Query psy9560
Match_columns 503
No_of_seqs 191 out of 328
Neff 2.6
Searched_HMMs 46136
Date Fri Aug 16 22:46:21 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy9560.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/9560hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG3977|consensus 100.0 2.8E-63 6E-68 465.5 14.5 204 19-222 1-211 (221)
2 PF00992 Troponin: Troponin; 100.0 6E-41 1.3E-45 295.7 6.1 129 61-190 1-132 (132)
3 KOG3634|consensus 99.7 1.1E-16 2.5E-21 160.7 13.5 87 67-153 193-279 (361)
4 KOG1029|consensus 95.8 0.058 1.3E-06 61.3 10.5 45 115-159 448-506 (1118)
5 PHA03247 large tegument protei 95.7 1.4 3E-05 55.8 22.2 9 312-320 2715-2723(3151)
6 KOG1029|consensus 95.1 0.26 5.6E-06 56.4 12.6 16 106-121 446-461 (1118)
7 PHA03247 large tegument protei 94.7 9.3 0.0002 49.1 24.5 9 206-214 2535-2543(3151)
8 PRK07764 DNA polymerase III su 92.2 22 0.00048 40.9 21.2 64 283-346 512-595 (824)
9 KOG1924|consensus 90.2 1.5 3.3E-05 50.5 9.6 26 107-135 302-327 (1102)
10 PF09726 Macoilin: Transmembra 89.0 7.7 0.00017 43.8 13.8 26 128-153 632-657 (697)
11 KOG3054|consensus 88.6 1.4 3E-05 45.1 7.1 11 146-156 245-255 (299)
12 KOG2891|consensus 86.6 11 0.00025 39.6 12.3 10 112-121 408-417 (445)
13 PRK07764 DNA polymerase III su 85.7 26 0.00056 40.4 15.7 59 90-150 166-224 (824)
14 PRK12323 DNA polymerase III su 85.6 46 0.00099 38.3 17.2 25 91-116 171-195 (700)
15 TIGR02231 conserved hypothetic 85.1 14 0.0003 39.3 12.5 53 102-154 122-174 (525)
16 KOG1144|consensus 84.3 2.5 5.3E-05 48.9 6.9 28 23-50 231-258 (1064)
17 PF06705 SF-assemblin: SF-asse 84.2 22 0.00047 34.5 12.4 60 107-173 95-155 (247)
18 KOG2412|consensus 82.4 9.7 0.00021 42.5 10.2 57 108-168 298-354 (591)
19 KOG1924|consensus 82.3 14 0.0003 43.3 11.6 9 302-310 460-468 (1102)
20 PRK04863 mukB cell division pr 81.9 36 0.00077 41.8 15.4 62 98-159 429-493 (1486)
21 COG1340 Uncharacterized archae 77.5 59 0.0013 33.9 13.4 47 107-153 203-249 (294)
22 COG4839 FtsL Protein required 77.1 5.8 0.00013 36.5 5.5 38 116-153 58-95 (120)
23 KOG4722|consensus 76.4 30 0.00064 38.3 11.4 112 18-147 238-360 (672)
24 PF10234 Cluap1: Clusterin-ass 75.3 17 0.00037 37.0 8.9 88 66-160 127-223 (267)
25 KOG0163|consensus 74.9 41 0.00088 39.7 12.4 29 108-136 988-1016(1259)
26 PF10779 XhlA: Haemolysin XhlA 74.2 12 0.00026 30.4 6.1 47 107-153 2-48 (71)
27 KOG0804|consensus 74.1 75 0.0016 35.2 13.6 42 112-153 408-449 (493)
28 PF07889 DUF1664: Protein of u 73.3 26 0.00056 32.2 8.7 81 69-153 41-124 (126)
29 PF01486 K-box: K-box region; 73.1 14 0.00031 31.2 6.6 48 99-152 41-88 (100)
30 KOG4460|consensus 72.4 63 0.0014 36.8 12.8 12 180-191 723-734 (741)
31 PF00992 Troponin: Troponin; 71.8 5.3 0.00012 36.3 4.0 36 81-117 97-132 (132)
32 PF09744 Jnk-SapK_ap_N: JNK_SA 71.6 12 0.00026 35.3 6.4 85 61-153 47-131 (158)
33 PRK12323 DNA polymerase III su 70.6 2.1E+02 0.0044 33.3 16.5 20 327-346 355-375 (700)
34 PF13864 Enkurin: Calmodulin-b 70.0 59 0.0013 27.7 9.7 53 68-126 6-59 (98)
35 COG1340 Uncharacterized archae 69.3 40 0.00088 35.1 10.0 44 108-151 59-102 (294)
36 PF10168 Nup88: Nuclear pore c 67.5 18 0.00039 41.0 7.8 59 133-192 601-661 (717)
37 COG4026 Uncharacterized protei 66.6 57 0.0012 33.6 10.2 48 107-154 159-206 (290)
38 KOG1103|consensus 66.4 1.4E+02 0.003 32.7 13.5 48 106-153 240-287 (561)
39 PF15175 SPATA24: Spermatogene 66.2 15 0.00033 35.1 5.8 56 106-167 72-128 (153)
40 PF04859 DUF641: Plant protein 66.0 65 0.0014 29.9 9.7 77 70-153 44-122 (131)
41 PF10148 SCHIP-1: Schwannomin- 65.3 19 0.00042 36.4 6.7 49 100-148 169-224 (238)
42 PHA03011 hypothetical protein; 65.0 23 0.0005 32.4 6.5 47 107-153 67-113 (120)
43 KOG4593|consensus 65.0 44 0.00095 38.4 10.0 41 106-146 153-193 (716)
44 PTZ00266 NIMA-related protein 64.4 91 0.002 37.2 12.7 17 118-134 510-526 (1021)
45 PF06034 DUF919: Nucleopolyhed 63.3 14 0.0003 30.6 4.4 47 107-155 8-54 (62)
46 KOG1853|consensus 62.6 35 0.00075 35.6 8.0 48 105-152 99-146 (333)
47 COG3883 Uncharacterized protei 61.9 12 0.00026 38.2 4.7 51 107-157 55-105 (265)
48 KOG0161|consensus 61.0 1.2E+02 0.0026 38.6 13.4 57 107-163 904-960 (1930)
49 PF09728 Taxilin: Myosin-like 61.0 52 0.0011 33.7 8.9 17 107-123 74-90 (309)
50 KOG2072|consensus 60.9 1.2E+02 0.0025 36.2 12.4 36 98-139 656-691 (988)
51 KOG0996|consensus 60.8 32 0.0007 41.6 8.3 52 115-174 462-513 (1293)
52 PRK14950 DNA polymerase III su 59.9 68 0.0015 35.1 10.1 14 99-112 174-187 (585)
53 PRK11546 zraP zinc resistance 59.7 22 0.00047 33.4 5.6 50 104-153 44-103 (143)
54 PF08581 Tup_N: Tup N-terminal 59.7 34 0.00074 29.2 6.3 33 121-153 21-53 (79)
55 PF04102 SlyX: SlyX; InterPro 59.7 37 0.0008 27.6 6.3 27 129-155 22-48 (69)
56 TIGR02168 SMC_prok_B chromosom 59.4 2.6E+02 0.0057 31.4 14.6 8 143-150 334-341 (1179)
57 KOG0994|consensus 59.0 66 0.0014 39.5 10.3 36 117-152 1709-1744(1758)
58 cd00520 RRF Ribosome recycling 58.6 70 0.0015 30.3 8.8 51 54-109 99-149 (179)
59 PF00261 Tropomyosin: Tropomyo 58.6 1.7E+02 0.0037 28.4 11.6 19 108-126 159-177 (237)
60 TIGR00606 rad50 rad50. This fa 57.7 1.1E+02 0.0025 36.5 12.2 69 82-154 361-434 (1311)
61 PF10498 IFT57: Intra-flagella 57.7 1.1E+02 0.0024 32.3 10.8 73 69-153 243-315 (359)
62 PF07304 SRA1: Steroid recepto 57.6 7.9 0.00017 36.0 2.4 35 299-333 41-75 (157)
63 PRK00083 frr ribosome recyclin 57.3 68 0.0015 30.8 8.5 52 54-110 103-154 (185)
64 KOG1655|consensus 57.0 20 0.00043 35.9 5.0 59 116-174 17-83 (218)
65 PF09726 Macoilin: Transmembra 57.0 3.6E+02 0.0077 31.1 15.4 46 105-153 535-580 (697)
66 PF09755 DUF2046: Uncharacteri 56.8 59 0.0013 34.2 8.6 27 110-136 177-203 (310)
67 KOG4466|consensus 56.1 96 0.0021 32.5 9.8 139 28-185 36-190 (291)
68 KOG2507|consensus 56.1 35 0.00076 37.6 7.1 64 29-94 209-273 (506)
69 PF01086 Clathrin_lg_ch: Clath 55.7 24 0.00052 34.4 5.4 42 113-158 182-223 (225)
70 PF07926 TPR_MLP1_2: TPR/MLP1/ 55.6 64 0.0014 28.8 7.6 82 70-153 27-119 (132)
71 PF01576 Myosin_tail_1: Myosin 55.6 3.9 8.4E-05 46.7 0.0 109 40-157 108-233 (859)
72 KOG1318|consensus 55.6 27 0.00059 37.7 6.2 23 53-77 261-283 (411)
73 PF10044 Ret_tiss: Retinal tis 54.9 11 0.00024 33.3 2.7 25 100-124 54-78 (95)
74 PF07946 DUF1682: Protein of u 53.8 63 0.0014 32.9 8.2 8 53-60 299-306 (321)
75 PF14335 DUF4391: Domain of un 53.5 24 0.00052 33.9 4.9 71 106-192 143-220 (221)
76 PF12718 Tropomyosin_1: Tropom 53.4 33 0.00072 31.5 5.6 47 107-153 17-63 (143)
77 TIGR00496 frr ribosome recycli 53.2 89 0.0019 29.8 8.6 52 54-110 94-145 (176)
78 KOG3091|consensus 52.4 81 0.0017 35.2 9.1 52 107-158 379-430 (508)
79 TIGR01005 eps_transp_fam exopo 51.9 2.2E+02 0.0048 31.7 12.6 44 111-154 288-331 (754)
80 PRK11020 hypothetical protein; 51.7 36 0.00079 31.5 5.5 44 109-153 3-52 (118)
81 PF10046 BLOC1_2: Biogenesis o 51.5 35 0.00075 29.4 5.1 48 105-156 50-97 (99)
82 KOG0681|consensus 51.3 30 0.00065 39.1 5.8 126 33-187 281-415 (645)
83 PF10146 zf-C4H2: Zinc finger- 51.2 2.7E+02 0.0058 27.9 13.4 35 119-153 68-102 (230)
84 KOG0163|consensus 51.2 62 0.0013 38.3 8.3 19 32-50 937-955 (1259)
85 KOG4196|consensus 51.1 35 0.00076 32.2 5.4 34 122-155 78-111 (135)
86 PF05667 DUF812: Protein of un 51.0 3.2E+02 0.0069 30.9 13.6 44 82-126 372-416 (594)
87 KOG2891|consensus 50.8 3.5E+02 0.0075 29.1 14.0 11 37-47 341-351 (445)
88 PF11544 Spc42p: Spindle pole 50.6 7.3 0.00016 33.4 0.9 29 127-155 7-35 (76)
89 PF07058 Myosin_HC-like: Myosi 50.4 30 0.00065 36.7 5.3 68 106-173 2-71 (351)
90 KOG4460|consensus 50.3 51 0.0011 37.5 7.3 71 83-157 613-687 (741)
91 PF09755 DUF2046: Uncharacteri 50.1 34 0.00073 35.9 5.6 42 112-153 107-149 (310)
92 PRK04863 mukB cell division pr 49.9 6.2E+02 0.014 31.8 18.0 67 88-155 536-602 (1486)
93 PF00804 Syntaxin: Syntaxin; 49.9 19 0.00042 28.7 3.2 18 109-126 54-71 (103)
94 PF05622 HOOK: HOOK protein; 49.3 5.6 0.00012 44.1 0.0 85 107-191 242-339 (713)
95 PF02841 GBP_C: Guanylate-bind 48.9 2.8E+02 0.0061 27.7 11.7 44 109-152 252-297 (297)
96 KOG0250|consensus 48.2 2.1E+02 0.0046 34.7 12.1 88 79-173 394-481 (1074)
97 KOG2751|consensus 47.5 78 0.0017 34.8 8.0 89 69-167 181-269 (447)
98 TIGR00606 rad50 rad50. This fa 47.4 4.5E+02 0.0098 31.8 14.8 52 100-152 295-346 (1311)
99 cd00632 Prefoldin_beta Prefold 47.3 62 0.0013 27.7 6.0 46 107-152 59-104 (105)
100 KOG0996|consensus 46.9 3E+02 0.0065 34.0 13.0 42 100-141 896-937 (1293)
101 PF00038 Filament: Intermediat 46.9 41 0.0009 32.9 5.5 96 107-212 50-152 (312)
102 PF04880 NUDE_C: NUDE protein, 46.8 19 0.00041 34.5 3.1 49 106-155 2-53 (166)
103 COG4942 Membrane-bound metallo 46.8 44 0.00096 36.3 6.1 48 106-153 61-108 (420)
104 PF08232 Striatin: Striatin fa 46.6 36 0.00078 31.0 4.7 12 142-153 28-39 (134)
105 PRK12750 cpxP periplasmic repr 46.5 1.6E+02 0.0035 27.9 9.1 66 53-120 50-119 (170)
106 PF12329 TMF_DNA_bd: TATA elem 45.8 41 0.0009 27.9 4.5 43 106-148 28-70 (74)
107 PRK04325 hypothetical protein; 45.7 51 0.0011 27.5 5.1 30 124-153 8-37 (74)
108 smart00338 BRLZ basic region l 45.7 61 0.0013 25.4 5.3 37 112-148 27-63 (65)
109 PRK14954 DNA polymerase III su 45.6 3.7E+02 0.008 30.4 13.1 21 302-322 320-340 (620)
110 PRK02793 phi X174 lysis protei 45.6 57 0.0012 27.0 5.3 25 128-152 11-35 (72)
111 PF14662 CCDC155: Coiled-coil 45.4 2.5E+02 0.0054 28.0 10.4 45 108-152 120-164 (193)
112 KOG4364|consensus 45.2 95 0.0021 36.1 8.5 16 294-309 663-678 (811)
113 PF08764 Coagulase: Staphyloco 44.7 3.7E+02 0.008 28.2 12.0 43 77-119 73-126 (282)
114 PRK14959 DNA polymerase III su 44.7 2.9E+02 0.0063 31.5 12.1 47 90-137 165-212 (624)
115 PRK10361 DNA recombination pro 44.6 4.5E+02 0.0098 29.2 13.2 42 109-150 145-186 (475)
116 KOG4031|consensus 44.3 1.3E+02 0.0027 30.5 8.3 33 117-153 179-211 (216)
117 PF06246 Isy1: Isy1-like splic 44.2 35 0.00076 34.5 4.6 43 110-153 50-99 (255)
118 PF11559 ADIP: Afadin- and alp 44.2 80 0.0017 28.3 6.5 49 105-153 29-80 (151)
119 PF06005 DUF904: Protein of un 44.0 1E+02 0.0023 25.8 6.7 46 107-152 7-52 (72)
120 KOG1144|consensus 43.8 48 0.001 39.1 6.1 11 37-47 237-247 (1064)
121 PF09849 DUF2076: Uncharacteri 43.7 44 0.00096 33.7 5.2 32 122-153 38-69 (247)
122 KOG4643|consensus 43.6 3.8E+02 0.0083 32.8 13.1 144 25-174 77-242 (1195)
123 PF09756 DDRGK: DDRGK domain; 43.5 8.9 0.00019 37.1 0.4 18 139-156 132-154 (188)
124 PF07956 DUF1690: Protein of U 43.4 2.8E+02 0.006 25.9 11.1 66 56-121 24-92 (142)
125 KOG2008|consensus 43.2 3.8E+02 0.0083 29.1 12.0 226 53-317 69-299 (426)
126 PF12718 Tropomyosin_1: Tropom 43.2 75 0.0016 29.2 6.2 17 79-95 14-30 (143)
127 PF14772 NYD-SP28: Sperm tail 43.1 56 0.0012 27.9 5.1 37 112-151 52-88 (104)
128 PF03357 Snf7: Snf7; InterPro 43.0 47 0.001 29.2 4.8 28 125-152 8-35 (171)
129 PF10805 DUF2730: Protein of u 42.9 49 0.0011 28.9 4.8 42 107-155 38-81 (106)
130 PF10146 zf-C4H2: Zinc finger- 42.7 1.8E+02 0.0039 29.1 9.2 43 111-153 32-74 (230)
131 KOG3859|consensus 42.5 2.1E+02 0.0046 30.8 10.0 73 69-149 325-401 (406)
132 PRK02224 chromosome segregatio 42.3 3.7E+02 0.008 30.3 12.5 15 139-153 627-641 (880)
133 PF03978 Borrelia_REV: Borreli 42.2 34 0.00073 33.0 3.9 76 61-156 8-83 (160)
134 PF05103 DivIVA: DivIVA protei 42.1 26 0.00056 30.0 2.9 53 101-153 15-67 (131)
135 COG1579 Zn-ribbon protein, pos 42.1 76 0.0016 32.1 6.6 49 107-155 34-82 (239)
136 PHA02562 46 endonuclease subun 41.9 1.6E+02 0.0034 31.0 9.1 47 107-153 354-400 (562)
137 TIGR02169 SMC_prok_A chromosom 41.9 2.8E+02 0.0061 31.4 11.5 42 112-153 302-343 (1164)
138 PF02994 Transposase_22: L1 tr 41.8 40 0.00088 35.2 4.8 60 108-167 141-200 (370)
139 PF08614 ATG16: Autophagy prot 41.8 51 0.0011 31.0 5.1 49 105-153 103-151 (194)
140 cd07627 BAR_Vps5p The Bin/Amph 41.4 32 0.00068 32.8 3.7 88 88-187 104-193 (216)
141 PRK11637 AmiB activator; Provi 41.3 3.2E+02 0.0069 28.6 11.2 46 108-153 79-124 (428)
142 KOG0992|consensus 41.2 2.6E+02 0.0057 31.8 10.9 53 107-159 363-423 (613)
143 PF09278 MerR-DNA-bind: MerR, 41.1 41 0.00088 25.7 3.6 16 102-118 13-28 (65)
144 PF11932 DUF3450: Protein of u 41.0 74 0.0016 30.9 6.2 45 108-152 53-97 (251)
145 PRK13848 conjugal transfer pro 40.2 1.3E+02 0.0029 27.2 7.0 46 73-121 16-61 (98)
146 PF07462 MSP1_C: Merozoite sur 40.0 66 0.0014 36.3 6.2 18 133-150 55-72 (574)
147 PRK04406 hypothetical protein; 39.9 55 0.0012 27.5 4.4 34 120-153 6-39 (75)
148 KOG0963|consensus 39.7 4.2E+02 0.009 30.6 12.3 101 29-133 120-225 (629)
149 PF04156 IncA: IncA protein; 39.7 3E+02 0.0066 25.2 10.4 24 114-137 126-149 (191)
150 PF05793 TFIIF_alpha: Transcri 39.4 41 0.00088 36.8 4.6 60 277-341 460-519 (527)
151 PRK04325 hypothetical protein; 39.4 33 0.00072 28.5 3.1 46 108-153 6-51 (74)
152 PHA03264 envelope glycoprotein 39.4 4E+02 0.0086 29.3 11.6 36 306-348 224-259 (416)
153 PF07820 TraC: TraC-like prote 39.4 1.4E+02 0.0029 26.8 6.9 48 72-122 14-61 (92)
154 PF04102 SlyX: SlyX; InterPro 39.2 97 0.0021 25.2 5.6 25 129-153 8-32 (69)
155 PRK00846 hypothetical protein; 39.1 58 0.0013 27.9 4.5 30 124-153 12-41 (77)
156 PF15254 CCDC14: Coiled-coil d 39.1 1.7E+02 0.0037 34.6 9.4 46 110-155 507-552 (861)
157 KOG4302|consensus 38.8 1.7E+02 0.0037 33.6 9.3 25 106-130 249-273 (660)
158 PF08700 Vps51: Vps51/Vps67; 38.8 1.2E+02 0.0026 24.4 6.1 24 103-126 18-41 (87)
159 PF04728 LPP: Lipoprotein leuc 38.6 1.2E+02 0.0026 24.9 6.0 49 107-155 6-54 (56)
160 PF01765 RRF: Ribosome recycli 37.5 1.3E+02 0.0028 27.9 6.9 36 54-93 85-120 (165)
161 KOG0247|consensus 37.3 3.2E+02 0.0069 32.3 11.1 110 29-164 492-612 (809)
162 PF00170 bZIP_1: bZIP transcri 37.0 1.2E+02 0.0025 23.9 5.6 34 112-145 27-60 (64)
163 PTZ00454 26S protease regulato 37.0 91 0.002 32.9 6.5 49 105-153 16-64 (398)
164 KOG3032|consensus 36.9 1E+02 0.0022 31.8 6.5 8 204-211 254-262 (264)
165 PRK04406 hypothetical protein; 36.9 87 0.0019 26.3 5.2 27 127-153 27-53 (75)
166 PTZ00121 MAEBL; Provisional 36.8 3.8E+02 0.0082 34.3 12.0 6 172-177 1848-1853(2084)
167 TIGR00237 xseA exodeoxyribonuc 35.9 5.7E+02 0.012 27.3 14.6 14 8-21 242-255 (432)
168 KOG4661|consensus 35.9 1E+02 0.0022 35.5 6.9 14 122-135 681-694 (940)
169 KOG0979|consensus 35.8 9.1E+02 0.02 29.7 15.1 22 69-90 253-274 (1072)
170 COG1196 Smc Chromosome segrega 35.8 4.5E+02 0.0097 31.4 12.4 45 109-153 272-316 (1163)
171 KOG1850|consensus 35.7 2.6E+02 0.0057 30.2 9.5 21 106-126 75-95 (391)
172 PRK11546 zraP zinc resistance 35.7 3.9E+02 0.0085 25.3 10.3 74 55-136 43-118 (143)
173 PRK03963 V-type ATP synthase s 35.0 3.7E+02 0.0081 24.9 11.0 20 109-128 83-102 (198)
174 PRK00286 xseA exodeoxyribonucl 34.3 5.7E+02 0.012 26.8 15.6 15 8-22 247-261 (438)
175 KOG3654|consensus 34.3 37 0.00081 38.2 3.3 46 22-67 405-459 (708)
176 PF05531 NPV_P10: Nucleopolyhe 34.3 1.2E+02 0.0026 26.1 5.6 47 107-153 14-63 (75)
177 PF07851 TMPIT: TMPIT-like pro 33.9 1.3E+02 0.0028 31.9 6.9 47 107-153 14-60 (330)
178 KOG0962|consensus 33.4 2.1E+02 0.0045 35.4 9.3 17 32-48 831-847 (1294)
179 KOG0239|consensus 33.4 1.6E+02 0.0035 33.6 8.1 48 106-153 229-276 (670)
180 PF05266 DUF724: Protein of un 33.3 3.6E+02 0.0077 26.2 9.3 50 107-156 134-183 (190)
181 PRK07003 DNA polymerase III su 33.2 9.2E+02 0.02 28.9 16.6 44 91-135 166-210 (830)
182 PRK00295 hypothetical protein; 33.1 93 0.002 25.6 4.7 25 128-152 8-32 (68)
183 KOG2004|consensus 33.0 50 0.0011 38.7 4.1 49 126-192 279-328 (906)
184 PF07106 TBPIP: Tat binding pr 33.0 2.6E+02 0.0057 25.6 8.1 12 66-77 32-43 (169)
185 PF12128 DUF3584: Protein of u 32.8 3.4E+02 0.0073 32.6 10.8 22 106-127 773-794 (1201)
186 KOG0239|consensus 32.7 2.6E+02 0.0056 32.1 9.5 47 107-154 269-315 (670)
187 KOG0250|consensus 32.7 4.1E+02 0.009 32.4 11.3 32 122-153 398-429 (1074)
188 PF12709 Kinetocho_Slk19: Cent 32.6 1.4E+02 0.003 26.4 5.8 47 107-153 15-70 (87)
189 PRK14127 cell division protein 32.5 1.7E+02 0.0037 26.5 6.6 54 100-153 19-72 (109)
190 PF04728 LPP: Lipoprotein leuc 32.4 90 0.002 25.6 4.4 36 118-153 3-38 (56)
191 PF10186 Atg14: UV radiation r 32.2 4.2E+02 0.009 25.3 9.6 50 105-154 57-106 (302)
192 PRK10803 tol-pal system protei 32.2 78 0.0017 31.5 4.9 15 139-153 61-75 (263)
193 PRK06800 fliH flagellar assemb 32.1 2.4E+02 0.0053 28.5 8.1 27 69-95 50-76 (228)
194 KOG0978|consensus 31.9 81 0.0018 36.4 5.5 46 108-153 59-104 (698)
195 COG3937 Uncharacterized conser 31.7 1.2E+02 0.0025 27.9 5.4 32 102-136 37-68 (108)
196 PF10174 Cast: RIM-binding pro 31.6 9.2E+02 0.02 28.4 14.5 49 107-155 436-488 (775)
197 smart00761 HDAC_interact Histo 31.5 89 0.0019 28.1 4.6 36 114-149 55-90 (102)
198 TIGR03017 EpsF chain length de 31.1 6E+02 0.013 26.1 11.7 44 112-155 216-277 (444)
199 PRK07191 flgK flagellar hook-a 30.7 85 0.0018 33.4 5.2 48 109-156 136-183 (456)
200 PLN02316 synthase/transferase 30.7 61 0.0013 38.7 4.5 42 26-67 262-306 (1036)
201 PRK03992 proteasome-activating 30.7 1.4E+02 0.0031 30.9 6.7 47 107-153 4-50 (389)
202 PRK02119 hypothetical protein; 30.5 89 0.0019 26.1 4.2 30 124-153 8-37 (73)
203 PF10212 TTKRSYEDQ: Predicted 30.4 5.8E+02 0.013 28.9 11.4 62 109-181 439-500 (518)
204 PRK00736 hypothetical protein; 30.3 1.1E+02 0.0024 25.2 4.7 24 129-152 9-32 (68)
205 KOG2072|consensus 30.2 8.6E+02 0.019 29.5 13.0 8 180-187 756-763 (988)
206 PRK10884 SH3 domain-containing 30.0 1.5E+02 0.0032 29.1 6.3 48 106-153 120-167 (206)
207 PF06156 DUF972: Protein of un 29.9 3.8E+02 0.0083 24.0 8.3 67 107-173 11-81 (107)
208 KOG2129|consensus 29.7 96 0.0021 34.4 5.4 29 106-134 196-224 (552)
209 KOG0933|consensus 29.5 3.2E+02 0.007 33.3 9.8 43 114-156 839-881 (1174)
210 PF07946 DUF1682: Protein of u 29.5 1.6E+02 0.0034 30.1 6.6 27 24-50 261-287 (321)
211 PF04977 DivIC: Septum formati 29.5 1.3E+02 0.0027 23.6 4.8 28 126-153 25-52 (80)
212 PF04156 IncA: IncA protein; 29.5 1.7E+02 0.0037 26.8 6.3 25 107-131 98-122 (191)
213 PRK09174 F0F1 ATP synthase sub 29.4 5.4E+02 0.012 25.0 11.3 27 109-135 173-199 (204)
214 KOG0964|consensus 29.3 9E+02 0.019 29.9 13.1 134 16-151 690-833 (1200)
215 PF10211 Ax_dynein_light: Axon 29.2 1.4E+02 0.003 28.6 5.9 27 127-153 122-148 (189)
216 PF10168 Nup88: Nuclear pore c 28.9 4.3E+02 0.0094 30.5 10.5 16 80-95 601-616 (717)
217 PF13851 GAS: Growth-arrest sp 28.9 1.5E+02 0.0032 28.7 6.0 117 68-186 28-147 (201)
218 PRK15422 septal ring assembly 28.8 1.4E+02 0.0031 26.1 5.2 51 105-155 26-76 (79)
219 PRK08147 flgK flagellar hook-a 28.7 91 0.002 33.8 5.0 48 109-156 137-184 (547)
220 PF03082 MAGSP: Male accessory 28.6 1.7E+02 0.0038 30.0 6.6 77 87-166 130-206 (264)
221 COG3851 UhpB Signal transducti 28.6 2.8E+02 0.0061 30.7 8.5 66 79-151 279-354 (497)
222 PF00769 ERM: Ezrin/radixin/mo 28.4 6E+02 0.013 25.3 12.5 25 69-93 45-69 (246)
223 cd06225 HAMP Histidine kinase, 28.4 32 0.0007 21.4 1.1 39 116-155 8-46 (48)
224 PF06698 DUF1192: Protein of u 28.3 80 0.0017 25.9 3.5 28 99-126 16-43 (59)
225 PF04120 Iron_permease: Low af 28.1 3.1E+02 0.0068 25.6 7.7 46 83-128 74-119 (132)
226 PF13801 Metal_resist: Heavy-m 28.0 3.2E+02 0.0069 22.0 11.0 76 54-135 40-115 (125)
227 KOG4438|consensus 27.8 4.9E+02 0.011 29.0 10.1 59 78-147 261-321 (446)
228 COG4026 Uncharacterized protei 27.7 1.7E+02 0.0036 30.4 6.3 47 107-153 131-177 (290)
229 PF04048 Sec8_exocyst: Sec8 ex 27.5 2.2E+02 0.0049 25.8 6.6 59 109-167 56-116 (142)
230 PF07798 DUF1640: Protein of u 27.4 4.5E+02 0.0097 24.6 8.7 9 182-190 143-151 (177)
231 PF08295 Sin3_corepress: Sin3 27.3 1.3E+02 0.0028 26.7 4.9 35 114-148 54-88 (101)
232 PF03656 Pam16: Pam16; InterP 27.2 58 0.0013 30.0 2.8 56 91-147 57-114 (127)
233 PRK15396 murein lipoprotein; P 27.1 2.5E+02 0.0055 24.1 6.4 53 105-157 26-78 (78)
234 KOG3866|consensus 27.1 93 0.002 33.5 4.6 32 99-135 220-251 (442)
235 PF10264 Stork_head: Winged he 27.0 1.1E+02 0.0025 26.4 4.4 57 273-330 3-59 (80)
236 PF08946 Osmo_CC: Osmosensory 27.0 75 0.0016 25.4 3.0 36 107-152 4-39 (46)
237 KOG0978|consensus 26.9 7.1E+02 0.015 29.1 11.7 25 129-153 598-622 (698)
238 KOG0243|consensus 26.9 5.1E+02 0.011 31.6 10.8 59 29-95 410-468 (1041)
239 PF05276 SH3BP5: SH3 domain-bi 26.9 6.8E+02 0.015 25.4 12.0 99 68-169 78-178 (239)
240 PF11932 DUF3450: Protein of u 26.9 2.1E+02 0.0044 27.9 6.7 26 109-134 68-93 (251)
241 KOG1899|consensus 26.8 3.5E+02 0.0076 31.7 9.1 21 114-135 164-184 (861)
242 COG2118 DNA-binding protein [G 26.8 5.3E+02 0.011 24.1 9.0 16 104-119 89-104 (116)
243 KOG2572|consensus 26.6 1.8E+02 0.0039 32.3 6.7 64 101-165 125-197 (498)
244 KOG0994|consensus 26.6 1.1E+03 0.024 30.0 13.3 40 10-49 1407-1448(1758)
245 smart00540 LEM in nuclear memb 26.3 38 0.00083 26.2 1.3 17 100-116 2-18 (44)
246 PF05278 PEARLI-4: Arabidopsis 26.3 2.4E+02 0.0052 29.3 7.2 39 130-168 212-250 (269)
247 KOG0161|consensus 26.1 1E+03 0.023 31.0 13.7 42 112-153 1246-1287(1930)
248 PHA02562 46 endonuclease subun 26.0 8E+02 0.017 25.9 12.6 45 109-153 232-276 (562)
249 PF09789 DUF2353: Uncharacteri 26.0 7.5E+02 0.016 26.3 10.8 78 60-153 2-100 (319)
250 PF07223 DUF1421: Protein of u 26.0 58 0.0012 34.6 2.9 30 297-332 316-345 (358)
251 PRK07739 flgK flagellar hook-a 25.9 1.1E+02 0.0023 33.2 4.9 47 109-155 148-194 (507)
252 PF12725 DUF3810: Protein of u 25.9 1.8E+02 0.0038 30.0 6.3 72 91-166 84-166 (318)
253 KOG4643|consensus 25.8 1.3E+03 0.028 28.7 13.6 59 69-136 497-555 (1195)
254 KOG0652|consensus 25.6 1.1E+02 0.0023 32.7 4.7 55 102-156 11-65 (424)
255 PF00505 HMG_box: HMG (high mo 25.6 1.4E+02 0.0031 22.5 4.3 40 103-142 25-64 (69)
256 PRK13729 conjugal transfer pil 25.4 1.7E+02 0.0036 32.6 6.3 39 110-148 75-120 (475)
257 PF01920 Prefoldin_2: Prefoldi 25.2 2.2E+02 0.0047 23.4 5.6 39 107-145 1-39 (106)
258 TIGR02338 gimC_beta prefoldin, 25.2 4.5E+02 0.0098 22.8 8.5 44 107-150 63-106 (110)
259 PLN00138 large subunit ribosom 25.1 2.4E+02 0.0052 25.6 6.2 60 286-346 4-63 (113)
260 KOG3230|consensus 25.0 7.5E+02 0.016 25.3 11.4 62 55-119 7-68 (224)
261 PF05010 TACC: Transforming ac 24.9 6.9E+02 0.015 24.8 12.0 63 113-176 78-140 (207)
262 PF05667 DUF812: Protein of un 24.8 1.1E+03 0.023 26.9 14.1 44 110-157 487-530 (594)
263 PF06248 Zw10: Centromere/kine 24.5 2.2E+02 0.0047 31.1 7.0 70 104-173 7-88 (593)
264 KOG0992|consensus 24.4 2.5E+02 0.0054 32.0 7.4 33 119-152 153-185 (613)
265 PF01690 PLRV_ORF5: Potato lea 24.4 43 0.00092 36.9 1.7 16 276-291 23-38 (465)
266 PRK15178 Vi polysaccharide exp 24.4 6.4E+02 0.014 27.7 10.3 89 67-156 242-338 (434)
267 PRK00295 hypothetical protein; 24.4 4E+02 0.0087 21.9 7.2 25 129-153 23-47 (68)
268 COG3937 Uncharacterized conser 24.3 5.7E+02 0.012 23.6 8.5 23 51-73 34-56 (108)
269 PF09371 Tex_N: Tex-like prote 24.2 2.1E+02 0.0045 27.9 6.1 56 101-157 36-100 (193)
270 KOG4005|consensus 24.2 7.5E+02 0.016 26.0 10.2 7 16-22 41-47 (292)
271 TIGR03319 YmdA_YtgF conserved 24.2 5.8E+02 0.013 28.2 10.1 115 22-143 57-178 (514)
272 PRK07521 flgK flagellar hook-a 24.1 2E+02 0.0044 30.9 6.6 47 109-155 131-177 (483)
273 PF06013 WXG100: Proteins of 1 24.0 2.8E+02 0.006 20.9 5.7 40 104-157 4-43 (86)
274 PF05278 PEARLI-4: Arabidopsis 24.0 3.5E+02 0.0076 28.1 7.9 25 130-154 240-264 (269)
275 KOG0995|consensus 24.0 6.4E+02 0.014 29.0 10.4 69 69-137 292-372 (581)
276 PF07200 Mod_r: Modifier of ru 24.0 1.9E+02 0.0041 25.8 5.4 51 101-151 1-67 (150)
277 PF14389 Lzipper-MIP1: Leucine 24.0 4.6E+02 0.01 22.4 9.1 37 107-150 50-86 (88)
278 COG5293 Predicted ATPase [Gene 23.9 7.4E+02 0.016 28.3 10.7 83 37-132 316-404 (591)
279 PF10046 BLOC1_2: Biogenesis o 23.9 2.4E+02 0.0053 24.3 5.9 40 111-150 59-98 (99)
280 PF08317 Spc7: Spc7 kinetochor 23.8 2.2E+02 0.0047 29.0 6.5 56 119-174 231-289 (325)
281 PF08232 Striatin: Striatin fa 23.7 2.1E+02 0.0046 26.1 5.7 39 115-153 29-74 (134)
282 PF14915 CCDC144C: CCDC144C pr 23.7 6E+02 0.013 27.1 9.5 56 66-130 178-233 (305)
283 COG3883 Uncharacterized protei 23.7 2E+02 0.0043 29.7 6.1 38 107-144 62-99 (265)
284 PF08826 DMPK_coil: DMPK coile 23.7 3.9E+02 0.0084 22.1 6.7 37 116-152 16-52 (61)
285 PF09340 NuA4: Histone acetylt 23.7 1.2E+02 0.0025 25.8 3.8 29 107-135 5-33 (80)
286 PF08286 Spc24: Spc24 subunit 23.6 27 0.00057 30.7 0.0 34 120-153 1-34 (118)
287 KOG0976|consensus 23.6 7.6E+02 0.016 30.1 11.1 43 107-149 266-308 (1265)
288 PRK02119 hypothetical protein; 23.5 2.2E+02 0.0047 23.8 5.3 24 129-152 27-50 (73)
289 PF04100 Vps53_N: Vps53-like, 23.3 90 0.002 32.8 3.7 62 105-168 58-119 (383)
290 PF05983 Med7: MED7 protein; 23.2 4.1E+02 0.0088 25.1 7.6 85 56-153 69-159 (162)
291 PF07227 DUF1423: Protein of u 23.1 4.4E+02 0.0096 29.2 8.8 45 100-144 401-445 (446)
292 PRK04778 septation ring format 23.1 3.4E+02 0.0075 29.8 8.1 52 105-156 377-428 (569)
293 COG0536 Obg Predicted GTPase [ 23.1 65 0.0014 34.6 2.7 51 82-132 292-346 (369)
294 PF15233 SYCE1: Synaptonemal c 23.0 3.2E+02 0.0069 26.0 6.8 44 108-151 31-74 (134)
295 PF13797 Post_transc_reg: Post 22.8 78 0.0017 27.2 2.7 48 281-331 28-76 (87)
296 TIGR01837 PHA_granule_1 poly(h 22.8 5.1E+02 0.011 23.2 7.9 26 51-76 32-57 (118)
297 KOG3647|consensus 22.6 4.2E+02 0.0091 28.2 8.2 45 113-159 121-165 (338)
298 PRK08471 flgK flagellar hook-a 22.5 1.4E+02 0.0031 33.3 5.2 48 109-156 141-188 (613)
299 PF15469 Sec5: Exocyst complex 22.5 1.6E+02 0.0035 27.0 4.8 87 108-194 3-102 (182)
300 cd09237 V_ScBro1_like Protein- 22.4 5.4E+02 0.012 26.5 9.0 96 68-166 88-185 (356)
301 PF05791 Bacillus_HBL: Bacillu 22.4 2.4E+02 0.0052 26.7 6.0 34 122-155 146-179 (184)
302 KOG3863|consensus 22.4 1.1E+02 0.0023 35.0 4.2 72 53-129 497-568 (604)
303 PF08781 DP: Transcription fac 22.3 1.6E+02 0.0034 27.9 4.8 34 119-152 2-35 (142)
304 KOG3054|consensus 22.3 4.4E+02 0.0095 27.7 8.2 11 111-121 200-210 (299)
305 PF10018 Med4: Vitamin-D-recep 22.2 2.9E+02 0.0062 26.2 6.5 42 109-151 7-48 (188)
306 PF11945 WASH_WAHD: WAHD domai 22.2 2.3E+02 0.005 29.4 6.3 48 105-157 26-74 (297)
307 cd00890 Prefoldin Prefoldin is 22.0 2.9E+02 0.0062 23.5 5.9 45 108-152 84-128 (129)
308 PF05178 Kri1: KRI1-like famil 22.0 45 0.00098 29.4 1.1 15 33-47 2-16 (101)
309 TIGR02169 SMC_prok_A chromosom 22.0 1.2E+03 0.026 26.6 13.6 35 114-148 402-436 (1164)
310 KOG2412|consensus 21.9 3.8E+02 0.0082 30.7 8.2 28 23-50 204-231 (591)
311 PF10226 DUF2216: Uncharacteri 21.8 2.1E+02 0.0046 28.6 5.6 54 100-153 4-59 (195)
312 PLN02316 synthase/transferase 21.7 77 0.0017 37.9 3.2 35 25-59 442-476 (1036)
313 KOG0976|consensus 21.7 7.4E+02 0.016 30.2 10.6 23 68-90 110-132 (1265)
314 PRK00247 putative inner membra 21.7 4.7E+02 0.01 28.7 8.7 20 25-44 337-356 (429)
315 KOG2689|consensus 21.7 4.3E+02 0.0092 27.9 8.0 40 54-93 126-165 (290)
316 COG1196 Smc Chromosome segrega 21.7 1.4E+03 0.031 27.4 14.2 38 116-153 300-337 (1163)
317 smart00513 SAP Putative DNA-bi 21.5 75 0.0016 22.5 2.0 17 102-118 2-18 (35)
318 PF09302 XLF: XLF (XRCC4-like 21.5 1.2E+02 0.0026 27.6 3.7 42 109-150 130-171 (171)
319 cd00632 Prefoldin_beta Prefold 21.4 1.6E+02 0.0035 25.1 4.3 46 108-153 3-53 (105)
320 PRK00846 hypothetical protein; 21.4 2E+02 0.0043 24.7 4.8 48 107-154 9-56 (77)
321 PF13358 DDE_3: DDE superfamil 21.3 70 0.0015 26.2 2.0 24 282-305 122-145 (146)
322 PF08317 Spc7: Spc7 kinetochor 21.2 8.1E+02 0.017 25.1 9.8 23 129-151 234-256 (325)
323 PRK06665 flgK flagellar hook-a 21.1 2.4E+02 0.0052 31.6 6.6 47 109-155 148-194 (627)
324 PF02646 RmuC: RmuC family; I 20.9 1.7E+02 0.0036 29.7 5.0 44 107-150 236-279 (304)
325 PRK02793 phi X174 lysis protei 20.9 4.9E+02 0.011 21.6 7.2 26 128-153 25-50 (72)
326 PF14271 DUF4359: Domain of un 20.9 1.4E+02 0.0031 26.2 3.9 48 285-333 21-70 (107)
327 COG3416 Uncharacterized protei 20.7 3.6E+02 0.0079 27.6 7.1 13 56-68 1-13 (233)
328 TIGR02492 flgK_ends flagellar 20.6 2.9E+02 0.0063 28.0 6.6 47 109-155 136-182 (322)
329 PF14282 FlxA: FlxA-like prote 20.5 3.1E+02 0.0068 24.0 5.9 43 107-153 22-65 (106)
330 KOG4674|consensus 20.4 4E+02 0.0086 34.3 8.6 63 105-167 566-629 (1822)
331 PF10481 CENP-F_N: Cenp-F N-te 20.4 4.6E+02 0.01 27.8 8.0 73 72-153 44-123 (307)
332 cd01109 HTH_YyaN Helix-Turn-He 20.4 4E+02 0.0086 22.8 6.5 11 103-113 57-67 (113)
333 PF11214 Med2: Mediator comple 20.4 1.2E+02 0.0026 27.6 3.4 26 24-49 80-105 (105)
334 COG0011 Uncharacterized conser 20.3 25 0.00055 31.3 -0.8 53 100-153 45-97 (100)
335 PF05266 DUF724: Protein of un 20.3 8E+02 0.017 23.8 10.5 48 105-152 125-172 (190)
336 PF01895 PhoU: PhoU domain; I 20.3 3.8E+02 0.0082 20.3 5.8 42 109-150 4-46 (88)
337 PF07323 DUF1465: Protein of u 20.2 37 0.00081 32.3 0.2 22 109-130 114-135 (156)
338 PTZ00400 DnaK-type molecular c 20.1 4.1E+02 0.0089 29.8 8.1 18 105-122 595-612 (663)
339 KOG1118|consensus 20.1 2.5E+02 0.0053 30.3 6.0 27 113-139 146-172 (366)
340 PF10552 ORF6C: ORF6C domain; 20.0 4.2E+02 0.0091 23.2 6.7 77 112-188 9-92 (116)
No 1
>KOG3977|consensus
Probab=100.00 E-value=2.8e-63 Score=465.48 Aligned_cols=204 Identities=42% Similarity=0.613 Sum_probs=186.5
Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHhHHHHhhhh-ccCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcCC
Q psy9560 19 SAEDEAKRLKLEELERKRAEVRRRMEEASGAR-RKKGFMTPERKKKLRLLLRKKAAEELKREQERRALERKRVIDERCGE 97 (503)
Q Consensus 19 mad~ed~kRKqeErERKkAEvRKRlEEA~KkK-AKKgKMT~eRKkkLKSLLLqKAaEELEKEKEqKeEEKKKiLAERcpP 97 (503)
|.|.++..|+++++++|++|||+|||+++..+ +||||+|++||++||+|||+||+++|+++++.+++||++||++|+.+
T Consensus 1 ~~dg~~~~rka~~re~kk~evrkrleeA~~~~~~KKgfltpeRKkkLrkLlm~kAaedLkqqq~~kEqErqr~LaeR~i~ 80 (221)
T KOG3977|consen 1 EVDGDDAARKAQEREAKKAEVRKRLEEAGMPKKEKKGFLTPERKKKLRKLLMQKAAEDLKQQQELKEQERQRYLAERTIP 80 (221)
T ss_pred CCccchhhhhccchhHHHHHHHHHHHHhcccchhhcccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccCC
Confidence 46778899999999999999999999999766 89999999999999999999999999999999999999999999988
Q ss_pred CCCCCCCCH-HHHHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHhhCCCCCCccccccccHHHHHH-H--
Q psy9560 98 PKDTDDMAE-DEIGDLCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQVNDLRGKFVRPILKKVSKYENKFAK-L-- 173 (503)
Q Consensus 98 PLnIDGLSe-dELQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnDLRGKFKKPaLKKVRkSAnm~a~-L-- 173 (503)
+.|+++|+. ..|++||++||.+|+.||+|||||+++|.+++.|||+|||+||||||||+||+|||||+|+|+|.. |
T Consensus 81 lp~~d~l~d~g~Lq~ly~~l~arv~~leEEkYDi~~~v~qt~~EIndLtikvnDLRGKFvkPtLkkVsks~~kf~ka~~~ 160 (221)
T KOG3977|consen 81 LPDVDSLDDRGLLQDLYRELHARVDALEEEKYDIEAKVTQTETEINDLTIKVNDLRGKFVKPTLKKVSKSADKFLKALLG 160 (221)
T ss_pred CCCCCcccchHHHHHHHHHHHHHHHHHHHhhcchhheeehhhhhHHHHHHHHHHhcccccCccHHHHHhhhHHHHHHhhc
Confidence 999998876 569999999999999999999999999999999999999999999999999999999999976643 3
Q ss_pred Hhhhhccchhhhcccccccccchhh--hhhccCccccccchhhccCCCCCC
Q psy9560 174 QKKAAEFNFRSQLKAVKKREFSMED--AEKEKKPEWVKEPESKKKRIPIPE 222 (503)
Q Consensus 174 ~KhkvsmDfRANLKqVKKEe~~leE--eekeeVgDWRKNiE~K~~~e~~~~ 222 (503)
.||+++||||+|||+|||+++..+. +++.+||||||||+..+++|....
T Consensus 161 ~k~~~k~DlRanLK~VKKed~~~e~~~kkk~ek~dW~K~~~~~~~mE~~~k 211 (221)
T KOG3977|consen 161 SKHKVKMDLRANLKQVKKEDTEKERPNKKKREKGDWRKNIEPESGMEGRKK 211 (221)
T ss_pred cchhhhHHHHHHHHHhhhhhHHHhhhhhhcccchhhhhccCccccCccchh
Confidence 3789999999999999999775433 345689999999999888876554
No 2
>PF00992 Troponin: Troponin; InterPro: IPR001978 The troponin (Tn) complex regulates Ca2+ induced muscle contraction. Tn contains three subunits, Ca2+ binding (TnC), inhibitory (TnI), and tropomyosin binding (TnT). This family includes troponin T and troponin I. Troponin I binds to actin and troponin T binds to tropomyosin [, , ].; PDB: 1J1D_C 1MXL_I 1LXF_I 2KRD_I 1J1E_F 2L1R_B 2KGB_I 1YTZ_I 1YV0_I 1VDJ_A ....
Probab=100.00 E-value=6e-41 Score=295.70 Aligned_cols=129 Identities=42% Similarity=0.640 Sum_probs=120.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcCCCCCCCCCCHHHHHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhh
Q psy9560 61 KKKLRLLLRKKAAEELKREQERRALERKRVIDERCGEPKDTDDMAEDEIGDLCQEYWQRIYNLEAIKYELDREIMLKDFE 140 (503)
Q Consensus 61 KkkLKSLLLqKAaEELEKEKEqKeEEKKKiLAERcpPPLnIDGLSedELQEkCKELHdRIdkLEEEKYDLE~KVkKQDyE 140 (503)
|++||+|||++|..+|..++++++++|.+||++||+ +++|+|+|.++|+++|++||++|++||++|||+|++|.+++||
T Consensus 1 K~~lK~L~~~kA~~el~~e~~~keeek~~~L~eri~-~l~~~g~~~~eL~~~~k~lh~ri~~leeEryd~E~kv~k~~~E 79 (132)
T PF00992_consen 1 KRMLKDLLLLKAKIELHFEQRKKEEEKRRYLAERIP-PLKLDGLSRAELQELCKELHERIDKLEEERYDLEEKVAKQDYE 79 (132)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHCHHHHHHHCCCTS-S--GTTSSHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcC-CCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccc
Confidence 679999999999999999999999999999999996 6689999999999999999999999999999999999999999
Q ss_pred HHHHHHHHHHhhCCCCCCccccccccHH-HHHHH--Hhhhhccchhhhccccc
Q psy9560 141 ITELGKQVNDLRGKFVRPILKKVSKYEN-KFAKL--QKKAAEFNFRSQLKAVK 190 (503)
Q Consensus 141 InELniKVnDLRGKFKKPaLKKVRkSAn-m~a~L--~KhkvsmDfRANLKqVK 190 (503)
|++||++|+|++|||+||+||+|++|++ |+.+| .+|.++||||+|||+||
T Consensus 80 i~elk~kv~d~rgkfkkp~lkkvkk~~~~~~~~~l~~k~~~~~~lr~~lK~vk 132 (132)
T PF00992_consen 80 IEELKKKVNDLRGKFKKPTLKKVKKSADAMRRKLLGIKHKVSMDLRANLKEVK 132 (132)
T ss_dssp HHHHCCCCCCCCCSHCCHHHHHHHHHHHHHHHHHHCTTCSHHHHHHHHHHHHH
T ss_pred HHHHHhHHHHHcccccchHHHHHHHHHHHHHHhhhCcccccHHHHHHHhhccC
Confidence 9999999999999999999999999995 34444 38999999999999996
No 3
>KOG3634|consensus
Probab=99.70 E-value=1.1e-16 Score=160.73 Aligned_cols=87 Identities=20% Similarity=0.360 Sum_probs=72.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhhhcCCCCCCCCCCHHHHHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHH
Q psy9560 67 LLRKKAAEELKREQERRALERKRVIDERCGEPKDTDDMAEDEIGDLCQEYWQRIYNLEAIKYELDREIMLKDFEITELGK 146 (503)
Q Consensus 67 LLLqKAaEELEKEKEqKeEEKKKiLAERcpPPLnIDGLSedELQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELni 146 (503)
+.++.|+.+++..++|.+.+|+++|.--|..+++||+|..++|++++|+||+|||+||.+||||+.++++|+|||++||.
T Consensus 193 gn~~qak~e~~mtKeQqEdakk~~l~aiRkk~~~~~~~~e~~LkeKiKELhqrI~kLE~EKyDLekR~eRQeYDlkeL~e 272 (361)
T KOG3634|consen 193 GNIVQAKQEMGMTKEQQEDAKKKFLLAIRKKPLNISELPENDLKEKIKELHQRICKLETEKYDLEKRHERQEYDLKELNE 272 (361)
T ss_pred HHHHHHHHHhcccHHHHHHHHHHHHHHHHhcccccccCCHHHHHHHHHHHHHHHHHHHHhhhhHHHHHHhhhccHHHHHH
Confidence 34455667777667776655555554444489999999999999999999999999999999999999999999999999
Q ss_pred HHHHhhC
Q psy9560 147 QVNDLRG 153 (503)
Q Consensus 147 KVnDLRG 153 (503)
|+++...
T Consensus 273 Rqrq~~r 279 (361)
T KOG3634|consen 273 RQRQVQR 279 (361)
T ss_pred HHHHHHH
Confidence 9888643
No 4
>KOG1029|consensus
Probab=95.79 E-value=0.058 Score=61.26 Aligned_cols=45 Identities=16% Similarity=0.237 Sum_probs=24.9
Q ss_pred HHHHHHHHHhhhhhhHHHHHhhhhh--------------hHHHHHHHHHHhhCCCCCCc
Q psy9560 115 EYWQRIYNLEAIKYELDREIMLKDF--------------EITELGKQVNDLRGKFVRPI 159 (503)
Q Consensus 115 ELHdRIdkLEEEKYDLE~KVkKQDy--------------EInELniKVnDLRGKFKKPa 159 (503)
-|..+|.+|++--.|+..++..++. ||..|..||.+++-|..+=+
T Consensus 448 tLn~k~qqls~kl~Dvr~~~tt~kt~ie~~~~q~e~~isei~qlqarikE~q~kl~~l~ 506 (1118)
T KOG1029|consen 448 TLNFKLQQLSGKLQDVRVDITTQKTEIEEVTKQRELMISEIDQLQARIKELQEKLQKLA 506 (1118)
T ss_pred HHHHHHHHHhhhhhhheeccchHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhh
Confidence 3444445555555555555555444 34556667777777665543
No 5
>PHA03247 large tegument protein UL36; Provisional
Probab=95.75 E-value=1.4 Score=55.78 Aligned_cols=9 Identities=67% Similarity=0.763 Sum_probs=3.7
Q ss_pred hhhcCchhh
Q psy9560 312 AVSATPQPL 320 (503)
Q Consensus 312 ~~~~~~~~~ 320 (503)
.++++|-+.
T Consensus 2715 ~~pa~P~pp 2723 (3151)
T PHA03247 2715 LVSATPLPP 2723 (3151)
T ss_pred CCCCCCCCC
Confidence 344444433
No 6
>KOG1029|consensus
Probab=95.12 E-value=0.26 Score=56.36 Aligned_cols=16 Identities=6% Similarity=-0.023 Sum_probs=8.3
Q ss_pred HHHHHHHHHHHHHHHH
Q psy9560 106 EDEIGDLCQEYWQRIY 121 (503)
Q Consensus 106 edELQEkCKELHdRId 121 (503)
...|+.+.++|..+|+
T Consensus 446 letLn~k~qqls~kl~ 461 (1118)
T KOG1029|consen 446 LETLNFKLQQLSGKLQ 461 (1118)
T ss_pred HHHHHHHHHHHhhhhh
Confidence 3445555555555554
No 7
>PHA03247 large tegument protein UL36; Provisional
Probab=94.65 E-value=9.3 Score=49.09 Aligned_cols=9 Identities=22% Similarity=0.686 Sum_probs=4.7
Q ss_pred cccccchhh
Q psy9560 206 EWVKEPESK 214 (503)
Q Consensus 206 DWRKNiE~K 214 (503)
-|-+-.|+-
T Consensus 2535 ~w~~glEe~ 2543 (3151)
T PHA03247 2535 TWIRGLEEL 2543 (3151)
T ss_pred hhhhhhhhh
Confidence 465555553
No 8
>PRK07764 DNA polymerase III subunits gamma and tau; Validated
Probab=92.23 E-value=22 Score=40.93 Aligned_cols=64 Identities=16% Similarity=0.178 Sum_probs=34.3
Q ss_pred hhhhhHHhhccccCCCCccHHHHHH-------------------HHHHhhhc-CchhhHHHHHHHHHhhhhccccccCCC
Q psy9560 283 PEELWLYLKDTVSASNPVSIDDLVE-------------------KLMSAVSA-TPQPLLKSVVKDTLRRQASHEETEGGE 342 (503)
Q Consensus 283 ~~~~~~~~~~~~~~~~~~~~~~~~~-------------------~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~ 342 (503)
..+.|--|...|.-.|-.++--|+. -|...++. ----+|.++|.+.|..+..-+-+-|+.
T Consensus 512 lr~~W~~Il~~V~~~~r~~~~~~l~~a~v~~~~g~~L~L~f~t~~la~~f~~~~~a~vl~~al~e~lG~~~~V~a~vg~~ 591 (824)
T PRK07764 512 LRERWPEILAAVPKRSRKTWAILLPEATVLGVRGDTLVLGFSTGGLARRFASPGNAEVLVTALAEELGGDWQVEAVVGPA 591 (824)
T ss_pred HHHHHHHHHHHHhhccHHHHHHhcccceeecccCCEEEEecCCHHHHHHHhcccchHHHHHHHHHHhCCceEEEEEeCCC
Confidence 4588999988888777311111111 12222222 122356667777776666666666655
Q ss_pred CCCC
Q psy9560 343 PKPE 346 (503)
Q Consensus 343 ~~~~ 346 (503)
+...
T Consensus 592 ~~~~ 595 (824)
T PRK07764 592 PGAA 595 (824)
T ss_pred cccc
Confidence 5443
No 9
>KOG1924|consensus
Probab=90.23 E-value=1.5 Score=50.53 Aligned_cols=26 Identities=23% Similarity=0.272 Sum_probs=15.8
Q ss_pred HHHHHHHHHHHHHHHHHhhhhhhHHHHHh
Q psy9560 107 DEIGDLCQEYWQRIYNLEAIKYELDREIM 135 (503)
Q Consensus 107 dELQEkCKELHdRIdkLEEEKYDLE~KVk 135 (503)
.+|+=.| |+-|..|...-|||.+++.
T Consensus 302 ~~l~vac---mq~INal~t~p~dldfRlh 327 (1102)
T KOG1924|consen 302 QQLQVAC---MQFINALVTSPSDLDFRLH 327 (1102)
T ss_pred HHHHHHH---HHHHHHhcCCHHHhhHHHH
Confidence 3444444 3467777777777776664
No 10
>PF09726 Macoilin: Transmembrane protein; InterPro: IPR019130 This entry represents the multi-pass transmembrane protein Macoilin, which is highly conserved in eukaryotes. ; GO: 0016021 integral to membrane
Probab=88.96 E-value=7.7 Score=43.81 Aligned_cols=26 Identities=27% Similarity=0.254 Sum_probs=23.1
Q ss_pred hhHHHHHhhhhhhHHHHHHHHHHhhC
Q psy9560 128 YELDREIMLKDFEITELGKQVNDLRG 153 (503)
Q Consensus 128 YDLE~KVkKQDyEInELniKVnDLRG 153 (503)
=.++..+.++|.||.||++||.+|.-
T Consensus 632 ei~~~~~~~~d~ei~~lk~ki~~~~a 657 (697)
T PF09726_consen 632 EIAQGQLRKKDKEIEELKAKIAQLLA 657 (697)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 36677899999999999999999976
No 11
>KOG3054|consensus
Probab=88.58 E-value=1.4 Score=45.06 Aligned_cols=11 Identities=55% Similarity=0.776 Sum_probs=8.0
Q ss_pred HHHHHhhCCCC
Q psy9560 146 KQVNDLRGKFV 156 (503)
Q Consensus 146 iKVnDLRGKFK 156 (503)
.=|.|=||||.
T Consensus 245 tGVmDDRGKfI 255 (299)
T KOG3054|consen 245 TGVMDDRGKFI 255 (299)
T ss_pred eeeecCCCceE
Confidence 34677799995
No 12
>KOG2891|consensus
Probab=86.55 E-value=11 Score=39.55 Aligned_cols=10 Identities=20% Similarity=0.132 Sum_probs=5.3
Q ss_pred HHHHHHHHHH
Q psy9560 112 LCQEYWQRIY 121 (503)
Q Consensus 112 kCKELHdRId 121 (503)
++|+|.++|.
T Consensus 408 kikeleek~~ 417 (445)
T KOG2891|consen 408 KIKELEEKIK 417 (445)
T ss_pred HHHHHHHHHH
Confidence 3456655553
No 13
>PRK07764 DNA polymerase III subunits gamma and tau; Validated
Probab=85.73 E-value=26 Score=40.44 Aligned_cols=59 Identities=7% Similarity=-0.085 Sum_probs=32.2
Q ss_pred HhhhhcCCCCCCCCCCHHHHHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHH
Q psy9560 90 VIDERCGEPKDTDDMAEDEIGDLCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQVND 150 (503)
Q Consensus 90 iLAERcpPPLnIDGLSedELQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnD 150 (503)
.|.-|| ..+.+..|+.++|.++++++..+.. +.-+.-.|.+.+.....++.++..-+..
T Consensus 166 TIrSRc-~~v~F~~l~~~~l~~~L~~il~~EG-v~id~eal~lLa~~sgGdlR~Al~eLEK 224 (824)
T PRK07764 166 TIRSRT-HHYPFRLVPPEVMRGYLERICAQEG-VPVEPGVLPLVIRAGGGSVRDSLSVLDQ 224 (824)
T ss_pred HHHhhe-eEEEeeCCCHHHHHHHHHHHHHHcC-CCCCHHHHHHHHHHcCCCHHHHHHHHHH
Confidence 356688 4678888888888877776554422 2112222334444444445444433333
No 14
>PRK12323 DNA polymerase III subunits gamma and tau; Provisional
Probab=85.62 E-value=46 Score=38.27 Aligned_cols=25 Identities=16% Similarity=0.190 Sum_probs=18.1
Q ss_pred hhhhcCCCCCCCCCCHHHHHHHHHHH
Q psy9560 91 IDERCGEPKDTDDMAEDEIGDLCQEY 116 (503)
Q Consensus 91 LAERcpPPLnIDGLSedELQEkCKEL 116 (503)
|.-|| ..+.+..++.+++.+.++.+
T Consensus 171 IrSRC-q~f~f~~ls~eei~~~L~~I 195 (700)
T PRK12323 171 VLSRC-LQFNLKQMPPGHIVSHLDAI 195 (700)
T ss_pred HHHHH-HhcccCCCChHHHHHHHHHH
Confidence 44589 47789999988877666544
No 15
>TIGR02231 conserved hypothetical protein. This family consists of proteins over 500 amino acids long in Caenorhabditis elegans and several bacteria (Pseudomonas aeruginosa, Nostoc sp. PCC 7120, Leptospira interrogans, etc.). The function is unknown.
Probab=85.08 E-value=14 Score=39.27 Aligned_cols=53 Identities=19% Similarity=0.090 Sum_probs=49.3
Q ss_pred CCCCHHHHHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHhhCC
Q psy9560 102 DDMAEDEIGDLCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQVNDLRGK 154 (503)
Q Consensus 102 DGLSedELQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnDLRGK 154 (503)
...+..+|.++..-+.+++..|..++++++.+++..+.+|..|+.+++.|.+.
T Consensus 122 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~l~~~l~~l~~~ 174 (525)
T TIGR02231 122 NEPDLKEWFQAFDFNGSEIERLLTEDREAERRIRELEKQLSELQNELNALLTG 174 (525)
T ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccC
Confidence 35578999999999999999999999999999999999999999999999883
No 16
>KOG1144|consensus
Probab=84.31 E-value=2.5 Score=48.92 Aligned_cols=28 Identities=32% Similarity=0.374 Sum_probs=19.3
Q ss_pred HHHHHHHHHHHHHHHHHHHhHHHHhhhh
Q psy9560 23 EAKRLKLEELERKRAEVRRRMEEASGAR 50 (503)
Q Consensus 23 ed~kRKqeErERKkAEvRKRlEEA~KkK 50 (503)
|.++|-++|+.|+.+|.|+|++|.++.+
T Consensus 231 E~qkreeEE~~r~eeEEer~~ee~E~~~ 258 (1064)
T KOG1144|consen 231 ERQKREEEERLRREEEEERRREEEEAQE 258 (1064)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4567777888888877777766655443
No 17
>PF06705 SF-assemblin: SF-assemblin/beta giardin
Probab=84.25 E-value=22 Score=34.53 Aligned_cols=60 Identities=18% Similarity=0.258 Sum_probs=33.2
Q ss_pred HHHHHHHHHHHHHHHHHhhhhhh-HHHHHhhhhhhHHHHHHHHHHhhCCCCCCccccccccHHHHHHH
Q psy9560 107 DEIGDLCQEYWQRIYNLEAIKYE-LDREIMLKDFEITELGKQVNDLRGKFVRPILKKVSKYENKFAKL 173 (503)
Q Consensus 107 dELQEkCKELHdRIdkLEEEKYD-LE~KVkKQDyEInELniKVnDLRGKFKKPaLKKVRkSAnm~a~L 173 (503)
+.|.+.|..||..|..--.++-+ |+.. ...|..+|++|+..|..=..-|+.+-.+++.+|
T Consensus 95 ~~L~~ri~~L~~~i~ee~~~r~~~ie~~-------~~~l~~~l~~l~~~~~~Er~~R~erE~~i~krl 155 (247)
T PF06705_consen 95 DSLNDRIEALEEEIQEEKEERPQDIEEL-------NQELVRELNELQEAFENERNEREEREENILKRL 155 (247)
T ss_pred HHHHHHHHHHHHHHHHHHHHhhHHHHHH-------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 45777777777777665555443 2222 233444555555555554445555555555555
No 18
>KOG2412|consensus
Probab=82.38 E-value=9.7 Score=42.52 Aligned_cols=57 Identities=21% Similarity=0.353 Sum_probs=41.6
Q ss_pred HHHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHhhCCCCCCccccccccHH
Q psy9560 108 EIGDLCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQVNDLRGKFVRPILKKVSKYEN 168 (503)
Q Consensus 108 ELQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnDLRGKFKKPaLKKVRkSAn 168 (503)
++....+-.|..++.+-.-+-|++.-+.+.+-++..+++. |+. -.-|.|--++++..
T Consensus 298 ~~~~ds~m~w~~~d~i~q~k~d~v~pi~~kd~~lk~~~~~---~kr-~in~~~~qis~~~~ 354 (591)
T KOG2412|consen 298 SSASDSQMFWNSQDAIAQSKLDLVNPILKKDEELKNYNQS---LKR-AINPPFSQISKSNG 354 (591)
T ss_pred CchhHHHhhhhHHHHHHHHHHHHHhhhhhHHHHHHHHHHH---HHh-hcCCChhhhhhccH
Confidence 3556777788888888889999999988887766666554 444 35677777777773
No 19
>KOG1924|consensus
Probab=82.33 E-value=14 Score=43.27 Aligned_cols=9 Identities=44% Similarity=0.693 Sum_probs=4.0
Q ss_pred HHHHHHHHH
Q psy9560 302 IDDLVEKLM 310 (503)
Q Consensus 302 ~~~~~~~~~ 310 (503)
||++|++-|
T Consensus 460 iD~~vdkak 468 (1102)
T KOG1924|consen 460 IDKMVDKAK 468 (1102)
T ss_pred HHHHHHHHH
Confidence 344444444
No 20
>PRK04863 mukB cell division protein MukB; Provisional
Probab=81.91 E-value=36 Score=41.83 Aligned_cols=62 Identities=11% Similarity=0.157 Sum_probs=54.3
Q ss_pred CCCCCCCCHHHHHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHh---hCCCCCCc
Q psy9560 98 PKDTDDMAEDEIGDLCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQVNDL---RGKFVRPI 159 (503)
Q Consensus 98 PLnIDGLSedELQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnDL---RGKFKKPa 159 (503)
++.+++|+.++|+.....|.+++..+..+.-+++.++...+..++.|+...+.+ .|+.-+..
T Consensus 429 ~~~~~~~SdEeLe~~LenF~aklee~e~qL~elE~kL~~lea~leql~~~~~~l~~~~Gkv~~~~ 493 (1486)
T PRK04863 429 LCGLPDLTADNAEDWLEEFQAKEQEATEELLSLEQKLSVAQAAHSQFEQAYQLVRKIAGEVSRSE 493 (1486)
T ss_pred HhCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCHHH
Confidence 456788999999999999999999999999999999999999999999998774 66544443
No 21
>COG1340 Uncharacterized archaeal coiled-coil protein [Function unknown]
Probab=77.49 E-value=59 Score=33.89 Aligned_cols=47 Identities=28% Similarity=0.329 Sum_probs=32.1
Q ss_pred HHHHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHhhC
Q psy9560 107 DEIGDLCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQVNDLRG 153 (503)
Q Consensus 107 dELQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnDLRG 153 (503)
|+|+..+.+||..|..+-...=.+...+.....+|.+|+.+|+.|+-
T Consensus 203 De~Rkeade~he~~ve~~~~~~e~~ee~~~~~~elre~~k~ik~l~~ 249 (294)
T COG1340 203 DELRKEADELHEEFVELSKKIDELHEEFRNLQNELRELEKKIKALRA 249 (294)
T ss_pred HHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 56777777777777666666666666666666677777777766665
No 22
>COG4839 FtsL Protein required for the initiation of cell division [Cell division and chromosome partitioning]
Probab=77.08 E-value=5.8 Score=36.52 Aligned_cols=38 Identities=21% Similarity=0.323 Sum_probs=33.5
Q ss_pred HHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHhhC
Q psy9560 116 YWQRIYNLEAIKYELDREIMLKDFEITELGKQVNDLRG 153 (503)
Q Consensus 116 LHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnDLRG 153 (503)
+.-+++++..+--||++++..+..++.+|.+.|+||..
T Consensus 58 ~q~~~yqvq~ei~~Le~kIs~q~~e~~dlkqeV~dLss 95 (120)
T COG4839 58 VQTKAYQVQGEITDLESKISEQKTENDDLKQEVKDLSS 95 (120)
T ss_pred HHHHHHHHHhHHHHHHHHHHHHHhhhhhHHHHHHHhcc
Confidence 44667788888889999999999999999999999965
No 23
>KOG4722|consensus
Probab=76.44 E-value=30 Score=38.30 Aligned_cols=112 Identities=21% Similarity=0.304 Sum_probs=64.6
Q ss_pred HhhHHHHHHHHHHHHHHHHHHHHHhHHHHhhhhccCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcCC
Q psy9560 18 ASAEDEAKRLKLEELERKRAEVRRRMEEASGARRKKGFMTPERKKKLRLLLRKKAAEELKREQERRALERKRVIDERCGE 97 (503)
Q Consensus 18 ~mad~ed~kRKqeErERKkAEvRKRlEEA~KkKAKKgKMT~eRKkkLKSLLLqKAaEELEKEKEqKeEEKKKiLAERcpP 97 (503)
-.+|+.|=-.|-+++|.+-.+-|.+|.|+... +|+.|+ ..- ++..+.++.....|++.|.++.-
T Consensus 238 ~~ad~~dfgKkHEeKQ~~AeeLRekLqE~Kal-------------KLkeLl-eRe-edVRk~kE~L~dqkRqllE~kll- 301 (672)
T KOG4722|consen 238 GHADDDDFGKKHEEKQKHAEELREKLQEAKAL-------------KLKELL-ERE-EDVRKKKEALKDQKRQLLEAKLL- 301 (672)
T ss_pred cccccchhhhhhHHHHHHHHHHHHHHHHHHHH-------------HHHHHH-HHH-HHHHHHHHHHHHHHHHHHHHHHh-
Confidence 34566666678888888888889999988753 344433 211 22223344555566677777662
Q ss_pred CCCCCCCCHHHHHHHHHHHH---------HHHHHHhh--hhhhHHHHHhhhhhhHHHHHHH
Q psy9560 98 PKDTDDMAEDEIGDLCQEYW---------QRIYNLEA--IKYELDREIMLKDFEITELGKQ 147 (503)
Q Consensus 98 PLnIDGLSedELQEkCKELH---------dRIdkLEE--EKYDLE~KVkKQDyEInELniK 147 (503)
.-+.-.+-.|+.++|.-. .-|..|+. -||||-.|...-..-++++|..
T Consensus 302 --hAe~kRd~ni~aiikkAkdddakV~EiaFI~TLea~NKRhdlLaK~~e~Eekl~E~nee 360 (672)
T KOG4722|consen 302 --HAEDKRDKNIMAIIKKAKDDDAKVMEIAFIATLEADNKRHDLLAKDAEVEEKLLEKNEE 360 (672)
T ss_pred --hhhhcchhhHHHHHHHhhhhhHHHHHHHHHHHHHHhhhHHHHHHhhHHHHHHHHHHHHH
Confidence 222223344555544332 23556654 4899988876655555555543
No 24
>PF10234 Cluap1: Clusterin-associated protein-1; InterPro: IPR019366 This protein of 413 amino acids contains a central coiled-coil domain, possibly the region that binds to clusterin. Cluap1 expression is highest in the nucleus and gradually increases during late S to G2/M phases of the cell cycle and returns to the basal level in the G0/G1 phases. In addition, it is upregulated in colon cancer tissues compared to corresponding non-cancerous mucosa. It thus plays a crucial role in the life of the cell [].
Probab=75.27 E-value=17 Score=37.04 Aligned_cols=88 Identities=23% Similarity=0.342 Sum_probs=65.7
Q ss_pred HHHHHHHH---HHHHHHHHHHHHHHHHHhhhhcCCCCCCCCCCH------HHHHHHHHHHHHHHHHHhhhhhhHHHHHhh
Q psy9560 66 LLLRKKAA---EELKREQERRALERKRVIDERCGEPKDTDDMAE------DEIGDLCQEYWQRIYNLEAIKYELDREIML 136 (503)
Q Consensus 66 SLLLqKAa---EELEKEKEqKeEEKKKiLAERcpPPLnIDGLSe------dELQEkCKELHdRIdkLEEEKYDLE~KVkK 136 (503)
|=|.++++ ..|++|- ..++.|.+.|+ . +++|+++.. ..++.-+..+.+.|+.|..+--.|+.|++|
T Consensus 127 seit~~GA~LydlL~kE~-~lr~~R~~a~~--r--~~e~~~iE~~l~~ai~~~~~~~~~~~~~l~~l~~de~~Le~KIek 201 (267)
T PF10234_consen 127 SEITQRGASLYDLLGKEV-ELREERQRALA--R--PLELNEIEKALKEAIKAVQQQLQQTQQQLNNLASDEANLEAKIEK 201 (267)
T ss_pred HHHHHHHHHHHHHHhchH-hHHHHHHHHHc--C--CcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33555555 3456664 66677888877 2 356665321 345666777788899999999999999999
Q ss_pred hhhhHHHHHHHHHHhhCCCCCCcc
Q psy9560 137 KDFEITELGKQVNDLRGKFVRPIL 160 (503)
Q Consensus 137 QDyEInELniKVnDLRGKFKKPaL 160 (503)
...|+.-...|+.-|+. +||+.
T Consensus 202 kk~ELER~qKRL~sLq~--vRPAf 223 (267)
T PF10234_consen 202 KKQELERNQKRLQSLQS--VRPAF 223 (267)
T ss_pred HHHHHHHHHHHHHHHHh--cChHH
Confidence 99999999999999998 77764
No 25
>KOG0163|consensus
Probab=74.87 E-value=41 Score=39.67 Aligned_cols=29 Identities=17% Similarity=0.207 Sum_probs=17.1
Q ss_pred HHHHHHHHHHHHHHHHhhhhhhHHHHHhh
Q psy9560 108 EIGDLCQEYWQRIYNLEAIKYELDREIML 136 (503)
Q Consensus 108 ELQEkCKELHdRIdkLEEEKYDLE~KVkK 136 (503)
+|-.-+++=..+-.++|-||-|-+.-++-
T Consensus 988 qla~e~eee~k~q~~~Eqer~D~~la~Rl 1016 (1259)
T KOG0163|consen 988 QLAKEAEEEAKRQNQLEQERRDHELALRL 1016 (1259)
T ss_pred HHHHHHHHHHHHHhHHHHHHHHHHHHHHH
Confidence 34444444445566677777777665543
No 26
>PF10779 XhlA: Haemolysin XhlA; InterPro: IPR019715 Haemolysin XhlA is a cell-surface associated haemolysin that lyses the two most prevalent types of insect immune cells (granulocytes and plasmatocytes) as well as rabbit and horse erythrocytes [].
Probab=74.20 E-value=12 Score=30.42 Aligned_cols=47 Identities=17% Similarity=0.251 Sum_probs=41.8
Q ss_pred HHHHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHhhC
Q psy9560 107 DEIGDLCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQVNDLRG 153 (503)
Q Consensus 107 dELQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnDLRG 153 (503)
+++++.+..++.+++.++..-=++|.....++.+|..|+.++.++..
T Consensus 2 ~~i~e~l~~ie~~l~~~~~~i~~lE~~~~~~e~~i~~~~~~l~~I~~ 48 (71)
T PF10779_consen 2 QDIKEKLNRIETKLDNHEERIDKLEKRDAANEKDIKNLNKQLEKIKS 48 (71)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 46888889999999998888888999999999999999999998865
No 27
>KOG0804|consensus
Probab=74.09 E-value=75 Score=35.24 Aligned_cols=42 Identities=24% Similarity=0.138 Sum_probs=29.4
Q ss_pred HHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHhhC
Q psy9560 112 LCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQVNDLRG 153 (503)
Q Consensus 112 kCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnDLRG 153 (503)
|.+.+--|+.+|.+..=-+..-+.-.|..|+||+.+|.||.=
T Consensus 408 l~knq~vw~~kl~~~~e~~~~~~~s~d~~I~dLqEQlrDlmf 449 (493)
T KOG0804|consen 408 LIKNQDVWRGKLKELEEREKEALGSKDEKITDLQEQLRDLMF 449 (493)
T ss_pred HHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhe
Confidence 334444555555555555566677788999999999999964
No 28
>PF07889 DUF1664: Protein of unknown function (DUF1664); InterPro: IPR012458 The members of this family are hypothetical plant proteins of unknown function. The region featured in this family is approximately 100 amino acids long.
Probab=73.32 E-value=26 Score=32.25 Aligned_cols=81 Identities=11% Similarity=0.209 Sum_probs=41.2
Q ss_pred HHHHHHHHHHHHHH---HHHHHHHHhhhhcCCCCCCCCCCHHHHHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHH
Q psy9560 69 RKKAAEELKREQER---RALERKRVIDERCGEPKDTDDMAEDEIGDLCQEYWQRIYNLEAIKYELDREIMLKDFEITELG 145 (503)
Q Consensus 69 LqKAaEELEKEKEq---KeEEKKKiLAERcpPPLnIDGLSedELQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELn 145 (503)
|..|...+.+.-++ .-..=|+.|..|+. ++|. +.|+..++.+.+-+.+..|-++--+|...|..-..-+..|.
T Consensus 41 m~~A~~~v~kql~~vs~~l~~tKkhLsqRId---~vd~-klDe~~ei~~~i~~eV~~v~~dv~~i~~dv~~v~~~V~~Le 116 (126)
T PF07889_consen 41 MSDAVASVSKQLEQVSESLSSTKKHLSQRID---RVDD-KLDEQKEISKQIKDEVTEVREDVSQIGDDVDSVQQMVEGLE 116 (126)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHh-hHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHH
Confidence 34555555433222 23344566778883 4554 66777777766666665555444444333333333444444
Q ss_pred HHHHHhhC
Q psy9560 146 KQVNDLRG 153 (503)
Q Consensus 146 iKVnDLRG 153 (503)
.||.+|.+
T Consensus 117 ~ki~~ie~ 124 (126)
T PF07889_consen 117 GKIDEIEE 124 (126)
T ss_pred HHHHHHhc
Confidence 44444443
No 29
>PF01486 K-box: K-box region; InterPro: IPR002487 MADS genes in plants encode key developmental regulators of vegetative and reproductive development. The majority of the plant MADS proteins share a stereotypical MIKC structure. It comprises (from N- to C-terminal) an N-terminal domain, which is, however, present only in a minority of proteins; a MADS domain (see PDOC00302 from PROSITEDOC, IPR002100 from INTERPRO), which is the major determinant of DNA-binding but which also performs dimerisation and accessory factor binding functions; a weakly conserved intervening (I) domain, which constitutes a key molecular determinant for the selective formation of DNA-binding dimers; a keratin-like (K-box) domain, which promotes protein dimerisation; and a C-terminal (C) domain, which is involved in transcriptional activation or in the formation of ternary or quaternary protein complexes. The 80-amino acid K-box domain was originally identified as a region with low but significant similarity to a region of keratin, which is part of the coiled-coil sequence constituting the central rod-shaped domain of keratin [, , ]. The K-box protein-protein interaction domain which mediates heterodimerization of MIKC-type MADS proteins contains several heptad repeats in which the first and the fourth positions are occupied by hydrophobic amino acids suggesting that the K-box domain forms three amphipathic alpha-helices referred to as K1, K2, and K3 [].; GO: 0003700 sequence-specific DNA binding transcription factor activity, 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus
Probab=73.10 E-value=14 Score=31.23 Aligned_cols=48 Identities=23% Similarity=0.264 Sum_probs=37.3
Q ss_pred CCCCCCCHHHHHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHhh
Q psy9560 99 KDTDDMAEDEIGDLCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQVNDLR 152 (503)
Q Consensus 99 LnIDGLSedELQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnDLR 152 (503)
-+|++||..+|+.|=+.|...|.+|-+-|+.+ -..+|..|+.++..|.
T Consensus 41 edL~~Ls~~eL~~LE~~Le~aL~~VR~rK~~~------l~~~i~~l~~ke~~l~ 88 (100)
T PF01486_consen 41 EDLESLSLKELQQLEQQLESALKRVRSRKDQL------LMEQIEELKKKERELE 88 (100)
T ss_pred ccccccchHHHHHHHHhhhhhHHHHHHHHHHH------HHHHHHHHHHHHHHHH
Confidence 58999999999999999999999999777653 3345666666665553
No 30
>KOG4460|consensus
Probab=72.36 E-value=63 Score=36.84 Aligned_cols=12 Identities=33% Similarity=0.393 Sum_probs=7.7
Q ss_pred cchhhhcccccc
Q psy9560 180 FNFRSQLKAVKK 191 (503)
Q Consensus 180 mDfRANLKqVKK 191 (503)
--.|.|.|+||-
T Consensus 723 ~~i~~~~k~VK~ 734 (741)
T KOG4460|consen 723 EHIREMVKQVKD 734 (741)
T ss_pred HHHHHHHHHHHH
Confidence 345667777774
No 31
>PF00992 Troponin: Troponin; InterPro: IPR001978 The troponin (Tn) complex regulates Ca2+ induced muscle contraction. Tn contains three subunits, Ca2+ binding (TnC), inhibitory (TnI), and tropomyosin binding (TnT). This family includes troponin T and troponin I. Troponin I binds to actin and troponin T binds to tropomyosin [, , ].; PDB: 1J1D_C 1MXL_I 1LXF_I 2KRD_I 1J1E_F 2L1R_B 2KGB_I 1YTZ_I 1YV0_I 1VDJ_A ....
Probab=71.83 E-value=5.3 Score=36.31 Aligned_cols=36 Identities=8% Similarity=-0.106 Sum_probs=30.5
Q ss_pred HHHHHHHHHHhhhhcCCCCCCCCCCHHHHHHHHHHHH
Q psy9560 81 ERRALERKRVIDERCGEPKDTDDMAEDEIGDLCQEYW 117 (503)
Q Consensus 81 EqKeEEKKKiLAERcpPPLnIDGLSedELQEkCKELH 117 (503)
......|+.+++.+. .+++++|++..+|++.++++|
T Consensus 97 p~lkkvkk~~~~~~~-~~l~~k~~~~~~lr~~lK~vk 132 (132)
T PF00992_consen 97 PTLKKVKKSADAMRR-KLLGIKHKVSMDLRANLKEVK 132 (132)
T ss_dssp HHHHHHHHHHHHHHH-HHHCTTCSHHHHHHHHHHHHH
T ss_pred hHHHHHHHHHHHHHH-hhhCcccccHHHHHHHhhccC
Confidence 345566778888888 688999999999999999987
No 32
>PF09744 Jnk-SapK_ap_N: JNK_SAPK-associated protein-1; InterPro: IPR019143 This entry represents the N-terminal 200 residues of a set of proteins conserved from yeasts to humans. Most of the proteins in this entry have a RhoGEF domain (IPR000219 from INTERPRO) at their C-terminal end.
Probab=71.61 E-value=12 Score=35.27 Aligned_cols=85 Identities=19% Similarity=0.250 Sum_probs=42.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcCCCCCCCCCCHHHHHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhh
Q psy9560 61 KKKLRLLLRKKAAEELKREQERRALERKRVIDERCGEPKDTDDMAEDEIGDLCQEYWQRIYNLEAIKYELDREIMLKDFE 140 (503)
Q Consensus 61 KkkLKSLLLqKAaEELEKEKEqKeEEKKKiLAERcpPPLnIDGLSedELQEkCKELHdRIdkLEEEKYDLE~KVkKQDyE 140 (503)
.+.+...+|+...+.|..+.+.+.+-|+++ ..++ ..+ +|.++.-+++|..+|..|+++.=-|+.+++-..-.
T Consensus 47 ~~~~e~~~L~~d~e~L~~q~~~ek~~r~~~-e~~l---~~~----Ed~~~~e~k~L~~~v~~Le~e~r~L~~~~~~~~~q 118 (158)
T PF09744_consen 47 EHEVELELLREDNEQLETQYEREKELRKQA-EEEL---LEL----EDQWRQERKDLQSQVEQLEEENRQLELKLKNLSDQ 118 (158)
T ss_pred hhhhHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHH---HHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhh
Confidence 344444455555555554444433333333 2223 111 36666677777777777776666666555443333
Q ss_pred HHHHHHHHHHhhC
Q psy9560 141 ITELGKQVNDLRG 153 (503)
Q Consensus 141 InELniKVnDLRG 153 (503)
+.-|..+-.+|+.
T Consensus 119 ~~rlee~e~~l~~ 131 (158)
T PF09744_consen 119 SSRLEEREAELKK 131 (158)
T ss_pred ccccchhHHHHHH
Confidence 4444444444443
No 33
>PRK12323 DNA polymerase III subunits gamma and tau; Provisional
Probab=70.63 E-value=2.1e+02 Score=33.29 Aligned_cols=20 Identities=30% Similarity=0.330 Sum_probs=8.3
Q ss_pred HHHhhhh-ccccccCCCCCCC
Q psy9560 327 DTLRRQA-SHEETEGGEPKPE 346 (503)
Q Consensus 327 ~~~~~~~-~~~~~~~~~~~~~ 346 (503)
-+|-|.. -+-.+.||+--|.
T Consensus 355 m~llrll~~~p~~~~~~~~~~ 375 (700)
T PRK12323 355 MTLLRMLAFRPGQSGGGAGPA 375 (700)
T ss_pred HHHHHHhhCCCCCCCCCCCCC
Confidence 3443433 3333445554433
No 34
>PF13864 Enkurin: Calmodulin-binding
Probab=70.00 E-value=59 Score=27.74 Aligned_cols=53 Identities=23% Similarity=0.312 Sum_probs=32.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhh-hcCCCCCCCCCCHHHHHHHHHHHHHHHHHHhhh
Q psy9560 68 LRKKAAEELKREQERRALERKRVIDE-RCGEPKDTDDMAEDEIGDLCQEYWQRIYNLEAI 126 (503)
Q Consensus 68 LLqKAaEELEKEKEqKeEEKKKiLAE-RcpPPLnIDGLSedELQEkCKELHdRIdkLEEE 126 (503)
||.+-+.++..+.+. +.+++.+ .|+ ..+--|++++.+++...|..+.+.|..+
T Consensus 6 YL~~~k~e~~~e~~~----~~~~~~~~~~~--~~~~~l~eeER~~lL~~Lk~~~~el~~e 59 (98)
T PF13864_consen 6 YLKKRKKEIAEEEEE----REREEEDQECP--PGMRLLSEEERQELLEGLKKNWDELNKE 59 (98)
T ss_pred HHHHHHHHHHHHHHH----HHHHHHhHhCc--cccccCCHHHHHHHHHHHHHHHHHHHHH
Confidence 455555555544332 2222222 353 3566689999999999998888887765
No 35
>COG1340 Uncharacterized archaeal coiled-coil protein [Function unknown]
Probab=69.34 E-value=40 Score=35.05 Aligned_cols=44 Identities=18% Similarity=0.241 Sum_probs=27.2
Q ss_pred HHHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHh
Q psy9560 108 EIGDLCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQVNDL 151 (503)
Q Consensus 108 ELQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnDL 151 (503)
.|++.-.+|-++|..|-+.|-++..++...-.++..|..+.+..
T Consensus 59 elr~~rdeineev~elK~kR~ein~kl~eL~~~~~~l~e~~~~~ 102 (294)
T COG1340 59 ELREERDEINEEVQELKEKRDEINAKLQELRKEYRELKEKRNEF 102 (294)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh
Confidence 35555566666666666666666666666666666666665544
No 36
>PF10168 Nup88: Nuclear pore component; InterPro: IPR019321 Nup88 can be divided into two structural domains; the N-terminal two-thirds of the protein have no obvious structural motifs. It is, however, where it binds to Nup98; one of the components of the nuclear pore. The C-terminal end is a predicted coiled-coil domain []. Nup88 is over expressed in tumour cells [].
Probab=67.50 E-value=18 Score=41.01 Aligned_cols=59 Identities=15% Similarity=0.131 Sum_probs=31.9
Q ss_pred HHhhhhhhHHHHHHHHHHhhCCCCCCccccccccH-HHHHHHHhhh-hccchhhhccccccc
Q psy9560 133 EIMLKDFEITELGKQVNDLRGKFVRPILKKVSKYE-NKFAKLQKKA-AEFNFRSQLKAVKKR 192 (503)
Q Consensus 133 KVkKQDyEInELniKVnDLRGKFKKPaLKKVRkSA-nm~a~L~Khk-vsmDfRANLKqVKKE 192 (503)
|+..-..-=+.|..|+..+...+.. .+...+.++ +|+..|+.-. .--+|++.|++||+.
T Consensus 601 R~e~a~d~Qe~L~~R~~~vl~~l~~-~~P~LS~AEr~~~~EL~~~~~~l~~l~~si~~lk~k 661 (717)
T PF10168_consen 601 RYEEAKDKQEKLMKRVDRVLQLLNS-QLPVLSEAEREFKKELERMKDQLQDLKASIEQLKKK 661 (717)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhc-cCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3333333344566666665554422 233345555 5556665322 224588999999876
No 37
>COG4026 Uncharacterized protein containing TOPRIM domain, potential nuclease [General function prediction only]
Probab=66.62 E-value=57 Score=33.56 Aligned_cols=48 Identities=21% Similarity=0.234 Sum_probs=34.6
Q ss_pred HHHHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHhhCC
Q psy9560 107 DEIGDLCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQVNDLRGK 154 (503)
Q Consensus 107 dELQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnDLRGK 154 (503)
.+|+..+.++.++|..|+-++--|+.++++.--++.+|+.|..+|--+
T Consensus 159 eele~e~ee~~erlk~le~E~s~LeE~~~~l~~ev~~L~~r~~ELe~~ 206 (290)
T COG4026 159 EELEAEYEEVQERLKRLEVENSRLEEMLKKLPGEVYDLKKRWDELEPG 206 (290)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhchhHHHHHHHHHHHhccc
Confidence 346667777777777777777777777777777777777777776553
No 38
>KOG1103|consensus
Probab=66.41 E-value=1.4e+02 Score=32.68 Aligned_cols=48 Identities=21% Similarity=0.176 Sum_probs=34.4
Q ss_pred HHHHHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHhhC
Q psy9560 106 EDEIGDLCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQVNDLRG 153 (503)
Q Consensus 106 edELQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnDLRG 153 (503)
+.++..++.||.--...|..++-..+.+.+..+.|+..|+.-|.||-.
T Consensus 240 eaqvek~i~EfdiEre~LRAel~ree~r~K~lKeEmeSLkeiVkdlEA 287 (561)
T KOG1103|consen 240 EAQVEKLIEEFDIEREFLRAELEREEKRQKMLKEEMESLKEIVKDLEA 287 (561)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhh
Confidence 455666666766666677777777777777778888888888877743
No 39
>PF15175 SPATA24: Spermatogenesis-associated protein 24
Probab=66.24 E-value=15 Score=35.09 Aligned_cols=56 Identities=20% Similarity=0.279 Sum_probs=39.3
Q ss_pred HHHHHHHHHHHHHHHH-HHhhhhhhHHHHHhhhhhhHHHHHHHHHHhhCCCCCCccccccccH
Q psy9560 106 EDEIGDLCQEYWQRIY-NLEAIKYELDREIMLKDFEITELGKQVNDLRGKFVRPILKKVSKYE 167 (503)
Q Consensus 106 edELQEkCKELHdRId-kLEEEKYDLE~KVkKQDyEInELniKVnDLRGKFKKPaLKKVRkSA 167 (503)
-|+|..+|.++-..|. . --+.-++-.+.+|++|+++|..-+--|-++.|.-+++-.
T Consensus 72 ~dqL~~KC~~~~~ei~c~------kqed~LngKe~~I~eLk~~l~sQK~~~Hk~qlsdl~Iqk 128 (153)
T PF15175_consen 72 KDQLITKCNEIESEIICH------KQEDILNGKENEIKELKQRLASQKQNFHKRQLSDLRIQK 128 (153)
T ss_pred HHHHHHHHHHHHHHHHhc------chhhhhcccccchHHHHhhhHHHHHHHhhccchhhHHhh
Confidence 3667777776654433 0 344568888999999999998887767777776666544
No 40
>PF04859 DUF641: Plant protein of unknown function (DUF641); InterPro: IPR006943 This conserved region is found in a number of plant proteins of unknown function.
Probab=66.03 E-value=65 Score=29.94 Aligned_cols=77 Identities=21% Similarity=0.259 Sum_probs=50.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhhhcCCCCCCCC--CCHHHHHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHH
Q psy9560 70 KKAAEELKREQERRALERKRVIDERCGEPKDTDD--MAEDEIGDLCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQ 147 (503)
Q Consensus 70 qKAaEELEKEKEqKeEEKKKiLAERcpPPLnIDG--LSedELQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniK 147 (503)
+-|..-+-.|-+.--+=|+.|+.-...+...... -..+++|.+.+-|.-.+.+| +..+..+|-||..|+.+
T Consensus 44 ~aAD~~vVsEL~~Ls~LK~~y~~~~~~~~~~~~~l~a~~~e~qsli~~yE~~~~kL-------e~e~~~Kdsei~~Lr~~ 116 (131)
T PF04859_consen 44 QAADEAVVSELRRLSELKRRYRKKQSDPSPQVARLAAEIQEQQSLIKTYEIVVKKL-------EAELRAKDSEIDRLREK 116 (131)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHcCCCCCCccccccccchHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHH
Confidence 3344445556667778888888766632211112 12366677777666666654 56689999999999999
Q ss_pred HHHhhC
Q psy9560 148 VNDLRG 153 (503)
Q Consensus 148 VnDLRG 153 (503)
+.++..
T Consensus 117 L~~~~~ 122 (131)
T PF04859_consen 117 LDELNR 122 (131)
T ss_pred HHHHHH
Confidence 998865
No 41
>PF10148 SCHIP-1: Schwannomin-interacting protein 1; InterPro: IPR015649 SCHIP-1 is a coiled-coil protein that specifically associates with schwannomin in vitro and in vivo. The product of the neurofibromatosis type 2 (NF2) tumour suppressor gene, known as schwannomin or merlin, is involved in NF2-associated and sporadic schwannomas and meningiomas. It is closely related to the ezrin-radixin-moesin family members, which link membrane proteins to the cytoskeleton. Association with SCHIP-1 can be observed only with some naturally occurring mutants of schwannomin, or a schwannomin spliced isoform lacking exons 2 and 3, but not with the schwannomin isoform exhibiting growth-suppressive activity []. This entry consists of mammalian SCHIP-1 proteins from Mus musculus (Mouse) and Homo sapiens (Human).
Probab=65.31 E-value=19 Score=36.35 Aligned_cols=49 Identities=18% Similarity=0.203 Sum_probs=34.5
Q ss_pred CCCCCCHHHHHHHHHHHHHHHHHHhhhhhh-------HHHHHhhhhhhHHHHHHHH
Q psy9560 100 DTDDMAEDEIGDLCQEYWQRIYNLEAIKYE-------LDREIMLKDFEITELGKQV 148 (503)
Q Consensus 100 nIDGLSedELQEkCKELHdRIdkLEEEKYD-------LE~KVkKQDyEInELniKV 148 (503)
.+..|+..+||-|.+.||.+|..|-++-.. |-...--.--+|.||+...
T Consensus 169 ~L~~~~~~qLq~i~~~l~~~i~~ln~~Lv~~L~~RD~Lh~eqds~lvDieD~t~~~ 224 (238)
T PF10148_consen 169 DLTKMNVPQLQVIVNDLHEQIEALNEELVQLLLERDDLHMEQDSMLVDIEDLTRQL 224 (238)
T ss_pred HHhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccceeeeHHHHHHHH
Confidence 367789999999999999999998887433 3333333334677776654
No 42
>PHA03011 hypothetical protein; Provisional
Probab=65.04 E-value=23 Score=32.41 Aligned_cols=47 Identities=23% Similarity=0.193 Sum_probs=40.6
Q ss_pred HHHHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHhhC
Q psy9560 107 DEIGDLCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQVNDLRG 153 (503)
Q Consensus 107 dELQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnDLRG 153 (503)
++|..-+.+|.+--..++.++-|++.-+..|+-+|.-|+.+|..|+-
T Consensus 67 deL~~qYN~L~dEYn~i~Ne~k~~~~iIQdn~d~I~~LraeIDkLK~ 113 (120)
T PHA03011 67 DELIAQYNELLDEYNLIENEIKDLEIIIQDNDDEIHFLRAEIDKLKE 113 (120)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhchHHHHHHHHHHHHHHH
Confidence 45666677788888889999999999999999999999999988764
No 43
>KOG4593|consensus
Probab=65.03 E-value=44 Score=38.44 Aligned_cols=41 Identities=15% Similarity=0.024 Sum_probs=28.4
Q ss_pred HHHHHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHH
Q psy9560 106 EDEIGDLCQEYWQRIYNLEAIKYELDREIMLKDFEITELGK 146 (503)
Q Consensus 106 edELQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELni 146 (503)
..+|..+.+.+-..|-++..+.|+++.+++...-.|+.+..
T Consensus 153 ~ae~~~lr~k~dss~s~~q~e~~~~~~~~~~~~s~l~~~ek 193 (716)
T KOG4593|consen 153 LAELGTLRNKLDSSLSELQWEVMLQEMRAKRLHSELQNEEK 193 (716)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 45667777777778888888888888877766555544433
No 44
>PTZ00266 NIMA-related protein kinase; Provisional
Probab=64.39 E-value=91 Score=37.25 Aligned_cols=17 Identities=29% Similarity=0.323 Sum_probs=10.0
Q ss_pred HHHHHHhhhhhhHHHHH
Q psy9560 118 QRIYNLEAIKYELDREI 134 (503)
Q Consensus 118 dRIdkLEEEKYDLE~KV 134 (503)
++|..++.+++..+.+.
T Consensus 510 er~~r~e~e~~e~~rr~ 526 (1021)
T PTZ00266 510 ERVDRLERDRLEKARRN 526 (1021)
T ss_pred HHHHHHHHHHHHHHHhh
Confidence 55666777766544443
No 45
>PF06034 DUF919: Nucleopolyhedrovirus protein of unknown function (DUF919); InterPro: IPR009265 This entry is represented by Autographa californica nuclear polyhedrosis virus (AcMNPV), Orf29. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This family consists of several short baculovirus proteins of unknown function.
Probab=63.32 E-value=14 Score=30.58 Aligned_cols=47 Identities=21% Similarity=0.386 Sum_probs=35.3
Q ss_pred HHHHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHhhCCC
Q psy9560 107 DEIGDLCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQVNDLRGKF 155 (503)
Q Consensus 107 dELQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnDLRGKF 155 (503)
++|.++|+-=++ ..+....|.--.++.|+-.||.++..||+++|-.|
T Consensus 8 ~QLd~I~~~K~~--l~ik~~H~Ekl~kitK~p~El~~i~~kl~~~R~~F 54 (62)
T PF06034_consen 8 QQLDEINQMKRQ--LTIKSQHWEKLKKITKNPKELQEIEKKLQELRQNF 54 (62)
T ss_pred HHHHHHHHHHHH--HHHHHHHHHHHHhccCCHHHHHHHHHHHHHHHHHH
Confidence 455555553333 34566677778889999999999999999999766
No 46
>KOG1853|consensus
Probab=62.62 E-value=35 Score=35.61 Aligned_cols=48 Identities=13% Similarity=0.205 Sum_probs=42.7
Q ss_pred CHHHHHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHhh
Q psy9560 105 AEDEIGDLCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQVNDLR 152 (503)
Q Consensus 105 SedELQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnDLR 152 (503)
+..++.++|..||.+|..||--.-|||.--+-..|-+.|+.+|+|..-
T Consensus 99 dlsqt~aikeql~kyiReLEQaNDdLErakRati~sleDfeqrLnqAI 146 (333)
T KOG1853|consen 99 DLSQTHAIKEQLRKYIRELEQANDDLERAKRATIYSLEDFEQRLNQAI 146 (333)
T ss_pred HHHHHHHHHHHHHHHHHHHHHhccHHHHhhhhhhhhHHHHHHHHHHHH
Confidence 346789999999999999999999999888888999999999998853
No 47
>COG3883 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=61.90 E-value=12 Score=38.18 Aligned_cols=51 Identities=18% Similarity=0.189 Sum_probs=40.4
Q ss_pred HHHHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHhhCCCCC
Q psy9560 107 DEIGDLCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQVNDLRGKFVR 157 (503)
Q Consensus 107 dELQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnDLRGKFKK 157 (503)
+.|...+.++..+|+.++++-=-++..+++...+|.+|+.+|.+.+-.|++
T Consensus 55 ~~L~~qi~~~~~k~~~~~~~i~~~~~eik~l~~eI~~~~~~I~~r~~~l~~ 105 (265)
T COG3883 55 ESLDNQIEEIQSKIDELQKEIDQSKAEIKKLQKEIAELKENIVERQELLKK 105 (265)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 456666677777777777777777788999999999999999998887765
No 48
>KOG0161|consensus
Probab=61.00 E-value=1.2e+02 Score=38.65 Aligned_cols=57 Identities=25% Similarity=0.261 Sum_probs=37.9
Q ss_pred HHHHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHhhCCCCCCccccc
Q psy9560 107 DEIGDLCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQVNDLRGKFVRPILKKV 163 (503)
Q Consensus 107 dELQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnDLRGKFKKPaLKKV 163 (503)
++|+..++++-.++...++..-+++.+.++...+|.+|+..|.|+-|...|=-+.++
T Consensus 904 ~~le~~l~~~~~~~e~~ee~~~~le~~~~~~~~e~~~l~~~~~~~E~~~~k~~~Ek~ 960 (1930)
T KOG0161|consen 904 QELEKELKELKERLEEEEEKNAELERKKRKLEQEVQELKEQLEELELTLQKLELEKN 960 (1930)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 556667777777777777777777777777777777777777777664444333333
No 49
>PF09728 Taxilin: Myosin-like coiled-coil protein; InterPro: IPR019132 Taxilin contains an extraordinarily long coiled-coil domain in its C-terminal half and is ubiquitously expressed. It is a novel binding partner of several syntaxin family members and is possibly involved in Ca(2+)-dependent exocytosis in neuroendocrine cells []. Gamma-taxilin, described as leucine zipper protein Factor Inhibiting ATF4-mediated Transcription (FIAT), localises to the nucleus in osteoblasts and dimerises with ATF4 to form inactive dimers, thus inhibiting ATF4-mediated transcription [].
Probab=61.00 E-value=52 Score=33.74 Aligned_cols=17 Identities=24% Similarity=0.491 Sum_probs=7.8
Q ss_pred HHHHHHHHHHHHHHHHH
Q psy9560 107 DEIGDLCQEYWQRIYNL 123 (503)
Q Consensus 107 dELQEkCKELHdRIdkL 123 (503)
+.|..||++|-.+-..|
T Consensus 74 ~KLE~LCRELQk~Nk~l 90 (309)
T PF09728_consen 74 SKLESLCRELQKQNKKL 90 (309)
T ss_pred HHHHHHHHHHHHHHHHH
Confidence 34555555554443333
No 50
>KOG2072|consensus
Probab=60.87 E-value=1.2e+02 Score=36.18 Aligned_cols=36 Identities=22% Similarity=0.326 Sum_probs=18.0
Q ss_pred CCCCCCCCHHHHHHHHHHHHHHHHHHhhhhhhHHHHHhhhhh
Q psy9560 98 PKDTDDMAEDEIGDLCQEYWQRIYNLEAIKYELDREIMLKDF 139 (503)
Q Consensus 98 PLnIDGLSedELQEkCKELHdRIdkLEEEKYDLE~KVkKQDy 139 (503)
+.++.+|+.++++.. .|..|+-++-.|+.+++.+.+
T Consensus 656 ~ed~e~lD~d~i~~~------q~eel~Ke~kElq~rL~~q~K 691 (988)
T KOG2072|consen 656 LEDLEKLDADQIKAR------QIEELEKERKELQSRLQYQEK 691 (988)
T ss_pred hHHhhhcCHHHHHHH------HHHHHHHHHHHHHHHHHHHHh
Confidence 456666776665432 334444444444444444433
No 51
>KOG0996|consensus
Probab=60.80 E-value=32 Score=41.59 Aligned_cols=52 Identities=25% Similarity=0.214 Sum_probs=32.2
Q ss_pred HHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHhhCCCCCCccccccccHHHHHHHH
Q psy9560 115 EYWQRIYNLEAIKYELDREIMLKDFEITELGKQVNDLRGKFVRPILKKVSKYENKFAKLQ 174 (503)
Q Consensus 115 ELHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnDLRGKFKKPaLKKVRkSAnm~a~L~ 174 (503)
+|-+..+.|+-+---+-.++.+...++..+..++|.++| +++++..-|.+|.
T Consensus 462 ~l~e~~~~l~~~t~~~~~e~~~~ekel~~~~~~~n~~~~--------e~~vaesel~~L~ 513 (1293)
T KOG0996|consen 462 ELDEILDSLKQETEGIREEIEKLEKELMPLLKQVNEARS--------ELDVAESELDILL 513 (1293)
T ss_pred HHHHHHHHHhhhhhhhHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHHHHHHHHH
Confidence 344444445444445556666777788888888888888 3445554466553
No 52
>PRK14950 DNA polymerase III subunits gamma and tau; Provisional
Probab=59.94 E-value=68 Score=35.13 Aligned_cols=14 Identities=7% Similarity=0.012 Sum_probs=6.0
Q ss_pred CCCCCCCHHHHHHH
Q psy9560 99 KDTDDMAEDEIGDL 112 (503)
Q Consensus 99 LnIDGLSedELQEk 112 (503)
..+..++.++|...
T Consensus 174 i~f~~l~~~el~~~ 187 (585)
T PRK14950 174 FDFHRHSVADMAAH 187 (585)
T ss_pred eeCCCCCHHHHHHH
Confidence 34444444444433
No 53
>PRK11546 zraP zinc resistance protein; Provisional
Probab=59.74 E-value=22 Score=33.44 Aligned_cols=50 Identities=16% Similarity=0.378 Sum_probs=28.6
Q ss_pred CCH---HHHHHHHHHHHHHHHHHhhh----hhhHHHHHhhh--hhh-HHHHHHHHHHhhC
Q psy9560 104 MAE---DEIGDLCQEYWQRIYNLEAI----KYELDREIMLK--DFE-ITELGKQVNDLRG 153 (503)
Q Consensus 104 LSe---dELQEkCKELHdRIdkLEEE----KYDLE~KVkKQ--DyE-InELniKVnDLRG 153 (503)
|+. +.++.|.++++..+..|-.+ +|+|+.-+... |-+ |+.|...|+|||+
T Consensus 44 LT~EQQa~~q~I~~~f~~~t~~LRqqL~aKr~ELnALl~~~~pD~~kI~aL~kEI~~Lr~ 103 (143)
T PRK11546 44 LTTEQQAAWQKIHNDFYAQTSALRQQLVSKRYEYNALLTANPPDSSKINAVAKEMENLRQ 103 (143)
T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCHHHHHHHHHHHHHHHH
Confidence 554 45777788888888777665 45555433322 112 5555555555555
No 54
>PF08581 Tup_N: Tup N-terminal; InterPro: IPR013890 The N-terminal region of the Tup protein has been shown to interact with the Ssn6 transcriptional co-repressor []. ; PDB: 3VP9_B 3VP8_B.
Probab=59.70 E-value=34 Score=29.17 Aligned_cols=33 Identities=18% Similarity=0.224 Sum_probs=28.3
Q ss_pred HHHhhhhhhHHHHHhhhhhhHHHHHHHHHHhhC
Q psy9560 121 YNLEAIKYELDREIMLKDFEITELGKQVNDLRG 153 (503)
Q Consensus 121 dkLEEEKYDLE~KVkKQDyEInELniKVnDLRG 153 (503)
..+...+=|++.+|..|-.||+.++++|.+|--
T Consensus 21 ~~~k~~~~e~e~ki~~Qi~Em~~ir~~v~eLE~ 53 (79)
T PF08581_consen 21 NSYKHQKDEYEHKINSQIQEMQQIRQKVYELEQ 53 (79)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 345558889999999999999999999999965
No 55
>PF04102 SlyX: SlyX; InterPro: IPR007236 The SlyX protein has no known function. It is short, less than 80 amino acids, and its gene is found close to the slyD gene. The SlyX protein has a conserved PPH(Y/W) motif at its C terminus. The protein may be a coiled-coil structure.; PDB: 3EFG_A.
Probab=59.65 E-value=37 Score=27.63 Aligned_cols=27 Identities=26% Similarity=0.393 Sum_probs=13.7
Q ss_pred hHHHHHhhhhhhHHHHHHHHHHhhCCC
Q psy9560 129 ELDREIMLKDFEITELGKQVNDLRGKF 155 (503)
Q Consensus 129 DLE~KVkKQDyEInELniKVnDLRGKF 155 (503)
.|+..|.+|..+|..|+.+|..|..++
T Consensus 22 ~Ln~~v~~Qq~~I~~L~~~l~~L~~rl 48 (69)
T PF04102_consen 22 ELNDVVTEQQRQIDRLQRQLRLLRERL 48 (69)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHT-
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 344445555555555555555555543
No 56
>TIGR02168 SMC_prok_B chromosome segregation protein SMC, common bacterial type. SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
Probab=59.36 E-value=2.6e+02 Score=31.40 Aligned_cols=8 Identities=25% Similarity=0.634 Sum_probs=2.8
Q ss_pred HHHHHHHH
Q psy9560 143 ELGKQVND 150 (503)
Q Consensus 143 ELniKVnD 150 (503)
.|..++.+
T Consensus 334 ~~~~~l~~ 341 (1179)
T TIGR02168 334 ELAEELAE 341 (1179)
T ss_pred HHHHHHHH
Confidence 33333333
No 57
>KOG0994|consensus
Probab=59.03 E-value=66 Score=39.51 Aligned_cols=36 Identities=22% Similarity=0.251 Sum_probs=22.5
Q ss_pred HHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHhh
Q psy9560 117 WQRIYNLEAIKYELDREIMLKDFEITELGKQVNDLR 152 (503)
Q Consensus 117 HdRIdkLEEEKYDLE~KVkKQDyEInELniKVnDLR 152 (503)
.++|+.|+-+.-+-+..+.....+|..|+.||+.+.
T Consensus 1709 l~~l~dLe~~y~~~~~~L~~~~aeL~~Le~r~~~vl 1744 (1758)
T KOG0994|consen 1709 LDRLKDLELEYLRNEQALEDKAAELAGLEKRVESVL 1744 (1758)
T ss_pred HHHHHHHHHHHhhhhHHHHHHHHHhhhHHHHHHHHH
Confidence 345555665555566666666677777777766543
No 58
>cd00520 RRF Ribosome recycling factor (RRF). Ribosome recycling factor dissociates the posttermination complex, composed of the ribosome, deacylated tRNA, and mRNA, after termination of translation. Thus ribosomes are "recycled" and ready for another round of protein synthesis. RRF is believed to bind the ribosome at the A-site in a manner that mimics tRNA, but the specific mechanisms remain unclear. RRF is essential for bacterial growth. It is not necessary for cell growth in archaea or eukaryotes, but is found in mitochondria or chloroplasts of some eukaryotic species.
Probab=58.61 E-value=70 Score=30.31 Aligned_cols=51 Identities=24% Similarity=0.257 Sum_probs=32.0
Q ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcCCCCCCCCCCHHHH
Q psy9560 54 GFMTPERKKKLRLLLRKKAAEELKREQERRALERKRVIDERCGEPKDTDDMAEDEI 109 (503)
Q Consensus 54 gKMT~eRKkkLKSLLLqKAaEELEKEKEqKeEEKKKiLAERcpPPLnIDGLSedEL 109 (503)
..||.+||+.|- +.++...++-+..-+--|+.++..-- +...-..+++|++
T Consensus 99 P~lT~E~R~~lv----K~~k~~~E~~Kv~iRniR~~~~~~lK-k~~k~~~iseD~~ 149 (179)
T cd00520 99 PPLTEERRKELV----KDAKKIAEEAKVAIRNIRRDANDKIK-KLEKEKEISEDEV 149 (179)
T ss_pred CCCCHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHH-HhhccCCCCchHH
Confidence 479999988654 45566666666677777777777544 3333334565543
No 59
>PF00261 Tropomyosin: Tropomyosin; InterPro: IPR000533 Tropomyosins [], are a family of closely related proteins present in muscle and non-muscle cells. In striated muscle, tropomyosin mediate the interactions between the troponin complex and actin so as to regulate muscle contraction []. The role of tropomyosin in smooth muscle and non-muscle tissues is not clear. Tropomyosin is an alpha-helical protein that forms a coiled-coil structure of 2 parallel helices containing 2 sets of 7 alternating actin binding sites []. There are multiple cell-specific isoforms, created by differential splicing of the messenger RNA from one gene, but the proportions of the isoforms vary between different cell types. Muscle isoforms of tropomyosin are characterised by having 284 amino acid residues and a highly conserved N-terminal region, whereas non-muscle forms are generally smaller and are heterogeneous in their N-terminal region. This entry represents tropomyosin (Tmp) 1, 2 and 3. Within the yeast Tmp1 and Tmp2, biochemical and sequence analyses indicate that Tpm2 spans four actin monomers along a filament, whereas Tpm1 spans five. Despite its shorter length, Tpm2 can compete with Tpm1 for binding to F-actin. Over-expression of Tpm2 in vivo alters the axial budding of haploids to a bipolar pattern, and this can be partially suppressed by co-over-expression of Tpm1. This suggests distinct functions for the two tropomyosins, and indicates that the ratio between them is important for correct morphogenesis [].; PDB: 2EFR_A 2Z5H_C 2Z5I_D 2D3E_B 2EFS_D 3U59_B 1C1G_C 1IHQ_A 3AZD_B 1MV4_B ....
Probab=58.58 E-value=1.7e+02 Score=28.41 Aligned_cols=19 Identities=11% Similarity=-0.041 Sum_probs=7.3
Q ss_pred HHHHHHHHHHHHHHHHhhh
Q psy9560 108 EIGDLCQEYWQRIYNLEAI 126 (503)
Q Consensus 108 ELQEkCKELHdRIdkLEEE 126 (503)
.|...+.++.++.+.+++.
T Consensus 159 ~lE~~~~~~~~re~~~e~~ 177 (237)
T PF00261_consen 159 SLEASEEKASEREDEYEEK 177 (237)
T ss_dssp HHHHHHHHHHHHHHHHHHH
T ss_pred HhhhhhhhhhHHHHHHHHH
Confidence 3333333344443333333
No 60
>TIGR00606 rad50 rad50. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=57.70 E-value=1.1e+02 Score=36.54 Aligned_cols=69 Identities=10% Similarity=0.056 Sum_probs=34.5
Q ss_pred HHHHHHHHHhhhhcCCCCCCCCCCHH-----HHHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHhhCC
Q psy9560 82 RRALERKRVIDERCGEPKDTDDMAED-----EIGDLCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQVNDLRGK 154 (503)
Q Consensus 82 qKeEEKKKiLAERcpPPLnIDGLSed-----ELQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnDLRGK 154 (503)
+....+..++.+-. ...+|+|++.+ ++..+...|-. .|....-+|+.....++.+|..+..++++|+..
T Consensus 361 ~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~e~~~~~~~~q~~L~ei~~~ 434 (1311)
T TIGR00606 361 EHIRARDSLIQSLA-TRLELDGFERGPFSERQIKNFHTLVIE---RQEDEAKTAAQLCADLQSKERLKQEQADEIRDE 434 (1311)
T ss_pred HHHHHHHHHHHHHH-HhcCcCCCCCcccchHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34455555555555 34567765533 33333332222 233334445555555555566666666666553
No 61
>PF10498 IFT57: Intra-flagellar transport protein 57 ; InterPro: IPR019530 Eukaryotic cilia and flagella are specialised organelles found at the periphery of cells of diverse organisms. Intra-flagellar transport (IFT) is required for the assembly and maintenance of eukaryotic cilia and flagella, and consists of the bi-directional movement of large protein particles between the base and the distal tip of the organelle. IFT particles contain multiple copies of two distinct protein complexes, A and B, which contain at least 6 and 11 protein subunits. IFT57 is part of complex B but is not, however, required for the core subunits to stay associated []. This protein is known as Huntington-interacting protein-1 in humans.
Probab=57.66 E-value=1.1e+02 Score=32.34 Aligned_cols=73 Identities=7% Similarity=0.160 Sum_probs=48.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhhcCCCCCCCCCCHHHHHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHH
Q psy9560 69 RKKAAEELKREQERRALERKRVIDERCGEPKDTDDMAEDEIGDLCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQV 148 (503)
Q Consensus 69 LqKAaEELEKEKEqKeEEKKKiLAERcpPPLnIDGLSedELQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKV 148 (503)
|.+-..++. ....+...|++||+.... .|...++.+.+++..+.+.-=.+..-|.....++++++.++
T Consensus 243 L~kl~~~i~-~~lekI~sREk~iN~qle-----------~l~~eYr~~~~~ls~~~~~y~~~s~~V~~~t~~L~~IseeL 310 (359)
T PF10498_consen 243 LDKLQQDIS-KTLEKIESREKYINNQLE-----------PLIQEYRSAQDELSEVQEKYKQASEGVSERTRELAEISEEL 310 (359)
T ss_pred HHHHHHHHH-HHHHHHHHHHHHHHHHhH-----------HHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHH
Confidence 444455555 467788999999998662 13334666677777777665566677777777777777776
Q ss_pred HHhhC
Q psy9560 149 NDLRG 153 (503)
Q Consensus 149 nDLRG 153 (503)
..++.
T Consensus 311 e~vK~ 315 (359)
T PF10498_consen 311 EQVKQ 315 (359)
T ss_pred HHHHH
Confidence 66544
No 62
>PF07304 SRA1: Steroid receptor RNA activator (SRA1); InterPro: IPR009917 This entry consists of several hypothetical mammalian steroid receptor RNA activator proteins. The SRA-RNAs encode stable proteins that are widely expressed and upregulated in breast cancer cell lines. SRA-RNA is a steroid receptor co-activator which acts as a functional RNA. This domain is also found at the C terminus of Sec31, a component of the coat protein complex II (COPII, which promotes the formation of transport vesicles from the endoplasmic reticulum (ER). COPII has two main functions, the physical deformation of the endoplasmic reticulum membrane into vesicles and the selection of cargo molecules. ; PDB: 2YRU_A.
Probab=57.61 E-value=7.9 Score=35.99 Aligned_cols=35 Identities=26% Similarity=0.434 Sum_probs=25.8
Q ss_pred CccHHHHHHHHHHhhhcCchhhHHHHHHHHHhhhh
Q psy9560 299 PVSIDDLVEKLMSAVSATPQPLLKSVVKDTLRRQA 333 (503)
Q Consensus 299 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 333 (503)
...|.+++.-|...+...++.+-|.+|+|+-||..
T Consensus 41 ~~~i~~~~~~L~~v~~~~~~~~~kr~~~D~~KRL~ 75 (157)
T PF07304_consen 41 EQPIEEVLRELQRVLEACPPSIKKRVVDDIEKRLN 75 (157)
T ss_dssp ---HHHHHHHHHHHHHHHHTTS-HHHHHHHHHHHH
T ss_pred hhhHHHHHHHHHHHHHHcccccchhHHHHHHHHHH
Confidence 34577777778888888888889999999988853
No 63
>PRK00083 frr ribosome recycling factor; Reviewed
Probab=57.33 E-value=68 Score=30.80 Aligned_cols=52 Identities=23% Similarity=0.277 Sum_probs=32.6
Q ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcCCCCCCCCCCHHHHH
Q psy9560 54 GFMTPERKKKLRLLLRKKAAEELKREQERRALERKRVIDERCGEPKDTDDMAEDEIG 110 (503)
Q Consensus 54 gKMT~eRKkkLKSLLLqKAaEELEKEKEqKeEEKKKiLAERcpPPLnIDGLSedELQ 110 (503)
..||.+||+.|- +.++...++-+..-+--|+.++..-- +...-.++|+|+++
T Consensus 103 P~lT~E~R~elv----K~~k~~~E~aKv~iRniRr~~~~~iK-k~~k~~~iseD~~k 154 (185)
T PRK00083 103 PPLTEERRKELV----KQVKKEAEEAKVAIRNIRRDANDKLK-KLEKDKEISEDELK 154 (185)
T ss_pred CCCCHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHH-HHhhcCCCChHHHH
Confidence 469999988754 44555555556677777888777544 33223456776543
No 64
>KOG1655|consensus
Probab=56.97 E-value=20 Score=35.89 Aligned_cols=59 Identities=20% Similarity=0.231 Sum_probs=46.2
Q ss_pred HHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHhhCCCCCCcc--------ccccccHHHHHHHH
Q psy9560 116 YWQRIYNLEAIKYELDREIMLKDFEITELGKQVNDLRGKFVRPIL--------KKVSKYENKFAKLQ 174 (503)
Q Consensus 116 LHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnDLRGKFKKPaL--------KKVRkSAnm~a~L~ 174 (503)
|.+.|+.|+.----|+.|+.+.|.+|-.++.+|.-.|---.++.| |.-+||+++..+|.
T Consensus 17 L~dai~~v~~r~dSve~KIskLDaeL~k~~~Qi~k~R~gpaq~~~KqrAlrVLkQKK~yE~q~d~L~ 83 (218)
T KOG1655|consen 17 LQDAIDSVNKRSDSVEKKISKLDAELCKYKDQIKKTRPGPAQNALKQRALRVLKQKKMYENQKDSLD 83 (218)
T ss_pred HHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHhcCCCcchhHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 446677777767778899999999999999999988876666654 45678888777774
No 65
>PF09726 Macoilin: Transmembrane protein; InterPro: IPR019130 This entry represents the multi-pass transmembrane protein Macoilin, which is highly conserved in eukaryotes. ; GO: 0016021 integral to membrane
Probab=56.95 E-value=3.6e+02 Score=31.08 Aligned_cols=46 Identities=33% Similarity=0.480 Sum_probs=37.5
Q ss_pred CHHHHHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHhhC
Q psy9560 105 AEDEIGDLCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQVNDLRG 153 (503)
Q Consensus 105 SedELQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnDLRG 153 (503)
+..+--+-|| .|+..||.|-+-|...++..+-.|..|...+++||-
T Consensus 535 ~r~e~~e~~r---~r~~~lE~E~~~lr~elk~kee~~~~~e~~~~~lr~ 580 (697)
T PF09726_consen 535 TRQECAESCR---QRRRQLESELKKLRRELKQKEEQIRELESELQELRK 580 (697)
T ss_pred ccchhHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3344444554 888999999999999999999999999999988887
No 66
>PF09755 DUF2046: Uncharacterized conserved protein H4 (DUF2046); InterPro: IPR019152 This is the conserved N-terminal 350 residues of a family of proteins of unknown function possibly containing a coiled-coil domain.
Probab=56.76 E-value=59 Score=34.16 Aligned_cols=27 Identities=26% Similarity=0.452 Sum_probs=18.9
Q ss_pred HHHHHHHHHHHHHHhhhhhhHHHHHhh
Q psy9560 110 GDLCQEYWQRIYNLEAIKYELDREIML 136 (503)
Q Consensus 110 QEkCKELHdRIdkLEEEKYDLE~KVkK 136 (503)
.-|++.||.++++|+.+|--|+.++.+
T Consensus 177 E~lvN~L~Kqm~~l~~eKr~Lq~~l~~ 203 (310)
T PF09755_consen 177 EALVNRLWKQMDKLEAEKRRLQEKLEQ 203 (310)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHcc
Confidence 447777777777777777777776653
No 67
>KOG4466|consensus
Probab=56.11 E-value=96 Score=32.47 Aligned_cols=139 Identities=19% Similarity=0.220 Sum_probs=73.5
Q ss_pred HHHHHHHHHHHHHHhHHHHhhhh-----ccCCCCCHHHHHHHHH------HHHHHHHHHHHHHHHHHHHHHHHHhhhhcC
Q psy9560 28 KLEELERKRAEVRRRMEEASGAR-----RKKGFMTPERKKKLRL------LLRKKAAEELKREQERRALERKRVIDERCG 96 (503)
Q Consensus 28 KqeErERKkAEvRKRlEEA~KkK-----AKKgKMT~eRKkkLKS------LLLqKAaEELEKEKEqKeEEKKKiLAERcp 96 (503)
|.+.=+-|.+..+++|++-..-. ..-++|-.+|+.-|+. +++..+..+-+.|.+. .|+- +..+|
T Consensus 36 keq~yk~kLa~Lq~~Leel~~g~~~eYl~~~~~L~~~~kerl~~aely~e~~~e~v~~eYe~E~~a---Ak~e-~E~~~- 110 (291)
T KOG4466|consen 36 KEQMYKDKLAQLQAQLEELGQGTAPEYLKRVKKLDESRKERLRVAELYREYCVERVEREYECEIKA---AKKE-YESKK- 110 (291)
T ss_pred HHHHHHHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHH-HHHHH-
Confidence 44555667789999999987654 2345566666655443 3444444444333221 1211 23334
Q ss_pred CCCCCCCCCHHHHH-HHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHhhCCCCCC--ccccccccH-HHH-H
Q psy9560 97 EPKDTDDMAEDEIG-DLCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQVNDLRGKFVRP--ILKKVSKYE-NKF-A 171 (503)
Q Consensus 97 PPLnIDGLSedELQ-EkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnDLRGKFKKP--aLKKVRkSA-nm~-a 171 (503)
.+ |+ .|.-+|-++|.+|+++|+-|+..-.. .++..|-.| ++|.-|--| .-+++++|+ +.+ -
T Consensus 111 ~l----------Lke~l~seleeKkrkieeeR~smDlts~~--~e~~~l~~r--k~rrd~~~p~k~r~~r~~sa~~~~~y 176 (291)
T KOG4466|consen 111 KL----------LKENLISELEEKKRKIEEERLSMDLTSDS--MESKPLYTR--KLRRDPNDPEKGRDKRNKSAPDQLVY 176 (291)
T ss_pred HH----------HHHHHHHHHHHHHHHHHHHHhhhhhcccc--cccchHHhh--hcccCCCCcccccccccCCChHHHHH
Confidence 11 11 24567888999999999887765432 344554444 444444333 235666666 222 3
Q ss_pred HHHhhhhccchhhh
Q psy9560 172 KLQKKAAEFNFRSQ 185 (503)
Q Consensus 172 ~L~KhkvsmDfRAN 185 (503)
||+.....-|+|.|
T Consensus 177 ~L~d~~i~eD~~~i 190 (291)
T KOG4466|consen 177 QLQDLNILEDLRTI 190 (291)
T ss_pred hhhhhhHHHHHhhh
Confidence 44444334444443
No 68
>KOG2507|consensus
Probab=56.05 E-value=35 Score=37.58 Aligned_cols=64 Identities=23% Similarity=0.297 Sum_probs=39.9
Q ss_pred HHHHHHHHHHHHHhHHHHhhhh-ccCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh
Q psy9560 29 LEELERKRAEVRRRMEEASGAR-RKKGFMTPERKKKLRLLLRKKAAEELKREQERRALERKRVIDER 94 (503)
Q Consensus 29 qeErERKkAEvRKRlEEA~KkK-AKKgKMT~eRKkkLKSLLLqKAaEELEKEKEqKeEEKKKiLAER 94 (503)
|.-.|++++|++.-.++.-+.+ ++|--++..++. +.-.+++..+..++|+.+....|+|||++-
T Consensus 209 qkr~Ek~kee~~hekeee~~rr~agkE~~~~~a~~--~ee~tkrm~eqrkrer~e~~aareRI~aqi 273 (506)
T KOG2507|consen 209 QKRVEKTKEEVEHEKEEEPLRRQAGKEPIRPTAPG--TEENTKRMREQRKRERKEGTAARERILAQI 273 (506)
T ss_pred ccchhhhHHHHHHHhhhhhHhhhhccccccccccc--chHHHHHHHHHHHHHHHhhHHHHHHHHHHh
Confidence 3344555555554444444333 445555555443 334567788888889999999999999863
No 69
>PF01086 Clathrin_lg_ch: Clathrin light chain; InterPro: IPR000996 Proteins synthesized on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. These vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transport []. Clathrin coats contain both clathrin (acts as a scaffold) and adaptor complexes that link clathrin to receptors in coated vesicles. Clathrin-associated protein complexes are believed to interact with the cytoplasmic tails of membrane proteins, leading to their selection and concentration. The two major types of clathrin adaptor complexes are the heterotetrameric adaptor protein (AP) complexes, and the monomeric GGA (Golgi-localising, Gamma-adaptin ear domain homology, ARF-binding proteins) adaptors [, ]. Clathrin is a trimer composed of three heavy chains and three light chains, each monomer projecting outwards like a leg; this three-legged structure is known as a triskelion [, ]. The heavy chains form the legs, their N-terminal beta-propeller regions extending outwards, while their C-terminal alpha-alpha-superhelical regions form the central hub of the triskelion. Peptide motifs can bind between the beta-propeller blades. The light chains appear to have a regulatory role, and may help orient the assembly and disassembly of clathrin coats as they interact with hsc70 uncoating ATPase []. Clathrin triskelia self-polymerise into a curved lattice by twisting individual legs together. The clathrin lattice forms around a vesicle as it buds from the TGN, plasma membrane or endosomes, acting to stabilise the vesicle and facilitate the budding process []. The multiple blades created when the triskelia polymerise are involved in multiple protein interactions, enabling the recruitment of different cargo adaptors and membrane attachment proteins []. This entry represents clathrin light chains, which are more divergent in sequence than the heavy chains []. In higher eukaryotes, two genes encode distinct but related light chains, each of which can yield two separate forms via alternative splicing. In yeast there is a single light chain whose sequence is only distantly related to that of higher eukaryotes. Clathrin light chains have a conserved acidic N-terminal domain, a central coiled-coil domain and a conserved C-terminal domain. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0005198 structural molecule activity, 0006886 intracellular protein transport, 0016192 vesicle-mediated transport, 0030130 clathrin coat of trans-Golgi network vesicle, 0030132 clathrin coat of coated pit; PDB: 3LVG_E 3LVH_D.
Probab=55.65 E-value=24 Score=34.43 Aligned_cols=42 Identities=14% Similarity=0.306 Sum_probs=0.0
Q ss_pred HHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHhhCCCCCC
Q psy9560 113 CQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQVNDLRGKFVRP 158 (503)
Q Consensus 113 CKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnDLRGKFKKP 158 (503)
..-.|+||++|= ||+.+..+..+++.-++.=+..|+|+-.-|
T Consensus 182 ~~t~WerV~~Li----d~~~~~~~~~kD~sRmR~iLl~LK~~~~~P 223 (225)
T PF01086_consen 182 PGTEWERVAKLI----DFNPKSSKSGKDVSRMREILLKLKGDENAP 223 (225)
T ss_dssp ----------------------------------------------
T ss_pred CCCcHHHHHHHh----CCCCCCCCCCCcHHHHHHHHHHhhhhccCC
Confidence 456799999886 777777777788888887777887764433
No 70
>PF07926 TPR_MLP1_2: TPR/MLP1/MLP2-like protein; InterPro: IPR012929 This domain is found in a number of proteins, including TPR protein (P12270 from SWISSPROT) and yeast myosin-like proteins 1 (MLP1, Q02455 from SWISSPROT) and 2 (MLP2, P40457 from SWISSPROT). These proteins share a number of features; for example, they all have coiled-coil regions and all three are associated with nuclear pores [, , ]. TPR is thought to be a component of nuclear pore complex- attached intranuclear filaments [], and is implicated in nuclear protein import []. Moreover, its N-terminal region is involved in the activation of oncogenic kinases, possibly by mediating the dimerisation of kinase domains or by targeting these kinases to the nuclear pore complex []. MLP1 and MLP2 are involved in the process of telomere length regulation, where they are thought to interact with proteins such as Tel1p and modulate their activity []. ; GO: 0006606 protein import into nucleus, 0005643 nuclear pore
Probab=55.64 E-value=64 Score=28.82 Aligned_cols=82 Identities=18% Similarity=0.223 Sum_probs=41.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhhhcCCCCCCCCCCHHHHHHHHHHHHHHHHHHhhhh-----------hhHHHHHhhhh
Q psy9560 70 KKAAEELKREQERRALERKRVIDERCGEPKDTDDMAEDEIGDLCQEYWQRIYNLEAIK-----------YELDREIMLKD 138 (503)
Q Consensus 70 qKAaEELEKEKEqKeEEKKKiLAERcpPPLnIDGLSedELQEkCKELHdRIdkLEEEK-----------YDLE~KVkKQD 138 (503)
+.+.++|+.....-.+.-.+|-.|-+-.--++. ....|++-+.++-..|..|..+. =.|+..-..-.
T Consensus 27 ~~~~~dl~~q~~~a~~Aq~~YE~El~~Ha~~~~--~L~~lr~e~~~~~~~~~~l~~~~~~a~~~l~~~e~sw~~qk~~le 104 (132)
T PF07926_consen 27 QSLREDLESQAKIAQEAQQKYERELVKHAEDIK--ELQQLREELQELQQEINELKAEAESAKAELEESEASWEEQKEQLE 104 (132)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHH
Confidence 445555555555555555566554441000011 11233333444444444444443 33444445567
Q ss_pred hhHHHHHHHHHHhhC
Q psy9560 139 FEITELGKQVNDLRG 153 (503)
Q Consensus 139 yEInELniKVnDLRG 153 (503)
.+|.+++.|+.||..
T Consensus 105 ~e~~~~~~r~~dL~~ 119 (132)
T PF07926_consen 105 KELSELEQRIEDLNE 119 (132)
T ss_pred HHHHHHHHHHHHHHH
Confidence 789999999999863
No 71
>PF01576 Myosin_tail_1: Myosin tail; InterPro: IPR002928 Muscle contraction is caused by sliding between the thick and thin filaments of the myofibril. Myosin is a major component of thick filaments and exists as a hexamer of 2 heavy chains [], 2 alkali light chains, and 2 regulatory light chains. The heavy chain can be subdivided into the N-terminal globular head and the C-terminal coiled-coil rod-like tail, although some forms have a globular region in their C-terminal. There are many cell-specific isoforms of myosin heavy chains, coded for by a multi-gene family []. Myosin interacts with actin to convert chemical energy, in the form of ATP, to mechanical energy []. The 3-D structure of the head portion of myosin has been determined [] and a model for actin-myosin complex has been constructed []. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament []. The coiled-coil region provides the structural backbone of the thick filament [].; GO: 0003774 motor activity, 0016459 myosin complex; PDB: 2LNK_C 3ZWH_Q.
Probab=55.60 E-value=3.9 Score=46.73 Aligned_cols=109 Identities=19% Similarity=0.251 Sum_probs=0.0
Q ss_pred HHhHHHHhhhhccCCCCCHHHHHHHHHHH--------HHHHHHHHHHHHHHHHHHHHHHhhhhcCCCCCCCCCCH-----
Q psy9560 40 RRRMEEASGARRKKGFMTPERKKKLRLLL--------RKKAAEELKREQERRALERKRVIDERCGEPKDTDDMAE----- 106 (503)
Q Consensus 40 RKRlEEA~KkKAKKgKMT~eRKkkLKSLL--------LqKAaEELEKEKEqKeEEKKKiLAERcpPPLnIDGLSe----- 106 (503)
|+-||+.... --..|+.-||+|...+- |++.+..|++++.....+-..+.. +|+.+..
T Consensus 108 rr~LEe~~~~--~e~~~~~lrkkh~~~~~eL~eqle~lqk~k~~lEK~k~~l~~e~~dL~~-------~l~~~~k~k~~~ 178 (859)
T PF01576_consen 108 RRDLEEANLQ--HEATLAELRKKHQDAVAELNEQLEQLQKQKAKLEKEKSQLEAELDDLQA-------QLDSLQKAKQEA 178 (859)
T ss_dssp --------------------------------------------------------------------------------
T ss_pred HHHHHHHHHH--HHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHH-------HHHHHHHHHHHH
Confidence 5556554421 12567777887765531 455555555444333333222211 2222221
Q ss_pred ----HHHHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHhhCCCCC
Q psy9560 107 ----DEIGDLCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQVNDLRGKFVR 157 (503)
Q Consensus 107 ----dELQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnDLRGKFKK 157 (503)
-.|..-+.+|..+++.++..+=||.....+...||.+|+.++.|+..++..
T Consensus 179 Ek~~K~lE~qL~El~~klee~er~~~el~~~k~kL~~E~~eL~~qLee~e~~~~~ 233 (859)
T PF01576_consen 179 EKKRKQLEAQLNELQAKLEESERQRNELTEQKAKLQSENSELTRQLEEAESQLSQ 233 (859)
T ss_dssp -------------------------------------------------------
T ss_pred HhHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 124556678888888999999999999999999999999999999886554
No 72
>KOG1318|consensus
Probab=55.60 E-value=27 Score=37.74 Aligned_cols=23 Identities=9% Similarity=0.075 Sum_probs=12.9
Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHH
Q psy9560 53 KGFMTPERKKKLRLLLRKKAAEELK 77 (503)
Q Consensus 53 KgKMT~eRKkkLKSLLLqKAaEELE 77 (503)
|+.+..+|.- |+-+|++++.-..
T Consensus 261 ~~~~~~~~~n--KgtILk~s~dYIr 283 (411)
T KOG1318|consen 261 KCNSEDMKSN--KGTILKASCDYIR 283 (411)
T ss_pred CCCcchhhcc--cchhhHHHHHHHH
Confidence 4444444433 6667777775544
No 73
>PF10044 Ret_tiss: Retinal tissue protein; InterPro: IPR018737 Rtp is a family of proteins of approximately 112 amino acids in length which is conserved from nematodes to humans. The proposed tertiary structure is of almost entirely alpha helix interrupted only by loops located at proline residues. Three sites in the protein sequence reveal two types of possible post-translation modification. A serine residue, at position 41, is a candidate for protein kinase C phosphorylation. Glycine residues at position 69 and 91 are probable sites for acetylation by covalent amide linkage of myristate via N-myristoyl transferase. Rtp is differentially expressed in the trout retina between parr and smolt developmental stages (smoltification). It is likely to be a house-keeping protein [].
Probab=54.94 E-value=11 Score=33.31 Aligned_cols=25 Identities=12% Similarity=0.291 Sum_probs=21.1
Q ss_pred CCCCCCHHHHHHHHHHHHHHHHHHh
Q psy9560 100 DTDDMAEDEIGDLCQEYWQRIYNLE 124 (503)
Q Consensus 100 nIDGLSedELQEkCKELHdRIdkLE 124 (503)
.++.|+.++|.+++|+||+++++|=
T Consensus 54 eLs~Lt~~~L~~~Ik~L~~~aYqLG 78 (95)
T PF10044_consen 54 ELSSLTPDQLIEKIKKLQDEAYQLG 78 (95)
T ss_pred HHHcCCHHHHHHHHHHHHHHHHHHh
Confidence 4567789999999999999998764
No 74
>PF07946 DUF1682: Protein of unknown function (DUF1682); InterPro: IPR012879 The members of this family are all hypothetical eukaryotic proteins of unknown function. One member (Q920S6 from SWISSPROT) is described as being an adipocyte-specific protein, but no evidence of this was found.
Probab=53.83 E-value=63 Score=32.89 Aligned_cols=8 Identities=25% Similarity=0.455 Sum_probs=4.3
Q ss_pred CCCCCHHH
Q psy9560 53 KGFMTPER 60 (503)
Q Consensus 53 KgKMT~eR 60 (503)
..+||++.
T Consensus 299 ~~~lspee 306 (321)
T PF07946_consen 299 LSKLSPEE 306 (321)
T ss_pred HhcCCHHH
Confidence 45566654
No 75
>PF14335 DUF4391: Domain of unknown function (DUF4391)
Probab=53.46 E-value=24 Score=33.88 Aligned_cols=71 Identities=21% Similarity=0.261 Sum_probs=48.6
Q ss_pred HHHHHHHHHHHHHHHHHHhhhhh-------hHHHHHhhhhhhHHHHHHHHHHhhCCCCCCccccccccHHHHHHHHhhhh
Q psy9560 106 EDEIGDLCQEYWQRIYNLEAIKY-------ELDREIMLKDFEITELGKQVNDLRGKFVRPILKKVSKYENKFAKLQKKAA 178 (503)
Q Consensus 106 edELQEkCKELHdRIdkLEEEKY-------DLE~KVkKQDyEInELniKVnDLRGKFKKPaLKKVRkSAnm~a~L~Khkv 178 (503)
...|..+...|.++|..+..... ++.+.+.+. .+|..|..+|..|+++.+|= ..-..
T Consensus 143 ~~~l~~lY~~l~~~i~~~~~~~~~g~~~~~~~~~~~~~~-~~i~~L~kei~~L~~~~~kE---------------kq~nr 206 (221)
T PF14335_consen 143 GLNLDALYESLVNQIIALNAAPNTGEFEKTSLWERIERL-EQIEKLEKEIAKLKKKIKKE---------------KQFNR 206 (221)
T ss_pred cccHHHHHHHHHHHHhcchhhhhcCccccCCHHHHHHHH-HHHHHHHHHHHHHHHHHHhc---------------cCHHH
Confidence 34466677888888888887776 778877766 57899999999888854330 01124
Q ss_pred ccchhhhccccccc
Q psy9560 179 EFNFRSQLKAVKKR 192 (503)
Q Consensus 179 smDfRANLKqVKKE 192 (503)
+|.+...||+.+++
T Consensus 207 kveln~elk~l~~e 220 (221)
T PF14335_consen 207 KVELNTELKKLKKE 220 (221)
T ss_pred HHHHHHHHHHHHhc
Confidence 56666667766653
No 76
>PF12718 Tropomyosin_1: Tropomyosin like; InterPro: IPR000533 Tropomyosins [], are a family of closely related proteins present in muscle and non-muscle cells. In striated muscle, tropomyosin mediate the interactions between the troponin complex and actin so as to regulate muscle contraction []. The role of tropomyosin in smooth muscle and non-muscle tissues is not clear. Tropomyosin is an alpha-helical protein that forms a coiled-coil structure of 2 parallel helices containing 2 sets of 7 alternating actin binding sites []. There are multiple cell-specific isoforms, created by differential splicing of the messenger RNA from one gene, but the proportions of the isoforms vary between different cell types. Muscle isoforms of tropomyosin are characterised by having 284 amino acid residues and a highly conserved N-terminal region, whereas non-muscle forms are generally smaller and are heterogeneous in their N-terminal region. This entry represents tropomyosin (Tmp) 1, 2 and 3. Within the yeast Tmp1 and Tmp2, biochemical and sequence analyses indicate that Tpm2 spans four actin monomers along a filament, whereas Tpm1 spans five. Despite its shorter length, Tpm2 can compete with Tpm1 for binding to F-actin. Over-expression of Tpm2 in vivo alters the axial budding of haploids to a bipolar pattern, and this can be partially suppressed by co-over-expression of Tpm1. This suggests distinct functions for the two tropomyosins, and indicates that the ratio between them is important for correct morphogenesis [].
Probab=53.38 E-value=33 Score=31.47 Aligned_cols=47 Identities=15% Similarity=0.135 Sum_probs=29.6
Q ss_pred HHHHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHhhC
Q psy9560 107 DEIGDLCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQVNDLRG 153 (503)
Q Consensus 107 dELQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnDLRG 153 (503)
+.|...+|+|-++...+|.+-=+|..|+.....+|..|..++.+++.
T Consensus 17 e~~e~~~K~le~~~~~~E~EI~sL~~K~~~lE~eld~~~~~l~~~k~ 63 (143)
T PF12718_consen 17 EELEAKVKQLEQENEQKEQEITSLQKKNQQLEEELDKLEEQLKEAKE 63 (143)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 45566666666666666666666666666666666666666666554
No 77
>TIGR00496 frr ribosome recycling factor. This model finds only eubacterial proteins. Mitochondrial and/or chloroplast forms might be expected but are not currently known. This protein was previously called ribosome releasing factor. By releasing ribosomes from mRNA at the end of protein biosynthesis, it prevents inappropriate translation from 3-prime regions of the mRNA and frees the ribosome for new rounds of translation. EGAD|53116|YHR038W is part of the frr superfamily.
Probab=53.24 E-value=89 Score=29.80 Aligned_cols=52 Identities=23% Similarity=0.227 Sum_probs=33.1
Q ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcCCCCCCCCCCHHHHH
Q psy9560 54 GFMTPERKKKLRLLLRKKAAEELKREQERRALERKRVIDERCGEPKDTDDMAEDEIG 110 (503)
Q Consensus 54 gKMT~eRKkkLKSLLLqKAaEELEKEKEqKeEEKKKiLAERcpPPLnIDGLSedELQ 110 (503)
..||.+||+.| .+.++...++-+..-+--|+.++..-- +...-.++|+|+++
T Consensus 94 P~lT~E~Rkel----vK~~k~~~E~aKv~iRniRr~~~~~iK-k~~k~~~iseD~~k 145 (176)
T TIGR00496 94 PPLTEERRKEL----VKHAKKIAEQAKVAVRNVRRDANDKVK-KLEKDKEISEDEER 145 (176)
T ss_pred CCCCHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHH-HHhhcCCCChhHHH
Confidence 46899998764 445566666666677778888877644 33223456666543
No 78
>KOG3091|consensus
Probab=52.41 E-value=81 Score=35.18 Aligned_cols=52 Identities=17% Similarity=0.145 Sum_probs=31.3
Q ss_pred HHHHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHhhCCCCCC
Q psy9560 107 DEIGDLCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQVNDLRGKFVRP 158 (503)
Q Consensus 107 dELQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnDLRGKFKKP 158 (503)
.+++.+-.+|-+||-+|--.-.-|..+=.-.+.+=++|+.|..+|.+-...|
T Consensus 379 ~~~k~r~~~Ls~RiLRv~ikqeilr~~G~~L~~~EE~Lr~Kldtll~~ln~P 430 (508)
T KOG3091|consen 379 EEAKNRHVELSHRILRVMIKQEILRKRGYALTPDEEELRAKLDTLLAQLNAP 430 (508)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhccCCcCCccHHHHHHHHHHHHHHhcCh
Confidence 5566666777777766643332222222233445567888888888877777
No 79
>TIGR01005 eps_transp_fam exopolysaccharide transport protein family. The model describes the exopolysaccharide transport protein family in bacteria. The transport protein is part of a large genetic locus which is associated with exopolysaccharide (EPS) biosynthesis. Detailed molecular characterization and gene fusion analysis revealed atleast seven gene products are involved in the overall regulation, which among other things, include exopolysaccharide biosynthesis, property of conferring virulence and exopolysaccharide export.
Probab=51.91 E-value=2.2e+02 Score=31.70 Aligned_cols=44 Identities=20% Similarity=0.274 Sum_probs=29.5
Q ss_pred HHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHhhCC
Q psy9560 111 DLCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQVNDLRGK 154 (503)
Q Consensus 111 EkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnDLRGK 154 (503)
....+|..++..++.+..++..+..-+.-.|..|+.+|.+|+.+
T Consensus 288 ~~i~~L~~~l~~l~~~~~~l~~~y~~~hP~v~~l~~qi~~l~~~ 331 (754)
T TIGR01005 288 DLIQRLRERQAELRATIADLSTTMLANHPRVVAAKSSLADLDAQ 331 (754)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhCCCCHHHHHHHHHHHHHHHH
Confidence 46677777777788777777776666555666666666665553
No 80
>PRK11020 hypothetical protein; Provisional
Probab=51.66 E-value=36 Score=31.46 Aligned_cols=44 Identities=9% Similarity=0.073 Sum_probs=31.1
Q ss_pred HHHHHHHHHHHHHHHhhh------hhhHHHHHhhhhhhHHHHHHHHHHhhC
Q psy9560 109 IGDLCQEYWQRIYNLEAI------KYELDREIMLKDFEITELGKQVNDLRG 153 (503)
Q Consensus 109 LQEkCKELHdRIdkLEEE------KYDLE~KVkKQDyEInELniKVnDLRG 153 (503)
|+.=.+.||++||.+.-. |=| ..++.+...||+.|+.+|+.|+|
T Consensus 3 ~K~Eiq~L~drLD~~~~Klaaa~~rgd-~~~i~qf~~E~~~l~k~I~~lk~ 52 (118)
T PRK11020 3 EKNEIKRLSDRLDAIRHKLAAASLRGD-AEKYAQFEKEKATLEAEIARLKE 52 (118)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCC-HHHHHHHHHHHHHHHHHHHHHHH
Confidence 344457788888776432 222 23567788899999999999988
No 81
>PF10046 BLOC1_2: Biogenesis of lysosome-related organelles complex-1 subunit 2 ; InterPro: IPR019269 This entry represents a family of proteins that play a role in cellular proliferation, as well as in the biogenesis of specialised organelles of the endosomal-lysosomal system [].
Probab=51.48 E-value=35 Score=29.38 Aligned_cols=48 Identities=13% Similarity=0.299 Sum_probs=24.6
Q ss_pred CHHHHHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHhhCCCC
Q psy9560 105 AEDEIGDLCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQVNDLRGKFV 156 (503)
Q Consensus 105 SedELQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnDLRGKFK 156 (503)
+..+|..++.+|...+.+|+ +|+..|.+-..=+..|-.-+..|-.|||
T Consensus 50 ~~~~l~~k~~~l~~~l~~Id----~Ie~~V~~LE~~v~~LD~ysk~LE~k~k 97 (99)
T PF10046_consen 50 NLEDLNQKYEELQPYLQQID----QIEEQVTELEQTVYELDEYSKELESKFK 97 (99)
T ss_pred HHHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHhh
Confidence 34666666666655555555 3444444444444444444444444444
No 82
>KOG0681|consensus
Probab=51.33 E-value=30 Score=39.10 Aligned_cols=126 Identities=17% Similarity=0.251 Sum_probs=0.0
Q ss_pred HHHHHHHHHhHHHHhhhhccCCCCCHH--HHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHhhhhcCCCCCC-CCCCHHH
Q psy9560 33 ERKRAEVRRRMEEASGARRKKGFMTPE--RKKKLRLLLRKKAAEELKREQERRALE-RKRVIDERCGEPKDT-DDMAEDE 108 (503)
Q Consensus 33 ERKkAEvRKRlEEA~KkKAKKgKMT~e--RKkkLKSLLLqKAaEELEKEKEqKeEE-KKKiLAERcpPPLnI-DGLSedE 108 (503)
++|++ |||+++.+.| +..++-.. ||---..|--.+ +.+.+++ +-- +++.. +++|| .-|+.++
T Consensus 281 e~kqe---kRlq~~a~lk-rv~k~~~re~~redeqql~~~~--------kaq~e~e~~~D-~~q~~-~ll~v~~eL~~d~ 346 (645)
T KOG0681|consen 281 EKKQE---KRLQEQAALK-RVEKINARENRREDEQQLESYN--------KAQGEQESNLD-LEQKF-PLLNVPAELDEDQ 346 (645)
T ss_pred HHHHH---HHHHHHHHHh-hHHHHHHHHhhhhhHHHHHHHH--------HhhhchhcCcc-Hhhhc-hhhcchhhhCHHH
Q ss_pred HHHHHHHHHHHHHHHh-----hhhhhHHHHHhhhhhhHHHHHHHHHHhhCCCCCCccccccccHHHHHHHHhhhhccchh
Q psy9560 109 IGDLCQEYWQRIYNLE-----AIKYELDREIMLKDFEITELGKQVNDLRGKFVRPILKKVSKYENKFAKLQKKAAEFNFR 183 (503)
Q Consensus 109 LQEkCKELHdRIdkLE-----EEKYDLE~KVkKQDyEInELniKVnDLRGKFKKPaLKKVRkSAnm~a~L~KhkvsmDfR 183 (503)
|+++-++.+-+....- .+| ..|.+.+....--..|..=|+|+|+|.+ ..|.+.+.+.-||
T Consensus 347 lk~k~~qr~lkas~dar~rar~ek-e~Er~~k~~~~r~~~~~swl~e~r~k~~--------------~ller~~~kk~lk 411 (645)
T KOG0681|consen 347 LKEKKKQRILKASTDARLRARVEK-ELERLNKLEEEREENLISWLEELREKLE--------------KLLERISQKKRLK 411 (645)
T ss_pred HHHHHHHHHHHhhhhhhccccccc-hHHHhhcccccccccHHHHHHHHHHHHH--------------HHHHHHHHHHHHH
Q ss_pred hhcc
Q psy9560 184 SQLK 187 (503)
Q Consensus 184 ANLK 187 (503)
.+||
T Consensus 412 ~e~~ 415 (645)
T KOG0681|consen 412 QELK 415 (645)
T ss_pred Hhhh
No 83
>PF10146 zf-C4H2: Zinc finger-containing protein ; InterPro: IPR018482 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents a family of proteins which appears to have a highly conserved zinc finger domain at the C-terminal end, described as -C-X2-CH-X3-H-X5-C-X2-C-. The structure is predicted to contain a coiled coil. Members of this family are annotated as being tumour-associated antigen HCA127 in humans, but this could not be confirmed.
Probab=51.18 E-value=2.7e+02 Score=27.89 Aligned_cols=35 Identities=23% Similarity=0.198 Sum_probs=18.0
Q ss_pred HHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHhhC
Q psy9560 119 RIYNLEAIKYELDREIMLKDFEITELGKQVNDLRG 153 (503)
Q Consensus 119 RIdkLEEEKYDLE~KVkKQDyEInELniKVnDLRG 153 (503)
-|.+++++|=.+...+.+.--|+.-|+..||.+|.
T Consensus 68 iIkqa~~er~~~~~~i~r~~eey~~Lk~~in~~R~ 102 (230)
T PF10146_consen 68 IIKQAESERNKRQEKIQRLYEEYKPLKDEINELRK 102 (230)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33344444444445555554555555555555554
No 84
>KOG0163|consensus
Probab=51.16 E-value=62 Score=38.28 Aligned_cols=19 Identities=42% Similarity=0.560 Sum_probs=10.1
Q ss_pred HHHHHHHHHHhHHHHhhhh
Q psy9560 32 LERKRAEVRRRMEEASGAR 50 (503)
Q Consensus 32 rERKkAEvRKRlEEA~KkK 50 (503)
|.|+++|..+|+|+.++++
T Consensus 937 R~rrEaeek~rre~ee~k~ 955 (1259)
T KOG0163|consen 937 RKRREAEEKRRREEEEKKR 955 (1259)
T ss_pred HHhhhhhHHHHHHHHHHHH
Confidence 3344455555666666555
No 85
>KOG4196|consensus
Probab=51.07 E-value=35 Score=32.15 Aligned_cols=34 Identities=24% Similarity=0.362 Sum_probs=16.9
Q ss_pred HHhhhhhhHHHHHhhhhhhHHHHHHHHHHhhCCC
Q psy9560 122 NLEAIKYELDREIMLKDFEITELGKQVNDLRGKF 155 (503)
Q Consensus 122 kLEEEKYDLE~KVkKQDyEInELniKVnDLRGKF 155 (503)
.||.+|--|...|.+...|+..|...+.-++.||
T Consensus 78 eLE~~k~~L~qqv~~L~~e~s~~~~E~da~k~k~ 111 (135)
T KOG4196|consen 78 ELEKEKAELQQQVEKLKEENSRLRRELDAYKSKY 111 (135)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4444555555555555555555555544444433
No 86
>PF05667 DUF812: Protein of unknown function (DUF812); InterPro: IPR008530 This family consists of several eukaryotic proteins of unknown function.
Probab=51.02 E-value=3.2e+02 Score=30.89 Aligned_cols=44 Identities=18% Similarity=0.194 Sum_probs=24.4
Q ss_pred HHHHHHHHHhhhhcCCCC-CCCCCCHHHHHHHHHHHHHHHHHHhhh
Q psy9560 82 RRALERKRVIDERCGEPK-DTDDMAEDEIGDLCQEYWQRIYNLEAI 126 (503)
Q Consensus 82 qKeEEKKKiLAERcpPPL-nIDGLSedELQEkCKELHdRIdkLEEE 126 (503)
..+-++..-|.+|.--+| |-+ -|..+|+.+|..-.+++..|..+
T Consensus 372 ~~~le~~~~l~~k~~~lL~d~e-~ni~kL~~~v~~s~~rl~~L~~q 416 (594)
T PF05667_consen 372 NEELEEELKLKKKTVELLPDAE-ENIAKLQALVEASEQRLVELAQQ 416 (594)
T ss_pred HHHHHHHHHHHHHHHHHhcCcH-HHHHHHHHHHHHHHHHHHHHHHH
Confidence 333344444444442222 222 36788988888777777666544
No 87
>KOG2891|consensus
Probab=50.78 E-value=3.5e+02 Score=29.07 Aligned_cols=11 Identities=45% Similarity=0.595 Sum_probs=5.0
Q ss_pred HHHHHhHHHHh
Q psy9560 37 AEVRRRMEEAS 47 (503)
Q Consensus 37 AEvRKRlEEA~ 47 (503)
+|..||++|+.
T Consensus 341 eeekkr~eeae 351 (445)
T KOG2891|consen 341 EEEKKREEEAE 351 (445)
T ss_pred HHHHHHHHHHH
Confidence 34444444444
No 88
>PF11544 Spc42p: Spindle pole body component Spc42p; InterPro: IPR021611 Spc42p is a 42kDa component of the S.cerevisiae spindle body that localises to the electron dense central region of the SPB. Spc42p is a phosphoprotein which forms a polymeric layer at the periphery of the SPB central plaque. This functions during SPB duplication and also facilitates the attachment of the SPB to the nuclear membrane. ; PDB: 2Q6Q_B.
Probab=50.56 E-value=7.3 Score=33.44 Aligned_cols=29 Identities=38% Similarity=0.472 Sum_probs=24.0
Q ss_pred hhhHHHHHhhhhhhHHHHHHHHHHhhCCC
Q psy9560 127 KYELDREIMLKDFEITELGKQVNDLRGKF 155 (503)
Q Consensus 127 KYDLE~KVkKQDyEInELniKVnDLRGKF 155 (503)
-=||-.++..++.||..||+-|.+||+|.
T Consensus 7 Nk~L~~kL~~K~eEI~rLn~lv~sLR~KL 35 (76)
T PF11544_consen 7 NKELKKKLNDKQEEIDRLNILVGSLRGKL 35 (76)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34666678888999999999999999943
No 89
>PF07058 Myosin_HC-like: Myosin II heavy chain-like; InterPro: IPR009768 This family represents a conserved region within a number of myosin II heavy chain-like proteins that seem to be specific to Arabidopsis thaliana.
Probab=50.40 E-value=30 Score=36.66 Aligned_cols=68 Identities=19% Similarity=0.254 Sum_probs=50.7
Q ss_pred HHHHHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHhhCCC--CCCccccccccHHHHHHH
Q psy9560 106 EDEIGDLCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQVNDLRGKF--VRPILKKVSKYENKFAKL 173 (503)
Q Consensus 106 edELQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnDLRGKF--KKPaLKKVRkSAnm~a~L 173 (503)
.++||..-.||..+|..+.+|.--|+...+++-.||+-|.+-|.+|---. -.=+-.-||-|-.++..|
T Consensus 2 Vdd~QN~N~EL~kQiEIcqEENkiLdK~hRQKV~EVEKLsqTi~ELEEaiLagGaaaNavrdYqrq~~el 71 (351)
T PF07058_consen 2 VDDVQNQNQELMKQIEICQEENKILDKMHRQKVLEVEKLSQTIRELEEAILAGGAAANAVRDYQRQVQEL 71 (351)
T ss_pred chhhhhhcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcchHHHHHHHHHHHHHHH
Confidence 46899999999999999999999999999999999999999998883210 012334455555555444
No 90
>KOG4460|consensus
Probab=50.28 E-value=51 Score=37.53 Aligned_cols=71 Identities=13% Similarity=0.223 Sum_probs=30.3
Q ss_pred HHHHHHHHhhhhcCCCCCCCCCCHHHHHHHHHHHHHHHH----HHhhhhhhHHHHHhhhhhhHHHHHHHHHHhhCCCCC
Q psy9560 83 RALERKRVIDERCGEPKDTDDMAEDEIGDLCQEYWQRIY----NLEAIKYELDREIMLKDFEITELGKQVNDLRGKFVR 157 (503)
Q Consensus 83 KeEEKKKiLAERcpPPLnIDGLSedELQEkCKELHdRId----kLEEEKYDLE~KVkKQDyEInELniKVnDLRGKFKK 157 (503)
+.+++.-.|++|+-. +.- +.+.|...|+.|.++.. .|-.---|+-..|..-..++.-|..-|.-++.+|+|
T Consensus 613 ~i~e~a~~La~R~ee---a~e-~qe~L~~~~~~L~~~~~~~lp~l~~AErdFk~Elq~~~~~~~~L~~~iET~~~~~~K 687 (741)
T KOG4460|consen 613 SLREMAERLADRYEE---AKE-KQEDLMNRMKKLLHSFHSELPVLSDAERDFKKELQLIPDQLRHLGNAIETVTMKKDK 687 (741)
T ss_pred HHHHHHHHHHHHHHH---HHH-hHHHHHHHHHHHHhcccccCCcchhHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHH
Confidence 344444455555521 111 44555555555544432 111122233333444444444444444444444444
No 91
>PF09755 DUF2046: Uncharacterized conserved protein H4 (DUF2046); InterPro: IPR019152 This is the conserved N-terminal 350 residues of a family of proteins of unknown function possibly containing a coiled-coil domain.
Probab=50.06 E-value=34 Score=35.88 Aligned_cols=42 Identities=19% Similarity=0.406 Sum_probs=39.0
Q ss_pred HHHHHHHHHHHHhhhhhhHHHHHhh-hhhhHHHHHHHHHHhhC
Q psy9560 112 LCQEYWQRIYNLEAIKYELDREIML-KDFEITELGKQVNDLRG 153 (503)
Q Consensus 112 kCKELHdRIdkLEEEKYDLE~KVkK-QDyEInELniKVnDLRG 153 (503)
+.+.|-.+|.+|-.+|-||+..+.+ |.|.|+-|..+|.+|+.
T Consensus 107 ltn~L~rkl~qLr~EK~~lE~~Le~EqE~~V~kL~k~i~~Le~ 149 (310)
T PF09755_consen 107 LTNDLSRKLNQLRQEKVELENQLEQEQEYLVNKLQKKIERLEK 149 (310)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHH
Confidence 6788999999999999999999998 88999999999999965
No 92
>PRK04863 mukB cell division protein MukB; Provisional
Probab=49.88 E-value=6.2e+02 Score=31.78 Aligned_cols=67 Identities=21% Similarity=0.257 Sum_probs=54.5
Q ss_pred HHHhhhhcCCCCCCCCCCHHHHHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHhhCCC
Q psy9560 88 KRVIDERCGEPKDTDDMAEDEIGDLCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQVNDLRGKF 155 (503)
Q Consensus 88 KKiLAERcpPPLnIDGLSedELQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnDLRGKF 155 (503)
.+.|.+-|. -+...--+.+.|..+..++...|..|+++.-.+-.+.......+++|+++|..|...+
T Consensus 536 ~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~qL~~~i~~l~~~a 602 (1486)
T PRK04863 536 ERLLAEFCK-RLGKNLDDEDELEQLQEELEARLESLSESVSEARERRMALRQQLEQLQARIQRLAARA 602 (1486)
T ss_pred HHHHHHHHH-HhCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC
Confidence 455666653 2233323689999999999999999999999999999999999999999999999944
No 93
>PF00804 Syntaxin: Syntaxin; InterPro: IPR006011 Syntaxins A and B are nervous system-specific proteins implicated in the docking of synaptic vesicles with the presynaptic plasma membrane. Syntaxins are a family of receptors for intracellular transport vesicles. Each target membrane may be identified by a specific member of the syntaxin family []. Members of the syntaxin family [, ] have a size ranging from 30 Kd to 40 Kd; a C-terminal extremity which is highly hydrophobic and anchors the protein on the cytoplasmic surface of cellular membranes; a central, well conserved region, which seems to be in a coiled-coil conformation. ; GO: 0016020 membrane; PDB: 1S94_B 1EZ3_A 3C98_B 1BR0_A 1FIO_A 2XHE_B.
Probab=49.86 E-value=19 Score=28.70 Aligned_cols=18 Identities=28% Similarity=0.267 Sum_probs=10.0
Q ss_pred HHHHHHHHHHHHHHHhhh
Q psy9560 109 IGDLCQEYWQRIYNLEAI 126 (503)
Q Consensus 109 LQEkCKELHdRIdkLEEE 126 (503)
+..+++.++.+|..|+..
T Consensus 54 i~~~~~~~~~~lk~l~~~ 71 (103)
T PF00804_consen 54 IKQLFQKIKKRLKQLSKD 71 (103)
T ss_dssp HHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHH
Confidence 444555555666555554
No 94
>PF05622 HOOK: HOOK protein; InterPro: IPR008636 This family consists of several HOOK1, 2 and 3 proteins from different eukaryotic organisms. The different members of the Homo sapiens gene family are HOOK1, HOOK2 and HOOK3. Different domains have been identified in the three Homo sapiens HOOK proteins, and it was demonstrated that the highly conserved NH2-domain mediates attachment to microtubules, whereas the central coiled-coil motif mediates homodimerisation and the more divergent C-terminal domains are involved in binding to specific organelles (organelle-binding domains). It has been demonstrated that endogenous HOOK3 binds to Golgi membranes [], whereas both HOOK1 and HOOK2 are localised to discrete but unidentified cellular structures. In mice the Hook1 gene is predominantly expressed in the testis. Hook1 function is necessary for the correct positioning of microtubular structures within the haploid germ cell. Disruption of Hook1 function in mice causes abnormal sperm head shape and fragile attachment of the flagellum to the sperm head [].; GO: 0008017 microtubule binding, 0000226 microtubule cytoskeleton organization, 0005737 cytoplasm; PDB: 1WIX_A.
Probab=49.29 E-value=5.6 Score=44.07 Aligned_cols=85 Identities=20% Similarity=0.271 Sum_probs=0.0
Q ss_pred HHHHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHhhCCCC------------CCccccccccHHHHHHHH
Q psy9560 107 DEIGDLCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQVNDLRGKFV------------RPILKKVSKYENKFAKLQ 174 (503)
Q Consensus 107 dELQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnDLRGKFK------------KPaLKKVRkSAnm~a~L~ 174 (503)
.+|+..++.|.+.+..+++.+-|+..++...+.+|.+|++++.+|...-. |-.-.||.++++.+..++
T Consensus 242 ~~l~~ql~~L~~el~~~e~~~~d~~~~~e~le~ei~~L~q~~~eL~~~A~~a~~LrDElD~lR~~a~r~~klE~~ve~YK 321 (713)
T PF05622_consen 242 ADLRAQLRRLREELERLEEQRDDLKIELEELEKEIDELRQENEELQAEAREARALRDELDELREKADRADKLENEVEKYK 321 (713)
T ss_dssp --------------------------------------------------------------------------------
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHH
Confidence 56888889999999999999999999999999999999999999876321 122224444443333332
Q ss_pred -hhhhccchhhhcccccc
Q psy9560 175 -KKAAEFNFRSQLKAVKK 191 (503)
Q Consensus 175 -KhkvsmDfRANLKqVKK 191 (503)
|....-|||..+|..+-
T Consensus 322 kKLed~~~lk~qvk~Lee 339 (713)
T PF05622_consen 322 KKLEDLEDLKRQVKELEE 339 (713)
T ss_dssp ------------------
T ss_pred HHHHHHHHHHHHHHHHHH
Confidence 22355677877776554
No 95
>PF02841 GBP_C: Guanylate-binding protein, C-terminal domain; InterPro: IPR003191 Guanylate-binding protein is a GTPase that is induced by interferon (IFN)-gamma. GTPases induced by IFN-gamma are key to the protective immunity against microbial and viral pathogens. These GTPases are classified into three groups: the small 47-kd GTPases, the Mx proteins, and the large 65- to 67-kd GTPases. Guanylate-binding proteins (GBP) fall into the last class. In humans, there are seven GBPs (hGBP1-7) []. Structurally, hGBP1 consists of two domains: a compact globular N-terminal domain harbouring the GTPase function (IPR015894 from INTERPRO), and an alpha-helical finger-like C-terminal domain. Human GBP1 is secreted from cells without the need of a leader peptide, and has been shown to exhibit antiviral activity against Vesicular stomatitis virus and Encephalomyocarditis virus, as well as being able to regulate the inhibition of proliferation and invasion of endothelial cells in response to IFN-gamma [].; GO: 0003924 GTPase activity, 0005525 GTP binding; PDB: 1DG3_A 2D4H_A 2B8W_B 2B92_A 2BC9_A 1F5N_A.
Probab=48.93 E-value=2.8e+02 Score=27.73 Aligned_cols=44 Identities=20% Similarity=0.177 Sum_probs=21.1
Q ss_pred HHHHHHHHHHHHHHHhhhhhhHHHHH--hhhhhhHHHHHHHHHHhh
Q psy9560 109 IGDLCQEYWQRIYNLEAIKYELDREI--MLKDFEITELGKQVNDLR 152 (503)
Q Consensus 109 LQEkCKELHdRIdkLEEEKYDLE~KV--kKQDyEInELniKVnDLR 152 (503)
|..--+.|.+....+...+.-...++ .....++..|+..|++|+
T Consensus 252 me~e~~~~~~e~e~~l~~k~~eq~~~l~e~~~~~~~~l~~ei~~L~ 297 (297)
T PF02841_consen 252 MEEEREQLLQEQERLLEQKLQEQEELLKEGFQEEAEKLQKEIQDLQ 297 (297)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHCT-HHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcC
Confidence 33333344444444433333333322 334566777777787774
No 96
>KOG0250|consensus
Probab=48.19 E-value=2.1e+02 Score=34.69 Aligned_cols=88 Identities=11% Similarity=0.131 Sum_probs=62.4
Q ss_pred HHHHHHHHHHHHhhhhcCCCCCCCCCCHHHHHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHhhCCCCCC
Q psy9560 79 EQERRALERKRVIDERCGEPKDTDDMAEDEIGDLCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQVNDLRGKFVRP 158 (503)
Q Consensus 79 EKEqKeEEKKKiLAERcpPPLnIDGLSedELQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnDLRGKFKKP 158 (503)
.+..+.+.|-+.|...+- .+.+ -...|++..++++..+...++++|-++-++....+-|......+++|+. -|+=
T Consensus 394 ~~~~e~e~k~~~L~~eve---k~e~-~~~~L~~e~~~~~~~~~~~~ee~~~i~~~i~~l~k~i~~~~~~l~~lk~-~k~d 468 (1074)
T KOG0250|consen 394 SELEERENKLEQLKKEVE---KLEE-QINSLREELNEVKEKAKEEEEEKEHIEGEILQLRKKIENISEELKDLKK-TKTD 468 (1074)
T ss_pred hhHHHHHHHHHHHHHHHH---HHHH-HHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-cccc
Confidence 345667777788887773 3343 3467888899999999999999999988888888888888888888866 4443
Q ss_pred ccccccccHHHHHHH
Q psy9560 159 ILKKVSKYENKFAKL 173 (503)
Q Consensus 159 aLKKVRkSAnm~a~L 173 (503)
.|.+.. .+|.+.|
T Consensus 469 kvs~FG--~~m~~lL 481 (1074)
T KOG0250|consen 469 KVSAFG--PNMPQLL 481 (1074)
T ss_pred hhhhcc--hhhHHHH
Confidence 333333 1555555
No 97
>KOG2751|consensus
Probab=47.46 E-value=78 Score=34.78 Aligned_cols=89 Identities=18% Similarity=0.192 Sum_probs=65.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhhcCCCCCCCCCCHHHHHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHH
Q psy9560 69 RKKAAEELKREQERRALERKRVIDERCGEPKDTDDMAEDEIGDLCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQV 148 (503)
Q Consensus 69 LqKAaEELEKEKEqKeEEKKKiLAERcpPPLnIDGLSedELQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKV 148 (503)
+.++.+.|+.+.++..++.++.+.++- ++++ ...+++....+|++.-.+.=.+-|++-+-+-..+.+|..|+.++
T Consensus 181 ~~~e~~~l~~eE~~L~q~lk~le~~~~----~l~~-~l~e~~~~~~~~~e~~~~~~~ey~~~~~q~~~~~del~Sle~q~ 255 (447)
T KOG2751|consen 181 LLKELKNLKEEEERLLQQLEELEKEEA----ELDH-QLKELEFKAERLNEEEDQYWREYNNFQRQLIEHQDELDSLEAQI 255 (447)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccchHHHHHHHH
Confidence 345667777777788888888888766 4565 67889999999999988888888888888888888888877764
Q ss_pred HHhhCCCCCCccccccccH
Q psy9560 149 NDLRGKFVRPILKKVSKYE 167 (503)
Q Consensus 149 nDLRGKFKKPaLKKVRkSA 167 (503)
. +..+-|+|.++.-
T Consensus 256 ~-----~s~~qldkL~ktN 269 (447)
T KOG2751|consen 256 E-----YSQAQLDKLRKTN 269 (447)
T ss_pred H-----HHHHHHHHHHhhh
Confidence 2 2445566655554
No 98
>TIGR00606 rad50 rad50. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=47.38 E-value=4.5e+02 Score=31.77 Aligned_cols=52 Identities=15% Similarity=0.167 Sum_probs=42.1
Q ss_pred CCCCCCHHHHHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHhh
Q psy9560 100 DTDDMAEDEIGDLCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQVNDLR 152 (503)
Q Consensus 100 nIDGLSedELQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnDLR 152 (503)
.+++ +.++|+++...++.++...+.+.=+++..+.....++..|+.+.++|.
T Consensus 295 ~l~~-s~eEL~~ll~~f~~~~~e~~~~~~~le~e~~~l~~el~~l~~~~~~l~ 346 (1311)
T TIGR00606 295 VFQG-TDEQLNDLYHNHQRTVREKERELVDCQRELEKLNKERRLLNQEKTELL 346 (1311)
T ss_pred cCCC-CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3555 889999999999998888888888888888888888888887766653
No 99
>cd00632 Prefoldin_beta Prefoldin beta; Prefoldin is a hexameric molecular chaperone complex, composed of two evolutionarily related subunits (alpha and beta), which are found in both eukaryotes and archaea. Prefoldin binds and stabilizes newly synthesized polypeptides allowing them to fold correctly. The hexameric structure consists of a double beta barrel assembly with six protruding coiled-coils. The alpha prefoldin subunits have two beta hairpin structures while the beta prefoldin subunits (this CD) have only one hairpin that is most similar to the second hairpin of the alpha subunit. The prefoldin hexamer consists of two alpha and four beta subunits and is assembled from the beta hairpins of all six subunits. The alpha subunits initially dimerize providing a structural nucleus for the assembly of the beta subunits. In archaea, there is usually only one gene for each subunit while in eukaryotes there two or more paralogous genes encoding each subunit adding heterogeneity to the st
Probab=47.29 E-value=62 Score=27.68 Aligned_cols=46 Identities=17% Similarity=0.181 Sum_probs=35.3
Q ss_pred HHHHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHhh
Q psy9560 107 DEIGDLCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQVNDLR 152 (503)
Q Consensus 107 dELQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnDLR 152 (503)
..+-+.+..|.++|..|+.+.=.|+..+.....++++|..+|..|+
T Consensus 59 ~~~~ea~~~Le~~~e~le~~i~~l~~~~~~l~~~~~elk~~l~~~~ 104 (105)
T cd00632 59 QEKEEARTELKERLETIELRIKRLERQEEDLQEKLKELQEKIQQAQ 104 (105)
T ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 3456677888888888888877777777777778888888777765
No 100
>KOG0996|consensus
Probab=46.93 E-value=3e+02 Score=34.04 Aligned_cols=42 Identities=19% Similarity=0.113 Sum_probs=29.1
Q ss_pred CCCCCCHHHHHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhH
Q psy9560 100 DTDDMAEDEIGDLCQEYWQRIYNLEAIKYELDREIMLKDFEI 141 (503)
Q Consensus 100 nIDGLSedELQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEI 141 (503)
.|.|...+--++++++++++|++|+.+.--+...++.++..|
T Consensus 896 ~i~~e~~q~qk~kv~~~~~~~~~l~~~i~k~~~~i~~s~~~i 937 (1293)
T KOG0996|consen 896 EIGGEKVQAQKDKVEKINEQLDKLEADIAKLTVAIKTSDRNI 937 (1293)
T ss_pred HhhchhhHHhHHHHHHHHHHHHHHHHHHHHhHHHHhcCcccH
Confidence 445545555678899999999999888776666665544433
No 101
>PF00038 Filament: Intermediate filament protein; InterPro: IPR016044 Intermediate filaments (IF) [, , ] are proteins which are primordial components of the cytoskeleton and the nuclear envelope. They generally form filamentous structures 8 to 14 nm wide. IF proteins are members of a very large multigene family of proteins which has been subdivided in five major subgroups: Type I: Acidic cytokeratins. Type II: Basic cytokeratins. Type III: Vimentin, desmin, glial fibrillary acidic protein (GFAP), peripherin, and plasticin. Type IV: Neurofilaments L, H and M, alpha-internexin and nestin. Type V: Nuclear lamins A, B1, B2 and C. All IF proteins are structurally similar in that they consist of: a central rod domain comprising some 300 to 350 residues which is arranged in coiled-coiled alpha-helices, with at least two short characteristic interruptions; a N-terminal non-helical domain (head) of variable length; and a C-terminal domain (tail) which is also non-helical, and which shows extreme length variation between different IF proteins. While IF proteins are evolutionary and structurally related, they have limited sequence homologies except in several regions of the rod domain. This entry represents the central rod domain found in IF proteins.; PDB: 3TNU_B 3KLT_D 1GK4_F 3TRT_A 3G1E_A 3UF1_C 1GK6_B 1GK7_A 3TYY_B 3V4W_A ....
Probab=46.91 E-value=41 Score=32.88 Aligned_cols=96 Identities=20% Similarity=0.267 Sum_probs=66.7
Q ss_pred HHHHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHhhCCCCCCccccccccH-HHHHHHHh-----hhhcc
Q psy9560 107 DEIGDLCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQVNDLRGKFVRPILKKVSKYE-NKFAKLQK-----KAAEF 180 (503)
Q Consensus 107 dELQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnDLRGKFKKPaLKKVRkSA-nm~a~L~K-----hkvsm 180 (503)
.-+..-.++|+..|+.+-.++-.|+..+.....+|++|+.|+.+. .+.+.+. +.+..|.| ....+
T Consensus 50 ~~ye~el~~lr~~id~~~~eka~l~~e~~~l~~e~~~~r~k~e~e---------~~~~~~le~el~~lrk~ld~~~~~r~ 120 (312)
T PF00038_consen 50 EMYEEELRELRRQIDDLSKEKARLELEIDNLKEELEDLRRKYEEE---------LAERKDLEEELESLRKDLDEETLARV 120 (312)
T ss_dssp HHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---------HHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred cchhhHHHHhHHhhhhHHHHhhHHhhhhhhHHHHHHHHHHHHHHH---------HHHHHHHHHHHhhhhhhhhhhhhhHh
Confidence 345556678889999999999999999999999999999998887 3344444 44555542 23668
Q ss_pred chhhhccccccc-ccchhhhhhccCccccccch
Q psy9560 181 NFRSQLKAVKKR-EFSMEDAEKEKKPEWVKEPE 212 (503)
Q Consensus 181 DfRANLKqVKKE-e~~leEeekeeVgDWRKNiE 212 (503)
||.+.+.+.+.+ .| +..-=.+++.+++..+.
T Consensus 121 ~le~~i~~L~eEl~f-l~~~heeEi~~L~~~~~ 152 (312)
T PF00038_consen 121 DLENQIQSLKEELEF-LKQNHEEEIEELREQIQ 152 (312)
T ss_dssp HHHHHHHHHHHHHHH-HHHHHHHHHHTTSTT--
T ss_pred HHHHHHHHHHHHHHH-HHhhhhhhhhhhhhccc
Confidence 999999988886 33 22212335777777654
No 102
>PF04880 NUDE_C: NUDE protein, C-terminal conserved region; InterPro: IPR006964 This domain represents the C-terminal conserved region of NUDE proteins. Emericella nidulans (Aspergillus nidulans) NUDE, acts in the cytoplasmic dynein/dynactin pathway and is required for distribution of nuclei []. It is a homologue of the nuclear distribution protein RO11 of Neurospora crassa. NUDE interacts with the NUDF via an N-terminal coiled coil domain; this is the only domain which is absolutely required for NUDE function.; PDB: 2V66_B 2V71_B.
Probab=46.84 E-value=19 Score=34.51 Aligned_cols=49 Identities=18% Similarity=0.180 Sum_probs=18.7
Q ss_pred HHHHHHHHHHHHHHHHHHhhh---hhhHHHHHhhhhhhHHHHHHHHHHhhCCC
Q psy9560 106 EDEIGDLCQEYWQRIYNLEAI---KYELDREIMLKDFEITELGKQVNDLRGKF 155 (503)
Q Consensus 106 edELQEkCKELHdRIdkLEEE---KYDLE~KVkKQDyEInELniKVnDLRGKF 155 (503)
.++|..++.+-.++-.-||+| |=.|...|.|..-|+.||++-+ .++-|.
T Consensus 2 LeD~EsklN~AIERnalLE~ELdEKE~L~~~~QRLkDE~RDLKqEl-~V~ek~ 53 (166)
T PF04880_consen 2 LEDFESKLNQAIERNALLESELDEKENLREEVQRLKDELRDLKQEL-IVQEKL 53 (166)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCH---------------------
T ss_pred HHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHh
Confidence 456666666666666666654 6667777777788888888776 666543
No 103
>COG4942 Membrane-bound metallopeptidase [Cell division and chromosome partitioning]
Probab=46.76 E-value=44 Score=36.28 Aligned_cols=48 Identities=19% Similarity=0.178 Sum_probs=40.4
Q ss_pred HHHHHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHhhC
Q psy9560 106 EDEIGDLCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQVNDLRG 153 (503)
Q Consensus 106 edELQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnDLRG 153 (503)
.++|+.-.++|-..|..|+.+.-.++..+++.+..|.+++.+++.|++
T Consensus 61 ~~kL~~~lk~~e~~i~~~~~ql~~s~~~l~~~~~~I~~~~~~l~~l~~ 108 (420)
T COG4942 61 RAKLEKQLKSLETEIASLEAQLIETADDLKKLRKQIADLNARLNALEV 108 (420)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHhhHHHHHHHHHHHHH
Confidence 366778888888888888888888888888888888888888888865
No 104
>PF08232 Striatin: Striatin family; InterPro: IPR013258 This domain is associated with the N terminus of striatin. Striatin is an intracellular protein which has a caveolin-binding motif, a coiled-coil structure, a calmodulin-binding site, and a WD (IPR001680 from INTERPRO) repeat domain []. It acts as a scaffold protein [] and is involved in signalling pathways [, ].
Probab=46.58 E-value=36 Score=31.02 Aligned_cols=12 Identities=25% Similarity=0.454 Sum_probs=4.8
Q ss_pred HHHHHHHHHhhC
Q psy9560 142 TELGKQVNDLRG 153 (503)
Q Consensus 142 nELniKVnDLRG 153 (503)
.+|+.||..|.|
T Consensus 28 aEmkarIa~LEG 39 (134)
T PF08232_consen 28 AEMKARIAFLEG 39 (134)
T ss_pred HHHHHHHHHHHH
Confidence 334444444433
No 105
>PRK12750 cpxP periplasmic repressor CpxP; Reviewed
Probab=46.54 E-value=1.6e+02 Score=27.94 Aligned_cols=66 Identities=21% Similarity=0.231 Sum_probs=41.5
Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHhhhhcCCCCCCCCCCHHHHHHHHHHHHHHH
Q psy9560 53 KGFMTPERKKKLRLLLRKKAAEELKR----EQERRALERKRVIDERCGEPKDTDDMAEDEIGDLCQEYWQRI 120 (503)
Q Consensus 53 KgKMT~eRKkkLKSLLLqKAaEELEK----EKEqKeEEKKKiLAERcpPPLnIDGLSedELQEkCKELHdRI 120 (503)
+-.||.+-|..||. |++...++++. .+....++++....+.. .++.-+.+|++.+++++.++++.-
T Consensus 50 ~L~LTdeQk~qik~-i~~~~r~~~k~~~~~~r~~~~~~m~a~~~~~~-~Ll~a~~FDeaavral~~~~~~~~ 119 (170)
T PRK12750 50 QLDLTDAQKEQLKE-MREANRAEMKAKYSGNREQSHAEMKAHHAKVQ-ALVLADDFDEAAANDLAKQMVEKQ 119 (170)
T ss_pred hCCCCHHHHHHHHH-HHHHHHHHHHhhhhhhhhhhHHHHHHHHHHHH-HHHhcCCCCHHHHHHHHHHHHHHH
Confidence 45789999999998 45555555554 23333444444443333 344557788999998888776663
No 106
>PF12329 TMF_DNA_bd: TATA element modulatory factor 1 DNA binding; InterPro: IPR022092 This is the middle region of a family of TATA element modulatory factor 1 proteins conserved in eukaryotes that contains at its N-terminal section a number of leucine zippers that could potentially form coiled coil structures. The whole proteins bind to the TATA element of some RNA polymerase II promoters and repress their activity. by competing with the binding of TATA binding protein. TMFs are evolutionarily conserved golgins that bind Rab6, a ubiquitous ras-like GTP-binding Golgi protein, and contribute to Golgi organisation in animal [] and plant [] cells.
Probab=45.77 E-value=41 Score=27.94 Aligned_cols=43 Identities=19% Similarity=0.256 Sum_probs=32.8
Q ss_pred HHHHHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHH
Q psy9560 106 EDEIGDLCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQV 148 (503)
Q Consensus 106 edELQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKV 148 (503)
+..+..++|.|-..|..++.+.-++..++.....+|..|+.+.
T Consensus 28 el~~~~~IKKLr~~~~e~e~~~~~l~~~~~~~e~~~~~l~~~l 70 (74)
T PF12329_consen 28 ELKLNNTIKKLRAKIKELEKQIKELKKKLEELEKELESLEERL 70 (74)
T ss_pred HHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 3456778888888888888777777777777777777777765
No 107
>PRK04325 hypothetical protein; Provisional
Probab=45.74 E-value=51 Score=27.47 Aligned_cols=30 Identities=20% Similarity=0.184 Sum_probs=18.4
Q ss_pred hhhhhhHHHHHhhhhhhHHHHHHHHHHhhC
Q psy9560 124 EAIKYELDREIMLKDFEITELGKQVNDLRG 153 (503)
Q Consensus 124 EEEKYDLE~KVkKQDyEInELniKVnDLRG 153 (503)
++--.|||.||..|+.-|.+||.-|...+.
T Consensus 8 e~Ri~~LE~klAfQE~tIe~LN~vv~~Qq~ 37 (74)
T PRK04325 8 EDRITELEIQLAFQEDLIDGLNATVARQQQ 37 (74)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 333445666677777777777766665554
No 108
>smart00338 BRLZ basic region leucin zipper.
Probab=45.69 E-value=61 Score=25.44 Aligned_cols=37 Identities=24% Similarity=0.247 Sum_probs=25.3
Q ss_pred HHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHH
Q psy9560 112 LCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQV 148 (503)
Q Consensus 112 kCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKV 148 (503)
...+|..+|..|+.+..+|..+|.....+|..|+...
T Consensus 27 ~~~~Le~~~~~L~~en~~L~~~~~~l~~e~~~lk~~~ 63 (65)
T smart00338 27 EIEELERKVEQLEAENERLKKEIERLRRELEKLKSEL 63 (65)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 4566667777777777777777777777777666553
No 109
>PRK14954 DNA polymerase III subunits gamma and tau; Provisional
Probab=45.58 E-value=3.7e+02 Score=30.42 Aligned_cols=21 Identities=5% Similarity=0.104 Sum_probs=10.2
Q ss_pred HHHHHHHHHHhhhcCchhhHH
Q psy9560 302 IDDLVEKLMSAVSATPQPLLK 322 (503)
Q Consensus 302 ~~~~~~~~~~~~~~~~~~~~~ 322 (503)
.++++++++.........-|.
T Consensus 320 ~~~v~~~l~~qA~~fs~~~L~ 340 (620)
T PRK14954 320 PESVKERYQQSAGGLTPAAVM 340 (620)
T ss_pred CHHHHHHHHHHHHhCCHHHHH
Confidence 445566665544444433333
No 110
>PRK02793 phi X174 lysis protein; Provisional
Probab=45.58 E-value=57 Score=27.05 Aligned_cols=25 Identities=24% Similarity=0.399 Sum_probs=13.6
Q ss_pred hhHHHHHhhhhhhHHHHHHHHHHhh
Q psy9560 128 YELDREIMLKDFEITELGKQVNDLR 152 (503)
Q Consensus 128 YDLE~KVkKQDyEInELniKVnDLR 152 (503)
.|||.++..|+.-|.+||.-|...+
T Consensus 11 ~~LE~~lafQe~tIe~Ln~~v~~Qq 35 (72)
T PRK02793 11 AELESRLAFQEITIEELNVTVTAHE 35 (72)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3455555555555555555555443
No 111
>PF14662 CCDC155: Coiled-coil region of CCDC155
Probab=45.41 E-value=2.5e+02 Score=27.99 Aligned_cols=45 Identities=18% Similarity=0.220 Sum_probs=25.2
Q ss_pred HHHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHhh
Q psy9560 108 EIGDLCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQVNDLR 152 (503)
Q Consensus 108 ELQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnDLR 152 (503)
.|+...++|+.....|--.-|-++.-+...+..|++-+.+|.+|.
T Consensus 120 ~lk~~~~eL~~~~~~Lq~Ql~~~e~l~~~~da~l~e~t~~i~eL~ 164 (193)
T PF14662_consen 120 GLKKRSKELATEKATLQRQLCEFESLICQRDAILSERTQQIEELK 164 (193)
T ss_pred hHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHH
Confidence 355555555555555555555555555555555555555555554
No 112
>KOG4364|consensus
Probab=45.17 E-value=95 Score=36.07 Aligned_cols=16 Identities=31% Similarity=0.493 Sum_probs=9.0
Q ss_pred ccCCCCccHHHHHHHH
Q psy9560 294 VSASNPVSIDDLVEKL 309 (503)
Q Consensus 294 ~~~~~~~~~~~~~~~~ 309 (503)
++..|-.||.||+...
T Consensus 663 l~hgn~ns~~~ii~E~ 678 (811)
T KOG4364|consen 663 LSHGNENSINDIIDED 678 (811)
T ss_pred hhcccccchHHHHHHH
Confidence 4555666666665443
No 113
>PF08764 Coagulase: Staphylococcus aureus coagulase; InterPro: IPR014874 Staphylococcus aureus secretes a cofactor called coagulase. Coagulase is an extracellular protein that forms a complex with human prothrombin, and activates it without the usual proteolytic cleavages []. The resulting complex directly initiates blood clotting. ; PDB: 1NU7_H 1NU9_C 2A1D_D.
Probab=44.73 E-value=3.7e+02 Score=28.15 Aligned_cols=43 Identities=5% Similarity=0.115 Sum_probs=18.3
Q ss_pred HHHHHHHHHHHHHHhhhhcCCC-----------CCCCCCCHHHHHHHHHHHHHH
Q psy9560 77 KREQERRALERKRVIDERCGEP-----------KDTDDMAEDEIGDLCQEYWQR 119 (503)
Q Consensus 77 EKEKEqKeEEKKKiLAERcpPP-----------LnIDGLSedELQEkCKELHdR 119 (503)
.+....-+..|+.|.......| .+|-||++.....|...|-+.
T Consensus 73 ~~k~~~~~~~k~~~~~~~~~np~~~~k~~~f~~~~i~~LT~~rYn~I~~sLK~a 126 (282)
T PF08764_consen 73 EKKKDEYEKYKKWYKKKKNENPNTKIKMKKFDKYNIYGLTMERYNYIYESLKEA 126 (282)
T ss_dssp HHHHHHHHHHHHHHHHHHCCSTTTTS----TTT------BHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHhhhcCcccccccccccccchhhcCHHHHHHHHHHHHHH
Confidence 4444444455666666432212 478888887655554444433
No 114
>PRK14959 DNA polymerase III subunits gamma and tau; Provisional
Probab=44.70 E-value=2.9e+02 Score=31.48 Aligned_cols=47 Identities=13% Similarity=0.082 Sum_probs=29.3
Q ss_pred HhhhhcCCCCCCCCCCHHHHHHHHHHHHHHHH-HHhhhhhhHHHHHhhh
Q psy9560 90 VIDERCGEPKDTDDMAEDEIGDLCQEYWQRIY-NLEAIKYELDREIMLK 137 (503)
Q Consensus 90 iLAERcpPPLnIDGLSedELQEkCKELHdRId-kLEEEKYDLE~KVkKQ 137 (503)
.|..|| ..+.+..|+.++|...++++-.+.. .++.+-.++-.++...
T Consensus 165 TI~SRc-q~i~F~pLs~~eL~~~L~~il~~egi~id~eal~lIA~~s~G 212 (624)
T PRK14959 165 TIVSRC-QHFTFTRLSEAGLEAHLTKVLGREGVDYDPAAVRLIARRAAG 212 (624)
T ss_pred HHHhhh-hccccCCCCHHHHHHHHHHHHHHcCCCCCHHHHHHHHHHcCC
Confidence 356788 4678999999999877776444332 2455544554444443
No 115
>PRK10361 DNA recombination protein RmuC; Provisional
Probab=44.59 E-value=4.5e+02 Score=29.23 Aligned_cols=42 Identities=5% Similarity=0.088 Sum_probs=20.6
Q ss_pred HHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHH
Q psy9560 109 IGDLCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQVND 150 (503)
Q Consensus 109 LQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnD 150 (503)
|++-...+-.+|..++.++-.--..++.+-..+.++|.+|..
T Consensus 145 l~e~l~~f~~~v~~~~~~~~~~~~~L~~qi~~L~~~n~~i~~ 186 (475)
T PRK10361 145 LREQLDGFRRQVQDSFGKEAQERHTLAHEIRNLQQLNAQMAQ 186 (475)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 344445555555555544333334444444555566666543
No 116
>KOG4031|consensus
Probab=44.30 E-value=1.3e+02 Score=30.51 Aligned_cols=33 Identities=15% Similarity=0.409 Sum_probs=26.4
Q ss_pred HHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHhhC
Q psy9560 117 WQRIYNLEAIKYELDREIMLKDFEITELGKQVNDLRG 153 (503)
Q Consensus 117 HdRIdkLEEEKYDLE~KVkKQDyEInELniKVnDLRG 153 (503)
|+||.+| .|+.-|+.++-+++.-|+.=+..|++
T Consensus 179 WErv~kL----~D~n~k~sk~gkD~SRlrslL~~LK~ 211 (216)
T KOG4031|consen 179 WERVAKL----CDFNPKSSKQGKDVSRLRSLLISLKQ 211 (216)
T ss_pred HHHHHHH----HcCCccchhccccHHHHHHHHHHhhh
Confidence 6777766 47888888999999888888877766
No 117
>PF06246 Isy1: Isy1-like splicing family; InterPro: IPR009360 Isy1 protein is important in the optimisation of splicing [].; PDB: 1X4T_A.
Probab=44.21 E-value=35 Score=34.53 Aligned_cols=43 Identities=16% Similarity=0.262 Sum_probs=23.7
Q ss_pred HHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHH-------HHHHHhhC
Q psy9560 110 GDLCQEYWQRIYNLEAIKYELDREIMLKDFEITELG-------KQVNDLRG 153 (503)
Q Consensus 110 QEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELn-------iKVnDLRG 153 (503)
..+.+|+-.+|.+|-..-+. |++|+-.+-|||.|- .+|..|.|
T Consensus 50 ~~ii~EIs~kv~~Iqd~~L~-E~~IRdLNDeINkL~rEK~~WE~rI~~LGG 99 (255)
T PF06246_consen 50 RQIIKEISRKVTRIQDPSLG-EFQIRDLNDEINKLIREKRHWERRIKELGG 99 (255)
T ss_dssp HHHHHHHHHHHHHHHH--S--HHHHHHHHHHHHHHHHHHHHHHHHHHHHTS
T ss_pred HHHHHHHHHHHHHHHhcCCC-HHHHHHHHHHHHHHHHHHHHHHHHHHHhCC
Confidence 45677777777777666553 555555555555543 23556666
No 118
>PF11559 ADIP: Afadin- and alpha -actinin-Binding; InterPro: IPR021622 This family is found in mammals where it is localised at cell-cell adherens junctions [], and in Sch. pombe and other fungi where it anchors spindle-pole bodies to spindle microtubules []. It is a coiled-coil structure, and in pombe, it is required for anchoring the minus end of spindle microtubules to the centrosome equivalent, the spindle-pole body. The name ADIP derives from the family being composed of Afadin- and alpha -Actinin-Binding Proteins Localised at Cell-Cell Adherens Junctions.
Probab=44.17 E-value=80 Score=28.33 Aligned_cols=49 Identities=14% Similarity=0.170 Sum_probs=40.1
Q ss_pred CHHHHHHHHHHHHHHHHHHhhh---hhhHHHHHhhhhhhHHHHHHHHHHhhC
Q psy9560 105 AEDEIGDLCQEYWQRIYNLEAI---KYELDREIMLKDFEITELGKQVNDLRG 153 (503)
Q Consensus 105 SedELQEkCKELHdRIdkLEEE---KYDLE~KVkKQDyEInELniKVnDLRG 153 (503)
+.+.+..+|+=||+-|...+.+ +=+|..++.+.++||..|+..|..|+.
T Consensus 29 ~~~~~~~vin~i~~Ll~~~~r~~~~~e~l~~~~~~l~~d~~~l~~~~~rL~~ 80 (151)
T PF11559_consen 29 SEDNDVRVINCIYDLLQQRDRDMEQREDLSDKLRRLRSDIERLQNDVERLKE 80 (151)
T ss_pred ccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHH
Confidence 4567778888888888777665 557888999999999999999998876
No 119
>PF06005 DUF904: Protein of unknown function (DUF904); InterPro: IPR009252 Cell division protein ZapB is a non-essential, abundant cell division factor that is required for proper Z-ring formation. It is recruited early to the divisome by direct interaction with FtsZ, stimulating Z-ring assembly and thereby promoting cell division earlier in the cell cycle. Its recruitment to the Z-ring requires functional FtsA or ZipA.; GO: 0000917 barrier septum formation, 0043093 cytokinesis by binary fission, 0005737 cytoplasm; PDB: 2JEE_A.
Probab=44.03 E-value=1e+02 Score=25.78 Aligned_cols=46 Identities=20% Similarity=0.140 Sum_probs=36.7
Q ss_pred HHHHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHhh
Q psy9560 107 DEIGDLCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQVNDLR 152 (503)
Q Consensus 107 dELQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnDLR 152 (503)
++|..++..+-+.|..|.-+-..|..+......+...|+.....|+
T Consensus 7 ~~LE~ki~~aveti~~Lq~e~eeLke~n~~L~~e~~~L~~en~~L~ 52 (72)
T PF06005_consen 7 EQLEEKIQQAVETIALLQMENEELKEKNNELKEENEELKEENEQLK 52 (72)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHH
Confidence 6788888888888888888888888887777777777777776666
No 120
>KOG1144|consensus
Probab=43.81 E-value=48 Score=39.08 Aligned_cols=11 Identities=45% Similarity=0.444 Sum_probs=4.4
Q ss_pred HHHHHhHHHHh
Q psy9560 37 AEVRRRMEEAS 47 (503)
Q Consensus 37 AEvRKRlEEA~ 47 (503)
.|.|.|++|..
T Consensus 237 eEE~~r~eeEE 247 (1064)
T KOG1144|consen 237 EEERLRREEEE 247 (1064)
T ss_pred HHHHHHHHHHH
Confidence 33344444433
No 121
>PF09849 DUF2076: Uncharacterized protein conserved in bacteria (DUF2076); InterPro: IPR018648 This family of hypothetical prokaryotic proteins has no known function but includes putative perimplasmic ligand-binding sensor proteins.
Probab=43.75 E-value=44 Score=33.68 Aligned_cols=32 Identities=9% Similarity=0.323 Sum_probs=26.8
Q ss_pred HHhhhhhhHHHHHhhhhhhHHHHHHHHHHhhC
Q psy9560 122 NLEAIKYELDREIMLKDFEITELGKQVNDLRG 153 (503)
Q Consensus 122 kLEEEKYDLE~KVkKQDyEInELniKVnDLRG 153 (503)
+-=.--|=|-..|-.|+.=|+.|+.||.+|..
T Consensus 38 ~qP~A~Y~laQ~vlvQE~AL~~a~~ri~eLe~ 69 (247)
T PF09849_consen 38 RQPDAPYYLAQTVLVQEQALKQAQARIQELEA 69 (247)
T ss_pred hCCchHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33344788889999999999999999999987
No 122
>KOG4643|consensus
Probab=43.55 E-value=3.8e+02 Score=32.81 Aligned_cols=144 Identities=18% Similarity=0.224 Sum_probs=76.8
Q ss_pred HHHHHHHHHHHHHHHHHhHHHHhhhhccCCC-------CCHHHHHHHHHHH--HHHHHHHH-HHHHHHHHHHHHHHhhh-
Q psy9560 25 KRLKLEELERKRAEVRRRMEEASGARRKKGF-------MTPERKKKLRLLL--RKKAAEEL-KREQERRALERKRVIDE- 93 (503)
Q Consensus 25 ~kRKqeErERKkAEvRKRlEEA~KkKAKKgK-------MT~eRKkkLKSLL--LqKAaEEL-EKEKEqKeEEKKKiLAE- 93 (503)
+-|-...+.+=+.+.-+-++...--+.-+.. +|..|++++++++ ++-+-+.. ++..+.--.. +.|+..
T Consensus 77 e~a~~~~iaevtd~~~~vleld~~er~~~~q~~~hir~llk~r~~~~k~~id~~qe~se~i~e~~le~vGl~-~~~~~s~ 155 (1195)
T KOG4643|consen 77 EMAQMRTIAEVTDEECQVLELDNEERAQKIQILEHIRLLLKDRKKKWKSVIDDLQEASEKIAEKLLELVGLE-KKYRESR 155 (1195)
T ss_pred HHHHHHHHHHhhhhHHhhhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccc-ceeeccc
Confidence 3444445555555555555555544422333 3556666666654 33333322 2221111111 223222
Q ss_pred ----hcCCCCCCCCCCHHHHHHHHHHHHHHHHHHh---hhhhhHHH----HHhhhhhhHHHHHHHHHHhhCCCCCCcccc
Q psy9560 94 ----RCGEPKDTDDMAEDEIGDLCQEYWQRIYNLE---AIKYELDR----EIMLKDFEITELGKQVNDLRGKFVRPILKK 162 (503)
Q Consensus 94 ----RcpPPLnIDGLSedELQEkCKELHdRIdkLE---EEKYDLE~----KVkKQDyEInELniKVnDLRGKFKKPaLKK 162 (503)
-|..+.++.+-+...|..-|-.|-.+|..|. +|||+.-. .+...+.||..|++-+.++.. --++
T Consensus 156 s~~~~~~sp~~~~~~~~~hL~velAdle~kir~LrqElEEK~enll~lr~eLddleae~~klrqe~~e~l~-----ea~r 230 (1195)
T KOG4643|consen 156 SGKELYKSPYDIVVKKNLHLEVELADLEKKIRTLRQELEEKFENLLRLRNELDDLEAEISKLRQEIEEFLD-----EAHR 230 (1195)
T ss_pred cCCCCCCCcchhhcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHh
Confidence 2444556666666778888888888888776 45665433 344455666666666655433 1246
Q ss_pred ccccHHHHHHHH
Q psy9560 163 VSKYENKFAKLQ 174 (503)
Q Consensus 163 VRkSAnm~a~L~ 174 (503)
.++|.+.++.|.
T Consensus 231 a~~yrdeldalr 242 (1195)
T KOG4643|consen 231 ADRYRDELDALR 242 (1195)
T ss_pred hhhhhhHHHHHH
Confidence 677887788774
No 123
>PF09756 DDRGK: DDRGK domain; InterPro: IPR019153 This is a family of proteins of approximately 300 residues. They contain a highly conserved DDRGK motif. The function is unknown. ; PDB: 1WI9_A.
Probab=43.49 E-value=8.9 Score=37.13 Aligned_cols=18 Identities=50% Similarity=0.654 Sum_probs=7.9
Q ss_pred hhHHHHHHH-----HHHhhCCCC
Q psy9560 139 FEITELGKQ-----VNDLRGKFV 156 (503)
Q Consensus 139 yEInELniK-----VnDLRGKFK 156 (503)
+-|++|... |.|-||||.
T Consensus 132 ~ri~~L~~~g~ltGv~DdrGkfI 154 (188)
T PF09756_consen 132 NRIQELEAEGRLTGVIDDRGKFI 154 (188)
T ss_dssp HHHHHHHHHSSS-EEE-TT--EE
T ss_pred HHHHHHHHCCCceeeEcCCCCeE
Confidence 345555444 445588884
No 124
>PF07956 DUF1690: Protein of Unknown function (DUF1690) ; InterPro: IPR012471 Family of uncharacterised fungal proteins.
Probab=43.37 E-value=2.8e+02 Score=25.86 Aligned_cols=66 Identities=15% Similarity=0.284 Sum_probs=36.4
Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcCC---CCCCCCCCHHHHHHHHHHHHHHHH
Q psy9560 56 MTPERKKKLRLLLRKKAAEELKREQERRALERKRVIDERCGE---PKDTDDMAEDEIGDLCQEYWQRIY 121 (503)
Q Consensus 56 MT~eRKkkLKSLLLqKAaEELEKEKEqKeEEKKKiLAERcpP---PLnIDGLSedELQEkCKELHdRId 121 (503)
-..+|.+.+-..|-++.+++|.+-+.+....=+.+|...... --+.++||...|.+.+..|-.+|.
T Consensus 24 tD~sR~q~~e~~iq~Rva~eL~~L~~~~~~~~~~~l~~~~~~ed~~~~~~~lSs~~v~~~i~~L~~kLe 92 (142)
T PF07956_consen 24 TDSSRAQTLELHIQERVAEELKRLEEEELKKFEEALEKSLLSEDEKDQDPGLSSKSVNEEIEKLREKLE 92 (142)
T ss_pred CChhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccHHHHHHHHHHHHHHHH
Confidence 346788888888888888888755444332222233322210 123355666666665555555554
No 125
>KOG2008|consensus
Probab=43.20 E-value=3.8e+02 Score=29.10 Aligned_cols=226 Identities=21% Similarity=0.196 Sum_probs=118.7
Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-hhhhcCCCCCCCCCCH-HHHHHHHHHHHHHHHHHhhhhhhH
Q psy9560 53 KGFMTPERKKKLRLLLRKKAAEELKREQERRALERKRV-IDERCGEPKDTDDMAE-DEIGDLCQEYWQRIYNLEAIKYEL 130 (503)
Q Consensus 53 KgKMT~eRKkkLKSLLLqKAaEELEKEKEqKeEEKKKi-LAERcpPPLnIDGLSe-dELQEkCKELHdRIdkLEEEKYDL 130 (503)
+.|.--.|+-.--.+-.|||++..++...-.+..|+.+ |.+.. +++-+-++. -+.+++...-.+++..+|.++.--
T Consensus 69 rpf~elk~~er~~r~e~QkAa~~FeRat~vl~~AkeqVsl~~~s--L~~~~~~~~~~~~~evlnh~~qrV~EaE~e~t~a 146 (426)
T KOG2008|consen 69 RPFWELKRVERQARLEAQKAAQDFERATEVLRAAKEQVSLAEQS--LLEDDKRQFDSAWQEVLNHATQRVMEAEQEKTRA 146 (426)
T ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--hhcchhhhhHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 44544444332222456889988887666655555543 33333 233232232 345677777778888899999999
Q ss_pred HHHHhhhhhhHHHHHHHHHHhhCCCCCCccccccccHHHHHHHHhhhhccchhhhcccccccccchhhhhhccCcccc--
Q psy9560 131 DREIMLKDFEITELGKQVNDLRGKFVRPILKKVSKYENKFAKLQKKAAEFNFRSQLKAVKKREFSMEDAEKEKKPEWV-- 208 (503)
Q Consensus 131 E~KVkKQDyEInELniKVnDLRGKFKKPaLKKVRkSAnm~a~L~KhkvsmDfRANLKqVKKEe~~leEeekeeVgDWR-- 208 (503)
|.-+.-.-..+..|-.++..|-.+- |=++||-|-|-..-. +-..-|-+++|+|-- ++-+-...++++.
T Consensus 147 E~~Has~a~~~l~l~~~~R~~ek~n-~~AIkKSrpYfE~k~-----~~t~~le~qk~tv~~----Leaev~~~K~~Y~~s 216 (426)
T KOG2008|consen 147 ELVHASTAARYLALMGRMRQLEKKN-KRAIKKSRPYFELKA-----KYTVQLEQQKKTVDD----LEAEVTLAKGEYKMS 216 (426)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHh-HHHHhhcchHHHHHH-----HHHHHHHHHHHHHHH----HHHHHHHhhccHHHH
Confidence 9888888888888888887776643 336666666643111 112223333333321 1112222333443
Q ss_pred -ccchhhccCCCCCCCCCCCCCCCccCcccCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCChhhhh
Q psy9560 209 -KEPESKKKRIPIPEPEPEPESATDASDVEDGKETPLPPEGEQPPVEEVPPPPEPEPEPPVEEIPPTPPSPVTPNPEELW 287 (503)
Q Consensus 209 -KNiE~K~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 287 (503)
+|+|--+.+.-..--. .-+-.-.||-++.+.. --.|+.- |-++.+|..+..+
T Consensus 217 lrnLE~ISd~IHeeRss--qs~~apssd~e~~~s~-------~~s~e~l------------------pg~~p~p~~~~~~ 269 (426)
T KOG2008|consen 217 LRNLEMISDEIHEERSS--QSAMAPSSDGEGAESS-------STSVEDL------------------PGSKPEPDAISVA 269 (426)
T ss_pred HHHHHHHHHHHHHhhhh--hhccCCCccccccccc-------ccchhhC------------------CCCCCCCchhhhh
Confidence 6666544443222110 0000111222222111 1111111 1112255677788
Q ss_pred HHhhccccCCCCccHHHHHHHHHHhhhcCc
Q psy9560 288 LYLKDTVSASNPVSIDDLVEKLMSAVSATP 317 (503)
Q Consensus 288 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 317 (503)
..-+|.-|.+|-||-||--..-.+.+|..|
T Consensus 270 ~~~~~dds~~~~~~e~~~e~~~~~s~~sg~ 299 (426)
T KOG2008|consen 270 SEAFEDDSCSNFVSEDDSETQSVSSFSSGP 299 (426)
T ss_pred hhhcccchhhcccccccccccccccccCCC
Confidence 888888999999998886555444444443
No 126
>PF12718 Tropomyosin_1: Tropomyosin like; InterPro: IPR000533 Tropomyosins [], are a family of closely related proteins present in muscle and non-muscle cells. In striated muscle, tropomyosin mediate the interactions between the troponin complex and actin so as to regulate muscle contraction []. The role of tropomyosin in smooth muscle and non-muscle tissues is not clear. Tropomyosin is an alpha-helical protein that forms a coiled-coil structure of 2 parallel helices containing 2 sets of 7 alternating actin binding sites []. There are multiple cell-specific isoforms, created by differential splicing of the messenger RNA from one gene, but the proportions of the isoforms vary between different cell types. Muscle isoforms of tropomyosin are characterised by having 284 amino acid residues and a highly conserved N-terminal region, whereas non-muscle forms are generally smaller and are heterogeneous in their N-terminal region. This entry represents tropomyosin (Tmp) 1, 2 and 3. Within the yeast Tmp1 and Tmp2, biochemical and sequence analyses indicate that Tpm2 spans four actin monomers along a filament, whereas Tpm1 spans five. Despite its shorter length, Tpm2 can compete with Tpm1 for binding to F-actin. Over-expression of Tpm2 in vivo alters the axial budding of haploids to a bipolar pattern, and this can be partially suppressed by co-over-expression of Tpm1. This suggests distinct functions for the two tropomyosins, and indicates that the ratio between them is important for correct morphogenesis [].
Probab=43.18 E-value=75 Score=29.25 Aligned_cols=17 Identities=0% Similarity=0.179 Sum_probs=8.8
Q ss_pred HHHHHHHHHHHHhhhhc
Q psy9560 79 EQERRALERKRVIDERC 95 (503)
Q Consensus 79 EKEqKeEEKKKiLAERc 95 (503)
.+....+++-+-|.++.
T Consensus 14 ~r~e~~e~~~K~le~~~ 30 (143)
T PF12718_consen 14 DRAEELEAKVKQLEQEN 30 (143)
T ss_pred HHHHHHHHHHHHHHHHH
Confidence 44445555555555554
No 127
>PF14772 NYD-SP28: Sperm tail
Probab=43.14 E-value=56 Score=27.92 Aligned_cols=37 Identities=32% Similarity=0.284 Sum_probs=26.5
Q ss_pred HHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHh
Q psy9560 112 LCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQVNDL 151 (503)
Q Consensus 112 kCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnDL 151 (503)
+|.+||+-| +..+--++.-|.++|.-|..|...+.+.
T Consensus 52 ~~~eL~~~i---e~q~~~~e~ii~~Kd~lI~~L~~eL~~~ 88 (104)
T PF14772_consen 52 KPQELRKEI---EEQKQACERIIDRKDALIKELQQELKEA 88 (104)
T ss_pred cHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 566776654 4456677788888888898888776553
No 128
>PF03357 Snf7: Snf7; InterPro: IPR005024 This is a family of eukaryotic proteins which are variously described as either hypothetical protein, developmental protein or related to yeast SNF7. The family contains human CHMP1. CHMP1 (CHromatin Modifying Protein; CHarged Multivesicular body Protein), is encoded by an alternative open reading frame in the PRSM1 gene [] and is conserved in both complex and simple eukaryotes. CHMP1 contains a predicted bipartite nuclear localisation signal and distributes as distinct forms to the cytoplasm and the nuclear matrix in all cell lines tested. Human CHMP1 is strongly implicated in multivesicular body formation. A multivesicular body is a vesicle-filled endosome that targets proteins to the interior of lysosomes. Immunocytochemistry and biochemical fractionation localise CHMP1 to early endosomes and CHMP1 physically interacts with SKD1/VPS4, a highly conserved protein directly linked to multivesicular body sorting in yeast. Similar to the action of a mutant SKD1 protein, over expression of a fusion derivative of human CHMP1 dilates endosomal compartments and disrupts the normal distribution of several endosomal markers. Genetic studies in Saccharomyces cerevisiae (Baker's yeast) further support a conserved role of CHMP1 in vesicle trafficking. Deletion of CHM1, the budding yeast homologue of CHMP1, results in defective sorting of carboxypeptidases S and Y and produces abnormal, multi-lamellar prevacuolar compartments. This phenotype classifies CHM1 as a member of the class E vacuolar protein sorting genes []. ; GO: 0015031 protein transport; PDB: 2V6X_B 2W2U_D 2GD5_D 3FRT_B 3FRV_A 4ABM_D 3EAB_H 3HTU_D.
Probab=43.04 E-value=47 Score=29.23 Aligned_cols=28 Identities=18% Similarity=0.198 Sum_probs=12.2
Q ss_pred hhhhhHHHHHhhhhhhHHHHHHHHHHhh
Q psy9560 125 AIKYELDREIMLKDFEITELGKQVNDLR 152 (503)
Q Consensus 125 EEKYDLE~KVkKQDyEInELniKVnDLR 152 (503)
.....|+..+.+.+.+|..|..++..+.
T Consensus 8 ~~~~~L~~~~~~le~~i~~~~~~~k~~~ 35 (171)
T PF03357_consen 8 KTIRRLEKQIKRLEKKIKKLEKKAKKAI 35 (171)
T ss_dssp HHHHHHHHHHHHHHHHHHHCHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3333333444444444444455544443
No 129
>PF10805 DUF2730: Protein of unknown function (DUF2730); InterPro: IPR020269 This entry represents a family of various hypothetical proteins. The proteins, which include HI1498 and Gp25, from phage Mu, are currently uncharacterised.
Probab=42.88 E-value=49 Score=28.92 Aligned_cols=42 Identities=19% Similarity=0.344 Sum_probs=28.5
Q ss_pred HHHHHHHHHHHHHHHHHhhhhhhHHHHHhhh--hhhHHHHHHHHHHhhCCC
Q psy9560 107 DEIGDLCQEYWQRIYNLEAIKYELDREIMLK--DFEITELGKQVNDLRGKF 155 (503)
Q Consensus 107 dELQEkCKELHdRIdkLEEEKYDLE~KVkKQ--DyEInELniKVnDLRGKF 155 (503)
++|++....+.+||+.|| .+|... ..++++|+..|.+++|.+
T Consensus 38 ~~l~~~~~~~~~Rl~~lE-------~~l~~LPt~~dv~~L~l~l~el~G~~ 81 (106)
T PF10805_consen 38 EKLEERLDEHDRRLQALE-------TKLEHLPTRDDVHDLQLELAELRGEL 81 (106)
T ss_pred HHHHHHHHHHHHHHHHHH-------HHHHhCCCHHHHHHHHHHHHHHHhHH
Confidence 445555566666666655 445554 678888999999998844
No 130
>PF10146 zf-C4H2: Zinc finger-containing protein ; InterPro: IPR018482 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents a family of proteins which appears to have a highly conserved zinc finger domain at the C-terminal end, described as -C-X2-CH-X3-H-X5-C-X2-C-. The structure is predicted to contain a coiled coil. Members of this family are annotated as being tumour-associated antigen HCA127 in humans, but this could not be confirmed.
Probab=42.72 E-value=1.8e+02 Score=29.06 Aligned_cols=43 Identities=14% Similarity=0.174 Sum_probs=25.5
Q ss_pred HHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHhhC
Q psy9560 111 DLCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQVNDLRG 153 (503)
Q Consensus 111 EkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnDLRG 153 (503)
.+..+|..-+..|..||--+...++.-+.||++|..-|.+++.
T Consensus 32 ~~L~e~~kE~~~L~~Er~~h~eeLrqI~~DIn~lE~iIkqa~~ 74 (230)
T PF10146_consen 32 KCLEEYRKEMEELLQERMAHVEELRQINQDINTLENIIKQAES 74 (230)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3445555556666666666666666666666666666655543
No 131
>KOG3859|consensus
Probab=42.49 E-value=2.1e+02 Score=30.76 Aligned_cols=73 Identities=19% Similarity=0.256 Sum_probs=45.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhhcCCCCCCCCCCHHHHHHHHHHHHHHHHHHh----hhhhhHHHHHhhhhhhHHHH
Q psy9560 69 RKKAAEELKREQERRALERKRVIDERCGEPKDTDDMAEDEIGDLCQEYWQRIYNLE----AIKYELDREIMLKDFEITEL 144 (503)
Q Consensus 69 LqKAaEELEKEKEqKeEEKKKiLAERcpPPLnIDGLSedELQEkCKELHdRIdkLE----EEKYDLE~KVkKQDyEInEL 144 (503)
-+|-.+-|. +-++++++=+...-.|+ +- .+.+|++.-++||.+.+.|- +||--||.+.+..+.|++.+
T Consensus 325 eaKr~e~~~-e~qrkEee~rqmFvqrv-ke------kE~elke~Ekel~~kf~~lkr~h~eEk~kle~~rr~Leee~~~f 396 (406)
T KOG3859|consen 325 EAKRNEFLG-ELQRKEEEMRQMFVQRV-KE------KEAELKEAEKELHEKFDRLKRLHQEEKKKLEEKRKQLEEEVNAF 396 (406)
T ss_pred HHHHHHHHH-HHHHhHHHHHHHHHHHH-HH------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 334444444 33444444444444577 32 56789999999999998875 45666666666666666666
Q ss_pred HHHHH
Q psy9560 145 GKQVN 149 (503)
Q Consensus 145 niKVn 149 (503)
..|+.
T Consensus 397 ~~rk~ 401 (406)
T KOG3859|consen 397 QRRKT 401 (406)
T ss_pred HHHHH
Confidence 66654
No 132
>PRK02224 chromosome segregation protein; Provisional
Probab=42.32 E-value=3.7e+02 Score=30.32 Aligned_cols=15 Identities=13% Similarity=0.209 Sum_probs=7.4
Q ss_pred hhHHHHHHHHHHhhC
Q psy9560 139 FEITELGKQVNDLRG 153 (503)
Q Consensus 139 yEInELniKVnDLRG 153 (503)
.+|.+|+.+|..|..
T Consensus 627 ~~l~~~r~~i~~l~~ 641 (880)
T PRK02224 627 ERLAEKRERKRELEA 641 (880)
T ss_pred HHHHHHHHHHHHHHH
Confidence 344455555555543
No 133
>PF03978 Borrelia_REV: Borrelia burgdorferi REV protein; InterPro: IPR007126 This family consists of several REV proteins from Borrelia burgdorferi (Lyme disease spirochete) and Borrelia garinii. The function of REV is unknown although it has been shown that the gene is induced during the ingesting of host blood suggesting a role in the metabolic activation of borreliae to adapt to physiological stimuli [].
Probab=42.17 E-value=34 Score=33.05 Aligned_cols=76 Identities=16% Similarity=0.182 Sum_probs=41.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcCCCCCCCCCCHHHHHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhh
Q psy9560 61 KKKLRLLLRKKAAEELKREQERRALERKRVIDERCGEPKDTDDMAEDEIGDLCQEYWQRIYNLEAIKYELDREIMLKDFE 140 (503)
Q Consensus 61 KkkLKSLLLqKAaEELEKEKEqKeEEKKKiLAERcpPPLnIDGLSedELQEkCKELHdRIdkLEEEKYDLE~KVkKQDyE 140 (503)
|+++=++|+-.++..-..++ ++-..|-.-+ +.|+.+.=-+..+.+|++|..|.+.-+|.--.
T Consensus 8 klff~~~lfvmaCkaYv~eK-----K~Insl~~~v------~~l~nk~d~~~yknyk~ki~eLke~lK~~~NA------- 69 (160)
T PF03978_consen 8 KLFFISMLFVMACKAYVEEK-----KEINSLIEDV------SKLNNKSDAEAYKNYKKKINELKEDLKDVSNA------- 69 (160)
T ss_pred HHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHH------HHhcccccHHHHHHHHHHHHHHHHHHHHhhhH-------
Confidence 34555666666776653222 1222233333 33322112258899999999998876665211
Q ss_pred HHHHHHHHHHhhCCCC
Q psy9560 141 ITELGKQVNDLRGKFV 156 (503)
Q Consensus 141 InELniKVnDLRGKFK 156 (503)
+|..++.+|++.|.
T Consensus 70 --Eleekll~lq~lfq 83 (160)
T PF03978_consen 70 --ELEEKLLKLQKLFQ 83 (160)
T ss_pred --HHHHHHHHHHHHHH
Confidence 45556666666554
No 134
>PF05103 DivIVA: DivIVA protein; InterPro: IPR007793 The Bacillus subtilis divIVA1 mutation causes misplacement of the septum during cell division, resulting in the formation of small, circular, anucleate minicells []. Inactivation of divIVA produces a minicell phenotype, whereas overproduction of DivIVA results in a filamentation phenotype []. These proteins appear to contain coiled-coils.; PDB: 2WUK_C 2WUJ_A.
Probab=42.09 E-value=26 Score=30.04 Aligned_cols=53 Identities=23% Similarity=0.285 Sum_probs=31.7
Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHhhC
Q psy9560 101 TDDMAEDEIGDLCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQVNDLRG 153 (503)
Q Consensus 101 IDGLSedELQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnDLRG 153 (503)
+-|.+.+++-.+...|.+.|..|..+.=+|...+......|.+++.+..+|+.
T Consensus 15 ~rGYd~~eVD~fl~~l~~~~~~l~~e~~~L~~~~~~l~~~l~~~~~~~~~l~~ 67 (131)
T PF05103_consen 15 MRGYDPDEVDDFLDELAEELERLQRENAELKEEIEELQAQLEELREEEESLQR 67 (131)
T ss_dssp EEEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCT-------------
T ss_pred CCCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhHHHHHHH
Confidence 46889999999999999888888877666666666666666666655555554
No 135
>COG1579 Zn-ribbon protein, possibly nucleic acid-binding [General function prediction only]
Probab=42.09 E-value=76 Score=32.08 Aligned_cols=49 Identities=14% Similarity=0.176 Sum_probs=41.4
Q ss_pred HHHHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHhhCCC
Q psy9560 107 DEIGDLCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQVNDLRGKF 155 (503)
Q Consensus 107 dELQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnDLRGKF 155 (503)
+.++..-..+-+.+..++.+.=||+..|.+...+|.+++.|+.+++-|.
T Consensus 34 ~k~~~e~e~~~~~~~~~~~e~e~le~qv~~~e~ei~~~r~r~~~~e~kl 82 (239)
T COG1579 34 KKAKAELEALNKALEALEIELEDLENQVSQLESEIQEIRERIKRAEEKL 82 (239)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4556666667778888899999999999999999999999999998866
No 136
>PHA02562 46 endonuclease subunit; Provisional
Probab=41.89 E-value=1.6e+02 Score=31.04 Aligned_cols=47 Identities=17% Similarity=0.073 Sum_probs=32.1
Q ss_pred HHHHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHhhC
Q psy9560 107 DEIGDLCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQVNDLRG 153 (503)
Q Consensus 107 dELQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnDLRG 153 (503)
..+..-.+.|...|..|+...+|++.++.....++++|+..+.++..
T Consensus 354 ~~~~~~~~~l~~ei~~l~~~~~~~~~~l~~l~~~l~~~~~~~~~~~k 400 (562)
T PHA02562 354 ITLVDKAKKVKAAIEELQAEFVDNAEELAKLQDELDKIVKTKSELVK 400 (562)
T ss_pred HHHHHHHHHHHHHHHHHHhhhhchHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34555566677777777777777777777777777777766666543
No 137
>TIGR02169 SMC_prok_A chromosome segregation protein SMC, primarily archaeal type. SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent.
Probab=41.88 E-value=2.8e+02 Score=31.43 Aligned_cols=42 Identities=14% Similarity=0.100 Sum_probs=21.8
Q ss_pred HHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHhhC
Q psy9560 112 LCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQVNDLRG 153 (503)
Q Consensus 112 kCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnDLRG 153 (503)
....+-..|..++.+.-+++.++.+...+|..|+.++..+..
T Consensus 302 ~~~~~~~~~~~~~~~l~~l~~~~~~~~~~~~~l~~~~~~l~~ 343 (1164)
T TIGR02169 302 EIASLERSIAEKERELEDAEERLAKLEAEIDKLLAEIEELER 343 (1164)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 333444444444444555555555555556666655555544
No 138
>PF02994 Transposase_22: L1 transposable element; InterPro: IPR004244 Many human L1 elements are capable of retrotransposition. Some of these have been shown to exhibit reverse transcriptase (RT) activity [] although the function of many are, as yet, unknown. More information about these proteins can be found at Protein of the Month: Transposase [].; PDB: 2LDY_A 3SOO_A 2YKQ_A 2YKO_C 2YKP_B 2W7A_B 2JRB_A.
Probab=41.83 E-value=40 Score=35.15 Aligned_cols=60 Identities=22% Similarity=0.261 Sum_probs=48.8
Q ss_pred HHHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHhhCCCCCCccccccccH
Q psy9560 108 EIGDLCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQVNDLRGKFVRPILKKVSKYE 167 (503)
Q Consensus 108 ELQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnDLRGKFKKPaLKKVRkSA 167 (503)
.|..-+.++=+||..||+.-.+|+..+....+.|..|..++.||.....+-.||=|.+.+
T Consensus 141 ~l~~Ri~e~Eeris~lEd~~~~i~~~~~~~~k~i~~l~~kl~DlEnrsRRnNiRIiGiPE 200 (370)
T PF02994_consen 141 SLNSRIDELEERISELEDRIEEIEQAIKELEKRIKKLEDKLDDLENRSRRNNIRIIGIPE 200 (370)
T ss_dssp ---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHTTTEEEEES---
T ss_pred HHHHHHHHHHhHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHhhccCCceeEEecCC
Confidence 577778888899999998888899999999999999999999999888899998888876
No 139
>PF08614 ATG16: Autophagy protein 16 (ATG16); InterPro: IPR013923 Macroautophagy is a bulk degradation process induced by starvation in eukaryotic cells. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. No molecule involved in autophagy has yet been identified in higher eukaryotes []. The pre-autophagosomal structure contains at least five Apg proteins: Apg1p, Apg2p, Apg5p, Aut7p/Apg8p and Apg16p. It is found in the vacuole []. The C-terminal glycine of Apg12p is conjugated to a lysine residue of Apg5p via an isopeptide bond. During autophagy, cytoplasmic components are enclosed in autophagosomes and delivered to lysosomes/vacuoles. Auotphagy protein 16 (Apg16) has been shown to be bind to Apg5 and is required for the function of the Apg12p-Apg5p conjugate []. Autophagy protein 5 (Apg5) is directly required for the import of aminopeptidase I via the cytoplasm-to-vacuole targeting pathway []. This entry represents auotphagy protein 16 (Apg16), which is required for the function of the Apg12p-Apg5p conjugate.; PDB: 3A7O_D 3A7P_B.
Probab=41.78 E-value=51 Score=31.00 Aligned_cols=49 Identities=24% Similarity=0.174 Sum_probs=30.8
Q ss_pred CHHHHHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHhhC
Q psy9560 105 AEDEIGDLCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQVNDLRG 153 (503)
Q Consensus 105 SedELQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnDLRG 153 (503)
...+|+..+.+....|..|..+...|+.++...+.+|.+++.-+.+|+-
T Consensus 103 ~l~~l~~~~~~~~~~l~~l~~~~~~L~~~~~~l~~~l~ek~k~~e~l~D 151 (194)
T PF08614_consen 103 ELQELEKELSEKERRLAELEAELAQLEEKIKDLEEELKEKNKANEILQD 151 (194)
T ss_dssp -----------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred ccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4567788888888888888888888888888888888887777766654
No 140
>cd07627 BAR_Vps5p The Bin/Amphiphysin/Rvs (BAR) domain of yeast Sorting Nexin Vps5p. BAR domains are dimerization, lipid binding and curvature sensing modules found in many different proteins with diverse functions. Sorting nexins (SNXs) are Phox homology (PX) domain containing proteins that are involved in regulating membrane traffic and protein sorting in the endosomal system. SNXs differ from each other in their lipid-binding specificity, subcellular localization and specific function in the endocytic pathway. A subset of SNXs also contain BAR domains. The PX-BAR structural unit determines the specific membrane targeting of SNXs. Vsp5p is the yeast counterpart of human SNX1 and is part of the retromer complex, which functions in the endosome-to-Golgi retrieval of vacuolar protein sorting receptor Vps10p, the Golgi-resident membrane protein A-ALP, and endopeptidase Kex2. BAR domains form dimers that bind to membranes, induce membrane bending and curvature, and may also be involved in
Probab=41.43 E-value=32 Score=32.85 Aligned_cols=88 Identities=11% Similarity=0.091 Sum_probs=0.0
Q ss_pred HHHhhhhcCCCCCCCCCCHHHHHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHhhCCCCCCccccccccH
Q psy9560 88 KRVIDERCGEPKDTDDMAEDEIGDLCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQVNDLRGKFVRPILKKVSKYE 167 (503)
Q Consensus 88 KKiLAERcpPPLnIDGLSedELQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnDLRGKFKKPaLKKVRkSA 167 (503)
+.+|..|+ +.+. +.+.+..-...++.++++|...----..|+...+.||.++..++...+-.|-. +|.
T Consensus 104 k~~~~~R~-~~~~----~~~~~~~~L~k~~~~~~Kl~~~~~s~~~K~~~~~~ei~~~e~~~~~a~~~~e~-------is~ 171 (216)
T cd07627 104 RAAFAQRQ-KLWQ----YWQSAESELSKKKAQLEKLKRQGKTQQEKLNSLLSELEEAERRASELKKEFEE-------VSE 171 (216)
T ss_pred HHHHHHHH-HHHH----HHHHHHHHHHHHHHHHHHHhccCCChHHHHHHHHHHHHHHHHHHHHHHHHHHH-------HHH
Q ss_pred HHHHHHH--hhhhccchhhhcc
Q psy9560 168 NKFAKLQ--KKAAEFNFRSQLK 187 (503)
Q Consensus 168 nm~a~L~--KhkvsmDfRANLK 187 (503)
.+-.-|. +.....|||+||+
T Consensus 172 ~~k~El~rF~~~r~~dfk~~l~ 193 (216)
T cd07627 172 LIKSELERFERERVEDFRNSVE 193 (216)
T ss_pred HHHHHHHHHHHHHHHHHHHHHH
No 141
>PRK11637 AmiB activator; Provisional
Probab=41.34 E-value=3.2e+02 Score=28.61 Aligned_cols=46 Identities=11% Similarity=0.109 Sum_probs=27.6
Q ss_pred HHHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHhhC
Q psy9560 108 EIGDLCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQVNDLRG 153 (503)
Q Consensus 108 ELQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnDLRG 153 (503)
.|......+...|..++.+-=+++.++.+.+.+|.+|..+|..++.
T Consensus 79 ~l~~qi~~~~~~i~~~~~~i~~~~~ei~~l~~eI~~~q~~l~~~~~ 124 (428)
T PRK11637 79 KQEEAISQASRKLRETQNTLNQLNKQIDELNASIAKLEQQQAAQER 124 (428)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3444445555566666666666666666666666666666655544
No 142
>KOG0992|consensus
Probab=41.18 E-value=2.6e+02 Score=31.79 Aligned_cols=53 Identities=21% Similarity=0.208 Sum_probs=40.9
Q ss_pred HHHHHHHHHHHHHHH--------HHhhhhhhHHHHHhhhhhhHHHHHHHHHHhhCCCCCCc
Q psy9560 107 DEIGDLCQEYWQRIY--------NLEAIKYELDREIMLKDFEITELGKQVNDLRGKFVRPI 159 (503)
Q Consensus 107 dELQEkCKELHdRId--------kLEEEKYDLE~KVkKQDyEInELniKVnDLRGKFKKPa 159 (503)
.+++-+....+++.. +|++.|-+|..+..+.+.-|.+|.+..+..|.+.-++.
T Consensus 363 ~ee~tlla~~~dr~se~~e~teqkleelk~~f~a~q~K~a~tikeL~~El~~yrr~i~~~~ 423 (613)
T KOG0992|consen 363 KEEKTLLAAADDRFSEYSELTEQKLEELKVQFTAKQEKHAETIKELEIELEEYRRAILRNA 423 (613)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccc
Confidence 444555544444432 59999999999999999999999999998877776666
No 143
>PF09278 MerR-DNA-bind: MerR, DNA binding; InterPro: IPR015358 This entry represents a family of DNA-binding domains that are predominantly found in the prokaryotic transcriptional regulator MerR. They adopt a structure consisting of a core of three alpha helices, with an architecture that is similar to that of the 'winged helix' fold []. ; PDB: 3QAO_A 1R8D_B 1JBG_A 2VZ4_A 2ZHH_A 2ZHG_A 1Q09_A 1Q08_B 1Q0A_B 1Q07_A ....
Probab=41.08 E-value=41 Score=25.73 Aligned_cols=16 Identities=25% Similarity=0.399 Sum_probs=9.3
Q ss_pred CCCCHHHHHHHHHHHHH
Q psy9560 102 DDMAEDEIGDLCQEYWQ 118 (503)
Q Consensus 102 DGLSedELQEkCKELHd 118 (503)
-|+|.+++++++ .|++
T Consensus 13 lGfsL~eI~~~l-~l~~ 28 (65)
T PF09278_consen 13 LGFSLEEIRELL-ELYD 28 (65)
T ss_dssp TT--HHHHHHHH-HHCC
T ss_pred cCCCHHHHHHHH-hccC
Confidence 367888888877 4433
No 144
>PF11932 DUF3450: Protein of unknown function (DUF3450); InterPro: IPR016866 There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function. However, they are found in an operon along with components of a TonB transport system (typified by Vibrio cholerae TonB2 [], and are predicted to be localized to the periplasmic space. Caution: the low-complexity nature of these sequences produces spurious BLAST hits to chromosome segregation ATPases (which are much longer in length and contain canonical Walker motifs). Accordingly, some members are misidentified as such.
Probab=40.97 E-value=74 Score=30.91 Aligned_cols=45 Identities=22% Similarity=0.363 Sum_probs=22.7
Q ss_pred HHHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHhh
Q psy9560 108 EIGDLCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQVNDLR 152 (503)
Q Consensus 108 ELQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnDLR 152 (503)
+|..-++.|...|..|+...=-++..|..++.+|.+|+.+|.++.
T Consensus 53 ~L~~e~~~l~~e~e~L~~~~~~l~~~v~~q~~el~~L~~qi~~~~ 97 (251)
T PF11932_consen 53 ELLAEYRQLEREIENLEVYNEQLERQVASQEQELASLEQQIEQIE 97 (251)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 344444445555555554444455555555555555555544443
No 145
>PRK13848 conjugal transfer protein TraC; Provisional
Probab=40.24 E-value=1.3e+02 Score=27.18 Aligned_cols=46 Identities=28% Similarity=0.378 Sum_probs=36.1
Q ss_pred HHHHHHHHHHHHHHHHHHhhhhcCCCCCCCCCCHHHHHHHHHHHHHHHH
Q psy9560 73 AEELKREQERRALERKRVIDERCGEPKDTDDMAEDEIGDLCQEYWQRIY 121 (503)
Q Consensus 73 aEELEKEKEqKeEEKKKiLAERcpPPLnIDGLSedELQEkCKELHdRId 121 (503)
++.|+ +.+.++.||-.-|+-+++ +-+|+ +++++|+..+.+|-.|+.
T Consensus 16 qe~lk-~~e~keAERigRiAlKAG-Lgeie-I~d~eL~~aFeeiAaRFR 61 (98)
T PRK13848 16 QEQLK-QAETREAERIGRIALKAG-LGEIE-IEEAELQAAFEELAKRFR 61 (98)
T ss_pred HHHHH-HHHHHHHHHHHHHHHHcC-ccccc-cCHHHHHHHHHHHHHHHh
Confidence 34444 567788888888998996 76777 689999999999988865
No 146
>PF07462 MSP1_C: Merozoite surface protein 1 (MSP1) C-terminus; InterPro: IPR010901 This entry represents the C-terminal region of merozoite surface protein 1 (MSP1), which is found in a number of Plasmodium species. MSP-1 is a 200 kDa protein expressed on the surface of the Plasmodium vivax merozoite. MSP-1 of Plasmodium species is synthesised as a high-molecular-weight precursor and then processed into several fragments. At the time of red cell invasion by the merozoite, only the 19 kDa C-terminal fragment (MSP-119), which contains two epidermal growth factor-like domains, remains on the surface. Antibodies against MSP-119 inhibit merozoite entry into red cells, and immunisation with MSP-119 protects monkeys from challenging infections. Hence, MSP-119 is considered a promising vaccine candidate [].; GO: 0009405 pathogenesis, 0016020 membrane
Probab=39.99 E-value=66 Score=36.28 Aligned_cols=18 Identities=28% Similarity=0.182 Sum_probs=11.7
Q ss_pred HHhhhhhhHHHHHHHHHH
Q psy9560 133 EIMLKDFEITELGKQVND 150 (503)
Q Consensus 133 KVkKQDyEInELniKVnD 150 (503)
+++..-.||+.|+.+|+-
T Consensus 55 ~~~~L~~eI~~LK~~~q~ 72 (574)
T PF07462_consen 55 KWNALGEEIEELKKKIQV 72 (574)
T ss_pred HHHHHHHHHHHHHHHHHh
Confidence 345556678888777653
No 147
>PRK04406 hypothetical protein; Provisional
Probab=39.86 E-value=55 Score=27.51 Aligned_cols=34 Identities=21% Similarity=0.291 Sum_probs=23.1
Q ss_pred HHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHhhC
Q psy9560 120 IYNLEAIKYELDREIMLKDFEITELGKQVNDLRG 153 (503)
Q Consensus 120 IdkLEEEKYDLE~KVkKQDyEInELniKVnDLRG 153 (503)
|+.+++--.|||.++..|+.-|.+||.-|...+.
T Consensus 6 ~~~le~Ri~~LE~~lAfQE~tIe~LN~~v~~Qq~ 39 (75)
T PRK04406 6 IEQLEERINDLECQLAFQEQTIEELNDALSQQQL 39 (75)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3445545556777888888888888877766554
No 148
>KOG0963|consensus
Probab=39.74 E-value=4.2e+02 Score=30.63 Aligned_cols=101 Identities=18% Similarity=0.219 Sum_probs=49.5
Q ss_pred HHHHHHHHHHHHHhHHHHhhhhccCCCCCHHHHH--HHHHHHH---HHHHHHHHHHHHHHHHHHHHHhhhhcCCCCCCCC
Q psy9560 29 LEELERKRAEVRRRMEEASGARRKKGFMTPERKK--KLRLLLR---KKAAEELKREQERRALERKRVIDERCGEPKDTDD 103 (503)
Q Consensus 29 qeErERKkAEvRKRlEEA~KkKAKKgKMT~eRKk--kLKSLLL---qKAaEELEKEKEqKeEEKKKiLAERcpPPLnIDG 103 (503)
-.|+++.+.++-++..+-.-.|.+...+..-+.. ++..++. ..+...++.+.++.-.+|.+-|-+.-.. +-.
T Consensus 120 ~~e~~~lk~~lee~~~el~~~k~qq~~v~~l~e~l~k~~~~~~~~ie~~a~~~e~~~~q~~~e~e~~L~~~~~~---~~~ 196 (629)
T KOG0963|consen 120 SEENEELKEELEEVNNELADLKTQQVTVRNLKERLRKLEQLLEIFIENAANETEEKLEQEWAEREAGLKDEEQN---LQE 196 (629)
T ss_pred hhhHHHHHHHHHHHHHHHhhhhhhHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHH
Confidence 3446666666555555554433233333322221 1122221 1233455555666666666666655531 110
Q ss_pred CCHHHHHHHHHHHHHHHHHHhhhhhhHHHH
Q psy9560 104 MAEDEIGDLCQEYWQRIYNLEAIKYELDRE 133 (503)
Q Consensus 104 LSedELQEkCKELHdRIdkLEEEKYDLE~K 133 (503)
-...|..+...||..|.....+.||+..+
T Consensus 197 -q~~~le~ki~~lq~a~~~t~~el~~~~s~ 225 (629)
T KOG0963|consen 197 -QLEELEKKISSLQSAIEDTQNELFDLKSK 225 (629)
T ss_pred -HHHHHHHHHHHHHHHHHhhhhHHHHHHHh
Confidence 12445556666666666666666666665
No 149
>PF04156 IncA: IncA protein; InterPro: IPR007285 Chlamydia trachomatis is an obligate intracellular bacterium that develops within a parasitophorous vacuole termed an inclusion. The inclusion is nonfusogenic with lysosomes but intercepts lipids from a host cell exocytic pathway. Initiation of chlamydial development is concurrent with modification of the inclusion membrane by a set of C. trachomatis-encoded proteins collectively designated Incs. One of these Incs, IncA (Inclusion membrane protein A), is functionally associated with the homotypic fusion of inclusions [].
Probab=39.65 E-value=3e+02 Score=25.22 Aligned_cols=24 Identities=25% Similarity=0.256 Sum_probs=9.5
Q ss_pred HHHHHHHHHHhhhhhhHHHHHhhh
Q psy9560 114 QEYWQRIYNLEAIKYELDREIMLK 137 (503)
Q Consensus 114 KELHdRIdkLEEEKYDLE~KVkKQ 137 (503)
+.+.++++.+++...++...+...
T Consensus 126 ~~~~~~l~~l~~~~~~~~~e~~~l 149 (191)
T PF04156_consen 126 KSVEERLDSLDESIKELEKEIREL 149 (191)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH
Confidence 333444444444433333333333
No 150
>PF05793 TFIIF_alpha: Transcription initiation factor IIF, alpha subunit (TFIIF-alpha); InterPro: IPR008851 Transcription initiation factor IIF, alpha subunit (TFIIF-alpha) or RNA polymerase II-associating protein 74 (RAP74) is the large subunit of transcription factor IIF (TFIIF), which is essential for accurate initiation and stimulates elongation by RNA polymerase II [].; GO: 0003677 DNA binding, 0045893 positive regulation of transcription, DNA-dependent, 0005634 nucleus; PDB: 1F3U_F 1NHA_A 1I27_A 1J2X_A 2K7L_A 1ONV_A.
Probab=39.45 E-value=41 Score=36.81 Aligned_cols=60 Identities=25% Similarity=0.363 Sum_probs=42.6
Q ss_pred CCCCCChhhhhHHhhccccCCCCccHHHHHHHHHHhhhcCchhhHHHHHHHHHhhhhccccccCC
Q psy9560 277 SPVTPNPEELWLYLKDTVSASNPVSIDDLVEKLMSAVSATPQPLLKSVVKDTLRRQASHEETEGG 341 (503)
Q Consensus 277 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 341 (503)
.-+..+-+++-.||.. -||+.-|||.+|.+-+.-....-++.+|.+.|++.+-|..|.+|
T Consensus 460 ~~~~vTEe~VrryL~r-----kPmTTkdLL~kFk~k~~~l~s~~~~~~~~~ILkki~p~~k~~~~ 519 (527)
T PF05793_consen 460 DDSGVTEEEVRRYLRR-----KPMTTKDLLKKFKSKKTGLSSEEDKNVFAQILKKINPIQKTIGG 519 (527)
T ss_dssp --SS--HHHHHHHHHH-----S-B-HHHHHHHTSHHHHSS-HHHHHHHHHHHHHHH--EEEEETT
T ss_pred cccCcCHHHHHHHHhc-----CCccHHHHHHHHHhccCCCCcHHHHHHHHHHHHHhChHheeccC
Confidence 3445567899999984 59999999999999855555567889999999999888877766
No 151
>PRK04325 hypothetical protein; Provisional
Probab=39.44 E-value=33 Score=28.55 Aligned_cols=46 Identities=17% Similarity=0.089 Sum_probs=25.2
Q ss_pred HHHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHhhC
Q psy9560 108 EIGDLCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQVNDLRG 153 (503)
Q Consensus 108 ELQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnDLRG 153 (503)
.|.+-.-+|=-+|.-.|.--=+|+.-|.+|..+|..|+.++..|..
T Consensus 6 ~~e~Ri~~LE~klAfQE~tIe~LN~vv~~Qq~~I~~L~~ql~~L~~ 51 (74)
T PRK04325 6 EMEDRITELEIQLAFQEDLIDGLNATVARQQQTLDLLQAQLRLLYQ 51 (74)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3433444444444444444445666666666666666666655544
No 152
>PHA03264 envelope glycoprotein D; Provisional
Probab=39.41 E-value=4e+02 Score=29.34 Aligned_cols=36 Identities=31% Similarity=0.278 Sum_probs=26.7
Q ss_pred HHHHHHhhhcCchhhHHHHHHHHHhhhhccccccCCCCCCCCC
Q psy9560 306 VEKLMSAVSATPQPLLKSVVKDTLRRQASHEETEGGEPKPEGE 348 (503)
Q Consensus 306 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 348 (503)
|..|+-..+.-||++.|.||+...||. ||+.++-=.
T Consensus 224 Vd~l~yL~~yyPQ~~Hka~VkyWf~r~-------Gg~vP~~f~ 259 (416)
T PHA03264 224 VLRLTYLTQYYPQEAHKAIVDYWFMRH-------GGVVPPYFE 259 (416)
T ss_pred eeeeecccccCCcHHHHHHHHHHHHHc-------CCCCChhhh
Confidence 344555667789999999999999974 666665433
No 153
>PF07820 TraC: TraC-like protein; InterPro: IPR012930 The members of this family are sequences that are similar to TraC (Q84HT8 from SWISSPROT) from Rhizobium etli. The gene encoding this protein is one of a group of genes found on plasmid p42a of Rhizobium etli (strain CFN 42/ATCC 51251) that are thought to be involved in the process of plasmid self-transmission. Mobilisation of plasmid p42a is of importance as it is required for transfer of plasmid p42d, the symbiotic plasmid which carries most of the genes required for nodulation and nitrogen fixation by this symbiotic bacterium. The predicted protein products of p42a are similar to known transfer proteins of Agrobacterium tumefaciens plasmid pTiC58 []. ; GO: 0000746 conjugation
Probab=39.37 E-value=1.4e+02 Score=26.78 Aligned_cols=48 Identities=23% Similarity=0.331 Sum_probs=36.9
Q ss_pred HHHHHHHHHHHHHHHHHHHhhhhcCCCCCCCCCCHHHHHHHHHHHHHHHHH
Q psy9560 72 AAEELKREQERRALERKRVIDERCGEPKDTDDMAEDEIGDLCQEYWQRIYN 122 (503)
Q Consensus 72 AaEELEKEKEqKeEEKKKiLAERcpPPLnIDGLSedELQEkCKELHdRIdk 122 (503)
-++.|+ ..+.++.||---|+=+++ +.+|+ +++++|+..+.+|-.|+.+
T Consensus 14 Lqe~lk-~~e~keaERigr~AlKaG-L~eie-I~d~eL~~~FeeIa~RFrk 61 (92)
T PF07820_consen 14 LQEQLK-QAETKEAERIGRIALKAG-LGEIE-ISDAELQAAFEEIAARFRK 61 (92)
T ss_pred HHHHHH-HHHHHHHHHHHHHHHHcc-ccccc-CCHHHHHHHHHHHHHHHhc
Confidence 334444 567788888888888896 77887 5899999999998888653
No 154
>PF04102 SlyX: SlyX; InterPro: IPR007236 The SlyX protein has no known function. It is short, less than 80 amino acids, and its gene is found close to the slyD gene. The SlyX protein has a conserved PPH(Y/W) motif at its C terminus. The protein may be a coiled-coil structure.; PDB: 3EFG_A.
Probab=39.16 E-value=97 Score=25.24 Aligned_cols=25 Identities=24% Similarity=0.391 Sum_probs=13.3
Q ss_pred hHHHHHhhhhhhHHHHHHHHHHhhC
Q psy9560 129 ELDREIMLKDFEITELGKQVNDLRG 153 (503)
Q Consensus 129 DLE~KVkKQDyEInELniKVnDLRG 153 (503)
+||.++..++.-|++||.-|.+.+.
T Consensus 8 ~LE~~la~qe~~ie~Ln~~v~~Qq~ 32 (69)
T PF04102_consen 8 ELEIKLAFQEDTIEELNDVVTEQQR 32 (69)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4555555555555555555554443
No 155
>PRK00846 hypothetical protein; Provisional
Probab=39.14 E-value=58 Score=27.90 Aligned_cols=30 Identities=30% Similarity=0.396 Sum_probs=19.5
Q ss_pred hhhhhhHHHHHhhhhhhHHHHHHHHHHhhC
Q psy9560 124 EAIKYELDREIMLKDFEITELGKQVNDLRG 153 (503)
Q Consensus 124 EEEKYDLE~KVkKQDyEInELniKVnDLRG 153 (503)
+.---|||.++..|+.-|.+||.-|...+-
T Consensus 12 e~Ri~~LE~rlAfQe~tIe~LN~~v~~qq~ 41 (77)
T PRK00846 12 EARLVELETRLSFQEQALTELSEALADARL 41 (77)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 333346667777777777777777766543
No 156
>PF15254 CCDC14: Coiled-coil domain-containing protein 14
Probab=39.11 E-value=1.7e+02 Score=34.55 Aligned_cols=46 Identities=15% Similarity=0.060 Sum_probs=37.3
Q ss_pred HHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHhhCCC
Q psy9560 110 GDLCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQVNDLRGKF 155 (503)
Q Consensus 110 QEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnDLRGKF 155 (503)
.-..+-|+-++...|-|.-.|...|++.|.||+-|+.=...|++-+
T Consensus 507 l~~~k~~q~kLe~sekEN~iL~itlrQrDaEi~RL~eLtR~LQ~Sm 552 (861)
T PF15254_consen 507 LVNVKSLQFKLEASEKENQILGITLRQRDAEIERLRELTRTLQNSM 552 (861)
T ss_pred HHHHHHHhhhHHHHHhhhhHhhhHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3345677777788888888888999999999999998888887743
No 157
>KOG4302|consensus
Probab=38.85 E-value=1.7e+02 Score=33.63 Aligned_cols=25 Identities=8% Similarity=0.209 Sum_probs=22.3
Q ss_pred HHHHHHHHHHHHHHHHHHhhhhhhH
Q psy9560 106 EDEIGDLCQEYWQRIYNLEAIKYEL 130 (503)
Q Consensus 106 edELQEkCKELHdRIdkLEEEKYDL 130 (503)
.++|+..|.+||++++.-++++|-+
T Consensus 249 l~~l~~~~~~LWn~l~ts~Ee~~~f 273 (660)
T KOG4302|consen 249 LQDLRTKLLELWNLLDTSDEERQRF 273 (660)
T ss_pred HHHHHHHHHHHHHhccCCHHHHHHH
Confidence 4679999999999999999998877
No 158
>PF08700 Vps51: Vps51/Vps67; InterPro: IPR014812 The VFT tethering complex (also known as GARP complex, Golgi associated retrograde protein complex, Vps53 tethering complex) is a conserved eukaryotic docking complex which is involved in recycling of proteins from endosomes to the late Golgi. Vps51 (also known as Vps67) is a subunit of VFT and interacts with the SNARE Tlg1 [].
Probab=38.76 E-value=1.2e+02 Score=24.38 Aligned_cols=24 Identities=21% Similarity=0.165 Sum_probs=18.0
Q ss_pred CCCHHHHHHHHHHHHHHHHHHhhh
Q psy9560 103 DMAEDEIGDLCQEYWQRIYNLEAI 126 (503)
Q Consensus 103 GLSedELQEkCKELHdRIdkLEEE 126 (503)
+.+..+|+.++++|...|..++++
T Consensus 18 ~~s~~~i~~~~~~L~~~i~~~~~e 41 (87)
T PF08700_consen 18 NSSIKEIRQLENKLRQEIEEKDEE 41 (87)
T ss_pred hCCHHHHHHHHHHHHHHHHHHHHH
Confidence 346778888888888888777765
No 159
>PF04728 LPP: Lipoprotein leucine-zipper; InterPro: IPR006817 This repeating sequence, NAKVDQLSNDV, is found in the enterobacterial outer membrane lipoprotein LPP. The outer membrane lipoprotein is the most abundant protein in an Escherichia coli cell. The messenger RNA for the lipoprotein of the E. coli outer membrane codes for a putative precursor, prolipoprotein, which has 20 additional amino acid residues extending from the amino terminus of the lipoprotein.; GO: 0019867 outer membrane; PDB: 1JCC_A 2GUV_C 2GUS_A 1JCD_A 1KFM_A 1T8Z_D 1KFN_A 1EQ7_A.
Probab=38.61 E-value=1.2e+02 Score=24.86 Aligned_cols=49 Identities=10% Similarity=0.199 Sum_probs=25.4
Q ss_pred HHHHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHhhCCC
Q psy9560 107 DEIGDLCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQVNDLRGKF 155 (503)
Q Consensus 107 dELQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnDLRGKF 155 (503)
++|-.=+..|..+|++|..+-=.|-.-|..-.-|-.--|.||..+-..|
T Consensus 6 d~Ls~dVq~L~~kvdqLs~dv~~lr~~v~~ak~EAaRAN~RlDN~a~sY 54 (56)
T PF04728_consen 6 DQLSSDVQTLNSKVDQLSSDVNALRADVQAAKEEAARANQRLDNIAQSY 54 (56)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHT
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHhhc
Confidence 3444444555566666655555555555555555555555554443333
No 160
>PF01765 RRF: Ribosome recycling factor; InterPro: IPR023584 The ribosome recycling factor or ribosome release factor (RRF) dissociates ribosomes from mRNA after termination of translation, and is essential for bacterial growth []. Thus ribosomes are 'recycled' and ready for another round of protein synthesis. This entry represents a domain found in ribosome recycling factors.; PDB: 3R8N_Y 1Y69_8 1ISE_A 1EK8_A 1DD5_A 1WIH_A 2Z4L_6 2QBE_6 2V46_Y 1EH1_A ....
Probab=37.48 E-value=1.3e+02 Score=27.87 Aligned_cols=36 Identities=22% Similarity=0.181 Sum_probs=22.4
Q ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh
Q psy9560 54 GFMTPERKKKLRLLLRKKAAEELKREQERRALERKRVIDE 93 (503)
Q Consensus 54 gKMT~eRKkkLKSLLLqKAaEELEKEKEqKeEEKKKiLAE 93 (503)
..||.+||+.| .+.++...++-+..-+.-|+.++..
T Consensus 85 P~~T~E~R~~l----~k~~k~~~E~~k~~iR~iR~~~~~~ 120 (165)
T PF01765_consen 85 PPPTEERRKEL----VKQAKKIAEEAKVSIRNIRRDAMKK 120 (165)
T ss_dssp -SSSHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred CCCCHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 36899997764 4455555555555666666666654
No 161
>KOG0247|consensus
Probab=37.31 E-value=3.2e+02 Score=32.29 Aligned_cols=110 Identities=15% Similarity=0.191 Sum_probs=0.0
Q ss_pred HHHHHHHHHHHHHhHHHHhhhhccCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcCCCCCCCCCCHHH
Q psy9560 29 LEELERKRAEVRRRMEEASGARRKKGFMTPERKKKLRLLLRKKAAEELKREQERRALERKRVIDERCGEPKDTDDMAEDE 108 (503)
Q Consensus 29 qeErERKkAEvRKRlEEA~KkKAKKgKMT~eRKkkLKSLLLqKAaEELEKEKEqKeEEKKKiLAERcpPPLnIDGLSedE 108 (503)
.+...+.-.|.++|.....+ + ..++..-|..+..+.......... .+|.++. ++
T Consensus 492 ~~~l~~llee~~~~~~~~~~-~----------~l~~~~~~k~~~~~q~~~~~~~~~----~~~~~~l-----------~~ 545 (809)
T KOG0247|consen 492 KETLDQLLEELEKRILLRTK-E----------ILQNNKSLKEKECRQKLMNAQLES----QMLSSQL-----------ND 545 (809)
T ss_pred HHHHHHHHHHHHHHHHHHHH-H----------HHhcchhhHHHHHHHHHHHHHHHH----HHHHHHH-----------hH
Q ss_pred HHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhh---------HHHHHHHHHHhhCCC--CCCcccccc
Q psy9560 109 IGDLCQEYWQRIYNLEAIKYELDREIMLKDFE---------ITELGKQVNDLRGKF--VRPILKKVS 164 (503)
Q Consensus 109 LQEkCKELHdRIdkLEEEKYDLE~KVkKQDyE---------InELniKVnDLRGKF--KKPaLKKVR 164 (503)
..+.++++.+.|..+..++-++++++.-++.. ..+|+.+-++|+-+| +++.+++|+
T Consensus 546 kke~i~q~re~~~~~~k~~l~~e~~~~i~E~~~~~~~i~~l~~el~eq~~~~~~~~~~~~~~~~~~~ 612 (809)
T KOG0247|consen 546 KKEQIEQLRDEIERLKKENLTTEYSIEILESTEYEEEIEALDQELEEQKMELQQKFSEKKKAMAKVR 612 (809)
T ss_pred HHHHHHHHHHHHHHHhhhhhhhhhhhhhhhcchhhhhhHHHHHHHHhhhHHHHhhccchhHHHhhhc
No 162
>PF00170 bZIP_1: bZIP transcription factor cAMP response element binding (CREB) protein signature fos transforming protein signature jun transcription factor signature; InterPro: IPR011616 The basic-leucine zipper (bZIP) transcription factors [, ] of eukaryotic are proteins that contain a basic region mediating sequence-specific DNA-binding followed by a leucine zipper region (see IPR002158 from INTERPRO) required for dimerization.; GO: 0003700 sequence-specific DNA binding transcription factor activity, 0043565 sequence-specific DNA binding, 0046983 protein dimerization activity, 0006355 regulation of transcription, DNA-dependent; PDB: 2H7H_B 2OQQ_B 1S9K_E 1JNM_A 1JUN_A 1FOS_H 1A02_J 1T2K_C 1CI6_A 1DH3_C ....
Probab=37.05 E-value=1.2e+02 Score=23.88 Aligned_cols=34 Identities=26% Similarity=0.287 Sum_probs=14.6
Q ss_pred HHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHH
Q psy9560 112 LCQEYWQRIYNLEAIKYELDREIMLKDFEITELG 145 (503)
Q Consensus 112 kCKELHdRIdkLEEEKYDLE~KVkKQDyEInELn 145 (503)
.+.+|.++++.|+.+.=+|...+.....+|..|.
T Consensus 27 ~~~~Le~~~~~L~~en~~L~~~~~~L~~~~~~L~ 60 (64)
T PF00170_consen 27 YIEELEEKVEELESENEELKKELEQLKKEIQSLK 60 (64)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3444444444444444444443333333444443
No 163
>PTZ00454 26S protease regulatory subunit 6B-like protein; Provisional
Probab=36.99 E-value=91 Score=32.88 Aligned_cols=49 Identities=12% Similarity=0.108 Sum_probs=41.6
Q ss_pred CHHHHHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHhhC
Q psy9560 105 AEDEIGDLCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQVNDLRG 153 (503)
Q Consensus 105 SedELQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnDLRG 153 (503)
....|.+..++|...++.|+.+.-+|+...++...+|+.|+..++.|++
T Consensus 16 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 64 (398)
T PTZ00454 16 TERDLYEKLKELEKELEFLDIQEEYIKEEQKNLKRELIRAKEEVKRIQS 64 (398)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC
Confidence 3466777778888889999999999999999999999999999998865
No 164
>KOG3032|consensus
Probab=36.95 E-value=1e+02 Score=31.83 Aligned_cols=8 Identities=25% Similarity=0.605 Sum_probs=5.6
Q ss_pred Ccccc-ccc
Q psy9560 204 KPEWV-KEP 211 (503)
Q Consensus 204 VgDWR-KNi 211 (503)
.-||| ||+
T Consensus 254 a~DWRaKnl 262 (264)
T KOG3032|consen 254 AVDWRAKNL 262 (264)
T ss_pred hhhhhhhhc
Confidence 34999 765
No 165
>PRK04406 hypothetical protein; Provisional
Probab=36.88 E-value=87 Score=26.33 Aligned_cols=27 Identities=22% Similarity=0.308 Sum_probs=15.1
Q ss_pred hhhHHHHHhhhhhhHHHHHHHHHHhhC
Q psy9560 127 KYELDREIMLKDFEITELGKQVNDLRG 153 (503)
Q Consensus 127 KYDLE~KVkKQDyEInELniKVnDLRG 153 (503)
-=+|+.-|.+|..+|..|..++..|..
T Consensus 27 Ie~LN~~v~~Qq~~I~~L~~ql~~L~~ 53 (75)
T PRK04406 27 IEELNDALSQQQLLITKMQDQMKYVVG 53 (75)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 334555556666666666666555544
No 166
>PTZ00121 MAEBL; Provisional
Probab=36.79 E-value=3.8e+02 Score=34.32 Aligned_cols=6 Identities=17% Similarity=0.373 Sum_probs=2.9
Q ss_pred HHHhhh
Q psy9560 172 KLQKKA 177 (503)
Q Consensus 172 ~L~Khk 177 (503)
.+.+|+
T Consensus 1848 ~~~~~~ 1853 (2084)
T PTZ00121 1848 AFEKHK 1853 (2084)
T ss_pred Hhhhhc
Confidence 344554
No 167
>TIGR00237 xseA exodeoxyribonuclease VII, large subunit. This family consist of exodeoxyribonuclease VII, large subunit XseA which catalyses exonucleolytic cleavage in either the 5'-3' or 3'-5' direction to yield 5'-phosphomononucleotides. Exonuclease VII consists of one large subunit and four small subunits.
Probab=35.94 E-value=5.7e+02 Score=27.32 Aligned_cols=14 Identities=36% Similarity=0.385 Sum_probs=9.0
Q ss_pred ccCCcHHHHHHhhH
Q psy9560 8 AAETPAEAAAASAE 21 (503)
Q Consensus 8 ~~~~~~~~~~~mad 21 (503)
-+.||+.|+.-...
T Consensus 242 ra~TPtaaae~~~p 255 (432)
T TIGR00237 242 RAPTPSAAAEIVSP 255 (432)
T ss_pred cCCCcHHHHHHhCc
Confidence 46788877655544
No 168
>KOG4661|consensus
Probab=35.86 E-value=1e+02 Score=35.49 Aligned_cols=14 Identities=14% Similarity=0.209 Sum_probs=7.1
Q ss_pred HHhhhhhhHHHHHh
Q psy9560 122 NLEAIKYELDREIM 135 (503)
Q Consensus 122 kLEEEKYDLE~KVk 135 (503)
+|+++|-|-..++-
T Consensus 681 ~ve~eRr~eqeRih 694 (940)
T KOG4661|consen 681 KVEEERRDEQERIH 694 (940)
T ss_pred HHHHhhcchhhhhh
Confidence 44555555554444
No 169
>KOG0979|consensus
Probab=35.83 E-value=9.1e+02 Score=29.66 Aligned_cols=22 Identities=27% Similarity=0.306 Sum_probs=9.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHH
Q psy9560 69 RKKAAEELKREQERRALERKRV 90 (503)
Q Consensus 69 LqKAaEELEKEKEqKeEEKKKi 90 (503)
++.+...++++-....++.+.|
T Consensus 253 ~k~~~~r~k~~~r~l~k~~~pi 274 (1072)
T KOG0979|consen 253 YKQAKDRAKKELRKLEKEIKPI 274 (1072)
T ss_pred HHHHHHHHHHHHHHHHHhhhhh
Confidence 3444455554433444444443
No 170
>COG1196 Smc Chromosome segregation ATPases [Cell division and chromosome partitioning]
Probab=35.76 E-value=4.5e+02 Score=31.41 Aligned_cols=45 Identities=22% Similarity=0.257 Sum_probs=23.4
Q ss_pred HHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHhhC
Q psy9560 109 IGDLCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQVNDLRG 153 (503)
Q Consensus 109 LQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnDLRG 153 (503)
|+....+|-..+..+..+-|++...+...+-+|..+..++..+.-
T Consensus 272 ~~~~~~e~~~~~~~~~~~~~~~~~~~~~le~~~~~~~~~~~~~~~ 316 (1163)
T COG1196 272 LKSELEELREELEELQEELLELKEEIEELEGEISLLRERLEELEN 316 (1163)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 344444444455555555555555555555555555555555443
No 171
>KOG1850|consensus
Probab=35.70 E-value=2.6e+02 Score=30.23 Aligned_cols=21 Identities=14% Similarity=0.297 Sum_probs=12.2
Q ss_pred HHHHHHHHHHHHHHHHHHhhh
Q psy9560 106 EDEIGDLCQEYWQRIYNLEAI 126 (503)
Q Consensus 106 edELQEkCKELHdRIdkLEEE 126 (503)
.+.|+++|++|...+.++-+|
T Consensus 75 rsKLeelCRelQr~nk~~keE 95 (391)
T KOG1850|consen 75 RSKLEELCRELQRANKQTKEE 95 (391)
T ss_pred HHHHHHHHHHHHHHHHHHHHH
Confidence 355666666666655554433
No 172
>PRK11546 zraP zinc resistance protein; Provisional
Probab=35.70 E-value=3.9e+02 Score=25.35 Aligned_cols=74 Identities=12% Similarity=0.156 Sum_probs=45.8
Q ss_pred CCCHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHhhhhcCCCCCCCCCCHHHHHHHHHHHHHHHHHHhhhhhhHHH
Q psy9560 55 FMTPERKKKLRLLLRKKAA--EELKREQERRALERKRVIDERCGEPKDTDDMAEDEIGDLCQEYWQRIYNLEAIKYELDR 132 (503)
Q Consensus 55 KMT~eRKkkLKSLLLqKAa--EELEKEKEqKeEEKKKiLAERcpPPLnIDGLSedELQEkCKELHdRIdkLEEEKYDLE~ 132 (503)
.||++-...+.++.-+... ..|.++...|+.|=+..|+- +.-+.+.++.+.+|+.+--.+|.+.|+-+..
T Consensus 43 ~LT~EQQa~~q~I~~~f~~~t~~LRqqL~aKr~ELnALl~~--------~~pD~~kI~aL~kEI~~Lr~kL~e~r~~~~~ 114 (143)
T PRK11546 43 PLTTEQQAAWQKIHNDFYAQTSALRQQLVSKRYEYNALLTA--------NPPDSSKINAVAKEMENLRQSLDELRVKRDI 114 (143)
T ss_pred cCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcC--------CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4888888877776533332 33444444444444444332 2225677889999888888888888775555
Q ss_pred HHhh
Q psy9560 133 EIML 136 (503)
Q Consensus 133 KVkK 136 (503)
.+.|
T Consensus 115 ~~~k 118 (143)
T PRK11546 115 AMAE 118 (143)
T ss_pred HHHH
Confidence 5554
No 173
>PRK03963 V-type ATP synthase subunit E; Provisional
Probab=35.01 E-value=3.7e+02 Score=24.91 Aligned_cols=20 Identities=20% Similarity=0.385 Sum_probs=10.5
Q ss_pred HHHHHHHHHHHHHHHhhhhh
Q psy9560 109 IGDLCQEYWQRIYNLEAIKY 128 (503)
Q Consensus 109 LQEkCKELHdRIdkLEEEKY 128 (503)
++++..+..++|..+..+.|
T Consensus 83 ~~~v~~~a~~~l~~~~~~~Y 102 (198)
T PRK03963 83 ISEVLEAVRERLAELPEDEY 102 (198)
T ss_pred HHHHHHHHHHHHHhhhhhhH
Confidence 34455555556655555533
No 174
>PRK00286 xseA exodeoxyribonuclease VII large subunit; Reviewed
Probab=34.27 E-value=5.7e+02 Score=26.83 Aligned_cols=15 Identities=40% Similarity=0.468 Sum_probs=9.9
Q ss_pred ccCCcHHHHHHhhHH
Q psy9560 8 AAETPAEAAAASAED 22 (503)
Q Consensus 8 ~~~~~~~~~~~mad~ 22 (503)
-+.||+.|+.-...+
T Consensus 247 ra~TPtaaae~~~~~ 261 (438)
T PRK00286 247 RAPTPTAAAELAVPD 261 (438)
T ss_pred cCCChHHHHHHhCcc
Confidence 467888877655543
No 175
>KOG3654|consensus
Probab=34.26 E-value=37 Score=38.18 Aligned_cols=46 Identities=41% Similarity=0.722 Sum_probs=0.0
Q ss_pred HHHHHHHHH---HHHHHHHHHHHhHHHHhhhh----ccCCCCCHH--HHHHHHHH
Q psy9560 22 DEAKRLKLE---ELERKRAEVRRRMEEASGAR----RKKGFMTPE--RKKKLRLL 67 (503)
Q Consensus 22 ~ed~kRKqe---ErERKkAEvRKRlEEA~KkK----AKKgKMT~e--RKkkLKSL 67 (503)
+|.+.|||+ |+|-||+|.|+..||.-..+ ++.-|+--+ ||++||.+
T Consensus 405 eear~rkqqleae~e~kreearrkaeeer~~keee~arrefirqey~rrkqlklm 459 (708)
T KOG3654|consen 405 EEARRRKQQLEAEKEQKREEARRKAEEERAPKEEEVARREFIRQEYERRKQLKLM 459 (708)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhHhhhcchhhhhhHHHHHHHHHHHHHHHHHH
No 176
>PF05531 NPV_P10: Nucleopolyhedrovirus P10 protein; InterPro: IPR008702 This family consists of several nucleopolyhedrovirus P10 proteins which are thought to be involved in the morphogenesis of the polyhedra [].; GO: 0019028 viral capsid
Probab=34.26 E-value=1.2e+02 Score=26.13 Aligned_cols=47 Identities=17% Similarity=0.338 Sum_probs=34.0
Q ss_pred HHHHHHHHHHHHHHHHHhhhhhh---HHHHHhhhhhhHHHHHHHHHHhhC
Q psy9560 107 DEIGDLCQEYWQRIYNLEAIKYE---LDREIMLKDFEITELGKQVNDLRG 153 (503)
Q Consensus 107 dELQEkCKELHdRIdkLEEEKYD---LE~KVkKQDyEInELniKVnDLRG 153 (503)
..|-.+|-.|..+++.|...--| |+.|+..+...+++|+.+|++++-
T Consensus 14 k~vd~KVdaLq~~V~~l~~~~~~v~~l~~klDa~~~~l~~l~~~V~~I~~ 63 (75)
T PF05531_consen 14 KAVDDKVDALQTQVDDLESNLPDVTELNKKLDAQSAQLTTLNTKVNEIQD 63 (75)
T ss_pred HHHHHHHHHHHHHHHHHHhcCCchHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34555666666666777766555 778888888888888888887765
No 177
>PF07851 TMPIT: TMPIT-like protein; InterPro: IPR012926 A number of members of this family are annotated as being transmembrane proteins induced by tumour necrosis factor alpha, but no literature was found to support this. ; GO: 0016021 integral to membrane
Probab=33.91 E-value=1.3e+02 Score=31.87 Aligned_cols=47 Identities=17% Similarity=0.280 Sum_probs=32.3
Q ss_pred HHHHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHhhC
Q psy9560 107 DEIGDLCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQVNDLRG 153 (503)
Q Consensus 107 dELQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnDLRG 153 (503)
.+||+..+.+-+++..+..--=+.-..+.+|.|.+++|...+..++.
T Consensus 14 q~Lqethr~Y~qKleel~~lQ~~C~ssI~~QkkrLk~L~~sLk~~~~ 60 (330)
T PF07851_consen 14 QELQETHRSYKQKLEELSKLQDKCSSSISHQKKRLKELKKSLKRCKK 60 (330)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc
Confidence 45666666666666666666666666777777777777777776654
No 178
>KOG0962|consensus
Probab=33.41 E-value=2.1e+02 Score=35.38 Aligned_cols=17 Identities=12% Similarity=0.186 Sum_probs=7.6
Q ss_pred HHHHHHHHHHhHHHHhh
Q psy9560 32 LERKRAEVRRRMEEASG 48 (503)
Q Consensus 32 rERKkAEvRKRlEEA~K 48 (503)
.++++.+.|+-+|...+
T Consensus 831 ~~~~~~~~rke~E~~~k 847 (1294)
T KOG0962|consen 831 KQESLDKLRKEIECLQK 847 (1294)
T ss_pred HHHHHHHHHHHHHHHHH
Confidence 34444444544444433
No 179
>KOG0239|consensus
Probab=33.37 E-value=1.6e+02 Score=33.59 Aligned_cols=48 Identities=17% Similarity=0.133 Sum_probs=36.7
Q ss_pred HHHHHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHhhC
Q psy9560 106 EDEIGDLCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQVNDLRG 153 (503)
Q Consensus 106 edELQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnDLRG 153 (503)
...|..+|.-++.||+.|..+-++|..++..-++++..|...++++.-
T Consensus 229 ~~~l~~~~~~~~~~i~~l~~~l~~l~~~~~~l~~~~~~~~~~~~~~~~ 276 (670)
T KOG0239|consen 229 IKPLEGLESTIKKKIQALQQELEELKAELKELNDQVSLLTREVQEALK 276 (670)
T ss_pred hhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 355777788888888888888888888888888888777777776543
No 180
>PF05266 DUF724: Protein of unknown function (DUF724); InterPro: IPR007930 This family contains several uncharacterised proteins found exclusively in Arabidopsis thaliana.
Probab=33.27 E-value=3.6e+02 Score=26.22 Aligned_cols=50 Identities=20% Similarity=0.167 Sum_probs=32.3
Q ss_pred HHHHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHhhCCCC
Q psy9560 107 DEIGDLCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQVNDLRGKFV 156 (503)
Q Consensus 107 dELQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnDLRGKFK 156 (503)
.+|..+.-+|.++...+...+=+....+.+.+.+++.|+..|.+++-+|.
T Consensus 134 ~~Le~ki~el~~~~~~~~~~ke~~~~ei~~lks~~~~l~~~~~~~e~~F~ 183 (190)
T PF05266_consen 134 KELEMKILELQRQAAKLKEKKEAKDKEISRLKSEAEALKEEIENAELEFQ 183 (190)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 44555556666666666656666666666677777777777777766553
No 181
>PRK07003 DNA polymerase III subunits gamma and tau; Validated
Probab=33.24 E-value=9.2e+02 Score=28.89 Aligned_cols=44 Identities=14% Similarity=0.093 Sum_probs=27.2
Q ss_pred hhhhcCCCCCCCCCCHHHHHHHHHHHHHHHH-HHhhhhhhHHHHHh
Q psy9560 91 IDERCGEPKDTDDMAEDEIGDLCQEYWQRIY-NLEAIKYELDREIM 135 (503)
Q Consensus 91 LAERcpPPLnIDGLSedELQEkCKELHdRId-kLEEEKYDLE~KVk 135 (503)
|.-|| ..+++..|+.+++.+.++.+...-. .++++-.++-.+..
T Consensus 166 IrSRC-q~f~Fk~Ls~eeIv~~L~~Il~~EgI~id~eAL~lIA~~A 210 (830)
T PRK07003 166 VLSRC-LQFNLKQMPAGHIVSHLERILGEERIAFEPQALRLLARAA 210 (830)
T ss_pred hhhhe-EEEecCCcCHHHHHHHHHHHHHHcCCCCCHHHHHHHHHHc
Confidence 45699 4779999999999887776554321 23344444444443
No 182
>PRK00295 hypothetical protein; Provisional
Probab=33.10 E-value=93 Score=25.56 Aligned_cols=25 Identities=20% Similarity=0.258 Sum_probs=13.6
Q ss_pred hhHHHHHhhhhhhHHHHHHHHHHhh
Q psy9560 128 YELDREIMLKDFEITELGKQVNDLR 152 (503)
Q Consensus 128 YDLE~KVkKQDyEInELniKVnDLR 152 (503)
.|||.++..++.-|.+||.-|...+
T Consensus 8 ~~LE~kla~qE~tie~Ln~~v~~Qq 32 (68)
T PRK00295 8 TELESRQAFQDDTIQALNDVLVEQQ 32 (68)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3455555555555555555554443
No 183
>KOG2004|consensus
Probab=32.98 E-value=50 Score=38.71 Aligned_cols=49 Identities=29% Similarity=0.525 Sum_probs=36.4
Q ss_pred hhhhHHHHHhhhhhhHHHHHHHHHH-hhCCCCCCccccccccHHHHHHHHhhhhccchhhhccccccc
Q psy9560 126 IKYELDREIMLKDFEITELGKQVND-LRGKFVRPILKKVSKYENKFAKLQKKAAEFNFRSQLKAVKKR 192 (503)
Q Consensus 126 EKYDLE~KVkKQDyEInELniKVnD-LRGKFKKPaLKKVRkSAnm~a~L~KhkvsmDfRANLKqVKKE 192 (503)
.|+.+-..+-+...+++.|.++|+. +.+| ++++....=||-|||..|||
T Consensus 279 ~Rl~~al~llkke~e~~klq~ki~k~vE~k------------------~~~~~r~ylL~eQlk~IKke 328 (906)
T KOG2004|consen 279 KRLEKALELLKKELELAKLQQKIGKEVEEK------------------IKQDHREYLLREQLKAIKKE 328 (906)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhHHHhh------------------hhHHHHHHHHHHHHHHHHHh
Confidence 5777788888889999999998853 3333 33444466788999999997
No 184
>PF07106 TBPIP: Tat binding protein 1(TBP-1)-interacting protein (TBPIP); InterPro: IPR010776 This family consists of several eukaryotic TBP-1 interacting protein (TBPIP) sequences. TBP-1 has been demonstrated to interact with the human immunodeficiency virus type 1 (HIV-1) viral protein Tat, then modulate the essential replication process of HIV. In addition, TBP-1 has been shown to be a component of the 26S proteasome, a basic multiprotein complex that degrades ubiquitinated proteins in an ATP-dependent fashion. Human TBPIP interacts with human TBP-1 then modulates the inhibitory action of human TBP-1 on HIV-Tat-mediated transactivation [].
Probab=32.95 E-value=2.6e+02 Score=25.60 Aligned_cols=12 Identities=25% Similarity=0.127 Sum_probs=7.2
Q ss_pred HHHHHHHHHHHH
Q psy9560 66 LLLRKKAAEELK 77 (503)
Q Consensus 66 SLLLqKAaEELE 77 (503)
.-+++|+-+.|-
T Consensus 32 K~~v~k~Ld~L~ 43 (169)
T PF07106_consen 32 KTAVQKALDSLV 43 (169)
T ss_pred HHHHHHHHHHHH
Confidence 345666666665
No 185
>PF12128 DUF3584: Protein of unknown function (DUF3584); InterPro: IPR021979 This family consist of uncharacterised bacterial proteins.
Probab=32.78 E-value=3.4e+02 Score=32.61 Aligned_cols=22 Identities=5% Similarity=0.113 Sum_probs=13.8
Q ss_pred HHHHHHHHHHHHHHHHHHhhhh
Q psy9560 106 EDEIGDLCQEYWQRIYNLEAIK 127 (503)
Q Consensus 106 edELQEkCKELHdRIdkLEEEK 127 (503)
..+|+...+.|-..|..++..+
T Consensus 773 I~~l~~~i~~L~~~l~~ie~~r 794 (1201)
T PF12128_consen 773 IQQLKQEIEQLEKELKRIEERR 794 (1201)
T ss_pred HHHHHHHHHHHHHHHHHHHHhH
Confidence 3555666666667777776554
No 186
>KOG0239|consensus
Probab=32.71 E-value=2.6e+02 Score=32.06 Aligned_cols=47 Identities=19% Similarity=0.233 Sum_probs=31.6
Q ss_pred HHHHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHhhCC
Q psy9560 107 DEIGDLCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQVNDLRGK 154 (503)
Q Consensus 107 dELQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnDLRGK 154 (503)
.++++..+.+...+..|.+..=.|-.+. +.+.+...|+++|.||+|.
T Consensus 269 ~~~~~~~~~~~~~~~~L~~~~~~l~~~~-~e~~~r~kL~N~i~eLkGn 315 (670)
T KOG0239|consen 269 REVQEALKESNTLQSDLESLEENLVEKK-KEKEERRKLHNEILELKGN 315 (670)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhhcC
Confidence 3444455555555666666666666666 6667788888889999884
No 187
>KOG0250|consensus
Probab=32.68 E-value=4.1e+02 Score=32.41 Aligned_cols=32 Identities=19% Similarity=0.289 Sum_probs=14.3
Q ss_pred HHhhhhhhHHHHHhhhhhhHHHHHHHHHHhhC
Q psy9560 122 NLEAIKYELDREIMLKDFEITELGKQVNDLRG 153 (503)
Q Consensus 122 kLEEEKYDLE~KVkKQDyEInELniKVnDLRG 153 (503)
+++.+.=+|+..|.+....|+.|+.+.++++-
T Consensus 398 e~e~k~~~L~~evek~e~~~~~L~~e~~~~~~ 429 (1074)
T KOG0250|consen 398 ERENKLEQLKKEVEKLEEQINSLREELNEVKE 429 (1074)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33333334444444444444445555444443
No 188
>PF12709 Kinetocho_Slk19: Central kinetochore-associated; InterPro: IPR024312 This is a family of proteins integrally involved in the central kinetochore. Slk19 is a yeast member and it may play an important role in the timing of nuclear migration. It may also participate, directly or indirectly, in the maintenance of centromeric tensile strength during mitotic stagnation, for instance during activation of checkpoint controls, when cells need to preserve nuclear integrity until cell cycle progression can be resumed [].
Probab=32.58 E-value=1.4e+02 Score=26.42 Aligned_cols=47 Identities=13% Similarity=0.211 Sum_probs=27.2
Q ss_pred HHHHHHHHHHHHHHHH-----Hhhhh----hhHHHHHhhhhhhHHHHHHHHHHhhC
Q psy9560 107 DEIGDLCQEYWQRIYN-----LEAIK----YELDREIMLKDFEITELGKQVNDLRG 153 (503)
Q Consensus 107 dELQEkCKELHdRIdk-----LEEEK----YDLE~KVkKQDyEInELniKVnDLRG 153 (503)
..|+.++++||..-.. |-.-| --++.+|.+...+++.|+..|..|++
T Consensus 15 ~~ve~vA~eLh~~YssKHE~KV~~LKksYe~rwek~v~~L~~e~~~l~~E~e~L~~ 70 (87)
T PF12709_consen 15 KAVEKVARELHALYSSKHETKVKALKKSYEARWEKKVDELENENKALKRENEQLKK 70 (87)
T ss_pred HHHHHHHHHHHHHHhhHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4567778888764322 22222 12455666666666777776666665
No 189
>PRK14127 cell division protein GpsB; Provisional
Probab=32.53 E-value=1.7e+02 Score=26.46 Aligned_cols=54 Identities=19% Similarity=0.159 Sum_probs=44.8
Q ss_pred CCCCCCHHHHHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHhhC
Q psy9560 100 DTDDMAEDEIGDLCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQVNDLRG 153 (503)
Q Consensus 100 nIDGLSedELQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnDLRG 153 (503)
.+-|.+.+++.+...++-+-+..|..+.=+|..++.+....|.++..++...+.
T Consensus 19 ~~RGYd~~EVD~FLd~V~~dye~l~~e~~~Lk~e~~~l~~~l~e~~~~~~~~~~ 72 (109)
T PRK14127 19 SMRGYDQDEVDKFLDDVIKDYEAFQKEIEELQQENARLKAQVDELTKQVSVGAS 72 (109)
T ss_pred CCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccc
Confidence 568999999988888888888888888888888888888888888888876654
No 190
>PF04728 LPP: Lipoprotein leucine-zipper; InterPro: IPR006817 This repeating sequence, NAKVDQLSNDV, is found in the enterobacterial outer membrane lipoprotein LPP. The outer membrane lipoprotein is the most abundant protein in an Escherichia coli cell. The messenger RNA for the lipoprotein of the E. coli outer membrane codes for a putative precursor, prolipoprotein, which has 20 additional amino acid residues extending from the amino terminus of the lipoprotein.; GO: 0019867 outer membrane; PDB: 1JCC_A 2GUV_C 2GUS_A 1JCD_A 1KFM_A 1T8Z_D 1KFN_A 1EQ7_A.
Probab=32.43 E-value=90 Score=25.58 Aligned_cols=36 Identities=14% Similarity=0.169 Sum_probs=22.9
Q ss_pred HHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHhhC
Q psy9560 118 QRIYNLEAIKYELDREIMLKDFEITELGKQVNDLRG 153 (503)
Q Consensus 118 dRIdkLEEEKYDLE~KVkKQDyEInELniKVnDLRG 153 (503)
.+|++|..+-=.|..+|.+...||+.|+.-|+..++
T Consensus 3 akid~Ls~dVq~L~~kvdqLs~dv~~lr~~v~~ak~ 38 (56)
T PF04728_consen 3 AKIDQLSSDVQTLNSKVDQLSSDVNALRADVQAAKE 38 (56)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 356666666666666666666666666666655443
No 191
>PF10186 Atg14: UV radiation resistance protein and autophagy-related subunit 14; InterPro: IPR018791 Class III phosphatidylinositol 3-kinase (PI3-kinase) regulates multiple membrane trafficking. In yeast, two distinct PI3-kinase complexes are known: complex I (Vps34, Vps15, Vps30/Atg6, and Atg14) is involved in autophagy, and complex II (Vps34, Vps15, Vps30/Atg6, and Vps38) functions in the vacuolar protein sorting pathway. In mammals, the counterparts of Vps34, Vps15, and Vps30/Atg6 are Vps34, p150, and Beclin 1, respectively. Mammalian UV irradiation resistance-associated gene (UVRAG) has been identified as identical to yeast Vps38 []. The Atg14 (autophagy-related protein 14) proteins are hydrophilic proteins and have a coiled-coil motif at the N terminus region. Yeast cells with mutant Atg14 are defective not only in autophagy but also in sorting of carboxypeptidase Y (CPY), a vacuolar-soluble hydrolase, to the vacuole []. This entry represents Atg14 and UVRAG, which bind Beclin 1 to forms two distinct PI3-kinase complexes. This entry also includes Bakor (beclin-1-associated autophagy-related key regulator), also known as autophagy-related protein 14-like protein, which share sequence similarity to the yeast Atg14 protein []. Barkor positively regulates autophagy through its interaction with Beclin-1, with decreased levels of autophagosome formation observed when Barkor expression is eliminated []. Autophagy mediates the cellular response to nutrient deprivation, protein aggregation, and pathogen invasion in humans, and malfunction of autophagy has been implicated in multiple human diseases including cancer. ; GO: 0010508 positive regulation of autophagy
Probab=32.23 E-value=4.2e+02 Score=25.28 Aligned_cols=50 Identities=18% Similarity=0.215 Sum_probs=29.8
Q ss_pred CHHHHHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHhhCC
Q psy9560 105 AEDEIGDLCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQVNDLRGK 154 (503)
Q Consensus 105 SedELQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnDLRGK 154 (503)
+...++....++-.++..|......+...+......|.+|+..+.+.+..
T Consensus 57 ~~~~~~~~~~~~~~r~~~l~~~i~~~~~~i~~~r~~l~~~~~~l~~~~~~ 106 (302)
T PF10186_consen 57 EIQQLKREIEELRERLERLRERIERLRKRIEQKRERLEELRESLEQRRSR 106 (302)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33455555566666666666666666666666666666666666555553
No 192
>PRK10803 tol-pal system protein YbgF; Provisional
Probab=32.21 E-value=78 Score=31.51 Aligned_cols=15 Identities=20% Similarity=0.539 Sum_probs=8.3
Q ss_pred hhHHHHHHHHHHhhC
Q psy9560 139 FEITELGKQVNDLRG 153 (503)
Q Consensus 139 yEInELniKVnDLRG 153 (503)
+.|++|+..|.+|||
T Consensus 61 ~ql~~lq~ev~~LrG 75 (263)
T PRK10803 61 QQLSDNQSDIDSLRG 75 (263)
T ss_pred HHHHHHHHHHHHHhh
Confidence 335555555555555
No 193
>PRK06800 fliH flagellar assembly protein H; Validated
Probab=32.09 E-value=2.4e+02 Score=28.49 Aligned_cols=27 Identities=26% Similarity=0.464 Sum_probs=15.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhhc
Q psy9560 69 RKKAAEELKREQERRALERKRVIDERC 95 (503)
Q Consensus 69 LqKAaEELEKEKEqKeEEKKKiLAERc 95 (503)
|.+-...|..+++..+.+|..+|++|-
T Consensus 50 L~~e~~~l~~eqQ~l~~er~~l~~er~ 76 (228)
T PRK06800 50 LHKELNQLRQEQQKLERERQQLLADRE 76 (228)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 344445555566666666666666655
No 194
>KOG0978|consensus
Probab=31.93 E-value=81 Score=36.35 Aligned_cols=46 Identities=24% Similarity=0.321 Sum_probs=29.7
Q ss_pred HHHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHhhC
Q psy9560 108 EIGDLCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQVNDLRG 153 (503)
Q Consensus 108 ELQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnDLRG 153 (503)
.||.+|-.||.+...+.+..+.|-.+|.-..++.++|...+.|+++
T Consensus 59 ~Lq~~~~~~~~~~~~~~~~~~el~~k~s~~~~~~~e~~~~le~~~~ 104 (698)
T KOG0978|consen 59 KLQNLADHLQEKHATLSEQISELLDKISTAETEVDELEQQLEDLQA 104 (698)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccHHHHHHhHHHHHH
Confidence 3666666666666666666666666666666666666666666665
No 195
>COG3937 Uncharacterized conserved protein [Function unknown]
Probab=31.75 E-value=1.2e+02 Score=27.92 Aligned_cols=32 Identities=13% Similarity=0.235 Sum_probs=18.3
Q ss_pred CCCCHHHHHHHHHHHHHHHHHHhhhhhhHHHHHhh
Q psy9560 102 DDMAEDEIGDLCQEYWQRIYNLEAIKYELDREIML 136 (503)
Q Consensus 102 DGLSedELQEkCKELHdRIdkLEEEKYDLE~KVkK 136 (503)
..||.++=+.+..+|-.+|. .++-+++.|+.+
T Consensus 37 Geln~eEak~~vddl~~q~k---~~~~e~e~K~~r 68 (108)
T COG3937 37 GELNAEEAKRFVDDLLRQAK---EAQGELEEKIPR 68 (108)
T ss_pred CCCCHHHHHHHHHHHHHHHH---HHhhhHHHhhhH
Confidence 44667776666666655554 455555555433
No 196
>PF10174 Cast: RIM-binding protein of the cytomatrix active zone; InterPro: IPR019323 This entry represents a family of proteins that form part of the CAZ (cytomatrix at the active zone) complex which is involved in determining the site of synaptic vesicle fusion []. Located at the C terminus is a PDZ-binding motif that binds directly to RIM (a small G protein Rab-3A effector). These proteins also contain four coiled-coil domains [].
Probab=31.55 E-value=9.2e+02 Score=28.42 Aligned_cols=49 Identities=16% Similarity=0.205 Sum_probs=32.5
Q ss_pred HHHHHHHHHHHHHHHHHhhhhhhHH----HHHhhhhhhHHHHHHHHHHhhCCC
Q psy9560 107 DEIGDLCQEYWQRIYNLEAIKYELD----REIMLKDFEITELGKQVNDLRGKF 155 (503)
Q Consensus 107 dELQEkCKELHdRIdkLEEEKYDLE----~KVkKQDyEInELniKVnDLRGKF 155 (503)
..|.+-..++-.+|..|++.|+-.+ ..+.....+|.+|+.+|..|+++.
T Consensus 436 ~~lEea~~eker~~e~l~e~r~~~e~e~~Eele~~~~e~~~lk~~~~~LQ~eL 488 (775)
T PF10174_consen 436 ETLEEALREKERLQERLEEQRERAEKERQEELETYQKELKELKAKLESLQKEL 488 (775)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh
Confidence 3455556666667777777655544 345667778888888888777753
No 197
>smart00761 HDAC_interact Histone deacetylase (HDAC) interacting. This domain is found on transcriptional regulators. It forms interactions with histone deacetylases.
Probab=31.47 E-value=89 Score=28.09 Aligned_cols=36 Identities=25% Similarity=0.358 Sum_probs=29.4
Q ss_pred HHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHH
Q psy9560 114 QEYWQRIYNLEAIKYELDREIMLKDFEITELGKQVN 149 (503)
Q Consensus 114 KELHdRIdkLEEEKYDLE~KVkKQDyEInELniKVn 149 (503)
.++-+.|.++|+|||++..-+..+..-|+-|..=++
T Consensus 55 NqyEE~Lfr~EDeR~E~D~~ie~~~~ti~~le~l~~ 90 (102)
T smart00761 55 NQYEEALFRCEDERFELDMVIESNSSTIKLLEEILN 90 (102)
T ss_pred cHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHH
Confidence 677889999999999999999988887776654333
No 198
>TIGR03017 EpsF chain length determinant protein EpsF. Sequences in this family of proteins are members of the chain length determinant family (pfam02706) which includes the wzc protein from E.coli. This family of proteins are homologous to the EpsF protein of the methanolan biosynthesis operon of Methylobacillus species strain 12S. The distribution of this protein appears to be restricted to a subset of exopolysaccharide operons containing a syntenic grouping of genes including a variant of the EpsH exosortase protein. Exosortase has been proposed to be involved in the targetting and processing of proteins containing the PEP-CTERM domain to the exopolysaccharide layer.
Probab=31.11 E-value=6e+02 Score=26.14 Aligned_cols=44 Identities=9% Similarity=0.053 Sum_probs=26.8
Q ss_pred HHHHHHHHHHHHhhhhhhHHHHHhh------------------hhhhHHHHHHHHHHhhCCC
Q psy9560 112 LCQEYWQRIYNLEAIKYELDREIML------------------KDFEITELGKQVNDLRGKF 155 (503)
Q Consensus 112 kCKELHdRIdkLEEEKYDLE~KVkK------------------QDyEInELniKVnDLRGKF 155 (503)
...+|..++..+..++.+...+... ...+|.+|..++.+|+..|
T Consensus 216 ~l~~l~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~l~~~l~~le~~l~~l~~~y 277 (444)
T TIGR03017 216 RLNELSAQLVAAQAQVMDASSKEGGSSGKDALPEVIANPIIQNLKTDIARAESKLAELSQRL 277 (444)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCcccchhhhcChHHHHHHHHHHHHHHHHHHHHHHh
Confidence 3456667777777777777665432 3345556666666666655
No 199
>PRK07191 flgK flagellar hook-associated protein FlgK; Validated
Probab=30.74 E-value=85 Score=33.38 Aligned_cols=48 Identities=10% Similarity=0.091 Sum_probs=39.9
Q ss_pred HHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHhhCCCC
Q psy9560 109 IGDLCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQVNDLRGKFV 156 (503)
Q Consensus 109 LQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnDLRGKFK 156 (503)
.+.+|..|+.--..|+..|.|+...+.-.-.+||.|-.+|.+|..+..
T Consensus 136 a~~la~~~n~~~~~l~~~~~~~~~~i~~~V~~iN~ll~~Ia~LN~~I~ 183 (456)
T PRK07191 136 ANAMALRFNNVNNFIVQQKKSIGQQRDATVKQINSLTRSIADYNQKIL 183 (456)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 366888888888899999999999888888888888888888877553
No 200
>PLN02316 synthase/transferase
Probab=30.74 E-value=61 Score=38.68 Aligned_cols=42 Identities=31% Similarity=0.332 Sum_probs=21.0
Q ss_pred HHHHHHHHHHHHHHHHhHHHHhhhhcc---CCCCCHHHHHHHHHH
Q psy9560 26 RLKLEELERKRAEVRRRMEEASGARRK---KGFMTPERKKKLRLL 67 (503)
Q Consensus 26 kRKqeErERKkAEvRKRlEEA~KkKAK---KgKMT~eRKkkLKSL 67 (503)
.+.|+||+|..+|+|+|-++....++. -+..+..|+..|+.|
T Consensus 262 a~e~ae~~~~~ee~~r~~~~kaa~~a~~a~akae~~~~~~~~~~~ 306 (1036)
T PLN02316 262 AKEEAERERQAEEQRRREEEKAAMEADRAQAKAEVEKRREKLQNL 306 (1036)
T ss_pred HHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhHHHHHHHHHHHHH
Confidence 334567777777777665554322222 233344444455544
No 201
>PRK03992 proteasome-activating nucleotidase; Provisional
Probab=30.67 E-value=1.4e+02 Score=30.89 Aligned_cols=47 Identities=19% Similarity=0.243 Sum_probs=40.1
Q ss_pred HHHHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHhhC
Q psy9560 107 DEIGDLCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQVNDLRG 153 (503)
Q Consensus 107 dELQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnDLRG 153 (503)
..|.+..++|..++.+|....-.|....+....+|..|+.++..|++
T Consensus 4 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 50 (389)
T PRK03992 4 EALEERNSELEEQIRQLELKLRDLEAENEKLERELERLKSELEKLKS 50 (389)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhC
Confidence 56777778888888889988888888888888999999999988876
No 202
>PRK02119 hypothetical protein; Provisional
Probab=30.54 E-value=89 Score=26.06 Aligned_cols=30 Identities=20% Similarity=0.206 Sum_probs=19.5
Q ss_pred hhhhhhHHHHHhhhhhhHHHHHHHHHHhhC
Q psy9560 124 EAIKYELDREIMLKDFEITELGKQVNDLRG 153 (503)
Q Consensus 124 EEEKYDLE~KVkKQDyEInELniKVnDLRG 153 (503)
++---|||.++..|+.-|.+||.-|...+.
T Consensus 8 e~Ri~~LE~rla~QE~tie~LN~~v~~Qq~ 37 (73)
T PRK02119 8 ENRIAELEMKIAFQENLLEELNQALIEQQF 37 (73)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 333345667777777777777777766554
No 203
>PF10212 TTKRSYEDQ: Predicted coiled-coil domain-containing protein; InterPro: IPR019348 This entry represents a C-terminal 500 residue region, which contains a conserved TTKRSYEDQ motif. It is found in a family of coiled-coil domain-containing proteins that are conserved from nematodes to humans. These proteins also contain an N-terminal domain with a KLRAQ motif (IPR019343 from INTERPRO). The function of these proteins is not known.
Probab=30.41 E-value=5.8e+02 Score=28.86 Aligned_cols=62 Identities=21% Similarity=0.350 Sum_probs=42.4
Q ss_pred HHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHhhCCCCCCccccccccHHHHHHHHhhhhccc
Q psy9560 109 IGDLCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQVNDLRGKFVRPILKKVSKYENKFAKLQKKAAEFN 181 (503)
Q Consensus 109 LQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnDLRGKFKKPaLKKVRkSAnm~a~L~KhkvsmD 181 (503)
+..-|+-|+.+++..+.+|=.+...++.-...|+-|.....-.|+ -|+.++.+|..|=.+|+
T Consensus 439 f~~Ec~aL~~rL~~aE~ek~~l~eeL~~a~~~i~~LqDEL~TTr~-----------NYE~QLs~MSEHLasmN 500 (518)
T PF10212_consen 439 FYAECRALQKRLESAEKEKESLEEELKEANQNISRLQDELETTRR-----------NYEEQLSMMSEHLASMN 500 (518)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-----------hHHHHHHHHHHHHHHHH
Confidence 456899999999999888877766665555555555444433333 35678888888866664
No 204
>PRK00736 hypothetical protein; Provisional
Probab=30.33 E-value=1.1e+02 Score=25.15 Aligned_cols=24 Identities=25% Similarity=0.346 Sum_probs=12.2
Q ss_pred hHHHHHhhhhhhHHHHHHHHHHhh
Q psy9560 129 ELDREIMLKDFEITELGKQVNDLR 152 (503)
Q Consensus 129 DLE~KVkKQDyEInELniKVnDLR 152 (503)
|||.|+..++.-|.+||.-|...+
T Consensus 9 ~LE~klafqe~tie~Ln~~v~~Qq 32 (68)
T PRK00736 9 ELEIRVAEQEKTIEELSDQLAEQW 32 (68)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH
Confidence 445555555555555555544433
No 205
>KOG2072|consensus
Probab=30.23 E-value=8.6e+02 Score=29.50 Aligned_cols=8 Identities=25% Similarity=0.517 Sum_probs=3.8
Q ss_pred cchhhhcc
Q psy9560 180 FNFRSQLK 187 (503)
Q Consensus 180 mDfRANLK 187 (503)
.+|..|+|
T Consensus 756 ~~f~e~vk 763 (988)
T KOG2072|consen 756 DKFKEHVK 763 (988)
T ss_pred HHHHHHHh
Confidence 34444544
No 206
>PRK10884 SH3 domain-containing protein; Provisional
Probab=29.97 E-value=1.5e+02 Score=29.10 Aligned_cols=48 Identities=13% Similarity=0.168 Sum_probs=39.6
Q ss_pred HHHHHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHhhC
Q psy9560 106 EDEIGDLCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQVNDLRG 153 (503)
Q Consensus 106 edELQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnDLRG 153 (503)
..+|++...+.-+.|..|+++.=.|...+.....++.+|+.++.+++.
T Consensus 120 ~~~l~~~~~~~~~~~~~L~~~n~~L~~~l~~~~~~~~~l~~~~~~~~~ 167 (206)
T PRK10884 120 TAEMQQKVAQSDSVINGLKEENQKLKNQLIVAQKKVDAANLQLDDKQR 167 (206)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 356777777788888888888888888888888888888888888876
No 207
>PF06156 DUF972: Protein of unknown function (DUF972); InterPro: IPR010377 FUNCTION: Involved in initiation control of chromosome replication. SUBUNIT: Interacts with both DnaA and DnaN, acting as a bridge between these two proteins. SIMILARITY: Belongs to the YabA family.
Probab=29.87 E-value=3.8e+02 Score=23.96 Aligned_cols=67 Identities=19% Similarity=0.209 Sum_probs=44.8
Q ss_pred HHHHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHhhC----CCCCCccccccccHHHHHHH
Q psy9560 107 DEIGDLCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQVNDLRG----KFVRPILKKVSKYENKFAKL 173 (503)
Q Consensus 107 dELQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnDLRG----KFKKPaLKKVRkSAnm~a~L 173 (503)
++|.+-...|++.|..|...-+.|...-....-|...|+.++.++.. +-.+-.-+.++.+-+-|++|
T Consensus 11 ~~le~~l~~l~~~~~~LK~~~~~l~EEN~~L~~EN~~Lr~~l~~~~~~~~~~~~~~~~~~~~~g~~NL~~L 81 (107)
T PF06156_consen 11 DQLEQQLGQLLEELEELKKQLQELLEENARLRIENEHLRERLEELEQEEEEKEEKKTKKKLGEGRDNLARL 81 (107)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccchHHHHHHH
Confidence 45666667777777777777777777777777888888888888755 12223333455555446665
No 208
>KOG2129|consensus
Probab=29.72 E-value=96 Score=34.40 Aligned_cols=29 Identities=31% Similarity=0.528 Sum_probs=23.6
Q ss_pred HHHHHHHHHHHHHHHHHHhhhhhhHHHHH
Q psy9560 106 EDEIGDLCQEYWQRIYNLEAIKYELDREI 134 (503)
Q Consensus 106 edELQEkCKELHdRIdkLEEEKYDLE~KV 134 (503)
+++-..|...||.|||+|+.||--|..|+
T Consensus 196 EQEqEalvN~LwKrmdkLe~ekr~Lq~Kl 224 (552)
T KOG2129|consen 196 EQEQEALVNSLWKRMDKLEQEKRYLQKKL 224 (552)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 34445689999999999999988777776
No 209
>KOG0933|consensus
Probab=29.51 E-value=3.2e+02 Score=33.35 Aligned_cols=43 Identities=12% Similarity=0.159 Sum_probs=26.7
Q ss_pred HHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHhhCCCC
Q psy9560 114 QEYWQRIYNLEAIKYELDREIMLKDFEITELGKQVNDLRGKFV 156 (503)
Q Consensus 114 KELHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnDLRGKFK 156 (503)
..+-..|..|..+-=+++.+|.....+.+.+...|+|...||+
T Consensus 839 ~~~~~~~~~l~~e~~~l~~kv~~~~~~~~~~~~el~~~k~k~~ 881 (1174)
T KOG0933|consen 839 EQLEKQISSLKSELGNLEAKVDKVEKDVKKAQAELKDQKAKQR 881 (1174)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHhHHhHHHHHHHHHHHHHHHHH
Confidence 3344455566666666677777776666666666666665443
No 210
>PF07946 DUF1682: Protein of unknown function (DUF1682); InterPro: IPR012879 The members of this family are all hypothetical eukaryotic proteins of unknown function. One member (Q920S6 from SWISSPROT) is described as being an adipocyte-specific protein, but no evidence of this was found.
Probab=29.49 E-value=1.6e+02 Score=30.14 Aligned_cols=27 Identities=26% Similarity=0.224 Sum_probs=18.4
Q ss_pred HHHHHHHHHHHHHHHHHHhHHHHhhhh
Q psy9560 24 AKRLKLEELERKRAEVRRRMEEASGAR 50 (503)
Q Consensus 24 d~kRKqeErERKkAEvRKRlEEA~KkK 50 (503)
++.|.+.+.+-.++....|.|++..+|
T Consensus 261 ~k~R~~~~~~~~K~~~~~r~E~~~~~k 287 (321)
T PF07946_consen 261 KKNREEEEEKILKEAHQERQEEAQEKK 287 (321)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 566666666667777777888776555
No 211
>PF04977 DivIC: Septum formation initiator; InterPro: IPR007060 DivIC, from the spore-forming, Gram-positive bacterium Bacillus subtilis, is necessary for both vegetative and sporulation septum formation []. These proteins are mainly composed of an N-terminal coiled-coil. DivIB, DivIC and FtsL inter-depend on each other for stabilisation and localisation. The latter two form a heterodimer. DivIC is always centre cell but the other two associate with it during septation [].; GO: 0007049 cell cycle
Probab=29.48 E-value=1.3e+02 Score=23.57 Aligned_cols=28 Identities=29% Similarity=0.275 Sum_probs=15.9
Q ss_pred hhhhHHHHHhhhhhhHHHHHHHHHHhhC
Q psy9560 126 IKYELDREIMLKDFEITELGKQVNDLRG 153 (503)
Q Consensus 126 EKYDLE~KVkKQDyEInELniKVnDLRG 153 (503)
+.=+++.++.+.+.++.+|+.+|..|++
T Consensus 25 ei~~l~~~i~~l~~e~~~L~~ei~~l~~ 52 (80)
T PF04977_consen 25 EIAELQKEIEELKKENEELKEEIERLKN 52 (80)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcC
Confidence 3334455555566666666666666633
No 212
>PF04156 IncA: IncA protein; InterPro: IPR007285 Chlamydia trachomatis is an obligate intracellular bacterium that develops within a parasitophorous vacuole termed an inclusion. The inclusion is nonfusogenic with lysosomes but intercepts lipids from a host cell exocytic pathway. Initiation of chlamydial development is concurrent with modification of the inclusion membrane by a set of C. trachomatis-encoded proteins collectively designated Incs. One of these Incs, IncA (Inclusion membrane protein A), is functionally associated with the homotypic fusion of inclusions [].
Probab=29.46 E-value=1.7e+02 Score=26.83 Aligned_cols=25 Identities=24% Similarity=0.215 Sum_probs=13.4
Q ss_pred HHHHHHHHHHHHHHHHHhhhhhhHH
Q psy9560 107 DEIGDLCQEYWQRIYNLEAIKYELD 131 (503)
Q Consensus 107 dELQEkCKELHdRIdkLEEEKYDLE 131 (503)
++|++.+.++...+..+.++.|++.
T Consensus 98 ~~l~~~~~~~~~~l~~~~~~~~~~~ 122 (191)
T PF04156_consen 98 DQLQERIQELESELEKLKEDLQELR 122 (191)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhH
Confidence 3455555555555555555555444
No 213
>PRK09174 F0F1 ATP synthase subunit B'; Validated
Probab=29.39 E-value=5.4e+02 Score=25.03 Aligned_cols=27 Identities=7% Similarity=0.008 Sum_probs=16.0
Q ss_pred HHHHHHHHHHHHHHHhhhhhhHHHHHh
Q psy9560 109 IGDLCQEYWQRIYNLEAIKYELDREIM 135 (503)
Q Consensus 109 LQEkCKELHdRIdkLEEEKYDLE~KVk 135 (503)
..++|-+|..+|-..+-++=|+...|.
T Consensus 173 a~e~A~~I~~Kllg~~~dk~~~~~av~ 199 (204)
T PRK09174 173 AEETAAAIVEQLIGGTADKASVAAAVK 199 (204)
T ss_pred HHHHHHHHHHHHhCcccCHHHHHHHHH
Confidence 455666666666665555555555554
No 214
>KOG0964|consensus
Probab=29.25 E-value=9e+02 Score=29.90 Aligned_cols=134 Identities=12% Similarity=0.178 Sum_probs=0.0
Q ss_pred HHHhhHHHHHHHHHHHHHHHHHHHHHhHHHHhhhh-ccCCCCCHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHhhh
Q psy9560 16 AAASAEDEAKRLKLEELERKRAEVRRRMEEASGAR-RKKGFMTPERKKKLRLLLRKKAAEELK-REQERRALERKRVIDE 93 (503)
Q Consensus 16 ~~~mad~ed~kRKqeErERKkAEvRKRlEEA~KkK-AKKgKMT~eRKkkLKSLLLqKAaEELE-KEKEqKeEEKKKiLAE 93 (503)
+|.|-.+-+.+--|---+-++.|-..+.=...-.. ..--.+-..-++.+-..|..|-....+ +--....+...+++..
T Consensus 690 ~r~~i~~~~~~i~q~~~~~qk~e~~~~~~~~~~~~l~~e~~~~k~e~~~v~~s~~~k~~~Le~i~~~l~~~~~~~~~~e~ 769 (1200)
T KOG0964|consen 690 VRNEIEDIDQKIDQLNNNMQKVENDRNAFKREHEKLKRELNTIKGEKSRVQESLEPKGKELEEIKTSLHKLESQSNYFES 769 (1200)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHhhhHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHhHHH
Q ss_pred hcCCCCCC-CCCCHHH------HHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHH-HHHHHHHh
Q psy9560 94 RCGEPKDT-DDMAEDE------IGDLCQEYWQRIYNLEAIKYELDREIMLKDFEITE-LGKQVNDL 151 (503)
Q Consensus 94 RcpPPLnI-DGLSedE------LQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInE-LniKVnDL 151 (503)
-.+ -++ |.|+.++ |..-+++||.++..|-.+|.||+.+...-++.|+. |..|+++|
T Consensus 770 el~--sel~sqLt~ee~e~l~kLn~eI~~l~~kl~~~~~er~~~~~rk~~le~~l~~kL~~r~~~l 833 (1200)
T KOG0964|consen 770 ELG--SELFSQLTPEELERLSKLNKEINKLSVKLRALREERIDIETRKTALEANLNTKLYKRVNEL 833 (1200)
T ss_pred HHh--HHHHhhcCHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhHH
No 215
>PF10211 Ax_dynein_light: Axonemal dynein light chain; InterPro: IPR019347 Axonemal dynein light chain proteins play a dynamic role in flagellar and cilial motility. Eukaryotic cilia and flagella are complex organelles consisting of a core structure, the axoneme, which is composed of nine microtubule doublets forming a cylinder that surrounds a pair of central singlet microtubules. This ultra-structural arrangement seems to be one of the most stable micro-tubular assemblies known and is responsible for the flagellar and ciliary movement of a large number of organisms ranging from protozoan to mammals. This light chain interacts directly with the N-terminal half of the heavy chains [].
Probab=29.24 E-value=1.4e+02 Score=28.58 Aligned_cols=27 Identities=44% Similarity=0.491 Sum_probs=18.2
Q ss_pred hhhHHHHHhhhhhhHHHHHHHHHHhhC
Q psy9560 127 KYELDREIMLKDFEITELGKQVNDLRG 153 (503)
Q Consensus 127 KYDLE~KVkKQDyEInELniKVnDLRG 153 (503)
+.+++.++...+.++++|..++++|+-
T Consensus 122 ~~~l~~~i~~L~~e~~~L~~~~~~l~~ 148 (189)
T PF10211_consen 122 KQELEEEIEELEEEKEELEKQVQELKN 148 (189)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 566666666666677777777776655
No 216
>PF10168 Nup88: Nuclear pore component; InterPro: IPR019321 Nup88 can be divided into two structural domains; the N-terminal two-thirds of the protein have no obvious structural motifs. It is, however, where it binds to Nup98; one of the components of the nuclear pore. The C-terminal end is a predicted coiled-coil domain []. Nup88 is over expressed in tumour cells [].
Probab=28.94 E-value=4.3e+02 Score=30.46 Aligned_cols=16 Identities=13% Similarity=0.235 Sum_probs=9.3
Q ss_pred HHHHHHHHHHHhhhhc
Q psy9560 80 QERRALERKRVIDERC 95 (503)
Q Consensus 80 KEqKeEEKKKiLAERc 95 (503)
+-....+|+..|..||
T Consensus 601 R~e~a~d~Qe~L~~R~ 616 (717)
T PF10168_consen 601 RYEEAKDKQEKLMKRV 616 (717)
T ss_pred HHHHHHHHHHHHHHHH
Confidence 4455555666666665
No 217
>PF13851 GAS: Growth-arrest specific micro-tubule binding
Probab=28.91 E-value=1.5e+02 Score=28.70 Aligned_cols=117 Identities=17% Similarity=0.212 Sum_probs=0.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhhcCCCCCCCCCCHHHHHHHHHHHHHHHHHHhhhhh---hHHHHHhhhhhhHHHH
Q psy9560 68 LRKKAAEELKREQERRALERKRVIDERCGEPKDTDDMAEDEIGDLCQEYWQRIYNLEAIKY---ELDREIMLKDFEITEL 144 (503)
Q Consensus 68 LLqKAaEELEKEKEqKeEEKKKiLAERcpPPLnIDGLSedELQEkCKELHdRIdkLEEEKY---DLE~KVkKQDyEInEL 144 (503)
|++.=++++. +...+.+..++.|.+....-..+.- ....++.-+.+|+..+...+.+|- .+..+++..+.+|.+|
T Consensus 28 lIksLKeei~-emkk~e~~~~k~m~ei~~eN~~L~e-pL~~a~~e~~eL~k~L~~y~kdK~~L~~~k~rl~~~ek~l~~L 105 (201)
T PF13851_consen 28 LIKSLKEEIA-EMKKKEERNEKLMAEISQENKRLSE-PLKKAEEEVEELRKQLKNYEKDKQSLQNLKARLKELEKELKDL 105 (201)
T ss_pred HHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q ss_pred HHHHHHhhCCCCCCccccccccHHHHHHHHhhhhccchhhhc
Q psy9560 145 GKQVNDLRGKFVRPILKKVSKYENKFAKLQKKAAEFNFRSQL 186 (503)
Q Consensus 145 niKVnDLRGKFKKPaLKKVRkSAnm~a~L~KhkvsmDfRANL 186 (503)
...-..|.=+|.+=---|=..+..--.++....-..+||++|
T Consensus 106 k~e~evL~qr~~kle~ErdeL~~kf~~~i~evqQk~~~kn~l 147 (201)
T PF13851_consen 106 KWEHEVLEQRFEKLEQERDELYRKFESAIQEVQQKTGLKNLL 147 (201)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
No 218
>PRK15422 septal ring assembly protein ZapB; Provisional
Probab=28.77 E-value=1.4e+02 Score=26.06 Aligned_cols=51 Identities=18% Similarity=0.194 Sum_probs=42.5
Q ss_pred CHHHHHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHhhCCC
Q psy9560 105 AEDEIGDLCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQVNDLRGKF 155 (503)
Q Consensus 105 SedELQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnDLRGKF 155 (503)
-.++|++.-..|.+.++.+-+.|-.|+....+...|-+.-..|++-|=||+
T Consensus 26 EieELKekn~~L~~e~~~~~~~r~~L~~en~qLk~E~~~WqerLr~LLGkm 76 (79)
T PRK15422 26 EIEELKEKNNSLSQEVQNAQHQREELERENNHLKEQQNGWQERLQALLGRM 76 (79)
T ss_pred HHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Confidence 457788888888888888888888888888888888888888888888875
No 219
>PRK08147 flgK flagellar hook-associated protein FlgK; Validated
Probab=28.68 E-value=91 Score=33.78 Aligned_cols=48 Identities=15% Similarity=0.126 Sum_probs=40.5
Q ss_pred HHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHhhCCCC
Q psy9560 109 IGDLCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQVNDLRGKFV 156 (503)
Q Consensus 109 LQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnDLRGKFK 156 (503)
.+.||..||.--..|+..+-|+...+...-.+||.|-.+|.+|..+..
T Consensus 137 a~~l~~~~n~~~~~L~~~~~~~~~~i~~~V~~iN~l~~~Ia~LN~~I~ 184 (547)
T PRK08147 137 AEGLVNQFKTTDQYLRDQDKGVNTAIGSSVDQINNYAKQIASLNDQIT 184 (547)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 466889999988999999999999999888888888888888877543
No 220
>PF03082 MAGSP: Male accessory gland secretory protein; InterPro: IPR004315 The accessory gland of male insects is a genital tissue that secretes many components of the ejaculatory fluid, some of which affect the female's receptivity to courtship and her rate of oviposition. The protein is expressed exclusively in the male accessory glands of adult Drosophila melanogaster. During copulation it is transferred to the female genital tract where it is rapidly altered [].; GO: 0007618 mating, 0005576 extracellular region
Probab=28.60 E-value=1.7e+02 Score=29.99 Aligned_cols=77 Identities=21% Similarity=0.231 Sum_probs=48.6
Q ss_pred HHHHhhhhcCCCCCCCCCCHHHHHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHhhCCCCCCcccccccc
Q psy9560 87 RKRVIDERCGEPKDTDDMAEDEIGDLCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQVNDLRGKFVRPILKKVSKY 166 (503)
Q Consensus 87 KKKiLAERcpPPLnIDGLSedELQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnDLRGKFKKPaLKKVRkS 166 (503)
++|.+-||.- -+-+.+.|..=++++ ..-..-|-+..---.|||..++-.+.-|-+||.+++|.|. -.||--||-.+-
T Consensus 130 qqrL~~E~n~-s~~fRN~SV~LM~Ei-e~rK~eIl~~Rq~NldLE~eLndanRkilElNlqlqdaRk-svkpCkKr~~kd 206 (264)
T PF03082_consen 130 QQRLLLEQNN-SFMFRNISVALMKEI-EARKTEILKARQSNLDLELELNDANRKILELNLQLQDARK-SVKPCKKRNSKD 206 (264)
T ss_pred HHHHHHhhcc-chhhhhhHHHHHHHH-HHHHHHHHHHHhhCCceeeehhHHHHHHHHHHHHHHHhhc-ccCCCcCCCccc
Confidence 3445555552 334444454333322 2222334455556678999999999999999999999987 778876665444
No 221
>COG3851 UhpB Signal transduction histidine kinase, glucose-6-phosphate specific [Signal transduction mechanisms]
Probab=28.60 E-value=2.8e+02 Score=30.75 Aligned_cols=66 Identities=18% Similarity=0.226 Sum_probs=45.9
Q ss_pred HHHHHHHHHHHHhhhhcCCCCCCCCCCHHHH-HHHHHHHHHHHHHHhh---------hhhhHHHHHhhhhhhHHHHHHHH
Q psy9560 79 EQERRALERKRVIDERCGEPKDTDDMAEDEI-GDLCQEYWQRIYNLEA---------IKYELDREIMLKDFEITELGKQV 148 (503)
Q Consensus 79 EKEqKeEEKKKiLAERcpPPLnIDGLSedEL-QEkCKELHdRIdkLEE---------EKYDLE~KVkKQDyEInELniKV 148 (503)
+..+++-.+++.|+++.. + +++++ +++++||||.|-+=-. +|-.=...++.+-.-|+.|..+|
T Consensus 279 qrL~~EL~~~raLaeqLi-----s--tEEsiRk~vARELHDeIGQnITAIr~Qa~ivkR~~~~~q~kqaas~Ie~LslrI 351 (497)
T COG3851 279 QRLQKELARNRALAEQLI-----S--TEESIRKDVARELHDEIGQNITAIRTQAGIVKRAADNAQVKQAASLIEQLSLRI 351 (497)
T ss_pred HHHHHHHHHhHHHHHHHH-----h--hHHHHHHHHHHHHHHHhcchHHHHHHHHHHHHhccCCHhHHhHHHHHHHHHHHH
Confidence 455677888899999883 3 44454 7899999998864211 23333445666777899999998
Q ss_pred HHh
Q psy9560 149 NDL 151 (503)
Q Consensus 149 nDL 151 (503)
-|-
T Consensus 352 ~~s 354 (497)
T COG3851 352 YDS 354 (497)
T ss_pred HHH
Confidence 773
No 222
>PF00769 ERM: Ezrin/radixin/moesin family; InterPro: IPR011259 The ERM family consists of three closely-related proteins, ezrin, radixin and moesin []. Ezrin was first identified as a constituent of microvilli [], radixin as a barbed, end-capping actin-modulating protein from isolated junctional fractions [], and moesin as a heparin binding protein []. A tumour suppressor molecule responsible for neurofibromatosis type 2 (NF2) is highly similar to ERM proteins and has been designated merlin (moesin-ezrin-radixin-like protein). ERM molecules contain 3 domains, an N-terminal globular domain; an extended alpha-helical domain; and a charged C-terminal domain []. Ezrin, radixin and merlin also contain a polyproline region between the helical and C-terminal domains. The N-terminal domain is highly conserved, and is also found in merlin, band 4.1 proteins and members of the band 4.1 superfamily. ERM proteins crosslink actin filaments with plasma membranes. They co-localise with CD44 at actin filament-plasma membrane interaction sites, associating with CD44 via their N-terminal domains and with actin filaments via their C-terminal domains [].; GO: 0008092 cytoskeletal protein binding, 0005737 cytoplasm, 0019898 extrinsic to membrane; PDB: 2I1J_A 2I1K_A 1E5W_A 1EF1_C.
Probab=28.43 E-value=6e+02 Score=25.29 Aligned_cols=25 Identities=28% Similarity=0.500 Sum_probs=11.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhh
Q psy9560 69 RKKAAEELKREQERRALERKRVIDE 93 (503)
Q Consensus 69 LqKAaEELEKEKEqKeEEKKKiLAE 93 (503)
++-.+..|.+++....++++++-..
T Consensus 45 aeeea~~Le~k~~eaee~~~rL~~~ 69 (246)
T PF00769_consen 45 AEEEAEELEQKRQEAEEEKQRLEEE 69 (246)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHH-
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3444455555555555555555443
No 223
>cd06225 HAMP Histidine kinase, Adenylyl cyclase, Methyl-accepting protein, and Phosphatase (HAMP) domain. HAMP is a signaling domain which occurs in a wide variety of signaling proteins, many of which are bacterial. The HAMP domain consists of two alpha helices connected by an extended linker. The structure of the HAMP dimer from Archaeoglobus fulgidus has been solved using nuclear magnetic resonance, revealing a parallel four-helix bundle; this structure has been confirmed by cross-linking analysis of HAMP domains from the Escherichia coli aerotaxis receptor Aer. It has been suggested that the four-helix arrangement can rotate between the unusually packed conformation observed in the NMR structure and a canonical coiled-coil arrangement. Such rotation may coincide with signal transduction, but a common mechanism by which HAMP domains relay a variety of input signals has yet to be established.
Probab=28.39 E-value=32 Score=21.40 Aligned_cols=39 Identities=15% Similarity=0.167 Sum_probs=22.2
Q ss_pred HHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHhhCCC
Q psy9560 116 YWQRIYNLEAIKYELDREIMLKDFEITELGKQVNDLRGKF 155 (503)
Q Consensus 116 LHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnDLRGKF 155 (503)
|-+.+..+.+..|.....+.. ..|+..|...++.|+.++
T Consensus 8 l~~~~~~~~~g~~~~~~~~~~-~~e~~~l~~~~~~l~~~~ 46 (48)
T cd06225 8 LAEAAQRIAAGDLDVRLPVTG-RDEIGELARAFNQMAERL 46 (48)
T ss_pred HHHHHHHHhCCCCceeecCCC-CchHHHHHHHHHHHHHHH
Confidence 444455555555555444443 346777777777776543
No 224
>PF06698 DUF1192: Protein of unknown function (DUF1192); InterPro: IPR009579 This family consists of several short, hypothetical, bacterial proteins of around 60 residues in length. The function of this family is unknown.
Probab=28.33 E-value=80 Score=25.94 Aligned_cols=28 Identities=21% Similarity=0.181 Sum_probs=19.8
Q ss_pred CCCCCCCHHHHHHHHHHHHHHHHHHhhh
Q psy9560 99 KDTDDMAEDEIGDLCQEYWQRIYNLEAI 126 (503)
Q Consensus 99 LnIDGLSedELQEkCKELHdRIdkLEEE 126 (503)
.+|+.||.++|.+-+--|..-|..++.+
T Consensus 16 ~dLs~lSv~EL~~RIa~L~aEI~R~~~~ 43 (59)
T PF06698_consen 16 EDLSLLSVEELEERIALLEAEIARLEAA 43 (59)
T ss_pred CCchhcCHHHHHHHHHHHHHHHHHHHHH
Confidence 4789999998887666666666655543
No 225
>PF04120 Iron_permease: Low affinity iron permease ; InterPro: IPR007251 Although originally identified as a low-affinity iron(II) permease [, ], Fet4 has since been shown to import several other transition metal ions, including copper [, ] and zinc []. Copper, cobalt, and cadmium inhibit Fet4 [, ]. Fet4 is an integral protein of the plasma membrane [, ]. FET4 is not essential, not even in fet3 fet4 double mutants []. Over expression of FET4 improves growth under alkaline conditions []. Transcription of FET4 is induced by Aft1 in response to low levels of iron [, , ] or by Zap1 in response to low zinc [, ], but not in response to low copper []. When the high-affinity iron permease component Fet3 is deleted, FET4 is induced by the addition of copper, zinc, cobalt, or manganese []. It is also induced under anaerobic conditions [, , ] and repressed by Rox1 in aerobic conditions [, ]. Rox1 attenuates the activation of FET4 by Aft1 or Zap1 []. ; GO: 0055085 transmembrane transport
Probab=28.15 E-value=3.1e+02 Score=25.56 Aligned_cols=46 Identities=20% Similarity=0.235 Sum_probs=29.0
Q ss_pred HHHHHHHHhhhhcCCCCCCCCCCHHHHHHHHHHHHHHHHHHhhhhh
Q psy9560 83 RALERKRVIDERCGEPKDTDDMAEDEIGDLCQEYWQRIYNLEAIKY 128 (503)
Q Consensus 83 KeEEKKKiLAERcpPPLnIDGLSedELQEkCKELHdRIdkLEEEKY 128 (503)
|..+=-+++........+|++|++++|.++.+.|++.-..-....|
T Consensus 74 KLdeLi~~~~~a~n~li~iE~l~~~el~~~~~~~~~~~~~~~~~~~ 119 (132)
T PF04120_consen 74 KLDELIRAVKEARNELIDIEDLTEEELEEIRKRYERLAEQARERHD 119 (132)
T ss_pred HHHHHHHHHHHHHHHhCCcccCCHHHHHHHHHHHHHHHHHhhhhcc
Confidence 3333333444333346799999999999988777766555544443
No 226
>PF13801 Metal_resist: Heavy-metal resistance; PDB: 3EPV_C 2Y3D_A 2Y3H_D 2Y3G_B 2Y3B_A 2Y39_A 3LAY_H.
Probab=28.04 E-value=3.2e+02 Score=21.97 Aligned_cols=76 Identities=22% Similarity=0.204 Sum_probs=48.5
Q ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcCCCCCCCCCCHHHHHHHHHHHHHHHHHHhhhhhhHHHH
Q psy9560 54 GFMTPERKKKLRLLLRKKAAEELKREQERRALERKRVIDERCGEPKDTDDMAEDEIGDLCQEYWQRIYNLEAIKYELDRE 133 (503)
Q Consensus 54 gKMT~eRKkkLKSLLLqKAaEELEKEKEqKeEEKKKiLAERcpPPLnIDGLSedELQEkCKELHdRIdkLEEEKYDLE~K 133 (503)
..+|.+-+..|+.++... ...+..-..+....+..+...-. =+.+|.+.|..+.+++.+.-..+...+.+.-..
T Consensus 40 l~Lt~eQ~~~l~~~~~~~-~~~~~~~r~~~~~~r~~l~~ll~-----~~~~D~~~i~a~~~~~~~~~~~l~~~~~~~~~~ 113 (125)
T PF13801_consen 40 LNLTPEQQAKLRALMDEF-RQEMRALRQELRAARQELRALLA-----APPPDEAAIEALLEEIREAQAELRQERLEHLLE 113 (125)
T ss_dssp S-TTHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHC-----CSSS-HHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred cCCCHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHc-----CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 568999999988876443 34444444555556666655433 234577888888888888877777766665544
Q ss_pred Hh
Q psy9560 134 IM 135 (503)
Q Consensus 134 Vk 135 (503)
+.
T Consensus 114 ~~ 115 (125)
T PF13801_consen 114 IR 115 (125)
T ss_dssp HH
T ss_pred HH
Confidence 43
No 227
>KOG4438|consensus
Probab=27.85 E-value=4.9e+02 Score=28.98 Aligned_cols=59 Identities=15% Similarity=0.219 Sum_probs=33.7
Q ss_pred HHHHHHHHHHHHHhhhhcCCCCCCCCCCHHHHHHHHHHHHHHHHHHhhhh--hhHHHHHhhhhhhHHHHHHH
Q psy9560 78 REQERRALERKRVIDERCGEPKDTDDMAEDEIGDLCQEYWQRIYNLEAIK--YELDREIMLKDFEITELGKQ 147 (503)
Q Consensus 78 KEKEqKeEEKKKiLAERcpPPLnIDGLSedELQEkCKELHdRIdkLEEEK--YDLE~KVkKQDyEInELniK 147 (503)
++.+..-.+|.++|.+.+ +-|+-+-.+|+.-++++.++. ||--.+...-..+|-.|+.+
T Consensus 261 k~~~~~l~~K~~iL~ekv-----------~~~qti~~e~~~~lk~i~~~~~e~d~~Et~~v~lke~~~Le~q 321 (446)
T KOG4438|consen 261 KSAMVELQEKAKILEEKV-----------TNLQTIEKELKALLKKISSDGVEYDSLETKVVELKEILELEDQ 321 (446)
T ss_pred HHHHHHHHHHHHHHHhHh-----------HHHHHHHHHHHHHHHHHHHhhhhhhhhHHHHHHHHHHHHHHHH
Confidence 345556667777777655 226666777777777777776 44333333333444444443
No 228
>COG4026 Uncharacterized protein containing TOPRIM domain, potential nuclease [General function prediction only]
Probab=27.72 E-value=1.7e+02 Score=30.41 Aligned_cols=47 Identities=19% Similarity=0.210 Sum_probs=40.7
Q ss_pred HHHHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHhhC
Q psy9560 107 DEIGDLCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQVNDLRG 153 (503)
Q Consensus 107 dELQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnDLRG 153 (503)
.+|++...+|..+++.+-.+|-.|-..+....-++++++.|+..|+|
T Consensus 131 ~d~ke~~ee~kekl~E~~~EkeeL~~eleele~e~ee~~erlk~le~ 177 (290)
T COG4026 131 MDLKEDYEELKEKLEELQKEKEELLKELEELEAEYEEVQERLKRLEV 177 (290)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 45777778888888888899999999999999999999999988887
No 229
>PF04048 Sec8_exocyst: Sec8 exocyst complex component specific domain; InterPro: IPR007191 Sec8 is a component of the exocyst complex involved in the docking of exocystic vesicles with a fusion site on the plasma membrane. The exocyst complex is composed of Sec3, Sec5, Sec6, Sec8, Sec10, Sec15, Exo70 and Exo84.; GO: 0006904 vesicle docking involved in exocytosis, 0015031 protein transport, 0000145 exocyst
Probab=27.49 E-value=2.2e+02 Score=25.75 Aligned_cols=59 Identities=15% Similarity=0.141 Sum_probs=35.5
Q ss_pred HHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHhhCC--CCCCccccccccH
Q psy9560 109 IGDLCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQVNDLRGK--FVRPILKKVSKYE 167 (503)
Q Consensus 109 LQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnDLRGK--FKKPaLKKVRkSA 167 (503)
|+.++.+.|+-+...=..==.|-..+......|..|+..+.+-+.. .+++-|++.....
T Consensus 56 L~~vV~eh~q~Fn~sI~sy~~i~~~i~~sq~~i~~lK~~L~~ak~~L~~~~~eL~~L~~~s 116 (142)
T PF04048_consen 56 LQEVVNEHYQGFNSSIGSYSQILSSISESQERIRELKESLQEAKSLLGCRREELKELWQRS 116 (142)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHH
Confidence 4444444444444433333345566677777788888887776654 5777777776655
No 230
>PF07798 DUF1640: Protein of unknown function (DUF1640); InterPro: IPR024461 This family consists of uncharacterised proteins.
Probab=27.37 E-value=4.5e+02 Score=24.58 Aligned_cols=9 Identities=22% Similarity=0.741 Sum_probs=3.7
Q ss_pred hhhhccccc
Q psy9560 182 FRSQLKAVK 190 (503)
Q Consensus 182 fRANLKqVK 190 (503)
||+.+-++|
T Consensus 143 lr~~iE~~K 151 (177)
T PF07798_consen 143 LRTEIESLK 151 (177)
T ss_pred HHHHHHHHH
Confidence 444444433
No 231
>PF08295 Sin3_corepress: Sin3 family co-repressor; InterPro: IPR013194 This domain is found on transcriptional regulators. It forms interactions with histone deacetylases [].
Probab=27.35 E-value=1.3e+02 Score=26.72 Aligned_cols=35 Identities=23% Similarity=0.287 Sum_probs=29.3
Q ss_pred HHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHH
Q psy9560 114 QEYWQRIYNLEAIKYELDREIMLKDFEITELGKQV 148 (503)
Q Consensus 114 KELHdRIdkLEEEKYDLE~KVkKQDyEInELniKV 148 (503)
..+=+.|.++|+|||++...+..+..-|.-|..=.
T Consensus 54 nqyEE~lf~~EDeR~E~D~~ie~~~~tI~~Le~l~ 88 (101)
T PF08295_consen 54 NQYEEALFRCEDERFELDMLIESNRSTIKLLEELQ 88 (101)
T ss_pred hHHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHH
Confidence 67888999999999999999998888877665433
No 232
>PF03656 Pam16: Pam16; InterPro: IPR005341 The Pam16 protein is the fifth essential subunit of the pre-sequence translocase-associated protein import motor (PAM) []. In Saccharomyces cerevisiae (Baker's yeast), Pam16 is required for preprotein translocation into the matrix, but not for protein insertion into the inner membrane [].; PDB: 2GUZ_J.
Probab=27.20 E-value=58 Score=29.96 Aligned_cols=56 Identities=13% Similarity=0.220 Sum_probs=33.7
Q ss_pred hhhhcCCCCCCCC-CCHHHHHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHH-HHHHH
Q psy9560 91 IDERCGEPKDTDD-MAEDEIGDLCQEYWQRIYNLEAIKYELDREIMLKDFEIT-ELGKQ 147 (503)
Q Consensus 91 LAERcpPPLnIDG-LSedELQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEIn-ELniK 147 (503)
|.|-| ..|||+. ++.+++++..+.||+.=|.-..=-|=|..||-+-.--|. +|..+
T Consensus 57 l~EA~-~ILnv~~~~~~eeI~k~y~~Lf~~Nd~~kGGSfYLQSKV~rAKErl~~El~~~ 114 (127)
T PF03656_consen 57 LDEAR-QILNVKEELSREEIQKRYKHLFKANDPSKGGSFYLQSKVFRAKERLEQELKEE 114 (127)
T ss_dssp HHHHH-HHHT--G--SHHHHHHHHHHHHHHT-CCCTS-HHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHH-HHcCCCCccCHHHHHHHHHHHHhccCCCcCCCHHHHHHHHHHHHHHHHHHHHH
Confidence 33444 3455555 788888888888887766655567888888887655554 44443
No 233
>PRK15396 murein lipoprotein; Provisional
Probab=27.13 E-value=2.5e+02 Score=24.14 Aligned_cols=53 Identities=8% Similarity=0.134 Sum_probs=45.5
Q ss_pred CHHHHHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHhhCCCCC
Q psy9560 105 AEDEIGDLCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQVNDLRGKFVR 157 (503)
Q Consensus 105 SedELQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnDLRGKFKK 157 (503)
..++|+.-...|..++++|..+-=++...+..-+-|-.--|+||..+-..|+|
T Consensus 26 kvd~LssqV~~L~~kvdql~~dv~~~~~~~~~a~~eA~raN~RlDn~~~sy~k 78 (78)
T PRK15396 26 KIDQLSSDVQTLNAKVDQLSNDVNAMRSDVQAAKDDAARANQRLDNQATKYRK 78 (78)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccC
Confidence 66788888899999999999998888888888888899999998877776765
No 234
>KOG3866|consensus
Probab=27.11 E-value=93 Score=33.53 Aligned_cols=32 Identities=13% Similarity=0.367 Sum_probs=24.7
Q ss_pred CCCCCCCHHHHHHHHHHHHHHHHHHhhhhhhHHHHHh
Q psy9560 99 KDTDDMAEDEIGDLCQEYWQRIYNLEAIKYELDREIM 135 (503)
Q Consensus 99 LnIDGLSedELQEkCKELHdRIdkLEEEKYDLE~KVk 135 (503)
+|..| |.++| +|.|.-.|-||-.+||--.-+.
T Consensus 220 vnhPG-SkdQL----kEVWEE~DgLdpn~fdPKTFF~ 251 (442)
T KOG3866|consen 220 VNHPG-SKDQL----KEVWEESDGLDPNQFDPKTFFA 251 (442)
T ss_pred CCCCC-cHHHH----HHHHHHhcCCCcccCCcchhee
Confidence 46777 77777 6779999999999998665554
No 235
>PF10264 Stork_head: Winged helix Storkhead-box1 domain; InterPro: IPR019391 In humans the Storkhead-box protein controls polyploidization of extravillus trophoblast and is implicated in pre-eclampsia []. This entry represents the conserved N-terminal winged-helix domain, which is likely to bind DNA.
Probab=27.01 E-value=1.1e+02 Score=26.44 Aligned_cols=57 Identities=28% Similarity=0.369 Sum_probs=43.9
Q ss_pred CCCCCCCCCChhhhhHHhhccccCCCCccHHHHHHHHHHhhhcCchhhHHHHHHHHHh
Q psy9560 273 PTPPSPVTPNPEELWLYLKDTVSASNPVSIDDLVEKLMSAVSATPQPLLKSVVKDTLR 330 (503)
Q Consensus 273 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 330 (503)
|.+.+--+|-+|-|...+-|.-++.-+|+++.|++.|+.....-..|-. .+|.++|-
T Consensus 3 pi~Q~qfiPL~EvlC~~I~dln~~~~~at~E~l~~~L~~~yp~i~~Ps~-e~l~~~L~ 59 (80)
T PF10264_consen 3 PISQSQFIPLPEVLCWVISDLNAAGQPATQETLREHLRKHYPGIAIPSQ-EVLYNTLG 59 (80)
T ss_pred ccccccceeHHHHHHHHHHHHhccCCcchHHHHHHHHHHhCCCCCCCCH-HHHHHHHH
Confidence 3445556788899999999999999999999999999998876654543 35566653
No 236
>PF08946 Osmo_CC: Osmosensory transporter coiled coil; InterPro: IPR015041 The osmosensory transporter coiled coil is a C-terminal domain found in various bacterial osmoprotective transporters, such as ProP, Proline/betaine transporter, Proline permease 2 and the citrate proton symporters. It adopts an antiparallel coiled-coil structure, and is essential for osmosensory and osmoprotectant transporter function []. ; PDB: 1R48_B.
Probab=27.00 E-value=75 Score=25.36 Aligned_cols=36 Identities=28% Similarity=0.330 Sum_probs=16.6
Q ss_pred HHHHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHhh
Q psy9560 107 DEIGDLCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQVNDLR 152 (503)
Q Consensus 107 dELQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnDLR 152 (503)
.+-++|.++-|+. ||.||..-|.+|.+|..|-+.|-
T Consensus 4 ~EAkelLqe~~d~----------IEqkiedid~qIaeLe~KR~~Lv 39 (46)
T PF08946_consen 4 AEAKELLQEHYDN----------IEQKIEDIDEQIAELEAKRQRLV 39 (46)
T ss_dssp -----------TH----------HHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHh----------HHHhHHHHHHHHHHHHHHHHHHH
Confidence 4556666776665 45667677777877777755553
No 237
>KOG0978|consensus
Probab=26.92 E-value=7.1e+02 Score=29.14 Aligned_cols=25 Identities=12% Similarity=0.076 Sum_probs=12.8
Q ss_pred hHHHHHhhhhhhHHHHHHHHHHhhC
Q psy9560 129 ELDREIMLKDFEITELGKQVNDLRG 153 (503)
Q Consensus 129 DLE~KVkKQDyEInELniKVnDLRG 153 (503)
++-++.++-+-|+..|+.++.+++.
T Consensus 598 ~~~~k~~rleEE~e~L~~kle~~k~ 622 (698)
T KOG0978|consen 598 IEKFKRKRLEEELERLKRKLERLKK 622 (698)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHhcc
Confidence 3334445555555566666555443
No 238
>KOG0243|consensus
Probab=26.88 E-value=5.1e+02 Score=31.62 Aligned_cols=59 Identities=20% Similarity=0.250 Sum_probs=29.0
Q ss_pred HHHHHHHHHHHHHhHHHHhhhhccCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhc
Q psy9560 29 LEELERKRAEVRRRMEEASGARRKKGFMTPERKKKLRLLLRKKAAEELKREQERRALERKRVIDERC 95 (503)
Q Consensus 29 qeErERKkAEvRKRlEEA~KkKAKKgKMT~eRKkkLKSLLLqKAaEELEKEKEqKeEEKKKiLAERc 95 (503)
=.|++|-|.+++.=.| +-.-.|+.+|=... .. ..++..++-++.+.+-+..++.|.+..
T Consensus 410 ~~EIerLK~dl~AaRe------KnGvyisee~y~~~-e~-e~~~~~~~ieele~el~~~~~~l~~~~ 468 (1041)
T KOG0243|consen 410 YEEIERLKRDLAAARE------KNGVYISEERYTQE-EK-EKKEMAEQIEELEEELENLEKQLKDLT 468 (1041)
T ss_pred HHHHHHHHHHHHHhHh------hCceEechHHHHHH-HH-HHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3567777666652221 12346666652111 00 122333333355666666666666655
No 239
>PF05276 SH3BP5: SH3 domain-binding protein 5 (SH3BP5); InterPro: IPR007940 The SH3 domain-binding protein inhibits the auto and transphophorylation of BTK and acts as a negative regulator of BTK-related signalling in B cells.
Probab=26.86 E-value=6.8e+02 Score=25.37 Aligned_cols=99 Identities=19% Similarity=0.194 Sum_probs=66.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHH-HHhhhhcCCCCCCCC-CCHHHHHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHH
Q psy9560 68 LRKKAAEELKREQERRALERK-RVIDERCGEPKDTDD-MAEDEIGDLCQEYWQRIYNLEAIKYELDREIMLKDFEITELG 145 (503)
Q Consensus 68 LLqKAaEELEKEKEqKeEEKK-KiLAERcpPPLnIDG-LSedELQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELn 145 (503)
-.|+|+...++-...-...|+ -+|+|.+ +..-++ --+...++++..=-.++...+.+|-+.+..+.+.-...+.+.
T Consensus 78 e~q~Aa~~yerA~~~h~aAKe~v~laEq~--l~~~~~~~~D~~wqEmLn~A~~kVneAE~ek~~ae~eH~~~~~~~~~ae 155 (239)
T PF05276_consen 78 EAQKAALQYERANSMHAAAKEMVALAEQS--LMSDSNWTFDPAWQEMLNHATQKVNEAEQEKTRAEREHQRRARIYNEAE 155 (239)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HhcCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 355566555544333333332 2344444 222221 123567899998889999999999999999999999999999
Q ss_pred HHHHHhhCCCCCCccccccccHHH
Q psy9560 146 KQVNDLRGKFVRPILKKVSKYENK 169 (503)
Q Consensus 146 iKVnDLRGKFKKPaLKKVRkSAnm 169 (503)
.+|..|..+.++ +.+|-+-|=++
T Consensus 156 ~~v~~Lek~lkr-~I~KSrPYfe~ 178 (239)
T PF05276_consen 156 QRVQQLEKKLKR-AIKKSRPYFEL 178 (239)
T ss_pred HHHHHHHHHHHH-HHHhhhHHHHH
Confidence 999999997543 55566655443
No 240
>PF11932 DUF3450: Protein of unknown function (DUF3450); InterPro: IPR016866 There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function. However, they are found in an operon along with components of a TonB transport system (typified by Vibrio cholerae TonB2 [], and are predicted to be localized to the periplasmic space. Caution: the low-complexity nature of these sequences produces spurious BLAST hits to chromosome segregation ATPases (which are much longer in length and contain canonical Walker motifs). Accordingly, some members are misidentified as such.
Probab=26.86 E-value=2.1e+02 Score=27.93 Aligned_cols=26 Identities=12% Similarity=0.132 Sum_probs=12.2
Q ss_pred HHHHHHHHHHHHHHHhhhhhhHHHHH
Q psy9560 109 IGDLCQEYWQRIYNLEAIKYELDREI 134 (503)
Q Consensus 109 LQEkCKELHdRIdkLEEEKYDLE~KV 134 (503)
|+..++.|-..|+.++.+.=+|+..+
T Consensus 68 L~~~~~~l~~~v~~q~~el~~L~~qi 93 (251)
T PF11932_consen 68 LEVYNEQLERQVASQEQELASLEQQI 93 (251)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 44444444444444444444444433
No 241
>KOG1899|consensus
Probab=26.79 E-value=3.5e+02 Score=31.74 Aligned_cols=21 Identities=38% Similarity=0.436 Sum_probs=11.9
Q ss_pred HHHHHHHHHHhhhhhhHHHHHh
Q psy9560 114 QEYWQRIYNLEAIKYELDREIM 135 (503)
Q Consensus 114 KELHdRIdkLEEEKYDLE~KVk 135 (503)
++||.+ ..||..|.||-..|.
T Consensus 164 qellsr-tsLETqKlDLmaevS 184 (861)
T KOG1899|consen 164 QELLSR-TSLETQKLDLMAEVS 184 (861)
T ss_pred HHHHhh-hhHHHHHhHHHHHHH
Confidence 445444 666666666655443
No 242
>COG2118 DNA-binding protein [General function prediction only]
Probab=26.77 E-value=5.3e+02 Score=24.08 Aligned_cols=16 Identities=13% Similarity=0.360 Sum_probs=12.3
Q ss_pred CCHHHHHHHHHHHHHH
Q psy9560 104 MAEDEIGDLCQEYWQR 119 (503)
Q Consensus 104 LSedELQEkCKELHdR 119 (503)
+++++|.++.+.+...
T Consensus 89 I~e~~lk~IL~~i~~~ 104 (116)
T COG2118 89 IDEEELKEILERISPQ 104 (116)
T ss_pred CCHHHHHHHHHHHhHH
Confidence 6788998888777655
No 243
>KOG2572|consensus
Probab=26.62 E-value=1.8e+02 Score=32.31 Aligned_cols=64 Identities=13% Similarity=0.164 Sum_probs=47.5
Q ss_pred CCCCCHHHHHHHHHHHHHHHH----HHhhhhhhHHHH-----HhhhhhhHHHHHHHHHHhhCCCCCCccccccc
Q psy9560 101 TDDMAEDEIGDLCQEYWQRIY----NLEAIKYELDRE-----IMLKDFEITELGKQVNDLRGKFVRPILKKVSK 165 (503)
Q Consensus 101 IDGLSedELQEkCKELHdRId----kLEEEKYDLE~K-----VkKQDyEInELniKVnDLRGKFKKPaLKKVRk 165 (503)
|+||+..+|-.+|=-|-+.|. ++-.+|.|.-.. +-..|+|||.-.+||.++-| |-.|-|.++--
T Consensus 125 ~~g~~~~dl~~msLglaHslar~Klkfs~dKvDtmIiQaisLLDDLDkeLNtY~mRvrEwYG-wHFPEL~kii~ 197 (498)
T KOG2572|consen 125 ISGLNDSDLAAMSLGLAHSLARYKLKFSPDKVDTMIIQAISLLDDLDKELNTYAMRVKEWYG-WHFPELAKIIQ 197 (498)
T ss_pred hccCChhhhhHHHHHHHHHHHhhhcccCcchhhHHHHHHHHHHHhHHHHHHHHHHHHHHHHh-hhhHHHHHHHH
Confidence 677888888887777776664 334455554332 34578999999999999999 99999988743
No 244
>KOG0994|consensus
Probab=26.56 E-value=1.1e+03 Score=29.99 Aligned_cols=40 Identities=38% Similarity=0.242 Sum_probs=18.9
Q ss_pred CCcHHHHHHhhHHH--HHHHHHHHHHHHHHHHHHhHHHHhhh
Q psy9560 10 ETPAEAAAASAEDE--AKRLKLEELERKRAEVRRRMEEASGA 49 (503)
Q Consensus 10 ~~~~~~~~~mad~e--d~kRKqeErERKkAEvRKRlEEA~Kk 49 (503)
-|-+.-+-.||-+- +-.+|+.|-++-.--||.-+..++++
T Consensus 1407 ~t~A~~A~~~A~~~~~~l~~~~ae~eq~~~~v~ea~~~aseA 1448 (1758)
T KOG0994|consen 1407 VTRAGGALLMAGDADTQLRSKLAEAEQTLSMVREAKLSASEA 1448 (1758)
T ss_pred hcccchHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhHHH
Confidence 34555666676543 33444555444443343333334433
No 245
>smart00540 LEM in nuclear membrane-associated proteins. LEM, domain in nuclear membrane-associated proteins, including lamino-associated polypeptide 2 and emerin.
Probab=26.32 E-value=38 Score=26.23 Aligned_cols=17 Identities=24% Similarity=0.534 Sum_probs=14.6
Q ss_pred CCCCCCHHHHHHHHHHH
Q psy9560 100 DTDDMAEDEIGDLCQEY 116 (503)
Q Consensus 100 nIDGLSedELQEkCKEL 116 (503)
||+.||.++|+..|+++
T Consensus 2 d~~~LSd~eL~~~L~~~ 18 (44)
T smart00540 2 DVDRLSDAELRAELKQY 18 (44)
T ss_pred chhHcCHHHHHHHHHHc
Confidence 67889999999999875
No 246
>PF05278 PEARLI-4: Arabidopsis phospholipase-like protein (PEARLI 4); InterPro: IPR007942 This family contains several phospholipase-like proteins from Arabidopsis thaliana and other members of the Streptophyta which are homologous to PEARLI 4.
Probab=26.26 E-value=2.4e+02 Score=29.29 Aligned_cols=39 Identities=15% Similarity=0.383 Sum_probs=28.2
Q ss_pred HHHHHhhhhhhHHHHHHHHHHhhCCCCCCccccccccHH
Q psy9560 130 LDREIMLKDFEITELGKQVNDLRGKFVRPILKKVSKYEN 168 (503)
Q Consensus 130 LE~KVkKQDyEInELniKVnDLRGKFKKPaLKKVRkSAn 168 (503)
++..+++...++.+++.||.+++|+...=-+++.|++.+
T Consensus 212 ~~EeL~~~Eke~~e~~~~i~e~~~rl~~l~~~~~~l~k~ 250 (269)
T PF05278_consen 212 LEEELKQKEKEVKEIKERITEMKGRLGELEMESTRLSKT 250 (269)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 344667778888899999999998766666666666553
No 247
>KOG0161|consensus
Probab=26.11 E-value=1e+03 Score=30.98 Aligned_cols=42 Identities=17% Similarity=0.174 Sum_probs=29.6
Q ss_pred HHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHhhC
Q psy9560 112 LCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQVNDLRG 153 (503)
Q Consensus 112 kCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnDLRG 153 (503)
...+|..+++.++..+-||-.+..+...|+.+|..++.+.-.
T Consensus 1246 ~l~elq~k~~~~~~~~~~l~~q~~~l~~E~~~l~~~lee~e~ 1287 (1930)
T KOG0161|consen 1246 QLSELQLKLDEQERLRNDLTAKRSRLQNENEELSRQLEEAEA 1287 (1930)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhhhHHHHhhHhHHHHH
Confidence 445566666777777777777777777777777777777655
No 248
>PHA02562 46 endonuclease subunit; Provisional
Probab=26.00 E-value=8e+02 Score=25.93 Aligned_cols=45 Identities=9% Similarity=0.073 Sum_probs=28.5
Q ss_pred HHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHhhC
Q psy9560 109 IGDLCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQVNDLRG 153 (503)
Q Consensus 109 LQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnDLRG 153 (503)
|+..+..|.+.|..|-.+.+|++..+++...+|..+...+..+.+
T Consensus 232 l~~~l~~l~~~i~~l~~~i~~~~~~L~~l~~~~~~~~~~l~~~~~ 276 (562)
T PHA02562 232 IKAEIEELTDELLNLVMDIEDPSAALNKLNTAAAKIKSKIEQFQK 276 (562)
T ss_pred HHHHHHHHHHHHHHHhcccccHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 444455555555555566666666677677777777777766665
No 249
>PF09789 DUF2353: Uncharacterized coiled-coil protein (DUF2353); InterPro: IPR019179 Members of this family have been annotated as being coiled-coil domain-containing protein 149, however they currently have no known function.
Probab=25.97 E-value=7.5e+02 Score=26.27 Aligned_cols=78 Identities=18% Similarity=0.167 Sum_probs=0.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcCCCCCCCCCCHHHHHHHHHHHHHHHHHHh---------------
Q psy9560 60 RKKKLRLLLRKKAAEELKREQERRALERKRVIDERCGEPKDTDDMAEDEIGDLCQEYWQRIYNLE--------------- 124 (503)
Q Consensus 60 RKkkLKSLLLqKAaEELEKEKEqKeEEKKKiLAERcpPPLnIDGLSedELQEkCKELHdRIdkLE--------------- 124 (503)
||+.-|+=-|.+-..+|++=+.++.+=|..+ ++|++.|..|-.++..++
T Consensus 2 rKL~SK~eAL~IL~~eLe~cq~ErDqyKlMA----------------EqLqer~q~LKkk~~el~~~~~~~~d~~~~~~~ 65 (319)
T PF09789_consen 2 RKLQSKSEALLILSQELEKCQSERDQYKLMA----------------EQLQERYQALKKKYRELIQEAAGFGDPSIPPEK 65 (319)
T ss_pred chhhhHHHHHHHHHHHHHHHHHHHHHHHHHH----------------HHHHHHHHHHHHHHHHhhhhhcccCCccCCccc
Q ss_pred ------hhhhhHHHHHhhhhhhHHHHHHHHHHhhC
Q psy9560 125 ------AIKYELDREIMLKDFEITELGKQVNDLRG 153 (503)
Q Consensus 125 ------EEKYDLE~KVkKQDyEInELniKVnDLRG 153 (503)
.---|.-...++...||.+|++|+.|++|
T Consensus 66 ~~~~La~lL~~sre~Nk~L~~Ev~~Lrqkl~E~qG 100 (319)
T PF09789_consen 66 ENKNLAQLLSESREQNKKLKEEVEELRQKLNEAQG 100 (319)
T ss_pred chhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc
No 250
>PF07223 DUF1421: Protein of unknown function (DUF1421); InterPro: IPR010820 This family represents a conserved region approximately 350 residues long within a number of plant proteins of unknown function.
Probab=25.97 E-value=58 Score=34.65 Aligned_cols=30 Identities=30% Similarity=0.463 Sum_probs=22.8
Q ss_pred CCCccHHHHHHHHHHhhhcCchhhHHHHHHHHHhhh
Q psy9560 297 SNPVSIDDLVEKLMSAVSATPQPLLKSVVKDTLRRQ 332 (503)
Q Consensus 297 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 332 (503)
.|-+.|||||||+.+. -+-+.-|..|+||.
T Consensus 316 ~~~~p~ddvidKv~~M------Gf~rDqV~a~v~rl 345 (358)
T PF07223_consen 316 GNRHPYDDVIDKVASM------GFRRDQVRATVRRL 345 (358)
T ss_pred cccCcHHHHHHHHHHc------CCcHHHHHHHHHHH
Confidence 4778899999999764 24566777788875
No 251
>PRK07739 flgK flagellar hook-associated protein FlgK; Validated
Probab=25.93 E-value=1.1e+02 Score=33.20 Aligned_cols=47 Identities=26% Similarity=0.314 Sum_probs=39.6
Q ss_pred HHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHhhCCC
Q psy9560 109 IGDLCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQVNDLRGKF 155 (503)
Q Consensus 109 LQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnDLRGKF 155 (503)
.+.||..|+.--..|+..+.++...+.-.-.+||.|-.+|.+|..+.
T Consensus 148 a~~La~~~n~~~~~L~~~~~~~~~~i~~~V~~iN~ll~~Ia~LN~~I 194 (507)
T PRK07739 148 AQALAETFNYLSQSLTDIQNDLKSEIDVTVKEINSLASQISDLNKQI 194 (507)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 36688888888889999999999999888888888888888887654
No 252
>PF12725 DUF3810: Protein of unknown function (DUF3810); InterPro: IPR024294 This family of bacterial proteins is functionally uncharacterised. Proteins in this family are typically between 333 and 377 amino acids in length and contain a conserved HEXXH sequence motif that is characteristic of metallopeptidases. This family may therefore belong to an as yet uncharacterised family of peptidase enzymes.
Probab=25.87 E-value=1.8e+02 Score=30.02 Aligned_cols=72 Identities=18% Similarity=0.213 Sum_probs=42.4
Q ss_pred hhhhcCCCCCCCCCCHHHHHHHHHHHHHHHHHHhhhh-----------hhHHHHHhhhhhhHHHHHHHHHHhhCCCCCCc
Q psy9560 91 IDERCGEPKDTDDMAEDEIGDLCQEYWQRIYNLEAIK-----------YELDREIMLKDFEITELGKQVNDLRGKFVRPI 159 (503)
Q Consensus 91 LAERcpPPLnIDGLSedELQEkCKELHdRIdkLEEEK-----------YDLE~KVkKQDyEInELniKVnDLRGKFKKPa 159 (503)
|.++.+ ++.+..|.++|.++|+.|-+++..+..+- ||.+.-.++-..-.+.|..+.-.|.|. -|.
T Consensus 84 l~~~l~--l~~~~~~~~eL~~l~~~li~~~N~l~~~i~~~~~~~~~~~~~~~~i~~~~~~~y~~l~~~~p~l~~~--~~~ 159 (318)
T PF12725_consen 84 LSERLG--LETEEYSTEELKELTEYLIEKANELREQITEDDNGVVDIPYDKEEIFEEAREGYENLAERYPFLSGY--YPS 159 (318)
T ss_pred HHHHcC--CCCCCCCHHHHHHHHHHHHHHHHHHHHhcCCCccccccCCCCHHHHHHHHHHHHHHHHHhCCccCCC--CCC
Confidence 445563 46667899999999999999988874431 333433443334444444444444542 444
Q ss_pred ccccccc
Q psy9560 160 LKKVSKY 166 (503)
Q Consensus 160 LKKVRkS 166 (503)
.|.+=.|
T Consensus 160 ~K~~l~S 166 (318)
T PF12725_consen 160 PKPSLFS 166 (318)
T ss_pred CcHhhcc
Confidence 5555444
No 253
>KOG4643|consensus
Probab=25.78 E-value=1.3e+03 Score=28.71 Aligned_cols=59 Identities=15% Similarity=0.059 Sum_probs=34.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhhcCCCCCCCCCCHHHHHHHHHHHHHHHHHHhhhhhhHHHHHhh
Q psy9560 69 RKKAAEELKREQERRALERKRVIDERCGEPKDTDDMAEDEIGDLCQEYWQRIYNLEAIKYELDREIML 136 (503)
Q Consensus 69 LqKAaEELEKEKEqKeEEKKKiLAERcpPPLnIDGLSedELQEkCKELHdRIdkLEEEKYDLE~KVkK 136 (503)
|.....+|..-...+.++|+.|...-+ .-..+...|.+|-+.+..||++.-+|-.+|..
T Consensus 497 L~~r~~elsrl~a~~~elkeQ~kt~~~---------qye~~~~k~eeLe~~l~~lE~ENa~LlkqI~~ 555 (1195)
T KOG4643|consen 497 LNNRDLELSRLHALKNELKEQYKTCDI---------QYELLSNKLEELEELLGNLEEENAHLLKQIQS 555 (1195)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH---------HHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHH
Confidence 444445555444555555555544322 22456667777777777777777777665543
No 254
>KOG0652|consensus
Probab=25.60 E-value=1.1e+02 Score=32.73 Aligned_cols=55 Identities=13% Similarity=0.172 Sum_probs=42.7
Q ss_pred CCCCHHHHHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHhhCCCC
Q psy9560 102 DDMAEDEIGDLCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQVNDLRGKFV 156 (503)
Q Consensus 102 DGLSedELQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnDLRGKFK 156 (503)
|.++++-|.....++..++.-|+.|---|...|.+-+.||+.++.+|.+..-|.|
T Consensus 11 ~~~~~e~~~mste~i~~rtrlldnEirI~~sev~ri~he~~~~~ekIkeN~EkIk 65 (424)
T KOG0652|consen 11 DALDQEILSMSTEEIISRTRLLDNEIRIMKSEVQRINHELQAMKEKIKENTEKIK 65 (424)
T ss_pred hhhhhhhhhccHHHHHHHHHHhhhHHHHHHHHHHHhhhHHHHHHHHHHhhHHHhh
Confidence 3344455555667777888888888888888899999999999999999877543
No 255
>PF00505 HMG_box: HMG (high mobility group) box; InterPro: IPR000910 High mobility group (HMG or HMGB) proteins are a family of relatively low molecular weight non-histone components in chromatin. HMG1 (also called HMG-T in fish) and HMG2 are two highly related proteins that bind single-stranded DNA preferentially and unwind double-stranded DNA. Although they have no sequence specificity, they have a high affinity for bent or distorted DNA, and bend linear DNA. HMG1 and HMG2 contain two DNA-binding HMG-box domains (A and B) that show structural and functional differences, and have a long acidic C-terminal domain rich in aspartic and glutamic acid residues. The acidic tail modulates the affinity of the tandem HMG boxes in HMG1 and 2 for a variety of DNA targets. HMG1 and 2 appear to play important architectural roles in the assembly of nucleoprotein complexes in a variety of biological processes, for example V(D)J recombination, the initiation of transcription, and DNA repair []. The profile in this entry describing the HMG-domains is much more general than the signature. In addition to the HMG1 and HMG2 proteins, HMG-domains occur in single or multiple copies in the following protein classes; the SOX family of transcription factors; SRY sex determining region Y protein and related proteins []; LEF1 lymphoid enhancer binding factor 1 []; SSRP recombination signal recognition protein; MTF1 mitochondrial transcription factor 1; UBF1/2 nucleolar transcription factors; Abf2 yeast ARS-binding factor []; and Saccharomyces cerevisiae transcription factors Ixr1, Rox1, Nhp6a, Nhp6b and Spp41.; GO: 0003677 DNA binding; PDB: 1I11_A 1J3C_A 1J3D_A 1WZ6_A 1WGF_A 2D7L_A 1GT0_D 3U2B_C 2CRJ_A 2CS1_A ....
Probab=25.56 E-value=1.4e+02 Score=22.54 Aligned_cols=40 Identities=20% Similarity=0.421 Sum_probs=33.3
Q ss_pred CCCHHHHHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHH
Q psy9560 103 DMAEDEIGDLCQEYWQRIYNLEAIKYELDREIMLKDFEIT 142 (503)
Q Consensus 103 GLSedELQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEIn 142 (503)
+++..+|..+|-++|..+..-+=..|.-.+...+..|+..
T Consensus 25 ~~~~~~i~~~~~~~W~~l~~~eK~~y~~~a~~~~~~y~~~ 64 (69)
T PF00505_consen 25 DLSNKEISKILAQMWKNLSEEEKAPYKEEAEEEKERYEKE 64 (69)
T ss_dssp TSTHHHHHHHHHHHHHCSHHHHHHHHHHHHHHHHHHHHHH
T ss_pred ccccccchhhHHHHHhcCCHHHHHHHHHHHHHHHHHHHHH
Confidence 4568899999999999999888889988888887777643
No 256
>PRK13729 conjugal transfer pilus assembly protein TraB; Provisional
Probab=25.42 E-value=1.7e+02 Score=32.55 Aligned_cols=39 Identities=13% Similarity=0.056 Sum_probs=17.7
Q ss_pred HHHHHHHHHHHHHHh-------hhhhhHHHHHhhhhhhHHHHHHHH
Q psy9560 110 GDLCQEYWQRIYNLE-------AIKYELDREIMLKDFEITELGKQV 148 (503)
Q Consensus 110 QEkCKELHdRIdkLE-------EEKYDLE~KVkKQDyEInELniKV 148 (503)
|.+..+|-.+|+.|. -.+=|++.|++..+-||..|+.++
T Consensus 75 Q~kasELEKqLaaLrqElq~~saq~~dle~KIkeLEaE~~~Lk~Ql 120 (475)
T PRK13729 75 QVTAAQMQKQYEEIRRELDVLNKQRGDDQRRIEKLGQDNAALAEQV 120 (475)
T ss_pred HHHHHHHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHH
Confidence 345555555555553 122344444444444444444444
No 257
>PF01920 Prefoldin_2: Prefoldin subunit; InterPro: IPR002777 Prefoldin (PFD) is a chaperone that interacts exclusively with type II chaperonins, hetero-oligomers lacking an obligate co-chaperonin that are found only in eukaryotes (chaperonin-containing T-complex polypeptide-1 (CCT)) and archaea. Eukaryotic PFD is a multi-subunit complex containing six polypeptides in the molecular mass range of 14-23 kDa. In archaea, on the other hand, PFD is composed of two types of subunits, two alpha and four beta. The six subunits associate to form two back-to-back up-and-down eight-stranded barrels, from which hang six coiled coils. Each subunit contributes one (beta subunits) or two (alpha subunits) beta hairpin turns to the barrels. The coiled coils are formed by the N and C termini of an individual subunit. Overall, this unique arrangement resembles a jellyfish. The eukaryotic PFD hexamer is composed of six different subunits; however, these can be grouped into two alpha-like (PFD3 and -5) and four beta-like (PFD1, -2, -4, and -6) subunits based on amino acid sequence similarity with their archaeal counterparts. Eukaryotic PFD has a six-legged structure similar to that seen in the archaeal homologue [, ]. This family contains the archaeal beta subunit, eukaryotic prefoldin subunits 1, 2, 4 and 6. Eukaryotic PFD has been shown to bind both actin and tubulin co-translationally. The chaperone then delivers the target protein to CCT, interacting with the chaperonin through the tips of the coiled coils. No authentic target proteins of any archaeal PFD have been identified, to date.; GO: 0051082 unfolded protein binding, 0006457 protein folding, 0016272 prefoldin complex; PDB: 2ZDI_B 3AEI_B 2ZQM_A 1FXK_A.
Probab=25.20 E-value=2.2e+02 Score=23.36 Aligned_cols=39 Identities=26% Similarity=0.353 Sum_probs=28.2
Q ss_pred HHHHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHH
Q psy9560 107 DEIGDLCQEYWQRIYNLEAIKYELDREIMLKDFEITELG 145 (503)
Q Consensus 107 dELQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELn 145 (503)
++++..+.+|+.++..|...+-+|+..++..+.-+.+|.
T Consensus 1 Qe~~~~~~~l~~~l~~~~~q~~~l~~~~~~~~~~~~eL~ 39 (106)
T PF01920_consen 1 QELQNKFQELNQQLQQLEQQIQQLERQLRELELTLEELE 39 (106)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 457778888888888888888887777765554444443
No 258
>TIGR02338 gimC_beta prefoldin, beta subunit, archaeal. Chaperonins are cytosolic, ATP-dependent molecular chaperones, with a conserved toroidal architecture, that assist in the folding of nascent and/or denatured polypeptide chains. The group I chaperonin system consists of GroEL and GroES, and is found (usually) in bacteria and organelles of bacterial origin. The group II chaperonin system, called the thermosome in Archaea and TRiC or CCT in the Eukaryota, is structurally similar but only distantly related. Prefoldin, also called GimC, is a complex in Archaea and Eukaryota, that works with group II chaperonins. Members of this protein family are the archaeal clade of the beta class of prefoldin subunit. Closely related, but outside the scope of this family are the eukaryotic beta-class prefoldin subunits, Gim-1,3,4 and 6. The alpha class prefoldin subunits are more distantly related.
Probab=25.18 E-value=4.5e+02 Score=22.76 Aligned_cols=44 Identities=18% Similarity=0.166 Sum_probs=25.5
Q ss_pred HHHHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHH
Q psy9560 107 DEIGDLCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQVND 150 (503)
Q Consensus 107 dELQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnD 150 (503)
.++.++...|.++|..++..-=.|+.+.......|.+|..++.+
T Consensus 63 ~~~~e~~~~l~~r~e~ie~~i~~lek~~~~l~~~l~e~q~~l~~ 106 (110)
T TIGR02338 63 TDKEEAIQELKEKKETLELRVKTLQRQEERLREQLKELQEKIQE 106 (110)
T ss_pred ecHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34666777778888877655555555444444444444444443
No 259
>PLN00138 large subunit ribosomal protein LP2; Provisional
Probab=25.08 E-value=2.4e+02 Score=25.60 Aligned_cols=60 Identities=17% Similarity=0.215 Sum_probs=41.9
Q ss_pred hhHHhhccccCCCCccHHHHHHHHHHhhhcCchhhHHHHHHHHHhhhhccccccCCCCCCC
Q psy9560 286 LWLYLKDTVSASNPVSIDDLVEKLMSAVSATPQPLLKSVVKDTLRRQASHEETEGGEPKPE 346 (503)
Q Consensus 286 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 346 (503)
|--||--+++..+.+|-+|| .+|++++.....+..-.+|-+.|.-.--|+-...|.-+-.
T Consensus 4 vaAyll~~l~g~~~pta~dI-~~IL~AaGvevd~~~~~~f~~~L~gK~i~eLIa~G~~kl~ 63 (113)
T PLN00138 4 VAAYLLAVLGGNTCPSAEDL-KDILGSVGADADDDRIELLLSEVKGKDITELIASGREKLA 63 (113)
T ss_pred HHHHHHHHhcCCCCCCHHHH-HHHHHHcCCcccHHHHHHHHHHHcCCCHHHHHHhchhccc
Confidence 45677777777778998886 5788888888887666666677766666666555444443
No 260
>KOG3230|consensus
Probab=25.00 E-value=7.5e+02 Score=25.26 Aligned_cols=62 Identities=19% Similarity=0.253 Sum_probs=42.5
Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcCCCCCCCCCCHHHHHHHHHHHHHH
Q psy9560 55 FMTPERKKKLRLLLRKKAAEELKREQERRALERKRVIDERCGEPKDTDDMAEDEIGDLCQEYWQR 119 (503)
Q Consensus 55 KMT~eRKkkLKSLLLqKAaEELEKEKEqKeEEKKKiLAERcpPPLnIDGLSedELQEkCKELHdR 119 (503)
+.|+.-.+.--.-.|.+|..+|++|....+.+.+++++|-. +. --.| +++..+=++|+|...
T Consensus 7 ~~tp~e~Lr~nqRal~~a~ReleRer~~le~qeKklvaeIK-k~-AK~g-q~~A~KimAkdLvRt 68 (224)
T KOG3230|consen 7 KKTPAELLRENQRALNKATRELERERQKLELQEKKLVAEIK-KT-AKQG-QMDAVKIMAKDLVRT 68 (224)
T ss_pred CCCHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HH-HHcc-cHHHHHHHHHHHHHH
Confidence 44554422222357899999999999998888888888865 32 2233 677787788877643
No 261
>PF05010 TACC: Transforming acidic coiled-coil-containing protein (TACC); InterPro: IPR007707 This family contains the proteins TACC 1, 2 and 3, found concentrated in the centrosomes of eukaryotes which may play a conserved role in organising centrosomal microtubules. The human TACC proteins have been linked to cancer and TACC2 has been identified as a possible tumour suppressor (AZU-1) [].
Probab=24.91 E-value=6.9e+02 Score=24.80 Aligned_cols=63 Identities=14% Similarity=0.206 Sum_probs=37.1
Q ss_pred HHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHhhCCCCCCccccccccHHHHHHHHhh
Q psy9560 113 CQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQVNDLRGKFVRPILKKVSKYENKFAKLQKK 176 (503)
Q Consensus 113 CKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnDLRGKFKKPaLKKVRkSAnm~a~L~Kh 176 (503)
+..+...|..|+..-.||-.++.+....|+.++.--..|+. ...=.+.+|.....+|.+|..|
T Consensus 78 rdq~~~dL~s~E~sfsdl~~ryek~K~vi~~~k~NEE~Lkk-~~~ey~~~l~~~eqry~aLK~h 140 (207)
T PF05010_consen 78 RDQAYADLNSLEKSFSDLHKRYEKQKEVIEGYKKNEETLKK-CIEEYEERLKKEEQRYQALKAH 140 (207)
T ss_pred HHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHhHHHHHH-HHHHHHHHHHHHHHHHHHHHHH
Confidence 34455566666666667777777666666666554344442 3333455666666667666544
No 262
>PF05667 DUF812: Protein of unknown function (DUF812); InterPro: IPR008530 This family consists of several eukaryotic proteins of unknown function.
Probab=24.77 E-value=1.1e+03 Score=26.93 Aligned_cols=44 Identities=25% Similarity=0.413 Sum_probs=30.0
Q ss_pred HHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHhhCCCCC
Q psy9560 110 GDLCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQVNDLRGKFVR 157 (503)
Q Consensus 110 QEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnDLRGKFKK 157 (503)
..+.+.+++-|..+.=++=||...+. |+..|...||.|.||..|
T Consensus 487 s~Yt~RIlEIv~NI~KQk~eI~KIl~----DTr~lQkeiN~l~gkL~R 530 (594)
T PF05667_consen 487 SAYTRRILEIVKNIRKQKEEIEKILS----DTRELQKEINSLTGKLDR 530 (594)
T ss_pred HHHHHHHHHHHHhHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHh
Confidence 34667777777777777777765553 566777788888885443
No 263
>PF06248 Zw10: Centromere/kinetochore Zw10; InterPro: IPR009361 Zeste white 10 (ZW10) was initially identified as a mitotic checkpoint protein involved in chromosome segregation, and then implicated in targeting cytoplasmic dynein and dynactin to mitotic kinetochores, but it is also important in non-dividing cells. These include cytoplasmic dynein targeting to Golgi and other membranes, and SNARE-mediated ER-Golgi trafficking [, ]. Dominant-negative ZW10, anti-ZW10 antibody, and ZW10 RNA interference (RNAi) cause Golgi dispersal. ZW10 RNAi also disperse endosomes and lysosomes []. Drosophila kinetochore components Rough deal (Rod) and Zw10 are required for the proper functioning of the metaphase checkpoint in flies []. The eukaryotic spindle assembly checkpoint (SAC) monitors microtubule attachment to kinetochores and prevents anaphase onset until all kinetochores are aligned on the metaphase plate. It is an essential surveillance mechanism that ensures high fidelity chromosome segregation during mitosis. In higher eukaryotes, cytoplasmic dynein is involved in silencing the SAC by removing the checkpoint proteins Mad2 and the Rod-Zw10-Zwilch complex (RZZ) from aligned kinetochores [, , ].; GO: 0007067 mitosis, 0000775 chromosome, centromeric region, 0005634 nucleus
Probab=24.51 E-value=2.2e+02 Score=31.10 Aligned_cols=70 Identities=13% Similarity=0.162 Sum_probs=34.1
Q ss_pred CCHHHHHHHHHHHHHHHHHHhhhhhhHHHH--------H---hhhhhhHHHHHHHHHHhhCC-CCCCccccccccHHHHH
Q psy9560 104 MAEDEIGDLCQEYWQRIYNLEAIKYELDRE--------I---MLKDFEITELGKQVNDLRGK-FVRPILKKVSKYENKFA 171 (503)
Q Consensus 104 LSedELQEkCKELHdRIdkLEEEKYDLE~K--------V---kKQDyEInELniKVnDLRGK-FKKPaLKKVRkSAnm~a 171 (503)
|+.++|+..+.+|..+|..+.++=|++-.+ + ..--.++..|...|+|+.-+ ...+.+.+|+.+.+++.
T Consensus 7 l~~edl~~~I~~L~~~i~~~k~eV~~~I~~~y~df~~~~~~~~~L~~~~~~l~~eI~d~l~~~~~~~i~~~l~~a~~e~~ 86 (593)
T PF06248_consen 7 LSKEDLRKSISRLSRRIEELKEEVHSMINKKYSDFSPSLQSAKDLIERSKSLAREINDLLQSEIENEIQPQLRDAAEELQ 86 (593)
T ss_pred CCHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHhhccchhHHHHHHHHHHHH
Confidence 455556665565555555555544432111 1 11112233333344444333 55667777777776666
Q ss_pred HH
Q psy9560 172 KL 173 (503)
Q Consensus 172 ~L 173 (503)
.|
T Consensus 87 ~L 88 (593)
T PF06248_consen 87 EL 88 (593)
T ss_pred HH
Confidence 55
No 264
>KOG0992|consensus
Probab=24.40 E-value=2.5e+02 Score=31.97 Aligned_cols=33 Identities=21% Similarity=0.330 Sum_probs=21.5
Q ss_pred HHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHhh
Q psy9560 119 RIYNLEAIKYELDREIMLKDFEITELGKQVNDLR 152 (503)
Q Consensus 119 RIdkLEEEKYDLE~KVkKQDyEInELniKVnDLR 152 (503)
+-+++|...|.|+. .+++.|-|++|++..-+++
T Consensus 153 ~~~q~Esls~~le~-~~~~~~~~~kl~ie~e~~~ 185 (613)
T KOG0992|consen 153 QTQQIESLSEELER-LRPIESVAEKLRIELEQLR 185 (613)
T ss_pred HHHHHHHHHHHHHh-hhhHHHHHHHHHHHHHHHH
Confidence 34566677777766 6666777777777665554
No 265
>PF01690 PLRV_ORF5: Potato leaf roll virus readthrough protein; InterPro: IPR002929 This family consists mainly of the Potato leafroll virus (PLrV) read through protein otherwise known as the minor capsid protein. This is generated via a readthrough of open reading frame 3, the coat protein, allowing transcription of open reading frame 5 to give an extended coat protein with a large C-terminal addition or read through domain []. The read through protein is essential for the circulative aphid transmission of PLrV [] and Beet western yellows virus []. The N-terminal region of the luteovirus readthrough domain determines virus binding to Buchnera GroEL and is essential for virus persistence in the aphid [].; GO: 0019028 viral capsid
Probab=24.40 E-value=43 Score=36.87 Aligned_cols=16 Identities=25% Similarity=0.401 Sum_probs=9.7
Q ss_pred CCCCCCChhhhhHHhh
Q psy9560 276 PSPVTPNPEELWLYLK 291 (503)
Q Consensus 276 ~~~~~~~~~~~~~~~~ 291 (503)
|.|.+.-.+.+|.|.-
T Consensus 23 PePtP~~~~RF~~Y~G 38 (465)
T PF01690_consen 23 PEPTPAKHERFIGYEG 38 (465)
T ss_pred CCCcccCccceEEEec
Confidence 3444445678888854
No 266
>PRK15178 Vi polysaccharide export inner membrane protein VexD; Provisional
Probab=24.39 E-value=6.4e+02 Score=27.70 Aligned_cols=89 Identities=19% Similarity=0.121 Sum_probs=57.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhhhcCCCCCCCCCCH-HHHHHHHHHHHHHHHHHhhhhhhHHH-------HHhhhh
Q psy9560 67 LLRKKAAEELKREQERRALERKRVIDERCGEPKDTDDMAE-DEIGDLCQEYWQRIYNLEAIKYELDR-------EIMLKD 138 (503)
Q Consensus 67 LLLqKAaEELEKEKEqKeEEKKKiLAERcpPPLnIDGLSe-dELQEkCKELHdRIdkLEEEKYDLE~-------KVkKQD 138 (503)
=.|.-|.++++.-+++....+.+.+.-|- .-..||-... +-+..++-.|-..|..+.-+.--|.. .|...+
T Consensus 242 D~v~~Ae~ev~~Ae~rl~~Ar~aL~~fRn-~~gvlDP~~~a~~~~~lI~~Le~qLa~~~aeL~~L~~~~~p~sPqV~~l~ 320 (434)
T PRK15178 242 ERILWLENDVKSAQENLGAARLELLKIQH-IQKDIDPKETITAIYQLIAGFETQLAEAKAEYAQLMVNGLDQNPLIPRLS 320 (434)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH-hCCCcChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCchhHHH
Confidence 35677888888777777788888888776 3445564333 33445556666666555555444422 344556
Q ss_pred hhHHHHHHHHHHhhCCCC
Q psy9560 139 FEITELGKQVNDLRGKFV 156 (503)
Q Consensus 139 yEInELniKVnDLRGKFK 156 (503)
..|.-|..+|.++++|+.
T Consensus 321 ~rI~aLe~QIa~er~kl~ 338 (434)
T PRK15178 321 AKIKVLEKQIGEQRNRLS 338 (434)
T ss_pred HHHHHHHHHHHHHHHHhh
Confidence 678888888888888875
No 267
>PRK00295 hypothetical protein; Provisional
Probab=24.39 E-value=4e+02 Score=21.90 Aligned_cols=25 Identities=20% Similarity=0.166 Sum_probs=13.9
Q ss_pred hHHHHHhhhhhhHHHHHHHHHHhhC
Q psy9560 129 ELDREIMLKDFEITELGKQVNDLRG 153 (503)
Q Consensus 129 DLE~KVkKQDyEInELniKVnDLRG 153 (503)
+|+.-|.+|..+|..|+.++..|..
T Consensus 23 ~Ln~~v~~Qq~~I~~L~~ql~~L~~ 47 (68)
T PRK00295 23 ALNDVLVEQQRVIERLQLQMAALIK 47 (68)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4445555666666666655555544
No 268
>COG3937 Uncharacterized conserved protein [Function unknown]
Probab=24.33 E-value=5.7e+02 Score=23.61 Aligned_cols=23 Identities=30% Similarity=0.294 Sum_probs=14.2
Q ss_pred ccCCCCCHHHHHHHHHHHHHHHH
Q psy9560 51 RKKGFMTPERKKKLRLLLRKKAA 73 (503)
Q Consensus 51 AKKgKMT~eRKkkLKSLLLqKAa 73 (503)
-|||+||.+--+.+=.=||+.++
T Consensus 34 VkkGeln~eEak~~vddl~~q~k 56 (108)
T COG3937 34 VKKGELNAEEAKRFVDDLLRQAK 56 (108)
T ss_pred HHcCCCCHHHHHHHHHHHHHHHH
Confidence 47899998865555444444443
No 269
>PF09371 Tex_N: Tex-like protein N-terminal domain; InterPro: IPR018974 This presumed domain is found at the N terminus of Q45388 from SWISSPROT. This protein defines a novel family of prokaryotic transcriptional accessory factors []. ; PDB: 2OCE_A 3BZK_A 3BZC_A.
Probab=24.24 E-value=2.1e+02 Score=27.87 Aligned_cols=56 Identities=20% Similarity=0.244 Sum_probs=38.8
Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHH---------HHHhhCCCCC
Q psy9560 101 TDDMAEDEIGDLCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQ---------VNDLRGKFVR 157 (503)
Q Consensus 101 IDGLSedELQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniK---------VnDLRGKFKK 157 (503)
.+||++.+|+++ .+.|..+..|+.-|=.+-..+..+..=-.+|+.+ |.||=-=||.
T Consensus 36 TG~Lde~~lR~i-~~~~~~~~~L~~Rk~~il~~i~eqgkLt~eL~~~I~~a~tl~elEdlY~PyK~ 100 (193)
T PF09371_consen 36 TGGLDEVQLREI-QDRYEYLRELEKRKESILKSIEEQGKLTPELKQAIENATTLQELEDLYLPYKP 100 (193)
T ss_dssp HTS--HHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHTT---HHHHHHHHH--SHHHHHHHHGGGS-
T ss_pred hCCCCHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHcccCCHHHHHHHHhcCCHHHHHHHHhhhcc
Confidence 577899999987 5667778999999999998888888766677766 4455555666
No 270
>KOG4005|consensus
Probab=24.19 E-value=7.5e+02 Score=26.01 Aligned_cols=7 Identities=14% Similarity=0.364 Sum_probs=3.9
Q ss_pred HHHhhHH
Q psy9560 16 AAASAED 22 (503)
Q Consensus 16 ~~~mad~ 22 (503)
-..|.|+
T Consensus 41 ~~~~~~~ 47 (292)
T KOG4005|consen 41 SSNMDDD 47 (292)
T ss_pred ccccCCC
Confidence 3456665
No 271
>TIGR03319 YmdA_YtgF conserved hypothetical protein YmdA/YtgF.
Probab=24.17 E-value=5.8e+02 Score=28.21 Aligned_cols=115 Identities=18% Similarity=0.196 Sum_probs=0.0
Q ss_pred HHHHHHHHHHHHHH----HHHHHHhHHHHhhhh---ccCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh
Q psy9560 22 DEAKRLKLEELERK----RAEVRRRMEEASGAR---RKKGFMTPERKKKLRLLLRKKAAEELKREQERRALERKRVIDER 94 (503)
Q Consensus 22 ~ed~kRKqeErERK----kAEvRKRlEEA~KkK---AKKgKMT~eRKkkLKSLLLqKAaEELEKEKEqKeEEKKKiLAER 94 (503)
+++-+++.+|.++. +.+..+|-+.-.++. .++......|...|.. ..+.-+.++++-.++..+....+.++
T Consensus 57 eeE~~~~R~Ele~el~~~e~rL~qrE~rL~qRee~Lekr~e~Lekre~~Le~--ke~~L~~re~eLee~~~e~~~~~~~~ 134 (514)
T TIGR03319 57 KEEVHKLRAELERELKERRNELQRLERRLLQREETLDRKMESLDKKEENLEK--KEKELSNKEKNLDEKEEELEELIAEQ 134 (514)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHH
Q ss_pred cCCCCCCCCCCHHHHHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHH
Q psy9560 95 CGEPKDTDDMAEDEIGDLCQEYWQRIYNLEAIKYELDREIMLKDFEITE 143 (503)
Q Consensus 95 cpPPLnIDGLSedELQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInE 143 (503)
.-.+-+++||+.++-++..-+-... +-+.|+...+++...+.++
T Consensus 135 ~~~le~~a~lt~~eak~~l~~~~~~-----~~~~~~~~~~~~~~~~~~~ 178 (514)
T TIGR03319 135 REELERISGLTQEEAKEILLEEVEE-----EARHEAAKLIKEIEEEAKE 178 (514)
T ss_pred HHHHHHHhCCCHHHHHHHHHHHHHH-----HHHHHHHHHHHHHHHHHHH
No 272
>PRK07521 flgK flagellar hook-associated protein FlgK; Validated
Probab=24.11 E-value=2e+02 Score=30.86 Aligned_cols=47 Identities=17% Similarity=0.132 Sum_probs=39.8
Q ss_pred HHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHhhCCC
Q psy9560 109 IGDLCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQVNDLRGKF 155 (503)
Q Consensus 109 LQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnDLRGKF 155 (503)
.+.||..||.--..|+..+-|+...+...-.+||.|-.+|.+|..+.
T Consensus 131 a~~L~~~~n~~~~~L~~~~~~~~~~i~~~V~~iN~l~~~Ia~LN~~I 177 (483)
T PRK07521 131 AQDLANSLNDASDAVQSARADADAEIADSVDTLNDLLAQFEDANNAV 177 (483)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 46688888888889999999999999888888888888888887654
No 273
>PF06013 WXG100: Proteins of 100 residues with WXG; InterPro: IPR010310 ESAT-6 is a small protein appears to be of fundamental importance in virulence and protective immunity in Mycobacterium tuberculosis. Homologues have been detected in other Gram-positive bacterial species. It may represent a novel secretion system potentially driven by the PF01580 from PFAM domains in the YukA-like proteins []. Members of this protein family include secretion targets for type main variants of type VII secretion systems (T7SS), one found in the Actinobacteria, one found in the Firmicutes. This model was derived through iteration from PF06013 from PFAM. The best characterised member of this family is ESAT-6 from Mycobacterium tuberculosis. Members of this family usually are ~100 amino acids in length but occasionally have long C-terminal extension. ; PDB: 3FAV_A 1WA8_A 3Q4H_B 2KG7_A 2VRZ_B 2VS0_B 3OGI_A 3H6P_B 3GVM_B 3GWK_C ....
Probab=24.03 E-value=2.8e+02 Score=20.94 Aligned_cols=40 Identities=10% Similarity=0.268 Sum_probs=30.7
Q ss_pred CCHHHHHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHhhCCCCC
Q psy9560 104 MAEDEIGDLCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQVNDLRGKFVR 157 (503)
Q Consensus 104 LSedELQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnDLRGKFKK 157 (503)
++.++|+..++.|......|+ ..+..|+..|+.|.+-++-
T Consensus 4 vd~~~l~~~a~~~~~~~~~l~--------------~~~~~l~~~~~~l~~~W~G 43 (86)
T PF06013_consen 4 VDPEQLRAAAQQLQAQADELQ--------------SQLQQLESSIDSLQASWQG 43 (86)
T ss_dssp SCHHHHHHHHHHHHHHHHHHH--------------HHHHHHHHHHHHHGGGBTS
T ss_pred ecHHHHHHHHHHHHHHHHHHH--------------HHHHHHHHHHHHHhhhCCc
Confidence 467889999998888877665 4577888888888776654
No 274
>PF05278 PEARLI-4: Arabidopsis phospholipase-like protein (PEARLI 4); InterPro: IPR007942 This family contains several phospholipase-like proteins from Arabidopsis thaliana and other members of the Streptophyta which are homologous to PEARLI 4.
Probab=24.00 E-value=3.5e+02 Score=28.13 Aligned_cols=25 Identities=24% Similarity=0.319 Sum_probs=16.5
Q ss_pred HHHHHhhhhhhHHHHHHHHHHhhCC
Q psy9560 130 LDREIMLKDFEITELGKQVNDLRGK 154 (503)
Q Consensus 130 LE~KVkKQDyEInELniKVnDLRGK 154 (503)
|+.+-.+-...|..+..||..+.||
T Consensus 240 l~~~~~~l~k~~~~~~sKV~kf~~~ 264 (269)
T PF05278_consen 240 LEMESTRLSKTIKSIKSKVEKFHGK 264 (269)
T ss_pred HHHHHHHHHHHHHHHHHHHHHhcCC
Confidence 3444444455577888888888774
No 275
>KOG0995|consensus
Probab=23.99 E-value=6.4e+02 Score=28.97 Aligned_cols=69 Identities=19% Similarity=0.170 Sum_probs=34.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhhcC--CCCCCCCCCHHHH----------HHHHHHHHHHHHHHhhhhhhHHHHHhh
Q psy9560 69 RKKAAEELKREQERRALERKRVIDERCG--EPKDTDDMAEDEI----------GDLCQEYWQRIYNLEAIKYELDREIML 136 (503)
Q Consensus 69 LqKAaEELEKEKEqKeEEKKKiLAERcp--PPLnIDGLSedEL----------QEkCKELHdRIdkLEEEKYDLE~KVkK 136 (503)
|-+.-++|..|.+.+++|.+++=.++-. .-...-|+|..++ +.-..++--.++.|.-+-|+++.....
T Consensus 292 ~~~~l~~l~~Eie~kEeE~e~lq~~~d~Lk~~Ie~Q~iS~~dve~mn~Er~~l~r~l~~i~~~~d~l~k~vw~~~l~~~~ 371 (581)
T KOG0995|consen 292 MEKKLEMLKSEIEEKEEEIEKLQKENDELKKQIELQGISGEDVERMNLERNKLKRELNKIQSELDRLSKEVWELKLEIED 371 (581)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHH
Confidence 4445566677777777777766544321 1112335555543 223333344445555555555554433
Q ss_pred h
Q psy9560 137 K 137 (503)
Q Consensus 137 Q 137 (503)
.
T Consensus 372 ~ 372 (581)
T KOG0995|consen 372 F 372 (581)
T ss_pred H
Confidence 3
No 276
>PF07200 Mod_r: Modifier of rudimentary (Mod(r)) protein; InterPro: IPR009851 This entry represents a conserved region approximately 150 residues long within a number of eukaryotic proteins that show homology with Drosophila melanogaster Modifier of rudimentary (Mod(r)) proteins. The N-terminal half of Mod(r) proteins is acidic, whereas the C-terminal half is basic [], and both of these regions are represented in this family.; PDB: 2CAZ_F 2P22_C 2F66_F.
Probab=23.98 E-value=1.9e+02 Score=25.78 Aligned_cols=51 Identities=27% Similarity=0.314 Sum_probs=0.0
Q ss_pred CCCCCHHHHHH----------------HHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHh
Q psy9560 101 TDDMAEDEIGD----------------LCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQVNDL 151 (503)
Q Consensus 101 IDGLSedELQE----------------kCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnDL 151 (503)
|.+||.++|++ -|+.++..+..|-.+--+|....-...-+|.+|+.+|.++
T Consensus 1 L~~lS~~eL~~Ll~d~~~l~~~v~~l~~~~~~~~~~~~l~~~n~~lAe~nL~~~~~l~~~r~~l~~~ 67 (150)
T PF07200_consen 1 LQDLSTEELQELLSDEEKLDAFVKSLPQVQELQQEREELLAENEELAEQNLSLEPELEELRSQLQEL 67 (150)
T ss_dssp GGS-TTHHHHHHHHH-HHHHHHGGGGS--HHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHH
T ss_pred CCcCCHHHHHHHHcCHHHHHHHHHcCHHHHHHHHHHHHHHHHHHHHHHHhcccchHHHHHHHHHHHH
No 277
>PF14389 Lzipper-MIP1: Leucine-zipper of ternary complex factor MIP1
Probab=23.95 E-value=4.6e+02 Score=22.45 Aligned_cols=37 Identities=22% Similarity=0.246 Sum_probs=25.5
Q ss_pred HHHHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHH
Q psy9560 107 DEIGDLCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQVND 150 (503)
Q Consensus 107 dELQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnD 150 (503)
..|-..+++|-.-|..||.+-..||.+ |.+|..++++
T Consensus 50 ~~lp~~~keLL~EIA~lE~eV~~LE~~-------v~~L~~~l~~ 86 (88)
T PF14389_consen 50 SSLPKKAKELLEEIALLEAEVAKLEQK-------VLSLYRQLFQ 86 (88)
T ss_pred ccCChHHHHHHHHHHHHHHHHHHHHHH-------HHHHHHHHHh
Confidence 356778888888899998775555544 5566666554
No 278
>COG5293 Predicted ATPase [General function prediction only]
Probab=23.90 E-value=7.4e+02 Score=28.27 Aligned_cols=83 Identities=20% Similarity=0.312 Sum_probs=47.9
Q ss_pred HHHHHhHHHHhhhhccCCCCCHHHHHHHHHHHHHHHHHHHH---HHHHH---HHHHHHHHhhhhcCCCCCCCCCCHHHHH
Q psy9560 37 AEVRRRMEEASGARRKKGFMTPERKKKLRLLLRKKAAEELK---REQER---RALERKRVIDERCGEPKDTDDMAEDEIG 110 (503)
Q Consensus 37 AEvRKRlEEA~KkKAKKgKMT~eRKkkLKSLLLqKAaEELE---KEKEq---KeEEKKKiLAERcpPPLnIDGLSedELQ 110 (503)
.+|.+++|+-..-- .+|+.+|...|++-+-.+ ..+|+ .+... ++++--++|.-| | -..+.+
T Consensus 316 g~Vkk~~e~v~~F~---r~~~e~R~~yl~~ei~~i-~~dLk~~n~~~~~l~~~rae~l~~Lk~~--------g-~~e~y~ 382 (591)
T COG5293 316 GQVKKDFEHVIAFN---RAITEERHDYLQEEIAEI-EGDLKEVNAELDDLGKRRAEGLAFLKNR--------G-VFEKYQ 382 (591)
T ss_pred HHHHHhHHHHHHHH---HHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhC--------C-cHHHHH
Confidence 45667777766544 689999999999865333 23333 22222 222222233222 1 246777
Q ss_pred HHHHHHHHHHHHHhhhhhhHHH
Q psy9560 111 DLCQEYWQRIYNLEAIKYELDR 132 (503)
Q Consensus 111 EkCKELHdRIdkLEEEKYDLE~ 132 (503)
-+|.++-+..-.|-+-+|-+|.
T Consensus 383 ~l~ee~~~~~~elae~~~rie~ 404 (591)
T COG5293 383 TLCEEIIALRGELAELEYRIEP 404 (591)
T ss_pred HHHHHHHHHhhhHHHHHHhhhH
Confidence 7888877776666666666553
No 279
>PF10046 BLOC1_2: Biogenesis of lysosome-related organelles complex-1 subunit 2 ; InterPro: IPR019269 This entry represents a family of proteins that play a role in cellular proliferation, as well as in the biogenesis of specialised organelles of the endosomal-lysosomal system [].
Probab=23.85 E-value=2.4e+02 Score=24.28 Aligned_cols=40 Identities=25% Similarity=0.257 Sum_probs=23.6
Q ss_pred HHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHH
Q psy9560 111 DLCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQVND 150 (503)
Q Consensus 111 EkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnD 150 (503)
+-.+.+.++|+.+|.+-=.||..|.+-|.=+..|-.|+..
T Consensus 59 ~~l~~~l~~Id~Ie~~V~~LE~~v~~LD~ysk~LE~k~k~ 98 (99)
T PF10046_consen 59 EELQPYLQQIDQIEEQVTELEQTVYELDEYSKELESKFKK 98 (99)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc
Confidence 3335555666666666666666666666666666666544
No 280
>PF08317 Spc7: Spc7 kinetochore protein; InterPro: IPR013253 This entry consists of cell division proteins which are required for kinetochore-spindle association [].
Probab=23.82 E-value=2.2e+02 Score=29.05 Aligned_cols=56 Identities=16% Similarity=0.133 Sum_probs=29.2
Q ss_pred HHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHhhC---CCCCCccccccccHHHHHHHH
Q psy9560 119 RIYNLEAIKYELDREIMLKDFEITELGKQVNDLRG---KFVRPILKKVSKYENKFAKLQ 174 (503)
Q Consensus 119 RIdkLEEEKYDLE~KVkKQDyEInELniKVnDLRG---KFKKPaLKKVRkSAnm~a~L~ 174 (503)
.|..|..++=.++.++.....++.+|+..|.++.. +-+.-+-..|+.--+.+..|+
T Consensus 231 ~l~el~~el~~l~~~i~~~~~~k~~l~~eI~e~~~~~~~~r~~t~~Ev~~Lk~~~~~Le 289 (325)
T PF08317_consen 231 ELAELQEELEELEEKIEELEEQKQELLAEIAEAEKIREECRGWTRSEVKRLKAKVDALE 289 (325)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCHHHHHHHHHHHHHHH
Confidence 33355556666666666666666666666666542 122233333333334455554
No 281
>PF08232 Striatin: Striatin family; InterPro: IPR013258 This domain is associated with the N terminus of striatin. Striatin is an intracellular protein which has a caveolin-binding motif, a coiled-coil structure, a calmodulin-binding site, and a WD (IPR001680 from INTERPRO) repeat domain []. It acts as a scaffold protein [] and is involved in signalling pathways [, ].
Probab=23.74 E-value=2.1e+02 Score=26.14 Aligned_cols=39 Identities=23% Similarity=0.320 Sum_probs=22.9
Q ss_pred HHHHHHHHHhhhh-------hhHHHHHhhhhhhHHHHHHHHHHhhC
Q psy9560 115 EYWQRIYNLEAIK-------YELDREIMLKDFEITELGKQVNDLRG 153 (503)
Q Consensus 115 ELHdRIdkLEEEK-------YDLE~KVkKQDyEInELniKVnDLRG 153 (503)
||..+|..|++|+ .||-.+|+...|.+...+.++..|.+
T Consensus 29 EmkarIa~LEGE~r~~e~l~~dL~rrIkMLE~aLkqER~k~~~~~~ 74 (134)
T PF08232_consen 29 EMKARIAFLEGERRGQENLKKDLKRRIKMLEYALKQERAKYKKLKY 74 (134)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccc
Confidence 3445555555554 46666666666666666666655443
No 282
>PF14915 CCDC144C: CCDC144C protein coiled-coil region
Probab=23.73 E-value=6e+02 Score=27.07 Aligned_cols=56 Identities=13% Similarity=0.044 Sum_probs=27.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhhhcCCCCCCCCCCHHHHHHHHHHHHHHHHHHhhhhhhH
Q psy9560 66 LLLRKKAAEELKREQERRALERKRVIDERCGEPKDTDDMAEDEIGDLCQEYWQRIYNLEAIKYEL 130 (503)
Q Consensus 66 SLLLqKAaEELEKEKEqKeEEKKKiLAERcpPPLnIDGLSedELQEkCKELHdRIdkLEEEKYDL 130 (503)
+|+|--+..+|.+-+-+..+-+..|..++- + +..-..+-.-|.+||.+|.++--=|
T Consensus 178 tL~lE~~QrdL~Qtq~q~KE~e~m~qne~~-k--------v~k~~~Kqes~eERL~QlqsEN~LL 233 (305)
T PF14915_consen 178 TLALESVQRDLSQTQCQIKEIEHMYQNEQD-K--------VNKYIGKQESLEERLSQLQSENMLL 233 (305)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhHHH-H--------HHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 355555566666555555555555555443 1 1112222333456666666554433
No 283
>COG3883 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=23.72 E-value=2e+02 Score=29.73 Aligned_cols=38 Identities=26% Similarity=0.182 Sum_probs=17.5
Q ss_pred HHHHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHH
Q psy9560 107 DEIGDLCQEYWQRIYNLEAIKYELDREIMLKDFEITEL 144 (503)
Q Consensus 107 dELQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInEL 144 (503)
++++....++-.+|+++..+-=-|+.++...+..|.+.
T Consensus 62 ~~~~~k~~~~~~~i~~~~~eik~l~~eI~~~~~~I~~r 99 (265)
T COG3883 62 EEIQSKIDELQKEIDQSKAEIKKLQKEIAELKENIVER 99 (265)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 44444444444444444444444444444444444443
No 284
>PF08826 DMPK_coil: DMPK coiled coil domain like; InterPro: IPR014930 This domain is found in the myotonic dystrophy protein kinase (DMPK) and adopts a coiled coil structure. It plays a role in dimerisation []. ; GO: 0004674 protein serine/threonine kinase activity, 0005524 ATP binding, 0006468 protein phosphorylation; PDB: 1WT6_D.
Probab=23.71 E-value=3.9e+02 Score=22.07 Aligned_cols=37 Identities=8% Similarity=0.236 Sum_probs=19.7
Q ss_pred HHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHhh
Q psy9560 116 YWQRIYNLEAIKYELDREIMLKDFEITELGKQVNDLR 152 (503)
Q Consensus 116 LHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnDLR 152 (503)
|+..|.++-+.-..++.+++-.+..+.+|...|..|+
T Consensus 16 ~~eEL~kvk~~n~~~e~kLqeaE~rn~eL~~ei~~L~ 52 (61)
T PF08826_consen 16 IQEELTKVKSANLAFESKLQEAEKRNRELEQEIERLK 52 (61)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4444455555555555555555555555555555544
No 285
>PF09340 NuA4: Histone acetyltransferase subunit NuA4; InterPro: IPR015418 The NuA4 histone acetyltransferase (HAT) multisubunit complex is responsible for acetylation of histone H4 and H2A N-terminal tails in yeast []. NuA4 complexes are highly conserved in eukaryotes and play primary roles in transcription, cellular response to DNA damage, and cell cycle control [].
Probab=23.67 E-value=1.2e+02 Score=25.75 Aligned_cols=29 Identities=14% Similarity=0.172 Sum_probs=23.1
Q ss_pred HHHHHHHHHHHHHHHHHhhhhhhHHHHHh
Q psy9560 107 DEIGDLCQEYWQRIYNLEAIKYELDREIM 135 (503)
Q Consensus 107 dELQEkCKELHdRIdkLEEEKYDLE~KVk 135 (503)
.+|...=++|.+.|..||.+-||+|..--
T Consensus 5 ~~l~~~k~~Le~~L~~lE~qIy~~Et~YL 33 (80)
T PF09340_consen 5 KELLQKKKKLEKDLAALEKQIYDKETSYL 33 (80)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 35566668899999999999999887654
No 286
>PF08286 Spc24: Spc24 subunit of Ndc80; InterPro: IPR013252 Spc24 is a component of the evolutionarily conserved kinetochore-associated Ndc80 complex and is involved in chromosome segregation [].; PDB: 2VE7_D 2FV4_B 2FTX_B.
Probab=23.63 E-value=27 Score=30.71 Aligned_cols=34 Identities=24% Similarity=0.427 Sum_probs=0.0
Q ss_pred HHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHhhC
Q psy9560 120 IYNLEAIKYELDREIMLKDFEITELGKQVNDLRG 153 (503)
Q Consensus 120 IdkLEEEKYDLE~KVkKQDyEInELniKVnDLRG 153 (503)
|..||.++|.|-..+...+.+|+.|+..+.+|+.
T Consensus 1 ~~~Ld~~k~~laK~~~~LE~~l~~l~~el~~L~~ 34 (118)
T PF08286_consen 1 IQELDNEKFRLAKELSDLESELESLQSELEELKE 34 (118)
T ss_dssp ----------------------------------
T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4566677776666655555555555555555544
No 287
>KOG0976|consensus
Probab=23.58 E-value=7.6e+02 Score=30.08 Aligned_cols=43 Identities=12% Similarity=0.013 Sum_probs=19.7
Q ss_pred HHHHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHH
Q psy9560 107 DEIGDLCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQVN 149 (503)
Q Consensus 107 dELQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKVn 149 (503)
+.|.++-++|-.|=.+|..|+-+.|..|+..+.++.+|++-+.
T Consensus 266 k~ieE~m~qlk~kns~L~~ElSqkeelVk~~qeeLd~lkqt~t 308 (1265)
T KOG0976|consen 266 KEIEEKMRQLKAKNSVLGDELSQKEELVKELQEELDTLKQTRT 308 (1265)
T ss_pred HHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHH
Confidence 3344444444444444444455555555444444444444433
No 288
>PRK02119 hypothetical protein; Provisional
Probab=23.46 E-value=2.2e+02 Score=23.80 Aligned_cols=24 Identities=21% Similarity=0.429 Sum_probs=12.6
Q ss_pred hHHHHHhhhhhhHHHHHHHHHHhh
Q psy9560 129 ELDREIMLKDFEITELGKQVNDLR 152 (503)
Q Consensus 129 DLE~KVkKQDyEInELniKVnDLR 152 (503)
+|+.-|.+|..+|..|..++..|.
T Consensus 27 ~LN~~v~~Qq~~id~L~~ql~~L~ 50 (73)
T PRK02119 27 ELNQALIEQQFVIDKMQVQLRYMA 50 (73)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH
Confidence 444455555555555555554443
No 289
>PF04100 Vps53_N: Vps53-like, N-terminal ; InterPro: IPR007234 Vps53 complexes with Vps52 and Vps54 to form a multi-subunit complex involved in regulating membrane trafficking events [].
Probab=23.33 E-value=90 Score=32.80 Aligned_cols=62 Identities=16% Similarity=0.210 Sum_probs=41.7
Q ss_pred CHHHHHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHhhCCCCCCccccccccHH
Q psy9560 105 AEDEIGDLCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQVNDLRGKFVRPILKKVSKYEN 168 (503)
Q Consensus 105 SedELQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnDLRGKFKKPaLKKVRkSAn 168 (503)
+..+.+....+|+.+|..+.+.==.-|..|..-..+|..|-+-=..|.. .==+|||..|--+
T Consensus 58 ~l~~a~~~i~~L~~~i~~ik~kA~~sE~~V~~it~dIk~LD~AKrNLT~--SIT~LkrL~MLv~ 119 (383)
T PF04100_consen 58 DLEEAQEAIQELFEKISEIKSKAEESEQMVQEITRDIKQLDNAKRNLTQ--SITTLKRLQMLVT 119 (383)
T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHH
Confidence 4567778888999999888877777777777777777777555444443 2235666665543
No 290
>PF05983 Med7: MED7 protein; InterPro: IPR009244 The Mediator complex is a coactivator involved in the regulated transcription of nearly all RNA polymerase II-dependent genes. Mediator functions as a bridge to convey information from gene-specific regulatory proteins to the basal RNA polymerase II transcription machinery. The Mediator complex, having a compact conformation in its free form, is recruited to promoters by direct interactions with regulatory proteins and serves for the assembly of a functional preinitiation complex with RNA polymerase II and the general transcription factors. On recruitment the Mediator complex unfolds to an extended conformation and partially surrounds RNA polymerase II, specifically interacting with the unphosphorylated form of the C-terminal domain (CTD) of RNA polymerase II. The Mediator complex dissociates from the RNA polymerase II holoenzyme and stays at the promoter when transcriptional elongation begins. The Mediator complex is composed of at least 31 subunits: MED1, MED4, MED6, MED7, MED8, MED9, MED10, MED11, MED12, MED13, MED13L, MED14, MED15, MED16, MED17, MED18, MED19, MED20, MED21, MED22, MED23, MED24, MED25, MED26, MED27, MED29, MED30, MED31, CCNC, CDK8 and CDC2L6/CDK11. The subunits form at least three structurally distinct submodules. The head and the middle modules interact directly with RNA polymerase II, whereas the elongated tail module interacts with gene-specific regulatory proteins. Mediator containing the CDK8 module is less active than Mediator lacking this module in supporting transcriptional activation. The head module contains: MED6, MED8, MED11, SRB4/MED17, SRB5/MED18, ROX3/MED19, SRB2/MED20 and SRB6/MED22. The middle module contains: MED1, MED4, NUT1/MED5, MED7, CSE2/MED9, NUT2/MED10, SRB7/MED21 and SOH1/MED31. CSE2/MED9 interacts directly with MED4. The tail module contains: MED2, PGD1/MED3, RGR1/MED14, GAL11/MED15 and SIN4/MED16. The CDK8 module contains: MED12, MED13, CCNC and CDK8. Individual preparations of the Mediator complex lacking one or more distinct subunits have been variously termed ARC, CRSP, DRIP, PC2, SMCC and TRAP. This family consists of several eukaryotic proteins, which are homologues of the yeast MED7 protein. Activation of gene transcription in metazoans is a multistep process that is triggered by factors that recognise transcriptional enhancer sites in DNA. These factors work with co-activators such as MED7 to direct transcriptional initiation by the RNA polymerase II apparatus [].; GO: 0001104 RNA polymerase II transcription cofactor activity, 0006357 regulation of transcription from RNA polymerase II promoter, 0016592 mediator complex; PDB: 3FBI_C 3FBN_A 1YKH_A 1YKE_A.
Probab=23.15 E-value=4.1e+02 Score=25.07 Aligned_cols=85 Identities=16% Similarity=0.166 Sum_probs=44.8
Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcCCCCCCCCCCHHHHHHHHHHHHHHHHHHhh--hhhh----
Q psy9560 56 MTPERKKKLRLLLRKKAAEELKREQERRALERKRVIDERCGEPKDTDDMAEDEIGDLCQEYWQRIYNLEA--IKYE---- 129 (503)
Q Consensus 56 MT~eRKkkLKSLLLqKAaEELEKEKEqKeEEKKKiLAERcpPPLnIDGLSedELQEkCKELHdRIdkLEE--EKYD---- 129 (503)
-+..||..||.|+...-..-|+ |-+.+.....-..-..++|+-|..-||+-|..+-- -|..
T Consensus 69 ~~~d~~~eLkkL~~sll~nfle-------------Ll~~l~~~P~~~~~ki~~i~~L~~NmhhllNeyRPhQARetLi~~ 135 (162)
T PF05983_consen 69 PSVDRKKELKKLNKSLLLNFLE-------------LLDILSKNPSQYERKIEDIRLLFINMHHLLNEYRPHQARETLIMM 135 (162)
T ss_dssp -HHHHHHHHHHHHHHHHHHHHH-------------HTTSS---CCCHHHHHHHHHHHHHHHHHHHHHTHHHHHHHHHHHH
T ss_pred CCchHHHHHHHHHHHHHHHHHH-------------HHHHHHhCCccHHHHHHHHHHHHHHHHHHHHHhCHHHHHHHHHHH
Confidence 4567888888877665555554 22223111110101235566677777777665432 2444
Q ss_pred HHHHHhhhhhhHHHHHHHHHHhhC
Q psy9560 130 LDREIMLKDFEITELGKQVNDLRG 153 (503)
Q Consensus 130 LE~KVkKQDyEInELniKVnDLRG 153 (503)
|+..+.+...+|+.|+..+.+.+.
T Consensus 136 me~Ql~~kr~~i~~i~~~~~~~~~ 159 (162)
T PF05983_consen 136 MEEQLEEKREEIEEIRKVCEKARE 159 (162)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH
Confidence 555555566666666666665543
No 291
>PF07227 DUF1423: Protein of unknown function (DUF1423); InterPro: IPR004082 A total of 715 potential protein-coding genes have been identified in the nucleotide sequence of Arabidopsis thaliana chromosome 5, with an average gene density of 1 gene per 4001 bp []. Amongst the gene products is a well-conserved family of 130.7kDa proteins that share no sequence similarity with any other known proteins, other than in plants. The sequences are characterised by an N-terminal domain of variable length, a central cysteine-rich region and a relatively acidic C-terminal domain. The sequences may possess a PHD finger.
Probab=23.11 E-value=4.4e+02 Score=29.24 Aligned_cols=45 Identities=18% Similarity=0.161 Sum_probs=34.3
Q ss_pred CCCCCCHHHHHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHH
Q psy9560 100 DTDDMAEDEIGDLCQEYWQRIYNLEAIKYELDREIMLKDFEITEL 144 (503)
Q Consensus 100 nIDGLSedELQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInEL 144 (503)
.+.-|...+.++.=+.+++.|..+|..+||+...-.+..-+|.+|
T Consensus 401 ~~~kl~l~eaee~r~~~~eelk~~e~s~~~~~~mk~rM~~~I~~L 445 (446)
T PF07227_consen 401 RYLKLRLNEAEEERKKKFEELKVLENSHRDYDNMKMRMQSEIQDL 445 (446)
T ss_pred HHHhhhhHHHHHHHHHHHHHHHHHHhhccccchHHHHHHHHHHhh
Confidence 345566778888889999999999999999865555555666665
No 292
>PRK04778 septation ring formation regulator EzrA; Provisional
Probab=23.10 E-value=3.4e+02 Score=29.77 Aligned_cols=52 Identities=15% Similarity=0.151 Sum_probs=45.7
Q ss_pred CHHHHHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHhhCCCC
Q psy9560 105 AEDEIGDLCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQVNDLRGKFV 156 (503)
Q Consensus 105 SedELQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnDLRGKFK 156 (503)
.-.+|++.+.+|-+++..++.++=+|...+.....+..+.+.++.+++.++.
T Consensus 377 ~ysel~e~leel~e~leeie~eq~ei~e~l~~Lrk~E~eAr~kL~~~~~~L~ 428 (569)
T PRK04778 377 AYSELQEELEEILKQLEEIEKEQEKLSEMLQGLRKDELEAREKLERYRNKLH 428 (569)
T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4688999999999999999999999999999988888888888888877654
No 293
>COG0536 Obg Predicted GTPase [General function prediction only]
Probab=23.08 E-value=65 Score=34.60 Aligned_cols=51 Identities=16% Similarity=0.136 Sum_probs=34.9
Q ss_pred HHHHHHHHHhhhhcCCCCC--CCCCCHHHHHHHHHHHHHHHHHH--hhhhhhHHH
Q psy9560 82 RRALERKRVIDERCGEPKD--TDDMAEDEIGDLCQEYWQRIYNL--EAIKYELDR 132 (503)
Q Consensus 82 qKeEEKKKiLAERcpPPLn--IDGLSedELQEkCKELHdRIdkL--EEEKYDLE~ 132 (503)
+..++++++|.+.+..... ||.++-+.|++||+.|++.|.+. +.+.++.+.
T Consensus 292 e~~~~~~~~l~~~~~~~~~~~ISa~t~~g~~~L~~~~~~~l~~~~~~~~~~~~~~ 346 (369)
T COG0536 292 EELEELKKALAEALGWEVFYLISALTREGLDELLRALAELLEETKAEAEAAEAEE 346 (369)
T ss_pred HHHHHHHHHHHHhcCCCcceeeehhcccCHHHHHHHHHHHHHHhhhhhccccccc
Confidence 4555666777766643322 88888888999999999988877 455554333
No 294
>PF15233 SYCE1: Synaptonemal complex central element protein 1
Probab=23.03 E-value=3.2e+02 Score=26.02 Aligned_cols=44 Identities=16% Similarity=0.137 Sum_probs=35.6
Q ss_pred HHHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHh
Q psy9560 108 EIGDLCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQVNDL 151 (503)
Q Consensus 108 ELQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnDL 151 (503)
+-+.++.-||.-+|.|-+|++-|+..++++.--+..|+...++-
T Consensus 31 Ea~~l~eaL~~ELDsL~~EkvhLeeilnkKqe~l~iLqlhcqek 74 (134)
T PF15233_consen 31 EAQALWEALQRELDSLNGEKVHLEEILNKKQETLRILQLHCQEK 74 (134)
T ss_pred HHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 35678889999999999999999999998766667776665554
No 295
>PF13797 Post_transc_reg: Post-transcriptional regulator
Probab=22.81 E-value=78 Score=27.23 Aligned_cols=48 Identities=21% Similarity=0.512 Sum_probs=31.6
Q ss_pred CChhhhhHHhhccccC-CCCccHHHHHHHHHHhhhcCchhhHHHHHHHHHhh
Q psy9560 281 PNPEELWLYLKDTVSA-SNPVSIDDLVEKLMSAVSATPQPLLKSVVKDTLRR 331 (503)
Q Consensus 281 ~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 331 (503)
-+.++||-||.+-+-- ..++++-++|..+||. ++.-++.-+-....+.
T Consensus 28 vt~~dlw~yl~~~~WK~~~~~~l~e~V~DIlsl---~~~~~m~ylt~~A~k~ 76 (87)
T PF13797_consen 28 VTEEDLWSYLTEKKWKKKKPPRLHELVNDILSL---KPNDYMNYLTVEAYKE 76 (87)
T ss_pred CCHHHHHHHHHHHHhccCCCcCHHHHHHHHHcC---CHHHHHHHHHHHHHHc
Confidence 3578999999886544 4458999999999863 4444444443333333
No 296
>TIGR01837 PHA_granule_1 poly(hydroxyalkanoate) granule-associated protein. This model describes a domain found in some proteins associated with polyhydroxyalkanoate (PHA) granules in a subset of species that have PHA inclusion granules. Included are two tandem proteins of Pseudomonas oleovorans, PhaI and PhaF, and their homologs in related species. PhaF proteins have a low-complexity C-terminal region with repeats similar to AAAKP.
Probab=22.79 E-value=5.1e+02 Score=23.15 Aligned_cols=26 Identities=27% Similarity=0.157 Sum_probs=16.3
Q ss_pred ccCCCCCHHHHHHHHHHHHHHHHHHH
Q psy9560 51 RKKGFMTPERKKKLRLLLRKKAAEEL 76 (503)
Q Consensus 51 AKKgKMT~eRKkkLKSLLLqKAaEEL 76 (503)
-++|.++.++-+.+=.-++.+..+++
T Consensus 32 VkkGe~~~ee~k~~~~e~~~~~~e~~ 57 (118)
T TIGR01837 32 VKEGELAEKRGQKRFDESVDAAREEV 57 (118)
T ss_pred HHhccccHHHHHHHHHHHHHHHHHHH
Confidence 36788888876665544555555444
No 297
>KOG3647|consensus
Probab=22.57 E-value=4.2e+02 Score=28.20 Aligned_cols=45 Identities=20% Similarity=0.300 Sum_probs=37.2
Q ss_pred HHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHhhCCCCCCc
Q psy9560 113 CQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQVNDLRGKFVRPI 159 (503)
Q Consensus 113 CKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnDLRGKFKKPa 159 (503)
.+.+.++|..+-.+--.|+.|+.+.+.|++.++.|+--|+. .||.
T Consensus 121 ~q~~~~~Lnnvasdea~L~~Kierrk~ElEr~rkRle~Lqs--iRP~ 165 (338)
T KOG3647|consen 121 LQSSRAQLNNVASDEAALGSKIERRKAELERTRKRLEALQS--IRPA 165 (338)
T ss_pred HHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHh--cchH
Confidence 34555677777778888999999999999999999999988 6665
No 298
>PRK08471 flgK flagellar hook-associated protein FlgK; Validated
Probab=22.52 E-value=1.4e+02 Score=33.35 Aligned_cols=48 Identities=19% Similarity=0.301 Sum_probs=40.1
Q ss_pred HHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHhhCCCC
Q psy9560 109 IGDLCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQVNDLRGKFV 156 (503)
Q Consensus 109 LQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnDLRGKFK 156 (503)
.+.||..|+.--..|+..+-|+...+...-.+||.|-.+|.+|..+..
T Consensus 141 a~~L~~~~n~~~~~L~~~~~~~~~~i~~~V~~iN~ll~~Ia~LN~qI~ 188 (613)
T PRK08471 141 TETLTNNIKDTRERLDTLQKKVNEELKVTVDEINSLGKQIAEINKQIK 188 (613)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 466888888888899999999999998888888888888888877543
No 299
>PF15469 Sec5: Exocyst complex component Sec5
Probab=22.47 E-value=1.6e+02 Score=26.97 Aligned_cols=87 Identities=21% Similarity=0.369 Sum_probs=49.8
Q ss_pred HHHHHHHHHHHHHHHHhhhhhhHHHHHhhhh------hhHHHHHHHHHHhhCC---CCCCccccccccH---HHHHHHHh
Q psy9560 108 EIGDLCQEYWQRIYNLEAIKYELDREIMLKD------FEITELGKQVNDLRGK---FVRPILKKVSKYE---NKFAKLQK 175 (503)
Q Consensus 108 ELQEkCKELHdRIdkLEEEKYDLE~KVkKQD------yEInELniKVnDLRGK---FKKPaLKKVRkSA---nm~a~L~K 175 (503)
+|+.+|++=|++.-.+-+--=++-..++.+. +-+..|...|+++..+ +-+|.|.+-.+.. +.+..|++
T Consensus 3 ~lk~LV~~Nf~~Fv~~k~tid~i~~~~~~~~~~~~~~~~~~~L~~~l~~~~~~~~~~~~pll~~~~k~~~l~~~l~~l~r 82 (182)
T PF15469_consen 3 DLKSLVKENFDKFVSCKDTIDDIYEEFRNMKTEAQQDSGTEKLEESLNEASSKANSVFKPLLERREKADKLRNALEFLQR 82 (182)
T ss_pred HHHHHHHHhHHHHHHHHHHHHHHHHHHHHhhhcccccccHHHHHHHHHHHHHHHHHHHHHHHccHHHHHHHHHHHHHHHH
Confidence 5666777666555433332222223332222 3366677666666543 4567776544444 44566678
Q ss_pred hhhccchhhhccc-cccccc
Q psy9560 176 KAAEFNFRSQLKA-VKKREF 194 (503)
Q Consensus 176 hkvsmDfRANLKq-VKKEe~ 194 (503)
++-=|||=++|+. +++.++
T Consensus 83 ~~flF~LP~~L~~~i~~~dy 102 (182)
T PF15469_consen 83 NRFLFNLPSNLRECIKKGDY 102 (182)
T ss_pred HHHHHHhHHHHHHHHHcCcH
Confidence 8888998888875 466555
No 300
>cd09237 V_ScBro1_like Protein-interacting V-domain of Saccharomyces cerevisiae Bro1 and related domains. This family contains the V-shaped (V) domain of Saccharomyces cerevisiae Bro1, and related domains. It belongs to the V_Alix_like superfamily which also includes the V-domain of Saccharomyces cerevisiae Rim20 (also known as PalA), mammalian Alix (apoptosis-linked gene-2 interacting protein X), His-Domain type N23 protein tyrosine phosphatase (HD-PTP, also known as PTPN23), and related domains. Bro1 interacts with the ESCRT (Endosomal Sorting Complexes Required for Transport) system, and participates in endosomal trafficking. The mammalian Alix V-domain (belonging to a different family) contains a binding site, partially conserved in the superfamily, for the retroviral late assembly (L) domain YPXnL motif. The Alix V-domain is also a dimerization domain. Bro1 also has an N-terminal Bro1-like domain, which binds Snf7, a component of the ESCRT-III complex, and a C-terminal proline-rich
Probab=22.45 E-value=5.4e+02 Score=26.46 Aligned_cols=96 Identities=10% Similarity=-0.015 Sum_probs=69.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhhcCCCCCCCCCCHHHHHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHH
Q psy9560 68 LRKKAAEELKREQERRALERKRVIDERCGEPKDTDDMAEDEIGDLCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQ 147 (503)
Q Consensus 68 LLqKAaEELEKEKEqKeEEKKKiLAERcpPPLnIDGLSedELQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniK 147 (503)
+|.-+...|..|...-..-|.+|.. |.... .+..--..|++-++.|..-|..--.----|-.++......|..|..-
T Consensus 88 ~l~~~~~~L~~E~~ed~~~R~k~g~-~Wtr~--~S~~~~~~l~~~~~k~~~~L~~A~~SD~~l~~~~~~~~~~l~lL~~~ 164 (356)
T cd09237 88 EIDSSYNDLDEEMKEIEKMRKKILA-KWTQS--PSSSLTASLREDLVKLKKSLVEASASDEKLFSLVDPVKEDIALLLNG 164 (356)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhc-ccccc--cchhhhHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHcCC
Confidence 4577788888888888888999988 55322 23323467888888888888877777777888888888899999776
Q ss_pred HHHhhCCCCCCc--ccccccc
Q psy9560 148 VNDLRGKFVRPI--LKKVSKY 166 (503)
Q Consensus 148 VnDLRGKFKKPa--LKKVRkS 166 (503)
..+|..-|+.|. ...|+..
T Consensus 165 ~~~l~~~~~~p~~~~~~~sll 185 (356)
T cd09237 165 GSLWEELFGFSSSGSPEPSLL 185 (356)
T ss_pred hHHHHHHhcCCCCCCCCCccc
Confidence 667776454555 5554443
No 301
>PF05791 Bacillus_HBL: Bacillus haemolytic enterotoxin (HBL); InterPro: IPR008414 This family consists of several Bacillus haemolytic enterotoxins (HblC, HblD, HblA, NheA, and NheB), which can cause food poisoning in humans []. Haemolysin BL (encoded by HBL) and non-haemolytic enterotoxin (encoded by NHE), represent the major enterotoxins produced by Bacillus cereus. Most of the cytotoxic activity of B. cereus isolates has been attributed to the level of Nhe, which may indicate a highly diarrheic potential []. The exact mechanism by which B. cereus causes diarrhoea is unknown. Hbl, cytotoxin K (CytK) and Nhe are all putative causes. Both Hbl and Nhe are three-component cytotoxins and maximal cytotoxicity of Nhe against epithelia is dependent on all three components. Nhe has haemolytic activity against erythrocytes from a variety of species. It is possible that the common structural and functional properties of these toxins indicate that the Hbl/Nhe and ClyA families of toxins constitute a superfamily of pore-forming cytotoxins []. The high virulence of some strains is thought to be due to the greater cytotoxic activity of CytK-1 compared to CytK-2, and to a high level of cytK expression []. Haemolysin BL and non-haemolytic enterotoxin production are both influenced by pH and micro []. This entry is found in cytotoxic proteins that form part of the enterotoxin complex and bind to erythrocytes. HblA is composed of a binding component, B, and two lytic components, L1 and L2. All three subunits act synergically to cause hemolysis.; GO: 0009405 pathogenesis, 0016020 membrane; PDB: 2NRJ_A.
Probab=22.39 E-value=2.4e+02 Score=26.67 Aligned_cols=34 Identities=26% Similarity=0.316 Sum_probs=19.7
Q ss_pred HHhhhhhhHHHHHhhhhhhHHHHHHHHHHhhCCC
Q psy9560 122 NLEAIKYELDREIMLKDFEITELGKQVNDLRGKF 155 (503)
Q Consensus 122 kLEEEKYDLE~KVkKQDyEInELniKVnDLRGKF 155 (503)
.|....-.+...+....-+|.+|+.+|.++++..
T Consensus 146 ~l~~~~~~l~~~l~~~~g~I~~L~~~I~~~~~~I 179 (184)
T PF05791_consen 146 NLKTDVDELQSILAGENGDIPQLQKQIENLNEEI 179 (184)
T ss_dssp HHHHHHHHHHHHHHHTT--HHHHHHHHHHHTGGG
T ss_pred HHHHhHHHHHHHHhcccCCHHHHHHHHHHHHHHH
Confidence 3444445555566666677777777777777643
No 302
>KOG3863|consensus
Probab=22.39 E-value=1.1e+02 Score=34.96 Aligned_cols=72 Identities=24% Similarity=0.331 Sum_probs=41.9
Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcCCCCCCCCCCHHHHHHHHHHHHHHHHHHhhhhhh
Q psy9560 53 KGFMTPERKKKLRLLLRKKAAEELKREQERRALERKRVIDERCGEPKDTDDMAEDEIGDLCQEYWQRIYNLEAIKYE 129 (503)
Q Consensus 53 KgKMT~eRKkkLKSLLLqKAaEELEKEKEqKeEEKKKiLAERcpPPLnIDGLSedELQEkCKELHdRIdkLEEEKYD 129 (503)
|-+|.+.|..+=|+ -.|+ .|+.+-.....+|++.|.||- ...+--+..-++|.+||.+.+++|..=+.-.|-
T Consensus 497 KNkvAAQnCRKRKL--d~I~--nLE~ev~~l~~eKeqLl~Er~-~~d~~L~~~kqqls~L~~~Vf~~lrd~eg~~~s 568 (604)
T KOG3863|consen 497 KNKVAAQNCRKRKL--DCIL--NLEDEVEKLQKEKEQLLRERD-ELDSTLGVMKQQLSELYQEVFQQLRDEEGNPYS 568 (604)
T ss_pred ccchhccchhhhHH--HHHH--HHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHhccccCccC
Confidence 34555554443222 2222 345555566777888888887 343333344578888888888887655544443
No 303
>PF08781 DP: Transcription factor DP; InterPro: IPR014889 DP forms a heterodimer with E2F and regulates genes involved in cell cycle progression. The transcriptional activity of E2F is inhibited by the retinoblastoma protein which binds to the E2F-DP heterodimer [] and negatively regulates the G1-S transition. ; PDB: 2AZE_A.
Probab=22.32 E-value=1.6e+02 Score=27.89 Aligned_cols=34 Identities=24% Similarity=0.118 Sum_probs=27.0
Q ss_pred HHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHhh
Q psy9560 119 RIYNLEAIKYELDREIMLKDFEITELGKQVNDLR 152 (503)
Q Consensus 119 RIdkLEEEKYDLE~KVkKQDyEInELniKVnDLR 152 (503)
-|..|+.||-.+..+++++...+.+|-.+...++
T Consensus 2 ~~~~Le~ek~~~~~rI~~K~~~LqEL~~Q~va~k 35 (142)
T PF08781_consen 2 ECEELEEEKQRRRERIKKKKEQLQELILQQVAFK 35 (142)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4677888888888888888888888877766554
No 304
>KOG3054|consensus
Probab=22.27 E-value=4.4e+02 Score=27.74 Aligned_cols=11 Identities=27% Similarity=0.389 Sum_probs=6.2
Q ss_pred HHHHHHHHHHH
Q psy9560 111 DLCQEYWQRIY 121 (503)
Q Consensus 111 EkCKELHdRId 121 (503)
.|..++.+.|.
T Consensus 200 nll~eFv~YIk 210 (299)
T KOG3054|consen 200 NLLSEFVEYIK 210 (299)
T ss_pred HHHHHHHHHHH
Confidence 35566655554
No 305
>PF10018 Med4: Vitamin-D-receptor interacting Mediator subunit 4; InterPro: IPR019258 The Mediator complex is a coactivator involved in the regulated transcription of nearly all RNA polymerase II-dependent genes. Mediator functions as a bridge to convey information from gene-specific regulatory proteins to the basal RNA polymerase II transcription machinery. The Mediator complex, having a compact conformation in its free form, is recruited to promoters by direct interactions with regulatory proteins and serves for the assembly of a functional preinitiation complex with RNA polymerase II and the general transcription factors. On recruitment the Mediator complex unfolds to an extended conformation and partially surrounds RNA polymerase II, specifically interacting with the unphosphorylated form of the C-terminal domain (CTD) of RNA polymerase II. The Mediator complex dissociates from the RNA polymerase II holoenzyme and stays at the promoter when transcriptional elongation begins. The Mediator complex is composed of at least 31 subunits: MED1, MED4, MED6, MED7, MED8, MED9, MED10, MED11, MED12, MED13, MED13L, MED14, MED15, MED16, MED17, MED18, MED19, MED20, MED21, MED22, MED23, MED24, MED25, MED26, MED27, MED29, MED30, MED31, CCNC, CDK8 and CDC2L6/CDK11. The subunits form at least three structurally distinct submodules. The head and the middle modules interact directly with RNA polymerase II, whereas the elongated tail module interacts with gene-specific regulatory proteins. Mediator containing the CDK8 module is less active than Mediator lacking this module in supporting transcriptional activation. The head module contains: MED6, MED8, MED11, SRB4/MED17, SRB5/MED18, ROX3/MED19, SRB2/MED20 and SRB6/MED22. The middle module contains: MED1, MED4, NUT1/MED5, MED7, CSE2/MED9, NUT2/MED10, SRB7/MED21 and SOH1/MED31. CSE2/MED9 interacts directly with MED4. The tail module contains: MED2, PGD1/MED3, RGR1/MED14, GAL11/MED15 and SIN4/MED16. The CDK8 module contains: MED12, MED13, CCNC and CDK8. Individual preparations of the Mediator complex lacking one or more distinct subunits have been variously termed ARC, CRSP, DRIP, PC2, SMCC and TRAP. Members of this family represent the Med4 subunit of the Mediator (Med) complex [, ]. ; GO: 0001104 RNA polymerase II transcription cofactor activity, 0006357 regulation of transcription from RNA polymerase II promoter, 0016592 mediator complex
Probab=22.20 E-value=2.9e+02 Score=26.16 Aligned_cols=42 Identities=24% Similarity=0.226 Sum_probs=19.3
Q ss_pred HHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHh
Q psy9560 109 IGDLCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQVNDL 151 (503)
Q Consensus 109 LQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnDL 151 (503)
|.++=++|+..|+.|. +.-++..++.+...+|..|..+|.++
T Consensus 7 L~~~d~~L~~~L~~l~-~hq~~~~~I~~L~~e~~~ld~~i~~~ 48 (188)
T PF10018_consen 7 LIEADDELSSALEELQ-EHQENQARIQQLRAEIEELDEQIRDI 48 (188)
T ss_pred HHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3444444444444442 33344444444445555555555444
No 306
>PF11945 WASH_WAHD: WAHD domain of WASH complex; InterPro: IPR021854 This entry represents a component of the WASH complex. The WASH complex is present at the surface of endosomes and recruits and activates the Arp2/3 complex to induce actin polymerisation. The WASH complex plays a key role in the fission of tubules that serve as transport intermediates during endosome sorting []. The WASH complex's subunit structure: F-actin-capping protein subunit alpha (CAPZA1, CAPZA2 or CAPZA3), F-actin-capping protein subunit beta (CAPZB), WASH (WASH1, WASH2P, WASH3P, WASH4P, WASH5P or WASH6P), FAM21 (FAM21A, FAM21B or FAM21C), KIAA1033, KIAA0196 (strumpellin) and CCDC53. This entry represents the WASH subunit of the WASH complex. WASH genes duplicated to multiple chromosomal ends during primate evolution, with highest copy number reached in humans, whose WASH repertoires probably vary extensively among individuals []. It is therefore difficult to determine which gene is functional or not. The telomeric region of chromosome 9p is paralogous to the pericentromeric regions of chromosome 9 as well as to 2q. Paralogous regions contain 7 transcriptional units. Duplicated WASH genes are also present in the Xq/Yq pseudoautosomal region, as well as on chromosome 1 and 15. The chromosome 16 copy seems to be a pseudogene.
Probab=22.15 E-value=2.3e+02 Score=29.39 Aligned_cols=48 Identities=17% Similarity=0.232 Sum_probs=0.0
Q ss_pred CHHHHHHHHHHHHHHHH-HHhhhhhhHHHHHhhhhhhHHHHHHHHHHhhCCCCC
Q psy9560 105 AEDEIGDLCQEYWQRIY-NLEAIKYELDREIMLKDFEITELGKQVNDLRGKFVR 157 (503)
Q Consensus 105 SedELQEkCKELHdRId-kLEEEKYDLE~KVkKQDyEInELniKVnDLRGKFKK 157 (503)
+.+.|..++..+|.+|. +|+..|=.|+.--.| |+.++.||+.|+| .+|
T Consensus 26 aL~~L~~v~~diF~rI~~Rv~~~~~~l~~i~~R----i~~~qaKi~~l~g-s~k 74 (297)
T PF11945_consen 26 ALEYLDKVSNDIFSRISARVERNRERLQAIQQR----IEVAQAKIEKLQG-SKK 74 (297)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHhC-CCc
No 307
>cd00890 Prefoldin Prefoldin is a hexameric molecular chaperone complex, found in both eukaryotes and archaea, that binds and stabilizes newly synthesized polypeptides allowing them to fold correctly. The complex contains two alpha and four beta subunits, the two subunits being evolutionarily related. In archaea, there is usually only one gene for each subunit while in eukaryotes there two or more paralogous genes encoding each subunit adding heterogeneity to the structure of the hexamer. The structure of the complex consists of a double beta barrel assembly with six protruding coiled-coils.
Probab=22.05 E-value=2.9e+02 Score=23.48 Aligned_cols=45 Identities=20% Similarity=0.269 Sum_probs=34.8
Q ss_pred HHHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHhh
Q psy9560 108 EIGDLCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQVNDLR 152 (503)
Q Consensus 108 ELQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnDLR 152 (503)
.+.+..+-|-++|+.|+...=+|+..+.+...+|+.|+..++.+.
T Consensus 84 ~~~eA~~~l~~r~~~l~~~~~~l~~~~~~~~~~~~~l~~~l~~~~ 128 (129)
T cd00890 84 SLEEAIEFLKKRLETLEKQIEKLEKQLEKLQDQITELQEELQQLQ 128 (129)
T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc
Confidence 345555666689999988888888888888888888888877653
No 308
>PF05178 Kri1: KRI1-like family; InterPro: IPR018034 The Kri1 protein is also known as KRR1-interacting protein 1. The Saccharomyces cerevisiae member of this family is found to be required for the assembly of preribosomal 40S subunits in the nucleolus []. KRR1 is highly expressed in dividing cells and its expression ceases almost completely when cells enter the stationary phase.
Probab=21.99 E-value=45 Score=29.43 Aligned_cols=15 Identities=40% Similarity=0.501 Sum_probs=7.2
Q ss_pred HHHHHHHHHhHHHHh
Q psy9560 33 ERKRAEVRRRMEEAS 47 (503)
Q Consensus 33 ERKkAEvRKRlEEA~ 47 (503)
+||.+|.++|.+|-.
T Consensus 2 eRK~~Ek~~k~eElk 16 (101)
T PF05178_consen 2 ERKEEEKQEKEEELK 16 (101)
T ss_pred cHHHHHHHHHHHHHH
Confidence 344444444554444
No 309
>TIGR02169 SMC_prok_A chromosome segregation protein SMC, primarily archaeal type. SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent.
Probab=21.98 E-value=1.2e+03 Score=26.59 Aligned_cols=35 Identities=14% Similarity=0.164 Sum_probs=13.3
Q ss_pred HHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHH
Q psy9560 114 QEYWQRIYNLEAIKYELDREIMLKDFEITELGKQV 148 (503)
Q Consensus 114 KELHdRIdkLEEEKYDLE~KVkKQDyEInELniKV 148 (503)
..+..+|..++.+.-.++..+.....+|+.|..++
T Consensus 402 ~~~~~~~~~l~~~l~~l~~~~~~~~~~~~~l~~~~ 436 (1164)
T TIGR02169 402 NELKRELDRLQEELQRLSEELADLNAAIAGIEAKI 436 (1164)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33333333333333333333333333333333333
No 310
>KOG2412|consensus
Probab=21.91 E-value=3.8e+02 Score=30.69 Aligned_cols=28 Identities=29% Similarity=0.262 Sum_probs=20.3
Q ss_pred HHHHHHHHHHHHHHHHHHHhHHHHhhhh
Q psy9560 23 EAKRLKLEELERKRAEVRRRMEEASGAR 50 (503)
Q Consensus 23 ed~kRKqeErERKkAEvRKRlEEA~KkK 50 (503)
+.++|+.+.++|++.+.+||.|...+.+
T Consensus 204 ~~~q~~eqi~~~~~~~e~kr~Eaerk~~ 231 (591)
T KOG2412|consen 204 REKQRKEQIRERKERSEEKREEAERKRR 231 (591)
T ss_pred HHHHHHHHHHHHHHHHHhhhhhhHHHHH
Confidence 3577888888888888877777665543
No 311
>PF10226 DUF2216: Uncharacterized conserved proteins (DUF2216); InterPro: IPR019359 Proteins in this entry are found in Metazoa and contain a coiled-coil domain. Some annotation suggests it might be PKR, the Hepatitis delta antigen-interacting protein A, but this could not be confirmed.
Probab=21.75 E-value=2.1e+02 Score=28.58 Aligned_cols=54 Identities=26% Similarity=0.316 Sum_probs=0.0
Q ss_pred CCCCCCHHHHHHHHHH-HHHHHHHHhhhhhhHHHHHhhhhhhHH-HHHHHHHHhhC
Q psy9560 100 DTDDMAEDEIGDLCQE-YWQRIYNLEAIKYELDREIMLKDFEIT-ELGKQVNDLRG 153 (503)
Q Consensus 100 nIDGLSedELQEkCKE-LHdRIdkLEEEKYDLE~KVkKQDyEIn-ELniKVnDLRG 153 (503)
+|+.+++++|...-|| |-.++...|.||.-+-.-....-.|+| -|.+.++++|+
T Consensus 4 ~l~~~sDeell~~skeel~~rLR~~E~ek~~~m~~~g~lm~evNrrlQ~hl~EIR~ 59 (195)
T PF10226_consen 4 DLSKVSDEELLRWSKEELVRRLRRAEAEKMSLMVEHGRLMKEVNRRLQQHLNEIRG 59 (195)
T ss_pred ccccCCHHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHH
No 312
>PLN02316 synthase/transferase
Probab=21.70 E-value=77 Score=37.89 Aligned_cols=35 Identities=29% Similarity=0.303 Sum_probs=21.4
Q ss_pred HHHHHHHHHHHHHHHHHhHHHHhhhhccCCCCCHH
Q psy9560 25 KRLKLEELERKRAEVRRRMEEASGARRKKGFMTPE 59 (503)
Q Consensus 25 ~kRKqeErERKkAEvRKRlEEA~KkKAKKgKMT~e 59 (503)
+++.++|.+|+|+|.|++|+...|+|+.+.|+.+.
T Consensus 442 ~~~~~~~~~r~k~~~~a~~~ae~k~~~~~~~l~~~ 476 (1036)
T PLN02316 442 ERRLREEAIRAKAEKTARMKAEMKEKTLKMFLLSQ 476 (1036)
T ss_pred HHHhhHHHHHHHHHHHHHHHHHHHHHHHHhhhhcc
Confidence 44455666777777777777666555445555533
No 313
>KOG0976|consensus
Probab=21.69 E-value=7.4e+02 Score=30.17 Aligned_cols=23 Identities=9% Similarity=0.079 Sum_probs=12.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH
Q psy9560 68 LRKKAAEELKREQERRALERKRV 90 (503)
Q Consensus 68 LLqKAaEELEKEKEqKeEEKKKi 90 (503)
+||.-..+|+.+++.....+..+
T Consensus 110 iLQn~c~~lE~ekq~lQ~ti~~~ 132 (1265)
T KOG0976|consen 110 ILQNKCLRLEMEKQKLQDTIQGA 132 (1265)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHH
Confidence 45666666665555444444444
No 314
>PRK00247 putative inner membrane protein translocase component YidC; Validated
Probab=21.68 E-value=4.7e+02 Score=28.68 Aligned_cols=20 Identities=25% Similarity=0.229 Sum_probs=8.2
Q ss_pred HHHHHHHHHHHHHHHHHhHH
Q psy9560 25 KRLKLEELERKRAEVRRRME 44 (503)
Q Consensus 25 ~kRKqeErERKkAEvRKRlE 44 (503)
+++|.+++.+||+..++|.|
T Consensus 337 ~~~~~~~k~~~k~~~~~~~~ 356 (429)
T PRK00247 337 TAEKNEAKARKKEIAQKRRA 356 (429)
T ss_pred HHHHHHHHHHHHHHHHHHHH
Confidence 33444444444444433333
No 315
>KOG2689|consensus
Probab=21.66 E-value=4.3e+02 Score=27.93 Aligned_cols=40 Identities=25% Similarity=0.336 Sum_probs=21.6
Q ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh
Q psy9560 54 GFMTPERKKKLRLLLRKKAAEELKREQERRALERKRVIDE 93 (503)
Q Consensus 54 gKMT~eRKkkLKSLLLqKAaEELEKEKEqKeEEKKKiLAE 93 (503)
|.=+..-+..|+..-+..+.+..++++....++|+|||..
T Consensus 126 ~~~~~~a~~r~q~~e~~~~~qkRrreK~e~~eaRqRV~~~ 165 (290)
T KOG2689|consen 126 GDEMSAAKRRLQDDEMRRAAQKRRREKAEDEEARQRVLRQ 165 (290)
T ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHhhhhHHHHHHHHHH
Confidence 3333333444555555555566565555556666666553
No 316
>COG1196 Smc Chromosome segregation ATPases [Cell division and chromosome partitioning]
Probab=21.66 E-value=1.4e+03 Score=27.35 Aligned_cols=38 Identities=21% Similarity=0.197 Sum_probs=14.8
Q ss_pred HHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHhhC
Q psy9560 116 YWQRIYNLEAIKYELDREIMLKDFEITELGKQVNDLRG 153 (503)
Q Consensus 116 LHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnDLRG 153 (503)
|...|..+.+..-.+..........|..|..++..++-
T Consensus 300 le~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 337 (1163)
T COG1196 300 LEGEISLLRERLEELENELEELEERLEELKEKIEALKE 337 (1163)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33333333333333333333333334444444444443
No 317
>smart00513 SAP Putative DNA-binding (bihelical) motif predicted to be involved in chromosomal organisation.
Probab=21.54 E-value=75 Score=22.49 Aligned_cols=17 Identities=12% Similarity=0.045 Sum_probs=13.6
Q ss_pred CCCCHHHHHHHHHHHHH
Q psy9560 102 DDMAEDEIGDLCQEYWQ 118 (503)
Q Consensus 102 DGLSedELQEkCKELHd 118 (503)
..|+.++|+++|+++.-
T Consensus 2 ~~l~~~~Lk~~l~~~gl 18 (35)
T smart00513 2 AKLKVSELKDELKKRGL 18 (35)
T ss_pred CcCcHHHHHHHHHHcCC
Confidence 56889999999997643
No 318
>PF09302 XLF: XLF (XRCC4-like factor); InterPro: IPR015381 XLF (also called Cernunnos) interacts with the XRCC4-DNA ligase IV complex to promote DNA non-homologous end-joining. It directly interacts with the XRCC4-Ligase IV complex and siRNA-mediated downregulation of XLF in human cell lines leads to radio-sensitivity and impaired DNA non-homologous end-joining []. XLF is homologous to the yeast non-homologous end-joining factor Nej1 []. ; GO: 0006281 DNA repair, 0005634 nucleus; PDB: 1Z56_A 3RWR_D 3Q4F_A 3SR2_H 2R9A_A 2QM4_C.
Probab=21.48 E-value=1.2e+02 Score=27.63 Aligned_cols=42 Identities=21% Similarity=0.203 Sum_probs=34.9
Q ss_pred HHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHH
Q psy9560 109 IGDLCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQVND 150 (503)
Q Consensus 109 LQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnD 150 (503)
+++++.-|......+...-..|-..++++|.+|..|..++.|
T Consensus 130 ~~~L~~Pll~~~~~l~~~~~~L~~~l~~KD~~i~~l~~~~~~ 171 (171)
T PF09302_consen 130 LSHLNSPLLRMSSALQRQVESLKDLLKEKDKEIEKLRDKLED 171 (171)
T ss_dssp HHHTHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHTTS-
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccC
Confidence 466777777888888888899999999999999999887543
No 319
>cd00632 Prefoldin_beta Prefoldin beta; Prefoldin is a hexameric molecular chaperone complex, composed of two evolutionarily related subunits (alpha and beta), which are found in both eukaryotes and archaea. Prefoldin binds and stabilizes newly synthesized polypeptides allowing them to fold correctly. The hexameric structure consists of a double beta barrel assembly with six protruding coiled-coils. The alpha prefoldin subunits have two beta hairpin structures while the beta prefoldin subunits (this CD) have only one hairpin that is most similar to the second hairpin of the alpha subunit. The prefoldin hexamer consists of two alpha and four beta subunits and is assembled from the beta hairpins of all six subunits. The alpha subunits initially dimerize providing a structural nucleus for the assembly of the beta subunits. In archaea, there is usually only one gene for each subunit while in eukaryotes there two or more paralogous genes encoding each subunit adding heterogeneity to the st
Probab=21.42 E-value=1.6e+02 Score=25.15 Aligned_cols=46 Identities=20% Similarity=0.185 Sum_probs=31.6
Q ss_pred HHHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHH-----HHHHhhC
Q psy9560 108 EIGDLCQEYWQRIYNLEAIKYELDREIMLKDFEITELGK-----QVNDLRG 153 (503)
Q Consensus 108 ELQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELni-----KVnDLRG 153 (503)
.+....+.|-+.+..|-..+..|+..++.++.-+++|.. +|.-+-|
T Consensus 3 ~~~~~~q~l~~~~~~l~~~~~~l~~~~~E~~~v~~EL~~l~~d~~vy~~VG 53 (105)
T cd00632 3 EQLAQLQQLQQQLQAYIVQRQKVEAQLNENKKALEELEKLADDAEVYKLVG 53 (105)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcchHHHHhh
Confidence 345556777777777777888888877777766666664 5555555
No 320
>PRK00846 hypothetical protein; Provisional
Probab=21.40 E-value=2e+02 Score=24.72 Aligned_cols=48 Identities=13% Similarity=0.023 Sum_probs=25.4
Q ss_pred HHHHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHhhCC
Q psy9560 107 DEIGDLCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQVNDLRGK 154 (503)
Q Consensus 107 dELQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnDLRGK 154 (503)
+.|.+-.-+|=-+|.-.|.--=+|+.-|.+|...|..|+.+|.-|.++
T Consensus 9 ~~le~Ri~~LE~rlAfQe~tIe~LN~~v~~qq~~I~~L~~ql~~L~~r 56 (77)
T PRK00846 9 QALEARLVELETRLSFQEQALTELSEALADARLTGARNAELIRHLLED 56 (77)
T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 334444444444444444444455566666666666666666655553
No 321
>PF13358 DDE_3: DDE superfamily endonuclease
Probab=21.33 E-value=70 Score=26.23 Aligned_cols=24 Identities=33% Similarity=0.560 Sum_probs=20.9
Q ss_pred ChhhhhHHhhccccCCCCccHHHH
Q psy9560 282 NPEELWLYLKDTVSASNPVSIDDL 305 (503)
Q Consensus 282 ~~~~~~~~~~~~~~~~~~~~~~~~ 305 (503)
-.|.+|.+||..+.....-|+++|
T Consensus 122 piE~~w~~lk~~~~~~~~~~~~~L 145 (146)
T PF13358_consen 122 PIENVWGYLKRRIRRRRFSSIEEL 145 (146)
T ss_pred HHHHHHHHHHHHHHhcCCCChHHh
Confidence 357899999999999888889887
No 322
>PF08317 Spc7: Spc7 kinetochore protein; InterPro: IPR013253 This entry consists of cell division proteins which are required for kinetochore-spindle association [].
Probab=21.19 E-value=8.1e+02 Score=25.07 Aligned_cols=23 Identities=35% Similarity=0.417 Sum_probs=9.0
Q ss_pred hHHHHHhhhhhhHHHHHHHHHHh
Q psy9560 129 ELDREIMLKDFEITELGKQVNDL 151 (503)
Q Consensus 129 DLE~KVkKQDyEInELniKVnDL 151 (503)
+++.++...+-+|++++.++.+|
T Consensus 234 el~~el~~l~~~i~~~~~~k~~l 256 (325)
T PF08317_consen 234 ELQEELEELEEKIEELEEQKQEL 256 (325)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHH
Confidence 33333333333344444443333
No 323
>PRK06665 flgK flagellar hook-associated protein FlgK; Validated
Probab=21.11 E-value=2.4e+02 Score=31.62 Aligned_cols=47 Identities=23% Similarity=0.292 Sum_probs=39.1
Q ss_pred HHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHhhCCC
Q psy9560 109 IGDLCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQVNDLRGKF 155 (503)
Q Consensus 109 LQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnDLRGKF 155 (503)
.+.||..|+.--..|+..+-|+...+...-.+||.|-.+|.+|..+.
T Consensus 148 A~~La~~~n~~~~~L~~~~~~~~~~i~~~V~~iN~ll~qIa~LN~qI 194 (627)
T PRK06665 148 AQSLGERIHDRYRSLERIRDMANDEIEITVEEINNILRNIADLNEQI 194 (627)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 36688888888889999999999999888888888888888887654
No 324
>PF02646 RmuC: RmuC family; InterPro: IPR003798 This protein contains several bacterial RmuC DNA recombination proteins. The function of the RMUC protein is unknown but it is suspected that it is either a structural protein that protects DNA against nuclease action, or is itself involved in DNA cleavage at the regions of DNA secondary structures []. Proteins in this family are predicted to contain a central endonuclease-like fold domain, surrounded by coiled coils, consistent with a direct role in DNA cleavage [, ].
Probab=20.94 E-value=1.7e+02 Score=29.70 Aligned_cols=44 Identities=11% Similarity=0.132 Sum_probs=24.6
Q ss_pred HHHHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHH
Q psy9560 107 DEIGDLCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQVND 150 (503)
Q Consensus 107 dELQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnD 150 (503)
.++.+++.+||+++..+.+.--++...+.+-...++++..++.+
T Consensus 236 ~~I~~~~~~l~~~~~~~~~~~~~l~k~l~~a~~~~~~~~~~~~~ 279 (304)
T PF02646_consen 236 EEIAELAGKLYDRFGKFVEHLEKLGKSLDKAVKSYNKAVGSLEK 279 (304)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 45666777777777777655545554444444444444444433
No 325
>PRK02793 phi X174 lysis protein; Provisional
Probab=20.90 E-value=4.9e+02 Score=21.62 Aligned_cols=26 Identities=19% Similarity=0.321 Sum_probs=13.8
Q ss_pred hhHHHHHhhhhhhHHHHHHHHHHhhC
Q psy9560 128 YELDREIMLKDFEITELGKQVNDLRG 153 (503)
Q Consensus 128 YDLE~KVkKQDyEInELniKVnDLRG 153 (503)
=+|+.-|.+|..+|..|..++..|..
T Consensus 25 e~Ln~~v~~Qq~~I~~L~~~l~~L~~ 50 (72)
T PRK02793 25 EELNVTVTAHEMEMAKLRDHLRLLTE 50 (72)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34445555555555555555555544
No 326
>PF14271 DUF4359: Domain of unknown function (DUF4359)
Probab=20.88 E-value=1.4e+02 Score=26.25 Aligned_cols=48 Identities=23% Similarity=0.396 Sum_probs=33.4
Q ss_pred hhhHHhhcccc--CCCCccHHHHHHHHHHhhhcCchhhHHHHHHHHHhhhh
Q psy9560 285 ELWLYLKDTVS--ASNPVSIDDLVEKLMSAVSATPQPLLKSVVKDTLRRQA 333 (503)
Q Consensus 285 ~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 333 (503)
.|-.||++-|. ...|--+..++..+...|+.- ++.++.+|....+||-
T Consensus 21 ~l~~~l~~~~c~~~~~p~~l~~~~~~c~~lv~~~-~~~i~~~i~~~t~r~N 70 (107)
T PF14271_consen 21 QLTTYLKKEVCDEKQLPGFLRSLIKNCKRLVDSQ-RPQIEALIDQSTTRQN 70 (107)
T ss_pred HHHHHHHHHHhccccCchHHHHHHHHHHHHHHhh-hHHHHHHHHhhhhhcc
Confidence 45667777777 445544555565555555544 9999999999999983
No 327
>COG3416 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=20.67 E-value=3.6e+02 Score=27.58 Aligned_cols=13 Identities=54% Similarity=0.728 Sum_probs=11.8
Q ss_pred CCHHHHHHHHHHH
Q psy9560 56 MTPERKKKLRLLL 68 (503)
Q Consensus 56 MT~eRKkkLKSLL 68 (503)
||++-++.|++|+
T Consensus 1 MtpeE~qlle~lf 13 (233)
T COG3416 1 MTPEEKQLLENLF 13 (233)
T ss_pred CCHHHHHHHHHHH
Confidence 8999999999987
No 328
>TIGR02492 flgK_ends flagellar hook-associated protein FlgK. The flagellar hook-associated protein FlgK of bacterial flagella has conserved N- and C-terminal domains. The central region is highly variable in length and sequence, and often contains substantial runs of low-complexity sequence. This model is built from an alignment of FlgK sequences with the central region excised. Note that several other proteins of the flagellar apparatus also are homologous in the N- and C-terminal regions to FlgK, but are excluded from this model.
Probab=20.65 E-value=2.9e+02 Score=28.02 Aligned_cols=47 Identities=23% Similarity=0.275 Sum_probs=37.6
Q ss_pred HHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHhhCCC
Q psy9560 109 IGDLCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQVNDLRGKF 155 (503)
Q Consensus 109 LQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnDLRGKF 155 (503)
.+.|+..|+.--..|+..+.++...+...-.+||.|-.+|.+|..+.
T Consensus 136 a~~l~~~~n~~~~~L~~~~~~~~~~i~~~V~~iN~ll~~Ia~lN~~I 182 (322)
T TIGR02492 136 AQALANSFNQTSNELQDLRKGINAEIKSAVTEINSLLKQIASLNKEI 182 (322)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 35677777777778888888898888888888888888888887654
No 329
>PF14282 FlxA: FlxA-like protein
Probab=20.46 E-value=3.1e+02 Score=24.01 Aligned_cols=43 Identities=12% Similarity=0.223 Sum_probs=28.2
Q ss_pred HHHHHHHHHHHHHHHHHhhhhh-hHHHHHhhhhhhHHHHHHHHHHhhC
Q psy9560 107 DEIGDLCQEYWQRIYNLEAIKY-ELDREIMLKDFEITELGKQVNDLRG 153 (503)
Q Consensus 107 dELQEkCKELHdRIdkLEEEKY-DLE~KVkKQDyEInELniKVnDLRG 153 (503)
..|+.-.+.|-+.|..|..... |-+.|- ..|..|..+|.+|..
T Consensus 22 ~~L~~Qi~~Lq~ql~~l~~~~~~~~e~k~----~q~q~Lq~QI~~Lqa 65 (106)
T PF14282_consen 22 EQLQKQIKQLQEQLQELSQDSDLDAEQKQ----QQIQLLQAQIQQLQA 65 (106)
T ss_pred HHHHHHHHHHHHHHHHHHcccCCCHHHHH----HHHHHHHHHHHHHHH
Confidence 4577777788888888877544 555553 335666666666665
No 330
>KOG4674|consensus
Probab=20.41 E-value=4e+02 Score=34.29 Aligned_cols=63 Identities=16% Similarity=0.119 Sum_probs=52.2
Q ss_pred CHHHHHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHhhCC-CCCCccccccccH
Q psy9560 105 AEDEIGDLCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQVNDLRGK-FVRPILKKVSKYE 167 (503)
Q Consensus 105 SedELQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnDLRGK-FKKPaLKKVRkSA 167 (503)
....|++.|+++-.+|.+|+.+.-+.+.+|.-...+...|+....++.-. ..+|+-..+--+.
T Consensus 566 ~~~~~k~~~~~a~e~i~~L~~~l~e~~~~i~sLl~erd~y~e~l~~~e~~~~~k~nss~~~~t~ 629 (1822)
T KOG4674|consen 566 LQNILKETINEASEKIAELEKELEEQEQRIESLLTERDMYKELLAELEDSHQLKPNSSALDQTE 629 (1822)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccCCCCchhhcccc
Confidence 34678999999999999999999999999999999999998888888776 4666655554444
No 331
>PF10481 CENP-F_N: Cenp-F N-terminal domain; InterPro: IPR018463 Mitosin or centromere-associated protein-F (Cenp-F) is found bound across the centromere as one of the proteins of the outer layer of the kinetochore []. Most of the kinetochore/centromere functions appear to depend upon binding of the C-terminal part of the molecule, whereas the N-terminal part, here, may be a cytoplasmic player in controlling the function of microtubules and dynein [].
Probab=20.38 E-value=4.6e+02 Score=27.81 Aligned_cols=73 Identities=10% Similarity=0.150 Sum_probs=44.0
Q ss_pred HHHHHHHHHHHHHHHHHHHhhhhcCCCCCCCCCCHHHHHHHHHHHHHHHH-------HHhhhhhhHHHHHhhhhhhHHHH
Q psy9560 72 AAEELKREQERRALERKRVIDERCGEPKDTDDMAEDEIGDLCQEYWQRIY-------NLEAIKYELDREIMLKDFEITEL 144 (503)
Q Consensus 72 AaEELEKEKEqKeEEKKKiLAERcpPPLnIDGLSedELQEkCKELHdRId-------kLEEEKYDLE~KVkKQDyEInEL 144 (503)
-..-|.+.++.-+.+|..|-.=.+ . .+-|.+.|..|-..-+ .=|..-+-||-.+...+|.|.-|
T Consensus 44 lEAaLqKQKqK~e~ek~e~s~LkR------E---nq~l~e~c~~lek~rqKlshdlq~Ke~qv~~lEgQl~s~Kkqie~L 114 (307)
T PF10481_consen 44 LEAALQKQKQKVEEEKNEYSALKR------E---NQSLMESCENLEKTRQKLSHDLQVKESQVNFLEGQLNSCKKQIEKL 114 (307)
T ss_pred HHHHHHHHHHHHHHHhhhhhhhhh------h---hhhHHHHHHHHHHHHHHhhHHHhhhHHHHHHHHHHHHHHHHHHHHH
Confidence 334455555555555555533222 1 1335556655544433 34556778888888888888888
Q ss_pred HHHHHHhhC
Q psy9560 145 GKQVNDLRG 153 (503)
Q Consensus 145 niKVnDLRG 153 (503)
++.|..++.
T Consensus 115 eqelkr~Ks 123 (307)
T PF10481_consen 115 EQELKRCKS 123 (307)
T ss_pred HHHHHHHHH
Confidence 888877766
No 332
>cd01109 HTH_YyaN Helix-Turn-Helix DNA binding domain of the MerR-like transcription regulators YyaN and YraB. Putative helix-turn-helix (HTH) MerR-like transcription regulators of Bacillus subtilis, YyaN and YraB, and related proteins; N-terminal domain. Based on sequence similarity, these proteins are predicted to function as transcription regulators that mediate responses to stress in eubacteria. They belong to the MerR superfamily of transcription regulators that promote transcription of various stress regulons by reconfiguring the operator sequence located between the -35 and -10 promoter elements. A typical MerR regulator is comprised of distinct domains that harbor the regulatory (effector-binding) site and the active (DNA-binding) site. Their N-terminal domains are homologous and contain a DNA-binding winged HTH motif, while the C-terminal domains are often dissimilar and bind specific coactivator molecules such as metal ions, drugs, and organic substrates.
Probab=20.38 E-value=4e+02 Score=22.84 Aligned_cols=11 Identities=18% Similarity=0.368 Sum_probs=5.1
Q ss_pred CCCHHHHHHHH
Q psy9560 103 DMAEDEIGDLC 113 (503)
Q Consensus 103 GLSedELQEkC 113 (503)
|++.++++++.
T Consensus 57 G~sL~eI~~~l 67 (113)
T cd01109 57 GMSIKDIKEYA 67 (113)
T ss_pred CCCHHHHHHHH
Confidence 34555554443
No 333
>PF11214 Med2: Mediator complex subunit 2; InterPro: IPR021017 The Mediator complex is a coactivator involved in the regulated transcription of nearly all RNA polymerase II-dependent genes. Mediator functions as a bridge to convey information from gene-specific regulatory proteins to the basal RNA polymerase II transcription machinery. The Mediator complex, having a compact conformation in its free form, is recruited to promoters by direct interactions with regulatory proteins and serves for the assembly of a functional preinitiation complex with RNA polymerase II and the general transcription factors. On recruitment the Mediator complex unfolds to an extended conformation and partially surrounds RNA polymerase II, specifically interacting with the unphosphorylated form of the C-terminal domain (CTD) of RNA polymerase II. The Mediator complex dissociates from the RNA polymerase II holoenzyme and stays at the promoter when transcriptional elongation begins. The Mediator complex is composed of at least 31 subunits: MED1, MED4, MED6, MED7, MED8, MED9, MED10, MED11, MED12, MED13, MED13L, MED14, MED15, MED16, MED17, MED18, MED19, MED20, MED21, MED22, MED23, MED24, MED25, MED26, MED27, MED29, MED30, MED31, CCNC, CDK8 and CDC2L6/CDK11. The subunits form at least three structurally distinct submodules. The head and the middle modules interact directly with RNA polymerase II, whereas the elongated tail module interacts with gene-specific regulatory proteins. Mediator containing the CDK8 module is less active than Mediator lacking this module in supporting transcriptional activation. The head module contains: MED6, MED8, MED11, SRB4/MED17, SRB5/MED18, ROX3/MED19, SRB2/MED20 and SRB6/MED22. The middle module contains: MED1, MED4, NUT1/MED5, MED7, CSE2/MED9, NUT2/MED10, SRB7/MED21 and SOH1/MED31. CSE2/MED9 interacts directly with MED4. The tail module contains: MED2, PGD1/MED3, RGR1/MED14, GAL11/MED15 and SIN4/MED16. The CDK8 module contains: MED12, MED13, CCNC and CDK8. Individual preparations of the Mediator complex lacking one or more distinct subunits have been variously termed ARC, CRSP, DRIP, PC2, SMCC and TRAP. This family of mediator complex subunit 2 proteins is conserved in fungi. Cyclin-dependent kinase CDK8 or Srb10 interacts with and phosphorylates Med2. Post-translational modifications of Mediator subunits are important for regulation of gene expression [, ].
Probab=20.36 E-value=1.2e+02 Score=27.64 Aligned_cols=26 Identities=35% Similarity=0.331 Sum_probs=0.0
Q ss_pred HHHHHHHHHHHHHHHHHHhHHHHhhh
Q psy9560 24 AKRLKLEELERKRAEVRRRMEEASGA 49 (503)
Q Consensus 24 d~kRKqeErERKkAEvRKRlEEA~Kk 49 (503)
+-+.+++|...++++.|+|.|+.+++
T Consensus 80 e~~q~~ee~K~keE~E~~rkE~e~~k 105 (105)
T PF11214_consen 80 ELKQKQEEEKIKEEEERKRKEEEEKK 105 (105)
T ss_pred HHHHHHHHHHHHHHHHHHHHHhhhcC
No 334
>COG0011 Uncharacterized conserved protein [Function unknown]
Probab=20.30 E-value=25 Score=31.34 Aligned_cols=53 Identities=11% Similarity=0.098 Sum_probs=36.2
Q ss_pred CCCCCCHHHHHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHhhC
Q psy9560 100 DTDDMAEDEIGDLCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQVNDLRG 153 (503)
Q Consensus 100 nIDGLSedELQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnDLRG 153 (503)
.|.| +.++|.++.|++|+.+......|+-...|+-..--....+..||.-.+.
T Consensus 45 ~iEg-~~del~~~ik~~~Ea~~~~g~~Rv~t~ikId~R~d~~~t~e~Kv~~v~e 97 (100)
T COG0011 45 VIEG-ELDELMEAVKEAHEAVFEKGAPRVSTVIKIDERRDKELTMEEKVKSVEE 97 (100)
T ss_pred EEEe-cHHHHHHHHHHHHHHHHhcCCceEEEEEEeeeecCccccHHHHHHHHHH
Confidence 4677 8999999999999999998777776655554433333444445544443
No 335
>PF05266 DUF724: Protein of unknown function (DUF724); InterPro: IPR007930 This family contains several uncharacterised proteins found exclusively in Arabidopsis thaliana.
Probab=20.29 E-value=8e+02 Score=23.85 Aligned_cols=48 Identities=23% Similarity=0.252 Sum_probs=33.1
Q ss_pred CHHHHHHHHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHHHHHHhh
Q psy9560 105 AEDEIGDLCQEYWQRIYNLEAIKYELDREIMLKDFEITELGKQVNDLR 152 (503)
Q Consensus 105 SedELQEkCKELHdRIdkLEEEKYDLE~KVkKQDyEInELniKVnDLR 152 (503)
..++|.+..++|-.+|..|..+.+-|..+....+.+|..|...++.|.
T Consensus 125 ~~~~~e~~i~~Le~ki~el~~~~~~~~~~ke~~~~ei~~lks~~~~l~ 172 (190)
T PF05266_consen 125 ELKELESEIKELEMKILELQRQAAKLKEKKEAKDKEISRLKSEAEALK 172 (190)
T ss_pred hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 345566677777777777777777777777777777777776666654
No 336
>PF01895 PhoU: PhoU domain; InterPro: IPR008170 This family contains phosphate regulatory proteins including PhoU. PhoU proteins are known to play a role in the regulation of phosphate uptake. The PhoU domain is composed of a three helix bundle []. The PhoU protein contains two copies of this domain. The domain binds to an iron cluster via its conserved E/DXXXD motif. Deletion of PhoU activates constitutive expression of the phosphate ABC transporter and allows phosphate transport, but causes a growth defect; suggesting that the protein has some secondary function []. ; PDB: 2I0M_A 1T72_B 1T8B_A 1SUM_B 1VCT_A 2BKN_A 2BKP_A 2BKO_A.
Probab=20.27 E-value=3.8e+02 Score=20.35 Aligned_cols=42 Identities=29% Similarity=0.410 Sum_probs=24.4
Q ss_pred HHHHHHHHHHHH-HHHhhhhhhHHHHHhhhhhhHHHHHHHHHH
Q psy9560 109 IGDLCQEYWQRI-YNLEAIKYELDREIMLKDFEITELGKQVND 150 (503)
Q Consensus 109 LQEkCKELHdRI-dkLEEEKYDLE~KVkKQDyEInELniKVnD 150 (503)
+-++|.++++.+ .-+.+..-++-.++.+.+.+|+.|..++.+
T Consensus 4 m~~~~~~~l~~~~~~~~~~d~~~a~~i~~~e~~id~~~~~~~~ 46 (88)
T PF01895_consen 4 MGELVEEMLDDAIEAFEERDSELAQEIIQLEEEIDELYREIRR 46 (88)
T ss_dssp HHHHHHHHHHHHHHHHHTT-HHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHcCCHHHHHHHHHhHHHHHHHHHHHHH
Confidence 444455544443 344555556667777777777777776644
No 337
>PF07323 DUF1465: Protein of unknown function (DUF1465); InterPro: IPR010848 This family consists of several hypothetical bacterial proteins of around 180 residues in length. The function of this family is unknown.; PDB: 3CTW_D.
Probab=20.20 E-value=37 Score=32.29 Aligned_cols=22 Identities=18% Similarity=0.430 Sum_probs=15.6
Q ss_pred HHHHHHHHHHHHHHHhhhhhhH
Q psy9560 109 IGDLCQEYWQRIYNLEAIKYEL 130 (503)
Q Consensus 109 LQEkCKELHdRIdkLEEEKYDL 130 (503)
|-+-...||++|..||..-|.-
T Consensus 114 Li~rS~rL~~RV~rLD~~~~~~ 135 (156)
T PF07323_consen 114 LIERSERLYERVARLDRMIYEP 135 (156)
T ss_dssp HHHHHHHHHHHHHHHHHHHHT-
T ss_pred HHHHHHHHHHHHHHHHHHhccc
Confidence 3334467899999999887764
No 338
>PTZ00400 DnaK-type molecular chaperone; Provisional
Probab=20.13 E-value=4.1e+02 Score=29.81 Aligned_cols=18 Identities=11% Similarity=0.187 Sum_probs=14.8
Q ss_pred CHHHHHHHHHHHHHHHHH
Q psy9560 105 AEDEIGDLCQEYWQRIYN 122 (503)
Q Consensus 105 SedELQEkCKELHdRIdk 122 (503)
...+|++++.++-+||+.
T Consensus 595 ere~i~~~l~~~~~WL~~ 612 (663)
T PTZ00400 595 DKDELKQKITKLRSTLSS 612 (663)
T ss_pred HHHHHHHHHHHHHHHHhc
Confidence 346789999999999974
No 339
>KOG1118|consensus
Probab=20.12 E-value=2.5e+02 Score=30.30 Aligned_cols=27 Identities=11% Similarity=0.068 Sum_probs=24.6
Q ss_pred HHHHHHHHHHHhhhhhhHHHHHhhhhh
Q psy9560 113 CQEYWQRIYNLEAIKYELDREIMLKDF 139 (503)
Q Consensus 113 CKELHdRIdkLEEEKYDLE~KVkKQDy 139 (503)
.|++|.+..+||+-|.|+.++.+|+-+
T Consensus 146 lK~i~hh~KKLEgRRldyD~kkkk~~K 172 (366)
T KOG1118|consen 146 LKDIQHHRKKLEGRRLDYDYKKKKQGK 172 (366)
T ss_pred HHHHHHHHHHhhhhhhHHHHHHHHhcc
Confidence 489999999999999999999998855
No 340
>PF10552 ORF6C: ORF6C domain; InterPro: IPR018878 This entry represents the carboxy-terminal domain from ORF6 (Q9B012 from SWISSPROT), an antirepressor protein from Lactococcus phage bIL285 [].
Probab=20.03 E-value=4.2e+02 Score=23.24 Aligned_cols=77 Identities=18% Similarity=0.175 Sum_probs=37.1
Q ss_pred HHHHHHHHHHHHhhhhhhHHHHHhhhhhhHHHHHH----HHHHhhCCCCCCccccccccHHHHHHHH---hhhhccchhh
Q psy9560 112 LCQEYWQRIYNLEAIKYELDREIMLKDFEITELGK----QVNDLRGKFVRPILKKVSKYENKFAKLQ---KKAAEFNFRS 184 (503)
Q Consensus 112 kCKELHdRIdkLEEEKYDLE~KVkKQDyEInELni----KVnDLRGKFKKPaLKKVRkSAnm~a~L~---KhkvsmDfRA 184 (503)
..+++-++|..|+..--||+..+.-..-+-..|+. ||.++-|-...++-++.+.+---|..|- |+.-.++-+.
T Consensus 9 ~~~~~~~ki~~ve~~V~~l~~~~~i~~~q~~~i~~~v~~rv~~~lgg~~s~ay~~~~~~~k~f~~i~~~lk~~F~V~sY~ 88 (116)
T PF10552_consen 9 ATEEHNEKIEEVENRVDDLEENMPIDPGQQKEIQKAVKSRVYELLGGKGSPAYKDKSFRRKLFSDIYRDLKRHFGVPSYK 88 (116)
T ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhccccchhhhhHHhHHHHHHHHHHHHHHhCCchHH
Confidence 34455555555555555555444443333334333 5556655455566666333332244442 3334455555
Q ss_pred hccc
Q psy9560 185 QLKA 188 (503)
Q Consensus 185 NLKq 188 (503)
+|+.
T Consensus 89 ~I~~ 92 (116)
T PF10552_consen 89 DIPR 92 (116)
T ss_pred hhhH
Confidence 5544
Done!