Query psy4154
Match_columns 346
No_of_seqs 224 out of 407
Neff 5.9
Searched_HMMs 46136
Date Fri Aug 16 18:16:36 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy4154.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/4154hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PF03194 LUC7: LUC7 N_terminus 100.0 1.4E-69 3.1E-74 511.7 19.3 217 10-259 12-234 (254)
2 KOG0796|consensus 100.0 4E-66 8.7E-71 492.3 19.3 216 10-258 13-229 (319)
3 COG5200 LUC7 U1 snRNP componen 100.0 3.1E-51 6.6E-56 368.9 16.0 213 10-256 13-226 (258)
4 PF03194 LUC7: LUC7 N_terminus 99.9 1.4E-26 3E-31 219.2 12.0 146 188-337 2-164 (254)
5 KOG0796|consensus 99.9 5.4E-24 1.2E-28 203.8 13.0 142 190-336 5-162 (319)
6 COG5200 LUC7 U1 snRNP componen 99.2 3.2E-11 6.9E-16 110.0 9.4 144 187-337 2-162 (258)
7 PHA02562 46 endonuclease subun 94.6 6.8 0.00015 40.8 23.7 13 216-228 285-297 (562)
8 PF10168 Nup88: Nuclear pore c 92.8 18 0.00039 39.7 21.5 32 149-182 576-607 (717)
9 PF13779 DUF4175: Domain of un 91.4 23 0.00049 39.6 19.9 73 116-188 461-567 (820)
10 PF04012 PspA_IM30: PspA/IM30 91.3 12 0.00026 34.4 20.1 82 116-201 26-107 (221)
11 KOG0994|consensus 88.3 56 0.0012 37.9 21.5 46 152-197 1511-1561(1758)
12 TIGR02302 aProt_lowcomp conser 87.1 4 8.6E-05 45.5 10.1 76 115-190 491-599 (851)
13 KOG0250|consensus 84.3 76 0.0017 36.4 18.2 21 25-45 181-201 (1074)
14 PRK02224 chromosome segregatio 83.9 75 0.0016 35.2 22.0 12 215-226 451-462 (880)
15 PF06008 Laminin_I: Laminin Do 82.8 44 0.00096 31.7 20.2 90 109-199 49-141 (264)
16 PF07361 Cytochrom_B562: Cytoc 80.6 6.4 0.00014 32.5 6.5 31 309-339 69-99 (103)
17 PLN03229 acetyl-coenzyme A car 79.6 1.1E+02 0.0023 34.0 18.5 40 94-133 440-479 (762)
18 TIGR02302 aProt_lowcomp conser 76.4 11 0.00025 42.0 8.7 32 304-335 569-600 (851)
19 KOG0933|consensus 75.4 1.6E+02 0.0034 33.8 17.9 79 260-339 801-884 (1174)
20 KOG0971|consensus 73.8 1.7E+02 0.0036 33.4 20.1 26 113-138 269-294 (1243)
21 COG0419 SbcC ATPase involved i 72.9 1.7E+02 0.0036 33.0 21.5 24 215-239 457-480 (908)
22 PF10805 DUF2730: Protein of u 72.3 34 0.00073 28.4 8.7 72 245-316 23-99 (106)
23 PRK01156 chromosome segregatio 71.9 1.7E+02 0.0037 32.6 22.4 11 217-227 451-461 (895)
24 KOG0996|consensus 71.4 2.1E+02 0.0045 33.4 20.3 24 259-282 923-946 (1293)
25 PF13779 DUF4175: Domain of un 68.5 21 0.00045 39.9 8.5 35 302-336 537-571 (820)
26 KOG0971|consensus 67.3 2.3E+02 0.005 32.3 17.5 51 264-318 504-554 (1243)
27 PRK10884 SH3 domain-containing 66.7 1.1E+02 0.0024 28.4 12.0 74 121-199 94-167 (206)
28 smart00356 ZnF_C3H1 zinc finge 66.2 2.8 6E-05 25.4 0.7 23 28-59 4-26 (27)
29 PRK06835 DNA replication prote 64.4 88 0.0019 31.0 11.3 67 153-223 37-106 (329)
30 PF06160 EzrA: Septation ring 63.0 2.1E+02 0.0046 30.4 19.7 89 112-200 100-212 (560)
31 PF13863 DUF4200: Domain of un 63.0 87 0.0019 25.9 10.1 67 248-318 36-103 (126)
32 PRK04778 septation ring format 62.8 2.1E+02 0.0046 30.4 18.2 43 297-339 511-553 (569)
33 PF13863 DUF4200: Domain of un 60.5 87 0.0019 25.9 9.1 54 115-172 48-101 (126)
34 PF07361 Cytochrom_B562: Cytoc 59.1 72 0.0016 26.3 8.1 40 154-193 55-97 (103)
35 TIGR00570 cdk7 CDK-activating 57.9 2E+02 0.0044 28.5 13.9 34 30-66 25-58 (309)
36 PRK15058 cytochrome b562; Prov 57.4 11 0.00023 32.7 3.0 32 309-340 94-125 (128)
37 PRK04406 hypothetical protein; 56.7 80 0.0017 24.7 7.6 48 124-171 8-58 (75)
38 PF12998 ING: Inhibitor of gro 54.2 1.1E+02 0.0024 24.3 8.6 75 120-195 22-99 (105)
39 KOG4403|consensus 52.8 3E+02 0.0064 28.9 16.0 40 162-201 286-325 (575)
40 KOG4727|consensus 52.5 85 0.0018 28.7 7.9 29 213-244 73-101 (193)
41 PRK09720 cybC cytochrome b562; 52.1 62 0.0013 26.9 6.5 41 152-192 50-93 (100)
42 KOG2391|consensus 51.8 90 0.002 31.4 8.7 67 258-325 226-295 (365)
43 PRK09720 cybC cytochrome b562; 51.7 12 0.00026 31.1 2.3 32 309-340 66-97 (100)
44 PF07544 Med9: RNA polymerase 51.7 81 0.0018 24.9 7.0 58 260-318 24-81 (83)
45 PF04012 PspA_IM30: PspA/IM30 50.9 1.4E+02 0.0031 27.2 9.7 70 260-333 26-95 (221)
46 cd00890 Prefoldin Prefoldin is 50.6 1.4E+02 0.003 24.5 12.1 74 216-314 52-126 (129)
47 KOG2391|consensus 50.4 75 0.0016 32.0 7.9 24 180-203 262-285 (365)
48 KOG3454|consensus 50.0 17 0.00038 32.7 3.2 29 216-245 4-32 (165)
49 PF07544 Med9: RNA polymerase 49.4 87 0.0019 24.7 6.9 56 117-173 25-80 (83)
50 PRK15058 cytochrome b562; Prov 49.3 44 0.00095 29.0 5.4 41 153-193 79-122 (128)
51 TIGR00606 rad50 rad50. This fa 48.3 5.3E+02 0.011 30.5 21.3 34 295-328 798-831 (1311)
52 KOG4674|consensus 47.9 6.2E+02 0.013 31.2 23.2 80 258-337 927-1012(1822)
53 PRK02119 hypothetical protein; 46.9 1.3E+02 0.0028 23.3 7.3 47 124-170 6-55 (73)
54 KOG4117|consensus 46.9 1E+02 0.0022 23.8 6.4 52 258-312 13-64 (73)
55 PF00642 zf-CCCH: Zinc finger 46.5 9.4 0.0002 23.8 0.7 25 27-60 2-27 (27)
56 PRK03947 prefoldin subunit alp 46.3 1.9E+02 0.004 24.6 12.2 79 215-311 58-137 (140)
57 PLN03229 acetyl-coenzyme A car 44.8 4.5E+02 0.0098 29.3 13.3 26 109-134 597-622 (762)
58 PF10475 DUF2450: Protein of u 44.4 3E+02 0.0065 26.5 11.6 83 253-336 63-158 (291)
59 COG1579 Zn-ribbon protein, pos 44.3 2.9E+02 0.0064 26.4 22.2 41 294-338 147-187 (239)
60 KOG0994|consensus 44.1 6.2E+02 0.013 30.1 19.3 30 114-143 1578-1607(1758)
61 PLN02372 violaxanthin de-epoxi 43.8 4E+02 0.0086 27.7 13.3 116 54-203 327-450 (455)
62 PF05008 V-SNARE: Vesicle tran 43.4 1.4E+02 0.0031 22.6 8.9 50 148-201 28-77 (79)
63 PF04102 SlyX: SlyX; InterPro 43.3 1E+02 0.0022 23.5 6.2 33 141-173 21-53 (69)
64 PHA02562 46 endonuclease subun 42.9 4.1E+02 0.0088 27.6 17.5 23 296-318 351-373 (562)
65 PF12854 PPR_1: PPR repeat 42.5 30 0.00065 22.3 2.7 19 168-186 14-32 (34)
66 COG3783 CybC Soluble cytochrom 41.7 42 0.00092 27.7 4.0 27 166-192 67-93 (100)
67 PRK04406 hypothetical protein; 40.8 1.6E+02 0.0036 22.9 7.1 50 268-317 8-60 (75)
68 KOG0250|consensus 40.7 6.4E+02 0.014 29.3 17.8 20 261-280 369-388 (1074)
69 PF12269 zf-CpG_bind_C: CpG bi 40.6 48 0.001 31.6 4.8 53 214-270 157-217 (236)
70 PF12171 zf-C2H2_jaz: Zinc-fin 39.8 13 0.00029 22.7 0.7 25 216-243 2-26 (27)
71 PRK00736 hypothetical protein; 39.7 1.5E+02 0.0033 22.6 6.6 31 141-171 22-52 (68)
72 PRK04325 hypothetical protein; 38.4 1.9E+02 0.0041 22.4 7.1 31 141-171 26-56 (74)
73 TIGR02977 phageshock_pspA phag 38.4 3.2E+02 0.007 25.2 21.1 81 116-200 27-107 (219)
74 PF06825 HSBP1: Heat shock fac 37.9 98 0.0021 22.9 5.0 49 113-168 3-51 (54)
75 PRK10564 maltose regulon perip 37.8 37 0.00079 33.6 3.6 32 307-338 259-290 (303)
76 KOG0977|consensus 37.3 5.5E+02 0.012 27.6 17.6 15 322-336 356-370 (546)
77 PRK02793 phi X174 lysis protei 37.2 1.9E+02 0.0042 22.3 7.2 28 143-170 27-54 (72)
78 COG5136 U1 snRNP-specific prot 37.0 26 0.00057 31.6 2.3 35 216-254 4-38 (188)
79 PF02403 Seryl_tRNA_N: Seryl-t 36.9 1.5E+02 0.0032 24.0 6.7 46 153-200 44-89 (108)
80 COG1340 Uncharacterized archae 36.8 4.3E+02 0.0093 26.1 20.9 49 144-197 33-81 (294)
81 PF14282 FlxA: FlxA-like prote 36.5 1.5E+02 0.0032 24.5 6.6 53 263-315 18-70 (106)
82 PF06220 zf-U1: U1 zinc finger 36.2 10 0.00023 25.8 -0.3 29 216-245 4-32 (38)
83 smart00451 ZnF_U1 U1-like zinc 36.0 8.4 0.00018 24.7 -0.8 26 217-245 5-30 (35)
84 PF03962 Mnd1: Mnd1 family; I 35.6 3E+02 0.0066 25.0 9.1 64 138-201 62-126 (188)
85 PF02151 UVR: UvrB/uvrC motif; 35.6 1.3E+02 0.0028 19.8 5.1 34 159-192 2-35 (36)
86 PF09969 DUF2203: Uncharacteri 35.5 2.8E+02 0.006 23.6 8.3 26 149-174 47-72 (120)
87 KOG0995|consensus 35.1 6.1E+02 0.013 27.4 17.7 70 259-332 310-384 (581)
88 PF14282 FlxA: FlxA-like prote 34.5 1.6E+02 0.0034 24.3 6.5 20 119-138 18-37 (106)
89 PF03980 Nnf1: Nnf1 ; InterPr 34.4 2.5E+02 0.0055 22.8 9.4 68 248-315 18-99 (109)
90 PF09543 DUF2379: Protein of u 34.1 1.5E+02 0.0031 25.6 6.2 24 164-187 77-100 (121)
91 PF03357 Snf7: Snf7; InterPro 34.1 2.9E+02 0.0063 23.6 8.5 51 150-200 13-63 (171)
92 PF06103 DUF948: Bacterial pro 33.9 2.3E+02 0.005 22.2 8.2 19 153-171 41-59 (90)
93 KOG4797|consensus 33.7 1.1E+02 0.0025 25.8 5.4 63 268-336 45-107 (123)
94 PF15397 DUF4618: Domain of un 33.5 3.2E+02 0.007 26.4 9.2 68 248-315 39-107 (258)
95 PRK10698 phage shock protein P 33.5 4E+02 0.0087 24.8 21.1 161 116-317 27-187 (222)
96 PF04102 SlyX: SlyX; InterPro 33.5 1.5E+02 0.0032 22.6 5.7 32 287-318 23-54 (69)
97 PF02388 FemAB: FemAB family; 33.4 2.5E+02 0.0054 28.5 9.0 26 113-138 242-267 (406)
98 PRK00239 rpsT 30S ribosomal pr 33.3 2.6E+02 0.0056 22.5 7.5 35 300-334 23-57 (88)
99 PF09969 DUF2203: Uncharacteri 33.1 3.1E+02 0.0066 23.3 8.4 27 292-318 46-72 (120)
100 TIGR00029 S20 ribosomal protei 32.8 2.6E+02 0.0057 22.5 7.5 35 300-334 23-57 (87)
101 PF05597 Phasin: Poly(hydroxya 32.0 1.7E+02 0.0037 25.4 6.5 21 152-172 109-129 (132)
102 PF12874 zf-met: Zinc-finger o 31.5 11 0.00023 22.4 -0.8 24 217-243 2-25 (25)
103 KOG1040|consensus 30.8 33 0.00072 34.2 2.1 32 25-65 74-105 (325)
104 COG1842 PspA Phage shock prote 30.1 3.2E+02 0.007 25.7 8.5 93 232-334 5-97 (225)
105 PF06248 Zw10: Centromere/kine 30.1 7E+02 0.015 26.5 12.1 23 117-139 11-33 (593)
106 PRK09039 hypothetical protein; 30.0 5.7E+02 0.012 25.5 19.6 11 153-163 54-64 (343)
107 KOG4182|consensus 30.0 7.2E+02 0.016 26.7 17.9 47 149-197 78-124 (828)
108 PF10146 zf-C4H2: Zinc finger- 30.0 4.8E+02 0.01 24.7 10.7 50 151-200 52-103 (230)
109 COG4942 Membrane-bound metallo 30.0 6.5E+02 0.014 26.2 20.1 20 262-281 169-188 (420)
110 PF10805 DUF2730: Protein of u 29.9 3.2E+02 0.0069 22.5 8.1 27 115-141 37-63 (106)
111 PF05667 DUF812: Protein of un 29.5 7.6E+02 0.016 26.8 16.5 40 257-308 551-590 (594)
112 PF06694 Plant_NMP1: Plant nuc 29.5 5.8E+02 0.013 25.5 10.9 139 146-310 169-319 (325)
113 COG2178 Predicted RNA-binding 29.2 4.8E+02 0.01 24.4 16.8 130 151-332 19-148 (204)
114 PF05278 PEARLI-4: Arabidopsis 28.9 5.2E+02 0.011 25.2 9.8 67 239-311 163-236 (269)
115 PRK11637 AmiB activator; Provi 28.9 6.3E+02 0.014 25.6 22.3 198 115-334 49-252 (428)
116 PRK00736 hypothetical protein; 28.3 2.7E+02 0.0058 21.2 6.5 31 287-317 24-54 (68)
117 KOG0977|consensus 28.1 5.7E+02 0.012 27.5 10.7 37 153-194 350-386 (546)
118 PRK02119 hypothetical protein; 28.0 2.9E+02 0.0062 21.4 6.9 49 269-317 7-58 (73)
119 PF02388 FemAB: FemAB family; 27.5 2.4E+02 0.0052 28.6 7.7 48 258-313 243-290 (406)
120 PRK00846 hypothetical protein; 27.4 3.1E+02 0.0068 21.6 7.2 24 147-170 36-59 (77)
121 PRK00295 hypothetical protein; 27.2 2.8E+02 0.0061 21.1 7.1 31 141-171 22-52 (68)
122 KOG0161|consensus 27.2 1.3E+03 0.029 28.8 19.5 19 300-318 1214-1232(1930)
123 PRK04778 septation ring format 27.1 7.8E+02 0.017 26.2 22.8 88 113-200 105-216 (569)
124 PRK00420 hypothetical protein; 27.0 3.8E+02 0.0083 22.6 7.6 13 216-228 24-36 (112)
125 COG2433 Uncharacterized conser 27.0 8.6E+02 0.019 26.6 14.5 22 153-174 416-437 (652)
126 PF06156 DUF972: Protein of un 26.7 2.7E+02 0.0058 23.3 6.6 46 154-199 10-57 (107)
127 COG5152 Uncharacterized conser 26.1 25 0.00055 32.8 0.3 25 26-59 139-164 (259)
128 PRK14011 prefoldin subunit alp 26.0 4.5E+02 0.0098 23.0 10.9 15 214-228 51-65 (144)
129 KOG3032|consensus 25.9 3.7E+02 0.0081 25.8 8.0 33 173-205 199-231 (264)
130 PF05597 Phasin: Poly(hydroxya 25.7 2.8E+02 0.0061 24.0 6.7 40 268-317 91-130 (132)
131 PF05289 BLYB: Borrelia hemoly 25.5 2.1E+02 0.0046 23.9 5.6 59 236-313 27-85 (105)
132 COG4477 EzrA Negative regulato 24.7 9E+02 0.019 26.0 17.2 42 159-200 174-215 (570)
133 PRK09039 hypothetical protein; 24.5 7.1E+02 0.015 24.8 16.7 22 119-140 45-66 (343)
134 PLN00204 CP12 gene family prot 24.5 2.2E+02 0.0048 24.6 5.7 40 156-195 56-96 (126)
135 PF10458 Val_tRNA-synt_C: Valy 24.2 3.1E+02 0.0066 20.5 6.6 55 117-171 8-65 (66)
136 PRK10564 maltose regulon perip 23.7 92 0.002 30.8 3.7 26 165-190 261-286 (303)
137 PF04420 CHD5: CHD5-like prote 23.5 3.2E+02 0.0069 24.2 6.9 51 151-201 39-89 (161)
138 KOG4360|consensus 23.4 9.4E+02 0.02 25.8 11.5 63 111-173 196-261 (596)
139 PF06103 DUF948: Bacterial pro 23.4 3.6E+02 0.0078 21.0 8.8 24 115-138 21-44 (90)
140 PF10538 ITAM_Cys-rich: Immuno 23.3 40 0.00087 20.8 0.7 8 1-8 14-21 (24)
141 PRK04863 mukB cell division pr 23.1 1.4E+03 0.03 27.7 13.9 25 261-285 1237-1261(1486)
142 PF10979 DUF2786: Protein of u 23.0 1.2E+02 0.0027 21.1 3.3 38 301-340 3-40 (43)
143 PRK04325 hypothetical protein; 22.9 3.6E+02 0.0079 20.8 6.7 29 288-316 29-57 (74)
144 PF10458 Val_tRNA-synt_C: Valy 22.9 3.3E+02 0.0071 20.3 7.4 58 259-316 6-66 (66)
145 PF04423 Rad50_zn_hook: Rad50 22.9 2.3E+02 0.0049 20.2 4.8 10 217-226 22-31 (54)
146 PRK00846 hypothetical protein; 22.9 3.9E+02 0.0084 21.1 6.8 48 271-318 13-63 (77)
147 TIGR02977 phageshock_pspA phag 22.8 6E+02 0.013 23.4 9.3 66 260-329 27-92 (219)
148 smart00035 CLa CLUSTERIN alpha 22.6 1.8E+02 0.0038 27.5 5.1 56 216-283 73-128 (216)
149 PF10234 Cluap1: Clusterin-ass 22.3 7.3E+02 0.016 24.1 10.1 9 186-194 226-234 (267)
150 TIGR03185 DNA_S_dndD DNA sulfu 22.3 8.2E+02 0.018 26.4 11.0 16 181-196 479-494 (650)
151 PF12128 DUF3584: Protein of u 22.1 1.3E+03 0.028 27.0 16.2 173 115-312 825-1015(1201)
152 PF04363 DUF496: Protein of un 22.0 3.7E+02 0.0081 22.0 6.2 24 115-138 14-37 (95)
153 KOG2264|consensus 21.9 4.8E+02 0.01 28.4 8.7 36 149-184 104-144 (907)
154 PF05149 Flagellar_rod: Parafl 21.9 7.7E+02 0.017 24.3 15.6 53 264-316 187-239 (289)
155 PRK02793 phi X174 lysis protei 21.6 3.8E+02 0.0082 20.6 6.8 48 270-317 7-57 (72)
156 COG4911 Uncharacterized conser 21.5 4.4E+02 0.0095 22.4 6.7 55 113-173 11-65 (123)
157 KOG0150|consensus 21.5 1.3E+02 0.0028 29.9 4.2 28 216-245 11-38 (336)
158 TIGR00756 PPR pentatricopeptid 21.1 1.8E+02 0.0039 17.2 3.6 27 312-341 7-33 (35)
159 PRK03918 chromosome segregatio 21.0 1.1E+03 0.025 25.9 22.4 182 109-334 521-716 (880)
160 COG2433 Uncharacterized conser 21.0 5.9E+02 0.013 27.8 9.2 35 249-283 427-462 (652)
161 COG4026 Uncharacterized protei 20.8 7E+02 0.015 24.0 8.7 14 186-199 192-205 (290)
162 PF11945 WASH_WAHD: WAHD domai 20.5 3.8E+02 0.0083 26.4 7.3 51 254-318 22-72 (297)
163 PF10367 Vps39_2: Vacuolar sor 20.5 2.6E+02 0.0056 22.0 5.2 12 216-227 79-90 (109)
164 PF04949 Transcrip_act: Transc 20.2 4.8E+02 0.01 23.3 7.1 62 129-190 61-127 (159)
165 PF06825 HSBP1: Heat shock fac 20.2 2.1E+02 0.0046 21.1 4.2 36 278-313 17-52 (54)
No 1
>PF03194 LUC7: LUC7 N_terminus; InterPro: IPR004882 This family consists of several LUC7 protein homologues that are restricted to eukaryotes. LUC7 has been shown to be a U1 snRNA associated protein [] with a role in splice site recognition []. The entry contains human and mouse LUC7 like (LUC7L) proteins [] and human cisplatin resistance-associated overexpressed protein (CROP) [].
Probab=100.00 E-value=1.4e-69 Score=511.67 Aligned_cols=217 Identities=55% Similarity=0.874 Sum_probs=196.3
Q ss_pred ccCCCCCCCCCcccCCCCcchhhhhhcCCChhhhhhhCccCCCCCCccccHHHHHHHHHhhccCCccccccccccCCCch
Q psy4154 10 FSINFPGESNKYNVKFFDSKVCKSFLLTCCPHEILSSTRMDLGECPKIHDLALRADYEKASKNRDYYYDIDRMDLGECPK 89 (346)
Q Consensus 10 ~~~~r~~~~~~~~~~~~D~~VCk~fL~g~CPhdLF~nTk~dlG~C~k~H~~~lk~~ye~~~~~~~~~~~~~r~d~g~c~k 89 (346)
||.+||+...+..++|+||+|||||||||||||||+|||+||||||||||++||++|++
T Consensus 12 MG~~Rn~~~~~~~~~f~D~~VCk~~L~g~CPhdLF~nTK~DLG~C~kiHd~~lk~~Ye~--------------------- 70 (254)
T PF03194_consen 12 MGSNRNGDPSKRKVHFTDPDVCKYFLVGFCPHDLFVNTKSDLGPCPKIHDEALKAEYEK--------------------- 70 (254)
T ss_pred cCCccCCCccccCCCCCCcccCHHHHhCCCcHHHHhhcccccchhhhhcCHHHHHHHHh---------------------
Confidence 89999999877779999999999999999999999999999999999999999999955
Q ss_pred hhhhhhHHHHHHhhhcccch-hhHHHHHHHHHHHHHhhHHHHHHHHHHhhhhhhhhHHH-HHHHHHHHHHHHHHHHHHHH
Q psy4154 90 IHDLALRADYEKASKNRDYY-YDIDAMEHLQAFISDCDRRTELAKQRLLETQEVLSAEV-AEKANAVHQLAEDIGKKLAK 167 (346)
Q Consensus 90 ~h~~~lr~~~e~~~~~~~~~-yE~e~~~~L~~~I~dcdRkI~~ak~RLe~~~~~~~~~~-~~~~~~i~~l~~~I~~~l~~ 167 (346)
+++..+++ ||++|+++|++||.+|||||+++++||+.++++.+... ....+.|..|+++|++++++
T Consensus 71 ------------~~~~~~~~~YE~e~~~~L~~~i~d~drrI~~~k~RL~~~~~~~~~~~~~~~~~~i~~l~~~I~~ll~~ 138 (254)
T PF03194_consen 71 ------------ASKKEKYGGYEREFLRYLQRLIRDCDRRIERAKERLEQTQEEQAKEADEEKAEKIDELDEKIGELLKE 138 (254)
T ss_pred ------------CccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhCccccccchhhhHHHHHHHHHHHHHHHHHH
Confidence 44455555 99999999999999999999999999999887765333 23478999999999999999
Q ss_pred HHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHH---hhccCCCCCc-hhhhcccccccchhhhcccCcchhhhcccCCccc
Q psy4154 168 AEELGAEGFVEESLKLMTEIEDLRKKKAEAED---TYRNSMPASS-YQQQKLRVCEVCSAYLGIHDNDRRLADHFGGKLH 243 (346)
Q Consensus 168 aE~LGeeG~VdeA~~~~~~~e~lka~~deL~~---~~rn~~~~~~-~~~~k~~vCdVCga~L~~~d~~~R~~dH~~gK~H 243 (346)
||+||++|+||+|+++|.+|+.|++++++|.. .+++..+... +++++|.||+|||+||+++|+++|++|||+||+|
T Consensus 139 aE~LGeeG~VdeA~~~~~~~e~Lk~ek~~le~~~~~~~~~~~~~~~~~~qkl~VCeVCGA~Ls~~D~d~RladH~~GK~H 218 (254)
T PF03194_consen 139 AEELGEEGDVDEAQKLMEEVEKLKEEKEELEKELEEYRNSIENSAQSQQQKLEVCEVCGAFLSVGDNDRRLADHFGGKQH 218 (254)
T ss_pred HHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhhhcccccCccchhhhhhHHhccchHHHHHHHhccchh
Confidence 99999999999999999999999999999998 5566555442 5678999999999999999999999999999999
Q ss_pred cchHHHHHHHHHHHHH
Q psy4154 244 LGFIKIREKLAELQAM 259 (346)
Q Consensus 244 ~g~~~lR~~~~El~~~ 259 (346)
+||.+||+.+.+++..
T Consensus 219 lGy~~IR~~l~el~e~ 234 (254)
T PF03194_consen 219 LGYAKIREKLKELKEK 234 (254)
T ss_pred hhHHHHHHHHHHHHHH
Confidence 9999999999988843
No 2
>KOG0796|consensus
Probab=100.00 E-value=4e-66 Score=492.27 Aligned_cols=216 Identities=60% Similarity=0.956 Sum_probs=198.7
Q ss_pred ccCCCCCCCCCcccCCCCcchhhhhhcCCChhhhhhhCccCCCCCCccccHHHHHHHHHhhccCCccccccccccCCCch
Q psy4154 10 FSINFPGESNKYNVKFFDSKVCKSFLLTCCPHEILSSTRMDLGECPKIHDLALRADYEKASKNRDYYYDIDRMDLGECPK 89 (346)
Q Consensus 10 ~~~~r~~~~~~~~~~~~D~~VCk~fL~g~CPhdLF~nTk~dlG~C~k~H~~~lk~~ye~~~~~~~~~~~~~r~d~g~c~k 89 (346)
||+||||+.....+.|+||+||+||||||||||||+|||+|||+||++|+++||++|+
T Consensus 13 MGs~r~~~~~~~~v~~~D~~VC~~fLvg~CPHDlF~nTk~dlg~C~kvHd~~lk~~Ye---------------------- 70 (319)
T KOG0796|consen 13 MGSNRDGDETRQRVKFDDPDVCKSFLVGFCPHDLFQNTKMDLGPCPKVHDEALKADYE---------------------- 70 (319)
T ss_pred hCCCcCCCcccCCCCCCchhHHHHHHhCCCcHHHhhhhhcccCcccchhhHHHHHHHh----------------------
Confidence 8999999988888999999999999999999999999999999999999999999994
Q ss_pred hhhhhhHHHHHHhhhcccchhhHHHHHHHHHHHHHhhHHHHHHHHHHhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy4154 90 IHDLALRADYEKASKNRDYYYDIDAMEHLQAFISDCDRRTELAKQRLLETQEVLSAEVAEKANAVHQLAEDIGKKLAKAE 169 (346)
Q Consensus 90 ~h~~~lr~~~e~~~~~~~~~yE~e~~~~L~~~I~dcdRkI~~ak~RLe~~~~~~~~~~~~~~~~i~~l~~~I~~~l~~aE 169 (346)
.+++..+++||++|+++|+++|.||+++|+++++||..+.++.+.....++++|..|+++|+.++++||
T Consensus 71 -----------~~~k~~~~~~E~d~~~~l~~~v~d~~rri~~~kerL~e~~ee~~~e~~~k~~~v~~l~e~I~~~l~~~E 139 (319)
T KOG0796|consen 71 -----------RASKERDYGYEWDALEILERFVADVDRRIEKAKERLAETVEERSEEAARKAEKVHELEEKIGKLLEKAE 139 (319)
T ss_pred -----------hchHhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 466678899999999999999999999999999999999776665555667899999999999999999
Q ss_pred HHhhcCCHHHHHHHHHHHHHHHH-HHHHHHHhhccCCCCCchhhhcccccccchhhhcccCcchhhhcccCCccccchHH
Q psy4154 170 ELGAEGFVEESLKLMTEIEDLRK-KKAEAEDTYRNSMPASSYQQQKLRVCEVCSAYLGIHDNDRRLADHFGGKLHLGFIK 248 (346)
Q Consensus 170 ~LGeeG~VdeA~~~~~~~e~lka-~~deL~~~~rn~~~~~~~~~~k~~vCdVCga~L~~~d~~~R~~dH~~gK~H~g~~~ 248 (346)
+||++|+|++||++|.++|.|++ ++......+.+.++..+.++|+|.||+||||||+++|+++|++|||+||+|+||..
T Consensus 140 ~LG~eG~Veeaq~~~~e~E~lk~~e~e~~~~~~~~~~~~~~~~~qkl~VCeVCGa~L~~~D~d~RlaDHf~GKlHlGy~~ 219 (319)
T KOG0796|consen 140 ELGEEGNVEEAQKAMKEVEELKAKEKEEAEESYNTTMPGASAQQQKLRVCEVCGAFLSVNDADRRLADHFGGKLHLGYVL 219 (319)
T ss_pred HHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHccCcchhhhhhhhhhHHHhhhHHHhccchHHHHHHhhcchHHHHHHH
Confidence 99999999999999999999998 66666666666666666788999999999999999999999999999999999999
Q ss_pred HHHHHHHHHH
Q psy4154 249 IREKLAELQA 258 (346)
Q Consensus 249 lR~~~~El~~ 258 (346)
||++|.+|+.
T Consensus 220 iR~~l~eLk~ 229 (319)
T KOG0796|consen 220 IREKLAELKK 229 (319)
T ss_pred HHHHHHHHHH
Confidence 9999998874
No 3
>COG5200 LUC7 U1 snRNP component, mediates U1 snRNP association with cap-binding complex [RNA processing and modification]
Probab=100.00 E-value=3.1e-51 Score=368.93 Aligned_cols=213 Identities=31% Similarity=0.559 Sum_probs=191.5
Q ss_pred ccCCCCCCCCCcccCCCCcchhhhhhcCCChhhhhhhCccCCCCCCccccHHHHHHHHHhhccCCccccccccccCCCch
Q psy4154 10 FSINFPGESNKYNVKFFDSKVCKSFLLTCCPHEILSSTRMDLGECPKIHDLALRADYEKASKNRDYYYDIDRMDLGECPK 89 (346)
Q Consensus 10 ~~~~r~~~~~~~~~~~~D~~VCk~fL~g~CPhdLF~nTk~dlG~C~k~H~~~lk~~ye~~~~~~~~~~~~~r~d~g~c~k 89 (346)
||++|..-...+..||+||+||+||||||||||||+|||++||+||++|.+++|.+|+.
T Consensus 13 mG~~~s~f~~~r~~hftd~kvC~s~Lvg~CP~dlF~nTk~slGkCp~~H~~k~K~~YeR--------------------- 71 (258)
T COG5200 13 MGPERSTFDPCRPTHFTDKKVCISMLVGFCPFDLFRNTKRSLGKCPTSHEEKYKAEYER--------------------- 71 (258)
T ss_pred hCCCcccCCCCCCccccchHHHHHHHHccCcHHHHhccccccCCCcchhHHHHHHHHhh---------------------
Confidence 78888866667788999999999999999999999999999999999999999999955
Q ss_pred hhhhhhHHHHHHhhhcccchhhHHHHHHHHHHHHHhhHHHHHHHHHHhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy4154 90 IHDLALRADYEKASKNRDYYYDIDAMEHLQAFISDCDRRTELAKQRLLETQEVLSAEVAEKANAVHQLAEDIGKKLAKAE 169 (346)
Q Consensus 90 ~h~~~lr~~~e~~~~~~~~~yE~e~~~~L~~~I~dcdRkI~~ak~RLe~~~~~~~~~~~~~~~~i~~l~~~I~~~l~~aE 169 (346)
+...+.++||++|++.|.++|.+|+..|+.|-.+...|+++-. .+.++.+.+..+++.|+.+-.+.+
T Consensus 72 ------------~~~~~~~~yEweyl~~L~r~V~~cn~~I~~a~~~~~~t~ee~~-kI~e~~e~~~~~d~sIg~lg~e~d 138 (258)
T COG5200 72 ------------NGRERAEEYEWEYLRLLVRIVLSCNDGIRAAGLEDRTTPEEFG-KIKEKEELFNRVDESIGELGMEGD 138 (258)
T ss_pred ------------hhhccchhhhHHHHHHHHHHHHHhcchhhhhhhhccCCHHHHH-hHHHHHHHHHHHHHHHHHHHHhHH
Confidence 4456778999999999999999999999999999988887643 477888889999999999999999
Q ss_pred HHhhcCCHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCC-chhhhcccccccchhhhcccCcchhhhcccCCccccchHH
Q psy4154 170 ELGAEGFVEESLKLMTEIEDLRKKKAEAEDTYRNSMPAS-SYQQQKLRVCEVCSAYLGIHDNDRRLADHFGGKLHLGFIK 248 (346)
Q Consensus 170 ~LGeeG~VdeA~~~~~~~e~lka~~deL~~~~rn~~~~~-~~~~~k~~vCdVCga~L~~~d~~~R~~dH~~gK~H~g~~~ 248 (346)
.|...-.|+.|+....+++.|+.++.++.....+++... +..+|||+||+|||+||+..|+++|++|||+||+|+||..
T Consensus 139 alakrk~V~~a~~~f~el~rl~~~rkei~~~v~sm~en~gq~thqklqvC~iCgayLsrlDtdrrladHf~GklHlGy~~ 218 (258)
T COG5200 139 ALAKRKLVERACSAFNELERLREERKEIKEAVYSMVENNGQGTHQKLQVCGICGAYLSRLDTDRRLADHFNGKLHLGYLL 218 (258)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCcchhhhhhhhhhhhhhhHHHhcchhhHHHHHhccchhhhHHH
Confidence 999988999999999999999999999999988765433 3568999999999999999999999999999999999999
Q ss_pred HHHHHHHH
Q psy4154 249 IREKLAEL 256 (346)
Q Consensus 249 lR~~~~El 256 (346)
+|..|..+
T Consensus 219 ~R~dl~~l 226 (258)
T COG5200 219 VRSDLADL 226 (258)
T ss_pred HHHHHHHH
Confidence 99776643
No 4
>PF03194 LUC7: LUC7 N_terminus; InterPro: IPR004882 This family consists of several LUC7 protein homologues that are restricted to eukaryotes. LUC7 has been shown to be a U1 snRNA associated protein [] with a role in splice site recognition []. The entry contains human and mouse LUC7 like (LUC7L) proteins [] and human cisplatin resistance-associated overexpressed protein (CROP) [].
Probab=99.94 E-value=1.4e-26 Score=219.16 Aligned_cols=146 Identities=34% Similarity=0.459 Sum_probs=122.0
Q ss_pred HHHHHHHHHHHHhhccCCCCCchhhhcccccccchhhhcccCcchhhh---cccC--CccccchHHHHHHHH--------
Q psy4154 188 EDLRKKKAEAEDTYRNSMPASSYQQQKLRVCEVCSAYLGIHDNDRRLA---DHFG--GKLHLGFIKIREKLA-------- 254 (346)
Q Consensus 188 e~lka~~deL~~~~rn~~~~~~~~~~k~~vCdVCga~L~~~d~~~R~~---dH~~--gK~H~g~~~lR~~~~-------- 254 (346)
+.++++||+|||..||..+.... .+|.+++||++||++||+..-+. ..+| .|+|+. .+|..|+
T Consensus 2 d~~Ra~LdeLMG~~Rn~~~~~~~--~~f~D~~VCk~~L~g~CPhdLF~nTK~DLG~C~kiHd~--~lk~~Ye~~~~~~~~ 77 (254)
T PF03194_consen 2 DEMRAMLDELMGSNRNGDPSKRK--VHFTDPDVCKYFLVGFCPHDLFVNTKSDLGPCPKIHDE--ALKAEYEKASKKEKY 77 (254)
T ss_pred HHHHHHHHHHcCCccCCCccccC--CCCCCcccCHHHHhCCCcHHHHhhcccccchhhhhcCH--HHHHHHHhCcccccc
Confidence 67899999999999998764322 67999999999999999553220 1122 377876 5777765
Q ss_pred ---HHHHHHHHHHHHhhhHHHHHHHHHhhhhhhhhhhHhH-HHHHHHHHHHHHHHHHHHHHHHHHhhcCCHHHHHHHHHH
Q psy4154 255 ---ELQAMEHLQAFISDCDRRTELAKQRLLETQEVLSAEV-AEKANAVHQLAEDIGKKLAKAEELGAEGFVEESLKLMTE 330 (346)
Q Consensus 255 ---El~~~r~L~~~i~d~DRrI~~ak~RL~~t~ee~~~~~-~~~~~~i~~l~ekI~~ll~eaE~LGeeG~VdeA~~l~~~ 330 (346)
|.+++++|++||.||||||++|++||+.++++.+... ...++.|..|+++|++++++||+||++|+|++|+++|.+
T Consensus 78 ~~YE~e~~~~L~~~i~d~drrI~~~k~RL~~~~~~~~~~~~~~~~~~i~~l~~~I~~ll~~aE~LGeeG~VdeA~~~~~~ 157 (254)
T PF03194_consen 78 GGYEREFLRYLQRLIRDCDRRIERAKERLEQTQEEQAKEADEEKAEKIDELDEKIGELLKEAEELGEEGDVDEAQKLMEE 157 (254)
T ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHHHHhCccccccchhhhHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHH
Confidence 7889999999999999999999999999988776543 235899999999999999999999999999999999999
Q ss_pred HhhhcCC
Q psy4154 331 QTSSCSP 337 (346)
Q Consensus 331 ~e~l~~~ 337 (346)
|+.|+.+
T Consensus 158 ~e~Lk~e 164 (254)
T PF03194_consen 158 VEKLKEE 164 (254)
T ss_pred HHHHHHH
Confidence 9999764
No 5
>KOG0796|consensus
Probab=99.91 E-value=5.4e-24 Score=203.78 Aligned_cols=142 Identities=39% Similarity=0.486 Sum_probs=120.6
Q ss_pred HHHHHHHHHHhhccCCCCCchhhhcccccccchhhhcccCcchhh----hcccC--CccccchHHHHHHHH---------
Q psy4154 190 LRKKKAEAEDTYRNSMPASSYQQQKLRVCEVCSAYLGIHDNDRRL----ADHFG--GKLHLGFIKIREKLA--------- 254 (346)
Q Consensus 190 lka~~deL~~~~rn~~~~~~~~~~k~~vCdVCga~L~~~d~~~R~----~dH~~--gK~H~g~~~lR~~~~--------- 254 (346)
+++++++|||..++..... +-.+|++++||.+||++|||..-+ .| ++ -|+|.. +||..|+
T Consensus 5 mR~mLdqLMGs~r~~~~~~--~~v~~~D~~VC~~fLvg~CPHDlF~nTk~d-lg~C~kvHd~--~lk~~Ye~~~k~~~~~ 79 (319)
T KOG0796|consen 5 MRAMLDQLMGSNRDGDETR--QRVKFDDPDVCKSFLVGFCPHDLFQNTKMD-LGPCPKVHDE--ALKADYERASKERDYG 79 (319)
T ss_pred HHHHHHHHhCCCcCCCccc--CCCCCCchhHHHHHHhCCCcHHHhhhhhcc-cCcccchhhH--HHHHHHhhchHhhhhh
Confidence 8999999999888765433 225899999999999999965322 12 22 388988 5776665
Q ss_pred -HHHHHHHHHHHHhhhHHHHHHHHHhhhhhhhhhhHhHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCHHHHHHHHHHHhh
Q psy4154 255 -ELQAMEHLQAFISDCDRRTELAKQRLLETQEVLSAEVAEKANAVHQLAEDIGKKLAKAEELGAEGFVEESLKLMTEQTS 333 (346)
Q Consensus 255 -El~~~r~L~~~i~d~DRrI~~ak~RL~~t~ee~~~~~~~~~~~i~~l~ekI~~ll~eaE~LGeeG~VdeA~~l~~~~e~ 333 (346)
|.++|++|+.+|.||+++|+++++||++|.++.......++++|..|+++|+.++++||+||++|+|++||++|.+++.
T Consensus 80 ~E~d~~~~l~~~v~d~~rri~~~kerL~e~~ee~~~e~~~k~~~v~~l~e~I~~~l~~~E~LG~eG~Veeaq~~~~e~E~ 159 (319)
T KOG0796|consen 80 YEWDALEILERFVADVDRRIEKAKERLAETVEERSEEAARKAEKVHELEEKIGKLLEKAEELGEEGNVEEAQKAMKEVEE 159 (319)
T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCHHHHHHHHHHHHH
Confidence 8999999999999999999999999999977666655667889999999999999999999999999999999999999
Q ss_pred hcC
Q psy4154 334 SCS 336 (346)
Q Consensus 334 l~~ 336 (346)
|.+
T Consensus 160 lk~ 162 (319)
T KOG0796|consen 160 LKA 162 (319)
T ss_pred HHH
Confidence 986
No 6
>COG5200 LUC7 U1 snRNP component, mediates U1 snRNP association with cap-binding complex [RNA processing and modification]
Probab=99.24 E-value=3.2e-11 Score=110.04 Aligned_cols=144 Identities=17% Similarity=0.143 Sum_probs=116.0
Q ss_pred HHHHHHHHHHHHHhhccCCCCCchhhhcccccccchhhhcccCcchhhhcccC---------CccccchHHHHHH-----
Q psy4154 187 IEDLRKKKAEAEDTYRNSMPASSYQQQKLRVCEVCSAYLGIHDNDRRLADHFG---------GKLHLGFIKIREK----- 252 (346)
Q Consensus 187 ~e~lka~~deL~~~~rn~~~~~~~~~~k~~vCdVCga~L~~~d~~~R~~dH~~---------gK~H~g~~~lR~~----- 252 (346)
+.+.+..+++|||..+..-...+. .++.+.+||.+||+++|+. |.|. -++|..|.++..+
T Consensus 2 aaeqRkv~eqLmG~~~s~f~~~r~--~hftd~kvC~s~Lvg~CP~----dlF~nTk~slGkCp~~H~~k~K~~YeR~~~~ 75 (258)
T COG5200 2 AAEQRKVLEQLMGPERSTFDPCRP--THFTDKKVCISMLVGFCPF----DLFRNTKRSLGKCPTSHEEKYKAEYERNGRE 75 (258)
T ss_pred chHHHHHHHHHhCCCcccCCCCCC--ccccchHHHHHHHHccCcH----HHHhccccccCCCcchhHHHHHHHHhhhhhc
Confidence 345678888999987653222112 2488999999999999944 4442 2677776554222
Q ss_pred ---HHHHHHHHHHHHHHhhhHHHHHHHHHhhhhhhhhhhHhHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCHHHHHHHHH
Q psy4154 253 ---LAELQAMEHLQAFISDCDRRTELAKQRLLETQEVLSAEVAEKANAVHQLAEDIGKKLAKAEELGAEGFVEESLKLMT 329 (346)
Q Consensus 253 ---~~El~~~r~L~~~i~d~DRrI~~ak~RL~~t~ee~~~~~~~~~~~i~~l~ekI~~ll~eaE~LGeeG~VdeA~~l~~ 329 (346)
..|.++++.|.++|.+|..+|+.|-.++.+||++-.. +.++++.+..+++.|+.+-.+.+.|...-.|+.|.....
T Consensus 76 ~~~~yEweyl~~L~r~V~~cn~~I~~a~~~~~~t~ee~~k-I~e~~e~~~~~d~sIg~lg~e~dalakrk~V~~a~~~f~ 154 (258)
T COG5200 76 RAEEYEWEYLRLLVRIVLSCNDGIRAAGLEDRTTPEEFGK-IKEKEELFNRVDESIGELGMEGDALAKRKLVERACSAFN 154 (258)
T ss_pred cchhhhHHHHHHHHHHHHHhcchhhhhhhhccCCHHHHHh-HHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHH
Confidence 2389999999999999999999999999999888766 779999999999999999999999999999999999999
Q ss_pred HHhhhcCC
Q psy4154 330 EQTSSCSP 337 (346)
Q Consensus 330 ~~e~l~~~ 337 (346)
+++.|..+
T Consensus 155 el~rl~~~ 162 (258)
T COG5200 155 ELERLREE 162 (258)
T ss_pred HHHHHHHH
Confidence 99988653
No 7
>PHA02562 46 endonuclease subunit; Provisional
Probab=94.56 E-value=6.8 Score=40.75 Aligned_cols=13 Identities=23% Similarity=0.606 Sum_probs=10.0
Q ss_pred cccccchhhhccc
Q psy4154 216 RVCEVCSAYLGIH 228 (346)
Q Consensus 216 ~vCdVCga~L~~~ 228 (346)
..|+.|+.-+...
T Consensus 285 ~~Cp~C~~~~~~~ 297 (562)
T PHA02562 285 GVCPTCTQQISEG 297 (562)
T ss_pred CCCCCCCCcCCCc
Confidence 5899999877543
No 8
>PF10168 Nup88: Nuclear pore component; InterPro: IPR019321 Nup88 can be divided into two structural domains; the N-terminal two-thirds of the protein have no obvious structural motifs. It is, however, where it binds to Nup98; one of the components of the nuclear pore. The C-terminal end is a predicted coiled-coil domain []. Nup88 is over expressed in tumour cells [].
Probab=92.75 E-value=18 Score=39.73 Aligned_cols=32 Identities=22% Similarity=0.253 Sum_probs=23.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhcCCHHHHHH
Q psy4154 149 EKANAVHQLAEDIGKKLAKAEELGAEGFVEESLK 182 (346)
Q Consensus 149 ~~~~~i~~l~~~I~~~l~~aE~LGeeG~VdeA~~ 182 (346)
....+|.++.+.++.+-..++.|.+ ++++|..
T Consensus 576 ~Ql~~L~~l~e~~~~l~~~ae~Lae--R~e~a~d 607 (717)
T PF10168_consen 576 QQLKELQELQEERKSLRESAEKLAE--RYEEAKD 607 (717)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHH
Confidence 4456788888888888888888876 4666643
No 9
>PF13779 DUF4175: Domain of unknown function (DUF4175)
Probab=91.39 E-value=23 Score=39.60 Aligned_cols=73 Identities=23% Similarity=0.362 Sum_probs=48.7
Q ss_pred HHHHHHHHHhhHHHHHHHHHHhhhh------hhhhHHHHHHHHHHH----------------------------HHHHHH
Q psy4154 116 EHLQAFISDCDRRTELAKQRLLETQ------EVLSAEVAEKANAVH----------------------------QLAEDI 161 (346)
Q Consensus 116 ~~L~~~I~dcdRkI~~ak~RLe~~~------~~~~~~~~~~~~~i~----------------------------~l~~~I 161 (346)
.+-+.=+.+.+++++.|+++|.+.- +++..-+.+..+-+. .-...+
T Consensus 461 ~iEdG~ls~A~~~Lr~AQe~L~eAL~~gAs~eEI~rLm~eLR~A~~~ym~~LAeq~~~~~~~~~~p~~~~~~~~~~~~dL 540 (820)
T PF13779_consen 461 RIEDGDLSDAERRLRAAQEALREALERGASDEEIARLMQELREAMQDYMQALAEQAQRNPQQQDQPPDQGNSQMMSQQDL 540 (820)
T ss_pred HhhcCcHHHHHHHHHHHHHHHHHHHHcCCCHHHHHHHHHHHHHHHHHHHHHHHHHhHhCcccccCcccchhhhccCHHHH
Confidence 3334557888999999999997632 222111111111110 125678
Q ss_pred HHHHHHHHHHhhcCCHHHHHHHHHHHH
Q psy4154 162 GKKLAKAEELGAEGFVEESLKLMTEIE 188 (346)
Q Consensus 162 ~~~l~~aE~LGeeG~VdeA~~~~~~~e 188 (346)
+.+|.++++|.++|..++|+.+|.+++
T Consensus 541 ~~mmd~ie~la~~G~~~~A~q~L~qlq 567 (820)
T PF13779_consen 541 QRMMDRIEELARSGRMDEARQLLEQLQ 567 (820)
T ss_pred HHHHHHHHHHHHcCCHHHHHHHHHHHH
Confidence 899999999999999999999988665
No 10
>PF04012 PspA_IM30: PspA/IM30 family; InterPro: IPR007157 This family includes PspA a protein that suppresses sigma54-dependent transcription. The PspA protein, a negative regulator of the Escherichia coli phage shock psp operon, is produced when virulence factors are exported through secretins in many Gram-negative pathogenic bacteria and its homologue in plants, VIPP1, plays a critical role in thylakoid biogenesis, essential for photosynthesis. Activation of transcription by the enhancer-dependent bacterial sigma54-containing RNA polymerase occurs through ATP hydrolysis-driven protein conformational changes enabled by activator proteins that belong to the large AAA(+) mechanochemical protein family. It has been shown that PspA directly and specifically acts upon and binds to the AAA(+) domain of the PspF transcription activator [].
Probab=91.25 E-value=12 Score=34.44 Aligned_cols=82 Identities=18% Similarity=0.168 Sum_probs=67.7
Q ss_pred HHHHHHHHHhhHHHHHHHHHHhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHH
Q psy4154 116 EHLQAFISDCDRRTELAKQRLLETQEVLSAEVAEKANAVHQLAEDIGKKLAKAEELGAEGFVEESLKLMTEIEDLRKKKA 195 (346)
Q Consensus 116 ~~L~~~I~dcdRkI~~ak~RLe~~~~~~~~~~~~~~~~i~~l~~~I~~~l~~aE~LGeeG~VdeA~~~~~~~e~lka~~d 195 (346)
..|+.+|+|++.-|.+++.-+..... .......++.++...|.....+|+..-..|+=|-|..++.....+.....
T Consensus 26 ~~l~q~ird~e~~l~~a~~~~a~~~a----~~~~le~~~~~~~~~~~~~~~~A~~Al~~g~edLAr~al~~k~~~e~~~~ 101 (221)
T PF04012_consen 26 KMLEQAIRDMEEQLRKARQALARVMA----NQKRLERKLDEAEEEAEKWEKQAELALAAGREDLAREALQRKADLEEQAE 101 (221)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHHHH
Confidence 78899999999999999888865432 22345678999999999999999999999999999999988888888777
Q ss_pred HHHHhh
Q psy4154 196 EAEDTY 201 (346)
Q Consensus 196 eL~~~~ 201 (346)
.+....
T Consensus 102 ~l~~~~ 107 (221)
T PF04012_consen 102 RLEQQL 107 (221)
T ss_pred HHHHHH
Confidence 776644
No 11
>KOG0994|consensus
Probab=88.31 E-value=56 Score=37.91 Aligned_cols=46 Identities=22% Similarity=0.415 Sum_probs=27.3
Q ss_pred HHHHHHHHHHHHHHHHH---HH-Hh-hcCCHHHHHHHHHHHHHHHHHHHHH
Q psy4154 152 NAVHQLAEDIGKKLAKA---EE-LG-AEGFVEESLKLMTEIEDLRKKKAEA 197 (346)
Q Consensus 152 ~~i~~l~~~I~~~l~~a---E~-LG-eeG~VdeA~~~~~~~e~lka~~deL 197 (346)
++|..|..+|++.+... +. |- -.|+|..|..|..++.+.+..-+.+
T Consensus 1511 eqi~~L~~~I~e~v~sL~nVd~IL~~T~~di~ra~~L~s~A~~a~~~A~~v 1561 (1758)
T KOG0994|consen 1511 EQIQQLTGEIQERVASLPNVDAILSRTKGDIARAENLQSEAERARSRAEDV 1561 (1758)
T ss_pred HHHHHHHHHHHHHHHhcccHHHHHHhhhhhHHHHHHHHHHHHHHHhHHHHH
Confidence 56666677776665532 22 22 1677777777777776665544433
No 12
>TIGR02302 aProt_lowcomp conserved hypothetical protein TIGR02302. Members of this family are long (~850 residue) bacterial proteins from the alpha Proteobacteria. Each has 2-3 predicted transmembrane helices near the N-terminus and a long C-terminal region that includes stretches of Gln/Gly-rich low complexity sequence, predicted by TMHMM to be outside the membrane. In Bradyrhizobium japonicum, two tandem reading frames are together homologous the single members found in other species; the cutoffs scores are set low enough that the longer scores above the trusted cutoff and the shorter above the noise cutoff for this model.
Probab=87.05 E-value=4 Score=45.51 Aligned_cols=76 Identities=17% Similarity=0.299 Sum_probs=50.3
Q ss_pred HHHHHHHHHHhhHHHHHHHHHHhhhh------hhhhHHHHHHH-------HHHH-------------------HH-HHHH
Q psy4154 115 MEHLQAFISDCDRRTELAKQRLLETQ------EVLSAEVAEKA-------NAVH-------------------QL-AEDI 161 (346)
Q Consensus 115 ~~~L~~~I~dcdRkI~~ak~RLe~~~------~~~~~~~~~~~-------~~i~-------------------~l-~~~I 161 (346)
+.+-+.=+.+.+++.+.|+++|.+.- ++|..-+.+.. .++. .+ ...+
T Consensus 491 l~iEdG~ls~A~~~Lr~AQ~aL~eAL~~gAsdeEI~~Lm~eLR~Am~~ym~~LAeq~~~~~~~~~~~~~~~~~~l~~~dL 570 (851)
T TIGR02302 491 LGIEDGDLSDAERRLRAAQDALKDALERGASDEEIKQLTDKLRAAMQTYMRQLAQQLRNNPQQLARPLDPNTKVLRQQDL 570 (851)
T ss_pred HHhhcCCHHHHHHHHHHHHHHHHHHHHcCCCHHHHHHHHHHHHHHHHHHHHHHHHHhhhCcccccccCCccccccCHHHH
Confidence 33335557789999999999997642 22221111111 1111 01 3448
Q ss_pred HHHHHHHHHHhhcCCHHHHHHHHHHHHHH
Q psy4154 162 GKKLAKAEELGAEGFVEESLKLMTEIEDL 190 (346)
Q Consensus 162 ~~~l~~aE~LGeeG~VdeA~~~~~~~e~l 190 (346)
+.||..+|+|.++|..++|+.+|++++.+
T Consensus 571 q~Mmd~ieela~~G~~~~A~qlL~qlq~m 599 (851)
T TIGR02302 571 QNMMDQIENLARSGDRDQAKQLLSQLQQM 599 (851)
T ss_pred HHHHHHHHHHHHcCCHHHHHHHHHHHHHH
Confidence 89999999999999999999999887644
No 13
>KOG0250|consensus
Probab=84.34 E-value=76 Score=36.38 Aligned_cols=21 Identities=19% Similarity=0.418 Sum_probs=15.2
Q ss_pred CCCcchhhhhhcCCChhhhhh
Q psy4154 25 FFDSKVCKSFLLTCCPHEILS 45 (346)
Q Consensus 25 ~~D~~VCk~fL~g~CPhdLF~ 45 (346)
|=..+-=|.||..+=|-|.|.
T Consensus 181 ~lsQD~aR~FL~~~~p~dkYk 201 (1074)
T KOG0250|consen 181 VLSQDAARSFLANSNPKDKYK 201 (1074)
T ss_pred hhcHHHHHHHHhcCChHHHHH
Confidence 344466789999998988753
No 14
>PRK02224 chromosome segregation protein; Provisional
Probab=83.88 E-value=75 Score=35.18 Aligned_cols=12 Identities=17% Similarity=0.412 Sum_probs=9.0
Q ss_pred ccccccchhhhc
Q psy4154 215 LRVCEVCSAYLG 226 (346)
Q Consensus 215 ~~vCdVCga~L~ 226 (346)
-..|++||.-+.
T Consensus 451 ~~~Cp~C~r~~~ 462 (880)
T PRK02224 451 AGKCPECGQPVE 462 (880)
T ss_pred cccCCCCCCcCC
Confidence 368999998543
No 15
>PF06008 Laminin_I: Laminin Domain I; InterPro: IPR009254 Laminins are glycoproteins that are major constituents of the basement membrane of cells. Laminins are trimeric molecules; laminin-1 is an alpha1 beta1 gamma1 trimer. It has been suggested that the domains I and II from laminin A, B1 and B2 may come together to form a triple helical coiled-coil structure []. Binding to cells via a high affinity receptor, laminin is thought to mediate the attachment, migration and organisation of cells into tissues during embryonic development by interacting with other extracellular matrix components.; GO: 0005102 receptor binding, 0030155 regulation of cell adhesion, 0030334 regulation of cell migration, 0045995 regulation of embryonic development, 0005606 laminin-1 complex
Probab=82.77 E-value=44 Score=31.70 Aligned_cols=90 Identities=11% Similarity=0.192 Sum_probs=55.9
Q ss_pred hhhHHHHHHHHHHHHHhhHHHHHHHHHHhhhhhhhh---HHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCHHHHHHHHH
Q psy4154 109 YYDIDAMEHLQAFISDCDRRTELAKQRLLETQEVLS---AEVAEKANAVHQLAEDIGKKLAKAEELGAEGFVEESLKLMT 185 (346)
Q Consensus 109 ~yE~e~~~~L~~~I~dcdRkI~~ak~RLe~~~~~~~---~~~~~~~~~i~~l~~~I~~~l~~aE~LGeeG~VdeA~~~~~ 185 (346)
.+|.+ +..|..-+....+++.++..+-........ .........|..+...|..++.++..||..|..-....+-.
T Consensus 49 ~~e~~-l~~L~~d~~~L~~k~~~~~~~~~~l~~~t~~t~~~a~~L~~~i~~l~~~i~~l~~~~~~l~~~~~~~~~~~l~~ 127 (264)
T PF06008_consen 49 PLEKE-LESLEQDVENLQEKATKVSRKAQQLNNNTERTLQRAQDLEQFIQNLQDNIQELIEQVESLNENGDQLPSEDLQR 127 (264)
T ss_pred HHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCcccCCCCHHHHHH
Confidence 45544 555566666667777666666554433222 22345567888999999999999999999544444444444
Q ss_pred HHHHHHHHHHHHHH
Q psy4154 186 EIEDLRKKKAEAED 199 (346)
Q Consensus 186 ~~e~lka~~deL~~ 199 (346)
..+..+.++.++..
T Consensus 128 ~l~ea~~mL~emr~ 141 (264)
T PF06008_consen 128 ALAEAQRMLEEMRK 141 (264)
T ss_pred HHHHHHHHHHHHHh
Confidence 44444455555543
No 16
>PF07361 Cytochrom_B562: Cytochrome b562; InterPro: IPR009155 Cytochrome b562 is a haem-containing protein that is expressed in the periplasm of Escherichia coli. In b-type cytochromes, the haem atom is not covalently attached to the polypeptide. Cytochrome b562 has a four-helical bundle structure that is structurally similar to that found in members of the cytochrome c family (IPR002321 from INTERPRO). Cytochrome b562 has a reduction potential of 167 mV, which sets the energy yield possible in metabolism and is also a key determinant of the rate at which redox reactions proceed [].; GO: 0005506 iron ion binding, 0009055 electron carrier activity, 0020037 heme binding, 0042597 periplasmic space; PDB: 4ER9_A 3IQ6_G 2QLA_B 3FOO_A 3M79_C 256B_A 3NMI_F 3HNK_A 3NMK_D 2BC5_A ....
Probab=80.62 E-value=6.4 Score=32.51 Aligned_cols=31 Identities=23% Similarity=0.274 Sum_probs=28.3
Q ss_pred HHHHHHHhhcCCHHHHHHHHHHHhhhcCCCC
Q psy4154 309 LAKAEELGAEGFVEESLKLMTEQTSSCSPGH 339 (346)
Q Consensus 309 l~eaE~LGeeG~VdeA~~l~~~~e~l~~~~~ 339 (346)
+..++.+.+.|++++|...+.+++.|..+||
T Consensus 69 id~a~~~~~~G~l~~AK~~l~~l~~lR~eyH 99 (103)
T PF07361_consen 69 IDKAEALAEAGKLDEAKAALKKLDDLRKEYH 99 (103)
T ss_dssp HHHHHHHHHTTHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHCCCHHHHHHHHHHHHHHHHHHh
Confidence 4567889999999999999999999999988
No 17
>PLN03229 acetyl-coenzyme A carboxylase carboxyl transferase subunit alpha; Provisional
Probab=79.59 E-value=1.1e+02 Score=34.02 Aligned_cols=40 Identities=15% Similarity=0.113 Sum_probs=26.9
Q ss_pred hhHHHHHHhhhcccchhhHHHHHHHHHHHHHhhHHHHHHH
Q psy4154 94 ALRADYEKASKNRDYYYDIDAMEHLQAFISDCDRRTELAK 133 (346)
Q Consensus 94 ~lr~~~e~~~~~~~~~yE~e~~~~L~~~I~dcdRkI~~ak 133 (346)
.||...-+++.....--+.-+.+-++++-+++|..+..|-
T Consensus 440 kLk~eilKAk~s~~~~~~~~L~e~IeKLk~E~d~e~S~A~ 479 (762)
T PLN03229 440 KLKEQILKAKESSSKPSELALNEMIEKLKKEIDLEYTEAV 479 (762)
T ss_pred HHHHHHHhcccccCCCCChHHHHHHHHHHHHHHHHHHHhh
Confidence 4555444443333445566788888999999999988773
No 18
>TIGR02302 aProt_lowcomp conserved hypothetical protein TIGR02302. Members of this family are long (~850 residue) bacterial proteins from the alpha Proteobacteria. Each has 2-3 predicted transmembrane helices near the N-terminus and a long C-terminal region that includes stretches of Gln/Gly-rich low complexity sequence, predicted by TMHMM to be outside the membrane. In Bradyrhizobium japonicum, two tandem reading frames are together homologous the single members found in other species; the cutoffs scores are set low enough that the longer scores above the trusted cutoff and the shorter above the noise cutoff for this model.
Probab=76.39 E-value=11 Score=42.00 Aligned_cols=32 Identities=16% Similarity=0.304 Sum_probs=28.9
Q ss_pred HHHHHHHHHHHHhhcCCHHHHHHHHHHHhhhc
Q psy4154 304 DIGKKLAKAEELGAEGFVEESLKLMTEQTSSC 335 (346)
Q Consensus 304 kI~~ll~eaE~LGeeG~VdeA~~l~~~~e~l~ 335 (346)
-+..||.++|+|-+.|..++|+.++.++..+.
T Consensus 569 dLq~Mmd~ieela~~G~~~~A~qlL~qlq~mm 600 (851)
T TIGR02302 569 DLQNMMDQIENLARSGDRDQAKQLLSQLQQMM 600 (851)
T ss_pred HHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHH
Confidence 37899999999999999999999999987654
No 19
>KOG0933|consensus
Probab=75.41 E-value=1.6e+02 Score=33.85 Aligned_cols=79 Identities=15% Similarity=0.129 Sum_probs=40.1
Q ss_pred HHHHHHHhhhHHHHHHHHHhhhhhhhhhhHhHHHHHHHHHHHHHHHHHHHHHHH-----HHhhcCCHHHHHHHHHHHhhh
Q psy4154 260 EHLQAFISDCDRRTELAKQRLLETQEVLSAEVAEKANAVHQLAEDIGKKLAKAE-----ELGAEGFVEESLKLMTEQTSS 334 (346)
Q Consensus 260 r~L~~~i~d~DRrI~~ak~RL~~t~ee~~~~~~~~~~~i~~l~ekI~~ll~eaE-----~LGeeG~VdeA~~l~~~~e~l 334 (346)
..++..=.++.+++ .-.++|..+.++...........+..+..+|..+..++. .-+.+++|++++..+.....-
T Consensus 801 ~~~e~~~~~~ek~~-~e~e~l~lE~e~l~~e~~~~k~~l~~~~~~~~~l~~e~~~l~~kv~~~~~~~~~~~~el~~~k~k 879 (1174)
T KOG0933|consen 801 QRAEESSKELEKRE-NEYERLQLEHEELEKEISSLKQQLEQLEKQISSLKSELGNLEAKVDKVEKDVKKAQAELKDQKAK 879 (1174)
T ss_pred HHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHhHHHHHHHHHHHHHHH
Confidence 34444444554444 223344433333333333333334444444444443333 346788999999988877766
Q ss_pred cCCCC
Q psy4154 335 CSPGH 339 (346)
Q Consensus 335 ~~~~~ 339 (346)
+..+.
T Consensus 880 ~~~~d 884 (1174)
T KOG0933|consen 880 QRDID 884 (1174)
T ss_pred HHhhh
Confidence 55443
No 20
>KOG0971|consensus
Probab=73.83 E-value=1.7e+02 Score=33.39 Aligned_cols=26 Identities=15% Similarity=0.161 Sum_probs=14.5
Q ss_pred HHHHHHHHHHHHhhHHHHHHHHHHhh
Q psy4154 113 DAMEHLQAFISDCDRRTELAKQRLLE 138 (346)
Q Consensus 113 e~~~~L~~~I~dcdRkI~~ak~RLe~ 138 (346)
+|-.-|..-+.++.|.|.+++.-..+
T Consensus 269 EfkSkim~qqa~Lqrel~raR~e~ke 294 (1243)
T KOG0971|consen 269 EFKSKIMEQQADLQRELKRARKEAKE 294 (1243)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34444445556667777666555443
No 21
>COG0419 SbcC ATPase involved in DNA repair [DNA replication, recombination, and repair]
Probab=72.94 E-value=1.7e+02 Score=32.97 Aligned_cols=24 Identities=21% Similarity=0.444 Sum_probs=13.7
Q ss_pred ccccccchhhhcccCcchhhhcccC
Q psy4154 215 LRVCEVCSAYLGIHDNDRRLADHFG 239 (346)
Q Consensus 215 ~~vCdVCga~L~~~d~~~R~~dH~~ 239 (346)
-..|+|||.-+-..- .+.+.+|..
T Consensus 457 ~~~CPvCg~~l~~~~-~~~~~~~~~ 480 (908)
T COG0419 457 GEKCPVCGQELPEEH-EKELLELYE 480 (908)
T ss_pred CCCCCCCCCCCCcHH-HHHHHHHHH
Confidence 368999995444322 344455544
No 22
>PF10805 DUF2730: Protein of unknown function (DUF2730); InterPro: IPR020269 This entry represents a family of various hypothetical proteins. The proteins, which include HI1498 and Gp25, from phage Mu, are currently uncharacterised.
Probab=72.26 E-value=34 Score=28.36 Aligned_cols=72 Identities=17% Similarity=0.204 Sum_probs=45.1
Q ss_pred chHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHhhhhhhhhhhHh-----HHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy4154 245 GFIKIREKLAELQAMEHLQAFISDCDRRTELAKQRLLETQEVLSAE-----VAEKANAVHQLAEDIGKKLAKAEELG 316 (346)
Q Consensus 245 g~~~lR~~~~El~~~r~L~~~i~d~DRrI~~ak~RL~~t~ee~~~~-----~~~~~~~i~~l~ekI~~ll~eaE~LG 316 (346)
.|..|+..|..-..+..+..-+...|+|+..-..++++.|....-. ..+..-++..++.+|..+-..++-|-
T Consensus 23 ~~~~l~~~~a~~~~~~~l~~~~~~~~~Rl~~lE~~l~~LPt~~dv~~L~l~l~el~G~~~~l~~~l~~v~~~~~lLl 99 (106)
T PF10805_consen 23 FWLWLRRTYAKREDIEKLEERLDEHDRRLQALETKLEHLPTRDDVHDLQLELAELRGELKELSARLQGVSHQLDLLL 99 (106)
T ss_pred HHHHHHHhhccHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHH
Confidence 4556777777666677788888899999999999998876543211 12333344444555544444444443
No 23
>PRK01156 chromosome segregation protein; Provisional
Probab=71.93 E-value=1.7e+02 Score=32.62 Aligned_cols=11 Identities=18% Similarity=0.102 Sum_probs=6.9
Q ss_pred ccccchhhhcc
Q psy4154 217 VCEVCSAYLGI 227 (346)
Q Consensus 217 vCdVCga~L~~ 227 (346)
+|+|||..+.-
T Consensus 451 ~~~~Cp~c~~~ 461 (895)
T PRK01156 451 GQSVCPVCGTT 461 (895)
T ss_pred cCCCCCCCCCc
Confidence 67777765543
No 24
>KOG0996|consensus
Probab=71.45 E-value=2.1e+02 Score=33.44 Aligned_cols=24 Identities=29% Similarity=0.414 Sum_probs=13.1
Q ss_pred HHHHHHHHhhhHHHHHHHHHhhhh
Q psy4154 259 MEHLQAFISDCDRRTELAKQRLLE 282 (346)
Q Consensus 259 ~r~L~~~i~d~DRrI~~ak~RL~~ 282 (346)
.+.+..-|.--+|.|.+++..+..
T Consensus 923 i~k~~~~i~~s~~~i~k~q~~l~~ 946 (1293)
T KOG0996|consen 923 IAKLTVAIKTSDRNIAKAQKKLSE 946 (1293)
T ss_pred HHHhHHHHhcCcccHHHHHHHHHH
Confidence 344555555556666666555543
No 25
>PF13779 DUF4175: Domain of unknown function (DUF4175)
Probab=68.49 E-value=21 Score=39.89 Aligned_cols=35 Identities=20% Similarity=0.397 Sum_probs=30.9
Q ss_pred HHHHHHHHHHHHHHhhcCCHHHHHHHHHHHhhhcC
Q psy4154 302 AEDIGKKLAKAEELGAEGFVEESLKLMTEQTSSCS 336 (346)
Q Consensus 302 ~ekI~~ll~eaE~LGeeG~VdeA~~l~~~~e~l~~ 336 (346)
...+..||.++|+|.++|..++|+.++.+++.+..
T Consensus 537 ~~dL~~mmd~ie~la~~G~~~~A~q~L~qlq~mme 571 (820)
T PF13779_consen 537 QQDLQRMMDRIEELARSGRMDEARQLLEQLQQMME 571 (820)
T ss_pred HHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHH
Confidence 45688999999999999999999999999987654
No 26
>KOG0971|consensus
Probab=67.29 E-value=2.3e+02 Score=32.35 Aligned_cols=51 Identities=14% Similarity=0.209 Sum_probs=32.5
Q ss_pred HHHhhhHHHHHHHHHhhhhhhhhhhHhHHHHHHHHHHHHHHHHHHHHHHHHHhhc
Q psy4154 264 AFISDCDRRTELAKQRLLETQEVLSAEVAEKANAVHQLAEDIGKKLAKAEELGAE 318 (346)
Q Consensus 264 ~~i~d~DRrI~~ak~RL~~t~ee~~~~~~~~~~~i~~l~ekI~~ll~eaE~LGee 318 (346)
--++++.+||+.+++-+-. ..+.+..-.+.+..|+++|..+..+.+..-++
T Consensus 504 g~~kel~~r~~aaqet~yD----rdqTI~KfRelva~Lqdqlqe~~dq~~Sseee 554 (1243)
T KOG0971|consen 504 GARKELQKRVEAAQETVYD----RDQTIKKFRELVAHLQDQLQELTDQQESSEEE 554 (1243)
T ss_pred hHHHHHHHHHHHHHHHHHh----HHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHH
Confidence 4467777788877776542 23445455556677888888887766555443
No 27
>PRK10884 SH3 domain-containing protein; Provisional
Probab=66.72 E-value=1.1e+02 Score=28.44 Aligned_cols=74 Identities=11% Similarity=0.135 Sum_probs=45.1
Q ss_pred HHHHhhHHHHHHHHHHhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHH
Q psy4154 121 FISDCDRRTELAKQRLLETQEVLSAEVAEKANAVHQLAEDIGKKLAKAEELGAEGFVEESLKLMTEIEDLRKKKAEAED 199 (346)
Q Consensus 121 ~I~dcdRkI~~ak~RLe~~~~~~~~~~~~~~~~i~~l~~~I~~~l~~aE~LGeeG~VdeA~~~~~~~e~lka~~deL~~ 199 (346)
-+..+++.+...+.+|.....+.+...++....+...+..|.++..+-..|-++ =..+-.+++.++++.+.+..
T Consensus 94 rlp~le~el~~l~~~l~~~~~~~~~~~~~l~~~~~~~~~~~~~L~~~n~~L~~~-----l~~~~~~~~~l~~~~~~~~~ 167 (206)
T PRK10884 94 RVPDLENQVKTLTDKLNNIDNTWNQRTAEMQQKVAQSDSVINGLKEENQKLKNQ-----LIVAQKKVDAANLQLDDKQR 167 (206)
T ss_pred HHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHHHHHHHHHHHH
Confidence 345667777777777777665555555566667777777777777766666322 22344455555555555544
No 28
>smart00356 ZnF_C3H1 zinc finger.
Probab=66.19 E-value=2.8 Score=25.36 Aligned_cols=23 Identities=30% Similarity=0.670 Sum_probs=17.0
Q ss_pred cchhhhhhcCCChhhhhhhCccCCCCCCcccc
Q psy4154 28 SKVCKSFLLTCCPHEILSSTRMDLGECPKIHD 59 (346)
Q Consensus 28 ~~VCk~fL~g~CPhdLF~nTk~dlG~C~k~H~ 59 (346)
+.+|++|+.|.||+. ..|+..|+
T Consensus 4 ~~~C~~~~~g~C~~g---------~~C~~~H~ 26 (27)
T smart00356 4 TELCKFFKRGYCPYG---------DRCKFAHP 26 (27)
T ss_pred CCcCcCccCCCCCCC---------CCcCCCCc
Confidence 358999999999762 24777775
No 29
>PRK06835 DNA replication protein DnaC; Validated
Probab=64.37 E-value=88 Score=31.04 Aligned_cols=67 Identities=25% Similarity=0.306 Sum_probs=35.6
Q ss_pred HHHHHHHHHHHHHHHHHHHhhcCC---HHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCchhhhcccccccchh
Q psy4154 153 AVHQLAEDIGKKLAKAEELGAEGF---VEESLKLMTEIEDLRKKKAEAEDTYRNSMPASSYQQQKLRVCEVCSA 223 (346)
Q Consensus 153 ~i~~l~~~I~~~l~~aE~LGeeG~---VdeA~~~~~~~e~lka~~deL~~~~rn~~~~~~~~~~k~~vCdVCga 223 (346)
+|.+|+.+|..+--+.-.....|+ -+....+-+++..|+.++.+|.... .-|.+ +- ..-..|+.|+-
T Consensus 37 ~~~~id~~i~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~l~~~~~~lL~~~--g~~~d-yl-~~~y~Cp~C~d 106 (329)
T PRK06835 37 EIAEIDDEIAKLGIKLSRAILKNPDKKEETLKELKEKITDLRVKKAELLVSN--GYPPD-YL-EMKYTCPKCKD 106 (329)
T ss_pred cHHHHHHHHHHHHHHHHHHHhcCCCccHHHHHHHHHHHHHHHHHHHHHHHHc--CCChh-hc-CCCCCCCCCCC
Confidence 455666666666444433333454 3333455567778888777776543 11211 11 11247888874
No 30
>PF06160 EzrA: Septation ring formation regulator, EzrA ; InterPro: IPR010379 During the bacterial cell cycle, the tubulin-like cell-division protein FtsZ polymerises into a ring structure that establishes the location of the nascent division site. EzrA modulates the frequency and position of FtsZ ring formation [].; GO: 0000921 septin ring assembly, 0005940 septin ring, 0016021 integral to membrane
Probab=63.02 E-value=2.1e+02 Score=30.42 Aligned_cols=89 Identities=16% Similarity=0.224 Sum_probs=56.5
Q ss_pred HHHHHHHHHHHHHhhHHHHHHHHHHhhhhh--h-hhHHHHHH---------------------HHHHHHHHHHHHHHHHH
Q psy4154 112 IDAMEHLQAFISDCDRRTELAKQRLLETQE--V-LSAEVAEK---------------------ANAVHQLAEDIGKKLAK 167 (346)
Q Consensus 112 ~e~~~~L~~~I~dcdRkI~~ak~RLe~~~~--~-~~~~~~~~---------------------~~~i~~l~~~I~~~l~~ 167 (346)
...+..++..|.+++..|..-...|..-.. + ....+.+. ...|..-=..|.....+
T Consensus 100 ~~~i~~~~~~l~~~e~~i~~i~~~l~~L~~~e~~nr~~i~~l~~~y~~lrk~ll~~~~~~G~a~~~Le~~L~~ie~~F~~ 179 (560)
T PF06160_consen 100 KQAIKEIEEQLDEIEEDIKEILDELDELLESEEKNREEIEELKEKYRELRKELLAHSFSYGPAIEELEKQLENIEEEFSE 179 (560)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhchhHHHHHHHHHHHHHHHHH
Confidence 456778888888888888888877764211 1 11111111 02222223345555678
Q ss_pred HHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHh
Q psy4154 168 AEELGAEGFVEESLKLMTEIEDLRKKKAEAEDT 200 (346)
Q Consensus 168 aE~LGeeG~VdeA~~~~~~~e~lka~~deL~~~ 200 (346)
.+.|-+.|+..+|...+..++.--..+.++|..
T Consensus 180 f~~lt~~GD~~~A~eil~~l~~~~~~l~~~~e~ 212 (560)
T PF06160_consen 180 FEELTENGDYLEAREILEKLKEETDELEEIMED 212 (560)
T ss_pred HHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 888999999999999988877666666665553
No 31
>PF13863 DUF4200: Domain of unknown function (DUF4200)
Probab=62.98 E-value=87 Score=25.93 Aligned_cols=67 Identities=18% Similarity=0.339 Sum_probs=43.9
Q ss_pred HHHHHHHHHH-HHHHHHHHHhhhHHHHHHHHHhhhhhhhhhhHhHHHHHHHHHHHHHHHHHHHHHHHHHhhc
Q psy4154 248 KIREKLAELQ-AMEHLQAFISDCDRRTELAKQRLLETQEVLSAEVAEKANAVHQLAEDIGKKLAKAEELGAE 318 (346)
Q Consensus 248 ~lR~~~~El~-~~r~L~~~i~d~DRrI~~ak~RL~~t~ee~~~~~~~~~~~i~~l~ekI~~ll~eaE~LGee 318 (346)
.|+..-..+. .+.....|+.+++.+..+|..+.+... .....+..+|..|...|+.+..++..+.+.
T Consensus 36 ~L~~~e~~l~~~~~~f~~flken~~k~~rA~k~a~~e~----k~~~~k~~ei~~l~~~l~~l~~~~~k~e~~ 103 (126)
T PF13863_consen 36 ELEKKEQELEEDVIKFDKFLKENEAKRERAEKRAEEEK----KKKEEKEAEIKKLKAELEELKSEISKLEEK 103 (126)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3444333333 345778999999999999998887431 223455566777777777777776666554
No 32
>PRK04778 septation ring formation regulator EzrA; Provisional
Probab=62.81 E-value=2.1e+02 Score=30.38 Aligned_cols=43 Identities=21% Similarity=0.261 Sum_probs=34.9
Q ss_pred HHHHHHHHHHHHHHHHHHHhhcCCHHHHHHHHHHHhhhcCCCC
Q psy4154 297 AVHQLAEDIGKKLAKAEELGAEGFVEESLKLMTEQTSSCSPGH 339 (346)
Q Consensus 297 ~i~~l~ekI~~ll~eaE~LGeeG~VdeA~~l~~~~e~l~~~~~ 339 (346)
+-..-...|+.-+.+|+.|-..|+..+|+.....+=.-..||-
T Consensus 511 Rfr~~~~~V~~~f~~Ae~lF~~~~Y~~al~~~~~alE~vePG~ 553 (569)
T PRK04778 511 RYRSDNEEVAEALNEAERLFREYDYKAALEIIATALEKVEPGV 553 (569)
T ss_pred ccCCCCHHHHHHHHHHHHHHHhCChHHHHHHHHHHHHhhCCcH
Confidence 3334467899999999999999999999999977666666663
No 33
>PF13863 DUF4200: Domain of unknown function (DUF4200)
Probab=60.48 E-value=87 Score=25.93 Aligned_cols=54 Identities=17% Similarity=0.313 Sum_probs=34.9
Q ss_pred HHHHHHHHHHhhHHHHHHHHHHhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy4154 115 MEHLQAFISDCDRRTELAKQRLLETQEVLSAEVAEKANAVHQLAEDIGKKLAKAEELG 172 (346)
Q Consensus 115 ~~~L~~~I~dcdRkI~~ak~RLe~~~~~~~~~~~~~~~~i~~l~~~I~~~l~~aE~LG 172 (346)
...++.||.+++.+-.+|..+.+..... ......+|..|..+|..+-.++..+.
T Consensus 48 ~~~f~~flken~~k~~rA~k~a~~e~k~----~~~k~~ei~~l~~~l~~l~~~~~k~e 101 (126)
T PF13863_consen 48 VIKFDKFLKENEAKRERAEKRAEEEKKK----KEEKEAEIKKLKAELEELKSEISKLE 101 (126)
T ss_pred HHHHHHHHHHhHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4467899999999999999888654332 22344556666666666555555543
No 34
>PF07361 Cytochrom_B562: Cytochrome b562; InterPro: IPR009155 Cytochrome b562 is a haem-containing protein that is expressed in the periplasm of Escherichia coli. In b-type cytochromes, the haem atom is not covalently attached to the polypeptide. Cytochrome b562 has a four-helical bundle structure that is structurally similar to that found in members of the cytochrome c family (IPR002321 from INTERPRO). Cytochrome b562 has a reduction potential of 167 mV, which sets the energy yield possible in metabolism and is also a key determinant of the rate at which redox reactions proceed [].; GO: 0005506 iron ion binding, 0009055 electron carrier activity, 0020037 heme binding, 0042597 periplasmic space; PDB: 4ER9_A 3IQ6_G 2QLA_B 3FOO_A 3M79_C 256B_A 3NMI_F 3HNK_A 3NMK_D 2BC5_A ....
Probab=59.06 E-value=72 Score=26.27 Aligned_cols=40 Identities=33% Similarity=0.541 Sum_probs=29.4
Q ss_pred HHHHHHHHHHHH---HHHHHHhhcCCHHHHHHHHHHHHHHHHH
Q psy4154 154 VHQLAEDIGKKL---AKAEELGAEGFVEESLKLMTEIEDLRKK 193 (346)
Q Consensus 154 i~~l~~~I~~~l---~~aE~LGeeG~VdeA~~~~~~~e~lka~ 193 (346)
+..+.+-++.++ ..++.+.+.|++++|...+.+++.+|.+
T Consensus 55 ~~~Y~~Gl~~li~~id~a~~~~~~G~l~~AK~~l~~l~~lR~e 97 (103)
T PF07361_consen 55 VKDYQEGLDKLIDQIDKAEALAEAGKLDEAKAALKKLDDLRKE 97 (103)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHTTHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHH
Confidence 344444444443 4567788999999999999999988754
No 35
>TIGR00570 cdk7 CDK-activating kinase assembly factor MAT1. All proteins in this family for which functions are known are cyclin dependent protein kinases that are components of TFIIH, a complex that is involved in nucleotide excision repair and transcription initiation. Also known as MAT1 (menage a trois 1). This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=57.88 E-value=2e+02 Score=28.54 Aligned_cols=34 Identities=21% Similarity=0.522 Sum_probs=24.5
Q ss_pred hhhhhhcCCChhhhhhhCccCCCCCCccccHHHHHHH
Q psy4154 30 VCKSFLLTCCPHEILSSTRMDLGECPKIHDLALRADY 66 (346)
Q Consensus 30 VCk~fL~g~CPhdLF~nTk~dlG~C~k~H~~~lk~~y 66 (346)
.|-|-+|.-|--.+|.. .-|+||--+..--+..|
T Consensus 25 ~CGH~~C~sCv~~l~~~---~~~~CP~C~~~lrk~~f 58 (309)
T TIGR00570 25 VCGHTLCESCVDLLFVR---GSGSCPECDTPLRKNNF 58 (309)
T ss_pred CCCCcccHHHHHHHhcC---CCCCCCCCCCccchhhc
Confidence 79999999999999954 34799955444333343
No 36
>PRK15058 cytochrome b562; Provisional
Probab=57.35 E-value=11 Score=32.71 Aligned_cols=32 Identities=22% Similarity=0.222 Sum_probs=28.7
Q ss_pred HHHHHHHhhcCCHHHHHHHHHHHhhhcCCCCC
Q psy4154 309 LAKAEELGAEGFVEESLKLMTEQTSSCSPGHV 340 (346)
Q Consensus 309 l~eaE~LGeeG~VdeA~~l~~~~e~l~~~~~~ 340 (346)
+.+++.|..+|++++|.....++..+..++|-
T Consensus 94 ID~a~~la~~GkL~eAK~~a~~l~~lR~eYHk 125 (128)
T PRK15058 94 IDGALKLANEGKVKEAQAAAEQLKTTRNAYHK 125 (128)
T ss_pred HHHHHHHHhCCCHHHHHHHHHHHHHHHHHHHH
Confidence 35678899999999999999999999998884
No 37
>PRK04406 hypothetical protein; Provisional
Probab=56.66 E-value=80 Score=24.70 Aligned_cols=48 Identities=13% Similarity=0.201 Sum_probs=21.4
Q ss_pred HhhHHHHHHHHHHhh---hhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy4154 124 DCDRRTELAKQRLLE---TQEVLSAEVAEKANAVHQLAEDIGKKLAKAEEL 171 (346)
Q Consensus 124 dcdRkI~~ak~RLe~---~~~~~~~~~~~~~~~i~~l~~~I~~~l~~aE~L 171 (346)
..+.||..=..||.. +-++++..+.+...+|+.|..++..+..+...+
T Consensus 8 ~le~Ri~~LE~~lAfQE~tIe~LN~~v~~Qq~~I~~L~~ql~~L~~rl~~~ 58 (75)
T PRK04406 8 QLEERINDLECQLAFQEQTIEELNDALSQQQLLITKMQDQMKYVVGKVKNM 58 (75)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Confidence 344444444444432 223344434444455555555555554444433
No 38
>PF12998 ING: Inhibitor of growth proteins N-terminal histone-binding; InterPro: IPR024610 Histones undergo numerous post-translational modifications, including acetylation and methylation, at residues which are then probable docking sites for various chromatin remodelling complexes. Inhibitor of growth proteins (INGs) specifically bind to residues that have been thus modified. INGs carry a well-characterised C-terminal PHD-type zinc-finger domain, binding with lysine 4-tri-methylated histone H3 (H3K4me3), as well as this N-terminal domain that binds unmodified H3 tails. Although these two regions can bind histones independently, together they increase the apparent association of the ING for the H3 tail. This entry represents the N-terminal histone binding domain found in inhibitor proteins.; PDB: 4AFL_A.
Probab=54.19 E-value=1.1e+02 Score=24.30 Aligned_cols=75 Identities=13% Similarity=0.164 Sum_probs=45.2
Q ss_pred HHHHHhhHHHHHHHHHHhhhhhhh---hHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHH
Q psy4154 120 AFISDCDRRTELAKQRLLETQEVL---SAEVAEKANAVHQLAEDIGKKLAKAEELGAEGFVEESLKLMTEIEDLRKKKA 195 (346)
Q Consensus 120 ~~I~dcdRkI~~ak~RLe~~~~~~---~~~~~~~~~~i~~l~~~I~~~l~~aE~LGeeG~VdeA~~~~~~~e~lka~~d 195 (346)
..|+++|.++......+...-... ....+........+-.+|...+.++..++.+ +|.-|..+...|+.--..+|
T Consensus 22 ~~irelD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~I~~~~~~~~~l~de-Kv~lA~~~~d~v~~hi~rLD 99 (105)
T PF12998_consen 22 TLIRELDAKSQDLLEELDQQIQKFIKNHGSPSLSPEKRRELLKEIQEEYERALELSDE-KVALAQQAYDLVDRHIRRLD 99 (105)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHTCTTS--S-HHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHhhhhHHHHHHHHHHHHHHHHhhcccccCChHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHH
Confidence 456677777666665554422211 0000001124556778888889999998887 67778887777776655554
No 39
>KOG4403|consensus
Probab=52.76 E-value=3e+02 Score=28.87 Aligned_cols=40 Identities=28% Similarity=0.209 Sum_probs=26.1
Q ss_pred HHHHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy4154 162 GKKLAKAEELGAEGFVEESLKLMTEIEDLRKKKAEAEDTY 201 (346)
Q Consensus 162 ~~~l~~aE~LGeeG~VdeA~~~~~~~e~lka~~deL~~~~ 201 (346)
..++.+|-.|-+-..--+.+..+.++|.+|..+.+.+.+.
T Consensus 286 erkl~ea~rl~elreg~e~e~~rkelE~lR~~L~kAEkel 325 (575)
T KOG4403|consen 286 ERKLDEAPRLSELREGVENETSRKELEQLRVALEKAEKEL 325 (575)
T ss_pred HHHHhhhhhhhhhhcchhHHHHHHHHHHHHHHHHHHHHHH
Confidence 3445567677664433445555678899988888777655
No 40
>KOG4727|consensus
Probab=52.45 E-value=85 Score=28.69 Aligned_cols=29 Identities=31% Similarity=0.608 Sum_probs=20.4
Q ss_pred hcccccccchhhhcccCcchhhhcccCCcccc
Q psy4154 213 QKLRVCEVCSAYLGIHDNDRRLADHFGGKLHL 244 (346)
Q Consensus 213 ~k~~vCdVCga~L~~~d~~~R~~dH~~gK~H~ 244 (346)
+-=.-|+||-+. +-|+-. +-||++||.|.
T Consensus 73 ~~GyyCdVCdcv--vKDSin-flDHiNgKkHq 101 (193)
T KOG4727|consen 73 KGGYYCDVCDCV--VKDSIN-FLDHINGKKHQ 101 (193)
T ss_pred cCceeeeeccee--ehhhHH-HHHHhccHHHH
Confidence 444789999853 445444 45999999995
No 41
>PRK09720 cybC cytochrome b562; Provisional
Probab=52.15 E-value=62 Score=26.91 Aligned_cols=41 Identities=17% Similarity=0.253 Sum_probs=27.2
Q ss_pred HHHHHHHHHHHHHHH---HHHHHhhcCCHHHHHHHHHHHHHHHH
Q psy4154 152 NAVHQLAEDIGKKLA---KAEELGAEGFVEESLKLMTEIEDLRK 192 (346)
Q Consensus 152 ~~i~~l~~~I~~~l~---~aE~LGeeG~VdeA~~~~~~~e~lka 192 (346)
.++..+..-++.++. .|..|..+|+++||.....++..+|.
T Consensus 50 ~e~K~y~~Gld~lI~qID~A~~La~~GkL~eAK~~a~~l~~~Rn 93 (100)
T PRK09720 50 PEMKDFRHGFDILVGQIDDALKLANEGKVKEAQAAAEQLKTTRN 93 (100)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHH
Confidence 344455555555544 56679999999999977666665543
No 42
>KOG2391|consensus
Probab=51.77 E-value=90 Score=31.43 Aligned_cols=67 Identities=25% Similarity=0.332 Sum_probs=40.6
Q ss_pred HHHHHHHHHhhhHHH---HHHHHHhhhhhhhhhhHhHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCHHHHH
Q psy4154 258 AMEHLQAFISDCDRR---TELAKQRLLETQEVLSAEVAEKANAVHQLAEDIGKKLAKAEELGAEGFVEESL 325 (346)
Q Consensus 258 ~~r~L~~~i~d~DRr---I~~ak~RL~~t~ee~~~~~~~~~~~i~~l~ekI~~ll~eaE~LGeeG~VdeA~ 325 (346)
.|..++.+...+-|+ |..++..|....+...+......+.++.|+.++.+-+.++|- -+.-+||+|.
T Consensus 226 eme~~~aeq~slkRt~EeL~~G~~kL~~~~etLEqq~~~L~~niDIL~~k~~eal~~~~n-~~~~~~D~~~ 295 (365)
T KOG2391|consen 226 EMERLQAEQESLKRTEEELNIGKQKLVAMKETLEQQLQSLQKNIDILKSKVREALEKAEN-LEALDIDEAI 295 (365)
T ss_pred HHHHHHHHHHHHHhhHHHHHhhHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhhcc-CcCCCchhhh
Confidence 466777777777665 666666666554444444445555566666666665555555 4555677664
No 43
>PRK09720 cybC cytochrome b562; Provisional
Probab=51.75 E-value=12 Score=31.10 Aligned_cols=32 Identities=22% Similarity=0.239 Sum_probs=28.2
Q ss_pred HHHHHHHhhcCCHHHHHHHHHHHhhhcCCCCC
Q psy4154 309 LAKAEELGAEGFVEESLKLMTEQTSSCSPGHV 340 (346)
Q Consensus 309 l~eaE~LGeeG~VdeA~~l~~~~e~l~~~~~~ 340 (346)
+.+|..|..+|+++||.....++..+..+.|-
T Consensus 66 ID~A~~La~~GkL~eAK~~a~~l~~~Rn~yHk 97 (100)
T PRK09720 66 IDDALKLANEGKVKEAQAAAEQLKTTRNSYHK 97 (100)
T ss_pred HHHHHHHHHcCCHHHHHHHHHHHHHHHHHHHH
Confidence 35678899999999999999999999988874
No 44
>PF07544 Med9: RNA polymerase II transcription mediator complex subunit 9; InterPro: IPR011425 The Mediator complex is a coactivator involved in the regulated transcription of nearly all RNA polymerase II-dependent genes. Mediator functions as a bridge to convey information from gene-specific regulatory proteins to the basal RNA polymerase II transcription machinery. The Mediator complex, having a compact conformation in its free form, is recruited to promoters by direct interactions with regulatory proteins and serves for the assembly of a functional preinitiation complex with RNA polymerase II and the general transcription factors. On recruitment the Mediator complex unfolds to an extended conformation and partially surrounds RNA polymerase II, specifically interacting with the unphosphorylated form of the C-terminal domain (CTD) of RNA polymerase II. The Mediator complex dissociates from the RNA polymerase II holoenzyme and stays at the promoter when transcriptional elongation begins. The Mediator complex is composed of at least 31 subunits: MED1, MED4, MED6, MED7, MED8, MED9, MED10, MED11, MED12, MED13, MED13L, MED14, MED15, MED16, MED17, MED18, MED19, MED20, MED21, MED22, MED23, MED24, MED25, MED26, MED27, MED29, MED30, MED31, CCNC, CDK8 and CDC2L6/CDK11. The subunits form at least three structurally distinct submodules. The head and the middle modules interact directly with RNA polymerase II, whereas the elongated tail module interacts with gene-specific regulatory proteins. Mediator containing the CDK8 module is less active than Mediator lacking this module in supporting transcriptional activation. The head module contains: MED6, MED8, MED11, SRB4/MED17, SRB5/MED18, ROX3/MED19, SRB2/MED20 and SRB6/MED22. The middle module contains: MED1, MED4, NUT1/MED5, MED7, CSE2/MED9, NUT2/MED10, SRB7/MED21 and SOH1/MED31. CSE2/MED9 interacts directly with MED4. The tail module contains: MED2, PGD1/MED3, RGR1/MED14, GAL11/MED15 and SIN4/MED16. The CDK8 module contains: MED12, MED13, CCNC and CDK8. Individual preparations of the Mediator complex lacking one or more distinct subunits have been variously termed ARC, CRSP, DRIP, PC2, SMCC and TRAP. This entry represents subunit Med9 of the Mediator complex. Subunit Med9 is part of the middle module of the Mediator complex []; this associates with the core polymerase subunits to form the RNA polymerase II holoenzyme. Med9 alternatively known as the chromosome segregation protein, CSE2 (P33308 from SWISSPROT) is required, along with CSE1 (P33307 from SWISSPROT) for accurate mitotic chromosome segregation in Saccharomyces cerevisiae (Baker's yeast) [].; GO: 0001104 RNA polymerase II transcription cofactor activity, 0006357 regulation of transcription from RNA polymerase II promoter, 0016592 mediator complex
Probab=51.65 E-value=81 Score=24.90 Aligned_cols=58 Identities=17% Similarity=0.268 Sum_probs=43.4
Q ss_pred HHHHHHHhhhHHHHHHHHHhhhhhhhhhhHhHHHHHHHHHHHHHHHHHHHHHHHHHhhc
Q psy4154 260 EHLQAFISDCDRRTELAKQRLLETQEVLSAEVAEKANAVHQLAEDIGKKLAKAEELGAE 318 (346)
Q Consensus 260 r~L~~~i~d~DRrI~~ak~RL~~t~ee~~~~~~~~~~~i~~l~ekI~~ll~eaE~LGee 318 (346)
..+..-+..+-.||.+++.-+...|. ....+.+...+|..|.++|..+-+-...+++.
T Consensus 24 kd~~~~~~~lk~Klq~ar~~i~~lpg-i~~s~eeq~~~i~~Le~~i~~k~~~L~~~~~~ 81 (83)
T PF07544_consen 24 KDLDTATGSLKHKLQKARAAIRELPG-IDRSVEEQEEEIEELEEQIRKKREVLQKFKER 81 (83)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCC-ccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 34556666777799999999987665 44446788889999999998887777776653
No 45
>PF04012 PspA_IM30: PspA/IM30 family; InterPro: IPR007157 This family includes PspA a protein that suppresses sigma54-dependent transcription. The PspA protein, a negative regulator of the Escherichia coli phage shock psp operon, is produced when virulence factors are exported through secretins in many Gram-negative pathogenic bacteria and its homologue in plants, VIPP1, plays a critical role in thylakoid biogenesis, essential for photosynthesis. Activation of transcription by the enhancer-dependent bacterial sigma54-containing RNA polymerase occurs through ATP hydrolysis-driven protein conformational changes enabled by activator proteins that belong to the large AAA(+) mechanochemical protein family. It has been shown that PspA directly and specifically acts upon and binds to the AAA(+) domain of the PspF transcription activator [].
Probab=50.92 E-value=1.4e+02 Score=27.23 Aligned_cols=70 Identities=17% Similarity=0.147 Sum_probs=56.3
Q ss_pred HHHHHHHhhhHHHHHHHHHhhhhhhhhhhHhHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCHHHHHHHHHHHhh
Q psy4154 260 EHLQAFISDCDRRTELAKQRLLETQEVLSAEVAEKANAVHQLAEDIGKKLAKAEELGAEGFVEESLKLMTEQTS 333 (346)
Q Consensus 260 r~L~~~i~d~DRrI~~ak~RL~~t~ee~~~~~~~~~~~i~~l~ekI~~ll~eaE~LGeeG~VdeA~~l~~~~e~ 333 (346)
..|..+|+|++.-|..++.-++.... .......++..+...|.....+|+.+...|+=+-|...+.+...
T Consensus 26 ~~l~q~ird~e~~l~~a~~~~a~~~a----~~~~le~~~~~~~~~~~~~~~~A~~Al~~g~edLAr~al~~k~~ 95 (221)
T PF04012_consen 26 KMLEQAIRDMEEQLRKARQALARVMA----NQKRLERKLDEAEEEAEKWEKQAELALAAGREDLAREALQRKAD 95 (221)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHH
Confidence 67889999999999999888875322 23356678899999999999999999999998888776665443
No 46
>cd00890 Prefoldin Prefoldin is a hexameric molecular chaperone complex, found in both eukaryotes and archaea, that binds and stabilizes newly synthesized polypeptides allowing them to fold correctly. The complex contains two alpha and four beta subunits, the two subunits being evolutionarily related. In archaea, there is usually only one gene for each subunit while in eukaryotes there two or more paralogous genes encoding each subunit adding heterogeneity to the structure of the hexamer. The structure of the complex consists of a double beta barrel assembly with six protruding coiled-coils.
Probab=50.62 E-value=1.4e+02 Score=24.51 Aligned_cols=74 Identities=12% Similarity=0.152 Sum_probs=34.8
Q ss_pred cccccchhhhcccC-cchhhhcccCCccccchHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHhhhhhhhhhhHhHHHH
Q psy4154 216 RVCEVCSAYLGIHD-NDRRLADHFGGKLHLGFIKIREKLAELQAMEHLQAFISDCDRRTELAKQRLLETQEVLSAEVAEK 294 (346)
Q Consensus 216 ~vCdVCga~L~~~d-~~~R~~dH~~gK~H~g~~~lR~~~~El~~~r~L~~~i~d~DRrI~~ak~RL~~t~ee~~~~~~~~ 294 (346)
-++--|++|.-+.- ++.++-=++|+-.++.. .+...+..+++||..-+.++.. .
T Consensus 52 l~~~g~~~~~~~~i~~~~~v~v~iG~~~~ve~--------------~~~eA~~~l~~r~~~l~~~~~~-----------l 106 (129)
T cd00890 52 LVPLGAGLFVKAEVKDDDKVLVDLGTGVYVEK--------------SLEEAIEFLKKRLETLEKQIEK-----------L 106 (129)
T ss_pred EEecCCceEEEEEECCCCEEEEEecCCEEEEe--------------cHHHHHHHHHHHHHHHHHHHHH-----------H
Confidence 44446666665442 33334344454444442 2233345555566665555432 2
Q ss_pred HHHHHHHHHHHHHHHHHHHH
Q psy4154 295 ANAVHQLAEDIGKKLAKAEE 314 (346)
Q Consensus 295 ~~~i~~l~ekI~~ll~eaE~ 314 (346)
.+.+..+..+|..+...++.
T Consensus 107 ~~~~~~~~~~~~~l~~~l~~ 126 (129)
T cd00890 107 EKQLEKLQDQITELQEELQQ 126 (129)
T ss_pred HHHHHHHHHHHHHHHHHHHH
Confidence 23444445555555444443
No 47
>KOG2391|consensus
Probab=50.38 E-value=75 Score=31.99 Aligned_cols=24 Identities=29% Similarity=0.266 Sum_probs=12.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHhhcc
Q psy4154 180 SLKLMTEIEDLRKKKAEAEDTYRN 203 (346)
Q Consensus 180 A~~~~~~~e~lka~~deL~~~~rn 203 (346)
.+.+-+.+|-|+....|..+...|
T Consensus 262 ~~~L~~niDIL~~k~~eal~~~~n 285 (365)
T KOG2391|consen 262 LQSLQKNIDILKSKVREALEKAEN 285 (365)
T ss_pred HHHHHhhhHHHHHHHHHHHhhhcc
Confidence 444555555565555554444444
No 48
>KOG3454|consensus
Probab=50.04 E-value=17 Score=32.66 Aligned_cols=29 Identities=34% Similarity=0.682 Sum_probs=22.9
Q ss_pred cccccchhhhcccCcchhhhcccCCccccc
Q psy4154 216 RVCEVCSAYLGIHDNDRRLADHFGGKLHLG 245 (346)
Q Consensus 216 ~vCdVCga~L~~~d~~~R~~dH~~gK~H~g 245 (346)
.-||.|-.||. .|+..-...|.+|+.|..
T Consensus 4 YyCDYCdt~Lt-hDslsvRK~H~~GrkH~~ 32 (165)
T KOG3454|consen 4 YYCDYCDTYLT-HDSLSVRKTHCGGRKHKD 32 (165)
T ss_pred chhhhhhhhhh-cccHHHHHhhhhhHHHHH
Confidence 46999999987 555555579999999985
No 49
>PF07544 Med9: RNA polymerase II transcription mediator complex subunit 9; InterPro: IPR011425 The Mediator complex is a coactivator involved in the regulated transcription of nearly all RNA polymerase II-dependent genes. Mediator functions as a bridge to convey information from gene-specific regulatory proteins to the basal RNA polymerase II transcription machinery. The Mediator complex, having a compact conformation in its free form, is recruited to promoters by direct interactions with regulatory proteins and serves for the assembly of a functional preinitiation complex with RNA polymerase II and the general transcription factors. On recruitment the Mediator complex unfolds to an extended conformation and partially surrounds RNA polymerase II, specifically interacting with the unphosphorylated form of the C-terminal domain (CTD) of RNA polymerase II. The Mediator complex dissociates from the RNA polymerase II holoenzyme and stays at the promoter when transcriptional elongation begins. The Mediator complex is composed of at least 31 subunits: MED1, MED4, MED6, MED7, MED8, MED9, MED10, MED11, MED12, MED13, MED13L, MED14, MED15, MED16, MED17, MED18, MED19, MED20, MED21, MED22, MED23, MED24, MED25, MED26, MED27, MED29, MED30, MED31, CCNC, CDK8 and CDC2L6/CDK11. The subunits form at least three structurally distinct submodules. The head and the middle modules interact directly with RNA polymerase II, whereas the elongated tail module interacts with gene-specific regulatory proteins. Mediator containing the CDK8 module is less active than Mediator lacking this module in supporting transcriptional activation. The head module contains: MED6, MED8, MED11, SRB4/MED17, SRB5/MED18, ROX3/MED19, SRB2/MED20 and SRB6/MED22. The middle module contains: MED1, MED4, NUT1/MED5, MED7, CSE2/MED9, NUT2/MED10, SRB7/MED21 and SOH1/MED31. CSE2/MED9 interacts directly with MED4. The tail module contains: MED2, PGD1/MED3, RGR1/MED14, GAL11/MED15 and SIN4/MED16. The CDK8 module contains: MED12, MED13, CCNC and CDK8. Individual preparations of the Mediator complex lacking one or more distinct subunits have been variously termed ARC, CRSP, DRIP, PC2, SMCC and TRAP. This entry represents subunit Med9 of the Mediator complex. Subunit Med9 is part of the middle module of the Mediator complex []; this associates with the core polymerase subunits to form the RNA polymerase II holoenzyme. Med9 alternatively known as the chromosome segregation protein, CSE2 (P33308 from SWISSPROT) is required, along with CSE1 (P33307 from SWISSPROT) for accurate mitotic chromosome segregation in Saccharomyces cerevisiae (Baker's yeast) [].; GO: 0001104 RNA polymerase II transcription cofactor activity, 0006357 regulation of transcription from RNA polymerase II promoter, 0016592 mediator complex
Probab=49.43 E-value=87 Score=24.73 Aligned_cols=56 Identities=18% Similarity=0.268 Sum_probs=39.0
Q ss_pred HHHHHHHHhhHHHHHHHHHHhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy4154 117 HLQAFISDCDRRTELAKQRLLETQEVLSAEVAEKANAVHQLAEDIGKKLAKAEELGA 173 (346)
Q Consensus 117 ~L~~~I~dcdRkI~~ak~RLe~~~~~~~~~~~~~~~~i~~l~~~I~~~l~~aE~LGe 173 (346)
.+..-+..+..||.+|+.-+...+. ++..+.+...+|..|.++|..+-.-...+++
T Consensus 25 d~~~~~~~lk~Klq~ar~~i~~lpg-i~~s~eeq~~~i~~Le~~i~~k~~~L~~~~~ 80 (83)
T PF07544_consen 25 DLDTATGSLKHKLQKARAAIRELPG-IDRSVEEQEEEIEELEEQIRKKREVLQKFKE 80 (83)
T ss_pred HHHHHHHHHHHHHHHHHHHHHhCCC-ccCCHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3455667888999999988876543 4434456667888888888777666666554
No 50
>PRK15058 cytochrome b562; Provisional
Probab=49.26 E-value=44 Score=28.99 Aligned_cols=41 Identities=17% Similarity=0.238 Sum_probs=29.1
Q ss_pred HHHHHHHHHHHHH---HHHHHHhhcCCHHHHHHHHHHHHHHHHH
Q psy4154 153 AVHQLAEDIGKKL---AKAEELGAEGFVEESLKLMTEIEDLRKK 193 (346)
Q Consensus 153 ~i~~l~~~I~~~l---~~aE~LGeeG~VdeA~~~~~~~e~lka~ 193 (346)
++..+.+-++.++ ..++.|..+|++++|.....++..+|.+
T Consensus 79 e~K~Y~~G~d~Li~qID~a~~la~~GkL~eAK~~a~~l~~lR~e 122 (128)
T PRK15058 79 EMKDFRHGFDILVGQIDGALKLANEGKVKEAQAAAEQLKTTRNA 122 (128)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhCCCHHHHHHHHHHHHHHHHH
Confidence 4444444444443 4567799999999999988888877754
No 51
>TIGR00606 rad50 rad50. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=48.28 E-value=5.3e+02 Score=30.45 Aligned_cols=34 Identities=12% Similarity=0.116 Sum_probs=16.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhcCCHHHHHHHH
Q psy4154 295 ANAVHQLAEDIGKKLAKAEELGAEGFVEESLKLM 328 (346)
Q Consensus 295 ~~~i~~l~ekI~~ll~eaE~LGeeG~VdeA~~l~ 328 (346)
..++..+..+|..+..+....|....+++-+...
T Consensus 798 ~~ei~~l~~qie~l~~~l~~~~~~~s~~ele~ei 831 (1311)
T TIGR00606 798 QMELKDVERKIAQQAAKLQGSDLDRTVQQVNQEK 831 (1311)
T ss_pred HHHHHHHHHHHHHHHHHhccccccCCHHHHHHHH
Confidence 3444455555555555555555444444444433
No 52
>KOG4674|consensus
Probab=47.93 E-value=6.2e+02 Score=31.20 Aligned_cols=80 Identities=23% Similarity=0.233 Sum_probs=45.9
Q ss_pred HHHHHHHHHhhhHHHHHHHHHhhhhhhhhhhHhHHHHHHHHHHHHHHHHHHHHHHHHHh------hcCCHHHHHHHHHHH
Q psy4154 258 AMEHLQAFISDCDRRTELAKQRLLETQEVLSAEVAEKANAVHQLAEDIGKKLAKAEELG------AEGFVEESLKLMTEQ 331 (346)
Q Consensus 258 ~~r~L~~~i~d~DRrI~~ak~RL~~t~ee~~~~~~~~~~~i~~l~ekI~~ll~eaE~LG------eeG~VdeA~~l~~~~ 331 (346)
-++.++.++.-..--+..-+.+|..+.......+....+++-.+.++|..|..+++.|- ..|.=++...+..++
T Consensus 927 ~i~~yqe~~~s~eqsl~~~ks~lde~~~~~ea~ie~~~~k~tslE~~ls~L~~~~~~l~~e~~~~~k~~e~~~~~~~~e~ 1006 (1822)
T KOG4674|consen 927 QIREYQEEYSSLEQSLESVKSELDETRLELEAKIESLHKKITSLEEELSELEKEIENLREELELSTKGKEDKLLDLSREI 1006 (1822)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhhhHHHHHHHh
Confidence 34445555555556666666666666544444444555556666666666666665554 455555566666666
Q ss_pred hhhcCC
Q psy4154 332 TSSCSP 337 (346)
Q Consensus 332 e~l~~~ 337 (346)
..++.+
T Consensus 1007 ~sl~ne 1012 (1822)
T KOG4674|consen 1007 SSLQNE 1012 (1822)
T ss_pred HHHHHH
Confidence 655543
No 53
>PRK02119 hypothetical protein; Provisional
Probab=46.94 E-value=1.3e+02 Score=23.33 Aligned_cols=47 Identities=11% Similarity=0.145 Sum_probs=20.4
Q ss_pred HhhHHHHHHHHHHhh---hhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy4154 124 DCDRRTELAKQRLLE---TQEVLSAEVAEKANAVHQLAEDIGKKLAKAEE 170 (346)
Q Consensus 124 dcdRkI~~ak~RLe~---~~~~~~~~~~~~~~~i~~l~~~I~~~l~~aE~ 170 (346)
..+.||..=..||.. +-++++..+.+....|+.|..++..+..+...
T Consensus 6 ~~e~Ri~~LE~rla~QE~tie~LN~~v~~Qq~~id~L~~ql~~L~~rl~~ 55 (73)
T PRK02119 6 NLENRIAELEMKIAFQENLLEELNQALIEQQFVIDKMQVQLRYMANKLKD 55 (73)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 344455444444432 22333333344444555554444444444333
No 54
>KOG4117|consensus
Probab=46.90 E-value=1e+02 Score=23.76 Aligned_cols=52 Identities=10% Similarity=0.310 Sum_probs=34.2
Q ss_pred HHHHHHHHHhhhHHHHHHHHHhhhhhhhhhhHhHHHHHHHHHHHHHHHHHHHHHH
Q psy4154 258 AMEHLQAFISDCDRRTELAKQRLLETQEVLSAEVAEKANAVHQLAEDIGKKLAKA 312 (346)
Q Consensus 258 ~~r~L~~~i~d~DRrI~~ak~RL~~t~ee~~~~~~~~~~~i~~l~ekI~~ll~ea 312 (346)
-|..|..+|..+ ....+.|...+....-..+......|+.|+..|..||.++
T Consensus 13 Nmq~LTs~vQ~l---LQq~QDkFQtMSDQII~RiDDM~~riDDLEKnIaDLm~qa 64 (73)
T KOG4117|consen 13 NMQDLTSVVQGL---LQQTQDKFQTMSDQIIGRIDDMSSRIDDLEKNIADLMTQA 64 (73)
T ss_pred cHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHhhhhhhhHHHHHHHHHHHHHc
Confidence 345556666653 3455667766555544445667778888889999988775
No 55
>PF00642 zf-CCCH: Zinc finger C-x8-C-x5-C-x3-H type (and similar); InterPro: IPR000571 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents C-x8-C-x5-C-x3-H (CCCH) type Zinc finger (Znf) domains. Proteins containing CCCH Znf domains include Znf proteins from eukaryotes involved in cell cycle or growth phase-related regulation, e.g. human TIS11B (butyrate response factor 1), a probable regulatory protein involved in regulating the response to growth factors, and the mouse TTP growth factor-inducible nuclear protein, which has the same function. The mouse TTP protein is induced by growth factors. Another protein containing this domain is the human splicing factor U2AF 35kDa subunit, which plays a critical role in both constitutive and enhancer-dependent splicing by mediating essential protein-protein interactions and protein-RNA interactions required for 3' splice site selection. It has been shown that different CCCH-type Znf proteins interact with the 3'-untranslated region of various mRNA [, ]. This type of Znf is very often present in two copies. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0003676 nucleic acid binding, 0008270 zinc ion binding; PDB: 1M9O_A 1RGO_A 2CQE_A 2FC6_A 2D9M_A 2E5S_A 2RHK_C 2D9N_A 3D2S_A 3D2Q_C ....
Probab=46.53 E-value=9.4 Score=23.76 Aligned_cols=25 Identities=28% Similarity=0.716 Sum_probs=15.7
Q ss_pred Ccchhhhhhc-CCChhhhhhhCccCCCCCCccccH
Q psy4154 27 DSKVCKSFLL-TCCPHEILSSTRMDLGECPKIHDL 60 (346)
Q Consensus 27 D~~VCk~fL~-g~CPhdLF~nTk~dlG~C~k~H~~ 60 (346)
...+|++|+. |.||+. ..|+..|++
T Consensus 2 k~~~C~~f~~~g~C~~G---------~~C~f~H~~ 27 (27)
T PF00642_consen 2 KTKLCRFFMRTGTCPFG---------DKCRFAHGE 27 (27)
T ss_dssp TSSB-HHHHHTS--TTG---------GGSSSBSSG
T ss_pred ccccChhhccCCccCCC---------CCcCccCCC
Confidence 3578999999 999864 257777753
No 56
>PRK03947 prefoldin subunit alpha; Reviewed
Probab=46.26 E-value=1.9e+02 Score=24.63 Aligned_cols=79 Identities=15% Similarity=0.183 Sum_probs=38.9
Q ss_pred ccccccchhhhccc-CcchhhhcccCCccccchHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHhhhhhhhhhhHhHHH
Q psy4154 215 LRVCEVCSAYLGIH-DNDRRLADHFGGKLHLGFIKIREKLAELQAMEHLQAFISDCDRRTELAKQRLLETQEVLSAEVAE 293 (346)
Q Consensus 215 ~~vCdVCga~L~~~-d~~~R~~dH~~gK~H~g~~~lR~~~~El~~~r~L~~~i~d~DRrI~~ak~RL~~t~ee~~~~~~~ 293 (346)
..++=-+|.|.-+. .+..++--|+|+-.++.+ .+..-+..+++||......+.... .....
T Consensus 58 ~lvplg~~~yv~~~v~~~~kV~v~lG~g~~vE~--------------~~~eA~~~l~~~~~~l~~~~~~l~----~~l~~ 119 (140)
T PRK03947 58 TLVPIGAGSFVKAKVKDKDKVIVSLGAGYSAEK--------------DLDEAIEILDKRKEELEKALEKLE----EALQK 119 (140)
T ss_pred EEEEcCCCcEEEEEecCCCeEEEEcCCCEEEEe--------------cHHHHHHHHHHHHHHHHHHHHHHH----HHHHH
Confidence 44555566665433 244456566777777663 334444555556655555443211 11223
Q ss_pred HHHHHHHHHHHHHHHHHH
Q psy4154 294 KANAVHQLAEDIGKKLAK 311 (346)
Q Consensus 294 ~~~~i~~l~ekI~~ll~e 311 (346)
+.+.+..+...+..+.++
T Consensus 120 ~~~~~~~~~~~l~~l~~~ 137 (140)
T PRK03947 120 LASRIAQLAQELQQLQQE 137 (140)
T ss_pred HHHHHHHHHHHHHHHHHH
Confidence 344444455555544443
No 57
>PLN03229 acetyl-coenzyme A carboxylase carboxyl transferase subunit alpha; Provisional
Probab=44.76 E-value=4.5e+02 Score=29.33 Aligned_cols=26 Identities=12% Similarity=0.047 Sum_probs=19.3
Q ss_pred hhhHHHHHHHHHHHHHhhHHHHHHHH
Q psy4154 109 YYDIDAMEHLQAFISDCDRRTELAKQ 134 (346)
Q Consensus 109 ~yE~e~~~~L~~~I~dcdRkI~~ak~ 134 (346)
....++..-+.++-.+++..|+.+-.
T Consensus 597 ~~~~~lkeki~~~~~Ei~~eie~v~~ 622 (762)
T PLN03229 597 ELDDDLKEKVEKMKKEIELELAGVLK 622 (762)
T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHh
Confidence 66677788888888888877776643
No 58
>PF10475 DUF2450: Protein of unknown function N-terminal domain (DUF2450) ; InterPro: IPR019515 This entry represents Vacuolar protein sorting-associated protein 54, and is thought to be involved in retrograde transport from early and late endosomes to late Golgi found in eukaryotes, but its function is not known.
Probab=44.39 E-value=3e+02 Score=26.50 Aligned_cols=83 Identities=17% Similarity=0.233 Sum_probs=51.4
Q ss_pred HHHHHHHHHHHHHHhhhHHHHHHHHHhhhhhhhhhhH---hH----------HHHHHHHHHHHHHHHHHHHHHHHHhhcC
Q psy4154 253 LAELQAMEHLQAFISDCDRRTELAKQRLLETQEVLSA---EV----------AEKANAVHQLAEDIGKKLAKAEELGAEG 319 (346)
Q Consensus 253 ~~El~~~r~L~~~i~d~DRrI~~ak~RL~~t~ee~~~---~~----------~~~~~~i~~l~ekI~~ll~eaE~LGeeG 319 (346)
+.++..+..|+.-+.++-..|..+++.|....+.... .+ ......+..| ..+......++.|-++|
T Consensus 63 ~~a~~~v~el~~~l~~a~~~~~~~R~~L~~~~~~~~~~~L~Il~~~rkr~~l~~ll~~L~~i-~~v~~~~~~l~~ll~~~ 141 (291)
T PF10475_consen 63 FQAMSSVQELQDELEEALVICKNLRRNLKSADENLTKSGLEILRLQRKRQNLKKLLEKLEQI-KTVQQTQSRLQELLEEG 141 (291)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhcC
Confidence 3455555556666666666666666666554333211 01 1222223333 46677778899999999
Q ss_pred CHHHHHHHHHHHhhhcC
Q psy4154 320 FVEESLKLMTEQTSSCS 336 (346)
Q Consensus 320 ~VdeA~~l~~~~e~l~~ 336 (346)
+...|..+..++..+-+
T Consensus 142 dy~~Al~li~~~~~~l~ 158 (291)
T PF10475_consen 142 DYPGALDLIEECQQLLE 158 (291)
T ss_pred CHHHHHHHHHHHHHHHH
Confidence 99999999998877653
No 59
>COG1579 Zn-ribbon protein, possibly nucleic acid-binding [General function prediction only]
Probab=44.33 E-value=2.9e+02 Score=26.40 Aligned_cols=41 Identities=20% Similarity=0.101 Sum_probs=21.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhcCCHHHHHHHHHHHhhhcCCC
Q psy4154 294 KANAVHQLAEDIGKKLAKAEELGAEGFVEESLKLMTEQTSSCSPG 338 (346)
Q Consensus 294 ~~~~i~~l~ekI~~ll~eaE~LGeeG~VdeA~~l~~~~e~l~~~~ 338 (346)
...++..+.++.-....+.+.|.+.=+ ..++..-+.+++.+
T Consensus 147 ~e~e~~~i~e~~~~~~~~~~~L~~~l~----~ell~~yeri~~~~ 187 (239)
T COG1579 147 LEEEVAEIREEGQELSSKREELKEKLD----PELLSEYERIRKNK 187 (239)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcC----HHHHHHHHHHHhcC
Confidence 334455555555555555555554432 34555555555544
No 60
>KOG0994|consensus
Probab=44.09 E-value=6.2e+02 Score=30.07 Aligned_cols=30 Identities=37% Similarity=0.325 Sum_probs=21.9
Q ss_pred HHHHHHHHHHHhhHHHHHHHHHHhhhhhhh
Q psy4154 114 AMEHLQAFISDCDRRTELAKQRLLETQEVL 143 (346)
Q Consensus 114 ~~~~L~~~I~dcdRkI~~ak~RLe~~~~~~ 143 (346)
....-+..|...++-|+.++..|.+.+++.
T Consensus 1578 Aq~~a~~ai~~a~~~~~~a~~~l~kv~~~t 1607 (1758)
T KOG0994|consen 1578 AQGEAQDAIQGADRDIRLAQQLLAKVQEET 1607 (1758)
T ss_pred HHHHHHHHHHhhHHHHHHHHHHHHHHHHHH
Confidence 344557778888888888888887766654
No 61
>PLN02372 violaxanthin de-epoxidase
Probab=43.79 E-value=4e+02 Score=27.75 Aligned_cols=116 Identities=20% Similarity=0.208 Sum_probs=56.9
Q ss_pred CCccccHHHHHHHHHhhccCCcccccccccc--CCCchhhhhhhHHHHHHhhhcccchhhHHHHHHHHHHHHHhhHHHHH
Q psy4154 54 CPKIHDLALRADYEKASKNRDYYYDIDRMDL--GECPKIHDLALRADYEKASKNRDYYYDIDAMEHLQAFISDCDRRTEL 131 (346)
Q Consensus 54 C~k~H~~~lk~~ye~~~~~~~~~~~~~r~d~--g~c~k~h~~~lr~~~e~~~~~~~~~yE~e~~~~L~~~I~dcdRkI~~ 131 (346)
=|..|.+.|.+.- .+.|..+.++++-|- ||-| .+++.|.+-+.+-++.|.+
T Consensus 327 lP~~~~p~L~~Aa---~kvG~df~~F~~tDNsCgpep------------------------~l~~~l~~~~e~~e~~i~~ 379 (455)
T PLN02372 327 LPESIVPELEKAA---KKVGRDFSDFVRTDNTCGPEP------------------------PLLERLEKDVEEGEKTIVK 379 (455)
T ss_pred CChhhhHHHHHHH---HHcCCCHHHheeeCCCCCCCc------------------------hHHHHHHHHHHHHHHHHHH
Confidence 3456888888774 346777777777663 2333 3455555555555555554
Q ss_pred HHHHHhhhhhhhhHHHHHHHH----HHH--HHHHHHHHHHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHhhcc
Q psy4154 132 AKQRLLETQEVLSAEVAEKAN----AVH--QLAEDIGKKLAKAEELGAEGFVEESLKLMTEIEDLRKKKAEAEDTYRN 203 (346)
Q Consensus 132 ak~RLe~~~~~~~~~~~~~~~----~i~--~l~~~I~~~l~~aE~LGeeG~VdeA~~~~~~~e~lka~~deL~~~~rn 203 (346)
--..+++.-+ .++....+ -+. .+.+.+.++-+..+..+- +-+-..|..++.++.+..+.+...+.
T Consensus 380 e~~~~~~e~~---~~v~~~~~~~~~~~~~~~~~~~~~~l~~~~~~f~~----~lskee~~~l~~~~~~~~~vek~f~~ 450 (455)
T PLN02372 380 EARQIEEELE---KEVEKLGKEEESLFKRVALEEGLKELEQDEENFLK----ELSKEEKELLEKLKMEASEVEKLFGR 450 (455)
T ss_pred HHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----hhhHHHHHHHHHHHHHHHHHHHHhhh
Confidence 4333322111 11111000 111 244444444444444433 23444555666666666666665543
No 62
>PF05008 V-SNARE: Vesicle transport v-SNARE protein N-terminus; InterPro: IPR007705 V-SNARE proteins are required for protein traffic between eukaryotic organelles. The v-SNAREs on transport vesicles interact with t-SNAREs on target membranes in order to facilitate this []. This domain is the N-terminal half of the V-Snare proteins. ; GO: 0006886 intracellular protein transport, 0016020 membrane; PDB: 2V8S_V 1VCS_A 3ONL_C 3ONJ_A 2QYW_A.
Probab=43.36 E-value=1.4e+02 Score=22.58 Aligned_cols=50 Identities=10% Similarity=0.092 Sum_probs=30.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy4154 148 AEKANAVHQLAEDIGKKLAKAEELGAEGFVEESLKLMTEIEDLRKKKAEAEDTY 201 (346)
Q Consensus 148 ~~~~~~i~~l~~~I~~~l~~aE~LGeeG~VdeA~~~~~~~e~lka~~deL~~~~ 201 (346)
......|++..+-|+.|--++..+ ....-..++..+...+..++.|...+
T Consensus 28 ~~~e~~l~ea~~~l~qMe~E~~~~----p~s~r~~~~~kl~~yr~~l~~lk~~l 77 (79)
T PF05008_consen 28 REIERDLDEAEELLKQMELEVRSL----PPSERNQYKSKLRSYRSELKKLKKEL 77 (79)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHCTS-----HHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHhC----CHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 344455555555555555555444 33556677788888888887776544
No 63
>PF04102 SlyX: SlyX; InterPro: IPR007236 The SlyX protein has no known function. It is short, less than 80 amino acids, and its gene is found close to the slyD gene. The SlyX protein has a conserved PPH(Y/W) motif at its C terminus. The protein may be a coiled-coil structure.; PDB: 3EFG_A.
Probab=43.29 E-value=1e+02 Score=23.46 Aligned_cols=33 Identities=21% Similarity=0.277 Sum_probs=18.0
Q ss_pred hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy4154 141 EVLSAEVAEKANAVHQLAEDIGKKLAKAEELGA 173 (346)
Q Consensus 141 ~~~~~~~~~~~~~i~~l~~~I~~~l~~aE~LGe 173 (346)
++++..+.+...+|+.|..++..+..++..+..
T Consensus 21 e~Ln~~v~~Qq~~I~~L~~~l~~L~~rl~~~~~ 53 (69)
T PF04102_consen 21 EELNDVVTEQQRQIDRLQRQLRLLRERLRELED 53 (69)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHT------
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc
Confidence 344445556667777777777777777776653
No 64
>PHA02562 46 endonuclease subunit; Provisional
Probab=42.86 E-value=4.1e+02 Score=27.59 Aligned_cols=23 Identities=35% Similarity=0.343 Sum_probs=11.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHhhc
Q psy4154 296 NAVHQLAEDIGKKLAKAEELGAE 318 (346)
Q Consensus 296 ~~i~~l~ekI~~ll~eaE~LGee 318 (346)
..|..+..++..+-.+++.|...
T Consensus 351 ~~i~~~~~~~~~l~~ei~~l~~~ 373 (562)
T PHA02562 351 QSLITLVDKAKKVKAAIEELQAE 373 (562)
T ss_pred HHHHHHHHHHHHHHHHHHHHHhh
Confidence 33444444445555555555444
No 65
>PF12854 PPR_1: PPR repeat
Probab=42.48 E-value=30 Score=22.35 Aligned_cols=19 Identities=26% Similarity=0.445 Sum_probs=15.8
Q ss_pred HHHHhhcCCHHHHHHHHHH
Q psy4154 168 AEELGAEGFVEESLKLMTE 186 (346)
Q Consensus 168 aE~LGeeG~VdeA~~~~~~ 186 (346)
+..+...|+|++|.+++.+
T Consensus 14 I~~~Ck~G~~~~A~~l~~~ 32 (34)
T PF12854_consen 14 IDGYCKAGRVDEAFELFDE 32 (34)
T ss_pred HHHHHHCCCHHHHHHHHHh
Confidence 4567889999999998865
No 66
>COG3783 CybC Soluble cytochrome b562 [Energy production and conversion]
Probab=41.75 E-value=42 Score=27.74 Aligned_cols=27 Identities=30% Similarity=0.474 Sum_probs=22.4
Q ss_pred HHHHHHhhcCCHHHHHHHHHHHHHHHH
Q psy4154 166 AKAEELGAEGFVEESLKLMTEIEDLRK 192 (346)
Q Consensus 166 ~~aE~LGeeG~VdeA~~~~~~~e~lka 192 (346)
.++..|..+|++++|......+..+|-
T Consensus 67 D~a~klaqeGnl~eAKaaak~l~d~Rn 93 (100)
T COG3783 67 DKADKLAQEGNLDEAKAAAKTLKDTRN 93 (100)
T ss_pred HHHHHHHHcCChHHHHHHHHHHHHHHH
Confidence 456789999999999999888887663
No 67
>PRK04406 hypothetical protein; Provisional
Probab=40.79 E-value=1.6e+02 Score=22.91 Aligned_cols=50 Identities=12% Similarity=0.201 Sum_probs=28.6
Q ss_pred hhHHHHHHHHHhhhh---hhhhhhHhHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy4154 268 DCDRRTELAKQRLLE---TQEVLSAEVAEKANAVHQLAEDIGKKLAKAEELGA 317 (346)
Q Consensus 268 d~DRrI~~ak~RL~~---t~ee~~~~~~~~~~~i~~l~ekI~~ll~eaE~LGe 317 (346)
.++.||..=..||+. |-++.+.........|+.|..++..|..+...++.
T Consensus 8 ~le~Ri~~LE~~lAfQE~tIe~LN~~v~~Qq~~I~~L~~ql~~L~~rl~~~~~ 60 (75)
T PRK04406 8 QLEERINDLECQLAFQEQTIEELNDALSQQQLLITKMQDQMKYVVGKVKNMDS 60 (75)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc
Confidence 344455544455543 23333444456666777777777777777766654
No 68
>KOG0250|consensus
Probab=40.71 E-value=6.4e+02 Score=29.30 Aligned_cols=20 Identities=5% Similarity=0.129 Sum_probs=8.6
Q ss_pred HHHHHHhhhHHHHHHHHHhh
Q psy4154 261 HLQAFISDCDRRTELAKQRL 280 (346)
Q Consensus 261 ~L~~~i~d~DRrI~~ak~RL 280 (346)
.+.+-++-+.+.|...++++
T Consensus 369 ~~k~~~d~l~k~I~~~~~~~ 388 (1074)
T KOG0250|consen 369 KLKKEVDRLEKQIADLEKQT 388 (1074)
T ss_pred HHHHHHHHHHHHHHHHHHHH
Confidence 33344444444444444444
No 69
>PF12269 zf-CpG_bind_C: CpG binding protein zinc finger C terminal domain; InterPro: IPR022056 This domain family is found in eukaryotes, and is approximately 240 amino acids in length. This domain is the zinc finger domain of a CpG binding DNA methyltransferase protein. It contains a CxxC motif which forms the zinc finger and binds to DNA.
Probab=40.56 E-value=48 Score=31.63 Aligned_cols=53 Identities=21% Similarity=0.274 Sum_probs=36.3
Q ss_pred cccccccchhhhccc--CcchhhhcccC------CccccchHHHHHHHHHHHHHHHHHHHHhhhH
Q psy4154 214 KLRVCEVCSAYLGIH--DNDRRLADHFG------GKLHLGFIKIREKLAELQAMEHLQAFISDCD 270 (346)
Q Consensus 214 k~~vCdVCga~L~~~--d~~~R~~dH~~------gK~H~g~~~lR~~~~El~~~r~L~~~i~d~D 270 (346)
+...=+|||+-|+.+ +++ ++|- ...|.+|.++|..--+++.++.+.+|-.-++
T Consensus 157 kv~~~~VCG~Pl~~~vf~~t----~~~C~~~k~~C~kH~~Wek~rraeidlEr~~~~~kl~el~e 217 (236)
T PF12269_consen 157 KVPDTEVCGCPLVRNVFETT----GEFCRLPKKKCKKHYNWEKLRRAEIDLERVRQWLKLDELFE 217 (236)
T ss_pred CCCcccccCCccccccccCC----cccccCcchhhccccchHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 344557999999943 332 3331 4889999999888777777777776655443
No 70
>PF12171 zf-C2H2_jaz: Zinc-finger double-stranded RNA-binding; InterPro: IPR022755 This zinc finger is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localise in the nucleus, particularly the nucleolus []. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localisation. This entry represents the multiple-adjacent-C2H2 zinc finger, JAZ. ; PDB: 4DGW_A 1ZR9_A.
Probab=39.75 E-value=13 Score=22.72 Aligned_cols=25 Identities=28% Similarity=0.716 Sum_probs=18.3
Q ss_pred cccccchhhhcccCcchhhhcccCCccc
Q psy4154 216 RVCEVCSAYLGIHDNDRRLADHFGGKLH 243 (346)
Q Consensus 216 ~vCdVCga~L~~~d~~~R~~dH~~gK~H 243 (346)
..|++|+-. |-+..-+..|+.||.|
T Consensus 2 ~~C~~C~k~---f~~~~~~~~H~~sk~H 26 (27)
T PF12171_consen 2 FYCDACDKY---FSSENQLKQHMKSKKH 26 (27)
T ss_dssp CBBTTTTBB---BSSHHHHHCCTTSHHH
T ss_pred CCcccCCCC---cCCHHHHHHHHccCCC
Confidence 369999955 4455567889988876
No 71
>PRK00736 hypothetical protein; Provisional
Probab=39.68 E-value=1.5e+02 Score=22.55 Aligned_cols=31 Identities=26% Similarity=0.299 Sum_probs=16.4
Q ss_pred hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy4154 141 EVLSAEVAEKANAVHQLAEDIGKKLAKAEEL 171 (346)
Q Consensus 141 ~~~~~~~~~~~~~i~~l~~~I~~~l~~aE~L 171 (346)
++++..+.+..+.|+.|..++..+..+...+
T Consensus 22 e~Ln~~v~~Qq~~i~~L~~ql~~L~~rl~~~ 52 (68)
T PRK00736 22 EELSDQLAEQWKTVEQMRKKLDALTERFLSL 52 (68)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 3344444455556666666655555555443
No 72
>PRK04325 hypothetical protein; Provisional
Probab=38.37 E-value=1.9e+02 Score=22.45 Aligned_cols=31 Identities=16% Similarity=0.225 Sum_probs=16.1
Q ss_pred hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy4154 141 EVLSAEVAEKANAVHQLAEDIGKKLAKAEEL 171 (346)
Q Consensus 141 ~~~~~~~~~~~~~i~~l~~~I~~~l~~aE~L 171 (346)
++++..+.+...+|+.|..++..+..+...+
T Consensus 26 e~LN~vv~~Qq~~I~~L~~ql~~L~~rl~~~ 56 (74)
T PRK04325 26 DGLNATVARQQQTLDLLQAQLRLLYQQMRDA 56 (74)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 3444444445555666665555555555444
No 73
>TIGR02977 phageshock_pspA phage shock protein A. Members of this family are the phage shock protein PspA, from the phage shock operon. This is a narrower family than the set of PspA and its homologs, sometimes several in a genome, as described by PFAM model pfam04012. PspA appears to maintain the protonmotive force under stress conditions that include overexpression of certain phage secretins, heat shock, ethanol, and protein export defects.
Probab=38.36 E-value=3.2e+02 Score=25.19 Aligned_cols=81 Identities=14% Similarity=0.108 Sum_probs=59.9
Q ss_pred HHHHHHHHHhhHHHHHHHHHHhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHH
Q psy4154 116 EHLQAFISDCDRRTELAKQRLLETQEVLSAEVAEKANAVHQLAEDIGKKLAKAEELGAEGFVEESLKLMTEIEDLRKKKA 195 (346)
Q Consensus 116 ~~L~~~I~dcdRkI~~ak~RLe~~~~~~~~~~~~~~~~i~~l~~~I~~~l~~aE~LGeeG~VdeA~~~~~~~e~lka~~d 195 (346)
..|..+|.+++.-|..++.=|..... .......++.++...|.....+|+..-..|+=|-|...+.+.........
T Consensus 27 ~~l~q~irem~~~l~~ar~~lA~~~a----~~k~~e~~~~~~~~~~~~~~~~A~~Al~~G~EdLAr~Al~~k~~~~~~~~ 102 (219)
T TIGR02977 27 KMIRLIIQEMEDTLVEVRTTSARTIA----DKKELERRVSRLEAQVADWQEKAELALSKGREDLARAALIEKQKAQELAE 102 (219)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHH
Confidence 46778888888877777776654322 12234578899999999999999999999999999888776666665555
Q ss_pred HHHHh
Q psy4154 196 EAEDT 200 (346)
Q Consensus 196 eL~~~ 200 (346)
.|...
T Consensus 103 ~l~~~ 107 (219)
T TIGR02977 103 ALERE 107 (219)
T ss_pred HHHHH
Confidence 55443
No 74
>PF06825 HSBP1: Heat shock factor binding protein 1; InterPro: IPR009643 Heat shock factor binding protein 1 (HSBP1) appears to be a negative regulator of the heat shock response [].; PDB: 3CI9_A.
Probab=37.90 E-value=98 Score=22.87 Aligned_cols=49 Identities=10% Similarity=0.264 Sum_probs=21.1
Q ss_pred HHHHHHHHHHHHhhHHHHHHHHHHhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHH
Q psy4154 113 DAMEHLQAFISDCDRRTELAKQRLLETQEVLSAEVAEKANAVHQLAEDIGKKLAKA 168 (346)
Q Consensus 113 e~~~~L~~~I~dcdRkI~~ak~RLe~~~~~~~~~~~~~~~~i~~l~~~I~~~l~~a 168 (346)
++-.+++.++.++.-|. +.....+-..+.+-...|++|+..|..++.++
T Consensus 3 elt~~v~~lL~qmq~kF-------q~mS~~I~~riDeM~~RIDdLE~si~dl~~qa 51 (54)
T PF06825_consen 3 ELTAFVQNLLQQMQDKF-------QTMSDQILGRIDEMSSRIDDLEKSIADLMTQA 51 (54)
T ss_dssp HHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHHCCHHHH-------
T ss_pred HHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhc
Confidence 45556666776664443 33333333333344455566666666555544
No 75
>PRK10564 maltose regulon periplasmic protein; Provisional
Probab=37.82 E-value=37 Score=33.57 Aligned_cols=32 Identities=22% Similarity=0.165 Sum_probs=26.7
Q ss_pred HHHHHHHHHhhcCCHHHHHHHHHHHhhhcCCC
Q psy4154 307 KKLAKAEELGAEGFVEESLKLMTEQTSSCSPG 338 (346)
Q Consensus 307 ~ll~eaE~LGeeG~VdeA~~l~~~~e~l~~~~ 338 (346)
=-...|+++.+.|+|++|+.|+.+++.|=..|
T Consensus 259 Yy~~aI~~AVk~gDi~KAL~LldEAe~LG~~~ 290 (303)
T PRK10564 259 YFNQAIKQAVKKGDVDKALKLLDEAERLGSTS 290 (303)
T ss_pred HHHHHHHHHHHcCCHHHHHHHHHHHHHhCCch
Confidence 34466778889999999999999999986655
No 76
>KOG0977|consensus
Probab=37.29 E-value=5.5e+02 Score=27.57 Aligned_cols=15 Identities=27% Similarity=0.197 Sum_probs=10.4
Q ss_pred HHHHHHHHHHhhhcC
Q psy4154 322 EESLKLMTEQTSSCS 336 (346)
Q Consensus 322 deA~~l~~~~e~l~~ 336 (346)
++++.+|.+.+.||-
T Consensus 356 eec~~l~~Elq~LlD 370 (546)
T KOG0977|consen 356 EECQQLSVELQKLLD 370 (546)
T ss_pred HHHHHHHHHHHHhhc
Confidence 567777777777763
No 77
>PRK02793 phi X174 lysis protein; Provisional
Probab=37.24 E-value=1.9e+02 Score=22.26 Aligned_cols=28 Identities=14% Similarity=0.188 Sum_probs=12.8
Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy4154 143 LSAEVAEKANAVHQLAEDIGKKLAKAEE 170 (346)
Q Consensus 143 ~~~~~~~~~~~i~~l~~~I~~~l~~aE~ 170 (346)
++..+.+....|+.+..++..+..+...
T Consensus 27 Ln~~v~~Qq~~I~~L~~~l~~L~~rl~~ 54 (72)
T PRK02793 27 LNVTVTAHEMEMAKLRDHLRLLTEKLKA 54 (72)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 3333444444555555555444444433
No 78
>COG5136 U1 snRNP-specific protein C [RNA processing and modification]
Probab=36.97 E-value=26 Score=31.59 Aligned_cols=35 Identities=31% Similarity=0.461 Sum_probs=25.7
Q ss_pred cccccchhhhcccCcchhhhcccCCccccchHHHHHHHH
Q psy4154 216 RVCEVCSAYLGIHDNDRRLADHFGGKLHLGFIKIREKLA 254 (346)
Q Consensus 216 ~vCdVCga~L~~~d~~~R~~dH~~gK~H~g~~~lR~~~~ 254 (346)
.-|+.|-.+|. .|..+-...|.+|+.|. .++..|.
T Consensus 4 Y~CeyC~~~Lt-hD~lsvRk~H~~G~~H~---~~~~dYY 38 (188)
T COG5136 4 YFCEYCNKMLT-HDRLSVRKMHCGGAKHG---LMRKDYY 38 (188)
T ss_pred hHHHHHHHHHh-ccHHHHHHHhhhhHHHH---HHHHHHH
Confidence 46999999986 44444457899999997 4666664
No 79
>PF02403 Seryl_tRNA_N: Seryl-tRNA synthetase N-terminal domain; InterPro: IPR015866 The aminoacyl-tRNA synthetases (6.1.1. from EC) catalyse the attachment of an amino acid to its cognate transfer RNA molecule in a highly specific two-step reaction. These proteins differ widely in size and oligomeric state, and have limited sequence homology []. The 20 aminoacyl-tRNA synthetases are divided into two classes, I and II. Class I aminoacyl-tRNA synthetases contain a characteristic Rossman fold catalytic domain and are mostly monomeric []. Class II aminoacyl-tRNA synthetases share an anti-parallel beta-sheet fold flanked by alpha-helices [], and are mostly dimeric or multimeric, containing at least three conserved regions [, , ]. However, tRNA binding involves an alpha-helical structure that is conserved between class I and class II synthetases. In reactions catalysed by the class I aminoacyl-tRNA synthetases, the aminoacyl group is coupled to the 2'-hydroxyl of the tRNA, while, in class II reactions, the 3'-hydroxyl site is preferred. The synthetases specific for arginine, cysteine, glutamic acid, glutamine, isoleucine, leucine, methionine, tyrosine, tryptophan and valine belong to class I synthetases. The synthetases specific for alanine, asparagine, aspartic acid, glycine, histidine, lysine, phenylalanine, proline, serine, and threonine belong to class-II synthetases []. Based on their mode of binding to the tRNA acceptor stem, both classes of tRNA synthetases have been subdivided into three subclasses, designated 1a, 1b, 1c and 2a, 2b, 2c. This entry represents the N-terminal domain of Seryl-tRNA synthetase, which consists of two helices in a long alpha-hairpin. Seryl-tRNA synthetase (6.1.1.11 from EC) exists as monomer and belongs to class IIa [].; GO: 0000166 nucleotide binding, 0004828 serine-tRNA ligase activity, 0005524 ATP binding, 0006434 seryl-tRNA aminoacylation, 0005737 cytoplasm; PDB: 3QO8_A 3QO5_A 3QO7_A 3QNE_A 3LSQ_A 3LSS_A 2DQ3_B 1SET_A 1SER_A 1SRY_B ....
Probab=36.93 E-value=1.5e+02 Score=23.97 Aligned_cols=46 Identities=20% Similarity=0.359 Sum_probs=23.5
Q ss_pred HHHHHHHHHHHHHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHh
Q psy4154 153 AVHQLAEDIGKKLAKAEELGAEGFVEESLKLMTEIEDLRKKKAEAEDT 200 (346)
Q Consensus 153 ~i~~l~~~I~~~l~~aE~LGeeG~VdeA~~~~~~~e~lka~~deL~~~ 200 (346)
+++.+..+-+.+.+++-.+-..| +++..+++++..++.++..++..
T Consensus 44 ~~e~lr~~rN~~sk~I~~~~~~~--~~~~~l~~e~~~lk~~i~~le~~ 89 (108)
T PF02403_consen 44 ELEELRAERNELSKEIGKLKKAG--EDAEELKAEVKELKEEIKELEEQ 89 (108)
T ss_dssp HHHHHHHHHHHHHHHHHHHCHTT--CCTHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHhHHHHHHHHHhhCc--ccHHHHHHHHHHHHHHHHHHHHH
Confidence 33444444444445555555555 44555555555555555555443
No 80
>COG1340 Uncharacterized archaeal coiled-coil protein [Function unknown]
Probab=36.82 E-value=4.3e+02 Score=26.13 Aligned_cols=49 Identities=27% Similarity=0.371 Sum_probs=27.6
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHH
Q psy4154 144 SAEVAEKANAVHQLAEDIGKKLAKAEELGAEGFVEESLKLMTEIEDLRKKKAEA 197 (346)
Q Consensus 144 ~~~~~~~~~~i~~l~~~I~~~l~~aE~LGeeG~VdeA~~~~~~~e~lka~~deL 197 (346)
...++....+.++++.+..++..++..|-++ ==.+..+|..+++.++++
T Consensus 33 ~~~~~~~~ekRdeln~kvrE~~e~~~elr~~-----rdeineev~elK~kR~ei 81 (294)
T COG1340 33 RKEASELAEKRDELNAKVRELREKAQELREE-----RDEINEEVQELKEKRDEI 81 (294)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHHHHHHHHHH
Confidence 3334455566667777777777776666432 223445555555555544
No 81
>PF14282 FlxA: FlxA-like protein
Probab=36.46 E-value=1.5e+02 Score=24.52 Aligned_cols=53 Identities=15% Similarity=0.231 Sum_probs=33.6
Q ss_pred HHHHhhhHHHHHHHHHhhhhhhhhhhHhHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy4154 263 QAFISDCDRRTELAKQRLLETQEVLSAEVAEKANAVHQLAEDIGKKLAKAEEL 315 (346)
Q Consensus 263 ~~~i~d~DRrI~~ak~RL~~t~ee~~~~~~~~~~~i~~l~ekI~~ll~eaE~L 315 (346)
...|..+.++|..-++.|.............+..++..|..+|..|-.++-.|
T Consensus 18 ~~~I~~L~~Qi~~Lq~ql~~l~~~~~~~~e~k~~q~q~Lq~QI~~LqaQI~ql 70 (106)
T PF14282_consen 18 DSQIEQLQKQIKQLQEQLQELSQDSDLDAEQKQQQIQLLQAQIQQLQAQIAQL 70 (106)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHcccCCCHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 45666777777777777765433222223456667777788887777777655
No 82
>PF06220 zf-U1: U1 zinc finger; InterPro: IPR013085 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. C2H2-type (classical) zinc fingers (Znf) were the first class to be characterised. They contain a short beta hairpin and an alpha helix (beta/beta/alpha structure), where a single zinc atom is held in place by Cys(2)His(2) (C2H2) residues in a tetrahedral array. C2H2 Znf's can be divided into three groups based on the number and pattern of fingers: triple-C2H2 (binds single ligand), multiple-adjacent-C2H2 (binds multiple ligands), and separated paired-C2H2 []. C2H2 Znf's are the most common DNA-binding motifs found in eukaryotic transcription factors, and have also been identified in prokaryotes []. Transcription factors usually contain several Znf's (each with a conserved beta/beta/alpha structure) capable of making multiple contacts along the DNA, where the C2H2 Znf motifs recognise DNA sequences by binding to the major groove of DNA via a short alpha-helix in the Znf, the Znf spanning 3-4 bases of the DNA []. C2H2 Znf's can also bind to RNA and protein targets []. This entry represents a C2H2-type zinc finger motif found in several U1 small nuclear ribonucleoprotein C (U1-C) proteins. Some proteins contain multiple copies of this motif. The U1 small nuclear ribonucleoprotein (U1 snRNP) binds to the pre-mRNA 5' splice site at early stages of spliceosome assembly. Recruitment of U1 to a class of weak 5' splice site is promoted by binding of the protein TIA-1 to uridine-rich sequences immediately downstream from the 5' splice site. Binding of TIA-1 in the vicinity of a 5' splice site helps to stabilise U1 snRNP recruitment, at least in part, via a direct interaction with U1-C, thus providing one molecular mechanism for the function of this splicing regulator []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding; PDB: 2VRD_A.
Probab=36.20 E-value=10 Score=25.77 Aligned_cols=29 Identities=24% Similarity=0.363 Sum_probs=13.7
Q ss_pred cccccchhhhcccCcchhhhcccCCccccc
Q psy4154 216 RVCEVCSAYLGIHDNDRRLADHFGGKLHLG 245 (346)
Q Consensus 216 ~vCdVCga~L~~~d~~~R~~dH~~gK~H~g 245 (346)
.-|+.|..+|....+.. ...|..|..|..
T Consensus 4 yyCdyC~~~~~~d~~~~-Rk~H~~G~kH~~ 32 (38)
T PF06220_consen 4 YYCDYCKKYLTHDSPSI-RKQHERGWKHKE 32 (38)
T ss_dssp -B-TTT--B-S--SHHH-HHHHT--THHHH
T ss_pred eecccccceecCCChHH-HHHhhccHHHHH
Confidence 46999999986433222 358988988865
No 83
>smart00451 ZnF_U1 U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Probab=35.97 E-value=8.4 Score=24.66 Aligned_cols=26 Identities=23% Similarity=0.592 Sum_probs=18.9
Q ss_pred ccccchhhhcccCcchhhhcccCCccccc
Q psy4154 217 VCEVCSAYLGIHDNDRRLADHFGGKLHLG 245 (346)
Q Consensus 217 vCdVCga~L~~~d~~~R~~dH~~gK~H~g 245 (346)
-|++|+.++. +..-...|+.|+.|..
T Consensus 5 ~C~~C~~~~~---~~~~~~~H~~gk~H~~ 30 (35)
T smart00451 5 YCKLCNVTFT---DEISVEAHLKGKKHKK 30 (35)
T ss_pred EccccCCccC---CHHHHHHHHChHHHHH
Confidence 4999998766 2223479999998854
No 84
>PF03962 Mnd1: Mnd1 family; InterPro: IPR005647 This family of proteins includes meiotic nuclear division protein 1 (MND1) from Saccharomyces cerevisiae (Baker's yeast). The mnd1 protein forms a complex with hop2 to promote homologous chromosome pairing and meiotic double-strand break repair [].
Probab=35.63 E-value=3e+02 Score=25.05 Aligned_cols=64 Identities=20% Similarity=0.281 Sum_probs=0.0
Q ss_pred hhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhh-cCCHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy4154 138 ETQEVLSAEVAEKANAVHQLAEDIGKKLAKAEELGA-EGFVEESLKLMTEIEDLRKKKAEAEDTY 201 (346)
Q Consensus 138 ~~~~~~~~~~~~~~~~i~~l~~~I~~~l~~aE~LGe-eG~VdeA~~~~~~~e~lka~~deL~~~~ 201 (346)
+.............+++..+..+|..+..+++.+.. .+.-++-..++++++.|+.++.+|....
T Consensus 62 ~~~~~~~~~~~~l~~~~~~~~~~i~~l~~~i~~~~~~r~~~~eR~~~l~~l~~l~~~~~~l~~el 126 (188)
T PF03962_consen 62 QAKQKRQNKLEKLQKEIEELEKKIEELEEKIEEAKKGREESEEREELLEELEELKKELKELKKEL 126 (188)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccHHHHHHHHHHHHHHHHHHHHHHHH
No 85
>PF02151 UVR: UvrB/uvrC motif; InterPro: IPR001943 During the process of Escherichia coli nucleotide excision repair, DNA damage recognition and processing are achieved by the action of the uvrA, uvrB, and uvrC gene products []. UvrB and UvrC share a common domain of around 35 amino acids, the so called UVR domain. This domain in UvrB can interact with the homologous domain in UvrC throughout a putative coiled coil structure. This interaction is important for the incision of the damaged strand [].; GO: 0003677 DNA binding, 0004518 nuclease activity, 0006289 nucleotide-excision repair; PDB: 3PXG_D 3PXI_C 1E52_B 1QOJ_B 2D7D_B 2NMV_B.
Probab=35.62 E-value=1.3e+02 Score=19.85 Aligned_cols=34 Identities=26% Similarity=0.232 Sum_probs=25.3
Q ss_pred HHHHHHHHHHHHHhhcCCHHHHHHHHHHHHHHHH
Q psy4154 159 EDIGKKLAKAEELGAEGFVEESLKLMTEIEDLRK 192 (346)
Q Consensus 159 ~~I~~~l~~aE~LGeeG~VdeA~~~~~~~e~lka 192 (346)
+.|..+..+.+.+.+..+.+.|..+-.++..|+.
T Consensus 2 ~~i~~l~~~m~~a~~~~dfE~Aa~~Rd~i~~l~~ 35 (36)
T PF02151_consen 2 KLIKELEEKMEEAVENEDFEKAARLRDQIKALKK 35 (36)
T ss_dssp HHHHHHHHHHHHHHHTT-CHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHcCHHHHHHHHHHHHHHHc
Confidence 4567777788888888889998887777776654
No 86
>PF09969 DUF2203: Uncharacterized conserved protein (DUF2203); InterPro: IPR018699 This family has no known function.
Probab=35.54 E-value=2.8e+02 Score=23.57 Aligned_cols=26 Identities=23% Similarity=0.382 Sum_probs=18.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhc
Q psy4154 149 EKANAVHQLAEDIGKKLAKAEELGAE 174 (346)
Q Consensus 149 ~~~~~i~~l~~~I~~~l~~aE~LGee 174 (346)
....++..+..+|...+.+++.+|-+
T Consensus 47 ~~~~~~~~~~~~~~~~i~~i~~~Gv~ 72 (120)
T PF09969_consen 47 GLEAELEELEARLRELIDEIEELGVE 72 (120)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHcCcE
Confidence 34556777777777777777777753
No 87
>KOG0995|consensus
Probab=35.09 E-value=6.1e+02 Score=27.40 Aligned_cols=70 Identities=19% Similarity=0.229 Sum_probs=34.0
Q ss_pred HHHHHHHHhhhHHHHHHHHHhhhhhhhhhh---HhHHHHHHHHHHHHHHHHHHHHHHHHHhh--cCCHHHHHHHHHHHh
Q psy4154 259 MEHLQAFISDCDRRTELAKQRLLETQEVLS---AEVAEKANAVHQLAEDIGKKLAKAEELGA--EGFVEESLKLMTEQT 332 (346)
Q Consensus 259 ~r~L~~~i~d~DRrI~~ak~RL~~t~ee~~---~~~~~~~~~i~~l~ekI~~ll~eaE~LGe--eG~VdeA~~l~~~~e 332 (346)
...|+..++++-..|+.. ++. +++.. ....+..+.|..|+.++..+.+++=.+-- ++..++-+....+..
T Consensus 310 ~e~lq~~~d~Lk~~Ie~Q--~iS--~~dve~mn~Er~~l~r~l~~i~~~~d~l~k~vw~~~l~~~~~f~~le~~~~~~~ 384 (581)
T KOG0995|consen 310 IEKLQKENDELKKQIELQ--GIS--GEDVERMNLERNKLKRELNKIQSELDRLSKEVWELKLEIEDFFKELEKKFIDLN 384 (581)
T ss_pred HHHHHHHHHHHHHHHHhc--CCC--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHH
Confidence 345556666665555543 221 12221 22234555566666666666666544433 444444444444443
No 88
>PF14282 FlxA: FlxA-like protein
Probab=34.53 E-value=1.6e+02 Score=24.34 Aligned_cols=20 Identities=15% Similarity=0.278 Sum_probs=9.9
Q ss_pred HHHHHHhhHHHHHHHHHHhh
Q psy4154 119 QAFISDCDRRTELAKQRLLE 138 (346)
Q Consensus 119 ~~~I~dcdRkI~~ak~RLe~ 138 (346)
...|..+.++|..-+..|..
T Consensus 18 ~~~I~~L~~Qi~~Lq~ql~~ 37 (106)
T PF14282_consen 18 DSQIEQLQKQIKQLQEQLQE 37 (106)
T ss_pred HHHHHHHHHHHHHHHHHHHH
Confidence 34445555555555555543
No 89
>PF03980 Nnf1: Nnf1 ; InterPro: IPR007128 NNF1 is an essential yeast gene required for proper spindle orientation, nucleolar and nuclear envelope structure and mRNA export [].
Probab=34.36 E-value=2.5e+02 Score=22.79 Aligned_cols=68 Identities=19% Similarity=0.230 Sum_probs=41.2
Q ss_pred HHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHhhhhhhh-------hhhHh-------HHHHHHHHHHHHHHHHHHHHHHH
Q psy4154 248 KIREKLAELQAMEHLQAFISDCDRRTELAKQRLLETQE-------VLSAE-------VAEKANAVHQLAEDIGKKLAKAE 313 (346)
Q Consensus 248 ~lR~~~~El~~~r~L~~~i~d~DRrI~~ak~RL~~t~e-------e~~~~-------~~~~~~~i~~l~ekI~~ll~eaE 313 (346)
.++.+|+.+-.-+.+..-++++|.=|..|+.|...... ..++. .+.+...++.|+..+..+..+-.
T Consensus 18 ~~~~Ef~~I~~Er~v~~kLneLd~Li~eA~~r~~~~~~~~~~~~~~l~P~~~i~a~l~~~~~~~~~~L~~~l~~l~~eN~ 97 (109)
T PF03980_consen 18 NCKKEFEEILEERDVVEKLNELDKLIEEAKERKNSGEREKPVWRHSLTPEEDIRAHLAPYKKKEREQLNARLQELEEENE 97 (109)
T ss_pred HHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHhHhccccCCCCCCCCCChHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHH
Confidence 36777777777777888888888888888888862111 01111 13455555566666665555544
Q ss_pred HH
Q psy4154 314 EL 315 (346)
Q Consensus 314 ~L 315 (346)
.|
T Consensus 98 ~L 99 (109)
T PF03980_consen 98 AL 99 (109)
T ss_pred HH
Confidence 44
No 90
>PF09543 DUF2379: Protein of unknown function (DUF2379); InterPro: IPR011753 This family consists of at least 7 paralogs in Myxococcus xanthus, a member of the Deltaproteobacteria. The function is unknown.
Probab=34.14 E-value=1.5e+02 Score=25.55 Aligned_cols=24 Identities=17% Similarity=0.259 Sum_probs=17.4
Q ss_pred HHHHHHHHhhcCCHHHHHHHHHHH
Q psy4154 164 KLAKAEELGAEGFVEESLKLMTEI 187 (346)
Q Consensus 164 ~l~~aE~LGeeG~VdeA~~~~~~~ 187 (346)
.+-.+-.+-..|++|.|-+.|..+
T Consensus 77 al~r~~~~~daGD~dgARq~m~dv 100 (121)
T PF09543_consen 77 ALHRMYRLRDAGDLDGARQEMRDV 100 (121)
T ss_pred HHHHHHHHhhccCHHHHHHHHHHH
Confidence 344556677889999998777654
No 91
>PF03357 Snf7: Snf7; InterPro: IPR005024 This is a family of eukaryotic proteins which are variously described as either hypothetical protein, developmental protein or related to yeast SNF7. The family contains human CHMP1. CHMP1 (CHromatin Modifying Protein; CHarged Multivesicular body Protein), is encoded by an alternative open reading frame in the PRSM1 gene [] and is conserved in both complex and simple eukaryotes. CHMP1 contains a predicted bipartite nuclear localisation signal and distributes as distinct forms to the cytoplasm and the nuclear matrix in all cell lines tested. Human CHMP1 is strongly implicated in multivesicular body formation. A multivesicular body is a vesicle-filled endosome that targets proteins to the interior of lysosomes. Immunocytochemistry and biochemical fractionation localise CHMP1 to early endosomes and CHMP1 physically interacts with SKD1/VPS4, a highly conserved protein directly linked to multivesicular body sorting in yeast. Similar to the action of a mutant SKD1 protein, over expression of a fusion derivative of human CHMP1 dilates endosomal compartments and disrupts the normal distribution of several endosomal markers. Genetic studies in Saccharomyces cerevisiae (Baker's yeast) further support a conserved role of CHMP1 in vesicle trafficking. Deletion of CHM1, the budding yeast homologue of CHMP1, results in defective sorting of carboxypeptidases S and Y and produces abnormal, multi-lamellar prevacuolar compartments. This phenotype classifies CHM1 as a member of the class E vacuolar protein sorting genes []. ; GO: 0015031 protein transport; PDB: 2V6X_B 2W2U_D 2GD5_D 3FRT_B 3FRV_A 4ABM_D 3EAB_H 3HTU_D.
Probab=34.07 E-value=2.9e+02 Score=23.59 Aligned_cols=51 Identities=18% Similarity=0.162 Sum_probs=39.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHh
Q psy4154 150 KANAVHQLAEDIGKKLAKAEELGAEGFVEESLKLMTEIEDLRKKKAEAEDT 200 (346)
Q Consensus 150 ~~~~i~~l~~~I~~~l~~aE~LGeeG~VdeA~~~~~~~e~lka~~deL~~~ 200 (346)
....+..|+.+|..+..++-.+-..|+.+.|..++...-.++..++.+...
T Consensus 13 L~~~~~~le~~i~~~~~~~k~~~~~~~~~~A~~~lk~~k~~~k~~~~~~~~ 63 (171)
T PF03357_consen 13 LEKQIKRLEKKIKKLEKKAKKAIKKGNKERAKIYLKRKKRLEKQLEKLLNQ 63 (171)
T ss_dssp HHHHHHHHHHHHHHCHHHHHHHHCTT-HHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHcCChHHHHHHHHHHHHHHHHHHHHHHH
Confidence 345667788888888888899999999999988888887777777666543
No 92
>PF06103 DUF948: Bacterial protein of unknown function (DUF948); InterPro: IPR009293 This family consists of bacterial sequences several of which are thought to be general stress proteins.
Probab=33.85 E-value=2.3e+02 Score=22.15 Aligned_cols=19 Identities=26% Similarity=0.468 Sum_probs=7.7
Q ss_pred HHHHHHHHHHHHHHHHHHH
Q psy4154 153 AVHQLAEDIGKKLAKAEEL 171 (346)
Q Consensus 153 ~i~~l~~~I~~~l~~aE~L 171 (346)
+++.+..+++.++.++..+
T Consensus 41 ~~~~i~~e~~~ll~~~n~l 59 (90)
T PF06103_consen 41 QVDPITKEINDLLHNTNEL 59 (90)
T ss_pred hHHHHHHHHHHHHHHHHHH
Confidence 3333444444444444443
No 93
>KOG4797|consensus
Probab=33.71 E-value=1.1e+02 Score=25.83 Aligned_cols=63 Identities=17% Similarity=0.095 Sum_probs=42.2
Q ss_pred hhHHHHHHHHHhhhhhhhhhhHhHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCHHHHHHHHHHHhhhcC
Q psy4154 268 DCDRRTELAKQRLLETQEVLSAEVAEKANAVHQLAEDIGKKLAKAEELGAEGFVEESLKLMTEQTSSCS 336 (346)
Q Consensus 268 d~DRrI~~ak~RL~~t~ee~~~~~~~~~~~i~~l~ekI~~ll~eaE~LGeeG~VdeA~~l~~~~e~l~~ 336 (346)
-+|-||+.|-.=+. | -.-=.+.++++.|.++|.+|....-.|-.++..-+...--.++..|.+
T Consensus 45 aIDNKIeQAMDLVK-t-----HLmfAVREEVe~Lk~qI~eL~er~~~Le~EN~lLk~~~spe~L~ql~~ 107 (123)
T KOG4797|consen 45 AIDNKIEQAMDLVK-T-----HLMFAVREEVEVLKEQIRELEERNSALERENSLLKTLASPEQLAQLPA 107 (123)
T ss_pred eechHHHHHHHHHH-H-----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCHHHHHHHHH
Confidence 35668887744332 1 112356678888999999999999999888877766655555544443
No 94
>PF15397 DUF4618: Domain of unknown function (DUF4618)
Probab=33.50 E-value=3.2e+02 Score=26.43 Aligned_cols=68 Identities=19% Similarity=0.230 Sum_probs=0.0
Q ss_pred HHHHHHHHHHHHHHHHHHHhhhHHH-HHHHHHhhhhhhhhhhHhHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy4154 248 KIREKLAELQAMEHLQAFISDCDRR-TELAKQRLLETQEVLSAEVAEKANAVHQLAEDIGKKLAKAEEL 315 (346)
Q Consensus 248 ~lR~~~~El~~~r~L~~~i~d~DRr-I~~ak~RL~~t~ee~~~~~~~~~~~i~~l~ekI~~ll~eaE~L 315 (346)
+.|..+..++.+...-+.+.--..+ +..|+..|....+...........+++.++.+|.+..++.-.|
T Consensus 39 ~Vr~lLqqy~~~~~~i~~le~~~~~~l~~ak~eLqe~eek~e~~l~~Lq~ql~~l~akI~k~~~el~~L 107 (258)
T PF15397_consen 39 KVRKLLQQYDIYRTAIDILEYSNHKQLQQAKAELQEWEEKEESKLSKLQQQLEQLDAKIQKTQEELNFL 107 (258)
T ss_pred HHHHHHHHHHHHHHHHHHHHccChHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHH
No 95
>PRK10698 phage shock protein PspA; Provisional
Probab=33.49 E-value=4e+02 Score=24.84 Aligned_cols=161 Identities=14% Similarity=0.079 Sum_probs=87.0
Q ss_pred HHHHHHHHHhhHHHHHHHHHHhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHH
Q psy4154 116 EHLQAFISDCDRRTELAKQRLLETQEVLSAEVAEKANAVHQLAEDIGKKLAKAEELGAEGFVEESLKLMTEIEDLRKKKA 195 (346)
Q Consensus 116 ~~L~~~I~dcdRkI~~ak~RLe~~~~~~~~~~~~~~~~i~~l~~~I~~~l~~aE~LGeeG~VdeA~~~~~~~e~lka~~d 195 (346)
..|..+|.+++.-+...+.=+.... +.......++..+...|...-.+|+..-..|+=|-|-..+.+-........
T Consensus 27 k~l~q~i~em~~~l~~~r~alA~~~----A~~k~~er~~~~~~~~~~~~e~kA~~Al~~G~EdLAr~AL~~K~~~~~~~~ 102 (222)
T PRK10698 27 KLVRLMIQEMEDTLVEVRSTSARAL----AEKKQLTRRIEQAEAQQVEWQEKAELALRKEKEDLARAALIEKQKLTDLIA 102 (222)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHH
Confidence 3555666666555555543333221 112234578889999999999999999999999999886655444444444
Q ss_pred HHHHhhccCCCCCchhhhcccccccchhhhcccCcchhhhcccCCccccchHHHHHHHHHHHHHHHHHHHHhhhHHHHHH
Q psy4154 196 EAEDTYRNSMPASSYQQQKLRVCEVCSAYLGIHDNDRRLADHFGGKLHLGFIKIREKLAELQAMEHLQAFISDCDRRTEL 275 (346)
Q Consensus 196 eL~~~~rn~~~~~~~~~~k~~vCdVCga~L~~~d~~~R~~dH~~gK~H~g~~~lR~~~~El~~~r~L~~~i~d~DRrI~~ 275 (346)
.|......... . .+-|. ....+|+..+.+.+.-+. .++ ...+...
T Consensus 103 ~l~~~~~~~~~----~-----------------------~~~L~----~~l~~L~~ki~eak~k~~--~L~--aR~~~A~ 147 (222)
T PRK10698 103 TLEHEVTLVDE----T-----------------------LARMK----KEIGELENKLSETRARQQ--ALM--LRHQAAS 147 (222)
T ss_pred HHHHHHHHHHH----H-----------------------HHHHH----HHHHHHHHHHHHHHHHHH--HHH--HHHHHHH
Confidence 44433211100 0 00000 011245555555443221 111 1123444
Q ss_pred HHHhhhhhhhhhhHhHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy4154 276 AKQRLLETQEVLSAEVAEKANAVHQLAEDIGKKLAKAEELGA 317 (346)
Q Consensus 276 ak~RL~~t~ee~~~~~~~~~~~i~~l~ekI~~ll~eaE~LGe 317 (346)
++.++..+-.... ...-....+.+.++|..+..+++..+.
T Consensus 148 a~~~~~~~~~~~~--~~~a~~~f~rmE~ki~~~Ea~aea~~~ 187 (222)
T PRK10698 148 SSRDVRRQLDSGK--LDEAMARFESFERRIDQMEAEAESHGF 187 (222)
T ss_pred HHHHHHHHHhCCC--cchHHHHHHHHHHHHHHHHHHHhHhhc
Confidence 4444443321111 123345667789999999999998865
No 96
>PF04102 SlyX: SlyX; InterPro: IPR007236 The SlyX protein has no known function. It is short, less than 80 amino acids, and its gene is found close to the slyD gene. The SlyX protein has a conserved PPH(Y/W) motif at its C terminus. The protein may be a coiled-coil structure.; PDB: 3EFG_A.
Probab=33.49 E-value=1.5e+02 Score=22.56 Aligned_cols=32 Identities=19% Similarity=0.263 Sum_probs=18.1
Q ss_pred hhHhHHHHHHHHHHHHHHHHHHHHHHHHHhhc
Q psy4154 287 LSAEVAEKANAVHQLAEDIGKKLAKAEELGAE 318 (346)
Q Consensus 287 ~~~~~~~~~~~i~~l~ekI~~ll~eaE~LGee 318 (346)
.+........+|+.|..++..|..++..+...
T Consensus 23 Ln~~v~~Qq~~I~~L~~~l~~L~~rl~~~~~~ 54 (69)
T PF04102_consen 23 LNDVVTEQQRQIDRLQRQLRLLRERLRELEDP 54 (69)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHT-------
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhccc
Confidence 33444566677888888888888888877643
No 97
>PF02388 FemAB: FemAB family; InterPro: IPR003447 The femAB operon codes for two nearly identical approximately 50kDa proteins involved in the formation of the Staphylococcal pentaglycine interpeptide bridge in peptidoglycan []. These proteins are also considered as a factor influencing the level of methicillin resistance [].; GO: 0016755 transferase activity, transferring amino-acyl groups; PDB: 1XE4_A 1NE9_A 3GKR_A 1XIX_A 1P4N_A 1XF8_A 1LRZ_A.
Probab=33.37 E-value=2.5e+02 Score=28.50 Aligned_cols=26 Identities=19% Similarity=0.330 Sum_probs=11.5
Q ss_pred HHHHHHHHHHHHhhHHHHHHHHHHhh
Q psy4154 113 DAMEHLQAFISDCDRRTELAKQRLLE 138 (346)
Q Consensus 113 e~~~~L~~~I~dcdRkI~~ak~RLe~ 138 (346)
+++..|+.-+..+..+|..-..+|++
T Consensus 242 ~~~~~l~~~~~~~~~~i~~l~~~l~~ 267 (406)
T PF02388_consen 242 EYLESLQEKLEKLEKEIEKLEEKLEK 267 (406)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 33444444444444444444444443
No 98
>PRK00239 rpsT 30S ribosomal protein S20; Reviewed
Probab=33.26 E-value=2.6e+02 Score=22.53 Aligned_cols=35 Identities=17% Similarity=0.122 Sum_probs=29.2
Q ss_pred HHHHHHHHHHHHHHHHhhcCCHHHHHHHHHHHhhh
Q psy4154 300 QLAEDIGKKLAKAEELGAEGFVEESLKLMTEQTSS 334 (346)
Q Consensus 300 ~l~ekI~~ll~eaE~LGeeG~VdeA~~l~~~~e~l 334 (346)
..-+.|...+.+++.+.++|+.++|..++..+.++
T Consensus 23 ~~kS~~kT~iKk~~~ai~~~~~~~a~~~~~~a~s~ 57 (88)
T PRK00239 23 SRKSRVRTAIKKVEAAIAAGDKEAAEEALKAAQSK 57 (88)
T ss_pred HHHHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHH
Confidence 45677888888999999999999999998877654
No 99
>PF09969 DUF2203: Uncharacterized conserved protein (DUF2203); InterPro: IPR018699 This family has no known function.
Probab=33.06 E-value=3.1e+02 Score=23.31 Aligned_cols=27 Identities=22% Similarity=0.364 Sum_probs=22.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhc
Q psy4154 292 AEKANAVHQLAEDIGKKLAKAEELGAE 318 (346)
Q Consensus 292 ~~~~~~i~~l~ekI~~ll~eaE~LGee 318 (346)
.....++..+..+|...+.+++.+|-.
T Consensus 46 ~~~~~~~~~~~~~~~~~i~~i~~~Gv~ 72 (120)
T PF09969_consen 46 NGLEAELEELEARLRELIDEIEELGVE 72 (120)
T ss_pred HhHHHHHHHHHHHHHHHHHHHHHcCcE
Confidence 355667888899999999999999853
No 100
>TIGR00029 S20 ribosomal protein S20. This family consists of bacterial (and chloroplast) examples of the bacteria ribosomal small subunit protein S20.
Probab=32.81 E-value=2.6e+02 Score=22.48 Aligned_cols=35 Identities=11% Similarity=0.076 Sum_probs=28.8
Q ss_pred HHHHHHHHHHHHHHHHhhcCCHHHHHHHHHHHhhh
Q psy4154 300 QLAEDIGKKLAKAEELGAEGFVEESLKLMTEQTSS 334 (346)
Q Consensus 300 ~l~ekI~~ll~eaE~LGeeG~VdeA~~l~~~~e~l 334 (346)
..-+.|...+.+.+.+.++|+.++|..++..+.+.
T Consensus 23 ~~kS~~kT~iKk~~~ai~~~d~~~a~~~l~~a~s~ 57 (87)
T TIGR00029 23 SQKSKMKTIIKKVYAAIAAGDKDKAQEAFKEAAKK 57 (87)
T ss_pred HHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHH
Confidence 45677888889999999999999999888776543
No 101
>PF05597 Phasin: Poly(hydroxyalcanoate) granule associated protein (phasin); InterPro: IPR008769 Polyhydroxyalkanoates (PHAs) are storage polyesters synthesised by various bacteria as intracellular carbon and energy reserve material. PHAs are accumulated as water-insoluble inclusions within the cells. This family consists of the phasins PhaF and PhaI which act as a transcriptional regulator of PHA biosynthesis genes. PhaF has been proposed to repress expression of the phaC1 gene and the phaIF operon.
Probab=31.99 E-value=1.7e+02 Score=25.36 Aligned_cols=21 Identities=29% Similarity=0.376 Sum_probs=16.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHh
Q psy4154 152 NAVHQLAEDIGKKLAKAEELG 172 (346)
Q Consensus 152 ~~i~~l~~~I~~~l~~aE~LG 172 (346)
+.|+.|..+|..|..+++.|.
T Consensus 109 ~dv~~L~~rId~L~~~v~~l~ 129 (132)
T PF05597_consen 109 KDVEALSARIDQLTAQVERLA 129 (132)
T ss_pred HHHHHHHHHHHHHHHHHHHHh
Confidence 577778888888887777774
No 102
>PF12874 zf-met: Zinc-finger of C2H2 type; PDB: 1ZU1_A 2KVG_A.
Probab=31.49 E-value=11 Score=22.41 Aligned_cols=24 Identities=25% Similarity=0.713 Sum_probs=15.9
Q ss_pred ccccchhhhcccCcchhhhcccCCccc
Q psy4154 217 VCEVCSAYLGIHDNDRRLADHFGGKLH 243 (346)
Q Consensus 217 vCdVCga~L~~~d~~~R~~dH~~gK~H 243 (346)
.|+||..-. .+..-+..|+.|+.|
T Consensus 2 ~C~~C~~~f---~s~~~~~~H~~s~~H 25 (25)
T PF12874_consen 2 YCDICNKSF---SSENSLRQHLRSKKH 25 (25)
T ss_dssp EETTTTEEE---SSHHHHHHHHTTHHH
T ss_pred CCCCCCCCc---CCHHHHHHHHCcCCC
Confidence 488887543 344456788888765
No 103
>KOG1040|consensus
Probab=30.78 E-value=33 Score=34.20 Aligned_cols=32 Identities=28% Similarity=0.518 Sum_probs=23.5
Q ss_pred CCCcchhhhhhcCCChhhhhhhCccCCCCCCccccHHHHHH
Q psy4154 25 FFDSKVCKSFLLTCCPHEILSSTRMDLGECPKIHDLALRAD 65 (346)
Q Consensus 25 ~~D~~VCk~fL~g~CPhdLF~nTk~dlG~C~k~H~~~lk~~ 65 (346)
..--.|||+||=|+| +|.| .|+.+|+..+...
T Consensus 74 ~~~~~vcK~~l~glC-------~kgD--~C~Flhe~~~~k~ 105 (325)
T KOG1040|consen 74 SRGKVVCKHWLRGLC-------KKGD--QCEFLHEYDLTKM 105 (325)
T ss_pred cCCceeehhhhhhhh-------hccC--cCcchhhhhhccc
Confidence 555689999999999 3334 5888887655544
No 104
>COG1842 PspA Phage shock protein A (IM30), suppresses sigma54-dependent transcription [Transcription / Signal transduction mechanisms]
Probab=30.12 E-value=3.2e+02 Score=25.74 Aligned_cols=93 Identities=18% Similarity=0.103 Sum_probs=0.0
Q ss_pred hhhhcccCCccccchHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHhhhhhhhhhhHhHHHHHHHHHHHHHHHHHHHHH
Q psy4154 232 RRLADHFGGKLHLGFIKIREKLAELQAMEHLQAFISDCDRRTELAKQRLLETQEVLSAEVAEKANAVHQLAEDIGKKLAK 311 (346)
Q Consensus 232 ~R~~dH~~gK~H~g~~~lR~~~~El~~~r~L~~~i~d~DRrI~~ak~RL~~t~ee~~~~~~~~~~~i~~l~ekI~~ll~e 311 (346)
.|+.+.+.+.+|.+-.++-+-. +.|+.+|+|++.-+..++.-++..-....... .+++.+...+.++-.+
T Consensus 5 ~r~~~~~~a~~~~~~dk~EDp~------~~l~Q~ird~~~~l~~ar~~~A~~~a~~k~~e----~~~~~~~~~~~k~e~~ 74 (225)
T COG1842 5 SRLKDLVKANINELLDKAEDPE------KMLEQAIRDMESELAKARQALAQAIARQKQLE----RKLEEAQARAEKLEEK 74 (225)
T ss_pred HHHHHHHHHHHHHHHHhhcCHH------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHH
Q ss_pred HHHHhhcCCHHHHHHHHHHHhhh
Q psy4154 312 AEELGAEGFVEESLKLMTEQTSS 334 (346)
Q Consensus 312 aE~LGeeG~VdeA~~l~~~~e~l 334 (346)
|+.+-..|+=+=|-..+.+...|
T Consensus 75 A~~Al~~g~E~LAr~al~~~~~l 97 (225)
T COG1842 75 AELALQAGNEDLAREALEEKQSL 97 (225)
T ss_pred HHHHHHCCCHHHHHHHHHHHHHH
No 105
>PF06248 Zw10: Centromere/kinetochore Zw10; InterPro: IPR009361 Zeste white 10 (ZW10) was initially identified as a mitotic checkpoint protein involved in chromosome segregation, and then implicated in targeting cytoplasmic dynein and dynactin to mitotic kinetochores, but it is also important in non-dividing cells. These include cytoplasmic dynein targeting to Golgi and other membranes, and SNARE-mediated ER-Golgi trafficking [, ]. Dominant-negative ZW10, anti-ZW10 antibody, and ZW10 RNA interference (RNAi) cause Golgi dispersal. ZW10 RNAi also disperse endosomes and lysosomes []. Drosophila kinetochore components Rough deal (Rod) and Zw10 are required for the proper functioning of the metaphase checkpoint in flies []. The eukaryotic spindle assembly checkpoint (SAC) monitors microtubule attachment to kinetochores and prevents anaphase onset until all kinetochores are aligned on the metaphase plate. It is an essential surveillance mechanism that ensures high fidelity chromosome segregation during mitosis. In higher eukaryotes, cytoplasmic dynein is involved in silencing the SAC by removing the checkpoint proteins Mad2 and the Rod-Zw10-Zwilch complex (RZZ) from aligned kinetochores [, , ].; GO: 0007067 mitosis, 0000775 chromosome, centromeric region, 0005634 nucleus
Probab=30.06 E-value=7e+02 Score=26.53 Aligned_cols=23 Identities=30% Similarity=0.303 Sum_probs=15.5
Q ss_pred HHHHHHHHhhHHHHHHHHHHhhh
Q psy4154 117 HLQAFISDCDRRTELAKQRLLET 139 (346)
Q Consensus 117 ~L~~~I~dcdRkI~~ak~RLe~~ 139 (346)
.|...|..+.++|.+.+.++...
T Consensus 11 dl~~~I~~L~~~i~~~k~eV~~~ 33 (593)
T PF06248_consen 11 DLRKSISRLSRRIEELKEEVHSM 33 (593)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHH
Confidence 45566777777777777777543
No 106
>PRK09039 hypothetical protein; Validated
Probab=30.04 E-value=5.7e+02 Score=25.49 Aligned_cols=11 Identities=27% Similarity=0.504 Sum_probs=4.4
Q ss_pred HHHHHHHHHHH
Q psy4154 153 AVHQLAEDIGK 163 (346)
Q Consensus 153 ~i~~l~~~I~~ 163 (346)
+++.++.+|..
T Consensus 54 eL~~L~~qIa~ 64 (343)
T PRK09039 54 ALDRLNSQIAE 64 (343)
T ss_pred HHHHHHHHHHH
Confidence 33344444433
No 107
>KOG4182|consensus
Probab=30.01 E-value=7.2e+02 Score=26.68 Aligned_cols=47 Identities=19% Similarity=0.322 Sum_probs=33.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHH
Q psy4154 149 EKANAVHQLAEDIGKKLAKAEELGAEGFVEESLKLMTEIEDLRKKKAEA 197 (346)
Q Consensus 149 ~~~~~i~~l~~~I~~~l~~aE~LGeeG~VdeA~~~~~~~e~lka~~deL 197 (346)
........|.+++..++.+.- ..+|..-++++.++.++..+..++..
T Consensus 78 ~Lq~Da~~Lq~kma~il~el~--~aegesadCiAaLaRldn~kQkleaA 124 (828)
T KOG4182|consen 78 ALQADAHRLQEKMAAILLELA--AAEGESADCIAALARLDNKKQKLEAA 124 (828)
T ss_pred HHHHHHHHHHHHHHHHHHHHH--HHhCChHHHHHHHHHhccHHHHHHHH
Confidence 344566677788777776543 45788889999998888777665443
No 108
>PF10146 zf-C4H2: Zinc finger-containing protein ; InterPro: IPR018482 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents a family of proteins which appears to have a highly conserved zinc finger domain at the C-terminal end, described as -C-X2-CH-X3-H-X5-C-X2-C-. The structure is predicted to contain a coiled coil. Members of this family are annotated as being tumour-associated antigen HCA127 in humans, but this could not be confirmed.
Probab=29.98 E-value=4.8e+02 Score=24.67 Aligned_cols=50 Identities=16% Similarity=0.195 Sum_probs=29.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHhh--cCCHHHHHHHHHHHHHHHHHHHHHHHh
Q psy4154 151 ANAVHQLAEDIGKKLAKAEELGA--EGFVEESLKLMTEIEDLRKKKAEAEDT 200 (346)
Q Consensus 151 ~~~i~~l~~~I~~~l~~aE~LGe--eG~VdeA~~~~~~~e~lka~~deL~~~ 200 (346)
.++|..++..|+.+-..+..+-. +-..+.+..++.++..|+.+.+++...
T Consensus 52 ~eeLrqI~~DIn~lE~iIkqa~~er~~~~~~i~r~~eey~~Lk~~in~~R~e 103 (230)
T PF10146_consen 52 VEELRQINQDINTLENIIKQAESERNKRQEKIQRLYEEYKPLKDEINELRKE 103 (230)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 44555555555555444433322 123566777777888888777777665
No 109
>COG4942 Membrane-bound metallopeptidase [Cell division and chromosome partitioning]
Probab=29.96 E-value=6.5e+02 Score=26.15 Aligned_cols=20 Identities=5% Similarity=-0.001 Sum_probs=11.2
Q ss_pred HHHHHhhhHHHHHHHHHhhh
Q psy4154 262 LQAFISDCDRRTELAKQRLL 281 (346)
Q Consensus 262 L~~~i~d~DRrI~~ak~RL~ 281 (346)
=..-+..+...|+.-+..|.
T Consensus 169 ~~~~l~~~~~~iaaeq~~l~ 188 (420)
T COG4942 169 TLKQLAAVRAEIAAEQAELT 188 (420)
T ss_pred HHHHHHHHHHHHHHHHHHHH
Confidence 34445555556666666555
No 110
>PF10805 DUF2730: Protein of unknown function (DUF2730); InterPro: IPR020269 This entry represents a family of various hypothetical proteins. The proteins, which include HI1498 and Gp25, from phage Mu, are currently uncharacterised.
Probab=29.92 E-value=3.2e+02 Score=22.52 Aligned_cols=27 Identities=22% Similarity=0.370 Sum_probs=18.3
Q ss_pred HHHHHHHHHHhhHHHHHHHHHHhhhhh
Q psy4154 115 MEHLQAFISDCDRRTELAKQRLLETQE 141 (346)
Q Consensus 115 ~~~L~~~I~dcdRkI~~ak~RLe~~~~ 141 (346)
+.-|..-+...++|+..-..+++..|.
T Consensus 37 ~~~l~~~~~~~~~Rl~~lE~~l~~LPt 63 (106)
T PF10805_consen 37 IEKLEERLDEHDRRLQALETKLEHLPT 63 (106)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhCCC
Confidence 344556666778888887777776554
No 111
>PF05667 DUF812: Protein of unknown function (DUF812); InterPro: IPR008530 This family consists of several eukaryotic proteins of unknown function.
Probab=29.53 E-value=7.6e+02 Score=26.78 Aligned_cols=40 Identities=20% Similarity=0.386 Sum_probs=19.5
Q ss_pred HHHHHHHHHHhhhHHHHHHHHHhhhhhhhhhhHhHHHHHHHHHHHHHHHHHH
Q psy4154 257 QAMEHLQAFISDCDRRTELAKQRLLETQEVLSAEVAEKANAVHQLAEDIGKK 308 (346)
Q Consensus 257 ~~~r~L~~~i~d~DRrI~~ak~RL~~t~ee~~~~~~~~~~~i~~l~ekI~~l 308 (346)
+++.+|..+-..|..=|.... .+.....+|.+|+++|...
T Consensus 551 kaYK~La~lh~~c~~Li~~v~------------~tG~~~rEirdLe~qI~~e 590 (594)
T PF05667_consen 551 KAYKLLASLHENCSQLIETVE------------ETGTISREIRDLEEQIDTE 590 (594)
T ss_pred HHHHHHHHHHHHHHHHHHHHH------------HhhHHHHHHHHHHHHHHHH
Confidence 345555555555554443321 1234445566666666543
No 112
>PF06694 Plant_NMP1: Plant nuclear matrix protein 1 (NMP1); InterPro: IPR010604 This family consists of several plant specific nuclear matrix protein 1 (NMP1) sequences. Nuclear Matrix Protein 1 is a ubiquitously expressed 36 kDa protein, which has no homologues in animals and fungi, but is highly conserved among flowering and non-flowering plants. NMP1 is located both in the cytoplasm and nucleus and that the nuclear fraction is associated with the nuclear matrix. NMP1 is a candidate for a plant-specific structural protein with a function both in the nucleus and cytoplasm [].
Probab=29.51 E-value=5.8e+02 Score=25.46 Aligned_cols=139 Identities=12% Similarity=0.065 Sum_probs=80.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHhhccCC---CCCchhhh-cccccccc
Q psy4154 146 EVAEKANAVHQLAEDIGKKLAKAEELGAEGFVEESLKLMTEIEDLRKKKAEAEDTYRNSM---PASSYQQQ-KLRVCEVC 221 (346)
Q Consensus 146 ~~~~~~~~i~~l~~~I~~~l~~aE~LGeeG~VdeA~~~~~~~e~lka~~deL~~~~rn~~---~~~~~~~~-k~~vCdVC 221 (346)
.+++....++.++.++..+.+.++.|...+.+.-=+.-...+..|++.+..+....+.-. .......- .|.+++.|
T Consensus 169 D~seLe~~~s~~sk~Lq~lqq~v~~Lask~~y~pd~~~~e~~~~Lr~~L~tflq~~~~F~~~Y~~EIrpWch~~~~P~L~ 248 (325)
T PF06694_consen 169 DVSELEKKASELSKQLQSLQQQVAELASKHPYNPDEEYVEKESQLRLELETFLQTAAGFNHCYEKEIRPWCHMMEVPQLH 248 (325)
T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHhcchhhhccCccchhh
Confidence 345677889999999999999999999988776655555677778888877766543211 00011122 24666666
Q ss_pred hhhhcccCcchhhhcccCCccccchHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHhhhhhhh-----hh---hHhHHH
Q psy4154 222 SAYLGIHDNDRRLADHFGGKLHLGFIKIREKLAELQAMEHLQAFISDCDRRTELAKQRLLETQE-----VL---SAEVAE 293 (346)
Q Consensus 222 ga~L~~~d~~~R~~dH~~gK~H~g~~~lR~~~~El~~~r~L~~~i~d~DRrI~~ak~RL~~t~e-----e~---~~~~~~ 293 (346)
|.-.+. + ++| ..+..|-++..+++ +|+.+..-+..... +. .+.+.+
T Consensus 249 gLGPAa----~--------Rll-------------e~y~~l~klL~nL~-~lr~Sh~ai~~gs~~t~s~e~~s~~~ivs~ 302 (325)
T PF06694_consen 249 GLGPAA----N--------RLL-------------ELYKMLLKLLGNLA-TLRDSHAAIAAGSSPTTSDEPSSVTQIVSE 302 (325)
T ss_pred cccHHH----H--------HHH-------------HHHHHHHHHHHHHH-HHHHHHHHHhcCCCccccCCCccHHHHHHH
Confidence 632221 1 111 23445555666553 57777776664211 11 122334
Q ss_pred HHHHHHHHHHHHHHHHH
Q psy4154 294 KANAVHQLAEDIGKKLA 310 (346)
Q Consensus 294 ~~~~i~~l~ekI~~ll~ 310 (346)
-.+.+..|+..|+-|-+
T Consensus 303 ~e~al~~lnr~l~il~~ 319 (325)
T PF06694_consen 303 CETALTQLNRSLAILSA 319 (325)
T ss_pred HHHHHHHHHhhhHHHHh
Confidence 45566667766665543
No 113
>COG2178 Predicted RNA-binding protein of the translin family [Translation, ribosomal structure and biogenesis]
Probab=29.20 E-value=4.8e+02 Score=24.41 Aligned_cols=130 Identities=20% Similarity=0.183 Sum_probs=82.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCchhhhcccccccchhhhcccCc
Q psy4154 151 ANAVHQLAEDIGKKLAKAEELGAEGFVEESLKLMTEIEDLRKKKAEAEDTYRNSMPASSYQQQKLRVCEVCSAYLGIHDN 230 (346)
Q Consensus 151 ~~~i~~l~~~I~~~l~~aE~LGeeG~VdeA~~~~~~~e~lka~~deL~~~~rn~~~~~~~~~~k~~vCdVCga~L~~~d~ 230 (346)
.+++-.++-.|-..-..+=.+--.|++++|...+.++..+-.++..+-+..... |-+++++
T Consensus 19 REE~l~lsRei~r~s~~aI~~~H~~~~eeA~~~l~~a~~~v~~Lk~~l~~~pel-------------------~~ag~~~ 79 (204)
T COG2178 19 REEALKLSREIVRLSGEAIFLLHRGDFEEAEKKLKKASEAVEKLKRLLAGFPEL-------------------YFAGFVT 79 (204)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccHHHHHHHHHHHHHHHHHHHHHHhhhHHH-------------------HHHHhhc
Confidence 356666777777777788888889999999998888877666655332211000 0011221
Q ss_pred chhhhcccCCccccchHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHhhhhhhhhhhHhHHHHHHHHHHHHHHHHHHHH
Q psy4154 231 DRRLADHFGGKLHLGFIKIREKLAELQAMEHLQAFISDCDRRTELAKQRLLETQEVLSAEVAEKANAVHQLAEDIGKKLA 310 (346)
Q Consensus 231 ~~R~~dH~~gK~H~g~~~lR~~~~El~~~r~L~~~i~d~DRrI~~ak~RL~~t~ee~~~~~~~~~~~i~~l~ekI~~ll~ 310 (346)
+ .++|+-....|-.++.+- ++ ..+++...++ ..=|..+..-||+|..
T Consensus 80 ~--------------------a~QEyvEA~~l~~~l~~~---------~~-ps~~EL~V~~---~~YilGl~D~vGELrR 126 (204)
T COG2178 80 T--------------------ALQEYVEATLLYSILKDG---------RL-PSPEELGVPP---IAYILGLADAVGELRR 126 (204)
T ss_pred c--------------------hHHHHHHHHHHHHHHhcC---------CC-CCHHHcCCCH---HHHHHHHHHHHHHHHH
Confidence 1 133444444555566552 22 3445444322 2357788899999999
Q ss_pred HHHHHhhcCCHHHHHHHHHHHh
Q psy4154 311 KAEELGAEGFVEESLKLMTEQT 332 (346)
Q Consensus 311 eaE~LGeeG~VdeA~~l~~~~e 332 (346)
-+=.++.+|+|++|..+..=.+
T Consensus 127 ~~le~l~~~~~~~Ae~~~~~ME 148 (204)
T COG2178 127 HVLELLRKGSFEEAERFLKFME 148 (204)
T ss_pred HHHHHHHhccHHHHHHHHHHHH
Confidence 9999999999999986544443
No 114
>PF05278 PEARLI-4: Arabidopsis phospholipase-like protein (PEARLI 4); InterPro: IPR007942 This family contains several phospholipase-like proteins from Arabidopsis thaliana and other members of the Streptophyta which are homologous to PEARLI 4.
Probab=28.86 E-value=5.2e+02 Score=25.24 Aligned_cols=67 Identities=28% Similarity=0.412 Sum_probs=38.7
Q ss_pred CCccccchHHHHHHHHHHHH-------HHHHHHHHhhhHHHHHHHHHhhhhhhhhhhHhHHHHHHHHHHHHHHHHHHHHH
Q psy4154 239 GGKLHLGFIKIREKLAELQA-------MEHLQAFISDCDRRTELAKQRLLETQEVLSAEVAEKANAVHQLAEDIGKKLAK 311 (346)
Q Consensus 239 ~gK~H~g~~~lR~~~~El~~-------~r~L~~~i~d~DRrI~~ak~RL~~t~ee~~~~~~~~~~~i~~l~ekI~~ll~e 311 (346)
.-++-+|| ||..+.|+-. .+....-...++|+|...+..|+...++.. ...+++.++..+|.++-.+
T Consensus 163 sa~vkV~W--LR~~L~Ei~Ea~e~~~~~~~~e~eke~~~r~l~~~~~ELe~~~EeL~----~~Eke~~e~~~~i~e~~~r 236 (269)
T PF05278_consen 163 SAKVKVDW--LRSKLEEILEAKEIYDQHETREEEKEEKDRKLELKKEELEELEEELK----QKEKEVKEIKERITEMKGR 236 (269)
T ss_pred HcCcchHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHH
Confidence 35778888 5555553322 233444556688888888887776544433 3344555566666544433
No 115
>PRK11637 AmiB activator; Provisional
Probab=28.86 E-value=6.3e+02 Score=25.63 Aligned_cols=198 Identities=13% Similarity=0.116 Sum_probs=0.0
Q ss_pred HHHHHHHHHHhhHHHH---HHHHHHhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCHHHHHHHHHHHHHHH
Q psy4154 115 MEHLQAFISDCDRRTE---LAKQRLLETQEVLSAEVAEKANAVHQLAEDIGKKLAKAEELGAEGFVEESLKLMTEIEDLR 191 (346)
Q Consensus 115 ~~~L~~~I~dcdRkI~---~ak~RLe~~~~~~~~~~~~~~~~i~~l~~~I~~~l~~aE~LGeeG~VdeA~~~~~~~e~lk 191 (346)
+.-+++-|.+....|. .....++.....+...+.....+|..++.+|+.+-.+++.+-.+ +.++++.+......-
T Consensus 49 l~~l~~qi~~~~~~i~~~~~~~~~~~~~l~~l~~qi~~~~~~i~~~~~~i~~~~~ei~~l~~e--I~~~q~~l~~~~~~l 126 (428)
T PRK11637 49 LKSIQQDIAAKEKSVRQQQQQRASLLAQLKKQEEAISQASRKLRETQNTLNQLNKQIDELNAS--IAKLEQQQAAQERLL 126 (428)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHH
Q ss_pred HHHHHHHHhhccCCCCCchhhhcccccccchhhhcccCcchhhhc---ccCCccccchHHHHHHHHHHHHHHHHHHHHhh
Q psy4154 192 KKKAEAEDTYRNSMPASSYQQQKLRVCEVCSAYLGIHDNDRRLAD---HFGGKLHLGFIKIREKLAELQAMEHLQAFISD 268 (346)
Q Consensus 192 a~~deL~~~~rn~~~~~~~~~~k~~vCdVCga~L~~~d~~~R~~d---H~~gK~H~g~~~lR~~~~El~~~r~L~~~i~d 268 (346)
..+-..+-...+..... +|...++..++.. .+.--.......|...-.....+.....-+..
T Consensus 127 ~~rlra~Y~~g~~~~l~---------------vLl~a~~~~~~~r~~~~l~~i~~~d~~~l~~l~~~~~~L~~~k~~le~ 191 (428)
T PRK11637 127 AAQLDAAFRQGEHTGLQ---------------LILSGEESQRGERILAYFGYLNQARQETIAELKQTREELAAQKAELEE 191 (428)
T ss_pred HHHHHHHHHcCCCcHHH---------------HHhcCCChhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q ss_pred hHHHHHHHHHhhhhhhhhhhHhHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCHHHHHHHHHHHhhh
Q psy4154 269 CDRRTELAKQRLLETQEVLSAEVAEKANAVHQLAEDIGKKLAKAEELGAEGFVEESLKLMTEQTSS 334 (346)
Q Consensus 269 ~DRrI~~ak~RL~~t~ee~~~~~~~~~~~i~~l~ekI~~ll~eaE~LGeeG~VdeA~~l~~~~e~l 334 (346)
--...+..+..++....+......+....+..|...+....+++.+|.. +...+...+..+
T Consensus 192 ~~~~l~~~~~e~~~~k~~L~~~k~e~~~~l~~L~~~~~~~~~~l~~l~~-----~~~~L~~~I~~l 252 (428)
T PRK11637 192 KQSQQKTLLYEQQAQQQKLEQARNERKKTLTGLESSLQKDQQQLSELRA-----NESRLRDSIARA 252 (428)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHHHH
No 116
>PRK00736 hypothetical protein; Provisional
Probab=28.33 E-value=2.7e+02 Score=21.20 Aligned_cols=31 Identities=23% Similarity=0.273 Sum_probs=20.0
Q ss_pred hhHhHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy4154 287 LSAEVAEKANAVHQLAEDIGKKLAKAEELGA 317 (346)
Q Consensus 287 ~~~~~~~~~~~i~~l~ekI~~ll~eaE~LGe 317 (346)
.+.......+.|+.|..++..|..+...+..
T Consensus 24 Ln~~v~~Qq~~i~~L~~ql~~L~~rl~~~~~ 54 (68)
T PRK00736 24 LSDQLAEQWKTVEQMRKKLDALTERFLSLEE 54 (68)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc
Confidence 3344455666777777777777777766543
No 117
>KOG0977|consensus
Probab=28.13 E-value=5.7e+02 Score=27.51 Aligned_cols=37 Identities=19% Similarity=0.312 Sum_probs=15.9
Q ss_pred HHHHHHHHHHHHHHHHHHHhhcCCHHHHHHHHHHHHHHHHHH
Q psy4154 153 AVHQLAEDIGKKLAKAEELGAEGFVEESLKLMTEIEDLRKKK 194 (346)
Q Consensus 153 ~i~~l~~~I~~~l~~aE~LGeeG~VdeA~~~~~~~e~lka~~ 194 (346)
+|..+.+++..++.+.+.| +|-.+.|=.++...+.++
T Consensus 350 ~i~~mReec~~l~~Elq~L-----lD~ki~Ld~EI~~YRkLL 386 (546)
T KOG0977|consen 350 EIAKMREECQQLSVELQKL-----LDTKISLDAEIAAYRKLL 386 (546)
T ss_pred HHHHHHHHHHHHHHHHHHh-----hchHhHHHhHHHHHHHHh
Confidence 3334444444444444444 333444444444444443
No 118
>PRK02119 hypothetical protein; Provisional
Probab=28.00 E-value=2.9e+02 Score=21.39 Aligned_cols=49 Identities=10% Similarity=0.147 Sum_probs=27.5
Q ss_pred hHHHHHHHHHhhhhh---hhhhhHhHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy4154 269 CDRRTELAKQRLLET---QEVLSAEVAEKANAVHQLAEDIGKKLAKAEELGA 317 (346)
Q Consensus 269 ~DRrI~~ak~RL~~t---~ee~~~~~~~~~~~i~~l~ekI~~ll~eaE~LGe 317 (346)
++.||..=..||+.. -++.+....+....|+.|..++..|..+...++.
T Consensus 7 ~e~Ri~~LE~rla~QE~tie~LN~~v~~Qq~~id~L~~ql~~L~~rl~~~~~ 58 (73)
T PRK02119 7 LENRIAELEMKIAFQENLLEELNQALIEQQFVIDKMQVQLRYMANKLKDMQP 58 (73)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc
Confidence 344555555555432 2333333455666677777777777777666553
No 119
>PF02388 FemAB: FemAB family; InterPro: IPR003447 The femAB operon codes for two nearly identical approximately 50kDa proteins involved in the formation of the Staphylococcal pentaglycine interpeptide bridge in peptidoglycan []. These proteins are also considered as a factor influencing the level of methicillin resistance [].; GO: 0016755 transferase activity, transferring amino-acyl groups; PDB: 1XE4_A 1NE9_A 3GKR_A 1XIX_A 1P4N_A 1XF8_A 1LRZ_A.
Probab=27.52 E-value=2.4e+02 Score=28.63 Aligned_cols=48 Identities=21% Similarity=0.326 Sum_probs=27.7
Q ss_pred HHHHHHHHHhhhHHHHHHHHHhhhhhhhhhhHhHHHHHHHHHHHHHHHHHHHHHHH
Q psy4154 258 AMEHLQAFISDCDRRTELAKQRLLETQEVLSAEVAEKANAVHQLAEDIGKKLAKAE 313 (346)
Q Consensus 258 ~~r~L~~~i~d~DRrI~~ak~RL~~t~ee~~~~~~~~~~~i~~l~ekI~~ll~eaE 313 (346)
++..|+.-+..+..+|+.-..+|++.| ....++.+++.++..+.++++
T Consensus 243 ~~~~l~~~~~~~~~~i~~l~~~l~~~~--------k~~~k~~~~~~q~~~~~k~~~ 290 (406)
T PF02388_consen 243 YLESLQEKLEKLEKEIEKLEEKLEKNP--------KKKNKLKELEEQLASLEKRIE 290 (406)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHH-T--------HHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhCc--------chhhHHHHHHHHHHHHHHHHH
Confidence 445666667777778888888887665 222344445555544444443
No 120
>PRK00846 hypothetical protein; Provisional
Probab=27.35 E-value=3.1e+02 Score=21.65 Aligned_cols=24 Identities=21% Similarity=0.073 Sum_probs=10.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHH
Q psy4154 147 VAEKANAVHQLAEDIGKKLAKAEE 170 (346)
Q Consensus 147 ~~~~~~~i~~l~~~I~~~l~~aE~ 170 (346)
+......|+.|..+|..+..+...
T Consensus 36 v~~qq~~I~~L~~ql~~L~~rL~~ 59 (77)
T PRK00846 36 LADARLTGARNAELIRHLLEDLGK 59 (77)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH
Confidence 333344444444444444443333
No 121
>PRK00295 hypothetical protein; Provisional
Probab=27.19 E-value=2.8e+02 Score=21.08 Aligned_cols=31 Identities=16% Similarity=0.367 Sum_probs=16.5
Q ss_pred hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy4154 141 EVLSAEVAEKANAVHQLAEDIGKKLAKAEEL 171 (346)
Q Consensus 141 ~~~~~~~~~~~~~i~~l~~~I~~~l~~aE~L 171 (346)
++++..+.+....|+.|..++..+..+...+
T Consensus 22 e~Ln~~v~~Qq~~I~~L~~ql~~L~~rl~~~ 52 (68)
T PRK00295 22 QALNDVLVEQQRVIERLQLQMAALIKRQEEM 52 (68)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 3444444455556666666665555555444
No 122
>KOG0161|consensus
Probab=27.16 E-value=1.3e+03 Score=28.82 Aligned_cols=19 Identities=32% Similarity=0.463 Sum_probs=8.1
Q ss_pred HHHHHHHHHHHHHHHHhhc
Q psy4154 300 QLAEDIGKKLAKAEELGAE 318 (346)
Q Consensus 300 ~l~ekI~~ll~eaE~LGee 318 (346)
.++..+..+..+++.+...
T Consensus 1214 ~lq~e~~~l~~ev~~~~~~ 1232 (1930)
T KOG0161|consen 1214 DLQREIADLAAELEQLSSE 1232 (1930)
T ss_pred HHHHHHHHHHHHHHHHhhh
Confidence 3444444444444444433
No 123
>PRK04778 septation ring formation regulator EzrA; Provisional
Probab=27.10 E-value=7.8e+02 Score=26.16 Aligned_cols=88 Identities=15% Similarity=0.223 Sum_probs=53.3
Q ss_pred HHHHHHHHHHHHhhHHHHHHHHHHhhhhh---hhhHHHHHHH------------------HHHHHHHH---HHHHHHHHH
Q psy4154 113 DAMEHLQAFISDCDRRTELAKQRLLETQE---VLSAEVAEKA------------------NAVHQLAE---DIGKKLAKA 168 (346)
Q Consensus 113 e~~~~L~~~I~dcdRkI~~ak~RLe~~~~---~~~~~~~~~~------------------~~i~~l~~---~I~~~l~~a 168 (346)
..+..++..|..++..|..-..-|..-.. .....+.+.. ..+..|+. .|.....+.
T Consensus 105 ~~~~~~~~~l~~~e~~~~~i~~~l~~l~~~e~~nr~~v~~l~~~y~~~rk~ll~~~~~~G~a~~~le~~l~~~e~~f~~f 184 (569)
T PRK04778 105 HEINEIESLLDLIEEDIEQILEELQELLESEEKNREEVEQLKDLYRELRKSLLANRFSFGPALDELEKQLENLEEEFSQF 184 (569)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCccccchHHHHHHHHHHHHHHHHHH
Confidence 44677778888888887777766654211 1111111110 12223343 344556678
Q ss_pred HHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHh
Q psy4154 169 EELGAEGFVEESLKLMTEIEDLRKKKAEAEDT 200 (346)
Q Consensus 169 E~LGeeG~VdeA~~~~~~~e~lka~~deL~~~ 200 (346)
+.|-+.|+..+|...+..++.--..+...|..
T Consensus 185 ~~l~~~Gd~~~A~e~l~~l~~~~~~l~~~~~~ 216 (569)
T PRK04778 185 VELTESGDYVEAREILDQLEEELAALEQIMEE 216 (569)
T ss_pred HHHhcCCCHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 88999999999998888777665555555543
No 124
>PRK00420 hypothetical protein; Validated
Probab=27.04 E-value=3.8e+02 Score=22.63 Aligned_cols=13 Identities=31% Similarity=0.434 Sum_probs=8.9
Q ss_pred cccccchhhhccc
Q psy4154 216 RVCEVCSAYLGIH 228 (346)
Q Consensus 216 ~vCdVCga~L~~~ 228 (346)
..|++||+-|..+
T Consensus 24 ~~CP~Cg~pLf~l 36 (112)
T PRK00420 24 KHCPVCGLPLFEL 36 (112)
T ss_pred CCCCCCCCcceec
Confidence 5677777776653
No 125
>COG2433 Uncharacterized conserved protein [Function unknown]
Probab=26.96 E-value=8.6e+02 Score=26.61 Aligned_cols=22 Identities=23% Similarity=0.220 Sum_probs=12.6
Q ss_pred HHHHHHHHHHHHHHHHHHHhhc
Q psy4154 153 AVHQLAEDIGKKLAKAEELGAE 174 (346)
Q Consensus 153 ~i~~l~~~I~~~l~~aE~LGee 174 (346)
.+..+...|..+-..+|.|.++
T Consensus 416 ei~~~~~~i~~~~~~ve~l~~e 437 (652)
T COG2433 416 EITVYEKRIKKLEETVERLEEE 437 (652)
T ss_pred chhHHHHHHHHHHHHHHHHHHH
Confidence 3445555666666666666554
No 126
>PF06156 DUF972: Protein of unknown function (DUF972); InterPro: IPR010377 FUNCTION: Involved in initiation control of chromosome replication. SUBUNIT: Interacts with both DnaA and DnaN, acting as a bridge between these two proteins. SIMILARITY: Belongs to the YabA family.
Probab=26.73 E-value=2.7e+02 Score=23.26 Aligned_cols=46 Identities=35% Similarity=0.524 Sum_probs=24.6
Q ss_pred HHHHHHHHHHHHHHHHHHhh--cCCHHHHHHHHHHHHHHHHHHHHHHH
Q psy4154 154 VHQLAEDIGKKLAKAEELGA--EGFVEESLKLMTEIEDLRKKKAEAED 199 (346)
Q Consensus 154 i~~l~~~I~~~l~~aE~LGe--eG~VdeA~~~~~~~e~lka~~deL~~ 199 (346)
|..++++|+.++.+++.|-. ..-+++-..|-.+-+.||..+.++..
T Consensus 10 l~~le~~l~~l~~~~~~LK~~~~~l~EEN~~L~~EN~~Lr~~l~~~~~ 57 (107)
T PF06156_consen 10 LDQLEQQLGQLLEELEELKKQLQELLEENARLRIENEHLRERLEELEQ 57 (107)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc
Confidence 33444444444444444322 12244556666777788887777655
No 127
>COG5152 Uncharacterized conserved protein, contains RING and CCCH-type Zn-fingers [General function prediction only]
Probab=26.06 E-value=25 Score=32.82 Aligned_cols=25 Identities=28% Similarity=0.612 Sum_probs=19.3
Q ss_pred CCcchhhhh-hcCCChhhhhhhCccCCCCCCcccc
Q psy4154 26 FDSKVCKSF-LLTCCPHEILSSTRMDLGECPKIHD 59 (346)
Q Consensus 26 ~D~~VCk~f-L~g~CPhdLF~nTk~dlG~C~k~H~ 59 (346)
+.|+|||-| +.|.|-|. ..|..+|+
T Consensus 139 ~qpdVCKdyk~TGYCGYG---------DsCKflH~ 164 (259)
T COG5152 139 TQPDVCKDYKETGYCGYG---------DSCKFLHD 164 (259)
T ss_pred cCcccccchhhcccccCC---------chhhhhhh
Confidence 589999955 78999552 36888886
No 128
>PRK14011 prefoldin subunit alpha; Provisional
Probab=26.02 E-value=4.5e+02 Score=23.04 Aligned_cols=15 Identities=13% Similarity=0.073 Sum_probs=9.9
Q ss_pred cccccccchhhhccc
Q psy4154 214 KLRVCEVCSAYLGIH 228 (346)
Q Consensus 214 k~~vCdVCga~L~~~ 228 (346)
..-||=-+|.|+-+.
T Consensus 51 eiLVPLg~s~yV~g~ 65 (144)
T PRK14011 51 EILIPLGPGAFLKAK 65 (144)
T ss_pred eEEEEcCCCcEEeEE
Confidence 456777777777644
No 129
>KOG3032|consensus
Probab=25.94 E-value=3.7e+02 Score=25.82 Aligned_cols=33 Identities=18% Similarity=0.310 Sum_probs=26.3
Q ss_pred hcCCHHHHHHHHHHHHHHHHHHHHHHHhhccCC
Q psy4154 173 AEGFVEESLKLMTEIEDLRKKKAEAEDTYRNSM 205 (346)
Q Consensus 173 eeG~VdeA~~~~~~~e~lka~~deL~~~~rn~~ 205 (346)
+.|.+|+-+.....++.|+.++++|....|...
T Consensus 199 eieeidEQi~~~kkvekl~~qK~ellnkkRe~~ 231 (264)
T KOG3032|consen 199 EIEEIDEQISYKKKVEKLKRQKMELLNKKREKT 231 (264)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc
Confidence 456678888888999999999999988776543
No 130
>PF05597 Phasin: Poly(hydroxyalcanoate) granule associated protein (phasin); InterPro: IPR008769 Polyhydroxyalkanoates (PHAs) are storage polyesters synthesised by various bacteria as intracellular carbon and energy reserve material. PHAs are accumulated as water-insoluble inclusions within the cells. This family consists of the phasins PhaF and PhaI which act as a transcriptional regulator of PHA biosynthesis genes. PhaF has been proposed to repress expression of the phaC1 gene and the phaIF operon.
Probab=25.67 E-value=2.8e+02 Score=24.02 Aligned_cols=40 Identities=30% Similarity=0.290 Sum_probs=29.3
Q ss_pred hhHHHHHHHHHhhhhhhhhhhHhHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy4154 268 DCDRRTELAKQRLLETQEVLSAEVAEKANAVHQLAEDIGKKLAKAEELGA 317 (346)
Q Consensus 268 d~DRrI~~ak~RL~~t~ee~~~~~~~~~~~i~~l~ekI~~ll~eaE~LGe 317 (346)
-.|.|+..+-.||-. | + .++|+.|..+|..|..+++.|-.
T Consensus 91 ~fd~rV~~aL~rLgv-P---s------~~dv~~L~~rId~L~~~v~~l~~ 130 (132)
T PF05597_consen 91 AFDERVARALNRLGV-P---S------RKDVEALSARIDQLTAQVERLAN 130 (132)
T ss_pred HHHHHHHHHHHhcCC-C---C------HHHHHHHHHHHHHHHHHHHHHhc
Confidence 345599999999852 1 1 25788888899999888888753
No 131
>PF05289 BLYB: Borrelia hemolysin accessory protein; InterPro: IPR007953 This entry represents the borrelial prophage-encoded protein BlyB. Originally BlyB and its partner, the membrane-bound protein BlyA, were thought to comprise a haemolysis system. It is now thought, however, that BlyA and BlyB function instead as a holin or holin-like system [].
Probab=25.48 E-value=2.1e+02 Score=23.92 Aligned_cols=59 Identities=19% Similarity=0.261 Sum_probs=36.0
Q ss_pred cccCCccccchHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHhhhhhhhhhhHhHHHHHHHHHHHHHHHHHHHHHHH
Q psy4154 236 DHFGGKLHLGFIKIREKLAELQAMEHLQAFISDCDRRTELAKQRLLETQEVLSAEVAEKANAVHQLAEDIGKKLAKAE 313 (346)
Q Consensus 236 dH~~gK~H~g~~~lR~~~~El~~~r~L~~~i~d~DRrI~~ak~RL~~t~ee~~~~~~~~~~~i~~l~ekI~~ll~eaE 313 (346)
|-|+-..|-||..+=+.|..+...- . ..-+||+.+ ..+...+.++-++++|+.+..-+.
T Consensus 27 ~ef~e~ahKGf~Lv~~LYs~y~~IY------k-------~nmerlE~~------~t~tI~~~L~pin~KIn~~Id~VN 85 (105)
T PF05289_consen 27 DEFDEIAHKGFFLVYDLYSHYTLIY------K-------SNMERLENA------LTPTILNTLEPINEKINKLIDLVN 85 (105)
T ss_pred ChhHHHHHHHHHHHHHHHHHHHHHH------H-------HHHHHHHhh------ccHHHHHHHHHHHHHHHHHHHHHc
Confidence 4467788989765555555443211 1 334566532 122456788889999998887664
No 132
>COG4477 EzrA Negative regulator of septation ring formation [Cell division and chromosome partitioning]
Probab=24.71 E-value=9e+02 Score=26.05 Aligned_cols=42 Identities=21% Similarity=0.226 Sum_probs=31.5
Q ss_pred HHHHHHHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHh
Q psy4154 159 EDIGKKLAKAEELGAEGFVEESLKLMTEIEDLRKKKAEAEDT 200 (346)
Q Consensus 159 ~~I~~~l~~aE~LGeeG~VdeA~~~~~~~e~lka~~deL~~~ 200 (346)
+.|...+.+-+.|.+.|+.=+|-..+..++..-..+...|..
T Consensus 174 e~i~~~l~qf~~lt~~Gd~ieA~evl~~~ee~~~~L~~~~e~ 215 (570)
T COG4477 174 ENIEEELSQFVELTSSGDYIEAREVLEEAEEHMIALRSIMER 215 (570)
T ss_pred HHHHHHHHHHHHhccCCChhHHHHHHHHHHHHHHHHHHHHHH
Confidence 345556777788999999999999988888766666555553
No 133
>PRK09039 hypothetical protein; Validated
Probab=24.51 E-value=7.1e+02 Score=24.79 Aligned_cols=22 Identities=18% Similarity=0.113 Sum_probs=10.8
Q ss_pred HHHHHHhhHHHHHHHHHHhhhh
Q psy4154 119 QAFISDCDRRTELAKQRLLETQ 140 (346)
Q Consensus 119 ~~~I~dcdRkI~~ak~RLe~~~ 140 (346)
..-|...+..+.+-..++...+
T Consensus 45 s~~i~~~~~eL~~L~~qIa~L~ 66 (343)
T PRK09039 45 SREISGKDSALDRLNSQIAELA 66 (343)
T ss_pred HHHHhhHHHHHHHHHHHHHHHH
Confidence 3444445555555555554433
No 134
>PLN00204 CP12 gene family protein; Provisional
Probab=24.46 E-value=2.2e+02 Score=24.60 Aligned_cols=40 Identities=25% Similarity=0.361 Sum_probs=32.6
Q ss_pred HHHHHHHHHHHHHHHHhhcC-CHHHHHHHHHHHHHHHHHHH
Q psy4154 156 QLAEDIGKKLAKAEELGAEG-FVEESLKLMTEIEDLRKKKA 195 (346)
Q Consensus 156 ~l~~~I~~~l~~aE~LGeeG-~VdeA~~~~~~~e~lka~~d 195 (346)
.|+++|.+.+++|++.-.+| .-.++..+..+||+|.++..
T Consensus 56 ~L~e~Ie~aI~eArevCa~g~~S~eCaaAWDeVEELqAeas 96 (126)
T PLN00204 56 GISEKVEKSIKEAEETCADDPASGECVAAWDEVEELSAAAS 96 (126)
T ss_pred cHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHHHHHHHH
Confidence 47788899999999988877 44577888999999988875
No 135
>PF10458 Val_tRNA-synt_C: Valyl tRNA synthetase tRNA binding arm; InterPro: IPR019499 The aminoacyl-tRNA synthetases (6.1.1. from EC) catalyse the attachment of an amino acid to its cognate transfer RNA molecule in a highly specific two-step reaction. These proteins differ widely in size and oligomeric state, and have limited sequence homology []. The 20 aminoacyl-tRNA synthetases are divided into two classes, I and II. Class I aminoacyl-tRNA synthetases contain a characteristic Rossman fold catalytic domain and are mostly monomeric []. Class II aminoacyl-tRNA synthetases share an anti-parallel beta-sheet fold flanked by alpha-helices [], and are mostly dimeric or multimeric, containing at least three conserved regions [, , ]. However, tRNA binding involves an alpha-helical structure that is conserved between class I and class II synthetases. In reactions catalysed by the class I aminoacyl-tRNA synthetases, the aminoacyl group is coupled to the 2'-hydroxyl of the tRNA, while, in class II reactions, the 3'-hydroxyl site is preferred. The synthetases specific for arginine, cysteine, glutamic acid, glutamine, isoleucine, leucine, methionine, tyrosine, tryptophan and valine belong to class I synthetases. The synthetases specific for alanine, asparagine, aspartic acid, glycine, histidine, lysine, phenylalanine, proline, serine, and threonine belong to class-II synthetases []. Based on their mode of binding to the tRNA acceptor stem, both classes of tRNA synthetases have been subdivided into three subclasses, designated 1a, 1b, 1c and 2a, 2b, 2c. This entry represents the C-terminal domain of Valyl-tRNA synthetase, which consists of two helices in a long alpha-hairpin. Valyl-tRNA synthetase (6.1.1.9 from EC) is an alpha monomer that belongs to class Ia.; GO: 0000166 nucleotide binding, 0004832 valine-tRNA ligase activity, 0005524 ATP binding, 0006438 valyl-tRNA aminoacylation, 0005737 cytoplasm; PDB: 1IVS_B 1GAX_B.
Probab=24.25 E-value=3.1e+02 Score=20.46 Aligned_cols=55 Identities=18% Similarity=0.263 Sum_probs=33.0
Q ss_pred HHHHHHHHhhHHHHHHHHHHhhh--hhhhhH-HHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy4154 117 HLQAFISDCDRRTELAKQRLLET--QEVLSA-EVAEKANAVHQLAEDIGKKLAKAEEL 171 (346)
Q Consensus 117 ~L~~~I~dcdRkI~~ak~RLe~~--~~~~~~-~~~~~~~~i~~l~~~I~~~l~~aE~L 171 (346)
-|++=+..+...|.+...+|... -...|+ -+.....++.++..+|..+...++.|
T Consensus 8 rL~Kel~kl~~~i~~~~~kL~n~~F~~kAP~eVve~er~kl~~~~~~~~~l~~~l~~L 65 (66)
T PF10458_consen 8 RLEKELEKLEKEIERLEKKLSNENFVEKAPEEVVEKEREKLEELEEELEKLEEALEQL 65 (66)
T ss_dssp HHHHHHHHHHHHHHHHHHHHCSTTHHHHS-CCHHHHHHHHHHHHHHHHHHHHHHHHH-
T ss_pred HHHHHHHHHHHHHHHHHHHHcCccccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHhc
Confidence 45556677777788888888531 111111 12344567777888888777777665
No 136
>PRK10564 maltose regulon periplasmic protein; Provisional
Probab=23.68 E-value=92 Score=30.84 Aligned_cols=26 Identities=31% Similarity=0.378 Sum_probs=22.2
Q ss_pred HHHHHHHhhcCCHHHHHHHHHHHHHH
Q psy4154 165 LAKAEELGAEGFVEESLKLMTEIEDL 190 (346)
Q Consensus 165 l~~aE~LGeeG~VdeA~~~~~~~e~l 190 (346)
...|++.-+.|+|+.|++|+.|+++|
T Consensus 261 ~~aI~~AVk~gDi~KAL~LldEAe~L 286 (303)
T PRK10564 261 NQAIKQAVKKGDVDKALKLLDEAERL 286 (303)
T ss_pred HHHHHHHHHcCCHHHHHHHHHHHHHh
Confidence 35667777899999999999999977
No 137
>PF04420 CHD5: CHD5-like protein; InterPro: IPR007514 Members of this family are probably coiled-coil proteins that are similar to the CHD5 (Congenital heart disease 5) protein. The exact molecular function of these eukaryotic proteins is unknown.; PDB: 3SJA_H 3SJC_D 3SJB_D 3ZS8_D 3VLC_E.
Probab=23.54 E-value=3.2e+02 Score=24.17 Aligned_cols=51 Identities=14% Similarity=0.186 Sum_probs=40.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy4154 151 ANAVHQLAEDIGKKLAKAEELGAEGFVEESLKLMTEIEDLRKKKAEAEDTY 201 (346)
Q Consensus 151 ~~~i~~l~~~I~~~l~~aE~LGeeG~VdeA~~~~~~~e~lka~~deL~~~~ 201 (346)
.++..++..+|.++.++....-.+++.-...++-.+++++.++++++.+..
T Consensus 39 ~~~~~~l~~Ei~~l~~E~~~iS~qDeFAkwaKl~Rk~~kl~~el~~~~~~~ 89 (161)
T PF04420_consen 39 SKEQRQLRKEILQLKRELNAISAQDEFAKWAKLNRKLDKLEEELEKLNKSL 89 (161)
T ss_dssp HHHHHHHHHHHHHHHHHHTTS-TTTSHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred cHHHHHHHHHHHHHHHHHHcCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 445667788888888888888888888888888888888888888776543
No 138
>KOG4360|consensus
Probab=23.43 E-value=9.4e+02 Score=25.83 Aligned_cols=63 Identities=16% Similarity=0.157 Sum_probs=43.8
Q ss_pred hHHHHHHHHHHHHHhhHHHHHHHHHHhhhhhhhh---HHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy4154 111 DIDAMEHLQAFISDCDRRTELAKQRLLETQEVLS---AEVAEKANAVHQLAEDIGKKLAKAEELGA 173 (346)
Q Consensus 111 E~e~~~~L~~~I~dcdRkI~~ak~RLe~~~~~~~---~~~~~~~~~i~~l~~~I~~~l~~aE~LGe 173 (346)
|..++..+.+-++|.+-+|...++-|+.-..+.+ .+++....+|..++++|..+.-+.|+|++
T Consensus 196 eq~~y~~~~KelrdtN~q~~s~~eel~~kt~el~~q~Ee~skLlsql~d~qkk~k~~~~Ekeel~~ 261 (596)
T KOG4360|consen 196 EQQLYGDCVKELRDTNTQARSGQEELQSKTKELSRQQEENSKLLSQLVDLQKKIKYLRHEKEELDE 261 (596)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHH
Confidence 4455666777778888888888777765433332 22344557788888888888888888876
No 139
>PF06103 DUF948: Bacterial protein of unknown function (DUF948); InterPro: IPR009293 This family consists of bacterial sequences several of which are thought to be general stress proteins.
Probab=23.43 E-value=3.6e+02 Score=21.01 Aligned_cols=24 Identities=4% Similarity=0.277 Sum_probs=12.3
Q ss_pred HHHHHHHHHHhhHHHHHHHHHHhh
Q psy4154 115 MEHLQAFISDCDRRTELAKQRLLE 138 (346)
Q Consensus 115 ~~~L~~~I~dcdRkI~~ak~RLe~ 138 (346)
+.-+.+.++++.+.++.-++.+..
T Consensus 21 l~~l~~~l~~~~~ti~~l~~~~~~ 44 (90)
T PF06103_consen 21 LKKLKKTLDEVNKTIDTLQEQVDP 44 (90)
T ss_pred HHHHHHHHHHHHHHHHHHHHhHHH
Confidence 344455555555555555555443
No 140
>PF10538 ITAM_Cys-rich: Immunoreceptor tyrosine-based activation motif; InterPro: IPR012316 Signal transduction by T and B cell antigen receptors and certain receptors for Ig Fc regions involves a conserved sequence motif, termed an immunoreceptor tyrosine-based activation motif (ITAM). It is also found in the cytoplasmic domain of the apoptosis receptor. Phosphorylation of the two ITAM tyrosines is a critical event in signal transduction. All (p)2ITAMs, but not their nonphosphorylated counterparts, induced extensive protein tyrosine phosphorylation in permeabilised cells. After binding of the ligand via an SH2 domain, phosphorylation of the two conserved tyrosines of ITAM creates binding sites for downstream signalling molecules and thus enables the initiation of signalling events. This phosphorylation was found to reflect activation of the src family kinases Lyn and Syk. Different ITAMs may preferentially activate distinct signalling pathways as a consequence of distinct SH2 effector binding preference [, ]. Furthermore, in viruses, ITAMs may play key roles in viral pathogenesis by regulating viral clearance, immune cell activation, immune cell recruitment through binding of cellular kinases and thereby down regulate their function []. This motif can be found in one to three copies and in association with the Ig-like domain. Proteins currently known to contain an ITAM motif are: Mammalian alpha and beta immunoglobulin proteins, TCR gamma receptors, FCR gamma receptors subunits, CD3 chains receptors and NFAT activation molecule. Hantavirus cytoplasmic elements. ; GO: 0007165 signal transduction
Probab=23.32 E-value=40 Score=20.80 Aligned_cols=8 Identities=63% Similarity=1.568 Sum_probs=6.4
Q ss_pred CCceeEEe
Q psy4154 1 KNKCYIFA 8 (346)
Q Consensus 1 ~~~~~~~~ 8 (346)
|+.||||.
T Consensus 14 ksrCyi~~ 21 (24)
T PF10538_consen 14 KSRCYIFT 21 (24)
T ss_pred eeeEEEEE
Confidence 57899984
No 141
>PRK04863 mukB cell division protein MukB; Provisional
Probab=23.12 E-value=1.4e+03 Score=27.73 Aligned_cols=25 Identities=4% Similarity=0.049 Sum_probs=17.8
Q ss_pred HHHHHHhhhHHHHHHHHHhhhhhhh
Q psy4154 261 HLQAFISDCDRRTELAKQRLLETQE 285 (346)
Q Consensus 261 ~L~~~i~d~DRrI~~ak~RL~~t~e 285 (346)
....+...+..||.+++.++.....
T Consensus 1237 s~eEVa~~l~~rI~~a~~~V~~mN~ 1261 (1486)
T PRK04863 1237 SSESVANIIRKTIQREQNRIRMLNQ 1261 (1486)
T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3566777777788888888876533
No 142
>PF10979 DUF2786: Protein of unknown function (DUF2786); InterPro: IPR024498 This domain is found in proteins that have no known function.
Probab=23.02 E-value=1.2e+02 Score=21.09 Aligned_cols=38 Identities=21% Similarity=0.122 Sum_probs=28.3
Q ss_pred HHHHHHHHHHHHHHHhhcCCHHHHHHHHHHHhhhcCCCCC
Q psy4154 301 LAEDIGKKLAKAEELGAEGFVEESLKLMTEQTSSCSPGHV 340 (346)
Q Consensus 301 l~ekI~~ll~eaE~LGeeG~VdeA~~l~~~~e~l~~~~~~ 340 (346)
+-.+|.+|+..++.- .+.-.||...+.++..|...-++
T Consensus 3 il~kI~kLLalA~~~--~~~~~EA~~A~~kAq~Lm~ky~i 40 (43)
T PF10979_consen 3 ILEKIRKLLALAEST--GSNEHEAEAALAKAQRLMAKYGI 40 (43)
T ss_pred HHHHHHHHHHHhhCC--CCCHHHHHHHHHHHHHHHHHhCC
Confidence 456888888887732 26677999999999988876554
No 143
>PRK04325 hypothetical protein; Provisional
Probab=22.93 E-value=3.6e+02 Score=20.84 Aligned_cols=29 Identities=14% Similarity=0.206 Sum_probs=17.7
Q ss_pred hHhHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy4154 288 SAEVAEKANAVHQLAEDIGKKLAKAEELG 316 (346)
Q Consensus 288 ~~~~~~~~~~i~~l~ekI~~ll~eaE~LG 316 (346)
+.........|+.|..++..|..+...++
T Consensus 29 N~vv~~Qq~~I~~L~~ql~~L~~rl~~~~ 57 (74)
T PRK04325 29 NATVARQQQTLDLLQAQLRLLYQQMRDAN 57 (74)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhc
Confidence 33344555666667777777766666654
No 144
>PF10458 Val_tRNA-synt_C: Valyl tRNA synthetase tRNA binding arm; InterPro: IPR019499 The aminoacyl-tRNA synthetases (6.1.1. from EC) catalyse the attachment of an amino acid to its cognate transfer RNA molecule in a highly specific two-step reaction. These proteins differ widely in size and oligomeric state, and have limited sequence homology []. The 20 aminoacyl-tRNA synthetases are divided into two classes, I and II. Class I aminoacyl-tRNA synthetases contain a characteristic Rossman fold catalytic domain and are mostly monomeric []. Class II aminoacyl-tRNA synthetases share an anti-parallel beta-sheet fold flanked by alpha-helices [], and are mostly dimeric or multimeric, containing at least three conserved regions [, , ]. However, tRNA binding involves an alpha-helical structure that is conserved between class I and class II synthetases. In reactions catalysed by the class I aminoacyl-tRNA synthetases, the aminoacyl group is coupled to the 2'-hydroxyl of the tRNA, while, in class II reactions, the 3'-hydroxyl site is preferred. The synthetases specific for arginine, cysteine, glutamic acid, glutamine, isoleucine, leucine, methionine, tyrosine, tryptophan and valine belong to class I synthetases. The synthetases specific for alanine, asparagine, aspartic acid, glycine, histidine, lysine, phenylalanine, proline, serine, and threonine belong to class-II synthetases []. Based on their mode of binding to the tRNA acceptor stem, both classes of tRNA synthetases have been subdivided into three subclasses, designated 1a, 1b, 1c and 2a, 2b, 2c. This entry represents the C-terminal domain of Valyl-tRNA synthetase, which consists of two helices in a long alpha-hairpin. Valyl-tRNA synthetase (6.1.1.9 from EC) is an alpha monomer that belongs to class Ia.; GO: 0000166 nucleotide binding, 0004832 valine-tRNA ligase activity, 0005524 ATP binding, 0006438 valyl-tRNA aminoacylation, 0005737 cytoplasm; PDB: 1IVS_B 1GAX_B.
Probab=22.91 E-value=3.3e+02 Score=20.30 Aligned_cols=58 Identities=17% Similarity=0.286 Sum_probs=37.9
Q ss_pred HHHHHHHHhhhHHHHHHHHHhhhhh---hhhhhHhHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy4154 259 MEHLQAFISDCDRRTELAKQRLLET---QEVLSAEVAEKANAVHQLAEDIGKKLAKAEELG 316 (346)
Q Consensus 259 ~r~L~~~i~d~DRrI~~ak~RL~~t---~ee~~~~~~~~~~~i~~l~ekI~~ll~eaE~LG 316 (346)
...|++=+..+...|.+...+|... ..-++..+.....++..+..+|..+...++.|+
T Consensus 6 ~~rL~Kel~kl~~~i~~~~~kL~n~~F~~kAP~eVve~er~kl~~~~~~~~~l~~~l~~Lk 66 (66)
T PF10458_consen 6 IERLEKELEKLEKEIERLEKKLSNENFVEKAPEEVVEKEREKLEELEEELEKLEEALEQLK 66 (66)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHCSTTHHHHS-CCHHHHHHHHHHHHHHHHHHHHHHHHH--
T ss_pred HHHHHHHHHHHHHHHHHHHHHHcCccccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHhcc
Confidence 3466777777788888888888732 000011134566778888999998888887764
No 145
>PF04423 Rad50_zn_hook: Rad50 zinc hook motif; InterPro: IPR007517 The Mre11 complex (Mre11 Rad50 Nbs1) is central to chromosomal maintenance and functions in homologous recombination, telomere maintenance and sister chromatid association. The Rad50 coiled-coil region contains a dimer interface at the apex of the coiled coils in which pairs of conserved Cys-X-X-Cys motifs form interlocking hooks that bind one Zn ion. This alignment includes the zinc hook motif and a short stretch of coiled-coil on either side.; GO: 0004518 nuclease activity, 0005524 ATP binding, 0008270 zinc ion binding, 0006281 DNA repair; PDB: 1L8D_B.
Probab=22.90 E-value=2.3e+02 Score=20.20 Aligned_cols=10 Identities=30% Similarity=0.836 Sum_probs=4.9
Q ss_pred ccccchhhhc
Q psy4154 217 VCEVCSAYLG 226 (346)
Q Consensus 217 vCdVCga~L~ 226 (346)
.|+|||.=|.
T Consensus 22 ~CPlC~r~l~ 31 (54)
T PF04423_consen 22 CCPLCGRPLD 31 (54)
T ss_dssp E-TTT--EE-
T ss_pred cCCCCCCCCC
Confidence 8999996543
No 146
>PRK00846 hypothetical protein; Provisional
Probab=22.86 E-value=3.9e+02 Score=21.14 Aligned_cols=48 Identities=21% Similarity=0.078 Sum_probs=27.0
Q ss_pred HHHHHHHHhhhh---hhhhhhHhHHHHHHHHHHHHHHHHHHHHHHHHHhhc
Q psy4154 271 RRTELAKQRLLE---TQEVLSAEVAEKANAVHQLAEDIGKKLAKAEELGAE 318 (346)
Q Consensus 271 RrI~~ak~RL~~---t~ee~~~~~~~~~~~i~~l~ekI~~ll~eaE~LGee 318 (346)
.||..=..||+. |-++.+.........|+.|..+|..+..+...+...
T Consensus 13 ~Ri~~LE~rlAfQe~tIe~LN~~v~~qq~~I~~L~~ql~~L~~rL~~~~~s 63 (77)
T PRK00846 13 ARLVELETRLSFQEQALTELSEALADARLTGARNAELIRHLLEDLGKVRST 63 (77)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccc
Confidence 344444444443 223333334455667777777777777777776643
No 147
>TIGR02977 phageshock_pspA phage shock protein A. Members of this family are the phage shock protein PspA, from the phage shock operon. This is a narrower family than the set of PspA and its homologs, sometimes several in a genome, as described by PFAM model pfam04012. PspA appears to maintain the protonmotive force under stress conditions that include overexpression of certain phage secretins, heat shock, ethanol, and protein export defects.
Probab=22.83 E-value=6e+02 Score=23.38 Aligned_cols=66 Identities=14% Similarity=0.108 Sum_probs=49.7
Q ss_pred HHHHHHHhhhHHHHHHHHHhhhhhhhhhhHhHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCHHHHHHHHH
Q psy4154 260 EHLQAFISDCDRRTELAKQRLLETQEVLSAEVAEKANAVHQLAEDIGKKLAKAEELGAEGFVEESLKLMT 329 (346)
Q Consensus 260 r~L~~~i~d~DRrI~~ak~RL~~t~ee~~~~~~~~~~~i~~l~ekI~~ll~eaE~LGeeG~VdeA~~l~~ 329 (346)
.-|..+|++++.-|..++.-|+.... .......++..+...|.....+|+.+-..|+=+-|...+.
T Consensus 27 ~~l~q~irem~~~l~~ar~~lA~~~a----~~k~~e~~~~~~~~~~~~~~~~A~~Al~~G~EdLAr~Al~ 92 (219)
T TIGR02977 27 KMIRLIIQEMEDTLVEVRTTSARTIA----DKKELERRVSRLEAQVADWQEKAELALSKGREDLARAALI 92 (219)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHH
Confidence 35678888888888888777764321 1224556788999999999999999999998887776644
No 148
>smart00035 CLa CLUSTERIN alpha chain.
Probab=22.60 E-value=1.8e+02 Score=27.46 Aligned_cols=56 Identities=20% Similarity=0.399 Sum_probs=37.3
Q ss_pred cccccchhhhcccCcchhhhcccCCccccchHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHhhhhh
Q psy4154 216 RVCEVCSAYLGIHDNDRRLADHFGGKLHLGFIKIREKLAELQAMEHLQAFISDCDRRTELAKQRLLET 283 (346)
Q Consensus 216 ~vCdVCga~L~~~d~~~R~~dH~~gK~H~g~~~lR~~~~El~~~r~L~~~i~d~DRrI~~ak~RL~~t 283 (346)
.-|+.|..+|++.|+..... -..||.++++ +++..+++-+.=|.-...-+.+++.|
T Consensus 73 dqCEKCqeiLsvDCs~~~P~----------q~~Lr~El~e--AL~LaE~ftqqYd~lL~~~q~~m~nT 128 (216)
T smart00035 73 DQCEKCQEILSVDCSTNNPD----------QPQLRQELDE--SLQLAERFTQQYDQLLQSYQKKMLNT 128 (216)
T ss_pred HHHHHHHHHHhhhccCCCcc----------hHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHhH
Confidence 57999999999998655430 1257777763 44555566666666666667777655
No 149
>PF10234 Cluap1: Clusterin-associated protein-1; InterPro: IPR019366 This protein of 413 amino acids contains a central coiled-coil domain, possibly the region that binds to clusterin. Cluap1 expression is highest in the nucleus and gradually increases during late S to G2/M phases of the cell cycle and returns to the basal level in the G0/G1 phases. In addition, it is upregulated in colon cancer tissues compared to corresponding non-cancerous mucosa. It thus plays a crucial role in the life of the cell [].
Probab=22.34 E-value=7.3e+02 Score=24.14 Aligned_cols=9 Identities=33% Similarity=0.434 Sum_probs=3.3
Q ss_pred HHHHHHHHH
Q psy4154 186 EIEDLRKKK 194 (346)
Q Consensus 186 ~~e~lka~~ 194 (346)
+-|.+.+++
T Consensus 226 EyEklE~EL 234 (267)
T PF10234_consen 226 EYEKLEEEL 234 (267)
T ss_pred HHHHHHHHH
Confidence 333333333
No 150
>TIGR03185 DNA_S_dndD DNA sulfur modification protein DndD. This model describes the DndB protein encoded by an operon associated with a sulfur-containing modification to DNA. The operon is sporadically distributed in bacteria, much like some restriction enzyme operons. DndD is described as a putative ATPase. The small number of examples known so far include species from among the Firmicutes, Actinomycetes, Proteobacteria, and Cyanobacteria.
Probab=22.30 E-value=8.2e+02 Score=26.38 Aligned_cols=16 Identities=19% Similarity=0.196 Sum_probs=6.5
Q ss_pred HHHHHHHHHHHHHHHH
Q psy4154 181 LKLMTEIEDLRKKKAE 196 (346)
Q Consensus 181 ~~~~~~~e~lka~~de 196 (346)
...+..++.++..+.+
T Consensus 479 ~~~~~~~~~~~~~l~~ 494 (650)
T TIGR03185 479 ERAITIADKAKKTLKE 494 (650)
T ss_pred HHHHHHHHHHHHHHHH
Confidence 3333344444444433
No 151
>PF12128 DUF3584: Protein of unknown function (DUF3584); InterPro: IPR021979 This family consist of uncharacterised bacterial proteins.
Probab=22.15 E-value=1.3e+03 Score=27.02 Aligned_cols=173 Identities=13% Similarity=0.196 Sum_probs=0.0
Q ss_pred HHHHHHHHHHhhHHHHHHHHHHhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCHHHHHHHH-------HHH
Q psy4154 115 MEHLQAFISDCDRRTELAKQRLLETQEVLSAEVAEKANAVHQLAEDIGKKLAKAEELGAEGFVEESLKLM-------TEI 187 (346)
Q Consensus 115 ~~~L~~~I~dcdRkI~~ak~RLe~~~~~~~~~~~~~~~~i~~l~~~I~~~l~~aE~LGeeG~VdeA~~~~-------~~~ 187 (346)
++.++.-+.+.+.+....+..+.....++..........+..+++.+..+-.-...++.-+.+..+..+- ..+
T Consensus 825 ~~~~~~~~~~l~~~~~~~~~~~~~~~~~le~~~~~~~~~~~~~~~~l~~l~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~ 904 (1201)
T PF12128_consen 825 LRDLEQELQELEQELNQLQKEVKQRRKELEEELKALEEQLEQLEEQLRRLRDLLEKLAELSEPPNAEDAEGSVDERLRDL 904 (1201)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCCCCCchhhhhhHHHHHHHH
Q ss_pred HHHHHHHHHHHHhhccCCCCCchhhhcccccccchhhhcccCcchhhhcccCCccccchHHHHHHHH-----------HH
Q psy4154 188 EDLRKKKAEAEDTYRNSMPASSYQQQKLRVCEVCSAYLGIHDNDRRLADHFGGKLHLGFIKIREKLA-----------EL 256 (346)
Q Consensus 188 e~lka~~deL~~~~rn~~~~~~~~~~k~~vCdVCga~L~~~d~~~R~~dH~~gK~H~g~~~lR~~~~-----------El 256 (346)
+.+...+..+++..+......... +..|.++-....|...+.... .+
T Consensus 905 ~~~~~~~~~l~~~l~~~~~~f~~~----------------------l~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~ 962 (1201)
T PF12128_consen 905 EDLLQRRKRLREELKKAVERFKGV----------------------LTKHSGSELAENWEELRSEDSFLSDKGINSDDYR 962 (1201)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH----------------------HHhccccchHHHHHHHHHhccccccccccchhHH
Q ss_pred HHHHHHHHHHhhhHHHHHHHHHhhhhhhhhhhHhHHHHHHHHHHHHHHHHHHHHHH
Q psy4154 257 QAMEHLQAFISDCDRRTELAKQRLLETQEVLSAEVAEKANAVHQLAEDIGKKLAKA 312 (346)
Q Consensus 257 ~~~r~L~~~i~d~DRrI~~ak~RL~~t~ee~~~~~~~~~~~i~~l~ekI~~ll~ea 312 (346)
..+..|+.|++.. +...+.+|-..-......+..--..+..++-+|...-..+
T Consensus 963 ~~~~~l~~~~~~~---~~~~~~~l~e~~~~~~~~i~~f~~~l~~~~r~I~~~s~~l 1015 (1201)
T PF12128_consen 963 QWAPDLQELLDVL---IPQQQQALIEQGRNIGNDISNFYGVLEDFDRRIKSQSRRL 1015 (1201)
T ss_pred HHHHHHHHHHHhh---hHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHH
No 152
>PF04363 DUF496: Protein of unknown function (DUF496); InterPro: IPR007458 Members of this family are uncharacterised proteins.
Probab=21.97 E-value=3.7e+02 Score=21.99 Aligned_cols=24 Identities=21% Similarity=0.314 Sum_probs=20.3
Q ss_pred HHHHHHHHHHhhHHHHHHHHHHhh
Q psy4154 115 MEHLQAFISDCDRRTELAKQRLLE 138 (346)
Q Consensus 115 ~~~L~~~I~dcdRkI~~ak~RLe~ 138 (346)
..-|.+-|.|.++||+-++.|+..
T Consensus 14 KNKl~REi~Dn~kKIRDNqKRV~L 37 (95)
T PF04363_consen 14 KNKLKREIEDNEKKIRDNQKRVLL 37 (95)
T ss_pred HHHHHHHHHHHHHHHhhhHHHHHH
Confidence 345778889999999999999865
No 153
>KOG2264|consensus
Probab=21.92 E-value=4.8e+02 Score=28.43 Aligned_cols=36 Identities=11% Similarity=0.155 Sum_probs=19.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHH-----HHhhcCCHHHHHHHH
Q psy4154 149 EKANAVHQLAEDIGKKLAKAE-----ELGAEGFVEESLKLM 184 (346)
Q Consensus 149 ~~~~~i~~l~~~I~~~l~~aE-----~LGeeG~VdeA~~~~ 184 (346)
+..-+|.+++.+|.++-+.++ -+.-+|.|++||...
T Consensus 104 el~seI~~~n~kiEelk~~i~~~q~eL~~Lk~~ieqaq~~~ 144 (907)
T KOG2264|consen 104 ELNSEIEEINTKIEELKRLIPQKQLELSALKGEIEQAQRQL 144 (907)
T ss_pred HHHhHHHHHHHHHHHHHHHHHHhHHHHHHHHhHHHHHHHHH
Confidence 344556666666666654333 233466666665433
No 154
>PF05149 Flagellar_rod: Paraflagellar rod protein; InterPro: IPR007824 This family consists of several eukaryotic paraflagellar rod component proteins. The eukaryotic flagellum represents one of the most complex macromolecular structures found in any organism and contains more than 250 proteins []. In addition to its locomotive role, the flagellum is probably involved in nutrient uptake since receptors for host low-density lipoproteins are localised on the flagellar membrane as well as on the flagellar pocket membrane [].; GO: 0005516 calmodulin binding, 0009434 microtubule-based flagellum
Probab=21.86 E-value=7.7e+02 Score=24.26 Aligned_cols=53 Identities=13% Similarity=0.113 Sum_probs=27.8
Q ss_pred HHHhhhHHHHHHHHHhhhhhhhhhhHhHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy4154 264 AFISDCDRRTELAKQRLLETQEVLSAEVAEKANAVHQLAEDIGKKLAKAEELG 316 (346)
Q Consensus 264 ~~i~d~DRrI~~ak~RL~~t~ee~~~~~~~~~~~i~~l~ekI~~ll~eaE~LG 316 (346)
.=++.+|+.|+...-.++-..+...|....-.+.+..|..+-..+..++..|+
T Consensus 187 ~rle~l~~~iR~~~~~~E~ameTlDPnAk~y~~~~~dl~~~~~ev~~~I~~l~ 239 (289)
T PF05149_consen 187 RRLEELDRQIRSTHLQQERAMETLDPNAKKYSKEKKDLLEQREEVEQEINLLR 239 (289)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCcHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 44667788888876666644443444333333444444444444444444443
No 155
>PRK02793 phi X174 lysis protein; Provisional
Probab=21.65 E-value=3.8e+02 Score=20.61 Aligned_cols=48 Identities=19% Similarity=0.152 Sum_probs=26.4
Q ss_pred HHHHHHHHHhhhh---hhhhhhHhHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy4154 270 DRRTELAKQRLLE---TQEVLSAEVAEKANAVHQLAEDIGKKLAKAEELGA 317 (346)
Q Consensus 270 DRrI~~ak~RL~~---t~ee~~~~~~~~~~~i~~l~ekI~~ll~eaE~LGe 317 (346)
+.||..=..||+. |-++.+.........|+.|..++..|..+...++.
T Consensus 7 e~Ri~~LE~~lafQe~tIe~Ln~~v~~Qq~~I~~L~~~l~~L~~rl~~~~~ 57 (72)
T PRK02793 7 EARLAELESRLAFQEITIEELNVTVTAHEMEMAKLRDHLRLLTEKLKASQP 57 (72)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc
Confidence 3344444444443 22333333445566677777777777777766654
No 156
>COG4911 Uncharacterized conserved protein [Function unknown]
Probab=21.55 E-value=4.4e+02 Score=22.40 Aligned_cols=55 Identities=18% Similarity=0.160 Sum_probs=33.8
Q ss_pred HHHHHHHHHHHHhhHHHHHHHHHHhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy4154 113 DAMEHLQAFISDCDRRTELAKQRLLETQEVLSAEVAEKANAVHQLAEDIGKKLAKAEELGA 173 (346)
Q Consensus 113 e~~~~L~~~I~dcdRkI~~ak~RLe~~~~~~~~~~~~~~~~i~~l~~~I~~~l~~aE~LGe 173 (346)
++.+.|.-++.|.-.+-++-+.+.+.- . +++..-.+.++..+++..+++++++|-
T Consensus 11 e~~~el~P~l~d~~~~~r~~~n~~e~L----~--~qedk~~l~e~e~q~k~~l~~i~e~G~ 65 (123)
T COG4911 11 ETARELLPWLRDRLIQLRKIKNEIELL----L--VQEDKYALQEYESQTKKILDEIIEKGI 65 (123)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHh----c--ccccHHHHHHHHHHHHHHHHHHHHcCc
Confidence 566666677775544444444444321 1 112234577889999999999998875
No 157
>KOG0150|consensus
Probab=21.53 E-value=1.3e+02 Score=29.93 Aligned_cols=28 Identities=21% Similarity=0.452 Sum_probs=20.0
Q ss_pred cccccchhhhcccCcchhhhcccCCccccc
Q psy4154 216 RVCEVCSAYLGIHDNDRRLADHFGGKLHLG 245 (346)
Q Consensus 216 ~vCdVCga~L~~~d~~~R~~dH~~gK~H~g 245 (346)
.-|++|..++.-+-+. +..|=.||-|-.
T Consensus 11 kfCdyCKiWi~dN~~S--v~~He~GkrHke 38 (336)
T KOG0150|consen 11 KFCDYCKIWIKDNPAS--VRFHERGKRHKE 38 (336)
T ss_pred hhhhhhhhhhcCChHH--HHhHhhhhHHHH
Confidence 4699999998765544 445667887754
No 158
>TIGR00756 PPR pentatricopeptide repeat domain (PPR motif). This family has a similar consensus to the TPR domain (tetratricopeptide), pfam pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.
Probab=21.11 E-value=1.8e+02 Score=17.19 Aligned_cols=27 Identities=37% Similarity=0.487 Sum_probs=20.8
Q ss_pred HHHHhhcCCHHHHHHHHHHHhhhcCCCCCC
Q psy4154 312 AEELGAEGFVEESLKLMTEQTSSCSPGHVP 341 (346)
Q Consensus 312 aE~LGeeG~VdeA~~l~~~~e~l~~~~~~~ 341 (346)
+..+...|++++|..+..+.. +.|..|
T Consensus 7 i~~~~~~~~~~~a~~~~~~M~---~~g~~p 33 (35)
T TIGR00756 7 IDGLCKAGRVEEALELFKEML---ERGIEP 33 (35)
T ss_pred HHHHHHCCCHHHHHHHHHHHH---HcCCCC
Confidence 345788999999999998764 457666
No 159
>PRK03918 chromosome segregation protein; Provisional
Probab=21.00 E-value=1.1e+03 Score=25.88 Aligned_cols=182 Identities=21% Similarity=0.257 Sum_probs=0.0
Q ss_pred hhhHHHHHHHHHHHHHhhHHHHHHHHHHhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCHHHHHHHHHHHH
Q psy4154 109 YYDIDAMEHLQAFISDCDRRTELAKQRLLETQEVLSAEVAEKANAVHQLAEDIGKKLAKAEELGAEGFVEESLKLMTEIE 188 (346)
Q Consensus 109 ~yE~e~~~~L~~~I~dcdRkI~~ak~RLe~~~~~~~~~~~~~~~~i~~l~~~I~~~l~~aE~LGeeG~VdeA~~~~~~~e 188 (346)
.....-+..+..-+.+++..+..-..+++... ...........++.++...|..+..+...|+.. .+.........++
T Consensus 521 ~~~~~~~~~l~~~~~~l~~~l~~l~~~l~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~l~~L~~~-~~~~~~~~~~~l~ 598 (880)
T PRK03918 521 EKKAEEYEKLKEKLIKLKGEIKSLKKELEKLE-ELKKKLAELEKKLDELEEELAELLKELEELGFE-SVEELEERLKELE 598 (880)
T ss_pred HHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccc-hHHHHHHHHHHhh
Q ss_pred HHHHHHHHHHHhhccCCCCCchhhhcccccccchhhhcccCcchhhhcccCCccccchHHHHHHHHHHH-HHHHHHHHHh
Q psy4154 189 DLRKKKAEAEDTYRNSMPASSYQQQKLRVCEVCSAYLGIHDNDRRLADHFGGKLHLGFIKIREKLAELQ-AMEHLQAFIS 267 (346)
Q Consensus 189 ~lka~~deL~~~~rn~~~~~~~~~~k~~vCdVCga~L~~~d~~~R~~dH~~gK~H~g~~~lR~~~~El~-~~r~L~~~i~ 267 (346)
.....-.++........ .++..+.++. .+..++.-|.
T Consensus 599 ~~~~~~~~~~~~~~~l~------------------------------------------~~~~~l~~~~~~l~~~~~~i~ 636 (880)
T PRK03918 599 PFYNEYLELKDAEKELE------------------------------------------REEKELKKLEEELDKAFEELA 636 (880)
T ss_pred hhHHHHHHHHHHHHHHH------------------------------------------HHHHHHHHHHHHHHHHHHHHH
Q ss_pred hhHHHHHHHHHhhhhhh--------hhhhHhHHHHHHHHHHHHHHHHHHHHHHHHHhh-----cCCHHHHHHHHHHHhhh
Q psy4154 268 DCDRRTELAKQRLLETQ--------EVLSAEVAEKANAVHQLAEDIGKKLAKAEELGA-----EGFVEESLKLMTEQTSS 334 (346)
Q Consensus 268 d~DRrI~~ak~RL~~t~--------ee~~~~~~~~~~~i~~l~ekI~~ll~eaE~LGe-----eG~VdeA~~l~~~~e~l 334 (346)
++..+|..-+.+++... +...........++..+...|+.+..+++.+-. +..+.+......+++.+
T Consensus 637 ~l~~~i~~l~~~~~~l~~~~~~~~~~~l~~~~~~l~~~l~~l~~~~~~l~~~i~~l~~~i~~~~~~~~~l~~~~~~~~~l 716 (880)
T PRK03918 637 ETEKRLEELRKELEELEKKYSEEEYEELREEYLELSRELAGLRAELEELEKRREEIKKTLEKLKEELEEREKAKKELEKL 716 (880)
T ss_pred HHHHHHHHHHHHHHHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
No 160
>COG2433 Uncharacterized conserved protein [Function unknown]
Probab=20.98 E-value=5.9e+02 Score=27.79 Aligned_cols=35 Identities=17% Similarity=0.239 Sum_probs=19.5
Q ss_pred HHHHHHHHH-HHHHHHHHHhhhHHHHHHHHHhhhhh
Q psy4154 249 IREKLAELQ-AMEHLQAFISDCDRRTELAKQRLLET 283 (346)
Q Consensus 249 lR~~~~El~-~~r~L~~~i~d~DRrI~~ak~RL~~t 283 (346)
++.+.+.++ ....|+..|..+.+.|+.=+.+|+..
T Consensus 427 ~~~~ve~l~~e~~~L~~~~ee~k~eie~L~~~l~~~ 462 (652)
T COG2433 427 LEETVERLEEENSELKRELEELKREIEKLESELERF 462 (652)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 344444443 34556666666666666666666544
No 161
>COG4026 Uncharacterized protein containing TOPRIM domain, potential nuclease [General function prediction only]
Probab=20.81 E-value=7e+02 Score=23.95 Aligned_cols=14 Identities=43% Similarity=0.544 Sum_probs=7.9
Q ss_pred HHHHHHHHHHHHHH
Q psy4154 186 EIEDLRKKKAEAED 199 (346)
Q Consensus 186 ~~e~lka~~deL~~ 199 (346)
++..|+...++|+.
T Consensus 192 ev~~L~~r~~ELe~ 205 (290)
T COG4026 192 EVYDLKKRWDELEP 205 (290)
T ss_pred HHHHHHHHHHHhcc
Confidence 55566666555543
No 162
>PF11945 WASH_WAHD: WAHD domain of WASH complex; InterPro: IPR021854 This entry represents a component of the WASH complex. The WASH complex is present at the surface of endosomes and recruits and activates the Arp2/3 complex to induce actin polymerisation. The WASH complex plays a key role in the fission of tubules that serve as transport intermediates during endosome sorting []. The WASH complex's subunit structure: F-actin-capping protein subunit alpha (CAPZA1, CAPZA2 or CAPZA3), F-actin-capping protein subunit beta (CAPZB), WASH (WASH1, WASH2P, WASH3P, WASH4P, WASH5P or WASH6P), FAM21 (FAM21A, FAM21B or FAM21C), KIAA1033, KIAA0196 (strumpellin) and CCDC53. This entry represents the WASH subunit of the WASH complex. WASH genes duplicated to multiple chromosomal ends during primate evolution, with highest copy number reached in humans, whose WASH repertoires probably vary extensively among individuals []. It is therefore difficult to determine which gene is functional or not. The telomeric region of chromosome 9p is paralogous to the pericentromeric regions of chromosome 9 as well as to 2q. Paralogous regions contain 7 transcriptional units. Duplicated WASH genes are also present in the Xq/Yq pseudoautosomal region, as well as on chromosome 1 and 15. The chromosome 16 copy seems to be a pseudogene.
Probab=20.48 E-value=3.8e+02 Score=26.38 Aligned_cols=51 Identities=22% Similarity=0.219 Sum_probs=34.5
Q ss_pred HHHHHHHHHHHHHhhhHHHHHHHHHhhhhhhhhhhHhHHHHHHHHHHHHHHHHHHHHHHHHHhhc
Q psy4154 254 AELQAMEHLQAFISDCDRRTELAKQRLLETQEVLSAEVAEKANAVHQLAEDIGKKLAKAEELGAE 318 (346)
Q Consensus 254 ~El~~~r~L~~~i~d~DRrI~~ak~RL~~t~ee~~~~~~~~~~~i~~l~ekI~~ll~eaE~LGee 318 (346)
.-.+.+.+|++.++|+=.||.. |++ ....+|..|+.+|....+++++|...
T Consensus 22 qi~~aL~~L~~v~~diF~rI~~---Rv~-----------~~~~~l~~i~~Ri~~~qaKi~~l~gs 72 (297)
T PF11945_consen 22 QIADALEYLDKVSNDIFSRISA---RVE-----------RNRERLQAIQQRIEVAQAKIEKLQGS 72 (297)
T ss_pred HHHHHHHHHHHHHHHHHHHHHH---HHH-----------HHHHHHHHHHHHHHHHHHHHHHHhCC
Confidence 3566788888888888777643 332 22345667788888888888877543
No 163
>PF10367 Vps39_2: Vacuolar sorting protein 39 domain 2; InterPro: IPR019453 This entry represents a domain found in the vacuolar sorting protein Vps39 and transforming growth factor beta receptor-associated protein Trap1. Vps39, a component of the C-Vps complex, is thought to be required for the fusion of endosomes and other types of transport intermediates with the vacuole [, ]. In Saccharomyces cerevisiae (Baker's yeast), Vps39 has been shown to stimulate nucleotide exchange []. Trap1 plays a role in the TGF-beta/activin signaling pathway. It associates with inactive heteromeric TGF-beta and activin receptor complexes, mainly through the type II receptor, and is released upon activation of signaling [, ]. The precise function of this domain has not been characterised In Vps39 this domain is involved in localisation and in mediating the interactions with Vps11 [].
Probab=20.45 E-value=2.6e+02 Score=21.95 Aligned_cols=12 Identities=42% Similarity=0.816 Sum_probs=9.9
Q ss_pred cccccchhhhcc
Q psy4154 216 RVCEVCSAYLGI 227 (346)
Q Consensus 216 ~vCdVCga~L~~ 227 (346)
..|.|||--|..
T Consensus 79 ~~C~vC~k~l~~ 90 (109)
T PF10367_consen 79 TKCSVCGKPLGN 90 (109)
T ss_pred CCccCcCCcCCC
Confidence 679999998764
No 164
>PF04949 Transcrip_act: Transcriptional activator; InterPro: IPR007033 Golgins are a family of coiled-coil proteins associated with the Golgi apparatus necessary for tethering events in membrane fusion and as structural supports for Golgi cisternae []. This entry represents proteins annotated as RAB6-interacting golgins.
Probab=20.20 E-value=4.8e+02 Score=23.35 Aligned_cols=62 Identities=23% Similarity=0.222 Sum_probs=32.2
Q ss_pred HHHHHHHHhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhh-----cCCHHHHHHHHHHHHHH
Q psy4154 129 TELAKQRLLETQEVLSAEVAEKANAVHQLAEDIGKKLAKAEELGA-----EGFVEESLKLMTEIEDL 190 (346)
Q Consensus 129 I~~ak~RLe~~~~~~~~~~~~~~~~i~~l~~~I~~~l~~aE~LGe-----eG~VdeA~~~~~~~e~l 190 (346)
++...+||....+++..-..-..+++..+..+|+..-.+..-||- +-...+|+..+.+...-
T Consensus 61 veEetkrLa~ireeLE~l~dP~RkEv~~vRkkID~vNreLkpl~~~cqKKEkEykealea~nEknke 127 (159)
T PF04949_consen 61 VEEETKRLAEIREELEVLADPMRKEVEMVRKKIDSVNRELKPLGQSCQKKEKEYKEALEAFNEKNKE 127 (159)
T ss_pred HHHHHHHHHHHHHHHHhhccchHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 444445555444443211111235666777777777776666665 33455565555554433
No 165
>PF06825 HSBP1: Heat shock factor binding protein 1; InterPro: IPR009643 Heat shock factor binding protein 1 (HSBP1) appears to be a negative regulator of the heat shock response [].; PDB: 3CI9_A.
Probab=20.17 E-value=2.1e+02 Score=21.08 Aligned_cols=36 Identities=11% Similarity=0.300 Sum_probs=15.4
Q ss_pred HhhhhhhhhhhHhHHHHHHHHHHHHHHHHHHHHHHH
Q psy4154 278 QRLLETQEVLSAEVAEKANAVHQLAEDIGKKLAKAE 313 (346)
Q Consensus 278 ~RL~~t~ee~~~~~~~~~~~i~~l~ekI~~ll~eaE 313 (346)
.|........-....+-..+|+.|+..|..++.++.
T Consensus 17 ~kFq~mS~~I~~riDeM~~RIDdLE~si~dl~~qag 52 (54)
T PF06825_consen 17 DKFQTMSDQILGRIDEMSSRIDDLEKSIADLMTQAG 52 (54)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHCCHHHH--------
T ss_pred HHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhcC
Confidence 344433333333345666677777777777776653
Done!