Query 019170
Match_columns 345
No_of_seqs 284 out of 1077
Neff 5.0
Searched_HMMs 46136
Date Fri Mar 29 07:16:17 2013
Command hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/019170.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/019170hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG0774 Transcription factor P 100.0 3.5E-31 7.5E-36 248.8 17.7 207 68-319 26-249 (334)
2 KOG0773 Transcription factor M 99.9 3.1E-28 6.7E-33 237.3 0.9 249 67-323 46-304 (342)
3 PF03791 KNOX2: KNOX2 domain ; 99.9 2.9E-23 6.2E-28 152.9 6.9 50 126-179 3-52 (52)
4 PF03790 KNOX1: KNOX1 domain ; 99.8 1.3E-20 2.9E-25 134.7 3.0 43 71-113 1-43 (45)
5 PF05920 Homeobox_KN: Homeobox 99.7 8E-17 1.7E-21 113.0 4.0 40 276-315 1-40 (40)
6 cd00086 homeodomain Homeodomai 99.5 4.7E-14 1E-18 103.2 6.6 57 260-319 2-58 (59)
7 smart00389 HOX Homeodomain. DN 99.5 8.5E-14 1.8E-18 101.3 6.4 55 260-317 2-56 (56)
8 PF00046 Homeobox: Homeobox do 99.5 6.3E-14 1.4E-18 103.0 5.3 57 259-318 1-57 (57)
9 PF03792 PBC: PBC domain; Int 99.3 1.5E-11 3.3E-16 112.1 12.7 148 68-257 24-190 (191)
10 KOG0775 Transcription factor S 99.3 1.9E-12 4.2E-17 123.4 5.8 50 265-317 183-232 (304)
11 TIGR01565 homeo_ZF_HD homeobox 99.0 1.1E-09 2.3E-14 82.9 5.2 53 258-313 1-57 (58)
12 KOG0843 Transcription factor E 98.9 6.8E-10 1.5E-14 100.8 4.2 61 257-320 101-161 (197)
13 KOG0489 Transcription factor z 98.9 1.1E-09 2.3E-14 104.7 3.6 65 258-325 159-223 (261)
14 KOG0487 Transcription factor A 98.9 9E-10 2E-14 107.4 3.1 59 259-320 236-294 (308)
15 KOG0485 Transcription factor N 98.8 8E-09 1.7E-13 96.3 6.3 59 258-319 104-162 (268)
16 KOG0493 Transcription factor E 98.8 5E-09 1.1E-13 99.9 5.0 62 258-322 246-307 (342)
17 KOG0850 Transcription factor D 98.8 5.9E-09 1.3E-13 97.8 3.9 59 259-320 123-181 (245)
18 KOG0488 Transcription factor B 98.7 7.2E-09 1.6E-13 101.5 4.4 62 258-322 172-233 (309)
19 KOG0842 Transcription factor t 98.7 7.8E-09 1.7E-13 100.9 3.3 59 258-319 153-211 (307)
20 KOG3802 Transcription factor O 98.7 1.1E-08 2.3E-13 102.4 4.1 67 252-321 288-354 (398)
21 KOG0483 Transcription factor H 98.7 1.3E-08 2.8E-13 94.0 4.1 59 258-319 50-108 (198)
22 COG5576 Homeodomain-containing 98.6 3.2E-08 7E-13 88.4 4.2 62 258-322 51-112 (156)
23 KOG0491 Transcription factor B 98.6 2.8E-08 6E-13 89.4 3.2 59 257-318 99-157 (194)
24 KOG0484 Transcription factor P 98.6 2.1E-08 4.6E-13 84.1 1.6 63 254-319 13-75 (125)
25 KOG0492 Transcription factor M 98.5 4.6E-08 9.9E-13 90.8 3.1 67 250-319 136-202 (246)
26 KOG2251 Homeobox transcription 98.5 1.5E-07 3.3E-12 87.9 4.3 59 258-319 37-95 (228)
27 KOG0494 Transcription factor C 98.4 1.9E-07 4E-12 89.3 4.1 55 262-319 145-199 (332)
28 KOG4577 Transcription factor L 98.3 2.7E-07 5.9E-12 89.3 2.5 67 252-321 161-227 (383)
29 KOG0486 Transcription factor P 98.3 4E-07 8.7E-12 88.9 2.8 59 259-320 113-171 (351)
30 KOG0848 Transcription factor C 98.3 2.6E-07 5.6E-12 88.6 1.1 56 262-320 203-258 (317)
31 KOG0847 Transcription factor, 98.0 4.3E-06 9.2E-11 78.5 3.6 60 258-320 167-226 (288)
32 KOG2252 CCAAT displacement pro 98.0 7.1E-06 1.5E-10 85.1 5.3 56 258-316 420-475 (558)
33 KOG0844 Transcription factor E 97.8 5.9E-06 1.3E-10 80.7 1.7 56 262-320 185-240 (408)
34 KOG0490 Transcription factor, 97.8 7.5E-06 1.6E-10 74.7 2.0 62 256-320 58-119 (235)
35 KOG0849 Transcription factor P 97.8 1.4E-05 3.1E-10 79.7 3.6 61 258-321 176-236 (354)
36 KOG1168 Transcription factor A 97.7 1.2E-05 2.7E-10 78.0 1.0 64 257-323 308-371 (385)
37 KOG0773 Transcription factor M 97.4 7.5E-05 1.6E-09 73.4 2.4 64 258-322 95-158 (342)
38 PF03789 ELK: ELK domain ; In 97.1 0.00038 8.2E-09 43.2 2.1 22 236-257 1-22 (22)
39 PF11569 Homez: Homeodomain le 97.0 0.00068 1.5E-08 51.2 3.2 43 270-315 10-52 (56)
40 KOG0490 Transcription factor, 95.5 0.0099 2.1E-07 54.2 2.8 61 258-321 153-213 (235)
41 KOG1146 Homeobox protein [Gene 91.6 0.14 3.1E-06 58.6 3.4 61 258-321 903-963 (1406)
42 PF04218 CENP-B_N: CENP-B N-te 88.4 0.84 1.8E-05 33.6 4.1 47 259-313 1-47 (53)
43 KOG3623 Homeobox transcription 86.0 1.7 3.7E-05 47.8 6.4 52 270-324 568-619 (1007)
44 PF01527 HTH_Tnp_1: Transposas 70.9 5.9 0.00013 29.9 3.6 47 260-313 2-48 (76)
45 cd00569 HTH_Hin_like Helix-tur 70.5 12 0.00026 22.4 4.4 39 263-309 4-42 (42)
46 cd06171 Sigma70_r4 Sigma70, re 69.0 7.5 0.00016 26.1 3.5 46 264-317 10-55 (55)
47 PF08281 Sigma70_r4_2: Sigma-7 63.9 5.9 0.00013 28.2 2.2 44 264-315 10-53 (54)
48 PF04545 Sigma70_r4: Sigma-70, 59.1 8.1 0.00018 27.3 2.2 46 264-317 4-49 (50)
49 PRK06759 RNA polymerase factor 56.7 13 0.00028 31.4 3.4 47 264-318 106-152 (154)
50 PRK09646 RNA polymerase sigma 51.5 15 0.00032 32.8 3.1 50 264-321 142-191 (194)
51 PRK09642 RNA polymerase sigma 49.0 19 0.00041 30.7 3.3 48 264-319 106-153 (160)
52 PF13443 HTH_26: Cro/C1-type H 48.7 14 0.00031 26.9 2.1 24 290-313 12-35 (63)
53 KOG2070 Guanine nucleotide exc 47.9 35 0.00077 36.4 5.5 26 129-154 182-214 (661)
54 PRK09644 RNA polymerase sigma 46.4 21 0.00045 30.8 3.2 49 263-319 107-155 (165)
55 PRK03975 tfx putative transcri 45.3 23 0.00049 31.5 3.2 48 262-318 4-51 (141)
56 TIGR02937 sigma70-ECF RNA poly 45.0 30 0.00065 27.9 3.7 47 264-318 110-156 (158)
57 smart00421 HTH_LUXR helix_turn 42.7 47 0.001 22.6 4.0 47 264-319 3-49 (58)
58 PRK11924 RNA polymerase sigma 42.7 23 0.00051 30.2 2.8 48 264-319 125-172 (179)
59 PRK09652 RNA polymerase sigma 42.1 26 0.00056 30.0 3.0 48 264-319 128-175 (182)
60 PF13518 HTH_28: Helix-turn-he 41.8 25 0.00054 24.4 2.4 24 291-314 15-38 (52)
61 PRK12514 RNA polymerase sigma 41.7 26 0.00056 30.6 3.0 48 264-319 129-176 (179)
62 PRK12540 RNA polymerase sigma 41.5 27 0.00059 31.1 3.2 50 264-321 111-160 (182)
63 KOG4445 Uncharacterized conser 41.1 55 0.0012 33.0 5.4 47 132-180 134-180 (368)
64 TIGR02985 Sig70_bacteroi1 RNA 40.8 43 0.00094 27.9 4.2 47 264-318 113-159 (161)
65 TIGR02983 SigE-fam_strep RNA p 40.7 30 0.00064 29.5 3.2 50 264-321 110-159 (162)
66 PRK00118 putative DNA-binding 40.6 31 0.00068 29.1 3.2 47 264-318 17-63 (104)
67 TIGR02939 RpoE_Sigma70 RNA pol 40.3 22 0.00047 31.0 2.4 49 264-320 138-186 (190)
68 PRK12541 RNA polymerase sigma 40.2 29 0.00063 29.7 3.1 48 263-318 111-158 (161)
69 PRK12547 RNA polymerase sigma 40.1 29 0.00062 30.0 3.0 48 264-319 112-159 (164)
70 TIGR02999 Sig-70_X6 RNA polyme 39.8 31 0.00067 30.0 3.2 48 264-319 134-181 (183)
71 TIGR02989 Sig-70_gvs1 RNA poly 38.9 34 0.00073 28.9 3.2 47 264-318 111-157 (159)
72 PRK06811 RNA polymerase factor 38.0 34 0.00074 30.3 3.2 49 264-320 131-179 (189)
73 PRK12526 RNA polymerase sigma 37.0 33 0.00072 31.0 3.1 49 264-320 153-201 (206)
74 PF11288 DUF3089: Protein of u 36.6 8.7 0.00019 36.2 -0.8 38 58-101 100-137 (207)
75 PRK12523 RNA polymerase sigma 36.6 38 0.00082 29.4 3.3 48 264-319 119-166 (172)
76 PF10668 Phage_terminase: Phag 36.1 28 0.00061 26.7 2.0 21 290-310 24-44 (60)
77 PF00196 GerE: Bacterial regul 33.9 51 0.0011 23.8 3.1 48 264-320 3-50 (58)
78 PF01381 HTH_3: Helix-turn-hel 33.5 33 0.00072 24.1 2.0 21 292-312 13-33 (55)
79 PRK05602 RNA polymerase sigma 32.7 40 0.00086 29.6 2.8 50 264-321 128-177 (186)
80 PRK09648 RNA polymerase sigma 32.5 46 0.001 29.3 3.2 48 264-319 139-186 (189)
81 PRK12546 RNA polymerase sigma 32.2 41 0.00089 30.2 2.8 49 264-320 113-161 (188)
82 PRK12533 RNA polymerase sigma 31.9 42 0.0009 31.1 2.9 51 264-322 134-184 (216)
83 PRK09047 RNA polymerase factor 31.8 53 0.0012 27.7 3.4 48 264-319 106-153 (161)
84 PF13097 CENP-U: CENP-A nucleo 31.8 1.1E+02 0.0025 28.2 5.6 45 131-178 103-148 (175)
85 PRK12512 RNA polymerase sigma 31.6 47 0.001 29.0 3.1 49 264-320 131-179 (184)
86 PRK12532 RNA polymerase sigma 31.4 47 0.001 29.5 3.0 49 264-320 136-184 (195)
87 PF13551 HTH_29: Winged helix- 31.4 2.3E+02 0.0051 22.2 6.9 46 265-310 58-109 (112)
88 PF05190 MutS_IV: MutS family 31.4 81 0.0017 24.3 4.1 26 129-154 1-26 (92)
89 PRK09649 RNA polymerase sigma 31.2 48 0.001 29.4 3.0 48 264-319 130-177 (185)
90 TIGR03070 couple_hipB transcri 31.0 35 0.00076 23.7 1.8 22 292-313 19-40 (58)
91 PRK12536 RNA polymerase sigma 30.9 51 0.0011 28.9 3.2 48 264-319 129-176 (181)
92 PRK13919 putative RNA polymera 30.7 51 0.0011 28.8 3.1 48 264-319 135-182 (186)
93 PRK12520 RNA polymerase sigma 30.6 53 0.0011 29.0 3.2 48 264-319 131-178 (191)
94 PRK12516 RNA polymerase sigma 30.4 50 0.0011 29.5 3.0 50 264-321 116-165 (187)
95 TIGR02954 Sig70_famx3 RNA poly 29.9 54 0.0012 28.3 3.1 48 264-319 119-166 (169)
96 PRK12519 RNA polymerase sigma 29.9 40 0.00086 29.8 2.3 47 264-318 141-187 (194)
97 PF12022 DUF3510: Domain of un 29.8 2.1E+02 0.0046 24.5 6.7 20 161-180 80-99 (125)
98 PRK11511 DNA-binding transcrip 29.7 1.3E+02 0.0029 25.3 5.4 41 268-312 9-49 (127)
99 PRK12530 RNA polymerase sigma 29.6 53 0.0011 29.2 3.1 48 264-319 134-181 (189)
100 PRK12531 RNA polymerase sigma 29.5 54 0.0012 29.2 3.1 50 263-320 140-189 (194)
101 PRK12511 RNA polymerase sigma 29.3 55 0.0012 29.2 3.1 50 264-321 111-160 (182)
102 PRK15369 two component system 29.2 1.2E+02 0.0025 25.4 5.0 48 264-320 149-196 (211)
103 cd01392 HTH_LacI Helix-turn-he 28.9 31 0.00068 24.0 1.2 21 293-313 2-22 (52)
104 cd00131 PAX Paired Box domain 28.7 1.7E+02 0.0036 25.1 5.9 46 264-312 75-127 (128)
105 PRK12524 RNA polymerase sigma 28.5 56 0.0012 29.1 3.0 49 264-320 136-184 (196)
106 PRK12542 RNA polymerase sigma 28.4 57 0.0012 28.6 3.1 49 264-320 122-170 (185)
107 TIGR02959 SigZ RNA polymerase 28.1 62 0.0013 28.2 3.2 49 264-320 100-148 (170)
108 PRK09639 RNA polymerase sigma 28.1 60 0.0013 27.7 3.0 47 264-319 112-158 (166)
109 PRK10072 putative transcriptio 28.0 40 0.00086 28.0 1.8 23 291-313 49-71 (96)
110 PRK12537 RNA polymerase sigma 27.9 60 0.0013 28.5 3.1 47 264-318 133-179 (182)
111 cd06170 LuxR_C_like C-terminal 27.9 1E+02 0.0022 21.0 3.7 45 265-318 1-45 (57)
112 cd00093 HTH_XRE Helix-turn-hel 27.5 50 0.0011 21.5 2.0 21 292-312 16-36 (58)
113 PF13384 HTH_23: Homeodomain-l 27.4 50 0.0011 22.9 2.0 23 291-313 20-42 (50)
114 PRK12529 RNA polymerase sigma 27.4 74 0.0016 27.9 3.6 48 264-319 127-174 (178)
115 PRK12535 RNA polymerase sigma 27.4 62 0.0013 29.2 3.1 52 264-323 133-184 (196)
116 PF04967 HTH_10: HTH DNA bindi 26.7 1.3E+02 0.0028 22.4 4.2 47 265-312 1-47 (53)
117 PF12362 DUF3646: DNA polymera 26.5 45 0.00097 28.7 1.9 27 64-90 82-108 (117)
118 PRK09645 RNA polymerase sigma 26.3 68 0.0015 27.7 3.1 49 264-320 118-166 (173)
119 PF15500 Toxin_39: Putative RN 26.0 1.5E+02 0.0033 24.7 4.8 38 128-172 44-81 (96)
120 PRK06986 fliA flagellar biosyn 25.9 58 0.0013 30.1 2.7 48 264-319 184-231 (236)
121 PRK09413 IS2 repressor TnpA; R 25.8 1.1E+02 0.0025 25.6 4.3 48 260-314 8-55 (121)
122 TIGR02947 SigH_actino RNA poly 25.7 32 0.00069 30.5 0.9 48 264-319 131-178 (193)
123 TIGR02948 SigW_bacill RNA poly 25.2 59 0.0013 28.2 2.5 48 264-319 136-183 (187)
124 PRK12515 RNA polymerase sigma 25.1 76 0.0016 28.0 3.2 48 264-319 131-178 (189)
125 PRK08583 RNA polymerase sigma 24.8 69 0.0015 30.0 3.0 48 264-319 205-252 (257)
126 PRK07037 extracytoplasmic-func 24.7 74 0.0016 27.1 3.0 48 264-319 109-156 (163)
127 PRK09415 RNA polymerase factor 24.7 64 0.0014 28.3 2.6 47 264-318 127-173 (179)
128 PRK12528 RNA polymerase sigma 24.6 83 0.0018 26.8 3.3 46 264-317 113-158 (161)
129 PRK12543 RNA polymerase sigma 24.4 80 0.0017 27.6 3.2 47 264-318 117-163 (179)
130 PRK12539 RNA polymerase sigma 24.4 73 0.0016 28.0 3.0 48 264-319 131-178 (184)
131 PF13411 MerR_1: MerR HTH fami 24.3 54 0.0012 24.1 1.8 18 292-309 4-21 (69)
132 smart00530 HTH_XRE Helix-turn- 24.2 62 0.0013 20.8 2.0 22 291-312 13-34 (56)
133 KOG4040 NADH:ubiquinone oxidor 24.0 47 0.001 30.5 1.6 38 267-304 23-61 (186)
134 TIGR03001 Sig-70_gmx1 RNA poly 24.0 74 0.0016 30.1 3.1 48 264-319 161-208 (244)
135 PRK09636 RNA polymerase sigma 23.7 1.1E+02 0.0024 29.3 4.2 51 264-322 115-165 (293)
136 PRK12513 RNA polymerase sigma 23.4 35 0.00075 30.2 0.7 48 264-319 139-186 (194)
137 TIGR02943 Sig70_famx1 RNA poly 23.1 87 0.0019 27.9 3.2 48 264-319 131-178 (188)
138 PHA01976 helix-turn-helix prot 22.9 61 0.0013 23.8 1.8 22 292-313 19-40 (67)
139 TIGR02941 Sigma_B RNA polymera 22.7 92 0.002 29.1 3.4 47 264-318 205-251 (255)
140 PRK12545 RNA polymerase sigma 22.4 84 0.0018 28.3 3.0 49 264-320 139-187 (201)
141 PF10782 DUF2602: Protein of u 22.4 78 0.0017 24.3 2.3 40 132-177 8-53 (58)
142 PF13730 HTH_36: Helix-turn-he 22.2 2.3E+02 0.005 19.9 4.7 48 264-314 2-51 (55)
143 PRK09647 RNA polymerase sigma 22.2 87 0.0019 28.5 3.1 48 264-319 138-185 (203)
144 PRK12538 RNA polymerase sigma 22.2 70 0.0015 29.9 2.5 48 264-319 171-218 (233)
145 COG2979 Uncharacterized protei 21.9 2.6E+02 0.0057 26.8 6.2 30 128-169 142-171 (225)
146 TIGR02607 antidote_HigA addict 21.6 64 0.0014 24.3 1.8 22 292-313 22-43 (78)
147 cd04761 HTH_MerR-SF Helix-Turn 21.2 78 0.0017 21.5 2.0 19 292-310 4-22 (49)
148 PRK12534 RNA polymerase sigma 21.2 93 0.002 27.2 3.0 47 264-318 137-183 (187)
149 PRK04217 hypothetical protein; 21.1 1.5E+02 0.0033 25.2 4.1 49 263-319 41-89 (110)
150 PRK07670 RNA polymerase sigma 21.1 93 0.002 29.1 3.1 47 264-318 201-247 (251)
151 PRK12527 RNA polymerase sigma 21.1 1E+02 0.0022 26.2 3.1 47 265-319 106-152 (159)
152 TIGR02980 SigBFG RNA polymeras 21.0 99 0.0022 28.2 3.2 47 264-318 178-224 (227)
153 PRK08241 RNA polymerase factor 21.0 98 0.0021 30.1 3.4 50 264-321 153-202 (339)
154 PRK09641 RNA polymerase sigma 20.9 89 0.0019 27.1 2.8 48 264-319 136-183 (187)
155 PRK09637 RNA polymerase sigma 20.6 94 0.002 27.5 2.9 50 264-321 106-155 (181)
156 TIGR02479 FliA_WhiG RNA polyme 20.1 1E+02 0.0022 28.2 3.0 48 264-319 175-222 (224)
No 1
>KOG0774 consensus Transcription factor PBX and related HOX domain proteins [Transcription]
Probab=99.97 E-value=3.5e-31 Score=248.84 Aligned_cols=207 Identities=24% Similarity=0.374 Sum_probs=173.5
Q ss_pred hhhHHHHHHHhhCCChHHHHHHHHhh-----hcccCCCcchhhHHHHHhhhHHHHHhhhhcCCC---------CCCCChH
Q 019170 68 WETVKCKAEIVGHPLYEQLLSAHVSC-----LRIATPVDQLPKIDAQLSRSRDVLAKYSAVANG---------RVLDDKE 133 (345)
Q Consensus 68 ~e~~~lKa~I~sHPlYp~Ll~A~i~C-----~KVgaP~e~~~~ld~~~~~~~~~~~k~~~~~~~---------~~g~dpE 133 (345)
+|.++.|.+|.+||+||.|++++|+. +.|....|.-+. |.++.++++|+...+++|+. ..|+..+
T Consensus 26 Deaqa~K~~lnch~mk~AlfsVLcE~KeKt~lsir~~qdeep~-dpqlmRLDnML~AEGVagPekgga~~~~Asgg~hsd 104 (334)
T KOG0774|consen 26 DEAQARKHALNCHRMKPALFSVLCEIKEKTVLSIRGMQDEEPP-DPQLMRLDNMLLAEGVAGPEKGGARAAAASGGDHSD 104 (334)
T ss_pred chHHhhhhccccccchHHHHHHHHHhhhhheeeeccccccCCC-ChHHHHHHHHHHHhcccCccccchhhhhccCCChHH
Confidence 56669999999999999999999995 555533332221 33444556667677777653 2345678
Q ss_pred HHHHHHHHHHHHHHHHHHHhHHhhhhHHHHHHHHHHHHHHhhhhcCCCCCCCCCCCCCCccccccCCCCccCCCCCCCCC
Q 019170 134 LDQFMTHYVLLLYSFKEQLQQHVRVHAMEAVMACWDLEQSLQSLTGVSPGESTGATMSDDDEDQVDSDTNFFDGSLDGPD 213 (345)
Q Consensus 134 LDqFMeaYc~vL~kykEEL~kP~~~~~~EA~~f~~~iE~QL~sL~~~s~~~~~~~~~S~~e~~~~~s~~~~~~g~~~~~d 213 (345)
+.+-+.. +...|++||++ +...|+++.+.+.+|.. +
T Consensus 105 YR~kL~q---iR~iy~~Elek--------yeqaCneftthV~nlL~---------------------------------e 140 (334)
T KOG0774|consen 105 YRAKLLQ---IRQIYHNELEK--------YEQACNEFTTHVMNLLR---------------------------------E 140 (334)
T ss_pred HHHHHHH---HHHHHHHHHHH--------HHHHHHHHHHHHHHHHH---------------------------------H
Confidence 8877766 99999999987 56679999999999995 2
Q ss_pred CCCCCCCCCChhHHH--HHHHHHHHHHHHHhhhhhhHHHHHHHHHhh-hcCCCCCChhHHHHHHHHHHHhcCCCCCCHHH
Q 019170 214 SMGFGPLVPTESERS--LMERVRHELKHELKQGYKEKIVDIREEILR-KRRAGKLPGDTTSLLKAWWLSHAKWPYPTEED 290 (345)
Q Consensus 214 ~~~f~p~~~~~~e~~--~~~~~~~eLk~~l~~ky~~~i~~lr~e~~k-krkr~~lpk~~~~iL~~Wf~~H~~~PYPs~~e 290 (345)
.+.|+||.+.++|+. .|.+.|.-++..|++..|..+..||.++.. +|||++|++.++.+|..||+.|..|||||+++
T Consensus 141 Qsr~RPi~~ke~e~m~~~i~~kF~~iq~~lkqstce~vmiLr~r~ldarRKRRNFsK~aTeiLneyF~~h~~nPYPSee~ 220 (334)
T KOG0774|consen 141 QSRTRPIMPKEIERMVQIISKKFSHIQMQLKQSTCEAVMILRSRFLDARRKRRNFSKQATEILNEYFYSHLSNPYPSEEA 220 (334)
T ss_pred hcccCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccchhHHHHHHHHHHHhcCCCCCcHHH
Confidence 346899999999984 478889999999999999999999999985 56788999999999999999999999999999
Q ss_pred HHHHHHHhCCChhhhhhhhhhhhhhcccc
Q 019170 291 KARLVQETGLQLKQINNWFINQRKRNWHA 319 (345)
Q Consensus 291 K~~LA~~tgLs~kQV~nWF~N~R~R~kk~ 319 (345)
|+.||++||++..||+|||.|+|.|.||.
T Consensus 221 K~eLAkqCnItvsQvsnwfgnkrIrykK~ 249 (334)
T KOG0774|consen 221 KEELAKQCNITVSQVSNWFGNKRIRYKKN 249 (334)
T ss_pred HHHHHHHcCceehhhccccccceeehhhh
Confidence 99999999999999999999999999984
No 2
>KOG0773 consensus Transcription factor MEIS1 and related HOX domain proteins [Transcription]
Probab=99.94 E-value=3.1e-28 Score=237.30 Aligned_cols=249 Identities=23% Similarity=0.306 Sum_probs=175.2
Q ss_pred hhhhHHHHHHHhhCCChHHHHHHHHhhhcccCCCcchhhHHHHHhhhHHHHHhhhhcCC-----CCCCCChHHHHHHHHH
Q 019170 67 DWETVKCKAEIVGHPLYEQLLSAHVSCLRIATPVDQLPKIDAQLSRSRDVLAKYSAVAN-----GRVLDDKELDQFMTHY 141 (345)
Q Consensus 67 ~~e~~~lKa~I~sHPlYp~Ll~A~i~C~KVgaP~e~~~~ld~~~~~~~~~~~k~~~~~~-----~~~g~dpELDqFMeaY 141 (345)
......+|..+.+||+|..++.||++|+++++|.+.+.+.++...........+...+. ...+..++|+.||..|
T Consensus 46 ~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~s~~~~~~~~~~~~~~~~~k~ 125 (342)
T KOG0773|consen 46 MVSLASSKYLTAAQELLDEFCSAGLDCLKGKMPYDPVPRSPASLSPPEDKGARRGNATRESATLKAWLEEHRLNPYPSKL 125 (342)
T ss_pred ccccccccccccchhHHhHHhhccccccccccCcCccccccccccCccccccccccccccccccccchhhhhhccCchHH
Confidence 45566789999999999999999999999999999888865543322222222221111 1345689999999999
Q ss_pred HHHHHHHHHHHhHHhhhhHHHHHHHHHHHHHHhhhhcCCCCCCCCCCCCCCccccccC---CCCccCCCCCCCCCCCCCC
Q 019170 142 VLLLYSFKEQLQQHVRVHAMEAVMACWDLEQSLQSLTGVSPGESTGATMSDDDEDQVD---SDTNFFDGSLDGPDSMGFG 218 (345)
Q Consensus 142 c~vL~kykEEL~kP~~~~~~EA~~f~~~iE~QL~sL~~~s~~~~~~~~~S~~e~~~~~---s~~~~~~g~~~~~d~~~f~ 218 (345)
+.+|..+...|+..+. -++|+.++++++..+...++.++......+...+.++... +...... +..+++
T Consensus 126 ~~~ll~~~~~~~~~~~--~~~~~~a~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~------~~~~~~ 197 (342)
T KOG0773|consen 126 EKILLAVITKLTLTQV--STWFANARRRLKKELKMTWGPTPLALDGISRHFSDLEKEKAIGGQLSSSE------ELLGES 197 (342)
T ss_pred HHHHHHHHHHhhhhhH--HHHHHHHHHHHHhccCCCCCCccccccchhhhhhhhhhcccccccccccc------cccccc
Confidence 9999999999999863 3899999999999999999766554433322222111100 0000000 011111
Q ss_pred CCCCChhHHHHH--HHHHHHHHHHHhhhhhhHHHHHHHHHhhhcCCCCCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHH
Q 019170 219 PLVPTESERSLM--ERVRHELKHELKQGYKEKIVDIREEILRKRRAGKLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQ 296 (345)
Q Consensus 219 p~~~~~~e~~~~--~~~~~eLk~~l~~ky~~~i~~lr~e~~kkrkr~~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~ 296 (345)
+......+...+ ......++..+.+.+..++.....+..++|+++.||+.++.+|+.|+.+|+.||||++.+|..||.
T Consensus 198 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~lP~~a~~ilr~Wl~~h~~~PYPse~~K~~La~ 277 (342)
T KOG0773|consen 198 EQDDSEDESGPSGSEPPLRLAKQSLRQQRSAYDGSGGKKQSKWRPQRGLPKEAVSILRAWLFEHLLHPYPSDDEKLMLAK 277 (342)
T ss_pred cccccccccCcccccCCcccccccccccccccccccccccCCCCCCCCCCHHHHHHHHHHHHHhccCCCCcchhccccch
Confidence 111111110000 122345556666666655666666677888899999999999999999999999999999999999
Q ss_pred HhCCChhhhhhhhhhhhhhccccCCCC
Q 019170 297 ETGLQLKQINNWFINQRKRNWHANPSS 323 (345)
Q Consensus 297 ~tgLs~kQV~nWF~N~R~R~kk~~~s~ 323 (345)
+|||+..||+|||||+|+|.|++....
T Consensus 278 ~TGLs~~Qv~NWFINaR~R~w~p~~~~ 304 (342)
T KOG0773|consen 278 QTGLSRPQVSNWFINARVRLWKPMIEE 304 (342)
T ss_pred hcCCCcccCCchhhhcccccCCchHHH
Confidence 999999999999999999999998743
No 3
>PF03791 KNOX2: KNOX2 domain ; InterPro: IPR005541 The MEINOX region is comprised of two domains, KNOX1 and KNOX2. KNOX1 plays a role in suppressing target gene expression. KNOX2, essential for function, is thought to be necessary for homo-dimerization [].; GO: 0003677 DNA binding, 0005634 nucleus
Probab=99.89 E-value=2.9e-23 Score=152.92 Aligned_cols=50 Identities=46% Similarity=0.698 Sum_probs=47.3
Q ss_pred CCCCCChHHHHHHHHHHHHHHHHHHHHhHHhhhhHHHHHHHHHHHHHHhhhhcC
Q 019170 126 GRVLDDKELDQFMTHYVLLLYSFKEQLQQHVRVHAMEAVMACWDLEQSLQSLTG 179 (345)
Q Consensus 126 ~~~g~dpELDqFMeaYc~vL~kykEEL~kP~~~~~~EA~~f~~~iE~QL~sL~~ 179 (345)
.++|+||||||||++||.||+||||||++||+ ||++|||+||+||++||+
T Consensus 3 ~~~~~dpELDqFMeaYc~~L~kykeeL~~p~~----EA~~f~~~ie~qL~~Lt~ 52 (52)
T PF03791_consen 3 SSIGADPELDQFMEAYCDMLVKYKEELQRPFQ----EAMEFCREIEQQLSSLTG 52 (52)
T ss_pred CCCCCCccHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHhC
Confidence 46889999999999999999999999999984 999999999999999995
No 4
>PF03790 KNOX1: KNOX1 domain ; InterPro: IPR005540 The MEINOX region is comprised of two domains, KNOX1 and KNOX2. KNOX1 plays a role in suppressing target gene expression. KNOX2, essential for function, is thought to be necessary for homo-dimerization [].; GO: 0003677 DNA binding, 0005634 nucleus
Probab=99.80 E-value=1.3e-20 Score=134.72 Aligned_cols=43 Identities=35% Similarity=0.691 Sum_probs=40.0
Q ss_pred HHHHHHHhhCCChHHHHHHHHhhhcccCCCcchhhHHHHHhhh
Q 019170 71 VKCKAEIVGHPLYEQLLSAHVSCLRIATPVDQLPKIDAQLSRS 113 (345)
Q Consensus 71 ~~lKa~I~sHPlYp~Ll~A~i~C~KVgaP~e~~~~ld~~~~~~ 113 (345)
+.|||+|++||+||+||+|||+|+|||||||++++||++.++.
T Consensus 1 e~iKA~I~~HP~Y~~Ll~Ayi~C~KVGAP~e~~~~L~e~~~~~ 43 (45)
T PF03790_consen 1 EAIKAKIASHPLYPRLLAAYIDCQKVGAPPEVVARLDEILAES 43 (45)
T ss_pred ChHHHHHHcCCCcHHHHHHHHHHHhcCCCHHHHHHHHHHHHHh
Confidence 4799999999999999999999999999999999999987654
No 5
>PF05920 Homeobox_KN: Homeobox KN domain; InterPro: IPR008422 This entry represents a homeobox transcription factor KN domain conserved from fungi to human and plants [].; GO: 0003677 DNA binding, 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus; PDB: 3K2A_B 2LK2_A 1X2N_A 2DMN_A.
Probab=99.65 E-value=8e-17 Score=113.02 Aligned_cols=40 Identities=55% Similarity=1.074 Sum_probs=36.5
Q ss_pred HHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhh
Q 019170 276 WWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKR 315 (345)
Q Consensus 276 Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R 315 (345)
||.+|..|||||.+||..||..|||+.+||+|||+|+|+|
T Consensus 1 Wl~~h~~nPYPs~~ek~~L~~~tgls~~Qi~~WF~NaRrR 40 (40)
T PF05920_consen 1 WLLEHLHNPYPSKEEKEELAKQTGLSRKQISNWFINARRR 40 (40)
T ss_dssp HHHHTTTSGS--HHHHHHHHHHHTS-HHHHHHHHHHHHHH
T ss_pred CHHHHCCCCCCCHHHHHHHHHHcCCCHHHHHHHHHHhHcc
Confidence 9999999999999999999999999999999999999998
No 6
>cd00086 homeodomain Homeodomain; DNA binding domains involved in the transcriptional regulation of key eukaryotic developmental processes; may bind to DNA as monomers or as homo- and/or heterodimers, in a sequence-specific manner.
Probab=99.50 E-value=4.7e-14 Score=103.18 Aligned_cols=57 Identities=26% Similarity=0.520 Sum_probs=53.8
Q ss_pred cCCCCCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhcccc
Q 019170 260 RRAGKLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWHA 319 (345)
Q Consensus 260 rkr~~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk~ 319 (345)
+++..|++.+..+|+.||.. +|||+..++..||..|||+..||.+||.|+|.+.++.
T Consensus 2 ~~r~~~~~~~~~~Le~~f~~---~~~P~~~~~~~la~~~~l~~~qV~~WF~nrR~~~~~~ 58 (59)
T cd00086 2 RKRTRFTPEQLEELEKEFEK---NPYPSREEREELAKELGLTERQVKIWFQNRRAKLKRS 58 (59)
T ss_pred CCCCcCCHHHHHHHHHHHHh---CCCCCHHHHHHHHHHHCcCHHHHHHHHHHHHHHHhcc
Confidence 56779999999999999999 8999999999999999999999999999999998764
No 7
>smart00389 HOX Homeodomain. DNA-binding factors that are involved in the transcriptional regulation of key developmental processes
Probab=99.47 E-value=8.5e-14 Score=101.30 Aligned_cols=55 Identities=25% Similarity=0.497 Sum_probs=51.5
Q ss_pred cCCCCCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhcc
Q 019170 260 RRAGKLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNW 317 (345)
Q Consensus 260 rkr~~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~k 317 (345)
+.++.|++++..+|+.||.. +|||+.+++..||..+||+..||.+||+|+|+|.+
T Consensus 2 k~r~~~~~~~~~~L~~~f~~---~~~P~~~~~~~la~~~~l~~~qV~~WF~nrR~~~~ 56 (56)
T smart00389 2 RKRTSFTPEQLEELEKEFQK---NPYPSREEREELAAKLGLSERQVKVWFQNRRAKWK 56 (56)
T ss_pred CCCCcCCHHHHHHHHHHHHh---CCCCCHHHHHHHHHHHCcCHHHHHHhHHHHhhccC
Confidence 45678999999999999999 89999999999999999999999999999999864
No 8
>PF00046 Homeobox: Homeobox domain not present here.; InterPro: IPR001356 The homeobox domain was first identified in a number of drosophila homeotic and segmentation proteins, but is now known to be well-conserved in many other animals, including vertebrates [, , ]. Hox genes encode homeodomain-containing transcriptional regulators that operate differential genetic programs along the anterior-posterior axis of animal bodies []. The domain binds DNA through a helix-turn-helix (HTH) structure. The HTH motif is characterised by two alpha-helices, which make intimate contacts with the DNA and are joined by a short turn. The second helix binds to DNA via a number of hydrogen bonds and hydrophobic interactions, which occur between specific side chains and the exposed bases and thymine methyl groups within the major groove of the DNA []. The first helix helps to stabilise the structure. The motif is very similar in sequence and structure in a wide range of DNA-binding proteins (e.g., cro and repressor proteins, homeotic proteins, etc.). One of the principal differences between HTH motifs in these different proteins arises from the stereo-chemical requirement for glycine in the turn which is needed to avoid steric interference of the beta-carbon with the main chain: for cro and repressor proteins the glycine appears to be mandatory, while for many of the homeotic and other DNA-binding proteins the requirement is relaxed.; GO: 0003700 sequence-specific DNA binding transcription factor activity, 0043565 sequence-specific DNA binding, 0006355 regulation of transcription, DNA-dependent; PDB: 2DA3_A 1LFB_A 2LFB_A 2ECB_A 2DA5_A 3D1N_O 3A03_A 2XSD_C 3CMY_A 1AHD_P ....
Probab=99.47 E-value=6.3e-14 Score=102.98 Aligned_cols=57 Identities=30% Similarity=0.704 Sum_probs=54.1
Q ss_pred hcCCCCCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhccc
Q 019170 259 KRRAGKLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWH 318 (345)
Q Consensus 259 krkr~~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk 318 (345)
||+|..|+.++..+|+.+|.. +|||+.+++..||..+||+..||.+||+|+|.+.|+
T Consensus 1 kr~r~~~t~~q~~~L~~~f~~---~~~p~~~~~~~la~~l~l~~~~V~~WF~nrR~k~kk 57 (57)
T PF00046_consen 1 KRKRTRFTKEQLKVLEEYFQE---NPYPSKEEREELAKELGLTERQVKNWFQNRRRKEKK 57 (57)
T ss_dssp SSSSSSSSHHHHHHHHHHHHH---SSSCHHHHHHHHHHHHTSSHHHHHHHHHHHHHHHHH
T ss_pred CcCCCCCCHHHHHHHHHHHHH---hccccccccccccccccccccccccCHHHhHHHhCc
Confidence 467889999999999999999 999999999999999999999999999999999874
No 9
>PF03792 PBC: PBC domain; InterPro: IPR005542 Pbx proteins are members of the TALE (three-amino-acid loop extension) family of atypical homeodomain proteins, whose members are characterised by a three-residue insertion in the first helix of the homeodomain involved in their interaction with Hox proteins. Examination of Pbx1 has shown that, in addition to the homeodomain, a short 16-residue C-terminal tail is essential for maximal cooperative interactions with Hox partners as well as for maximal monomeric binding of Pbx1 to DNA. The PBX domain is a bipartite acidic domain [].; GO: 0003700 sequence-specific DNA binding transcription factor activity, 0005634 nucleus
Probab=99.33 E-value=1.5e-11 Score=112.12 Aligned_cols=148 Identities=21% Similarity=0.289 Sum_probs=111.2
Q ss_pred hhhHHHHHHHhhCCChHHHHHHHHhh-----hcccCCCcchhhHHHHHhhhHHHHHhhhhcCCCCCCC----------C-
Q 019170 68 WETVKCKAEIVGHPLYEQLLSAHVSC-----LRIATPVDQLPKIDAQLSRSRDVLAKYSAVANGRVLD----------D- 131 (345)
Q Consensus 68 ~e~~~lKa~I~sHPlYp~Ll~A~i~C-----~KVgaP~e~~~~ld~~~~~~~~~~~k~~~~~~~~~g~----------d- 131 (345)
+|.+++|.+|.+||+||.|.+++|+- +++..+.+..+. |.++.++++|+.+.++.|+...|. |
T Consensus 24 Deaqa~K~~l~~hr~k~ALfsVLcE~KEkt~LSir~~qee~p~-dpQl~RLDNML~AEGV~gPe~~~~~~~~~~~~~~~~ 102 (191)
T PF03792_consen 24 DEAQARKHALNCHRMKPALFSVLCEIKEKTVLSIRNIQEEDPP-DPQLMRLDNMLLAEGVAGPEKGGRAAAAAAGTAADN 102 (191)
T ss_pred hHHHHhchhhcCCCCchhhHHHHHHHHhhcCccccccCCcCCC-chhhhhhhcchhhhcCcCCCCcccchhhhhccCccc
Confidence 46699999999999999999999995 455554332222 445556677777787777653221 1
Q ss_pred -hHHHHHHHHHHHHHHHHHHHHhHHhhhhHHHHHHHHHHHHHHhhhhcCCCCCCCCCCCCCCccccccCCCCccCCCCCC
Q 019170 132 -KELDQFMTHYVLLLYSFKEQLQQHVRVHAMEAVMACWDLEQSLQSLTGVSPGESTGATMSDDDEDQVDSDTNFFDGSLD 210 (345)
Q Consensus 132 -pELDqFMeaYc~vL~kykEEL~kP~~~~~~EA~~f~~~iE~QL~sL~~~s~~~~~~~~~S~~e~~~~~s~~~~~~g~~~ 210 (345)
-|-+.|-...-.+...|++||++ +...|+++.+.+.+|..
T Consensus 103 ~~d~~dYr~kL~~ir~~y~~el~k--------ye~ac~eF~~hV~~lLr------------------------------- 143 (191)
T PF03792_consen 103 SIDHSDYRAKLSQIRQIYHSELEK--------YEQACNEFTEHVMNLLR------------------------------- 143 (191)
T ss_pred ccchHHHHHHHHHHHHHHHHHHHH--------HHHHhhhhHHHHHHHHH-------------------------------
Confidence 12344555555588899999987 45678899999989885
Q ss_pred CCCCCCCCCCCCChhHHH--HHHHHHHHHHHHHhhhhhhHHHHHHHHHh
Q 019170 211 GPDSMGFGPLVPTESERS--LMERVRHELKHELKQGYKEKIVDIREEIL 257 (345)
Q Consensus 211 ~~d~~~f~p~~~~~~e~~--~~~~~~~eLk~~l~~ky~~~i~~lr~e~~ 257 (345)
+.+.|+||++.++|+. .+.+.+.-+..+||++.|..+..||..|+
T Consensus 144 --eQs~~RPIs~keiE~m~~~i~~Kf~~iq~qLKQstCEaVm~LRsRfl 190 (191)
T PF03792_consen 144 --EQSEFRPISPKEIERMVNIIHRKFSKIQMQLKQSTCEAVMILRSRFL 190 (191)
T ss_pred --HhcccCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc
Confidence 2345999999999984 57889999999999999999999998875
No 10
>KOG0775 consensus Transcription factor SIX and related HOX domain proteins [Transcription]
Probab=99.31 E-value=1.9e-12 Score=123.41 Aligned_cols=50 Identities=40% Similarity=0.883 Sum_probs=47.3
Q ss_pred CChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhcc
Q 019170 265 LPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNW 317 (345)
Q Consensus 265 lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~k 317 (345)
|....+.+|++||.. +|||++++|++||+.|||+..||.|||.|+|.|.+
T Consensus 183 FKekSR~~LrewY~~---~~YPsp~eKReLA~aTgLt~tQVsNWFKNRRQRDR 232 (304)
T KOG0775|consen 183 FKEKSRSLLREWYLQ---NPYPSPREKRELAEATGLTITQVSNWFKNRRQRDR 232 (304)
T ss_pred hhHhhHHHHHHHHhc---CCCCChHHHHHHHHHhCCchhhhhhhhhhhhhhhh
Confidence 345789999999997 99999999999999999999999999999999998
No 11
>TIGR01565 homeo_ZF_HD homeobox domain, ZF-HD class. This model represents a class of homoebox domain that differs substantially from the typical homoebox domain described in pfam model pfam00046. It is found in both C4 and C3 plants.
Probab=98.95 E-value=1.1e-09 Score=82.90 Aligned_cols=53 Identities=19% Similarity=0.401 Sum_probs=50.3
Q ss_pred hhcCCCCCChhHHHHHHHHHHHhcCCCC----CCHHHHHHHHHHhCCChhhhhhhhhhhh
Q 019170 258 RKRRAGKLPGDTTSLLKAWWLSHAKWPY----PTEEDKARLVQETGLQLKQINNWFINQR 313 (345)
Q Consensus 258 kkrkr~~lpk~~~~iL~~Wf~~H~~~PY----Ps~~eK~~LA~~tgLs~kQV~nWF~N~R 313 (345)
+||.|+.|+.+++..|+..|.. ++| |+..++..||..+||+..+|.+||+|-+
T Consensus 1 ~kR~RT~Ft~~Q~~~Le~~fe~---~~y~~~~~~~~~r~~la~~lgl~~~vvKVWfqN~k 57 (58)
T TIGR01565 1 KKRRRTKFTAEQKEKMRDFAEK---LGWKLKDKRREEVREFCEEIGVTRKVFKVWMHNNK 57 (58)
T ss_pred CCCCCCCCCHHHHHHHHHHHHH---cCCCCCCCCHHHHHHHHHHhCCCHHHeeeecccCC
Confidence 4778999999999999999999 899 9999999999999999999999999964
No 12
>KOG0843 consensus Transcription factor EMX1 and related HOX domain proteins [Transcription]
Probab=98.93 E-value=6.8e-10 Score=100.75 Aligned_cols=61 Identities=20% Similarity=0.277 Sum_probs=57.2
Q ss_pred hhhcCCCCCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhccccC
Q 019170 257 LRKRRAGKLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWHAN 320 (345)
Q Consensus 257 ~kkrkr~~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk~~ 320 (345)
..||.|+.|+.++...|+..|.. +.|-.-.||..||+.++|+..||.+||+|+|.|+|+..
T Consensus 101 ~~kr~RT~ft~~Ql~~LE~~F~~---~~Yvvg~eR~~LA~~L~LsetQVkvWFQNRRtk~kr~~ 161 (197)
T KOG0843|consen 101 RPKRIRTAFTPEQLLKLEHAFEG---NQYVVGAERKQLAQSLSLSETQVKVWFQNRRTKHKRMQ 161 (197)
T ss_pred CCCccccccCHHHHHHHHHHHhc---CCeeechHHHHHHHHcCCChhHhhhhhhhhhHHHHHHH
Confidence 35777899999999999999999 89999999999999999999999999999999999854
No 13
>KOG0489 consensus Transcription factor zerknullt and related HOX domain proteins [General function prediction only]
Probab=98.88 E-value=1.1e-09 Score=104.73 Aligned_cols=65 Identities=23% Similarity=0.269 Sum_probs=59.9
Q ss_pred hhcCCCCCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhccccCCCCCc
Q 019170 258 RKRRAGKLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWHANPSSST 325 (345)
Q Consensus 258 kkrkr~~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk~~~s~st 325 (345)
.||.|+.|+..|+..|+.-|.- |.|.|+..|.+||..+.|++.||.+||+|+|+|+||.+...+.
T Consensus 159 ~kR~RtayT~~QllELEkEFhf---N~YLtR~RRiEiA~~L~LtErQIKIWFQNRRMK~Kk~~k~~~~ 223 (261)
T KOG0489|consen 159 SKRRRTAFTRYQLLELEKEFHF---NKYLTRSRRIEIAHALNLTERQIKIWFQNRRMKWKKENKAKSS 223 (261)
T ss_pred CCCCCcccchhhhhhhhhhhcc---ccccchHHHHHHHhhcchhHHHHHHHHHHHHHHHHHhhccccc
Confidence 6778899999999999999999 8999999999999999999999999999999999987664443
No 14
>KOG0487 consensus Transcription factor Abd-B, contains HOX domain [Transcription]
Probab=98.88 E-value=9e-10 Score=107.44 Aligned_cols=59 Identities=24% Similarity=0.339 Sum_probs=54.8
Q ss_pred hcCCCCCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhccccC
Q 019170 259 KRRAGKLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWHAN 320 (345)
Q Consensus 259 krkr~~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk~~ 320 (345)
+|||..++|.|+..|+.-|.- |-|.|++-|.+|++.++||..||+.||||+|+|.||-+
T Consensus 236 RKKRcPYTK~QtlELEkEFlf---N~YitkeKR~ElSr~lNLTeRQVKIWFQNRRMK~KK~~ 294 (308)
T KOG0487|consen 236 RKKRCPYTKHQTLELEKEFLF---NMYITKEKRLELSRTLNLTERQVKIWFQNRRMKEKKVN 294 (308)
T ss_pred ccccCCchHHHHHHHHHHHHH---HHHHhHHHHHHHHHhcccchhheeeeehhhhhHHhhhh
Confidence 455679999999999999998 89999999999999999999999999999999999954
No 15
>KOG0485 consensus Transcription factor NKX-5.1/HMX1, contains HOX domain [Transcription]
Probab=98.79 E-value=8e-09 Score=96.33 Aligned_cols=59 Identities=19% Similarity=0.261 Sum_probs=56.0
Q ss_pred hhcCCCCCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhcccc
Q 019170 258 RKRRAGKLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWHA 319 (345)
Q Consensus 258 kkrkr~~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk~ 319 (345)
|||.|+.|+..|+..|+.-|.. ..|.+..||..||..+.|++.||..||||+|.+.|+.
T Consensus 104 KKktRTvFSraQV~qLEs~Fe~---krYLSsaeRa~LA~sLqLTETQVKIWFQNRRnKwKRq 162 (268)
T KOG0485|consen 104 KKKTRTVFSRAQVFQLESTFEL---KRYLSSAERAGLAASLQLTETQVKIWFQNRRNKWKRQ 162 (268)
T ss_pred cccchhhhhHHHHHHHHHHHHH---HhhhhHHHHhHHHHhhhhhhhhhhhhhhhhhHHHHHH
Confidence 6777899999999999999999 7999999999999999999999999999999999874
No 16
>KOG0493 consensus Transcription factor Engrailed, contains HOX domain [General function prediction only]
Probab=98.79 E-value=5e-09 Score=99.90 Aligned_cols=62 Identities=32% Similarity=0.414 Sum_probs=57.9
Q ss_pred hhcCCCCCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhccccCCC
Q 019170 258 RKRRAGKLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWHANPS 322 (345)
Q Consensus 258 kkrkr~~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk~~~s 322 (345)
.||.|+-|+.++.+.|+.-|.. |-|.|+.-|+.||.++||.+.||..||+|+|.+.||.+.+
T Consensus 246 eKRPRTAFtaeQL~RLK~EF~e---nRYlTEqRRQ~La~ELgLNEsQIKIWFQNKRAKiKKsTgs 307 (342)
T KOG0493|consen 246 EKRPRTAFTAEQLQRLKAEFQE---NRYLTEQRRQELAQELGLNESQIKIWFQNKRAKIKKSTGS 307 (342)
T ss_pred hcCccccccHHHHHHHHHHHhh---hhhHHHHHHHHHHHHhCcCHHHhhHHhhhhhhhhhhccCC
Confidence 4677889999999999999999 8999999999999999999999999999999999997543
No 17
>KOG0850 consensus Transcription factor DLX and related proteins with LIM Zn-binding and HOX domains [Transcription]
Probab=98.75 E-value=5.9e-09 Score=97.77 Aligned_cols=59 Identities=19% Similarity=0.277 Sum_probs=54.7
Q ss_pred hcCCCCCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhccccC
Q 019170 259 KRRAGKLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWHAN 320 (345)
Q Consensus 259 krkr~~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk~~ 320 (345)
+|.|+.++.-+.+.|..-|++ ..|.--.||.+||..+|||.+||..||+|+|-|.||..
T Consensus 123 RKPRTIYSS~QLqaL~rRFQk---TQYLALPERAeLAAsLGLTQTQVKIWFQNrRSK~KKl~ 181 (245)
T KOG0850|consen 123 RKPRTIYSSLQLQALNRRFQQ---TQYLALPERAELAASLGLTQTQVKIWFQNRRSKFKKLK 181 (245)
T ss_pred cCCcccccHHHHHHHHHHHhh---cchhcCcHHHHHHHHhCCchhHhhhhhhhhHHHHHHHH
Confidence 445789999999999999999 99999999999999999999999999999999998743
No 18
>KOG0488 consensus Transcription factor BarH and related HOX domain proteins [General function prediction only]
Probab=98.74 E-value=7.2e-09 Score=101.47 Aligned_cols=62 Identities=19% Similarity=0.320 Sum_probs=56.1
Q ss_pred hhcCCCCCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhccccCCC
Q 019170 258 RKRRAGKLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWHANPS 322 (345)
Q Consensus 258 kkrkr~~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk~~~s 322 (345)
++|.|+-|+..++..|+.-|.. -.|.+..||..||...||+-.||.+||+|+|+|+|+....
T Consensus 172 ~RksRTaFT~~Ql~~LEkrF~~---QKYLS~~DR~~LA~~LgLTdaQVKtWfQNRRtKWKrq~a~ 233 (309)
T KOG0488|consen 172 RRKSRTAFSDHQLFELEKRFEK---QKYLSVADRIELAASLGLTDAQVKTWFQNRRTKWKRQTAE 233 (309)
T ss_pred cccchhhhhHHHHHHHHHHHHH---hhcccHHHHHHHHHHcCCchhhHHHHHhhhhHHHHHHHHh
Confidence 3445779999999999999999 8999999999999999999999999999999999986543
No 19
>KOG0842 consensus Transcription factor tinman/NKX2-3, contains HOX domain [Transcription]
Probab=98.70 E-value=7.8e-09 Score=100.91 Aligned_cols=59 Identities=17% Similarity=0.294 Sum_probs=54.4
Q ss_pred hhcCCCCCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhcccc
Q 019170 258 RKRRAGKLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWHA 319 (345)
Q Consensus 258 kkrkr~~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk~ 319 (345)
|||+|--|++-|+-.|+.-|.+ .-|.|..||+.||..++||.+||+.||||+|-|.|+.
T Consensus 153 kRKrRVLFSqAQV~ELERRFrq---QRYLSAPERE~LA~~LrLT~TQVKIWFQNrRYK~KR~ 211 (307)
T KOG0842|consen 153 KRKRRVLFSQAQVYELERRFRQ---QRYLSAPEREHLASSLRLTPTQVKIWFQNRRYKTKRQ 211 (307)
T ss_pred ccccccccchhHHHHHHHHHHh---hhccccHhHHHHHHhcCCCchheeeeeecchhhhhhh
Confidence 4455558999999999999999 8999999999999999999999999999999999874
No 20
>KOG3802 consensus Transcription factor OCT-1, contains POU and HOX domains [Transcription]
Probab=98.70 E-value=1.1e-08 Score=102.36 Aligned_cols=67 Identities=18% Similarity=0.420 Sum_probs=61.2
Q ss_pred HHHHHhhhcCCCCCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhccccCC
Q 019170 252 IREEILRKRRAGKLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWHANP 321 (345)
Q Consensus 252 lr~e~~kkrkr~~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk~~~ 321 (345)
+-.+-+|||||+.+...++..|+..|.. ||-||.+|.-.||+.++|....|++||+|||.|.|+.++
T Consensus 288 i~a~~RkRKKRTSie~~vr~aLE~~F~~---npKPt~qEIt~iA~~L~leKEVVRVWFCNRRQkeKR~~~ 354 (398)
T KOG3802|consen 288 IGAQSRKRKKRTSIEVNVRGALEKHFLK---NPKPTSQEITHIAESLQLEKEVVRVWFCNRRQKEKRITP 354 (398)
T ss_pred hhccccccccccceeHHHHHHHHHHHHh---CCCCCHHHHHHHHHHhccccceEEEEeeccccccccCCC
Confidence 3344456777889999999999999999 999999999999999999999999999999999999887
No 21
>KOG0483 consensus Transcription factor HEX, contains HOX and HALZ domains [Transcription]
Probab=98.69 E-value=1.3e-08 Score=94.05 Aligned_cols=59 Identities=29% Similarity=0.389 Sum_probs=53.9
Q ss_pred hhcCCCCCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhcccc
Q 019170 258 RKRRAGKLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWHA 319 (345)
Q Consensus 258 kkrkr~~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk~ 319 (345)
.++|+.+|+.++...|+.-|.. +-|-.++.|..||++.||++.||.+||+|+|.|.|.+
T Consensus 50 ~~~kk~Rlt~eQ~~~LE~~F~~---~~~L~p~~K~~LAk~LgL~pRQVavWFQNRRARwK~k 108 (198)
T KOG0483|consen 50 GKGKKRRLTSEQVKFLEKSFES---EKKLEPERKKKLAKELGLQPRQVAVWFQNRRARWKTK 108 (198)
T ss_pred cccccccccHHHHHHhHHhhcc---ccccChHHHHHHHHhhCCChhHHHHHHhhccccccch
Confidence 3456678999999999999999 6889999999999999999999999999999999864
No 22
>COG5576 Homeodomain-containing transcription factor [Transcription]
Probab=98.61 E-value=3.2e-08 Score=88.39 Aligned_cols=62 Identities=18% Similarity=0.333 Sum_probs=57.3
Q ss_pred hhcCCCCCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhccccCCC
Q 019170 258 RKRRAGKLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWHANPS 322 (345)
Q Consensus 258 kkrkr~~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk~~~s 322 (345)
.+++|.+.+..++.+|+.-|.. +|||+..+|..|+..++|+++-|..||+|+|.+.|+....
T Consensus 51 ~~~~r~R~t~~Q~~vL~~~F~i---~p~Ps~~~r~~L~~~lnm~~ksVqIWFQNkR~~~k~~~~~ 112 (156)
T COG5576 51 PKSKRRRTTDEQLMVLEREFEI---NPYPSSITRIKLSLLLNMPPKSVQIWFQNKRAKEKKKRSG 112 (156)
T ss_pred CcccceechHHHHHHHHHHhcc---CCCCCHHHHHHHHHhcCCChhhhhhhhchHHHHHHHhccc
Confidence 5667778999999999999999 9999999999999999999999999999999999986543
No 23
>KOG0491 consensus Transcription factor BSH, contains HOX domain [General function prediction only]
Probab=98.59 E-value=2.8e-08 Score=89.39 Aligned_cols=59 Identities=19% Similarity=0.317 Sum_probs=54.6
Q ss_pred hhhcCCCCCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhccc
Q 019170 257 LRKRRAGKLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWH 318 (345)
Q Consensus 257 ~kkrkr~~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk 318 (345)
.++|.|+.|+..+...|+.-|+. -.|.+-.|+.+||..++|+++||..||+|+|+++||
T Consensus 99 ~r~K~Rtvfs~~ql~~l~~rFe~---QrYLS~~e~~ELan~L~LS~~QVKTWFQNrRMK~Kk 157 (194)
T KOG0491|consen 99 RRRKARTVFSDPQLSGLEKRFER---QRYLSTPERQELANALSLSETQVKTWFQNRRMKHKK 157 (194)
T ss_pred HhhhhcccccCccccccHHHHhh---hhhcccHHHHHHHHHhhhhHHHHHHHHHHHHHHHHH
Confidence 35556789999999999999998 789999999999999999999999999999999997
No 24
>KOG0484 consensus Transcription factor PHOX2/ARIX, contains HOX domain [Transcription]
Probab=98.56 E-value=2.1e-08 Score=84.14 Aligned_cols=63 Identities=16% Similarity=0.265 Sum_probs=56.9
Q ss_pred HHHhhhcCCCCCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhcccc
Q 019170 254 EEILRKRRAGKLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWHA 319 (345)
Q Consensus 254 ~e~~kkrkr~~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk~ 319 (345)
+..+.+|-|+.|+..+...|+..|.+ ..||+.-.|++||-...|+...|.+||+|+|.+.+|.
T Consensus 13 ekrKQRRIRTTFTS~QLkELErvF~E---THYPDIYTREEiA~kidLTEARVQVWFQNRRAKfRKQ 75 (125)
T KOG0484|consen 13 EKRKQRRIRTTFTSAQLKELERVFAE---THYPDIYTREEIALKIDLTEARVQVWFQNRRAKFRKQ 75 (125)
T ss_pred HHHHhhhhhhhhhHHHHHHHHHHHHh---hcCCcchhHHHHHHhhhhhHHHHHHHHHhhHHHHHHH
Confidence 44445556789999999999999999 8999999999999999999999999999999999874
No 25
>KOG0492 consensus Transcription factor MSH, contains HOX domain [General function prediction only]
Probab=98.53 E-value=4.6e-08 Score=90.81 Aligned_cols=67 Identities=15% Similarity=0.239 Sum_probs=60.9
Q ss_pred HHHHHHHhhhcCCCCCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhcccc
Q 019170 250 VDIREEILRKRRAGKLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWHA 319 (345)
Q Consensus 250 ~~lr~e~~kkrkr~~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk~ 319 (345)
..||+-...++.|+.|+..+...|+.-|.+ ..|.+.+|+.+++..+.|+..||..||+|+|.|.|+-
T Consensus 136 C~LrKhk~nRkPRtPFTtqQLlaLErkfre---kqYLSiaEraefSsSL~LTeTqVKIWFQNRRAKaKRl 202 (246)
T KOG0492|consen 136 CTLRKHKPNRKPRTPFTTQQLLALERKFRE---KQYLSIAERAEFSSSLELTETQVKIWFQNRRAKAKRL 202 (246)
T ss_pred chhcccCCCCCCCCCCCHHHHHHHHHHHhH---hhhhhHHHHHhhhhhhhhhhhheehhhhhhhHHHHHH
Confidence 446666667788999999999999999999 8999999999999999999999999999999998863
No 26
>KOG2251 consensus Homeobox transcription factor [Transcription]
Probab=98.45 E-value=1.5e-07 Score=87.92 Aligned_cols=59 Identities=19% Similarity=0.345 Sum_probs=55.0
Q ss_pred hhcCCCCCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhcccc
Q 019170 258 RKRRAGKLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWHA 319 (345)
Q Consensus 258 kkrkr~~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk~ 319 (345)
.+|.|+.|+..+..+|+.-|.+ .-||+...+++||.+++|.+.+|.+||.|+|.+.++.
T Consensus 37 qRRERTtFtr~QlevLe~LF~k---TqYPDv~~rEelAlklnLpeSrVqVWFKNRRAK~r~q 95 (228)
T KOG2251|consen 37 QRRERTTFTRKQLEVLEALFAK---TQYPDVFMREELALKLNLPESRVQVWFKNRRAKCRRQ 95 (228)
T ss_pred cccccceecHHHHHHHHHHHHh---hcCccHHHHHHHHHHhCCchhhhhhhhccccchhhHh
Confidence 4556889999999999999999 9999999999999999999999999999999988764
No 27
>KOG0494 consensus Transcription factor CHX10 and related HOX domain proteins [General function prediction only]
Probab=98.42 E-value=1.9e-07 Score=89.28 Aligned_cols=55 Identities=20% Similarity=0.261 Sum_probs=52.4
Q ss_pred CCCCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhcccc
Q 019170 262 AGKLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWHA 319 (345)
Q Consensus 262 r~~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk~ 319 (345)
|+.|+..+...|+.-|.+ --||+.-.|+.||..|+|.+..|.+||+|+|.+.+|.
T Consensus 145 RTiFT~~Qle~LEkaFke---aHYPDv~Are~la~ktelpEDRIqVWfQNRRAKWRk~ 199 (332)
T KOG0494|consen 145 RTIFTSYQLEELEKAFKE---AHYPDVYAREMLADKTELPEDRIQVWFQNRRAKWRKT 199 (332)
T ss_pred cchhhHHHHHHHHHHHhh---ccCccHHHHHHHhhhccCchhhhhHHhhhhhHHhhhh
Confidence 679999999999999999 8999999999999999999999999999999998874
No 28
>KOG4577 consensus Transcription factor LIM3, contains LIM and HOX domains [Transcription]
Probab=98.31 E-value=2.7e-07 Score=89.27 Aligned_cols=67 Identities=22% Similarity=0.414 Sum_probs=60.7
Q ss_pred HHHHHhhhcCCCCCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhccccCC
Q 019170 252 IREEILRKRRAGKLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWHANP 321 (345)
Q Consensus 252 lr~e~~kkrkr~~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk~~~ 321 (345)
|-.+-..||.|+.++..+...|++-|.. .|-|-+--|++|+.+|||+...|++||+|+|.+.|+--.
T Consensus 161 l~gd~~nKRPRTTItAKqLETLK~AYn~---SpKPARHVREQLsseTGLDMRVVQVWFQNRRAKEKRLKK 227 (383)
T KOG4577|consen 161 LEGDASNKRPRTTITAKQLETLKQAYNT---SPKPARHVREQLSSETGLDMRVVQVWFQNRRAKEKRLKK 227 (383)
T ss_pred cccccccCCCcceeeHHHHHHHHHHhcC---CCchhHHHHHHhhhccCcceeehhhhhhhhhHHHHhhhh
Confidence 4456678999999999999999999988 899999999999999999999999999999999886443
No 29
>KOG0486 consensus Transcription factor PTX1, contains HOX domain [Transcription]
Probab=98.27 E-value=4e-07 Score=88.87 Aligned_cols=59 Identities=19% Similarity=0.318 Sum_probs=54.5
Q ss_pred hcCCCCCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhccccC
Q 019170 259 KRRAGKLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWHAN 320 (345)
Q Consensus 259 krkr~~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk~~ 320 (345)
+|.|+.|+..+.+.|+.||.+ |-||+.+.|+++|--|+|++..|++||.|+|.+.+|.-
T Consensus 113 rrQrthFtSqqlqele~tF~r---NrypdMstrEEIavwtNlTE~rvrvwfknrrakwrkrE 171 (351)
T KOG0486|consen 113 RRQRTHFTSQQLQELEATFQR---NRYPDMSTREEIAVWTNLTEARVRVWFKNRRAKWRKRE 171 (351)
T ss_pred hhhhhhhHHHHHHHHHHHHhh---ccCCccchhhHHHhhccccchhhhhhcccchhhhhhhh
Confidence 445678999999999999999 89999999999999999999999999999999988753
No 30
>KOG0848 consensus Transcription factor Caudal, contains HOX domain [Transcription]
Probab=98.26 E-value=2.6e-07 Score=88.60 Aligned_cols=56 Identities=23% Similarity=0.306 Sum_probs=51.8
Q ss_pred CCCCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhccccC
Q 019170 262 AGKLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWHAN 320 (345)
Q Consensus 262 r~~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk~~ 320 (345)
|-.++..++-.|+.-|.- .+|.|..-|.+||..+||++.||..||+|+|.|.+|-+
T Consensus 203 RvVYTDhQRLELEKEfh~---SryITirRKSELA~~LgLsERQVKIWFQNRRAKERK~n 258 (317)
T KOG0848|consen 203 RVVYTDHQRLELEKEFHT---SRYITIRRKSELAATLGLSERQVKIWFQNRRAKERKDN 258 (317)
T ss_pred eEEecchhhhhhhhhhcc---ccceeeehhHHHHHhhCccHhhhhHhhhhhhHHHHHHH
Confidence 346899999999999988 89999999999999999999999999999999998754
No 31
>KOG0847 consensus Transcription factor, contains HOX domain [Transcription]
Probab=98.00 E-value=4.3e-06 Score=78.51 Aligned_cols=60 Identities=25% Similarity=0.385 Sum_probs=54.6
Q ss_pred hhcCCCCCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhccccC
Q 019170 258 RKRRAGKLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWHAN 320 (345)
Q Consensus 258 kkrkr~~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk~~ 320 (345)
+|..|.+|+..++..|+.-|.+ ..||--.++.+||...|++..||.+||+|+|.+.+|..
T Consensus 167 rk~srPTf~g~qi~~le~~feq---tkylaG~~ra~lA~~lgmteSqvkVWFQNRRTKWRKkh 226 (288)
T KOG0847|consen 167 RKQSRPTFTGHQIYQLERKFEQ---TKYLAGADRAQLAQELNMTESQVKVWFQNRRTKWRKKH 226 (288)
T ss_pred ccccCCCccchhhhhhhhhhhh---hhcccchhHHHhhccccccHHHHHHHHhcchhhhhhhh
Confidence 4445667999999999999999 89999999999999999999999999999999988864
No 32
>KOG2252 consensus CCAAT displacement protein and related homeoproteins [Transcription]
Probab=97.99 E-value=7.1e-06 Score=85.15 Aligned_cols=56 Identities=23% Similarity=0.342 Sum_probs=52.7
Q ss_pred hhcCCCCCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhc
Q 019170 258 RKRRAGKLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRN 316 (345)
Q Consensus 258 kkrkr~~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~ 316 (345)
.||.|-.|+..+++.|...|.+ ++||+.+.-+.|+.++||....|.|||-|+|+|.
T Consensus 420 ~KKPRlVfTd~QkrTL~aiFke---~~RPS~Emq~tIS~qL~L~~sTV~NfFmNaRRRs 475 (558)
T KOG2252|consen 420 TKKPRLVFTDIQKRTLQAIFKE---NKRPSREMQETISQQLNLELSTVINFFMNARRRS 475 (558)
T ss_pred CCCceeeecHHHHHHHHHHHhc---CCCCCHHHHHHHHHHhCCcHHHHHHHHHhhhhhc
Confidence 3566778999999999999999 9999999999999999999999999999999994
No 33
>KOG0844 consensus Transcription factor EVX1, contains HOX domain [Transcription]
Probab=97.85 E-value=5.9e-06 Score=80.68 Aligned_cols=56 Identities=16% Similarity=0.276 Sum_probs=51.1
Q ss_pred CCCCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhccccC
Q 019170 262 AGKLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWHAN 320 (345)
Q Consensus 262 r~~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk~~ 320 (345)
|+-|+.+|+-.|+.-|++ --|-++-.|.+||..++|.+..|++||||+|+|.|+..
T Consensus 185 RTAFTReQIaRLEKEFyr---ENYVSRprRcELAAaLNLPEtTIKVWFQNRRMKDKRQR 240 (408)
T KOG0844|consen 185 RTAFTREQIARLEKEFYR---ENYVSRPRRCELAAALNLPETTIKVWFQNRRMKDKRQR 240 (408)
T ss_pred HhhhhHHHHHHHHHHHHH---hccccCchhhhHHHhhCCCcceeehhhhhchhhhhhhh
Confidence 467999999999888888 68999999999999999999999999999999998743
No 34
>KOG0490 consensus Transcription factor, contains HOX domain [General function prediction only]
Probab=97.83 E-value=7.5e-06 Score=74.69 Aligned_cols=62 Identities=13% Similarity=0.109 Sum_probs=56.8
Q ss_pred HhhhcCCCCCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhccccC
Q 019170 256 ILRKRRAGKLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWHAN 320 (345)
Q Consensus 256 ~~kkrkr~~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk~~ 320 (345)
+.++|.|..|+..+...|+.-|.. .+||+...++.||..+++++..|.+||+|+|.++++..
T Consensus 58 ~~~rr~rt~~~~~ql~~ler~f~~---~h~Pd~~~r~~la~~~~~~e~rVqvwFqnrrak~r~~~ 119 (235)
T KOG0490|consen 58 FSKRCARCKFTISQLDELERAFEK---VHLPCFACRECLALLLTGDEFRVQVWFQNRRAKDRKEE 119 (235)
T ss_pred ccccccCCCCCcCHHHHHHHhhcC---CCcCccchHHHHhhcCCCCeeeeehhhhhhcHhhhhhh
Confidence 446777889999999999999999 69999999999999999999999999999999988654
No 35
>KOG0849 consensus Transcription factor PRD and related proteins, contain PAX and HOX domains [Transcription]
Probab=97.80 E-value=1.4e-05 Score=79.73 Aligned_cols=61 Identities=23% Similarity=0.419 Sum_probs=55.4
Q ss_pred hhcCCCCCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhccccCC
Q 019170 258 RKRRAGKLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWHANP 321 (345)
Q Consensus 258 kkrkr~~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk~~~ 321 (345)
.+|.|+.|+..+...|..+|.. +|||....++.||.++||+...|..||.|+|.|.++-..
T Consensus 176 ~rr~rtsft~~Q~~~le~~f~r---t~yP~i~~Re~La~~i~l~e~riqvwf~nrra~~rr~~~ 236 (354)
T KOG0849|consen 176 GRRNRTSFSPSQLEALEECFQR---TPYPDIVGRETLAKETGLPEPRVQVWFQNRRAKWRRQHR 236 (354)
T ss_pred ccccccccccchHHHHHHHhcC---CCCCchhhHHHHhhhccCCchHHHHHHhhhhhhhhhccc
Confidence 3444678999999999999999 899999999999999999999999999999998887653
No 36
>KOG1168 consensus Transcription factor ACJ6/BRN-3, contains POU and HOX domains [Transcription]
Probab=97.68 E-value=1.2e-05 Score=78.02 Aligned_cols=64 Identities=23% Similarity=0.429 Sum_probs=56.9
Q ss_pred hhhcCCCCCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhccccCCCC
Q 019170 257 LRKRRAGKLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWHANPSS 323 (345)
Q Consensus 257 ~kkrkr~~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk~~~s~ 323 (345)
-|||||+.+-..-++-|+.+|.. -|-|+.+-...+|+++.|....|++||+|+|.+.|+.--+.
T Consensus 308 ekKRKRTSIAAPEKRsLEayFav---QPRPS~EkIAaIAekLDLKKNVVRVWFCNQRQKQKRm~~Sa 371 (385)
T KOG1168|consen 308 EKKRKRTSIAAPEKRSLEAYFAV---QPRPSGEKIAAIAEKLDLKKNVVRVWFCNQRQKQKRMKRSA 371 (385)
T ss_pred ccccccccccCcccccHHHHhcc---CCCCchhHHHHHHHhhhhhhceEEEEeeccHHHHHHhhhhh
Confidence 37888888888888999999999 89999999999999999999999999999999998844333
No 37
>KOG0773 consensus Transcription factor MEIS1 and related HOX domain proteins [Transcription]
Probab=97.41 E-value=7.5e-05 Score=73.36 Aligned_cols=64 Identities=33% Similarity=0.533 Sum_probs=58.8
Q ss_pred hhcCCCCCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhccccCCC
Q 019170 258 RKRRAGKLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWHANPS 322 (345)
Q Consensus 258 kkrkr~~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk~~~s 322 (345)
..+++++++.+. ..|+.|...|..+|||+..++..|+..++++..||++||+|.|+|.++.+..
T Consensus 95 ~~~~~~n~~~~s-~~~~~~~~~~~~~~~~~k~~~~ll~~~~~~~~~~~~~~~~~a~r~~~~~~~~ 158 (342)
T KOG0773|consen 95 KGARRGNATRES-ATLKAWLEEHRLNPYPSKLEKILLAVITKLTLTQVSTWFANARRRLKKELKM 158 (342)
T ss_pred cccccccccccc-cccccchhhhhhccCchHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHhccCC
Confidence 445677899999 9999999999999999999999999999999999999999999999987654
No 38
>PF03789 ELK: ELK domain ; InterPro: IPR005539 This domain is required for the nuclear localisation of these proteins []. All of these proteins are members of the Tale/Knox homeodomain family, a subfamily, containing homeobox IPR001356 from INTERPRO.; GO: 0003677 DNA binding, 0005634 nucleus
Probab=97.08 E-value=0.00038 Score=43.22 Aligned_cols=22 Identities=41% Similarity=0.622 Sum_probs=20.6
Q ss_pred HHHHHHhhhhhhHHHHHHHHHh
Q 019170 236 ELKHELKQGYKEKIVDIREEIL 257 (345)
Q Consensus 236 eLk~~l~~ky~~~i~~lr~e~~ 257 (345)
|||.+|+++|+++|++||+||+
T Consensus 1 ELK~~LlrkY~g~i~~Lr~Ef~ 22 (22)
T PF03789_consen 1 ELKHQLLRKYSGYISSLRQEFS 22 (22)
T ss_pred CHHHHHHHHHhHhHHHHHHHhC
Confidence 6899999999999999999984
No 39
>PF11569 Homez: Homeodomain leucine-zipper encoding, Homez; PDB: 2YS9_A.
Probab=96.99 E-value=0.00068 Score=51.20 Aligned_cols=43 Identities=21% Similarity=0.477 Sum_probs=31.5
Q ss_pred HHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhh
Q 019170 270 TSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKR 315 (345)
Q Consensus 270 ~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R 315 (345)
.+.|+++|.. +.++.+.+-..|+.++||+..||.+||.-++.+
T Consensus 10 ~~pL~~Yy~~---h~~L~E~DL~~L~~kS~ms~qqVr~WFa~~~~e 52 (56)
T PF11569_consen 10 IQPLEDYYLK---HKQLQEEDLDELCDKSRMSYQQVRDWFAERMQE 52 (56)
T ss_dssp -HHHHHHHHH---T----TTHHHHHHHHTT--HHHHHHHHHHHS--
T ss_pred hHHHHHHHHH---cCCccHhhHHHHHHHHCCCHHHHHHHHHHhccc
Confidence 4569999999 699999999999999999999999999877543
No 40
>KOG0490 consensus Transcription factor, contains HOX domain [General function prediction only]
Probab=95.48 E-value=0.0099 Score=54.19 Aligned_cols=61 Identities=25% Similarity=0.468 Sum_probs=53.5
Q ss_pred hhcCCCCCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhccccCC
Q 019170 258 RKRRAGKLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWHANP 321 (345)
Q Consensus 258 kkrkr~~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk~~~ 321 (345)
.++.+..+...+...|..-|.. .+||....+..|+..+|++...|.+||+|.|.+.++...
T Consensus 153 ~~~~~~~~~~~~~~~~~~~~~~---~~~P~~~~~~~l~~~~~~~~~~~q~~~~~~~~~~~~~~~ 213 (235)
T KOG0490|consen 153 PRRPRTTFTENQLEVLETVFRA---TPKPDADDREQLAEETGLSERVIQVWFQNRRAKLRKHKR 213 (235)
T ss_pred cCCCccccccchhHhhhhcccC---CCCCchhhHHHHHHhcCCChhhhhhhcccHHHHHHhhcc
Confidence 3445667888888888888888 899999999999999999999999999999999987644
No 41
>KOG1146 consensus Homeobox protein [General function prediction only]
Probab=91.63 E-value=0.14 Score=58.61 Aligned_cols=61 Identities=23% Similarity=0.305 Sum_probs=55.9
Q ss_pred hhcCCCCCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhccccCC
Q 019170 258 RKRRAGKLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWHANP 321 (345)
Q Consensus 258 kkrkr~~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk~~~ 321 (345)
+++.|..++..+..+|+..|.. .-||+.++-+.|.+..+|....|..||+|.|.+.++.-.
T Consensus 903 r~a~~~~~~d~qlk~i~~~~~~---q~~~~~~~~E~l~~~~~~~~~~i~vw~qna~~~s~k~~~ 963 (1406)
T KOG1146|consen 903 RRAYRTQESDLQLKIIKACYEA---QRTPTMQECEVLEEPIGLPKRVIQVWFQNARAKSKKAKL 963 (1406)
T ss_pred hhhhccchhHHHHHHHHHHHhh---ccCChHHHHHhhcccccCCcchhHHhhhhhhhhhhhhhh
Confidence 5566778999999999999999 899999999999999999999999999999999998754
No 42
>PF04218 CENP-B_N: CENP-B N-terminal DNA-binding domain; InterPro: IPR006695 Centromere Protein B (CENP-B) is a DNA-binding protein localized to the centromere. Within the N-terminal 125 residues, there is a DNA-binding region, which binds to a corresponding 17bp CENP-B box sequence. CENP-B dimers either bind two separate DNA molecules or alternatively, they may bind two CENP-B boxes on one DNA molecule, with the intervening stretch of DNA forming a loop structure. The CENP-B DNA-binding domain consists of two repeating domains, RP1 and RP2. This family corresponds to RP1 has been shown to consist of four helices in a helix-turn-helix structure [].; GO: 0003677 DNA binding, 0000775 chromosome, centromeric region; PDB: 1BW6_A 1HLV_A 2ELH_A.
Probab=88.37 E-value=0.84 Score=33.61 Aligned_cols=47 Identities=15% Similarity=0.145 Sum_probs=32.8
Q ss_pred hcCCCCCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhh
Q 019170 259 KRRAGKLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQR 313 (345)
Q Consensus 259 krkr~~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R 313 (345)
||+|..|+-+..-.+-.-+.. .+ -...||+.+|++..+|.+|..|+.
T Consensus 1 krkR~~LTl~eK~~iI~~~e~---g~-----s~~~ia~~fgv~~sTv~~I~K~k~ 47 (53)
T PF04218_consen 1 KRKRKSLTLEEKLEIIKRLEE---GE-----SKRDIAREFGVSRSTVSTILKNKD 47 (53)
T ss_dssp SSSSSS--HHHHHHHHHHHHC---TT------HHHHHHHHT--CCHHHHHHHCHH
T ss_pred CCCCccCCHHHHHHHHHHHHc---CC-----CHHHHHHHhCCCHHHHHHHHHhHH
Confidence 466778888886666666665 33 478999999999999999999964
No 43
>KOG3623 consensus Homeobox transcription factor SIP1 [Transcription]
Probab=85.96 E-value=1.7 Score=47.75 Aligned_cols=52 Identities=31% Similarity=0.562 Sum_probs=46.3
Q ss_pred HHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhccccCCCCC
Q 019170 270 TSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWHANPSSS 324 (345)
Q Consensus 270 ~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk~~~s~s 324 (345)
..+|+.+|.. |+.|+.++-..+|...||...-|..||.+.+........+++
T Consensus 568 ~sllkayyal---n~~ps~eelskia~qvglp~~vvk~wfE~~~a~e~sv~rsps 619 (1007)
T KOG3623|consen 568 TSLLKAYYAL---NGLPSEEELSKIAQQVGLPFAVVKAWFEDEEAEEMSVERSPS 619 (1007)
T ss_pred HHHHHHHHHh---cCCCCHHHHHHHHHHhcccHHHHHHHHHhhhhhhhhhccCcc
Confidence 6788999998 999999999999999999999999999999998877654443
No 44
>PF01527 HTH_Tnp_1: Transposase; InterPro: IPR002514 Transposase proteins are necessary for efficient DNA transposition. This family consists of various Escherichia coli insertion elements and other bacterial transposases some of which are members of the IS3 family. This region includes a helix-turn-helix motif (HTH) at the N terminus followed by a leucine zipper (LZ) motif. The LZ motif has been shown to mediate oligomerisation of the transposase components in IS911 []. More information about these proteins can be found at Protein of the Month: Transposase [].; GO: 0003677 DNA binding, 0004803 transposase activity, 0006313 transposition, DNA-mediated; PDB: 2JN6_A 2RN7_A.
Probab=70.91 E-value=5.9 Score=29.92 Aligned_cols=47 Identities=17% Similarity=0.283 Sum_probs=30.3
Q ss_pred cCCCCCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhh
Q 019170 260 RRAGKLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQR 313 (345)
Q Consensus 260 rkr~~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R 313 (345)
++++.||.+.+..+-.-+.. .......+|+..|+++.+|.+|-.-.+
T Consensus 2 ~~r~~ys~e~K~~~v~~~~~-------~g~sv~~va~~~gi~~~~l~~W~~~~~ 48 (76)
T PF01527_consen 2 RKRRRYSPEFKLQAVREYLE-------SGESVSEVAREYGISPSTLYNWRKQYR 48 (76)
T ss_dssp -SS----HHHHHHHHHHHHH-------HHCHHHHHHHHHTS-HHHHHHHHHHHH
T ss_pred CCCCCCCHHHHHHHHHHHHH-------CCCceEeeecccccccccccHHHHHHh
Confidence 34567888886655444322 136678999999999999999987776
No 45
>cd00569 HTH_Hin_like Helix-turn-helix domain of Hin and related proteins, a family of DNA-binding domains unique to bacteria and represented by the Hin protein of Salmonella. The basic HTH domain is a simple fold comprised of three core helices that form a right-handed helical bundle. The principal DNA-protein interface is formed by the third helix, the recognition helix, inserting itself into the major groove of the DNA. A diverse array of HTH domains participate in a variety of functions that depend on their DNA-binding properties. HTH_Hin represents one of the simplest versions of the HTH domains; the characterization of homologous relationships between various sequence-diverse HTH domain families remains difficult. The Hin recombinase induces the site-specific inversion of a chromosomal DNA segment containing a promoter, which controls the alternate expression of two genes by reversibly switching orientation. The Hin recombinase consists of a single polypeptide chain containing a D
Probab=70.51 E-value=12 Score=22.35 Aligned_cols=39 Identities=10% Similarity=0.157 Sum_probs=27.7
Q ss_pred CCCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhh
Q 019170 263 GKLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWF 309 (345)
Q Consensus 263 ~~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF 309 (345)
..++.+.+..+..++.. .+ ....+|+.+|++...|.+|.
T Consensus 4 ~~~~~~~~~~i~~~~~~----~~----s~~~ia~~~~is~~tv~~~~ 42 (42)
T cd00569 4 PKLTPEQIEEARRLLAA----GE----SVAEIARRLGVSRSTLYRYL 42 (42)
T ss_pred CcCCHHHHHHHHHHHHc----CC----CHHHHHHHHCCCHHHHHHhC
Confidence 45666666666666543 33 34578899999999999985
No 46
>cd06171 Sigma70_r4 Sigma70, region (SR) 4 refers to the most C-terminal of four conserved domains found in Escherichia coli (Ec) sigma70, the main housekeeping sigma, and related sigma-factors (SFs). A SF is a dissociable subunit of RNA polymerase, it directs bacterial or plastid core RNA polymerase to specific promoter elements located upstream of transcription initiation points. The SR4 of Ec sigma70 and other essential primary SFs contact promoter sequences located 35 base-pairs upstream of the initiation point, recognizing a 6-base-pair -35 consensus TTGACA. Sigma70 related SFs also include SFs which are dispensable for bacterial cell growth for example Ec sigmaS, SFs which activate regulons in response to a specific signal for example heat-shock Ec sigmaH, and a group of SFs which includes the extracytoplasmic function (ECF) SFs and is typified by Ec sigmaE which contains SR2 and -4 only. ECF SFs direct the transcription of genes that regulate various responses including periplas
Probab=69.03 E-value=7.5 Score=26.10 Aligned_cols=46 Identities=11% Similarity=0.125 Sum_probs=34.9
Q ss_pred CCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhcc
Q 019170 264 KLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNW 317 (345)
Q Consensus 264 ~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~k 317 (345)
.++...+.++...+... ..-..+|..+|++..+|..|....+.+.+
T Consensus 10 ~l~~~~~~~~~~~~~~~--------~~~~~ia~~~~~s~~~i~~~~~~~~~~l~ 55 (55)
T cd06171 10 KLPEREREVILLRFGEG--------LSYEEIAEILGISRSTVRQRLHRALKKLR 55 (55)
T ss_pred hCCHHHHHHHHHHHhcC--------CCHHHHHHHHCcCHHHHHHHHHHHHHHcC
Confidence 46777888887776442 12457899999999999999988877653
No 47
>PF08281 Sigma70_r4_2: Sigma-70, region 4; InterPro: IPR013249 The bacterial core RNA polymerase complex, which consists of five subunits, is sufficient for transcription elongation and termination but is unable to initiate transcription. Transcription initiation from promoter elements requires a sixth, dissociable subunit called a sigma factor, which reversibly associates with the core RNA polymerase complex to form a holoenzyme []. RNA polymerase recruits alternative sigma factors as a means of switching on specific regulons. Most bacteria express a multiplicity of sigma factors. Two of these factors, sigma-70 (gene rpoD), generally known as the major or primary sigma factor, and sigma-54 (gene rpoN or ntrA) direct the transcription of a wide variety of genes. The other sigma factors, known as alternative sigma factors, are required for the transcription of specific subsets of genes. With regard to sequence similarity, sigma factors can be grouped into two classes, the sigma-54 and sigma-70 families. Sequence alignments of the sigma70 family members reveal four conserved regions that can be further divided into subregions eg. sub-region 2.2, which may be involved in the binding of the sigma factor to the core RNA polymerase; and sub-region 4.2, which seems to harbor a DNA-binding 'helix-turn-helix' motif involved in binding the conserved -35 region of promoters recognised by the major sigma factors [, ]. Region 4 of sigma-70 like sigma-factors are involved in binding to the -35 promoter element via a helix-turn-helix motif [].; GO: 0003677 DNA binding, 0003700 sequence-specific DNA binding transcription factor activity, 0016987 sigma factor activity, 0006352 transcription initiation, DNA-dependent, 0006355 regulation of transcription, DNA-dependent; PDB: 2LFW_A 1OR7_B 2H27_D 2O8X_B.
Probab=63.94 E-value=5.9 Score=28.23 Aligned_cols=44 Identities=14% Similarity=0.277 Sum_probs=32.3
Q ss_pred CCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhh
Q 019170 264 KLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKR 315 (345)
Q Consensus 264 ~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R 315 (345)
.||+..+.++.-.|... -.-.++|+.+|++...|.+|....|++
T Consensus 10 ~L~~~~r~i~~l~~~~g--------~s~~eIa~~l~~s~~~v~~~l~ra~~~ 53 (54)
T PF08281_consen 10 QLPERQREIFLLRYFQG--------MSYAEIAEILGISESTVKRRLRRARKK 53 (54)
T ss_dssp CS-HHHHHHHHHHHTS-----------HHHHHHHCTS-HHHHHHHHHHHHHH
T ss_pred CCCHHHHHHHHHHHHHC--------cCHHHHHHHHCcCHHHHHHHHHHHHhh
Confidence 57888888887655552 234589999999999999999998876
No 48
>PF04545 Sigma70_r4: Sigma-70, region 4; InterPro: IPR007630 The bacterial core RNA polymerase complex, which consists of five subunits, is sufficient for transcription elongation and termination but is unable to initiate transcription. Transcription initiation from promoter elements requires a sixth, dissociable subunit called a sigma factor, which reversibly associates with the core RNA polymerase complex to form a holoenzyme []. RNA polymerase recruits alternative sigma factors as a means of switching on specific regulons. Most bacteria express a multiplicity of sigma factors. Two of these factors, sigma-70 (gene rpoD), generally known as the major or primary sigma factor, and sigma-54 (gene rpoN or ntrA) direct the transcription of a wide variety of genes. The other sigma factors, known as alternative sigma factors, are required for the transcription of specific subsets of genes. With regard to sequence similarity, sigma factors can be grouped into two classes, the sigma-54 and sigma-70 families. Sequence alignments of the sigma70 family members reveal four conserved regions that can be further divided into subregions eg. sub-region 2.2, which may be involved in the binding of the sigma factor to the core RNA polymerase; and sub-region 4.2, which seems to harbor a DNA-binding 'helix-turn-helix' motif involved in binding the conserved -35 region of promoters recognised by the major sigma factors [, ]. Region 4 of sigma-70 like sigma-factors is involved in binding to the -35 promoter element via a helix-turn-helix motif []. Due to the way Pfam works, the threshold has been set artificially high to prevent overlaps with other helix-turn-helix families. Therefore there are many false negatives.; GO: 0003677 DNA binding, 0003700 sequence-specific DNA binding transcription factor activity, 0016987 sigma factor activity, 0006352 transcription initiation, DNA-dependent, 0006355 regulation of transcription, DNA-dependent; PDB: 2P7V_B 3IYD_F 1TLH_B 1KU7_A 1RIO_H 3N97_A 1KU3_A 1RP3_C 1SC5_A 1NR3_A ....
Probab=59.13 E-value=8.1 Score=27.29 Aligned_cols=46 Identities=9% Similarity=0.225 Sum_probs=35.9
Q ss_pred CCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhcc
Q 019170 264 KLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNW 317 (345)
Q Consensus 264 ~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~k 317 (345)
.||+..+.+|...|.. ++ .-.++|+..|++...|..+...+..+.|
T Consensus 4 ~L~~~er~vi~~~y~~----~~----t~~eIa~~lg~s~~~V~~~~~~al~kLR 49 (50)
T PF04545_consen 4 QLPPREREVIRLRYFE----GL----TLEEIAERLGISRSTVRRILKRALKKLR 49 (50)
T ss_dssp TS-HHHHHHHHHHHTS----T-----SHHHHHHHHTSCHHHHHHHHHHHHHHHH
T ss_pred hCCHHHHHHHHHHhcC----CC----CHHHHHHHHCCcHHHHHHHHHHHHHHhc
Confidence 5888899999887744 22 2457999999999999999998888765
No 49
>PRK06759 RNA polymerase factor sigma-70; Validated
Probab=56.74 E-value=13 Score=31.44 Aligned_cols=47 Identities=9% Similarity=0.071 Sum_probs=37.6
Q ss_pred CCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhccc
Q 019170 264 KLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWH 318 (345)
Q Consensus 264 ~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk 318 (345)
.||+..+.++..-|... .+ -.++|+.+|++...|.+|....|++.++
T Consensus 106 ~L~~~~r~ii~l~~~~~--~s------~~EIA~~l~is~~tV~~~~~ra~~~Lr~ 152 (154)
T PRK06759 106 VLDEKEKYIIFERFFVG--KT------MGEIALETEMTYYQVRWIYRQALEKMRN 152 (154)
T ss_pred hCCHHHHHHHHHHHhcC--CC------HHHHHHHHCCCHHHHHHHHHHHHHHHhh
Confidence 58888999886555442 22 4589999999999999999999998865
No 50
>PRK09646 RNA polymerase sigma factor SigK; Reviewed
Probab=51.54 E-value=15 Score=32.83 Aligned_cols=50 Identities=6% Similarity=0.016 Sum_probs=40.2
Q ss_pred CCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhccccCC
Q 019170 264 KLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWHANP 321 (345)
Q Consensus 264 ~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk~~~ 321 (345)
.||+..+.+|.--+... .+ -.++|+.+|++...|.+++...|++.++...
T Consensus 142 ~L~~~~r~vl~l~~~~~--~s------~~EIA~~Lgis~~tVk~~l~ra~~~Lr~~l~ 191 (194)
T PRK09646 142 ALTDTQRESVTLAYYGG--LT------YREVAERLAVPLGTVKTRMRDGLIRLRDCLG 191 (194)
T ss_pred hCCHHHHHHHHHHHHcC--CC------HHHHHHHhCCChHhHHHHHHHHHHHHHHHhc
Confidence 48889999987665553 22 3579999999999999999999999887653
No 51
>PRK09642 RNA polymerase sigma factor SigW; Reviewed
Probab=49.04 E-value=19 Score=30.74 Aligned_cols=48 Identities=17% Similarity=0.280 Sum_probs=38.5
Q ss_pred CCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhcccc
Q 019170 264 KLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWHA 319 (345)
Q Consensus 264 ~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk~ 319 (345)
.||+..+.++.-.+... .+| .++|+.+|++...|.+.+.-+|++.++.
T Consensus 106 ~Lp~~~r~v~~l~~~~g--~s~------~EIA~~lgis~~tV~~~l~Rar~~Lr~~ 153 (160)
T PRK09642 106 ELPENYRDVVLAHYLEE--KSY------QEIALQEKIEVKTVEMKLYRARKWIKKH 153 (160)
T ss_pred hCCHHHHHHHHHHHHhC--CCH------HHHHHHHCCCHHHHHHHHHHHHHHHHHH
Confidence 48999999987665552 232 4799999999999999999999988764
No 52
>PF13443 HTH_26: Cro/C1-type HTH DNA-binding domain; PDB: 3TYR_A 3TYS_A 3B7H_A.
Probab=48.74 E-value=14 Score=26.88 Aligned_cols=24 Identities=13% Similarity=0.279 Sum_probs=18.1
Q ss_pred HHHHHHHHhCCChhhhhhhhhhhh
Q 019170 290 DKARLVQETGLQLKQINNWFINQR 313 (345)
Q Consensus 290 eK~~LA~~tgLs~kQV~nWF~N~R 313 (345)
....||+.+|++..+|+.|+.+..
T Consensus 12 t~~~La~~~gis~~tl~~~~~~~~ 35 (63)
T PF13443_consen 12 TQKDLARKTGISRSTLSRILNGKP 35 (63)
T ss_dssp -HHHHHHHHT--HHHHHHHHTTT-
T ss_pred CHHHHHHHHCcCHHHHHHHHhccc
Confidence 356899999999999999999773
No 53
>KOG2070 consensus Guanine nucleotide exchange factor [Nucleotide transport and metabolism]
Probab=47.94 E-value=35 Score=36.45 Aligned_cols=26 Identities=12% Similarity=0.245 Sum_probs=16.0
Q ss_pred CCChHHHHHHHHHHH-------HHHHHHHHHhH
Q 019170 129 LDDKELDQFMTHYVL-------LLYSFKEQLQQ 154 (345)
Q Consensus 129 g~dpELDqFMeaYc~-------vL~kykEEL~k 154 (345)
..-|++.-..-+||. ||.+|++||++
T Consensus 182 n~ap~mkt~~~aYcanHP~AV~VL~k~~dELek 214 (661)
T KOG2070|consen 182 NLAPQMKTLYLAYCANHPSAVNVLTKHSDELEK 214 (661)
T ss_pred hhhHHHHHHHHHHHhcCchhhhHHHHhHHHHHH
Confidence 445666666666663 66666666655
No 54
>PRK09644 RNA polymerase sigma factor SigM; Provisional
Probab=46.38 E-value=21 Score=30.76 Aligned_cols=49 Identities=14% Similarity=0.061 Sum_probs=40.2
Q ss_pred CCCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhcccc
Q 019170 263 GKLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWHA 319 (345)
Q Consensus 263 ~~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk~ 319 (345)
..||...++++.-++..+ .+ -.++|..+|++...|.+|..-.|++.++-
T Consensus 107 ~~L~~~~r~v~~l~~~~g--~s------~~eIA~~lgis~~tv~~~l~Rar~~Lr~~ 155 (165)
T PRK09644 107 HTLPVIEAQAILLCDVHE--LT------YEEAASVLDLKLNTYKSHLFRGRKRLKAL 155 (165)
T ss_pred HhCCHHHHHHHHhHHHhc--CC------HHHHHHHHCCCHHHHHHHHHHHHHHHHHH
Confidence 358888999998776663 33 35899999999999999999999998874
No 55
>PRK03975 tfx putative transcriptional regulator; Provisional
Probab=45.26 E-value=23 Score=31.50 Aligned_cols=48 Identities=8% Similarity=0.022 Sum_probs=38.3
Q ss_pred CCCCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhccc
Q 019170 262 AGKLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWH 318 (345)
Q Consensus 262 r~~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk 318 (345)
...||+.++++|.- +.+ .+ .-.++|+.+|++...|.+|...++++.++
T Consensus 4 ~~~Lt~rqreVL~l-r~~----Gl----Tq~EIAe~LGiS~~tVs~ie~ra~kkLr~ 51 (141)
T PRK03975 4 ESFLTERQIEVLRL-RER----GL----TQQEIADILGTSRANVSSIEKRARENIEK 51 (141)
T ss_pred ccCCCHHHHHHHHH-HHc----CC----CHHHHHHHHCCCHHHHHHHHHHHHHHHHH
Confidence 45789999999976 333 22 23589999999999999999999988765
No 56
>TIGR02937 sigma70-ECF RNA polymerase sigma factor, sigma-70 family. Several PFAM models hit segments of these sequences including Sigma-70 region 2 (pfam04542) and Sigma-70, region 4 (pfam04545), but not always above their respective trusted cutoffs.
Probab=45.05 E-value=30 Score=27.91 Aligned_cols=47 Identities=17% Similarity=0.285 Sum_probs=35.7
Q ss_pred CCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhccc
Q 019170 264 KLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWH 318 (345)
Q Consensus 264 ~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk 318 (345)
.||+..+.++..-+.. .| .-..+|+.+|++...|.+|....+.+.++
T Consensus 110 ~L~~~~~~ii~~~~~~----g~----s~~eIA~~l~~s~~~v~~~~~~~~~kl~~ 156 (158)
T TIGR02937 110 KLPEREREVLVLRYLE----GL----SYKEIAEILGISVGTVKRRLKRARKKLRE 156 (158)
T ss_pred hCCHHHHHHHhhHHhc----CC----CHHHHHHHHCCCHHHHHHHHHHHHHHHHh
Confidence 4677777777544333 33 23489999999999999999999988775
No 57
>smart00421 HTH_LUXR helix_turn_helix, Lux Regulon. lux regulon (activates the bioluminescence operon
Probab=42.73 E-value=47 Score=22.60 Aligned_cols=47 Identities=11% Similarity=0.147 Sum_probs=34.4
Q ss_pred CCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhcccc
Q 019170 264 KLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWHA 319 (345)
Q Consensus 264 ~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk~ 319 (345)
.|+.....++..+ .. .+ ....+|+.+|++...|..|....+.+..-.
T Consensus 3 ~l~~~e~~i~~~~-~~----g~----s~~eia~~l~is~~tv~~~~~~~~~kl~~~ 49 (58)
T smart00421 3 SLTPREREVLRLL-AE----GL----TNKEIAERLGISEKTVKTHLSNIMRKLGVR 49 (58)
T ss_pred CCCHHHHHHHHHH-Hc----CC----CHHHHHHHHCCCHHHHHHHHHHHHHHHCCC
Confidence 4777777777543 32 22 346899999999999999999887776543
No 58
>PRK11924 RNA polymerase sigma factor; Provisional
Probab=42.65 E-value=23 Score=30.16 Aligned_cols=48 Identities=8% Similarity=0.160 Sum_probs=37.2
Q ss_pred CCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhcccc
Q 019170 264 KLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWHA 319 (345)
Q Consensus 264 ~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk~ 319 (345)
.||+..+.++...+.. .+ .-..+|+..|++...|.+|..-.|++.++.
T Consensus 125 ~L~~~~r~i~~l~~~~----~~----~~~eIA~~lgis~~tv~~~~~ra~~~lr~~ 172 (179)
T PRK11924 125 ALPVKQREVFLLRYVE----GL----SYREIAEILGVPVGTVKSRLRRARQLLREC 172 (179)
T ss_pred hCCHHHHHHhhHHHHc----CC----CHHHHHHHHCCCHHHHHHHHHHHHHHHHHH
Confidence 3777788887665544 22 235899999999999999999999998763
No 59
>PRK09652 RNA polymerase sigma factor RpoE; Provisional
Probab=42.05 E-value=26 Score=30.00 Aligned_cols=48 Identities=8% Similarity=0.141 Sum_probs=37.2
Q ss_pred CCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhcccc
Q 019170 264 KLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWHA 319 (345)
Q Consensus 264 ~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk~ 319 (345)
.||+..+.++.-.|... .+ -..+|+.+|++...|.+|....|++.++.
T Consensus 128 ~L~~~~r~vl~l~~~~~--~s------~~eIA~~lgis~~tV~~~l~ra~~~Lr~~ 175 (182)
T PRK09652 128 SLPEELRTAITLREIEG--LS------YEEIAEIMGCPIGTVRSRIFRAREALRAK 175 (182)
T ss_pred hCCHHHHHHHHHHHHcC--CC------HHHHHHHHCCCHHHHHHHHHHHHHHHHHH
Confidence 47888888886654431 23 24789999999999999999999988764
No 60
>PF13518 HTH_28: Helix-turn-helix domain
Probab=41.78 E-value=25 Score=24.37 Aligned_cols=24 Identities=17% Similarity=0.432 Sum_probs=20.9
Q ss_pred HHHHHHHhCCChhhhhhhhhhhhh
Q 019170 291 KARLVQETGLQLKQINNWFINQRK 314 (345)
Q Consensus 291 K~~LA~~tgLs~kQV~nWF~N~R~ 314 (345)
...+|..+|++..+|..|....+.
T Consensus 15 ~~~~a~~~gis~~tv~~w~~~y~~ 38 (52)
T PF13518_consen 15 VREIAREFGISRSTVYRWIKRYRE 38 (52)
T ss_pred HHHHHHHHCCCHhHHHHHHHHHHh
Confidence 346999999999999999987775
No 61
>PRK12514 RNA polymerase sigma factor; Provisional
Probab=41.71 E-value=26 Score=30.57 Aligned_cols=48 Identities=13% Similarity=0.270 Sum_probs=38.8
Q ss_pred CCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhcccc
Q 019170 264 KLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWHA 319 (345)
Q Consensus 264 ~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk~ 319 (345)
.||+..+.++.-.|.. .++ -.++|+.+|++...|.+++...|++.++.
T Consensus 129 ~L~~~~r~i~~l~~~~----g~s----~~eIA~~lgis~~tV~~~l~Rar~~Lr~~ 176 (179)
T PRK12514 129 ELEKDRAAAVRRAYLE----GLS----YKELAERHDVPLNTMRTWLRRSLLKLREC 176 (179)
T ss_pred hCCHHHHHHHHHHHHc----CCC----HHHHHHHHCCChHHHHHHHHHHHHHHHHH
Confidence 4788888888777765 222 35799999999999999999999998764
No 62
>PRK12540 RNA polymerase sigma factor; Provisional
Probab=41.50 E-value=27 Score=31.09 Aligned_cols=50 Identities=18% Similarity=0.058 Sum_probs=40.0
Q ss_pred CCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhccccCC
Q 019170 264 KLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWHANP 321 (345)
Q Consensus 264 ~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk~~~ 321 (345)
.||+..+.++.-.+... .+ -.++|+.+|++...|.....-+|++.++...
T Consensus 111 ~Lp~~~R~v~~L~~~~g--~s------~~EIA~~Lgis~~tV~~~l~RAr~~Lr~~l~ 160 (182)
T PRK12540 111 KLPQDQREALILVGASG--FS------YEDAAAICGCAVGTIKSRVNRARSKLSALLY 160 (182)
T ss_pred hCCHHHHHHhhHHHHcC--CC------HHHHHHHHCCCHHHHHHHHHHHHHHHHHHHH
Confidence 47888989887765553 22 3489999999999999999999999987654
No 63
>KOG4445 consensus Uncharacterized conserved protein, contains RWD domain [Function unknown]
Probab=41.10 E-value=55 Score=32.98 Aligned_cols=47 Identities=13% Similarity=0.200 Sum_probs=36.2
Q ss_pred hHHHHHHHHHHHHHHHHHHHHhHHhhhhHHHHHHHHHHHHHHhhhhcCC
Q 019170 132 KELDQFMTHYVLLLYSFKEQLQQHVRVHAMEAVMACWDLEQSLQSLTGV 180 (345)
Q Consensus 132 pELDqFMeaYc~vL~kykEEL~kP~~~~~~EA~~f~~~iE~QL~sL~~~ 180 (345)
-+-|.||--||. .+|-.+|.+-+|-.++++..-+..+-.|+..+|.+
T Consensus 134 T~C~Hy~H~~Cl--aRyl~~~~~~lrqe~q~~~~~~qh~~~~~eavcpV 180 (368)
T KOG4445|consen 134 TACDHYMHFACL--ARYLTECLTGLRQEIQDAQKERQHMKEQVEAVCPV 180 (368)
T ss_pred ehhHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHhhhhHhhhhhH
Confidence 478999999994 45666666656656678888888999999999853
No 64
>TIGR02985 Sig70_bacteroi1 RNA polymerase sigma-70 factor, Bacteroides expansion family 1. This group of sigma factors are members of the sigma-70 family (TIGR02937) and are found primarily in the genus Bacteroides. This family appears to have resulted from a lineage-specific expansion as B. thetaiotaomicron VPI-5482, Bacteroides forsythus ATCC 43037, Bacteroides fragilis YCH46 and Bacteroides fragilis NCTC 9343 contain 25, 12, 24 and 23 members, respectively. There are currentlyonly two known members of this family outside of the Bacteroides, in Rhodopseudomonas and Bradyrhizobium.
Probab=40.75 E-value=43 Score=27.86 Aligned_cols=47 Identities=15% Similarity=0.202 Sum_probs=36.7
Q ss_pred CCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhccc
Q 019170 264 KLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWH 318 (345)
Q Consensus 264 ~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk 318 (345)
.||...+.++.-.+.. .++ -.++|+.+|++...|.++....|++.++
T Consensus 113 ~L~~~~r~il~l~~~~----~~~----~~eIA~~lgis~~tv~~~~~ra~~~Lr~ 159 (161)
T TIGR02985 113 KLPEQCRKIFILSRFE----GKS----YKEIAEELGISVKTVEYHISKALKELRK 159 (161)
T ss_pred HCCHHHHHHHHHHHHc----CCC----HHHHHHHHCCCHHHHHHHHHHHHHHHHh
Confidence 4778888888765544 332 2368999999999999999999998875
No 65
>TIGR02983 SigE-fam_strep RNA polymerase sigma-70 factor, sigma-E family. This group of similar sigma-70 factors includes the sigE factor from Streptomyces coelicolor. The family appears to include a paralagous expansion in the Streptomycetes lineage, while related Actinomycetales have at most two representatives.
Probab=40.69 E-value=30 Score=29.52 Aligned_cols=50 Identities=10% Similarity=0.229 Sum_probs=39.2
Q ss_pred CCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhccccCC
Q 019170 264 KLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWHANP 321 (345)
Q Consensus 264 ~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk~~~ 321 (345)
.||...+.++.--|... ++ -.++|+.+|++...|.++....|++.++...
T Consensus 110 ~L~~~~r~i~~l~~~~g----~s----~~eIA~~lgis~~tV~~~l~ra~~~Lr~~l~ 159 (162)
T TIGR02983 110 RLPARQRAVVVLRYYED----LS----EAQVAEALGISVGTVKSRLSRALARLRELLE 159 (162)
T ss_pred hCCHHHHHHhhhHHHhc----CC----HHHHHHHhCCCHHHHHHHHHHHHHHHHHHhc
Confidence 57888888886655542 22 2478999999999999999999999887544
No 66
>PRK00118 putative DNA-binding protein; Validated
Probab=40.62 E-value=31 Score=29.07 Aligned_cols=47 Identities=11% Similarity=0.193 Sum_probs=38.1
Q ss_pred CCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhccc
Q 019170 264 KLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWH 318 (345)
Q Consensus 264 ~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk 318 (345)
.||..++.++.-++... + .-..+|+.+|++...|.+|....|++.++
T Consensus 17 ~L~ekqRevl~L~y~eg----~----S~~EIAe~lGIS~~TV~r~L~RArkkLr~ 63 (104)
T PRK00118 17 LLTEKQRNYMELYYLDD----Y----SLGEIAEEFNVSRQAVYDNIKRTEKLLED 63 (104)
T ss_pred cCCHHHHHHHHHHHHcC----C----CHHHHHHHHCcCHHHHHHHHHHHHHHHHH
Confidence 57888899987776662 2 23469999999999999999999988775
No 67
>TIGR02939 RpoE_Sigma70 RNA polymerase sigma factor RpoE. A sigma factor is a DNA-binding protein protein that binds to the DNA-directed RNA polymerase core to produce the holoenzyme capable of initiating transcription at specific sites. Different sigma factors act in vegetative growth, heat shock, extracytoplasmic functions (ECF), etc. This model represents the clade of sigma factors called RpoE. This protein may be called sigma-24, sigma-E factor, sigma-H factor, fecI-like sigma factor or alternative sigma factor AlgU.
Probab=40.32 E-value=22 Score=31.04 Aligned_cols=49 Identities=10% Similarity=0.043 Sum_probs=37.8
Q ss_pred CCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhccccC
Q 019170 264 KLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWHAN 320 (345)
Q Consensus 264 ~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk~~ 320 (345)
.||+..+.++.--|..+ -.-.++|+.+|++...|.++....|++.++..
T Consensus 138 ~L~~~~r~v~~l~~~~~--------~s~~EIA~~lgis~~tv~~~l~rar~~Lr~~l 186 (190)
T TIGR02939 138 ALPEDLRTAITLRELEG--------LSYEDIARIMDCPVGTVRSRIFRAREAIAIRL 186 (190)
T ss_pred cCCHHHhhhhhhhhhcC--------CCHHHHHHHHCcCHHHHHHHHHHHHHHHHHHh
Confidence 46777888886554442 22358999999999999999999999988754
No 68
>PRK12541 RNA polymerase sigma factor; Provisional
Probab=40.18 E-value=29 Score=29.66 Aligned_cols=48 Identities=19% Similarity=0.191 Sum_probs=38.7
Q ss_pred CCCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhccc
Q 019170 263 GKLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWH 318 (345)
Q Consensus 263 ~~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk 318 (345)
..||..++.++.-.+... .+ -.++|..+|++...|..+....|++.++
T Consensus 111 ~~L~~~~r~v~~l~~~~~--~s------~~eIA~~lgis~~tv~~~l~Rar~~L~~ 158 (161)
T PRK12541 111 SSLPLERRNVLLLRDYYG--FS------YKEIAEMTGLSLAKVKIELHRGRKETKS 158 (161)
T ss_pred HHCCHHHHHHhhhHHhcC--CC------HHHHHHHHCCCHHHHHHHHHHHHHHHHh
Confidence 358898999987766553 23 2479999999999999999999998875
No 69
>PRK12547 RNA polymerase sigma factor; Provisional
Probab=40.06 E-value=29 Score=30.03 Aligned_cols=48 Identities=15% Similarity=0.021 Sum_probs=38.9
Q ss_pred CCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhcccc
Q 019170 264 KLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWHA 319 (345)
Q Consensus 264 ~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk~ 319 (345)
.||+..++++.-.+...+ . -.++|+.+|++...|.++..-.|++.+..
T Consensus 112 ~L~~~~r~v~~l~~~~g~--s------~~eIA~~lgis~~tV~~~l~Rar~~Lr~~ 159 (164)
T PRK12547 112 LLSADQREAIILIGASGF--S------YEDAAAICGCAVGTIKSRVSRARNRLQEL 159 (164)
T ss_pred hCCHHHHHHHHHHHHcCC--C------HHHHHHHhCCCHHHHHHHHHHHHHHHHHH
Confidence 588899998877666532 2 34799999999999999999999988754
No 70
>TIGR02999 Sig-70_X6 RNA polymerase sigma factor, TIGR02999 family. This group of sigma factors are members of the sigma-70 family (TIGR02937) and are found in a variety of species including Rhodopirellula baltica which encodes a paralogous group of five.
Probab=39.75 E-value=31 Score=30.04 Aligned_cols=48 Identities=4% Similarity=0.100 Sum_probs=38.8
Q ss_pred CCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhcccc
Q 019170 264 KLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWHA 319 (345)
Q Consensus 264 ~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk~ 319 (345)
.||+.++.++.-.+... .+ -.++|+.+|++...|.+.+...|++.++.
T Consensus 134 ~Lp~~~r~v~~l~~~~g--~s------~~EIA~~lgis~~tVk~~l~Rar~~Lr~~ 181 (183)
T TIGR02999 134 QVDPRQAEVVELRFFAG--LT------VEEIAELLGVSVRTVERDWRFARAWLADE 181 (183)
T ss_pred cCCHHHHHHHHHHHHcC--CC------HHHHHHHhCCCHHHHHHHHHHHHHHHHHH
Confidence 48899999987776652 22 24799999999999999999999988763
No 71
>TIGR02989 Sig-70_gvs1 RNA polymerase sigma-70 factor, Rhodopirellula/Verrucomicrobium family. This group of sigma factors are members of the sigma-70 family (TIGR02937) and are abundantly found in the species Rhodopirellula baltica (11), and Verrucomicrobium spinosum (16) and to a lesser extent in Gemmata obscuriglobus (2).
Probab=38.92 E-value=34 Score=28.94 Aligned_cols=47 Identities=19% Similarity=0.233 Sum_probs=37.0
Q ss_pred CCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhccc
Q 019170 264 KLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWH 318 (345)
Q Consensus 264 ~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk 318 (345)
.||+..+.++.--|.. .++ -.++|+.+|++...|.++..-.|++.+.
T Consensus 111 ~L~~~~r~v~~l~~~~----g~~----~~eIA~~l~is~~tv~~~l~Rar~~Lr~ 157 (159)
T TIGR02989 111 KLPERQRELLQLRYQR----GVS----LTALAEQLGRTVNAVYKALSRLRVRLRD 157 (159)
T ss_pred HCCHHHHHHHHHHHhc----CCC----HHHHHHHhCCCHHHHHHHHHHHHHHHHh
Confidence 5888888888775444 222 3479999999999999999999988765
No 72
>PRK06811 RNA polymerase factor sigma-70; Validated
Probab=37.99 E-value=34 Score=30.34 Aligned_cols=49 Identities=16% Similarity=0.165 Sum_probs=39.7
Q ss_pred CCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhccccC
Q 019170 264 KLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWHAN 320 (345)
Q Consensus 264 ~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk~~ 320 (345)
.||+..+.++.-.|... .+ -.++|+.+|++...|.+...-.|++.++.-
T Consensus 131 ~L~~~~r~i~~l~~~~g--~s------~~EIAe~lgis~~~V~~~l~Ra~~~Lr~~~ 179 (189)
T PRK06811 131 DLEKLDREIFIRRYLLG--EK------IEEIAKKLGLTRSAIDNRLSRGRKKLQKNK 179 (189)
T ss_pred hCCHHHHHHHHHHHHcc--CC------HHHHHHHHCCCHHHHHHHHHHHHHHHHHcc
Confidence 68999999987655542 33 357999999999999999999999988753
No 73
>PRK12526 RNA polymerase sigma factor; Provisional
Probab=36.97 E-value=33 Score=31.02 Aligned_cols=49 Identities=14% Similarity=0.208 Sum_probs=38.5
Q ss_pred CCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhccccC
Q 019170 264 KLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWHAN 320 (345)
Q Consensus 264 ~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk~~ 320 (345)
.||+..+.++.--+... . .-.++|+.+|++...|.+++..+|++.++-.
T Consensus 153 ~L~~~~r~vl~l~~~~g--~------s~~EIA~~lgis~~tV~~~l~Ra~~~Lr~~l 201 (206)
T PRK12526 153 KLPEAQQTVVKGVYFQE--L------SQEQLAQQLNVPLGTVKSRLRLALAKLKVQM 201 (206)
T ss_pred hCCHHHHHHHHHHHHcC--C------CHHHHHHHHCCCHHHHHHHHHHHHHHHHHHH
Confidence 48888989987655442 2 2358999999999999999999999887643
No 74
>PF11288 DUF3089: Protein of unknown function (DUF3089); InterPro: IPR021440 This family of proteins has no known function.
Probab=36.61 E-value=8.7 Score=36.18 Aligned_cols=38 Identities=34% Similarity=0.468 Sum_probs=31.0
Q ss_pred CCCCCcccchhhhHHHHHHHhhCCChHHHHHHHHhhhcccCCCc
Q 019170 58 ASNDNGRAEDWETVKCKAEIVGHPLYEQLLSAHVSCLRIATPVD 101 (345)
Q Consensus 58 ~~~~~~~~~~~e~~~lKa~I~sHPlYp~Ll~A~i~C~KVgaP~e 101 (345)
++||-|...- ...||..|..||++.+||+||+ ||.|.-
T Consensus 100 aGHSQGs~~l--~~LL~e~~~~~pl~~rLVAAYl----iG~~v~ 137 (207)
T PF11288_consen 100 AGHSQGSMHL--LRLLKEEIAGDPLRKRLVAAYL----IGYPVT 137 (207)
T ss_pred EEeChHHHHH--HHHHHHHhcCchHHhhhheeee----cCcccc
Confidence 6777776643 4589999999999999999999 787743
No 75
>PRK12523 RNA polymerase sigma factor; Reviewed
Probab=36.58 E-value=38 Score=29.43 Aligned_cols=48 Identities=13% Similarity=0.222 Sum_probs=38.2
Q ss_pred CCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhcccc
Q 019170 264 KLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWHA 319 (345)
Q Consensus 264 ~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk~ 319 (345)
.||+.++.++.-.+... .+| .++|+.+|++...|.++...++++.+..
T Consensus 119 ~Lp~~~r~v~~L~~~~g--~s~------~EIA~~lgis~~tV~~~l~ra~~~~~~~ 166 (172)
T PRK12523 119 KLSSKARAAFLYNRLDG--MGH------AEIAERLGVSVSRVRQYLAQGLRQCYIA 166 (172)
T ss_pred hCCHHHHHHHHHHHHcC--CCH------HHHHHHHCCCHHHHHHHHHHHHHHHHHH
Confidence 58888888887665542 333 4799999999999999999999988653
No 76
>PF10668 Phage_terminase: Phage terminase small subunit; InterPro: IPR018925 This entry describes the terminase small subunit from Enterococcus phage phiFL1A, related proteins in other bacteriophage, and prophage regions of bacterial genomes. Packaging of double-stranded viral DNA concatemers requires interaction of the prohead with virus DNA. This process is mediated by a phage-encoded DNA recognition and terminase protein. The terminase enzymes described so far, which are hetero-oligomers composed of a small and a large subunit, do not have a significant level of sequence homology. The small terminase subunit is thought to form a nucleoprotein structure that helps to position the terminase large subunit at the packaging initiation site [].
Probab=36.06 E-value=28 Score=26.73 Aligned_cols=21 Identities=14% Similarity=0.355 Sum_probs=17.7
Q ss_pred HHHHHHHHhCCChhhhhhhhh
Q 019170 290 DKARLVQETGLQLKQINNWFI 310 (345)
Q Consensus 290 eK~~LA~~tgLs~kQV~nWF~ 310 (345)
.-..||+++|++..||..|=.
T Consensus 24 ~lkdIA~~Lgvs~~tIr~WK~ 44 (60)
T PF10668_consen 24 KLKDIAEKLGVSESTIRKWKS 44 (60)
T ss_pred cHHHHHHHHCCCHHHHHHHhh
Confidence 346799999999999999943
No 77
>PF00196 GerE: Bacterial regulatory proteins, luxR family; InterPro: IPR000792 This domain is a DNA-binding, helix-turn-helix (HTH) domain of about 65 amino acids, present in transcription regulators of the LuxR/FixJ family of response regulators. The domain is named after Vibrio fischeri luxR, a transcriptional activator for quorum-sensing control of luminescence. LuxR-type HTH domain proteins occur in a variety of organisms. The DNA-binding HTH domain is usually located in the C-terminal region; the N-terminal region often containing an autoinducer-binding domain or a response regulatory domain. Most luxR-type regulators act as transcription activators, but some can be repressors or have a dual role for different sites. LuxR-type HTH regulators control a wide variety of activities in various biological processes. The luxR-type, DNA-binding HTH domain forms a four-helical bundle structure. The HTH motif comprises the second and third helices, known as the scaffold and recognition helix, respectively. The HTH binds DNA in the major groove, where the N-terminal part of the recognition helix makes most of the DNA contacts. The fourth helix is involved in dimerisation of gerE and traR. Signalling events by one of the four activation mechanisms described below lead to multimerisation of the regulator. The regulators bind DNA as multimers [, , ]. LuxR-type HTH proteins can be activated by one of four different mechanisms: 1) Regulators which belong to a two-component sensory transduction system where the protein is activated by its phosphorylation, generally on an aspartate residue, by a transmembrane kinase [, ]. Some proteins that belong to this category are: Rhizobiaceae fixJ (global regulator inducing expression of nitrogen-fixation genes in microaerobiosis) Escherichia coli and Salmonella typhimurium uhpA (activates hexose phosphate transport gene uhpT) E. coli narL and narP (activate nitrate reductase operon) Enterobacteria rcsB (regulation of exopolysaccharide biosynthesis in enteric and plant pathogenesis) Bordetella pertussis bvgA (virulence factor) Bacillus subtilis coma (involved in expression of late-expressing competence genes) 2) Regulators which are activated, or in very rare cases repressed, when bound to N-acyl homoserine lactones, which are used as quorum sensing molecules in a variety of Gram-negative bacteria []: V. fischeri luxR (activates bioluminescence operon) Agrobacterium tumefaciens traR (regulation of Ti plasmid transfer) Erwinia carotovora carR (control of carbapenem antibiotics biosynthesis) E. carotovora expR (virulence factor for soft rot disease; activates plant tissue macerating enzyme genes) Pseudomonas aeruginosa lasR (activates elastase gene lasB) Erwinia chrysanthemi echR and Erwinia stewartii esaR Pseudomonas chlororaphis phzR (positive regulator of phenazine antibiotic production) Pseudomonas aeruginosa rhlR (activates rhlAB operon and lasB gene) 3) Autonomous effector domain regulators, without a regulatory domain, represented by gerE []. B. subtilis gerE (transcription activator and repressor for the regulation of spore formation) 4) Multiple ligand-binding regulators, exemplified by malT []. E. coli malT (activates maltose operon; MalT binds ATP and maltotriose); GO: 0003700 sequence-specific DNA binding transcription factor activity, 0043565 sequence-specific DNA binding, 0006355 regulation of transcription, DNA-dependent, 0005622 intracellular; PDB: 3SZT_A 3CLO_A 1H0M_A 1L3L_A 3C57_B 1ZLK_B 1ZLJ_H 3C3W_B 1RNL_A 1ZG1_A ....
Probab=33.92 E-value=51 Score=23.84 Aligned_cols=48 Identities=15% Similarity=0.108 Sum_probs=35.5
Q ss_pred CCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhccccC
Q 019170 264 KLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWHAN 320 (345)
Q Consensus 264 ~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk~~ 320 (345)
.|++....+|.-+..- ..-.++|...|++++.|..+..+.++|..-++
T Consensus 3 ~LT~~E~~vl~~l~~G---------~~~~eIA~~l~is~~tV~~~~~~i~~Kl~~~~ 50 (58)
T PF00196_consen 3 SLTERELEVLRLLAQG---------MSNKEIAEELGISEKTVKSHRRRIMKKLGVKN 50 (58)
T ss_dssp SS-HHHHHHHHHHHTT---------S-HHHHHHHHTSHHHHHHHHHHHHHHHHT-SS
T ss_pred ccCHHHHHHHHHHHhc---------CCcchhHHhcCcchhhHHHHHHHHHHHhCCCC
Confidence 4677777877666444 23458999999999999999999999876543
No 78
>PF01381 HTH_3: Helix-turn-helix; InterPro: IPR001387 This is large family of DNA binding helix-turn helix proteins that include a bacterial plasmid copy control protein, bacterial methylases, various bacteriophage transcription control proteins and a vegetative specific protein from Dictyostelium discoideum (Slime mould).; GO: 0043565 sequence-specific DNA binding; PDB: 2AXU_A 2AWI_D 2AXV_D 2AXZ_C 2AW6_A 3KXA_C 3BS3_A 2CRO_A 1ZUG_A 3CRO_R ....
Probab=33.53 E-value=33 Score=24.09 Aligned_cols=21 Identities=14% Similarity=0.189 Sum_probs=18.4
Q ss_pred HHHHHHhCCChhhhhhhhhhh
Q 019170 292 ARLVQETGLQLKQINNWFINQ 312 (345)
Q Consensus 292 ~~LA~~tgLs~kQV~nWF~N~ 312 (345)
..||+.+|+++..|..|..+.
T Consensus 13 ~~la~~~gis~~~i~~~~~g~ 33 (55)
T PF01381_consen 13 KELAEKLGISRSTISRIENGK 33 (55)
T ss_dssp HHHHHHHTS-HHHHHHHHTTS
T ss_pred HHHHHHhCCCcchhHHHhcCC
Confidence 689999999999999999983
No 79
>PRK05602 RNA polymerase sigma factor; Reviewed
Probab=32.70 E-value=40 Score=29.63 Aligned_cols=50 Identities=6% Similarity=0.092 Sum_probs=38.1
Q ss_pred CCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhccccCC
Q 019170 264 KLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWHANP 321 (345)
Q Consensus 264 ~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk~~~ 321 (345)
.||+..+.++.--|... -+ -.++|+.+|++...|.++...+|++.++...
T Consensus 128 ~L~~~~r~i~~l~~~~g--~s------~~EIA~~lgis~~tV~~~l~Rar~~Lr~~l~ 177 (186)
T PRK05602 128 ALPERQREAIVLQYYQG--LS------NIEAAAVMDISVDALESLLARGRRALRAQLA 177 (186)
T ss_pred hCCHHHHHHhhHHHhcC--CC------HHHHHHHhCcCHHHHHHHHHHHHHHHHHHHH
Confidence 36778888886544442 22 3479999999999999999999999887543
No 80
>PRK09648 RNA polymerase sigma factor SigD; Reviewed
Probab=32.50 E-value=46 Score=29.27 Aligned_cols=48 Identities=10% Similarity=0.014 Sum_probs=38.3
Q ss_pred CCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhcccc
Q 019170 264 KLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWHA 319 (345)
Q Consensus 264 ~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk~ 319 (345)
.||+..+.++.--|... + .-.++|+.+|++...|.++...+|++.++.
T Consensus 139 ~L~~~~r~i~~l~~~~g----~----s~~EIA~~lgis~~tV~~~l~Ra~~~Lr~~ 186 (189)
T PRK09648 139 TLPEKQREILILRVVVG----L----SAEETAEAVGSTPGAVRVAQHRALARLRAE 186 (189)
T ss_pred hCCHHHHHHHHHHHHcC----C----CHHHHHHHHCCCHHHHHHHHHHHHHHHHHH
Confidence 58888888887655542 1 245899999999999999999999988764
No 81
>PRK12546 RNA polymerase sigma factor; Provisional
Probab=32.24 E-value=41 Score=30.19 Aligned_cols=49 Identities=14% Similarity=0.101 Sum_probs=39.7
Q ss_pred CCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhccccC
Q 019170 264 KLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWHAN 320 (345)
Q Consensus 264 ~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk~~ 320 (345)
.||+..+.++.-.+... .+ -.++|+.+|++...|.+++.-.|++.++..
T Consensus 113 ~Lp~~~r~v~~L~~~~g--~s------~~EIA~~LgiS~~tVk~~l~Rar~~Lr~~l 161 (188)
T PRK12546 113 QLPDEQREALILVGASG--FS------YEEAAEMCGVAVGTVKSRANRARARLAELL 161 (188)
T ss_pred hCCHHHhHHhhhHHhcC--CC------HHHHHHHHCCCHHHHHHHHHHHHHHHHHHH
Confidence 58899999987776552 22 247899999999999999999999998744
No 82
>PRK12533 RNA polymerase sigma factor; Provisional
Probab=31.90 E-value=42 Score=31.11 Aligned_cols=51 Identities=16% Similarity=0.164 Sum_probs=41.7
Q ss_pred CCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhccccCCC
Q 019170 264 KLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWHANPS 322 (345)
Q Consensus 264 ~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk~~~s 322 (345)
.||...+.++.-.|..+ .+| .++|+.+|++...|.++....|++.++....
T Consensus 134 ~Lp~~~R~v~~L~y~eg--~s~------~EIAe~LgiS~~tVk~~L~RAr~~Lr~~l~~ 184 (216)
T PRK12533 134 KLPVEYREVLVLRELED--MSY------REIAAIADVPVGTVMSRLARARRRLAALLGG 184 (216)
T ss_pred cCCHHHHhHhhhHHhcC--CCH------HHHHHHHCCCHHHHHHHHHHHHHHHHHHHcc
Confidence 58888999998877663 232 4789999999999999999999999986543
No 83
>PRK09047 RNA polymerase factor sigma-70; Validated
Probab=31.83 E-value=53 Score=27.75 Aligned_cols=48 Identities=13% Similarity=0.080 Sum_probs=38.6
Q ss_pred CCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhcccc
Q 019170 264 KLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWHA 319 (345)
Q Consensus 264 ~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk~ 319 (345)
.||+..+.++.-.+... .+ -.++|+.+|++...|.+...-.|++.++.
T Consensus 106 ~Lp~~~r~v~~l~~~~g--~s------~~EIA~~lgis~~tV~~~l~ra~~~Lr~~ 153 (161)
T PRK09047 106 KLPARQREAFLLRYWED--MD------VAETAAAMGCSEGSVKTHCSRATHALAKA 153 (161)
T ss_pred hCCHHHHHHHHHHHHhc--CC------HHHHHHHHCCCHHHHHHHHHHHHHHHHHH
Confidence 58889999987755552 23 35799999999999999999999988764
No 84
>PF13097 CENP-U: CENP-A nucleosome associated complex (NAC) subunit
Probab=31.80 E-value=1.1e+02 Score=28.24 Aligned_cols=45 Identities=13% Similarity=0.223 Sum_probs=36.6
Q ss_pred ChHHHHHHHHHHHHHHHHHHHHhHHhhhhHHHH-HHHHHHHHHHhhhhc
Q 019170 131 DKELDQFMTHYVLLLYSFKEQLQQHVRVHAMEA-VMACWDLEQSLQSLT 178 (345)
Q Consensus 131 dpELDqFMeaYc~vL~kykEEL~kP~~~~~~EA-~~f~~~iE~QL~sL~ 178 (345)
=-|||-.+.++-.++..|++.++-.+= .+| ..|+..|..||-.+.
T Consensus 103 ItELDVvL~~FEk~~~eYkq~ieS~~c---r~AI~~F~~~~keqL~~~i 148 (175)
T PF13097_consen 103 ITELDVVLSAFEKTALEYKQSIESKIC---RKAINKFYSNFKEQLIEMI 148 (175)
T ss_pred chHHHHHHHHHHHHHHHHHHhhccHHH---HHHHHHHHHHHHHHHHHHH
Confidence 479999999999999999999976552 233 358899999998776
No 85
>PRK12512 RNA polymerase sigma factor; Provisional
Probab=31.64 E-value=47 Score=29.03 Aligned_cols=49 Identities=4% Similarity=-0.086 Sum_probs=39.7
Q ss_pred CCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhccccC
Q 019170 264 KLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWHAN 320 (345)
Q Consensus 264 ~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk~~ 320 (345)
.||+..+.++.-.+... .+ -.++|+.+|++...|.+++..+|++.+...
T Consensus 131 ~L~~~~r~v~~l~~~~g--~s------~~eIA~~l~is~~tV~~~l~ra~~~Lr~~l 179 (184)
T PRK12512 131 TLPPRQRDVVQSISVEG--AS------IKETAAKLSMSEGAVRVALHRGLAALAAKF 179 (184)
T ss_pred hCCHHHHHHHHHHHHcC--CC------HHHHHHHhCCCHHHHHHHHHHHHHHHHHHh
Confidence 58888999988765552 22 358999999999999999999999988643
No 86
>PRK12532 RNA polymerase sigma factor; Provisional
Probab=31.43 E-value=47 Score=29.46 Aligned_cols=49 Identities=12% Similarity=0.178 Sum_probs=38.8
Q ss_pred CCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhccccC
Q 019170 264 KLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWHAN 320 (345)
Q Consensus 264 ~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk~~ 320 (345)
.||+..+.++.-.+... + .-.++|+.+|++...|.++...+|++.++..
T Consensus 136 ~L~~~~r~i~~L~~~~g----~----s~~EIA~~lgis~~tVk~~l~Rar~~Lr~~l 184 (195)
T PRK12532 136 NLPENTARVFTLKEILG----F----SSDEIQQMCGISTSNYHTIMHRARESLRQCL 184 (195)
T ss_pred hCCHHHHHHhhhHHHhC----C----CHHHHHHHHCCCHHHHHHHHHHHHHHHHHHH
Confidence 48888888887655442 2 2358999999999999999999999998744
No 87
>PF13551 HTH_29: Winged helix-turn helix
Probab=31.40 E-value=2.3e+02 Score=22.22 Aligned_cols=46 Identities=17% Similarity=0.052 Sum_probs=29.6
Q ss_pred CChhHHHHHHHHHHHhcCCC--CCCHHHHHH-H-HHHh--CCChhhhhhhhh
Q 019170 265 LPGDTTSLLKAWWLSHAKWP--YPTEEDKAR-L-VQET--GLQLKQINNWFI 310 (345)
Q Consensus 265 lpk~~~~iL~~Wf~~H~~~P--YPs~~eK~~-L-A~~t--gLs~kQV~nWF~ 310 (345)
++.+....|.+++.++.... ..+...-.. | .+.+ .++...|..|+.
T Consensus 58 l~~~~~~~l~~~~~~~p~~g~~~~t~~~l~~~l~~~~~~~~~s~~ti~r~L~ 109 (112)
T PF13551_consen 58 LSEEQRAQLIELLRENPPEGRSRWTLEELAEWLIEEEFGIDVSPSTIRRILK 109 (112)
T ss_pred CCHHHHHHHHHHHHHCCCCCCCcccHHHHHHHHHHhccCccCCHHHHHHHHH
Confidence 89999999999999942221 233343333 4 3333 477788888864
No 88
>PF05190 MutS_IV: MutS family domain IV C-terminus.; InterPro: IPR007861 Mismatch repair contributes to the overall fidelity of DNA replication and is essential for combating the adverse effects of damage to the genome. It involves the correction of mismatched base pairs that have been missed by the proofreading element of the DNA polymerase complex. The post-replicative Mismatch Repair System (MMRS) of Escherichia coli involves MutS (Mutator S), MutL and MutH proteins, and acts to correct point mutations or small insertion/deletion loops produced during DNA replication []. MutS and MutL are involved in preventing recombination between partially homologous DNA sequences. The assembly of MMRS is initiated by MutS, which recognises and binds to mispaired nucleotides and allows further action of MutL and MutH to eliminate a portion of newly synthesized DNA strand containing the mispaired base []. MutS can also collaborate with methyltransferases in the repair of O(6)-methylguanine damage, which would otherwise pair with thymine during replication to create an O(6)mG:T mismatch []. MutS exists as a dimer, where the two monomers have different conformations and form a heterodimer at the structural level []. Only one monomer recognises the mismatch specifically and has ADP bound. Non-specific major groove DNA-binding domains from both monomers embrace the DNA in a clamp-like structure. Mismatch binding induces ATP uptake and a conformational change in the MutS protein, resulting in a clamp that translocates on DNA. MutS is a modular protein with a complex structure [], and is composed of: N-terminal mismatch-recognition domain, which is similar in structure to tRNA endonuclease. Connector domain, which is similar in structure to Holliday junction resolvase ruvC. Core domain, which is composed of two separate subdomains that join together to form a helical bundle; from within the core domain, two helices act as levers that extend towards (but do not touch) the DNA. Clamp domain, which is inserted between the two subdomains of the core domain at the top of the lever helices; the clamp domain has a beta-sheet structure. ATPase domain (connected to the core domain), which has a classical Walker A motif. HTH (helix-turn-helix) domain, which is involved in dimer contacts. The MutS family of proteins is named after the Salmonella typhimurium MutS protein involved in mismatch repair. Homologues of MutS have been found in many species including eukaryotes (MSH 1, 2, 3, 4, 5, and 6 proteins), archaea and bacteria, and together these proteins have been grouped into the MutS family. Although many of these proteins have similar activities to the E. coli MutS, there is significant diversity of function among the MutS family members. Human MSH has been implicated in non-polyposis colorectal carcinoma (HNPCC) and is a mismatch binding protein [].This diversity is even seen within species, where many species encode multiple MutS homologues with distinct functions []. Inter-species homologues may have arisen through frequent ancient horizontal gene transfer of MutS (and MutL) from bacteria to archaea and eukaryotes via endosymbiotic ancestors of mitochondria and chloroplasts []. This entry represents the clamp domain (domain 4) found in proteins of the MutS family. The clamp domain is inserted within the core domain at the top of the lever helices. It has a beta-sheet structure [].; GO: 0005524 ATP binding, 0030983 mismatched DNA binding, 0006298 mismatch repair; PDB: 2WTU_A 1OH7_A 1OH5_B 1W7A_B 1NG9_A 1OH8_B 1WBD_A 1WB9_A 3K0S_A 1OH6_A ....
Probab=31.40 E-value=81 Score=24.26 Aligned_cols=26 Identities=27% Similarity=0.319 Sum_probs=20.8
Q ss_pred CCChHHHHHHHHHHHHHHHHHHHHhH
Q 019170 129 LDDKELDQFMTHYVLLLYSFKEQLQQ 154 (345)
Q Consensus 129 g~dpELDqFMeaYc~vL~kykEEL~k 154 (345)
|-|++||.....|..+.....+.+.+
T Consensus 1 g~d~~Ld~~~~~~~~~~~~l~~~~~~ 26 (92)
T PF05190_consen 1 GFDEELDELREEYEEIEEELEELLEE 26 (92)
T ss_dssp TSSHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHH
Confidence 46899999999999888777666643
No 89
>PRK09649 RNA polymerase sigma factor SigC; Reviewed
Probab=31.16 E-value=48 Score=29.39 Aligned_cols=48 Identities=17% Similarity=0.047 Sum_probs=38.9
Q ss_pred CCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhcccc
Q 019170 264 KLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWHA 319 (345)
Q Consensus 264 ~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk~ 319 (345)
.||..++.++.-.+... .+ -.++|+.+|++...|.++..-.|++.++-
T Consensus 130 ~Lp~~~r~v~~L~~~~g--~s------~~EIA~~lgis~~tVk~~l~Rar~~Lr~~ 177 (185)
T PRK09649 130 DLTTDQREALLLTQLLG--LS------YADAAAVCGCPVGTIRSRVARARDALLAD 177 (185)
T ss_pred hCCHHHhHHhhhHHHcC--CC------HHHHHHHHCCCHHHHHHHHHHHHHHHHhh
Confidence 58888999887665552 22 34799999999999999999999999873
No 90
>TIGR03070 couple_hipB transcriptional regulator, y4mF family. Members of this family belong to a clade of helix-turn-helix DNA-binding proteins, among the larger family pfam01381 (HTH_3; Helix-turn-helix). Members are similar in sequence to the HipB protein of E. coli. Genes for members of the seed alignment for this protein family were found to be closely linked to genes encoding proteins related to HipA. The HibBA operon appears to have some features in common with toxin-antitoxin post-segregational killing systems.
Probab=30.96 E-value=35 Score=23.70 Aligned_cols=22 Identities=23% Similarity=0.223 Sum_probs=19.2
Q ss_pred HHHHHHhCCChhhhhhhhhhhh
Q 019170 292 ARLVQETGLQLKQINNWFINQR 313 (345)
Q Consensus 292 ~~LA~~tgLs~kQV~nWF~N~R 313 (345)
..||+.+|+++..|..|..+.+
T Consensus 19 ~~lA~~~gvs~~~vs~~e~g~~ 40 (58)
T TIGR03070 19 ADLADLAGVGLRFIRDVENGKP 40 (58)
T ss_pred HHHHHHhCCCHHHHHHHHCCCC
Confidence 5799999999999999997653
No 91
>PRK12536 RNA polymerase sigma factor; Provisional
Probab=30.87 E-value=51 Score=28.92 Aligned_cols=48 Identities=19% Similarity=0.028 Sum_probs=37.5
Q ss_pred CCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhcccc
Q 019170 264 KLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWHA 319 (345)
Q Consensus 264 ~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk~ 319 (345)
.||+..+.++.-.+..+ .+ -.++|+.+|++...|.+.+...|++.++.
T Consensus 129 ~L~~~~r~v~~l~~~~g--~s------~~EIA~~l~is~~tV~~~l~rar~~Lr~~ 176 (181)
T PRK12536 129 QLPDRQRLPIVHVKLEG--LS------VAETAQLTGLSESAVKVGIHRGLKALAAK 176 (181)
T ss_pred HCCHHHHHHHHHHHHcC--CC------HHHHHHHHCCCHHHHHHHHHHHHHHHHHH
Confidence 46888888776555543 22 35899999999999999999999998874
No 92
>PRK13919 putative RNA polymerase sigma E protein; Provisional
Probab=30.72 E-value=51 Score=28.79 Aligned_cols=48 Identities=13% Similarity=0.159 Sum_probs=38.3
Q ss_pred CCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhcccc
Q 019170 264 KLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWHA 319 (345)
Q Consensus 264 ~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk~ 319 (345)
.||...+.+|.-.+... .. -.++|+.+|++...|.++....|++.++.
T Consensus 135 ~L~~~~r~vl~l~~~~~--~s------~~eIA~~lgis~~~V~~~l~ra~~~Lr~~ 182 (186)
T PRK13919 135 ALSPEERRVIEVLYYQG--YT------HREAAQLLGLPLGTLKTRARRALSRLKEV 182 (186)
T ss_pred hCCHHHHHHHHHHHHcC--CC------HHHHHHHHCcCHHHHHHHHHHHHHHHHHH
Confidence 48888999987665542 22 35799999999999999999999988764
No 93
>PRK12520 RNA polymerase sigma factor; Provisional
Probab=30.57 E-value=53 Score=29.03 Aligned_cols=48 Identities=15% Similarity=0.085 Sum_probs=38.5
Q ss_pred CCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhcccc
Q 019170 264 KLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWHA 319 (345)
Q Consensus 264 ~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk~ 319 (345)
.||+.++.++.-.+... .+| .++|+.+|++...|.+.....|++.++-
T Consensus 131 ~Lp~~~r~v~~l~~~~g--~s~------~EIA~~lgis~~tV~~~l~Rar~~Lr~~ 178 (191)
T PRK12520 131 RLPPRTGRVFMMREWLE--LET------EEICQELQITATNAWVLLYRARMRLREC 178 (191)
T ss_pred hCCHHHHHHHHHHHHcC--CCH------HHHHHHHCCCHHHHHHHHHHHHHHHHHH
Confidence 48888888887665552 333 5799999999999999999999998763
No 94
>PRK12516 RNA polymerase sigma factor; Provisional
Probab=30.41 E-value=50 Score=29.52 Aligned_cols=50 Identities=18% Similarity=0.185 Sum_probs=39.7
Q ss_pred CCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhccccCC
Q 019170 264 KLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWHANP 321 (345)
Q Consensus 264 ~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk~~~ 321 (345)
.||...+.++.-.+... .+| .++|+.+|++...|.++....|++.++...
T Consensus 116 ~Lp~~~r~i~~L~~~~g--~s~------~EIA~~Lgis~~tVk~~l~Rar~~Lr~~l~ 165 (187)
T PRK12516 116 QLPDDQREAIILVGASG--FAY------EEAAEICGCAVGTIKSRVNRARQRLQEILQ 165 (187)
T ss_pred hCCHHHHHHHHHHHHcC--CCH------HHHHHHHCCCHHHHHHHHHHHHHHHHHHHH
Confidence 47888888887665552 333 478999999999999999999999987544
No 95
>TIGR02954 Sig70_famx3 RNA polymerase sigma-70 factor, TIGR02954 family. This group of sigma factors are members of the sigma-70 family (TIGR02937). They and appear by homology, tree building, bidirectional best hits and one-to-a-genome distribution, to represent a conserved family. This family is found in certain Bacillus and Clostridium species.
Probab=29.93 E-value=54 Score=28.25 Aligned_cols=48 Identities=4% Similarity=0.028 Sum_probs=37.4
Q ss_pred CCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhcccc
Q 019170 264 KLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWHA 319 (345)
Q Consensus 264 ~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk~ 319 (345)
.||...++++..-+... .+ -..+|+.+|++...|.++..-.|++.++.
T Consensus 119 ~L~~~~r~i~~l~~~~g--~s------~~eiA~~lgis~~tv~~~l~Ra~~~Lr~~ 166 (169)
T TIGR02954 119 TLNDKYQTAIILRYYHD--LT------IKEIAEVMNKPEGTVKTYLHRALKKLKKR 166 (169)
T ss_pred hCCHHHhHHHHHHHHcC--CC------HHHHHHHHCCCHHHHHHHHHHHHHHHHHH
Confidence 47888888886655552 22 34799999999999999999999887753
No 96
>PRK12519 RNA polymerase sigma factor; Provisional
Probab=29.89 E-value=40 Score=29.78 Aligned_cols=47 Identities=11% Similarity=0.196 Sum_probs=36.6
Q ss_pred CCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhccc
Q 019170 264 KLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWH 318 (345)
Q Consensus 264 ~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk 318 (345)
.||+..+.++.--+... .+ -.++|+.+|++...|.+|+..+|++.++
T Consensus 141 ~L~~~~~~v~~l~~~~g--~s------~~EIA~~lgis~~tV~~~l~Ra~~~Lr~ 187 (194)
T PRK12519 141 QLPESQRQVLELAYYEG--LS------QSEIAKRLGIPLGTVKARARQGLLKLRE 187 (194)
T ss_pred hCCHHHhhhhhhhhhcC--CC------HHHHHHHhCCCHHHHHHHHHHHHHHHHH
Confidence 57888888886554442 22 3579999999999999999999998875
No 97
>PF12022 DUF3510: Domain of unknown function (DUF3510); InterPro: IPR024603 The COG complex comprises eight proteins (COG1-8) and plays critical roles in Golgi structure and function []. This uncharacterised domain is found in the C-terminal of COG complex subunit 2 proteins.
Probab=29.85 E-value=2.1e+02 Score=24.45 Aligned_cols=20 Identities=25% Similarity=0.268 Sum_probs=15.4
Q ss_pred HHHHHHHHHHHHHhhhhcCC
Q 019170 161 MEAVMACWDLEQSLQSLTGV 180 (345)
Q Consensus 161 ~EA~~f~~~iE~QL~sL~~~ 180 (345)
.|...-.+++|+.|+-|=..
T Consensus 80 ~evL~sv~KtEeSL~rlkk~ 99 (125)
T PF12022_consen 80 SEVLTSVRKTEESLKRLKKR 99 (125)
T ss_pred HHHHHHHHHHHHHHHHHHHh
Confidence 46666778999999988753
No 98
>PRK11511 DNA-binding transcriptional activator MarA; Provisional
Probab=29.65 E-value=1.3e+02 Score=25.28 Aligned_cols=41 Identities=12% Similarity=0.229 Sum_probs=31.4
Q ss_pred hHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhh
Q 019170 268 DTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQ 312 (345)
Q Consensus 268 ~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~ 312 (345)
.....+..|...|+..| ++. ..||+.+|+++.++..+|...
T Consensus 9 ~~i~~~~~~I~~~~~~~-~sl---~~lA~~~g~S~~~l~r~Fk~~ 49 (127)
T PRK11511 9 ITIHSILDWIEDNLESP-LSL---EKVSERSGYSKWHLQRMFKKE 49 (127)
T ss_pred HHHHHHHHHHHHhcCCC-CCH---HHHHHHHCcCHHHHHHHHHHH
Confidence 34567788999987766 454 578888999999998888754
No 99
>PRK12530 RNA polymerase sigma factor; Provisional
Probab=29.60 E-value=53 Score=29.22 Aligned_cols=48 Identities=10% Similarity=0.149 Sum_probs=38.6
Q ss_pred CCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhcccc
Q 019170 264 KLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWHA 319 (345)
Q Consensus 264 ~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk~ 319 (345)
.||...+.++.-.+... .+ -.++|..+|+++..|.++..-+|++.++.
T Consensus 134 ~Lp~~~R~v~~L~~~~g--~s------~~EIA~~lgis~~tVk~~l~RAr~~Lr~~ 181 (189)
T PRK12530 134 HLPAQQARVFMMREYLE--LS------SEQICQECDISTSNLHVLLYRARLQLQAC 181 (189)
T ss_pred hCCHHHHHHHhHHHHcC--CC------HHHHHHHHCCCHHHHHHHHHHHHHHHHHH
Confidence 58888988887665552 22 35899999999999999999999988763
No 100
>PRK12531 RNA polymerase sigma factor; Provisional
Probab=29.54 E-value=54 Score=29.18 Aligned_cols=50 Identities=16% Similarity=0.281 Sum_probs=39.2
Q ss_pred CCCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhccccC
Q 019170 263 GKLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWHAN 320 (345)
Q Consensus 263 ~~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk~~ 320 (345)
..||+..+.++.-.+... .+| .++|+.+|++...|.+-+..+|++.++..
T Consensus 140 ~~Lp~~~r~v~~l~~~eg--~s~------~EIA~~lgis~~tVk~rl~ra~~~Lr~~l 189 (194)
T PRK12531 140 DRLPKAQRDVLQAVYLEE--LPH------QQVAEMFDIPLGTVKSRLRLAVEKLRHSM 189 (194)
T ss_pred HhCCHHHHHHHHHHHHcC--CCH------HHHHHHhCcCHHHHHHHHHHHHHHHHHHh
Confidence 358888999987654442 333 47999999999999999999999887643
No 101
>PRK12511 RNA polymerase sigma factor; Provisional
Probab=29.31 E-value=55 Score=29.17 Aligned_cols=50 Identities=10% Similarity=0.066 Sum_probs=40.5
Q ss_pred CCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhccccCC
Q 019170 264 KLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWHANP 321 (345)
Q Consensus 264 ~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk~~~ 321 (345)
.||+.++.++.-.+... .+ -.++|+.+|++...|.++..-.|++.++-..
T Consensus 111 ~Lp~~~R~v~~L~~~eg--~s------~~EIA~~lgis~~tV~~~l~Rar~~Lr~~~~ 160 (182)
T PRK12511 111 DLPEEQRAALHLVAIEG--LS------YQEAAAVLGIPIGTLMSRIGRARAALRAFEE 160 (182)
T ss_pred hCCHHHHHHHHHHHHcC--CC------HHHHHHHhCcCHHHHHHHHHHHHHHHHHHHH
Confidence 58999999998766652 22 3579999999999999999999999887554
No 102
>PRK15369 two component system sensor kinase SsrB; Provisional
Probab=29.21 E-value=1.2e+02 Score=25.40 Aligned_cols=48 Identities=13% Similarity=0.185 Sum_probs=38.5
Q ss_pred CCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhccccC
Q 019170 264 KLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWHAN 320 (345)
Q Consensus 264 ~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk~~ 320 (345)
.||+..+++|.-+ ..+ |. ...+|+..+++.+.|.++..+.|++..-.+
T Consensus 149 ~lt~~e~~vl~l~-~~g----~~----~~~Ia~~l~~s~~tv~~~~~~~~~kl~~~~ 196 (211)
T PRK15369 149 LLTPRERQILKLI-TEG----YT----NRDIAEQLSISIKTVETHRLNMMRKLDVHK 196 (211)
T ss_pred CCCHHHHHHHHHH-HCC----CC----HHHHHHHhCCCHHHHHHHHHHHHHHhCCCC
Confidence 5888889998874 442 22 468899999999999999999999987544
No 103
>cd01392 HTH_LacI Helix-turn-helix (HTH) DNA binding domain of the LacI family of transcriptional regulators. HTH-DNA binding domain of the LacI (lactose operon repressor) family of bacterial transcriptional regulators and their putative homologs found in plants. The LacI family has more than 500 members distributed among almost all bacterial species. The monomeric proteins of the LacI family contain common structural features that include a small DNA-binding domain with a helix-turn-helix motif in the N-terminus, a regulatory ligand-binding domain which exhibits the type I periplasmic binding protein fold in the C-terminus for oligomerization and for effector binding, and an approximately 18-amino acid linker connecting these two functional domains. In LacI-like transcriptional regulators, the ligands are monosaccharides including lactose, ribose, fructose, xylose, arabinose, galactose/glucose, and other sugars, with a few exceptions. When the C-terminal domain of the LacI family repre
Probab=28.92 E-value=31 Score=23.99 Aligned_cols=21 Identities=5% Similarity=0.265 Sum_probs=19.0
Q ss_pred HHHHHhCCChhhhhhhhhhhh
Q 019170 293 RLVQETGLQLKQINNWFINQR 313 (345)
Q Consensus 293 ~LA~~tgLs~kQV~nWF~N~R 313 (345)
.||+.+|++...|+.|+.+.+
T Consensus 2 ~lA~~~gvs~~tvs~~l~g~~ 22 (52)
T cd01392 2 DIARAAGVSVATVSRVLNGKP 22 (52)
T ss_pred cHHHHHCcCHHHHHHHHcCCC
Confidence 589999999999999998873
No 104
>cd00131 PAX Paired Box domain
Probab=28.74 E-value=1.7e+02 Score=25.12 Aligned_cols=46 Identities=11% Similarity=0.082 Sum_probs=32.8
Q ss_pred CCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCC-------Chhhhhhhhhhh
Q 019170 264 KLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGL-------QLKQINNWFINQ 312 (345)
Q Consensus 264 ~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgL-------s~kQV~nWF~N~ 312 (345)
.++......+..+... +|.-|..|-..+-...|+ +..+|+.||.++
T Consensus 75 ~~~~~~~~~i~~~v~~---~p~~Tl~El~~~L~~~gv~~~~~~~s~stI~R~L~~~ 127 (128)
T cd00131 75 VATPEVVKKIEIYKQE---NPGMFAWEIRDRLLQEGVCDKSNVPSVSSINRILRNK 127 (128)
T ss_pred cCCHHHHHHHHHHHHH---CCCCCHHHHHHHHHHcCCcccCCCCCHHHHHHHHHhc
Confidence 4455555666667777 798888887666335577 999999998764
No 105
>PRK12524 RNA polymerase sigma factor; Provisional
Probab=28.48 E-value=56 Score=29.12 Aligned_cols=49 Identities=4% Similarity=0.059 Sum_probs=38.6
Q ss_pred CCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhccccC
Q 019170 264 KLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWHAN 320 (345)
Q Consensus 264 ~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk~~ 320 (345)
.||+..+.++.--+.. .|+ -.++|+.+|++...|.+++.-+|++.++..
T Consensus 136 ~L~~~~r~i~~L~~~~----g~s----~~eIA~~lgis~~tV~~~l~Ra~~~Lr~~l 184 (196)
T PRK12524 136 ALPERQRQAVVLRHIE----GLS----NPEIAEVMEIGVEAVESLTARGKRALAALL 184 (196)
T ss_pred hCCHHHHHHHHHHHHc----CCC----HHHHHHHHCcCHHHHHHHHHHHHHHHHHHH
Confidence 5888888887665444 222 357999999999999999999999998754
No 106
>PRK12542 RNA polymerase sigma factor; Provisional
Probab=28.41 E-value=57 Score=28.64 Aligned_cols=49 Identities=14% Similarity=0.136 Sum_probs=38.3
Q ss_pred CCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhccccC
Q 019170 264 KLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWHAN 320 (345)
Q Consensus 264 ~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk~~ 320 (345)
.||+..+.++.-.+... .+ -.++|+.+|++...|.+.....|++.++..
T Consensus 122 ~L~~~~r~i~~l~~~~g--~s------~~EIA~~lgis~~tVk~~l~Rar~~Lr~~l 170 (185)
T PRK12542 122 ELNESNRQVFKYKVFYN--LT------YQEISSVMGITEANVRKQFERARKRVQNMI 170 (185)
T ss_pred hCCHHHHHHHHHHHHcC--CC------HHHHHHHHCCCHHHHHHHHHHHHHHHHHHH
Confidence 48888989887644442 22 247999999999999999999999887743
No 107
>TIGR02959 SigZ RNA polymerase sigma factor, SigZ family. This family of RNA polymerase sigma factors is a member of the Sigma-70 subfamily (TIGR02937). One of these is designated as SigZ in B. subtilis (Swiss_Prot: SIGZ_BACSU). Interestingly, this group has a very sporatic distribution, B. subtilis, for instance, being the only sequenced strain of Bacilli with a member. Dechloromonas aromatica RCB appears to have two of these sigma factors. A member appears on a plasmid found in Photobacterium profundum SS9 and Vibrio fischeri ES114 (where a second one is chromosomally encoded).
Probab=28.15 E-value=62 Score=28.22 Aligned_cols=49 Identities=16% Similarity=0.202 Sum_probs=39.3
Q ss_pred CCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhccccC
Q 019170 264 KLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWHAN 320 (345)
Q Consensus 264 ~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk~~ 320 (345)
.||+..+.+|.-.+... |+ -.++|+.+|++...|.++..-.|++.++-.
T Consensus 100 ~L~~~~r~v~~l~~~~g----~s----~~eIA~~lgis~~tV~~~l~Rar~~Lr~~l 148 (170)
T TIGR02959 100 ELPDEYREAIRLTELEG----LS----QQEIAEKLGLSLSGAKSRVQRGRKKLKELL 148 (170)
T ss_pred hCCHHHHHHHHHHHHcC----CC----HHHHHHHHCCCHHHHHHHHHHHHHHHHHHH
Confidence 58888888887766552 22 357999999999999999999999988744
No 108
>PRK09639 RNA polymerase sigma factor SigX; Provisional
Probab=28.12 E-value=60 Score=27.66 Aligned_cols=47 Identities=13% Similarity=0.249 Sum_probs=37.4
Q ss_pred CCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhcccc
Q 019170 264 KLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWHA 319 (345)
Q Consensus 264 ~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk~ 319 (345)
.||...+.+|.-.+ . .+ .-..+|..+|++...|.++....|++.++-
T Consensus 112 ~L~~~~r~il~l~~-~----g~----s~~eIA~~lgis~~tV~~~i~ra~~~Lr~~ 158 (166)
T PRK09639 112 KMTERDRTVLLLRF-S----GY----SYKEIAEALGIKESSVGTTLARAKKKFRKI 158 (166)
T ss_pred cCCHHHHHHHHHHH-c----CC----CHHHHHHHHCCCHHHHHHHHHHHHHHHHHH
Confidence 47888888887666 4 22 235799999999999999999999988763
No 109
>PRK10072 putative transcriptional regulator; Provisional
Probab=28.00 E-value=40 Score=27.98 Aligned_cols=23 Identities=9% Similarity=0.289 Sum_probs=20.4
Q ss_pred HHHHHHHhCCChhhhhhhhhhhh
Q 019170 291 KARLVQETGLQLKQINNWFINQR 313 (345)
Q Consensus 291 K~~LA~~tgLs~kQV~nWF~N~R 313 (345)
...||+.+|++...|++|...+|
T Consensus 49 Q~elA~~lGvS~~TVs~WE~G~r 71 (96)
T PRK10072 49 IDDFARVLGVSVAMVKEWESRRV 71 (96)
T ss_pred HHHHHHHhCCCHHHHHHHHcCCC
Confidence 45899999999999999998765
No 110
>PRK12537 RNA polymerase sigma factor; Provisional
Probab=27.92 E-value=60 Score=28.52 Aligned_cols=47 Identities=15% Similarity=0.232 Sum_probs=36.5
Q ss_pred CCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhccc
Q 019170 264 KLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWH 318 (345)
Q Consensus 264 ~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk 318 (345)
.||+..+.++.--|... .+ -.++|+.+|++...|.+|...+|++.++
T Consensus 133 ~L~~~~r~i~~l~~~~~--~s------~~eIA~~lgis~~tV~~~l~ra~~~Lr~ 179 (182)
T PRK12537 133 QLEPARRNCILHAYVDG--CS------HAEIAQRLGAPLGTVKAWIKRSLKALRE 179 (182)
T ss_pred hCCHHHHHHHHHHHHcC--CC------HHHHHHHHCCChhhHHHHHHHHHHHHHH
Confidence 47888888765554442 33 3579999999999999999999988765
No 111
>cd06170 LuxR_C_like C-terminal DNA-binding domain of LuxR-like proteins. This domain contains a helix-turn-helix motif and binds DNA. Proteins belonging to this group are response regulators; some act as transcriptional activators, others as transcriptional repressors. Many are active as homodimers. Many are two domain proteins in which the DNA binding property of the C-terminal DNA binding domain is modulated by modifications of the N-terminal domain. For example in the case of Lux R which participates in the regulation of gene expression in response to fluctuations in cell-population density (quorum-sensing), a signaling molecule, the pheromone Acyl HSL (N-acyl derivatives of homoserine lactone), binds to the N-terminal domain and leads to LuxR dimerization. For others phophorylation of the N-terminal domain leads to multimerization, for example Escherichia coli NarL and Sinorhizobium melilot FixJ. NarL controls gene expression of many respiratory-related operons when environmental
Probab=27.86 E-value=1e+02 Score=21.04 Aligned_cols=45 Identities=11% Similarity=0.091 Sum_probs=30.5
Q ss_pred CChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhccc
Q 019170 265 LPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWH 318 (345)
Q Consensus 265 lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk 318 (345)
|+.....++.-++ . .+ .-..+|+.+|++...|..|..-.+++..-
T Consensus 1 l~~~e~~i~~~~~-~----~~----s~~eia~~l~~s~~tv~~~~~~~~~~l~~ 45 (57)
T cd06170 1 LTPREREVLRLLA-E----GK----TNKEIADILGISEKTVKTHLRNIMRKLGV 45 (57)
T ss_pred CCHHHHHHHHHHH-c----CC----CHHHHHHHHCCCHHHHHHHHHHHHHHhCC
Confidence 3455566664432 2 22 34688999999999999999876666544
No 112
>cd00093 HTH_XRE Helix-turn-helix XRE-family like proteins. Prokaryotic DNA binding proteins belonging to the xenobiotic response element family of transcriptional regulators.
Probab=27.55 E-value=50 Score=21.49 Aligned_cols=21 Identities=14% Similarity=0.180 Sum_probs=18.8
Q ss_pred HHHHHHhCCChhhhhhhhhhh
Q 019170 292 ARLVQETGLQLKQINNWFINQ 312 (345)
Q Consensus 292 ~~LA~~tgLs~kQV~nWF~N~ 312 (345)
..+|+.+|++...|..|..+.
T Consensus 16 ~~~a~~~~~~~~~v~~~~~g~ 36 (58)
T cd00093 16 EELAEKLGVSRSTISRIENGK 36 (58)
T ss_pred HHHHHHHCCCHHHHHHHHcCC
Confidence 489999999999999998875
No 113
>PF13384 HTH_23: Homeodomain-like domain; PDB: 2X48_C.
Probab=27.44 E-value=50 Score=22.92 Aligned_cols=23 Identities=13% Similarity=0.379 Sum_probs=17.0
Q ss_pred HHHHHHHhCCChhhhhhhhhhhh
Q 019170 291 KARLVQETGLQLKQINNWFINQR 313 (345)
Q Consensus 291 K~~LA~~tgLs~kQV~nWF~N~R 313 (345)
...+|+.+|++...|.+|....+
T Consensus 20 ~~~ia~~lgvs~~Tv~~w~kr~~ 42 (50)
T PF13384_consen 20 IREIAKRLGVSRSTVYRWIKRYR 42 (50)
T ss_dssp HHHHHHHHTS-HHHHHHHHT---
T ss_pred HHHHHHHHCcCHHHHHHHHHHcc
Confidence 45899999999999999976544
No 114
>PRK12529 RNA polymerase sigma factor; Provisional
Probab=27.43 E-value=74 Score=27.95 Aligned_cols=48 Identities=8% Similarity=0.042 Sum_probs=38.4
Q ss_pred CCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhcccc
Q 019170 264 KLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWHA 319 (345)
Q Consensus 264 ~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk~ 319 (345)
.||+.++.++.-.+... .+ -.++|+.+|++...|.+.+..++.+....
T Consensus 127 ~Lp~~~R~v~~L~~~~g--~s------~~EIA~~lgis~~tVk~~l~rAl~~~~~~ 174 (178)
T PRK12529 127 TLRPRVKQAFLMATLDG--MK------QKDIAQALDIALPTVKKYIHQAYVTCLSL 174 (178)
T ss_pred hCCHHHHHHHHHHHHcC--CC------HHHHHHHHCCCHHHHHHHHHHHHHHHHHh
Confidence 48999999887766553 23 25799999999999999999988887654
No 115
>PRK12535 RNA polymerase sigma factor; Provisional
Probab=27.38 E-value=62 Score=29.20 Aligned_cols=52 Identities=12% Similarity=0.099 Sum_probs=41.2
Q ss_pred CCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhccccCCCC
Q 019170 264 KLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWHANPSS 323 (345)
Q Consensus 264 ~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk~~~s~ 323 (345)
.||+..+.++.--+... .+ -.++|+.+|++...|.++....|++.++.....
T Consensus 133 ~Lp~~~r~v~~l~~~~g--~s------~~EIAe~lgis~~tV~~~l~Rar~~Lr~~l~~~ 184 (196)
T PRK12535 133 ALPPERREALILTQVLG--YT------YEEAAKIADVRVGTIRSRVARARADLIAATATG 184 (196)
T ss_pred cCCHHHHHHhhhHHHhC--CC------HHHHHHHhCCCHHHHHHHHHHHHHHHHHHhccc
Confidence 48888888886665553 22 358999999999999999999999999866544
No 116
>PF04967 HTH_10: HTH DNA binding domain; InterPro: IPR007050 Numerous bacterial transcription regulatory proteins bind DNA via a helix-turn-helix (HTH) motif. This entry represents the HTH DNA binding domain found in Halobacterium salinarium (Halobacterium halobium) and described as a putative bacterio-opsin activator.
Probab=26.74 E-value=1.3e+02 Score=22.39 Aligned_cols=47 Identities=17% Similarity=0.187 Sum_probs=35.9
Q ss_pred CChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhh
Q 019170 265 LPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQ 312 (345)
Q Consensus 265 lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~ 312 (345)
|+..++.+|..-+..=+. -+|-...-..||++.|++..-+..=+.+.
T Consensus 1 LT~~Q~e~L~~A~~~GYf-d~PR~~tl~elA~~lgis~st~~~~LRra 47 (53)
T PF04967_consen 1 LTDRQREILKAAYELGYF-DVPRRITLEELAEELGISKSTVSEHLRRA 47 (53)
T ss_pred CCHHHHHHHHHHHHcCCC-CCCCcCCHHHHHHHhCCCHHHHHHHHHHH
Confidence 577788899887776333 34888888999999999999887655443
No 117
>PF12362 DUF3646: DNA polymerase III gamma and tau subunits C terminal; InterPro: IPR022107 This domain family is found in bacteria, and is approximately 120 amino acids in length. The family is found in association with PF00004 from PFAM. The proteins in this family are frequently annotated as the gamma and tau subunits of DNA polymerase III, however there is little accompanying literature to back this up.
Probab=26.46 E-value=45 Score=28.68 Aligned_cols=27 Identities=26% Similarity=0.310 Sum_probs=22.0
Q ss_pred ccchhhhHHHHHHHhhCCChHHHHHHH
Q 019170 64 RAEDWETVKCKAEIVGHPLYEQLLSAH 90 (345)
Q Consensus 64 ~~~~~e~~~lKa~I~sHPlYp~Ll~A~ 90 (345)
+....+.+..++.+..||++...|.+|
T Consensus 82 E~~~~~~~~~~~~a~~~P~V~avL~~F 108 (117)
T PF12362_consen 82 EQEAAAKEARRAAARAHPLVKAVLAAF 108 (117)
T ss_pred HHHHHHHHHHHHHHHhCcHHHHHHHHC
Confidence 344556678899999999999999887
No 118
>PRK09645 RNA polymerase sigma factor SigL; Provisional
Probab=26.30 E-value=68 Score=27.66 Aligned_cols=49 Identities=12% Similarity=0.223 Sum_probs=39.0
Q ss_pred CCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhccccC
Q 019170 264 KLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWHAN 320 (345)
Q Consensus 264 ~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk~~ 320 (345)
.||+..+.+|.--+... .+ -.++|+.+|++...|.+...-.|++.++..
T Consensus 118 ~L~~~~r~vl~L~~~~g--~s------~~EIA~~lgis~~tV~~~l~ra~~~Lr~~l 166 (173)
T PRK09645 118 QLSPEHRAVLVRSYYRG--WS------TAQIAADLGIPEGTVKSRLHYALRALRLAL 166 (173)
T ss_pred hCCHHHHHHHHHHHHcC--CC------HHHHHHHHCcCHHHHHHHHHHHHHHHHHHh
Confidence 48888999987665552 22 347999999999999999999999888743
No 119
>PF15500 Toxin_39: Putative RNase-like toxin
Probab=26.04 E-value=1.5e+02 Score=24.71 Aligned_cols=38 Identities=21% Similarity=0.156 Sum_probs=26.8
Q ss_pred CCCChHHHHHHHHHHHHHHHHHHHHhHHhhhhHHHHHHHHHHHHH
Q 019170 128 VLDDKELDQFMTHYVLLLYSFKEQLQQHVRVHAMEAVMACWDLEQ 172 (345)
Q Consensus 128 ~g~dpELDqFMeaYc~vL~kykEEL~kP~~~~~~EA~~f~~~iE~ 172 (345)
++++|+|.+.|+. .+-+++.--++ .++||...+..+|.
T Consensus 44 l~a~p~lk~wne~-vq~~Rk~dp~~------aAdeaakLi~alE~ 81 (96)
T PF15500_consen 44 LAADPALKAWNET-VQAKRKLDPKF------AADEAAKLIQALET 81 (96)
T ss_pred hccCHHHHHHHHH-HHHHHhhchhh------hHHHHHHHHHHHHH
Confidence 5789999999998 44455554444 35788877766664
No 120
>PRK06986 fliA flagellar biosynthesis sigma factor; Validated
Probab=25.87 E-value=58 Score=30.09 Aligned_cols=48 Identities=13% Similarity=0.175 Sum_probs=38.4
Q ss_pred CCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhcccc
Q 019170 264 KLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWHA 319 (345)
Q Consensus 264 ~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk~ 319 (345)
.||...+.++..-|... + .-.++|+.+|++...|.++...+|++.++.
T Consensus 184 ~L~~~~r~vl~l~~~~g----~----s~~EIA~~lgis~~tV~~~~~ra~~~Lr~~ 231 (236)
T PRK06986 184 SLPEREQLVLSLYYQEE----L----NLKEIGAVLGVSESRVSQIHSQAIKRLRAR 231 (236)
T ss_pred hCCHHHHHHHHhHhccC----C----CHHHHHHHHCCCHHHHHHHHHHHHHHHHHH
Confidence 47888888887665442 2 346899999999999999999999998764
No 121
>PRK09413 IS2 repressor TnpA; Reviewed
Probab=25.77 E-value=1.1e+02 Score=25.63 Aligned_cols=48 Identities=6% Similarity=0.060 Sum_probs=32.4
Q ss_pred cCCCCCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhh
Q 019170 260 RRAGKLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRK 314 (345)
Q Consensus 260 rkr~~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~ 314 (345)
+++++++.+-+...-..... +.+ ....+|++.|++..+|.+|..-.+.
T Consensus 8 ~~rr~ys~EfK~~aV~~~~~---~g~----sv~evA~e~gIs~~tl~~W~r~y~~ 55 (121)
T PRK09413 8 EKRRRRTTQEKIAIVQQSFE---PGM----TVSLVARQHGVAASQLFLWRKQYQE 55 (121)
T ss_pred CCCCCCCHHHHHHHHHHHHc---CCC----CHHHHHHHHCcCHHHHHHHHHHHhh
Confidence 34567888875544333333 232 2457899999999999999776654
No 122
>TIGR02947 SigH_actino RNA polymerase sigma-70 factor, TIGR02947 family. This group of sigma factors are members of the sigma-70 family (TIGR02937). They and appear by homology, tree building, bidirectional best hits and (with the exception of a paralog in Thermobifida fusca YX) one-to-a-genome distribution, to represent a conserved family. This family is restricted to the Actinobacteria and each gene examined is followed by an anti-sigma factor in an apparent operon.
Probab=25.73 E-value=32 Score=30.49 Aligned_cols=48 Identities=10% Similarity=0.098 Sum_probs=36.8
Q ss_pred CCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhcccc
Q 019170 264 KLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWHA 319 (345)
Q Consensus 264 ~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk~ 319 (345)
.||...+.++.-.+... .+ -.++|+.+|++...|.++..-+|++.++.
T Consensus 131 ~Lp~~~r~i~~L~~~~g--~s------~~EIA~~lgis~~tVk~~l~Rar~~Lr~~ 178 (193)
T TIGR02947 131 GLPEEFRQAVYLADVEG--FA------YKEIAEIMGTPIGTVMSRLHRGRKQLRKQ 178 (193)
T ss_pred hCCHHHhhheeehhhcC--CC------HHHHHHHHCCCHHHHHHHHHHHHHHHHHH
Confidence 47777877775544442 22 35799999999999999999999998864
No 123
>TIGR02948 SigW_bacill RNA polymerase sigma-W factor. This sigma factor is restricted to certain lineages of the order Bacillales.
Probab=25.15 E-value=59 Score=28.23 Aligned_cols=48 Identities=8% Similarity=0.113 Sum_probs=37.1
Q ss_pred CCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhcccc
Q 019170 264 KLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWHA 319 (345)
Q Consensus 264 ~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk~ 319 (345)
.||+..+.++.--|... .+ -.++|+.+|++...|.+++...|++.+..
T Consensus 136 ~L~~~~r~v~~l~~~~g--~s------~~eIA~~lgis~~~v~~~l~Rar~~Lr~~ 183 (187)
T TIGR02948 136 ALPPKYRMVIVLKYMED--LS------LKEISEILDLPVGTVKTRIHRGREALRKQ 183 (187)
T ss_pred hCCHHHhHHhhhHHhcC--CC------HHHHHHHHCCCHHHHHHHHHHHHHHHHHH
Confidence 48888888886644331 22 35789999999999999999999988753
No 124
>PRK12515 RNA polymerase sigma factor; Provisional
Probab=25.08 E-value=76 Score=27.98 Aligned_cols=48 Identities=10% Similarity=0.132 Sum_probs=37.9
Q ss_pred CCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhcccc
Q 019170 264 KLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWHA 319 (345)
Q Consensus 264 ~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk~ 319 (345)
.||+..+.+|.--+... . .-.++|+.+|++...|.+-+...|++.++.
T Consensus 131 ~L~~~~r~vl~l~~~~~--~------s~~eIA~~lgis~~tV~~~l~Rar~~Lr~~ 178 (189)
T PRK12515 131 KLSPAHREIIDLVYYHE--K------SVEEVGEIVGIPESTVKTRMFYARKKLAEL 178 (189)
T ss_pred hCCHHHHHHHHHHHHcC--C------CHHHHHHHHCcCHHHHHHHHHHHHHHHHHH
Confidence 47888999986654441 2 235799999999999999999999988764
No 125
>PRK08583 RNA polymerase sigma factor SigB; Validated
Probab=24.76 E-value=69 Score=29.98 Aligned_cols=48 Identities=10% Similarity=0.159 Sum_probs=38.8
Q ss_pred CCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhcccc
Q 019170 264 KLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWHA 319 (345)
Q Consensus 264 ~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk~ 319 (345)
.||+..+.+|.-.|... .+ -.++|+.+|++...|.+|....+++.++.
T Consensus 205 ~L~~~~r~vl~l~~~~g--~s------~~eIA~~l~is~~tV~~~~~ra~~kLr~~ 252 (257)
T PRK08583 205 VLSDREKSIIQCTFIEN--LS------QKETGERLGISQMHVSRLQRQAIKKLREA 252 (257)
T ss_pred hCCHHHHHHHHHHHhCC--CC------HHHHHHHHCCCHHHHHHHHHHHHHHHHHH
Confidence 48888999998766552 22 26899999999999999999999988753
No 126
>PRK07037 extracytoplasmic-function sigma-70 factor; Validated
Probab=24.75 E-value=74 Score=27.06 Aligned_cols=48 Identities=15% Similarity=0.179 Sum_probs=37.5
Q ss_pred CCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhcccc
Q 019170 264 KLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWHA 319 (345)
Q Consensus 264 ~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk~ 319 (345)
.||+..+.++.--|... .+ -.++|+.+|++...|.....-+|++.++.
T Consensus 109 ~L~~~~r~v~~l~~~~~--~s------~~EIA~~lgis~~tV~~~l~ra~~~lr~~ 156 (163)
T PRK07037 109 ELPARTRYAFEMYRLHG--ET------QKDIARELGVSPTLVNFMIRDALVHCRKC 156 (163)
T ss_pred hCCHHHHHHHHHHHHcC--CC------HHHHHHHHCCCHHHHHHHHHHHHHHHHHH
Confidence 58888989886655442 22 35799999999999999998888887764
No 127
>PRK09415 RNA polymerase factor sigma C; Reviewed
Probab=24.71 E-value=64 Score=28.31 Aligned_cols=47 Identities=9% Similarity=0.175 Sum_probs=37.6
Q ss_pred CCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhccc
Q 019170 264 KLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWH 318 (345)
Q Consensus 264 ~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk 318 (345)
+||+..+.++.--+... .+ -.++|+.+|++...|.++..-.|++.++
T Consensus 127 ~L~~~~r~v~~l~~~~g--~s------~~EIA~~l~is~~tv~~~l~Ra~~~Lr~ 173 (179)
T PRK09415 127 SLPIKYREVIYLFYYEE--LS------IKEIAEVTGVNENTVKTRLKKAKELLKK 173 (179)
T ss_pred hCCHHHhhHhHhHHhcC--CC------HHHHHHHHCCCHHHHHHHHHHHHHHHHH
Confidence 58888989886655552 23 2479999999999999999999988875
No 128
>PRK12528 RNA polymerase sigma factor; Provisional
Probab=24.59 E-value=83 Score=26.83 Aligned_cols=46 Identities=15% Similarity=0.254 Sum_probs=35.8
Q ss_pred CCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhcc
Q 019170 264 KLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNW 317 (345)
Q Consensus 264 ~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~k 317 (345)
.||+.++.++.-.+... .+| .++|+.+|++...|.++....+++..
T Consensus 113 ~L~~~~r~v~~L~~~~g--~s~------~EIA~~l~is~~tV~~~l~ra~~~~~ 158 (161)
T PRK12528 113 GLPPLVKRAFLLAQVDG--LGY------GEIATELGISLATVKRYLNKAAMRCY 158 (161)
T ss_pred HCCHHHHHHHHHHHHcC--CCH------HHHHHHHCCCHHHHHHHHHHHHHHHH
Confidence 48888988886655542 333 47999999999999999998887654
No 129
>PRK12543 RNA polymerase sigma factor; Provisional
Probab=24.38 E-value=80 Score=27.62 Aligned_cols=47 Identities=13% Similarity=0.222 Sum_probs=37.6
Q ss_pred CCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhccc
Q 019170 264 KLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWH 318 (345)
Q Consensus 264 ~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk 318 (345)
.||+..+.++.--+..+ .+ -.++|+.+|++...|.+.....|++.++
T Consensus 117 ~Lp~~~r~i~~l~~~e~--~s------~~EIA~~lgis~~tV~~~l~ra~~~Lr~ 163 (179)
T PRK12543 117 KLPYKLRQVIILRYLHD--YS------QEEIAQLLQIPIGTVKSRIHAALKKLRQ 163 (179)
T ss_pred hCCHHHHHHHHHHHHcc--CC------HHHHHHHHCCCHHHHHHHHHHHHHHHHH
Confidence 48888888887755443 22 3579999999999999999999999876
No 130
>PRK12539 RNA polymerase sigma factor; Provisional
Probab=24.35 E-value=73 Score=28.01 Aligned_cols=48 Identities=15% Similarity=0.065 Sum_probs=38.6
Q ss_pred CCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhcccc
Q 019170 264 KLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWHA 319 (345)
Q Consensus 264 ~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk~ 319 (345)
.||+..+.++.-.+... .+ -.++|+.+|++...|.++....|++.++.
T Consensus 131 ~L~~~~r~v~~l~~~~g--~s------~~eIA~~lgis~~tV~~~l~ra~~~Lr~~ 178 (184)
T PRK12539 131 RLPEKMRLAIQAVKLEG--LS------VAEAATRSGMSESAVKVSVHRGLKALAAL 178 (184)
T ss_pred hCCHHHHHHHHHHHHcC--Cc------HHHHHHHHCcCHHHHHHHHHHHHHHHHHH
Confidence 48999999987655442 22 35799999999999999999999988764
No 131
>PF13411 MerR_1: MerR HTH family regulatory protein; PDB: 2JML_A 3GP4_A 3GPV_B.
Probab=24.34 E-value=54 Score=24.14 Aligned_cols=18 Identities=6% Similarity=0.143 Sum_probs=16.2
Q ss_pred HHHHHHhCCChhhhhhhh
Q 019170 292 ARLVQETGLQLKQINNWF 309 (345)
Q Consensus 292 ~~LA~~tgLs~kQV~nWF 309 (345)
.++|+.+|++..+|..|-
T Consensus 4 ~eva~~~gvs~~tlr~y~ 21 (69)
T PF13411_consen 4 KEVAKLLGVSPSTLRYYE 21 (69)
T ss_dssp HHHHHHTTTTHHHHHHHH
T ss_pred HHHHHHHCcCHHHHHHHH
Confidence 478999999999999993
No 132
>smart00530 HTH_XRE Helix-turn-helix XRE-family like proteins.
Probab=24.17 E-value=62 Score=20.83 Aligned_cols=22 Identities=9% Similarity=0.176 Sum_probs=18.6
Q ss_pred HHHHHHHhCCChhhhhhhhhhh
Q 019170 291 KARLVQETGLQLKQINNWFINQ 312 (345)
Q Consensus 291 K~~LA~~tgLs~kQV~nWF~N~ 312 (345)
...||+.+|++..+|..|..+.
T Consensus 13 ~~~la~~~~i~~~~i~~~~~~~ 34 (56)
T smart00530 13 QEELAEKLGVSRSTLSRIENGK 34 (56)
T ss_pred HHHHHHHhCCCHHHHHHHHCCC
Confidence 4489999999999999997654
No 133
>KOG4040 consensus NADH:ubiquinone oxidoreductase, NDUFB8/ASHI subunit [Energy production and conversion]
Probab=24.04 E-value=47 Score=30.53 Aligned_cols=38 Identities=26% Similarity=0.569 Sum_probs=29.9
Q ss_pred hhHHHHHHHHHHHhcCCCCCC-HHHHHHHHHHhCCChhh
Q 019170 267 GDTTSLLKAWWLSHAKWPYPT-EEDKARLVQETGLQLKQ 304 (345)
Q Consensus 267 k~~~~iL~~Wf~~H~~~PYPs-~~eK~~LA~~tgLs~kQ 304 (345)
....+..-.|...|.-.|||+ ++||..-|++.||-+..
T Consensus 23 ~~g~rt~~gw~kD~kPgpyP~teeER~AAAkKY~lrpEd 61 (186)
T KOG4040|consen 23 PRGPRTFDGWYKDHKPGPYPTTEEERRAAAKKYGLRPED 61 (186)
T ss_pred ccccccccccccccCCCCCCCCHHHHHHHHHHhCCCHhh
Confidence 444566678999999999995 77888889999987653
No 134
>TIGR03001 Sig-70_gmx1 RNA polymerase sigma-70 factor, Myxococcales family 1. This group of sigma factors are members of the sigma-70 family (TIGR02937) and are found in multiple copies in the order Myxococcales. This model supercedes TIGR02233, which has now been retired.
Probab=24.01 E-value=74 Score=30.11 Aligned_cols=48 Identities=15% Similarity=0.270 Sum_probs=38.9
Q ss_pred CCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhcccc
Q 019170 264 KLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWHA 319 (345)
Q Consensus 264 ~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk~ 319 (345)
.||...+.++.-.+... .+ -.++|..+|++...|.++...+|++.++.
T Consensus 161 ~Lp~~~R~v~~L~~~eg--~S------~~EIA~~Lgis~~TVk~rl~RAr~~Lr~~ 208 (244)
T TIGR03001 161 ALSERERHLLRLHFVDG--LS------MDRIGAMYQVHRSTVSRWVAQARERLLER 208 (244)
T ss_pred hCCHHHHHHHHHHHHcC--CC------HHHHHHHHCcCHHHHHHHHHHHHHHHHHH
Confidence 58888888887766653 22 34799999999999999999999988864
No 135
>PRK09636 RNA polymerase sigma factor SigJ; Provisional
Probab=23.68 E-value=1.1e+02 Score=29.30 Aligned_cols=51 Identities=14% Similarity=0.095 Sum_probs=39.9
Q ss_pred CCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhccccCCC
Q 019170 264 KLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWHANPS 322 (345)
Q Consensus 264 ~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk~~~s 322 (345)
.||+.++.++.--+... .+| .++|+.+|++...|.+.+..+|++.++..+.
T Consensus 115 ~L~~~~R~v~~L~~~~g--~s~------~EIA~~lg~s~~tVk~~l~RAr~~Lr~~~~~ 165 (293)
T PRK09636 115 RLSPLERAAFLLHDVFG--VPF------DEIASTLGRSPAACRQLASRARKHVRAARPR 165 (293)
T ss_pred hCCHHHHHHHHHHHHhC--CCH------HHHHHHHCCCHHHHHHHHHHHHHHHHhhCCC
Confidence 38888888875544442 343 5799999999999999999999999986653
No 136
>PRK12513 RNA polymerase sigma factor; Provisional
Probab=23.36 E-value=35 Score=30.23 Aligned_cols=48 Identities=8% Similarity=0.098 Sum_probs=36.2
Q ss_pred CCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhcccc
Q 019170 264 KLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWHA 319 (345)
Q Consensus 264 ~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk~ 319 (345)
.||+..+.++.-.+... .+ -.++|+.+|++...|.++...+|++.++.
T Consensus 139 ~L~~~~r~i~~l~~~~g--~s------~~EIA~~lgis~~tV~~~l~ra~~~Lr~~ 186 (194)
T PRK12513 139 TLPDEQREVFLLREHGD--LE------LEEIAELTGVPEETVKSRLRYALQKLREL 186 (194)
T ss_pred hCCHhHhhheeeehccC--CC------HHHHHHHHCCCHHHHHHHHHHHHHHHHHH
Confidence 46777777776544331 22 24789999999999999999999998864
No 137
>TIGR02943 Sig70_famx1 RNA polymerase sigma-70 factor, TIGR02943 family. This group of sigma factors are members of the sigma-70 family (TIGR02937). They and appear by homology, tree building, bidirectional best hits and one-to-a-genome distribution, to represent a conserved family.
Probab=23.07 E-value=87 Score=27.87 Aligned_cols=48 Identities=13% Similarity=0.129 Sum_probs=38.8
Q ss_pred CCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhcccc
Q 019170 264 KLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWHA 319 (345)
Q Consensus 264 ~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk~ 319 (345)
.||...+.++.--|... .+ -.++|+.+|++..-|.+...-.|++.++.
T Consensus 131 ~L~~~~r~v~~l~~~~g--~s------~~EIA~~lgis~~tvk~rl~Rar~~Lr~~ 178 (188)
T TIGR02943 131 HLPEQTARVFMMREVLG--FE------SDEICQELEISTSNCHVLLYRARLSLRAC 178 (188)
T ss_pred hCCHHHHHHHHHHHHhC--CC------HHHHHHHhCCCHHHHHHHHHHHHHHHHHH
Confidence 58888888887766553 22 35899999999999999999999998763
No 138
>PHA01976 helix-turn-helix protein
Probab=22.88 E-value=61 Score=23.77 Aligned_cols=22 Identities=18% Similarity=0.377 Sum_probs=18.8
Q ss_pred HHHHHHhCCChhhhhhhhhhhh
Q 019170 292 ARLVQETGLQLKQINNWFINQR 313 (345)
Q Consensus 292 ~~LA~~tgLs~kQV~nWF~N~R 313 (345)
.+||+.+|++...|.+|....+
T Consensus 19 ~~lA~~~gvs~~~v~~~e~g~~ 40 (67)
T PHA01976 19 PELSRRAGVRHSLIYDFEADKR 40 (67)
T ss_pred HHHHHHhCCCHHHHHHHHcCCC
Confidence 4799999999999999986543
No 139
>TIGR02941 Sigma_B RNA polymerase sigma-B factor. This sigma factor is restricted to certain lineages of the order Bacillales including Staphylococcus, Listeria and Bacillus.
Probab=22.70 E-value=92 Score=29.12 Aligned_cols=47 Identities=6% Similarity=0.055 Sum_probs=38.2
Q ss_pred CCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhccc
Q 019170 264 KLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWH 318 (345)
Q Consensus 264 ~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk 318 (345)
.||...+.++.-.|... .. -.++|+.+|++...|..+...++++.++
T Consensus 205 ~L~~~~r~ii~l~~~~g--~s------~~eIA~~lgis~~~V~~~~~ra~~~Lr~ 251 (255)
T TIGR02941 205 ILSEREKSIIHCTFEEN--LS------QKETGERLGISQMHVSRLQRQAISKLKE 251 (255)
T ss_pred cCCHHHHHHHHHHHcCC--CC------HHHHHHHHCcCHHHHHHHHHHHHHHHHH
Confidence 58888999887776552 22 2589999999999999999999988775
No 140
>PRK12545 RNA polymerase sigma factor; Provisional
Probab=22.45 E-value=84 Score=28.27 Aligned_cols=49 Identities=12% Similarity=0.112 Sum_probs=38.8
Q ss_pred CCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhccccC
Q 019170 264 KLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWHAN 320 (345)
Q Consensus 264 ~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk~~ 320 (345)
.||...+.++.--+... .+ -.++|+.+|++...|.+....+|++.++..
T Consensus 139 ~Lp~~~r~v~~L~~~eg--~s------~~EIA~~lgis~~tVk~~l~RAr~~Lr~~l 187 (201)
T PRK12545 139 HLPEQIGRVFMMREFLD--FE------IDDICTELTLTANHCSVLLYRARTRLRTCL 187 (201)
T ss_pred hCCHHHHHHHHHHHHcC--CC------HHHHHHHHCcCHHHHHHHHHHHHHHHHHHH
Confidence 48888988887665552 23 247899999999999999999999988743
No 141
>PF10782 DUF2602: Protein of unknown function (DUF2602); InterPro: IPR019718 This bacterial family of proteins that has no known function.
Probab=22.39 E-value=78 Score=24.30 Aligned_cols=40 Identities=20% Similarity=0.365 Sum_probs=27.7
Q ss_pred hHHHHHHHHHHH---HHHHHHHHHhHHhhhhHHHHHHHHHH---HHHHhhhh
Q 019170 132 KELDQFMTHYVL---LLYSFKEQLQQHVRVHAMEAVMACWD---LEQSLQSL 177 (345)
Q Consensus 132 pELDqFMeaYc~---vL~kykEEL~kP~~~~~~EA~~f~~~---iE~QL~sL 177 (345)
-|+|+.|++||. +-..++.+..+. -|..||-+ |=.||..|
T Consensus 8 ~~I~~L~dtyC~gC~lk~~~rk~~gk~------~ah~fCI~~CtVGkqik~l 53 (58)
T PF10782_consen 8 IEIDDLQDTYCKGCFLKKHFRKEKGKT------YAHSFCINQCTVGKQIKQL 53 (58)
T ss_pred HHHHHHHHHHhcCCchHHHhHHHhccc------ccchHHHHhCcHhHHHHHH
Confidence 478999999998 335667777774 48888863 34455444
No 142
>PF13730 HTH_36: Helix-turn-helix domain
Probab=22.23 E-value=2.3e+02 Score=19.88 Aligned_cols=48 Identities=8% Similarity=0.109 Sum_probs=31.2
Q ss_pred CCChhHHHHHHHHHHH--hcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhh
Q 019170 264 KLPGDTTSLLKAWWLS--HAKWPYPTEEDKARLVQETGLQLKQINNWFINQRK 314 (345)
Q Consensus 264 ~lpk~~~~iL~~Wf~~--H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~ 314 (345)
.|+..++.++-.-+.. .....||| ...||+.+|++...|..+...-..
T Consensus 2 ~Ls~~~~~v~~~l~~~~~~~~~~~pS---~~~la~~~g~s~~Tv~~~i~~L~~ 51 (55)
T PF13730_consen 2 NLSPTAKLVYLYLASYANKNGGCFPS---QETLAKDLGVSRRTVQRAIKELEE 51 (55)
T ss_pred CCCHHHHHHHHHHHHhcCCCCCCCcC---HHHHHHHHCcCHHHHHHHHHHHHH
Confidence 4566665555332222 22347887 557999999999999998765443
No 143
>PRK09647 RNA polymerase sigma factor SigE; Reviewed
Probab=22.17 E-value=87 Score=28.47 Aligned_cols=48 Identities=15% Similarity=0.225 Sum_probs=37.1
Q ss_pred CCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhcccc
Q 019170 264 KLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWHA 319 (345)
Q Consensus 264 ~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk~ 319 (345)
.||...+.++.--+... .+| .++|+.+|++...|.++..-.|++.++.
T Consensus 138 ~L~~~~r~v~~L~~~~g--~s~------~EIA~~Lgis~~tV~~~l~RArk~Lr~~ 185 (203)
T PRK09647 138 SLPPEFRAAVVLCDIEG--LSY------EEIAATLGVKLGTVRSRIHRGRQQLRAA 185 (203)
T ss_pred hCCHHHHHHHHHHHHcC--CCH------HHHHHHHCCCHHHHHHHHHHHHHHHHHH
Confidence 47777878765544442 333 5799999999999999999999998863
No 144
>PRK12538 RNA polymerase sigma factor; Provisional
Probab=22.16 E-value=70 Score=29.89 Aligned_cols=48 Identities=6% Similarity=0.133 Sum_probs=37.5
Q ss_pred CCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhcccc
Q 019170 264 KLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWHA 319 (345)
Q Consensus 264 ~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk~ 319 (345)
.||..++.++.-.|... .+ -.++|+.+|++...|.++....|++.++.
T Consensus 171 ~Lp~~~R~v~~L~~~eg--~s------~~EIA~~Lgis~~tVk~~l~RAr~kLr~~ 218 (233)
T PRK12538 171 RLPEQQRIAVILSYHEN--MS------NGEIAEVMDTTVAAVESLLKRGRQQLRDL 218 (233)
T ss_pred hCCHHHHHHhhhHHhcC--CC------HHHHHHHHCcCHHHHHHHHHHHHHHHHHH
Confidence 47888888876554442 23 35799999999999999999999998864
No 145
>COG2979 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=21.94 E-value=2.6e+02 Score=26.82 Aligned_cols=30 Identities=23% Similarity=0.261 Sum_probs=23.5
Q ss_pred CCCChHHHHHHHHHHHHHHHHHHHHhHHhhhhHHHHHHHHHH
Q 019170 128 VLDDKELDQFMTHYVLLLYSFKEQLQQHVRVHAMEAVMACWD 169 (345)
Q Consensus 128 ~g~dpELDqFMeaYc~vL~kykEEL~kP~~~~~~EA~~f~~~ 169 (345)
.|.|||..+|+|+ ||++|++ ..|-..+.++
T Consensus 142 sG~d~e~~~~le~----------El~~PlD--~~~ia~~a~~ 171 (225)
T COG2979 142 SGVDPEAQAFLEQ----------ELEQPLD--PDEIAAAARN 171 (225)
T ss_pred cCCCHHHHHHHHH----------HHhCCCC--HHHHHHHhcC
Confidence 5789999999987 8999986 4565566655
No 146
>TIGR02607 antidote_HigA addiction module antidote protein, HigA family. Members of this family form a distinct clade within the larger family HTH_3 of helix-turn-helix proteins, described by Pfam model pfam01381. Members of this clade are strictly bacterial and nearly always shorter than 110 amino acids. This family includes the characterized member HigA, without which the killer protein HigB cannot be cloned. The hig (host inhibition of growth) system is noted to be unusual in that killer protein is uncoded by the upstream member of the gene pair.
Probab=21.57 E-value=64 Score=24.26 Aligned_cols=22 Identities=14% Similarity=0.240 Sum_probs=19.4
Q ss_pred HHHHHHhCCChhhhhhhhhhhh
Q 019170 292 ARLVQETGLQLKQINNWFINQR 313 (345)
Q Consensus 292 ~~LA~~tgLs~kQV~nWF~N~R 313 (345)
..||+.+|++...|..|+.+.|
T Consensus 22 ~~lA~~~gis~~tis~~~~g~~ 43 (78)
T TIGR02607 22 RALAKALGVSRSTLSRIVNGRR 43 (78)
T ss_pred HHHHHHhCCCHHHHHHHHcCCC
Confidence 5799999999999999997654
No 147
>cd04761 HTH_MerR-SF Helix-Turn-Helix DNA binding domain of transcription regulators from the MerR superfamily. Helix-turn-helix (HTH) transcription regulator MerR superfamily, N-terminal domain. The MerR family transcription regulators have been shown to mediate responses to stress including exposure to heavy metals, drugs, or oxygen radicals in eubacterial and some archaeal species. They regulate transcription of multidrug/metal ion transporter genes and oxidative stress regulons by reconfiguring the spacer between the -35 and -10 promoter elements. A typical MerR regulator is comprised of two distinct domains that harbor the regulatory (effector-binding) site and the active (DNA-binding) site. Their N-terminal domains are homologous and contain a DNA-binding winged HTH motif, while the C-terminal domains are often dissimilar and bind specific coactivator molecules such as metal ions, drugs, and organic substrates.
Probab=21.22 E-value=78 Score=21.51 Aligned_cols=19 Identities=16% Similarity=0.214 Sum_probs=16.6
Q ss_pred HHHHHHhCCChhhhhhhhh
Q 019170 292 ARLVQETGLQLKQINNWFI 310 (345)
Q Consensus 292 ~~LA~~tgLs~kQV~nWF~ 310 (345)
.++|+.+|++...|..|..
T Consensus 4 ~e~a~~~gv~~~tlr~~~~ 22 (49)
T cd04761 4 GELAKLTGVSPSTLRYYER 22 (49)
T ss_pred HHHHHHHCcCHHHHHHHHH
Confidence 4789999999999999953
No 148
>PRK12534 RNA polymerase sigma factor; Provisional
Probab=21.20 E-value=93 Score=27.22 Aligned_cols=47 Identities=11% Similarity=0.202 Sum_probs=36.9
Q ss_pred CCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhccc
Q 019170 264 KLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWH 318 (345)
Q Consensus 264 ~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk 318 (345)
.||+..+.++..-|... .+ -.++|..+|++...|.+-...+|++.+.
T Consensus 137 ~L~~~~r~i~~l~~~~g--~s------~~eIA~~lgis~~~v~~~l~Rar~~Lr~ 183 (187)
T PRK12534 137 ELEPPRSELIRTAFFEG--IT------YEELAARTDTPIGTVKSWIRRGLAKLKA 183 (187)
T ss_pred hCCHHHHHHHHHHHHcC--CC------HHHHHHHhCCChhHHHHHHHHHHHHHHH
Confidence 47888888887666542 22 2479999999999999999999988775
No 149
>PRK04217 hypothetical protein; Provisional
Probab=21.12 E-value=1.5e+02 Score=25.21 Aligned_cols=49 Identities=8% Similarity=-0.034 Sum_probs=39.1
Q ss_pred CCCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhcccc
Q 019170 263 GKLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWHA 319 (345)
Q Consensus 263 ~~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk~ 319 (345)
..|+.+.+.++..++.+. + .-.++|+.+|++...|.+.+...|++.+..
T Consensus 41 ~~Lt~eereai~l~~~eG----l----S~~EIAk~LGIS~sTV~r~L~RArkkLre~ 89 (110)
T PRK04217 41 IFMTYEEFEALRLVDYEG----L----TQEEAGKRMGVSRGTVWRALTSARKKVAQM 89 (110)
T ss_pred ccCCHHHHHHHHHHHHcC----C----CHHHHHHHHCcCHHHHHHHHHHHHHHHHHH
Confidence 467888888887776552 2 345799999999999999999999988764
No 150
>PRK07670 RNA polymerase sigma factor SigD; Validated
Probab=21.10 E-value=93 Score=29.12 Aligned_cols=47 Identities=9% Similarity=0.065 Sum_probs=37.9
Q ss_pred CCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhccc
Q 019170 264 KLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWH 318 (345)
Q Consensus 264 ~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk 318 (345)
.||+..+.++.-.|.. .+ .-..+|..+|++...|.+++...|++.++
T Consensus 201 ~L~~~~r~vl~l~~~~----~~----s~~EIA~~lgis~~tV~~~~~ra~~~Lr~ 247 (251)
T PRK07670 201 QLSEKEQLVISLFYKE----EL----TLTEIGQVLNLSTSRISQIHSKALFKLKK 247 (251)
T ss_pred cCCHHHHHHHHHHHhc----CC----CHHHHHHHHCcCHHHHHHHHHHHHHHHHH
Confidence 4788888998776554 22 24689999999999999999999998875
No 151
>PRK12527 RNA polymerase sigma factor; Reviewed
Probab=21.06 E-value=1e+02 Score=26.21 Aligned_cols=47 Identities=13% Similarity=0.164 Sum_probs=38.5
Q ss_pred CChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhcccc
Q 019170 265 LPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWHA 319 (345)
Q Consensus 265 lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk~ 319 (345)
||...+.++.-.+... .+ -.++|+.+|++..-|.+....+|++.++.
T Consensus 106 L~~~~r~v~~l~~~~~--~s------~~eIA~~lgis~~tv~~~l~ra~~~Lr~~ 152 (159)
T PRK12527 106 LPPACRDSFLLRKLEG--LS------HQQIAEHLGISRSLVEKHIVNAMKHCRVR 152 (159)
T ss_pred CCHHHHHHHHHHHHcC--CC------HHHHHHHhCCCHHHHHHHHHHHHHHHHHH
Confidence 8889999987776653 33 35899999999999999999999888764
No 152
>TIGR02980 SigBFG RNA polymerase sigma-70 factor, sigma-B/F/G subfamily. This group of similar sigma-70 factors includes clades found in Bacilli (including the sporulation factors SigF:TIGR02885 and SigG:TIGR02850 as well as SigB:TIGR02941), and the high GC gram positive bacteria (Actinobacteria) where a variable number of them are found depending on the lineage.
Probab=21.01 E-value=99 Score=28.19 Aligned_cols=47 Identities=11% Similarity=0.251 Sum_probs=38.3
Q ss_pred CCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhccc
Q 019170 264 KLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWH 318 (345)
Q Consensus 264 ~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk 318 (345)
.||+..+.++...|.. . -.-.++|+.+|++...|..|....+++.++
T Consensus 178 ~L~~~~r~vl~l~y~~----~----~s~~eIA~~lgis~~~v~~~~~ra~~~Lr~ 224 (227)
T TIGR02980 178 ALPERERRILLLRFFE----D----KTQSEIAERLGISQMHVSRLLRRALKKLRE 224 (227)
T ss_pred cCCHHHHHHHHHHHhc----C----CCHHHHHHHHCcCHHHHHHHHHHHHHHHHH
Confidence 5889999998877654 2 134689999999999999999999988765
No 153
>PRK08241 RNA polymerase factor sigma-70; Validated
Probab=21.01 E-value=98 Score=30.05 Aligned_cols=50 Identities=16% Similarity=0.213 Sum_probs=39.4
Q ss_pred CCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhccccCC
Q 019170 264 KLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWHANP 321 (345)
Q Consensus 264 ~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk~~~ 321 (345)
.||...+.++.-.+... .+ -.++|+.+|++...|.+...-.|++.++..+
T Consensus 153 ~Lp~~~R~v~~L~~~~g--~s------~~EIA~~lgis~~tVk~~l~RAr~~Lr~~~~ 202 (339)
T PRK08241 153 HLPPRQRAVLILRDVLG--WS------AAEVAELLDTSVAAVNSALQRARATLAERGP 202 (339)
T ss_pred hCCHHHhhhhhhHHhhC--CC------HHHHHHHhCCCHHHHHHHHHHHHHHHhhcCC
Confidence 47888888886655552 22 2479999999999999999999999998543
No 154
>PRK09641 RNA polymerase sigma factor SigW; Provisional
Probab=20.94 E-value=89 Score=27.09 Aligned_cols=48 Identities=8% Similarity=0.152 Sum_probs=35.7
Q ss_pred CCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhcccc
Q 019170 264 KLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWHA 319 (345)
Q Consensus 264 ~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk~ 319 (345)
.||...+.+|.--+... .+ -.++|+.+|++...|.+.....|++.++.
T Consensus 136 ~L~~~~r~il~l~~~~~--~s------~~eIA~~lgis~~~v~~~l~Rar~~Lr~~ 183 (187)
T PRK09641 136 QLPEKYRTVIVLKYIED--LS------LKEISEILDLPVGTVKTRIHRGREALRKQ 183 (187)
T ss_pred hCCHHHHHHhhhHHhhC--CC------HHHHHHHHCCCHHHHHHHHHHHHHHHHHH
Confidence 46777777774433221 22 35799999999999999999999988763
No 155
>PRK09637 RNA polymerase sigma factor SigZ; Provisional
Probab=20.55 E-value=94 Score=27.51 Aligned_cols=50 Identities=16% Similarity=0.142 Sum_probs=39.2
Q ss_pred CCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhccccCC
Q 019170 264 KLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWHANP 321 (345)
Q Consensus 264 ~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk~~~ 321 (345)
.||+..+.++.-.+.. .+| -.++|..+|++...|.+.+..+|++.++...
T Consensus 106 ~L~~~~r~i~~l~~~~----g~~----~~EIA~~lgis~~tV~~~l~Rar~~Lr~~l~ 155 (181)
T PRK09637 106 ALPEKYAEALRLTELE----GLS----QKEIAEKLGLSLSGAKSRVQRGRVKLKELLE 155 (181)
T ss_pred hCCHHHHHHHHHHHhc----CCC----HHHHHHHhCCCHHHHHHHHHHHHHHHHHHHH
Confidence 5788888888665544 222 3579999999999999999999999887543
No 156
>TIGR02479 FliA_WhiG RNA polymerase sigma factor, FliA/WhiG family. Most members of this family are the flagellar operon sigma factor FliA, controlling transcription of bacterial flagellar genes by RNA polymerase. An exception is the sigma factor WhiG in the genus Streptomyces, involved in the production of sporulating aerial mycelium.
Probab=20.13 E-value=1e+02 Score=28.23 Aligned_cols=48 Identities=10% Similarity=0.152 Sum_probs=39.0
Q ss_pred CCChhHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhCCChhhhhhhhhhhhhhcccc
Q 019170 264 KLPGDTTSLLKAWWLSHAKWPYPTEEDKARLVQETGLQLKQINNWFINQRKRNWHA 319 (345)
Q Consensus 264 ~lpk~~~~iL~~Wf~~H~~~PYPs~~eK~~LA~~tgLs~kQV~nWF~N~R~R~kk~ 319 (345)
.||...+.+|...|... + .-..+|+.+|++...|..+...++++.++.
T Consensus 175 ~L~~~~r~il~l~y~~~----~----s~~eIA~~lgis~~tV~~~~~ra~~~Lr~~ 222 (224)
T TIGR02479 175 SLSEREQLVLSLYYYEE----L----NLKEIGEVLGLTESRVSQIHSQALKKLRAK 222 (224)
T ss_pred hCCHHHHHHHHHHHhCC----C----CHHHHHHHhCCCHHHHHHHHHHHHHHHHHH
Confidence 48899999998877552 2 236899999999999999999999887753
Done!