Query 047804
Match_columns 81
No_of_seqs 102 out of 245
Neff 3.8
Searched_HMMs 46136
Date Fri Mar 29 03:22:53 2013
Command hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/047804.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/047804hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PF03195 DUF260: Protein of un 99.9 4.2E-28 9.2E-33 162.7 6.1 64 1-76 38-101 (101)
2 COG3416 Uncharacterized protei 87.2 2 4.4E-05 33.2 5.7 53 29-81 24-76 (233)
3 PF13334 DUF4094: Domain of un 87.0 0.68 1.5E-05 30.8 2.7 25 51-75 71-95 (95)
4 PRK10265 chaperone-modulator p 83.8 2.6 5.7E-05 27.7 4.3 31 49-79 67-97 (101)
5 PF09006 Surfac_D-trimer: Lung 82.6 2.9 6.2E-05 25.2 3.7 26 55-80 1-26 (46)
6 PF06698 DUF1192: Protein of u 80.8 3.6 7.7E-05 25.6 3.8 24 55-78 23-46 (59)
7 PF03242 LEA_3: Late embryogen 80.6 0.49 1.1E-05 31.6 -0.1 21 37-57 58-78 (93)
8 COG5509 Uncharacterized small 79.0 4.2 9.1E-05 26.0 3.8 23 55-77 27-49 (65)
9 PF12097 DUF3573: Protein of u 74.9 3.7 8.1E-05 33.8 3.4 23 53-75 42-64 (383)
10 PF05308 Mito_fiss_reg: Mitoch 74.3 3.5 7.5E-05 31.7 2.9 17 61-77 123-139 (253)
11 PRK00888 ftsB cell division pr 72.9 6.1 0.00013 26.3 3.5 30 49-78 21-52 (105)
12 PF15483 DUF4641: Domain of un 72.7 4.3 9.4E-05 34.0 3.3 31 49-80 415-445 (445)
13 PF13600 DUF4140: N-terminal d 72.0 8.1 0.00017 24.6 3.8 25 53-77 70-94 (104)
14 PF06696 Strep_SA_rep: Strepto 70.6 10 0.00022 20.0 3.4 23 58-80 3-25 (25)
15 cd04766 HTH_HspR Helix-Turn-He 68.8 12 0.00026 23.5 4.0 32 45-77 58-89 (91)
16 PF04977 DivIC: Septum formati 68.1 13 0.00029 22.0 4.0 24 55-78 26-49 (80)
17 PF13591 MerR_2: MerR HTH fami 66.5 9.4 0.0002 24.1 3.2 24 51-74 61-84 (84)
18 PF10883 DUF2681: Protein of u 66.2 11 0.00024 25.0 3.6 26 55-80 32-57 (87)
19 PF09849 DUF2076: Uncharacteri 63.8 25 0.00054 27.0 5.6 49 29-77 24-72 (247)
20 PF04977 DivIC: Septum formati 63.7 19 0.00041 21.3 4.1 27 53-79 17-43 (80)
21 PF05120 GvpG: Gas vesicle pro 63.3 8 0.00017 25.1 2.5 33 47-79 8-40 (79)
22 PF14282 FlxA: FlxA-like prote 63.3 14 0.00031 24.4 3.8 24 54-77 52-75 (106)
23 PF12325 TMF_TATA_bd: TATA ele 63.2 15 0.00033 25.3 4.0 30 50-79 13-42 (120)
24 PF15397 DUF4618: Domain of un 62.4 12 0.00027 29.1 3.8 27 53-79 81-107 (258)
25 PRK10803 tol-pal system protei 62.2 13 0.00028 28.1 3.8 26 54-79 55-80 (263)
26 KOG4418 Predicted membrane pro 61.8 0.75 1.6E-05 38.3 -3.1 61 21-81 35-95 (438)
27 PF06305 DUF1049: Protein of u 60.7 26 0.00057 20.5 4.3 25 55-79 43-67 (68)
28 TIGR02209 ftsL_broad cell divi 59.2 22 0.00049 21.6 3.9 28 52-79 30-57 (85)
29 PF12808 Mto2_bdg: Micro-tubul 57.6 41 0.00088 20.4 5.6 43 31-74 8-50 (52)
30 PRK09039 hypothetical protein; 57.6 15 0.00033 28.8 3.6 27 52-78 136-162 (343)
31 PF05308 Mito_fiss_reg: Mitoch 57.3 12 0.00027 28.6 3.0 22 52-73 121-142 (253)
32 COG5509 Uncharacterized small 56.4 16 0.00036 23.3 3.0 21 54-74 33-53 (65)
33 PHA02047 phage lambda Rz1-like 56.3 24 0.00052 24.3 4.0 31 50-80 18-54 (101)
34 PRK00295 hypothetical protein; 55.3 30 0.00064 21.5 4.0 27 53-79 26-52 (68)
35 PF07334 IFP_35_N: Interferon- 54.8 25 0.00054 22.9 3.7 23 55-77 2-24 (76)
36 PF11853 DUF3373: Protein of u 54.7 16 0.00036 30.8 3.5 33 46-79 18-50 (489)
37 cd00632 Prefoldin_beta Prefold 54.6 19 0.00042 23.3 3.2 38 43-80 67-104 (105)
38 TIGR01010 BexC_CtrB_KpsE polys 53.3 34 0.00073 26.2 4.8 29 52-80 169-197 (362)
39 PF08227 DASH_Hsk3: DASH compl 53.0 31 0.00067 20.4 3.6 25 54-78 3-27 (45)
40 PRK14127 cell division protein 52.7 29 0.00062 23.8 3.9 26 54-79 45-70 (109)
41 PF07716 bZIP_2: Basic region 52.6 36 0.00079 19.6 3.9 25 55-79 27-51 (54)
42 PRK04406 hypothetical protein; 51.9 37 0.0008 21.5 4.1 27 53-79 32-58 (75)
43 PRK02793 phi X174 lysis protei 51.7 38 0.00082 21.2 4.1 27 53-79 29-55 (72)
44 PF06698 DUF1192: Protein of u 51.7 35 0.00077 21.1 3.9 25 53-77 28-52 (59)
45 PF07106 TBPIP: Tat binding pr 51.5 28 0.00061 24.1 3.8 71 3-80 33-106 (169)
46 KOG4552 Vitamin-D-receptor int 50.4 71 0.0015 25.2 6.2 48 5-71 59-106 (272)
47 PRK00846 hypothetical protein; 50.0 39 0.00084 21.9 4.0 29 52-80 33-61 (77)
48 PF05983 Med7: MED7 protein; 49.7 92 0.002 22.1 6.4 43 3-78 114-156 (162)
49 smart00338 BRLZ basic region l 49.6 46 0.001 19.6 4.1 25 53-77 33-57 (65)
50 PF04102 SlyX: SlyX; InterPro 49.5 29 0.00063 21.3 3.3 26 54-79 26-51 (69)
51 PRK02119 hypothetical protein; 49.5 42 0.00092 21.1 4.1 27 53-79 30-56 (73)
52 PF14282 FlxA: FlxA-like prote 49.4 23 0.00051 23.4 3.0 23 52-74 18-40 (106)
53 PF04340 DUF484: Protein of un 49.4 33 0.00071 24.8 4.0 34 43-79 33-66 (225)
54 PRK00888 ftsB cell division pr 49.3 35 0.00076 22.7 3.9 28 53-80 34-61 (105)
55 PRK00736 hypothetical protein; 49.1 44 0.00096 20.7 4.1 27 53-79 26-52 (68)
56 PF08657 DASH_Spc34: DASH comp 48.7 35 0.00076 26.3 4.2 34 45-78 172-205 (259)
57 PF04728 LPP: Lipoprotein leuc 48.2 44 0.00096 20.6 3.9 20 55-74 12-31 (56)
58 smart00150 SPEC Spectrin repea 46.4 58 0.0012 18.8 4.5 34 45-78 23-56 (101)
59 PRK04325 hypothetical protein; 46.2 50 0.0011 20.7 4.0 27 53-79 30-56 (74)
60 TIGR03021 pilP_fam type IV pil 46.2 42 0.00091 22.9 4.0 24 52-75 4-27 (119)
61 PF10458 Val_tRNA-synt_C: Valy 45.9 42 0.0009 20.2 3.5 23 55-77 6-28 (66)
62 PF13600 DUF4140: N-terminal d 45.2 45 0.00097 21.1 3.8 27 54-80 78-104 (104)
63 PF11333 DUF3135: Protein of u 44.7 86 0.0019 20.3 5.4 30 40-69 53-82 (83)
64 PF06295 DUF1043: Protein of u 43.9 27 0.00058 23.8 2.7 23 55-77 27-49 (128)
65 PF08285 DPM3: Dolichol-phosph 43.2 39 0.00085 22.2 3.3 21 51-71 66-86 (91)
66 PRK14127 cell division protein 42.8 45 0.00097 22.8 3.6 27 54-80 38-64 (109)
67 PF11336 DUF3138: Protein of u 42.1 37 0.00079 29.1 3.7 27 52-78 24-50 (514)
68 PF06295 DUF1043: Protein of u 41.0 51 0.0011 22.4 3.7 21 60-80 25-45 (128)
69 COG3879 Uncharacterized protei 40.8 35 0.00075 26.6 3.2 46 33-80 39-84 (247)
70 KOG4196 bZIP transcription fac 40.6 47 0.001 23.9 3.6 51 29-79 55-107 (135)
71 PF14257 DUF4349: Domain of un 40.5 45 0.00097 24.5 3.6 29 51-79 160-188 (262)
72 PF15300 INT_SG_DDX_CT_C: INTS 40.3 47 0.001 20.9 3.2 26 4-41 24-51 (65)
73 PF11471 Sugarporin_N: Maltopo 39.9 75 0.0016 19.5 4.0 26 54-79 33-58 (60)
74 KOG4841 Dolichol-phosphate man 39.2 51 0.0011 22.5 3.4 22 50-71 69-90 (95)
75 PF08537 NBP1: Fungal Nap bind 39.0 45 0.00097 27.0 3.6 25 55-79 184-208 (323)
76 PF13863 DUF4200: Domain of un 38.9 60 0.0013 21.0 3.7 46 30-80 63-108 (126)
77 PF04420 CHD5: CHD5-like prote 38.8 41 0.00089 23.6 3.1 29 51-79 64-92 (161)
78 PRK09413 IS2 repressor TnpA; R 38.6 64 0.0014 21.3 3.8 23 55-77 80-102 (121)
79 PRK11239 hypothetical protein; 38.5 41 0.0009 25.8 3.2 27 53-79 183-209 (215)
80 PF12718 Tropomyosin_1: Tropom 38.4 54 0.0012 22.8 3.6 25 54-78 36-60 (143)
81 PF14931 IFT20: Intraflagellar 38.1 97 0.0021 21.3 4.8 26 55-80 82-107 (120)
82 PLN02720 complex II 37.9 14 0.0003 26.8 0.5 9 44-52 50-58 (140)
83 cd01111 HTH_MerD Helix-Turn-He 37.5 1E+02 0.0022 20.2 4.6 30 50-79 77-106 (107)
84 cd00890 Prefoldin Prefoldin is 37.4 64 0.0014 20.7 3.6 25 55-79 8-32 (129)
85 PF04012 PspA_IM30: PspA/IM30 37.2 1.1E+02 0.0024 21.8 5.1 40 31-77 8-47 (221)
86 PRK13169 DNA replication intia 37.1 81 0.0018 21.5 4.2 26 52-77 7-32 (110)
87 PF04447 DUF550: Protein of un 36.9 35 0.00075 23.4 2.3 25 45-69 14-38 (100)
88 PRK10963 hypothetical protein; 36.7 55 0.0012 24.0 3.6 35 41-78 28-62 (223)
89 PF04508 Pox_A_type_inc: Viral 36.1 64 0.0014 16.7 2.8 18 55-72 3-20 (23)
90 cd00584 Prefoldin_alpha Prefol 35.9 68 0.0015 21.0 3.6 25 55-79 8-32 (129)
91 KOG4098 Molecular chaperone Pr 35.7 1.7E+02 0.0036 21.3 5.8 63 4-74 50-114 (140)
92 PF02996 Prefoldin: Prefoldin 35.6 71 0.0015 20.3 3.6 25 54-78 85-109 (120)
93 PF05565 Sipho_Gp157: Siphovir 35.4 1.1E+02 0.0024 21.5 4.8 43 30-76 28-70 (162)
94 PLN02523 galacturonosyltransfe 34.8 2E+02 0.0044 24.9 7.1 44 30-78 158-201 (559)
95 TIGR00293 prefoldin, archaeal 34.6 73 0.0016 20.8 3.6 25 55-79 8-32 (126)
96 PF04999 FtsL: Cell division p 34.3 90 0.002 19.6 3.9 27 53-79 42-68 (97)
97 PRK13902 alaS alanyl-tRNA synt 34.2 1.3E+02 0.0029 27.0 6.1 45 35-79 743-787 (900)
98 TIGR03017 EpsF chain length de 34.0 66 0.0014 25.0 3.8 30 51-80 169-198 (444)
99 TIGR02231 conserved hypothetic 33.9 57 0.0012 26.4 3.6 26 54-79 72-97 (525)
100 PRK11677 hypothetical protein; 33.8 47 0.001 23.4 2.7 24 54-77 30-53 (134)
101 PF12518 DUF3721: Protein of u 33.6 16 0.00035 20.6 0.3 14 39-54 6-19 (34)
102 COG4985 ABC-type phosphate tra 33.4 1.5E+02 0.0032 23.8 5.6 39 34-79 209-247 (289)
103 TIGR02894 DNA_bind_RsfA transc 33.2 70 0.0015 23.5 3.6 25 55-79 113-137 (161)
104 PRK15396 murein lipoprotein; P 33.1 47 0.001 21.5 2.4 24 54-77 26-49 (78)
105 PF03623 Focal_AT: Focal adhes 33.0 7.2 0.00016 27.9 -1.5 25 43-67 6-34 (139)
106 PRK10884 SH3 domain-containing 32.9 69 0.0015 23.8 3.6 23 55-77 134-156 (206)
107 PF02970 TBCA: Tubulin binding 32.9 84 0.0018 20.3 3.6 29 52-80 6-34 (90)
108 PRK13922 rod shape-determining 32.9 85 0.0018 23.2 4.1 37 43-79 56-95 (276)
109 PF11461 RILP: Rab interacting 32.7 72 0.0016 19.9 3.1 20 61-80 11-30 (60)
110 PF14197 Cep57_CLD_2: Centroso 32.4 73 0.0016 19.9 3.1 24 57-80 2-25 (69)
111 PRK14872 rod shape-determining 32.4 75 0.0016 25.6 3.9 28 53-80 57-84 (337)
112 PF07820 TraC: TraC-like prote 32.2 91 0.002 21.1 3.8 22 55-76 4-25 (92)
113 PF15294 Leu_zip: Leucine zipp 32.0 69 0.0015 25.3 3.6 33 47-80 120-152 (278)
114 PRK14623 hypothetical protein; 31.8 89 0.0019 21.1 3.7 25 55-79 3-27 (106)
115 PRK14626 hypothetical protein; 31.7 1E+02 0.0022 20.9 4.0 26 53-78 5-30 (110)
116 PRK14625 hypothetical protein; 31.7 90 0.0019 21.2 3.7 24 55-78 4-27 (109)
117 PRK01203 prefoldin subunit alp 31.5 83 0.0018 22.2 3.6 25 55-79 9-33 (130)
118 TIGR00344 alaS alanine--tRNA l 31.5 1.5E+02 0.0032 26.5 5.8 44 36-79 709-752 (851)
119 COG5415 Predicted integral mem 31.2 89 0.0019 24.5 4.0 37 45-81 7-43 (251)
120 PRK11677 hypothetical protein; 31.2 1.1E+02 0.0024 21.5 4.2 21 60-80 29-49 (134)
121 PF07439 DUF1515: Protein of u 30.9 1.9E+02 0.0041 20.3 5.7 47 29-80 21-67 (112)
122 PF08286 Spc24: Spc24 subunit 30.9 16 0.00036 24.3 0.0 25 54-78 14-38 (118)
123 PF06005 DUF904: Protein of un 30.7 79 0.0017 19.9 3.1 21 48-68 13-33 (72)
124 TIGR03683 A-tRNA_syn_arch alan 30.4 1.7E+02 0.0036 26.4 6.0 45 35-79 747-791 (902)
125 PF07223 DUF1421: Protein of u 30.3 49 0.0011 26.9 2.6 15 66-80 14-28 (358)
126 PF06156 DUF972: Protein of un 30.3 1.2E+02 0.0027 20.3 4.2 23 54-76 9-31 (107)
127 KOG3436 60S ribosomal protein 30.2 63 0.0014 23.0 2.8 19 55-73 14-32 (123)
128 PRK03762 hypothetical protein; 30.1 1E+02 0.0022 20.7 3.7 23 56-78 8-30 (103)
129 PRK00587 hypothetical protein; 30.0 1E+02 0.0022 20.6 3.7 24 55-78 3-26 (99)
130 COG0013 AlaS Alanyl-tRNA synth 29.8 1.2E+02 0.0025 27.7 5.0 25 54-78 733-757 (879)
131 PF01025 GrpE: GrpE; InterPro 29.8 98 0.0021 21.0 3.7 25 55-79 20-44 (165)
132 PRK14629 hypothetical protein; 29.7 96 0.0021 20.8 3.6 22 57-78 7-28 (99)
133 PRK14622 hypothetical protein; 29.7 1E+02 0.0022 20.5 3.7 24 55-78 3-26 (103)
134 PF12443 AKNA: AT-hook-contain 29.3 1.9E+02 0.0042 19.9 5.5 59 6-79 4-71 (106)
135 PF06657 Cep57_MT_bd: Centroso 29.3 1.3E+02 0.0029 19.1 4.0 32 49-80 13-44 (79)
136 KOG2185 Predicted RNA-processi 29.2 77 0.0017 27.0 3.6 31 50-80 441-471 (486)
137 PF11570 E2R135: Coiled-coil r 29.0 1.5E+02 0.0033 21.4 4.6 47 34-80 49-104 (136)
138 PF02183 HALZ: Homeobox associ 29.0 1E+02 0.0022 17.8 3.2 20 57-76 23-42 (45)
139 PF12325 TMF_TATA_bd: TATA ele 28.9 1.2E+02 0.0027 20.8 4.1 25 53-77 30-54 (120)
140 COG0255 RpmC Ribosomal protein 28.7 1.4E+02 0.003 18.9 4.0 26 55-80 13-38 (69)
141 PF07106 TBPIP: Tat binding pr 28.4 87 0.0019 21.6 3.3 22 57-78 113-134 (169)
142 KOG4603 TBP-1 interacting prot 28.2 1.7E+02 0.0037 22.3 5.0 25 4-28 41-65 (201)
143 PF00631 G-gamma: GGL domain; 28.2 67 0.0015 19.4 2.4 17 55-71 4-20 (68)
144 PF02375 JmjN: jmjN domain; I 27.9 44 0.00096 18.4 1.4 22 3-28 11-32 (34)
145 PRK14624 hypothetical protein; 27.7 1.1E+02 0.0025 20.9 3.7 27 53-79 6-32 (115)
146 PRK15178 Vi polysaccharide exp 27.1 1.4E+02 0.003 24.9 4.7 39 36-74 266-307 (434)
147 PF03955 Adeno_PIX: Adenovirus 26.9 1.3E+02 0.0029 20.8 3.9 25 54-78 84-108 (109)
148 PRK11546 zraP zinc resistance 26.8 1.3E+02 0.0028 21.6 3.9 22 53-74 89-110 (143)
149 PLN02900 alanyl-tRNA synthetas 26.7 1.9E+02 0.0041 26.3 5.8 31 49-79 786-816 (936)
150 PRK15178 Vi polysaccharide exp 26.3 3.2E+02 0.0069 22.8 6.7 40 41-80 267-306 (434)
151 PF02050 FliJ: Flagellar FliJ 26.3 1.5E+02 0.0034 17.8 4.1 27 51-77 50-76 (123)
152 PF10737 GerPC: Spore germinat 26.1 59 0.0013 24.0 2.2 21 55-75 1-21 (176)
153 PRK11519 tyrosine kinase; Prov 26.1 98 0.0021 26.4 3.8 28 53-80 267-294 (719)
154 TIGR02047 CadR-PbrR Cd(II)/Pb( 26.1 2E+02 0.0042 19.2 4.6 32 46-77 72-103 (127)
155 cd00089 HR1 Protein kinase C-r 25.9 1.1E+02 0.0024 18.4 3.1 28 53-80 42-69 (72)
156 PRK00252 alaS alanyl-tRNA synt 25.8 2.1E+02 0.0046 25.5 5.9 42 38-79 704-745 (865)
157 TIGR00103 DNA_YbaB_EbfC DNA-bi 25.7 1.5E+02 0.0032 19.5 3.9 24 55-78 7-30 (102)
158 COG1730 GIM5 Predicted prefold 25.5 1.3E+02 0.0028 21.5 3.8 25 55-79 15-39 (145)
159 TIGR02044 CueR Cu(I)-responsiv 25.3 2E+02 0.0043 19.0 4.5 28 50-77 76-103 (127)
160 cd01106 HTH_TipAL-Mta Helix-Tu 25.3 1.5E+02 0.0033 18.7 3.8 28 52-79 72-99 (103)
161 PRK00153 hypothetical protein; 25.1 1.3E+02 0.0028 19.6 3.5 24 55-78 5-28 (104)
162 PF12737 Mating_C: C-terminal 24.9 69 0.0015 26.4 2.6 19 59-77 401-419 (419)
163 PF05546 She9_MDM33: She9 / Md 24.6 3.1E+02 0.0068 20.9 5.9 41 35-77 16-56 (207)
164 PF12001 DUF3496: Domain of un 24.5 1.1E+02 0.0023 21.2 3.1 17 57-73 4-20 (111)
165 COG3105 Uncharacterized protei 24.4 1.3E+02 0.0027 21.8 3.6 23 56-78 37-59 (138)
166 PRK10803 tol-pal system protei 24.3 1.1E+02 0.0023 23.1 3.4 24 54-77 62-85 (263)
167 PRK09039 hypothetical protein; 24.1 1.1E+02 0.0023 24.1 3.5 26 55-80 132-157 (343)
168 PRK03947 prefoldin subunit alp 23.8 1.4E+02 0.0031 19.8 3.6 22 56-77 16-37 (140)
169 PF05615 THOC7: Tho complex su 23.7 1.6E+02 0.0034 19.7 3.8 24 54-77 89-112 (139)
170 COG4467 Regulator of replicati 23.6 1.5E+02 0.0033 20.8 3.8 26 53-78 8-33 (114)
171 PF07996 T4SS: Type IV secreti 23.4 1.7E+02 0.0036 20.3 4.0 23 55-77 21-43 (195)
172 PRK14621 hypothetical protein; 23.4 1.6E+02 0.0034 20.0 3.7 24 55-78 6-29 (111)
173 PF06717 DUF1202: Protein of u 23.3 1.2E+02 0.0027 24.5 3.7 26 54-79 153-178 (308)
174 TIGR03007 pepcterm_ChnLen poly 23.3 1.4E+02 0.003 23.7 3.9 28 53-80 161-188 (498)
175 PRK10884 SH3 domain-containing 23.3 1.2E+02 0.0026 22.5 3.4 23 55-77 141-163 (206)
176 cd04784 HTH_CadR-PbrR Helix-Tu 23.2 2.3E+02 0.0049 18.6 4.5 30 47-76 73-102 (127)
177 PRK09841 cryptic autophosphory 22.9 1.2E+02 0.0027 25.9 3.8 29 52-80 266-294 (726)
178 PRK11239 hypothetical protein; 22.7 1.5E+02 0.0032 22.8 3.9 55 15-77 160-214 (215)
179 PF02185 HR1: Hr1 repeat; Int 22.7 1.8E+02 0.004 17.3 3.9 26 54-79 34-59 (70)
180 KOG0999 Microtubule-associated 22.6 1.4E+02 0.003 26.7 4.1 42 34-79 92-133 (772)
181 PRK14627 hypothetical protein; 22.5 1.7E+02 0.0036 19.3 3.7 23 56-78 4-26 (100)
182 COG2919 Septum formation initi 22.5 1.8E+02 0.0039 19.4 3.9 24 52-75 49-72 (117)
183 PF08232 Striatin: Striatin fa 22.4 1.8E+02 0.0039 20.0 4.0 28 53-80 4-38 (134)
184 PF08172 CASP_C: CASP C termin 22.3 1.4E+02 0.003 22.8 3.6 20 58-77 98-117 (248)
185 PF05816 TelA: Toxic anion res 22.3 2.4E+02 0.0051 21.9 5.0 21 57-77 88-108 (333)
186 PF00831 Ribosomal_L29: Riboso 22.3 1.8E+02 0.0039 17.1 3.9 24 55-78 9-32 (58)
187 cd01282 HTH_MerR-like_sg3 Heli 22.1 2.3E+02 0.0051 18.3 4.5 32 47-78 75-106 (112)
188 TIGR02051 MerR Hg(II)-responsi 21.9 2.1E+02 0.0044 18.9 4.1 29 49-77 72-100 (124)
189 PRK14164 heat shock protein Gr 21.8 1.9E+02 0.0041 21.9 4.2 36 45-80 69-104 (218)
190 PF05465 Halo_GVPC: Halobacter 21.6 1.5E+02 0.0033 16.0 2.8 25 55-79 1-25 (32)
191 KOG3478 Prefoldin subunit 6, K 21.6 1.6E+02 0.0035 20.9 3.6 27 54-80 84-110 (120)
192 PRK14149 heat shock protein Gr 21.6 1.7E+02 0.0037 21.7 3.9 28 53-80 43-70 (191)
193 PF05529 Bap31: B-cell recepto 21.5 1.6E+02 0.0035 20.7 3.7 25 53-77 161-185 (192)
194 PHA03386 P10 fibrous body prot 21.5 1.5E+02 0.0033 20.2 3.3 14 54-67 20-33 (94)
195 cd01109 HTH_YyaN Helix-Turn-He 21.5 2.2E+02 0.0047 18.3 4.0 28 51-78 77-104 (113)
196 PF07544 Med9: RNA polymerase 21.4 1.2E+02 0.0027 19.1 2.8 31 50-80 49-79 (83)
197 PRK05771 V-type ATP synthase s 21.4 1.5E+02 0.0032 24.9 3.9 29 51-79 213-241 (646)
198 COG0718 Uncharacterized protei 21.3 2.1E+02 0.0046 19.5 4.0 27 53-79 5-31 (105)
199 PF08172 CASP_C: CASP C termin 21.2 1.5E+02 0.0032 22.6 3.6 27 54-80 108-134 (248)
200 TIGR02338 gimC_beta prefoldin, 21.2 1.5E+02 0.0033 19.3 3.2 66 5-79 39-107 (110)
201 PF06818 Fez1: Fez1; InterPro 21.1 1.6E+02 0.0036 22.2 3.8 24 52-75 9-32 (202)
202 TIGR02231 conserved hypothetic 21.1 1.2E+02 0.0026 24.6 3.3 29 52-80 77-105 (525)
203 PF06810 Phage_GP20: Phage min 21.1 1.6E+02 0.0035 20.7 3.6 19 52-70 50-68 (155)
204 PF04697 Pinin_SDK_N: pinin/SD 20.9 1.6E+02 0.0035 21.2 3.5 24 54-77 4-27 (134)
205 cd04776 HTH_GnyR Helix-Turn-He 20.9 2.3E+02 0.0049 18.7 4.1 26 53-78 80-105 (118)
206 PF05377 FlaC_arch: Flagella a 20.8 2E+02 0.0044 17.6 3.5 19 55-73 16-34 (55)
207 PRK13923 putative spore coat p 20.8 1.6E+02 0.0034 21.7 3.6 27 54-80 112-138 (170)
208 TIGR02977 phageshock_pspA phag 20.6 3.1E+02 0.0067 19.9 5.1 44 31-74 9-52 (219)
209 PF10152 DUF2360: Predicted co 20.5 2.2E+02 0.0048 19.8 4.1 30 50-79 18-47 (148)
210 PF12709 Kinetocho_Slk19: Cent 20.4 2.1E+02 0.0045 19.1 3.8 24 54-77 50-73 (87)
211 KOG4673 Transcription factor T 20.4 1.7E+02 0.0037 26.8 4.2 37 36-78 855-891 (961)
212 PF10066 DUF2304: Uncharacteri 20.1 1.7E+02 0.0037 19.2 3.4 21 54-74 87-107 (115)
213 PF11559 ADIP: Afadin- and alp 20.0 1.8E+02 0.004 19.6 3.6 23 55-77 68-90 (151)
214 PRK10227 DNA-binding transcrip 20.0 2.8E+02 0.0061 18.8 4.5 31 47-77 73-103 (135)
No 1
>PF03195 DUF260: Protein of unknown function DUF260; InterPro: IPR004883 The lateral organ boundaries (LOB) gene is expressed at the adaxial base of initiating lateral organs and encodes a plant-specific protein of unknown function. The N-terminal one half of the LOB protein contains a conserved approximately 100-amino acid domain (the LOB domain) that is present in 42 other Arabidopsis thaliana proteins and in proteins from a variety of other plant species. Genes encoding LOB domain (LBD) proteins are expressed in a variety of temporal- and tissue-specific patterns, suggesting that they may function in diverse processes [] The LOB domain contains conserved blocks of amino acids that identify the LBD gene family. In particular, a conserved C-x(2)-C-x(6)-C-x(3)-C motif, which is defining feature of the LOB domain, is present in all LBD proteins. It is possible that this motif forms a new zinc finger [].
Probab=99.95 E-value=4.2e-28 Score=162.66 Aligned_cols=64 Identities=48% Similarity=0.727 Sum_probs=62.8
Q ss_pred CcchHHHHHhhhhhhhHHHHhhhccccCHHHHHHHHHHHHhhhccCCccccHHHHHHHHHHHHHHHHHHHHHHHHH
Q 047804 1 GMSFFLKLLESFRELQSWVNLATGACRTKRDCKLIMDYEANSRIRDPVYGCVGAVCHLQKQVSELQAELAKAQAGL 76 (81)
Q Consensus 1 G~snv~k~l~~~~~~~r~~~~~~g~~~~k~da~~SlvYEA~aR~rDPV~Gc~G~I~~Lq~qi~~lq~eLa~~~~~l 76 (81)
|+|||.|||++++|++| +++|+||+|||++|.+||||||+|+|++||+|+..+++||+.++++|
T Consensus 38 G~sni~k~L~~~~~~~R------------~~a~~Sl~yEA~~R~~dPv~Gc~G~i~~L~~ql~~~~~el~~~~~~l 101 (101)
T PF03195_consen 38 GVSNISKMLQELPPEQR------------EDAMRSLVYEANARARDPVYGCVGIISQLQQQLQQLQAELALVRAQL 101 (101)
T ss_pred chhHHHHHHHhCCccch------------hhHHHHHHHHHHhhccCCCcchHHHHHHHHHHHHHHHHHHHHHHccC
Confidence 89999999999999999 99999999999999999999999999999999999999999999876
No 2
>COG3416 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=87.19 E-value=2 Score=33.19 Aligned_cols=53 Identities=13% Similarity=0.152 Sum_probs=49.3
Q ss_pred HHHHHHHHHHHHhhhccCCccccHHHHHHHHHHHHHHHHHHHHHHHHHhhccC
Q 047804 29 KRDCKLIMDYEANSRIRDPVYGCVGAVCHLQKQVSELQAELAKAQAGLAFMQS 81 (81)
Q Consensus 29 k~da~~SlvYEA~aR~rDPV~Gc~G~I~~Lq~qi~~lq~eLa~~~~~la~~q~ 81 (81)
|++.+..+|-|+-++.-|--|=-+-+|.-+++-|+.++.+++.++.+++.+|.
T Consensus 24 rD~~Ae~lI~~~~~~qP~a~Y~laQ~vliqE~ALk~a~~~i~eLe~ri~~lq~ 76 (233)
T COG3416 24 RDPQAEALIAEAVAKQPDAAYYLAQRVLIQEQALKKASTQIKELEKRIAILQA 76 (233)
T ss_pred CChHHHHHHHHHHhcCCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc
Confidence 69999999999999999999999999999999999999999999999987763
No 3
>PF13334 DUF4094: Domain of unknown function (DUF4094)
Probab=87.04 E-value=0.68 Score=30.78 Aligned_cols=25 Identities=40% Similarity=0.445 Sum_probs=21.2
Q ss_pred cHHHHHHHHHHHHHHHHHHHHHHHH
Q 047804 51 CVGAVCHLQKQVSELQAELAKAQAG 75 (81)
Q Consensus 51 c~G~I~~Lq~qi~~lq~eLa~~~~~ 75 (81)
-.-+|..|.+.|..++.||+.++++
T Consensus 71 Th~aIq~LdKtIS~LEMELAaARa~ 95 (95)
T PF13334_consen 71 THEAIQSLDKTISSLEMELAAARAE 95 (95)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHhcC
Confidence 3457889999999999999999863
No 4
>PRK10265 chaperone-modulator protein CbpM; Provisional
Probab=83.75 E-value=2.6 Score=27.66 Aligned_cols=31 Identities=13% Similarity=0.190 Sum_probs=27.8
Q ss_pred cccHHHHHHHHHHHHHHHHHHHHHHHHHhhc
Q 047804 49 YGCVGAVCHLQKQVSELQAELAKAQAGLAFM 79 (81)
Q Consensus 49 ~Gc~G~I~~Lq~qi~~lq~eLa~~~~~la~~ 79 (81)
+-.++.|.+|-.||+.+++|+..++++|..|
T Consensus 67 ~~gialvl~LLd~i~~Lr~el~~L~~~l~~~ 97 (101)
T PRK10265 67 WPGIAVALTLLDEIAHLKQENRLLRQRLSRF 97 (101)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4467899999999999999999999999766
No 5
>PF09006 Surfac_D-trimer: Lung surfactant protein D coiled-coil trimerisation; InterPro: IPR015097 This domain is found in the SFTPD family, which includes lung surfactant protein D (SFTPD), conglutinin, collectin-43 and collectin-46. It forms a triple-helical parallel coiled coil, and mediates trimerisation of the protein []. ; PDB: 4DN8_A 3G84_A 2RIE_C 3IKR_B 1B08_A 2GGX_B 2OS9_C 2ORK_B 1PWB_A 2RIA_C ....
Probab=82.59 E-value=2.9 Score=25.16 Aligned_cols=26 Identities=27% Similarity=0.427 Sum_probs=22.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhcc
Q 047804 55 VCHLQKQVSELQAELAKAQAGLAFMQ 80 (81)
Q Consensus 55 I~~Lq~qi~~lq~eLa~~~~~la~~q 80 (81)
|..|.+|+..++.++..+|+-+..|+
T Consensus 1 i~aLrqQv~aL~~qv~~Lq~~fs~yK 26 (46)
T PF09006_consen 1 INALRQQVEALQGQVQRLQAAFSQYK 26 (46)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 56789999999999999998887764
No 6
>PF06698 DUF1192: Protein of unknown function (DUF1192); InterPro: IPR009579 This family consists of several short, hypothetical, bacterial proteins of around 60 residues in length. The function of this family is unknown.
Probab=80.80 E-value=3.6 Score=25.58 Aligned_cols=24 Identities=33% Similarity=0.494 Sum_probs=17.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhh
Q 047804 55 VCHLQKQVSELQAELAKAQAGLAF 78 (81)
Q Consensus 55 I~~Lq~qi~~lq~eLa~~~~~la~ 78 (81)
|-.|+..|..|++|++.++++++.
T Consensus 23 v~EL~~RIa~L~aEI~R~~~~~~~ 46 (59)
T PF06698_consen 23 VEELEERIALLEAEIARLEAAIAK 46 (59)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH
Confidence 456777777888888777777653
No 7
>PF03242 LEA_3: Late embryogenesis abundant protein; InterPro: IPR004926 Late-embryogenesis abundant (LEA) genes encode a diverse group of proteins that accumulate to high levels during the maturation phase of seed development []. This group includes LEA-5 [], whose expression is induced by salt, drought and heat stress [], and related proteins. ; GO: 0006950 response to stress
Probab=80.57 E-value=0.49 Score=31.62 Aligned_cols=21 Identities=33% Similarity=0.400 Sum_probs=17.1
Q ss_pred HHHHhhhccCCccccHHHHHH
Q 047804 37 DYEANSRIRDPVYGCVGAVCH 57 (81)
Q Consensus 37 vYEA~aR~rDPV~Gc~G~I~~ 57 (81)
-.+-..|.+|||-|+|-...+
T Consensus 58 ~~~~~~W~pDPvTGyyrPen~ 78 (93)
T PF03242_consen 58 SKEKSSWMPDPVTGYYRPENH 78 (93)
T ss_pred cccccccccCCCCccccCCCC
Confidence 567789999999999976543
No 8
>COG5509 Uncharacterized small protein containing a coiled-coil domain [Function unknown]
Probab=78.99 E-value=4.2 Score=26.03 Aligned_cols=23 Identities=39% Similarity=0.589 Sum_probs=14.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHh
Q 047804 55 VCHLQKQVSELQAELAKAQAGLA 77 (81)
Q Consensus 55 I~~Lq~qi~~lq~eLa~~~~~la 77 (81)
|..|.+.|..||+|++.++++++
T Consensus 27 V~El~eRIalLq~EIeRlkAe~~ 49 (65)
T COG5509 27 VAELEERIALLQAEIERLKAELA 49 (65)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHH
Confidence 34566666666666666666654
No 9
>PF12097 DUF3573: Protein of unknown function (DUF3573); InterPro: IPR021956 This family of proteins is functionally uncharacterised. This protein is found in bacteria. Proteins in this family are typically between 372 to 530 amino acids in length.
Probab=74.92 E-value=3.7 Score=33.77 Aligned_cols=23 Identities=35% Similarity=0.528 Sum_probs=20.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH
Q 047804 53 GAVCHLQKQVSELQAELAKAQAG 75 (81)
Q Consensus 53 G~I~~Lq~qi~~lq~eLa~~~~~ 75 (81)
-.|..||+||..||+|+..++.+
T Consensus 42 ~~i~~Lq~QI~~Lq~ei~~l~~~ 64 (383)
T PF12097_consen 42 QEISELQKQIQQLQAEINQLEEQ 64 (383)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHh
Confidence 36889999999999999988865
No 10
>PF05308 Mito_fiss_reg: Mitochondrial fission regulator; InterPro: IPR007972 This family consists of several uncharacterised eukaryotic proteins of unknown function.
Probab=74.29 E-value=3.5 Score=31.67 Aligned_cols=17 Identities=35% Similarity=0.599 Sum_probs=8.6
Q ss_pred HHHHHHHHHHHHHHHHh
Q 047804 61 QVSELQAELAKAQAGLA 77 (81)
Q Consensus 61 qi~~lq~eLa~~~~~la 77 (81)
+|..|+.||+.+++|||
T Consensus 123 KIsALEdELs~LRaQIA 139 (253)
T PF05308_consen 123 KISALEDELSRLRAQIA 139 (253)
T ss_pred HHHHHHHHHHHHHHHHH
Confidence 34445555555555554
No 11
>PRK00888 ftsB cell division protein FtsB; Reviewed
Probab=72.93 E-value=6.1 Score=26.35 Aligned_cols=30 Identities=27% Similarity=0.234 Sum_probs=21.3
Q ss_pred cccHH--HHHHHHHHHHHHHHHHHHHHHHHhh
Q 047804 49 YGCVG--AVCHLQKQVSELQAELAKAQAGLAF 78 (81)
Q Consensus 49 ~Gc~G--~I~~Lq~qi~~lq~eLa~~~~~la~ 78 (81)
+|-.| ...++++|+..++++++.++++.+.
T Consensus 21 ~g~~G~~~~~~l~~q~~~~~~e~~~l~~~n~~ 52 (105)
T PRK00888 21 FGKNGILDYWRVNDQVAAQQQTNAKLKARNDQ 52 (105)
T ss_pred ccCCcHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 45555 5778888888888888777766543
No 12
>PF15483 DUF4641: Domain of unknown function (DUF4641)
Probab=72.65 E-value=4.3 Score=33.99 Aligned_cols=31 Identities=32% Similarity=0.462 Sum_probs=25.9
Q ss_pred cccHHHHHHHHHHHHHHHHHHHHHHHHHhhcc
Q 047804 49 YGCVGAVCHLQKQVSELQAELAKAQAGLAFMQ 80 (81)
Q Consensus 49 ~Gc~G~I~~Lq~qi~~lq~eLa~~~~~la~~q 80 (81)
-||-.-+ -||++|++|+++|+.+|+-...||
T Consensus 415 qGCpRC~-~LQkEIedLreQLaamqsl~~kfq 445 (445)
T PF15483_consen 415 QGCPRCL-VLQKEIEDLREQLAAMQSLADKFQ 445 (445)
T ss_pred CCCcccH-HHHHHHHHHHHHHHHHHHHHHhhC
Confidence 4666554 599999999999999999888776
No 13
>PF13600 DUF4140: N-terminal domain of unknown function (DUF4140)
Probab=72.04 E-value=8.1 Score=24.64 Aligned_cols=25 Identities=24% Similarity=0.353 Sum_probs=15.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHh
Q 047804 53 GAVCHLQKQVSELQAELAKAQAGLA 77 (81)
Q Consensus 53 G~I~~Lq~qi~~lq~eLa~~~~~la 77 (81)
-.+..|+.++..++.+++.+++++.
T Consensus 70 ~~~~~l~~~l~~l~~~~~~~~~~~~ 94 (104)
T PF13600_consen 70 PELKELEEELEALEDELAALQDEIQ 94 (104)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3566666666666666665555543
No 14
>PF06696 Strep_SA_rep: Streptococcal surface antigen repeat; InterPro: IPR009578 This family consists of a number of ~25 residue long repeats found commonly in Streptococcal surface antigens although one copy is present in the HPSR2-heavy chain potential motor protein of Giardia lamblia (Giardia intestinalis) (Q24984 from SWISSPROT). This family is often found in conjunction with IPR001899 from INTERPRO.; PDB: 3IOX_A 3IPK_A 2WD6_B 1JMM_A.
Probab=70.60 E-value=10 Score=20.05 Aligned_cols=23 Identities=35% Similarity=0.371 Sum_probs=18.1
Q ss_pred HHHHHHHHHHHHHHHHHHHhhcc
Q 047804 58 LQKQVSELQAELAKAQAGLAFMQ 80 (81)
Q Consensus 58 Lq~qi~~lq~eLa~~~~~la~~q 80 (81)
.|..+...++||+.+|..++.++
T Consensus 3 Yqakla~YqaeLa~vqk~na~~~ 25 (25)
T PF06696_consen 3 YQAKLAQYQAELARVQKANADYE 25 (25)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHhhccC
Confidence 46678888999999998887653
No 15
>cd04766 HTH_HspR Helix-Turn-Helix DNA binding domain of the HspR transcription regulator. Helix-turn-helix (HTH) transcription regulator HspR, N-terminal domain. Heat shock protein regulators (HspR) have been shown to regulate expression of specific regulons in response to high temperature or high osmolarity in Streptomyces and Helicobacter, respectively. These proteins share the N-terminal DNA binding domain with other transcription regulators of the MerR superfamily that promote transcription by reconfiguring the spacer between the -35 and -10 promoter elements. A typical MerR regulator is comprised of distinct domains that harbor the regulatory (effector-binding) site and the active (DNA-binding) site. Their conserved N-terminal domains contain predicted winged HTH motifs that mediate DNA binding, while the dissimilar C-terminal domains bind specific coactivator molecules.
Probab=68.83 E-value=12 Score=23.49 Aligned_cols=32 Identities=28% Similarity=0.400 Sum_probs=24.6
Q ss_pred cCCccccHHHHHHHHHHHHHHHHHHHHHHHHHh
Q 047804 45 RDPVYGCVGAVCHLQKQVSELQAELAKAQAGLA 77 (81)
Q Consensus 45 rDPV~Gc~G~I~~Lq~qi~~lq~eLa~~~~~la 77 (81)
--|.-|. ..+..|..|++.++++|+.+++.|.
T Consensus 58 g~~l~~i-~~~l~l~~~~~~l~~~l~~l~~~~~ 89 (91)
T cd04766 58 GVNLAGV-KRILELEEELAELRAELDELRARLR 89 (91)
T ss_pred CCCHHHH-HHHHHHHHHHHHHHHHHHHHHHHhc
Confidence 4455555 4455699999999999999998874
No 16
>PF04977 DivIC: Septum formation initiator; InterPro: IPR007060 DivIC, from the spore-forming, Gram-positive bacterium Bacillus subtilis, is necessary for both vegetative and sporulation septum formation []. These proteins are mainly composed of an N-terminal coiled-coil. DivIB, DivIC and FtsL inter-depend on each other for stabilisation and localisation. The latter two form a heterodimer. DivIC is always centre cell but the other two associate with it during septation [].; GO: 0007049 cell cycle
Probab=68.14 E-value=13 Score=21.99 Aligned_cols=24 Identities=25% Similarity=0.397 Sum_probs=11.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhh
Q 047804 55 VCHLQKQVSELQAELAKAQAGLAF 78 (81)
Q Consensus 55 I~~Lq~qi~~lq~eLa~~~~~la~ 78 (81)
|..|+.+++.++++...++.++..
T Consensus 26 i~~l~~~i~~l~~e~~~L~~ei~~ 49 (80)
T PF04977_consen 26 IAELQKEIEELKKENEELKEEIER 49 (80)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH
Confidence 334444444444444444444443
No 17
>PF13591 MerR_2: MerR HTH family regulatory protein
Probab=66.52 E-value=9.4 Score=24.09 Aligned_cols=24 Identities=17% Similarity=0.343 Sum_probs=21.0
Q ss_pred cHHHHHHHHHHHHHHHHHHHHHHH
Q 047804 51 CVGAVCHLQKQVSELQAELAKAQA 74 (81)
Q Consensus 51 c~G~I~~Lq~qi~~lq~eLa~~~~ 74 (81)
.+++|.+|-.+|..+++|+..+++
T Consensus 61 gi~lil~LLd~i~~L~~el~~L~~ 84 (84)
T PF13591_consen 61 GIALILDLLDRIEQLRRELRELRR 84 (84)
T ss_pred HHHHHHHHHHHHHHHHHHHHHhhC
Confidence 478999999999999999988763
No 18
>PF10883 DUF2681: Protein of unknown function (DUF2681); InterPro: IPR020274 This entry contains membrane proteins with no known function.
Probab=66.24 E-value=11 Score=24.99 Aligned_cols=26 Identities=27% Similarity=0.275 Sum_probs=21.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhcc
Q 047804 55 VCHLQKQVSELQAELAKAQAGLAFMQ 80 (81)
Q Consensus 55 I~~Lq~qi~~lq~eLa~~~~~la~~q 80 (81)
+-.|+.++++++.|.+.+++++..|+
T Consensus 32 ~~kL~~en~qlk~Ek~~~~~qvkn~~ 57 (87)
T PF10883_consen 32 NAKLQKENEQLKTEKAVAETQVKNAK 57 (87)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 67788888888888888888887764
No 19
>PF09849 DUF2076: Uncharacterized protein conserved in bacteria (DUF2076); InterPro: IPR018648 This family of hypothetical prokaryotic proteins has no known function but includes putative perimplasmic ligand-binding sensor proteins.
Probab=63.82 E-value=25 Score=27.03 Aligned_cols=49 Identities=18% Similarity=0.135 Sum_probs=45.3
Q ss_pred HHHHHHHHHHHHhhhccCCccccHHHHHHHHHHHHHHHHHHHHHHHHHh
Q 047804 29 KRDCKLIMDYEANSRIRDPVYGCVGAVCHLQKQVSELQAELAKAQAGLA 77 (81)
Q Consensus 29 k~da~~SlvYEA~aR~rDPV~Gc~G~I~~Lq~qi~~lq~eLa~~~~~la 77 (81)
|+--++.+|=++=.|.-|-+|=-+-.|.-++.=|+.+++++..++++|.
T Consensus 24 rD~eAe~lI~~~~~~qP~A~Y~laQ~vlvQE~AL~~a~~ri~eLe~ql~ 72 (247)
T PF09849_consen 24 RDPEAEALIAQALARQPDAPYYLAQTVLVQEQALKQAQARIQELEAQLQ 72 (247)
T ss_pred CCHHHHHHHHHHHHhCCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4888999999999999999999999999999999999999999999874
No 20
>PF04977 DivIC: Septum formation initiator; InterPro: IPR007060 DivIC, from the spore-forming, Gram-positive bacterium Bacillus subtilis, is necessary for both vegetative and sporulation septum formation []. These proteins are mainly composed of an N-terminal coiled-coil. DivIB, DivIC and FtsL inter-depend on each other for stabilisation and localisation. The latter two form a heterodimer. DivIC is always centre cell but the other two associate with it during septation [].; GO: 0007049 cell cycle
Probab=63.71 E-value=19 Score=21.28 Aligned_cols=27 Identities=19% Similarity=0.298 Sum_probs=21.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhc
Q 047804 53 GAVCHLQKQVSELQAELAKAQAGLAFM 79 (81)
Q Consensus 53 G~I~~Lq~qi~~lq~eLa~~~~~la~~ 79 (81)
..+..+++++..++.+++.++++...+
T Consensus 17 ~~~~~~~~ei~~l~~~i~~l~~e~~~L 43 (80)
T PF04977_consen 17 SRYYQLNQEIAELQKEIEELKKENEEL 43 (80)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 346788889999999999888877654
No 21
>PF05120 GvpG: Gas vesicle protein G ; InterPro: IPR007804 Gas vesicles are intracellular, protein-coated, and hollow organelles found in cyanobacteria and halophilic archaea. They are permeable to ambient gases by diffusion and provide buoyancy, enabling cells to move upwards in water to access oxygen and/or light. Proteins containing this family are involved in the formation of gas vesicles [].
Probab=63.35 E-value=8 Score=25.14 Aligned_cols=33 Identities=27% Similarity=0.245 Sum_probs=20.4
Q ss_pred CccccHHHHHHHHHHHHHHHHHHHHHHHHHhhc
Q 047804 47 PVYGCVGAVCHLQKQVSELQAELAKAQAGLAFM 79 (81)
Q Consensus 47 PV~Gc~G~I~~Lq~qi~~lq~eLa~~~~~la~~ 79 (81)
||.|.+-+.-+++.+...---+-+.++++|+..
T Consensus 8 Pvrgv~wv~e~I~~~Ae~E~~Dp~~i~~~L~~L 40 (79)
T PF05120_consen 8 PVRGVVWVAEQIQEQAERELYDPAAIRRELAEL 40 (79)
T ss_pred hHHHHHHHHHHHHHHHHHHHcCHHHHHHHHHHH
Confidence 666666677777776655444555666666543
No 22
>PF14282 FlxA: FlxA-like protein
Probab=63.26 E-value=14 Score=24.42 Aligned_cols=24 Identities=38% Similarity=0.452 Sum_probs=13.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHh
Q 047804 54 AVCHLQKQVSELQAELAKAQAGLA 77 (81)
Q Consensus 54 ~I~~Lq~qi~~lq~eLa~~~~~la 77 (81)
-+..||.||..|+++|+.++.+.+
T Consensus 52 q~q~Lq~QI~~LqaQI~qlq~q~~ 75 (106)
T PF14282_consen 52 QIQLLQAQIQQLQAQIAQLQSQQA 75 (106)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH
Confidence 345566666666666665555543
No 23
>PF12325 TMF_TATA_bd: TATA element modulatory factor 1 TATA binding; InterPro: IPR022091 This is the C-terminal conserved coiled coil region of a family of TATA element modulatory factor 1 proteins conserved in eukaryotes []. The proteins bind to the TATA element of some RNA polymerase II promoters and repress their activity. by competing with the binding of TATA binding protein. TMF1_TATA_bd is the most conserved part of the TMFs []. TMFs are evolutionarily conserved golgins that bind Rab6, a ubiquitous ras-like GTP-binding Golgi protein, and contribute to Golgi organisation in animal [] and plant cells. The Rab6-binding domain appears to be the same region as this C-terminal family [].
Probab=63.23 E-value=15 Score=25.29 Aligned_cols=30 Identities=40% Similarity=0.392 Sum_probs=23.4
Q ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHHhhc
Q 047804 50 GCVGAVCHLQKQVSELQAELAKAQAGLAFM 79 (81)
Q Consensus 50 Gc~G~I~~Lq~qi~~lq~eLa~~~~~la~~ 79 (81)
.-+..|..|+.+|..+..|++.++.+++..
T Consensus 13 ~~~~~ve~L~s~lr~~E~E~~~l~~el~~l 42 (120)
T PF12325_consen 13 PSVQLVERLQSQLRRLEGELASLQEELARL 42 (120)
T ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 346778888888888888888888887754
No 24
>PF15397 DUF4618: Domain of unknown function (DUF4618)
Probab=62.37 E-value=12 Score=29.08 Aligned_cols=27 Identities=33% Similarity=0.586 Sum_probs=24.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhc
Q 047804 53 GAVCHLQKQVSELQAELAKAQAGLAFM 79 (81)
Q Consensus 53 G~I~~Lq~qi~~lq~eLa~~~~~la~~ 79 (81)
-.+..||+|+..+.+.+..++.+|.+.
T Consensus 81 ~~l~~Lq~ql~~l~akI~k~~~el~~L 107 (258)
T PF15397_consen 81 SKLSKLQQQLEQLDAKIQKTQEELNFL 107 (258)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 457899999999999999999999874
No 25
>PRK10803 tol-pal system protein YbgF; Provisional
Probab=62.23 E-value=13 Score=28.09 Aligned_cols=26 Identities=19% Similarity=0.275 Sum_probs=21.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhc
Q 047804 54 AVCHLQKQVSELQAELAKAQAGLAFM 79 (81)
Q Consensus 54 ~I~~Lq~qi~~lq~eLa~~~~~la~~ 79 (81)
....||+||+.++.|++.++.++...
T Consensus 55 ~~~~l~~ql~~lq~ev~~LrG~~E~~ 80 (263)
T PRK10803 55 LLTQLQQQLSDNQSDIDSLRGQIQEN 80 (263)
T ss_pred HHHHHHHHHHHHHHHHHHHhhHHHHH
Confidence 45688999999999999998887643
No 26
>KOG4418 consensus Predicted membrane protein [Function unknown]
Probab=61.78 E-value=0.75 Score=38.28 Aligned_cols=61 Identities=16% Similarity=0.115 Sum_probs=45.0
Q ss_pred hhhccccCHHHHHHHHHHHHhhhccCCccccHHHHHHHHHHHHHHHHHHHHHHHHHhhccC
Q 047804 21 LATGACRTKRDCKLIMDYEANSRIRDPVYGCVGAVCHLQKQVSELQAELAKAQAGLAFMQS 81 (81)
Q Consensus 21 ~~~g~~~~k~da~~SlvYEA~aR~rDPV~Gc~G~I~~Lq~qi~~lq~eLa~~~~~la~~q~ 81 (81)
+.+|..|++.+-...+.||+.----||+|||-++|..----..---.|+-++.++|.+|+|
T Consensus 35 r~S~~~nv~sa~~~kltde~lL~~l~~i~~~~~~~l~~wtls~ppd~ev~rl~~qLkff~c 95 (438)
T KOG4418|consen 35 RTSGARNVVSAGGQKLTDEELLSSLDQIYPCGHVVLAEWTLSTPPDLEVPRLIQQLKFFAC 95 (438)
T ss_pred cccceeeccccccccccHHHHHhhccccccccceeeeeEeecCCchhhhHhHHHHHHHHHh
Confidence 4678888888888899999999999999999988753222222223666777777777765
No 27
>PF06305 DUF1049: Protein of unknown function (DUF1049); InterPro: IPR010445 This entry consists of several hypothetical bacterial proteins of unknown function.
Probab=60.74 E-value=26 Score=20.46 Aligned_cols=25 Identities=24% Similarity=0.389 Sum_probs=18.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhc
Q 047804 55 VCHLQKQVSELQAELAKAQAGLAFM 79 (81)
Q Consensus 55 I~~Lq~qi~~lq~eLa~~~~~la~~ 79 (81)
...+..++..++.+++.++.+++..
T Consensus 43 ~~~~r~~~~~~~k~l~~le~e~~~l 67 (68)
T PF06305_consen 43 RLRLRRRIRRLRKELKKLEKELEQL 67 (68)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhc
Confidence 3566788888888888888887653
No 28
>TIGR02209 ftsL_broad cell division protein FtsL. This model represents FtsL, both forms similar to that in E. coli and similar to that in B. subtilis. FtsL is one of the later proteins active in cell division septum formation. FtsL is small, low in complexity, and highly divergent. The scope of this model is broader than that of the Pfam model pfam04999.3 for FtsL, as this one includes FtsL from Bacillus subtilis and related species.
Probab=59.25 E-value=22 Score=21.61 Aligned_cols=28 Identities=25% Similarity=0.252 Sum_probs=17.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhc
Q 047804 52 VGAVCHLQKQVSELQAELAKAQAGLAFM 79 (81)
Q Consensus 52 ~G~I~~Lq~qi~~lq~eLa~~~~~la~~ 79 (81)
...+..++++++.+++|-..++.++..+
T Consensus 30 ~~~~~~~~~~~~~l~~en~~L~~ei~~l 57 (85)
T TIGR02209 30 NNELQKLQLEIDKLQKEWRDLQLEVAEL 57 (85)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3445666666666666666666666543
No 29
>PF12808 Mto2_bdg: Micro-tubular organiser Mto1 C-term Mto2-binding region; InterPro: IPR024545 This domain occurs at the C terminus of microtubule organising proteins in both budding and fission fungi. In Schizosaccharomyces pombe it has been shown to interact with the Mto2p protein, an interaction which is critical for anchoring the cytokinetic actin ring to the medial region of the cell and for proper coordination of mitosis with cytokinesis [, ].
Probab=57.63 E-value=41 Score=20.41 Aligned_cols=43 Identities=19% Similarity=0.213 Sum_probs=33.0
Q ss_pred HHHHHHHHHHhhhccCCccccHHHHHHHHHHHHHHHHHHHHHHH
Q 047804 31 DCKLIMDYEANSRIRDPVYGCVGAVCHLQKQVSELQAELAKAQA 74 (81)
Q Consensus 31 da~~SlvYEA~aR~rDPV~Gc~G~I~~Lq~qi~~lq~eLa~~~~ 74 (81)
+...-+.-|=+.|..|| .|+--.|..|..+...|+++|+..|.
T Consensus 8 ELe~klkaerE~R~~d~-~~a~~rl~~l~~EN~~Lr~eL~~~r~ 50 (52)
T PF12808_consen 8 ELERKLKAEREARSLDR-SAARKRLSKLEGENRLLRAELERLRS 50 (52)
T ss_pred HHHHHHHHhHHhccCCc-hhHHHHHHHHHHHHHHHHHHHHHHhh
Confidence 34445666777888888 57778888888888888888887764
No 30
>PRK09039 hypothetical protein; Validated
Probab=57.55 E-value=15 Score=28.82 Aligned_cols=27 Identities=30% Similarity=0.385 Sum_probs=19.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhh
Q 047804 52 VGAVCHLQKQVSELQAELAKAQAGLAF 78 (81)
Q Consensus 52 ~G~I~~Lq~qi~~lq~eLa~~~~~la~ 78 (81)
.-.|..|++||..|+++|+.+++.|+.
T Consensus 136 ~~~V~~L~~qI~aLr~Qla~le~~L~~ 162 (343)
T PRK09039 136 LAQVELLNQQIAALRRQLAALEAALDA 162 (343)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 345777777888777777777777764
No 31
>PF05308 Mito_fiss_reg: Mitochondrial fission regulator; InterPro: IPR007972 This family consists of several uncharacterised eukaryotic proteins of unknown function.
Probab=57.35 E-value=12 Score=28.65 Aligned_cols=22 Identities=27% Similarity=0.468 Sum_probs=19.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHH
Q 047804 52 VGAVCHLQKQVSELQAELAKAQ 73 (81)
Q Consensus 52 ~G~I~~Lq~qi~~lq~eLa~~~ 73 (81)
...|+.||.++..|++|+|++-
T Consensus 121 lqKIsALEdELs~LRaQIA~IV 142 (253)
T PF05308_consen 121 LQKISALEDELSRLRAQIAKIV 142 (253)
T ss_pred HHHHHHHHHHHHHHHHHHHHHH
Confidence 4579999999999999999764
No 32
>COG5509 Uncharacterized small protein containing a coiled-coil domain [Function unknown]
Probab=56.43 E-value=16 Score=23.33 Aligned_cols=21 Identities=43% Similarity=0.594 Sum_probs=17.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHH
Q 047804 54 AVCHLQKQVSELQAELAKAQA 74 (81)
Q Consensus 54 ~I~~Lq~qi~~lq~eLa~~~~ 74 (81)
.|-.||.+|..+++|+++-.+
T Consensus 33 RIalLq~EIeRlkAe~~kK~~ 53 (65)
T COG5509 33 RIALLQAEIERLKAELAKKKA 53 (65)
T ss_pred HHHHHHHHHHHHHHHHHhhhc
Confidence 478999999999999987543
No 33
>PHA02047 phage lambda Rz1-like protein
Probab=56.32 E-value=24 Score=24.31 Aligned_cols=31 Identities=19% Similarity=0.176 Sum_probs=21.7
Q ss_pred ccHHHHH------HHHHHHHHHHHHHHHHHHHHhhcc
Q 047804 50 GCVGAVC------HLQKQVSELQAELAKAQAGLAFMQ 80 (81)
Q Consensus 50 Gc~G~I~------~Lq~qi~~lq~eLa~~~~~la~~q 80 (81)
|.||.+. .-++..+++.++|+.++.++..||
T Consensus 18 ~~y~~~~~~r~~g~~h~~a~~la~qLE~a~~r~~~~Q 54 (101)
T PHA02047 18 ASYGFVQSYRALGIAHEEAKRQTARLEALEVRYATLQ 54 (101)
T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4555555 335677788888888888887776
No 34
>PRK00295 hypothetical protein; Provisional
Probab=55.29 E-value=30 Score=21.45 Aligned_cols=27 Identities=19% Similarity=0.183 Sum_probs=19.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhc
Q 047804 53 GAVCHLQKQVSELQAELAKAQAGLAFM 79 (81)
Q Consensus 53 G~I~~Lq~qi~~lq~eLa~~~~~la~~ 79 (81)
.+|...|++|..++.+|..+..+|...
T Consensus 26 ~~v~~Qq~~I~~L~~ql~~L~~rl~~~ 52 (68)
T PRK00295 26 DVLVEQQRVIERLQLQMAALIKRQEEM 52 (68)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 466777788888888888777766543
No 35
>PF07334 IFP_35_N: Interferon-induced 35 kDa protein (IFP 35) N-terminus; InterPro: IPR009938 This entry represents the N terminus of interferon-induced 35 kDa protein (IFP 35) (approximately 80 residues long), which contains a leucine zipper motif in an alpha helical configuration []. This group of proteins also includes N-myc-interactor (Nmi), a homologous interferon-induced protein.
Probab=54.78 E-value=25 Score=22.92 Aligned_cols=23 Identities=26% Similarity=0.446 Sum_probs=17.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHh
Q 047804 55 VCHLQKQVSELQAELAKAQAGLA 77 (81)
Q Consensus 55 I~~Lq~qi~~lq~eLa~~~~~la 77 (81)
|..|+++..+|+.+|.++.++|.
T Consensus 2 i~ei~eEn~~Lk~eiqkle~ELq 24 (76)
T PF07334_consen 2 IHEIQEENARLKEEIQKLEAELQ 24 (76)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHH
Confidence 56777777778877777777665
No 36
>PF11853 DUF3373: Protein of unknown function (DUF3373); InterPro: IPR021803 This family of proteins are functionally uncharacterised. This protein is found in bacteria. Proteins in this family are typically between 472 to 574 amino acids in length.
Probab=54.74 E-value=16 Score=30.80 Aligned_cols=33 Identities=18% Similarity=0.210 Sum_probs=23.5
Q ss_pred CCccccHHHHHHHHHHHHHHHHHHHHHHHHHhhc
Q 047804 46 DPVYGCVGAVCHLQKQVSELQAELAKAQAGLAFM 79 (81)
Q Consensus 46 DPV~Gc~G~I~~Lq~qi~~lq~eLa~~~~~la~~ 79 (81)
.|...-.--+..+| ||+.|++||+.+++++..+
T Consensus 18 ~~~~a~~~~~~~~q-kie~L~kql~~Lk~q~~~l 50 (489)
T PF11853_consen 18 LPAAAMADDIDLLQ-KIEALKKQLEELKAQQDDL 50 (489)
T ss_pred cchhhhhhhhHHHH-HHHHHHHHHHHHHHhhccc
Confidence 44444445555566 9999999999999887644
No 37
>cd00632 Prefoldin_beta Prefoldin beta; Prefoldin is a hexameric molecular chaperone complex, composed of two evolutionarily related subunits (alpha and beta), which are found in both eukaryotes and archaea. Prefoldin binds and stabilizes newly synthesized polypeptides allowing them to fold correctly. The hexameric structure consists of a double beta barrel assembly with six protruding coiled-coils. The alpha prefoldin subunits have two beta hairpin structures while the beta prefoldin subunits (this CD) have only one hairpin that is most similar to the second hairpin of the alpha subunit. The prefoldin hexamer consists of two alpha and four beta subunits and is assembled from the beta hairpins of all six subunits. The alpha subunits initially dimerize providing a structural nucleus for the assembly of the beta subunits. In archaea, there is usually only one gene for each subunit while in eukaryotes there two or more paralogous genes encoding each subunit adding heterogeneity to the st
Probab=54.57 E-value=19 Score=23.26 Aligned_cols=38 Identities=18% Similarity=0.228 Sum_probs=28.7
Q ss_pred hccCCccccHHHHHHHHHHHHHHHHHHHHHHHHHhhcc
Q 047804 43 RIRDPVYGCVGAVCHLQKQVSELQAELAKAQAGLAFMQ 80 (81)
Q Consensus 43 R~rDPV~Gc~G~I~~Lq~qi~~lq~eLa~~~~~la~~q 80 (81)
...+=+-.+-..|..|..++..++.++...+.+|..+|
T Consensus 67 ~Le~~~e~le~~i~~l~~~~~~l~~~~~elk~~l~~~~ 104 (105)
T cd00632 67 ELKERLETIELRIKRLERQEEDLQEKLKELQEKIQQAQ 104 (105)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 33444455667788888888888888888888888776
No 38
>TIGR01010 BexC_CtrB_KpsE polysaccharide export inner-membrane protein, BexC/CtrB/KpsE family. This family contains gamma proteobacterial proteins involved in capsule polysaccharide export.
Probab=53.28 E-value=34 Score=26.21 Aligned_cols=29 Identities=21% Similarity=0.258 Sum_probs=23.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhcc
Q 047804 52 VGAVCHLQKQVSELQAELAKAQAGLAFMQ 80 (81)
Q Consensus 52 ~G~I~~Lq~qi~~lq~eLa~~~~~la~~q 80 (81)
......+++|+..++.+|..++..|..||
T Consensus 169 ~~a~~fl~~ql~~~~~~l~~ae~~l~~fr 197 (362)
T TIGR01010 169 KDTIAFAENEVKEAEQRLNATKAELLKYQ 197 (362)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 45667888888888888888888888776
No 39
>PF08227 DASH_Hsk3: DASH complex subunit Hsk3 like; InterPro: IPR013183 This is a family of fungal proteins of unknown function.
Probab=53.02 E-value=31 Score=20.37 Aligned_cols=25 Identities=36% Similarity=0.482 Sum_probs=20.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhh
Q 047804 54 AVCHLQKQVSELQAELAKAQAGLAF 78 (81)
Q Consensus 54 ~I~~Lq~qi~~lq~eLa~~~~~la~ 78 (81)
-+.+|..|+..|++-|+.+.+.|..
T Consensus 3 q~s~L~~qL~qL~aNL~~t~~~l~~ 27 (45)
T PF08227_consen 3 QYSHLASQLAQLQANLADTENLLEM 27 (45)
T ss_pred HHHHHHHHHHHHHHhHHHHHHHHHH
Confidence 4678899999999999988887764
No 40
>PRK14127 cell division protein GpsB; Provisional
Probab=52.71 E-value=29 Score=23.76 Aligned_cols=26 Identities=15% Similarity=0.244 Sum_probs=15.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhc
Q 047804 54 AVCHLQKQVSELQAELAKAQAGLAFM 79 (81)
Q Consensus 54 ~I~~Lq~qi~~lq~eLa~~~~~la~~ 79 (81)
.+..|+.++..++.+|+..+.+++.+
T Consensus 45 e~~~Lk~e~~~l~~~l~e~~~~~~~~ 70 (109)
T PRK14127 45 EIEELQQENARLKAQVDELTKQVSVG 70 (109)
T ss_pred HHHHHHHHHHHHHHHHHHHHHhhccc
Confidence 45566666666666666666655543
No 41
>PF07716 bZIP_2: Basic region leucine zipper; InterPro: IPR011700 The basic-leucine zipper (bZIP) transcription factors [, ] of eukaryotes are proteins that contain a basic region mediating sequence-specific DNA-binding, followed by a leucine zipper region (see IPR002158 from INTERPRO), which is required for dimerization.; GO: 0003700 sequence-specific DNA binding transcription factor activity, 0043565 sequence-specific DNA binding, 0046983 protein dimerization activity, 0006355 regulation of transcription, DNA-dependent; PDB: 1NWQ_A 1H89_B 1H88_A 1GTW_B 2E43_A 1IO4_A 1GU4_B 2E42_A 1H8A_B 1GU5_B ....
Probab=52.62 E-value=36 Score=19.65 Aligned_cols=25 Identities=24% Similarity=0.323 Sum_probs=14.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhc
Q 047804 55 VCHLQKQVSELQAELAKAQAGLAFM 79 (81)
Q Consensus 55 I~~Lq~qi~~lq~eLa~~~~~la~~ 79 (81)
+..|+.++..|+.+...++++++.+
T Consensus 27 ~~~le~~~~~L~~en~~L~~~i~~L 51 (54)
T PF07716_consen 27 EEELEQEVQELEEENEQLRQEIAQL 51 (54)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4455666666666666666655543
No 42
>PRK04406 hypothetical protein; Provisional
Probab=51.94 E-value=37 Score=21.55 Aligned_cols=27 Identities=15% Similarity=0.207 Sum_probs=19.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhc
Q 047804 53 GAVCHLQKQVSELQAELAKAQAGLAFM 79 (81)
Q Consensus 53 G~I~~Lq~qi~~lq~eLa~~~~~la~~ 79 (81)
.+|...|++|..++.+|..+..+|...
T Consensus 32 ~~v~~Qq~~I~~L~~ql~~L~~rl~~~ 58 (75)
T PRK04406 32 DALSQQQLLITKMQDQMKYVVGKVKNM 58 (75)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhh
Confidence 467777888888888888777776544
No 43
>PRK02793 phi X174 lysis protein; Provisional
Probab=51.74 E-value=38 Score=21.20 Aligned_cols=27 Identities=15% Similarity=0.100 Sum_probs=18.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhc
Q 047804 53 GAVCHLQKQVSELQAELAKAQAGLAFM 79 (81)
Q Consensus 53 G~I~~Lq~qi~~lq~eLa~~~~~la~~ 79 (81)
-+|...|++|..++.+|..+..+|...
T Consensus 29 ~~v~~Qq~~I~~L~~~l~~L~~rl~~~ 55 (72)
T PRK02793 29 VTVTAHEMEMAKLRDHLRLLTEKLKAS 55 (72)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhh
Confidence 356667777777777777777666543
No 44
>PF06698 DUF1192: Protein of unknown function (DUF1192); InterPro: IPR009579 This family consists of several short, hypothetical, bacterial proteins of around 60 residues in length. The function of this family is unknown.
Probab=51.68 E-value=35 Score=21.11 Aligned_cols=25 Identities=28% Similarity=0.417 Sum_probs=21.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHh
Q 047804 53 GAVCHLQKQVSELQAELAKAQAGLA 77 (81)
Q Consensus 53 G~I~~Lq~qi~~lq~eLa~~~~~la 77 (81)
..|..|+.+|..++++++.-++..+
T Consensus 28 ~RIa~L~aEI~R~~~~~~~K~a~r~ 52 (59)
T PF06698_consen 28 ERIALLEAEIARLEAAIAKKSASRA 52 (59)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4688999999999999998877654
No 45
>PF07106 TBPIP: Tat binding protein 1(TBP-1)-interacting protein (TBPIP); InterPro: IPR010776 This family consists of several eukaryotic TBP-1 interacting protein (TBPIP) sequences. TBP-1 has been demonstrated to interact with the human immunodeficiency virus type 1 (HIV-1) viral protein Tat, then modulate the essential replication process of HIV. In addition, TBP-1 has been shown to be a component of the 26S proteasome, a basic multiprotein complex that degrades ubiquitinated proteins in an ATP-dependent fashion. Human TBPIP interacts with human TBP-1 then modulates the inhibitory action of human TBP-1 on HIV-Tat-mediated transactivation [].
Probab=51.50 E-value=28 Score=24.12 Aligned_cols=71 Identities=20% Similarity=0.192 Sum_probs=41.3
Q ss_pred chHHHHHhhhhhhhHHHHhhhccccC---HHHHHHHHHHHHhhhccCCccccHHHHHHHHHHHHHHHHHHHHHHHHHhhc
Q 047804 3 SFFLKLLESFRELQSWVNLATGACRT---KRDCKLIMDYEANSRIRDPVYGCVGAVCHLQKQVSELQAELAKAQAGLAFM 79 (81)
Q Consensus 3 snv~k~l~~~~~~~r~~~~~~g~~~~---k~da~~SlvYEA~aR~rDPV~Gc~G~I~~Lq~qi~~lq~eLa~~~~~la~~ 79 (81)
..+.|.|.+|-...+-+...||--+| +.+...++.= .+ ...--.-|..|+.++..++.+...++++|+.+
T Consensus 33 ~~v~k~Ld~L~~~g~i~~K~~GKqkiY~~~Q~~~~~~s~-ee------l~~ld~ei~~L~~el~~l~~~~k~l~~eL~~L 105 (169)
T PF07106_consen 33 TAVQKALDSLVEEGKIVEKEYGKQKIYFANQDELEVPSP-EE------LAELDAEIKELREELAELKKEVKSLEAELASL 105 (169)
T ss_pred HHHHHHHHHHHhCCCeeeeeecceEEEeeCccccCCCCc-hh------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34678888888887777777776666 2222111000 00 00112246777777777777777777777654
Q ss_pred c
Q 047804 80 Q 80 (81)
Q Consensus 80 q 80 (81)
.
T Consensus 106 ~ 106 (169)
T PF07106_consen 106 S 106 (169)
T ss_pred h
Confidence 3
No 46
>KOG4552 consensus Vitamin-D-receptor interacting protein complex component [Transcription]
Probab=50.37 E-value=71 Score=25.17 Aligned_cols=48 Identities=25% Similarity=0.309 Sum_probs=30.9
Q ss_pred HHHHHhhhhhhhHHHHhhhccccCHHHHHHHHHHHHhhhccCCccccHHHHHHHHHHHHHHHHHHHH
Q 047804 5 FLKLLESFRELQSWVNLATGACRTKRDCKLIMDYEANSRIRDPVYGCVGAVCHLQKQVSELQAELAK 71 (81)
Q Consensus 5 v~k~l~~~~~~~r~~~~~~g~~~~k~da~~SlvYEA~aR~rDPV~Gc~G~I~~Lq~qi~~lq~eLa~ 71 (81)
|-+||+-.++.+. ++.+|..+- |..-.|| .+|.+||.++...+.-|+.
T Consensus 59 f~~llkla~eq~k-----------~e~~m~~Le--a~VEkrD------~~IQqLqk~LK~aE~iLtt 106 (272)
T KOG4552|consen 59 FKTLLKLAPEQQK-----------REQLMRTLE--AHVEKRD------EVIQQLQKNLKSAEVILTT 106 (272)
T ss_pred HHHHHHHhHhHHH-----------HHHHHHHHH--HHHHHhH------HHHHHHHHHHHHHHHHHHH
Confidence 4455555555443 377888774 4444445 3599999999887666653
No 47
>PRK00846 hypothetical protein; Provisional
Probab=49.95 E-value=39 Score=21.89 Aligned_cols=29 Identities=7% Similarity=-0.019 Sum_probs=21.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhcc
Q 047804 52 VGAVCHLQKQVSELQAELAKAQAGLAFMQ 80 (81)
Q Consensus 52 ~G~I~~Lq~qi~~lq~eLa~~~~~la~~q 80 (81)
-.+|...|++|..++.+|..+..+|..++
T Consensus 33 N~~v~~qq~~I~~L~~ql~~L~~rL~~~~ 61 (77)
T PRK00846 33 SEALADARLTGARNAELIRHLLEDLGKVR 61 (77)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhc
Confidence 35677777888888888887777776654
No 48
>PF05983 Med7: MED7 protein; InterPro: IPR009244 The Mediator complex is a coactivator involved in the regulated transcription of nearly all RNA polymerase II-dependent genes. Mediator functions as a bridge to convey information from gene-specific regulatory proteins to the basal RNA polymerase II transcription machinery. The Mediator complex, having a compact conformation in its free form, is recruited to promoters by direct interactions with regulatory proteins and serves for the assembly of a functional preinitiation complex with RNA polymerase II and the general transcription factors. On recruitment the Mediator complex unfolds to an extended conformation and partially surrounds RNA polymerase II, specifically interacting with the unphosphorylated form of the C-terminal domain (CTD) of RNA polymerase II. The Mediator complex dissociates from the RNA polymerase II holoenzyme and stays at the promoter when transcriptional elongation begins. The Mediator complex is composed of at least 31 subunits: MED1, MED4, MED6, MED7, MED8, MED9, MED10, MED11, MED12, MED13, MED13L, MED14, MED15, MED16, MED17, MED18, MED19, MED20, MED21, MED22, MED23, MED24, MED25, MED26, MED27, MED29, MED30, MED31, CCNC, CDK8 and CDC2L6/CDK11. The subunits form at least three structurally distinct submodules. The head and the middle modules interact directly with RNA polymerase II, whereas the elongated tail module interacts with gene-specific regulatory proteins. Mediator containing the CDK8 module is less active than Mediator lacking this module in supporting transcriptional activation. The head module contains: MED6, MED8, MED11, SRB4/MED17, SRB5/MED18, ROX3/MED19, SRB2/MED20 and SRB6/MED22. The middle module contains: MED1, MED4, NUT1/MED5, MED7, CSE2/MED9, NUT2/MED10, SRB7/MED21 and SOH1/MED31. CSE2/MED9 interacts directly with MED4. The tail module contains: MED2, PGD1/MED3, RGR1/MED14, GAL11/MED15 and SIN4/MED16. The CDK8 module contains: MED12, MED13, CCNC and CDK8. Individual preparations of the Mediator complex lacking one or more distinct subunits have been variously termed ARC, CRSP, DRIP, PC2, SMCC and TRAP. This family consists of several eukaryotic proteins, which are homologues of the yeast MED7 protein. Activation of gene transcription in metazoans is a multistep process that is triggered by factors that recognise transcriptional enhancer sites in DNA. These factors work with co-activators such as MED7 to direct transcriptional initiation by the RNA polymerase II apparatus [].; GO: 0001104 RNA polymerase II transcription cofactor activity, 0006357 regulation of transcription from RNA polymerase II promoter, 0016592 mediator complex; PDB: 3FBI_C 3FBN_A 1YKH_A 1YKE_A.
Probab=49.73 E-value=92 Score=22.13 Aligned_cols=43 Identities=19% Similarity=0.277 Sum_probs=29.9
Q ss_pred chHHHHHhhhhhhhHHHHhhhccccCHHHHHHHHHHHHhhhccCCccccHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q 047804 3 SFFLKLLESFRELQSWVNLATGACRTKRDCKLIMDYEANSRIRDPVYGCVGAVCHLQKQVSELQAELAKAQAGLAF 78 (81)
Q Consensus 3 snv~k~l~~~~~~~r~~~~~~g~~~~k~da~~SlvYEA~aR~rDPV~Gc~G~I~~Lq~qi~~lq~eLa~~~~~la~ 78 (81)
.|+--+|++++|+|- ++. .|.-|+.|+++-+++++..+..+..
T Consensus 114 ~NmhhllNeyRPhQA------------Ret---------------------Li~~me~Ql~~kr~~i~~i~~~~~~ 156 (162)
T PF05983_consen 114 INMHHLLNEYRPHQA------------RET---------------------LIMMMEEQLEEKREEIEEIRKVCEK 156 (162)
T ss_dssp HHHHHHHHHTHHHHH------------HHH---------------------HHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHhCHHHH------------HHH---------------------HHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 477778888888875 333 3456777888888877777766543
No 49
>smart00338 BRLZ basic region leucin zipper.
Probab=49.56 E-value=46 Score=19.63 Aligned_cols=25 Identities=20% Similarity=0.304 Sum_probs=12.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHh
Q 047804 53 GAVCHLQKQVSELQAELAKAQAGLA 77 (81)
Q Consensus 53 G~I~~Lq~qi~~lq~eLa~~~~~la 77 (81)
+.|..|+.+...|+.++..++.++.
T Consensus 33 ~~~~~L~~en~~L~~~~~~l~~e~~ 57 (65)
T smart00338 33 RKVEQLEAENERLKKEIERLRRELE 57 (65)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3444555555555555555554443
No 50
>PF04102 SlyX: SlyX; InterPro: IPR007236 The SlyX protein has no known function. It is short, less than 80 amino acids, and its gene is found close to the slyD gene. The SlyX protein has a conserved PPH(Y/W) motif at its C terminus. The protein may be a coiled-coil structure.; PDB: 3EFG_A.
Probab=49.55 E-value=29 Score=21.32 Aligned_cols=26 Identities=27% Similarity=0.317 Sum_probs=14.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhc
Q 047804 54 AVCHLQKQVSELQAELAKAQAGLAFM 79 (81)
Q Consensus 54 ~I~~Lq~qi~~lq~eLa~~~~~la~~ 79 (81)
+|...|++|..++.++..+..+|..+
T Consensus 26 ~v~~Qq~~I~~L~~~l~~L~~rl~~~ 51 (69)
T PF04102_consen 26 VVTEQQRQIDRLQRQLRLLRERLREL 51 (69)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHT----
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 45566677777777777766666544
No 51
>PRK02119 hypothetical protein; Provisional
Probab=49.54 E-value=42 Score=21.07 Aligned_cols=27 Identities=22% Similarity=0.166 Sum_probs=19.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhc
Q 047804 53 GAVCHLQKQVSELQAELAKAQAGLAFM 79 (81)
Q Consensus 53 G~I~~Lq~qi~~lq~eLa~~~~~la~~ 79 (81)
.+|..-|++|..++.+|..+..+|...
T Consensus 30 ~~v~~Qq~~id~L~~ql~~L~~rl~~~ 56 (73)
T PRK02119 30 QALIEQQFVIDKMQVQLRYMANKLKDM 56 (73)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhh
Confidence 456677777777777777777766543
No 52
>PF14282 FlxA: FlxA-like protein
Probab=49.44 E-value=23 Score=23.41 Aligned_cols=23 Identities=30% Similarity=0.411 Sum_probs=18.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH
Q 047804 52 VGAVCHLQKQVSELQAELAKAQA 74 (81)
Q Consensus 52 ~G~I~~Lq~qi~~lq~eLa~~~~ 74 (81)
-..|..|++||..|+.+|..+..
T Consensus 18 ~~~I~~L~~Qi~~Lq~ql~~l~~ 40 (106)
T PF14282_consen 18 DSQIEQLQKQIKQLQEQLQELSQ 40 (106)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHc
Confidence 66788888888888888877664
No 53
>PF04340 DUF484: Protein of unknown function, DUF484; InterPro: IPR007435 This family consists of several proteins of uncharacterised function.; PDB: 3E98_B.
Probab=49.42 E-value=33 Score=24.79 Aligned_cols=34 Identities=24% Similarity=0.378 Sum_probs=23.4
Q ss_pred hccCCccccHHHHHHHHHHHHHHHHHHHHHHHHHhhc
Q 047804 43 RIRDPVYGCVGAVCHLQKQVSELQAELAKAQAGLAFM 79 (81)
Q Consensus 43 R~rDPV~Gc~G~I~~Lq~qi~~lq~eLa~~~~~la~~ 79 (81)
+...|- -|+|+-.+.|+..++++...++.+|..+
T Consensus 33 ~~ph~~---~~avSL~erQ~~~LR~~~~~L~~~l~~L 66 (225)
T PF04340_consen 33 RLPHPS---GGAVSLVERQLERLRERNRQLEEQLEEL 66 (225)
T ss_dssp -----------HHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred CCCCCC---CCcccHHHHHHHHHHHHHHHHHHHHHHH
Confidence 455553 3799999999999999999999988743
No 54
>PRK00888 ftsB cell division protein FtsB; Reviewed
Probab=49.32 E-value=35 Score=22.66 Aligned_cols=28 Identities=18% Similarity=0.148 Sum_probs=19.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhcc
Q 047804 53 GAVCHLQKQVSELQAELAKAQAGLAFMQ 80 (81)
Q Consensus 53 G~I~~Lq~qi~~lq~eLa~~~~~la~~q 80 (81)
.-+..++++++.++++.+.++.++..++
T Consensus 34 ~q~~~~~~e~~~l~~~n~~L~~eI~~L~ 61 (105)
T PRK00888 34 DQVAAQQQTNAKLKARNDQLFAEIDDLK 61 (105)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhh
Confidence 3456677777777777777777776554
No 55
>PRK00736 hypothetical protein; Provisional
Probab=49.07 E-value=44 Score=20.66 Aligned_cols=27 Identities=11% Similarity=0.172 Sum_probs=19.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhc
Q 047804 53 GAVCHLQKQVSELQAELAKAQAGLAFM 79 (81)
Q Consensus 53 G~I~~Lq~qi~~lq~eLa~~~~~la~~ 79 (81)
.+|..-|++|..++.+|..+..+|...
T Consensus 26 ~~v~~Qq~~i~~L~~ql~~L~~rl~~~ 52 (68)
T PRK00736 26 DQLAEQWKTVEQMRKKLDALTERFLSL 52 (68)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 456677777777777777777766544
No 56
>PF08657 DASH_Spc34: DASH complex subunit Spc34 ; InterPro: IPR013966 The DASH complex is a ~10 subunit microtubule-binding complex that is transferred to the kinetochore prior to mitosis []. In Saccharomyces cerevisiae (Baker's yeast) DASH forms both rings and spiral structures on microtubules in vitro [, ]. Components of the DASH complex, including Dam1, Duo1, Spc34, Dad1 and Ask1, are essential and connect the centromere to the plus end of spindle microtubules [].
Probab=48.70 E-value=35 Score=26.27 Aligned_cols=34 Identities=21% Similarity=0.274 Sum_probs=28.4
Q ss_pred cCCccccHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q 047804 45 RDPVYGCVGAVCHLQKQVSELQAELAKAQAGLAF 78 (81)
Q Consensus 45 rDPV~Gc~G~I~~Lq~qi~~lq~eLa~~~~~la~ 78 (81)
..|+.|.-..|..|.++...+.++++..+++++.
T Consensus 172 vYP~~ga~eki~~Lr~~y~~l~~~i~~lE~~Vae 205 (259)
T PF08657_consen 172 VYPLPGAREKIAALRQRYNQLSNSIAYLEAEVAE 205 (259)
T ss_pred hCCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4599999999999999988888888888777763
No 57
>PF04728 LPP: Lipoprotein leucine-zipper; InterPro: IPR006817 This repeating sequence, NAKVDQLSNDV, is found in the enterobacterial outer membrane lipoprotein LPP. The outer membrane lipoprotein is the most abundant protein in an Escherichia coli cell. The messenger RNA for the lipoprotein of the E. coli outer membrane codes for a putative precursor, prolipoprotein, which has 20 additional amino acid residues extending from the amino terminus of the lipoprotein.; GO: 0019867 outer membrane; PDB: 1JCC_A 2GUV_C 2GUS_A 1JCD_A 1KFM_A 1T8Z_D 1KFN_A 1EQ7_A.
Probab=48.20 E-value=44 Score=20.62 Aligned_cols=20 Identities=25% Similarity=0.408 Sum_probs=8.7
Q ss_pred HHHHHHHHHHHHHHHHHHHH
Q 047804 55 VCHLQKQVSELQAELAKAQA 74 (81)
Q Consensus 55 I~~Lq~qi~~lq~eLa~~~~ 74 (81)
|+.|..++.++..+...++.
T Consensus 12 Vq~L~~kvdqLs~dv~~lr~ 31 (56)
T PF04728_consen 12 VQTLNSKVDQLSSDVNALRA 31 (56)
T ss_dssp HHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHH
Confidence 33444444444444444333
No 58
>smart00150 SPEC Spectrin repeats.
Probab=46.44 E-value=58 Score=18.84 Aligned_cols=34 Identities=21% Similarity=0.169 Sum_probs=27.5
Q ss_pred cCCccccHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q 047804 45 RDPVYGCVGAVCHLQKQVSELQAELAKAQAGLAF 78 (81)
Q Consensus 45 rDPV~Gc~G~I~~Lq~qi~~lq~eLa~~~~~la~ 78 (81)
.+|+++....|..+.++...++.++...+..+..
T Consensus 23 ~~~~~~d~~~~~~~~~~~~~~~~e~~~~~~~v~~ 56 (101)
T smart00150 23 SEDLGKDLESVEALLKKHEALEAELEAHEERVEA 56 (101)
T ss_pred CCCCCCCHHHHHHHHHHHHHHHHHHHHhHHHHHH
Confidence 3567789999999999999999999877766653
No 59
>PRK04325 hypothetical protein; Provisional
Probab=46.24 E-value=50 Score=20.73 Aligned_cols=27 Identities=22% Similarity=0.159 Sum_probs=19.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhc
Q 047804 53 GAVCHLQKQVSELQAELAKAQAGLAFM 79 (81)
Q Consensus 53 G~I~~Lq~qi~~lq~eLa~~~~~la~~ 79 (81)
.+|..-|++|..++.+|..+..+|...
T Consensus 30 ~vv~~Qq~~I~~L~~ql~~L~~rl~~~ 56 (74)
T PRK04325 30 ATVARQQQTLDLLQAQLRLLYQQMRDA 56 (74)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 466777778888888877777666543
No 60
>TIGR03021 pilP_fam type IV pilus biogenesis protein PilP. Members of this protein family are found in type IV pilus biogenesis loci and include proteins designated PilP.
Probab=46.19 E-value=42 Score=22.95 Aligned_cols=24 Identities=42% Similarity=0.540 Sum_probs=18.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHH
Q 047804 52 VGAVCHLQKQVSELQAELAKAQAG 75 (81)
Q Consensus 52 ~G~I~~Lq~qi~~lq~eLa~~~~~ 75 (81)
+|.+..||.|...++++++.++++
T Consensus 4 ~~eLe~iQ~et~LleAq~~~akaq 27 (119)
T TIGR03021 4 VGQLEALQSETALLEAQLARAKAQ 27 (119)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH
Confidence 567888999998888777755543
No 61
>PF10458 Val_tRNA-synt_C: Valyl tRNA synthetase tRNA binding arm; InterPro: IPR019499 The aminoacyl-tRNA synthetases (6.1.1. from EC) catalyse the attachment of an amino acid to its cognate transfer RNA molecule in a highly specific two-step reaction. These proteins differ widely in size and oligomeric state, and have limited sequence homology []. The 20 aminoacyl-tRNA synthetases are divided into two classes, I and II. Class I aminoacyl-tRNA synthetases contain a characteristic Rossman fold catalytic domain and are mostly monomeric []. Class II aminoacyl-tRNA synthetases share an anti-parallel beta-sheet fold flanked by alpha-helices [], and are mostly dimeric or multimeric, containing at least three conserved regions [, , ]. However, tRNA binding involves an alpha-helical structure that is conserved between class I and class II synthetases. In reactions catalysed by the class I aminoacyl-tRNA synthetases, the aminoacyl group is coupled to the 2'-hydroxyl of the tRNA, while, in class II reactions, the 3'-hydroxyl site is preferred. The synthetases specific for arginine, cysteine, glutamic acid, glutamine, isoleucine, leucine, methionine, tyrosine, tryptophan and valine belong to class I synthetases. The synthetases specific for alanine, asparagine, aspartic acid, glycine, histidine, lysine, phenylalanine, proline, serine, and threonine belong to class-II synthetases []. Based on their mode of binding to the tRNA acceptor stem, both classes of tRNA synthetases have been subdivided into three subclasses, designated 1a, 1b, 1c and 2a, 2b, 2c. This entry represents the C-terminal domain of Valyl-tRNA synthetase, which consists of two helices in a long alpha-hairpin. Valyl-tRNA synthetase (6.1.1.9 from EC) is an alpha monomer that belongs to class Ia.; GO: 0000166 nucleotide binding, 0004832 valine-tRNA ligase activity, 0005524 ATP binding, 0006438 valyl-tRNA aminoacylation, 0005737 cytoplasm; PDB: 1IVS_B 1GAX_B.
Probab=45.91 E-value=42 Score=20.22 Aligned_cols=23 Identities=22% Similarity=0.479 Sum_probs=15.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHh
Q 047804 55 VCHLQKQVSELQAELAKAQAGLA 77 (81)
Q Consensus 55 I~~Lq~qi~~lq~eLa~~~~~la 77 (81)
+..|+.++..++.++..++..|+
T Consensus 6 ~~rL~Kel~kl~~~i~~~~~kL~ 28 (66)
T PF10458_consen 6 IERLEKELEKLEKEIERLEKKLS 28 (66)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHC
T ss_pred HHHHHHHHHHHHHHHHHHHHHHc
Confidence 55677777777777777776664
No 62
>PF13600 DUF4140: N-terminal domain of unknown function (DUF4140)
Probab=45.20 E-value=45 Score=21.14 Aligned_cols=27 Identities=30% Similarity=0.502 Sum_probs=24.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhcc
Q 047804 54 AVCHLQKQVSELQAELAKAQAGLAFMQ 80 (81)
Q Consensus 54 ~I~~Lq~qi~~lq~eLa~~~~~la~~q 80 (81)
.+..|+.++..+++++..++++++++|
T Consensus 78 ~l~~l~~~~~~~~~~~~~~~~~~~~L~ 104 (104)
T PF13600_consen 78 ELEALEDELAALQDEIQALEAQIAFLQ 104 (104)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhC
Confidence 467899999999999999999999876
No 63
>PF11333 DUF3135: Protein of unknown function (DUF3135); InterPro: IPR021482 This family of proteins with unkown function appears to be restricted to Proteobacteria.
Probab=44.72 E-value=86 Score=20.33 Aligned_cols=30 Identities=27% Similarity=0.386 Sum_probs=25.7
Q ss_pred HhhhccCCccccHHHHHHHHHHHHHHHHHH
Q 047804 40 ANSRIRDPVYGCVGAVCHLQKQVSELQAEL 69 (81)
Q Consensus 40 A~aR~rDPV~Gc~G~I~~Lq~qi~~lq~eL 69 (81)
--.|.++|++-|+-+-.-+..++..++..|
T Consensus 53 ~~~~~knP~~~~~~l~~~m~~~~~~l~~~l 82 (83)
T PF11333_consen 53 QRSRCKNPLHRCVLLSRMMYEQFYKLNDAL 82 (83)
T ss_pred HHHHcCChHHHHHHHHHHHHHHHHHHHHhh
Confidence 346789999999999999999998887655
No 64
>PF06295 DUF1043: Protein of unknown function (DUF1043); InterPro: IPR009386 This entry consists of several hypothetical bacterial proteins of unknown function.
Probab=43.93 E-value=27 Score=23.83 Aligned_cols=23 Identities=13% Similarity=0.309 Sum_probs=13.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHh
Q 047804 55 VCHLQKQVSELQAELAKAQAGLA 77 (81)
Q Consensus 55 I~~Lq~qi~~lq~eLa~~~~~la 77 (81)
...|+++++..+.||+.-|+++.
T Consensus 27 q~~l~~eL~~~k~el~~yk~~V~ 49 (128)
T PF06295_consen 27 QAKLEQELEQAKQELEQYKQEVN 49 (128)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHH
Confidence 34556666666666665555554
No 65
>PF08285 DPM3: Dolichol-phosphate mannosyltransferase subunit 3 (DPM3); InterPro: IPR013174 This family corresponds to subunit 3 of dolichol-phosphate mannosyltransferase, an enzyme which generates mannosyl donors for glycosylphosphatidylinositols, N-glycan and protein O- and C-mannosylation. DPM3 is an integral membrane protein and plays a role in stabilising the dolichol-phosphate mannosyl transferase complex [].
Probab=43.15 E-value=39 Score=22.24 Aligned_cols=21 Identities=43% Similarity=0.626 Sum_probs=13.2
Q ss_pred cHHHHHHHHHHHHHHHHHHHH
Q 047804 51 CVGAVCHLQKQVSELQAELAK 71 (81)
Q Consensus 51 c~G~I~~Lq~qi~~lq~eLa~ 71 (81)
|=-.-..||+||++.+++|.+
T Consensus 66 cpeA~~eL~~eI~eAK~dLr~ 86 (91)
T PF08285_consen 66 CPEAAKELQKEIKEAKADLRK 86 (91)
T ss_pred CHHHHHHHHHHHHHHHHHHHH
Confidence 333445677777777777654
No 66
>PRK14127 cell division protein GpsB; Provisional
Probab=42.83 E-value=45 Score=22.81 Aligned_cols=27 Identities=30% Similarity=0.318 Sum_probs=23.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhcc
Q 047804 54 AVCHLQKQVSELQAELAKAQAGLAFMQ 80 (81)
Q Consensus 54 ~I~~Lq~qi~~lq~eLa~~~~~la~~q 80 (81)
-+..|..++..|+.++..++.+|+.++
T Consensus 38 dye~l~~e~~~Lk~e~~~l~~~l~e~~ 64 (109)
T PRK14127 38 DYEAFQKEIEELQQENARLKAQVDELT 64 (109)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 455788899999999999999998876
No 67
>PF11336 DUF3138: Protein of unknown function (DUF3138); InterPro: IPR021485 This family of proteins with unknown function appear to be restricted to Proteobacteria.
Probab=42.06 E-value=37 Score=29.08 Aligned_cols=27 Identities=30% Similarity=0.335 Sum_probs=22.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhh
Q 047804 52 VGAVCHLQKQVSELQAELAKAQAGLAF 78 (81)
Q Consensus 52 ~G~I~~Lq~qi~~lq~eLa~~~~~la~ 78 (81)
.--|..||.|+..||.+...++++|+.
T Consensus 24 a~~i~~L~~ql~aLq~~v~eL~~~laa 50 (514)
T PF11336_consen 24 ADQIKALQAQLQALQDQVNELRAKLAA 50 (514)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhc
Confidence 346888999999999999999998874
No 68
>PF06295 DUF1043: Protein of unknown function (DUF1043); InterPro: IPR009386 This entry consists of several hypothetical bacterial proteins of unknown function.
Probab=41.03 E-value=51 Score=22.43 Aligned_cols=21 Identities=24% Similarity=0.382 Sum_probs=18.0
Q ss_pred HHHHHHHHHHHHHHHHHhhcc
Q 047804 60 KQVSELQAELAKAQAGLAFMQ 80 (81)
Q Consensus 60 ~qi~~lq~eLa~~~~~la~~q 80 (81)
++...++.||+.++.+|..||
T Consensus 25 ~~q~~l~~eL~~~k~el~~yk 45 (128)
T PF06295_consen 25 QKQAKLEQELEQAKQELEQYK 45 (128)
T ss_pred hhHHHHHHHHHHHHHHHHHHH
Confidence 345789999999999999886
No 69
>COG3879 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=40.82 E-value=35 Score=26.60 Aligned_cols=46 Identities=28% Similarity=0.248 Sum_probs=33.1
Q ss_pred HHHHHHHHhhhccCCccccHHHHHHHHHHHHHHHHHHHHHHHHHhhcc
Q 047804 33 KLIMDYEANSRIRDPVYGCVGAVCHLQKQVSELQAELAKAQAGLAFMQ 80 (81)
Q Consensus 33 ~~SlvYEA~aR~rDPV~Gc~G~I~~Lq~qi~~lq~eLa~~~~~la~~q 80 (81)
..+--+|...|.||+ =-.-.+..+|++..+|++|...+++.++.++
T Consensus 39 ~q~~k~~~~~~~r~~--~L~~e~~s~Q~~~~~L~~ev~~~~~~~~s~~ 84 (247)
T COG3879 39 FQTSKGESVRRARDL--DLVKELRSLQKKVNTLAAEVEDLENKLDSVR 84 (247)
T ss_pred HhhccCcchhhhhhh--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 334445556677777 5667788888888888888888888877664
No 70
>KOG4196 consensus bZIP transcription factor MafK [Transcription]
Probab=40.62 E-value=47 Score=23.95 Aligned_cols=51 Identities=20% Similarity=0.204 Sum_probs=26.5
Q ss_pred HHHHHHHHHHHHhhhccCCc--cccHHHHHHHHHHHHHHHHHHHHHHHHHhhc
Q 047804 29 KRDCKLIMDYEANSRIRDPV--YGCVGAVCHLQKQVSELQAELAKAQAGLAFM 79 (81)
Q Consensus 29 k~da~~SlvYEA~aR~rDPV--~Gc~G~I~~Lq~qi~~lq~eLa~~~~~la~~ 79 (81)
|+.+..-==|-++||++-=- ..-=-.=..|++|++.|..|++.++-++..|
T Consensus 55 rRRTLKNRGYA~sCR~KRv~Qk~eLE~~k~~L~qqv~~L~~e~s~~~~E~da~ 107 (135)
T KOG4196|consen 55 RRRTLKNRGYAQSCRVKRVQQKHELEKEKAELQQQVEKLKEENSRLRRELDAY 107 (135)
T ss_pred HHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 46677777788888865200 0000011235555555555555555555433
No 71
>PF14257 DUF4349: Domain of unknown function (DUF4349)
Probab=40.52 E-value=45 Score=24.54 Aligned_cols=29 Identities=14% Similarity=0.456 Sum_probs=25.3
Q ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHHhhc
Q 047804 51 CVGAVCHLQKQVSELQAELAKAQAGLAFM 79 (81)
Q Consensus 51 c~G~I~~Lq~qi~~lq~eLa~~~~~la~~ 79 (81)
-+.-+..++++|.+++.||..++.++..+
T Consensus 160 ~~~d~l~ie~~L~~v~~eIe~~~~~~~~l 188 (262)
T PF14257_consen 160 TVEDLLEIERELSRVRSEIEQLEGQLKYL 188 (262)
T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 56778899999999999999999988765
No 72
>PF15300 INT_SG_DDX_CT_C: INTS6/SAGE1/DDX26B/CT45 C-terminus
Probab=40.35 E-value=47 Score=20.88 Aligned_cols=26 Identities=23% Similarity=0.126 Sum_probs=19.4
Q ss_pred hHHHHHhhh--hhhhHHHHhhhccccCHHHHHHHHHHHHh
Q 047804 4 FFLKLLESF--RELQSWVNLATGACRTKRDCKLIMDYEAN 41 (81)
Q Consensus 4 nv~k~l~~~--~~~~r~~~~~~g~~~~k~da~~SlvYEA~ 41 (81)
.+.++|+.+ |.+.| ...+..++.||.
T Consensus 24 ~iF~lL~~vqG~~~~r------------~~fv~~~IkEA~ 51 (65)
T PF15300_consen 24 KIFKLLEQVQGPLEVR------------KQFVEMIIKEAA 51 (65)
T ss_pred HHHHHHHHccCCHHHH------------HHHHHHHHHHHH
Confidence 466677766 35556 889999999985
No 73
>PF11471 Sugarporin_N: Maltoporin periplasmic N-terminal extension; InterPro: IPR021570 This N-terminal domain is found in members of the sugar porin family 1.B.3 from TC, They are related to LamB - the well characterised maltoporin of Escherichia coli for which the three-dimensional structures with and without its substrate have been obtained by X-ray diffraction. The protein consists of an 18 beta-stranded beta-barrel in contrast to proteins of the general bacterial porin family (GBP) and the Rhodobacter PorCa Porin (RPP) family which consist of 16 beta-stranded beta-barrels. Although maltoporin contains a wider beta-barrel than the porins of the GBP and RPP families (1.B.1 from TC and 1.B.7 from TC), it exhibits a narrower channel, showing only 5% of the ionic conductance of the latter porins.
Probab=39.93 E-value=75 Score=19.45 Aligned_cols=26 Identities=12% Similarity=0.144 Sum_probs=14.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhc
Q 047804 54 AVCHLQKQVSELQAELAKAQAGLAFM 79 (81)
Q Consensus 54 ~I~~Lq~qi~~lq~eLa~~~~~la~~ 79 (81)
.+-.|+++++..+.++..++.++..+
T Consensus 33 RLa~LE~rL~~ae~ra~~ae~~~~~~ 58 (60)
T PF11471_consen 33 RLAALEQRLQAAEQRAQAAEARAKQA 58 (60)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 34556666666666666666555544
No 74
>KOG4841 consensus Dolichol-phosphate mannosyltransferase, subunit 3 [Posttranslational modification, protein turnover, chaperones; Signal transduction mechanisms]
Probab=39.17 E-value=51 Score=22.47 Aligned_cols=22 Identities=41% Similarity=0.646 Sum_probs=17.4
Q ss_pred ccHHHHHHHHHHHHHHHHHHHH
Q 047804 50 GCVGAVCHLQKQVSELQAELAK 71 (81)
Q Consensus 50 Gc~G~I~~Lq~qi~~lq~eLa~ 71 (81)
.|--.-..||+||++.+++|++
T Consensus 69 Dc~eA~veL~~~IkEAr~~L~r 90 (95)
T KOG4841|consen 69 DCEEAAVELQSQIKEARADLAR 90 (95)
T ss_pred CcHHHHHHHHHHHHHHHHHHHH
Confidence 4556667899999999998875
No 75
>PF08537 NBP1: Fungal Nap binding protein NBP1; InterPro: IPR013743 NBP1 is a nuclear protein which has been shown in Saccharomyces cerevisiae (Bakers yeast) to be essential for the G2/M transition of the cell cycle.
Probab=38.98 E-value=45 Score=27.01 Aligned_cols=25 Identities=24% Similarity=0.489 Sum_probs=14.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhc
Q 047804 55 VCHLQKQVSELQAELAKAQAGLAFM 79 (81)
Q Consensus 55 I~~Lq~qi~~lq~eLa~~~~~la~~ 79 (81)
|.+|+++|..++.||..++..|.|-
T Consensus 184 ~~~l~~~l~~~~~eL~~~~k~L~fa 208 (323)
T PF08537_consen 184 IDELEERLNDLEKELEITKKDLKFA 208 (323)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3456666666666666666666553
No 76
>PF13863 DUF4200: Domain of unknown function (DUF4200)
Probab=38.92 E-value=60 Score=21.03 Aligned_cols=46 Identities=15% Similarity=0.207 Sum_probs=29.0
Q ss_pred HHHHHHHHHHHhhhccCCccccHHHHHHHHHHHHHHHHHHHHHHHHHhhcc
Q 047804 30 RDCKLIMDYEANSRIRDPVYGCVGAVCHLQKQVSELQAELAKAQAGLAFMQ 80 (81)
Q Consensus 30 ~da~~SlvYEA~aR~rDPV~Gc~G~I~~Lq~qi~~lq~eLa~~~~~la~~q 80 (81)
..|....--|...+.. ...-|..|..+|..++.+.......+..|+
T Consensus 63 ~rA~k~a~~e~k~~~~-----k~~ei~~l~~~l~~l~~~~~k~e~~l~~~~ 108 (126)
T PF13863_consen 63 ERAEKRAEEEKKKKEE-----KEAEIKKLKAELEELKSEISKLEEKLEEYK 108 (126)
T ss_pred HHHHHHHHHHHHHHHH-----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4455555555554443 345677777777777777777777776553
No 77
>PF04420 CHD5: CHD5-like protein; InterPro: IPR007514 Members of this family are probably coiled-coil proteins that are similar to the CHD5 (Congenital heart disease 5) protein. The exact molecular function of these eukaryotic proteins is unknown.; PDB: 3SJA_H 3SJC_D 3SJB_D 3ZS8_D 3VLC_E.
Probab=38.82 E-value=41 Score=23.65 Aligned_cols=29 Identities=21% Similarity=0.282 Sum_probs=23.1
Q ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHHhhc
Q 047804 51 CVGAVCHLQKQVSELQAELAKAQAGLAFM 79 (81)
Q Consensus 51 c~G~I~~Lq~qi~~lq~eLa~~~~~la~~ 79 (81)
=+..-.+|+++++.+.+||+..++.+...
T Consensus 64 eFAkwaKl~Rk~~kl~~el~~~~~~~~~~ 92 (161)
T PF04420_consen 64 EFAKWAKLNRKLDKLEEELEKLNKSLSSE 92 (161)
T ss_dssp SHHHHHHHHHHHHHHHHHHHHHHHHHHHT
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 45666778999999999999988877643
No 78
>PRK09413 IS2 repressor TnpA; Reviewed
Probab=38.61 E-value=64 Score=21.26 Aligned_cols=23 Identities=13% Similarity=0.150 Sum_probs=10.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHh
Q 047804 55 VCHLQKQVSELQAELAKAQAGLA 77 (81)
Q Consensus 55 I~~Lq~qi~~lq~eLa~~~~~la 77 (81)
|..|+.++.+++.|.+.++..++
T Consensus 80 i~~L~~el~~L~~E~diLKKa~~ 102 (121)
T PRK09413 80 IKELQRLLGKKTMENELLKEAVE 102 (121)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHH
Confidence 34444444444555554444443
No 79
>PRK11239 hypothetical protein; Provisional
Probab=38.45 E-value=41 Score=25.78 Aligned_cols=27 Identities=26% Similarity=0.300 Sum_probs=19.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhc
Q 047804 53 GAVCHLQKQVSELQAELAKAQAGLAFM 79 (81)
Q Consensus 53 G~I~~Lq~qi~~lq~eLa~~~~~la~~ 79 (81)
+.+..|+.++..|++|++.+++.++.+
T Consensus 183 ~~~~~Le~rv~~Le~eva~L~~~l~~l 209 (215)
T PRK11239 183 AVDGDLQARVEALEIEVAELKQRLDSL 209 (215)
T ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 345667888888888888777777654
No 80
>PF12718 Tropomyosin_1: Tropomyosin like; InterPro: IPR000533 Tropomyosins [], are a family of closely related proteins present in muscle and non-muscle cells. In striated muscle, tropomyosin mediate the interactions between the troponin complex and actin so as to regulate muscle contraction []. The role of tropomyosin in smooth muscle and non-muscle tissues is not clear. Tropomyosin is an alpha-helical protein that forms a coiled-coil structure of 2 parallel helices containing 2 sets of 7 alternating actin binding sites []. There are multiple cell-specific isoforms, created by differential splicing of the messenger RNA from one gene, but the proportions of the isoforms vary between different cell types. Muscle isoforms of tropomyosin are characterised by having 284 amino acid residues and a highly conserved N-terminal region, whereas non-muscle forms are generally smaller and are heterogeneous in their N-terminal region. This entry represents tropomyosin (Tmp) 1, 2 and 3. Within the yeast Tmp1 and Tmp2, biochemical and sequence analyses indicate that Tpm2 spans four actin monomers along a filament, whereas Tpm1 spans five. Despite its shorter length, Tpm2 can compete with Tpm1 for binding to F-actin. Over-expression of Tpm2 in vivo alters the axial budding of haploids to a bipolar pattern, and this can be partially suppressed by co-over-expression of Tpm1. This suggests distinct functions for the two tropomyosins, and indicates that the ratio between them is important for correct morphogenesis [].
Probab=38.36 E-value=54 Score=22.83 Aligned_cols=25 Identities=32% Similarity=0.387 Sum_probs=16.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhh
Q 047804 54 AVCHLQKQVSELQAELAKAQAGLAF 78 (81)
Q Consensus 54 ~I~~Lq~qi~~lq~eLa~~~~~la~ 78 (81)
-|..|+..+..+..+|..++..|..
T Consensus 36 EI~sL~~K~~~lE~eld~~~~~l~~ 60 (143)
T PF12718_consen 36 EITSLQKKNQQLEEELDKLEEQLKE 60 (143)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4566666666666666666666543
No 81
>PF14931 IFT20: Intraflagellar transport complex B, subunit 20
Probab=38.12 E-value=97 Score=21.32 Aligned_cols=26 Identities=23% Similarity=0.285 Sum_probs=20.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhcc
Q 047804 55 VCHLQKQVSELQAELAKAQAGLAFMQ 80 (81)
Q Consensus 55 I~~Lq~qi~~lq~eLa~~~~~la~~q 80 (81)
...||..|..-+.||...+.+...+|
T Consensus 82 ~q~lq~~I~Ek~~eLERl~~E~~sL~ 107 (120)
T PF14931_consen 82 QQQLQALIAEKKMELERLRSEYESLQ 107 (120)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 46788888888888888888776543
No 82
>PLN02720 complex II
Probab=37.90 E-value=14 Score=26.78 Aligned_cols=9 Identities=56% Similarity=0.907 Sum_probs=7.4
Q ss_pred ccCCccccH
Q 047804 44 IRDPVYGCV 52 (81)
Q Consensus 44 ~rDPV~Gc~ 52 (81)
+.|||||+.
T Consensus 50 aSDPVyGPq 58 (140)
T PLN02720 50 ASDPVYGPQ 58 (140)
T ss_pred hcCCCcchH
Confidence 579999984
No 83
>cd01111 HTH_MerD Helix-Turn-Helix DNA binding domain of the MerD transcription regulator. Helix-turn-helix (HTH) transcription regulator MerD. The putative secondary regulator of mercury resistance (mer) operons, MerD, has been shown to down-regulate the expression of this operon in gram-negative bacteria. It binds to the same operator DNA as MerR that activates transcription of the operon in the presence of mercury ions. The MerD protein shares the N-terminal DNA binding domain with other transcription regulators of the MerR superfamily, which promote transcription by reconfiguring the spacer between the -35 and -10 promoter elements. A typical MerR regulator is comprised of two distinct domains that harbor the regulatory (effector-binding) site and the active (DNA-binding) site. Their N-terminal domains are conserved and contain predicted winged HTH motifs that mediate DNA binding, while the dissimilar C-terminal domains bind specific coactivator molecules such as metal ions, drugs,
Probab=37.52 E-value=1e+02 Score=20.19 Aligned_cols=30 Identities=20% Similarity=0.208 Sum_probs=26.1
Q ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHHhhc
Q 047804 50 GCVGAVCHLQKQVSELQAELAKAQAGLAFM 79 (81)
Q Consensus 50 Gc~G~I~~Lq~qi~~lq~eLa~~~~~la~~ 79 (81)
-|-.....+..+++..+++|+.++..|..+
T Consensus 77 ~~~~~~~~~~~~l~~~~~~L~~l~~~L~~~ 106 (107)
T cd01111 77 QPEACLAQLRQKIEVRRAALNALTTQLAEM 106 (107)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 377889999999999999999999998754
No 84
>cd00890 Prefoldin Prefoldin is a hexameric molecular chaperone complex, found in both eukaryotes and archaea, that binds and stabilizes newly synthesized polypeptides allowing them to fold correctly. The complex contains two alpha and four beta subunits, the two subunits being evolutionarily related. In archaea, there is usually only one gene for each subunit while in eukaryotes there two or more paralogous genes encoding each subunit adding heterogeneity to the structure of the hexamer. The structure of the complex consists of a double beta barrel assembly with six protruding coiled-coils.
Probab=37.40 E-value=64 Score=20.65 Aligned_cols=25 Identities=36% Similarity=0.416 Sum_probs=15.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhc
Q 047804 55 VCHLQKQVSELQAELAKAQAGLAFM 79 (81)
Q Consensus 55 I~~Lq~qi~~lq~eLa~~~~~la~~ 79 (81)
...|+++|..+++++..++..+..|
T Consensus 8 ~~~l~~~i~~l~~~~~~l~~~~~e~ 32 (129)
T cd00890 8 LQQLQQQLEALQQQLQKLEAQLTEY 32 (129)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4556666666666666666665544
No 85
>PF04012 PspA_IM30: PspA/IM30 family; InterPro: IPR007157 This family includes PspA a protein that suppresses sigma54-dependent transcription. The PspA protein, a negative regulator of the Escherichia coli phage shock psp operon, is produced when virulence factors are exported through secretins in many Gram-negative pathogenic bacteria and its homologue in plants, VIPP1, plays a critical role in thylakoid biogenesis, essential for photosynthesis. Activation of transcription by the enhancer-dependent bacterial sigma54-containing RNA polymerase occurs through ATP hydrolysis-driven protein conformational changes enabled by activator proteins that belong to the large AAA(+) mechanochemical protein family. It has been shown that PspA directly and specifically acts upon and binds to the AAA(+) domain of the PspF transcription activator [].
Probab=37.23 E-value=1.1e+02 Score=21.80 Aligned_cols=40 Identities=23% Similarity=0.260 Sum_probs=26.9
Q ss_pred HHHHHHHHHHhhhccCCccccHHHHHHHHHHHHHHHHHHHHHHHHHh
Q 047804 31 DCKLIMDYEANSRIRDPVYGCVGAVCHLQKQVSELQAELAKAQAGLA 77 (81)
Q Consensus 31 da~~SlvYEA~aR~rDPV~Gc~G~I~~Lq~qi~~lq~eLa~~~~~la 77 (81)
+++.+-+-++--+..||. .-|.+.|.+++.+|..++..++
T Consensus 8 ~~~~a~~~~~ld~~EDP~-------~~l~q~ird~e~~l~~a~~~~a 47 (221)
T PF04012_consen 8 TLVKANINELLDKAEDPE-------KMLEQAIRDMEEQLRKARQALA 47 (221)
T ss_pred HHHHHHHHHHHHhhcCHH-------HHHHHHHHHHHHHHHHHHHHHH
Confidence 356666777777888987 5566666666666666665554
No 86
>PRK13169 DNA replication intiation control protein YabA; Reviewed
Probab=37.08 E-value=81 Score=21.53 Aligned_cols=26 Identities=27% Similarity=0.319 Sum_probs=17.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHh
Q 047804 52 VGAVCHLQKQVSELQAELAKAQAGLA 77 (81)
Q Consensus 52 ~G~I~~Lq~qi~~lq~eLa~~~~~la 77 (81)
+-.+..|++|+..+.+++..+++.+.
T Consensus 7 fd~l~~le~~l~~l~~el~~LK~~~~ 32 (110)
T PRK13169 7 FDALDDLEQNLGVLLKELGALKKQLA 32 (110)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34566677777777777776666654
No 87
>PF04447 DUF550: Protein of unknown function (DUF550); InterPro: IPR007538 This entry represents the N terminus of a protein of unknown function, found in a range of Proteobacteria and a few P22-like dsDNA virus particles.
Probab=36.85 E-value=35 Score=23.45 Aligned_cols=25 Identities=36% Similarity=0.404 Sum_probs=21.3
Q ss_pred cCCccccHHHHHHHHHHHHHHHHHH
Q 047804 45 RDPVYGCVGAVCHLQKQVSELQAEL 69 (81)
Q Consensus 45 rDPV~Gc~G~I~~Lq~qi~~lq~eL 69 (81)
=-|..|++|.+.||.+++.++.++.
T Consensus 14 FGp~~~p~g~lkHl~kE~~E~~~~p 38 (100)
T PF04447_consen 14 FGPGVGPVGPLKHLSKEALEAEAAP 38 (100)
T ss_pred cCCCCCcchHHHHHHHHHHHHHhCC
Confidence 3489999999999999999887654
No 88
>PRK10963 hypothetical protein; Provisional
Probab=36.66 E-value=55 Score=23.99 Aligned_cols=35 Identities=17% Similarity=0.342 Sum_probs=27.6
Q ss_pred hhhccCCccccHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q 047804 41 NSRIRDPVYGCVGAVCHLQKQVSELQAELAKAQAGLAF 78 (81)
Q Consensus 41 ~aR~rDPV~Gc~G~I~~Lq~qi~~lq~eLa~~~~~la~ 78 (81)
..+.-.|.. |+|+-.+.|++.+++++..++.+|..
T Consensus 28 ~L~lph~~~---gaVSL~ErQ~~~LR~r~~~Le~~l~~ 62 (223)
T PRK10963 28 QMRVPHPVR---GTVSLVEWQMARQRNHIHVLEEEMTL 62 (223)
T ss_pred hccCCCCCC---CeecHHHHHHHHHHHHHHHHHHHHHH
Confidence 456777755 48888899999999999888888764
No 89
>PF04508 Pox_A_type_inc: Viral A-type inclusion protein repeat ; InterPro: IPR007596 The repeat is found in the A-type inclusion protein of the Poxvirus family [].; GO: 0016032 viral reproduction
Probab=36.06 E-value=64 Score=16.73 Aligned_cols=18 Identities=22% Similarity=0.553 Sum_probs=13.5
Q ss_pred HHHHHHHHHHHHHHHHHH
Q 047804 55 VCHLQKQVSELQAELAKA 72 (81)
Q Consensus 55 I~~Lq~qi~~lq~eLa~~ 72 (81)
|..|...|..|+.+|+.-
T Consensus 3 ~~rlr~rI~dLer~L~~C 20 (23)
T PF04508_consen 3 MNRLRNRISDLERQLSEC 20 (23)
T ss_pred HHHHHHHHHHHHHHHHHH
Confidence 567788888888888754
No 90
>cd00584 Prefoldin_alpha Prefoldin alpha subunit; Prefoldin is a hexameric molecular chaperone complex, found in both eukaryotes and archaea, that binds and stabilizes newly synthesized polypeptides allowing them to fold correctly. The complex contains two alpha and four beta subunits, the two subunits being evolutionarily related. In archaea, there is usually only one gene for each subunit while in eukaryotes there two or more paralogous genes encoding each subunit adding heterogeneity to the structure of the hexamer. The structure of the complex consists of a double beta barrel assembly with six protruding coiled-coils.
Probab=35.92 E-value=68 Score=21.03 Aligned_cols=25 Identities=36% Similarity=0.461 Sum_probs=14.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhc
Q 047804 55 VCHLQKQVSELQAELAKAQAGLAFM 79 (81)
Q Consensus 55 I~~Lq~qi~~lq~eLa~~~~~la~~ 79 (81)
...|+.++..+++++..++..+..|
T Consensus 8 ~~~l~~~i~~l~~~~~~l~~~~~e~ 32 (129)
T cd00584 8 LQVLQQEIEELQQELARLNEAIAEY 32 (129)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3455666666666666666555544
No 91
>KOG4098 consensus Molecular chaperone Prefoldin, subunit 2 [Posttranslational modification, protein turnover, chaperones]
Probab=35.69 E-value=1.7e+02 Score=21.25 Aligned_cols=63 Identities=16% Similarity=0.204 Sum_probs=36.0
Q ss_pred hHHHHHhhhhhhhHHHHhhhcc--ccCHHHHHHHHHHHHhhhccCCccccHHHHHHHHHHHHHHHHHHHHHHH
Q 047804 4 FFLKLLESFRELQSWVNLATGA--CRTKRDCKLIMDYEANSRIRDPVYGCVGAVCHLQKQVSELQAELAKAQA 74 (81)
Q Consensus 4 nv~k~l~~~~~~~r~~~~~~g~--~~~k~da~~SlvYEA~aR~rDPV~Gc~G~I~~Lq~qi~~lq~eLa~~~~ 74 (81)
-|++.|++++|.++-.|..-|+ +|+=.+.+-. ...-.-|-.++|..|-.|+.....||.+-+.
T Consensus 50 lVi~tlk~~dp~RKCfRmIgGvLVErTVkeVlP~--------L~~nke~i~~~i~~l~~qL~~k~kElnkfk~ 114 (140)
T KOG4098|consen 50 LVIETLKDLDPTRKCFRMIGGVLVERTVKEVLPI--------LQTNKENIEKVIKKLTDQLVQKGKELNKFKK 114 (140)
T ss_pred HHHHHHHhcChhhHHHHHhccchhhhhHHHHhHH--------HHhhHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 3667777777777655544443 3331111111 1122335667788888888888888876654
No 92
>PF02996 Prefoldin: Prefoldin subunit; InterPro: IPR004127 This entry comprises of several prefoldin subunits. Prefoldin (PFD) is a chaperone that interacts exclusively with type II chaperonins, hetero-oligomers lacking an obligate co-chaperonin that are found only in eukaryotes (chaperonin-containing T-complex polypeptide-1 (CCT)) and archaea. Eukaryotic PFD is a multi-subunit complex containing six polypeptides in the molecular mass range of 14-23 kDa. In archaea, on the other hand, PFD is composed of two types of subunits, two alpha and four beta. The six subunits associate to form two back-to-back up-and-down eight-stranded barrels, from which hang six coiled coils. Each subunit contributes one (beta subunits) or two (alpha subunits) beta hairpin turns to the barrels. The coiled coils are formed by the N and C termini of an individual subunit. Overall, this unique arrangement resembles a jellyfish. The eukaryotic PFD hexamer is composed of six different subunits; however, these can be grouped into two alpha-like (PFD3 and -5) and four beta-like (PFD1, -2, -4, and -6) subunits based on amino acid sequence similarity with their archaeal counterparts. Eukaryotic PFD has a six-legged structure similar to that seen in the archaeal homologue [, ]. This family contains the archaeal alpha subunit, eukaryotic prefoldin subunits 3 and 5 and the UXT (ubiquitously expressed transcript) family. Eukaryotic PFD has been shown to bind both actin and tubulin co-translationally. The chaperone then delivers the target protein to CCT, interacting with the chaperonin through the tips of the coiled coils. No authentic target proteins of any archaeal PFD have been identified, to date.; GO: 0051082 unfolded protein binding, 0006457 protein folding, 0016272 prefoldin complex; PDB: 1FXK_C 2ZDI_C.
Probab=35.65 E-value=71 Score=20.34 Aligned_cols=25 Identities=32% Similarity=0.434 Sum_probs=13.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhh
Q 047804 54 AVCHLQKQVSELQAELAKAQAGLAF 78 (81)
Q Consensus 54 ~I~~Lq~qi~~lq~eLa~~~~~la~ 78 (81)
.+..|+.++..++.+++.+++++..
T Consensus 85 r~~~l~~~~~~l~~~~~~~~~~~~~ 109 (120)
T PF02996_consen 85 RIKELEEQLEKLEKELAELQAQIEQ 109 (120)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHC
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3455555555555555555555543
No 93
>PF05565 Sipho_Gp157: Siphovirus Gp157; InterPro: IPR008840 This family contains both viral and bacterial proteins which are related to the Gp157 protein of the Streptococcus thermophilus SFi bacteriophage. It is thought that bacteria possessing the gene coding for this protein have an increased resistance to the bacteriophage [].
Probab=35.39 E-value=1.1e+02 Score=21.47 Aligned_cols=43 Identities=14% Similarity=0.220 Sum_probs=33.6
Q ss_pred HHHHHHHHHHHhhhccCCccccHHHHHHHHHHHHHHHHHHHHHHHHH
Q 047804 30 RDCKLIMDYEANSRIRDPVYGCVGAVCHLQKQVSELQAELAKAQAGL 76 (81)
Q Consensus 30 ~da~~SlvYEA~aR~rDPV~Gc~G~I~~Lq~qi~~lq~eLa~~~~~l 76 (81)
.|++.++ +.-..|.+-|++.+|..++..+..+++|...+++.-
T Consensus 28 ~dtLe~i----~~~~~~K~~~~~~~Ik~~ea~~e~~k~E~krL~~rk 70 (162)
T PF05565_consen 28 ADTLESI----EDEIEEKADNIAKVIKNLEADIEAIKAEIKRLQERK 70 (162)
T ss_pred HHHHHHH----HHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHH
Confidence 5666663 344567889999999999999999999888777643
No 94
>PLN02523 galacturonosyltransferase
Probab=34.84 E-value=2e+02 Score=24.94 Aligned_cols=44 Identities=18% Similarity=0.173 Sum_probs=36.8
Q ss_pred HHHHHHHHHHHhhhccCCccccHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q 047804 30 RDCKLIMDYEANSRIRDPVYGCVGAVCHLQKQVSELQAELAKAQAGLAF 78 (81)
Q Consensus 30 ~da~~SlvYEA~aR~rDPV~Gc~G~I~~Lq~qi~~lq~eLa~~~~~la~ 78 (81)
-.+|..++++|.. .|-|..+|.+|...+..++.++..++.+=++
T Consensus 158 ~k~~~~~~~~a~~-----~~d~~~~~~kl~~~~~~~e~~~~~~~~q~~~ 201 (559)
T PLN02523 158 VKVARQMIAESKE-----SFDNQLKIQKLKDTIFAVNEQLTKAKKNGAF 201 (559)
T ss_pred HHHHHHHHHHHHh-----hcchHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 6689999999983 4557789999999999999999988876554
No 95
>TIGR00293 prefoldin, archaeal alpha subunit/eukaryotic subunit 5. This model finds a set of small proteins from the Archaea and from Aquifex aeolicus that may represent two orthologous groups. The proteins are predicted to be mostly coiled coil, and may hit large numbers of proteins that contain coiled coil regions.
Probab=34.61 E-value=73 Score=20.77 Aligned_cols=25 Identities=40% Similarity=0.400 Sum_probs=14.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhc
Q 047804 55 VCHLQKQVSELQAELAKAQAGLAFM 79 (81)
Q Consensus 55 I~~Lq~qi~~lq~eLa~~~~~la~~ 79 (81)
+..|+++++.+++.+..+...+..+
T Consensus 8 ~~ql~~~i~~l~~~i~~l~~~i~e~ 32 (126)
T TIGR00293 8 LQILQQQVESLQAQIAALRALIAEL 32 (126)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4456666666666666665555543
No 96
>PF04999 FtsL: Cell division protein FtsL; InterPro: IPR007082 In Escherichia coli, nine gene products are known to be essential for assembly of the division septum. One of these, FtsL, is a bitopic membrane protein whose precise function is not understood. It has been proposed that FtsL interacts with the DivIC protein IPR007060 from INTERPRO [], however this interaction may be indirect [].; GO: 0007049 cell cycle, 0016021 integral to membrane
Probab=34.30 E-value=90 Score=19.57 Aligned_cols=27 Identities=22% Similarity=0.330 Sum_probs=20.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhc
Q 047804 53 GAVCHLQKQVSELQAELAKAQAGLAFM 79 (81)
Q Consensus 53 G~I~~Lq~qi~~lq~eLa~~~~~la~~ 79 (81)
..+..+++++.+++.|-..++.+++.+
T Consensus 42 ~~l~~l~~~~~~l~~e~~~L~lE~~~l 68 (97)
T PF04999_consen 42 YELQQLEKEIDQLQEENERLRLEIATL 68 (97)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 347888888888888877777776654
No 97
>PRK13902 alaS alanyl-tRNA synthetase; Provisional
Probab=34.21 E-value=1.3e+02 Score=27.01 Aligned_cols=45 Identities=20% Similarity=0.087 Sum_probs=32.1
Q ss_pred HHHHHHhhhccCCccccHHHHHHHHHHHHHHHHHHHHHHHHHhhc
Q 047804 35 IMDYEANSRIRDPVYGCVGAVCHLQKQVSELQAELAKAQAGLAFM 79 (81)
Q Consensus 35 SlvYEA~aR~rDPV~Gc~G~I~~Lq~qi~~lq~eLa~~~~~la~~ 79 (81)
.++.++..-..-|..--.-.|..|+.++..++.++..++.+|+.+
T Consensus 743 ~~l~~~a~~L~~~~~el~~~v~kl~ee~k~l~kei~~l~~~l~~~ 787 (900)
T PRK13902 743 DLLKEAAEILGVPPEQLPKTVERFFEEWKEQKKEIEKLRKELAEL 787 (900)
T ss_pred HHHHHHHHHhCCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 355555555555555667778888888888888888888877654
No 98
>TIGR03017 EpsF chain length determinant protein EpsF. Sequences in this family of proteins are members of the chain length determinant family (pfam02706) which includes the wzc protein from E.coli. This family of proteins are homologous to the EpsF protein of the methanolan biosynthesis operon of Methylobacillus species strain 12S. The distribution of this protein appears to be restricted to a subset of exopolysaccharide operons containing a syntenic grouping of genes including a variant of the EpsH exosortase protein. Exosortase has been proposed to be involved in the targetting and processing of proteins containing the PEP-CTERM domain to the exopolysaccharide layer.
Probab=33.97 E-value=66 Score=24.97 Aligned_cols=30 Identities=30% Similarity=0.387 Sum_probs=25.9
Q ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHHhhcc
Q 047804 51 CVGAVCHLQKQVSELQAELAKAQAGLAFMQ 80 (81)
Q Consensus 51 c~G~I~~Lq~qi~~lq~eLa~~~~~la~~q 80 (81)
..-.+..|+.|+..++.+|..++..|..|+
T Consensus 169 ~~~~~~fl~~ql~~~~~~l~~ae~~l~~fr 198 (444)
T TIGR03017 169 AQKAALWFVQQIAALREDLARAQSKLSAYQ 198 (444)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 334678999999999999999999998875
No 99
>TIGR02231 conserved hypothetical protein. This family consists of proteins over 500 amino acids long in Caenorhabditis elegans and several bacteria (Pseudomonas aeruginosa, Nostoc sp. PCC 7120, Leptospira interrogans, etc.). The function is unknown.
Probab=33.89 E-value=57 Score=26.44 Aligned_cols=26 Identities=31% Similarity=0.394 Sum_probs=19.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhc
Q 047804 54 AVCHLQKQVSELQAELAKAQAGLAFM 79 (81)
Q Consensus 54 ~I~~Lq~qi~~lq~eLa~~~~~la~~ 79 (81)
.|..|+.+|..++.+++.++++++..
T Consensus 72 ~~~~l~~~l~~l~~~~~~~~~~~~~~ 97 (525)
T TIGR02231 72 RLAELRKQIRELEAELRDLEDRGDAL 97 (525)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 67788888888888888777766543
No 100
>PRK11677 hypothetical protein; Provisional
Probab=33.80 E-value=47 Score=23.42 Aligned_cols=24 Identities=25% Similarity=0.234 Sum_probs=13.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHh
Q 047804 54 AVCHLQKQVSELQAELAKAQAGLA 77 (81)
Q Consensus 54 ~I~~Lq~qi~~lq~eLa~~~~~la 77 (81)
....|++|++..+.||..-++++.
T Consensus 30 ~q~~le~eLe~~k~ele~YkqeV~ 53 (134)
T PRK11677 30 QQQALQYELEKNKAELEEYRQELV 53 (134)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH
Confidence 344566666666666665555554
No 101
>PF12518 DUF3721: Protein of unknown function; InterPro: IPR022196 This domain family is found in bacteria and eukaryotes, and is approximately 30 amino acids in length. There is a conserved WMPC sequence motif. There are two completely conserved residues (A and C) that may be functionally important.
Probab=33.64 E-value=16 Score=20.57 Aligned_cols=14 Identities=50% Similarity=0.992 Sum_probs=9.3
Q ss_pred HHhhhccCCccccHHH
Q 047804 39 EANSRIRDPVYGCVGA 54 (81)
Q Consensus 39 EA~aR~rDPV~Gc~G~ 54 (81)
||+.|++. .||.|+
T Consensus 6 eAe~~A~~--~GC~G~ 19 (34)
T PF12518_consen 6 EAEKRAKE--LGCKGA 19 (34)
T ss_pred HHHHHHHH--cCCcch
Confidence 45555544 899986
No 102
>COG4985 ABC-type phosphate transport system, auxiliary component [Inorganic ion transport and metabolism]
Probab=33.36 E-value=1.5e+02 Score=23.76 Aligned_cols=39 Identities=31% Similarity=0.301 Sum_probs=28.6
Q ss_pred HHHHHHHhhhccCCccccHHHHHHHHHHHHHHHHHHHHHHHHHhhc
Q 047804 34 LIMDYEANSRIRDPVYGCVGAVCHLQKQVSELQAELAKAQAGLAFM 79 (81)
Q Consensus 34 ~SlvYEA~aR~rDPV~Gc~G~I~~Lq~qi~~lq~eLa~~~~~la~~ 79 (81)
.-+.=|+..+. +..-+.||.+...++.||..+++++..+
T Consensus 209 g~Ld~~~q~~~-------~ae~seLq~r~~~l~~~L~~L~~e~~r~ 247 (289)
T COG4985 209 GQLDDEFQQHY-------VAEKSELQKRLAQLQTELDALRAELERQ 247 (289)
T ss_pred ccccHHHHHHH-------HHHHHHHHHHHHHHHHHHHHHhhhhhhc
Confidence 33555555543 3456889999999999999999988653
No 103
>TIGR02894 DNA_bind_RsfA transcription factor, RsfA family. In a subset of endospore-forming members of the Firmcutes, members of this protein family are found, several to a genome. Two very strongly conserved sequences regions are separated by a highly variable linker region. Much of the linker region was excised from the seed alignment for this model. A characterized member is the prespore-specific transcription RsfA from Bacillus subtilis, previously called YwfN, which is controlled by sigma factor F and seems to fine-tune expression of some genes in the sigma-F regulon. A paralog in Bacillus subtilis is designated YlbO.
Probab=33.18 E-value=70 Score=23.53 Aligned_cols=25 Identities=36% Similarity=0.377 Sum_probs=13.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhc
Q 047804 55 VCHLQKQVSELQAELAKAQAGLAFM 79 (81)
Q Consensus 55 I~~Lq~qi~~lq~eLa~~~~~la~~ 79 (81)
+..|++++..|+.|+..+..++..|
T Consensus 113 ~~~l~~~~e~Le~e~~~L~~~~~~~ 137 (161)
T TIGR02894 113 NESLQKRNEELEKELEKLRQRLSTI 137 (161)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3445555555555555555555433
No 104
>PRK15396 murein lipoprotein; Provisional
Probab=33.09 E-value=47 Score=21.52 Aligned_cols=24 Identities=17% Similarity=0.293 Sum_probs=16.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHh
Q 047804 54 AVCHLQKQVSELQAELAKAQAGLA 77 (81)
Q Consensus 54 ~I~~Lq~qi~~lq~eLa~~~~~la 77 (81)
-|-+|+.|++.|.++...+.+...
T Consensus 26 kvd~LssqV~~L~~kvdql~~dv~ 49 (78)
T PRK15396 26 KIDQLSSDVQTLNAKVDQLSNDVN 49 (78)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHH
Confidence 566777777777776666665544
No 105
>PF03623 Focal_AT: Focal adhesion targeting region; InterPro: IPR005189 Focal adhesion kinase (FAK) is a tyrosine kinase found in focal adhesions, intracellular signalling complexes that are formed following engagement of the extracellular matrix by integrins. The C-terminal "focal adhesion targeting" (FAT) region is necessary and sufficient for localizing FAK to focal adhesions. The crystal structure of FAT shows it forms a four-helix bundle that resembles those found in two other proteins involved in cell adhesion, alpha-catenin and vinculin []. The binding of FAT to the focal adhesion protein, paxillin, requires the integrity of the helical bundle, whereas binding to another focal adhesion protein, talin, does not.; GO: 0004713 protein tyrosine kinase activity, 0004871 signal transducer activity, 0006468 protein phosphorylation, 0007172 signal complex assembly, 0005925 focal adhesion; PDB: 1K05_C 1OW8_C 1OW7_C 3S9O_C 3B71_B 1K04_A 1OW6_C 3GM1_B 3GM3_A 3GM2_A ....
Probab=32.95 E-value=7.2 Score=27.92 Aligned_cols=25 Identities=32% Similarity=0.462 Sum_probs=18.4
Q ss_pred hccCCccccH----HHHHHHHHHHHHHHH
Q 047804 43 RIRDPVYGCV----GAVCHLQKQVSELQA 67 (81)
Q Consensus 43 R~rDPV~Gc~----G~I~~Lq~qi~~lq~ 67 (81)
|..||||+++ +.|..|++++..++.
T Consensus 6 R~~D~Vy~at~~VVkaV~~Ls~~v~~~~~ 34 (139)
T PF03623_consen 6 RTNDKVYDATTGVVKAVMQLSNSVQTAKP 34 (139)
T ss_dssp CTTHHHHHHHHHHHHHHHHHHHHTTTSSH
T ss_pred cccChHHHHHHHHHHHHHHHHhhhccCCh
Confidence 7789999874 667778888776543
No 106
>PRK10884 SH3 domain-containing protein; Provisional
Probab=32.92 E-value=69 Score=23.76 Aligned_cols=23 Identities=22% Similarity=0.306 Sum_probs=15.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHh
Q 047804 55 VCHLQKQVSELQAELAKAQAGLA 77 (81)
Q Consensus 55 I~~Lq~qi~~lq~eLa~~~~~la 77 (81)
|..|+++.+++++||+.+++++.
T Consensus 134 ~~~L~~~n~~L~~~l~~~~~~~~ 156 (206)
T PRK10884 134 INGLKEENQKLKNQLIVAQKKVD 156 (206)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHH
Confidence 55577777777777777766654
No 107
>PF02970 TBCA: Tubulin binding cofactor A; InterPro: IPR004226 The folding pathway of tubulins includes highly specific interactions with a series of cofactors (A, B, C, D and E) after they are released from the eukaryotic chaperonin CCT. Cofactors A and D capture and stabilise tubulin in a quasi-native conformation. Cofactor E binds to the cofactor D-tubulin complex, and interaction with cofactor C then causes the release of tubulin poypeptides in the native state. This family is the tubulin-specific chaperone A.; GO: 0051082 unfolded protein binding, 0007021 tubulin complex assembly, 0005874 microtubule; PDB: 3MXZ_A 1QSD_A 1H7C_A.
Probab=32.91 E-value=84 Score=20.29 Aligned_cols=29 Identities=28% Similarity=0.368 Sum_probs=25.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhcc
Q 047804 52 VGAVCHLQKQVSELQAELAKAQAGLAFMQ 80 (81)
Q Consensus 52 ~G~I~~Lq~qi~~lq~eLa~~~~~la~~q 80 (81)
+|+|..|.++....+.|+..-++.+..|.
T Consensus 6 t~~vkRL~KE~~~Y~kE~~~q~~rle~~k 34 (90)
T PF02970_consen 6 TGVVKRLLKEEASYEKEVEEQEARLEKMK 34 (90)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 68999999999999999999888887664
No 108
>PRK13922 rod shape-determining protein MreC; Provisional
Probab=32.85 E-value=85 Score=23.19 Aligned_cols=37 Identities=24% Similarity=0.328 Sum_probs=23.0
Q ss_pred hccCCccccHH---HHHHHHHHHHHHHHHHHHHHHHHhhc
Q 047804 43 RIRDPVYGCVG---AVCHLQKQVSELQAELAKAQAGLAFM 79 (81)
Q Consensus 43 R~rDPV~Gc~G---~I~~Lq~qi~~lq~eLa~~~~~la~~ 79 (81)
+..+++.+.+. -...|+.+...|++|++.++++++.+
T Consensus 56 ~~~~~~~~~~~~~~~~~~l~~en~~L~~e~~~l~~~~~~~ 95 (276)
T PRK13922 56 APREFVSGVFESLASLFDLREENEELKKELLELESRLQEL 95 (276)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33344444443 34566777788888888777776644
No 109
>PF11461 RILP: Rab interacting lysosomal protein; InterPro: IPR021563 RILP contains a domain which contains two coiled-coil regions and is found mainly in the cytosol. RILP is recruited onto late endosomal and lysosomal membranes by Rab7 and acts as a downstream effector of Rab7. This recruitment process is important for phagosome maturation and fusion with late endosomes and lysosomes. ; PDB: 1YHN_B.
Probab=32.71 E-value=72 Score=19.88 Aligned_cols=20 Identities=25% Similarity=0.375 Sum_probs=12.5
Q ss_pred HHHHHHHHHHHHHHHHhhcc
Q 047804 61 QVSELQAELAKAQAGLAFMQ 80 (81)
Q Consensus 61 qi~~lq~eLa~~~~~la~~q 80 (81)
+=+.|++.+-.++.+|+.||
T Consensus 11 ERNeLK~~v~~leEEL~~yk 30 (60)
T PF11461_consen 11 ERNELKARVFLLEEELAYYK 30 (60)
T ss_dssp HHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHh
Confidence 44556666666666666665
No 110
>PF14197 Cep57_CLD_2: Centrosome localisation domain of PPC89
Probab=32.38 E-value=73 Score=19.91 Aligned_cols=24 Identities=13% Similarity=0.235 Sum_probs=15.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHhhcc
Q 047804 57 HLQKQVSELQAELAKAQAGLAFMQ 80 (81)
Q Consensus 57 ~Lq~qi~~lq~eLa~~~~~la~~q 80 (81)
.|+.++..|+..|..+..++..|+
T Consensus 2 ~Lea~~~~Lr~rLd~~~rk~~~~~ 25 (69)
T PF14197_consen 2 KLEAEIATLRNRLDSLTRKNSVHE 25 (69)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHH
Confidence 456666667777766666665543
No 111
>PRK14872 rod shape-determining protein MreC; Provisional
Probab=32.37 E-value=75 Score=25.61 Aligned_cols=28 Identities=14% Similarity=0.082 Sum_probs=24.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhcc
Q 047804 53 GAVCHLQKQVSELQAELAKAQAGLAFMQ 80 (81)
Q Consensus 53 G~I~~Lq~qi~~lq~eLa~~~~~la~~q 80 (81)
+....|+++.+.|++|++.+++++..++
T Consensus 57 ~~y~~L~~EN~~Lk~Ena~L~~~l~~~e 84 (337)
T PRK14872 57 SHALVLETENFLLKERIALLEERLKSYE 84 (337)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 7779999999999999999988887654
No 112
>PF07820 TraC: TraC-like protein; InterPro: IPR012930 The members of this family are sequences that are similar to TraC (Q84HT8 from SWISSPROT) from Rhizobium etli. The gene encoding this protein is one of a group of genes found on plasmid p42a of Rhizobium etli (strain CFN 42/ATCC 51251) that are thought to be involved in the process of plasmid self-transmission. Mobilisation of plasmid p42a is of importance as it is required for transfer of plasmid p42d, the symbiotic plasmid which carries most of the genes required for nodulation and nitrogen fixation by this symbiotic bacterium. The predicted protein products of p42a are similar to known transfer proteins of Agrobacterium tumefaciens plasmid pTiC58 []. ; GO: 0000746 conjugation
Probab=32.23 E-value=91 Score=21.07 Aligned_cols=22 Identities=18% Similarity=0.391 Sum_probs=16.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHH
Q 047804 55 VCHLQKQVSELQAELAKAQAGL 76 (81)
Q Consensus 55 I~~Lq~qi~~lq~eLa~~~~~l 76 (81)
++.|..||..||++|..+....
T Consensus 4 ~s~I~~eIekLqe~lk~~e~ke 25 (92)
T PF07820_consen 4 SSKIREEIEKLQEQLKQAETKE 25 (92)
T ss_pred HHHHHHHHHHHHHHHHHHHHHH
Confidence 5678888888888887766543
No 113
>PF15294 Leu_zip: Leucine zipper
Probab=32.02 E-value=69 Score=25.30 Aligned_cols=33 Identities=27% Similarity=0.377 Sum_probs=27.0
Q ss_pred CccccHHHHHHHHHHHHHHHHHHHHHHHHHhhcc
Q 047804 47 PVYGCVGAVCHLQKQVSELQAELAKAQAGLAFMQ 80 (81)
Q Consensus 47 PV~Gc~G~I~~Lq~qi~~lq~eLa~~~~~la~~q 80 (81)
|+.+. |.+--|+.+|..+++|..++++.|...+
T Consensus 120 pl~e~-g~~~ll~kEi~rLq~EN~kLk~rl~~le 152 (278)
T PF15294_consen 120 PLNES-GGSELLNKEIDRLQEENEKLKERLKSLE 152 (278)
T ss_pred ccccc-chHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 45443 8888899999999999999999887654
No 114
>PRK14623 hypothetical protein; Provisional
Probab=31.84 E-value=89 Score=21.13 Aligned_cols=25 Identities=12% Similarity=0.198 Sum_probs=19.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhc
Q 047804 55 VCHLQKQVSELQAELAKAQAGLAFM 79 (81)
Q Consensus 55 I~~Lq~qi~~lq~eLa~~~~~la~~ 79 (81)
+..+-+|.+..|.++..+|++|+..
T Consensus 3 ~~~~mkqaqkmQ~km~~~Qeel~~~ 27 (106)
T PRK14623 3 MMGMMGKLKEAQQKVEATKKRLDTV 27 (106)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHhcc
Confidence 5667778888888888888888753
No 115
>PRK14626 hypothetical protein; Provisional
Probab=31.73 E-value=1e+02 Score=20.85 Aligned_cols=26 Identities=27% Similarity=0.431 Sum_probs=20.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhh
Q 047804 53 GAVCHLQKQVSELQAELAKAQAGLAF 78 (81)
Q Consensus 53 G~I~~Lq~qi~~lq~eLa~~~~~la~ 78 (81)
|-+..+-+|.+..|.++..+|++|+.
T Consensus 5 gn~~~mmkqaq~mQ~km~~~qeeL~~ 30 (110)
T PRK14626 5 GNLAELMKQMQSIKENVEKAKEELKK 30 (110)
T ss_pred HhHHHHHHHHHHHHHHHHHHHHHHhc
Confidence 34667778888888888888888874
No 116
>PRK14625 hypothetical protein; Provisional
Probab=31.66 E-value=90 Score=21.21 Aligned_cols=24 Identities=46% Similarity=0.418 Sum_probs=18.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhh
Q 047804 55 VCHLQKQVSELQAELAKAQAGLAF 78 (81)
Q Consensus 55 I~~Lq~qi~~lq~eLa~~~~~la~ 78 (81)
+..+-+|.+..|.++..+|++|+.
T Consensus 4 m~~mmkqaq~mQ~km~~~Q~el~~ 27 (109)
T PRK14625 4 LGGLMKQAQAMQQKLADAQARLAE 27 (109)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHhc
Confidence 566777888888888888888874
No 117
>PRK01203 prefoldin subunit alpha; Provisional
Probab=31.47 E-value=83 Score=22.17 Aligned_cols=25 Identities=8% Similarity=0.205 Sum_probs=17.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhc
Q 047804 55 VCHLQKQVSELQAELAKAQAGLAFM 79 (81)
Q Consensus 55 I~~Lq~qi~~lq~eLa~~~~~la~~ 79 (81)
+..|++|++.+++++..++..+..+
T Consensus 9 ~~~~~~q~e~l~~ql~~L~~a~se~ 33 (130)
T PRK01203 9 LNYIESLISSVDSQIDSLNKTLSEV 33 (130)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4567777777777777777665544
No 118
>TIGR00344 alaS alanine--tRNA ligase. The model describes alanine--tRNA ligase. This enzyme catalyzes the reaction (tRNAala + L-alanine + ATP = L-alanyl-tRNAala + pyrophosphate + AMP).
Probab=31.46 E-value=1.5e+02 Score=26.51 Aligned_cols=44 Identities=14% Similarity=-0.007 Sum_probs=30.1
Q ss_pred HHHHHhhhccCCccccHHHHHHHHHHHHHHHHHHHHHHHHHhhc
Q 047804 36 MDYEANSRIRDPVYGCVGAVCHLQKQVSELQAELAKAQAGLAFM 79 (81)
Q Consensus 36 lvYEA~aR~rDPV~Gc~G~I~~Lq~qi~~lq~eLa~~~~~la~~ 79 (81)
++-+...-..-|...-...|..|+.++..++.++..++.+|+.+
T Consensus 709 ~l~~l~~~L~~~~~~i~~~v~~l~~e~k~l~k~~~~l~~~l~~~ 752 (851)
T TIGR00344 709 KLKELADILKVTPNELPKKIERFFEEWKALQKELESLKLKIADQ 752 (851)
T ss_pred HHHHHHHHhCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 44444444555555556667888888888888888888877643
No 119
>COG5415 Predicted integral membrane metal-binding protein [General function prediction only]
Probab=31.25 E-value=89 Score=24.53 Aligned_cols=37 Identities=30% Similarity=0.343 Sum_probs=29.9
Q ss_pred cCCccccHHHHHHHHHHHHHHHHHHHHHHHHHhhccC
Q 047804 45 RDPVYGCVGAVCHLQKQVSELQAELAKAQAGLAFMQS 81 (81)
Q Consensus 45 rDPV~Gc~G~I~~Lq~qi~~lq~eLa~~~~~la~~q~ 81 (81)
.|-|.--.|-...|+.||++++--+...|+.|...|+
T Consensus 7 K~~~~~~~~~L~rle~qi~q~~~~~~~~qs~l~~~~~ 43 (251)
T COG5415 7 KDFVTKYTADLSRLESQIHQLDVALKKSQSILSQWQS 43 (251)
T ss_pred ccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4555666788899999999999999999998887663
No 120
>PRK11677 hypothetical protein; Provisional
Probab=31.18 E-value=1.1e+02 Score=21.51 Aligned_cols=21 Identities=38% Similarity=0.376 Sum_probs=18.5
Q ss_pred HHHHHHHHHHHHHHHHHhhcc
Q 047804 60 KQVSELQAELAKAQAGLAFMQ 80 (81)
Q Consensus 60 ~qi~~lq~eLa~~~~~la~~q 80 (81)
++...++.||..++.+|..||
T Consensus 29 ~~q~~le~eLe~~k~ele~Yk 49 (134)
T PRK11677 29 RQQQALQYELEKNKAELEEYR 49 (134)
T ss_pred hHHHHHHHHHHHHHHHHHHHH
Confidence 467889999999999999886
No 121
>PF07439 DUF1515: Protein of unknown function (DUF1515); InterPro: IPR010889 This family consists of several hypothetical bacterial proteins of around 130 residues in length. Members of this family seem to be found exclusively in Rhizobium species. The function of this family is unknown.
Probab=30.87 E-value=1.9e+02 Score=20.26 Aligned_cols=47 Identities=15% Similarity=0.236 Sum_probs=35.2
Q ss_pred HHHHHHHHHHHHhhhccCCccccHHHHHHHHHHHHHHHHHHHHHHHHHhhcc
Q 047804 29 KRDCKLIMDYEANSRIRDPVYGCVGAVCHLQKQVSELQAELAKAQAGLAFMQ 80 (81)
Q Consensus 29 k~da~~SlvYEA~aR~rDPV~Gc~G~I~~Lq~qi~~lq~eLa~~~~~la~~q 80 (81)
|+|.- +++.|...-=..-++.+-.|...+..++..+..+++.++.++
T Consensus 21 Red~r-----~SEdrsa~SRa~mhrRlDElV~Rv~~lEs~~~~lk~dVsemK 67 (112)
T PF07439_consen 21 REDIR-----RSEDRSAASRASMHRRLDELVERVTTLESSVSTLKADVSEMK 67 (112)
T ss_pred HHHHH-----HHHHHhhhhhHHHHHhHHHHHHHHHHHHHHHHHHHhhHHhcc
Confidence 35555 455555555556788999999999999999999988887664
No 122
>PF08286 Spc24: Spc24 subunit of Ndc80; InterPro: IPR013252 Spc24 is a component of the evolutionarily conserved kinetochore-associated Ndc80 complex and is involved in chromosome segregation [].; PDB: 2VE7_D 2FV4_B 2FTX_B.
Probab=30.87 E-value=16 Score=24.26 Aligned_cols=25 Identities=24% Similarity=0.377 Sum_probs=0.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhh
Q 047804 54 AVCHLQKQVSELQAELAKAQAGLAF 78 (81)
Q Consensus 54 ~I~~Lq~qi~~lq~eLa~~~~~la~ 78 (81)
.+..|+.++..++++|+.++.++..
T Consensus 14 ~~~~LE~~l~~l~~el~~L~~~l~e 38 (118)
T PF08286_consen 14 ELSDLESELESLQSELEELKEELEE 38 (118)
T ss_dssp -------------------------
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4566777777777777777766654
No 123
>PF06005 DUF904: Protein of unknown function (DUF904); InterPro: IPR009252 Cell division protein ZapB is a non-essential, abundant cell division factor that is required for proper Z-ring formation. It is recruited early to the divisome by direct interaction with FtsZ, stimulating Z-ring assembly and thereby promoting cell division earlier in the cell cycle. Its recruitment to the Z-ring requires functional FtsA or ZipA.; GO: 0000917 barrier septum formation, 0043093 cytokinesis by binary fission, 0005737 cytoplasm; PDB: 2JEE_A.
Probab=30.75 E-value=79 Score=19.93 Aligned_cols=21 Identities=24% Similarity=0.336 Sum_probs=14.4
Q ss_pred ccccHHHHHHHHHHHHHHHHH
Q 047804 48 VYGCVGAVCHLQKQVSELQAE 68 (81)
Q Consensus 48 V~Gc~G~I~~Lq~qi~~lq~e 68 (81)
|-..+-.|..||.++..++.+
T Consensus 13 i~~aveti~~Lq~e~eeLke~ 33 (72)
T PF06005_consen 13 IQQAVETIALLQMENEELKEK 33 (72)
T ss_dssp HHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHH
Confidence 334556777788888777775
No 124
>TIGR03683 A-tRNA_syn_arch alanyl-tRNA synthetase. This family of alanyl-tRNA synthetases is limited to the archaea, and is a subset of those sequences identified by the model pfam07973 covering the second additional domain (SAD) of alanyl and threonyl tRNA synthetases.
Probab=30.41 E-value=1.7e+02 Score=26.42 Aligned_cols=45 Identities=16% Similarity=0.116 Sum_probs=30.6
Q ss_pred HHHHHHhhhccCCccccHHHHHHHHHHHHHHHHHHHHHHHHHhhc
Q 047804 35 IMDYEANSRIRDPVYGCVGAVCHLQKQVSELQAELAKAQAGLAFM 79 (81)
Q Consensus 35 SlvYEA~aR~rDPV~Gc~G~I~~Lq~qi~~lq~eLa~~~~~la~~ 79 (81)
.+..|+..-..-|..--.-.|..|++++..++.++..++.+|+.+
T Consensus 747 ~~l~~~a~~L~~~~~~l~~~v~~l~~e~k~l~k~l~~l~~~l~~~ 791 (902)
T TIGR03683 747 DLLRESSDILKVPPEQLPETVKRFFEEWKEQRKEIERLKKKLAEL 791 (902)
T ss_pred HHHHHHHHHhCCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 344555555554545556678888888888888888888777654
No 125
>PF07223 DUF1421: Protein of unknown function (DUF1421); InterPro: IPR010820 This family represents a conserved region approximately 350 residues long within a number of plant proteins of unknown function.
Probab=30.27 E-value=49 Score=26.88 Aligned_cols=15 Identities=40% Similarity=0.485 Sum_probs=9.2
Q ss_pred HHHHHHHHHHHhhcc
Q 047804 66 QAELAKAQAGLAFMQ 80 (81)
Q Consensus 66 q~eLa~~~~~la~~q 80 (81)
|.||+.++.+||++|
T Consensus 14 KQEiaEtQ~qLaKLQ 28 (358)
T PF07223_consen 14 KQEIAETQKQLAKLQ 28 (358)
T ss_pred HHHHHHHHHHHHhhh
Confidence 556666666666554
No 126
>PF06156 DUF972: Protein of unknown function (DUF972); InterPro: IPR010377 FUNCTION: Involved in initiation control of chromosome replication. SUBUNIT: Interacts with both DnaA and DnaN, acting as a bridge between these two proteins. SIMILARITY: Belongs to the YabA family.
Probab=30.26 E-value=1.2e+02 Score=20.33 Aligned_cols=23 Identities=26% Similarity=0.394 Sum_probs=12.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH
Q 047804 54 AVCHLQKQVSELQAELAKAQAGL 76 (81)
Q Consensus 54 ~I~~Lq~qi~~lq~eLa~~~~~l 76 (81)
.|..|++|+..+-+++..++..+
T Consensus 9 ~l~~le~~l~~l~~~~~~LK~~~ 31 (107)
T PF06156_consen 9 RLDQLEQQLGQLLEELEELKKQL 31 (107)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHH
Confidence 34455555555555555555444
No 127
>KOG3436 consensus 60S ribosomal protein L35 [Translation, ribosomal structure and biogenesis]
Probab=30.17 E-value=63 Score=22.97 Aligned_cols=19 Identities=37% Similarity=0.478 Sum_probs=16.0
Q ss_pred HHHHHHHHHHHHHHHHHHH
Q 047804 55 VCHLQKQVSELQAELAKAQ 73 (81)
Q Consensus 55 I~~Lq~qi~~lq~eLa~~~ 73 (81)
...|+.|+.+++.||+.++
T Consensus 14 ke~L~~ql~dLK~ELa~LR 32 (123)
T KOG3436|consen 14 KEQLLKQLDDLKVELAQLR 32 (123)
T ss_pred HHHHHHHHHHHHHHHHHHH
Confidence 3688999999999998776
No 128
>PRK03762 hypothetical protein; Provisional
Probab=30.13 E-value=1e+02 Score=20.74 Aligned_cols=23 Identities=13% Similarity=0.144 Sum_probs=16.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHhh
Q 047804 56 CHLQKQVSELQAELAKAQAGLAF 78 (81)
Q Consensus 56 ~~Lq~qi~~lq~eLa~~~~~la~ 78 (81)
..+.+|....|.+++.+|++|+.
T Consensus 8 ~~m~kqaqkmQ~km~~~Q~el~~ 30 (103)
T PRK03762 8 SKLGEMLEQMQKKAKQLEEENAN 30 (103)
T ss_pred HHHHHHHHHHHHHHHHHHHHHhc
Confidence 45666677777777777777764
No 129
>PRK00587 hypothetical protein; Provisional
Probab=29.99 E-value=1e+02 Score=20.58 Aligned_cols=24 Identities=21% Similarity=0.275 Sum_probs=18.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhh
Q 047804 55 VCHLQKQVSELQAELAKAQAGLAF 78 (81)
Q Consensus 55 I~~Lq~qi~~lq~eLa~~~~~la~ 78 (81)
+..|.+|.+..|.++..+|++|+.
T Consensus 3 ~~~lmkqaqkmQ~km~~~QeeL~~ 26 (99)
T PRK00587 3 FQKLAQQLKKMQNTMEKKQKEFEE 26 (99)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHhc
Confidence 356777778888888888888774
No 130
>COG0013 AlaS Alanyl-tRNA synthetase [Translation, ribosomal structure and biogenesis]
Probab=29.83 E-value=1.2e+02 Score=27.69 Aligned_cols=25 Identities=32% Similarity=0.400 Sum_probs=21.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhh
Q 047804 54 AVCHLQKQVSELQAELAKAQAGLAF 78 (81)
Q Consensus 54 ~I~~Lq~qi~~lq~eLa~~~~~la~ 78 (81)
.|..|+.+...++.||+.++.+++.
T Consensus 733 kv~~l~~~~k~lekel~~lk~~~~~ 757 (879)
T COG0013 733 KVERLLEELKELEKELERLKKKLAA 757 (879)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 8999999999999999999886653
No 131
>PF01025 GrpE: GrpE; InterPro: IPR000740 Molecular chaperones are a diverse family of proteins that function to protect proteins in the intracellular milieu from irreversible aggregation during synthesis and in times of cellular stress. The bacterial molecular chaperone DnaK is an enzyme that couples cycles of ATP binding, hydrolysis, and ADP release by an N-terminal ATP-hydrolysing domain to cycles of sequestration and release of unfolded proteins by a C-terminal substrate binding domain. In prokaryotes the grpE protein. Dimeric GrpE is the co-chaperone for DnaK, and acts as a nucleotide exchange factor, stimulating the rate of ADP release 5000-fold []. DnaK is itself a weak ATPase; ATP hydrolysis by DnaK is stimulated by its interaction with another co-chaperone, DnaJ. Thus the co-chaperones DnaJ and GrpE are capable of tightly regulating the nucleotide-bound and substrate-bound state of DnaK in ways that are necessary for the normal housekeeping functions and stress-related functions of the DnaK molecular chaperone cycle. The X-ray crystal structure of GrpE in complex with the ATPase domain of DnaK revealed that GrpE is an asymmetric homodimer, bent in a manner that favours extensive contacts with only one DnaKATPase monomer []. GrpE does not actively compete for the atomic positions occupied by the nucleotide. GrpE and ADP mutually reduce one another's affinity for DnaK 200-fold, and ATP instantly dissociates GrpE from DnaK.; GO: 0000774 adenyl-nucleotide exchange factor activity, 0042803 protein homodimerization activity, 0051087 chaperone binding, 0006457 protein folding; PDB: 3A6M_A 4ANI_A 1DKG_B.
Probab=29.78 E-value=98 Score=20.95 Aligned_cols=25 Identities=28% Similarity=0.400 Sum_probs=16.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhc
Q 047804 55 VCHLQKQVSELQAELAKAQAGLAFM 79 (81)
Q Consensus 55 I~~Lq~qi~~lq~eLa~~~~~la~~ 79 (81)
+..|+.++..++.++....+++..|
T Consensus 20 l~~l~~~~~~l~~~~~r~~ae~en~ 44 (165)
T PF01025_consen 20 LEELEKEIEELKERLLRLQAEFENY 44 (165)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4556777777777777666666544
No 132
>PRK14629 hypothetical protein; Provisional
Probab=29.73 E-value=96 Score=20.77 Aligned_cols=22 Identities=9% Similarity=0.258 Sum_probs=15.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHhh
Q 047804 57 HLQKQVSELQAELAKAQAGLAF 78 (81)
Q Consensus 57 ~Lq~qi~~lq~eLa~~~~~la~ 78 (81)
++-+|.+..|.++..+|++|+.
T Consensus 7 ~~mkqaq~mQ~km~~~Q~eL~~ 28 (99)
T PRK14629 7 DFLKNMSSFKDNIDNIKKEISQ 28 (99)
T ss_pred HHHHHHHHHHHHHHHHHHHHhc
Confidence 4667777777777777777764
No 133
>PRK14622 hypothetical protein; Provisional
Probab=29.67 E-value=1e+02 Score=20.53 Aligned_cols=24 Identities=29% Similarity=0.343 Sum_probs=18.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhh
Q 047804 55 VCHLQKQVSELQAELAKAQAGLAF 78 (81)
Q Consensus 55 I~~Lq~qi~~lq~eLa~~~~~la~ 78 (81)
+..|-+|.+..|.++..+|++|+.
T Consensus 3 ~~~lmkqaq~mQ~~m~~~q~el~~ 26 (103)
T PRK14622 3 IQYLMRQAKKLEKAMADAKEKLAE 26 (103)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHhc
Confidence 456778888888888888888874
No 134
>PF12443 AKNA: AT-hook-containing transcription factor; InterPro: IPR022150 This domain family is found in eukaryotes, and is approximately 110 amino acids in length. This family contains a transcription factor which regulates the expression of the costimulatory molecules on lymphocytes.
Probab=29.32 E-value=1.9e+02 Score=19.87 Aligned_cols=59 Identities=12% Similarity=0.077 Sum_probs=37.8
Q ss_pred HHHHhhhhhhhHHHHhhhccccCHHHHHHHHHHHHhhhccCC---------ccccHHHHHHHHHHHHHHHHHHHHHHHHH
Q 047804 6 LKLLESFRELQSWVNLATGACRTKRDCKLIMDYEANSRIRDP---------VYGCVGAVCHLQKQVSELQAELAKAQAGL 76 (81)
Q Consensus 6 ~k~l~~~~~~~r~~~~~~g~~~~k~da~~SlvYEA~aR~rDP---------V~Gc~G~I~~Lq~qi~~lq~eLa~~~~~l 76 (81)
+|+...++..+. +..- -||..+|-.-| -.|-+--|+.++.+|..|...+..++.++
T Consensus 4 lk~~~~l~~~~~--------------aL~g-E~~~a~~~l~~t~q~~~s~~sp~~f~~~~ege~~~qkL~eqteeLK~kv 68 (106)
T PF12443_consen 4 LKGFPRLHAAQE--------------ALEG-ESETACRKLSSTSQKDDSSGSPGIFDKIREGEQMIQKLGEQTEELKDKV 68 (106)
T ss_pred hhhhHHHHHHHH--------------Hhhh-HHHHHHHhcCCCccccCCCCCccccCcchHHHHHHHHHHHHHHHHHHHH
Confidence 455556666555 6666 88888875422 23344456777777777777777777766
Q ss_pred hhc
Q 047804 77 AFM 79 (81)
Q Consensus 77 a~~ 79 (81)
..+
T Consensus 69 qe~ 71 (106)
T PF12443_consen 69 QEF 71 (106)
T ss_pred HHH
Confidence 544
No 135
>PF06657 Cep57_MT_bd: Centrosome microtubule-binding domain of Cep57; InterPro: IPR010597 This entry is thought to represent a centrosomal protein of 57 kDa (Cep57-related protein). It is required for spindle microtubule attachment to both kinetochores and centrosomes and functions to tether minus-ends of spindle microtubules to centrosomes. It may act by forming ring-like structures around microtubules, or by serving as a cross-linker or scaffold at the attachment site [].
Probab=29.29 E-value=1.3e+02 Score=19.06 Aligned_cols=32 Identities=16% Similarity=0.122 Sum_probs=24.6
Q ss_pred cccHHHHHHHHHHHHHHHHHHHHHHHHHhhcc
Q 047804 49 YGCVGAVCHLQKQVSELQAELAKAQAGLAFMQ 80 (81)
Q Consensus 49 ~Gc~G~I~~Lq~qi~~lq~eLa~~~~~la~~q 80 (81)
..-.-+|..||.++..++.++..+++.+..++
T Consensus 13 ~~Ls~vl~~LqDE~~hm~~e~~~L~~~~~~~d 44 (79)
T PF06657_consen 13 EALSEVLKALQDEFGHMKMEHQELQDEYKQMD 44 (79)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc
Confidence 33455788899999999999998887776553
No 136
>KOG2185 consensus Predicted RNA-processing protein, contains G-patch domain [RNA processing and modification]
Probab=29.23 E-value=77 Score=27.00 Aligned_cols=31 Identities=26% Similarity=0.221 Sum_probs=26.7
Q ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHHhhcc
Q 047804 50 GCVGAVCHLQKQVSELQAELAKAQAGLAFMQ 80 (81)
Q Consensus 50 Gc~G~I~~Lq~qi~~lq~eLa~~~~~la~~q 80 (81)
|=.-+...||.++..++.+|+.++++.+..|
T Consensus 441 gd~v~s~~lqe~L~ev~~~Lasl~aqea~ls 471 (486)
T KOG2185|consen 441 GDLVVSEALQERLKEVRKALASLLAQEAALS 471 (486)
T ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 4555788999999999999999999988766
No 137
>PF11570 E2R135: Coiled-coil receptor-binding R-domain of colicin E2; InterPro: IPR024566 Bacteriocins are protein antibiotics that kill bacteria closely related to the producing species. Colicins are a subgroup of bacteriocins that are produced by and target Escherichia coli. The lethal action of most colicins is exerted either by formation of a pore in the cytoplasmic membrane of the target cell, or by an enzymatic nuclease digestion mechanism. Most colicins are able to translocate the outer membrane by a two-receptor system, where one receptor is used for the initial binding and the second for translocation. The initial binding is to cell surface receptors such as the porins OmpF, FepA, BtuB, Cir and FhuA. The presence of specific periplasmic proteins, such as TolA, TolB, TolC, or TonB, are required for translocation across the membrane []. Colicins are composed of domains with distinct functional roles. In general they contain a central R (receptor) domain that mediates receptor binding, an N-terminal T (translocation) domain that mediates translocation of the protein from the outer membrane receptor to the colicin's target within the cell, and a C-terminal C (catalytic) domain that performs the catalytic cleavage []. This entry represents the central R domain found in colicin-E2 and other colicins.; PDB: 2YSU_B 1UJW_B 2B5U_C 1JCH_A.
Probab=29.02 E-value=1.5e+02 Score=21.40 Aligned_cols=47 Identities=28% Similarity=0.140 Sum_probs=29.9
Q ss_pred HHHHHHHhh-----hccCCccccH-H---HHHHHHHHHHHHHHHHHHHHHHHhhcc
Q 047804 34 LIMDYEANS-----RIRDPVYGCV-G---AVCHLQKQVSELQAELAKAQAGLAFMQ 80 (81)
Q Consensus 34 ~SlvYEA~a-----R~rDPV~Gc~-G---~I~~Lq~qi~~lq~eLa~~~~~la~~q 80 (81)
+--+=||+. ++.||-++-- | -+..++..|..-+++|..++..|..++
T Consensus 49 ~~~~~eae~k~~~~~a~~P~~~~~~~wqlkvr~a~~dv~nkq~~l~AA~~~l~~~~ 104 (136)
T PF11570_consen 49 NKKVKEAEIKQDEFFANNPPHEYGRGWQLKVRRAQKDVQNKQNKLKAAQKELNAAD 104 (136)
T ss_dssp HHHHHHHHHHHCCCCTT-TTSSCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHhcccccccCCCccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Confidence 334445543 6778876533 2 355566778888888888888877654
No 138
>PF02183 HALZ: Homeobox associated leucine zipper; InterPro: IPR003106 This region is a plant specific leucine zipper that is always found associated with a homeobox []. ; GO: 0003677 DNA binding, 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus
Probab=28.99 E-value=1e+02 Score=17.82 Aligned_cols=20 Identities=30% Similarity=0.449 Sum_probs=11.6
Q ss_pred HHHHHHHHHHHHHHHHHHHH
Q 047804 57 HLQKQVSELQAELAKAQAGL 76 (81)
Q Consensus 57 ~Lq~qi~~lq~eLa~~~~~l 76 (81)
.|+++.+.+++|+..+...+
T Consensus 23 ~L~~E~~~L~aev~~L~~kl 42 (45)
T PF02183_consen 23 SLKKENEKLRAEVQELKEKL 42 (45)
T ss_pred HHHHHHHHHHHHHHHHHHhh
Confidence 45566666666666555544
No 139
>PF12325 TMF_TATA_bd: TATA element modulatory factor 1 TATA binding; InterPro: IPR022091 This is the C-terminal conserved coiled coil region of a family of TATA element modulatory factor 1 proteins conserved in eukaryotes []. The proteins bind to the TATA element of some RNA polymerase II promoters and repress their activity. by competing with the binding of TATA binding protein. TMF1_TATA_bd is the most conserved part of the TMFs []. TMFs are evolutionarily conserved golgins that bind Rab6, a ubiquitous ras-like GTP-binding Golgi protein, and contribute to Golgi organisation in animal [] and plant cells. The Rab6-binding domain appears to be the same region as this C-terminal family [].
Probab=28.87 E-value=1.2e+02 Score=20.78 Aligned_cols=25 Identities=24% Similarity=0.460 Sum_probs=18.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHh
Q 047804 53 GAVCHLQKQVSELQAELAKAQAGLA 77 (81)
Q Consensus 53 G~I~~Lq~qi~~lq~eLa~~~~~la 77 (81)
|-+..|+.++..+.++-..+.++|.
T Consensus 30 ~E~~~l~~el~~l~~~r~~l~~Eiv 54 (120)
T PF12325_consen 30 GELASLQEELARLEAERDELREEIV 54 (120)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 6777778888777777777777665
No 140
>COG0255 RpmC Ribosomal protein L29 [Translation, ribosomal structure and biogenesis]
Probab=28.71 E-value=1.4e+02 Score=18.89 Aligned_cols=26 Identities=35% Similarity=0.337 Sum_probs=21.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhcc
Q 047804 55 VCHLQKQVSELQAELAKAQAGLAFMQ 80 (81)
Q Consensus 55 I~~Lq~qi~~lq~eLa~~~~~la~~q 80 (81)
+-.|..++..++.||..++.+++.-|
T Consensus 13 ~eeL~~~l~eLK~ELf~LR~q~a~g~ 38 (69)
T COG0255 13 VEELEEELRELKKELFNLRFQLATGQ 38 (69)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhCC
Confidence 45788899999999999988887544
No 141
>PF07106 TBPIP: Tat binding protein 1(TBP-1)-interacting protein (TBPIP); InterPro: IPR010776 This family consists of several eukaryotic TBP-1 interacting protein (TBPIP) sequences. TBP-1 has been demonstrated to interact with the human immunodeficiency virus type 1 (HIV-1) viral protein Tat, then modulate the essential replication process of HIV. In addition, TBP-1 has been shown to be a component of the 26S proteasome, a basic multiprotein complex that degrades ubiquitinated proteins in an ATP-dependent fashion. Human TBPIP interacts with human TBP-1 then modulates the inhibitory action of human TBP-1 on HIV-Tat-mediated transactivation [].
Probab=28.36 E-value=87 Score=21.65 Aligned_cols=22 Identities=23% Similarity=0.442 Sum_probs=9.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHhh
Q 047804 57 HLQKQVSELQAELAKAQAGLAF 78 (81)
Q Consensus 57 ~Lq~qi~~lq~eLa~~~~~la~ 78 (81)
.|..+|..++.|+..+.+.|..
T Consensus 113 el~~~i~~l~~e~~~l~~kL~~ 134 (169)
T PF07106_consen 113 ELREEIEELEEEIEELEEKLEK 134 (169)
T ss_pred HHHHHHHHHHHHHHHHHHHHHH
Confidence 3444444444444444444443
No 142
>KOG4603 consensus TBP-1 interacting protein [Signal transduction mechanisms]
Probab=28.21 E-value=1.7e+02 Score=22.30 Aligned_cols=25 Identities=12% Similarity=0.058 Sum_probs=20.8
Q ss_pred hHHHHHhhhhhhhHHHHhhhccccC
Q 047804 4 FFLKLLESFRELQSWVNLATGACRT 28 (81)
Q Consensus 4 nv~k~l~~~~~~~r~~~~~~g~~~~ 28 (81)
-|.|.|..|...-+-..-+.|--+|
T Consensus 41 avqk~Ld~La~~Gki~~K~YGKqKI 65 (201)
T KOG4603|consen 41 AVQKTLDQLAQQGKIKEKMYGKQKI 65 (201)
T ss_pred HHHHHHHHHHHcCchhHHhccceee
Confidence 4778888888888888888888888
No 143
>PF00631 G-gamma: GGL domain; InterPro: IPR015898 This entry represents the G protein gamma subunit and the GGL (G protein gamma-like) domain, which are related in sequence and are comprised of an extended alpha-helical polypeptide. The G protein gamma subunit forms a stable dimer with the beta subunit, but it does not make any contact with the alpha subunit, which contacts the opposite face of the beta subunit. The GGL domain is found in several RGS (regulators of G protein signaling) proteins. GGL domains can interact with beta subunits to form novel dimers that prevent gamma subunit binding, and may prevent heterotrimer formation by inhibiting alpha subunit binding. The interaction between G protein beta-5 neuro-specific isoforms and RGS GGL domains may represent a general mode of binding between beta-propeller proteins and their partners [].; GO: 0004871 signal transducer activity, 0007186 G-protein coupled receptor protein signaling pathway, 0005834 heterotrimeric G-protein complex; PDB: 3PSC_G 3SN6_G 1OMW_G 2BCJ_G 1GG2_G 3PVW_G 3PVU_G 3AH8_G 3CIK_G 1GP2_G ....
Probab=28.19 E-value=67 Score=19.36 Aligned_cols=17 Identities=24% Similarity=0.526 Sum_probs=13.7
Q ss_pred HHHHHHHHHHHHHHHHH
Q 047804 55 VCHLQKQVSELQAELAK 71 (81)
Q Consensus 55 I~~Lq~qi~~lq~eLa~ 71 (81)
+.+|+++|+.|+.||..
T Consensus 4 ~~~l~~ei~~L~~el~~ 20 (68)
T PF00631_consen 4 KDQLKREIEQLRQELER 20 (68)
T ss_dssp HHHHHHHHHHHHHHHTS
T ss_pred HHHHHHHHHHHHHHHcc
Confidence 56788888888888865
No 144
>PF02375 JmjN: jmjN domain; InterPro: IPR003349 Jumonji protein is required for neural tube formation in mice [].There is evidence of domain swapping within the jumonji family of transcription factors []. This domain is often associated with JmjC (see IPR003347 from INTERPRO).; PDB: 2XML_A 2W2I_C 3DXT_A 3DXU_A 2OX0_B 2OQ6_B 2WWJ_A 2Q8D_A 3PDQ_A 2YBK_A ....
Probab=27.89 E-value=44 Score=18.43 Aligned_cols=22 Identities=18% Similarity=0.265 Sum_probs=13.8
Q ss_pred chHHHHHhhhhhhhHHHHhhhccccC
Q 047804 3 SFFLKLLESFRELQSWVNLATGACRT 28 (81)
Q Consensus 3 snv~k~l~~~~~~~r~~~~~~g~~~~ 28 (81)
.|.++-++++.+.-. -+|+|||
T Consensus 11 ~dp~~yi~~i~~~g~----~~Gi~KI 32 (34)
T PF02375_consen 11 KDPIKYISSIEPEGE----KYGICKI 32 (34)
T ss_dssp S-HHHHHHHHHHTTG----GGSEEEE
T ss_pred hCHHHHHHHHHHHHH----HCCEEEe
Confidence 456667777665333 6899886
No 145
>PRK14624 hypothetical protein; Provisional
Probab=27.71 E-value=1.1e+02 Score=20.92 Aligned_cols=27 Identities=4% Similarity=0.202 Sum_probs=21.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhc
Q 047804 53 GAVCHLQKQVSELQAELAKAQAGLAFM 79 (81)
Q Consensus 53 G~I~~Lq~qi~~lq~eLa~~~~~la~~ 79 (81)
+-+..+-+|.+..|+++..+|++|+..
T Consensus 6 ~nm~~~mkqAq~mQ~km~~~QeeL~~~ 32 (115)
T PRK14624 6 KNMSEALSNMGNIREKMEEVKKRIASI 32 (115)
T ss_pred HhHHHHHHHHHHHHHHHHHHHHHHhcc
Confidence 336678888888999999999988753
No 146
>PRK15178 Vi polysaccharide export inner membrane protein VexD; Provisional
Probab=27.05 E-value=1.4e+02 Score=24.91 Aligned_cols=39 Identities=18% Similarity=0.213 Sum_probs=25.5
Q ss_pred HHHHHhhhccCCcc---ccHHHHHHHHHHHHHHHHHHHHHHH
Q 047804 36 MDYEANSRIRDPVY---GCVGAVCHLQKQVSELQAELAKAQA 74 (81)
Q Consensus 36 lvYEA~aR~rDPV~---Gc~G~I~~Lq~qi~~lq~eLa~~~~ 74 (81)
..|.-....-||.- +-.+.|..||.|+..++++|+.++.
T Consensus 266 ~~fRn~~gvlDP~~~a~~~~~lI~~Le~qLa~~~aeL~~L~~ 307 (434)
T PRK15178 266 LKIQHIQKDIDPKETITAIYQLIAGFETQLAEAKAEYAQLMV 307 (434)
T ss_pred HHHHHhCCCcChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 55666666778854 4446677777777777777766543
No 147
>PF03955 Adeno_PIX: Adenovirus hexon-associated protein (IX); InterPro: IPR005641 Hexon (IPR000736 from INTERPRO) is the major coat protein from adenovirus type 2. Hexon forms a homo-trimer, 240 copies of which are present in the capsid, organised so that 12 lie on each of the 20 facets of this structure. The central 9 hexons in a facet are cemented together by 12 copies of protein IX []. Protein IX is not neccessarily required for viral replication, but has been shown to affect several processes including DNA-packaging capacity, thermostability, and the transcriptional activity of several promoters. For more information see [].; GO: 0031423 hexon binding, 0044423 virion part; PDB: 3IYN_T.
Probab=26.89 E-value=1.3e+02 Score=20.83 Aligned_cols=25 Identities=24% Similarity=0.307 Sum_probs=19.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhh
Q 047804 54 AVCHLQKQVSELQAELAKAQAGLAF 78 (81)
Q Consensus 54 ~I~~Lq~qi~~lq~eLa~~~~~la~ 78 (81)
-+..+|.++..++++|+.+..+|+.
T Consensus 84 ~~~~~~~~l~~l~a~Le~l~~~L~~ 108 (109)
T PF03955_consen 84 NLTALEDKLTALLAQLEALKQQLAE 108 (109)
T ss_dssp THHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhc
Confidence 3566788888888888888888874
No 148
>PRK11546 zraP zinc resistance protein; Provisional
Probab=26.76 E-value=1.3e+02 Score=21.58 Aligned_cols=22 Identities=14% Similarity=0.312 Sum_probs=17.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHH
Q 047804 53 GAVCHLQKQVSELQAELAKAQA 74 (81)
Q Consensus 53 G~I~~Lq~qi~~lq~eLa~~~~ 74 (81)
..|..|.++|..|+.+|...+.
T Consensus 89 ~kI~aL~kEI~~Lr~kL~e~r~ 110 (143)
T PRK11546 89 SKINAVAKEMENLRQSLDELRV 110 (143)
T ss_pred HHHHHHHHHHHHHHHHHHHHHH
Confidence 5688888888888888875544
No 149
>PLN02900 alanyl-tRNA synthetase
Probab=26.74 E-value=1.9e+02 Score=26.26 Aligned_cols=31 Identities=32% Similarity=0.239 Sum_probs=23.9
Q ss_pred cccHHHHHHHHHHHHHHHHHHHHHHHHHhhc
Q 047804 49 YGCVGAVCHLQKQVSELQAELAKAQAGLAFM 79 (81)
Q Consensus 49 ~Gc~G~I~~Lq~qi~~lq~eLa~~~~~la~~ 79 (81)
......|..|+.++..++.++..++.+++.+
T Consensus 786 ~el~~~v~~l~~e~k~l~k~~~~l~~~la~~ 816 (936)
T PLN02900 786 EEIRARVSALQKELRAAQKEAAALRAKLAVA 816 (936)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3456678888888888888888888887654
No 150
>PRK15178 Vi polysaccharide export inner membrane protein VexD; Provisional
Probab=26.35 E-value=3.2e+02 Score=22.79 Aligned_cols=40 Identities=13% Similarity=0.124 Sum_probs=31.0
Q ss_pred hhhccCCccccHHHHHHHHHHHHHHHHHHHHHHHHHhhcc
Q 047804 41 NSRIRDPVYGCVGAVCHLQKQVSELQAELAKAQAGLAFMQ 80 (81)
Q Consensus 41 ~aR~rDPV~Gc~G~I~~Lq~qi~~lq~eLa~~~~~la~~q 80 (81)
.-|.+.=|.-+...+..+...|..|+.||+.++++|+.++
T Consensus 267 ~fRn~~gvlDP~~~a~~~~~lI~~Le~qLa~~~aeL~~L~ 306 (434)
T PRK15178 267 KIQHIQKDIDPKETITAIYQLIAGFETQLAEAKAEYAQLM 306 (434)
T ss_pred HHHHhCCCcChHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3456677888888888888888888888888888887653
No 151
>PF02050 FliJ: Flagellar FliJ protein; InterPro: IPR012823 Many flagellar proteins are exported by a flagellum-specific export pathway. Attempts have been made to characterise the apparatus responsible for this process, by designing assays to screen for mutants with export defects []. Experiments involving filament removal from temperature-sensitive flagellar mutants of Salmonella typhimurium have shown that, while most mutants were able to regrow filaments, flhA, fliH, fliI and fliN mutants showed no or greatly reduced regrowth. This suggests that the corresponding gene products are involved in the process of flagellum-specific export. The sequences of fliH, fliI and the adjacent gene, fliJ, have been deduced. FliJ was shown to encode a protein of molecular mass 17,302 Da []. It is a membrane-associated protein that affects chemotactic events, mutations in FliJ result in failure to respond to chemotactic stimuli.; GO: 0003774 motor activity, 0001539 ciliary or flagellar motility, 0006935 chemotaxis, 0009288 bacterial-type flagellum, 0016020 membrane, 0044461 bacterial-type flagellum part; PDB: 3AJW_A.
Probab=26.34 E-value=1.5e+02 Score=17.76 Aligned_cols=27 Identities=15% Similarity=0.270 Sum_probs=16.2
Q ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHHh
Q 047804 51 CVGAVCHLQKQVSELQAELAKAQAGLA 77 (81)
Q Consensus 51 c~G~I~~Lq~qi~~lq~eLa~~~~~la 77 (81)
+.+.+..|...|...+.++..++.++.
T Consensus 50 ~~~~~~~l~~~i~~~~~~~~~~~~~~~ 76 (123)
T PF02050_consen 50 YQRYISALEQAIQQQQQELERLEQEVE 76 (123)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 345566666666666666666655544
No 152
>PF10737 GerPC: Spore germination protein GerPC; InterPro: IPR019673 GerPC is required for the formation of functionally normal spores. The gerP locus encodes a number of proteins which are thought to be involved in the establishment of normal spore coat structure and/or permeability, which allows the access of germinants to their receptor [].
Probab=26.11 E-value=59 Score=24.00 Aligned_cols=21 Identities=29% Similarity=0.477 Sum_probs=17.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHH
Q 047804 55 VCHLQKQVSELQAELAKAQAG 75 (81)
Q Consensus 55 I~~Lq~qi~~lq~eLa~~~~~ 75 (81)
|..|++++..++.|+..++.+
T Consensus 1 I~~LE~~~~~l~~e~~~Lk~~ 21 (176)
T PF10737_consen 1 IQRLEQRLQELQQELEELKQQ 21 (176)
T ss_pred ChHHHHHHHHHHHHHHHHHhC
Confidence 568999999999999888764
No 153
>PRK11519 tyrosine kinase; Provisional
Probab=26.11 E-value=98 Score=26.40 Aligned_cols=28 Identities=21% Similarity=0.358 Sum_probs=25.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhcc
Q 047804 53 GAVCHLQKQVSELQAELAKAQAGLAFMQ 80 (81)
Q Consensus 53 G~I~~Lq~qi~~lq~eLa~~~~~la~~q 80 (81)
..+..|++|+..++.+|..++..|..|+
T Consensus 267 ~a~~fL~~ql~~l~~~L~~aE~~l~~fr 294 (719)
T PRK11519 267 KSLAFLAQQLPEVRSRLDVAENKLNAFR 294 (719)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4678899999999999999999998876
No 154
>TIGR02047 CadR-PbrR Cd(II)/Pb(II)-responsive transcriptional regulator. This model represents the cadmium(II) and/or lead(II) responsive transcriptional activator of the proteobacterial metal efflux system. This protein is a member of the MerR family of transcriptional activators (pfam00376) and contains a distinctive pattern of cysteine residues in its metal binding loop, Cys-X(6-9)-Cys, as well as a conserved and critical cysteine at the N-terminal end of the dimerization helix.
Probab=26.07 E-value=2e+02 Score=19.17 Aligned_cols=32 Identities=25% Similarity=0.399 Sum_probs=21.4
Q ss_pred CCccccHHHHHHHHHHHHHHHHHHHHHHHHHh
Q 047804 46 DPVYGCVGAVCHLQKQVSELQAELAKAQAGLA 77 (81)
Q Consensus 46 DPV~Gc~G~I~~Lq~qi~~lq~eLa~~~~~la 77 (81)
+|-..|--....|+.++..++++++.++....
T Consensus 72 ~~~~~~~~~~~~l~~~~~~l~~~i~~L~~~~~ 103 (127)
T TIGR02047 72 KPEKSCSDVNALLDEHISHVRARIIKLQALIE 103 (127)
T ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 44445666667788888888777777665544
No 155
>cd00089 HR1 Protein kinase C-related kinase homology region 1 domain; also known as the ACC (antiparallel coiled-coil) finger domain or Rho-binding domain. Found in vertebrate PRK1 and yeast PKC1 protein kinases C; those found in rhophilin bind RhoGTP; those in PRK1 bind RhoA and RhoB. Rho family members function as molecular switches, cycling between inactive and active forms, controlling a variety of cellular processes. HR1 repeats often occur in tandem repeat arrangments, seperated by a short linker region.
Probab=25.86 E-value=1.1e+02 Score=18.44 Aligned_cols=28 Identities=11% Similarity=0.151 Sum_probs=24.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhcc
Q 047804 53 GAVCHLQKQVSELQAELAKAQAGLAFMQ 80 (81)
Q Consensus 53 G~I~~Lq~qi~~lq~eLa~~~~~la~~q 80 (81)
|-....+.++......|..++.+|..|+
T Consensus 42 ~~~~~~~~~l~es~~ki~~Lr~~L~k~~ 69 (72)
T cd00089 42 KLLAEAEQMLRESKQKLELLKMQLEKLK 69 (72)
T ss_pred cCHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 6778889999999999999999998876
No 156
>PRK00252 alaS alanyl-tRNA synthetase; Reviewed
Probab=25.80 E-value=2.1e+02 Score=25.47 Aligned_cols=42 Identities=24% Similarity=0.193 Sum_probs=29.2
Q ss_pred HHHhhhccCCccccHHHHHHHHHHHHHHHHHHHHHHHHHhhc
Q 047804 38 YEANSRIRDPVYGCVGAVCHLQKQVSELQAELAKAQAGLAFM 79 (81)
Q Consensus 38 YEA~aR~rDPV~Gc~G~I~~Lq~qi~~lq~eLa~~~~~la~~ 79 (81)
=+......-|.....-.|..|+.++..++.++..++.+++.+
T Consensus 704 ~~~~~~l~~~~~~l~~~v~~l~~e~k~l~k~~~~l~~~l~~~ 745 (865)
T PRK00252 704 KELAALLKAKPSELPERVEALLEELKELEKELEQLKAKLAAA 745 (865)
T ss_pred HHHHHHhCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 333344444556667778888888888888888888777654
No 157
>TIGR00103 DNA_YbaB_EbfC DNA-binding protein, YbaB/EbfC family. The function of this protein is unknown, but it has been expressed and crystallized. Its gene nearly always occurs next to recR and/or dnaX. It is restricted to Bacteria and the plant Arabidopsis. The plant form contains an additional N-terminal region that may serve as a transit peptide and shows a close relationship to the cyanobacterial member, suggesting that it is a chloroplast protein. Members of this family are found in a single copy per bacterial genome, but are broadly distributed. A member is present even in the minimal gene complement of Mycoplasm genitalium.
Probab=25.69 E-value=1.5e+02 Score=19.51 Aligned_cols=24 Identities=29% Similarity=0.372 Sum_probs=17.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhh
Q 047804 55 VCHLQKQVSELQAELAKAQAGLAF 78 (81)
Q Consensus 55 I~~Lq~qi~~lq~eLa~~~~~la~ 78 (81)
+..+-+|.++.|.++..+|.+|+.
T Consensus 7 ~~~m~kqaq~mQ~k~~~~q~eL~~ 30 (102)
T TIGR00103 7 LGELMKQAQQMQEKMKKLQEEIAQ 30 (102)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHhc
Confidence 556667777777777777777764
No 158
>COG1730 GIM5 Predicted prefoldin, molecular chaperone implicated in de novo protein folding [Posttranslational modification, protein turnover, chaperones]
Probab=25.47 E-value=1.3e+02 Score=21.46 Aligned_cols=25 Identities=32% Similarity=0.415 Sum_probs=14.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhc
Q 047804 55 VCHLQKQVSELQAELAKAQAGLAFM 79 (81)
Q Consensus 55 I~~Lq~qi~~lq~eLa~~~~~la~~ 79 (81)
+..|+.|++.+++++..+..-+..+
T Consensus 15 lq~l~~qie~L~~~i~~l~~~~~e~ 39 (145)
T COG1730 15 LQILQSQIESLQAQIAALNAAISEL 39 (145)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4455666666666666555554443
No 159
>TIGR02044 CueR Cu(I)-responsive transcriptional regulator. This model represents the copper-, silver- and gold- (I) responsive transcriptional activator of the gamma proteobacterial copper efflux system. This protein is a member of the MerR family of transcriptional activators (pfam00376) and contains a distinctive pattern of cysteine residues in its metal binding loop, Cys-X7-Cys. This family also lacks a conserved cysteine at the N-terminal end of the dimerization helix which is required for the binding of divalent metals such as zinc; here it is replaced by a serine residue.
Probab=25.29 E-value=2e+02 Score=18.98 Aligned_cols=28 Identities=11% Similarity=0.162 Sum_probs=17.8
Q ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHHh
Q 047804 50 GCVGAVCHLQKQVSELQAELAKAQAGLA 77 (81)
Q Consensus 50 Gc~G~I~~Lq~qi~~lq~eLa~~~~~la 77 (81)
.|......|++++..++++++.++..+.
T Consensus 76 ~~~~~~~~l~~~~~~l~~~i~~L~~~~~ 103 (127)
T TIGR02044 76 TSADVKARTLEKVAEIERKISELQSMRD 103 (127)
T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3455566677777777776666665544
No 160
>cd01106 HTH_TipAL-Mta Helix-Turn-Helix DNA binding domain of the transcription regulators TipAL, Mta, and SkgA. Helix-turn-helix (HTH) TipAL, Mta, and SkgA transcription regulators, and related proteins, N-terminal domain. TipAL regulates resistance to and activation by numerous cyclic thiopeptide antibiotics, such as thiostrepton. Mta is a global transcriptional regulator; the N-terminal DNA-binding domain of Mta interacts directly with the promoters of mta, bmr, blt, and ydfK, and induces transcription of these multidrug-efflux transport genes. SkgA has been shown to control stationary-phase expression of catalase-peroxidase in Caulobacter crescentus. These proteins are comprised of distinct domains that harbor an N-terminal active (DNA-binding) site and a regulatory (effector-binding) site. The conserved N-terminal domain of these transcription regulators contains winged HTH motifs that mediate DNA binding. These proteins share the N-terminal DNA binding domain with other transcrip
Probab=25.26 E-value=1.5e+02 Score=18.75 Aligned_cols=28 Identities=11% Similarity=0.064 Sum_probs=20.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhc
Q 047804 52 VGAVCHLQKQVSELQAELAKAQAGLAFM 79 (81)
Q Consensus 52 ~G~I~~Lq~qi~~lq~eLa~~~~~la~~ 79 (81)
--....|+++...++++++.+++.++.+
T Consensus 72 ~~~~~~l~~~~~~l~~~i~~l~~~~~~l 99 (103)
T cd01106 72 EDLLEALREQKELLEEKKERLDKLIKTI 99 (103)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3356678888888888888887776654
No 161
>PRK00153 hypothetical protein; Validated
Probab=25.11 E-value=1.3e+02 Score=19.58 Aligned_cols=24 Identities=33% Similarity=0.401 Sum_probs=18.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhh
Q 047804 55 VCHLQKQVSELQAELAKAQAGLAF 78 (81)
Q Consensus 55 I~~Lq~qi~~lq~eLa~~~~~la~ 78 (81)
+..+-+|.+..|.++..+|.+|+.
T Consensus 5 ~~~m~~qaq~~q~~~~~~q~~l~~ 28 (104)
T PRK00153 5 MQNLMKQAQQMQEKMQKMQEELAQ 28 (104)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHhc
Confidence 566777888888888888888764
No 162
>PF12737 Mating_C: C-terminal domain of homeodomain 1; InterPro: IPR024441 Mating in fungi is controlled by the loci that determine the mating type of an individual, and only individuals with differing mating types can mate. Basidiomycete fungi have evolved a unique mating system, termed tetrapolar or bifactorial incompatibility, in which mating type is determined by two unlinked loci; compatibility at both loci is required for mating to occur. The multi-allelic tetrapolar mating system is considered to be a novel innovation that could have only evolved once, and is thus unique to the mushroom fungi. This domain is found in the C-terminal of some mating-type proteins.
Probab=24.85 E-value=69 Score=26.45 Aligned_cols=19 Identities=32% Similarity=0.482 Sum_probs=15.4
Q ss_pred HHHHHHHHHHHHHHHHHHh
Q 047804 59 QKQVSELQAELAKAQAGLA 77 (81)
Q Consensus 59 q~qi~~lq~eLa~~~~~la 77 (81)
++|++.|+++.+.++++||
T Consensus 401 ~reL~eLeAq~~aL~AELA 419 (419)
T PF12737_consen 401 RRELEELEAQARALRAELA 419 (419)
T ss_pred HHHHHHHHHHHHHHHhhhC
Confidence 5678888888888888885
No 163
>PF05546 She9_MDM33: She9 / Mdm33 family; InterPro: IPR008839 Members of this family are mitochondrial inner membrane proteins with a role in inner mitochondrial membrane organisation and biogenesis []. The yeast Mdm33 protein assembles into an oligomeric complex in the inner membrane where it performs homotypic protein-protein interactions. It has been suggested that Mdm33 plays a distinct role, possibly involved in fission of the mitochondrial inner membrane [].
Probab=24.65 E-value=3.1e+02 Score=20.86 Aligned_cols=41 Identities=27% Similarity=0.401 Sum_probs=31.8
Q ss_pred HHHHHHhhhccCCccccHHHHHHHHHHHHHHHHHHHHHHHHHh
Q 047804 35 IMDYEANSRIRDPVYGCVGAVCHLQKQVSELQAELAKAQAGLA 77 (81)
Q Consensus 35 SlvYEA~aR~rDPV~Gc~G~I~~Lq~qi~~lq~eLa~~~~~la 77 (81)
+=+..|..+..| +-| |..|-.|+.+|..++.+|+.++..+.
T Consensus 16 ~~i~~as~~lNd-~TG-Ys~Ie~LK~~i~~~E~~l~~~r~~~~ 56 (207)
T PF05546_consen 16 ETIFTASQALND-VTG-YSEIEKLKKSIEELEDELEAARQEVR 56 (207)
T ss_pred HHHHHHHHHHHh-ccC-hHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 345556666655 455 99999999999999999999887764
No 164
>PF12001 DUF3496: Domain of unknown function (DUF3496); InterPro: IPR021885 This presumed domain is functionally uncharacterised. This domain is found in eukaryotes. This domain is about 110 amino acids in length.
Probab=24.47 E-value=1.1e+02 Score=21.17 Aligned_cols=17 Identities=24% Similarity=0.599 Sum_probs=10.8
Q ss_pred HHHHHHHHHHHHHHHHH
Q 047804 57 HLQKQVSELQAELAKAQ 73 (81)
Q Consensus 57 ~Lq~qi~~lq~eLa~~~ 73 (81)
+++..|.+++.||.+++
T Consensus 4 QmElrIkdLeselsk~K 20 (111)
T PF12001_consen 4 QMELRIKDLESELSKMK 20 (111)
T ss_pred HHHHHHHHHHHHHHHhH
Confidence 45556666666666665
No 165
>COG3105 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=24.43 E-value=1.3e+02 Score=21.84 Aligned_cols=23 Identities=22% Similarity=0.278 Sum_probs=16.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHhh
Q 047804 56 CHLQKQVSELQAELAKAQAGLAF 78 (81)
Q Consensus 56 ~~Lq~qi~~lq~eLa~~~~~la~ 78 (81)
..+|.|++.++.+|..-+++|..
T Consensus 37 ~~~q~ELe~~K~~ld~~rqel~~ 59 (138)
T COG3105 37 QKLQYELEKVKAQLDEYRQELVK 59 (138)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHH
Confidence 36777777777777777777653
No 166
>PRK10803 tol-pal system protein YbgF; Provisional
Probab=24.34 E-value=1.1e+02 Score=23.13 Aligned_cols=24 Identities=17% Similarity=0.293 Sum_probs=11.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHh
Q 047804 54 AVCHLQKQVSELQAELAKAQAGLA 77 (81)
Q Consensus 54 ~I~~Lq~qi~~lq~eLa~~~~~la 77 (81)
-|..||++|.+|+-++...+.+|.
T Consensus 62 ql~~lq~ev~~LrG~~E~~~~~l~ 85 (263)
T PRK10803 62 QLSDNQSDIDSLRGQIQENQYQLN 85 (263)
T ss_pred HHHHHHHHHHHHhhHHHHHHHHHH
Confidence 344455555555444444444443
No 167
>PRK09039 hypothetical protein; Validated
Probab=24.13 E-value=1.1e+02 Score=24.11 Aligned_cols=26 Identities=23% Similarity=0.241 Sum_probs=15.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhcc
Q 047804 55 VCHLQKQVSELQAELAKAQAGLAFMQ 80 (81)
Q Consensus 55 I~~Lq~qi~~lq~eLa~~~~~la~~q 80 (81)
+...+.+|..|++|++.++++|+..|
T Consensus 132 ~se~~~~V~~L~~qI~aLr~Qla~le 157 (343)
T PRK09039 132 SARALAQVELLNQQIAALRRQLAALE 157 (343)
T ss_pred HHHhhHHHHHHHHHHHHHHHHHHHHH
Confidence 34455556666666666666666543
No 168
>PRK03947 prefoldin subunit alpha; Reviewed
Probab=23.76 E-value=1.4e+02 Score=19.84 Aligned_cols=22 Identities=36% Similarity=0.445 Sum_probs=10.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHh
Q 047804 56 CHLQKQVSELQAELAKAQAGLA 77 (81)
Q Consensus 56 ~~Lq~qi~~lq~eLa~~~~~la 77 (81)
..|+++++.+++++..+.+.+.
T Consensus 16 ~~l~~~~~~l~~~~~~l~~~~~ 37 (140)
T PRK03947 16 QALQAQIEALQQQLEELQASIN 37 (140)
T ss_pred HHHHHHHHHHHHHHHHHHHHHH
Confidence 3344444444444444444443
No 169
>PF05615 THOC7: Tho complex subunit 7; InterPro: IPR008501 The Tho complex (THOC) is involved in transcription elongation and mRNA export from the nucleus []. This entry represents the subunit THOC7, which is found in higher eukaryotes, and the non-homologous subunit Mft1p found in yeast. The funtions of these subunits are unknown, and it is not known if these subunits are functionally equivalent.
Probab=23.75 E-value=1.6e+02 Score=19.74 Aligned_cols=24 Identities=25% Similarity=0.353 Sum_probs=16.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHh
Q 047804 54 AVCHLQKQVSELQAELAKAQAGLA 77 (81)
Q Consensus 54 ~I~~Lq~qi~~lq~eLa~~~~~la 77 (81)
-|...+.+|..++.+|..++....
T Consensus 89 ~i~~~k~~ie~lk~~L~~ak~~r~ 112 (139)
T PF05615_consen 89 EIEQAKKEIEELKEELEEAKRVRQ 112 (139)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH
Confidence 455677777777777777776554
No 170
>COG4467 Regulator of replication initiation timing [Replication, recombination, and repair]
Probab=23.57 E-value=1.5e+02 Score=20.81 Aligned_cols=26 Identities=31% Similarity=0.302 Sum_probs=17.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhh
Q 047804 53 GAVCHLQKQVSELQAELAKAQAGLAF 78 (81)
Q Consensus 53 G~I~~Lq~qi~~lq~eLa~~~~~la~ 78 (81)
-.|..|++||..+-+|+..++++|..
T Consensus 8 d~v~~le~~l~~l~~el~~lK~~l~~ 33 (114)
T COG4467 8 DQVDNLEEQLGVLLAELGGLKQHLGS 33 (114)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34667777777777777777776653
No 171
>PF07996 T4SS: Type IV secretion system proteins; InterPro: IPR014158 This entry contains VirB5, a protein that is involved in the type IV DNA secretion systems typified by the Agrobacterium Ti plasmid vir system where it interacts with several other proteins essential for proper pilus formation []. VirB5 is homologous to the IncN (N-type) conjugation system protein TraC [] as well as the P-type protein TrbJ and the F-type protein TraE [].; PDB: 1R8I_A.
Probab=23.38 E-value=1.7e+02 Score=20.28 Aligned_cols=23 Identities=17% Similarity=0.391 Sum_probs=15.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHh
Q 047804 55 VCHLQKQVSELQAELAKAQAGLA 77 (81)
Q Consensus 55 I~~Lq~qi~~lq~eLa~~~~~la 77 (81)
|-.+.+|++.++.+|..++..+.
T Consensus 21 ~~~~~~q~~q~~~Ql~~~k~q~~ 43 (195)
T PF07996_consen 21 LAQWKQQLEQLKQQLQQAKQQYN 43 (195)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHH
Confidence 55566777777777777776665
No 172
>PRK14621 hypothetical protein; Provisional
Probab=23.37 E-value=1.6e+02 Score=20.01 Aligned_cols=24 Identities=17% Similarity=0.239 Sum_probs=18.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhh
Q 047804 55 VCHLQKQVSELQAELAKAQAGLAF 78 (81)
Q Consensus 55 I~~Lq~qi~~lq~eLa~~~~~la~ 78 (81)
...+-+|.+..|+++..+|++|+.
T Consensus 6 m~~mmkqaq~mQ~km~~~Q~eL~~ 29 (111)
T PRK14621 6 LGDMMKQIQQAGEKMQDVQKQLEK 29 (111)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHc
Confidence 566777888888888888888864
No 173
>PF06717 DUF1202: Protein of unknown function (DUF1202); InterPro: IPR009592 This family consists of several hypothetical bacterial proteins of around 335 residues in length. Members of this family are found exclusively in Escherichia coli and Salmonella species and are often referred to as YggM proteins. The function of this family is unknown.
Probab=23.35 E-value=1.2e+02 Score=24.49 Aligned_cols=26 Identities=19% Similarity=0.262 Sum_probs=18.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhc
Q 047804 54 AVCHLQKQVSELQAELAKAQAGLAFM 79 (81)
Q Consensus 54 ~I~~Lq~qi~~lq~eLa~~~~~la~~ 79 (81)
-|-.|+.|++.|+.+...++.++..|
T Consensus 153 ~I~~L~~qisaLdkqi~ai~Kkid~y 178 (308)
T PF06717_consen 153 KIPGLNKQISALDKQIVAINKKIDRY 178 (308)
T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHhc
Confidence 35567777777777777777766544
No 174
>TIGR03007 pepcterm_ChnLen polysaccharide chain length determinant protein, PEP-CTERM locus subfamily. Members of this protein family belong to the family of polysaccharide chain length determinant proteins (pfam02706). All are found in species that encode the PEP-CTERM/exosortase system predicted to act in protein sorting in a number of Gram-negative bacteria, and are found near the epsH homolog that is the putative exosortase gene.
Probab=23.32 E-value=1.4e+02 Score=23.72 Aligned_cols=28 Identities=18% Similarity=0.275 Sum_probs=25.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhcc
Q 047804 53 GAVCHLQKQVSELQAELAKAQAGLAFMQ 80 (81)
Q Consensus 53 G~I~~Lq~qi~~lq~eLa~~~~~la~~q 80 (81)
..+..|++|+..++.+|..++.+|..|+
T Consensus 161 ~~~~fl~~ql~~~~~~L~~ae~~l~~f~ 188 (498)
T TIGR03007 161 SAQRFIDEQIKTYEKKLEAAENRLKAFK 188 (498)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4788899999999999999999998775
No 175
>PRK10884 SH3 domain-containing protein; Provisional
Probab=23.25 E-value=1.2e+02 Score=22.47 Aligned_cols=23 Identities=22% Similarity=0.163 Sum_probs=12.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHh
Q 047804 55 VCHLQKQVSELQAELAKAQAGLA 77 (81)
Q Consensus 55 I~~Lq~qi~~lq~eLa~~~~~la 77 (81)
-..|++|+..+++++..+++++.
T Consensus 141 n~~L~~~l~~~~~~~~~l~~~~~ 163 (206)
T PRK10884 141 NQKLKNQLIVAQKKVDAANLQLD 163 (206)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHH
Confidence 34455555555555555555544
No 176
>cd04784 HTH_CadR-PbrR Helix-Turn-Helix DNA binding domain of the CadR and PbrR transcription regulators. Helix-turn-helix (HTH) CadR and PbrR transcription regulators including Pseudomonas aeruginosa CadR and Ralstonia metallidurans PbrR that regulate expression of the cadmium and lead resistance operons, respectively. These proteins are comprised of distinct domains that harbor the regulatory (effector-binding) site and the active (DNA-binding) site. Their conserved N-terminal domains contain predicted winged HTH motifs that mediate DNA binding, while the C-terminal domains have three conserved cysteines which form a putative metal binding site. Some members in this group have a histidine-rich C-terminal extension. These proteins share the N-terminal DNA binding domain with other transcription regulators of the MerR superfamily that promote transcription by reconfiguring the spacer between the -35 and -10 promoter elements.
Probab=23.22 E-value=2.3e+02 Score=18.60 Aligned_cols=30 Identities=20% Similarity=0.333 Sum_probs=19.7
Q ss_pred CccccHHHHHHHHHHHHHHHHHHHHHHHHH
Q 047804 47 PVYGCVGAVCHLQKQVSELQAELAKAQAGL 76 (81)
Q Consensus 47 PV~Gc~G~I~~Lq~qi~~lq~eLa~~~~~l 76 (81)
|-..|-.....|+.++..++++++.++.-.
T Consensus 73 ~~~~~~~~~~~l~~~~~~l~~~i~~L~~~~ 102 (127)
T cd04784 73 PEASCAEVNALIDEHLAHVRARIAELQALE 102 (127)
T ss_pred CCCcHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 434456667778888888877776665433
No 177
>PRK09841 cryptic autophosphorylating protein tyrosine kinase Etk; Provisional
Probab=22.91 E-value=1.2e+02 Score=25.86 Aligned_cols=29 Identities=28% Similarity=0.426 Sum_probs=25.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhcc
Q 047804 52 VGAVCHLQKQVSELQAELAKAQAGLAFMQ 80 (81)
Q Consensus 52 ~G~I~~Lq~qi~~lq~eLa~~~~~la~~q 80 (81)
...+..|++|+..++.+|..++.+|..|+
T Consensus 266 ~~a~~fL~~qL~~l~~~L~~aE~~l~~fr 294 (726)
T PRK09841 266 SQSLEFLQRQLPEVRSELDQAEEKLNVYR 294 (726)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 44678899999999999999999998775
No 178
>PRK11239 hypothetical protein; Provisional
Probab=22.75 E-value=1.5e+02 Score=22.79 Aligned_cols=55 Identities=18% Similarity=0.122 Sum_probs=37.1
Q ss_pred hhHHHHhhhccccCHHHHHHHHHHHHhhhccCCccccHHHHHHHHHHHHHHHHHHHHHHHHHh
Q 047804 15 LQSWVNLATGACRTKRDCKLIMDYEANSRIRDPVYGCVGAVCHLQKQVSELQAELAKAQAGLA 77 (81)
Q Consensus 15 ~~r~~~~~~g~~~~k~da~~SlvYEA~aR~rDPV~Gc~G~I~~Lq~qi~~lq~eLa~~~~~la 77 (81)
+.||..|+-|---. ... +..-..++.-.--+.|..|+.++..|+++++.+.+++.
T Consensus 160 E~Ry~hLl~g~v~~-----~~~---~~~~~~~~~~~Le~rv~~Le~eva~L~~~l~~l~~~~~ 214 (215)
T PRK11239 160 ESRYMHLFSGEVDD-----QPA---VTDMSNAVDGDLQARVEALEIEVAELKQRLDSLLAHLG 214 (215)
T ss_pred hHHHHHHhcCCccc-----ccc---ccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc
Confidence 57888888875111 000 01111344444678999999999999999999988764
No 179
>PF02185 HR1: Hr1 repeat; InterPro: IPR000861 The HR1 repeat was first described as a three times repeated homology region of the N-terminal non-catalytic part of protein kinase PRK1(PKN) []. The first two of these repeats were later shown to bind the small G protein rho [, ] known to activate PKN in its GTP-bound form. Similar rho-binding domains also occur in a number of other protein kinases and in the rho-binding proteins rhophilin and rhotekin. Recently, the structure of the N-terminal HR1 repeat complexed with RhoA has been determined by X-ray crystallography []. It forms an antiparallel coiled-coil fold termed an ACC finger. This entry includes domains found within rho-associated protein kinases.; GO: 0007165 signal transduction, 0005622 intracellular; PDB: 1CXZ_B 3O0Z_C 2RMK_B 1URF_A.
Probab=22.67 E-value=1.8e+02 Score=17.34 Aligned_cols=26 Identities=12% Similarity=0.181 Sum_probs=12.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhc
Q 047804 54 AVCHLQKQVSELQAELAKAQAGLAFM 79 (81)
Q Consensus 54 ~I~~Lq~qi~~lq~eLa~~~~~la~~ 79 (81)
+....+.++......|..++.+|..+
T Consensus 34 ~~~~~~~~l~~s~~kI~~L~~~L~~l 59 (70)
T PF02185_consen 34 VLSEAESQLRESNQKIELLREQLEKL 59 (70)
T ss_dssp -HHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34444555555555555555555444
No 180
>KOG0999 consensus Microtubule-associated protein Bicaudal-D [Intracellular trafficking, secretion, and vesicular transport]
Probab=22.65 E-value=1.4e+02 Score=26.70 Aligned_cols=42 Identities=14% Similarity=0.178 Sum_probs=35.2
Q ss_pred HHHHHHHhhhccCCccccHHHHHHHHHHHHHHHHHHHHHHHHHhhc
Q 047804 34 LIMDYEANSRIRDPVYGCVGAVCHLQKQVSELQAELAKAQAGLAFM 79 (81)
Q Consensus 34 ~SlvYEA~aR~rDPV~Gc~G~I~~Lq~qi~~lq~eLa~~~~~la~~ 79 (81)
.|++-|+-++-.. -.+.|..||.++.+++++|+.+|++...+
T Consensus 92 esLLqESaakE~~----yl~kI~eleneLKq~r~el~~~q~E~erl 133 (772)
T KOG0999|consen 92 ESLLQESAAKEEY----YLQKILELENELKQLRQELTNVQEENERL 133 (772)
T ss_pred HHHHHHHHHhHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4688888887764 46899999999999999999999987653
No 181
>PRK14627 hypothetical protein; Provisional
Probab=22.55 E-value=1.7e+02 Score=19.34 Aligned_cols=23 Identities=26% Similarity=0.335 Sum_probs=15.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHhh
Q 047804 56 CHLQKQVSELQAELAKAQAGLAF 78 (81)
Q Consensus 56 ~~Lq~qi~~lq~eLa~~~~~la~ 78 (81)
..+-+|.+..|.++..+|.+|+.
T Consensus 4 ~~~mkqaq~mQ~km~~~Q~el~~ 26 (100)
T PRK14627 4 RQLMQMAQQMQRQMQKVQEELAA 26 (100)
T ss_pred HHHHHHHHHHHHHHHHHHHHHhc
Confidence 34556666677777777777664
No 182
>COG2919 Septum formation initiator [Cell division and chromosome partitioning]
Probab=22.48 E-value=1.8e+02 Score=19.40 Aligned_cols=24 Identities=42% Similarity=0.479 Sum_probs=16.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHH
Q 047804 52 VGAVCHLQKQVSELQAELAKAQAG 75 (81)
Q Consensus 52 ~G~I~~Lq~qi~~lq~eLa~~~~~ 75 (81)
+..+..++.|+...++|++.+.+.
T Consensus 49 ~~~~~~l~~qi~~~~~e~~~L~~~ 72 (117)
T COG2919 49 AADVLQLQRQIAAQQAELEKLSAR 72 (117)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH
Confidence 456677777887777777766554
No 183
>PF08232 Striatin: Striatin family; InterPro: IPR013258 This domain is associated with the N terminus of striatin. Striatin is an intracellular protein which has a caveolin-binding motif, a coiled-coil structure, a calmodulin-binding site, and a WD (IPR001680 from INTERPRO) repeat domain []. It acts as a scaffold protein [] and is involved in signalling pathways [, ].
Probab=22.41 E-value=1.8e+02 Score=19.99 Aligned_cols=28 Identities=29% Similarity=0.479 Sum_probs=16.4
Q ss_pred HHHHHHHHHHHHHHH-------HHHHHHHHHhhcc
Q 047804 53 GAVCHLQKQVSELQA-------ELAKAQAGLAFMQ 80 (81)
Q Consensus 53 G~I~~Lq~qi~~lq~-------eLa~~~~~la~~q 80 (81)
||+..||.+-...+. |-+..++.|++.|
T Consensus 4 GVl~fLQ~Ew~r~ErdR~~WeiERaEmkarIa~LE 38 (134)
T PF08232_consen 4 GVLHFLQTEWHRFERDRNQWEIERAEMKARIAFLE 38 (134)
T ss_pred HHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHH
Confidence 777777777665533 3344555555544
No 184
>PF08172 CASP_C: CASP C terminal; InterPro: IPR012955 This domain is the C-terminal region of the CASP family of proteins. These are Golgi membrane proteins which are thought to have a role in vesicle transport [].; GO: 0006891 intra-Golgi vesicle-mediated transport, 0030173 integral to Golgi membrane
Probab=22.27 E-value=1.4e+02 Score=22.80 Aligned_cols=20 Identities=10% Similarity=0.283 Sum_probs=8.6
Q ss_pred HHHHHHHHHHHHHHHHHHHh
Q 047804 58 LQKQVSELQAELAKAQAGLA 77 (81)
Q Consensus 58 Lq~qi~~lq~eLa~~~~~la 77 (81)
|+.++...+.++..++++++
T Consensus 98 LE~elr~~~~~~~~L~~Ev~ 117 (248)
T PF08172_consen 98 LEEELRKQQQTISSLRREVE 117 (248)
T ss_pred HHHHHHHHHHHHHHHHHHHH
Confidence 44444444444444444443
No 185
>PF05816 TelA: Toxic anion resistance protein (TelA); InterPro: IPR008863 This family consists of several prokaryotic TelA like proteins. TelA and KlA are associated with tellurite resistance [] and plasmid fertility inhibition [].
Probab=22.26 E-value=2.4e+02 Score=21.90 Aligned_cols=21 Identities=19% Similarity=0.229 Sum_probs=9.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHh
Q 047804 57 HLQKQVSELQAELAKAQAGLA 77 (81)
Q Consensus 57 ~Lq~qi~~lq~eLa~~~~~la 77 (81)
.+-.+.+.++.+++.+-.+|.
T Consensus 88 ~~~~ky~sv~~qId~I~~~L~ 108 (333)
T PF05816_consen 88 RYFAKYQSVQSQIDKIIAELE 108 (333)
T ss_pred HHHHHHHHHHHHHHHHHHHHH
Confidence 333444444444444444443
No 186
>PF00831 Ribosomal_L29: Ribosomal L29 protein; InterPro: IPR001854 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites [, ]. About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits. Many ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome [, ]. Ribosomal protein L29 is one of the proteins from the large ribosomal subunit. L29 belongs to a family of ribosomal proteins of 63 to 138 amino-acid residues which, on the basis of sequence similarities [], groups: Red algal L29. Bacterial L29. Mammalian L35 Caenorhabditis elegans L35 (ZK652.4). Yeast L35. ; GO: 0003735 structural constituent of ribosome, 0006412 translation, 0005622 intracellular, 0005840 ribosome; PDB: 1VSP_W 3MS1_Y 3MRZ_Y 3F1H_2 3PYT_Y 3PYO_Y 3D5D_2 3D5B_2 3PYR_Y 1VSA_W ....
Probab=22.26 E-value=1.8e+02 Score=17.14 Aligned_cols=24 Identities=29% Similarity=0.221 Sum_probs=18.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhh
Q 047804 55 VCHLQKQVSELQAELAKAQAGLAF 78 (81)
Q Consensus 55 I~~Lq~qi~~lq~eLa~~~~~la~ 78 (81)
...|+.++..++.||..++.+.+.
T Consensus 9 ~~eL~~~l~elk~eL~~Lr~q~~~ 32 (58)
T PF00831_consen 9 DEELQEKLEELKKELFNLRFQKAT 32 (58)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHh
Confidence 356888888888888888776653
No 187
>cd01282 HTH_MerR-like_sg3 Helix-Turn-Helix DNA binding domain of putative transcription regulators from the MerR superfamily. Putative helix-turn-helix (HTH) MerR-like transcription regulators (subgroup 3). Based on sequence similarity, these proteins are predicted to function as transcription regulators that mediate responses to stress in eubacteria. They belong to the MerR superfamily of transcription regulators that promote transcription of various stress regulons by reconfiguring the operator sequence located between the -35 and -10 promoter elements. A typical MerR regulator is comprised of two distinct domains that harbor the regulatory (effector-binding) site and the active (DNA-binding) site. Their N-terminal domains are homologous and contain a DNA-binding winged HTH motif, while the C-terminal domains are often dissimilar and bind specific coactivator molecules such as metal ions, drugs, and organic substrates.
Probab=22.09 E-value=2.3e+02 Score=18.33 Aligned_cols=32 Identities=9% Similarity=0.234 Sum_probs=24.7
Q ss_pred CccccHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q 047804 47 PVYGCVGAVCHLQKQVSELQAELAKAQAGLAF 78 (81)
Q Consensus 47 PV~Gc~G~I~~Lq~qi~~lq~eLa~~~~~la~ 78 (81)
|-..|......|+.++..+.++++.++.....
T Consensus 75 ~~~~~~~~~~~l~~~~~~l~~~i~~L~~~~~~ 106 (112)
T cd01282 75 TFRPCPDLLAVLRRELARIDRQIADLTRSRDR 106 (112)
T ss_pred cCCccHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34667888888999999998888877765543
No 188
>TIGR02051 MerR Hg(II)-responsive transcriptional regulator. This model represents the mercury (II) responsive transcriptional activator of the mer organomercurial resistance operon. This protein is a member of the MerR family of transcriptional activators (pfam00376) and contains a distinctive pattern of cysteine residues in its metal binding loop, Cys-X(8)-Cys-Pro, as well as a conserved and critical cysteine at the N-terminal end of the dimerization helix.
Probab=21.86 E-value=2.1e+02 Score=18.94 Aligned_cols=29 Identities=14% Similarity=0.077 Sum_probs=20.7
Q ss_pred cccHHHHHHHHHHHHHHHHHHHHHHHHHh
Q 047804 49 YGCVGAVCHLQKQVSELQAELAKAQAGLA 77 (81)
Q Consensus 49 ~Gc~G~I~~Lq~qi~~lq~eLa~~~~~la 77 (81)
..|......|++++..++++++.++.-..
T Consensus 72 ~~~~~~~~~l~~~~~~l~~~i~~L~~~~~ 100 (124)
T TIGR02051 72 THCREMYELASRKLKSVQAKMADLLRIER 100 (124)
T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34677778888888888877777665443
No 189
>PRK14164 heat shock protein GrpE; Provisional
Probab=21.83 E-value=1.9e+02 Score=21.93 Aligned_cols=36 Identities=22% Similarity=0.288 Sum_probs=29.2
Q ss_pred cCCccccHHHHHHHHHHHHHHHHHHHHHHHHHhhcc
Q 047804 45 RDPVYGCVGAVCHLQKQVSELQAELAKAQAGLAFMQ 80 (81)
Q Consensus 45 rDPV~Gc~G~I~~Lq~qi~~lq~eLa~~~~~la~~q 80 (81)
-||---.-+.+..|+.++..++..+-.++++...|.
T Consensus 69 ~~~~~~~~~~~~~le~el~el~d~llR~~AE~eN~R 104 (218)
T PRK14164 69 VDPELADDGEASTVEAQLAERTEDLQRVTAEYANYR 104 (218)
T ss_pred cCcccCChhhHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 445444558899999999999999999999887763
No 190
>PF05465 Halo_GVPC: Halobacterial gas vesicle protein C (GVPC) repeat; InterPro: IPR008639 This family consists of Halobacterium gas vesicle protein C sequences which are thought to confer stability to the gas vesicle membranes [,].; GO: 0031412 gas vesicle organization, 0031411 gas vesicle
Probab=21.64 E-value=1.5e+02 Score=16.03 Aligned_cols=25 Identities=20% Similarity=0.345 Sum_probs=17.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhc
Q 047804 55 VCHLQKQVSELQAELAKAQAGLAFM 79 (81)
Q Consensus 55 I~~Lq~qi~~lq~eLa~~~~~la~~ 79 (81)
|..|.-.|..++++...++..-..|
T Consensus 1 V~~l~a~I~~~r~~f~~~~~aF~aY 25 (32)
T PF05465_consen 1 VSDLLAAIAEFREEFDDTQDAFEAY 25 (32)
T ss_pred CcHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3566677777777777777766554
No 191
>KOG3478 consensus Prefoldin subunit 6, KE2 family [Posttranslational modification, protein turnover, chaperones]
Probab=21.62 E-value=1.6e+02 Score=20.86 Aligned_cols=27 Identities=22% Similarity=0.373 Sum_probs=18.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhcc
Q 047804 54 AVCHLQKQVSELQAELAKAQAGLAFMQ 80 (81)
Q Consensus 54 ~I~~Lq~qi~~lq~eLa~~~~~la~~q 80 (81)
.|..++.+|...+.+..+.+..++.+|
T Consensus 84 Eikr~e~~i~d~q~e~~k~R~~v~k~Q 110 (120)
T KOG3478|consen 84 EIKRLENQIRDSQEEFEKQREAVIKLQ 110 (120)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 455667777777777777777776665
No 192
>PRK14149 heat shock protein GrpE; Provisional
Probab=21.61 E-value=1.7e+02 Score=21.70 Aligned_cols=28 Identities=7% Similarity=0.148 Sum_probs=23.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhcc
Q 047804 53 GAVCHLQKQVSELQAELAKAQAGLAFMQ 80 (81)
Q Consensus 53 G~I~~Lq~qi~~lq~eLa~~~~~la~~q 80 (81)
..+..|+.++..++..+..++|++..|.
T Consensus 43 ~~~~~l~~e~~elkd~~lR~~AefEN~r 70 (191)
T PRK14149 43 EIKEDFELKYKEMHEKYLRVHADFENVK 70 (191)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3578899999999999999998887663
No 193
>PF05529 Bap31: B-cell receptor-associated protein 31-like ; InterPro: IPR008417 Bap31 is a polytopic integral protein of the endoplasmic reticulum membrane and a substrate of caspase-8. Bap31 is cleaved within its cytosolic domain, generating pro-apoptotic p20 Bap31 [].; GO: 0006886 intracellular protein transport, 0005783 endoplasmic reticulum, 0016021 integral to membrane
Probab=21.53 E-value=1.6e+02 Score=20.66 Aligned_cols=25 Identities=12% Similarity=0.209 Sum_probs=12.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHh
Q 047804 53 GAVCHLQKQVSELQAELAKAQAGLA 77 (81)
Q Consensus 53 G~I~~Lq~qi~~lq~eLa~~~~~la 77 (81)
-.|..|+++++..+.|++.++.+..
T Consensus 161 ~ei~~lk~el~~~~~~~~~LkkQ~~ 185 (192)
T PF05529_consen 161 EEIEKLKKELEKKEKEIEALKKQSE 185 (192)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3444555555555555555554443
No 194
>PHA03386 P10 fibrous body protein; Provisional
Probab=21.49 E-value=1.5e+02 Score=20.15 Aligned_cols=14 Identities=36% Similarity=0.385 Sum_probs=10.0
Q ss_pred HHHHHHHHHHHHHH
Q 047804 54 AVCHLQKQVSELQA 67 (81)
Q Consensus 54 ~I~~Lq~qi~~lq~ 67 (81)
-|..||.|+..+++
T Consensus 20 KVdaLQ~qV~dv~~ 33 (94)
T PHA03386 20 KVDALQTQLNGLEE 33 (94)
T ss_pred HHHHHHHHHHHHHh
Confidence 45678888887774
No 195
>cd01109 HTH_YyaN Helix-Turn-Helix DNA binding domain of the MerR-like transcription regulators YyaN and YraB. Putative helix-turn-helix (HTH) MerR-like transcription regulators of Bacillus subtilis, YyaN and YraB, and related proteins; N-terminal domain. Based on sequence similarity, these proteins are predicted to function as transcription regulators that mediate responses to stress in eubacteria. They belong to the MerR superfamily of transcription regulators that promote transcription of various stress regulons by reconfiguring the operator sequence located between the -35 and -10 promoter elements. A typical MerR regulator is comprised of distinct domains that harbor the regulatory (effector-binding) site and the active (DNA-binding) site. Their N-terminal domains are homologous and contain a DNA-binding winged HTH motif, while the C-terminal domains are often dissimilar and bind specific coactivator molecules such as metal ions, drugs, and organic substrates.
Probab=21.49 E-value=2.2e+02 Score=18.31 Aligned_cols=28 Identities=25% Similarity=0.337 Sum_probs=20.2
Q ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q 047804 51 CVGAVCHLQKQVSELQAELAKAQAGLAF 78 (81)
Q Consensus 51 c~G~I~~Lq~qi~~lq~eLa~~~~~la~ 78 (81)
|-..+..|+.++..++.+++.+++.++.
T Consensus 77 ~~~~~~~l~~~~~~l~~~i~~l~~~~~~ 104 (113)
T cd01109 77 IPERLELLEEHREELEEQIAELQETLAY 104 (113)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4556777888888888887777766654
No 196
>PF07544 Med9: RNA polymerase II transcription mediator complex subunit 9; InterPro: IPR011425 The Mediator complex is a coactivator involved in the regulated transcription of nearly all RNA polymerase II-dependent genes. Mediator functions as a bridge to convey information from gene-specific regulatory proteins to the basal RNA polymerase II transcription machinery. The Mediator complex, having a compact conformation in its free form, is recruited to promoters by direct interactions with regulatory proteins and serves for the assembly of a functional preinitiation complex with RNA polymerase II and the general transcription factors. On recruitment the Mediator complex unfolds to an extended conformation and partially surrounds RNA polymerase II, specifically interacting with the unphosphorylated form of the C-terminal domain (CTD) of RNA polymerase II. The Mediator complex dissociates from the RNA polymerase II holoenzyme and stays at the promoter when transcriptional elongation begins. The Mediator complex is composed of at least 31 subunits: MED1, MED4, MED6, MED7, MED8, MED9, MED10, MED11, MED12, MED13, MED13L, MED14, MED15, MED16, MED17, MED18, MED19, MED20, MED21, MED22, MED23, MED24, MED25, MED26, MED27, MED29, MED30, MED31, CCNC, CDK8 and CDC2L6/CDK11. The subunits form at least three structurally distinct submodules. The head and the middle modules interact directly with RNA polymerase II, whereas the elongated tail module interacts with gene-specific regulatory proteins. Mediator containing the CDK8 module is less active than Mediator lacking this module in supporting transcriptional activation. The head module contains: MED6, MED8, MED11, SRB4/MED17, SRB5/MED18, ROX3/MED19, SRB2/MED20 and SRB6/MED22. The middle module contains: MED1, MED4, NUT1/MED5, MED7, CSE2/MED9, NUT2/MED10, SRB7/MED21 and SOH1/MED31. CSE2/MED9 interacts directly with MED4. The tail module contains: MED2, PGD1/MED3, RGR1/MED14, GAL11/MED15 and SIN4/MED16. The CDK8 module contains: MED12, MED13, CCNC and CDK8. Individual preparations of the Mediator complex lacking one or more distinct subunits have been variously termed ARC, CRSP, DRIP, PC2, SMCC and TRAP. This entry represents subunit Med9 of the Mediator complex. Subunit Med9 is part of the middle module of the Mediator complex []; this associates with the core polymerase subunits to form the RNA polymerase II holoenzyme. Med9 alternatively known as the chromosome segregation protein, CSE2 (P33308 from SWISSPROT) is required, along with CSE1 (P33307 from SWISSPROT) for accurate mitotic chromosome segregation in Saccharomyces cerevisiae (Baker's yeast) [].; GO: 0001104 RNA polymerase II transcription cofactor activity, 0006357 regulation of transcription from RNA polymerase II promoter, 0016592 mediator complex
Probab=21.43 E-value=1.2e+02 Score=19.09 Aligned_cols=31 Identities=19% Similarity=0.298 Sum_probs=20.8
Q ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHHhhcc
Q 047804 50 GCVGAVCHLQKQVSELQAELAKAQAGLAFMQ 80 (81)
Q Consensus 50 Gc~G~I~~Lq~qi~~lq~eLa~~~~~la~~q 80 (81)
|----+-.-+.+|..++.++..-++-|+.|+
T Consensus 49 gi~~s~eeq~~~i~~Le~~i~~k~~~L~~~~ 79 (83)
T PF07544_consen 49 GIDRSVEEQEEEIEELEEQIRKKREVLQKFK 79 (83)
T ss_pred CccCCHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3444455667777777777777777777664
No 197
>PRK05771 V-type ATP synthase subunit I; Validated
Probab=21.40 E-value=1.5e+02 Score=24.90 Aligned_cols=29 Identities=10% Similarity=0.146 Sum_probs=23.6
Q ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHHhhc
Q 047804 51 CVGAVCHLQKQVSELQAELAKAQAGLAFM 79 (81)
Q Consensus 51 c~G~I~~Lq~qi~~lq~eLa~~~~~la~~ 79 (81)
+--.+..|+.++++++++++.+++++..+
T Consensus 213 p~~~l~~l~~~l~~l~~~~~~~~~~l~~~ 241 (646)
T PRK05771 213 PSELIREIKEELEEIEKERESLLEELKEL 241 (646)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 45678889999999999998888888754
No 198
>COG0718 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=21.31 E-value=2.1e+02 Score=19.48 Aligned_cols=27 Identities=30% Similarity=0.281 Sum_probs=21.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhc
Q 047804 53 GAVCHLQKQVSELQAELAKAQAGLAFM 79 (81)
Q Consensus 53 G~I~~Lq~qi~~lq~eLa~~~~~la~~ 79 (81)
+-+..|-+|.++.|.++...|.+|+..
T Consensus 5 ~~~~~l~kqaqqmQ~~~~~~Q~ela~~ 31 (105)
T COG0718 5 MDMQKLMKQAQQMQKKMQKMQEELAQK 31 (105)
T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHhc
Confidence 456777888888888888888888754
No 199
>PF08172 CASP_C: CASP C terminal; InterPro: IPR012955 This domain is the C-terminal region of the CASP family of proteins. These are Golgi membrane proteins which are thought to have a role in vesicle transport [].; GO: 0006891 intra-Golgi vesicle-mediated transport, 0030173 integral to Golgi membrane
Probab=21.25 E-value=1.5e+02 Score=22.64 Aligned_cols=27 Identities=22% Similarity=0.458 Sum_probs=10.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhcc
Q 047804 54 AVCHLQKQVSELQAELAKAQAGLAFMQ 80 (81)
Q Consensus 54 ~I~~Lq~qi~~lq~eLa~~~~~la~~q 80 (81)
.|..|+.+++.++++--++=.++-++|
T Consensus 108 ~~~~L~~Ev~~L~~DN~kLYEKiRylq 134 (248)
T PF08172_consen 108 TISSLRREVESLRADNVKLYEKIRYLQ 134 (248)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 334444444444444333333333333
No 200
>TIGR02338 gimC_beta prefoldin, beta subunit, archaeal. Chaperonins are cytosolic, ATP-dependent molecular chaperones, with a conserved toroidal architecture, that assist in the folding of nascent and/or denatured polypeptide chains. The group I chaperonin system consists of GroEL and GroES, and is found (usually) in bacteria and organelles of bacterial origin. The group II chaperonin system, called the thermosome in Archaea and TRiC or CCT in the Eukaryota, is structurally similar but only distantly related. Prefoldin, also called GimC, is a complex in Archaea and Eukaryota, that works with group II chaperonins. Members of this protein family are the archaeal clade of the beta class of prefoldin subunit. Closely related, but outside the scope of this family are the eukaryotic beta-class prefoldin subunits, Gim-1,3,4 and 6. The alpha class prefoldin subunits are more distantly related.
Probab=21.22 E-value=1.5e+02 Score=19.29 Aligned_cols=66 Identities=20% Similarity=0.141 Sum_probs=39.6
Q ss_pred HHHHHhhhhhhhHHHHhhhcc---ccCHHHHHHHHHHHHhhhccCCccccHHHHHHHHHHHHHHHHHHHHHHHHHhhc
Q 047804 5 FLKLLESFRELQSWVNLATGA---CRTKRDCKLIMDYEANSRIRDPVYGCVGAVCHLQKQVSELQAELAKAQAGLAFM 79 (81)
Q Consensus 5 v~k~l~~~~~~~r~~~~~~g~---~~~k~da~~SlvYEA~aR~rDPV~Gc~G~I~~Lq~qi~~lq~eLa~~~~~la~~ 79 (81)
+.+-|+.++++.+ |=.+.|- .+.++++...+ .|=+-.+-..|..|+.++..++.++..++..|..+
T Consensus 39 v~~eL~~l~~d~~-vyk~VG~vlv~~~~~e~~~~l--------~~r~e~ie~~i~~lek~~~~l~~~l~e~q~~l~~~ 107 (110)
T TIGR02338 39 ALEELERLPDDTP-VYKSVGNLLVKTDKEEAIQEL--------KEKKETLELRVKTLQRQEERLREQLKELQEKIQEA 107 (110)
T ss_pred HHHHHHcCCCcch-hHHHhchhhheecHHHHHHHH--------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4555666665443 4344442 22244444433 22333447788888888888888888888887643
No 201
>PF06818 Fez1: Fez1; InterPro: IPR009638 This family represents the eukaryotic Fez1 protein. Fez1 contains a leucine-zipper region with similarity to the DNA-binding domain of the cAMP-responsive activating-transcription factor 5 []. There is evidence that Fez1 inhibits cancer cell growth through regulation of mitosis, and that its alterations result in abnormal cell growth []. Note that some family members contain more than one copy of this region.; GO: 0005737 cytoplasm, 0016020 membrane
Probab=21.14 E-value=1.6e+02 Score=22.21 Aligned_cols=24 Identities=29% Similarity=0.452 Sum_probs=18.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHH
Q 047804 52 VGAVCHLQKQVSELQAELAKAQAG 75 (81)
Q Consensus 52 ~G~I~~Lq~qi~~lq~eLa~~~~~ 75 (81)
.|.|+-|.+|+.+.++|++.=-++
T Consensus 9 ~GEIsLLKqQLke~q~E~~~K~~E 32 (202)
T PF06818_consen 9 SGEISLLKQQLKESQAEVNQKDSE 32 (202)
T ss_pred hhhHHHHHHHHHHHHHHHHHHHhH
Confidence 588999999999999888754333
No 202
>TIGR02231 conserved hypothetical protein. This family consists of proteins over 500 amino acids long in Caenorhabditis elegans and several bacteria (Pseudomonas aeruginosa, Nostoc sp. PCC 7120, Leptospira interrogans, etc.). The function is unknown.
Probab=21.12 E-value=1.2e+02 Score=24.60 Aligned_cols=29 Identities=14% Similarity=0.211 Sum_probs=24.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhcc
Q 047804 52 VGAVCHLQKQVSELQAELAKAQAGLAFMQ 80 (81)
Q Consensus 52 ~G~I~~Lq~qi~~lq~eLa~~~~~la~~q 80 (81)
--.|..|+.++..++++++.+++++++.+
T Consensus 77 ~~~l~~l~~~~~~~~~~~~~~~~~~~~l~ 105 (525)
T TIGR02231 77 RKQIRELEAELRDLEDRGDALKALAKFLE 105 (525)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 45678899999999999999999888764
No 203
>PF06810 Phage_GP20: Phage minor structural protein GP20; InterPro: IPR009636 This family consists of several phage minor structural protein Gp20 sequences and prophage sequences of around 180 residues in length. The function of this family is unknown.; GO: 0005198 structural molecule activity
Probab=21.08 E-value=1.6e+02 Score=20.71 Aligned_cols=19 Identities=37% Similarity=0.337 Sum_probs=11.9
Q ss_pred HHHHHHHHHHHHHHHHHHH
Q 047804 52 VGAVCHLQKQVSELQAELA 70 (81)
Q Consensus 52 ~G~I~~Lq~qi~~lq~eLa 70 (81)
.+-+-.|+.+|..++++..
T Consensus 50 ~~d~eeLk~~i~~lq~~~~ 68 (155)
T PF06810_consen 50 AKDNEELKKQIEELQAKNK 68 (155)
T ss_pred cCCHHHHHHHHHHHHHHHH
Confidence 3445666666666666666
No 204
>PF04697 Pinin_SDK_N: pinin/SDK conserved region; InterPro: IPR006787 This conserved region is found at the N-terminal of the member proteins. It is located adjacent and N-terminal to the pinin/SKD/memA domain IPR006786 from INTERPRO. Members of this family have very varied localisations within the eukaryotic cell. Pinin is known to localise at the desmosomes and is implicated in anchoring intermediate filaments to the desmosomal plaque [, ]. SDK2/3 is a dynamically localised nuclear protein thought to be involved in modulation of alternative pre-mRNA splicing []. MemA is a tumour marker preferentially expressed in human melanoma cell lines. A common feature of the members of this family is that they may all participate in regulating protein-protein interactions [].
Probab=20.88 E-value=1.6e+02 Score=21.21 Aligned_cols=24 Identities=25% Similarity=0.338 Sum_probs=17.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHh
Q 047804 54 AVCHLQKQVSELQAELAKAQAGLA 77 (81)
Q Consensus 54 ~I~~Lq~qi~~lq~eLa~~~~~la 77 (81)
+|..||.|++.++..|..+-.-|.
T Consensus 4 av~~Lq~qlE~Ake~Lk~vDenIk 27 (134)
T PF04697_consen 4 AVRTLQAQLEKAKESLKNVDENIK 27 (134)
T ss_pred hHHHHHHHHHHHHHHhhhhhHHHH
Confidence 577788888888888877765554
No 205
>cd04776 HTH_GnyR Helix-Turn-Helix DNA binding domain of the regulatory protein GnyR. Putative helix-turn-helix (HTH) regulatory protein, GnyR, and other related proteins. GnyR belongs to the gnyRDBHAL cluster, which is involved in acyclic isoprenoid degradation in Pseudomonas aeruginosa. These proteins share the N-terminal DNA binding domain with other transcription regulators of the MerR superfamily that promote transcription by reconfiguring the spacer between the -35 and -10 promoter elements. A typical MerR regulator is comprised of distinct domains that harbor the regulatory (effector-binding) site and the active (DNA-binding) site. Their conserved N-terminal domains contain predicted winged HTH motifs that mediate DNA binding, while the dissimilar C-terminal domains bind specific coactivator molecules.
Probab=20.86 E-value=2.3e+02 Score=18.72 Aligned_cols=26 Identities=23% Similarity=0.257 Sum_probs=15.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhh
Q 047804 53 GAVCHLQKQVSELQAELAKAQAGLAF 78 (81)
Q Consensus 53 G~I~~Lq~qi~~lq~eLa~~~~~la~ 78 (81)
..+..|.+++..++.+++.+++.+..
T Consensus 80 ~~~~~l~~~~~~l~~~~~~l~~~~~~ 105 (118)
T cd04776 80 KMLEKIEKRRAELEQQRRDIDAALAE 105 (118)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34455666666666666666555443
No 206
>PF05377 FlaC_arch: Flagella accessory protein C (FlaC); InterPro: IPR008039 Although archaeal flagella appear superficially similar to those of bacteria, they are quite distinct []. In several archaea, the flagellin genes are followed immediately by the flagellar accessory genes flaCDEFGHIJ. The gene products may have a role in translocation, secretion, or assembly of the flagellum. FlaC is a protein whose exact role is unknown but it has been shown to be membrane-associated (by immuno-blotting fractionated cells) [].
Probab=20.82 E-value=2e+02 Score=17.62 Aligned_cols=19 Identities=16% Similarity=0.350 Sum_probs=9.3
Q ss_pred HHHHHHHHHHHHHHHHHHH
Q 047804 55 VCHLQKQVSELQAELAKAQ 73 (81)
Q Consensus 55 I~~Lq~qi~~lq~eLa~~~ 73 (81)
|..++.|++.+..++..+.
T Consensus 16 i~tvk~en~~i~~~ve~i~ 34 (55)
T PF05377_consen 16 INTVKKENEEISESVEKIE 34 (55)
T ss_pred HHHHHHHHHHHHHHHHHHH
Confidence 4445555555555544443
No 207
>PRK13923 putative spore coat protein regulator protein YlbO; Provisional
Probab=20.80 E-value=1.6e+02 Score=21.73 Aligned_cols=27 Identities=22% Similarity=0.243 Sum_probs=22.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhcc
Q 047804 54 AVCHLQKQVSELQAELAKAQAGLAFMQ 80 (81)
Q Consensus 54 ~I~~Lq~qi~~lq~eLa~~~~~la~~q 80 (81)
.+..|++++..++.++..++.++..++
T Consensus 112 e~~kl~~~~e~L~~e~~~L~~~~~~~~ 138 (170)
T PRK13923 112 QIGKLQEEEEKLSWENQTLKQELAITE 138 (170)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 577888899999999998888887654
No 208
>TIGR02977 phageshock_pspA phage shock protein A. Members of this family are the phage shock protein PspA, from the phage shock operon. This is a narrower family than the set of PspA and its homologs, sometimes several in a genome, as described by PFAM model pfam04012. PspA appears to maintain the protonmotive force under stress conditions that include overexpression of certain phage secretins, heat shock, ethanol, and protein export defects.
Probab=20.61 E-value=3.1e+02 Score=19.89 Aligned_cols=44 Identities=14% Similarity=0.025 Sum_probs=30.8
Q ss_pred HHHHHHHHHHhhhccCCccccHHHHHHHHHHHHHHHHHHHHHHH
Q 047804 31 DCKLIMDYEANSRIRDPVYGCVGAVCHLQKQVSELQAELAKAQA 74 (81)
Q Consensus 31 da~~SlvYEA~aR~rDPV~Gc~G~I~~Lq~qi~~lq~eLa~~~~ 74 (81)
+++.+=+-++--.+.||+-.--=.|..++..|..++..|+.+.+
T Consensus 9 ~iv~a~~n~~~dk~EDP~~~l~q~irem~~~l~~ar~~lA~~~a 52 (219)
T TIGR02977 9 DIVNSNLNALLDKAEDPEKMIRLIIQEMEDTLVEVRTTSARTIA 52 (219)
T ss_pred HHHHHHHHHHHHhccCHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 45666666777778899877666777777777777777666544
No 209
>PF10152 DUF2360: Predicted coiled-coil domain-containing protein (DUF2360); InterPro: IPR019309 This entry represents a component of the WASH complex. The WASH complex is present at the surface of endosomes and recruits and activates the Arp2/3 complex to induce actin polymerisation. The WASH complex plays a key role in the fission of tubules that serve as transport intermediates during endosome sorting []. The WASH complex's subunit structure: F-actin-capping protein subunit alpha (CAPZA1, CAPZA2 or CAPZA3), F-actin-capping protein subunit beta (CAPZB), WASH (WASH1, WASH2P, WASH3P, WASH4P, WASH5P or WASH6P), FAM21 (FAM21A, FAM21B or FAM21C), KIAA1033, KIAA0196 (strumpellin) and CCDC53.
Probab=20.52 E-value=2.2e+02 Score=19.81 Aligned_cols=30 Identities=20% Similarity=0.373 Sum_probs=24.9
Q ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHHhhc
Q 047804 50 GCVGAVCHLQKQVSELQAELAKAQAGLAFM 79 (81)
Q Consensus 50 Gc~G~I~~Lq~qi~~lq~eLa~~~~~la~~ 79 (81)
.|=.....+.++|++++..|..+.++|+.+
T Consensus 18 ~cE~kL~~~e~~Lq~~E~~l~iLEaKL~SI 47 (148)
T PF10152_consen 18 VCEEKLSDMEQRLQRLEATLNILEAKLSSI 47 (148)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhcc
Confidence 355667889999999999999999999764
No 210
>PF12709 Kinetocho_Slk19: Central kinetochore-associated; InterPro: IPR024312 This is a family of proteins integrally involved in the central kinetochore. Slk19 is a yeast member and it may play an important role in the timing of nuclear migration. It may also participate, directly or indirectly, in the maintenance of centromeric tensile strength during mitotic stagnation, for instance during activation of checkpoint controls, when cells need to preserve nuclear integrity until cell cycle progression can be resumed [].
Probab=20.44 E-value=2.1e+02 Score=19.07 Aligned_cols=24 Identities=21% Similarity=0.221 Sum_probs=15.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHh
Q 047804 54 AVCHLQKQVSELQAELAKAQAGLA 77 (81)
Q Consensus 54 ~I~~Lq~qi~~lq~eLa~~~~~la 77 (81)
-|..|+.++..+..|...++.+|.
T Consensus 50 ~v~~L~~e~~~l~~E~e~L~~~l~ 73 (87)
T PF12709_consen 50 KVDELENENKALKRENEQLKKKLD 73 (87)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH
Confidence 455666666666666666666554
No 211
>KOG4673 consensus Transcription factor TMF, TATA element modulatory factor [Transcription]
Probab=20.40 E-value=1.7e+02 Score=26.83 Aligned_cols=37 Identities=30% Similarity=0.402 Sum_probs=30.6
Q ss_pred HHHHHhhhccCCccccHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q 047804 36 MDYEANSRIRDPVYGCVGAVCHLQKQVSELQAELAKAQAGLAF 78 (81)
Q Consensus 36 lvYEA~aR~rDPV~Gc~G~I~~Lq~qi~~lq~eLa~~~~~la~ 78 (81)
=-|||--|.|| |.+.|||.+|..+++....+-++|..
T Consensus 855 eh~eall~Qre------GElthlq~e~~~le~~Rs~laeElvk 891 (961)
T KOG4673|consen 855 EHYEALLRQRE------GELTHLQTELASLESIRSSLAEELVK 891 (961)
T ss_pred HHHHHHHHhhc------chHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 47999999987 89999999999998877777666653
No 212
>PF10066 DUF2304: Uncharacterized conserved protein (DUF2304); InterPro: IPR019277 This entry represents hypothetical archaeal and bacterial proteins that have no known function.
Probab=20.14 E-value=1.7e+02 Score=19.21 Aligned_cols=21 Identities=24% Similarity=0.452 Sum_probs=12.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHH
Q 047804 54 AVCHLQKQVSELQAELAKAQA 74 (81)
Q Consensus 54 ~I~~Lq~qi~~lq~eLa~~~~ 74 (81)
.+++++.++.++-+|+|..++
T Consensus 87 ~is~le~~i~~L~qeiAl~e~ 107 (115)
T PF10066_consen 87 RISRLEEKIKRLAQEIALLEA 107 (115)
T ss_pred HHHHHHHHHHHHHHHHHHHhh
Confidence 455666666666666655543
No 213
>PF11559 ADIP: Afadin- and alpha -actinin-Binding; InterPro: IPR021622 This family is found in mammals where it is localised at cell-cell adherens junctions [], and in Sch. pombe and other fungi where it anchors spindle-pole bodies to spindle microtubules []. It is a coiled-coil structure, and in pombe, it is required for anchoring the minus end of spindle microtubules to the centrosome equivalent, the spindle-pole body. The name ADIP derives from the family being composed of Afadin- and alpha -Actinin-Binding Proteins Localised at Cell-Cell Adherens Junctions.
Probab=20.04 E-value=1.8e+02 Score=19.57 Aligned_cols=23 Identities=30% Similarity=0.450 Sum_probs=10.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHh
Q 047804 55 VCHLQKQVSELQAELAKAQAGLA 77 (81)
Q Consensus 55 I~~Lq~qi~~lq~eLa~~~~~la 77 (81)
+..|+..+..++.+++.++.+++
T Consensus 68 ~~~l~~~~~rL~~~~~~~ere~~ 90 (151)
T PF11559_consen 68 IERLQNDVERLKEQLEELERELA 90 (151)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHH
Confidence 34444444444444444444443
No 214
>PRK10227 DNA-binding transcriptional regulator CueR; Provisional
Probab=20.02 E-value=2.8e+02 Score=18.84 Aligned_cols=31 Identities=13% Similarity=0.140 Sum_probs=17.5
Q ss_pred CccccHHHHHHHHHHHHHHHHHHHHHHHHHh
Q 047804 47 PVYGCVGAVCHLQKQVSELQAELAKAQAGLA 77 (81)
Q Consensus 47 PV~Gc~G~I~~Lq~qi~~lq~eLa~~~~~la 77 (81)
|=..|......|++++..++++++.+++...
T Consensus 73 ~~~~~~~~~~~l~~~~~~l~~~i~~L~~~~~ 103 (135)
T PRK10227 73 PQRHSADVKRRTLEKVAEIERHIEELQSMRD 103 (135)
T ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3334555555566666666666665555444
Done!