Query psy7799
Match_columns 83
No_of_seqs 45 out of 47
Neff 4.6
Searched_HMMs 46136
Date Fri Aug 16 19:06:14 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy7799.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/7799hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PF04799 Fzo_mitofusin: fzo-li 100.0 3E-36 6.6E-41 216.9 9.2 82 1-82 90-171 (171)
2 KOG0448|consensus 99.8 4.4E-20 9.6E-25 154.3 10.5 82 1-82 668-749 (749)
3 PF00957 Synaptobrevin: Synapt 93.2 1.3 2.8E-05 27.7 9.3 50 21-70 4-53 (89)
4 cd07646 I-BAR_IMD_IRSp53 Inver 90.8 4.4 9.6E-05 30.9 9.7 32 46-77 126-177 (232)
5 PRK10884 SH3 domain-containing 88.6 8.1 0.00017 28.5 10.0 64 14-77 98-168 (206)
6 PF10157 DUF2365: Uncharacteri 87.3 5.9 0.00013 28.1 7.7 50 20-69 95-147 (149)
7 COG5185 HEC1 Protein involved 86.8 7.5 0.00016 33.2 9.2 62 15-76 255-316 (622)
8 PF06705 SF-assemblin: SF-asse 86.4 11 0.00024 27.7 10.3 66 13-78 67-137 (247)
9 PRK05561 DNA topoisomerase IV 85.0 4.8 0.0001 34.8 7.4 53 24-79 430-482 (742)
10 TIGR02338 gimC_beta prefoldin, 84.2 5.2 0.00011 26.2 5.9 44 27-73 63-106 (110)
11 PRK05560 DNA gyrase subunit A; 84.0 5.3 0.00011 34.7 7.3 53 24-79 423-475 (805)
12 TIGR01063 gyrA DNA gyrase, A s 83.7 5.8 0.00013 34.5 7.4 54 24-80 420-473 (800)
13 PF02646 RmuC: RmuC family; I 83.5 16 0.00035 27.9 9.1 66 13-78 6-85 (304)
14 TIGR01061 parC_Gpos DNA topois 83.1 6.1 0.00013 34.2 7.2 53 24-79 420-472 (738)
15 COG4942 Membrane-bound metallo 82.2 22 0.00048 29.3 9.8 64 12-76 41-108 (420)
16 PF08514 STAG: STAG domain ; 82.1 5.7 0.00012 26.7 5.5 53 5-57 64-116 (118)
17 PF06825 HSBP1: Heat shock fac 80.4 5.7 0.00012 23.9 4.5 35 14-48 11-45 (54)
18 PF11932 DUF3450: Protein of u 80.3 21 0.00045 26.3 9.7 10 68-77 111-120 (251)
19 PF06698 DUF1192: Protein of u 79.8 7.6 0.00016 23.8 5.0 34 32-65 22-55 (59)
20 TIGR01062 parC_Gneg DNA topois 79.2 9.2 0.0002 33.3 7.0 53 24-79 417-469 (735)
21 cd07645 I-BAR_IMD_BAIAP2L1 Inv 79.1 26 0.00057 26.7 9.1 18 46-63 124-141 (226)
22 KOG0860|consensus 77.9 20 0.00044 24.7 7.6 39 23-61 32-70 (116)
23 PRK13979 DNA topoisomerase IV 77.3 9.8 0.00021 34.0 6.8 66 11-79 414-491 (957)
24 PF02388 FemAB: FemAB family; 77.1 13 0.00029 29.4 7.0 46 30-75 241-293 (406)
25 PHA02562 46 endonuclease subun 76.9 11 0.00023 30.1 6.4 44 32-75 307-350 (562)
26 PF07798 DUF1640: Protein of u 76.8 23 0.00051 24.9 8.5 30 18-47 78-107 (177)
27 KOG0995|consensus 76.4 19 0.00041 30.9 8.0 63 14-76 218-280 (581)
28 PF09403 FadA: Adhesion protei 75.5 15 0.00031 25.5 6.0 36 20-55 89-124 (126)
29 PF06825 HSBP1: Heat shock fac 75.4 15 0.00033 22.1 5.4 36 19-54 9-44 (54)
30 PRK09343 prefoldin subunit bet 74.8 22 0.00048 23.8 6.7 50 26-78 66-115 (121)
31 PRK14148 heat shock protein Gr 73.5 11 0.00023 27.8 5.2 38 38-78 40-77 (195)
32 PRK08476 F0F1 ATP synthase sub 73.5 16 0.00034 25.0 5.8 34 14-47 101-134 (141)
33 PF09177 Syntaxin-6_N: Syntaxi 73.1 19 0.00041 22.9 5.8 56 18-73 17-81 (97)
34 PF07352 Phage_Mu_Gam: Bacteri 72.1 29 0.00062 23.8 7.4 49 24-78 14-62 (149)
35 TIGR03007 pepcterm_ChnLen poly 71.2 50 0.0011 26.2 9.4 64 15-78 312-378 (498)
36 PF05008 V-SNARE: Vesicle tran 70.8 20 0.00044 21.5 6.5 55 22-78 23-77 (79)
37 PF13747 DUF4164: Domain of un 69.8 27 0.00058 22.5 7.9 18 13-30 8-25 (89)
38 TIGR00414 serS seryl-tRNA synt 68.7 59 0.0013 26.1 9.0 44 35-78 48-99 (418)
39 PF10458 Val_tRNA-synt_C: Valy 68.5 15 0.00032 22.1 4.3 23 35-57 1-23 (66)
40 cd00187 TOP4c DNA Topoisomeras 68.2 18 0.00038 29.6 5.9 48 24-74 398-445 (445)
41 KOG1962|consensus 67.8 30 0.00064 26.2 6.6 51 28-78 152-202 (216)
42 PF07412 Geminin: Geminin; In 67.7 49 0.0011 24.7 8.5 70 6-76 100-170 (200)
43 TIGR01837 PHA_granule_1 poly(h 67.4 32 0.0007 23.0 6.2 28 27-54 81-112 (118)
44 PF07028 DUF1319: Protein of u 67.3 40 0.00087 23.6 8.2 60 13-79 21-84 (126)
45 PF02108 FliH: Flagellar assem 67.3 27 0.00058 22.1 5.6 36 12-47 1-36 (128)
46 TIGR01005 eps_transp_fam exopo 66.8 73 0.0016 26.9 9.4 16 54-69 375-390 (754)
47 PF13864 Enkurin: Calmodulin-b 66.8 6.6 0.00014 25.3 2.6 37 15-51 53-94 (98)
48 COG5420 Uncharacterized conser 66.2 12 0.00027 23.7 3.7 32 46-77 4-35 (71)
49 cd07666 BAR_SNX7 The Bin/Amphi 65.5 54 0.0012 24.9 7.7 17 34-50 180-196 (243)
50 PF05082 Rop-like: Rop-like; 64.6 20 0.00043 22.5 4.3 32 49-81 3-34 (66)
51 PRK11637 AmiB activator; Provi 64.2 70 0.0015 25.3 10.4 13 60-72 108-120 (428)
52 PRK14145 heat shock protein Gr 64.0 20 0.00044 26.5 5.0 45 30-77 37-81 (196)
53 PRK03918 chromosome segregatio 63.9 75 0.0016 26.9 8.9 36 23-58 161-196 (880)
54 PF03938 OmpH: Outer membrane 63.8 41 0.00089 22.4 7.9 41 6-46 29-69 (158)
55 PF12001 DUF3496: Domain of un 63.6 22 0.00047 24.3 4.7 37 34-79 3-39 (111)
56 PRK05431 seryl-tRNA synthetase 63.2 78 0.0017 25.4 8.9 45 34-78 45-96 (425)
57 PF13093 FTA4: Kinetochore com 63.2 37 0.00081 25.1 6.3 56 16-75 138-204 (213)
58 PLN02320 seryl-tRNA synthetase 62.5 86 0.0019 26.3 8.9 58 17-78 97-160 (502)
59 KOG0976|consensus 62.5 64 0.0014 29.6 8.5 64 15-78 370-447 (1265)
60 cd00632 Prefoldin_beta Prefold 62.4 39 0.00084 21.7 6.3 43 27-72 59-101 (105)
61 PF10359 Fmp27_WPPW: RNA pol I 62.4 46 0.001 27.1 7.2 29 50-78 202-230 (475)
62 PRK06798 fliD flagellar cappin 62.3 39 0.00085 27.4 6.8 46 32-77 380-426 (440)
63 PRK09631 DNA topoisomerase IV 61.8 45 0.00097 28.8 7.3 49 24-78 397-445 (635)
64 PF07889 DUF1664: Protein of u 61.6 51 0.0011 22.8 8.8 18 33-50 63-80 (126)
65 PF10549 ORF11CD3: ORF11CD3 do 61.1 34 0.00074 20.7 4.8 36 13-48 16-51 (57)
66 TIGR01837 PHA_granule_1 poly(h 61.0 19 0.00042 24.1 4.1 17 57-73 98-114 (118)
67 PF01920 Prefoldin_2: Prefoldi 59.6 40 0.00086 20.9 6.4 37 30-76 61-97 (106)
68 PF10146 zf-C4H2: Zinc finger- 59.2 74 0.0016 23.9 7.8 32 33-64 48-79 (230)
69 cd01106 HTH_TipAL-Mta Helix-Tu 59.0 44 0.00095 21.2 6.1 45 7-51 55-100 (103)
70 PF07352 Phage_Mu_Gam: Bacteri 58.7 49 0.0011 22.6 6.0 42 14-55 15-56 (149)
71 PRK14143 heat shock protein Gr 58.7 32 0.00068 26.1 5.3 41 35-78 64-104 (238)
72 PRK07737 fliD flagellar cappin 58.4 1E+02 0.0022 25.4 8.6 42 35-76 445-487 (501)
73 PF12732 YtxH: YtxH-like prote 58.0 40 0.00086 20.4 7.6 37 6-42 19-55 (74)
74 PF11690 DUF3287: Protein of u 57.8 58 0.0013 22.2 7.4 54 22-76 40-94 (109)
75 PF07851 TMPIT: TMPIT-like pro 57.7 97 0.0021 24.7 9.0 45 16-60 14-58 (330)
76 PRK14150 heat shock protein Gr 57.1 44 0.00095 24.4 5.7 20 59-78 56-75 (193)
77 PF09798 LCD1: DNA damage chec 56.6 46 0.001 28.9 6.5 42 12-53 7-48 (654)
78 PF12792 CSS-motif: CSS motif 56.6 57 0.0012 21.8 6.6 50 12-61 1-59 (208)
79 TIGR02492 flgK_ends flagellar 55.7 79 0.0017 24.2 7.2 16 58-73 194-209 (322)
80 PRK14160 heat shock protein Gr 55.6 69 0.0015 23.9 6.6 41 35-78 58-98 (211)
81 PF04912 Dynamitin: Dynamitin 55.5 58 0.0013 25.6 6.5 46 27-72 339-385 (388)
82 COG2348 Peptidoglycan interpep 55.0 44 0.00096 27.4 5.9 67 12-78 221-296 (418)
83 PRK13922 rod shape-determining 54.8 53 0.0012 24.2 6.0 34 34-67 72-105 (276)
84 smart00502 BBC B-Box C-termina 54.7 49 0.0011 20.5 9.9 34 11-44 37-70 (127)
85 PRK07521 flgK flagellar hook-a 54.5 66 0.0014 26.1 6.9 58 17-74 133-204 (483)
86 PRK09458 pspB phage shock prot 53.9 28 0.00061 22.4 3.8 27 46-72 40-66 (75)
87 PF10393 Matrilin_ccoil: Trime 52.9 45 0.00098 19.5 4.3 25 28-52 20-44 (47)
88 TIGR02047 CadR-PbrR Cd(II)/Pb( 52.9 60 0.0013 21.6 5.5 31 31-61 79-109 (127)
89 PRK13729 conjugal transfer pil 52.6 87 0.0019 26.3 7.3 7 66-72 108-114 (475)
90 PLN03085 nucleobase:cation sym 52.4 63 0.0014 24.7 6.0 40 12-51 145-199 (221)
91 PF02388 FemAB: FemAB family; 52.4 94 0.002 24.6 7.3 24 37-60 272-295 (406)
92 PRK04654 sec-independent trans 51.9 1E+02 0.0022 23.4 8.5 27 15-41 29-55 (214)
93 TIGR02051 MerR Hg(II)-responsi 51.9 52 0.0011 21.8 5.0 54 5-60 52-105 (124)
94 cd07628 BAR_Atg24p The Bin/Amp 51.8 72 0.0016 22.6 6.1 41 35-78 8-48 (185)
95 KOG0687|consensus 51.8 58 0.0013 26.7 6.1 45 34-78 78-131 (393)
96 PF04350 PilO: Pilus assembly 51.6 16 0.00035 23.8 2.5 12 63-74 31-42 (144)
97 PF03954 Lectin_N: Hepatic lec 51.6 48 0.001 23.6 5.0 50 12-75 58-107 (138)
98 cd04776 HTH_GnyR Helix-Turn-He 51.6 68 0.0015 21.1 5.9 34 28-61 77-110 (118)
99 PF06810 Phage_GP20: Phage min 51.2 83 0.0018 22.1 6.6 38 40-77 43-84 (155)
100 KOG1760|consensus 50.9 47 0.001 23.5 4.8 35 19-57 22-56 (131)
101 PF14335 DUF4391: Domain of un 50.7 94 0.002 22.5 7.4 58 15-73 143-216 (221)
102 COG1579 Zn-ribbon protein, pos 50.2 1.1E+02 0.0024 23.3 9.5 60 17-76 14-73 (239)
103 PRK11637 AmiB activator; Provi 49.7 1.3E+02 0.0028 23.8 10.4 23 53-75 94-116 (428)
104 COG3206 GumC Uncharacterized p 49.7 1.3E+02 0.0028 23.9 9.0 35 35-69 353-387 (458)
105 PF12325 TMF_TATA_bd: TATA ele 49.3 82 0.0018 21.5 10.3 69 5-73 8-86 (120)
106 PF06456 Arfaptin: Arfaptin-li 49.2 70 0.0015 23.8 5.8 37 41-77 33-69 (229)
107 PF05873 Mt_ATP-synt_D: ATP sy 49.1 45 0.00097 23.6 4.6 35 19-53 91-125 (161)
108 PRK07191 flgK flagellar hook-a 48.8 77 0.0017 25.5 6.4 57 17-73 138-208 (456)
109 PF14257 DUF4349: Domain of un 48.1 74 0.0016 23.3 5.8 29 33-61 134-162 (262)
110 PRK09514 zntR zinc-responsive 48.0 73 0.0016 21.6 5.4 58 5-62 54-112 (140)
111 PF13935 Ead_Ea22: Ead/Ea22-li 47.9 87 0.0019 21.3 5.9 44 32-76 68-111 (139)
112 PRK11020 hypothetical protein; 47.9 93 0.002 21.6 6.7 42 14-55 3-55 (118)
113 PF05529 Bap31: B-cell recepto 47.7 96 0.0021 21.8 7.7 33 35-70 158-190 (192)
114 cd07624 BAR_SNX7_30 The Bin/Am 47.5 97 0.0021 22.1 6.2 42 34-78 17-58 (200)
115 PRK08871 flgK flagellar hook-a 46.6 1.2E+02 0.0025 25.9 7.4 18 57-74 192-209 (626)
116 COG5374 Uncharacterized conser 46.6 1.2E+02 0.0027 22.7 7.2 43 33-78 138-180 (192)
117 PF10779 XhlA: Haemolysin XhlA 46.3 66 0.0014 19.5 7.0 25 30-54 26-50 (71)
118 PRK05291 trmE tRNA modificatio 46.3 28 0.00061 27.9 3.5 43 2-44 128-170 (449)
119 PF10168 Nup88: Nuclear pore c 46.1 2E+02 0.0044 25.0 9.5 25 49-73 580-604 (717)
120 smart00721 BAR BAR domain. 46.1 1E+02 0.0022 21.5 8.8 54 25-78 125-186 (239)
121 PF08614 ATG16: Autophagy prot 46.0 60 0.0013 23.0 4.9 63 14-76 79-144 (194)
122 PF05957 DUF883: Bacterial pro 45.4 75 0.0016 19.9 9.9 64 13-76 2-70 (94)
123 cd01108 HTH_CueR Helix-Turn-He 45.4 88 0.0019 20.7 5.4 31 31-61 79-109 (127)
124 PF03285 Paralemmin: Paralemmi 45.3 20 0.00043 28.0 2.5 21 33-53 12-32 (278)
125 TIGR03545 conserved hypothetic 45.2 1.2E+02 0.0025 25.7 7.1 54 13-66 216-273 (555)
126 PRK05724 acetyl-CoA carboxylas 45.2 61 0.0013 25.6 5.2 35 16-50 13-47 (319)
127 PF07798 DUF1640: Protein of u 44.9 1.1E+02 0.0023 21.5 9.3 33 9-41 80-112 (177)
128 PRK14162 heat shock protein Gr 44.8 80 0.0017 23.2 5.5 25 53-77 51-75 (194)
129 PRK14139 heat shock protein Gr 44.6 70 0.0015 23.4 5.1 21 56-76 47-67 (185)
130 PRK12758 DNA topoisomerase IV 43.9 1.1E+02 0.0024 27.6 7.0 49 24-78 418-466 (869)
131 PRK14156 heat shock protein Gr 43.8 91 0.002 22.6 5.6 29 49-77 35-63 (177)
132 PF11285 DUF3086: Protein of u 43.7 40 0.00087 26.5 3.9 42 12-53 28-73 (283)
133 PF04508 Pox_A_type_inc: Viral 43.6 31 0.00067 17.6 2.2 16 58-73 4-19 (23)
134 cd00056 ENDO3c endonuclease II 43.6 68 0.0015 21.3 4.6 69 5-80 4-73 (158)
135 PRK01156 chromosome segregatio 43.5 2.1E+02 0.0046 24.6 9.2 24 43-66 714-737 (895)
136 TIGR00513 accA acetyl-CoA carb 43.1 71 0.0015 25.2 5.3 40 10-50 8-47 (316)
137 PF10234 Cluap1: Clusterin-ass 43.0 1.6E+02 0.0034 22.9 10.0 66 14-79 163-242 (267)
138 PF11855 DUF3375: Protein of u 42.9 62 0.0013 26.4 5.1 34 18-51 124-157 (478)
139 PRK14158 heat shock protein Gr 42.8 83 0.0018 23.1 5.3 39 36-77 38-76 (194)
140 PF15463 ECM11: Extracellular 42.8 1.1E+02 0.0023 20.9 6.2 47 30-76 86-133 (139)
141 KOG1853|consensus 42.5 1.8E+02 0.0038 23.3 7.6 60 13-73 95-158 (333)
142 PRK04406 hypothetical protein; 42.4 78 0.0017 19.8 4.5 13 36-48 9-21 (75)
143 CHL00198 accA acetyl-CoA carbo 42.3 64 0.0014 25.6 4.9 48 9-57 10-58 (322)
144 PF09006 Surfac_D-trimer: Lung 41.7 74 0.0016 18.7 4.2 24 35-58 3-26 (46)
145 smart00338 BRLZ basic region l 41.6 73 0.0016 18.6 5.8 36 39-77 27-62 (65)
146 PRK09343 prefoldin subunit bet 41.6 1.1E+02 0.0023 20.5 5.9 23 55-77 85-107 (121)
147 cd07605 I-BAR_IMD Inverse (I)- 41.4 1.5E+02 0.0032 22.1 8.7 33 47-79 125-176 (223)
148 PRK06664 fliD flagellar hook-a 41.3 2.4E+02 0.0051 24.4 8.6 58 20-77 579-647 (661)
149 KOG3251|consensus 41.3 1.1E+02 0.0024 23.1 5.8 43 27-69 29-71 (213)
150 PF09278 MerR-DNA-bind: MerR, 41.3 69 0.0015 18.3 5.4 29 31-59 36-64 (65)
151 PRK04098 sec-independent trans 41.0 1.4E+02 0.003 21.6 7.5 29 13-41 27-55 (158)
152 PF07544 Med9: RNA polymerase 40.8 88 0.0019 19.6 4.6 15 35-49 56-70 (83)
153 cd01107 HTH_BmrR Helix-Turn-He 40.8 98 0.0021 19.9 5.3 30 30-59 74-103 (108)
154 KOG4117|consensus 40.6 96 0.0021 19.8 5.5 37 16-52 15-55 (73)
155 PRK12765 flagellar capping pro 40.5 1.3E+02 0.0027 25.6 6.6 58 20-77 511-579 (595)
156 COG5481 Uncharacterized conser 40.5 93 0.002 19.5 6.3 53 11-70 2-60 (67)
157 PF02403 Seryl_tRNA_N: Seryl-t 40.3 96 0.0021 19.6 8.3 28 50-77 69-96 (108)
158 PHA02666 hypothetical protein; 40.2 58 0.0013 25.3 4.3 36 35-80 205-240 (287)
159 PF11945 WASH_WAHD: WAHD domai 40.0 1.7E+02 0.0037 22.8 6.9 17 19-35 35-51 (297)
160 PF00015 MCPsignal: Methyl-acc 40.0 1.2E+02 0.0025 20.5 6.7 25 50-74 186-210 (213)
161 PF05103 DivIVA: DivIVA protei 39.9 75 0.0016 20.5 4.3 24 27-50 21-44 (131)
162 PRK08032 fliD flagellar cappin 39.9 2E+02 0.0044 23.3 8.5 43 34-76 409-452 (462)
163 PF10018 Med4: Vitamin-D-recep 39.8 1.4E+02 0.0029 21.2 9.0 40 13-52 2-43 (188)
164 KOG4643|consensus 39.8 2.2E+02 0.0047 26.7 8.2 62 14-83 189-250 (1195)
165 KOG4797|consensus 39.7 1.3E+02 0.0028 21.0 7.3 50 12-61 48-97 (123)
166 PRK07191 flgK flagellar hook-a 39.7 1.6E+02 0.0035 23.7 7.0 15 47-61 165-179 (456)
167 KOG4603|consensus 38.8 1.7E+02 0.0036 22.0 6.5 58 18-76 84-144 (201)
168 cd07644 I-BAR_IMD_BAIAP2L2 Inv 38.6 1.8E+02 0.0038 22.2 9.7 31 47-77 125-166 (215)
169 PF12329 TMF_DNA_bd: TATA elem 38.6 97 0.0021 19.2 5.8 36 37-75 32-67 (74)
170 cd01878 HflX HflX subfamily. 38.5 1E+02 0.0023 20.8 5.0 34 28-61 3-36 (204)
171 PF04375 HemX: HemX; InterPro 38.5 1.9E+02 0.0042 22.7 9.1 10 62-71 123-132 (372)
172 PF10372 YojJ: Bacterial membr 38.5 55 0.0012 20.6 3.3 15 33-47 12-26 (70)
173 PF12277 DUF3618: Protein of u 38.4 78 0.0017 18.0 3.9 23 12-34 6-28 (49)
174 PF10168 Nup88: Nuclear pore c 38.3 2.7E+02 0.0059 24.3 9.5 55 9-63 547-601 (717)
175 COG1729 Uncharacterized protei 38.2 1.1E+02 0.0024 23.6 5.6 21 32-52 57-77 (262)
176 KOG0859|consensus 38.2 52 0.0011 25.0 3.7 47 19-65 124-170 (217)
177 TIGR03545 conserved hypothetic 38.2 2.5E+02 0.0053 23.8 8.5 27 47-73 218-244 (555)
178 KOG0976|consensus 38.1 2.5E+02 0.0054 26.1 8.3 59 16-78 123-185 (1265)
179 PF06013 WXG100: Proteins of 1 38.0 78 0.0017 18.0 10.0 32 48-79 51-82 (86)
180 COG5509 Uncharacterized small 37.8 86 0.0019 19.6 4.0 33 32-64 26-58 (65)
181 TIGR03185 DNA_S_dndD DNA sulfu 37.7 2.4E+02 0.0053 23.6 8.4 35 13-49 186-220 (650)
182 COG2825 HlpA Outer membrane pr 37.7 1.5E+02 0.0032 21.1 9.1 47 12-59 43-89 (170)
183 PF11744 ALMT: Aluminium activ 37.6 94 0.002 25.2 5.3 55 10-64 285-342 (406)
184 PHA02592 52 DNA topisomerase I 37.6 99 0.0021 25.4 5.5 43 25-75 396-438 (439)
185 PF05565 Sipho_Gp157: Siphovir 37.6 1.4E+02 0.0031 20.8 8.9 42 32-76 48-89 (162)
186 TIGR01069 mutS2 MutS2 family p 37.6 2.8E+02 0.0061 24.2 9.4 45 19-63 503-547 (771)
187 PF13747 DUF4164: Domain of un 37.5 1.1E+02 0.0024 19.6 6.3 56 16-75 4-59 (89)
188 PRK10227 DNA-binding transcrip 37.5 99 0.0021 21.0 4.7 32 30-61 78-109 (135)
189 PF09755 DUF2046: Uncharacteri 37.4 2.1E+02 0.0046 22.8 8.7 37 40-76 137-177 (310)
190 PF14932 HAUS-augmin3: HAUS au 37.4 84 0.0018 23.5 4.7 19 51-69 92-110 (256)
191 PRK12704 phosphodiesterase; Pr 37.3 2.4E+02 0.0053 23.5 10.4 18 37-54 92-109 (520)
192 PF04645 DUF603: Protein of un 37.0 1.6E+02 0.0035 21.9 6.0 64 14-77 80-153 (181)
193 PF11629 Mst1_SARAH: C termina 36.9 93 0.002 18.5 4.7 27 22-48 10-36 (49)
194 PRK06975 bifunctional uroporph 36.9 2.6E+02 0.0056 23.9 8.0 12 61-72 413-424 (656)
195 TIGR02043 ZntR Zn(II)-responsi 36.7 1.3E+02 0.0028 20.1 5.3 31 31-61 81-111 (131)
196 PF01166 TSC22: TSC-22/dip/bun 36.7 77 0.0017 19.6 3.6 30 29-58 12-41 (59)
197 PRK06032 fliH flagellar assemb 36.6 1.6E+02 0.0034 21.1 6.5 41 10-50 52-92 (199)
198 PHA03386 P10 fibrous body prot 36.6 1.2E+02 0.0027 20.2 4.9 37 36-72 24-60 (94)
199 PF12761 End3: Actin cytoskele 36.5 1E+02 0.0022 23.0 4.9 39 36-74 101-140 (195)
200 PRK13729 conjugal transfer pil 36.5 1.9E+02 0.0041 24.4 7.0 36 17-52 80-118 (475)
201 PF02994 Transposase_22: L1 tr 36.4 1.1E+02 0.0024 24.2 5.4 36 34-69 147-186 (370)
202 COG1293 Predicted RNA-binding 36.3 1.7E+02 0.0037 24.6 6.8 25 51-75 289-313 (564)
203 cd00890 Prefoldin Prefoldin is 36.3 1.2E+02 0.0025 19.4 5.9 29 34-62 9-37 (129)
204 TIGR02976 phageshock_pspB phag 36.2 70 0.0015 20.2 3.5 20 34-53 45-64 (75)
205 PF10376 Mei5: Double-strand r 36.1 1.8E+02 0.004 21.7 7.8 58 14-75 132-194 (221)
206 cd01282 HTH_MerR-like_sg3 Heli 36.1 1.2E+02 0.0026 19.6 5.2 28 31-58 81-108 (112)
207 PRK15374 pathogenicity island 36.0 2.5E+02 0.0055 24.4 7.7 50 15-71 122-171 (593)
208 TIGR01010 BexC_CtrB_KpsE polys 36.0 2E+02 0.0043 22.0 7.7 31 48-78 170-200 (362)
209 PRK14147 heat shock protein Gr 35.8 83 0.0018 22.5 4.3 23 54-76 31-53 (172)
210 KOG0396|consensus 35.8 2.5E+02 0.0054 23.2 9.4 72 5-80 48-127 (389)
211 cd03569 VHS_Hrs_Vps27p VHS dom 35.7 1.3E+02 0.0028 20.6 5.1 65 3-78 65-129 (142)
212 cd00632 Prefoldin_beta Prefold 35.4 1.2E+02 0.0026 19.4 5.9 27 50-76 72-98 (105)
213 cd04784 HTH_CadR-PbrR Helix-Tu 35.2 1.3E+02 0.0028 19.7 5.6 31 31-61 79-109 (127)
214 PF06667 PspB: Phage shock pro 35.2 81 0.0018 20.0 3.7 23 49-71 43-65 (75)
215 PF10267 Tmemb_cc2: Predicted 35.1 2.5E+02 0.0054 22.9 9.2 66 14-79 206-275 (395)
216 PRK14149 heat shock protein Gr 34.9 85 0.0018 23.1 4.3 29 49-77 44-72 (191)
217 PF12017 Tnp_P_element: Transp 34.9 1.8E+02 0.0039 21.9 6.1 22 36-57 16-37 (236)
218 PRK14144 heat shock protein Gr 34.7 1.1E+02 0.0024 22.6 4.9 24 53-76 57-80 (199)
219 PRK08453 fliD flagellar cappin 34.4 2.9E+02 0.0062 24.2 7.9 61 18-78 597-669 (673)
220 PF14966 DNA_repr_REX1B: DNA r 34.2 1.3E+02 0.0029 19.5 7.2 40 13-52 40-84 (97)
221 PF06810 Phage_GP20: Phage min 34.1 1.6E+02 0.0036 20.5 6.3 46 33-78 29-74 (155)
222 PF14282 FlxA: FlxA-like prote 34.0 1.4E+02 0.003 19.6 5.5 19 34-52 22-40 (106)
223 PF14932 HAUS-augmin3: HAUS au 33.9 1E+02 0.0022 23.0 4.7 29 38-66 68-96 (256)
224 PRK02224 chromosome segregatio 33.9 3E+02 0.0065 23.5 10.2 11 40-50 683-693 (880)
225 cd03561 VHS VHS domain family; 33.9 1.2E+02 0.0026 20.2 4.6 32 2-34 60-91 (133)
226 PF09726 Macoilin: Transmembra 33.8 3.2E+02 0.007 23.8 8.4 56 16-71 548-610 (697)
227 PRK14161 heat shock protein Gr 33.5 1.4E+02 0.0029 21.6 5.1 25 53-77 31-55 (178)
228 KOG4674|consensus 33.5 1.8E+02 0.0039 28.5 7.0 45 15-59 1330-1374(1822)
229 PF09744 Jnk-SapK_ap_N: JNK_SA 33.3 1.8E+02 0.0039 20.7 5.9 28 29-56 80-107 (158)
230 COG5002 VicK Signal transducti 33.2 1.2E+02 0.0026 25.4 5.2 35 16-50 86-120 (459)
231 PF10835 DUF2573: Protein of u 33.1 1.1E+02 0.0024 20.0 4.1 42 17-58 39-80 (82)
232 PF11855 DUF3375: Protein of u 33.1 73 0.0016 26.0 4.0 38 14-51 127-164 (478)
233 PF08653 DASH_Dam1: DASH compl 33.0 1.2E+02 0.0025 18.5 4.8 30 31-60 5-34 (58)
234 cd04770 HTH_HMRTR Helix-Turn-H 32.9 1.4E+02 0.003 19.3 5.4 30 31-60 79-108 (123)
235 PRK09413 IS2 repressor TnpA; R 32.8 1.4E+02 0.0031 19.5 6.9 40 12-51 33-91 (121)
236 COG0188 GyrA Type IIA topoisom 32.8 1.7E+02 0.0037 26.1 6.4 52 24-78 427-478 (804)
237 KOG0612|consensus 32.6 4.5E+02 0.0097 25.1 9.3 72 6-77 568-662 (1317)
238 PF08855 DUF1825: Domain of un 32.5 1.6E+02 0.0035 20.0 7.2 64 11-78 7-82 (108)
239 COG0497 RecN ATPase involved i 32.5 2.6E+02 0.0056 24.0 7.2 51 29-79 323-373 (557)
240 PRK14140 heat shock protein Gr 32.4 1.4E+02 0.003 21.9 5.1 27 51-77 47-73 (191)
241 TIGR00293 prefoldin, archaeal 32.3 1.4E+02 0.0031 19.3 6.2 42 28-72 83-124 (126)
242 PF06160 EzrA: Septation ring 32.2 3E+02 0.0064 22.9 9.5 67 13-79 256-337 (560)
243 PF07439 DUF1515: Protein of u 32.1 1.7E+02 0.0037 20.2 6.2 38 17-54 19-56 (112)
244 PRK15365 type III secretion sy 31.6 1.1E+02 0.0024 20.9 4.1 32 33-64 11-43 (107)
245 PLN02678 seryl-tRNA synthetase 31.6 2.9E+02 0.0064 22.7 8.9 45 34-78 50-101 (448)
246 PF14988 DUF4515: Domain of un 31.6 2.1E+02 0.0045 21.0 9.0 41 15-55 38-78 (206)
247 PRK02793 phi X174 lysis protei 31.5 1.1E+02 0.0025 18.8 3.9 9 55-63 43-51 (72)
248 PTZ00464 SNF-7-like protein; P 31.4 2.2E+02 0.0047 21.1 6.9 16 63-78 69-84 (211)
249 PF14915 CCDC144C: CCDC144C pr 31.3 2.7E+02 0.0059 22.2 9.0 61 12-76 182-242 (305)
250 PRK08032 fliD flagellar cappin 31.3 2.9E+02 0.0062 22.4 8.1 52 17-71 410-461 (462)
251 PF11945 WASH_WAHD: WAHD domai 31.2 1.8E+02 0.004 22.7 5.8 50 17-69 22-71 (297)
252 COG3416 Uncharacterized protei 31.1 71 0.0015 24.5 3.4 21 32-52 56-76 (233)
253 PF02996 Prefoldin: Prefoldin 31.0 1.4E+02 0.0031 18.9 6.2 42 29-73 75-116 (120)
254 PF04108 APG17: Autophagy prot 30.9 2.8E+02 0.006 22.2 8.8 66 11-76 235-304 (412)
255 cd04790 HTH_Cfa-like_unk Helix 30.8 1.9E+02 0.0041 20.3 6.5 27 33-59 76-102 (172)
256 PF10392 COG5: Golgi transport 30.8 1.7E+02 0.0036 19.6 8.4 48 14-61 73-120 (132)
257 PRK14155 heat shock protein Gr 30.7 1.2E+02 0.0026 22.5 4.5 15 62-76 34-48 (208)
258 PRK12714 flgK flagellar hook-a 30.4 3.4E+02 0.0074 23.0 8.7 59 16-74 137-207 (624)
259 PF04111 APG6: Autophagy prote 30.4 2.6E+02 0.0056 21.7 8.1 23 33-55 73-95 (314)
260 TIGR02978 phageshock_pspC phag 30.4 99 0.0022 21.1 3.8 26 28-53 81-106 (121)
261 PRK14127 cell division protein 30.1 1.8E+02 0.0038 19.7 6.4 16 36-51 49-64 (109)
262 PF08372 PRT_C: Plant phosphor 30.1 64 0.0014 23.0 2.9 46 33-78 38-89 (156)
263 TIGR00606 rad50 rad50. This fa 29.9 4.4E+02 0.0096 24.2 9.0 53 24-76 186-238 (1311)
264 KOG2398|consensus 29.7 3.7E+02 0.0079 23.2 8.4 38 16-53 149-186 (611)
265 PF09177 Syntaxin-6_N: Syntaxi 29.7 1.5E+02 0.0032 18.7 6.5 32 22-53 37-68 (97)
266 PF08537 NBP1: Fungal Nap bind 29.7 1.7E+02 0.0036 23.5 5.4 36 35-73 186-221 (323)
267 PF12548 DUF3740: Sulfatase pr 29.6 1.3E+02 0.0029 21.2 4.4 31 31-61 106-136 (145)
268 PRK00409 recombination and DNA 29.5 2.8E+02 0.006 24.3 7.1 19 35-53 524-542 (782)
269 KOG0963|consensus 29.5 3.2E+02 0.0069 23.9 7.4 46 32-77 122-167 (629)
270 KOG0930|consensus 29.4 1.4E+02 0.003 24.3 4.9 48 15-63 16-63 (395)
271 PRK01919 tatB sec-independent 29.4 2.3E+02 0.005 20.7 8.6 28 14-41 28-55 (169)
272 PF02994 Transposase_22: L1 tr 29.4 2.3E+02 0.0049 22.5 6.1 13 37-49 143-155 (370)
273 TIGR03752 conj_TIGR03752 integ 29.3 2.8E+02 0.006 23.4 6.8 30 21-50 99-128 (472)
274 KOG1853|consensus 29.2 3E+02 0.0065 22.0 9.5 70 4-73 7-77 (333)
275 PRK14163 heat shock protein Gr 28.6 1.2E+02 0.0027 22.7 4.3 14 67-80 85-98 (214)
276 PF05911 DUF869: Plant protein 28.6 4.2E+02 0.0092 23.5 8.7 43 31-73 645-691 (769)
277 PRK10697 DNA-binding transcrip 28.5 1.1E+02 0.0023 21.0 3.7 26 28-53 78-103 (118)
278 KOG4005|consensus 28.5 2.2E+02 0.0047 22.5 5.8 43 33-75 92-138 (292)
279 PF08278 DnaG_DnaB_bind: DNA p 28.5 1.7E+02 0.0036 18.8 4.6 30 13-48 82-111 (127)
280 PF06730 FAM92: FAM92 protein; 28.5 1E+02 0.0022 23.4 3.8 28 50-77 34-61 (219)
281 PF06818 Fez1: Fez1; InterPro 28.4 2.6E+02 0.0055 21.0 7.9 37 42-78 67-103 (202)
282 PF05384 DegS: Sensor protein 28.3 2.2E+02 0.0048 20.3 8.1 61 17-77 17-77 (159)
283 PF10915 DUF2709: Protein of u 28.3 35 0.00076 26.1 1.4 19 65-83 166-184 (238)
284 PLN03230 acetyl-coenzyme A car 28.3 1.5E+02 0.0032 24.7 5.0 35 16-50 83-117 (431)
285 cd01109 HTH_YyaN Helix-Turn-He 28.2 1.7E+02 0.0036 18.8 5.6 27 30-56 78-104 (113)
286 KOG0288|consensus 28.2 3.7E+02 0.0079 22.7 8.2 51 27-77 13-63 (459)
287 PF06419 COG6: Conserved oligo 28.1 3.7E+02 0.008 22.7 9.4 66 16-81 41-110 (618)
288 cd04786 HTH_MerR-like_sg7 Heli 28.0 1.8E+02 0.004 19.5 4.8 30 30-59 77-106 (131)
289 PF14817 HAUS5: HAUS augmin-li 28.0 4E+02 0.0087 23.1 8.5 54 15-68 81-134 (632)
290 PRK14151 heat shock protein Gr 28.0 1.3E+02 0.0029 21.6 4.3 14 62-75 41-54 (176)
291 COG1579 Zn-ribbon protein, pos 27.9 2.8E+02 0.006 21.2 6.7 49 28-79 14-62 (239)
292 PRK14141 heat shock protein Gr 27.8 1.3E+02 0.0028 22.4 4.3 15 61-75 51-65 (209)
293 KOG2629|consensus 27.8 3.2E+02 0.0068 21.8 8.7 53 20-75 136-188 (300)
294 PRK15374 pathogenicity island 27.8 4.1E+02 0.0089 23.1 7.7 45 34-78 127-171 (593)
295 KOG4098|consensus 27.7 1.1E+02 0.0025 21.8 3.8 28 51-78 89-116 (140)
296 PRK03100 sec-independent trans 27.5 2.2E+02 0.0048 20.0 6.7 36 16-51 31-70 (136)
297 PF11068 YlqD: YlqD protein; 27.4 2.1E+02 0.0046 19.7 6.3 21 27-47 30-50 (131)
298 PRK05683 flgK flagellar hook-a 27.4 4.1E+02 0.0089 23.0 8.7 58 17-74 138-209 (676)
299 PF06148 COG2: COG (conserved 27.3 1.5E+02 0.0032 19.6 4.2 43 17-59 73-115 (133)
300 cd04785 HTH_CadR-PbrR-like Hel 27.3 1.9E+02 0.0041 19.0 5.0 29 32-60 80-108 (126)
301 COG4694 Uncharacterized protei 27.3 3E+02 0.0066 24.4 6.9 51 14-65 440-493 (758)
302 cd04789 HTH_Cfa Helix-Turn-Hel 27.2 1.7E+02 0.0037 18.6 4.5 28 33-60 73-100 (102)
303 COG4191 Signal transduction hi 27.2 3.8E+02 0.0082 23.4 7.4 50 13-62 324-373 (603)
304 PLN02678 seryl-tRNA synthetase 27.2 3.6E+02 0.0077 22.2 7.4 21 53-73 83-103 (448)
305 PF12128 DUF3584: Protein of u 27.1 4.7E+02 0.01 23.9 8.3 36 20-55 837-872 (1201)
306 PLN02372 violaxanthin de-epoxi 27.1 3.8E+02 0.0083 22.6 7.5 27 22-48 363-389 (455)
307 PF00957 Synaptobrevin: Synapt 27.0 1.6E+02 0.0034 18.1 7.7 39 12-51 6-44 (89)
308 PF05531 NPV_P10: Nucleopolyhe 26.9 1.8E+02 0.0038 18.6 5.2 21 51-71 45-65 (75)
309 PF01442 Apolipoprotein: Apoli 26.9 1.9E+02 0.0041 18.9 10.8 36 12-47 41-76 (202)
310 COG1730 GIM5 Predicted prefold 26.9 2.3E+02 0.005 20.0 6.4 22 29-50 92-113 (145)
311 PRK14159 heat shock protein Gr 26.6 1.6E+02 0.0035 21.3 4.5 27 51-77 33-59 (176)
312 PF11593 Med3: Mediator comple 26.5 93 0.002 25.5 3.6 28 54-81 64-93 (379)
313 PF09766 FimP: Fms-interacting 26.3 3.2E+02 0.007 21.5 7.6 10 65-74 132-141 (355)
314 PF12128 DUF3584: Protein of u 26.2 4.4E+02 0.0096 24.0 8.0 55 17-71 827-881 (1201)
315 PRK14154 heat shock protein Gr 26.2 1.4E+02 0.0031 22.2 4.3 15 61-75 72-86 (208)
316 PF04201 TPD52: Tumour protein 26.2 1.8E+02 0.0039 21.1 4.7 33 33-65 31-67 (162)
317 KOG0977|consensus 26.1 4.2E+02 0.0092 22.7 8.2 24 36-59 167-190 (546)
318 PF08700 Vps51: Vps51/Vps67; 25.9 1.6E+02 0.0034 17.7 8.6 62 15-76 21-86 (87)
319 COG3937 Uncharacterized conser 25.8 1.6E+02 0.0036 20.1 4.2 9 62-70 90-98 (108)
320 PRK14153 heat shock protein Gr 25.4 1.7E+02 0.0037 21.5 4.6 25 53-77 45-69 (194)
321 KOG4302|consensus 25.4 2.9E+02 0.0063 24.2 6.5 69 10-78 40-119 (660)
322 PF03961 DUF342: Protein of un 25.3 3.5E+02 0.0077 21.6 8.5 57 22-78 332-398 (451)
323 PF08232 Striatin: Striatin fa 25.2 2.3E+02 0.005 19.3 8.3 51 26-79 20-70 (134)
324 PF12252 SidE: Dot/Icm substra 25.1 2E+02 0.0043 27.3 5.6 40 25-64 1108-1147(1439)
325 TIGR03185 DNA_S_dndD DNA sulfu 25.1 4.1E+02 0.0089 22.3 9.7 10 36-45 403-412 (650)
326 PRK01203 prefoldin subunit alp 25.1 1.9E+02 0.0042 20.1 4.5 28 34-61 83-110 (130)
327 PRK11820 hypothetical protein; 25.1 1.1E+02 0.0024 23.6 3.6 45 33-77 212-256 (288)
328 PRK04778 septation ring format 25.0 4E+02 0.0087 22.1 8.3 29 50-78 312-340 (569)
329 PHA03332 membrane glycoprotein 24.9 3.3E+02 0.0072 25.8 6.9 47 17-63 916-963 (1328)
330 COG5187 RPN7 26S proteasome re 24.9 1E+02 0.0023 25.2 3.5 33 37-69 92-127 (412)
331 cd03003 PDI_a_ERdj5_N PDIa fam 24.8 48 0.0011 20.1 1.4 26 2-29 22-47 (101)
332 PRK13842 conjugal transfer pro 24.7 3.3E+02 0.0072 21.1 7.1 37 40-76 96-133 (267)
333 PF06103 DUF948: Bacterial pro 24.7 1.8E+02 0.0038 17.9 9.2 16 18-33 24-39 (90)
334 PLN03137 ATP-dependent DNA hel 24.5 1.6E+02 0.0034 27.6 5.0 19 36-54 322-340 (1195)
335 PF06305 DUF1049: Protein of u 24.5 78 0.0017 18.3 2.2 16 36-51 46-61 (68)
336 PF01920 Prefoldin_2: Prefoldi 24.1 1.8E+02 0.0039 17.8 6.3 39 16-58 1-39 (106)
337 PF04568 IATP: Mitochondrial A 24.0 2.2E+02 0.0048 18.9 4.5 25 28-52 73-97 (100)
338 KOG3335|consensus 24.0 2E+02 0.0044 21.3 4.7 27 38-67 106-132 (181)
339 PF05701 WEMBL: Weak chloropla 24.0 4.2E+02 0.009 21.9 10.1 65 12-76 196-263 (522)
340 PRK10203 hypothetical protein; 23.9 2E+02 0.0043 19.8 4.4 37 13-49 61-99 (122)
341 PF05557 MAD: Mitotic checkpoi 23.8 26 0.00057 29.7 0.0 46 34-80 284-329 (722)
342 PF13865 FoP_duplication: C-te 23.8 1.9E+02 0.0041 17.8 4.1 16 61-76 54-69 (74)
343 PF05615 THOC7: Tho complex su 23.7 2.2E+02 0.0047 18.9 4.5 26 22-47 79-104 (139)
344 PF10669 Phage_Gp23: Protein g 23.6 2.5E+02 0.0055 19.3 5.6 63 8-70 38-118 (121)
345 KOG3633|consensus 23.6 1.4E+02 0.0031 22.6 3.8 31 43-73 44-74 (219)
346 PF08702 Fib_alpha: Fibrinogen 23.6 2.6E+02 0.0056 19.4 6.1 34 24-57 22-55 (146)
347 PF15450 DUF4631: Domain of un 23.5 4.8E+02 0.01 22.4 7.7 23 52-74 409-431 (531)
348 COG3850 NarQ Signal transducti 23.5 2E+02 0.0043 24.9 5.1 37 8-44 210-246 (574)
349 PF13600 DUF4140: N-terminal d 23.5 2E+02 0.0042 18.0 4.1 19 33-51 72-90 (104)
350 TIGR00634 recN DNA repair prot 23.4 4.2E+02 0.0092 21.8 7.2 43 29-71 327-369 (563)
351 PRK11519 tyrosine kinase; Prov 23.4 4.7E+02 0.01 22.3 7.7 31 48-78 267-297 (719)
352 TIGR01410 tatB twin arginine-t 23.4 2E+02 0.0044 18.1 5.8 28 14-41 27-54 (80)
353 PRK14157 heat shock protein Gr 23.4 1.8E+02 0.004 22.0 4.4 14 62-75 98-111 (227)
354 PRK10803 tol-pal system protei 23.4 3.2E+02 0.007 20.4 8.6 38 35-72 58-99 (263)
355 PRK11032 hypothetical protein; 23.2 2.9E+02 0.0063 19.8 8.2 62 17-78 3-66 (160)
356 cd07660 BAR_Arfaptin The Bin/A 23.2 2.8E+02 0.0061 20.6 5.3 31 42-72 3-33 (201)
357 cd07629 BAR_Atg20p The Bin/Amp 23.2 2.8E+02 0.0061 19.7 5.7 21 35-55 8-29 (187)
358 PF01763 Herpes_UL6: Herpesvir 23.1 4.9E+02 0.011 22.4 7.5 45 30-77 362-406 (557)
359 smart00434 TOP4c DNA Topoisome 23.0 1.5E+02 0.0032 24.2 4.1 34 24-57 408-441 (445)
360 cd00011 BAR_Arfaptin_like The 22.8 3.1E+02 0.0067 20.4 5.5 32 42-73 3-34 (203)
361 PF08340 DUF1732: Domain of un 22.8 58 0.0013 21.4 1.5 22 35-56 12-33 (87)
362 PRK14146 heat shock protein Gr 22.7 1.8E+02 0.004 21.6 4.3 24 54-77 67-90 (215)
363 PRK06665 flgK flagellar hook-a 22.6 4.8E+02 0.01 22.2 9.6 18 57-74 204-221 (627)
364 TIGR00255 conserved hypothetic 22.6 1.3E+02 0.0028 23.3 3.6 44 33-76 215-258 (291)
365 KOG2662|consensus 22.6 4.5E+02 0.0099 21.9 7.6 52 23-76 197-248 (414)
366 PF07083 DUF1351: Protein of u 22.4 3.1E+02 0.0068 19.9 7.7 53 12-64 63-115 (215)
367 PF01923 Cob_adeno_trans: Coba 22.4 2.7E+02 0.0059 19.2 7.6 61 17-77 27-98 (163)
368 PRK08724 fliD flagellar cappin 22.4 5.4E+02 0.012 22.6 8.8 43 34-76 620-663 (673)
369 PF11126 Phage_DsbA: Transcrip 22.2 1.4E+02 0.0031 18.9 3.1 26 40-65 2-27 (69)
370 PRK04863 mukB cell division pr 22.0 3.8E+02 0.0083 25.6 6.9 65 11-78 415-479 (1486)
371 KOG0250|consensus 22.0 6.6E+02 0.014 23.5 8.9 61 10-73 373-433 (1074)
372 PF10191 COG7: Golgi complex c 21.9 5.4E+02 0.012 22.4 8.7 60 16-75 34-97 (766)
373 PF07795 DUF1635: Protein of u 21.8 3.6E+02 0.0078 20.4 7.3 49 17-65 5-53 (214)
374 COG1561 Uncharacterized stress 21.8 2.6E+02 0.0056 22.1 5.1 46 32-77 213-258 (290)
375 PF04912 Dynamitin: Dynamitin 21.8 4E+02 0.0086 20.9 8.2 42 11-52 341-382 (388)
376 PF03087 DUF241: Arabidopsis p 21.7 3.4E+02 0.0073 20.0 5.7 21 32-52 192-212 (231)
377 PF00430 ATP-synt_B: ATP synth 21.7 2E+02 0.0044 18.2 4.0 29 14-42 93-121 (132)
378 PF05308 Mito_fiss_reg: Mitoch 21.6 99 0.0021 23.6 2.8 25 34-58 125-149 (253)
379 KOG0447|consensus 21.6 4.1E+02 0.0088 23.9 6.7 60 13-74 902-962 (980)
380 PRK12714 flgK flagellar hook-a 21.6 2.9E+02 0.0063 23.4 5.7 13 48-60 166-178 (624)
381 PF07765 KIP1: KIP1-like prote 21.4 2.3E+02 0.0051 18.1 5.2 48 31-78 18-66 (74)
382 PF10267 Tmemb_cc2: Predicted 21.4 4.6E+02 0.0099 21.4 8.6 40 12-51 250-289 (395)
383 PRK11546 zraP zinc resistance 21.3 3.1E+02 0.0067 19.4 6.6 45 21-65 62-106 (143)
384 PF11101 DUF2884: Protein of u 21.2 3.4E+02 0.0075 19.9 7.4 39 33-73 169-207 (229)
385 PF05461 ApoL: Apolipoprotein 21.2 1.2E+02 0.0025 23.8 3.1 24 55-78 287-310 (313)
386 PF00996 GDI: GDP dissociation 21.1 46 0.001 27.2 0.9 15 17-31 233-247 (438)
387 cd04769 HTH_MerR2 Helix-Turn-H 21.1 2.4E+02 0.0053 18.2 6.7 27 33-59 81-107 (116)
388 COG1382 GimC Prefoldin, chaper 21.1 2.9E+02 0.0063 19.0 7.0 39 31-72 70-108 (119)
389 TIGR01554 major_cap_HK97 phage 20.9 3.9E+02 0.0086 20.5 8.5 16 17-32 3-18 (378)
390 TIGR01126 pdi_dom protein disu 20.9 48 0.001 19.5 0.8 22 2-25 17-38 (102)
391 COG4467 Regulator of replicati 20.8 3E+02 0.0064 19.0 7.2 51 16-66 4-54 (114)
392 COG2433 Uncharacterized conser 20.8 5.6E+02 0.012 22.6 7.2 12 64-75 452-463 (652)
393 PRK10869 recombination and rep 20.7 4.8E+02 0.01 21.7 6.7 40 33-72 326-365 (553)
394 PRK14160 heat shock protein Gr 20.6 2.6E+02 0.0056 20.9 4.7 10 71-80 110-119 (211)
395 KOG3433|consensus 20.6 3.8E+02 0.0083 20.2 8.6 16 14-29 82-97 (203)
396 PF09261 Alpha-mann_mid: Alpha 20.6 2.1E+02 0.0046 17.2 4.1 48 32-79 9-56 (80)
397 PF08287 DASH_Spc19: Spc19; I 20.5 3.1E+02 0.0068 19.2 5.9 48 29-76 51-103 (153)
398 KOG0161|consensus 20.5 3.7E+02 0.0081 26.6 6.6 48 26-73 1767-1819(1930)
399 PF09766 FimP: Fms-interacting 20.5 4.3E+02 0.0093 20.8 8.5 55 10-64 98-152 (355)
400 PF00429 TLV_coat: ENV polypro 20.4 3.2E+02 0.0069 23.0 5.7 34 19-52 423-456 (561)
401 PF09753 Use1: Membrane fusion 20.4 3.6E+02 0.0077 19.8 8.7 51 14-64 168-218 (251)
402 PF10398 DUF2443: Protein of u 20.3 2.3E+02 0.0049 18.4 3.8 21 36-56 55-75 (79)
403 KOG4673|consensus 20.3 5.5E+02 0.012 23.5 7.2 48 13-61 715-762 (961)
404 PF09726 Macoilin: Transmembra 20.3 5.9E+02 0.013 22.3 9.4 46 30-75 470-515 (697)
405 COG4896 Uncharacterized protei 20.2 13 0.00029 23.4 -1.9 20 45-64 9-28 (68)
406 COG3352 FlaC Putative archaeal 20.2 3.5E+02 0.0076 19.6 5.8 30 50-79 81-110 (157)
407 PF04899 MbeD_MobD: MbeD/MobD 20.1 2.4E+02 0.0051 17.6 6.6 36 16-51 24-62 (70)
No 1
>PF04799 Fzo_mitofusin: fzo-like conserved region; InterPro: IPR006884 This entry represents the heptad repeat domain which is conserved at the C terminus of Fzo/mitofusion family of GTPases. Fzo is a mediator of mitochondrial fusion during spermatogenesis []. This conserved region is also found in the human mitofusin protein []. This domain forms a dimeric antiparallel coiled coil structure, which has been proposed to act as a mitochodrial tether before vesicle fusion [].; GO: 0003924 GTPase activity, 0006184 GTP catabolic process, 0008053 mitochondrial fusion, 0005741 mitochondrial outer membrane, 0016021 integral to membrane; PDB: 1T3J_A.
Probab=100.00 E-value=3e-36 Score=216.89 Aligned_cols=82 Identities=54% Similarity=0.874 Sum_probs=58.9
Q ss_pred CchhhhhhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHhhhhHHHHHHHHHHHHHHhcc
Q psy7799 1 MIVDLTSSNCSHQVEQELSSTFARLCQLVDRATDDMHEDIKQLDSHLKQVEEASNNAKVLRNKAHYLIKELDMFQEAYLV 80 (83)
Q Consensus 1 ~iV~~tssncshQVqqEL~~tf~rL~~~Vd~~~~eL~~ei~~L~~~i~~le~i~~~~K~lrnKa~~L~~eL~~F~~~yL~ 80 (83)
+||++||+||||||||||++||+|||++||++++||++||++|+++|++||++++++|+|||||+||++|||+|.++||+
T Consensus 90 ~iv~~tsancs~QVqqeL~~tf~rL~~~Vd~~~~eL~~eI~~L~~~i~~le~~~~~~k~LrnKa~~L~~eL~~F~~~yL~ 169 (171)
T PF04799_consen 90 LIVSFTSANCSHQVQQELSSTFARLCQQVDQTKNELEDEIKQLEKEIQRLEEIQSKSKTLRNKANWLESELERFQEQYLQ 169 (171)
T ss_dssp --------------------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHT-
T ss_pred HHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc
Confidence 58999999999999999999999999999999999999999999999999999999999999999999999999999998
Q ss_pred cC
Q psy7799 81 NM 82 (83)
Q Consensus 81 ~~ 82 (83)
+.
T Consensus 170 ~~ 171 (171)
T PF04799_consen 170 KS 171 (171)
T ss_dssp --
T ss_pred CC
Confidence 63
No 2
>KOG0448|consensus
Probab=99.82 E-value=4.4e-20 Score=154.30 Aligned_cols=82 Identities=37% Similarity=0.561 Sum_probs=79.6
Q ss_pred CchhhhhhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHhhhhHHHHHHHHHHHHHHhcc
Q psy7799 1 MIVDLTSSNCSHQVEQELSSTFARLCQLVDRATDDMHEDIKQLDSHLKQVEEASNNAKVLRNKAHYLIKELDMFQEAYLV 80 (83)
Q Consensus 1 ~iV~~tssncshQVqqEL~~tf~rL~~~Vd~~~~eL~~ei~~L~~~i~~le~i~~~~K~lrnKa~~L~~eL~~F~~~yL~ 80 (83)
++++.|++||||||+|||++||++||++|+.+..+|+++++.+.++|+.+|.+|+++|.+||||.+++.||++|..+||.
T Consensus 668 ~~~~~h~a~cs~qi~~El~~vfa~l~r~v~~~~~~~~d~~~~~~~~le~lds~q~~~K~~~~ka~~l~~el~mf~~~~L~ 747 (749)
T KOG0448|consen 668 EFDDSHTANCSHQIRQELRTVFAILCRTVDRVEEQMRDDLLATKKQLESLDSIQSRLKFFRSKALKLEYELIMFERQYLS 747 (749)
T ss_pred HHHHHhccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccC
Confidence 36899999999999999999999999999999999999999999999999999999999999999999999999999998
Q ss_pred cC
Q psy7799 81 NM 82 (83)
Q Consensus 81 ~~ 82 (83)
+.
T Consensus 748 ~~ 749 (749)
T KOG0448|consen 748 ID 749 (749)
T ss_pred CC
Confidence 73
No 3
>PF00957 Synaptobrevin: Synaptobrevin; InterPro: IPR001388 Synaptobrevin is an intrinsic membrane protein of small synaptic vesicles [], specialised secretory organelles of neurons that actively accumulate neurotransmitters and participate in their calcium-dependent release by exocytosis. Vesicle function is mediated by proteins in their membranes, although the precise nature of the protein-protein interactions underlying this are still uncertain []. Synaptobrevin may play a role in the molecular events underlying neurotransmitter release and vesicle recycling and may be involved in the regulation of membrane flow in the nerve terminal, a process mediated by interaction with low molecular weight GTP-binding proteins []. Synaptic vesicle-associated membrane proteins (VAMPs) from Torpedo californica (Pacific electric ray) and SNC1 from yeast are related to synaptobrevin.; GO: 0016192 vesicle-mediated transport, 0016021 integral to membrane; PDB: 3EGX_C 2NUP_C 3EGD_C 2NUT_C 1IOU_A 1H8M_A 3B5N_A 3ZYM_A 2NPS_A 1SFC_E ....
Probab=93.21 E-value=1.3 Score=27.71 Aligned_cols=50 Identities=20% Similarity=0.322 Sum_probs=41.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHhhhhHHHHHHH
Q psy7799 21 TFARLCQLVDRATDDMHEDIKQLDSHLKQVEEASNNAKVLRNKAHYLIKE 70 (83)
Q Consensus 21 tf~rL~~~Vd~~~~eL~~ei~~L~~~i~~le~i~~~~K~lrnKa~~L~~e 70 (83)
....+..+|+++++-|.+.|+.+-.+=++||.+..++..|...|..+...
T Consensus 4 kl~~i~~~v~~v~~im~~Ni~~ll~Rge~L~~L~~kt~~L~~~a~~F~k~ 53 (89)
T PF00957_consen 4 KLEQIQEQVEEVKNIMRENIDKLLERGEKLEELEDKTEELSDNAKQFKKN 53 (89)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHcCchHHHHHHHHHHHHHHhHHHHHH
Confidence 45678899999999999999999999999999888877777666665543
No 4
>cd07646 I-BAR_IMD_IRSp53 Inverse (I)-BAR, also known as the IRSp53/MIM homology Domain (IMD), of Insulin Receptor tyrosine kinase Substrate p53. The IMD domain, also called Inverse-Bin/Amphiphysin/Rvs (I-BAR) domain, is a dimerization and lipid-binding module that bends membranes and induces membrane protrusions. IRSp53 (Insulin Receptor tyrosine kinase Substrate p53) is also known as BAIAP2 (Brain-specific Angiogenesis Inhibitor 1-Associated Protein 2). It is a scaffolding protein that takes part in many signaling pathways including Cdc42-induced filopodia formation, Rac-mediated lamellipodia extension, and spine morphogenesis. IRSp53 exists as multiple splicing variants that differ mainly at the C-termini. One variant (T-form) is expressed exclusively in human breast cancer cells. The gene encoding IRSp53 is a putative susceptibility gene for Gilles de la Tourette syndrome. IRSp53 contains an N-terminal IMD, a CRIB (Cdc42 and Rac interactive binding motif), an SH3 domain, and a WASP
Probab=90.83 E-value=4.4 Score=30.94 Aligned_cols=32 Identities=22% Similarity=0.271 Sum_probs=24.7
Q ss_pred HHHHHHHHHHhhHHhhhhHH--------------------HHHHHHHHHHHH
Q psy7799 46 HLKQVEEASNNAKVLRNKAH--------------------YLIKELDMFQEA 77 (83)
Q Consensus 46 ~i~~le~i~~~~K~lrnKa~--------------------~L~~eL~~F~~~ 77 (83)
+-+.||+.++.+|++|.|.. ...+||+.|...
T Consensus 126 k~~sleK~qseLKKlRrKsqg~k~~~ky~~ke~q~~~~~~~~q~ele~f~~~ 177 (232)
T cd07646 126 KGESLEKCQAELKKLRKKSQGSKNPQKYSDKELQYIEAISNKQGELENYVSD 177 (232)
T ss_pred HHHHHHHHHHHHHHHHHhhccCCCchhhHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34488999999999998843 456788888764
No 5
>PRK10884 SH3 domain-containing protein; Provisional
Probab=88.59 E-value=8.1 Score=28.52 Aligned_cols=64 Identities=19% Similarity=0.239 Sum_probs=38.7
Q ss_pred HHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHhhHHhhhhHHHHHHHHHHHHHH
Q psy7799 14 VEQELSSTFAR---LCQLVDRATDDMHEDIKQLDSHLKQV----EEASNNAKVLRNKAHYLIKELDMFQEA 77 (83)
Q Consensus 14 VqqEL~~tf~r---L~~~Vd~~~~eL~~ei~~L~~~i~~l----e~i~~~~K~lrnKa~~L~~eL~~F~~~ 77 (83)
+++|+...=++ +.++-+....+|.+.+++.+..|..| +.+..++..++++...|+.+++.-...
T Consensus 98 le~el~~l~~~l~~~~~~~~~~~~~l~~~~~~~~~~~~~L~~~n~~L~~~l~~~~~~~~~l~~~~~~~~~~ 168 (206)
T PRK10884 98 LENQVKTLTDKLNNIDNTWNQRTAEMQQKVAQSDSVINGLKEENQKLKNQLIVAQKKVDAANLQLDDKQRT 168 (206)
T ss_pred HHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 44555544333 33334455556666666666666543 345667777888888888888776543
No 6
>PF10157 DUF2365: Uncharacterized conserved protein (DUF2365); InterPro: IPR019314 This entry is found in a highly conserved family of proteins which have no known function.
Probab=87.28 E-value=5.9 Score=28.12 Aligned_cols=50 Identities=20% Similarity=0.346 Sum_probs=40.2
Q ss_pred HHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHhhHHhhhhHHHHHH
Q psy7799 20 STFARLCQLVDRATDDMH---EDIKQLDSHLKQVEEASNNAKVLRNKAHYLIK 69 (83)
Q Consensus 20 ~tf~rL~~~Vd~~~~eL~---~ei~~L~~~i~~le~i~~~~K~lrnKa~~L~~ 69 (83)
.+-.++|-.||.+-+-|. ..-.+|++.+..+..+..+.|.+|.-..+||+
T Consensus 95 ~sv~~~cdsvD~sik~~y~liakceELn~~M~~v~~La~qIK~Ik~~lD~lE~ 147 (149)
T PF10157_consen 95 DSVDKLCDSVDASIKSMYTLIAKCEELNESMKPVYKLAQQIKDIKKLLDLLES 147 (149)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 345678888888877776 56678889999999999999999977777664
No 7
>COG5185 HEC1 Protein involved in chromosome segregation, interacts with SMC proteins [Cell division and chromosome partitioning]
Probab=86.84 E-value=7.5 Score=33.15 Aligned_cols=62 Identities=19% Similarity=0.359 Sum_probs=57.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHhhhhHHHHHHHHHHHHH
Q psy7799 15 EQELSSTFARLCQLVDRATDDMHEDIKQLDSHLKQVEEASNNAKVLRNKAHYLIKELDMFQE 76 (83)
Q Consensus 15 qqEL~~tf~rL~~~Vd~~~~eL~~ei~~L~~~i~~le~i~~~~K~lrnKa~~L~~eL~~F~~ 76 (83)
-+||..+|....+-+..-...++++.+.|..+|+.-+++-...++|+-|-.-|.+.++.|..
T Consensus 255 e~Elk~~f~~~~~~i~~~i~~lk~~n~~l~e~i~ea~k~s~~i~~l~ek~r~l~~D~nk~~~ 316 (622)
T COG5185 255 EQELKLGFEKFVHIINTDIANLKTQNDNLYEKIQEAMKISQKIKTLREKWRALKSDSNKYEN 316 (622)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHH
Confidence 47999999999888888889999999999999999999999999999999999999999975
No 8
>PF06705 SF-assemblin: SF-assemblin/beta giardin
Probab=86.36 E-value=11 Score=27.71 Aligned_cols=66 Identities=17% Similarity=0.411 Sum_probs=52.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHhhHHhhhhHHHHHHHHHHHHHHh
Q psy7799 13 QVEQELSSTFARLCQLVDRATDDMHEDIKQLDSHLKQVEEA-----SNNAKVLRNKAHYLIKELDMFQEAY 78 (83)
Q Consensus 13 QVqqEL~~tf~rL~~~Vd~~~~eL~~ei~~L~~~i~~le~i-----~~~~K~lrnKa~~L~~eL~~F~~~y 78 (83)
.+-+.+..+..++.+.+..-...+..-++.|..+|..|+.. ..+...+.+++.-|.++|..|...|
T Consensus 67 ~~e~~i~~~~~~v~~~~~~~~~~~~~~l~~L~~ri~~L~~~i~ee~~~r~~~ie~~~~~l~~~l~~l~~~~ 137 (247)
T PF06705_consen 67 KFEEQINNMQERVENQISEKQEQLQSRLDSLNDRIEALEEEIQEEKEERPQDIEELNQELVRELNELQEAF 137 (247)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHH
Confidence 45567778888999999999999999999999999999654 3344456678888999999887764
No 9
>PRK05561 DNA topoisomerase IV subunit A; Validated
Probab=84.95 E-value=4.8 Score=34.79 Aligned_cols=53 Identities=19% Similarity=0.293 Sum_probs=43.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHhhhhHHHHHHHHHHHHHHhc
Q psy7799 24 RLCQLVDRATDDMHEDIKQLDSHLKQVEEASNNAKVLRNKAHYLIKELDMFQEAYL 79 (83)
Q Consensus 24 rL~~~Vd~~~~eL~~ei~~L~~~i~~le~i~~~~K~lrnKa~~L~~eL~~F~~~yL 79 (83)
||.+....-...|++|+++|.++|+.|+.+-..-+.+. ..+-.||..+.+.|=
T Consensus 430 ~L~~Lt~le~~kl~~E~~~l~~ei~~l~~iL~s~~~l~---~~i~~eL~~ik~kfg 482 (742)
T PRK05561 430 RLRRLAKLEEIEIRKEQDELRKEIAELEAILASERKLR---KLIKKELKADAKKFG 482 (742)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCHHHHH---HHHHHHHHHHHHHhC
Confidence 66777777788999999999999999999977666655 467889999988883
No 10
>TIGR02338 gimC_beta prefoldin, beta subunit, archaeal. Chaperonins are cytosolic, ATP-dependent molecular chaperones, with a conserved toroidal architecture, that assist in the folding of nascent and/or denatured polypeptide chains. The group I chaperonin system consists of GroEL and GroES, and is found (usually) in bacteria and organelles of bacterial origin. The group II chaperonin system, called the thermosome in Archaea and TRiC or CCT in the Eukaryota, is structurally similar but only distantly related. Prefoldin, also called GimC, is a complex in Archaea and Eukaryota, that works with group II chaperonins. Members of this protein family are the archaeal clade of the beta class of prefoldin subunit. Closely related, but outside the scope of this family are the eukaryotic beta-class prefoldin subunits, Gim-1,3,4 and 6. The alpha class prefoldin subunits are more distantly related.
Probab=84.20 E-value=5.2 Score=26.20 Aligned_cols=44 Identities=16% Similarity=0.285 Sum_probs=31.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHhhhhHHHHHHHHHH
Q psy7799 27 QLVDRATDDMHEDIKQLDSHLKQVEEASNNAKVLRNKAHYLIKELDM 73 (83)
Q Consensus 27 ~~Vd~~~~eL~~ei~~L~~~i~~le~i~~~~K~lrnKa~~L~~eL~~ 73 (83)
+.++++..++++.|+.++..|..|++ +.+.++.+..-++..|..
T Consensus 63 ~~~~e~~~~l~~r~e~ie~~i~~lek---~~~~l~~~l~e~q~~l~~ 106 (110)
T TIGR02338 63 TDKEEAIQELKEKKETLELRVKTLQR---QEERLREQLKELQEKIQE 106 (110)
T ss_pred ecHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHH
Confidence 45778888888888888887776654 455667777777776653
No 11
>PRK05560 DNA gyrase subunit A; Validated
Probab=83.98 E-value=5.3 Score=34.69 Aligned_cols=53 Identities=19% Similarity=0.278 Sum_probs=44.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHhhhhHHHHHHHHHHHHHHhc
Q psy7799 24 RLCQLVDRATDDMHEDIKQLDSHLKQVEEASNNAKVLRNKAHYLIKELDMFQEAYL 79 (83)
Q Consensus 24 rL~~~Vd~~~~eL~~ei~~L~~~i~~le~i~~~~K~lrnKa~~L~~eL~~F~~~yL 79 (83)
||.+....-...|..|+++|.++|+.|+.+-+.-+.+. ..+-.||..+.+.|=
T Consensus 423 ~L~~LT~~e~~kL~~E~~~l~~ei~~l~~iL~~~~~l~---~~i~~EL~~ikkkfg 475 (805)
T PRK05560 423 RLQRLTGLERDKIEDEYKELLALIADLKDILASPERLL---EIIKEELLEIKEKFG 475 (805)
T ss_pred HHHHHhHHHHHHHHHHHHHHHHHHHHHHHHhCCHHHHH---HHHHHHHHHHHHHhC
Confidence 55556666678899999999999999999988777766 588899999999984
No 12
>TIGR01063 gyrA DNA gyrase, A subunit. This model describes the common type II DNA topoisomerase (DNA gyrase). Two apparently independently arising families, one in the Proteobacteria and one in Gram-positive lineages, are both designated toposisomerase IV.
Probab=83.68 E-value=5.8 Score=34.45 Aligned_cols=54 Identities=19% Similarity=0.290 Sum_probs=45.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHhhhhHHHHHHHHHHHHHHhcc
Q psy7799 24 RLCQLVDRATDDMHEDIKQLDSHLKQVEEASNNAKVLRNKAHYLIKELDMFQEAYLV 80 (83)
Q Consensus 24 rL~~~Vd~~~~eL~~ei~~L~~~i~~le~i~~~~K~lrnKa~~L~~eL~~F~~~yL~ 80 (83)
||.+....-...|++|+++|.++|+.|+.+.+.-+.+. ..+-.||....+.|=.
T Consensus 420 ~L~~Lt~~e~~kl~~e~~~l~~ei~~l~~iL~~~~~l~---~vi~~EL~eikkkfg~ 473 (800)
T TIGR01063 420 RLQRLTGLEREKLQEEYKELLELIADLEDILASEERVL---EIIREELEEIKEQFGD 473 (800)
T ss_pred HHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHcCHHHHH---HHHHHHHHHHHHHhCC
Confidence 56666666678999999999999999999988877665 6889999999999843
No 13
>PF02646 RmuC: RmuC family; InterPro: IPR003798 This protein contains several bacterial RmuC DNA recombination proteins. The function of the RMUC protein is unknown but it is suspected that it is either a structural protein that protects DNA against nuclease action, or is itself involved in DNA cleavage at the regions of DNA secondary structures []. Proteins in this family are predicted to contain a central endonuclease-like fold domain, surrounded by coiled coils, consistent with a direct role in DNA cleavage [, ].
Probab=83.45 E-value=16 Score=27.90 Aligned_cols=66 Identities=18% Similarity=0.292 Sum_probs=35.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-------HHHHHhhHHhh-------hhHHHHHHHHHHHHHHh
Q psy7799 13 QVEQELSSTFARLCQLVDRATDDMHEDIKQLDSHLKQV-------EEASNNAKVLR-------NKAHYLIKELDMFQEAY 78 (83)
Q Consensus 13 QVqqEL~~tf~rL~~~Vd~~~~eL~~ei~~L~~~i~~l-------e~i~~~~K~lr-------nKa~~L~~eL~~F~~~y 78 (83)
++.+-+...+.++...|+....+..++...|..+|..| +.+......|. -+..|=|-.|++--+..
T Consensus 6 ~l~~pl~e~l~~~~~~l~~~~~~~~~~~~~L~~~l~~l~~~~~~~~~l~~~~~~L~~aL~~~k~rG~wGE~~Le~iLe~~ 85 (304)
T PF02646_consen 6 QLLKPLKEQLEKFEKRLEESFEQRSEEFGSLKEQLKQLSEANGEIQQLSQEASNLTSALKNSKTRGNWGEMQLERILEDS 85 (304)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHhCCCchhhHHHHHHHHHHHHc
Confidence 34455556666666666666666555555555555333 33333333322 34566666777666654
No 14
>TIGR01061 parC_Gpos DNA topoisomerase IV, A subunit, Gram-positive. Operationally, topoisomerase IV is a type II topoisomerase required for the decatenation of chromosome segregation. Not every bacterium has both a topo II and a topo IV. The topo IV families of the Gram-positive bacteria and the Gram-negative bacteria appear not to represent a single clade among the type II topoisomerases, and are represented by separate models for this reason.
Probab=83.06 E-value=6.1 Score=34.15 Aligned_cols=53 Identities=19% Similarity=0.389 Sum_probs=44.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHhhhhHHHHHHHHHHHHHHhc
Q psy7799 24 RLCQLVDRATDDMHEDIKQLDSHLKQVEEASNNAKVLRNKAHYLIKELDMFQEAYL 79 (83)
Q Consensus 24 rL~~~Vd~~~~eL~~ei~~L~~~i~~le~i~~~~K~lrnKa~~L~~eL~~F~~~yL 79 (83)
||.+....-...|..|+++|.++|+.|+.+-..-..+. ..+-+||..+.+.|=
T Consensus 420 ~l~~lt~~e~~kl~~e~~~l~~~i~~l~~iL~~~~~~~---~~i~~el~~ik~kfg 472 (738)
T TIGR01061 420 RLYRLTNTDIFELKEEQNELEKKIISLEQIIASEKARN---KLLKKQLEEYKKQFA 472 (738)
T ss_pred HHHHHhHHHHHHHHHHHHHHHHHHHHHHHHhCCHHHHH---HHHHHHHHHHHHHhC
Confidence 56667777778999999999999999999987777665 578899999998883
No 15
>COG4942 Membrane-bound metallopeptidase [Cell division and chromosome partitioning]
Probab=82.17 E-value=22 Score=29.27 Aligned_cols=64 Identities=16% Similarity=0.346 Sum_probs=45.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHhhHHhhhhHHHHHHHHHHHHH
Q psy7799 12 HQVEQELSSTFARLCQLVDRATDDMHEDIKQLDSHLKQVEE----ASNNAKVLRNKAHYLIKELDMFQE 76 (83)
Q Consensus 12 hQVqqEL~~tf~rL~~~Vd~~~~eL~~ei~~L~~~i~~le~----i~~~~K~lrnKa~~L~~eL~~F~~ 76 (83)
.|+++|+...=.++.. ...-...|+.+|++++++|.+++. .+...+.++.+...++..|+.-..
T Consensus 41 ~q~q~ei~~~~~~i~~-~~~~~~kL~~~lk~~e~~i~~~~~ql~~s~~~l~~~~~~I~~~~~~l~~l~~ 108 (420)
T COG4942 41 KQIQKEIAALEKKIRE-QQDQRAKLEKQLKSLETEIASLEAQLIETADDLKKLRKQIADLNARLNALEV 108 (420)
T ss_pred HHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHhhHHHHHHHHHHHHH
Confidence 3677787776666554 445567789999999999988875 345566777777777777765543
No 16
>PF08514 STAG: STAG domain ; InterPro: IPR013721 STAG domain proteins are subunits of cohesin complex - a protein complex required for sister chromatid cohesion in eukaryotes. The STAG domain is present in Schizosaccharomyces pombe (Fission yeast) mitotic cohesin Psc3, and the meiosis specific cohesin Rec11. Many organisms express a meiosis-specific STAG protein, for example, mice and humans have a meiosis specific variant called STAG3, although budding yeast does not have a meiosis specific version [].
Probab=82.06 E-value=5.7 Score=26.67 Aligned_cols=53 Identities=11% Similarity=0.199 Sum_probs=38.5
Q ss_pred hhhhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy7799 5 LTSSNCSHQVEQELSSTFARLCQLVDRATDDMHEDIKQLDSHLKQVEEASNNA 57 (83)
Q Consensus 5 ~tssncshQVqqEL~~tf~rL~~~Vd~~~~eL~~ei~~L~~~i~~le~i~~~~ 57 (83)
+||+-++.++.-=|..+...|+..++.+.+.|+.|-+.-...=.++|.++++.
T Consensus 64 hTaT~~~l~i~t~L~~v~~~l~~~~~~~~rQl~aE~~k~~~~~~r~~~l~~~i 116 (118)
T PF08514_consen 64 HTATLAALKIMTSLCDVAKSLSEELEKTQRQLEAEKKKKRKNKSRLEELEQKI 116 (118)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccHHHHHHHHHHh
Confidence 57777888888888888888888888888877777766555555666665544
No 17
>PF06825 HSBP1: Heat shock factor binding protein 1; InterPro: IPR009643 Heat shock factor binding protein 1 (HSBP1) appears to be a negative regulator of the heat shock response [].; PDB: 3CI9_A.
Probab=80.41 E-value=5.7 Score=23.93 Aligned_cols=35 Identities=17% Similarity=0.368 Sum_probs=18.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy7799 14 VEQELSSTFARLCQLVDRATDDMHEDIKQLDSHLK 48 (83)
Q Consensus 14 VqqEL~~tf~rL~~~Vd~~~~eL~~ei~~L~~~i~ 48 (83)
+-++|..-|..+..++-.-..||-..|++|++-|.
T Consensus 11 lL~qmq~kFq~mS~~I~~riDeM~~RIDdLE~si~ 45 (54)
T PF06825_consen 11 LLQQMQDKFQTMSDQILGRIDEMSSRIDDLEKSIA 45 (54)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHH-
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHH
Confidence 44556666666665555555555555555444443
No 18
>PF11932 DUF3450: Protein of unknown function (DUF3450); InterPro: IPR016866 There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function. However, they are found in an operon along with components of a TonB transport system (typified by Vibrio cholerae TonB2 [], and are predicted to be localized to the periplasmic space. Caution: the low-complexity nature of these sequences produces spurious BLAST hits to chromosome segregation ATPases (which are much longer in length and contain canonical Walker motifs). Accordingly, some members are misidentified as such.
Probab=80.33 E-value=21 Score=26.28 Aligned_cols=10 Identities=50% Similarity=0.723 Sum_probs=4.5
Q ss_pred HHHHHHHHHH
Q psy7799 68 IKELDMFQEA 77 (83)
Q Consensus 68 ~~eL~~F~~~ 77 (83)
..+|+.|.+.
T Consensus 111 ~~~L~~~v~~ 120 (251)
T PF11932_consen 111 IDELEQFVEL 120 (251)
T ss_pred HHHHHHHHhc
Confidence 3444444443
No 19
>PF06698 DUF1192: Protein of unknown function (DUF1192); InterPro: IPR009579 This family consists of several short, hypothetical, bacterial proteins of around 60 residues in length. The function of this family is unknown.
Probab=79.82 E-value=7.6 Score=23.79 Aligned_cols=34 Identities=21% Similarity=0.375 Sum_probs=27.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhHHhhhhHH
Q psy7799 32 ATDDMHEDIKQLDSHLKQVEEASNNAKVLRNKAH 65 (83)
Q Consensus 32 ~~~eL~~ei~~L~~~i~~le~i~~~~K~lrnKa~ 65 (83)
+-.||++.|..|..+|.+++....+-+.-|+=|.
T Consensus 22 Sv~EL~~RIa~L~aEI~R~~~~~~~K~a~r~AAe 55 (59)
T PF06698_consen 22 SVEELEERIALLEAEIARLEAAIAKKSASRAAAE 55 (59)
T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4579999999999999999988877776665443
No 20
>TIGR01062 parC_Gneg DNA topoisomerase IV, A subunit, proteobacterial. Operationally, topoisomerase IV is a type II topoisomerase required for the decatenation of chromosome segregation. Not every bacterium has both a topo II and a topo IV. The topo IV families of the Gram-positive bacteria and the Gram-negative bacteria appear not to represent a single clade among the type II topoisomerases, and are represented by separate models for this reason.
Probab=79.25 E-value=9.2 Score=33.27 Aligned_cols=53 Identities=17% Similarity=0.239 Sum_probs=45.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHhhhhHHHHHHHHHHHHHHhc
Q psy7799 24 RLCQLVDRATDDMHEDIKQLDSHLKQVEEASNNAKVLRNKAHYLIKELDMFQEAYL 79 (83)
Q Consensus 24 rL~~~Vd~~~~eL~~ei~~L~~~i~~le~i~~~~K~lrnKa~~L~~eL~~F~~~yL 79 (83)
||.+....-...+..|.++|.++|+.|+.+-..-..++ ..+..||....+.|=
T Consensus 417 rL~~L~~le~~~i~~E~~~l~~e~~~l~~~L~~~~~~~---~~i~~el~~~~~~~g 469 (735)
T TIGR01062 417 RLRHLAKLEEHAIIDEQSELEKERAILEKILKSERELN---QLVKKEIQADATKYG 469 (735)
T ss_pred HHHHhhhhHHHHHHHHHHHHHHHHHHHHHHhCCHHHHH---HHHHHHHHHHHHHhC
Confidence 67777777788999999999999999999988877776 467889999888883
No 21
>cd07645 I-BAR_IMD_BAIAP2L1 Inverse (I)-BAR, also known as the IRSp53/MIM homology Domain (IMD), of Brain-specific Angiogenesis Inhibitor 1-Associated Protein 2-Like 1. The IMD domain, also called Inverse-Bin/Amphiphysin/Rvs (I-BAR) domain, is a dimerization and lipid-binding module that bends membranes and induces membrane protrusions. BAIAP2L1 (Brain-specific Angiogenesis Inhibitor 1-Associated Protein 2-Like 1) is also known as IRTKS (Insulin Receptor Tyrosine Kinase Substrate). It is widely expressed, serves as a substrate for the insulin receptor, and binds the small GTPase Rac. It plays a role in regulating the actin cytoskeleton and colocalizes with F-actin, cortactin, VASP, and vinculin. BAIAP2L1 expression leads to the formation of short actin bundles, distinct from filopodia-like protrusions induced by the expression of the related protein IRSp53. It contains an N-terminal IMD, an SH3 domain, and a WASP homology 2 (WH2) actin-binding motif at the C-terminus. The IMD domain of
Probab=79.08 E-value=26 Score=26.74 Aligned_cols=18 Identities=28% Similarity=0.490 Sum_probs=15.3
Q ss_pred HHHHHHHHHHhhHHhhhh
Q psy7799 46 HLKQVEEASNNAKVLRNK 63 (83)
Q Consensus 46 ~i~~le~i~~~~K~lrnK 63 (83)
+.+.||+-++.+|++|.|
T Consensus 124 k~dsLeK~~seLKK~RRK 141 (226)
T cd07645 124 KLDSLEKSQADLKKIRRK 141 (226)
T ss_pred HHHHHHHHHHHHHHHHhc
Confidence 344899999999999977
No 22
>KOG0860|consensus
Probab=77.87 E-value=20 Score=24.75 Aligned_cols=39 Identities=21% Similarity=0.305 Sum_probs=25.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHhh
Q psy7799 23 ARLCQLVDRATDDMHEDIKQLDSHLKQVEEASNNAKVLR 61 (83)
Q Consensus 23 ~rL~~~Vd~~~~eL~~ei~~L~~~i~~le~i~~~~K~lr 61 (83)
.++.+|||++..=|.+.|...-.+=++|+.+..++-.|-
T Consensus 32 ~~tq~QvdeVv~IMr~NV~KVlER~ekL~~L~drad~L~ 70 (116)
T KOG0860|consen 32 QQTQAQVDEVVDIMRENVEKVLERGEKLDELDDRADQLQ 70 (116)
T ss_pred HHHHHHHHHHHHHHHHhHHHHHHhcchHHHHHHHHHHHH
Confidence 356678888888888877776666666666555444443
No 23
>PRK13979 DNA topoisomerase IV subunit A; Provisional
Probab=77.26 E-value=9.8 Score=34.02 Aligned_cols=66 Identities=21% Similarity=0.310 Sum_probs=52.5
Q ss_pred hHHHHHHHHHHHH------------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHhhhhHHHHHHHHHHHHHHh
Q psy7799 11 SHQVEQELSSTFA------------RLCQLVDRATDDMHEDIKQLDSHLKQVEEASNNAKVLRNKAHYLIKELDMFQEAY 78 (83)
Q Consensus 11 shQVqqEL~~tf~------------rL~~~Vd~~~~eL~~ei~~L~~~i~~le~i~~~~K~lrnKa~~L~~eL~~F~~~y 78 (83)
...+.+.|..-|. ||.+....-...++.|.++|.++|+.|+.|-..-+.++ ..+-.||....+.|
T Consensus 414 ~~~a~~~l~~~f~~s~~qa~aIl~mrL~~Lt~le~~kl~~E~~eL~~~I~~l~~iL~~~~~l~---~vi~~EL~eik~ky 490 (957)
T PRK13979 414 KKDASENLIEKFGFTDEQAEAILELMLYRLTGLEIVAFEKEYKELEKLIKKLTKILSSEKELL---KVIKKELKEVKEKY 490 (957)
T ss_pred HHHHHHHHHHHhCCCHHHHHHHHhCcHHhhhhhHHHHHHHHHHHHHHHHHHHHHHhCCHHHHH---HHHHHHHHHHHHHh
Confidence 3455666655554 56666667778999999999999999999998888877 57889999999988
Q ss_pred c
Q psy7799 79 L 79 (83)
Q Consensus 79 L 79 (83)
=
T Consensus 491 g 491 (957)
T PRK13979 491 G 491 (957)
T ss_pred C
Confidence 4
No 24
>PF02388 FemAB: FemAB family; InterPro: IPR003447 The femAB operon codes for two nearly identical approximately 50kDa proteins involved in the formation of the Staphylococcal pentaglycine interpeptide bridge in peptidoglycan []. These proteins are also considered as a factor influencing the level of methicillin resistance [].; GO: 0016755 transferase activity, transferring amino-acyl groups; PDB: 1XE4_A 1NE9_A 3GKR_A 1XIX_A 1P4N_A 1XF8_A 1LRZ_A.
Probab=77.14 E-value=13 Score=29.36 Aligned_cols=46 Identities=20% Similarity=0.371 Sum_probs=25.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHH-------HHhhHHhhhhHHHHHHHHHHHH
Q psy7799 30 DRATDDMHEDIKQLDSHLKQVEEA-------SNNAKVLRNKAHYLIKELDMFQ 75 (83)
Q Consensus 30 d~~~~eL~~ei~~L~~~i~~le~i-------~~~~K~lrnKa~~L~~eL~~F~ 75 (83)
+.....+++++.++.++|+.|+.- .++.+.+.+...-++.++..+.
T Consensus 241 ~~~~~~l~~~~~~~~~~i~~l~~~l~~~~k~~~k~~~~~~q~~~~~k~~~~~~ 293 (406)
T PF02388_consen 241 KEYLESLQEKLEKLEKEIEKLEEKLEKNPKKKNKLKELEEQLASLEKRIEEAE 293 (406)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHH-THHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhCcchhhHHHHHHHHHHHHHHHHHHHH
Confidence 345566677777777777777664 3444444444444444444433
No 25
>PHA02562 46 endonuclease subunit; Provisional
Probab=76.94 E-value=11 Score=30.14 Aligned_cols=44 Identities=14% Similarity=0.261 Sum_probs=24.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhHHhhhhHHHHHHHHHHHH
Q psy7799 32 ATDDMHEDIKQLDSHLKQVEEASNNAKVLRNKAHYLIKELDMFQ 75 (83)
Q Consensus 32 ~~~eL~~ei~~L~~~i~~le~i~~~~K~lrnKa~~L~~eL~~F~ 75 (83)
...+++.+++.+...|..++....+.+.++.+..-+.+++....
T Consensus 307 ~i~~l~~~l~~l~~~i~~~~~~~~~~~~~~~~i~el~~~i~~~~ 350 (562)
T PHA02562 307 KLKELQHSLEKLDTAIDELEEIMDEFNEQSKKLLELKNKISTNK 350 (562)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 44555666666666666555555555555555555555554443
No 26
>PF07798 DUF1640: Protein of unknown function (DUF1640); InterPro: IPR024461 This family consists of uncharacterised proteins.
Probab=76.76 E-value=23 Score=24.90 Aligned_cols=30 Identities=20% Similarity=0.440 Sum_probs=17.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy7799 18 LSSTFARLCQLVDRATDDMHEDIKQLDSHL 47 (83)
Q Consensus 18 L~~tf~rL~~~Vd~~~~eL~~ei~~L~~~i 47 (83)
|..-+.+|.+.++....+|+++|..+...+
T Consensus 78 lr~~~e~L~~eie~l~~~L~~ei~~l~a~~ 107 (177)
T PF07798_consen 78 LRSENEKLQREIEKLRQELREEINKLRAEV 107 (177)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 344555566666666666666666665544
No 27
>KOG0995|consensus
Probab=76.37 E-value=19 Score=30.88 Aligned_cols=63 Identities=19% Similarity=0.393 Sum_probs=56.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHhhhhHHHHHHHHHHHHH
Q psy7799 14 VEQELSSTFARLCQLVDRATDDMHEDIKQLDSHLKQVEEASNNAKVLRNKAHYLIKELDMFQE 76 (83)
Q Consensus 14 VqqEL~~tf~rL~~~Vd~~~~eL~~ei~~L~~~i~~le~i~~~~K~lrnKa~~L~~eL~~F~~ 76 (83)
..+|+...|...-.-+-..-.+|+..-++|..+|+..+..-.+...||.|-.-|.+.++.|..
T Consensus 218 ~~~Elk~~l~~~~~~i~~~ie~l~~~n~~l~e~i~e~ek~~~~~eslre~~~~L~~D~nK~~~ 280 (581)
T KOG0995|consen 218 LEDELKHRLEKYFTSIANEIEDLKKTNRELEEMINEREKDPGKEESLREKKARLQDDVNKFQA 280 (581)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcchHHHHHHHHHHHHhHHHHHHH
Confidence 467888888888887778888999999999999999999999999999999999999999974
No 28
>PF09403 FadA: Adhesion protein FadA; InterPro: IPR018543 FadA (Fusobacterium adhesin A) is an adhesin which forms two alpha helices. ; PDB: 3ETZ_B 3ETY_A 2GL2_B 3ETX_C 3ETW_A.
Probab=75.51 E-value=15 Score=25.48 Aligned_cols=36 Identities=8% Similarity=0.255 Sum_probs=28.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy7799 20 STFARLCQLVDRATDDMHEDIKQLDSHLKQVEEASN 55 (83)
Q Consensus 20 ~tf~rL~~~Vd~~~~eL~~ei~~L~~~i~~le~i~~ 55 (83)
..|..|-..-+.+.++|+.+|++.+..|.-+|.+++
T Consensus 89 ~eYk~llk~y~~~~~~L~k~I~~~e~iI~~fe~i~~ 124 (126)
T PF09403_consen 89 DEYKELLKKYKDLLNKLDKEIAEQEQIIDNFEKIQS 124 (126)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 467777788888888888888888888888887764
No 29
>PF06825 HSBP1: Heat shock factor binding protein 1; InterPro: IPR009643 Heat shock factor binding protein 1 (HSBP1) appears to be a negative regulator of the heat shock response [].; PDB: 3CI9_A.
Probab=75.40 E-value=15 Score=22.07 Aligned_cols=36 Identities=11% Similarity=0.292 Sum_probs=20.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy7799 19 SSTFARLCQLVDRATDDMHEDIKQLDSHLKQVEEAS 54 (83)
Q Consensus 19 ~~tf~rL~~~Vd~~~~eL~~ei~~L~~~i~~le~i~ 54 (83)
...+..|.........++-..|+++.++|+.||+-.
T Consensus 9 ~~lL~qmq~kFq~mS~~I~~riDeM~~RIDdLE~si 44 (54)
T PF06825_consen 9 QNLLQQMQDKFQTMSDQILGRIDEMSSRIDDLEKSI 44 (54)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHH
Confidence 344445555555555666666666666666666543
No 30
>PRK09343 prefoldin subunit beta; Provisional
Probab=74.84 E-value=22 Score=23.83 Aligned_cols=50 Identities=18% Similarity=0.253 Sum_probs=34.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHhhhhHHHHHHHHHHHHHHh
Q psy7799 26 CQLVDRATDDMHEDIKQLDSHLKQVEEASNNAKVLRNKAHYLIKELDMFQEAY 78 (83)
Q Consensus 26 ~~~Vd~~~~eL~~ei~~L~~~i~~le~i~~~~K~lrnKa~~L~~eL~~F~~~y 78 (83)
.+..+++..++++.++-++.+|..||+ ..+.|+.+..-+++.|......+
T Consensus 66 ~qd~~e~~~~l~~r~E~ie~~ik~lek---q~~~l~~~l~e~q~~l~~ll~~~ 115 (121)
T PRK09343 66 KVDKTKVEKELKERKELLELRSRTLEK---QEKKLREKLKELQAKINEMLSKY 115 (121)
T ss_pred hccHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHhc
Confidence 345667777888888887766666554 55667777778888877766654
No 31
>PRK14148 heat shock protein GrpE; Provisional
Probab=73.50 E-value=11 Score=27.85 Aligned_cols=38 Identities=11% Similarity=0.264 Sum_probs=24.6
Q ss_pred HHHHHHHHHHHHHHHHHHhhHHhhhhHHHHHHHHHHHHHHh
Q psy7799 38 EDIKQLDSHLKQVEEASNNAKVLRNKAHYLIKELDMFQEAY 78 (83)
Q Consensus 38 ~ei~~L~~~i~~le~i~~~~K~lrnKa~~L~~eL~~F~~~y 78 (83)
.++..+..+ ++.+...++.|+++.-.+..+.++|.+-.
T Consensus 40 ~e~~~l~~~---l~~l~~e~~elkd~~lR~~Ae~eN~rKR~ 77 (195)
T PRK14148 40 EQLERAKDT---IKELEDSCDQFKDEALRAKAEMENIRKRA 77 (195)
T ss_pred hHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 334444443 44555666677888888888888887643
No 32
>PRK08476 F0F1 ATP synthase subunit B'; Validated
Probab=73.47 E-value=16 Score=24.97 Aligned_cols=34 Identities=9% Similarity=0.217 Sum_probs=15.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy7799 14 VEQELSSTFARLCQLVDRATDDMHEDIKQLDSHL 47 (83)
Q Consensus 14 VqqEL~~tf~rL~~~Vd~~~~eL~~ei~~L~~~i 47 (83)
+.+++..+-..|..++.....+|+.++..++.+|
T Consensus 101 ~~~~~~~a~~~l~~e~~~~~~~l~~qv~~~~~~~ 134 (141)
T PRK08476 101 LESKYEAFAKQLANQKQELKEQLLSQMPEFKEAL 134 (141)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHH
Confidence 3444444444444444444444444444444443
No 33
>PF09177 Syntaxin-6_N: Syntaxin 6, N-terminal; InterPro: IPR015260 Members of this entry, which are found in the amino terminus of various SNARE proteins, adopt a structure consisting of an antiparallel three-helix bundle. Their exact function has not been determined, though it is known that they regulate the SNARE motif, as well as mediate various protein-protein interactions involved in membrane-transport []. ; GO: 0048193 Golgi vesicle transport, 0016020 membrane; PDB: 1LVF_B 2C5I_T 2C5J_A 2C5K_T 4DND_A.
Probab=73.13 E-value=19 Score=22.93 Aligned_cols=56 Identities=18% Similarity=0.242 Sum_probs=39.4
Q ss_pred HHHHHHHHHHHH---------HHHHHHHHHHHHHHHHHHHHHHHHHHhhHHhhhhHHHHHHHHHH
Q psy7799 18 LSSTFARLCQLV---------DRATDDMHEDIKQLDSHLKQVEEASNNAKVLRNKAHYLIKELDM 73 (83)
Q Consensus 18 L~~tf~rL~~~V---------d~~~~eL~~ei~~L~~~i~~le~i~~~~K~lrnKa~~L~~eL~~ 73 (83)
|.+.|.++.... ..+.++|...+..+..++.-|+..-.-+..=..|++.=+.||..
T Consensus 17 l~~~~~~~~~~~~~~~~~~e~~~~~~eL~~~l~~ie~~L~DL~~aV~ive~np~kF~l~~~Ei~~ 81 (97)
T PF09177_consen 17 LESLYRRWQRLRSDTSSSEELKWLKRELRNALQSIEWDLEDLEEAVRIVEKNPSKFNLSEEEISR 81 (97)
T ss_dssp HHHHHHHHHHHTTHCC-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHT-HHHHHHH
T ss_pred HHHHHHHHHHhcccCCCcHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCccccCCCHHHHHH
Confidence 345555555543 34668888888888888888888877776667777777777754
No 34
>PF07352 Phage_Mu_Gam: Bacteriophage Mu Gam like protein; InterPro: IPR009951 The Gam protein, originally characterised in Bacteriophage Mu, protects linear double stranded DNA from exonuclease degradation in vitro and in vivo []. This protein is also found in many bacterial species as part of a suspected prophage. Further studies have shown that Gam is a functional counterpart of the eukaryotic Ku protein, which has key roles in DNA repair and in certain transposition events. Gam displays DNA binding characteristics remarkably similar to those of human Ku []. In addition, Gam can interfere with Ty1 retrotransposition in Saccharomyces cerevisiae (Baker's yeast). These data reveal structural and functional parallels between bacteriophage Gam and eukaryotic Ku and suggest that their functions have been evolutionarily conserved [].; GO: 0003690 double-stranded DNA binding, 0042262 DNA protection; PDB: 2P2U_B.
Probab=72.11 E-value=29 Score=23.78 Aligned_cols=49 Identities=20% Similarity=0.315 Sum_probs=36.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHhhhhHHHHHHHHHHHHHHh
Q psy7799 24 RLCQLVDRATDDMHEDIKQLDSHLKQVEEASNNAKVLRNKAHYLIKELDMFQEAY 78 (83)
Q Consensus 24 rL~~~Vd~~~~eL~~ei~~L~~~i~~le~i~~~~K~lrnKa~~L~~eL~~F~~~y 78 (83)
-|.++++.....|+++|+++.. ..+..++.|+++..+|++-|..|.+..
T Consensus 14 ~l~~~~~~i~~~~~~~I~~i~~------~~~~~~~~l~~~i~~l~~~l~~y~e~~ 62 (149)
T PF07352_consen 14 ELQREIARIEAEANDEIARIKE------WYEAEIAPLQNRIEYLEGLLQAYAEAN 62 (149)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHH------HHHHHCHHHHHHHHHHHHHHHHHHHCT
T ss_pred HHHHHHHHHHHHHHHHHHHHHH------HHHHHHHHHHHHHHHHHHHHHHHHHHC
Confidence 3456667777777777777543 456788999999999999999887653
No 35
>TIGR03007 pepcterm_ChnLen polysaccharide chain length determinant protein, PEP-CTERM locus subfamily. Members of this protein family belong to the family of polysaccharide chain length determinant proteins (pfam02706). All are found in species that encode the PEP-CTERM/exosortase system predicted to act in protein sorting in a number of Gram-negative bacteria, and are found near the epsH homolog that is the putative exosortase gene.
Probab=71.19 E-value=50 Score=26.19 Aligned_cols=64 Identities=14% Similarity=0.159 Sum_probs=36.1
Q ss_pred HHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHhhHHhhhhHHHHHHHHHHHHHHh
Q psy7799 15 EQELSSTFARLCQLVDRA---TDDMHEDIKQLDSHLKQVEEASNNAKVLRNKAHYLIKELDMFQEAY 78 (83)
Q Consensus 15 qqEL~~tf~rL~~~Vd~~---~~eL~~ei~~L~~~i~~le~i~~~~K~lrnKa~~L~~eL~~F~~~y 78 (83)
+++|......+.+++... .+.+..++.++..++..+-..+.++..|.-.+...+...+.|.+.|
T Consensus 312 ~~~l~~~l~~~~~~~~~l~~~~~~l~~~~~~~~~~~~~~~~~~~el~~L~Re~~~~~~~Y~~l~~r~ 378 (498)
T TIGR03007 312 YQQLQIELAEAEAEIASLEARVAELTARIERLESLLRTIPEVEAELTQLNRDYEVNKSNYEQLLTRR 378 (498)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhchHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 344555555555444433 2445566666666666666666666666666666666555555544
No 36
>PF05008 V-SNARE: Vesicle transport v-SNARE protein N-terminus; InterPro: IPR007705 V-SNARE proteins are required for protein traffic between eukaryotic organelles. The v-SNAREs on transport vesicles interact with t-SNAREs on target membranes in order to facilitate this []. This domain is the N-terminal half of the V-Snare proteins. ; GO: 0006886 intracellular protein transport, 0016020 membrane; PDB: 2V8S_V 1VCS_A 3ONL_C 3ONJ_A 2QYW_A.
Probab=70.78 E-value=20 Score=21.54 Aligned_cols=55 Identities=16% Similarity=0.276 Sum_probs=34.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHhhhhHHHHHHHHHHHHHHh
Q psy7799 22 FARLCQLVDRATDDMHEDIKQLDSHLKQVEEASNNAKVLRNKAHYLIKELDMFQEAY 78 (83)
Q Consensus 22 f~rL~~~Vd~~~~eL~~ei~~L~~~i~~le~i~~~~K~lrnKa~~L~~eL~~F~~~y 78 (83)
...+.+.|+....+.++-|++|+-++..+.. +.-..+..|..-.-++|+.+...|
T Consensus 23 r~~~i~~~e~~l~ea~~~l~qMe~E~~~~p~--s~r~~~~~kl~~yr~~l~~lk~~l 77 (79)
T PF05008_consen 23 RKSLIREIERDLDEAEELLKQMELEVRSLPP--SERNQYKSKLRSYRSELKKLKKEL 77 (79)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHCTS-H--HHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhCCH--HHHHHHHHHHHHHHHHHHHHHHHh
Confidence 4455566777777777777777777776632 333455566666666666665544
No 37
>PF13747 DUF4164: Domain of unknown function (DUF4164)
Probab=69.81 E-value=27 Score=22.52 Aligned_cols=18 Identities=28% Similarity=0.338 Sum_probs=10.1
Q ss_pred HHHHHHHHHHHHHHHHHH
Q psy7799 13 QVEQELSSTFARLCQLVD 30 (83)
Q Consensus 13 QVqqEL~~tf~rL~~~Vd 30 (83)
++.+.|...+.+|...|+
T Consensus 8 ~al~rL~~aid~LE~~v~ 25 (89)
T PF13747_consen 8 AALTRLEAAIDRLEKAVD 25 (89)
T ss_pred HHHHHHHHHHHHHHHHHH
Confidence 455555666666655554
No 38
>TIGR00414 serS seryl-tRNA synthetase. This model represents the seryl-tRNA synthetase found in most organisms. This protein is a class II tRNA synthetase, and is recognized by the pfam model tRNA-synt_2b. The seryl-tRNA synthetases of two archaeal species, Methanococcus jannaschii and Methanobacterium thermoautotrophicum, differ considerably and are included in a different model.
Probab=68.74 E-value=59 Score=26.06 Aligned_cols=44 Identities=16% Similarity=0.317 Sum_probs=30.7
Q ss_pred HHHHHHHHHHHHHHH--------HHHHHHhhHHhhhhHHHHHHHHHHHHHHh
Q psy7799 35 DMHEDIKQLDSHLKQ--------VEEASNNAKVLRNKAHYLIKELDMFQEAY 78 (83)
Q Consensus 35 eL~~ei~~L~~~i~~--------le~i~~~~K~lrnKa~~L~~eL~~F~~~y 78 (83)
+|..+.+.++++|.. .+.+..++|.|+++..-|+.++....+.+
T Consensus 48 ~l~~erN~~sk~i~~~~~~~~~~~~~l~~~~~~l~~~~~~~~~~~~~~~~~~ 99 (418)
T TIGR00414 48 ELQAKRNELSKQIGKAKGQKKDKIEEIKKELKELKEELTELSAALKALEAEL 99 (418)
T ss_pred HHHHHHHHHHHHHHHHhccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 355555556666655 45677788888888888888887766654
No 39
>PF10458 Val_tRNA-synt_C: Valyl tRNA synthetase tRNA binding arm; InterPro: IPR019499 The aminoacyl-tRNA synthetases (6.1.1. from EC) catalyse the attachment of an amino acid to its cognate transfer RNA molecule in a highly specific two-step reaction. These proteins differ widely in size and oligomeric state, and have limited sequence homology []. The 20 aminoacyl-tRNA synthetases are divided into two classes, I and II. Class I aminoacyl-tRNA synthetases contain a characteristic Rossman fold catalytic domain and are mostly monomeric []. Class II aminoacyl-tRNA synthetases share an anti-parallel beta-sheet fold flanked by alpha-helices [], and are mostly dimeric or multimeric, containing at least three conserved regions [, , ]. However, tRNA binding involves an alpha-helical structure that is conserved between class I and class II synthetases. In reactions catalysed by the class I aminoacyl-tRNA synthetases, the aminoacyl group is coupled to the 2'-hydroxyl of the tRNA, while, in class II reactions, the 3'-hydroxyl site is preferred. The synthetases specific for arginine, cysteine, glutamic acid, glutamine, isoleucine, leucine, methionine, tyrosine, tryptophan and valine belong to class I synthetases. The synthetases specific for alanine, asparagine, aspartic acid, glycine, histidine, lysine, phenylalanine, proline, serine, and threonine belong to class-II synthetases []. Based on their mode of binding to the tRNA acceptor stem, both classes of tRNA synthetases have been subdivided into three subclasses, designated 1a, 1b, 1c and 2a, 2b, 2c. This entry represents the C-terminal domain of Valyl-tRNA synthetase, which consists of two helices in a long alpha-hairpin. Valyl-tRNA synthetase (6.1.1.9 from EC) is an alpha monomer that belongs to class Ia.; GO: 0000166 nucleotide binding, 0004832 valine-tRNA ligase activity, 0005524 ATP binding, 0006438 valyl-tRNA aminoacylation, 0005737 cytoplasm; PDB: 1IVS_B 1GAX_B.
Probab=68.49 E-value=15 Score=22.09 Aligned_cols=23 Identities=22% Similarity=0.505 Sum_probs=13.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHhh
Q psy7799 35 DMHEDIKQLDSHLKQVEEASNNA 57 (83)
Q Consensus 35 eL~~ei~~L~~~i~~le~i~~~~ 57 (83)
|++.||..|.+++..++.-..++
T Consensus 1 D~~~E~~rL~Kel~kl~~~i~~~ 23 (66)
T PF10458_consen 1 DVEAEIERLEKELEKLEKEIERL 23 (66)
T ss_dssp -HHHHHHHHHHHHHHHHHHHHHH
T ss_pred CHHHHHHHHHHHHHHHHHHHHHH
Confidence 45667777777776666544443
No 40
>cd00187 TOP4c DNA Topoisomerase, subtype IIA; domain A'; bacterial DNA topoisomerase IV (C subunit, ParC), bacterial DNA gyrases (A subunit, GyrA),mammalian DNA toposiomerases II. DNA topoisomerases are essential enzymes that regulate the conformational changes in DNA topology by catalysing the concerted breakage and rejoining of DNA strands during normal cellular growth.
Probab=68.25 E-value=18 Score=29.65 Aligned_cols=48 Identities=21% Similarity=0.410 Sum_probs=35.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHhhhhHHHHHHHHHHH
Q psy7799 24 RLCQLVDRATDDMHEDIKQLDSHLKQVEEASNNAKVLRNKAHYLIKELDMF 74 (83)
Q Consensus 24 rL~~~Vd~~~~eL~~ei~~L~~~i~~le~i~~~~K~lrnKa~~L~~eL~~F 74 (83)
+|.+....-...|++|++++.++|+.|+.+...-+.++ .....||+.|
T Consensus 398 ~L~~LT~~e~~kL~~E~~~l~~ei~~l~~~l~~~~~~~---~~i~~eL~~~ 445 (445)
T cd00187 398 RLRRLTKLEREKLLKELKELEAEIEDLEKILASEERPK---DLWKEELDEF 445 (445)
T ss_pred HHHHhhhhHHHHHHHHHHHHHHHHHHHHHHhcChHHHH---HHHHHHHHhC
Confidence 45555666677899999999999999998886665554 4555666654
No 41
>KOG1962|consensus
Probab=67.80 E-value=30 Score=26.15 Aligned_cols=51 Identities=24% Similarity=0.359 Sum_probs=37.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHhhhhHHHHHHHHHHHHHHh
Q psy7799 28 LVDRATDDMHEDIKQLDSHLKQVEEASNNAKVLRNKAHYLIKELDMFQEAY 78 (83)
Q Consensus 28 ~Vd~~~~eL~~ei~~L~~~i~~le~i~~~~K~lrnKa~~L~~eL~~F~~~y 78 (83)
.-+....|++.=.+++++....||+.++.-..|+..+.-+..|-++-.+.|
T Consensus 152 ~~~~~~~~~~kL~~el~~~~~~Le~~~~~~~al~Kq~e~~~~EydrLlee~ 202 (216)
T KOG1962|consen 152 ENDKLKADLEKLETELEKKQKKLEKAQKKVDALKKQSEGLQDEYDRLLEEY 202 (216)
T ss_pred hHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcccHHHHHHHHH
Confidence 345556666666677777777888888888888888888888877777665
No 42
>PF07412 Geminin: Geminin; InterPro: IPR022786 This family contains the eukaryotic protein geminin (approximately 200 residues long). Geminin inhibits DNA replication by preventing the incorporation of MCM complex into prereplication complex, and is degraded during the mitotic phase of the cell cycle. It has been proposed that geminin inhibits DNA replication during S, G2, and M phases and that geminin destruction at the metaphase-anaphase transition permits replication in the succeeding cell cycle []. ; GO: 0008156 negative regulation of DNA replication; PDB: 1T6F_B 2LP0_B 1UII_B 2WVR_B 2ZXX_B.
Probab=67.69 E-value=49 Score=24.74 Aligned_cols=70 Identities=20% Similarity=0.335 Sum_probs=50.6
Q ss_pred hhhchhHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHhhhhHHHHHHHHHHHHH
Q psy7799 6 TSSNCSHQVEQELSSTFA-RLCQLVDRATDDMHEDIKQLDSHLKQVEEASNNAKVLRNKAHYLIKELDMFQE 76 (83)
Q Consensus 6 tssncshQVqqEL~~tf~-rL~~~Vd~~~~eL~~ei~~L~~~i~~le~i~~~~K~lrnKa~~L~~eL~~F~~ 76 (83)
++.+.|-|-|++|+.=.. -|.-..++... |..+|..+..+|..|-.--..++.|-.-+.+|..-++.+..
T Consensus 100 ~~e~Pse~YWk~lAE~RR~AL~eaL~ENe~-Lh~~ie~~~eEi~~lk~en~~L~elae~~~~la~~ie~l~~ 170 (200)
T PF07412_consen 100 SSEGPSENYWKELAEERRKALEEALEENEK-LHKEIEQKDEEIAKLKEENEELKELAEHVQYLAEVIERLTG 170 (200)
T ss_dssp -SSSCCHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHCC-
T ss_pred hcCCChHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc
Confidence 566889999999987543 34444445444 88888888888888888778888887778888877776543
No 43
>TIGR01837 PHA_granule_1 poly(hydroxyalkanoate) granule-associated protein. This model describes a domain found in some proteins associated with polyhydroxyalkanoate (PHA) granules in a subset of species that have PHA inclusion granules. Included are two tandem proteins of Pseudomonas oleovorans, PhaI and PhaF, and their homologs in related species. PhaF proteins have a low-complexity C-terminal region with repeats similar to AAAKP.
Probab=67.37 E-value=32 Score=23.05 Aligned_cols=28 Identities=21% Similarity=0.442 Sum_probs=17.5
Q ss_pred HHHHHHHHHHH----HHHHHHHHHHHHHHHHH
Q psy7799 27 QLVDRATDDMH----EDIKQLDSHLKQVEEAS 54 (83)
Q Consensus 27 ~~Vd~~~~eL~----~ei~~L~~~i~~le~i~ 54 (83)
.+|..+.+.|. +||++|+.+|+.|+.--
T Consensus 81 ~~v~~~L~~lg~~tk~ev~~L~~RI~~Le~~l 112 (118)
T TIGR01837 81 ERVEQALNRLNIPSREEIEALSAKIEQLAVQV 112 (118)
T ss_pred HHHHHHHHHcCCCCHHHHHHHHHHHHHHHHHH
Confidence 34555555553 67777777777776543
No 44
>PF07028 DUF1319: Protein of unknown function (DUF1319); InterPro: IPR010746 This entry is represented by Commelina yellow mottle virus, Orf1. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This family contains a number of viral proteins of unknown function approximately 200 residues long. Family members seem to be restricted to badnaviruses.
Probab=67.30 E-value=40 Score=23.59 Aligned_cols=60 Identities=25% Similarity=0.283 Sum_probs=42.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----hHHhhhhHHHHHHHHHHHHHHhc
Q psy7799 13 QVEQELSSTFARLCQLVDRATDDMHEDIKQLDSHLKQVEEASNN----AKVLRNKAHYLIKELDMFQEAYL 79 (83)
Q Consensus 13 QVqqEL~~tf~rL~~~Vd~~~~eL~~ei~~L~~~i~~le~i~~~----~K~lrnKa~~L~~eL~~F~~~yL 79 (83)
++..=|..+|.|+.----. .++.....|..+..|+.. ++.++.+-.-|..||.+..+.||
T Consensus 21 dLahNL~v~~~R~dL~~KV-------~~~~~~~~lk~~~ki~~~Qr~~l~~l~~~l~~l~~eL~~Lr~~~l 84 (126)
T PF07028_consen 21 DLAHNLRVTCYRSDLGSKV-------SQKKLLEELKNLSKIQESQRSELKELKQELDVLSKELQALRKEYL 84 (126)
T ss_pred HHHhhhhhhhhHhhHHHHH-------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4444556666665433333 445555667777777776 89999999999999999999997
No 45
>PF02108 FliH: Flagellar assembly protein FliH; InterPro: IPR018035 This entry represents a region found in the flagellar assembly protein FliH, as well as in type III secretion system protein HrpE. Many flagellar proteins are exported by a flagellum-specific export pathway. Attempts have been made to characterise the apparatus responsible for this process, by designing assays to screen for mutants with export defects. Experiments involving filament removal from temperature-sensitive flagellar mutants of Salmonella typhimurium have shown that, while most mutants were able to regrow filaments, flhA, fliH, fliI and fliN mutants showed no or greatly reduced regrowth. This suggests that the corresponding gene products are involved in the process of flagellum-specific export []. The sequence of fliH has been deduced and shown to encode a protein of molecular mass of 25,782 Da. Bacterial HrpE proteins are belived to function on the type III secretion system, specifically the secretion of HrpZ (harpinPss) [].
Probab=67.29 E-value=27 Score=22.14 Aligned_cols=36 Identities=14% Similarity=0.328 Sum_probs=31.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy7799 12 HQVEQELSSTFARLCQLVDRATDDMHEDIKQLDSHL 47 (83)
Q Consensus 12 hQVqqEL~~tf~rL~~~Vd~~~~eL~~ei~~L~~~i 47 (83)
+|..++|...+..+.........+++.++-+|--.|
T Consensus 1 ~~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~l~~~i 36 (128)
T PF02108_consen 1 EQAQAELEQLLEELEEALQELLEELEQELVELALAI 36 (128)
T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 377899999999999999999999999998887665
No 46
>TIGR01005 eps_transp_fam exopolysaccharide transport protein family. The model describes the exopolysaccharide transport protein family in bacteria. The transport protein is part of a large genetic locus which is associated with exopolysaccharide (EPS) biosynthesis. Detailed molecular characterization and gene fusion analysis revealed atleast seven gene products are involved in the overall regulation, which among other things, include exopolysaccharide biosynthesis, property of conferring virulence and exopolysaccharide export.
Probab=66.81 E-value=73 Score=26.87 Aligned_cols=16 Identities=13% Similarity=0.050 Sum_probs=5.8
Q ss_pred HHhhHHhhhhHHHHHH
Q psy7799 54 SNNAKVLRNKAHYLIK 69 (83)
Q Consensus 54 ~~~~K~lrnKa~~L~~ 69 (83)
+.++..|.-.+...+.
T Consensus 375 ~~e~~~L~Re~~~~~~ 390 (754)
T TIGR01005 375 QVDLDALQRDAAAKRQ 390 (754)
T ss_pred HHHHHHHHHHHHHHHH
Confidence 3333333333333333
No 47
>PF13864 Enkurin: Calmodulin-binding
Probab=66.79 E-value=6.6 Score=25.28 Aligned_cols=37 Identities=22% Similarity=0.481 Sum_probs=22.1
Q ss_pred HHHHHHHHHHHHHHHHHH-----HHHHHHHHHHHHHHHHHHH
Q psy7799 15 EQELSSTFARLCQLVDRA-----TDDMHEDIKQLDSHLKQVE 51 (83)
Q Consensus 15 qqEL~~tf~rL~~~Vd~~-----~~eL~~ei~~L~~~i~~le 51 (83)
|.||..-|.+|...+|+. ..+|+.+++++++.|..++
T Consensus 53 ~~el~~ey~~lp~~~DT~~~~~rK~~lE~~L~qlE~dI~~ls 94 (98)
T PF13864_consen 53 WDELNKEYQKLPFSIDTLRKKRRKEELEKELKQLEKDIKKLS 94 (98)
T ss_pred HHHHHHHHHhCCcccCCHHHHHHHHHHHHHHHHHHHHHHHHc
Confidence 555666666665554433 4566666677666666654
No 48
>COG5420 Uncharacterized conserved small protein containing a coiled-coil domain [Function unknown]
Probab=66.21 E-value=12 Score=23.73 Aligned_cols=32 Identities=22% Similarity=0.306 Sum_probs=27.2
Q ss_pred HHHHHHHHHHhhHHhhhhHHHHHHHHHHHHHH
Q psy7799 46 HLKQVEEASNNAKVLRNKAHYLIKELDMFQEA 77 (83)
Q Consensus 46 ~i~~le~i~~~~K~lrnKa~~L~~eL~~F~~~ 77 (83)
.+..||.+|++.++|..||+.--.+|.+..+.
T Consensus 4 ~ms~l~eiqkKvrkLqsrAg~akm~LhDLAEg 35 (71)
T COG5420 4 EMSSLEEIQKKVRKLQSRAGQAKMELHDLAEG 35 (71)
T ss_pred hHhhHHHHHHHHHHHHHHHHHHHhhHHHHhcc
Confidence 35678999999999999999999888877654
No 49
>cd07666 BAR_SNX7 The Bin/Amphiphysin/Rvs (BAR) domain of Sorting Nexin 7. BAR domains are dimerization, lipid binding and curvature sensing modules found in many different proteins with diverse functions. Sorting nexins (SNXs) are Phox homology (PX) domain containing proteins that are involved in regulating membrane traffic and protein sorting in the endosomal system. SNXs differ from each other in their lipid-binding specificity, subcellular localization and specific function in the endocytic pathway. A subset of SNXs also contain BAR domains. The PX-BAR structural unit determines the specific membrane targeting of SNXs. The specific function of SNX7 is still unknown. BAR domains form dimers that bind to membranes, induce membrane bending and curvature, and may also be involved in protein-protein interactions.
Probab=65.48 E-value=54 Score=24.88 Aligned_cols=17 Identities=24% Similarity=0.503 Sum_probs=10.1
Q ss_pred HHHHHHHHHHHHHHHHH
Q psy7799 34 DDMHEDIKQLDSHLKQV 50 (83)
Q Consensus 34 ~eL~~ei~~L~~~i~~l 50 (83)
..++.||+.++.+++.-
T Consensus 180 ~~~~~ev~~~e~kve~a 196 (243)
T cd07666 180 DLLKEEIEKLEDKVECA 196 (243)
T ss_pred HHHHHHHHHHHHHHHHH
Confidence 35666666666665555
No 50
>PF05082 Rop-like: Rop-like; InterPro: IPR007774 This family contains several uncharacterised bacterial proteins. These proteins are found in nitrogen fixation operons, so are likely to play a role in this process.; PDB: 3CSX_A 2JS5_B.
Probab=64.62 E-value=20 Score=22.51 Aligned_cols=32 Identities=28% Similarity=0.301 Sum_probs=25.5
Q ss_pred HHHHHHHhhHHhhhhHHHHHHHHHHHHHHhccc
Q psy7799 49 QVEEASNNAKVLRNKAHYLIKELDMFQEAYLVN 81 (83)
Q Consensus 49 ~le~i~~~~K~lrnKa~~L~~eL~~F~~~yL~~ 81 (83)
-++.+..+.|+|+.||+-...+|-+-.+. |+.
T Consensus 3 d~~eLk~evkKL~~~A~~~kmdLHDLaEd-LP~ 34 (66)
T PF05082_consen 3 DIEELKKEVKKLNRKATQAKMDLHDLAED-LPT 34 (66)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHC-TTT
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHc-cch
Confidence 45667777888889999999999888887 653
No 51
>PRK11637 AmiB activator; Provisional
Probab=64.19 E-value=70 Score=25.26 Aligned_cols=13 Identities=15% Similarity=0.079 Sum_probs=4.8
Q ss_pred hhhhHHHHHHHHH
Q psy7799 60 LRNKAHYLIKELD 72 (83)
Q Consensus 60 lrnKa~~L~~eL~ 72 (83)
+..+..-++.+|+
T Consensus 108 l~~eI~~~q~~l~ 120 (428)
T PRK11637 108 LNASIAKLEQQQA 120 (428)
T ss_pred HHHHHHHHHHHHH
Confidence 3333333333333
No 52
>PRK14145 heat shock protein GrpE; Provisional
Probab=63.97 E-value=20 Score=26.47 Aligned_cols=45 Identities=16% Similarity=0.321 Sum_probs=28.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhHHhhhhHHHHHHHHHHHHHH
Q psy7799 30 DRATDDMHEDIKQLDSHLKQVEEASNNAKVLRNKAHYLIKELDMFQEA 77 (83)
Q Consensus 30 d~~~~eL~~ei~~L~~~i~~le~i~~~~K~lrnKa~~L~~eL~~F~~~ 77 (83)
.........++..|..++ +.+..+++.|++|.-.+..++++|.+-
T Consensus 37 ~~~~~~~~~e~~~l~~~l---~~le~e~~el~d~~lR~~AEfeN~rkR 81 (196)
T PRK14145 37 NQPQQQTVDEIEELKQKL---QQKEVEAQEYLDIAQRLKAEFENYRKR 81 (196)
T ss_pred cccccCchhHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 334445555666665553 455566677777777777777777654
No 53
>PRK03918 chromosome segregation protein; Provisional
Probab=63.94 E-value=75 Score=26.91 Aligned_cols=36 Identities=14% Similarity=0.283 Sum_probs=16.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhH
Q psy7799 23 ARLCQLVDRATDDMHEDIKQLDSHLKQVEEASNNAK 58 (83)
Q Consensus 23 ~rL~~~Vd~~~~eL~~ei~~L~~~i~~le~i~~~~K 58 (83)
.++...+....++++.++..+...+..++.+...++
T Consensus 161 ~~~~~~~~~~~~~~~~~~~~l~~~l~~l~~i~~~l~ 196 (880)
T PRK03918 161 ENAYKNLGEVIKEIKRRIERLEKFIKRTENIEELIK 196 (880)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 333344444444445555555444444444444444
No 54
>PF03938 OmpH: Outer membrane protein (OmpH-like); InterPro: IPR005632 This entry includes outer membrane proteins such as OmpH (Skp) among others. OmpH (outer membrane protein H) is a major structural protein of the outer membrane. In Pasteurella multocida it acts as a channel-forming transmembrane porin []. Porins act as molecular sieves to allow the diffusion of small hydrophilic solutes through the outer membrane and also acts as a receptor for bacteriophages and bacteriocins. Porins are highly immunogenic and are conserved in bacterial families, making them attractive vaccine candidates []. The 17kDa protein (Skp, OmpH) of Escherichia coli is a homotrimeric periplasmic chaperone for newly synthesised outer-membrane proteins, the X-ray structure of which has been reported at resolutions of 2.35 A and 2.30 A [, ]. Three hairpin-shaped alpha-helical extensions reach out by approximately 60 A from a trimerisation domain, which is composed of three intersubunit beta-sheets that wind around a central axis. The alpha-helical extensions approach each other at their distal turns, resulting in a fold that resembles a 'three-pronged grasping forcep'. The overall shape of Skp is reminiscent of the cytosolic chaperone prefoldin (IPR009053 from INTERPRO), although it is based on a radically different topology. The peculiar architecture, with apparent plasticity of the prongs and distinct electrostatic and hydrophobic surface properties, supports the recently proposed biochemical mechanism of this chaperone: formation of a Skp(3)-Omp complex protects the outer membrane protein from aggregation during passage through the bacterial periplasm. The ability of Skp to prevent the aggregation of model substrates in vitro is independent of ATP. Skp can interact directly with membrane lipids and lipopolysaccharide. These interactions are needed for efficient Skp-assisted folding of membrane proteins [].; GO: 0051082 unfolded protein binding; PDB: 1SG2_C 1U2M_C.
Probab=63.79 E-value=41 Score=22.45 Aligned_cols=41 Identities=15% Similarity=0.270 Sum_probs=29.7
Q ss_pred hhhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy7799 6 TSSNCSHQVEQELSSTFARLCQLVDRATDDMHEDIKQLDSH 46 (83)
Q Consensus 6 tssncshQVqqEL~~tf~rL~~~Vd~~~~eL~~ei~~L~~~ 46 (83)
.++.-..+++.+|...+..+..+++.-..+++....++...
T Consensus 29 ~~~~~~k~~~~~l~~~~~~~~~~l~~~~~el~~~~~~l~~~ 69 (158)
T PF03938_consen 29 QESPAGKDAQAKLQEKFKALQKELQAKQKELQKLQQKLQSQ 69 (158)
T ss_dssp HHHHHHHTHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHTTS
T ss_pred HhCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34555667788888888888888887777777777666553
No 55
>PF12001 DUF3496: Domain of unknown function (DUF3496); InterPro: IPR021885 This presumed domain is functionally uncharacterised. This domain is found in eukaryotes. This domain is about 110 amino acids in length.
Probab=63.60 E-value=22 Score=24.30 Aligned_cols=37 Identities=27% Similarity=0.548 Sum_probs=21.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhHHhhhhHHHHHHHHHHHHHHhc
Q psy7799 34 DDMHEDIKQLDSHLKQVEEASNNAKVLRNKAHYLIKELDMFQEAYL 79 (83)
Q Consensus 34 ~eL~~ei~~L~~~i~~le~i~~~~K~lrnKa~~L~~eL~~F~~~yL 79 (83)
..|+-.|++|+.++.++-..|..+. ..||+.|.+.|+
T Consensus 3 sQmElrIkdLeselsk~Ktsq~d~~---------~~eLEkYkqly~ 39 (111)
T PF12001_consen 3 SQMELRIKDLESELSKMKTSQEDSN---------KTELEKYKQLYL 39 (111)
T ss_pred hHHHHHHHHHHHHHHHhHhHhhhhh---------HHHHHHHHHHHH
Confidence 3455566666666655554444431 356777777765
No 56
>PRK05431 seryl-tRNA synthetase; Provisional
Probab=63.17 E-value=78 Score=25.44 Aligned_cols=45 Identities=20% Similarity=0.325 Sum_probs=31.0
Q ss_pred HHHHHHHHHHHHHHHH-------HHHHHHhhHHhhhhHHHHHHHHHHHHHHh
Q psy7799 34 DDMHEDIKQLDSHLKQ-------VEEASNNAKVLRNKAHYLIKELDMFQEAY 78 (83)
Q Consensus 34 ~eL~~ei~~L~~~i~~-------le~i~~~~K~lrnKa~~L~~eL~~F~~~y 78 (83)
++|..+.+.++++|.. .+.+..+++.|+++..-|+.++....+.+
T Consensus 45 ~~lr~~rn~~sk~i~~~~~~~~~~~~l~~~~~~l~~~~~~~~~~~~~~~~~~ 96 (425)
T PRK05431 45 EELQAERNALSKEIGQAKRKGEDAEALIAEVKELKEEIKALEAELDELEAEL 96 (425)
T ss_pred HHHHHHHHHHHHHHHHHhhcCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4555556666666654 45677788888888888888887766554
No 57
>PF13093 FTA4: Kinetochore complex Fta4 of Sim4 subunit, or CENP-50
Probab=63.16 E-value=37 Score=25.09 Aligned_cols=56 Identities=16% Similarity=0.214 Sum_probs=42.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHhh-----------hhHHHHHHHHHHHH
Q psy7799 16 QELSSTFARLCQLVDRATDDMHEDIKQLDSHLKQVEEASNNAKVLR-----------NKAHYLIKELDMFQ 75 (83)
Q Consensus 16 qEL~~tf~rL~~~Vd~~~~eL~~ei~~L~~~i~~le~i~~~~K~lr-----------nKa~~L~~eL~~F~ 75 (83)
.|+...|+.|.... .+|.++.+.+..+++.|..+...+..++ .|.+-|+.||++|.
T Consensus 138 ~e~~~rY~~l~~~L----~~l~~~r~~~~~rl~~lr~L~~lLepf~~~~~~IQ~NLvtr~g~l~~El~rmR 204 (213)
T PF13093_consen 138 PEEAERYAELRERL----IELSEQRQYLQQRLEYLRRLRSLLEPFDSPQENIQPNLVTRDGELEAELERMR 204 (213)
T ss_pred HHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHhcccccCCccccccccCCCCchHHHHHHHHH
Confidence 45556666666554 4667777888888888888888887666 57789999999985
No 58
>PLN02320 seryl-tRNA synthetase
Probab=62.53 E-value=86 Score=26.31 Aligned_cols=58 Identities=17% Similarity=0.240 Sum_probs=37.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH------HHHHHHhhHHhhhhHHHHHHHHHHHHHHh
Q psy7799 17 ELSSTFARLCQLVDRATDDMHEDIKQLDSHLKQ------VEEASNNAKVLRNKAHYLIKELDMFQEAY 78 (83)
Q Consensus 17 EL~~tf~rL~~~Vd~~~~eL~~ei~~L~~~i~~------le~i~~~~K~lrnKa~~L~~eL~~F~~~y 78 (83)
++...+..+..+++. |..+.+.+.++|.. .+.+..++|.|+.+..-|+.++....+.+
T Consensus 97 ~ld~~~r~~~~~~~~----lr~ern~~sk~i~~~~~~~~~~~l~~~~k~lk~~i~~le~~~~~~~~~l 160 (502)
T PLN02320 97 ELYENMLALQKEVER----LRAERNAVANKMKGKLEPSERQALVEEGKNLKEGLVTLEEDLVKLTDEL 160 (502)
T ss_pred HHHHHHHHHHHHHHH----HHHHHHHHHHHHHhhhCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 344444555555544 44444445555543 56777788999999999988887776544
No 59
>KOG0976|consensus
Probab=62.46 E-value=64 Score=29.63 Aligned_cols=64 Identities=16% Similarity=0.163 Sum_probs=49.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--------------HHhhHHhhhhHHHHHHHHHHHHHHh
Q psy7799 15 EQELSSTFARLCQLVDRATDDMHEDIKQLDSHLKQVEEA--------------SNNAKVLRNKAHYLIKELDMFQEAY 78 (83)
Q Consensus 15 qqEL~~tf~rL~~~Vd~~~~eL~~ei~~L~~~i~~le~i--------------~~~~K~lrnKa~~L~~eL~~F~~~y 78 (83)
-||...+-.++-+..-.-..++.++|+.|...|-++|.+ ..++..+..--.+++-+|.+|..-|
T Consensus 370 i~e~k~nve~elqsL~~l~aerqeQidelKn~if~~e~~~~dhe~~kneL~~a~ekld~mgthl~mad~Q~s~fk~Lk 447 (1265)
T KOG0976|consen 370 IQEKKENVEEELQSLLELQAERQEQIDELKNHIFRLEQGKKDHEAAKNELQEALEKLDLMGTHLSMADYQLSNFKVLK 447 (1265)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhccchhHHHHHHHHHHHHHHHHHhHHHHHHHHHHhhHHHHH
Confidence 466677777888888888899999999999999988876 4445555555667777888887665
No 60
>cd00632 Prefoldin_beta Prefoldin beta; Prefoldin is a hexameric molecular chaperone complex, composed of two evolutionarily related subunits (alpha and beta), which are found in both eukaryotes and archaea. Prefoldin binds and stabilizes newly synthesized polypeptides allowing them to fold correctly. The hexameric structure consists of a double beta barrel assembly with six protruding coiled-coils. The alpha prefoldin subunits have two beta hairpin structures while the beta prefoldin subunits (this CD) have only one hairpin that is most similar to the second hairpin of the alpha subunit. The prefoldin hexamer consists of two alpha and four beta subunits and is assembled from the beta hairpins of all six subunits. The alpha subunits initially dimerize providing a structural nucleus for the assembly of the beta subunits. In archaea, there is usually only one gene for each subunit while in eukaryotes there two or more paralogous genes encoding each subunit adding heterogeneity to the st
Probab=62.44 E-value=39 Score=21.71 Aligned_cols=43 Identities=19% Similarity=0.370 Sum_probs=23.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHhhhhHHHHHHHHH
Q psy7799 27 QLVDRATDDMHEDIKQLDSHLKQVEEASNNAKVLRNKAHYLIKELD 72 (83)
Q Consensus 27 ~~Vd~~~~eL~~ei~~L~~~i~~le~i~~~~K~lrnKa~~L~~eL~ 72 (83)
..++.+...+++.++.++.+|..++ ...+.+..+..-|..+|.
T Consensus 59 ~~~~ea~~~Le~~~e~le~~i~~l~---~~~~~l~~~~~elk~~l~ 101 (105)
T cd00632 59 QEKEEARTELKERLETIELRIKRLE---RQEEDLQEKLKELQEKIQ 101 (105)
T ss_pred ccHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHH
Confidence 4566666666666666666665443 334444444444444443
No 61
>PF10359 Fmp27_WPPW: RNA pol II promoter Fmp27 protein domain; InterPro: IPR019449 The function of the FMP27 protein is not known. FMP27 is the product of a nuclear encoded gene but it is detected in highly purified mitochondria in high-throughput studies []. This entry represents a domain within FMP27 that contains characteristic HQR and WPPW sequence motifs.
Probab=62.43 E-value=46 Score=27.08 Aligned_cols=29 Identities=17% Similarity=0.377 Sum_probs=24.7
Q ss_pred HHHHHHhhHHhhhhHHHHHHHHHHHHHHh
Q psy7799 50 VEEASNNAKVLRNKAHYLIKELDMFQEAY 78 (83)
Q Consensus 50 le~i~~~~K~lrnKa~~L~~eL~~F~~~y 78 (83)
++.+....+.|+.|..+|++.|+.|....
T Consensus 202 ~~~l~~~~~~l~~~~~~l~~~l~~l~~~~ 230 (475)
T PF10359_consen 202 IEELERHISSLKERIEFLENMLEDLEDSE 230 (475)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhhc
Confidence 55677888999999999999999998765
No 62
>PRK06798 fliD flagellar capping protein; Validated
Probab=62.30 E-value=39 Score=27.38 Aligned_cols=46 Identities=11% Similarity=0.266 Sum_probs=34.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHH-HHHhhHHhhhhHHHHHHHHHHHHHH
Q psy7799 32 ATDDMHEDIKQLDSHLKQVEE-ASNNAKVLRNKAHYLIKELDMFQEA 77 (83)
Q Consensus 32 ~~~eL~~ei~~L~~~i~~le~-i~~~~K~lrnKa~~L~~eL~~F~~~ 77 (83)
-...++.+|+.+.++++++|. +..+-..|+.++.-||.-+.....+
T Consensus 380 r~~~l~~~i~~l~~~~~~~e~rl~~~e~~l~~qf~ale~~ms~lnsQ 426 (440)
T PRK06798 380 RSKSIDNRVSKLDLKITDIDTQNKQKQDNIVDKYQKLESTLAALDSQ 426 (440)
T ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 345688888888888888885 5666677888888888777766544
No 63
>PRK09631 DNA topoisomerase IV subunit A; Provisional
Probab=61.81 E-value=45 Score=28.76 Aligned_cols=49 Identities=22% Similarity=0.320 Sum_probs=37.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHhhhhHHHHHHHHHHHHHHh
Q psy7799 24 RLCQLVDRATDDMHEDIKQLDSHLKQVEEASNNAKVLRNKAHYLIKELDMFQEAY 78 (83)
Q Consensus 24 rL~~~Vd~~~~eL~~ei~~L~~~i~~le~i~~~~K~lrnKa~~L~~eL~~F~~~y 78 (83)
||.+....-...+++|+++|.++|+.++.+-.. --.++-.||+...+.|
T Consensus 397 rl~rlt~~e~~k~~~e~~~l~~~i~~~~~~L~~------~~~~~~~el~~l~~ky 445 (635)
T PRK09631 397 PIRRISLFDIDKNQKEIRILNKELKSVEKNLKS------IKGYAINFIDKLLAKY 445 (635)
T ss_pred HHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHH------HHHHHHHHHHHHHHHh
Confidence 455555666778899999999999999888666 3357778888887777
No 64
>PF07889 DUF1664: Protein of unknown function (DUF1664); InterPro: IPR012458 The members of this family are hypothetical plant proteins of unknown function. The region featured in this family is approximately 100 amino acids long.
Probab=61.64 E-value=51 Score=22.80 Aligned_cols=18 Identities=17% Similarity=0.433 Sum_probs=7.3
Q ss_pred HHHHHHHHHHHHHHHHHH
Q psy7799 33 TDDMHEDIKQLDSHLKQV 50 (83)
Q Consensus 33 ~~eL~~ei~~L~~~i~~l 50 (83)
++.|...|+.++.+++..
T Consensus 63 KkhLsqRId~vd~klDe~ 80 (126)
T PF07889_consen 63 KKHLSQRIDRVDDKLDEQ 80 (126)
T ss_pred HHHHHHHHHHHHhhHHHH
Confidence 333444444444443333
No 65
>PF10549 ORF11CD3: ORF11CD3 domain; InterPro: IPR018877 This entry represents the carboxy-terminal domain from ORF11 (Q9XJS9 from SWISSPROT), one of the proteins of Pseudomonas phage D3 (Bacteriophage D3). The function of these proteins are unknown [].
Probab=61.07 E-value=34 Score=20.71 Aligned_cols=36 Identities=17% Similarity=0.293 Sum_probs=24.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy7799 13 QVEQELSSTFARLCQLVDRATDDMHEDIKQLDSHLK 48 (83)
Q Consensus 13 QVqqEL~~tf~rL~~~Vd~~~~eL~~ei~~L~~~i~ 48 (83)
.-..++++.++|.=+.--...--++..|+.|..+++
T Consensus 16 ~~~K~~AS~~GrgL~~Wk~~Kp~l~~ki~~l~~~~Q 51 (57)
T PF10549_consen 16 KKEKDIASLCGRGLNRWKWKKPQLEQKIEELEEQLQ 51 (57)
T ss_pred HhHHHHHHHHhHHHHHHHHhhHHHHHHHHHHHHHhh
Confidence 335667777777777777777777777777766654
No 66
>TIGR01837 PHA_granule_1 poly(hydroxyalkanoate) granule-associated protein. This model describes a domain found in some proteins associated with polyhydroxyalkanoate (PHA) granules in a subset of species that have PHA inclusion granules. Included are two tandem proteins of Pseudomonas oleovorans, PhaI and PhaF, and their homologs in related species. PhaF proteins have a low-complexity C-terminal region with repeats similar to AAAKP.
Probab=60.99 E-value=19 Score=24.12 Aligned_cols=17 Identities=18% Similarity=0.214 Sum_probs=8.2
Q ss_pred hHHhhhhHHHHHHHHHH
Q psy7799 57 AKVLRNKAHYLIKELDM 73 (83)
Q Consensus 57 ~K~lrnKa~~L~~eL~~ 73 (83)
...|+.|..-|+.+|+.
T Consensus 98 v~~L~~RI~~Le~~l~~ 114 (118)
T TIGR01837 98 IEALSAKIEQLAVQVEE 114 (118)
T ss_pred HHHHHHHHHHHHHHHHH
Confidence 34444555555555443
No 67
>PF01920 Prefoldin_2: Prefoldin subunit; InterPro: IPR002777 Prefoldin (PFD) is a chaperone that interacts exclusively with type II chaperonins, hetero-oligomers lacking an obligate co-chaperonin that are found only in eukaryotes (chaperonin-containing T-complex polypeptide-1 (CCT)) and archaea. Eukaryotic PFD is a multi-subunit complex containing six polypeptides in the molecular mass range of 14-23 kDa. In archaea, on the other hand, PFD is composed of two types of subunits, two alpha and four beta. The six subunits associate to form two back-to-back up-and-down eight-stranded barrels, from which hang six coiled coils. Each subunit contributes one (beta subunits) or two (alpha subunits) beta hairpin turns to the barrels. The coiled coils are formed by the N and C termini of an individual subunit. Overall, this unique arrangement resembles a jellyfish. The eukaryotic PFD hexamer is composed of six different subunits; however, these can be grouped into two alpha-like (PFD3 and -5) and four beta-like (PFD1, -2, -4, and -6) subunits based on amino acid sequence similarity with their archaeal counterparts. Eukaryotic PFD has a six-legged structure similar to that seen in the archaeal homologue [, ]. This family contains the archaeal beta subunit, eukaryotic prefoldin subunits 1, 2, 4 and 6. Eukaryotic PFD has been shown to bind both actin and tubulin co-translationally. The chaperone then delivers the target protein to CCT, interacting with the chaperonin through the tips of the coiled coils. No authentic target proteins of any archaeal PFD have been identified, to date.; GO: 0051082 unfolded protein binding, 0006457 protein folding, 0016272 prefoldin complex; PDB: 2ZDI_B 3AEI_B 2ZQM_A 1FXK_A.
Probab=59.60 E-value=40 Score=20.87 Aligned_cols=37 Identities=27% Similarity=0.557 Sum_probs=17.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhHHhhhhHHHHHHHHHHHHH
Q psy7799 30 DRATDDMHEDIKQLDSHLKQVEEASNNAKVLRNKAHYLIKELDMFQE 76 (83)
Q Consensus 30 d~~~~eL~~ei~~L~~~i~~le~i~~~~K~lrnKa~~L~~eL~~F~~ 76 (83)
+.+..+|+++++.++.+| +.|+++...++.++.....
T Consensus 61 ~~~~~~L~~~~~~~~~~i----------~~l~~~~~~l~~~l~~~~~ 97 (106)
T PF01920_consen 61 EEAIEELEERIEKLEKEI----------KKLEKQLKYLEKKLKELKK 97 (106)
T ss_dssp HHHHHHHHHHHHHHHHHH----------HHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHH----------HHHHHHHHHHHHHHHHHHH
Confidence 344444444444444444 4444455555555555443
No 68
>PF10146 zf-C4H2: Zinc finger-containing protein ; InterPro: IPR018482 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents a family of proteins which appears to have a highly conserved zinc finger domain at the C-terminal end, described as -C-X2-CH-X3-H-X5-C-X2-C-. The structure is predicted to contain a coiled coil. Members of this family are annotated as being tumour-associated antigen HCA127 in humans, but this could not be confirmed.
Probab=59.15 E-value=74 Score=23.89 Aligned_cols=32 Identities=22% Similarity=0.345 Sum_probs=24.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhHHhhhhH
Q psy7799 33 TDDMHEDIKQLDSHLKQVEEASNNAKVLRNKA 64 (83)
Q Consensus 33 ~~eL~~ei~~L~~~i~~le~i~~~~K~lrnKa 64 (83)
.....+++++...+|..||.+.+.++.=|+|.
T Consensus 48 r~~h~eeLrqI~~DIn~lE~iIkqa~~er~~~ 79 (230)
T PF10146_consen 48 RMAHVEELRQINQDINTLENIIKQAESERNKR 79 (230)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34456778888899999999998887766553
No 69
>cd01106 HTH_TipAL-Mta Helix-Turn-Helix DNA binding domain of the transcription regulators TipAL, Mta, and SkgA. Helix-turn-helix (HTH) TipAL, Mta, and SkgA transcription regulators, and related proteins, N-terminal domain. TipAL regulates resistance to and activation by numerous cyclic thiopeptide antibiotics, such as thiostrepton. Mta is a global transcriptional regulator; the N-terminal DNA-binding domain of Mta interacts directly with the promoters of mta, bmr, blt, and ydfK, and induces transcription of these multidrug-efflux transport genes. SkgA has been shown to control stationary-phase expression of catalase-peroxidase in Caulobacter crescentus. These proteins are comprised of distinct domains that harbor an N-terminal active (DNA-binding) site and a regulatory (effector-binding) site. The conserved N-terminal domain of these transcription regulators contains winged HTH motifs that mediate DNA binding. These proteins share the N-terminal DNA binding domain with other transcrip
Probab=59.01 E-value=44 Score=21.19 Aligned_cols=45 Identities=18% Similarity=0.209 Sum_probs=22.5
Q ss_pred hhchhHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy7799 7 SSNCSHQVEQELSSTF-ARLCQLVDRATDDMHEDIKQLDSHLKQVE 51 (83)
Q Consensus 7 ssncshQVqqEL~~tf-~rL~~~Vd~~~~eL~~ei~~L~~~i~~le 51 (83)
..+.+-.--+++-... ..+.+.+..-..+|+.+|++|...++.|+
T Consensus 55 ~~g~~l~~i~~~~~~~~~~~~~~l~~~~~~l~~~i~~l~~~~~~l~ 100 (103)
T cd01106 55 ELGFSLKEIKELLKDPSEDLLEALREQKELLEEKKERLDKLIKTID 100 (103)
T ss_pred HcCCCHHHHHHHHHcCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3444444444444333 34444555555555555555555555554
No 70
>PF07352 Phage_Mu_Gam: Bacteriophage Mu Gam like protein; InterPro: IPR009951 The Gam protein, originally characterised in Bacteriophage Mu, protects linear double stranded DNA from exonuclease degradation in vitro and in vivo []. This protein is also found in many bacterial species as part of a suspected prophage. Further studies have shown that Gam is a functional counterpart of the eukaryotic Ku protein, which has key roles in DNA repair and in certain transposition events. Gam displays DNA binding characteristics remarkably similar to those of human Ku []. In addition, Gam can interfere with Ty1 retrotransposition in Saccharomyces cerevisiae (Baker's yeast). These data reveal structural and functional parallels between bacteriophage Gam and eukaryotic Ku and suggest that their functions have been evolutionarily conserved [].; GO: 0003690 double-stranded DNA binding, 0042262 DNA protection; PDB: 2P2U_B.
Probab=58.72 E-value=49 Score=22.59 Aligned_cols=42 Identities=14% Similarity=0.216 Sum_probs=33.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy7799 14 VEQELSSTFARLCQLVDRATDDMHEDIKQLDSHLKQVEEASN 55 (83)
Q Consensus 14 VqqEL~~tf~rL~~~Vd~~~~eL~~ei~~L~~~i~~le~i~~ 55 (83)
+++|+...=+-+..+++......+.+++.|..+|+.|+..-.
T Consensus 15 l~~~~~~i~~~~~~~I~~i~~~~~~~~~~l~~~i~~l~~~l~ 56 (149)
T PF07352_consen 15 LQREIARIEAEANDEIARIKEWYEAEIAPLQNRIEYLEGLLQ 56 (149)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 466677777777788888888888899999998888887654
No 71
>PRK14143 heat shock protein GrpE; Provisional
Probab=58.71 E-value=32 Score=26.09 Aligned_cols=41 Identities=20% Similarity=0.307 Sum_probs=24.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhHHhhhhHHHHHHHHHHHHHHh
Q psy7799 35 DMHEDIKQLDSHLKQVEEASNNAKVLRNKAHYLIKELDMFQEAY 78 (83)
Q Consensus 35 eL~~ei~~L~~~i~~le~i~~~~K~lrnKa~~L~~eL~~F~~~y 78 (83)
+...++..|..+| +.+..+++.|++|.-.+..++++|.+..
T Consensus 64 ~~~~~~~~l~~el---~~l~~e~~elkd~~lR~~AdfeN~RKR~ 104 (238)
T PRK14143 64 DNAARLAQLEQEL---ESLKQELEELNSQYMRIAADFDNFRKRT 104 (238)
T ss_pred cchhHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3445555555543 3445556666777777777777776543
No 72
>PRK07737 fliD flagellar capping protein; Validated
Probab=58.44 E-value=1e+02 Score=25.36 Aligned_cols=42 Identities=12% Similarity=0.247 Sum_probs=26.9
Q ss_pred HHHHHHHHHHHHHHHHHH-HHHhhHHhhhhHHHHHHHHHHHHH
Q psy7799 35 DMHEDIKQLDSHLKQVEE-ASNNAKVLRNKAHYLIKELDMFQE 76 (83)
Q Consensus 35 eL~~ei~~L~~~i~~le~-i~~~~K~lrnKa~~L~~eL~~F~~ 76 (83)
.|..+|+.++++|..+|. +...-+.+|.|+.-||.-+.....
T Consensus 445 ~l~~~i~~l~~~i~~~~~rl~~~e~ry~~qf~ale~~~s~mns 487 (501)
T PRK07737 445 AIGKDLNQIETQIDRFQDRLKQIEDRYYKKFSAMEKAIQKANE 487 (501)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 456666666666666654 333345678888888877766544
No 73
>PF12732 YtxH: YtxH-like protein; InterPro: IPR024623 This family of uncharacterised proteins is found in bacteria. Proteins in this family are typically between 100 and 143 amino acids in length. The N-terminal region is the most conserved.
Probab=57.96 E-value=40 Score=20.40 Aligned_cols=37 Identities=11% Similarity=0.208 Sum_probs=26.7
Q ss_pred hhhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy7799 6 TSSNCSHQVEQELSSTFARLCQLVDRATDDMHEDIKQ 42 (83)
Q Consensus 6 tssncshQVqqEL~~tf~rL~~~Vd~~~~eL~~ei~~ 42 (83)
.+..-+.+.++.|......+...+.....++++.+++
T Consensus 19 ~aP~sG~e~R~~l~~~~~~~~~~~~~~~~~~~~~~k~ 55 (74)
T PF12732_consen 19 FAPKSGKETREKLKDKAEDLKDKAKDLYEEAKEKVKE 55 (74)
T ss_pred hCCCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3445577888888888888888887777776665444
No 74
>PF11690 DUF3287: Protein of unknown function (DUF3287); InterPro: IPR021704 This eukaryotic family of proteins has no known function.
Probab=57.78 E-value=58 Score=22.21 Aligned_cols=54 Identities=7% Similarity=0.155 Sum_probs=42.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHhh-hhHHHHHHHHHHHHH
Q psy7799 22 FARLCQLVDRATDDMHEDIKQLDSHLKQVEEASNNAKVLR-NKAHYLIKELDMFQE 76 (83)
Q Consensus 22 f~rL~~~Vd~~~~eL~~ei~~L~~~i~~le~i~~~~K~lr-nKa~~L~~eL~~F~~ 76 (83)
+-.+-.-|..-++.+-+|-+.|.++|+.| .|.++...|- +...|+...++.|+.
T Consensus 40 a~~F~~kV~~qH~~~~~e~r~L~kKi~~l-~veRkmr~Les~p~~W~~~gf~~~a~ 94 (109)
T PF11690_consen 40 AYDFIDKVVDQHQRYCDERRKLRKKIQDL-RVERKMRALESHPFDWERRGFDKVAP 94 (109)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHhccCChHHHHHhchHHhcc
Confidence 34455667788999999999999999999 6777777765 678888888877753
No 75
>PF07851 TMPIT: TMPIT-like protein; InterPro: IPR012926 A number of members of this family are annotated as being transmembrane proteins induced by tumour necrosis factor alpha, but no literature was found to support this. ; GO: 0016021 integral to membrane
Probab=57.67 E-value=97 Score=24.75 Aligned_cols=45 Identities=18% Similarity=0.152 Sum_probs=29.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHh
Q psy7799 16 QELSSTFARLCQLVDRATDDMHEDIKQLDSHLKQVEEASNNAKVL 60 (83)
Q Consensus 16 qEL~~tf~rL~~~Vd~~~~eL~~ei~~L~~~i~~le~i~~~~K~l 60 (83)
++|..+.+-..+..++...-...-.+.++++..+|..+...+|.+
T Consensus 14 q~Lqethr~Y~qKleel~~lQ~~C~ssI~~QkkrLk~L~~sLk~~ 58 (330)
T PF07851_consen 14 QELQETHRSYKQKLEELSKLQDKCSSSISHQKKRLKELKKSLKRC 58 (330)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 456666666666666666666666666666777777777777777
No 76
>PRK14150 heat shock protein GrpE; Provisional
Probab=57.08 E-value=44 Score=24.37 Aligned_cols=20 Identities=10% Similarity=0.014 Sum_probs=14.8
Q ss_pred HhhhhHHHHHHHHHHHHHHh
Q psy7799 59 VLRNKAHYLIKELDMFQEAY 78 (83)
Q Consensus 59 ~lrnKa~~L~~eL~~F~~~y 78 (83)
.++++.-.+..+.++|.+..
T Consensus 56 ~~kd~~lR~~AefeN~rkR~ 75 (193)
T PRK14150 56 EERDSVLRARAEVENIRRRA 75 (193)
T ss_pred HHHHHHHHHHHHHHHHHHHH
Confidence 46777778888888887653
No 77
>PF09798 LCD1: DNA damage checkpoint protein; InterPro: IPR018622 This is a family of proteins which regulate checkpoint kinases. In Schizosaccharomyces pombe (Fission yeast) this protein is called Rad26 and in Saccharomyces cerevisiae (Baker's yeast) it is called LCD1 [].
Probab=56.62 E-value=46 Score=28.89 Aligned_cols=42 Identities=19% Similarity=0.328 Sum_probs=37.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy7799 12 HQVEQELSSTFARLCQLVDRATDDMHEDIKQLDSHLKQVEEA 53 (83)
Q Consensus 12 hQVqqEL~~tf~rL~~~Vd~~~~eL~~ei~~L~~~i~~le~i 53 (83)
.|.|+|-..=-.++...++.....-++|++.|...+++||.-
T Consensus 7 ~~Lq~ek~~E~~~l~~~~~~lk~~~~~el~~Lk~~vqkLEDE 48 (654)
T PF09798_consen 7 ELLQQEKQKERQALKSSVEELKESHEEELNKLKSEVQKLEDE 48 (654)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHH
Confidence 467778888888999999999999999999999999999864
No 78
>PF12792 CSS-motif: CSS motif domain associated with EAL ; InterPro: IPR024744 This domain, with its characteristic highly conserved CSS sequence motif, is found N-terminal to the EAL domain (PF00563 from PFAM), found in many putative cyclic diguanylate phosphodiesterases.
Probab=56.60 E-value=57 Score=21.81 Aligned_cols=50 Identities=18% Similarity=0.337 Sum_probs=36.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---------HHHHHHHHHHHHhhHHhh
Q psy7799 12 HQVEQELSSTFARLCQLVDRATDDMHEDIKQL---------DSHLKQVEEASNNAKVLR 61 (83)
Q Consensus 12 hQVqqEL~~tf~rL~~~Vd~~~~eL~~ei~~L---------~~~i~~le~i~~~~K~lr 61 (83)
||.++.+.....++-.++|.+..+...-+..+ +..+..|-.+--...-+|
T Consensus 1 ~~~~~~~~~~a~~~~~~~e~~~~~~~~~~~~~~~~~~~~Cs~~~~~~Lr~~~~~~~~ir 59 (208)
T PF12792_consen 1 QQEQRDLDTYAQRALQRIESVLDQARQALDALLPLTGQPCSPAHLNALRQIVARSPYIR 59 (208)
T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCHHHHHHHHHHHHhchhhh
Confidence 57899999999999999999999988888887 233555555544444444
No 79
>TIGR02492 flgK_ends flagellar hook-associated protein FlgK. The flagellar hook-associated protein FlgK of bacterial flagella has conserved N- and C-terminal domains. The central region is highly variable in length and sequence, and often contains substantial runs of low-complexity sequence. This model is built from an alignment of FlgK sequences with the central region excised. Note that several other proteins of the flagellar apparatus also are homologous in the N- and C-terminal regions to FlgK, but are excluded from this model.
Probab=55.68 E-value=79 Score=24.18 Aligned_cols=16 Identities=31% Similarity=0.424 Sum_probs=8.6
Q ss_pred HHhhhhHHHHHHHHHH
Q psy7799 58 KVLRNKAHYLIKELDM 73 (83)
Q Consensus 58 K~lrnKa~~L~~eL~~ 73 (83)
..|+++=..|..+|-.
T Consensus 194 n~L~DqRD~ll~~LS~ 209 (322)
T TIGR02492 194 NDLLDQRDLLLKELSQ 209 (322)
T ss_pred hHhHHHHHHHHHHHHh
Confidence 3455555555555543
No 80
>PRK14160 heat shock protein GrpE; Provisional
Probab=55.61 E-value=69 Score=23.93 Aligned_cols=41 Identities=12% Similarity=0.317 Sum_probs=26.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhHHhhhhHHHHHHHHHHHHHHh
Q psy7799 35 DMHEDIKQLDSHLKQVEEASNNAKVLRNKAHYLIKELDMFQEAY 78 (83)
Q Consensus 35 eL~~ei~~L~~~i~~le~i~~~~K~lrnKa~~L~~eL~~F~~~y 78 (83)
.++.++..|..++ +.+......++++.-.+..+.++|.+-.
T Consensus 58 ~l~~e~~~l~~~l---~~l~~e~~elkd~~lR~~AefeN~RKR~ 98 (211)
T PRK14160 58 ELKDENNKLKEEN---KKLENELEALKDRLLRTVAEYDNYRKRT 98 (211)
T ss_pred HHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4444555554443 4455666677788888888888887654
No 81
>PF04912 Dynamitin: Dynamitin ; InterPro: IPR006996 Dynamitin is a subunit of the microtubule-dependent motor complex, it is also implicated in cell adhesion by binding to macrophage-enriched myristoylated alanine-rice C kinase substrate (MacMARCKS) []. It is also thought to modulate cytoplasmic dynein binding to an organelle, and plays a role in prometaphase chromosome alignment and spindle organisation during mitosis. Dynamitin is also involved in anchoring microtubules to centrosomes and may play a role in synapse formation during brain development []. ; GO: 0007017 microtubule-based process, 0005869 dynactin complex
Probab=55.55 E-value=58 Score=25.58 Aligned_cols=46 Identities=17% Similarity=0.351 Sum_probs=22.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH-HHHhhHHhhhhHHHHHHHHH
Q psy7799 27 QLVDRATDDMHEDIKQLDSHLKQVEE-ASNNAKVLRNKAHYLIKELD 72 (83)
Q Consensus 27 ~~Vd~~~~eL~~ei~~L~~~i~~le~-i~~~~K~lrnKa~~L~~eL~ 72 (83)
..++....+|..+++..+.-+..++. +....+.+.+...-|+.++.
T Consensus 339 ~~le~~q~~l~~~l~~~~~~L~~ve~~~~~N~~~i~~n~~~le~Ri~ 385 (388)
T PF04912_consen 339 SELESQQSDLQSQLKKWEELLNKVEEKFKENMETIEKNVKKLEERIA 385 (388)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 34455555555555555555554442 33444455555555554443
No 82
>COG2348 Peptidoglycan interpeptide bridge formation enzyme [Cell wall/membrane/envelope biogenesis]
Probab=54.95 E-value=44 Score=27.44 Aligned_cols=67 Identities=16% Similarity=0.276 Sum_probs=48.4
Q ss_pred HHHHHHHHHHHHHH------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhH---HhhhhHHHHHHHHHHHHHHh
Q psy7799 12 HQVEQELSSTFARL------CQLVDRATDDMHEDIKQLDSHLKQVEEASNNAK---VLRNKAHYLIKELDMFQEAY 78 (83)
Q Consensus 12 hQVqqEL~~tf~rL------~~~Vd~~~~eL~~ei~~L~~~i~~le~i~~~~K---~lrnKa~~L~~eL~~F~~~y 78 (83)
...-+.+..+|+-= +--+++....|+.+++.+..++.++......++ ..+|+-.-|..+++.|...+
T Consensus 221 ~~Y~~~~~d~y~d~a~~~la~l~~~e~~~~l~~~l~~~~~~~~r~~~~l~~~~~~~k~~~~l~~l~~q~~~~~~~~ 296 (418)
T COG2348 221 LSYYENFYDIYKDKAELPLAYLDLDEYLKKLNQELAKLAAEIERVQEALKESPKSEKAQNKLNRLQMQLEAFEERI 296 (418)
T ss_pred HHHHHHHHHHHhhhhhhhhhhcCHHHHHHHHHHHHHHHHhHHHHHHHHhccCcchhhhhhhHHHHHHHHHhhHHHH
Confidence 34445556666543 112346677788888888888888887777665 88899999999999998765
No 83
>PRK13922 rod shape-determining protein MreC; Provisional
Probab=54.84 E-value=53 Score=24.23 Aligned_cols=34 Identities=21% Similarity=0.276 Sum_probs=16.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhHHhhhhHHHH
Q psy7799 34 DDMHEDIKQLDSHLKQVEEASNNAKVLRNKAHYL 67 (83)
Q Consensus 34 ~eL~~ei~~L~~~i~~le~i~~~~K~lrnKa~~L 67 (83)
.++.+|.++|.+++..|+.-...+..++..-..|
T Consensus 72 ~~l~~en~~L~~e~~~l~~~~~~~~~l~~en~~L 105 (276)
T PRK13922 72 FDLREENEELKKELLELESRLQELEQLEAENARL 105 (276)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3455555555555555555544444444333333
No 84
>smart00502 BBC B-Box C-terminal domain. Coiled coil region C-terminal to (some) B-Box domains
Probab=54.72 E-value=49 Score=20.48 Aligned_cols=34 Identities=18% Similarity=0.434 Sum_probs=27.4
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy7799 11 SHQVEQELSSTFARLCQLVDRATDDMHEDIKQLD 44 (83)
Q Consensus 11 shQVqqEL~~tf~rL~~~Vd~~~~eL~~ei~~L~ 44 (83)
...+.+++...|.+|...++.-..+|-.+|+...
T Consensus 37 ~~~~~~~I~~~f~~l~~~L~~~e~~ll~~l~~~~ 70 (127)
T smart00502 37 AADVEAQIKAAFDELRNALNKRKKQLLEDLEEQK 70 (127)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3567888899999999999988888888887744
No 85
>PRK07521 flgK flagellar hook-associated protein FlgK; Validated
Probab=54.50 E-value=66 Score=26.13 Aligned_cols=58 Identities=17% Similarity=0.311 Sum_probs=27.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHH-------HhhHHhhhhHHHHHHHHHHH
Q psy7799 17 ELSSTFARLCQLVDRATDDMHEDI-------KQLDSHLKQVEEAS-------NNAKVLRNKAHYLIKELDMF 74 (83)
Q Consensus 17 EL~~tf~rL~~~Vd~~~~eL~~ei-------~~L~~~i~~le~i~-------~~~K~lrnKa~~L~~eL~~F 74 (83)
.|...|..+..+++..+++...+| ..+-++|..|..-. .....|+++=..|-.||-.+
T Consensus 133 ~L~~~~n~~~~~L~~~~~~~~~~i~~~V~~iN~l~~~Ia~LN~~I~~~~~~g~~~ndL~DqRD~ll~~LS~~ 204 (483)
T PRK07521 133 DLANSLNDASDAVQSARADADAEIADSVDTLNDLLAQFEDANNAVVSGTATGRDASDALDQRDKLLKQISQI 204 (483)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCchhhHHHHHHHHHHHHhh
Confidence 345555555555555544444332 22333333333222 22446677777777776543
No 86
>PRK09458 pspB phage shock protein B; Provisional
Probab=53.89 E-value=28 Score=22.35 Aligned_cols=27 Identities=19% Similarity=0.311 Sum_probs=16.0
Q ss_pred HHHHHHHHHHhhHHhhhhHHHHHHHHH
Q psy7799 46 HLKQVEEASNNAKVLRNKAHYLIKELD 72 (83)
Q Consensus 46 ~i~~le~i~~~~K~lrnKa~~L~~eL~ 72 (83)
+.+.|+.+..+++.++.+..-||+-|+
T Consensus 40 d~~~L~~L~~~A~rm~~RI~tLE~ILD 66 (75)
T PRK09458 40 EQQRLAQLTEKAERMRERIQALEAILD 66 (75)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHc
Confidence 345566666666666666666665554
No 87
>PF10393 Matrilin_ccoil: Trimeric coiled-coil oligomerisation domain of matrilin; InterPro: IPR019466 This entry represents a short domain found the matrilin (cartilage matrix) proteins. It forms a coiled coil structure and contains a single cysteine residue at its start which is likely to form a di-sulphide bridge with a corresponding cysteine in an upstream EGF domain (IPR006209 from INTERPRO), thereby spanning the VWA domain of the protein (IPR002035 from INTERPRO).This domain is likely to be responsible for protein trimerisation []. ; PDB: 1AQ5_C.
Probab=52.90 E-value=45 Score=19.49 Aligned_cols=25 Identities=16% Similarity=0.344 Sum_probs=14.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH
Q psy7799 28 LVDRATDDMHEDIKQLDSHLKQVEE 52 (83)
Q Consensus 28 ~Vd~~~~eL~~ei~~L~~~i~~le~ 52 (83)
.|......|...+++++++++.||.
T Consensus 20 ~v~~~lq~Lt~kL~~vs~RLe~LEn 44 (47)
T PF10393_consen 20 KVTSALQSLTQKLDAVSKRLEALEN 44 (47)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 4444555555556666666666654
No 88
>TIGR02047 CadR-PbrR Cd(II)/Pb(II)-responsive transcriptional regulator. This model represents the cadmium(II) and/or lead(II) responsive transcriptional activator of the proteobacterial metal efflux system. This protein is a member of the MerR family of transcriptional activators (pfam00376) and contains a distinctive pattern of cysteine residues in its metal binding loop, Cys-X(6-9)-Cys, as well as a conserved and critical cysteine at the N-terminal end of the dimerization helix.
Probab=52.88 E-value=60 Score=21.57 Aligned_cols=31 Identities=13% Similarity=0.205 Sum_probs=20.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhHHhh
Q psy7799 31 RATDDMHEDIKQLDSHLKQVEEASNNAKVLR 61 (83)
Q Consensus 31 ~~~~eL~~ei~~L~~~i~~le~i~~~~K~lr 61 (83)
....-++..+.+++.+|+.|..+...+..+.
T Consensus 79 ~~~~~l~~~~~~l~~~i~~L~~~~~~L~~~~ 109 (127)
T TIGR02047 79 DVNALLDEHISHVRARIIKLQALIEQLVDLR 109 (127)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3444566677777777777777776666554
No 89
>PRK13729 conjugal transfer pilus assembly protein TraB; Provisional
Probab=52.57 E-value=87 Score=26.33 Aligned_cols=7 Identities=14% Similarity=-0.078 Sum_probs=2.5
Q ss_pred HHHHHHH
Q psy7799 66 YLIKELD 72 (83)
Q Consensus 66 ~L~~eL~ 72 (83)
-|+.|+.
T Consensus 108 eLEaE~~ 114 (475)
T PRK13729 108 KLGQDNA 114 (475)
T ss_pred HHHHHHH
Confidence 3333333
No 90
>PLN03085 nucleobase:cation symporter-1; Provisional
Probab=52.45 E-value=63 Score=24.66 Aligned_cols=40 Identities=8% Similarity=0.273 Sum_probs=26.3
Q ss_pred HHHHHHHHHHHHHHHHHHHH---------------HHHHHHHHHHHHHHHHHHHH
Q psy7799 12 HQVEQELSSTFARLCQLVDR---------------ATDDMHEDIKQLDSHLKQVE 51 (83)
Q Consensus 12 hQVqqEL~~tf~rL~~~Vd~---------------~~~eL~~ei~~L~~~i~~le 51 (83)
-++++|+.+...+|...... ....+.++|++|++.|.++.
T Consensus 145 IeL~KEI~~~i~~lR~~Lr~~w~~~s~~~~~~W~~~~~~~qe~Ir~LNkkIdkYN 199 (221)
T PLN03085 145 VELNKEIRGQINEWRSALKKAWANRCEGDDSQWIEDSRLLQEQLRQINNKVFRYN 199 (221)
T ss_pred HHHHHHHHHHHHHHHHHHHHHhcccCchhHHHHHHHHHHHHHHHHHHHHHHHHhc
Confidence 36788999888888776554 23345666666666666554
No 91
>PF02388 FemAB: FemAB family; InterPro: IPR003447 The femAB operon codes for two nearly identical approximately 50kDa proteins involved in the formation of the Staphylococcal pentaglycine interpeptide bridge in peptidoglycan []. These proteins are also considered as a factor influencing the level of methicillin resistance [].; GO: 0016755 transferase activity, transferring amino-acyl groups; PDB: 1XE4_A 1NE9_A 3GKR_A 1XIX_A 1P4N_A 1XF8_A 1LRZ_A.
Probab=52.39 E-value=94 Score=24.61 Aligned_cols=24 Identities=25% Similarity=0.464 Sum_probs=12.2
Q ss_pred HHHHHHHHHHHHHHHHHHHhhHHh
Q psy7799 37 HEDIKQLDSHLKQVEEASNNAKVL 60 (83)
Q Consensus 37 ~~ei~~L~~~i~~le~i~~~~K~l 60 (83)
+..++++..+++++++-...++.+
T Consensus 272 ~~k~~~~~~q~~~~~k~~~~~~~~ 295 (406)
T PF02388_consen 272 KNKLKELEEQLASLEKRIEEAEEL 295 (406)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred hhHHHHHHHHHHHHHHHHHHHHHH
Confidence 344555555555555554444444
No 92
>PRK04654 sec-independent translocase; Provisional
Probab=51.89 E-value=1e+02 Score=23.35 Aligned_cols=27 Identities=7% Similarity=0.069 Sum_probs=15.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy7799 15 EQELSSTFARLCQLVDRATDDMHEDIK 41 (83)
Q Consensus 15 qqEL~~tf~rL~~~Vd~~~~eL~~ei~ 41 (83)
-|-+-..+.++...++.+.+++..|++
T Consensus 29 aRtlGk~irk~R~~~~~vk~El~~El~ 55 (214)
T PRK04654 29 ARFAGLWVRRARMQWDSVKQELERELE 55 (214)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhh
Confidence 344555666666666666666665543
No 93
>TIGR02051 MerR Hg(II)-responsive transcriptional regulator. This model represents the mercury (II) responsive transcriptional activator of the mer organomercurial resistance operon. This protein is a member of the MerR family of transcriptional activators (pfam00376) and contains a distinctive pattern of cysteine residues in its metal binding loop, Cys-X(8)-Cys-Pro, as well as a conserved and critical cysteine at the N-terminal end of the dimerization helix.
Probab=51.87 E-value=52 Score=21.75 Aligned_cols=54 Identities=7% Similarity=-0.008 Sum_probs=30.4
Q ss_pred hhhhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHh
Q psy7799 5 LTSSNCSHQVEQELSSTFARLCQLVDRATDDMHEDIKQLDSHLKQVEEASNNAKVL 60 (83)
Q Consensus 5 ~tssncshQVqqEL~~tf~rL~~~Vd~~~~eL~~ei~~L~~~i~~le~i~~~~K~l 60 (83)
+...++|-.--+++-..... ..++....-++..+..++.+|+.|..+...++.+
T Consensus 52 l~~~G~sl~eI~~~l~~~~~--~~~~~~~~~l~~~~~~l~~~i~~L~~~~~~L~~~ 105 (124)
T TIGR02051 52 AQELGFSLEEIGGLLGLVDG--THCREMYELASRKLKSVQAKMADLLRIERLLEEL 105 (124)
T ss_pred HHHCCCCHHHHHHHHhcccC--CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33444454444444333221 2344556667777788888888777776665544
No 94
>cd07628 BAR_Atg24p The Bin/Amphiphysin/Rvs (BAR) domain of yeast Sorting Nexin Atg24p. BAR domains are dimerization, lipid binding and curvature sensing modules found in many different proteins with diverse functions. Sorting nexins (SNXs) are Phox homology (PX) domain containing proteins that are involved in regulating membrane traffic and protein sorting in the endosomal system. SNXs differ from each other in their lipid-binding specificity, subcellular localization and specific function in the endocytic pathway. A subset of SNXs also contain BAR domains. The PX-BAR structural unit determines the specific membrane targeting of SNXs. Atg24p is involved in membrane fusion events at the vacuolar surface during pexophagy. BAR domains form dimers that bind to membranes, induce membrane bending and curvature, and may also be involved in protein-protein interactions.
Probab=51.83 E-value=72 Score=22.65 Aligned_cols=41 Identities=12% Similarity=0.283 Sum_probs=22.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhHHhhhhHHHHHHHHHHHHHHh
Q psy7799 35 DMHEDIKQLDSHLKQVEEASNNAKVLRNKAHYLIKELDMFQEAY 78 (83)
Q Consensus 35 eL~~ei~~L~~~i~~le~i~~~~K~lrnKa~~L~~eL~~F~~~y 78 (83)
++++-|+.|+..+..++.+..++ -+|-.-|...+..|...|
T Consensus 8 ei~e~~~~L~~~L~~l~ki~~Rl---~kr~~~l~~d~~efg~~~ 48 (185)
T cd07628 8 EIREKSDKLDENLTKIDKIFAKV---VKRQSDLSVDYADLATQF 48 (185)
T ss_pred HHHHHHHHHHHHhHHHHHHHHHH---HHHHHHHHHHHHHHHHHH
Confidence 45566666666666666555432 234455555555555555
No 95
>KOG0687|consensus
Probab=51.77 E-value=58 Score=26.73 Aligned_cols=45 Identities=29% Similarity=0.492 Sum_probs=33.7
Q ss_pred HHHHHHHHHHHHHHHHHHHH---HHhhHHhhhhHHHH------HHHHHHHHHHh
Q psy7799 34 DDMHEDIKQLDSHLKQVEEA---SNNAKVLRNKAHYL------IKELDMFQEAY 78 (83)
Q Consensus 34 ~eL~~ei~~L~~~i~~le~i---~~~~K~lrnKa~~L------~~eL~~F~~~y 78 (83)
..=+++|++|+..|+..|+. ..-+..+++||-++ ++-++.|.++|
T Consensus 78 ~~neeki~eld~~iedaeenlGE~ev~ea~~~kaeYycqigDkena~~~~~~t~ 131 (393)
T KOG0687|consen 78 KANEEKIKELDEKIEDAEENLGESEVREAMLRKAEYYCQIGDKENALEALRKTY 131 (393)
T ss_pred HhhHHHHHHHHHHHHHHHHhcchHHHHHHHHHHHHHHHHhccHHHHHHHHHHHH
Confidence 33467899999999888877 34456788899876 56677777777
No 96
>PF04350 PilO: Pilus assembly protein, PilO; PDB: 2RJZ_B.
Probab=51.63 E-value=16 Score=23.84 Aligned_cols=12 Identities=17% Similarity=0.285 Sum_probs=4.6
Q ss_pred hHHHHHHHHHHH
Q psy7799 63 KAHYLIKELDMF 74 (83)
Q Consensus 63 Ka~~L~~eL~~F 74 (83)
...-++.+|+.+
T Consensus 31 ~~~~~~~~l~~~ 42 (144)
T PF04350_consen 31 QLEQLEQQLEEL 42 (144)
T ss_dssp HHHHHHHHHHHH
T ss_pred HHHHHHHHHHHH
Confidence 333333444333
No 97
>PF03954 Lectin_N: Hepatic lectin, N-terminal domain; InterPro: IPR005640 Animal lectins display a wide variety of architectures. They are classified according to the carbohydrate-recognition domain (CRD) of which there are two main types, S-type and C-type. C-type lectins display a wide range of specificities. They require Ca2+ for their activity They are found predominantly but not exclusively in vertebrates. This entry presents N-terminal domain, which is found in C-type lectins.; GO: 0005529 sugar binding, 0016020 membrane
Probab=51.62 E-value=48 Score=23.56 Aligned_cols=50 Identities=20% Similarity=0.284 Sum_probs=33.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHhhhhHHHHHHHHHHHH
Q psy7799 12 HQVEQELSSTFARLCQLVDRATDDMHEDIKQLDSHLKQVEEASNNAKVLRNKAHYLIKELDMFQ 75 (83)
Q Consensus 12 hQVqqEL~~tf~rL~~~Vd~~~~eL~~ei~~L~~~i~~le~i~~~~K~lrnKa~~L~~eL~~F~ 75 (83)
-|.|.||.+ |..+-..+...++.||+.|..+-.. +..|..-|+++|+.-.
T Consensus 58 ~qlq~dl~t----LretfsNFssst~aEvqaL~S~G~s----------l~~kVtSLea~lEkqq 107 (138)
T PF03954_consen 58 SQLQRDLRT----LRETFSNFSSSTLAEVQALSSQGGS----------LQDKVTSLEAKLEKQQ 107 (138)
T ss_pred HHHHHHHHH----HHHHHhcccHHHHHHHHHHHhcccc----------HHhHcccHHHHHHHHH
Confidence 688888765 6667777888888899988775443 3345555555555443
No 98
>cd04776 HTH_GnyR Helix-Turn-Helix DNA binding domain of the regulatory protein GnyR. Putative helix-turn-helix (HTH) regulatory protein, GnyR, and other related proteins. GnyR belongs to the gnyRDBHAL cluster, which is involved in acyclic isoprenoid degradation in Pseudomonas aeruginosa. These proteins share the N-terminal DNA binding domain with other transcription regulators of the MerR superfamily that promote transcription by reconfiguring the spacer between the -35 and -10 promoter elements. A typical MerR regulator is comprised of distinct domains that harbor the regulatory (effector-binding) site and the active (DNA-binding) site. Their conserved N-terminal domains contain predicted winged HTH motifs that mediate DNA binding, while the dissimilar C-terminal domains bind specific coactivator molecules.
Probab=51.56 E-value=68 Score=21.14 Aligned_cols=34 Identities=6% Similarity=0.240 Sum_probs=21.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHhh
Q psy7799 28 LVDRATDDMHEDIKQLDSHLKQVEEASNNAKVLR 61 (83)
Q Consensus 28 ~Vd~~~~eL~~ei~~L~~~i~~le~i~~~~K~lr 61 (83)
.+......+...+..|..+|+.|+.....+....
T Consensus 77 ~~~~~~~~l~~~~~~l~~~~~~l~~~~~~L~~~~ 110 (118)
T cd04776 77 QLEKMLEKIEKRRAELEQQRRDIDAALAELDAAE 110 (118)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3445556677777777777777766665544433
No 99
>PF06810 Phage_GP20: Phage minor structural protein GP20; InterPro: IPR009636 This family consists of several phage minor structural protein Gp20 sequences and prophage sequences of around 180 residues in length. The function of this family is unknown.; GO: 0005198 structural molecule activity
Probab=51.25 E-value=83 Score=22.05 Aligned_cols=38 Identities=24% Similarity=0.266 Sum_probs=20.2
Q ss_pred HHHHHHHHHHHHHHHHhhHHhhhhHH----HHHHHHHHHHHH
Q psy7799 40 IKQLDSHLKQVEEASNNAKVLRNKAH----YLIKELDMFQEA 77 (83)
Q Consensus 40 i~~L~~~i~~le~i~~~~K~lrnKa~----~L~~eL~~F~~~ 77 (83)
|+.|.....-.|.++..+..|+.+.. -.+.+|......
T Consensus 43 i~~Lk~~~~d~eeLk~~i~~lq~~~~~~~~~~e~~l~~~~~~ 84 (155)
T PF06810_consen 43 IKDLKKSAKDNEELKKQIEELQAKNKTAKEEYEAKLAQMKKD 84 (155)
T ss_pred HHHHHhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33333333444556777777776665 555555544433
No 100
>KOG1760|consensus
Probab=50.90 E-value=47 Score=23.48 Aligned_cols=35 Identities=31% Similarity=0.517 Sum_probs=23.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy7799 19 SSTFARLCQLVDRATDDMHEDIKQLDSHLKQVEEASNNA 57 (83)
Q Consensus 19 ~~tf~rL~~~Vd~~~~eL~~ei~~L~~~i~~le~i~~~~ 57 (83)
-.-|+||.+.++. ++.+|+....+|+-||.+-...
T Consensus 22 iN~Fsrl~~R~~~----lk~dik~~k~~~enledA~~Ei 56 (131)
T KOG1760|consen 22 INEFSRLNSRKDD----LKADIKEAKTEIENLEDASNEI 56 (131)
T ss_pred HHHHHHHHhhHHH----HHHHHHHHHHHHHHHHHHHhhH
Confidence 3568999888764 5666666677777777665543
No 101
>PF14335 DUF4391: Domain of unknown function (DUF4391)
Probab=50.69 E-value=94 Score=22.53 Aligned_cols=58 Identities=17% Similarity=0.256 Sum_probs=32.8
Q ss_pred HHHHHHHHHHHHHHHHHHHH----------------HHHHHHHHHHHHHHHHHHHHHhhHHhhhhHHHHHHHHHH
Q psy7799 15 EQELSSTFARLCQLVDRATD----------------DMHEDIKQLDSHLKQVEEASNNAKVLRNKAHYLIKELDM 73 (83)
Q Consensus 15 qqEL~~tf~rL~~~Vd~~~~----------------eL~~ei~~L~~~i~~le~i~~~~K~lrnKa~~L~~eL~~ 73 (83)
...|...|..|+.++..... +.-++|.+|.++|++|+.-..+.|.+..| .-|..||..
T Consensus 143 ~~~l~~lY~~l~~~i~~~~~~~~~g~~~~~~~~~~~~~~~~i~~L~kei~~L~~~~~kEkq~nrk-veln~elk~ 216 (221)
T PF14335_consen 143 GLNLDALYESLVNQIIALNAAPNTGEFEKTSLWERIERLEQIEKLEKEIAKLKKKIKKEKQFNRK-VELNTELKK 216 (221)
T ss_pred cccHHHHHHHHHHHHhcchhhhhcCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHhccCHHHH-HHHHHHHHH
Confidence 34577777787777655443 23345556666666666655555555533 344444443
No 102
>COG1579 Zn-ribbon protein, possibly nucleic acid-binding [General function prediction only]
Probab=50.18 E-value=1.1e+02 Score=23.28 Aligned_cols=60 Identities=18% Similarity=0.202 Sum_probs=40.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHhhhhHHHHHHHHHHHHH
Q psy7799 17 ELSSTFARLCQLVDRATDDMHEDIKQLDSHLKQVEEASNNAKVLRNKAHYLIKELDMFQE 76 (83)
Q Consensus 17 EL~~tf~rL~~~Vd~~~~eL~~ei~~L~~~i~~le~i~~~~K~lrnKa~~L~~eL~~F~~ 76 (83)
+|+.--.||....+....+|+.--.+++.--..++.++.....|+|...-++.++..+.+
T Consensus 14 ~lD~e~~rl~~~~~~~~~~l~k~~~e~e~~~~~~~~~~~e~e~le~qv~~~e~ei~~~r~ 73 (239)
T COG1579 14 KLDLEKDRLEPRIKEIRKALKKAKAELEALNKALEALEIELEDLENQVSQLESEIQEIRE 73 (239)
T ss_pred HHHHHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 666666666666666666666666666666666677777777777777777777766654
No 103
>PRK11637 AmiB activator; Provisional
Probab=49.73 E-value=1.3e+02 Score=23.79 Aligned_cols=23 Identities=13% Similarity=0.183 Sum_probs=11.0
Q ss_pred HHHhhHHhhhhHHHHHHHHHHHH
Q psy7799 53 ASNNAKVLRNKAHYLIKELDMFQ 75 (83)
Q Consensus 53 i~~~~K~lrnKa~~L~~eL~~F~ 75 (83)
++..++.+..+..-++.++..-.
T Consensus 94 ~~~~i~~~~~ei~~l~~eI~~~q 116 (428)
T PRK11637 94 TQNTLNQLNKQIDELNASIAKLE 116 (428)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHH
Confidence 44444444455555555544433
No 104
>COG3206 GumC Uncharacterized protein involved in exopolysaccharide biosynthesis [Cell envelope biogenesis, outer membrane]
Probab=49.67 E-value=1.3e+02 Score=23.87 Aligned_cols=35 Identities=14% Similarity=0.246 Sum_probs=14.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhHHhhhhHHHHHH
Q psy7799 35 DMHEDIKQLDSHLKQVEEASNNAKVLRNKAHYLIK 69 (83)
Q Consensus 35 eL~~ei~~L~~~i~~le~i~~~~K~lrnKa~~L~~ 69 (83)
.|.+++.++..+...+-..+-++..|.-++.-..+
T Consensus 353 ~L~~~~~~l~~~~~~~~~~~~~l~~L~Re~~~~r~ 387 (458)
T COG3206 353 ALEKELAQLKGRLSKLPKLQVQLRELEREAEAARS 387 (458)
T ss_pred HHHHHHHHHHHHHhhchHhhhHHHHHHHHHHHHHH
Confidence 34444444444444444444444444333333333
No 105
>PF12325 TMF_TATA_bd: TATA element modulatory factor 1 TATA binding; InterPro: IPR022091 This is the C-terminal conserved coiled coil region of a family of TATA element modulatory factor 1 proteins conserved in eukaryotes []. The proteins bind to the TATA element of some RNA polymerase II promoters and repress their activity. by competing with the binding of TATA binding protein. TMF1_TATA_bd is the most conserved part of the TMFs []. TMFs are evolutionarily conserved golgins that bind Rab6, a ubiquitous ras-like GTP-binding Golgi protein, and contribute to Golgi organisation in animal [] and plant cells. The Rab6-binding domain appears to be the same region as this C-terminal family [].
Probab=49.31 E-value=82 Score=21.45 Aligned_cols=69 Identities=19% Similarity=0.244 Sum_probs=46.2
Q ss_pred hhhhchhHHHHHHHHHHHHHHHHHH----------HHHHHHHHHHHHHHHHHHHHHHHHHHhhHHhhhhHHHHHHHHHH
Q psy7799 5 LTSSNCSHQVEQELSSTFARLCQLV----------DRATDDMHEDIKQLDSHLKQVEEASNNAKVLRNKAHYLIKELDM 73 (83)
Q Consensus 5 ~tssncshQVqqEL~~tf~rL~~~V----------d~~~~eL~~ei~~L~~~i~~le~i~~~~K~lrnKa~~L~~eL~~ 73 (83)
..+++-+-|+---|+++-.|+...+ ......+..||-.|..+.+.+.........|+....-|+.+.+.
T Consensus 8 ~~~~~~~~~~ve~L~s~lr~~E~E~~~l~~el~~l~~~r~~l~~Eiv~l~~~~e~~~~~~~~~~~L~~el~~l~~ry~t 86 (120)
T PF12325_consen 8 TSSGGPSVQLVERLQSQLRRLEGELASLQEELARLEAERDELREEIVKLMEENEELRALKKEVEELEQELEELQQRYQT 86 (120)
T ss_pred cccCCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3445556677777777777776554 44455666777777777777777777777777766666666554
No 106
>PF06456 Arfaptin: Arfaptin-like domain; InterPro: IPR010504 Arfaptin interacts with ARF1, a small GTPase involved in vesicle budding at the Golgi complex and immature secretory granules. The structure of arfaptin shows that upon binding to a small GTPase, arfaptin forms a an elongated, crescent-shaped dimer of three-helix coiled-coils []. The N-terminal region of ICA69 is similar to arfaptin [].; PDB: 1I4D_B 1I4L_B 1I49_B 1I4T_A 4DCN_D.
Probab=49.23 E-value=70 Score=23.84 Aligned_cols=37 Identities=16% Similarity=0.289 Sum_probs=28.5
Q ss_pred HHHHHHHHHHHHHHHhhHHhhhhHHHHHHHHHHHHHH
Q psy7799 41 KQLDSHLKQVEEASNNAKVLRNKAHYLIKELDMFQEA 77 (83)
Q Consensus 41 ~~L~~~i~~le~i~~~~K~lrnKa~~L~~eL~~F~~~ 77 (83)
.+|+.+|+.++.++..++.|-.-+.-+.+.|..+...
T Consensus 33 ~eL~~kle~l~~~~~~y~~L~~~~~~~~~~l~~l~q~ 69 (229)
T PF06456_consen 33 DELDAKLELLRDTQRTYRGLLKHARAYQNRLQALSQT 69 (229)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4677788888888888888888887777777766554
No 107
>PF05873 Mt_ATP-synt_D: ATP synthase D chain, mitochondrial (ATP5H); InterPro: IPR008689 ATPases (or ATP synthases) are membrane-bound enzyme complexes/ion transporters that combine ATP synthesis and/or hydrolysis with the transport of protons across a membrane. ATPases can harness the energy from a proton gradient, using the flux of ions across the membrane via the ATPase proton channel to drive the synthesis of ATP. Some ATPases work in reverse, using the energy from the hydrolysis of ATP to create a proton gradient. There are different types of ATPases, which can differ in function (ATP synthesis and/or hydrolysis), structure (e.g., F-, V- and A-ATPases, which contain rotary motors) and in the type of ions they transport [, ]. The different types include: F-ATPases (F1F0-ATPases), which are found in mitochondria, chloroplasts and bacterial plasma membranes where they are the prime producers of ATP, using the proton gradient generated by oxidative phosphorylation (mitochondria) or photosynthesis (chloroplasts). V-ATPases (V1V0-ATPases), which are primarily found in eukaryotic vacuoles and catalyse ATP hydrolysis to transport solutes and lower pH in organelles. A-ATPases (A1A0-ATPases), which are found in Archaea and function like F-ATPases (though with respect to their structure and some inhibitor responses, A-ATPases are more closely related to the V-ATPases). P-ATPases (E1E2-ATPases), which are found in bacteria and in eukaryotic plasma membranes and organelles, and function to transport a variety of different ions across membranes. E-ATPases, which are cell-surface enzymes that hydrolyse a range of NTPs, including extracellular ATP. F-ATPases (also known as F1F0-ATPase, or H(+)-transporting two-sector ATPase) (3.6.3.14 from EC) are composed of two linked complexes: the F1 ATPase complex is the catalytic core and is composed of 5 subunits (alpha, beta, gamma, delta, epsilon), while the F0 ATPase complex is the membrane-embedded proton channel that is composed of at least 3 subunits (A-C), nine in mitochondria (A-G, F6, F8). Both the F1 and F0 complexes are rotary motors that are coupled back-to-back. In the F1 complex, the central gamma subunit forms the rotor inside the cylinder made of the alpha(3)beta(3) subunits, while in the F0 complex, the ring-shaped C subunits forms the rotor. The two rotors rotate in opposite directions, but the F0 rotor is usually stronger, using the force from the proton gradient to push the F1 rotor in reverse in order to drive ATP synthesis []. These ATPases can also work in reverse to hydrolyse ATP to create a proton gradient. This entry represents subunit D from the F0 complex in F-ATPases found in mitochondria. The D subunit is part of the peripheral stalk that links the F1 and F0 complexes together, and which acts as a stator to prevent certain subunits from rotating with the central rotary element. The peripheral stalk differs in subunit composition between mitochondrial, chloroplast and bacterial F-ATPases. In mitochondria, the peripheral stalk is composed of one copy each of subunits OSCP (oligomycin sensitivity conferral protein), F6, B and D []. There is no homologue of subunit D in bacterial or chloroplast F-ATPase, whose peripheral stalks are composed of one copy of the delta subunit (homologous to OSCP), and two copies of subunit B in bacteria, or one copy each of subunits B and B' in chloroplasts and photosynthetic bacteria. More information about this protein can be found at Protein of the Month: ATP Synthases [].; GO: 0015078 hydrogen ion transmembrane transporter activity, 0015986 ATP synthesis coupled proton transport, 0000276 mitochondrial proton-transporting ATP synthase complex, coupling factor F(o); PDB: 2CLY_E 2WSS_U.
Probab=49.07 E-value=45 Score=23.60 Aligned_cols=35 Identities=14% Similarity=0.205 Sum_probs=21.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy7799 19 SSTFARLCQLVDRATDDMHEDIKQLDSHLKQVEEA 53 (83)
Q Consensus 19 ~~tf~rL~~~Vd~~~~eL~~ei~~L~~~i~~le~i 53 (83)
..-.+.+...+..+..+.+.+|.+|+++|..+++.
T Consensus 91 ~~~e~~~~~~~~~~~~~s~~~i~~l~keL~~i~~~ 125 (161)
T PF05873_consen 91 DAQEKEAIKEAKEFEAESKKRIAELEKELANIESA 125 (161)
T ss_dssp HHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHT--
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcC
Confidence 33344555556666677777777777777776654
No 108
>PRK07191 flgK flagellar hook-associated protein FlgK; Validated
Probab=48.82 E-value=77 Score=25.54 Aligned_cols=57 Identities=14% Similarity=0.227 Sum_probs=27.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHH-------hhHHhhhhHHHHHHHHHH
Q psy7799 17 ELSSTFARLCQLVDRATDDMH-------EDIKQLDSHLKQVEEASN-------NAKVLRNKAHYLIKELDM 73 (83)
Q Consensus 17 EL~~tf~rL~~~Vd~~~~eL~-------~ei~~L~~~i~~le~i~~-------~~K~lrnKa~~L~~eL~~ 73 (83)
.|...|.++..+++..++++. ++|..+-++|..|..-.. ....|+.+=..|-.||-.
T Consensus 138 ~la~~~n~~~~~l~~~~~~~~~~i~~~V~~iN~ll~~Ia~LN~~I~~~~~~g~~~ndL~DqRD~ll~eLS~ 208 (456)
T PRK07191 138 AMALRFNNVNNFIVQQKKSIGQQRDATVKQINSLTRSIADYNQKILKNRSDGNNISDLLDQRDLQIKKLSG 208 (456)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCchhHHHHHHHHHHHHh
Confidence 344555555555554444443 233333344444433332 234566666666666644
No 109
>PF14257 DUF4349: Domain of unknown function (DUF4349)
Probab=48.06 E-value=74 Score=23.34 Aligned_cols=29 Identities=21% Similarity=0.305 Sum_probs=17.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhHHhh
Q psy7799 33 TDDMHEDIKQLDSHLKQVEEASNNAKVLR 61 (83)
Q Consensus 33 ~~eL~~ei~~L~~~i~~le~i~~~~K~lr 61 (83)
-.|++.+|+.+..+.++|..+-.+++++.
T Consensus 134 y~D~~arl~~l~~~~~rl~~ll~ka~~~~ 162 (262)
T PF14257_consen 134 YVDLEARLKNLEAEEERLLELLEKAKTVE 162 (262)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCHH
Confidence 34566666666666666666666655433
No 110
>PRK09514 zntR zinc-responsive transcriptional regulator; Provisional
Probab=47.98 E-value=73 Score=21.61 Aligned_cols=58 Identities=9% Similarity=0.119 Sum_probs=33.2
Q ss_pred hhhhchhHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHhhh
Q psy7799 5 LTSSNCSHQVEQELSSTFAR-LCQLVDRATDDMHEDIKQLDSHLKQVEEASNNAKVLRN 62 (83)
Q Consensus 5 ~tssncshQVqqEL~~tf~r-L~~~Vd~~~~eL~~ei~~L~~~i~~le~i~~~~K~lrn 62 (83)
+...++|-.--+++-..... -....+....-|+..+.+++.+|+.|+.+...+..+..
T Consensus 54 lr~~G~sL~eI~~~l~~~~~~~~~~~~~~~~~l~~~~~~l~~~i~~L~~~~~~L~~~~~ 112 (140)
T PRK09514 54 AKQLGFTLEEIRELLSIRLDPEHHTCQEVKGIVDEKLAEVEAKIAELQHMRRSLQRLND 112 (140)
T ss_pred HHHcCCCHHHHHHHHHhcccCCcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34445555444444432210 01123445566788888888888888887777766654
No 111
>PF13935 Ead_Ea22: Ead/Ea22-like protein
Probab=47.92 E-value=87 Score=21.34 Aligned_cols=44 Identities=9% Similarity=0.198 Sum_probs=30.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhHHhhhhHHHHHHHHHHHHH
Q psy7799 32 ATDDMHEDIKQLDSHLKQVEEASNNAKVLRNKAHYLIKELDMFQE 76 (83)
Q Consensus 32 ~~~eL~~ei~~L~~~i~~le~i~~~~K~lrnKa~~L~~eL~~F~~ 76 (83)
+--.|-+|+......|..++.. ...-.++.+..-|..+|+.+.+
T Consensus 68 tvLALLDElE~~~~~i~~~~~~-~e~~~~a~~~~~l~~~Le~ae~ 111 (139)
T PF13935_consen 68 TVLALLDELERAQQRIAELEQE-CENEDIALDVQKLRVELEAAEK 111 (139)
T ss_pred HHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHH
Confidence 3345667777788888888877 5555666677777777777654
No 112
>PRK11020 hypothetical protein; Provisional
Probab=47.87 E-value=93 Score=21.64 Aligned_cols=42 Identities=12% Similarity=0.237 Sum_probs=28.0
Q ss_pred HHHHHHHHHHHHHHHH-----------HHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy7799 14 VEQELSSTFARLCQLV-----------DRATDDMHEDIKQLDSHLKQVEEASN 55 (83)
Q Consensus 14 VqqEL~~tf~rL~~~V-----------d~~~~eL~~ei~~L~~~i~~le~i~~ 55 (83)
+.+||...-.||..-= .+.-..+.+||..|+++|++|-..++
T Consensus 3 ~K~Eiq~L~drLD~~~~Klaaa~~rgd~~~i~qf~~E~~~l~k~I~~lk~~~~ 55 (118)
T PRK11020 3 EKNEIKRLSDRLDAIRHKLAAASLRGDAEKYAQFEKEKATLEAEIARLKEVQS 55 (118)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 5667776666665321 13345678888888888888877654
No 113
>PF05529 Bap31: B-cell receptor-associated protein 31-like ; InterPro: IPR008417 Bap31 is a polytopic integral protein of the endoplasmic reticulum membrane and a substrate of caspase-8. Bap31 is cleaved within its cytosolic domain, generating pro-apoptotic p20 Bap31 [].; GO: 0006886 intracellular protein transport, 0005783 endoplasmic reticulum, 0016021 integral to membrane
Probab=47.73 E-value=96 Score=21.76 Aligned_cols=33 Identities=24% Similarity=0.458 Sum_probs=18.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhHHhhhhHHHHHHH
Q psy7799 35 DMHEDIKQLDSHLKQVEEASNNAKVLRNKAHYLIKE 70 (83)
Q Consensus 35 eL~~ei~~L~~~i~~le~i~~~~K~lrnKa~~L~~e 70 (83)
.+++||++|.++++. .......||..+.-|+.|
T Consensus 158 ~~~~ei~~lk~el~~---~~~~~~~LkkQ~~~l~~e 190 (192)
T PF05529_consen 158 KLSEEIEKLKKELEK---KEKEIEALKKQSEGLQKE 190 (192)
T ss_pred hhHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHhh
Confidence 345555555555544 555556666665555544
No 114
>cd07624 BAR_SNX7_30 The Bin/Amphiphysin/Rvs (BAR) domain of Sorting Nexins 7 and 30. BAR domains are dimerization, lipid binding and curvature sensing modules found in many different proteins with diverse functions. Sorting nexins (SNXs) are Phox homology (PX) domain containing proteins that are involved in regulating membrane traffic and protein sorting in the endosomal system. SNXs differ from each other in their lipid-binding specificity, subcellular localization and specific function in the endocytic pathway. A subset of SNXs also contain BAR domains. The PX-BAR structural unit determines the specific membrane targeting of SNXs. This subfamily consists of SNX7, SNX30, and similar proteins. The specific functions of SNX7 and SNX30 have not been elucidated. BAR domains form dimers that bind to membranes, induce membrane bending and curvature, and may also be involved in protein-protein interactions.
Probab=47.50 E-value=97 Score=22.11 Aligned_cols=42 Identities=19% Similarity=0.328 Sum_probs=33.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhHHhhhhHHHHHHHHHHHHHHh
Q psy7799 34 DDMHEDIKQLDSHLKQVEEASNNAKVLRNKAHYLIKELDMFQEAY 78 (83)
Q Consensus 34 ~eL~~ei~~L~~~i~~le~i~~~~K~lrnKa~~L~~eL~~F~~~y 78 (83)
.+|++-|.+|+..|..++.+..++- ++-..|...+..|...|
T Consensus 17 ~e~~eyi~~L~~~l~~~~kv~~Rl~---kr~~el~~~~~efg~~~ 58 (200)
T cd07624 17 DKMNEYLTLFGEKLGTIERISQRIH---KERIEYFDELKEYSPIF 58 (200)
T ss_pred HHHHHHHHHHHHHhHHHHHHHHHHH---HHHHHHHHHHHHHHHHH
Confidence 5788899999999999998877643 35577888888888777
No 115
>PRK08871 flgK flagellar hook-associated protein FlgK; Validated
Probab=46.63 E-value=1.2e+02 Score=25.94 Aligned_cols=18 Identities=28% Similarity=0.429 Sum_probs=11.0
Q ss_pred hHHhhhhHHHHHHHHHHH
Q psy7799 57 AKVLRNKAHYLIKELDMF 74 (83)
Q Consensus 57 ~K~lrnKa~~L~~eL~~F 74 (83)
...|+++=..|-.||..+
T Consensus 192 pNdLlDqRD~ll~eLS~~ 209 (626)
T PRK08871 192 HNDLMDQHEKLVKELSQY 209 (626)
T ss_pred chhhHHHHHHHHHHHHhh
Confidence 346666666666666544
No 116
>COG5374 Uncharacterized conserved protein [Function unknown]
Probab=46.61 E-value=1.2e+02 Score=22.66 Aligned_cols=43 Identities=28% Similarity=0.346 Sum_probs=33.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhHHhhhhHHHHHHHHHHHHHHh
Q psy7799 33 TDDMHEDIKQLDSHLKQVEEASNNAKVLRNKAHYLIKELDMFQEAY 78 (83)
Q Consensus 33 ~~eL~~ei~~L~~~i~~le~i~~~~K~lrnKa~~L~~eL~~F~~~y 78 (83)
.+.|+.++..|++++. +.+...+.|+++...|...+|...+-+
T Consensus 138 ~D~~eA~~t~lk~~~~---~~~~~le~Lqkn~~~~~k~~d~~ne~~ 180 (192)
T COG5374 138 IDKMEADSTDLKARLR---KAQILLEGLQKNQEELFKLLDKYNELR 180 (192)
T ss_pred hhhhhcchHHHHHHHh---hhhHHHHHHHHHHHHHHHHHHHHhHHH
Confidence 4677777777777776 677778888999999888888776544
No 117
>PF10779 XhlA: Haemolysin XhlA; InterPro: IPR019715 Haemolysin XhlA is a cell-surface associated haemolysin that lyses the two most prevalent types of insect immune cells (granulocytes and plasmatocytes) as well as rabbit and horse erythrocytes [].
Probab=46.28 E-value=66 Score=19.48 Aligned_cols=25 Identities=20% Similarity=0.482 Sum_probs=11.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH
Q psy7799 30 DRATDDMHEDIKQLDSHLKQVEEAS 54 (83)
Q Consensus 30 d~~~~eL~~ei~~L~~~i~~le~i~ 54 (83)
+.....++.+|+.++++|..++..+
T Consensus 26 E~~~~~~e~~i~~~~~~l~~I~~n~ 50 (71)
T PF10779_consen 26 EKRDAANEKDIKNLNKQLEKIKSNT 50 (71)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3344444444555555554444443
No 118
>PRK05291 trmE tRNA modification GTPase TrmE; Reviewed
Probab=46.27 E-value=28 Score=27.92 Aligned_cols=43 Identities=16% Similarity=0.291 Sum_probs=33.8
Q ss_pred chhhhhhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy7799 2 IVDLTSSNCSHQVEQELSSTFARLCQLVDRATDDMHEDIKQLD 44 (83)
Q Consensus 2 iV~~tssncshQVqqEL~~tf~rL~~~Vd~~~~eL~~ei~~L~ 44 (83)
|.++..|.=..|.++.|.+.-..|...++.-..+|..-+..++
T Consensus 128 i~~li~a~t~~~~~~al~~l~G~l~~~~~~~r~~l~~~~a~ie 170 (449)
T PRK05291 128 IADLIDAKTEAAARLALRQLQGALSKLINELREELLELLALVE 170 (449)
T ss_pred HHHHHhCCCHHHHHHHHHhcCcHHHHHHHHHHHHHHHHHHHhe
Confidence 4567788888999999999988888888888877766655544
No 119
>PF10168 Nup88: Nuclear pore component; InterPro: IPR019321 Nup88 can be divided into two structural domains; the N-terminal two-thirds of the protein have no obvious structural motifs. It is, however, where it binds to Nup98; one of the components of the nuclear pore. The C-terminal end is a predicted coiled-coil domain []. Nup88 is over expressed in tumour cells [].
Probab=46.10 E-value=2e+02 Score=25.04 Aligned_cols=25 Identities=24% Similarity=0.363 Sum_probs=12.7
Q ss_pred HHHHHHHhhHHhhhhHHHHHHHHHH
Q psy7799 49 QVEEASNNAKVLRNKAHYLIKELDM 73 (83)
Q Consensus 49 ~le~i~~~~K~lrnKa~~L~~eL~~ 73 (83)
.|+.++.+.+.|+.+|.-|+..++.
T Consensus 580 ~L~~l~e~~~~l~~~ae~LaeR~e~ 604 (717)
T PF10168_consen 580 ELQELQEERKSLRESAEKLAERYEE 604 (717)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4555555555555555555544443
No 120
>smart00721 BAR BAR domain.
Probab=46.09 E-value=1e+02 Score=21.49 Aligned_cols=54 Identities=15% Similarity=0.218 Sum_probs=33.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHhhhhHH--------HHHHHHHHHHHHh
Q psy7799 25 LCQLVDRATDDMHEDIKQLDSHLKQVEEASNNAKVLRNKAH--------YLIKELDMFQEAY 78 (83)
Q Consensus 25 L~~~Vd~~~~eL~~ei~~L~~~i~~le~i~~~~K~lrnKa~--------~L~~eL~~F~~~y 78 (83)
+.......-.+....|+.++...--+|...++++.++.+.. -.+.+|+...+.|
T Consensus 125 ~~~~~~~~~~~~~~~~kk~~~~~lDyD~~~~kl~~~~~~~~~~~~~kl~~~e~el~~ak~~f 186 (239)
T smart00721 125 LLNFLLGEFKEIKKARKKLERKLLDYDSARHKLKKAKKSKEKKKDEKLAKAEEELRKAKQEF 186 (239)
T ss_pred HHHHHHHHhHHHHHHHHHHHhHHHHHHHHHHHHHHHHHhccCChhhhhhhHHHHHHHHHHHH
Confidence 44444444555666666666666667777777777765432 2777777776666
No 121
>PF08614 ATG16: Autophagy protein 16 (ATG16); InterPro: IPR013923 Macroautophagy is a bulk degradation process induced by starvation in eukaryotic cells. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. No molecule involved in autophagy has yet been identified in higher eukaryotes []. The pre-autophagosomal structure contains at least five Apg proteins: Apg1p, Apg2p, Apg5p, Aut7p/Apg8p and Apg16p. It is found in the vacuole []. The C-terminal glycine of Apg12p is conjugated to a lysine residue of Apg5p via an isopeptide bond. During autophagy, cytoplasmic components are enclosed in autophagosomes and delivered to lysosomes/vacuoles. Auotphagy protein 16 (Apg16) has been shown to be bind to Apg5 and is required for the function of the Apg12p-Apg5p conjugate []. Autophagy protein 5 (Apg5) is directly required for the import of aminopeptidase I via the cytoplasm-to-vacuole targeting pathway []. This entry represents auotphagy protein 16 (Apg16), which is required for the function of the Apg12p-Apg5p conjugate.; PDB: 3A7O_D 3A7P_B.
Probab=45.98 E-value=60 Score=23.04 Aligned_cols=63 Identities=22% Similarity=0.300 Sum_probs=20.4
Q ss_pred HHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHhhhhHHHHHHHHHHHHH
Q psy7799 14 VEQELSSTFA---RLCQLVDRATDDMHEDIKQLDSHLKQVEEASNNAKVLRNKAHYLIKELDMFQE 76 (83)
Q Consensus 14 VqqEL~~tf~---rL~~~Vd~~~~eL~~ei~~L~~~i~~le~i~~~~K~lrnKa~~L~~eL~~F~~ 76 (83)
.+.||+..+. .+.+++-....++..-=+.+...-..|..++.....|+.|...|+.+|..-.+
T Consensus 79 l~~ELael~r~~~el~~~L~~~~~~l~~l~~~~~~~~~~l~~l~~~~~~L~~~~~~l~~~l~ek~k 144 (194)
T PF08614_consen 79 LQEELAELYRSKGELAQQLVELNDELQELEKELSEKERRLAELEAELAQLEEKIKDLEEELKEKNK 144 (194)
T ss_dssp -----------------------------------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred ccccccccccccccccccccccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4455555543 34444444444443333344444445666677777777777777777765443
No 122
>PF05957 DUF883: Bacterial protein of unknown function (DUF883); InterPro: IPR010279 This family consists of several bacterial proteins of unknown function that include the Escherichia coli genes for ElaB, YgaM and YqjD.
Probab=45.43 E-value=75 Score=19.89 Aligned_cols=64 Identities=19% Similarity=0.235 Sum_probs=43.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHhhH----HhhhhHHHHHHHHHHHHH
Q psy7799 13 QVEQELSSTFARLCQLVDRATDDMHEDIKQLDSHLK-QVEEASNNAK----VLRNKAHYLIKELDMFQE 76 (83)
Q Consensus 13 QVqqEL~~tf~rL~~~Vd~~~~eL~~ei~~L~~~i~-~le~i~~~~K----~lrnKa~~L~~eL~~F~~ 76 (83)
++.+||....+.+...+..+.....+.+.++..++. .++.+..+++ .++.++.......+.|..
T Consensus 2 ~l~~~l~~l~~d~~~l~~~~~~~~~~~~~~~r~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~V~ 70 (94)
T PF05957_consen 2 DLKAELEQLRADLEDLARSAADLAGEKADEARDRAEEALDDARDRAEDAADQAREQAREAAEQTEDYVR 70 (94)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 577888888888888888888888888877777664 3444445544 355555555555555543
No 123
>cd01108 HTH_CueR Helix-Turn-Helix DNA binding domain of CueR-like transcription regulators. Helix-turn-helix (HTH) transcription regulators CueR and ActP, copper efflux regulators. In Bacillus subtilis, copper induced CueR regulates the copZA operon, preventing copper toxicity. In Rhizobium leguminosarum, ActP controls copper homeostasis; it detects cytoplasmic copper stress and activates transcription in response to increasing copper concentrations. These proteins are comprised of two distinct domains that harbor the regulatory (effector-binding) site and the active (DNA-binding) site. Their conserved N-terminal domains contain winged HTH motifs that mediate DNA binding, while the C-terminal domains have two conserved cysteines that define a monovalent copper ion binding site. These proteins share the N-terminal DNA binding domain with other transcription regulators of the MerR superfamily that promote transcription by reconfiguring the spacer between the -35 and -10 promoter elements
Probab=45.40 E-value=88 Score=20.67 Aligned_cols=31 Identities=13% Similarity=0.207 Sum_probs=19.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhHHhh
Q psy7799 31 RATDDMHEDIKQLDSHLKQVEEASNNAKVLR 61 (83)
Q Consensus 31 ~~~~eL~~ei~~L~~~i~~le~i~~~~K~lr 61 (83)
....-++..+.++..+|..|..+...+..+.
T Consensus 79 ~~~~~l~~~~~~l~~~i~~L~~~~~~l~~~~ 109 (127)
T cd01108 79 DVKALALEHIAELERKIAELQAMRRTLQQLA 109 (127)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3445566777777777777766666655544
No 124
>PF03285 Paralemmin: Paralemmin; InterPro: IPR004965 Paralemmin was identified in the chicken lens as a protein with a molecular weight of 65 kDa (isoform 1) and a splice variant of 60 kDa (isoform 2). Isoform 2 is predominant during infancy and levels of isoform 1 increase with age. Paralemmin is localised to the plasma membrane of fibre cells, and was not detected in the annular pad cells. Its localisation to the short side of the fibre cell and the sites of fibre cell interlocking suggests that paralemmin may play a role in the development of such interdigitating processes []. Palmitoylation is important for localising these proteins to the filopodia of dendritic cells where they have been implicated in the regulation of membrane dynamics and process outgrowth. ; GO: 0008360 regulation of cell shape, 0016020 membrane
Probab=45.34 E-value=20 Score=27.99 Aligned_cols=21 Identities=24% Similarity=0.539 Sum_probs=18.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHH
Q psy7799 33 TDDMHEDIKQLDSHLKQVEEA 53 (83)
Q Consensus 33 ~~eL~~ei~~L~~~i~~le~i 53 (83)
.+.|++-|-+|+++|+.||.-
T Consensus 12 tR~LEesI~RLEkEIe~LE~~ 32 (278)
T PF03285_consen 12 TRSLEESIHRLEKEIEALENG 32 (278)
T ss_pred HHHHHHHHHHHHHHHHHhccC
Confidence 467999999999999999973
No 125
>TIGR03545 conserved hypothetical protein TIGR03545. This model represents a relatively rare but broadly distributed uncharacterized protein family, distributed in 1-2 percent of bacterial genomes, all of which have outer membranes. In many of these genomes, it is part of a two-gene pair.
Probab=45.22 E-value=1.2e+02 Score=25.70 Aligned_cols=54 Identities=28% Similarity=0.375 Sum_probs=38.4
Q ss_pred HHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHH-HHhhHHhhhhHHH
Q psy7799 13 QVEQELSSTFARLC---QLVDRATDDMHEDIKQLDSHLKQVEEA-SNNAKVLRNKAHY 66 (83)
Q Consensus 13 QVqqEL~~tf~rL~---~~Vd~~~~eL~~ei~~L~~~i~~le~i-~~~~K~lrnKa~~ 66 (83)
|..+|++.+.+-.. +.+....++|+.+.+++..++..|+.. +.-.+.|++|+..
T Consensus 216 ~~~~e~d~lk~e~~~~~~~i~~~~~~l~~~~~~~~~~~~~lk~ap~~D~~~L~~~~~~ 273 (555)
T TIGR03545 216 KIKEEFDKLKKEGKADKQKIKSAKNDLQNDKKQLKADLAELKKAPQNDLKRLENKYAI 273 (555)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHhccHhHHHHHHHHhCC
Confidence 45555555554444 456666679999999999999988875 5667888877764
No 126
>PRK05724 acetyl-CoA carboxylase carboxyltransferase subunit alpha; Validated
Probab=45.19 E-value=61 Score=25.65 Aligned_cols=35 Identities=23% Similarity=0.484 Sum_probs=17.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy7799 16 QELSSTFARLCQLVDRATDDMHEDIKQLDSHLKQV 50 (83)
Q Consensus 16 qEL~~tf~rL~~~Vd~~~~eL~~ei~~L~~~i~~l 50 (83)
.||..--..|....+...-++.+||..|+.+++.+
T Consensus 13 ~~l~~~i~~l~~~~~~~~~~~~~~~~~l~~~~~~~ 47 (319)
T PRK05724 13 AELEAKIEELRAVAEDSDVDLSEEIERLEKKLEEL 47 (319)
T ss_pred HHHHHHHHHHHhhhccccccHHHHHHHHHHHHHHH
Confidence 34444444444443333445555555555555554
No 127
>PF07798 DUF1640: Protein of unknown function (DUF1640); InterPro: IPR024461 This family consists of uncharacterised proteins.
Probab=44.85 E-value=1.1e+02 Score=21.51 Aligned_cols=33 Identities=9% Similarity=0.259 Sum_probs=23.2
Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy7799 9 NCSHQVEQELSSTFARLCQLVDRATDDMHEDIK 41 (83)
Q Consensus 9 ncshQVqqEL~~tf~rL~~~Vd~~~~eL~~ei~ 41 (83)
.+..++++|+...-.+|...++....+++-+++
T Consensus 80 ~~~e~L~~eie~l~~~L~~ei~~l~a~~klD~n 112 (177)
T PF07798_consen 80 SENEKLQREIEKLRQELREEINKLRAEVKLDLN 112 (177)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 456678888888888888887777775544433
No 128
>PRK14162 heat shock protein GrpE; Provisional
Probab=44.80 E-value=80 Score=23.22 Aligned_cols=25 Identities=20% Similarity=0.117 Sum_probs=13.8
Q ss_pred HHHhhHHhhhhHHHHHHHHHHHHHH
Q psy7799 53 ASNNAKVLRNKAHYLIKELDMFQEA 77 (83)
Q Consensus 53 i~~~~K~lrnKa~~L~~eL~~F~~~ 77 (83)
+..++..+++|.-.+..++++|.+-
T Consensus 51 l~~e~~elkd~~lR~~AEfeN~rkR 75 (194)
T PRK14162 51 LKAKNKDLEDKYLRSQAEIQNMQNR 75 (194)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3344445556666666666666544
No 129
>PRK14139 heat shock protein GrpE; Provisional
Probab=44.58 E-value=70 Score=23.36 Aligned_cols=21 Identities=14% Similarity=0.063 Sum_probs=10.8
Q ss_pred hhHHhhhhHHHHHHHHHHHHH
Q psy7799 56 NAKVLRNKAHYLIKELDMFQE 76 (83)
Q Consensus 56 ~~K~lrnKa~~L~~eL~~F~~ 76 (83)
++..|++|.-.+..+.++|.+
T Consensus 47 e~~elkd~~lR~~AefeN~rK 67 (185)
T PRK14139 47 KAAELQDSFLRAKAETENVRR 67 (185)
T ss_pred HHHHHHHHHHHHHHHHHHHHH
Confidence 333444555555555555554
No 130
>PRK12758 DNA topoisomerase IV subunit A; Provisional
Probab=43.88 E-value=1.1e+02 Score=27.60 Aligned_cols=49 Identities=16% Similarity=0.326 Sum_probs=35.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHhhhhHHHHHHHHHHHHHHh
Q psy7799 24 RLCQLVDRATDDMHEDIKQLDSHLKQVEEASNNAKVLRNKAHYLIKELDMFQEAY 78 (83)
Q Consensus 24 rL~~~Vd~~~~eL~~ei~~L~~~i~~le~i~~~~K~lrnKa~~L~~eL~~F~~~y 78 (83)
||.+...--...+++|+++|.++|..++.+-.. --.++-.+|....+.|
T Consensus 418 rL~rlt~le~~k~~~e~~~l~~~i~~~~~~L~~------l~~~ii~el~~i~~ky 466 (869)
T PRK12758 418 KIKRISKFDSDKADELIARLEAEIAEVKHHLAH------LTDYAIAYFTNLKKKY 466 (869)
T ss_pred HHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHH------HHHHHHHHHHHHHHHh
Confidence 455555566677889999999999988887765 3346677777777776
No 131
>PRK14156 heat shock protein GrpE; Provisional
Probab=43.85 E-value=91 Score=22.62 Aligned_cols=29 Identities=24% Similarity=0.210 Sum_probs=18.5
Q ss_pred HHHHHHHhhHHhhhhHHHHHHHHHHHHHH
Q psy7799 49 QVEEASNNAKVLRNKAHYLIKELDMFQEA 77 (83)
Q Consensus 49 ~le~i~~~~K~lrnKa~~L~~eL~~F~~~ 77 (83)
.++.+..++..|+++.-.+..++++|.+-
T Consensus 35 ~l~~l~~e~~elkd~~lR~~AEfeN~rKR 63 (177)
T PRK14156 35 ELELANERADEFENKYLRAHAEMQNIQRR 63 (177)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34445555566667777777777777654
No 132
>PF11285 DUF3086: Protein of unknown function (DUF3086); InterPro: IPR021437 This family of proteins with unknown function appears to be restricted to Cyanobacteria.
Probab=43.68 E-value=40 Score=26.53 Aligned_cols=42 Identities=24% Similarity=0.366 Sum_probs=33.6
Q ss_pred HHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy7799 12 HQVEQELSSTFAR----LCQLVDRATDDMHEDIKQLDSHLKQVEEA 53 (83)
Q Consensus 12 hQVqqEL~~tf~r----L~~~Vd~~~~eL~~ei~~L~~~i~~le~i 53 (83)
.|+++||.++|++ |...|-+++.=|..-+.+|-.-.++||-+
T Consensus 28 ~ri~~EmrtsFaG~Sq~lA~RVqGFkdYLvGsLQDLa~saEqLeLv 73 (283)
T PF11285_consen 28 ERIEKEMRTSFAGQSQDLAIRVQGFKDYLVGSLQDLAQSAEQLELV 73 (283)
T ss_pred HHHHHHHHcccccchHHHHHHHhhhHHHHHHHHHHHHHHHHhhccC
Confidence 5788999999986 55678888888888888888888877743
No 133
>PF04508 Pox_A_type_inc: Viral A-type inclusion protein repeat ; InterPro: IPR007596 The repeat is found in the A-type inclusion protein of the Poxvirus family [].; GO: 0016032 viral reproduction
Probab=43.57 E-value=31 Score=17.56 Aligned_cols=16 Identities=31% Similarity=0.385 Sum_probs=10.4
Q ss_pred HHhhhhHHHHHHHHHH
Q psy7799 58 KVLRNKAHYLIKELDM 73 (83)
Q Consensus 58 K~lrnKa~~L~~eL~~ 73 (83)
..+|+|...|+++|+.
T Consensus 4 ~rlr~rI~dLer~L~~ 19 (23)
T PF04508_consen 4 NRLRNRISDLERQLSE 19 (23)
T ss_pred HHHHHHHHHHHHHHHH
Confidence 3466677777777754
No 134
>cd00056 ENDO3c endonuclease III; includes endonuclease III (DNA-(apurinic or apyrimidinic site) lyase), alkylbase DNA glycosidases (Alka-family) and other DNA glycosidases
Probab=43.56 E-value=68 Score=21.31 Aligned_cols=69 Identities=22% Similarity=0.262 Sum_probs=42.2
Q ss_pred hhhhchhHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHhhhhHHHHHHHHHHHHHHhcc
Q psy7799 5 LTSSNCSHQVEQE-LSSTFARLCQLVDRATDDMHEDIKQLDSHLKQVEEASNNAKVLRNKAHYLIKELDMFQEAYLV 80 (83)
Q Consensus 5 ~tssncshQVqqE-L~~tf~rL~~~Vd~~~~eL~~ei~~L~~~i~~le~i~~~~K~lrnKa~~L~~eL~~F~~~yL~ 80 (83)
+.++-|++|+.-+ -..++.||++..+-+- +.|..++ .+.|.++..... .+.||..|-.=.+.|.+.|+.
T Consensus 4 Li~~il~q~~s~~~a~~~~~~l~~~~gpt~----~~l~~~~--~~~l~~~~~~~G-~~~kA~~i~~~a~~~~~~~~~ 73 (158)
T cd00056 4 LVSEILSQQTTDKAVNKAYERLFERYGPTP----EALAAAD--EEELRELIRSLG-YRRKAKYLKELARAIVEGFGG 73 (158)
T ss_pred HHHHHHHhcccHHHHHHHHHHHHHHhCCCH----HHHHCCC--HHHHHHHHHhcC-hHHHHHHHHHHHHHHHHHcCC
Confidence 4567788888877 5678888888874111 1122211 133444444444 467888888888888777753
No 135
>PRK01156 chromosome segregation protein; Provisional
Probab=43.53 E-value=2.1e+02 Score=24.58 Aligned_cols=24 Identities=13% Similarity=0.275 Sum_probs=10.3
Q ss_pred HHHHHHHHHHHHHhhHHhhhhHHH
Q psy7799 43 LDSHLKQVEEASNNAKVLRNKAHY 66 (83)
Q Consensus 43 L~~~i~~le~i~~~~K~lrnKa~~ 66 (83)
+..++..+.......++++.+..+
T Consensus 714 l~eel~~~~~~~~~l~~~~~~~~~ 737 (895)
T PRK01156 714 LSDRINDINETLESMKKIKKAIGD 737 (895)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH
Confidence 343444444444444444444443
No 136
>TIGR00513 accA acetyl-CoA carboxylase, carboxyl transferase, alpha subunit. The enzyme acetyl-CoA carboxylase contains a biotin carboxyl carrier protein or domain, a biotin carboxylase, and a carboxyl transferase. This model represents the alpha chain of the carboxyl transferase for cases in which the architecture of the protein is as in E. coli, in which the carboxyltransferase portion consists of two non-identical subnits, alpha and beta.
Probab=43.07 E-value=71 Score=25.24 Aligned_cols=40 Identities=18% Similarity=0.292 Sum_probs=22.9
Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy7799 10 CSHQVEQELSSTFARLCQLVDRATDDMHEDIKQLDSHLKQV 50 (83)
Q Consensus 10 cshQVqqEL~~tf~rL~~~Vd~~~~eL~~ei~~L~~~i~~l 50 (83)
|+..+ .||..--..|....+...-++.+||..|+.++..+
T Consensus 8 fe~~i-~~l~~~~~~l~~~~~~~~~~~~~~~~~l~~~~~~~ 47 (316)
T TIGR00513 8 FEKPI-AELEAKIESLRARSRDEDVDLSEEIERLEKRSVEL 47 (316)
T ss_pred hhHHH-HHHHHHHHHHHhhhhcccccHHHHHHHHHHHHHHH
Confidence 44444 45555555555555444556666666666666655
No 137
>PF10234 Cluap1: Clusterin-associated protein-1; InterPro: IPR019366 This protein of 413 amino acids contains a central coiled-coil domain, possibly the region that binds to clusterin. Cluap1 expression is highest in the nucleus and gradually increases during late S to G2/M phases of the cell cycle and returns to the basal level in the G0/G1 phases. In addition, it is upregulated in colon cancer tissues compared to corresponding non-cancerous mucosa. It thus plays a crucial role in the life of the cell [].
Probab=43.05 E-value=1.6e+02 Score=22.87 Aligned_cols=66 Identities=18% Similarity=0.361 Sum_probs=35.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHH----HHHHHHHHhhHHhhh-------hHHHHHHHHHHHHHHhc
Q psy7799 14 VEQELSSTFARLCQLVDRATDD---MHEDIKQLDSHL----KQVEEASNNAKVLRN-------KAHYLIKELDMFQEAYL 79 (83)
Q Consensus 14 VqqEL~~tf~rL~~~Vd~~~~e---L~~ei~~L~~~i----~~le~i~~~~K~lrn-------Ka~~L~~eL~~F~~~yL 79 (83)
|.+.|...-..+..++..+.+. +..+-+.|+.+| ..||-.++++++|++ -.--||.||..-...|+
T Consensus 163 iE~~l~~ai~~~~~~~~~~~~~l~~l~~de~~Le~KIekkk~ELER~qKRL~sLq~vRPAfmdEyEklE~EL~~lY~~Y~ 242 (267)
T PF10234_consen 163 IEKALKEAIKAVQQQLQQTQQQLNNLASDEANLEAKIEKKKQELERNQKRLQSLQSVRPAFMDEYEKLEEELQKLYEIYV 242 (267)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHHHHHHHHH
Confidence 4444444444444444444332 233444444444 456777777777762 34456777777666664
No 138
>PF11855 DUF3375: Protein of unknown function (DUF3375); InterPro: IPR021804 This family of proteins are functionally uncharacterised. This protein is found in bacteria. Proteins in this family are typically between 479 to 499 amino acids in length.
Probab=42.89 E-value=62 Score=26.43 Aligned_cols=34 Identities=24% Similarity=0.468 Sum_probs=23.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy7799 18 LSSTFARLCQLVDRATDDMHEDIKQLDSHLKQVE 51 (83)
Q Consensus 18 L~~tf~rL~~~Vd~~~~eL~~ei~~L~~~i~~le 51 (83)
|.++|..|.+.+..+..|=+..|..|+.+|+.++
T Consensus 124 l~tv~~~l~~la~~~~~Dp~~Ri~~Le~e~~~i~ 157 (478)
T PF11855_consen 124 LNTVFDALRQLAEGTDPDPERRIAELEREIAEID 157 (478)
T ss_pred HHHHHHHHHHHHHhcCCCHHHHHHHHHHHHHHHH
Confidence 6778888888888887665555555555555544
No 139
>PRK14158 heat shock protein GrpE; Provisional
Probab=42.79 E-value=83 Score=23.13 Aligned_cols=39 Identities=18% Similarity=0.225 Sum_probs=22.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHhhHHhhhhHHHHHHHHHHHHHH
Q psy7799 36 MHEDIKQLDSHLKQVEEASNNAKVLRNKAHYLIKELDMFQEA 77 (83)
Q Consensus 36 L~~ei~~L~~~i~~le~i~~~~K~lrnKa~~L~~eL~~F~~~ 77 (83)
.+.++..++.++ +.+..++..+++|.-.+..++++|.+-
T Consensus 38 ~~~~~~~le~~l---~~le~e~~el~d~~lR~~AefeN~RkR 76 (194)
T PRK14158 38 AADRIKELEEAL---AAKEAEAAANWDKYLRERADLENYRKR 76 (194)
T ss_pred chhHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 344555555443 334455556666777777777777653
No 140
>PF15463 ECM11: Extracellular mutant protein 11
Probab=42.78 E-value=1.1e+02 Score=20.90 Aligned_cols=47 Identities=17% Similarity=0.337 Sum_probs=31.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH-HHHhhHHhhhhHHHHHHHHHHHHH
Q psy7799 30 DRATDDMHEDIKQLDSHLKQVEE-ASNNAKVLRNKAHYLIKELDMFQE 76 (83)
Q Consensus 30 d~~~~eL~~ei~~L~~~i~~le~-i~~~~K~lrnKa~~L~~eL~~F~~ 76 (83)
......|++.-..+...++.+|. |..+...|+.|..-|...|+.+..
T Consensus 86 ~~l~~kl~~~R~~~r~~~~~fe~eI~~R~eav~~~~~~l~~kL~~mk~ 133 (139)
T PF15463_consen 86 SELMQKLKEARRKLRKKFAVFEDEINRRAEAVRAQGEQLDRKLEKMKE 133 (139)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33444555555566666666654 445778888888888888887765
No 141
>KOG1853|consensus
Probab=42.48 E-value=1.8e+02 Score=23.29 Aligned_cols=60 Identities=20% Similarity=0.330 Sum_probs=35.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHhhHHhhhhHHHHHHHHHH
Q psy7799 13 QVEQELSSTFARLCQLVDRATDDMHEDIKQLD----SHLKQVEEASNNAKVLRNKAHYLIKELDM 73 (83)
Q Consensus 13 QVqqEL~~tf~rL~~~Vd~~~~eL~~ei~~L~----~~i~~le~i~~~~K~lrnKa~~L~~eL~~ 73 (83)
|...+|+.|.+ .|-+....-++|+.--+.|+ ..|=.||....++..-=-|-.+||+||+.
T Consensus 95 ~Leddlsqt~a-ikeql~kyiReLEQaNDdLErakRati~sleDfeqrLnqAIErnAfLESELdE 158 (333)
T KOG1853|consen 95 QLEDDLSQTHA-IKEQLRKYIRELEQANDDLERAKRATIYSLEDFEQRLNQAIERNAFLESELDE 158 (333)
T ss_pred HHHHHHHHHHH-HHHHHHHHHHHHHHhccHHHHhhhhhhhhHHHHHHHHHHHHHHHHHHHHHhhH
Confidence 45556666654 34444555555555444443 24556666666666666677788888864
No 142
>PRK04406 hypothetical protein; Provisional
Probab=42.38 E-value=78 Score=19.81 Aligned_cols=13 Identities=31% Similarity=0.611 Sum_probs=5.1
Q ss_pred HHHHHHHHHHHHH
Q psy7799 36 MHEDIKQLDSHLK 48 (83)
Q Consensus 36 L~~ei~~L~~~i~ 48 (83)
|++.|.+|+.+|.
T Consensus 9 le~Ri~~LE~~lA 21 (75)
T PRK04406 9 LEERINDLECQLA 21 (75)
T ss_pred HHHHHHHHHHHHH
Confidence 3334444433333
No 143
>CHL00198 accA acetyl-CoA carboxylase carboxyltransferase alpha subunit; Provisional
Probab=42.34 E-value=64 Score=25.61 Aligned_cols=48 Identities=17% Similarity=0.295 Sum_probs=29.8
Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHhh
Q psy7799 9 NCSHQVEQELSSTFARLCQLVDRATDDMHEDIKQLDSHLKQV-EEASNNA 57 (83)
Q Consensus 9 ncshQVqqEL~~tf~rL~~~Vd~~~~eL~~ei~~L~~~i~~l-e~i~~~~ 57 (83)
.|+..+ .||..--..|....+...-++.+||..|++++..+ .++-+.+
T Consensus 10 ~fe~~i-~el~~~i~~l~~~~~~~~~~~~~~i~~l~~~~~~~~~~~~~~l 58 (322)
T CHL00198 10 DFMKPL-AELESQVEELSKLAPKNDKVINNKLKSFQRKLRILKKEIFYSL 58 (322)
T ss_pred chhhhH-HHHHHHHHHHHhhhcccccCHHHHHHHHHHHHHHHHHHHHhcC
Confidence 345555 56666666666655555567777777777777775 3333333
No 144
>PF09006 Surfac_D-trimer: Lung surfactant protein D coiled-coil trimerisation; InterPro: IPR015097 This domain is found in the SFTPD family, which includes lung surfactant protein D (SFTPD), conglutinin, collectin-43 and collectin-46. It forms a triple-helical parallel coiled coil, and mediates trimerisation of the protein []. ; PDB: 4DN8_A 3G84_A 2RIE_C 3IKR_B 1B08_A 2GGX_B 2OS9_C 2ORK_B 1PWB_A 2RIA_C ....
Probab=41.70 E-value=74 Score=18.72 Aligned_cols=24 Identities=13% Similarity=0.374 Sum_probs=14.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhH
Q psy7799 35 DMHEDIKQLDSHLKQVEEASNNAK 58 (83)
Q Consensus 35 eL~~ei~~L~~~i~~le~i~~~~K 58 (83)
.|+.++..|..+++.|...=++.|
T Consensus 3 aLrqQv~aL~~qv~~Lq~~fs~yK 26 (46)
T PF09006_consen 3 ALRQQVEALQGQVQRLQAAFSQYK 26 (46)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH
Confidence 455666666666666666555554
No 145
>smart00338 BRLZ basic region leucin zipper.
Probab=41.62 E-value=73 Score=18.63 Aligned_cols=36 Identities=22% Similarity=0.473 Sum_probs=17.6
Q ss_pred HHHHHHHHHHHHHHHHHhhHHhhhhHHHHHHHHHHHHHH
Q psy7799 39 DIKQLDSHLKQVEEASNNAKVLRNKAHYLIKELDMFQEA 77 (83)
Q Consensus 39 ei~~L~~~i~~le~i~~~~K~lrnKa~~L~~eL~~F~~~ 77 (83)
.+..|+.+++.|+. .-..|+.++..|..|+..+...
T Consensus 27 ~~~~Le~~~~~L~~---en~~L~~~~~~l~~e~~~lk~~ 62 (65)
T smart00338 27 EIEELERKVEQLEA---ENERLKKEIERLRRELEKLKSE 62 (65)
T ss_pred HHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHH
Confidence 34444444444333 2233555666666666655544
No 146
>PRK09343 prefoldin subunit beta; Provisional
Probab=41.56 E-value=1.1e+02 Score=20.51 Aligned_cols=23 Identities=17% Similarity=0.250 Sum_probs=13.9
Q ss_pred HhhHHhhhhHHHHHHHHHHHHHH
Q psy7799 55 NNAKVLRNKAHYLIKELDMFQEA 77 (83)
Q Consensus 55 ~~~K~lrnKa~~L~~eL~~F~~~ 77 (83)
.+.+.|-++..+|+..+..-..+
T Consensus 85 ~~ik~lekq~~~l~~~l~e~q~~ 107 (121)
T PRK09343 85 LRSRTLEKQEKKLREKLKELQAK 107 (121)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHH
Confidence 56666666666666666554443
No 147
>cd07605 I-BAR_IMD Inverse (I)-BAR, also known as the IRSp53/MIM homology Domain (IMD), a dimerization module that binds and bends membranes. Inverse (I)-BAR (or IMD) is a member of the Bin/Amphiphysin/Rvs (BAR) domain family. It is a dimerization and lipid-binding module that bends membranes and induces membrane protrusions in the opposite direction compared to classical BAR and F-BAR domains, which produce membrane invaginations. IMD domains are found in Insulin Receptor tyrosine kinase Substrate p53 (IRSp53), Missing in Metastasis (MIM), and Brain-specific Angiogenesis Inhibitor 1-Associated Protein 2-like (BAIAP2L) proteins. These are multi-domain proteins that act as scaffolding proteins and transducers of a variety of signaling pathways that link membrane dynamics and the underlying actin cytoskeleton. Most members contain an N-terminal IMD, an SH3 domain, and a WASP homology 2 (WH2) actin-binding motif at the C-terminus, exccept for MIM which does not carry an SH3 domain. Some me
Probab=41.44 E-value=1.5e+02 Score=22.15 Aligned_cols=33 Identities=27% Similarity=0.344 Sum_probs=22.5
Q ss_pred HHHHHHHHHhhHHhhhhHHH-------------------HHHHHHHHHHHhc
Q psy7799 47 LKQVEEASNNAKVLRNKAHY-------------------LIKELDMFQEAYL 79 (83)
Q Consensus 47 i~~le~i~~~~K~lrnKa~~-------------------L~~eL~~F~~~yL 79 (83)
-+.|++..+..+++|.|+.- .+.+|+.|..+-+
T Consensus 125 ~~~l~K~~sel~Kl~KKs~~~~~~k~~~~l~~~~e~v~~k~~ele~~~~~~l 176 (223)
T cd07605 125 REDLDKARSELKKLQKKSQKSGTGKYQEKLDQALEELNDKQKELEAFVSQGL 176 (223)
T ss_pred HHHHHHHHHHHHHHHHHHcccCCCcccHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34566888888888888542 4577777766543
No 148
>PRK06664 fliD flagellar hook-associated protein FliD; Validated
Probab=41.33 E-value=2.4e+02 Score=24.43 Aligned_cols=58 Identities=14% Similarity=0.212 Sum_probs=0.0
Q ss_pred HHHHHHHHHHHHHH----------HHHHHHHHHHHHHHHHHHH-HHHhhHHhhhhHHHHHHHHHHHHHH
Q psy7799 20 STFARLCQLVDRAT----------DDMHEDIKQLDSHLKQVEE-ASNNAKVLRNKAHYLIKELDMFQEA 77 (83)
Q Consensus 20 ~tf~rL~~~Vd~~~----------~eL~~ei~~L~~~i~~le~-i~~~~K~lrnKa~~L~~eL~~F~~~ 77 (83)
+...||...++... ..|..+|+++.++|+.+|. +...-..||.|+.-|+.-|-....+
T Consensus 579 Gla~~l~~~l~~~t~~~G~i~~r~~~l~~~i~~l~~~i~~~e~rl~~~e~rl~~QFtaME~~msqmnsq 647 (661)
T PRK06664 579 GVAKMLLEYLSPYTQAGGIIYNKVKGLDERIADNNKKIEEYEKKLESKERKLKGKYLTMDQTVKKMKEQ 647 (661)
T ss_pred cHHHHHHHHHHHHHcCCCceehHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
No 149
>KOG3251|consensus
Probab=41.32 E-value=1.1e+02 Score=23.15 Aligned_cols=43 Identities=16% Similarity=0.162 Sum_probs=33.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHhhhhHHHHHH
Q psy7799 27 QLVDRATDDMHEDIKQLDSHLKQVEEASNNAKVLRNKAHYLIK 69 (83)
Q Consensus 27 ~~Vd~~~~eL~~ei~~L~~~i~~le~i~~~~K~lrnKa~~L~~ 69 (83)
+.|..+-.++...|+++...+++++..-++-.+=+--|.+|.-
T Consensus 29 ~e~~~v~~~i~~sI~~~~s~~~rl~~~~~~epp~~rq~~rlr~ 71 (213)
T KOG3251|consen 29 QEVSAVENSIQRSIDQYASRCQRLDVLVSKEPPKSRQAARLRV 71 (213)
T ss_pred cchHHHHHHHHHhHHHHHHHHHHHHhHhhcCCCCcHHHHHHHH
Confidence 4777889999999999999999999988776554444555443
No 150
>PF09278 MerR-DNA-bind: MerR, DNA binding; InterPro: IPR015358 This entry represents a family of DNA-binding domains that are predominantly found in the prokaryotic transcriptional regulator MerR. They adopt a structure consisting of a core of three alpha helices, with an architecture that is similar to that of the 'winged helix' fold []. ; PDB: 3QAO_A 1R8D_B 1JBG_A 2VZ4_A 2ZHH_A 2ZHG_A 1Q09_A 1Q08_B 1Q0A_B 1Q07_A ....
Probab=41.31 E-value=69 Score=18.26 Aligned_cols=29 Identities=3% Similarity=0.221 Sum_probs=19.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhHH
Q psy7799 31 RATDDMHEDIKQLDSHLKQVEEASNNAKV 59 (83)
Q Consensus 31 ~~~~eL~~ei~~L~~~i~~le~i~~~~K~ 59 (83)
....=+...+++++.+|+.|..+...+..
T Consensus 36 ~~~~~l~~~~~~i~~~i~~L~~~~~~L~~ 64 (65)
T PF09278_consen 36 DRRALLEEKLEEIEEQIAELQALRAQLEH 64 (65)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhc
Confidence 33455677777777778777777666554
No 151
>PRK04098 sec-independent translocase; Provisional
Probab=41.02 E-value=1.4e+02 Score=21.60 Aligned_cols=29 Identities=14% Similarity=0.334 Sum_probs=21.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy7799 13 QVEQELSSTFARLCQLVDRATDDMHEDIK 41 (83)
Q Consensus 13 QVqqEL~~tf~rL~~~Vd~~~~eL~~ei~ 41 (83)
.+-+.+...+..+...++.+++++++|++
T Consensus 27 ~~~r~lGk~ir~~K~~~~~~k~~l~~Ei~ 55 (158)
T PRK04098 27 QAMVDIAKFFKAVKKTINDAKSTLDKEIN 55 (158)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 35566777788888888888888888764
No 152
>PF07544 Med9: RNA polymerase II transcription mediator complex subunit 9; InterPro: IPR011425 The Mediator complex is a coactivator involved in the regulated transcription of nearly all RNA polymerase II-dependent genes. Mediator functions as a bridge to convey information from gene-specific regulatory proteins to the basal RNA polymerase II transcription machinery. The Mediator complex, having a compact conformation in its free form, is recruited to promoters by direct interactions with regulatory proteins and serves for the assembly of a functional preinitiation complex with RNA polymerase II and the general transcription factors. On recruitment the Mediator complex unfolds to an extended conformation and partially surrounds RNA polymerase II, specifically interacting with the unphosphorylated form of the C-terminal domain (CTD) of RNA polymerase II. The Mediator complex dissociates from the RNA polymerase II holoenzyme and stays at the promoter when transcriptional elongation begins. The Mediator complex is composed of at least 31 subunits: MED1, MED4, MED6, MED7, MED8, MED9, MED10, MED11, MED12, MED13, MED13L, MED14, MED15, MED16, MED17, MED18, MED19, MED20, MED21, MED22, MED23, MED24, MED25, MED26, MED27, MED29, MED30, MED31, CCNC, CDK8 and CDC2L6/CDK11. The subunits form at least three structurally distinct submodules. The head and the middle modules interact directly with RNA polymerase II, whereas the elongated tail module interacts with gene-specific regulatory proteins. Mediator containing the CDK8 module is less active than Mediator lacking this module in supporting transcriptional activation. The head module contains: MED6, MED8, MED11, SRB4/MED17, SRB5/MED18, ROX3/MED19, SRB2/MED20 and SRB6/MED22. The middle module contains: MED1, MED4, NUT1/MED5, MED7, CSE2/MED9, NUT2/MED10, SRB7/MED21 and SOH1/MED31. CSE2/MED9 interacts directly with MED4. The tail module contains: MED2, PGD1/MED3, RGR1/MED14, GAL11/MED15 and SIN4/MED16. The CDK8 module contains: MED12, MED13, CCNC and CDK8. Individual preparations of the Mediator complex lacking one or more distinct subunits have been variously termed ARC, CRSP, DRIP, PC2, SMCC and TRAP. This entry represents subunit Med9 of the Mediator complex. Subunit Med9 is part of the middle module of the Mediator complex []; this associates with the core polymerase subunits to form the RNA polymerase II holoenzyme. Med9 alternatively known as the chromosome segregation protein, CSE2 (P33308 from SWISSPROT) is required, along with CSE1 (P33307 from SWISSPROT) for accurate mitotic chromosome segregation in Saccharomyces cerevisiae (Baker's yeast) [].; GO: 0001104 RNA polymerase II transcription cofactor activity, 0006357 regulation of transcription from RNA polymerase II promoter, 0016592 mediator complex
Probab=40.84 E-value=88 Score=19.63 Aligned_cols=15 Identities=20% Similarity=0.676 Sum_probs=6.2
Q ss_pred HHHHHHHHHHHHHHH
Q psy7799 35 DMHEDIKQLDSHLKQ 49 (83)
Q Consensus 35 eL~~ei~~L~~~i~~ 49 (83)
+-+.+|+.|+.+|..
T Consensus 56 eq~~~i~~Le~~i~~ 70 (83)
T PF07544_consen 56 EQEEEIEELEEQIRK 70 (83)
T ss_pred HHHHHHHHHHHHHHH
Confidence 334444444444433
No 153
>cd01107 HTH_BmrR Helix-Turn-Helix DNA binding domain of the BmrR transcription regulator. Helix-turn-helix (HTH) multidrug-efflux transporter transcription regulator, BmrR and YdfL of Bacillus subtilis, and related proteins; N-terminal domain. Bmr is a membrane protein which causes the efflux of a variety of toxic substances and antibiotics. BmrR is comprised of two distinct domains that harbor a regulatory (effector-binding) site and an active (DNA-binding) site. The conserved N-terminal domain contains a winged HTH motif that mediates DNA binding, while the C-terminal domain binds coactivating, toxic compounds. BmrR shares the N-terminal DNA binding domain with other transcription regulators of the MerR superfamily that promote transcription by reconfiguring the spacer between the -35 and -10 promoter elements.
Probab=40.78 E-value=98 Score=19.86 Aligned_cols=30 Identities=10% Similarity=0.236 Sum_probs=17.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhHH
Q psy7799 30 DRATDDMHEDIKQLDSHLKQVEEASNNAKV 59 (83)
Q Consensus 30 d~~~~eL~~ei~~L~~~i~~le~i~~~~K~ 59 (83)
+....-++..+.+|..+|..|+.....++.
T Consensus 74 ~~~~~~l~~~~~~l~~~i~~l~~~~~~l~~ 103 (108)
T cd01107 74 DELRKLLREKLAELEAEIEELQRILRLLED 103 (108)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 455555666666666666666665554443
No 154
>KOG4117|consensus
Probab=40.65 E-value=96 Score=19.76 Aligned_cols=37 Identities=24% Similarity=0.402 Sum_probs=17.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHH
Q psy7799 16 QELSSTFARLCQLVDRATDDMHED----IKQLDSHLKQVEE 52 (83)
Q Consensus 16 qEL~~tf~rL~~~Vd~~~~eL~~e----i~~L~~~i~~le~ 52 (83)
|||.++-..|-+++-.--..|-++ |+.+..+|+-||+
T Consensus 15 q~LTs~vQ~lLQq~QDkFQtMSDQII~RiDDM~~riDDLEK 55 (73)
T KOG4117|consen 15 QDLTSVVQGLLQQTQDKFQTMSDQIIGRIDDMSSRIDDLEK 55 (73)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhHHHHH
Confidence 455555555555554443444332 3444444444443
No 155
>PRK12765 flagellar capping protein; Provisional
Probab=40.55 E-value=1.3e+02 Score=25.58 Aligned_cols=58 Identities=14% Similarity=0.236 Sum_probs=39.9
Q ss_pred HHHHHHHHHHHH----------HHHHHHHHHHHHHHHHHHHHH-HHHhhHHhhhhHHHHHHHHHHHHHH
Q psy7799 20 STFARLCQLVDR----------ATDDMHEDIKQLDSHLKQVEE-ASNNAKVLRNKAHYLIKELDMFQEA 77 (83)
Q Consensus 20 ~tf~rL~~~Vd~----------~~~eL~~ei~~L~~~i~~le~-i~~~~K~lrnKa~~L~~eL~~F~~~ 77 (83)
+.|.+|...++. ...-++++|+.|+++++.++. +......+|+++.-|+.-+.....+
T Consensus 511 G~~~~l~~~l~~~~~~~G~l~~~~~~l~~~~~~l~~~~~~~~~rl~~~~~r~~~qf~alD~~i~~l~~t 579 (595)
T PRK12765 511 GIFSKLKDTLQEMTGKDGSLTKYDESLTNEIKSLTTSKESTQELIDTKYETMANKWLQYDSIIAKLEQQ 579 (595)
T ss_pred cHHHHHHHHHHHHhCCCCcchhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhH
Confidence 455555555443 446688888888888888764 5566778888888777766665544
No 156
>COG5481 Uncharacterized conserved small protein containing a coiled-coil domain [Function unknown]
Probab=40.53 E-value=93 Score=19.54 Aligned_cols=53 Identities=25% Similarity=0.332 Sum_probs=31.8
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH------HHHHHHHHHHhhHHhhhhHHHHHHH
Q psy7799 11 SHQVEQELSSTFARLCQLVDRATDDMHEDIKQLDS------HLKQVEEASNNAKVLRNKAHYLIKE 70 (83)
Q Consensus 11 shQVqqEL~~tf~rL~~~Vd~~~~eL~~ei~~L~~------~i~~le~i~~~~K~lrnKa~~L~~e 70 (83)
+.|=|-|+.-+++||.+. +.|+..-|+.+.. +|+++- ++-=.||.|.+-||..
T Consensus 2 ~Dqdqaeirl~~arLrqe----H~D~DaaInAmi~~~cD~L~iqRmK---kKKLAlKDki~~lED~ 60 (67)
T COG5481 2 SDQDQAEIRLTLARLRQE----HADFDAAINAMIATGCDALRIQRMK---KKKLALKDKITKLEDQ 60 (67)
T ss_pred CcccHHHHHHHHHHHHHH----HhhHHHHHHHHHHhCCcHHHHHHHH---HHHHhHHHHHHHHHHh
Confidence 457788999999999875 5556666665543 233322 2222456666666654
No 157
>PF02403 Seryl_tRNA_N: Seryl-tRNA synthetase N-terminal domain; InterPro: IPR015866 The aminoacyl-tRNA synthetases (6.1.1. from EC) catalyse the attachment of an amino acid to its cognate transfer RNA molecule in a highly specific two-step reaction. These proteins differ widely in size and oligomeric state, and have limited sequence homology []. The 20 aminoacyl-tRNA synthetases are divided into two classes, I and II. Class I aminoacyl-tRNA synthetases contain a characteristic Rossman fold catalytic domain and are mostly monomeric []. Class II aminoacyl-tRNA synthetases share an anti-parallel beta-sheet fold flanked by alpha-helices [], and are mostly dimeric or multimeric, containing at least three conserved regions [, , ]. However, tRNA binding involves an alpha-helical structure that is conserved between class I and class II synthetases. In reactions catalysed by the class I aminoacyl-tRNA synthetases, the aminoacyl group is coupled to the 2'-hydroxyl of the tRNA, while, in class II reactions, the 3'-hydroxyl site is preferred. The synthetases specific for arginine, cysteine, glutamic acid, glutamine, isoleucine, leucine, methionine, tyrosine, tryptophan and valine belong to class I synthetases. The synthetases specific for alanine, asparagine, aspartic acid, glycine, histidine, lysine, phenylalanine, proline, serine, and threonine belong to class-II synthetases []. Based on their mode of binding to the tRNA acceptor stem, both classes of tRNA synthetases have been subdivided into three subclasses, designated 1a, 1b, 1c and 2a, 2b, 2c. This entry represents the N-terminal domain of Seryl-tRNA synthetase, which consists of two helices in a long alpha-hairpin. Seryl-tRNA synthetase (6.1.1.11 from EC) exists as monomer and belongs to class IIa [].; GO: 0000166 nucleotide binding, 0004828 serine-tRNA ligase activity, 0005524 ATP binding, 0006434 seryl-tRNA aminoacylation, 0005737 cytoplasm; PDB: 3QO8_A 3QO5_A 3QO7_A 3QNE_A 3LSQ_A 3LSS_A 2DQ3_B 1SET_A 1SER_A 1SRY_B ....
Probab=40.27 E-value=96 Score=19.64 Aligned_cols=28 Identities=25% Similarity=0.336 Sum_probs=18.9
Q ss_pred HHHHHHhhHHhhhhHHHHHHHHHHHHHH
Q psy7799 50 VEEASNNAKVLRNKAHYLIKELDMFQEA 77 (83)
Q Consensus 50 le~i~~~~K~lrnKa~~L~~eL~~F~~~ 77 (83)
.+.+..+++.++.+..-|+.++..-.+.
T Consensus 69 ~~~l~~e~~~lk~~i~~le~~~~~~e~~ 96 (108)
T PF02403_consen 69 AEELKAEVKELKEEIKELEEQLKELEEE 96 (108)
T ss_dssp THHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4456677777777777777777665544
No 158
>PHA02666 hypothetical protein; Provisional
Probab=40.15 E-value=58 Score=25.35 Aligned_cols=36 Identities=25% Similarity=0.422 Sum_probs=19.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhHHhhhhHHHHHHHHHHHHHHhcc
Q psy7799 35 DMHEDIKQLDSHLKQVEEASNNAKVLRNKAHYLIKELDMFQEAYLV 80 (83)
Q Consensus 35 eL~~ei~~L~~~i~~le~i~~~~K~lrnKa~~L~~eL~~F~~~yL~ 80 (83)
+|.++|-.|+..|+. .+-.|- .-||+||+ |...|++
T Consensus 205 NLQ~DIcTLC~DIEt------QLSALE---KSLESEln-FYrrYIq 240 (287)
T PHA02666 205 NLQSDICTLCHDIET------QLSALE---KSLESELN-FYRRYIQ 240 (287)
T ss_pred chhhHHHHhhhhHHH------HHHHHH---HHHHHHHH-HHHHHHH
Confidence 466666666666643 111121 13677774 6677764
No 159
>PF11945 WASH_WAHD: WAHD domain of WASH complex; InterPro: IPR021854 This entry represents a component of the WASH complex. The WASH complex is present at the surface of endosomes and recruits and activates the Arp2/3 complex to induce actin polymerisation. The WASH complex plays a key role in the fission of tubules that serve as transport intermediates during endosome sorting []. The WASH complex's subunit structure: F-actin-capping protein subunit alpha (CAPZA1, CAPZA2 or CAPZA3), F-actin-capping protein subunit beta (CAPZB), WASH (WASH1, WASH2P, WASH3P, WASH4P, WASH5P or WASH6P), FAM21 (FAM21A, FAM21B or FAM21C), KIAA1033, KIAA0196 (strumpellin) and CCDC53. This entry represents the WASH subunit of the WASH complex. WASH genes duplicated to multiple chromosomal ends during primate evolution, with highest copy number reached in humans, whose WASH repertoires probably vary extensively among individuals []. It is therefore difficult to determine which gene is functional or not. The telomeric region of chromosome 9p is paralogous to the pericentromeric regions of chromosome 9 as well as to 2q. Paralogous regions contain 7 transcriptional units. Duplicated WASH genes are also present in the Xq/Yq pseudoautosomal region, as well as on chromosome 1 and 15. The chromosome 16 copy seems to be a pseudogene.
Probab=40.03 E-value=1.7e+02 Score=22.82 Aligned_cols=17 Identities=24% Similarity=0.526 Sum_probs=8.6
Q ss_pred HHHHHHHHHHHHHHHHH
Q psy7799 19 SSTFARLCQLVDRATDD 35 (83)
Q Consensus 19 ~~tf~rL~~~Vd~~~~e 35 (83)
..+|.|+.+.|...+..
T Consensus 35 ~diF~rI~~Rv~~~~~~ 51 (297)
T PF11945_consen 35 NDIFSRISARVERNRER 51 (297)
T ss_pred HHHHHHHHHHHHHHHHH
Confidence 34555555555554443
No 160
>PF00015 MCPsignal: Methyl-accepting chemotaxis protein (MCP) signalling domain; InterPro: IPR004089 Methyl-accepting chemotaxis proteins (MCPs) are a family of bacterial receptors that mediate chemotaxis to diverse signals, responding to changes in the concentration of attractants and repellents in the environment by altering swimming behaviour []. Environmental diversity gives rise to diversity in bacterial signalling receptors, and consequently there are many genes encoding MCPs []. For example, there are four well-characterised MCPs found in Escherichia coli: Tar (taxis towards aspartate and maltose, away from nickel and cobalt), Tsr (taxis towards serine, away from leucine, indole and weak acids), Trg (taxis towards galactose and ribose) and Tap (taxis towards dipeptides). MCPs share similar topology and signalling mechanisms. MCPs either bind ligands directly or interact with ligand-binding proteins, transducing the signal to downstream signalling proteins in the cytoplasm. MCPs undergo two covalent modifications: deamidation and reversible methylation at a number of glutamate residues. Attractants increase the level of methylation, while repellents decrease it. The methyl groups are added by the methyl-transferase cheR and are removed by the methylesterase cheB. Most MCPs are homodimers that contain the following organisation: an N-terminal signal sequence that acts as a transmembrane domain in the mature protein; a poorly-conserved periplasmic receptor (ligand-binding) domain; a second transmembrane domain; and a highly-conserved C-terminal cytoplasmic domain that interacts with downstream signalling components. The C-terminal domain contains the glycosylated glutamate residues. This entry represents the signalling domain found in several methyl-accepting chemotaxis proteins. This domain is thought to transduce the signal to CheA since it is highly conserved in very diverse MCPs.; GO: 0004871 signal transducer activity, 0007165 signal transduction, 0016020 membrane; PDB: 2CH7_A 3ZX6_B 1QU7_A 3G6B_B 3UR1_C 3G67_B.
Probab=40.00 E-value=1.2e+02 Score=20.55 Aligned_cols=25 Identities=32% Similarity=0.348 Sum_probs=12.2
Q ss_pred HHHHHHhhHHhhhhHHHHHHHHHHH
Q psy7799 50 VEEASNNAKVLRNKAHYLIKELDMF 74 (83)
Q Consensus 50 le~i~~~~K~lrnKa~~L~~eL~~F 74 (83)
.+.+...+..|++-+..|...++.|
T Consensus 186 ~~~~~~~~~~l~~~a~~L~~~v~~F 210 (213)
T PF00015_consen 186 SEEIAEAAEELSESAEELQELVDRF 210 (213)
T ss_dssp HHHHHHHHHHHHHHHCCCCHHCHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhc
Confidence 3444444445555555555555555
No 161
>PF05103 DivIVA: DivIVA protein; InterPro: IPR007793 The Bacillus subtilis divIVA1 mutation causes misplacement of the septum during cell division, resulting in the formation of small, circular, anucleate minicells []. Inactivation of divIVA produces a minicell phenotype, whereas overproduction of DivIVA results in a filamentation phenotype []. These proteins appear to contain coiled-coils.; PDB: 2WUK_C 2WUJ_A.
Probab=39.93 E-value=75 Score=20.46 Aligned_cols=24 Identities=21% Similarity=0.400 Sum_probs=15.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHH
Q psy7799 27 QLVDRATDDMHEDIKQLDSHLKQV 50 (83)
Q Consensus 27 ~~Vd~~~~eL~~ei~~L~~~i~~l 50 (83)
.+||.+...+..++..|..++..|
T Consensus 21 ~eVD~fl~~l~~~~~~l~~e~~~L 44 (131)
T PF05103_consen 21 DEVDDFLDELAEELERLQRENAEL 44 (131)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH
Confidence 467777777777777776666544
No 162
>PRK08032 fliD flagellar capping protein; Reviewed
Probab=39.89 E-value=2e+02 Score=23.27 Aligned_cols=43 Identities=14% Similarity=0.232 Sum_probs=25.5
Q ss_pred HHHHHHHHHHHHHHHHHHH-HHHhhHHhhhhHHHHHHHHHHHHH
Q psy7799 34 DDMHEDIKQLDSHLKQVEE-ASNNAKVLRNKAHYLIKELDMFQE 76 (83)
Q Consensus 34 ~eL~~ei~~L~~~i~~le~-i~~~~K~lrnKa~~L~~eL~~F~~ 76 (83)
..+..+|++++++|+.+|. +...-..|+.++.-||.-+-....
T Consensus 409 ~~l~~~i~~l~~~i~~~~~rl~~~e~rl~~qF~ame~~~s~mns 452 (462)
T PRK08032 409 DGVNKTLKKLTKQYNAVSDSIDATIARYKAQFTQLDKLMTSLNS 452 (462)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4466666666666666543 344455667777766666555543
No 163
>PF10018 Med4: Vitamin-D-receptor interacting Mediator subunit 4; InterPro: IPR019258 The Mediator complex is a coactivator involved in the regulated transcription of nearly all RNA polymerase II-dependent genes. Mediator functions as a bridge to convey information from gene-specific regulatory proteins to the basal RNA polymerase II transcription machinery. The Mediator complex, having a compact conformation in its free form, is recruited to promoters by direct interactions with regulatory proteins and serves for the assembly of a functional preinitiation complex with RNA polymerase II and the general transcription factors. On recruitment the Mediator complex unfolds to an extended conformation and partially surrounds RNA polymerase II, specifically interacting with the unphosphorylated form of the C-terminal domain (CTD) of RNA polymerase II. The Mediator complex dissociates from the RNA polymerase II holoenzyme and stays at the promoter when transcriptional elongation begins. The Mediator complex is composed of at least 31 subunits: MED1, MED4, MED6, MED7, MED8, MED9, MED10, MED11, MED12, MED13, MED13L, MED14, MED15, MED16, MED17, MED18, MED19, MED20, MED21, MED22, MED23, MED24, MED25, MED26, MED27, MED29, MED30, MED31, CCNC, CDK8 and CDC2L6/CDK11. The subunits form at least three structurally distinct submodules. The head and the middle modules interact directly with RNA polymerase II, whereas the elongated tail module interacts with gene-specific regulatory proteins. Mediator containing the CDK8 module is less active than Mediator lacking this module in supporting transcriptional activation. The head module contains: MED6, MED8, MED11, SRB4/MED17, SRB5/MED18, ROX3/MED19, SRB2/MED20 and SRB6/MED22. The middle module contains: MED1, MED4, NUT1/MED5, MED7, CSE2/MED9, NUT2/MED10, SRB7/MED21 and SOH1/MED31. CSE2/MED9 interacts directly with MED4. The tail module contains: MED2, PGD1/MED3, RGR1/MED14, GAL11/MED15 and SIN4/MED16. The CDK8 module contains: MED12, MED13, CCNC and CDK8. Individual preparations of the Mediator complex lacking one or more distinct subunits have been variously termed ARC, CRSP, DRIP, PC2, SMCC and TRAP. Members of this family represent the Med4 subunit of the Mediator (Med) complex [, ]. ; GO: 0001104 RNA polymerase II transcription cofactor activity, 0006357 regulation of transcription from RNA polymerase II promoter, 0016592 mediator complex
Probab=39.77 E-value=1.4e+02 Score=21.24 Aligned_cols=40 Identities=15% Similarity=0.356 Sum_probs=29.2
Q ss_pred HHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHH
Q psy7799 13 QVEQELSSTFARLCQLVDRA--TDDMHEDIKQLDSHLKQVEE 52 (83)
Q Consensus 13 QVqqEL~~tf~rL~~~Vd~~--~~eL~~ei~~L~~~i~~le~ 52 (83)
|+-..|-.+=..|+..++.+ +.++..+|.+|..++..+|.
T Consensus 2 ~~~~~L~~~d~~L~~~L~~l~~hq~~~~~I~~L~~e~~~ld~ 43 (188)
T PF10018_consen 2 ELAEDLIEADDELSSALEELQEHQENQARIQQLRAEIEELDE 43 (188)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 45556666666666666654 57788889999999888885
No 164
>KOG4643|consensus
Probab=39.76 E-value=2.2e+02 Score=26.68 Aligned_cols=62 Identities=16% Similarity=0.236 Sum_probs=0.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHhhhhHHHHHHHHHHHHHHhcccCC
Q psy7799 14 VEQELSSTFARLCQLVDRATDDMHEDIKQLDSHLKQVEEASNNAKVLRNKAHYLIKELDMFQEAYLVNMS 83 (83)
Q Consensus 14 VqqEL~~tf~rL~~~Vd~~~~eL~~ei~~L~~~i~~le~i~~~~K~lrnKa~~L~~eL~~F~~~yL~~~~ 83 (83)
.+|||..-| ....+|+.||++|+.+|..|-.-+-..=...+.|..-.+||+.-.+.--++++
T Consensus 189 LrqElEEK~--------enll~lr~eLddleae~~klrqe~~e~l~ea~ra~~yrdeldalre~aer~d~ 250 (1195)
T KOG4643|consen 189 LRQELEEKF--------ENLLRLRNELDDLEAEISKLRQEIEEFLDEAHRADRYRDELDALREQAERPDT 250 (1195)
T ss_pred HHHHHHHHH--------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhHHHHHHHhhhcCCC
No 165
>KOG4797|consensus
Probab=39.75 E-value=1.3e+02 Score=20.97 Aligned_cols=50 Identities=20% Similarity=0.298 Sum_probs=41.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHhh
Q psy7799 12 HQVEQELSSTFARLCQLVDRATDDMHEDIKQLDSHLKQVEEASNNAKVLR 61 (83)
Q Consensus 12 hQVqqEL~~tf~rL~~~Vd~~~~eL~~ei~~L~~~i~~le~i~~~~K~lr 61 (83)
.-|-|-|+-+=.+|-=.|.+--.-|++.|++|..+...||.--+=+|.+-
T Consensus 48 NKIeQAMDLVKtHLmfAVREEVe~Lk~qI~eL~er~~~Le~EN~lLk~~~ 97 (123)
T KOG4797|consen 48 NKIEQAMDLVKTHLMFAVREEVEVLKEQIRELEERNSALERENSLLKTLA 97 (123)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhC
Confidence 45678888888999888999889999999999999999987766666554
No 166
>PRK07191 flgK flagellar hook-associated protein FlgK; Validated
Probab=39.74 E-value=1.6e+02 Score=23.68 Aligned_cols=15 Identities=13% Similarity=0.215 Sum_probs=6.0
Q ss_pred HHHHHHHHHhhHHhh
Q psy7799 47 LKQVEEASNNAKVLR 61 (83)
Q Consensus 47 i~~le~i~~~~K~lr 61 (83)
+..+..+..+...|-
T Consensus 165 V~~iN~ll~~Ia~LN 179 (456)
T PRK07191 165 VKQINSLTRSIADYN 179 (456)
T ss_pred HHHHHHHHHHHHHHH
Confidence 334444444443333
No 167
>KOG4603|consensus
Probab=38.77 E-value=1.7e+02 Score=22.00 Aligned_cols=58 Identities=21% Similarity=0.295 Sum_probs=34.0
Q ss_pred HHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHhhHHhhhhHHHHHHHHHHHHH
Q psy7799 18 LSSTFARLCQLVD---RATDDMHEDIKQLDSHLKQVEEASNNAKVLRNKAHYLIKELDMFQE 76 (83)
Q Consensus 18 L~~tf~rL~~~Vd---~~~~eL~~ei~~L~~~i~~le~i~~~~K~lrnKa~~L~~eL~~F~~ 76 (83)
|+.--.+|...|. .+-..++.||+.|++-+ .+|.+|.....||.-..--...|.+|..
T Consensus 84 ld~~i~~l~ek~q~l~~t~s~veaEik~L~s~L-t~eemQe~i~~L~kev~~~~erl~~~k~ 144 (201)
T KOG4603|consen 84 LDGKIVALTEKVQSLQQTCSYVEAEIKELSSAL-TTEEMQEEIQELKKEVAGYRERLKNIKA 144 (201)
T ss_pred HhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc-ChHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3334444444443 23345666777666544 3567777777777666666666776654
No 168
>cd07644 I-BAR_IMD_BAIAP2L2 Inverse (I)-BAR, also known as the IRSp53/MIM homology Domain (IMD), of Brain-specific Angiogenesis Inhibitor 1-Associated Protein 2-Like 2. The IMD domain, also called Inverse-Bin/Amphiphysin/Rvs (I-BAR) domain, is a dimerization and lipid-binding module that bends membranes and induces membrane protrusions. This group is composed of uncharacterized proteins known as BAIAP2L2 (Brain-specific Angiogenesis Inhibitor 1-Associated Protein 2-Like 2). They contain an N-terminal IMD, an SH3 domain, and a WASP homology 2 (WH2) actin-binding motif at the C-terminus. The related proteins, BAIAP2L1 and IRSp53, function as regulators of membrane dynamics and the actin cytoskeleton. The IMD domain binds and bundles actin filaments, binds membranes and produces membrane protrusions, and interacts with the small GTPase Rac.
Probab=38.61 E-value=1.8e+02 Score=22.18 Aligned_cols=31 Identities=10% Similarity=0.188 Sum_probs=23.6
Q ss_pred HHHHHHHHHhhHHhhhh-----------HHHHHHHHHHHHHH
Q psy7799 47 LKQVEEASNNAKVLRNK-----------AHYLIKELDMFQEA 77 (83)
Q Consensus 47 i~~le~i~~~~K~lrnK-----------a~~L~~eL~~F~~~ 77 (83)
-+.||+-++.+|++|.| ...|..+++.|-++
T Consensus 125 ~dsleK~~selkk~rrk~qkn~~e~kE~~~~lq~~~~~f~~~ 166 (215)
T cd07644 125 AANLEKCMSELWRMERQRDRNVREMKENVNRLRQSMQAFLKE 166 (215)
T ss_pred hhhHHHHHHHHHHHHHhhcCCchhhHHHHHHHHHHHHHHHHH
Confidence 67899999999998855 35667777777653
No 169
>PF12329 TMF_DNA_bd: TATA element modulatory factor 1 DNA binding; InterPro: IPR022092 This is the middle region of a family of TATA element modulatory factor 1 proteins conserved in eukaryotes that contains at its N-terminal section a number of leucine zippers that could potentially form coiled coil structures. The whole proteins bind to the TATA element of some RNA polymerase II promoters and repress their activity. by competing with the binding of TATA binding protein. TMFs are evolutionarily conserved golgins that bind Rab6, a ubiquitous ras-like GTP-binding Golgi protein, and contribute to Golgi organisation in animal [] and plant [] cells.
Probab=38.58 E-value=97 Score=19.18 Aligned_cols=36 Identities=33% Similarity=0.502 Sum_probs=16.7
Q ss_pred HHHHHHHHHHHHHHHHHHHhhHHhhhhHHHHHHHHHHHH
Q psy7799 37 HEDIKQLDSHLKQVEEASNNAKVLRNKAHYLIKELDMFQ 75 (83)
Q Consensus 37 ~~ei~~L~~~i~~le~i~~~~K~lrnKa~~L~~eL~~F~ 75 (83)
..-|+.|..++..+|.-. ..++++..-++.+++...
T Consensus 32 ~~~IKKLr~~~~e~e~~~---~~l~~~~~~~e~~~~~l~ 67 (74)
T PF12329_consen 32 NNTIKKLRAKIKELEKQI---KELKKKLEELEKELESLE 67 (74)
T ss_pred HHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHH
Confidence 344555555555555333 233344455555544443
No 170
>cd01878 HflX HflX subfamily. A distinct conserved domain with a glycine-rich segment N-terminal of the GTPase domain characterizes the HflX subfamily. The E. coli HflX has been implicated in the control of the lambda cII repressor proteolysis, but the actual biological functions of these GTPases remain unclear. HflX is widespread, but not universally represented in all three superkingdoms.
Probab=38.50 E-value=1e+02 Score=20.83 Aligned_cols=34 Identities=18% Similarity=0.260 Sum_probs=25.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHhh
Q psy7799 28 LVDRATDDMHEDIKQLDSHLKQVEEASNNAKVLR 61 (83)
Q Consensus 28 ~Vd~~~~eL~~ei~~L~~~i~~le~i~~~~K~lr 61 (83)
|.+.-..-++.+|..+.++|+.+.......+.+|
T Consensus 3 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 36 (204)
T cd01878 3 QLETDRRLIRERIAKLRRELEKVKKQRELQRRRR 36 (204)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHhh
Confidence 4455566778888888888888888877777666
No 171
>PF04375 HemX: HemX; InterPro: IPR007470 The majority of proteins in this family are annotated as uroporphyrin-III C-methyltransferase (2.1.1.107 from EC) []; however, there is no direct evidence to support this annotation for these proteins, which come from mainly pathogenic Gram-negative organisms. There is some evidence to suggest that the proteins are membrane anchored as they have a predicted N-terminal signal peptide and transmembrane domain and may be involved in haem transport [].
Probab=38.50 E-value=1.9e+02 Score=22.67 Aligned_cols=10 Identities=20% Similarity=0.554 Sum_probs=4.8
Q ss_pred hhHHHHHHHH
Q psy7799 62 NKAHYLIKEL 71 (83)
Q Consensus 62 nKa~~L~~eL 71 (83)
++-.|+-.|-
T Consensus 123 ~~~dW~LaEa 132 (372)
T PF04375_consen 123 SRDDWLLAEA 132 (372)
T ss_pred ChHhHHHHHH
Confidence 4455554443
No 172
>PF10372 YojJ: Bacterial membrane-spanning protein N-terminus; InterPro: IPR019457 This entry is found at the N terminus of a family of putative membrane-spanning bacterial proteins. These proteins often contain IPR003390 from INTERPRO towards the C terminus. ; PDB: 2FB5_A.
Probab=38.48 E-value=55 Score=20.62 Aligned_cols=15 Identities=7% Similarity=0.481 Sum_probs=6.3
Q ss_pred HHHHHHHHHHHHHHH
Q psy7799 33 TDDMHEDIKQLDSHL 47 (83)
Q Consensus 33 ~~eL~~ei~~L~~~i 47 (83)
+.+++..|.++..+|
T Consensus 12 K~~lk~~L~~I~~~~ 26 (70)
T PF10372_consen 12 KEQLKQYLEQIEEEI 26 (70)
T ss_dssp HHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHH
Confidence 334444444444433
No 173
>PF12277 DUF3618: Protein of unknown function (DUF3618); InterPro: IPR022062 This domain family is found in bacteria, and is approximately 50 amino acids in length.
Probab=38.43 E-value=78 Score=18.04 Aligned_cols=23 Identities=26% Similarity=0.442 Sum_probs=12.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH
Q psy7799 12 HQVEQELSSTFARLCQLVDRATD 34 (83)
Q Consensus 12 hQVqqEL~~tf~rL~~~Vd~~~~ 34 (83)
.+|.+|+..+-++|..+||....
T Consensus 6 ~~ie~dIe~tR~~La~tvd~L~~ 28 (49)
T PF12277_consen 6 DEIERDIERTRAELAETVDELAA 28 (49)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHH
Confidence 34555666665555555554433
No 174
>PF10168 Nup88: Nuclear pore component; InterPro: IPR019321 Nup88 can be divided into two structural domains; the N-terminal two-thirds of the protein have no obvious structural motifs. It is, however, where it binds to Nup98; one of the components of the nuclear pore. The C-terminal end is a predicted coiled-coil domain []. Nup88 is over expressed in tumour cells [].
Probab=38.27 E-value=2.7e+02 Score=24.27 Aligned_cols=55 Identities=9% Similarity=0.102 Sum_probs=31.6
Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHhhhh
Q psy7799 9 NCSHQVEQELSSTFARLCQLVDRATDDMHEDIKQLDSHLKQVEEASNNAKVLRNK 63 (83)
Q Consensus 9 ncshQVqqEL~~tf~rL~~~Vd~~~~eL~~ei~~L~~~i~~le~i~~~~K~lrnK 63 (83)
.+..+--+.+..++..+...|..-...++.++++|..--+..+.++.+++.|..|
T Consensus 547 vlreeYi~~~~~ar~ei~~rv~~Lk~~~e~Ql~~L~~l~e~~~~l~~~ae~LaeR 601 (717)
T PF10168_consen 547 VLREEYIEKQDLAREEIQRRVKLLKQQKEQQLKELQELQEERKSLRESAEKLAER 601 (717)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3444422234556666777777777777766666655555555555555555443
No 175
>COG1729 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=38.25 E-value=1.1e+02 Score=23.63 Aligned_cols=21 Identities=10% Similarity=0.414 Sum_probs=14.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHH
Q psy7799 32 ATDDMHEDIKQLDSHLKQVEE 52 (83)
Q Consensus 32 ~~~eL~~ei~~L~~~i~~le~ 52 (83)
...+|+++|++|..+|.+|.+
T Consensus 57 ~~~~l~~Ql~~l~g~i~~L~~ 77 (262)
T COG1729 57 RLTQLEQQLRQLQGKIEELRG 77 (262)
T ss_pred ccHHHHHHHHHHHhhHHHHHh
Confidence 456677777777777777775
No 176
>KOG0859|consensus
Probab=38.22 E-value=52 Score=25.02 Aligned_cols=47 Identities=21% Similarity=0.263 Sum_probs=34.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHhhhhHH
Q psy7799 19 SSTFARLCQLVDRATDDMHEDIKQLDSHLKQVEEASNNAKVLRNKAH 65 (83)
Q Consensus 19 ~~tf~rL~~~Vd~~~~eL~~ei~~L~~~i~~le~i~~~~K~lrnKa~ 65 (83)
.++-+|..+||++++.=|-+.|+..-++=+++|-+-.+---|++++.
T Consensus 124 id~lskvkaqv~evk~vM~eNIekvldRGekiELLVdKTenl~~~s~ 170 (217)
T KOG0859|consen 124 ISKLAKVKAQVTEVKGVMMENIEKVLDRGEKIELLVDKTENLRSKSF 170 (217)
T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHhccCeEEeeechhhhhhhhhH
Confidence 47889999999999999999999988776666544433333333333
No 177
>TIGR03545 conserved hypothetical protein TIGR03545. This model represents a relatively rare but broadly distributed uncharacterized protein family, distributed in 1-2 percent of bacterial genomes, all of which have outer membranes. In many of these genomes, it is part of a two-gene pair.
Probab=38.21 E-value=2.5e+02 Score=23.77 Aligned_cols=27 Identities=11% Similarity=0.158 Sum_probs=15.6
Q ss_pred HHHHHHHHHhhHHhhhhHHHHHHHHHH
Q psy7799 47 LKQVEEASNNAKVLRNKAHYLIKELDM 73 (83)
Q Consensus 47 i~~le~i~~~~K~lrnKa~~L~~eL~~ 73 (83)
.+.++.+.+..+..++++.-|-++|+.
T Consensus 218 ~~e~d~lk~e~~~~~~~i~~~~~~l~~ 244 (555)
T TIGR03545 218 KEEFDKLKKEGKADKQKIKSAKNDLQN 244 (555)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 445556666666666666655555543
No 178
>KOG0976|consensus
Probab=38.11 E-value=2.5e+02 Score=26.08 Aligned_cols=59 Identities=15% Similarity=0.151 Sum_probs=37.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHhhHHhhhhHHHHHHHHHHHHHHh
Q psy7799 16 QELSSTFARLCQLVDRATDDMHEDIKQLDSHLKQV----EEASNNAKVLRNKAHYLIKELDMFQEAY 78 (83)
Q Consensus 16 qEL~~tf~rL~~~Vd~~~~eL~~ei~~L~~~i~~l----e~i~~~~K~lrnKa~~L~~eL~~F~~~y 78 (83)
|-|..|.+++..+..++..+++.--.++.+.++.| +.|+..-+.||||-. +|..|..+|
T Consensus 123 q~lQ~ti~~~q~d~ke~etelE~~~srlh~le~eLsAk~~eIf~~~~~L~nk~~----~lt~~~~q~ 185 (1265)
T KOG0976|consen 123 QKLQDTIQGAQDDKKENEIEIENLNSRLHKLEDELSAKAHDIFMIGEDLHDKNE----ELNEFNMEF 185 (1265)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHhhhhHHHHHHHHHHhhhhh----HHhHHHHHH
Confidence 34666777777776666666665555555555443 578888899999865 344455554
No 179
>PF06013 WXG100: Proteins of 100 residues with WXG; InterPro: IPR010310 ESAT-6 is a small protein appears to be of fundamental importance in virulence and protective immunity in Mycobacterium tuberculosis. Homologues have been detected in other Gram-positive bacterial species. It may represent a novel secretion system potentially driven by the PF01580 from PFAM domains in the YukA-like proteins []. Members of this protein family include secretion targets for type main variants of type VII secretion systems (T7SS), one found in the Actinobacteria, one found in the Firmicutes. This model was derived through iteration from PF06013 from PFAM. The best characterised member of this family is ESAT-6 from Mycobacterium tuberculosis. Members of this family usually are ~100 amino acids in length but occasionally have long C-terminal extension. ; PDB: 3FAV_A 1WA8_A 3Q4H_B 2KG7_A 2VRZ_B 2VS0_B 3OGI_A 3H6P_B 3GVM_B 3GWK_C ....
Probab=38.03 E-value=78 Score=17.97 Aligned_cols=32 Identities=19% Similarity=0.184 Sum_probs=23.2
Q ss_pred HHHHHHHHhhHHhhhhHHHHHHHHHHHHHHhc
Q psy7799 48 KQVEEASNNAKVLRNKAHYLIKELDMFQEAYL 79 (83)
Q Consensus 48 ~~le~i~~~~K~lrnKa~~L~~eL~~F~~~yL 79 (83)
+..+........+.+....+...|......|.
T Consensus 51 ~~~~~~~~~~~~~~~~L~~~~~~l~~~~~~~~ 82 (86)
T PF06013_consen 51 DKFEEWNQAFRQLNEALEELSQALRQAAQNYE 82 (86)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34456777777777777888888887777774
No 180
>COG5509 Uncharacterized small protein containing a coiled-coil domain [Function unknown]
Probab=37.78 E-value=86 Score=19.65 Aligned_cols=33 Identities=18% Similarity=0.342 Sum_probs=22.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhHHhhhhH
Q psy7799 32 ATDDMHEDIKQLDSHLKQVEEASNNAKVLRNKA 64 (83)
Q Consensus 32 ~~~eL~~ei~~L~~~i~~le~i~~~~K~lrnKa 64 (83)
.-.||++.|.-|..+|++|+.--.+-+.=|.-|
T Consensus 26 sV~El~eRIalLq~EIeRlkAe~~kK~~srsAA 58 (65)
T COG5509 26 SVAELEERIALLQAEIERLKAELAKKKASRSAA 58 (65)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHhhhccHHHH
Confidence 356899999999999999876444433334333
No 181
>TIGR03185 DNA_S_dndD DNA sulfur modification protein DndD. This model describes the DndB protein encoded by an operon associated with a sulfur-containing modification to DNA. The operon is sporadically distributed in bacteria, much like some restriction enzyme operons. DndD is described as a putative ATPase. The small number of examples known so far include species from among the Firmicutes, Actinomycetes, Proteobacteria, and Cyanobacteria.
Probab=37.71 E-value=2.4e+02 Score=23.59 Aligned_cols=35 Identities=17% Similarity=0.313 Sum_probs=15.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy7799 13 QVEQELSSTFARLCQLVDRATDDMHEDIKQLDSHLKQ 49 (83)
Q Consensus 13 QVqqEL~~tf~rL~~~Vd~~~~eL~~ei~~L~~~i~~ 49 (83)
+...+|...+.|....+. ..++..++++++.++..
T Consensus 186 ~L~~dl~~~~~~~~~~~~--~~~~~~~~~~le~el~~ 220 (650)
T TIGR03185 186 RLAGDLTNVLRRRKKSEL--PSSILSEIEALEAELKE 220 (650)
T ss_pred HHHHHHHHHHHHHHhccc--chHHHHHHHHHHHHHHH
Confidence 344555555555443332 22334444444444443
No 182
>COG2825 HlpA Outer membrane protein [Cell envelope biogenesis, outer membrane]
Probab=37.69 E-value=1.5e+02 Score=21.11 Aligned_cols=47 Identities=19% Similarity=0.323 Sum_probs=36.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHH
Q psy7799 12 HQVEQELSSTFARLCQLVDRATDDMHEDIKQLDSHLKQVEEASNNAKV 59 (83)
Q Consensus 12 hQVqqEL~~tf~rL~~~Vd~~~~eL~~ei~~L~~~i~~le~i~~~~K~ 59 (83)
-+|..+|..-|++--.....-..+|+...+.|.+.. .++.+..+.|.
T Consensus 43 k~~~~~le~~f~~~~~~lq~~~~el~~~~~kL~~~~-~~~~~~d~~k~ 89 (170)
T COG2825 43 KKVSADLESEFKKRQKELQKMQKELKAKEAKLQDDG-KMEALSDRAKA 89 (170)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhh-HhhhhhhHHHH
Confidence 578899999999988888888888888888888765 56665555443
No 183
>PF11744 ALMT: Aluminium activated malate transporter; InterPro: IPR020966 This entry represents an malate transporter which has been is identified as being critical for aluminium tolerance in Arabidopsis thaliana [].; GO: 0010044 response to aluminum ion
Probab=37.63 E-value=94 Score=25.25 Aligned_cols=55 Identities=15% Similarity=0.258 Sum_probs=39.2
Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHhhHHhhhhH
Q psy7799 10 CSHQVEQELSSTFARLCQLVDRATDDMHEDIKQLDSHLK---QVEEASNNAKVLRNKA 64 (83)
Q Consensus 10 cshQVqqEL~~tf~rL~~~Vd~~~~eL~~ei~~L~~~i~---~le~i~~~~K~lrnKa 64 (83)
+.+.+++.+.....|+...+..+.+||-..|+.+++.-. .++++..-++.|+++-
T Consensus 285 ~p~~~r~~~~~~~~~~~~e~~kvLrel~~~ik~m~~~~~~~~~~~~~~~A~~~Lq~~l 342 (406)
T PF11744_consen 285 APPELRQKFQEECTRVSSESAKVLRELSNSIKTMTKSSSIDDHVANLKEAAEDLQSKL 342 (406)
T ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccCCCchhHHHHHHHHHHHHHHHH
Confidence 446677777888888888889999999999999876544 3455555555555443
No 184
>PHA02592 52 DNA topisomerase II medium subunit; Provisional
Probab=37.61 E-value=99 Score=25.42 Aligned_cols=43 Identities=9% Similarity=0.287 Sum_probs=23.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHhhhhHHHHHHHHHHHH
Q psy7799 25 LCQLVDRATDDMHEDIKQLDSHLKQVEEASNNAKVLRNKAHYLIKELDMFQ 75 (83)
Q Consensus 25 L~~~Vd~~~~eL~~ei~~L~~~i~~le~i~~~~K~lrnKa~~L~~eL~~F~ 75 (83)
|......-...|++|++++.++|+.|+. -. -......+|+.|.
T Consensus 396 L~~LT~~e~~kL~~e~~~l~~ei~~l~~-------~t-~~~~w~~DL~~~~ 438 (439)
T PHA02592 396 IYSMTSDEREKLQKEAEELEKEHEYWKK-------TT-AKKEYIKDLEELK 438 (439)
T ss_pred HHHhhHHHHHHHHHHHHHHHHHHHHHHc-------CC-HHHHHHHHHHHHh
Confidence 3333344445566666666666665554 22 2345567788775
No 185
>PF05565 Sipho_Gp157: Siphovirus Gp157; InterPro: IPR008840 This family contains both viral and bacterial proteins which are related to the Gp157 protein of the Streptococcus thermophilus SFi bacteriophage. It is thought that bacteria possessing the gene coding for this protein have an increased resistance to the bacteriophage [].
Probab=37.60 E-value=1.4e+02 Score=20.78 Aligned_cols=42 Identities=21% Similarity=0.333 Sum_probs=28.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhHHhhhhHHHHHHHHHHHHH
Q psy7799 32 ATDDMHEDIKQLDSHLKQVEEASNNAKVLRNKAHYLIKELDMFQE 76 (83)
Q Consensus 32 ~~~eL~~ei~~L~~~i~~le~i~~~~K~lrnKa~~L~~eL~~F~~ 76 (83)
+-.+++.+++.+..++.+|.. +.|.+.|++.+|-.-|..+-.
T Consensus 48 ~Ik~~ea~~e~~k~E~krL~~---rkk~~e~~~~~Lk~yL~~~m~ 89 (162)
T PF05565_consen 48 VIKNLEADIEAIKAEIKRLQE---RKKSIENRIDRLKEYLLDAME 89 (162)
T ss_pred HHHHhHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHH
Confidence 345666777777777777655 456777888888777665543
No 186
>TIGR01069 mutS2 MutS2 family protein. Function of MutS2 is unknown. It should not be considered a DNA mismatch repair protein. It is likely a DNA mismatch binding protein of unknown cellular function.
Probab=37.58 E-value=2.8e+02 Score=24.24 Aligned_cols=45 Identities=9% Similarity=0.216 Sum_probs=24.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHhhhh
Q psy7799 19 SSTFARLCQLVDRATDDMHEDIKQLDSHLKQVEEASNNAKVLRNK 63 (83)
Q Consensus 19 ~~tf~rL~~~Vd~~~~eL~~ei~~L~~~i~~le~i~~~~K~lrnK 63 (83)
..........++..-.+|+++..+++.+...++.....++.++.+
T Consensus 503 ~~~~~~~~~~~~~li~~L~~~~~~~e~~~~~~~~~~~e~~~~~~~ 547 (771)
T TIGR01069 503 KTFYGEFKEEINVLIEKLSALEKELEQKNEHLEKLLKEQEKLKKE 547 (771)
T ss_pred HHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 333444455555666666666666666665555555555555544
No 187
>PF13747 DUF4164: Domain of unknown function (DUF4164)
Probab=37.54 E-value=1.1e+02 Score=19.60 Aligned_cols=56 Identities=20% Similarity=0.294 Sum_probs=37.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHhhhhHHHHHHHHHHHH
Q psy7799 16 QELSSTFARLCQLVDRATDDMHEDIKQLDSHLKQVEEASNNAKVLRNKAHYLIKELDMFQ 75 (83)
Q Consensus 16 qEL~~tf~rL~~~Vd~~~~eL~~ei~~L~~~i~~le~i~~~~K~lrnKa~~L~~eL~~F~ 75 (83)
..|..+..||.+-+|.....+...+....+. +.+..+...|...-.-|+.+|+.+.
T Consensus 4 ~~le~al~rL~~aid~LE~~v~~r~~~~~~~----~~~e~ei~~l~~dr~rLa~eLD~~~ 59 (89)
T PF13747_consen 4 YSLEAALTRLEAAIDRLEKAVDRRLERDRKR----DELEEEIQRLDADRSRLAQELDQAE 59 (89)
T ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHhhhhh----hhHHHHHHHHHhhHHHHHHHHHhHH
Confidence 3577788888888887777777666654433 5555566666666666777776654
No 188
>PRK10227 DNA-binding transcriptional regulator CueR; Provisional
Probab=37.46 E-value=99 Score=20.96 Aligned_cols=32 Identities=9% Similarity=0.292 Sum_probs=19.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhHHhh
Q psy7799 30 DRATDDMHEDIKQLDSHLKQVEEASNNAKVLR 61 (83)
Q Consensus 30 d~~~~eL~~ei~~L~~~i~~le~i~~~~K~lr 61 (83)
+....-+.+.+.+++.+|+.|+.....+..+.
T Consensus 78 ~~~~~~l~~~~~~l~~~i~~L~~~~~~L~~~~ 109 (135)
T PRK10227 78 ADVKRRTLEKVAEIERHIEELQSMRDQLLALA 109 (135)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33344456667777777777777666655433
No 189
>PF09755 DUF2046: Uncharacterized conserved protein H4 (DUF2046); InterPro: IPR019152 This is the conserved N-terminal 350 residues of a family of proteins of unknown function possibly containing a coiled-coil domain.
Probab=37.45 E-value=2.1e+02 Score=22.78 Aligned_cols=37 Identities=22% Similarity=0.331 Sum_probs=25.0
Q ss_pred HHHHHHHHHHHH----HHHHhhHHhhhhHHHHHHHHHHHHH
Q psy7799 40 IKQLDSHLKQVE----EASNNAKVLRNKAHYLIKELDMFQE 76 (83)
Q Consensus 40 i~~L~~~i~~le----~i~~~~K~lrnKa~~L~~eL~~F~~ 76 (83)
+..|.++|..|+ ..+..+..||.--+.|++-|+.=.+
T Consensus 137 V~kL~k~i~~Le~e~~~~q~~le~Lr~EKVdlEn~LE~EQE 177 (310)
T PF09755_consen 137 VNKLQKKIERLEKEKSAKQEELERLRREKVDLENTLEQEQE 177 (310)
T ss_pred HHHHHHHHHHHHHHHHHhHHHHHHHHHHHHhHHHHHHHHHH
Confidence 556666666665 4566677777766678888876444
No 190
>PF14932 HAUS-augmin3: HAUS augmin-like complex subunit 3
Probab=37.37 E-value=84 Score=23.46 Aligned_cols=19 Identities=11% Similarity=-0.008 Sum_probs=7.0
Q ss_pred HHHHHhhHHhhhhHHHHHH
Q psy7799 51 EEASNNAKVLRNKAHYLIK 69 (83)
Q Consensus 51 e~i~~~~K~lrnKa~~L~~ 69 (83)
+..+.....+.++...|+.
T Consensus 92 ~~lq~~~~~~~~~~~~l~~ 110 (256)
T PF14932_consen 92 NKLQQLDSSLSQELSELEG 110 (256)
T ss_pred HHHHHHHHHHHHHHHHhHH
Confidence 3333333333333333333
No 191
>PRK12704 phosphodiesterase; Provisional
Probab=37.27 E-value=2.4e+02 Score=23.45 Aligned_cols=18 Identities=22% Similarity=0.342 Sum_probs=6.9
Q ss_pred HHHHHHHHHHHHHHHHHH
Q psy7799 37 HEDIKQLDSHLKQVEEAS 54 (83)
Q Consensus 37 ~~ei~~L~~~i~~le~i~ 54 (83)
...-..|+++.+.|+.-.
T Consensus 92 ~~Ree~Le~r~e~Lekke 109 (520)
T PRK12704 92 LQKEENLDRKLELLEKRE 109 (520)
T ss_pred HHHHHHHHHHHHHHHHHH
Confidence 333333334444333333
No 192
>PF04645 DUF603: Protein of unknown function, DUF603; InterPro: IPR006739 This family includes several uncharacterised proteins from Borrelia species.
Probab=36.99 E-value=1.6e+02 Score=21.86 Aligned_cols=64 Identities=17% Similarity=0.213 Sum_probs=42.8
Q ss_pred HHHHHHHHHHHHHH--------HHHHHHHHHHHHHHHHHHHHHHHHHHHHh--hHHhhhhHHHHHHHHHHHHHH
Q psy7799 14 VEQELSSTFARLCQ--------LVDRATDDMHEDIKQLDSHLKQVEEASNN--AKVLRNKAHYLIKELDMFQEA 77 (83)
Q Consensus 14 VqqEL~~tf~rL~~--------~Vd~~~~eL~~ei~~L~~~i~~le~i~~~--~K~lrnKa~~L~~eL~~F~~~ 77 (83)
+.-|++-+|..|.. =-+.-...+..+|.-|+.+|..|.+-.+. -+.+..+-.-|..||+.-.+.
T Consensus 80 ~Kse~~~~r~~L~l~FI~sf~~Y~~leL~s~~~ei~~L~~kI~~L~~~in~~~k~~~n~~i~slk~EL~d~iKe 153 (181)
T PF04645_consen 80 AKSELEMERSNLELSFIDSFNQYKNLELKSIKKEIEILRLKISSLQKEINKNKKKDLNEEIESLKSELNDLIKE 153 (181)
T ss_pred HHhHHHHHHHHHhhHHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhHHHHHHHHHHHHHH
Confidence 44566666665543 23445567888999999999888765543 334556777888888876554
No 193
>PF11629 Mst1_SARAH: C terminal SARAH domain of Mst1; InterPro: IPR024205 The SARAH (Sav/Rassf/Hpo) domain is found at the C terminus in three classes of eukaryotic tumour suppressors that give the domain its name. In the Sav (Salvador) and Hpo (Hippo) families, the SARAH domain mediates signal transduction from Hpo via the Sav scaffolding protein to the downstream component Wts (Warts); the phosphorylation of Wts by Hpo triggers cell cycle arrest and apoptosis by down-regulating cyclin E, Diap 1 and other targets []. The SARAH domain is also involved in dimerisation, as in the human Hpo orthologue, Mst1, which homodimerises via its C-terminal SARAH domain. The SARAH domain is found associated with other domains, such as protein kinase domains, WW/rsp5/WWP domain (IPR001202 from INTERPRO), C1 domain (IPR002219 from INTERPRO), LIM domain (IPR001781 from INTERPRO), or the Ras-associating (RA) domain (IPR000159 from INTERPRO).; GO: 0004674 protein serine/threonine kinase activity; PDB: 2JO8_A.
Probab=36.91 E-value=93 Score=18.52 Aligned_cols=27 Identities=19% Similarity=0.393 Sum_probs=17.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy7799 22 FARLCQLVDRATDDMHEDIKQLDSHLK 48 (83)
Q Consensus 22 f~rL~~~Vd~~~~eL~~ei~~L~~~i~ 48 (83)
+.-|.+.......+|+.||.+|..+-+
T Consensus 10 ~~eL~~rl~~LD~~ME~Eieelr~RY~ 36 (49)
T PF11629_consen 10 YEELQQRLASLDPEMEQEIEELRQRYQ 36 (49)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHhCCHHHHHHHHHHHHHHH
Confidence 344555555666788888888766543
No 194
>PRK06975 bifunctional uroporphyrinogen-III synthetase/uroporphyrin-III C-methyltransferase; Reviewed
Probab=36.87 E-value=2.6e+02 Score=23.86 Aligned_cols=12 Identities=33% Similarity=0.830 Sum_probs=7.8
Q ss_pred hhhHHHHHHHHH
Q psy7799 61 RNKAHYLIKELD 72 (83)
Q Consensus 61 rnKa~~L~~eL~ 72 (83)
|++-.|+-.|-+
T Consensus 413 ~~r~dW~laEae 424 (656)
T PRK06975 413 RNRDDWMIAEVE 424 (656)
T ss_pred cChhhhHHHHHH
Confidence 566677776655
No 195
>TIGR02043 ZntR Zn(II)-responsive transcriptional regulator. This model represents the zinc and cadmium (II) responsive transcriptional activator of the gamma proteobacterial zinc efflux system. This protein is a member of the MerR family of transcriptional activators (pfam00376) and contains a distinctive pattern of cysteine residues in its metal binding loop, Cys-Cys-X(8-9)-Cys, as well as a conserved and critical cysteine at the N-terminal end of the dimerization helix.
Probab=36.75 E-value=1.3e+02 Score=20.07 Aligned_cols=31 Identities=10% Similarity=0.230 Sum_probs=20.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhHHhh
Q psy7799 31 RATDDMHEDIKQLDSHLKQVEEASNNAKVLR 61 (83)
Q Consensus 31 ~~~~eL~~ei~~L~~~i~~le~i~~~~K~lr 61 (83)
....-|+..+.+++.+|+.|+.+...+..+.
T Consensus 81 ~~~~~l~~~~~~l~~~i~~L~~~~~~L~~~~ 111 (131)
T TIGR02043 81 EVKAIVDAKLELVDEKINELTKIRRSLKKLS 111 (131)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4445566777777777777776666655544
No 196
>PF01166 TSC22: TSC-22/dip/bun family; InterPro: IPR000580 Several eukaryotic proteins are evolutionary related and are thought to be involved in transcriptional regulation. These proteins are highly similar in a region of about 50 residues that include a conserved leucine-zipper domain most probably involved in homo- or hetero-dimerisation. Proteins containing this signature include: Vertebrate protein TSC-22 [], a transcriptional regulator which seems to act on C-type natriuretic peptide (CNP) promoter. Mammalian protein DIP (DSIP-immunoreactive peptide) [], a protein whose function is not yet known. Drosophila protein bunched [] (gene bun) (also known as shortsighted), a probable transcription factor required for peripheral nervous system morphogenesis, eye development and oogenesis. Caenorhabditis elegans hypothetical protein T18D3.7. ; GO: 0003700 sequence-specific DNA binding transcription factor activity, 0006355 regulation of transcription, DNA-dependent; PDB: 1DIP_B.
Probab=36.67 E-value=77 Score=19.57 Aligned_cols=30 Identities=30% Similarity=0.407 Sum_probs=20.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhH
Q psy7799 29 VDRATDDMHEDIKQLDSHLKQVEEASNNAK 58 (83)
Q Consensus 29 Vd~~~~eL~~ei~~L~~~i~~le~i~~~~K 58 (83)
|.+--.-|++.|++|..++.+||.=-+.+|
T Consensus 12 VrEEVevLK~~I~eL~~~n~~Le~EN~~Lk 41 (59)
T PF01166_consen 12 VREEVEVLKEQIAELEERNSQLEEENNLLK 41 (59)
T ss_dssp -TTSHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 334445688889999888888876554444
No 197
>PRK06032 fliH flagellar assembly protein H; Validated
Probab=36.63 E-value=1.6e+02 Score=21.06 Aligned_cols=41 Identities=12% Similarity=0.234 Sum_probs=23.5
Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy7799 10 CSHQVEQELSSTFARLCQLVDRATDDMHEDIKQLDSHLKQV 50 (83)
Q Consensus 10 cshQVqqEL~~tf~rL~~~Vd~~~~eL~~ei~~L~~~i~~l 50 (83)
|..++++++......|...+......+..-...+..+|-.|
T Consensus 52 A~~~~~~~la~~l~~l~~~~~~l~~~~~~~~~~~~~~lv~L 92 (199)
T PRK06032 52 AAAQSERRLADALERIAQAAAGLLAALAAVEHEMETEAADL 92 (199)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34566777766666666666555555555555555444433
No 198
>PHA03386 P10 fibrous body protein; Provisional
Probab=36.58 E-value=1.2e+02 Score=20.24 Aligned_cols=37 Identities=11% Similarity=0.191 Sum_probs=22.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHhhHHhhhhHHHHHHHHH
Q psy7799 36 MHEDIKQLDSHLKQVEEASNNAKVLRNKAHYLIKELD 72 (83)
Q Consensus 36 L~~ei~~L~~~i~~le~i~~~~K~lrnKa~~L~~eL~ 72 (83)
|...++.+....+.||.+...+..|-+|.+.+.+-|+
T Consensus 24 LQ~qV~dv~~n~~~LDa~~~qL~~l~tkV~~Iq~iLn 60 (94)
T PHA03386 24 LQTQLNGLEEDSQPLDGLPAQLTELDTKVSDIQSILT 60 (94)
T ss_pred HHHHHHHHHhcchhhhhHHHHHHHHHHHHHHHHHhcC
Confidence 3444444444466666666677777777776666554
No 199
>PF12761 End3: Actin cytoskeleton-regulatory complex protein END3
Probab=36.54 E-value=1e+02 Score=23.00 Aligned_cols=39 Identities=18% Similarity=0.411 Sum_probs=22.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHh-hHHhhhhHHHHHHHHHHH
Q psy7799 36 MHEDIKQLDSHLKQVEEASNN-AKVLRNKAHYLIKELDMF 74 (83)
Q Consensus 36 L~~ei~~L~~~i~~le~i~~~-~K~lrnKa~~L~~eL~~F 74 (83)
|+.|+.+|+.+|...+..... ...-.++...+-.||+..
T Consensus 101 LkrELa~Le~~l~~~~~~~~~~~~~~~~~~~lvk~e~EqL 140 (195)
T PF12761_consen 101 LKRELAELEEKLSKVEQAAESRRSDTDSKPALVKREFEQL 140 (195)
T ss_pred HHHHHHHHHHHHHHHHHHHHhcccCCcchHHHHHHHHHHH
Confidence 666677777777666666655 223334445555555543
No 200
>PRK13729 conjugal transfer pilus assembly protein TraB; Provisional
Probab=36.52 E-value=1.9e+02 Score=24.36 Aligned_cols=36 Identities=14% Similarity=0.323 Sum_probs=18.1
Q ss_pred HHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHH
Q psy7799 17 ELSSTFARLCQLVD---RATDDMHEDIKQLDSHLKQVEE 52 (83)
Q Consensus 17 EL~~tf~rL~~~Vd---~~~~eL~~ei~~L~~~i~~le~ 52 (83)
||..-++.|.++++ ....+++.+|++|+.++..|+.
T Consensus 80 ELEKqLaaLrqElq~~saq~~dle~KIkeLEaE~~~Lk~ 118 (475)
T PRK13729 80 QMQKQYEEIRRELDVLNKQRGDDQRRIEKLGQDNAALAE 118 (475)
T ss_pred HHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHH
Confidence 44444455554444 4444455555555555555543
No 201
>PF02994 Transposase_22: L1 transposable element; InterPro: IPR004244 Many human L1 elements are capable of retrotransposition. Some of these have been shown to exhibit reverse transcriptase (RT) activity [] although the function of many are, as yet, unknown. More information about these proteins can be found at Protein of the Month: Transposase [].; PDB: 2LDY_A 3SOO_A 2YKQ_A 2YKO_C 2YKP_B 2W7A_B 2JRB_A.
Probab=36.40 E-value=1.1e+02 Score=24.24 Aligned_cols=36 Identities=28% Similarity=0.504 Sum_probs=15.8
Q ss_pred HHHHHHHHHHHHHHHHHHHH----HHhhHHhhhhHHHHHH
Q psy7799 34 DDMHEDIKQLDSHLKQVEEA----SNNAKVLRNKAHYLIK 69 (83)
Q Consensus 34 ~eL~~ei~~L~~~i~~le~i----~~~~K~lrnKa~~L~~ 69 (83)
.++++.|..+++.+..++.. ..+.+.+.+|...|++
T Consensus 147 ~e~Eeris~lEd~~~~i~~~~~~~~k~i~~l~~kl~DlEn 186 (370)
T PF02994_consen 147 DELEERISELEDRIEEIEQAIKELEKRIKKLEDKLDDLEN 186 (370)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHhHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHh
Confidence 34455555555554444332 2233344444444443
No 202
>COG1293 Predicted RNA-binding protein homologous to eukaryotic snRNP [Transcription]
Probab=36.28 E-value=1.7e+02 Score=24.57 Aligned_cols=25 Identities=24% Similarity=0.270 Sum_probs=12.1
Q ss_pred HHHHHhhHHhhhhHHHHHHHHHHHH
Q psy7799 51 EEASNNAKVLRNKAHYLIKELDMFQ 75 (83)
Q Consensus 51 e~i~~~~K~lrnKa~~L~~eL~~F~ 75 (83)
+.+.++.+.|++|+.+++.+++.|.
T Consensus 289 ~~l~~~~~kl~~~i~~~~~~~~~~~ 313 (564)
T COG1293 289 KKLEKELKKLENKLEKQEDELEELE 313 (564)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3444444455555555555554443
No 203
>cd00890 Prefoldin Prefoldin is a hexameric molecular chaperone complex, found in both eukaryotes and archaea, that binds and stabilizes newly synthesized polypeptides allowing them to fold correctly. The complex contains two alpha and four beta subunits, the two subunits being evolutionarily related. In archaea, there is usually only one gene for each subunit while in eukaryotes there two or more paralogous genes encoding each subunit adding heterogeneity to the structure of the hexamer. The structure of the complex consists of a double beta barrel assembly with six protruding coiled-coils.
Probab=36.27 E-value=1.2e+02 Score=19.40 Aligned_cols=29 Identities=10% Similarity=0.343 Sum_probs=14.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhHHhhh
Q psy7799 34 DDMHEDIKQLDSHLKQVEEASNNAKVLRN 62 (83)
Q Consensus 34 ~eL~~ei~~L~~~i~~le~i~~~~K~lrn 62 (83)
+.+..+|..+...+..+.......+..++
T Consensus 9 ~~l~~~i~~l~~~~~~l~~~~~e~~~~~~ 37 (129)
T cd00890 9 QQLQQQLEALQQQLQKLEAQLTEYEKAKE 37 (129)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34445555555555555555555444443
No 204
>TIGR02976 phageshock_pspB phage shock protein B. This model describes the PspB protein of the psp (phage shock protein) operon, as found in Escherichia coli and many related species. Expression of a phage protein called secretin protein IV, and a number of other stresses including ethanol, heat shock, and defects in protein secretion trigger sigma-54-dependent expression of the phage shock regulon. PspB is both a regulator and an effector protein of the phage shock response.
Probab=36.21 E-value=70 Score=20.25 Aligned_cols=20 Identities=10% Similarity=0.363 Sum_probs=11.6
Q ss_pred HHHHHHHHHHHHHHHHHHHH
Q psy7799 34 DDMHEDIKQLDSHLKQVEEA 53 (83)
Q Consensus 34 ~eL~~ei~~L~~~i~~le~i 53 (83)
++|-...++|+.+|+.||.|
T Consensus 45 ~~L~~~a~rm~eRI~tLE~I 64 (75)
T TIGR02976 45 QELYAKADRLEERIDTLERI 64 (75)
T ss_pred HHHHHHHHHHHHHHHHHHHH
Confidence 44555555666666666655
No 205
>PF10376 Mei5: Double-strand recombination repair protein ; InterPro: IPR018468 Mei5 is one of a pair of meiosis-specific proteins which facilitate the loading of Dmc1 on to Rad51 on DNA at double-strand breaks during recombination. Recombination is carried out by a large protein complex based around the two RecA homologues, Rad51 and Dmc1 []. This complex may play both a catalytic and a structural role in the interaction between homologous chromosomes during meiosis. Mei5 is seen to contain a coiled-coli region.
Probab=36.11 E-value=1.8e+02 Score=21.69 Aligned_cols=58 Identities=21% Similarity=0.249 Sum_probs=39.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHHhhHHhhhhHHHHHHHHHHHH
Q psy7799 14 VEQELSSTFARLCQLVDRATDDMHEDIKQLDS-----HLKQVEEASNNAKVLRNKAHYLIKELDMFQ 75 (83)
Q Consensus 14 VqqEL~~tf~rL~~~Vd~~~~eL~~ei~~L~~-----~i~~le~i~~~~K~lrnKa~~L~~eL~~F~ 75 (83)
-++||..--++|..+|+.-.. +++.|.. --..++.++...+.||+-+.-...+|..+.
T Consensus 132 ~~~el~~ek~kL~~q~~e~~e----~lr~L~~~k~~r~Kn~~~~Lq~lI~Kwr~~~q~~l~eL~~~~ 194 (221)
T PF10376_consen 132 KQQELEEEKRKLEKQVDEKEE----ELRRLKLVKQYRSKNDLEQLQSLIKKWRSASQEALYELQSEM 194 (221)
T ss_pred HHHHHHHHHHHHHHHHHHHHH----HHHHhHHHHHHHhhccHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 467888888888888887433 3333322 123456788888889988887777777654
No 206
>cd01282 HTH_MerR-like_sg3 Helix-Turn-Helix DNA binding domain of putative transcription regulators from the MerR superfamily. Putative helix-turn-helix (HTH) MerR-like transcription regulators (subgroup 3). Based on sequence similarity, these proteins are predicted to function as transcription regulators that mediate responses to stress in eubacteria. They belong to the MerR superfamily of transcription regulators that promote transcription of various stress regulons by reconfiguring the operator sequence located between the -35 and -10 promoter elements. A typical MerR regulator is comprised of two distinct domains that harbor the regulatory (effector-binding) site and the active (DNA-binding) site. Their N-terminal domains are homologous and contain a DNA-binding winged HTH motif, while the C-terminal domains are often dissimilar and bind specific coactivator molecules such as metal ions, drugs, and organic substrates.
Probab=36.05 E-value=1.2e+02 Score=19.59 Aligned_cols=28 Identities=4% Similarity=0.252 Sum_probs=17.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhH
Q psy7799 31 RATDDMHEDIKQLDSHLKQVEEASNNAK 58 (83)
Q Consensus 31 ~~~~eL~~ei~~L~~~i~~le~i~~~~K 58 (83)
....-+++.+.+++.+|..|......+.
T Consensus 81 ~~~~~l~~~~~~l~~~i~~L~~~~~~L~ 108 (112)
T cd01282 81 DLLAVLRRELARIDRQIADLTRSRDRLD 108 (112)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3455667777777777777766665543
No 207
>PRK15374 pathogenicity island 1 effector protein SipB; Provisional
Probab=36.00 E-value=2.5e+02 Score=24.35 Aligned_cols=50 Identities=18% Similarity=0.177 Sum_probs=23.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHhhhhHHHHHHHH
Q psy7799 15 EQELSSTFARLCQLVDRATDDMHEDIKQLDSHLKQVEEASNNAKVLRNKAHYLIKEL 71 (83)
Q Consensus 15 qqEL~~tf~rL~~~Vd~~~~eL~~ei~~L~~~i~~le~i~~~~K~lrnKa~~L~~eL 71 (83)
..||+..|...-..+.+..+. |+..|..++.+++.+....+|.+-|+.++
T Consensus 122 ~~~lS~~ledaL~aaq~~ad~-------l~q~~~~~~~Aq~~l~~aq~~l~~lq~~a 171 (593)
T PRK15374 122 GIQVSKEFQTALGEAQEATDL-------YEASIKKTDTAKSVYDAAEKKLTQAQNKL 171 (593)
T ss_pred HHHHHHHHHHHHHHHHHHHHH-------HHHHHhhhhhHHHHHHHHHHHHHHHHHHH
Confidence 344555555444444444444 44444444444554444444444444443
No 208
>TIGR01010 BexC_CtrB_KpsE polysaccharide export inner-membrane protein, BexC/CtrB/KpsE family. This family contains gamma proteobacterial proteins involved in capsule polysaccharide export.
Probab=35.99 E-value=2e+02 Score=22.00 Aligned_cols=31 Identities=19% Similarity=0.115 Sum_probs=22.5
Q ss_pred HHHHHHHHhhHHhhhhHHHHHHHHHHHHHHh
Q psy7799 48 KQVEEASNNAKVLRNKAHYLIKELDMFQEAY 78 (83)
Q Consensus 48 ~~le~i~~~~K~lrnKa~~L~~eL~~F~~~y 78 (83)
+.++-+..+...++.|..-.+.+|..|.+.+
T Consensus 170 ~a~~fl~~ql~~~~~~l~~ae~~l~~fr~~~ 200 (362)
T TIGR01010 170 DTIAFAENEVKEAEQRLNATKAELLKYQIKN 200 (362)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC
Confidence 5666777777777777777777777777765
No 209
>PRK14147 heat shock protein GrpE; Provisional
Probab=35.83 E-value=83 Score=22.53 Aligned_cols=23 Identities=9% Similarity=0.108 Sum_probs=11.6
Q ss_pred HHhhHHhhhhHHHHHHHHHHHHH
Q psy7799 54 SNNAKVLRNKAHYLIKELDMFQE 76 (83)
Q Consensus 54 ~~~~K~lrnKa~~L~~eL~~F~~ 76 (83)
..++..|+++.-.+..++++|.+
T Consensus 31 ~~e~~elkd~~lR~~Ad~eN~rk 53 (172)
T PRK14147 31 RSEIALVKADALRERADLENQRK 53 (172)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHH
Confidence 33334445555555555555554
No 210
>KOG0396|consensus
Probab=35.79 E-value=2.5e+02 Score=23.15 Aligned_cols=72 Identities=21% Similarity=0.246 Sum_probs=45.2
Q ss_pred hhhhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-------HHHHHH-HHHHhhHHhhhhHHHHHHHHHHHHH
Q psy7799 5 LTSSNCSHQVEQELSSTFARLCQLVDRATDDMHEDIKQLDS-------HLKQVE-EASNNAKVLRNKAHYLIKELDMFQE 76 (83)
Q Consensus 5 ~tssncshQVqqEL~~tf~rL~~~Vd~~~~eL~~ei~~L~~-------~i~~le-~i~~~~K~lrnKa~~L~~eL~~F~~ 76 (83)
+.|.+|.| ..+.-.-+.+|-..|....+-|++-|+..+. +|+.++ .+++.. =.|=-.|=.+.|++|.-
T Consensus 48 ~~sn~~~~--~d~~~~~id~Li~kv~~~krk~e~~iq~e~~~~~~iksRid~m~e~~~~d~--~~~~~~w~r~~l~r~vv 123 (389)
T KOG0396|consen 48 LISNIVPH--LDSTVSLIDRLIRKVQCLKRKLEEYIQSEEEQLKRIKSRIDFMHEEISSDT--PANSRKWPRNKLDRFVV 123 (389)
T ss_pred HhcccChh--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccC--chHHHHhHHHHHHHHHH
Confidence 44566766 6666677888888888888888877655433 333343 122111 11333566788899988
Q ss_pred Hhcc
Q psy7799 77 AYLV 80 (83)
Q Consensus 77 ~yL~ 80 (83)
.||-
T Consensus 124 dhml 127 (389)
T KOG0396|consen 124 DHML 127 (389)
T ss_pred HHHH
Confidence 8863
No 211
>cd03569 VHS_Hrs_Vps27p VHS domain family, Hrs and Vps27p subfamily; composed of Hrs (Hepatocyte growth factor-regulated tyrosine kinase substrate) and its yeast homolog Vps27p (vacuolar protein sorting). The VHS domain, an essential part of Hrs/Vps27p, has a superhelical structure similar to the structure of ARM (Armadillo) repeats and is present at the N-termini of proteins. Hrs also contains a FYVE (Fab1p, YOTB, Vac1p, and EEA1) zinc finger domain C-terminal to VHS, as well as two coiled-coil domains. Hrs has been proposed to play a role in at least three vesicle trafficking events: exocytosis, endocytosis, and endosome to lysosome trafficking. Hrs is involved in promoting rapid recycling of endocytosed signaling receptors to the plasma membrane.
Probab=35.73 E-value=1.3e+02 Score=20.56 Aligned_cols=65 Identities=25% Similarity=0.360 Sum_probs=35.9
Q ss_pred hhhhhhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHhhhhHHHHHHHHHHHHHHh
Q psy7799 3 VDLTSSNCSHQVEQELSSTFARLCQLVDRATDDMHEDIKQLDSHLKQVEEASNNAKVLRNKAHYLIKELDMFQEAY 78 (83)
Q Consensus 3 V~~tssncshQVqqEL~~tf~rL~~~Vd~~~~eL~~ei~~L~~~i~~le~i~~~~K~lrnKa~~L~~eL~~F~~~y 78 (83)
.+.-..||+.-+.+|+++ +.=|...+..+......++++ .-|+-++.=+..++++ ..|..|.+.|
T Consensus 65 Le~~vkNCG~~fh~evas-~~fl~~l~~l~~~~~~~~Vk~-----kil~li~~W~~~f~~~-----~~l~~i~~~y 129 (142)
T cd03569 65 LESCVKNCGTHFHDEVAS-REFMDELKDLIKTTKNEEVRQ-----KILELIQAWALAFRNK-----PQLKYVVDTY 129 (142)
T ss_pred HHHHHHHCCHHHHHHHhh-HHHHHHHHHHHcccCCHHHHH-----HHHHHHHHHHHHhCCC-----cccHHHHHHH
Confidence 456778999999999984 555555444444322222222 2344444444555543 3455555555
No 212
>cd00632 Prefoldin_beta Prefoldin beta; Prefoldin is a hexameric molecular chaperone complex, composed of two evolutionarily related subunits (alpha and beta), which are found in both eukaryotes and archaea. Prefoldin binds and stabilizes newly synthesized polypeptides allowing them to fold correctly. The hexameric structure consists of a double beta barrel assembly with six protruding coiled-coils. The alpha prefoldin subunits have two beta hairpin structures while the beta prefoldin subunits (this CD) have only one hairpin that is most similar to the second hairpin of the alpha subunit. The prefoldin hexamer consists of two alpha and four beta subunits and is assembled from the beta hairpins of all six subunits. The alpha subunits initially dimerize providing a structural nucleus for the assembly of the beta subunits. In archaea, there is usually only one gene for each subunit while in eukaryotes there two or more paralogous genes encoding each subunit adding heterogeneity to the st
Probab=35.42 E-value=1.2e+02 Score=19.37 Aligned_cols=27 Identities=26% Similarity=0.267 Sum_probs=16.3
Q ss_pred HHHHHHhhHHhhhhHHHHHHHHHHHHH
Q psy7799 50 VEEASNNAKVLRNKAHYLIKELDMFQE 76 (83)
Q Consensus 50 le~i~~~~K~lrnKa~~L~~eL~~F~~ 76 (83)
++.+....+.+.++-..++.++..+..
T Consensus 72 ~e~le~~i~~l~~~~~~l~~~~~elk~ 98 (105)
T cd00632 72 LETIELRIKRLERQEEDLQEKLKELQE 98 (105)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 344455556666666666666666554
No 213
>cd04784 HTH_CadR-PbrR Helix-Turn-Helix DNA binding domain of the CadR and PbrR transcription regulators. Helix-turn-helix (HTH) CadR and PbrR transcription regulators including Pseudomonas aeruginosa CadR and Ralstonia metallidurans PbrR that regulate expression of the cadmium and lead resistance operons, respectively. These proteins are comprised of distinct domains that harbor the regulatory (effector-binding) site and the active (DNA-binding) site. Their conserved N-terminal domains contain predicted winged HTH motifs that mediate DNA binding, while the C-terminal domains have three conserved cysteines which form a putative metal binding site. Some members in this group have a histidine-rich C-terminal extension. These proteins share the N-terminal DNA binding domain with other transcription regulators of the MerR superfamily that promote transcription by reconfiguring the spacer between the -35 and -10 promoter elements.
Probab=35.24 E-value=1.3e+02 Score=19.69 Aligned_cols=31 Identities=10% Similarity=0.298 Sum_probs=19.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhHHhh
Q psy7799 31 RATDDMHEDIKQLDSHLKQVEEASNNAKVLR 61 (83)
Q Consensus 31 ~~~~eL~~ei~~L~~~i~~le~i~~~~K~lr 61 (83)
....-|+..+.+++.+|..|..+...+..+.
T Consensus 79 ~~~~~l~~~~~~l~~~i~~L~~~~~~L~~~~ 109 (127)
T cd04784 79 EVNALIDEHLAHVRARIAELQALEKQLQALR 109 (127)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3445566677777777777776666655544
No 214
>PF06667 PspB: Phage shock protein B; InterPro: IPR009554 This family consists of several bacterial phage shock protein B (PspB) sequences. The phage shock protein (psp) operon is induced in response to heat, ethanol, osmotic shock and infection by filamentous bacteriophages []. Expression of the operon requires the alternative sigma factor sigma54 and the transcriptional activator PspF. In addition, PspA plays a negative regulatory role, and the integral-membrane proteins PspB and PspC play a positive one [].; GO: 0006355 regulation of transcription, DNA-dependent, 0009271 phage shock
Probab=35.16 E-value=81 Score=20.01 Aligned_cols=23 Identities=17% Similarity=0.270 Sum_probs=9.9
Q ss_pred HHHHHHHhhHHhhhhHHHHHHHH
Q psy7799 49 QVEEASNNAKVLRNKAHYLIKEL 71 (83)
Q Consensus 49 ~le~i~~~~K~lrnKa~~L~~eL 71 (83)
.|+.+..++..+..+..-||+-|
T Consensus 43 ~L~~L~~~a~rm~eRI~tLE~IL 65 (75)
T PF06667_consen 43 RLQELYEQAERMEERIETLERIL 65 (75)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHH
Confidence 34444444444444444444433
No 215
>PF10267 Tmemb_cc2: Predicted transmembrane and coiled-coil 2 protein; InterPro: IPR019394 This family of transmembrane coiled-coil containing proteins is conserved from worms to humans. Its function is unknown.
Probab=35.14 E-value=2.5e+02 Score=22.93 Aligned_cols=66 Identities=12% Similarity=0.179 Sum_probs=38.2
Q ss_pred HHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHH-HHHHHHHhhHHhhhhHHHHHHHHHHHHHHhc
Q psy7799 14 VEQELSSTFARLCQ---LVDRATDDMHEDIKQLDSHLK-QVEEASNNAKVLRNKAHYLIKELDMFQEAYL 79 (83)
Q Consensus 14 VqqEL~~tf~rL~~---~Vd~~~~eL~~ei~~L~~~i~-~le~i~~~~K~lrnKa~~L~~eL~~F~~~yL 79 (83)
+.++....+..+.. .+......|++++..|..++. .+.-+...+---|-|...||.+|+.+.+-+.
T Consensus 206 ~~~~~~~~l~~~~~el~eik~~~~~L~~~~e~Lk~~~~~e~~~~~~~LqEEr~R~erLEeqlNd~~elHq 275 (395)
T PF10267_consen 206 VSSQQNLGLQKILEELREIKESQSRLEESIEKLKEQYQREYQFILEALQEERYRYERLEEQLNDLTELHQ 275 (395)
T ss_pred ccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHH
Confidence 33444444444433 333444555666666665544 3344444455557888889999999887654
No 216
>PRK14149 heat shock protein GrpE; Provisional
Probab=34.93 E-value=85 Score=23.11 Aligned_cols=29 Identities=10% Similarity=0.068 Sum_probs=17.9
Q ss_pred HHHHHHHhhHHhhhhHHHHHHHHHHHHHH
Q psy7799 49 QVEEASNNAKVLRNKAHYLIKELDMFQEA 77 (83)
Q Consensus 49 ~le~i~~~~K~lrnKa~~L~~eL~~F~~~ 77 (83)
.++.++.++..|++|.-.+..++++|.+-
T Consensus 44 ~~~~l~~e~~elkd~~lR~~AefEN~rKR 72 (191)
T PRK14149 44 IKEDFELKYKEMHEKYLRVHADFENVKKR 72 (191)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 45555556666666666666666666553
No 217
>PF12017 Tnp_P_element: Transposase protein; InterPro: IPR021896 Protein in this family are transposases found in insects. This region is about 230 amino acids in length and is found associated with PF05485 from PFAM.
Probab=34.86 E-value=1.8e+02 Score=21.94 Aligned_cols=22 Identities=14% Similarity=0.341 Sum_probs=9.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHhh
Q psy7799 36 MHEDIKQLDSHLKQVEEASNNA 57 (83)
Q Consensus 36 L~~ei~~L~~~i~~le~i~~~~ 57 (83)
++.|++.|..+|..|+.-..++
T Consensus 16 ~~~e~~~Lk~kir~le~~l~~L 37 (236)
T PF12017_consen 16 LKIENKKLKKKIRRLEKELKKL 37 (236)
T ss_pred HHHHHHHHHHHHHHHHHHHHHH
Confidence 3344444444444444443333
No 218
>PRK14144 heat shock protein GrpE; Provisional
Probab=34.67 E-value=1.1e+02 Score=22.63 Aligned_cols=24 Identities=21% Similarity=0.187 Sum_probs=13.2
Q ss_pred HHHhhHHhhhhHHHHHHHHHHHHH
Q psy7799 53 ASNNAKVLRNKAHYLIKELDMFQE 76 (83)
Q Consensus 53 i~~~~K~lrnKa~~L~~eL~~F~~ 76 (83)
+..++..+++|.-.+..++++|.+
T Consensus 57 le~e~~elkdk~lR~~AefeN~RK 80 (199)
T PRK14144 57 AEQKAHENWEKSVRALAELENVRR 80 (199)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH
Confidence 334444555566666666666654
No 219
>PRK08453 fliD flagellar capping protein; Validated
Probab=34.39 E-value=2.9e+02 Score=24.19 Aligned_cols=61 Identities=10% Similarity=0.221 Sum_probs=41.7
Q ss_pred HHHHHHHHHHHHHHHH-----------HHHHHHHHHHHHHHHHHH-HHHHhhHHhhhhHHHHHHHHHHHHHHh
Q psy7799 18 LSSTFARLCQLVDRAT-----------DDMHEDIKQLDSHLKQVE-EASNNAKVLRNKAHYLIKELDMFQEAY 78 (83)
Q Consensus 18 L~~tf~rL~~~Vd~~~-----------~eL~~ei~~L~~~i~~le-~i~~~~K~lrnKa~~L~~eL~~F~~~y 78 (83)
..+.|.||...++... ..|..+++.+++++.++| .+..+...++.++.-|+..+..-..+|
T Consensus 597 ~~Gi~~rl~~~L~~~i~g~~G~l~~~~~sL~~q~k~L~~q~~~~e~rL~~ry~rl~~qFsAmDs~IsqmNsq~ 669 (673)
T PRK08453 597 QEGIFSKFNQVIANLIDGGNAKLKIYEDSLTRDAKSLTKDKENAQELLKTRYDIMAERFAAYDSQISKANQKF 669 (673)
T ss_pred cCcHHHHHHHHHHHHhcCCCceehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh
Confidence 3566777777666543 247778888888887665 355667778888888887776665554
No 220
>PF14966 DNA_repr_REX1B: DNA repair REX1-B
Probab=34.18 E-value=1.3e+02 Score=19.52 Aligned_cols=40 Identities=18% Similarity=0.355 Sum_probs=21.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHHHHHHH
Q psy7799 13 QVEQELSSTFARLCQLVDRATDDMH-----EDIKQLDSHLKQVEE 52 (83)
Q Consensus 13 QVqqEL~~tf~rL~~~Vd~~~~eL~-----~ei~~L~~~i~~le~ 52 (83)
|...++..-|+.+..+|-...+.|. .++..+=..|+..|+
T Consensus 40 ~~~~~iT~~f~~~S~ei~~ie~~L~~~~~~~~la~~i~~lQ~~Ek 84 (97)
T PF14966_consen 40 QLCHEITQEFSAISKEILAIEAELRDEHERPDLAELIRELQEQEK 84 (97)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccccCCHHHHHHHHHHHHHHH
Confidence 4444555555555555555555555 455555555555554
No 221
>PF06810 Phage_GP20: Phage minor structural protein GP20; InterPro: IPR009636 This family consists of several phage minor structural protein Gp20 sequences and prophage sequences of around 180 residues in length. The function of this family is unknown.; GO: 0005198 structural molecule activity
Probab=34.08 E-value=1.6e+02 Score=20.53 Aligned_cols=46 Identities=17% Similarity=0.400 Sum_probs=26.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhHHhhhhHHHHHHHHHHHHHHh
Q psy7799 33 TDDMHEDIKQLDSHLKQVEEASNNAKVLRNKAHYLIKELDMFQEAY 78 (83)
Q Consensus 33 ~~eL~~ei~~L~~~i~~le~i~~~~K~lrnKa~~L~~eL~~F~~~y 78 (83)
...++.+|+..+++|..|..-..-...|+.+..-|+.+.....+.|
T Consensus 29 ~~~~k~ql~~~d~~i~~Lk~~~~d~eeLk~~i~~lq~~~~~~~~~~ 74 (155)
T PF06810_consen 29 RDNLKTQLKEADKQIKDLKKSAKDNEELKKQIEELQAKNKTAKEEY 74 (155)
T ss_pred HHHHHHHHHHHHHHHHHHHhccCCHHHHHHHHHHHHHHHHHHHHHH
Confidence 4445555555556666665555555566666666666666554444
No 222
>PF14282 FlxA: FlxA-like protein
Probab=34.03 E-value=1.4e+02 Score=19.56 Aligned_cols=19 Identities=26% Similarity=0.646 Sum_probs=9.0
Q ss_pred HHHHHHHHHHHHHHHHHHH
Q psy7799 34 DDMHEDIKQLDSHLKQVEE 52 (83)
Q Consensus 34 ~eL~~ei~~L~~~i~~le~ 52 (83)
..|+..|..|..+|..|..
T Consensus 22 ~~L~~Qi~~Lq~ql~~l~~ 40 (106)
T PF14282_consen 22 EQLQKQIKQLQEQLQELSQ 40 (106)
T ss_pred HHHHHHHHHHHHHHHHHHc
Confidence 3444455555555544433
No 223
>PF14932 HAUS-augmin3: HAUS augmin-like complex subunit 3
Probab=33.94 E-value=1e+02 Score=22.98 Aligned_cols=29 Identities=31% Similarity=0.447 Sum_probs=12.8
Q ss_pred HHHHHHHHHHHHHHHHHHhhHHhhhhHHH
Q psy7799 38 EDIKQLDSHLKQVEEASNNAKVLRNKAHY 66 (83)
Q Consensus 38 ~ei~~L~~~i~~le~i~~~~K~lrnKa~~ 66 (83)
.+|..|+.+++.|.++......+|||...
T Consensus 68 ~~le~Le~el~~l~~~~~~~~~~~~~lq~ 96 (256)
T PF14932_consen 68 EDLEALEEELEALQEYKELYEQLRNKLQQ 96 (256)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34444444444444444444444444433
No 224
>PRK02224 chromosome segregation protein; Provisional
Probab=33.87 E-value=3e+02 Score=23.49 Aligned_cols=11 Identities=18% Similarity=0.522 Sum_probs=3.9
Q ss_pred HHHHHHHHHHH
Q psy7799 40 IKQLDSHLKQV 50 (83)
Q Consensus 40 i~~L~~~i~~l 50 (83)
|..+...+..+
T Consensus 683 i~~~~~~~e~~ 693 (880)
T PRK02224 683 IGAVENELEEL 693 (880)
T ss_pred HHHHHHHHHHH
Confidence 33333333333
No 225
>cd03561 VHS VHS domain family; The VHS domain is present in Vps27 (Vacuolar Protein Sorting), Hrs (Hepatocyte growth factor-regulated tyrosine kinase substrate) and STAM (Signal Transducing Adaptor Molecule). It has a superhelical structure similar to that of the ARM (Armadillo) repeats and is present at the N-termini of proteins involved in intracellular membrane trafficking. There are four general groups of VHS domain containing proteins based on their association with other domains. The first group consists of proteins of the STAM/EAST/Hbp family which has the domain composition VHS-SH3-ITAM. The second consists of proteins with a FYVE domain C-terminal to VHS. The third consists of GGA proteins with a domain composition VHS-GAT (GGA and TOM)-GAE (gamma-adaptin ear) domain. The fourth consists of proteins with a VHS domain alone or with domains other than those mentioned above. In GGA proteins, VHS domains are involved in cargo recognition in trans-Golgi, thereby having a general me
Probab=33.85 E-value=1.2e+02 Score=20.16 Aligned_cols=32 Identities=22% Similarity=0.388 Sum_probs=22.5
Q ss_pred chhhhhhchhHHHHHHHHHHHHHHHHHHHHHHH
Q psy7799 2 IVDLTSSNCSHQVEQELSSTFARLCQLVDRATD 34 (83)
Q Consensus 2 iV~~tssncshQVqqEL~~tf~rL~~~Vd~~~~ 34 (83)
+.+.-+.||+..+.+|+++ +.-|...+.....
T Consensus 60 lLd~~vkNcg~~f~~~i~s-~~fl~~l~~l~~~ 91 (133)
T cd03561 60 LLELLVKNCGKPFHLQVAD-KEFLLELVKIAKN 91 (133)
T ss_pred HHHHHHHhCChHHHHHHhh-HHHHHHHHHHhCC
Confidence 3567789999999999998 5545554444443
No 226
>PF09726 Macoilin: Transmembrane protein; InterPro: IPR019130 This entry represents the multi-pass transmembrane protein Macoilin, which is highly conserved in eukaryotes. ; GO: 0016021 integral to membrane
Probab=33.75 E-value=3.2e+02 Score=23.80 Aligned_cols=56 Identities=16% Similarity=0.294 Sum_probs=36.3
Q ss_pred HHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHH----HHHHHHHHHhhHHhhhhHHHHHHHH
Q psy7799 16 QELSSTFARLCQLV---DRATDDMHEDIKQLDSH----LKQVEEASNNAKVLRNKAHYLIKEL 71 (83)
Q Consensus 16 qEL~~tf~rL~~~V---d~~~~eL~~ei~~L~~~----i~~le~i~~~~K~lrnKa~~L~~eL 71 (83)
++|..=+.+|.... ++...+|+.|+.+|... -...|.+-+.+-.++.|..-||+-|
T Consensus 548 ~~lE~E~~~lr~elk~kee~~~~~e~~~~~lr~~~~e~~~~~e~L~~aL~amqdk~~~LE~sL 610 (697)
T PF09726_consen 548 RQLESELKKLRRELKQKEEQIRELESELQELRKYEKESEKDTEVLMSALSAMQDKNQHLENSL 610 (697)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHhh
Confidence 55666566665443 45555666666555543 2346777777888889998888866
No 227
>PRK14161 heat shock protein GrpE; Provisional
Probab=33.50 E-value=1.4e+02 Score=21.63 Aligned_cols=25 Identities=16% Similarity=0.154 Sum_probs=12.0
Q ss_pred HHHhhHHhhhhHHHHHHHHHHHHHH
Q psy7799 53 ASNNAKVLRNKAHYLIKELDMFQEA 77 (83)
Q Consensus 53 i~~~~K~lrnKa~~L~~eL~~F~~~ 77 (83)
+..++..+++|.-.+..++++|.+-
T Consensus 31 l~~e~~elkd~~lR~~AefeN~rkR 55 (178)
T PRK14161 31 LKAEIEELKDKLIRTTAEIDNTRKR 55 (178)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3333344445555555555555543
No 228
>KOG4674|consensus
Probab=33.49 E-value=1.8e+02 Score=28.52 Aligned_cols=45 Identities=20% Similarity=0.365 Sum_probs=36.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHH
Q psy7799 15 EQELSSTFARLCQLVDRATDDMHEDIKQLDSHLKQVEEASNNAKV 59 (83)
Q Consensus 15 qqEL~~tf~rL~~~Vd~~~~eL~~ei~~L~~~i~~le~i~~~~K~ 59 (83)
-+|+..-|.++..++-.-..+++.+++.+.++|.+++.+..++..
T Consensus 1330 ~~el~~~~~~~q~~~k~qld~l~~e~~~lt~~~~ql~~~~~rL~~ 1374 (1822)
T KOG4674|consen 1330 IAELKKELNRLQEKIKKQLDELNNEKANLTKELEQLEDLKTRLAA 1374 (1822)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 468888899999777777888888888888888888888777653
No 229
>PF09744 Jnk-SapK_ap_N: JNK_SAPK-associated protein-1; InterPro: IPR019143 This entry represents the N-terminal 200 residues of a set of proteins conserved from yeasts to humans. Most of the proteins in this entry have a RhoGEF domain (IPR000219 from INTERPRO) at their C-terminal end.
Probab=33.32 E-value=1.8e+02 Score=20.69 Aligned_cols=28 Identities=25% Similarity=0.395 Sum_probs=17.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy7799 29 VDRATDDMHEDIKQLDSHLKQVEEASNN 56 (83)
Q Consensus 29 Vd~~~~eL~~ei~~L~~~i~~le~i~~~ 56 (83)
........+.+.++|..+|.+|+...+.
T Consensus 80 l~~~Ed~~~~e~k~L~~~v~~Le~e~r~ 107 (158)
T PF09744_consen 80 LLELEDQWRQERKDLQSQVEQLEEENRQ 107 (158)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3344556667777777777777665533
No 230
>COG5002 VicK Signal transduction histidine kinase [Signal transduction mechanisms]
Probab=33.23 E-value=1.2e+02 Score=25.39 Aligned_cols=35 Identities=29% Similarity=0.442 Sum_probs=31.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy7799 16 QELSSTFARLCQLVDRATDDMHEDIKQLDSHLKQV 50 (83)
Q Consensus 16 qEL~~tf~rL~~~Vd~~~~eL~~ei~~L~~~i~~l 50 (83)
-||+..|..|.-.+..+++.++.|-++|..=+..+
T Consensus 86 geLa~~fn~Lt~~~~~aq~n~e~Er~kL~SvlayM 120 (459)
T COG5002 86 GELADSFNDLTKRVQEAQANTEQERRKLDSVLAYM 120 (459)
T ss_pred HHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHH
Confidence 48999999999999999999999999988776654
No 231
>PF10835 DUF2573: Protein of unknown function (DUF2573); InterPro: IPR020393 This entry contains proteins with no known function.
Probab=33.12 E-value=1.1e+02 Score=20.04 Aligned_cols=42 Identities=24% Similarity=0.369 Sum_probs=29.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhH
Q psy7799 17 ELSSTFARLCQLVDRATDDMHEDIKQLDSHLKQVEEASNNAK 58 (83)
Q Consensus 17 EL~~tf~rL~~~Vd~~~~eL~~ei~~L~~~i~~le~i~~~~K 58 (83)
+|+-+---|-.-=+++.-|.+++|+++-.+|.+|.+.++..+
T Consensus 39 hiaKsMPpL~kHWN~~~PeaK~~ik~li~~Ik~lNe~~r~~~ 80 (82)
T PF10835_consen 39 HIAKSMPPLAKHWNGTYPEAKEEIKELIEEIKQLNEAHRANK 80 (82)
T ss_pred HHHHhCcHHHHhhcccCchHHHHHHHHHHHHHHHHHHHHhhc
Confidence 334444444444556678889999999999999998876543
No 232
>PF11855 DUF3375: Protein of unknown function (DUF3375); InterPro: IPR021804 This family of proteins are functionally uncharacterised. This protein is found in bacteria. Proteins in this family are typically between 479 to 499 amino acids in length.
Probab=33.09 E-value=73 Score=26.02 Aligned_cols=38 Identities=16% Similarity=0.253 Sum_probs=31.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy7799 14 VEQELSSTFARLCQLVDRATDDMHEDIKQLDSHLKQVE 51 (83)
Q Consensus 14 VqqEL~~tf~rL~~~Vd~~~~eL~~ei~~L~~~i~~le 51 (83)
|...|...-.....-.+.=...|+.+|++++.+|+.|+
T Consensus 127 v~~~l~~la~~~~~Dp~~Ri~~Le~e~~~i~~EI~~l~ 164 (478)
T PF11855_consen 127 VFDALRQLAEGTDPDPERRIAELEREIAEIDAEIDRLE 164 (478)
T ss_pred HHHHHHHHHHhcCCCHHHHHHHHHHHHHHHHHHHHHHH
Confidence 66677777777777777888899999999999998876
No 233
>PF08653 DASH_Dam1: DASH complex subunit Dam1; InterPro: IPR013962 The DASH complex is a ~10 subunit microtubule-binding complex that is transferred to the kinetochore prior to mitosis []. In Saccharomyces cerevisiae (Baker's yeast) DASH forms both rings and spiral structures on microtubules in vitro [, ]. Components of the DASH complex, including Dam1, Duo1, Spc34, Dad1 and Ask1, are essential and connect the centromere to the plus end of spindle microtubules [].
Probab=33.01 E-value=1.2e+02 Score=18.45 Aligned_cols=30 Identities=10% Similarity=0.318 Sum_probs=22.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhHHh
Q psy7799 31 RATDDMHEDIKQLDSHLKQVEEASNNAKVL 60 (83)
Q Consensus 31 ~~~~eL~~ei~~L~~~i~~le~i~~~~K~l 60 (83)
..-.+|.+++..|.+.+.+|..|+..+..+
T Consensus 5 ~~f~eL~D~~~~L~~n~~~L~~ihesL~~F 34 (58)
T PF08653_consen 5 PQFAELSDSMETLDKNMEQLNQIHESLSDF 34 (58)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 345678888888888888888888776653
No 234
>cd04770 HTH_HMRTR Helix-Turn-Helix DNA binding domain of Heavy Metal Resistance transcription regulators. Helix-turn-helix (HTH) heavy metal resistance transcription regulators (HMRTR): MerR1 (mercury), CueR (copper), CadR (cadmium), PbrR (lead), ZntR (zinc), and other related proteins. These transcription regulators mediate responses to heavy metal stress in eubacteria. They belong to the MerR superfamily of transcription regulators that promote transcription of various stress regulons by reconfiguring the operator sequence located between the -35 and -10 promoter elements. A typical MerR regulator is comprised of two distinct domains that harbor the regulatory (effector-binding) site and the active (DNA-binding) site. Their N-terminal domains are homologous and contain a DNA-binding winged HTH motif, while the C-terminal domains are often dissimilar and bind specific coactivator molecules such as metal ions, drugs, and organic substrates.
Probab=32.85 E-value=1.4e+02 Score=19.29 Aligned_cols=30 Identities=7% Similarity=0.219 Sum_probs=18.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhHHh
Q psy7799 31 RATDDMHEDIKQLDSHLKQVEEASNNAKVL 60 (83)
Q Consensus 31 ~~~~eL~~ei~~L~~~i~~le~i~~~~K~l 60 (83)
....-|+..+.+++.+|+.|..+...+..+
T Consensus 79 ~~~~~l~~~~~~l~~~i~~l~~~~~~l~~~ 108 (123)
T cd04770 79 EVRALLEEKLAEVEAKIAELQALRAELAGL 108 (123)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 344556666777777777766666554433
No 235
>PRK09413 IS2 repressor TnpA; Reviewed
Probab=32.85 E-value=1.4e+02 Score=19.46 Aligned_cols=40 Identities=15% Similarity=0.108 Sum_probs=25.1
Q ss_pred HHHHHHHH---HHHHHHHHHHHH----------------HHHHHHHHHHHHHHHHHHHH
Q psy7799 12 HQVEQELS---STFARLCQLVDR----------------ATDDMHEDIKQLDSHLKQVE 51 (83)
Q Consensus 12 hQVqqEL~---~tf~rL~~~Vd~----------------~~~eL~~ei~~L~~~i~~le 51 (83)
.+|.+|+. +|+.+|..+... ...++++||++|.+++.+|+
T Consensus 33 ~evA~e~gIs~~tl~~W~r~y~~~~~~~~~~~~~~~~~~~~~~~~~ei~~L~~el~~L~ 91 (121)
T PRK09413 33 SLVARQHGVAASQLFLWRKQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQRLLGKKT 91 (121)
T ss_pred HHHHHHHCcCHHHHHHHHHHHhhcccccccccccCCchhHHHHHHHHHHHHHHHHHHHH
Confidence 56666664 566667666532 12346777778877777765
No 236
>COG0188 GyrA Type IIA topoisomerase (DNA gyrase/topo II, topoisomerase IV), A subunit [DNA replication, recombination, and repair]
Probab=32.81 E-value=1.7e+02 Score=26.09 Aligned_cols=52 Identities=19% Similarity=0.366 Sum_probs=39.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHhhhhHHHHHHHHHHHHHHh
Q psy7799 24 RLCQLVDRATDDMHEDIKQLDSHLKQVEEASNNAKVLRNKAHYLIKELDMFQEAY 78 (83)
Q Consensus 24 rL~~~Vd~~~~eL~~ei~~L~~~i~~le~i~~~~K~lrnKa~~L~~eL~~F~~~y 78 (83)
||.+....-...+..|.++|.++|..|+.+-..-+.+. ..+..||....+.|
T Consensus 427 rL~rLt~le~~~i~~E~~~L~~~i~~l~~iL~s~~~~~---~~i~~eL~~~~~k~ 478 (804)
T COG0188 427 RLRRLTGLEEEKIEKELKELEKEIADLEKILASEERLL---DIIKKELLEIKKKF 478 (804)
T ss_pred hHHHhhcchHHHHHHHHHHHHHHHHHHHHHhcCHHHHH---HHHHHHHHHHHHHh
Confidence 34444445667889999999999999999988877766 46677888877777
No 237
>KOG0612|consensus
Probab=32.60 E-value=4.5e+02 Score=25.10 Aligned_cols=72 Identities=24% Similarity=0.219 Sum_probs=0.0
Q ss_pred hhhchhHHHHHHHH---HHHHHHHHHHHH--------------------HHHHHHHHHHHHHHHHHHHHHHHHhhHHhhh
Q psy7799 6 TSSNCSHQVEQELS---STFARLCQLVDR--------------------ATDDMHEDIKQLDSHLKQVEEASNNAKVLRN 62 (83)
Q Consensus 6 tssncshQVqqEL~---~tf~rL~~~Vd~--------------------~~~eL~~ei~~L~~~i~~le~i~~~~K~lrn 62 (83)
++.+|+.|+++|+. ..-.+++..-.- -..+..+++.++..+|..|+.-+...+.-=-
T Consensus 568 ~~~e~~~~iq~~~e~~~~~~d~l~~le~~k~~ls~~~~~~~~~~e~~~~~~~~~~e~~~~l~~~i~sL~~~~~~~~~~l~ 647 (1317)
T KOG0612|consen 568 HSKELSKQIQQELEENRDLEDKLSLLEESKSKLSKENKKLRSELEKERRQRTEISEIIAELKEEISSLEETLKAGKKELL 647 (1317)
T ss_pred hhhhhhHHHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHhhhhHHH
Q ss_pred hHHHHHHHHHHHHHH
Q psy7799 63 KAHYLIKELDMFQEA 77 (83)
Q Consensus 63 Ka~~L~~eL~~F~~~ 77 (83)
|+-.|..+...-..+
T Consensus 648 k~~el~r~~~e~~~~ 662 (1317)
T KOG0612|consen 648 KVEELKRENQERISD 662 (1317)
T ss_pred HHHHHHHHHHHHHHH
No 238
>PF08855 DUF1825: Domain of unknown function (DUF1825); InterPro: IPR014954 These roteins are uncharacterised and are principally found in cyanobacteria.
Probab=32.50 E-value=1.6e+02 Score=20.00 Aligned_cols=64 Identities=17% Similarity=0.319 Sum_probs=39.8
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHhhHHhhhh--------HHHHHHHHHHHHHHh
Q psy7799 11 SHQVEQELSSTFARLCQLVDRATD----DMHEDIKQLDSHLKQVEEASNNAKVLRNK--------AHYLIKELDMFQEAY 78 (83)
Q Consensus 11 shQVqqEL~~tf~rL~~~Vd~~~~----eL~~ei~~L~~~i~~le~i~~~~K~lrnK--------a~~L~~eL~~F~~~y 78 (83)
|.+||.|+...|.-.-...-...+ +.++.+.- |+.++.+..+-|.+=.+ |.+.-.+++.....|
T Consensus 7 SeiVq~e~~~if~~yq~l~~~~~~~~~fd~egK~~~----Id~m~~LidkqkiF~~Rl~LSdD~~Ak~m~~qi~~~~~~f 82 (108)
T PF08855_consen 7 SEIVQDELQDIFEDYQELMQMGSKYGKFDREGKKIH----IDKMEELIDKQKIFYKRLELSDDPEAKDMKEQINAQLNQF 82 (108)
T ss_pred cHHHHHHHHHHHHHHHHHHHHHhhcccCCHHHHHHH----HHHHHHHHHHHHHHHHHHHhccCHHHHHHHHHHHHHHHHc
Confidence 789999999888765544444333 44555444 77777777777776544 344445555555555
No 239
>COG0497 RecN ATPase involved in DNA repair [DNA replication, recombination, and repair]
Probab=32.46 E-value=2.6e+02 Score=24.00 Aligned_cols=51 Identities=18% Similarity=0.229 Sum_probs=40.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHhhhhHHHHHHHHHHHHHHhc
Q psy7799 29 VDRATDDMHEDIKQLDSHLKQVEEASNNAKVLRNKAHYLIKELDMFQEAYL 79 (83)
Q Consensus 29 Vd~~~~eL~~ei~~L~~~i~~le~i~~~~K~lrnKa~~L~~eL~~F~~~yL 79 (83)
+-.....++.|+..|+....+++.+..+.+.++.+.......|-.....|.
T Consensus 323 l~~~~~~~~~el~~L~~~~~~~~~Le~~~~~l~~~~~~~A~~Ls~~R~~~A 373 (557)
T COG0497 323 LLEYLDKIKEELAQLDNSEESLEALEKEVKKLKAELLEAAEALSAIRKKAA 373 (557)
T ss_pred HHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 345567888888888888888888999999888888888888877776653
No 240
>PRK14140 heat shock protein GrpE; Provisional
Probab=32.43 E-value=1.4e+02 Score=21.92 Aligned_cols=27 Identities=11% Similarity=0.198 Sum_probs=16.2
Q ss_pred HHHHHhhHHhhhhHHHHHHHHHHHHHH
Q psy7799 51 EEASNNAKVLRNKAHYLIKELDMFQEA 77 (83)
Q Consensus 51 e~i~~~~K~lrnKa~~L~~eL~~F~~~ 77 (83)
+.+..++..|+++.-.+..++++|.+-
T Consensus 47 ~~l~~ei~elkd~~lR~~Ae~eN~rkR 73 (191)
T PRK14140 47 AELEAKLDELEERYLRLQADFENYKRR 73 (191)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 334445555666666666677776654
No 241
>TIGR00293 prefoldin, archaeal alpha subunit/eukaryotic subunit 5. This model finds a set of small proteins from the Archaea and from Aquifex aeolicus that may represent two orthologous groups. The proteins are predicted to be mostly coiled coil, and may hit large numbers of proteins that contain coiled coil regions.
Probab=32.32 E-value=1.4e+02 Score=19.30 Aligned_cols=42 Identities=17% Similarity=0.299 Sum_probs=24.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHhhhhHHHHHHHHH
Q psy7799 28 LVDRATDDMHEDIKQLDSHLKQVEEASNNAKVLRNKAHYLIKELD 72 (83)
Q Consensus 28 ~Vd~~~~eL~~ei~~L~~~i~~le~i~~~~K~lrnKa~~L~~eL~ 72 (83)
.++++..-++..++.|.+.+..|+ ..+..++..+.-+...|+
T Consensus 83 ~~~eA~~~l~~~~~~l~~~~~~l~---~~l~~l~~~~~~i~~~l~ 124 (126)
T TIGR00293 83 DAEEAIEFLKKRIEELEKAIEKLQ---EALAELASRAQQLEQEAQ 124 (126)
T ss_pred cHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHh
Confidence 356666667777777666655443 344555556666655554
No 242
>PF06160 EzrA: Septation ring formation regulator, EzrA ; InterPro: IPR010379 During the bacterial cell cycle, the tubulin-like cell-division protein FtsZ polymerises into a ring structure that establishes the location of the nascent division site. EzrA modulates the frequency and position of FtsZ ring formation [].; GO: 0000921 septin ring assembly, 0005940 septin ring, 0016021 integral to membrane
Probab=32.25 E-value=3e+02 Score=22.93 Aligned_cols=67 Identities=25% Similarity=0.380 Sum_probs=39.1
Q ss_pred HHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHH-----------HHHHHHHHhhHHhhhhHHHHHHHHHHHHHH
Q psy7799 13 QVEQELSSTFARLCQ----LVDRATDDMHEDIKQLDSHL-----------KQVEEASNNAKVLRNKAHYLIKELDMFQEA 77 (83)
Q Consensus 13 QVqqEL~~tf~rL~~----~Vd~~~~eL~~ei~~L~~~i-----------~~le~i~~~~K~lrnKa~~L~~eL~~F~~~ 77 (83)
++...|...-+.|.. .|.....++.++|+.|-..+ ..+..+...+..++.....|..|++.-..+
T Consensus 256 ~i~~~l~~~~~~L~~l~l~~~~~~~~~i~~~Id~lYd~le~E~~Ak~~V~~~~~~l~~~l~~~~~~~~~l~~e~~~v~~s 335 (560)
T PF06160_consen 256 QIEEQLEEALALLKNLELDEVEEENEEIEERIDQLYDILEKEVEAKKYVEKNLKELYEYLEHAKEQNKELKEELERVSQS 335 (560)
T ss_pred HHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 334444444444432 34444445555555443322 235555666667778888899999999999
Q ss_pred hc
Q psy7799 78 YL 79 (83)
Q Consensus 78 yL 79 (83)
|.
T Consensus 336 Y~ 337 (560)
T PF06160_consen 336 YT 337 (560)
T ss_pred cC
Confidence 93
No 243
>PF07439 DUF1515: Protein of unknown function (DUF1515); InterPro: IPR010889 This family consists of several hypothetical bacterial proteins of around 130 residues in length. Members of this family seem to be found exclusively in Rhizobium species. The function of this family is unknown.
Probab=32.15 E-value=1.7e+02 Score=20.15 Aligned_cols=38 Identities=18% Similarity=0.268 Sum_probs=23.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy7799 17 ELSSTFARLCQLVDRATDDMHEDIKQLDSHLKQVEEAS 54 (83)
Q Consensus 17 EL~~tf~rL~~~Vd~~~~eL~~ei~~L~~~i~~le~i~ 54 (83)
++..-|.|=...++.....|...+++|-+++..||..-
T Consensus 19 ~lRed~r~SEdrsa~SRa~mhrRlDElV~Rv~~lEs~~ 56 (112)
T PF07439_consen 19 ELREDIRRSEDRSAASRASMHRRLDELVERVTTLESSV 56 (112)
T ss_pred HHHHHHHHHHHHhhhhhHHHHHhHHHHHHHHHHHHHHH
Confidence 34444555556666666666677777777777776433
No 244
>PRK15365 type III secretion system chaperone SseA; Provisional
Probab=31.65 E-value=1.1e+02 Score=20.89 Aligned_cols=32 Identities=22% Similarity=0.346 Sum_probs=19.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhh-HHhhhhH
Q psy7799 33 TDDMHEDIKQLDSHLKQVEEASNNA-KVLRNKA 64 (83)
Q Consensus 33 ~~eL~~ei~~L~~~i~~le~i~~~~-K~lrnKa 64 (83)
..||+..-..|..++..+..++... -.|..||
T Consensus 11 l~DL~~rYs~L~s~lkKfkq~q~~I~q~L~eRA 43 (107)
T PRK15365 11 YRDLEQSYMQLNHCLKKFHQIRAKVSQQLAERA 43 (107)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 3456666666666666666666665 5555444
No 245
>PLN02678 seryl-tRNA synthetase
Probab=31.64 E-value=2.9e+02 Score=22.70 Aligned_cols=45 Identities=16% Similarity=0.254 Sum_probs=27.3
Q ss_pred HHHHHHHHHHHHHHHH-------HHHHHHhhHHhhhhHHHHHHHHHHHHHHh
Q psy7799 34 DDMHEDIKQLDSHLKQ-------VEEASNNAKVLRNKAHYLIKELDMFQEAY 78 (83)
Q Consensus 34 ~eL~~ei~~L~~~i~~-------le~i~~~~K~lrnKa~~L~~eL~~F~~~y 78 (83)
.+|..+.++++++|.. .+.+..+++.|+.+..-|+.+++...+..
T Consensus 50 e~lr~erN~~sk~I~~~k~~~~~~~~l~~~~~~Lk~ei~~le~~~~~~~~~l 101 (448)
T PLN02678 50 DSLRKEFNKLNKEVAKLKIAKEDATELIAETKELKKEITEKEAEVQEAKAAL 101 (448)
T ss_pred HHHHHHHHHHHHHHHHHhhCCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4455555555555643 34555567777777777777776655543
No 246
>PF14988 DUF4515: Domain of unknown function (DUF4515)
Probab=31.60 E-value=2.1e+02 Score=20.98 Aligned_cols=41 Identities=20% Similarity=0.264 Sum_probs=22.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy7799 15 EQELSSTFARLCQLVDRATDDMHEDIKQLDSHLKQVEEASN 55 (83)
Q Consensus 15 qqEL~~tf~rL~~~Vd~~~~eL~~ei~~L~~~i~~le~i~~ 55 (83)
+++|.+.|+.-......-.-+.+.++..|..+++.|..+..
T Consensus 38 r~~l~s~y~~q~~~Lq~qLlq~~k~~~~l~~eLq~l~~~~~ 78 (206)
T PF14988_consen 38 RQELVSRYAKQTSELQDQLLQKEKEQAKLQQELQALKEFRR 78 (206)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHH
Confidence 34556666655555555555555555555555555555443
No 247
>PRK02793 phi X174 lysis protein; Provisional
Probab=31.52 E-value=1.1e+02 Score=18.79 Aligned_cols=9 Identities=22% Similarity=0.541 Sum_probs=3.7
Q ss_pred HhhHHhhhh
Q psy7799 55 NNAKVLRNK 63 (83)
Q Consensus 55 ~~~K~lrnK 63 (83)
...+.|++|
T Consensus 43 ~~l~~L~~r 51 (72)
T PRK02793 43 DHLRLLTEK 51 (72)
T ss_pred HHHHHHHHH
Confidence 334444433
No 248
>PTZ00464 SNF-7-like protein; Provisional
Probab=31.42 E-value=2.2e+02 Score=21.09 Aligned_cols=16 Identities=13% Similarity=0.208 Sum_probs=10.0
Q ss_pred hHHHHHHHHHHHHHHh
Q psy7799 63 KAHYLIKELDMFQEAY 78 (83)
Q Consensus 63 Ka~~L~~eL~~F~~~y 78 (83)
|=-++|.+|+....+.
T Consensus 69 ~KK~~E~ql~~l~~q~ 84 (211)
T PTZ00464 69 QKRMYQNQQDMMMQQQ 84 (211)
T ss_pred HHHHHHHHHHHHHHHH
Confidence 3356777777766554
No 249
>PF14915 CCDC144C: CCDC144C protein coiled-coil region
Probab=31.31 E-value=2.7e+02 Score=22.21 Aligned_cols=61 Identities=15% Similarity=0.226 Sum_probs=45.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHhhhhHHHHHHHHHHHHH
Q psy7799 12 HQVEQELSSTFARLCQLVDRATDDMHEDIKQLDSHLKQVEEASNNAKVLRNKAHYLIKELDMFQE 76 (83)
Q Consensus 12 hQVqqEL~~tf~rL~~~Vd~~~~eL~~ei~~L~~~i~~le~i~~~~K~lrnKa~~L~~eL~~F~~ 76 (83)
-.||++|+.|-+++. +...-...|=..+++-|.+=|.++.++--|.+..-||.-+|+...+
T Consensus 182 E~~QrdL~Qtq~q~K----E~e~m~qne~~kv~k~~~Kqes~eERL~QlqsEN~LLrQQLddA~~ 242 (305)
T PF14915_consen 182 ESVQRDLSQTQCQIK----EIEHMYQNEQDKVNKYIGKQESLEERLSQLQSENMLLRQQLDDAHN 242 (305)
T ss_pred HHHHHHHHHHHHHHH----HHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 468888888877654 3334444556677778888888888888888888888888887654
No 250
>PRK08032 fliD flagellar capping protein; Reviewed
Probab=31.29 E-value=2.9e+02 Score=22.43 Aligned_cols=52 Identities=13% Similarity=0.358 Sum_probs=34.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHhhhhHHHHHHHH
Q psy7799 17 ELSSTFARLCQLVDRATDDMHEDIKQLDSHLKQVEEASNNAKVLRNKAHYLIKEL 71 (83)
Q Consensus 17 EL~~tf~rL~~~Vd~~~~eL~~ei~~L~~~i~~le~i~~~~K~lrnKa~~L~~eL 71 (83)
-|...-.+|..+.+.....|+...++|.++-..||.+..++.. -..||...|
T Consensus 410 ~l~~~i~~l~~~i~~~~~rl~~~e~rl~~qF~ame~~~s~mns---~~s~L~~q~ 461 (462)
T PRK08032 410 GVNKTLKKLTKQYNAVSDSIDATIARYKAQFTQLDKLMTSLNS---TSSYLTQQF 461 (462)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHhh
Confidence 3555566666677777777777777777777777777666553 446777665
No 251
>PF11945 WASH_WAHD: WAHD domain of WASH complex; InterPro: IPR021854 This entry represents a component of the WASH complex. The WASH complex is present at the surface of endosomes and recruits and activates the Arp2/3 complex to induce actin polymerisation. The WASH complex plays a key role in the fission of tubules that serve as transport intermediates during endosome sorting []. The WASH complex's subunit structure: F-actin-capping protein subunit alpha (CAPZA1, CAPZA2 or CAPZA3), F-actin-capping protein subunit beta (CAPZB), WASH (WASH1, WASH2P, WASH3P, WASH4P, WASH5P or WASH6P), FAM21 (FAM21A, FAM21B or FAM21C), KIAA1033, KIAA0196 (strumpellin) and CCDC53. This entry represents the WASH subunit of the WASH complex. WASH genes duplicated to multiple chromosomal ends during primate evolution, with highest copy number reached in humans, whose WASH repertoires probably vary extensively among individuals []. It is therefore difficult to determine which gene is functional or not. The telomeric region of chromosome 9p is paralogous to the pericentromeric regions of chromosome 9 as well as to 2q. Paralogous regions contain 7 transcriptional units. Duplicated WASH genes are also present in the Xq/Yq pseudoautosomal region, as well as on chromosome 1 and 15. The chromosome 16 copy seems to be a pseudogene.
Probab=31.18 E-value=1.8e+02 Score=22.66 Aligned_cols=50 Identities=8% Similarity=0.167 Sum_probs=36.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHhhhhHHHHHH
Q psy7799 17 ELSSTFARLCQLVDRATDDMHEDIKQLDSHLKQVEEASNNAKVLRNKAHYLIK 69 (83)
Q Consensus 17 EL~~tf~rL~~~Vd~~~~eL~~ei~~L~~~i~~le~i~~~~K~lrnKa~~L~~ 69 (83)
-+..+...|...++.+-.-+...|.+-.. +|+.|+.+....+.|...|.+
T Consensus 22 qi~~aL~~L~~v~~diF~rI~~Rv~~~~~---~l~~i~~Ri~~~qaKi~~l~g 71 (297)
T PF11945_consen 22 QIADALEYLDKVSNDIFSRISARVERNRE---RLQAIQQRIEVAQAKIEKLQG 71 (297)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHhC
Confidence 35677888888888888888887777544 455777777777777776643
No 252
>COG3416 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=31.11 E-value=71 Score=24.51 Aligned_cols=21 Identities=19% Similarity=0.355 Sum_probs=10.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHH
Q psy7799 32 ATDDMHEDIKQLDSHLKQVEE 52 (83)
Q Consensus 32 ~~~eL~~ei~~L~~~i~~le~ 52 (83)
...-+..+|++|+++|+.|+.
T Consensus 56 ALk~a~~~i~eLe~ri~~lq~ 76 (233)
T COG3416 56 ALKKASTQIKELEKRIAILQA 76 (233)
T ss_pred HHHHHHHHHHHHHHHHHHHhc
Confidence 334444555555555555443
No 253
>PF02996 Prefoldin: Prefoldin subunit; InterPro: IPR004127 This entry comprises of several prefoldin subunits. Prefoldin (PFD) is a chaperone that interacts exclusively with type II chaperonins, hetero-oligomers lacking an obligate co-chaperonin that are found only in eukaryotes (chaperonin-containing T-complex polypeptide-1 (CCT)) and archaea. Eukaryotic PFD is a multi-subunit complex containing six polypeptides in the molecular mass range of 14-23 kDa. In archaea, on the other hand, PFD is composed of two types of subunits, two alpha and four beta. The six subunits associate to form two back-to-back up-and-down eight-stranded barrels, from which hang six coiled coils. Each subunit contributes one (beta subunits) or two (alpha subunits) beta hairpin turns to the barrels. The coiled coils are formed by the N and C termini of an individual subunit. Overall, this unique arrangement resembles a jellyfish. The eukaryotic PFD hexamer is composed of six different subunits; however, these can be grouped into two alpha-like (PFD3 and -5) and four beta-like (PFD1, -2, -4, and -6) subunits based on amino acid sequence similarity with their archaeal counterparts. Eukaryotic PFD has a six-legged structure similar to that seen in the archaeal homologue [, ]. This family contains the archaeal alpha subunit, eukaryotic prefoldin subunits 3 and 5 and the UXT (ubiquitously expressed transcript) family. Eukaryotic PFD has been shown to bind both actin and tubulin co-translationally. The chaperone then delivers the target protein to CCT, interacting with the chaperonin through the tips of the coiled coils. No authentic target proteins of any archaeal PFD have been identified, to date.; GO: 0051082 unfolded protein binding, 0006457 protein folding, 0016272 prefoldin complex; PDB: 1FXK_C 2ZDI_C.
Probab=30.99 E-value=1.4e+02 Score=18.87 Aligned_cols=42 Identities=19% Similarity=0.329 Sum_probs=24.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHhhhhHHHHHHHHHH
Q psy7799 29 VDRATDDMHEDIKQLDSHLKQVEEASNNAKVLRNKAHYLIKELDM 73 (83)
Q Consensus 29 Vd~~~~eL~~ei~~L~~~i~~le~i~~~~K~lrnKa~~L~~eL~~ 73 (83)
++++..-++..++.|+..++. +...++.++.....++..++.
T Consensus 75 ~~eA~~~l~~r~~~l~~~~~~---l~~~~~~~~~~~~~~~~~l~~ 116 (120)
T PF02996_consen 75 LEEAIEFLKKRIKELEEQLEK---LEKELAELQAQIEQLEQTLQQ 116 (120)
T ss_dssp HHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHCHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHH
Confidence 455666666667766655443 344555566666666665554
No 254
>PF04108 APG17: Autophagy protein Apg17 ; InterPro: IPR007240 Macroautophagy is a bulk degradation process induced by starvation in eukaryotic cells. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. No molecule involved in autophagy has yet been identified in higher eukaryotes []. The pre-autophagosomal structure contains at least five Apg proteins: Apg1p, Apg2p, Apg5p, Aut7p/Apg8p and Apg16p. It is found in the vacuole []. The C-terminal glycine of Apg12p is conjugated to a lysine residue of Apg5p via an isopeptide bond. During autophagy, cytoplasmic components are enclosed in autophagosomes and delivered to lysosomes/vacuoles. Auotphagy protein 16 (Apg16) has been shown to be bind to Apg5 and is required for the function of the Apg12p-Apg5p conjugate []. Autophagy protein 5 (Apg5) is directly required for the import of aminopeptidase I via the cytoplasm-to-vacuole targeting pathway []. Autophagy protein 17 (Apg17) is required for activating Apg1 protein kinases. This entry also contains Autophagy protein 11 which is involved in cytoplasm to vacuole transport (Cvt) and pexophagy. ; GO: 0006914 autophagy
Probab=30.92 E-value=2.8e+02 Score=22.18 Aligned_cols=66 Identities=17% Similarity=0.300 Sum_probs=45.2
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHhhHHhhhhHHHHHHHHHHHHH
Q psy7799 11 SHQVEQELSSTFARLCQLVDRATDDMHEDIKQLDSHLKQ----VEEASNNAKVLRNKAHYLIKELDMFQE 76 (83)
Q Consensus 11 shQVqqEL~~tf~rL~~~Vd~~~~eL~~ei~~L~~~i~~----le~i~~~~K~lrnKa~~L~~eL~~F~~ 76 (83)
+.+-.+||-.+-.+=..+|+.+..+|++-+..+..-..+ |......+..+.+-..-+-.+|+.|.+
T Consensus 235 ~~~e~~e~l~Vl~~Da~El~~V~~el~~~~~~~~~~~~~~~k~l~~~~~~~~~~~~~~~~~~~~l~~~~~ 304 (412)
T PF04108_consen 235 SEEERQEMLEVLENDAQELPDVVKELQERLDEMENNEERTKKLLQSQRDHIRELYNALSEALEELRKFGE 304 (412)
T ss_pred ChHHHHHHHHHHHcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 344556777776666777777788888877777777777 666666666666555555666666665
No 255
>cd04790 HTH_Cfa-like_unk Helix-Turn-Helix DNA binding domain of putative Cfa-like transcription regulators. Putative helix-turn-helix (HTH) MerR-like transcription regulator; conserved, Cfa-like, unknown proteins (~172 a.a.). The N-terminal domain of these proteins appears to be related to the HTH domain of Cfa, a cyclopropane fatty acid synthase. These Cfa-like proteins have a unique C-terminal domain with conserved histidines (motif HXXFX7HXXF). Based on sequence similarity of the N-terminal domains, these proteins are predicted to function as transcription regulators that mediate responses to stress in eubacteria. They belong to the MerR superfamily of transcription regulators that promote transcription of various stress regulons by reconfiguring the operator sequence located between the -35 and -10 promoter elements. A typical MerR regulator is comprised of distinct domains that harbor the regulatory (effector-binding) site and the active (DNA-binding) site. Their N-terminal domain
Probab=30.76 E-value=1.9e+02 Score=20.26 Aligned_cols=27 Identities=11% Similarity=0.332 Sum_probs=13.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhHH
Q psy7799 33 TDDMHEDIKQLDSHLKQVEEASNNAKV 59 (83)
Q Consensus 33 ~~eL~~ei~~L~~~i~~le~i~~~~K~ 59 (83)
..-|...+++|+.+|+.|...+..+..
T Consensus 76 ~~~L~~~~~~l~~ei~~L~~~~~~l~~ 102 (172)
T cd04790 76 TDVLRRRLAELNREIQRLRQQQRAIAT 102 (172)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 334555555555555555555544443
No 256
>PF10392 COG5: Golgi transport complex subunit 5; InterPro: IPR019465 The conserved oligomeric Golgi (COG) complex is a peripheral membrane complex involved in intra-Golgi protein trafficking. Subunit 5 is located in the smaller, B lobe, together with subunits 6-8, and has been shown to bind subunits 1 and 7 [].
Probab=30.76 E-value=1.7e+02 Score=19.58 Aligned_cols=48 Identities=21% Similarity=0.291 Sum_probs=25.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHhh
Q psy7799 14 VEQELSSTFARLCQLVDRATDDMHEDIKQLDSHLKQVEEASNNAKVLR 61 (83)
Q Consensus 14 VqqEL~~tf~rL~~~Vd~~~~eL~~ei~~L~~~i~~le~i~~~~K~lr 61 (83)
+...|..-...|..-++....+.-+--.++......|+.++..+..||
T Consensus 73 ~l~~v~~~v~~L~~s~~RL~~eV~~Py~~~~~~~~~L~rl~~t~~LLR 120 (132)
T PF10392_consen 73 VLQAVRSSVESLQSSYERLRSEVIEPYEKIQKLTSQLERLHQTSDLLR 120 (132)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHH
Confidence 444444444444444444444444455555555555666666666665
No 257
>PRK14155 heat shock protein GrpE; Provisional
Probab=30.68 E-value=1.2e+02 Score=22.54 Aligned_cols=15 Identities=13% Similarity=0.087 Sum_probs=6.1
Q ss_pred hhHHHHHHHHHHHHH
Q psy7799 62 NKAHYLIKELDMFQE 76 (83)
Q Consensus 62 nKa~~L~~eL~~F~~ 76 (83)
+|.-.+..++++|.+
T Consensus 34 d~~lR~~AefeN~RK 48 (208)
T PRK14155 34 DQALRYAAEAENTKR 48 (208)
T ss_pred HHHHHHHHHHHHHHH
Confidence 333334444444433
No 258
>PRK12714 flgK flagellar hook-associated protein FlgK; Provisional
Probab=30.39 E-value=3.4e+02 Score=23.03 Aligned_cols=59 Identities=15% Similarity=0.343 Sum_probs=33.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHh-----hHHhhhhHHHHHHHHHHH
Q psy7799 16 QELSSTFARLCQLVDRATDDMHED-------IKQLDSHLKQVEEASNN-----AKVLRNKAHYLIKELDMF 74 (83)
Q Consensus 16 qEL~~tf~rL~~~Vd~~~~eL~~e-------i~~L~~~i~~le~i~~~-----~K~lrnKa~~L~~eL~~F 74 (83)
+.|...|..+..+++..+++...+ |..+-++|..|..-... ...|+++=..|-.||-.+
T Consensus 137 ~~La~~f~~~~~~L~~~~~~~n~~i~~~V~~IN~l~~~IA~LN~~I~~~~~~~~ndLlDqRD~ll~eLS~~ 207 (624)
T PRK12714 137 NSLATRFKQLNGQMDSLSNEVNSGLTSSVDEVNRLTQQIAKINGTIGSSAQNAAPDLLDQRDALVSKLVGY 207 (624)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCchhhHHHHHHHHHHHHhh
Confidence 445666666666666666555533 33444444444433332 356777777777777544
No 259
>PF04111 APG6: Autophagy protein Apg6; InterPro: IPR007243 Macroautophagy is a bulk degradation process induced by starvation in eukaryotic cells. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. No molecule involved in autophagy has yet been identified in higher eukaryotes []. The pre-autophagosomal structure contains at least five Apg proteins: Apg1p, Apg2p, Apg5p, Aut7p/Apg8p and Apg16p. It is found in the vacuole []. The C-terminal glycine of Apg12p is conjugated to a lysine residue of Apg5p via an isopeptide bond. During autophagy, cytoplasmic components are enclosed in autophagosomes and delivered to lysosomes/vacuoles. Auotphagy protein 16 (Apg16) has been shown to be bind to Apg5 and is required for the function of the Apg12p-Apg5p conjugate []. Autophagy protein 5 (Apg5) is directly required for the import of aminopeptidase I via the cytoplasm-to-vacuole targeting pathway []. Apg6/Vps30p has two distinct functions in the autophagic process, either associated with the membrane or in a retrieval step of the carboxypeptidase Y sorting pathway [].; GO: 0006914 autophagy; PDB: 3Q8T_A 3VP7_A 4DDP_A.
Probab=30.37 E-value=2.6e+02 Score=21.65 Aligned_cols=23 Identities=13% Similarity=0.608 Sum_probs=12.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH
Q psy7799 33 TDDMHEDIKQLDSHLKQVEEASN 55 (83)
Q Consensus 33 ~~eL~~ei~~L~~~i~~le~i~~ 55 (83)
..++..||..|..+...++.-..
T Consensus 73 ~~~l~~el~~le~e~~~l~~eE~ 95 (314)
T PF04111_consen 73 REELDQELEELEEELEELDEEEE 95 (314)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHH
Confidence 34455555555555555554444
No 260
>TIGR02978 phageshock_pspC phage shock protein C. All members of this protein family are the phage shock protein PspC. These proteins contain a PspC domain, as do other members of the larger family of proteins described by Pfam model pfam04024. The phage shock regulon is restricted to the Proteobacteria and somewhat sparsely distributed there. It is expressed, under positive control of a sigma-54-dependent transcription factor, PspF, which binds and is modulated by PspA. Stresses that induce the psp regulon include phage secretin overexpression, ethanol, heat shock, and protein export defects.
Probab=30.37 E-value=99 Score=21.08 Aligned_cols=26 Identities=15% Similarity=0.368 Sum_probs=21.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy7799 28 LVDRATDDMHEDIKQLDSHLKQVEEA 53 (83)
Q Consensus 28 ~Vd~~~~eL~~ei~~L~~~i~~le~i 53 (83)
......++++.+.++++.+++++|.-
T Consensus 81 ~~~~~l~~~~~~~~~~e~Rl~~mE~y 106 (121)
T TIGR02978 81 SPRQALREVKREFRDLERRLRNMERY 106 (121)
T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 45667788889999999999999863
No 261
>PRK14127 cell division protein GpsB; Provisional
Probab=30.13 E-value=1.8e+02 Score=19.66 Aligned_cols=16 Identities=6% Similarity=0.401 Sum_probs=6.7
Q ss_pred HHHHHHHHHHHHHHHH
Q psy7799 36 MHEDIKQLDSHLKQVE 51 (83)
Q Consensus 36 L~~ei~~L~~~i~~le 51 (83)
|++++.+|..++..++
T Consensus 49 Lk~e~~~l~~~l~e~~ 64 (109)
T PRK14127 49 LQQENARLKAQVDELT 64 (109)
T ss_pred HHHHHHHHHHHHHHHH
Confidence 4444444444444333
No 262
>PF08372 PRT_C: Plant phosphoribosyltransferase C-terminal; InterPro: IPR013583 This domain is found at the C terminus of phosphoribosyltransferases and phosphoribosyltransferase-like proteins. It contains putative transmembrane regions. It often appears together with calcium-ion dependent C2 domains (IPR000008 from INTERPRO).
Probab=30.10 E-value=64 Score=23.04 Aligned_cols=46 Identities=15% Similarity=0.182 Sum_probs=31.9
Q ss_pred HHHHHHHHHHH------HHHHHHHHHHHHhhHHhhhhHHHHHHHHHHHHHHh
Q psy7799 33 TDDMHEDIKQL------DSHLKQVEEASNNAKVLRNKAHYLIKELDMFQEAY 78 (83)
Q Consensus 33 ~~eL~~ei~~L------~~~i~~le~i~~~~K~lrnKa~~L~~eL~~F~~~y 78 (83)
..|+.+|.+.- +.=-.++|.++.-+....|-.+.+++.+|++.+.+
T Consensus 38 ~deldEEfD~~ps~~~~~~lr~Rydrlr~va~rvQ~vlgd~At~gERl~all 89 (156)
T PF08372_consen 38 PDELDEEFDTFPSSRPPDSLRMRYDRLRSVAGRVQNVLGDVATQGERLQALL 89 (156)
T ss_pred cchhhhhhcccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Confidence 45666665421 11223677777777888899999999999988754
No 263
>TIGR00606 rad50 rad50. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=29.92 E-value=4.4e+02 Score=24.20 Aligned_cols=53 Identities=11% Similarity=0.179 Sum_probs=28.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHhhhhHHHHHHHHHHHHH
Q psy7799 24 RLCQLVDRATDDMHEDIKQLDSHLKQVEEASNNAKVLRNKAHYLIKELDMFQE 76 (83)
Q Consensus 24 rL~~~Vd~~~~eL~~ei~~L~~~i~~le~i~~~~K~lrnKa~~L~~eL~~F~~ 76 (83)
..-........++..+|+.+...++.|.....++..+|.+..-++.+++.+..
T Consensus 186 k~~~~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~~ir~~l~~~q~kie~~~~ 238 (1311)
T TIGR00606 186 KALETLRQVRQTQGQKVQEHQMELKYLKQYKEKACEIRDQITSKEAQLESSRE 238 (1311)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33344445555555666666666655555555555555555555554444443
No 264
>KOG2398|consensus
Probab=29.72 E-value=3.7e+02 Score=23.20 Aligned_cols=38 Identities=16% Similarity=0.399 Sum_probs=33.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy7799 16 QELSSTFARLCQLVDRATDDMHEDIKQLDSHLKQVEEA 53 (83)
Q Consensus 16 qEL~~tf~rL~~~Vd~~~~eL~~ei~~L~~~i~~le~i 53 (83)
.++...|.++-+.++-++++.+++-.+++.+-+.+|.-
T Consensus 149 ~~~~~~y~~~~~~~~~vr~~w~~~~~~~c~~fQ~~Ee~ 186 (611)
T KOG2398|consen 149 KEAREEYRSLVAKLEKVRKDWEQEMTDLCLKFQEIEES 186 (611)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 57788899999999999999999999999988877753
No 265
>PF09177 Syntaxin-6_N: Syntaxin 6, N-terminal; InterPro: IPR015260 Members of this entry, which are found in the amino terminus of various SNARE proteins, adopt a structure consisting of an antiparallel three-helix bundle. Their exact function has not been determined, though it is known that they regulate the SNARE motif, as well as mediate various protein-protein interactions involved in membrane-transport []. ; GO: 0048193 Golgi vesicle transport, 0016020 membrane; PDB: 1LVF_B 2C5I_T 2C5J_A 2C5K_T 4DND_A.
Probab=29.66 E-value=1.5e+02 Score=18.69 Aligned_cols=32 Identities=19% Similarity=0.304 Sum_probs=16.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy7799 22 FARLCQLVDRATDDMHEDIKQLDSHLKQVEEA 53 (83)
Q Consensus 22 f~rL~~~Vd~~~~eL~~ei~~L~~~i~~le~i 53 (83)
|..+.........+++..+.+|+.-|...|.-
T Consensus 37 ~~~~~~eL~~~l~~ie~~L~DL~~aV~ive~n 68 (97)
T PF09177_consen 37 LKWLKRELRNALQSIEWDLEDLEEAVRIVEKN 68 (97)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC
Confidence 34444555555555555555555555554443
No 266
>PF08537 NBP1: Fungal Nap binding protein NBP1; InterPro: IPR013743 NBP1 is a nuclear protein which has been shown in Saccharomyces cerevisiae (Bakers yeast) to be essential for the G2/M transition of the cell cycle.
Probab=29.66 E-value=1.7e+02 Score=23.54 Aligned_cols=36 Identities=22% Similarity=0.351 Sum_probs=25.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhHHhhhhHHHHHHHHHH
Q psy7799 35 DMHEDIKQLDSHLKQVEEASNNAKVLRNKAHYLIKELDM 73 (83)
Q Consensus 35 eL~~ei~~L~~~i~~le~i~~~~K~lrnKa~~L~~eL~~ 73 (83)
+|+.+|.+++.+ |+.++..++--.-|-.||++-|+.
T Consensus 186 ~l~~~l~~~~~e---L~~~~k~L~faqekn~LlqslLdd 221 (323)
T PF08537_consen 186 ELEERLNDLEKE---LEITKKDLKFAQEKNALLQSLLDD 221 (323)
T ss_pred HHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHhh
Confidence 455555555444 445777888888899999998875
No 267
>PF12548 DUF3740: Sulfatase protein; InterPro: IPR024609 This uncharacterised domain is found in the C-terminal region of extracellular sulphatase proteins.
Probab=29.61 E-value=1.3e+02 Score=21.16 Aligned_cols=31 Identities=19% Similarity=0.314 Sum_probs=26.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhHHhh
Q psy7799 31 RATDDMHEDIKQLDSHLKQVEEASNNAKVLR 61 (83)
Q Consensus 31 ~~~~eL~~ei~~L~~~i~~le~i~~~~K~lr 61 (83)
.....++.+|+.|..+|.-|=+|-..+|.=|
T Consensus 106 ~hr~~ID~eIe~Lq~Ki~~LKeiR~hLk~~R 136 (145)
T PF12548_consen 106 DHRLHIDHEIETLQDKIKNLKEIRGHLKKKR 136 (145)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC
Confidence 4567788999999999999999988888655
No 268
>PRK00409 recombination and DNA strand exchange inhibitor protein; Reviewed
Probab=29.55 E-value=2.8e+02 Score=24.30 Aligned_cols=19 Identities=16% Similarity=0.295 Sum_probs=6.9
Q ss_pred HHHHHHHHHHHHHHHHHHH
Q psy7799 35 DMHEDIKQLDSHLKQVEEA 53 (83)
Q Consensus 35 eL~~ei~~L~~~i~~le~i 53 (83)
+|+.+..+++.+...++..
T Consensus 524 ~l~~~~~~~e~~~~~~~~~ 542 (782)
T PRK00409 524 SLEELERELEQKAEEAEAL 542 (782)
T ss_pred HHHHHHHHHHHHHHHHHHH
Confidence 3333333333333333333
No 269
>KOG0963|consensus
Probab=29.50 E-value=3.2e+02 Score=23.94 Aligned_cols=46 Identities=15% Similarity=0.422 Sum_probs=40.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhHHhhhhHHHHHHHHHHHHHH
Q psy7799 32 ATDDMHEDIKQLDSHLKQVEEASNNAKVLRNKAHYLIKELDMFQEA 77 (83)
Q Consensus 32 ~~~eL~~ei~~L~~~i~~le~i~~~~K~lrnKa~~L~~eL~~F~~~ 77 (83)
=-.+|+.++-.++.++..+++.+.+...|+-+..-++..++.|.++
T Consensus 122 e~~~lk~~lee~~~el~~~k~qq~~v~~l~e~l~k~~~~~~~~ie~ 167 (629)
T KOG0963|consen 122 ENEELKEELEEVNNELADLKTQQVTVRNLKERLRKLEQLLEIFIEN 167 (629)
T ss_pred hHHHHHHHHHHHHHHHhhhhhhHHHHHhHHHHHHHHHHHHHHHHHH
Confidence 3448888889999999999999999999999999999999998874
No 270
>KOG0930|consensus
Probab=29.44 E-value=1.4e+02 Score=24.32 Aligned_cols=48 Identities=21% Similarity=0.343 Sum_probs=35.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHhhhh
Q psy7799 15 EQELSSTFARLCQLVDRATDDMHEDIKQLDSHLKQVEEASNNAKVLRNK 63 (83)
Q Consensus 15 qqEL~~tf~rL~~~Vd~~~~eL~~ei~~L~~~i~~le~i~~~~K~lrnK 63 (83)
.-||...-.|=.+.+|+.+ -|++||.+...+|+.++.....-.+=|||
T Consensus 16 ~~eL~~ir~rk~qL~deIq-~Lk~Ei~ev~~eid~~~~~e~skt~q~~k 63 (395)
T KOG0930|consen 16 RMELENIRRRKQELLDEIQ-RLKDEIAEVMEEIDNLESTEESKTKQRNK 63 (395)
T ss_pred HHhHHHHHHHHHHHHHHHH-HHHHHHHHHHHHhhhhccccchhhHHHhH
Confidence 4577777788778888765 48999999999999998876533333444
No 271
>PRK01919 tatB sec-independent translocase; Provisional
Probab=29.41 E-value=2.3e+02 Score=20.74 Aligned_cols=28 Identities=14% Similarity=0.373 Sum_probs=20.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy7799 14 VEQELSSTFARLCQLVDRATDDMHEDIK 41 (83)
Q Consensus 14 VqqEL~~tf~rL~~~Vd~~~~eL~~ei~ 41 (83)
+-+.+-..+.++...++.+..+++.|++
T Consensus 28 ~aRtlGk~i~k~Rr~~~d~K~ev~~E~e 55 (169)
T PRK01919 28 VARTAGALFGRAQRYINDVKAEVSREIE 55 (169)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 4556667777788888888888877763
No 272
>PF02994 Transposase_22: L1 transposable element; InterPro: IPR004244 Many human L1 elements are capable of retrotransposition. Some of these have been shown to exhibit reverse transcriptase (RT) activity [] although the function of many are, as yet, unknown. More information about these proteins can be found at Protein of the Month: Transposase [].; PDB: 2LDY_A 3SOO_A 2YKQ_A 2YKO_C 2YKP_B 2W7A_B 2JRB_A.
Probab=29.40 E-value=2.3e+02 Score=22.47 Aligned_cols=13 Identities=15% Similarity=0.498 Sum_probs=5.0
Q ss_pred HHHHHHHHHHHHH
Q psy7799 37 HEDIKQLDSHLKQ 49 (83)
Q Consensus 37 ~~ei~~L~~~i~~ 49 (83)
...|++++.+|..
T Consensus 143 ~~Ri~e~Eeris~ 155 (370)
T PF02994_consen 143 NSRIDELEERISE 155 (370)
T ss_dssp -HHHHHHHHHHHH
T ss_pred HHHHHHHHhHHHH
Confidence 3344444444433
No 273
>TIGR03752 conj_TIGR03752 integrating conjugative element protein, PFL_4705 family. Members of this protein family are found occasionally on plasmids such as the Pseudomonas putida toluene catabolic TOL plasmid pWWO_p085. Usually, however, they are found on the bacterial main chromosome in regions flanked by markers of conjugative transfer and/or transposition.
Probab=29.35 E-value=2.8e+02 Score=23.41 Aligned_cols=30 Identities=27% Similarity=0.374 Sum_probs=16.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy7799 21 TFARLCQLVDRATDDMHEDIKQLDSHLKQV 50 (83)
Q Consensus 21 tf~rL~~~Vd~~~~eL~~ei~~L~~~i~~l 50 (83)
+=.|..+.|+..+.++..+..+|..+++++
T Consensus 99 id~~i~~av~~~~~~~~~~~~ql~~~~~~~ 128 (472)
T TIGR03752 99 IDQQIQQAVQSETQELTKEIEQLKSERQQL 128 (472)
T ss_pred HHHHHHHHHHhhhHHHHHHHHHHHHHHHHH
Confidence 334445555555566666666665555544
No 274
>KOG1853|consensus
Probab=29.21 E-value=3e+02 Score=22.02 Aligned_cols=70 Identities=13% Similarity=0.233 Sum_probs=40.4
Q ss_pred hhhhhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHhhHHhhhhHHHHHHHHHH
Q psy7799 4 DLTSSNCSHQVEQELSSTFARLCQLVDRATDDMHEDIKQLDSHLK-QVEEASNNAKVLRNKAHYLIKELDM 73 (83)
Q Consensus 4 ~~tssncshQVqqEL~~tf~rL~~~Vd~~~~eL~~ei~~L~~~i~-~le~i~~~~K~lrnKa~~L~~eL~~ 73 (83)
+|.|---..|.|.+|+-+|..=-.+..+-.++..+-.++++++++ +|+.+..+.+.|-.....|-.|++.
T Consensus 7 ~fss~~eE~~ywk~l~~~ykq~f~~~reEl~EFQegSrE~EaelesqL~q~etrnrdl~t~nqrl~~E~e~ 77 (333)
T KOG1853|consen 7 TFSSDLEEDQYWKLLHHEYKQHFLQMREELNEFQEGSREIEAELESQLDQLETRNRDLETRNQRLTTEQER 77 (333)
T ss_pred cccchhhHHHHHhhhHHHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 444555678999998888876655555544444444444444443 3555555555555555555555543
No 275
>PRK14163 heat shock protein GrpE; Provisional
Probab=28.61 E-value=1.2e+02 Score=22.71 Aligned_cols=14 Identities=7% Similarity=-0.111 Sum_probs=6.3
Q ss_pred HHHHHHHHHHHhcc
Q psy7799 67 LIKELDMFQEAYLV 80 (83)
Q Consensus 67 L~~eL~~F~~~yL~ 80 (83)
...-...|.+.+|+
T Consensus 85 ~~~a~~~~~~~LLp 98 (214)
T PRK14163 85 KEIAVANLLSELLP 98 (214)
T ss_pred HHHHHHHHHHHHhh
Confidence 33344445555443
No 276
>PF05911 DUF869: Plant protein of unknown function (DUF869); InterPro: IPR008587 This family consists of a number of sequences found in plants. The function of this family is unknown.
Probab=28.55 E-value=4.2e+02 Score=23.55 Aligned_cols=43 Identities=19% Similarity=0.267 Sum_probs=26.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHH----HHHhhHHhhhhHHHHHHHHHH
Q psy7799 31 RATDDMHEDIKQLDSHLKQVEE----ASNNAKVLRNKAHYLIKELDM 73 (83)
Q Consensus 31 ~~~~eL~~ei~~L~~~i~~le~----i~~~~K~lrnKa~~L~~eL~~ 73 (83)
....-++.+++.+......+|. +....+.+++|..-|+.||+.
T Consensus 645 eS~s~~E~ql~~~~e~~e~le~~~~~~e~E~~~l~~Ki~~Le~Ele~ 691 (769)
T PF05911_consen 645 ESNSLAETQLKAMKESYESLETRLKDLEAEAEELQSKISSLEEELEK 691 (769)
T ss_pred HHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHH
Confidence 3344455555555555544443 366778888888888888764
No 277
>PRK10697 DNA-binding transcriptional activator PspC; Provisional
Probab=28.55 E-value=1.1e+02 Score=21.02 Aligned_cols=26 Identities=12% Similarity=0.270 Sum_probs=20.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy7799 28 LVDRATDDMHEDIKQLDSHLKQVEEA 53 (83)
Q Consensus 28 ~Vd~~~~eL~~ei~~L~~~i~~le~i 53 (83)
-.....++++.+.+.++.+|.++|.-
T Consensus 78 s~~~~l~~~~~~~~~~e~Rlr~mE~y 103 (118)
T PRK10697 78 SSSELLDEVDRELAAGEQRLREMERY 103 (118)
T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34566788889999999999999863
No 278
>KOG4005|consensus
Probab=28.52 E-value=2.2e+02 Score=22.48 Aligned_cols=43 Identities=23% Similarity=0.336 Sum_probs=26.8
Q ss_pred HHHHHHHHHHHHHHHHHH----HHHHHhhHHhhhhHHHHHHHHHHHH
Q psy7799 33 TDDMHEDIKQLDSHLKQV----EEASNNAKVLRNKAHYLIKELDMFQ 75 (83)
Q Consensus 33 ~~eL~~ei~~L~~~i~~l----e~i~~~~K~lrnKa~~L~~eL~~F~ 75 (83)
+.+|+.+|++|..+-+.| |.+...-+-|=+|-..|..+|+-..
T Consensus 92 m~eme~~i~dL~een~~L~~en~~Lr~~n~~L~~~n~el~~~le~~~ 138 (292)
T KOG4005|consen 92 MEEMEYEIKDLTEENEILQNENDSLRAINESLLAKNHELDSELELLR 138 (292)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHH
Confidence 457788888887665544 3344444555567777777777433
No 279
>PF08278 DnaG_DnaB_bind: DNA primase DnaG DnaB-binding ; InterPro: IPR013173 Eubacterial DnaG primases interact with several factors to form the replisome. One of these factors is DnaB, a helicase. This domain has been demonstrated to be responsible for the interaction between DnaG and DnaB []. This domain has a multi-helical structure that forms an orthogonal bundle [].; GO: 0003896 DNA primase activity, 0006269 DNA replication, synthesis of RNA primer; PDB: 2HAJ_A 1T3W_B.
Probab=28.52 E-value=1.7e+02 Score=18.82 Aligned_cols=30 Identities=27% Similarity=0.500 Sum_probs=20.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy7799 13 QVEQELSSTFARLCQLVDRATDDMHEDIKQLDSHLK 48 (83)
Q Consensus 13 QVqqEL~~tf~rL~~~Vd~~~~eL~~ei~~L~~~i~ 48 (83)
-+.+|..+++.||..+. ++.+|++|..+..
T Consensus 82 ~~~~ef~d~l~~L~~~~------~~~~i~~L~~k~~ 111 (127)
T PF08278_consen 82 DIEQEFQDALARLQEQA------LERRIEELKAKPR 111 (127)
T ss_dssp HHHHHHHHHHHHHHHHH------HHHHHHHHHHHHT
T ss_pred hHHHHHHHHHHHHHHHH------HHHHHHHHHHhhc
Confidence 46778888888887765 5566666655554
No 280
>PF06730 FAM92: FAM92 protein; InterPro: IPR009602 This family consists of several eukaryotic sequences of around 270 residues in length. Members of this family are found in mouse, human and Drosophila melanogaster. The function of this family is unknown.
Probab=28.47 E-value=1e+02 Score=23.38 Aligned_cols=28 Identities=21% Similarity=0.328 Sum_probs=23.3
Q ss_pred HHHHHHhhHHhhhhHHHHHHHHHHHHHH
Q psy7799 50 VEEASNNAKVLRNKAHYLIKELDMFQEA 77 (83)
Q Consensus 50 le~i~~~~K~lrnKa~~L~~eL~~F~~~ 77 (83)
+-....+.-.||.|+..|...|..|+++
T Consensus 34 ~a~ytRKtArLRDk~D~lak~l~~yA~~ 61 (219)
T PF06730_consen 34 FAAYTRKTARLRDKGDELAKQLQDYANT 61 (219)
T ss_pred HHHHHHHHHHhchhhHHHHHHHHHHHhc
Confidence 3456778888999999999999999875
No 281
>PF06818 Fez1: Fez1; InterPro: IPR009638 This family represents the eukaryotic Fez1 protein. Fez1 contains a leucine-zipper region with similarity to the DNA-binding domain of the cAMP-responsive activating-transcription factor 5 []. There is evidence that Fez1 inhibits cancer cell growth through regulation of mitosis, and that its alterations result in abnormal cell growth []. Note that some family members contain more than one copy of this region.; GO: 0005737 cytoplasm, 0016020 membrane
Probab=28.38 E-value=2.6e+02 Score=20.96 Aligned_cols=37 Identities=27% Similarity=0.373 Sum_probs=24.7
Q ss_pred HHHHHHHHHHHHHHhhHHhhhhHHHHHHHHHHHHHHh
Q psy7799 42 QLDSHLKQVEEASNNAKVLRNKAHYLIKELDMFQEAY 78 (83)
Q Consensus 42 ~L~~~i~~le~i~~~~K~lrnKa~~L~~eL~~F~~~y 78 (83)
+|+.+=..|....+.+..||.|.+-|+.|+-......
T Consensus 67 ELE~ce~ELqr~~~Ea~lLrekl~~le~El~~Lr~~l 103 (202)
T PF06818_consen 67 ELEVCENELQRKKNEAELLREKLGQLEAELAELREEL 103 (202)
T ss_pred hHHHhHHHHHHHhCHHHHhhhhhhhhHHHHHHHHHHH
Confidence 3333444444455566778889999998888776655
No 282
>PF05384 DegS: Sensor protein DegS; InterPro: IPR008595 This is a group of Bacillus DegS proteins. The DegS-DegU two-component regulatory system of Bacillus subtilis controls various processes that characterise the transition from the exponential to the stationary growth phase, including the induction of extracellular degradative enzymes, expression of late competence genes and down-regulation of the sigma D regulon []. The entry also contains one sequence Q8R9D3 from SWISSPROT from Thermoanaerobacter tengcongensis which is described as a sensory transduction histidine kinase.; GO: 0016301 kinase activity, 0007165 signal transduction
Probab=28.35 E-value=2.2e+02 Score=20.27 Aligned_cols=61 Identities=11% Similarity=0.244 Sum_probs=38.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHhhhhHHHHHHHHHHHHHH
Q psy7799 17 ELSSTFARLCQLVDRATDDMHEDIKQLDSHLKQVEEASNNAKVLRNKAHYLIKELDMFQEA 77 (83)
Q Consensus 17 EL~~tf~rL~~~Vd~~~~eL~~ei~~L~~~i~~le~i~~~~K~lrnKa~~L~~eL~~F~~~ 77 (83)
|+-.....-....+...++|++=-.++..-|+..|.+....+.-|+|.......++.|++.
T Consensus 17 qIf~I~E~~R~E~~~l~~EL~evk~~v~~~I~evD~Le~~er~aR~rL~eVS~~f~~ysE~ 77 (159)
T PF05384_consen 17 QIFEIAEQARQEYERLRKELEEVKEEVSEVIEEVDKLEKRERQARQRLAEVSRNFDRYSEE 77 (159)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccCHH
Confidence 3333344444455555555555444555567777888888888888888777777766543
No 283
>PF10915 DUF2709: Protein of unknown function (DUF2709); InterPro: IPR024484 Members of this family appear restricted to Chlamydiales. Their function is unknown.
Probab=28.32 E-value=35 Score=26.06 Aligned_cols=19 Identities=26% Similarity=0.426 Sum_probs=15.5
Q ss_pred HHHHHHHHHHHHHhcccCC
Q psy7799 65 HYLIKELDMFQEAYLVNMS 83 (83)
Q Consensus 65 ~~L~~eL~~F~~~yL~~~~ 83 (83)
+-+..-+++|...||+||+
T Consensus 166 ~s~~avieDF~~~ylk~mt 184 (238)
T PF10915_consen 166 HSKPAVIEDFEKNYLKPMT 184 (238)
T ss_pred cccHHHHHHHHHccccccc
Confidence 3456778999999999984
No 284
>PLN03230 acetyl-coenzyme A carboxylase carboxyl transferase; Provisional
Probab=28.28 E-value=1.5e+02 Score=24.72 Aligned_cols=35 Identities=20% Similarity=0.387 Sum_probs=17.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy7799 16 QELSSTFARLCQLVDRATDDMHEDIKQLDSHLKQV 50 (83)
Q Consensus 16 qEL~~tf~rL~~~Vd~~~~eL~~ei~~L~~~i~~l 50 (83)
.||..--.-|....+...-++.+||..|+.++..+
T Consensus 83 ~ele~ki~el~~~~~~~~~~~~~ei~~l~~~~~~~ 117 (431)
T PLN03230 83 VDLENRIDEVRELANKTGVDFSAQIAELEERYDQV 117 (431)
T ss_pred HHHHHHHHHHHhhhhcccccHHHHHHHHHHHHHHH
Confidence 34444444444444333445555566655555554
No 285
>cd01109 HTH_YyaN Helix-Turn-Helix DNA binding domain of the MerR-like transcription regulators YyaN and YraB. Putative helix-turn-helix (HTH) MerR-like transcription regulators of Bacillus subtilis, YyaN and YraB, and related proteins; N-terminal domain. Based on sequence similarity, these proteins are predicted to function as transcription regulators that mediate responses to stress in eubacteria. They belong to the MerR superfamily of transcription regulators that promote transcription of various stress regulons by reconfiguring the operator sequence located between the -35 and -10 promoter elements. A typical MerR regulator is comprised of distinct domains that harbor the regulatory (effector-binding) site and the active (DNA-binding) site. Their N-terminal domains are homologous and contain a DNA-binding winged HTH motif, while the C-terminal domains are often dissimilar and bind specific coactivator molecules such as metal ions, drugs, and organic substrates.
Probab=28.18 E-value=1.7e+02 Score=18.76 Aligned_cols=27 Identities=11% Similarity=0.252 Sum_probs=15.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy7799 30 DRATDDMHEDIKQLDSHLKQVEEASNN 56 (83)
Q Consensus 30 d~~~~eL~~ei~~L~~~i~~le~i~~~ 56 (83)
+....-|+..+.+++.+|+.|+.....
T Consensus 78 ~~~~~~l~~~~~~l~~~i~~l~~~~~~ 104 (113)
T cd01109 78 PERLELLEEHREELEEQIAELQETLAY 104 (113)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 334455666666666666666655544
No 286
>KOG0288|consensus
Probab=28.18 E-value=3.7e+02 Score=22.69 Aligned_cols=51 Identities=16% Similarity=0.091 Sum_probs=30.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHhhhhHHHHHHHHHHHHHH
Q psy7799 27 QLVDRATDDMHEDIKQLDSHLKQVEEASNNAKVLRNKAHYLIKELDMFQEA 77 (83)
Q Consensus 27 ~~Vd~~~~eL~~ei~~L~~~i~~le~i~~~~K~lrnKa~~L~~eL~~F~~~ 77 (83)
+.+.....++-+--++.+.=+.++...--.++.++.|.+..|.+|++.++.
T Consensus 13 qr~~~~~~~laq~~k~~s~~~aq~~~~~a~~~ai~a~~~~~E~~l~~Lq~e 63 (459)
T KOG0288|consen 13 QRLIDLNTELAQCEKAQSRLSAQLVILRAESRAIKAKLQEKELELNRLQEE 63 (459)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 333333333333334444445566666667788888888888888877653
No 287
>PF06419 COG6: Conserved oligomeric complex COG6; InterPro: IPR010490 COG6 is a component of the conserved oligomeric golgi complex, which is composed of eight different subunits and is required for normal golgi morphology and localisation.
Probab=28.09 E-value=3.7e+02 Score=22.71 Aligned_cols=66 Identities=21% Similarity=0.286 Sum_probs=0.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHhhhhHHHHHHH---HHHHHHHh-ccc
Q psy7799 16 QELSSTFARLCQLVDRATDDMHEDIKQLDSHLKQVEEASNNAKVLRNKAHYLIKE---LDMFQEAY-LVN 81 (83)
Q Consensus 16 qEL~~tf~rL~~~Vd~~~~eL~~ei~~L~~~i~~le~i~~~~K~lrnKa~~L~~e---L~~F~~~y-L~~ 81 (83)
+.+...+.++...|+....--++=.++|...=..-..+-.+...|+++-..++.. |..|.+.| |++
T Consensus 41 ~~v~~~l~~~~~~v~~l~~~~~~~~~~l~~~~~~t~~ll~~~~~L~~~~~~~~~k~~ll~~f~~~f~Ls~ 110 (618)
T PF06419_consen 41 SPVNRQLKRLQSDVDKLNSSCDQMQDRLSAAKSETSDLLEEASELREQKEELELKKKLLDAFLERFTLSE 110 (618)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCH
No 288
>cd04786 HTH_MerR-like_sg7 Helix-Turn-Helix DNA binding domain of putative transcription regulators from the MerR superfamily. Putative helix-turn-helix (HTH) MerR-like transcription regulators (subgroup 7) with a conserved cysteine present in the C-terminal portion of the protein. Based on sequence similarity, these proteins are predicted to function as transcription regulators that mediate responses to stress in eubacteria. They belong to the MerR superfamily of transcription regulators that promote transcription of various stress regulons by reconfiguring the operator sequence located between the -35 and -10 promoter elements. A typical MerR regulator is comprised of two distinct domains that harbor the regulatory (effector-binding) site and the active (DNA-binding) site. Their N-terminal domains are homologous and contain a DNA-binding winged HTH motif, while the C-terminal domains are often dissimilar and bind specific coactivator molecules such as metal ions, drugs, and organic su
Probab=28.03 E-value=1.8e+02 Score=19.51 Aligned_cols=30 Identities=7% Similarity=0.185 Sum_probs=19.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhHH
Q psy7799 30 DRATDDMHEDIKQLDSHLKQVEEASNNAKV 59 (83)
Q Consensus 30 d~~~~eL~~ei~~L~~~i~~le~i~~~~K~ 59 (83)
+....-|+..+.+++.+|..|..+...+..
T Consensus 77 ~~~~~~l~~k~~~i~~~i~~L~~~~~~L~~ 106 (131)
T cd04786 77 DELLAALERKVADIEALEARLAQNKAQLLV 106 (131)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 344556677777777777777766655443
No 289
>PF14817 HAUS5: HAUS augmin-like complex subunit 5
Probab=28.00 E-value=4e+02 Score=23.09 Aligned_cols=54 Identities=15% Similarity=0.191 Sum_probs=29.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHhhhhHHHHH
Q psy7799 15 EQELSSTFARLCQLVDRATDDMHEDIKQLDSHLKQVEEASNNAKVLRNKAHYLI 68 (83)
Q Consensus 15 qqEL~~tf~rL~~~Vd~~~~eL~~ei~~L~~~i~~le~i~~~~K~lrnKa~~L~ 68 (83)
+.||...-.||.+.+.....+++.--.++..+=..++....+....|.|..+|+
T Consensus 81 r~~L~~everLraei~~l~~~I~~~e~e~~~~e~~~~q~~~~~~~~~~k~~LL~ 134 (632)
T PF14817_consen 81 RRELEKEVERLRAEIQELDKEIESREREVSRQEASREQMLDKISDSRHKQLLLE 134 (632)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 446666666776666655555444444444333344444444466777776665
No 290
>PRK14151 heat shock protein GrpE; Provisional
Probab=27.97 E-value=1.3e+02 Score=21.57 Aligned_cols=14 Identities=7% Similarity=0.138 Sum_probs=6.0
Q ss_pred hhHHHHHHHHHHHH
Q psy7799 62 NKAHYLIKELDMFQ 75 (83)
Q Consensus 62 nKa~~L~~eL~~F~ 75 (83)
++.-.+..+.++|.
T Consensus 41 d~~lR~~Ae~eN~r 54 (176)
T PRK14151 41 DQSLRAAADLQNVR 54 (176)
T ss_pred HHHHHHHHHHHHHH
Confidence 33334444444443
No 291
>COG1579 Zn-ribbon protein, possibly nucleic acid-binding [General function prediction only]
Probab=27.94 E-value=2.8e+02 Score=21.18 Aligned_cols=49 Identities=20% Similarity=0.219 Sum_probs=29.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHhhhhHHHHHHHHHHHHHHhc
Q psy7799 28 LVDRATDDMHEDIKQLDSHLKQVEEASNNAKVLRNKAHYLIKELDMFQEAYL 79 (83)
Q Consensus 28 ~Vd~~~~eL~~ei~~L~~~i~~le~i~~~~K~lrnKa~~L~~eL~~F~~~yL 79 (83)
.+|.-...++-.++ ....-|+.++...-.++++..-++-+++.|.+++.
T Consensus 14 ~lD~e~~rl~~~~~---~~~~~l~k~~~e~e~~~~~~~~~~~e~e~le~qv~ 62 (239)
T COG1579 14 KLDLEKDRLEPRIK---EIRKALKKAKAELEALNKALEALEIELEDLENQVS 62 (239)
T ss_pred HHHHHHHHHHHhhh---hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34444444444444 33445666666667777777777777777777663
No 292
>PRK14141 heat shock protein GrpE; Provisional
Probab=27.85 E-value=1.3e+02 Score=22.41 Aligned_cols=15 Identities=13% Similarity=0.366 Sum_probs=6.4
Q ss_pred hhhHHHHHHHHHHHH
Q psy7799 61 RNKAHYLIKELDMFQ 75 (83)
Q Consensus 61 rnKa~~L~~eL~~F~ 75 (83)
++|.-.+..++++|.
T Consensus 51 kd~~lR~~Ae~eN~R 65 (209)
T PRK14141 51 KDRMLRLAAEMENLR 65 (209)
T ss_pred HHHHHHHHHHHHHHH
Confidence 333344444444444
No 293
>KOG2629|consensus
Probab=27.81 E-value=3.2e+02 Score=21.83 Aligned_cols=53 Identities=13% Similarity=0.191 Sum_probs=31.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHhhhhHHHHHHHHHHHH
Q psy7799 20 STFARLCQLVDRATDDMHEDIKQLDSHLKQVEEASNNAKVLRNKAHYLIKELDMFQ 75 (83)
Q Consensus 20 ~tf~rL~~~Vd~~~~eL~~ei~~L~~~i~~le~i~~~~K~lrnKa~~L~~eL~~F~ 75 (83)
.+-.-|..+|-.+...|...-++|...+.+|.+.+. .+.+--.-|++|++.-.
T Consensus 136 ~~~~~l~~~va~v~q~~~~qq~Els~~L~~l~~~~~---~~s~~~~k~esei~~Ik 188 (300)
T KOG2629|consen 136 KSLNALMDEVAQVSQLLATQQSELSRALASLKNTLV---QLSRNIEKLESEINTIK 188 (300)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HhhhhHHHHHHHHHHHH
Confidence 333445555566666666666677777777666655 44444455666666654
No 294
>PRK15374 pathogenicity island 1 effector protein SipB; Provisional
Probab=27.76 E-value=4.1e+02 Score=23.15 Aligned_cols=45 Identities=9% Similarity=0.086 Sum_probs=31.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhHHhhhhHHHHHHHHHHHHHHh
Q psy7799 34 DDMHEDIKQLDSHLKQVEEASNNAKVLRNKAHYLIKELDMFQEAY 78 (83)
Q Consensus 34 ~eL~~ei~~L~~~i~~le~i~~~~K~lrnKa~~L~~eL~~F~~~y 78 (83)
.++++-+++-...-+.|+...++...-..+..-+++.|+.-..++
T Consensus 127 ~~ledaL~aaq~~ad~l~q~~~~~~~Aq~~l~~aq~~l~~lq~~a 171 (593)
T PRK15374 127 KEFQTALGEAQEATDLYEASIKKTDTAKSVYDAAEKKLTQAQNKL 171 (593)
T ss_pred HHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHH
Confidence 445555555555566666666667778888888888888887766
No 295
>KOG4098|consensus
Probab=27.73 E-value=1.1e+02 Score=21.77 Aligned_cols=28 Identities=25% Similarity=0.216 Sum_probs=20.9
Q ss_pred HHHHHhhHHhhhhHHHHHHHHHHHHHHh
Q psy7799 51 EEASNNAKVLRNKAHYLIKELDMFQEAY 78 (83)
Q Consensus 51 e~i~~~~K~lrnKa~~L~~eL~~F~~~y 78 (83)
|.|+.-.+.|.+-..-...||+.|.+.|
T Consensus 89 e~i~~~i~~l~~qL~~k~kElnkfk~~h 116 (140)
T KOG4098|consen 89 ENIEKVIKKLTDQLVQKGKELNKFKKDH 116 (140)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhc
Confidence 4455555666677777889999999988
No 296
>PRK03100 sec-independent translocase; Provisional
Probab=27.49 E-value=2.2e+02 Score=19.95 Aligned_cols=36 Identities=14% Similarity=0.248 Sum_probs=18.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHH
Q psy7799 16 QELSSTFARLCQLVDRATDDMHED----IKQLDSHLKQVE 51 (83)
Q Consensus 16 qEL~~tf~rL~~~Vd~~~~eL~~e----i~~L~~~i~~le 51 (83)
+-+-.+...+...+.++++++++| ++++.+.++.+.
T Consensus 31 r~lG~~vr~~R~~~~~~~~~~~~elg~e~~dlrk~l~el~ 70 (136)
T PRK03100 31 RWTARALRQARDYASGATSQLREELGPEFDDLRKPLGELQ 70 (136)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHH
Confidence 344445555555566666555553 445544444433
No 297
>PF11068 YlqD: YlqD protein; InterPro: IPR021297 This bacterial family of proteins has no known function. ; PDB: 4DCI_C.
Probab=27.44 E-value=2.1e+02 Score=19.71 Aligned_cols=21 Identities=14% Similarity=0.197 Sum_probs=8.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHH
Q psy7799 27 QLVDRATDDMHEDIKQLDSHL 47 (83)
Q Consensus 27 ~~Vd~~~~eL~~ei~~L~~~i 47 (83)
+++|.=...|+-+.+++.+++
T Consensus 30 ~~~d~el~QLefq~kr~~~e~ 50 (131)
T PF11068_consen 30 QQLDQELQQLEFQGKRMIKEI 50 (131)
T ss_dssp HHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHH
Confidence 333333334444444444443
No 298
>PRK05683 flgK flagellar hook-associated protein FlgK; Validated
Probab=27.36 E-value=4.1e+02 Score=23.00 Aligned_cols=58 Identities=12% Similarity=0.294 Sum_probs=28.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHh-------hHHhhhhHHHHHHHHHHH
Q psy7799 17 ELSSTFARLCQLVDRATDDMHEDI-------KQLDSHLKQVEEASNN-------AKVLRNKAHYLIKELDMF 74 (83)
Q Consensus 17 EL~~tf~rL~~~Vd~~~~eL~~ei-------~~L~~~i~~le~i~~~-------~K~lrnKa~~L~~eL~~F 74 (83)
.|...|..+..+++..+++...+| ..+-++|..|..-... ...|+++=..|-.||-.+
T Consensus 138 ~La~~fn~~~~~L~~l~~~vn~qI~~~V~~IN~l~~qIA~LN~qI~~~~~~G~~~NdLlDqRD~Ll~eLS~~ 209 (676)
T PRK05683 138 GLSKRFNSLSSQLNQQNSNINSQLSAMTDQVNNLTTSIASYNKQIAQASASGATPNDLLDARDEAVRQLNEL 209 (676)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCchHhHHHHHHHHHHHHhh
Confidence 344455555555444444444333 3333444444333322 446677777777777544
No 299
>PF06148 COG2: COG (conserved oligomeric Golgi) complex component, COG2; InterPro: IPR024602 This entry represents the uncharacterised N-terminal domain of subunit 2 of the COG complex. The COG complex comprises eight proteins COG1-8 and plays critical roles in Golgi structure and function [].; PDB: 2JQQ_A.
Probab=27.30 E-value=1.5e+02 Score=19.61 Aligned_cols=43 Identities=14% Similarity=0.365 Sum_probs=20.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHH
Q psy7799 17 ELSSTFARLCQLVDRATDDMHEDIKQLDSHLKQVEEASNNAKV 59 (83)
Q Consensus 17 EL~~tf~rL~~~Vd~~~~eL~~ei~~L~~~i~~le~i~~~~K~ 59 (83)
+|..-...+..+|..+..++......+...++....+...-..
T Consensus 73 ~l~~~L~~~~~~v~~~~~~l~~~~~~i~~~l~~~~~l~~~k~~ 115 (133)
T PF06148_consen 73 ELRKPLSQFREEVESVRDELDNTQEEIEDKLEERKELREEKAL 115 (133)
T ss_dssp -HHHHHHHHHHHHHHHHHS-STTHHHHHHHHHHHHHHHHHHHT
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3444444455555555555555555555555555544444333
No 300
>cd04785 HTH_CadR-PbrR-like Helix-Turn-Helix DNA binding domain of the CadR- and PbrR-like transcription regulators. Helix-turn-helix (HTH) CadR- and PbrR-like transcription regulators. CadR and PbrR regulate expression of the cadmium and lead resistance operons, respectively. These proteins are comprised of distinct domains that harbor the regulatory (effector-binding) site and the active (DNA-binding) site. Their conserved N-terminal domains contain predicted winged HTH motifs that mediate DNA binding, while the C-terminal domains have three conserved cysteines which comprise a putative metal binding site. Some members in this group have a histidine-rich C-terminal extension. These proteins share the N-terminal DNA binding domain with other transcription regulators of the MerR superfamily that promote transcription by reconfiguring the spacer between the -35 and -10 promoter elements.
Probab=27.29 E-value=1.9e+02 Score=19.04 Aligned_cols=29 Identities=7% Similarity=0.156 Sum_probs=17.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhHHh
Q psy7799 32 ATDDMHEDIKQLDSHLKQVEEASNNAKVL 60 (83)
Q Consensus 32 ~~~eL~~ei~~L~~~i~~le~i~~~~K~l 60 (83)
...-++..+.+++.+|+.|..+...++.+
T Consensus 80 ~~~~l~~~~~~l~~~i~~L~~~~~~L~~~ 108 (126)
T cd04785 80 ADAIARAHLADVRARIADLRRLEAELKRM 108 (126)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34446666777777777766666555443
No 301
>COG4694 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=27.28 E-value=3e+02 Score=24.40 Aligned_cols=51 Identities=18% Similarity=0.330 Sum_probs=29.0
Q ss_pred HHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHhhHHhhhhHH
Q psy7799 14 VEQELSSTFARLCQLVD---RATDDMHEDIKQLDSHLKQVEEASNNAKVLRNKAH 65 (83)
Q Consensus 14 VqqEL~~tf~rL~~~Vd---~~~~eL~~ei~~L~~~i~~le~i~~~~K~lrnKa~ 65 (83)
+-||.. .|+++..-.+ .-..++..||+.|+.+|..+|..-...+..=++.+
T Consensus 440 ~VQe~~-~Y~g~ekk~n~LE~e~kn~~~ev~kls~ei~~ie~~l~~~~~~vke~n 493 (758)
T COG4694 440 DVQEYN-KYCGLEKKINNLEKEIKNNQEEVKKLSNEIKEIEKFLVSIKPIVKEIN 493 (758)
T ss_pred HHHHHH-HHHHHHHHHhHHHHHHHhhHHHHHHHHHHHHHHHHHHhhchhhHHHHH
Confidence 445555 6666654433 33344556666677777777766655555544443
No 302
>cd04789 HTH_Cfa Helix-Turn-Helix DNA binding domain of the Cfa transcription regulator. Putative helix-turn-helix (HTH) MerR-like transcription regulator; the N-terminal domain of Cfa, a cyclopropane fatty acid synthase and other related methyltransferases. Based on sequence similarity, these proteins are predicted to function as transcription regulators that mediate responses to stress in eubacteria. They belong to the MerR superfamily of transcription regulators that promote transcription of various stress regulons by reconfiguring the operator sequence located between the -35 and -10 promoter elements. A typical MerR regulator is comprised of distinct domains that harbor the regulatory (effector-binding) site and the active (DNA-binding) site. Their N-terminal domains are homologous and contain a DNA-binding winged HTH motif, while the C-terminal domains are often dissimilar and bind specific coactivator molecules such as metal ions, drugs, and organic substrates.
Probab=27.22 E-value=1.7e+02 Score=18.59 Aligned_cols=28 Identities=14% Similarity=0.218 Sum_probs=17.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhHHh
Q psy7799 33 TDDMHEDIKQLDSHLKQVEEASNNAKVL 60 (83)
Q Consensus 33 ~~eL~~ei~~L~~~i~~le~i~~~~K~l 60 (83)
...+.+.+.+++.+|..|+.....+..+
T Consensus 73 ~~~l~~~~~~l~~~i~~l~~~~~~l~~~ 100 (102)
T cd04789 73 RSLLLERLSSLAEQIARKQQARDLLAAL 100 (102)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3456667777777777776666555443
No 303
>COG4191 Signal transduction histidine kinase regulating C4-dicarboxylate transport system [Signal transduction mechanisms]
Probab=27.18 E-value=3.8e+02 Score=23.38 Aligned_cols=50 Identities=20% Similarity=0.225 Sum_probs=42.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHhhh
Q psy7799 13 QVEQELSSTFARLCQLVDRATDDMHEDIKQLDSHLKQVEEASNNAKVLRN 62 (83)
Q Consensus 13 QVqqEL~~tf~rL~~~Vd~~~~eL~~ei~~L~~~i~~le~i~~~~K~lrn 62 (83)
+.+.++..+.+.|...|.+=++||..--++|..+|.--|..+..++..++
T Consensus 324 ~r~~~~~~a~~eLE~rV~eRTadL~~~n~~l~~EIaer~~ae~~LR~~Qd 373 (603)
T COG4191 324 LRLAELQEARAELERRVEERTADLTRANARLQAEIAEREQAEAALRRAQD 373 (603)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHH
Confidence 45667888999999999999999988888888999988888888877763
No 304
>PLN02678 seryl-tRNA synthetase
Probab=27.16 E-value=3.6e+02 Score=22.22 Aligned_cols=21 Identities=10% Similarity=0.004 Sum_probs=9.7
Q ss_pred HHHhhHHhhhhHHHHHHHHHH
Q psy7799 53 ASNNAKVLRNKAHYLIKELDM 73 (83)
Q Consensus 53 i~~~~K~lrnKa~~L~~eL~~ 73 (83)
+..+.+.+..+...++.+|..
T Consensus 83 Lk~ei~~le~~~~~~~~~l~~ 103 (448)
T PLN02678 83 LKKEITEKEAEVQEAKAALDA 103 (448)
T ss_pred HHHHHHHHHHHHHHHHHHHHH
Confidence 333444444444455555543
No 305
>PF12128 DUF3584: Protein of unknown function (DUF3584); InterPro: IPR021979 This family consist of uncharacterised bacterial proteins.
Probab=27.11 E-value=4.7e+02 Score=23.87 Aligned_cols=36 Identities=25% Similarity=0.461 Sum_probs=15.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy7799 20 STFARLCQLVDRATDDMHEDIKQLDSHLKQVEEASN 55 (83)
Q Consensus 20 ~tf~rL~~~Vd~~~~eL~~ei~~L~~~i~~le~i~~ 55 (83)
.-+..+...+.....+++++++.+...+..++...+
T Consensus 837 ~~~~~~~~~~~~~~~~le~~~~~~~~~~~~~~~~l~ 872 (1201)
T PF12128_consen 837 QELNQLQKEVKQRRKELEEELKALEEQLEQLEEQLR 872 (1201)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 334444444444444444444444444444444333
No 306
>PLN02372 violaxanthin de-epoxidase
Probab=27.10 E-value=3.8e+02 Score=22.56 Aligned_cols=27 Identities=19% Similarity=0.407 Sum_probs=18.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy7799 22 FARLCQLVDRATDDMHEDIKQLDSHLK 48 (83)
Q Consensus 22 f~rL~~~Vd~~~~eL~~ei~~L~~~i~ 48 (83)
-.||.-.|++-.+++..|+.++..+|+
T Consensus 363 ~~~l~~~~e~~e~~i~~e~~~~~~e~~ 389 (455)
T PLN02372 363 LERLEKDVEEGEKTIVKEARQIEEELE 389 (455)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 457777777777777777665555555
No 307
>PF00957 Synaptobrevin: Synaptobrevin; InterPro: IPR001388 Synaptobrevin is an intrinsic membrane protein of small synaptic vesicles [], specialised secretory organelles of neurons that actively accumulate neurotransmitters and participate in their calcium-dependent release by exocytosis. Vesicle function is mediated by proteins in their membranes, although the precise nature of the protein-protein interactions underlying this are still uncertain []. Synaptobrevin may play a role in the molecular events underlying neurotransmitter release and vesicle recycling and may be involved in the regulation of membrane flow in the nerve terminal, a process mediated by interaction with low molecular weight GTP-binding proteins []. Synaptic vesicle-associated membrane proteins (VAMPs) from Torpedo californica (Pacific electric ray) and SNC1 from yeast are related to synaptobrevin.; GO: 0016192 vesicle-mediated transport, 0016021 integral to membrane; PDB: 3EGX_C 2NUP_C 3EGD_C 2NUT_C 1IOU_A 1H8M_A 3B5N_A 3ZYM_A 2NPS_A 1SFC_E ....
Probab=27.01 E-value=1.6e+02 Score=18.06 Aligned_cols=39 Identities=10% Similarity=0.306 Sum_probs=25.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy7799 12 HQVEQELSSTFARLCQLVDRATDDMHEDIKQLDSHLKQVE 51 (83)
Q Consensus 12 hQVqqEL~~tf~rL~~~Vd~~~~eL~~ei~~L~~~i~~le 51 (83)
.|+++++..|-.-+..-++... +=.+.++.|..+-+.|.
T Consensus 6 ~~i~~~v~~v~~im~~Ni~~ll-~Rge~L~~L~~kt~~L~ 44 (89)
T PF00957_consen 6 EQIQEQVEEVKNIMRENIDKLL-ERGEKLEELEDKTEELS 44 (89)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHH-HcCchHHHHHHHHHHHH
Confidence 4788888888888877777665 33455666655555443
No 308
>PF05531 NPV_P10: Nucleopolyhedrovirus P10 protein; InterPro: IPR008702 This family consists of several nucleopolyhedrovirus P10 proteins which are thought to be involved in the morphogenesis of the polyhedra [].; GO: 0019028 viral capsid
Probab=26.95 E-value=1.8e+02 Score=18.61 Aligned_cols=21 Identities=19% Similarity=0.213 Sum_probs=9.7
Q ss_pred HHHHHhhHHhhhhHHHHHHHH
Q psy7799 51 EEASNNAKVLRNKAHYLIKEL 71 (83)
Q Consensus 51 e~i~~~~K~lrnKa~~L~~eL 71 (83)
|....++..|-+|..-+.+-|
T Consensus 45 Da~~~~l~~l~~~V~~I~~iL 65 (75)
T PF05531_consen 45 DAQSAQLTTLNTKVNEIQDIL 65 (75)
T ss_pred HHHHHHHHHHHHHHHHHHHHh
Confidence 444444444444555444433
No 309
>PF01442 Apolipoprotein: Apolipoprotein A1/A4/E domain; InterPro: IPR000074 Exchangeable apolipoproteins (apoA, apoC and apoE) have the same genomic structure and are members of a multi-gene family that probably evolved from a common ancestral gene. This entry includes the ApoA1, ApoA4 and ApoE proteins. ApoA1 and ApoA4 are part of the APOA1/C3/A4/A5 gene cluster on chromosome 11 []. Apolipoproteins function in lipid transport as structural components of lipoprotein particles, cofactors for enzymes and ligands for cell-surface receptors. In particular, apoA1 is the major protein component of high-density lipoproteins; apoA4 is thought to act primarily in intestinal lipid absorption; and apoE is a blood plasma protein that mediates the transport and uptake of cholesterol and lipid by way of its high affinity interaction with different cellular receptors, including the low-density lipoprotein (LDL) receptor. Recent findings with apoA1 and apoE suggest that the tertiary structures of these two members of the human exchangeable apolipoprotein gene family are related []. The three-dimensional structure of the LDL receptor-binding domain of apoE indicates that the protein forms an unusually elongated four-helix bundle that may be stabilised by a tightly packed hydrophobic core that includes leucine zipper-type interactions and by numerous salt bridges on the mostly charged surface. Basic amino acids important for LDL receptor binding are clustered into a surface patch on one long helix [].; GO: 0008289 lipid binding, 0006869 lipid transport, 0042157 lipoprotein metabolic process, 0005576 extracellular region; PDB: 1YA9_A 3S84_A 1NFN_A 1LE2_A 1B68_A 1BZ4_A 1OEG_A 2L7B_A 1LE4_A 1EA8_A ....
Probab=26.88 E-value=1.9e+02 Score=18.91 Aligned_cols=36 Identities=28% Similarity=0.446 Sum_probs=18.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy7799 12 HQVEQELSSTFARLCQLVDRATDDMHEDIKQLDSHL 47 (83)
Q Consensus 12 hQVqqEL~~tf~rL~~~Vd~~~~eL~~ei~~L~~~i 47 (83)
..+..++...-.+|....+.+...+.+.+..+...+
T Consensus 41 ~~l~~~~~~~~~~l~~~~~~~~~~i~~~~~~~~~~l 76 (202)
T PF01442_consen 41 ERLESELEELSDRLEERLDEVKERIEERIEELKNSL 76 (202)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 344455555555555555555555555555544443
No 310
>COG1730 GIM5 Predicted prefoldin, molecular chaperone implicated in de novo protein folding [Posttranslational modification, protein turnover, chaperones]
Probab=26.86 E-value=2.3e+02 Score=19.95 Aligned_cols=22 Identities=18% Similarity=0.451 Sum_probs=13.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHH
Q psy7799 29 VDRATDDMHEDIKQLDSHLKQV 50 (83)
Q Consensus 29 Vd~~~~eL~~ei~~L~~~i~~l 50 (83)
+|.+..-+++++++|++-++.+
T Consensus 92 ~~eAie~l~k~~~~l~~~~~~l 113 (145)
T COG1730 92 ADEAIEFLKKRIEELEKAIEKL 113 (145)
T ss_pred HHHHHHHHHHHHHHHHHHHHHH
Confidence 4555566666666666655553
No 311
>PRK14159 heat shock protein GrpE; Provisional
Probab=26.62 E-value=1.6e+02 Score=21.30 Aligned_cols=27 Identities=11% Similarity=0.151 Sum_probs=14.1
Q ss_pred HHHHHhhHHhhhhHHHHHHHHHHHHHH
Q psy7799 51 EEASNNAKVLRNKAHYLIKELDMFQEA 77 (83)
Q Consensus 51 e~i~~~~K~lrnKa~~L~~eL~~F~~~ 77 (83)
+.+...++.++++.-.+..+.++|.+.
T Consensus 33 ~~l~~e~~elkd~~lR~~AdfeN~rkR 59 (176)
T PRK14159 33 NKLQKDYDELKDKYMRANAEFENIKKR 59 (176)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 344444445555555555566655543
No 312
>PF11593 Med3: Mediator complex subunit 3 fungal; InterPro: IPR020998 The Mediator complex is a coactivator involved in the regulated transcription of nearly all RNA polymerase II-dependent genes. Mediator functions as a bridge to convey information from gene-specific regulatory proteins to the basal RNA polymerase II transcription machinery. The Mediator complex, having a compact conformation in its free form, is recruited to promoters by direct interactions with regulatory proteins and serves for the assembly of a functional preinitiation complex with RNA polymerase II and the general transcription factors. On recruitment the Mediator complex unfolds to an extended conformation and partially surrounds RNA polymerase II, specifically interacting with the unphosphorylated form of the C-terminal domain (CTD) of RNA polymerase II. The Mediator complex dissociates from the RNA polymerase II holoenzyme and stays at the promoter when transcriptional elongation begins. The Mediator complex is composed of at least 31 subunits: MED1, MED4, MED6, MED7, MED8, MED9, MED10, MED11, MED12, MED13, MED13L, MED14, MED15, MED16, MED17, MED18, MED19, MED20, MED21, MED22, MED23, MED24, MED25, MED26, MED27, MED29, MED30, MED31, CCNC, CDK8 and CDC2L6/CDK11. The subunits form at least three structurally distinct submodules. The head and the middle modules interact directly with RNA polymerase II, whereas the elongated tail module interacts with gene-specific regulatory proteins. Mediator containing the CDK8 module is less active than Mediator lacking this module in supporting transcriptional activation. The head module contains: MED6, MED8, MED11, SRB4/MED17, SRB5/MED18, ROX3/MED19, SRB2/MED20 and SRB6/MED22. The middle module contains: MED1, MED4, NUT1/MED5, MED7, CSE2/MED9, NUT2/MED10, SRB7/MED21 and SOH1/MED31. CSE2/MED9 interacts directly with MED4. The tail module contains: MED2, PGD1/MED3, RGR1/MED14, GAL11/MED15 and SIN4/MED16. The CDK8 module contains: MED12, MED13, CCNC and CDK8. Individual preparations of the Mediator complex lacking one or more distinct subunits have been variously termed ARC, CRSP, DRIP, PC2, SMCC and TRAP. This entry represents the subunit Med3, which is a physical target for Cyc8-Tup1, a yeast transcriptional co-repressor []. ; GO: 0001104 RNA polymerase II transcription cofactor activity, 0006357 regulation of transcription from RNA polymerase II promoter, 0016592 mediator complex
Probab=26.52 E-value=93 Score=25.50 Aligned_cols=28 Identities=18% Similarity=0.265 Sum_probs=24.4
Q ss_pred HHhhHHhhhhHHHHHHHHHHHHHHh--ccc
Q psy7799 54 SNNAKVLRNKAHYLIKELDMFQEAY--LVN 81 (83)
Q Consensus 54 ~~~~K~lrnKa~~L~~eL~~F~~~y--L~~ 81 (83)
+.+.-.+|.|.-.|-+.|..|++.| |+|
T Consensus 64 qeKFl~IR~KlleL~~~lQ~lS~df~~LqP 93 (379)
T PF11593_consen 64 QEKFLLIRSKLLELYNKLQELSSDFQKLQP 93 (379)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhch
Confidence 4556689999999999999999999 776
No 313
>PF09766 FimP: Fms-interacting protein; InterPro: IPR019163 This entry represents Thoc5 which is one of the subunits of the THO complex, which additionally contains: HPR1, Thoc2, Thoc6 and Thoc7. The evolutionarily conserved multisubunit THO complex, which is recruited to actively transcribed genes is required for the efficient expression of genes that have internal tandem repeats. It is suggested that the THO complex functions to rectify aberrant structures that arise during transcription [, ] and is required for cell proliferation and for proper export of heat-shock mRNAs under heat stress []. This entry also identifies the crucial 144 N-terminal residues of the FmiP protein, which is essential for the binding of the protein to the cytoplasmic domain of activated Fms-molecules in M-CSF induced haematopoietic differentiation of macrophages. The C terminus contains a putative nuclear localisation sequence and a leucine zipper which suggest further, as yet unknown, nuclear functions. The level of FMIP expression might form a threshold that determines whether cells differentiate into macrophages or into granulocytes [].
Probab=26.29 E-value=3.2e+02 Score=21.46 Aligned_cols=10 Identities=20% Similarity=0.205 Sum_probs=3.7
Q ss_pred HHHHHHHHHH
Q psy7799 65 HYLIKELDMF 74 (83)
Q Consensus 65 ~~L~~eL~~F 74 (83)
.-|...|..|
T Consensus 132 ~~l~~~L~~l 141 (355)
T PF09766_consen 132 DSLPPQLKSL 141 (355)
T ss_pred HHhHHHHHHH
Confidence 3333333333
No 314
>PF12128 DUF3584: Protein of unknown function (DUF3584); InterPro: IPR021979 This family consist of uncharacterised bacterial proteins.
Probab=26.22 E-value=4.4e+02 Score=24.05 Aligned_cols=55 Identities=20% Similarity=0.325 Sum_probs=32.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHhhhhHHHHHHHH
Q psy7799 17 ELSSTFARLCQLVDRATDDMHEDIKQLDSHLKQVEEASNNAKVLRNKAHYLIKEL 71 (83)
Q Consensus 17 EL~~tf~rL~~~Vd~~~~eL~~ei~~L~~~i~~le~i~~~~K~lrnKa~~L~~eL 71 (83)
.+..-+..|.+......++++..+++++.++..++.....+....++...+-..|
T Consensus 827 ~~~~~~~~l~~~~~~~~~~~~~~~~~le~~~~~~~~~~~~~~~~l~~l~~~~~~l 881 (1201)
T PF12128_consen 827 DLEQELQELEQELNQLQKEVKQRRKELEEELKALEEQLEQLEEQLRRLRDLLEKL 881 (1201)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh
Confidence 3445556666666666666667777777777666666666665555443333333
No 315
>PRK14154 heat shock protein GrpE; Provisional
Probab=26.19 E-value=1.4e+02 Score=22.20 Aligned_cols=15 Identities=13% Similarity=0.173 Sum_probs=6.0
Q ss_pred hhhHHHHHHHHHHHH
Q psy7799 61 RNKAHYLIKELDMFQ 75 (83)
Q Consensus 61 rnKa~~L~~eL~~F~ 75 (83)
++|.-.+..+.++|.
T Consensus 72 kd~~lRl~ADfeNyR 86 (208)
T PRK14154 72 KTQYLRAQAEMDNLR 86 (208)
T ss_pred HHHHHHHHHHHHHHH
Confidence 333333444444443
No 316
>PF04201 TPD52: Tumour protein D52 family; InterPro: IPR007327 The hD52 gene was originally identified through its elevated expression level in human breast carcinoma. Cloning of D52 homologues from other species has indicated that D52 may play roles in calcium-mediated signal transduction and cell proliferation. Two human homologues of hD52, hD53 and hD54, have also been identified, demonstrating the existence of a novel gene/protein family []. These proteins have an N-terminal coiled-coil that allows members to form homo- and heterodimers with each other [].
Probab=26.19 E-value=1.8e+02 Score=21.12 Aligned_cols=33 Identities=6% Similarity=0.368 Sum_probs=22.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHH----HHhhHHhhhhHH
Q psy7799 33 TDDMHEDIKQLDSHLKQVEEA----SNNAKVLRNKAH 65 (83)
Q Consensus 33 ~~eL~~ei~~L~~~i~~le~i----~~~~K~lrnKa~ 65 (83)
..+|..|+.+++.+|..|-.+ .+++..||-|-+
T Consensus 31 ~eeLr~EL~KvEeEI~TLrqvL~aKer~~~eLKrkLG 67 (162)
T PF04201_consen 31 REELRSELAKVEEEIQTLRQVLAAKERHCAELKRKLG 67 (162)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHC
Confidence 467777888888887766655 455666666654
No 317
>KOG0977|consensus
Probab=26.05 E-value=4.2e+02 Score=22.71 Aligned_cols=24 Identities=17% Similarity=0.387 Sum_probs=10.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHhhHH
Q psy7799 36 MHEDIKQLDSHLKQVEEASNNAKV 59 (83)
Q Consensus 36 L~~ei~~L~~~i~~le~i~~~~K~ 59 (83)
|++|.+.|.+++.+|..--..++.
T Consensus 167 le~e~~~Lk~en~rl~~~l~~~r~ 190 (546)
T KOG0977|consen 167 LEDELKRLKAENSRLREELARARK 190 (546)
T ss_pred HHHHHHHHHHHhhhhHHHHHHHHH
Confidence 444444444444444444444443
No 318
>PF08700 Vps51: Vps51/Vps67; InterPro: IPR014812 The VFT tethering complex (also known as GARP complex, Golgi associated retrograde protein complex, Vps53 tethering complex) is a conserved eukaryotic docking complex which is involved in recycling of proteins from endosomes to the late Golgi. Vps51 (also known as Vps67) is a subunit of VFT and interacts with the SNARE Tlg1 [].
Probab=25.89 E-value=1.6e+02 Score=17.67 Aligned_cols=62 Identities=16% Similarity=0.229 Sum_probs=35.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHhhHHhhhhHHHHHHHHHHHHH
Q psy7799 15 EQELSSTFARLCQLVDRATDDMHEDIK----QLDSHLKQVEEASNNAKVLRNKAHYLIKELDMFQE 76 (83)
Q Consensus 15 qqEL~~tf~rL~~~Vd~~~~eL~~ei~----~L~~~i~~le~i~~~~K~lrnKa~~L~~eL~~F~~ 76 (83)
-.+|......|.........+|+.-|- ++=.-.+.+..+......|++...-|...+..+.+
T Consensus 21 ~~~i~~~~~~L~~~i~~~~~eLr~~V~~nY~~fI~as~~I~~m~~~~~~l~~~l~~l~~~~~~l~~ 86 (87)
T PF08700_consen 21 IKEIRQLENKLRQEIEEKDEELRKLVYENYRDFIEASDEISSMENDLSELRNLLSELQQSIQSLQE 86 (87)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc
Confidence 355555666666666666666655442 22222333444555666677777777777766654
No 319
>COG3937 Uncharacterized conserved protein [Function unknown]
Probab=25.80 E-value=1.6e+02 Score=20.09 Aligned_cols=9 Identities=11% Similarity=0.283 Sum_probs=3.4
Q ss_pred hhHHHHHHH
Q psy7799 62 NKAHYLIKE 70 (83)
Q Consensus 62 nKa~~L~~e 70 (83)
.+...|+.+
T Consensus 90 ~rvd~Lerq 98 (108)
T COG3937 90 ERVDALERQ 98 (108)
T ss_pred HHHHHHHHH
Confidence 333333333
No 320
>PRK14153 heat shock protein GrpE; Provisional
Probab=25.43 E-value=1.7e+02 Score=21.52 Aligned_cols=25 Identities=20% Similarity=0.368 Sum_probs=14.0
Q ss_pred HHHhhHHhhhhHHHHHHHHHHHHHH
Q psy7799 53 ASNNAKVLRNKAHYLIKELDMFQEA 77 (83)
Q Consensus 53 i~~~~K~lrnKa~~L~~eL~~F~~~ 77 (83)
++.++..|+++.-.+..+.++|.+.
T Consensus 45 l~~e~~elkd~~lR~~AEfeN~rKR 69 (194)
T PRK14153 45 CREEIESLKEQLFRLAAEFDNFRKR 69 (194)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3444445556666666666666543
No 321
>KOG4302|consensus
Probab=25.35 E-value=2.9e+02 Score=24.22 Aligned_cols=69 Identities=14% Similarity=0.181 Sum_probs=44.4
Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----------HhhHHhhhhHHHHHHHHHHHHHHh
Q psy7799 10 CSHQVEQELSSTFARLCQLVDRATDDMHEDIKQLDSHLKQVEEAS-----------NNAKVLRNKAHYLIKELDMFQEAY 78 (83)
Q Consensus 10 cshQVqqEL~~tf~rL~~~Vd~~~~eL~~ei~~L~~~i~~le~i~-----------~~~K~lrnKa~~L~~eL~~F~~~y 78 (83)
|.-.+-+|....|.|+-.++.+.++.|..+|...++++..|=..- ...++|+..-.-|.-.|+....+|
T Consensus 40 ~l~~le~e~~~~y~~kve~a~~~~~~L~~~ia~~eael~~l~s~l~~~~~~~~~~~k~e~tLke~l~~l~~~le~lr~qk 119 (660)
T KOG4302|consen 40 KLLRLEQECLEIYKRKVEEASESKARLLQEIAVIEAELNDLCSALGEPSIIGEISDKIEGTLKEQLESLKPYLEGLRKQK 119 (660)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccccccccccCccHHHHHHHHHHHHHHHHHHH
Confidence 445677888888888888888888888888888777776553221 234455554444544444444433
No 322
>PF03961 DUF342: Protein of unknown function (DUF342); InterPro: IPR005646 This family of bacterial proteins has no known function. The proteins are in the region of 500-600 amino acid residues in length.
Probab=25.32 E-value=3.5e+02 Score=21.57 Aligned_cols=57 Identities=19% Similarity=0.390 Sum_probs=0.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----------HHHhhHHhhhhHHHHHHHHHHHHHHh
Q psy7799 22 FARLCQLVDRATDDMHEDIKQLDSHLKQVEE----------ASNNAKVLRNKAHYLIKELDMFQEAY 78 (83)
Q Consensus 22 f~rL~~~Vd~~~~eL~~ei~~L~~~i~~le~----------i~~~~K~lrnKa~~L~~eL~~F~~~y 78 (83)
+.+.-.......++++.++.++...|..++. .....+.++.+...|..++....+..
T Consensus 332 l~~~~~~l~~~~~~~~~~l~~l~~~l~~l~~~~~~~~~~~~~~~~~~~l~~~~~~l~~~~~~l~~~~ 398 (451)
T PF03961_consen 332 LKEKLEELEEELEELKEELEKLKKNLKKLKKLKKQGKLPPEKKEQLKKLKEKKKELKEELKELKEEL 398 (451)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH
No 323
>PF08232 Striatin: Striatin family; InterPro: IPR013258 This domain is associated with the N terminus of striatin. Striatin is an intracellular protein which has a caveolin-binding motif, a coiled-coil structure, a calmodulin-binding site, and a WD (IPR001680 from INTERPRO) repeat domain []. It acts as a scaffold protein [] and is involved in signalling pathways [, ].
Probab=25.17 E-value=2.3e+02 Score=19.33 Aligned_cols=51 Identities=14% Similarity=0.159 Sum_probs=34.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHhhhhHHHHHHHHHHHHHHhc
Q psy7799 26 CQLVDRATDDMHEDIKQLDSHLKQVEEASNNAKVLRNKAHYLIKELDMFQEAYL 79 (83)
Q Consensus 26 ~~~Vd~~~~eL~~ei~~L~~~i~~le~i~~~~K~lrnKa~~L~~eL~~F~~~yL 79 (83)
...=+.=.+||+..|..|+.+...+|.+...+ .-+.-+||-.|-.=...|.
T Consensus 20 R~~WeiERaEmkarIa~LEGE~r~~e~l~~dL---~rrIkMLE~aLkqER~k~~ 70 (134)
T PF08232_consen 20 RNQWEIERAEMKARIAFLEGERRGQENLKKDL---KRRIKMLEYALKQERAKYK 70 (134)
T ss_pred HHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHhh
Confidence 34445556677777777777777777766544 3478889988877666664
No 324
>PF12252 SidE: Dot/Icm substrate protein; InterPro: IPR021014 This entry represents bacterial proteins that are typically between 397 and 1543 amino acids in length including SidE protein in the Dot/Icm pathway of Legionella pneumophila bacteria. There is little literature describing the family.
Probab=25.11 E-value=2e+02 Score=27.28 Aligned_cols=40 Identities=10% Similarity=0.297 Sum_probs=26.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHhhhhH
Q psy7799 25 LCQLVDRATDDMHEDIKQLDSHLKQVEEASNNAKVLRNKA 64 (83)
Q Consensus 25 L~~~Vd~~~~eL~~ei~~L~~~i~~le~i~~~~K~lrnKa 64 (83)
+.++|....+.--.......+.|..|+.+|...|.|||.-
T Consensus 1108 iTKrIt~LEk~k~~~l~~ikK~ia~lnnlqqElklLRnEK 1147 (1439)
T PF12252_consen 1108 ITKRITDLEKAKLDNLDSIKKAIANLNNLQQELKLLRNEK 1147 (1439)
T ss_pred HHHHHHHHhccccccHHHHHHHHHHHHHHHHHHHHHHhHH
Confidence 3344444444444455666778888889999999999753
No 325
>TIGR03185 DNA_S_dndD DNA sulfur modification protein DndD. This model describes the DndB protein encoded by an operon associated with a sulfur-containing modification to DNA. The operon is sporadically distributed in bacteria, much like some restriction enzyme operons. DndD is described as a putative ATPase. The small number of examples known so far include species from among the Firmicutes, Actinomycetes, Proteobacteria, and Cyanobacteria.
Probab=25.10 E-value=4.1e+02 Score=22.26 Aligned_cols=10 Identities=20% Similarity=0.674 Sum_probs=3.8
Q ss_pred HHHHHHHHHH
Q psy7799 36 MHEDIKQLDS 45 (83)
Q Consensus 36 L~~ei~~L~~ 45 (83)
++.||++++.
T Consensus 403 ~e~el~~l~~ 412 (650)
T TIGR03185 403 LEEELAEVDK 412 (650)
T ss_pred HHHHHHHHHH
Confidence 3333333333
No 326
>PRK01203 prefoldin subunit alpha; Provisional
Probab=25.09 E-value=1.9e+02 Score=20.09 Aligned_cols=28 Identities=14% Similarity=0.353 Sum_probs=18.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhHHhh
Q psy7799 34 DDMHEDIKQLDSHLKQVEEASNNAKVLR 61 (83)
Q Consensus 34 ~eL~~ei~~L~~~i~~le~i~~~~K~lr 61 (83)
.++++-|+.|.++|..|+.+...-...+
T Consensus 83 K~~e~kie~L~~~ie~Le~~i~~K~~~l 110 (130)
T PRK01203 83 EERERTIERLKENLEDLKDSIQKLNDQR 110 (130)
T ss_pred ecHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4556678888888888887765444333
No 327
>PRK11820 hypothetical protein; Provisional
Probab=25.05 E-value=1.1e+02 Score=23.64 Aligned_cols=45 Identities=27% Similarity=0.456 Sum_probs=32.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhHHhhhhHHHHHHHHHHHHHH
Q psy7799 33 TDDMHEDIKQLDSHLKQVEEASNNAKVLRNKAHYLIKELDMFQEA 77 (83)
Q Consensus 33 ~~eL~~ei~~L~~~i~~le~i~~~~K~lrnKa~~L~~eL~~F~~~ 77 (83)
+.|..+||.+|...|+++..+-..-...=.|-.||--|+++=.++
T Consensus 212 K~DI~EEi~RL~sHl~~f~~~L~~~~~vGrkLDFL~QEm~RE~NT 256 (288)
T PRK11820 212 KADIAEELDRLKSHLKEFREILKKGGPVGRKLDFLMQELNREANT 256 (288)
T ss_pred HcchHHHHHHHHHHHHHHHHHHhcCCCCCcchhHHHHHHhHHHHH
Confidence 457788899999998888887766666666777777666654433
No 328
>PRK04778 septation ring formation regulator EzrA; Provisional
Probab=24.99 E-value=4e+02 Score=22.09 Aligned_cols=29 Identities=14% Similarity=0.279 Sum_probs=18.3
Q ss_pred HHHHHHhhHHhhhhHHHHHHHHHHHHHHh
Q psy7799 50 VEEASNNAKVLRNKAHYLIKELDMFQEAY 78 (83)
Q Consensus 50 le~i~~~~K~lrnKa~~L~~eL~~F~~~y 78 (83)
...+...+..++.....|..|++.-..+|
T Consensus 312 ~~~l~~~l~~~~e~~~~l~~Ei~~l~~sY 340 (569)
T PRK04778 312 SDTLPDFLEHAKEQNKELKEEIDRVKQSY 340 (569)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHcc
Confidence 33444455555566667777777777777
No 329
>PHA03332 membrane glycoprotein; Provisional
Probab=24.89 E-value=3.3e+02 Score=25.79 Aligned_cols=47 Identities=15% Similarity=0.331 Sum_probs=20.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHhhHHhhhh
Q psy7799 17 ELSSTFARLCQLVDRATDDMHEDIKQLDSHLK-QVEEASNNAKVLRNK 63 (83)
Q Consensus 17 EL~~tf~rL~~~Vd~~~~eL~~ei~~L~~~i~-~le~i~~~~K~lrnK 63 (83)
+|..+-+.+.+..+..-++-+..|..|++++. ++..+....++|+-+
T Consensus 916 klGnti~kisatl~~nI~avNgRIs~Led~VN~r~~~v~~~intLA~q 963 (1328)
T PHA03332 916 KLGDTIAKISATLDNNIRAVNGRVSDLEDQVNLRFLAVATNFNTLATQ 963 (1328)
T ss_pred HhhhHHHHHHHHHHhhHHHhcccHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33344444444444444444444444444442 233444455555433
No 330
>COG5187 RPN7 26S proteasome regulatory complex component, contains PCI domain [Posttranslational modification, protein turnover, chaperones]
Probab=24.85 E-value=1e+02 Score=25.21 Aligned_cols=33 Identities=30% Similarity=0.525 Sum_probs=24.4
Q ss_pred HHHHHHHHHHHHHHHHH---HHhhHHhhhhHHHHHH
Q psy7799 37 HEDIKQLDSHLKQVEEA---SNNAKVLRNKAHYLIK 69 (83)
Q Consensus 37 ~~ei~~L~~~i~~le~i---~~~~K~lrnKa~~L~~ 69 (83)
++.|++|+++|+..|+. ...+..+||+|.+--.
T Consensus 92 eeki~Elde~i~~~eedngE~e~~ea~~n~aeyY~q 127 (412)
T COG5187 92 EEKIEELDERIREKEEDNGETEGSEADRNIAEYYCQ 127 (412)
T ss_pred HHHHHHHHHHHHHHhhcccchHHHHHHHHHHHHHHH
Confidence 56778888888777654 4678889999987543
No 331
>cd03003 PDI_a_ERdj5_N PDIa family, N-terminal ERdj5 subfamily; ERdj5, also known as JPDI and macrothioredoxin, is a protein containing an N-terminal DnaJ domain and four redox active TRX domains. This subfamily is comprised of the first TRX domain of ERdj5 located after the DnaJ domain at the N-terminal half of the protein. ERdj5 is a ubiquitous protein localized in the endoplasmic reticulum (ER) and is abundant in secretory cells. It's transcription is induced during ER stress. It interacts with BiP through its DnaJ domain in an ATP-dependent manner. BiP, an ER-resident member of the Hsp70 chaperone family, functions in ER-associated degradation and protein translocation.
Probab=24.84 E-value=48 Score=20.09 Aligned_cols=26 Identities=27% Similarity=0.570 Sum_probs=18.6
Q ss_pred chhhhhhchhHHHHHHHHHHHHHHHHHH
Q psy7799 2 IVDLTSSNCSHQVEQELSSTFARLCQLV 29 (83)
Q Consensus 2 iV~~tssncshQVqqEL~~tf~rL~~~V 29 (83)
+|.|++..|++ .++|...|..+....
T Consensus 22 ~v~f~a~wC~~--C~~~~p~~~~~a~~~ 47 (101)
T cd03003 22 FVNFYSPRCSH--CHDLAPTWREFAKEM 47 (101)
T ss_pred EEEEECCCChH--HHHhHHHHHHHHHHh
Confidence 57788899986 566777777766543
No 332
>PRK13842 conjugal transfer protein TrbJ; Provisional
Probab=24.71 E-value=3.3e+02 Score=21.07 Aligned_cols=37 Identities=16% Similarity=0.162 Sum_probs=28.2
Q ss_pred HHHHHHHHHHHHHHHHhhHHhhhhHHHHHHHHH-HHHH
Q psy7799 40 IKQLDSHLKQVEEASNNAKVLRNKAHYLIKELD-MFQE 76 (83)
Q Consensus 40 i~~L~~~i~~le~i~~~~K~lrnKa~~L~~eL~-~F~~ 76 (83)
..++...|.+|..+...+.-+..-++-|+..+. .|..
T Consensus 96 w~ql~~~l~~l~~l~~qaq~iay~v~~id~~f~q~y~~ 133 (267)
T PRK13842 96 WGQVENDLNRLRSIVNQGQGIAFSMGNADDVLKQRFQS 133 (267)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccc
Confidence 345667778888888888888888888888887 6644
No 333
>PF06103 DUF948: Bacterial protein of unknown function (DUF948); InterPro: IPR009293 This family consists of bacterial sequences several of which are thought to be general stress proteins.
Probab=24.70 E-value=1.8e+02 Score=17.89 Aligned_cols=16 Identities=19% Similarity=0.322 Sum_probs=6.7
Q ss_pred HHHHHHHHHHHHHHHH
Q psy7799 18 LSSTFARLCQLVDRAT 33 (83)
Q Consensus 18 L~~tf~rL~~~Vd~~~ 33 (83)
+..++.++..+++...
T Consensus 24 l~~~l~~~~~ti~~l~ 39 (90)
T PF06103_consen 24 LKKTLDEVNKTIDTLQ 39 (90)
T ss_pred HHHHHHHHHHHHHHHH
Confidence 3444444444444333
No 334
>PLN03137 ATP-dependent DNA helicase; Q4-like; Provisional
Probab=24.54 E-value=1.6e+02 Score=27.58 Aligned_cols=19 Identities=26% Similarity=0.368 Sum_probs=11.1
Q ss_pred HHHHHHHHHHHHHHHHHHH
Q psy7799 36 MHEDIKQLDSHLKQVEEAS 54 (83)
Q Consensus 36 L~~ei~~L~~~i~~le~i~ 54 (83)
|..+--.|.++|++||.-.
T Consensus 322 ~~~~~~~~~~~~~~~~~~~ 340 (1195)
T PLN03137 322 LRQDRLQLKKQIQQLEIHI 340 (1195)
T ss_pred HHHHHHHHHHHHHHHHHHh
Confidence 3334445667777777644
No 335
>PF06305 DUF1049: Protein of unknown function (DUF1049); InterPro: IPR010445 This entry consists of several hypothetical bacterial proteins of unknown function.
Probab=24.51 E-value=78 Score=18.30 Aligned_cols=16 Identities=31% Similarity=0.625 Sum_probs=6.9
Q ss_pred HHHHHHHHHHHHHHHH
Q psy7799 36 MHEDIKQLDSHLKQVE 51 (83)
Q Consensus 36 L~~ei~~L~~~i~~le 51 (83)
++.+++++.++++.+|
T Consensus 46 ~r~~~~~~~k~l~~le 61 (68)
T PF06305_consen 46 LRRRIRRLRKELKKLE 61 (68)
T ss_pred HHHHHHHHHHHHHHHH
Confidence 3344444444444433
No 336
>PF01920 Prefoldin_2: Prefoldin subunit; InterPro: IPR002777 Prefoldin (PFD) is a chaperone that interacts exclusively with type II chaperonins, hetero-oligomers lacking an obligate co-chaperonin that are found only in eukaryotes (chaperonin-containing T-complex polypeptide-1 (CCT)) and archaea. Eukaryotic PFD is a multi-subunit complex containing six polypeptides in the molecular mass range of 14-23 kDa. In archaea, on the other hand, PFD is composed of two types of subunits, two alpha and four beta. The six subunits associate to form two back-to-back up-and-down eight-stranded barrels, from which hang six coiled coils. Each subunit contributes one (beta subunits) or two (alpha subunits) beta hairpin turns to the barrels. The coiled coils are formed by the N and C termini of an individual subunit. Overall, this unique arrangement resembles a jellyfish. The eukaryotic PFD hexamer is composed of six different subunits; however, these can be grouped into two alpha-like (PFD3 and -5) and four beta-like (PFD1, -2, -4, and -6) subunits based on amino acid sequence similarity with their archaeal counterparts. Eukaryotic PFD has a six-legged structure similar to that seen in the archaeal homologue [, ]. This family contains the archaeal beta subunit, eukaryotic prefoldin subunits 1, 2, 4 and 6. Eukaryotic PFD has been shown to bind both actin and tubulin co-translationally. The chaperone then delivers the target protein to CCT, interacting with the chaperonin through the tips of the coiled coils. No authentic target proteins of any archaeal PFD have been identified, to date.; GO: 0051082 unfolded protein binding, 0006457 protein folding, 0016272 prefoldin complex; PDB: 2ZDI_B 3AEI_B 2ZQM_A 1FXK_A.
Probab=24.07 E-value=1.8e+02 Score=17.78 Aligned_cols=39 Identities=28% Similarity=0.507 Sum_probs=17.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhH
Q psy7799 16 QELSSTFARLCQLVDRATDDMHEDIKQLDSHLKQVEEASNNAK 58 (83)
Q Consensus 16 qEL~~tf~rL~~~Vd~~~~eL~~ei~~L~~~i~~le~i~~~~K 58 (83)
||+-.-|.+|.++... +...|..|..++..++.+...+.
T Consensus 1 Qe~~~~~~~l~~~l~~----~~~q~~~l~~~~~~~~~~~~eL~ 39 (106)
T PF01920_consen 1 QELQNKFQELNQQLQQ----LEQQIQQLERQLRELELTLEELE 39 (106)
T ss_dssp HHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHH
T ss_pred CHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHH
Confidence 3444555555554433 23334444444444444444433
No 337
>PF04568 IATP: Mitochondrial ATPase inhibitor, IATP; InterPro: IPR007648 ATP synthase inhibitor prevents the enzyme from switching to ATP hydrolysis during collapse of the electrochemical gradient, for example during oxygen deprivation [] ATP synthase inhibitor forms a one to one complex with the F1 ATPase, possibly by binding at the alpha-beta interface. It is thought to inhibit ATP synthesis by preventing the release of ATP []. The minimum inhibitory region for bovine inhibitor (P01096 from SWISSPROT) is from residues 39 to 72 []. The inhibitor has two oligomeric states, dimer (the active state) and tetramer. At low pH , the inhibitor forms a dimer via antiparallel coiled coil interactions between the C-terminal regions of two monomers. At high pH, the inhibitor forms tetramers and higher oligomers by coiled coil interactions involving the N terminus and inhibitory region, thus preventing the inhibitory activity []. ; GO: 0004857 enzyme inhibitor activity, 0045980 negative regulation of nucleotide metabolic process, 0005739 mitochondrion; PDB: 1GMJ_B 1OHH_H 1HF9_B 2V7Q_J.
Probab=24.05 E-value=2.2e+02 Score=18.93 Aligned_cols=25 Identities=12% Similarity=0.421 Sum_probs=13.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH
Q psy7799 28 LVDRATDDMHEDIKQLDSHLKQVEE 52 (83)
Q Consensus 28 ~Vd~~~~eL~~ei~~L~~~i~~le~ 52 (83)
+.......|+.||+.-.++|+.||+
T Consensus 73 qL~~Lk~kl~~e~~~~~k~i~~le~ 97 (100)
T PF04568_consen 73 QLKKLKEKLKEEIEHHRKEIDELEK 97 (100)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4444455555555556666665554
No 338
>KOG3335|consensus
Probab=24.05 E-value=2e+02 Score=21.31 Aligned_cols=27 Identities=19% Similarity=0.241 Sum_probs=12.3
Q ss_pred HHHHHHHHHHHHHHHHHHhhHHhhhhHHHH
Q psy7799 38 EDIKQLDSHLKQVEEASNNAKVLRNKAHYL 67 (83)
Q Consensus 38 ~ei~~L~~~i~~le~i~~~~K~lrnKa~~L 67 (83)
.+|.+|..+++.|+. ..-.++++...|
T Consensus 106 ~e~~elr~~~~~l~~---~i~~~~~~~~~L 132 (181)
T KOG3335|consen 106 QEIMELRLKVEKLEN---AIAELTKFFSQL 132 (181)
T ss_pred HHHHHHHHHHHHHHH---HHHHHHHHHHHH
Confidence 344555555544444 333344444444
No 339
>PF05701 WEMBL: Weak chloroplast movement under blue light; InterPro: IPR008545 This family consists of several plant proteins of unknown function. Several sequences in this family are described as being myosin heavy chain-like.
Probab=23.98 E-value=4.2e+02 Score=21.90 Aligned_cols=65 Identities=17% Similarity=0.185 Sum_probs=45.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHhhHHhhhhHHHHHHHHHHHHH
Q psy7799 12 HQVEQELSSTFARLCQLVDRA---TDDMHEDIKQLDSHLKQVEEASNNAKVLRNKAHYLIKELDMFQE 76 (83)
Q Consensus 12 hQVqqEL~~tf~rL~~~Vd~~---~~eL~~ei~~L~~~i~~le~i~~~~K~lrnKa~~L~~eL~~F~~ 76 (83)
++...|-..++.-..+..... ..+-+.++.+|..++.....+..++...+.....|..||..+..
T Consensus 196 ~eAeee~~~~~~~~~~~~~~~~~~leeae~~l~~L~~e~~~~k~Le~kL~~a~~~l~~Lq~El~~~~~ 263 (522)
T PF05701_consen 196 IEAEEERIEIAAEREQDAEEWEKELEEAEEELEELKEELEAAKDLESKLAEASAELESLQAELEAAKE 263 (522)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 455555555554444333333 34456778888888888888888888888888888888888766
No 340
>PRK10203 hypothetical protein; Provisional
Probab=23.90 E-value=2e+02 Score=19.82 Aligned_cols=37 Identities=14% Similarity=0.230 Sum_probs=17.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHH
Q psy7799 13 QVEQELSSTFARLCQLVDRATD--DMHEDIKQLDSHLKQ 49 (83)
Q Consensus 13 QVqqEL~~tf~rL~~~Vd~~~~--eL~~ei~~L~~~i~~ 49 (83)
+.++|+.+.-.-|....|.... .+..++..+..++..
T Consensus 61 ~LrKE~~~l~~~l~~~~d~~~~~~~~~k~L~~l~lr~~~ 99 (122)
T PRK10203 61 EQRREAIQLLDLLKGIREDDPQYQEVSRRLSLLELKLRQ 99 (122)
T ss_pred HHHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHH
Confidence 4556665555555444433221 344444444444443
No 341
>PF05557 MAD: Mitotic checkpoint protein; InterPro: IPR008672 This family consists of several eukaryotic mitotic checkpoint (Mitotic arrest deficient or MAD) proteins. The mitotic spindle checkpoint monitors proper attachment of the bipolar spindle to the kinetochores of aligned sister chromatids and causes a cell cycle arrest in prometaphase when failures occur. Multiple components of the mitotic spindle checkpoint have been identified in Saccharomyces cerevisiae and higher eukaryotes. In Saccharomyces cerevisiae, the existence of a Mad1-dependent complex containing Mad2, Mad3, Bub3 and Cdc20 has been demonstrated [].; PDB: 1GO4_F 4DZO_A.
Probab=23.84 E-value=26 Score=29.69 Aligned_cols=46 Identities=22% Similarity=0.343 Sum_probs=0.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhHHhhhhHHHHHHHHHHHHHHhcc
Q psy7799 34 DDMHEDIKQLDSHLKQVEEASNNAKVLRNKAHYLIKELDMFQEAYLV 80 (83)
Q Consensus 34 ~eL~~ei~~L~~~i~~le~i~~~~K~lrnKa~~L~~eL~~F~~~yL~ 80 (83)
.=++++...|..+++++|.+...+-.++=+-.-|+.++..+.. |++
T Consensus 284 elLeEe~~sLq~kl~~~E~~~~el~~lq~e~~~Le~el~sW~s-l~~ 329 (722)
T PF05557_consen 284 ELLEEEKRSLQRKLERLEELEEELAELQLENEKLEDELNSWES-LLQ 329 (722)
T ss_dssp -----------------------------------------------
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHh
Confidence 3466777788888888888888888887777778888877655 443
No 342
>PF13865 FoP_duplication: C-terminal duplication domain of Friend of PRMT1
Probab=23.77 E-value=1.9e+02 Score=17.83 Aligned_cols=16 Identities=25% Similarity=0.272 Sum_probs=11.0
Q ss_pred hhhHHHHHHHHHHHHH
Q psy7799 61 RNKAHYLIKELDMFQE 76 (83)
Q Consensus 61 rnKa~~L~~eL~~F~~ 76 (83)
.+.+.-|..||+.|-+
T Consensus 54 ~~~~~~LD~~Ld~Y~~ 69 (74)
T PF13865_consen 54 SKTKSKLDAELDSYMS 69 (74)
T ss_pred HHHHHHHHHHHHHHHH
Confidence 5566777888877654
No 343
>PF05615 THOC7: Tho complex subunit 7; InterPro: IPR008501 The Tho complex (THOC) is involved in transcription elongation and mRNA export from the nucleus []. This entry represents the subunit THOC7, which is found in higher eukaryotes, and the non-homologous subunit Mft1p found in yeast. The funtions of these subunits are unknown, and it is not known if these subunits are functionally equivalent.
Probab=23.72 E-value=2.2e+02 Score=18.94 Aligned_cols=26 Identities=15% Similarity=0.486 Sum_probs=11.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy7799 22 FARLCQLVDRATDDMHEDIKQLDSHL 47 (83)
Q Consensus 22 f~rL~~~Vd~~~~eL~~ei~~L~~~i 47 (83)
|..+...++.....++.+|.+|..++
T Consensus 79 Y~~~~~~i~~~i~~~k~~ie~lk~~L 104 (139)
T PF05615_consen 79 YEQLNEEIEQEIEQAKKEIEELKEEL 104 (139)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 44444444444444444444444433
No 344
>PF10669 Phage_Gp23: Protein gp23 (Bacteriophage A118); InterPro: IPR018926 This entry is represented by the major tail subunit protein, Gp23 of Listeria phage A118 and prophage found in Bacilli. The function is currently unknown.
Probab=23.60 E-value=2.5e+02 Score=19.29 Aligned_cols=63 Identities=13% Similarity=0.158 Sum_probs=37.6
Q ss_pred hchhHHHHHHHHHHHHHHHH------------------HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHhhhhHHHHHH
Q psy7799 8 SNCSHQVEQELSSTFARLCQ------------------LVDRATDDMHEDIKQLDSHLKQVEEASNNAKVLRNKAHYLIK 69 (83)
Q Consensus 8 sncshQVqqEL~~tf~rL~~------------------~Vd~~~~eL~~ei~~L~~~i~~le~i~~~~K~lrnKa~~L~~ 69 (83)
=+=+.||.-||..-...-.+ .||...+=|..+-+=+-.+-+.++.+......|-+|...|+.
T Consensus 38 W~~~r~~r~~MKEER~K~E~~~q~r~rES~~Er~K~~~s~~~~q~Lm~rQN~mm~~qqqsidslsksvgklahkvdlleh 117 (121)
T PF10669_consen 38 WHDSRQVRIRMKEERSKKEEKRQKRNRESKRERQKFIWSMNKQQSLMNRQNNMMKQQQQSIDSLSKSVGKLAHKVDLLEH 117 (121)
T ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHhhhhHHHHHhHHhhhhHHHHHHHHHhHHHHHHHHhHHHHHHHHHHHHHHHHHHHh
Confidence 34566777777663333222 333333334444444555667777888888888888888875
Q ss_pred H
Q psy7799 70 E 70 (83)
Q Consensus 70 e 70 (83)
-
T Consensus 118 k 118 (121)
T PF10669_consen 118 K 118 (121)
T ss_pred h
Confidence 4
No 345
>KOG3633|consensus
Probab=23.60 E-value=1.4e+02 Score=22.59 Aligned_cols=31 Identities=16% Similarity=0.179 Sum_probs=25.1
Q ss_pred HHHHHHHHHHHHHhhHHhhhhHHHHHHHHHH
Q psy7799 43 LDSHLKQVEEASNNAKVLRNKAHYLIKELDM 73 (83)
Q Consensus 43 L~~~i~~le~i~~~~K~lrnKa~~L~~eL~~ 73 (83)
-.+-|..||.+..+...||.+|..|+-|=+-
T Consensus 44 s~r~v~~LD~lelrVE~LRk~A~~le~eKe~ 74 (219)
T KOG3633|consen 44 SERFVTILDSLELRVEKLRKDALNLEEEKEY 74 (219)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3455788999999999999999999887543
No 346
>PF08702 Fib_alpha: Fibrinogen alpha/beta chain family; InterPro: IPR012290 Fibrinogen plays key roles in both blood clotting and platelet aggregation. During blood clot formation, the conversion of soluble fibrinogen to insoluble fibrin is triggered by thrombin, resulting in the polymerisation of fibrin, which forms a soft clot; this is then converted to a hard clot by factor XIIIA, which cross-links fibrin molecules. Platelet aggregation involves the binding of the platelet protein receptor integrin alpha(IIb)-beta(3) to the C-terminal D domain of fibrinogen []. In addition to platelet aggregation, platelet-fibrinogen interaction mediates both adhesion and fibrin clot retraction. Fibrinogen occurs as a dimer, where each monomer is composed of three non-identical chains, alpha, beta and gamma, linked together by several disulphide bonds []. The N-terminals of all six chains come together to form the centre of the molecule (E domain), from which the monomers extend in opposite directions as coiled coils, followed by C-terminal globular domains (D domains). Therefore, the domain composition is: D-coil-E-coil-D. At each end, the C-terminal of the alpha chain extends beyond the D domain as a protuberance that is important for cross-linking the molecule. During clot formation, the N-terminal fragments of the alpha and beta chains (within the E domain) in fibrinogen are cleaved by thrombin, releasing fibrinopeptides A and B, respectively, and producing fibrin. This cleavage results in the exposure of four binding sites on the E domain, each of which can bind to a D domain from different fibrin molecules. The binding of fibrin molecules produces a polymer consisting of a lattice network of fibrins that form a long, branching, flexible fibre [, ]. Fibrin fibres interact with platelets to increase the size of the clot, as well as with several different proteins and cells, thereby promoting the inflammatory response and concentrating the cells required for wound repair at the site of damage. This entry represents the coiled-coil domain and part of the N-terminal E domain found in all three fibrinogen polypeptides, namely the alpha, beta and gamma chains. More information about these proteins can be found at Protein of the Month: Fibrinogen [].; GO: 0005102 receptor binding, 0030674 protein binding, bridging, 0007165 signal transduction, 0030168 platelet activation, 0051258 protein polymerization, 0005577 fibrinogen complex; PDB: 1LWU_D 1N73_D 1M1J_B 1JY2_R 1JY3_R 1RF0_A 2H43_D 1RE4_D 2XNY_D 2HPC_D ....
Probab=23.57 E-value=2.6e+02 Score=19.41 Aligned_cols=34 Identities=24% Similarity=0.517 Sum_probs=15.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy7799 24 RLCQLVDRATDDMHEDIKQLDSHLKQVEEASNNA 57 (83)
Q Consensus 24 rL~~~Vd~~~~eL~~ei~~L~~~i~~le~i~~~~ 57 (83)
|+.-..+....+....|.+|+..+..++...+.+
T Consensus 22 ~i~~~L~k~~~~v~~~i~~L~~~L~~~~n~t~~~ 55 (146)
T PF08702_consen 22 GIQDFLDKYERDVDKDIQELENLLDQISNSTSEA 55 (146)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred hHHHHHHHHccchHHHHHHHHHHHHHHHHhhhhH
Confidence 3334444444455555555555444444444433
No 347
>PF15450 DUF4631: Domain of unknown function (DUF4631)
Probab=23.54 E-value=4.8e+02 Score=22.44 Aligned_cols=23 Identities=17% Similarity=0.283 Sum_probs=11.4
Q ss_pred HHHHhhHHhhhhHHHHHHHHHHH
Q psy7799 52 EASNNAKVLRNKAHYLIKELDMF 74 (83)
Q Consensus 52 ~i~~~~K~lrnKa~~L~~eL~~F 74 (83)
.+....+.++.|..-|-.+++.-
T Consensus 409 e~~~~l~~v~eKVd~LpqqI~~v 431 (531)
T PF15450_consen 409 EMEKHLKEVQEKVDSLPQQIEEV 431 (531)
T ss_pred HHHHHHHHHHHHHHhhhHHHHHH
Confidence 34445555555555554444433
No 348
>COG3850 NarQ Signal transduction histidine kinase, nitrate/nitrite-specific [Signal transduction mechanisms]
Probab=23.51 E-value=2e+02 Score=24.87 Aligned_cols=37 Identities=27% Similarity=0.374 Sum_probs=30.7
Q ss_pred hchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy7799 8 SNCSHQVEQELSSTFARLCQLVDRATDDMHEDIKQLD 44 (83)
Q Consensus 8 sncshQVqqEL~~tf~rL~~~Vd~~~~eL~~ei~~L~ 44 (83)
+.|=.|.-.||...|+.|.+.|.+-+++|+..=..|+
T Consensus 210 ~~~FNqMs~EL~~lY~~LE~rV~eKT~~L~~~Nq~Ls 246 (574)
T COG3850 210 GRAFNQMSGELKKLYADLEQRVEEKTRDLEQKNQRLS 246 (574)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhHHH
Confidence 4566799999999999999999999998887655543
No 349
>PF13600 DUF4140: N-terminal domain of unknown function (DUF4140)
Probab=23.49 E-value=2e+02 Score=18.05 Aligned_cols=19 Identities=16% Similarity=0.522 Sum_probs=9.2
Q ss_pred HHHHHHHHHHHHHHHHHHH
Q psy7799 33 TDDMHEDIKQLDSHLKQVE 51 (83)
Q Consensus 33 ~~eL~~ei~~L~~~i~~le 51 (83)
..+|+++|+.+..++..++
T Consensus 72 ~~~l~~~l~~l~~~~~~~~ 90 (104)
T PF13600_consen 72 LKELEEELEALEDELAALQ 90 (104)
T ss_pred HHHHHHHHHHHHHHHHHHH
Confidence 3445555555555544433
No 350
>TIGR00634 recN DNA repair protein RecN. All proteins in this family for which functions are known are ATP binding proteins involved in the initiation of recombination and recombinational repair.
Probab=23.45 E-value=4.2e+02 Score=21.80 Aligned_cols=43 Identities=19% Similarity=0.251 Sum_probs=20.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHhhhhHHHHHHHH
Q psy7799 29 VDRATDDMHEDIKQLDSHLKQVEEASNNAKVLRNKAHYLIKEL 71 (83)
Q Consensus 29 Vd~~~~eL~~ei~~L~~~i~~le~i~~~~K~lrnKa~~L~~eL 71 (83)
+-....+++.+++.+...-..++.++.+...++.+..-+..+|
T Consensus 327 l~~~~~~l~~eL~~l~~~~~~le~L~~el~~l~~~l~~~a~~L 369 (563)
T TIGR00634 327 VLEYAEKIKEELDQLDDSDESLEALEEEVDKLEEELDKAAVAL 369 (563)
T ss_pred HHHHHHHHHHHHHHHhCCHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3344444555555554444444444444444444444444333
No 351
>PRK11519 tyrosine kinase; Provisional
Probab=23.43 E-value=4.7e+02 Score=22.32 Aligned_cols=31 Identities=13% Similarity=0.201 Sum_probs=17.9
Q ss_pred HHHHHHHHhhHHhhhhHHHHHHHHHHHHHHh
Q psy7799 48 KQVEEASNNAKVLRNKAHYLIKELDMFQEAY 78 (83)
Q Consensus 48 ~~le~i~~~~K~lrnKa~~L~~eL~~F~~~y 78 (83)
+.++-+..++..+|.+..-.|..|+.|..++
T Consensus 267 ~a~~fL~~ql~~l~~~L~~aE~~l~~fr~~~ 297 (719)
T PRK11519 267 KSLAFLAQQLPEVRSRLDVAENKLNAFRQDK 297 (719)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHc
Confidence 3445555555666666666666666665544
No 352
>TIGR01410 tatB twin arginine-targeting protein translocase TatB. This model represents the TatB protein of a Sec-independent system for transporting folded proteins, often with a bound redox cofactor, across the bacterial inner membrane. TatC is the multiple membrane spanning component. TatB, like the related TatA/E proteins, appears to span the membrane one time. The tat system recognizes proteins with an elongated signal sequence containing a conserved R-R in a motif approximated by RRxFLK N-terminal to the transmembrane helix. TIGRFAMs model TIGR01409 describes this twin-Arg signal sequence. A similar system, termed Delta-pH-dependent transport, operates on chloroplast-encoded proteins.
Probab=23.36 E-value=2e+02 Score=18.06 Aligned_cols=28 Identities=14% Similarity=0.331 Sum_probs=20.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy7799 14 VEQELSSTFARLCQLVDRATDDMHEDIK 41 (83)
Q Consensus 14 VqqEL~~tf~rL~~~Vd~~~~eL~~ei~ 41 (83)
+-+.+-..+.++...++..+++++++++
T Consensus 27 ~~r~~G~~i~~~r~~~~~~~~~~~~e~~ 54 (80)
T TIGR01410 27 AIRAVGKFVRRLRGMASDVKNELDEELK 54 (80)
T ss_pred HHHHHHHHHHHHHHhhHhHHHHHHHHhc
Confidence 4456667777777777777777777766
No 353
>PRK14157 heat shock protein GrpE; Provisional
Probab=23.36 E-value=1.8e+02 Score=22.04 Aligned_cols=14 Identities=7% Similarity=-0.014 Sum_probs=5.6
Q ss_pred hhHHHHHHHHHHHH
Q psy7799 62 NKAHYLIKELDMFQ 75 (83)
Q Consensus 62 nKa~~L~~eL~~F~ 75 (83)
++.-.+..|.++|.
T Consensus 98 d~llR~~AEfeNyR 111 (227)
T PRK14157 98 EALQRERAEFINYR 111 (227)
T ss_pred HHHHHHHHHHHHHH
Confidence 33333444444443
No 354
>PRK10803 tol-pal system protein YbgF; Provisional
Probab=23.35 E-value=3.2e+02 Score=20.44 Aligned_cols=38 Identities=8% Similarity=0.274 Sum_probs=15.4
Q ss_pred HHHHHHHHHHHHHHHH----HHHHHhhHHhhhhHHHHHHHHH
Q psy7799 35 DMHEDIKQLDSHLKQV----EEASNNAKVLRNKAHYLIKELD 72 (83)
Q Consensus 35 eL~~ei~~L~~~i~~l----e~i~~~~K~lrnKa~~L~~eL~ 72 (83)
+|..+|++|..+|.+| |..+..+..++.+-..|-.+|+
T Consensus 58 ~l~~ql~~lq~ev~~LrG~~E~~~~~l~~~~~rq~~~y~dld 99 (263)
T PRK10803 58 QLQQQLSDNQSDIDSLRGQIQENQYQLNQVVERQKQIYLQID 99 (263)
T ss_pred HHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4444444444444332 3334444444433333333333
No 355
>PRK11032 hypothetical protein; Provisional
Probab=23.23 E-value=2.9e+02 Score=19.81 Aligned_cols=62 Identities=13% Similarity=0.236 Sum_probs=34.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh--HHhhhhHHHHHHHHHHHHHHh
Q psy7799 17 ELSSTFARLCQLVDRATDDMHEDIKQLDSHLKQVEEASNNA--KVLRNKAHYLIKELDMFQEAY 78 (83)
Q Consensus 17 EL~~tf~rL~~~Vd~~~~eL~~ei~~L~~~i~~le~i~~~~--K~lrnKa~~L~~eL~~F~~~y 78 (83)
.+...|.+|-..|.....+-+.+|+++-............+ .-+-.=..||.+.|+.|...|
T Consensus 3 k~~~~Y~~ll~~v~~~l~~~~~~l~~~ve~a~~~~~~~~elT~dEl~lv~~ylkRDL~ef~~~~ 66 (160)
T PRK11032 3 KVAQYYRELVASLTERLRNGERDIDALVESARKRVDAAGELTRDEVDLITRAVRRDLEEFARSY 66 (160)
T ss_pred hHHHHHHHHHHHHHHHHHhCHHHHHHHHHHHHHHHHHHHhcCHHHHHHHHHHHHHHHHHHHHHH
Confidence 35566777777777666666666655522222211111111 112223478899999998876
No 356
>cd07660 BAR_Arfaptin The Bin/Amphiphysin/Rvs (BAR) domain of Arfaptin. The BAR domain of Arfaptin-like proteins, also called the Arfaptin domain, is a dimerization and lipid binding module that can detect and drive membrane curvature. Arfaptins are ubiquitously expressed proteins implicated in mediating cross-talk between Rac, a member of the Rho family GTPases, and Arf (ADP-ribosylation factor) small GTPases. Arfaptins bind to GTP-bound Arf1, Arf5, and Arf6, with strongest binding to GTP-Arf1. Arfaptins also bind to Rac-GTP and Rac-GDP with similar affinities. The Arfs are thought to bind to the same surface as Rac, and their binding is mutually exclusive. Mammals contain at least two isoforms of Arfaptin. Arfaptin 1 has been shown to inhibit the activation of Arf-dependent phospholipase D (PLD) and the secretion of matrix metalloproteinase-9 (MMP-9), an enzyme implicated in cancer invasiveness and metastasis. Arfaptin 2 regulates the aggregation of the protein huntingtin, which is im
Probab=23.21 E-value=2.8e+02 Score=20.62 Aligned_cols=31 Identities=10% Similarity=0.225 Sum_probs=18.3
Q ss_pred HHHHHHHHHHHHHHhhHHhhhhHHHHHHHHH
Q psy7799 42 QLDSHLKQVEEASNNAKVLRNKAHYLIKELD 72 (83)
Q Consensus 42 ~L~~~i~~le~i~~~~K~lrnKa~~L~~eL~ 72 (83)
+|+.+|+.|+.++...+.|=+=+.-|.+.+-
T Consensus 3 eLeaklE~L~~~q~~Y~~ll~~~~~l~~~~~ 33 (201)
T cd07660 3 ELEAQIEVLRDTQRKYESVLRLARALASQFY 33 (201)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4666677777777766666544444444443
No 357
>cd07629 BAR_Atg20p The Bin/Amphiphysin/Rvs (BAR) domain of yeast Sorting Nexin Atg20p. BAR domains are dimerization, lipid binding and curvature sensing modules found in many different proteins with diverse functions. Sorting nexins (SNXs) are Phox homology (PX) domain containing proteins that are involved in regulating membrane traffic and protein sorting in the endosomal system. SNXs differ from each other in their lipid-binding specificity, subcellular localization and specific function in the endocytic pathway. A subset of SNXs also contain BAR domains. The PX-BAR structural unit determines the specific membrane targeting of SNXs. The function of Atg20p is unknown but it has been shown to interact with Atg11p, which plays a role in linking cargo molecules with vesicle-forming components. BAR domains form dimers that bind to membranes, induce membrane bending and curvature, and may also be involved in protein-protein interactions.
Probab=23.20 E-value=2.8e+02 Score=19.70 Aligned_cols=21 Identities=19% Similarity=0.425 Sum_probs=10.1
Q ss_pred HHHHHHHHHHHHHH-HHHHHHH
Q psy7799 35 DMHEDIKQLDSHLK-QVEEASN 55 (83)
Q Consensus 35 eL~~ei~~L~~~i~-~le~i~~ 55 (83)
+++..+++++.-+. .|+++-.
T Consensus 8 ~~e~~~~~~~~~L~~~l~kv~~ 29 (187)
T cd07629 8 DIEAETKKYEQLLHGGMEKVNR 29 (187)
T ss_pred HHHHHHHHHHHHHcCcHHHHHH
Confidence 44555555555553 4554443
No 358
>PF01763 Herpes_UL6: Herpesvirus UL6 like; InterPro: IPR002660 This family consists of various proteins from the Herpesviridae that are similar to Human herpesvirus 1 (HHV-1) UL6 virion protein. UL6 is essential for cleavage and packaging of the viral genome [].; GO: 0006323 DNA packaging
Probab=23.09 E-value=4.9e+02 Score=22.38 Aligned_cols=45 Identities=18% Similarity=0.268 Sum_probs=32.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhHHhhhhHHHHHHHHHHHHHH
Q psy7799 30 DRATDDMHEDIKQLDSHLKQVEEASNNAKVLRNKAHYLIKELDMFQEA 77 (83)
Q Consensus 30 d~~~~eL~~ei~~L~~~i~~le~i~~~~K~lrnKa~~L~~eL~~F~~~ 77 (83)
..+-.=|++.|.++-+.|+.|+.+- +.|-+|...++.+|.+-...
T Consensus 362 nsI~kcLe~qIn~qf~tIe~Lk~~n---~~~~~kl~~~e~~L~r~~~~ 406 (557)
T PF01763_consen 362 NSINKCLEGQINNQFDTIEDLKEEN---QDLEKKLRELESELSRYREE 406 (557)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHh
Confidence 3444456788888888888887664 45667888888888877654
No 359
>smart00434 TOP4c DNA Topoisomerase IV. Bacterial DNA topoisomerase IV, GyrA, ParC
Probab=23.00 E-value=1.5e+02 Score=24.23 Aligned_cols=34 Identities=18% Similarity=0.439 Sum_probs=24.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy7799 24 RLCQLVDRATDDMHEDIKQLDSHLKQVEEASNNA 57 (83)
Q Consensus 24 rL~~~Vd~~~~eL~~ei~~L~~~i~~le~i~~~~ 57 (83)
+|.+....-...|++|++++.++|+.|+++....
T Consensus 408 ~L~~LT~~e~~kL~~e~~~l~~ei~~l~~~l~~~ 441 (445)
T smart00434 408 RLRRLTKLEVEKLEKELKELEKEIEDLEKILASE 441 (445)
T ss_pred HHHHhhhhHHHHHHHHHHHHHHHHHHHHHhccch
Confidence 4555555666778888888888888888775543
No 360
>cd00011 BAR_Arfaptin_like The Bin/Amphiphysin/Rvs (BAR) domain of Arfaptin-like proteins, a dimerization module that binds and bends membranes. The BAR domain of Arfaptin-like proteins, also called the Arfaptin domain, is a dimerization, lipid binding and curvature sensing module present in Arfaptins, PICK1, ICA69, and similar proteins. Arfaptins are ubiquitously expressed proteins implicated in mediating cross-talk between Rac, a member of the Rho family GTPases, and Arf (ADP-ribosylation factor) small GTPases. Arfaptins bind to GTP-bound Arf1, Arf5, and Arf6, with strongest binding to GTP-Arf1. Arfaptins also binds to Rac-GTP and Rac-GDP with similar affinities. The Arfs are thought to bind to the same surface as Rac, and their binding is mutually exclusive. Protein Interacting with C Kinase 1 (PICK1) plays a key role in the trafficking of AMPA receptors, which are critical for regulating synaptic strength and may be important in cellular processes involved in learning and memory. Is
Probab=22.82 E-value=3.1e+02 Score=20.36 Aligned_cols=32 Identities=16% Similarity=0.179 Sum_probs=19.8
Q ss_pred HHHHHHHHHHHHHHhhHHhhhhHHHHHHHHHH
Q psy7799 42 QLDSHLKQVEEASNNAKVLRNKAHYLIKELDM 73 (83)
Q Consensus 42 ~L~~~i~~le~i~~~~K~lrnKa~~L~~eL~~ 73 (83)
+|+++|+.|+.++..++.|=+=+.-+.+.+..
T Consensus 3 eL~akle~l~~~~~~y~~Ll~~~~~~~~~~~~ 34 (203)
T cd00011 3 ELELQLELLRETKRKYESVLQLGRALTAHLYS 34 (203)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 56677777777777776665555555444433
No 361
>PF08340 DUF1732: Domain of unknown function (DUF1732); InterPro: IPR013551 This domain of unknown function is found at the C terminus of bacterial proteins, many of which are hypothetical and include proteins of the YicC family.
Probab=22.81 E-value=58 Score=21.37 Aligned_cols=22 Identities=41% Similarity=0.623 Sum_probs=12.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHh
Q psy7799 35 DMHEDIKQLDSHLKQVEEASNN 56 (83)
Q Consensus 35 eL~~ei~~L~~~i~~le~i~~~ 56 (83)
|..+||.+|...++++..+-..
T Consensus 12 DI~EEl~RL~sH~~~f~~~l~~ 33 (87)
T PF08340_consen 12 DISEELVRLKSHLKQFRELLES 33 (87)
T ss_pred chHHHHHHHHHHHHHHHHHHhc
Confidence 4556666666666665555443
No 362
>PRK14146 heat shock protein GrpE; Provisional
Probab=22.75 E-value=1.8e+02 Score=21.62 Aligned_cols=24 Identities=13% Similarity=0.155 Sum_probs=12.9
Q ss_pred HHhhHHhhhhHHHHHHHHHHHHHH
Q psy7799 54 SNNAKVLRNKAHYLIKELDMFQEA 77 (83)
Q Consensus 54 ~~~~K~lrnKa~~L~~eL~~F~~~ 77 (83)
..++..|+++.-.+..++++|.+-
T Consensus 67 ~~e~~el~d~~lR~~AdfeN~rkR 90 (215)
T PRK14146 67 KKEIESLKDSWARERAEFQNFKRR 90 (215)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH
Confidence 333444555555566666666543
No 363
>PRK06665 flgK flagellar hook-associated protein FlgK; Validated
Probab=22.60 E-value=4.8e+02 Score=22.15 Aligned_cols=18 Identities=17% Similarity=0.346 Sum_probs=11.6
Q ss_pred hHHhhhhHHHHHHHHHHH
Q psy7799 57 AKVLRNKAHYLIKELDMF 74 (83)
Q Consensus 57 ~K~lrnKa~~L~~eL~~F 74 (83)
...|+++=..|-.||-.+
T Consensus 204 ~ndLlDqRD~ll~eLS~~ 221 (627)
T PRK06665 204 PNDLLDRRDLLVDKLSSL 221 (627)
T ss_pred chhhHHHHHHHHHHHHhh
Confidence 445777777777777544
No 364
>TIGR00255 conserved hypothetical protein TIGR00255. The apparent ortholog from Aquifex aeolicus as reported is split into two consecutive reading frames.
Probab=22.57 E-value=1.3e+02 Score=23.29 Aligned_cols=44 Identities=25% Similarity=0.439 Sum_probs=29.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhHHhhhhHHHHHHHHHHHHH
Q psy7799 33 TDDMHEDIKQLDSHLKQVEEASNNAKVLRNKAHYLIKELDMFQE 76 (83)
Q Consensus 33 ~~eL~~ei~~L~~~i~~le~i~~~~K~lrnKa~~L~~eL~~F~~ 76 (83)
+.|..+||.+|...|++...+-..-...=.|-.||--|+++=.+
T Consensus 215 K~DI~EEl~RL~sHl~~f~~~L~~~~~vGrkLDFL~QEmnRE~N 258 (291)
T TIGR00255 215 RIDIAEEIDRLDSHVKEFYNILKKGEAVGRKLDFMMQELNRESN 258 (291)
T ss_pred hcchHHHHHHHHHHHHHHHHHHhcCCCcCcchhHHHHHHhHHHH
Confidence 34677888888888888877766555555566666666655333
No 365
>KOG2662|consensus
Probab=22.57 E-value=4.5e+02 Score=21.85 Aligned_cols=52 Identities=15% Similarity=0.234 Sum_probs=29.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHhhhhHHHHHHHHHHHHH
Q psy7799 23 ARLCQLVDRATDDMHEDIKQLDSHLKQVEEASNNAKVLRNKAHYLIKELDMFQE 76 (83)
Q Consensus 23 ~rL~~~Vd~~~~eL~~ei~~L~~~i~~le~i~~~~K~lrnKa~~L~~eL~~F~~ 76 (83)
..|...+-.+.+||...|..+. +++|=...+++..|=.|+.-+-.||+.+-+
T Consensus 197 ~~Le~~~~~~LdeLt~~is~~n--L~~lr~~k~~Lt~l~~rvqkvRDeLe~LLd 248 (414)
T KOG2662|consen 197 SELETEAYPLLDELTNKISTLN--LERLRILKKRLTELTSRVQKVRDELEELLD 248 (414)
T ss_pred HHHHHHHHHHHHHHhhhhhHHH--HHHHHHHhHHHHHHHHHHHHHHHHHHHHhc
Confidence 3334444444444444444332 445555666666677788888888876643
No 366
>PF07083 DUF1351: Protein of unknown function (DUF1351); InterPro: IPR009785 This entry is represented by Lactobacillus prophage Lj928, Orf309. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This family consists of several bacterial and phage proteins of around 230 residues in length. The function of this family is unknown.
Probab=22.43 E-value=3.1e+02 Score=19.95 Aligned_cols=53 Identities=17% Similarity=0.293 Sum_probs=40.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHhhhhH
Q psy7799 12 HQVEQELSSTFARLCQLVDRATDDMHEDIKQLDSHLKQVEEASNNAKVLRNKA 64 (83)
Q Consensus 12 hQVqqEL~~tf~rL~~~Vd~~~~eL~~ei~~L~~~i~~le~i~~~~K~lrnKa 64 (83)
-.|.++++.-+.-....+......+++=++.+..+|...|.-++..|.=+-++
T Consensus 63 K~ikk~~~~P~~~Fe~~~K~l~~~i~~~~~~I~~~ik~~Ee~~k~~k~~~i~~ 115 (215)
T PF07083_consen 63 KEIKKEYSKPIKEFEAKIKELIAPIDEASDKIDEQIKEFEEKEKEEKREKIKE 115 (215)
T ss_pred HHHHHHHhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 35677777777888888888888888888888888888888887776655443
No 367
>PF01923 Cob_adeno_trans: Cobalamin adenosyltransferase; InterPro: IPR002779 ATP:cob(I)alamin (or ATP:corrinoid) adenosyltransferases (2.5.1.17 from EC), catalyse the conversion of cobalamin (vitamin B12) into its coenzyme form, adenosylcobalamin (coenzyme B12) []. Adenosylcobalamin (AdoCbl) is required for the ativity of certain enzymes. AdoCbl contains an adenosyl moiety liganded to the cobalt ion of cobalamin via a covalent Co-C bond, and its synthesis is unique to certain prokaryotes. ATP:cob(I)alamin adenosyltransferases are classed into three groups: CobA-type [], EutT-type [] and PduO-type []. Each of the three enzyme types appears to be specialised for particular AdoCbl-dependent enzymes or for the de novo synthesis AdoCbl. PduO and EutT are distantly related, sharing short conserved motifs, while CobA is evolutionarily unrelated and is an example of convergent evolution. This entry represents EutT- and PduO-type ATP:cob(I)alamin adenosyltransferases. PduO functions to convert cobalamin to AdoCbl for 1,2-propanediol degradation [], while EutT produces AdoCbl for ethanolamine utilisation []. ; GO: 0005524 ATP binding, 0008817 cob(I)yrinic acid a,c-diamide adenosyltransferase activity, 0009236 cobalamin biosynthetic process; PDB: 2ZHY_A 2ZHZ_B 3KE5_C 3KE4_B 2AH6_C 1NOG_A 2IDX_C 3GAH_A 3CI1_A 3CI3_A ....
Probab=22.41 E-value=2.7e+02 Score=19.22 Aligned_cols=61 Identities=15% Similarity=0.305 Sum_probs=35.2
Q ss_pred HHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHH----HHHHHHHHh------hHHhhhhHHHHHHHHHHHHHH
Q psy7799 17 ELSSTFARLCQLVDRATDDM-HEDIKQLDSHL----KQVEEASNN------AKVLRNKAHYLIKELDMFQEA 77 (83)
Q Consensus 17 EL~~tf~rL~~~Vd~~~~eL-~~ei~~L~~~i----~~le~i~~~------~K~lrnKa~~L~~eL~~F~~~ 77 (83)
|+-++..-|++.++.+...+ ..++++.-.+| -.+-..-.. .+.......|||.+.+.+.+.
T Consensus 27 e~~G~lDEl~a~igla~~~~~~~~~~~~L~~iq~~L~~l~~~la~~~~~~~~~i~~~~v~~Le~~i~~~~~~ 98 (163)
T PF01923_consen 27 EAYGTLDELNAFIGLARSEIKEEELREILERIQNELFDLGAELATPEEDEEPEITEEDVQELEEEIDEYSEE 98 (163)
T ss_dssp HHHHHHHHHHHHHHHHHTHCTTHHHHHHHHHHHHHHHHHHHHHHTTTTSSSCS--HHHHHHHHHHHHHHHHH
T ss_pred eeeeeHHHHHHHHHHHHHHcCchhHHHHHHHHHHHHHHHHHHHcCCCcccccccCHHHHHHHHHHHHHHHhc
Confidence 56677777777777777666 24444433333 233222222 223456778888888888876
No 368
>PRK08724 fliD flagellar capping protein; Validated
Probab=22.41 E-value=5.4e+02 Score=22.65 Aligned_cols=43 Identities=12% Similarity=0.095 Sum_probs=25.4
Q ss_pred HHHHHHHHHHHHHHHHHHHH-HHhhHHhhhhHHHHHHHHHHHHH
Q psy7799 34 DDMHEDIKQLDSHLKQVEEA-SNNAKVLRNKAHYLIKELDMFQE 76 (83)
Q Consensus 34 ~eL~~ei~~L~~~i~~le~i-~~~~K~lrnKa~~L~~eL~~F~~ 76 (83)
..|..+|+.+.++|..||.= ...-..++.|++-|+.-+.....
T Consensus 620 ~sL~~~i~~l~dqi~~Le~Rle~~E~Ry~~QFtAMD~~msqMns 663 (673)
T PRK08724 620 KSLREQNYRLNDDQVALDRRMESLEKRTHAKFAAMQDATGKMQG 663 (673)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 44666777777776666542 33335677777766655544433
No 369
>PF11126 Phage_DsbA: Transcriptional regulator DsbA; InterPro: IPR020313 DsbA is a double stranded binding protein found in bacteriophage T4 which is involved in transcriptional regulation. DsbA, along with other viral proteins, interacts with the host RNA polymerase core enzyme enabling initiation of transcription. DsbA acts as an enhancer protein of late genes in vitro. The protein consists of mainly alpha helices [].
Probab=22.25 E-value=1.4e+02 Score=18.92 Aligned_cols=26 Identities=27% Similarity=0.419 Sum_probs=18.2
Q ss_pred HHHHHHHHHHHHHHHHhhHHhhhhHH
Q psy7799 40 IKQLDSHLKQVEEASNNAKVLRNKAH 65 (83)
Q Consensus 40 i~~L~~~i~~le~i~~~~K~lrnKa~ 65 (83)
|++-+...-++|..+...|.+|++|-
T Consensus 2 IkeAS~~k~~ie~~~e~IKdik~~AK 27 (69)
T PF11126_consen 2 IKEASDHKTQIESYNEMIKDIKDRAK 27 (69)
T ss_pred chHHHhhHHHHHHHHHHHHHHHHHHH
Confidence 34445556667777888888888875
No 370
>PRK04863 mukB cell division protein MukB; Provisional
Probab=22.02 E-value=3.8e+02 Score=25.57 Aligned_cols=65 Identities=20% Similarity=0.279 Sum_probs=0.0
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHhhhhHHHHHHHHHHHHHHh
Q psy7799 11 SHQVEQELSSTFARLCQLVDRATDDMHEDIKQLDSHLKQVEEASNNAKVLRNKAHYLIKELDMFQEAY 78 (83)
Q Consensus 11 shQVqqEL~~tf~rL~~~Vd~~~~eL~~ei~~L~~~i~~le~i~~~~K~lrnKa~~L~~eL~~F~~~y 78 (83)
.+|-..+--.....||...+-+..+|++.+.....++ +..+.....+..|...++.+++.|.+.|
T Consensus 415 q~qq~i~~Le~~~~~~~~~~~SdEeLe~~LenF~akl---ee~e~qL~elE~kL~~lea~leql~~~~ 479 (1486)
T PRK04863 415 QYQQAVQALERAKQLCGLPDLTADNAEDWLEEFQAKE---QEATEELLSLEQKLSVAQAAHSQFEQAY 479 (1486)
T ss_pred HHHHHHHHHHHHHHHhCCCCCCHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHH
No 371
>KOG0250|consensus
Probab=21.98 E-value=6.6e+02 Score=23.51 Aligned_cols=61 Identities=11% Similarity=0.220 Sum_probs=0.0
Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHhhhhHHHHHHHHHH
Q psy7799 10 CSHQVEQELSSTFARLCQLVDRATDDMHEDIKQLDSHLKQVEEASNNAKVLRNKAHYLIKELDM 73 (83)
Q Consensus 10 cshQVqqEL~~tf~rL~~~Vd~~~~eL~~ei~~L~~~i~~le~i~~~~K~lrnKa~~L~~eL~~ 73 (83)
|-.+..+.+...-+.+...++.-..+++++++.|..++..+|+ ....||.+-+.+-.++..
T Consensus 373 ~~d~l~k~I~~~~~~~~~~~~~~~~e~e~k~~~L~~evek~e~---~~~~L~~e~~~~~~~~~~ 433 (1074)
T KOG0250|consen 373 EVDRLEKQIADLEKQTNNELGSELEERENKLEQLKKEVEKLEE---QINSLREELNEVKEKAKE 433 (1074)
T ss_pred HHHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHH
No 372
>PF10191 COG7: Golgi complex component 7 (COG7); InterPro: IPR019335 The conserved oligomeric Golgi (COG) complex is an eight-subunit (Cog1-8) peripheral Golgi protein involved in membrane trafficking and glycoconjugate synthesis []. COG7 is required for normal Golgi morphology and trafficking. Mutation in COG7 causes a congenital disorder of glycosylation [].
Probab=21.86 E-value=5.4e+02 Score=22.45 Aligned_cols=60 Identities=18% Similarity=0.235 Sum_probs=46.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHhhHHhhhhHHHHHHHHHHHH
Q psy7799 16 QELSSTFARLCQLVDRATDDMHEDIKQLD----SHLKQVEEASNNAKVLRNKAHYLIKELDMFQ 75 (83)
Q Consensus 16 qEL~~tf~rL~~~Vd~~~~eL~~ei~~L~----~~i~~le~i~~~~K~lrnKa~~L~~eL~~F~ 75 (83)
..+++...+|.-.+.++...+++.+.++- +-+..++.++..+..||++..-+..++....
T Consensus 34 ~~ls~l~~kLql~~qe~~~~le~~~~q~l~~~Pr~~~ev~~l~~ea~~L~~~~~~v~~~~~~~e 97 (766)
T PF10191_consen 34 AHLSSLVMKLQLYSQEVNASLEETSQQALQRVPRVLREVDRLRQEAASLQEQMASVQEEIKAVE 97 (766)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh
Confidence 36888889999889999888998888776 4556677778888888888877777776543
No 373
>PF07795 DUF1635: Protein of unknown function (DUF1635); InterPro: IPR012862 The members of this family include sequences that are parts of hypothetical proteins expressed by plant species. The region in question is about 170 amino acids long.
Probab=21.82 E-value=3.6e+02 Score=20.39 Aligned_cols=49 Identities=12% Similarity=0.241 Sum_probs=36.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHhhhhHH
Q psy7799 17 ELSSTFARLCQLVDRATDDMHEDIKQLDSHLKQVEEASNNAKVLRNKAH 65 (83)
Q Consensus 17 EL~~tf~rL~~~Vd~~~~eL~~ei~~L~~~i~~le~i~~~~K~lrnKa~ 65 (83)
||....-.-....+.+...-++||+.-+.+|.+|-.+-+.+-.=|+.|.
T Consensus 5 ELRq~Ll~TTlELE~~k~~A~EElRk~eeqi~~L~~Ll~~a~~ERDEAr 53 (214)
T PF07795_consen 5 ELRQKLLYTTLELEATKMEANEELRKREEQIAHLKDLLKKAYQERDEAR 53 (214)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4444444445566777777888999999999999988888888777664
No 374
>COG1561 Uncharacterized stress-induced protein [Function unknown]
Probab=21.79 E-value=2.6e+02 Score=22.10 Aligned_cols=46 Identities=20% Similarity=0.395 Sum_probs=33.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhHHhhhhHHHHHHHHHHHHHH
Q psy7799 32 ATDDMHEDIKQLDSHLKQVEEASNNAKVLRNKAHYLIKELDMFQEA 77 (83)
Q Consensus 32 ~~~eL~~ei~~L~~~i~~le~i~~~~K~lrnKa~~L~~eL~~F~~~ 77 (83)
.+.|..+||++|...|...-.+-.+=...=.|-.||--|+++=+++
T Consensus 213 ~K~DI~EEldRL~sHv~~~~~iL~~~g~vGRkLDFl~QE~nREaNT 258 (290)
T COG1561 213 QKADIAEELDRLKSHVKEFRNILEKGGPVGRKLDFLMQEFNREANT 258 (290)
T ss_pred HHhhHHHHHHHHHHHHHHHHHHHhcCCccchhHHHHHHHHhHHHHh
Confidence 3567888999999999888887765555556677777776654443
No 375
>PF04912 Dynamitin: Dynamitin ; InterPro: IPR006996 Dynamitin is a subunit of the microtubule-dependent motor complex, it is also implicated in cell adhesion by binding to macrophage-enriched myristoylated alanine-rice C kinase substrate (MacMARCKS) []. It is also thought to modulate cytoplasmic dynein binding to an organelle, and plays a role in prometaphase chromosome alignment and spindle organisation during mitosis. Dynamitin is also involved in anchoring microtubules to centrosomes and may play a role in synapse formation during brain development []. ; GO: 0007017 microtubule-based process, 0005869 dynactin complex
Probab=21.77 E-value=4e+02 Score=20.89 Aligned_cols=42 Identities=17% Similarity=0.463 Sum_probs=31.8
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy7799 11 SHQVEQELSSTFARLCQLVDRATDDMHEDIKQLDSHLKQVEE 52 (83)
Q Consensus 11 shQVqqEL~~tf~rL~~~Vd~~~~eL~~ei~~L~~~i~~le~ 52 (83)
-.+.|.+|......|...++.+...|++.+..+...+..||.
T Consensus 341 le~~q~~l~~~l~~~~~~L~~ve~~~~~N~~~i~~n~~~le~ 382 (388)
T PF04912_consen 341 LESQQSDLQSQLKKWEELLNKVEEKFKENMETIEKNVKKLEE 382 (388)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 346777888888888888888888888777777777776664
No 376
>PF03087 DUF241: Arabidopsis protein of unknown function; InterPro: IPR004320 This family represents plant proteins of unknown function.
Probab=21.69 E-value=3.4e+02 Score=19.99 Aligned_cols=21 Identities=14% Similarity=0.485 Sum_probs=13.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHH
Q psy7799 32 ATDDMHEDIKQLDSHLKQVEE 52 (83)
Q Consensus 32 ~~~eL~~ei~~L~~~i~~le~ 52 (83)
......+++..|+.+|+.||.
T Consensus 192 ~~~~~~~~Le~LE~~Ie~lE~ 212 (231)
T PF03087_consen 192 EVQNAQKRLEELEECIEELEE 212 (231)
T ss_pred HHHHHHHHHHHHHHHHHHHHH
Confidence 445566666777777776664
No 377
>PF00430 ATP-synt_B: ATP synthase B/B' CF(0); InterPro: IPR002146 ATPases (or ATP synthases) are membrane-bound enzyme complexes/ion transporters that combine ATP synthesis and/or hydrolysis with the transport of protons across a membrane. ATPases can harness the energy from a proton gradient, using the flux of ions across the membrane via the ATPase proton channel to drive the synthesis of ATP. Some ATPases work in reverse, using the energy from the hydrolysis of ATP to create a proton gradient. There are different types of ATPases, which can differ in function (ATP synthesis and/or hydrolysis), structure (e.g., F-, V- and A-ATPases, which contain rotary motors) and in the type of ions they transport [, ]. The different types include: F-ATPases (F1F0-ATPases), which are found in mitochondria, chloroplasts and bacterial plasma membranes where they are the prime producers of ATP, using the proton gradient generated by oxidative phosphorylation (mitochondria) or photosynthesis (chloroplasts). V-ATPases (V1V0-ATPases), which are primarily found in eukaryotic vacuoles and catalyse ATP hydrolysis to transport solutes and lower pH in organelles. A-ATPases (A1A0-ATPases), which are found in Archaea and function like F-ATPases (though with respect to their structure and some inhibitor responses, A-ATPases are more closely related to the V-ATPases). P-ATPases (E1E2-ATPases), which are found in bacteria and in eukaryotic plasma membranes and organelles, and function to transport a variety of different ions across membranes. E-ATPases, which are cell-surface enzymes that hydrolyse a range of NTPs, including extracellular ATP. F-ATPases (also known as F1F0-ATPase, or H(+)-transporting two-sector ATPase) (3.6.3.14 from EC) are composed of two linked complexes: the F1 ATPase complex is the catalytic core and is composed of 5 subunits (alpha, beta, gamma, delta, epsilon), while the F0 ATPase complex is the membrane-embedded proton channel that is composed of at least 3 subunits (A-C), nine in mitochondria (A-G, F6, F8). Both the F1 and F0 complexes are rotary motors that are coupled back-to-back. In the F1 complex, the central gamma subunit forms the rotor inside the cylinder made of the alpha(3)beta(3) subunits, while in the F0 complex, the ring-shaped C subunits forms the rotor. The two rotors rotate in opposite directions, but the F0 rotor is usually stronger, using the force from the proton gradient to push the F1 rotor in reverse in order to drive ATP synthesis []. These ATPases can also work in reverse to hydrolyse ATP to create a proton gradient. This entry represents subunits B and B' from the F0 complex in F-ATPases found in chloroplasts and in bacterial plasma membranes. The B subunits are part of the peripheral stalk that links the F1 and F0 complexes together, and which acts as a stator to prevent certain subunits from rotating with the central rotary element. The peripheral stalk differs in subunit composition between mitochondrial, chloroplast and bacterial F-ATPases. In bacterial and chloroplast F-ATPases, the peripheral stalk is composed of one copy of the delta subunit (homologous to OSCP in mitochondria), and two copies of subunit B in bacteria, or one copy each of subunits B and B' in chloroplasts and photosynthetic bacteria []. More information about this protein can be found at Protein of the Month: ATP Synthases [].; GO: 0015078 hydrogen ion transmembrane transporter activity, 0015986 ATP synthesis coupled proton transport, 0045263 proton-transporting ATP synthase complex, coupling factor F(o); PDB: 1L2P_A 2KHK_A 1B9U_A.
Probab=21.69 E-value=2e+02 Score=18.24 Aligned_cols=29 Identities=17% Similarity=0.292 Sum_probs=9.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy7799 14 VEQELSSTFARLCQLVDRATDDMHEDIKQ 42 (83)
Q Consensus 14 VqqEL~~tf~rL~~~Vd~~~~eL~~ei~~ 42 (83)
+.+.+..+...+..+.+.+..++..++..
T Consensus 93 ~~~~~~~a~~~i~~e~~~a~~~l~~~~~~ 121 (132)
T PF00430_consen 93 AERIIEQAEAEIEQEKEKAKKELRQEIVD 121 (132)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHT-----
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33333333333333334444444333333
No 378
>PF05308 Mito_fiss_reg: Mitochondrial fission regulator; InterPro: IPR007972 This family consists of several uncharacterised eukaryotic proteins of unknown function.
Probab=21.64 E-value=99 Score=23.59 Aligned_cols=25 Identities=8% Similarity=0.407 Sum_probs=19.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhH
Q psy7799 34 DDMHEDIKQLDSHLKQVEEASNNAK 58 (83)
Q Consensus 34 ~eL~~ei~~L~~~i~~le~i~~~~K 58 (83)
..||+|+..|..+|+.+=..|....
T Consensus 125 sALEdELs~LRaQIA~IV~~qe~~~ 149 (253)
T PF05308_consen 125 SALEDELSRLRAQIAKIVAAQEQSN 149 (253)
T ss_pred HHHHHHHHHHHHHHHHHHhcccccc
Confidence 4589999999999998876665443
No 379
>KOG0447|consensus
Probab=21.62 E-value=4.1e+02 Score=23.93 Aligned_cols=60 Identities=18% Similarity=0.275 Sum_probs=37.0
Q ss_pred HHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhhHHhhhhHHHHHHHHHHH
Q psy7799 13 QVEQELSSTFARLCQLVDRA-TDDMHEDIKQLDSHLKQVEEASNNAKVLRNKAHYLIKELDMF 74 (83)
Q Consensus 13 QVqqEL~~tf~rL~~~Vd~~-~~eL~~ei~~L~~~i~~le~i~~~~K~lrnKa~~L~~eL~~F 74 (83)
-|||=+.-|-.-|.||+..+ .+-|+.||++.-.+-..=+. .+-|.|-+|-..|..||..|
T Consensus 902 RIQ~m~~iT~NaLRQQ~tNtE~RRLeKeiKeVLddfs~d~e--kK~klLTGkRV~LAEel~kv 962 (980)
T KOG0447|consen 902 RIQRMLAITANTLRQQLTNTEVRRLEKNVKEVLEDFAEDGE--KKIKLLTGKRVQLAEDLKKV 962 (980)
T ss_pred HHHHHHHHhHHHHHHHHhhHHHHHHHHHHHHHHHHhhhchh--hhheeeccchhHHHHHHHHH
Confidence 46888888999999998765 35677777765444433222 33445555555555555444
No 380
>PRK12714 flgK flagellar hook-associated protein FlgK; Provisional
Probab=21.57 E-value=2.9e+02 Score=23.44 Aligned_cols=13 Identities=8% Similarity=0.220 Sum_probs=4.9
Q ss_pred HHHHHHHHhhHHh
Q psy7799 48 KQVEEASNNAKVL 60 (83)
Q Consensus 48 ~~le~i~~~~K~l 60 (83)
.++..+..+.-.|
T Consensus 166 ~~IN~l~~~IA~L 178 (624)
T PRK12714 166 DEVNRLTQQIAKI 178 (624)
T ss_pred HHHHHHHHHHHHH
Confidence 3333333333333
No 381
>PF07765 KIP1: KIP1-like protein; InterPro: IPR011684 This is a group of sequences found exclusively in plants. They are similar to kinase interacting protein 1 (KIP1), which has been found to interact with the kinase domain of PRK1, a receptor-like kinase []. This particular region contains two coiled-coils, which are described as motifs involved in protein-protein interactions []. It has also been suggested that the coiled-coils of the protein allow it to dimerise in vivo [].
Probab=21.40 E-value=2.3e+02 Score=18.06 Aligned_cols=48 Identities=21% Similarity=0.324 Sum_probs=32.2
Q ss_pred HHHHHHHHHHHHHHHHHHH-HHHHHHhhHHhhhhHHHHHHHHHHHHHHh
Q psy7799 31 RATDDMHEDIKQLDSHLKQ-VEEASNNAKVLRNKAHYLIKELDMFQEAY 78 (83)
Q Consensus 31 ~~~~eL~~ei~~L~~~i~~-le~i~~~~K~lrnKa~~L~~eL~~F~~~y 78 (83)
.+..||++.++.+-+=|+. =|....++.---+|=--|-+-++.|...|
T Consensus 18 ~~l~dmd~kvk~mlklieedgdSfakrAEmyy~kRp~Li~~vee~yr~Y 66 (74)
T PF07765_consen 18 ENLSDMDEKVKAMLKLIEEDGDSFAKRAEMYYKKRPELISLVEEFYRSY 66 (74)
T ss_pred HHHHHHHHHHHHHHHHhccCcchHHHhhHHHhcccHHHHHHHHHHHHHH
Confidence 4566777777776555543 13345555555556668888999999888
No 382
>PF10267 Tmemb_cc2: Predicted transmembrane and coiled-coil 2 protein; InterPro: IPR019394 This family of transmembrane coiled-coil containing proteins is conserved from worms to humans. Its function is unknown.
Probab=21.39 E-value=4.6e+02 Score=21.43 Aligned_cols=40 Identities=23% Similarity=0.277 Sum_probs=24.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy7799 12 HQVEQELSSTFARLCQLVDRATDDMHEDIKQLDSHLKQVE 51 (83)
Q Consensus 12 hQVqqEL~~tf~rL~~~Vd~~~~eL~~ei~~L~~~i~~le 51 (83)
.|--||-.-=+.||..||++...=-..||-.|..++..+|
T Consensus 250 ~~~LqEEr~R~erLEeqlNd~~elHq~Ei~~LKqeLa~~E 289 (395)
T PF10267_consen 250 LEALQEERYRYERLEEQLNDLTELHQNEIYNLKQELASME 289 (395)
T ss_pred HHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHH
Confidence 3444455555667777777776666666666666665555
No 383
>PRK11546 zraP zinc resistance protein; Provisional
Probab=21.29 E-value=3.1e+02 Score=19.40 Aligned_cols=45 Identities=9% Similarity=0.072 Sum_probs=21.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHhhhhHH
Q psy7799 21 TFARLCQLVDRATDDMHEDIKQLDSHLKQVEEASNNAKVLRNKAH 65 (83)
Q Consensus 21 tf~rL~~~Vd~~~~eL~~ei~~L~~~i~~le~i~~~~K~lrnKa~ 65 (83)
-...|.++.-.-+.||+.....=.-+=+.+..+.++...||.|..
T Consensus 62 ~t~~LRqqL~aKr~ELnALl~~~~pD~~kI~aL~kEI~~Lr~kL~ 106 (143)
T PRK11546 62 QTSALRQQLVSKRYEYNALLTANPPDSSKINAVAKEMENLRQSLD 106 (143)
T ss_pred HHHHHHHHHHHHHHHHHHHHcCCCCCHHHHHHHHHHHHHHHHHHH
Confidence 334455555555555544333222233445555555556665444
No 384
>PF11101 DUF2884: Protein of unknown function (DUF2884); InterPro: IPR021307 Some members in this bacterial family of proteins are annotated as YggN which currently has no known function.
Probab=21.23 E-value=3.4e+02 Score=19.94 Aligned_cols=39 Identities=21% Similarity=0.344 Sum_probs=20.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhHHhhhhHHHHHHHHHH
Q psy7799 33 TDDMHEDIKQLDSHLKQVEEASNNAKVLRNKAHYLIKELDM 73 (83)
Q Consensus 33 ~~eL~~ei~~L~~~i~~le~i~~~~K~lrnKa~~L~~eL~~ 73 (83)
...|......|..+|+ ..+..+++.|+.||..|=+.|..
T Consensus 169 l~~~~~~m~~l~~~ie--~~~~~q~~~le~~a~~lC~~l~~ 207 (229)
T PF11101_consen 169 LQAFEQRMEGLQQQIE--QEMEAQAQELEQKAQALCDSLQQ 207 (229)
T ss_pred HHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHH
Confidence 3444444444444444 24555666666666655544443
No 385
>PF05461 ApoL: Apolipoprotein L; InterPro: IPR008405 Apo L belongs to the high density lipoprotein family that plays a central role in cholesterol transport. The cholesterol content of membranes is important in cellular processes such as modulating gene transcription and signal transduction both in the adult brain and during neurodevelopment. There are six apo L genes located in close proximity to each other on chromosome 22q12 in humans. 22q12 is a confirmed high-susceptibility locus for schizophrenia and close to the region associated with velocardiofacial syndrome that includes symptoms of schizophrenia []. The various functions of apoL are still not entirely clear. Apolipoprotein L-I has been identified as a trypanolytic agent [] and displays similar phylogenetic distribution to the programmed cell death protein Bcl-2 and BH-3 domain-containing proteins, suggesting a possible role in apoptosis [].; GO: 0008289 lipid binding, 0006869 lipid transport, 0042157 lipoprotein metabolic process, 0005576 extracellular region
Probab=21.21 E-value=1.2e+02 Score=23.80 Aligned_cols=24 Identities=29% Similarity=0.481 Sum_probs=19.7
Q ss_pred HhhHHhhhhHHHHHHHHHHHHHHh
Q psy7799 55 NNAKVLRNKAHYLIKELDMFQEAY 78 (83)
Q Consensus 55 ~~~K~lrnKa~~L~~eL~~F~~~y 78 (83)
.-++.||.+|..||..|+....-|
T Consensus 287 e~Ae~LR~~A~eLE~~L~el~~~~ 310 (313)
T PF05461_consen 287 ESAEELREQAQELEEKLEELTQIY 310 (313)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH
Confidence 446789999999999998887766
No 386
>PF00996 GDI: GDP dissociation inhibitor; InterPro: IPR018203 Rab proteins constitute a family of small GTPases that serve a regulatory role in vesicular membrane traffic [, ]; C-terminal geranylgeranylation is crucial for their membrane association and function. This post-translational modification is catalysed by Rab geranylgeranyl transferase (Rab-GGTase), a multi-subunit enzyme that contains a catalytic heterodimer and an accessory component, termed Rab escort protein (REP)-1 []. REP-1 presents newly- synthesised Rab proteins to the catalytic component, and forms a stable complex with the prenylated proteins following the transfer reaction. The mechanism of REP-1-mediated membrane association of Rab5 is similar to that mediated by Rab GDP dissociation inhibitor (GDI). REP-1 and Rab GDI also share other functional properties, including the ability to inhibit the release of GDP and to remove Rab proteins from membranes. The crystal structure of the bovine alpha-isoform of Rab GDI has been determined to a resolution of 1.81A []. The protein is composed of two main structural units: a large complex multi-sheet domain I, and a smaller alpha-helical domain II. The structural organisation of domain I is closely related to FAD-containing monooxygenases and oxidases []. Conserved regions common to GDI and the choroideraemia gene product, which delivers Rab to catalytic subunits of Rab geranylgeranyltransferase II, are clustered on one face of the domain []. The two most conserved regions form a compact structure at the apex of the molecule; site-directed mutagenesis has shown these regions to play a critical role in the binding of Rab proteins [].; PDB: 1VG9_C 1VG0_A 1LTX_R 3P1W_A 3CPH_H 3CPJ_G 3CPI_H 1UKV_G 2BCG_G 1GND_A ....
Probab=21.12 E-value=46 Score=27.18 Aligned_cols=15 Identities=33% Similarity=0.549 Sum_probs=12.4
Q ss_pred HHHHHHHHHHHHHHH
Q psy7799 17 ELSSTFARLCQLVDR 31 (83)
Q Consensus 17 EL~~tf~rL~~~Vd~ 31 (83)
||.+.|+|+||.-++
T Consensus 233 ELpQ~FcRl~AV~GG 247 (438)
T PF00996_consen 233 ELPQAFCRLSAVYGG 247 (438)
T ss_dssp HHHHHHHHHHHHTT-
T ss_pred cHHHHHHHHhhhcCc
Confidence 899999999997554
No 387
>cd04769 HTH_MerR2 Helix-Turn-Helix DNA binding domain of MerR2-like transcription regulators. Helix-turn-helix (HTH) transcription regulator MerR2 and related proteins. MerR2 in Bacillus cereus RC607 regulates resistance to organomercurials. The MerR family transcription regulators have been shown to mediate responses to stress including exposure to heavy metals, drugs, or oxygen radicals in eubacterial and some archaeal species. They regulate transcription by reconfiguring the spacer between the -35 and -10 promoter elements. A typical MerR regulator is comprised of two distinct domains that harbor the regulatory (effector-binding) site and the active (DNA-binding) site. Their N-terminal domains are homologous and contain a DNA-binding winged HTH motif, while the C-terminal domains are often dissimilar and bind specific coactivator molecules such as metal ions, drugs, and organic substrates.
Probab=21.10 E-value=2.4e+02 Score=18.16 Aligned_cols=27 Identities=0% Similarity=0.234 Sum_probs=14.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhHH
Q psy7799 33 TDDMHEDIKQLDSHLKQVEEASNNAKV 59 (83)
Q Consensus 33 ~~eL~~ei~~L~~~i~~le~i~~~~K~ 59 (83)
..=|...++.++.+|..|+.....++.
T Consensus 81 ~~~l~~~~~~l~~~i~~l~~~~~~l~~ 107 (116)
T cd04769 81 QQALEDKKQEIRAQITELQQLLARLDA 107 (116)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 344555566666666655555554443
No 388
>COG1382 GimC Prefoldin, chaperonin cofactor [Posttranslational modification, protein turnover, chaperones]
Probab=21.10 E-value=2.9e+02 Score=19.02 Aligned_cols=39 Identities=23% Similarity=0.413 Sum_probs=18.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhHHhhhhHHHHHHHHH
Q psy7799 31 RATDDMHEDIKQLDSHLKQVEEASNNAKVLRNKAHYLIKELD 72 (83)
Q Consensus 31 ~~~~eL~~ei~~L~~~i~~le~i~~~~K~lrnKa~~L~~eL~ 72 (83)
.+..+|++.+.-|+.+|..|+.=+ +.++.+..-|.++|.
T Consensus 70 ~~~~eL~er~E~Le~ri~tLekQe---~~l~e~l~eLq~~i~ 108 (119)
T COG1382 70 EAVDELEERKETLELRIKTLEKQE---EKLQERLEELQSEIQ 108 (119)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHH
Confidence 445555555555555555554322 233334444444443
No 389
>TIGR01554 major_cap_HK97 phage major capsid protein, HK97 family. This family represents the major capsid protein component of the heads (capsids) of bacteriophage HK97, phi-105, P27, and related phage. This model represents one of several analogous families lacking detectable sequence similarity. The gene encoding this component is typically located in an operon encoding the small and large terminase subunits, the portal protein and the prohead or maturation protease.
Probab=20.95 E-value=3.9e+02 Score=20.53 Aligned_cols=16 Identities=13% Similarity=0.133 Sum_probs=6.3
Q ss_pred HHHHHHHHHHHHHHHH
Q psy7799 17 ELSSTFARLCQLVDRA 32 (83)
Q Consensus 17 EL~~tf~rL~~~Vd~~ 32 (83)
||..-+..+...+...
T Consensus 3 el~~~~~~~~~~~r~l 18 (378)
T TIGR01554 3 ELKEQREEIVAEIRSL 18 (378)
T ss_pred hHHHHHHHHHHHHHHH
Confidence 3344444444443333
No 390
>TIGR01126 pdi_dom protein disulfide-isomerase domain. This model describes a domain of eukaryotic protein disulfide isomerases, generally found in two copies. The high cutoff for total score reflects the expectation of finding both copies. The domain is similar to thioredoxin but the redox-active disulfide region motif is APWCGHCK.
Probab=20.92 E-value=48 Score=19.48 Aligned_cols=22 Identities=23% Similarity=0.628 Sum_probs=14.5
Q ss_pred chhhhhhchhHHHHHHHHHHHHHH
Q psy7799 2 IVDLTSSNCSHQVEQELSSTFARL 25 (83)
Q Consensus 2 iV~~tssncshQVqqEL~~tf~rL 25 (83)
+|.++++.|.+ .+.+...|.++
T Consensus 17 ~i~f~~~~C~~--c~~~~~~~~~~ 38 (102)
T TIGR01126 17 LVEFYAPWCGH--CKNLAPEYEKL 38 (102)
T ss_pred EEEEECCCCHH--HHhhChHHHHH
Confidence 67889999988 44444444444
No 391
>COG4467 Regulator of replication initiation timing [Replication, recombination, and repair]
Probab=20.82 E-value=3e+02 Score=19.04 Aligned_cols=51 Identities=12% Similarity=0.089 Sum_probs=38.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHhhhhHHH
Q psy7799 16 QELSSTFARLCQLVDRATDDMHEDIKQLDSHLKQVEEASNNAKVLRNKAHY 66 (83)
Q Consensus 16 qEL~~tf~rL~~~Vd~~~~eL~~ei~~L~~~i~~le~i~~~~K~lrnKa~~ 66 (83)
.|+=..+.+|..+++++.+++.+=-+.|..=|+..-.++-.-..||.+.+.
T Consensus 4 keiFd~v~~le~~l~~l~~el~~lK~~l~~lvEEN~~L~lENe~LR~RL~~ 54 (114)
T COG4467 4 KEIFDQVDNLEEQLGVLLAELGGLKQHLGSLVEENTALRLENEKLRERLGE 54 (114)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHhhHHHHHHHhCC
Confidence 577788889999999988887766666666666666777777888877665
No 392
>COG2433 Uncharacterized conserved protein [Function unknown]
Probab=20.76 E-value=5.6e+02 Score=22.62 Aligned_cols=12 Identities=33% Similarity=0.445 Sum_probs=5.0
Q ss_pred HHHHHHHHHHHH
Q psy7799 64 AHYLIKELDMFQ 75 (83)
Q Consensus 64 a~~L~~eL~~F~ 75 (83)
.-.|+++|+.|.
T Consensus 452 ie~L~~~l~~~~ 463 (652)
T COG2433 452 IEKLESELERFR 463 (652)
T ss_pred HHHHHHHHHHHH
Confidence 333444444443
No 393
>PRK10869 recombination and repair protein; Provisional
Probab=20.66 E-value=4.8e+02 Score=21.72 Aligned_cols=40 Identities=18% Similarity=0.212 Sum_probs=20.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhHHhhhhHHHHHHHHH
Q psy7799 33 TDDMHEDIKQLDSHLKQVEEASNNAKVLRNKAHYLIKELD 72 (83)
Q Consensus 33 ~~eL~~ei~~L~~~i~~le~i~~~~K~lrnKa~~L~~eL~ 72 (83)
..+++.|++.|+.--..++.++.+.+.++.+..-+..+|-
T Consensus 326 ~~~l~~eL~~L~~~e~~l~~Le~e~~~l~~~l~~~A~~LS 365 (553)
T PRK10869 326 HQQLLEEQQQLDDQEDDLETLALAVEKHHQQALETAQKLH 365 (553)
T ss_pred HHHHHHHHHHhhCCHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3344455555544444455555555555555544444444
No 394
>PRK14160 heat shock protein GrpE; Provisional
Probab=20.61 E-value=2.6e+02 Score=20.87 Aligned_cols=10 Identities=10% Similarity=0.006 Sum_probs=4.5
Q ss_pred HHHHHHHhcc
Q psy7799 71 LDMFQEAYLV 80 (83)
Q Consensus 71 L~~F~~~yL~ 80 (83)
...|....|+
T Consensus 110 ~e~~~~~LLp 119 (211)
T PRK14160 110 CEDVLKELLP 119 (211)
T ss_pred HHHHHHHHhh
Confidence 4444444443
No 395
>KOG3433|consensus
Probab=20.59 E-value=3.8e+02 Score=20.22 Aligned_cols=16 Identities=44% Similarity=0.254 Sum_probs=9.4
Q ss_pred HHHHHHHHHHHHHHHH
Q psy7799 14 VEQELSSTFARLCQLV 29 (83)
Q Consensus 14 VqqEL~~tf~rL~~~V 29 (83)
|.|||....++..+-.
T Consensus 82 ~~qeLe~~L~~~~qk~ 97 (203)
T KOG3433|consen 82 VLQELESQLATGSQKK 97 (203)
T ss_pred HHHHHHHHHHHhhhhH
Confidence 5566666666655443
No 396
>PF09261 Alpha-mann_mid: Alpha mannosidase, middle domain; InterPro: IPR015341 Members of this entry belong to the glycosyl hydrolase family 38, This domain, which is found in the central region adopts a structure consisting of three alpha helices, in an immunoglobulin/albumin-binding domain-like fold. The domain is predominantly found in the enzyme alpha-mannosidase []. ; GO: 0004553 hydrolase activity, hydrolyzing O-glycosyl compounds, 0008270 zinc ion binding; PDB: 1O7D_C 3LVT_A 3CZN_A 2FYV_A 3D50_A 3EJU_A 3EJS_A 3DX3_A 3BVX_A 3BUQ_A ....
Probab=20.58 E-value=2.1e+02 Score=17.24 Aligned_cols=48 Identities=23% Similarity=0.284 Sum_probs=36.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhHHhhhhHHHHHHHHHHHHHHhc
Q psy7799 32 ATDDMHEDIKQLDSHLKQVEEASNNAKVLRNKAHYLIKELDMFQEAYL 79 (83)
Q Consensus 32 ~~~eL~~ei~~L~~~i~~le~i~~~~K~lrnKa~~L~~eL~~F~~~yL 79 (83)
++..+|.-++.++..+..+|.+..-+....+...+...+|+..-+.-+
T Consensus 9 Sr~~~K~~~r~~e~~L~~~E~l~~~~~~~~~~~~~~~~~l~~~w~~l~ 56 (80)
T PF09261_consen 9 SRPDIKQLNRRAENLLRAAEPLAALAALLGNGGDYPQEELEKAWKALL 56 (80)
T ss_dssp STHHHHHHHHHHHHHHHCHHHHHHHCHHH--HSG-HHHHHHHHHHHHH
T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHhHhccCccccHHHHHHHHHHHH
Confidence 355788888899999999999999888885444788888888877654
No 397
>PF08287 DASH_Spc19: Spc19; InterPro: IPR013251 Spc19 is a component of the DASH complex. The DASH complex associates with the spindle pole body and is important for spindle and kinetochore integrity during cell division [, ].
Probab=20.54 E-value=3.1e+02 Score=19.22 Aligned_cols=48 Identities=17% Similarity=0.335 Sum_probs=34.7
Q ss_pred HHHHHHHHHHHH----HHHHHHH-HHHHHHHHhhHHhhhhHHHHHHHHHHHHH
Q psy7799 29 VDRATDDMHEDI----KQLDSHL-KQVEEASNNAKVLRNKAHYLIKELDMFQE 76 (83)
Q Consensus 29 Vd~~~~eL~~ei----~~L~~~i-~~le~i~~~~K~lrnKa~~L~~eL~~F~~ 76 (83)
+..++..+.+|| +.|-.++ ..|+.++++-..|+.|+...+..|.+...
T Consensus 51 l~~Aq~~l~~EI~P~I~~Ll~k~e~~l~kL~Rr~~tL~ak~EL~~~RL~~~~~ 103 (153)
T PF08287_consen 51 LQAAQQSLRDEIEPQINHLLDKAEKHLEKLQRREETLKAKCELQQGRLSNYES 103 (153)
T ss_pred HHHHHHHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcc
Confidence 445555555554 4554444 46889999999999999999999987654
No 398
>KOG0161|consensus
Probab=20.53 E-value=3.7e+02 Score=26.58 Aligned_cols=48 Identities=23% Similarity=0.328 Sum_probs=0.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-----hHHhhhhHHHHHHHHHH
Q psy7799 26 CQLVDRATDDMHEDIKQLDSHLKQVEEASNN-----AKVLRNKAHYLIKELDM 73 (83)
Q Consensus 26 ~~~Vd~~~~eL~~ei~~L~~~i~~le~i~~~-----~K~lrnKa~~L~~eL~~ 73 (83)
++..+.....|+..+++|..+++-+|..... .-+|..|...||++|+.
T Consensus 1767 ~~~le~~k~~LE~~~kdLq~rL~e~E~~a~~~~k~~i~~Learir~LE~~l~~ 1819 (1930)
T KOG0161|consen 1767 SQKLERLKKSLERQVKDLQLRLDEAEQAALKGGKKQIAKLEARIRELESELEG 1819 (1930)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccHHHHHHHHHHHHHHHHHHhH
No 399
>PF09766 FimP: Fms-interacting protein; InterPro: IPR019163 This entry represents Thoc5 which is one of the subunits of the THO complex, which additionally contains: HPR1, Thoc2, Thoc6 and Thoc7. The evolutionarily conserved multisubunit THO complex, which is recruited to actively transcribed genes is required for the efficient expression of genes that have internal tandem repeats. It is suggested that the THO complex functions to rectify aberrant structures that arise during transcription [, ] and is required for cell proliferation and for proper export of heat-shock mRNAs under heat stress []. This entry also identifies the crucial 144 N-terminal residues of the FmiP protein, which is essential for the binding of the protein to the cytoplasmic domain of activated Fms-molecules in M-CSF induced haematopoietic differentiation of macrophages. The C terminus contains a putative nuclear localisation sequence and a leucine zipper which suggest further, as yet unknown, nuclear functions. The level of FMIP expression might form a threshold that determines whether cells differentiate into macrophages or into granulocytes [].
Probab=20.48 E-value=4.3e+02 Score=20.77 Aligned_cols=55 Identities=20% Similarity=0.176 Sum_probs=25.5
Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHhhhhH
Q psy7799 10 CSHQVEQELSSTFARLCQLVDRATDDMHEDIKQLDSHLKQVEEASNNAKVLRNKA 64 (83)
Q Consensus 10 cshQVqqEL~~tf~rL~~~Vd~~~~eL~~ei~~L~~~i~~le~i~~~~K~lrnKa 64 (83)
|+.+.+++|......|.+.-.....+.+..-+.|..==..|+.+...++.++...
T Consensus 98 ~EL~~Rk~L~~~~~el~~~k~~l~~~~~~k~~~L~~l~~~L~~l~~a~~plq~~l 152 (355)
T PF09766_consen 98 FELEQRKRLEEQLKELEQRKKKLQQENKKKKKFLDSLPPQLKSLKKAAKPLQEYL 152 (355)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHh
Confidence 4444555555555555554444444443333333333334444455555555443
No 400
>PF00429 TLV_coat: ENV polyprotein (coat polyprotein); InterPro: IPR018154 Enveloped viruses such as Human immunodeficiency virus 1, influenza virus, and Ebola virus sp. express a surface glycoprotein that mediates both cell attachment and fusion of viral and cellular membranes. The ENV polyprotein (coat polyprotein) usually contains two coat proteins which differ depending on the source. The structure of a number of the ENV polyprotein domains have been determined: The crystal structure of an extraviral segment of the Moloney murine leukemia virus (MoMuLV) transmembrane (TM) subunit has been determined to 1.7-A resolution. This segment contains a trimeric coiled coil, with a hydrophobic cluster at its base and a strand that packs in an antiparallel orientation against the coiled coil. This structure serves as a model for a wide range of viral fusion proteins; key residues in this structure are conserved among C- and D-type retroviruses and the filovirus ebola []. An essential step in retrovirus infection is the binding of the virus to its receptor on a target cell. The structure of the receptor-binding domain of the envelope glycoprotein from Friend murine leukemia virus (F-MuLV) has been determined determined to 2.0-A resolution. The core of the domain is an antiparallel beta sandwich, with two interstrand loops forming a helical subdomain atop the sandwich. The residues in the helical region, but not in the beta sandwich, are highly variable among mammalian C-type retroviruses with distinct tropisms, indicating that the helical subdomain determines the receptor specificity of the virus []. ; PDB: 1LCS_B 1MOF_A 1XNL_A 2XZ3_A 1AOL_A 1Y4M_C.
Probab=20.42 E-value=3.2e+02 Score=23.05 Aligned_cols=34 Identities=15% Similarity=0.408 Sum_probs=28.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy7799 19 SSTFARLCQLVDRATDDMHEDIKQLDSHLKQVEE 52 (83)
Q Consensus 19 ~~tf~rL~~~Vd~~~~eL~~ei~~L~~~i~~le~ 52 (83)
...|..|.++++...+++++-|+.|.+++..|-+
T Consensus 423 ~~~~~~L~~~~~~d~~~~~~~i~~l~~~~~sl~~ 456 (561)
T PF00429_consen 423 TQQYRQLSNALEEDLQALEDSISALQEQLTSLAE 456 (561)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3568888899999999999999998888888765
No 401
>PF09753 Use1: Membrane fusion protein Use1; InterPro: IPR019150 This entry represents a family of proteins, approximately 300 residues in length, involved in vesicle transport. They have a single C-terminal transmembrane domain and a SNARE [soluble NSF (N-ethylmaleimide-sensitive fusion protein) attachment protein receptor] domain of approximately 60 residues. The SNARE domains are essential for membrane fusion and are conserved from yeasts to humans. Use1 is one of the three protein subunits that make up the SNARE complex and it is specifically required for Golgi-endoplasmic reticulum retrograde transport [].
Probab=20.38 E-value=3.6e+02 Score=19.80 Aligned_cols=51 Identities=18% Similarity=0.117 Sum_probs=40.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHhhhhH
Q psy7799 14 VEQELSSTFARLCQLVDRATDDMHEDIKQLDSHLKQVEEASNNAKVLRNKA 64 (83)
Q Consensus 14 VqqEL~~tf~rL~~~Vd~~~~eL~~ei~~L~~~i~~le~i~~~~K~lrnKa 64 (83)
+..||.+....|.+.+-.+.+-|+++.+-|+.-...+|.-....+.-..|.
T Consensus 168 L~~em~~La~~LK~~s~~~~~~l~~D~~~L~~~~~~~d~n~~~l~~~~~rl 218 (251)
T PF09753_consen 168 LTEEMLSLARQLKENSLAFSQILKEDNKVLDRTEEGLDRNLSSLKRESKRL 218 (251)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 667899999999999999999999999888887777776666655555443
No 402
>PF10398 DUF2443: Protein of unknown function (DUF2443); InterPro: IPR019469 This entry represents a small group of highly conserved proteins from bacteria, in particular Helicobacter species. The structure is a bundle of alpha helices. The function is not known. ; PDB: 1ZKE_F.
Probab=20.31 E-value=2.3e+02 Score=18.42 Aligned_cols=21 Identities=14% Similarity=0.536 Sum_probs=12.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHh
Q psy7799 36 MHEDIKQLDSHLKQVEEASNN 56 (83)
Q Consensus 36 L~~ei~~L~~~i~~le~i~~~ 56 (83)
+.+++..|...|+.|.++.++
T Consensus 55 ideeV~~LKe~IdaLNK~KkE 75 (79)
T PF10398_consen 55 IDEEVEKLKEHIDALNKIKKE 75 (79)
T ss_dssp HHHHHHHHHHHHHHHHHHHGG
T ss_pred HHHHHHHHHHHHHHHHHHHHH
Confidence 445566666666666666544
No 403
>KOG4673|consensus
Probab=20.31 E-value=5.5e+02 Score=23.45 Aligned_cols=48 Identities=17% Similarity=0.163 Sum_probs=36.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHhh
Q psy7799 13 QVEQELSSTFARLCQLVDRATDDMHEDIKQLDSHLKQVEEASNNAKVLR 61 (83)
Q Consensus 13 QVqqEL~~tf~rL~~~Vd~~~~eL~~ei~~L~~~i~~le~i~~~~K~lr 61 (83)
|.++.|.....|+|---++.. -+.+|+..|..++.+||....+.|.+-
T Consensus 715 ~l~~~le~e~nr~~~~~~e~~-~~qeE~~~l~~r~~~le~e~r~~k~~~ 762 (961)
T KOG4673|consen 715 QLSKSLEKERNRAAENRQEYL-AAQEEADTLEGRANQLEVEIRELKRKH 762 (961)
T ss_pred HHHHHHHHHHHHHhhhHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 778888888888886655543 467788888888888888887777653
No 404
>PF09726 Macoilin: Transmembrane protein; InterPro: IPR019130 This entry represents the multi-pass transmembrane protein Macoilin, which is highly conserved in eukaryotes. ; GO: 0016021 integral to membrane
Probab=20.30 E-value=5.9e+02 Score=22.26 Aligned_cols=46 Identities=9% Similarity=0.138 Sum_probs=28.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhHHhhhhHHHHHHHHHHHH
Q psy7799 30 DRATDDMHEDIKQLDSHLKQVEEASNNAKVLRNKAHYLIKELDMFQ 75 (83)
Q Consensus 30 d~~~~eL~~ei~~L~~~i~~le~i~~~~K~lrnKa~~L~~eL~~F~ 75 (83)
|..++.+..=......+-+.|..+.++++..|.+-..||.+|..=.
T Consensus 470 e~Lq~Kl~~L~~aRq~DKq~l~~LEkrL~eE~~~R~~lEkQL~eEr 515 (697)
T PF09726_consen 470 EQLQNKLQNLVQARQQDKQSLQQLEKRLAEERRQRASLEKQLQEER 515 (697)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3333333333333344445566677888888888888888886543
No 405
>COG4896 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=20.18 E-value=13 Score=23.44 Aligned_cols=20 Identities=25% Similarity=0.471 Sum_probs=14.7
Q ss_pred HHHHHHHHHHHhhHHhhhhH
Q psy7799 45 SHLKQVEEASNNAKVLRNKA 64 (83)
Q Consensus 45 ~~i~~le~i~~~~K~lrnKa 64 (83)
+.+.-+|.-+..+|.||||-
T Consensus 9 d~v~~iD~rt~~tKrLrN~P 28 (68)
T COG4896 9 DRVDEIDNRTFKTKRLRNKP 28 (68)
T ss_pred cceeeecchhHHHHHhhCCC
Confidence 34556677788899999873
No 406
>COG3352 FlaC Putative archaeal flagellar protein C [Cell motility and secretion]
Probab=20.15 E-value=3.5e+02 Score=19.64 Aligned_cols=30 Identities=17% Similarity=0.188 Sum_probs=14.2
Q ss_pred HHHHHHhhHHhhhhHHHHHHHHHHHHHHhc
Q psy7799 50 VEEASNNAKVLRNKAHYLIKELDMFQEAYL 79 (83)
Q Consensus 50 le~i~~~~K~lrnKa~~L~~eL~~F~~~yL 79 (83)
||.+.++.|.|..=...+.++++-|.+.|.
T Consensus 81 lerLe~~iKdl~~lye~Vs~d~Npf~s~~~ 110 (157)
T COG3352 81 LERLEENIKDLVSLYELVSRDFNPFMSKTP 110 (157)
T ss_pred HHHHHHHHHHHHHHHHHHHHhhhhHHhhhH
Confidence 333334444444444455555555555554
No 407
>PF04899 MbeD_MobD: MbeD/MobD like ; InterPro: IPR006983 The MbeD and MobD proteins are plasmid encoded, and are involved in the plasmid mobilisation and transfer in the presence of conjugative plasmids [].
Probab=20.07 E-value=2.4e+02 Score=17.60 Aligned_cols=36 Identities=19% Similarity=0.526 Sum_probs=17.2
Q ss_pred HHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHH
Q psy7799 16 QELSSTFARLCQLVDRATDD---MHEDIKQLDSHLKQVE 51 (83)
Q Consensus 16 qEL~~tf~rL~~~Vd~~~~e---L~~ei~~L~~~i~~le 51 (83)
++..+.|+.|...-+.+.++ |...++.|..+++.|.
T Consensus 24 ~~Wq~sy~~Lq~~~~~t~~~~a~L~~qv~~Ls~qv~~Ls 62 (70)
T PF04899_consen 24 QEWQSSYADLQHMFEQTSQENAALSEQVNNLSQQVQRLS 62 (70)
T ss_pred HHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHH
Confidence 44555555555555544432 3444444444444443
Done!