Query psy8735
Match_columns 115
No_of_seqs 108 out of 140
Neff 6.3
Searched_HMMs 46136
Date Fri Aug 16 17:52:45 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy8735.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/8735hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG4603|consensus 100.0 1.2E-29 2.7E-34 186.4 12.6 102 2-105 100-201 (201)
2 PF07106 TBPIP: Tat binding pr 99.8 1.4E-18 2.9E-23 126.9 10.0 76 3-80 94-169 (169)
3 PF07061 Swi5: Swi5; InterPro 97.2 0.0086 1.9E-07 39.5 9.9 78 23-106 5-82 (83)
4 PF03962 Mnd1: Mnd1 family; I 93.6 2.3 4.9E-05 31.8 11.9 75 23-106 108-185 (188)
5 PF05812 Herpes_BLRF2: Herpesv 92.6 0.62 1.4E-05 32.7 6.3 49 17-67 2-55 (118)
6 PHA03162 hypothetical protein; 92.5 0.94 2E-05 32.4 7.2 50 16-67 11-65 (135)
7 PHA03155 hypothetical protein; 92.2 1 2.2E-05 31.5 6.9 50 16-67 6-56 (115)
8 PF10552 ORF6C: ORF6C domain; 89.0 5.6 0.00012 27.2 8.4 65 24-91 7-83 (116)
9 PF05008 V-SNARE: Vesicle tran 87.2 5.4 0.00012 25.0 9.7 73 5-80 6-78 (79)
10 PF01486 K-box: K-box region; 86.5 3.7 8.1E-05 27.2 6.2 56 20-77 14-70 (100)
11 PF14282 FlxA: FlxA-like prote 86.3 4.4 9.5E-05 27.5 6.5 43 3-45 27-71 (106)
12 PF06698 DUF1192: Protein of u 85.4 2.1 4.7E-05 26.5 4.1 30 15-44 18-47 (59)
13 PF05377 FlaC_arch: Flagella a 81.7 8.3 0.00018 23.6 5.6 39 4-44 2-40 (55)
14 PF05837 CENP-H: Centromere pr 80.2 15 0.00034 24.8 9.0 76 5-84 6-86 (106)
15 PF08317 Spc7: Spc7 kinetochor 78.5 13 0.00027 29.8 7.3 20 47-68 269-288 (325)
16 COG5200 LUC7 U1 snRNP componen 76.0 23 0.0005 27.6 7.7 83 17-102 106-189 (258)
17 PF12808 Mto2_bdg: Micro-tubul 74.5 16 0.00035 22.0 5.6 44 3-46 5-50 (52)
18 PF10376 Mei5: Double-strand r 74.4 33 0.00072 26.4 8.4 83 18-106 131-216 (221)
19 KOG3433|consensus 73.8 37 0.0008 25.8 8.2 49 53-106 150-198 (203)
20 PRK11546 zraP zinc resistance 73.8 9.3 0.0002 27.6 4.9 41 26-69 62-102 (143)
21 PF15188 CCDC-167: Coiled-coil 73.2 20 0.00043 23.7 6.0 45 24-70 4-50 (85)
22 KOG4302|consensus 72.7 21 0.00046 31.8 7.7 65 20-86 56-133 (660)
23 PF02344 Myc-LZ: Myc leucine z 72.5 10 0.00023 20.6 3.7 25 23-47 6-30 (32)
24 PF07106 TBPIP: Tat binding pr 72.5 33 0.00071 24.6 7.9 59 10-75 73-131 (169)
25 PRK10132 hypothetical protein; 70.2 32 0.00069 23.6 7.3 30 16-45 3-32 (108)
26 COG5124 Protein predicted to b 70.0 46 0.001 25.3 10.5 26 82-107 178-203 (209)
27 PF12958 DUF3847: Protein of u 69.4 10 0.00022 25.1 4.0 28 19-46 2-29 (86)
28 COG5509 Uncharacterized small 69.1 9.9 0.00022 23.8 3.6 30 15-44 22-51 (65)
29 PRK02119 hypothetical protein; 68.5 21 0.00046 22.7 5.2 44 1-44 1-49 (73)
30 PF04977 DivIC: Septum formati 66.4 23 0.00051 21.6 5.1 25 21-45 27-51 (80)
31 PRK13729 conjugal transfer pil 66.2 14 0.00031 31.6 5.2 42 4-45 78-124 (475)
32 PF10805 DUF2730: Protein of u 66.2 13 0.00028 25.2 4.1 41 5-45 52-92 (106)
33 PF06810 Phage_GP20: Phage min 65.1 34 0.00074 24.7 6.4 23 24-46 26-48 (155)
34 PRK11020 hypothetical protein; 64.4 21 0.00046 25.0 4.9 44 3-46 6-52 (118)
35 PF10234 Cluap1: Clusterin-ass 63.9 56 0.0012 25.9 7.8 83 5-91 172-259 (267)
36 PRK13182 racA polar chromosome 63.6 29 0.00064 25.7 5.9 42 4-45 101-145 (175)
37 PF09744 Jnk-SapK_ap_N: JNK_SA 63.1 57 0.0012 23.8 8.3 62 21-85 92-153 (158)
38 smart00338 BRLZ basic region l 61.9 33 0.00071 20.7 5.0 26 20-45 28-53 (65)
39 KOG4001|consensus 61.7 60 0.0013 25.3 7.4 72 13-86 180-254 (259)
40 PF15290 Syntaphilin: Golgi-lo 60.5 18 0.00038 29.2 4.4 29 15-43 114-142 (305)
41 PF08614 ATG16: Autophagy prot 59.6 55 0.0012 24.1 6.8 22 54-77 165-186 (194)
42 PF11932 DUF3450: Protein of u 58.9 18 0.00039 27.7 4.2 49 3-53 57-105 (251)
43 KOG4603|consensus 57.4 80 0.0017 23.9 7.2 66 24-97 92-157 (201)
44 KOG4302|consensus 57.0 1.5E+02 0.0032 26.7 9.9 78 20-107 112-203 (660)
45 PF07716 bZIP_2: Basic region 56.8 34 0.00073 20.0 4.3 27 19-45 26-52 (54)
46 KOG3231|consensus 56.6 41 0.00088 25.3 5.5 57 49-107 113-175 (208)
47 PRK04325 hypothetical protein; 56.5 42 0.00092 21.3 5.0 26 19-44 24-49 (74)
48 COG1093 SUI2 Translation initi 56.1 26 0.00056 27.9 4.7 40 65-105 90-129 (269)
49 PRK09039 hypothetical protein; 55.7 1.1E+02 0.0024 24.8 9.6 26 20-45 139-164 (343)
50 PF13801 Metal_resist: Heavy-m 55.5 39 0.00085 21.6 4.9 60 26-91 60-119 (125)
51 cd00176 SPEC Spectrin repeats, 55.1 68 0.0015 22.2 7.4 80 5-86 17-109 (213)
52 KOG4797|consensus 54.5 27 0.00059 24.4 4.0 40 23-64 65-108 (123)
53 PF14389 Lzipper-MIP1: Leucine 54.5 27 0.00059 22.9 3.9 28 19-46 55-82 (88)
54 PF05557 MAD: Mitotic checkpoi 54.0 96 0.0021 27.5 8.4 62 24-87 565-644 (722)
55 PF10224 DUF2205: Predicted co 53.4 61 0.0013 21.1 5.5 43 4-48 25-67 (80)
56 smart00338 BRLZ basic region l 52.7 38 0.00083 20.4 4.2 36 4-41 28-63 (65)
57 PRK10884 SH3 domain-containing 51.8 44 0.00096 25.3 5.2 20 26-45 133-152 (206)
58 TIGR03185 DNA_S_dndD DNA sulfu 51.7 1.3E+02 0.0029 26.2 8.8 25 21-45 226-250 (650)
59 PF12252 SidE: Dot/Icm substra 51.4 90 0.0019 29.9 7.8 69 3-75 1041-1111(1439)
60 PF04156 IncA: IncA protein; 51.3 90 0.0019 22.5 7.4 60 21-83 91-150 (191)
61 PRK15178 Vi polysaccharide exp 51.2 86 0.0019 26.6 7.3 41 3-43 294-339 (434)
62 PF14282 FlxA: FlxA-like prote 51.1 54 0.0012 22.1 5.1 35 22-57 16-50 (106)
63 COG5185 HEC1 Protein involved 50.9 1.6E+02 0.0035 25.8 8.8 82 20-104 332-429 (622)
64 smart00787 Spc7 Spc7 kinetocho 50.5 1.3E+02 0.0027 24.3 7.9 20 48-69 265-284 (312)
65 COG3132 Uncharacterized protei 49.8 27 0.00058 26.6 3.7 27 20-46 187-213 (215)
66 PRK12901 secA preprotein trans 48.9 1E+02 0.0022 29.3 7.8 66 4-75 27-97 (1112)
67 cd04779 HTH_MerR-like_sg4 Heli 48.4 51 0.0011 23.2 4.8 40 18-57 74-113 (134)
68 PTZ00446 vacuolar sorting prot 47.2 1E+02 0.0023 23.2 6.6 59 22-89 31-89 (191)
69 COG1579 Zn-ribbon protein, pos 47.1 1.4E+02 0.003 23.4 8.3 82 4-91 33-120 (239)
70 PF10147 CR6_interact: Growth 47.0 68 0.0015 24.7 5.6 48 60-107 128-175 (217)
71 TIGR02263 benz_CoA_red_C benzo 46.9 58 0.0013 26.5 5.6 44 27-74 137-180 (380)
72 PF00170 bZIP_1: bZIP transcri 46.7 58 0.0013 19.6 4.3 24 21-44 29-52 (64)
73 PF11365 DUF3166: Protein of u 46.5 44 0.00095 22.6 4.0 25 22-46 19-43 (96)
74 PF06305 DUF1049: Protein of u 46.4 37 0.0008 20.4 3.4 32 14-45 37-68 (68)
75 PF01726 LexA_DNA_bind: LexA D 46.2 54 0.0012 20.2 4.1 31 74-104 7-38 (65)
76 PF07334 IFP_35_N: Interferon- 45.9 59 0.0013 21.1 4.4 26 21-46 3-28 (76)
77 PRK12750 cpxP periplasmic repr 45.8 1.2E+02 0.0026 22.2 6.6 25 47-73 133-157 (170)
78 COG3167 PilO Tfp pilus assembl 45.4 39 0.00086 25.9 4.0 44 1-44 48-92 (211)
79 PF13514 AAA_27: AAA domain 45.3 1.2E+02 0.0025 28.4 7.8 81 5-90 892-973 (1111)
80 PF03938 OmpH: Outer membrane 45.2 44 0.00095 23.3 4.1 51 22-74 40-94 (158)
81 PRK00888 ftsB cell division pr 44.6 51 0.0011 22.2 4.2 25 20-44 29-53 (105)
82 PF06156 DUF972: Protein of un 43.6 83 0.0018 21.4 5.1 77 4-84 10-94 (107)
83 KOG2916|consensus 43.5 53 0.0011 26.5 4.6 50 48-107 87-136 (304)
84 cd07627 BAR_Vps5p The Bin/Amph 43.4 1.2E+02 0.0027 22.5 6.6 61 25-94 143-203 (216)
85 PF07544 Med9: RNA polymerase 43.3 64 0.0014 20.7 4.4 28 19-46 53-80 (83)
86 PF15469 Sec5: Exocyst complex 42.9 52 0.0011 23.7 4.3 50 50-101 96-148 (182)
87 PF09457 RBD-FIP: FIP domain ; 42.5 72 0.0016 18.8 4.4 26 19-44 1-33 (48)
88 TIGR00606 rad50 rad50. This fa 42.2 2.9E+02 0.0063 26.4 10.0 12 88-99 294-305 (1311)
89 PF00038 Filament: Intermediat 41.7 95 0.0021 24.1 5.9 36 10-47 5-40 (312)
90 PF09006 Surfac_D-trimer: Lung 41.6 74 0.0016 18.7 4.3 25 21-45 2-26 (46)
91 PF00261 Tropomyosin: Tropomyo 41.4 1.6E+02 0.0034 22.4 7.7 41 4-44 143-188 (237)
92 PF04065 Not3: Not1 N-terminal 40.8 1.5E+02 0.0033 23.0 6.7 11 94-104 221-231 (233)
93 PF06667 PspB: Phage shock pro 40.1 74 0.0016 20.5 4.2 32 14-46 32-63 (75)
94 COG3007 Uncharacterized paraqu 39.8 27 0.00059 28.8 2.6 30 78-107 344-380 (398)
95 PF10211 Ax_dynein_light: Axon 39.7 1.2E+02 0.0027 22.4 6.0 33 14-46 116-148 (189)
96 PRK04406 hypothetical protein; 39.5 1E+02 0.0022 19.7 5.1 26 19-44 26-51 (75)
97 PF06013 WXG100: Proteins of 1 39.4 83 0.0018 18.7 5.7 28 18-45 4-31 (86)
98 PRK15396 murein lipoprotein; P 38.9 70 0.0015 20.7 4.0 36 11-46 18-53 (78)
99 PHA00728 hypothetical protein 38.7 55 0.0012 23.4 3.7 22 25-46 5-26 (151)
100 PF05308 Mito_fiss_reg: Mitoch 38.6 50 0.0011 25.9 3.8 29 16-44 113-141 (253)
101 KOG3647|consensus 38.5 2.2E+02 0.0047 23.2 8.8 28 20-47 135-162 (338)
102 PF07303 Occludin_ELL: Occludi 38.5 1.2E+02 0.0026 20.3 6.7 59 22-84 26-87 (101)
103 TIGR01010 BexC_CtrB_KpsE polys 38.1 1.6E+02 0.0035 23.4 6.8 40 4-43 223-267 (362)
104 PF06216 RTBV_P46: Rice tungro 37.7 1.2E+02 0.0026 24.5 5.8 57 4-64 73-131 (389)
105 PF07586 HXXSHH: Protein of un 37.6 1.4E+02 0.0031 23.3 6.3 42 4-46 164-205 (302)
106 PF07798 DUF1640: Protein of u 37.4 1.1E+02 0.0023 22.2 5.3 38 4-42 60-97 (177)
107 PF06156 DUF972: Protein of un 37.4 1.1E+02 0.0024 20.8 5.0 23 23-45 6-28 (107)
108 cd04790 HTH_Cfa-like_unk Helix 37.2 1.6E+02 0.0035 21.3 8.3 25 83-107 138-163 (172)
109 TIGR02169 SMC_prok_A chromosom 37.1 2.2E+02 0.0047 25.9 8.1 19 73-91 1014-1032(1164)
110 TIGR03017 EpsF chain length de 37.0 68 0.0015 26.1 4.5 45 11-55 164-208 (444)
111 PF07757 AdoMet_MTase: Predict 37.0 66 0.0014 22.4 3.8 24 80-104 20-43 (112)
112 PF13991 BssS: BssS protein fa 37.0 57 0.0012 21.0 3.2 23 17-39 41-63 (73)
113 PF07028 DUF1319: Protein of u 36.9 84 0.0018 22.3 4.4 34 24-57 59-95 (126)
114 PF08826 DMPK_coil: DMPK coile 36.6 1E+02 0.0023 19.0 5.1 25 21-45 35-59 (61)
115 PF08654 DASH_Dad2: DASH compl 36.3 1.3E+02 0.0028 20.4 5.1 41 4-50 6-46 (103)
116 PF08838 DUF1811: Protein of u 36.1 60 0.0013 22.3 3.4 23 15-37 4-26 (102)
117 PRK11239 hypothetical protein; 36.1 69 0.0015 24.8 4.1 28 19-46 184-211 (215)
118 PF01923 Cob_adeno_trans: Coba 35.9 1.6E+02 0.0035 21.0 8.1 63 4-68 32-95 (163)
119 TIGR00606 rad50 rad50. This fa 35.6 2.1E+02 0.0044 27.4 7.9 44 3-46 479-522 (1311)
120 PF08340 DUF1732: Domain of un 35.6 91 0.002 20.7 4.2 40 5-44 47-86 (87)
121 PF14265 DUF4355: Domain of un 35.1 1.1E+02 0.0024 20.6 4.8 21 23-43 47-67 (125)
122 PF12329 TMF_DNA_bd: TATA elem 34.7 1.2E+02 0.0026 19.1 4.9 39 4-44 14-52 (74)
123 PF04867 DUF643: Protein of un 34.6 1.2E+02 0.0026 21.0 4.7 40 47-88 24-63 (114)
124 PRK10363 cpxP periplasmic repr 34.6 1.7E+02 0.0037 21.6 5.9 25 48-74 121-145 (166)
125 COG4026 Uncharacterized protei 34.6 84 0.0018 24.8 4.4 39 7-45 117-162 (290)
126 PF07739 TipAS: TipAS antibiot 34.6 58 0.0013 21.2 3.2 70 26-101 29-98 (118)
127 PRK13169 DNA replication intia 34.5 1.2E+02 0.0027 20.8 4.9 75 6-86 19-93 (110)
128 TIGR03190 benz_CoA_bzdN benzoy 34.3 1.8E+02 0.0038 23.7 6.5 47 24-74 130-176 (377)
129 PHA02562 46 endonuclease subun 34.0 1.5E+02 0.0033 24.7 6.3 7 7-13 311-317 (562)
130 PRK14127 cell division protein 33.9 89 0.0019 21.5 4.1 24 22-45 41-64 (109)
131 TIGR01843 type_I_hlyD type I s 33.9 2.4E+02 0.0052 22.3 7.7 33 25-57 165-199 (423)
132 PF07820 TraC: TraC-like prote 33.8 76 0.0017 21.3 3.6 24 19-42 3-26 (92)
133 PF07373 CAMP_factor: CAMP fac 33.4 2.3E+02 0.005 22.1 6.6 55 27-84 127-181 (228)
134 PRK11637 AmiB activator; Provi 33.3 2.7E+02 0.0059 22.8 9.4 32 60-91 114-150 (428)
135 PRK06342 transcription elongat 33.2 1.1E+02 0.0025 22.1 4.8 41 5-45 37-84 (160)
136 PRK10265 chaperone-modulator p 33.2 82 0.0018 20.9 3.7 24 21-44 74-97 (101)
137 PF06717 DUF1202: Protein of u 33.0 2.6E+02 0.0055 22.8 7.0 58 2-61 138-198 (308)
138 COG4942 Membrane-bound metallo 33.0 3.1E+02 0.0067 23.3 8.8 82 4-88 40-126 (420)
139 PF08898 DUF1843: Domain of un 32.9 95 0.0021 18.8 3.6 21 20-40 33-53 (53)
140 PF08663 HalX: HalX domain; I 32.9 1.3E+02 0.0028 19.0 5.1 42 4-45 13-56 (71)
141 KOG4674|consensus 32.4 1.2E+02 0.0026 30.4 5.9 48 21-75 1246-1293(1822)
142 PRK11637 AmiB activator; Provi 32.4 2.8E+02 0.0062 22.8 9.4 27 60-86 107-133 (428)
143 KOG4196|consensus 32.2 1.5E+02 0.0033 21.3 5.1 39 6-46 78-116 (135)
144 PF09958 DUF2192: Uncharacteri 32.0 1E+02 0.0022 24.1 4.5 42 69-110 3-51 (231)
145 TIGR01837 PHA_granule_1 poly(h 31.8 78 0.0017 21.7 3.6 20 26-45 97-116 (118)
146 PRK05431 seryl-tRNA synthetase 31.8 1.4E+02 0.003 24.9 5.6 23 22-44 70-92 (425)
147 PF12729 4HB_MCP_1: Four helix 31.7 1.6E+02 0.0034 19.6 10.8 79 6-92 60-138 (181)
148 PF12711 Kinesin-relat_1: Kine 31.7 1.2E+02 0.0026 20.1 4.2 67 2-74 3-69 (86)
149 cd01040 globin Globins are hem 31.6 84 0.0018 20.6 3.7 42 52-99 1-42 (140)
150 KOG2391|consensus 31.2 1.5E+02 0.0033 24.6 5.5 47 3-51 226-272 (365)
151 PF13412 HTH_24: Winged helix- 31.1 91 0.002 17.2 3.3 26 80-105 6-31 (48)
152 PF02637 GatB_Yqey: GatB domai 31.1 68 0.0015 22.4 3.2 50 48-104 37-86 (148)
153 TIGR02209 ftsL_broad cell divi 30.9 1.2E+02 0.0025 18.9 4.1 23 22-44 28-50 (85)
154 COG1579 Zn-ribbon protein, pos 30.8 2.6E+02 0.0057 21.8 6.8 14 88-101 170-183 (239)
155 PF09726 Macoilin: Transmembra 30.6 2.8E+02 0.006 25.0 7.5 25 21-45 456-480 (697)
156 PF10239 DUF2465: Protein of u 30.3 2.5E+02 0.0055 22.7 6.7 30 49-80 183-212 (318)
157 PF09304 Cortex-I_coil: Cortex 29.9 1.7E+02 0.0037 20.2 4.9 40 5-46 12-51 (107)
158 PF08657 DASH_Spc34: DASH comp 29.8 1.5E+02 0.0033 23.3 5.2 32 4-37 182-213 (259)
159 PF03993 DUF349: Domain of Unk 29.8 1.3E+02 0.0029 18.1 4.2 21 61-81 3-23 (77)
160 cd00490 Met_repressor_MetJ Met 29.7 29 0.00064 23.4 1.1 20 88-107 82-101 (103)
161 PRK00295 hypothetical protein; 29.6 1.2E+02 0.0026 18.9 3.8 25 20-44 21-45 (68)
162 PF05381 Peptidase_C21: Tymovi 29.4 34 0.00074 23.5 1.4 27 81-107 20-48 (104)
163 PF13680 DUF4152: Protein of u 29.1 35 0.00075 25.9 1.5 27 80-106 106-132 (227)
164 TIGR00255 conserved hypothetic 29.1 1.5E+02 0.0032 23.7 5.2 41 4-44 250-290 (291)
165 PF06657 Cep57_MT_bd: Centroso 29.0 1.6E+02 0.0035 18.8 5.0 29 17-45 9-37 (79)
166 PF04102 SlyX: SlyX; InterPro 28.9 1.2E+02 0.0026 18.7 3.8 25 20-44 20-44 (69)
167 PTZ00007 (NAP-L) nucleosome as 28.8 3.1E+02 0.0067 22.5 7.0 36 55-92 63-112 (337)
168 PRK11820 hypothetical protein; 28.7 1.5E+02 0.0033 23.6 5.2 41 4-44 247-287 (288)
169 PF11616 EZH2_WD-Binding: WD r 28.7 87 0.0019 16.8 2.6 17 60-76 10-26 (30)
170 PF09602 PhaP_Bmeg: Polyhydrox 28.6 2.5E+02 0.0054 20.8 8.7 72 19-92 79-150 (165)
171 PF03670 UPF0184: Uncharacteri 28.4 1.6E+02 0.0035 19.4 4.4 35 9-45 26-60 (83)
172 PF03357 Snf7: Snf7; InterPro 28.0 2.1E+02 0.0045 19.8 5.7 23 22-44 12-34 (171)
173 PF03082 MAGSP: Male accessory 27.8 2.1E+02 0.0046 22.4 5.6 85 1-87 151-248 (264)
174 PRK04863 mukB cell division pr 27.7 6.1E+02 0.013 25.1 9.9 104 3-108 919-1053(1486)
175 PF12718 Tropomyosin_1: Tropom 27.6 2.3E+02 0.0049 20.1 6.3 18 24-41 79-96 (143)
176 PLN02678 seryl-tRNA synthetase 27.5 1.7E+02 0.0038 24.8 5.5 21 22-42 75-95 (448)
177 PF09032 Siah-Interact_N: Siah 27.5 40 0.00086 22.0 1.4 36 4-39 5-47 (79)
178 PRK00736 hypothetical protein; 27.3 1.4E+02 0.003 18.6 3.8 39 4-44 7-45 (68)
179 PF11853 DUF3373: Protein of u 27.2 70 0.0015 27.6 3.2 22 23-44 29-50 (489)
180 PF12718 Tropomyosin_1: Tropom 27.2 2.3E+02 0.005 20.0 5.5 14 4-17 16-29 (143)
181 COG2433 Uncharacterized conser 27.2 1.9E+02 0.0041 25.9 5.8 26 20-45 438-463 (652)
182 cd07596 BAR_SNX The Bin/Amphip 27.0 2.3E+02 0.0051 20.1 10.5 40 4-45 126-165 (218)
183 PF14915 CCDC144C: CCDC144C pr 26.6 2.3E+02 0.005 23.0 5.8 66 6-76 67-132 (305)
184 PF09602 PhaP_Bmeg: Polyhydrox 26.5 2.7E+02 0.0059 20.6 6.6 76 23-100 46-133 (165)
185 PF07262 DUF1436: Protein of u 26.5 37 0.0008 24.8 1.2 9 47-55 55-63 (158)
186 PF14989 CCDC32: Coiled-coil d 26.5 1.4E+02 0.003 21.7 4.2 25 33-57 57-81 (148)
187 PF00846 Hanta_nucleocap: Hant 26.5 2.9E+02 0.0063 23.4 6.5 25 83-109 66-90 (428)
188 PRK09973 putative outer membra 26.4 1.6E+02 0.0035 19.4 4.1 36 11-46 17-52 (85)
189 KOG4074|consensus 26.4 3.7E+02 0.0081 22.2 8.4 75 6-83 158-232 (383)
190 PF00570 HRDC: HRDC domain Blo 26.1 1.5E+02 0.0032 17.4 3.7 24 81-104 30-53 (68)
191 COG5185 HEC1 Protein involved 26.0 4.6E+02 0.0099 23.1 9.0 105 2-107 494-616 (622)
192 PF02697 DUF217: Uncharacteriz 26.0 97 0.0021 19.6 3.0 36 72-108 17-52 (71)
193 COG5570 Uncharacterized small 26.0 1.6E+02 0.0035 17.9 4.7 38 7-46 17-54 (57)
194 PF04728 LPP: Lipoprotein leuc 25.9 1.6E+02 0.0036 17.9 4.3 18 27-44 5-22 (56)
195 PF14712 Snapin_Pallidin: Snap 25.8 1.8E+02 0.0039 18.4 5.2 41 4-44 37-83 (92)
196 PF09325 Vps5: Vps5 C terminal 25.8 2.7E+02 0.0058 20.3 6.1 39 6-46 146-184 (236)
197 COG5457 Uncharacterized conser 25.1 1.8E+02 0.0039 18.1 4.7 32 68-107 21-52 (63)
198 PRK00846 hypothetical protein; 25.0 2E+02 0.0043 18.6 4.9 22 24-45 40-61 (77)
199 PRK14549 50S ribosomal protein 24.9 1.8E+02 0.0039 18.1 7.5 54 18-77 12-66 (69)
200 TIGR02894 DNA_bind_RsfA transc 24.8 2.9E+02 0.0063 20.4 7.2 12 60-71 136-147 (161)
201 PF06102 DUF947: Domain of unk 24.8 1.2E+02 0.0026 22.2 3.6 15 3-17 64-78 (168)
202 COG3783 CybC Soluble cytochrom 24.4 1.9E+02 0.0041 19.7 4.2 51 6-56 3-53 (100)
203 PRK02793 phi X174 lysis protei 24.4 1.6E+02 0.0036 18.4 3.8 25 20-44 24-48 (72)
204 COG3027 zapA Cell division pro 24.4 2.2E+02 0.0047 19.3 4.6 15 2-16 61-75 (105)
205 PRK03987 translation initiatio 24.1 1.4E+02 0.0029 23.5 4.1 38 68-106 90-127 (262)
206 TIGR02168 SMC_prok_B chromosom 23.8 4.2E+02 0.0091 23.9 7.6 22 80-101 1031-1052(1179)
207 PRK13169 DNA replication intia 23.7 2.5E+02 0.0054 19.3 5.1 25 21-45 25-49 (110)
208 KOG0994|consensus 23.7 4.6E+02 0.01 25.9 7.8 45 28-74 1197-1243(1758)
209 PRK01203 prefoldin subunit alp 23.6 1.5E+02 0.0033 21.0 3.9 38 6-45 4-41 (130)
210 PF00831 Ribosomal_L29: Riboso 23.6 1.7E+02 0.0038 17.4 6.9 48 17-70 6-53 (58)
211 PF03938 OmpH: Outer membrane 23.5 2.5E+02 0.0055 19.3 8.6 15 17-31 72-86 (158)
212 PF14335 DUF4391: Domain of un 23.5 62 0.0013 24.4 2.0 22 23-44 180-201 (221)
213 COG1644 RPB10 DNA-directed RNA 23.3 1.1E+02 0.0025 19.1 2.8 17 86-104 25-41 (63)
214 PRK03918 chromosome segregatio 23.1 5.4E+02 0.012 22.9 8.2 98 2-101 659-756 (880)
215 PF15463 ECM11: Extracellular 23.1 1.9E+02 0.0041 20.2 4.3 28 22-49 109-136 (139)
216 PRK10884 SH3 domain-containing 23.1 2.3E+02 0.005 21.4 5.0 42 4-45 102-145 (206)
217 PF12325 TMF_TATA_bd: TATA ele 23.0 2.7E+02 0.0058 19.3 7.7 22 23-44 21-42 (120)
218 PHA01750 hypothetical protein 22.9 2.1E+02 0.0047 18.2 4.1 17 25-41 56-72 (75)
219 PF10458 Val_tRNA-synt_C: Valy 22.9 1.9E+02 0.0041 17.5 8.0 24 22-45 1-24 (66)
220 PF12824 MRP-L20: Mitochondria 22.9 3.1E+02 0.0067 20.0 6.3 62 17-81 85-155 (164)
221 PF08946 Osmo_CC: Osmosensory 22.7 1.8E+02 0.0038 17.1 3.7 22 23-44 17-38 (46)
222 PF07407 Seadorna_VP6: Seadorn 22.7 2E+02 0.0043 24.0 4.8 26 21-46 35-60 (420)
223 PRK00306 50S ribosomal protein 22.7 1.9E+02 0.0042 17.6 7.2 53 17-75 8-60 (66)
224 PF02954 HTH_8: Bacterial regu 22.6 1.4E+02 0.0031 16.3 2.9 26 80-105 7-32 (42)
225 PF01340 MetJ: Met Apo-repress 22.6 44 0.00096 22.6 0.9 20 88-107 82-101 (104)
226 KOG4191|consensus 22.6 1.5E+02 0.0032 25.7 4.2 73 5-89 404-487 (516)
227 PF06428 Sec2p: GDP/GTP exchan 22.5 2.5E+02 0.0055 18.9 5.6 39 5-44 18-56 (100)
228 smart00502 BBC B-Box C-termina 22.4 2.2E+02 0.0048 18.2 6.6 28 60-87 43-70 (127)
229 TIGR02905 spore_yutH spore coa 22.4 2.1E+02 0.0045 22.4 4.8 55 36-94 97-151 (313)
230 COG4079 Uncharacterized protei 22.4 4.1E+02 0.0088 21.3 6.3 33 60-94 187-219 (293)
231 PF10359 Fmp27_WPPW: RNA pol I 22.3 1.9E+02 0.0041 24.5 4.8 23 22-44 197-219 (475)
232 PF02183 HALZ: Homeobox associ 22.3 1.7E+02 0.0037 16.8 4.2 26 21-46 15-40 (45)
233 TIGR02976 phageshock_pspB phag 22.3 1.7E+02 0.0036 18.8 3.5 26 21-46 38-63 (75)
234 PRK12423 LexA repressor; Provi 22.2 1.8E+02 0.0039 21.4 4.3 31 73-103 6-37 (202)
235 PRK13848 conjugal transfer pro 22.2 1.6E+02 0.0035 20.0 3.5 51 19-74 4-62 (98)
236 PF01166 TSC22: TSC-22/dip/bun 22.2 2.1E+02 0.0045 17.7 4.4 25 22-46 18-42 (59)
237 PF10360 DUF2433: Protein of u 22.2 1.5E+02 0.0033 21.1 3.6 27 63-89 12-38 (132)
238 PRK09841 cryptic autophosphory 22.1 5.6E+02 0.012 22.8 8.8 42 13-54 262-303 (726)
239 TIGR02912 sulfite_red_C sulfit 22.0 1E+02 0.0022 24.3 3.1 55 49-107 247-306 (314)
240 PF10211 Ax_dynein_light: Axon 22.0 3.3E+02 0.0072 20.1 6.0 22 23-44 132-153 (189)
241 COG3937 Uncharacterized conser 21.8 2.1E+02 0.0045 19.8 4.1 21 21-41 86-106 (108)
242 PF04508 Pox_A_type_inc: Viral 21.7 1.3E+02 0.0028 15.2 2.7 17 27-43 3-19 (23)
243 KOG4571|consensus 21.6 2E+02 0.0043 23.3 4.5 34 68-101 238-271 (294)
244 COG4575 ElaB Uncharacterized c 21.6 2.8E+02 0.006 19.0 6.0 30 26-55 9-38 (104)
245 PF01466 Skp1: Skp1 family, di 21.6 1.7E+02 0.0038 18.2 3.6 17 91-107 45-61 (78)
246 PRK04863 mukB cell division pr 21.6 6.6E+02 0.014 24.9 8.7 66 4-73 385-455 (1486)
247 PF11853 DUF3373: Protein of u 21.6 2.3E+02 0.005 24.5 5.2 33 14-47 14-46 (489)
248 TIGR00634 recN DNA repair prot 21.3 5.3E+02 0.011 22.1 9.2 67 17-86 265-331 (563)
249 PRK04778 septation ring format 21.2 4.2E+02 0.0092 22.8 6.8 93 3-101 356-450 (569)
250 PF15294 Leu_zip: Leucine zipp 21.0 1.7E+02 0.0037 23.5 4.0 25 21-45 128-152 (278)
251 PLN02320 seryl-tRNA synthetase 21.0 2.8E+02 0.0062 24.0 5.7 42 3-44 108-156 (502)
252 TIGR03185 DNA_S_dndD DNA sulfu 20.9 1.1E+02 0.0024 26.7 3.2 16 3-18 399-414 (650)
253 PRK14160 heat shock protein Gr 20.9 3.9E+02 0.0084 20.4 8.3 23 22-44 58-80 (211)
254 COG3078 Uncharacterized protei 20.7 65 0.0014 23.8 1.5 13 95-107 139-151 (169)
255 PF03961 DUF342: Protein of un 20.7 5E+02 0.011 21.6 7.7 24 21-44 330-353 (451)
256 PF04899 MbeD_MobD: MbeD/MobD 20.5 2.3E+02 0.0049 17.9 3.8 22 23-44 47-68 (70)
257 PF12795 MscS_porin: Mechanose 20.5 3.8E+02 0.0083 20.2 7.3 57 5-65 102-159 (240)
258 PF13600 DUF4140: N-terminal d 20.5 1.9E+02 0.0041 18.7 3.7 26 20-45 72-97 (104)
259 PF11855 DUF3375: Protein of u 20.4 2.2E+02 0.0048 24.1 4.9 37 19-55 138-175 (478)
260 PF04012 PspA_IM30: PspA/IM30 20.2 3.3E+02 0.0072 20.0 5.4 24 21-44 115-138 (221)
261 COG4879 Uncharacterized protei 20.1 1.6E+02 0.0034 22.8 3.6 33 69-101 5-38 (243)
262 COG1842 PspA Phage shock prote 20.1 3.3E+02 0.0071 20.9 5.4 41 4-44 94-139 (225)
No 1
>KOG4603|consensus
Probab=99.97 E-value=1.2e-29 Score=186.38 Aligned_cols=102 Identities=30% Similarity=0.512 Sum_probs=99.8
Q ss_pred chHHHHHHHHHHHHccCCHHHHHHHHHHHHHHHHHHHHHHHHhhhcCCCCChhhhcHHhHHHHHHHHHHHHHHHHHHHHH
Q psy8735 2 DDKEKVKSKLKQLSQQMTEAEAKQKKAELQAELEEKKSKLEKLSRNVNVISEGEVDKKTITDTKEKMVKEYNKRKRMCTE 81 (115)
Q Consensus 2 ~~~~~l~seL~~L~s~lt~~el~~~i~~L~~e~~~l~~rL~~l~~~~~~Vs~ee~~~~~v~~~~~k~~~~w~kRKRi~~d 81 (115)
+.++.+++|++.|.|.||+++|+..|..|..+|..+.+||..+++|.++|+|+|+. .+.+.|.+++++||+|||||++
T Consensus 100 ~t~s~veaEik~L~s~Lt~eemQe~i~~L~kev~~~~erl~~~k~g~~~vtpedk~--~v~~~y~~~~~~wrk~krmf~e 177 (201)
T KOG4603|consen 100 QTCSYVEAEIKELSSALTTEEMQEEIQELKKEVAGYRERLKNIKAGTNHVTPEDKE--QVYREYQKYCKEWRKRKRMFRE 177 (201)
T ss_pred HHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHHHHHHHHHHHHHHhcccCCHHHHH--HHHHHHHHHHHHHHHHHHHHHH
Confidence 46788999999999999999999999999999999999999999999999999999 9999999999999999999999
Q ss_pred HHHHHHhcCcCcHHHHHHHhcccc
Q psy8735 82 MLEAILENYPKSKKILLEEVGIET 105 (115)
Q Consensus 82 i~d~I~E~~p~~kk~L~EelGiEt 105 (115)
||++++||+|+++++|+|++|||+
T Consensus 178 i~d~~~e~~pk~ksel~eelGIE~ 201 (201)
T KOG4603|consen 178 IIDKLLEGLPKKKSELYEELGIEA 201 (201)
T ss_pred HHHHHHcCCcchHHHHHHHhCcCC
Confidence 999999999999999999999995
No 2
>PF07106 TBPIP: Tat binding protein 1(TBP-1)-interacting protein (TBPIP); InterPro: IPR010776 This family consists of several eukaryotic TBP-1 interacting protein (TBPIP) sequences. TBP-1 has been demonstrated to interact with the human immunodeficiency virus type 1 (HIV-1) viral protein Tat, then modulate the essential replication process of HIV. In addition, TBP-1 has been shown to be a component of the 26S proteasome, a basic multiprotein complex that degrades ubiquitinated proteins in an ATP-dependent fashion. Human TBPIP interacts with human TBP-1 then modulates the inhibitory action of human TBP-1 on HIV-Tat-mediated transactivation [].
Probab=99.78 E-value=1.4e-18 Score=126.93 Aligned_cols=76 Identities=33% Similarity=0.515 Sum_probs=74.0
Q ss_pred hHHHHHHHHHHHHccCCHHHHHHHHHHHHHHHHHHHHHHHHhhhcCCCCChhhhcHHhHHHHHHHHHHHHHHHHHHHH
Q psy8735 3 DKEKVKSKLKQLSQQMTEAEAKQKKAELQAELEEKKSKLEKLSRNVNVISEGEVDKKTITDTKEKMVKEYNKRKRMCT 80 (115)
Q Consensus 3 ~~~~l~seL~~L~s~lt~~el~~~i~~L~~e~~~l~~rL~~l~~~~~~Vs~ee~~~~~v~~~~~k~~~~w~kRKRi~~ 80 (115)
+++.|+++|++|.+.||+++|...|.+|+.+|..+++||.+|++++..|||++++ ++.+.|.+|+++|++|||||+
T Consensus 94 ~~k~l~~eL~~L~~~~t~~el~~~i~~l~~e~~~l~~kL~~l~~~~~~vs~ee~~--~~~~~~~~~~k~w~kRKri~k 169 (169)
T PF07106_consen 94 EVKSLEAELASLSSEPTNEELREEIEELEEEIEELEEKLEKLRSGSKPVSPEEKE--KLEKEYKKWRKEWKKRKRICK 169 (169)
T ss_pred HHHHHHHHHHHHhcCCCHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCHHHHH--HHHHHHHHHHHHHHHHHHHhC
Confidence 5789999999999999999999999999999999999999999999999999999 999999999999999999995
No 3
>PF07061 Swi5: Swi5; InterPro: IPR010760 This entry represents Swi5 and is involved in meiotic DNA repair synthesis and meiotic joint molecule formation []. It is known to interact with Swi2, Rhp51 and Swi6 [].
Probab=97.23 E-value=0.0086 Score=39.50 Aligned_cols=78 Identities=19% Similarity=0.354 Sum_probs=58.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhhcCCCCChhhhcHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcCcHHHHHHHhc
Q psy8735 23 AKQKKAELQAELEEKKSKLEKLSRNVNVISEGEVDKKTITDTKEKMVKEYNKRKRMCTEMLEAILENYPKSKKILLEEVG 102 (115)
Q Consensus 23 l~~~i~~L~~e~~~l~~rL~~l~~~~~~Vs~ee~~~~~v~~~~~k~~~~w~kRKRi~~di~d~I~E~~p~~kk~L~EelG 102 (115)
+...+..|+.++.++...+..+.+.- +..-. .+.+.|-+...+|..=|-++.-+++.|++.=+-+-+++++++|
T Consensus 5 l~~~~~~L~~~~~~l~~~i~~~~~~l----~~~~~--~~v~~hI~lLheYNeiKD~gQ~Lig~iA~~rgvt~~~v~~e~g 78 (83)
T PF07061_consen 5 LEAEIQELKEQIEQLEKEISELEAEL----IEDPE--KIVKRHIKLLHEYNEIKDIGQGLIGLIADQRGVTVKDVYEEFG 78 (83)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhc----ccCHH--HHHHHHHHHHHHHhHHHHHHHHHHHHHHHHcCCcHHHHHHHcC
Confidence 34445555555555555555554321 23344 7888999999999999999999999999999999999999999
Q ss_pred cccc
Q psy8735 103 IETD 106 (115)
Q Consensus 103 iEtD 106 (115)
++.+
T Consensus 79 l~~~ 82 (83)
T PF07061_consen 79 LDMN 82 (83)
T ss_pred CCCC
Confidence 9875
No 4
>PF03962 Mnd1: Mnd1 family; InterPro: IPR005647 This family of proteins includes meiotic nuclear division protein 1 (MND1) from Saccharomyces cerevisiae (Baker's yeast). The mnd1 protein forms a complex with hop2 to promote homologous chromosome pairing and meiotic double-strand break repair [].
Probab=93.62 E-value=2.3 Score=31.77 Aligned_cols=75 Identities=23% Similarity=0.332 Sum_probs=40.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhhcCCCCChhhhcHHhHHHHHHHHHHHHHHHHHHHHH---HHHHHHhcCcCcHHHHHH
Q psy8735 23 AKQKKAELQAELEEKKSKLEKLSRNVNVISEGEVDKKTITDTKEKMVKEYNKRKRMCTE---MLEAILENYPKSKKILLE 99 (115)
Q Consensus 23 l~~~i~~L~~e~~~l~~rL~~l~~~~~~Vs~ee~~~~~v~~~~~k~~~~w~kRKRi~~d---i~d~I~E~~p~~kk~L~E 99 (115)
+...+.+|+.++..++..|..+... +|+... ++...-..++...- |+-.+ |..-|.--++.+.++|-.
T Consensus 108 ~l~~l~~l~~~~~~l~~el~~~~~~----Dp~~i~--~~~~~~~~~~~~an---rwTDNI~~l~~~~~~k~~~~~~~i~k 178 (188)
T PF03962_consen 108 LLEELEELKKELKELKKELEKYSEN----DPEKIE--KLKEEIKIAKEAAN---RWTDNIFSLKSYLKKKFGMDEEDIRK 178 (188)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhc----CHHHHH--HHHHHHHHHHHHHH---HHHhhHHHHHHHHHHhcCCCHHHHHH
Confidence 4455566666777777777655432 444444 33333322222211 11112 223334557888999999
Q ss_pred Hhccccc
Q psy8735 100 EVGIETD 106 (115)
Q Consensus 100 elGiEtD 106 (115)
++||..|
T Consensus 179 ~f~Ip~d 185 (188)
T PF03962_consen 179 EFGIPED 185 (188)
T ss_pred HcCCccc
Confidence 9999754
No 5
>PF05812 Herpes_BLRF2: Herpesvirus BLRF2 protein; InterPro: IPR008642 This family consists of several herpes virus BLRF2 tegument proteins.; PDB: 2OA5_B 2H3R_D.
Probab=92.61 E-value=0.62 Score=32.72 Aligned_cols=49 Identities=20% Similarity=0.234 Sum_probs=36.7
Q ss_pred cCCHHHHHHHHHHHHHHHHHHHHHHHHhh-----hcCCCCChhhhcHHhHHHHHHH
Q psy8735 17 QMTEAEAKQKKAELQAELEEKKSKLEKLS-----RNVNVISEGEVDKKTITDTKEK 67 (115)
Q Consensus 17 ~lt~~el~~~i~~L~~e~~~l~~rL~~l~-----~~~~~Vs~ee~~~~~v~~~~~k 67 (115)
.+|-+||.+++.+|+-||..|..+|..-- .....+||.+++ -+...|..
T Consensus 2 ~~t~EeLaaeL~kLqmENk~LKkkl~~~~~p~~~p~~~~LTp~qKe--~~I~s~~~ 55 (118)
T PF05812_consen 2 DMTMEELAAELQKLQMENKALKKKLRQSVGPGPSPDDEVLTPAQKE--AMITSAVS 55 (118)
T ss_dssp -HHHHHHHHHHHHHHHHHHHHHHHHHHTT---S-TT--B--HHHHH--HHHHHHHH
T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHccCCCCCCCCccccChHHHH--HHHHHHHH
Confidence 36789999999999999999999998754 234788999999 77776643
No 6
>PHA03162 hypothetical protein; Provisional
Probab=92.47 E-value=0.94 Score=32.43 Aligned_cols=50 Identities=16% Similarity=0.166 Sum_probs=40.5
Q ss_pred ccCCHHHHHHHHHHHHHHHHHHHHHHHHhh-----hcCCCCChhhhcHHhHHHHHHH
Q psy8735 16 QQMTEAEAKQKKAELQAELEEKKSKLEKLS-----RNVNVISEGEVDKKTITDTKEK 67 (115)
Q Consensus 16 s~lt~~el~~~i~~L~~e~~~l~~rL~~l~-----~~~~~Vs~ee~~~~~v~~~~~k 67 (115)
..+|.+||.+++.+|+-||..|..+|..=- .+...+||.+++ .+...|..
T Consensus 11 ~~~tmEeLaaeL~kLqmENK~LKkkl~~~~~~~~~p~d~~LTp~qKe--a~I~s~v~ 65 (135)
T PHA03162 11 AQPTMEDLAAEIAKLQLENKALKKKIKEGTDDDPLPGDPILTPAAKE--AMIGAATA 65 (135)
T ss_pred cCCCHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCCCCccCCHHHHH--HHHHHHHH
Confidence 578999999999999999999999995432 134568999999 87777743
No 7
>PHA03155 hypothetical protein; Provisional
Probab=92.21 E-value=1 Score=31.47 Aligned_cols=50 Identities=16% Similarity=0.175 Sum_probs=40.7
Q ss_pred ccCCHHHHHHHHHHHHHHHHHHHHHHHHh-hhcCCCCChhhhcHHhHHHHHHH
Q psy8735 16 QQMTEAEAKQKKAELQAELEEKKSKLEKL-SRNVNVISEGEVDKKTITDTKEK 67 (115)
Q Consensus 16 s~lt~~el~~~i~~L~~e~~~l~~rL~~l-~~~~~~Vs~ee~~~~~v~~~~~k 67 (115)
..+|.+||.+++..|+-||+.|..+|..= ......+||.+++ -+...|..
T Consensus 6 ~~~tvEeLaaeL~kL~~ENK~LKkkl~~~~~p~d~~LT~~qKe--a~I~s~v~ 56 (115)
T PHA03155 6 ACADVEELEKELQKLKIENKALKKKLLQHGNPEDELLTPAQKD--AIINSLVN 56 (115)
T ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHHHccCCCCccccCHHHHH--HHHHHHHH
Confidence 46899999999999999999999999552 2355789999998 77766643
No 8
>PF10552 ORF6C: ORF6C domain; InterPro: IPR018878 This entry represents the carboxy-terminal domain from ORF6 (Q9B012 from SWISSPROT), an antirepressor protein from Lactococcus phage bIL285 [].
Probab=89.01 E-value=5.6 Score=27.18 Aligned_cols=65 Identities=17% Similarity=0.341 Sum_probs=47.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHhhhcCCCCChhhhcHHhHHHHHHHHHHHH---------HH---HHHHHHHHHHHHHhcCc
Q psy8735 24 KQKKAELQAELEEKKSKLEKLSRNVNVISEGEVDKKTITDTKEKMVKEY---------NK---RKRMCTEMLEAILENYP 91 (115)
Q Consensus 24 ~~~i~~L~~e~~~l~~rL~~l~~~~~~Vs~ee~~~~~v~~~~~k~~~~w---------~k---RKRi~~di~d~I~E~~p 91 (115)
-..+.++..++..++.++..|.. ..++++.+.. .+.+.-......+ +. |++.|.+||..+-..++
T Consensus 7 ~~~~~~~~~ki~~ve~~V~~l~~-~~~i~~~q~~--~i~~~v~~rv~~~lgg~~s~ay~~~~~~~k~f~~i~~~lk~~F~ 83 (116)
T PF10552_consen 7 MQATEEHNEKIEEVENRVDDLEE-NMPIDPGQQK--EIQKAVKSRVYELLGGKGSPAYKDKSFRRKLFSDIYRDLKRHFG 83 (116)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHh-cCCCCHHHHH--HHHHHHHHHHHHHHhccccchhhhhHHhHHHHHHHHHHHHHHhC
Confidence 45567788888888989988884 4678888877 7776655444444 44 99999999988855444
No 9
>PF05008 V-SNARE: Vesicle transport v-SNARE protein N-terminus; InterPro: IPR007705 V-SNARE proteins are required for protein traffic between eukaryotic organelles. The v-SNAREs on transport vesicles interact with t-SNAREs on target membranes in order to facilitate this []. This domain is the N-terminal half of the V-Snare proteins. ; GO: 0006886 intracellular protein transport, 0016020 membrane; PDB: 2V8S_V 1VCS_A 3ONL_C 3ONJ_A 2QYW_A.
Probab=87.25 E-value=5.4 Score=24.98 Aligned_cols=73 Identities=19% Similarity=0.277 Sum_probs=58.6
Q ss_pred HHHHHHHHHHHccCCHHHHHHHHHHHHHHHHHHHHHHHHhhhcCCCCChhhhcHHhHHHHHHHHHHHHHHHHHHHH
Q psy8735 5 EKVKSKLKQLSQQMTEAEAKQKKAELQAELEEKKSKLEKLSRNVNVISEGEVDKKTITDTKEKMVKEYNKRKRMCT 80 (115)
Q Consensus 5 ~~l~seL~~L~s~lt~~el~~~i~~L~~e~~~l~~rL~~l~~~~~~Vs~ee~~~~~v~~~~~k~~~~w~kRKRi~~ 80 (115)
..+.+.+..+.+... ++-...|..++..+.+..+-|..|.--...+++.++. .+......|+..+.+=++=++
T Consensus 6 ~~i~~~l~~~~~~~~-~~r~~~i~~~e~~l~ea~~~l~qMe~E~~~~p~s~r~--~~~~kl~~yr~~l~~lk~~l~ 78 (79)
T PF05008_consen 6 AEIKSKLERIKNLSG-EQRKSLIREIERDLDEAEELLKQMELEVRSLPPSERN--QYKSKLRSYRSELKKLKKELK 78 (79)
T ss_dssp HHHHHHHHHGGGS-C-HHHHHHHHHHHHHHHHHHHHHHHHHHHHCTS-HHHHH--HHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHhhccCh-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCHHHHH--HHHHHHHHHHHHHHHHHHHhc
Confidence 456667777776555 8999999999999999999999998666677889999 999999999999888766543
No 10
>PF01486 K-box: K-box region; InterPro: IPR002487 MADS genes in plants encode key developmental regulators of vegetative and reproductive development. The majority of the plant MADS proteins share a stereotypical MIKC structure. It comprises (from N- to C-terminal) an N-terminal domain, which is, however, present only in a minority of proteins; a MADS domain (see PDOC00302 from PROSITEDOC, IPR002100 from INTERPRO), which is the major determinant of DNA-binding but which also performs dimerisation and accessory factor binding functions; a weakly conserved intervening (I) domain, which constitutes a key molecular determinant for the selective formation of DNA-binding dimers; a keratin-like (K-box) domain, which promotes protein dimerisation; and a C-terminal (C) domain, which is involved in transcriptional activation or in the formation of ternary or quaternary protein complexes. The 80-amino acid K-box domain was originally identified as a region with low but significant similarity to a region of keratin, which is part of the coiled-coil sequence constituting the central rod-shaped domain of keratin [, , ]. The K-box protein-protein interaction domain which mediates heterodimerization of MIKC-type MADS proteins contains several heptad repeats in which the first and the fourth positions are occupied by hydrophobic amino acids suggesting that the K-box domain forms three amphipathic alpha-helices referred to as K1, K2, and K3 [].; GO: 0003700 sequence-specific DNA binding transcription factor activity, 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus
Probab=86.55 E-value=3.7 Score=27.19 Aligned_cols=56 Identities=25% Similarity=0.327 Sum_probs=48.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhh-cCCCCChhhhcHHhHHHHHHHHHHHHHHHHH
Q psy8735 20 EAEAKQKKAELQAELEEKKSKLEKLSR-NVNVISEGEVDKKTITDTKEKMVKEYNKRKR 77 (115)
Q Consensus 20 ~~el~~~i~~L~~e~~~l~~rL~~l~~-~~~~Vs~ee~~~~~v~~~~~k~~~~w~kRKR 77 (115)
...+...+..|..++..++..+..+.+ .-..+|..|.. .++.....+...-|.||.
T Consensus 14 ~e~~~~e~~~L~~~~~~L~~~~R~~~GedL~~Ls~~eL~--~LE~~Le~aL~~VR~rK~ 70 (100)
T PF01486_consen 14 HEELQQEIAKLRKENESLQKELRHLMGEDLESLSLKELQ--QLEQQLESALKRVRSRKD 70 (100)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHhccccccccccchHHHH--HHHHhhhhhHHHHHHHHH
Confidence 456778888999999999988888886 44789999999 999999999999888885
No 11
>PF14282 FlxA: FlxA-like protein
Probab=86.34 E-value=4.4 Score=27.48 Aligned_cols=43 Identities=21% Similarity=0.433 Sum_probs=34.7
Q ss_pred hHHHHHHHHHHHHc--cCCHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy8735 3 DKEKVKSKLKQLSQ--QMTEAEAKQKKAELQAELEEKKSKLEKLS 45 (115)
Q Consensus 3 ~~~~l~seL~~L~s--~lt~~el~~~i~~L~~e~~~l~~rL~~l~ 45 (115)
.|..|..+|..|.+ .++.++-..++..|..+|..|+..|..+.
T Consensus 27 Qi~~Lq~ql~~l~~~~~~~~e~k~~q~q~Lq~QI~~LqaQI~qlq 71 (106)
T PF14282_consen 27 QIKQLQEQLQELSQDSDLDAEQKQQQIQLLQAQIQQLQAQIAQLQ 71 (106)
T ss_pred HHHHHHHHHHHHHcccCCCHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 47778888888887 36888888888888888888888888774
No 12
>PF06698 DUF1192: Protein of unknown function (DUF1192); InterPro: IPR009579 This family consists of several short, hypothetical, bacterial proteins of around 60 residues in length. The function of this family is unknown.
Probab=85.40 E-value=2.1 Score=26.49 Aligned_cols=30 Identities=20% Similarity=0.257 Sum_probs=26.8
Q ss_pred HccCCHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy8735 15 SQQMTEAEAKQKKAELQAELEEKKSKLEKL 44 (115)
Q Consensus 15 ~s~lt~~el~~~i~~L~~e~~~l~~rL~~l 44 (115)
-+.+|.+||...|+.|+.||..++..+..=
T Consensus 18 Ls~lSv~EL~~RIa~L~aEI~R~~~~~~~K 47 (59)
T PF06698_consen 18 LSLLSVEELEERIALLEAEIARLEAAIAKK 47 (59)
T ss_pred chhcCHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 467899999999999999999999988764
No 13
>PF05377 FlaC_arch: Flagella accessory protein C (FlaC); InterPro: IPR008039 Although archaeal flagella appear superficially similar to those of bacteria, they are quite distinct []. In several archaea, the flagellin genes are followed immediately by the flagellar accessory genes flaCDEFGHIJ. The gene products may have a role in translocation, secretion, or assembly of the flagellum. FlaC is a protein whose exact role is unknown but it has been shown to be membrane-associated (by immuno-blotting fractionated cells) [].
Probab=81.67 E-value=8.3 Score=23.58 Aligned_cols=39 Identities=13% Similarity=0.387 Sum_probs=23.6
Q ss_pred HHHHHHHHHHHHccCCHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy8735 4 KEKVKSKLKQLSQQMTEAEAKQKKAELQAELEEKKSKLEKL 44 (115)
Q Consensus 4 ~~~l~seL~~L~s~lt~~el~~~i~~L~~e~~~l~~rL~~l 44 (115)
+.++++++..+.+++ .-++.++.++...+..+++.+..|
T Consensus 2 i~elEn~~~~~~~~i--~tvk~en~~i~~~ve~i~envk~l 40 (55)
T PF05377_consen 2 IDELENELPRIESSI--NTVKKENEEISESVEKIEENVKDL 40 (55)
T ss_pred HHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHH
Confidence 567788888887773 334555555555555555555443
No 14
>PF05837 CENP-H: Centromere protein H (CENP-H); InterPro: IPR008426 Chromosome segregation in eukaryotes requires the kinetochore, a multi-protein structure that assembles on centromeric DNA, and which acts to link chromosomes to spindle microtubules. Kinetochore structure and composition is highly conserved among vertebrates. The inner kinetochore is essential for kinetochore assembly, and is involved in chromosome segregation via regulation of the spindle. Inner kinetochore components include the multi-subunit CENP-H/I complex, which may function, in part, in directing centromere protein A (CENP-A) deposition to centromeres, where CENP-A is a centromere-specific histone H3 variant required for the organisation of centromeric chromatin during interphase. The CENP-H/I complex contains three functional classes of proteins [, ]: CENP-H class (includes CENP-H, -I, -K, -L) CENP-M class (includes CENP-M) CENP-O class (includes CENP-O, -P, -Q, -R, -50) CENP-H is required for the localisation of CENP-C, but not CENP-A, to the centromere. However, it may be involved in the incorporation of newly synthesised CENP-A into centromeres via its interaction with the CENP-A/CENP-HI complex. CENP-H contains a coiled-coil structure and a nuclear localisation signal. CENP-H is specifically and constitutively localised in kinetochores throughout the cell cycle, and may play a role in kinetochore organisation and function throughout the cell cycle []. Studies show that CENP-H may be associated with certain human cancers [, ]. This entry also includes Kinetochore protein Fta3 which is a subunit of the Sim4 complex. This complex is required for loading the DASH complex onto the kinetochore via interaction with dad1. Fta2, Fta3 and Fta4 associate with the central core and inner repeat region of the centromere [].; GO: 0043515 kinetochore binding, 0007059 chromosome segregation, 0051301 cell division, 0000777 condensed chromosome kinetochore, 0005634 nucleus
Probab=80.22 E-value=15 Score=24.76 Aligned_cols=76 Identities=11% Similarity=0.227 Sum_probs=46.9
Q ss_pred HHHHHHHHHHHccCCHHHHHHHHHHHHHHHHHHHHHHHHhhhcCC-----CCChhhhcHHhHHHHHHHHHHHHHHHHHHH
Q psy8735 5 EKVKSKLKQLSQQMTEAEAKQKKAELQAELEEKKSKLEKLSRNVN-----VISEGEVDKKTITDTKEKMVKEYNKRKRMC 79 (115)
Q Consensus 5 ~~l~seL~~L~s~lt~~el~~~i~~L~~e~~~l~~rL~~l~~~~~-----~Vs~ee~~~~~v~~~~~k~~~~w~kRKRi~ 79 (115)
-.+.++...+...+ .+++...-.+...+.++-..+..|..... .-....+. .+...+...+..|+-=|.+|
T Consensus 6 ~~~~~~~~~l~~~L--~~v~~~~l~l~~~n~el~~el~~l~~~~~~~~~~~~~~~~l~--~~~~~lk~~r~~~~v~k~v~ 81 (106)
T PF05837_consen 6 LNLQQESRSLQEKL--SDVEKKRLRLKRRNQELAQELLELAEKQKSQREDEELSEKLE--KLEKELKKSRQRWRVMKNVF 81 (106)
T ss_pred HHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccchHHHHHHH--HHHHHHHHHHHHHHHHHHHH
Confidence 34444555555544 33445555566666666666666643221 22335677 78888888888888888888
Q ss_pred HHHHH
Q psy8735 80 TEMLE 84 (115)
Q Consensus 80 ~di~d 84 (115)
..|+-
T Consensus 82 q~lI~ 86 (106)
T PF05837_consen 82 QALIV 86 (106)
T ss_pred HHHHH
Confidence 87763
No 15
>PF08317 Spc7: Spc7 kinetochore protein; InterPro: IPR013253 This entry consists of cell division proteins which are required for kinetochore-spindle association [].
Probab=78.49 E-value=13 Score=29.80 Aligned_cols=20 Identities=10% Similarity=0.199 Sum_probs=12.2
Q ss_pred cCCCCChhhhcHHhHHHHHHHH
Q psy8735 47 NVNVISEGEVDKKTITDTKEKM 68 (115)
Q Consensus 47 ~~~~Vs~ee~~~~~v~~~~~k~ 68 (115)
.....|..|+. .+...|...
T Consensus 269 ~~r~~t~~Ev~--~Lk~~~~~L 288 (325)
T PF08317_consen 269 ECRGWTRSEVK--RLKAKVDAL 288 (325)
T ss_pred HhcCCCHHHHH--HHHHHHHHH
Confidence 34566777777 666666444
No 16
>COG5200 LUC7 U1 snRNP component, mediates U1 snRNP association with cap-binding complex [RNA processing and modification]
Probab=76.03 E-value=23 Score=27.64 Aligned_cols=83 Identities=14% Similarity=0.157 Sum_probs=64.0
Q ss_pred cCCHHHHHHHHHHHHHHHHHHHHHHHHhh-hcCCCCChhhhcHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcCcHH
Q psy8735 17 QMTEAEAKQKKAELQAELEEKKSKLEKLS-RNVNVISEGEVDKKTITDTKEKMVKEYNKRKRMCTEMLEAILENYPKSKK 95 (115)
Q Consensus 17 ~lt~~el~~~i~~L~~e~~~l~~rL~~l~-~~~~~Vs~ee~~~~~v~~~~~k~~~~w~kRKRi~~di~d~I~E~~p~~kk 95 (115)
--++.+....|.+++..+.-+...+..+- ++..++-....+ .....|.+....--+||++...++... |+.+.+..
T Consensus 106 ~~~t~ee~~kI~e~~e~~~~~d~sIg~lg~e~dalakrk~V~--~a~~~f~el~rl~~~rkei~~~v~sm~-en~gq~th 182 (258)
T COG5200 106 DRTTPEEFGKIKEKEELFNRVDESIGELGMEGDALAKRKLVE--RACSAFNELERLREERKEIKEAVYSMV-ENNGQGTH 182 (258)
T ss_pred ccCCHHHHHhHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHh-hCcchhhh
Confidence 35677888899999999999999988884 566777778888 899999999999999999999988644 55555433
Q ss_pred HHHHHhc
Q psy8735 96 ILLEEVG 102 (115)
Q Consensus 96 ~L~EelG 102 (115)
+=++-.|
T Consensus 183 qklqvC~ 189 (258)
T COG5200 183 QKLQVCG 189 (258)
T ss_pred hhhhhhh
Confidence 3333333
No 17
>PF12808 Mto2_bdg: Micro-tubular organiser Mto1 C-term Mto2-binding region; InterPro: IPR024545 This domain occurs at the C terminus of microtubule organising proteins in both budding and fission fungi. In Schizosaccharomyces pombe it has been shown to interact with the Mto2p protein, an interaction which is critical for anchoring the cytokinetic actin ring to the medial region of the cell and for proper coordination of mitosis with cytokinesis [, ].
Probab=74.45 E-value=16 Score=22.00 Aligned_cols=44 Identities=23% Similarity=0.373 Sum_probs=34.0
Q ss_pred hHHHHHHHHHHHH--ccCCHHHHHHHHHHHHHHHHHHHHHHHHhhh
Q psy8735 3 DKEKVKSKLKQLS--QQMTEAEAKQKKAELQAELEEKKSKLEKLSR 46 (115)
Q Consensus 3 ~~~~l~seL~~L~--s~lt~~el~~~i~~L~~e~~~l~~rL~~l~~ 46 (115)
-+.+|+..+++=. -.+....+...|.+|+.++..|...|..+++
T Consensus 5 Rl~ELe~klkaerE~R~~d~~~a~~rl~~l~~EN~~Lr~eL~~~r~ 50 (52)
T PF12808_consen 5 RLEELERKLKAEREARSLDRSAARKRLSKLEGENRLLRAELERLRS 50 (52)
T ss_pred HHHHHHHHHHHhHHhccCCchhHHHHHHHHHHHHHHHHHHHHHHhh
Confidence 3455565555544 5567788999999999999999999998764
No 18
>PF10376 Mei5: Double-strand recombination repair protein ; InterPro: IPR018468 Mei5 is one of a pair of meiosis-specific proteins which facilitate the loading of Dmc1 on to Rad51 on DNA at double-strand breaks during recombination. Recombination is carried out by a large protein complex based around the two RecA homologues, Rad51 and Dmc1 []. This complex may play both a catalytic and a structural role in the interaction between homologous chromosomes during meiosis. Mei5 is seen to contain a coiled-coli region.
Probab=74.43 E-value=33 Score=26.35 Aligned_cols=83 Identities=18% Similarity=0.341 Sum_probs=50.7
Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHhhhcCCCCChhhhcHHhHHHHHHHHHHHHHHHH-HHHHHHHHHHHh--cCcCcH
Q psy8735 18 MTEAEAKQKKAELQAELEEKKSKLEKLSRNVNVISEGEVDKKTITDTKEKMVKEYNKRK-RMCTEMLEAILE--NYPKSK 94 (115)
Q Consensus 18 lt~~el~~~i~~L~~e~~~l~~rL~~l~~~~~~Vs~ee~~~~~v~~~~~k~~~~w~kRK-Ri~~di~d~I~E--~~p~~k 94 (115)
+...+|..+-.+|+.++...++.|..|......=+..+.. .+......|+.-. ..+.++...+.+ |=..+-
T Consensus 131 ~~~~el~~ek~kL~~q~~e~~e~lr~L~~~k~~r~Kn~~~------~Lq~lI~Kwr~~~q~~l~eL~~~~~~~e~~~~TM 204 (221)
T PF10376_consen 131 LKQQELEEEKRKLEKQVDEKEEELRRLKLVKQYRSKNDLE------QLQSLIKKWRSASQEALYELQSEMSEEEGEKFTM 204 (221)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHhhccHH------HHHHHHHHHHHHHHHHHHHHHHHHhhccccCccH
Confidence 3456777788888889999888888886422222333222 2223333443332 234455555555 333467
Q ss_pred HHHHHHhccccc
Q psy8735 95 KILLEEVGIETD 106 (115)
Q Consensus 95 k~L~EelGiEtD 106 (115)
.+|+..+||+.+
T Consensus 205 ~eL~~~l~ID~~ 216 (221)
T PF10376_consen 205 GELIKRLGIDYD 216 (221)
T ss_pred HHHHHHhCCCcc
Confidence 999999999876
No 19
>KOG3433|consensus
Probab=73.84 E-value=37 Score=25.83 Aligned_cols=49 Identities=10% Similarity=0.181 Sum_probs=35.2
Q ss_pred hhhhcHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcCcHHHHHHHhccccc
Q psy8735 53 EGEVDKKTITDTKEKMVKEYNKRKRMCTEMLEAILENYPKSKKILLEEVGIETD 106 (115)
Q Consensus 53 ~ee~~~~~v~~~~~k~~~~w~kRKRi~~di~d~I~E~~p~~kk~L~EelGiEtD 106 (115)
-+.+. ...+.|....+.|..-= --+++-|.-.+|..+.++-..+||..|
T Consensus 150 ~~k~~--~~~K~~~eaanrwtDnI---~il~dy~~rkf~~e~nqi~~~fgIPed 198 (203)
T KOG3433|consen 150 FEKKV--HLEKTMAEAANRWTDNI---FILIDYLYRKFGLEPNQIRKEFGIPED 198 (203)
T ss_pred HHHHH--HHHHHHHHHHhhhhhhH---HHHHHHHHHhcCCCHHHHhHhcCCCcc
Confidence 34455 66777777777775432 235566678889999999999999876
No 20
>PRK11546 zraP zinc resistance protein; Provisional
Probab=73.79 E-value=9.3 Score=27.65 Aligned_cols=41 Identities=15% Similarity=0.283 Sum_probs=17.2
Q ss_pred HHHHHHHHHHHHHHHHHHhhhcCCCCChhhhcHHhHHHHHHHHH
Q psy8735 26 KKAELQAELEEKKSKLEKLSRNVNVISEGEVDKKTITDTKEKMV 69 (115)
Q Consensus 26 ~i~~L~~e~~~l~~rL~~l~~~~~~Vs~ee~~~~~v~~~~~k~~ 69 (115)
+...|+.++-.....|..|-.+ ...+++-+. ++.++-...+
T Consensus 62 ~t~~LRqqL~aKr~ELnALl~~-~~pD~~kI~--aL~kEI~~Lr 102 (143)
T PRK11546 62 QTSALRQQLVSKRYEYNALLTA-NPPDSSKIN--AVAKEMENLR 102 (143)
T ss_pred HHHHHHHHHHHHHHHHHHHHcC-CCCCHHHHH--HHHHHHHHHH
Confidence 3444444444444444444322 233333344 4444444433
No 21
>PF15188 CCDC-167: Coiled-coil domain-containing protein 167
Probab=73.25 E-value=20 Score=23.75 Aligned_cols=45 Identities=11% Similarity=0.210 Sum_probs=33.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHhhh--cCCCCChhhhcHHhHHHHHHHHHH
Q psy8735 24 KQKKAELQAELEEKKSKLEKLSR--NVNVISEGEVDKKTITDTKEKMVK 70 (115)
Q Consensus 24 ~~~i~~L~~e~~~l~~rL~~l~~--~~~~Vs~ee~~~~~v~~~~~k~~~ 70 (115)
..+|..++..+.....+|+.+-. -+..+||+.+. .++++.....+
T Consensus 4 ~~eId~lEekl~~cr~~le~ve~rL~~~eLs~e~R~--~lE~E~~~l~~ 50 (85)
T PF15188_consen 4 AKEIDGLEEKLAQCRRRLEAVESRLRRRELSPEARR--SLEKELNELKE 50 (85)
T ss_pred HHHHhhHHHHHHHHHHHHHHHHHHHcccCCChHHHH--HHHHHHHHHHH
Confidence 46778888888888888887743 56888999888 88877654443
No 22
>KOG4302|consensus
Probab=72.71 E-value=21 Score=31.78 Aligned_cols=65 Identities=22% Similarity=0.329 Sum_probs=45.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhh--cC-CCC---------Ch-hhhcHHhHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy8735 20 EAEAKQKKAELQAELEEKKSKLEKLSR--NV-NVI---------SE-GEVDKKTITDTKEKMVKEYNKRKRMCTEMLEAI 86 (115)
Q Consensus 20 ~~el~~~i~~L~~e~~~l~~rL~~l~~--~~-~~V---------s~-ee~~~~~v~~~~~k~~~~w~kRKRi~~di~d~I 86 (115)
.+++....+.|..++...++.|..|.+ +. ..+ |- +... .|......++.++..|+.=|.+|.++|
T Consensus 56 ve~a~~~~~~L~~~ia~~eael~~l~s~l~~~~~~~~~~~k~e~tLke~l~--~l~~~le~lr~qk~eR~~ef~el~~qi 133 (660)
T KOG4302|consen 56 VEEASESKARLLQEIAVIEAELNDLCSALGEPSIIGEISDKIEGTLKEQLE--SLKPYLEGLRKQKDERRAEFKELYHQI 133 (660)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccccccccccCccHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 355555666667777777777776642 11 111 11 4567 788888999999999999999999887
No 23
>PF02344 Myc-LZ: Myc leucine zipper domain; InterPro: IPR003327 This family consists of the leucine zipper dimerisation domain found in both cellular c-Myc proto-oncogenes and viral v-Myc oncogenes. Dimerisation via the leucine zipper motif with other basic helix-loop-helix-leucine zipper (b/HLH/lz) proteins is required for efficient DNA binding []. The Myc-Max dimer is a transactivating complex activating expression of growth related genes promoting cell proliferation. The dimerisation is facilitated via interdigitating leucine residues every 7th position of the alpha helix. Like charge repulsion of adjacent residues in this region preturbs the formation of homodimers with heterodimers being promoted by opposing charge attractions. It has been demonstrated that in transgenic mice the balance between oncogene-induced proliferation and apoptosis in a given tissue can be a critical determinant in the initiation and maintenance of the tumor [].; GO: 0003700 sequence-specific DNA binding transcription factor activity, 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus; PDB: 1NKP_D 1A93_A 2A93_A.
Probab=72.55 E-value=10 Score=20.65 Aligned_cols=25 Identities=32% Similarity=0.396 Sum_probs=19.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhhc
Q psy8735 23 AKQKKAELQAELEEKKSKLEKLSRN 47 (115)
Q Consensus 23 l~~~i~~L~~e~~~l~~rL~~l~~~ 47 (115)
+.....+|+.....|..+|+.|+++
T Consensus 6 L~sekeqLrrr~eqLK~kLeqlrnS 30 (32)
T PF02344_consen 6 LISEKEQLRRRREQLKHKLEQLRNS 30 (32)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHH---
T ss_pred HHHHHHHHHHHHHHHHHHHHHHhcc
Confidence 5667889999999999999999864
No 24
>PF07106 TBPIP: Tat binding protein 1(TBP-1)-interacting protein (TBPIP); InterPro: IPR010776 This family consists of several eukaryotic TBP-1 interacting protein (TBPIP) sequences. TBP-1 has been demonstrated to interact with the human immunodeficiency virus type 1 (HIV-1) viral protein Tat, then modulate the essential replication process of HIV. In addition, TBP-1 has been shown to be a component of the 26S proteasome, a basic multiprotein complex that degrades ubiquitinated proteins in an ATP-dependent fashion. Human TBPIP interacts with human TBP-1 then modulates the inhibitory action of human TBP-1 on HIV-Tat-mediated transactivation [].
Probab=72.50 E-value=33 Score=24.61 Aligned_cols=59 Identities=22% Similarity=0.338 Sum_probs=35.2
Q ss_pred HHHHHHccCCHHHHHHHHHHHHHHHHHHHHHHHHhhhcCCCCChhhhcHHhHHHHHHHHHHHHHHH
Q psy8735 10 KLKQLSQQMTEAEAKQKKAELQAELEEKKSKLEKLSRNVNVISEGEVDKKTITDTKEKMVKEYNKR 75 (115)
Q Consensus 10 eL~~L~s~lt~~el~~~i~~L~~e~~~l~~rL~~l~~~~~~Vs~ee~~~~~v~~~~~k~~~~w~kR 75 (115)
++..|... ..+++.++.+|+.++..+++.|..|.+ ..|.+++. ........-...-..|
T Consensus 73 el~~ld~e--i~~L~~el~~l~~~~k~l~~eL~~L~~---~~t~~el~--~~i~~l~~e~~~l~~k 131 (169)
T PF07106_consen 73 ELAELDAE--IKELREELAELKKEVKSLEAELASLSS---EPTNEELR--EEIEELEEEIEELEEK 131 (169)
T ss_pred hHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHhc---CCCHHHHH--HHHHHHHHHHHHHHHH
Confidence 34445444 566677777777777777777777763 45556665 4444444444444444
No 25
>PRK10132 hypothetical protein; Provisional
Probab=70.25 E-value=32 Score=23.57 Aligned_cols=30 Identities=10% Similarity=0.247 Sum_probs=24.2
Q ss_pred ccCCHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy8735 16 QQMTEAEAKQKKAELQAELEEKKSKLEKLS 45 (115)
Q Consensus 16 s~lt~~el~~~i~~L~~e~~~l~~rL~~l~ 45 (115)
+.+++.++..++++|..++..+-+-++.|-
T Consensus 3 ~~~~~~~~~~q~e~L~~Dl~~L~~~le~ll 32 (108)
T PRK10132 3 NRPNRNDVDDGVQDIQNDVNQLADSLESVL 32 (108)
T ss_pred CCCccchhhhHHHHHHHHHHHHHHHHHHHH
Confidence 456677888888999999988888888774
No 26
>COG5124 Protein predicted to be involved in meiotic recombination [Cell division and chromosome partitioning / General function prediction only]
Probab=70.04 E-value=46 Score=25.30 Aligned_cols=26 Identities=15% Similarity=0.319 Sum_probs=22.2
Q ss_pred HHHHHHhcCcCcHHHHHHHhcccccc
Q psy8735 82 MLEAILENYPKSKKILLEEVGIETDE 107 (115)
Q Consensus 82 i~d~I~E~~p~~kk~L~EelGiEtDe 107 (115)
++|-++-.+|..+.++-+++||..|=
T Consensus 178 lidy~c~kf~~~~~qir~~fgIPedl 203 (209)
T COG5124 178 LIDYLCKKFFLKPEQIRKEFGIPEDL 203 (209)
T ss_pred HHHHHHHHcCCCHHHHHHhcCCCcch
Confidence 55667889999999999999998774
No 27
>PF12958 DUF3847: Protein of unknown function (DUF3847); InterPro: IPR024215 This entry represents a family of uncharacterised proteins that were found by clustering human gut metagenomic sequences [].
Probab=69.41 E-value=10 Score=25.14 Aligned_cols=28 Identities=21% Similarity=0.336 Sum_probs=23.2
Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHhhh
Q psy8735 19 TEAEAKQKKAELQAELEEKKSKLEKLSR 46 (115)
Q Consensus 19 t~~el~~~i~~L~~e~~~l~~rL~~l~~ 46 (115)
|.+++.+++...+.++...+.++..|.+
T Consensus 2 ~Le~l~~e~e~~~~kl~q~e~~~k~L~n 29 (86)
T PF12958_consen 2 TLEELQAEIEKAEKKLEQAEHKIKQLEN 29 (86)
T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 5678888888888888888888888865
No 28
>COG5509 Uncharacterized small protein containing a coiled-coil domain [Function unknown]
Probab=69.10 E-value=9.9 Score=23.81 Aligned_cols=30 Identities=37% Similarity=0.388 Sum_probs=26.2
Q ss_pred HccCCHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy8735 15 SQQMTEAEAKQKKAELQAELEEKKSKLEKL 44 (115)
Q Consensus 15 ~s~lt~~el~~~i~~L~~e~~~l~~rL~~l 44 (115)
-+-++..|+.+.|+-|+.+|..++..+.+=
T Consensus 22 LsllsV~El~eRIalLq~EIeRlkAe~~kK 51 (65)
T COG5509 22 LSLLSVAELEERIALLQAEIERLKAELAKK 51 (65)
T ss_pred HHHhhHHHHHHHHHHHHHHHHHHHHHHHhh
Confidence 466789999999999999999999988763
No 29
>PRK02119 hypothetical protein; Provisional
Probab=68.51 E-value=21 Score=22.67 Aligned_cols=44 Identities=16% Similarity=0.265 Sum_probs=26.2
Q ss_pred CchHHHHHHHHHHHHccC-----CHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy8735 1 MDDKEKVKSKLKQLSQQM-----TEAEAKQKKAELQAELEEKKSKLEKL 44 (115)
Q Consensus 1 ~~~~~~l~seL~~L~s~l-----t~~el~~~i~~L~~e~~~l~~rL~~l 44 (115)
|.+++.++..+..|...+ |.++|...|.+...++..+..+|..|
T Consensus 1 ~~~~~~~e~Ri~~LE~rla~QE~tie~LN~~v~~Qq~~id~L~~ql~~L 49 (73)
T PRK02119 1 MQIQQNLENRIAELEMKIAFQENLLEELNQALIEQQFVIDKMQVQLRYM 49 (73)
T ss_pred CcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 455666666666666554 34556666666666666666555555
No 30
>PF04977 DivIC: Septum formation initiator; InterPro: IPR007060 DivIC, from the spore-forming, Gram-positive bacterium Bacillus subtilis, is necessary for both vegetative and sporulation septum formation []. These proteins are mainly composed of an N-terminal coiled-coil. DivIB, DivIC and FtsL inter-depend on each other for stabilisation and localisation. The latter two form a heterodimer. DivIC is always centre cell but the other two associate with it during septation [].; GO: 0007049 cell cycle
Probab=66.40 E-value=23 Score=21.63 Aligned_cols=25 Identities=40% Similarity=0.522 Sum_probs=15.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhh
Q psy8735 21 AEAKQKKAELQAELEEKKSKLEKLS 45 (115)
Q Consensus 21 ~el~~~i~~L~~e~~~l~~rL~~l~ 45 (115)
.++..++++++.++..++.+++.|+
T Consensus 27 ~~l~~~i~~l~~e~~~L~~ei~~l~ 51 (80)
T PF04977_consen 27 AELQKEIEELKKENEELKEEIERLK 51 (80)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhc
Confidence 3455556666666666666666663
No 31
>PRK13729 conjugal transfer pilus assembly protein TraB; Provisional
Probab=66.20 E-value=14 Score=31.58 Aligned_cols=42 Identities=5% Similarity=0.272 Sum_probs=30.4
Q ss_pred HHHHHHHHHHHHccCC-----HHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy8735 4 KEKVKSKLKQLSQQMT-----EAEAKQKKAELQAELEEKKSKLEKLS 45 (115)
Q Consensus 4 ~~~l~seL~~L~s~lt-----~~el~~~i~~L~~e~~~l~~rL~~l~ 45 (115)
.++|+.+|..|++.+- ..++...|..|+.++..|+.+++.+.
T Consensus 78 asELEKqLaaLrqElq~~saq~~dle~KIkeLEaE~~~Lk~Ql~a~~ 124 (475)
T PRK13729 78 AAQMQKQYEEIRRELDVLNKQRGDDQRRIEKLGQDNAALAEQVKALG 124 (475)
T ss_pred HHHHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHhhh
Confidence 4567777777765432 45678888888889999988887654
No 32
>PF10805 DUF2730: Protein of unknown function (DUF2730); InterPro: IPR020269 This entry represents a family of various hypothetical proteins. The proteins, which include HI1498 and Gp25, from phage Mu, are currently uncharacterised.
Probab=66.19 E-value=13 Score=25.17 Aligned_cols=41 Identities=27% Similarity=0.415 Sum_probs=22.8
Q ss_pred HHHHHHHHHHHccCCHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy8735 5 EKVKSKLKQLSQQMTEAEAKQKKAELQAELEEKKSKLEKLS 45 (115)
Q Consensus 5 ~~l~seL~~L~s~lt~~el~~~i~~L~~e~~~l~~rL~~l~ 45 (115)
+.+++++..|=+.-...+|+-.+++++-+++.+..+|.++.
T Consensus 52 ~~lE~~l~~LPt~~dv~~L~l~l~el~G~~~~l~~~l~~v~ 92 (106)
T PF10805_consen 52 QALETKLEHLPTRDDVHDLQLELAELRGELKELSARLQGVS 92 (106)
T ss_pred HHHHHHHHhCCCHHHHHHHHHHHHHHHhHHHHHHHHHHHHH
Confidence 33444444333333344455666667777777777777664
No 33
>PF06810 Phage_GP20: Phage minor structural protein GP20; InterPro: IPR009636 This family consists of several phage minor structural protein Gp20 sequences and prophage sequences of around 180 residues in length. The function of this family is unknown.; GO: 0005198 structural molecule activity
Probab=65.07 E-value=34 Score=24.69 Aligned_cols=23 Identities=22% Similarity=0.566 Sum_probs=15.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHhhh
Q psy8735 24 KQKKAELQAELEEKKSKLEKLSR 46 (115)
Q Consensus 24 ~~~i~~L~~e~~~l~~rL~~l~~ 46 (115)
..++..++.++......|..|+.
T Consensus 26 ~~e~~~~k~ql~~~d~~i~~Lk~ 48 (155)
T PF06810_consen 26 KEERDNLKTQLKEADKQIKDLKK 48 (155)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHh
Confidence 55666666777777777777765
No 34
>PRK11020 hypothetical protein; Provisional
Probab=64.39 E-value=21 Score=24.99 Aligned_cols=44 Identities=14% Similarity=0.378 Sum_probs=31.2
Q ss_pred hHHHHHHHHHHHHccCCHHHHH---HHHHHHHHHHHHHHHHHHHhhh
Q psy8735 3 DKEKVKSKLKQLSQQMTEAEAK---QKKAELQAELEEKKSKLEKLSR 46 (115)
Q Consensus 3 ~~~~l~seL~~L~s~lt~~el~---~~i~~L~~e~~~l~~rL~~l~~ 46 (115)
||+.|...|..++..+.....+ ..|.++..|+..+..++..|++
T Consensus 6 Eiq~L~drLD~~~~Klaaa~~rgd~~~i~qf~~E~~~l~k~I~~lk~ 52 (118)
T PRK11020 6 EIKRLSDRLDAIRHKLAAASLRGDAEKYAQFEKEKATLEAEIARLKE 52 (118)
T ss_pred HHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHHHHHHHHHHHH
Confidence 6777777777777766655543 4567777888888888877763
No 35
>PF10234 Cluap1: Clusterin-associated protein-1; InterPro: IPR019366 This protein of 413 amino acids contains a central coiled-coil domain, possibly the region that binds to clusterin. Cluap1 expression is highest in the nucleus and gradually increases during late S to G2/M phases of the cell cycle and returns to the basal level in the G0/G1 phases. In addition, it is upregulated in colon cancer tissues compared to corresponding non-cancerous mucosa. It thus plays a crucial role in the life of the cell [].
Probab=63.93 E-value=56 Score=25.94 Aligned_cols=83 Identities=25% Similarity=0.362 Sum_probs=47.0
Q ss_pred HHHHHHHHHHHccCCH-----HHHHHHHHHHHHHHHHHHHHHHHhhhcCCCCChhhhcHHhHHHHHHHHHHHHHHHHHHH
Q psy8735 5 EKVKSKLKQLSQQMTE-----AEAKQKKAELQAELEEKKSKLEKLSRNVNVISEGEVDKKTITDTKEKMVKEYNKRKRMC 79 (115)
Q Consensus 5 ~~l~seL~~L~s~lt~-----~el~~~i~~L~~e~~~l~~rL~~l~~~~~~Vs~ee~~~~~v~~~~~k~~~~w~kRKRi~ 79 (115)
+.+..++..+...+.+ ..|..+|..-+.+....+.||..|++- .+-=-+|-+ +++.+..+.-..|..|=|-
T Consensus 172 ~~~~~~~~~~~~~l~~l~~de~~Le~KIekkk~ELER~qKRL~sLq~v-RPAfmdEyE--klE~EL~~lY~~Y~~kfRN- 247 (267)
T PF10234_consen 172 KAVQQQLQQTQQQLNNLASDEANLEAKIEKKKQELERNQKRLQSLQSV-RPAFMDEYE--KLEEELQKLYEIYVEKFRN- 247 (267)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc-ChHHHHHHH--HHHHHHHHHHHHHHHHHHh-
Confidence 3344444444444333 456677888888888888888888752 222225666 6666666666666555442
Q ss_pred HHHHHHHHhcCc
Q psy8735 80 TEMLEAILENYP 91 (115)
Q Consensus 80 ~di~d~I~E~~p 91 (115)
-+-+..-++.+.
T Consensus 248 l~yLe~qle~~~ 259 (267)
T PF10234_consen 248 LDYLEHQLEEYN 259 (267)
T ss_pred HHHHHHHHHHHH
Confidence 244444444443
No 36
>PRK13182 racA polar chromosome segregation protein; Reviewed
Probab=63.57 E-value=29 Score=25.65 Aligned_cols=42 Identities=17% Similarity=0.373 Sum_probs=21.5
Q ss_pred HHHHHHHHHHHHccCCHHHH---HHHHHHHHHHHHHHHHHHHHhh
Q psy8735 4 KEKVKSKLKQLSQQMTEAEA---KQKKAELQAELEEKKSKLEKLS 45 (115)
Q Consensus 4 ~~~l~seL~~L~s~lt~~el---~~~i~~L~~e~~~l~~rL~~l~ 45 (115)
|.+|+..|+.+.--.-.=+| +.+|+++-+.+..|+.++..+.
T Consensus 101 i~eLe~~l~~kad~vvsYqll~hr~e~ee~~~~l~~le~~~~~~e 145 (175)
T PRK13182 101 LDELERQLQQKADDVVSYQLLQHRREMEEMLERLQKLEARLKKLE 145 (175)
T ss_pred HHHHHHHHHHHHhhhhhHHHHHhHHHHHHHHHHHHHHHHHHHHHH
Confidence 33444444444333333333 3456666666666666666654
No 37
>PF09744 Jnk-SapK_ap_N: JNK_SAPK-associated protein-1; InterPro: IPR019143 This entry represents the N-terminal 200 residues of a set of proteins conserved from yeasts to humans. Most of the proteins in this entry have a RhoGEF domain (IPR000219 from INTERPRO) at their C-terminal end.
Probab=63.06 E-value=57 Score=23.81 Aligned_cols=62 Identities=16% Similarity=0.203 Sum_probs=35.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhcCCCCChhhhcHHhHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy8735 21 AEAKQKKAELQAELEEKKSKLEKLSRNVNVISEGEVDKKTITDTKEKMVKEYNKRKRMCTEMLEA 85 (115)
Q Consensus 21 ~el~~~i~~L~~e~~~l~~rL~~l~~~~~~Vs~ee~~~~~v~~~~~k~~~~w~kRKRi~~di~d~ 85 (115)
.++..+|.+|+.++..|..++..+..-...+.+. -. .+...|+..+...+.==+-.++.++.
T Consensus 92 k~L~~~v~~Le~e~r~L~~~~~~~~~q~~rlee~-e~--~l~~e~~~l~er~~e~l~~~~e~ver 153 (158)
T PF09744_consen 92 KDLQSQVEQLEEENRQLELKLKNLSDQSSRLEER-EA--ELKKEYNRLHERERELLRKLKEHVER 153 (158)
T ss_pred HHHHHHHHHHHHHHHHHHHHhhhhhhhccccchh-HH--HHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4567777777777777777776654432223222 22 34556666666555544445555554
No 38
>smart00338 BRLZ basic region leucin zipper.
Probab=61.90 E-value=33 Score=20.70 Aligned_cols=26 Identities=35% Similarity=0.432 Sum_probs=14.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy8735 20 EAEAKQKKAELQAELEEKKSKLEKLS 45 (115)
Q Consensus 20 ~~el~~~i~~L~~e~~~l~~rL~~l~ 45 (115)
..+|...+..|+.++..|..++..|.
T Consensus 28 ~~~Le~~~~~L~~en~~L~~~~~~l~ 53 (65)
T smart00338 28 IEELERKVEQLEAENERLKKEIERLR 53 (65)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34555555555555555555555553
No 39
>KOG4001|consensus
Probab=61.69 E-value=60 Score=25.26 Aligned_cols=72 Identities=18% Similarity=0.210 Sum_probs=51.7
Q ss_pred HHHccCCHHHHHHHHHHHHHHHHHHHHHHHHhhh---cCCCCChhhhcHHhHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy8735 13 QLSQQMTEAEAKQKKAELQAELEEKKSKLEKLSR---NVNVISEGEVDKKTITDTKEKMVKEYNKRKRMCTEMLEAI 86 (115)
Q Consensus 13 ~L~s~lt~~el~~~i~~L~~e~~~l~~rL~~l~~---~~~~Vs~ee~~~~~v~~~~~k~~~~w~kRKRi~~di~d~I 86 (115)
.|...---..+...+..|+.+...|+.++..++. .+..++.++++ -.++.|+.-..--++-.|+.+.-+.-|
T Consensus 180 ALqae~ek~~~~~~~k~le~~k~~Le~~ia~~k~K~e~~e~r~~E~r~--ieEkk~~eei~fLk~tN~qLKaQLegI 254 (259)
T KOG4001|consen 180 ALQAENEKTRATTEWKVLEDKKKELELKIAQLKKKLETDEIRSEEERE--IEEKKMKEEIEFLKETNRQLKAQLEGI 254 (259)
T ss_pred HHHHhhhhhHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHhhhHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHhhc
Confidence 3444444567778888888888888888888853 56778888888 888888777666666666665554433
No 40
>PF15290 Syntaphilin: Golgi-localised syntaxin-1-binding clamp
Probab=60.52 E-value=18 Score=29.22 Aligned_cols=29 Identities=24% Similarity=0.369 Sum_probs=19.3
Q ss_pred HccCCHHHHHHHHHHHHHHHHHHHHHHHH
Q psy8735 15 SQQMTEAEAKQKKAELQAELEEKKSKLEK 43 (115)
Q Consensus 15 ~s~lt~~el~~~i~~L~~e~~~l~~rL~~ 43 (115)
.++|...|++.+|.+|++-|..+.+.|..
T Consensus 114 EAQLALKEARkEIkQLkQvieTmrssL~e 142 (305)
T PF15290_consen 114 EAQLALKEARKEIKQLKQVIETMRSSLAE 142 (305)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhhch
Confidence 35566667777777777777777666643
No 41
>PF08614 ATG16: Autophagy protein 16 (ATG16); InterPro: IPR013923 Macroautophagy is a bulk degradation process induced by starvation in eukaryotic cells. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. No molecule involved in autophagy has yet been identified in higher eukaryotes []. The pre-autophagosomal structure contains at least five Apg proteins: Apg1p, Apg2p, Apg5p, Aut7p/Apg8p and Apg16p. It is found in the vacuole []. The C-terminal glycine of Apg12p is conjugated to a lysine residue of Apg5p via an isopeptide bond. During autophagy, cytoplasmic components are enclosed in autophagosomes and delivered to lysosomes/vacuoles. Auotphagy protein 16 (Apg16) has been shown to be bind to Apg5 and is required for the function of the Apg12p-Apg5p conjugate []. Autophagy protein 5 (Apg5) is directly required for the import of aminopeptidase I via the cytoplasm-to-vacuole targeting pathway []. This entry represents auotphagy protein 16 (Apg16), which is required for the function of the Apg12p-Apg5p conjugate.; PDB: 3A7O_D 3A7P_B.
Probab=59.63 E-value=55 Score=24.07 Aligned_cols=22 Identities=14% Similarity=0.428 Sum_probs=18.1
Q ss_pred hhhcHHhHHHHHHHHHHHHHHHHH
Q psy8735 54 GEVDKKTITDTKEKMVKEYNKRKR 77 (115)
Q Consensus 54 ee~~~~~v~~~~~k~~~~w~kRKR 77 (115)
+... ++..++...+..|-.||-
T Consensus 165 ~k~~--~l~~En~~Lv~Rwm~~k~ 186 (194)
T PF08614_consen 165 EKLR--KLEEENRELVERWMQRKA 186 (194)
T ss_dssp HHHH--HHHHHHHHHHHHHHHHHH
T ss_pred HHHH--HHHHHHHHHHHHHHHHHH
Confidence 3455 899999999999998874
No 42
>PF11932 DUF3450: Protein of unknown function (DUF3450); InterPro: IPR016866 There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function. However, they are found in an operon along with components of a TonB transport system (typified by Vibrio cholerae TonB2 [], and are predicted to be localized to the periplasmic space. Caution: the low-complexity nature of these sequences produces spurious BLAST hits to chromosome segregation ATPases (which are much longer in length and contain canonical Walker motifs). Accordingly, some members are misidentified as such.
Probab=58.88 E-value=18 Score=27.65 Aligned_cols=49 Identities=10% Similarity=0.254 Sum_probs=30.5
Q ss_pred hHHHHHHHHHHHHccCCHHHHHHHHHHHHHHHHHHHHHHHHhhhcCCCCCh
Q psy8735 3 DKEKVKSKLKQLSQQMTEAEAKQKKAELQAELEEKKSKLEKLSRNVNVISE 53 (115)
Q Consensus 3 ~~~~l~seL~~L~s~lt~~el~~~i~~L~~e~~~l~~rL~~l~~~~~~Vs~ 53 (115)
++..+..++..|... +..+...+..++.++..++.+++.+......+.|
T Consensus 57 e~~~l~~e~e~L~~~--~~~l~~~v~~q~~el~~L~~qi~~~~~~~~~l~p 105 (251)
T PF11932_consen 57 EYRQLEREIENLEVY--NEQLERQVASQEQELASLEQQIEQIEETRQELVP 105 (251)
T ss_pred HHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 455566666666554 5666677777777777777777776544444444
No 43
>KOG4603|consensus
Probab=57.43 E-value=80 Score=23.88 Aligned_cols=66 Identities=18% Similarity=0.213 Sum_probs=41.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHhhhcCCCCChhhhcHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcCcHHHH
Q psy8735 24 KQKKAELQAELEEKKSKLEKLSRNVNVISEGEVDKKTITDTKEKMVKEYNKRKRMCTEMLEAILENYPKSKKIL 97 (115)
Q Consensus 24 ~~~i~~L~~e~~~l~~rL~~l~~~~~~Vs~ee~~~~~v~~~~~k~~~~w~kRKRi~~di~d~I~E~~p~~kk~L 97 (115)
.+++..|...|.-+++.|..|.+ .+|-++++ +-..+..+-++.+++|=.-|+.-. ....|..+++.
T Consensus 92 ~ek~q~l~~t~s~veaEik~L~s---~Lt~eemQ--e~i~~L~kev~~~~erl~~~k~g~---~~vtpedk~~v 157 (201)
T KOG4603|consen 92 TEKVQSLQQTCSYVEAEIKELSS---ALTTEEMQ--EEIQELKKEVAGYRERLKNIKAGT---NHVTPEDKEQV 157 (201)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHH---hcChHHHH--HHHHHHHHHHHHHHHHHHHHHHhc---ccCCHHHHHHH
Confidence 44455555666666666666653 56778888 777888888888888865555443 34455544433
No 44
>KOG4302|consensus
Probab=57.03 E-value=1.5e+02 Score=26.66 Aligned_cols=78 Identities=23% Similarity=0.292 Sum_probs=55.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhh--cCC------------CCChhhhcHHhHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy8735 20 EAEAKQKKAELQAELEEKKSKLEKLSR--NVN------------VISEGEVDKKTITDTKEKMVKEYNKRKRMCTEMLEA 85 (115)
Q Consensus 20 ~~el~~~i~~L~~e~~~l~~rL~~l~~--~~~------------~Vs~ee~~~~~v~~~~~k~~~~w~kRKRi~~di~d~ 85 (115)
.++|+.+-.+-..+...+..+++.|.. |+. .+|-+.++ .+......+++++..|.+-+.++..+
T Consensus 112 le~lr~qk~eR~~ef~el~~qie~l~~~l~g~~~~~~~~~~D~~dlsl~kLe--elr~~L~~L~~ek~~Rlekv~~~~~~ 189 (660)
T KOG4302|consen 112 LEGLRKQKDERRAEFKELYHQIEKLCEELGGPEDLPSFLIADESDLSLEKLE--ELREHLNELQKEKSDRLEKVLELKEE 189 (660)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccCCcccccCcccccHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 355666666666677777777777753 222 23335566 77788889999999999988888877
Q ss_pred HHhcCcCcHHHHHHHhcccccc
Q psy8735 86 ILENYPKSKKILLEEVGIETDE 107 (115)
Q Consensus 86 I~E~~p~~kk~L~EelGiEtDe 107 (115)
| +.|++.||++...
T Consensus 190 I--------~~l~~~Lg~~~~~ 203 (660)
T KOG4302|consen 190 I--------KSLCSVLGLDFSM 203 (660)
T ss_pred H--------HHHHHHhCCCccc
Confidence 6 4677888887763
No 45
>PF07716 bZIP_2: Basic region leucine zipper; InterPro: IPR011700 The basic-leucine zipper (bZIP) transcription factors [, ] of eukaryotes are proteins that contain a basic region mediating sequence-specific DNA-binding, followed by a leucine zipper region (see IPR002158 from INTERPRO), which is required for dimerization.; GO: 0003700 sequence-specific DNA binding transcription factor activity, 0043565 sequence-specific DNA binding, 0046983 protein dimerization activity, 0006355 regulation of transcription, DNA-dependent; PDB: 1NWQ_A 1H89_B 1H88_A 1GTW_B 2E43_A 1IO4_A 1GU4_B 2E42_A 1H8A_B 1GU5_B ....
Probab=56.76 E-value=34 Score=20.04 Aligned_cols=27 Identities=30% Similarity=0.430 Sum_probs=20.8
Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy8735 19 TEAEAKQKKAELQAELEEKKSKLEKLS 45 (115)
Q Consensus 19 t~~el~~~i~~L~~e~~~l~~rL~~l~ 45 (115)
-..+|...+..|..++..|..++..|.
T Consensus 26 ~~~~le~~~~~L~~en~~L~~~i~~L~ 52 (54)
T PF07716_consen 26 REEELEQEVQELEEENEQLRQEIAQLE 52 (54)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 356777888888888888888887764
No 46
>KOG3231|consensus
Probab=56.62 E-value=41 Score=25.34 Aligned_cols=57 Identities=18% Similarity=0.267 Sum_probs=37.1
Q ss_pred CCCChhhhcHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcCcH------HHHHHHhcccccc
Q psy8735 49 NVISEGEVDKKTITDTKEKMVKEYNKRKRMCTEMLEAILENYPKSK------KILLEEVGIETDE 107 (115)
Q Consensus 49 ~~Vs~ee~~~~~v~~~~~k~~~~w~kRKRi~~di~d~I~E~~p~~k------k~L~EelGiEtDe 107 (115)
..+.|+-.- ..-..|.+..--------|.+|-+|-|+++.+.-. .+++++||||.-.
T Consensus 113 k~M~pek~~--~tmr~FQ~anmKMemTeEMiNDTLDdild~sgDeeEs~aiVNqVLDEIGIEisg 175 (208)
T KOG3231|consen 113 KKMDPEKTL--QTMRNFQKANMKMEMTEEMINDTLDDILDGSGDEEESQAIVNQVLDEIGIEISG 175 (208)
T ss_pred ccCCHHHHH--HHHHHHHHHHHHhhhHHHHHHhhHHHHhcCCCcHHHHHHHHHHHHHHhhhhhcc
Confidence 444555444 44444444433333345688899999999988632 4788999999876
No 47
>PRK04325 hypothetical protein; Provisional
Probab=56.50 E-value=42 Score=21.28 Aligned_cols=26 Identities=19% Similarity=0.111 Sum_probs=14.3
Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy8735 19 TEAEAKQKKAELQAELEEKKSKLEKL 44 (115)
Q Consensus 19 t~~el~~~i~~L~~e~~~l~~rL~~l 44 (115)
|.++|-..|.+...++..|..+|..|
T Consensus 24 tIe~LN~vv~~Qq~~I~~L~~ql~~L 49 (74)
T PRK04325 24 LIDGLNATVARQQQTLDLLQAQLRLL 49 (74)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34556555555555555555555444
No 48
>COG1093 SUI2 Translation initiation factor 2, alpha subunit (eIF-2alpha) [Translation, ribosomal structure and biogenesis]
Probab=56.07 E-value=26 Score=27.92 Aligned_cols=40 Identities=28% Similarity=0.395 Sum_probs=31.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCcCcHHHHHHHhcccc
Q psy8735 65 KEKMVKEYNKRKRMCTEMLEAILENYPKSKKILLEEVGIET 105 (115)
Q Consensus 65 ~~k~~~~w~kRKRi~~di~d~I~E~~p~~kk~L~EelGiEt 105 (115)
..+-..+|++.+|.. -++..+++-++++.+++|+++|..-
T Consensus 90 ~~~k~~~wk~~qka~-klle~aaekl~~~~ee~~~~vg~~L 129 (269)
T COG1093 90 RRKKIQEWKKEQKAD-KLLELAAEKLGKDLEEAYEEVGWKL 129 (269)
T ss_pred HHHHHHHHHHHHHHH-HHHHHHHHHhCCCHHHHHHHHhHHH
Confidence 346677788777765 4788899999999999999998653
No 49
>PRK09039 hypothetical protein; Validated
Probab=55.66 E-value=1.1e+02 Score=24.79 Aligned_cols=26 Identities=19% Similarity=0.184 Sum_probs=12.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy8735 20 EAEAKQKKAELQAELEEKKSKLEKLS 45 (115)
Q Consensus 20 ~~el~~~i~~L~~e~~~l~~rL~~l~ 45 (115)
..-|..+|+.|+.++..+++.|..+.
T Consensus 139 V~~L~~qI~aLr~Qla~le~~L~~ae 164 (343)
T PRK09039 139 VELLNQQIAALRRQLAALEAALDASE 164 (343)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34444555555555555555554443
No 50
>PF13801 Metal_resist: Heavy-metal resistance; PDB: 3EPV_C 2Y3D_A 2Y3H_D 2Y3G_B 2Y3B_A 2Y39_A 3LAY_H.
Probab=55.47 E-value=39 Score=21.64 Aligned_cols=60 Identities=17% Similarity=0.160 Sum_probs=25.4
Q ss_pred HHHHHHHHHHHHHHHHHHhhhcCCCCChhhhcHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCc
Q psy8735 26 KKAELQAELEEKKSKLEKLSRNVNVISEGEVDKKTITDTKEKMVKEYNKRKRMCTEMLEAILENYP 91 (115)
Q Consensus 26 ~i~~L~~e~~~l~~rL~~l~~~~~~Vs~ee~~~~~v~~~~~k~~~~w~kRKRi~~di~d~I~E~~p 91 (115)
....+..++..+...|..+-. +..++++... ....+.........+..-+.+-.+...++
T Consensus 60 ~~~~~r~~~~~~r~~l~~ll~-~~~~D~~~i~-----a~~~~~~~~~~~l~~~~~~~~~~~~~~Lt 119 (125)
T PF13801_consen 60 EMRALRQELRAARQELRALLA-APPPDEAAIE-----ALLEEIREAQAELRQERLEHLLEIRAVLT 119 (125)
T ss_dssp HHHHHHHHHHHHHHHHHHHHC-CSSS-HHHHH-----HHHHHHHHHHHHHHHHHHHHHHHHHHTT-
T ss_pred HHHHHHHHHHHHHHHHHHHHc-CCCCCHHHHH-----HHHHHHHHHHHHHHHHHHHHHHHHHHcCC
Confidence 334444444444444444432 3455555444 33444444444444444444444444443
No 51
>cd00176 SPEC Spectrin repeats, found in several proteins involved in cytoskeletal structure; family members include spectrin, alpha-actinin and dystrophin; the spectrin repeat forms a three helix bundle with the second helix interrupted by proline in some sequences; the repeats are independent folding units; tandem repeats are found in differing numbers and arrange in an antiparallel manner to form dimers; the repeats are defined by a characteristic tryptophan (W) residue in helix A and a leucine (L) at the carboxyl end of helix C and separated by a linker of 5 residues; two copies of the repeat are present here
Probab=55.14 E-value=68 Score=22.19 Aligned_cols=80 Identities=15% Similarity=0.207 Sum_probs=47.1
Q ss_pred HHHHHHHHHHHccCCHHHHHHHH---HHHHHHHHHHHHHHHHhhh-cCCCCC---------hhhhcHHhHHHHHHHHHHH
Q psy8735 5 EKVKSKLKQLSQQMTEAEAKQKK---AELQAELEEKKSKLEKLSR-NVNVIS---------EGEVDKKTITDTKEKMVKE 71 (115)
Q Consensus 5 ~~l~seL~~L~s~lt~~el~~~i---~~L~~e~~~l~~rL~~l~~-~~~~Vs---------~ee~~~~~v~~~~~k~~~~ 71 (115)
...+..|.......+..++...+ ..+..++......+..+.. +..++. ..... .+...|...+..
T Consensus 17 ~~~e~~l~~~~~~~d~~~~~~~l~~~~~~~~e~~~~~~~~~~l~~~~~~L~~~~~~~~~~i~~~~~--~l~~~w~~l~~~ 94 (213)
T cd00176 17 SEKEELLSSTDYGDDLESVEALLKKHEALEAELAAHEERVEALNELGEQLIEEGHPDAEEIQERLE--ELNQRWEELREL 94 (213)
T ss_pred HHHHHHhcCcccCCCHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHhcCCCChHHHHHHHH--HHHHHHHHHHHH
Confidence 34455555554444444555444 5555566666666666642 111111 13345 788889999999
Q ss_pred HHHHHHHHHHHHHHH
Q psy8735 72 YNKRKRMCTEMLEAI 86 (115)
Q Consensus 72 w~kRKRi~~di~d~I 86 (115)
+..|.+-....++..
T Consensus 95 ~~~r~~~L~~~~~~~ 109 (213)
T cd00176 95 AEERRQRLEEALDLQ 109 (213)
T ss_pred HHHHHHHHHHHHHHH
Confidence 999988777766543
No 52
>KOG4797|consensus
Probab=54.52 E-value=27 Score=24.40 Aligned_cols=40 Identities=13% Similarity=0.258 Sum_probs=31.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhhcC----CCCChhhhcHHhHHHH
Q psy8735 23 AKQKKAELQAELEEKKSKLEKLSRNV----NVISEGEVDKKTITDT 64 (115)
Q Consensus 23 l~~~i~~L~~e~~~l~~rL~~l~~~~----~~Vs~ee~~~~~v~~~ 64 (115)
.+++++-|+.+|.+|.+|...|...+ ..+||+..+ .+...
T Consensus 65 VREEVe~Lk~qI~eL~er~~~Le~EN~lLk~~~spe~L~--ql~~~ 108 (123)
T KOG4797|consen 65 VREEVEVLKEQIRELEERNSALERENSLLKTLASPEQLA--QLPAQ 108 (123)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCHHHHH--HHHHh
Confidence 57788889999999999999886422 678998888 66544
No 53
>PF14389 Lzipper-MIP1: Leucine-zipper of ternary complex factor MIP1
Probab=54.48 E-value=27 Score=22.88 Aligned_cols=28 Identities=29% Similarity=0.281 Sum_probs=25.0
Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHhhh
Q psy8735 19 TEAEAKQKKAELQAELEEKKSKLEKLSR 46 (115)
Q Consensus 19 t~~el~~~i~~L~~e~~~l~~rL~~l~~ 46 (115)
.+.++..+|+.++.+|..++.++..|+.
T Consensus 55 ~~keLL~EIA~lE~eV~~LE~~v~~L~~ 82 (88)
T PF14389_consen 55 KAKELLEEIALLEAEVAKLEQKVLSLYR 82 (88)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 5678999999999999999999999864
No 54
>PF05557 MAD: Mitotic checkpoint protein; InterPro: IPR008672 This family consists of several eukaryotic mitotic checkpoint (Mitotic arrest deficient or MAD) proteins. The mitotic spindle checkpoint monitors proper attachment of the bipolar spindle to the kinetochores of aligned sister chromatids and causes a cell cycle arrest in prometaphase when failures occur. Multiple components of the mitotic spindle checkpoint have been identified in Saccharomyces cerevisiae and higher eukaryotes. In Saccharomyces cerevisiae, the existence of a Mad1-dependent complex containing Mad2, Mad3, Bub3 and Cdc20 has been demonstrated [].; PDB: 1GO4_F 4DZO_A.
Probab=54.05 E-value=96 Score=27.46 Aligned_cols=62 Identities=19% Similarity=0.318 Sum_probs=42.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHhhhcC-CC---C-------Ch-------hhhcHHhHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy8735 24 KQKKAELQAELEEKKSKLEKLSRNV-NV---I-------SE-------GEVDKKTITDTKEKMVKEYNKRKRMCTEMLEA 85 (115)
Q Consensus 24 ~~~i~~L~~e~~~l~~rL~~l~~~~-~~---V-------s~-------ee~~~~~v~~~~~k~~~~w~kRKRi~~di~d~ 85 (115)
...+..|++||..|..+|..|..++ .. + .. .++. .+++...++...|.+.-.=|++++-.
T Consensus 565 ~~~l~~L~~En~~L~~~l~~le~~~~~~~~~~p~~~~~~~~~e~~~l~~~~~--~~ekr~~RLkevf~~ks~eFr~av~~ 642 (722)
T PF05557_consen 565 KSTLEALQAENEDLLARLRSLEEGNSQPVDAVPTSSLESQEKEIAELKAELA--SAEKRNQRLKEVFKAKSQEFREAVYS 642 (722)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHTTTT----------------HHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHhcccCCCCCcccccchhhhhhHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4778999999999999998885432 11 1 11 2334 66777777888887777777777766
Q ss_pred HH
Q psy8735 86 IL 87 (115)
Q Consensus 86 I~ 87 (115)
++
T Consensus 643 ll 644 (722)
T PF05557_consen 643 LL 644 (722)
T ss_dssp HH
T ss_pred Hh
Confidence 64
No 55
>PF10224 DUF2205: Predicted coiled-coil protein (DUF2205); InterPro: IPR019357 This entry represents a highly conserved 100 residue region which is likely to have a coiled-coil structure. The exact function is unknown.
Probab=53.45 E-value=61 Score=21.14 Aligned_cols=43 Identities=21% Similarity=0.270 Sum_probs=27.0
Q ss_pred HHHHHHHHHHHHccCCHHHHHHHHHHHHHHHHHHHHHHHHhhhcC
Q psy8735 4 KEKVKSKLKQLSQQMTEAEAKQKKAELQAELEEKKSKLEKLSRNV 48 (115)
Q Consensus 4 ~~~l~seL~~L~s~lt~~el~~~i~~L~~e~~~l~~rL~~l~~~~ 48 (115)
|..|...|..|.+. .+..+.....|..++.-|+..+..|.+.+
T Consensus 25 i~~LQ~sL~~L~~R--ve~Vk~E~~kL~~EN~~Lq~YI~nLm~~s 67 (80)
T PF10224_consen 25 ILELQDSLEALSDR--VEEVKEENEKLESENEYLQQYIGNLMSSS 67 (80)
T ss_pred HHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Confidence 44455555555444 35566677777777777777887775433
No 56
>smart00338 BRLZ basic region leucin zipper.
Probab=52.68 E-value=38 Score=20.41 Aligned_cols=36 Identities=33% Similarity=0.466 Sum_probs=19.6
Q ss_pred HHHHHHHHHHHHccCCHHHHHHHHHHHHHHHHHHHHHH
Q psy8735 4 KEKVKSKLKQLSQQMTEAEAKQKKAELQAELEEKKSKL 41 (115)
Q Consensus 4 ~~~l~seL~~L~s~lt~~el~~~i~~L~~e~~~l~~rL 41 (115)
+..|+.++..|.+. +.+|...+..|..++..+...+
T Consensus 28 ~~~Le~~~~~L~~e--n~~L~~~~~~l~~e~~~lk~~~ 63 (65)
T smart00338 28 IEELERKVEQLEAE--NERLKKEIERLRRELEKLKSEL 63 (65)
T ss_pred HHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHh
Confidence 45566666555553 4455555555555555555443
No 57
>PRK10884 SH3 domain-containing protein; Provisional
Probab=51.76 E-value=44 Score=25.33 Aligned_cols=20 Identities=20% Similarity=0.106 Sum_probs=10.7
Q ss_pred HHHHHHHHHHHHHHHHHHhh
Q psy8735 26 KKAELQAELEEKKSKLEKLS 45 (115)
Q Consensus 26 ~i~~L~~e~~~l~~rL~~l~ 45 (115)
.+.+|+.+++++...|..++
T Consensus 133 ~~~~L~~~n~~L~~~l~~~~ 152 (206)
T PRK10884 133 VINGLKEENQKLKNQLIVAQ 152 (206)
T ss_pred HHHHHHHHHHHHHHHHHHHH
Confidence 34445555555555555554
No 58
>TIGR03185 DNA_S_dndD DNA sulfur modification protein DndD. This model describes the DndB protein encoded by an operon associated with a sulfur-containing modification to DNA. The operon is sporadically distributed in bacteria, much like some restriction enzyme operons. DndD is described as a putative ATPase. The small number of examples known so far include species from among the Firmicutes, Actinomycetes, Proteobacteria, and Cyanobacteria.
Probab=51.67 E-value=1.3e+02 Score=26.18 Aligned_cols=25 Identities=40% Similarity=0.516 Sum_probs=12.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhh
Q psy8735 21 AEAKQKKAELQAELEEKKSKLEKLS 45 (115)
Q Consensus 21 ~el~~~i~~L~~e~~~l~~rL~~l~ 45 (115)
+.+...+..++.++..++.++..++
T Consensus 226 e~l~~~i~~l~~ele~a~~~l~~l~ 250 (650)
T TIGR03185 226 EDLAQEIAHLRNELEEAQRSLESLE 250 (650)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3344444555555555555555443
No 59
>PF12252 SidE: Dot/Icm substrate protein; InterPro: IPR021014 This entry represents bacterial proteins that are typically between 397 and 1543 amino acids in length including SidE protein in the Dot/Icm pathway of Legionella pneumophila bacteria. There is little literature describing the family.
Probab=51.43 E-value=90 Score=29.91 Aligned_cols=69 Identities=23% Similarity=0.431 Sum_probs=51.9
Q ss_pred hHHHHHHHHHHHHccCCHHHHHHHHHHHHHHHHHHHHHHHHhhh--cCCCCChhhhcHHhHHHHHHHHHHHHHHH
Q psy8735 3 DKEKVKSKLKQLSQQMTEAEAKQKKAELQAELEEKKSKLEKLSR--NVNVISEGEVDKKTITDTKEKMVKEYNKR 75 (115)
Q Consensus 3 ~~~~l~seL~~L~s~lt~~el~~~i~~L~~e~~~l~~rL~~l~~--~~~~Vs~ee~~~~~v~~~~~k~~~~w~kR 75 (115)
+++....-|..|++-|+..||-.--..|+.+|..+...|++|+. .+.+|+.++ ++...|......-.+|
T Consensus 1041 eKqnIa~AL~ALn~IPSdKEms~Is~eLReQIq~~KQ~LesLQRAV~TPVvtd~e----Kvr~rYe~LI~~iTKr 1111 (1439)
T PF12252_consen 1041 EKQNIAGALQALNNIPSDKEMSKISSELREQIQSVKQDLESLQRAVVTPVVTDAE----KVRVRYETLITDITKR 1111 (1439)
T ss_pred HhhhHHHHHHHHhcCCchhhhhhhhHHHHHHHHHHHHHHHHHHHhhcccccccHH----HHHHHHHHHHHHHHHH
Confidence 45667788999999999999988447899999999999999974 345555554 4466676666655555
No 60
>PF04156 IncA: IncA protein; InterPro: IPR007285 Chlamydia trachomatis is an obligate intracellular bacterium that develops within a parasitophorous vacuole termed an inclusion. The inclusion is nonfusogenic with lysosomes but intercepts lipids from a host cell exocytic pathway. Initiation of chlamydial development is concurrent with modification of the inclusion membrane by a set of C. trachomatis-encoded proteins collectively designated Incs. One of these Incs, IncA (Inclusion membrane protein A), is functionally associated with the homotypic fusion of inclusions [].
Probab=51.25 E-value=90 Score=22.46 Aligned_cols=60 Identities=15% Similarity=0.305 Sum_probs=37.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhcCCCCChhhhcHHhHHHHHHHHHHHHHHHHHHHHHHH
Q psy8735 21 AEAKQKKAELQAELEEKKSKLEKLSRNVNVISEGEVDKKTITDTKEKMVKEYNKRKRMCTEML 83 (115)
Q Consensus 21 ~el~~~i~~L~~e~~~l~~rL~~l~~~~~~Vs~ee~~~~~v~~~~~k~~~~w~kRKRi~~di~ 83 (115)
.++..++.+++..+..+...+..++..-.... ++++ ........+...|....+-|.++.
T Consensus 91 ~~l~~el~~l~~~~~~~~~~l~~~~~~~~~~~-~~~~--~~~~~l~~l~~~~~~~~~e~~~l~ 150 (191)
T PF04156_consen 91 QQLQEELDQLQERIQELESELEKLKEDLQELR-ELLK--SVEERLDSLDESIKELEKEIRELQ 150 (191)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhH-HHHH--HHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34456666666666666666666653221111 5566 677777777777777777777777
No 61
>PRK15178 Vi polysaccharide export inner membrane protein VexD; Provisional
Probab=51.19 E-value=86 Score=26.61 Aligned_cols=41 Identities=17% Similarity=0.221 Sum_probs=32.8
Q ss_pred hHHHHHHHHHHHHc-----cCCHHHHHHHHHHHHHHHHHHHHHHHH
Q psy8735 3 DKEKVKSKLKQLSQ-----QMTEAEAKQKKAELQAELEEKKSKLEK 43 (115)
Q Consensus 3 ~~~~l~seL~~L~s-----~lt~~el~~~i~~L~~e~~~l~~rL~~ 43 (115)
++-.++++|+.|.+ +|....++..|..|++++.....+|..
T Consensus 294 qLa~~~aeL~~L~~~~~p~sPqV~~l~~rI~aLe~QIa~er~kl~~ 339 (434)
T PRK15178 294 QLAEAKAEYAQLMVNGLDQNPLIPRLSAKIKVLEKQIGEQRNRLSN 339 (434)
T ss_pred HHHHHHHHHHHHHhhcCCCCCchhHHHHHHHHHHHHHHHHHHHhhc
Confidence 45567788888865 677888889999999999988888863
No 62
>PF14282 FlxA: FlxA-like protein
Probab=51.14 E-value=54 Score=22.08 Aligned_cols=35 Identities=14% Similarity=0.327 Sum_probs=28.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhcCCCCChhhhc
Q psy8735 22 EAKQKKAELQAELEEKKSKLEKLSRNVNVISEGEVD 57 (115)
Q Consensus 22 el~~~i~~L~~e~~~l~~rL~~l~~~~~~Vs~ee~~ 57 (115)
.....|..|..++..|..+|..|..+. ..+++++.
T Consensus 16 ~~~~~I~~L~~Qi~~Lq~ql~~l~~~~-~~~~e~k~ 50 (106)
T PF14282_consen 16 SSDSQIEQLQKQIKQLQEQLQELSQDS-DLDAEQKQ 50 (106)
T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHccc-CCCHHHHH
Confidence 347899999999999999999998753 44777765
No 63
>COG5185 HEC1 Protein involved in chromosome segregation, interacts with SMC proteins [Cell division and chromosome partitioning]
Probab=50.94 E-value=1.6e+02 Score=25.79 Aligned_cols=82 Identities=13% Similarity=0.230 Sum_probs=52.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhh--cCCCCChhhhcHHhHHHH-------H-------HHHHHHHHHHHHHHHHHH
Q psy8735 20 EAEAKQKKAELQAELEEKKSKLEKLSR--NVNVISEGEVDKKTITDT-------K-------EKMVKEYNKRKRMCTEML 83 (115)
Q Consensus 20 ~~el~~~i~~L~~e~~~l~~rL~~l~~--~~~~Vs~ee~~~~~v~~~-------~-------~k~~~~w~kRKRi~~di~ 83 (115)
...|...|+.-+.+++.|+++.+.|+. ....||+++.+ ....+ . .+.++.-..|++-+..++
T Consensus 332 l~kl~~eie~kEeei~~L~~~~d~L~~q~~kq~Is~e~fe--~mn~Ere~L~reL~~i~~~~~~L~k~V~~~~leaq~~~ 409 (622)
T COG5185 332 LEKLKSEIELKEEEIKALQSNIDELHKQLRKQGISTEQFE--LMNQEREKLTRELDKINIQSDKLTKSVKSRKLEAQGIF 409 (622)
T ss_pred HHHHHHHHHHHHHHHHHHHhhHHHHHHHHHhcCCCHHHHH--HHHHHHHHHHHHHHHhcchHHHHHHHHHhHHHHHHHHH
Confidence 356667777777777777777777763 44778888765 33333 2 345667778888888777
Q ss_pred HHHHhcCcCcHHHHHHHhccc
Q psy8735 84 EAILENYPKSKKILLEEVGIE 104 (115)
Q Consensus 84 d~I~E~~p~~kk~L~EelGiE 104 (115)
+.+..-+-+ -+-|...||+-
T Consensus 410 ~slek~~~~-~~sl~~~i~~~ 429 (622)
T COG5185 410 KSLEKTLRQ-YDSLIQNITRS 429 (622)
T ss_pred HHHHHHHHH-HHHHHHHhccc
Confidence 766433332 45566777765
No 64
>smart00787 Spc7 Spc7 kinetochore protein. This domain is found in cell division proteins which are required for kinetochore-spindle association.
Probab=50.48 E-value=1.3e+02 Score=24.30 Aligned_cols=20 Identities=5% Similarity=0.134 Sum_probs=12.1
Q ss_pred CCCCChhhhcHHhHHHHHHHHH
Q psy8735 48 VNVISEGEVDKKTITDTKEKMV 69 (115)
Q Consensus 48 ~~~Vs~ee~~~~~v~~~~~k~~ 69 (115)
....|..|.. ++...+..+.
T Consensus 265 ~r~~t~~Ei~--~Lk~~~~~Le 284 (312)
T smart00787 265 CRGFTFKEIE--KLKEQLKLLQ 284 (312)
T ss_pred cCCCCHHHHH--HHHHHHHHHH
Confidence 4666666666 6666665443
No 65
>COG3132 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=49.79 E-value=27 Score=26.62 Aligned_cols=27 Identities=19% Similarity=0.389 Sum_probs=23.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhh
Q psy8735 20 EAEAKQKKAELQAELEEKKSKLEKLSR 46 (115)
Q Consensus 20 ~~el~~~i~~L~~e~~~l~~rL~~l~~ 46 (115)
..++.+.+++|+.++.++..||+.+-+
T Consensus 187 s~dlearv~aLe~eva~L~~rld~ll~ 213 (215)
T COG3132 187 SSDLEARVEALEQEVAELRARLDSLLG 213 (215)
T ss_pred cchHHHHHHHHHHHHHHHHHHHHHHhc
Confidence 356999999999999999999998743
No 66
>PRK12901 secA preprotein translocase subunit SecA; Reviewed
Probab=48.93 E-value=1e+02 Score=29.32 Aligned_cols=66 Identities=20% Similarity=0.286 Sum_probs=47.8
Q ss_pred HHHHHHHHHHHHccCCHHHHHHHHHHHHHHHHH----HHHHHHHhhh-cCCCCChhhhcHHhHHHHHHHHHHHHHHH
Q psy8735 4 KEKVKSKLKQLSQQMTEAEAKQKKAELQAELEE----KKSKLEKLSR-NVNVISEGEVDKKTITDTKEKMVKEYNKR 75 (115)
Q Consensus 4 ~~~l~seL~~L~s~lt~~el~~~i~~L~~e~~~----l~~rL~~l~~-~~~~Vs~ee~~~~~v~~~~~k~~~~w~kR 75 (115)
|.+++.++ +.+|+++|+.+..+++..++. ...++..|+. .....+.++++ .+..+.....++|.+.
T Consensus 27 In~le~~~----~~LSDeeLr~kT~efk~rl~~~~~~~~~~~~~l~~~~~~~~~~~~~e--~~~~~id~~~~~~~~~ 97 (1112)
T PRK12901 27 IKAEYPEL----EALSNDELRAKTDEFKQYIKEAVADIDAKIEELKAEAIESLDIDERE--DIYAQIDKLEKEAYEI 97 (1112)
T ss_pred HHHhHHHH----HhCCHHHHHHHHHHHHHHHHHhhhhhhhHHHHHHhhhccccchhhhh--HHHHHHHHHHHHHHHH
Confidence 44444444 568999999999999998877 5667777764 33447778888 8888887777777654
No 67
>cd04779 HTH_MerR-like_sg4 Helix-Turn-Helix DNA binding domain of putative transcription regulators from the MerR superfamily. Putative helix-turn-helix (HTH) MerR-like transcription regulators (subgroup 4). Based on sequence similarity, these proteins are predicted to function as transcription regulators that mediate responses to stress in eubacteria. They belong to the MerR superfamily of transcription regulators that promote transcription of various stress regulons by reconfiguring the operator sequence located between the -35 and -10 promoter elements. A typical MerR regulator is comprised of two distinct domains that harbor the regulatory (effector-binding) site and the active (DNA-binding) site. Their N-terminal domains are homologous and contain a DNA-binding winged HTH motif, while the C-terminal domains are often dissimilar and bind specific coactivator molecules such as metal ions, drugs, and organic substrates.
Probab=48.40 E-value=51 Score=23.17 Aligned_cols=40 Identities=10% Similarity=0.113 Sum_probs=26.4
Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHhhhcCCCCChhhhc
Q psy8735 18 MTEAEAKQKKAELQAELEEKKSKLEKLSRNVNVISEGEVD 57 (115)
Q Consensus 18 lt~~el~~~i~~L~~e~~~l~~rL~~l~~~~~~Vs~ee~~ 57 (115)
.+..++...+..+...+..++.++..|......+..+++.
T Consensus 74 ~~~~~~~~~~~~l~~~i~~Le~~l~~L~~~~~~l~~~~~~ 113 (134)
T cd04779 74 KEQREVAQEVQLVCDQIDGLEHRLKQLKPIASQTDRAQRM 113 (134)
T ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4456677777888888888888887776544444444444
No 68
>PTZ00446 vacuolar sorting protein SNF7-like; Provisional
Probab=47.22 E-value=1e+02 Score=23.17 Aligned_cols=59 Identities=19% Similarity=0.293 Sum_probs=32.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhcCCCCChhhhcHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHhc
Q psy8735 22 EAKQKKAELQAELEEKKSKLEKLSRNVNVISEGEVDKKTITDTKEKMVKEYNKRKRMCTEMLEAILEN 89 (115)
Q Consensus 22 el~~~i~~L~~e~~~l~~rL~~l~~~~~~Vs~ee~~~~~v~~~~~k~~~~w~kRKRi~~di~d~I~E~ 89 (115)
.|+.++..|+.....++.+++.....+ +. .+.......-...-+|||++..-++.+...
T Consensus 31 ~Lk~~~~~L~krq~~Le~kIe~e~~~A-------k~--~~~~~kk~~Al~~LkrKK~~E~ql~q~~~q 89 (191)
T PTZ00446 31 KNREAIDALEKKQVQVEKKIKQLEIEA-------KQ--KVEQNQMSNAKILLKRKKLYEQEIENILNN 89 (191)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH-------HH--HHHcccHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 344444444444444444444433211 22 344444445667889999999888777543
No 69
>COG1579 Zn-ribbon protein, possibly nucleic acid-binding [General function prediction only]
Probab=47.10 E-value=1.4e+02 Score=23.38 Aligned_cols=82 Identities=15% Similarity=0.321 Sum_probs=49.3
Q ss_pred HHHHHHHHHHHHccC-----CHHHHHHHHHHHHHHHHHHHHHHHHhhhcCCCC-ChhhhcHHhHHHHHHHHHHHHHHHHH
Q psy8735 4 KEKVKSKLKQLSQQM-----TEAEAKQKKAELQAELEEKKSKLEKLSRNVNVI-SEGEVDKKTITDTKEKMVKEYNKRKR 77 (115)
Q Consensus 4 ~~~l~seL~~L~s~l-----t~~el~~~i~~L~~e~~~l~~rL~~l~~~~~~V-s~ee~~~~~v~~~~~k~~~~w~kRKR 77 (115)
...+.+++..++..+ ..+++..++.+++.++..+.+|+...+..-..| +..+.. . +..-...|..|..
T Consensus 33 l~k~~~e~e~~~~~~~~~~~e~e~le~qv~~~e~ei~~~r~r~~~~e~kl~~v~~~~e~~--a----L~~E~~~ak~r~~ 106 (239)
T COG1579 33 LKKAKAELEALNKALEALEIELEDLENQVSQLESEIQEIRERIKRAEEKLSAVKDERELR--A----LNIEIQIAKERIN 106 (239)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccHHHHH--H----HHHHHHHHHHHHH
Confidence 344555555555543 456677888888888888888888776432333 444444 4 4444556677776
Q ss_pred HHHHHHHHHHhcCc
Q psy8735 78 MCTEMLEAILENYP 91 (115)
Q Consensus 78 i~~di~d~I~E~~p 91 (115)
-..+=+..+.+-..
T Consensus 107 ~le~el~~l~~~~~ 120 (239)
T COG1579 107 SLEDELAELMEEIE 120 (239)
T ss_pred HHHHHHHHHHHHHH
Confidence 66655555544433
No 70
>PF10147 CR6_interact: Growth arrest and DNA-damage-inducible proteins-interacting protein 1; InterPro: IPR018472 Members of this family of proteins act as negative regulators of G1 to S cell cycle phase progression by inhibiting cyclin-dependent kinases. Inhibitory effects are additive with GADD45 proteins but occur also in the absence of GADD45 proteins. Furthermore, they act as a repressor of the orphan nuclear receptor NR4A1 by inhibiting AB domain-mediated transcriptional activity []. They may be involved in the hormone-mediated regulation of NR4A1 transcriptional activity.; GO: 0007049 cell cycle, 0005634 nucleus
Probab=46.95 E-value=68 Score=24.74 Aligned_cols=48 Identities=15% Similarity=0.108 Sum_probs=28.2
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcCcHHHHHHHhcccccc
Q psy8735 60 TITDTKEKMVKEYNKRKRMCTEMLEAILENYPKSKKILLEEVGIETDE 107 (115)
Q Consensus 60 ~v~~~~~k~~~~w~kRKRi~~di~d~I~E~~p~~kk~L~EelGiEtDe 107 (115)
.--..+-.|...|+.+.+.-...-.+--+.-..--.+.-+.+|...|.
T Consensus 128 ~nM~Kmpk~i~e~~~~~~kk~~~~~~~k~rkerl~eEvre~fGy~vDp 175 (217)
T PF10147_consen 128 KNMAKMPKWIAEWKAKIAKKEAKAQAAKERKERLIEEVREHFGYKVDP 175 (217)
T ss_pred HHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCCC
Confidence 444566777777877776665444433332222234556677888776
No 71
>TIGR02263 benz_CoA_red_C benzoyl-CoA reductase, subunit C. This model describes C subunit of benzoyl-CoA reductase, a 4-subunit enzyme. Many aromatic compounds are metabolized by way of benzoyl-CoA. This enzyme acts under anaerobic conditions.
Probab=46.93 E-value=58 Score=26.54 Aligned_cols=44 Identities=23% Similarity=0.129 Sum_probs=24.5
Q ss_pred HHHHHHHHHHHHHHHHHhhhcCCCCChhhhcHHhHHHHHHHHHHHHHH
Q psy8735 27 KAELQAELEEKKSKLEKLSRNVNVISEGEVDKKTITDTKEKMVKEYNK 74 (115)
Q Consensus 27 i~~L~~e~~~l~~rL~~l~~~~~~Vs~ee~~~~~v~~~~~k~~~~w~k 74 (115)
+.=+..|+..+...|++.. +..+|++.+. +.-+.|++.++.|++
T Consensus 137 ~~Y~~~el~~l~~~LE~~~--G~~it~e~L~--~aI~~~N~~R~~~~~ 180 (380)
T TIGR02263 137 GEFYTAELNELCEGLEHLS--GKKITDDAIR--ASIAVFNDNRKLIQA 180 (380)
T ss_pred HHHHHHHHHHHHHHHHHHH--CCCCCHHHHH--HHHHHHHHHHHHHHH
Confidence 4445555555555555543 2456666666 666666666555544
No 72
>PF00170 bZIP_1: bZIP transcription factor cAMP response element binding (CREB) protein signature fos transforming protein signature jun transcription factor signature; InterPro: IPR011616 The basic-leucine zipper (bZIP) transcription factors [, ] of eukaryotic are proteins that contain a basic region mediating sequence-specific DNA-binding followed by a leucine zipper region (see IPR002158 from INTERPRO) required for dimerization.; GO: 0003700 sequence-specific DNA binding transcription factor activity, 0043565 sequence-specific DNA binding, 0046983 protein dimerization activity, 0006355 regulation of transcription, DNA-dependent; PDB: 2H7H_B 2OQQ_B 1S9K_E 1JNM_A 1JUN_A 1FOS_H 1A02_J 1T2K_C 1CI6_A 1DH3_C ....
Probab=46.69 E-value=58 Score=19.55 Aligned_cols=24 Identities=46% Similarity=0.563 Sum_probs=11.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHh
Q psy8735 21 AEAKQKKAELQAELEEKKSKLEKL 44 (115)
Q Consensus 21 ~el~~~i~~L~~e~~~l~~rL~~l 44 (115)
.+|...+..|+.++..|...+..|
T Consensus 29 ~~Le~~~~~L~~en~~L~~~~~~L 52 (64)
T PF00170_consen 29 EELEEKVEELESENEELKKELEQL 52 (64)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH
Confidence 344444444444444444444444
No 73
>PF11365 DUF3166: Protein of unknown function (DUF3166); InterPro: IPR021507 This eukaryotic family of proteins has no known function.
Probab=46.53 E-value=44 Score=22.59 Aligned_cols=25 Identities=24% Similarity=0.299 Sum_probs=21.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhh
Q psy8735 22 EAKQKKAELQAELEEKKSKLEKLSR 46 (115)
Q Consensus 22 el~~~i~~L~~e~~~l~~rL~~l~~ 46 (115)
=|+..+.+++.++..+...|.++++
T Consensus 19 LlRRkl~ele~eN~~l~~EL~kyk~ 43 (96)
T PF11365_consen 19 LLRRKLSELEDENKQLTEELNKYKS 43 (96)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3678889999999999999999975
No 74
>PF06305 DUF1049: Protein of unknown function (DUF1049); InterPro: IPR010445 This entry consists of several hypothetical bacterial proteins of unknown function.
Probab=46.36 E-value=37 Score=20.36 Aligned_cols=32 Identities=22% Similarity=0.324 Sum_probs=22.6
Q ss_pred HHccCCHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy8735 14 LSQQMTEAEAKQKKAELQAELEEKKSKLEKLS 45 (115)
Q Consensus 14 L~s~lt~~el~~~i~~L~~e~~~l~~rL~~l~ 45 (115)
+-+.++--..+.++..++++++.++.+++++|
T Consensus 37 l~~~~~~~~~r~~~~~~~k~l~~le~e~~~lr 68 (68)
T PF06305_consen 37 LLSLPSRLRLRRRIRRLRKELKKLEKELEQLR 68 (68)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC
Confidence 33444445667778888888888888888764
No 75
>PF01726 LexA_DNA_bind: LexA DNA binding domain; InterPro: IPR006199 This is the DNA binding domain of the LexA SOS regulon repressor which prevents expression of DNA repair proteins in bacteria. The aligned region contains a variant form of the helix-turn-helix DNA binding motif []. This domain usually at the N terminus is found associated with IPR006198 from INTERPRO the auto-proteolytic domain of LexA 3.4.21.88 from EC.; GO: 0004252 serine-type endopeptidase activity, 0006508 proteolysis; PDB: 1LEA_A 1JHH_A 3JSP_A 1JHF_A 3JSO_B 1LEB_A 3K2Z_A.
Probab=46.15 E-value=54 Score=20.18 Aligned_cols=31 Identities=23% Similarity=0.434 Sum_probs=20.5
Q ss_pred HHHHHHHHHHHHHH-hcCcCcHHHHHHHhccc
Q psy8735 74 KRKRMCTEMLEAIL-ENYPKSKKILLEEVGIE 104 (115)
Q Consensus 74 kRKRi~~di~d~I~-E~~p~~kk~L~EelGiE 104 (115)
+.++++.=|.+.+. -|+|.+-.++.+.+|+-
T Consensus 7 rQ~~vL~~I~~~~~~~G~~Pt~rEIa~~~g~~ 38 (65)
T PF01726_consen 7 RQKEVLEFIREYIEENGYPPTVREIAEALGLK 38 (65)
T ss_dssp HHHHHHHHHHHHHHHHSS---HHHHHHHHTSS
T ss_pred HHHHHHHHHHHHHHHcCCCCCHHHHHHHhCCC
Confidence 34555555666565 48999999999999996
No 76
>PF07334 IFP_35_N: Interferon-induced 35 kDa protein (IFP 35) N-terminus; InterPro: IPR009938 This entry represents the N terminus of interferon-induced 35 kDa protein (IFP 35) (approximately 80 residues long), which contains a leucine zipper motif in an alpha helical configuration []. This group of proteins also includes N-myc-interactor (Nmi), a homologous interferon-induced protein.
Probab=45.87 E-value=59 Score=21.09 Aligned_cols=26 Identities=23% Similarity=0.452 Sum_probs=20.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhh
Q psy8735 21 AEAKQKKAELQAELEEKKSKLEKLSR 46 (115)
Q Consensus 21 ~el~~~i~~L~~e~~~l~~rL~~l~~ 46 (115)
.+|.++..+|+.++..+++.|+.++.
T Consensus 3 ~ei~eEn~~Lk~eiqkle~ELq~~~~ 28 (76)
T PF07334_consen 3 HEIQEENARLKEEIQKLEAELQQNKR 28 (76)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 56777788888888888888887754
No 77
>PRK12750 cpxP periplasmic repressor CpxP; Reviewed
Probab=45.82 E-value=1.2e+02 Score=22.25 Aligned_cols=25 Identities=4% Similarity=0.247 Sum_probs=17.6
Q ss_pred cCCCCChhhhcHHhHHHHHHHHHHHHH
Q psy8735 47 NVNVISEGEVDKKTITDTKEKMVKEYN 73 (115)
Q Consensus 47 ~~~~Vs~ee~~~~~v~~~~~k~~~~w~ 73 (115)
--.+.||+++. ++..-+......|-
T Consensus 133 ~~~vLTpEQRa--k~~e~~~~r~~~~~ 157 (170)
T PRK12750 133 MLSILTPEQKA--KFQELQQERMQECQ 157 (170)
T ss_pred HHHhCCHHHHH--HHHHHHHHHHHHHH
Confidence 45788999999 77777655555543
No 78
>COG3167 PilO Tfp pilus assembly protein PilO [Cell motility and secretion / Intracellular trafficking and secretion]
Probab=45.40 E-value=39 Score=25.86 Aligned_cols=44 Identities=16% Similarity=0.351 Sum_probs=34.4
Q ss_pred CchHHHHHHHHHHHHccCCHHHHHHH-HHHHHHHHHHHHHHHHHh
Q psy8735 1 MDDKEKVKSKLKQLSQQMTEAEAKQK-KAELQAELEEKKSKLEKL 44 (115)
Q Consensus 1 ~~~~~~l~seL~~L~s~lt~~el~~~-i~~L~~e~~~l~~rL~~l 44 (115)
++++..+++|=.+|+|+......+.. ...|+++..++++++..|
T Consensus 48 ~eel~~~~~eEe~LKs~~q~K~~~aanL~~lr~Ql~emee~~~~l 92 (211)
T COG3167 48 LEELEELEAEEEELKSTYQQKAIQAANLEALRAQLAEMEERFDIL 92 (211)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhchHHHHHHHHHHHHHHHHH
Confidence 46778888888888888877666554 478888888888888877
No 79
>PF13514 AAA_27: AAA domain
Probab=45.26 E-value=1.2e+02 Score=28.36 Aligned_cols=81 Identities=26% Similarity=0.450 Sum_probs=43.3
Q ss_pred HHHHHHHHHHHccCCHHHHHHHHHHHHHHHHHHHHHHHHhhhcCCCC-ChhhhcHHhHHHHHHHHHHHHHHHHHHHHHHH
Q psy8735 5 EKVKSKLKQLSQQMTEAEAKQKKAELQAELEEKKSKLEKLSRNVNVI-SEGEVDKKTITDTKEKMVKEYNKRKRMCTEML 83 (115)
Q Consensus 5 ~~l~seL~~L~s~lt~~el~~~i~~L~~e~~~l~~rL~~l~~~~~~V-s~ee~~~~~v~~~~~k~~~~w~kRKRi~~di~ 83 (115)
..+..++..+.+.+ +++...+..+..++..++.+|..|.+++... -..++. .+..........|... +++..++
T Consensus 892 ~~l~~~l~~l~~~l--~~l~~~~~~l~~~~~~~~~~l~~l~~~~~~a~l~~e~e--~~~a~l~~~~~~~~~~-~la~~lL 966 (1111)
T PF13514_consen 892 DELEAELEELEEEL--EELEEELEELQEERAELEQELEALEGDDDAAELEQERE--EAEAELEELAEEWAAL-RLAAELL 966 (1111)
T ss_pred HHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHhCCchHHHHHHHHH--HHHHHHHHHHHHHHHH-HHHHHHH
Confidence 34444444444433 2245555555555556666666654433222 124556 7777778888888654 4555555
Q ss_pred HHHHhcC
Q psy8735 84 EAILENY 90 (115)
Q Consensus 84 d~I~E~~ 90 (115)
....+.|
T Consensus 967 ~~a~~~~ 973 (1111)
T PF13514_consen 967 EEAIERY 973 (1111)
T ss_pred HHHHHHH
Confidence 5544444
No 80
>PF03938 OmpH: Outer membrane protein (OmpH-like); InterPro: IPR005632 This entry includes outer membrane proteins such as OmpH (Skp) among others. OmpH (outer membrane protein H) is a major structural protein of the outer membrane. In Pasteurella multocida it acts as a channel-forming transmembrane porin []. Porins act as molecular sieves to allow the diffusion of small hydrophilic solutes through the outer membrane and also acts as a receptor for bacteriophages and bacteriocins. Porins are highly immunogenic and are conserved in bacterial families, making them attractive vaccine candidates []. The 17kDa protein (Skp, OmpH) of Escherichia coli is a homotrimeric periplasmic chaperone for newly synthesised outer-membrane proteins, the X-ray structure of which has been reported at resolutions of 2.35 A and 2.30 A [, ]. Three hairpin-shaped alpha-helical extensions reach out by approximately 60 A from a trimerisation domain, which is composed of three intersubunit beta-sheets that wind around a central axis. The alpha-helical extensions approach each other at their distal turns, resulting in a fold that resembles a 'three-pronged grasping forcep'. The overall shape of Skp is reminiscent of the cytosolic chaperone prefoldin (IPR009053 from INTERPRO), although it is based on a radically different topology. The peculiar architecture, with apparent plasticity of the prongs and distinct electrostatic and hydrophobic surface properties, supports the recently proposed biochemical mechanism of this chaperone: formation of a Skp(3)-Omp complex protects the outer membrane protein from aggregation during passage through the bacterial periplasm. The ability of Skp to prevent the aggregation of model substrates in vitro is independent of ATP. Skp can interact directly with membrane lipids and lipopolysaccharide. These interactions are needed for efficient Skp-assisted folding of membrane proteins [].; GO: 0051082 unfolded protein binding; PDB: 1SG2_C 1U2M_C.
Probab=45.18 E-value=44 Score=23.26 Aligned_cols=51 Identities=25% Similarity=0.327 Sum_probs=25.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHh----hhcCCCCChhhhcHHhHHHHHHHHHHHHHH
Q psy8735 22 EAKQKKAELQAELEEKKSKLEKL----SRNVNVISEGEVDKKTITDTKEKMVKEYNK 74 (115)
Q Consensus 22 el~~~i~~L~~e~~~l~~rL~~l----~~~~~~Vs~ee~~~~~v~~~~~k~~~~w~k 74 (115)
.+.......+.++..+...|..+ .+....+|++++. ....++.........
T Consensus 40 ~l~~~~~~~~~~l~~~~~el~~~~~~l~~~~~~ls~~~~~--~~~~~l~~~~~~l~~ 94 (158)
T PF03938_consen 40 KLQEKFKALQKELQAKQKELQKLQQKLQSQKATLSEEERQ--KRQQELQQKEQELQQ 94 (158)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHTTS----SSHHHH--HHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccchhHHH--HHHHHHHHHHHHHHH
Confidence 33444444444444444444444 3445678888877 666666555444443
No 81
>PRK00888 ftsB cell division protein FtsB; Reviewed
Probab=44.56 E-value=51 Score=22.23 Aligned_cols=25 Identities=16% Similarity=0.098 Sum_probs=11.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHh
Q psy8735 20 EAEAKQKKAELQAELEEKKSKLEKL 44 (115)
Q Consensus 20 ~~el~~~i~~L~~e~~~l~~rL~~l 44 (115)
..+++.+++++++++++++.+-..|
T Consensus 29 ~~~l~~q~~~~~~e~~~l~~~n~~L 53 (105)
T PRK00888 29 YWRVNDQVAAQQQTNAKLKARNDQL 53 (105)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3444444444444444444444444
No 82
>PF06156 DUF972: Protein of unknown function (DUF972); InterPro: IPR010377 FUNCTION: Involved in initiation control of chromosome replication. SUBUNIT: Interacts with both DnaA and DnaN, acting as a bridge between these two proteins. SIMILARITY: Belongs to the YabA family.
Probab=43.56 E-value=83 Score=21.44 Aligned_cols=77 Identities=18% Similarity=0.350 Sum_probs=36.8
Q ss_pred HHHHHHHHHHHHccCCHHHHHHHH-------HHHHHHHHHHHHHHHHhhh-cCCCCChhhhcHHhHHHHHHHHHHHHHHH
Q psy8735 4 KEKVKSKLKQLSQQMTEAEAKQKK-------AELQAELEEKKSKLEKLSR-NVNVISEGEVDKKTITDTKEKMVKEYNKR 75 (115)
Q Consensus 4 ~~~l~seL~~L~s~lt~~el~~~i-------~~L~~e~~~l~~rL~~l~~-~~~~Vs~ee~~~~~v~~~~~k~~~~w~kR 75 (115)
+..++..+..|...+ .+|+..+ +.|+-|+..|..+|..+.. ......+.... .+-..|....+.|..=
T Consensus 10 l~~le~~l~~l~~~~--~~LK~~~~~l~EEN~~L~~EN~~Lr~~l~~~~~~~~~~~~~~~~~--~~~~g~~NL~~LY~EG 85 (107)
T PF06156_consen 10 LDQLEQQLGQLLEEL--EELKKQLQELLEENARLRIENEHLRERLEELEQEEEEKEEKKTKK--KLGEGRDNLARLYQEG 85 (107)
T ss_pred HHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccc--cccchHHHHHHHHhcC
Confidence 444555554444432 3444444 4455566666666666543 11222233333 4455555555555544
Q ss_pred HHHHHHHHH
Q psy8735 76 KRMCTEMLE 84 (115)
Q Consensus 76 KRi~~di~d 84 (115)
=.||...++
T Consensus 86 FHICn~~yG 94 (107)
T PF06156_consen 86 FHICNVHYG 94 (107)
T ss_pred eeeCcHHhC
Confidence 445554443
No 83
>KOG2916|consensus
Probab=43.54 E-value=53 Score=26.46 Aligned_cols=50 Identities=12% Similarity=0.163 Sum_probs=42.2
Q ss_pred CCCCChhhhcHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcCcHHHHHHHhcccccc
Q psy8735 48 VNVISEGEVDKKTITDTKEKMVKEYNKRKRMCTEMLEAILENYPKSKKILLEEVGIETDE 107 (115)
Q Consensus 48 ~~~Vs~ee~~~~~v~~~~~k~~~~w~kRKRi~~di~d~I~E~~p~~kk~L~EelGiEtDe 107 (115)
-.-|||+|+. +-+..|.+.. +..-|+-.++|.++..-.+|+..+|+-.+.
T Consensus 87 krrVs~ed~~--kC~Er~~ksK--------~v~sIlrhvAe~~g~~LeeLY~~igW~~~r 136 (304)
T KOG2916|consen 87 KRRVSPEDKE--KCEERFAKSK--------LVYSILRHVAEKLGYELEELYQRIGWPLER 136 (304)
T ss_pred hccCCHHHHH--HHHHHHHHhH--------HHHHHHHHHHHHhCCcHHHHHHHhCCchhh
Confidence 3679999999 8888887765 445888899999998899999999998776
No 84
>cd07627 BAR_Vps5p The Bin/Amphiphysin/Rvs (BAR) domain of yeast Sorting Nexin Vps5p. BAR domains are dimerization, lipid binding and curvature sensing modules found in many different proteins with diverse functions. Sorting nexins (SNXs) are Phox homology (PX) domain containing proteins that are involved in regulating membrane traffic and protein sorting in the endosomal system. SNXs differ from each other in their lipid-binding specificity, subcellular localization and specific function in the endocytic pathway. A subset of SNXs also contain BAR domains. The PX-BAR structural unit determines the specific membrane targeting of SNXs. Vsp5p is the yeast counterpart of human SNX1 and is part of the retromer complex, which functions in the endosome-to-Golgi retrieval of vacuolar protein sorting receptor Vps10p, the Golgi-resident membrane protein A-ALP, and endopeptidase Kex2. BAR domains form dimers that bind to membranes, induce membrane bending and curvature, and may also be involved in
Probab=43.38 E-value=1.2e+02 Score=22.54 Aligned_cols=61 Identities=26% Similarity=0.330 Sum_probs=33.6
Q ss_pred HHHHHHHHHHHHHHHHHHHhhhcCCCCChhhhcHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcCcH
Q psy8735 25 QKKAELQAELEEKKSKLEKLSRNVNVISEGEVDKKTITDTKEKMVKEYNKRKRMCTEMLEAILENYPKSK 94 (115)
Q Consensus 25 ~~i~~L~~e~~~l~~rL~~l~~~~~~Vs~ee~~~~~v~~~~~k~~~~w~kRKRi~~di~d~I~E~~p~~k 94 (115)
..+..++.++.+++.+....+..-..||. .+..+...+. +.|..-|++++...+++.-..-
T Consensus 143 ~K~~~~~~ei~~~e~~~~~a~~~~e~is~------~~k~El~rF~---~~r~~dfk~~l~~~~e~~ie~~ 203 (216)
T cd07627 143 EKLNSLLSELEEAERRASELKKEFEEVSE------LIKSELERFE---RERVEDFRNSVEIYLESAIESQ 203 (216)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH------HHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHH
Confidence 45666666666666666665542222222 2222222222 6677777888877777766533
No 85
>PF07544 Med9: RNA polymerase II transcription mediator complex subunit 9; InterPro: IPR011425 The Mediator complex is a coactivator involved in the regulated transcription of nearly all RNA polymerase II-dependent genes. Mediator functions as a bridge to convey information from gene-specific regulatory proteins to the basal RNA polymerase II transcription machinery. The Mediator complex, having a compact conformation in its free form, is recruited to promoters by direct interactions with regulatory proteins and serves for the assembly of a functional preinitiation complex with RNA polymerase II and the general transcription factors. On recruitment the Mediator complex unfolds to an extended conformation and partially surrounds RNA polymerase II, specifically interacting with the unphosphorylated form of the C-terminal domain (CTD) of RNA polymerase II. The Mediator complex dissociates from the RNA polymerase II holoenzyme and stays at the promoter when transcriptional elongation begins. The Mediator complex is composed of at least 31 subunits: MED1, MED4, MED6, MED7, MED8, MED9, MED10, MED11, MED12, MED13, MED13L, MED14, MED15, MED16, MED17, MED18, MED19, MED20, MED21, MED22, MED23, MED24, MED25, MED26, MED27, MED29, MED30, MED31, CCNC, CDK8 and CDC2L6/CDK11. The subunits form at least three structurally distinct submodules. The head and the middle modules interact directly with RNA polymerase II, whereas the elongated tail module interacts with gene-specific regulatory proteins. Mediator containing the CDK8 module is less active than Mediator lacking this module in supporting transcriptional activation. The head module contains: MED6, MED8, MED11, SRB4/MED17, SRB5/MED18, ROX3/MED19, SRB2/MED20 and SRB6/MED22. The middle module contains: MED1, MED4, NUT1/MED5, MED7, CSE2/MED9, NUT2/MED10, SRB7/MED21 and SOH1/MED31. CSE2/MED9 interacts directly with MED4. The tail module contains: MED2, PGD1/MED3, RGR1/MED14, GAL11/MED15 and SIN4/MED16. The CDK8 module contains: MED12, MED13, CCNC and CDK8. Individual preparations of the Mediator complex lacking one or more distinct subunits have been variously termed ARC, CRSP, DRIP, PC2, SMCC and TRAP. This entry represents subunit Med9 of the Mediator complex. Subunit Med9 is part of the middle module of the Mediator complex []; this associates with the core polymerase subunits to form the RNA polymerase II holoenzyme. Med9 alternatively known as the chromosome segregation protein, CSE2 (P33308 from SWISSPROT) is required, along with CSE1 (P33307 from SWISSPROT) for accurate mitotic chromosome segregation in Saccharomyces cerevisiae (Baker's yeast) [].; GO: 0001104 RNA polymerase II transcription cofactor activity, 0006357 regulation of transcription from RNA polymerase II promoter, 0016592 mediator complex
Probab=43.32 E-value=64 Score=20.75 Aligned_cols=28 Identities=21% Similarity=0.432 Sum_probs=17.8
Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHhhh
Q psy8735 19 TEAEAKQKKAELQAELEEKKSKLEKLSR 46 (115)
Q Consensus 19 t~~el~~~i~~L~~e~~~l~~rL~~l~~ 46 (115)
|.++-..+|..|+.++....+-|..|++
T Consensus 53 s~eeq~~~i~~Le~~i~~k~~~L~~~~~ 80 (83)
T PF07544_consen 53 SVEEQEEEIEELEEQIRKKREVLQKFKE 80 (83)
T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 5566666666666666666666666653
No 86
>PF15469 Sec5: Exocyst complex component Sec5
Probab=42.88 E-value=52 Score=23.69 Aligned_cols=50 Identities=16% Similarity=0.246 Sum_probs=36.4
Q ss_pred CCChhhhcHHhHHHHHHHHHHHHHHHH---HHHHHHHHHHHhcCcCcHHHHHHHh
Q psy8735 50 VISEGEVDKKTITDTKEKMVKEYNKRK---RMCTEMLEAILENYPKSKKILLEEV 101 (115)
Q Consensus 50 ~Vs~ee~~~~~v~~~~~k~~~~w~kRK---Ri~~di~d~I~E~~p~~kk~L~Eel 101 (115)
.+...+-. .+-..|.+++..+...+ .+|..||..|-..+..=+..||+.|
T Consensus 96 ~i~~~dy~--~~i~dY~kak~l~~~~~~~~~vf~~v~~eve~ii~~~r~~l~~~L 148 (182)
T PF15469_consen 96 CIKKGDYD--QAINDYKKAKSLFEKYKQQVPVFQKVWSEVEKIIEEFREKLWEKL 148 (182)
T ss_pred HHHcCcHH--HHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34556667 88888989888888865 7888888888666655555666655
No 87
>PF09457 RBD-FIP: FIP domain ; InterPro: IPR019018 The Rab11 GTPase regulates recycling of internalized plasma membrane receptors and is essential for completion of cytokinesis. A family of Rab11 interacting proteins (FIPs) that conserve a C-terminal Rab-binding domain (RBD) selectively recognise the active form of Rab11. FIPs are diverse in sequence length and composition toward their N-termini, presumably a feature that underpins their specific roles in Rab11-mediated vesicle trafficking. They have been divided into three subfamilies (classe I, II, and III)on the basis of domain architecture. Class I FIPs comprises a subfamily of three proteins (Rip11/pp75/FIP5, Rab-coupling protein (RCP), and FIP2) that possess an N- terminal C2 domain, localize to recycling endosomes, and regulate plasma membrane recycling. The class II subfamily consists of two proteins (FIP3/eferin/arfophilin and FIP4) with tandem EF hands and a proline-rich region. Class II FIPs localize to recycling endosomes, the trans-Golgi network, and have been implicated in the regulation of membrane trafficking during cytokinesis. The class III subfamily consists of a single protein, FIP1, which does not contain obvious homology domains or motifs other than the FIP-RBD [, , , ]. The FIP-RBD domain is also found in Rab6-interacting protein Erc1/Elks. Erc1 is the regulatory subunit of the IKK complex and probably recruits IkappaBalpha/NFKBIA to the complex []. It may be involved in the organisation of the cytomatrix at the nerve terminals active zone (CAZ) which regulates neurotransmitter release. It may also be involved in vesicle trafficking at the CAZ, as well as in Rab-6 regulated endosomes to Golgi transport []. The FIB-RBD domain consists of an N-terminal long alpha-helix, followed by a 90 degrees bend at a conserved proline residue, a 3(10) helix and a C-terminal short beta-strand, adopting an "L" shape. The long alpha-helix forms a parallel coiled-coil homodimer that symmetrically interacts with two Rab11 molecules on both sides, forming a quaternary Rab11-(FIP)2-Rab11 complex. The Rab11-interacting region of FIP-RBD is confined to the C-terminal 24 amino acids, which cover the C-terminal half of the long alpha-helix and the short beta-strand [, , , ]. This entry represents the FIP-RBD domain.; PDB: 2HV8_E 2D7C_D 2K6S_B 2GZD_D 2GZH_B.
Probab=42.52 E-value=72 Score=18.81 Aligned_cols=26 Identities=12% Similarity=0.104 Sum_probs=17.4
Q ss_pred CHHHHHHHHHHHHHHHHH-------HHHHHHHh
Q psy8735 19 TEAEAKQKKAELQAELEE-------KKSKLEKL 44 (115)
Q Consensus 19 t~~el~~~i~~L~~e~~~-------l~~rL~~l 44 (115)
|-+++...+.+.+.++.. |+..++.|
T Consensus 1 s~eeL~~~l~~~e~~~~~k~~~v~eLe~YiD~L 33 (48)
T PF09457_consen 1 SREELISLLKKQEEENARKDSRVRELEDYIDNL 33 (48)
T ss_dssp THHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 456777777777777777 55555554
No 88
>TIGR00606 rad50 rad50. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=42.16 E-value=2.9e+02 Score=26.35 Aligned_cols=12 Identities=8% Similarity=0.219 Sum_probs=7.3
Q ss_pred hcCcCcHHHHHH
Q psy8735 88 ENYPKSKKILLE 99 (115)
Q Consensus 88 E~~p~~kk~L~E 99 (115)
+-+|.+..+|.+
T Consensus 294 ~~l~~s~eEL~~ 305 (1311)
T TIGR00606 294 KVFQGTDEQLND 305 (1311)
T ss_pred ccCCCCHHHHHH
Confidence 457777666553
No 89
>PF00038 Filament: Intermediate filament protein; InterPro: IPR016044 Intermediate filaments (IF) [, , ] are proteins which are primordial components of the cytoskeleton and the nuclear envelope. They generally form filamentous structures 8 to 14 nm wide. IF proteins are members of a very large multigene family of proteins which has been subdivided in five major subgroups: Type I: Acidic cytokeratins. Type II: Basic cytokeratins. Type III: Vimentin, desmin, glial fibrillary acidic protein (GFAP), peripherin, and plasticin. Type IV: Neurofilaments L, H and M, alpha-internexin and nestin. Type V: Nuclear lamins A, B1, B2 and C. All IF proteins are structurally similar in that they consist of: a central rod domain comprising some 300 to 350 residues which is arranged in coiled-coiled alpha-helices, with at least two short characteristic interruptions; a N-terminal non-helical domain (head) of variable length; and a C-terminal domain (tail) which is also non-helical, and which shows extreme length variation between different IF proteins. While IF proteins are evolutionary and structurally related, they have limited sequence homologies except in several regions of the rod domain. This entry represents the central rod domain found in IF proteins.; PDB: 3TNU_B 3KLT_D 1GK4_F 3TRT_A 3G1E_A 3UF1_C 1GK6_B 1GK7_A 3TYY_B 3V4W_A ....
Probab=41.67 E-value=95 Score=24.05 Aligned_cols=36 Identities=25% Similarity=0.342 Sum_probs=22.3
Q ss_pred HHHHHHccCCHHHHHHHHHHHHHHHHHHHHHHHHhhhc
Q psy8735 10 KLKQLSQQMTEAEAKQKKAELQAELEEKKSKLEKLSRN 47 (115)
Q Consensus 10 eL~~L~s~lt~~el~~~i~~L~~e~~~l~~rL~~l~~~ 47 (115)
+|..||..+ +.-...|..|+.++..|+.+|..++..
T Consensus 5 eL~~LNdRl--a~YIekVr~LE~~N~~Le~~i~~~~~~ 40 (312)
T PF00038_consen 5 ELQSLNDRL--ASYIEKVRFLEQENKRLESEIEELREK 40 (312)
T ss_dssp HHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHH----
T ss_pred HHHHHHHHH--HHHHHHHHHHHHHhhhhHHHHHHHHhc
Confidence 444555543 344667788888888888888888754
No 90
>PF09006 Surfac_D-trimer: Lung surfactant protein D coiled-coil trimerisation; InterPro: IPR015097 This domain is found in the SFTPD family, which includes lung surfactant protein D (SFTPD), conglutinin, collectin-43 and collectin-46. It forms a triple-helical parallel coiled coil, and mediates trimerisation of the protein []. ; PDB: 4DN8_A 3G84_A 2RIE_C 3IKR_B 1B08_A 2GGX_B 2OS9_C 2ORK_B 1PWB_A 2RIA_C ....
Probab=41.63 E-value=74 Score=18.74 Aligned_cols=25 Identities=12% Similarity=0.274 Sum_probs=17.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhh
Q psy8735 21 AEAKQKKAELQAELEEKKSKLEKLS 45 (115)
Q Consensus 21 ~el~~~i~~L~~e~~~l~~rL~~l~ 45 (115)
+.|+.++..|+.++..|+..+...+
T Consensus 2 ~aLrqQv~aL~~qv~~Lq~~fs~yK 26 (46)
T PF09006_consen 2 NALRQQVEALQGQVQRLQAAFSQYK 26 (46)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3566777777777777777766654
No 91
>PF00261 Tropomyosin: Tropomyosin; InterPro: IPR000533 Tropomyosins [], are a family of closely related proteins present in muscle and non-muscle cells. In striated muscle, tropomyosin mediate the interactions between the troponin complex and actin so as to regulate muscle contraction []. The role of tropomyosin in smooth muscle and non-muscle tissues is not clear. Tropomyosin is an alpha-helical protein that forms a coiled-coil structure of 2 parallel helices containing 2 sets of 7 alternating actin binding sites []. There are multiple cell-specific isoforms, created by differential splicing of the messenger RNA from one gene, but the proportions of the isoforms vary between different cell types. Muscle isoforms of tropomyosin are characterised by having 284 amino acid residues and a highly conserved N-terminal region, whereas non-muscle forms are generally smaller and are heterogeneous in their N-terminal region. This entry represents tropomyosin (Tmp) 1, 2 and 3. Within the yeast Tmp1 and Tmp2, biochemical and sequence analyses indicate that Tpm2 spans four actin monomers along a filament, whereas Tpm1 spans five. Despite its shorter length, Tpm2 can compete with Tpm1 for binding to F-actin. Over-expression of Tpm2 in vivo alters the axial budding of haploids to a bipolar pattern, and this can be partially suppressed by co-over-expression of Tpm1. This suggests distinct functions for the two tropomyosins, and indicates that the ratio between them is important for correct morphogenesis [].; PDB: 2EFR_A 2Z5H_C 2Z5I_D 2D3E_B 2EFS_D 3U59_B 1C1G_C 1IHQ_A 3AZD_B 1MV4_B ....
Probab=41.45 E-value=1.6e+02 Score=22.38 Aligned_cols=41 Identities=15% Similarity=0.381 Sum_probs=22.4
Q ss_pred HHHHHHHHHHHHccCCHH-----HHHHHHHHHHHHHHHHHHHHHHh
Q psy8735 4 KEKVKSKLKQLSQQMTEA-----EAKQKKAELQAELEEKKSKLEKL 44 (115)
Q Consensus 4 ~~~l~seL~~L~s~lt~~-----el~~~i~~L~~e~~~l~~rL~~l 44 (115)
|..|+.+|..+.+.+-.- .+-.....++..+..|..+|...
T Consensus 143 i~eLE~el~~~~~~lk~lE~~~~~~~~re~~~e~~i~~L~~~lkea 188 (237)
T PF00261_consen 143 IKELEEELKSVGNNLKSLEASEEKASEREDEYEEKIRDLEEKLKEA 188 (237)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHhhhhhhhhhHHHHHHHHHHHHHHHHHHHH
Confidence 566677777766666442 22333445555555555555554
No 92
>PF04065 Not3: Not1 N-terminal domain, CCR4-Not complex component ; InterPro: IPR007207 The Ccr4-Not complex (Not1, Not2, Not3, Not4 and Not5) is a global regulator of transcription that affects genes positively and negatively and is thought to regulate transcription factor TFIID []. This domain is the N-terminal region of the Not proteins.; GO: 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus
Probab=40.83 E-value=1.5e+02 Score=23.01 Aligned_cols=11 Identities=0% Similarity=0.419 Sum_probs=6.9
Q ss_pred HHHHHHHhccc
Q psy8735 94 KKILLEEVGIE 104 (115)
Q Consensus 94 kk~L~EelGiE 104 (115)
...|+++||++
T Consensus 221 de~iYddl~Ld 231 (233)
T PF04065_consen 221 DEDIYDDLNLD 231 (233)
T ss_pred hHhHhhccCCC
Confidence 35666776665
No 93
>PF06667 PspB: Phage shock protein B; InterPro: IPR009554 This family consists of several bacterial phage shock protein B (PspB) sequences. The phage shock protein (psp) operon is induced in response to heat, ethanol, osmotic shock and infection by filamentous bacteriophages []. Expression of the operon requires the alternative sigma factor sigma54 and the transcriptional activator PspF. In addition, PspA plays a negative regulatory role, and the integral-membrane proteins PspB and PspC play a positive one [].; GO: 0006355 regulation of transcription, DNA-dependent, 0009271 phage shock
Probab=40.09 E-value=74 Score=20.48 Aligned_cols=32 Identities=31% Similarity=0.472 Sum_probs=25.8
Q ss_pred HHccCCHHHHHHHHHHHHHHHHHHHHHHHHhhh
Q psy8735 14 LSQQMTEAEAKQKKAELQAELEEKKSKLEKLSR 46 (115)
Q Consensus 14 L~s~lt~~el~~~i~~L~~e~~~l~~rL~~l~~ 46 (115)
..+.+|.+| ...+.+|-...+.|++|++.|.+
T Consensus 32 ~~~gLs~~d-~~~L~~L~~~a~rm~eRI~tLE~ 63 (75)
T PF06667_consen 32 SSQGLSEED-EQRLQELYEQAERMEERIETLER 63 (75)
T ss_pred cCCCCCHHH-HHHHHHHHHHHHHHHHHHHHHHH
Confidence 356677655 77889999999999999999864
No 94
>COG3007 Uncharacterized paraquat-inducible protein B [Function unknown]
Probab=39.81 E-value=27 Score=28.75 Aligned_cols=30 Identities=27% Similarity=0.481 Sum_probs=24.6
Q ss_pred HHHHHHHHH-------HhcCcCcHHHHHHHhcccccc
Q psy8735 78 MCTEMLEAI-------LENYPKSKKILLEEVGIETDE 107 (115)
Q Consensus 78 i~~di~d~I-------~E~~p~~kk~L~EelGiEtDe 107 (115)
-|+++|++| +-.|.+=|++|+.-+|+|.|.
T Consensus 344 ~v~~lw~qvt~eN~~qLtDy~gyK~EFl~LfGFe~dg 380 (398)
T COG3007 344 QVRELWDQVTNENLYQLTDYAGYKREFLNLFGFEIDG 380 (398)
T ss_pred HHHHHHHhcCHHHHHHHhhhhhhHHHHHHhcCccccC
Confidence 477889988 445667889999999999886
No 95
>PF10211 Ax_dynein_light: Axonemal dynein light chain; InterPro: IPR019347 Axonemal dynein light chain proteins play a dynamic role in flagellar and cilial motility. Eukaryotic cilia and flagella are complex organelles consisting of a core structure, the axoneme, which is composed of nine microtubule doublets forming a cylinder that surrounds a pair of central singlet microtubules. This ultra-structural arrangement seems to be one of the most stable micro-tubular assemblies known and is responsible for the flagellar and ciliary movement of a large number of organisms ranging from protozoan to mammals. This light chain interacts directly with the N-terminal half of the heavy chains [].
Probab=39.68 E-value=1.2e+02 Score=22.41 Aligned_cols=33 Identities=24% Similarity=0.373 Sum_probs=24.6
Q ss_pred HHccCCHHHHHHHHHHHHHHHHHHHHHHHHhhh
Q psy8735 14 LSQQMTEAEAKQKKAELQAELEEKKSKLEKLSR 46 (115)
Q Consensus 14 L~s~lt~~el~~~i~~L~~e~~~l~~rL~~l~~ 46 (115)
|.......++...|..|+.++..++.++..++.
T Consensus 116 l~~e~~~~~l~~~i~~L~~e~~~L~~~~~~l~~ 148 (189)
T PF10211_consen 116 LQAEQGKQELEEEIEELEEEKEELEKQVQELKN 148 (189)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 344444678888888888888888888888864
No 96
>PRK04406 hypothetical protein; Provisional
Probab=39.49 E-value=1e+02 Score=19.66 Aligned_cols=26 Identities=12% Similarity=0.143 Sum_probs=13.5
Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy8735 19 TEAEAKQKKAELQAELEEKKSKLEKL 44 (115)
Q Consensus 19 t~~el~~~i~~L~~e~~~l~~rL~~l 44 (115)
|.++|...|.+...++..|..+|..|
T Consensus 26 tIe~LN~~v~~Qq~~I~~L~~ql~~L 51 (75)
T PRK04406 26 TIEELNDALSQQQLLITKMQDQMKYV 51 (75)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34555555555555555555555444
No 97
>PF06013 WXG100: Proteins of 100 residues with WXG; InterPro: IPR010310 ESAT-6 is a small protein appears to be of fundamental importance in virulence and protective immunity in Mycobacterium tuberculosis. Homologues have been detected in other Gram-positive bacterial species. It may represent a novel secretion system potentially driven by the PF01580 from PFAM domains in the YukA-like proteins []. Members of this protein family include secretion targets for type main variants of type VII secretion systems (T7SS), one found in the Actinobacteria, one found in the Firmicutes. This model was derived through iteration from PF06013 from PFAM. The best characterised member of this family is ESAT-6 from Mycobacterium tuberculosis. Members of this family usually are ~100 amino acids in length but occasionally have long C-terminal extension. ; PDB: 3FAV_A 1WA8_A 3Q4H_B 2KG7_A 2VRZ_B 2VS0_B 3OGI_A 3H6P_B 3GVM_B 3GWK_C ....
Probab=39.36 E-value=83 Score=18.65 Aligned_cols=28 Identities=25% Similarity=0.420 Sum_probs=16.1
Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy8735 18 MTEAEAKQKKAELQAELEEKKSKLEKLS 45 (115)
Q Consensus 18 lt~~el~~~i~~L~~e~~~l~~rL~~l~ 45 (115)
.+.++|...+..+......+...+..|.
T Consensus 4 vd~~~l~~~a~~~~~~~~~l~~~~~~l~ 31 (86)
T PF06013_consen 4 VDPEQLRAAAQQLQAQADELQSQLQQLE 31 (86)
T ss_dssp SCHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred ecHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3455566666666665555555555554
No 98
>PRK15396 murein lipoprotein; Provisional
Probab=38.95 E-value=70 Score=20.73 Aligned_cols=36 Identities=11% Similarity=0.173 Sum_probs=21.9
Q ss_pred HHHHHccCCHHHHHHHHHHHHHHHHHHHHHHHHhhh
Q psy8735 11 LKQLSQQMTEAEAKQKKAELQAELEEKKSKLEKLSR 46 (115)
Q Consensus 11 L~~L~s~lt~~el~~~i~~L~~e~~~l~~rL~~l~~ 46 (115)
|....+++-.++|..++..|..++..+..-...++.
T Consensus 18 LaGCAs~~kvd~LssqV~~L~~kvdql~~dv~~~~~ 53 (78)
T PRK15396 18 LAGCSSNAKIDQLSSDVQTLNAKVDQLSNDVNAMRS 53 (78)
T ss_pred HHHcCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 455556665566666666666666666666666554
No 99
>PHA00728 hypothetical protein
Probab=38.73 E-value=55 Score=23.39 Aligned_cols=22 Identities=36% Similarity=0.415 Sum_probs=18.0
Q ss_pred HHHHHHHHHHHHHHHHHHHhhh
Q psy8735 25 QKKAELQAELEEKKSKLEKLSR 46 (115)
Q Consensus 25 ~~i~~L~~e~~~l~~rL~~l~~ 46 (115)
..+++|++++.++..+|..|.+
T Consensus 5 teveql~keneelkkkla~lea 26 (151)
T PHA00728 5 TEVEQLKKENEELKKKLAELEA 26 (151)
T ss_pred hHHHHHHHhHHHHHHHHHHHHH
Confidence 4678889999999998888764
No 100
>PF05308 Mito_fiss_reg: Mitochondrial fission regulator; InterPro: IPR007972 This family consists of several uncharacterised eukaryotic proteins of unknown function.
Probab=38.58 E-value=50 Score=25.89 Aligned_cols=29 Identities=24% Similarity=0.337 Sum_probs=25.8
Q ss_pred ccCCHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy8735 16 QQMTEAEAKQKKAELQAELEEKKSKLEKL 44 (115)
Q Consensus 16 s~lt~~el~~~i~~L~~e~~~l~~rL~~l 44 (115)
..+.++++.++|..|+.|...|.+++..+
T Consensus 113 ~~~~~~~AlqKIsALEdELs~LRaQIA~I 141 (253)
T PF05308_consen 113 DLPANEAALQKISALEDELSRLRAQIAKI 141 (253)
T ss_pred ccCCCHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 56778888999999999999999999888
No 101
>KOG3647|consensus
Probab=38.47 E-value=2.2e+02 Score=23.18 Aligned_cols=28 Identities=36% Similarity=0.390 Sum_probs=23.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhhc
Q psy8735 20 EAEAKQKKAELQAELEEKKSKLEKLSRN 47 (115)
Q Consensus 20 ~~el~~~i~~L~~e~~~l~~rL~~l~~~ 47 (115)
.+.|-++|+.-+.+...+..||+.|++-
T Consensus 135 ea~L~~Kierrk~ElEr~rkRle~Lqsi 162 (338)
T KOG3647|consen 135 EAALGSKIERRKAELERTRKRLEALQSI 162 (338)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhc
Confidence 3567888888889999999999999863
No 102
>PF07303 Occludin_ELL: Occludin homology domain; InterPro: IPR010844 This represents a conserved region approximately 100 residues long within eukaryotic occludin proteins and the RNA polymerase II elongation factor ELL. Occludin is an integral membrane protein that localises to tight junctions [], while ELL is an elongation factor that can increase the catalytic rate of RNA polymerase II transcription by suppressing transient pausing by polymerase at multiple sites along the DNA []. This shared domain is thought to mediate protein interactions [].; PDB: 1WPA_A 3G7C_A 1XAW_A.
Probab=38.45 E-value=1.2e+02 Score=20.31 Aligned_cols=59 Identities=17% Similarity=0.291 Sum_probs=36.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhcCCCCChhhhcHHhHHHHHHHHH---HHHHHHHHHHHHHHH
Q psy8735 22 EAKQKKAELQAELEEKKSKLEKLSRNVNVISEGEVDKKTITDTKEKMV---KEYNKRKRMCTEMLE 84 (115)
Q Consensus 22 el~~~i~~L~~e~~~l~~rL~~l~~~~~~Vs~ee~~~~~v~~~~~k~~---~~w~kRKRi~~di~d 84 (115)
+|.+.|.........|...|..+..|+ -.-..+. +|..+|++.. -.|...|+-|.-+=+
T Consensus 26 ~L~~~v~~v~~~f~~L~~~l~~l~~~s--~ey~~i~--~I~~eY~k~Kk~~p~y~~~K~Rc~yL~~ 87 (101)
T PF07303_consen 26 ELHAEVDAVSRRFQELDSELKRLPPGS--QEYKRIA--QILQEYNKKKKRDPNYQEKKKRCEYLHN 87 (101)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHS-TTS--HHHHHHH-----HHHHHHHHTSHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHhCCCCC--cHHHHHH--HHHHHHHHHHhcCccHHHHHHHHHHHHH
Confidence 456677777778888888887776522 1223344 5669998884 677777777776543
No 103
>TIGR01010 BexC_CtrB_KpsE polysaccharide export inner-membrane protein, BexC/CtrB/KpsE family. This family contains gamma proteobacterial proteins involved in capsule polysaccharide export.
Probab=38.11 E-value=1.6e+02 Score=23.44 Aligned_cols=40 Identities=18% Similarity=0.324 Sum_probs=24.9
Q ss_pred HHHHHHHHHHHHc-----cCCHHHHHHHHHHHHHHHHHHHHHHHH
Q psy8735 4 KEKVKSKLKQLSQ-----QMTEAEAKQKKAELQAELEEKKSKLEK 43 (115)
Q Consensus 4 ~~~l~seL~~L~s-----~lt~~el~~~i~~L~~e~~~l~~rL~~ 43 (115)
+..++++|+.+.+ +|..-.++.++..|+.++.+...++..
T Consensus 223 l~~~~~~l~~l~~~~~~~~P~v~~l~~~i~~l~~~i~~e~~~i~~ 267 (362)
T TIGR01010 223 LIRVQAQLAQLRSITPEQNPQVPSLQARIKSLRKQIDEQRNQLSG 267 (362)
T ss_pred HHHHHHHHHHHHhhCCCCCCchHHHHHHHHHHHHHHHHHHHHhhc
Confidence 4456666666654 555666677777777776666655543
No 104
>PF06216 RTBV_P46: Rice tungro bacilliform virus P46 protein; InterPro: IPR009347 This family consists of several Rice tungro bacilliform virus P46 proteins. The function of this family is unknown.
Probab=37.75 E-value=1.2e+02 Score=24.49 Aligned_cols=57 Identities=11% Similarity=0.281 Sum_probs=37.2
Q ss_pred HHHHHHHHHHHHccCCHHHHHHHHHHHHHHHHHHHHHHHHhhh--cCCCCChhhhcHHhHHHH
Q psy8735 4 KEKVKSKLKQLSQQMTEAEAKQKKAELQAELEEKKSKLEKLSR--NVNVISEGEVDKKTITDT 64 (115)
Q Consensus 4 ~~~l~seL~~L~s~lt~~el~~~i~~L~~e~~~l~~rL~~l~~--~~~~Vs~ee~~~~~v~~~ 64 (115)
++.-..|--+|+.++|+ |+.++..+++++.++.+.|+.|+. ....-|..+++ .+..-
T Consensus 73 ~~sy~~e~~~l~~qvs~--l~~~~~~~r~~~~~~~~~~eglrep~kkpiyttqdke--~lr~f 131 (389)
T PF06216_consen 73 RQSYSNEWISLNDQVSH--LQHQNSEQRQQIREMREIIEGLREPVKKPIYTTQDKE--RLRNF 131 (389)
T ss_pred HHHHHhHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHhhhhhhcCCccccccHH--HHHHH
Confidence 44444455555555443 677788888899999999999986 33444666666 55433
No 105
>PF07586 HXXSHH: Protein of unknown function (DUF1552); InterPro: IPR011447 This is a family of proteins identified in Rhodopirellula baltica.
Probab=37.57 E-value=1.4e+02 Score=23.31 Aligned_cols=42 Identities=17% Similarity=0.313 Sum_probs=36.9
Q ss_pred HHHHHHHHHHHHccCCHHHHHHHHHHHHHHHHHHHHHHHHhhh
Q psy8735 4 KEKVKSKLKQLSQQMTEAEAKQKKAELQAELEEKKSKLEKLSR 46 (115)
Q Consensus 4 ~~~l~seL~~L~s~lt~~el~~~i~~L~~e~~~l~~rL~~l~~ 46 (115)
++.+...++.|++.++.+| +.++.+--.-+.+++.||+....
T Consensus 164 LD~v~~d~~~L~~~Lg~~D-r~kLd~yl~sireiE~rl~~~~~ 205 (302)
T PF07586_consen 164 LDLVREDAKSLRRRLGAED-RQKLDQYLDSIREIEKRLQQAEA 205 (302)
T ss_pred HHHHHHHHHHHHhhcCHHH-HHHHHHHHHHHHHHHHHHHhhhh
Confidence 3567788999999999999 88899999999999999988764
No 106
>PF07798 DUF1640: Protein of unknown function (DUF1640); InterPro: IPR024461 This family consists of uncharacterised proteins.
Probab=37.40 E-value=1.1e+02 Score=22.23 Aligned_cols=38 Identities=24% Similarity=0.380 Sum_probs=18.4
Q ss_pred HHHHHHHHHHHHccCCHHHHHHHHHHHHHHHHHHHHHHH
Q psy8735 4 KEKVKSKLKQLSQQMTEAEAKQKKAELQAELEEKKSKLE 42 (115)
Q Consensus 4 ~~~l~seL~~L~s~lt~~el~~~i~~L~~e~~~l~~rL~ 42 (115)
+++|++++..+..+ -.+.++...+.|+.++..+..+|.
T Consensus 60 ~~eLr~el~~~~k~-~~~~lr~~~e~L~~eie~l~~~L~ 97 (177)
T PF07798_consen 60 IAELRSELQNSRKS-EFAELRSENEKLQREIEKLRQELR 97 (177)
T ss_pred HHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34444444433222 124455555666666666555543
No 107
>PF06156 DUF972: Protein of unknown function (DUF972); InterPro: IPR010377 FUNCTION: Involved in initiation control of chromosome replication. SUBUNIT: Interacts with both DnaA and DnaN, acting as a bridge between these two proteins. SIMILARITY: Belongs to the YabA family.
Probab=37.36 E-value=1.1e+02 Score=20.81 Aligned_cols=23 Identities=22% Similarity=0.300 Sum_probs=10.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHhh
Q psy8735 23 AKQKKAELQAELEEKKSKLEKLS 45 (115)
Q Consensus 23 l~~~i~~L~~e~~~l~~rL~~l~ 45 (115)
+-..+.+++.++..+-+.+..|+
T Consensus 6 l~~~l~~le~~l~~l~~~~~~LK 28 (107)
T PF06156_consen 6 LFDRLDQLEQQLGQLLEELEELK 28 (107)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHH
Confidence 33444444444444444444444
No 108
>cd04790 HTH_Cfa-like_unk Helix-Turn-Helix DNA binding domain of putative Cfa-like transcription regulators. Putative helix-turn-helix (HTH) MerR-like transcription regulator; conserved, Cfa-like, unknown proteins (~172 a.a.). The N-terminal domain of these proteins appears to be related to the HTH domain of Cfa, a cyclopropane fatty acid synthase. These Cfa-like proteins have a unique C-terminal domain with conserved histidines (motif HXXFX7HXXF). Based on sequence similarity of the N-terminal domains, these proteins are predicted to function as transcription regulators that mediate responses to stress in eubacteria. They belong to the MerR superfamily of transcription regulators that promote transcription of various stress regulons by reconfiguring the operator sequence located between the -35 and -10 promoter elements. A typical MerR regulator is comprised of distinct domains that harbor the regulatory (effector-binding) site and the active (DNA-binding) site. Their N-terminal domain
Probab=37.23 E-value=1.6e+02 Score=21.33 Aligned_cols=25 Identities=28% Similarity=0.530 Sum_probs=16.2
Q ss_pred HHHHHhcC-cCcHHHHHHHhcccccc
Q psy8735 83 LEAILENY-PKSKKILLEEVGIETDE 107 (115)
Q Consensus 83 ~d~I~E~~-p~~kk~L~EelGiEtDe 107 (115)
|.+-.|.. |..=.+|++.+||..+|
T Consensus 138 wh~~fe~~~p~~h~~~l~~~g~~~~~ 163 (172)
T cd04790 138 WHIEFEKMEPEAHQEFLQSLGIPEDE 163 (172)
T ss_pred HHHHHHHhCcHHHHHHHHHcCCCHHH
Confidence 33333443 44447899999998766
No 109
>TIGR02169 SMC_prok_A chromosome segregation protein SMC, primarily archaeal type. SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent.
Probab=37.11 E-value=2.2e+02 Score=25.86 Aligned_cols=19 Identities=37% Similarity=0.588 Sum_probs=12.8
Q ss_pred HHHHHHHHHHHHHHHhcCc
Q psy8735 73 NKRKRMCTEMLEAILENYP 91 (115)
Q Consensus 73 ~kRKRi~~di~d~I~E~~p 91 (115)
+.|+.+|.+.|..|...+.
T Consensus 1014 ~~~~~~f~~~f~~~~~~f~ 1032 (1164)
T TIGR02169 1014 KKKREVFMEAFEAINENFN 1032 (1164)
T ss_pred HHHHHHHHHHHHHHHHHHH
Confidence 5667777777777765544
No 110
>TIGR03017 EpsF chain length determinant protein EpsF. Sequences in this family of proteins are members of the chain length determinant family (pfam02706) which includes the wzc protein from E.coli. This family of proteins are homologous to the EpsF protein of the methanolan biosynthesis operon of Methylobacillus species strain 12S. The distribution of this protein appears to be restricted to a subset of exopolysaccharide operons containing a syntenic grouping of genes including a variant of the EpsH exosortase protein. Exosortase has been proposed to be involved in the targetting and processing of proteins containing the PEP-CTERM domain to the exopolysaccharide layer.
Probab=36.98 E-value=68 Score=26.13 Aligned_cols=45 Identities=27% Similarity=0.272 Sum_probs=36.5
Q ss_pred HHHHHccCCHHHHHHHHHHHHHHHHHHHHHHHHhhhcCCCCChhh
Q psy8735 11 LKQLSQQMTEAEAKQKKAELQAELEEKKSKLEKLSRNVNVISEGE 55 (115)
Q Consensus 11 L~~L~s~lt~~el~~~i~~L~~e~~~l~~rL~~l~~~~~~Vs~ee 55 (115)
.+.-.+.-+..-+..++.+++.++...+.+|..++.....+++++
T Consensus 164 ~~~~~~~~~~~fl~~ql~~~~~~l~~ae~~l~~fr~~~~i~~~~~ 208 (444)
T TIGR03017 164 LKVEPAQKAALWFVQQIAALREDLARAQSKLSAYQQEKGIVSSDE 208 (444)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcccCc
Confidence 334444456788899999999999999999999998777887765
No 111
>PF07757 AdoMet_MTase: Predicted AdoMet-dependent methyltransferase; InterPro: IPR011671 tRNA (uracil-O(2)-)-methyltransferase catalyses the formation of O(2)-methyl-uracil at position 44 (m2U44) in tRNA(Ser) [].; GO: 0008168 methyltransferase activity
Probab=36.98 E-value=66 Score=22.41 Aligned_cols=24 Identities=17% Similarity=0.260 Sum_probs=17.8
Q ss_pred HHHHHHHHhcCcCcHHHHHHHhccc
Q psy8735 80 TEMLEAILENYPKSKKILLEEVGIE 104 (115)
Q Consensus 80 ~di~d~I~E~~p~~kk~L~EelGiE 104 (115)
..+++.-.|+.+. .|..+|++||-
T Consensus 20 ~~lv~~W~E~TdP-~K~VfEDlaIA 43 (112)
T PF07757_consen 20 RWLVDNWPESTDP-QKHVFEDLAIA 43 (112)
T ss_pred HHHHHhCcccCCc-hhhHHHHHHHH
Confidence 4566666777766 78999999983
No 112
>PF13991 BssS: BssS protein family
Probab=36.97 E-value=57 Score=21.01 Aligned_cols=23 Identities=26% Similarity=0.260 Sum_probs=18.7
Q ss_pred cCCHHHHHHHHHHHHHHHHHHHH
Q psy8735 17 QMTEAEAKQKKAELQAELEEKKS 39 (115)
Q Consensus 17 ~lt~~el~~~i~~L~~e~~~l~~ 39 (115)
.+|++.+++-|..|++.+..+++
T Consensus 41 ~lT~e~Ar~Li~~L~~~I~kiE~ 63 (73)
T PF13991_consen 41 WLTTEMARQLISILEAGIDKIES 63 (73)
T ss_pred EecHHHHHHHHHHHHHHHHHHHh
Confidence 47899999999988888877664
No 113
>PF07028 DUF1319: Protein of unknown function (DUF1319); InterPro: IPR010746 This entry is represented by Commelina yellow mottle virus, Orf1. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This family contains a number of viral proteins of unknown function approximately 200 residues long. Family members seem to be restricted to badnaviruses.
Probab=36.92 E-value=84 Score=22.33 Aligned_cols=34 Identities=21% Similarity=0.402 Sum_probs=24.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHhhh---cCCCCChhhhc
Q psy8735 24 KQKKAELQAELEEKKSKLEKLSR---NVNVISEGEVD 57 (115)
Q Consensus 24 ~~~i~~L~~e~~~l~~rL~~l~~---~~~~Vs~ee~~ 57 (115)
++.+..+..++..+...|..|+. ...+.|.+|..
T Consensus 59 r~~l~~l~~~l~~l~~eL~~Lr~~~l~rRPLtk~dVe 95 (126)
T PF07028_consen 59 RSELKELKQELDVLSKELQALRKEYLERRPLTKEDVE 95 (126)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCHHHHH
Confidence 44567777777777777777764 56788888777
No 114
>PF08826 DMPK_coil: DMPK coiled coil domain like; InterPro: IPR014930 This domain is found in the myotonic dystrophy protein kinase (DMPK) and adopts a coiled coil structure. It plays a role in dimerisation []. ; GO: 0004674 protein serine/threonine kinase activity, 0005524 ATP binding, 0006468 protein phosphorylation; PDB: 1WT6_D.
Probab=36.63 E-value=1e+02 Score=18.99 Aligned_cols=25 Identities=36% Similarity=0.597 Sum_probs=15.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhh
Q psy8735 21 AEAKQKKAELQAELEEKKSKLEKLS 45 (115)
Q Consensus 21 ~el~~~i~~L~~e~~~l~~rL~~l~ 45 (115)
.+.......|..++..|+.+++.++
T Consensus 35 qeaE~rn~eL~~ei~~L~~e~ee~r 59 (61)
T PF08826_consen 35 QEAEKRNRELEQEIERLKKEMEELR 59 (61)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhh
Confidence 3445555666666666666666655
No 115
>PF08654 DASH_Dad2: DASH complex subunit Dad2; InterPro: IPR013963 The DASH complex is a ~10 subunit microtubule-binding complex that is transferred to the kinetochore prior to mitosis []. In Saccharomyces cerevisiae (Baker's yeast) DASH forms both rings and spiral structures on microtubules in vitro [, ].
Probab=36.27 E-value=1.3e+02 Score=20.41 Aligned_cols=41 Identities=24% Similarity=0.388 Sum_probs=31.6
Q ss_pred HHHHHHHHHHHHccCCHHHHHHHHHHHHHHHHHHHHHHHHhhhcCCC
Q psy8735 4 KEKVKSKLKQLSQQMTEAEAKQKKAELQAELEEKKSKLEKLSRNVNV 50 (115)
Q Consensus 4 ~~~l~seL~~L~s~lt~~el~~~i~~L~~e~~~l~~rL~~l~~~~~~ 50 (115)
|.+-++||..|. +++..-..|..+...+..+|..|..|+..
T Consensus 6 i~eKk~ELe~L~------~l~~lS~~L~~qle~L~~kl~~m~dg~e~ 46 (103)
T PF08654_consen 6 IAEKKAELEALK------QLRDLSADLASQLEALSEKLETMADGAEA 46 (103)
T ss_pred HHHHHHHHHHHH------HHHHHHHHHHHHHHHHHHHHHHHHhccHH
Confidence 556667777664 56777788889999999999999887743
No 116
>PF08838 DUF1811: Protein of unknown function (DUF1811); InterPro: IPR014938 This entry consists uncharacterised bacterial proteins. Some of the proteins are annotated as being transcriptional regulators (see Q4MQL7 from SWISSPROT, Q65MA2 from SWISSPROT). The structure of one of the proteins has revealed a beta-barrel like structure with helix-turn-helix like motif. ; PDB: 2YXY_A 1SF9_A.
Probab=36.14 E-value=60 Score=22.25 Aligned_cols=23 Identities=26% Similarity=0.375 Sum_probs=16.3
Q ss_pred HccCCHHHHHHHHHHHHHHHHHH
Q psy8735 15 SQQMTEAEAKQKKAELQAELEEK 37 (115)
Q Consensus 15 ~s~lt~~el~~~i~~L~~e~~~l 37 (115)
=|.+|..||+.+|+.|+.+-.+.
T Consensus 4 ySeMs~~EL~~Ei~~L~ekarKA 26 (102)
T PF08838_consen 4 YSEMSEEELRQEIARLKEKARKA 26 (102)
T ss_dssp HHC--HHHHHHHHHHHHHHHHHH
T ss_pred hhhcCHHHHHHHHHHHHHHHHHH
Confidence 37889999999999988765443
No 117
>PRK11239 hypothetical protein; Provisional
Probab=36.10 E-value=69 Score=24.75 Aligned_cols=28 Identities=21% Similarity=0.394 Sum_probs=23.3
Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHhhh
Q psy8735 19 TEAEAKQKKAELQAELEEKKSKLEKLSR 46 (115)
Q Consensus 19 t~~el~~~i~~L~~e~~~l~~rL~~l~~ 46 (115)
..+++...+..|+.++.+|+.+|+.|.+
T Consensus 184 ~~~~Le~rv~~Le~eva~L~~~l~~l~~ 211 (215)
T PRK11239 184 VDGDLQARVEALEIEVAELKQRLDSLLA 211 (215)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3466888999999999999999988864
No 118
>PF01923 Cob_adeno_trans: Cobalamin adenosyltransferase; InterPro: IPR002779 ATP:cob(I)alamin (or ATP:corrinoid) adenosyltransferases (2.5.1.17 from EC), catalyse the conversion of cobalamin (vitamin B12) into its coenzyme form, adenosylcobalamin (coenzyme B12) []. Adenosylcobalamin (AdoCbl) is required for the ativity of certain enzymes. AdoCbl contains an adenosyl moiety liganded to the cobalt ion of cobalamin via a covalent Co-C bond, and its synthesis is unique to certain prokaryotes. ATP:cob(I)alamin adenosyltransferases are classed into three groups: CobA-type [], EutT-type [] and PduO-type []. Each of the three enzyme types appears to be specialised for particular AdoCbl-dependent enzymes or for the de novo synthesis AdoCbl. PduO and EutT are distantly related, sharing short conserved motifs, while CobA is evolutionarily unrelated and is an example of convergent evolution. This entry represents EutT- and PduO-type ATP:cob(I)alamin adenosyltransferases. PduO functions to convert cobalamin to AdoCbl for 1,2-propanediol degradation [], while EutT produces AdoCbl for ethanolamine utilisation []. ; GO: 0005524 ATP binding, 0008817 cob(I)yrinic acid a,c-diamide adenosyltransferase activity, 0009236 cobalamin biosynthetic process; PDB: 2ZHY_A 2ZHZ_B 3KE5_C 3KE4_B 2AH6_C 1NOG_A 2IDX_C 3GAH_A 3CI1_A 3CI3_A ....
Probab=35.90 E-value=1.6e+02 Score=21.02 Aligned_cols=63 Identities=14% Similarity=0.231 Sum_probs=37.6
Q ss_pred HHHHHHHHHHHHccCCHHHHHHHHHHHHHHHHHHHHHHHHhhh-cCCCCChhhhcHHhHHHHHHHH
Q psy8735 4 KEKVKSKLKQLSQQMTEAEAKQKKAELQAELEEKKSKLEKLSR-NVNVISEGEVDKKTITDTKEKM 68 (115)
Q Consensus 4 ~~~l~seL~~L~s~lt~~el~~~i~~L~~e~~~l~~rL~~l~~-~~~~Vs~ee~~~~~v~~~~~k~ 68 (115)
+++|.|.+--..+.....++...+..++..+-.+..-|..-.. ....|++++.. .++....++
T Consensus 32 lDEl~a~igla~~~~~~~~~~~~L~~iq~~L~~l~~~la~~~~~~~~~i~~~~v~--~Le~~i~~~ 95 (163)
T PF01923_consen 32 LDELNAFIGLARSEIKEEELREILERIQNELFDLGAELATPEEDEEPEITEEDVQ--ELEEEIDEY 95 (163)
T ss_dssp HHHHHHHHHHHHTHCTTHHHHHHHHHHHHHHHHHHHHHHTTTTSSSCS--HHHHH--HHHHHHHHH
T ss_pred HHHHHHHHHHHHHHcCchhHHHHHHHHHHHHHHHHHHHcCCCcccccccCHHHHH--HHHHHHHHH
Confidence 5667777777777776666777777776666666666655432 12556666666 555444443
No 119
>TIGR00606 rad50 rad50. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=35.62 E-value=2.1e+02 Score=27.36 Aligned_cols=44 Identities=23% Similarity=0.296 Sum_probs=35.8
Q ss_pred hHHHHHHHHHHHHccCCHHHHHHHHHHHHHHHHHHHHHHHHhhh
Q psy8735 3 DKEKVKSKLKQLSQQMTEAEAKQKKAELQAELEEKKSKLEKLSR 46 (115)
Q Consensus 3 ~~~~l~seL~~L~s~lt~~el~~~i~~L~~e~~~l~~rL~~l~~ 46 (115)
++..++.+|..+.+......+...|.....++..++.++..|..
T Consensus 479 ~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~le~~~~~l~~ 522 (1311)
T TIGR00606 479 ELRKAERELSKAEKNSLTETLKKEVKSLQNEKADLDRKLRKLDQ 522 (1311)
T ss_pred HHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 45667777777778888888899998888888888888888863
No 120
>PF08340 DUF1732: Domain of unknown function (DUF1732); InterPro: IPR013551 This domain of unknown function is found at the C terminus of bacterial proteins, many of which are hypothetical and include proteins of the YicC family.
Probab=35.60 E-value=91 Score=20.70 Aligned_cols=40 Identities=13% Similarity=0.401 Sum_probs=33.6
Q ss_pred HHHHHHHHHHHccCCHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy8735 5 EKVKSKLKQLSQQMTEAEAKQKKAELQAELEEKKSKLEKL 44 (115)
Q Consensus 5 ~~l~seL~~L~s~lt~~el~~~i~~L~~e~~~l~~rL~~l 44 (115)
+++--|..++.|.-...++...+-.++.++.++.+++..+
T Consensus 47 QEm~RE~NTigSKs~~~~i~~~vv~~K~~iEkiREQvqNi 86 (87)
T PF08340_consen 47 QEMNREINTIGSKSNDAEISNLVVEMKTEIEKIREQVQNI 86 (87)
T ss_pred hhhccHHHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHcC
Confidence 5566788899999999999999999999999998887653
No 121
>PF14265 DUF4355: Domain of unknown function (DUF4355)
Probab=35.14 E-value=1.1e+02 Score=20.63 Aligned_cols=21 Identities=38% Similarity=0.580 Sum_probs=11.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHH
Q psy8735 23 AKQKKAELQAELEEKKSKLEK 43 (115)
Q Consensus 23 l~~~i~~L~~e~~~l~~rL~~ 43 (115)
....+.++..++..++..+..
T Consensus 47 ~~~e~~~~~~el~~~~~e~~~ 67 (125)
T PF14265_consen 47 AQEELEELEKELEELEAELAR 67 (125)
T ss_pred HHHHHHHHHHHHHHHHHHHHH
Confidence 334555555555555555543
No 122
>PF12329 TMF_DNA_bd: TATA element modulatory factor 1 DNA binding; InterPro: IPR022092 This is the middle region of a family of TATA element modulatory factor 1 proteins conserved in eukaryotes that contains at its N-terminal section a number of leucine zippers that could potentially form coiled coil structures. The whole proteins bind to the TATA element of some RNA polymerase II promoters and repress their activity. by competing with the binding of TATA binding protein. TMFs are evolutionarily conserved golgins that bind Rab6, a ubiquitous ras-like GTP-binding Golgi protein, and contribute to Golgi organisation in animal [] and plant [] cells.
Probab=34.69 E-value=1.2e+02 Score=19.13 Aligned_cols=39 Identities=18% Similarity=0.326 Sum_probs=17.3
Q ss_pred HHHHHHHHHHHHccCCHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy8735 4 KEKVKSKLKQLSQQMTEAEAKQKKAELQAELEEKKSKLEKL 44 (115)
Q Consensus 4 ~~~l~seL~~L~s~lt~~el~~~i~~L~~e~~~l~~rL~~l 44 (115)
|.+|..|-..|... .-.....|..|++.+..++..+..+
T Consensus 14 Ia~L~eEGekLSk~--el~~~~~IKKLr~~~~e~e~~~~~l 52 (74)
T PF12329_consen 14 IAQLMEEGEKLSKK--ELKLNNTIKKLRAKIKELEKQIKEL 52 (74)
T ss_pred HHHHHHHHHHHHHH--HHhhHHHHHHHHHHHHHHHHHHHHH
Confidence 55555555544432 1222334444444444444444444
No 123
>PF04867 DUF643: Protein of unknown function (DUF643); InterPro: IPR006951 These are proteins of unknown function found in Borrelia burgdorferi (Lyme disease spirochete)
Probab=34.61 E-value=1.2e+02 Score=20.96 Aligned_cols=40 Identities=15% Similarity=0.221 Sum_probs=34.7
Q ss_pred cCCCCChhhhcHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy8735 47 NVNVISEGEVDKKTITDTKEKMVKEYNKRKRMCTEMLEAILE 88 (115)
Q Consensus 47 ~~~~Vs~ee~~~~~v~~~~~k~~~~w~kRKRi~~di~d~I~E 88 (115)
++..+|-+.+. .+...|.....--++--+-..+|++.|++
T Consensus 24 ~t~Q~TleqKr--qiYs~y~~~QeykrKtGKSi~eiind~l~ 63 (114)
T PF04867_consen 24 GTDQLTLEQKR--QIYSAYEAIQEYKRKTGKSINEIINDILD 63 (114)
T ss_pred chhhhhHHHHH--HHHHHHHHHHHHHHhhcccHHHHHHHHhh
Confidence 56788999999 99999988888888888888899998887
No 124
>PRK10363 cpxP periplasmic repressor CpxP; Reviewed
Probab=34.59 E-value=1.7e+02 Score=21.65 Aligned_cols=25 Identities=0% Similarity=0.234 Sum_probs=21.6
Q ss_pred CCCCChhhhcHHhHHHHHHHHHHHHHH
Q psy8735 48 VNVISEGEVDKKTITDTKEKMVKEYNK 74 (115)
Q Consensus 48 ~~~Vs~ee~~~~~v~~~~~k~~~~w~k 74 (115)
-.+.|||.+. ++...|.+...+|+.
T Consensus 121 y~lLTPEQKa--q~~~~~~~rm~~~~~ 145 (166)
T PRK10363 121 YRLLTPEQQA--VLNEKHQQRMEQLRD 145 (166)
T ss_pred HHhCCHHHHH--HHHHHHHHHHHHHHH
Confidence 3789999999 999999998888843
No 125
>COG4026 Uncharacterized protein containing TOPRIM domain, potential nuclease [General function prediction only]
Probab=34.58 E-value=84 Score=24.84 Aligned_cols=39 Identities=36% Similarity=0.387 Sum_probs=20.0
Q ss_pred HHHHHHHHHccCCHHHHHHH-------HHHHHHHHHHHHHHHHHhh
Q psy8735 7 VKSKLKQLSQQMTEAEAKQK-------KAELQAELEEKKSKLEKLS 45 (115)
Q Consensus 7 l~seL~~L~s~lt~~el~~~-------i~~L~~e~~~l~~rL~~l~ 45 (115)
.++-|++|+..|+-.+|+.. ..++.++..++..+|..|.
T Consensus 117 vRAGLktL~~v~~~~d~ke~~ee~kekl~E~~~EkeeL~~eleele 162 (290)
T COG4026 117 VRAGLKTLQRVPEYMDLKEDYEELKEKLEELQKEKEELLKELEELE 162 (290)
T ss_pred HHHHHHHHhccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 35667777766665555443 3333334444444444443
No 126
>PF07739 TipAS: TipAS antibiotic-recognition domain; InterPro: IPR012925 TipAL is a bacterial transcriptional regulator of the MerR family. The tipA gene can be expressed as a long form, TipAL, and a short form, TipAS, which constitutes the C-terminal part of TipAL. TipAS forms the antibiotic-recognition domain []. This domain, which has an alpha-helical globin-like fold, is also found at the C terminus of other MerR family transcription factors, including Mta, a central regulator of multidrug resistance in Bacillus subtilis [], and SkgA from Caulobacter crescentus []. ; GO: 0003677 DNA binding, 0045892 negative regulation of transcription, DNA-dependent; PDB: 1NY9_A 3HH0_A 3QAO_A.
Probab=34.56 E-value=58 Score=21.23 Aligned_cols=70 Identities=17% Similarity=0.191 Sum_probs=42.6
Q ss_pred HHHHHHHHHHHHHHHHHHhhhcCCCCChhhhcHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcCcHHHHHHHh
Q psy8735 26 KKAELQAELEEKKSKLEKLSRNVNVISEGEVDKKTITDTKEKMVKEYNKRKRMCTEMLEAILENYPKSKKILLEEV 101 (115)
Q Consensus 26 ~i~~L~~e~~~l~~rL~~l~~~~~~Vs~ee~~~~~v~~~~~k~~~~w~kRKRi~~di~d~I~E~~p~~kk~L~Eel 101 (115)
...++..+...+-.+|..+...+...+..+.. .+...|..+...|-. .=.+++..+...|-.. ..|...+
T Consensus 29 ~~~~~~~~~~~l~~~l~~~~~~g~~p~s~evq--~l~~~~~~~~~~~~~---~~~~~~~~l~~~y~~~-~~~~~~~ 98 (118)
T PF07739_consen 29 EWQELQKEWDELFAELAALMEEGVDPDSPEVQ--ELAERWMELINQFTG---GDPELLRGLAQMYVED-PRFAAMY 98 (118)
T ss_dssp ----TTHHHHHHHHHHHHHHHHT--TT-HHHH--HHHHHHHHHHHHSS------HHHHHHHHHHTTST-HHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHcCCCcCCHHHH--HHHHHHHHHHHHHhC---CCHHHHHHHHHHHHcC-HHHHhhc
Confidence 34556677888889998887766677777888 899999888888766 2224555555555333 4444443
No 127
>PRK13169 DNA replication intiation control protein YabA; Reviewed
Probab=34.47 E-value=1.2e+02 Score=20.79 Aligned_cols=75 Identities=17% Similarity=0.281 Sum_probs=36.6
Q ss_pred HHHHHHHHHHccCCHHHHHHHHHHHHHHHHHHHHHHHHhhhcCCCCChhhhcHHhHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy8735 6 KVKSKLKQLSQQMTEAEAKQKKAELQAELEEKKSKLEKLSRNVNVISEGEVDKKTITDTKEKMVKEYNKRKRMCTEMLEA 85 (115)
Q Consensus 6 ~l~seL~~L~s~lt~~el~~~i~~L~~e~~~l~~rL~~l~~~~~~Vs~ee~~~~~v~~~~~k~~~~w~kRKRi~~di~d~ 85 (115)
.+-.++..|+..+ .++..+-+.|+-|+..|..+|..+.......+. .. ..-..|.-..+.|..==.||...++.
T Consensus 19 ~l~~el~~LK~~~--~el~EEN~~L~iEN~~Lr~~l~~~~~~~~~~~~--~~--~~~~g~~NL~~LY~EGFHICn~~yG~ 92 (110)
T PRK13169 19 VLLKELGALKKQL--AELLEENTALRLENDKLRERLEELEAEEPAKEK--KK--KEGEGKDNLARLYQEGFHICNLHYGS 92 (110)
T ss_pred HHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHhhcccccccc--cc--cccchHHHHHHHHHcCceecHHHhCC
Confidence 3334444444432 445555566666777777777776432111111 01 22334555555555555566655543
Q ss_pred H
Q psy8735 86 I 86 (115)
Q Consensus 86 I 86 (115)
.
T Consensus 93 ~ 93 (110)
T PRK13169 93 R 93 (110)
T ss_pred c
Confidence 3
No 128
>TIGR03190 benz_CoA_bzdN benzoyl-CoA reductase, bzd-type, N subunit. Members of this family are the N subunit of one of two related types of four-subunit ATP-dependent benzoyl-CoA reductase. This enzyme system catalyzes the dearomatization of benzoyl-CoA, a common intermediate in pathways for the degradation for a number of different aromatic compounds, such as phenol and toluene.
Probab=34.28 E-value=1.8e+02 Score=23.67 Aligned_cols=47 Identities=11% Similarity=0.105 Sum_probs=36.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHhhhcCCCCChhhhcHHhHHHHHHHHHHHHHH
Q psy8735 24 KQKKAELQAELEEKKSKLEKLSRNVNVISEGEVDKKTITDTKEKMVKEYNK 74 (115)
Q Consensus 24 ~~~i~~L~~e~~~l~~rL~~l~~~~~~Vs~ee~~~~~v~~~~~k~~~~w~k 74 (115)
...+.=...++..+...|+.+. +..+|.+.+. ++-+.+++.++.|++
T Consensus 130 ~~~~~y~~~el~~l~~~LE~~~--G~~i~~e~L~--~ai~~~n~~r~~~~~ 176 (377)
T TIGR03190 130 PHARKAHYAEVQRFRVFLQTLT--GKEITDDMLR--DALAVCDENRRLLRE 176 (377)
T ss_pred HHHHHHHHHHHHHHHHHHHHHH--CCCCCHHHHH--HHHHHHHHHHHHHHH
Confidence 4445556778888888888764 4788999988 888888888888865
No 129
>PHA02562 46 endonuclease subunit; Provisional
Probab=33.98 E-value=1.5e+02 Score=24.74 Aligned_cols=7 Identities=14% Similarity=0.472 Sum_probs=2.5
Q ss_pred HHHHHHH
Q psy8735 7 VKSKLKQ 13 (115)
Q Consensus 7 l~seL~~ 13 (115)
++++++.
T Consensus 311 l~~~l~~ 317 (562)
T PHA02562 311 LQHSLEK 317 (562)
T ss_pred HHHHHHH
Confidence 3333333
No 130
>PRK14127 cell division protein GpsB; Provisional
Probab=33.94 E-value=89 Score=21.50 Aligned_cols=24 Identities=25% Similarity=0.360 Sum_probs=14.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhh
Q psy8735 22 EAKQKKAELQAELEEKKSKLEKLS 45 (115)
Q Consensus 22 el~~~i~~L~~e~~~l~~rL~~l~ 45 (115)
.+..++..|+.++..++.+|..++
T Consensus 41 ~l~~e~~~Lk~e~~~l~~~l~e~~ 64 (109)
T PRK14127 41 AFQKEIEELQQENARLKAQVDELT 64 (109)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH
Confidence 344555566666666666666654
No 131
>TIGR01843 type_I_hlyD type I secretion membrane fusion protein, HlyD family. Type I secretion is an ABC transport process that exports proteins, without cleavage of any signal sequence, from the cytosol to extracellular medium across both inner and outer membranes. The secretion signal is found in the C-terminus of the transported protein. This model represents the adaptor protein between the ATP-binding cassette (ABC) protein of the inner membrane and the outer membrane protein, and is called the membrane fusion protein. This model selects a subfamily closely related to HlyD; it is defined narrowly and excludes, for example, colicin V secretion protein CvaA and multidrug efflux proteins.
Probab=33.89 E-value=2.4e+02 Score=22.34 Aligned_cols=33 Identities=21% Similarity=0.231 Sum_probs=14.0
Q ss_pred HHHHHHHHHHHHHHHHHHHhhh--cCCCCChhhhc
Q psy8735 25 QKKAELQAELEEKKSKLEKLSR--NVNVISEGEVD 57 (115)
Q Consensus 25 ~~i~~L~~e~~~l~~rL~~l~~--~~~~Vs~ee~~ 57 (115)
.++..++.+....+..+...+. ....++..+..
T Consensus 165 ~~l~~~~~~l~~~~~~~~~~~~L~~~g~is~~~~~ 199 (423)
T TIGR01843 165 AQLQALRQQLEVISEELEARRKLKEKGLVSRLELL 199 (423)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHcCCCCHHHHH
Confidence 3334444444444444443321 33455555544
No 132
>PF07820 TraC: TraC-like protein; InterPro: IPR012930 The members of this family are sequences that are similar to TraC (Q84HT8 from SWISSPROT) from Rhizobium etli. The gene encoding this protein is one of a group of genes found on plasmid p42a of Rhizobium etli (strain CFN 42/ATCC 51251) that are thought to be involved in the process of plasmid self-transmission. Mobilisation of plasmid p42a is of importance as it is required for transfer of plasmid p42d, the symbiotic plasmid which carries most of the genes required for nodulation and nitrogen fixation by this symbiotic bacterium. The predicted protein products of p42a are similar to known transfer proteins of Agrobacterium tumefaciens plasmid pTiC58 []. ; GO: 0000746 conjugation
Probab=33.79 E-value=76 Score=21.35 Aligned_cols=24 Identities=17% Similarity=0.361 Sum_probs=18.4
Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHH
Q psy8735 19 TEAEAKQKKAELQAELEEKKSKLE 42 (115)
Q Consensus 19 t~~el~~~i~~L~~e~~~l~~rL~ 42 (115)
+..+|..+|+.|+.+++.++.+..
T Consensus 3 ~~s~I~~eIekLqe~lk~~e~kea 26 (92)
T PF07820_consen 3 SSSKIREEIEKLQEQLKQAETKEA 26 (92)
T ss_pred cHHHHHHHHHHHHHHHHHHHHHHH
Confidence 567888888888888887776653
No 133
>PF07373 CAMP_factor: CAMP factor (Cfa); InterPro: IPR010860 This family consists of several bacterial CAMP factor (Cfa) proteins, which seem to be specific to Streptococcus species. The CAMP reaction is a synergistic lysis of erythrocytes by the interaction of an extracellular protein (CAMP factor) produced by some streptococcal species with the Staphylococcus aureus sphingomyelinase C (beta-toxin) [].
Probab=33.36 E-value=2.3e+02 Score=22.10 Aligned_cols=55 Identities=22% Similarity=0.158 Sum_probs=41.7
Q ss_pred HHHHHHHHHHHHHHHHHhhhcCCCCChhhhcHHhHHHHHHHHHHHHHHHHHHHHHHHH
Q psy8735 27 KAELQAELEEKKSKLEKLSRNVNVISEGEVDKKTITDTKEKMVKEYNKRKRMCTEMLE 84 (115)
Q Consensus 27 i~~L~~e~~~l~~rL~~l~~~~~~Vs~ee~~~~~v~~~~~k~~~~w~kRKRi~~di~d 84 (115)
..+|..+++.+...++.+.. =+-.+|.+.. .++..-.-.+..|.-|.--.+++++
T Consensus 127 ~~ql~~~i~~l~~~~~kv~~-yPDL~~~D~A--TiYvK~~L~k~I~~~R~~Rd~~vl~ 181 (228)
T PF07373_consen 127 VDQLKDEIEALKALLEKVLQ-YPDLTPTDVA--TIYVKAKLDKAIWQARFDRDKCVLG 181 (228)
T ss_pred HHHHHHHHHHHHHHHHHHHh-CCCCCccchh--HHHHHHHHHHHHHHHHHhhhhHHhc
Confidence 34556666666666666653 3578999999 9999999999999999877777754
No 134
>PRK11637 AmiB activator; Provisional
Probab=33.27 E-value=2.7e+02 Score=22.84 Aligned_cols=32 Identities=9% Similarity=0.050 Sum_probs=15.1
Q ss_pred hHHHHHHHHHHHHHHHHHHHHH-----HHHHHHhcCc
Q psy8735 60 TITDTKEKMVKEYNKRKRMCTE-----MLEAILENYP 91 (115)
Q Consensus 60 ~v~~~~~k~~~~w~kRKRi~~d-----i~d~I~E~~p 91 (115)
.+.............|-|-... .++.|+...+
T Consensus 114 ~~q~~l~~~~~~l~~rlra~Y~~g~~~~l~vLl~a~~ 150 (428)
T PRK11637 114 KLEQQQAAQERLLAAQLDAAFRQGEHTGLQLILSGEE 150 (428)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHcCCCcHHHHHhcCCC
Confidence 4445554554545544443332 3445555554
No 135
>PRK06342 transcription elongation factor regulatory protein; Validated
Probab=33.24 E-value=1.1e+02 Score=22.13 Aligned_cols=41 Identities=12% Similarity=0.108 Sum_probs=23.2
Q ss_pred HHHHHHHHHHHc-------cCCHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy8735 5 EKVKSKLKQLSQ-------QMTEAEAKQKKAELQAELEEKKSKLEKLS 45 (115)
Q Consensus 5 ~~l~seL~~L~s-------~lt~~el~~~i~~L~~e~~~l~~rL~~l~ 45 (115)
+.|+.||..|.. .=-..|.+.+...++..+..++.+|...+
T Consensus 37 ~~L~~El~~L~~~i~~Ar~~GDlsEak~~~~~~e~rI~~L~~~L~~A~ 84 (160)
T PRK06342 37 KALEDQLAQARAAYEAAQAIEDVNERRRQMARPLRDLRYLAARRRTAQ 84 (160)
T ss_pred HHHHHHHHHHHHHHHHHHHCCChhHHHHHHHHHHHHHHHHHHHHccCE
Confidence 345555555543 32233445566667777777777776653
No 136
>PRK10265 chaperone-modulator protein CbpM; Provisional
Probab=33.22 E-value=82 Score=20.91 Aligned_cols=24 Identities=17% Similarity=0.198 Sum_probs=19.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHh
Q psy8735 21 AEAKQKKAELQAELEEKKSKLEKL 44 (115)
Q Consensus 21 ~el~~~i~~L~~e~~~l~~rL~~l 44 (115)
-+|-.+|.+|++++..++.||..+
T Consensus 74 l~LLd~i~~Lr~el~~L~~~l~~~ 97 (101)
T PRK10265 74 LTLLDEIAHLKQENRLLRQRLSRF 97 (101)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH
Confidence 467788888888888888888654
No 137
>PF06717 DUF1202: Protein of unknown function (DUF1202); InterPro: IPR009592 This family consists of several hypothetical bacterial proteins of around 335 residues in length. Members of this family are found exclusively in Escherichia coli and Salmonella species and are often referred to as YggM proteins. The function of this family is unknown.
Probab=33.03 E-value=2.6e+02 Score=22.76 Aligned_cols=58 Identities=10% Similarity=0.184 Sum_probs=42.5
Q ss_pred chHHHHHHHHHHHHccCCHHHHHHHHHHHHHHHHHHHHHHHHhhh---cCCCCChhhhcHHhH
Q psy8735 2 DDKEKVKSKLKQLSQQMTEAEAKQKKAELQAELEEKKSKLEKLSR---NVNVISEGEVDKKTI 61 (115)
Q Consensus 2 ~~~~~l~seL~~L~s~lt~~el~~~i~~L~~e~~~l~~rL~~l~~---~~~~Vs~ee~~~~~v 61 (115)
+.+..+++++..-+++ ...|..++.+|.+++.....++...=+ .+...|-.+..++..
T Consensus 138 ~rf~~Ied~~~~kK~~--I~~L~~qisaLdkqi~ai~Kkid~yWgkda~gk~~tR~~~q~k~~ 198 (308)
T PF06717_consen 138 YRFNQIEDEYNRKKNK--IPGLNKQISALDKQIVAINKKIDRYWGKDANGKQLTRYEVQRKLL 198 (308)
T ss_pred HHHHHHHHHHHHHhhh--hHHHHHHHHHHHHHHHHHHHHHHhccCCCCCCCcccHHHHHHHHH
Confidence 3577888888777665 567889999999999999999988832 446777666553333
No 138
>COG4942 Membrane-bound metallopeptidase [Cell division and chromosome partitioning]
Probab=33.00 E-value=3.1e+02 Score=23.32 Aligned_cols=82 Identities=11% Similarity=0.223 Sum_probs=45.9
Q ss_pred HHHHHHHHHHHHccCC-----HHHHHHHHHHHHHHHHHHHHHHHHhhhcCCCCChhhhcHHhHHHHHHHHHHHHHHHHHH
Q psy8735 4 KEKVKSKLKQLSQQMT-----EAEAKQKKAELQAELEEKKSKLEKLSRNVNVISEGEVDKKTITDTKEKMVKEYNKRKRM 78 (115)
Q Consensus 4 ~~~l~seL~~L~s~lt-----~~el~~~i~~L~~e~~~l~~rL~~l~~~~~~Vs~ee~~~~~v~~~~~k~~~~w~kRKRi 78 (115)
+++...++..+++.++ ...|..+|..++.++..++..|...... .......+. .+.........+=+.|++.
T Consensus 40 l~q~q~ei~~~~~~i~~~~~~~~kL~~~lk~~e~~i~~~~~ql~~s~~~-l~~~~~~I~--~~~~~l~~l~~q~r~qr~~ 116 (420)
T COG4942 40 LKQIQKEIAALEKKIREQQDQRAKLEKQLKSLETEIASLEAQLIETADD-LKKLRKQIA--DLNARLNALEVQEREQRRR 116 (420)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhH-HHHHHhhHH--HHHHHHHHHHHHHHHHHHH
Confidence 4556666666666553 2345555555555555555555443211 111222333 4555555555555899999
Q ss_pred HHHHHHHHHh
Q psy8735 79 CTEMLEAILE 88 (115)
Q Consensus 79 ~~di~d~I~E 88 (115)
...++++.-=
T Consensus 117 La~~L~A~~r 126 (420)
T COG4942 117 LAEQLAALQR 126 (420)
T ss_pred HHHHHHHHHh
Confidence 9999998843
No 139
>PF08898 DUF1843: Domain of unknown function (DUF1843); InterPro: IPR014994 This domain is found in functionally uncharacterised proteins. It can be found independently or at the C terminus of the protein.
Probab=32.92 E-value=95 Score=18.82 Aligned_cols=21 Identities=14% Similarity=0.302 Sum_probs=16.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHH
Q psy8735 20 EAEAKQKKAELQAELEEKKSK 40 (115)
Q Consensus 20 ~~el~~~i~~L~~e~~~l~~r 40 (115)
..++...+..|..||..++.|
T Consensus 33 ~~~i~~al~~Lk~EIaklE~R 53 (53)
T PF08898_consen 33 AGDIAAALEKLKAEIAKLEAR 53 (53)
T ss_pred chHHHHHHHHHHHHHHHHhcC
Confidence 467888888888888888754
No 140
>PF08663 HalX: HalX domain; InterPro: IPR013971 HalX is a protein of unknown function, previously mis-annotated as HoxA-like transcriptional regulator. It is C-terminal to a CheY-like superfamily domain and suggests a role as a response regulator.
Probab=32.86 E-value=1.3e+02 Score=19.00 Aligned_cols=42 Identities=26% Similarity=0.298 Sum_probs=30.5
Q ss_pred HHHHHHHHHHHHccCCHHHHHHH--HHHHHHHHHHHHHHHHHhh
Q psy8735 4 KEKVKSKLKQLSQQMTEAEAKQK--KAELQAELEEKKSKLEKLS 45 (115)
Q Consensus 4 ~~~l~seL~~L~s~lt~~el~~~--i~~L~~e~~~l~~rL~~l~ 45 (115)
.=.|.+..+.|.+..+..++... -.+|+.++..+..+|....
T Consensus 13 ~~al~sK~A~Leaek~~~eL~~seeY~eL~~ri~~lr~~ld~~~ 56 (71)
T PF08663_consen 13 YFALASKRAVLEAEKSEAELEESEEYQELEDRIEELRAELDDTL 56 (71)
T ss_pred HHHHHHHHHHHHhcCCHHHHhccHHHHHHHHHHHHHHHHHHHHH
Confidence 34678889999999999988754 4666666666666666543
No 141
>KOG4674|consensus
Probab=32.45 E-value=1.2e+02 Score=30.41 Aligned_cols=48 Identities=19% Similarity=0.235 Sum_probs=25.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhcCCCCChhhhcHHhHHHHHHHHHHHHHHH
Q psy8735 21 AEAKQKKAELQAELEEKKSKLEKLSRNVNVISEGEVDKKTITDTKEKMVKEYNKR 75 (115)
Q Consensus 21 ~el~~~i~~L~~e~~~l~~rL~~l~~~~~~Vs~ee~~~~~v~~~~~k~~~~w~kR 75 (115)
.+|...|..+..++.-++..|..|.+.-.. .. .--.-+......|++|
T Consensus 1246 qEl~~~i~kl~~el~plq~~l~el~~e~~~-----~~--ael~~l~~e~~~wK~R 1293 (1822)
T KOG4674|consen 1246 QELRDKIEKLNFELAPLQNELKELKAELQE-----KV--AELKKLEEENDRWKQR 1293 (1822)
T ss_pred HHHHHHHHHHHhhHhhHHHHHHHHHHHHHH-----HH--HHHHHHHHHHHHHHHH
Confidence 555555555555555555555555432111 11 2223455667778888
No 142
>PRK11637 AmiB activator; Provisional
Probab=32.41 E-value=2.8e+02 Score=22.75 Aligned_cols=27 Identities=15% Similarity=0.295 Sum_probs=10.7
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy8735 60 TITDTKEKMVKEYNKRKRMCTEMLEAI 86 (115)
Q Consensus 60 ~v~~~~~k~~~~w~kRKRi~~di~d~I 86 (115)
.+...-......-..++..+...+...
T Consensus 107 ~l~~eI~~~q~~l~~~~~~l~~rlra~ 133 (428)
T PRK11637 107 ELNASIAKLEQQQAAQERLLAAQLDAA 133 (428)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 333333444444444444444333333
No 143
>KOG4196|consensus
Probab=32.24 E-value=1.5e+02 Score=21.26 Aligned_cols=39 Identities=31% Similarity=0.430 Sum_probs=20.9
Q ss_pred HHHHHHHHHHccCCHHHHHHHHHHHHHHHHHHHHHHHHhhh
Q psy8735 6 KVKSKLKQLSQQMTEAEAKQKKAELQAELEEKKSKLEKLSR 46 (115)
Q Consensus 6 ~l~seL~~L~s~lt~~el~~~i~~L~~e~~~l~~rL~~l~~ 46 (115)
+|+++-..|... .+.|+.++..+..+...+..+.+.|.+
T Consensus 78 eLE~~k~~L~qq--v~~L~~e~s~~~~E~da~k~k~e~l~~ 116 (135)
T KOG4196|consen 78 ELEKEKAELQQQ--VEKLKEENSRLRRELDAYKSKYEALQN 116 (135)
T ss_pred HHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 444444444443 244555555666666666666666643
No 144
>PF09958 DUF2192: Uncharacterized protein conserved in archaea (DUF2192); InterPro: IPR018693 This family of various hypothetical archaeal proteins has no known function.
Probab=32.02 E-value=1e+02 Score=24.12 Aligned_cols=42 Identities=31% Similarity=0.362 Sum_probs=31.1
Q ss_pred HHHHHHHHHHHHHHHHHHHhcCcCcHHHHHH-------HhccccccCCC
Q psy8735 69 VKEYNKRKRMCTEMLEAILENYPKSKKILLE-------EVGIETDEMVS 110 (115)
Q Consensus 69 ~~~w~kRKRi~~di~d~I~E~~p~~kk~L~E-------elGiEtDe~v~ 110 (115)
+..+|+|=..+.++|+.|+.+-.-+..+|.+ +-||+.=-|.+
T Consensus 3 k~l~r~RI~va~~l~~~il~~~~~~R~~lv~~L~~~Y~~~gIeP~RG~s 51 (231)
T PF09958_consen 3 KELYRKRIEVATDLWSRILRGEVLDREELVELLREVYEENGIEPFRGLS 51 (231)
T ss_pred hHHHHHHHHHHHHHHHHHHHhccCCHHHHHHHHHHHHHHcCCCcCCCCC
Confidence 4567899999999999999996666666654 44777544444
No 145
>TIGR01837 PHA_granule_1 poly(hydroxyalkanoate) granule-associated protein. This model describes a domain found in some proteins associated with polyhydroxyalkanoate (PHA) granules in a subset of species that have PHA inclusion granules. Included are two tandem proteins of Pseudomonas oleovorans, PhaI and PhaF, and their homologs in related species. PhaF proteins have a low-complexity C-terminal region with repeats similar to AAAKP.
Probab=31.78 E-value=78 Score=21.72 Aligned_cols=20 Identities=25% Similarity=0.391 Sum_probs=13.5
Q ss_pred HHHHHHHHHHHHHHHHHHhh
Q psy8735 26 KKAELQAELEEKKSKLEKLS 45 (115)
Q Consensus 26 ~i~~L~~e~~~l~~rL~~l~ 45 (115)
.|..|+..|..|+.+|..|.
T Consensus 97 ev~~L~~RI~~Le~~l~~l~ 116 (118)
T TIGR01837 97 EIEALSAKIEQLAVQVEELR 116 (118)
T ss_pred HHHHHHHHHHHHHHHHHHHh
Confidence 45566777777777777664
No 146
>PRK05431 seryl-tRNA synthetase; Provisional
Probab=31.76 E-value=1.4e+02 Score=24.93 Aligned_cols=23 Identities=22% Similarity=0.372 Sum_probs=11.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHh
Q psy8735 22 EAKQKKAELQAELEEKKSKLEKL 44 (115)
Q Consensus 22 el~~~i~~L~~e~~~l~~rL~~l 44 (115)
++..+..+|.+++..+++++..+
T Consensus 70 ~l~~~~~~l~~~~~~~~~~~~~~ 92 (425)
T PRK05431 70 ALIAEVKELKEEIKALEAELDEL 92 (425)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHH
Confidence 34444455555555554444443
No 147
>PF12729 4HB_MCP_1: Four helix bundle sensory module for signal transduction; InterPro: IPR024478 This entry represents a four-helix bundle that operates as a ubiquitous sensory module in prokaryotic signal-transduction, which is known as four-helix bundles methyl-accepting chemotaxis protein (4HB_MCP) domain. The 4HB_MCP is always found between two predicted transmembrane helices indicating that it detects only extracellular signals. In many cases the domain is associated with a cytoplasmic HAMP domain suggesting that most proteins carrying the bundle might share the mechanism of transmembrane signalling which is well-characterised in E coli chemoreceptors [].
Probab=31.70 E-value=1.6e+02 Score=19.60 Aligned_cols=79 Identities=13% Similarity=0.236 Sum_probs=48.4
Q ss_pred HHHHHHHHHHccCCHHHHHHHHHHHHHHHHHHHHHHHHhhhcCCCCChhhhcHHhHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy8735 6 KVKSKLKQLSQQMTEAEAKQKKAELQAELEEKKSKLEKLSRNVNVISEGEVDKKTITDTKEKMVKEYNKRKRMCTEMLEA 85 (115)
Q Consensus 6 ~l~seL~~L~s~lt~~el~~~i~~L~~e~~~l~~rL~~l~~~~~~Vs~ee~~~~~v~~~~~k~~~~w~kRKRi~~di~d~ 85 (115)
..+..+..+-..++.++......++......+...+..+.. ...+|+++. .|..+...|..=......+++.
T Consensus 60 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~------~~~~~~~~~~~y~~~~~~~~~~ 131 (181)
T PF12729_consen 60 RIRRALRRYLLATDPEERQEIEKEIDEARAEIDEALEEYEK--LILSPEEKQ------LLEEFKEAWKAYRKLRDQVIEL 131 (181)
T ss_pred HHHHHHHHhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHH--hcCCHHHHH------HHHHHHHHHHHHHHHHHHHHHH
Confidence 34444555555567777777777777777777777777764 345666654 3344445555545566677776
Q ss_pred HHhcCcC
Q psy8735 86 ILENYPK 92 (115)
Q Consensus 86 I~E~~p~ 92 (115)
+..|-+.
T Consensus 132 ~~~~~~~ 138 (181)
T PF12729_consen 132 AKSGDND 138 (181)
T ss_pred HHCCCHH
Confidence 6555443
No 148
>PF12711 Kinesin-relat_1: Kinesin motor; InterPro: IPR024658 Kinesin [, , ] is a microtubule-associated force-producing protein that may play a role in organelle transport. The kinesin motor activity is directed toward the microtubule's plus end. Kinesin is an oligomeric complex composed of two heavy chains and two light chains. The maintenance of the quaternary structure does not require interchain disulphide bonds. The heavy chain is composed of three structural domains: a large globular N-terminal domain which is responsible for the motor activity of kinesin (it is known to hydrolyse ATP, to bind and move on microtubules), a central alpha-helical coiled coil domain that mediates the heavy chain dimerisation; and a small globular C-terminal domain which interacts with other proteins (such as the kinesin light chains), vesicles and membranous organelles. A number of proteins have been recently found that contain a domain similar to that of the kinesin 'motor' domain [, ]: Drosophila melanogaster claret segregational protein (ncd). Ncd is required for normal chromosomal segregation in meiosis, in females, and in early mitotic divisions of the embryo. The ncd motor activity is directed toward the microtubule's minus end. Homo sapiens CENP-E []. CENP-E is a protein that associates with kinetochores during chromosome congression, relocates to the spindle midzone at anaphase, and is quantitatively discarded at the end of the cell division. CENP-E is probably an important motor molecule in chromosome movement and/or spindle elongation. H. sapiens mitotic kinesin-like protein-1 (MKLP-1), a motor protein whose activity is directed toward the microtubule's plus end. Saccharomyces cerevisiae KAR3 protein, which is essential for nuclear fusion during mating. KAR3 may mediate microtubule sliding during nuclear fusion and possibly mitosis. S. cerevisiae CIN8 and KIP1 proteins which are required for the assembly of the mitotic spindle. Both proteins seem to interact with spindle microtubules to produce an outwardly directed force acting upon the poles. Emericella nidulans (Aspergillus nidulans) bimC, which plays an important role in nuclear division. A. nidulans klpA. Caenorhabditis elegans unc-104, which may be required for the transport of substances needed for neuronal cell differentiation. C. elegans osm-3. Xenopus laevis Eg5, which may be involved in mitosis. Arabidopsis thaliana KatA, KatB and katC. Chlamydomonas reinhardtii FLA10/KHP1 and KLP1. Both proteins seem to play a role in the rotation or twisting of the microtubules of the flagella. C. elegans hypothetical protein T09A5.2. Kinesin-like proteins KLP2 (or KIF15) also contain a kinesin 'motor' domain. They are involved in mitotic spindle assembly, playing a role in positioning spindle poles during mitosis, specifically at prometaphase []. This entry represents a domain of unknown function found in this type of kinesin-like proteins.
Probab=31.66 E-value=1.2e+02 Score=20.06 Aligned_cols=67 Identities=10% Similarity=0.218 Sum_probs=40.6
Q ss_pred chHHHHHHHHHHHHccCCHHHHHHHHHHHHHHHHHHHHHHHHhhhcCCCCChhhhcHHhHHHHHHHHHHHHHH
Q psy8735 2 DDKEKVKSKLKQLSQQMTEAEAKQKKAELQAELEEKKSKLEKLSRNVNVISEGEVDKKTITDTKEKMVKEYNK 74 (115)
Q Consensus 2 ~~~~~l~seL~~L~s~lt~~el~~~i~~L~~e~~~l~~rL~~l~~~~~~Vs~ee~~~~~v~~~~~k~~~~w~k 74 (115)
|-|..|++-++. +.|+..-+......|..+|..|.++++. .+-||-=-+.-..+.....+++..+..
T Consensus 3 dkI~rLE~~~~g--~l~~~~~~~~e~~~L~eEI~~Lr~qve~----nPevtr~A~EN~rL~ee~rrl~~f~~~ 69 (86)
T PF12711_consen 3 DKIKRLEKLLDG--KLPSESYLEEENEALKEEIQLLREQVEH----NPEVTRFAMENIRLREELRRLQSFYVE 69 (86)
T ss_pred hHHHHHHHHhcC--CCCccchhHHHHHHHHHHHHHHHHHHHh----CHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 457777775543 4455555666668999999999998876 223332222211455666666666643
No 149
>cd01040 globin Globins are heme proteins, which bind and transport oxygen. This family summarizes a diverse set of homologous protein domains, including: (1) tetrameric vertebrate hemoglobins, which are the major protein component of erythrocytes and transport oxygen in the bloodstream, (2) microorganismal flavohemoglobins, which are linked to C-terminal FAD-dependend reductase domains, (3) homodimeric bacterial hemoglobins, such as from Vitreoscilla, (4) plant leghemoglobins (symbiotic hemoglobins, involved in nitrogen metabolism in plant rhizomes), (5) plant non-symbiotic hexacoordinate globins and hexacoordinate globins from bacteria and animals, such as neuroglobin, (6) invertebrate hemoglobins, which may occur in tandem-repeat arrangements, and (7) monomeric myoglobins found in animal muscle tissue.
Probab=31.62 E-value=84 Score=20.59 Aligned_cols=42 Identities=19% Similarity=0.347 Sum_probs=31.5
Q ss_pred ChhhhcHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcCcHHHHHH
Q psy8735 52 SEGEVDKKTITDTKEKMVKEYNKRKRMCTEMLEAILENYPKSKKILLE 99 (115)
Q Consensus 52 s~ee~~~~~v~~~~~k~~~~w~kRKRi~~di~d~I~E~~p~~kk~L~E 99 (115)
|+.++. -+...|.... .....+..+++..+.+.+|.-+ .++-
T Consensus 1 s~~~~~--~l~~sw~~~~---~~~~~~g~~~f~~lf~~~P~~~-~~F~ 42 (140)
T cd01040 1 SAEEKK--LVKASWAKLK---ADREEIGLEFYERLFKAHPETR-ALFS 42 (140)
T ss_pred CHHHHH--HHHHHHHHHH---ccHHhHHHHHHHHHHHHChhHH-HHhH
Confidence 456667 7777777665 6678899999999999999744 3443
No 150
>KOG2391|consensus
Probab=31.23 E-value=1.5e+02 Score=24.63 Aligned_cols=47 Identities=26% Similarity=0.440 Sum_probs=32.4
Q ss_pred hHHHHHHHHHHHHccCCHHHHHHHHHHHHHHHHHHHHHHHHhhhcCCCC
Q psy8735 3 DKEKVKSKLKQLSQQMTEAEAKQKKAELQAELEEKKSKLEKLSRNVNVI 51 (115)
Q Consensus 3 ~~~~l~seL~~L~s~lt~~el~~~i~~L~~e~~~l~~rL~~l~~~~~~V 51 (115)
++..+.+++.+|+. |.+++..-+.+|+.+..+|+.++..|.......
T Consensus 226 eme~~~aeq~slkR--t~EeL~~G~~kL~~~~etLEqq~~~L~~niDIL 272 (365)
T KOG2391|consen 226 EMERLQAEQESLKR--TEEELNIGKQKLVAMKETLEQQLQSLQKNIDIL 272 (365)
T ss_pred HHHHHHHHHHHHHh--hHHHHHhhHHHHHHHHHHHHHHHHHHHhhhHHH
Confidence 34455566666653 577888888888888888888888886544333
No 151
>PF13412 HTH_24: Winged helix-turn-helix DNA-binding; PDB: 1I1G_B 2IA0_B 3I4P_A 2GQQ_A 2L4A_A 2CFX_B 2DBB_B 2EFO_A 2EFQ_A 2PN6_A ....
Probab=31.11 E-value=91 Score=17.19 Aligned_cols=26 Identities=31% Similarity=0.406 Sum_probs=19.7
Q ss_pred HHHHHHHHhcCcCcHHHHHHHhcccc
Q psy8735 80 TEMLEAILENYPKSKKILLEEVGIET 105 (115)
Q Consensus 80 ~di~d~I~E~~p~~kk~L~EelGiEt 105 (115)
..|+..|.++-+-+..++.+.+|+..
T Consensus 6 ~~Il~~l~~~~~~t~~ela~~~~is~ 31 (48)
T PF13412_consen 6 RKILNYLRENPRITQKELAEKLGISR 31 (48)
T ss_dssp HHHHHHHHHCTTS-HHHHHHHHTS-H
T ss_pred HHHHHHHHHcCCCCHHHHHHHhCCCH
Confidence 35677777888889999999999864
No 152
>PF02637 GatB_Yqey: GatB domain; InterPro: IPR018027 The GatB domain, the function of which is uncertain, is associated with aspartyl/glutamyl amidotransferase subunit B and glutamyl amidotransferase subunit E. These are involved in the formation of correctly charged Asn-tRNA(Asn) or Gln-tRNA(Gln) through the transamidation of misacylated Asp-tRNA(Asn) or Glu-tRNA(Gln) in organisms which lack either or both of asparaginyl-tRNA or glutaminyl-tRNA synthetases. The reaction takes place in the presence of glutamine and ATP through an activated phospho-Asp-tRNA(Asn) or phospho-Glu-tRNA(Gln). ; GO: 0016884 carbon-nitrogen ligase activity, with glutamine as amido-N-donor; PDB: 2D6F_D 3H0M_H 3H0R_K 3H0L_K 3KFU_F 3AL0_B 3IP4_B 2DF4_B 2G5I_B 2F2A_B ....
Probab=31.09 E-value=68 Score=22.36 Aligned_cols=50 Identities=28% Similarity=0.338 Sum_probs=32.8
Q ss_pred CCCCChhhhcHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcCcHHHHHHHhccc
Q psy8735 48 VNVISEGEVDKKTITDTKEKMVKEYNKRKRMCTEMLEAILENYPKSKKILLEEVGIE 104 (115)
Q Consensus 48 ~~~Vs~ee~~~~~v~~~~~k~~~~w~kRKRi~~di~d~I~E~~p~~kk~L~EelGiE 104 (115)
...|+|+... .+-.-+..- +==++.+++++..+.++ ..++.++.++.|+-
T Consensus 37 ~~~i~~~~l~--~li~l~~~~----~Is~~~ak~ll~~~~~~-~~~~~~ii~~~~l~ 86 (148)
T PF02637_consen 37 DSPISPEHLA--ELINLLEDG----KISKKSAKELLRELLEN-GKSPEEIIEENGLW 86 (148)
T ss_dssp TSSSTHHHHH--HHHHHHHTT----SSGHHHHHHHHHHHHHH-TS-HHHHHHHTT--
T ss_pred hcCCCHHHHH--HHHHHHHcC----CCCHHHHHHHHHHHHHc-CCCHHHHHHHcCCC
Confidence 3578999888 776554221 11134567888888888 88899999998875
No 153
>TIGR02209 ftsL_broad cell division protein FtsL. This model represents FtsL, both forms similar to that in E. coli and similar to that in B. subtilis. FtsL is one of the later proteins active in cell division septum formation. FtsL is small, low in complexity, and highly divergent. The scope of this model is broader than that of the Pfam model pfam04999.3 for FtsL, as this one includes FtsL from Bacillus subtilis and related species.
Probab=30.88 E-value=1.2e+02 Score=18.88 Aligned_cols=23 Identities=17% Similarity=0.332 Sum_probs=9.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHh
Q psy8735 22 EAKQKKAELQAELEEKKSKLEKL 44 (115)
Q Consensus 22 el~~~i~~L~~e~~~l~~rL~~l 44 (115)
++..++..+++++..++..-..|
T Consensus 28 ~~~~~~~~~~~~~~~l~~en~~L 50 (85)
T TIGR02209 28 QLNNELQKLQLEIDKLQKEWRDL 50 (85)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHH
Confidence 33344444444444444444444
No 154
>COG1579 Zn-ribbon protein, possibly nucleic acid-binding [General function prediction only]
Probab=30.79 E-value=2.6e+02 Score=21.83 Aligned_cols=14 Identities=14% Similarity=0.041 Sum_probs=5.2
Q ss_pred hcCcCcHHHHHHHh
Q psy8735 88 ENYPKSKKILLEEV 101 (115)
Q Consensus 88 E~~p~~kk~L~Eel 101 (115)
+-+|..-=.++|.+
T Consensus 170 ~~l~~ell~~yeri 183 (239)
T COG1579 170 EKLDPELLSEYERI 183 (239)
T ss_pred HhcCHHHHHHHHHH
Confidence 33443333333433
No 155
>PF09726 Macoilin: Transmembrane protein; InterPro: IPR019130 This entry represents the multi-pass transmembrane protein Macoilin, which is highly conserved in eukaryotes. ; GO: 0016021 integral to membrane
Probab=30.58 E-value=2.8e+02 Score=24.99 Aligned_cols=25 Identities=28% Similarity=0.373 Sum_probs=21.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhh
Q psy8735 21 AEAKQKKAELQAELEEKKSKLEKLS 45 (115)
Q Consensus 21 ~el~~~i~~L~~e~~~l~~rL~~l~ 45 (115)
..++.++.+++.+++.|+.|+..|.
T Consensus 456 r~lk~eL~qlr~ene~Lq~Kl~~L~ 480 (697)
T PF09726_consen 456 RSLKSELSQLRQENEQLQNKLQNLV 480 (697)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4667778999999999999999994
No 156
>PF10239 DUF2465: Protein of unknown function (DUF2465); InterPro: IPR018797 FAM98A, B and C are glycine-rich proteins found from worms to humans whose function is unknown.
Probab=30.29 E-value=2.5e+02 Score=22.66 Aligned_cols=30 Identities=17% Similarity=0.418 Sum_probs=27.2
Q ss_pred CCCChhhhcHHhHHHHHHHHHHHHHHHHHHHH
Q psy8735 49 NVISEGEVDKKTITDTKEKMVKEYNKRKRMCT 80 (115)
Q Consensus 49 ~~Vs~ee~~~~~v~~~~~k~~~~w~kRKRi~~ 80 (115)
...|+++.. +++.-......+|..|++|..
T Consensus 183 ~~L~~~Qw~--~Le~i~~~L~~EY~~RR~mLl 212 (318)
T PF10239_consen 183 KSLTDEQWE--KLEKINQALSKEYECRRQMLL 212 (318)
T ss_pred CCCCHHHHH--HHHHHHHHHHHHHHHHHHHHH
Confidence 567899999 999999999999999999954
No 157
>PF09304 Cortex-I_coil: Cortexillin I, coiled coil; InterPro: IPR015383 This domain is predominantly found in the actin-bundling protein cortexillin I from Dictyostelium discoideum (Slime mold). The domain has a structure consisting of an 18-heptad-repeat alpha-helical coiled-coil, and is a prerequisite for the assembly of Cortexillin I []. ; PDB: 1D7M_A.
Probab=29.93 E-value=1.7e+02 Score=20.21 Aligned_cols=40 Identities=18% Similarity=0.380 Sum_probs=27.5
Q ss_pred HHHHHHHHHHHccCCHHHHHHHHHHHHHHHHHHHHHHHHhhh
Q psy8735 5 EKVKSKLKQLSQQMTEAEAKQKKAELQAELEEKKSKLEKLSR 46 (115)
Q Consensus 5 ~~l~seL~~L~s~lt~~el~~~i~~L~~e~~~l~~rL~~l~~ 46 (115)
.++.+.|++|..+ .+..+..+.+|-++...|++.+..|..
T Consensus 12 ~el~n~La~Le~s--lE~~K~S~~eL~kqkd~L~~~l~~L~~ 51 (107)
T PF09304_consen 12 NELQNRLASLERS--LEDEKTSQGELAKQKDQLRNALQSLQA 51 (107)
T ss_dssp --HHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHH--HHHHHhhHHHHHHhHHHHHHHHHHHHH
Confidence 3456677777665 466777777777777777777777764
No 158
>PF08657 DASH_Spc34: DASH complex subunit Spc34 ; InterPro: IPR013966 The DASH complex is a ~10 subunit microtubule-binding complex that is transferred to the kinetochore prior to mitosis []. In Saccharomyces cerevisiae (Baker's yeast) DASH forms both rings and spiral structures on microtubules in vitro [, ]. Components of the DASH complex, including Dam1, Duo1, Spc34, Dad1 and Ask1, are essential and connect the centromere to the plus end of spindle microtubules [].
Probab=29.81 E-value=1.5e+02 Score=23.29 Aligned_cols=32 Identities=28% Similarity=0.299 Sum_probs=14.7
Q ss_pred HHHHHHHHHHHHccCCHHHHHHHHHHHHHHHHHH
Q psy8735 4 KEKVKSKLKQLSQQMTEAEAKQKKAELQAELEEK 37 (115)
Q Consensus 4 ~~~l~seL~~L~s~lt~~el~~~i~~L~~e~~~l 37 (115)
|..|+..-..|.++ .+.+...|++.+.+...+
T Consensus 182 i~~Lr~~y~~l~~~--i~~lE~~VaeQ~~qL~~~ 213 (259)
T PF08657_consen 182 IAALRQRYNQLSNS--IAYLEAEVAEQEAQLERM 213 (259)
T ss_pred HHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHH
Confidence 34444444444443 345555555444444443
No 159
>PF03993 DUF349: Domain of Unknown Function (DUF349); InterPro: IPR007139 This motif is found singly or as up to five tandem repeats in a small set of bacterial proteins. There are two or three alpha-helices, and possibly a beta-strand.
Probab=29.76 E-value=1.3e+02 Score=18.11 Aligned_cols=21 Identities=10% Similarity=0.118 Sum_probs=9.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHH
Q psy8735 61 ITDTKEKMVKEYNKRKRMCTE 81 (115)
Q Consensus 61 v~~~~~k~~~~w~kRKRi~~d 81 (115)
+-..|..++.....|++-+..
T Consensus 3 Lw~~F~~a~~~~~~~~~~~~~ 23 (77)
T PF03993_consen 3 LWKRFRAACDAFFDRRKEFFE 23 (77)
T ss_pred HHHHHHHHHHHHHHHHHHHHH
Confidence 334444444444444443333
No 160
>cd00490 Met_repressor_MetJ Met Repressor, MetJ. MetJ is a bacterial regulatory protein that uses S-adenosylmethionine (SAM) as a corepressor to regulate the production of Methionine. MetJ binds arrays of two to five adjacent copies of an eight base-pair 'metbox' sequence. MetJ forms sufficiently strong interactions with the sugar-phosphate backbone to accomodate sequence variation in natural operators. However, it is very sensitive to particular base changes in the operator. MetJ exists as a homodimer.
Probab=29.69 E-value=29 Score=23.39 Aligned_cols=20 Identities=35% Similarity=0.704 Sum_probs=16.4
Q ss_pred hcCcCcHHHHHHHhcccccc
Q psy8735 88 ENYPKSKKILLEEVGIETDE 107 (115)
Q Consensus 88 E~~p~~kk~L~EelGiEtDe 107 (115)
+..|...+.+++++||+.|+
T Consensus 82 d~iP~~ak~~m~~~Gi~~~~ 101 (103)
T cd00490 82 DEIPEAAKELMREMGIEPET 101 (103)
T ss_pred ccccHHHHHHHHHhCCCccc
Confidence 56677788999999998764
No 161
>PRK00295 hypothetical protein; Provisional
Probab=29.59 E-value=1.2e+02 Score=18.86 Aligned_cols=25 Identities=16% Similarity=0.110 Sum_probs=12.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHh
Q psy8735 20 EAEAKQKKAELQAELEEKKSKLEKL 44 (115)
Q Consensus 20 ~~el~~~i~~L~~e~~~l~~rL~~l 44 (115)
.++|...|.+...++..+..+|..|
T Consensus 21 ie~Ln~~v~~Qq~~I~~L~~ql~~L 45 (68)
T PRK00295 21 IQALNDVLVEQQRVIERLQLQMAAL 45 (68)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4455555555554444444444444
No 162
>PF05381 Peptidase_C21: Tymovirus endopeptidase; InterPro: IPR008043 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad []. This entry is found in cysteine peptidases belong to the MEROPS peptidase family C21 (tymovirus endopeptidase family, clan CA). The type example is tymovirus endopeptidase (turnip yellow mosaic virus). The noncapsid protein expressed from ORF-206 of turnip yellow mosaic virus (TYMV) is autocatalytically processed by a papain-like protease, producing N-terminal 150kDa and C-terminal 70kDa proteins.; GO: 0003968 RNA-directed RNA polymerase activity, 0016032 viral reproduction
Probab=29.42 E-value=34 Score=23.52 Aligned_cols=27 Identities=22% Similarity=0.377 Sum_probs=21.1
Q ss_pred HHHHHHHhcCcCcH--HHHHHHhcccccc
Q psy8735 81 EMLEAILENYPKSK--KILLEEVGIETDE 107 (115)
Q Consensus 81 di~d~I~E~~p~~k--k~L~EelGiEtDe 107 (115)
++|..+.+.+|.+- ..-...+|+-||-
T Consensus 20 ~LW~~L~~~lPDSlL~n~ei~~~GLSTDh 48 (104)
T PF05381_consen 20 TLWATLCEILPDSLLDNPEIRTLGLSTDH 48 (104)
T ss_pred HHHHHHHHhCchhhcCchhhhhcCCcHHH
Confidence 68899999999754 4557888999884
No 163
>PF13680 DUF4152: Protein of unknown function (DUF4152)
Probab=29.10 E-value=35 Score=25.95 Aligned_cols=27 Identities=26% Similarity=0.320 Sum_probs=20.2
Q ss_pred HHHHHHHHhcCcCcHHHHHHHhccccc
Q psy8735 80 TEMLEAILENYPKSKKILLEEVGIETD 106 (115)
Q Consensus 80 ~di~d~I~E~~p~~kk~L~EelGiEtD 106 (115)
+++|.-++-.+..=-|.|||+-|||.=
T Consensus 106 KevW~eLskdlQPLAkkfWEetgiEIl 132 (227)
T PF13680_consen 106 KEVWKELSKDLQPLAKKFWEETGIEIL 132 (227)
T ss_pred HHHHHHHHHHhHHHHHHHHHhhCcEEE
Confidence 356766666666656899999999964
No 164
>TIGR00255 conserved hypothetical protein TIGR00255. The apparent ortholog from Aquifex aeolicus as reported is split into two consecutive reading frames.
Probab=29.10 E-value=1.5e+02 Score=23.66 Aligned_cols=41 Identities=12% Similarity=0.314 Sum_probs=36.6
Q ss_pred HHHHHHHHHHHHccCCHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy8735 4 KEKVKSKLKQLSQQMTEAEAKQKKAELQAELEEKKSKLEKL 44 (115)
Q Consensus 4 ~~~l~seL~~L~s~lt~~el~~~i~~L~~e~~~l~~rL~~l 44 (115)
++++--|..++.|.-...++...+-+++.++.++.++++.+
T Consensus 250 ~QEmnRE~NTigSKs~d~~is~~vVe~K~eiEkiREQVQNI 290 (291)
T TIGR00255 250 MQELNRESNTLASKAIDADITNLAVEMKVLIEKIKEQIQNI 290 (291)
T ss_pred HHHHhHHHHHHHHccCcHHHHHHHHHHHHHHHHHHHHHhcC
Confidence 46777899999999999999999999999999999998753
No 165
>PF06657 Cep57_MT_bd: Centrosome microtubule-binding domain of Cep57; InterPro: IPR010597 This entry is thought to represent a centrosomal protein of 57 kDa (Cep57-related protein). It is required for spindle microtubule attachment to both kinetochores and centrosomes and functions to tether minus-ends of spindle microtubules to centrosomes. It may act by forming ring-like structures around microtubules, or by serving as a cross-linker or scaffold at the attachment site [].
Probab=29.04 E-value=1.6e+02 Score=18.84 Aligned_cols=29 Identities=21% Similarity=0.179 Sum_probs=18.3
Q ss_pred cCCHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy8735 17 QMTEAEAKQKKAELQAELEEKKSKLEKLS 45 (115)
Q Consensus 17 ~lt~~el~~~i~~L~~e~~~l~~rL~~l~ 45 (115)
+++-..|..-+..|+.+...|.-.+..|.
T Consensus 9 ~~p~~~Ls~vl~~LqDE~~hm~~e~~~L~ 37 (79)
T PF06657_consen 9 QSPGEALSEVLKALQDEFGHMKMEHQELQ 37 (79)
T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34445555666677777777776666664
No 166
>PF04102 SlyX: SlyX; InterPro: IPR007236 The SlyX protein has no known function. It is short, less than 80 amino acids, and its gene is found close to the slyD gene. The SlyX protein has a conserved PPH(Y/W) motif at its C terminus. The protein may be a coiled-coil structure.; PDB: 3EFG_A.
Probab=28.87 E-value=1.2e+02 Score=18.73 Aligned_cols=25 Identities=20% Similarity=0.232 Sum_probs=11.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHh
Q psy8735 20 EAEAKQKKAELQAELEEKKSKLEKL 44 (115)
Q Consensus 20 ~~el~~~i~~L~~e~~~l~~rL~~l 44 (115)
.++|...+.....++..++.+|..|
T Consensus 20 ie~Ln~~v~~Qq~~I~~L~~~l~~L 44 (69)
T PF04102_consen 20 IEELNDVVTEQQRQIDRLQRQLRLL 44 (69)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3444444444444444444444444
No 167
>PTZ00007 (NAP-L) nucleosome assembly protein -L; Provisional
Probab=28.79 E-value=3.1e+02 Score=22.48 Aligned_cols=36 Identities=19% Similarity=0.248 Sum_probs=28.4
Q ss_pred hhcHHhHHHHHH-HHHHHHHHHHHHHH-------------HHHHHHHhcCcC
Q psy8735 55 EVDKKTITDTKE-KMVKEYNKRKRMCT-------------EMLEAILENYPK 92 (115)
Q Consensus 55 e~~~~~v~~~~~-k~~~~w~kRKRi~~-------------di~d~I~E~~p~ 92 (115)
++. .++..|. .+.-.|.+|+.|.. ++|-.++-+.|.
T Consensus 63 ev~--~LE~kY~~~~~Ply~kR~eII~G~~~~e~~~~gIP~FWl~vL~Nh~~ 112 (337)
T PTZ00007 63 ELR--KLRSKYEDLYNPIYDKRKEALVQNGGAEIGTPGLPQFWLTAMKNNNT 112 (337)
T ss_pred HHH--HHHHHHHHHhhHHHHHHHHHHcCCcccccccCCcccHHHHHHHcCcc
Confidence 456 8888995 78999999999976 478777777663
No 168
>PRK11820 hypothetical protein; Provisional
Probab=28.69 E-value=1.5e+02 Score=23.56 Aligned_cols=41 Identities=15% Similarity=0.323 Sum_probs=36.6
Q ss_pred HHHHHHHHHHHHccCCHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy8735 4 KEKVKSKLKQLSQQMTEAEAKQKKAELQAELEEKKSKLEKL 44 (115)
Q Consensus 4 ~~~l~seL~~L~s~lt~~el~~~i~~L~~e~~~l~~rL~~l 44 (115)
.+++--|..++.|.-...++...+-+++.++.++.++++.+
T Consensus 247 ~QEm~RE~NTigSKs~~~~is~~vVe~K~elEkiREQVQNI 287 (288)
T PRK11820 247 MQELNREANTLGSKSNDAEITNLVVELKVLIEQMREQVQNI 287 (288)
T ss_pred HHHHhHHHHHHHHccCcHHHHHHHHHHHHHHHHHHHHHhcC
Confidence 46777899999999999999999999999999999998753
No 169
>PF11616 EZH2_WD-Binding: WD repeat binding protein EZH2; InterPro: IPR021654 This family of proteins represents Enhancer of zest homologue 2, (EZH2) a 30 residue peptide which binds to a WD-repeat domain of EED by residues 39-68. EED is a component of PRC2 complex which is involved in gene expression []. This interaction is required for the HMTase activity of PCR2 []. ; GO: 0018024 histone-lysine N-methyltransferase activity; PDB: 2QXV_B.
Probab=28.68 E-value=87 Score=16.81 Aligned_cols=17 Identities=18% Similarity=0.419 Sum_probs=12.2
Q ss_pred hHHHHHHHHHHHHHHHH
Q psy8735 60 TITDTKEKMVKEYNKRK 76 (115)
Q Consensus 60 ~v~~~~~k~~~~w~kRK 76 (115)
++...-.-+-.+|++|+
T Consensus 10 Ki~e~t~iLN~eWk~lR 26 (30)
T PF11616_consen 10 KIQERTDILNEEWKKLR 26 (30)
T ss_dssp HHHHHHHHHHHHHHH--
T ss_pred HHHHHHHHHHHHHHHhc
Confidence 77777778888898875
No 170
>PF09602 PhaP_Bmeg: Polyhydroxyalkanoic acid inclusion protein (PhaP_Bmeg); InterPro: IPR011728 This entry describes a protein found in polyhydroxyalkanoic acid (PHA) gene regions and incorporated into PHA inclusions in Bacillus cereus and Bacillus megaterium. The role of the protein may include amino acid storage [].
Probab=28.57 E-value=2.5e+02 Score=20.85 Aligned_cols=72 Identities=19% Similarity=0.261 Sum_probs=36.7
Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHhhhcCCCCChhhhcHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcC
Q psy8735 19 TEAEAKQKKAELQAELEEKKSKLEKLSRNVNVISEGEVDKKTITDTKEKMVKEYNKRKRMCTEMLEAILENYPK 92 (115)
Q Consensus 19 t~~el~~~i~~L~~e~~~l~~rL~~l~~~~~~Vs~ee~~~~~v~~~~~k~~~~w~kRKRi~~di~d~I~E~~p~ 92 (115)
+-+.+...|.+++.+..++..++..|--...-.+.+=.. .+...|+...+.+..-.+.-++-|-..+++|-.
T Consensus 79 ~~~~l~d~inE~t~k~~El~~~i~el~~~~~Ks~~~~l~--q~~~~~eEtv~~~ieqqk~~r~e~qk~~~~yv~ 150 (165)
T PF09602_consen 79 TGNSLNDSINEWTDKLNELSAKIQELLLSPSKSSFSLLS--QISKQYEETVKQLIEQQKLTREEWQKVLDAYVE 150 (165)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHcchHHHHHHHHH--HHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 455556666666666666666665553211112223334 455555555555555555555555555555543
No 171
>PF03670 UPF0184: Uncharacterised protein family (UPF0184); InterPro: IPR022788 This family of proteins has no known function.
Probab=28.41 E-value=1.6e+02 Score=19.40 Aligned_cols=35 Identities=9% Similarity=0.181 Sum_probs=20.5
Q ss_pred HHHHHHHccCCHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy8735 9 SKLKQLSQQMTEAEAKQKKAELQAELEEKKSKLEKLS 45 (115)
Q Consensus 9 seL~~L~s~lt~~el~~~i~~L~~e~~~l~~rL~~l~ 45 (115)
.|.+.++++| .+|-..+..|++.+..+..+|..|=
T Consensus 26 ~E~~~ins~L--D~Lns~LD~LE~rnD~l~~~L~~LL 60 (83)
T PF03670_consen 26 EEYAAINSML--DQLNSCLDHLEQRNDHLHAQLQELL 60 (83)
T ss_pred HHHHHHHHHH--HHHHHHHHHHHHhhhHHHHHHHHHH
Confidence 3444455442 4455555667777777777776663
No 172
>PF03357 Snf7: Snf7; InterPro: IPR005024 This is a family of eukaryotic proteins which are variously described as either hypothetical protein, developmental protein or related to yeast SNF7. The family contains human CHMP1. CHMP1 (CHromatin Modifying Protein; CHarged Multivesicular body Protein), is encoded by an alternative open reading frame in the PRSM1 gene [] and is conserved in both complex and simple eukaryotes. CHMP1 contains a predicted bipartite nuclear localisation signal and distributes as distinct forms to the cytoplasm and the nuclear matrix in all cell lines tested. Human CHMP1 is strongly implicated in multivesicular body formation. A multivesicular body is a vesicle-filled endosome that targets proteins to the interior of lysosomes. Immunocytochemistry and biochemical fractionation localise CHMP1 to early endosomes and CHMP1 physically interacts with SKD1/VPS4, a highly conserved protein directly linked to multivesicular body sorting in yeast. Similar to the action of a mutant SKD1 protein, over expression of a fusion derivative of human CHMP1 dilates endosomal compartments and disrupts the normal distribution of several endosomal markers. Genetic studies in Saccharomyces cerevisiae (Baker's yeast) further support a conserved role of CHMP1 in vesicle trafficking. Deletion of CHM1, the budding yeast homologue of CHMP1, results in defective sorting of carboxypeptidases S and Y and produces abnormal, multi-lamellar prevacuolar compartments. This phenotype classifies CHM1 as a member of the class E vacuolar protein sorting genes []. ; GO: 0015031 protein transport; PDB: 2V6X_B 2W2U_D 2GD5_D 3FRT_B 3FRV_A 4ABM_D 3EAB_H 3HTU_D.
Probab=28.03 E-value=2.1e+02 Score=19.77 Aligned_cols=23 Identities=13% Similarity=0.254 Sum_probs=10.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHh
Q psy8735 22 EAKQKKAELQAELEEKKSKLEKL 44 (115)
Q Consensus 22 el~~~i~~L~~e~~~l~~rL~~l 44 (115)
.|...+..|+.++..+..++..+
T Consensus 12 ~L~~~~~~le~~i~~~~~~~k~~ 34 (171)
T PF03357_consen 12 RLEKQIKRLEKKIKKLEKKAKKA 34 (171)
T ss_dssp HHHHHHHHHHHHHHHCHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHH
Confidence 34444444444444444444444
No 173
>PF03082 MAGSP: Male accessory gland secretory protein; InterPro: IPR004315 The accessory gland of male insects is a genital tissue that secretes many components of the ejaculatory fluid, some of which affect the female's receptivity to courtship and her rate of oviposition. The protein is expressed exclusively in the male accessory glands of adult Drosophila melanogaster. During copulation it is transferred to the female genital tract where it is rapidly altered [].; GO: 0007618 mating, 0005576 extracellular region
Probab=27.79 E-value=2.1e+02 Score=22.43 Aligned_cols=85 Identities=13% Similarity=0.222 Sum_probs=46.0
Q ss_pred CchHHHHHHHHHHHHccCCHHHHHHHHHHHHHHHHHHHHHHHHhhhcCCCC--------Ch---hhhcHHhHHHHH-HHH
Q psy8735 1 MDDKEKVKSKLKQLSQQMTEAEAKQKKAELQAELEEKKSKLEKLSRNVNVI--------SE---GEVDKKTITDTK-EKM 68 (115)
Q Consensus 1 ~~~~~~l~seL~~L~s~lt~~el~~~i~~L~~e~~~l~~rL~~l~~~~~~V--------s~---ee~~~~~v~~~~-~k~ 68 (115)
|.||..-.+++ |+..-++=+|.-+.......+-+|.-+|+..|.+.++. +| .+....+|.+.| ++|
T Consensus 151 M~Eie~rK~eI--l~~Rq~NldLE~eLndanRkilElNlqlqdaRksvkpCkKr~~kd~a~p~~nQl~E~~i~k~YRn~y 228 (264)
T PF03082_consen 151 MKEIEARKTEI--LKARQSNLDLELELNDANRKILELNLQLQDARKSVKPCKKRNSKDLAPPATNQLQEAKIKKTYRNKY 228 (264)
T ss_pred HHHHHHHHHHH--HHHHhhCCceeeehhHHHHHHHHHHHHHHHhhcccCCCcCCCccccCCchhhHHHHHHHHHHHHHHH
Confidence 44555555554 44444444555556666667777777777776443322 11 122222444444 233
Q ss_pred HHHHH-HHHHHHHHHHHHHH
Q psy8735 69 VKEYN-KRKRMCTEMLEAIL 87 (115)
Q Consensus 69 ~~~w~-kRKRi~~di~d~I~ 87 (115)
.+..+ -+.||+.+||.-.+
T Consensus 229 ~~L~~~L~q~in~ei~kvat 248 (264)
T PF03082_consen 229 LTLLKELIQKINSEIAKVAT 248 (264)
T ss_pred HHHHHHHHHHHHHHHHHhhc
Confidence 33332 37889999997643
No 174
>PRK04863 mukB cell division protein MukB; Provisional
Probab=27.69 E-value=6.1e+02 Score=25.09 Aligned_cols=104 Identities=12% Similarity=0.175 Sum_probs=66.1
Q ss_pred hHHHHHHHHHHHHccC-CHHHHHHHHHHHHHHHHHHHHHHHHhhh------------------cCCCCCh---hhhcHHh
Q psy8735 3 DKEKVKSKLKQLSQQM-TEAEAKQKKAELQAELEEKKSKLEKLSR------------------NVNVISE---GEVDKKT 60 (115)
Q Consensus 3 ~~~~l~seL~~L~s~l-t~~el~~~i~~L~~e~~~l~~rL~~l~~------------------~~~~Vs~---ee~~~~~ 60 (115)
++..++..|..|++-| +-+.++.+..+.++....+..++-.|.. .+...++ ..+. .
T Consensus 919 ~L~qLE~~l~~L~~Dp~~~e~lr~e~~~~~~~~~~~~~~~~~l~~~~~~~~~~~y~~~~~~l~~~~~~~~~Le~~Le--~ 996 (1486)
T PRK04863 919 ALAQLEPIVSVLQSDPEQFEQLKQDYQQAQQTQRDAKQQAFALTEVVQRRAHFSYEDAAEMLAKNSDLNEKLRQRLE--Q 996 (1486)
T ss_pred HHHHHHHHHHHhCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccHHHHHhHhhcchhhhHHHHHHHH--H
Confidence 5788999999999998 4556666677777777777766655521 1112222 3456 7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH---HHhcCcC------cHHHHHHHhccccccC
Q psy8735 61 ITDTKEKMVKEYNKRKRMCTEMLEA---ILENYPK------SKKILLEEVGIETDEM 108 (115)
Q Consensus 61 v~~~~~k~~~~w~kRKRi~~di~d~---I~E~~p~------~kk~L~EelGiEtDe~ 108 (115)
++..|......|+.-...+...-.. +...+.. ..+.=+.+||+-.|+|
T Consensus 997 iE~~~~~areql~qaq~q~~q~~q~l~slksslq~~~e~L~E~eqe~~~~g~~~~~~ 1053 (1486)
T PRK04863 997 AEQERTRAREQLRQAQAQLAQYNQVLASLKSSYDAKRQMLQELKQELQDLGVPADSG 1053 (1486)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCcc
Confidence 8888888888888877776654333 2222221 1223356789988874
No 175
>PF12718 Tropomyosin_1: Tropomyosin like; InterPro: IPR000533 Tropomyosins [], are a family of closely related proteins present in muscle and non-muscle cells. In striated muscle, tropomyosin mediate the interactions between the troponin complex and actin so as to regulate muscle contraction []. The role of tropomyosin in smooth muscle and non-muscle tissues is not clear. Tropomyosin is an alpha-helical protein that forms a coiled-coil structure of 2 parallel helices containing 2 sets of 7 alternating actin binding sites []. There are multiple cell-specific isoforms, created by differential splicing of the messenger RNA from one gene, but the proportions of the isoforms vary between different cell types. Muscle isoforms of tropomyosin are characterised by having 284 amino acid residues and a highly conserved N-terminal region, whereas non-muscle forms are generally smaller and are heterogeneous in their N-terminal region. This entry represents tropomyosin (Tmp) 1, 2 and 3. Within the yeast Tmp1 and Tmp2, biochemical and sequence analyses indicate that Tpm2 spans four actin monomers along a filament, whereas Tpm1 spans five. Despite its shorter length, Tpm2 can compete with Tpm1 for binding to F-actin. Over-expression of Tpm2 in vivo alters the axial budding of haploids to a bipolar pattern, and this can be partially suppressed by co-over-expression of Tpm1. This suggests distinct functions for the two tropomyosins, and indicates that the ratio between them is important for correct morphogenesis [].
Probab=27.60 E-value=2.3e+02 Score=20.08 Aligned_cols=18 Identities=39% Similarity=0.529 Sum_probs=6.9
Q ss_pred HHHHHHHHHHHHHHHHHH
Q psy8735 24 KQKKAELQAELEEKKSKL 41 (115)
Q Consensus 24 ~~~i~~L~~e~~~l~~rL 41 (115)
...|..|+.+......+|
T Consensus 79 ~rriq~LEeele~ae~~L 96 (143)
T PF12718_consen 79 NRRIQLLEEELEEAEKKL 96 (143)
T ss_pred HhhHHHHHHHHHHHHHHH
Confidence 333333333333333333
No 176
>PLN02678 seryl-tRNA synthetase
Probab=27.51 E-value=1.7e+02 Score=24.76 Aligned_cols=21 Identities=29% Similarity=0.474 Sum_probs=8.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHH
Q psy8735 22 EAKQKKAELQAELEEKKSKLE 42 (115)
Q Consensus 22 el~~~i~~L~~e~~~l~~rL~ 42 (115)
++.+++.+|.+++..++..+.
T Consensus 75 ~l~~~~~~Lk~ei~~le~~~~ 95 (448)
T PLN02678 75 ELIAETKELKKEITEKEAEVQ 95 (448)
T ss_pred HHHHHHHHHHHHHHHHHHHHH
Confidence 333344444444444444443
No 177
>PF09032 Siah-Interact_N: Siah interacting protein, N terminal ; InterPro: IPR015120 The N-terminal domain of Siah interacting protein (SIP) adopts a helical hairpin structure with a hydrophobic core stabilised by a classic knobs-and-holes arrangement of side chains contributed by the two amphipathic helices. Little is known about this domain's function, except that it is crucial for interactions with Siah. It has also been hypothesised that SIP can dimerise through this N-terminal domain []. ; PDB: 1YSM_A 2A26_C 2A25_B 1X5M_A.
Probab=27.51 E-value=40 Score=21.96 Aligned_cols=36 Identities=19% Similarity=0.314 Sum_probs=16.9
Q ss_pred HHHHHHHHHHHHccCCHH-------HHHHHHHHHHHHHHHHHH
Q psy8735 4 KEKVKSKLKQLSQQMTEA-------EAKQKKAELQAELEEKKS 39 (115)
Q Consensus 4 ~~~l~seL~~L~s~lt~~-------el~~~i~~L~~e~~~l~~ 39 (115)
|.+|+.-|..|++-++.. =|..+|..++.++..+..
T Consensus 5 i~eL~~Dl~El~~Ll~~a~R~rVk~~L~~ei~klE~eI~~~~~ 47 (79)
T PF09032_consen 5 IEELQLDLEELKSLLEQAKRKRVKDLLTNEIRKLETEIKKLKE 47 (79)
T ss_dssp HHHHHHHHHHHHHHHHHTTTCCHHHHHHHHHHHHHHHHHHCHH
T ss_pred HHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 455555444444433332 334445555555555443
No 178
>PRK00736 hypothetical protein; Provisional
Probab=27.27 E-value=1.4e+02 Score=18.56 Aligned_cols=39 Identities=21% Similarity=0.250 Sum_probs=19.4
Q ss_pred HHHHHHHHHHHHccCCHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy8735 4 KEKVKSKLKQLSQQMTEAEAKQKKAELQAELEEKKSKLEKL 44 (115)
Q Consensus 4 ~~~l~seL~~L~s~lt~~el~~~i~~L~~e~~~l~~rL~~l 44 (115)
|..|+..++=.. -|.++|-..|.+...++..|..+|..|
T Consensus 7 i~~LE~klafqe--~tie~Ln~~v~~Qq~~i~~L~~ql~~L 45 (68)
T PRK00736 7 LTELEIRVAEQE--KTIEELSDQLAEQWKTVEQMRKKLDAL 45 (68)
T ss_pred HHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 445555443332 344555555555555555555555444
No 179
>PF11853 DUF3373: Protein of unknown function (DUF3373); InterPro: IPR021803 This family of proteins are functionally uncharacterised. This protein is found in bacteria. Proteins in this family are typically between 472 to 574 amino acids in length.
Probab=27.21 E-value=70 Score=27.59 Aligned_cols=22 Identities=36% Similarity=0.422 Sum_probs=13.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHh
Q psy8735 23 AKQKKAELQAELEEKKSKLEKL 44 (115)
Q Consensus 23 l~~~i~~L~~e~~~l~~rL~~l 44 (115)
+.++|++|++++++|++++..+
T Consensus 29 ~~qkie~L~kql~~Lk~q~~~l 50 (489)
T PF11853_consen 29 LLQKIEALKKQLEELKAQQDDL 50 (489)
T ss_pred HHHHHHHHHHHHHHHHHhhccc
Confidence 3346666666666666666644
No 180
>PF12718 Tropomyosin_1: Tropomyosin like; InterPro: IPR000533 Tropomyosins [], are a family of closely related proteins present in muscle and non-muscle cells. In striated muscle, tropomyosin mediate the interactions between the troponin complex and actin so as to regulate muscle contraction []. The role of tropomyosin in smooth muscle and non-muscle tissues is not clear. Tropomyosin is an alpha-helical protein that forms a coiled-coil structure of 2 parallel helices containing 2 sets of 7 alternating actin binding sites []. There are multiple cell-specific isoforms, created by differential splicing of the messenger RNA from one gene, but the proportions of the isoforms vary between different cell types. Muscle isoforms of tropomyosin are characterised by having 284 amino acid residues and a highly conserved N-terminal region, whereas non-muscle forms are generally smaller and are heterogeneous in their N-terminal region. This entry represents tropomyosin (Tmp) 1, 2 and 3. Within the yeast Tmp1 and Tmp2, biochemical and sequence analyses indicate that Tpm2 spans four actin monomers along a filament, whereas Tpm1 spans five. Despite its shorter length, Tpm2 can compete with Tpm1 for binding to F-actin. Over-expression of Tpm2 in vivo alters the axial budding of haploids to a bipolar pattern, and this can be partially suppressed by co-over-expression of Tpm1. This suggests distinct functions for the two tropomyosins, and indicates that the ratio between them is important for correct morphogenesis [].
Probab=27.20 E-value=2.3e+02 Score=20.04 Aligned_cols=14 Identities=43% Similarity=0.667 Sum_probs=7.4
Q ss_pred HHHHHHHHHHHHcc
Q psy8735 4 KEKVKSKLKQLSQQ 17 (115)
Q Consensus 4 ~~~l~seL~~L~s~ 17 (115)
...+++.++.|...
T Consensus 16 ~e~~e~~~K~le~~ 29 (143)
T PF12718_consen 16 AEELEAKVKQLEQE 29 (143)
T ss_pred HHHHHHHHHHHHHH
Confidence 34555555555554
No 181
>COG2433 Uncharacterized conserved protein [Function unknown]
Probab=27.16 E-value=1.9e+02 Score=25.88 Aligned_cols=26 Identities=35% Similarity=0.546 Sum_probs=19.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy8735 20 EAEAKQKKAELQAELEEKKSKLEKLS 45 (115)
Q Consensus 20 ~~el~~~i~~L~~e~~~l~~rL~~l~ 45 (115)
+.+|...+.+++.++..|+++|+.++
T Consensus 438 ~~~L~~~~ee~k~eie~L~~~l~~~~ 463 (652)
T COG2433 438 NSELKRELEELKREIEKLESELERFR 463 (652)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 45667777777778888888887775
No 182
>cd07596 BAR_SNX The Bin/Amphiphysin/Rvs (BAR) domain of Sorting Nexins. BAR domains are dimerization, lipid binding and curvature sensing modules found in many different proteins with diverse functions. Sorting nexins (SNXs) are Phox homology (PX) domain containing proteins that are involved in regulating membrane traffic and protein sorting in the endosomal system. SNXs differ from each other in their lipid-binding specificity, subcellular localization and specific function in the endocytic pathway. A subset of SNXs also contain BAR domains. The PX-BAR structural unit determines the specific membrane targeting of SNXs. BAR domains form dimers that bind to membranes, induce membrane bending and curvature, and may also be involved in protein-protein interactions.
Probab=26.99 E-value=2.3e+02 Score=20.06 Aligned_cols=40 Identities=33% Similarity=0.341 Sum_probs=20.4
Q ss_pred HHHHHHHHHHHHccCCHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy8735 4 KEKVKSKLKQLSQQMTEAEAKQKKAELQAELEEKKSKLEKLS 45 (115)
Q Consensus 4 ~~~l~seL~~L~s~lt~~el~~~i~~L~~e~~~l~~rL~~l~ 45 (115)
....++.+..|...+++. ...|..++.++..++..+..++
T Consensus 126 l~~k~~~~~kl~~~~~~~--~~ki~~l~~~i~~~e~~~~~~~ 165 (218)
T cd07596 126 LASKKAQLEKLKAAPGIK--PAKVEELEEELEEAESALEEAR 165 (218)
T ss_pred HHHHHHHHHHHhhcCCCc--hhHHHHHHHHHHHHHHHHHHHH
Confidence 344555666666655422 1345555555555555555544
No 183
>PF14915 CCDC144C: CCDC144C protein coiled-coil region
Probab=26.56 E-value=2.3e+02 Score=23.03 Aligned_cols=66 Identities=18% Similarity=0.273 Sum_probs=36.4
Q ss_pred HHHHHHHHHHccCCHHHHHHHHHHHHHHHHHHHHHHHHhhhcCCCCChhhhcHHhHHHHHHHHHHHHHHHH
Q psy8735 6 KVKSKLKQLSQQMTEAEAKQKKAELQAELEEKKSKLEKLSRNVNVISEGEVDKKTITDTKEKMVKEYNKRK 76 (115)
Q Consensus 6 ~l~seL~~L~s~lt~~el~~~i~~L~~e~~~l~~rL~~l~~~~~~Vs~ee~~~~~v~~~~~k~~~~w~kRK 76 (115)
.|.+|...|+|.+-.+ +..-..|++++.-+..||...-..-. -|..-+. .++-.|.+.+.+|..-+
T Consensus 67 ~L~aENt~L~SkLe~E--Kq~kerLEtEiES~rsRLaaAi~d~d-qsq~skr--dlelafqr~rdEw~~lq 132 (305)
T PF14915_consen 67 VLKAENTMLNSKLEKE--KQNKERLETEIESYRSRLAAAIQDHD-QSQTSKR--DLELAFQRARDEWVRLQ 132 (305)
T ss_pred HHHHHHHHHhHHHHHh--HHHHHHHHHHHHHHHHHHHHHHhhHH-HHHhhHH--HHHHHHHHHhhHHHHHH
Confidence 4455555555554332 34445667777777777766432110 0111133 67788888888886544
No 184
>PF09602 PhaP_Bmeg: Polyhydroxyalkanoic acid inclusion protein (PhaP_Bmeg); InterPro: IPR011728 This entry describes a protein found in polyhydroxyalkanoic acid (PHA) gene regions and incorporated into PHA inclusions in Bacillus cereus and Bacillus megaterium. The role of the protein may include amino acid storage [].
Probab=26.54 E-value=2.7e+02 Score=20.64 Aligned_cols=76 Identities=20% Similarity=0.354 Sum_probs=41.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhhc-CCCCChhhhcHHhHHHHHHHHHHHHHH-----------HHHHHHHHHHHHHhcC
Q psy8735 23 AKQKKAELQAELEEKKSKLEKLSRN-VNVISEGEVDKKTITDTKEKMVKEYNK-----------RKRMCTEMLEAILENY 90 (115)
Q Consensus 23 l~~~i~~L~~e~~~l~~rL~~l~~~-~~~Vs~ee~~~~~v~~~~~k~~~~w~k-----------RKRi~~di~d~I~E~~ 90 (115)
+.+.+++|+.........+..+-.. ...++..... .+...++.|...|+. +=+.+.+++.++.+++
T Consensus 46 itk~veeLe~~~~q~~~~~s~~~~~~vk~L~k~~~~--~l~d~inE~t~k~~El~~~i~el~~~~~Ks~~~~l~q~~~~~ 123 (165)
T PF09602_consen 46 ITKQVEELEKELKQFKREFSDLYEEYVKQLRKATGN--SLNDSINEWTDKLNELSAKIQELLLSPSKSSFSLLSQISKQY 123 (165)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHcchHHHHHHHHHHHHhhH
Confidence 5555566666665555555554322 2333334455 666666666666653 3344556666666666
Q ss_pred cCcHHHHHHH
Q psy8735 91 PKSKKILLEE 100 (115)
Q Consensus 91 p~~kk~L~Ee 100 (115)
...-+.|++.
T Consensus 124 eEtv~~~ieq 133 (165)
T PF09602_consen 124 EETVKQLIEQ 133 (165)
T ss_pred HHHHHHHHHH
Confidence 6555555543
No 185
>PF07262 DUF1436: Protein of unknown function (DUF1436); InterPro: IPR009888 This family consists of several hypothetical bacterial proteins of around 160 residues in length. The function of this family is unknown.; PDB: 2GKP_A.
Probab=26.49 E-value=37 Score=24.84 Aligned_cols=9 Identities=44% Similarity=0.431 Sum_probs=4.4
Q ss_pred cCCCCChhh
Q psy8735 47 NVNVISEGE 55 (115)
Q Consensus 47 ~~~~Vs~ee 55 (115)
.++.|++++
T Consensus 55 ~Sr~i~~ee 63 (158)
T PF07262_consen 55 NSRTISPEE 63 (158)
T ss_dssp T--B-TSHH
T ss_pred hCCcCChHH
Confidence 457888876
No 186
>PF14989 CCDC32: Coiled-coil domain containing 32
Probab=26.46 E-value=1.4e+02 Score=21.72 Aligned_cols=25 Identities=16% Similarity=0.206 Sum_probs=20.0
Q ss_pred HHHHHHHHHHHhhhcCCCCChhhhc
Q psy8735 33 ELEEKKSKLEKLSRNVNVISEGEVD 57 (115)
Q Consensus 33 e~~~l~~rL~~l~~~~~~Vs~ee~~ 57 (115)
=++.|+.||..++.++..||+.++=
T Consensus 57 YLasLE~KL~rik~~~~~vtsKemL 81 (148)
T PF14989_consen 57 YLASLERKLKRIKGKNREVTSKEML 81 (148)
T ss_pred HHHHHHHHHHHHhCCCccCCHHHHH
Confidence 3567888999999888888887766
No 187
>PF00846 Hanta_nucleocap: Hantavirus nucleocapsid protein; InterPro: IPR002214 Hantaviruses are ssRNA negative-strand viruses. The nucleocapsid protein is an internal protein of the virus particle [, ].; GO: 0019013 viral nucleocapsid; PDB: 2IC9_A 2IC6_A 2K48_A 4FI5_A.
Probab=26.45 E-value=2.9e+02 Score=23.45 Aligned_cols=25 Identities=12% Similarity=0.288 Sum_probs=3.6
Q ss_pred HHHHHhcCcCcHHHHHHHhccccccCC
Q psy8735 83 LEAILENYPKSKKILLEEVGIETDEMV 109 (115)
Q Consensus 83 ~d~I~E~~p~~kk~L~EelGiEtDe~v 109 (115)
-+.+..+-. .+.-.+=-|+|.|+|.
T Consensus 66 Ad~v~~~k~--~~~~~dptG~epdDhl 90 (428)
T PF00846_consen 66 ADRVAAGKQ--SAKPVDPTGVEPDDHL 90 (428)
T ss_dssp HHHHHHHCH------------------
T ss_pred HHHHhcccc--ccCCCCCCCCCCchhh
Confidence 334444422 2334566789888754
No 188
>PRK09973 putative outer membrane lipoprotein; Provisional
Probab=26.43 E-value=1.6e+02 Score=19.45 Aligned_cols=36 Identities=11% Similarity=0.158 Sum_probs=22.6
Q ss_pred HHHHHccCCHHHHHHHHHHHHHHHHHHHHHHHHhhh
Q psy8735 11 LKQLSQQMTEAEAKQKKAELQAELEEKKSKLEKLSR 46 (115)
Q Consensus 11 L~~L~s~lt~~el~~~i~~L~~e~~~l~~rL~~l~~ 46 (115)
|..-.+++..++|..++..|..++.++......++.
T Consensus 17 LaGCAs~~kvdqLss~V~~L~~kvdql~~dv~~a~a 52 (85)
T PRK09973 17 LSGCVNEQKVNQLASNVQTLNAKIARLEQDMKALRP 52 (85)
T ss_pred HHHcCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 455556666666666666666666666666666554
No 189
>KOG4074|consensus
Probab=26.38 E-value=3.7e+02 Score=22.20 Aligned_cols=75 Identities=13% Similarity=0.212 Sum_probs=51.7
Q ss_pred HHHHHHHHHHccCCHHHHHHHHHHHHHHHHHHHHHHHHhhhcCCCCChhhhcHHhHHHHHHHHHHHHHHHHHHHHHHH
Q psy8735 6 KVKSKLKQLSQQMTEAEAKQKKAELQAELEEKKSKLEKLSRNVNVISEGEVDKKTITDTKEKMVKEYNKRKRMCTEML 83 (115)
Q Consensus 6 ~l~seL~~L~s~lt~~el~~~i~~L~~e~~~l~~rL~~l~~~~~~Vs~ee~~~~~v~~~~~k~~~~w~kRKRi~~di~ 83 (115)
++.+||+.|-=.---++|+.+++.|..+.-+|--++..+-+ +-.+-.|+.+ .+--.-.-|+.-+-.-.=||.++-
T Consensus 158 ~vNsELK~LlVASvgddLQ~~ve~LtedK~qLa~~~~~~~~-nl~~~~Eq~e--rl~iqcdVWrsKFLAssVm~dELt 232 (383)
T KOG4074|consen 158 KVNSELKRLLVASVGDDLQGQVEALTEDKVQLAHRVDEYMG-NLMVEDEQSE--RLRIQCDVWRSKFLASSVMCDELT 232 (383)
T ss_pred HHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHhh-hhhhhHHHHH--HHhhHHHHHHHHHHHHHHHHHHHH
Confidence 45677777776666788888888888888888888877653 3344556666 665555666666666666666654
No 190
>PF00570 HRDC: HRDC domain Bloom syndrome. Werner syndrome.; InterPro: IPR002121 The HRDC (Helicase and RNase D C-terminal) domain has a putative role in nucleic acid binding. Mutations in the HRDC domain associated with the human BLM gene result in Bloom Syndrome (BS), an autosomal recessive disorder characterised by proportionate pre- and postnatal growth deficiency; sun-sensitive, telangiectatic, hypo- and hyperpigmented skin; predisposition to malignancy; and chromosomal instability [].; GO: 0003676 nucleic acid binding, 0005622 intracellular; PDB: 3SAG_B 3SAH_B 2CPR_A 3SAF_B 3CYM_A 1WUD_A 2HBK_A 2HBJ_A 2HBM_A 2HBL_A ....
Probab=26.07 E-value=1.5e+02 Score=17.42 Aligned_cols=24 Identities=33% Similarity=0.393 Sum_probs=17.9
Q ss_pred HHHHHHHhcCcCcHHHHHHHhccc
Q psy8735 81 EMLEAILENYPKSKKILLEEVGIE 104 (115)
Q Consensus 81 di~d~I~E~~p~~kk~L~EelGiE 104 (115)
+.+..|+...|.+..+|..--|+-
T Consensus 30 ~~L~~ia~~~P~s~~~L~~i~g~~ 53 (68)
T PF00570_consen 30 EALLEIAKRLPTSIEELLQIPGMG 53 (68)
T ss_dssp HHHHHHHHH--SSHHHHHTSTTCG
T ss_pred HHHHHHHHhCCCCHHHHHHccCCC
Confidence 678889999999999998766654
No 191
>COG5185 HEC1 Protein involved in chromosome segregation, interacts with SMC proteins [Cell division and chromosome partitioning]
Probab=26.04 E-value=4.6e+02 Score=23.10 Aligned_cols=105 Identities=19% Similarity=0.267 Sum_probs=59.8
Q ss_pred chHHHHHHHHHHHHccCCHH----HH-----HHHHHHHHHHHHHHHHHHHHhh--hcCCCCChhhhcHHhHHHH-HHHHH
Q psy8735 2 DDKEKVKSKLKQLSQQMTEA----EA-----KQKKAELQAELEEKKSKLEKLS--RNVNVISEGEVDKKTITDT-KEKMV 69 (115)
Q Consensus 2 ~~~~~l~seL~~L~s~lt~~----el-----~~~i~~L~~e~~~l~~rL~~l~--~~~~~Vs~ee~~~~~v~~~-~~k~~ 69 (115)
++|.+|..+|..+.+.+|.+ ++ .........++..|+..|..|. +.+.....+++-. .+.-. =.-.+
T Consensus 494 ~~i~El~~~l~~~e~~L~~a~s~~~~~ke~~e~e~~a~~~E~eklE~el~~lnL~s~ts~l~~eq~vq-s~~i~ld~~~~ 572 (622)
T COG5185 494 HDINELTQILEKLELELSEANSKFELSKEENERELVAQRIEIEKLEKELNDLNLLSKTSILDAEQLVQ-STEIKLDELKV 572 (622)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHhhhhccchHhhHHHHHH-HHHhhHHHHHH
Confidence 45677777777666655432 23 3334455668889999998885 3444444444220 22111 13455
Q ss_pred HHHHHHHHHHHHHHHHHHhcCc------CcHHHHHHHhcccccc
Q psy8735 70 KEYNKRKRMCTEMLEAILENYP------KSKKILLEEVGIETDE 107 (115)
Q Consensus 70 ~~w~kRKRi~~di~d~I~E~~p------~~kk~L~EelGiEtDe 107 (115)
..-++|-.|.+.|+.-+-+.+. .+-.+|--++|=+.|+
T Consensus 573 ~~n~~r~~i~k~V~~v~~~~~~fk~~IQssledl~~~l~k~~~~ 616 (622)
T COG5185 573 DLNRKRYKIHKQVIHVIDITSKFKINIQSSLEDLENELGKVIEE 616 (622)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhhHHhhHHHHHHHHHHHHHH
Confidence 6678888898888876644333 2334555555544443
No 192
>PF02697 DUF217: Uncharacterized ACR, COG1753; InterPro: IPR003847 This entry is represented by Natrialba phage PhiCh1, Orf96. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches.
Probab=26.03 E-value=97 Score=19.60 Aligned_cols=36 Identities=22% Similarity=0.447 Sum_probs=27.9
Q ss_pred HHHHHHHHHHHHHHHHhcCcCcHHHHHHHhccccccC
Q psy8735 72 YNKRKRMCTEMLEAILENYPKSKKILLEEVGIETDEM 108 (115)
Q Consensus 72 w~kRKRi~~di~d~I~E~~p~~kk~L~EelGiEtDe~ 108 (115)
|+.=..=|.|+|..+++ .+++...|.+-+|+=.|+.
T Consensus 17 ~K~~~eSFSdvI~rli~-~~~~~~~l~~~~g~l~dee 52 (71)
T PF02697_consen 17 LKREDESFSDVIERLIE-KEKKRRDLMDYFGILSDEE 52 (71)
T ss_pred HhcCCCCHHHHHHHHHh-cccchhHHHHHhccCChhh
Confidence 44445678999999888 7777889999999877763
No 193
>COG5570 Uncharacterized small protein [Function unknown]
Probab=25.97 E-value=1.6e+02 Score=17.91 Aligned_cols=38 Identities=16% Similarity=0.227 Sum_probs=26.2
Q ss_pred HHHHHHHHHccCCHHHHHHHHHHHHHHHHHHHHHHHHhhh
Q psy8735 7 VKSKLKQLSQQMTEAEAKQKKAELQAELEEKKSKLEKLSR 46 (115)
Q Consensus 7 l~seL~~L~s~lt~~el~~~i~~L~~e~~~l~~rL~~l~~ 46 (115)
|+-|+..-.++|+.+++ .|.+|+...-.+.+.++.|++
T Consensus 17 le~ei~ea~n~Ps~dd~--~i~eLKRrKL~lKeeIEkLka 54 (57)
T COG5570 17 LEREIQEAMNSPSSDDL--AIRELKRRKLRLKEEIEKLKA 54 (57)
T ss_pred HHHHHHHHhcCCCcchH--HHHHHHHHHHHHHHHHHHHhc
Confidence 55666666777777754 466777777777777777764
No 194
>PF04728 LPP: Lipoprotein leucine-zipper; InterPro: IPR006817 This repeating sequence, NAKVDQLSNDV, is found in the enterobacterial outer membrane lipoprotein LPP. The outer membrane lipoprotein is the most abundant protein in an Escherichia coli cell. The messenger RNA for the lipoprotein of the E. coli outer membrane codes for a putative precursor, prolipoprotein, which has 20 additional amino acid residues extending from the amino terminus of the lipoprotein.; GO: 0019867 outer membrane; PDB: 1JCC_A 2GUV_C 2GUS_A 1JCD_A 1KFM_A 1T8Z_D 1KFN_A 1EQ7_A.
Probab=25.86 E-value=1.6e+02 Score=17.92 Aligned_cols=18 Identities=22% Similarity=0.460 Sum_probs=7.1
Q ss_pred HHHHHHHHHHHHHHHHHh
Q psy8735 27 KAELQAELEEKKSKLEKL 44 (115)
Q Consensus 27 i~~L~~e~~~l~~rL~~l 44 (115)
|.+|..++..|..++..|
T Consensus 5 id~Ls~dVq~L~~kvdqL 22 (56)
T PF04728_consen 5 IDQLSSDVQTLNSKVDQL 22 (56)
T ss_dssp HHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHH
Confidence 333334444444444433
No 195
>PF14712 Snapin_Pallidin: Snapin/Pallidin
Probab=25.80 E-value=1.8e+02 Score=18.40 Aligned_cols=41 Identities=20% Similarity=0.326 Sum_probs=18.2
Q ss_pred HHHHHHHHHHHHccCC------HHHHHHHHHHHHHHHHHHHHHHHHh
Q psy8735 4 KEKVKSKLKQLSQQMT------EAEAKQKKAELQAELEEKKSKLEKL 44 (115)
Q Consensus 4 ~~~l~seL~~L~s~lt------~~el~~~i~~L~~e~~~l~~rL~~l 44 (115)
|+.+..+|+.++.... ...-...+..++.++..+..++..+
T Consensus 37 i~~~~~~L~~~~~~~~~~~~~~~~~y~~KL~~ikkrm~~l~~~l~~l 83 (92)
T PF14712_consen 37 IDRLNEKLKELNEVEQINEPFDLDPYVKKLVNIKKRMSNLHERLQKL 83 (92)
T ss_pred HHHHHHHHHHHHHhhhhhhHHHhhHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4455555555555111 1223333444444444444444444
No 196
>PF09325 Vps5: Vps5 C terminal like; InterPro: IPR015404 Vps5 is a sorting nexin that functions in membrane trafficking. This is the C-terminal dimerisation domain [].
Probab=25.75 E-value=2.7e+02 Score=20.35 Aligned_cols=39 Identities=23% Similarity=0.371 Sum_probs=19.7
Q ss_pred HHHHHHHHHHccCCHHHHHHHHHHHHHHHHHHHHHHHHhhh
Q psy8735 6 KVKSKLKQLSQQMTEAEAKQKKAELQAELEEKKSKLEKLSR 46 (115)
Q Consensus 6 ~l~seL~~L~s~lt~~el~~~i~~L~~e~~~l~~rL~~l~~ 46 (115)
..++.+..|..++... ...+..+..++.+++.+...++.
T Consensus 146 kkk~~~~kl~~~~~~~--~~k~~~~~~ei~~~~~~~~~~~~ 184 (236)
T PF09325_consen 146 KKKAQLEKLKASGKNR--QDKVEQAENEIEEAERRVEQAKD 184 (236)
T ss_pred HHHHHHhcccccchhh--hHHHHHHHHHHHHHHHHHHHHHH
Confidence 3344444444443322 45555566666666666665543
No 197
>COG5457 Uncharacterized conserved small protein [Function unknown]
Probab=25.15 E-value=1.8e+02 Score=18.11 Aligned_cols=32 Identities=22% Similarity=0.532 Sum_probs=22.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCcCcHHHHHHHhcccccc
Q psy8735 68 MVKEYNKRKRMCTEMLEAILENYPKSKKILLEEVGIETDE 107 (115)
Q Consensus 68 ~~~~w~kRKRi~~di~d~I~E~~p~~kk~L~EelGiEtDe 107 (115)
..+.|+.++|-+.++- .++ ++++.+|||.-.+
T Consensus 21 ~~~~W~~~rr~r~eL~-~ls-------d~~L~DiGisR~d 52 (63)
T COG5457 21 AIRRWRRYRRTRRELL-RLS-------DHLLSDIGISRAD 52 (63)
T ss_pred HHHHHHHHHHHHHHHH-HHh-------HHHHHHcCCCHHH
Confidence 3448999999888765 333 2888999997544
No 198
>PRK00846 hypothetical protein; Provisional
Probab=25.04 E-value=2e+02 Score=18.58 Aligned_cols=22 Identities=14% Similarity=0.078 Sum_probs=10.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHhh
Q psy8735 24 KQKKAELQAELEEKKSKLEKLS 45 (115)
Q Consensus 24 ~~~i~~L~~e~~~l~~rL~~l~ 45 (115)
...|..|+.++.-|..||..+.
T Consensus 40 q~~I~~L~~ql~~L~~rL~~~~ 61 (77)
T PRK00846 40 RLTGARNAELIRHLLEDLGKVR 61 (77)
T ss_pred HHHHHHHHHHHHHHHHHHHHhc
Confidence 3444444555555555555544
No 199
>PRK14549 50S ribosomal protein L29P; Provisional
Probab=24.95 E-value=1.8e+02 Score=18.09 Aligned_cols=54 Identities=22% Similarity=0.325 Sum_probs=32.8
Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHhhhcCC-CCChhhhcHHhHHHHHHHHHHHHHHHHH
Q psy8735 18 MTEAEAKQKKAELQAELEEKKSKLEKLSRNVN-VISEGEVDKKTITDTKEKMVKEYNKRKR 77 (115)
Q Consensus 18 lt~~el~~~i~~L~~e~~~l~~rL~~l~~~~~-~Vs~ee~~~~~v~~~~~k~~~~w~kRKR 77 (115)
+|.+||...+.+++.+.-.+.-+ ++.+. .-.|..+. .+-+.-.+....-..|+|
T Consensus 12 ls~~eL~~~l~elk~eLf~LR~q----~~~~~~l~n~~~ir--~~Rk~IARi~Tvl~ek~~ 66 (69)
T PRK14549 12 MSPEEREEKLEELKLELLKERAQ----AAMGGAPENPGRIR--EIRRTIARILTIQREKKR 66 (69)
T ss_pred CCHHHHHHHHHHHHHHHHHHHHH----HHhCcCccccHHHH--HHHHHHHHHHHHHHHHHh
Confidence 56666666666666555444321 12333 67888888 777777776666666554
No 200
>TIGR02894 DNA_bind_RsfA transcription factor, RsfA family. In a subset of endospore-forming members of the Firmcutes, members of this protein family are found, several to a genome. Two very strongly conserved sequences regions are separated by a highly variable linker region. Much of the linker region was excised from the seed alignment for this model. A characterized member is the prespore-specific transcription RsfA from Bacillus subtilis, previously called YwfN, which is controlled by sigma factor F and seems to fine-tune expression of some genes in the sigma-F regulon. A paralog in Bacillus subtilis is designated YlbO.
Probab=24.83 E-value=2.9e+02 Score=20.40 Aligned_cols=12 Identities=17% Similarity=0.349 Sum_probs=6.2
Q ss_pred hHHHHHHHHHHH
Q psy8735 60 TITDTKEKMVKE 71 (115)
Q Consensus 60 ~v~~~~~k~~~~ 71 (115)
.+...|......
T Consensus 136 ~~~eDY~~L~~I 147 (161)
T TIGR02894 136 TIEEDYQTLIDI 147 (161)
T ss_pred HHHHHHHHHHHH
Confidence 455556554443
No 201
>PF06102 DUF947: Domain of unknown function (DUF947); InterPro: IPR009292 This is a family of eukaryotic proteins with unknown function.
Probab=24.77 E-value=1.2e+02 Score=22.21 Aligned_cols=15 Identities=20% Similarity=0.397 Sum_probs=8.5
Q ss_pred hHHHHHHHHHHHHcc
Q psy8735 3 DKEKVKSKLKQLSQQ 17 (115)
Q Consensus 3 ~~~~l~seL~~L~s~ 17 (115)
|+..|..+|+.....
T Consensus 64 E~~~Lk~~lk~~k~~ 78 (168)
T PF06102_consen 64 EIKELKKQLKKTKDP 78 (168)
T ss_pred HHHHHHHHHHHcCCH
Confidence 555666666555533
No 202
>COG3783 CybC Soluble cytochrome b562 [Energy production and conversion]
Probab=24.44 E-value=1.9e+02 Score=19.73 Aligned_cols=51 Identities=10% Similarity=-0.003 Sum_probs=37.2
Q ss_pred HHHHHHHHHHccCCHHHHHHHHHHHHHHHHHHHHHHHHhhhcCCCCChhhh
Q psy8735 6 KVKSKLKQLSQQMTEAEAKQKKAELQAELEEKKSKLEKLSRNVNVISEGEV 56 (115)
Q Consensus 6 ~l~seL~~L~s~lt~~el~~~i~~L~~e~~~l~~rL~~l~~~~~~Vs~ee~ 56 (115)
.|-+-++...++.++..+...+...+.-...++.-..+-..|..++||+=+
T Consensus 3 ~l~~~mk~v~~a~~~a~~~~~le~mr~aa~daqk~~p~kl~~k~~dspE~k 53 (100)
T COG3783 3 TLNDNMKVVEKADKAAVIKAALEKMRAAALDAQKATPPKLEGKYPDSPEMK 53 (100)
T ss_pred HHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHcCCccccCCCCCChhHH
Confidence 456677888888888888888888777777777666665566677777654
No 203
>PRK02793 phi X174 lysis protein; Provisional
Probab=24.43 E-value=1.6e+02 Score=18.42 Aligned_cols=25 Identities=16% Similarity=0.171 Sum_probs=10.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHh
Q psy8735 20 EAEAKQKKAELQAELEEKKSKLEKL 44 (115)
Q Consensus 20 ~~el~~~i~~L~~e~~~l~~rL~~l 44 (115)
.++|-..+.+...++..+..+|..|
T Consensus 24 Ie~Ln~~v~~Qq~~I~~L~~~l~~L 48 (72)
T PRK02793 24 IEELNVTVTAHEMEMAKLRDHLRLL 48 (72)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3444444444444444444444433
No 204
>COG3027 zapA Cell division protein ZapA (stimulator of FtsZ polymerization and Z-ring component) [Cell cycle control, cell division, chromosome partitioning]
Probab=24.42 E-value=2.2e+02 Score=19.33 Aligned_cols=15 Identities=27% Similarity=0.450 Sum_probs=7.1
Q ss_pred chHHHHHHHHHHHHc
Q psy8735 2 DDKEKVKSKLKQLSQ 16 (115)
Q Consensus 2 ~~~~~l~seL~~L~s 16 (115)
+|++.+...++.+.+
T Consensus 61 ~eL~~l~~k~~~~~~ 75 (105)
T COG3027 61 HELLKLKEKLRDIEA 75 (105)
T ss_pred HHHHHHHHHHHHHHH
Confidence 344444444444444
No 205
>PRK03987 translation initiation factor IF-2 subunit alpha; Validated
Probab=24.15 E-value=1.4e+02 Score=23.46 Aligned_cols=38 Identities=26% Similarity=0.387 Sum_probs=29.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCcCcHHHHHHHhccccc
Q psy8735 68 MVKEYNKRKRMCTEMLEAILENYPKSKKILLEEVGIETD 106 (115)
Q Consensus 68 ~~~~w~kRKRi~~di~d~I~E~~p~~kk~L~EelGiEtD 106 (115)
....|++-++ ...++..+++.++.+..++++.+|..-.
T Consensus 90 ~~~~~~~~~~-~~~il~~~a~~~~~~~e~~~~~~~~~l~ 127 (262)
T PRK03987 90 KIQEWKNEQK-ADKWLELAAEKLGKSLEEAWEEVGYKLE 127 (262)
T ss_pred HHHHHHHHhh-HhhHHHHHHHHcCCCHHHHHHHHHHHHH
Confidence 4456666555 5588999999999999999999987543
No 206
>TIGR02168 SMC_prok_B chromosome segregation protein SMC, common bacterial type. SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
Probab=23.82 E-value=4.2e+02 Score=23.92 Aligned_cols=22 Identities=5% Similarity=0.178 Sum_probs=8.9
Q ss_pred HHHHHHHHhcCcCcHHHHHHHh
Q psy8735 80 TEMLEAILENYPKSKKILLEEV 101 (115)
Q Consensus 80 ~di~d~I~E~~p~~kk~L~Eel 101 (115)
++.|+.|-+++...=..||..+
T Consensus 1031 ~~~F~~v~~~f~~~F~~lf~~~ 1052 (1179)
T TIGR02168 1031 RERFKDTFDQVNENFQRVFPKL 1052 (1179)
T ss_pred HHHHHHHHHHHHHHHHHHHHHH
Confidence 3333333333333334455443
No 207
>PRK13169 DNA replication intiation control protein YabA; Reviewed
Probab=23.71 E-value=2.5e+02 Score=19.27 Aligned_cols=25 Identities=28% Similarity=0.251 Sum_probs=13.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhh
Q psy8735 21 AEAKQKKAELQAELEEKKSKLEKLS 45 (115)
Q Consensus 21 ~el~~~i~~L~~e~~~l~~rL~~l~ 45 (115)
++|+..+.+|-.++..|+-.-+.||
T Consensus 25 ~~LK~~~~el~EEN~~L~iEN~~Lr 49 (110)
T PRK13169 25 GALKKQLAELLEENTALRLENDKLR 49 (110)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4455555555555555555555554
No 208
>KOG0994|consensus
Probab=23.70 E-value=4.6e+02 Score=25.87 Aligned_cols=45 Identities=11% Similarity=0.302 Sum_probs=35.1
Q ss_pred HHHHHHHHHHHHHHHHhhh--cCCCCChhhhcHHhHHHHHHHHHHHHHH
Q psy8735 28 AELQAELEEKKSKLEKLSR--NVNVISEGEVDKKTITDTKEKMVKEYNK 74 (115)
Q Consensus 28 ~~L~~e~~~l~~rL~~l~~--~~~~Vs~ee~~~~~v~~~~~k~~~~w~k 74 (115)
..+.+....|+++|...+. ++..||.++.+ ++...+...+++-..
T Consensus 1197 gay~s~f~~me~kl~~ir~il~~~svs~~~i~--~l~~~~~~lr~~l~~ 1243 (1758)
T KOG0994|consen 1197 GAYASRFLDMEEKLEEIRAILSAPSVSAEDIA--QLASATESLRRQLQA 1243 (1758)
T ss_pred hhhHhHHHHHHHHHHHHHHHhcCCCccHHHHH--HHHHHHHHHHHHHHH
Confidence 3566788888888888875 77899999999 888888777665543
No 209
>PRK01203 prefoldin subunit alpha; Provisional
Probab=23.58 E-value=1.5e+02 Score=20.97 Aligned_cols=38 Identities=18% Similarity=0.257 Sum_probs=21.0
Q ss_pred HHHHHHHHHHccCCHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy8735 6 KVKSKLKQLSQQMTEAEAKQKKAELQAELEEKKSKLEKLS 45 (115)
Q Consensus 6 ~l~seL~~L~s~lt~~el~~~i~~L~~e~~~l~~rL~~l~ 45 (115)
.+.+++..|.+. .+.|..++..|+....++.+-++.|.
T Consensus 4 ~~~~~~~~~~~q--~e~l~~ql~~L~~a~se~~~~ie~L~ 41 (130)
T PRK01203 4 DVEAQLNYIESL--ISSVDSQIDSLNKTLSEVQQTISFLS 41 (130)
T ss_pred hHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 345555555554 34555666666666655555555553
No 210
>PF00831 Ribosomal_L29: Ribosomal L29 protein; InterPro: IPR001854 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites [, ]. About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits. Many ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome [, ]. Ribosomal protein L29 is one of the proteins from the large ribosomal subunit. L29 belongs to a family of ribosomal proteins of 63 to 138 amino-acid residues which, on the basis of sequence similarities [], groups: Red algal L29. Bacterial L29. Mammalian L35 Caenorhabditis elegans L35 (ZK652.4). Yeast L35. ; GO: 0003735 structural constituent of ribosome, 0006412 translation, 0005622 intracellular, 0005840 ribosome; PDB: 1VSP_W 3MS1_Y 3MRZ_Y 3F1H_2 3PYT_Y 3PYO_Y 3D5D_2 3D5B_2 3PYR_Y 1VSA_W ....
Probab=23.57 E-value=1.7e+02 Score=17.38 Aligned_cols=48 Identities=15% Similarity=0.234 Sum_probs=27.6
Q ss_pred cCCHHHHHHHHHHHHHHHHHHHHHHHHhhhcCCCCChhhhcHHhHHHHHHHHHH
Q psy8735 17 QMTEAEAKQKKAELQAELEEKKSKLEKLSRNVNVISEGEVDKKTITDTKEKMVK 70 (115)
Q Consensus 17 ~lt~~el~~~i~~L~~e~~~l~~rL~~l~~~~~~Vs~ee~~~~~v~~~~~k~~~ 70 (115)
.+|.+||...+.++..+.-.+.-+.. .+....|..+. .+.+.-.+...
T Consensus 6 ~ls~~eL~~~l~elk~eL~~Lr~q~~----~~~l~n~~~ir--~~Rr~IARi~T 53 (58)
T PF00831_consen 6 ELSDEELQEKLEELKKELFNLRFQKA----TGQLENPHRIR--EIRRDIARILT 53 (58)
T ss_dssp HSHHHHHHHHHHHHHHHHHHHHHHHH----HSSSSCCHHHH--HHHHHHHHHHH
T ss_pred hCCHHHHHHHHHHHHHHHHHHHHHHH----hcccccccHHH--HHHHHHHHHHH
Confidence 35666666666666665554443332 34557888888 66655544443
No 211
>PF03938 OmpH: Outer membrane protein (OmpH-like); InterPro: IPR005632 This entry includes outer membrane proteins such as OmpH (Skp) among others. OmpH (outer membrane protein H) is a major structural protein of the outer membrane. In Pasteurella multocida it acts as a channel-forming transmembrane porin []. Porins act as molecular sieves to allow the diffusion of small hydrophilic solutes through the outer membrane and also acts as a receptor for bacteriophages and bacteriocins. Porins are highly immunogenic and are conserved in bacterial families, making them attractive vaccine candidates []. The 17kDa protein (Skp, OmpH) of Escherichia coli is a homotrimeric periplasmic chaperone for newly synthesised outer-membrane proteins, the X-ray structure of which has been reported at resolutions of 2.35 A and 2.30 A [, ]. Three hairpin-shaped alpha-helical extensions reach out by approximately 60 A from a trimerisation domain, which is composed of three intersubunit beta-sheets that wind around a central axis. The alpha-helical extensions approach each other at their distal turns, resulting in a fold that resembles a 'three-pronged grasping forcep'. The overall shape of Skp is reminiscent of the cytosolic chaperone prefoldin (IPR009053 from INTERPRO), although it is based on a radically different topology. The peculiar architecture, with apparent plasticity of the prongs and distinct electrostatic and hydrophobic surface properties, supports the recently proposed biochemical mechanism of this chaperone: formation of a Skp(3)-Omp complex protects the outer membrane protein from aggregation during passage through the bacterial periplasm. The ability of Skp to prevent the aggregation of model substrates in vitro is independent of ATP. Skp can interact directly with membrane lipids and lipopolysaccharide. These interactions are needed for efficient Skp-assisted folding of membrane proteins [].; GO: 0051082 unfolded protein binding; PDB: 1SG2_C 1U2M_C.
Probab=23.55 E-value=2.5e+02 Score=19.29 Aligned_cols=15 Identities=33% Similarity=0.627 Sum_probs=5.1
Q ss_pred cCCHHHHHHHHHHHH
Q psy8735 17 QMTEAEAKQKKAELQ 31 (115)
Q Consensus 17 ~lt~~el~~~i~~L~ 31 (115)
.+|.++......++.
T Consensus 72 ~ls~~~~~~~~~~l~ 86 (158)
T PF03938_consen 72 TLSEEERQKRQQELQ 86 (158)
T ss_dssp --SSHHHHHHHHHHH
T ss_pred ccchhHHHHHHHHHH
Confidence 344444443333333
No 212
>PF14335 DUF4391: Domain of unknown function (DUF4391)
Probab=23.52 E-value=62 Score=24.36 Aligned_cols=22 Identities=23% Similarity=0.324 Sum_probs=12.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHh
Q psy8735 23 AKQKKAELQAELEEKKSKLEKL 44 (115)
Q Consensus 23 l~~~i~~L~~e~~~l~~rL~~l 44 (115)
...++.+|++++..++.++.+-
T Consensus 180 ~~~~i~~L~kei~~L~~~~~kE 201 (221)
T PF14335_consen 180 RLEQIEKLEKEIAKLKKKIKKE 201 (221)
T ss_pred HHHHHHHHHHHHHHHHHHHHhc
Confidence 3445566666666666666543
No 213
>COG1644 RPB10 DNA-directed RNA polymerase, subunit N (RpoN/RPB10) [Transcription]
Probab=23.26 E-value=1.1e+02 Score=19.12 Aligned_cols=17 Identities=18% Similarity=0.501 Sum_probs=13.5
Q ss_pred HHhcCcCcHHHHHHHhccc
Q psy8735 86 ILENYPKSKKILLEEVGIE 104 (115)
Q Consensus 86 I~E~~p~~kk~L~EelGiE 104 (115)
+.+| +.+++.+++|||+
T Consensus 25 v~~g--e~p~~vLDdLGv~ 41 (63)
T COG1644 25 VEEG--EDPGEVLDDLGVK 41 (63)
T ss_pred HHcC--CCHHHHHHHhCcH
Confidence 4455 7789999999986
No 214
>PRK03918 chromosome segregation protein; Provisional
Probab=23.13 E-value=5.4e+02 Score=22.91 Aligned_cols=98 Identities=20% Similarity=0.287 Sum_probs=0.0
Q ss_pred chHHHHHHHHHHHHccCCHHHHHHHHHHHHHHHHHHHHHHHHhhhcCCCCChhhhcHHhHHHHHHHHHHHHHHHHHHHHH
Q psy8735 2 DDKEKVKSKLKQLSQQMTEAEAKQKKAELQAELEEKKSKLEKLSRNVNVISEGEVDKKTITDTKEKMVKEYNKRKRMCTE 81 (115)
Q Consensus 2 ~~~~~l~seL~~L~s~lt~~el~~~i~~L~~e~~~l~~rL~~l~~~~~~Vs~ee~~~~~v~~~~~k~~~~w~kRKRi~~d 81 (115)
+++..++.++..+...+ .++...+..++.++..+...+..+...-.........-.++...+..+...-..=......
T Consensus 659 ~~~~~l~~~~~~l~~~l--~~l~~~~~~l~~~i~~l~~~i~~~~~~~~~l~~~~~~~~~l~~~~~~l~~lr~~~~~~~~~ 736 (880)
T PRK03918 659 EEYEELREEYLELSREL--AGLRAELEELEKRREEIKKTLEKLKEELEEREKAKKELEKLEKALERVEELREKVKKYKAL 736 (880)
T ss_pred HHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q ss_pred HHHHHHhcCcCcHHHHHHHh
Q psy8735 82 MLEAILENYPKSKKILLEEV 101 (115)
Q Consensus 82 i~d~I~E~~p~~kk~L~Eel 101 (115)
+...+...+..--..+|..|
T Consensus 737 l~~~~~~~l~~~~~~if~~l 756 (880)
T PRK03918 737 LKERALSKVGEIASEIFEEL 756 (880)
T ss_pred HHHHHHHHHHHHHHHHHHHH
No 215
>PF15463 ECM11: Extracellular mutant protein 11
Probab=23.13 E-value=1.9e+02 Score=20.23 Aligned_cols=28 Identities=25% Similarity=0.348 Sum_probs=18.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhcCC
Q psy8735 22 EAKQKKAELQAELEEKKSKLEKLSRNVN 49 (115)
Q Consensus 22 el~~~i~~L~~e~~~l~~rL~~l~~~~~ 49 (115)
++...-..|+.+...+..+|..++.++.
T Consensus 109 eI~~R~eav~~~~~~l~~kL~~mk~~G~ 136 (139)
T PF15463_consen 109 EINRRAEAVRAQGEQLDRKLEKMKEGGK 136 (139)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhc
Confidence 4444456667777777777777776443
No 216
>PRK10884 SH3 domain-containing protein; Provisional
Probab=23.13 E-value=2.3e+02 Score=21.41 Aligned_cols=42 Identities=24% Similarity=0.302 Sum_probs=19.6
Q ss_pred HHHHHHHHHHHHccC--CHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy8735 4 KEKVKSKLKQLSQQM--TEAEAKQKKAELQAELEEKKSKLEKLS 45 (115)
Q Consensus 4 ~~~l~seL~~L~s~l--t~~el~~~i~~L~~e~~~l~~rL~~l~ 45 (115)
+.+++++|+.+.+.. .+.++...+++....+.+++++...|+
T Consensus 102 l~~l~~~l~~~~~~~~~~~~~l~~~~~~~~~~~~~L~~~n~~L~ 145 (206)
T PRK10884 102 VKTLTDKLNNIDNTWNQRTAEMQQKVAQSDSVINGLKEENQKLK 145 (206)
T ss_pred HHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 344444444443331 223444445555555555555555554
No 217
>PF12325 TMF_TATA_bd: TATA element modulatory factor 1 TATA binding; InterPro: IPR022091 This is the C-terminal conserved coiled coil region of a family of TATA element modulatory factor 1 proteins conserved in eukaryotes []. The proteins bind to the TATA element of some RNA polymerase II promoters and repress their activity. by competing with the binding of TATA binding protein. TMF1_TATA_bd is the most conserved part of the TMFs []. TMFs are evolutionarily conserved golgins that bind Rab6, a ubiquitous ras-like GTP-binding Golgi protein, and contribute to Golgi organisation in animal [] and plant cells. The Rab6-binding domain appears to be the same region as this C-terminal family [].
Probab=23.00 E-value=2.7e+02 Score=19.34 Aligned_cols=22 Identities=23% Similarity=0.383 Sum_probs=9.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHh
Q psy8735 23 AKQKKAELQAELEEKKSKLEKL 44 (115)
Q Consensus 23 l~~~i~~L~~e~~~l~~rL~~l 44 (115)
|.+.|..++.++..++.++..|
T Consensus 21 L~s~lr~~E~E~~~l~~el~~l 42 (120)
T PF12325_consen 21 LQSQLRRLEGELASLQEELARL 42 (120)
T ss_pred HHHHHHHHHHHHHHHHHHHHHH
Confidence 3334444444444444444443
No 218
>PHA01750 hypothetical protein
Probab=22.92 E-value=2.1e+02 Score=18.22 Aligned_cols=17 Identities=47% Similarity=0.559 Sum_probs=7.4
Q ss_pred HHHHHHHHHHHHHHHHH
Q psy8735 25 QKKAELQAELEEKKSKL 41 (115)
Q Consensus 25 ~~i~~L~~e~~~l~~rL 41 (115)
..+..|+.++.++..++
T Consensus 56 ikqDnl~~qv~eik~k~ 72 (75)
T PHA01750 56 IKQDELSRQVEEIKRKL 72 (75)
T ss_pred HhHHHHHHHHHHHHHhh
Confidence 33344444444444444
No 219
>PF10458 Val_tRNA-synt_C: Valyl tRNA synthetase tRNA binding arm; InterPro: IPR019499 The aminoacyl-tRNA synthetases (6.1.1. from EC) catalyse the attachment of an amino acid to its cognate transfer RNA molecule in a highly specific two-step reaction. These proteins differ widely in size and oligomeric state, and have limited sequence homology []. The 20 aminoacyl-tRNA synthetases are divided into two classes, I and II. Class I aminoacyl-tRNA synthetases contain a characteristic Rossman fold catalytic domain and are mostly monomeric []. Class II aminoacyl-tRNA synthetases share an anti-parallel beta-sheet fold flanked by alpha-helices [], and are mostly dimeric or multimeric, containing at least three conserved regions [, , ]. However, tRNA binding involves an alpha-helical structure that is conserved between class I and class II synthetases. In reactions catalysed by the class I aminoacyl-tRNA synthetases, the aminoacyl group is coupled to the 2'-hydroxyl of the tRNA, while, in class II reactions, the 3'-hydroxyl site is preferred. The synthetases specific for arginine, cysteine, glutamic acid, glutamine, isoleucine, leucine, methionine, tyrosine, tryptophan and valine belong to class I synthetases. The synthetases specific for alanine, asparagine, aspartic acid, glycine, histidine, lysine, phenylalanine, proline, serine, and threonine belong to class-II synthetases []. Based on their mode of binding to the tRNA acceptor stem, both classes of tRNA synthetases have been subdivided into three subclasses, designated 1a, 1b, 1c and 2a, 2b, 2c. This entry represents the C-terminal domain of Valyl-tRNA synthetase, which consists of two helices in a long alpha-hairpin. Valyl-tRNA synthetase (6.1.1.9 from EC) is an alpha monomer that belongs to class Ia.; GO: 0000166 nucleotide binding, 0004832 valine-tRNA ligase activity, 0005524 ATP binding, 0006438 valyl-tRNA aminoacylation, 0005737 cytoplasm; PDB: 1IVS_B 1GAX_B.
Probab=22.90 E-value=1.9e+02 Score=17.53 Aligned_cols=24 Identities=25% Similarity=0.483 Sum_probs=16.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhh
Q psy8735 22 EAKQKKAELQAELEEKKSKLEKLS 45 (115)
Q Consensus 22 el~~~i~~L~~e~~~l~~rL~~l~ 45 (115)
|+..++..|+.++..++..+..+.
T Consensus 1 D~~~E~~rL~Kel~kl~~~i~~~~ 24 (66)
T PF10458_consen 1 DVEAEIERLEKELEKLEKEIERLE 24 (66)
T ss_dssp -HHHHHHHHHHHHHHHHHHHHHHH
T ss_pred CHHHHHHHHHHHHHHHHHHHHHHH
Confidence 345667777777777777777664
No 220
>PF12824 MRP-L20: Mitochondrial ribosomal protein subunit L20; InterPro: IPR024388 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites [, ]. About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits. Many ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome [, ]. This entry represents the essential mitochondrial ribosomal protein L20 family from fungi [].
Probab=22.87 E-value=3.1e+02 Score=20.03 Aligned_cols=62 Identities=18% Similarity=0.249 Sum_probs=39.6
Q ss_pred cCCHHHHHHHHHHHHHH------HHHHHHHHHHhh---hcCCCCChhhhcHHhHHHHHHHHHHHHHHHHHHHHH
Q psy8735 17 QMTEAEAKQKKAELQAE------LEEKKSKLEKLS---RNVNVISEGEVDKKTITDTKEKMVKEYNKRKRMCTE 81 (115)
Q Consensus 17 ~lt~~el~~~i~~L~~e------~~~l~~rL~~l~---~~~~~Vs~ee~~~~~v~~~~~k~~~~w~kRKRi~~d 81 (115)
.||.+++ ++|.+|..+ +..|..+..=-. +-....+++.++ .++..+......|-.||+++++
T Consensus 85 ~Lt~e~i-~Eir~LR~~DP~~wTr~~LAkkF~~S~~fV~~v~~~~~e~~~--~~~~~le~~k~rWg~~r~~ARe 155 (164)
T PF12824_consen 85 HLTPEDI-QEIRRLRAEDPEKWTRKKLAKKFNCSPLFVSMVAPAPKEKKK--EMEARLEAIKSRWGPRRRIARE 155 (164)
T ss_pred cCCHHHH-HHHHHHHHcCchHhhHHHHHHHhCCCHHHHHHhcCCCHHHHH--HHHHHHHHHHHhccHHHHHHHH
Confidence 3555444 456667765 334444332111 122344557777 8999999999999999999987
No 221
>PF08946 Osmo_CC: Osmosensory transporter coiled coil; InterPro: IPR015041 The osmosensory transporter coiled coil is a C-terminal domain found in various bacterial osmoprotective transporters, such as ProP, Proline/betaine transporter, Proline permease 2 and the citrate proton symporters. It adopts an antiparallel coiled-coil structure, and is essential for osmosensory and osmoprotectant transporter function []. ; PDB: 1R48_B.
Probab=22.72 E-value=1.8e+02 Score=17.14 Aligned_cols=22 Identities=23% Similarity=0.430 Sum_probs=14.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHh
Q psy8735 23 AKQKKAELQAELEEKKSKLEKL 44 (115)
Q Consensus 23 l~~~i~~L~~e~~~l~~rL~~l 44 (115)
+...|..+..++++|+.+-..|
T Consensus 17 IEqkiedid~qIaeLe~KR~~L 38 (46)
T PF08946_consen 17 IEQKIEDIDEQIAELEAKRQRL 38 (46)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHhHHHHHHHHHHHHHHHHHH
Confidence 4566667777777777665555
No 222
>PF07407 Seadorna_VP6: Seadornavirus VP6 protein; InterPro: IPR009982 This family consists of several VP6 proteins from the Banna virus as well as a related protein VP5 from the Kadipiro virus. Members of this family are typically of around 420 residues in length. The function of this family is unknown.
Probab=22.71 E-value=2e+02 Score=24.02 Aligned_cols=26 Identities=19% Similarity=0.284 Sum_probs=20.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhh
Q psy8735 21 AEAKQKKAELQAELEEKKSKLEKLSR 46 (115)
Q Consensus 21 ~el~~~i~~L~~e~~~l~~rL~~l~~ 46 (115)
..|+.+-..|++|+.+|..+|+.|.+
T Consensus 35 ~aLr~EN~~LKkEN~~Lk~eVerLE~ 60 (420)
T PF07407_consen 35 FALRMENHSLKKENNDLKIEVERLEN 60 (420)
T ss_pred hhHHHHhHHHHHHHHHHHHHHHHHHH
Confidence 45777788888888888888888854
No 223
>PRK00306 50S ribosomal protein L29; Reviewed
Probab=22.66 E-value=1.9e+02 Score=17.60 Aligned_cols=53 Identities=13% Similarity=0.162 Sum_probs=34.0
Q ss_pred cCCHHHHHHHHHHHHHHHHHHHHHHHHhhhcCCCCChhhhcHHhHHHHHHHHHHHHHHH
Q psy8735 17 QMTEAEAKQKKAELQAELEEKKSKLEKLSRNVNVISEGEVDKKTITDTKEKMVKEYNKR 75 (115)
Q Consensus 17 ~lt~~el~~~i~~L~~e~~~l~~rL~~l~~~~~~Vs~ee~~~~~v~~~~~k~~~~w~kR 75 (115)
.+|.+||...+.+++.+.-.+.-+. +.+.+..|..+. .+.+.-.+....-..|
T Consensus 8 ~ls~~eL~~~l~~lkkeL~~lR~~~----~~~~~~n~~~i~--~~rk~IARi~Tvl~er 60 (66)
T PRK00306 8 ELSVEELNEKLLELKKELFNLRFQK----ATGQLENTHRLR--EVRRDIARIKTVLRER 60 (66)
T ss_pred hCCHHHHHHHHHHHHHHHHHHHHHH----HhCCCcCcHHHH--HHHHHHHHHHHHHHHH
Confidence 4678888888888888766555222 233477888888 6666655555444443
No 224
>PF02954 HTH_8: Bacterial regulatory protein, Fis family; InterPro: IPR002197 The Factor for Inversion Stimulation (FIS) protein is a regulator of bacterial functions, and binds specifically to weakly related DNA sequences [,]. It activates ribosomal RNA transcription, and is involved in upstream activation of rRNA promoters. The protein has been shown to play a role in the regulation of virulence factors in both Salmonella typhimurium and Escherichia coli []. Some of its functions include inhibition of the initiation of DNA replication from the OriC site, and promotion of Hin-mediated DNA inversion. In its C-terminal extremity, FIS encodes a helix-turn-helix (HTH) DNA- binding motif, which shares a high degree of similarity with other HTH motifs of more primitive bacterial transcriptional regulators, such as the nitrogen assimilation regulatory proteins (NtrC) from species like Azobacter, Rhodobacter and Rhizobium. This has led to speculation that both evolved from a single common ancestor []. The 3-dimensional structure of the E. coli FIS DNA-binding protein has been determined by means of X-ray diffraction to 2.0A resolution [,]. FIS is composed of four alpha-helices tightly intertwined to form a globular dimer with two protruding HTH motifs. The 24 N-terminal amino acids are poorly defined, indicating that they might act as `feelers' suitable for DNA or protein (invertase) recognition []. Other proteins belonging to this subfamily include: E. coli: atoC, hydG, ntrC, fhlA, tyrR, Rhizobium spp.: ntrC, nifA, dctD ; GO: 0003700 sequence-specific DNA binding transcription factor activity, 0006355 regulation of transcription, DNA-dependent; PDB: 1NTC_A 3JRH_A 3JRB_A 3IV5_A 3JRI_A 1ETQ_A 1ETW_B 1ETY_A 3JRF_A 3JRA_A ....
Probab=22.63 E-value=1.4e+02 Score=16.31 Aligned_cols=26 Identities=15% Similarity=0.211 Sum_probs=20.3
Q ss_pred HHHHHHHHhcCcCcHHHHHHHhcccc
Q psy8735 80 TEMLEAILENYPKSKKILLEEVGIET 105 (115)
Q Consensus 80 ~di~d~I~E~~p~~kk~L~EelGiEt 105 (115)
++++...++.+.++...-.+.|||..
T Consensus 7 ~~~i~~aL~~~~gn~~~aA~~Lgisr 32 (42)
T PF02954_consen 7 KQLIRQALERCGGNVSKAARLLGISR 32 (42)
T ss_dssp HHHHHHHHHHTTT-HHHHHHHHTS-H
T ss_pred HHHHHHHHHHhCCCHHHHHHHHCCCH
Confidence 46788888999999999999999863
No 225
>PF01340 MetJ: Met Apo-repressor, MetJ; InterPro: IPR002084 Binding of a specific DNA fragment and S-adenosyl methionine (SAM) co-repressor molecules to the Escherichia coli methionine repressor (MetJ) leads to a significant reduction in dynamic flexibility of the ternary complex, with considerable entropy-enthalpy compensation, not necessarily involving any overall conformational change []. MetJ is a regulatory protein which when combined with S-adenosylmethionine (SAM) represses the expression of the methionine regulon and of enzymes involved in SAM synthesis. It is also autoregulated. The crystal structure of the met repressor-operator complex shows two dimeric repressor molecules bound to adjacent sites 8 base pairs apart on an 18-base-pair DNA fragment. Sequence specificity is achieved by insertion of double-stranded antiparallel protein beta-ribbons into the major groove of B-form DNA, with direct hydrogen-bonding between amino-acid side chains and the base pairs. The repressor also recognises sequence-dependent distortion or flexibility of the operator phosphate backbone, conferring specificity even for inaccessible base pairs [].; GO: 0003700 sequence-specific DNA binding transcription factor activity, 0006355 regulation of transcription, DNA-dependent, 0006555 methionine metabolic process; PDB: 1MJO_D 1CMB_A 1MJQ_C 1CMC_B 1MJK_A 1MJ2_A 1MJP_A 1MJM_B 1CMA_B 1MJL_A ....
Probab=22.58 E-value=44 Score=22.57 Aligned_cols=20 Identities=35% Similarity=0.747 Sum_probs=13.3
Q ss_pred hcCcCcHHHHHHHhcccccc
Q psy8735 88 ENYPKSKKILLEEVGIETDE 107 (115)
Q Consensus 88 E~~p~~kk~L~EelGiEtDe 107 (115)
+.+|...+.+++++||+.++
T Consensus 82 d~ip~~~~~~m~~~gi~~~~ 101 (104)
T PF01340_consen 82 DEIPAEAKRIMEEMGIDPEE 101 (104)
T ss_dssp SSS-HHHHHHHHHTT--TTT
T ss_pred ccchHHHHHHHHHhCCChhh
Confidence 45666678999999999875
No 226
>KOG4191|consensus
Probab=22.57 E-value=1.5e+02 Score=25.65 Aligned_cols=73 Identities=23% Similarity=0.274 Sum_probs=37.9
Q ss_pred HHHHHHHHHHHccCCHHHHHHHHHHHHHHHHHHHHHHHHhhh-------cCCCCChhhhcHHhHHHHHHHHHHHHHHHHH
Q psy8735 5 EKVKSKLKQLSQQMTEAEAKQKKAELQAELEEKKSKLEKLSR-------NVNVISEGEVDKKTITDTKEKMVKEYNKRKR 77 (115)
Q Consensus 5 ~~l~seL~~L~s~lt~~el~~~i~~L~~e~~~l~~rL~~l~~-------~~~~Vs~ee~~~~~v~~~~~k~~~~w~kRKR 77 (115)
++.-+||+.|+. .+..+.+.+.+...+|-+|-. -...+..-++ .|...|.+..+.-+|||.
T Consensus 404 DEvlaeLR~lqa---------eLk~vS~~N~k~k~~Ll~la~eE~a~qe~~q~lddlDk---qI~qaYvKr~r~~kkrKk 471 (516)
T KOG4191|consen 404 DEVLAELRKLQA---------ELKAVSAHNRKKKHDLLRLAPEEMARQEFQQVLDDLDK---QIEQAYVKRNRSRKKRKK 471 (516)
T ss_pred HHHHHHHHHHHH---------HHHHHHhhhHHHHHHHHHhhHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHhhcc
Confidence 444555555543 345566666777666665521 1122222222 577888776666666554
Q ss_pred -HHH---HHHHHHHhc
Q psy8735 78 -MCT---EMLEAILEN 89 (115)
Q Consensus 78 -i~~---di~d~I~E~ 89 (115)
+.. .+|.+|.+.
T Consensus 472 ht~~ek~~~~~~~~eq 487 (516)
T KOG4191|consen 472 HTVTEKIGSTSQISEQ 487 (516)
T ss_pred cchHhhhhhHHHHHHH
Confidence 333 345555443
No 227
>PF06428 Sec2p: GDP/GTP exchange factor Sec2p; InterPro: IPR009449 In Saccharomyces cerevisiae, Sec2p is a GDP/GTP exchange factor for Sec4p, which is required for vesicular transport at the post-Golgi stage of yeast secretion []. It catalyzes the dissociation of GDP from SEC4 and also potently promoting binding of GTP. Activation of SEC4 by SEC2 is needed for the directed transport of vesicles to sites of exocytosis. Binds the Rab GTPase YPT32, but does not have exhange activity on YPT32 [, , ].; PDB: 2EQB_C 2E7S_K 2OCY_A.
Probab=22.54 E-value=2.5e+02 Score=18.88 Aligned_cols=39 Identities=28% Similarity=0.378 Sum_probs=19.3
Q ss_pred HHHHHHHHHHHccCCHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy8735 5 EKVKSKLKQLSQQMTEAEAKQKKAELQAELEEKKSKLEKL 44 (115)
Q Consensus 5 ~~l~seL~~L~s~lt~~el~~~i~~L~~e~~~l~~rL~~l 44 (115)
..+++||-.|+.+| .+++...|+.-+.+...++.++..|
T Consensus 18 ~~ie~ElEeLTasL-FeEAN~MVa~ar~e~~~~e~k~~~l 56 (100)
T PF06428_consen 18 EQIESELEELTASL-FEEANKMVADARRERAALEEKNEQL 56 (100)
T ss_dssp HHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 44555555555554 3444555544444444444444444
No 228
>smart00502 BBC B-Box C-terminal domain. Coiled coil region C-terminal to (some) B-Box domains
Probab=22.41 E-value=2.2e+02 Score=18.21 Aligned_cols=28 Identities=29% Similarity=0.294 Sum_probs=18.8
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy8735 60 TITDTKEKMVKEYNKRKRMCTEMLEAIL 87 (115)
Q Consensus 60 ~v~~~~~k~~~~w~kRKRi~~di~d~I~ 87 (115)
.|...|..++..-..|+.-...=++.+.
T Consensus 43 ~I~~~f~~l~~~L~~~e~~ll~~l~~~~ 70 (127)
T smart00502 43 QIKAAFDELRNALNKRKKQLLEDLEEQK 70 (127)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 6777777777777777766655555543
No 229
>TIGR02905 spore_yutH spore coat protein YutH. Members of this family are homologs of the Bacillus subtilis spore coat protein CotS. Members of this family, designated YutH, are found only in the family Bacillaceae from among the endospore-forming members of the Firmicutes branch of the Bacteria.
Probab=22.40 E-value=2.1e+02 Score=22.41 Aligned_cols=55 Identities=9% Similarity=0.098 Sum_probs=40.6
Q ss_pred HHHHHHHHhhhcCCCCChhhhcHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcCcH
Q psy8735 36 EKKSKLEKLSRNVNVISEGEVDKKTITDTKEKMVKEYNKRKRMCTEMLEAILENYPKSK 94 (115)
Q Consensus 36 ~l~~rL~~l~~~~~~Vs~ee~~~~~v~~~~~k~~~~w~kRKRi~~di~d~I~E~~p~~k 94 (115)
..-..|..|+..+...++.-.. ...+.+|...|.+|.+-+..+|..|....++++
T Consensus 97 ~~~~~LA~lH~as~~f~~~~~~----~~~~~~~~~~~~k~~~el~~~~~~~~~~~~~s~ 151 (313)
T TIGR02905 97 DLGRELATFHKRGRQYPRPIWK----KSRIGQWKELWAKRLDQLEDWWQSLLAQGPQTL 151 (313)
T ss_pred HHHHHHHHHHHhcCCCCCCccc----cccccccHHHHHHHHHHHHHHHHHHHhcCCCCh
Confidence 5566778887655554443222 245689999999999999999999988888766
No 230
>COG4079 Uncharacterized protein conserved in archaea [Function unknown]
Probab=22.38 E-value=4.1e+02 Score=21.31 Aligned_cols=33 Identities=27% Similarity=0.438 Sum_probs=24.5
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcCcH
Q psy8735 60 TITDTKEKMVKEYNKRKRMCTEMLEAILENYPKSK 94 (115)
Q Consensus 60 ~v~~~~~k~~~~w~kRKRi~~di~d~I~E~~p~~k 94 (115)
++.+ ..-....+-|.|+..+|+|.+-|+|.-++
T Consensus 187 rl~k--kDie~L~k~R~~L~~emld~~~e~~~~~k 219 (293)
T COG4079 187 RLVK--KDIETLRKHRRRLAEEMLDHIREGYDILK 219 (293)
T ss_pred HHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4444 44555667788999999999999987544
No 231
>PF10359 Fmp27_WPPW: RNA pol II promoter Fmp27 protein domain; InterPro: IPR019449 The function of the FMP27 protein is not known. FMP27 is the product of a nuclear encoded gene but it is detected in highly purified mitochondria in high-throughput studies []. This entry represents a domain within FMP27 that contains characteristic HQR and WPPW sequence motifs.
Probab=22.33 E-value=1.9e+02 Score=24.49 Aligned_cols=23 Identities=30% Similarity=0.401 Sum_probs=11.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHh
Q psy8735 22 EAKQKKAELQAELEEKKSKLEKL 44 (115)
Q Consensus 22 el~~~i~~L~~e~~~l~~rL~~l 44 (115)
++..++..+..++..+..++.-|
T Consensus 197 ~~~~~~~~l~~~~~~l~~~~~~l 219 (475)
T PF10359_consen 197 ELKSDIEELERHISSLKERIEFL 219 (475)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHH
Confidence 34555555555555555444443
No 232
>PF02183 HALZ: Homeobox associated leucine zipper; InterPro: IPR003106 This region is a plant specific leucine zipper that is always found associated with a homeobox []. ; GO: 0003677 DNA binding, 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus
Probab=22.33 E-value=1.7e+02 Score=16.81 Aligned_cols=26 Identities=19% Similarity=0.300 Sum_probs=17.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhh
Q psy8735 21 AEAKQKKAELQAELEEKKSKLEKLSR 46 (115)
Q Consensus 21 ~el~~~i~~L~~e~~~l~~rL~~l~~ 46 (115)
+.|...-..|..++..+.+.+..|++
T Consensus 15 d~Lk~~~~~L~~E~~~L~aev~~L~~ 40 (45)
T PF02183_consen 15 DSLKAEYDSLKKENEKLRAEVQELKE 40 (45)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 45566666677777777777766654
No 233
>TIGR02976 phageshock_pspB phage shock protein B. This model describes the PspB protein of the psp (phage shock protein) operon, as found in Escherichia coli and many related species. Expression of a phage protein called secretin protein IV, and a number of other stresses including ethanol, heat shock, and defects in protein secretion trigger sigma-54-dependent expression of the phage shock regulon. PspB is both a regulator and an effector protein of the phage shock response.
Probab=22.29 E-value=1.7e+02 Score=18.79 Aligned_cols=26 Identities=19% Similarity=0.358 Sum_probs=21.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhh
Q psy8735 21 AEAKQKKAELQAELEEKKSKLEKLSR 46 (115)
Q Consensus 21 ~el~~~i~~L~~e~~~l~~rL~~l~~ 46 (115)
++-...+.+|-...+.|++|++.|..
T Consensus 38 ~~d~~~L~~L~~~a~rm~eRI~tLE~ 63 (75)
T TIGR02976 38 TDDQALLQELYAKADRLEERIDTLER 63 (75)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 44556778999999999999999864
No 234
>PRK12423 LexA repressor; Provisional
Probab=22.20 E-value=1.8e+02 Score=21.36 Aligned_cols=31 Identities=23% Similarity=0.273 Sum_probs=25.0
Q ss_pred HHHHHHHHHHHHHHHh-cCcCcHHHHHHHhcc
Q psy8735 73 NKRKRMCTEMLEAILE-NYPKSKKILLEEVGI 103 (115)
Q Consensus 73 ~kRKRi~~di~d~I~E-~~p~~kk~L~EelGi 103 (115)
.+|+.++.-|-+.|.+ ++|.+-.+|.+.+|+
T Consensus 6 ~~q~~il~~l~~~i~~~g~~Ps~~eia~~~g~ 37 (202)
T PRK12423 6 PKRAAILAFIRERIAQAGQPPSLAEIAQAFGF 37 (202)
T ss_pred HHHHHHHHHHHHHHHHcCCCCCHHHHHHHhCC
Confidence 4678888888887744 678889999999996
No 235
>PRK13848 conjugal transfer protein TraC; Provisional
Probab=22.17 E-value=1.6e+02 Score=19.97 Aligned_cols=51 Identities=18% Similarity=0.399 Sum_probs=30.1
Q ss_pred CHHHHHHHHHHHHHHHHHHHHH----HHHh--hhcC--CCCChhhhcHHhHHHHHHHHHHHHHH
Q psy8735 19 TEAEAKQKKAELQAELEEKKSK----LEKL--SRNV--NVISEGEVDKKTITDTKEKMVKEYNK 74 (115)
Q Consensus 19 t~~el~~~i~~L~~e~~~l~~r----L~~l--~~~~--~~Vs~ee~~~~~v~~~~~k~~~~w~k 74 (115)
+..++..+|+.|+.+...+..+ +..+ ++|- --|+.+++. ..|......+|+
T Consensus 4 ~~s~I~~eI~kLqe~lk~~e~keAERigRiAlKAGLgeieI~d~eL~-----~aFeeiAaRFR~ 62 (98)
T PRK13848 4 PSSKIREEIAKLQEQLKQAETREAERIGRIALKAGLGEIEIEEAELQ-----AAFEELAKRFRG 62 (98)
T ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCccccccCHHHHH-----HHHHHHHHHHhc
Confidence 3567888888888888777654 4443 3433 444555544 455555444444
No 236
>PF01166 TSC22: TSC-22/dip/bun family; InterPro: IPR000580 Several eukaryotic proteins are evolutionary related and are thought to be involved in transcriptional regulation. These proteins are highly similar in a region of about 50 residues that include a conserved leucine-zipper domain most probably involved in homo- or hetero-dimerisation. Proteins containing this signature include: Vertebrate protein TSC-22 [], a transcriptional regulator which seems to act on C-type natriuretic peptide (CNP) promoter. Mammalian protein DIP (DSIP-immunoreactive peptide) [], a protein whose function is not yet known. Drosophila protein bunched [] (gene bun) (also known as shortsighted), a probable transcription factor required for peripheral nervous system morphogenesis, eye development and oogenesis. Caenorhabditis elegans hypothetical protein T18D3.7. ; GO: 0003700 sequence-specific DNA binding transcription factor activity, 0006355 regulation of transcription, DNA-dependent; PDB: 1DIP_B.
Probab=22.16 E-value=2.1e+02 Score=17.73 Aligned_cols=25 Identities=20% Similarity=0.190 Sum_probs=13.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhh
Q psy8735 22 EAKQKKAELQAELEEKKSKLEKLSR 46 (115)
Q Consensus 22 el~~~i~~L~~e~~~l~~rL~~l~~ 46 (115)
-|+..|..|...+..++..=.-|++
T Consensus 18 vLK~~I~eL~~~n~~Le~EN~~Lk~ 42 (59)
T PF01166_consen 18 VLKEQIAELEERNSQLEEENNLLKQ 42 (59)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 3455555555555555554444443
No 237
>PF10360 DUF2433: Protein of unknown function (DUF2433); InterPro: IPR018829 This entry represents a conserved domain of 120 residues from a family fungal proteins. Their function is not known.
Probab=22.16 E-value=1.5e+02 Score=21.15 Aligned_cols=27 Identities=15% Similarity=0.126 Sum_probs=21.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhc
Q psy8735 63 DTKEKMVKEYNKRKRMCTEMLEAILEN 89 (115)
Q Consensus 63 ~~~~k~~~~w~kRKRi~~di~d~I~E~ 89 (115)
-.++.|+.---..|-.|.+||+.|-..
T Consensus 12 ps~e~yR~Kl~~~k~~F~~vW~~VK~~ 38 (132)
T PF10360_consen 12 PSFEHYRSKLSASKASFGEVWETVKGQ 38 (132)
T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 345678888888999999999999433
No 238
>PRK09841 cryptic autophosphorylating protein tyrosine kinase Etk; Provisional
Probab=22.12 E-value=5.6e+02 Score=22.75 Aligned_cols=42 Identities=12% Similarity=0.200 Sum_probs=33.8
Q ss_pred HHHccCCHHHHHHHHHHHHHHHHHHHHHHHHhhhcCCCCChh
Q psy8735 13 QLSQQMTEAEAKQKKAELQAELEEKKSKLEKLSRNVNVISEG 54 (115)
Q Consensus 13 ~L~s~lt~~el~~~i~~L~~e~~~l~~rL~~l~~~~~~Vs~e 54 (115)
.-.+.-|.+-+..++..++.+....+.+|+.++.....+++.
T Consensus 262 ~~~a~~a~~fL~~qL~~l~~~L~~aE~~l~~fr~~~~~~d~~ 303 (726)
T PRK09841 262 AAQDSQSLEFLQRQLPEVRSELDQAEEKLNVYRQQRDSVDLN 303 (726)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCC
Confidence 334455778899999999999999999999999866666653
No 239
>TIGR02912 sulfite_red_C sulfite reductase, subunit C. Members of this protein family include the C subunit, one of three subunits, of the anaerobic sulfite reductase of Salmonella, and close homologs from various Clostridum species, where the three-gene neighborhood is preserved. Two such gene clusters are found in Clostridium perfringens, but it may be that these sets of genes correspond to the distinct assimilatory and dissimilatory forms as seen in Clostridium pasteurianum. Note that any one of these enzymes may have secondary substates such as NH2OH, SeO3(2-), and SO3(2-). Heterologous expression of the anaerobic sulfite reductase of Salmonella confers on Escherichia coli the ability to produce hydrogen sulfide gas from sulfite.
Probab=22.01 E-value=1e+02 Score=24.29 Aligned_cols=55 Identities=18% Similarity=0.282 Sum_probs=33.6
Q ss_pred CCCChhhhcHHhHHHHHHHHHHHHHHH-----HHHHHHHHHHHHhcCcCcHHHHHHHhcccccc
Q psy8735 49 NVISEGEVDKKTITDTKEKMVKEYNKR-----KRMCTEMLEAILENYPKSKKILLEEVGIETDE 107 (115)
Q Consensus 49 ~~Vs~ee~~~~~v~~~~~k~~~~w~kR-----KRi~~di~d~I~E~~p~~kk~L~EelGiEtDe 107 (115)
..++.++.. .+....-.+-..|..| +.=|.++++.+ |...=++.+++++|+++|-
T Consensus 247 ~~~~~~~v~--~~~~~~~~~~~~~~~~~~~~~~eR~~~~i~r~--g~~~~~~~~~~~~~~~~~~ 306 (314)
T TIGR02912 247 KWVDEDSIV--KIILNTYDYITEYIDKNAPGGKEHIGYIVDRT--GYQEFKKWALKDVQLNPKA 306 (314)
T ss_pred hhcCcchhH--HHHHHHHHHHHHHHhccCCCccchHHHHHHHh--hHHHHHHHHhhhcccChhh
Confidence 345667776 6665554554555322 23566788777 4444456667888988875
No 240
>PF10211 Ax_dynein_light: Axonemal dynein light chain; InterPro: IPR019347 Axonemal dynein light chain proteins play a dynamic role in flagellar and cilial motility. Eukaryotic cilia and flagella are complex organelles consisting of a core structure, the axoneme, which is composed of nine microtubule doublets forming a cylinder that surrounds a pair of central singlet microtubules. This ultra-structural arrangement seems to be one of the most stable micro-tubular assemblies known and is responsible for the flagellar and ciliary movement of a large number of organisms ranging from protozoan to mammals. This light chain interacts directly with the N-terminal half of the heavy chains [].
Probab=22.00 E-value=3.3e+02 Score=20.08 Aligned_cols=22 Identities=36% Similarity=0.555 Sum_probs=8.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHh
Q psy8735 23 AKQKKAELQAELEEKKSKLEKL 44 (115)
Q Consensus 23 l~~~i~~L~~e~~~l~~rL~~l 44 (115)
+...+..|+.++..+..+++.+
T Consensus 132 L~~e~~~L~~~~~~l~~~~e~~ 153 (189)
T PF10211_consen 132 LEEEKEELEKQVQELKNKCEQL 153 (189)
T ss_pred HHHHHHHHHHHHHHHHHHHHHH
Confidence 3333444444444444444433
No 241
>COG3937 Uncharacterized conserved protein [Function unknown]
Probab=21.79 E-value=2.1e+02 Score=19.79 Aligned_cols=21 Identities=19% Similarity=0.368 Sum_probs=10.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHH
Q psy8735 21 AEAKQKKAELQAELEEKKSKL 41 (115)
Q Consensus 21 ~el~~~i~~L~~e~~~l~~rL 41 (115)
.++...|..|+.++..++.++
T Consensus 86 ~~l~~rvd~Lerqv~~Lenk~ 106 (108)
T COG3937 86 DELTERVDALERQVADLENKL 106 (108)
T ss_pred HHHHHHHHHHHHHHHHHHHHh
Confidence 344455555555555555443
No 242
>PF04508 Pox_A_type_inc: Viral A-type inclusion protein repeat ; InterPro: IPR007596 The repeat is found in the A-type inclusion protein of the Poxvirus family [].; GO: 0016032 viral reproduction
Probab=21.72 E-value=1.3e+02 Score=15.16 Aligned_cols=17 Identities=12% Similarity=0.311 Sum_probs=9.1
Q ss_pred HHHHHHHHHHHHHHHHH
Q psy8735 27 KAELQAELEEKKSKLEK 43 (115)
Q Consensus 27 i~~L~~e~~~l~~rL~~ 43 (115)
+..++..|..|+.+|..
T Consensus 3 ~~rlr~rI~dLer~L~~ 19 (23)
T PF04508_consen 3 MNRLRNRISDLERQLSE 19 (23)
T ss_pred HHHHHHHHHHHHHHHHH
Confidence 44555555555555543
No 243
>KOG4571|consensus
Probab=21.65 E-value=2e+02 Score=23.32 Aligned_cols=34 Identities=26% Similarity=0.341 Sum_probs=27.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCcCcHHHHHHHh
Q psy8735 68 MVKEYNKRKRMCTEMLEAILENYPKSKKILLEEV 101 (115)
Q Consensus 68 ~~~~w~kRKRi~~di~d~I~E~~p~~kk~L~Eel 101 (115)
+-..||.|||.-++-+.-.++++.+.-.+|.+.+
T Consensus 238 AAtRYRqKkRae~E~l~ge~~~Le~rN~~LK~qa 271 (294)
T KOG4571|consen 238 AATRYRQKKRAEKEALLGELEGLEKRNEELKDQA 271 (294)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4788999999999988888888887666666544
No 244
>COG4575 ElaB Uncharacterized conserved protein [Function unknown]
Probab=21.64 E-value=2.8e+02 Score=19.05 Aligned_cols=30 Identities=27% Similarity=0.338 Sum_probs=16.7
Q ss_pred HHHHHHHHHHHHHHHHHHhhhcCCCCChhh
Q psy8735 26 KKAELQAELEEKKSKLEKLSRNVNVISEGE 55 (115)
Q Consensus 26 ~i~~L~~e~~~l~~rL~~l~~~~~~Vs~ee 55 (115)
.|.+|..++..+-+-++.+=.++...+-++
T Consensus 9 ~~~~l~~el~~L~d~lEevL~ssg~~a~~e 38 (104)
T COG4575 9 AIDQLLAELQELLDTLEEVLKSSGSLAGDE 38 (104)
T ss_pred hHHHHHHHHHHHHHHHHHHHHhcccchhhH
Confidence 346666777777777766643333333333
No 245
>PF01466 Skp1: Skp1 family, dimerisation domain; InterPro: IPR016072 SKP1 (together with SKP2) was identified as an essential component of the cyclin A-CDK2 S phase kinase complex []. It was found to bind several F-box containing proteins (e.g., Cdc4, Skp2, cyclin F) and to be involved in the ubiquitin protein degradation pathway. A yeast homologue of SKP1 (P52286) was identified in the centromere bound kinetochore complex [] and is also involved in the ubiquitin pathway []. In Dictyostelium discoideum (Slime mold) FP21 was shown to be glycosylated in the cytosol and has homology to SKP1 []. This entry represents a dimerisation domain found at the C-terminal of SKP1 proteins [], as well as in subunit D of the centromere DNA-binding protein complex Cbf3 []. This domain is multi-helical in structure, and consists of an interlocked herterodimer in F-box proteins.; GO: 0006511 ubiquitin-dependent protein catabolic process; PDB: 2P1O_A 3OGL_G 3OGM_A 3C6O_A 2P1N_A 2P1Q_A 3OGK_I 3C6N_A 3C6P_A 2P1P_A ....
Probab=21.64 E-value=1.7e+02 Score=18.19 Aligned_cols=17 Identities=35% Similarity=0.345 Sum_probs=11.5
Q ss_pred cCcHHHHHHHhcccccc
Q psy8735 91 PKSKKILLEEVGIETDE 107 (115)
Q Consensus 91 p~~kk~L~EelGiEtDe 107 (115)
++++.++-.-+||++|.
T Consensus 45 gks~eeir~~fgi~~d~ 61 (78)
T PF01466_consen 45 GKSPEEIRKYFGIENDL 61 (78)
T ss_dssp TS-HHHHHHHHT---TS
T ss_pred CCCHHHHHHHcCCCCCC
Confidence 48899999999999985
No 246
>PRK04863 mukB cell division protein MukB; Provisional
Probab=21.60 E-value=6.6e+02 Score=24.86 Aligned_cols=66 Identities=14% Similarity=0.160 Sum_probs=38.4
Q ss_pred HHHHHHHHHHHHccCCHHHHHHHHHHHHHHHHHHHHHHHHhhh-----cCCCCChhhhcHHhHHHHHHHHHHHHH
Q psy8735 4 KEKVKSKLKQLSQQMTEAEAKQKKAELQAELEEKKSKLEKLSR-----NVNVISEGEVDKKTITDTKEKMVKEYN 73 (115)
Q Consensus 4 ~~~l~seL~~L~s~lt~~el~~~i~~L~~e~~~l~~rL~~l~~-----~~~~Vs~ee~~~~~v~~~~~k~~~~w~ 73 (115)
+..++.++..|... ..++...+..++.+...++..+..+.+ |...+|.++.. .....|..-..+..
T Consensus 385 leelEeeLeeLqeq--Laelqqel~elQ~el~q~qq~i~~Le~~~~~~~~~~~SdEeLe--~~LenF~aklee~e 455 (1486)
T PRK04863 385 AEAAEEEVDELKSQ--LADYQQALDVQQTRAIQYQQAVQALERAKQLCGLPDLTADNAE--DWLEEFQAKEQEAT 455 (1486)
T ss_pred HHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCHHHHH--HHHHHHHHHHHHHH
Confidence 34445555555444 234555666666666666666665532 45678888877 77666665554443
No 247
>PF11853 DUF3373: Protein of unknown function (DUF3373); InterPro: IPR021803 This family of proteins are functionally uncharacterised. This protein is found in bacteria. Proteins in this family are typically between 472 to 574 amino acids in length.
Probab=21.59 E-value=2.3e+02 Score=24.52 Aligned_cols=33 Identities=27% Similarity=0.307 Sum_probs=23.9
Q ss_pred HHccCCHHHHHHHHHHHHHHHHHHHHHHHHhhhc
Q psy8735 14 LSQQMTEAEAKQKKAELQAELEEKKSKLEKLSRN 47 (115)
Q Consensus 14 L~s~lt~~el~~~i~~L~~e~~~l~~rL~~l~~~ 47 (115)
|..+++...+...+..++ ++++|+.+|+.|+..
T Consensus 14 l~~s~~~~a~~~~~~~~q-kie~L~kql~~Lk~q 46 (489)
T PF11853_consen 14 LFLSLPAAAMADDIDLLQ-KIEALKKQLEELKAQ 46 (489)
T ss_pred HHhccchhhhhhhhHHHH-HHHHHHHHHHHHHHh
Confidence 455566666677777677 888888888888753
No 248
>TIGR00634 recN DNA repair protein RecN. All proteins in this family for which functions are known are ATP binding proteins involved in the initiation of recombination and recombinational repair.
Probab=21.26 E-value=5.3e+02 Score=22.10 Aligned_cols=67 Identities=15% Similarity=0.145 Sum_probs=43.6
Q ss_pred cCCHHHHHHHHHHHHHHHHHHHHHHHHhhhcCCCCChhhhcHHhHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy8735 17 QMTEAEAKQKKAELQAELEEKKSKLEKLSRNVNVISEGEVDKKTITDTKEKMVKEYNKRKRMCTEMLEAI 86 (115)
Q Consensus 17 ~lt~~el~~~i~~L~~e~~~l~~rL~~l~~~~~~Vs~ee~~~~~v~~~~~k~~~~w~kRKRi~~di~d~I 86 (115)
.+...++...+.+...++..+...|..+.++ -..+|+... .++.....+...-+|...-+.+++..+
T Consensus 265 d~~~~~~~~~l~~~~~~l~d~~~~l~~~~~~-l~~dp~~L~--ele~RL~~l~~LkrKyg~s~e~l~~~~ 331 (563)
T TIGR00634 265 DGSLRELAEQVGNALTEVEEATRELQNYLDE-LEFDPERLN--EIEERLAQIKRLKRKYGASVEEVLEYA 331 (563)
T ss_pred hHhHHHHHHHHHHHHHHHHHHHHHHHHHHHh-CCCCHHHHH--HHHHHHHHHHHHHHHhCCCHHHHHHHH
Confidence 3444566666666666666666666665543 377898888 888888877776666655555554433
No 249
>PRK04778 septation ring formation regulator EzrA; Provisional
Probab=21.22 E-value=4.2e+02 Score=22.81 Aligned_cols=93 Identities=23% Similarity=0.314 Sum_probs=0.0
Q ss_pred hHHHHHHHHHHHHccCCHHHHHHHHHHHHHHHHHHHHHHHHhhhcCCCCCh--hhhcHHhHHHHHHHHHHHHHHHHHHHH
Q psy8735 3 DKEKVKSKLKQLSQQMTEAEAKQKKAELQAELEEKKSKLEKLSRNVNVISE--GEVDKKTITDTKEKMVKEYNKRKRMCT 80 (115)
Q Consensus 3 ~~~~l~seL~~L~s~lt~~el~~~i~~L~~e~~~l~~rL~~l~~~~~~Vs~--ee~~~~~v~~~~~k~~~~w~kRKRi~~ 80 (115)
+++.+...+..+...+...... ...++.+...+.++|..+...-..+.. .... +.+.........|+.--+-++
T Consensus 356 eL~~Le~~~~~~~~~i~~~~~~--ysel~e~leel~e~leeie~eq~ei~e~l~~Lr--k~E~eAr~kL~~~~~~L~~ik 431 (569)
T PRK04778 356 QLESLEKQYDEITERIAEQEIA--YSELQEELEEILKQLEEIEKEQEKLSEMLQGLR--KDELEAREKLERYRNKLHEIK 431 (569)
T ss_pred HHHHHHHHHHHHHHHHHcCCCC--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHH
Q ss_pred HHHHHHHhcCcCcHHHHHHHh
Q psy8735 81 EMLEAILENYPKSKKILLEEV 101 (115)
Q Consensus 81 di~d~I~E~~p~~kk~L~Eel 101 (115)
..+..- ++|+-|..+.+.+
T Consensus 432 r~l~k~--~lpgip~~y~~~~ 450 (569)
T PRK04778 432 RYLEKS--NLPGLPEDYLEMF 450 (569)
T ss_pred HHHHHc--CCCCCcHHHHHHH
No 250
>PF15294 Leu_zip: Leucine zipper
Probab=21.03 E-value=1.7e+02 Score=23.45 Aligned_cols=25 Identities=28% Similarity=0.310 Sum_probs=15.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhh
Q psy8735 21 AEAKQKKAELQAELEEKKSKLEKLS 45 (115)
Q Consensus 21 ~el~~~i~~L~~e~~~l~~rL~~l~ 45 (115)
+=+..+|..|+.|+..+.+||..+.
T Consensus 128 ~ll~kEi~rLq~EN~kLk~rl~~le 152 (278)
T PF15294_consen 128 ELLNKEIDRLQEENEKLKERLKSLE 152 (278)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3356666666666666666666664
No 251
>PLN02320 seryl-tRNA synthetase
Probab=20.99 E-value=2.8e+02 Score=23.98 Aligned_cols=42 Identities=17% Similarity=0.298 Sum_probs=20.2
Q ss_pred hHHHHHHHHHHHHccCC-------HHHHHHHHHHHHHHHHHHHHHHHHh
Q psy8735 3 DKEKVKSKLKQLSQQMT-------EAEAKQKKAELQAELEEKKSKLEKL 44 (115)
Q Consensus 3 ~~~~l~seL~~L~s~lt-------~~el~~~i~~L~~e~~~l~~rL~~l 44 (115)
+++.|+++.+.+.+.+. .+++..+...|..++..+++.+..+
T Consensus 108 ~~~~lr~ern~~sk~i~~~~~~~~~~~l~~~~k~lk~~i~~le~~~~~~ 156 (502)
T PLN02320 108 EVERLRAERNAVANKMKGKLEPSERQALVEEGKNLKEGLVTLEEDLVKL 156 (502)
T ss_pred HHHHHHHHHHHHHHHHHhhhCCCCHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34555555555544432 3345555555555555555444333
No 252
>TIGR03185 DNA_S_dndD DNA sulfur modification protein DndD. This model describes the DndB protein encoded by an operon associated with a sulfur-containing modification to DNA. The operon is sporadically distributed in bacteria, much like some restriction enzyme operons. DndD is described as a putative ATPase. The small number of examples known so far include species from among the Firmicutes, Actinomycetes, Proteobacteria, and Cyanobacteria.
Probab=20.95 E-value=1.1e+02 Score=26.72 Aligned_cols=16 Identities=6% Similarity=0.412 Sum_probs=9.1
Q ss_pred hHHHHHHHHHHHHccC
Q psy8735 3 DKEKVKSKLKQLSQQM 18 (115)
Q Consensus 3 ~~~~l~seL~~L~s~l 18 (115)
+++.++.+|..++..+
T Consensus 399 ~~~~~e~el~~l~~~l 414 (650)
T TIGR03185 399 ELRELEEELAEVDKKI 414 (650)
T ss_pred HHHHHHHHHHHHHHHH
Confidence 4556666666555543
No 253
>PRK14160 heat shock protein GrpE; Provisional
Probab=20.91 E-value=3.9e+02 Score=20.43 Aligned_cols=23 Identities=30% Similarity=0.440 Sum_probs=10.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHh
Q psy8735 22 EAKQKKAELQAELEEKKSKLEKL 44 (115)
Q Consensus 22 el~~~i~~L~~e~~~l~~rL~~l 44 (115)
+++..+..|+.++..++.++..+
T Consensus 58 ~l~~e~~~l~~~l~~l~~e~~el 80 (211)
T PRK14160 58 ELKDENNKLKEENKKLENELEAL 80 (211)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHH
Confidence 34444444444444444444444
No 254
>COG3078 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=20.73 E-value=65 Score=23.76 Aligned_cols=13 Identities=31% Similarity=0.675 Sum_probs=11.5
Q ss_pred HHHHHHhcccccc
Q psy8735 95 KILLEEVGIETDE 107 (115)
Q Consensus 95 k~L~EelGiEtDe 107 (115)
.+|+|.|||+.|+
T Consensus 139 ~~LMe~LGl~~dd 151 (169)
T COG3078 139 DELMEKLGLSYDD 151 (169)
T ss_pred HHHHHHhCCccCC
Confidence 5799999999987
No 255
>PF03961 DUF342: Protein of unknown function (DUF342); InterPro: IPR005646 This family of bacterial proteins has no known function. The proteins are in the region of 500-600 amino acid residues in length.
Probab=20.65 E-value=5e+02 Score=21.56 Aligned_cols=24 Identities=58% Similarity=0.696 Sum_probs=11.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHh
Q psy8735 21 AEAKQKKAELQAELEEKKSKLEKL 44 (115)
Q Consensus 21 ~el~~~i~~L~~e~~~l~~rL~~l 44 (115)
.++..++..|..++..+..++..+
T Consensus 330 ~~l~~~~~~l~~~~~~~~~~l~~l 353 (451)
T PF03961_consen 330 PELKEKLEELEEELEELKEELEKL 353 (451)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH
Confidence 344444444555444444444444
No 256
>PF04899 MbeD_MobD: MbeD/MobD like ; InterPro: IPR006983 The MbeD and MobD proteins are plasmid encoded, and are involved in the plasmid mobilisation and transfer in the presence of conjugative plasmids [].
Probab=20.53 E-value=2.3e+02 Score=17.94 Aligned_cols=22 Identities=18% Similarity=0.374 Sum_probs=10.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHh
Q psy8735 23 AKQKKAELQAELEEKKSKLEKL 44 (115)
Q Consensus 23 l~~~i~~L~~e~~~l~~rL~~l 44 (115)
|..++..|..++..|...++.|
T Consensus 47 L~~qv~~Ls~qv~~Ls~ql~rL 68 (70)
T PF04899_consen 47 LSEQVNNLSQQVQRLSEQLERL 68 (70)
T ss_pred HHHHHHHHHHHHHHHHHHHHhh
Confidence 4444444444444444444433
No 257
>PF12795 MscS_porin: Mechanosensitive ion channel porin domain
Probab=20.47 E-value=3.8e+02 Score=20.16 Aligned_cols=57 Identities=21% Similarity=0.280 Sum_probs=36.9
Q ss_pred HHHHHHHHHHHccCCHHHHHHHHHHHHHHHHHHHHHHHHhhhcC-CCCChhhhcHHhHHHHH
Q psy8735 5 EKVKSKLKQLSQQMTEAEAKQKKAELQAELEEKKSKLEKLSRNV-NVISEGEVDKKTITDTK 65 (115)
Q Consensus 5 ~~l~seL~~L~s~lt~~el~~~i~~L~~e~~~l~~rL~~l~~~~-~~Vs~ee~~~~~v~~~~ 65 (115)
+...+.|..+.+.| +.+...+........++..+|..+..++ ..+++..+. .+..+.
T Consensus 102 ~~~~~~l~~~~~~p--~~aq~~l~~~~~~l~ei~~~L~~~~~~~~~~l~~a~~~--~l~ae~ 159 (240)
T PF12795_consen 102 QQENSQLIEIQTRP--ERAQQQLSEARQRLQEIRNQLQNLPPNGESPLSEAQRW--LLQAEL 159 (240)
T ss_pred HHHHHHHHHHHccH--HHHHHHHHHHHHHHHHHHHHHhccCCCCcchhhHHHHH--HHHHHH
Confidence 44455555555554 4567777888888888888888776544 677777666 444443
No 258
>PF13600 DUF4140: N-terminal domain of unknown function (DUF4140)
Probab=20.47 E-value=1.9e+02 Score=18.75 Aligned_cols=26 Identities=19% Similarity=0.304 Sum_probs=13.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy8735 20 EAEAKQKKAELQAELEEKKSKLEKLS 45 (115)
Q Consensus 20 ~~el~~~i~~L~~e~~~l~~rL~~l~ 45 (115)
..++..++..|+.++..+..++..++
T Consensus 72 ~~~l~~~l~~l~~~~~~~~~~~~~~~ 97 (104)
T PF13600_consen 72 LKELEEELEALEDELAALQDEIQALE 97 (104)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34455555555555555555555443
No 259
>PF11855 DUF3375: Protein of unknown function (DUF3375); InterPro: IPR021804 This family of proteins are functionally uncharacterised. This protein is found in bacteria. Proteins in this family are typically between 479 to 499 amino acids in length.
Probab=20.42 E-value=2.2e+02 Score=24.10 Aligned_cols=37 Identities=22% Similarity=0.469 Sum_probs=29.6
Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHhhhcC-CCCChhh
Q psy8735 19 TEAEAKQKKAELQAELEEKKSKLEKLSRNV-NVISEGE 55 (115)
Q Consensus 19 t~~el~~~i~~L~~e~~~l~~rL~~l~~~~-~~Vs~ee 55 (115)
|+.+-...|+.|+.++.+++..++.+++|. ..+++.+
T Consensus 138 ~~~Dp~~Ri~~Le~e~~~i~~EI~~l~aG~~~~ld~~~ 175 (478)
T PF11855_consen 138 TDPDPERRIAELEREIAEIDAEIDRLEAGDVPVLDDTQ 175 (478)
T ss_pred cCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHH
Confidence 566778889999999999999999999875 3445433
No 260
>PF04012 PspA_IM30: PspA/IM30 family; InterPro: IPR007157 This family includes PspA a protein that suppresses sigma54-dependent transcription. The PspA protein, a negative regulator of the Escherichia coli phage shock psp operon, is produced when virulence factors are exported through secretins in many Gram-negative pathogenic bacteria and its homologue in plants, VIPP1, plays a critical role in thylakoid biogenesis, essential for photosynthesis. Activation of transcription by the enhancer-dependent bacterial sigma54-containing RNA polymerase occurs through ATP hydrolysis-driven protein conformational changes enabled by activator proteins that belong to the large AAA(+) mechanochemical protein family. It has been shown that PspA directly and specifically acts upon and binds to the AAA(+) domain of the PspF transcription activator [].
Probab=20.20 E-value=3.3e+02 Score=20.01 Aligned_cols=24 Identities=50% Similarity=0.577 Sum_probs=12.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHh
Q psy8735 21 AEAKQKKAELQAELEEKKSKLEKL 44 (115)
Q Consensus 21 ~el~~~i~~L~~e~~~l~~rL~~l 44 (115)
..|+..+..++..+.+++.+...|
T Consensus 115 ~~l~~~l~~l~~kl~e~k~k~~~l 138 (221)
T PF04012_consen 115 EKLKEQLEELEAKLEELKSKREEL 138 (221)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH
Confidence 444555555555555555555544
No 261
>COG4879 Uncharacterized protein conserved in archaea [Function unknown]
Probab=20.13 E-value=1.6e+02 Score=22.84 Aligned_cols=33 Identities=18% Similarity=0.289 Sum_probs=26.3
Q ss_pred HHHHHHHHHHHHHHHHHHHhcCcC-cHHHHHHHh
Q psy8735 69 VKEYNKRKRMCTEMLEAILENYPK-SKKILLEEV 101 (115)
Q Consensus 69 ~~~w~kRKRi~~di~d~I~E~~p~-~kk~L~Eel 101 (115)
+..+|.|=.+.+|+|..++++-.. +...+.|.+
T Consensus 5 r~~~r~RI~~l~d~wer~l~~we~lsRe~~ieil 38 (243)
T COG4879 5 RQLHRDRIEALLDLWERVLNGWETLSRESLIEIL 38 (243)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHH
Confidence 455788888999999999998887 666666665
No 262
>COG1842 PspA Phage shock protein A (IM30), suppresses sigma54-dependent transcription [Transcription / Signal transduction mechanisms]
Probab=20.06 E-value=3.3e+02 Score=20.92 Aligned_cols=41 Identities=24% Similarity=0.362 Sum_probs=25.0
Q ss_pred HHHHHHHHHHHHccCC-----HHHHHHHHHHHHHHHHHHHHHHHHh
Q psy8735 4 KEKVKSKLKQLSQQMT-----EAEAKQKKAELQAELEEKKSKLEKL 44 (115)
Q Consensus 4 ~~~l~seL~~L~s~lt-----~~el~~~i~~L~~e~~~l~~rL~~l 44 (115)
+..++..+..+.+.++ .+.|+..+..|+..|.+++.+...+
T Consensus 94 ~~~le~~~~~~~~~~~~~~~~~~~l~~~~~~Le~Ki~e~~~~~~~l 139 (225)
T COG1842 94 KQSLEDLAKALEAELQQAEEQVEKLKKQLAALEQKIAELRAKKEAL 139 (225)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4555555555555543 3556666677777777776666665
Done!