Query 029243
Match_columns 196
No_of_seqs 77 out of 79
Neff 2.6
Searched_HMMs 46136
Date Fri Mar 29 09:47:58 2013
Command hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/029243.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/029243hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PF11460 DUF3007: Protein of u 100.0 1.8E-51 3.8E-56 318.8 7.4 97 87-184 1-104 (104)
2 PF14163 SieB: Superinfection 94.8 0.079 1.7E-06 41.7 5.5 74 109-185 19-94 (151)
3 PF04156 IncA: IncA protein; 93.9 0.12 2.7E-06 41.2 5.1 75 108-183 26-105 (191)
4 PF12576 DUF3754: Protein of u 89.2 0.36 7.9E-06 38.5 2.8 70 85-162 57-127 (141)
5 PF06305 DUF1049: Protein of u 86.5 0.62 1.3E-05 31.7 2.3 25 118-142 18-42 (68)
6 PF06518 DUF1104: Protein of u 86.3 1.5 3.3E-05 33.8 4.6 40 147-187 35-77 (93)
7 PF03250 Tropomodulin: Tropomo 85.9 0.63 1.4E-05 38.9 2.5 36 149-184 3-38 (147)
8 PF08621 RPAP1_N: RPAP1-like, 83.7 1.5 3.2E-05 30.4 3.1 22 163-184 8-29 (49)
9 PF04120 Iron_permease: Low af 82.5 7.9 0.00017 31.5 7.4 48 143-190 62-115 (132)
10 PF13618 Gluconate_2-dh3: Gluc 77.5 2.3 4.9E-05 32.1 2.6 28 160-187 61-88 (131)
11 PRK11864 2-ketoisovalerate fer 74.4 1.9 4E-05 39.0 1.7 46 140-185 233-288 (300)
12 PTZ00432 falcilysin; Provision 74.0 8.7 0.00019 39.8 6.5 43 145-187 571-615 (1119)
13 TIGR03348 VI_IcmF type VI secr 73.9 5.2 0.00011 41.4 4.9 84 96-185 3-93 (1169)
14 PF06295 DUF1043: Protein of u 73.5 2.7 5.8E-05 33.2 2.2 20 126-148 4-23 (128)
15 TIGR03142 cytochro_ccmI cytoch 73.5 4.5 9.7E-05 31.1 3.4 16 169-184 56-71 (117)
16 PRK11677 hypothetical protein; 73.1 5.1 0.00011 32.7 3.8 16 126-141 8-23 (134)
17 PRK03557 zinc transporter ZitB 70.7 7.1 0.00015 34.3 4.4 33 81-113 77-113 (312)
18 TIGR01297 CDF cation diffusion 70.6 14 0.00031 30.4 5.9 79 81-159 48-134 (268)
19 PRK11380 hypothetical protein; 69.4 6.7 0.00015 36.9 4.2 34 140-179 89-122 (353)
20 PF12666 PrgI: PrgI family pro 67.2 12 0.00026 27.1 4.3 34 80-114 10-44 (93)
21 PRK00888 ftsB cell division pr 66.5 8.6 0.00019 29.6 3.6 51 134-187 12-62 (105)
22 PRK03573 transcriptional regul 65.9 28 0.00061 26.4 6.3 100 83-184 25-132 (144)
23 PF11003 DUF2842: Protein of u 65.4 6.6 0.00014 28.2 2.6 45 94-138 3-49 (62)
24 PF07946 DUF1682: Protein of u 65.2 15 0.00032 32.8 5.3 16 165-180 297-312 (321)
25 PF06814 Lung_7-TM_R: Lung sev 64.5 9.8 0.00021 32.7 4.0 77 88-168 150-232 (295)
26 PF04977 DivIC: Septum formati 63.4 7.1 0.00015 26.7 2.4 36 146-184 14-49 (80)
27 PF11304 DUF3106: Protein of u 63.2 16 0.00036 28.1 4.6 22 163-184 34-55 (107)
28 PRK11865 pyruvate ferredoxin o 62.9 6 0.00013 35.7 2.5 45 140-186 237-292 (299)
29 PF05915 DUF872: Eukaryotic pr 61.9 21 0.00045 28.3 5.1 45 90-137 41-88 (115)
30 KOG2019 Metalloendoprotease HM 61.7 15 0.00033 38.3 5.3 39 146-184 493-531 (998)
31 PF12597 DUF3767: Protein of u 61.6 26 0.00055 27.8 5.5 51 114-177 62-112 (118)
32 PF06518 DUF1104: Protein of u 60.3 7.2 0.00016 30.1 2.2 31 147-178 58-91 (93)
33 PRK11091 aerobic respiration c 58.6 17 0.00036 34.6 4.7 11 132-142 64-74 (779)
34 PF07226 DUF1422: Protein of u 56.5 16 0.00035 29.8 3.6 45 93-139 65-109 (117)
35 TIGR00219 mreC rod shape-deter 56.1 6.5 0.00014 34.7 1.5 30 159-189 74-103 (283)
36 PRK11462 putative transporter; 56.1 33 0.00072 30.9 5.9 23 165-187 425-447 (460)
37 PF08451 A_deaminase_N: Adenos 55.1 14 0.00031 28.1 3.1 36 142-177 14-52 (96)
38 TIGR02698 CopY_TcrY copper tra 55.0 47 0.001 26.0 6.0 30 161-190 91-122 (130)
39 COG0099 RpsM Ribosomal protein 54.1 13 0.00028 30.4 2.7 22 163-184 41-62 (121)
40 PRK10236 hypothetical protein; 53.3 15 0.00033 32.9 3.3 26 160-185 119-144 (237)
41 PF15372 DUF4600: Domain of un 51.8 16 0.00034 30.1 2.9 26 165-190 46-71 (129)
42 COG1422 Predicted membrane pro 51.6 9.3 0.0002 33.5 1.7 53 126-182 49-107 (201)
43 PF14143 YrhC: YrhC-like prote 49.8 31 0.00066 25.8 4.0 43 94-137 12-56 (72)
44 PF01956 DUF106: Integral memb 49.6 9.6 0.00021 30.4 1.4 54 134-188 22-84 (168)
45 PF03457 HA: Helicase associat 49.4 24 0.00052 24.0 3.2 22 152-180 43-64 (68)
46 PLN03094 Substrate binding sub 48.4 26 0.00056 32.8 4.1 26 84-109 75-100 (370)
47 PRK11056 hypothetical protein; 47.7 30 0.00065 28.4 3.9 45 93-139 65-109 (120)
48 PF08172 CASP_C: CASP C termin 47.6 28 0.00061 30.7 4.1 35 152-187 85-121 (248)
49 PF12768 Rax2: Cortical protei 46.6 5.1 0.00011 35.6 -0.6 17 86-103 223-239 (281)
50 cd01390 HMGB-UBF_HMG-box HMGB- 46.3 80 0.0017 20.5 5.5 28 160-187 32-59 (66)
51 PF05961 Chordopox_A13L: Chord 46.2 21 0.00045 26.9 2.6 21 90-110 3-23 (68)
52 PRK09509 fieF ferrous iron eff 46.1 41 0.00088 29.1 4.8 14 81-94 69-82 (299)
53 PF10439 Bacteriocin_IIc: Bact 46.0 13 0.00027 26.3 1.4 14 165-178 3-16 (65)
54 cd03381 PAP2_glucose_6_phospha 45.6 24 0.00052 30.8 3.3 32 88-119 160-191 (235)
55 PF03965 Penicillinase_R: Peni 44.8 71 0.0015 23.9 5.4 46 139-184 59-115 (115)
56 PF07047 OPA3: Optic atrophy 3 44.7 23 0.00049 28.2 2.8 27 160-187 107-133 (134)
57 PF00505 HMG_box: HMG (high mo 44.5 80 0.0017 20.8 5.0 28 160-187 32-59 (69)
58 PF10550 Toxin_36: Conantokin- 44.5 27 0.00059 19.6 2.2 14 172-185 2-15 (15)
59 PF12072 DUF3552: Domain of un 44.3 13 0.00029 30.9 1.5 12 173-184 60-71 (201)
60 PF15361 RIC3: Resistance to i 44.3 8.8 0.00019 31.5 0.4 23 155-178 128-150 (152)
61 COG1026 Predicted Zn-dependent 42.7 64 0.0014 34.2 6.3 49 140-188 450-498 (978)
62 cd00084 HMG-box High Mobility 42.6 89 0.0019 19.9 5.5 29 160-188 32-60 (66)
63 PRK09706 transcriptional repre 42.4 47 0.001 25.4 4.2 25 160-184 89-113 (135)
64 PF07895 DUF1673: Protein of u 42.2 44 0.00095 28.4 4.3 53 90-160 57-109 (205)
65 KOG1962 B-cell receptor-associ 42.1 42 0.0009 29.7 4.2 45 144-188 112-173 (216)
66 PF04855 SNF5: SNF5 / SMARCB1 41.8 20 0.00044 31.4 2.3 26 160-185 217-242 (244)
67 PF13902 R3H-assoc: R3H-associ 40.4 50 0.0011 25.6 4.1 39 151-190 63-101 (131)
68 PRK00306 50S ribosomal protein 40.2 38 0.00082 24.0 3.1 22 165-186 4-25 (66)
69 PF09928 DUF2160: Predicted sm 40.0 60 0.0013 25.4 4.4 32 83-119 38-69 (88)
70 PF09991 DUF2232: Predicted me 39.9 1.6E+02 0.0034 24.3 7.1 17 169-185 137-153 (290)
71 smart00398 HMG high mobility g 39.6 1E+02 0.0023 19.9 5.6 28 160-187 33-60 (70)
72 PF09972 DUF2207: Predicted me 38.3 40 0.00087 29.7 3.6 16 92-107 400-415 (511)
73 KOG2224 Uncharacterized conser 38.2 34 0.00074 34.3 3.4 42 140-182 447-495 (781)
74 PTZ00246 proteasome subunit al 38.1 25 0.00055 29.7 2.3 23 167-189 226-248 (253)
75 PRK09848 glucuronide transport 38.1 1.6E+02 0.0036 25.6 7.3 19 169-187 428-446 (448)
76 PF01638 HxlR: HxlR-like helix 36.7 19 0.00041 26.0 1.1 41 136-185 6-46 (90)
77 PF11239 DUF3040: Protein of u 36.3 35 0.00076 24.8 2.5 16 169-185 2-17 (82)
78 PF00645 zf-PARP: Poly(ADP-rib 36.0 20 0.00043 25.4 1.1 18 166-183 65-82 (82)
79 PRK13453 F0F1 ATP synthase sub 35.9 49 0.0011 26.8 3.5 48 98-159 6-54 (173)
80 COG2863 Cytochrome c553 [Energ 35.4 59 0.0013 26.4 3.8 35 152-186 62-101 (121)
81 COG1896 Predicted hydrolases o 35.0 43 0.00094 28.2 3.2 40 143-184 75-123 (193)
82 PLN02220 delta-9 acyl-lipid de 34.7 65 0.0014 29.2 4.4 21 76-96 14-37 (299)
83 smart00347 HTH_MARR helix_turn 34.6 97 0.0021 20.9 4.3 88 83-175 4-101 (101)
84 PRK12585 putative monovalent c 34.3 63 0.0014 28.5 4.1 16 151-166 103-118 (197)
85 PRK14748 kdpF potassium-transp 34.2 47 0.001 21.4 2.5 25 87-111 1-25 (29)
86 PRK06474 hypothetical protein; 33.9 44 0.00095 27.5 3.0 15 170-184 136-150 (178)
87 KOG2568 Predicted membrane pro 33.4 49 0.0011 32.7 3.6 80 88-168 305-385 (518)
88 PRK13729 conjugal transfer pil 33.3 40 0.00086 33.0 3.0 28 144-171 57-89 (475)
89 PF06945 DUF1289: Protein of u 33.1 63 0.0014 22.0 3.2 23 165-187 28-50 (51)
90 PHA03049 IMV membrane protein; 33.1 44 0.00096 25.2 2.6 20 90-109 3-22 (68)
91 cd00427 Ribosomal_L29_HIP Ribo 33.1 48 0.001 22.8 2.6 20 167-186 3-22 (57)
92 KOG4171 Adenylate/guanylate ki 32.9 42 0.00092 34.1 3.2 28 160-188 385-412 (671)
93 PF13858 DUF4199: Protein of u 32.6 1.3E+02 0.0029 23.1 5.3 18 136-153 46-65 (163)
94 COG5478 Predicted small integr 32.5 1.1E+02 0.0023 25.9 5.0 19 168-186 98-116 (141)
95 TIGR00012 L29 ribosomal protei 32.5 51 0.0011 22.6 2.7 19 167-185 2-20 (55)
96 PF01099 Uteroglobin: Uteroglo 32.2 79 0.0017 22.2 3.7 34 150-183 18-59 (67)
97 PF15188 CCDC-167: Coiled-coil 32.1 74 0.0016 24.5 3.7 38 156-193 19-56 (85)
98 COG2155 Uncharacterized conser 31.9 29 0.00062 26.7 1.5 58 91-148 5-68 (79)
99 PF10661 EssA: WXG100 protein 31.5 50 0.0011 27.0 2.9 18 95-112 125-142 (145)
100 PRK05179 rpsM 30S ribosomal pr 30.7 36 0.00079 27.1 1.9 21 164-184 42-62 (122)
101 PF07256 DUF1435: Protein of u 30.5 57 0.0012 25.0 2.8 47 96-159 9-56 (78)
102 COG0255 RpmC Ribosomal protein 30.5 64 0.0014 23.8 3.0 21 165-185 6-26 (69)
103 PF10742 DUF2555: Protein of u 30.5 45 0.00098 24.3 2.2 23 164-186 3-25 (57)
104 PF13239 2TM: 2TM domain 29.8 1.6E+02 0.0035 21.1 5.0 71 88-169 10-83 (83)
105 PF03047 ComC: COMC family; I 29.7 18 0.00038 23.4 0.0 15 165-179 9-23 (32)
106 cd07321 Extradiol_Dioxygenase_ 29.4 85 0.0018 23.0 3.5 25 154-180 18-45 (77)
107 PHA02698 hypothetical protein; 29.3 77 0.0017 24.8 3.4 27 159-185 55-81 (89)
108 TIGR00439 ftsX putative protei 29.3 54 0.0012 29.2 3.0 16 126-141 282-297 (309)
109 CHL00137 rps13 ribosomal prote 29.1 39 0.00084 27.0 1.8 21 164-184 42-62 (122)
110 CHL00154 rpl29 ribosomal prote 29.1 70 0.0015 23.2 3.0 21 166-186 8-28 (67)
111 TIGR00787 dctP tripartite ATP- 29.0 63 0.0014 26.9 3.2 21 164-184 213-233 (257)
112 cd06171 Sigma70_r4 Sigma70, re 28.6 78 0.0017 18.5 2.8 18 162-179 3-20 (55)
113 PF01545 Cation_efflux: Cation 28.4 86 0.0019 25.9 3.8 31 81-111 57-88 (284)
114 PF02096 60KD_IMP: 60Kd inner 28.3 44 0.00096 26.9 2.1 44 141-184 13-59 (198)
115 PF02697 DUF217: Uncharacteriz 28.2 2E+02 0.0044 21.1 5.3 42 146-187 19-65 (71)
116 PF08285 DPM3: Dolichol-phosph 28.0 47 0.001 25.4 2.1 84 87-191 1-86 (91)
117 PF00831 Ribosomal_L29: Riboso 27.9 54 0.0012 22.7 2.2 20 166-185 3-22 (58)
118 PF13687 DUF4153: Domain of un 27.9 46 0.001 27.6 2.2 67 84-160 150-216 (217)
119 PF09682 Holin_LLH: Phage holi 27.9 73 0.0016 24.3 3.1 26 133-158 20-45 (108)
120 TIGR03631 bact_S13 30S ribosom 27.8 44 0.00096 26.2 1.9 20 165-184 41-60 (113)
121 PF05529 Bap31: B-cell recepto 27.8 54 0.0012 26.6 2.5 13 172-184 160-172 (192)
122 PF02899 Phage_int_SAM_1: Phag 27.7 75 0.0016 21.2 2.8 37 150-186 21-60 (84)
123 PLN02435 probable UDP-N-acetyl 27.7 76 0.0017 30.9 3.8 31 154-184 28-58 (493)
124 PF06738 DUF1212: Protein of u 27.7 1.6E+02 0.0036 23.4 5.2 77 110-187 23-99 (193)
125 PRK14549 50S ribosomal protein 27.6 80 0.0017 22.8 3.1 22 165-186 7-28 (69)
126 COG4858 Uncharacterized membra 27.6 49 0.0011 29.6 2.4 38 107-144 146-183 (226)
127 COG1230 CzcD Co/Zn/Cd efflux s 27.5 77 0.0017 29.0 3.6 74 83-157 82-165 (296)
128 PF10061 DUF2299: Uncharacteri 27.5 68 0.0015 26.0 3.0 26 161-186 58-83 (138)
129 TIGR02408 ectoine_ThpD ectoine 27.2 32 0.00069 29.6 1.1 35 152-186 19-53 (277)
130 PRK00461 rpmC 50S ribosomal pr 27.1 76 0.0017 24.3 3.0 22 165-186 3-24 (87)
131 PF09012 FeoC: FeoC like trans 26.6 36 0.00077 23.5 1.1 33 151-184 8-40 (69)
132 PF14261 DUF4351: Domain of un 26.6 97 0.0021 21.7 3.3 20 162-181 26-45 (59)
133 PF11326 DUF3128: Protein of u 26.6 1.6E+02 0.0034 21.5 4.5 45 151-195 36-80 (84)
134 PRK10878 hypothetical protein; 26.5 52 0.0011 24.3 2.0 27 161-187 10-36 (72)
135 KOG0182 20S proteasome, regula 26.4 1.6E+02 0.0035 26.8 5.4 50 134-185 62-124 (246)
136 TIGR02838 spore_V_AC stage V s 26.3 74 0.0016 26.6 3.1 51 80-130 11-62 (141)
137 PRK05457 heat shock protein Ht 26.3 3.3E+02 0.0073 24.0 7.3 35 151-187 211-245 (284)
138 smart00529 HTH_DTXR Helix-turn 26.0 2.3E+02 0.0051 19.7 5.9 72 112-184 8-81 (96)
139 PF09677 TrbI_Ftype: Type-F co 25.8 65 0.0014 25.3 2.5 14 168-181 52-65 (111)
140 PTZ00459 mucin-associated surf 25.8 33 0.00071 30.7 1.0 13 126-138 9-21 (291)
141 PTZ00134 40S ribosomal protein 25.8 38 0.00082 28.3 1.3 21 164-184 55-75 (154)
142 PF10953 DUF2754: Protein of u 25.7 45 0.00098 25.0 1.5 46 92-143 15-60 (70)
143 COG0053 MMT1 Predicted Co/Zn/C 25.5 1.1E+02 0.0023 27.3 4.1 67 81-147 71-143 (304)
144 PF02686 Glu-tRNAGln: Glu-tRNA 25.3 66 0.0014 21.9 2.2 15 170-184 1-15 (72)
145 TIGR03798 ocin_TIGR03798 bacte 25.3 1.1E+02 0.0023 21.2 3.3 27 153-179 2-34 (64)
146 COG2938 Uncharacterized conser 25.2 58 0.0013 25.4 2.1 29 160-188 34-62 (94)
147 PHA02592 52 DNA topisomerase I 25.2 76 0.0017 30.3 3.3 29 160-188 389-417 (439)
148 TIGR00782 ccoP cytochrome c ox 25.1 2.2E+02 0.0049 24.7 5.9 17 92-108 32-48 (285)
149 TIGR03766 conserved hypothetic 25.0 79 0.0017 29.6 3.4 32 150-181 330-361 (483)
150 KOG3248 Transcription factor T 25.0 1.2E+02 0.0026 29.4 4.5 38 149-186 200-249 (421)
151 PRK13922 rod shape-determining 25.0 1.7E+02 0.0038 24.9 5.2 25 160-184 78-104 (276)
152 PF02468 PsbN: Photosystem II 24.9 66 0.0014 22.2 2.1 25 90-114 4-28 (43)
153 PF11833 DUF3353: Protein of u 24.8 1.1E+02 0.0023 26.1 3.9 51 88-139 138-191 (194)
154 PRK04654 sec-independent trans 24.8 76 0.0016 28.2 3.0 32 151-182 35-66 (214)
155 PF10083 DUF2321: Uncharacteri 24.8 86 0.0019 26.8 3.2 23 163-185 91-113 (158)
156 cd01389 MATA_HMG-box MATA_HMG- 24.7 1.4E+02 0.0031 20.7 3.9 28 160-187 33-60 (77)
157 cd00633 Secretoglobin Secretog 24.4 1.7E+02 0.0038 20.3 4.3 35 149-183 17-59 (67)
158 PF04695 Pex14_N: Peroxisomal 24.3 78 0.0017 25.1 2.8 26 159-186 26-51 (136)
159 cd02000 TPP_E1_PDC_ADC_BCADC T 24.3 1E+02 0.0022 26.8 3.7 58 131-188 219-288 (293)
160 TIGR02209 ftsL_broad cell divi 24.0 2.6E+02 0.0056 19.5 5.4 25 159-184 32-56 (85)
161 cd01388 SOX-TCF_HMG-box SOX-TC 24.0 1.5E+02 0.0033 20.3 3.9 28 161-188 34-61 (72)
162 PRK05382 chorismate synthase; 23.8 1.5E+02 0.0032 28.0 4.8 51 136-188 144-194 (359)
163 PF09446 VMA21: VMA21-like dom 23.8 79 0.0017 22.6 2.4 52 91-143 7-64 (66)
164 smart00096 UTG Uteroglobin. 23.6 1.9E+02 0.0042 21.4 4.5 35 150-184 20-62 (69)
165 PF13807 GNVR: G-rich domain o 23.5 1.5E+02 0.0033 21.0 3.9 36 76-111 45-80 (82)
166 PRK02463 OxaA-like protein pre 23.4 5.5E+02 0.012 23.4 8.3 44 141-184 70-125 (307)
167 PHA01080 hypothetical protein 23.4 42 0.00091 26.0 1.0 31 110-140 39-69 (80)
168 PF03937 Sdh5: Flavinator of s 23.3 79 0.0017 22.7 2.4 28 160-187 22-49 (74)
169 PRK14471 F0F1 ATP synthase sub 23.3 1.2E+02 0.0025 24.1 3.6 35 121-159 10-44 (164)
170 PRK11512 DNA-binding transcrip 23.3 2.5E+02 0.0053 21.5 5.3 93 83-184 34-141 (144)
171 PRK10870 transcriptional repre 23.2 2.2E+02 0.0047 23.1 5.2 71 110-184 78-157 (176)
172 PRK00247 putative inner membra 23.2 3E+02 0.0064 26.6 6.8 63 112-184 23-93 (429)
173 PRK13183 psbN photosystem II r 23.1 55 0.0012 23.0 1.5 29 87-115 4-32 (46)
174 COG2177 FtsX Cell division pro 23.1 91 0.002 28.2 3.3 30 112-141 260-289 (297)
175 PF04277 OAD_gamma: Oxaloaceta 22.8 28 0.0006 24.5 -0.1 17 170-186 61-77 (79)
176 PF10666 Phage_Gp14: Phage pro 22.7 1.7E+02 0.0038 24.6 4.6 47 138-184 21-76 (140)
177 COG3771 Predicted membrane pro 22.7 25 0.00055 27.9 -0.3 40 121-160 42-83 (97)
178 PF12616 DUF3775: Protein of u 22.5 1.1E+02 0.0024 22.8 3.0 21 160-180 3-23 (75)
179 PF07037 DUF1323: Putative tra 22.5 60 0.0013 26.7 1.8 43 143-185 66-108 (122)
180 PF06374 NDUF_C2: NADH-ubiquin 22.5 2E+02 0.0043 23.3 4.8 23 115-140 47-70 (117)
181 PF13623 SurA_N_2: SurA N-term 22.4 3.1E+02 0.0067 22.2 5.9 17 143-159 43-62 (145)
182 PRK09814 beta-1,6-galactofuran 22.4 1.9E+02 0.0041 24.9 4.9 47 146-192 266-319 (333)
183 COG3923 PriC Primosomal replic 22.3 1.3E+02 0.0028 26.2 3.9 37 153-189 113-156 (175)
184 TIGR02530 flg_new flagellar op 22.3 1.2E+02 0.0026 23.9 3.4 23 164-186 22-48 (96)
185 PF07835 COX4_pro_2: Bacterial 22.3 62 0.0013 21.9 1.6 14 146-159 6-19 (44)
186 PRK10429 melibiose:sodium symp 22.2 2.9E+02 0.0063 24.7 6.2 9 109-117 390-398 (473)
187 COG4653 Predicted phage phi-C3 22.0 89 0.0019 30.4 3.1 26 160-186 30-55 (422)
188 cd04748 Commd COMM_Domain, a f 22.0 93 0.002 22.4 2.6 19 169-187 67-85 (87)
189 KOG3882 Tetraspanin family int 22.0 1.4E+02 0.003 24.6 3.9 23 87-109 51-73 (237)
190 PRK01381 Trp operon repressor; 21.9 1.3E+02 0.0028 23.8 3.5 25 159-183 21-46 (99)
191 PRK02944 OxaA-like protein pre 21.9 3.6E+02 0.0079 23.6 6.6 44 141-184 69-119 (255)
192 MTH00046 CYTB cytochrome b; Va 21.8 1.2E+02 0.0025 28.4 3.8 45 62-106 189-235 (355)
193 CHL00019 atpF ATP synthase CF0 21.8 1.3E+02 0.0028 24.5 3.6 41 113-160 20-61 (184)
194 PF10762 DUF2583: Protein of u 21.7 1.1E+02 0.0023 24.2 3.0 37 101-139 21-60 (89)
195 cd06660 Aldo_ket_red Aldo-keto 21.7 92 0.002 25.6 2.8 44 132-180 239-284 (285)
196 KOG1937 Uncharacterized conser 21.6 80 0.0017 31.3 2.7 22 146-167 307-328 (521)
197 PF03419 Peptidase_U4: Sporula 21.5 88 0.0019 27.2 2.8 50 90-139 86-140 (293)
198 PF13148 DUF3987: Protein of u 21.5 1.3E+02 0.0029 26.0 3.8 12 151-162 74-85 (378)
199 PF02038 ATP1G1_PLM_MAT8: ATP1 21.4 1.2E+02 0.0025 21.6 2.9 26 81-106 3-32 (50)
200 PF08920 SF3b1: Splicing facto 21.4 62 0.0013 26.8 1.7 27 153-179 69-96 (144)
201 PF08784 RPA_C: Replication pr 21.4 1.1E+02 0.0025 22.2 3.0 25 160-185 68-92 (102)
202 PRK04156 gltX glutamyl-tRNA sy 21.2 1.1E+02 0.0025 30.2 3.7 21 162-182 53-73 (567)
203 PF07271 Cytadhesin_P30: Cytad 21.1 1.4E+02 0.003 27.6 4.0 8 128-135 84-91 (279)
204 PHA00649 hypothetical protein 21.1 1.1E+02 0.0024 23.7 2.9 23 90-112 56-78 (83)
205 PF14164 YqzH: YqzH-like prote 21.0 1.1E+02 0.0024 22.6 2.8 23 168-190 24-46 (64)
206 PRK05732 2-octaprenyl-6-methox 20.9 40 0.00087 28.7 0.6 17 87-103 1-17 (395)
207 PRK08455 fliL flagellar basal 20.5 96 0.0021 26.0 2.7 23 91-113 20-42 (182)
208 PF13349 DUF4097: Domain of un 20.4 94 0.002 23.5 2.4 16 91-106 2-17 (166)
209 PRK10692 hypothetical protein; 20.3 99 0.0021 24.5 2.5 36 102-139 22-60 (92)
210 PRK06475 salicylate hydroxylas 20.3 45 0.00097 29.2 0.7 27 88-116 1-27 (400)
211 PRK03598 putative efflux pump 20.2 71 0.0015 27.6 1.9 26 87-112 1-27 (331)
212 PRK08317 hypothetical protein; 20.1 4.2E+02 0.009 20.5 7.4 71 116-188 153-224 (241)
No 1
>PF11460 DUF3007: Protein of unknown function (DUF3007); InterPro: IPR021562 This is a family of uncharacterised proteins found in bacteria and eukaryotes.
Probab=100.00 E-value=1.8e-51 Score=318.75 Aligned_cols=97 Identities=55% Similarity=0.856 Sum_probs=95.5
Q ss_pred ccchhhhhhhHHHHHHHHHHHHhhHHhCCCcccchhHHHHHHHhhhhhheeeeeeeeeecCCccHHHHHHHHHHHH----
Q 029243 87 YTRKDVLLIGVGVTVIGYGLKSGLELFGVDPLQAGNVVEVIVVLGLTLGWVSTYIFRVSNKEMTYAQQLRDYEDKV---- 162 (196)
Q Consensus 87 MtR~DVI~IGlgvfllGg~ly~gLq~~GlD~~~AGiwsQ~lLVlGlvvgWv~SYlfRV~tk~MTY~qQ~kdYE~av---- 162 (196)
|||+|||+||+++|++|+++|++|+++|+|+++||||||++||+|+ |+|++||+|||+||+|||||||||||+++
T Consensus 1 MtR~dvl~Iglgv~~~Gg~~Y~~l~~~G~d~~~AGi~sq~~lv~gl-vgW~~sYlfRV~t~~MTy~~Q~k~Ye~a~~~~~ 79 (104)
T PF11460_consen 1 MTRIDVLLIGLGVFLLGGLLYGGLQAAGLDSLSAGIWSQALLVLGL-VGWVSSYLFRVVTGKMTYMQQRKDYEEAVDQLT 79 (104)
T ss_pred CcccceeeecHHHHHHHHHHHHHHHHcCCCchhhhHHHHHHHHHHH-HHHHhHHHhhhccCCCcHHHHHHHHHHHHHHHh
Confidence 8999999999999999999999999999999999999999999986 99999999999999999999999999999
Q ss_pred ---HHHHHhcCCHHHHHHHHHHHHH
Q 029243 163 ---MQKRLEGLTEAELEALIEQVEE 184 (196)
Q Consensus 163 ---LQKRleeLtpEEle~L~aEIE~ 184 (196)
||||||||||||+|+||+||||
T Consensus 80 ~~~lqkRle~l~~eE~~~L~~eiee 104 (104)
T PF11460_consen 80 NEELQKRLEELSPEELEALQAEIEE 104 (104)
T ss_pred HHHHHHHHHhCCHHHHHHHHHHhcC
Confidence 9999999999999999999985
No 2
>PF14163 SieB: Superinfection exclusion protein B
Probab=94.75 E-value=0.079 Score=41.75 Aligned_cols=74 Identities=20% Similarity=0.257 Sum_probs=46.2
Q ss_pred hhHHhCCCcc--cchhHHHHHHHhhhhhheeeeeeeeeecCCccHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHHH
Q 029243 109 GLELFGVDPL--QAGNVVEVIVVLGLTLGWVSTYIFRVSNKEMTYAQQLRDYEDKVMQKRLEGLTEAELEALIEQVEEE 185 (196)
Q Consensus 109 gLq~~GlD~~--~AGiwsQ~lLVlGlvvgWv~SYlfRV~tk~MTY~qQ~kdYE~avLQKRleeLtpEEle~L~aEIE~E 185 (196)
.++.+|+|.+ .-+.|.-.++++ +++.+.+.++...-+.-... +.+.-+.+.++|+++.|||+|++-|-+=+.+.
T Consensus 19 ~~~~l~l~~~~~~y~~~i~~~fl~--s~s~li~~~~~~~~~~~~~~-~~~k~~~~~~~~~l~~Lt~~EkavL~~~~~~~ 94 (151)
T PF14163_consen 19 LLEWLNLDKFEIKYQPWIGLIFLF--SVSYLIAQLLSFIYKEAKDR-YQRKRKKKKIEKKLNSLTPEEKAVLREFYIQG 94 (151)
T ss_pred HHHHhCcchHHHhcchHHHHHHHH--HHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhCCHHHHHHHHHHHHCC
Confidence 3678888874 455666666665 34555555544442222222 33333466788889999999999887766654
No 3
>PF04156 IncA: IncA protein; InterPro: IPR007285 Chlamydia trachomatis is an obligate intracellular bacterium that develops within a parasitophorous vacuole termed an inclusion. The inclusion is nonfusogenic with lysosomes but intercepts lipids from a host cell exocytic pathway. Initiation of chlamydial development is concurrent with modification of the inclusion membrane by a set of C. trachomatis-encoded proteins collectively designated Incs. One of these Incs, IncA (Inclusion membrane protein A), is functionally associated with the homotypic fusion of inclusions [].
Probab=93.92 E-value=0.12 Score=41.22 Aligned_cols=75 Identities=20% Similarity=0.175 Sum_probs=34.7
Q ss_pred HhhHHhCCCcccchhHHHHHHHhhhhhheeeeeeeeeecCCccHHHHHHHH-----HHHHHHHHHhcCCHHHHHHHHHHH
Q 029243 108 SGLELFGVDPLQAGNVVEVIVVLGLTLGWVSTYIFRVSNKEMTYAQQLRDY-----EDKVMQKRLEGLTEAELEALIEQV 182 (196)
Q Consensus 108 ~gLq~~GlD~~~AGiwsQ~lLVlGlvvgWv~SYlfRV~tk~MTY~qQ~kdY-----E~avLQKRleeLtpEEle~L~aEI 182 (196)
..+-..|+-+.-++...=+++.+|+++.=.+.|.+-.-...=.+..+.-+. +....++++.+++ +|++.+++.+
T Consensus 26 Lv~~~~~l~~~~s~~lg~~~lAlg~vL~~~g~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~l~~l~-~el~~l~~~~ 104 (191)
T PF04156_consen 26 LVLFISGLGALISFILGIALLALGVVLLSLGLLCLLSKRPVQSVRPQQIEEPRLQGELSELQQQLQQLQ-EELDQLQERI 104 (191)
T ss_pred HHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccccccchHHHHHhhhhhhhHHhHHHHHHHHH-HHHHHHHHHH
Confidence 344444555555555555555555533333333333322223333333333 4455555665554 4555554444
Q ss_pred H
Q 029243 183 E 183 (196)
Q Consensus 183 E 183 (196)
.
T Consensus 105 ~ 105 (191)
T PF04156_consen 105 Q 105 (191)
T ss_pred H
Confidence 3
No 4
>PF12576 DUF3754: Protein of unknown function (DUF3754); InterPro: IPR022227 This domain family is found in bacteria, archaea and eukaryotes, and is typically between 135 and 166 amino acids in length. There is a single completely conserved residue P that may be functionally important.
Probab=89.16 E-value=0.36 Score=38.50 Aligned_cols=70 Identities=17% Similarity=0.135 Sum_probs=41.1
Q ss_pred ccccchhhhhhhHHHHHHHHHHHHhhHHhCCCc-ccchhHHHHHHHhhhhhheeeeeeeeeecCCccHHHHHHHHHHHH
Q 029243 85 FGYTRKDVLLIGVGVTVIGYGLKSGLELFGVDP-LQAGNVVEVIVVLGLTLGWVSTYIFRVSNKEMTYAQQLRDYEDKV 162 (196)
Q Consensus 85 fgMtR~DVI~IGlgvfllGg~ly~gLq~~GlD~-~~AGiwsQ~lLVlGlvvgWv~SYlfRV~tk~MTY~qQ~kdYE~av 162 (196)
=.||-.|-+.+++.+.+.|+...+-+.....+. .....+.-+.+ ++=+++|.+|+.+ +|.+++.+|...+
T Consensus 57 v~~~~~D~~~l~~~~vvg~v~~~~~~~~~~~~~~~~~~~~~~~~~-----l~~~~~~~~r~~~---~~~~~~~ry~~~l 127 (141)
T PF12576_consen 57 VRMRPFDRVKLGVSAVVGGVAVFVKLVGMSLLLLSDIFLILILSL-----LSALGGYAFRQYT---GYKNNRARYQLLL 127 (141)
T ss_pred CCcCHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHH-----HHHHHHHHHHHHH---HHHHHHHHHHHHH
Confidence 478899999888877555544333322222111 11122222222 3334688999985 8999999998543
No 5
>PF06305 DUF1049: Protein of unknown function (DUF1049); InterPro: IPR010445 This entry consists of several hypothetical bacterial proteins of unknown function.
Probab=86.54 E-value=0.62 Score=31.74 Aligned_cols=25 Identities=28% Similarity=0.550 Sum_probs=15.8
Q ss_pred ccchhHHHHHHHhhhhhheeeeeee
Q 029243 118 LQAGNVVEVIVVLGLTLGWVSTYIF 142 (196)
Q Consensus 118 ~~AGiwsQ~lLVlGlvvgWv~SYlf 142 (196)
...+++.=+.+++|+++||+.+...
T Consensus 18 ~pl~l~il~~f~~G~llg~l~~~~~ 42 (68)
T PF06305_consen 18 LPLGLLILIAFLLGALLGWLLSLPS 42 (68)
T ss_pred chHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3444555556677788888876643
No 6
>PF06518 DUF1104: Protein of unknown function (DUF1104); InterPro: IPR009488 This family consists of several hypothetical proteins of unknown function which appear to be found exclusively in Helicobacter pylori.; PDB: 2XRH_A.
Probab=86.27 E-value=1.5 Score=33.78 Aligned_cols=40 Identities=20% Similarity=0.324 Sum_probs=29.6
Q ss_pred CCccHHHHHHHHH---HHHHHHHHhcCCHHHHHHHHHHHHHHHh
Q 029243 147 KEMTYAQQLRDYE---DKVMQKRLEGLTEAELEALIEQVEEEKR 187 (196)
Q Consensus 147 k~MTY~qQ~kdYE---~avLQKRleeLtpEEle~L~aEIE~Ek~ 187 (196)
+.|++-++ ++|. .+.++|.++.||++|-.+.+++|-.+=+
T Consensus 35 ~~m~~~~~-k~f~~~~~~~~~kn~~~ms~~e~~k~~~ev~k~~~ 77 (93)
T PF06518_consen 35 KKMKEKEA-KDFKKQFKEAARKNLSKMSVEERKKRREEVRKALE 77 (93)
T ss_dssp HHS-HHHH-HHHHHHHHHHHHHHHTTS-HHHHHHHHHHHHHHHH
T ss_pred HhCCHHHH-HHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHH
Confidence 45776544 4665 7789999999999999999999977433
No 7
>PF03250 Tropomodulin: Tropomodulin; InterPro: IPR004934 Actin filaments have an intrinsic polarity, each with a fast-growing (barbed) end and a slow-growing (pointed) end. To regulate the dynamics at these ends, capping proteins have evolved that specifically bind to either the barbed or the pointed ends of the filament, where they block the association and dissociation of monomers. Pointed ends, for which actin monomers have significantly lower association and dissociation rate-constants than for barbed, are capped by either the Arp2/3 complex or tropomodulins []. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments []. Limited proteolysis shows this protein is composed of two domains. The unstructured tropomyosin-binding region at the N terminus has an actin pointed-end-capping activity that is dramatically up-regulated by tropomyosin coating of the actin filament[]. The second region is found near the C terminus. This tropomyosin-independent capping-domain caps pure actin. ; GO: 0005523 tropomyosin binding, 0005856 cytoskeleton
Probab=85.91 E-value=0.63 Score=38.95 Aligned_cols=36 Identities=36% Similarity=0.562 Sum_probs=31.7
Q ss_pred ccHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHH
Q 029243 149 MTYAQQLRDYEDKVMQKRLEGLTEAELEALIEQVEE 184 (196)
Q Consensus 149 MTY~qQ~kdYE~avLQKRleeLtpEEle~L~aEIE~ 184 (196)
|.|.+-+..|++--...=|..|||||++.|..|+++
T Consensus 3 ~~~~kel~~y~DiDeDelL~~LS~EEL~~L~~el~e 38 (147)
T PF03250_consen 3 LSYRKELEKYEDIDEDELLAKLSPEELEELENELEE 38 (147)
T ss_pred chhhhhhhhcccCCHHHHHHhCCHHHHHHHHHHHHh
Confidence 457788889998888888999999999999988876
No 8
>PF08621 RPAP1_N: RPAP1-like, N-terminal; InterPro: IPR013930 Inhibition of RNA polymerase II-associated protein 1 (RPAP1) synthesis in Saccharomyces cerevisiae (Baker's yeast) results in changes in global gene expression that are similar to those caused by the loss of the RNAPII subunit Rpb11 []. This entry represents the N-terminal region of RPAP-1 that is conserved from yeast to humans.
Probab=83.69 E-value=1.5 Score=30.37 Aligned_cols=22 Identities=27% Similarity=0.376 Sum_probs=19.4
Q ss_pred HHHHHhcCCHHHHHHHHHHHHH
Q 029243 163 MQKRLEGLTEAELEALIEQVEE 184 (196)
Q Consensus 163 LQKRleeLtpEEle~L~aEIE~ 184 (196)
=.+||..|||||.++-++||-+
T Consensus 8 N~~rL~~MS~eEI~~er~eL~~ 29 (49)
T PF08621_consen 8 NEARLASMSPEEIEEEREELLE 29 (49)
T ss_pred HHHHHHhCCHHHHHHHHHHHHH
Confidence 3689999999999999999865
No 9
>PF04120 Iron_permease: Low affinity iron permease ; InterPro: IPR007251 Although originally identified as a low-affinity iron(II) permease [, ], Fet4 has since been shown to import several other transition metal ions, including copper [, ] and zinc []. Copper, cobalt, and cadmium inhibit Fet4 [, ]. Fet4 is an integral protein of the plasma membrane [, ]. FET4 is not essential, not even in fet3 fet4 double mutants []. Over expression of FET4 improves growth under alkaline conditions []. Transcription of FET4 is induced by Aft1 in response to low levels of iron [, , ] or by Zap1 in response to low zinc [, ], but not in response to low copper []. When the high-affinity iron permease component Fet3 is deleted, FET4 is induced by the addition of copper, zinc, cobalt, or manganese []. It is also induced under anaerobic conditions [, , ] and repressed by Rox1 in aerobic conditions [, ]. Rox1 attenuates the activation of FET4 by Aft1 or Zap1 []. ; GO: 0055085 transmembrane transport
Probab=82.49 E-value=7.9 Score=31.55 Aligned_cols=48 Identities=23% Similarity=0.255 Sum_probs=30.9
Q ss_pred eeecCCccHHHHHHHHHHHHHH---H---HHhcCCHHHHHHHHHHHHHHHhccc
Q 029243 143 RVSNKEMTYAQQLRDYEDKVMQ---K---RLEGLTEAELEALIEQVEEEKRRLA 190 (196)
Q Consensus 143 RV~tk~MTY~qQ~kdYE~avLQ---K---RleeLtpEEle~L~aEIE~Ek~~~~ 190 (196)
..-+++..+++.+.|-=-..+. . -+|.|+|+|++++.++.|++..+..
T Consensus 62 n~q~Rd~~al~~KLdeLi~~~~~a~n~li~iE~l~~~el~~~~~~~~~~~~~~~ 115 (132)
T PF04120_consen 62 NTQNRDTKALQAKLDELIRAVKEARNELIDIEDLTEEELEEIRKRYERLAEQAR 115 (132)
T ss_pred hhhhhHHHHHHHHHHHHHHHHHHHHHHhCCcccCCHHHHHHHHHHHHHHHHHhh
Confidence 3445666666555554433332 1 1588999999999999998655443
No 10
>PF13618 Gluconate_2-dh3: Gluconate 2-dehydrogenase subunit 3
Probab=77.50 E-value=2.3 Score=32.07 Aligned_cols=28 Identities=29% Similarity=0.414 Sum_probs=23.2
Q ss_pred HHHHHHHHhcCCHHHHHHHHHHHHHHHh
Q 029243 160 DKVMQKRLEGLTEAELEALIEQVEEEKR 187 (196)
Q Consensus 160 ~avLQKRleeLtpEEle~L~aEIE~Ek~ 187 (196)
...-.|.|.+|+++|+++|++++++.+.
T Consensus 61 ~~~~g~~F~~l~~~~~~~lL~~~~~~~~ 88 (131)
T PF13618_consen 61 QKRYGKSFAELSPAQREALLDALEKSEA 88 (131)
T ss_pred HHHhCCChhhCCHHHHHHHHHHHHhccc
Confidence 3344688999999999999999998654
No 11
>PRK11864 2-ketoisovalerate ferredoxin oxidoreductase subunit beta; Provisional
Probab=74.42 E-value=1.9 Score=39.00 Aligned_cols=46 Identities=26% Similarity=0.424 Sum_probs=33.7
Q ss_pred eeeeeecCCccHHH---------HHHHHHH-HHHHHHHhcCCHHHHHHHHHHHHHH
Q 029243 140 YIFRVSNKEMTYAQ---------QLRDYED-KVMQKRLEGLTEAELEALIEQVEEE 185 (196)
Q Consensus 140 YlfRV~tk~MTY~q---------Q~kdYE~-avLQKRleeLtpEEle~L~aEIE~E 185 (196)
-|||.-.++.+... .++..+| -.+|.||..|.||++++||+++++.
T Consensus 233 plye~~~g~~~~~~~~~~~~~~~~~~pv~~~l~~q~Rf~~L~~~~~~~~q~~vd~~ 288 (300)
T PRK11864 233 PLFEYENGKFKLNSPSKTLLDKKKRKPVEEYLKLQGRFKHLTEEEIKGLQEEIDEM 288 (300)
T ss_pred eEEEEECCEEEEccCCccccccccCCCHHHHHhhccchhcCCHHHHHHHHHHHHHH
Confidence 36777776655543 2334443 3489999999999999999999874
No 12
>PTZ00432 falcilysin; Provisional
Probab=73.98 E-value=8.7 Score=39.85 Aligned_cols=43 Identities=16% Similarity=0.306 Sum_probs=37.5
Q ss_pred ecCCc--cHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHHHHh
Q 029243 145 SNKEM--TYAQQLRDYEDKVMQKRLEGLTEAELEALIEQVEEEKR 187 (196)
Q Consensus 145 ~tk~M--TY~qQ~kdYE~avLQKRleeLtpEEle~L~aEIE~Ek~ 187 (196)
+..+= .|.++++.=|++.|+++.++|++||+++|.++.++-++
T Consensus 571 ~~p~~s~~~~~~~~~~e~~~L~~~~~~Ls~ee~~~i~~~~~~l~~ 615 (1119)
T PTZ00432 571 LEAVESSKYEKEFNKLVKDELKERLSHLTKEQVDEMEKAYEKFKK 615 (1119)
T ss_pred EecCCcccHHHHHHHHHHHHHHHHHHhCCHHHHHHHHHHHHHHHH
Confidence 33555 89999999999999999999999999999999988443
No 13
>TIGR03348 VI_IcmF type VI secretion protein IcmF. Members of this protein family are IcmF homologs and tend to be associated with type VI secretion systems.
Probab=73.94 E-value=5.2 Score=41.43 Aligned_cols=84 Identities=15% Similarity=0.217 Sum_probs=46.7
Q ss_pred hHHHHHHHHHHHHhhHHhCCCcc--cchhHHHHHHHhhhhhheeeeeeeeeecCCccHH--HHHHHHHHHHHH-HHHhcC
Q 029243 96 GVGVTVIGYGLKSGLELFGVDPL--QAGNVVEVIVVLGLTLGWVSTYIFRVSNKEMTYA--QQLRDYEDKVMQ-KRLEGL 170 (196)
Q Consensus 96 GlgvfllGg~ly~gLq~~GlD~~--~AGiwsQ~lLVlGlvvgWv~SYlfRV~tk~MTY~--qQ~kdYE~avLQ-KRleeL 170 (196)
.+++++++.++|.+-..+++++. -|+.|+=++++++++++|++..++|. +. .+.+.-+++.-+ ++-+..
T Consensus 3 ~l~~~~l~~~iW~~gp~~~~~~~~Pl~~~~~r~~~i~~~~~~~~~~~~~~~------~~~~r~~~~~~~~~~~~~~~~~~ 76 (1169)
T TIGR03348 3 LLGLILLCILIWWAGPLLAVGDYKPLESVWARLLIIAVLLLVWLLWLLIRQ------WRARRRNQQLLAALAAPTKLKAD 76 (1169)
T ss_pred HHHHHHHHHHHHHhhhhhhcCCCCccccHHHHHHHHHHHHHHHHHHHHHHH------HHHHHHHHHHHhhhcccccccCC
Confidence 45556666677777666666554 56788877777666677777666544 11 111122222211 111111
Q ss_pred C--HHHHHHHHHHHHHH
Q 029243 171 T--EAELEALIEQVEEE 185 (196)
Q Consensus 171 t--pEEle~L~aEIE~E 185 (196)
+ .+|++.|++.+.+-
T Consensus 77 ~~~~~~~~~l~~~~~~a 93 (1169)
T TIGR03348 77 AAAEAEIRELRARFNEA 93 (1169)
T ss_pred ccchHHHHHHHHHHHHH
Confidence 2 47888888877763
No 14
>PF06295 DUF1043: Protein of unknown function (DUF1043); InterPro: IPR009386 This entry consists of several hypothetical bacterial proteins of unknown function.
Probab=73.54 E-value=2.7 Score=33.16 Aligned_cols=20 Identities=25% Similarity=0.624 Sum_probs=13.7
Q ss_pred HHHHhhhhhheeeeeeeeeecCC
Q 029243 126 VIVVLGLTLGWVSTYIFRVSNKE 148 (196)
Q Consensus 126 ~lLVlGlvvgWv~SYlfRV~tk~ 148 (196)
+.||+|++|||+.. |..+++
T Consensus 4 i~lvvG~iiG~~~~---r~~~~~ 23 (128)
T PF06295_consen 4 IGLVVGLIIGFLIG---RLTSSN 23 (128)
T ss_pred HHHHHHHHHHHHHH---HHhccc
Confidence 45788888999864 445554
No 15
>TIGR03142 cytochro_ccmI cytochrome c-type biogenesis protein CcmI. This TPR repeat-containing protein is the CcmI protein (also called CycH) of c-type cytochrome biogenesis. CcmI is thought to act as an apo-cytochrome c chaperone. This model describes the N-terminal region of the protein, Members of this protein family
Probab=73.51 E-value=4.5 Score=31.09 Aligned_cols=16 Identities=38% Similarity=0.335 Sum_probs=13.1
Q ss_pred cCCHHHHHHHHHHHHH
Q 029243 169 GLTEAELEALIEQVEE 184 (196)
Q Consensus 169 eLtpEEle~L~aEIE~ 184 (196)
.++++|.+++.+|+++
T Consensus 56 ~l~~~e~~~~~~El~r 71 (117)
T TIGR03142 56 LLDEAEAEAARAELQR 71 (117)
T ss_pred CCCHHHHHHHHHHHHH
Confidence 3488899999999886
No 16
>PRK11677 hypothetical protein; Provisional
Probab=73.10 E-value=5.1 Score=32.68 Aligned_cols=16 Identities=25% Similarity=0.486 Sum_probs=11.5
Q ss_pred HHHHhhhhhheeeeee
Q 029243 126 VIVVLGLTLGWVSTYI 141 (196)
Q Consensus 126 ~lLVlGlvvgWv~SYl 141 (196)
+.||+|++|||+...+
T Consensus 8 i~livG~iiG~~~~R~ 23 (134)
T PRK11677 8 IGLVVGIIIGAVAMRF 23 (134)
T ss_pred HHHHHHHHHHHHHHhh
Confidence 5677778888887653
No 17
>PRK03557 zinc transporter ZitB; Provisional
Probab=70.70 E-value=7.1 Score=34.25 Aligned_cols=33 Identities=21% Similarity=0.374 Sum_probs=20.4
Q ss_pred CCCCccccchhh---hhhhHHHHHHHHH-HHHhhHHh
Q 029243 81 TETPFGYTRKDV---LLIGVGVTVIGYG-LKSGLELF 113 (196)
Q Consensus 81 ~~~pfgMtR~DV---I~IGlgvfllGg~-ly~gLq~~ 113 (196)
++-|||+.|... +..|+.++++|+. +|-+.+.+
T Consensus 77 ~~hpyG~~r~E~l~al~~~~~l~~~~~~i~~eai~~l 113 (312)
T PRK03557 77 IRHTFGWLRLTTLAAFVNAIALVVITILIVWEAIERF 113 (312)
T ss_pred CCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 468999999985 4555555555533 33445543
No 18
>TIGR01297 CDF cation diffusion facilitator family transporter. This model describes a broadly distributed family of transporters, a number of which have been shown to transport divalent cations of cobalt, cadmium and/or zinc. The family has six predicted transmembrane domains. Members of the family are variable in length because of variably sized inserts, often containing low-complexity sequence.
Probab=70.56 E-value=14 Score=30.43 Aligned_cols=79 Identities=22% Similarity=0.401 Sum_probs=35.7
Q ss_pred CCCCccccchhhh---hhhHHHHHHHH-HHHHhhHHh-CCCcccchhHHHHHHHhhhhhhee-eeeeeee--ecCCccHH
Q 029243 81 TETPFGYTRKDVL---LIGVGVTVIGY-GLKSGLELF-GVDPLQAGNVVEVIVVLGLTLGWV-STYIFRV--SNKEMTYA 152 (196)
Q Consensus 81 ~~~pfgMtR~DVI---~IGlgvfllGg-~ly~gLq~~-GlD~~~AGiwsQ~lLVlGlvvgWv-~SYlfRV--~tk~MTY~ 152 (196)
++-|||+.|.+.+ ..|+.++++|+ ++|-+.+.+ .=++..-..+.-.+.++++++-++ .-|..|+ -.+++...
T Consensus 48 ~~~pyG~~r~E~l~~l~~~~~l~~~~~~~~~~si~~l~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~s~~l~ 127 (268)
T TIGR01297 48 ERHPFGHGRAEILAALLNGLFLVVVALFILYEAIERLINPEPEIDGGTMLIVAIVGLIVNLILALYLHRVGHRLGSLALR 127 (268)
T ss_pred CCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccchhHHHHHHHHHHHHHHHHHHHHHhCccCCCHHHH
Confidence 4599999998744 34443333332 223334433 111122223333344444443333 3344443 23345555
Q ss_pred HHHHHHH
Q 029243 153 QQLRDYE 159 (196)
Q Consensus 153 qQ~kdYE 159 (196)
....++.
T Consensus 128 a~~~~~~ 134 (268)
T TIGR01297 128 AAALHVL 134 (268)
T ss_pred HHHHHHH
Confidence 5555554
No 19
>PRK11380 hypothetical protein; Provisional
Probab=69.39 E-value=6.7 Score=36.91 Aligned_cols=34 Identities=32% Similarity=0.349 Sum_probs=25.0
Q ss_pred eeeeeecCCccHHHHHHHHHHHHHHHHHhcCCHHHHHHHH
Q 029243 140 YIFRVSNKEMTYAQQLRDYEDKVMQKRLEGLTEAELEALI 179 (196)
Q Consensus 140 YlfRV~tk~MTY~qQ~kdYE~avLQKRleeLtpEEle~L~ 179 (196)
-+||++-+ .=-+|++-|+ |+|++.||+|++++|+
T Consensus 89 ~~~~~~~~--~~~eq~~yy~----~~~~~~LteEq~r~L~ 122 (353)
T PRK11380 89 IVRAGFKK--AKKEQLRYYQ----AKGIEPLSEEKRQALQ 122 (353)
T ss_pred HHHHHHcc--chHHHHHHHH----HcCCCCCCHHHHHHHH
Confidence 45666642 2246777665 7899999999999986
No 20
>PF12666 PrgI: PrgI family protein; InterPro: IPR024414 This family of bacterial proteins is functionally uncharacterised. Proteins in this family are typically between 116 and 146 amino acids in length. PrgI is found encoded on plasmids of Enterococcus faecalis, its function is not known.
Probab=67.25 E-value=12 Score=27.13 Aligned_cols=34 Identities=24% Similarity=0.384 Sum_probs=25.7
Q ss_pred CCCCCccccchhhhhhhHHHHHHHHHHHHhh-HHhC
Q 029243 80 QTETPFGYTRKDVLLIGVGVTVIGYGLKSGL-ELFG 114 (196)
Q Consensus 80 ~~~~pfgMtR~DVI~IGlgvfllGg~ly~gL-q~~G 114 (196)
++|.=||+|-...+.+++++.+. +++|..+ ..+|
T Consensus 10 e~ki~~GlT~RQl~~l~~~~~~~-~~~~~~~~~~l~ 44 (93)
T PF12666_consen 10 EEKIFFGLTLRQLICLAIGALVG-VGVYLLLWFFLG 44 (93)
T ss_pred cchhccCCCHHHHHHHHHHHHHH-HHHHHHHHHhcc
Confidence 67788899999999999988654 4566654 3565
No 21
>PRK00888 ftsB cell division protein FtsB; Reviewed
Probab=66.46 E-value=8.6 Score=29.62 Aligned_cols=51 Identities=18% Similarity=0.246 Sum_probs=27.2
Q ss_pred hheeeeeeeeeecCCccHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHHHHh
Q 029243 134 LGWVSTYIFRVSNKEMTYAQQLRDYEDKVMQKRLEGLTEAELEALIEQVEEEKR 187 (196)
Q Consensus 134 vgWv~SYlfRV~tk~MTY~qQ~kdYE~avLQKRleeLtpEEle~L~aEIE~Ek~ 187 (196)
++|+.-+++---+|-..|.++.+++++ +++++++| .+|-++|.+||+.-++
T Consensus 12 l~~l~y~l~~g~~G~~~~~~l~~q~~~--~~~e~~~l-~~~n~~L~~eI~~L~~ 62 (105)
T PRK00888 12 LVWLQYSLWFGKNGILDYWRVNDQVAA--QQQTNAKL-KARNDQLFAEIDDLKG 62 (105)
T ss_pred HHHHHHHHhccCCcHHHHHHHHHHHHH--HHHHHHHH-HHHHHHHHHHHHHhhC
Confidence 344433333223444578888887763 34444444 4555666666665443
No 22
>PRK03573 transcriptional regulator SlyA; Provisional
Probab=65.86 E-value=28 Score=26.40 Aligned_cols=100 Identities=13% Similarity=0.178 Sum_probs=62.7
Q ss_pred CCccccchhhhhhhHHHHHHH-HH-HHHhhHHhCCCcccchhHHHHHHHhhhhhheeee------eeeeeecCCccHHHH
Q 029243 83 TPFGYTRKDVLLIGVGVTVIG-YG-LKSGLELFGVDPLQAGNVVEVIVVLGLTLGWVST------YIFRVSNKEMTYAQQ 154 (196)
Q Consensus 83 ~pfgMtR~DVI~IGlgvfllG-g~-ly~gLq~~GlD~~~AGiwsQ~lLVlGlvvgWv~S------YlfRV~tk~MTY~qQ 154 (196)
.|+|+|..+..++....- .+ +. .--+-..+|+|.....-.+.-+-=-|+ |-..-. +....--+-....++
T Consensus 25 ~~~glt~~q~~vL~~l~~-~~~~~t~~eLa~~l~~~~~tvt~~v~~Le~~Gl-V~r~~~~~DrR~~~l~LT~~G~~~~~~ 102 (144)
T PRK03573 25 KPLELTQTHWVTLHNIHQ-LPPEQSQIQLAKAIGIEQPSLVRTLDQLEEKGL-ISRQTCASDRRAKRIKLTEKAEPLISE 102 (144)
T ss_pred HhcCCCHHHHHHHHHHHH-cCCCCCHHHHHHHhCCChhhHHHHHHHHHHCCC-EeeecCCCCcCeeeeEEChHHHHHHHH
Confidence 588999888665443321 22 11 223355888887765555555544443 222211 333344444557777
Q ss_pred HHHHHHHHHHHHHhcCCHHHHHHHHHHHHH
Q 029243 155 LRDYEDKVMQKRLEGLTEAELEALIEQVEE 184 (196)
Q Consensus 155 ~kdYE~avLQKRleeLtpEEle~L~aEIE~ 184 (196)
.....++..++-++.++|||++.+.+-++.
T Consensus 103 ~~~~~~~~~~~~~~~l~~ee~~~l~~~l~~ 132 (144)
T PRK03573 103 VEAVINKTRAEILHGISAEEIEQLITLIAK 132 (144)
T ss_pred HHHHHHHHHHHHHhCCCHHHHHHHHHHHHH
Confidence 788888889999999999999998877765
No 23
>PF11003 DUF2842: Protein of unknown function (DUF2842); InterPro: IPR021265 This bacterial family of proteins have no known function.
Probab=65.44 E-value=6.6 Score=28.23 Aligned_cols=45 Identities=29% Similarity=0.293 Sum_probs=32.4
Q ss_pred hhhHHHHHHHHHHHHhhHHhCCCcc--cchhHHHHHHHhhhhhheee
Q 029243 94 LIGVGVTVIGYGLKSGLELFGVDPL--QAGNVVEVIVVLGLTLGWVS 138 (196)
Q Consensus 94 ~IGlgvfllGg~ly~gLq~~GlD~~--~AGiwsQ~lLVlGlvvgWv~ 138 (196)
.||+.++++|...|..+-..=.|.. ....|.|+++-+.+-++|+.
T Consensus 3 ~ig~v~ll~~l~vY~~~a~~l~~~~~~~~~~~~~l~~Yvv~G~~Wvl 49 (62)
T PF11003_consen 3 LIGLVLLLVGLPVYAVLAVTLADWLLARWPWWVQLIYYVVLGLLWVL 49 (62)
T ss_pred eehhHHHHHHHHHHHHHHHHHHHHhccCchHHHHHHHHHHHHHHHHH
Confidence 3678888888777777553212222 88999999988877889974
No 24
>PF07946 DUF1682: Protein of unknown function (DUF1682); InterPro: IPR012879 The members of this family are all hypothetical eukaryotic proteins of unknown function. One member (Q920S6 from SWISSPROT) is described as being an adipocyte-specific protein, but no evidence of this was found.
Probab=65.23 E-value=15 Score=32.76 Aligned_cols=16 Identities=25% Similarity=0.343 Sum_probs=13.1
Q ss_pred HHHhcCCHHHHHHHHH
Q 029243 165 KRLEGLTEAELEALIE 180 (196)
Q Consensus 165 KRleeLtpEEle~L~a 180 (196)
+++++||||||.++++
T Consensus 297 ~~~~~lspeeQrK~ee 312 (321)
T PF07946_consen 297 RKLSKLSPEEQRKYEE 312 (321)
T ss_pred HHHhcCCHHHHHHHHH
Confidence 3489999999999873
No 25
>PF06814 Lung_7-TM_R: Lung seven transmembrane receptor; InterPro: IPR009637 This family represents a conserved region with eukaryotic lung seven transmembrane receptors and related proteins.; GO: 0016021 integral to membrane
Probab=64.54 E-value=9.8 Score=32.72 Aligned_cols=77 Identities=17% Similarity=0.119 Sum_probs=49.8
Q ss_pred cchhhhhhhHHHHHHHHHHHHhhHHhCCCcccchhHHHHHHHhhh------hhheeeeeeeeeecCCccHHHHHHHHHHH
Q 029243 88 TRKDVLLIGVGVTVIGYGLKSGLELFGVDPLQAGNVVEVIVVLGL------TLGWVSTYIFRVSNKEMTYAQQLRDYEDK 161 (196)
Q Consensus 88 tR~DVI~IGlgvfllGg~ly~gLq~~GlD~~~AGiwsQ~lLVlGl------vvgWv~SYlfRV~tk~MTY~qQ~kdYE~a 161 (196)
+.+.++.+++..+++..+.+..-...+......+.|..+++++=+ .+.|+ ++..++.|+.-+++|++++.
T Consensus 150 ~~~~v~~l~i~~~v~~~i~~i~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~wi----~~sL~~t~~~lk~~~q~~KL 225 (295)
T PF06814_consen 150 REKKVLMLVILYFVFSNIAYIIREESSPSDSSYASWNFIFFLLPLCILDLFFIVWI----FRSLSKTIRDLKARRQTAKL 225 (295)
T ss_pred ceeehhHHHHHHHHHHHHHHHHHHhcCCccchhhhHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHH
Confidence 445688888888888866654443333333333445555544422 24565 55567788888999999988
Q ss_pred HHHHHHh
Q 029243 162 VMQKRLE 168 (196)
Q Consensus 162 vLQKRle 168 (196)
.|=+||-
T Consensus 226 ~lyr~f~ 232 (295)
T PF06814_consen 226 SLYRRFY 232 (295)
T ss_pred HHHHHHH
Confidence 8888764
No 26
>PF04977 DivIC: Septum formation initiator; InterPro: IPR007060 DivIC, from the spore-forming, Gram-positive bacterium Bacillus subtilis, is necessary for both vegetative and sporulation septum formation []. These proteins are mainly composed of an N-terminal coiled-coil. DivIB, DivIC and FtsL inter-depend on each other for stabilisation and localisation. The latter two form a heterodimer. DivIC is always centre cell but the other two associate with it during septation [].; GO: 0007049 cell cycle
Probab=63.43 E-value=7.1 Score=26.67 Aligned_cols=36 Identities=31% Similarity=0.350 Sum_probs=15.4
Q ss_pred cCCccHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHH
Q 029243 146 NKEMTYAQQLRDYEDKVMQKRLEGLTEAELEALIEQVEE 184 (196)
Q Consensus 146 tk~MTY~qQ~kdYE~avLQKRleeLtpEEle~L~aEIE~ 184 (196)
.+-++|.+..+.+. .++++++++. +|.+.|.+|++.
T Consensus 14 ~~~~~~~~~~~ei~--~l~~~i~~l~-~e~~~L~~ei~~ 49 (80)
T PF04977_consen 14 SGYSRYYQLNQEIA--ELQKEIEELK-KENEELKEEIER 49 (80)
T ss_pred chHHHHHHHHHHHH--HHHHHHHHHH-HHHHHHHHHHHH
Confidence 34445544444332 3444444442 234444444443
No 27
>PF11304 DUF3106: Protein of unknown function (DUF3106); InterPro: IPR021455 Some members in this family of proteins are annotated as transmembrane proteins however this cannot be confirmed. Currently no function is known.
Probab=63.20 E-value=16 Score=28.10 Aligned_cols=22 Identities=14% Similarity=0.327 Sum_probs=16.0
Q ss_pred HHHHHhcCCHHHHHHHHHHHHH
Q 029243 163 MQKRLEGLTEAELEALIEQVEE 184 (196)
Q Consensus 163 LQKRleeLtpEEle~L~aEIE~ 184 (196)
+-.|+..|||||++.+++.+..
T Consensus 34 ~a~r~~~mspeqq~r~~~rm~~ 55 (107)
T PF11304_consen 34 IAERWPSMSPEQQQRLRERMRR 55 (107)
T ss_pred HHHHHhcCCHHHHHHHHHHHHH
Confidence 5667777777777777776664
No 28
>PRK11865 pyruvate ferredoxin oxidoreductase subunit beta; Provisional
Probab=62.88 E-value=6 Score=35.73 Aligned_cols=45 Identities=24% Similarity=0.451 Sum_probs=33.2
Q ss_pred eeeeeecCCccHHH-----------HHHHHHHHHHHHHHhcCCHHHHHHHHHHHHHHH
Q 029243 140 YIFRVSNKEMTYAQ-----------QLRDYEDKVMQKRLEGLTEAELEALIEQVEEEK 186 (196)
Q Consensus 140 YlfRV~tk~MTY~q-----------Q~kdYE~avLQKRleeLtpEEle~L~aEIE~Ek 186 (196)
-|||+-.++.+-.. ..+|| -.+|.||..|+|||.++||+++++.=
T Consensus 237 plye~~~g~~~~~~~~~~ld~~~~~pv~~~--l~~q~Rf~~L~~~~~~~~q~~v~~~~ 292 (299)
T PRK11865 237 PLFEIENGKFKITYEPLHLDRRTRKPIEEY--LKVQGRFKHLTEEDIEILQKYIDEKW 292 (299)
T ss_pred eEEEEECCeeccCCCcccccccCCCCHHHH--HhhCcchhcCCHHHHHHHHHHHHHHH
Confidence 46788776654322 34444 35699999999999999999998843
No 29
>PF05915 DUF872: Eukaryotic protein of unknown function (DUF872); InterPro: IPR008590 This entry represents several uncharacterised eukaryotic transmembrane proteins. The function of this currently unknown.
Probab=61.90 E-value=21 Score=28.27 Aligned_cols=45 Identities=22% Similarity=0.316 Sum_probs=27.5
Q ss_pred hhhhhhhHHHHHHHHHHHHh--hHHhCC-CcccchhHHHHHHHhhhhhhee
Q 029243 90 KDVLLIGVGVTVIGYGLKSG--LELFGV-DPLQAGNVVEVIVVLGLTLGWV 137 (196)
Q Consensus 90 ~DVI~IGlgvfllGg~ly~g--Lq~~Gl-D~~~AGiwsQ~lLVlGlvvgWv 137 (196)
.+.|.+++++|++|.++... +-++|- |.-..-.| +++|+|+ +.++
T Consensus 41 wK~I~la~~Lli~G~~li~~g~l~~~~~i~~~~~~~~--~llilG~-L~fI 88 (115)
T PF05915_consen 41 WKSIALAVFLLIFGTVLIIIGLLLFFGHIDGDRDRGW--ALLILGI-LCFI 88 (115)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcccCCCCcccc--hHHHHHH-HHHh
Confidence 67888888888888665443 445553 43333333 6778876 4443
No 30
>KOG2019 consensus Metalloendoprotease HMP1 (insulinase superfamily) [General function prediction only; Posttranslational modification, protein turnover, chaperones]
Probab=61.73 E-value=15 Score=38.25 Aligned_cols=39 Identities=23% Similarity=0.393 Sum_probs=33.6
Q ss_pred cCCccHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHH
Q 029243 146 NKEMTYAQQLRDYEDKVMQKRLEGLTEAELEALIEQVEE 184 (196)
Q Consensus 146 tk~MTY~qQ~kdYE~avLQKRleeLtpEEle~L~aEIE~ 184 (196)
-.+=+|++++.+=|...||+|-++||||+++.+-+-=++
T Consensus 493 qpd~e~~~~~~~eE~tkL~ek~~alteeD~~ei~k~~~e 531 (998)
T KOG2019|consen 493 QPDPEFAEKLEQEEATKLEEKKAALTEEDLAEIAKAGEE 531 (998)
T ss_pred cCCchhhHHHHHHHHHHHHHHHhhCCHHHHHHHHHHHHH
Confidence 346689999999999999999999999999888765554
No 31
>PF12597 DUF3767: Protein of unknown function (DUF3767); InterPro: IPR022533 This group of proteins includes mitochodrial cytochrome c oxidase proteins [], and some transmembrane domain-containing proteins of unknown function known as FAM36A. Proteins in this family are typically between 112 and 199 amino acids in length.
Probab=61.56 E-value=26 Score=27.84 Aligned_cols=51 Identities=24% Similarity=0.330 Sum_probs=36.6
Q ss_pred CCCcccchhHHHHHHHhhhhhheeeeeeeeeecCCccHHHHHHHHHHHHHHHHHhcCCHHHHHH
Q 029243 114 GVDPLQAGNVVEVIVVLGLTLGWVSTYIFRVSNKEMTYAQQLRDYEDKVMQKRLEGLTEAELEA 177 (196)
Q Consensus 114 GlD~~~AGiwsQ~lLVlGlvvgWv~SYlfRV~tk~MTY~qQ~kdYE~avLQKRleeLtpEEle~ 177 (196)
+=.+..|-+|+=.-++++-++.|. |-+..++-|...+++=.|.|.+.+..+
T Consensus 62 ~~~~~~A~nwavgsF~l~s~~~we-------------~Cr~~r~~~~~~~~~~~e~~~~k~~~~ 112 (118)
T PF12597_consen 62 TSNPRKAANWAVGSFFLGSLGSWE-------------YCRYNRRKERQQMKRAVEAMQEKKRKK 112 (118)
T ss_pred cCCCccchhhhhHHHHHHHHHHHH-------------HHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 338889999998888888778884 566667777777776666665554443
No 32
>PF06518 DUF1104: Protein of unknown function (DUF1104); InterPro: IPR009488 This family consists of several hypothetical proteins of unknown function which appear to be found exclusively in Helicobacter pylori.; PDB: 2XRH_A.
Probab=60.25 E-value=7.2 Score=30.08 Aligned_cols=31 Identities=16% Similarity=0.346 Sum_probs=13.1
Q ss_pred CCccHHHHHHHHH---HHHHHHHHhcCCHHHHHHH
Q 029243 147 KEMTYAQQLRDYE---DKVMQKRLEGLTEAELEAL 178 (196)
Q Consensus 147 k~MTY~qQ~kdYE---~avLQKRleeLtpEEle~L 178 (196)
.+|| .++++.|+ ...|++|.++||++|+..+
T Consensus 58 ~~ms-~~e~~k~~~ev~k~~~~~~~~mS~kE~~~~ 91 (93)
T PF06518_consen 58 SKMS-VEERKKRREEVRKALEKRIKKMSVKEAKES 91 (93)
T ss_dssp TTS--HHHHHHHHHHHHHHHHHT----S-------
T ss_pred HHCC-HHHHHHHHHHHHHHHHHHHHhccccccccc
Confidence 4555 35666666 5678999999999998764
No 33
>PRK11091 aerobic respiration control sensor protein ArcB; Provisional
Probab=58.59 E-value=17 Score=34.57 Aligned_cols=11 Identities=18% Similarity=0.516 Sum_probs=5.4
Q ss_pred hhhheeeeeee
Q 029243 132 LTLGWVSTYIF 142 (196)
Q Consensus 132 lvvgWv~SYlf 142 (196)
++..|+..++.
T Consensus 64 ~~~~~~~~~~~ 74 (779)
T PRK11091 64 LITPWAVYFLS 74 (779)
T ss_pred HHHHHHHHHHH
Confidence 33455555544
No 34
>PF07226 DUF1422: Protein of unknown function (DUF1422); InterPro: IPR009867 This family consists of several hypothetical bacterial proteins of around 120 residues in length. The function of this family is unknown.
Probab=56.54 E-value=16 Score=29.81 Aligned_cols=45 Identities=18% Similarity=0.157 Sum_probs=34.2
Q ss_pred hhhhHHHHHHHHHHHHhhHHhCCCcccchhHHHHHHHhhhhhheeee
Q 029243 93 LLIGVGVTVIGYGLKSGLELFGVDPLQAGNVVEVIVVLGLTLGWVST 139 (196)
Q Consensus 93 I~IGlgvfllGg~ly~gLq~~GlD~~~AGiwsQ~lLVlGlvvgWv~S 139 (196)
..++++.|++|..+|..+..+=. |---.|...+++++.| +.|++.
T Consensus 65 P~~a~acFflG~f~ySA~vraqy-PeiGSNFfp~il~l~L-~~Wi~~ 109 (117)
T PF07226_consen 65 PKLALACFFLGLFGYSAFVRAQY-PEIGSNFFPSILCLIL-VFWIGY 109 (117)
T ss_pred hHHHHHHHHHHHHHHHHHHHHhc-hhhhhhHHHHHHHHHH-HHHHHH
Confidence 46899999999999999775544 3334577778888855 899863
No 35
>TIGR00219 mreC rod shape-determining protein MreC. MreC (murein formation C) is involved in the rod shape determination in E. coli, and more generally in cell shape determination of bacteria whether or not they are rod-shaped. Cells defective in MreC are round. Species with MreC include many of the Proteobacteria, Gram-positives, and spirochetes.
Probab=56.14 E-value=6.5 Score=34.70 Aligned_cols=30 Identities=27% Similarity=0.412 Sum_probs=18.5
Q ss_pred HHHHHHHHHhcCCHHHHHHHHHHHHHHHhcc
Q 029243 159 EDKVMQKRLEGLTEAELEALIEQVEEEKRRL 189 (196)
Q Consensus 159 E~avLQKRleeLtpEEle~L~aEIE~Ek~~~ 189 (196)
|++.|++++.+| .++++.+.+|+++|.++.
T Consensus 74 EN~~Lr~e~~~l-~~~~~~~~~~l~~EN~rL 103 (283)
T TIGR00219 74 ENYKLRQELLKK-NQQLEILTQNLKQENVRL 103 (283)
T ss_pred HHHHHHHHHHHH-HHHHHHHHHHHHHHHHHH
Confidence 588899988888 344444444455554443
No 36
>PRK11462 putative transporter; Provisional
Probab=56.11 E-value=33 Score=30.86 Aligned_cols=23 Identities=39% Similarity=0.567 Sum_probs=16.1
Q ss_pred HHHhcCCHHHHHHHHHHHHHHHh
Q 029243 165 KRLEGLTEAELEALIEQVEEEKR 187 (196)
Q Consensus 165 KRleeLtpEEle~L~aEIE~Ek~ 187 (196)
.|+-.|++++.+++++|+++.|+
T Consensus 425 ~~~y~l~~~~~~~i~~~l~~~~~ 447 (460)
T PRK11462 425 KRYYSLTTHNLKTVMEQLAQGKR 447 (460)
T ss_pred HHhccCCHHHHHHHHHHHHHHHh
Confidence 34556777888888888876554
No 37
>PF08451 A_deaminase_N: Adenosine/AMP deaminase N-terminal; InterPro: IPR013659 This domain is found to the N terminus of the Adenosine/AMP deaminase domain (IPR001365 from INTERPRO) in metazoan proteins such as the Cat eye syndrome critical region protein 1 and its homologues. ; GO: 0005615 extracellular space; PDB: 3LGG_B 3LGD_B.
Probab=55.14 E-value=14 Score=28.14 Aligned_cols=36 Identities=22% Similarity=0.132 Sum_probs=20.1
Q ss_pred eeeecCCccHHHHHHHHHHHHHHHHHh---cCCHHHHHH
Q 029243 142 FRVSNKEMTYAQQLRDYEDKVMQKRLE---GLTEAELEA 177 (196)
Q Consensus 142 fRV~tk~MTY~qQ~kdYE~avLQKRle---eLtpEEle~ 177 (196)
..-.+...+|.+||..+-+.+-..++. .||+.|+++
T Consensus 14 ~~~~~~~~~y~~~R~~l~~~E~~~~lG~~l~Lt~~E~~a 52 (96)
T PF08451_consen 14 SLSRPTPEDYKRQRNALLQEEESRRLGGDLVLTPKEQKA 52 (96)
T ss_dssp ---------HHHHHHHHHHHHHHHSTTTT----HHHHHH
T ss_pred cccCCCccHHHHHHHHHHHHHHHhcCCCCCccCHHHHHH
Confidence 334566789999999998887777776 488888764
No 38
>TIGR02698 CopY_TcrY copper transport repressor, CopY/TcrY family. This family includes metal-fist type transcriptional repressors of copper transport systems such as copYZAB of Enterococcus hirae and tcrYAZB (transferble copper resistance) of an Enterocuccus faecium plasmid. High levels of copper can displace zinc and prevent binding by the repressor, activating efflux by copper resistance transporters. The most closely related proteins excluded by this model are antibiotic resistance regulators including the methicillin resistance regulatory protein MecI.
Probab=54.98 E-value=47 Score=25.97 Aligned_cols=30 Identities=23% Similarity=0.382 Sum_probs=22.5
Q ss_pred HHHHHHHh--cCCHHHHHHHHHHHHHHHhccc
Q 029243 161 KVMQKRLE--GLTEAELEALIEQVEEEKRRLA 190 (196)
Q Consensus 161 avLQKRle--eLtpEEle~L~aEIE~Ek~~~~ 190 (196)
+.+..=++ .+|+||+++|++-|++.+..++
T Consensus 91 ~ll~~l~~~~~ls~eele~L~~li~~~~~~~~ 122 (130)
T TIGR02698 91 AVIADLIEESPLSQTDIEKLEKLLSEKKSTAV 122 (130)
T ss_pred HHHHHHHhcCCCCHHHHHHHHHHHHhcccCCC
Confidence 33444444 7999999999999998776654
No 39
>COG0099 RpsM Ribosomal protein S13 [Translation, ribosomal structure and biogenesis]
Probab=54.11 E-value=13 Score=30.45 Aligned_cols=22 Identities=36% Similarity=0.545 Sum_probs=19.7
Q ss_pred HHHHHhcCCHHHHHHHHHHHHH
Q 029243 163 MQKRLEGLTEAELEALIEQVEE 184 (196)
Q Consensus 163 LQKRleeLtpEEle~L~aEIE~ 184 (196)
..+|..+||+||+++|-++|+.
T Consensus 41 ~~~r~~eLteeei~~ir~~i~~ 62 (121)
T COG0099 41 PDKRVGELTEEEIERLRDAIQN 62 (121)
T ss_pred HhHhhccCCHHHHHHHHHHHHh
Confidence 4689999999999999999983
No 40
>PRK10236 hypothetical protein; Provisional
Probab=53.34 E-value=15 Score=32.86 Aligned_cols=26 Identities=8% Similarity=0.341 Sum_probs=23.5
Q ss_pred HHHHHHHHhcCCHHHHHHHHHHHHHH
Q 029243 160 DKVMQKRLEGLTEAELEALIEQVEEE 185 (196)
Q Consensus 160 ~avLQKRleeLtpEEle~L~aEIE~E 185 (196)
..++++=++.|||||++.|.++++..
T Consensus 119 ~kll~~a~~kms~eE~~~L~~~l~~~ 144 (237)
T PRK10236 119 EQFLRNTWKKMDEEHKQEFLHAVDAR 144 (237)
T ss_pred HHHHHHHHHHCCHHHHHHHHHHHhhh
Confidence 66788899999999999999999875
No 41
>PF15372 DUF4600: Domain of unknown function (DUF4600)
Probab=51.81 E-value=16 Score=30.11 Aligned_cols=26 Identities=38% Similarity=0.640 Sum_probs=22.5
Q ss_pred HHHhcCCHHHHHHHHHHHHHHHhccc
Q 029243 165 KRLEGLTEAELEALIEQVEEEKRRLA 190 (196)
Q Consensus 165 KRleeLtpEEle~L~aEIE~Ek~~~~ 190 (196)
++++.||.+++..|+.++|+||....
T Consensus 46 R~ye~Ms~~~l~~llkqLEkeK~~Le 71 (129)
T PF15372_consen 46 RRYEQMSVESLNQLLKQLEKEKRSLE 71 (129)
T ss_pred HHHhhccHHHHHHHHHHHHHHHHHHH
Confidence 35699999999999999999987653
No 42
>COG1422 Predicted membrane protein [Function unknown]
Probab=51.57 E-value=9.3 Score=33.47 Aligned_cols=53 Identities=23% Similarity=0.324 Sum_probs=31.7
Q ss_pred HHHHhhhhhheeeeeeeeeecCCccHHHHHHHHH--HHHHHHHHhcCCH----HHHHHHHHHH
Q 029243 126 VIVVLGLTLGWVSTYIFRVSNKEMTYAQQLRDYE--DKVMQKRLEGLTE----AELEALIEQV 182 (196)
Q Consensus 126 ~lLVlGlvvgWv~SYlfRV~tk~MTY~qQ~kdYE--~avLQKRleeLtp----EEle~L~aEI 182 (196)
++||.|+++|=.++++-..+.. .+-.++|+ -+++||+++|..+ +-+++|+++-
T Consensus 49 vilV~avi~gl~~~i~~~~liD----~ekm~~~qk~m~efq~e~~eA~~~~d~~~lkkLq~~q 107 (201)
T COG1422 49 VILVAAVITGLYITILQKLLID----QEKMKELQKMMKEFQKEFREAQESGDMKKLKKLQEKQ 107 (201)
T ss_pred HHHHHHHHHHHHHHHHHHHhcc----HHHHHHHHHHHHHHHHHHHHHHHhCCHHHHHHHHHHH
Confidence 4666777777777776544432 34455555 4677888877765 4444444443
No 43
>PF14143 YrhC: YrhC-like protein
Probab=49.83 E-value=31 Score=25.84 Aligned_cols=43 Identities=12% Similarity=0.074 Sum_probs=26.3
Q ss_pred hhhHHHHHHHHHHHHhh--HHhCCCcccchhHHHHHHHhhhhhhee
Q 029243 94 LIGVGVTVIGYGLKSGL--ELFGVDPLQAGNVVEVIVVLGLTLGWV 137 (196)
Q Consensus 94 ~IGlgvfllGg~ly~gL--q~~GlD~~~AGiwsQ~lLVlGlvvgWv 137 (196)
--|+.++.++..+|.|+ +.-|.+..+..+..-++.+. ++..|.
T Consensus 12 rf~~vLLAvs~FlYiG~viP~~~~~~~~~~~m~~~~~~~-l~~a~~ 56 (72)
T PF14143_consen 12 RFAFVLLAVSTFLYIGTVIPIGAKETAQKYIMMGAICIF-LAGAFL 56 (72)
T ss_pred HHHHHHHHHHHHHHHHhhCCccchhHHHHHHHHHHHHHH-HHHHHH
Confidence 45677788888899884 45577766665554444333 334443
No 44
>PF01956 DUF106: Integral membrane protein DUF106; InterPro: IPR002809 This entry represents a group of eukaryotic and archaeal proteins that have no known function. Members are predicted to be integral membrane proteins.; GO: 0016020 membrane
Probab=49.59 E-value=9.6 Score=30.37 Aligned_cols=54 Identities=19% Similarity=0.180 Sum_probs=31.9
Q ss_pred hheeeeeeeeeecCCccHHHHHHHHH--HHHHHHH-------HhcCCHHHHHHHHHHHHHHHhc
Q 029243 134 LGWVSTYIFRVSNKEMTYAQQLRDYE--DKVMQKR-------LEGLTEAELEALIEQVEEEKRR 188 (196)
Q Consensus 134 vgWv~SYlfRV~tk~MTY~qQ~kdYE--~avLQKR-------leeLtpEEle~L~aEIE~Ek~~ 188 (196)
++=+.+|+--.+++..+|.+ .++|+ ++.++|| =+...++.+++.++|+-+++.+
T Consensus 22 ~~~~~~~~s~l~~~~~i~~~-~~~~~~~~~~~~~~~~~l~~~~~~~~~~~l~~~~~~~~~~~~~ 84 (168)
T PF01956_consen 22 IAILRGLISELLQKFLIDRK-MDKYQKRMKEFQKRYRELRKNGDFKKPKKLEKRQMELMEKQQE 84 (168)
T ss_pred HHHHHHHHHHHHhccccccc-cHHHHHHHHHHHHHHHHHHHcCCccCHHHHHHHHHHHHHHHHH
Confidence 44455555555566666665 33443 3444444 4456788888888888776543
No 45
>PF03457 HA: Helicase associated domain; InterPro: IPR005114 This short domain is found in multiple copies in bacterial helicase proteins. The domain is predicted to contain 3 alpha helices. The function of this domain may be to bind nucleic acid.; PDB: 2KTA_A.
Probab=49.41 E-value=24 Score=24.03 Aligned_cols=22 Identities=32% Similarity=0.385 Sum_probs=14.2
Q ss_pred HHHHHHHHHHHHHHHHhcCCHHHHHHHHH
Q 029243 152 AQQLRDYEDKVMQKRLEGLTEAELEALIE 180 (196)
Q Consensus 152 ~qQ~kdYE~avLQKRleeLtpEEle~L~a 180 (196)
+.||+.|+. ..||||..++|.+
T Consensus 43 ~~qR~~~r~-------g~L~~er~~~L~~ 64 (68)
T PF03457_consen 43 NNQRRKYRK-------GKLTPERIERLDA 64 (68)
T ss_dssp HHHHHHHHH-------T---HHHHHHHHH
T ss_pred HHHHHHHHc-------CCCCHHHHHHHHc
Confidence 579999987 4488888777753
No 46
>PLN03094 Substrate binding subunit of ER-derived-lipid transporter; Provisional
Probab=48.40 E-value=26 Score=32.77 Aligned_cols=26 Identities=23% Similarity=0.399 Sum_probs=20.6
Q ss_pred CccccchhhhhhhHHHHHHHHHHHHh
Q 029243 84 PFGYTRKDVLLIGVGVTVIGYGLKSG 109 (196)
Q Consensus 84 pfgMtR~DVI~IGlgvfllGg~ly~g 109 (196)
-|||.|.-|--..+|+|++.+++..+
T Consensus 75 ~~~~~rrsvrEg~VGlfvL~gi~ll~ 100 (370)
T PLN03094 75 DFGFGKRSVWEGGVGLFLLSGAALLA 100 (370)
T ss_pred hcCCcchhHHHHHHHHHHHHHHHHHH
Confidence 39999999988888888877665443
No 47
>PRK11056 hypothetical protein; Provisional
Probab=47.75 E-value=30 Score=28.36 Aligned_cols=45 Identities=13% Similarity=0.112 Sum_probs=33.7
Q ss_pred hhhhHHHHHHHHHHHHhhHHhCCCcccchhHHHHHHHhhhhhheeee
Q 029243 93 LLIGVGVTVIGYGLKSGLELFGVDPLQAGNVVEVIVVLGLTLGWVST 139 (196)
Q Consensus 93 I~IGlgvfllGg~ly~gLq~~GlD~~~AGiwsQ~lLVlGlvvgWv~S 139 (196)
..++.+.|++|..+|..+..+=-. ---.|..-+++++++ +.|++.
T Consensus 65 P~~a~acFflG~f~ySA~vraeyP-eiGSNF~p~il~~~L-~~Wi~~ 109 (120)
T PRK11056 65 PGLAAACFFLGVFLYSAFVRAEYP-EIGSNFFPAVLSVIL-VFWIGR 109 (120)
T ss_pred hHHHHHHHHHHHHHHHHHHHhcCc-ccccchHHHHHHHHH-HHHHHH
Confidence 468899999999999997755432 233577778888855 899864
No 48
>PF08172 CASP_C: CASP C terminal; InterPro: IPR012955 This domain is the C-terminal region of the CASP family of proteins. These are Golgi membrane proteins which are thought to have a role in vesicle transport [].; GO: 0006891 intra-Golgi vesicle-mediated transport, 0030173 integral to Golgi membrane
Probab=47.61 E-value=28 Score=30.66 Aligned_cols=35 Identities=14% Similarity=0.289 Sum_probs=26.7
Q ss_pred HHHHHHHH--HHHHHHHHhcCCHHHHHHHHHHHHHHHh
Q 029243 152 AQQLRDYE--DKVMQKRLEGLTEAELEALIEQVEEEKR 187 (196)
Q Consensus 152 ~qQ~kdYE--~avLQKRleeLtpEEle~L~aEIE~Ek~ 187 (196)
..||.+|+ +.+||+.+. ....++.+|+.||+.-+.
T Consensus 85 tsQRDRFR~Rn~ELE~elr-~~~~~~~~L~~Ev~~L~~ 121 (248)
T PF08172_consen 85 TSQRDRFRQRNAELEEELR-KQQQTISSLRREVESLRA 121 (248)
T ss_pred HHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHH
Confidence 48999999 778888773 456788888888886443
No 49
>PF12768 Rax2: Cortical protein marker for cell polarity
Probab=46.64 E-value=5.1 Score=35.62 Aligned_cols=17 Identities=29% Similarity=0.550 Sum_probs=14.5
Q ss_pred cccchhhhhhhHHHHHHH
Q 029243 86 GYTRKDVLLIGVGVTVIG 103 (196)
Q Consensus 86 gMtR~DVI~IGlgvfllG 103 (196)
-|.|+-|++|+|++ .+|
T Consensus 223 ~l~~G~VVlIslAi-ALG 239 (281)
T PF12768_consen 223 KLSRGFVVLISLAI-ALG 239 (281)
T ss_pred cccceEEEEEehHH-HHH
Confidence 47899999999998 666
No 50
>cd01390 HMGB-UBF_HMG-box HMGB-UBF_HMG-box, class II and III members of the HMG-box superfamily of DNA-binding proteins. These proteins bind the minor groove of DNA in a non-sequence specific fashion and contain two or more tandem HMG boxes. Class II members include non-histone chromosomal proteins, HMG1 and HMG2, which bind to bent or distorted DNA such as four-way DNA junctions, synthetic DNA cruciforms, kinked cisplatin-modified DNA, DNA bulges, cross-overs in supercoiled DNA, and can cause looping of linear DNA. Class III members include nucleolar and mitochondrial transcription factors, UBF and mtTF1, which bind four-way DNA junctions.
Probab=46.35 E-value=80 Score=20.48 Aligned_cols=28 Identities=25% Similarity=0.438 Sum_probs=22.6
Q ss_pred HHHHHHHHhcCCHHHHHHHHHHHHHHHh
Q 029243 160 DKVMQKRLEGLTEAELEALIEQVEEEKR 187 (196)
Q Consensus 160 ~avLQKRleeLtpEEle~L~aEIE~Ek~ 187 (196)
...+-++..+||++|.+.-.++-++++.
T Consensus 32 ~~~~~~~W~~ls~~eK~~y~~~a~~~~~ 59 (66)
T cd01390 32 TKILGEKWKELSEEEKKKYEEKAEKDKE 59 (66)
T ss_pred HHHHHHHHHhCCHHHHHHHHHHHHHHHH
Confidence 4567889999999999998887776654
No 51
>PF05961 Chordopox_A13L: Chordopoxvirus A13L protein; InterPro: IPR009236 This family consists of A13L proteins from the Chordopoxviruses. A13L or p8 is one of the three most abundant membrane proteins of the intracellular mature Vaccinia virus [].
Probab=46.19 E-value=21 Score=26.85 Aligned_cols=21 Identities=33% Similarity=0.352 Sum_probs=15.0
Q ss_pred hhhhhhhHHHHHHHHHHHHhh
Q 029243 90 KDVLLIGVGVTVIGYGLKSGL 110 (196)
Q Consensus 90 ~DVI~IGlgvfllGg~ly~gL 110 (196)
+|.|+|++.|.+.|+++|...
T Consensus 3 ~d~iLi~ICVaii~lIlY~iY 23 (68)
T PF05961_consen 3 GDFILIIICVAIIGLILYGIY 23 (68)
T ss_pred HHHHHHHHHHHHHHHHHHHHH
Confidence 467778888877777776553
No 52
>PRK09509 fieF ferrous iron efflux protein F; Reviewed
Probab=46.12 E-value=41 Score=29.11 Aligned_cols=14 Identities=29% Similarity=0.489 Sum_probs=10.9
Q ss_pred CCCCccccchhhhh
Q 029243 81 TETPFGYTRKDVLL 94 (196)
Q Consensus 81 ~~~pfgMtR~DVI~ 94 (196)
++-|||+.|...+.
T Consensus 69 ~~~pyG~~r~E~l~ 82 (299)
T PRK09509 69 DEHTFGHGKAESLA 82 (299)
T ss_pred CCCCCccHHHHHHH
Confidence 45899999988653
No 53
>PF10439 Bacteriocin_IIc: Bacteriocin class II with double-glycine leader peptide; InterPro: IPR019493 Bacteriocins are proteinaceous toxins produced by bacteria to inhibit the growth of similar or closely related strains. The producer bacteria are protected from the effects of their own bacteriocins by production of a specific immunity protein which is co-transcribed with the genes encoding the bacteriocins, e.g. IPR015046 from INTERPRO. The bacteriocins are structurally more specific than their immunity-protein counterparts. Typically, production of the bacteriocin gene is from within an operon carrying up to 6 genes including a typical two-component regulatory system (R and H), a small peptide pheromone (C), and a dedicated ABC transporter (A and -B) as well as an immunity protein []. The ABC transporter is thought to recognise the N termini of both the pheromone and the bacteriocins and to transport these peptides across the cytoplasmic membrane, concurrent with cleavage at the conserved double-glycine motif. Cleaved extracellular C can then bind to the sensor kinase, H, resulting in activation of R and up-regulation of the entire gene cluster via binding to consensus sequences within each promoter []. It seems likely that the whole regulon is carried on a transmissible plasmid which is passed between closely related Firmicute species since many clinical isolates from different Firmicutes can produce at least two bacteriocins, and the same bacteriocins can be produced by different species. The proteins in this entry include amylovorin-L, lactacin-F and salivaricin CRL 1328, all of them class IIb two-peptide bacteriocins.
Probab=45.97 E-value=13 Score=26.29 Aligned_cols=14 Identities=36% Similarity=0.551 Sum_probs=12.2
Q ss_pred HHHhcCCHHHHHHH
Q 029243 165 KRLEGLTEAELEAL 178 (196)
Q Consensus 165 KRleeLtpEEle~L 178 (196)
++||+|++|||+..
T Consensus 3 ~~f~~L~~eeL~~I 16 (65)
T PF10439_consen 3 ENFEVLDEEELSSI 16 (65)
T ss_pred HHHHhcCHHHHHHh
Confidence 57999999999865
No 54
>cd03381 PAP2_glucose_6_phosphatase PAP2_like proteins, glucose-6-phosphatase subfamily. Glucose-6-phosphatase converts glucose-6-phosphate into free glucose and is active in the lumen of the endoplasmic reticulum, where it is bound to the membrane. The generation of free glucose is an important control point in metabolism, and stands at the end of gluconeogenesis and the release of glucose from glycogen. Deficiency of glucose-6-phosphatase leads to von Gierke's disease.
Probab=45.60 E-value=24 Score=30.77 Aligned_cols=32 Identities=28% Similarity=0.238 Sum_probs=27.1
Q ss_pred cchhhhhhhHHHHHHHHHHHHhhHHhCCCccc
Q 029243 88 TRKDVLLIGVGVTVIGYGLKSGLELFGVDPLQ 119 (196)
Q Consensus 88 tR~DVI~IGlgvfllGg~ly~gLq~~GlD~~~ 119 (196)
+++..+.+.+.++.+-.+.|.++..+|+|+..
T Consensus 160 ~~~~~~~~~~~l~~~a~~~y~~~~~~gvd~~w 191 (235)
T cd03381 160 SLKRYVLITFFLFGFALGFYLLLKWLGVDLLW 191 (235)
T ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHhCCCchh
Confidence 56677888888888888889999999999974
No 55
>PF03965 Penicillinase_R: Penicillinase repressor; InterPro: IPR005650 Proteins in this entry are transcriptional regulators found in a variety of bacteria and a small number of archaea. Many are BlaI/MecI proteins which regulate resistance to penicillins (beta-lactams), though at least one protein (Q47839 from SWISSPROT) appears to be involved in the regulation of copper homeostasis []. BlaI regulators repress the expression of penicillin-degrading enzymes (penicillinases) until the cell encounters the antiobiotic, at which point repression ceases and penicillinase expression occurs, allowing cell growth []. MecI regulators repress the expression of MecA, a cell-wall biosynthetic enzyme not inhibited by penicillins at clinically achievable concentrations, until the presence of the antibiotic is detected []. At this point repression ends and MecA expression occurs which, together with the switching off of the penicillin-sensitive enzymes, allows the cell to grow despite the presence of antibiotic.; GO: 0003677 DNA binding, 0045892 negative regulation of transcription, DNA-dependent; PDB: 2G9W_A 2K4B_A 1XSD_A 1SD4_A 1SD7_A 1SD6_A 2P7C_B 1P6R_A 1OKR_B 2D45_B ....
Probab=44.76 E-value=71 Score=23.86 Aligned_cols=46 Identities=24% Similarity=0.400 Sum_probs=27.0
Q ss_pred eeeeeeecCCccHHHHH-HHHH--------HHHHHHHHh--cCCHHHHHHHHHHHHH
Q 029243 139 TYIFRVSNKEMTYAQQL-RDYE--------DKVMQKRLE--GLTEAELEALIEQVEE 184 (196)
Q Consensus 139 SYlfRV~tk~MTY~qQ~-kdYE--------~avLQKRle--eLtpEEle~L~aEIE~ 184 (196)
.|+|+..-.+..|..+. ++.= .+.+..=++ ++||+|++.|.+-|+|
T Consensus 59 ~~~Y~p~is~~e~~~~~~~~~l~~~~~gs~~~l~~~l~~~~~ls~~el~~L~~li~e 115 (115)
T PF03965_consen 59 AYVYSPLISREEYLAQELRQFLDRLFDGSIPQLVAALVESEELSPEELEELRKLIDE 115 (115)
T ss_dssp CEEEEESSSHHHHHHHHHHHHHHHHSTTHHHHHHHHHHHCT-S-HHHHHHHHHHHH-
T ss_pred ceEEEeCCcHHHHHHHHHHHHHHHHhCCCHHHHHHHHHhcCCCCHHHHHHHHHHHcC
Confidence 57788766666665533 3222 233333333 5999999999998875
No 56
>PF07047 OPA3: Optic atrophy 3 protein (OPA3); InterPro: IPR010754 OPA3 deficiency causes type III 3-methylglutaconic aciduria (MGA) in humans. This disease manifests with early bilateral optic atrophy, spasticity, extrapyramidal dysfunction, ataxia, and cognitive deficits, but normal longevity []. This family consists of several optic atrophy 3 (OPA3) proteins and related proteins from other eukaryotic species, the function is unknown.
Probab=44.67 E-value=23 Score=28.16 Aligned_cols=27 Identities=44% Similarity=0.667 Sum_probs=18.3
Q ss_pred HHHHHHHHhcCCHHHHHHHHHHHHHHHh
Q 029243 160 DKVMQKRLEGLTEAELEALIEQVEEEKR 187 (196)
Q Consensus 160 ~avLQKRleeLtpEEle~L~aEIE~Ek~ 187 (196)
++.++.++++| +++++.|..++++.++
T Consensus 107 e~~~~~~l~~L-~~~i~~L~~~~~~~~~ 133 (134)
T PF07047_consen 107 EEELQERLEEL-EERIEELEEQVEKQQE 133 (134)
T ss_pred HHHHHHHHHHH-HHHHHHHHHHHHHHhc
Confidence 66677777777 5666777777766543
No 57
>PF00505 HMG_box: HMG (high mobility group) box; InterPro: IPR000910 High mobility group (HMG or HMGB) proteins are a family of relatively low molecular weight non-histone components in chromatin. HMG1 (also called HMG-T in fish) and HMG2 are two highly related proteins that bind single-stranded DNA preferentially and unwind double-stranded DNA. Although they have no sequence specificity, they have a high affinity for bent or distorted DNA, and bend linear DNA. HMG1 and HMG2 contain two DNA-binding HMG-box domains (A and B) that show structural and functional differences, and have a long acidic C-terminal domain rich in aspartic and glutamic acid residues. The acidic tail modulates the affinity of the tandem HMG boxes in HMG1 and 2 for a variety of DNA targets. HMG1 and 2 appear to play important architectural roles in the assembly of nucleoprotein complexes in a variety of biological processes, for example V(D)J recombination, the initiation of transcription, and DNA repair []. The profile in this entry describing the HMG-domains is much more general than the signature. In addition to the HMG1 and HMG2 proteins, HMG-domains occur in single or multiple copies in the following protein classes; the SOX family of transcription factors; SRY sex determining region Y protein and related proteins []; LEF1 lymphoid enhancer binding factor 1 []; SSRP recombination signal recognition protein; MTF1 mitochondrial transcription factor 1; UBF1/2 nucleolar transcription factors; Abf2 yeast ARS-binding factor []; and Saccharomyces cerevisiae transcription factors Ixr1, Rox1, Nhp6a, Nhp6b and Spp41.; GO: 0003677 DNA binding; PDB: 1I11_A 1J3C_A 1J3D_A 1WZ6_A 1WGF_A 2D7L_A 1GT0_D 3U2B_C 2CRJ_A 2CS1_A ....
Probab=44.50 E-value=80 Score=20.75 Aligned_cols=28 Identities=32% Similarity=0.462 Sum_probs=23.2
Q ss_pred HHHHHHHHhcCCHHHHHHHHHHHHHHHh
Q 029243 160 DKVMQKRLEGLTEAELEALIEQVEEEKR 187 (196)
Q Consensus 160 ~avLQKRleeLtpEEle~L~aEIE~Ek~ 187 (196)
...+.+...+||++|.+.-.++-++++.
T Consensus 32 ~~~~~~~W~~l~~~eK~~y~~~a~~~~~ 59 (69)
T PF00505_consen 32 SKILAQMWKNLSEEEKAPYKEEAEEEKE 59 (69)
T ss_dssp HHHHHHHHHCSHHHHHHHHHHHHHHHHH
T ss_pred hhhHHHHHhcCCHHHHHHHHHHHHHHHH
Confidence 4567888999999999999888877654
No 58
>PF10550 Toxin_36: Conantokin-G mollusc-toxin; InterPro: IPR005918 The conantokins are a family of neuroactive peptides found in the venoms of fish-hunting cone snails. They possess a relatively high number of residues (4-5) of the non-standard amino acid gamma-carboxyglutamic acid (Gla), which is generated by the post-translational modification of glutamate (Glu) residues. Conantokins are the only naturally produced peptides known to be N-methyl-D-aspartate (NMDA) receptor antagonists and show therapeutic promise in treating conditions associated with NMDA receptor dysfunction. In animal models they have exhibited anticonvulsant and anti-Parkinsonian properties and have provided neuroprotection within therapeutically acceptable times following transient focal brain ischemia [, , , ]. Upon binding of Ca2+ to Gla, conantokin undergoes a conformational transition from a distorted curvilinear 3(10) helix to a linear alpha-helix. The binding of Ca2+ to conantokin leads to the exposure of a hydrophobic region on the opposite face of the helix []. Conantokins share relatively few sequence elements, which include include sequence identity at the first four residues, homologous positioning of the two most C-terminal Gla residues, and an Arg preceding the most C-terminal Gla []. The conantokin family is currently known to include: Conotoxin G from Conus geographus (Geography cone) (Nubecula geographus). Conantokin-L from Conus lynceus (Lynceus cone). Conantokin-R from Conus radiatus (Rayed cone). Conantokin-T from Conus tulipa (Fish-hunting cone snail) (Tulip cone). ; PDB: 1ONT_A.
Probab=44.50 E-value=27 Score=19.56 Aligned_cols=14 Identities=21% Similarity=0.432 Sum_probs=10.5
Q ss_pred HHHHHHHHHHHHHH
Q 029243 172 EAELEALIEQVEEE 185 (196)
Q Consensus 172 pEEle~L~aEIE~E 185 (196)
+||.+++.+++..|
T Consensus 2 eee~~km~~~lare 15 (15)
T PF10550_consen 2 EEEVAKMAAELARE 15 (15)
T ss_dssp HHHHHHHHHHH-HH
T ss_pred hHHHHHHHHHHhcC
Confidence 68999999988654
No 59
>PF12072 DUF3552: Domain of unknown function (DUF3552); InterPro: IPR022711 This presumed domain is functionally uncharacterised. This domain is found in bacteria, archaea and eukaryotes. This domain is about 200 amino acids in length. This domain is found associated with PF00013 from PFAM, PF01966 from PFAM. This domain has a single completely conserved residue A that may be functionally important. ; GO: 0008663 2',3'-cyclic-nucleotide 2'-phosphodiesterase activity
Probab=44.34 E-value=13 Score=30.95 Aligned_cols=12 Identities=25% Similarity=0.404 Sum_probs=5.9
Q ss_pred HHHHHHHHHHHH
Q 029243 173 AELEALIEQVEE 184 (196)
Q Consensus 173 EEle~L~aEIE~ 184 (196)
+|..++..++|+
T Consensus 60 ee~~~~r~~~E~ 71 (201)
T PF12072_consen 60 EEAQKLRQELER 71 (201)
T ss_pred HHHHHHHHHHHH
Confidence 444455555544
No 60
>PF15361 RIC3: Resistance to inhibitors of cholinesterase homologue 3
Probab=44.27 E-value=8.8 Score=31.52 Aligned_cols=23 Identities=39% Similarity=0.602 Sum_probs=11.6
Q ss_pred HHHHHHHHHHHHHhcCCHHHHHHH
Q 029243 155 LRDYEDKVMQKRLEGLTEAELEAL 178 (196)
Q Consensus 155 ~kdYE~avLQKRleeLtpEEle~L 178 (196)
.-+||-+.||+||++ ||+.+++|
T Consensus 128 ~~~~eL~qLq~rL~q-TE~~m~ki 150 (152)
T PF15361_consen 128 ITDYELAQLQERLAQ-TERAMEKI 150 (152)
T ss_pred chHHHHHHHHHHHHH-HHHHHHHH
Confidence 445555555555554 44444443
No 61
>COG1026 Predicted Zn-dependent peptidases, insulinase-like [General function prediction only]
Probab=42.68 E-value=64 Score=34.19 Aligned_cols=49 Identities=27% Similarity=0.411 Sum_probs=42.2
Q ss_pred eeeeeecCCccHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHHHHhc
Q 029243 140 YIFRVSNKEMTYAQQLRDYEDKVMQKRLEGLTEAELEALIEQVEEEKRR 188 (196)
Q Consensus 140 YlfRV~tk~MTY~qQ~kdYE~avLQKRleeLtpEEle~L~aEIE~Ek~~ 188 (196)
|+.=.+.-+-++.+|+++=|...|++|=.+|++|++++|.++..+-+..
T Consensus 450 ~~~v~~~Ps~~~~~~~ekee~e~L~~~~~~l~de~~~ki~~~~~~lke~ 498 (978)
T COG1026 450 YVTVIVLPSPELEEKLEKEERELLQKRSSELTDEDLEKIIKDSKKLKER 498 (978)
T ss_pred cEEEEEecChHHHHHHHHHHHHHHHHHHhhcCHHHHHHHHHHHHHHHHh
Confidence 5555677788999999999999999999999999999999999874433
No 62
>cd00084 HMG-box High Mobility Group (HMG)-box is found in a variety of eukaryotic chromosomal proteins and transcription factors. HMGs bind to the minor groove of DNA and have been classified by DNA binding preferences. Two phylogenically distinct groups of Class I proteins bind DNA in a sequence specific fashion and contain a single HMG box. One group (SOX-TCF) includes transcription factors, TCF-1, -3, -4; and also SRY and LEF-1, which bind four-way DNA junctions and duplex DNA targets. The second group (MATA) includes fungal mating type gene products MC, MATA1 and Ste11. Class II and III proteins (HMGB-UBF) bind DNA in a non-sequence specific fashion and contain two or more tandem HMG boxes. Class II members include non-histone chromosomal proteins, HMG1 and HMG2, which bind to bent or distorted DNA such as four-way DNA junctions, synthetic DNA cruciforms, kinked cisplatin-modified DNA, DNA bulges, cross-overs in supercoiled DNA, and can cause looping of linear DNA. Class III member
Probab=42.64 E-value=89 Score=19.94 Aligned_cols=29 Identities=28% Similarity=0.446 Sum_probs=24.2
Q ss_pred HHHHHHHHhcCCHHHHHHHHHHHHHHHhc
Q 029243 160 DKVMQKRLEGLTEAELEALIEQVEEEKRR 188 (196)
Q Consensus 160 ~avLQKRleeLtpEEle~L~aEIE~Ek~~ 188 (196)
...+.++...||++|.+.-.++-++++..
T Consensus 32 ~~~~~~~W~~l~~~~k~~y~~~a~~~~~~ 60 (66)
T cd00084 32 SKILGEMWKSLSEEEKKKYEEKAEKDKER 60 (66)
T ss_pred HHHHHHHHHhCCHHHHHHHHHHHHHHHHH
Confidence 46788899999999999999888876543
No 63
>PRK09706 transcriptional repressor DicA; Reviewed
Probab=42.42 E-value=47 Score=25.43 Aligned_cols=25 Identities=20% Similarity=0.383 Sum_probs=22.3
Q ss_pred HHHHHHHHhcCCHHHHHHHHHHHHH
Q 029243 160 DKVMQKRLEGLTEAELEALIEQVEE 184 (196)
Q Consensus 160 ~avLQKRleeLtpEEle~L~aEIE~ 184 (196)
+..|-..|++|||++++++++.++.
T Consensus 89 ~~~ll~~~~~L~~~~~~~~l~~l~~ 113 (135)
T PRK09706 89 QKELLELFDALPESEQDAQLSEMRA 113 (135)
T ss_pred HHHHHHHHHHCCHHHHHHHHHHHHH
Confidence 5678899999999999999998875
No 64
>PF07895 DUF1673: Protein of unknown function (DUF1673); InterPro: IPR012874 This family contains hypothetical proteins of unknown function found in Methanosarcina acetivorans and Methanosarcina mazei.
Probab=42.16 E-value=44 Score=28.40 Aligned_cols=53 Identities=26% Similarity=0.428 Sum_probs=27.4
Q ss_pred hhhhhhhHHHHHHHHHHHHhhHHhCCCcccchhHHHHHHHhhhhhheeeeeeeeeecCCccHHHHHHHHHH
Q 029243 90 KDVLLIGVGVTVIGYGLKSGLELFGVDPLQAGNVVEVIVVLGLTLGWVSTYIFRVSNKEMTYAQQLRDYED 160 (196)
Q Consensus 90 ~DVI~IGlgvfllGg~ly~gLq~~GlD~~~AGiwsQ~lLVlGlvvgWv~SYlfRV~tk~MTY~qQ~kdYE~ 160 (196)
...+++...++++= ++...+..+... ..|++|+.+. +..|++. +.+|.++|++
T Consensus 57 ~~~Ll~~~f~T~~~-~L~i~~~gin~~---------~~ll~g~~~~-L~~~i~~-------wk~~~~~~d~ 109 (205)
T PF07895_consen 57 NRILLINIFFTLVY-FLVISQLGINLH---------LFLLAGLILS-LYLYIFS-------WKKQMIRYDD 109 (205)
T ss_pred ceeeHHHHHHHHHH-HHHHHHHHHHHH---------HHHHHHHHHH-HHHHHHH-------HHHHHHHHHH
Confidence 34555555554433 233333333332 3344444444 5555555 8899999984
No 65
>KOG1962 consensus B-cell receptor-associated protein and related proteins [Defense mechanisms]
Probab=42.14 E-value=42 Score=29.68 Aligned_cols=45 Identities=24% Similarity=0.444 Sum_probs=29.0
Q ss_pred eecCCccHHHHHHHHH--------HHHHHHHHhcCC---------HHHHHHHHHHHHHHHhc
Q 029243 144 VSNKEMTYAQQLRDYE--------DKVMQKRLEGLT---------EAELEALIEQVEEEKRR 188 (196)
Q Consensus 144 V~tk~MTY~qQ~kdYE--------~avLQKRleeLt---------pEEle~L~aEIE~Ek~~ 188 (196)
|+..=|||..++..++ ++.+++.++.-+ .||+++|.+|+++.+.+
T Consensus 112 vI~R~~~ll~~l~~l~~~~~~~~~~~~lk~~~~~~~~~~~~~~~~~~~~~kL~~el~~~~~~ 173 (216)
T KOG1962|consen 112 VIRRLHTLLRELATLRANEKAMKENEALKKQLENSSKLEEENDKLKADLEKLETELEKKQKK 173 (216)
T ss_pred HHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhhcccchhhhHHHHHhhHHHHHHHHHHHHHH
Confidence 4556678888888777 355666666644 35667777777664443
No 66
>PF04855 SNF5: SNF5 / SMARCB1 / INI1; InterPro: IPR006939 SNF5 is a component of the yeast SWI/SNF complex, which is an ATP-dependent nucleosome-remodelling complex that regulates the transcription of a subset of yeast genes. SNF5 is a key component of all SWI/SNF-class complexes characterised so far []. This family consists of the conserved region of SNF5, including a direct repeat motif. SNF5 is essential for the assembly promoter targeting and chromatin remodelling activity of the SWI-SNF complex []. SNF5 is also known as SMARCB1, for SWI/SNF-related, matrix-associated, actin-dependent regulator of chromatin, subfamily b, member 1, and also INI1 for integrase interactor 1. Loss-of function mutations in SNF5 are thought to contribute to oncogenesis in malignant rhabdoid tumours (MRTs) [].; GO: 0006338 chromatin remodeling, 0000228 nuclear chromosome
Probab=41.79 E-value=20 Score=31.39 Aligned_cols=26 Identities=31% Similarity=0.337 Sum_probs=20.7
Q ss_pred HHHHHHHHhcCCHHHHHHHHHHHHHH
Q 029243 160 DKVMQKRLEGLTEAELEALIEQVEEE 185 (196)
Q Consensus 160 ~avLQKRleeLtpEEle~L~aEIE~E 185 (196)
.+.-.=+++.||+||+|++..|-|.+
T Consensus 217 ~~~w~P~le~Ls~eEier~e~ereR~ 242 (244)
T PF04855_consen 217 ADEWTPRLEELSPEEIERREKERERE 242 (244)
T ss_pred hhhcCCchhhCCHHHHHHHHHHhhhh
Confidence 44456789999999999998877654
No 67
>PF13902 R3H-assoc: R3H-associated N-terminal domain
Probab=40.38 E-value=50 Score=25.65 Aligned_cols=39 Identities=21% Similarity=0.481 Sum_probs=30.7
Q ss_pred HHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHHHHhccc
Q 029243 151 YAQQLRDYEDKVMQKRLEGLTEAELEALIEQVEEEKRRLA 190 (196)
Q Consensus 151 Y~qQ~kdYE~avLQKRleeLtpEEle~L~aEIE~Ek~~~~ 190 (196)
| ++.-||..+.+=..|=+.++||++++++..+++.....
T Consensus 63 F-~~~~~~~~~~~Wd~f~~~~~eeQ~~~l~~~~~~~~~~~ 101 (131)
T PF13902_consen 63 F-ELCSDYQNAQLWDDFRNIDEEEQERLLEKLEKKEKKRK 101 (131)
T ss_pred H-HHhccHHHHHHHHHHhhcCHHHHHHHHHHhhhcccccc
Confidence 5 66667777777778888999999999999888655444
No 68
>PRK00306 50S ribosomal protein L29; Reviewed
Probab=40.23 E-value=38 Score=23.95 Aligned_cols=22 Identities=27% Similarity=0.374 Sum_probs=17.7
Q ss_pred HHHhcCCHHHHHHHHHHHHHHH
Q 029243 165 KRLEGLTEAELEALIEQVEEEK 186 (196)
Q Consensus 165 KRleeLtpEEle~L~aEIE~Ek 186 (196)
+-+.+||++|+++...++.+|=
T Consensus 4 ~elr~ls~~eL~~~l~~lkkeL 25 (66)
T PRK00306 4 KELRELSVEELNEKLLELKKEL 25 (66)
T ss_pred HHHhhCCHHHHHHHHHHHHHHH
Confidence 3577899999999988887753
No 69
>PF09928 DUF2160: Predicted small integral membrane protein (DUF2160); InterPro: IPR018678 The members of this family of hypothetical prokaryotic proteins have no known function. It is thought that they are transmembrane proteins, but their function has not been inferred yet.
Probab=40.03 E-value=60 Score=25.40 Aligned_cols=32 Identities=25% Similarity=0.186 Sum_probs=22.8
Q ss_pred CCccccchhhhhhhHHHHHHHHHHHHhhHHhCCCccc
Q 029243 83 TPFGYTRKDVLLIGVGVTVIGYGLKSGLELFGVDPLQ 119 (196)
Q Consensus 83 ~pfgMtR~DVI~IGlgvfllGg~ly~gLq~~GlD~~~ 119 (196)
.|+--||+|=+-||+.. ..|.-|-.+|+-+.+
T Consensus 38 Lpi~TTRGDRLFIsLL~-----sA~I~L~Wlgl~~~~ 69 (88)
T PF09928_consen 38 LPIETTRGDRLFISLLG-----SAFIHLAWLGLVGPS 69 (88)
T ss_pred ecccCCCcchhHHHHHH-----HHHHHHHHHHhcCcc
Confidence 88889999999999765 344445566665533
No 70
>PF09991 DUF2232: Predicted membrane protein (DUF2232); InterPro: IPR018710 This family of bacterial and eukaryotic proteins has no known fucntion; however this signature belongs to a Pfam Gx transporter clan.
Probab=39.87 E-value=1.6e+02 Score=24.28 Aligned_cols=17 Identities=18% Similarity=0.483 Sum_probs=12.6
Q ss_pred cCCHHHHHHHHHHHHHH
Q 029243 169 GLTEAELEALIEQVEEE 185 (196)
Q Consensus 169 eLtpEEle~L~aEIE~E 185 (196)
++|+|+.+++++.+++.
T Consensus 137 g~~~~~~~~~~~~~~~~ 153 (290)
T PF09991_consen 137 GLPQDQVDQLQKNLQQI 153 (290)
T ss_pred CCCchhHHHHHHHHHHH
Confidence 56888887777777765
No 71
>smart00398 HMG high mobility group.
Probab=39.55 E-value=1e+02 Score=19.85 Aligned_cols=28 Identities=25% Similarity=0.334 Sum_probs=22.9
Q ss_pred HHHHHHHHhcCCHHHHHHHHHHHHHHHh
Q 029243 160 DKVMQKRLEGLTEAELEALIEQVEEEKR 187 (196)
Q Consensus 160 ~avLQKRleeLtpEEle~L~aEIE~Ek~ 187 (196)
...+.++..+||++|.+...+.-++++.
T Consensus 33 ~~~~~~~W~~l~~~ek~~y~~~a~~~~~ 60 (70)
T smart00398 33 SKKLGERWKLLSEEEKAPYEEKAKKDKE 60 (70)
T ss_pred HHHHHHHHHcCCHHHHHHHHHHHHHHHH
Confidence 4567889999999999999888777654
No 72
>PF09972 DUF2207: Predicted membrane protein (DUF2207); InterPro: IPR018702 This domain has no known function.
Probab=38.29 E-value=40 Score=29.68 Aligned_cols=16 Identities=19% Similarity=0.644 Sum_probs=8.0
Q ss_pred hhhhhHHHHHHHHHHH
Q 029243 92 VLLIGVGVTVIGYGLK 107 (196)
Q Consensus 92 VI~IGlgvfllGg~ly 107 (196)
.+.++++++++++++.
T Consensus 400 ~~~~~~i~~i~~~~~~ 415 (511)
T PF09972_consen 400 LIILGIILLILGFILL 415 (511)
T ss_pred HHHHHHHHHHHHHHHH
Confidence 4455555555554444
No 73
>KOG2224 consensus Uncharacterized conserved protein, contains TBC domain [Signal transduction mechanisms; General function prediction only]
Probab=38.17 E-value=34 Score=34.33 Aligned_cols=42 Identities=26% Similarity=0.478 Sum_probs=29.8
Q ss_pred eeeeeecCCccHH-------HHHHHHHHHHHHHHHhcCCHHHHHHHHHHH
Q 029243 140 YIFRVSNKEMTYA-------QQLRDYEDKVMQKRLEGLTEAELEALIEQV 182 (196)
Q Consensus 140 YlfRV~tk~MTY~-------qQ~kdYE~avLQKRleeLtpEEle~L~aEI 182 (196)
+|.|--+-.-||- +-||.||+ .-||||-+|||||..+.-+-|
T Consensus 447 fllk~ys~est~edr~al~~~krkey~e-iqqkrlysmspeeh~~fwknv 495 (781)
T KOG2224|consen 447 FLLKCYSFESTFEDRAALMDIKRKEYEE-IQQKRLYSMSPEEHIAFWKNV 495 (781)
T ss_pred HHHHHhccccchhHHHHHHHHHHHHHHH-HHHHHhhcCCHHHHHHHHhhe
Confidence 4555555566653 45677775 459999999999998876654
No 74
>PTZ00246 proteasome subunit alpha; Provisional
Probab=38.09 E-value=25 Score=29.69 Aligned_cols=23 Identities=26% Similarity=0.385 Sum_probs=19.5
Q ss_pred HhcCCHHHHHHHHHHHHHHHhcc
Q 029243 167 LEGLTEAELEALIEQVEEEKRRL 189 (196)
Q Consensus 167 leeLtpEEle~L~aEIE~Ek~~~ 189 (196)
|..|+|||++++++++++++...
T Consensus 226 ~~~l~~~ei~~~l~~~~~~~~~~ 248 (253)
T PTZ00246 226 QKMLSEKEIAELLKKVTQEYAKE 248 (253)
T ss_pred eEECCHHHHHHHHHHHhhhhhhh
Confidence 67899999999999999887443
No 75
>PRK09848 glucuronide transporter; Provisional
Probab=38.06 E-value=1.6e+02 Score=25.64 Aligned_cols=19 Identities=16% Similarity=0.532 Sum_probs=14.5
Q ss_pred cCCHHHHHHHHHHHHHHHh
Q 029243 169 GLTEAELEALIEQVEEEKR 187 (196)
Q Consensus 169 eLtpEEle~L~aEIE~Ek~ 187 (196)
.|++|+.+++++|+++.+.
T Consensus 428 ~l~~~~~~~~~~~l~~~~~ 446 (448)
T PRK09848 428 PLTDKKFKEIVVEIDNRKK 446 (448)
T ss_pred eCCHHHHHHHHHHHHHhhh
Confidence 5777788888888887553
No 76
>PF01638 HxlR: HxlR-like helix-turn-helix; InterPro: IPR002577 The hxlR-type HTH domain is a domain of ~90-100 amino acids present in putative transcription regulators with a winged helix-turn-helix (wHTH) structure. The domain is named after Bacillus subtilis hxlR, a transcription activator of the hxlAB operon involved in the detoxification of formaldehyde []. The hxlR-type domain forms the core of putative transcription regulators and of hypothetical proteins occurring in eubacteria as well as in archaea. The sequence and structure of hxlR-type proteins show similarities with the marR-type wHTH []. The crystal structure of ytfH resembles the DNA-binding domains of winged helix proteins, containing a three helix (H) bundle and a three-stranded antiparallel beta-sheet (B) in the topology: H1-H2-B1-H3-H4-B2-B3-H5-H6. This topology corresponds with that of the marR-type DNA-binding domain, wherein helices 3 and 4 comprise the helix-turn-helix motif and the beta-sheet is called the wing. ; PDB: 2F2E_B 3DF8_A 1Z7U_B 1YYV_A 4A5M_D 4A5N_B 2FSW_A 2HZT_D.
Probab=36.68 E-value=19 Score=26.02 Aligned_cols=41 Identities=22% Similarity=0.490 Sum_probs=31.3
Q ss_pred eeeeeeeeeecCCccHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHHH
Q 029243 136 WVSTYIFRVSNKEMTYAQQLRDYEDKVMQKRLEGLTEAELEALIEQVEEE 185 (196)
Q Consensus 136 Wv~SYlfRV~tk~MTY~qQ~kdYE~avLQKRleeLtpEEle~L~aEIE~E 185 (196)
|...-|..+..+.|+|++ |+++++.+|+.-|.+-++++++.
T Consensus 6 W~~~IL~~l~~g~~rf~e---------l~~~l~~is~~~L~~~L~~L~~~ 46 (90)
T PF01638_consen 6 WTLLILRALFQGPMRFSE---------LQRRLPGISPKVLSQRLKELEEA 46 (90)
T ss_dssp THHHHHHHHTTSSEEHHH---------HHHHSTTS-HHHHHHHHHHHHHT
T ss_pred CHHHHHHHHHhCCCcHHH---------HHHhcchhHHHHHHHHHHHHHHc
Confidence 555555666778999875 78889999999998888888873
No 77
>PF11239 DUF3040: Protein of unknown function (DUF3040); InterPro: IPR021401 Some members in this family of proteins with unknown function are annotated as membrane proteins however this cannot be confirmed.
Probab=36.34 E-value=35 Score=24.76 Aligned_cols=16 Identities=31% Similarity=0.497 Sum_probs=12.0
Q ss_pred cCCHHHHHHHHHHHHHH
Q 029243 169 GLTEAELEALIEQVEEE 185 (196)
Q Consensus 169 eLtpEEle~L~aEIE~E 185 (196)
.||++|+. .++|||++
T Consensus 2 ~LSe~E~r-~L~eiEr~ 17 (82)
T PF11239_consen 2 PLSEHEQR-RLEEIERQ 17 (82)
T ss_pred CCCHHHHH-HHHHHHHH
Confidence 37889977 66788873
No 78
>PF00645 zf-PARP: Poly(ADP-ribose) polymerase and DNA-Ligase Zn-finger region; InterPro: IPR001510 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents PARP (Poly(ADP) polymerase) type zinc finger domains. NAD(+) ADP-ribosyltransferase(2.4.2.30 from EC) [, ] is a eukaryotic enzyme that catalyses the covalent attachment of ADP-ribose units from NAD(+) to various nuclear acceptor proteins. This post-translational modification of nuclear proteins is dependent on DNA. It appears to be involved in the regulation of various important cellular processes such as differentiation, proliferation and tumour transformation as well as in the regulation of the molecular events involved in the recovery of the cell from DNA damage. Structurally, NAD(+) ADP-ribosyltransferase consists of three distinct domains: an N-terminal zinc-dependent DNA-binding domain, a central automodification domain and a C-terminal NAD-binding domain. The DNA-binding region contains a pair of PARP-type zinc finger domains which have been shown to bind DNA in a zinc-dependent manner. The PARP-type zinc finger domains seem to bind specifically to single-stranded DNA and to act as a DNA nick sensor. DNA ligase III [] contains, in its N-terminal section, a single copy of a zinc finger highly similar to those of PARP. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0003677 DNA binding, 0008270 zinc ion binding; PDB: 1UW0_A 3OD8_D 3ODA_A 4AV1_A 2DMJ_A 4DQY_D 2L30_A 2CS2_A 2L31_A 3ODE_B ....
Probab=36.05 E-value=20 Score=25.37 Aligned_cols=18 Identities=17% Similarity=0.326 Sum_probs=13.4
Q ss_pred HHhcCCHHHHHHHHHHHH
Q 029243 166 RLEGLTEAELEALIEQVE 183 (196)
Q Consensus 166 RleeLtpEEle~L~aEIE 183 (196)
-|++|++|+++++++.||
T Consensus 65 G~~~L~~~Dq~~i~~~i~ 82 (82)
T PF00645_consen 65 GFDELKPEDQEKIRKLIE 82 (82)
T ss_dssp TCCCS-HHHHHHHHHHHS
T ss_pred ChHHCCHHHHHHHHHHhC
Confidence 367888999988888765
No 79
>PRK13453 F0F1 ATP synthase subunit B; Provisional
Probab=35.93 E-value=49 Score=26.82 Aligned_cols=48 Identities=15% Similarity=0.035 Sum_probs=27.7
Q ss_pred HHHHHHHHHHHhhHHhCCCcccchhHHHHHHHhhhhhheeeeee-eeeecCCccHHHHHHHHH
Q 029243 98 GVTVIGYGLKSGLELFGVDPLQAGNVVEVIVVLGLTLGWVSTYI-FRVSNKEMTYAQQLRDYE 159 (196)
Q Consensus 98 gvfllGg~ly~gLq~~GlD~~~AGiwsQ~lLVlGlvvgWv~SYl-fRV~tk~MTY~qQ~kdYE 159 (196)
-+|++|. +.|+|.. -++.|++-++ ++.|+..|+ |+-+ +-++++|+++=
T Consensus 6 ~~~~~~~-------~~~~~~~--t~~~~iInFl--iL~~lL~~~l~~pi---~~~l~~R~~~I 54 (173)
T PRK13453 6 NLFVLGA-------AGGVEWG--TVIVTVLTFI--VLLALLKKFAWGPL---KDVMDKRERDI 54 (173)
T ss_pred HHHHHHh-------hcCCCHH--HHHHHHHHHH--HHHHHHHHHHHHHH---HHHHHHHHHHH
Confidence 3566663 4577743 2445666666 366776554 3433 34677777765
No 80
>COG2863 Cytochrome c553 [Energy production and conversion]
Probab=35.44 E-value=59 Score=26.37 Aligned_cols=35 Identities=26% Similarity=0.468 Sum_probs=30.1
Q ss_pred HHHHHHHH-----HHHHHHHHhcCCHHHHHHHHHHHHHHH
Q 029243 152 AQQLRDYE-----DKVMQKRLEGLTEAELEALIEQVEEEK 186 (196)
Q Consensus 152 ~qQ~kdYE-----~avLQKRleeLtpEEle~L~aEIE~Ek 186 (196)
-+|+++|+ .-+|......|++|+++.|-+=++..+
T Consensus 62 ~~~L~a~k~g~r~~~vM~~~a~~LsD~Di~~lAa~~a~~~ 101 (121)
T COG2863 62 EKQLKAYKDGKRPGPVMNAIASGLSDEDIADLAAYYAAQK 101 (121)
T ss_pred HHHHHHHHcCCCCcchHHHHHHhCCHHHHHHHHHHHHhCC
Confidence 36999999 568999999999999999988887643
No 81
>COG1896 Predicted hydrolases of HD superfamily [General function prediction only]
Probab=35.05 E-value=43 Score=28.18 Aligned_cols=40 Identities=13% Similarity=0.164 Sum_probs=30.7
Q ss_pred eeecCCcc---------HHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHH
Q 029243 143 RVSNKEMT---------YAQQLRDYEDKVMQKRLEGLTEAELEALIEQVEE 184 (196)
Q Consensus 143 RV~tk~MT---------Y~qQ~kdYE~avLQKRleeLtpEEle~L~aEIE~ 184 (196)
=+++|+|+ +.+..+..+++..+.++.. |+|+..+-.|.++
T Consensus 75 E~~~GDi~tp~k~~~~~~~~~~~e~e~~~~~~~~~~--p~e~~~~~~~~~~ 123 (193)
T COG1896 75 EALTGDIPTPVKYARAGLYKEEEEAEEAAIHLLFGL--PEELLELFREYEK 123 (193)
T ss_pred HHHhCCCCCchhhhcchHHHHHHHHHHHHHHcccCC--cHHHHHHHHHHHc
Confidence 34678888 3455555667888888877 9999999999987
No 82
>PLN02220 delta-9 acyl-lipid desaturase
Probab=34.69 E-value=65 Score=29.24 Aligned_cols=21 Identities=24% Similarity=0.355 Sum_probs=11.6
Q ss_pred CCCCCCCCCccc---cchhhhhhh
Q 029243 76 DSSDQTETPFGY---TRKDVLLIG 96 (196)
Q Consensus 76 ~~~~~~~~pfgM---tR~DVI~IG 96 (196)
.+...+|.+|.- ++.|++-+.
T Consensus 14 ~~~~~~~~~~~~~~~~~~~~~~~~ 37 (299)
T PLN02220 14 KAVRKEKRAFFFRKWTRLDVVRAS 37 (299)
T ss_pred cccccccchhhhhccceehhHHHH
Confidence 344456666665 556655543
No 83
>smart00347 HTH_MARR helix_turn_helix multiple antibiotic resistance protein.
Probab=34.57 E-value=97 Score=20.93 Aligned_cols=88 Identities=19% Similarity=0.283 Sum_probs=52.5
Q ss_pred CCccccchhhhhhhHHHHHHHHHHHH-hhHHhCCCcccchhHHHHHHHhhhhhheeeee---------eeeeecCCccHH
Q 029243 83 TPFGYTRKDVLLIGVGVTVIGYGLKS-GLELFGVDPLQAGNVVEVIVVLGLTLGWVSTY---------IFRVSNKEMTYA 152 (196)
Q Consensus 83 ~pfgMtR~DVI~IGlgvfllGg~ly~-gLq~~GlD~~~AGiwsQ~lLVlGlvvgWv~SY---------lfRV~tk~MTY~ 152 (196)
..||++..+...+-+..- .|..-.. .-+.++++..+.--...-+.= -+|+-.+ .+++-.+-..+.
T Consensus 4 ~~~~l~~~~~~il~~l~~-~~~~~~~~la~~~~~s~~~i~~~l~~L~~----~g~v~~~~~~~~~r~~~~~lT~~g~~~~ 78 (101)
T smart00347 4 KPLGLTPTQFLVLRILYE-EGPLSVSELAKRLGVSPSTVTRVLDRLEK----KGLIRRLPSPEDRRSVLVSLTEEGRELI 78 (101)
T ss_pred cccCCCHHHHHHHHHHHH-cCCcCHHHHHHHHCCCchhHHHHHHHHHH----CCCeEecCCCCCCCeEEEEECHhHHHHH
Confidence 467888777665554442 2322222 235667665444333333333 4555433 455555667788
Q ss_pred HHHHHHHHHHHHHHHhcCCHHHH
Q 029243 153 QQLRDYEDKVMQKRLEGLTEAEL 175 (196)
Q Consensus 153 qQ~kdYE~avLQKRleeLtpEEl 175 (196)
++.++--....+..++.++++|+
T Consensus 79 ~~~~~~~~~~~~~~~~~~~~~e~ 101 (101)
T smart00347 79 EELLEARHETLAELLAGLTAEEQ 101 (101)
T ss_pred HHHHHHHHHHHHHHHhcCCcccC
Confidence 88888888888888899988873
No 84
>PRK12585 putative monovalent cation/H+ antiporter subunit G; Reviewed
Probab=34.34 E-value=63 Score=28.47 Aligned_cols=16 Identities=25% Similarity=0.409 Sum_probs=9.8
Q ss_pred HHHHHHHHHHHHHHHH
Q 029243 151 YAQQLRDYEDKVMQKR 166 (196)
Q Consensus 151 Y~qQ~kdYE~avLQKR 166 (196)
-..|++.|++..+.+|
T Consensus 103 ~~d~~~~~~k~~~~~~ 118 (197)
T PRK12585 103 IRDQLRSVKKDDIKKK 118 (197)
T ss_pred hHHHHHHHHhhhhhhc
Confidence 3478888885544433
No 85
>PRK14748 kdpF potassium-transporting ATPase subunit F; Provisional
Probab=34.24 E-value=47 Score=21.45 Aligned_cols=25 Identities=8% Similarity=0.066 Sum_probs=17.6
Q ss_pred ccchhhhhhhHHHHHHHHHHHHhhH
Q 029243 87 YTRKDVLLIGVGVTVIGYGLKSGLE 111 (196)
Q Consensus 87 MtR~DVI~IGlgvfllGg~ly~gLq 111 (196)
|+-.-+.+|-+.+.++|++.|..+.
T Consensus 1 Ms~~vi~G~ilv~lLlgYLvyALi~ 25 (29)
T PRK14748 1 MSAGVITGVLLVFLLLGYLVYALIN 25 (29)
T ss_pred CcHHHHHHHHHHHHHHHHHHHHHhC
Confidence 4445566777778888888887654
No 86
>PRK06474 hypothetical protein; Provisional
Probab=33.88 E-value=44 Score=27.50 Aligned_cols=15 Identities=33% Similarity=0.558 Sum_probs=14.1
Q ss_pred CCHHHHHHHHHHHHH
Q 029243 170 LTEAELEALIEQVEE 184 (196)
Q Consensus 170 LtpEEle~L~aEIE~ 184 (196)
|||||++.|++|+.+
T Consensus 136 Lt~ee~~el~~el~~ 150 (178)
T PRK06474 136 LDEEEFEEFQSELNE 150 (178)
T ss_pred cCHHHHHHHHHHHHH
Confidence 899999999999987
No 87
>KOG2568 consensus Predicted membrane protein [Function unknown]
Probab=33.41 E-value=49 Score=32.66 Aligned_cols=80 Identities=14% Similarity=0.079 Sum_probs=58.0
Q ss_pred cchhhhhhhHHHHHHHHHHHHhhHHhCCCcccchhHHHHHHH-hhhhhheeeeeeeeeecCCccHHHHHHHHHHHHHHHH
Q 029243 88 TRKDVLLIGVGVTVIGYGLKSGLELFGVDPLQAGNVVEVIVV-LGLTLGWVSTYIFRVSNKEMTYAQQLRDYEDKVMQKR 166 (196)
Q Consensus 88 tR~DVI~IGlgvfllGg~ly~gLq~~GlD~~~AGiwsQ~lLV-lGlvvgWv~SYlfRV~tk~MTY~qQ~kdYE~avLQKR 166 (196)
+=-.|++||++.|++- ......+..|-.+-.+|.|-+...+ +.++-.-..-.+|+-+++.|.+-+=+|++=+..|=|+
T Consensus 305 ~l~rv~~ig~~~~i~s-~i~~l~~~~g~~se~~~~~~lf~~ip~ai~d~~f~~wIF~SL~~Tlk~Lr~rRn~vKl~lYr~ 383 (518)
T KOG2568|consen 305 TLLRVCQIGVIYFIAS-EILGLARVIGNISELSSLLILFAALPLAILDAAFIYWIFISLAKTLKKLRLRRNIVKLSLYRK 383 (518)
T ss_pred HHHHHHHHhHHHHHHH-HHHHHHHHhcCcccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4456899999998877 4555688999999999986654322 2222222233468899999999999999988888888
Q ss_pred Hh
Q 029243 167 LE 168 (196)
Q Consensus 167 le 168 (196)
|-
T Consensus 384 F~ 385 (518)
T KOG2568|consen 384 FT 385 (518)
T ss_pred HH
Confidence 74
No 88
>PRK13729 conjugal transfer pilus assembly protein TraB; Provisional
Probab=33.27 E-value=40 Score=32.96 Aligned_cols=28 Identities=18% Similarity=0.176 Sum_probs=16.8
Q ss_pred eecCCccHHHHHHHHH-----HHHHHHHHhcCC
Q 029243 144 VSNKEMTYAQQLRDYE-----DKVMQKRLEGLT 171 (196)
Q Consensus 144 V~tk~MTY~qQ~kdYE-----~avLQKRleeLt 171 (196)
|+|..++=+-+....+ .++|||+|++|.
T Consensus 57 vV~~~FddkVnqSALteqQ~kasELEKqLaaLr 89 (475)
T PRK13729 57 VVDTTFDDKVRQHATTEMQVTAAQMQKQYEEIR 89 (475)
T ss_pred eecchhHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4566665555544444 456778887774
No 89
>PF06945 DUF1289: Protein of unknown function (DUF1289); InterPro: IPR010710 This family consists of a number of hypothetical bacterial proteins. The aligned region spans around 56 residues and contains 4 highly conserved cysteine residues towards the N terminus. The function of this family is unknown.
Probab=33.14 E-value=63 Score=21.96 Aligned_cols=23 Identities=9% Similarity=0.378 Sum_probs=19.3
Q ss_pred HHHhcCCHHHHHHHHHHHHHHHh
Q 029243 165 KRLEGLTEAELEALIEQVEEEKR 187 (196)
Q Consensus 165 KRleeLtpEEle~L~aEIE~Ek~ 187 (196)
+.+..|+++|..++++++.+..+
T Consensus 28 ~~W~~~s~~er~~i~~~l~~R~~ 50 (51)
T PF06945_consen 28 RDWKSMSDDERRAILARLRARRA 50 (51)
T ss_pred HHHhhCCHHHHHHHHHHHHHHhc
Confidence 46778999999999999987654
No 90
>PHA03049 IMV membrane protein; Provisional
Probab=33.13 E-value=44 Score=25.16 Aligned_cols=20 Identities=25% Similarity=0.358 Sum_probs=12.0
Q ss_pred hhhhhhhHHHHHHHHHHHHh
Q 029243 90 KDVLLIGVGVTVIGYGLKSG 109 (196)
Q Consensus 90 ~DVI~IGlgvfllGg~ly~g 109 (196)
.|++++++.|-+.|.++|..
T Consensus 3 ~d~~l~iICVaIi~lIvYgi 22 (68)
T PHA03049 3 GDIILVIICVVIIGLIVYGI 22 (68)
T ss_pred HHHHHHHHHHHHHHHHHHHH
Confidence 35666666666666555544
No 91
>cd00427 Ribosomal_L29_HIP Ribosomal L29 protein/HIP. L29 is a protein of the large ribosomal Subunit. A homolog, called heparin/heparan sulfate interacting protein (HIP), has also been identified in mammals. L29 is located on the surface of the large ribosomal subunit, where it participates in forming a protein ring that surrounds the polypeptide exit channel, providing structural support for the ribosome. L29 is involved in forming the translocon binding site, along with L19, L22, L23, L24, and L31e. In addition, L29 and L23 form the interaction site for trigger factor (TF) on the ribosomal surface, adjacent to the exit tunnel. L29 forms numerous interactions with L23 and with the 23S rRNA. In some eukaryotes, L29 is referred to as L35, which is distinct from L35 found in bacteria and some eukaryotes (primarily plastids and mitochondria). The mammalian homolog, HIP, is found on the surface of many tissues and cell lines. It is believed to play a role in cell adhesion and modulat
Probab=33.13 E-value=48 Score=22.76 Aligned_cols=20 Identities=20% Similarity=0.431 Sum_probs=16.0
Q ss_pred HhcCCHHHHHHHHHHHHHHH
Q 029243 167 LEGLTEAELEALIEQVEEEK 186 (196)
Q Consensus 167 leeLtpEEle~L~aEIE~Ek 186 (196)
+.+||++|+++.+.++.+|-
T Consensus 3 ir~ls~~eL~~~l~~l~~el 22 (57)
T cd00427 3 LREKSDEELQEKLDELKKEL 22 (57)
T ss_pred HHHCCHHHHHHHHHHHHHHH
Confidence 56889999988888887653
No 92
>KOG4171 consensus Adenylate/guanylate kinase [Nucleotide transport and metabolism]
Probab=32.93 E-value=42 Score=34.11 Aligned_cols=28 Identities=32% Similarity=0.528 Sum_probs=25.1
Q ss_pred HHHHHHHHhcCCHHHHHHHHHHHHHHHhc
Q 029243 160 DKVMQKRLEGLTEAELEALIEQVEEEKRR 188 (196)
Q Consensus 160 ~avLQKRleeLtpEEle~L~aEIE~Ek~~ 188 (196)
+..|-+++|++ .|+||+..+++|+||++
T Consensus 385 q~el~~~lE~~-~~~Le~~~~~Le~EKkk 412 (671)
T KOG4171|consen 385 QLELNLELEKL-KEKLEKMTRELEEEKKK 412 (671)
T ss_pred HHHHHHHHHHH-HHHHHHHHHHHHHHHhh
Confidence 56788999999 99999999999999875
No 93
>PF13858 DUF4199: Protein of unknown function (DUF4199)
Probab=32.60 E-value=1.3e+02 Score=23.14 Aligned_cols=18 Identities=11% Similarity=0.331 Sum_probs=8.8
Q ss_pred eeeeeeee--eecCCccHHH
Q 029243 136 WVSTYIFR--VSNKEMTYAQ 153 (196)
Q Consensus 136 Wv~SYlfR--V~tk~MTY~q 153 (196)
+.+..-+| ..++.|||.|
T Consensus 46 ~~~i~~~R~~~~~g~isf~~ 65 (163)
T PF13858_consen 46 YFAIRRYRKKYNGGFISFGQ 65 (163)
T ss_pred HHHHHHHHHHccCCCeeHHH
Confidence 33444444 3455666655
No 94
>COG5478 Predicted small integral membrane protein [Function unknown]
Probab=32.55 E-value=1.1e+02 Score=25.87 Aligned_cols=19 Identities=32% Similarity=0.557 Sum_probs=15.0
Q ss_pred hcCCHHHHHHHHHHHHHHH
Q 029243 168 EGLTEAELEALIEQVEEEK 186 (196)
Q Consensus 168 eeLtpEEle~L~aEIE~Ek 186 (196)
|.++|+|.+++.+-+|++-
T Consensus 98 Eh~~~~~i~~~~~~~e~~a 116 (141)
T COG5478 98 EHLKPEEIEEIRDRLEDEA 116 (141)
T ss_pred ccCCHHHHHHHHHHHHHHh
Confidence 3578999999988888754
No 95
>TIGR00012 L29 ribosomal protein L29. called L29 in prokaryotic (50S) large subunits and L35 in eukaryotic (60S) large subunits.
Probab=32.49 E-value=51 Score=22.61 Aligned_cols=19 Identities=21% Similarity=0.427 Sum_probs=15.6
Q ss_pred HhcCCHHHHHHHHHHHHHH
Q 029243 167 LEGLTEAELEALIEQVEEE 185 (196)
Q Consensus 167 leeLtpEEle~L~aEIE~E 185 (196)
+.+||.+|+++...|+.+|
T Consensus 2 lr~~s~~EL~~~l~~lr~e 20 (55)
T TIGR00012 2 LREKSKEELAKKLDELKKE 20 (55)
T ss_pred HhhCCHHHHHHHHHHHHHH
Confidence 4678999999988888775
No 96
>PF01099 Uteroglobin: Uteroglobin family; InterPro: IPR006038 Uteroglobin (or blastokinin) is a mammalian steroid-inducible secreted protein originally isolated from the uterus of rabbits during early pregnancy. The mucosal epithelia of several organs that communicate with the external environment express uteroglobin. Its tissue-specific expression is regulated by steroid hormones, and is augmented in the uterus by non-steroidal prolactin. Uteroglobin may be a multi-functional protein with anti-inflammatory/immunomodulatory properties, acting to inhibit phospholipase A2 activity, and binding to (and possibly sequestering) several hydrophobic ligands such as progesterone, retinols, polychlorinated biphenyls, phospholipids and prostaglandins. In addition, uteroglobin has anti-chemotactic, anti-allergic, anti-tumourigenic and embryo growth-stimulatory properties. Uteroglobin may have a homeostatic role against oxidative damage, inflammation, autoimmunity and cancer [, , , ]. Uteroglobin consists of a disulphide-linked dimer of two identical polypeptides, each polypeptide being composed of four helices. It is a member of the secretoglobin superfamily. This entry represents uteroglobin proteins from several mammalian species, as well as other members of the secretoglobin superfamily, such as lipophilin B [], prostatic steroid-binding protein [], mammaglobin [], and the related allergen Fel d 1 (Felis domesticus allergen 1) [].; GO: 0005488 binding, 0005576 extracellular region; PDB: 1UTR_B 1CCD_A 1UTG_A 2UTG_A 1ZKR_B 1PUO_B 2EJN_B.
Probab=32.22 E-value=79 Score=22.18 Aligned_cols=34 Identities=18% Similarity=0.446 Sum_probs=25.0
Q ss_pred cHHHHHHHHH--------HHHHHHHHhcCCHHHHHHHHHHHH
Q 029243 150 TYAQQLRDYE--------DKVMQKRLEGLTEAELEALIEQVE 183 (196)
Q Consensus 150 TY~qQ~kdYE--------~avLQKRleeLtpEEle~L~aEIE 183 (196)
.|..+++.|. ...+.+-++.||+|.++....=++
T Consensus 18 ~Y~~~l~~y~~~~~~~~A~~~lK~C~d~ls~e~~~~i~~~l~ 59 (67)
T PF01099_consen 18 EYKESLQKYNPPPEAVEAKLELKQCVDKLSNETRENILKLLE 59 (67)
T ss_dssp HHHHHHHCC---HHHHHHHHHHHHHHTTS-HHHHHHHHHHHH
T ss_pred HHHHHHHhcCCCHHHHHHHHHHHHHHHhCCHHHHHHHHHHHH
Confidence 5888999998 446778899999999887665444
No 97
>PF15188 CCDC-167: Coiled-coil domain-containing protein 167
Probab=32.14 E-value=74 Score=24.46 Aligned_cols=38 Identities=24% Similarity=0.277 Sum_probs=25.6
Q ss_pred HHHHHHHHHHHHhcCCHHHHHHHHHHHHHHHhccccCC
Q 029243 156 RDYEDKVMQKRLEGLTEAELEALIEQVEEEKRRLASGE 193 (196)
Q Consensus 156 kdYE~avLQKRleeLtpEEle~L~aEIE~Ek~~~~~~~ 193 (196)
+|-|...-+-|-++||||..+.|.+|.+.-+.....+|
T Consensus 19 ~~le~ve~rL~~~eLs~e~R~~lE~E~~~l~~~l~~~E 56 (85)
T PF15188_consen 19 RRLEAVESRLRRRELSPEARRSLEKELNELKEKLENNE 56 (85)
T ss_pred HHHHHHHHHHcccCCChHHHHHHHHHHHHHHHHhhccH
Confidence 33334444555679999999999988887655554443
No 98
>COG2155 Uncharacterized conserved protein [Function unknown]
Probab=31.92 E-value=29 Score=26.65 Aligned_cols=58 Identities=19% Similarity=0.260 Sum_probs=42.0
Q ss_pred hhhhhhHHHHHHHHHHHHhhHHhCCCcc-----cchhHHHHHHH-hhhhhheeeeeeeeeecCC
Q 029243 91 DVLLIGVGVTVIGYGLKSGLELFGVDPL-----QAGNVVEVIVV-LGLTLGWVSTYIFRVSNKE 148 (196)
Q Consensus 91 DVI~IGlgvfllGg~ly~gLq~~GlD~~-----~AGiwsQ~lLV-lGlvvgWv~SYlfRV~tk~ 148 (196)
-.-.+.+.+.++|++=|...-.+++|=. +...|++++.+ +|+.=.|...++|++..++
T Consensus 5 ~i~~~sllLvIiGalNWGLvG~f~fdLvaalfG~gs~lsri~YiivGlagv~~il~l~k~~~~~ 68 (79)
T COG2155 5 IIRGLSLLLVILGALNWGLVGLFGFDLVAALFGVGSMLTRIIYIIVGLAGVYQILPLFKVCEKK 68 (79)
T ss_pred HHHHHHHHHHHHhhhhhceeeeehhhHHHHHHcCchHHHHHHHHHHHHHHHHHHHHhcchhhcc
Confidence 3446677777888776666667777743 35678888754 5777889999999987764
No 99
>PF10661 EssA: WXG100 protein secretion system (Wss), protein EssA; InterPro: IPR018920 The Wss (WXG100 protein secretion system) in Staphylococcus aureus seems to be encoded by a locus of eight ORFs, called ess (eSAT-6 secretion system) []. This locus encodes, amongst several other proteins, EssA, a protein predicted to possess one transmembrane domain. Due to its predicted membrane location and its absolute requirement for WXG100 protein secretion, it has been speculated that EssA could form a secretion apparatus in conjunction with YukC and YukAB. Proteins homologous to EssA, YukC, EsaA and YukD were absent from mycobacteria []. Members of this family are associated with type VII secretion of WXG100 family targets in the Firmicutes, but not in the Actinobacteria. This highly divergent protein family consists largely of a central region of highly polar low-complexity sequence containing occasional LF motifs in weak repeats about 17 residues in length, flanked by hydrophobic N- and C-terminal regions.
Probab=31.50 E-value=50 Score=27.01 Aligned_cols=18 Identities=28% Similarity=0.126 Sum_probs=13.3
Q ss_pred hhHHHHHHHHHHHHhhHH
Q 029243 95 IGVGVTVIGYGLKSGLEL 112 (196)
Q Consensus 95 IGlgvfllGg~ly~gLq~ 112 (196)
|+..|.++++++|+.++-
T Consensus 125 i~g~ll~i~~giy~~~r~ 142 (145)
T PF10661_consen 125 IGGILLAICGGIYVVLRK 142 (145)
T ss_pred HHHHHHHHHHHHHHHHHH
Confidence 555666777899998874
No 100
>PRK05179 rpsM 30S ribosomal protein S13; Validated
Probab=30.72 E-value=36 Score=27.08 Aligned_cols=21 Identities=29% Similarity=0.645 Sum_probs=16.8
Q ss_pred HHHHhcCCHHHHHHHHHHHHH
Q 029243 164 QKRLEGLTEAELEALIEQVEE 184 (196)
Q Consensus 164 QKRleeLtpEEle~L~aEIE~ 184 (196)
.+++.+|+++|+++|.+.|++
T Consensus 42 ~~~~~~L~~~qi~~l~~~i~~ 62 (122)
T PRK05179 42 DTRVKDLTDEELDKIREEIDK 62 (122)
T ss_pred ccccccCCHHHHHHHHHHHHh
Confidence 357788888888888888875
No 101
>PF07256 DUF1435: Protein of unknown function (DUF1435); InterPro: IPR009885 This family consists of several hypothetical Enterobacterial proteins of around 80 residues in length. The function of this family is unknown.
Probab=30.49 E-value=57 Score=24.99 Aligned_cols=47 Identities=26% Similarity=0.378 Sum_probs=28.5
Q ss_pred hHHHHHHHHHHH-HhhHHhCCCcccchhHHHHHHHhhhhhheeeeeeeeeecCCccHHHHHHHHH
Q 029243 96 GVGVTVIGYGLK-SGLELFGVDPLQAGNVVEVIVVLGLTLGWVSTYIFRVSNKEMTYAQQLRDYE 159 (196)
Q Consensus 96 GlgvfllGg~ly-~gLq~~GlD~~~AGiwsQ~lLVlGlvvgWv~SYlfRV~tk~MTY~qQ~kdYE 159 (196)
|=|+++-|.+.- +++--.|+|..+.-+++-.++.. -|=||+|+|.|=
T Consensus 9 gWGvllp~~l~~~l~~~~ls~~~~r~~iv~amL~T~-----------------~MLfh~rlRH~l 56 (78)
T PF07256_consen 9 GWGVLLPGALIPLLALANLSFDQWRIVIVVAMLLTL-----------------AMLFHRRLRHFL 56 (78)
T ss_pred chHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHHHHH-----------------HHHHhHhhhHHH
Confidence 445555553222 22446677777666655555554 378999999994
No 102
>COG0255 RpmC Ribosomal protein L29 [Translation, ribosomal structure and biogenesis]
Probab=30.49 E-value=64 Score=23.81 Aligned_cols=21 Identities=29% Similarity=0.411 Sum_probs=17.5
Q ss_pred HHHhcCCHHHHHHHHHHHHHH
Q 029243 165 KRLEGLTEAELEALIEQVEEE 185 (196)
Q Consensus 165 KRleeLtpEEle~L~aEIE~E 185 (196)
+-|.+||+||+++.+.|+..|
T Consensus 6 ~elR~~s~eeL~~~l~eLK~E 26 (69)
T COG0255 6 KELREKSVEELEEELRELKKE 26 (69)
T ss_pred HHHHhCCHHHHHHHHHHHHHH
Confidence 457789999999999888775
No 103
>PF10742 DUF2555: Protein of unknown function (DUF2555); InterPro: IPR019678 This entry represents conserved proteins found in Cyanobacteria. The function is not known.
Probab=30.46 E-value=45 Score=24.28 Aligned_cols=23 Identities=26% Similarity=0.540 Sum_probs=20.5
Q ss_pred HHHHhcCCHHHHHHHHHHHHHHH
Q 029243 164 QKRLEGLTEAELEALIEQVEEEK 186 (196)
Q Consensus 164 QKRleeLtpEEle~L~aEIE~Ek 186 (196)
++.+..+++|+.++|-+-+|++.
T Consensus 3 ~~~~~~~t~~~va~LA~RLE~Dd 25 (57)
T PF10742_consen 3 KEDLAAFTEEDVAKLAKRLEEDD 25 (57)
T ss_pred hhhHhhcCHHHHHHHHHhhhhcc
Confidence 57899999999999999999875
No 104
>PF13239 2TM: 2TM domain
Probab=29.80 E-value=1.6e+02 Score=21.07 Aligned_cols=71 Identities=20% Similarity=0.236 Sum_probs=34.5
Q ss_pred cchhhhhhhHHHHHHHHHHHHhhHHhCCCcccchhHHHHHHHhhhhhheeeeeee---eeecCCccHHHHHHHHHHHHHH
Q 029243 88 TRKDVLLIGVGVTVIGYGLKSGLELFGVDPLQAGNVVEVIVVLGLTLGWVSTYIF---RVSNKEMTYAQQLRDYEDKVMQ 164 (196)
Q Consensus 88 tR~DVI~IGlgvfllGg~ly~gLq~~GlD~~~AGiwsQ~lLVlGlvvgWv~SYlf---RV~tk~MTY~qQ~kdYE~avLQ 164 (196)
.|+.-..+=+.+++++-++..++..+.-.... |+-- .+++|....++ +|+-.++.+.+ +.|+..+|
T Consensus 10 ~~~k~fy~Hl~~y~ivn~~l~~in~~~~~~~~---W~~~-----~~~~Wgi~L~~h~~~vf~~~~~~~~---~We~rki~ 78 (83)
T PF13239_consen 10 KRRKGFYRHLAVYLIVNSFLIFINLFTGPGYF---WPLW-----PILGWGIGLAIHALKVFGSKYFFGK---DWEERKIQ 78 (83)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCc---hHHH-----HHHHHHHHHHHHHHHHHcccccccc---hHHHHHHH
Confidence 34455566677777776666665553311111 3321 11223222222 23323344433 77788888
Q ss_pred HHHhc
Q 029243 165 KRLEG 169 (196)
Q Consensus 165 KRlee 169 (196)
|.+++
T Consensus 79 k~m~k 83 (83)
T PF13239_consen 79 KLMEK 83 (83)
T ss_pred HHhcC
Confidence 87763
No 105
>PF03047 ComC: COMC family; InterPro: IPR004288 Competence is the ability of a cell to take up exogenous DNA from its environment, resulting in transformation. It is widespread among bacteria and is probably an important mechanism for the horizontal transfer of genes. Cells that take up DNA inevitably acquire the nucleotides the DNA consists of, and, because nucleotides are needed for DNA and RNA synthesis and are expensive to synthesise, these may make a significant contribution to the cell's energy budget []. The lateral gene transfer caused by competence also contributes to the genetic diversity that makes evolution possible. DNA usually becomes available by the death and lysis of other cells. Competent bacteria use components of extracellular filaments called type 4 pili to create pores in their membranes and pull DNA through the pores into the cytoplasm. This process, including the development of competence and the expression of the uptake machinery, is regulated in response to cell-cell signalling and/or nutritional conditions []. This family consists of streptococcal competence stimulating peptide precursors, which are generally up to 50 amino acid residues long. In all the members of this family, the leader sequence is cleaved after two conserved glycine residues; thus the leader sequence is of the double- glycine type []. Competence stimulating peptides (CSP) are small (less than 25 amino acid residues) cationic peptides. The N-terminal amino acid residue is negatively charged, either glutamate or aspartate. The C-terminal end is positively charged. The third residue is also positively charged: a highly conserved arginine []. Some COMC proteins and their precursors (not included in this family) do not fully follow the above description. Functionally, CSP act as pheromones, stimulating competence for genetic transformation in streptococci. In streptococci, the (CSP mediated) competence response requires exponential cell growth at a critical density, a relatively simple requirement when compared to the stationary-phase requirement of Haemophilus, or the late-logarithmic- phase of Bacillus []. All bacteria induced to competence by a particular CSP are said to belong to the same pherotype, because each CSP is recognised by a specific receptor (the signalling domain of a histidine kinase ComD). Pherotypes are not necessarily species-specific. In addition, an organism may change pherotype. There are two possible mechanisms for pherotype switching: horizontal gene transfer, and accumulation of point mutations. The biological significance of pherotypes and pherotype switching is not definitively determined. Pherotype switching occurs frequently enough in naturally competent streptococci to suggest that it may be an important contributor to genetic exchange between different bacterial species []. This entry also includes proteins that form bacteriocin-like propetides with a glycine-glycine cleavage site. The bacteriocin is initially formed as a pre-propeptide and upon cleavage at the glycine-glycine cleavage site, a leader peptide and the propeptide would be formed. The propeptide then undergoes posttranslational modification before becoming functional [].; GO: 0005186 pheromone activity; PDB: 2I2J_A 2I2H_A 2A1C_A.
Probab=29.71 E-value=18 Score=23.35 Aligned_cols=15 Identities=40% Similarity=0.618 Sum_probs=0.0
Q ss_pred HHHhcCCHHHHHHHH
Q 029243 165 KRLEGLTEAELEALI 179 (196)
Q Consensus 165 KRleeLtpEEle~L~ 179 (196)
+.|++|+++||+...
T Consensus 9 ~~F~~lt~~eL~~I~ 23 (32)
T PF03047_consen 9 EQFEELTEEELQEIQ 23 (32)
T ss_dssp ---------------
T ss_pred HHHhcCCHHHHhhcc
Confidence 579999999998753
No 106
>cd07321 Extradiol_Dioxygenase_3A_like Subunit A of Class III extradiol dioxygenases. Extradiol dioxygenases catalyze the incorporation of both atoms of molecular oxygen into substrates using a variety of reaction mechanisms, resulting in the cleavage of aromatic rings. There are two major groups of dioxygenases according to the cleavage site of the aromatic ring. Intradiol enzymes cleave the aromatic ring between two hydroxyl groups, whereas extradiol enzymes cleave the aromatic ring between a hydroxylated carbon and an adjacent non-hydroxylated carbon. Extradiol dioxygenases can be divided into three classes. Class I and II enzymes are evolutionary related and show sequence similarity, with the two domain class II enzymes evolving from the class I enzyme through gene duplication. Class III enzymes are different in sequence and structure and usually have two subunits, designated A and B, which form a tetramer composed of two copies of each subunit. This model represents subunit A of c
Probab=29.40 E-value=85 Score=22.96 Aligned_cols=25 Identities=28% Similarity=0.452 Sum_probs=18.3
Q ss_pred HHHHHH---HHHHHHHHhcCCHHHHHHHHH
Q 029243 154 QLRDYE---DKVMQKRLEGLTEAELEALIE 180 (196)
Q Consensus 154 Q~kdYE---~avLQKRleeLtpEEle~L~a 180 (196)
-|++|. ++++++- .|||||.++|.+
T Consensus 18 ~re~f~~dp~a~~~~~--~Lt~eE~~al~~ 45 (77)
T cd07321 18 VKERFKADPEAVLAEY--GLTPEEKAALLA 45 (77)
T ss_pred HHHHHHhCHHHHHHHc--CCCHHHHHHHHc
Confidence 355555 6777765 899999999864
No 107
>PHA02698 hypothetical protein; Provisional
Probab=29.32 E-value=77 Score=24.80 Aligned_cols=27 Identities=26% Similarity=0.446 Sum_probs=23.4
Q ss_pred HHHHHHHHHhcCCHHHHHHHHHHHHHH
Q 029243 159 EDKVMQKRLEGLTEAELEALIEQVEEE 185 (196)
Q Consensus 159 E~avLQKRleeLtpEEle~L~aEIE~E 185 (196)
|+-.++.-+.-|+-||.+.|+.|+|+-
T Consensus 55 ediq~ksElqLLsqEEMdELl~Eledl 81 (89)
T PHA02698 55 EDIQYKSELQLLSQEEMDELLVELEDL 81 (89)
T ss_pred HHHHHHHHHHHhhHHHHHHHHHHHHHH
Confidence 466778888899999999999999874
No 108
>TIGR00439 ftsX putative protein insertion permease FtsX. FtsX is an integral membrane protein encoded in the same operon as signal recognition particle docking protein FtsY and FtsE. It belongs to a family of predicted permeases and may play a role in the insertion of proteins required for potassium transport, cell division, and other activities. FtsE is a hydrophilic nucleotide-binding protein that associates with the inner membrane by means of association with FtsX.
Probab=29.27 E-value=54 Score=29.17 Aligned_cols=16 Identities=19% Similarity=0.538 Sum_probs=12.5
Q ss_pred HHHHhhhhhheeeeee
Q 029243 126 VIVVLGLTLGWVSTYI 141 (196)
Q Consensus 126 ~lLVlGlvvgWv~SYl 141 (196)
+++.+|+++||++|++
T Consensus 282 ~l~~~g~~lg~lgs~~ 297 (309)
T TIGR00439 282 LLLGFCIALGVVGAWL 297 (309)
T ss_pred HHHHHHHHHHHHHHHH
Confidence 4556678899999985
No 109
>CHL00137 rps13 ribosomal protein S13; Validated
Probab=29.15 E-value=39 Score=26.98 Aligned_cols=21 Identities=38% Similarity=0.531 Sum_probs=16.3
Q ss_pred HHHHhcCCHHHHHHHHHHHHH
Q 029243 164 QKRLEGLTEAELEALIEQVEE 184 (196)
Q Consensus 164 QKRleeLtpEEle~L~aEIE~ 184 (196)
.+|+.+|+++|+++|.+.|++
T Consensus 42 ~~~~~~Lt~~qi~~l~~~i~~ 62 (122)
T CHL00137 42 DIRTKDLTDEQISALREIIEE 62 (122)
T ss_pred CcCcccCCHHHHHHHHHHHHH
Confidence 356778888888888888864
No 110
>CHL00154 rpl29 ribosomal protein L29; Validated
Probab=29.14 E-value=70 Score=23.18 Aligned_cols=21 Identities=24% Similarity=0.330 Sum_probs=16.8
Q ss_pred HHhcCCHHHHHHHHHHHHHHH
Q 029243 166 RLEGLTEAELEALIEQVEEEK 186 (196)
Q Consensus 166 RleeLtpEEle~L~aEIE~Ek 186 (196)
-+.+||++|+++-+.|+.+|-
T Consensus 8 elr~ls~~eL~~~l~elk~el 28 (67)
T CHL00154 8 DIIDLTDSEISEEIIKTKKEL 28 (67)
T ss_pred HHHhCCHHHHHHHHHHHHHHH
Confidence 467899999998888887754
No 111
>TIGR00787 dctP tripartite ATP-independent periplasmic transporter solute receptor, DctP family. TRAP-T (Tripartite ATP-independent Periplasmic Transporter) family proteins generally consist of three components, and these systems have so far been found in Gram-negative bacteria, Gram-postive bacteria and archaea. The best characterized example is the DctPQM system of Rhodobacter capsulatus, a C4 dicarboxylate (malate, fumarate, succinate) transporter. This model represents the DctP family, one of at least three major families of extracytoplasmic solute receptor for TRAP family transporters. Other are the SnoM family (see pfam03480) and TAXI (TRAP-associated extracytoplasmic immunogenic) family.
Probab=29.03 E-value=63 Score=26.87 Aligned_cols=21 Identities=14% Similarity=0.094 Sum_probs=17.8
Q ss_pred HHHHhcCCHHHHHHHHHHHHH
Q 029243 164 QKRLEGLTEAELEALIEQVEE 184 (196)
Q Consensus 164 QKRleeLtpEEle~L~aEIE~ 184 (196)
++++++||||.++++++..++
T Consensus 213 ~~~~~~L~~e~q~~i~~a~~~ 233 (257)
T TIGR00787 213 KAFWKSLPPDLQAVVKEAAKE 233 (257)
T ss_pred HHHHhcCCHHHHHHHHHHHHH
Confidence 678999999999999877655
No 112
>cd06171 Sigma70_r4 Sigma70, region (SR) 4 refers to the most C-terminal of four conserved domains found in Escherichia coli (Ec) sigma70, the main housekeeping sigma, and related sigma-factors (SFs). A SF is a dissociable subunit of RNA polymerase, it directs bacterial or plastid core RNA polymerase to specific promoter elements located upstream of transcription initiation points. The SR4 of Ec sigma70 and other essential primary SFs contact promoter sequences located 35 base-pairs upstream of the initiation point, recognizing a 6-base-pair -35 consensus TTGACA. Sigma70 related SFs also include SFs which are dispensable for bacterial cell growth for example Ec sigmaS, SFs which activate regulons in response to a specific signal for example heat-shock Ec sigmaH, and a group of SFs which includes the extracytoplasmic function (ECF) SFs and is typified by Ec sigmaE which contains SR2 and -4 only. ECF SFs direct the transcription of genes that regulate various responses including periplas
Probab=28.63 E-value=78 Score=18.54 Aligned_cols=18 Identities=28% Similarity=0.492 Sum_probs=11.1
Q ss_pred HHHHHHhcCCHHHHHHHH
Q 029243 162 VMQKRLEGLTEAELEALI 179 (196)
Q Consensus 162 vLQKRleeLtpEEle~L~ 179 (196)
.+++.++.|||++.+.+.
T Consensus 3 ~~~~~~~~l~~~~~~~~~ 20 (55)
T cd06171 3 RLEEALDKLPEREREVIL 20 (55)
T ss_pred HHHHHHHhCCHHHHHHHH
Confidence 456666667666665553
No 113
>PF01545 Cation_efflux: Cation efflux family; InterPro: IPR002524 Members of this family are integral membrane proteins, that are found to increase tolerance to divalent metal ions such as cadmium, zinc, and cobalt. These proteins are considered to be efflux pumps that remove these ions from cells [, ], however others are implicated in ion uptake []. The family has six predicted transmembrane domains. Members of the family are variable in length because of variably sized inserts, often containing low-complexity sequence.; GO: 0008324 cation transmembrane transporter activity, 0006812 cation transport, 0055085 transmembrane transport, 0016021 integral to membrane; PDB: 3BYP_B 3BYR_A 3H90_B 2QFI_B 2ZZT_A.
Probab=28.42 E-value=86 Score=25.94 Aligned_cols=31 Identities=35% Similarity=0.429 Sum_probs=18.3
Q ss_pred CCCCccccchhhhhh-hHHHHHHHHHHHHhhH
Q 029243 81 TETPFGYTRKDVLLI-GVGVTVIGYGLKSGLE 111 (196)
Q Consensus 81 ~~~pfgMtR~DVI~I-GlgvfllGg~ly~gLq 111 (196)
++-|||+.|...+.. ..++++++.+++....
T Consensus 57 ~~~pfG~~r~e~l~~~~~~~~l~~~~~~~~~~ 88 (284)
T PF01545_consen 57 KRYPFGYGRLEPLAALIVSILLIFLGLFLIVE 88 (284)
T ss_dssp SSSSSSSTTHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred cccccchhhhhhHhhhhhhhhHhhhHHHHHHH
Confidence 568999999877653 3333444444444443
No 114
>PF02096 60KD_IMP: 60Kd inner membrane protein; InterPro: IPR001708 This family of proteins is required for the insertion of integral membrane proteins into cellular membranes. Many of these integral membrane proteins are associated with respiratory chain complexes, for example a large number of members of this family play an essential role in the activity and assembly of cytochrome c oxidase. Stage III sporulation protein J (SP3J) is a probable lipoprotein, rich in basic and hydrophobic amino acids. Mutations in the protein abolish the transcription of prespore-specific genes transcribed by the sigma G form of RNA polymerase []. SP3J could be involved in a signal transduction pathway coupling gene expression in the prespore to events in the mother cell, or it may be necessary for essential metabolic interactions between the two cells []. The protein shows a high degree of similarity to Bacillus subtilis YQJG, to yeast OXA1 and also to bacterial 60 kDa inner-membrane proteins [, , , ]. ; GO: 0051205 protein insertion into membrane, 0016021 integral to membrane
Probab=28.30 E-value=44 Score=26.92 Aligned_cols=44 Identities=14% Similarity=0.241 Sum_probs=27.7
Q ss_pred eeeeecCCccHHHHHHHHHHHHHHHHHhcCCH---HHHHHHHHHHHH
Q 029243 141 IFRVSNKEMTYAQQLRDYEDKVMQKRLEGLTE---AELEALIEQVEE 184 (196)
Q Consensus 141 lfRV~tk~MTY~qQ~kdYE~avLQKRleeLtp---EEle~L~aEIE~ 184 (196)
+.|++.=-.++.+|+..-+.+.++..++++.+ ++.+++++|+.+
T Consensus 13 ~vR~~~~Pl~i~~~~~~~k~~~~~P~l~~i~~k~~~~~~~~~~~~~~ 59 (198)
T PF02096_consen 13 LVRLILLPLSIKQQRSSAKMQELQPELKEIQEKYKEDQQKMQQEMQK 59 (198)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHH
Confidence 46777777888888888777777766665433 334444444433
No 115
>PF02697 DUF217: Uncharacterized ACR, COG1753; InterPro: IPR003847 This entry is represented by Natrialba phage PhiCh1, Orf96. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches.
Probab=28.15 E-value=2e+02 Score=21.06 Aligned_cols=42 Identities=10% Similarity=0.321 Sum_probs=34.8
Q ss_pred cCCccHHHHHHHH-----HHHHHHHHHhcCCHHHHHHHHHHHHHHHh
Q 029243 146 NKEMTYAQQLRDY-----EDKVMQKRLEGLTEAELEALIEQVEEEKR 187 (196)
Q Consensus 146 tk~MTY~qQ~kdY-----E~avLQKRleeLtpEEle~L~aEIE~Ek~ 187 (196)
..+++|..=..++ ....+.+-|..|+++|-+.+.++++++..
T Consensus 19 ~~~eSFSdvI~rli~~~~~~~~l~~~~g~l~deea~~~~~~i~e~r~ 65 (71)
T PF02697_consen 19 REDESFSDVIERLIEKEKKRRDLMDYFGILSDEEADEMEKDIKEERE 65 (71)
T ss_pred cCCCCHHHHHHHHHhcccchhHHHHHhccCChhhHHHHHHHHHHHHH
Confidence 4678888888777 35889999999999999999999987643
No 116
>PF08285 DPM3: Dolichol-phosphate mannosyltransferase subunit 3 (DPM3); InterPro: IPR013174 This family corresponds to subunit 3 of dolichol-phosphate mannosyltransferase, an enzyme which generates mannosyl donors for glycosylphosphatidylinositols, N-glycan and protein O- and C-mannosylation. DPM3 is an integral membrane protein and plays a role in stabilising the dolichol-phosphate mannosyl transferase complex [].
Probab=28.02 E-value=47 Score=25.40 Aligned_cols=84 Identities=23% Similarity=0.292 Sum_probs=46.5
Q ss_pred ccchhhhhhhHHHHHHHHHHHHhhHHhCCCcccchhHHHHHHHhhh-hhheeeee-eeeeecCCccHHHHHHHHHHHHHH
Q 029243 87 YTRKDVLLIGVGVTVIGYGLKSGLELFGVDPLQAGNVVEVIVVLGL-TLGWVSTY-IFRVSNKEMTYAQQLRDYEDKVMQ 164 (196)
Q Consensus 87 MtR~DVI~IGlgvfllGg~ly~gLq~~GlD~~~AGiwsQ~lLVlGl-vvgWv~SY-lfRV~tk~MTY~qQ~kdYE~avLQ 164 (196)
|||.-=.+..+.+ ++ ++|..+- .|.-+.....+.+++..+=+ .+.=.++| +++|.-+=|||+.
T Consensus 1 Mtr~~~~l~~~~~--l~-~~w~~l~-~~~~~~~~~~~~~ii~~lP~~~Lv~fG~Ysl~~lgy~v~tFnD----------- 65 (91)
T PF08285_consen 1 MTRLQQWLSALLL--LS-ALWLALL-LGLLPLPPEPQQEIIPYLPFYALVSFGCYSLFTLGYGVATFND----------- 65 (91)
T ss_pred CchHHHHHHHHHH--HH-HHHHHHH-HccCCCCchhHHHHHHHhhHHHHHHHHHHHHHHHHHhhhccCC-----------
Confidence 6666544444333 22 4454433 33333555566666654321 00111233 4566667777764
Q ss_pred HHHhcCCHHHHHHHHHHHHHHHhcccc
Q 029243 165 KRLEGLTEAELEALIEQVEEEKRRLAS 191 (196)
Q Consensus 165 KRleeLtpEEle~L~aEIE~Ek~~~~~ 191 (196)
-||+-+.||+||+|-|.+.+.
T Consensus 66 ------cpeA~~eL~~eI~eAK~dLr~ 86 (91)
T PF08285_consen 66 ------CPEAAKELQKEIKEAKADLRK 86 (91)
T ss_pred ------CHHHHHHHHHHHHHHHHHHHH
Confidence 577788899999998877653
No 117
>PF00831 Ribosomal_L29: Ribosomal L29 protein; InterPro: IPR001854 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites [, ]. About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits. Many ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome [, ]. Ribosomal protein L29 is one of the proteins from the large ribosomal subunit. L29 belongs to a family of ribosomal proteins of 63 to 138 amino-acid residues which, on the basis of sequence similarities [], groups: Red algal L29. Bacterial L29. Mammalian L35 Caenorhabditis elegans L35 (ZK652.4). Yeast L35. ; GO: 0003735 structural constituent of ribosome, 0006412 translation, 0005622 intracellular, 0005840 ribosome; PDB: 1VSP_W 3MS1_Y 3MRZ_Y 3F1H_2 3PYT_Y 3PYO_Y 3D5D_2 3D5B_2 3PYR_Y 1VSA_W ....
Probab=27.91 E-value=54 Score=22.66 Aligned_cols=20 Identities=30% Similarity=0.589 Sum_probs=15.8
Q ss_pred HHhcCCHHHHHHHHHHHHHH
Q 029243 166 RLEGLTEAELEALIEQVEEE 185 (196)
Q Consensus 166 RleeLtpEEle~L~aEIE~E 185 (196)
-+.+||.+|+++.+.|+..|
T Consensus 3 elr~ls~~eL~~~l~elk~e 22 (58)
T PF00831_consen 3 ELRELSDEELQEKLEELKKE 22 (58)
T ss_dssp HHCHSHHHHHHHHHHHHHHH
T ss_pred HHHhCCHHHHHHHHHHHHHH
Confidence 46788999988888888765
No 118
>PF13687 DUF4153: Domain of unknown function (DUF4153)
Probab=27.88 E-value=46 Score=27.63 Aligned_cols=67 Identities=15% Similarity=0.133 Sum_probs=36.8
Q ss_pred CccccchhhhhhhHHHHHHHHHHHHhhHHhCCCcccchhHHHHHHHhhhhhheeeeeeeeeecCCccHHHHHHHHHH
Q 029243 84 PFGYTRKDVLLIGVGVTVIGYGLKSGLELFGVDPLQAGNVVEVIVVLGLTLGWVSTYIFRVSNKEMTYAQQLRDYED 160 (196)
Q Consensus 84 pfgMtR~DVI~IGlgvfllGg~ly~gLq~~GlD~~~AGiwsQ~lLVlGlvvgWv~SYlfRV~tk~MTY~qQ~kdYE~ 160 (196)
.+|-|-.-+..+.+++++++..+++.+.. ...| -+.+++ ..+..++++-= +--.+.+.+.|.+|+|+
T Consensus 150 qYGlT~~R~~~~~~~~~~~~~~l~~~~r~-------~~~~-~i~~~~-~~~~l~~~~~P-~~~~~is~~sQ~~Rl~~ 216 (217)
T PF13687_consen 150 QYGLTPNRYYALLLAIFLLIYALYYIFRK-------RQRW-YIPLSL-AILLLLSSFGP-LDPFRISVNSQLARLEK 216 (217)
T ss_pred HHCCCHHHHHHHHHHHHHHHHHHHHHHHc-------cccc-HHHHHH-HHHHHHHHhCC-CCHHHHhHHHHHHHHhc
Confidence 46666666777777777777777777721 1233 111111 11222222222 55677788888887763
No 119
>PF09682 Holin_LLH: Phage holin protein (Holin_LLH); InterPro: IPR010026 This entry represents the Bacteriophage LL-H, Orf107, holin protein. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This protein family represent one of a large number of mutually dissimilar families of phage holins. It is thought that the temporal precision of holin-mediated lysis may occur through the build-up of a holin oligomer which causes the lysis [].
Probab=27.88 E-value=73 Score=24.29 Aligned_cols=26 Identities=12% Similarity=0.215 Sum_probs=12.8
Q ss_pred hhheeeeeeeeeecCCccHHHHHHHH
Q 029243 133 TLGWVSTYIFRVSNKEMTYAQQLRDY 158 (196)
Q Consensus 133 vvgWv~SYlfRV~tk~MTY~qQ~kdY 158 (196)
+++|++.|+.-...++++=.++.+-+
T Consensus 20 i~~~~~~~~~~~l~~k~~~e~~~~~~ 45 (108)
T PF09682_consen 20 IAGIIVKYVIKYLKKKAGGEKLVKIL 45 (108)
T ss_pred HHHHHHHHHHHHHHHHhhHHHHHHHH
Confidence 45555555555555555533333333
No 120
>TIGR03631 bact_S13 30S ribosomal protein S13. This model describes bacterial ribosomal protein S13, to the exclusion of the homologous archaeal S13P and eukaryotic ribosomal protein S18. This model identifies some (but not all) instances of chloroplast and mitochondrial S13, which is of bacterial type.
Probab=27.77 E-value=44 Score=26.22 Aligned_cols=20 Identities=50% Similarity=0.740 Sum_probs=15.4
Q ss_pred HHHhcCCHHHHHHHHHHHHH
Q 029243 165 KRLEGLTEAELEALIEQVEE 184 (196)
Q Consensus 165 KRleeLtpEEle~L~aEIE~ 184 (196)
+|+.+|+++|+++|.+.|+.
T Consensus 41 ~~~~~L~~~qi~~l~~~l~~ 60 (113)
T TIGR03631 41 KRVKDLTEEELNAIREEIEA 60 (113)
T ss_pred cccccCCHHHHHHHHHHHHh
Confidence 56778888888888888853
No 121
>PF05529 Bap31: B-cell receptor-associated protein 31-like ; InterPro: IPR008417 Bap31 is a polytopic integral protein of the endoplasmic reticulum membrane and a substrate of caspase-8. Bap31 is cleaved within its cytosolic domain, generating pro-apoptotic p20 Bap31 [].; GO: 0006886 intracellular protein transport, 0005783 endoplasmic reticulum, 0016021 integral to membrane
Probab=27.76 E-value=54 Score=26.65 Aligned_cols=13 Identities=31% Similarity=0.544 Sum_probs=8.3
Q ss_pred HHHHHHHHHHHHH
Q 029243 172 EAELEALIEQVEE 184 (196)
Q Consensus 172 pEEle~L~aEIE~ 184 (196)
.+|+++|.+|+++
T Consensus 160 ~~ei~~lk~el~~ 172 (192)
T PF05529_consen 160 SEEIEKLKKELEK 172 (192)
T ss_pred HHHHHHHHHHHHH
Confidence 4666666666665
No 122
>PF02899 Phage_int_SAM_1: Phage integrase, N-terminal SAM-like domain; InterPro: IPR004107 Proteins containing this domain cleave DNA substrates by a series of staggered cuts, during which the protein becomes covalently linked to the DNA through a catalytic tyrosine residue at the carboxy end of the alignment [, ]. The phage integrase N-terminal SAM-like domain is almost always found with the signature that defines the phage integrase family (see IPR002104 from INTERPRO).; GO: 0003677 DNA binding, 0015074 DNA integration; PDB: 1Z1G_B 1Z19_A 1Z1B_A 2OXO_A 1P7D_B 3NRW_A 1A0P_A.
Probab=27.74 E-value=75 Score=21.21 Aligned_cols=37 Identities=22% Similarity=0.415 Sum_probs=28.0
Q ss_pred cHHHHHHHHHHHHHH---HHHhcCCHHHHHHHHHHHHHHH
Q 029243 150 TYAQQLRDYEDKVMQ---KRLEGLTEAELEALIEQVEEEK 186 (196)
Q Consensus 150 TY~qQ~kdYE~avLQ---KRleeLtpEEle~L~aEIE~Ek 186 (196)
+|..++++|.+-.-+ .-++++++++++.-.+++.++.
T Consensus 21 ~Y~~~l~~f~~~~~~~~~~~~~~i~~~~v~~f~~~~~~~~ 60 (84)
T PF02899_consen 21 SYRRDLRRFIRWLEEHGIIDWEDITEEDVRDFLEYLAKEG 60 (84)
T ss_dssp HHHHHHHHHHHHHHHTTS-CGGG--HHHHHHHHHHHHCTT
T ss_pred HHHHHHHHHHHhhhhhhhhhhhhhhhHHHHHHHHHHHccC
Confidence 588889999876666 5789999999999999887753
No 123
>PLN02435 probable UDP-N-acetylglucosamine pyrophosphorylase
Probab=27.68 E-value=76 Score=30.88 Aligned_cols=31 Identities=23% Similarity=0.406 Sum_probs=25.7
Q ss_pred HHHHHHHHHHHHHHhcCCHHHHHHHHHHHHH
Q 029243 154 QLRDYEDKVMQKRLEGLTEAELEALIEQVEE 184 (196)
Q Consensus 154 Q~kdYE~avLQKRleeLtpEEle~L~aEIE~ 184 (196)
.+..+...-|=+-+++|+++|+++|.++|++
T Consensus 28 ~l~~~gQ~HLl~~w~~ls~~e~~~L~~qL~~ 58 (493)
T PLN02435 28 RLKDYGQEDAFALWDELSPEERDLLVRDIES 58 (493)
T ss_pred HHHHcChHHHHHhhhhCCHHHHHHHHHHHHh
Confidence 3444557788889999999999999999986
No 124
>PF06738 DUF1212: Protein of unknown function (DUF1212); InterPro: IPR010619 This entry represents a predicted domain found within a number of hypothetical proteins of unknown function found in eukaryotes, bacteria and archaea. Some of these sequences are predicted to be membrane proteins.
Probab=27.65 E-value=1.6e+02 Score=23.40 Aligned_cols=77 Identities=13% Similarity=0.067 Sum_probs=45.9
Q ss_pred hHHhCCCcccchhHHHHHHHhhhhhheeeeeeeeeecCCccHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHHHHh
Q 029243 110 LELFGVDPLQAGNVVEVIVVLGLTLGWVSTYIFRVSNKEMTYAQQLRDYEDKVMQKRLEGLTEAELEALIEQVEEEKR 187 (196)
Q Consensus 110 Lq~~GlD~~~AGiwsQ~lLVlGlvvgWv~SYlfRV~tk~MTY~qQ~kdYE~avLQKRleeLtpEEle~L~aEIE~Ek~ 187 (196)
-++.|+|..+.-..-..+++----=....|.+.|+-.+..- +.+..+..+-.-+=.-+.+++||..+.++||+.++.
T Consensus 23 a~~~G~~~~~~~v~~t~i~is~~~~~~~~t~~~~v~~~~~n-l~~l~~v~~l~~~~~~~~~~~~ea~~~L~~I~~~~~ 99 (193)
T PF06738_consen 23 ARALGLEDVEVFVTPTGIFISFDDDGEPRTRVRRVPPRGVN-LDKLAAVNRLSRRIVAGQLSLEEAIERLDEIDREPP 99 (193)
T ss_pred HHHcCCCeeEEEEecCEEEEEEecCCcceEEEEEeCCCCcC-HHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHhhCCC
Confidence 44667764444333333222200126677889999555444 345555444333333399999999999999988763
No 125
>PRK14549 50S ribosomal protein L29P; Provisional
Probab=27.58 E-value=80 Score=22.82 Aligned_cols=22 Identities=18% Similarity=0.296 Sum_probs=17.5
Q ss_pred HHHhcCCHHHHHHHHHHHHHHH
Q 029243 165 KRLEGLTEAELEALIEQVEEEK 186 (196)
Q Consensus 165 KRleeLtpEEle~L~aEIE~Ek 186 (196)
+-+.+||++||.+.+.|+.+|-
T Consensus 7 ~elr~ls~~eL~~~l~elk~eL 28 (69)
T PRK14549 7 SEIREMSPEEREEKLEELKLEL 28 (69)
T ss_pred HHHHhCCHHHHHHHHHHHHHHH
Confidence 3477899999998888887754
No 126
>COG4858 Uncharacterized membrane-bound protein conserved in bacteria [Function unknown]
Probab=27.56 E-value=49 Score=29.61 Aligned_cols=38 Identities=16% Similarity=0.199 Sum_probs=27.5
Q ss_pred HHhhHHhCCCcccchhHHHHHHHhhhhhheeeeeeeee
Q 029243 107 KSGLELFGVDPLQAGNVVEVIVVLGLTLGWVSTYIFRV 144 (196)
Q Consensus 107 y~gLq~~GlD~~~AGiwsQ~lLVlGlvvgWv~SYlfRV 144 (196)
||..+.-|=.++.-|.|--++..++.+..|+.-|++-+
T Consensus 146 ~y~yr~~ad~sqr~~~~K~~lv~~~sm~lWi~v~i~t~ 183 (226)
T COG4858 146 YYAYRMRADNSQRPGTWKYLLVAVLSMLLWIAVMIATV 183 (226)
T ss_pred HHHHHhhcccccCCchHHHHHHHHHHHHHHHHHHHHHh
Confidence 34466778556777888777777777899999886654
No 127
>COG1230 CzcD Co/Zn/Cd efflux system component [Inorganic ion transport and metabolism]
Probab=27.53 E-value=77 Score=28.96 Aligned_cols=74 Identities=24% Similarity=0.429 Sum_probs=42.2
Q ss_pred CCccccchhhhhhhH-HHHHHHHHHH---HhhH-HhCCCcccchhHHHHHHHhhhhhheeeeeeeeee-----cCCccHH
Q 029243 83 TPFGYTRKDVLLIGV-GVTVIGYGLK---SGLE-LFGVDPLQAGNVVEVIVVLGLTLGWVSTYIFRVS-----NKEMTYA 152 (196)
Q Consensus 83 ~pfgMtR~DVI~IGl-gvfllGg~ly---~gLq-~~GlD~~~AGiwsQ~lLVlGlvvgWv~SYlfRV~-----tk~MTY~ 152 (196)
.+|||.|..++.--+ +++++++.+| -..+ +.-=.+.+. ...=.+-++|+++-+++.++++-- |.+=.|.
T Consensus 82 ~TfGy~R~eiLaa~~nav~Li~~s~~I~~EAi~R~~~P~~i~~-~~ml~va~~GL~vN~~~a~ll~~~~~~~lN~r~a~L 160 (296)
T COG1230 82 FTFGYKRLEILAAFLNALLLIVVSLLILWEAIQRLLAPPPIHY-SGMLVVAIIGLVVNLVSALLLHKGHEENLNMRGAYL 160 (296)
T ss_pred CCccHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCc-cchHHHHHHHHHHHHHHHHHhhCCCcccchHHHHHH
Confidence 789999999886422 3333333333 3344 332333333 334445566778888888888874 3333455
Q ss_pred HHHHH
Q 029243 153 QQLRD 157 (196)
Q Consensus 153 qQ~kd 157 (196)
.=+.|
T Consensus 161 Hvl~D 165 (296)
T COG1230 161 HVLGD 165 (296)
T ss_pred HHHHH
Confidence 55444
No 128
>PF10061 DUF2299: Uncharacterized conserved protein (DUF2299); InterPro: IPR018747 Members of this family of hypothetical bacterial proteins have no known function. ; PDB: 3CXJ_D.
Probab=27.46 E-value=68 Score=25.98 Aligned_cols=26 Identities=23% Similarity=0.294 Sum_probs=20.7
Q ss_pred HHHHHHHhcCCHHHHHHHHHHHHHHH
Q 029243 161 KVMQKRLEGLTEAELEALIEQVEEEK 186 (196)
Q Consensus 161 avLQKRleeLtpEEle~L~aEIE~Ek 186 (196)
..=|++|.+|+++|.++++.++-.+=
T Consensus 58 ~~H~~~l~~l~~~eR~~fl~~i~~~l 83 (138)
T PF10061_consen 58 PEHQRALRSLKPEEREEFLWEIKREL 83 (138)
T ss_dssp HHHHHHHHHS-HHHHHHHHHHHHHHH
T ss_pred HHHHHHHHhCCHHHHHHHHHHHHHHH
Confidence 34578999999999999999997653
No 129
>TIGR02408 ectoine_ThpD ectoine hydroxylase. Both ectoine and hydroxyectoine are compatible solvents that serve as protectants against osmotic and thermal stresses. A number of genomes synthesize ectoine. This enzyme allows conversion of ectoine to hydroxyectoine, which may be more effective for some purposes, and is found in a subset of ectoine-producing organisms.
Probab=27.15 E-value=32 Score=29.58 Aligned_cols=35 Identities=29% Similarity=0.363 Sum_probs=25.4
Q ss_pred HHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHHHH
Q 029243 152 AQQLRDYEDKVMQKRLEGLTEAELEALIEQVEEEK 186 (196)
Q Consensus 152 ~qQ~kdYE~avLQKRleeLtpEEle~L~aEIE~Ek 186 (196)
-+|++.|++.=--.=-.-+++||+++|.+++++--
T Consensus 19 ~eqi~~f~~dGyvvl~~vls~eev~~lr~~i~~~~ 53 (277)
T TIGR02408 19 AKQLQSYERDGFLLLENLFSDDEVAALLAEVERMT 53 (277)
T ss_pred HHHHHHHHHCCEEECcccCCHHHHHHHHHHHHHHH
Confidence 47888888543222234689999999999998853
No 130
>PRK00461 rpmC 50S ribosomal protein L29; Reviewed
Probab=27.05 E-value=76 Score=24.27 Aligned_cols=22 Identities=32% Similarity=0.392 Sum_probs=17.5
Q ss_pred HHHhcCCHHHHHHHHHHHHHHH
Q 029243 165 KRLEGLTEAELEALIEQVEEEK 186 (196)
Q Consensus 165 KRleeLtpEEle~L~aEIE~Ek 186 (196)
+-+.+||++|+++.+.|+.+|-
T Consensus 3 ~elR~lS~eEL~e~L~elkkEL 24 (87)
T PRK00461 3 KELRKKSVEELEKLVIELKAEL 24 (87)
T ss_pred HHHHhCCHHHHHHHHHHHHHHH
Confidence 3578899999998888887753
No 131
>PF09012 FeoC: FeoC like transcriptional regulator; InterPro: IPR015102 This entry contains several transcriptional regulators, including FeoC, which contain a HTH motif. FeoC acts as a [Fe-S] dependent transcriptional repressor []. ; PDB: 1XN7_A 2K02_A.
Probab=26.61 E-value=36 Score=23.53 Aligned_cols=33 Identities=15% Similarity=0.294 Sum_probs=23.4
Q ss_pred HHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHH
Q 029243 151 YAQQLRDYEDKVMQKRLEGLTEAELEALIEQVEE 184 (196)
Q Consensus 151 Y~qQ~kdYE~avLQKRleeLtpEEle~L~aEIE~ 184 (196)
|.+++..+--+.|.++|+ ++|+.++.+++.+++
T Consensus 8 ~l~~~~~~S~~eLa~~~~-~s~~~ve~mL~~l~~ 40 (69)
T PF09012_consen 8 YLRERGRVSLAELAREFG-ISPEAVEAMLEQLIR 40 (69)
T ss_dssp HHHHS-SEEHHHHHHHTT---HHHHHHHHHHHHC
T ss_pred HHHHcCCcCHHHHHHHHC-cCHHHHHHHHHHHHH
Confidence 444444444788999985 899999999999876
No 132
>PF14261 DUF4351: Domain of unknown function (DUF4351)
Probab=26.56 E-value=97 Score=21.73 Aligned_cols=20 Identities=40% Similarity=0.562 Sum_probs=9.6
Q ss_pred HHHHHHhcCCHHHHHHHHHH
Q 029243 162 VMQKRLEGLTEAELEALIEQ 181 (196)
Q Consensus 162 vLQKRleeLtpEEle~L~aE 181 (196)
.++.|++.|+.|.++.|-+.
T Consensus 26 ~~~~~I~~l~~eqLE~l~e~ 45 (59)
T PF14261_consen 26 EIQERIQQLSLEQLEALAEA 45 (59)
T ss_pred HHHHHHHcCCHHHHHHHHHH
Confidence 33444555555555555443
No 133
>PF11326 DUF3128: Protein of unknown function (DUF3128); InterPro: IPR021475 This eukaryotic family of proteins has no known function.
Probab=26.56 E-value=1.6e+02 Score=21.55 Aligned_cols=45 Identities=22% Similarity=0.294 Sum_probs=31.8
Q ss_pred HHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHHHHhccccCCCC
Q 029243 151 YAQQLRDYEDKVMQKRLEGLTEAELEALIEQVEEEKRRLASGEQV 195 (196)
Q Consensus 151 Y~qQ~kdYE~avLQKRleeLtpEEle~L~aEIE~Ek~~~~~~~~~ 195 (196)
=.+|++||.+-.--|...+..+++.+....+-+.++.....-+.|
T Consensus 36 C~~~~~df~~C~~~k~~~~~~~~~~~e~~~~~~~~~~~~~~sedV 80 (84)
T PF11326_consen 36 CSQWWEDFKFCLRWKSKPEKRPEEVQEILKKREKERRKQGSSEDV 80 (84)
T ss_pred HHHHHHHHHHHHHhccCCccCHHHHHHHHHHhHHHHhcCCCcCcc
Confidence 568899999877666654448999988888877776444444443
No 134
>PRK10878 hypothetical protein; Provisional
Probab=26.52 E-value=52 Score=24.28 Aligned_cols=27 Identities=15% Similarity=0.315 Sum_probs=21.8
Q ss_pred HHHHHHHhcCCHHHHHHHHHHHHHHHh
Q 029243 161 KVMQKRLEGLTEAELEALIEQVEEEKR 187 (196)
Q Consensus 161 avLQKRleeLtpEEle~L~aEIE~Ek~ 187 (196)
..+++++++||++|++...+=++.+.+
T Consensus 10 ~F~~~~~~~l~~~e~~~Fe~LL~~~D~ 36 (72)
T PRK10878 10 PFFEHEYDSLSDDEKRIFIRLLECDDP 36 (72)
T ss_pred HHHHHHHhhCCHHHHHHHHHHHcCCCH
Confidence 457889999999999988877776544
No 135
>KOG0182 consensus 20S proteasome, regulatory subunit alpha type PSMA6/SCL1 [Posttranslational modification, protein turnover, chaperones]
Probab=26.38 E-value=1.6e+02 Score=26.78 Aligned_cols=50 Identities=16% Similarity=0.146 Sum_probs=38.7
Q ss_pred hheeeeeeeeee----------cCCccHHHHHHHHHHHHHHHHHh---cCCHHHHHHHHHHHHHH
Q 029243 134 LGWVSTYIFRVS----------NKEMTYAQQLRDYEDKVMQKRLE---GLTEAELEALIEQVEEE 185 (196)
Q Consensus 134 vgWv~SYlfRV~----------tk~MTY~qQ~kdYE~avLQKRle---eLtpEEle~L~aEIE~E 185 (196)
..=+.|+|||+. ..+--|..||.+||.+.. |+. |||++=|.+-++++.|-
T Consensus 62 d~~tvt~~f~itk~ig~v~tG~~aDar~~v~rar~eAa~~--~yk~Gyemp~DiL~k~~Ad~~Qv 124 (246)
T KOG0182|consen 62 DSSTVTHLFRITKKIGCVITGMIADARSQVQRARYEAAEF--RYKYGYEMPCDILAKRMADKSQV 124 (246)
T ss_pred ccccceeEEEeeccceEEEecCCcchHHHHHHHHHHHHhh--hhhcCCCCCHHHHHHHHhhHHHH
Confidence 344568888863 235569999999998754 454 99999999999999873
No 136
>TIGR02838 spore_V_AC stage V sporulation protein AC. This model describes stage V sporulation protein AC, a paralog of stage V sporulation protein AE. Both are proteins found to present in a species if and only if that species is one of the Firmicutes capable of endospore formation, as of the time of the publication of the genome of Carboxydothermus hydrogenoformans. Mutants in spoVAC have a stage V sproulation defect.
Probab=26.28 E-value=74 Score=26.57 Aligned_cols=51 Identities=14% Similarity=0.170 Sum_probs=44.4
Q ss_pred CCCCCccccchhhhhhhHHHHHHHHHHHHhhHHhC-CCcccchhHHHHHHHh
Q 029243 80 QTETPFGYTRKDVLLIGVGVTVIGYGLKSGLELFG-VDPLQAGNVVEVIVVL 130 (196)
Q Consensus 80 ~~~~pfgMtR~DVI~IGlgvfllGg~ly~gLq~~G-lD~~~AGiwsQ~lLVl 130 (196)
+.|.|..|.=..+-++|-.+-++|=++.-.+...| ++..+|+.|.-++||+
T Consensus 11 ~Pk~~~~~n~l~AFlvGG~IC~iGQ~l~d~~~~~~~lt~~~a~~~~~~~lV~ 62 (141)
T TIGR02838 11 EPKPPYLKNCVMAFLVGGLICLIGQLISDFYLRYFQFSEKTAGSPTSATLIF 62 (141)
T ss_pred CCCCcHHHHHHHHHHhCcHHHHHHHHHHHHHHHhccCChhhcccchhhHHHH
Confidence 35688889999999999999999988888888788 9999999999888876
No 137
>PRK05457 heat shock protein HtpX; Provisional
Probab=26.26 E-value=3.3e+02 Score=24.00 Aligned_cols=35 Identities=11% Similarity=0.321 Sum_probs=26.1
Q ss_pred HHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHHHHh
Q 029243 151 YAQQLRDYEDKVMQKRLEGLTEAELEALIEQVEEEKR 187 (196)
Q Consensus 151 Y~qQ~kdYE~avLQKRleeLtpEEle~L~aEIE~Ek~ 187 (196)
+.-+.++|+.+..-.++-. |+.+...++.+++...
T Consensus 211 ~~SR~rEy~AD~~Aa~ltg--p~~L~~AL~KL~~~~~ 245 (284)
T PRK05457 211 WFSRHREFRADAGGAKLAG--REKMIAALQRLKTSYE 245 (284)
T ss_pred HHHhHHHHHHHHHHHHhhC--HHHHHHHHHHHHhhCc
Confidence 3356777887777777765 9999999999987655
No 138
>smart00529 HTH_DTXR Helix-turn-helix diphteria tox regulatory element. iron dependent repressor
Probab=25.95 E-value=2.3e+02 Score=19.71 Aligned_cols=72 Identities=17% Similarity=0.194 Sum_probs=34.3
Q ss_pred HhCCCcccchhHHHHHHHhhhhhheeeeeeeeeecCCccHHHHHHHHHHHHHHHHH--hcCCHHHHHHHHHHHHH
Q 029243 112 LFGVDPLQAGNVVEVIVVLGLTLGWVSTYIFRVSNKEMTYAQQLRDYEDKVMQKRL--EGLTEAELEALIEQVEE 184 (196)
Q Consensus 112 ~~GlD~~~AGiwsQ~lLVlGlvvgWv~SYlfRV~tk~MTY~qQ~kdYE~avLQKRl--eeLtpEEle~L~aEIE~ 184 (196)
.+|++.....-..+-+-=-|+ |-..-.-.+++--+-....++.....+.....-. ..+++||.+++...++.
T Consensus 8 ~l~is~stvs~~l~~L~~~gl-I~r~~~~~~~lT~~g~~~~~~~~~~~~~~~~~l~~~~~~~~~e~~~l~~~l~~ 81 (96)
T smart00529 8 RLNVSPPTVTQMLKKLEKDGL-VEYEPYRGITLTEKGRRLARRLLRKHRLLERFLVDVLGVDEEEVHEEAERLEH 81 (96)
T ss_pred HhCCChHHHHHHHHHHHHCCC-EEEcCCCceEechhHHHHHHHHHHHHHHHHHHHHHHhCCCHHHHHHHHHHHHc
Confidence 455555444444443333332 1111111233333334444455444322222222 27999999999988874
No 139
>PF09677 TrbI_Ftype: Type-F conjugative transfer system protein (TrbI_Ftype); InterPro: IPR014115 This entry represents TrbI, an essential component of the F-type conjugative transfer system for plasmid DNA transfer that has been shown to be localized to the periplasm [, ].
Probab=25.81 E-value=65 Score=25.32 Aligned_cols=14 Identities=36% Similarity=0.541 Sum_probs=10.1
Q ss_pred hcCCHHHHHHHHHH
Q 029243 168 EGLTEAELEALIEQ 181 (196)
Q Consensus 168 eeLtpEEle~L~aE 181 (196)
.+|||||.+++.+.
T Consensus 52 ~~lt~~q~~a~t~~ 65 (111)
T PF09677_consen 52 SSLTPEQVEALTQR 65 (111)
T ss_pred cCCCHHHHHHHHHH
Confidence 68899888766543
No 140
>PTZ00459 mucin-associated surface protein (MASP); Provisional
Probab=25.77 E-value=33 Score=30.71 Aligned_cols=13 Identities=31% Similarity=0.664 Sum_probs=11.7
Q ss_pred HHHHhhhhhheee
Q 029243 126 VIVVLGLTLGWVS 138 (196)
Q Consensus 126 ~lLVlGlvvgWv~ 138 (196)
+|||..|||.|-+
T Consensus 9 VLLVCALCVLWCg 21 (291)
T PTZ00459 9 VLLVCALCVLWCG 21 (291)
T ss_pred HHHHHHHHHHhcC
Confidence 7899999999976
No 141
>PTZ00134 40S ribosomal protein S18; Provisional
Probab=25.76 E-value=38 Score=28.29 Aligned_cols=21 Identities=33% Similarity=0.521 Sum_probs=18.4
Q ss_pred HHHHhcCCHHHHHHHHHHHHH
Q 029243 164 QKRLEGLTEAELEALIEQVEE 184 (196)
Q Consensus 164 QKRleeLtpEEle~L~aEIE~ 184 (196)
.+|+.+||++|+++|.+.|++
T Consensus 55 ~~~~~~Lt~~qi~~l~~~i~~ 75 (154)
T PTZ00134 55 TKRAGELTAEEIEKIVEIIAN 75 (154)
T ss_pred CCCcccCCHHHHHHHHHHHhc
Confidence 467889999999999999976
No 142
>PF10953 DUF2754: Protein of unknown function (DUF2754); InterPro: IPR020490 This entry contains membrane proteins with no known function.
Probab=25.66 E-value=45 Score=25.03 Aligned_cols=46 Identities=24% Similarity=0.298 Sum_probs=31.7
Q ss_pred hhhhhHHHHHHHHHHHHhhHHhCCCcccchhHHHHHHHhhhhhheeeeeeee
Q 029243 92 VLLIGVGVTVIGYGLKSGLELFGVDPLQAGNVVEVIVVLGLTLGWVSTYIFR 143 (196)
Q Consensus 92 VI~IGlgvfllGg~ly~gLq~~GlD~~~AGiwsQ~lLVlGlvvgWv~SYlfR 143 (196)
+..||+...+-|..+..||.+-|..+-..|.+.= ++--|+..++.|
T Consensus 15 afa~glifilngvvgllgfeakgwqtyavglvtw------visfwlag~iir 60 (70)
T PF10953_consen 15 AFAIGLIFILNGVVGLLGFEAKGWQTYAVGLVTW------VISFWLAGFIIR 60 (70)
T ss_pred HHHHHHHHHhhchhhhceecccCceeeeehhHHH------HHHHHHhhheee
Confidence 5678888888888888888888887766654321 224466666665
No 143
>COG0053 MMT1 Predicted Co/Zn/Cd cation transporters [Inorganic ion transport and metabolism]
Probab=25.51 E-value=1.1e+02 Score=27.34 Aligned_cols=67 Identities=15% Similarity=0.238 Sum_probs=38.7
Q ss_pred CCCCccccchhhh---hhhHHHHHHHHH-HHHhhHHhC--CCcccchhHHHHHHHhhhhhheeeeeeeeeecC
Q 029243 81 TETPFGYTRKDVL---LIGVGVTVIGYG-LKSGLELFG--VDPLQAGNVVEVIVVLGLTLGWVSTYIFRVSNK 147 (196)
Q Consensus 81 ~~~pfgMtR~DVI---~IGlgvfllGg~-ly~gLq~~G--lD~~~AGiwsQ~lLVlGlvvgWv~SYlfRV~tk 147 (196)
.+-|||.-|..-+ .+++.+++.|+. +|..+..+- -+.-...+..-+.++..++-.|+..|..|+.-+
T Consensus 71 ~~HpyGh~k~E~l~sl~~~~~i~~~g~~i~~~a~~~~~~~~~~~~~~~~~~v~l~s~~~~~~l~~~~~~~~kk 143 (304)
T COG0053 71 RDHPYGHGKAETLASLIVSILIFAAGFEILLEAIKRLISPQPVEPPLLALGVALISIVIKEALYRYLRRVGKK 143 (304)
T ss_pred CCCCCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCccHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4589999998754 455555555543 334444332 333333333334444444678888898887643
No 144
>PF02686 Glu-tRNAGln: Glu-tRNAGln amidotransferase C subunit; InterPro: IPR003837 Glu-tRNAGln amidotransferase is a heterotrimeric enzyme that is required for correct decoding of glutamine codons during translation. The Glu-tRNA Gln amidotransferase enzyme is an important translational fidelity mechanism replacing incorrectly charged Glu-tRNAGln with the correct Gln-tRANGln via transmidation of the misacylated Glu-tRNAGln []. This activity supplements the lack of glutaminyl-tRNA synthetase activity in Gram-positive eubacteria, cyanobacteria, archaea, and organelles [].; GO: 0006450 regulation of translational fidelity; PDB: 2DQN_C 3IP4_C 2G5I_C 2F2A_C 2G5H_C 2DF4_C 3AL0_C 3KFU_G 3H0R_I 3H0L_U ....
Probab=25.31 E-value=66 Score=21.93 Aligned_cols=15 Identities=53% Similarity=0.747 Sum_probs=11.7
Q ss_pred CCHHHHHHHHHHHHH
Q 029243 170 LTEAELEALIEQVEE 184 (196)
Q Consensus 170 LtpEEle~L~aEIE~ 184 (196)
|++||++++.++++.
T Consensus 1 l~eeE~~~~~~~l~~ 15 (72)
T PF02686_consen 1 LTEEELEKLTKQLND 15 (72)
T ss_dssp --HHHHHHHHHHHHH
T ss_pred CCHHHHHHHHHHHHH
Confidence 689999999999875
No 145
>TIGR03798 ocin_TIGR03798 bacteriocin propeptide, TIGR03798 family. This model describes a conserved, fairly long (about 65 residue) propeptide region for a family of putative microcins, that is, bacteriocins of small size. Members of the seed alignment tend to have the Gly-Gly motif as the last two residues of the matched region. This is a cleavage site for a combination processing/export ABC transporter with a peptidase domain.
Probab=25.28 E-value=1.1e+02 Score=21.19 Aligned_cols=27 Identities=19% Similarity=0.346 Sum_probs=14.0
Q ss_pred HHHHHHH-----HHHHHHHHhc-CCHHHHHHHH
Q 029243 153 QQLRDYE-----DKVMQKRLEG-LTEAELEALI 179 (196)
Q Consensus 153 qQ~kdYE-----~avLQKRlee-LtpEEle~L~ 179 (196)
+|++++- +..|+++|.+ .+++|.-+|-
T Consensus 2 e~l~~Fl~~~~~d~~L~~~l~~~~~~e~~~~lA 34 (64)
T TIGR03798 2 EQLKAFLEKVKTDPDLREKLKAAEDPEDRVAIA 34 (64)
T ss_pred HHHHHHHHHHHcCHHHHHHHHHcCCHHHHHHHH
Confidence 4555554 5556666655 3455544443
No 146
>COG2938 Uncharacterized conserved protein [Function unknown]
Probab=25.24 E-value=58 Score=25.40 Aligned_cols=29 Identities=21% Similarity=0.374 Sum_probs=23.5
Q ss_pred HHHHHHHHhcCCHHHHHHHHHHHHHHHhc
Q 029243 160 DKVMQKRLEGLTEAELEALIEQVEEEKRR 188 (196)
Q Consensus 160 ~avLQKRleeLtpEEle~L~aEIE~Ek~~ 188 (196)
.-..++.+++||++|+..+..=+|++.++
T Consensus 34 ~~Fae~~~~~lsd~el~~f~~LLe~~D~d 62 (94)
T COG2938 34 GPFAEKEFDSLSDEELDEFERLLECEDND 62 (94)
T ss_pred HHHHHHHHhhCCHHHHHHHHHHHcCCcHH
Confidence 45678999999999999988877765543
No 147
>PHA02592 52 DNA topisomerase II medium subunit; Provisional
Probab=25.15 E-value=76 Score=30.31 Aligned_cols=29 Identities=28% Similarity=0.393 Sum_probs=24.5
Q ss_pred HHHHHHHHhcCCHHHHHHHHHHHHHHHhc
Q 029243 160 DKVMQKRLEGLTEAELEALIEQVEEEKRR 188 (196)
Q Consensus 160 ~avLQKRleeLtpEEle~L~aEIE~Ek~~ 188 (196)
++.|.-||-.||.+|.++|++|+++-+++
T Consensus 389 ~~lL~m~L~~LT~~e~~kL~~e~~~l~~e 417 (439)
T PHA02592 389 DKLVAMNIYSMTSDEREKLQKEAEELEKE 417 (439)
T ss_pred HHHHHhHHHHhhHHHHHHHHHHHHHHHHH
Confidence 77888899999999999999999875443
No 148
>TIGR00782 ccoP cytochrome c oxidase, cbb3-type, subunit III. This model describes a di-heme subunit of approximately 26 kDa of the cbb3 type copper and heme-containing cytochrome oxidase.
Probab=25.09 E-value=2.2e+02 Score=24.73 Aligned_cols=17 Identities=18% Similarity=0.012 Sum_probs=9.6
Q ss_pred hhhhhHHHHHHHHHHHH
Q 029243 92 VLLIGVGVTVIGYGLKS 108 (196)
Q Consensus 92 VI~IGlgvfllGg~ly~ 108 (196)
.+.++.+||.+++.++|
T Consensus 32 ~~f~~~i~~~~~y~~~y 48 (285)
T TIGR00782 32 WTFYATIVWGFGYLVAY 48 (285)
T ss_pred HHHHHHHHHHHHHHHHh
Confidence 34555666666665544
No 149
>TIGR03766 conserved hypothetical integral membrane protein. Models TIGR03110, TIGR03111, and TIGR03112 describe a three-gene system found in several Gram-positive bacteria, where TIGR03110 is distantly related to a putative transpeptidase, exosortase (TIGR02602). This model describes a small clade that correlates by both gene clustering and phyletic pattern, although imperfectly, to the three gene system. Both this narrow clade, and the larger set of full-length homologous integral membrane proteins, have an especially well-conserved region near the C-terminus with an invariant tyrosine. The function is unknown.
Probab=25.04 E-value=79 Score=29.58 Aligned_cols=32 Identities=19% Similarity=0.163 Sum_probs=26.5
Q ss_pred cHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHH
Q 029243 150 TYAQQLRDYEDKVMQKRLEGLTEAELEALIEQ 181 (196)
Q Consensus 150 TY~qQ~kdYE~avLQKRleeLtpEEle~L~aE 181 (196)
...+++++|..+++++|++++.|...-++..+
T Consensus 330 ~~~~~k~~~~~k~I~~Rl~~~g~~g~~k~~~~ 361 (483)
T TIGR03766 330 PTKAEKNKYSIKEIKRRLKEKGFAGYIKFLIQ 361 (483)
T ss_pred CCHHHHHHhhHHHHHHHHHHcCchHHHHHHHH
Confidence 33456888889999999999999999888755
No 150
>KOG3248 consensus Transcription factor TCF-4 [Transcription]
Probab=25.03 E-value=1.2e+02 Score=29.35 Aligned_cols=38 Identities=21% Similarity=0.435 Sum_probs=27.0
Q ss_pred ccHHHHHHHHH------------HHHHHHHHhcCCHHHHHHHHHHHHHHH
Q 029243 149 MTYAQQLRDYE------------DKVMQKRLEGLTEAELEALIEQVEEEK 186 (196)
Q Consensus 149 MTY~qQ~kdYE------------~avLQKRleeLtpEEle~L~aEIE~Ek 186 (196)
|-||+-.|.=- |+.|-+|+.+|+.||++|--+=-.+|+
T Consensus 200 mlyMKEmRa~vvaEctlKeSAaiNqiLGrRWH~LSrEEQAKYyElArKer 249 (421)
T KOG3248|consen 200 MLYMKEMRAKVVAECTLKESAAINQILGRRWHALSREEQAKYYELARKER 249 (421)
T ss_pred HHHHHHHHHHHHHHhhhhhHHHHHHHHhHHHhhhhHHHHHHHHHHHHHHH
Confidence 77777654321 778889999999999998754443343
No 151
>PRK13922 rod shape-determining protein MreC; Provisional
Probab=24.99 E-value=1.7e+02 Score=24.91 Aligned_cols=25 Identities=20% Similarity=0.182 Sum_probs=14.5
Q ss_pred HHHHHHHHhcCCHHHH--HHHHHHHHH
Q 029243 160 DKVMQKRLEGLTEAEL--EALIEQVEE 184 (196)
Q Consensus 160 ~avLQKRleeLtpEEl--e~L~aEIE~ 184 (196)
++.|+++.++|..+.. +++++|.++
T Consensus 78 n~~L~~e~~~l~~~~~~~~~l~~en~~ 104 (276)
T PRK13922 78 NEELKKELLELESRLQELEQLEAENAR 104 (276)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 5667777777655433 445555544
No 152
>PF02468 PsbN: Photosystem II reaction centre N protein (psbN); InterPro: IPR003398 Oxygenic photosynthesis uses two multi-subunit photosystems (I and II) located in the cell membranes of cyanobacteria and in the thylakoid membranes of chloroplasts in plants and algae. Photosystem II (PSII) has a P680 reaction centre containing chlorophyll 'a' that uses light energy to carry out the oxidation (splitting) of water molecules, and to produce ATP via a proton pump. Photosystem I (PSI) has a P700 reaction centre containing chlorophyll that takes the electron and associated hydrogen donated from PSII to reduce NADP+ to NADPH. Both ATP and NADPH are subsequently used in the light-independent reactions to convert carbon dioxide to glucose using the hydrogen atom extracted from water by PSII, releasing oxygen as a by-product. PSII is a multisubunit protein-pigment complex containing polypeptides both intrinsic and extrinsic to the photosynthetic membrane [, ]. Within the core of the complex, the chlorophyll and beta-carotene pigments are mainly bound to the antenna proteins CP43 (PsbC) and CP47 (PsbB), which pass the excitation energy on to the reaction centre proteins D1 (Qb, PsbA) and D2 (Qa, PsbD) that bind all the redox-active cofactors involved in the energy conversion process. The PSII oxygen-evolving complex (OEC) oxidises water to provide protons for use by PSI, and consists of OEE1 (PsbO), OEE2 (PsbP) and OEE3 (PsbQ). The remaining subunits in PSII are of low molecular weight (less than 10 kDa), and are involved in PSII assembly, stabilisation, dimerisation, and photo-protection []. This family represents the low molecular weight transmembrane protein PsbN found in PSII. PsbN may have a role in PSII stability, however its actual function unknown. PsbN does not appear to be essential for photoautotrophic growth or normal PSII function.; GO: 0015979 photosynthesis, 0009523 photosystem II, 0009539 photosystem II reaction center, 0016020 membrane
Probab=24.92 E-value=66 Score=22.22 Aligned_cols=25 Identities=20% Similarity=0.171 Sum_probs=20.9
Q ss_pred hhhhhhhHHHHHHHHHHHHhhHHhC
Q 029243 90 KDVLLIGVGVTVIGYGLKSGLELFG 114 (196)
Q Consensus 90 ~DVI~IGlgvfllGg~ly~gLq~~G 114 (196)
.-++.|.++++++|.-+|....++|
T Consensus 4 a~~~~i~i~~~lv~~Tgy~iYtaFG 28 (43)
T PF02468_consen 4 ATVLAIFISCLLVSITGYAIYTAFG 28 (43)
T ss_pred eeeHHHHHHHHHHHHHhhhhhheeC
Confidence 4567888999999988888888888
No 153
>PF11833 DUF3353: Protein of unknown function (DUF3353); InterPro: IPR021788 This family of proteins are functionally uncharacterised. This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 205 to 258 amino acids in length.
Probab=24.80 E-value=1.1e+02 Score=26.12 Aligned_cols=51 Identities=24% Similarity=0.319 Sum_probs=27.0
Q ss_pred cchhhhhhhHHHHHHHHHHHHhhH-HhCCCcccchhH--HHHHHHhhhhhheeee
Q 029243 88 TRKDVLLIGVGVTVIGYGLKSGLE-LFGVDPLQAGNV--VEVIVVLGLTLGWVST 139 (196)
Q Consensus 88 tR~DVI~IGlgvfllGg~ly~gLq-~~GlD~~~AGiw--sQ~lLVlGlvvgWv~S 139 (196)
+-+.+++++++.+++|.++=.++. .+-.+... |.| .|+.-+..+++-|+++
T Consensus 138 ~~~rA~~~~~~~L~~G~~lGs~l~~~l~~~~~p-~~~s~~~~~sl~~~i~lwl~s 191 (194)
T PF11833_consen 138 KLGRAFLWTLGGLVVGLILGSLLASWLPVDIVP-GPWSPEQLVSLFTYILLWLVS 191 (194)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHhhcccccCC-CCCCHHHHHHHHHHHHHHHHH
Confidence 445788888888888855444443 44333222 222 2222222335678765
No 154
>PRK04654 sec-independent translocase; Provisional
Probab=24.76 E-value=76 Score=28.24 Aligned_cols=32 Identities=25% Similarity=0.270 Sum_probs=15.8
Q ss_pred HHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHH
Q 029243 151 YAQQLRDYEDKVMQKRLEGLTEAELEALIEQV 182 (196)
Q Consensus 151 Y~qQ~kdYE~avLQKRleeLtpEEle~L~aEI 182 (196)
+..+.|++=+.+.+.=-+|+..+|+.+.++++
T Consensus 35 ~irk~R~~~~~vk~El~~El~~~ELrk~l~~~ 66 (214)
T PRK04654 35 WVRRARMQWDSVKQELERELEAEELKRSLQDV 66 (214)
T ss_pred HHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHH
Confidence 44455555544444444555555555544433
No 155
>PF10083 DUF2321: Uncharacterized protein conserved in bacteria (DUF2321); InterPro: IPR016891 This entry is represented by Bacteriophage 'Lactobacillus prophage Lj928', Orf-Ljo1454. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=24.75 E-value=86 Score=26.79 Aligned_cols=23 Identities=22% Similarity=0.231 Sum_probs=19.6
Q ss_pred HHHHHhcCCHHHHHHHHHHHHHH
Q 029243 163 MQKRLEGLTEAELEALIEQVEEE 185 (196)
Q Consensus 163 LQKRleeLtpEEle~L~aEIE~E 185 (196)
|-.-+|||+|+|.|++.+.+++-
T Consensus 91 l~ee~eeLs~deke~~~~sl~dL 113 (158)
T PF10083_consen 91 LIEEDEELSPDEKEQFKESLPDL 113 (158)
T ss_pred HHHHhhcCCHHHHHHHHhhhHHH
Confidence 44479999999999999999873
No 156
>cd01389 MATA_HMG-box MATA_HMG-box, class I member of the HMG-box superfamily of DNA-binding proteins. These proteins contain a single HMG box, and bind the minor groove of DNA in a highly sequence-specific manner. Members include the fungal mating type gene products MC, MATA1 and Ste11.
Probab=24.73 E-value=1.4e+02 Score=20.69 Aligned_cols=28 Identities=18% Similarity=0.184 Sum_probs=21.6
Q ss_pred HHHHHHHHhcCCHHHHHHHHHHHHHHHh
Q 029243 160 DKVMQKRLEGLTEAELEALIEQVEEEKR 187 (196)
Q Consensus 160 ~avLQKRleeLtpEEle~L~aEIE~Ek~ 187 (196)
...+-.+..+||+||.+.-.++-++++.
T Consensus 33 sk~~g~~Wk~ls~eeK~~y~~~A~~~k~ 60 (77)
T cd01389 33 SRIIGRMWRSESPEVKAYYKELAEEEKE 60 (77)
T ss_pred HHHHHHHHhhCCHHHHHHHHHHHHHHHH
Confidence 3456678999999999998777766654
No 157
>cd00633 Secretoglobin Secretoglobins are relatively small, secreted, disulphide-bridged dimeric proteins with encoding genes sharing substantial sequence similarity. Their family subunits may be grouped into five subfamilies, A-E. Uteroglobin (subfamily A), which is identical to Clara cell protein (CC10), forms a globular shaped homodimer with a large hydrophobic pocket located between the two dimers. The uteroglobin monomer structure is composed of four alpha helices that do not form a canonical four helix-bundle motif but rather a boomerang-shaped structure in which helices H1, H3, and H4 are able to bind a homodimeric partner. The hydrophobic pocket binds steroids, particularly progesterone, with high specificity. However, the true biological function of uteroglobin is poorly understood. In mammals, uteroglobin has immunosuppressive and anti-inflammatory properties through the inhibition of phospholipase A2. The other four main subfamilies of secretoglobins are found in heterodimeri
Probab=24.43 E-value=1.7e+02 Score=20.27 Aligned_cols=35 Identities=11% Similarity=0.289 Sum_probs=27.1
Q ss_pred ccHHHHHHHHH--------HHHHHHHHhcCCHHHHHHHHHHHH
Q 029243 149 MTYAQQLRDYE--------DKVMQKRLEGLTEAELEALIEQVE 183 (196)
Q Consensus 149 MTY~qQ~kdYE--------~avLQKRleeLtpEEle~L~aEIE 183 (196)
=-|+..++.|. ...+++-+++++++.+.....=++
T Consensus 17 ~~y~~~L~~f~~~~~~~~A~~~lK~C~d~~~~e~k~~~~~~m~ 59 (67)
T cd00633 17 EEYKAELEKFNATPEAVEAKEKLKQCVDEQSLETKENIAKLLE 59 (67)
T ss_pred HHHHHHHHhcCCCHHHHHHHHHHHHHHhcCCHhHHHHHHHHHH
Confidence 35889999997 567899999999988776554443
No 158
>PF04695 Pex14_N: Peroxisomal membrane anchor protein (Pex14p) conserved region; InterPro: IPR006785 This conserved region defines a group of peroxisomal membrane anchor proteins which bind the PTS1 (peroxisomal targeting signal) receptor and are required for the import of PTS1-containing proteins into peroxisomes. Loss of functional Pex14p results in defects in both the PTS1 and PTS2-dependent import pathways. Deletion analysis of this conserved region implicates it in selective peroxisome degradation. In the majority of members this region is situated at the N terminus of the protein [, ].; GO: 0005777 peroxisome, 0016020 membrane; PDB: 2W85_A 2W84_A 3FF5_B.
Probab=24.33 E-value=78 Score=25.08 Aligned_cols=26 Identities=19% Similarity=0.397 Sum_probs=18.5
Q ss_pred HHHHHHHHHhcCCHHHHHHHHHHHHHHH
Q 029243 159 EDKVMQKRLEGLTEAELEALIEQVEEEK 186 (196)
Q Consensus 159 E~avLQKRleeLtpEEle~L~aEIE~Ek 186 (196)
+-+.|++| .||+||++.++++.+...
T Consensus 26 k~~FL~sK--GLt~~EI~~al~~a~~~~ 51 (136)
T PF04695_consen 26 KIAFLESK--GLTEEEIDEALGRAGSPP 51 (136)
T ss_dssp HHHHHHHC--T--HHHHHHHHHHHT--S
T ss_pred HHHHHHcC--CCCHHHHHHHHHhcCCcc
Confidence 45677777 999999999999988765
No 159
>cd02000 TPP_E1_PDC_ADC_BCADC Thiamine pyrophosphate (TPP) family, E1 of PDC_ADC_BCADC subfamily, TPP-binding module; composed of proteins similar to the E1 components of the human pyruvate dehydrogenase complex (PDC), the acetoin dehydrogenase complex (ADC) and the branched chain alpha-keto acid dehydrogenase/2-oxoisovalerate dehydrogenase complex (BCADC). PDC catalyzes the irreversible oxidative decarboxylation of pyruvate to produce acetyl-CoA in the bridging step between glycolysis and the citric acid cycle. ADC participates in the breakdown of acetoin while BCADC participates in the breakdown of branched chain amino acids. BCADC catalyzes the oxidative decarboxylation of 4-methyl-2-oxopentanoate, 3-methyl-2-oxopentanoate and 3-methyl-2-oxobutanoate (branched chain 2-oxo acids derived from the transamination of leucine, valine and isoleucine).
Probab=24.30 E-value=1e+02 Score=26.77 Aligned_cols=58 Identities=28% Similarity=0.219 Sum_probs=34.2
Q ss_pred hhhhheeeeeeeeeec-C-CccHHHHHHHHH-------HHHHHHHHhc---CCHHHHHHHHHHHHHHHhc
Q 029243 131 GLTLGWVSTYIFRVSN-K-EMTYAQQLRDYE-------DKVMQKRLEG---LTEAELEALIEQVEEEKRR 188 (196)
Q Consensus 131 GlvvgWv~SYlfRV~t-k-~MTY~qQ~kdYE-------~avLQKRlee---LtpEEle~L~aEIE~Ek~~ 188 (196)
|-++.-+.+|-++=-+ . +-+.-.+..+++ -..++++|.+ +|++|++++.+|+.++=++
T Consensus 219 ~P~lIev~~~r~~gHs~~dd~~~Yr~~~e~~~~~~~dpi~~~~~~L~~~g~~~~~~~~~~~~~~~~~v~~ 288 (293)
T cd02000 219 GPTLIEAVTYRLGGHSTSDDPSRYRTKEEVEEWKKRDPILRLRKYLIEAGILTEEELAAIEAEVKAEVEE 288 (293)
T ss_pred CCEEEEEEEeccCCCCCCCCcccCCCHHHHHHHhcCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHH
Confidence 3455566665555443 1 112212333333 2456777765 7999999999999886543
No 160
>TIGR02209 ftsL_broad cell division protein FtsL. This model represents FtsL, both forms similar to that in E. coli and similar to that in B. subtilis. FtsL is one of the later proteins active in cell division septum formation. FtsL is small, low in complexity, and highly divergent. The scope of this model is broader than that of the Pfam model pfam04999.3 for FtsL, as this one includes FtsL from Bacillus subtilis and related species.
Probab=24.03 E-value=2.6e+02 Score=19.53 Aligned_cols=25 Identities=28% Similarity=0.320 Sum_probs=16.0
Q ss_pred HHHHHHHHHhcCCHHHHHHHHHHHHH
Q 029243 159 EDKVMQKRLEGLTEAELEALIEQVEE 184 (196)
Q Consensus 159 E~avLQKRleeLtpEEle~L~aEIE~ 184 (196)
+-+.++++++.+ .+|..+|+.|++.
T Consensus 32 ~~~~~~~~~~~l-~~en~~L~~ei~~ 56 (85)
T TIGR02209 32 ELQKLQLEIDKL-QKEWRDLQLEVAE 56 (85)
T ss_pred HHHHHHHHHHHH-HHHHHHHHHHHHH
Confidence 345566666665 5667777777765
No 161
>cd01388 SOX-TCF_HMG-box SOX-TCF_HMG-box, class I member of the HMG-box superfamily of DNA-binding proteins. These proteins contain a single HMG box, and bind the minor groove of DNA in a highly sequence-specific manner. Members include SRY and its homologs in insects and vertebrates, and transcription factor-like proteins, TCF-1, -3, -4, and LEF-1. They appear to bind the minor groove of the A/T C A A A G/C-motif.
Probab=23.98 E-value=1.5e+02 Score=20.30 Aligned_cols=28 Identities=21% Similarity=0.357 Sum_probs=22.3
Q ss_pred HHHHHHHhcCCHHHHHHHHHHHHHHHhc
Q 029243 161 KVMQKRLEGLTEAELEALIEQVEEEKRR 188 (196)
Q Consensus 161 avLQKRleeLtpEEle~L~aEIE~Ek~~ 188 (196)
..|-+...+||++|.+..+++-++++++
T Consensus 34 k~l~~~Wk~ls~~eK~~y~~~a~~~k~~ 61 (72)
T cd01388 34 KILGDRWKALSNEEKQPYYEEAKKLKEL 61 (72)
T ss_pred HHHHHHHHcCCHHHHHHHHHHHHHHHHH
Confidence 4567788999999999998887776653
No 162
>PRK05382 chorismate synthase; Validated
Probab=23.78 E-value=1.5e+02 Score=28.04 Aligned_cols=51 Identities=14% Similarity=0.181 Sum_probs=30.4
Q ss_pred eeeeeeeeeecCCccHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHHHHhc
Q 029243 136 WVSTYIFRVSNKEMTYAQQLRDYEDKVMQKRLEGLTEAELEALIEQVEEEKRR 188 (196)
Q Consensus 136 Wv~SYlfRV~tk~MTY~qQ~kdYE~avLQKRleeLtpEEle~L~aEIE~Ek~~ 188 (196)
.+.+|+.++-+-+.+ ..+.++++..-++-+.-+.|+--+++.++|++-+++
T Consensus 144 ~i~s~v~~IG~i~~~--~~~~~~~~~~~~~~~~~~d~~~~~~m~~~I~~ak~~ 194 (359)
T PRK05382 144 EVRGHVVQIGGIEAD--LDWEEVEERADANPVRCPDPEAEEEMEELIDEAKKE 194 (359)
T ss_pred EEEEEEEEEcCCcCC--CCHHHHHhHhhcCCcCCCCHHHHHHHHHHHHHHHHc
Confidence 788999999877665 112222222223445555666667788888775543
No 163
>PF09446 VMA21: VMA21-like domain; InterPro: IPR019013 The vacuolar ATPase assembly integral membrane protein VMA21 is required for the assembly of the integral membrane sector (V0 component) of the vacuolar ATPase (V-ATPase) in the endoplasmic reticulum []. This entry represents a putative short domain found in VMA21-like proteins, and which appears to contain two potential transmembrane helices.
Probab=23.77 E-value=79 Score=22.55 Aligned_cols=52 Identities=19% Similarity=0.328 Sum_probs=34.3
Q ss_pred hhhhhhHHHHHHHHHHHHhhHHh------CCCcccchhHHHHHHHhhhhhheeeeeeee
Q 029243 91 DVLLIGVGVTVIGYGLKSGLELF------GVDPLQAGNVVEVIVVLGLTLGWVSTYIFR 143 (196)
Q Consensus 91 DVI~IGlgvfllGg~ly~gLq~~------GlD~~~AGiwsQ~lLVlGlvvgWv~SYlfR 143 (196)
+.+.-.+.++.+..+.|++++.. ++|+.++.+++-.+-|+++ -.=+..|++|
T Consensus 7 kLl~fs~~M~~lPl~~ff~~~~~~~~~~~~~~~~~~~~~sgi~Avi~v-nvVl~~Yi~~ 64 (66)
T PF09446_consen 7 KLLFFSVLMFTLPLGTFFGFKYFLFDGFFGLSPSDNTIYSGIAAVIVV-NVVLASYIYV 64 (66)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcccccCCcchhhHHHHHHHHHHH-HHHHHHHHHH
Confidence 34566778889999999987744 4555677777776666643 3344556554
No 164
>smart00096 UTG Uteroglobin.
Probab=23.64 E-value=1.9e+02 Score=21.37 Aligned_cols=35 Identities=17% Similarity=0.477 Sum_probs=27.7
Q ss_pred cHHHHHHHHH--------HHHHHHHHhcCCHHHHHHHHHHHHH
Q 029243 150 TYAQQLRDYE--------DKVMQKRLEGLTEAELEALIEQVEE 184 (196)
Q Consensus 150 TY~qQ~kdYE--------~avLQKRleeLtpEEle~L~aEIE~ 184 (196)
.|.++++.|. ...+.+-.|.||++..+.+.+=+|+
T Consensus 20 ~Y~~~l~~y~~~~~~~ea~~~lK~cvD~L~~~~k~~i~~ll~k 62 (69)
T smart00096 20 SYEASLKQFKPDPDMLEAGRQLKKLVDTLPQETRENILKLTEK 62 (69)
T ss_pred HHHHHHHhcCCCHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHH
Confidence 4999999998 4567788899999988776665554
No 165
>PF13807 GNVR: G-rich domain on putative tyrosine kinase
Probab=23.49 E-value=1.5e+02 Score=21.05 Aligned_cols=36 Identities=19% Similarity=0.081 Sum_probs=26.0
Q ss_pred CCCCCCCCCccccchhhhhhhHHHHHHHHHHHHhhH
Q 029243 76 DSSDQTETPFGYTRKDVLLIGVGVTVIGYGLKSGLE 111 (196)
Q Consensus 76 ~~~~~~~~pfgMtR~DVI~IGlgvfllGg~ly~gLq 111 (196)
+++.....|-+-+|.=.+++|+++-++.++++..++
T Consensus 45 d~A~~P~~P~~P~~~lil~l~~~~Gl~lgi~~~~~r 80 (82)
T PF13807_consen 45 DPAIVPDKPVSPKRALILALGLFLGLILGIGLAFLR 80 (82)
T ss_pred cccccCCCCCCCcHHHHHHHHHHHHHHHHHHHHHHh
Confidence 555555578888999888888877666666666554
No 166
>PRK02463 OxaA-like protein precursor; Provisional
Probab=23.42 E-value=5.5e+02 Score=23.36 Aligned_cols=44 Identities=18% Similarity=0.127 Sum_probs=26.4
Q ss_pred eeeeecCCccHHHHHHHH-----------HHHHHHHHHh-cCCHHHHHHHHHHHHH
Q 029243 141 IFRVSNKEMTYAQQLRDY-----------EDKVMQKRLE-GLTEAELEALIEQVEE 184 (196)
Q Consensus 141 lfRV~tk~MTY~qQ~kdY-----------E~avLQKRle-eLtpEEle~L~aEIE~ 184 (196)
+.|++.=-++..||++.. |-+.+|+|.. .-++||.++.++|+-+
T Consensus 70 ivRlillPL~i~q~~ka~~~~~KM~~lqPe~~~Iq~Kyk~~~~~~~~~~~q~em~~ 125 (307)
T PRK02463 70 IVRLIILPLGLYQSWKATYQSEKMAYLKPVFEPINERLKNATTQEEKMAAQTELMA 125 (307)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhChhHHHHHHHHhcCCChHHHHHHHHHHHH
Confidence 457666555555665533 2455777874 4467777777766643
No 167
>PHA01080 hypothetical protein
Probab=23.35 E-value=42 Score=25.95 Aligned_cols=31 Identities=26% Similarity=0.478 Sum_probs=20.2
Q ss_pred hHHhCCCcccchhHHHHHHHhhhhhheeeee
Q 029243 110 LELFGVDPLQAGNVVEVIVVLGLTLGWVSTY 140 (196)
Q Consensus 110 Lq~~GlD~~~AGiwsQ~lLVlGlvvgWv~SY 140 (196)
++..+++|.++|.+-.+.+..-+++|++++|
T Consensus 39 ~~~~~itpt~~~qaFglgF~~V~~lgyls~Y 69 (80)
T PHA01080 39 MSYTRITPTDAGTAFSFGFMAVFALGYLSTY 69 (80)
T ss_pred HHHhccCHHHHHHHHhhhHHHHHHhhhhHHH
Confidence 5677777777777666666555555666655
No 168
>PF03937 Sdh5: Flavinator of succinate dehydrogenase; InterPro: IPR005631 This entry represents a group of uncharacterised small proteins found in both eukaryotes and prokaryotes, including NMA1147 from Neisseria meningitidis [] and YgfY from Escherichia coli []. YgfY may be involved in transcriptional regulation. The structure of these proteins consists of a complex bundle of five alpha-helices, which is composed of an up-down 3-helix bundle plus an orthogonal 2-helix bundle. ; PDB: 2LM4_A 1PUZ_A 2JR5_A 1X6I_A 1X6J_A.
Probab=23.27 E-value=79 Score=22.72 Aligned_cols=28 Identities=25% Similarity=0.420 Sum_probs=20.8
Q ss_pred HHHHHHHHhcCCHHHHHHHHHHHHHHHh
Q 029243 160 DKVMQKRLEGLTEAELEALIEQVEEEKR 187 (196)
Q Consensus 160 ~avLQKRleeLtpEEle~L~aEIE~Ek~ 187 (196)
...+++.++.|+++|++++..=++.+..
T Consensus 22 ~~f~~~~~~~l~~~el~~fe~lL~~~D~ 49 (74)
T PF03937_consen 22 GRFADAHLDSLSEEELDAFERLLDLEDP 49 (74)
T ss_dssp HHHHHHHHHHS-HHHHHHHHHHHTS-HH
T ss_pred HHHHHHHHhhCCHHHHHHHHHHHcCCCH
Confidence 4567899999999999998876665543
No 169
>PRK14471 F0F1 ATP synthase subunit B; Provisional
Probab=23.26 E-value=1.2e+02 Score=24.13 Aligned_cols=35 Identities=11% Similarity=-0.073 Sum_probs=19.6
Q ss_pred hhHHHHHHHhhhhhheeeeeeeeeecCCccHHHHHHHHH
Q 029243 121 GNVVEVIVVLGLTLGWVSTYIFRVSNKEMTYAQQLRDYE 159 (196)
Q Consensus 121 GiwsQ~lLVlGlvvgWv~SYlfRV~tk~MTY~qQ~kdYE 159 (196)
-++.|++-++ ++.|+..+++ ...=+.++++|+++=
T Consensus 10 ~~~~~~i~Fl--il~~ll~~~l--~~pi~~~l~~R~~~I 44 (164)
T PRK14471 10 LFFWQTILFL--ILLLLLAKFA--WKPILGAVKEREDSI 44 (164)
T ss_pred HHHHHHHHHH--HHHHHHHHHh--HHHHHHHHHHHHHHH
Confidence 3445555555 3666664433 233345778888775
No 170
>PRK11512 DNA-binding transcriptional repressor MarR; Provisional
Probab=23.26 E-value=2.5e+02 Score=21.47 Aligned_cols=93 Identities=15% Similarity=0.131 Sum_probs=51.7
Q ss_pred CCccccchhhhhhhHHHHHHHHH-HHHhhHHhCCCcccchhHHHHHHHhhhhhheeeeeeeeeec-------------CC
Q 029243 83 TPFGYTRKDVLLIGVGVTVIGYG-LKSGLELFGVDPLQAGNVVEVIVVLGLTLGWVSTYIFRVSN-------------KE 148 (196)
Q Consensus 83 ~pfgMtR~DVI~IGlgvfllGg~-ly~gLq~~GlD~~~AGiwsQ~lLVlGlvvgWv~SYlfRV~t-------------k~ 148 (196)
.|+|+|.....++...- --|+. .--.-..+|+|.....-...-+-=-|+ +-|+.. +-
T Consensus 34 ~~~glt~~q~~vL~~l~-~~~~~t~~eLa~~l~i~~~tvsr~l~~Le~~Gl--------I~R~~~~~DrR~~~l~LT~~G 104 (144)
T PRK11512 34 SPLDITAAQFKVLCSIR-CAACITPVELKKVLSVDLGALTRMLDRLVCKGW--------VERLPNPNDKRGVLVKLTTSG 104 (144)
T ss_pred cccCCCHHHHHHHHHHH-HcCCCCHHHHHHHHCCCHHHHHHHHHHHHHCCC--------EEeccCcccCCeeEeEEChhH
Confidence 57787776654433221 11222 222345788887776666666555543 344432 22
Q ss_pred ccHHHHHH-HHHHHHHHHHHhcCCHHHHHHHHHHHHH
Q 029243 149 MTYAQQLR-DYEDKVMQKRLEGLTEAELEALIEQVEE 184 (196)
Q Consensus 149 MTY~qQ~k-dYE~avLQKRleeLtpEEle~L~aEIE~ 184 (196)
....++.. .+.++..++=++.|+|||++.|.+-+++
T Consensus 105 ~~~~~~~~~~~~~~~~~~l~~~ls~ee~~~l~~~L~k 141 (144)
T PRK11512 105 AAICEQCHQLVGQDLHQELTKNLTADEVATLEHLLKK 141 (144)
T ss_pred HHHHHHHHHHHHHHHHHHHHccCCHHHHHHHHHHHHH
Confidence 22233322 2334555667899999999999876654
No 171
>PRK10870 transcriptional repressor MprA; Provisional
Probab=23.23 E-value=2.2e+02 Score=23.10 Aligned_cols=71 Identities=11% Similarity=0.259 Sum_probs=47.8
Q ss_pred hHHhCCCcccchhHHHHHHHhhhhhheee---------eeeeeeecCCccHHHHHHHHHHHHHHHHHhcCCHHHHHHHHH
Q 029243 110 LELFGVDPLQAGNVVEVIVVLGLTLGWVS---------TYIFRVSNKEMTYAQQLRDYEDKVMQKRLEGLTEAELEALIE 180 (196)
Q Consensus 110 Lq~~GlD~~~AGiwsQ~lLVlGlvvgWv~---------SYlfRV~tk~MTY~qQ~kdYE~avLQKRleeLtpEEle~L~a 180 (196)
-..+|+|..++.-...-+-= -|||- ++..+.--+--...++.....++..+.=++.|+++|++.|..
T Consensus 78 a~~l~l~~~tvsr~v~rLe~----kGlV~R~~~~~DrR~~~v~LT~~G~~~~~~i~~~~~~~~~~~~~~ls~~e~~~l~~ 153 (176)
T PRK10870 78 SCALGSSRTNATRIADELEK----RGWIERRESDNDRRCLHLQLTEKGHEFLREVLPPQHNCLHQLWSALSTTEKDQLEQ 153 (176)
T ss_pred HHHHCCCHHHHHHHHHHHHH----CCCEEecCCCCCCCeeEEEECHHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHH
Confidence 44778777554444444333 45554 234444444455777888888888888899999999999887
Q ss_pred HHHH
Q 029243 181 QVEE 184 (196)
Q Consensus 181 EIE~ 184 (196)
-+++
T Consensus 154 ~L~k 157 (176)
T PRK10870 154 ITRK 157 (176)
T ss_pred HHHH
Confidence 7765
No 172
>PRK00247 putative inner membrane protein translocase component YidC; Validated
Probab=23.22 E-value=3e+02 Score=26.57 Aligned_cols=63 Identities=14% Similarity=0.227 Sum_probs=33.8
Q ss_pred HhCCCcccchhHHHHHHHhhhhhheeeeeeeeeecCCccHHHHHHHHH-------HHHHHHHH-hcCCHHHHHHHHHHHH
Q 029243 112 LFGVDPLQAGNVVEVIVVLGLTLGWVSTYIFRVSNKEMTYAQQLRDYE-------DKVMQKRL-EGLTEAELEALIEQVE 183 (196)
Q Consensus 112 ~~GlD~~~AGiwsQ~lLVlGlvvgWv~SYlfRV~tk~MTY~qQ~kdYE-------~avLQKRl-eeLtpEEle~L~aEIE 183 (196)
.+|+|+. .+.|..++++. + +.|++-=-.++.|++..-. =+.||++. +..++|+++++++|.-
T Consensus 23 ~~G~~~~-l~W~isIi~lt-i--------iVRliLlPL~~~q~ks~~km~~lqPel~~iq~kyk~~~d~e~~~~~qqe~~ 92 (429)
T PRK00247 23 VLGLDDS-LAWFASLFGLV-I--------TVRAIIAPFTWQQYKSGRTAAHIRPKRKALREEYKGKTDEASIRELQQKQK 92 (429)
T ss_pred cccCcCC-CcHHHHHHHHH-H--------HHHHHHHHHHHHHHHHHHHHHHcCHHHHHHHHHHhcCCCHHHHHHHHHHHH
Confidence 6788773 44454444432 2 2344444445555443322 34456666 4567777777777665
Q ss_pred H
Q 029243 184 E 184 (196)
Q Consensus 184 ~ 184 (196)
+
T Consensus 93 ~ 93 (429)
T PRK00247 93 D 93 (429)
T ss_pred H
Confidence 4
No 173
>PRK13183 psbN photosystem II reaction center protein N; Provisional
Probab=23.09 E-value=55 Score=22.98 Aligned_cols=29 Identities=14% Similarity=0.021 Sum_probs=23.2
Q ss_pred ccchhhhhhhHHHHHHHHHHHHhhHHhCC
Q 029243 87 YTRKDVLLIGVGVTVIGYGLKSGLELFGV 115 (196)
Q Consensus 87 MtR~DVI~IGlgvfllGg~ly~gLq~~Gl 115 (196)
|.-.-++.|.+..+++|.-+|....++|=
T Consensus 4 me~A~~~~i~i~~lL~~~TgyaiYtaFGp 32 (46)
T PRK13183 4 MSPALSLAITILAILLALTGFGIYTAFGP 32 (46)
T ss_pred cchhHHHHHHHHHHHHHHhhheeeeccCC
Confidence 44456788999999999888988888883
No 174
>COG2177 FtsX Cell division protein [Cell division and chromosome partitioning]
Probab=23.08 E-value=91 Score=28.16 Aligned_cols=30 Identities=13% Similarity=0.493 Sum_probs=21.6
Q ss_pred HhCCCcccchhHHHHHHHhhhhhheeeeee
Q 029243 112 LFGVDPLQAGNVVEVIVVLGLTLGWVSTYI 141 (196)
Q Consensus 112 ~~GlD~~~AGiwsQ~lLVlGlvvgWv~SYl 141 (196)
.+|++....--.+=.++++|.+++|+++|.
T Consensus 260 ~~~~~~~~~l~l~~~~~~iG~~~~~l~s~~ 289 (297)
T COG2177 260 LLGLGLDEVLLLLGILLLIGVLIAWLASLR 289 (297)
T ss_pred hhcCCchHHHHHHHHHHHHHHHHHHHHHHH
Confidence 356666666666677777888889988874
No 175
>PF04277 OAD_gamma: Oxaloacetate decarboxylase, gamma chain ; InterPro: IPR005899 This family comprises distantly related, low complexity, hydrophobic small subunits of several related sodium ion-pumping decarboxylases. These include oxaloacetate decarboxylase gamma subunit and methylmalonyl-CoA decarboxylase delta subunit [].; GO: 0008948 oxaloacetate decarboxylase activity, 0015081 sodium ion transmembrane transporter activity, 0071436 sodium ion export, 0016020 membrane
Probab=22.77 E-value=28 Score=24.48 Aligned_cols=17 Identities=6% Similarity=0.038 Sum_probs=12.0
Q ss_pred CCHHHHHHHHHHHHHHH
Q 029243 170 LTEAELEALIEQVEEEK 186 (196)
Q Consensus 170 LtpEEle~L~aEIE~Ek 186 (196)
.+||..+++-+=|.+.+
T Consensus 61 ~~~~~vAaI~AAi~~~~ 77 (79)
T PF04277_consen 61 DDPELVAAIAAAIAAYR 77 (79)
T ss_pred CChHHHHHHHHHHHHHH
Confidence 77788888777766543
No 176
>PF10666 Phage_Gp14: Phage protein Gp14; InterPro: IPR018923 This Listeria phage Gp14 protein family is of unknown function but is expressed from within a cluster of tail- and base plate genes [].
Probab=22.73 E-value=1.7e+02 Score=24.62 Aligned_cols=47 Identities=23% Similarity=0.322 Sum_probs=30.2
Q ss_pred eeeeeeeecCCc---cHHHHHHHHHH--HHHHHH----HhcCCHHHHHHHHHHHHH
Q 029243 138 STYIFRVSNKEM---TYAQQLRDYED--KVMQKR----LEGLTEAELEALIEQVEE 184 (196)
Q Consensus 138 ~SYlfRV~tk~M---TY~qQ~kdYE~--avLQKR----leeLtpEEle~L~aEIE~ 184 (196)
++=||||.-++= -|-+|-..|++ ..++|. +++|||||.-+|..++.+
T Consensus 21 g~elfKV~~~DE~~rk~~e~~~kykkk~e~l~k~~id~~~EmS~e~~~aLe~~~re 76 (140)
T PF10666_consen 21 GGELFKVSLDDEMRRKWIEADEKYKKKAEKLNKYNIDFDREMSSEEYRALEEQQRE 76 (140)
T ss_pred cceeeeeecChHHHHHHHHHHHHHHHHHHHhhhhcccchhhcCHHHHHHHHHHHHH
Confidence 456888877652 24455555552 223332 378999999999888754
No 177
>COG3771 Predicted membrane protein [Function unknown]
Probab=22.71 E-value=25 Score=27.93 Aligned_cols=40 Identities=20% Similarity=0.502 Sum_probs=29.2
Q ss_pred hhHHHHHHHhhhhhheeeeee--eeeecCCccHHHHHHHHHH
Q 029243 121 GNVVEVIVVLGLTLGWVSTYI--FRVSNKEMTYAQQLRDYED 160 (196)
Q Consensus 121 GiwsQ~lLVlGlvvgWv~SYl--fRV~tk~MTY~qQ~kdYE~ 160 (196)
....-.++.+|+++||+.+=+ +||--+.|.-..|.|+-|.
T Consensus 42 STLla~lF~~G~~lgwli~g~fy~k~~l~~~~l~rqiKr~~~ 83 (97)
T COG3771 42 STLLATLFAAGFALGWLICGLFYLKVRLSLMRLERQIKRLEN 83 (97)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 344557888999999997755 4666777777777777654
No 178
>PF12616 DUF3775: Protein of unknown function (DUF3775); InterPro: IPR022254 This domain family is found in bacteria, and is approximately 80 amino acids in length. There is a single completely conserved residue G that may be functionally important.
Probab=22.50 E-value=1.1e+02 Score=22.80 Aligned_cols=21 Identities=24% Similarity=0.423 Sum_probs=17.4
Q ss_pred HHHHHHHHhcCCHHHHHHHHH
Q 029243 160 DKVMQKRLEGLTEAELEALIE 180 (196)
Q Consensus 160 ~avLQKRleeLtpEEle~L~a 180 (196)
...|.+=++.|+|+|+..|.+
T Consensus 3 ~~el~~~I~~l~~deqaeLvA 23 (75)
T PF12616_consen 3 EEELRSFIEDLNEDEQAELVA 23 (75)
T ss_pred HHHHHHHHHhCCHHHHHHHHH
Confidence 356788899999999998876
No 179
>PF07037 DUF1323: Putative transcription regulator (DUF1323); InterPro: IPR010749 This family consists of several hypothetical Enterobacterial proteins of around 120 residues in length. The function of this family is unknown.
Probab=22.49 E-value=60 Score=26.73 Aligned_cols=43 Identities=26% Similarity=0.297 Sum_probs=26.7
Q ss_pred eeecCCccHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHHH
Q 029243 143 RVSNKEMTYAQQLRDYEDKVMQKRLEGLTEAELEALIEQVEEE 185 (196)
Q Consensus 143 RV~tk~MTY~qQ~kdYE~avLQKRleeLtpEEle~L~aEIE~E 185 (196)
|+.-....|.....+==...|-.=+++|||.|+++|-.=+-.|
T Consensus 66 ~~~E~~~~y~~~~~~~l~~~ii~~~~~Mt~~EQ~~L~~lL~RE 108 (122)
T PF07037_consen 66 RLAEPPAEYAVTWDSPLERQIIDTLEEMTPAEQEQLTSLLLRE 108 (122)
T ss_pred hccCCccccCCCCcchHHHHHHHHHHHCCHHHHHHHHHHHHHh
Confidence 4444444442222222245566778999999999998776655
No 180
>PF06374 NDUF_C2: NADH-ubiquinone oxidoreductase subunit b14.5b (NDUFC2); InterPro: IPR009423 NADH:ubiquinone oxidoreductase (complex I) (1.6.5.3 from EC) is a respiratory-chain enzyme that catalyses the transfer of two electrons from NADH to ubiquinone in a reaction that is associated with proton translocation across the membrane (NADH + ubiquinone = NAD+ + ubiquinol) []. Complex I is a major source of reactive oxygen species (ROS) that are predominantly formed by electron transfer from FMNH(2). Complex I is found in bacteria, cyanobacteria (as a NADH-plastoquinone oxidoreductase), archaea [], mitochondira, and in the hydrogenosome, a mitochondria-derived organelle. In general, the bacterial complex consists of 14 different subunits, while the mitochondrial complex contains homologues to these subunits in addition to approximately 31 additional proteins []. Mitochondrial complex I, which is located in the inner mitochondrial membrane, is the largest multimeric respiratory enzyme in the mitochondria, consisting of more than 40 subunits, one FMN co-factor and eight FeS clusters []. The assembly of mitochondrial complex I is an intricate process that requires the cooperation of the nuclear and mitochondrial genomes [, ]. Mitochondrial complex I can cycle between active and deactive forms that can be distinguished by the reactivity towards divalent cations and thiol-reactive agents. All redox prosthetic groups reside in the peripheral arm of the L-shaped structure. The NADH oxidation domain harbouring the FMN cofactor is connected via a chain of iron-sulphur clusters to the ubiquinone reduction site that is located in a large pocket formed by the PSST and 49kDa subunits of complex I []. This family consists of several NADH-ubiquinone oxidoreductase subunit b14.5b proteins.; GO: 0008137 NADH dehydrogenase (ubiquinone) activity, 0006120 mitochondrial electron transport, NADH to ubiquinone, 0005743 mitochondrial inner membrane
Probab=22.48 E-value=2e+02 Score=23.28 Aligned_cols=23 Identities=22% Similarity=0.549 Sum_probs=17.2
Q ss_pred CCccc-chhHHHHHHHhhhhhheeeee
Q 029243 115 VDPLQ-AGNVVEVIVVLGLTLGWVSTY 140 (196)
Q Consensus 115 lD~~~-AGiwsQ~lLVlGlvvgWv~SY 140 (196)
=-+.- ||+--|++++. +||...|
T Consensus 47 rRP~~~sGihr~ll~~t---~g~~~Gy 70 (117)
T PF06374_consen 47 RRPPLKSGIHRQLLLAT---IGWFIGY 70 (117)
T ss_pred cCCchhhccHHHHHHHH---HHHHHHH
Confidence 34555 99999998876 5777776
No 181
>PF13623 SurA_N_2: SurA N-terminal domain
Probab=22.45 E-value=3.1e+02 Score=22.20 Aligned_cols=17 Identities=12% Similarity=0.421 Sum_probs=8.3
Q ss_pred eeecCCcc---HHHHHHHHH
Q 029243 143 RVSNKEMT---YAQQLRDYE 159 (196)
Q Consensus 143 RV~tk~MT---Y~qQ~kdYE 159 (196)
-|-.++-| |.++.+.|+
T Consensus 43 ~VnGe~Is~~ef~~~v~~~~ 62 (145)
T PF13623_consen 43 EVNGEKISYQEFQQRVEQAT 62 (145)
T ss_pred eECCEEcCHHHHHHHHHHHH
Confidence 45555666 344444444
No 182
>PRK09814 beta-1,6-galactofuranosyltransferase; Provisional
Probab=22.41 E-value=1.9e+02 Score=24.91 Aligned_cols=47 Identities=17% Similarity=0.289 Sum_probs=34.3
Q ss_pred cCCccHHHHHHHHH-------HHHHHHHHhcCCHHHHHHHHHHHHHHHhccccC
Q 029243 146 NKEMTYAQQLRDYE-------DKVMQKRLEGLTEAELEALIEQVEEEKRRLASG 192 (196)
Q Consensus 146 tk~MTY~qQ~kdYE-------~avLQKRleeLtpEEle~L~aEIE~Ek~~~~~~ 192 (196)
+...-..+..+++. .+.+.++++++++||.+++.+...+--+..+.|
T Consensus 266 ~~~~~~~~~V~~~~~G~~v~~~~el~~~l~~~~~~~~~~m~~n~~~~~~~~~~g 319 (333)
T PRK09814 266 WSKAAIADFIVENGLGFVVDSLEELPEIIDNITEEEYQEMVENVKKISKLLRNG 319 (333)
T ss_pred CCCccHHHHHHhCCceEEeCCHHHHHHHHHhcCHHHHHHHHHHHHHHHHHHhcc
Confidence 34444556666665 367999999999999999988887766655555
No 183
>COG3923 PriC Primosomal replication protein N'' [DNA replication, recombination, and repair]
Probab=22.34 E-value=1.3e+02 Score=26.21 Aligned_cols=37 Identities=27% Similarity=0.328 Sum_probs=24.9
Q ss_pred HHHHHHH------HHHHHHHHhcCC-HHHHHHHHHHHHHHHhcc
Q 029243 153 QQLRDYE------DKVMQKRLEGLT-EAELEALIEQVEEEKRRL 189 (196)
Q Consensus 153 qQ~kdYE------~avLQKRleeLt-pEEle~L~aEIE~Ek~~~ 189 (196)
-|-+||| .+.+.-+++--| -.|+++|++||+.-+++.
T Consensus 113 ~qhqd~Errl~~m~~~r~l~l~q~s~~vEqq~lqqel~~~e~Rl 156 (175)
T COG3923 113 AQHQDYERRLLAMVQDRRLQLAQQSDLVEQQKLQQELEAYEQRL 156 (175)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHH
Confidence 3667888 333444555555 679999999999866554
No 184
>TIGR02530 flg_new flagellar operon protein. Members of this family are found in a subset of bacterial flagellar operons, generally between genes designated flgD and flgE, in species as diverse as Bacillus halodurans and various other Firmicutes, Geobacter sulfurreducens, and Bdellovibrio bacteriovorus. The specific molecular function is unknown.
Probab=22.33 E-value=1.2e+02 Score=23.93 Aligned_cols=23 Identities=26% Similarity=0.494 Sum_probs=18.2
Q ss_pred HHHHhc----CCHHHHHHHHHHHHHHH
Q 029243 164 QKRLEG----LTEAELEALIEQVEEEK 186 (196)
Q Consensus 164 QKRlee----LtpEEle~L~aEIE~Ek 186 (196)
++||.+ |+++++++|.+-+++-+
T Consensus 22 ~~RL~~R~I~l~~~~~~~i~~av~~A~ 48 (96)
T TIGR02530 22 LERMRERNISINPDDWKKLLEAVEEAE 48 (96)
T ss_pred HHHHHHcCCCCCHHHHHHHHHHHHHHH
Confidence 455555 99999999999988744
No 185
>PF07835 COX4_pro_2: Bacterial aa3 type cytochrome c oxidase subunit IV; InterPro: IPR012422 Bacterial cytochrome c oxidase is found bound to the to the cell membrane, where it is involved in the generation of the transmembrane proton electrochemical gradient. It is composed of four subunits. Subunit IV consists of one transmembrane helix that does not interact directly with the other subunits, but maintains its position by indirect contacts via phospholipid molecules found in the structure. The function of subunit IV is as yet unknown []. ; PDB: 1QLE_D 1M57_J 1M56_J.
Probab=22.31 E-value=62 Score=21.89 Aligned_cols=14 Identities=29% Similarity=0.513 Sum_probs=9.9
Q ss_pred cCCccHHHHHHHHH
Q 029243 146 NKEMTYAQQLRDYE 159 (196)
Q Consensus 146 tk~MTY~qQ~kdYE 159 (196)
-+.|.|.+|.+.|+
T Consensus 6 ~g~MD~~~he~Ty~ 19 (44)
T PF07835_consen 6 AGEMDIAEHEKTYD 19 (44)
T ss_dssp TT-S--HHHHHHHH
T ss_pred CCCCCHHHHHHHHH
Confidence 47899999999998
No 186
>PRK10429 melibiose:sodium symporter; Provisional
Probab=22.22 E-value=2.9e+02 Score=24.70 Aligned_cols=9 Identities=44% Similarity=0.560 Sum_probs=6.4
Q ss_pred hhHHhCCCc
Q 029243 109 GLELFGVDP 117 (196)
Q Consensus 109 gLq~~GlD~ 117 (196)
.+.+.|+|+
T Consensus 390 ~l~~~Gy~~ 398 (473)
T PRK10429 390 VLGLIGYVP 398 (473)
T ss_pred HHHHcCCCC
Confidence 367888875
No 187
>COG4653 Predicted phage phi-C31 gp36 major capsid-like protein [General function prediction only]
Probab=22.04 E-value=89 Score=30.35 Aligned_cols=26 Identities=23% Similarity=0.236 Sum_probs=19.9
Q ss_pred HHHHHHHHhcCCHHHHHHHHHHHHHHH
Q 029243 160 DKVMQKRLEGLTEAELEALIEQVEEEK 186 (196)
Q Consensus 160 ~avLQKRleeLtpEEle~L~aEIE~Ek 186 (196)
.+++++|||++ +||..++++++|.++
T Consensus 30 ~aEqr~r~e~~-eeen~~~l~~~er~~ 55 (422)
T COG4653 30 RAEQRRRGEEA-EEENRRLLADIERVG 55 (422)
T ss_pred HHHHHHHHHHH-HHHHHHHhhhhhhhc
Confidence 67788888887 567888888888744
No 188
>cd04748 Commd COMM_Domain, a family of domains found at the C-terminus of HCarG, the copper metabolism gene MURR1 product, and related proteins. Presumably all COMM_Domain containing proteins are located in the nucleus and the COMM domain plays a role in protein-protein interactions. Several family members have been shown to bind and inhibit NF-kappaB. Murr1/Commd1 is a protein involved in copper homeostasis, which has also been identified as a regulator of the human delta epithelial sodium channel. HCaRG, a nuclear protein that might be involved in cell proliferation, is negatively regulated by extracellular calcium concentration, and its basal mRNA levels are higher in hypertensive animals.
Probab=22.03 E-value=93 Score=22.39 Aligned_cols=19 Identities=21% Similarity=0.457 Sum_probs=16.4
Q ss_pred cCCHHHHHHHHHHHHHHHh
Q 029243 169 GLTEAELEALIEQVEEEKR 187 (196)
Q Consensus 169 eLtpEEle~L~aEIE~Ek~ 187 (196)
||+++|++.+..++++-++
T Consensus 67 E~~~~~l~~l~~~L~~a~~ 85 (87)
T cd04748 67 EMTKEELQNFLNKLKKIKA 85 (87)
T ss_pred EeCHHHHHHHHHHHHHHHh
Confidence 8999999999999987543
No 189
>KOG3882 consensus Tetraspanin family integral membrane protein [General function prediction only]
Probab=21.95 E-value=1.4e+02 Score=24.58 Aligned_cols=23 Identities=17% Similarity=0.174 Sum_probs=16.8
Q ss_pred ccchhhhhhhHHHHHHHHHHHHh
Q 029243 87 YTRKDVLLIGVGVTVIGYGLKSG 109 (196)
Q Consensus 87 MtR~DVI~IGlgvfllGg~ly~g 109 (196)
.--.-.|.+|..++++|.++..|
T Consensus 51 ~~~~ili~~G~v~~~v~flGc~G 73 (237)
T KOG3882|consen 51 VPAYILIAVGGVVFLVGFLGCCG 73 (237)
T ss_pred cchhhhhhhhHHHHHHHHhhhhh
Confidence 34456778888888888887665
No 190
>PRK01381 Trp operon repressor; Provisional
Probab=21.90 E-value=1.3e+02 Score=23.78 Aligned_cols=25 Identities=36% Similarity=0.271 Sum_probs=20.3
Q ss_pred HHHHHHHHHhc-CCHHHHHHHHHHHH
Q 029243 159 EDKVMQKRLEG-LTEAELEALIEQVE 183 (196)
Q Consensus 159 E~avLQKRlee-LtpEEle~L~aEIE 183 (196)
+...++.-|+. |||+|+++|..-+.
T Consensus 21 ~~~~~~~~l~~llTp~Er~al~~R~~ 46 (99)
T PRK01381 21 EEDLHLPLLTLLLTPDEREALGTRVR 46 (99)
T ss_pred cHHHHHHHHHHhCCHHHHHHHHHHHH
Confidence 36677788888 99999999987665
No 191
>PRK02944 OxaA-like protein precursor; Validated
Probab=21.86 E-value=3.6e+02 Score=23.57 Aligned_cols=44 Identities=9% Similarity=0.125 Sum_probs=28.5
Q ss_pred eeeeecCCccHHHHHHHHH-------HHHHHHHHhcCCHHHHHHHHHHHHH
Q 029243 141 IFRVSNKEMTYAQQLRDYE-------DKVMQKRLEGLTEAELEALIEQVEE 184 (196)
Q Consensus 141 lfRV~tk~MTY~qQ~kdYE-------~avLQKRleeLtpEEle~L~aEIE~ 184 (196)
+.|++.=-.++.+|+..-+ =+.+|+|..+=++++.+++++|.-+
T Consensus 69 ivR~illPl~~~q~~~~~km~~iqPe~~~iq~kyk~~~~~~~~k~~~e~~~ 119 (255)
T PRK02944 69 LIRLLILPLMIKQTKSTKAMQALQPEMQKLKEKYSSKDQATQQKLQQEMMQ 119 (255)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHccHHHHHHHHHHcCCCHHHHHHHHHHHHH
Confidence 4576666666777666655 3555666655567778888877654
No 192
>MTH00046 CYTB cytochrome b; Validated
Probab=21.82 E-value=1.2e+02 Score=28.39 Aligned_cols=45 Identities=29% Similarity=0.273 Sum_probs=30.7
Q ss_pred ccccccCCCCCCCCCCCCCCCCCcc--ccchhhhhhhHHHHHHHHHH
Q 029243 62 IIPRSSGSSNSNGSDSSDQTETPFG--YTRKDVLLIGVGVTVIGYGL 106 (196)
Q Consensus 62 ~~~r~~~~~~~~~~~~~~~~~~pfg--MtR~DVI~IGlgvfllGg~l 106 (196)
...|..+|+|..+.+++..+|.||- ++-+|.+.+.+.++++....
T Consensus 189 ~~lh~~GssNPlg~~~~~~dki~F~Py~~~KDl~~~~~~l~~~~~~~ 235 (355)
T MTH00046 189 FYLHKSGSSNPLFSSNGYSDVVYFHSYFTVKDLFLFMVVCSLLVFWL 235 (355)
T ss_pred hheeecccCCcccccCCcccccCCCchHHHHHHHHHHHHHHHHHHHH
Confidence 3457788888877765545678885 48899877766665555443
No 193
>CHL00019 atpF ATP synthase CF0 B subunit
Probab=21.80 E-value=1.3e+02 Score=24.53 Aligned_cols=41 Identities=12% Similarity=0.062 Sum_probs=25.1
Q ss_pred hCCCcccchhH-HHHHHHhhhhhheeeeeeeeeecCCccHHHHHHHHHH
Q 029243 113 FGVDPLQAGNV-VEVIVVLGLTLGWVSTYIFRVSNKEMTYAQQLRDYED 160 (196)
Q Consensus 113 ~GlD~~~AGiw-sQ~lLVlGlvvgWv~SYlfRV~tk~MTY~qQ~kdYE~ 160 (196)
+|++. .++ .|++-++ ++.|++-|++. .-=+.++++|+++=.
T Consensus 20 f~~n~---~~~~~~~Infl--ill~lL~~fl~--kPI~~~l~~R~~~I~ 61 (184)
T CHL00019 20 FGFNT---DILETNLINLS--VVLGVLIYFGK--GVLSDLLDNRKQTIL 61 (184)
T ss_pred cCCcc---hHHHHHHHHHH--HHHHHHHHHhH--hHHHHHHHHHHHHHH
Confidence 46654 454 5666665 47888877543 223457788887753
No 194
>PF10762 DUF2583: Protein of unknown function (DUF2583) ; InterPro: IPR019698 Some members in this entry are annotated as YchH however currently no function is known.
Probab=21.69 E-value=1.1e+02 Score=24.19 Aligned_cols=37 Identities=22% Similarity=0.197 Sum_probs=27.6
Q ss_pred HHHHHHHHhhH---HhCCCcccchhHHHHHHHhhhhhheeee
Q 029243 101 VIGYGLKSGLE---LFGVDPLQAGNVVEVIVVLGLTLGWVST 139 (196)
Q Consensus 101 llGg~ly~gLq---~~GlD~~~AGiwsQ~lLVlGlvvgWv~S 139 (196)
.+|+++|..+. .+|+..+-|....=.++|.. +.|+..
T Consensus 21 Mv~gigysi~~~~~~L~Lp~~~~~gal~~IFiGA--llWL~G 60 (89)
T PF10762_consen 21 MVGGIGYSILSQIPQLGLPQFLAHGALFSIFIGA--LLWLVG 60 (89)
T ss_pred HHHhHHHHHHHhcccCCCcHHHHhhHHHHHHHHH--HHHHhc
Confidence 34557888844 78999999988888888873 678754
No 195
>cd06660 Aldo_ket_red Aldo-keto reductases (AKRs) are a superfamily of soluble NAD(P)(H) oxidoreductases whose chief purpose is to reduce aldehydes and ketones to primary and secondary alcohols. AKRs are present in all phyla and are of importance to both health and industrial applications. Members have very distinct functions and include the prokaryotic 2,5-diketo-D-gluconic acid reductases and beta-keto ester reductases, the eukaryotic aldose reductases, aldehyde reductases, hydroxysteroid dehydrogenases, steroid 5beta-reductases, potassium channel beta-subunits and aflatoxin aldehyde reductases, among others.
Probab=21.69 E-value=92 Score=25.61 Aligned_cols=44 Identities=18% Similarity=0.252 Sum_probs=32.3
Q ss_pred hhhheeeee--eeeeecCCccHHHHHHHHHHHHHHHHHhcCCHHHHHHHHH
Q 029243 132 LTLGWVSTY--IFRVSNKEMTYAQQLRDYEDKVMQKRLEGLTEAELEALIE 180 (196)
Q Consensus 132 lvvgWv~SY--lfRV~tk~MTY~qQ~kdYE~avLQKRleeLtpEEle~L~a 180 (196)
+++.|+.+. +.-|+.+ |+-.+|+++.=++. .-.||+||++.|.+
T Consensus 239 ~al~~~l~~p~~~~~i~g-~~~~~~l~~n~~~~----~~~L~~~~~~~l~~ 284 (285)
T cd06660 239 VALRWLLQQPGVTSVIPG-ASSPERLEENLAAL----DFELSDEDLAALDA 284 (285)
T ss_pred HHHHHHhcCCCCeEEEeC-CCCHHHHHHHHhhc----cCCCCHHHHHHHhh
Confidence 346777777 6666665 77788888766554 46899999998754
No 196
>KOG1937 consensus Uncharacterized conserved protein [Function unknown]
Probab=21.55 E-value=80 Score=31.35 Aligned_cols=22 Identities=27% Similarity=0.374 Sum_probs=17.4
Q ss_pred cCCccHHHHHHHHHHHHHHHHH
Q 029243 146 NKEMTYAQQLRDYEDKVMQKRL 167 (196)
Q Consensus 146 tk~MTY~qQ~kdYE~avLQKRl 167 (196)
.+-|+-.|||.||+..-++|++
T Consensus 307 ~~~~~ltqqwed~R~pll~kkl 328 (521)
T KOG1937|consen 307 KQMEELTQQWEDTRQPLLQKKL 328 (521)
T ss_pred HHHHHHHHHHHHHhhhHHHHHH
Confidence 3457999999999977777664
No 197
>PF03419 Peptidase_U4: Sporulation factor SpoIIGA This family belongs to family U4 of the peptidase classification.; InterPro: IPR005081 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. The peptidases families associated with clan U- have an unknown catalytic mechanism as the protein fold of the active site domain and the active site residues have not been reported. This group of peptidases belong to the MEROPS peptidase family U4 (SpoIIGA peptidase family, clan U-). Sporulation in bacteria such as Bacillus subtilis involves the formation of a polar septum, which divides the sporangium into a mother cell and a forespore. The sigma E factor, which is encoded within the spoIIG operon, is a cell-specific regulatory protein that directs gene transcription in the mother cell. Sigma E is synthesised as an inactive proprotein pro-sigma E, which is converted to the mature factor by the putative processing enzyme SpoIIGA []. ; GO: 0004190 aspartic-type endopeptidase activity, 0006508 proteolysis, 0030436 asexual sporulation
Probab=21.55 E-value=88 Score=27.18 Aligned_cols=50 Identities=12% Similarity=0.179 Sum_probs=27.2
Q ss_pred hhhhhhhHHHHHHHHHHHHhhHHhCCCcc-cchhH----HHHHHHhhhhhheeee
Q 029243 90 KDVLLIGVGVTVIGYGLKSGLELFGVDPL-QAGNV----VEVIVVLGLTLGWVST 139 (196)
Q Consensus 90 ~DVI~IGlgvfllGg~ly~gLq~~GlD~~-~AGiw----sQ~lLVlGlvvgWv~S 139 (196)
+.++..=+.-|++||+.+......+-+.. ..|+. +-..+++|+.++|...
T Consensus 86 k~~~~fy~~sf~~gG~~~~l~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~ 140 (293)
T PF03419_consen 86 KALLIFYLVSFLLGGIMFALYYFLRSGGFIQNGIFYFNISWLFLLIGFIIAYLLL 140 (293)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHcCcchhccceeeccchHHHHHHHHHHHHHHH
Confidence 34444455566788788777666666555 33332 2344444445565433
No 198
>PF13148 DUF3987: Protein of unknown function (DUF3987)
Probab=21.45 E-value=1.3e+02 Score=25.96 Aligned_cols=12 Identities=33% Similarity=0.900 Sum_probs=6.6
Q ss_pred HHHHHHHHHHHH
Q 029243 151 YAQQLRDYEDKV 162 (196)
Q Consensus 151 Y~qQ~kdYE~av 162 (196)
|.+|+++|+.+.
T Consensus 74 ~~~~~~~y~~~~ 85 (378)
T PF13148_consen 74 YEEELEEYEAEL 85 (378)
T ss_pred HHHHHHHHHHHH
Confidence 555666666444
No 199
>PF02038 ATP1G1_PLM_MAT8: ATP1G1/PLM/MAT8 family; InterPro: IPR000272 The FXYD protein family contains at least seven members in mammals []. Two other family members that are not obvious orthologs of any identified mammalian FXYD protein exist in zebrafish. All these proteins share a signature sequence of six conserved amino acids comprising the FXYD motif in the NH2-terminus, and two glycines and one serine residue in the transmembrane domain. FXYD proteins are widely distributed in mammalian tissues with prominent expression in tissues that perform fluid and solute transport or that are electrically excitable. Initial functional characterisation suggested that FXYD proteins act as channels or as modulators of ion channels however studies have revealed that most FXYD proteins have another specific function and act as tissue-specific regulatory subunits of the Na,K-ATPase. Each of these auxiliary subunits produces a distinct functional effect on the transport characteristics of the Na,K-ATPase that is adjusted to the specific functional demands of the tissue in which the FXYD protein is expressed. FXYD proteins appear to preferentially associate with Na,K-ATPase alpha1-beta isozymes, and affect their function in a way that render them operationally complementary or supplementary to coexisting isozymes.; GO: 0005216 ion channel activity, 0006811 ion transport, 0016020 membrane; PDB: 2JO1_A 2JP3_A 2ZXE_G 3A3Y_G 3N23_E 3B8E_H 3KDP_G 3N2F_E.
Probab=21.38 E-value=1.2e+02 Score=21.59 Aligned_cols=26 Identities=23% Similarity=0.410 Sum_probs=16.4
Q ss_pred CCCCcccc----chhhhhhhHHHHHHHHHH
Q 029243 81 TETPFGYT----RKDVLLIGVGVTVIGYGL 106 (196)
Q Consensus 81 ~~~pfgMt----R~DVI~IGlgvfllGg~l 106 (196)
|+-||-|. |+.=+.++..+|++|.++
T Consensus 3 e~~pF~YDy~tLrigGLi~A~vlfi~Gi~i 32 (50)
T PF02038_consen 3 EDDPFYYDYETLRIGGLIFAGVLFILGILI 32 (50)
T ss_dssp CCSGGGGCHHHHHHHHHHHHHHHHHHHHHH
T ss_pred cCCCCccchhHhhccchHHHHHHHHHHHHH
Confidence 45788884 555566666666666443
No 200
>PF08920 SF3b1: Splicing factor 3B subunit 1; InterPro: IPR015016 This group of proteins consists of several eukaryotic splicing factor 3B subunit 1 proteins, which associate with p14 through a C terminus beta-strand that interacts with beta-3 of the p14 RNA recognition motif (RRM) beta-sheet, which is in turn connected to an alpha-helix by a loop that makes extensive contacts with both the shorter C-terminal helix and RRM of p14. This subunit is required for 'A' splicing complex assembly (formed by the stable binding of U2 snRNP to the branchpoint sequence in pre-mRNA) and 'E' splicing complex assembly []. ; PDB: 2FHO_A 3LQV_P 2PEH_D 2F9J_P 2F9D_Q.
Probab=21.38 E-value=62 Score=26.75 Aligned_cols=27 Identities=30% Similarity=0.524 Sum_probs=20.3
Q ss_pred HHHHHHH-HHHHHHHHhcCCHHHHHHHH
Q 029243 153 QQLRDYE-DKVMQKRLEGLTEAELEALI 179 (196)
Q Consensus 153 qQ~kdYE-~avLQKRleeLtpEEle~L~ 179 (196)
.|...|+ +.+|.+|-.=||+|||.+|+
T Consensus 69 ~~~~~~~~~~eid~RNrpLTDEELD~mL 96 (144)
T PF08920_consen 69 EQYQAMRWEKEIDERNRPLTDEELDAML 96 (144)
T ss_dssp HHHHHHHHHHHHHHCTS-S-HHHHHHTS
T ss_pred hhhhhhccccchhhccCcCCHHHHHHhC
Confidence 4455555 77899999999999999998
No 201
>PF08784 RPA_C: Replication protein A C terminal; InterPro: IPR014892 This protein corresponds to the C-terminal of the single stranded DNA binding protein RPA (replication protein A). RPA is involved in many DNA metabolic pathways including DNA replication, DNA repair, recombination, cell cycle and DNA damage checkpoints. ; PDB: 1QUQ_C 2PQA_C 3KDF_B 2Z6K_B 2PI2_B 1L1O_E 1DPU_A 1Z1D_A.
Probab=21.38 E-value=1.1e+02 Score=22.21 Aligned_cols=25 Identities=20% Similarity=0.438 Sum_probs=20.0
Q ss_pred HHHHHHHHhcCCHHHHHHHHHHHHHH
Q 029243 160 DKVMQKRLEGLTEAELEALIEQVEEE 185 (196)
Q Consensus 160 ~avLQKRleeLtpEEle~L~aEIE~E 185 (196)
-..|.++| .|+++|+.+-.+++..|
T Consensus 68 v~~I~~~l-~~~~~~v~~al~~L~~e 92 (102)
T PF08784_consen 68 VDEIAQQL-GMSENEVRKALDFLSNE 92 (102)
T ss_dssp HHHHHHHS-TS-HHHHHHHHHHHHHT
T ss_pred HHHHHHHh-CcCHHHHHHHHHHHHhC
Confidence 46778888 99999999999988765
No 202
>PRK04156 gltX glutamyl-tRNA synthetase; Provisional
Probab=21.15 E-value=1.1e+02 Score=30.18 Aligned_cols=21 Identities=14% Similarity=0.251 Sum_probs=17.4
Q ss_pred HHHHHHhcCCHHHHHHHHHHH
Q 029243 162 VMQKRLEGLTEAELEALIEQV 182 (196)
Q Consensus 162 vLQKRleeLtpEEle~L~aEI 182 (196)
.+=++.++||+||++++++|+
T Consensus 53 ~~v~~vn~ms~ee~~~~l~~~ 73 (567)
T PRK04156 53 EVVEEVNSLSLEEQRERLEEL 73 (567)
T ss_pred HHHHHHhcCCHHHHHHHHHHh
Confidence 344789999999999999883
No 203
>PF07271 Cytadhesin_P30: Cytadhesin P30/P32; InterPro: IPR009896 This family consists of several Mycoplasma species specific Cytadhesin P32 and P30 proteins. P30 has been found to be membrane associated and localised on the tip organelle. It is thought that it is important in cytadherence and virulence [].; GO: 0007157 heterophilic cell-cell adhesion, 0009405 pathogenesis, 0016021 integral to membrane
Probab=21.15 E-value=1.4e+02 Score=27.64 Aligned_cols=8 Identities=50% Similarity=1.227 Sum_probs=3.8
Q ss_pred HHhhhhhh
Q 029243 128 VVLGLTLG 135 (196)
Q Consensus 128 LVlGlvvg 135 (196)
|+||+.||
T Consensus 84 liLgl~ig 91 (279)
T PF07271_consen 84 LILGLAIG 91 (279)
T ss_pred HHHHHhhc
Confidence 44445444
No 204
>PHA00649 hypothetical protein
Probab=21.06 E-value=1.1e+02 Score=23.67 Aligned_cols=23 Identities=17% Similarity=0.316 Sum_probs=19.1
Q ss_pred hhhhhhhHHHHHHHHHHHHhhHH
Q 029243 90 KDVLLIGVGVTVIGYGLKSGLEL 112 (196)
Q Consensus 90 ~DVI~IGlgvfllGg~ly~gLq~ 112 (196)
+=.-++||.+..+|..+|.|+++
T Consensus 56 G~L~~VGL~~~~iGAA~W~G~~s 78 (83)
T PHA00649 56 GFLALVGLVAVALGAAVWAGITS 78 (83)
T ss_pred chHHHHHHHHHHHHHHHHHHHHH
Confidence 33567899999999999999883
No 205
>PF14164 YqzH: YqzH-like protein
Probab=21.00 E-value=1.1e+02 Score=22.64 Aligned_cols=23 Identities=26% Similarity=0.387 Sum_probs=19.2
Q ss_pred hcCCHHHHHHHHHHHHHHHhccc
Q 029243 168 EGLTEAELEALIEQVEEEKRRLA 190 (196)
Q Consensus 168 eeLtpEEle~L~aEIE~Ek~~~~ 190 (196)
..||++|++.|-+.|.+.+.+.+
T Consensus 24 ~pls~~E~~~L~~~i~~~~~~~~ 46 (64)
T PF14164_consen 24 MPLSDEEWEELCKHIQERKNEEP 46 (64)
T ss_pred CCCCHHHHHHHHHHHHHHHhcCC
Confidence 46899999999999988776654
No 206
>PRK05732 2-octaprenyl-6-methoxyphenyl hydroxylase; Validated
Probab=20.92 E-value=40 Score=28.74 Aligned_cols=17 Identities=29% Similarity=0.556 Sum_probs=13.9
Q ss_pred ccchhhhhhhHHHHHHH
Q 029243 87 YTRKDVLLIGVGVTVIG 103 (196)
Q Consensus 87 MtR~DVI~IGlgvfllG 103 (196)
|.+.||+.||.|+-.+-
T Consensus 1 m~~~dv~IvGaG~aGl~ 17 (395)
T PRK05732 1 MSRMDVIIVGGGMAGAT 17 (395)
T ss_pred CCcCCEEEECcCHHHHH
Confidence 77889999999985544
No 207
>PRK08455 fliL flagellar basal body-associated protein FliL; Reviewed
Probab=20.51 E-value=96 Score=26.02 Aligned_cols=23 Identities=22% Similarity=0.174 Sum_probs=12.3
Q ss_pred hhhhhhHHHHHHHHHHHHhhHHh
Q 029243 91 DVLLIGVGVTVIGYGLKSGLELF 113 (196)
Q Consensus 91 DVI~IGlgvfllGg~ly~gLq~~ 113 (196)
-.|+||+.++++++++.+++-++
T Consensus 20 ~~iIi~~~llll~~~G~~~~~~~ 42 (182)
T PRK08455 20 LIIIIGVVVLLLLIVGVIAMLLM 42 (182)
T ss_pred EEehHHHHHHHHHHHHHHHHHHh
Confidence 33566666666655554443333
No 208
>PF13349 DUF4097: Domain of unknown function (DUF4097)
Probab=20.43 E-value=94 Score=23.48 Aligned_cols=16 Identities=44% Similarity=0.646 Sum_probs=11.4
Q ss_pred hhhhhhHHHHHHHHHH
Q 029243 91 DVLLIGVGVTVIGYGL 106 (196)
Q Consensus 91 DVI~IGlgvfllGg~l 106 (196)
+.+.||+.++++|+++
T Consensus 2 k~~~i~~~l~viG~il 17 (166)
T PF13349_consen 2 KLLIIGLILIVIGIIL 17 (166)
T ss_pred eEehHHHHHHHHHHHH
Confidence 4567888888888654
No 209
>PRK10692 hypothetical protein; Provisional
Probab=20.29 E-value=99 Score=24.50 Aligned_cols=36 Identities=19% Similarity=0.080 Sum_probs=26.3
Q ss_pred HHHHHHHhhH---HhCCCcccchhHHHHHHHhhhhhheeee
Q 029243 102 IGYGLKSGLE---LFGVDPLQAGNVVEVIVVLGLTLGWVST 139 (196)
Q Consensus 102 lGg~ly~gLq---~~GlD~~~AGiwsQ~lLVlGlvvgWv~S 139 (196)
+|+++|..+. .+|+..+-|....=.+++.. +.|+..
T Consensus 22 v~gigysi~~~i~~L~Lp~~~~~gal~~IFiGA--llWL~G 60 (92)
T PRK10692 22 VVGVGYSILNQLPQLNLPQFFAHGALLSIFVGA--LLWLAG 60 (92)
T ss_pred HHHHHHHHHHhcccCCchHHHHhhHHHHHHHHH--HHHHhc
Confidence 4457787744 78888888888887777773 678754
No 210
>PRK06475 salicylate hydroxylase; Provisional
Probab=20.29 E-value=45 Score=29.21 Aligned_cols=27 Identities=33% Similarity=0.402 Sum_probs=18.9
Q ss_pred cchhhhhhhHHHHHHHHHHHHhhHHhCCC
Q 029243 88 TRKDVLLIGVGVTVIGYGLKSGLELFGVD 116 (196)
Q Consensus 88 tR~DVI~IGlgvfllGg~ly~gLq~~GlD 116 (196)
+|++|+.||-|+ +|..+...|...|++
T Consensus 1 ~~~~V~IvGgGi--aGl~~A~~L~~~G~~ 27 (400)
T PRK06475 1 TRGSPLIAGAGV--AGLSAALELAARGWA 27 (400)
T ss_pred CCCcEEEECCCH--HHHHHHHHHHhCCCc
Confidence 468999999997 454555556666654
No 211
>PRK03598 putative efflux pump membrane fusion protein; Provisional
Probab=20.20 E-value=71 Score=27.60 Aligned_cols=26 Identities=19% Similarity=-0.061 Sum_probs=12.4
Q ss_pred ccchhhhhhhHHHH-HHHHHHHHhhHH
Q 029243 87 YTRKDVLLIGVGVT-VIGYGLKSGLEL 112 (196)
Q Consensus 87 MtR~DVI~IGlgvf-llGg~ly~gLq~ 112 (196)
|.+.+++.+|..+| ++=+++|..|.-
T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~ 27 (331)
T PRK03598 1 MKKKVVIGLAVVVLAAAVAGGWWWYQS 27 (331)
T ss_pred CCceEEEEhHHHHHHHHHHHheeEeee
Confidence 45566665554443 333344444443
No 212
>PRK08317 hypothetical protein; Provisional
Probab=20.10 E-value=4.2e+02 Score=20.47 Aligned_cols=71 Identities=20% Similarity=0.230 Sum_probs=40.4
Q ss_pred CcccchhHHHHHHHhhhhhheeeeeeeeeec-CCccHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHHHHhc
Q 029243 116 DPLQAGNVVEVIVVLGLTLGWVSTYIFRVSN-KEMTYAQQLRDYEDKVMQKRLEGLTEAELEALIEQVEEEKRR 188 (196)
Q Consensus 116 D~~~AGiwsQ~lLVlGlvvgWv~SYlfRV~t-k~MTY~qQ~kdYE~avLQKRleeLtpEEle~L~aEIE~Ek~~ 188 (196)
++....-|.+.+--.|+...++-.+-+-... ....+...+.++-....+ -..++++|++++.++++++...
T Consensus 153 ~~~~~~~~~~~l~~aGf~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~--~~~~~~~~~~~~~~~l~~~~~~ 224 (241)
T PRK08317 153 DPWLGRRLPGLFREAGLTDIEVEPYTLIETDLKEADKGFGLIRAARRAVE--AGGISADEADAWLADLAQLARA 224 (241)
T ss_pred CCcHHHHHHHHHHHcCCCceeEEEEEEeccCcchhhHHHHHHHHHHHHHH--cCCCCHHHHHHHHHHHHHHHhc
Confidence 3333445666666667666666555332211 111233333333333332 4678999999999999986643
Done!