Query psy7548
Match_columns 134
No_of_seqs 194 out of 2201
Neff 7.9
Searched_HMMs 46136
Date Fri Aug 16 21:11:33 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy7548.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/7548hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 COG0484 DnaJ DnaJ-class molecu 100.0 6.4E-31 1.4E-35 207.6 8.5 127 1-127 4-150 (371)
2 KOG0712|consensus 99.9 6E-28 1.3E-32 188.6 9.4 124 2-127 5-135 (337)
3 KOG0713|consensus 99.9 1.4E-25 3.1E-30 174.0 7.9 69 1-69 16-87 (336)
4 PTZ00037 DnaJ_C chaperone prot 99.9 1.6E-25 3.5E-30 181.1 7.5 123 1-126 28-157 (421)
5 PRK14296 chaperone protein Dna 99.9 4.9E-25 1.1E-29 176.1 8.0 67 1-67 4-72 (372)
6 PRK14288 chaperone protein Dna 99.9 9.6E-25 2.1E-29 174.3 7.6 67 1-67 3-72 (369)
7 PRK14286 chaperone protein Dna 99.9 3.2E-24 7E-29 171.4 7.7 67 1-67 4-73 (372)
8 PRK14287 chaperone protein Dna 99.9 3.3E-24 7.1E-29 171.3 7.4 126 1-126 4-145 (371)
9 PRK14298 chaperone protein Dna 99.9 3.1E-24 6.7E-29 171.8 7.2 126 1-126 5-148 (377)
10 PRK14276 chaperone protein Dna 99.9 9.9E-24 2.1E-28 169.0 6.6 67 1-67 4-72 (380)
11 PRK14285 chaperone protein Dna 99.9 1.4E-23 3E-28 167.4 6.9 67 1-67 3-72 (365)
12 PRK14280 chaperone protein Dna 99.9 2.4E-23 5.2E-28 166.6 7.6 126 1-126 4-150 (376)
13 PRK14278 chaperone protein Dna 99.9 2.3E-23 5E-28 166.8 7.1 126 1-126 3-146 (378)
14 PRK14282 chaperone protein Dna 99.9 3.7E-23 8E-28 165.2 7.7 67 1-67 4-74 (369)
15 PRK14284 chaperone protein Dna 99.9 7.5E-23 1.6E-27 164.5 8.6 67 1-67 1-70 (391)
16 PRK14277 chaperone protein Dna 99.9 7.5E-23 1.6E-27 164.3 7.6 67 1-67 5-74 (386)
17 PRK14297 chaperone protein Dna 99.9 4.4E-23 9.5E-28 165.3 6.2 67 1-67 4-73 (380)
18 PRK14283 chaperone protein Dna 99.9 1.4E-22 3E-27 162.4 8.8 67 1-67 5-73 (378)
19 PRK10767 chaperone protein Dna 99.9 1E-22 2.3E-27 162.7 8.0 67 1-67 4-73 (371)
20 PRK14301 chaperone protein Dna 99.9 5.2E-23 1.1E-27 164.6 6.3 67 1-67 4-73 (373)
21 PRK14294 chaperone protein Dna 99.9 9.5E-23 2.1E-27 162.7 7.7 67 1-67 4-73 (366)
22 PRK14295 chaperone protein Dna 99.9 1.7E-22 3.7E-27 162.3 8.3 62 1-62 9-73 (389)
23 PRK14291 chaperone protein Dna 99.9 2.2E-22 4.8E-27 161.4 6.4 67 1-67 3-71 (382)
24 TIGR02349 DnaJ_bact chaperone 99.9 3.2E-22 6.9E-27 159.0 7.0 66 2-67 1-68 (354)
25 PRK14279 chaperone protein Dna 99.9 5.6E-22 1.2E-26 159.5 7.1 65 1-65 9-76 (392)
26 PRK14292 chaperone protein Dna 99.9 7.9E-22 1.7E-26 157.6 6.3 126 1-126 2-146 (371)
27 PRK14300 chaperone protein Dna 99.8 1.2E-21 2.5E-26 156.7 6.3 67 1-67 3-71 (372)
28 PRK14289 chaperone protein Dna 99.8 2.9E-21 6.3E-26 155.1 8.4 67 1-67 5-74 (386)
29 PRK14290 chaperone protein Dna 99.8 1.7E-21 3.7E-26 155.4 6.5 67 1-67 3-73 (365)
30 PRK14293 chaperone protein Dna 99.8 3.1E-21 6.6E-26 154.4 7.4 67 1-67 3-71 (374)
31 KOG0716|consensus 99.8 1.9E-21 4.2E-26 147.1 5.4 70 1-70 31-103 (279)
32 KOG0717|consensus 99.8 1.8E-21 4E-26 155.9 4.8 71 1-71 8-82 (508)
33 PRK14299 chaperone protein Dna 99.8 5.5E-21 1.2E-25 148.5 6.9 66 1-66 4-71 (291)
34 KOG0715|consensus 99.8 1.3E-20 2.7E-25 146.1 8.5 66 2-67 44-111 (288)
35 PF00226 DnaJ: DnaJ domain; I 99.8 1.3E-20 2.9E-25 115.3 6.2 60 2-61 1-64 (64)
36 PRK14281 chaperone protein Dna 99.8 2.3E-20 5.1E-25 150.3 6.8 67 1-67 3-72 (397)
37 KOG0718|consensus 99.8 1.1E-19 2.4E-24 146.0 6.2 69 1-69 9-83 (546)
38 PRK10266 curved DNA-binding pr 99.8 1.6E-19 3.5E-24 141.1 6.9 65 1-65 4-70 (306)
39 KOG0691|consensus 99.8 3.2E-19 6.9E-24 138.1 7.2 71 1-71 5-78 (296)
40 PTZ00341 Ring-infected erythro 99.8 4.5E-19 9.7E-24 152.6 6.9 68 1-68 573-642 (1136)
41 KOG0719|consensus 99.8 1.1E-18 2.4E-23 129.8 5.7 67 1-67 14-85 (264)
42 smart00271 DnaJ DnaJ molecular 99.7 2.9E-18 6.3E-23 103.4 5.4 55 1-55 1-59 (60)
43 cd06257 DnaJ DnaJ domain or J- 99.7 7.9E-18 1.7E-22 99.7 6.1 52 2-53 1-55 (55)
44 TIGR03835 termin_org_DnaJ term 99.7 1.1E-17 2.4E-22 141.4 6.4 68 1-68 2-71 (871)
45 KOG0721|consensus 99.7 1.7E-17 3.7E-22 122.3 6.2 68 2-69 100-170 (230)
46 KOG0720|consensus 99.7 3.9E-18 8.4E-23 136.9 2.4 98 1-100 235-335 (490)
47 PRK05014 hscB co-chaperone Hsc 99.7 1.9E-17 4.2E-22 119.8 5.0 67 1-67 1-77 (171)
48 PHA03102 Small T antigen; Revi 99.7 1.1E-17 2.3E-22 118.7 3.4 67 2-69 6-74 (153)
49 COG2214 CbpA DnaJ-class molecu 99.7 5.2E-17 1.1E-21 118.6 6.7 64 1-64 6-73 (237)
50 KOG2923|consensus 99.7 2.6E-17 5.6E-22 98.8 3.8 56 74-133 3-58 (67)
51 PF05207 zf-CSL: CSL zinc fing 99.7 4.2E-17 9.2E-22 97.2 3.2 53 77-133 2-54 (55)
52 PRK01356 hscB co-chaperone Hsc 99.7 1.9E-16 4.2E-21 114.0 7.1 66 1-66 2-75 (166)
53 PRK00294 hscB co-chaperone Hsc 99.7 1.6E-16 3.5E-21 115.1 5.8 86 1-86 4-101 (173)
54 KOG0624|consensus 99.6 1.5E-16 3.2E-21 125.2 5.0 67 1-67 394-466 (504)
55 PRK03578 hscB co-chaperone Hsc 99.6 1.8E-16 3.9E-21 115.1 5.0 68 1-68 6-83 (176)
56 KOG0550|consensus 99.6 2.3E-15 5.1E-20 120.1 5.5 83 1-85 373-459 (486)
57 COG5216 Uncharacterized conser 99.5 9.4E-15 2E-19 86.6 4.8 57 73-133 2-58 (67)
58 KOG0722|consensus 99.5 6.9E-15 1.5E-19 111.1 3.5 68 1-68 33-102 (329)
59 PTZ00100 DnaJ chaperone protei 99.5 3.6E-14 7.8E-19 96.2 4.0 50 2-52 66-115 (116)
60 KOG0714|consensus 99.4 4.6E-14 1E-18 107.4 3.3 65 1-65 3-71 (306)
61 PRK09430 djlA Dna-J like membr 99.4 9.2E-14 2E-18 106.9 4.7 52 2-53 201-262 (267)
62 PHA02624 large T antigen; Prov 99.4 1.3E-13 2.7E-18 115.4 4.5 58 2-60 12-71 (647)
63 PRK01773 hscB co-chaperone Hsc 99.4 5.9E-13 1.3E-17 96.5 6.9 65 1-65 2-76 (173)
64 COG5407 SEC63 Preprotein trans 99.4 2.2E-13 4.7E-18 109.8 4.8 70 2-71 99-176 (610)
65 TIGR00714 hscB Fe-S protein as 99.3 7.3E-12 1.6E-16 89.5 5.1 55 13-67 3-65 (157)
66 COG5269 ZUO1 Ribosome-associat 99.1 4.5E-11 9.8E-16 91.4 3.3 80 2-81 44-131 (379)
67 KOG1150|consensus 99.1 2.8E-10 6E-15 83.5 6.3 60 2-61 54-117 (250)
68 KOG0568|consensus 98.7 1.3E-08 2.9E-13 76.5 3.4 54 1-54 47-103 (342)
69 KOG1789|consensus 98.6 9.3E-08 2E-12 84.3 5.3 50 3-52 1283-1336(2235)
70 KOG0723|consensus 98.5 3.6E-07 7.7E-12 60.8 5.1 47 5-54 60-108 (112)
71 KOG3192|consensus 98.1 5.6E-06 1.2E-10 58.7 4.6 68 1-68 8-85 (168)
72 COG1076 DjlA DnaJ-domain-conta 97.5 6.2E-05 1.4E-09 54.5 2.8 50 2-51 114-173 (174)
73 KOG0431|consensus 97.5 0.00013 2.8E-09 60.3 4.1 45 7-51 394-448 (453)
74 COG1076 DjlA DnaJ-domain-conta 97.2 0.00027 5.8E-09 51.2 2.9 65 1-65 1-75 (174)
75 COG1996 RPC10 DNA-directed RNA 96.5 0.0053 1.1E-07 35.4 3.7 36 90-134 3-39 (49)
76 TIGR02098 MJ0042_CXXC MJ0042 f 95.8 0.017 3.7E-07 31.1 3.5 34 94-131 3-37 (38)
77 PF03656 Pam16: Pam16; InterP 95.8 0.019 4.1E-07 39.7 4.4 50 5-55 62-111 (127)
78 PF13719 zinc_ribbon_5: zinc-r 94.0 0.065 1.4E-06 29.0 2.6 34 94-131 3-37 (37)
79 PF13717 zinc_ribbon_4: zinc-r 92.6 0.16 3.4E-06 27.3 2.6 32 94-129 3-35 (36)
80 PF13446 RPT: A repeated domai 92.4 0.3 6.5E-06 29.1 4.0 24 4-27 8-31 (62)
81 smart00659 RPOLCX RNA polymera 92.3 0.38 8.2E-06 27.1 4.0 31 93-133 2-33 (44)
82 KOG0724|consensus 92.0 0.22 4.9E-06 39.3 4.0 53 12-64 3-62 (335)
83 PF11833 DUF3353: Protein of u 91.1 0.71 1.5E-05 34.1 5.6 38 10-52 1-38 (194)
84 PF05180 zf-DNL: DNL zinc fing 89.9 0.3 6.5E-06 29.9 2.2 35 93-131 4-41 (66)
85 PF14687 DUF4460: Domain of un 89.6 0.83 1.8E-05 30.8 4.5 23 11-33 4-26 (112)
86 PRK00398 rpoP DNA-directed RNA 87.8 1.4 3E-05 24.6 3.9 30 93-131 3-33 (46)
87 PF09723 Zn-ribbon_8: Zinc rib 87.2 0.82 1.8E-05 25.2 2.7 32 90-127 2-34 (42)
88 TIGR01206 lysW lysine biosynth 86.0 1.7 3.7E-05 25.5 3.7 33 94-133 3-36 (54)
89 smart00834 CxxC_CXXC_SSSS Puta 85.4 1.4 3.1E-05 23.6 3.1 33 90-128 2-35 (41)
90 PF05280 FlhC: Flagellar trans 85.2 0.69 1.5E-05 33.7 2.2 30 92-128 133-163 (175)
91 COG5552 Uncharacterized conser 81.6 5.3 0.00012 25.1 4.7 33 1-33 3-39 (88)
92 TIGR02605 CxxC_CxxC_SSSS putat 81.1 1.9 4.2E-05 24.5 2.6 32 90-127 2-34 (52)
93 PF03604 DNA_RNApol_7kD: DNA d 80.7 1.4 3E-05 23.1 1.6 31 94-134 1-32 (32)
94 PRK12860 transcriptional activ 80.0 1.3 2.8E-05 32.7 1.9 28 93-127 134-162 (189)
95 KOG3277|consensus 79.9 3 6.6E-05 29.8 3.7 38 90-131 76-116 (165)
96 PRK12722 transcriptional activ 79.3 1.4 3E-05 32.4 1.9 28 93-127 134-162 (187)
97 PF08271 TF_Zn_Ribbon: TFIIB z 78.5 2.3 5E-05 23.4 2.3 29 94-130 1-30 (43)
98 PF02216 B: B domain; InterPr 68.8 11 0.00024 22.0 3.5 30 3-35 13-42 (54)
99 KOG3442|consensus 68.0 7.2 0.00016 26.9 3.0 29 5-33 63-91 (132)
100 KOG1779|consensus 66.8 15 0.00032 23.3 4.0 37 91-132 30-66 (84)
101 PF07709 SRR: Seven Residue Re 64.3 6.4 0.00014 16.4 1.5 13 40-52 2-14 (14)
102 PF12434 Malate_DH: Malate deh 61.9 13 0.00027 18.7 2.5 18 14-31 9-26 (28)
103 PF10041 DUF2277: Uncharacteri 59.3 42 0.00092 21.1 5.2 32 2-33 4-35 (78)
104 PF09297 zf-NADH-PPase: NADH p 59.1 10 0.00022 19.3 2.0 26 95-129 5-31 (32)
105 PF04216 FdhE: Protein involve 58.8 6.2 0.00014 30.6 1.6 17 117-133 236-252 (290)
106 cd01388 SOX-TCF_HMG-box SOX-TC 54.7 45 0.00097 20.0 5.5 43 20-64 14-56 (72)
107 COG3195 Uncharacterized protei 54.1 46 0.001 24.2 5.3 40 10-49 41-117 (176)
108 smart00531 TFIIE Transcription 52.6 13 0.00029 25.9 2.4 37 92-131 98-135 (147)
109 KOG2324|consensus 52.5 41 0.00088 27.7 5.3 29 92-127 226-255 (457)
110 PRK06266 transcription initiat 50.8 8.4 0.00018 28.0 1.1 30 92-129 116-146 (178)
111 COG1579 Zn-ribbon protein, pos 49.6 5.9 0.00013 30.3 0.2 32 97-128 199-230 (239)
112 cd00084 HMG-box High Mobility 49.4 48 0.001 18.8 5.3 45 19-65 12-56 (66)
113 TIGR00373 conserved hypothetic 47.9 13 0.00027 26.5 1.6 29 93-129 109-138 (158)
114 PF13831 PHD_2: PHD-finger; PD 47.7 16 0.00035 19.4 1.7 13 118-130 3-15 (36)
115 PF11331 DUF3133: Protein of u 46.8 28 0.00061 19.7 2.6 17 117-133 29-45 (46)
116 PF00505 HMG_box: HMG (high mo 46.3 58 0.0013 18.8 5.1 43 20-64 13-55 (69)
117 PTZ00043 cytochrome c oxidase 46.0 1.4E+02 0.0029 23.0 7.3 37 16-52 96-132 (268)
118 cd01390 HMGB-UBF_HMG-box HMGB- 45.9 57 0.0012 18.6 5.3 41 22-64 15-55 (66)
119 TIGR01562 FdhE formate dehydro 44.6 19 0.00042 28.5 2.3 17 117-133 250-266 (305)
120 PF08274 PhnA_Zn_Ribbon: PhnA 43.2 16 0.00034 18.8 1.1 25 95-129 4-29 (30)
121 COG0089 RplW Ribosomal protein 42.8 24 0.00052 23.1 2.2 21 6-26 25-45 (94)
122 KOG0527|consensus 41.4 48 0.001 26.6 4.2 43 20-64 75-117 (331)
123 PTZ00199 high mobility group p 41.4 94 0.002 19.8 5.5 41 22-62 37-77 (94)
124 PF07295 DUF1451: Protein of u 41.3 28 0.0006 24.6 2.5 28 92-128 111-139 (146)
125 PRK00762 hypA hydrogenase nick 41.3 16 0.00036 24.8 1.3 35 94-132 71-106 (124)
126 cd01389 MATA_HMG-box MATA_HMG- 40.9 82 0.0018 19.0 5.7 43 19-63 13-55 (77)
127 PF14354 Lar_restr_allev: Rest 39.3 41 0.0009 19.4 2.8 33 94-127 4-37 (61)
128 PF07739 TipAS: TipAS antibiot 38.3 96 0.0021 20.0 4.7 48 8-61 51-99 (118)
129 TIGR00577 fpg formamidopyrimid 38.0 1.1E+02 0.0024 23.5 5.7 26 94-126 246-272 (272)
130 PF09855 DUF2082: Nucleic-acid 37.1 48 0.001 20.0 2.8 11 117-127 34-44 (64)
131 CHL00030 rpl23 ribosomal prote 36.3 36 0.00077 22.1 2.3 27 6-33 23-49 (93)
132 COG0266 Nei Formamidopyrimidin 35.8 1.2E+02 0.0026 23.7 5.5 26 93-126 245-272 (273)
133 COG2331 Uncharacterized protei 35.8 7.5 0.00016 24.5 -0.9 33 89-127 8-41 (82)
134 PRK14811 formamidopyrimidine-D 35.6 1.3E+02 0.0029 23.1 5.7 28 93-127 235-263 (269)
135 PRK13945 formamidopyrimidine-D 35.3 1.3E+02 0.0028 23.3 5.6 27 93-126 254-281 (282)
136 PRK14810 formamidopyrimidine-D 34.9 1.5E+02 0.0032 22.9 5.9 27 93-126 244-271 (272)
137 PF14803 Nudix_N_2: Nudix N-te 34.4 55 0.0012 17.1 2.4 30 95-129 2-32 (34)
138 KOG3960|consensus 34.1 28 0.00061 27.0 1.7 16 38-53 127-142 (284)
139 smart00661 RPOL9 RNA polymeras 33.9 46 0.00099 18.4 2.3 11 120-130 21-31 (52)
140 PF12662 cEGF: Complement Clr- 33.9 15 0.00033 17.8 0.2 15 93-107 1-15 (24)
141 cd01780 PLC_epsilon_RA Ubiquit 33.5 52 0.0011 21.5 2.6 32 2-33 12-43 (93)
142 PF06107 DUF951: Bacterial pro 33.4 63 0.0014 19.1 2.8 29 97-132 15-44 (57)
143 PRK03824 hypA hydrogenase nick 33.2 52 0.0011 22.7 2.8 13 93-105 70-83 (135)
144 PF05876 Terminase_GpA: Phage 32.9 39 0.00084 28.9 2.6 49 81-129 188-239 (557)
145 TIGR03636 L23_arch archaeal ri 32.6 46 0.001 20.8 2.3 21 6-26 18-38 (77)
146 COG2879 Uncharacterized small 32.4 80 0.0017 19.1 3.1 21 21-41 27-47 (65)
147 PRK01103 formamidopyrimidine/5 32.4 1.5E+02 0.0032 22.8 5.5 26 94-126 246-272 (274)
148 PRK03564 formate dehydrogenase 31.8 55 0.0012 26.0 3.1 17 117-133 250-266 (309)
149 PF07227 DUF1423: Protein of u 31.0 35 0.00077 28.5 1.9 15 117-131 144-158 (446)
150 COG5415 Predicted integral mem 30.5 1.9E+02 0.0041 22.0 5.5 22 100-129 203-224 (251)
151 PF01155 HypA: Hydrogenase exp 29.6 35 0.00077 22.7 1.5 29 93-132 70-100 (113)
152 COG1439 Predicted nucleic acid 29.4 52 0.0011 24.1 2.4 30 90-132 136-166 (177)
153 PF14951 DUF4503: Domain of un 29.1 33 0.00071 28.0 1.4 19 90-108 271-291 (389)
154 PRK02935 hypothetical protein; 28.7 29 0.00062 23.3 0.9 18 117-134 68-85 (110)
155 PRK11032 hypothetical protein; 28.6 2.1E+02 0.0045 20.5 5.3 27 93-128 124-151 (160)
156 PF14706 Tnp_DNA_bind: Transpo 28.6 1E+02 0.0022 18.2 3.2 39 17-55 16-57 (58)
157 PRK05738 rplW 50S ribosomal pr 28.2 59 0.0013 20.9 2.3 20 6-25 24-43 (92)
158 PF15178 TOM_sub5: Mitochondri 28.2 1.2E+02 0.0026 17.3 3.2 22 4-25 2-23 (51)
159 PF11023 DUF2614: Protein of u 28.0 23 0.00049 24.0 0.3 17 117-133 67-83 (114)
160 PRK00810 nifW nitrogenase stab 27.6 83 0.0018 21.3 3.0 28 3-30 21-53 (113)
161 PF13451 zf-trcl: Probable zin 27.5 11 0.00024 21.7 -1.1 33 94-126 5-40 (49)
162 PRK13798 putative OHCU decarbo 26.6 2.4E+02 0.0053 20.1 5.8 23 11-33 41-74 (166)
163 PF04502 DUF572: Family of unk 26.2 90 0.0019 24.8 3.4 15 119-133 77-91 (324)
164 PF14606 Lipase_GDSL_3: GDSL-l 26.2 72 0.0016 23.2 2.7 28 6-33 64-94 (178)
165 smart00440 ZnF_C2C2 C2C2 Zinc 26.2 1.2E+02 0.0025 16.3 3.6 11 118-128 27-37 (40)
166 PF00076 RRM_1: RNA recognitio 26.0 53 0.0012 18.5 1.7 20 6-25 3-22 (70)
167 PRK14548 50S ribosomal protein 25.9 70 0.0015 20.4 2.3 21 6-26 25-45 (84)
168 TIGR03180 UraD_2 OHCU decarbox 25.8 2.5E+02 0.0053 19.9 5.8 23 11-33 31-64 (158)
169 PF04967 HTH_10: HTH DNA bindi 25.7 27 0.00058 20.3 0.3 21 6-26 32-52 (53)
170 PF03820 Mtc: Tricarboxylate c 25.6 1.5E+02 0.0033 23.5 4.6 24 10-33 45-68 (308)
171 PF05191 ADK_lid: Adenylate ki 25.1 37 0.0008 18.0 0.8 26 95-127 3-29 (36)
172 PF14353 CpXC: CpXC protein 25.1 94 0.002 20.8 3.0 13 119-131 38-50 (128)
173 TIGR01385 TFSII transcription 25.1 3.5E+02 0.0075 21.4 6.6 38 91-129 256-296 (299)
174 COG5525 Bacteriophage tail ass 25.0 54 0.0012 28.5 2.1 43 88-130 222-270 (611)
175 PF08996 zf-DNA_Pol: DNA Polym 25.0 47 0.001 24.2 1.5 41 90-130 15-56 (188)
176 PF11126 Phage_DsbA: Transcrip 23.6 1.3E+02 0.0028 18.5 3.0 32 20-51 35-66 (69)
177 PF05781 MRVI1: MRVI1 protein; 23.5 3.4E+02 0.0074 23.4 6.4 25 9-33 184-208 (538)
178 PF03206 NifW: Nitrogen fixati 23.3 92 0.002 20.7 2.5 30 2-31 16-50 (105)
179 PF01096 TFIIS_C: Transcriptio 22.9 1.4E+02 0.0029 15.9 3.1 10 118-127 27-36 (39)
180 smart00398 HMG high mobility g 22.7 1.6E+02 0.0035 16.7 5.6 42 21-64 15-56 (70)
181 PF00628 PHD: PHD-finger; Int 22.4 1E+02 0.0022 16.8 2.4 15 117-131 12-26 (51)
182 COG2816 NPY1 NTP pyrophosphohy 21.9 50 0.0011 25.9 1.2 28 93-129 111-139 (279)
183 PF09963 DUF2197: Uncharacteri 21.6 31 0.00066 20.4 -0.0 35 97-131 7-43 (56)
184 PF08447 PAS_3: PAS fold; Int 21.4 35 0.00077 20.4 0.3 28 2-33 7-35 (91)
185 TIGR03164 UHCUDC OHCU decarbox 21.3 3.1E+02 0.0066 19.4 5.8 23 11-33 29-62 (157)
186 PHA02150 hypothetical protein 21.0 49 0.0011 20.2 0.8 15 119-133 38-52 (77)
187 COG4888 Uncharacterized Zn rib 20.4 1.8E+02 0.004 19.3 3.4 35 94-131 23-58 (104)
188 smart00427 H2B Histone H2B. 20.1 1.1E+02 0.0023 19.9 2.3 11 23-33 8-18 (89)
189 PF00276 Ribosomal_L23: Riboso 20.0 1E+02 0.0023 19.6 2.3 20 6-25 24-43 (91)
No 1
>COG0484 DnaJ DnaJ-class molecular chaperone with C-terminal Zn finger domain [Posttranslational modification, protein turnover, chaperones]
Probab=99.97 E-value=6.4e-31 Score=207.65 Aligned_cols=127 Identities=26% Similarity=0.357 Sum_probs=95.0
Q ss_pred CCccccccCCCCCCHHHHHHHHHHHHHHcCCCC---CHHHHHHHHHHHHHHHHccChHHHHHHHHHHhhhhhccc--c--
Q psy7548 1 MNHFEILNCNVNSTLEEIKENYKSLILKYHPDK---NSEHQEMFLKINEAWNILKDEKERKLYESQLLSQQQTHM--N-- 73 (134)
Q Consensus 1 ~~~Y~iLgv~~~as~~~Ik~ay~~l~~~~HPDk---~~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~~~~~~~--~-- 73 (134)
+|||+||||+++||.+|||+|||+||++||||+ ++.|+++|++|++||+|||||++|+.||+++......++ .
T Consensus 4 ~dyYeiLGV~k~As~~EIKkAYRkLA~kyHPD~n~g~~~AeeKFKEI~eAYEVLsD~eKRa~YD~fG~~~~~~gg~gg~g 83 (371)
T COG0484 4 RDYYEILGVSKDASEEEIKKAYRKLAKKYHPDRNPGDKEAEEKFKEINEAYEVLSDPEKRAAYDQFGHAGFKAGGFGGFG 83 (371)
T ss_pred cchhhhcCCCCCCCHHHHHHHHHHHHHHhCCCCCCCCHHHHHHHHHHHHHHHHhCCHHHHHHhhccCccccccCCcCCCC
Confidence 699999999999999999999999999999999 468999999999999999999999999999998865321 1
Q ss_pred ---cc-cccccccccccCCce--eeeeeCCCCC----eEEEecccccCCCCCCce---eEeCCCCcc
Q psy7548 74 ---IY-KSVSLSDMEHKGESQ--MFTYPCRCGA----EFCIEEQDTEGDSSDDNI---LIACDTCSL 127 (134)
Q Consensus 74 ---~~-~~~~~~~~~~~~~~~--~~~~~crcG~----~~~v~~~~~~~g~~~~~~---~v~C~~CSl 127 (134)
+. ...++++.+|+++.. .-....+-|. ...|++++++.|....+. .+.|++|+-
T Consensus 84 ~~~fgg~~~DIF~~~FgGg~~~~~~~~~~~rG~Dl~~~l~isleEa~~G~~~~i~~~~~~~C~~C~G 150 (371)
T COG0484 84 FGGFGGDFGDIFEDFFGGGGGGRRRPNRPRRGADLRYNLEITLEEAVFGVKKEIRVTRSVTCSTCHG 150 (371)
T ss_pred cCCCCCCHHHHHHHhhcCCCcccCCCCCcccCCceEEEEEeEhhhhccCceeeEecceeeECCcCCC
Confidence 11 123333444432211 1111223443 677899999999766543 777887763
No 2
>KOG0712|consensus
Probab=99.95 E-value=6e-28 Score=188.60 Aligned_cols=124 Identities=26% Similarity=0.329 Sum_probs=96.6
Q ss_pred CccccccCCCCCCHHHHHHHHHHHHHHcCCCCCHHHHHHHHHHHHHHHHccChHHHHHHHHHHhhhhhcccccccccccc
Q psy7548 2 NHFEILNCNVNSTLEEIKENYKSLILKYHPDKNSEHQEMFLKINEAWNILKDEKERKLYESQLLSQQQTHMNIYKSVSLS 81 (134)
Q Consensus 2 ~~Y~iLgv~~~as~~~Ik~ay~~l~~~~HPDk~~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~~~~~~~~~~~~~~~~ 81 (134)
.+|+||||+++||.+|||+|||+|+++|||||++.+.++|++|.+||+|||||++|+.||+++++....++.......+.
T Consensus 5 ~~y~il~v~~~As~~eikkayrkla~k~HpDkn~~~~ekfkei~~AyevLsd~ekr~~yD~~g~~~~~~g~~~~g~~~f~ 84 (337)
T KOG0712|consen 5 KLYDILGVSPDASEEEIKKAYRKLALKYHPDKNPDAGEKFKEISQAYEVLSDPEKREIYDQYGEEGLQGGGGGGGFGGFS 84 (337)
T ss_pred ccceeeccCCCcCHHHHHHHHHHHHHHhCCCCCccHHHHHHHHHHHHHHhcCHHHHHHHHhhhhhhhcccCCCCCCccHH
Confidence 68999999999999999999999999999999999999999999999999999999999999998886553333333344
Q ss_pred cccccCCceeeeeeCCCC----CeEEEecccccCCCCCCce---eEeCCCCcc
Q psy7548 82 DMEHKGESQMFTYPCRCG----AEFCIEEQDTEGDSSDDNI---LIACDTCSL 127 (134)
Q Consensus 82 ~~~~~~~~~~~~~~crcG----~~~~v~~~~~~~g~~~~~~---~v~C~~CSl 127 (134)
+|+ +.+......+=| | ....|++++++.|.+.+.. .+.|+.|+-
T Consensus 85 ~~F-~~g~~~~~~~~r-g~~~~~~~~~~Le~~y~G~s~kl~l~~~~iCs~C~G 135 (337)
T KOG0712|consen 85 QFF-GFGGNGGRGRQR-GKDVVHQLKVTLEELYMGKSKKLFLSRNFICSKCSG 135 (337)
T ss_pred Hhc-cCCCcCcccccc-CCCceEEEEEEHHHhhcCCccceecccCccCCcCCC
Confidence 443 322111111113 4 4889999999999666543 777777763
No 3
>KOG0713|consensus
Probab=99.92 E-value=1.4e-25 Score=173.98 Aligned_cols=69 Identities=36% Similarity=0.622 Sum_probs=66.5
Q ss_pred CCccccccCCCCCCHHHHHHHHHHHHHHcCCCC---CHHHHHHHHHHHHHHHHccChHHHHHHHHHHhhhhh
Q psy7548 1 MNHFEILNCNVNSTLEEIKENYKSLILKYHPDK---NSEHQEMFLKINEAWNILKDEKERKLYESQLLSQQQ 69 (134)
Q Consensus 1 ~~~Y~iLgv~~~as~~~Ik~ay~~l~~~~HPDk---~~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~~~~ 69 (134)
||||+||||+++||..|||+|||+||+++|||| +|.|.+.|++|+.||+|||||++|+.||..++....
T Consensus 16 rDfYelLgV~k~Asd~eIKkAYRKLALk~HPDkNpddp~A~e~F~~in~AYEVLsDpekRk~YD~~GEegL~ 87 (336)
T KOG0713|consen 16 RDFYELLGVPKNASDQEIKKAYRKLALKYHPDKNPDDPNANEKFKEINAAYEVLSDPEKRKHYDTYGEEGLK 87 (336)
T ss_pred CCHHHHhCCCCCCCHHHHHHHHHHHHHHhCCCCCCCCHHHHHHHHHHHHHHHHhcCHHHHHHHHhhhHhhhc
Confidence 799999999999999999999999999999999 789999999999999999999999999999987765
No 4
>PTZ00037 DnaJ_C chaperone protein; Provisional
Probab=99.92 E-value=1.6e-25 Score=181.10 Aligned_cols=123 Identities=24% Similarity=0.372 Sum_probs=87.7
Q ss_pred CCccccccCCCCCCHHHHHHHHHHHHHHcCCCCCHHHHHHHHHHHHHHHHccChHHHHHHHHHHhhhhhccccccccccc
Q psy7548 1 MNHFEILNCNVNSTLEEIKENYKSLILKYHPDKNSEHQEMFLKINEAWNILKDEKERKLYESQLLSQQQTHMNIYKSVSL 80 (134)
Q Consensus 1 ~~~Y~iLgv~~~as~~~Ik~ay~~l~~~~HPDk~~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~~~~~~~~~~~~~~~ 80 (134)
+|||+||||+++||.+|||+|||+||++|||||++. .++|++|++||++|+||.+|+.||+++...........+..++
T Consensus 28 ~d~Y~vLGV~~~As~~eIKkAYrkla~k~HPDk~~~-~e~F~~i~~AYevLsD~~kR~~YD~~G~~~~~~~~~~~d~~d~ 106 (421)
T PTZ00037 28 EKLYEVLNLSKDCTTSEIKKAYRKLAIKHHPDKGGD-PEKFKEISRAYEVLSDPEKRKIYDEYGEEGLEGGEQPADASDL 106 (421)
T ss_pred hhHHHHcCCCCCCCHHHHHHHHHHHHHHHCCCCCch-HHHHHHHHHHHHHhccHHHHHHHhhhcchhcccCCCCcchhhh
Confidence 489999999999999999999999999999999653 5899999999999999999999999887643211111122233
Q ss_pred ccccccCCceeeeeeCCCCC----eEEEecccccCCCCCCce---eEeCCCCc
Q psy7548 81 SDMEHKGESQMFTYPCRCGA----EFCIEEQDTEGDSSDDNI---LIACDTCS 126 (134)
Q Consensus 81 ~~~~~~~~~~~~~~~crcG~----~~~v~~~~~~~g~~~~~~---~v~C~~CS 126 (134)
++.+|+++... ..-+-|. ...|++++++.|....+. .+.|+.|+
T Consensus 107 f~~~Fggg~~~--~~~~rg~di~~~l~vtLee~~~G~~~~i~~~r~~~C~~C~ 157 (421)
T PTZ00037 107 FDLIFGGGRKP--GGKKRGEDIVSHLKVTLEQIYNGAMRKLAINKDVICANCE 157 (421)
T ss_pred HHHhhcccccc--ccccCCCCEEEEeeeeHHHHhCCCceEEEeeccccccccC
Confidence 33344321100 1112232 567889999999766543 45666554
No 5
>PRK14296 chaperone protein DnaJ; Provisional
Probab=99.92 E-value=4.9e-25 Score=176.11 Aligned_cols=67 Identities=28% Similarity=0.544 Sum_probs=62.9
Q ss_pred CCccccccCCCCCCHHHHHHHHHHHHHHcCCCC--CHHHHHHHHHHHHHHHHccChHHHHHHHHHHhhh
Q psy7548 1 MNHFEILNCNVNSTLEEIKENYKSLILKYHPDK--NSEHQEMFLKINEAWNILKDEKERKLYESQLLSQ 67 (134)
Q Consensus 1 ~~~Y~iLgv~~~as~~~Ik~ay~~l~~~~HPDk--~~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~~ 67 (134)
+|||+||||+++||.+|||+|||+||++||||+ .+.+.++|++|++||+||+||++|+.||+++...
T Consensus 4 ~dyY~~Lgv~~~a~~~eik~ayrkla~~~HPD~n~~~~a~~~F~~i~~AyevLsD~~KR~~YD~~G~~~ 72 (372)
T PRK14296 4 KDYYEVLGVSKTASEQEIRQAYRKLAKQYHPDLNKSPDAHDKMVEINEAADVLLDKDKRKQYDQFGHAA 72 (372)
T ss_pred CCHHHhcCCCCCCCHHHHHHHHHHHHHHHCcCCCCCchHHHHHHHHHHHHHHhcCHHHhhhhhhccchh
Confidence 489999999999999999999999999999999 4668899999999999999999999999988753
No 6
>PRK14288 chaperone protein DnaJ; Provisional
Probab=99.91 E-value=9.6e-25 Score=174.30 Aligned_cols=67 Identities=45% Similarity=0.718 Sum_probs=62.6
Q ss_pred CCccccccCCCCCCHHHHHHHHHHHHHHcCCCCC---HHHHHHHHHHHHHHHHccChHHHHHHHHHHhhh
Q psy7548 1 MNHFEILNCNVNSTLEEIKENYKSLILKYHPDKN---SEHQEMFLKINEAWNILKDEKERKLYESQLLSQ 67 (134)
Q Consensus 1 ~~~Y~iLgv~~~as~~~Ik~ay~~l~~~~HPDk~---~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~~ 67 (134)
+|||+||||+++||.+|||+|||+||++||||++ +.++++|++|++||+||+||++|+.||+++...
T Consensus 3 ~dyY~vLgv~~~As~~eIkkayrkla~k~HPD~~~~~~~a~~~f~~i~~AYevLsd~~kR~~YD~~G~~~ 72 (369)
T PRK14288 3 LSYYEILEVEKHSNQETIKKSYRKLALKYHPDRNAGDKEAEEKFKLINEAYGVLSDEKKRALYDRYGKKG 72 (369)
T ss_pred CChHHHcCCCCCCCHHHHHHHHHHHHHHHCCCCCCCccHHHHHHHHHHHHHHHhccHHHHHHHHHhcccc
Confidence 4999999999999999999999999999999993 568899999999999999999999999998753
No 7
>PRK14286 chaperone protein DnaJ; Provisional
Probab=99.90 E-value=3.2e-24 Score=171.43 Aligned_cols=67 Identities=39% Similarity=0.639 Sum_probs=62.8
Q ss_pred CCccccccCCCCCCHHHHHHHHHHHHHHcCCCC---CHHHHHHHHHHHHHHHHccChHHHHHHHHHHhhh
Q psy7548 1 MNHFEILNCNVNSTLEEIKENYKSLILKYHPDK---NSEHQEMFLKINEAWNILKDEKERKLYESQLLSQ 67 (134)
Q Consensus 1 ~~~Y~iLgv~~~as~~~Ik~ay~~l~~~~HPDk---~~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~~ 67 (134)
+|||+||||+++||.+|||+|||+|+++||||+ .+.+.++|++|++||+||+||.+|+.||+++...
T Consensus 4 ~d~y~~Lgv~~~a~~~eik~ayr~la~~~HPD~~~~~~~a~~~f~~i~~Ay~vL~d~~kR~~YD~~G~~g 73 (372)
T PRK14286 4 RSYYDILGVSKSANDEEIKSAYRKLAIKYHPDKNKGNKESEEKFKEATEAYEILRDPKKRQAYDQFGKAG 73 (372)
T ss_pred CCHHHhcCCCCCCCHHHHHHHHHHHHHHHCcCCCCCchHHHHHHHHHHHHHHHhccHHHHHHHHHhCchh
Confidence 489999999999999999999999999999999 3568899999999999999999999999988754
No 8
>PRK14287 chaperone protein DnaJ; Provisional
Probab=99.90 E-value=3.3e-24 Score=171.34 Aligned_cols=126 Identities=22% Similarity=0.364 Sum_probs=87.8
Q ss_pred CCccccccCCCCCCHHHHHHHHHHHHHHcCCCCC--HHHHHHHHHHHHHHHHccChHHHHHHHHHHhhhhhcc--c-c--
Q psy7548 1 MNHFEILNCNVNSTLEEIKENYKSLILKYHPDKN--SEHQEMFLKINEAWNILKDEKERKLYESQLLSQQQTH--M-N-- 73 (134)
Q Consensus 1 ~~~Y~iLgv~~~as~~~Ik~ay~~l~~~~HPDk~--~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~~~~~~--~-~-- 73 (134)
+|||+||||+++||.+|||+|||+|+++||||++ +.++++|++|++||++|+||.+|+.||+++....... + .
T Consensus 4 ~d~y~~Lgv~~~a~~~eik~ayr~la~~~HpD~~~~~~~~~~f~~i~~Ay~~L~d~~kR~~YD~~G~~~~~~~~~~~~~~ 83 (371)
T PRK14287 4 RDYYEVLGVDRNASVDEVKKAYRKLARKYHPDVNKAPDAEDKFKEVKEAYDTLSDPQKKAHYDQFGHTDPNQGFGGGGAG 83 (371)
T ss_pred CCHHHhcCCCCCCCHHHHHHHHHHHHHHHCcCCCCChhHHHHHHHHHHHHHHhCcHhHHHHHHhhCCcccccccCCCCCc
Confidence 4899999999999999999999999999999993 6688999999999999999999999999987543211 0 0
Q ss_pred -cccccccccccccCCceeee-eeCCCC----CeEEEecccccCCCCCCce---eEeCCCCc
Q psy7548 74 -IYKSVSLSDMEHKGESQMFT-YPCRCG----AEFCIEEQDTEGDSSDDNI---LIACDTCS 126 (134)
Q Consensus 74 -~~~~~~~~~~~~~~~~~~~~-~~crcG----~~~~v~~~~~~~g~~~~~~---~v~C~~CS 126 (134)
+....++++.+|++++.... ...+-| -...|++++++.|....+. .+.|+.|.
T Consensus 84 ~f~~~~d~f~~~fgg~~~~~~~~~~~~g~d~~~~l~vslee~~~G~~~~i~~~r~~~C~~C~ 145 (371)
T PRK14287 84 DFGGFSDIFDMFFGGGGGRRNPNAPRQGADLQYTMTLEFKEAVFGKETEIEIPREETCGTCH 145 (371)
T ss_pred cccchHHHHHhhhccccCCCCCCCCCCCCCEEEEEEEEHHHhcCCeEEEEEEeeeccCCCCC
Confidence 01111233333432111000 011223 2677899999999655443 45666664
No 9
>PRK14298 chaperone protein DnaJ; Provisional
Probab=99.90 E-value=3.1e-24 Score=171.80 Aligned_cols=126 Identities=26% Similarity=0.407 Sum_probs=87.5
Q ss_pred CCccccccCCCCCCHHHHHHHHHHHHHHcCCCC--CHHHHHHHHHHHHHHHHccChHHHHHHHHHHhhhhhccc------
Q psy7548 1 MNHFEILNCNVNSTLEEIKENYKSLILKYHPDK--NSEHQEMFLKINEAWNILKDEKERKLYESQLLSQQQTHM------ 72 (134)
Q Consensus 1 ~~~Y~iLgv~~~as~~~Ik~ay~~l~~~~HPDk--~~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~~~~~~~------ 72 (134)
+|||+||||+++||.+|||+|||+||++||||+ .+.+.++|++|++||++|+||.+|+.||+++........
T Consensus 5 ~d~y~iLgv~~~a~~~eik~ayr~la~~~HPD~~~~~~~~~~f~~i~~Ay~vL~d~~kR~~YD~~G~~g~~~~~~~~~~~ 84 (377)
T PRK14298 5 RDYYEILGLSKDASVEDIKKAYRKLAMKYHPDKNKEPDAEEKFKEISEAYAVLSDAEKRAQYDRFGHAGIDNQYSAEDIF 84 (377)
T ss_pred CCHHHhhCCCCCCCHHHHHHHHHHHHHHhCccccCChhHHHHHHHHHHHHHHhcchHhhhhhhhcCccccccccCccccc
Confidence 489999999999999999999999999999999 366889999999999999999999999998865322110
Q ss_pred ---ccccccccccccccCCceeeeeeCCCC----CeEEEecccccCCCCCCce---eEeCCCCc
Q psy7548 73 ---NIYKSVSLSDMEHKGESQMFTYPCRCG----AEFCIEEQDTEGDSSDDNI---LIACDTCS 126 (134)
Q Consensus 73 ---~~~~~~~~~~~~~~~~~~~~~~~crcG----~~~~v~~~~~~~g~~~~~~---~v~C~~CS 126 (134)
.+....++++.+|++++.......+-| -...|++++++.|....+. .+.|+.|.
T Consensus 85 ~~~~~~~~~d~f~~~Fgg~~~~~~~~~~~g~di~~~l~vslee~~~G~~~~i~~~r~~~C~~C~ 148 (377)
T PRK14298 85 RGADFGGFGDIFEMFFGGGGRRGRMGPRRGSDLRYDLYITLEEAAFGVRKDIDVPRAERCSTCS 148 (377)
T ss_pred ccCCcCcchhhhHhhhcCCCccCCCCCCCCCCEEEEEEEEHHHhhCCeEEEEEEEeeccCCCCC
Confidence 000111223333332111000112223 2678899999999666543 45555553
No 10
>PRK14276 chaperone protein DnaJ; Provisional
Probab=99.89 E-value=9.9e-24 Score=169.04 Aligned_cols=67 Identities=27% Similarity=0.509 Sum_probs=62.7
Q ss_pred CCccccccCCCCCCHHHHHHHHHHHHHHcCCCC--CHHHHHHHHHHHHHHHHccChHHHHHHHHHHhhh
Q psy7548 1 MNHFEILNCNVNSTLEEIKENYKSLILKYHPDK--NSEHQEMFLKINEAWNILKDEKERKLYESQLLSQ 67 (134)
Q Consensus 1 ~~~Y~iLgv~~~as~~~Ik~ay~~l~~~~HPDk--~~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~~ 67 (134)
+|||+||||+++||.+|||+|||+|+++||||+ .+.+.++|++|++||++|+||.+|+.||+++...
T Consensus 4 ~d~y~~Lgv~~~a~~~eik~ayr~la~~~HpD~~~~~~a~~~f~~i~~Ay~vL~d~~kR~~YD~~G~~~ 72 (380)
T PRK14276 4 TEYYDRLGVSKDASQDEIKKAYRKLSKKYHPDINKEPGAEEKYKEVQEAYETLSDPQKRAAYDQYGAAG 72 (380)
T ss_pred CCHHHhhCCCCCCCHHHHHHHHHHHHHHHCcCCCCCcCHHHHHHHHHHHHHHhcCHhhhhhHhhcCCcc
Confidence 489999999999999999999999999999999 4568899999999999999999999999988753
No 11
>PRK14285 chaperone protein DnaJ; Provisional
Probab=99.89 E-value=1.4e-23 Score=167.41 Aligned_cols=67 Identities=33% Similarity=0.617 Sum_probs=62.6
Q ss_pred CCccccccCCCCCCHHHHHHHHHHHHHHcCCCC---CHHHHHHHHHHHHHHHHccChHHHHHHHHHHhhh
Q psy7548 1 MNHFEILNCNVNSTLEEIKENYKSLILKYHPDK---NSEHQEMFLKINEAWNILKDEKERKLYESQLLSQ 67 (134)
Q Consensus 1 ~~~Y~iLgv~~~as~~~Ik~ay~~l~~~~HPDk---~~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~~ 67 (134)
+|||+||||+++||.+|||+|||+|+++||||+ .+.+.++|++|++||++|+||.+|..||+++...
T Consensus 3 ~d~y~iLgv~~~a~~~eIk~ayr~la~~~HPD~~~~~~~a~~~f~~i~~Ay~vL~d~~kr~~yd~~g~~~ 72 (365)
T PRK14285 3 RDYYEILGLSKGASKDEIKKAYRKIAIKYHPDKNKGNKEAESIFKEATEAYEVLIDDNKRAQYDRFGHTA 72 (365)
T ss_pred CCHHHhcCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHcCcchhHHHHhcCcch
Confidence 489999999999999999999999999999999 3567899999999999999999999999988753
No 12
>PRK14280 chaperone protein DnaJ; Provisional
Probab=99.89 E-value=2.4e-23 Score=166.63 Aligned_cols=126 Identities=22% Similarity=0.321 Sum_probs=88.0
Q ss_pred CCccccccCCCCCCHHHHHHHHHHHHHHcCCCC--CHHHHHHHHHHHHHHHHccChHHHHHHHHHHhhhhhcc-c--c--
Q psy7548 1 MNHFEILNCNVNSTLEEIKENYKSLILKYHPDK--NSEHQEMFLKINEAWNILKDEKERKLYESQLLSQQQTH-M--N-- 73 (134)
Q Consensus 1 ~~~Y~iLgv~~~as~~~Ik~ay~~l~~~~HPDk--~~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~~~~~~-~--~-- 73 (134)
+|||+||||+++|+.++||+|||+|+++||||+ .+.+.++|++|++||++|+||.+|+.||+++....... + .
T Consensus 4 ~~~y~iLgv~~~a~~~eik~ayr~la~~~HpD~~~~~~a~~~f~~i~~Ay~vL~d~~kr~~yD~~G~~~~~~~~~~~~~~ 83 (376)
T PRK14280 4 RDYYEVLGVSKSASKDEIKKAYRKLSKKYHPDINKEEGADEKFKEISEAYEVLSDDQKRAQYDQFGHAGPNQGFGGGGFG 83 (376)
T ss_pred CChHHhhCCCCCCCHHHHHHHHHHHHHHHCcCCCCCccHHHHHHHHHHHHHHhccHhHHHHHHhcCccccccCcCCCCCC
Confidence 489999999999999999999999999999999 46688999999999999999999999999887543211 0 0
Q ss_pred ---cc---cccccccccccCCcee-eeeeCCCC----CeEEEecccccCCCCCCce---eEeCCCCc
Q psy7548 74 ---IY---KSVSLSDMEHKGESQM-FTYPCRCG----AEFCIEEQDTEGDSSDDNI---LIACDTCS 126 (134)
Q Consensus 74 ---~~---~~~~~~~~~~~~~~~~-~~~~crcG----~~~~v~~~~~~~g~~~~~~---~v~C~~CS 126 (134)
+. ...++++.+|++++.. .....+-| -...|++++++.|....+. .+.|+.|+
T Consensus 84 ~~~~~~~~~~~d~f~~~fgg~~~~~~~~~~~kg~di~~~l~vtLee~~~G~~~~i~~~r~~~C~~C~ 150 (376)
T PRK14280 84 GGDFGGGFGFEDIFSSFFGGGGRRRDPNAPRQGADLQYTMTLTFEEAVFGKEKEIEIPKEETCDTCH 150 (376)
T ss_pred CCCccccccchhhHHHHhCCccccCcccccccccCEEEEEEEEHHHHhCCceeEEEEeeeccCCCCC
Confidence 00 0012222233321110 00111223 2678899999999766543 56666664
No 13
>PRK14278 chaperone protein DnaJ; Provisional
Probab=99.89 E-value=2.3e-23 Score=166.81 Aligned_cols=126 Identities=20% Similarity=0.297 Sum_probs=85.9
Q ss_pred CCccccccCCCCCCHHHHHHHHHHHHHHcCCCC--CHHHHHHHHHHHHHHHHccChHHHHHHHHHHhhhhhcc---cccc
Q psy7548 1 MNHFEILNCNVNSTLEEIKENYKSLILKYHPDK--NSEHQEMFLKINEAWNILKDEKERKLYESQLLSQQQTH---MNIY 75 (134)
Q Consensus 1 ~~~Y~iLgv~~~as~~~Ik~ay~~l~~~~HPDk--~~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~~~~~~---~~~~ 75 (134)
+|||+||||+++||.++||+|||+|+++||||+ .+.+.++|++|++||++|+||.+|+.||+++....... +.+.
T Consensus 3 ~d~y~iLgv~~~a~~~eik~ayr~la~~~hpD~~~~~~a~~~f~~i~~Ay~vL~d~~~r~~YD~~G~~~~~~~~~~~g~~ 82 (378)
T PRK14278 3 RDYYGLLGVSRNASDAEIKRAYRKLARELHPDVNPDEEAQEKFKEISVAYEVLSDPEKRRIVDLGGDPLESAGGGGGGFG 82 (378)
T ss_pred CCcceecCCCCCCCHHHHHHHHHHHHHHHCCCCCCcHHHHHHHHHHHHHHHHhchhhhhhhhhccCCccccccCCCCCCC
Confidence 489999999999999999999999999999999 45578899999999999999999999999886422110 0111
Q ss_pred ----cccccccccccCCcee--eeeeCCCC----CeEEEecccccCCCCCCce---eEeCCCCc
Q psy7548 76 ----KSVSLSDMEHKGESQM--FTYPCRCG----AEFCIEEQDTEGDSSDDNI---LIACDTCS 126 (134)
Q Consensus 76 ----~~~~~~~~~~~~~~~~--~~~~crcG----~~~~v~~~~~~~g~~~~~~---~v~C~~CS 126 (134)
...++++.+|++++.. .....+-| -...|++++++.|....+. .+.|+.|+
T Consensus 83 ~~f~~~~d~f~~ffgg~g~~~~~~~~~~~g~d~~~~l~vtLee~~~G~~~~i~~~~~~~C~~C~ 146 (378)
T PRK14278 83 GGFGGLGDVFEAFFGGGAASRGPRGRVRPGSDSLLRMRLDLEECATGVTKQVTVDTAVLCDRCH 146 (378)
T ss_pred cCcCchhHHHHHHhCCCCCCCCCccCCCCCCCeEEEEEEEHHHhcCCeEEEEEEEeeccCCCCc
Confidence 1112222223321100 00011223 2578899999999666543 45555553
No 14
>PRK14282 chaperone protein DnaJ; Provisional
Probab=99.88 E-value=3.7e-23 Score=165.20 Aligned_cols=67 Identities=36% Similarity=0.634 Sum_probs=62.2
Q ss_pred CCccccccCCCCCCHHHHHHHHHHHHHHcCCCCC----HHHHHHHHHHHHHHHHccChHHHHHHHHHHhhh
Q psy7548 1 MNHFEILNCNVNSTLEEIKENYKSLILKYHPDKN----SEHQEMFLKINEAWNILKDEKERKLYESQLLSQ 67 (134)
Q Consensus 1 ~~~Y~iLgv~~~as~~~Ik~ay~~l~~~~HPDk~----~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~~ 67 (134)
+|||+||||+++||.+|||+|||+|+++||||++ +.+.++|++|++||++|+||.+|+.||+++...
T Consensus 4 ~d~y~~lgv~~~a~~~eik~ayr~la~~~HPD~~~~~~~~a~~~f~~i~~Ay~vL~d~~kR~~YD~~g~~~ 74 (369)
T PRK14282 4 KDYYEILGVSRNATQEEIKRAYKRLVKEWHPDRHPENRKEAEQKFKEIQEAYEVLSDPQKRAMYDRFGYVG 74 (369)
T ss_pred CChHHhcCCCCCCCHHHHHHHHHHHHHHHCCCCCccchhHHHHHHHHHHHHHHHhcChhhHHHHhhcCccc
Confidence 4999999999999999999999999999999993 457889999999999999999999999988654
No 15
>PRK14284 chaperone protein DnaJ; Provisional
Probab=99.88 E-value=7.5e-23 Score=164.50 Aligned_cols=67 Identities=34% Similarity=0.645 Sum_probs=62.9
Q ss_pred CCccccccCCCCCCHHHHHHHHHHHHHHcCCCC---CHHHHHHHHHHHHHHHHccChHHHHHHHHHHhhh
Q psy7548 1 MNHFEILNCNVNSTLEEIKENYKSLILKYHPDK---NSEHQEMFLKINEAWNILKDEKERKLYESQLLSQ 67 (134)
Q Consensus 1 ~~~Y~iLgv~~~as~~~Ik~ay~~l~~~~HPDk---~~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~~ 67 (134)
+|||+||||+++||.++||+|||+|+++||||+ .+.+.++|++|++||++|+||.+|+.||+++...
T Consensus 1 ~d~y~iLgv~~~a~~~eikkayr~la~~~HPD~~~~~~~a~~~f~~i~~Ay~vL~d~~kR~~YD~~G~~g 70 (391)
T PRK14284 1 MDYYTILGVSKTASPEEIKKAYRKLAVKYHPDKNPGDAEAEKRFKEVSEAYEVLSDAQKRESYDRYGKDG 70 (391)
T ss_pred CCHHHhcCCCCCCCHHHHHHHHHHHHHHHCcCCCCCchHHHHHHHHHHHHHHHhcCHHHHHHHHhccccc
Confidence 699999999999999999999999999999999 3568899999999999999999999999988753
No 16
>PRK14277 chaperone protein DnaJ; Provisional
Probab=99.88 E-value=7.5e-23 Score=164.26 Aligned_cols=67 Identities=40% Similarity=0.631 Sum_probs=62.5
Q ss_pred CCccccccCCCCCCHHHHHHHHHHHHHHcCCCC---CHHHHHHHHHHHHHHHHccChHHHHHHHHHHhhh
Q psy7548 1 MNHFEILNCNVNSTLEEIKENYKSLILKYHPDK---NSEHQEMFLKINEAWNILKDEKERKLYESQLLSQ 67 (134)
Q Consensus 1 ~~~Y~iLgv~~~as~~~Ik~ay~~l~~~~HPDk---~~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~~ 67 (134)
+|||+||||+++||.++||+|||+|+++||||+ .+.++++|++|++||++|+||.+|+.||+++...
T Consensus 5 ~d~y~~Lgv~~~a~~~eik~ayr~la~~~HPD~~~~~~~a~~~f~~i~~Ay~vL~d~~kr~~yD~~G~~~ 74 (386)
T PRK14277 5 KDYYEILGVDRNATEEEIKKAYRRLAKKYHPDLNPGDKEAEQKFKEINEAYEILSDPQKRAQYDQFGHAA 74 (386)
T ss_pred CCHHHhcCCCCCCCHHHHHHHHHHHHHHHCCCcCCCchHHHHHHHHHHHHHHHhCCHHHHHHHHhhcccc
Confidence 489999999999999999999999999999999 3567889999999999999999999999988653
No 17
>PRK14297 chaperone protein DnaJ; Provisional
Probab=99.88 E-value=4.4e-23 Score=165.33 Aligned_cols=67 Identities=34% Similarity=0.631 Sum_probs=62.7
Q ss_pred CCccccccCCCCCCHHHHHHHHHHHHHHcCCCC---CHHHHHHHHHHHHHHHHccChHHHHHHHHHHhhh
Q psy7548 1 MNHFEILNCNVNSTLEEIKENYKSLILKYHPDK---NSEHQEMFLKINEAWNILKDEKERKLYESQLLSQ 67 (134)
Q Consensus 1 ~~~Y~iLgv~~~as~~~Ik~ay~~l~~~~HPDk---~~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~~ 67 (134)
+|||+||||+++||.++||+|||+|+++||||+ .+.++++|++|++||++|+||.+|+.||+++...
T Consensus 4 ~d~y~~Lgv~~~a~~~~ik~ayr~la~~~HPD~~~~~~~a~~~f~~i~~Ay~vL~d~~~r~~yD~~G~~~ 73 (380)
T PRK14297 4 KDYYEVLGLEKGASDDEIKKAFRKLAIKYHPDKNKGNKEAEEKFKEINEAYQVLSDPQKKAQYDQFGTAD 73 (380)
T ss_pred CChHHhhCCCCCCCHHHHHHHHHHHHHHHCcCCCCCcHHHHHHHHHHHHHHHHhcCHhhhCchhhcCccc
Confidence 489999999999999999999999999999999 3568899999999999999999999999988754
No 18
>PRK14283 chaperone protein DnaJ; Provisional
Probab=99.88 E-value=1.4e-22 Score=162.39 Aligned_cols=67 Identities=33% Similarity=0.581 Sum_probs=62.9
Q ss_pred CCccccccCCCCCCHHHHHHHHHHHHHHcCCCCC--HHHHHHHHHHHHHHHHccChHHHHHHHHHHhhh
Q psy7548 1 MNHFEILNCNVNSTLEEIKENYKSLILKYHPDKN--SEHQEMFLKINEAWNILKDEKERKLYESQLLSQ 67 (134)
Q Consensus 1 ~~~Y~iLgv~~~as~~~Ik~ay~~l~~~~HPDk~--~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~~ 67 (134)
+|||+||||+++||.+|||+|||+|+++||||++ +.+.++|++|++||++|+||.+|+.||+++...
T Consensus 5 ~d~y~~Lgv~~~a~~~eik~ayr~la~~~HPD~~~~~~a~~~f~~i~~Ay~~Lsd~~kR~~YD~~G~~g 73 (378)
T PRK14283 5 RDYYEVLGVDRNADKKEIKKAYRKLARKYHPDVSEEEGAEEKFKEISEAYAVLSDDEKRQRYDQFGHAG 73 (378)
T ss_pred CChHHhhCCCCCCCHHHHHHHHHHHHHHHCcCCCCCccHHHHHHHHHHHHHHhchhHHHHHHhhhcccc
Confidence 4899999999999999999999999999999994 678899999999999999999999999988653
No 19
>PRK10767 chaperone protein DnaJ; Provisional
Probab=99.88 E-value=1e-22 Score=162.70 Aligned_cols=67 Identities=34% Similarity=0.656 Sum_probs=62.4
Q ss_pred CCccccccCCCCCCHHHHHHHHHHHHHHcCCCC---CHHHHHHHHHHHHHHHHccChHHHHHHHHHHhhh
Q psy7548 1 MNHFEILNCNVNSTLEEIKENYKSLILKYHPDK---NSEHQEMFLKINEAWNILKDEKERKLYESQLLSQ 67 (134)
Q Consensus 1 ~~~Y~iLgv~~~as~~~Ik~ay~~l~~~~HPDk---~~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~~ 67 (134)
+|||+||||+++||.++||+|||+|++++|||+ .+.+.++|++|++||++|+||.+|+.||+++...
T Consensus 4 ~d~y~iLgv~~~as~~eik~ayr~la~~~HPD~~~~~~~a~~~f~~i~~Ay~~L~d~~~r~~yd~~g~~~ 73 (371)
T PRK10767 4 RDYYEVLGVSRNASEDEIKKAYRKLAMKYHPDRNPGDKEAEEKFKEIKEAYEVLSDPQKRAAYDQYGHAA 73 (371)
T ss_pred CChHHhcCCCCCCCHHHHHHHHHHHHHHHCCCCCCCcHHHHHHHHHHHHHHHHhcchhhhhHhhhccccc
Confidence 489999999999999999999999999999999 3567899999999999999999999999988654
No 20
>PRK14301 chaperone protein DnaJ; Provisional
Probab=99.88 E-value=5.2e-23 Score=164.55 Aligned_cols=67 Identities=31% Similarity=0.614 Sum_probs=62.5
Q ss_pred CCccccccCCCCCCHHHHHHHHHHHHHHcCCCC---CHHHHHHHHHHHHHHHHccChHHHHHHHHHHhhh
Q psy7548 1 MNHFEILNCNVNSTLEEIKENYKSLILKYHPDK---NSEHQEMFLKINEAWNILKDEKERKLYESQLLSQ 67 (134)
Q Consensus 1 ~~~Y~iLgv~~~as~~~Ik~ay~~l~~~~HPDk---~~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~~ 67 (134)
+|||+||||+++||.++||+|||+|++++|||+ .+.++++|++|++||+||+||.+|+.||+++...
T Consensus 4 ~~~y~~Lgv~~~a~~~~ik~ayr~la~~~HPD~~~~~~~a~~~f~~i~~Ay~vL~d~~kr~~yD~~g~~g 73 (373)
T PRK14301 4 RDYYEVLGVSRDASEDEIKKAYRKLALQYHPDRNPDNPEAEQKFKEAAEAYEVLRDAEKRARYDRFGHAG 73 (373)
T ss_pred CChHHhcCCCCCCCHHHHHHHHHHHHHHhCCCcCCCChHHHHHHHHHHHHHHHhcchhhhhhhhhccccc
Confidence 489999999999999999999999999999999 3567889999999999999999999999988754
No 21
>PRK14294 chaperone protein DnaJ; Provisional
Probab=99.88 E-value=9.5e-23 Score=162.69 Aligned_cols=67 Identities=36% Similarity=0.668 Sum_probs=62.8
Q ss_pred CCccccccCCCCCCHHHHHHHHHHHHHHcCCCC---CHHHHHHHHHHHHHHHHccChHHHHHHHHHHhhh
Q psy7548 1 MNHFEILNCNVNSTLEEIKENYKSLILKYHPDK---NSEHQEMFLKINEAWNILKDEKERKLYESQLLSQ 67 (134)
Q Consensus 1 ~~~Y~iLgv~~~as~~~Ik~ay~~l~~~~HPDk---~~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~~ 67 (134)
+|||+||||+++||.+|||+|||+|+++||||+ .+.+.++|++|++||+||+||.+|+.||+++...
T Consensus 4 ~d~y~~lgv~~~a~~~eik~ayr~la~~~HPD~~~~~~~~~~~f~~~~~Ay~vL~d~~~r~~yD~~G~~g 73 (366)
T PRK14294 4 RDYYEILGVTRDASEEEIKKSYRKLAMKYHPDRNPGDKEAEELFKEAAEAYEVLSDPKKRGIYDQYGHEG 73 (366)
T ss_pred CChHHHhCCCCCCCHHHHHHHHHHHHHHHCCCCCCCchHHHHHHHHHHHHHHHhccHHHHHHHHhhcccc
Confidence 589999999999999999999999999999999 3567899999999999999999999999998754
No 22
>PRK14295 chaperone protein DnaJ; Provisional
Probab=99.87 E-value=1.7e-22 Score=162.33 Aligned_cols=62 Identities=37% Similarity=0.695 Sum_probs=59.1
Q ss_pred CCccccccCCCCCCHHHHHHHHHHHHHHcCCCC---CHHHHHHHHHHHHHHHHccChHHHHHHHH
Q psy7548 1 MNHFEILNCNVNSTLEEIKENYKSLILKYHPDK---NSEHQEMFLKINEAWNILKDEKERKLYES 62 (134)
Q Consensus 1 ~~~Y~iLgv~~~as~~~Ik~ay~~l~~~~HPDk---~~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~ 62 (134)
+|||+||||+++||.++||+|||+|+++||||+ .+.++++|++|++||++|+||.+|+.||+
T Consensus 9 ~d~y~~Lgv~~~a~~~eik~ayr~la~~~HPD~~~~~~~a~~~f~~i~~Ay~vL~d~~~r~~yD~ 73 (389)
T PRK14295 9 KDYYKVLGVPKDATEAEIKKAYRKLAREYHPDANKGDAKAEERFKEISEAYDVLSDEKKRKEYDE 73 (389)
T ss_pred cCHHHhcCCCCCCCHHHHHHHHHHHHHHHCCCcCCCchhHHHHHHHHHHHHHHHCchhhHHHHHH
Confidence 489999999999999999999999999999999 35688999999999999999999999998
No 23
>PRK14291 chaperone protein DnaJ; Provisional
Probab=99.86 E-value=2.2e-22 Score=161.37 Aligned_cols=67 Identities=45% Similarity=0.682 Sum_probs=62.9
Q ss_pred CCccccccCCCCCCHHHHHHHHHHHHHHcCCCCC--HHHHHHHHHHHHHHHHccChHHHHHHHHHHhhh
Q psy7548 1 MNHFEILNCNVNSTLEEIKENYKSLILKYHPDKN--SEHQEMFLKINEAWNILKDEKERKLYESQLLSQ 67 (134)
Q Consensus 1 ~~~Y~iLgv~~~as~~~Ik~ay~~l~~~~HPDk~--~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~~ 67 (134)
+|||+||||+++||.++||+|||+|+++||||++ +.+.++|++|++||++|+||.+|+.||+++...
T Consensus 3 ~d~Y~~Lgv~~~a~~~~ik~ayr~la~~~HPD~~~~~~~~~~f~~i~~Ay~vLsd~~kR~~YD~~g~~~ 71 (382)
T PRK14291 3 KDYYEILGVSRNATQEEIKKAYRRLARKYHPDFNKNPEAEEKFKEINEAYQVLSDPEKRKLYDQFGHAA 71 (382)
T ss_pred CCHHHhhCCCCCCCHHHHHHHHHHHHHHHCCCCCCCccHHHHHHHHHHHHHHhcCHHHHHHHhhhcccc
Confidence 5899999999999999999999999999999994 568899999999999999999999999988754
No 24
>TIGR02349 DnaJ_bact chaperone protein DnaJ. This model represents bacterial forms of DnaJ, part of the DnaK-DnaJ-GrpE chaperone system. The three components typically are encoded by consecutive genes. DnaJ homologs occur in many genomes, typically not near DnaK and GrpE-like genes; most such genes are not included by this family. Eukaryotic (mitochondrial and chloroplast) forms are not included in the scope of this family.
Probab=99.86 E-value=3.2e-22 Score=159.00 Aligned_cols=66 Identities=39% Similarity=0.663 Sum_probs=62.0
Q ss_pred CccccccCCCCCCHHHHHHHHHHHHHHcCCCCC--HHHHHHHHHHHHHHHHccChHHHHHHHHHHhhh
Q psy7548 2 NHFEILNCNVNSTLEEIKENYKSLILKYHPDKN--SEHQEMFLKINEAWNILKDEKERKLYESQLLSQ 67 (134)
Q Consensus 2 ~~Y~iLgv~~~as~~~Ik~ay~~l~~~~HPDk~--~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~~ 67 (134)
|||+||||+++|+.++||+|||+|++++|||++ +.+.++|++|++||++|+||.+|..||.++...
T Consensus 1 d~y~~Lgv~~~a~~~~ik~ayr~la~~~HPD~~~~~~~~~~f~~i~~Ay~vL~d~~~R~~yd~~g~~~ 68 (354)
T TIGR02349 1 DYYEILGVSKDASEEEIKKAYRKLAKKYHPDRNKDKEAEEKFKEINEAYEVLSDPEKRAQYDQFGHAG 68 (354)
T ss_pred ChHHhCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCccHHHHHHHHHHHHHHhhChHHHHhhhhccccc
Confidence 799999999999999999999999999999994 567899999999999999999999999988754
No 25
>PRK14279 chaperone protein DnaJ; Provisional
Probab=99.86 E-value=5.6e-22 Score=159.51 Aligned_cols=65 Identities=29% Similarity=0.529 Sum_probs=61.6
Q ss_pred CCccccccCCCCCCHHHHHHHHHHHHHHcCCCC---CHHHHHHHHHHHHHHHHccChHHHHHHHHHHh
Q psy7548 1 MNHFEILNCNVNSTLEEIKENYKSLILKYHPDK---NSEHQEMFLKINEAWNILKDEKERKLYESQLL 65 (134)
Q Consensus 1 ~~~Y~iLgv~~~as~~~Ik~ay~~l~~~~HPDk---~~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~ 65 (134)
+|||+||||+++|+.+|||+|||+|+++||||+ .+.+.++|++|++||+||+||++|+.||+++.
T Consensus 9 ~Dyy~~Lgv~~~a~~~eik~ayr~la~~~HPD~~~~~~~a~~~f~~i~~Ay~vLsD~~KR~~YD~~G~ 76 (392)
T PRK14279 9 KDFYKELGVSSDASAEEIKKAYRKLARELHPDANPGDPAAEERFKAVSEAHDVLSDPAKRKEYDETRR 76 (392)
T ss_pred cCHHHhcCCCCCCCHHHHHHHHHHHHHHHCcCCCCCChHHHHHHHHHHHHHHHhcchhhhhHHHHhhh
Confidence 599999999999999999999999999999999 34688999999999999999999999999875
No 26
>PRK14292 chaperone protein DnaJ; Provisional
Probab=99.85 E-value=7.9e-22 Score=157.62 Aligned_cols=126 Identities=25% Similarity=0.376 Sum_probs=87.4
Q ss_pred CCccccccCCCCCCHHHHHHHHHHHHHHcCCCC--CHHHHHHHHHHHHHHHHccChHHHHHHHHHHhhhhhc--c-cccc
Q psy7548 1 MNHFEILNCNVNSTLEEIKENYKSLILKYHPDK--NSEHQEMFLKINEAWNILKDEKERKLYESQLLSQQQT--H-MNIY 75 (134)
Q Consensus 1 ~~~Y~iLgv~~~as~~~Ik~ay~~l~~~~HPDk--~~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~~~~~--~-~~~~ 75 (134)
+|||+||||+++||.++||+|||+|++++|||+ .+.+.++|+.|++||++|+||.+|+.||+++...... . +++.
T Consensus 2 ~d~y~~Lgv~~~a~~~~ik~ayr~l~~~~hpD~~~~~~a~~~~~~i~~Ay~vL~d~~~r~~yd~~G~~~~~~~~~~~~~~ 81 (371)
T PRK14292 2 MDYYELLGVSRTASADEIKSAYRKLALKYHPDRNKEKGAAEKFAQINEAYAVLSDAEKRAHYDRFGTAPGAGMPGGDPFG 81 (371)
T ss_pred CChHHHcCCCCCCCHHHHHHHHHHHHHHHCCCCCCChhHHHHHHHHHHHHHHhcchhhhhhHhhcCCcccccccCCcccC
Confidence 699999999999999999999999999999999 4568899999999999999999999999988653211 0 0100
Q ss_pred c-cc---ccccccccCCc--eeee-eeCCCC----CeEEEecccccCCCCCCce---eEeCCCCc
Q psy7548 76 K-SV---SLSDMEHKGES--QMFT-YPCRCG----AEFCIEEQDTEGDSSDDNI---LIACDTCS 126 (134)
Q Consensus 76 ~-~~---~~~~~~~~~~~--~~~~-~~crcG----~~~~v~~~~~~~g~~~~~~---~v~C~~CS 126 (134)
. .+ ++++.+|+... .... ...+-| -...+++++++.|....+. .+.|+.|.
T Consensus 82 ~~~~d~~d~f~~~fg~~~~~~~~~~~~~~~g~d~~~~l~~sLee~~~G~~~~v~~~r~~~C~~C~ 146 (371)
T PRK14292 82 GMGFDPMDIFEQLFGGAGFGGGRGRRGPARGDDLETEARITLEQARAGEEVEVEVDRLTECEHCH 146 (371)
T ss_pred ccCCChHHHHHHhhCCCCcCCCCCcccccCCCCeEEEEeccHHHHcCCeEEEEEEEeeecCCCCc
Confidence 0 01 22222232210 0000 111223 2567789999999666543 57787774
No 27
>PRK14300 chaperone protein DnaJ; Provisional
Probab=99.85 E-value=1.2e-21 Score=156.75 Aligned_cols=67 Identities=28% Similarity=0.553 Sum_probs=62.3
Q ss_pred CCccccccCCCCCCHHHHHHHHHHHHHHcCCCC--CHHHHHHHHHHHHHHHHccChHHHHHHHHHHhhh
Q psy7548 1 MNHFEILNCNVNSTLEEIKENYKSLILKYHPDK--NSEHQEMFLKINEAWNILKDEKERKLYESQLLSQ 67 (134)
Q Consensus 1 ~~~Y~iLgv~~~as~~~Ik~ay~~l~~~~HPDk--~~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~~ 67 (134)
+|||+||||+++||.+|||+|||+|+++||||+ .+.++++|++|++||++|+|+.+|+.||+++...
T Consensus 3 ~~~y~iLgv~~~as~~eik~ayr~la~~~HPD~~~~~~~~~~f~~i~~Ay~~L~d~~~r~~yD~~G~~~ 71 (372)
T PRK14300 3 QDYYQILGVSKTASQADLKKAYLKLAKQYHPDTTDAKDAEKKFKEINAAYDVLKDEQKRAAYDRFGHDA 71 (372)
T ss_pred CChHHHcCCCCCCCHHHHHHHHHHHHHHHCcCCCCCcCHHHHHHHHHHHHHHhhhHhHhhHHHhccccc
Confidence 589999999999999999999999999999999 3557889999999999999999999999987653
No 28
>PRK14289 chaperone protein DnaJ; Provisional
Probab=99.85 E-value=2.9e-21 Score=155.09 Aligned_cols=67 Identities=31% Similarity=0.594 Sum_probs=62.7
Q ss_pred CCccccccCCCCCCHHHHHHHHHHHHHHcCCCC---CHHHHHHHHHHHHHHHHccChHHHHHHHHHHhhh
Q psy7548 1 MNHFEILNCNVNSTLEEIKENYKSLILKYHPDK---NSEHQEMFLKINEAWNILKDEKERKLYESQLLSQ 67 (134)
Q Consensus 1 ~~~Y~iLgv~~~as~~~Ik~ay~~l~~~~HPDk---~~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~~ 67 (134)
+|||+||||+++||.+|||+|||+|++++|||+ .+.+.++|++|++||++|+||.+|+.||+++...
T Consensus 5 ~~~y~~Lgv~~~a~~~eik~ayr~la~~~HpD~~~~~~~a~~~f~~i~~Ay~~L~d~~~R~~yD~~G~~~ 74 (386)
T PRK14289 5 RDYYEVLGVSKTATVDEIKKAYRKKAIQYHPDKNPGDKEAEEKFKEAAEAYDVLSDPDKRSRYDQFGHAG 74 (386)
T ss_pred CCHHHHcCCCCCCCHHHHHHHHHHHHHHHCCCCCCCChHHHHHHHHHHHHHHHhcCHHHHHHHHHhcccc
Confidence 489999999999999999999999999999999 3568899999999999999999999999988753
No 29
>PRK14290 chaperone protein DnaJ; Provisional
Probab=99.84 E-value=1.7e-21 Score=155.45 Aligned_cols=67 Identities=31% Similarity=0.603 Sum_probs=62.1
Q ss_pred CCccccccCCCCCCHHHHHHHHHHHHHHcCCCCC----HHHHHHHHHHHHHHHHccChHHHHHHHHHHhhh
Q psy7548 1 MNHFEILNCNVNSTLEEIKENYKSLILKYHPDKN----SEHQEMFLKINEAWNILKDEKERKLYESQLLSQ 67 (134)
Q Consensus 1 ~~~Y~iLgv~~~as~~~Ik~ay~~l~~~~HPDk~----~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~~ 67 (134)
+|||+||||+++||.+|||+|||+|++++|||++ ..+.++|++|++||++|+||.+|+.||.++...
T Consensus 3 ~d~y~vLgv~~~a~~~eik~ayr~la~~~HPD~~~~~~~~a~~~f~~i~~Ay~~L~d~~~r~~yd~~G~~~ 73 (365)
T PRK14290 3 KDYYKILGVDRNASQEDIKKAFRELAKKWHPDLHPGNKAEAEEKFKEISEAYEVLSDPQKRRQYDQTGTVD 73 (365)
T ss_pred CChhhhcCCCCCCCHHHHHHHHHHHHHHHCcCCCCCchhHHHHHHHHHHHHHHHhcChhhhhhhcccCCcc
Confidence 4899999999999999999999999999999993 257899999999999999999999999988653
No 30
>PRK14293 chaperone protein DnaJ; Provisional
Probab=99.84 E-value=3.1e-21 Score=154.41 Aligned_cols=67 Identities=30% Similarity=0.536 Sum_probs=62.5
Q ss_pred CCccccccCCCCCCHHHHHHHHHHHHHHcCCCC--CHHHHHHHHHHHHHHHHccChHHHHHHHHHHhhh
Q psy7548 1 MNHFEILNCNVNSTLEEIKENYKSLILKYHPDK--NSEHQEMFLKINEAWNILKDEKERKLYESQLLSQ 67 (134)
Q Consensus 1 ~~~Y~iLgv~~~as~~~Ik~ay~~l~~~~HPDk--~~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~~ 67 (134)
+|||+||||+++||.++||+|||+|++++|||+ .+.+.++|+.|++||++|+||.+|+.||.++...
T Consensus 3 ~d~y~vLgv~~~a~~~eik~ayr~la~~~HPD~~~~~~a~~~f~~i~~Ay~vL~~~~~R~~yd~~g~~g 71 (374)
T PRK14293 3 ADYYEILGVSRDADKDELKRAYRRLARKYHPDVNKEPGAEDRFKEINRAYEVLSDPETRARYDQFGEAG 71 (374)
T ss_pred CChhhhcCCCCCCCHHHHHHHHHHHHHHHCCCCCCCcCHHHHHHHHHHHHHHHhchHHHHHHhhccccc
Confidence 489999999999999999999999999999999 4568899999999999999999999999988653
No 31
>KOG0716|consensus
Probab=99.84 E-value=1.9e-21 Score=147.09 Aligned_cols=70 Identities=36% Similarity=0.608 Sum_probs=65.2
Q ss_pred CCccccccCCCCCCHHHHHHHHHHHHHHcCCCC---CHHHHHHHHHHHHHHHHccChHHHHHHHHHHhhhhhc
Q psy7548 1 MNHFEILNCNVNSTLEEIKENYKSLILKYHPDK---NSEHQEMFLKINEAWNILKDEKERKLYESQLLSQQQT 70 (134)
Q Consensus 1 ~~~Y~iLgv~~~as~~~Ik~ay~~l~~~~HPDk---~~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~~~~~ 70 (134)
+|+|+|||++++|+.++||++||+|++++|||+ +|++..+|++||+||+||+||.+|..||.++......
T Consensus 31 ~~LYdVLgl~k~at~d~IKKaYR~L~~k~HPD~~gd~P~~~dkf~eIN~Ay~ILsD~~kR~~YD~~g~~~l~l 103 (279)
T KOG0716|consen 31 LDLYDVLGLPKTATKDEIKKAYRKLALKYHPDKNGDNPEATDKFKEINTAYAILSDPTKRNVYDEYGELGLKL 103 (279)
T ss_pred hHHHHHhCCCcccchHHHHHHHHHHHHHhCCCcCCCCchhHHHHHHHHHHHHHhcChhhhhhHHHhhhHHHHH
Confidence 478999999999999999999999999999999 5779999999999999999999999999998876543
No 32
>KOG0717|consensus
Probab=99.84 E-value=1.8e-21 Score=155.86 Aligned_cols=71 Identities=34% Similarity=0.558 Sum_probs=65.9
Q ss_pred CCccccccCCCCCCHHHHHHHHHHHHHHcCCCC----CHHHHHHHHHHHHHHHHccChHHHHHHHHHHhhhhhcc
Q psy7548 1 MNHFEILNCNVNSTLEEIKENYKSLILKYHPDK----NSEHQEMFLKINEAWNILKDEKERKLYESQLLSQQQTH 71 (134)
Q Consensus 1 ~~~Y~iLgv~~~as~~~Ik~ay~~l~~~~HPDk----~~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~~~~~~ 71 (134)
|.||+||||..+|+..+||++||+|||+||||| -+.+.++|+.|+.||+|||||..|++||.+..+.....
T Consensus 8 ~c~YE~L~v~~~a~d~eik~~YRklALq~HPDknpd~ieeat~~F~~i~aAYeVLSdp~eR~wyd~hreqil~~~ 82 (508)
T KOG0717|consen 8 RCYYEVLGVERDADDDEIKKNYRKLALQYHPDKNPDRIEEATQQFQLIQAAYEVLSDPQERAWYDSHREQILRGK 82 (508)
T ss_pred hHHHHHhcccccCCHHHHHHHHHHHHHhhCCCCCCccHHHHHHHHHHHHHHHHHhcChHhhhhHHHHHHHHhcCC
Confidence 579999999999999999999999999999999 45789999999999999999999999999999877544
No 33
>PRK14299 chaperone protein DnaJ; Provisional
Probab=99.83 E-value=5.5e-21 Score=148.45 Aligned_cols=66 Identities=35% Similarity=0.626 Sum_probs=62.6
Q ss_pred CCccccccCCCCCCHHHHHHHHHHHHHHcCCCC--CHHHHHHHHHHHHHHHHccChHHHHHHHHHHhh
Q psy7548 1 MNHFEILNCNVNSTLEEIKENYKSLILKYHPDK--NSEHQEMFLKINEAWNILKDEKERKLYESQLLS 66 (134)
Q Consensus 1 ~~~Y~iLgv~~~as~~~Ik~ay~~l~~~~HPDk--~~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~ 66 (134)
+|||+||||+++||.++||+|||+|++++|||+ .+.+.++|++|++||++|+||.+|+.||+++..
T Consensus 4 ~d~y~vLgv~~~a~~~eik~ayr~la~~~HPD~~~~~~~~~~f~~i~~Ay~~L~d~~kr~~yD~~g~~ 71 (291)
T PRK14299 4 KDYYAILGVPKNASQDEIKKAFKKLARKYHPDVNKSPGAEEKFKEINEAYTVLSDPEKRRIYDTYGTT 71 (291)
T ss_pred CCHHHHcCCCCCCCHHHHHHHHHHHHHHHCcCCCCChhHHHHHHHHHHHHHHhcCHHHHHHHHhcCCc
Confidence 489999999999999999999999999999999 467889999999999999999999999998875
No 34
>KOG0715|consensus
Probab=99.83 E-value=1.3e-20 Score=146.15 Aligned_cols=66 Identities=39% Similarity=0.663 Sum_probs=63.7
Q ss_pred CccccccCCCCCCHHHHHHHHHHHHHHcCCCC--CHHHHHHHHHHHHHHHHccChHHHHHHHHHHhhh
Q psy7548 2 NHFEILNCNVNSTLEEIKENYKSLILKYHPDK--NSEHQEMFLKINEAWNILKDEKERKLYESQLLSQ 67 (134)
Q Consensus 2 ~~Y~iLgv~~~as~~~Ik~ay~~l~~~~HPDk--~~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~~ 67 (134)
|||+||||+++|+..|||.||++|++++|||. .+.+.++|++|.+||++|+|+++|..||..+...
T Consensus 44 d~Y~vLgv~~~At~~EIK~Af~~LaKkyHPD~n~~~~a~~kF~eI~~AYEiLsd~eKR~~YD~~~~~~ 111 (288)
T KOG0715|consen 44 DYYKVLGVSRNATLSEIKSAFRKLAKKYHPDVNKDKEASKKFKEISEAYEILSDEEKRQEYDVYGLEQ 111 (288)
T ss_pred chhhhhCcCCCCCHHHHHHHHHHHHHhhCCCCCCCcchhhHHHHHHHHHHHhcCHHHHHHHHHhhhhc
Confidence 79999999999999999999999999999999 6689999999999999999999999999999875
No 35
>PF00226 DnaJ: DnaJ domain; InterPro: IPR001623 The prokaryotic heat shock protein DnaJ interacts with the chaperone hsp70-like DnaK protein []. Structurally, the DnaJ protein consists of an N-terminal conserved domain (called 'J' domain) of about 70 amino acids, a glycine-rich region ('G' domain') of about 30 residues, a central domain containing four repeats of a CXXCXGXG motif ('CRR' domain) and a C-terminal region of 120 to 170 residues. Such a structure is shown in the following schematic representation: +------------+-+-------+-----+-----------+--------------------------------+ | N-terminal | | Gly-R | | CXXCXGXG | C-terminal | +------------+-+-------+-----+-----------+--------------------------------+ It is thought that the 'J' domain of DnaJ mediates the interaction with the dnaK protein and consists of four helices, the second of which has a charged surface that includes at least one pair of basic residues that are essential for interaction with the ATPase domain of Hsp70. The J- and CRR-domains are found in many prokaryotic and eukaryotic proteins [], either together or separately. In yeast, J-domains have been classified into 3 groups; the class III proteins are functionally distinct and do not appear to act as molecular chaperones []. ; GO: 0031072 heat shock protein binding; PDB: 2GUZ_C 2L6L_A 1HDJ_A 2EJ7_A 1FPO_C 2CUG_A 2QSA_A 2OCH_A 3BVO_B 3APQ_A ....
Probab=99.83 E-value=1.3e-20 Score=115.29 Aligned_cols=60 Identities=45% Similarity=0.763 Sum_probs=56.5
Q ss_pred CccccccCCCCCCHHHHHHHHHHHHHHcCCCC--CH--HHHHHHHHHHHHHHHccChHHHHHHH
Q psy7548 2 NHFEILNCNVNSTLEEIKENYKSLILKYHPDK--NS--EHQEMFLKINEAWNILKDEKERKLYE 61 (134)
Q Consensus 2 ~~Y~iLgv~~~as~~~Ik~ay~~l~~~~HPDk--~~--~~~~~f~~i~~Ay~~L~d~~~R~~Yd 61 (134)
|||+||||+++++.++||++|+++++++|||+ .. .+...|..|++||++|++|.+|+.||
T Consensus 1 ~~y~iLgl~~~~~~~eik~~y~~l~~~~HPD~~~~~~~~~~~~~~~i~~Ay~~L~~~~~R~~YD 64 (64)
T PF00226_consen 1 NPYEILGLPPDASDEEIKKAYRRLSKQYHPDKNSGDEAEAEEKFARINEAYEILSDPERRRRYD 64 (64)
T ss_dssp HHHHHCTSTTTSSHHHHHHHHHHHHHHTSTTTGTSTHHHHHHHHHHHHHHHHHHHSHHHHHHHH
T ss_pred ChHHHCCCCCCCCHHHHHHHHHhhhhccccccchhhhhhhhHHHHHHHHHHHHhCCHHHHHhcC
Confidence 68999999999999999999999999999999 32 37799999999999999999999998
No 36
>PRK14281 chaperone protein DnaJ; Provisional
Probab=99.81 E-value=2.3e-20 Score=150.34 Aligned_cols=67 Identities=36% Similarity=0.665 Sum_probs=62.5
Q ss_pred CCccccccCCCCCCHHHHHHHHHHHHHHcCCCC---CHHHHHHHHHHHHHHHHccChHHHHHHHHHHhhh
Q psy7548 1 MNHFEILNCNVNSTLEEIKENYKSLILKYHPDK---NSEHQEMFLKINEAWNILKDEKERKLYESQLLSQ 67 (134)
Q Consensus 1 ~~~Y~iLgv~~~as~~~Ik~ay~~l~~~~HPDk---~~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~~ 67 (134)
+|||+||||+++|+.++||+|||+|++++|||+ .+.+.++|++|++||++|+||.+|+.||+++...
T Consensus 3 ~d~y~iLgv~~~a~~~eikkayr~la~~~HPD~~~~~~~a~~~f~~i~~Ay~vL~d~~~r~~yD~~g~~~ 72 (397)
T PRK14281 3 RDYYEVLGVSRSADKDEIKKAYRKLALKYHPDKNPDNKEAEEHFKEVNEAYEVLSNDDKRRRYDQFGHAG 72 (397)
T ss_pred CChhhhcCCCCCCCHHHHHHHHHHHHHHHCCCcCCCchHHHHHHHHHHHHHHHhhhhhhhhhhhhccchh
Confidence 489999999999999999999999999999999 3567899999999999999999999999988754
No 37
>KOG0718|consensus
Probab=99.79 E-value=1.1e-19 Score=145.96 Aligned_cols=69 Identities=30% Similarity=0.544 Sum_probs=65.3
Q ss_pred CCccccccCCCCCCHHHHHHHHHHHHHHcCCCC------CHHHHHHHHHHHHHHHHccChHHHHHHHHHHhhhhh
Q psy7548 1 MNHFEILNCNVNSTLEEIKENYKSLILKYHPDK------NSEHQEMFLKINEAWNILKDEKERKLYESQLLSQQQ 69 (134)
Q Consensus 1 ~~~Y~iLgv~~~as~~~Ik~ay~~l~~~~HPDk------~~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~~~~ 69 (134)
.+||.+|||+++||.+|||+|||++++.+|||| +..|++.|+.|.+||+||+||.+|..||.++++...
T Consensus 9 ~e~Ya~LNlpkdAt~eeI~~AYrr~~~lfHPDkh~dpd~K~~AE~~F~~i~~AyEVLsDp~kRaIYD~~G~qGL~ 83 (546)
T KOG0718|consen 9 IELYALLNLPKDATDEEIKKAYRRLSRLFHPDKHTDPDQKKAAEEKFQRIQRAYEVLSDPQKRAIYDNYGEQGLK 83 (546)
T ss_pred hhHHHHhCCCcccCHHHHHHHHHHHHHhcCCcccCChhHHHHHHHHHHHHHHHHHHhcChHHHHHHHHhhhcccc
Confidence 379999999999999999999999999999999 567899999999999999999999999999998775
No 38
>PRK10266 curved DNA-binding protein CbpA; Provisional
Probab=99.79 E-value=1.6e-19 Score=141.12 Aligned_cols=65 Identities=31% Similarity=0.619 Sum_probs=61.4
Q ss_pred CCccccccCCCCCCHHHHHHHHHHHHHHcCCCC--CHHHHHHHHHHHHHHHHccChHHHHHHHHHHh
Q psy7548 1 MNHFEILNCNVNSTLEEIKENYKSLILKYHPDK--NSEHQEMFLKINEAWNILKDEKERKLYESQLL 65 (134)
Q Consensus 1 ~~~Y~iLgv~~~as~~~Ik~ay~~l~~~~HPDk--~~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~ 65 (134)
+|||+||||+++||.++||+|||+|++++|||+ .+.+.++|++|++||++|+||.+|+.||.++.
T Consensus 4 ~d~y~~Lgv~~~a~~~eik~ayr~la~k~HPD~~~~~~~~~~f~~i~~Ay~~L~~~~kr~~yD~~g~ 70 (306)
T PRK10266 4 KDYYAIMGVKPTDDLKTIKTAYRRLARKYHPDVSKEPDAEARFKEVAEAWEVLSDEQRRAEYDQLWQ 70 (306)
T ss_pred CChHHHcCCCCCCCHHHHHHHHHHHHHHHCcCCCCCccHHHHHHHHHHHHHHhhhHHHHHHHHHhhc
Confidence 589999999999999999999999999999999 45688999999999999999999999999864
No 39
>KOG0691|consensus
Probab=99.78 E-value=3.2e-19 Score=138.14 Aligned_cols=71 Identities=31% Similarity=0.579 Sum_probs=66.6
Q ss_pred CCccccccCCCCCCHHHHHHHHHHHHHHcCCCC---CHHHHHHHHHHHHHHHHccChHHHHHHHHHHhhhhhcc
Q psy7548 1 MNHFEILNCNVNSTLEEIKENYKSLILKYHPDK---NSEHQEMFLKINEAWNILKDEKERKLYESQLLSQQQTH 71 (134)
Q Consensus 1 ~~~Y~iLgv~~~as~~~Ik~ay~~l~~~~HPDk---~~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~~~~~~ 71 (134)
.|||+||||+.+||..+|++||++.++++|||| +|.+.++|+.|.+||+||+|+.+|..||..+.......
T Consensus 5 ~dyY~lLgi~~~at~~eIkKaYr~kaL~~HPDKNp~dP~A~ekFq~L~eAy~VL~D~~~R~~YDk~~k~~~~~~ 78 (296)
T KOG0691|consen 5 TDYYDLLGISEDATDAEIKKAYRKKALQYHPDKNPGDPQAAEKFQELSEAYEVLSDEESRAAYDKLRKSGSSAQ 78 (296)
T ss_pred chHHHHhCCCCCCCHHHHHHHHHHHHHhcCCCCCCCChHHHHHHHHHHHHHHHhcCHHHHHHHHHHhhhcccch
Confidence 489999999999999999999999999999999 78899999999999999999999999999999776543
No 40
>PTZ00341 Ring-infected erythrocyte surface antigen; Provisional
Probab=99.77 E-value=4.5e-19 Score=152.60 Aligned_cols=68 Identities=35% Similarity=0.509 Sum_probs=63.0
Q ss_pred CCccccccCCCCCCHHHHHHHHHHHHHHcCCCC--CHHHHHHHHHHHHHHHHccChHHHHHHHHHHhhhh
Q psy7548 1 MNHFEILNCNVNSTLEEIKENYKSLILKYHPDK--NSEHQEMFLKINEAWNILKDEKERKLYESQLLSQQ 68 (134)
Q Consensus 1 ~~~Y~iLgv~~~as~~~Ik~ay~~l~~~~HPDk--~~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~~~ 68 (134)
++||+||||+++||..+||+|||+||+++|||+ .+.+..+|+.|++||+||+||.+|+.||.+|..+.
T Consensus 573 ~dYYdILGVs~dAS~~EIKKAYRKLAlkyHPDKN~~~~A~ekFq~I~EAYeVLSDp~kRk~YD~~G~~Gl 642 (1136)
T PTZ00341 573 TLFYDILGVGVNADMKEISERYFKLAENYYPPKRSGNEGFHKFKKINEAYQILGDIDKKKMYNKFGYDGI 642 (1136)
T ss_pred CChHHHcCCCCCCCHHHHHHHHHHHHHHhCCCCCCCchHHHHHHHHHHHHHHhCCHHHHHHHhhcccccc
Confidence 489999999999999999999999999999999 34578899999999999999999999999988653
No 41
>KOG0719|consensus
Probab=99.75 E-value=1.1e-18 Score=129.75 Aligned_cols=67 Identities=33% Similarity=0.597 Sum_probs=62.7
Q ss_pred CCccccccCCCCCCHHHHHHHHHHHHHHcCCCC-----CHHHHHHHHHHHHHHHHccChHHHHHHHHHHhhh
Q psy7548 1 MNHFEILNCNVNSTLEEIKENYKSLILKYHPDK-----NSEHQEMFLKINEAWNILKDEKERKLYESQLLSQ 67 (134)
Q Consensus 1 ~~~Y~iLgv~~~as~~~Ik~ay~~l~~~~HPDk-----~~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~~ 67 (134)
+|+|+||||.++|+..+||+||++|++++|||+ ...+..+|+.|+.||.||+|.++|+.||..+.-.
T Consensus 14 ~d~YevLGVer~a~~~eIrkAY~klal~~HPDk~~eed~~ea~~kFq~l~k~y~iLsDeekR~~YDetG~id 85 (264)
T KOG0719|consen 14 KDLYEVLGVERDATDKEIRKAYHKLALRLHPDKNHEEDKVEATEKFQQLQKAYQILSDEEKRAVYDETGSID 85 (264)
T ss_pred cCHHHHhhhcccCCHHHHHHHHHHHHHHhCCCcchhhhHHHHHHHHHHHHHHHHHhhHHHHHHHHhccCCCC
Confidence 589999999999999999999999999999999 4568899999999999999999999999888755
No 42
>smart00271 DnaJ DnaJ molecular chaperone homology domain.
Probab=99.74 E-value=2.9e-18 Score=103.38 Aligned_cols=55 Identities=47% Similarity=0.820 Sum_probs=51.4
Q ss_pred CCccccccCCCCCCHHHHHHHHHHHHHHcCCCCC----HHHHHHHHHHHHHHHHccChH
Q psy7548 1 MNHFEILNCNVNSTLEEIKENYKSLILKYHPDKN----SEHQEMFLKINEAWNILKDEK 55 (134)
Q Consensus 1 ~~~Y~iLgv~~~as~~~Ik~ay~~l~~~~HPDk~----~~~~~~f~~i~~Ay~~L~d~~ 55 (134)
+|||+||||+++++.++||++|+++++++|||+. +.+.+.|..|++||++|+||.
T Consensus 1 ~~~y~vLgl~~~~~~~~ik~ay~~l~~~~HPD~~~~~~~~~~~~~~~l~~Ay~~L~~~~ 59 (60)
T smart00271 1 TDYYEILGVPRDASLDEIKKAYRKLALKYHPDKNPGDKEEAEEKFKEINEAYEVLSDPE 59 (60)
T ss_pred CCHHHHcCCCCCCCHHHHHHHHHHHHHHHCcCCCCCchHHHHHHHHHHHHHHHHHcCCC
Confidence 5899999999999999999999999999999993 358889999999999999985
No 43
>cd06257 DnaJ DnaJ domain or J-domain. DnaJ/Hsp40 (heat shock protein 40) proteins are highly conserved and play crucial roles in protein translation, folding, unfolding, translocation, and degradation. They act primarily by stimulating the ATPase activity of Hsp70s, an important chaperonine family. Hsp40 proteins are characterized by the presence of a J domain, which mediates the interaction with Hsp70. They may contain other domains as well, and the architectures provide a means of classification.
Probab=99.73 E-value=7.9e-18 Score=99.69 Aligned_cols=52 Identities=48% Similarity=0.818 Sum_probs=49.2
Q ss_pred CccccccCCCCCCHHHHHHHHHHHHHHcCCCCC---HHHHHHHHHHHHHHHHccC
Q psy7548 2 NHFEILNCNVNSTLEEIKENYKSLILKYHPDKN---SEHQEMFLKINEAWNILKD 53 (134)
Q Consensus 2 ~~Y~iLgv~~~as~~~Ik~ay~~l~~~~HPDk~---~~~~~~f~~i~~Ay~~L~d 53 (134)
|||+||||+++++.++||++|+++++++|||+. +.+.+.|..|++||++|+|
T Consensus 1 ~~y~vLgl~~~~~~~~ik~~y~~l~~~~HPD~~~~~~~~~~~~~~l~~Ay~~L~d 55 (55)
T cd06257 1 DYYDILGVPPDASDEEIKKAYRKLALKYHPDKNPDDPEAEEKFKEINEAYEVLSD 55 (55)
T ss_pred ChHHHcCCCCCCCHHHHHHHHHHHHHHHCcCCCCCcHHHHHHHHHHHHHHHHhcC
Confidence 799999999999999999999999999999993 5689999999999999986
No 44
>TIGR03835 termin_org_DnaJ terminal organelle assembly protein TopJ. This model describes TopJ (MG_200, CbpA), a DnaJ homolog and probable assembly protein of the Mycoplasma terminal organelle. The terminal organelle is involved in both cytadherence and gliding motility.
Probab=99.71 E-value=1.1e-17 Score=141.38 Aligned_cols=68 Identities=29% Similarity=0.528 Sum_probs=63.1
Q ss_pred CCccccccCCCCCCHHHHHHHHHHHHHHcCCCC--CHHHHHHHHHHHHHHHHccChHHHHHHHHHHhhhh
Q psy7548 1 MNHFEILNCNVNSTLEEIKENYKSLILKYHPDK--NSEHQEMFLKINEAWNILKDEKERKLYESQLLSQQ 68 (134)
Q Consensus 1 ~~~Y~iLgv~~~as~~~Ik~ay~~l~~~~HPDk--~~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~~~ 68 (134)
+|||+||||+++|+.++||+|||+|++++|||+ .+.+.++|+.|++||++|+||.+|+.||.++....
T Consensus 2 ~DYYeVLGVs~dAS~eEIKKAYRKLAKKyHPDKn~~~eAeekFqeINEAYEVLSDP~KRa~YD~fG~aG~ 71 (871)
T TIGR03835 2 RDYYEVLGIDRDADEQEIKKAFRKLAKKYHPDRNKAPDAASIFAEINEANDVLSNPKKRANYDKYGHDGV 71 (871)
T ss_pred CChhHhcCCCCCCCHHHHHHHHHHHHHHHCcCCCCChhHHHHHHHHHHHHHHhCCHHHHHHHhhhccccc
Confidence 589999999999999999999999999999999 55677899999999999999999999999887543
No 45
>KOG0721|consensus
Probab=99.71 E-value=1.7e-17 Score=122.25 Aligned_cols=68 Identities=34% Similarity=0.567 Sum_probs=63.5
Q ss_pred CccccccCCCCCCHHHHHHHHHHHHHHcCCCCC---HHHHHHHHHHHHHHHHccChHHHHHHHHHHhhhhh
Q psy7548 2 NHFEILNCNVNSTLEEIKENYKSLILKYHPDKN---SEHQEMFLKINEAWNILKDEKERKLYESQLLSQQQ 69 (134)
Q Consensus 2 ~~Y~iLgv~~~as~~~Ik~ay~~l~~~~HPDk~---~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~~~~ 69 (134)
|+|+||||+++||.+|||+|||+|++++||||. .+.++.|..|.+||+.|+|+..|..|..++.....
T Consensus 100 DPyEILGl~pgas~~eIKkaYR~LSik~HPDK~~~~~~~e~~~~~I~KAY~aLTD~~sreN~ekYG~PDGp 170 (230)
T KOG0721|consen 100 DPYEILGLDPGASEKEIKKAYRRLSIKYHPDKQPPEEGDEEFFEAIAKAYQALTDKKSRENWEKYGNPDGP 170 (230)
T ss_pred CcHHhhCCCCCCCHHHHHHHHHHhhhhhCCCcCCCcchhHHHHHHHHHHHHHhcchhhHHHHHHhCCCCCc
Confidence 899999999999999999999999999999992 66788899999999999999999999999987654
No 46
>KOG0720|consensus
Probab=99.70 E-value=3.9e-18 Score=136.86 Aligned_cols=98 Identities=27% Similarity=0.508 Sum_probs=77.0
Q ss_pred CCccccccCCCCCCHHHHHHHHHHHHHHcCCCC--CHHHHHHHHHHHHHHHHccChHHHHHHHHHHhhhhhccccccccc
Q psy7548 1 MNHFEILNCNVNSTLEEIKENYKSLILKYHPDK--NSEHQEMFLKINEAWNILKDEKERKLYESQLLSQQQTHMNIYKSV 78 (134)
Q Consensus 1 ~~~Y~iLgv~~~as~~~Ik~ay~~l~~~~HPDk--~~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~~~~~~~~~~~~~ 78 (134)
+|+|.+|||+++++.++||+.||++|...|||| .+.|++.|+.|+.||++|+|+++|..||..+..+......+-+
T Consensus 235 ~daYsvlGl~~d~sd~~lKk~Yrk~A~LVhPDKn~~~~A~Eafk~Lq~Afevig~~~kR~eYd~e~~kene~~~~~~~-- 312 (490)
T KOG0720|consen 235 LDAYSALGLPSDCSDADLKKNYRKKAMLVHPDKNMIPRAEEAFKKLQVAFEVIGDSVKRKEYDLELKKENELHRQVIS-- 312 (490)
T ss_pred CCchhhcCCCCCCCHHHHHHHHHhhceEeCCCccCChhHHHHHHHHHHHHHHhcchhhhhHHHHHHHHHHHHHHHHHH--
Confidence 589999999999999999999999999999999 8899999999999999999999999999988877654321111
Q ss_pred ccccccccCCceeeeeeC-CCCC
Q psy7548 79 SLSDMEHKGESQMFTYPC-RCGA 100 (134)
Q Consensus 79 ~~~~~~~~~~~~~~~~~c-rcG~ 100 (134)
+..+|.-...+......| +||+
T Consensus 313 ~~~~~~~~~eEA~ntI~CskC~n 335 (490)
T KOG0720|consen 313 SLNDLQKAVEEARNTIFCSKCGN 335 (490)
T ss_pred HHHHHHHHHHHHHhheehhhhcC
Confidence 111332222223445778 6884
No 47
>PRK05014 hscB co-chaperone HscB; Provisional
Probab=99.69 E-value=1.9e-17 Score=119.76 Aligned_cols=67 Identities=24% Similarity=0.352 Sum_probs=58.7
Q ss_pred CCccccccCCCC--CCHHHHHHHHHHHHHHcCCCC---C-H----HHHHHHHHHHHHHHHccChHHHHHHHHHHhhh
Q psy7548 1 MNHFEILNCNVN--STLEEIKENYKSLILKYHPDK---N-S----EHQEMFLKINEAWNILKDEKERKLYESQLLSQ 67 (134)
Q Consensus 1 ~~~Y~iLgv~~~--as~~~Ik~ay~~l~~~~HPDk---~-~----~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~~ 67 (134)
+|||++|||++. ++..+|+++|+++++++|||+ . + .+.+.|..|++||++|+||.+|..|+..+...
T Consensus 1 ~~yf~llgl~~~~~~d~~~L~~~yr~l~~~~HPDk~~~~~~~~~~~a~~~s~~iN~AY~~L~dp~~Ra~Yll~l~g~ 77 (171)
T PRK05014 1 MDYFTLFGLPARYDIDTQLLASRYQELQRQFHPDKFANASERERLLAVQQAATINDAYQTLKHPLKRAEYLLSLHGF 77 (171)
T ss_pred CCHHHHCCCCCCCCCCHHHHHHHHHHHHHHhCcCCCCCCcHHHHHHHHHHHHHHHHHHHHHCChhHHHHHHHHhcCC
Confidence 699999999996 678999999999999999999 1 1 24668999999999999999999999776643
No 48
>PHA03102 Small T antigen; Reviewed
Probab=99.69 E-value=1.1e-17 Score=118.71 Aligned_cols=67 Identities=15% Similarity=0.277 Sum_probs=60.5
Q ss_pred CccccccCCCCC--CHHHHHHHHHHHHHHcCCCCCHHHHHHHHHHHHHHHHccChHHHHHHHHHHhhhhh
Q psy7548 2 NHFEILNCNVNS--TLEEIKENYKSLILKYHPDKNSEHQEMFLKINEAWNILKDEKERKLYESQLLSQQQ 69 (134)
Q Consensus 2 ~~Y~iLgv~~~a--s~~~Ik~ay~~l~~~~HPDk~~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~~~~ 69 (134)
..|+||||+++| |.++||+|||++++++||||++ +.++|++|++||++|+|+.+|..||.++.....
T Consensus 6 ~l~~vLGl~~~A~~s~~eIKkAYr~la~~~HPDkgg-~~e~~k~in~Ay~~L~d~~~r~~yd~~g~~~~~ 74 (153)
T PHA03102 6 ELMDLLGLPRSAWGNLPLMRKAYLRKCLEFHPDKGG-DEEKMKELNTLYKKFRESVKSLRDLDGEEDSSS 74 (153)
T ss_pred HHHHHcCCCCCCCCCHHHHHHHHHHHHHHHCcCCCc-hhHHHHHHHHHHHHHhhHHHhccccccCCcccc
Confidence 368999999999 9999999999999999999953 467999999999999999999999998876543
No 49
>COG2214 CbpA DnaJ-class molecular chaperone [Posttranslational modification, protein turnover, chaperones]
Probab=99.69 E-value=5.2e-17 Score=118.61 Aligned_cols=64 Identities=44% Similarity=0.735 Sum_probs=59.7
Q ss_pred CCccccccCCCCCCHHHHHHHHHHHHHHcCCCC---CH-HHHHHHHHHHHHHHHccChHHHHHHHHHH
Q psy7548 1 MNHFEILNCNVNSTLEEIKENYKSLILKYHPDK---NS-EHQEMFLKINEAWNILKDEKERKLYESQL 64 (134)
Q Consensus 1 ~~~Y~iLgv~~~as~~~Ik~ay~~l~~~~HPDk---~~-~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~ 64 (134)
++||+||||+++|+..+|+++||++++++|||+ .+ .+.+.|+.|++||++|+|+.+|..||..+
T Consensus 6 ~~~y~iLgv~~~as~~eik~ayrkla~~~HPD~~~~~~~~a~~~f~~i~~Ay~vLsd~~~r~~yd~~~ 73 (237)
T COG2214 6 LDYYEILGVPPNASLEEIKKAYRKLALKYHPDRNPGDPKVAEEKFKEINEAYEILSDPERRAEYDKIG 73 (237)
T ss_pred hhHHHHhCCCCCCCHHHHHHHHHHHHHHhCCCCCCCchhHHHHHHHHHHHHHHHhhCHHHHHHhhhhc
Confidence 589999999999999999999999999999999 33 27899999999999999999999999874
No 50
>KOG2923|consensus
Probab=99.68 E-value=2.6e-17 Score=98.75 Aligned_cols=56 Identities=34% Similarity=0.764 Sum_probs=53.1
Q ss_pred cccccccccccccCCceeeeeeCCCCCeEEEecccccCCCCCCceeEeCCCCcceEEEee
Q psy7548 74 IYKSVSLSDMEHKGESQMFTYPCRCGAEFCIEEQDTEGDSSDDNILIACDTCSLLLEITA 133 (134)
Q Consensus 74 ~~~~~~~~~~~~~~~~~~~~~~crcG~~~~v~~~~~~~g~~~~~~~v~C~~CSl~i~v~~ 133 (134)
+++.+.+++|+|+++.++|.+||+||+.|.|++++|..| ..++.|++|||.|.|+.
T Consensus 3 ~ydeVeiedfe~~~e~~~y~yPCpCGDrf~It~edL~~g----e~Va~CpsCSL~I~ViY 58 (67)
T KOG2923|consen 3 FYDEVEIEDFEFDEENQTYYYPCPCGDRFQITLEDLENG----EDVARCPSCSLIIRVIY 58 (67)
T ss_pred ccceEEeecceeccCCCeEEcCCCCCCeeeecHHHHhCC----CeeecCCCceEEEEEEe
Confidence 567889999999999999999999999999999999999 99999999999999974
No 51
>PF05207 zf-CSL: CSL zinc finger; InterPro: IPR007872 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents a probable zinc binding motif that contains four cysteines and may chelate zinc, known as the DPH-type after the diphthamide (DPH) biosynthesis protein in which it was first characterised, including the proteins DPH3 and DPH4. This domain is also found associated with N-terminal domain of heat shock protein DnaJ IPR001623 from INTERPRO domain. Diphthamide is a unique post-translationally modified histidine residue found only in translation elongation factor 2 (eEF-2). It is conserved from archaea to humans and serves as the target for diphteria toxin and Pseudomonas exotoxin A. These two toxins catalyse the transfer of ADP-ribose to diphtamide on eEF-2, thus inactivating eEF-2, halting cellular protein synthesis, and causing cell death []. The biosynthesis of diphtamide is dependent on at least five proteins, DPH1 to -5, and a still unidentified amidating enzyme. DPH3 and DPH4 share a conserved region, which encode a putative zinc finger, the DPH-type or CSL-type (after the conserved motif of the final cysteine) zinc finger [, ]. The function of this motif is unknown. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 2L6L_A 1WGE_A 2JR7_A 1YOP_A 1YWS_A.
Probab=99.67 E-value=4.2e-17 Score=97.20 Aligned_cols=53 Identities=45% Similarity=0.981 Sum_probs=47.5
Q ss_pred ccccccccccCCceeeeeeCCCCCeEEEecccccCCCCCCceeEeCCCCcceEEEee
Q psy7548 77 SVSLSDMEHKGESQMFTYPCRCGAEFCIEEQDTEGDSSDDNILIACDTCSLLLEITA 133 (134)
Q Consensus 77 ~~~~~~~~~~~~~~~~~~~crcG~~~~v~~~~~~~g~~~~~~~v~C~~CSl~i~v~~ 133 (134)
.+++++|.+++...+|.++||||+.|.|+.+++..+ ..+++|++|||||+|..
T Consensus 2 ev~l~d~~~~~~~~~~~y~CRCG~~f~i~e~~l~~~----~~iv~C~sCSL~I~V~~ 54 (55)
T PF05207_consen 2 EVSLDDMEFDEEEGVYSYPCRCGGEFEISEEDLEEG----EVIVQCDSCSLWIRVNY 54 (55)
T ss_dssp EEETTTSEEETTTTEEEEEETTSSEEEEEHHHHHCT------EEEETTTTEEEEEE-
T ss_pred eEEhhhceecCCCCEEEEcCCCCCEEEEcchhccCc----CEEEECCCCccEEEEEe
Confidence 468899999998899999999999999999999999 89999999999999975
No 52
>PRK01356 hscB co-chaperone HscB; Provisional
Probab=99.66 E-value=1.9e-16 Score=114.01 Aligned_cols=66 Identities=29% Similarity=0.460 Sum_probs=57.4
Q ss_pred CCccccccCCCC--CCHHHHHHHHHHHHHHcCCCCC------HHHHHHHHHHHHHHHHccChHHHHHHHHHHhh
Q psy7548 1 MNHFEILNCNVN--STLEEIKENYKSLILKYHPDKN------SEHQEMFLKINEAWNILKDEKERKLYESQLLS 66 (134)
Q Consensus 1 ~~~Y~iLgv~~~--as~~~Ik~ay~~l~~~~HPDk~------~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~ 66 (134)
+|||++|||++. ++..+|+++|+++++++|||+. ..+...+..|++||+||+||.+|+.|+..+..
T Consensus 2 ~~yf~llgl~~~f~id~~~L~~aYr~lq~~~HPDk~~~~~~k~~~~~~s~~in~AY~~L~dp~~Ra~YlL~l~g 75 (166)
T PRK01356 2 QNYFQLLGLPQEYNIDLKILEKQYFAMQVKYHPDKAKTLQEKEQNLIIASELNNAYSTLKDALKRAEYMLLLQN 75 (166)
T ss_pred CCHHHHcCCCCCCCCCHHHHHHHHHHHHHHHCcCCCCCHHHHHHHHHHHHHHHHHHHHhCCHHHHHHHHHHccC
Confidence 489999999997 7899999999999999999992 12234578999999999999999999887754
No 53
>PRK00294 hscB co-chaperone HscB; Provisional
Probab=99.65 E-value=1.6e-16 Score=115.06 Aligned_cols=86 Identities=22% Similarity=0.332 Sum_probs=66.2
Q ss_pred CCccccccCCCC--CCHHHHHHHHHHHHHHcCCCC---C-----HHHHHHHHHHHHHHHHccChHHHHHHHHHHhhhhhc
Q psy7548 1 MNHFEILNCNVN--STLEEIKENYKSLILKYHPDK---N-----SEHQEMFLKINEAWNILKDEKERKLYESQLLSQQQT 70 (134)
Q Consensus 1 ~~~Y~iLgv~~~--as~~~Ik~ay~~l~~~~HPDk---~-----~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~~~~~ 70 (134)
+|||++|||++. .+..+|+++|+++++++|||+ . ..+...+..|++||++|+||.+|..|+..+......
T Consensus 4 ~~~F~l~~l~~~f~id~~~L~~~Yr~Lq~~~HPDk~~~~~~~e~~~a~~~s~~IN~AY~~L~~p~~Ra~YlL~l~g~~~~ 83 (173)
T PRK00294 4 PCHFALFDLQPSFRLDLDQLATRYRELAREVHPDRFADAPEREQRLALERSASLNEAYQTLKSPPRRARYLLALSGHEVP 83 (173)
T ss_pred CChhhhcCcCCCCCCCHHHHHHHHHHHHHHHCcCCCCCCcHHHHHHHHHHHHHHHHHHHHhCChhhhHHHHHHhcCCCCC
Confidence 589999999998 567999999999999999999 1 225677999999999999999999999887754322
Q ss_pred c-cc-ccccccccccccc
Q psy7548 71 H-MN-IYKSVSLSDMEHK 86 (134)
Q Consensus 71 ~-~~-~~~~~~~~~~~~~ 86 (134)
. .. ....+.+..|++.
T Consensus 84 ~~~~~~d~~fLme~me~r 101 (173)
T PRK00294 84 LEVTVHDPEFLLQQMQLR 101 (173)
T ss_pred cccCCCCHHHHHHHHHHH
Confidence 1 11 1233444555554
No 54
>KOG0624|consensus
Probab=99.64 E-value=1.5e-16 Score=125.16 Aligned_cols=67 Identities=25% Similarity=0.503 Sum_probs=61.9
Q ss_pred CCccccccCCCCCCHHHHHHHHHHHHHHcCCCC------CHHHHHHHHHHHHHHHHccChHHHHHHHHHHhhh
Q psy7548 1 MNHFEILNCNVNSTLEEIKENYKSLILKYHPDK------NSEHQEMFLKINEAWNILKDEKERKLYESQLLSQ 67 (134)
Q Consensus 1 ~~~Y~iLgv~~~as~~~Ik~ay~~l~~~~HPDk------~~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~~ 67 (134)
||||+||||.++|+..||.+|||+++.+||||- +..++++|..|..|-+||+||++|+.+|..-..-
T Consensus 394 RDYYKILGVkRnAsKqEI~KAYRKlAqkWHPDNFqdEeEKKkAEKKFIDIAAAKEVLsd~EkRrqFDnGeDPL 466 (504)
T KOG0624|consen 394 RDYYKILGVKRNASKQEITKAYRKLAQKWHPDNFQDEEEKKKAEKKFIDIAAAKEVLSDPEKRRQFDNGEDPL 466 (504)
T ss_pred chHHHHhhhcccccHHHHHHHHHHHHHhcCCccccCHHHHHHHHHhhhhHHHHHHhhcCHHHHhhccCCCCCC
Confidence 699999999999999999999999999999998 4568999999999999999999999999765543
No 55
>PRK03578 hscB co-chaperone HscB; Provisional
Probab=99.64 E-value=1.8e-16 Score=115.11 Aligned_cols=68 Identities=21% Similarity=0.382 Sum_probs=58.0
Q ss_pred CCccccccCCCC--CCHHHHHHHHHHHHHHcCCCC---CHH-----HHHHHHHHHHHHHHccChHHHHHHHHHHhhhh
Q psy7548 1 MNHFEILNCNVN--STLEEIKENYKSLILKYHPDK---NSE-----HQEMFLKINEAWNILKDEKERKLYESQLLSQQ 68 (134)
Q Consensus 1 ~~~Y~iLgv~~~--as~~~Ik~ay~~l~~~~HPDk---~~~-----~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~~~ 68 (134)
+|||+||||++. ++..+|+++|+++++++|||+ .+. +.+.+..||+||++|+||.+|..|...+....
T Consensus 6 ~dyf~llglp~~f~~d~~~L~~~yr~lq~~~HPD~~~~~~~~e~~~a~~~s~~iN~AY~tL~~p~~Ra~Yll~l~G~~ 83 (176)
T PRK03578 6 DDHFSLFGLPARFALDEAALDAAYRTVQAQVHPDRFAAAGDAEKRVAMQWATRANEAYQTLRDPLKRARYLLHLRGVD 83 (176)
T ss_pred CCHHHHcCCCCCCCCCHHHHHHHHHHHHHHHCcCCCCCCCHHHHHHHHHHHHHHHHHHHHhCChhhHHHHHHHhcCCC
Confidence 589999999996 578999999999999999999 222 33446899999999999999999998776443
No 56
>KOG0550|consensus
Probab=99.57 E-value=2.3e-15 Score=120.13 Aligned_cols=83 Identities=27% Similarity=0.393 Sum_probs=70.4
Q ss_pred CCccccccCCCCCCHHHHHHHHHHHHHHcCCCC---C-HHHHHHHHHHHHHHHHccChHHHHHHHHHHhhhhhccccccc
Q psy7548 1 MNHFEILNCNVNSTLEEIKENYKSLILKYHPDK---N-SEHQEMFLKINEAWNILKDEKERKLYESQLLSQQQTHMNIYK 76 (134)
Q Consensus 1 ~~~Y~iLgv~~~as~~~Ik~ay~~l~~~~HPDk---~-~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~~~~~~~~~~~ 76 (134)
++||.||||..+|+..+||+|||++++.+|||+ + .+++.+|++|.+||.+|+||.+|..||.........+++ .
T Consensus 373 kd~ykilGi~~~as~~eikkayrk~AL~~Hpd~~agsq~eaE~kFkevgeAy~il~d~~kr~r~dsg~dle~~~~~~--a 450 (486)
T KOG0550|consen 373 KDWYKILGISRNASDDEIKKAYRKLALVHHPDKNAGSQKEAEAKFKEVGEAYTILSDPMKRVRFDSGQDLEEVGSGG--A 450 (486)
T ss_pred hhHHHHhhhhhhcccchhhhHHHHHHHHhCCCcCcchhHHHHHHHHHHHHHHHHhcCHHHHhhcccccchhhhcCCC--c
Confidence 589999999999999999999999999999999 3 678999999999999999999999999877665544333 4
Q ss_pred ccccccccc
Q psy7548 77 SVSLSDMEH 85 (134)
Q Consensus 77 ~~~~~~~~~ 85 (134)
.+++++++-
T Consensus 451 ~~dp~~~~~ 459 (486)
T KOG0550|consen 451 GFDPFNIFR 459 (486)
T ss_pred CcChhhhhh
Confidence 555555543
No 57
>COG5216 Uncharacterized conserved protein [Function unknown]
Probab=99.54 E-value=9.4e-15 Score=86.58 Aligned_cols=57 Identities=30% Similarity=0.752 Sum_probs=53.2
Q ss_pred ccccccccccccccCCceeeeeeCCCCCeEEEecccccCCCCCCceeEeCCCCcceEEEee
Q psy7548 73 NIYKSVSLSDMEHKGESQMFTYPCRCGAEFCIEEQDTEGDSSDDNILIACDTCSLLLEITA 133 (134)
Q Consensus 73 ~~~~~~~~~~~~~~~~~~~~~~~crcG~~~~v~~~~~~~g~~~~~~~v~C~~CSl~i~v~~ 133 (134)
.+++.+.+.+|.+..+..+|.++|.||+.|.|+++++..| +.+..|++|||.|.|+.
T Consensus 2 sfYDEieiedftf~~e~~~ftyPCPCGDRFeIsLeDl~~G----E~VArCPSCSLiv~vvy 58 (67)
T COG5216 2 SFYDEIEIEDFTFSREEKTFTYPCPCGDRFEISLEDLRNG----EVVARCPSCSLIVCVVY 58 (67)
T ss_pred CccceeEeeeeEEcCCCceEEecCCCCCEeEEEHHHhhCC----ceEEEcCCceEEEEEEe
Confidence 3567889999999999999999999999999999999999 99999999999999864
No 58
>KOG0722|consensus
Probab=99.52 E-value=6.9e-15 Score=111.07 Aligned_cols=68 Identities=31% Similarity=0.588 Sum_probs=62.5
Q ss_pred CCccccccCCCCCCHHHHHHHHHHHHHHcCCCC--CHHHHHHHHHHHHHHHHccChHHHHHHHHHHhhhh
Q psy7548 1 MNHFEILNCNVNSTLEEIKENYKSLILKYHPDK--NSEHQEMFLKINEAWNILKDEKERKLYESQLLSQQ 68 (134)
Q Consensus 1 ~~~Y~iLgv~~~as~~~Ik~ay~~l~~~~HPDk--~~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~~~ 68 (134)
.|.|+||||.++|+..+|.+|||+|++++|||+ ++++.+.|+.|..||++|.|.+.|..||-.+....
T Consensus 33 enCYdVLgV~Rea~KseIakAYRqLARrhHPDr~r~~e~k~~F~~iAtayeilkd~e~rt~ydyaldhpd 102 (329)
T KOG0722|consen 33 ENCYDVLGVAREANKSEIAKAYRQLARRHHPDRNRDPESKKLFVKIATAYEILKDNETRTQYDYALDHPD 102 (329)
T ss_pred hhHHHHhhhhhhccHHHHHHHHHHHHHHhCCcccCCchhhhhhhhhhcccccccchhhHHhHHHHhcCch
Confidence 378999999999999999999999999999999 66677899999999999999999999998777544
No 59
>PTZ00100 DnaJ chaperone protein; Provisional
Probab=99.47 E-value=3.6e-14 Score=96.21 Aligned_cols=50 Identities=34% Similarity=0.518 Sum_probs=45.7
Q ss_pred CccccccCCCCCCHHHHHHHHHHHHHHcCCCCCHHHHHHHHHHHHHHHHcc
Q psy7548 2 NHFEILNCNVNSTLEEIKENYKSLILKYHPDKNSEHQEMFLKINEAWNILK 52 (134)
Q Consensus 2 ~~Y~iLgv~~~as~~~Ik~ay~~l~~~~HPDk~~~~~~~f~~i~~Ay~~L~ 52 (134)
++|+||||++++|.++||++||+|++++|||+. .+...|.+|++||++|.
T Consensus 66 eAy~ILGv~~~As~~eIkkaYRrLa~~~HPDkg-Gs~~~~~kIneAyevL~ 115 (116)
T PTZ00100 66 EAYKILNISPTASKERIREAHKQLMLRNHPDNG-GSTYIASKVNEAKDLLL 115 (116)
T ss_pred HHHHHcCCCCCCCHHHHHHHHHHHHHHhCCCCC-CCHHHHHHHHHHHHHHh
Confidence 579999999999999999999999999999984 45678999999999985
No 60
>KOG0714|consensus
Probab=99.44 E-value=4.6e-14 Score=107.39 Aligned_cols=65 Identities=37% Similarity=0.633 Sum_probs=60.0
Q ss_pred CCccccccCCCCCCHHHHHHHHHHHHHHcCCCC--CH--HHHHHHHHHHHHHHHccChHHHHHHHHHHh
Q psy7548 1 MNHFEILNCNVNSTLEEIKENYKSLILKYHPDK--NS--EHQEMFLKINEAWNILKDEKERKLYESQLL 65 (134)
Q Consensus 1 ~~~Y~iLgv~~~as~~~Ik~ay~~l~~~~HPDk--~~--~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~ 65 (134)
+++|.||||.+.|+.++|++||+++++++|||+ .+ .+..+|++|.+||++|+|+.+|..||.++.
T Consensus 3 ~d~~~~l~i~~~as~~~i~ka~~~~a~~~hpdk~~~~~~~~~~~~~~~~ea~~~ls~~~kr~~~d~~~~ 71 (306)
T KOG0714|consen 3 KDYYKILGIARSASEEDIKKAYRKLALKYHPDKNPSPKEVAEAKFKEIAEAYEVLSDPKKRKIYDQYGE 71 (306)
T ss_pred ccHHHHhCccccccHHHHHHHHHHHHHhhCCCCCCCchhhHHHHHhhhhccccccCCHHHhhhccccCc
Confidence 489999999999999999999999999999999 22 455689999999999999999999999998
No 61
>PRK09430 djlA Dna-J like membrane chaperone protein; Provisional
Probab=99.44 E-value=9.2e-14 Score=106.92 Aligned_cols=52 Identities=27% Similarity=0.491 Sum_probs=48.1
Q ss_pred CccccccCCCCCCHHHHHHHHHHHHHHcCCCCC----------HHHHHHHHHHHHHHHHccC
Q psy7548 2 NHFEILNCNVNSTLEEIKENYKSLILKYHPDKN----------SEHQEMFLKINEAWNILKD 53 (134)
Q Consensus 2 ~~Y~iLgv~~~as~~~Ik~ay~~l~~~~HPDk~----------~~~~~~f~~i~~Ay~~L~d 53 (134)
++|+||||++++|.++||++||+|++++|||+. +.+.++|++|++||++|+.
T Consensus 201 ~ay~vLgv~~~as~~eIk~aYr~L~~~~HPDk~~~~g~~~~~~~~a~ek~~~I~~AYe~L~~ 262 (267)
T PRK09430 201 DAYKVLGVSESDDDQEIKRAYRKLMSEHHPDKLVAKGLPPEMMEMAKEKAQEIQAAYELIKK 262 (267)
T ss_pred hHHHHcCCCCCCCHHHHHHHHHHHHHHhCcCCCCCCCCChhhHHHHHHHHHHHHHHHHHHHH
Confidence 689999999999999999999999999999991 3578899999999999985
No 62
>PHA02624 large T antigen; Provisional
Probab=99.42 E-value=1.3e-13 Score=115.41 Aligned_cols=58 Identities=19% Similarity=0.377 Sum_probs=54.5
Q ss_pred CccccccCCCCC--CHHHHHHHHHHHHHHcCCCCCHHHHHHHHHHHHHHHHccChHHHHHH
Q psy7548 2 NHFEILNCNVNS--TLEEIKENYKSLILKYHPDKNSEHQEMFLKINEAWNILKDEKERKLY 60 (134)
Q Consensus 2 ~~Y~iLgv~~~a--s~~~Ik~ay~~l~~~~HPDk~~~~~~~f~~i~~Ay~~L~d~~~R~~Y 60 (134)
++|++|||+++| +.++||+|||++++++|||+. .+.++|++|++||++|+|+.+|..|
T Consensus 12 elyelLGL~~~A~gs~~eIKkAYRkLAkkyHPDKg-Gdeekfk~Ln~AYevL~d~~k~~r~ 71 (647)
T PHA02624 12 ELMDLLGLPMAAWGNLPLMRKAYLRKCKEYHPDKG-GDEEKMKRLNSLYKKLQEGVKSARQ 71 (647)
T ss_pred HHHHHcCCCCCCCCCHHHHHHHHHHHHHHHCcCCC-CcHHHHHHHHHHHHHHhcHHHhhhc
Confidence 589999999999 999999999999999999994 3467899999999999999999998
No 63
>PRK01773 hscB co-chaperone HscB; Provisional
Probab=99.41 E-value=5.9e-13 Score=96.46 Aligned_cols=65 Identities=26% Similarity=0.280 Sum_probs=58.0
Q ss_pred CCccccccCCCC--CCHHHHHHHHHHHHHHcCCCC---C-----HHHHHHHHHHHHHHHHccChHHHHHHHHHHh
Q psy7548 1 MNHFEILNCNVN--STLEEIKENYKSLILKYHPDK---N-----SEHQEMFLKINEAWNILKDEKERKLYESQLL 65 (134)
Q Consensus 1 ~~~Y~iLgv~~~--as~~~Ik~ay~~l~~~~HPDk---~-----~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~ 65 (134)
+|||++||+++. .+...++++|++|.+++|||+ . ..+.+....||+||.+|+||.+|+.|-..+.
T Consensus 2 ~nyF~lf~lp~~F~iD~~~L~~~y~~Lq~~~HPD~f~~~~~~eq~~a~~~ss~iN~AY~tLkdPl~RA~YLL~L~ 76 (173)
T PRK01773 2 NNPFALFDLPVDFQLDNALLSERYLALQKSLHPDNFANSSAQEQRLAMQKSAEVNDALQILKDPILRAEAIIALN 76 (173)
T ss_pred CChHHhcCCCCCCCCCHHHHHHHHHHHHHHhCcCcccCCCHHHHHHHHHHHHHHHHHHHHHCChHHHHHHHHHhc
Confidence 589999999997 789999999999999999999 1 2345678899999999999999999988776
No 64
>COG5407 SEC63 Preprotein translocase subunit Sec63 [Intracellular trafficking and secretion]
Probab=99.40 E-value=2.2e-13 Score=109.84 Aligned_cols=70 Identities=33% Similarity=0.593 Sum_probs=64.1
Q ss_pred CccccccCCCCCCHHHHHHHHHHHHHHcCCCC--------CHHHHHHHHHHHHHHHHccChHHHHHHHHHHhhhhhcc
Q psy7548 2 NHFEILNCNVNSTLEEIKENYKSLILKYHPDK--------NSEHQEMFLKINEAWNILKDEKERKLYESQLLSQQQTH 71 (134)
Q Consensus 2 ~~Y~iLgv~~~as~~~Ik~ay~~l~~~~HPDk--------~~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~~~~~~ 71 (134)
|+|||||+..+++..+||++||+|+.++|||| ...-++.++.|++||+.|+|...|+.|-.+|.....++
T Consensus 99 DPyEILGI~~~ts~rdik~~yr~Ls~KfhpdK~~~mvn~~rse~Ee~y~~ItkAY~~lTd~k~renyl~yGtPd~pQh 176 (610)
T COG5407 99 DPYEILGIDQDTSERDIKKRYRMLSMKFHPDKAPPMVNELRSEYEEKYKTITKAYGLLTDKKRRENYLNYGTPDSPQH 176 (610)
T ss_pred ChHHhhcccCCCcHHHHHHHHHhheeecChhhcCCCChhHHHHHHHHHHHHHHHHHhhhhHHHHHHHHhcCCCCCCcc
Confidence 79999999999999999999999999999999 34678899999999999999999999999988766543
No 65
>TIGR00714 hscB Fe-S protein assembly co-chaperone HscB. This model describes the small subunit, Hsc20 (20K heat shock cognate protein) of a pair of proteins Hsc66-Hsc20, related to the DnaK-DnaJ heat shock proteins, which also serve as molecular chaperones. Hsc20, unlike DnaJ, appears not to have chaperone activity on its own, but to act solely as a regulatory subunit for Hsc66 (i.e., to be a co-chaperone). The gene for Hsc20 in E. coli, hscB, is not induced by heat shock.
Probab=99.26 E-value=7.3e-12 Score=89.51 Aligned_cols=55 Identities=27% Similarity=0.369 Sum_probs=48.0
Q ss_pred CCHHHHHHHHHHHHHHcCCCC--C------HHHHHHHHHHHHHHHHccChHHHHHHHHHHhhh
Q psy7548 13 STLEEIKENYKSLILKYHPDK--N------SEHQEMFLKINEAWNILKDEKERKLYESQLLSQ 67 (134)
Q Consensus 13 as~~~Ik~ay~~l~~~~HPDk--~------~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~~ 67 (134)
.+..+|+++|+++++++|||+ . ..+...+..|++||++|+||.+|+.|+..+...
T Consensus 3 iD~~~L~~~yr~lq~~~HPD~~~~~~~~~~~~a~~~s~~iN~AY~~L~~p~~Ra~ylL~l~g~ 65 (157)
T TIGR00714 3 LDTQALSLRYQDLQRQYHPDKFASGSAQEQLAAVQQSTTLNQAYQTLKDPLMRAEYMLSLHGI 65 (157)
T ss_pred CCHHHHHHHHHHHHHHHCcCCCCCCChhhhHHHHHHHHHHHHHHHHhCChhhhHHHHHHhcCC
Confidence 467899999999999999998 1 225678999999999999999999999888743
No 66
>COG5269 ZUO1 Ribosome-associated chaperone zuotin [Translation, ribosomal structure and biogenesis / Posttranslational modification, protein turnover, chaperones]
Probab=99.11 E-value=4.5e-11 Score=91.36 Aligned_cols=80 Identities=28% Similarity=0.399 Sum_probs=64.8
Q ss_pred CccccccCCC---CCCHHHHHHHHHHHHHHcCCCC-----CHHHHHHHHHHHHHHHHccChHHHHHHHHHHhhhhhcccc
Q psy7548 2 NHFEILNCNV---NSTLEEIKENYKSLILKYHPDK-----NSEHQEMFLKINEAWNILKDEKERKLYESQLLSQQQTHMN 73 (134)
Q Consensus 2 ~~Y~iLgv~~---~as~~~Ik~ay~~l~~~~HPDk-----~~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~~~~~~~~ 73 (134)
|+|.+||++. -|++.+|.++.++.+.+||||+ +-.....|+.|+.||+||+|+.+|..||.--.......+.
T Consensus 44 DlYa~lgLskyR~ka~~~qi~kah~kkv~kyHPDk~aa~g~~~~d~fFk~iqkA~evL~D~~~R~qyDS~df~advppp~ 123 (379)
T COG5269 44 DLYALLGLSKYRTKAIPPQILKAHKKKVYKYHPDKTAAGGNKGCDEFFKLIQKAREVLGDRKLRLQYDSNDFDADVPPPR 123 (379)
T ss_pred hHHHHhchHhhhcCCCcHHHHHHHHHHHHHhCccchhccCCCCcHHHHHHHHHHHHHhccHHHHhhccccccccCCCCcc
Confidence 6899999986 5789999999999999999999 3556788999999999999999999999776655544433
Q ss_pred cccccccc
Q psy7548 74 IYKSVSLS 81 (134)
Q Consensus 74 ~~~~~~~~ 81 (134)
.+....++
T Consensus 124 ~~t~~~Ff 131 (379)
T COG5269 124 IYTPDEFF 131 (379)
T ss_pred CCCchhHH
Confidence 44333333
No 67
>KOG1150|consensus
Probab=99.08 E-value=2.8e-10 Score=83.54 Aligned_cols=60 Identities=35% Similarity=0.553 Sum_probs=54.6
Q ss_pred CccccccCCCCCCHHHHHHHHHHHHHHcCCCC----CHHHHHHHHHHHHHHHHccChHHHHHHH
Q psy7548 2 NHFEILNCNVNSTLEEIKENYKSLILKYHPDK----NSEHQEMFLKINEAWNILKDEKERKLYE 61 (134)
Q Consensus 2 ~~Y~iLgv~~~as~~~Ik~ay~~l~~~~HPDk----~~~~~~~f~~i~~Ay~~L~d~~~R~~Yd 61 (134)
|+|+||.|.|..+.++||+.||+|++..|||| .+.|...|-.|.+||..|-|+..|..-+
T Consensus 54 NpfeVLqIdpev~~edikkryRklSilVHPDKN~Dd~~rAqkAFdivkKA~k~l~n~~~rkr~~ 117 (250)
T KOG1150|consen 54 NPFEVLQIDPEVTDEDIKKRYRKLSILVHPDKNPDDAERAQKAFDIVKKAYKLLENDKIRKRCL 117 (250)
T ss_pred ChHHHHhcCCCCCHHHHHHHHHhhheeecCCCCcccHHHHHHHHHHHHHHHHHHhCHHHHHHHH
Confidence 79999999999999999999999999999999 3567888999999999999998666544
No 68
>KOG0568|consensus
Probab=98.69 E-value=1.3e-08 Score=76.48 Aligned_cols=54 Identities=28% Similarity=0.618 Sum_probs=48.4
Q ss_pred CCccccccCCCCCCHHHHHHHHHHHHHHcCCCC--CHHHHHHHHHHHHHHH-HccCh
Q psy7548 1 MNHFEILNCNVNSTLEEIKENYKSLILKYHPDK--NSEHQEMFLKINEAWN-ILKDE 54 (134)
Q Consensus 1 ~~~Y~iLgv~~~as~~~Ik~ay~~l~~~~HPDk--~~~~~~~f~~i~~Ay~-~L~d~ 54 (134)
|.+|.||||...|+.++++.+|..|++++|||. ...+.+.|.+|.+||. ||+..
T Consensus 47 ~e~fril~v~e~~~adevr~af~~lakq~hpdsgs~~adaa~f~qideafrkvlq~~ 103 (342)
T KOG0568|consen 47 MECFRILGVEEGADADEVREAFHDLAKQVHPDSGSEEADAARFIQIDEAFRKVLQEK 103 (342)
T ss_pred HHHHHHhcccccCchhHHHHHHHHHHHHcCCCCCCccccHHHHHHHHHHHHHHHHHH
Confidence 457999999999999999999999999999999 5667889999999998 77643
No 69
>KOG1789|consensus
Probab=98.56 E-value=9.3e-08 Score=84.27 Aligned_cols=50 Identities=40% Similarity=0.705 Sum_probs=44.4
Q ss_pred ccccccCCCC----CCHHHHHHHHHHHHHHcCCCCCHHHHHHHHHHHHHHHHcc
Q psy7548 3 HFEILNCNVN----STLEEIKENYKSLILKYHPDKNSEHQEMFLKINEAWNILK 52 (134)
Q Consensus 3 ~Y~iLgv~~~----as~~~Ik~ay~~l~~~~HPDk~~~~~~~f~~i~~Ay~~L~ 52 (134)
-|+||.|+-+ ...+.||++|++|+.+|||||+|+..++|..+++||+.|.
T Consensus 1283 A~eiL~i~l~n~~hD~~~KirrqY~kLA~kYHPDKNPEGRemFe~VnKAYE~L~ 1336 (2235)
T KOG1789|consen 1283 AREILSVDLTNEEHDKPAKIRRQYYKLAAKYHPDKNPEGREMFERVNKAYELLS 1336 (2235)
T ss_pred HHHHhccccCCCCcccHHHHHHHHHHHHHHhCCCCCchHHHHHHHHHHHHHHHH
Confidence 3788888753 3458899999999999999999999999999999999998
No 70
>KOG0723|consensus
Probab=98.46 E-value=3.6e-07 Score=60.77 Aligned_cols=47 Identities=32% Similarity=0.428 Sum_probs=40.6
Q ss_pred ccccCCCCCCHHHHHHHHHHHHHHcCCCC--CHHHHHHHHHHHHHHHHccCh
Q psy7548 5 EILNCNVNSTLEEIKENYKSLILKYHPDK--NSEHQEMFLKINEAWNILKDE 54 (134)
Q Consensus 5 ~iLgv~~~as~~~Ik~ay~~l~~~~HPDk--~~~~~~~f~~i~~Ay~~L~d~ 54 (134)
.||||++.++.+.||+|+|++++..|||+ .|-- -.+|+||+++|...
T Consensus 60 lIL~v~~s~~k~KikeaHrriM~~NHPD~GGSPYl---AsKINEAKdlLe~~ 108 (112)
T KOG0723|consen 60 LILGVTPSLDKDKIKEAHRRIMLANHPDRGGSPYL---ASKINEAKDLLEGT 108 (112)
T ss_pred HHhCCCccccHHHHHHHHHHHHHcCCCcCCCCHHH---HHHHHHHHHHHhcc
Confidence 48999999999999999999999999999 4443 34899999998753
No 71
>KOG3192|consensus
Probab=98.09 E-value=5.6e-06 Score=58.69 Aligned_cols=68 Identities=28% Similarity=0.489 Sum_probs=55.4
Q ss_pred CCccccccCCC--CCCHHHHHHHHHHHHHHcCCCC--------CHHHHHHHHHHHHHHHHccChHHHHHHHHHHhhhh
Q psy7548 1 MNHFEILNCNV--NSTLEEIKENYKSLILKYHPDK--------NSEHQEMFLKINEAWNILKDEKERKLYESQLLSQQ 68 (134)
Q Consensus 1 ~~~Y~iLgv~~--~as~~~Ik~ay~~l~~~~HPDk--------~~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~~~ 68 (134)
++||.++|... ...++.++..|.-...++|||+ ...+.+....|++||.+|.||-+|+.|-..+....
T Consensus 8 ~~ff~~Fg~e~~~~~~p~~l~~~~~~~skkL~~d~~~~~~~~~~d~a~eqSa~lnkAY~TLk~pL~RA~Yilkl~g~e 85 (168)
T KOG3192|consen 8 SRFFDIFGMELSFKIDPDKLKEKYTDISKKLHPDRPGLSFAGDTDQASEQSAELNKAYDTLKDPLARARYLLKLKGQE 85 (168)
T ss_pred HHHHHHhccccCCCCCcchhhHHHHHHHHhhCcccccccccccchhHHHHHHHHHHHHHHHHhHHHHHHHHHHHhCCC
Confidence 46899997654 4466777778999999999999 33467778999999999999999999987776644
No 72
>COG1076 DjlA DnaJ-domain-containing proteins 1 [Posttranslational modification, protein turnover, chaperones]
Probab=97.54 E-value=6.2e-05 Score=54.54 Aligned_cols=50 Identities=26% Similarity=0.535 Sum_probs=43.8
Q ss_pred CccccccCCCCCCHHHHHHHHHHHHHHcCCCC----------CHHHHHHHHHHHHHHHHc
Q psy7548 2 NHFEILNCNVNSTLEEIKENYKSLILKYHPDK----------NSEHQEMFLKINEAWNIL 51 (134)
Q Consensus 2 ~~Y~iLgv~~~as~~~Ik~ay~~l~~~~HPDk----------~~~~~~~f~~i~~Ay~~L 51 (134)
+.|.+||++..+...+|+++|+++....|||+ -..+.+++++|++||+.+
T Consensus 114 ~~l~~l~~~~~~~~~~i~~~~r~l~~e~~~d~a~~~~~~~e~~~~~~~~~~~i~~a~~~~ 173 (174)
T COG1076 114 DALKVLGVEIKADQDAIKKAYRKLLSEQHPDKAAAKGLKLEFIEKLKEKLQEIQEAYEDI 173 (174)
T ss_pred hHHHHhcCchhhhHHHHHHHHHHHHHhcCHHHHHHhcCCHHHHHHHHHHHHHHHHHHHhc
Confidence 56899999999999999999999999999999 245678888899888753
No 73
>KOG0431|consensus
Probab=97.48 E-value=0.00013 Score=60.27 Aligned_cols=45 Identities=29% Similarity=0.505 Sum_probs=33.9
Q ss_pred ccCCCCCCHHHHHHHHHHHHHHcCCCC----------CHHHHHHHHHHHHHHHHc
Q psy7548 7 LNCNVNSTLEEIKENYKSLILKYHPDK----------NSEHQEMFLKINEAWNIL 51 (134)
Q Consensus 7 Lgv~~~as~~~Ik~ay~~l~~~~HPDk----------~~~~~~~f~~i~~Ay~~L 51 (134)
+++..-.+.++||++|||..|..|||| +-.+++.|-.+++||...
T Consensus 394 VsltDLVtp~~VKKaYrKA~L~VHPDKlqq~gas~~qK~Iaekvfd~l~eawn~f 448 (453)
T KOG0431|consen 394 VSLTDLVTPAQVKKAYRKAVLCVHPDKLQQKGASLEQKYIAEKVFDALSEAWNKF 448 (453)
T ss_pred CchhhccCHHHHHHHHHhhhheeCcccccCCcccHHHHHHHHHHHHHHHHHHHhh
Confidence 345556799999999999999999999 123556667777777643
No 74
>COG1076 DjlA DnaJ-domain-containing proteins 1 [Posttranslational modification, protein turnover, chaperones]
Probab=97.22 E-value=0.00027 Score=51.22 Aligned_cols=65 Identities=26% Similarity=0.383 Sum_probs=52.5
Q ss_pred CCccccccCCCCC--CHHHHHHHHHHHHHHcCCCC---C-----HHHHHHHHHHHHHHHHccChHHHHHHHHHHh
Q psy7548 1 MNHFEILNCNVNS--TLEEIKENYKSLILKYHPDK---N-----SEHQEMFLKINEAWNILKDEKERKLYESQLL 65 (134)
Q Consensus 1 ~~~Y~iLgv~~~a--s~~~Ik~ay~~l~~~~HPDk---~-----~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~ 65 (134)
++++..+|.++.+ ..+.++..|+.+.+.+|||+ . ..+.+.+..++.||.+|.+|.+|..|-..+.
T Consensus 1 ~~~~~~~~~~~~f~~~~~~l~~~~~~~~~~~~~dr~~~~~~~~~~~~l~~~~~~~~a~~tLk~~l~ra~~~lal~ 75 (174)
T COG1076 1 SDGFVLFGLPRAFQIDLDALKLQYRELQRAYHPDRFGKASEAEQRKALQQSAEVNPAYQTLKDPLLRAEYLLALA 75 (174)
T ss_pred CCcccccccHHHHHHHHhHhhhhHHHHHHhhCcccccccchHHHHHHHHHHHHhcchHHHHHHHHHHHHHHHHhc
Confidence 3566777777765 34569999999999999999 1 1244678999999999999999999987666
No 75
>COG1996 RPC10 DNA-directed RNA polymerase, subunit RPC10 (contains C4-type Zn-finger) [Transcription]
Probab=96.49 E-value=0.0053 Score=35.44 Aligned_cols=36 Identities=33% Similarity=0.736 Sum_probs=27.4
Q ss_pred eeeeeeC-CCCCeEEEecccccCCCCCCceeEeCCCCcceEEEeeC
Q psy7548 90 QMFTYPC-RCGAEFCIEEQDTEGDSSDDNILIACDTCSLLLEITAR 134 (134)
Q Consensus 90 ~~~~~~c-rcG~~~~v~~~~~~~g~~~~~~~v~C~~CSl~i~v~~~ 134 (134)
.+..|.| +||..+ ++.....+ +.|+.|+..|.+..|
T Consensus 3 ~~~~Y~C~~Cg~~~--~~~~~~~~-------irCp~Cg~rIl~K~R 39 (49)
T COG1996 3 AMMEYKCARCGREV--ELDQETRG-------IRCPYCGSRILVKER 39 (49)
T ss_pred ceEEEEhhhcCCee--ehhhccCc-------eeCCCCCcEEEEecc
Confidence 3467899 999988 33444555 999999999998653
No 76
>TIGR02098 MJ0042_CXXC MJ0042 family finger-like domain. This domain contains a CXXCX(19)CXXC motif suggestive of both zinc fingers and thioredoxin, usually found at the N-terminus of prokaryotic proteins. One partially characterized gene, agmX, is among a large set in Myxococcus whose interruption affects adventurous gliding motility.
Probab=95.81 E-value=0.017 Score=31.15 Aligned_cols=34 Identities=15% Similarity=0.473 Sum_probs=28.4
Q ss_pred eeC-CCCCeEEEecccccCCCCCCceeEeCCCCcceEEE
Q psy7548 94 YPC-RCGAEFCIEEQDTEGDSSDDNILIACDTCSLLLEI 131 (134)
Q Consensus 94 ~~c-rcG~~~~v~~~~~~~g~~~~~~~v~C~~CSl~i~v 131 (134)
..| +||..|.|..+.+... ...+.|+.|...+.+
T Consensus 3 ~~CP~C~~~~~v~~~~~~~~----~~~v~C~~C~~~~~~ 37 (38)
T TIGR02098 3 IQCPNCKTSFRVVDSQLGAN----GGKVRCGKCGHVWYA 37 (38)
T ss_pred EECCCCCCEEEeCHHHcCCC----CCEEECCCCCCEEEe
Confidence 467 9999999999988754 347999999998876
No 77
>PF03656 Pam16: Pam16; InterPro: IPR005341 The Pam16 protein is the fifth essential subunit of the pre-sequence translocase-associated protein import motor (PAM) []. In Saccharomyces cerevisiae (Baker's yeast), Pam16 is required for preprotein translocation into the matrix, but not for protein insertion into the inner membrane [].; PDB: 2GUZ_J.
Probab=95.80 E-value=0.019 Score=39.71 Aligned_cols=50 Identities=30% Similarity=0.329 Sum_probs=35.9
Q ss_pred ccccCCCCCCHHHHHHHHHHHHHHcCCCCCHHHHHHHHHHHHHHHHccChH
Q psy7548 5 EILNCNVNSTLEEIKENYKSLILKYHPDKNSEHQEMFLKINEAWNILKDEK 55 (134)
Q Consensus 5 ~iLgv~~~as~~~Ik~ay~~l~~~~HPDk~~~~~~~f~~i~~Ay~~L~d~~ 55 (134)
.||||++..+.++|.+.|.+|....+|++ .++.-.-.+|..|.+.|....
T Consensus 62 ~ILnv~~~~~~eeI~k~y~~Lf~~Nd~~k-GGSfYLQSKV~rAKErl~~El 111 (127)
T PF03656_consen 62 QILNVKEELSREEIQKRYKHLFKANDPSK-GGSFYLQSKVFRAKERLEQEL 111 (127)
T ss_dssp HHHT--G--SHHHHHHHHHHHHHHT-CCC-TS-HHHHHHHHHHHHHHHHHH
T ss_pred HHcCCCCccCHHHHHHHHHHHHhccCCCc-CCCHHHHHHHHHHHHHHHHHH
Confidence 69999999999999999999999999997 234444567888888877444
No 78
>PF13719 zinc_ribbon_5: zinc-ribbon domain
Probab=94.04 E-value=0.065 Score=28.95 Aligned_cols=34 Identities=15% Similarity=0.585 Sum_probs=28.3
Q ss_pred eeC-CCCCeEEEecccccCCCCCCceeEeCCCCcceEEE
Q psy7548 94 YPC-RCGAEFCIEEQDTEGDSSDDNILIACDTCSLLLEI 131 (134)
Q Consensus 94 ~~c-rcG~~~~v~~~~~~~g~~~~~~~v~C~~CSl~i~v 131 (134)
..| .|+..|.|+.+.+-.+ -..+.|+.|.....|
T Consensus 3 i~CP~C~~~f~v~~~~l~~~----~~~vrC~~C~~~f~v 37 (37)
T PF13719_consen 3 ITCPNCQTRFRVPDDKLPAG----GRKVRCPKCGHVFRV 37 (37)
T ss_pred EECCCCCceEEcCHHHcccC----CcEEECCCCCcEeeC
Confidence 357 8999999999988776 568999999987654
No 79
>PF13717 zinc_ribbon_4: zinc-ribbon domain
Probab=92.56 E-value=0.16 Score=27.28 Aligned_cols=32 Identities=19% Similarity=0.637 Sum_probs=26.7
Q ss_pred eeC-CCCCeEEEecccccCCCCCCceeEeCCCCcceE
Q psy7548 94 YPC-RCGAEFCIEEQDTEGDSSDDNILIACDTCSLLL 129 (134)
Q Consensus 94 ~~c-rcG~~~~v~~~~~~~g~~~~~~~v~C~~CSl~i 129 (134)
..| +|+..|.|..+.+-.. -..+.|+.|+...
T Consensus 3 i~Cp~C~~~y~i~d~~ip~~----g~~v~C~~C~~~f 35 (36)
T PF13717_consen 3 ITCPNCQAKYEIDDEKIPPK----GRKVRCSKCGHVF 35 (36)
T ss_pred EECCCCCCEEeCCHHHCCCC----CcEEECCCCCCEe
Confidence 357 8999999999988765 5689999999764
No 80
>PF13446 RPT: A repeated domain in UCH-protein
Probab=92.42 E-value=0.3 Score=29.11 Aligned_cols=24 Identities=21% Similarity=0.326 Sum_probs=22.2
Q ss_pred cccccCCCCCCHHHHHHHHHHHHH
Q psy7548 4 FEILNCNVNSTLEEIKENYKSLIL 27 (134)
Q Consensus 4 Y~iLgv~~~as~~~Ik~ay~~l~~ 27 (134)
|++|||+++.+.+.|-.+|+....
T Consensus 8 y~~Lgi~~~~~Dd~Ii~~f~~~~~ 31 (62)
T PF13446_consen 8 YEILGIDEDTDDDFIISAFQSKVN 31 (62)
T ss_pred HHHhCcCCCCCHHHHHHHHHHHHH
Confidence 889999999999999999998766
No 81
>smart00659 RPOLCX RNA polymerase subunit CX. present in RNA polymerase I, II and III
Probab=92.29 E-value=0.38 Score=27.05 Aligned_cols=31 Identities=26% Similarity=0.624 Sum_probs=23.6
Q ss_pred eeeC-CCCCeEEEecccccCCCCCCceeEeCCCCcceEEEee
Q psy7548 93 TYPC-RCGAEFCIEEQDTEGDSSDDNILIACDTCSLLLEITA 133 (134)
Q Consensus 93 ~~~c-rcG~~~~v~~~~~~~g~~~~~~~v~C~~CSl~i~v~~ 133 (134)
.|.| .||..+.+.. ...+.|+.|...|....
T Consensus 2 ~Y~C~~Cg~~~~~~~----------~~~irC~~CG~rIlyK~ 33 (44)
T smart00659 2 IYICGECGRENEIKS----------KDVVRCRECGYRILYKK 33 (44)
T ss_pred EEECCCCCCEeecCC----------CCceECCCCCceEEEEe
Confidence 3778 8999888761 13389999999998754
No 82
>KOG0724|consensus
Probab=92.02 E-value=0.22 Score=39.30 Aligned_cols=53 Identities=34% Similarity=0.492 Sum_probs=42.7
Q ss_pred CCCHHHHHHHHHHHHHHcCCCCCH-------HHHHHHHHHHHHHHHccChHHHHHHHHHH
Q psy7548 12 NSTLEEIKENYKSLILKYHPDKNS-------EHQEMFLKINEAWNILKDEKERKLYESQL 64 (134)
Q Consensus 12 ~as~~~Ik~ay~~l~~~~HPDk~~-------~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~ 64 (134)
.++..+|+.+|+..++..||++.. ...+.++.|.+||.+|.+...|...|...
T Consensus 3 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~i~ka~~i~~~~~~~~t~~~~~ 62 (335)
T KOG0724|consen 3 LASEDELRLAYREMALKSHPEKKSFYEKLSLWTEEEFKKIEKALAILDDDEPRRTPDSWD 62 (335)
T ss_pred cccHHHHHHHHHHHhhhcCcHHHHHHHHhhhhHHHHHHHHHHHHHHHhccccccchhhhh
Confidence 467889999999999999999832 45677999999999999876655555444
No 83
>PF11833 DUF3353: Protein of unknown function (DUF3353); InterPro: IPR021788 This family of proteins are functionally uncharacterised. This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 205 to 258 amino acids in length.
Probab=91.13 E-value=0.71 Score=34.13 Aligned_cols=38 Identities=21% Similarity=0.372 Sum_probs=29.9
Q ss_pred CCCCCHHHHHHHHHHHHHHcCCCCCHHHHHHHHHHHHHHHHcc
Q psy7548 10 NVNSTLEEIKENYKSLILKYHPDKNSEHQEMFLKINEAWNILK 52 (134)
Q Consensus 10 ~~~as~~~Ik~ay~~l~~~~HPDk~~~~~~~f~~i~~Ay~~L~ 52 (134)
+++||.+||.+|+.++..+|--|. +.-..|..||+.|.
T Consensus 1 S~~ASfeEIq~Arn~ll~~y~gd~-----~~~~~IEaAYD~IL 38 (194)
T PF11833_consen 1 SEDASFEEIQAARNRLLAQYAGDE-----KSREAIEAAYDAIL 38 (194)
T ss_pred CCCCCHHHHHHHHHHHHHHhcCCH-----HHHHHHHHHHHHHH
Confidence 578999999999999999996653 33347888987654
No 84
>PF05180 zf-DNL: DNL zinc finger; InterPro: IPR007853 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The DNL-type zinc finger is found in Tim15, a zinc finger protein essential for protein import into mitochondria. Mitochondrial functions rely on the correct transport of resident proteins synthesized in the cytosol to mitochondria. Protein import into mitochondria is mediated by membrane protein complexes, protein translocators, in the outer and inner mitochondrial membranes, in cooperation with their assistant proteins in the cytosol, intermembrane space and matrix. Proteins destined to the mitochondrial matrix cross the outer membrane with the aid of the outer membrane translocator, the tOM40 complex, and then the inner membrane with the aid of the inner membrane translocator, the TIM23 complex, and mitochondrial motor and chaperone (MMC) proteins including mitochondrial heat- shock protein 70 (mtHsp70), and translocase in the inner mitochondrial membrane (Tim)15. Tim15 is also known as zinc finger motif (Zim)17 or mtHsp70 escort protein (Hep)1. Tim15 contains a zinc-finger motif (CXXC and CXXC) of ~100 residues, which has been named DNL after a short C-terminal motif of D(N/H)L [, , ]. The DNL-type zinc finger is an L-shaped molecule. The two CXXC motifs are located at the end of the L, and are sandwiched by two- stranded antiparallel beta-sheets. Two short alpha-helices constitute another leg of the L. The outer (convex) face of the L has a large acidic groove, which is lined with five acidic residues, whereas the inner (concave) face of the L has two positively charged residues, next to the CXXC motifs []. This entry represents the DNL-type zinc finger.; GO: 0008270 zinc ion binding; PDB: 2E2Z_A.
Probab=89.94 E-value=0.3 Score=29.95 Aligned_cols=35 Identities=23% Similarity=0.583 Sum_probs=25.9
Q ss_pred eeeC-CCC--CeEEEecccccCCCCCCceeEeCCCCcceEEE
Q psy7548 93 TYPC-RCG--AEFCIEEQDTEGDSSDDNILIACDTCSLLLEI 131 (134)
Q Consensus 93 ~~~c-rcG--~~~~v~~~~~~~g~~~~~~~v~C~~CSl~i~v 131 (134)
.+.| .|+ ..-.++......| .+++.|++|..+..|
T Consensus 4 ~FTC~~C~~Rs~~~~sk~aY~~G----vViv~C~gC~~~HlI 41 (66)
T PF05180_consen 4 TFTCNKCGTRSAKMFSKQAYHKG----VVIVQCPGCKNRHLI 41 (66)
T ss_dssp EEEETTTTEEEEEEEEHHHHHTS----EEEEE-TTS--EEES
T ss_pred EEEcCCCCCccceeeCHHHHhCC----eEEEECCCCcceeee
Confidence 4667 787 5777888888888 899999999988765
No 85
>PF14687 DUF4460: Domain of unknown function (DUF4460)
Probab=89.64 E-value=0.83 Score=30.84 Aligned_cols=23 Identities=17% Similarity=0.404 Sum_probs=20.3
Q ss_pred CCCCHHHHHHHHHHHHHHcCCCC
Q psy7548 11 VNSTLEEIKENYKSLILKYHPDK 33 (134)
Q Consensus 11 ~~as~~~Ik~ay~~l~~~~HPDk 33 (134)
+..+..+++.|.|...++.|||.
T Consensus 4 r~~~~~~l~~aLr~Fy~~VHPDl 26 (112)
T PF14687_consen 4 RNLSSPDLRSALRPFYFAVHPDL 26 (112)
T ss_pred hhhhhHHHHHHHHHHHHHhCCcc
Confidence 34577889999999999999997
No 86
>PRK00398 rpoP DNA-directed RNA polymerase subunit P; Provisional
Probab=87.83 E-value=1.4 Score=24.65 Aligned_cols=30 Identities=33% Similarity=0.769 Sum_probs=22.4
Q ss_pred eeeC-CCCCeEEEecccccCCCCCCceeEeCCCCcceEEE
Q psy7548 93 TYPC-RCGAEFCIEEQDTEGDSSDDNILIACDTCSLLLEI 131 (134)
Q Consensus 93 ~~~c-rcG~~~~v~~~~~~~g~~~~~~~v~C~~CSl~i~v 131 (134)
.+.| .||..+.+..... .+.|+.|...+..
T Consensus 3 ~y~C~~CG~~~~~~~~~~---------~~~Cp~CG~~~~~ 33 (46)
T PRK00398 3 EYKCARCGREVELDEYGT---------GVRCPYCGYRILF 33 (46)
T ss_pred EEECCCCCCEEEECCCCC---------ceECCCCCCeEEE
Confidence 4778 8999988875532 3799999887654
No 87
>PF09723 Zn-ribbon_8: Zinc ribbon domain; InterPro: IPR013429 This entry represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB []. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=87.17 E-value=0.82 Score=25.21 Aligned_cols=32 Identities=28% Similarity=0.776 Sum_probs=23.3
Q ss_pred eeeeeeC-CCCCeEEEecccccCCCCCCceeEeCCCCcc
Q psy7548 90 QMFTYPC-RCGAEFCIEEQDTEGDSSDDNILIACDTCSL 127 (134)
Q Consensus 90 ~~~~~~c-rcG~~~~v~~~~~~~g~~~~~~~v~C~~CSl 127 (134)
..|.|.| .||..|++...--+ ...+.|+.|.-
T Consensus 2 P~Yey~C~~Cg~~fe~~~~~~~------~~~~~CP~Cg~ 34 (42)
T PF09723_consen 2 PIYEYRCEECGHEFEVLQSISE------DDPVPCPECGS 34 (42)
T ss_pred CCEEEEeCCCCCEEEEEEEcCC------CCCCcCCCCCC
Confidence 4588999 79999988654322 23479999976
No 88
>TIGR01206 lysW lysine biosynthesis protein LysW. This very small, poorly characterized protein has been shown essential in Thermus thermophilus for an unusual pathway of Lys biosynthesis from aspartate by way of alpha-aminoadipate (AAA) rather than diaminopimelate. It is found also in Deinococcus radiodurans and Pyrococcus horikoshii, which appear to share the AAA pathway.
Probab=85.98 E-value=1.7 Score=25.54 Aligned_cols=33 Identities=36% Similarity=0.867 Sum_probs=24.8
Q ss_pred eeC-CCCCeEEEecccccCCCCCCceeEeCCCCcceEEEee
Q psy7548 94 YPC-RCGAEFCIEEQDTEGDSSDDNILIACDTCSLLLEITA 133 (134)
Q Consensus 94 ~~c-rcG~~~~v~~~~~~~g~~~~~~~v~C~~CSl~i~v~~ 133 (134)
..| .||..+.+..... | .++.|+.|..-+.|..
T Consensus 3 ~~CP~CG~~iev~~~~~--G-----eiV~Cp~CGaeleVv~ 36 (54)
T TIGR01206 3 FECPDCGAEIELENPEL--G-----ELVICDECGAELEVVS 36 (54)
T ss_pred cCCCCCCCEEecCCCcc--C-----CEEeCCCCCCEEEEEe
Confidence 357 8999888865443 5 3689999999988853
No 89
>smart00834 CxxC_CXXC_SSSS Putative regulatory protein. CxxC_CXXC_SSSS represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=85.41 E-value=1.4 Score=23.57 Aligned_cols=33 Identities=24% Similarity=0.688 Sum_probs=23.3
Q ss_pred eeeeeeC-CCCCeEEEecccccCCCCCCceeEeCCCCcce
Q psy7548 90 QMFTYPC-RCGAEFCIEEQDTEGDSSDDNILIACDTCSLL 128 (134)
Q Consensus 90 ~~~~~~c-rcG~~~~v~~~~~~~g~~~~~~~v~C~~CSl~ 128 (134)
..|.+.| .||..|.+...... ...+.|+.|..-
T Consensus 2 p~Y~y~C~~Cg~~fe~~~~~~~------~~~~~CP~Cg~~ 35 (41)
T smart00834 2 PIYEYRCEDCGHTFEVLQKISD------DPLATCPECGGD 35 (41)
T ss_pred CCEEEEcCCCCCEEEEEEecCC------CCCCCCCCCCCc
Confidence 3578899 89998987765321 134789999873
No 90
>PF05280 FlhC: Flagellar transcriptional activator (FlhC); InterPro: IPR007944 This family consists of several bacterial flagellar transcriptional activator (FlhC) proteins. FlhC combines with FlhD to form a regulatory complex in Escherichia coli, this complex has been shown to be a global regulator involved in many cellular processes as well as a flagellar transcriptional activator [].; GO: 0003677 DNA binding, 0030092 regulation of flagellum assembly, 0045893 positive regulation of transcription, DNA-dependent; PDB: 2AVU_E.
Probab=85.22 E-value=0.69 Score=33.67 Aligned_cols=30 Identities=30% Similarity=0.731 Sum_probs=13.9
Q ss_pred eeeeC-CCCCeEEEecccccCCCCCCceeEeCCCCcce
Q psy7548 92 FTYPC-RCGAEFCIEEQDTEGDSSDDNILIACDTCSLL 128 (134)
Q Consensus 92 ~~~~c-rcG~~~~v~~~~~~~g~~~~~~~v~C~~CSl~ 128 (134)
--.+| +||+.|.+...++... +.|+.|++.
T Consensus 133 ~l~~C~~C~~~fv~~~~~~~~~-------~~Cp~C~~p 163 (175)
T PF05280_consen 133 QLAPCRRCGGHFVTHAHDPRHS-------FVCPFCQPP 163 (175)
T ss_dssp EEEE-TTT--EEEEESS--SS-----------TT----
T ss_pred cccCCCCCCCCeECcCCCCCcC-------cCCCCCCCc
Confidence 35788 7999999998876666 799999864
No 91
>COG5552 Uncharacterized conserved protein [Function unknown]
Probab=81.61 E-value=5.3 Score=25.10 Aligned_cols=33 Identities=24% Similarity=0.349 Sum_probs=26.7
Q ss_pred CCccccccCCCCCCHHHHHHHHHHHHHHc----CCCC
Q psy7548 1 MNHFEILNCNVNSTLEEIKENYKSLILKY----HPDK 33 (134)
Q Consensus 1 ~~~Y~iLgv~~~as~~~Ik~ay~~l~~~~----HPDk 33 (134)
+|.-+++|+.|.|+..||+.|-++..+++ ||.+
T Consensus 3 RNIk~LfnfdPPAT~~EvrdAAlQfVRKlSGtT~PS~ 39 (88)
T COG5552 3 RNIKELFNFDPPATPVEVRDAALQFVRKLSGTTHPSA 39 (88)
T ss_pred cchHHHhCCCCCCCcHHHHHHHHHHHHHhcCCCCcch
Confidence 34557889999999999999988877776 6655
No 92
>TIGR02605 CxxC_CxxC_SSSS putative regulatory protein, FmdB family. This model represents a region of about 50 amino acids found in a number of small proteins in a wide range of bacteria. The region begins usually with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One member of this family is has been noted as a putative regulatory protein, designated FmdB (PubMed:8841393). Most members of this family have a C-terminal region containing highly degenerate sequence, such as SSTSESTKSSGSSGSSGSSESKASGSTEKSTSSTTAAAAV in Mycobacterium tuberculosis and VAVGGSAPAPSPAPRAGGGGGGCCGGGCCG in Streptomyces avermitilis. These low complexity regions, which are not included in the model, resemble low-complexity C-terminal regions of some heterocycle-containing bacteriocin precursors.
Probab=81.07 E-value=1.9 Score=24.47 Aligned_cols=32 Identities=25% Similarity=0.761 Sum_probs=22.2
Q ss_pred eeeeeeC-CCCCeEEEecccccCCCCCCceeEeCCCCcc
Q psy7548 90 QMFTYPC-RCGAEFCIEEQDTEGDSSDDNILIACDTCSL 127 (134)
Q Consensus 90 ~~~~~~c-rcG~~~~v~~~~~~~g~~~~~~~v~C~~CSl 127 (134)
..|.|.| .||..|++....-+ ...+.|+.|..
T Consensus 2 P~Yey~C~~Cg~~fe~~~~~~~------~~~~~CP~Cg~ 34 (52)
T TIGR02605 2 PIYEYRCTACGHRFEVLQKMSD------DPLATCPECGG 34 (52)
T ss_pred CCEEEEeCCCCCEeEEEEecCC------CCCCCCCCCCC
Confidence 3578999 79998988643211 23467999986
No 93
>PF03604 DNA_RNApol_7kD: DNA directed RNA polymerase, 7 kDa subunit; InterPro: IPR006591 DNA-dependent RNA polymerase catalyzes the transcription of DNA into RNA using the four ribonucleoside triphosphates as substrates. Each class of RNA polymerase is assembled from 9 to 15 different polypeptides. Rbp10 (RNA polymerase CX) is a domain found in RNA polymerase subunit 10; present in RNA polymerase I, II and III.; GO: 0003677 DNA binding, 0003899 DNA-directed RNA polymerase activity, 0006351 transcription, DNA-dependent; PDB: 2PMZ_Z 3HKZ_X 2NVX_L 3S1Q_L 2JA6_L 3S17_L 3HOW_L 3HOV_L 3PO2_L 3HOZ_L ....
Probab=80.66 E-value=1.4 Score=23.07 Aligned_cols=31 Identities=32% Similarity=0.649 Sum_probs=18.5
Q ss_pred eeC-CCCCeEEEecccccCCCCCCceeEeCCCCcceEEEeeC
Q psy7548 94 YPC-RCGAEFCIEEQDTEGDSSDDNILIACDTCSLLLEITAR 134 (134)
Q Consensus 94 ~~c-rcG~~~~v~~~~~~~g~~~~~~~v~C~~CSl~i~v~~~ 134 (134)
|-| +||..+.+... ..+.|+.|...|....|
T Consensus 1 Y~C~~Cg~~~~~~~~----------~~irC~~CG~RIlyK~R 32 (32)
T PF03604_consen 1 YICGECGAEVELKPG----------DPIRCPECGHRILYKKR 32 (32)
T ss_dssp EBESSSSSSE-BSTS----------STSSBSSSS-SEEBE--
T ss_pred CCCCcCCCeeEcCCC----------CcEECCcCCCeEEEecC
Confidence 346 78877774332 12789999999976543
No 94
>PRK12860 transcriptional activator FlhC; Provisional
Probab=79.99 E-value=1.3 Score=32.66 Aligned_cols=28 Identities=29% Similarity=0.774 Sum_probs=23.5
Q ss_pred eeeC-CCCCeEEEecccccCCCCCCceeEeCCCCcc
Q psy7548 93 TYPC-RCGAEFCIEEQDTEGDSSDDNILIACDTCSL 127 (134)
Q Consensus 93 ~~~c-rcG~~~~v~~~~~~~g~~~~~~~v~C~~CSl 127 (134)
-.+| +||+.|.+...++... ..|+-|..
T Consensus 134 l~~C~~Cgg~fv~~~~e~~~~-------f~CplC~~ 162 (189)
T PRK12860 134 LARCCRCGGKFVTHAHDLRHN-------FVCGLCQP 162 (189)
T ss_pred eccCCCCCCCeeccccccCCC-------CcCCCCCC
Confidence 4788 6999999998887776 79999974
No 95
>KOG3277|consensus
Probab=79.94 E-value=3 Score=29.80 Aligned_cols=38 Identities=24% Similarity=0.509 Sum_probs=30.6
Q ss_pred eeeeeeC-CCC--CeEEEecccccCCCCCCceeEeCCCCcceEEE
Q psy7548 90 QMFTYPC-RCG--AEFCIEEQDTEGDSSDDNILIACDTCSLLLEI 131 (134)
Q Consensus 90 ~~~~~~c-rcG--~~~~v~~~~~~~g~~~~~~~v~C~~CSl~i~v 131 (134)
....|.| -|+ ..-.|+....+.| .++|.|++|.....|
T Consensus 76 m~l~yTCkvCntRs~ktisk~AY~~G----vVivqC~gC~~~Hli 116 (165)
T KOG3277|consen 76 MQLAYTCKVCNTRSTKTISKQAYEKG----VVIVQCPGCKNHHLI 116 (165)
T ss_pred eEEEEEeeccCCccccccChhhhhCc----eEEEECCCCccceee
Confidence 3457888 787 5678888888888 999999999977655
No 96
>PRK12722 transcriptional activator FlhC; Provisional
Probab=79.28 E-value=1.4 Score=32.44 Aligned_cols=28 Identities=29% Similarity=0.765 Sum_probs=23.5
Q ss_pred eeeC-CCCCeEEEecccccCCCCCCceeEeCCCCcc
Q psy7548 93 TYPC-RCGAEFCIEEQDTEGDSSDDNILIACDTCSL 127 (134)
Q Consensus 93 ~~~c-rcG~~~~v~~~~~~~g~~~~~~~v~C~~CSl 127 (134)
-.+| +||+.|.+...++... ..|+-|..
T Consensus 134 l~~C~~Cgg~fv~~~~e~~~~-------f~CplC~~ 162 (187)
T PRK12722 134 LSSCNCCGGHFVTHAHDPVGS-------FVCGLCQP 162 (187)
T ss_pred eccCCCCCCCeeccccccCCC-------CcCCCCCC
Confidence 4678 6999999999888666 79999976
No 97
>PF08271 TF_Zn_Ribbon: TFIIB zinc-binding; InterPro: IPR013137 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents a zinc finger motif found in transcription factor IIB (TFIIB). In eukaryotes the initiation of transcription of protein encoding genes by the polymerase II complexe (Pol II) is modulated by general and specific transcription factors. The general transcription factors operate through common promoters elements (such as the TATA box). At least seven different proteins associate to form the general transcription factors: TFIIA, -IIB, -IID, -IIE, -IIF, -IIG, and -IIH []. TFIIB and TFIID are responsible for promoter recognition and interaction with pol II; together with Pol II, they form a minimal initiation complex capable of transcription under certain conditions. The TATA box of a Pol II promoter is bound in the initiation complex by the TBP subunit of TFIID, which bends the DNA around the C-terminal domain of TFIIB whereas the N-terminal zinc finger of TFIIB interacts with Pol II [, ]. The TFIIB zinc finger adopts a zinc ribbon fold characterised by two beta-hairpins forming two structurally similar zinc-binding sub-sites []. The zinc finger contacts the rbp1 subunit of Pol II through its dock domain, a conserved region of about 70 amino acids located close to the polymerase active site []. In the Pol II complex this surface is located near the RNA exit groove. Interestingly this sequence is best conserved in the three polymerases that utilise a TFIIB-like general transcription factor (Pol II, Pol III, and archaeal RNA polymerase) but not in Pol I []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent; PDB: 1VD4_A 1PFT_A 3K1F_M 3K7A_M 1RO4_A 1RLY_A 1DL6_A.
Probab=78.49 E-value=2.3 Score=23.36 Aligned_cols=29 Identities=31% Similarity=0.779 Sum_probs=17.6
Q ss_pred eeC-CCCCeEEEecccccCCCCCCceeEeCCCCcceEE
Q psy7548 94 YPC-RCGAEFCIEEQDTEGDSSDDNILIACDTCSLLLE 130 (134)
Q Consensus 94 ~~c-rcG~~~~v~~~~~~~g~~~~~~~v~C~~CSl~i~ 130 (134)
+.| .||+...+.. -..| .+.|..|-++|.
T Consensus 1 m~Cp~Cg~~~~~~D--~~~g------~~vC~~CG~Vl~ 30 (43)
T PF08271_consen 1 MKCPNCGSKEIVFD--PERG------ELVCPNCGLVLE 30 (43)
T ss_dssp ESBTTTSSSEEEEE--TTTT------EEEETTT-BBEE
T ss_pred CCCcCCcCCceEEc--CCCC------eEECCCCCCEee
Confidence 356 7888773333 2222 369999988875
No 98
>PF02216 B: B domain; InterPro: IPR003132 This entry represents the immunoglobulin-binding domain found in the Staphylococcus aureus virulence factor protein A (SpA). Protein A contains five highly homologous Ig-binding domains in tandem (designated domains E, D, A, B and C), which share a common structure consisting of three helices in a closed left-handed twist. Protein A can exist in both secreted and membrane-bound forms, and has two distinct Ig-binding activities: each domain can bind Fc-gamma (the constant region of IgG involved in effector functions) and Fab (the Ig fragment responsible for antigen recognition) [].; GO: 0019865 immunoglobulin binding, 0009405 pathogenesis; PDB: 1EDL_A 1EDI_A 1EDJ_A 1EDK_A 2B88_A 2B87_A 2B89_A 1FC2_C 1DEE_H 1ZXG_A ....
Probab=68.84 E-value=11 Score=22.02 Aligned_cols=30 Identities=27% Similarity=0.450 Sum_probs=20.3
Q ss_pred ccccccCCCCCCHHHHHHHHHHHHHHcCCCCCH
Q psy7548 3 HFEILNCNVNSTLEEIKENYKSLILKYHPDKNS 35 (134)
Q Consensus 3 ~Y~iLgv~~~as~~~Ik~ay~~l~~~~HPDk~~ 35 (134)
||+||+++. -.+|=|..|-+ .++-||+...
T Consensus 13 FY~vl~~~n--LteeQrn~yI~-~lKddPs~sq 42 (54)
T PF02216_consen 13 FYEVLHMPN--LTEEQRNGYIQ-SLKDDPSRSQ 42 (54)
T ss_dssp HHHHHCSTT--S-HHHHHHHHH-HHHH-GCCHH
T ss_pred HHHHHcCCC--cCHHHHHhHHH-HHhhChHHHH
Confidence 789998876 34556777776 4778999733
No 99
>KOG3442|consensus
Probab=67.95 E-value=7.2 Score=26.90 Aligned_cols=29 Identities=31% Similarity=0.381 Sum_probs=27.1
Q ss_pred ccccCCCCCCHHHHHHHHHHHHHHcCCCC
Q psy7548 5 EILNCNVNSTLEEIKENYKSLILKYHPDK 33 (134)
Q Consensus 5 ~iLgv~~~as~~~Ik~ay~~l~~~~HPDk 33 (134)
+||+|++..+.++|.+.|..|.....+.|
T Consensus 63 qILnV~~~ln~eei~k~yehLFevNdksk 91 (132)
T KOG3442|consen 63 QILNVKEPLNREEIEKRYEHLFEVNDKSK 91 (132)
T ss_pred hHhCCCCCCCHHHHHHHHHHHHhccCccc
Confidence 68999999999999999999999988887
No 100
>KOG1779|consensus
Probab=66.84 E-value=15 Score=23.31 Aligned_cols=37 Identities=19% Similarity=0.419 Sum_probs=25.0
Q ss_pred eeeeeCCCCCeEEEecccccCCCCCCceeEeCCCCcceEEEe
Q psy7548 91 MFTYPCRCGAEFCIEEQDTEGDSSDDNILIACDTCSLLLEIT 132 (134)
Q Consensus 91 ~~~~~crcG~~~~v~~~~~~~g~~~~~~~v~C~~CSl~i~v~ 132 (134)
.|+..-+|+|-|.++-- ++ ....+|.|.+|+..+-..
T Consensus 30 syFm~VkC~gc~~iT~v--fS---HaqtvVvc~~c~~il~~~ 66 (84)
T KOG1779|consen 30 SYFMDVKCPGCFKITTV--FS---HAQTVVVCEGCSTILCQP 66 (84)
T ss_pred ceEEEEEcCCceEEEEE--ee---cCceEEEcCCCceEEEEe
Confidence 34555588888877653 11 124589999999987654
No 101
>PF07709 SRR: Seven Residue Repeat; InterPro: IPR011714 This repeat is found in some Plasmodium and Theileria proteins.
Probab=64.26 E-value=6.4 Score=16.43 Aligned_cols=13 Identities=31% Similarity=0.524 Sum_probs=8.9
Q ss_pred HHHHHHHHHHHcc
Q psy7548 40 MFLKINEAWNILK 52 (134)
Q Consensus 40 ~f~~i~~Ay~~L~ 52 (134)
.|..+..||+.|+
T Consensus 2 ~~~~V~~aY~~l~ 14 (14)
T PF07709_consen 2 KFEKVKNAYEQLS 14 (14)
T ss_pred cHHHHHHHHHhcC
Confidence 4667777877664
No 102
>PF12434 Malate_DH: Malate dehydrogenase enzyme
Probab=61.86 E-value=13 Score=18.72 Aligned_cols=18 Identities=22% Similarity=0.471 Sum_probs=15.1
Q ss_pred CHHHHHHHHHHHHHHcCC
Q psy7548 14 TLEEIKENYKSLILKYHP 31 (134)
Q Consensus 14 s~~~Ik~ay~~l~~~~HP 31 (134)
..++.|.+-|+.++.||-
T Consensus 9 ~~~~~r~~lR~AALeYHe 26 (28)
T PF12434_consen 9 NKEDKRAQLRQAALEYHE 26 (28)
T ss_pred chHHHHHHHHHHHHHhcc
Confidence 457889999999999983
No 103
>PF10041 DUF2277: Uncharacterized conserved protein (DUF2277); InterPro: IPR018735 Members of this family of hypothetical bacterial proteins have no known function.
Probab=59.27 E-value=42 Score=21.11 Aligned_cols=32 Identities=22% Similarity=0.091 Sum_probs=25.2
Q ss_pred CccccccCCCCCCHHHHHHHHHHHHHHcCCCC
Q psy7548 2 NHFEILNCNVNSTLEEIKENYKSLILKYHPDK 33 (134)
Q Consensus 2 ~~Y~iLgv~~~as~~~Ik~ay~~l~~~~HPDk 33 (134)
|--.+.|+.|.||.+||+.|-.+..+|..--.
T Consensus 4 nI~~L~~fePpaT~~EI~aAAlQyVRKvSG~~ 35 (78)
T PF10041_consen 4 NIKTLRNFEPPATDEEIRAAALQYVRKVSGFR 35 (78)
T ss_pred chhhhcCCCCCCCHHHHHHHHHHHHHHHccCC
Confidence 33456688999999999999998888875443
No 104
>PF09297 zf-NADH-PPase: NADH pyrophosphatase zinc ribbon domain; InterPro: IPR015376 This domain has a zinc ribbon structure and is often found between two NUDIX domains.; GO: 0016787 hydrolase activity, 0046872 metal ion binding; PDB: 1VK6_A 2GB5_A.
Probab=59.07 E-value=10 Score=19.34 Aligned_cols=26 Identities=27% Similarity=0.443 Sum_probs=14.1
Q ss_pred eC-CCCCeEEEecccccCCCCCCceeEeCCCCcceE
Q psy7548 95 PC-RCGAEFCIEEQDTEGDSSDDNILIACDTCSLLL 129 (134)
Q Consensus 95 ~c-rcG~~~~v~~~~~~~g~~~~~~~v~C~~CSl~i 129 (134)
.| +||+.-.....+.. ..|+.|.+..
T Consensus 5 fC~~CG~~t~~~~~g~~---------r~C~~Cg~~~ 31 (32)
T PF09297_consen 5 FCGRCGAPTKPAPGGWA---------RRCPSCGHEH 31 (32)
T ss_dssp B-TTT--BEEE-SSSS----------EEESSSS-EE
T ss_pred ccCcCCccccCCCCcCE---------eECCCCcCEe
Confidence 46 89988877776433 4788887653
No 105
>PF04216 FdhE: Protein involved in formate dehydrogenase formation; InterPro: IPR006452 This family of sequences describe an accessory protein required for the assembly of formate dehydrogenase of certain proteobacteria although not present in the final complex []. The exact nature of the function of FdhE in the assembly of the complex is unknown, but considering the presence of selenocysteine, molybdopterin, iron-sulphur clusters and cytochrome b556, it is likely to be involved in the insertion of cofactors. ; GO: 0005737 cytoplasm; PDB: 2FIY_B.
Probab=58.79 E-value=6.2 Score=30.63 Aligned_cols=17 Identities=18% Similarity=0.495 Sum_probs=14.8
Q ss_pred ceeEeCCCCcceEEEee
Q psy7548 117 NILIACDTCSLLLEITA 133 (134)
Q Consensus 117 ~~~v~C~~CSl~i~v~~ 133 (134)
.-+-.|..|..+++++.
T Consensus 236 ~rve~C~~C~~YlK~vd 252 (290)
T PF04216_consen 236 YRVEVCESCGSYLKTVD 252 (290)
T ss_dssp EEEEEETTTTEEEEEEE
T ss_pred EEEEECCcccchHHHHh
Confidence 55899999999999874
No 106
>cd01388 SOX-TCF_HMG-box SOX-TCF_HMG-box, class I member of the HMG-box superfamily of DNA-binding proteins. These proteins contain a single HMG box, and bind the minor groove of DNA in a highly sequence-specific manner. Members include SRY and its homologs in insects and vertebrates, and transcription factor-like proteins, TCF-1, -3, -4, and LEF-1. They appear to bind the minor groove of the A/T C A A A G/C-motif.
Probab=54.67 E-value=45 Score=19.96 Aligned_cols=43 Identities=21% Similarity=0.420 Sum_probs=30.6
Q ss_pred HHHHHHHHHcCCCCCHHHHHHHHHHHHHHHHccChHHHHHHHHHH
Q psy7548 20 ENYKSLILKYHPDKNSEHQEMFLKINEAWNILKDEKERKLYESQL 64 (134)
Q Consensus 20 ~ay~~l~~~~HPDk~~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~ 64 (134)
+..+.....-||+. ...+..+.|.+.|..|++.++...++...
T Consensus 14 ~~~r~~~~~~~p~~--~~~eisk~l~~~Wk~ls~~eK~~y~~~a~ 56 (72)
T cd01388 14 KRHRRKVLQEYPLK--ENRAISKILGDRWKALSNEEKQPYYEEAK 56 (72)
T ss_pred HHHHHHHHHHCCCC--CHHHHHHHHHHHHHcCCHHHHHHHHHHHH
Confidence 44556667789986 23456788999999999888766555443
No 107
>COG3195 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=54.12 E-value=46 Score=24.18 Aligned_cols=40 Identities=20% Similarity=0.330 Sum_probs=28.8
Q ss_pred CCCCCHHHHHHHHHHH-----------HHHcCCCC--------------------------CHHHHHHHHHHHHHHH
Q psy7548 10 NVNSTLEEIKENYKSL-----------ILKYHPDK--------------------------NSEHQEMFLKINEAWN 49 (134)
Q Consensus 10 ~~~as~~~Ik~ay~~l-----------~~~~HPDk--------------------------~~~~~~~f~~i~~Ay~ 49 (134)
.|.+|.+.+..+.... .+.-|||- .+...++|..+|+||.
T Consensus 41 ~pf~s~~~l~~am~~~v~~A~~~~rl~liraHPdLAgk~a~a~elta~S~~EQasAGLd~Ls~~E~a~f~~LN~aY~ 117 (176)
T COG3195 41 RPFASAEGLHAAMCRAVRAASEEERLALIRAHPDLAGKAAIAGELTAESTSEQASAGLDRLSPEEFARFTELNAAYV 117 (176)
T ss_pred CCcCCHHHHHHHHHHHHHcCCHHHHHHHHHhChhhHHHHHHHHHhhhhhHHHHHhcCcccCCHHHHHHHHHHHHHHH
Confidence 5667888877777543 45679997 2445678899999885
No 108
>smart00531 TFIIE Transcription initiation factor IIE.
Probab=52.61 E-value=13 Score=25.89 Aligned_cols=37 Identities=24% Similarity=0.454 Sum_probs=24.2
Q ss_pred eeeeC-CCCCeEEEecccccCCCCCCceeEeCCCCcceEEE
Q psy7548 92 FTYPC-RCGAEFCIEEQDTEGDSSDDNILIACDTCSLLLEI 131 (134)
Q Consensus 92 ~~~~c-rcG~~~~v~~~~~~~g~~~~~~~v~C~~CSl~i~v 131 (134)
..|.| +||..|.+..+..... ......|+.|.-.+..
T Consensus 98 ~~Y~Cp~C~~~y~~~ea~~~~d---~~~~f~Cp~Cg~~l~~ 135 (147)
T smart00531 98 AYYKCPNCQSKYTFLEANQLLD---MDGTFTCPRCGEELEE 135 (147)
T ss_pred cEEECcCCCCEeeHHHHHHhcC---CCCcEECCCCCCEEEE
Confidence 35788 9999998754432211 1234899999887753
No 109
>KOG2324|consensus
Probab=52.49 E-value=41 Score=27.70 Aligned_cols=29 Identities=21% Similarity=0.645 Sum_probs=16.0
Q ss_pred eeeeC-CCCCeEEEecccccCCCCCCceeEeCCCCcc
Q psy7548 92 FTYPC-RCGAEFCIEEQDTEGDSSDDNILIACDTCSL 127 (134)
Q Consensus 92 ~~~~c-rcG~~~~v~~~~~~~g~~~~~~~v~C~~CSl 127 (134)
+-+.| +|| |.-..+.+.- ..+..|+.|.-
T Consensus 226 ~l~~C~~C~--~s~n~e~~~~-----sk~~~Cp~C~~ 255 (457)
T KOG2324|consen 226 TLMSCPSCG--YSKNSEDLDL-----SKIASCPKCNE 255 (457)
T ss_pred ceeecCcCC--ccCchhhhcC-----CccccCCcccC
Confidence 44678 887 2222222221 23478999976
No 110
>PRK06266 transcription initiation factor E subunit alpha; Validated
Probab=50.84 E-value=8.4 Score=28.00 Aligned_cols=30 Identities=27% Similarity=0.662 Sum_probs=21.9
Q ss_pred eeeeC-CCCCeEEEecccccCCCCCCceeEeCCCCcceE
Q psy7548 92 FTYPC-RCGAEFCIEEQDTEGDSSDDNILIACDTCSLLL 129 (134)
Q Consensus 92 ~~~~c-rcG~~~~v~~~~~~~g~~~~~~~v~C~~CSl~i 129 (134)
-+|.| +|+..|++..+. ..+ ..|+.|.-.+
T Consensus 116 ~~Y~Cp~C~~rytf~eA~-~~~-------F~Cp~Cg~~L 146 (178)
T PRK06266 116 MFFFCPNCHIRFTFDEAM-EYG-------FRCPQCGEML 146 (178)
T ss_pred CEEECCCCCcEEeHHHHh-hcC-------CcCCCCCCCC
Confidence 34778 899999888763 334 6999997543
No 111
>COG1579 Zn-ribbon protein, possibly nucleic acid-binding [General function prediction only]
Probab=49.61 E-value=5.9 Score=30.31 Aligned_cols=32 Identities=16% Similarity=0.347 Sum_probs=21.8
Q ss_pred CCCCeEEEecccccCCCCCCceeEeCCCCcce
Q psy7548 97 RCGAEFCIEEQDTEGDSSDDNILIACDTCSLL 128 (134)
Q Consensus 97 rcG~~~~v~~~~~~~g~~~~~~~v~C~~CSl~ 128 (134)
+|||=+.+-..........+..+|.||.|+=.
T Consensus 199 ~C~GC~m~l~~~~~~~V~~~d~iv~CP~CgRI 230 (239)
T COG1579 199 VCGGCHMKLPSQTLSKVRKKDEIVFCPYCGRI 230 (239)
T ss_pred cccCCeeeecHHHHHHHhcCCCCccCCccchH
Confidence 67775555555555554455789999999854
No 112
>cd00084 HMG-box High Mobility Group (HMG)-box is found in a variety of eukaryotic chromosomal proteins and transcription factors. HMGs bind to the minor groove of DNA and have been classified by DNA binding preferences. Two phylogenically distinct groups of Class I proteins bind DNA in a sequence specific fashion and contain a single HMG box. One group (SOX-TCF) includes transcription factors, TCF-1, -3, -4; and also SRY and LEF-1, which bind four-way DNA junctions and duplex DNA targets. The second group (MATA) includes fungal mating type gene products MC, MATA1 and Ste11. Class II and III proteins (HMGB-UBF) bind DNA in a non-sequence specific fashion and contain two or more tandem HMG boxes. Class II members include non-histone chromosomal proteins, HMG1 and HMG2, which bind to bent or distorted DNA such as four-way DNA junctions, synthetic DNA cruciforms, kinked cisplatin-modified DNA, DNA bulges, cross-overs in supercoiled DNA, and can cause looping of linear DNA. Class III member
Probab=49.36 E-value=48 Score=18.78 Aligned_cols=45 Identities=18% Similarity=0.276 Sum_probs=31.3
Q ss_pred HHHHHHHHHHcCCCCCHHHHHHHHHHHHHHHHccChHHHHHHHHHHh
Q psy7548 19 KENYKSLILKYHPDKNSEHQEMFLKINEAWNILKDEKERKLYESQLL 65 (134)
Q Consensus 19 k~ay~~l~~~~HPDk~~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~ 65 (134)
.+.++...+.-||+.. ..+..+.+.+.|..|++.++....+....
T Consensus 12 ~~~~~~~~~~~~~~~~--~~~i~~~~~~~W~~l~~~~k~~y~~~a~~ 56 (66)
T cd00084 12 SQEHRAEVKAENPGLS--VGEISKILGEMWKSLSEEEKKKYEEKAEK 56 (66)
T ss_pred HHHHHHHHHHHCcCCC--HHHHHHHHHHHHHhCCHHHHHHHHHHHHH
Confidence 4456666777888852 44567799999999998766655554433
No 113
>TIGR00373 conserved hypothetical protein TIGR00373. This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain.
Probab=47.95 E-value=13 Score=26.47 Aligned_cols=29 Identities=21% Similarity=0.528 Sum_probs=21.2
Q ss_pred eeeC-CCCCeEEEecccccCCCCCCceeEeCCCCcceE
Q psy7548 93 TYPC-RCGAEFCIEEQDTEGDSSDDNILIACDTCSLLL 129 (134)
Q Consensus 93 ~~~c-rcG~~~~v~~~~~~~g~~~~~~~v~C~~CSl~i 129 (134)
.|.| +|+..|++..+-- .+ ..|+.|.-.+
T Consensus 109 ~Y~Cp~c~~r~tf~eA~~-~~-------F~Cp~Cg~~L 138 (158)
T TIGR00373 109 FFICPNMCVRFTFNEAME-LN-------FTCPRCGAML 138 (158)
T ss_pred eEECCCCCcEeeHHHHHH-cC-------CcCCCCCCEe
Confidence 4778 8998888877743 33 5899997654
No 114
>PF13831 PHD_2: PHD-finger; PDB: 2L43_A 2KU3_A.
Probab=47.73 E-value=16 Score=19.37 Aligned_cols=13 Identities=31% Similarity=0.866 Sum_probs=8.4
Q ss_pred eeEeCCCCcceEE
Q psy7548 118 ILIACDTCSLLLE 130 (134)
Q Consensus 118 ~~v~C~~CSl~i~ 130 (134)
.++.|++|++.|+
T Consensus 3 ~ll~C~~C~v~VH 15 (36)
T PF13831_consen 3 PLLFCDNCNVAVH 15 (36)
T ss_dssp EEEE-SSS--EEE
T ss_pred ceEEeCCCCCcCC
Confidence 4789999999886
No 115
>PF11331 DUF3133: Protein of unknown function (DUF3133); InterPro: IPR021480 This eukaryotic family of proteins has no known function.
Probab=46.81 E-value=28 Score=19.70 Aligned_cols=17 Identities=24% Similarity=0.575 Sum_probs=13.9
Q ss_pred ceeEeCCCCcceEEEee
Q psy7548 117 NILIACDTCSLLLEITA 133 (134)
Q Consensus 117 ~~~v~C~~CSl~i~v~~ 133 (134)
...+.|.+||.+|.+.-
T Consensus 29 ~~klrCGaCs~vl~~s~ 45 (46)
T PF11331_consen 29 QQKLRCGACSEVLSFSL 45 (46)
T ss_pred eeEEeCCCCceeEEEec
Confidence 55789999999998753
No 116
>PF00505 HMG_box: HMG (high mobility group) box; InterPro: IPR000910 High mobility group (HMG or HMGB) proteins are a family of relatively low molecular weight non-histone components in chromatin. HMG1 (also called HMG-T in fish) and HMG2 are two highly related proteins that bind single-stranded DNA preferentially and unwind double-stranded DNA. Although they have no sequence specificity, they have a high affinity for bent or distorted DNA, and bend linear DNA. HMG1 and HMG2 contain two DNA-binding HMG-box domains (A and B) that show structural and functional differences, and have a long acidic C-terminal domain rich in aspartic and glutamic acid residues. The acidic tail modulates the affinity of the tandem HMG boxes in HMG1 and 2 for a variety of DNA targets. HMG1 and 2 appear to play important architectural roles in the assembly of nucleoprotein complexes in a variety of biological processes, for example V(D)J recombination, the initiation of transcription, and DNA repair []. The profile in this entry describing the HMG-domains is much more general than the signature. In addition to the HMG1 and HMG2 proteins, HMG-domains occur in single or multiple copies in the following protein classes; the SOX family of transcription factors; SRY sex determining region Y protein and related proteins []; LEF1 lymphoid enhancer binding factor 1 []; SSRP recombination signal recognition protein; MTF1 mitochondrial transcription factor 1; UBF1/2 nucleolar transcription factors; Abf2 yeast ARS-binding factor []; and Saccharomyces cerevisiae transcription factors Ixr1, Rox1, Nhp6a, Nhp6b and Spp41.; GO: 0003677 DNA binding; PDB: 1I11_A 1J3C_A 1J3D_A 1WZ6_A 1WGF_A 2D7L_A 1GT0_D 3U2B_C 2CRJ_A 2CS1_A ....
Probab=46.30 E-value=58 Score=18.83 Aligned_cols=43 Identities=16% Similarity=0.319 Sum_probs=29.9
Q ss_pred HHHHHHHHHcCCCCCHHHHHHHHHHHHHHHHccChHHHHHHHHHH
Q psy7548 20 ENYKSLILKYHPDKNSEHQEMFLKINEAWNILKDEKERKLYESQL 64 (134)
Q Consensus 20 ~ay~~l~~~~HPDk~~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~ 64 (134)
...+...+.-||+.. ..+..+.|.+.|.-|++.++....+...
T Consensus 13 ~~~~~~~k~~~p~~~--~~~i~~~~~~~W~~l~~~eK~~y~~~a~ 55 (69)
T PF00505_consen 13 KEKRAKLKEENPDLS--NKEISKILAQMWKNLSEEEKAPYKEEAE 55 (69)
T ss_dssp HHHHHHHHHHSTTST--HHHHHHHHHHHHHCSHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHhcccc--cccchhhHHHHHhcCCHHHHHHHHHHHH
Confidence 445566667799974 4566789999999998666655544443
No 117
>PTZ00043 cytochrome c oxidase subunit; Provisional
Probab=45.96 E-value=1.4e+02 Score=23.00 Aligned_cols=37 Identities=27% Similarity=0.265 Sum_probs=23.3
Q ss_pred HHHHHHHHHHHHHcCCCCCHHHHHHHHHHHHHHHHcc
Q psy7548 16 EEIKENYKSLILKYHPDKNSEHQEMFLKINEAWNILK 52 (134)
Q Consensus 16 ~~Ik~ay~~l~~~~HPDk~~~~~~~f~~i~~Ay~~L~ 52 (134)
.-++++|++||+-.--....+..+-...+-+-|++|.
T Consensus 96 ~l~~ksykqla~d~gmqi~~~~~~hm~~~le~y~~Lk 132 (268)
T PTZ00043 96 TLPKKSYKQLARDMGMQIVNEPSEHMLGLLELYEYLK 132 (268)
T ss_pred hcchHHHHHHHHHhCceecCCchHHHHHHHHHHHhcC
Confidence 3378999999876544442222344566677777775
No 118
>cd01390 HMGB-UBF_HMG-box HMGB-UBF_HMG-box, class II and III members of the HMG-box superfamily of DNA-binding proteins. These proteins bind the minor groove of DNA in a non-sequence specific fashion and contain two or more tandem HMG boxes. Class II members include non-histone chromosomal proteins, HMG1 and HMG2, which bind to bent or distorted DNA such as four-way DNA junctions, synthetic DNA cruciforms, kinked cisplatin-modified DNA, DNA bulges, cross-overs in supercoiled DNA, and can cause looping of linear DNA. Class III members include nucleolar and mitochondrial transcription factors, UBF and mtTF1, which bind four-way DNA junctions.
Probab=45.85 E-value=57 Score=18.63 Aligned_cols=41 Identities=24% Similarity=0.375 Sum_probs=28.7
Q ss_pred HHHHHHHcCCCCCHHHHHHHHHHHHHHHHccChHHHHHHHHHH
Q psy7548 22 YKSLILKYHPDKNSEHQEMFLKINEAWNILKDEKERKLYESQL 64 (134)
Q Consensus 22 y~~l~~~~HPDk~~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~ 64 (134)
.+...+.-||+. ...+..+.|.+.|..|++.++....+...
T Consensus 15 ~r~~~~~~~p~~--~~~~i~~~~~~~W~~ls~~eK~~y~~~a~ 55 (66)
T cd01390 15 QRPKLKKENPDA--SVTEVTKILGEKWKELSEEEKKKYEEKAE 55 (66)
T ss_pred HHHHHHHHCcCC--CHHHHHHHHHHHHHhCCHHHHHHHHHHHH
Confidence 445556678885 34566789999999999877666555443
No 119
>TIGR01562 FdhE formate dehydrogenase accessory protein FdhE. The only sequence scoring between trusted and noise is that from Aquifex aeolicus, which shows certain structural differences from the proteobacterial forms in the alignment. However it is notable that A. aeolicus also has a sequence scoring above trusted to the alpha subunit of formate dehydrogenase (TIGR01553).
Probab=44.61 E-value=19 Score=28.50 Aligned_cols=17 Identities=29% Similarity=0.438 Sum_probs=14.2
Q ss_pred ceeEeCCCCcceEEEee
Q psy7548 117 NILIACDTCSLLLEITA 133 (134)
Q Consensus 117 ~~~v~C~~CSl~i~v~~ 133 (134)
.-.-.|..|.-+++++.
T Consensus 250 ~r~e~C~~C~~YlK~~~ 266 (305)
T TIGR01562 250 LKAETCDSCQGYLKILY 266 (305)
T ss_pred eEEeeccccccchhhhc
Confidence 45779999999999864
No 120
>PF08274 PhnA_Zn_Ribbon: PhnA Zinc-Ribbon ; InterPro: IPR013987 The PhnA protein family includes the uncharacterised Escherichia coli protein PhnA and its homologues. The E. coli phnA gene is part of a large operon associated with alkylphosphonate uptake and carbon-phosphorus bond cleavage []. The protein is not related to the characterised phosphonoacetate hydrolase designated PhnA []. This entry represents the N-terminal domain of PhnA, which is predicted to form a zinc-ribbon.; PDB: 2AKL_A.
Probab=43.22 E-value=16 Score=18.76 Aligned_cols=25 Identities=32% Similarity=0.846 Sum_probs=11.4
Q ss_pred eC-CCCCeEEEecccccCCCCCCceeEeCCCCcceE
Q psy7548 95 PC-RCGAEFCIEEQDTEGDSSDDNILIACDTCSLLL 129 (134)
Q Consensus 95 ~c-rcG~~~~v~~~~~~~g~~~~~~~v~C~~CSl~i 129 (134)
.| .||..|.+.. ..+..|+.|..-+
T Consensus 4 ~Cp~C~se~~y~D----------~~~~vCp~C~~ew 29 (30)
T PF08274_consen 4 KCPLCGSEYTYED----------GELLVCPECGHEW 29 (30)
T ss_dssp --TTT-----EE-----------SSSEEETTTTEEE
T ss_pred CCCCCCCcceecc----------CCEEeCCcccccC
Confidence 35 6888777722 2457899998643
No 121
>COG0089 RplW Ribosomal protein L23 [Translation, ribosomal structure and biogenesis]
Probab=42.84 E-value=24 Score=23.09 Aligned_cols=21 Identities=24% Similarity=0.346 Sum_probs=17.9
Q ss_pred cccCCCCCCHHHHHHHHHHHH
Q psy7548 6 ILNCNVNSTLEEIKENYKSLI 26 (134)
Q Consensus 6 iLgv~~~as~~~Ik~ay~~l~ 26 (134)
++-|.++|+..+||+|...+-
T Consensus 25 vF~V~~~AtK~~IK~AvE~lF 45 (94)
T COG0089 25 VFIVDPDATKPEIKAAVEELF 45 (94)
T ss_pred EEEECCCCCHHHHHHHHHHHh
Confidence 467899999999999998873
No 122
>KOG0527|consensus
Probab=41.43 E-value=48 Score=26.61 Aligned_cols=43 Identities=16% Similarity=0.443 Sum_probs=33.9
Q ss_pred HHHHHHHHHcCCCCCHHHHHHHHHHHHHHHHccChHHHHHHHHHH
Q psy7548 20 ENYKSLILKYHPDKNSEHQEMFLKINEAWNILKDEKERKLYESQL 64 (134)
Q Consensus 20 ~ay~~l~~~~HPDk~~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~ 64 (134)
+..|+.+.+--||.+ ..+..|+|.+-|..|++.+||-.+|.--
T Consensus 75 q~~RRkma~qnP~mH--NSEISK~LG~~WK~Lse~EKrPFi~EAe 117 (331)
T KOG0527|consen 75 QGQRRKLAKQNPKMH--NSEISKRLGAEWKLLSEEEKRPFVDEAE 117 (331)
T ss_pred HHHHHHHHHhCcchh--hHHHHHHHHHHHhhcCHhhhccHHHHHH
Confidence 456777777778872 3467889999999999999999998443
No 123
>PTZ00199 high mobility group protein; Provisional
Probab=41.39 E-value=94 Score=19.84 Aligned_cols=41 Identities=20% Similarity=0.333 Sum_probs=27.8
Q ss_pred HHHHHHHcCCCCCHHHHHHHHHHHHHHHHccChHHHHHHHH
Q psy7548 22 YKSLILKYHPDKNSEHQEMFLKINEAWNILKDEKERKLYES 62 (134)
Q Consensus 22 y~~l~~~~HPDk~~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~ 62 (134)
.|..+..-||+......+..+.|.+.|..|++.++...++.
T Consensus 37 ~R~~i~~~~P~~~~~~~evsk~ige~Wk~ls~eeK~~y~~~ 77 (94)
T PTZ00199 37 KRAEIIAENPELAKDVAAVGKMVGEAWNKLSEEEKAPYEKK 77 (94)
T ss_pred HHHHHHHHCcCCcccHHHHHHHHHHHHHcCCHHHHHHHHHH
Confidence 34455667898643344567889999999997766555443
No 124
>PF07295 DUF1451: Protein of unknown function (DUF1451); InterPro: IPR009912 This family consists of several hypothetical bacterial proteins of around 160 residues in length. Members of this family contain four highly conserved cysteine resides toward the C-terminal region of the protein. The function of this family is unknown.
Probab=41.34 E-value=28 Score=24.59 Aligned_cols=28 Identities=29% Similarity=0.593 Sum_probs=20.7
Q ss_pred eeeeC-CCCCeEEEecccccCCCCCCceeEeCCCCcce
Q psy7548 92 FTYPC-RCGAEFCIEEQDTEGDSSDDNILIACDTCSLL 128 (134)
Q Consensus 92 ~~~~c-rcG~~~~v~~~~~~~g~~~~~~~v~C~~CSl~ 128 (134)
-.+.| .||....++..+ .+-+|+.|...
T Consensus 111 G~l~C~~Cg~~~~~~~~~---------~l~~Cp~C~~~ 139 (146)
T PF07295_consen 111 GTLVCENCGHEVELTHPE---------RLPPCPKCGHT 139 (146)
T ss_pred ceEecccCCCEEEecCCC---------cCCCCCCCCCC
Confidence 35789 999888887542 34699999764
No 125
>PRK00762 hypA hydrogenase nickel incorporation protein; Provisional
Probab=41.28 E-value=16 Score=24.84 Aligned_cols=35 Identities=26% Similarity=0.581 Sum_probs=18.8
Q ss_pred eeCCCCCeEEEecccccCCCCCCceeEeCCCCc-ceEEEe
Q psy7548 94 YPCRCGAEFCIEEQDTEGDSSDDNILIACDTCS-LLLEIT 132 (134)
Q Consensus 94 ~~crcG~~~~v~~~~~~~g~~~~~~~v~C~~CS-l~i~v~ 132 (134)
..|.||..|.+....+..- .....||.|. .-+.|+
T Consensus 71 ~~C~Cg~~~~~~~~~~~~~----~~~~~CP~Cgs~~~~i~ 106 (124)
T PRK00762 71 IECECGYEGVVDEDEIDHY----AAVIECPVCGNKRAHIL 106 (124)
T ss_pred EEeeCcCcccccccchhcc----ccCCcCcCCCCCCCEEe
Confidence 4567887776654322110 1124799995 555554
No 126
>cd01389 MATA_HMG-box MATA_HMG-box, class I member of the HMG-box superfamily of DNA-binding proteins. These proteins contain a single HMG box, and bind the minor groove of DNA in a highly sequence-specific manner. Members include the fungal mating type gene products MC, MATA1 and Ste11.
Probab=40.87 E-value=82 Score=19.01 Aligned_cols=43 Identities=14% Similarity=0.187 Sum_probs=31.0
Q ss_pred HHHHHHHHHHcCCCCCHHHHHHHHHHHHHHHHccChHHHHHHHHH
Q psy7548 19 KENYKSLILKYHPDKNSEHQEMFLKINEAWNILKDEKERKLYESQ 63 (134)
Q Consensus 19 k~ay~~l~~~~HPDk~~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~ 63 (134)
.+.++..++.-+|+.. ..+..+.|.+.|..|++.++....+..
T Consensus 13 ~~~~r~~~~~~~p~~~--~~eisk~~g~~Wk~ls~eeK~~y~~~A 55 (77)
T cd01389 13 RQDKHAQLKTENPGLT--NNEISRIIGRMWRSESPEVKAYYKELA 55 (77)
T ss_pred HHHHHHHHHHHCCCCC--HHHHHHHHHHHHhhCCHHHHHHHHHHH
Confidence 4566777788899873 346678899999999977765544433
No 127
>PF14354 Lar_restr_allev: Restriction alleviation protein Lar
Probab=39.32 E-value=41 Score=19.38 Aligned_cols=33 Identities=27% Similarity=0.540 Sum_probs=17.0
Q ss_pred eeC-CCCCeEEEecccccCCCCCCceeEeCCCCcc
Q psy7548 94 YPC-RCGAEFCIEEQDTEGDSSDDNILIACDTCSL 127 (134)
Q Consensus 94 ~~c-rcG~~~~v~~~~~~~g~~~~~~~v~C~~CSl 127 (134)
.+| .||..-.........+ ......|.|..|..
T Consensus 4 kPCPFCG~~~~~~~~~~~~~-~~~~~~V~C~~Cga 37 (61)
T PF14354_consen 4 KPCPFCGSADVLIRQDEGFD-YGMYYYVECTDCGA 37 (61)
T ss_pred cCCCCCCCcceEeecccCCC-CCCEEEEEcCCCCC
Confidence 467 6875444433322221 00116788999865
No 128
>PF07739 TipAS: TipAS antibiotic-recognition domain; InterPro: IPR012925 TipAL is a bacterial transcriptional regulator of the MerR family. The tipA gene can be expressed as a long form, TipAL, and a short form, TipAS, which constitutes the C-terminal part of TipAL. TipAS forms the antibiotic-recognition domain []. This domain, which has an alpha-helical globin-like fold, is also found at the C terminus of other MerR family transcription factors, including Mta, a central regulator of multidrug resistance in Bacillus subtilis [], and SkgA from Caulobacter crescentus []. ; GO: 0003677 DNA binding, 0045892 negative regulation of transcription, DNA-dependent; PDB: 1NY9_A 3HH0_A 3QAO_A.
Probab=38.28 E-value=96 Score=19.96 Aligned_cols=48 Identities=17% Similarity=0.404 Sum_probs=26.7
Q ss_pred cCCCCCCH-HHHHHHHHHHHHHcCCCCCHHHHHHHHHHHHHHHHccChHHHHHHH
Q psy7548 8 NCNVNSTL-EEIKENYKSLILKYHPDKNSEHQEMFLKINEAWNILKDEKERKLYE 61 (134)
Q Consensus 8 gv~~~as~-~~Ik~ay~~l~~~~HPDk~~~~~~~f~~i~~Ay~~L~d~~~R~~Yd 61 (134)
|++|++.. .+|-+.+..++..+++. ..+.+..|.+.| +.||.-+..|+
T Consensus 51 g~~p~s~evq~l~~~~~~~~~~~~~~----~~~~~~~l~~~y--~~~~~~~~~~~ 99 (118)
T PF07739_consen 51 GVDPDSPEVQELAERWMELINQFTGG----DPELLRGLAQMY--VEDPRFAAMYD 99 (118)
T ss_dssp T--TT-HHHHHHHHHHHHHHHHSS-------HHHHHHHHHHT--TSTHHHHHHHG
T ss_pred CCCcCCHHHHHHHHHHHHHHHHHhCC----CHHHHHHHHHHH--HcCHHHHhhcc
Confidence 44554433 33556666666666662 245666777777 67777777776
No 129
>TIGR00577 fpg formamidopyrimidine-DNA glycosylase (fpg). All proteins in the FPG family with known functions are FAPY-DNA glycosylases that function in base excision repair. Homologous to endonuclease VIII (nei). This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=37.96 E-value=1.1e+02 Score=23.50 Aligned_cols=26 Identities=27% Similarity=0.565 Sum_probs=15.6
Q ss_pred eeC-CCCCeEEEecccccCCCCCCceeEeCCCCc
Q psy7548 94 YPC-RCGAEFCIEEQDTEGDSSDDNILIACDTCS 126 (134)
Q Consensus 94 ~~c-rcG~~~~v~~~~~~~g~~~~~~~v~C~~CS 126 (134)
.+| +||.......- . | =...-|+.|.
T Consensus 246 ~pC~~Cg~~I~~~~~--~-g----R~t~~CP~CQ 272 (272)
T TIGR00577 246 EPCRRCGTPIEKIKV--G-G----RGTHFCPQCQ 272 (272)
T ss_pred CCCCCCCCeeEEEEE--C-C----CCCEECCCCC
Confidence 488 89976543221 1 2 3457899883
No 130
>PF09855 DUF2082: Nucleic-acid-binding protein containing Zn-ribbon domain (DUF2082); InterPro: IPR018652 This family of proteins contains various hypothetical prokaryotic proteins as well as some Zn-ribbon nucleic-acid-binding proteins.
Probab=37.06 E-value=48 Score=20.02 Aligned_cols=11 Identities=18% Similarity=0.700 Sum_probs=7.1
Q ss_pred ceeEeCCCCcc
Q psy7548 117 NILIACDTCSL 127 (134)
Q Consensus 117 ~~~v~C~~CSl 127 (134)
+..+.|+.|..
T Consensus 34 f~~v~C~~CGY 44 (64)
T PF09855_consen 34 FTTVSCTNCGY 44 (64)
T ss_pred EEEEECCCCCC
Confidence 34677777764
No 131
>CHL00030 rpl23 ribosomal protein L23
Probab=36.34 E-value=36 Score=22.14 Aligned_cols=27 Identities=19% Similarity=0.115 Sum_probs=20.1
Q ss_pred cccCCCCCCHHHHHHHHHHHHHHcCCCC
Q psy7548 6 ILNCNVNSTLEEIKENYKSLILKYHPDK 33 (134)
Q Consensus 6 iLgv~~~as~~~Ik~ay~~l~~~~HPDk 33 (134)
++-|+++|+..|||+|..++- ...+.+
T Consensus 23 ~F~V~~~anK~eIK~avE~lf-~VkV~~ 49 (93)
T CHL00030 23 TFDVDSGSTKTEIKHWIELFF-GVKVIA 49 (93)
T ss_pred EEEECCCCCHHHHHHHHHHHh-CCeEEE
Confidence 456889999999999998763 344444
No 132
>COG0266 Nei Formamidopyrimidine-DNA glycosylase [DNA replication, recombination, and repair]
Probab=35.81 E-value=1.2e+02 Score=23.71 Aligned_cols=26 Identities=27% Similarity=0.643 Sum_probs=14.9
Q ss_pred eeeC-CCCCeEEE-ecccccCCCCCCceeEeCCCCc
Q psy7548 93 TYPC-RCGAEFCI-EEQDTEGDSSDDNILIACDTCS 126 (134)
Q Consensus 93 ~~~c-rcG~~~~v-~~~~~~~g~~~~~~~v~C~~CS 126 (134)
..+| +||+...= .... =...-|+.|-
T Consensus 245 GepC~~CGt~I~k~~~~g--------R~t~~CP~CQ 272 (273)
T COG0266 245 GEPCRRCGTPIEKIKLGG--------RSTFYCPVCQ 272 (273)
T ss_pred CCCCCccCCEeEEEEEcC--------CcCEeCCCCC
Confidence 4688 89875432 2221 2346888884
No 133
>COG2331 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=35.76 E-value=7.5 Score=24.53 Aligned_cols=33 Identities=27% Similarity=0.797 Sum_probs=22.1
Q ss_pred ceeeeeeC-CCCCeEEEecccccCCCCCCceeEeCCCCcc
Q psy7548 89 SQMFTYPC-RCGAEFCIEEQDTEGDSSDDNILIACDTCSL 127 (134)
Q Consensus 89 ~~~~~~~c-rcG~~~~v~~~~~~~g~~~~~~~v~C~~CSl 127 (134)
..+|.|.| .||..|.|-..--+ .....|+.|+-
T Consensus 8 MPtY~Y~c~~cg~~~dvvq~~~d------dplt~ce~c~a 41 (82)
T COG2331 8 MPTYSYECTECGNRFDVVQAMTD------DPLTTCEECGA 41 (82)
T ss_pred ccceEEeecccchHHHHHHhccc------CccccChhhCh
Confidence 46789999 99988776443111 23567888874
No 134
>PRK14811 formamidopyrimidine-DNA glycosylase; Provisional
Probab=35.59 E-value=1.3e+02 Score=23.11 Aligned_cols=28 Identities=29% Similarity=0.547 Sum_probs=17.2
Q ss_pred eeeC-CCCCeEEEecccccCCCCCCceeEeCCCCcc
Q psy7548 93 TYPC-RCGAEFCIEEQDTEGDSSDDNILIACDTCSL 127 (134)
Q Consensus 93 ~~~c-rcG~~~~v~~~~~~~g~~~~~~~v~C~~CSl 127 (134)
..+| +||....-..- .| =...-|+.|..
T Consensus 235 g~pC~~Cg~~I~~~~~---~g----R~ty~Cp~CQ~ 263 (269)
T PRK14811 235 GQPCPRCGTPIEKIVV---GG----RGTHFCPQCQP 263 (269)
T ss_pred cCCCCcCCCeeEEEEE---CC----CCcEECCCCcC
Confidence 3589 89976543221 12 34578999964
No 135
>PRK13945 formamidopyrimidine-DNA glycosylase; Provisional
Probab=35.27 E-value=1.3e+02 Score=23.31 Aligned_cols=27 Identities=22% Similarity=0.551 Sum_probs=16.4
Q ss_pred eeeC-CCCCeEEEecccccCCCCCCceeEeCCCCc
Q psy7548 93 TYPC-RCGAEFCIEEQDTEGDSSDDNILIACDTCS 126 (134)
Q Consensus 93 ~~~c-rcG~~~~v~~~~~~~g~~~~~~~v~C~~CS 126 (134)
..+| +||....... + .| =...-|+.|-
T Consensus 254 g~pC~~Cg~~I~~~~--~-~g----R~t~~CP~CQ 281 (282)
T PRK13945 254 GKPCRKCGTPIERIK--L-AG----RSTHWCPNCQ 281 (282)
T ss_pred cCCCCcCCCeeEEEE--E-CC----CccEECCCCc
Confidence 3589 9997654321 1 22 3467899984
No 136
>PRK14810 formamidopyrimidine-DNA glycosylase; Provisional
Probab=34.93 E-value=1.5e+02 Score=22.87 Aligned_cols=27 Identities=19% Similarity=0.396 Sum_probs=16.0
Q ss_pred eeeC-CCCCeEEEecccccCCCCCCceeEeCCCCc
Q psy7548 93 TYPC-RCGAEFCIEEQDTEGDSSDDNILIACDTCS 126 (134)
Q Consensus 93 ~~~c-rcG~~~~v~~~~~~~g~~~~~~~v~C~~CS 126 (134)
..+| +||+...-..- .| =...-|+.|.
T Consensus 244 g~pCprCG~~I~~~~~---~g----R~t~~CP~CQ 271 (272)
T PRK14810 244 GEPCLNCKTPIRRVVV---AG----RSSHYCPHCQ 271 (272)
T ss_pred CCcCCCCCCeeEEEEE---CC----CccEECcCCc
Confidence 4689 89975532111 12 3467899984
No 137
>PF14803 Nudix_N_2: Nudix N-terminal; PDB: 3CNG_C.
Probab=34.38 E-value=55 Score=17.14 Aligned_cols=30 Identities=17% Similarity=0.559 Sum_probs=12.9
Q ss_pred eC-CCCCeEEEecccccCCCCCCceeEeCCCCcceE
Q psy7548 95 PC-RCGAEFCIEEQDTEGDSSDDNILIACDTCSLLL 129 (134)
Q Consensus 95 ~c-rcG~~~~v~~~~~~~g~~~~~~~v~C~~CSl~i 129 (134)
.| .||+..+...-+-+. ..-..|+.|..+.
T Consensus 2 fC~~CG~~l~~~ip~gd~-----r~R~vC~~Cg~Ih 32 (34)
T PF14803_consen 2 FCPQCGGPLERRIPEGDD-----RERLVCPACGFIH 32 (34)
T ss_dssp B-TTT--B-EEE--TT-S-----S-EEEETTTTEEE
T ss_pred ccccccChhhhhcCCCCC-----ccceECCCCCCEE
Confidence 35 788765443221111 2347899998653
No 138
>KOG3960|consensus
Probab=34.07 E-value=28 Score=26.97 Aligned_cols=16 Identities=38% Similarity=0.563 Sum_probs=13.3
Q ss_pred HHHHHHHHHHHHHccC
Q psy7548 38 QEMFLKINEAWNILKD 53 (134)
Q Consensus 38 ~~~f~~i~~Ay~~L~d 53 (134)
..+.++||||+++|+.
T Consensus 127 RRRLkKVNEAFE~LKR 142 (284)
T KOG3960|consen 127 RRRLKKVNEAFETLKR 142 (284)
T ss_pred HHHHHHHHHHHHHHHh
Confidence 4679999999998754
No 139
>smart00661 RPOL9 RNA polymerase subunit 9.
Probab=33.95 E-value=46 Score=18.43 Aligned_cols=11 Identities=18% Similarity=0.552 Sum_probs=5.6
Q ss_pred EeCCCCcceEE
Q psy7548 120 IACDTCSLLLE 130 (134)
Q Consensus 120 v~C~~CSl~i~ 130 (134)
..|+.|.....
T Consensus 21 ~vC~~Cg~~~~ 31 (52)
T smart00661 21 FVCRKCGYEEP 31 (52)
T ss_pred EECCcCCCeEE
Confidence 45666554433
No 140
>PF12662 cEGF: Complement Clr-like EGF-like
Probab=33.87 E-value=15 Score=17.78 Aligned_cols=15 Identities=20% Similarity=0.804 Sum_probs=11.2
Q ss_pred eeeCCCCCeEEEecc
Q psy7548 93 TYPCRCGAEFCIEEQ 107 (134)
Q Consensus 93 ~~~crcG~~~~v~~~ 107 (134)
+|.|.|...|.+..+
T Consensus 1 sy~C~C~~Gy~l~~d 15 (24)
T PF12662_consen 1 SYTCSCPPGYQLSPD 15 (24)
T ss_pred CEEeeCCCCCcCCCC
Confidence 367889888887654
No 141
>cd01780 PLC_epsilon_RA Ubiquitin-like domain of Phosphatidylinositide-specific phospholipase. PLC_epsilon_RA Phosphatidylinositide-specific phospholipase C (PLC) is a signaling enzyme that hydrolyzes membrane phospholipids to generate inositol triphosphate. PLC-epsilon represents a novel forth class of PLC that has a PLC catalytic core domain, a CDC25 guanine nucleotide exchange factor domain and two RA (Ras-association) domains of which the second is critical for Ras activation of the enzyme.
Probab=33.54 E-value=52 Score=21.48 Aligned_cols=32 Identities=16% Similarity=0.230 Sum_probs=24.9
Q ss_pred CccccccCCCCCCHHHHHHHHHHHHHHcCCCC
Q psy7548 2 NHFEILNCNVNSTLEEIKENYKSLILKYHPDK 33 (134)
Q Consensus 2 ~~Y~iLgv~~~as~~~Ik~ay~~l~~~~HPDk 33 (134)
.+|.||-++..+|..+|-+.-...|++-+||-
T Consensus 12 qP~~il~a~~~STa~Dvi~Qal~KA~rs~~~~ 43 (93)
T cd01780 12 QPYAILRAPRVSTAQDVIQQTLCKARRSNPNP 43 (93)
T ss_pred CCeeEEEccccccHHHHHHHHHHHhccCCCCc
Confidence 57999999999998886665555567778886
No 142
>PF06107 DUF951: Bacterial protein of unknown function (DUF951); InterPro: IPR009296 This family consists of several short hypothetical bacterial proteins of unknown function.
Probab=33.38 E-value=63 Score=19.15 Aligned_cols=29 Identities=24% Similarity=0.485 Sum_probs=19.7
Q ss_pred CCC-CeEEEecccccCCCCCCceeEeCCCCcceEEEe
Q psy7548 97 RCG-AEFCIEEQDTEGDSSDDNILIACDTCSLLLEIT 132 (134)
Q Consensus 97 rcG-~~~~v~~~~~~~g~~~~~~~v~C~~CSl~i~v~ 132 (134)
.|| ..++|..--++-. +.|.+|...|.+.
T Consensus 15 PCG~~~Wei~R~GaDik-------ikC~gCg~~imlp 44 (57)
T PF06107_consen 15 PCGSNEWEIIRIGADIK-------IKCLGCGRQIMLP 44 (57)
T ss_pred CCCCCEEEEEEccCcEE-------EEECCCCCEEEEe
Confidence 455 4666665555444 7899999888764
No 143
>PRK03824 hypA hydrogenase nickel incorporation protein; Provisional
Probab=33.24 E-value=52 Score=22.69 Aligned_cols=13 Identities=31% Similarity=1.032 Sum_probs=10.1
Q ss_pred eeeC-CCCCeEEEe
Q psy7548 93 TYPC-RCGAEFCIE 105 (134)
Q Consensus 93 ~~~c-rcG~~~~v~ 105 (134)
...| .||..|.+.
T Consensus 70 ~~~C~~CG~~~~~~ 83 (135)
T PRK03824 70 VLKCRNCGNEWSLK 83 (135)
T ss_pred EEECCCCCCEEecc
Confidence 3678 799888876
No 144
>PF05876 Terminase_GpA: Phage terminase large subunit (GpA); InterPro: IPR008866 This entry is represented by Bacteriophage lambda, GpA. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This entry consists of several phage terminase large subunit proteins as well as related sequences from several bacterial species. The DNA packaging enzyme of bacteriophage lambda, terminase, is a heteromultimer composed of a small subunit, gpNu1, and a large subunit, gpA, products of the Nu1 and A genes, respectively. Terminase is involved in the site-specific binding and cutting of the DNA in the initial stages of packaging. It is now known that gpA is actively involved in late stages of packaging, including DNA translocation, and that this enzyme contains separate functional domains for its early and late packaging activities [].
Probab=32.93 E-value=39 Score=28.91 Aligned_cols=49 Identities=18% Similarity=0.500 Sum_probs=32.3
Q ss_pred ccccccCCceeeeeeC-CCCCeEEEecccccCC--CCCCceeEeCCCCcceE
Q psy7548 81 SDMEHKGESQMFTYPC-RCGAEFCIEEQDTEGD--SSDDNILIACDTCSLLL 129 (134)
Q Consensus 81 ~~~~~~~~~~~~~~~c-rcG~~~~v~~~~~~~g--~~~~~~~v~C~~CSl~i 129 (134)
.......+...|.-+| .||..+.+..+.+.-. .........|+.|-..|
T Consensus 188 ~~~~~~sdqr~~~vpCPhCg~~~~l~~~~l~w~~~~~~~~a~y~C~~Cg~~i 239 (557)
T PF05876_consen 188 ERLYEESDQRRYYVPCPHCGEEQVLEWENLKWDKGEAPETARYVCPHCGCEI 239 (557)
T ss_pred HHHHHhCCceEEEccCCCCCCCccccccceeecCCCCccceEEECCCCcCCC
Confidence 3333344556788999 8998877776655322 12345799999997665
No 145
>TIGR03636 L23_arch archaeal ribosomal protein L23. This model describes the archaeal ribosomal protein L23P and rigorously excludes the bacterial counterpart L23. In order to capture every known instance of archaeal L23P, the trusted cutoff is set lower than a few of the highest scoring eukaryotic cytosolic ribosomal counterparts.
Probab=32.58 E-value=46 Score=20.81 Aligned_cols=21 Identities=19% Similarity=0.248 Sum_probs=17.8
Q ss_pred cccCCCCCCHHHHHHHHHHHH
Q psy7548 6 ILNCNVNSTLEEIKENYKSLI 26 (134)
Q Consensus 6 iLgv~~~as~~~Ik~ay~~l~ 26 (134)
++-|+++|+..|||+|..++.
T Consensus 18 ~F~V~~~anK~eIK~avE~lf 38 (77)
T TIGR03636 18 TFIVDRKATKGDIKRAVEKLF 38 (77)
T ss_pred EEEECCCCCHHHHHHHHHHHh
Confidence 456899999999999998873
No 146
>COG2879 Uncharacterized small protein [Function unknown]
Probab=32.41 E-value=80 Score=19.14 Aligned_cols=21 Identities=43% Similarity=0.559 Sum_probs=13.7
Q ss_pred HHHHHHHHcCCCCCHHHHHHH
Q psy7548 21 NYKSLILKYHPDKNSEHQEMF 41 (134)
Q Consensus 21 ay~~l~~~~HPDk~~~~~~~f 41 (134)
.|..-+++.|||+.+-....|
T Consensus 27 nYVehmr~~hPd~p~mT~~EF 47 (65)
T COG2879 27 NYVEHMRKKHPDKPPMTYEEF 47 (65)
T ss_pred HHHHHHHHhCcCCCcccHHHH
Confidence 456668899999954443333
No 147
>PRK01103 formamidopyrimidine/5-formyluracil/ 5-hydroxymethyluracil DNA glycosylase; Validated
Probab=32.38 E-value=1.5e+02 Score=22.82 Aligned_cols=26 Identities=31% Similarity=0.751 Sum_probs=16.0
Q ss_pred eeC-CCCCeEEEecccccCCCCCCceeEeCCCCc
Q psy7548 94 YPC-RCGAEFCIEEQDTEGDSSDDNILIACDTCS 126 (134)
Q Consensus 94 ~~c-rcG~~~~v~~~~~~~g~~~~~~~v~C~~CS 126 (134)
.+| +||....-.. + .| =...-|+.|.
T Consensus 246 ~pC~~Cg~~I~~~~--~-~g----R~t~~CP~CQ 272 (274)
T PRK01103 246 EPCRRCGTPIEKIK--Q-GG----RSTFFCPRCQ 272 (274)
T ss_pred CCCCCCCCeeEEEE--E-CC----CCcEECcCCC
Confidence 579 8997654211 1 12 3467899995
No 148
>PRK03564 formate dehydrogenase accessory protein FdhE; Provisional
Probab=31.76 E-value=55 Score=26.02 Aligned_cols=17 Identities=24% Similarity=0.409 Sum_probs=14.3
Q ss_pred ceeEeCCCCcceEEEee
Q psy7548 117 NILIACDTCSLLLEITA 133 (134)
Q Consensus 117 ~~~v~C~~CSl~i~v~~ 133 (134)
...-.|..|.-+++++.
T Consensus 250 ~r~e~C~~C~~YlK~~~ 266 (309)
T PRK03564 250 VKAESCGDCGTYLKILY 266 (309)
T ss_pred eEeeecccccccceecc
Confidence 45789999999999864
No 149
>PF07227 DUF1423: Protein of unknown function (DUF1423); InterPro: IPR004082 A total of 715 potential protein-coding genes have been identified in the nucleotide sequence of Arabidopsis thaliana chromosome 5, with an average gene density of 1 gene per 4001 bp []. Amongst the gene products is a well-conserved family of 130.7kDa proteins that share no sequence similarity with any other known proteins, other than in plants. The sequences are characterised by an N-terminal domain of variable length, a central cysteine-rich region and a relatively acidic C-terminal domain. The sequences may possess a PHD finger.
Probab=31.01 E-value=35 Score=28.47 Aligned_cols=15 Identities=27% Similarity=0.609 Sum_probs=13.4
Q ss_pred ceeEeCCCCcceEEE
Q psy7548 117 NILIACDTCSLLLEI 131 (134)
Q Consensus 117 ~~~v~C~~CSl~i~v 131 (134)
..++.|+.|++|.++
T Consensus 144 ~~Wi~Cd~CgH~cH~ 158 (446)
T PF07227_consen 144 CSWIGCDVCGHWCHL 158 (446)
T ss_pred eeEEeccCCCceehh
Confidence 679999999999875
No 150
>COG5415 Predicted integral membrane metal-binding protein [General function prediction only]
Probab=30.49 E-value=1.9e+02 Score=21.95 Aligned_cols=22 Identities=14% Similarity=0.301 Sum_probs=15.2
Q ss_pred CeEEEecccccCCCCCCceeEeCCCCcceE
Q psy7548 100 AEFCIEEQDTEGDSSDDNILIACDTCSLLL 129 (134)
Q Consensus 100 ~~~~v~~~~~~~g~~~~~~~v~C~~CSl~i 129 (134)
|-+.....++.. ..|+.|..|.
T Consensus 203 gl~~~~ek~~~e--------fiC~~Cn~~n 224 (251)
T COG5415 203 GLYRLAEKPIIE--------FICPHCNHKN 224 (251)
T ss_pred cccccccccchh--------eecccchhhc
Confidence 555555555553 5999999887
No 151
>PF01155 HypA: Hydrogenase expression/synthesis hypA family; InterPro: IPR000688 Bacterial membrane-bound nickel-dependent hydrogenases requires a number of accessory proteins which are involved in their maturation. The exact role of these proteins is not yet clear, but some seem to be required for the incorporation of the nickel ions []. One of these proteins is generally known as hypA. It is a protein of about 12 to 14 kDa that contains, in its C-terminal region, four conserved cysteines that form a zinc-finger like motif. Escherichia coli has two proteins that belong to this family, hypA and hybF. A homologue, MJ0214, has also been found in a number of archaeal species, including the genome of Methanocaldococcus jannaschii (Methanococcus jannaschii).; GO: 0016151 nickel ion binding, 0006464 protein modification process; PDB: 2KDX_A 3A44_D 3A43_B.
Probab=29.56 E-value=35 Score=22.70 Aligned_cols=29 Identities=38% Similarity=0.777 Sum_probs=18.4
Q ss_pred eeeC-CCCCeEEEecccccCCCCCCceeEeCCCCcce-EEEe
Q psy7548 93 TYPC-RCGAEFCIEEQDTEGDSSDDNILIACDTCSLL-LEIT 132 (134)
Q Consensus 93 ~~~c-rcG~~~~v~~~~~~~g~~~~~~~v~C~~CSl~-i~v~ 132 (134)
...| .||..|.+.... ..||.|... +.|+
T Consensus 70 ~~~C~~Cg~~~~~~~~~-----------~~CP~Cgs~~~~i~ 100 (113)
T PF01155_consen 70 RARCRDCGHEFEPDEFD-----------FSCPRCGSPDVEII 100 (113)
T ss_dssp EEEETTTS-EEECHHCC-----------HH-SSSSSS-EEEE
T ss_pred cEECCCCCCEEecCCCC-----------CCCcCCcCCCcEEc
Confidence 4678 888888887664 258888765 4444
No 152
>COG1439 Predicted nucleic acid-binding protein, consists of a PIN domain and a Zn-ribbon module [General function prediction only]
Probab=29.39 E-value=52 Score=24.05 Aligned_cols=30 Identities=23% Similarity=0.505 Sum_probs=21.4
Q ss_pred eeeeeeC-CCCCeEEEecccccCCCCCCceeEeCCCCcceEEEe
Q psy7548 90 QMFTYPC-RCGAEFCIEEQDTEGDSSDDNILIACDTCSLLLEIT 132 (134)
Q Consensus 90 ~~~~~~c-rcG~~~~v~~~~~~~g~~~~~~~v~C~~CSl~i~v~ 132 (134)
..|.+.| .|+..|. ...+ .|+.|-..+...
T Consensus 136 ~~w~~rC~GC~~~f~-----~~~~--------~Cp~CG~~~~~~ 166 (177)
T COG1439 136 RKWRLRCHGCKRIFP-----EPKD--------FCPICGSPLKRK 166 (177)
T ss_pred eeeeEEEecCceecC-----CCCC--------cCCCCCCceEEe
Confidence 4577889 8888887 2333 899998876653
No 153
>PF14951 DUF4503: Domain of unknown function (DUF4503)
Probab=29.15 E-value=33 Score=27.97 Aligned_cols=19 Identities=26% Similarity=0.503 Sum_probs=11.8
Q ss_pred eeeeeeC-CCC-CeEEEeccc
Q psy7548 90 QMFTYPC-RCG-AEFCIEEQD 108 (134)
Q Consensus 90 ~~~~~~c-rcG-~~~~v~~~~ 108 (134)
..-|-.| ||| +..+...++
T Consensus 271 AfSWPvCd~CGn~rLe~~pe~ 291 (389)
T PF14951_consen 271 AFSWPVCDRCGNGRLEQSPED 291 (389)
T ss_pred cccCccccccCCccceeCccC
Confidence 3446678 999 555555554
No 154
>PRK02935 hypothetical protein; Provisional
Probab=28.73 E-value=29 Score=23.28 Aligned_cols=18 Identities=17% Similarity=0.545 Sum_probs=13.2
Q ss_pred ceeEeCCCCcceEEEeeC
Q psy7548 117 NILIACDTCSLLLEITAR 134 (134)
Q Consensus 117 ~~~v~C~~CSl~i~v~~~ 134 (134)
-+.|.|++|.-..++++|
T Consensus 68 avqV~CP~C~K~TKmLGr 85 (110)
T PRK02935 68 AVQVICPSCEKPTKMLGR 85 (110)
T ss_pred ceeeECCCCCchhhhccc
Confidence 457899999877766553
No 155
>PRK11032 hypothetical protein; Provisional
Probab=28.61 E-value=2.1e+02 Score=20.54 Aligned_cols=27 Identities=19% Similarity=0.546 Sum_probs=19.4
Q ss_pred eeeC-CCCCeEEEecccccCCCCCCceeEeCCCCcce
Q psy7548 93 TYPC-RCGAEFCIEEQDTEGDSSDDNILIACDTCSLL 128 (134)
Q Consensus 93 ~~~c-rcG~~~~v~~~~~~~g~~~~~~~v~C~~CSl~ 128 (134)
...| .||....+... ..+-+|+.|..-
T Consensus 124 ~LvC~~Cg~~~~~~~p---------~~i~pCp~C~~~ 151 (160)
T PRK11032 124 NLVCEKCHHHLAFYTP---------EVLPLCPKCGHD 151 (160)
T ss_pred eEEecCCCCEEEecCC---------CcCCCCCCCCCC
Confidence 4678 89988877654 345589999753
No 156
>PF14706 Tnp_DNA_bind: Transposase DNA-binding; PDB: 3ECP_A 4DM0_A 1MUS_A 1MUH_A 1MM8_A.
Probab=28.60 E-value=1e+02 Score=18.19 Aligned_cols=39 Identities=8% Similarity=0.199 Sum_probs=26.6
Q ss_pred HHHHHHHHHHH--HcCCCC-CHHHHHHHHHHHHHHHHccChH
Q psy7548 17 EIKENYKSLIL--KYHPDK-NSEHQEMFLKINEAWNILKDEK 55 (134)
Q Consensus 17 ~Ik~ay~~l~~--~~HPDk-~~~~~~~f~~i~~Ay~~L~d~~ 55 (134)
-+.+...+++. --||.. -|.+......+..||..|++|.
T Consensus 16 Rl~~Rl~~l~~~la~~p~~Sip~a~~~wa~tkaAYRF~~N~~ 57 (58)
T PF14706_consen 16 RLTRRLVKLAESLAEKPGASIPQACQDWAETKAAYRFFRNPR 57 (58)
T ss_dssp HHHHHHHHHHHHHHHTTTS-HHHHTT-HHHHHHHHHHHT-TT
T ss_pred hHHHHHHHHHHHHHHCCCCccchhccCHHHHHHHHHhhcCCC
Confidence 35566666644 558888 5666667788999999998763
No 157
>PRK05738 rplW 50S ribosomal protein L23; Reviewed
Probab=28.22 E-value=59 Score=20.94 Aligned_cols=20 Identities=25% Similarity=0.365 Sum_probs=16.8
Q ss_pred cccCCCCCCHHHHHHHHHHH
Q psy7548 6 ILNCNVNSTLEEIKENYKSL 25 (134)
Q Consensus 6 iLgv~~~as~~~Ik~ay~~l 25 (134)
++-|++.|+..|||+|..++
T Consensus 24 ~F~V~~~a~K~eIK~aie~l 43 (92)
T PRK05738 24 VFEVAPDATKPEIKAAVEKL 43 (92)
T ss_pred EEEECCCCCHHHHHHHHHHH
Confidence 35688999999999998886
No 158
>PF15178 TOM_sub5: Mitochondrial import receptor subunit TOM5 homolog
Probab=28.17 E-value=1.2e+02 Score=17.26 Aligned_cols=22 Identities=23% Similarity=0.244 Sum_probs=18.2
Q ss_pred cccccCCCCCCHHHHHHHHHHH
Q psy7548 4 FEILNCNVNSTLEEIKENYKSL 25 (134)
Q Consensus 4 Y~iLgv~~~as~~~Ik~ay~~l 25 (134)
|.+=|+.|..+++|.|+..|+-
T Consensus 2 ~~~egl~pk~DPeE~k~kmR~d 23 (51)
T PF15178_consen 2 FRIEGLGPKMDPEEMKRKMRED 23 (51)
T ss_pred cccccCCCCCCHHHHHHHHHHH
Confidence 4567899999999999888864
No 159
>PF11023 DUF2614: Protein of unknown function (DUF2614); InterPro: IPR020912 This entry describes proteins of unknown function, which are thought to be membrane proteins.; GO: 0005887 integral to plasma membrane
Probab=27.96 E-value=23 Score=24.00 Aligned_cols=17 Identities=12% Similarity=0.499 Sum_probs=11.9
Q ss_pred ceeEeCCCCcceEEEee
Q psy7548 117 NILIACDTCSLLLEITA 133 (134)
Q Consensus 117 ~~~v~C~~CSl~i~v~~ 133 (134)
...|.|++|.-.-++++
T Consensus 67 av~V~CP~C~K~TKmLG 83 (114)
T PF11023_consen 67 AVQVECPNCGKQTKMLG 83 (114)
T ss_pred ceeeECCCCCChHhhhc
Confidence 45788888887666544
No 160
>PRK00810 nifW nitrogenase stabilizing/protective protein; Provisional
Probab=27.63 E-value=83 Score=21.28 Aligned_cols=28 Identities=11% Similarity=0.186 Sum_probs=15.1
Q ss_pred ccccccCCCCC-----CHHHHHHHHHHHHHHcC
Q psy7548 3 HFEILNCNVNS-----TLEEIKENYKSLILKYH 30 (134)
Q Consensus 3 ~Y~iLgv~~~a-----s~~~Ik~ay~~l~~~~H 30 (134)
|++.|||+-+. +.-.|=+.|......-.
T Consensus 21 ff~ff~V~YDp~vvnV~RLHILKrF~~yL~~~~ 53 (113)
T PRK00810 21 FFQLLGVPYDPKVVNVARLHILKRMGQYLAQED 53 (113)
T ss_pred HHHHhCCCCCHHHHHHhHHHHHHHHHHHHHhcc
Confidence 56667776643 33345556655544443
No 161
>PF13451 zf-trcl: Probable zinc-binding domain
Probab=27.51 E-value=11 Score=21.66 Aligned_cols=33 Identities=24% Similarity=0.535 Sum_probs=19.9
Q ss_pred eeC-CCCCeEEEecccccCCCCCCc--eeEeCCCCc
Q psy7548 94 YPC-RCGAEFCIEEQDTEGDSSDDN--ILIACDTCS 126 (134)
Q Consensus 94 ~~c-rcG~~~~v~~~~~~~g~~~~~--~~v~C~~CS 126 (134)
..| .||..|+++..+-.--..+++ ....|.+|.
T Consensus 5 l~C~dCg~~FvfTa~EQ~fy~eKgf~n~p~RC~~CR 40 (49)
T PF13451_consen 5 LTCKDCGAEFVFTAGEQKFYAEKGFDNEPKRCPSCR 40 (49)
T ss_pred EEcccCCCeEEEehhHHHHHHhcCCcCCCccCHHHH
Confidence 467 799888887664322212223 367787774
No 162
>PRK13798 putative OHCU decarboxylase; Provisional
Probab=26.57 E-value=2.4e+02 Score=20.12 Aligned_cols=23 Identities=13% Similarity=0.070 Sum_probs=14.8
Q ss_pred CCCCHHHHHHHHHHH-----------HHHcCCCC
Q psy7548 11 VNSTLEEIKENYKSL-----------ILKYHPDK 33 (134)
Q Consensus 11 ~~as~~~Ik~ay~~l-----------~~~~HPDk 33 (134)
|-+|.+++..+.... ++..|||-
T Consensus 41 Pf~s~~~L~~a~~~~~~~~~~~~~~~~l~~HP~l 74 (166)
T PRK13798 41 PFADHDALLAAADEALAGLSEADIDEALAGHPRI 74 (166)
T ss_pred CCCCHHHHHHHHHHHHHcCCHHHHHHHHHhCCcc
Confidence 445666666665543 66779998
No 163
>PF04502 DUF572: Family of unknown function (DUF572) ; InterPro: IPR007590 This entry represents eukaryotic proteins with undetermined function belonging to the CWC16 family.
Probab=26.23 E-value=90 Score=24.78 Aligned_cols=15 Identities=33% Similarity=0.671 Sum_probs=13.0
Q ss_pred eEeCCCCcceEEEee
Q psy7548 119 LIACDTCSLLLEITA 133 (134)
Q Consensus 119 ~v~C~~CSl~i~v~~ 133 (134)
.+.|+.|+.+|.+.+
T Consensus 77 ~~kC~~C~~~i~~kT 91 (324)
T PF04502_consen 77 YIKCPRCSNEIEFKT 91 (324)
T ss_pred EEEcCCCCCEEeeec
Confidence 889999999998754
No 164
>PF14606 Lipase_GDSL_3: GDSL-like Lipase/Acylhydrolase family; PDB: 3SKV_B.
Probab=26.21 E-value=72 Score=23.25 Aligned_cols=28 Identities=36% Similarity=0.647 Sum_probs=18.8
Q ss_pred cccCCCCCCHHHHHHHHH---HHHHHcCCCC
Q psy7548 6 ILNCNVNSTLEEIKENYK---SLILKYHPDK 33 (134)
Q Consensus 6 iLgv~~~as~~~Ik~ay~---~l~~~~HPDk 33 (134)
+|.+.+|.+.++++++.. +..+.-||++
T Consensus 64 ~ld~~~N~~~~~~~~~~~~fv~~iR~~hP~t 94 (178)
T PF14606_consen 64 VLDCGPNMSPEEFRERLDGFVKTIREAHPDT 94 (178)
T ss_dssp EEEESHHCCTTTHHHHHHHHHHHHHTT-SSS
T ss_pred EEEeecCCCHHHHHHHHHHHHHHHHHhCCCC
Confidence 455566677777777654 5677789999
No 165
>smart00440 ZnF_C2C2 C2C2 Zinc finger. Nucleic-acid-binding motif in transcriptional elongation factor TFIIS and RNA polymerases.
Probab=26.18 E-value=1.2e+02 Score=16.29 Aligned_cols=11 Identities=18% Similarity=0.724 Sum_probs=7.3
Q ss_pred eeEeCCCCcce
Q psy7548 118 ILIACDTCSLL 128 (134)
Q Consensus 118 ~~v~C~~CSl~ 128 (134)
....|..|...
T Consensus 27 ~fy~C~~C~~~ 37 (40)
T smart00440 27 VFYVCTKCGHR 37 (40)
T ss_pred EEEEeCCCCCE
Confidence 46777777653
No 166
>PF00076 RRM_1: RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain); InterPro: IPR000504 Many eukaryotic proteins containing one or more copies of a putative RNA-binding domain of about 90 amino acids are known to bind single-stranded RNAs [, , ]. The largest group of single strand RNA-binding proteins is the eukaryotic RNA recognition motif (RRM) family that contains an eight amino acid RNP-1 consensus sequence [, ]. RRM proteins have a variety of RNA binding preferences and functions, and include heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing (SR, U2AF, Sxl), protein components of small nuclear ribonucleoproteins (U1 and U2 snRNPs), and proteins that regulate RNA stability and translation (PABP, La, Hu) [, , ]. The RRM in heterodimeric splicing factor U2 snRNP auxiliary factor (U2AF) appears to have two RRM-like domains with specialised features for protein recognition []. The motif also appears in a few single stranded DNA binding proteins. The typical RRM consists of four anti-parallel beta-strands and two alpha-helices arranged in a beta-alpha-beta-beta-alpha-beta fold with side chains that stack with RNA bases. Specificity of RNA binding is determined by multiple contacts with surrounding amino acids. A third helix is present during RNA binding in some cases []. The RRM is reviewed in a number of publications [, , ].; GO: 0003676 nucleic acid binding; PDB: 2RNE_A 2DGO_A 2DO4_A 1YTY_B 2VOO_B 2VOP_A 2VON_B 1ZH5_B 2VOD_A 1S79_A ....
Probab=25.96 E-value=53 Score=18.49 Aligned_cols=20 Identities=20% Similarity=0.316 Sum_probs=16.5
Q ss_pred cccCCCCCCHHHHHHHHHHH
Q psy7548 6 ILNCNVNSTLEEIKENYKSL 25 (134)
Q Consensus 6 iLgv~~~as~~~Ik~ay~~l 25 (134)
|=||+++.+.++|+..+.+.
T Consensus 3 v~nlp~~~t~~~l~~~f~~~ 22 (70)
T PF00076_consen 3 VGNLPPDVTEEELRDFFSQF 22 (70)
T ss_dssp EESETTTSSHHHHHHHHHTT
T ss_pred EcCCCCcCCHHHHHHHHHHh
Confidence 34789999999999888774
No 167
>PRK14548 50S ribosomal protein L23P; Provisional
Probab=25.86 E-value=70 Score=20.37 Aligned_cols=21 Identities=19% Similarity=0.246 Sum_probs=17.6
Q ss_pred cccCCCCCCHHHHHHHHHHHH
Q psy7548 6 ILNCNVNSTLEEIKENYKSLI 26 (134)
Q Consensus 6 iLgv~~~as~~~Ik~ay~~l~ 26 (134)
.+-|++.|+..+||+|..++.
T Consensus 25 ~F~V~~~anK~eIK~AvE~lf 45 (84)
T PRK14548 25 TFIVDRRATKPDIKRAVEELF 45 (84)
T ss_pred EEEECCCCCHHHHHHHHHHHh
Confidence 356889999999999998873
No 168
>TIGR03180 UraD_2 OHCU decarboxylase. Previously thought to only proceed spontaneously, the decarboxylation of 2-oxo-4-hydroxy-4-carboxy--5-ureidoimidazoline (OHCU) has been recently been shown to be catalyzed by this enzyme in Mus musculus. Homologs of this enzyme are found adjacent to and fused with uricase in a number of prokaryotes and are represented by this model. This model is a separate (but related) clade from that represented by TIGR3164. This model places a second homolog in streptomyces species which (are not in the vicinity of other urate catabolism associated genes) below the trusted cutoff.
Probab=25.75 E-value=2.5e+02 Score=19.90 Aligned_cols=23 Identities=17% Similarity=0.115 Sum_probs=14.7
Q ss_pred CCCCHHHHHHHHHHH-----------HHHcCCCC
Q psy7548 11 VNSTLEEIKENYKSL-----------ILKYHPDK 33 (134)
Q Consensus 11 ~~as~~~Ik~ay~~l-----------~~~~HPDk 33 (134)
|-+|.+++..+.... ++..|||-
T Consensus 31 Pf~s~~~L~~a~~~~~~~~~~~~~~~~l~~HP~l 64 (158)
T TIGR03180 31 PFASAEALLAAADQAWQNLSEQDLFEALAGHPRI 64 (158)
T ss_pred CCCCHHHHHHHHHHHHHcCCHHHHHHHHHhCCcc
Confidence 445666666555443 66779998
No 169
>PF04967 HTH_10: HTH DNA binding domain; InterPro: IPR007050 Numerous bacterial transcription regulatory proteins bind DNA via a helix-turn-helix (HTH) motif. This entry represents the HTH DNA binding domain found in Halobacterium salinarium (Halobacterium halobium) and described as a putative bacterio-opsin activator.
Probab=25.72 E-value=27 Score=20.26 Aligned_cols=21 Identities=19% Similarity=0.284 Sum_probs=12.9
Q ss_pred cccCCCCCCHHHHHHHHHHHH
Q psy7548 6 ILNCNVNSTLEEIKENYKSLI 26 (134)
Q Consensus 6 iLgv~~~as~~~Ik~ay~~l~ 26 (134)
.|||++.+-.+.|+++-++++
T Consensus 32 ~lgis~st~~~~LRrae~kli 52 (53)
T PF04967_consen 32 ELGISKSTVSEHLRRAERKLI 52 (53)
T ss_pred HhCCCHHHHHHHHHHHHHHHh
Confidence 456666655566676666653
No 170
>PF03820 Mtc: Tricarboxylate carrier; InterPro: IPR004686 The MTC family consists of a limited number of homologues, all from eukaryotes. One member of the family has been functionally characterised as a tricarboxylate carrier from rat liver mitochondria. The rat liver mitochondrial tricarboxylate carrier has been reported to transport citrate, cis-aconitate, threo-D-isocitrate, D- and L-tartrate, malate, succinate and phosphoenolpyruvate. It presumably functions by a proton symport mechanism. The rest of the characterised proteins appear to be sideroflexins involved in iron transport.; GO: 0008324 cation transmembrane transporter activity, 0006812 cation transport, 0055085 transmembrane transport, 0016020 membrane
Probab=25.58 E-value=1.5e+02 Score=23.51 Aligned_cols=24 Identities=25% Similarity=0.235 Sum_probs=20.7
Q ss_pred CCCCCHHHHHHHHHHHHHHcCCCC
Q psy7548 10 NVNSTLEEIKENYKSLILKYHPDK 33 (134)
Q Consensus 10 ~~~as~~~Ik~ay~~l~~~~HPDk 33 (134)
+++.+.+++-+|.+-.--.+|||.
T Consensus 45 ~~~~~~~~lw~Ak~l~~Sa~HPDT 68 (308)
T PF03820_consen 45 PPGLTDDELWKAKKLYDSAFHPDT 68 (308)
T ss_pred CCCCCHHHHHHHHHHhhcccCCCC
Confidence 344589999999999999999997
No 171
>PF05191 ADK_lid: Adenylate kinase, active site lid; InterPro: IPR007862 Adenylate kinases (ADK; 2.7.4.3 from EC) are phosphotransferases that catalyse the Mg-dependent reversible conversion of ATP and AMP to two molecules of ADP, an essential reaction for many processes in living cells. In large variants of adenylate kinase, the AMP and ATP substrates are buried in a domain that undergoes conformational changes from an open to a closed state when bound to substrate; the ligand is then contained within a highly specific environment required for catalysis. Adenylate kinase is a 3-domain protein consisting of a large central CORE domain flanked by a LID domain on one side and the AMP-binding NMPbind domain on the other []. The LID domain binds ATP and covers the phosphates at the active site. The substrates first bind the CORE domain, followed by closure of the active site by the LID and NMPbind domains. Comparisons of adenylate kinases have revealed a particular divergence in the active site lid. In some organisms, particularly the Gram-positive bacteria, residues in the lid domain have been mutated to cysteines and these cysteine residues (two CX(n)C motifs) are responsible for the binding of a zinc ion. The bound zinc ion in the lid domain is clearly structurally homologous to Zinc-finger domains. However, it is unclear whether the adenylate kinase lid is a novel zinc-finger DNA/RNA binding domain, or that the lid bound zinc serves a purely structural function [].; GO: 0004017 adenylate kinase activity; PDB: 3BE4_A 2OSB_B 2ORI_A 2EU8_A 3DL0_A 1P3J_A 2QAJ_A 2OO7_A 2P3S_A 3DKV_A ....
Probab=25.14 E-value=37 Score=17.99 Aligned_cols=26 Identities=31% Similarity=0.698 Sum_probs=15.6
Q ss_pred eC-CCCCeEEEecccccCCCCCCceeEeCCCCcc
Q psy7548 95 PC-RCGAEFCIEEQDTEGDSSDDNILIACDTCSL 127 (134)
Q Consensus 95 ~c-rcG~~~~v~~~~~~~g~~~~~~~v~C~~CSl 127 (134)
.| .||..|.+....-... ..|+.|.-
T Consensus 3 ~C~~Cg~~Yh~~~~pP~~~-------~~Cd~cg~ 29 (36)
T PF05191_consen 3 ICPKCGRIYHIEFNPPKVE-------GVCDNCGG 29 (36)
T ss_dssp EETTTTEEEETTTB--SST-------TBCTTTTE
T ss_pred CcCCCCCccccccCCCCCC-------CccCCCCC
Confidence 35 6888888766532222 37887765
No 172
>PF14353 CpXC: CpXC protein
Probab=25.08 E-value=94 Score=20.80 Aligned_cols=13 Identities=15% Similarity=0.639 Sum_probs=6.4
Q ss_pred eEeCCCCcceEEE
Q psy7548 119 LIACDTCSLLLEI 131 (134)
Q Consensus 119 ~v~C~~CSl~i~v 131 (134)
.+.|+.|-....|
T Consensus 38 ~~~CP~Cg~~~~~ 50 (128)
T PF14353_consen 38 SFTCPSCGHKFRL 50 (128)
T ss_pred EEECCCCCCceec
Confidence 4555555544433
No 173
>TIGR01385 TFSII transcription elongation factor S-II. This model represents eukaryotic transcription elongation factor S-II. This protein allows stalled RNA transcription complexes to perform a cleavage of the nascent RNA and restart at the newly generated 3-prime end.
Probab=25.07 E-value=3.5e+02 Score=21.38 Aligned_cols=38 Identities=18% Similarity=0.511 Sum_probs=19.2
Q ss_pred eeeeeC-CCCCeEEEecccccCCCCC--CceeEeCCCCcceE
Q psy7548 91 MFTYPC-RCGAEFCIEEQDTEGDSSD--DNILIACDTCSLLL 129 (134)
Q Consensus 91 ~~~~~c-rcG~~~~v~~~~~~~g~~~--~~~~v~C~~CSl~i 129 (134)
+..+.| +||..=.+. -++.....+ -...+.|..|.+.-
T Consensus 256 t~~~~C~~C~~~~~~~-~q~QtrsaDEpmT~f~~C~~Cg~~w 296 (299)
T TIGR01385 256 TDLFTCGKCKQKKCTY-YQLQTRSADEPMTTFVTCEECGNRW 296 (299)
T ss_pred cccccCCCCCCccceE-EEecccCCCCCCeEEEEcCCCCCee
Confidence 344788 888421111 112222111 13478999998754
No 174
>COG5525 Bacteriophage tail assembly protein [General function prediction only]
Probab=25.04 E-value=54 Score=28.46 Aligned_cols=43 Identities=23% Similarity=0.651 Sum_probs=29.0
Q ss_pred CceeeeeeC-CCCCeEEEecccccCCCCC-----CceeEeCCCCcceEE
Q psy7548 88 ESQMFTYPC-RCGAEFCIEEQDTEGDSSD-----DNILIACDTCSLLLE 130 (134)
Q Consensus 88 ~~~~~~~~c-rcG~~~~v~~~~~~~g~~~-----~~~~v~C~~CSl~i~ 130 (134)
+...|.-+| .||..+.+..++...-... .-..+.|+.|...|+
T Consensus 222 d~rr~yvpCPHCGe~q~l~~~e~~~~~g~~~~~~~~~~~~c~h~~~~i~ 270 (611)
T COG5525 222 DQRRFYVPCPHCGEEQQLKFGEKSGPRGLKDTPAEAAFIQCEHCGCVIR 270 (611)
T ss_pred cceeEEeeCCCCCchhhccccccCCCcCcccchhhhhhhhccccCceee
Confidence 346678899 9998888777554432111 123679999999886
No 175
>PF08996 zf-DNA_Pol: DNA Polymerase alpha zinc finger; InterPro: IPR015088 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The DNA Polymerase alpha zinc finger domain adopts an alpha-helix-like structure, followed by three turns, all of which involve proline. The resulting motif is a helix-turn-helix motif, in contrast to other zinc finger domains, which show anti-parallel sheet and helix conformation. Zinc binding occurs due to the presence of four cysteine residues positioned to bind the metal centre in a tetrahedral coordination geometry. The function of this domain is uncertain: it has been proposed that the zinc finger motif may be an essential part of the DNA binding domain []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0001882 nucleoside binding, 0003887 DNA-directed DNA polymerase activity, 0006260 DNA replication; PDB: 3FLO_D 1N5G_A 1K0P_A 1K18_A.
Probab=24.97 E-value=47 Score=24.16 Aligned_cols=41 Identities=24% Similarity=0.458 Sum_probs=19.1
Q ss_pred eeeeeeC-CCCCeEEEecccccCCCCCCceeEeCCCCcceEE
Q psy7548 90 QMFTYPC-RCGAEFCIEEQDTEGDSSDDNILIACDTCSLLLE 130 (134)
Q Consensus 90 ~~~~~~c-rcG~~~~v~~~~~~~g~~~~~~~v~C~~CSl~i~ 130 (134)
..+...| .||..+.+..-.-..+.........|+.|+..+.
T Consensus 15 ~~l~~~C~~C~~~~~f~g~~~~~~~~~~~~~~~C~~C~~~~~ 56 (188)
T PF08996_consen 15 EPLKLTCPSCGTEFEFPGVFEEDGDDVSPSGLQCPNCSTPLS 56 (188)
T ss_dssp --EEEE-TTT--EEEE-SSS--SSEEEETTEEEETTT--B--
T ss_pred CceEeECCCCCCCccccccccCCccccccCcCcCCCCCCcCC
Confidence 4467888 8998888866433222222234799999987543
No 176
>PF11126 Phage_DsbA: Transcriptional regulator DsbA; InterPro: IPR020313 DsbA is a double stranded binding protein found in bacteriophage T4 which is involved in transcriptional regulation. DsbA, along with other viral proteins, interacts with the host RNA polymerase core enzyme enabling initiation of transcription. DsbA acts as an enhancer protein of late genes in vitro. The protein consists of mainly alpha helices [].
Probab=23.64 E-value=1.3e+02 Score=18.51 Aligned_cols=32 Identities=16% Similarity=0.340 Sum_probs=21.1
Q ss_pred HHHHHHHHHcCCCCCHHHHHHHHHHHHHHHHc
Q psy7548 20 ENYKSLILKYHPDKNSEHQEMFLKINEAWNIL 51 (134)
Q Consensus 20 ~ay~~l~~~~HPDk~~~~~~~f~~i~~Ay~~L 51 (134)
+-|.++...||-+-...-++.-.++.+.|+.+
T Consensus 35 k~Fnkl~~lyHk~~Re~fE~e~ee~~elYD~~ 66 (69)
T PF11126_consen 35 KMFNKLLKLYHKQEREEFEAENEEVVELYDAV 66 (69)
T ss_pred HHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHH
Confidence 45778888888887555555555666666654
No 177
>PF05781 MRVI1: MRVI1 protein; InterPro: IPR008677 This family consists of mammalian MRVI1 proteins which are related to the lymphoid-restricted membrane protein (JAW1) and the IP3 receptor associated cGMP kinase substrates A and B (IRAGA and IRAGB). The function of MRVI1 is unknown although mutations in the Mrvi1 gene induces myeloid leukaemia by altering the expression of a gene important for myeloid cell growth and/or differentiation so it has been speculated that Mrvi1 is a tumour suppressor gene []. IRAG is very similar in sequence to MRVI1 and is an essential NO/cGKI-dependent regulator of IP3-induced calcium release. Activation of cGKI decreases IP3-stimulated elevations in intracellular calcium, induces smooth muscle relaxation and contributes to the antiproliferative and pro-apoptotic effects of NO/cGMP []. Jaw1 is a member of a class of proteins with COOH-terminal hydrophobic membrane anchors and is structurally similar to proteins involved in vesicle targeting and fusion. This suggests that the function and/or the structure of the ER in lymphocytes may be modified by lymphoid-restricted resident ER proteins [].
Probab=23.53 E-value=3.4e+02 Score=23.43 Aligned_cols=25 Identities=16% Similarity=0.307 Sum_probs=22.1
Q ss_pred CCCCCCHHHHHHHHHHHHHHcCCCC
Q psy7548 9 CNVNSTLEEIKENYKSLILKYHPDK 33 (134)
Q Consensus 9 v~~~as~~~Ik~ay~~l~~~~HPDk 33 (134)
.+..++..||..+|.+|++-++-|.
T Consensus 184 ~~~s~~EkEvE~~F~~lsL~f~~D~ 208 (538)
T PF05781_consen 184 LPGSASEKEVEAEFLRLSLGFKCDR 208 (538)
T ss_pred CCCCCcHHHHHHHHHHHHHHhhhhh
Confidence 3445799999999999999999998
No 178
>PF03206 NifW: Nitrogen fixation protein NifW; InterPro: IPR004893 Nitrogenase is a complex metalloenzyme composed of two proteins designated the Fe-protein and the MoFe-protein. Apart from these two proteins, a number of accessory proteins are essential for the maturation and assembly of nitrogenase. Even though experimental evidence suggests that these accessory proteins are required for nitrogenase activity, the exact roles played by many of these proteins in the functions of nitrogenase are unclear []. Using yeast two-hybrid screening it has been shown that NifW can interact with itself as well as NifZ. ; GO: 0009399 nitrogen fixation
Probab=23.33 E-value=92 Score=20.74 Aligned_cols=30 Identities=7% Similarity=0.156 Sum_probs=20.9
Q ss_pred CccccccCCCCC-----CHHHHHHHHHHHHHHcCC
Q psy7548 2 NHFEILNCNVNS-----TLEEIKENYKSLILKYHP 31 (134)
Q Consensus 2 ~~Y~iLgv~~~a-----s~~~Ik~ay~~l~~~~HP 31 (134)
+|++.|||+-+. +.=.|=+.|.......++
T Consensus 16 dFf~fF~V~YDp~vv~V~RLHILkrF~~yL~~~~~ 50 (105)
T PF03206_consen 16 DFFDFFGVPYDPKVVNVNRLHILKRFGQYLRAADF 50 (105)
T ss_pred HHHHHhCCCcchhHHHHhhHHHHHHHHHHHHhccC
Confidence 467778888754 344577788887777766
No 179
>PF01096 TFIIS_C: Transcription factor S-II (TFIIS); InterPro: IPR001222 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents a zinc finger motif found in transcription factor IIs (TFIIS). In eukaryotes the initiation of transcription of protein encoding genes by polymerase II (Pol II) is modulated by general and specific transcription factors. The general transcription factors operate through common promoters elements (such as the TATA box). At least eight different proteins associate to form the general transcription factors: TFIIA, -IIB, -IID, -IIE, -IIF, -IIG, -IIH and -IIS []. During mRNA elongation, Pol II can encounter DNA sequences that cause reverse movement of the enzyme. Such backtracking involves extrusion of the RNA 3'-end into the pore, and can lead to transcriptional arrest. Escape from arrest requires cleavage of the extruded RNA with the help of TFIIS, which induces mRNA cleavage by enhancing the intrinsic nuclease activity of RNA polymerase (Pol) II, past template-encoded pause sites []. TFIIS extends from the polymerase surface via a pore to the internal active site. Two essential and invariant acidic residues in a TFIIS loop complement the Pol II active site and could position a metal ion and a water molecule for hydrolytic RNA cleavage. TFIIS also induces extensive structural changes in Pol II that would realign nucleic acids in the active centre. TFIIS is a protein of about 300 amino acids. It contains three regions: a variable N-terminal domain not required for TFIIS activity; a conserved central domain required for Pol II binding; and a conserved C-terminal C4-type zinc finger essential for RNA cleavage. The zinc finger folds in a conformation termed a zinc ribbon [] characterised by a three-stranded antiparallel beta-sheet and two beta-hairpins. A backbone model for Pol II-TFIIS complex was obtained from X-ray analysis. It shows that a beta hairpin protrudes from the zinc finger and complements the pol II active site []. Some viral proteins also contain the TFIIS zinc ribbon C-terminal domain. The Vaccinia virus protein, unlike its eukaryotic homologue, is an integral RNA polymerase subunit rather than a readily separable transcription factor []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0003676 nucleic acid binding, 0008270 zinc ion binding, 0006351 transcription, DNA-dependent; PDB: 3M4O_I 3S14_I 2E2J_I 4A3J_I 3HOZ_I 1TWA_I 3S1Q_I 3S1N_I 1TWG_I 3I4M_I ....
Probab=22.92 E-value=1.4e+02 Score=15.90 Aligned_cols=10 Identities=20% Similarity=0.766 Sum_probs=6.2
Q ss_pred eeEeCCCCcc
Q psy7548 118 ILIACDTCSL 127 (134)
Q Consensus 118 ~~v~C~~CSl 127 (134)
....|..|..
T Consensus 27 ~fy~C~~C~~ 36 (39)
T PF01096_consen 27 LFYVCCNCGH 36 (39)
T ss_dssp EEEEESSSTE
T ss_pred EEEEeCCCCC
Confidence 3666767765
No 180
>smart00398 HMG high mobility group.
Probab=22.68 E-value=1.6e+02 Score=16.67 Aligned_cols=42 Identities=24% Similarity=0.414 Sum_probs=28.1
Q ss_pred HHHHHHHHcCCCCCHHHHHHHHHHHHHHHHccChHHHHHHHHHH
Q psy7548 21 NYKSLILKYHPDKNSEHQEMFLKINEAWNILKDEKERKLYESQL 64 (134)
Q Consensus 21 ay~~l~~~~HPDk~~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~ 64 (134)
..+.-...-||+.. ..+..+.|.+.|..|++.++....+...
T Consensus 15 ~~r~~~~~~~~~~~--~~~i~~~~~~~W~~l~~~ek~~y~~~a~ 56 (70)
T smart00398 15 ENRAKIKAENPDLS--NAEISKKLGERWKLLSEEEKAPYEEKAK 56 (70)
T ss_pred HHHHHHHHHCcCCC--HHHHHHHHHHHHHcCCHHHHHHHHHHHH
Confidence 34555566688863 3456678999999999766655554433
No 181
>PF00628 PHD: PHD-finger; InterPro: IPR019787 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents the PHD (homeodomain) zinc finger domain [,], which is a C4HC3 zinc-finger-like motif found in nuclear proteins thought to be involved in chromatin-mediated transcriptional regulation. The PHD finger motif is reminiscent of, but distinct from the C3HC4 type RING finger. The function of this domain is not yet known but in analogy with the LIM domain it could be involved in protein-protein interaction and be important for the assembly or activity of multicomponent complexes involved in transcriptional activation or repression. Alternatively, the interactions could be intra-molecular and be important in maintaining the structural integrity of the protein. In similarity to the RING finger and the LIM domain, the PHD finger is thought to bind two zinc ions. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0005515 protein binding; PDB: 3ZVY_A 2LGG_A 3SOW_A 3SOU_B 3ASL_A 3ASK_A 3ZVZ_B 3T6R_A 2LGK_A 3SOX_B ....
Probab=22.40 E-value=1e+02 Score=16.77 Aligned_cols=15 Identities=27% Similarity=0.643 Sum_probs=12.6
Q ss_pred ceeEeCCCCcceEEE
Q psy7548 117 NILIACDTCSLLLEI 131 (134)
Q Consensus 117 ~~~v~C~~CSl~i~v 131 (134)
...+.|++|..|.+.
T Consensus 12 ~~~i~C~~C~~~~H~ 26 (51)
T PF00628_consen 12 GDMIQCDSCNRWYHQ 26 (51)
T ss_dssp SSEEEBSTTSCEEET
T ss_pred CCeEEcCCCChhhCc
Confidence 468999999999874
No 182
>COG2816 NPY1 NTP pyrophosphohydrolases containing a Zn-finger, probably nucleic-acid-binding [DNA replication, recombination, and repair]
Probab=21.93 E-value=50 Score=25.90 Aligned_cols=28 Identities=25% Similarity=0.444 Sum_probs=18.6
Q ss_pred eeeC-CCCCeEEEecccccCCCCCCceeEeCCCCcceE
Q psy7548 93 TYPC-RCGAEFCIEEQDTEGDSSDDNILIACDTCSLLL 129 (134)
Q Consensus 93 ~~~c-rcG~~~~v~~~~~~~g~~~~~~~v~C~~CSl~i 129 (134)
...| +||+.......+. ...|+.|.+.+
T Consensus 111 ~RFCg~CG~~~~~~~~g~---------~~~C~~cg~~~ 139 (279)
T COG2816 111 HRFCGRCGTKTYPREGGW---------ARVCPKCGHEH 139 (279)
T ss_pred CcCCCCCCCcCccccCce---------eeeCCCCCCcc
Confidence 3567 8998766555533 35799998753
No 183
>PF09963 DUF2197: Uncharacterized protein conserved in bacteria (DUF2197); InterPro: IPR019241 This family represents various hypothetical bacterial proteins with no known function.
Probab=21.60 E-value=31 Score=20.41 Aligned_cols=35 Identities=20% Similarity=0.329 Sum_probs=25.0
Q ss_pred CCCCeEEEecccccCC--CCCCceeEeCCCCcceEEE
Q psy7548 97 RCGAEFCIEEQDTEGD--SSDDNILIACDTCSLLLEI 131 (134)
Q Consensus 97 rcG~~~~v~~~~~~~g--~~~~~~~v~C~~CSl~i~v 131 (134)
=|+..+.|..+.+..- ..+.+....|+.|+..|..
T Consensus 7 lCdk~~~Id~~~~~aKrLrnrPi~tYmC~eC~~RI~~ 43 (56)
T PF09963_consen 7 LCDKKEEIDEDTPEAKRLRNRPIHTYMCDECKERIRE 43 (56)
T ss_pred ecCCEEEeccCCHHHHHhhcCCCcceeChhHHHHHhH
Confidence 5788888877765543 2235668999999988764
No 184
>PF08447 PAS_3: PAS fold; InterPro: IPR013655 The PAS fold corresponds to the structural domain that has previously been defined as PAS and PAC motifs []. The PAS fold appears in archaea, eubacteria and eukarya. The PAS domain contains a sensory box, or S-box domain that occupies the central portion of the PAS domain but is more widely distributed. It is often tandemly repeated. Known prosthetic groups bound in the S-box domain include haem in the oxygen sensor FixL [], FAD in the redox potential sensor NifL [], and a 4-hydroxycinnamyl chromophore in photoactive yellow protein []. Proteins containing the domain often contain other regulatory domains such as response regulator or sensor histidine kinase domains. Other S-box proteins include phytochromes and the aryl hydrocarbon receptor nuclear translocator. This domain has been found in the gene product of the madA gene of the filamentous zygomycete fungus Phycomyces blakesleeanus. It has been shown that MadA encodes a blue-light photoreceptor for phototropism and other light responses. The gene is involved in the phototropic responses associated with sporangiophore growth; they exhibit phototropism by bending toward near-UV and blue wavelengths and away from far-UV wavelengths in a manner that is physiologically similar to plant phototropic responses [].; GO: 0005515 protein binding; PDB: 3NJA_D 3H9W_A 3GDI_B 3ICY_A 3EEH_A 3MR0_B.
Probab=21.43 E-value=35 Score=20.43 Aligned_cols=28 Identities=36% Similarity=0.716 Sum_probs=18.0
Q ss_pred CccccccCCCCCCHHHH-HHHHHHHHHHcCCCC
Q psy7548 2 NHFEILNCNVNSTLEEI-KENYKSLILKYHPDK 33 (134)
Q Consensus 2 ~~Y~iLgv~~~as~~~I-k~ay~~l~~~~HPDk 33 (134)
+++++||.++ +++ ...........|||-
T Consensus 7 ~~~~i~G~~~----~~~~~~~~~~~~~~ihpdD 35 (91)
T PF08447_consen 7 NFYEIFGYSP----EEIGKPDFEEWLERIHPDD 35 (91)
T ss_dssp HHHHHHTS-H----HHHTCBEHHHHHHHB-TTT
T ss_pred HHHHHhCCCH----HHhccCCHHHHHhhcCHHH
Confidence 4577887765 455 555666777899995
No 185
>TIGR03164 UHCUDC OHCU decarboxylase. Previously thought to only proceed spontaneously, the decarboxylation of 2-oxo-4-hydroxy-4-carboxy--5-ureidoimidazoline (OHCU) has been recently been shown to be catalyzed by this enzyme in Mus musculus. Homologs of this enzyme are found adjacent to and fused with uricase in a number of prokaryotes and are represented by this model.
Probab=21.32 E-value=3.1e+02 Score=19.38 Aligned_cols=23 Identities=17% Similarity=0.420 Sum_probs=14.0
Q ss_pred CCCCHHHHHHHHHHH-----------HHHcCCCC
Q psy7548 11 VNSTLEEIKENYKSL-----------ILKYHPDK 33 (134)
Q Consensus 11 ~~as~~~Ik~ay~~l-----------~~~~HPDk 33 (134)
|-+|.+.+..+.... ++..|||-
T Consensus 29 Pf~s~~~L~~a~~~~~~~~~~~~~~~ll~~HP~L 62 (157)
T TIGR03164 29 PFDSIEDLHAAMVGAVRAASPEQQLALIRAHPDL 62 (157)
T ss_pred CCCCHHHHHHHHHHHHHHCCHHHHHHHHHhCCcc
Confidence 445566666555443 46678887
No 186
>PHA02150 hypothetical protein
Probab=21.02 E-value=49 Score=20.24 Aligned_cols=15 Identities=20% Similarity=0.364 Sum_probs=11.5
Q ss_pred eEeCCCCcceEEEee
Q psy7548 119 LIACDTCSLLLEITA 133 (134)
Q Consensus 119 ~v~C~~CSl~i~v~~ 133 (134)
-+.|..||-|+.|.+
T Consensus 38 d~~~~~~sd~~ei~g 52 (77)
T PHA02150 38 DLSCEGMTDWAEING 52 (77)
T ss_pred hcccccccceeEECc
Confidence 368888888888754
No 187
>COG4888 Uncharacterized Zn ribbon-containing protein [General function prediction only]
Probab=20.36 E-value=1.8e+02 Score=19.29 Aligned_cols=35 Identities=26% Similarity=0.524 Sum_probs=20.8
Q ss_pred eeC-CCCCeEEEecccccCCCCCCceeEeCCCCcceEEE
Q psy7548 94 YPC-RCGAEFCIEEQDTEGDSSDDNILIACDTCSLLLEI 131 (134)
Q Consensus 94 ~~c-rcG~~~~v~~~~~~~g~~~~~~~v~C~~CSl~i~v 131 (134)
+.| +||..-.++-. +... .....+.|..|-+....
T Consensus 23 FtCp~Cghe~vs~ct-vkk~--~~~g~~~Cg~CGls~e~ 58 (104)
T COG4888 23 FTCPRCGHEKVSSCT-VKKT--VNIGTAVCGNCGLSFEC 58 (104)
T ss_pred EecCccCCeeeeEEE-EEec--CceeEEEcccCcceEEE
Confidence 567 89865444211 2222 22457899999887654
No 188
>smart00427 H2B Histone H2B.
Probab=20.12 E-value=1.1e+02 Score=19.85 Aligned_cols=11 Identities=27% Similarity=0.585 Sum_probs=9.2
Q ss_pred HHHHHHcCCCC
Q psy7548 23 KSLILKYHPDK 33 (134)
Q Consensus 23 ~~l~~~~HPDk 33 (134)
.+..++.|||.
T Consensus 8 ~kvLKqVhpd~ 18 (89)
T smart00427 8 YKVLKQVHPDT 18 (89)
T ss_pred HHHHHHhCCCc
Confidence 46678999998
No 189
>PF00276 Ribosomal_L23: Ribosomal protein L23; InterPro: IPR013025 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites [, ]. About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits. Many ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome [, ]. This domain is found in both eukaryotic L25 and prokaryotic and eukaryotic L23 proteins.; GO: 0003735 structural constituent of ribosome, 0006412 translation, 0005622 intracellular, 0005840 ribosome; PDB: 3DF4_T 3DF2_T 4A1E_R 4A1A_R 4A1C_R 4A17_R 2HGU_W 2HGQ_W 1N88_A 2HGJ_W ....
Probab=20.03 E-value=1e+02 Score=19.61 Aligned_cols=20 Identities=25% Similarity=0.366 Sum_probs=16.2
Q ss_pred cccCCCCCCHHHHHHHHHHH
Q psy7548 6 ILNCNVNSTLEEIKENYKSL 25 (134)
Q Consensus 6 iLgv~~~as~~~Ik~ay~~l 25 (134)
.+-|+++|+..+||++..++
T Consensus 24 tF~V~~~atK~~Ik~aie~i 43 (91)
T PF00276_consen 24 TFEVDPRATKTEIKEAIEKI 43 (91)
T ss_dssp EEEETTTSTHHHHHHHHHHH
T ss_pred EEEEeCCCCHHHHHHHHHhh
Confidence 45688899999999888776
Done!