Query psy7114
Match_columns 134
No_of_seqs 178 out of 2226
Neff 8.0
Searched_HMMs 46136
Date Fri Aug 16 18:30:31 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy7114.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/7114hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 COG0484 DnaJ DnaJ-class molecu 100.0 2.4E-30 5.2E-35 203.4 8.2 126 1-126 4-149 (371)
2 KOG0712|consensus 99.9 5.9E-27 1.3E-31 182.1 9.3 123 2-126 5-134 (337)
3 PTZ00037 DnaJ_C chaperone prot 99.9 4E-25 8.7E-30 178.0 7.7 123 1-126 28-157 (421)
4 KOG0713|consensus 99.9 9.2E-25 2E-29 168.6 7.5 69 1-69 16-87 (336)
5 PRK14296 chaperone protein Dna 99.9 1.7E-24 3.7E-29 172.2 7.8 67 1-67 4-72 (372)
6 PRK14288 chaperone protein Dna 99.9 1.4E-24 3.1E-29 172.5 7.3 67 1-67 3-72 (369)
7 PRK14286 chaperone protein Dna 99.9 1.1E-23 2.3E-28 167.7 8.2 126 1-126 4-157 (372)
8 PRK14298 chaperone protein Dna 99.9 8.5E-24 1.8E-28 168.5 7.3 126 1-126 5-148 (377)
9 PRK14287 chaperone protein Dna 99.9 9.8E-24 2.1E-28 167.8 7.5 126 1-126 4-145 (371)
10 PRK14276 chaperone protein Dna 99.9 2.7E-23 6E-28 165.8 6.8 126 1-126 4-153 (380)
11 PRK14280 chaperone protein Dna 99.9 4.7E-23 1E-27 164.2 7.8 126 1-126 4-150 (376)
12 PRK14285 chaperone protein Dna 99.9 3.9E-23 8.4E-28 164.2 7.0 126 1-126 3-153 (365)
13 PRK14278 chaperone protein Dna 99.9 6.2E-23 1.3E-27 163.6 7.1 126 1-126 3-146 (378)
14 PRK14282 chaperone protein Dna 99.9 9.9E-23 2.1E-27 162.0 8.0 67 1-67 4-74 (369)
15 PRK14297 chaperone protein Dna 99.9 1.1E-22 2.4E-27 162.2 6.7 126 1-126 4-155 (380)
16 PRK14284 chaperone protein Dna 99.9 2.6E-22 5.7E-27 160.7 8.8 67 1-67 1-70 (391)
17 PRK10767 chaperone protein Dna 99.9 2.2E-22 4.7E-27 160.1 8.2 126 1-126 4-149 (371)
18 PRK14283 chaperone protein Dna 99.9 3.7E-22 8.1E-27 159.2 8.9 67 1-67 5-73 (378)
19 PRK14277 chaperone protein Dna 99.9 2.1E-22 4.6E-27 160.9 7.4 67 1-67 5-74 (386)
20 PRK14294 chaperone protein Dna 99.9 2.7E-22 5.8E-27 159.4 7.9 126 1-126 4-151 (366)
21 PRK14301 chaperone protein Dna 99.9 1.4E-22 3.1E-27 161.3 6.3 126 1-126 4-151 (373)
22 PRK14295 chaperone protein Dna 99.9 4.2E-22 9.1E-27 159.4 8.7 66 1-66 9-81 (389)
23 PRK14291 chaperone protein Dna 99.9 4E-22 8.7E-27 159.2 6.6 67 1-67 3-71 (382)
24 TIGR02349 DnaJ_bact chaperone 99.9 8.5E-22 1.8E-26 155.9 7.2 125 2-126 1-150 (354)
25 PRK14279 chaperone protein Dna 99.8 1.7E-21 3.6E-26 156.0 7.1 65 1-65 9-76 (392)
26 PRK14292 chaperone protein Dna 99.8 2E-21 4.4E-26 154.6 6.3 126 1-126 2-146 (371)
27 PRK14289 chaperone protein Dna 99.8 7E-21 1.5E-25 152.2 8.5 67 1-67 5-74 (386)
28 PRK14300 chaperone protein Dna 99.8 3.6E-21 7.7E-26 153.3 6.6 67 1-67 3-71 (372)
29 PRK14290 chaperone protein Dna 99.8 4.9E-21 1.1E-25 152.1 6.1 66 1-66 3-72 (365)
30 PRK14293 chaperone protein Dna 99.8 9.8E-21 2.1E-25 150.8 7.4 126 1-126 3-150 (374)
31 KOG0716|consensus 99.8 5E-21 1.1E-25 144.1 5.2 70 1-70 31-103 (279)
32 PRK14299 chaperone protein Dna 99.8 1.8E-20 4E-25 144.9 7.1 66 1-66 4-71 (291)
33 PF00226 DnaJ: DnaJ domain; I 99.8 4.5E-20 9.8E-25 112.4 6.0 60 2-61 1-64 (64)
34 KOG0717|consensus 99.8 1.6E-20 3.6E-25 149.7 4.6 71 1-71 8-82 (508)
35 KOG0715|consensus 99.8 1.1E-19 2.4E-24 140.2 8.4 66 2-67 44-111 (288)
36 PRK14281 chaperone protein Dna 99.8 6.6E-20 1.4E-24 147.1 6.9 67 1-67 3-72 (397)
37 KOG0718|consensus 99.8 1E-19 2.3E-24 145.4 6.2 69 1-69 9-83 (546)
38 PRK10266 curved DNA-binding pr 99.8 4E-19 8.6E-24 138.3 6.9 65 1-65 4-70 (306)
39 KOG0691|consensus 99.8 9.2E-19 2E-23 134.9 7.2 71 1-71 5-78 (296)
40 PTZ00341 Ring-infected erythro 99.8 1.1E-18 2.3E-23 149.6 7.0 68 1-68 573-642 (1136)
41 KOG0719|consensus 99.7 1.7E-18 3.8E-23 128.0 5.8 67 1-67 14-85 (264)
42 KOG0721|consensus 99.7 5.1E-18 1.1E-22 124.3 6.4 70 2-71 100-172 (230)
43 smart00271 DnaJ DnaJ molecular 99.7 7.6E-18 1.6E-22 101.1 5.3 55 1-55 1-59 (60)
44 cd06257 DnaJ DnaJ domain or J- 99.7 2.2E-17 4.8E-22 97.3 6.1 52 2-53 1-55 (55)
45 PHA03102 Small T antigen; Revi 99.7 1E-17 2.3E-22 118.2 3.4 68 2-70 6-75 (153)
46 TIGR03835 termin_org_DnaJ term 99.7 2.8E-17 6.2E-22 138.2 6.3 68 1-68 2-71 (871)
47 KOG2923|consensus 99.7 2.6E-17 5.6E-22 98.2 3.8 56 74-133 3-58 (67)
48 KOG0720|consensus 99.7 8.1E-18 1.8E-22 134.3 2.0 69 1-69 235-305 (490)
49 PF05207 zf-CSL: CSL zinc fing 99.7 3.6E-17 7.9E-22 97.0 3.3 53 77-133 2-54 (55)
50 PRK05014 hscB co-chaperone Hsc 99.7 5.5E-17 1.2E-21 116.8 4.8 67 1-67 1-77 (171)
51 COG2214 CbpA DnaJ-class molecu 99.7 1.4E-16 3.1E-21 116.0 6.8 64 1-64 6-73 (237)
52 PRK01356 hscB co-chaperone Hsc 99.6 7.1E-16 1.5E-20 110.6 6.9 67 1-67 2-76 (166)
53 PRK00294 hscB co-chaperone Hsc 99.6 4.3E-16 9.3E-21 112.3 5.5 68 1-68 4-81 (173)
54 PRK03578 hscB co-chaperone Hsc 99.6 5.7E-16 1.2E-20 112.0 4.7 68 1-68 6-83 (176)
55 KOG0624|consensus 99.5 4.8E-15 1E-19 116.1 4.9 66 1-66 394-465 (504)
56 COG5216 Uncharacterized conser 99.5 1.6E-14 3.4E-19 85.1 4.7 56 74-133 3-58 (67)
57 KOG0550|consensus 99.5 1.3E-14 2.8E-19 115.3 5.4 71 1-71 373-447 (486)
58 KOG0722|consensus 99.5 1.6E-14 3.6E-19 108.5 3.7 68 1-68 33-102 (329)
59 COG5407 SEC63 Preprotein trans 99.5 5.2E-14 1.1E-18 112.8 4.9 70 2-71 99-176 (610)
60 PTZ00100 DnaJ chaperone protei 99.5 4.7E-14 1E-18 95.1 3.9 50 2-52 66-115 (116)
61 PRK09430 djlA Dna-J like membr 99.4 2.2E-13 4.7E-18 104.4 4.7 52 2-53 201-262 (267)
62 KOG0714|consensus 99.4 2E-13 4.3E-18 103.6 3.6 65 1-65 3-71 (306)
63 PHA02624 large T antigen; Prov 99.4 2.5E-13 5.4E-18 113.1 4.5 58 2-60 12-71 (647)
64 PRK01773 hscB co-chaperone Hsc 99.4 1.8E-12 3.9E-17 93.5 6.8 65 1-65 2-76 (173)
65 TIGR00714 hscB Fe-S protein as 99.2 1.5E-11 3.2E-16 87.5 4.9 55 13-67 3-65 (157)
66 KOG1150|consensus 99.1 1.4E-10 3E-15 84.6 6.1 60 2-61 54-117 (250)
67 COG5269 ZUO1 Ribosome-associat 99.0 1.5E-10 3.3E-15 88.0 3.3 69 2-70 44-120 (379)
68 KOG0568|consensus 98.6 2.4E-08 5.2E-13 74.7 3.4 53 1-53 47-102 (342)
69 KOG1789|consensus 98.5 1.1E-07 2.5E-12 83.3 5.3 51 3-53 1283-1337(2235)
70 KOG0723|consensus 98.5 2.6E-07 5.5E-12 61.1 5.1 47 5-54 60-108 (112)
71 KOG3192|consensus 98.0 1.3E-05 2.7E-10 56.6 4.7 70 1-70 8-87 (168)
72 COG1076 DjlA DnaJ-domain-conta 97.5 9.1E-05 2E-09 53.4 2.9 50 2-51 114-173 (174)
73 KOG0431|consensus 97.3 0.00027 5.8E-09 58.2 4.0 44 7-50 394-447 (453)
74 COG1076 DjlA DnaJ-domain-conta 97.0 0.00062 1.3E-08 49.1 2.8 64 2-65 2-75 (174)
75 PF03656 Pam16: Pam16; InterP 96.2 0.0091 2E-07 41.1 4.5 50 5-55 62-111 (127)
76 COG1996 RPC10 DNA-directed RNA 96.1 0.013 2.8E-07 33.6 3.8 35 91-134 4-39 (49)
77 TIGR02098 MJ0042_CXXC MJ0042 f 95.4 0.029 6.4E-07 30.1 3.5 34 94-131 3-37 (38)
78 PF13719 zinc_ribbon_5: zinc-r 94.0 0.068 1.5E-06 28.7 2.6 34 94-131 3-37 (37)
79 PF11833 DUF3353: Protein of u 93.1 0.29 6.4E-06 36.0 5.5 42 10-56 1-42 (194)
80 PF13446 RPT: A repeated domai 92.3 0.31 6.7E-06 28.9 4.0 29 3-32 7-35 (62)
81 PF13717 zinc_ribbon_4: zinc-r 91.9 0.21 4.5E-06 26.7 2.6 32 94-129 3-35 (36)
82 PF05180 zf-DNL: DNL zinc fing 91.7 0.16 3.4E-06 31.0 2.2 35 93-131 4-41 (66)
83 smart00659 RPOLCX RNA polymera 90.4 0.72 1.6E-05 25.8 3.9 31 93-133 2-33 (44)
84 KOG0724|consensus 90.4 0.39 8.5E-06 37.8 3.9 53 12-64 3-62 (335)
85 PF05280 FlhC: Flagellar trans 88.4 0.37 8E-06 34.9 2.2 30 93-129 134-164 (175)
86 PF14687 DUF4460: Domain of un 87.9 1.2 2.7E-05 29.8 4.4 23 11-33 4-26 (112)
87 PRK00398 rpoP DNA-directed RNA 87.7 1.6 3.5E-05 24.2 4.1 30 93-131 3-33 (46)
88 PF09723 Zn-ribbon_8: Zinc rib 87.2 0.79 1.7E-05 25.2 2.6 32 90-127 2-34 (42)
89 TIGR01206 lysW lysine biosynth 85.4 1.7 3.7E-05 25.4 3.5 32 94-132 3-35 (54)
90 smart00834 CxxC_CXXC_SSSS Puta 85.4 1.2 2.7E-05 23.7 2.8 32 91-128 3-35 (41)
91 PRK12860 transcriptional activ 83.7 0.79 1.7E-05 33.6 1.9 28 93-127 134-162 (189)
92 KOG3277|consensus 83.5 2 4.3E-05 30.5 3.8 37 91-131 77-116 (165)
93 PRK12722 transcriptional activ 83.1 0.87 1.9E-05 33.4 1.9 28 93-127 134-162 (187)
94 TIGR02605 CxxC_CxxC_SSSS putat 81.8 1.8 4E-05 24.4 2.6 31 91-127 3-34 (52)
95 PF03604 DNA_RNApol_7kD: DNA d 74.6 3.4 7.5E-05 21.4 2.1 30 94-133 1-31 (32)
96 PF08271 TF_Zn_Ribbon: TFIIB z 73.3 5 0.00011 21.9 2.7 29 94-130 1-30 (43)
97 COG5552 Uncharacterized conser 71.4 17 0.00037 22.7 5.0 33 1-33 3-39 (88)
98 KOG3442|consensus 69.6 9.4 0.0002 26.2 3.8 47 5-52 63-109 (132)
99 PF02216 B: B domain; InterPr 68.1 13 0.00027 21.7 3.6 31 3-36 13-43 (54)
100 KOG1779|consensus 67.8 13 0.00029 23.4 4.0 38 90-132 29-66 (84)
101 KOG2324|consensus 67.0 17 0.00036 29.7 5.3 29 92-127 226-255 (457)
102 PF12434 Malate_DH: Malate deh 67.0 9.3 0.0002 19.1 2.5 18 14-31 9-26 (28)
103 PF07709 SRR: Seven Residue Re 65.8 5.7 0.00012 16.5 1.5 13 40-52 2-14 (14)
104 PF09297 zf-NADH-PPase: NADH p 61.7 8.7 0.00019 19.5 2.0 26 95-129 5-31 (32)
105 TIGR00577 fpg formamidopyrimid 60.8 24 0.00052 27.1 5.1 26 94-126 246-272 (272)
106 COG3195 Uncharacterized protei 59.7 34 0.00074 24.7 5.3 40 10-49 41-117 (176)
107 COG1579 Zn-ribbon protein, pos 58.2 3.5 7.7E-05 31.3 0.2 33 97-129 199-231 (239)
108 PF13831 PHD_2: PHD-finger; PD 57.5 9.1 0.0002 20.2 1.7 14 118-131 3-16 (36)
109 PRK13945 formamidopyrimidine-D 57.4 35 0.00075 26.4 5.5 26 94-126 255-281 (282)
110 PRK03824 hypA hydrogenase nick 55.8 16 0.00034 25.2 3.1 13 93-105 70-83 (135)
111 PRK14811 formamidopyrimidine-D 55.2 43 0.00094 25.7 5.7 27 94-127 236-263 (269)
112 PRK01103 formamidopyrimidine/5 55.0 43 0.00094 25.7 5.7 26 94-126 246-272 (274)
113 PF10041 DUF2277: Uncharacteri 53.0 55 0.0012 20.5 5.6 32 2-33 4-35 (78)
114 PRK14810 formamidopyrimidine-D 51.6 55 0.0012 25.2 5.8 26 94-126 245-271 (272)
115 COG0266 Nei Formamidopyrimidin 50.8 58 0.0013 25.3 5.7 26 94-126 246-272 (273)
116 COG5415 Predicted integral mem 50.0 57 0.0012 24.6 5.3 29 93-129 192-224 (251)
117 PF07295 DUF1451: Protein of u 49.2 18 0.00038 25.5 2.5 27 93-128 112-139 (146)
118 PF04216 FdhE: Protein involve 49.0 11 0.00024 29.2 1.6 17 117-133 236-252 (290)
119 PRK06266 transcription initiat 46.5 11 0.00023 27.3 1.1 29 92-128 116-145 (178)
120 smart00531 TFIIE Transcription 44.3 16 0.00035 25.4 1.7 37 92-131 98-135 (147)
121 PF08274 PhnA_Zn_Ribbon: PhnA 44.0 14 0.00031 18.8 1.0 23 97-129 7-29 (30)
122 PRK00762 hypA hydrogenase nick 43.6 17 0.00036 24.7 1.6 35 94-132 71-106 (124)
123 cd01388 SOX-TCF_HMG-box SOX-TC 43.2 71 0.0015 19.0 5.3 43 20-64 14-56 (72)
124 TIGR00143 hypF [NiFe] hydrogen 42.4 16 0.00034 32.2 1.7 26 87-112 81-110 (711)
125 PF04502 DUF572: Family of unk 41.5 39 0.00085 26.7 3.7 15 119-133 77-91 (324)
126 TIGR00373 conserved hypothetic 41.2 14 0.00029 26.2 1.0 29 93-129 109-138 (158)
127 PF11331 DUF3133: Protein of u 40.8 41 0.00088 18.9 2.7 17 117-133 29-45 (46)
128 PF14354 Lar_restr_allev: Rest 40.7 42 0.00091 19.3 2.9 33 94-127 4-37 (61)
129 PF12662 cEGF: Complement Clr- 38.9 11 0.00024 18.2 0.1 14 94-107 2-15 (24)
130 PF07739 TipAS: TipAS antibiot 38.3 96 0.0021 19.9 4.7 49 8-62 51-100 (118)
131 COG0089 RplW Ribosomal protein 37.5 33 0.00072 22.3 2.2 21 6-26 25-45 (94)
132 COG0068 HypF Hydrogenase matur 37.3 24 0.00053 31.1 2.0 28 85-112 112-143 (750)
133 PRK11032 hypothetical protein; 36.9 1.1E+02 0.0024 21.8 5.1 26 93-127 124-150 (160)
134 TIGR01562 FdhE formate dehydro 36.4 31 0.00068 27.2 2.4 17 117-133 250-266 (305)
135 PF01155 HypA: Hydrogenase exp 35.9 23 0.0005 23.5 1.4 29 93-132 70-100 (113)
136 PF09855 DUF2082: Nucleic-acid 34.8 49 0.0011 19.9 2.5 10 118-127 35-44 (64)
137 PF00505 HMG_box: HMG (high mo 34.0 95 0.0021 17.8 5.3 42 20-63 13-54 (69)
138 PRK13798 putative OHCU decarbo 33.9 1.7E+02 0.0038 20.8 5.9 24 10-33 40-74 (166)
139 PF14706 Tnp_DNA_bind: Transpo 33.7 71 0.0015 18.8 3.1 39 17-55 16-57 (58)
140 TIGR03180 UraD_2 OHCU decarbox 33.3 1.7E+02 0.0038 20.6 5.9 24 10-33 30-64 (158)
141 PF06107 DUF951: Bacterial pro 33.1 69 0.0015 18.9 2.9 30 97-133 15-45 (57)
142 cd01780 PLC_epsilon_RA Ubiquit 32.6 55 0.0012 21.3 2.6 32 2-33 12-43 (93)
143 cd01390 HMGB-UBF_HMG-box HMGB- 32.0 1E+02 0.0022 17.4 5.3 41 22-64 15-55 (66)
144 PTZ00043 cytochrome c oxidase 31.8 2.4E+02 0.0051 21.6 7.2 37 16-52 96-132 (268)
145 cd00084 HMG-box High Mobility 31.3 1E+02 0.0022 17.3 5.3 43 19-63 12-54 (66)
146 PF05876 Terminase_GpA: Phage 31.2 41 0.0009 28.6 2.5 43 87-129 194-239 (557)
147 COG2331 Uncharacterized protei 31.2 9.1 0.0002 24.0 -1.1 33 89-127 8-41 (82)
148 COG2879 Uncharacterized small 30.6 94 0.002 18.7 3.2 21 21-41 27-47 (65)
149 PF06827 zf-FPG_IleRS: Zinc fi 30.1 32 0.00069 17.0 1.0 9 118-126 20-28 (30)
150 KOG0527|consensus 30.0 1.1E+02 0.0023 24.6 4.4 42 20-63 75-116 (331)
151 PF14353 CpXC: CpXC protein 29.8 61 0.0013 21.6 2.7 26 92-117 37-63 (128)
152 CHL00030 rpl23 ribosomal prote 29.8 54 0.0012 21.2 2.3 21 6-26 23-43 (93)
153 KOG3960|consensus 29.8 35 0.00076 26.3 1.6 16 38-53 127-142 (284)
154 smart00661 RPOL9 RNA polymeras 29.3 62 0.0013 17.8 2.3 9 120-128 21-29 (52)
155 PF14951 DUF4503: Domain of un 28.5 32 0.0007 27.9 1.3 29 93-129 274-304 (389)
156 PF15178 TOM_sub5: Mitochondri 28.3 1.2E+02 0.0026 17.2 3.2 24 4-27 2-25 (51)
157 TIGR03164 UHCUDC OHCU decarbox 28.2 2.2E+02 0.0047 20.1 5.9 24 10-33 28-62 (157)
158 PF08996 zf-DNA_Pol: DNA Polym 28.2 38 0.00083 24.5 1.5 40 91-130 16-56 (188)
159 PTZ00199 high mobility group p 28.0 1.6E+02 0.0035 18.6 5.6 40 23-62 38-77 (94)
160 COG5134 Uncharacterized conser 28.0 32 0.00069 26.0 1.1 13 120-132 80-92 (272)
161 TIGR03636 L23_arch archaeal ri 27.6 64 0.0014 20.1 2.3 21 6-26 18-38 (77)
162 PF00628 PHD: PHD-finger; Int 27.4 74 0.0016 17.3 2.4 15 117-131 12-26 (51)
163 PRK10445 endonuclease VIII; Pr 27.3 62 0.0013 24.7 2.6 26 94-126 236-262 (263)
164 PF07227 DUF1423: Protein of u 27.1 46 0.00099 27.7 1.9 15 117-131 144-158 (446)
165 COG5525 Bacteriophage tail ass 27.1 61 0.0013 28.0 2.7 42 89-130 223-270 (611)
166 cd03031 GRX_GRX_like Glutaredo 25.5 99 0.0021 21.6 3.2 31 92-126 109-140 (147)
167 PF14803 Nudix_N_2: Nudix N-te 25.0 1E+02 0.0023 16.0 2.5 28 96-128 3-31 (34)
168 PRK03564 formate dehydrogenase 24.8 89 0.0019 24.8 3.1 17 117-133 250-266 (309)
169 TIGR00100 hypA hydrogenase nic 24.6 53 0.0011 21.9 1.6 23 94-127 71-94 (115)
170 PRK05738 rplW 50S ribosomal pr 24.0 79 0.0017 20.2 2.3 21 6-26 24-44 (92)
171 PF04967 HTH_10: HTH DNA bindi 23.7 32 0.00069 19.8 0.4 21 6-26 32-52 (53)
172 PF14616 DUF4451: Domain of un 23.5 54 0.0012 22.2 1.5 16 118-133 88-103 (124)
173 PF03820 Mtc: Tricarboxylate c 23.4 1.7E+02 0.0037 23.2 4.4 23 11-33 46-68 (308)
174 PRK02935 hypothetical protein; 23.1 43 0.00093 22.3 0.9 17 117-133 68-84 (110)
175 KOG2899|consensus 22.7 61 0.0013 25.2 1.8 35 100-134 150-184 (288)
176 PF10475 DUF2450: Protein of u 22.5 1.3E+02 0.0028 23.2 3.6 33 16-55 182-214 (291)
177 PF11023 DUF2614: Protein of u 22.3 34 0.00075 23.0 0.3 15 118-132 68-82 (114)
178 PF08447 PAS_3: PAS fold; Int 21.8 45 0.00097 19.9 0.8 28 2-33 7-35 (91)
179 PF05191 ADK_lid: Adenylate ki 21.7 47 0.001 17.5 0.7 26 95-127 3-29 (36)
180 PF08209 Sgf11: Sgf11 (transcr 21.6 1.1E+02 0.0023 15.9 2.1 14 118-131 3-16 (33)
181 smart00778 Prim_Zn_Ribbon Zinc 21.5 85 0.0018 16.7 1.7 12 95-106 5-19 (37)
182 PF11126 Phage_DsbA: Transcrip 21.5 1.6E+02 0.0034 18.1 3.0 31 21-51 36-66 (69)
183 cd01389 MATA_HMG-box MATA_HMG- 21.4 2E+02 0.0042 17.2 5.7 43 19-63 13-55 (77)
184 PF14606 Lipase_GDSL_3: GDSL-l 21.2 91 0.002 22.6 2.4 28 6-33 64-94 (178)
185 PF01096 TFIIS_C: Transcriptio 21.1 1.5E+02 0.0032 15.7 3.0 10 118-127 27-36 (39)
186 PF00076 RRM_1: RNA recognitio 21.0 76 0.0016 17.7 1.7 20 6-25 3-22 (70)
187 PRK14548 50S ribosomal protein 21.0 1E+02 0.0022 19.5 2.3 21 6-26 25-45 (84)
188 KOG2907|consensus 20.8 70 0.0015 21.6 1.6 40 89-128 70-111 (116)
189 COG3755 Uncharacterized protei 20.4 1.9E+02 0.0042 19.8 3.7 38 15-56 50-88 (127)
No 1
>COG0484 DnaJ DnaJ-class molecular chaperone with C-terminal Zn finger domain [Posttranslational modification, protein turnover, chaperones]
Probab=99.96 E-value=2.4e-30 Score=203.38 Aligned_cols=126 Identities=25% Similarity=0.353 Sum_probs=95.6
Q ss_pred CCccccccCCCCCCHHHHHHHHHHHHHHhCCCC---CHHHHHHHHHHHHHHHhccChHHHHHHHHHhhhhhhccc--c--
Q psy7114 1 MNHFEILNCNVNSTLEEIKENYKSLILKYHPDK---NSEHQEMFLKINEAWNILKDEKERKLYESQLLSQQQTHM--N-- 73 (134)
Q Consensus 1 ~~~Y~vLgv~~~as~~~Ik~ayr~l~~~~HPDk---~~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~~~~~~~--~-- 73 (134)
+|||+||||+++||.+|||+|||+||++||||+ .+.|+++|++|++||+||+||++|+.||+++......++ .
T Consensus 4 ~dyYeiLGV~k~As~~EIKkAYRkLA~kyHPD~n~g~~~AeeKFKEI~eAYEVLsD~eKRa~YD~fG~~~~~~gg~gg~g 83 (371)
T COG0484 4 RDYYEILGVSKDASEEEIKKAYRKLAKKYHPDRNPGDKEAEEKFKEINEAYEVLSDPEKRAAYDQFGHAGFKAGGFGGFG 83 (371)
T ss_pred cchhhhcCCCCCCCHHHHHHHHHHHHHHhCCCCCCCCHHHHHHHHHHHHHHHHhCCHHHHHHhhccCccccccCCcCCCC
Confidence 699999999999999999999999999999999 468999999999999999999999999999998765321 1
Q ss_pred ---cc-cccCcccccccCCce--eEEeeCCCC----CeEEEecccccCCCCCCce---eEeCCCCc
Q psy7114 74 ---IY-KSVSLSDMEHKGESQ--MFTYPCRCG----AEFCIEEQDTEGDGSDDNI---LIACDTCS 126 (134)
Q Consensus 74 ---~~-~~~~~~~~~~~~~~~--~~~~~crcg----~~~~v~~~~l~~g~~~~~~---~v~C~~Cs 126 (134)
.. ...++++.+|++++. .-....+-| -...|+++|++.|.+..+. .+.|++|+
T Consensus 84 ~~~fgg~~~DIF~~~FgGg~~~~~~~~~~~rG~Dl~~~l~isleEa~~G~~~~i~~~~~~~C~~C~ 149 (371)
T COG0484 84 FGGFGGDFGDIFEDFFGGGGGGRRRPNRPRRGADLRYNLEITLEEAVFGVKKEIRVTRSVTCSTCH 149 (371)
T ss_pred cCCCCCCHHHHHHHhhcCCCcccCCCCCcccCCceEEEEEeEhhhhccCceeeEecceeeECCcCC
Confidence 00 123333334542221 111123445 3788899999999876653 48888886
No 2
>KOG0712|consensus
Probab=99.94 E-value=5.9e-27 Score=182.15 Aligned_cols=123 Identities=26% Similarity=0.340 Sum_probs=96.5
Q ss_pred CccccccCCCCCCHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHHHHHhccChHHHHHHHHHhhhhhhcccccccccCcc
Q psy7114 2 NHFEILNCNVNSTLEEIKENYKSLILKYHPDKNSEHQEMFLKINEAWNILKDEKERKLYESQLLSQQQTHMNIYKSVSLS 81 (134)
Q Consensus 2 ~~Y~vLgv~~~as~~~Ik~ayr~l~~~~HPDk~~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~~~~~~~~~~~~~~~~ 81 (134)
.+|+||||+++||.+|||+|||+|+++|||||++.+.++|++|.+||+||+||++|..||+++++....++.......|+
T Consensus 5 ~~y~il~v~~~As~~eikkayrkla~k~HpDkn~~~~ekfkei~~AyevLsd~ekr~~yD~~g~~~~~~g~~~~g~~~f~ 84 (337)
T KOG0712|consen 5 KLYDILGVSPDASEEEIKKAYRKLALKYHPDKNPDAGEKFKEISQAYEVLSDPEKREIYDQYGEEGLQGGGGGGGFGGFS 84 (337)
T ss_pred ccceeeccCCCcCHHHHHHHHHHHHHHhCCCCCccHHHHHHHHHHHHHHhcCHHHHHHHHhhhhhhhcccCCCCCCccHH
Confidence 68999999999999999999999999999999999999999999999999999999999999998886654333333355
Q ss_pred cccccCCceeEEeeCCCC----CeEEEecccccCCCCCCce---eEeCCCCc
Q psy7114 82 DMEHKGESQMFTYPCRCG----AEFCIEEQDTEGDGSDDNI---LIACDTCS 126 (134)
Q Consensus 82 ~~~~~~~~~~~~~~crcg----~~~~v~~~~l~~g~~~~~~---~v~C~~Cs 126 (134)
+||- .+......+=| | ..+.|++++++.|.+.+.. -+.|+.|+
T Consensus 85 ~~F~-~g~~~~~~~~r-g~~~~~~~~~~Le~~y~G~s~kl~l~~~~iCs~C~ 134 (337)
T KOG0712|consen 85 QFFG-FGGNGGRGRQR-GKDVVHQLKVTLEELYMGKSKKLFLSRNFICSKCS 134 (337)
T ss_pred Hhcc-CCCcCcccccc-CCCceEEEEEEHHHhhcCCccceecccCccCCcCC
Confidence 5543 22111111113 4 5899999999999777654 26677665
No 3
>PTZ00037 DnaJ_C chaperone protein; Provisional
Probab=99.92 E-value=4e-25 Score=177.98 Aligned_cols=123 Identities=24% Similarity=0.391 Sum_probs=88.2
Q ss_pred CCccccccCCCCCCHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHHHHHhccChHHHHHHHHHhhhhhhcccccccccCc
Q psy7114 1 MNHFEILNCNVNSTLEEIKENYKSLILKYHPDKNSEHQEMFLKINEAWNILKDEKERKLYESQLLSQQQTHMNIYKSVSL 80 (134)
Q Consensus 1 ~~~Y~vLgv~~~as~~~Ik~ayr~l~~~~HPDk~~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~~~~~~~~~~~~~~~ 80 (134)
+|||+||||+++||.++||+|||+||+++|||+++. .++|++|++||++|+||.+|+.||.++...........+..++
T Consensus 28 ~d~Y~vLGV~~~As~~eIKkAYrkla~k~HPDk~~~-~e~F~~i~~AYevLsD~~kR~~YD~~G~~~~~~~~~~~d~~d~ 106 (421)
T PTZ00037 28 EKLYEVLNLSKDCTTSEIKKAYRKLAIKHHPDKGGD-PEKFKEISRAYEVLSDPEKRKIYDEYGEEGLEGGEQPADASDL 106 (421)
T ss_pred hhHHHHcCCCCCCCHHHHHHHHHHHHHHHCCCCCch-HHHHHHHHHHHHHhccHHHHHHHhhhcchhcccCCCCcchhhh
Confidence 489999999999999999999999999999999653 6899999999999999999999999987543211111111233
Q ss_pred ccccccCCceeEEeeCCCC----CeEEEecccccCCCCCCcee---EeCCCCc
Q psy7114 81 SDMEHKGESQMFTYPCRCG----AEFCIEEQDTEGDGSDDNIL---IACDTCS 126 (134)
Q Consensus 81 ~~~~~~~~~~~~~~~crcg----~~~~v~~~~l~~g~~~~~~~---v~C~~Cs 126 (134)
++.+|+++... ...+-| -...|++++++.|....+.+ +.|+.|+
T Consensus 107 f~~~Fggg~~~--~~~~rg~di~~~l~vtLee~~~G~~~~i~~~r~~~C~~C~ 157 (421)
T PTZ00037 107 FDLIFGGGRKP--GGKKRGEDIVSHLKVTLEQIYNGAMRKLAINKDVICANCE 157 (421)
T ss_pred HHHhhcccccc--ccccCCCCEEEEeeeeHHHHhCCCceEEEeeccccccccC
Confidence 33344432110 111234 25778999999997766543 5666653
No 4
>KOG0713|consensus
Probab=99.91 E-value=9.2e-25 Score=168.61 Aligned_cols=69 Identities=36% Similarity=0.622 Sum_probs=66.2
Q ss_pred CCccccccCCCCCCHHHHHHHHHHHHHHhCCCC---CHHHHHHHHHHHHHHHhccChHHHHHHHHHhhhhhh
Q psy7114 1 MNHFEILNCNVNSTLEEIKENYKSLILKYHPDK---NSEHQEMFLKINEAWNILKDEKERKLYESQLLSQQQ 69 (134)
Q Consensus 1 ~~~Y~vLgv~~~as~~~Ik~ayr~l~~~~HPDk---~~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~~~~ 69 (134)
||||+||||+++||..+||+||||||+++|||| +|.|.+.|+.|+.||+||+||.+|+.||.+|++..-
T Consensus 16 rDfYelLgV~k~Asd~eIKkAYRKLALk~HPDkNpddp~A~e~F~~in~AYEVLsDpekRk~YD~~GEegL~ 87 (336)
T KOG0713|consen 16 RDFYELLGVPKNASDQEIKKAYRKLALKYHPDKNPDDPNANEKFKEINAAYEVLSDPEKRKHYDTYGEEGLK 87 (336)
T ss_pred CCHHHHhCCCCCCCHHHHHHHHHHHHHHhCCCCCCCCHHHHHHHHHHHHHHHHhcCHHHHHHHHhhhHhhhc
Confidence 799999999999999999999999999999999 789999999999999999999999999999987654
No 5
>PRK14296 chaperone protein DnaJ; Provisional
Probab=99.91 E-value=1.7e-24 Score=172.21 Aligned_cols=67 Identities=28% Similarity=0.544 Sum_probs=62.8
Q ss_pred CCccccccCCCCCCHHHHHHHHHHHHHHhCCCC--CHHHHHHHHHHHHHHHhccChHHHHHHHHHhhhh
Q psy7114 1 MNHFEILNCNVNSTLEEIKENYKSLILKYHPDK--NSEHQEMFLKINEAWNILKDEKERKLYESQLLSQ 67 (134)
Q Consensus 1 ~~~Y~vLgv~~~as~~~Ik~ayr~l~~~~HPDk--~~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~~ 67 (134)
+|||+||||+++||.++||+|||+||+++|||+ .+.+.++|++|++||+||+||.+|+.||+++...
T Consensus 4 ~dyY~~Lgv~~~a~~~eik~ayrkla~~~HPD~n~~~~a~~~F~~i~~AyevLsD~~KR~~YD~~G~~~ 72 (372)
T PRK14296 4 KDYYEVLGVSKTASEQEIRQAYRKLAKQYHPDLNKSPDAHDKMVEINEAADVLLDKDKRKQYDQFGHAA 72 (372)
T ss_pred CCHHHhcCCCCCCCHHHHHHHHHHHHHHHCcCCCCCchHHHHHHHHHHHHHHhcCHHHhhhhhhccchh
Confidence 489999999999999999999999999999999 3668899999999999999999999999998753
No 6
>PRK14288 chaperone protein DnaJ; Provisional
Probab=99.91 E-value=1.4e-24 Score=172.52 Aligned_cols=67 Identities=45% Similarity=0.718 Sum_probs=62.6
Q ss_pred CCccccccCCCCCCHHHHHHHHHHHHHHhCCCCC---HHHHHHHHHHHHHHHhccChHHHHHHHHHhhhh
Q psy7114 1 MNHFEILNCNVNSTLEEIKENYKSLILKYHPDKN---SEHQEMFLKINEAWNILKDEKERKLYESQLLSQ 67 (134)
Q Consensus 1 ~~~Y~vLgv~~~as~~~Ik~ayr~l~~~~HPDk~---~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~~ 67 (134)
+|||+||||+++||.++||+|||+||+++|||++ +.+.++|++|++||+||+||.+|+.||+++...
T Consensus 3 ~dyY~vLgv~~~As~~eIkkayrkla~k~HPD~~~~~~~a~~~f~~i~~AYevLsd~~kR~~YD~~G~~~ 72 (369)
T PRK14288 3 LSYYEILEVEKHSNQETIKKSYRKLALKYHPDRNAGDKEAEEKFKLINEAYGVLSDEKKRALYDRYGKKG 72 (369)
T ss_pred CChHHHcCCCCCCCHHHHHHHHHHHHHHHCCCCCCCccHHHHHHHHHHHHHHHhccHHHHHHHHHhcccc
Confidence 4899999999999999999999999999999993 567899999999999999999999999998753
No 7
>PRK14286 chaperone protein DnaJ; Provisional
Probab=99.90 E-value=1.1e-23 Score=167.73 Aligned_cols=126 Identities=25% Similarity=0.315 Sum_probs=88.1
Q ss_pred CCccccccCCCCCCHHHHHHHHHHHHHHhCCCC---CHHHHHHHHHHHHHHHhccChHHHHHHHHHhhhhhhcc--c---
Q psy7114 1 MNHFEILNCNVNSTLEEIKENYKSLILKYHPDK---NSEHQEMFLKINEAWNILKDEKERKLYESQLLSQQQTH--M--- 72 (134)
Q Consensus 1 ~~~Y~vLgv~~~as~~~Ik~ayr~l~~~~HPDk---~~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~~~~~~--~--- 72 (134)
+|||+||||+++||.++||+|||+|++++|||+ .+.+.++|++|++||+||+||.+|+.||+++....... +
T Consensus 4 ~d~y~~Lgv~~~a~~~eik~ayr~la~~~HPD~~~~~~~a~~~f~~i~~Ay~vL~d~~kR~~YD~~G~~g~~~~~~~~~~ 83 (372)
T PRK14286 4 RSYYDILGVSKSANDEEIKSAYRKLAIKYHPDKNKGNKESEEKFKEATEAYEILRDPKKRQAYDQFGKAGVNAGAGGFGQ 83 (372)
T ss_pred CCHHHhcCCCCCCCHHHHHHHHHHHHHHHCcCCCCCchHHHHHHHHHHHHHHHhccHHHHHHHHHhCchhhccccCCCCC
Confidence 489999999999999999999999999999999 35688999999999999999999999999987542211 0
Q ss_pred -ccc-------cccCcccccccCCcee-----EEeeCCCC----CeEEEecccccCCCCCCcee---EeCCCCc
Q psy7114 73 -NIY-------KSVSLSDMEHKGESQM-----FTYPCRCG----AEFCIEEQDTEGDGSDDNIL---IACDTCS 126 (134)
Q Consensus 73 -~~~-------~~~~~~~~~~~~~~~~-----~~~~crcg----~~~~v~~~~l~~g~~~~~~~---v~C~~Cs 126 (134)
... ...++++.+|++.++. .....+-| -...|+++|++.|.+..+.+ +.|+.|+
T Consensus 84 ~~~~~~~~~~~~~~d~f~~ffgg~~~~~~~~~~~~~~~~g~di~~~l~vtLee~~~G~~k~i~~~r~~~C~~C~ 157 (372)
T PRK14286 84 GAYTDFSDIFGDFGDIFGDFFGGGRGGGSGGGRRSGPQRGSDLRYNLEVSLEDAALGREYKIEIPRLESCVDCN 157 (372)
T ss_pred CCcccccccccchhhHHHHhhCCCccCCCcccccCCCCCCCCeeEEEEEEHHHHhCCeeEEEEeeccccCCCCc
Confidence 000 0001222234321100 00111223 36788999999997766543 6666664
No 8
>PRK14298 chaperone protein DnaJ; Provisional
Probab=99.89 E-value=8.5e-24 Score=168.51 Aligned_cols=126 Identities=27% Similarity=0.407 Sum_probs=88.5
Q ss_pred CCccccccCCCCCCHHHHHHHHHHHHHHhCCCCC--HHHHHHHHHHHHHHHhccChHHHHHHHHHhhhhhhcc-cc---c
Q psy7114 1 MNHFEILNCNVNSTLEEIKENYKSLILKYHPDKN--SEHQEMFLKINEAWNILKDEKERKLYESQLLSQQQTH-MN---I 74 (134)
Q Consensus 1 ~~~Y~vLgv~~~as~~~Ik~ayr~l~~~~HPDk~--~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~~~~~~-~~---~ 74 (134)
+|||+||||+++|+.++||+|||+|++++|||++ +.+.++|++|++||++|+||.+|+.||+++....... +. +
T Consensus 5 ~d~y~iLgv~~~a~~~eik~ayr~la~~~HPD~~~~~~~~~~f~~i~~Ay~vL~d~~kR~~YD~~G~~g~~~~~~~~~~~ 84 (377)
T PRK14298 5 RDYYEILGLSKDASVEDIKKAYRKLAMKYHPDKNKEPDAEEKFKEISEAYAVLSDAEKRAQYDRFGHAGIDNQYSAEDIF 84 (377)
T ss_pred CCHHHhhCCCCCCCHHHHHHHHHHHHHHhCccccCChhHHHHHHHHHHHHHHhcchHhhhhhhhcCccccccccCccccc
Confidence 4899999999999999999999999999999993 6678999999999999999999999999986532211 00 0
Q ss_pred -----ccccCcccccccCCceeEEeeCCCC----CeEEEecccccCCCCCCcee---EeCCCCc
Q psy7114 75 -----YKSVSLSDMEHKGESQMFTYPCRCG----AEFCIEEQDTEGDGSDDNIL---IACDTCS 126 (134)
Q Consensus 75 -----~~~~~~~~~~~~~~~~~~~~~crcg----~~~~v~~~~l~~g~~~~~~~---v~C~~Cs 126 (134)
....++++.+|++.+..-....+-| -...|+++|++.|.+..+.+ +.|+.|.
T Consensus 85 ~~~~~~~~~d~f~~~Fgg~~~~~~~~~~~g~di~~~l~vslee~~~G~~~~i~~~r~~~C~~C~ 148 (377)
T PRK14298 85 RGADFGGFGDIFEMFFGGGGRRGRMGPRRGSDLRYDLYITLEEAAFGVRKDIDVPRAERCSTCS 148 (377)
T ss_pred ccCCcCcchhhhHhhhcCCCccCCCCCCCCCCEEEEEEEEHHHhhCCeEEEEEEEeeccCCCCC
Confidence 0011222323432111000112233 37788999999997665543 6666664
No 9
>PRK14287 chaperone protein DnaJ; Provisional
Probab=99.89 E-value=9.8e-24 Score=167.85 Aligned_cols=126 Identities=22% Similarity=0.375 Sum_probs=89.0
Q ss_pred CCccccccCCCCCCHHHHHHHHHHHHHHhCCCCC--HHHHHHHHHHHHHHHhccChHHHHHHHHHhhhhhhcc--c-c--
Q psy7114 1 MNHFEILNCNVNSTLEEIKENYKSLILKYHPDKN--SEHQEMFLKINEAWNILKDEKERKLYESQLLSQQQTH--M-N-- 73 (134)
Q Consensus 1 ~~~Y~vLgv~~~as~~~Ik~ayr~l~~~~HPDk~--~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~~~~~~--~-~-- 73 (134)
+|||+||||+++||.++||+|||+|++++|||++ +.+.++|++|++||++|+||.+|+.||+++....... + .
T Consensus 4 ~d~y~~Lgv~~~a~~~eik~ayr~la~~~HpD~~~~~~~~~~f~~i~~Ay~~L~d~~kR~~YD~~G~~~~~~~~~~~~~~ 83 (371)
T PRK14287 4 RDYYEVLGVDRNASVDEVKKAYRKLARKYHPDVNKAPDAEDKFKEVKEAYDTLSDPQKKAHYDQFGHTDPNQGFGGGGAG 83 (371)
T ss_pred CCHHHhcCCCCCCCHHHHHHHHHHHHHHHCcCCCCChhHHHHHHHHHHHHHHhCcHhHHHHHHhhCCcccccccCCCCCc
Confidence 4899999999999999999999999999999994 6678999999999999999999999999987532211 0 0
Q ss_pred -cccccCcccccccCCceeE-EeeCCCC----CeEEEecccccCCCCCCcee---EeCCCCc
Q psy7114 74 -IYKSVSLSDMEHKGESQMF-TYPCRCG----AEFCIEEQDTEGDGSDDNIL---IACDTCS 126 (134)
Q Consensus 74 -~~~~~~~~~~~~~~~~~~~-~~~crcg----~~~~v~~~~l~~g~~~~~~~---v~C~~Cs 126 (134)
+....++++.+|++.+... ....+-| -...|+++|++.|....+.+ +.|+.|+
T Consensus 84 ~f~~~~d~f~~~fgg~~~~~~~~~~~~g~d~~~~l~vslee~~~G~~~~i~~~r~~~C~~C~ 145 (371)
T PRK14287 84 DFGGFSDIFDMFFGGGGGRRNPNAPRQGADLQYTMTLEFKEAVFGKETEIEIPREETCGTCH 145 (371)
T ss_pred cccchHHHHHhhhccccCCCCCCCCCCCCCEEEEEEEEHHHhcCCeEEEEEEeeeccCCCCC
Confidence 0011122333344221110 0112233 36778999999997665533 6777775
No 10
>PRK14276 chaperone protein DnaJ; Provisional
Probab=99.88 E-value=2.7e-23 Score=165.75 Aligned_cols=126 Identities=20% Similarity=0.306 Sum_probs=87.5
Q ss_pred CCccccccCCCCCCHHHHHHHHHHHHHHhCCCC--CHHHHHHHHHHHHHHHhccChHHHHHHHHHhhhhhhcc--c---c
Q psy7114 1 MNHFEILNCNVNSTLEEIKENYKSLILKYHPDK--NSEHQEMFLKINEAWNILKDEKERKLYESQLLSQQQTH--M---N 73 (134)
Q Consensus 1 ~~~Y~vLgv~~~as~~~Ik~ayr~l~~~~HPDk--~~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~~~~~~--~---~ 73 (134)
+|||+||||+++||.++||+|||+|++++|||+ .+.+.++|++|++||++|+||.+|+.||.++....... + .
T Consensus 4 ~d~y~~Lgv~~~a~~~eik~ayr~la~~~HpD~~~~~~a~~~f~~i~~Ay~vL~d~~kR~~YD~~G~~~~~~~~~~~~~~ 83 (380)
T PRK14276 4 TEYYDRLGVSKDASQDEIKKAYRKLSKKYHPDINKEPGAEEKYKEVQEAYETLSDPQKRAAYDQYGAAGANGGFGGGAGG 83 (380)
T ss_pred CCHHHhhCCCCCCCHHHHHHHHHHHHHHHCcCCCCCcCHHHHHHHHHHHHHHhcCHhhhhhHhhcCCccccCCCCCCCCC
Confidence 489999999999999999999999999999999 45678999999999999999999999999987532110 0 0
Q ss_pred ---cc--c-ccCccc---ccccCCcee-EEeeCCCC----CeEEEecccccCCCCCCcee---EeCCCCc
Q psy7114 74 ---IY--K-SVSLSD---MEHKGESQM-FTYPCRCG----AEFCIEEQDTEGDGSDDNIL---IACDTCS 126 (134)
Q Consensus 74 ---~~--~-~~~~~~---~~~~~~~~~-~~~~crcg----~~~~v~~~~l~~g~~~~~~~---v~C~~Cs 126 (134)
+. . ..++.+ .+|++.+.. .....+-| -...|+++|++.|.+..+.+ +.|+.|+
T Consensus 84 ~~~~~~~~~~~~~~d~f~~~fgg~~~~~~~~~~~~g~di~~~l~vtLee~~~G~~~~i~~~~~~~C~~C~ 153 (380)
T PRK14276 84 FGGFDGSGGFGGFEDIFSSFFGGGGARRNPNAPRQGDDLQYRVNLDFEEAIFGKEKEVSYNREATCHTCN 153 (380)
T ss_pred CCCccccccccchhhHHHHHhCccccccCcCCCCCCCCEEEEEEEEHHHhcCCeEEEEEeeccccCCCCc
Confidence 00 0 011222 223321110 00112233 26788999999997665543 6676663
No 11
>PRK14280 chaperone protein DnaJ; Provisional
Probab=99.88 E-value=4.7e-23 Score=164.23 Aligned_cols=126 Identities=23% Similarity=0.362 Sum_probs=88.7
Q ss_pred CCccccccCCCCCCHHHHHHHHHHHHHHhCCCC--CHHHHHHHHHHHHHHHhccChHHHHHHHHHhhhhhhcc-c--c--
Q psy7114 1 MNHFEILNCNVNSTLEEIKENYKSLILKYHPDK--NSEHQEMFLKINEAWNILKDEKERKLYESQLLSQQQTH-M--N-- 73 (134)
Q Consensus 1 ~~~Y~vLgv~~~as~~~Ik~ayr~l~~~~HPDk--~~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~~~~~~-~--~-- 73 (134)
+|||+||||+++|+.++||+|||+|++++|||+ .+.+.++|++|++||++|+||.+|+.||.++......+ + .
T Consensus 4 ~~~y~iLgv~~~a~~~eik~ayr~la~~~HpD~~~~~~a~~~f~~i~~Ay~vL~d~~kr~~yD~~G~~~~~~~~~~~~~~ 83 (376)
T PRK14280 4 RDYYEVLGVSKSASKDEIKKAYRKLSKKYHPDINKEEGADEKFKEISEAYEVLSDDQKRAQYDQFGHAGPNQGFGGGGFG 83 (376)
T ss_pred CChHHhhCCCCCCCHHHHHHHHHHHHHHHCcCCCCCccHHHHHHHHHHHHHHhccHhHHHHHHhcCccccccCcCCCCCC
Confidence 489999999999999999999999999999999 45688999999999999999999999999987542211 0 0
Q ss_pred ---cccccCccc---ccccCCcee-EEeeCCCC----CeEEEecccccCCCCCCcee---EeCCCCc
Q psy7114 74 ---IYKSVSLSD---MEHKGESQM-FTYPCRCG----AEFCIEEQDTEGDGSDDNIL---IACDTCS 126 (134)
Q Consensus 74 ---~~~~~~~~~---~~~~~~~~~-~~~~crcg----~~~~v~~~~l~~g~~~~~~~---v~C~~Cs 126 (134)
+....++.+ .+|++.+.. .....+-| -...|+++|++.|.+..+.+ +.|+.|+
T Consensus 84 ~~~~~~~~~~~d~f~~~fgg~~~~~~~~~~~kg~di~~~l~vtLee~~~G~~~~i~~~r~~~C~~C~ 150 (376)
T PRK14280 84 GGDFGGGFGFEDIFSSFFGGGGRRRDPNAPRQGADLQYTMTLTFEEAVFGKEKEIEIPKEETCDTCH 150 (376)
T ss_pred CCCccccccchhhHHHHhCCccccCcccccccccCEEEEEEEEHHHHhCCceeEEEEeeeccCCCCC
Confidence 000001222 233321110 00112233 37788999999997766543 6677664
No 12
>PRK14285 chaperone protein DnaJ; Provisional
Probab=99.88 E-value=3.9e-23 Score=164.15 Aligned_cols=126 Identities=23% Similarity=0.372 Sum_probs=87.2
Q ss_pred CCccccccCCCCCCHHHHHHHHHHHHHHhCCCC---CHHHHHHHHHHHHHHHhccChHHHHHHHHHhhhhhhcc-c--cc
Q psy7114 1 MNHFEILNCNVNSTLEEIKENYKSLILKYHPDK---NSEHQEMFLKINEAWNILKDEKERKLYESQLLSQQQTH-M--NI 74 (134)
Q Consensus 1 ~~~Y~vLgv~~~as~~~Ik~ayr~l~~~~HPDk---~~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~~~~~~-~--~~ 74 (134)
+|||+||||+++||.++||+|||+|++++|||+ .+.+.++|++|++||++|+||.+|..||.++....... + ..
T Consensus 3 ~d~y~iLgv~~~a~~~eIk~ayr~la~~~HPD~~~~~~~a~~~f~~i~~Ay~vL~d~~kr~~yd~~g~~~~~~~~~~~~~ 82 (365)
T PRK14285 3 RDYYEILGLSKGASKDEIKKAYRKIAIKYHPDKNKGNKEAESIFKEATEAYEVLIDDNKRAQYDRFGHTAFEGGGGFEGF 82 (365)
T ss_pred CCHHHhcCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHcCcchhHHHHhcCcchhccCCCcccc
Confidence 489999999999999999999999999999999 35678999999999999999999999999987532111 0 00
Q ss_pred -c--cc-----cCccc---ccccCCceeE-EeeCCCC----CeEEEecccccCCCCCCcee---EeCCCCc
Q psy7114 75 -Y--KS-----VSLSD---MEHKGESQMF-TYPCRCG----AEFCIEEQDTEGDGSDDNIL---IACDTCS 126 (134)
Q Consensus 75 -~--~~-----~~~~~---~~~~~~~~~~-~~~crcg----~~~~v~~~~l~~g~~~~~~~---v~C~~Cs 126 (134)
. .. .++.+ .+|++..+.. ....+-| -...|+++|++.|.+..+.+ +.|+.|+
T Consensus 83 ~~g~~~~~~~~~~~~d~f~~~fgg~~~~~~~~~~~~g~di~~~l~vtlee~~~G~~~~i~~~r~~~C~~C~ 153 (365)
T PRK14285 83 SGGFSGFSDIFEDFGDIFDSFFTGNRGQDKNRKHEKGQDLTYQIEISLEDAYLGYKNNINITRNMLCESCL 153 (365)
T ss_pred CCCccccccccccHHHHHHHhhcCCcCCCCCcCCCCCCCEEEEEEEEHHHhhCCeEEEEEeeecccCCCCC
Confidence 0 00 01112 1233211100 0112223 36788999999997665543 6676664
No 13
>PRK14278 chaperone protein DnaJ; Provisional
Probab=99.88 E-value=6.2e-23 Score=163.63 Aligned_cols=126 Identities=20% Similarity=0.299 Sum_probs=86.9
Q ss_pred CCccccccCCCCCCHHHHHHHHHHHHHHhCCCC--CHHHHHHHHHHHHHHHhccChHHHHHHHHHhhhhhhcc---cccc
Q psy7114 1 MNHFEILNCNVNSTLEEIKENYKSLILKYHPDK--NSEHQEMFLKINEAWNILKDEKERKLYESQLLSQQQTH---MNIY 75 (134)
Q Consensus 1 ~~~Y~vLgv~~~as~~~Ik~ayr~l~~~~HPDk--~~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~~~~~~---~~~~ 75 (134)
+|||+||||+++||.++||+|||+|++++|||+ .+.+.++|++|++||++|+||.+|+.||.++....... +...
T Consensus 3 ~d~y~iLgv~~~a~~~eik~ayr~la~~~hpD~~~~~~a~~~f~~i~~Ay~vL~d~~~r~~YD~~G~~~~~~~~~~~g~~ 82 (378)
T PRK14278 3 RDYYGLLGVSRNASDAEIKRAYRKLARELHPDVNPDEEAQEKFKEISVAYEVLSDPEKRRIVDLGGDPLESAGGGGGGFG 82 (378)
T ss_pred CCcceecCCCCCCCHHHHHHHHHHHHHHHCCCCCCcHHHHHHHHHHHHHHHHhchhhhhhhhhccCCccccccCCCCCCC
Confidence 489999999999999999999999999999999 45578899999999999999999999999886422110 0000
Q ss_pred ----cccCcccccccCCcee--EEeeCCCC----CeEEEecccccCCCCCCcee---EeCCCCc
Q psy7114 76 ----KSVSLSDMEHKGESQM--FTYPCRCG----AEFCIEEQDTEGDGSDDNIL---IACDTCS 126 (134)
Q Consensus 76 ----~~~~~~~~~~~~~~~~--~~~~crcg----~~~~v~~~~l~~g~~~~~~~---v~C~~Cs 126 (134)
...++++.+|++.+.. .....+-| -...|+++|++.|.+..+.+ +.|+.|+
T Consensus 83 ~~f~~~~d~f~~ffgg~g~~~~~~~~~~~g~d~~~~l~vtLee~~~G~~~~i~~~~~~~C~~C~ 146 (378)
T PRK14278 83 GGFGGLGDVFEAFFGGGAASRGPRGRVRPGSDSLLRMRLDLEECATGVTKQVTVDTAVLCDRCH 146 (378)
T ss_pred cCcCchhHHHHHHhCCCCCCCCCccCCCCCCCeEEEEEEEHHHhcCCeEEEEEEEeeccCCCCc
Confidence 0011222233321100 00011223 36788999999997665533 5666664
No 14
>PRK14282 chaperone protein DnaJ; Provisional
Probab=99.88 E-value=9.9e-23 Score=162.03 Aligned_cols=67 Identities=36% Similarity=0.634 Sum_probs=62.3
Q ss_pred CCccccccCCCCCCHHHHHHHHHHHHHHhCCCCC----HHHHHHHHHHHHHHHhccChHHHHHHHHHhhhh
Q psy7114 1 MNHFEILNCNVNSTLEEIKENYKSLILKYHPDKN----SEHQEMFLKINEAWNILKDEKERKLYESQLLSQ 67 (134)
Q Consensus 1 ~~~Y~vLgv~~~as~~~Ik~ayr~l~~~~HPDk~----~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~~ 67 (134)
+|||+||||+++||.++||+|||+|++++|||++ +.+.++|++|++||++|+||.+|+.||.++...
T Consensus 4 ~d~y~~lgv~~~a~~~eik~ayr~la~~~HPD~~~~~~~~a~~~f~~i~~Ay~vL~d~~kR~~YD~~g~~~ 74 (369)
T PRK14282 4 KDYYEILGVSRNATQEEIKRAYKRLVKEWHPDRHPENRKEAEQKFKEIQEAYEVLSDPQKRAMYDRFGYVG 74 (369)
T ss_pred CChHHhcCCCCCCCHHHHHHHHHHHHHHHCCCCCccchhHHHHHHHHHHHHHHHhcChhhHHHHhhcCccc
Confidence 4899999999999999999999999999999993 457899999999999999999999999988754
No 15
>PRK14297 chaperone protein DnaJ; Provisional
Probab=99.87 E-value=1.1e-22 Score=162.24 Aligned_cols=126 Identities=24% Similarity=0.344 Sum_probs=87.6
Q ss_pred CCccccccCCCCCCHHHHHHHHHHHHHHhCCCCC---HHHHHHHHHHHHHHHhccChHHHHHHHHHhhhhhhcc-c----
Q psy7114 1 MNHFEILNCNVNSTLEEIKENYKSLILKYHPDKN---SEHQEMFLKINEAWNILKDEKERKLYESQLLSQQQTH-M---- 72 (134)
Q Consensus 1 ~~~Y~vLgv~~~as~~~Ik~ayr~l~~~~HPDk~---~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~~~~~~-~---- 72 (134)
+|||+||||+++|+.++||+|||+|++++|||++ +.+.++|++|++||++|+||.+|+.||+++....... +
T Consensus 4 ~d~y~~Lgv~~~a~~~~ik~ayr~la~~~HPD~~~~~~~a~~~f~~i~~Ay~vL~d~~~r~~yD~~G~~~~~~~~~~~~~ 83 (380)
T PRK14297 4 KDYYEVLGLEKGASDDEIKKAFRKLAIKYHPDKNKGNKEAEEKFKEINEAYQVLSDPQKKAQYDQFGTADFNGAGGFGSG 83 (380)
T ss_pred CChHHhhCCCCCCCHHHHHHHHHHHHHHHCcCCCCCcHHHHHHHHHHHHHHHHhcCHhhhCchhhcCcccccccCCCCCC
Confidence 4899999999999999999999999999999993 5688999999999999999999999999987542110 0
Q ss_pred ccc----c----ccCcccccccCC--ce-eEEeeCCCC----CeEEEecccccCCCCCCcee---EeCCCCc
Q psy7114 73 NIY----K----SVSLSDMEHKGE--SQ-MFTYPCRCG----AEFCIEEQDTEGDGSDDNIL---IACDTCS 126 (134)
Q Consensus 73 ~~~----~----~~~~~~~~~~~~--~~-~~~~~crcg----~~~~v~~~~l~~g~~~~~~~---v~C~~Cs 126 (134)
... . ..++++.+|++. +. .-....+-| -...|+++|++.|.+..+.+ +.|+.|+
T Consensus 84 ~~~~~~~~~~~~~~d~f~~~fgg~~g~~~~~~~~~~kg~di~~~l~vsLee~~~G~~~~i~~~r~~~C~~C~ 155 (380)
T PRK14297 84 GFGGFDFSDMGGFGDIFDSFFGGGFGSSSRRRNGPQRGADIEYTINLTFEEAVFGVEKEISVTRNENCETCN 155 (380)
T ss_pred CCCCcCcccccchhHHHHHHhccCccccccccCCCCCCCCEEEEEEEEHHHhcCCeEEEEEeeeeccCCCcc
Confidence 000 0 001122233321 00 000112223 36888999999997666543 5666653
No 16
>PRK14284 chaperone protein DnaJ; Provisional
Probab=99.87 E-value=2.6e-22 Score=160.66 Aligned_cols=67 Identities=34% Similarity=0.645 Sum_probs=62.8
Q ss_pred CCccccccCCCCCCHHHHHHHHHHHHHHhCCCC---CHHHHHHHHHHHHHHHhccChHHHHHHHHHhhhh
Q psy7114 1 MNHFEILNCNVNSTLEEIKENYKSLILKYHPDK---NSEHQEMFLKINEAWNILKDEKERKLYESQLLSQ 67 (134)
Q Consensus 1 ~~~Y~vLgv~~~as~~~Ik~ayr~l~~~~HPDk---~~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~~ 67 (134)
+|||+||||+++||.++||+|||+|++++|||+ .+.+.++|++|++||++|+||.+|+.||+++...
T Consensus 1 ~d~y~iLgv~~~a~~~eikkayr~la~~~HPD~~~~~~~a~~~f~~i~~Ay~vL~d~~kR~~YD~~G~~g 70 (391)
T PRK14284 1 MDYYTILGVSKTASPEEIKKAYRKLAVKYHPDKNPGDAEAEKRFKEVSEAYEVLSDAQKRESYDRYGKDG 70 (391)
T ss_pred CCHHHhcCCCCCCCHHHHHHHHHHHHHHHCcCCCCCchHHHHHHHHHHHHHHHhcCHHHHHHHHhccccc
Confidence 689999999999999999999999999999999 3568899999999999999999999999998753
No 17
>PRK10767 chaperone protein DnaJ; Provisional
Probab=99.87 E-value=2.2e-22 Score=160.15 Aligned_cols=126 Identities=25% Similarity=0.428 Sum_probs=86.8
Q ss_pred CCccccccCCCCCCHHHHHHHHHHHHHHhCCCC---CHHHHHHHHHHHHHHHhccChHHHHHHHHHhhhhhhccc---cc
Q psy7114 1 MNHFEILNCNVNSTLEEIKENYKSLILKYHPDK---NSEHQEMFLKINEAWNILKDEKERKLYESQLLSQQQTHM---NI 74 (134)
Q Consensus 1 ~~~Y~vLgv~~~as~~~Ik~ayr~l~~~~HPDk---~~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~~~~~~~---~~ 74 (134)
+|||+||||+++||.++||+|||+|++++|||+ .+.+.++|++|++||++|+||.+|+.||.++........ ..
T Consensus 4 ~d~y~iLgv~~~as~~eik~ayr~la~~~HPD~~~~~~~a~~~f~~i~~Ay~~L~d~~~r~~yd~~g~~~~~~~~~~~~~ 83 (371)
T PRK10767 4 RDYYEVLGVSRNASEDEIKKAYRKLAMKYHPDRNPGDKEAEEKFKEIKEAYEVLSDPQKRAAYDQYGHAAFEQGGGGGGF 83 (371)
T ss_pred CChHHhcCCCCCCCHHHHHHHHHHHHHHHCCCCCCCcHHHHHHHHHHHHHHHHhcchhhhhHhhhccccccccCCCCCCC
Confidence 489999999999999999999999999999999 356789999999999999999999999998875332110 00
Q ss_pred cc--c-cC-cccc---cccCCceeEEeeCCCC----CeEEEecccccCCCCCCcee---EeCCCCc
Q psy7114 75 YK--S-VS-LSDM---EHKGESQMFTYPCRCG----AEFCIEEQDTEGDGSDDNIL---IACDTCS 126 (134)
Q Consensus 75 ~~--~-~~-~~~~---~~~~~~~~~~~~crcg----~~~~v~~~~l~~g~~~~~~~---v~C~~Cs 126 (134)
.. . .+ +.++ +|++.++.-....+-| -...|+++|++.|....+.+ +.|+.|.
T Consensus 84 ~~~~~~~~~f~~~f~~~fgg~~~~~~~~~~~g~di~~~l~vsLee~~~G~~~~v~~~r~~~C~~C~ 149 (371)
T PRK10767 84 GGGGGFGDIFGDIFGDIFGGGRGGGRQRARRGADLRYNMEITLEEAVRGVTKEIRIPTLVTCDTCH 149 (371)
T ss_pred CCccccccchhhhhhhhccCCccccCCCCCCCCCeEEEEEeehHHhhCCeeEEEeeeecccCCCCC
Confidence 00 0 01 1122 1221100000112223 35778999999997665543 6676663
No 18
>PRK14283 chaperone protein DnaJ; Provisional
Probab=99.87 E-value=3.7e-22 Score=159.16 Aligned_cols=67 Identities=33% Similarity=0.581 Sum_probs=62.9
Q ss_pred CCccccccCCCCCCHHHHHHHHHHHHHHhCCCCC--HHHHHHHHHHHHHHHhccChHHHHHHHHHhhhh
Q psy7114 1 MNHFEILNCNVNSTLEEIKENYKSLILKYHPDKN--SEHQEMFLKINEAWNILKDEKERKLYESQLLSQ 67 (134)
Q Consensus 1 ~~~Y~vLgv~~~as~~~Ik~ayr~l~~~~HPDk~--~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~~ 67 (134)
+|||+||||+++||.+|||+|||+|++++|||++ +.+.++|++|++||++|+||.+|+.||+++...
T Consensus 5 ~d~y~~Lgv~~~a~~~eik~ayr~la~~~HPD~~~~~~a~~~f~~i~~Ay~~Lsd~~kR~~YD~~G~~g 73 (378)
T PRK14283 5 RDYYEVLGVDRNADKKEIKKAYRKLARKYHPDVSEEEGAEEKFKEISEAYAVLSDDEKRQRYDQFGHAG 73 (378)
T ss_pred CChHHhhCCCCCCCHHHHHHHHHHHHHHHCcCCCCCccHHHHHHHHHHHHHHhchhHHHHHHhhhcccc
Confidence 4899999999999999999999999999999994 678999999999999999999999999988653
No 19
>PRK14277 chaperone protein DnaJ; Provisional
Probab=99.87 E-value=2.1e-22 Score=160.94 Aligned_cols=67 Identities=40% Similarity=0.631 Sum_probs=62.4
Q ss_pred CCccccccCCCCCCHHHHHHHHHHHHHHhCCCC---CHHHHHHHHHHHHHHHhccChHHHHHHHHHhhhh
Q psy7114 1 MNHFEILNCNVNSTLEEIKENYKSLILKYHPDK---NSEHQEMFLKINEAWNILKDEKERKLYESQLLSQ 67 (134)
Q Consensus 1 ~~~Y~vLgv~~~as~~~Ik~ayr~l~~~~HPDk---~~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~~ 67 (134)
+|||+||||+++||.++||+|||+|++++|||+ .+.+.++|++|++||++|+||.+|+.||.++...
T Consensus 5 ~d~y~~Lgv~~~a~~~eik~ayr~la~~~HPD~~~~~~~a~~~f~~i~~Ay~vL~d~~kr~~yD~~G~~~ 74 (386)
T PRK14277 5 KDYYEILGVDRNATEEEIKKAYRRLAKKYHPDLNPGDKEAEQKFKEINEAYEILSDPQKRAQYDQFGHAA 74 (386)
T ss_pred CCHHHhcCCCCCCCHHHHHHHHHHHHHHHCCCcCCCchHHHHHHHHHHHHHHHhCCHHHHHHHHhhcccc
Confidence 489999999999999999999999999999999 3567889999999999999999999999988653
No 20
>PRK14294 chaperone protein DnaJ; Provisional
Probab=99.87 E-value=2.7e-22 Score=159.40 Aligned_cols=126 Identities=25% Similarity=0.430 Sum_probs=86.9
Q ss_pred CCccccccCCCCCCHHHHHHHHHHHHHHhCCCC---CHHHHHHHHHHHHHHHhccChHHHHHHHHHhhhhhhcc---c--
Q psy7114 1 MNHFEILNCNVNSTLEEIKENYKSLILKYHPDK---NSEHQEMFLKINEAWNILKDEKERKLYESQLLSQQQTH---M-- 72 (134)
Q Consensus 1 ~~~Y~vLgv~~~as~~~Ik~ayr~l~~~~HPDk---~~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~~~~~~---~-- 72 (134)
+|||+||||+++||.++||+|||+|++++|||+ .+.+.++|++|++||++|+||.+|+.||.++....... +
T Consensus 4 ~d~y~~lgv~~~a~~~eik~ayr~la~~~HPD~~~~~~~~~~~f~~~~~Ay~vL~d~~~r~~yD~~G~~g~~~~~~~~~~ 83 (366)
T PRK14294 4 RDYYEILGVTRDASEEEIKKSYRKLAMKYHPDRNPGDKEAEELFKEAAEAYEVLSDPKKRGIYDQYGHEGLSGTGFSGFS 83 (366)
T ss_pred CChHHHhCCCCCCCHHHHHHHHHHHHHHHCCCCCCCchHHHHHHHHHHHHHHHhccHHHHHHHHhhccccccCCCCCCcC
Confidence 589999999999999999999999999999999 35678999999999999999999999999987542211 0
Q ss_pred ccccc-cCcccc---ccc-CCc-eeE-EeeCCCC----CeEEEecccccCCCCCCcee---EeCCCCc
Q psy7114 73 NIYKS-VSLSDM---EHK-GES-QMF-TYPCRCG----AEFCIEEQDTEGDGSDDNIL---IACDTCS 126 (134)
Q Consensus 73 ~~~~~-~~~~~~---~~~-~~~-~~~-~~~crcg----~~~~v~~~~l~~g~~~~~~~---v~C~~Cs 126 (134)
.+... .++.++ +|+ ++. +.. ....+-| -...|+++|++.|.+..+.+ +.|+.|+
T Consensus 84 ~~~~~~~~~~d~f~~~fg~g~~~~~~~~~~~~~g~d~~~~l~lslee~~~G~~~~i~~~r~~~C~~C~ 151 (366)
T PRK14294 84 GFDDIFSSFGDIFEDFFGFGGGRRGRSRTAVRAGADLRYDLTLPFLEAAFGTEKEIRIQKLETCEECH 151 (366)
T ss_pred ccccchhhhhhhHHHhhccCCCcCCcccCCCCCCCCceEEEEeeHHHhcCCeEEEEEeeecccCCCCC
Confidence 00000 112222 233 111 000 0112223 35788999999997665533 5666553
No 21
>PRK14301 chaperone protein DnaJ; Provisional
Probab=99.87 E-value=1.4e-22 Score=161.29 Aligned_cols=126 Identities=21% Similarity=0.383 Sum_probs=86.6
Q ss_pred CCccccccCCCCCCHHHHHHHHHHHHHHhCCCC---CHHHHHHHHHHHHHHHhccChHHHHHHHHHhhhhhhcc---ccc
Q psy7114 1 MNHFEILNCNVNSTLEEIKENYKSLILKYHPDK---NSEHQEMFLKINEAWNILKDEKERKLYESQLLSQQQTH---MNI 74 (134)
Q Consensus 1 ~~~Y~vLgv~~~as~~~Ik~ayr~l~~~~HPDk---~~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~~~~~~---~~~ 74 (134)
+|||+||||+++||.++||+|||+|++++|||+ .+.+.++|++|++||+||+||.+|+.||.++....... +.+
T Consensus 4 ~~~y~~Lgv~~~a~~~~ik~ayr~la~~~HPD~~~~~~~a~~~f~~i~~Ay~vL~d~~kr~~yD~~g~~g~~~~~~~~g~ 83 (373)
T PRK14301 4 RDYYEVLGVSRDASEDEIKKAYRKLALQYHPDRNPDNPEAEQKFKEAAEAYEVLRDAEKRARYDRFGHAGVNGNGGFGGF 83 (373)
T ss_pred CChHHhcCCCCCCCHHHHHHHHHHHHHHhCCCcCCCChHHHHHHHHHHHHHHHhcchhhhhhhhhccccccccCCCCCCc
Confidence 489999999999999999999999999999999 35678899999999999999999999999987532211 111
Q ss_pred ccc----cCcccc---cccCCce--eEEeeCCCC----CeEEEecccccCCCCCCcee---EeCCCCc
Q psy7114 75 YKS----VSLSDM---EHKGESQ--MFTYPCRCG----AEFCIEEQDTEGDGSDDNIL---IACDTCS 126 (134)
Q Consensus 75 ~~~----~~~~~~---~~~~~~~--~~~~~crcg----~~~~v~~~~l~~g~~~~~~~---v~C~~Cs 126 (134)
... ..+.++ +|+.++. ......+-| -...|+++|++.|.+..+.+ +.|+.|+
T Consensus 84 ~~~~~~~~~f~d~f~~~fg~g~~~~~~~~~~~~g~di~~~l~vtLee~~~G~~k~i~~~r~~~C~~C~ 151 (373)
T PRK14301 84 SSAEDIFSHFSDIFGDLFGFSGGGSRRGPRPQAGSDLRYNLTVSFRQAAKGDEVTLRIPKNVTCDDCG 151 (373)
T ss_pred ccccccccchHHHHHHHhhccCcccccCCCCCCCCCEEEEEeccHHHHhCCceEEEEeeecccCCCCC
Confidence 100 011121 1221110 000112223 25778999999997665543 6666664
No 22
>PRK14295 chaperone protein DnaJ; Provisional
Probab=99.87 E-value=4.2e-22 Score=159.36 Aligned_cols=66 Identities=35% Similarity=0.605 Sum_probs=61.3
Q ss_pred CCccccccCCCCCCHHHHHHHHHHHHHHhCCCC---CHHHHHHHHHHHHHHHhccChHHHHHHHH----Hhhh
Q psy7114 1 MNHFEILNCNVNSTLEEIKENYKSLILKYHPDK---NSEHQEMFLKINEAWNILKDEKERKLYES----QLLS 66 (134)
Q Consensus 1 ~~~Y~vLgv~~~as~~~Ik~ayr~l~~~~HPDk---~~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~----~~~~ 66 (134)
+|||+||||+++||.++||+|||+|++++|||+ .+.+.++|++|++||++|+||.+|+.||+ ++..
T Consensus 9 ~d~y~~Lgv~~~a~~~eik~ayr~la~~~HPD~~~~~~~a~~~f~~i~~Ay~vL~d~~~r~~yD~~~~~~G~~ 81 (389)
T PRK14295 9 KDYYKVLGVPKDATEAEIKKAYRKLAREYHPDANKGDAKAEERFKEISEAYDVLSDEKKRKEYDEARSLFGNG 81 (389)
T ss_pred cCHHHhcCCCCCCCHHHHHHHHHHHHHHHCCCcCCCchhHHHHHHHHHHHHHHHCchhhHHHHHHHHhhhccc
Confidence 489999999999999999999999999999999 34678999999999999999999999998 6654
No 23
>PRK14291 chaperone protein DnaJ; Provisional
Probab=99.86 E-value=4e-22 Score=159.16 Aligned_cols=67 Identities=45% Similarity=0.682 Sum_probs=62.9
Q ss_pred CCccccccCCCCCCHHHHHHHHHHHHHHhCCCCC--HHHHHHHHHHHHHHHhccChHHHHHHHHHhhhh
Q psy7114 1 MNHFEILNCNVNSTLEEIKENYKSLILKYHPDKN--SEHQEMFLKINEAWNILKDEKERKLYESQLLSQ 67 (134)
Q Consensus 1 ~~~Y~vLgv~~~as~~~Ik~ayr~l~~~~HPDk~--~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~~ 67 (134)
+|||+||||+++|+.++||+|||+|++++|||++ +.+.++|++|++||++|+||.+|+.||.++...
T Consensus 3 ~d~Y~~Lgv~~~a~~~~ik~ayr~la~~~HPD~~~~~~~~~~f~~i~~Ay~vLsd~~kR~~YD~~g~~~ 71 (382)
T PRK14291 3 KDYYEILGVSRNATQEEIKKAYRRLARKYHPDFNKNPEAEEKFKEINEAYQVLSDPEKRKLYDQFGHAA 71 (382)
T ss_pred CCHHHhhCCCCCCCHHHHHHHHHHHHHHHCCCCCCCccHHHHHHHHHHHHHHhcCHHHHHHHhhhcccc
Confidence 5899999999999999999999999999999994 567899999999999999999999999998754
No 24
>TIGR02349 DnaJ_bact chaperone protein DnaJ. This model represents bacterial forms of DnaJ, part of the DnaK-DnaJ-GrpE chaperone system. The three components typically are encoded by consecutive genes. DnaJ homologs occur in many genomes, typically not near DnaK and GrpE-like genes; most such genes are not included by this family. Eukaryotic (mitochondrial and chloroplast) forms are not included in the scope of this family.
Probab=99.85 E-value=8.5e-22 Score=155.89 Aligned_cols=125 Identities=26% Similarity=0.351 Sum_probs=87.1
Q ss_pred CccccccCCCCCCHHHHHHHHHHHHHHhCCCCC--HHHHHHHHHHHHHHHhccChHHHHHHHHHhhhhhhcc-c----cc
Q psy7114 2 NHFEILNCNVNSTLEEIKENYKSLILKYHPDKN--SEHQEMFLKINEAWNILKDEKERKLYESQLLSQQQTH-M----NI 74 (134)
Q Consensus 2 ~~Y~vLgv~~~as~~~Ik~ayr~l~~~~HPDk~--~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~~~~~~-~----~~ 74 (134)
|||+||||+++|+.++||+|||+|++++|||+. +.+.++|++|++||++|+|+.+|..||.++....... + ..
T Consensus 1 d~y~~Lgv~~~a~~~~ik~ayr~la~~~HPD~~~~~~~~~~f~~i~~Ay~vL~d~~~R~~yd~~g~~~~~~~~~~~~~~~ 80 (354)
T TIGR02349 1 DYYEILGVSKDASEEEIKKAYRKLAKKYHPDRNKDKEAEEKFKEINEAYEVLSDPEKRAQYDQFGHAGFNGGGGGGGGGF 80 (354)
T ss_pred ChHHhCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCccHHHHHHHHHHHHHHhhChHHHHhhhhcccccccccCcCCCCCc
Confidence 799999999999999999999999999999994 5678999999999999999999999999887543211 0 00
Q ss_pred c--------cccCcccccccCCce-eE--EeeCCCC----CeEEEecccccCCCCCCcee---EeCCCCc
Q psy7114 75 Y--------KSVSLSDMEHKGESQ-MF--TYPCRCG----AEFCIEEQDTEGDGSDDNIL---IACDTCS 126 (134)
Q Consensus 75 ~--------~~~~~~~~~~~~~~~-~~--~~~crcg----~~~~v~~~~l~~g~~~~~~~---v~C~~Cs 126 (134)
. ...++.+.+|++..+ .. ....+-| -.+.|+++|++.|.+..+.+ +.|+.|+
T Consensus 81 ~~~~~~~~~~~~~~f~~~fg~~~g~~~~~~~~~~~~~d~~~~l~vsLee~~~G~~~~i~~~r~~~C~~C~ 150 (354)
T TIGR02349 81 NGFDIGFFGDFGDIFGDFFGGGGGSGRRRRSGPRRGEDLRYDLELTFEEAVFGVEKEIEIPRKESCETCH 150 (354)
T ss_pred CCccccCcCchhhhHHHHhccCcccCccccCCCCCCCCeEEEEEEEHHHHhCCeeEEEEeecCCcCCCCC
Confidence 0 001122223332110 00 0112223 36788999999997665543 6666664
No 25
>PRK14279 chaperone protein DnaJ; Provisional
Probab=99.85 E-value=1.7e-21 Score=156.04 Aligned_cols=65 Identities=29% Similarity=0.529 Sum_probs=61.6
Q ss_pred CCccccccCCCCCCHHHHHHHHHHHHHHhCCCC---CHHHHHHHHHHHHHHHhccChHHHHHHHHHhh
Q psy7114 1 MNHFEILNCNVNSTLEEIKENYKSLILKYHPDK---NSEHQEMFLKINEAWNILKDEKERKLYESQLL 65 (134)
Q Consensus 1 ~~~Y~vLgv~~~as~~~Ik~ayr~l~~~~HPDk---~~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~ 65 (134)
+|||+||||+++|+.++||+|||+|++++|||+ .+.+.++|++|++||++|+||++|+.||+++.
T Consensus 9 ~Dyy~~Lgv~~~a~~~eik~ayr~la~~~HPD~~~~~~~a~~~f~~i~~Ay~vLsD~~KR~~YD~~G~ 76 (392)
T PRK14279 9 KDFYKELGVSSDASAEEIKKAYRKLARELHPDANPGDPAAEERFKAVSEAHDVLSDPAKRKEYDETRR 76 (392)
T ss_pred cCHHHhcCCCCCCCHHHHHHHHHHHHHHHCcCCCCCChHHHHHHHHHHHHHHHhcchhhhhHHHHhhh
Confidence 599999999999999999999999999999999 34688999999999999999999999999975
No 26
>PRK14292 chaperone protein DnaJ; Provisional
Probab=99.84 E-value=2e-21 Score=154.61 Aligned_cols=126 Identities=25% Similarity=0.381 Sum_probs=88.0
Q ss_pred CCccccccCCCCCCHHHHHHHHHHHHHHhCCCC--CHHHHHHHHHHHHHHHhccChHHHHHHHHHhhhhhhc--c-cccc
Q psy7114 1 MNHFEILNCNVNSTLEEIKENYKSLILKYHPDK--NSEHQEMFLKINEAWNILKDEKERKLYESQLLSQQQT--H-MNIY 75 (134)
Q Consensus 1 ~~~Y~vLgv~~~as~~~Ik~ayr~l~~~~HPDk--~~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~~~~~--~-~~~~ 75 (134)
+|||+||||+++||.++||+|||+|++++|||+ .+.+.++|+.|++||++|+||.+|+.||.++...... . ++..
T Consensus 2 ~d~y~~Lgv~~~a~~~~ik~ayr~l~~~~hpD~~~~~~a~~~~~~i~~Ay~vL~d~~~r~~yd~~G~~~~~~~~~~~~~~ 81 (371)
T PRK14292 2 MDYYELLGVSRTASADEIKSAYRKLALKYHPDRNKEKGAAEKFAQINEAYAVLSDAEKRAHYDRFGTAPGAGMPGGDPFG 81 (371)
T ss_pred CChHHHcCCCCCCCHHHHHHHHHHHHHHHCCCCCCChhHHHHHHHHHHHHHHhcchhhhhhHhhcCCcccccccCCcccC
Confidence 699999999999999999999999999999999 4568899999999999999999999999998753211 0 0100
Q ss_pred ----cccCcccccccCCc--eeEE-eeCCCC----CeEEEecccccCCCCCCce---eEeCCCCc
Q psy7114 76 ----KSVSLSDMEHKGES--QMFT-YPCRCG----AEFCIEEQDTEGDGSDDNI---LIACDTCS 126 (134)
Q Consensus 76 ----~~~~~~~~~~~~~~--~~~~-~~crcg----~~~~v~~~~l~~g~~~~~~---~v~C~~Cs 126 (134)
+..++++.+|++.. .... ...+-| -...++++|++.|.+.... .+.|+.|.
T Consensus 82 ~~~~d~~d~f~~~fg~~~~~~~~~~~~~~~g~d~~~~l~~sLee~~~G~~~~v~~~r~~~C~~C~ 146 (371)
T PRK14292 82 GMGFDPMDIFEQLFGGAGFGGGRGRRGPARGDDLETEARITLEQARAGEEVEVEVDRLTECEHCH 146 (371)
T ss_pred ccCCChHHHHHHhhCCCCcCCCCCcccccCCCCeEEEEeccHHHHcCCeEEEEEEEeeecCCCCc
Confidence 00112222233211 0000 111223 3667789999999766554 37788874
No 27
>PRK14289 chaperone protein DnaJ; Provisional
Probab=99.84 E-value=7e-21 Score=152.20 Aligned_cols=67 Identities=31% Similarity=0.594 Sum_probs=62.7
Q ss_pred CCccccccCCCCCCHHHHHHHHHHHHHHhCCCC---CHHHHHHHHHHHHHHHhccChHHHHHHHHHhhhh
Q psy7114 1 MNHFEILNCNVNSTLEEIKENYKSLILKYHPDK---NSEHQEMFLKINEAWNILKDEKERKLYESQLLSQ 67 (134)
Q Consensus 1 ~~~Y~vLgv~~~as~~~Ik~ayr~l~~~~HPDk---~~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~~ 67 (134)
+|||+||||+++|+.+|||+|||+|++++|||+ .+.+.++|++|++||++|+||.+|+.||.++...
T Consensus 5 ~~~y~~Lgv~~~a~~~eik~ayr~la~~~HpD~~~~~~~a~~~f~~i~~Ay~~L~d~~~R~~yD~~G~~~ 74 (386)
T PRK14289 5 RDYYEVLGVSKTATVDEIKKAYRKKAIQYHPDKNPGDKEAEEKFKEAAEAYDVLSDPDKRSRYDQFGHAG 74 (386)
T ss_pred CCHHHHcCCCCCCCHHHHHHHHHHHHHHHCCCCCCCChHHHHHHHHHHHHHHHhcCHHHHHHHHHhcccc
Confidence 489999999999999999999999999999999 3568899999999999999999999999998753
No 28
>PRK14300 chaperone protein DnaJ; Provisional
Probab=99.84 E-value=3.6e-21 Score=153.27 Aligned_cols=67 Identities=28% Similarity=0.553 Sum_probs=62.3
Q ss_pred CCccccccCCCCCCHHHHHHHHHHHHHHhCCCC--CHHHHHHHHHHHHHHHhccChHHHHHHHHHhhhh
Q psy7114 1 MNHFEILNCNVNSTLEEIKENYKSLILKYHPDK--NSEHQEMFLKINEAWNILKDEKERKLYESQLLSQ 67 (134)
Q Consensus 1 ~~~Y~vLgv~~~as~~~Ik~ayr~l~~~~HPDk--~~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~~ 67 (134)
+|||+||||+++||.++||+|||++++++|||+ .+.+.++|++|++||++|+|+.+|..||.++...
T Consensus 3 ~~~y~iLgv~~~as~~eik~ayr~la~~~HPD~~~~~~~~~~f~~i~~Ay~~L~d~~~r~~yD~~G~~~ 71 (372)
T PRK14300 3 QDYYQILGVSKTASQADLKKAYLKLAKQYHPDTTDAKDAEKKFKEINAAYDVLKDEQKRAAYDRFGHDA 71 (372)
T ss_pred CChHHHcCCCCCCCHHHHHHHHHHHHHHHCcCCCCCcCHHHHHHHHHHHHHHhhhHhHhhHHHhccccc
Confidence 589999999999999999999999999999999 3457889999999999999999999999988653
No 29
>PRK14290 chaperone protein DnaJ; Provisional
Probab=99.83 E-value=4.9e-21 Score=152.14 Aligned_cols=66 Identities=32% Similarity=0.608 Sum_probs=61.8
Q ss_pred CCccccccCCCCCCHHHHHHHHHHHHHHhCCCCC----HHHHHHHHHHHHHHHhccChHHHHHHHHHhhh
Q psy7114 1 MNHFEILNCNVNSTLEEIKENYKSLILKYHPDKN----SEHQEMFLKINEAWNILKDEKERKLYESQLLS 66 (134)
Q Consensus 1 ~~~Y~vLgv~~~as~~~Ik~ayr~l~~~~HPDk~----~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~ 66 (134)
+|||+||||+++||.++||+|||+|++++|||++ ..+.++|++|++||++|+||.+|..||.++..
T Consensus 3 ~d~y~vLgv~~~a~~~eik~ayr~la~~~HPD~~~~~~~~a~~~f~~i~~Ay~~L~d~~~r~~yd~~G~~ 72 (365)
T PRK14290 3 KDYYKILGVDRNASQEDIKKAFRELAKKWHPDLHPGNKAEAEEKFKEISEAYEVLSDPQKRRQYDQTGTV 72 (365)
T ss_pred CChhhhcCCCCCCCHHHHHHHHHHHHHHHCcCCCCCchhHHHHHHHHHHHHHHHhcChhhhhhhcccCCc
Confidence 4899999999999999999999999999999993 25789999999999999999999999998875
No 30
>PRK14293 chaperone protein DnaJ; Provisional
Probab=99.83 E-value=9.8e-21 Score=150.84 Aligned_cols=126 Identities=21% Similarity=0.338 Sum_probs=86.7
Q ss_pred CCccccccCCCCCCHHHHHHHHHHHHHHhCCCC--CHHHHHHHHHHHHHHHhccChHHHHHHHHHhhhhhhcc-c--ccc
Q psy7114 1 MNHFEILNCNVNSTLEEIKENYKSLILKYHPDK--NSEHQEMFLKINEAWNILKDEKERKLYESQLLSQQQTH-M--NIY 75 (134)
Q Consensus 1 ~~~Y~vLgv~~~as~~~Ik~ayr~l~~~~HPDk--~~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~~~~~~-~--~~~ 75 (134)
+|||+||||+++||.++||+|||+|++++|||+ .+.+.++|+.|++||++|+||.+|+.||.++......+ + ...
T Consensus 3 ~d~y~vLgv~~~a~~~eik~ayr~la~~~HPD~~~~~~a~~~f~~i~~Ay~vL~~~~~R~~yd~~g~~g~~~~~~~~~~~ 82 (374)
T PRK14293 3 ADYYEILGVSRDADKDELKRAYRRLARKYHPDVNKEPGAEDRFKEINRAYEVLSDPETRARYDQFGEAGVSGAAGFPDMG 82 (374)
T ss_pred CChhhhcCCCCCCCHHHHHHHHHHHHHHHCCCCCCCcCHHHHHHHHHHHHHHHhchHHHHHHhhccccccccCCCcCCcc
Confidence 489999999999999999999999999999999 45688999999999999999999999999887532111 0 000
Q ss_pred ccc---CcccccccC--C-ce---eE-EeeCCCC----CeEEEecccccCCCCCCcee---EeCCCCc
Q psy7114 76 KSV---SLSDMEHKG--E-SQ---MF-TYPCRCG----AEFCIEEQDTEGDGSDDNIL---IACDTCS 126 (134)
Q Consensus 76 ~~~---~~~~~~~~~--~-~~---~~-~~~crcg----~~~~v~~~~l~~g~~~~~~~---v~C~~Cs 126 (134)
... ++.+.+|++ + +. .. ....+-| -...|+++|++.|......+ +.|+.|.
T Consensus 83 ~~~~~~d~f~~~fg~~~~~~~~~~~~~~~~~~kg~di~~~l~vsLee~~~G~~k~i~~~r~~~C~~C~ 150 (374)
T PRK14293 83 DMGGFADIFETFFSGFGGAGGQGGRRRRRGPQRGDDLRYDLKLDFREAIFGGEKEIRIPHLETCETCR 150 (374)
T ss_pred cccchHHHHHHHhcccCCCCCCCccccccCccCCCCeEEEEEeeHHHHhCCceEEEEeeccccCCCCC
Confidence 000 111112321 0 00 00 0011223 26778999999997665543 6777773
No 31
>KOG0716|consensus
Probab=99.83 E-value=5e-21 Score=144.08 Aligned_cols=70 Identities=36% Similarity=0.608 Sum_probs=65.0
Q ss_pred CCccccccCCCCCCHHHHHHHHHHHHHHhCCCC---CHHHHHHHHHHHHHHHhccChHHHHHHHHHhhhhhhc
Q psy7114 1 MNHFEILNCNVNSTLEEIKENYKSLILKYHPDK---NSEHQEMFLKINEAWNILKDEKERKLYESQLLSQQQT 70 (134)
Q Consensus 1 ~~~Y~vLgv~~~as~~~Ik~ayr~l~~~~HPDk---~~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~~~~~ 70 (134)
+++|+|||++++|+.++||++||+|++++|||+ ++++.++|++||+||+||+||.+|..||.++......
T Consensus 31 ~~LYdVLgl~k~at~d~IKKaYR~L~~k~HPD~~gd~P~~~dkf~eIN~Ay~ILsD~~kR~~YD~~g~~~l~l 103 (279)
T KOG0716|consen 31 LDLYDVLGLPKTATKDEIKKAYRKLALKYHPDKNGDNPEATDKFKEINTAYAILSDPTKRNVYDEYGELGLKL 103 (279)
T ss_pred hHHHHHhCCCcccchHHHHHHHHHHHHHhCCCcCCCCchhHHHHHHHHHHHHHhcChhhhhhHHHhhhHHHHH
Confidence 478999999999999999999999999999999 5678999999999999999999999999998876543
No 32
>PRK14299 chaperone protein DnaJ; Provisional
Probab=99.82 E-value=1.8e-20 Score=144.91 Aligned_cols=66 Identities=35% Similarity=0.626 Sum_probs=62.5
Q ss_pred CCccccccCCCCCCHHHHHHHHHHHHHHhCCCC--CHHHHHHHHHHHHHHHhccChHHHHHHHHHhhh
Q psy7114 1 MNHFEILNCNVNSTLEEIKENYKSLILKYHPDK--NSEHQEMFLKINEAWNILKDEKERKLYESQLLS 66 (134)
Q Consensus 1 ~~~Y~vLgv~~~as~~~Ik~ayr~l~~~~HPDk--~~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~ 66 (134)
+|||+||||+++||.++||+|||+|++++|||+ .+.+.++|++|++||++|+||.+|+.||.++..
T Consensus 4 ~d~y~vLgv~~~a~~~eik~ayr~la~~~HPD~~~~~~~~~~f~~i~~Ay~~L~d~~kr~~yD~~g~~ 71 (291)
T PRK14299 4 KDYYAILGVPKNASQDEIKKAFKKLARKYHPDVNKSPGAEEKFKEINEAYTVLSDPEKRRIYDTYGTT 71 (291)
T ss_pred CCHHHHcCCCCCCCHHHHHHHHHHHHHHHCcCCCCChhHHHHHHHHHHHHHHhcCHHHHHHHHhcCCc
Confidence 489999999999999999999999999999999 367889999999999999999999999998875
No 33
>PF00226 DnaJ: DnaJ domain; InterPro: IPR001623 The prokaryotic heat shock protein DnaJ interacts with the chaperone hsp70-like DnaK protein []. Structurally, the DnaJ protein consists of an N-terminal conserved domain (called 'J' domain) of about 70 amino acids, a glycine-rich region ('G' domain') of about 30 residues, a central domain containing four repeats of a CXXCXGXG motif ('CRR' domain) and a C-terminal region of 120 to 170 residues. Such a structure is shown in the following schematic representation: +------------+-+-------+-----+-----------+--------------------------------+ | N-terminal | | Gly-R | | CXXCXGXG | C-terminal | +------------+-+-------+-----+-----------+--------------------------------+ It is thought that the 'J' domain of DnaJ mediates the interaction with the dnaK protein and consists of four helices, the second of which has a charged surface that includes at least one pair of basic residues that are essential for interaction with the ATPase domain of Hsp70. The J- and CRR-domains are found in many prokaryotic and eukaryotic proteins [], either together or separately. In yeast, J-domains have been classified into 3 groups; the class III proteins are functionally distinct and do not appear to act as molecular chaperones []. ; GO: 0031072 heat shock protein binding; PDB: 2GUZ_C 2L6L_A 1HDJ_A 2EJ7_A 1FPO_C 2CUG_A 2QSA_A 2OCH_A 3BVO_B 3APQ_A ....
Probab=99.81 E-value=4.5e-20 Score=112.42 Aligned_cols=60 Identities=45% Similarity=0.763 Sum_probs=56.4
Q ss_pred CccccccCCCCCCHHHHHHHHHHHHHHhCCCC--CH--HHHHHHHHHHHHHHhccChHHHHHHH
Q psy7114 2 NHFEILNCNVNSTLEEIKENYKSLILKYHPDK--NS--EHQEMFLKINEAWNILKDEKERKLYE 61 (134)
Q Consensus 2 ~~Y~vLgv~~~as~~~Ik~ayr~l~~~~HPDk--~~--~~~~~f~~i~~Ay~~L~d~~~R~~Yd 61 (134)
|||+||||+++++.++||++|+++++++|||+ .. .+.+.|..|++||++|++|.+|+.||
T Consensus 1 ~~y~iLgl~~~~~~~eik~~y~~l~~~~HPD~~~~~~~~~~~~~~~i~~Ay~~L~~~~~R~~YD 64 (64)
T PF00226_consen 1 NPYEILGLPPDASDEEIKKAYRRLSKQYHPDKNSGDEAEAEEKFARINEAYEILSDPERRRRYD 64 (64)
T ss_dssp HHHHHCTSTTTSSHHHHHHHHHHHHHHTSTTTGTSTHHHHHHHHHHHHHHHHHHHSHHHHHHHH
T ss_pred ChHHHCCCCCCCCHHHHHHHHHhhhhccccccchhhhhhhhHHHHHHHHHHHHhCCHHHHHhcC
Confidence 68999999999999999999999999999999 22 46799999999999999999999998
No 34
>KOG0717|consensus
Probab=99.81 E-value=1.6e-20 Score=149.67 Aligned_cols=71 Identities=34% Similarity=0.558 Sum_probs=65.8
Q ss_pred CCccccccCCCCCCHHHHHHHHHHHHHHhCCCC----CHHHHHHHHHHHHHHHhccChHHHHHHHHHhhhhhhcc
Q psy7114 1 MNHFEILNCNVNSTLEEIKENYKSLILKYHPDK----NSEHQEMFLKINEAWNILKDEKERKLYESQLLSQQQTH 71 (134)
Q Consensus 1 ~~~Y~vLgv~~~as~~~Ik~ayr~l~~~~HPDk----~~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~~~~~~ 71 (134)
|.||+||||..+|+..+||++||+|||++|||| .+.+.+.|+.|+.||+|||||+.|.+||.+..+.....
T Consensus 8 ~c~YE~L~v~~~a~d~eik~~YRklALq~HPDknpd~ieeat~~F~~i~aAYeVLSdp~eR~wyd~hreqil~~~ 82 (508)
T KOG0717|consen 8 RCYYEVLGVERDADDDEIKKNYRKLALQYHPDKNPDRIEEATQQFQLIQAAYEVLSDPQERAWYDSHREQILRGK 82 (508)
T ss_pred hHHHHHhcccccCCHHHHHHHHHHHHHhhCCCCCCccHHHHHHHHHHHHHHHHHhcChHhhhhHHHHHHHHhcCC
Confidence 469999999999999999999999999999999 45789999999999999999999999999998776544
No 35
>KOG0715|consensus
Probab=99.80 E-value=1.1e-19 Score=140.20 Aligned_cols=66 Identities=39% Similarity=0.663 Sum_probs=63.5
Q ss_pred CccccccCCCCCCHHHHHHHHHHHHHHhCCCC--CHHHHHHHHHHHHHHHhccChHHHHHHHHHhhhh
Q psy7114 2 NHFEILNCNVNSTLEEIKENYKSLILKYHPDK--NSEHQEMFLKINEAWNILKDEKERKLYESQLLSQ 67 (134)
Q Consensus 2 ~~Y~vLgv~~~as~~~Ik~ayr~l~~~~HPDk--~~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~~ 67 (134)
|||+||||+++|+..|||+||++|++++|||. .+.+.++|++|.+||++|+|+.+|..||..+...
T Consensus 44 d~Y~vLgv~~~At~~EIK~Af~~LaKkyHPD~n~~~~a~~kF~eI~~AYEiLsd~eKR~~YD~~~~~~ 111 (288)
T KOG0715|consen 44 DYYKVLGVSRNATLSEIKSAFRKLAKKYHPDVNKDKEASKKFKEISEAYEILSDEEKRQEYDVYGLEQ 111 (288)
T ss_pred chhhhhCcCCCCCHHHHHHHHHHHHHhhCCCCCCCcchhhHHHHHHHHHHHhcCHHHHHHHHHhhhhc
Confidence 79999999999999999999999999999999 6689999999999999999999999999998875
No 36
>PRK14281 chaperone protein DnaJ; Provisional
Probab=99.80 E-value=6.6e-20 Score=147.08 Aligned_cols=67 Identities=36% Similarity=0.665 Sum_probs=62.5
Q ss_pred CCccccccCCCCCCHHHHHHHHHHHHHHhCCCC---CHHHHHHHHHHHHHHHhccChHHHHHHHHHhhhh
Q psy7114 1 MNHFEILNCNVNSTLEEIKENYKSLILKYHPDK---NSEHQEMFLKINEAWNILKDEKERKLYESQLLSQ 67 (134)
Q Consensus 1 ~~~Y~vLgv~~~as~~~Ik~ayr~l~~~~HPDk---~~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~~ 67 (134)
+|||+||||+++|+.++||+|||+|++++|||+ .+.+.++|++|++||++|+||.+|+.||.++...
T Consensus 3 ~d~y~iLgv~~~a~~~eikkayr~la~~~HPD~~~~~~~a~~~f~~i~~Ay~vL~d~~~r~~yD~~g~~~ 72 (397)
T PRK14281 3 RDYYEVLGVSRSADKDEIKKAYRKLALKYHPDKNPDNKEAEEHFKEVNEAYEVLSNDDKRRRYDQFGHAG 72 (397)
T ss_pred CChhhhcCCCCCCCHHHHHHHHHHHHHHHCCCcCCCchHHHHHHHHHHHHHHHhhhhhhhhhhhhccchh
Confidence 489999999999999999999999999999999 3567899999999999999999999999998754
No 37
>KOG0718|consensus
Probab=99.79 E-value=1e-19 Score=145.36 Aligned_cols=69 Identities=30% Similarity=0.544 Sum_probs=64.8
Q ss_pred CCccccccCCCCCCHHHHHHHHHHHHHHhCCCC------CHHHHHHHHHHHHHHHhccChHHHHHHHHHhhhhhh
Q psy7114 1 MNHFEILNCNVNSTLEEIKENYKSLILKYHPDK------NSEHQEMFLKINEAWNILKDEKERKLYESQLLSQQQ 69 (134)
Q Consensus 1 ~~~Y~vLgv~~~as~~~Ik~ayr~l~~~~HPDk------~~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~~~~ 69 (134)
.++|.+|||+++||.+|||+|||++++.+|||| ...|++.|+.|.+||+||+||++|..||.+|.++..
T Consensus 9 ~e~Ya~LNlpkdAt~eeI~~AYrr~~~lfHPDkh~dpd~K~~AE~~F~~i~~AyEVLsDp~kRaIYD~~G~qGL~ 83 (546)
T KOG0718|consen 9 IELYALLNLPKDATDEEIKKAYRRLSRLFHPDKHTDPDQKKAAEEKFQRIQRAYEVLSDPQKRAIYDNYGEQGLK 83 (546)
T ss_pred hhHHHHhCCCcccCHHHHHHHHHHHHHhcCCcccCChhHHHHHHHHHHHHHHHHHHhcChHHHHHHHHhhhcccc
Confidence 379999999999999999999999999999999 567899999999999999999999999999997664
No 38
>PRK10266 curved DNA-binding protein CbpA; Provisional
Probab=99.78 E-value=4e-19 Score=138.29 Aligned_cols=65 Identities=31% Similarity=0.619 Sum_probs=61.3
Q ss_pred CCccccccCCCCCCHHHHHHHHHHHHHHhCCCC--CHHHHHHHHHHHHHHHhccChHHHHHHHHHhh
Q psy7114 1 MNHFEILNCNVNSTLEEIKENYKSLILKYHPDK--NSEHQEMFLKINEAWNILKDEKERKLYESQLL 65 (134)
Q Consensus 1 ~~~Y~vLgv~~~as~~~Ik~ayr~l~~~~HPDk--~~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~ 65 (134)
+|||+||||+++||.++||+|||+|++++|||+ .+.+.++|++|++||++|+||.+|+.||.++.
T Consensus 4 ~d~y~~Lgv~~~a~~~eik~ayr~la~k~HPD~~~~~~~~~~f~~i~~Ay~~L~~~~kr~~yD~~g~ 70 (306)
T PRK10266 4 KDYYAIMGVKPTDDLKTIKTAYRRLARKYHPDVSKEPDAEARFKEVAEAWEVLSDEQRRAEYDQLWQ 70 (306)
T ss_pred CChHHHcCCCCCCCHHHHHHHHHHHHHHHCcCCCCCccHHHHHHHHHHHHHHhhhHHHHHHHHHhhc
Confidence 589999999999999999999999999999999 35688999999999999999999999999874
No 39
>KOG0691|consensus
Probab=99.77 E-value=9.2e-19 Score=134.93 Aligned_cols=71 Identities=31% Similarity=0.579 Sum_probs=66.6
Q ss_pred CCccccccCCCCCCHHHHHHHHHHHHHHhCCCC---CHHHHHHHHHHHHHHHhccChHHHHHHHHHhhhhhhcc
Q psy7114 1 MNHFEILNCNVNSTLEEIKENYKSLILKYHPDK---NSEHQEMFLKINEAWNILKDEKERKLYESQLLSQQQTH 71 (134)
Q Consensus 1 ~~~Y~vLgv~~~as~~~Ik~ayr~l~~~~HPDk---~~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~~~~~~ 71 (134)
.|||+||||+++|+..+|++||++.++++|||| +|.+.++|+.+.+||+||+|+..|..||..+.....+.
T Consensus 5 ~dyY~lLgi~~~at~~eIkKaYr~kaL~~HPDKNp~dP~A~ekFq~L~eAy~VL~D~~~R~~YDk~~k~~~~~~ 78 (296)
T KOG0691|consen 5 TDYYDLLGISEDATDAEIKKAYRKKALQYHPDKNPGDPQAAEKFQELSEAYEVLSDEESRAAYDKLRKSGSSAQ 78 (296)
T ss_pred chHHHHhCCCCCCCHHHHHHHHHHHHHhcCCCCCCCChHHHHHHHHHHHHHHHhcCHHHHHHHHHHhhhcccch
Confidence 489999999999999999999999999999999 77899999999999999999999999999999776543
No 40
>PTZ00341 Ring-infected erythrocyte surface antigen; Provisional
Probab=99.76 E-value=1.1e-18 Score=149.58 Aligned_cols=68 Identities=35% Similarity=0.509 Sum_probs=63.1
Q ss_pred CCccccccCCCCCCHHHHHHHHHHHHHHhCCCC--CHHHHHHHHHHHHHHHhccChHHHHHHHHHhhhhh
Q psy7114 1 MNHFEILNCNVNSTLEEIKENYKSLILKYHPDK--NSEHQEMFLKINEAWNILKDEKERKLYESQLLSQQ 68 (134)
Q Consensus 1 ~~~Y~vLgv~~~as~~~Ik~ayr~l~~~~HPDk--~~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~~~ 68 (134)
++||+||||+++||..+||+|||+||+++|||+ ...+.++|+.|++||++|+||.+|+.||.+|..+.
T Consensus 573 ~dYYdILGVs~dAS~~EIKKAYRKLAlkyHPDKN~~~~A~ekFq~I~EAYeVLSDp~kRk~YD~~G~~Gl 642 (1136)
T PTZ00341 573 TLFYDILGVGVNADMKEISERYFKLAENYYPPKRSGNEGFHKFKKINEAYQILGDIDKKKMYNKFGYDGI 642 (1136)
T ss_pred CChHHHcCCCCCCCHHHHHHHHHHHHHHhCCCCCCCchHHHHHHHHHHHHHHhCCHHHHHHHhhcccccc
Confidence 489999999999999999999999999999999 34578899999999999999999999999988753
No 41
>KOG0719|consensus
Probab=99.75 E-value=1.7e-18 Score=128.04 Aligned_cols=67 Identities=33% Similarity=0.597 Sum_probs=62.9
Q ss_pred CCccccccCCCCCCHHHHHHHHHHHHHHhCCCC-----CHHHHHHHHHHHHHHHhccChHHHHHHHHHhhhh
Q psy7114 1 MNHFEILNCNVNSTLEEIKENYKSLILKYHPDK-----NSEHQEMFLKINEAWNILKDEKERKLYESQLLSQ 67 (134)
Q Consensus 1 ~~~Y~vLgv~~~as~~~Ik~ayr~l~~~~HPDk-----~~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~~ 67 (134)
+|+|+||||.++|+..+||+||+++++++|||+ ...+..+|+.|+.||+||+|.++|+.||..|.-.
T Consensus 14 ~d~YevLGVer~a~~~eIrkAY~klal~~HPDk~~eed~~ea~~kFq~l~k~y~iLsDeekR~~YDetG~id 85 (264)
T KOG0719|consen 14 KDLYEVLGVERDATDKEIRKAYHKLALRLHPDKNHEEDKVEATEKFQQLQKAYQILSDEEKRAVYDETGSID 85 (264)
T ss_pred cCHHHHhhhcccCCHHHHHHHHHHHHHHhCCCcchhhhHHHHHHHHHHHHHHHHHhhHHHHHHHHhccCCCC
Confidence 589999999999999999999999999999999 4578899999999999999999999999988754
No 42
>KOG0721|consensus
Probab=99.73 E-value=5.1e-18 Score=124.34 Aligned_cols=70 Identities=33% Similarity=0.547 Sum_probs=65.0
Q ss_pred CccccccCCCCCCHHHHHHHHHHHHHHhCCCCC---HHHHHHHHHHHHHHHhccChHHHHHHHHHhhhhhhcc
Q psy7114 2 NHFEILNCNVNSTLEEIKENYKSLILKYHPDKN---SEHQEMFLKINEAWNILKDEKERKLYESQLLSQQQTH 71 (134)
Q Consensus 2 ~~Y~vLgv~~~as~~~Ik~ayr~l~~~~HPDk~---~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~~~~~~ 71 (134)
|||+||||+|+||..|||+|||+|++++||||. .+.++.|..|.+||+.|+|+..|.+|..++.....++
T Consensus 100 DPyEILGl~pgas~~eIKkaYR~LSik~HPDK~~~~~~~e~~~~~I~KAY~aLTD~~sreN~ekYG~PDGpq~ 172 (230)
T KOG0721|consen 100 DPYEILGLDPGASEKEIKKAYRRLSIKYHPDKQPPEEGDEEFFEAIAKAYQALTDKKSRENWEKYGNPDGPQA 172 (230)
T ss_pred CcHHhhCCCCCCCHHHHHHHHHHhhhhhCCCcCCCcchhHHHHHHHHHHHHHhcchhhHHHHHHhCCCCCccc
Confidence 899999999999999999999999999999993 6788899999999999999999999999999876543
No 43
>smart00271 DnaJ DnaJ molecular chaperone homology domain.
Probab=99.72 E-value=7.6e-18 Score=101.10 Aligned_cols=55 Identities=47% Similarity=0.821 Sum_probs=51.3
Q ss_pred CCccccccCCCCCCHHHHHHHHHHHHHHhCCCCCH----HHHHHHHHHHHHHHhccChH
Q psy7114 1 MNHFEILNCNVNSTLEEIKENYKSLILKYHPDKNS----EHQEMFLKINEAWNILKDEK 55 (134)
Q Consensus 1 ~~~Y~vLgv~~~as~~~Ik~ayr~l~~~~HPDk~~----~~~~~f~~i~~Ay~~L~d~~ 55 (134)
+|||+||||+++++.++||++|+++++.+|||+.. .+.+.|..|++||++|+||.
T Consensus 1 ~~~y~vLgl~~~~~~~~ik~ay~~l~~~~HPD~~~~~~~~~~~~~~~l~~Ay~~L~~~~ 59 (60)
T smart00271 1 TDYYEILGVPRDASLDEIKKAYRKLALKYHPDKNPGDKEEAEEKFKEINEAYEVLSDPE 59 (60)
T ss_pred CCHHHHcCCCCCCCHHHHHHHHHHHHHHHCcCCCCCchHHHHHHHHHHHHHHHHHcCCC
Confidence 58999999999999999999999999999999933 58899999999999999984
No 44
>cd06257 DnaJ DnaJ domain or J-domain. DnaJ/Hsp40 (heat shock protein 40) proteins are highly conserved and play crucial roles in protein translation, folding, unfolding, translocation, and degradation. They act primarily by stimulating the ATPase activity of Hsp70s, an important chaperonine family. Hsp40 proteins are characterized by the presence of a J domain, which mediates the interaction with Hsp70. They may contain other domains as well, and the architectures provide a means of classification.
Probab=99.71 E-value=2.2e-17 Score=97.33 Aligned_cols=52 Identities=48% Similarity=0.818 Sum_probs=49.2
Q ss_pred CccccccCCCCCCHHHHHHHHHHHHHHhCCCCC---HHHHHHHHHHHHHHHhccC
Q psy7114 2 NHFEILNCNVNSTLEEIKENYKSLILKYHPDKN---SEHQEMFLKINEAWNILKD 53 (134)
Q Consensus 2 ~~Y~vLgv~~~as~~~Ik~ayr~l~~~~HPDk~---~~~~~~f~~i~~Ay~~L~d 53 (134)
|||+||||+++++.++||++|+++++++|||+. ..+.+.|..|++||++|+|
T Consensus 1 ~~y~vLgl~~~~~~~~ik~~y~~l~~~~HPD~~~~~~~~~~~~~~l~~Ay~~L~d 55 (55)
T cd06257 1 DYYDILGVPPDASDEEIKKAYRKLALKYHPDKNPDDPEAEEKFKEINEAYEVLSD 55 (55)
T ss_pred ChHHHcCCCCCCCHHHHHHHHHHHHHHHCcCCCCCcHHHHHHHHHHHHHHHHhcC
Confidence 799999999999999999999999999999993 5689999999999999986
No 45
>PHA03102 Small T antigen; Reviewed
Probab=99.69 E-value=1e-17 Score=118.16 Aligned_cols=68 Identities=15% Similarity=0.273 Sum_probs=61.2
Q ss_pred CccccccCCCCC--CHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHHHHHhccChHHHHHHHHHhhhhhhc
Q psy7114 2 NHFEILNCNVNS--TLEEIKENYKSLILKYHPDKNSEHQEMFLKINEAWNILKDEKERKLYESQLLSQQQT 70 (134)
Q Consensus 2 ~~Y~vLgv~~~a--s~~~Ik~ayr~l~~~~HPDk~~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~~~~~ 70 (134)
..|+||||+++| |.++||+|||++++++||||.+ +.++|++|++||++|+|+.+|..||.++......
T Consensus 6 ~l~~vLGl~~~A~~s~~eIKkAYr~la~~~HPDkgg-~~e~~k~in~Ay~~L~d~~~r~~yd~~g~~~~~~ 75 (153)
T PHA03102 6 ELMDLLGLPRSAWGNLPLMRKAYLRKCLEFHPDKGG-DEEKMKELNTLYKKFRESVKSLRDLDGEEDSSSE 75 (153)
T ss_pred HHHHHcCCCCCCCCCHHHHHHHHHHHHHHHCcCCCc-hhHHHHHHHHHHHHHhhHHHhccccccCCccccc
Confidence 368999999999 9999999999999999999953 4679999999999999999999999998765543
No 46
>TIGR03835 termin_org_DnaJ terminal organelle assembly protein TopJ. This model describes TopJ (MG_200, CbpA), a DnaJ homolog and probable assembly protein of the Mycoplasma terminal organelle. The terminal organelle is involved in both cytadherence and gliding motility.
Probab=99.69 E-value=2.8e-17 Score=138.22 Aligned_cols=68 Identities=29% Similarity=0.528 Sum_probs=63.0
Q ss_pred CCccccccCCCCCCHHHHHHHHHHHHHHhCCCC--CHHHHHHHHHHHHHHHhccChHHHHHHHHHhhhhh
Q psy7114 1 MNHFEILNCNVNSTLEEIKENYKSLILKYHPDK--NSEHQEMFLKINEAWNILKDEKERKLYESQLLSQQ 68 (134)
Q Consensus 1 ~~~Y~vLgv~~~as~~~Ik~ayr~l~~~~HPDk--~~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~~~ 68 (134)
+|||+||||+++|+.++||++||+|++++|||+ .+.+.++|+.|++||++|+||.+|..||.++....
T Consensus 2 ~DYYeVLGVs~dAS~eEIKKAYRKLAKKyHPDKn~~~eAeekFqeINEAYEVLSDP~KRa~YD~fG~aG~ 71 (871)
T TIGR03835 2 RDYYEVLGIDRDADEQEIKKAFRKLAKKYHPDRNKAPDAASIFAEINEANDVLSNPKKRANYDKYGHDGV 71 (871)
T ss_pred CChhHhcCCCCCCCHHHHHHHHHHHHHHHCcCCCCChhHHHHHHHHHHHHHHhCCHHHHHHHhhhccccc
Confidence 589999999999999999999999999999999 45677899999999999999999999999887543
No 47
>KOG2923|consensus
Probab=99.68 E-value=2.6e-17 Score=98.17 Aligned_cols=56 Identities=34% Similarity=0.764 Sum_probs=52.9
Q ss_pred cccccCcccccccCCceeEEeeCCCCCeEEEecccccCCCCCCceeEeCCCCcceEEEec
Q psy7114 74 IYKSVSLSDMEHKGESQMFTYPCRCGAEFCIEEQDTEGDGSDDNILIACDTCSLLLEITA 133 (134)
Q Consensus 74 ~~~~~~~~~~~~~~~~~~~~~~crcg~~~~v~~~~l~~g~~~~~~~v~C~~Csl~i~v~~ 133 (134)
.++.+.+++|.|+.+.+.|.+||+||+.|.|+.++|..| ..++.|++|||.|.|++
T Consensus 3 ~ydeVeiedfe~~~e~~~y~yPCpCGDrf~It~edL~~g----e~Va~CpsCSL~I~ViY 58 (67)
T KOG2923|consen 3 FYDEVEIEDFEFDEENQTYYYPCPCGDRFQITLEDLENG----EDVARCPSCSLIIRVIY 58 (67)
T ss_pred ccceEEeecceeccCCCeEEcCCCCCCeeeecHHHHhCC----CeeecCCCceEEEEEEe
Confidence 467889999999999999999999999999999999999 99999999999999975
No 48
>KOG0720|consensus
Probab=99.68 E-value=8.1e-18 Score=134.34 Aligned_cols=69 Identities=25% Similarity=0.488 Sum_probs=65.4
Q ss_pred CCccccccCCCCCCHHHHHHHHHHHHHHhCCCC--CHHHHHHHHHHHHHHHhccChHHHHHHHHHhhhhhh
Q psy7114 1 MNHFEILNCNVNSTLEEIKENYKSLILKYHPDK--NSEHQEMFLKINEAWNILKDEKERKLYESQLLSQQQ 69 (134)
Q Consensus 1 ~~~Y~vLgv~~~as~~~Ik~ayr~l~~~~HPDk--~~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~~~~ 69 (134)
+|+|.+|||+++++.++||+.||++|...|||| .+.|.+.|+.|+.||++|+|+.+|+.||..+..+..
T Consensus 235 ~daYsvlGl~~d~sd~~lKk~Yrk~A~LVhPDKn~~~~A~Eafk~Lq~Afevig~~~kR~eYd~e~~kene 305 (490)
T KOG0720|consen 235 LDAYSALGLPSDCSDADLKKNYRKKAMLVHPDKNMIPRAEEAFKKLQVAFEVIGDSVKRKEYDLELKKENE 305 (490)
T ss_pred CCchhhcCCCCCCCHHHHHHHHHhhceEeCCCccCChhHHHHHHHHHHHHHHhcchhhhhHHHHHHHHHHH
Confidence 589999999999999999999999999999999 889999999999999999999999999988886654
No 49
>PF05207 zf-CSL: CSL zinc finger; InterPro: IPR007872 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents a probable zinc binding motif that contains four cysteines and may chelate zinc, known as the DPH-type after the diphthamide (DPH) biosynthesis protein in which it was first characterised, including the proteins DPH3 and DPH4. This domain is also found associated with N-terminal domain of heat shock protein DnaJ IPR001623 from INTERPRO domain. Diphthamide is a unique post-translationally modified histidine residue found only in translation elongation factor 2 (eEF-2). It is conserved from archaea to humans and serves as the target for diphteria toxin and Pseudomonas exotoxin A. These two toxins catalyse the transfer of ADP-ribose to diphtamide on eEF-2, thus inactivating eEF-2, halting cellular protein synthesis, and causing cell death []. The biosynthesis of diphtamide is dependent on at least five proteins, DPH1 to -5, and a still unidentified amidating enzyme. DPH3 and DPH4 share a conserved region, which encode a putative zinc finger, the DPH-type or CSL-type (after the conserved motif of the final cysteine) zinc finger [, ]. The function of this motif is unknown. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 2L6L_A 1WGE_A 2JR7_A 1YOP_A 1YWS_A.
Probab=99.67 E-value=3.6e-17 Score=96.97 Aligned_cols=53 Identities=45% Similarity=0.981 Sum_probs=47.0
Q ss_pred ccCcccccccCCceeEEeeCCCCCeEEEecccccCCCCCCceeEeCCCCcceEEEec
Q psy7114 77 SVSLSDMEHKGESQMFTYPCRCGAEFCIEEQDTEGDGSDDNILIACDTCSLLLEITA 133 (134)
Q Consensus 77 ~~~~~~~~~~~~~~~~~~~crcg~~~~v~~~~l~~g~~~~~~~v~C~~Csl~i~v~~ 133 (134)
.+++++|.+++....|.++||||+.|.|+.+++..+ ..+|+|++|||||+|..
T Consensus 2 ev~l~d~~~~~~~~~~~y~CRCG~~f~i~e~~l~~~----~~iv~C~sCSL~I~V~~ 54 (55)
T PF05207_consen 2 EVSLDDMEFDEEEGVYSYPCRCGGEFEISEEDLEEG----EVIVQCDSCSLWIRVNY 54 (55)
T ss_dssp EEETTTSEEETTTTEEEEEETTSSEEEEEHHHHHCT------EEEETTTTEEEEEE-
T ss_pred eEEhhhceecCCCCEEEEcCCCCCEEEEcchhccCc----CEEEECCCCccEEEEEe
Confidence 467899999988889999999999999999999998 89999999999999975
No 50
>PRK05014 hscB co-chaperone HscB; Provisional
Probab=99.67 E-value=5.5e-17 Score=116.82 Aligned_cols=67 Identities=24% Similarity=0.358 Sum_probs=58.8
Q ss_pred CCccccccCCCC--CCHHHHHHHHHHHHHHhCCCC----CH----HHHHHHHHHHHHHHhccChHHHHHHHHHhhhh
Q psy7114 1 MNHFEILNCNVN--STLEEIKENYKSLILKYHPDK----NS----EHQEMFLKINEAWNILKDEKERKLYESQLLSQ 67 (134)
Q Consensus 1 ~~~Y~vLgv~~~--as~~~Ik~ayr~l~~~~HPDk----~~----~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~~ 67 (134)
+|||+||||++. ++..+|+++|+++++++|||+ .+ .+.+.|..|++||++|+||.+|..|+..+.+.
T Consensus 1 ~~yf~llgl~~~~~~d~~~L~~~yr~l~~~~HPDk~~~~~~~~~~~a~~~s~~iN~AY~~L~dp~~Ra~Yll~l~g~ 77 (171)
T PRK05014 1 MDYFTLFGLPARYDIDTQLLASRYQELQRQFHPDKFANASERERLLAVQQAATINDAYQTLKHPLKRAEYLLSLHGF 77 (171)
T ss_pred CCHHHHCCCCCCCCCCHHHHHHHHHHHHHHhCcCCCCCCcHHHHHHHHHHHHHHHHHHHHHCChhHHHHHHHHhcCC
Confidence 699999999996 678999999999999999999 11 24668999999999999999999999776643
No 51
>COG2214 CbpA DnaJ-class molecular chaperone [Posttranslational modification, protein turnover, chaperones]
Probab=99.67 E-value=1.4e-16 Score=115.97 Aligned_cols=64 Identities=44% Similarity=0.735 Sum_probs=59.6
Q ss_pred CCccccccCCCCCCHHHHHHHHHHHHHHhCCCC---CH-HHHHHHHHHHHHHHhccChHHHHHHHHHh
Q psy7114 1 MNHFEILNCNVNSTLEEIKENYKSLILKYHPDK---NS-EHQEMFLKINEAWNILKDEKERKLYESQL 64 (134)
Q Consensus 1 ~~~Y~vLgv~~~as~~~Ik~ayr~l~~~~HPDk---~~-~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~ 64 (134)
++||+||||+++|+..+||++||++++++|||+ .+ .+.+.|+.|++||++|+|+..|..||..+
T Consensus 6 ~~~y~iLgv~~~as~~eik~ayrkla~~~HPD~~~~~~~~a~~~f~~i~~Ay~vLsd~~~r~~yd~~~ 73 (237)
T COG2214 6 LDYYEILGVPPNASLEEIKKAYRKLALKYHPDRNPGDPKVAEEKFKEINEAYEILSDPERRAEYDKIG 73 (237)
T ss_pred hhHHHHhCCCCCCCHHHHHHHHHHHHHHhCCCCCCCchhHHHHHHHHHHHHHHHhhCHHHHHHhhhhc
Confidence 589999999999999999999999999999999 33 27899999999999999999999999874
No 52
>PRK01356 hscB co-chaperone HscB; Provisional
Probab=99.63 E-value=7.1e-16 Score=110.56 Aligned_cols=67 Identities=28% Similarity=0.443 Sum_probs=57.8
Q ss_pred CCccccccCCCC--CCHHHHHHHHHHHHHHhCCCCCH------HHHHHHHHHHHHHHhccChHHHHHHHHHhhhh
Q psy7114 1 MNHFEILNCNVN--STLEEIKENYKSLILKYHPDKNS------EHQEMFLKINEAWNILKDEKERKLYESQLLSQ 67 (134)
Q Consensus 1 ~~~Y~vLgv~~~--as~~~Ik~ayr~l~~~~HPDk~~------~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~~ 67 (134)
+|||++|||++. ++..+|+++|+++++++|||+.. .+...+..|++||++|+||.+|..|+..+...
T Consensus 2 ~~yf~llgl~~~f~id~~~L~~aYr~lq~~~HPDk~~~~~~k~~~~~~s~~in~AY~~L~dp~~Ra~YlL~l~g~ 76 (166)
T PRK01356 2 QNYFQLLGLPQEYNIDLKILEKQYFAMQVKYHPDKAKTLQEKEQNLIIASELNNAYSTLKDALKRAEYMLLLQNI 76 (166)
T ss_pred CCHHHHcCCCCCCCCCHHHHHHHHHHHHHHHCcCCCCCHHHHHHHHHHHHHHHHHHHHhCCHHHHHHHHHHccCC
Confidence 489999999996 78999999999999999999921 22345789999999999999999998887643
No 53
>PRK00294 hscB co-chaperone HscB; Provisional
Probab=99.63 E-value=4.3e-16 Score=112.28 Aligned_cols=68 Identities=25% Similarity=0.403 Sum_probs=59.5
Q ss_pred CCccccccCCCC--CCHHHHHHHHHHHHHHhCCCC---C-----HHHHHHHHHHHHHHHhccChHHHHHHHHHhhhhh
Q psy7114 1 MNHFEILNCNVN--STLEEIKENYKSLILKYHPDK---N-----SEHQEMFLKINEAWNILKDEKERKLYESQLLSQQ 68 (134)
Q Consensus 1 ~~~Y~vLgv~~~--as~~~Ik~ayr~l~~~~HPDk---~-----~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~~~ 68 (134)
+|||++|||++. .+..+|+++|+++++++|||+ . ..+.+.+..||+||++|+||.+|..|+..+.+..
T Consensus 4 ~~~F~l~~l~~~f~id~~~L~~~Yr~Lq~~~HPDk~~~~~~~e~~~a~~~s~~IN~AY~~L~~p~~Ra~YlL~l~g~~ 81 (173)
T PRK00294 4 PCHFALFDLQPSFRLDLDQLATRYRELAREVHPDRFADAPEREQRLALERSASLNEAYQTLKSPPRRARYLLALSGHE 81 (173)
T ss_pred CChhhhcCcCCCCCCCHHHHHHHHHHHHHHHCcCCCCCCcHHHHHHHHHHHHHHHHHHHHhCChhhhHHHHHHhcCCC
Confidence 589999999997 567999999999999999999 1 1256789999999999999999999998877543
No 54
>PRK03578 hscB co-chaperone HscB; Provisional
Probab=99.61 E-value=5.7e-16 Score=111.96 Aligned_cols=68 Identities=21% Similarity=0.382 Sum_probs=57.9
Q ss_pred CCccccccCCCC--CCHHHHHHHHHHHHHHhCCCC---CHH-----HHHHHHHHHHHHHhccChHHHHHHHHHhhhhh
Q psy7114 1 MNHFEILNCNVN--STLEEIKENYKSLILKYHPDK---NSE-----HQEMFLKINEAWNILKDEKERKLYESQLLSQQ 68 (134)
Q Consensus 1 ~~~Y~vLgv~~~--as~~~Ik~ayr~l~~~~HPDk---~~~-----~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~~~ 68 (134)
+|||+||||++. ++..+|+++|+++++++|||+ .+. +.+.+..||+||++|+||.+|..|...+.+..
T Consensus 6 ~dyf~llglp~~f~~d~~~L~~~yr~lq~~~HPD~~~~~~~~e~~~a~~~s~~iN~AY~tL~~p~~Ra~Yll~l~G~~ 83 (176)
T PRK03578 6 DDHFSLFGLPARFALDEAALDAAYRTVQAQVHPDRFAAAGDAEKRVAMQWATRANEAYQTLRDPLKRARYLLHLRGVD 83 (176)
T ss_pred CCHHHHcCCCCCCCCCHHHHHHHHHHHHHHHCcCCCCCCCHHHHHHHHHHHHHHHHHHHHhCChhhHHHHHHHhcCCC
Confidence 589999999995 678999999999999999999 222 34446899999999999999999998766443
No 55
>KOG0624|consensus
Probab=99.55 E-value=4.8e-15 Score=116.08 Aligned_cols=66 Identities=26% Similarity=0.519 Sum_probs=61.3
Q ss_pred CCccccccCCCCCCHHHHHHHHHHHHHHhCCCC------CHHHHHHHHHHHHHHHhccChHHHHHHHHHhhh
Q psy7114 1 MNHFEILNCNVNSTLEEIKENYKSLILKYHPDK------NSEHQEMFLKINEAWNILKDEKERKLYESQLLS 66 (134)
Q Consensus 1 ~~~Y~vLgv~~~as~~~Ik~ayr~l~~~~HPDk------~~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~ 66 (134)
||||.||||.++|+..||.+|||+++.++|||- ...++++|.-|..|-+||+||++|+.+|..-+.
T Consensus 394 RDYYKILGVkRnAsKqEI~KAYRKlAqkWHPDNFqdEeEKKkAEKKFIDIAAAKEVLsd~EkRrqFDnGeDP 465 (504)
T KOG0624|consen 394 RDYYKILGVKRNASKQEITKAYRKLAQKWHPDNFQDEEEKKKAEKKFIDIAAAKEVLSDPEKRRQFDNGEDP 465 (504)
T ss_pred chHHHHhhhcccccHHHHHHHHHHHHHhcCCccccCHHHHHHHHHhhhhHHHHHHhhcCHHHHhhccCCCCC
Confidence 699999999999999999999999999999998 456889999999999999999999999976553
No 56
>COG5216 Uncharacterized conserved protein [Function unknown]
Probab=99.53 E-value=1.6e-14 Score=85.10 Aligned_cols=56 Identities=30% Similarity=0.760 Sum_probs=52.3
Q ss_pred cccccCcccccccCCceeEEeeCCCCCeEEEecccccCCCCCCceeEeCCCCcceEEEec
Q psy7114 74 IYKSVSLSDMEHKGESQMFTYPCRCGAEFCIEEQDTEGDGSDDNILIACDTCSLLLEITA 133 (134)
Q Consensus 74 ~~~~~~~~~~~~~~~~~~~~~~crcg~~~~v~~~~l~~g~~~~~~~v~C~~Csl~i~v~~ 133 (134)
.++.+.+.+|.|..+...|.+||.||+.|.|+++|+..| ..+..|++|||.|.|..
T Consensus 3 fYDEieiedftf~~e~~~ftyPCPCGDRFeIsLeDl~~G----E~VArCPSCSLiv~vvy 58 (67)
T COG5216 3 FYDEIEIEDFTFSREEKTFTYPCPCGDRFEISLEDLRNG----EVVARCPSCSLIVCVVY 58 (67)
T ss_pred ccceeEeeeeEEcCCCceEEecCCCCCEeEEEHHHhhCC----ceEEEcCCceEEEEEEe
Confidence 467788999999999999999999999999999999999 99999999999999864
No 57
>KOG0550|consensus
Probab=99.52 E-value=1.3e-14 Score=115.34 Aligned_cols=71 Identities=31% Similarity=0.487 Sum_probs=64.6
Q ss_pred CCccccccCCCCCCHHHHHHHHHHHHHHhCCCC----CHHHHHHHHHHHHHHHhccChHHHHHHHHHhhhhhhcc
Q psy7114 1 MNHFEILNCNVNSTLEEIKENYKSLILKYHPDK----NSEHQEMFLKINEAWNILKDEKERKLYESQLLSQQQTH 71 (134)
Q Consensus 1 ~~~Y~vLgv~~~as~~~Ik~ayr~l~~~~HPDk----~~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~~~~~~ 71 (134)
++||.|||+..+|+..+||++||++++.+|||+ ..+++.+|++|.+||.+|+||.+|..||....-....+
T Consensus 373 kd~ykilGi~~~as~~eikkayrk~AL~~Hpd~~agsq~eaE~kFkevgeAy~il~d~~kr~r~dsg~dle~~~~ 447 (486)
T KOG0550|consen 373 KDWYKILGISRNASDDEIKKAYRKLALVHHPDKNAGSQKEAEAKFKEVGEAYTILSDPMKRVRFDSGQDLEEVGS 447 (486)
T ss_pred hhHHHHhhhhhhcccchhhhHHHHHHHHhCCCcCcchhHHHHHHHHHHHHHHHHhcCHHHHhhcccccchhhhcC
Confidence 589999999999999999999999999999999 36789999999999999999999999998777655444
No 58
>KOG0722|consensus
Probab=99.49 E-value=1.6e-14 Score=108.45 Aligned_cols=68 Identities=31% Similarity=0.588 Sum_probs=62.3
Q ss_pred CCccccccCCCCCCHHHHHHHHHHHHHHhCCCC--CHHHHHHHHHHHHHHHhccChHHHHHHHHHhhhhh
Q psy7114 1 MNHFEILNCNVNSTLEEIKENYKSLILKYHPDK--NSEHQEMFLKINEAWNILKDEKERKLYESQLLSQQ 68 (134)
Q Consensus 1 ~~~Y~vLgv~~~as~~~Ik~ayr~l~~~~HPDk--~~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~~~ 68 (134)
.|.|+||||+++|+..+|.+|||+|++++|||+ ++.+.+.|+.|..||++|.|...|..||-.++...
T Consensus 33 enCYdVLgV~Rea~KseIakAYRqLARrhHPDr~r~~e~k~~F~~iAtayeilkd~e~rt~ydyaldhpd 102 (329)
T KOG0722|consen 33 ENCYDVLGVAREANKSEIAKAYRQLARRHHPDRNRDPESKKLFVKIATAYEILKDNETRTQYDYALDHPD 102 (329)
T ss_pred hhHHHHhhhhhhccHHHHHHHHHHHHHHhCCcccCCchhhhhhhhhhcccccccchhhHHhHHHHhcCch
Confidence 378999999999999999999999999999999 66667899999999999999999999998877543
No 59
>COG5407 SEC63 Preprotein translocase subunit Sec63 [Intracellular trafficking and secretion]
Probab=99.46 E-value=5.2e-14 Score=112.77 Aligned_cols=70 Identities=33% Similarity=0.593 Sum_probs=64.8
Q ss_pred CccccccCCCCCCHHHHHHHHHHHHHHhCCCC--------CHHHHHHHHHHHHHHHhccChHHHHHHHHHhhhhhhcc
Q psy7114 2 NHFEILNCNVNSTLEEIKENYKSLILKYHPDK--------NSEHQEMFLKINEAWNILKDEKERKLYESQLLSQQQTH 71 (134)
Q Consensus 2 ~~Y~vLgv~~~as~~~Ik~ayr~l~~~~HPDk--------~~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~~~~~~ 71 (134)
|+|||||+..+++..+||++||+|+.++|||| ...-++.+++|++||..|+|...|+.|-.+|....++.
T Consensus 99 DPyEILGI~~~ts~rdik~~yr~Ls~KfhpdK~~~mvn~~rse~Ee~y~~ItkAY~~lTd~k~renyl~yGtPd~pQh 176 (610)
T COG5407 99 DPYEILGIDQDTSERDIKKRYRMLSMKFHPDKAPPMVNELRSEYEEKYKTITKAYGLLTDKKRRENYLNYGTPDSPQH 176 (610)
T ss_pred ChHHhhcccCCCcHHHHHHHHHhheeecChhhcCCCChhHHHHHHHHHHHHHHHHHhhhhHHHHHHHHhcCCCCCCcc
Confidence 79999999999999999999999999999999 34668899999999999999999999999998877654
No 60
>PTZ00100 DnaJ chaperone protein; Provisional
Probab=99.46 E-value=4.7e-14 Score=95.13 Aligned_cols=50 Identities=34% Similarity=0.518 Sum_probs=45.9
Q ss_pred CccccccCCCCCCHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHHHHHhcc
Q psy7114 2 NHFEILNCNVNSTLEEIKENYKSLILKYHPDKNSEHQEMFLKINEAWNILK 52 (134)
Q Consensus 2 ~~Y~vLgv~~~as~~~Ik~ayr~l~~~~HPDk~~~~~~~f~~i~~Ay~~L~ 52 (134)
++|+||||+++||.++||++||++++++|||+. ++...|.+|++||++|.
T Consensus 66 eAy~ILGv~~~As~~eIkkaYRrLa~~~HPDkg-Gs~~~~~kIneAyevL~ 115 (116)
T PTZ00100 66 EAYKILNISPTASKERIREAHKQLMLRNHPDNG-GSTYIASKVNEAKDLLL 115 (116)
T ss_pred HHHHHcCCCCCCCHHHHHHHHHHHHHHhCCCCC-CCHHHHHHHHHHHHHHh
Confidence 579999999999999999999999999999984 45678999999999985
No 61
>PRK09430 djlA Dna-J like membrane chaperone protein; Provisional
Probab=99.41 E-value=2.2e-13 Score=104.40 Aligned_cols=52 Identities=27% Similarity=0.491 Sum_probs=48.1
Q ss_pred CccccccCCCCCCHHHHHHHHHHHHHHhCCCCC----------HHHHHHHHHHHHHHHhccC
Q psy7114 2 NHFEILNCNVNSTLEEIKENYKSLILKYHPDKN----------SEHQEMFLKINEAWNILKD 53 (134)
Q Consensus 2 ~~Y~vLgv~~~as~~~Ik~ayr~l~~~~HPDk~----------~~~~~~f~~i~~Ay~~L~d 53 (134)
++|+||||++++|.++||++||+|++++|||+. +.+.++|++|++||++|+.
T Consensus 201 ~ay~vLgv~~~as~~eIk~aYr~L~~~~HPDk~~~~g~~~~~~~~a~ek~~~I~~AYe~L~~ 262 (267)
T PRK09430 201 DAYKVLGVSESDDDQEIKRAYRKLMSEHHPDKLVAKGLPPEMMEMAKEKAQEIQAAYELIKK 262 (267)
T ss_pred hHHHHcCCCCCCCHHHHHHHHHHHHHHhCcCCCCCCCCChhhHHHHHHHHHHHHHHHHHHHH
Confidence 689999999999999999999999999999991 3578999999999999985
No 62
>KOG0714|consensus
Probab=99.39 E-value=2e-13 Score=103.60 Aligned_cols=65 Identities=37% Similarity=0.633 Sum_probs=59.8
Q ss_pred CCccccccCCCCCCHHHHHHHHHHHHHHhCCCC--CH--HHHHHHHHHHHHHHhccChHHHHHHHHHhh
Q psy7114 1 MNHFEILNCNVNSTLEEIKENYKSLILKYHPDK--NS--EHQEMFLKINEAWNILKDEKERKLYESQLL 65 (134)
Q Consensus 1 ~~~Y~vLgv~~~as~~~Ik~ayr~l~~~~HPDk--~~--~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~ 65 (134)
.++|.||||.+.|+.++|++||+++++++|||+ .+ .+..+|++|.+||++|+|+.+|..||.++.
T Consensus 3 ~d~~~~l~i~~~as~~~i~ka~~~~a~~~hpdk~~~~~~~~~~~~~~~~ea~~~ls~~~kr~~~d~~~~ 71 (306)
T KOG0714|consen 3 KDYYKILGIARSASEEDIKKAYRKLALKYHPDKNPSPKEVAEAKFKEIAEAYEVLSDPKKRKIYDQYGE 71 (306)
T ss_pred ccHHHHhCccccccHHHHHHHHHHHHHhhCCCCCCCchhhHHHHHhhhhccccccCCHHHhhhccccCc
Confidence 489999999999999999999999999999999 22 455689999999999999999999999998
No 63
>PHA02624 large T antigen; Provisional
Probab=99.39 E-value=2.5e-13 Score=113.11 Aligned_cols=58 Identities=19% Similarity=0.377 Sum_probs=54.5
Q ss_pred CccccccCCCCC--CHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHHHHHhccChHHHHHH
Q psy7114 2 NHFEILNCNVNS--TLEEIKENYKSLILKYHPDKNSEHQEMFLKINEAWNILKDEKERKLY 60 (134)
Q Consensus 2 ~~Y~vLgv~~~a--s~~~Ik~ayr~l~~~~HPDk~~~~~~~f~~i~~Ay~~L~d~~~R~~Y 60 (134)
++|++|||+++| +.++||+|||++++++|||+. ++.++|++|++||++|+|+.+|..|
T Consensus 12 elyelLGL~~~A~gs~~eIKkAYRkLAkkyHPDKg-Gdeekfk~Ln~AYevL~d~~k~~r~ 71 (647)
T PHA02624 12 ELMDLLGLPMAAWGNLPLMRKAYLRKCKEYHPDKG-GDEEKMKRLNSLYKKLQEGVKSARQ 71 (647)
T ss_pred HHHHHcCCCCCCCCCHHHHHHHHHHHHHHHCcCCC-CcHHHHHHHHHHHHHHhcHHHhhhc
Confidence 589999999999 999999999999999999984 4568999999999999999999998
No 64
>PRK01773 hscB co-chaperone HscB; Provisional
Probab=99.36 E-value=1.8e-12 Score=93.51 Aligned_cols=65 Identities=26% Similarity=0.280 Sum_probs=57.8
Q ss_pred CCccccccCCCC--CCHHHHHHHHHHHHHHhCCCC----C----HHHHHHHHHHHHHHHhccChHHHHHHHHHhh
Q psy7114 1 MNHFEILNCNVN--STLEEIKENYKSLILKYHPDK----N----SEHQEMFLKINEAWNILKDEKERKLYESQLL 65 (134)
Q Consensus 1 ~~~Y~vLgv~~~--as~~~Ik~ayr~l~~~~HPDk----~----~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~ 65 (134)
+|||++|||++. .+...++++|+++.+++|||+ . ..+.+....||+||.+|+||.+|..|-..+.
T Consensus 2 ~nyF~lf~lp~~F~iD~~~L~~~y~~Lq~~~HPD~f~~~~~~eq~~a~~~ss~iN~AY~tLkdPl~RA~YLL~L~ 76 (173)
T PRK01773 2 NNPFALFDLPVDFQLDNALLSERYLALQKSLHPDNFANSSAQEQRLAMQKSAEVNDALQILKDPILRAEAIIALN 76 (173)
T ss_pred CChHHhcCCCCCCCCCHHHHHHHHHHHHHHhCcCcccCCCHHHHHHHHHHHHHHHHHHHHHCChHHHHHHHHHhc
Confidence 589999999996 789999999999999999999 1 2345567899999999999999999988776
No 65
>TIGR00714 hscB Fe-S protein assembly co-chaperone HscB. This model describes the small subunit, Hsc20 (20K heat shock cognate protein) of a pair of proteins Hsc66-Hsc20, related to the DnaK-DnaJ heat shock proteins, which also serve as molecular chaperones. Hsc20, unlike DnaJ, appears not to have chaperone activity on its own, but to act solely as a regulatory subunit for Hsc66 (i.e., to be a co-chaperone). The gene for Hsc20 in E. coli, hscB, is not induced by heat shock.
Probab=99.22 E-value=1.5e-11 Score=87.51 Aligned_cols=55 Identities=27% Similarity=0.367 Sum_probs=48.1
Q ss_pred CCHHHHHHHHHHHHHHhCCCC--------CHHHHHHHHHHHHHHHhccChHHHHHHHHHhhhh
Q psy7114 13 STLEEIKENYKSLILKYHPDK--------NSEHQEMFLKINEAWNILKDEKERKLYESQLLSQ 67 (134)
Q Consensus 13 as~~~Ik~ayr~l~~~~HPDk--------~~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~~ 67 (134)
.+..+|+++|+++++++|||+ ...+...+..||+||++|+||.+|..|+..+.+.
T Consensus 3 iD~~~L~~~yr~lq~~~HPD~~~~~~~~~~~~a~~~s~~iN~AY~~L~~p~~Ra~ylL~l~g~ 65 (157)
T TIGR00714 3 LDTQALSLRYQDLQRQYHPDKFASGSAQEQLAAVQQSTTLNQAYQTLKDPLMRAEYMLSLHGI 65 (157)
T ss_pred CCHHHHHHHHHHHHHHHCcCCCCCCChhhhHHHHHHHHHHHHHHHHhCChhhhHHHHHHhcCC
Confidence 467899999999999999998 1225678999999999999999999999888743
No 66
>KOG1150|consensus
Probab=99.12 E-value=1.4e-10 Score=84.63 Aligned_cols=60 Identities=35% Similarity=0.553 Sum_probs=55.1
Q ss_pred CccccccCCCCCCHHHHHHHHHHHHHHhCCCC----CHHHHHHHHHHHHHHHhccChHHHHHHH
Q psy7114 2 NHFEILNCNVNSTLEEIKENYKSLILKYHPDK----NSEHQEMFLKINEAWNILKDEKERKLYE 61 (134)
Q Consensus 2 ~~Y~vLgv~~~as~~~Ik~ayr~l~~~~HPDk----~~~~~~~f~~i~~Ay~~L~d~~~R~~Yd 61 (134)
|+|+||.|.|..+.++||+.||+|++..|||| .+.+...|-.|.+||..|-|+..|..-+
T Consensus 54 NpfeVLqIdpev~~edikkryRklSilVHPDKN~Dd~~rAqkAFdivkKA~k~l~n~~~rkr~~ 117 (250)
T KOG1150|consen 54 NPFEVLQIDPEVTDEDIKKRYRKLSILVHPDKNPDDAERAQKAFDIVKKAYKLLENDKIRKRCL 117 (250)
T ss_pred ChHHHHhcCCCCCHHHHHHHHHhhheeecCCCCcccHHHHHHHHHHHHHHHHHHhCHHHHHHHH
Confidence 79999999999999999999999999999999 4678899999999999999998766544
No 67
>COG5269 ZUO1 Ribosome-associated chaperone zuotin [Translation, ribosomal structure and biogenesis / Posttranslational modification, protein turnover, chaperones]
Probab=99.03 E-value=1.5e-10 Score=87.99 Aligned_cols=69 Identities=29% Similarity=0.435 Sum_probs=60.3
Q ss_pred CccccccCCC---CCCHHHHHHHHHHHHHHhCCCC-----CHHHHHHHHHHHHHHHhccChHHHHHHHHHhhhhhhc
Q psy7114 2 NHFEILNCNV---NSTLEEIKENYKSLILKYHPDK-----NSEHQEMFLKINEAWNILKDEKERKLYESQLLSQQQT 70 (134)
Q Consensus 2 ~~Y~vLgv~~---~as~~~Ik~ayr~l~~~~HPDk-----~~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~~~~~ 70 (134)
|+|.+|||+. .|++.+|.++.++.+.+||||+ +.+....|+.|+.||++|+|+.+|..||.-......+
T Consensus 44 DlYa~lgLskyR~ka~~~qi~kah~kkv~kyHPDk~aa~g~~~~d~fFk~iqkA~evL~D~~~R~qyDS~df~advp 120 (379)
T COG5269 44 DLYALLGLSKYRTKAIPPQILKAHKKKVYKYHPDKTAAGGNKGCDEFFKLIQKAREVLGDRKLRLQYDSNDFDADVP 120 (379)
T ss_pred hHHHHhchHhhhcCCCcHHHHHHHHHHHHHhCccchhccCCCCcHHHHHHHHHHHHHhccHHHHhhccccccccCCC
Confidence 6899999988 5788999999999999999999 3566788999999999999999999999776554443
No 68
>KOG0568|consensus
Probab=98.63 E-value=2.4e-08 Score=74.70 Aligned_cols=53 Identities=28% Similarity=0.622 Sum_probs=47.8
Q ss_pred CCccccccCCCCCCHHHHHHHHHHHHHHhCCCC--CHHHHHHHHHHHHHHH-hccC
Q psy7114 1 MNHFEILNCNVNSTLEEIKENYKSLILKYHPDK--NSEHQEMFLKINEAWN-ILKD 53 (134)
Q Consensus 1 ~~~Y~vLgv~~~as~~~Ik~ayr~l~~~~HPDk--~~~~~~~f~~i~~Ay~-~L~d 53 (134)
|.+|.+|||..+|+.++++.+|..|++++|||. .+.+.++|..|.+||. +|+.
T Consensus 47 ~e~fril~v~e~~~adevr~af~~lakq~hpdsgs~~adaa~f~qideafrkvlq~ 102 (342)
T KOG0568|consen 47 MECFRILGVEEGADADEVREAFHDLAKQVHPDSGSEEADAARFIQIDEAFRKVLQE 102 (342)
T ss_pred HHHHHHhcccccCchhHHHHHHHHHHHHcCCCCCCccccHHHHHHHHHHHHHHHHH
Confidence 458999999999999999999999999999999 5567889999999998 6664
No 69
>KOG1789|consensus
Probab=98.53 E-value=1.1e-07 Score=83.33 Aligned_cols=51 Identities=39% Similarity=0.694 Sum_probs=44.7
Q ss_pred ccccccCCCC----CCHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHHHHHhccC
Q psy7114 3 HFEILNCNVN----STLEEIKENYKSLILKYHPDKNSEHQEMFLKINEAWNILKD 53 (134)
Q Consensus 3 ~Y~vLgv~~~----as~~~Ik~ayr~l~~~~HPDk~~~~~~~f~~i~~Ay~~L~d 53 (134)
-|+||.|+-+ ...+.||++|++|+.+|||||+++..++|..|++||+.|..
T Consensus 1283 A~eiL~i~l~n~~hD~~~KirrqY~kLA~kYHPDKNPEGRemFe~VnKAYE~L~~ 1337 (2235)
T KOG1789|consen 1283 AREILSVDLTNEEHDKPAKIRRQYYKLAAKYHPDKNPEGREMFERVNKAYELLSS 1337 (2235)
T ss_pred HHHHhccccCCCCcccHHHHHHHHHHHHHHhCCCCCchHHHHHHHHHHHHHHHHH
Confidence 3788887663 34588999999999999999999999999999999999983
No 70
>KOG0723|consensus
Probab=98.50 E-value=2.6e-07 Score=61.10 Aligned_cols=47 Identities=32% Similarity=0.443 Sum_probs=40.5
Q ss_pred ccccCCCCCCHHHHHHHHHHHHHHhCCCC--CHHHHHHHHHHHHHHHhccCh
Q psy7114 5 EILNCNVNSTLEEIKENYKSLILKYHPDK--NSEHQEMFLKINEAWNILKDE 54 (134)
Q Consensus 5 ~vLgv~~~as~~~Ik~ayr~l~~~~HPDk--~~~~~~~f~~i~~Ay~~L~d~ 54 (134)
.||||++.++.+.||+++|++.+.+|||+ .+... .+||||+++|...
T Consensus 60 lIL~v~~s~~k~KikeaHrriM~~NHPD~GGSPYlA---sKINEAKdlLe~~ 108 (112)
T KOG0723|consen 60 LILGVTPSLDKDKIKEAHRRIMLANHPDRGGSPYLA---SKINEAKDLLEGT 108 (112)
T ss_pred HHhCCCccccHHHHHHHHHHHHHcCCCcCCCCHHHH---HHHHHHHHHHhcc
Confidence 48999999999999999999999999998 44433 3899999998753
No 71
>KOG3192|consensus
Probab=97.98 E-value=1.3e-05 Score=56.62 Aligned_cols=70 Identities=30% Similarity=0.499 Sum_probs=56.3
Q ss_pred CCccccccCCC--CCCHHHHHHHHHHHHHHhCCCC--------CHHHHHHHHHHHHHHHhccChHHHHHHHHHhhhhhhc
Q psy7114 1 MNHFEILNCNV--NSTLEEIKENYKSLILKYHPDK--------NSEHQEMFLKINEAWNILKDEKERKLYESQLLSQQQT 70 (134)
Q Consensus 1 ~~~Y~vLgv~~--~as~~~Ik~ayr~l~~~~HPDk--------~~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~~~~~~ 70 (134)
++||.++|... -..++.++..|.-...++|||+ ...+.+...+|++||.+|.||-.|+.|-..+....+.
T Consensus 8 ~~ff~~Fg~e~~~~~~p~~l~~~~~~~skkL~~d~~~~~~~~~~d~a~eqSa~lnkAY~TLk~pL~RA~Yilkl~g~e~~ 87 (168)
T KOG3192|consen 8 SRFFDIFGMELSFKIDPDKLKEKYTDISKKLHPDRPGLSFAGDTDQASEQSAELNKAYDTLKDPLARARYLLKLKGQEQT 87 (168)
T ss_pred HHHHHHhccccCCCCCcchhhHHHHHHHHhhCcccccccccccchhHHHHHHHHHHHHHHHHhHHHHHHHHHHHhCCCCc
Confidence 46899997655 4466777778999999999999 3356677889999999999999999998877765443
No 72
>COG1076 DjlA DnaJ-domain-containing proteins 1 [Posttranslational modification, protein turnover, chaperones]
Probab=97.47 E-value=9.1e-05 Score=53.43 Aligned_cols=50 Identities=26% Similarity=0.535 Sum_probs=43.4
Q ss_pred CccccccCCCCCCHHHHHHHHHHHHHHhCCCC----------CHHHHHHHHHHHHHHHhc
Q psy7114 2 NHFEILNCNVNSTLEEIKENYKSLILKYHPDK----------NSEHQEMFLKINEAWNIL 51 (134)
Q Consensus 2 ~~Y~vLgv~~~as~~~Ik~ayr~l~~~~HPDk----------~~~~~~~f~~i~~Ay~~L 51 (134)
+.|.+||+...+...+|+++|+++....|||+ -....++++.|++||+.+
T Consensus 114 ~~l~~l~~~~~~~~~~i~~~~r~l~~e~~~d~a~~~~~~~e~~~~~~~~~~~i~~a~~~~ 173 (174)
T COG1076 114 DALKVLGVEIKADQDAIKKAYRKLLSEQHPDKAAAKGLKLEFIEKLKEKLQEIQEAYEDI 173 (174)
T ss_pred hHHHHhcCchhhhHHHHHHHHHHHHHhcCHHHHHHhcCCHHHHHHHHHHHHHHHHHHHhc
Confidence 56899999999999999999999999999999 345677788888888753
No 73
>KOG0431|consensus
Probab=97.30 E-value=0.00027 Score=58.15 Aligned_cols=44 Identities=30% Similarity=0.502 Sum_probs=32.6
Q ss_pred ccCCCCCCHHHHHHHHHHHHHHhCCCC----------CHHHHHHHHHHHHHHHh
Q psy7114 7 LNCNVNSTLEEIKENYKSLILKYHPDK----------NSEHQEMFLKINEAWNI 50 (134)
Q Consensus 7 Lgv~~~as~~~Ik~ayr~l~~~~HPDk----------~~~~~~~f~~i~~Ay~~ 50 (134)
+++..-.+.++||++|||..|..|||| .-.+++.|-.+++||..
T Consensus 394 VsltDLVtp~~VKKaYrKA~L~VHPDKlqq~gas~~qK~Iaekvfd~l~eawn~ 447 (453)
T KOG0431|consen 394 VSLTDLVTPAQVKKAYRKAVLCVHPDKLQQKGASLEQKYIAEKVFDALSEAWNK 447 (453)
T ss_pred CchhhccCHHHHHHHHHhhhheeCcccccCCcccHHHHHHHHHHHHHHHHHHHh
Confidence 345556789999999999999999999 12345556666666653
No 74
>COG1076 DjlA DnaJ-domain-containing proteins 1 [Posttranslational modification, protein turnover, chaperones]
Probab=96.97 E-value=0.00062 Score=49.09 Aligned_cols=64 Identities=27% Similarity=0.403 Sum_probs=51.3
Q ss_pred CccccccCCCCCC--HHHHHHHHHHHHHHhCCCC----C----HHHHHHHHHHHHHHHhccChHHHHHHHHHhh
Q psy7114 2 NHFEILNCNVNST--LEEIKENYKSLILKYHPDK----N----SEHQEMFLKINEAWNILKDEKERKLYESQLL 65 (134)
Q Consensus 2 ~~Y~vLgv~~~as--~~~Ik~ayr~l~~~~HPDk----~----~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~~ 65 (134)
+++..+|..+.+. .+.++..|+.+.+.+|||+ . ..+...+..++.||.+|.+|-+|..|-....
T Consensus 2 ~~~~~~~~~~~f~~~~~~l~~~~~~~~~~~~~dr~~~~~~~~~~~~l~~~~~~~~a~~tLk~~l~ra~~~lal~ 75 (174)
T COG1076 2 DGFVLFGLPRAFQIDLDALKLQYRELQRAYHPDRFGKASEAEQRKALQQSAEVNPAYQTLKDPLLRAEYLLALA 75 (174)
T ss_pred CcccccccHHHHHHHHhHhhhhHHHHHHhhCcccccccchHHHHHHHHHHHHhcchHHHHHHHHHHHHHHHHhc
Confidence 5666677766553 4568999999999999999 1 1245678999999999999999999977666
No 75
>PF03656 Pam16: Pam16; InterPro: IPR005341 The Pam16 protein is the fifth essential subunit of the pre-sequence translocase-associated protein import motor (PAM) []. In Saccharomyces cerevisiae (Baker's yeast), Pam16 is required for preprotein translocation into the matrix, but not for protein insertion into the inner membrane [].; PDB: 2GUZ_J.
Probab=96.24 E-value=0.0091 Score=41.07 Aligned_cols=50 Identities=30% Similarity=0.329 Sum_probs=36.8
Q ss_pred ccccCCCCCCHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHHHHHhccChH
Q psy7114 5 EILNCNVNSTLEEIKENYKSLILKYHPDKNSEHQEMFLKINEAWNILKDEK 55 (134)
Q Consensus 5 ~vLgv~~~as~~~Ik~ayr~l~~~~HPDk~~~~~~~f~~i~~Ay~~L~d~~ 55 (134)
.||||++..+.++|.+.|.+|...++|++ .++.-.-.+|..|.+.|..+.
T Consensus 62 ~ILnv~~~~~~eeI~k~y~~Lf~~Nd~~k-GGSfYLQSKV~rAKErl~~El 111 (127)
T PF03656_consen 62 QILNVKEELSREEIQKRYKHLFKANDPSK-GGSFYLQSKVFRAKERLEQEL 111 (127)
T ss_dssp HHHT--G--SHHHHHHHHHHHHHHT-CCC-TS-HHHHHHHHHHHHHHHHHH
T ss_pred HHcCCCCccCHHHHHHHHHHHHhccCCCc-CCCHHHHHHHHHHHHHHHHHH
Confidence 68999999999999999999999999996 344555668889998887554
No 76
>COG1996 RPC10 DNA-directed RNA polymerase, subunit RPC10 (contains C4-type Zn-finger) [Transcription]
Probab=96.05 E-value=0.013 Score=33.62 Aligned_cols=35 Identities=34% Similarity=0.759 Sum_probs=26.6
Q ss_pred eEEeeC-CCCCeEEEecccccCCCCCCceeEeCCCCcceEEEecC
Q psy7114 91 MFTYPC-RCGAEFCIEEQDTEGDGSDDNILIACDTCSLLLEITAR 134 (134)
Q Consensus 91 ~~~~~c-rcg~~~~v~~~~l~~g~~~~~~~v~C~~Csl~i~v~~~ 134 (134)
...|.| +||..+ ++.....+ +.|+.|+..|.+..+
T Consensus 4 ~~~Y~C~~Cg~~~--~~~~~~~~-------irCp~Cg~rIl~K~R 39 (49)
T COG1996 4 MMEYKCARCGREV--ELDQETRG-------IRCPYCGSRILVKER 39 (49)
T ss_pred eEEEEhhhcCCee--ehhhccCc-------eeCCCCCcEEEEecc
Confidence 356899 999988 33344555 899999999998653
No 77
>TIGR02098 MJ0042_CXXC MJ0042 family finger-like domain. This domain contains a CXXCX(19)CXXC motif suggestive of both zinc fingers and thioredoxin, usually found at the N-terminus of prokaryotic proteins. One partially characterized gene, agmX, is among a large set in Myxococcus whose interruption affects adventurous gliding motility.
Probab=95.43 E-value=0.029 Score=30.05 Aligned_cols=34 Identities=15% Similarity=0.473 Sum_probs=28.0
Q ss_pred eeC-CCCCeEEEecccccCCCCCCceeEeCCCCcceEEE
Q psy7114 94 YPC-RCGAEFCIEEQDTEGDGSDDNILIACDTCSLLLEI 131 (134)
Q Consensus 94 ~~c-rcg~~~~v~~~~l~~g~~~~~~~v~C~~Csl~i~v 131 (134)
..| +||..|.|..+.+... ...+.|+.|...+.+
T Consensus 3 ~~CP~C~~~~~v~~~~~~~~----~~~v~C~~C~~~~~~ 37 (38)
T TIGR02098 3 IQCPNCKTSFRVVDSQLGAN----GGKVRCGKCGHVWYA 37 (38)
T ss_pred EECCCCCCEEEeCHHHcCCC----CCEEECCCCCCEEEe
Confidence 457 9999999999988754 346899999998876
No 78
>PF13719 zinc_ribbon_5: zinc-ribbon domain
Probab=93.97 E-value=0.068 Score=28.71 Aligned_cols=34 Identities=15% Similarity=0.585 Sum_probs=28.0
Q ss_pred eeC-CCCCeEEEecccccCCCCCCceeEeCCCCcceEEE
Q psy7114 94 YPC-RCGAEFCIEEQDTEGDGSDDNILIACDTCSLLLEI 131 (134)
Q Consensus 94 ~~c-rcg~~~~v~~~~l~~g~~~~~~~v~C~~Csl~i~v 131 (134)
..| .|+..|.|..+.+..+ ...|.|+.|.....|
T Consensus 3 i~CP~C~~~f~v~~~~l~~~----~~~vrC~~C~~~f~v 37 (37)
T PF13719_consen 3 ITCPNCQTRFRVPDDKLPAG----GRKVRCPKCGHVFRV 37 (37)
T ss_pred EECCCCCceEEcCHHHcccC----CcEEECCCCCcEeeC
Confidence 457 8999999999988776 567999999987653
No 79
>PF11833 DUF3353: Protein of unknown function (DUF3353); InterPro: IPR021788 This family of proteins are functionally uncharacterised. This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 205 to 258 amino acids in length.
Probab=93.10 E-value=0.29 Score=35.99 Aligned_cols=42 Identities=21% Similarity=0.345 Sum_probs=32.5
Q ss_pred CCCCCHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHHHHHhccChHH
Q psy7114 10 NVNSTLEEIKENYKSLILKYHPDKNSEHQEMFLKINEAWNILKDEKE 56 (134)
Q Consensus 10 ~~~as~~~Ik~ayr~l~~~~HPDk~~~~~~~f~~i~~Ay~~L~d~~~ 56 (134)
+++||.+||.+|+.++..++--| .+.-..|..||+.+.=...
T Consensus 1 S~~ASfeEIq~Arn~ll~~y~gd-----~~~~~~IEaAYD~ILM~rL 42 (194)
T PF11833_consen 1 SEDASFEEIQAARNRLLAQYAGD-----EKSREAIEAAYDAILMERL 42 (194)
T ss_pred CCCCCHHHHHHHHHHHHHHhcCC-----HHHHHHHHHHHHHHHHHHH
Confidence 57899999999999999999666 3344578899977654433
No 80
>PF13446 RPT: A repeated domain in UCH-protein
Probab=92.34 E-value=0.31 Score=28.90 Aligned_cols=29 Identities=21% Similarity=0.327 Sum_probs=24.2
Q ss_pred ccccccCCCCCCHHHHHHHHHHHHHHhCCC
Q psy7114 3 HFEILNCNVNSTLEEIKENYKSLILKYHPD 32 (134)
Q Consensus 3 ~Y~vLgv~~~as~~~Ik~ayr~l~~~~HPD 32 (134)
-|++|||+++.+.+.|-.+|+.... ..|+
T Consensus 7 Ay~~Lgi~~~~~Dd~Ii~~f~~~~~-~~P~ 35 (62)
T PF13446_consen 7 AYEILGIDEDTDDDFIISAFQSKVN-DDPS 35 (62)
T ss_pred HHHHhCcCCCCCHHHHHHHHHHHHH-cChH
Confidence 3889999999999999999998776 3443
No 81
>PF13717 zinc_ribbon_4: zinc-ribbon domain
Probab=91.94 E-value=0.21 Score=26.66 Aligned_cols=32 Identities=19% Similarity=0.637 Sum_probs=26.5
Q ss_pred eeC-CCCCeEEEecccccCCCCCCceeEeCCCCcceE
Q psy7114 94 YPC-RCGAEFCIEEQDTEGDGSDDNILIACDTCSLLL 129 (134)
Q Consensus 94 ~~c-rcg~~~~v~~~~l~~g~~~~~~~v~C~~Csl~i 129 (134)
..| +|+..|.|.++.+-.. ...+.|+.|+...
T Consensus 3 i~Cp~C~~~y~i~d~~ip~~----g~~v~C~~C~~~f 35 (36)
T PF13717_consen 3 ITCPNCQAKYEIDDEKIPPK----GRKVRCSKCGHVF 35 (36)
T ss_pred EECCCCCCEEeCCHHHCCCC----CcEEECCCCCCEe
Confidence 457 8999999999988765 5679999998754
No 82
>PF05180 zf-DNL: DNL zinc finger; InterPro: IPR007853 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The DNL-type zinc finger is found in Tim15, a zinc finger protein essential for protein import into mitochondria. Mitochondrial functions rely on the correct transport of resident proteins synthesized in the cytosol to mitochondria. Protein import into mitochondria is mediated by membrane protein complexes, protein translocators, in the outer and inner mitochondrial membranes, in cooperation with their assistant proteins in the cytosol, intermembrane space and matrix. Proteins destined to the mitochondrial matrix cross the outer membrane with the aid of the outer membrane translocator, the tOM40 complex, and then the inner membrane with the aid of the inner membrane translocator, the TIM23 complex, and mitochondrial motor and chaperone (MMC) proteins including mitochondrial heat- shock protein 70 (mtHsp70), and translocase in the inner mitochondrial membrane (Tim)15. Tim15 is also known as zinc finger motif (Zim)17 or mtHsp70 escort protein (Hep)1. Tim15 contains a zinc-finger motif (CXXC and CXXC) of ~100 residues, which has been named DNL after a short C-terminal motif of D(N/H)L [, , ]. The DNL-type zinc finger is an L-shaped molecule. The two CXXC motifs are located at the end of the L, and are sandwiched by two- stranded antiparallel beta-sheets. Two short alpha-helices constitute another leg of the L. The outer (convex) face of the L has a large acidic groove, which is lined with five acidic residues, whereas the inner (concave) face of the L has two positively charged residues, next to the CXXC motifs []. This entry represents the DNL-type zinc finger.; GO: 0008270 zinc ion binding; PDB: 2E2Z_A.
Probab=91.71 E-value=0.16 Score=30.99 Aligned_cols=35 Identities=23% Similarity=0.583 Sum_probs=25.9
Q ss_pred EeeC-CCC--CeEEEecccccCCCCCCceeEeCCCCcceEEE
Q psy7114 93 TYPC-RCG--AEFCIEEQDTEGDGSDDNILIACDTCSLLLEI 131 (134)
Q Consensus 93 ~~~c-rcg--~~~~v~~~~l~~g~~~~~~~v~C~~Csl~i~v 131 (134)
.+.| .|+ ..-.++......| .++|.|++|.-+..|
T Consensus 4 ~FTC~~C~~Rs~~~~sk~aY~~G----vViv~C~gC~~~HlI 41 (66)
T PF05180_consen 4 TFTCNKCGTRSAKMFSKQAYHKG----VVIVQCPGCKNRHLI 41 (66)
T ss_dssp EEEETTTTEEEEEEEEHHHHHTS----EEEEE-TTS--EEES
T ss_pred EEEcCCCCCccceeeCHHHHhCC----eEEEECCCCcceeee
Confidence 4667 787 6778888888888 899999999987665
No 83
>smart00659 RPOLCX RNA polymerase subunit CX. present in RNA polymerase I, II and III
Probab=90.43 E-value=0.72 Score=25.76 Aligned_cols=31 Identities=26% Similarity=0.628 Sum_probs=23.2
Q ss_pred EeeC-CCCCeEEEecccccCCCCCCceeEeCCCCcceEEEec
Q psy7114 93 TYPC-RCGAEFCIEEQDTEGDGSDDNILIACDTCSLLLEITA 133 (134)
Q Consensus 93 ~~~c-rcg~~~~v~~~~l~~g~~~~~~~v~C~~Csl~i~v~~ 133 (134)
.|.| .||..+.+.. . ..+.|+.|..-|...-
T Consensus 2 ~Y~C~~Cg~~~~~~~---~-------~~irC~~CG~rIlyK~ 33 (44)
T smart00659 2 IYICGECGRENEIKS---K-------DVVRCRECGYRILYKK 33 (44)
T ss_pred EEECCCCCCEeecCC---C-------CceECCCCCceEEEEe
Confidence 4788 9999888761 1 2389999999887653
No 84
>KOG0724|consensus
Probab=90.40 E-value=0.39 Score=37.79 Aligned_cols=53 Identities=34% Similarity=0.492 Sum_probs=41.8
Q ss_pred CCCHHHHHHHHHHHHHHhCCCCCH-------HHHHHHHHHHHHHHhccChHHHHHHHHHh
Q psy7114 12 NSTLEEIKENYKSLILKYHPDKNS-------EHQEMFLKINEAWNILKDEKERKLYESQL 64 (134)
Q Consensus 12 ~as~~~Ik~ayr~l~~~~HPDk~~-------~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~ 64 (134)
.++..+|+.+|+..++..||++.. ...+.|+.|.+||.+|.+...|...|...
T Consensus 3 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~i~ka~~i~~~~~~~~t~~~~~ 62 (335)
T KOG0724|consen 3 LASEDELRLAYREMALKSHPEKKSFYEKLSLWTEEEFKKIEKALAILDDDEPRRTPDSWD 62 (335)
T ss_pred cccHHHHHHHHHHHhhhcCcHHHHHHHHhhhhHHHHHHHHHHHHHHHhccccccchhhhh
Confidence 467789999999999999999842 45677999999999999865544544443
No 85
>PF05280 FlhC: Flagellar transcriptional activator (FlhC); InterPro: IPR007944 This family consists of several bacterial flagellar transcriptional activator (FlhC) proteins. FlhC combines with FlhD to form a regulatory complex in Escherichia coli, this complex has been shown to be a global regulator involved in many cellular processes as well as a flagellar transcriptional activator [].; GO: 0003677 DNA binding, 0030092 regulation of flagellum assembly, 0045893 positive regulation of transcription, DNA-dependent; PDB: 2AVU_E.
Probab=88.38 E-value=0.37 Score=34.93 Aligned_cols=30 Identities=30% Similarity=0.737 Sum_probs=13.7
Q ss_pred EeeC-CCCCeEEEecccccCCCCCCceeEeCCCCcceE
Q psy7114 93 TYPC-RCGAEFCIEEQDTEGDGSDDNILIACDTCSLLL 129 (134)
Q Consensus 93 ~~~c-rcg~~~~v~~~~l~~g~~~~~~~v~C~~Csl~i 129 (134)
-.+| +||+.|.+...++... +.|+.|++.-
T Consensus 134 l~~C~~C~~~fv~~~~~~~~~-------~~Cp~C~~ps 164 (175)
T PF05280_consen 134 LAPCRRCGGHFVTHAHDPRHS-------FVCPFCQPPS 164 (175)
T ss_dssp EEE-TTT--EEEEESS--SS-----------TT-----
T ss_pred ccCCCCCCCCeECcCCCCCcC-------cCCCCCCCcc
Confidence 4688 7999999998887666 7999999743
No 86
>PF14687 DUF4460: Domain of unknown function (DUF4460)
Probab=87.92 E-value=1.2 Score=29.84 Aligned_cols=23 Identities=17% Similarity=0.404 Sum_probs=20.1
Q ss_pred CCCCHHHHHHHHHHHHHHhCCCC
Q psy7114 11 VNSTLEEIKENYKSLILKYHPDK 33 (134)
Q Consensus 11 ~~as~~~Ik~ayr~l~~~~HPDk 33 (134)
...+..+++.+.|..-++.|||.
T Consensus 4 r~~~~~~l~~aLr~Fy~~VHPDl 26 (112)
T PF14687_consen 4 RNLSSPDLRSALRPFYFAVHPDL 26 (112)
T ss_pred hhhhhHHHHHHHHHHHHHhCCcc
Confidence 34567889999999999999997
No 87
>PRK00398 rpoP DNA-directed RNA polymerase subunit P; Provisional
Probab=87.66 E-value=1.6 Score=24.24 Aligned_cols=30 Identities=33% Similarity=0.819 Sum_probs=22.3
Q ss_pred EeeC-CCCCeEEEecccccCCCCCCceeEeCCCCcceEEE
Q psy7114 93 TYPC-RCGAEFCIEEQDTEGDGSDDNILIACDTCSLLLEI 131 (134)
Q Consensus 93 ~~~c-rcg~~~~v~~~~l~~g~~~~~~~v~C~~Csl~i~v 131 (134)
.+.| +||..+.+..... . +.|+.|...+..
T Consensus 3 ~y~C~~CG~~~~~~~~~~--~-------~~Cp~CG~~~~~ 33 (46)
T PRK00398 3 EYKCARCGREVELDEYGT--G-------VRCPYCGYRILF 33 (46)
T ss_pred EEECCCCCCEEEECCCCC--c-------eECCCCCCeEEE
Confidence 4778 8999988875543 2 799999876654
No 88
>PF09723 Zn-ribbon_8: Zinc ribbon domain; InterPro: IPR013429 This entry represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB []. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=87.17 E-value=0.79 Score=25.16 Aligned_cols=32 Identities=28% Similarity=0.776 Sum_probs=22.7
Q ss_pred eeEEeeC-CCCCeEEEecccccCCCCCCceeEeCCCCcc
Q psy7114 90 QMFTYPC-RCGAEFCIEEQDTEGDGSDDNILIACDTCSL 127 (134)
Q Consensus 90 ~~~~~~c-rcg~~~~v~~~~l~~g~~~~~~~v~C~~Csl 127 (134)
..|.|.| .||..|.+...--. ...+.|+.|.-
T Consensus 2 P~Yey~C~~Cg~~fe~~~~~~~------~~~~~CP~Cg~ 34 (42)
T PF09723_consen 2 PIYEYRCEECGHEFEVLQSISE------DDPVPCPECGS 34 (42)
T ss_pred CCEEEEeCCCCCEEEEEEEcCC------CCCCcCCCCCC
Confidence 3578999 89988887654222 23479999976
No 89
>TIGR01206 lysW lysine biosynthesis protein LysW. This very small, poorly characterized protein has been shown essential in Thermus thermophilus for an unusual pathway of Lys biosynthesis from aspartate by way of alpha-aminoadipate (AAA) rather than diaminopimelate. It is found also in Deinococcus radiodurans and Pyrococcus horikoshii, which appear to share the AAA pathway.
Probab=85.44 E-value=1.7 Score=25.39 Aligned_cols=32 Identities=38% Similarity=0.883 Sum_probs=24.3
Q ss_pred eeC-CCCCeEEEecccccCCCCCCceeEeCCCCcceEEEe
Q psy7114 94 YPC-RCGAEFCIEEQDTEGDGSDDNILIACDTCSLLLEIT 132 (134)
Q Consensus 94 ~~c-rcg~~~~v~~~~l~~g~~~~~~~v~C~~Csl~i~v~ 132 (134)
..| .||..+.+....+ | .++.|+.|-.-+.|.
T Consensus 3 ~~CP~CG~~iev~~~~~--G-----eiV~Cp~CGaeleVv 35 (54)
T TIGR01206 3 FECPDCGAEIELENPEL--G-----ELVICDECGAELEVV 35 (54)
T ss_pred cCCCCCCCEEecCCCcc--C-----CEEeCCCCCCEEEEE
Confidence 357 8999888865443 4 378999999988885
No 90
>smart00834 CxxC_CXXC_SSSS Putative regulatory protein. CxxC_CXXC_SSSS represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=85.38 E-value=1.2 Score=23.71 Aligned_cols=32 Identities=25% Similarity=0.712 Sum_probs=22.4
Q ss_pred eEEeeC-CCCCeEEEecccccCCCCCCceeEeCCCCcce
Q psy7114 91 MFTYPC-RCGAEFCIEEQDTEGDGSDDNILIACDTCSLL 128 (134)
Q Consensus 91 ~~~~~c-rcg~~~~v~~~~l~~g~~~~~~~v~C~~Csl~ 128 (134)
.|.+.| .||..|.+...... ...+.|+.|.--
T Consensus 3 ~Y~y~C~~Cg~~fe~~~~~~~------~~~~~CP~Cg~~ 35 (41)
T smart00834 3 IYEYRCEDCGHTFEVLQKISD------DPLATCPECGGD 35 (41)
T ss_pred CEEEEcCCCCCEEEEEEecCC------CCCCCCCCCCCc
Confidence 477899 89998887754321 133689999863
No 91
>PRK12860 transcriptional activator FlhC; Provisional
Probab=83.71 E-value=0.79 Score=33.63 Aligned_cols=28 Identities=29% Similarity=0.774 Sum_probs=23.8
Q ss_pred EeeC-CCCCeEEEecccccCCCCCCceeEeCCCCcc
Q psy7114 93 TYPC-RCGAEFCIEEQDTEGDGSDDNILIACDTCSL 127 (134)
Q Consensus 93 ~~~c-rcg~~~~v~~~~l~~g~~~~~~~v~C~~Csl 127 (134)
-.+| .||+.|.+...|+... ..|+-|..
T Consensus 134 l~~C~~Cgg~fv~~~~e~~~~-------f~CplC~~ 162 (189)
T PRK12860 134 LARCCRCGGKFVTHAHDLRHN-------FVCGLCQP 162 (189)
T ss_pred eccCCCCCCCeeccccccCCC-------CcCCCCCC
Confidence 4678 6999999999988777 79999984
No 92
>KOG3277|consensus
Probab=83.47 E-value=2 Score=30.54 Aligned_cols=37 Identities=24% Similarity=0.531 Sum_probs=29.7
Q ss_pred eEEeeC-CCC--CeEEEecccccCCCCCCceeEeCCCCcceEEE
Q psy7114 91 MFTYPC-RCG--AEFCIEEQDTEGDGSDDNILIACDTCSLLLEI 131 (134)
Q Consensus 91 ~~~~~c-rcg--~~~~v~~~~l~~g~~~~~~~v~C~~Csl~i~v 131 (134)
...|.| -|+ ..-.|+....+.| .+||.|+.|+....|
T Consensus 77 ~l~yTCkvCntRs~ktisk~AY~~G----vVivqC~gC~~~Hli 116 (165)
T KOG3277|consen 77 QLAYTCKVCNTRSTKTISKQAYEKG----VVIVQCPGCKNHHLI 116 (165)
T ss_pred EEEEEeeccCCccccccChhhhhCc----eEEEECCCCccceee
Confidence 357888 787 5788888888888 899999999876554
No 93
>PRK12722 transcriptional activator FlhC; Provisional
Probab=83.09 E-value=0.87 Score=33.35 Aligned_cols=28 Identities=29% Similarity=0.765 Sum_probs=23.8
Q ss_pred EeeC-CCCCeEEEecccccCCCCCCceeEeCCCCcc
Q psy7114 93 TYPC-RCGAEFCIEEQDTEGDGSDDNILIACDTCSL 127 (134)
Q Consensus 93 ~~~c-rcg~~~~v~~~~l~~g~~~~~~~v~C~~Csl 127 (134)
-.+| .||+.|.+...|+... ..|+-|..
T Consensus 134 l~~C~~Cgg~fv~~~~e~~~~-------f~CplC~~ 162 (187)
T PRK12722 134 LSSCNCCGGHFVTHAHDPVGS-------FVCGLCQP 162 (187)
T ss_pred eccCCCCCCCeeccccccCCC-------CcCCCCCC
Confidence 4578 6999999999988777 79999986
No 94
>TIGR02605 CxxC_CxxC_SSSS putative regulatory protein, FmdB family. This model represents a region of about 50 amino acids found in a number of small proteins in a wide range of bacteria. The region begins usually with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One member of this family is has been noted as a putative regulatory protein, designated FmdB (PubMed:8841393). Most members of this family have a C-terminal region containing highly degenerate sequence, such as SSTSESTKSSGSSGSSGSSESKASGSTEKSTSSTTAAAAV in Mycobacterium tuberculosis and VAVGGSAPAPSPAPRAGGGGGGCCGGGCCG in Streptomyces avermitilis. These low complexity regions, which are not included in the model, resemble low-complexity C-terminal regions of some heterocycle-containing bacteriocin precursors.
Probab=81.77 E-value=1.8 Score=24.45 Aligned_cols=31 Identities=26% Similarity=0.788 Sum_probs=21.3
Q ss_pred eEEeeC-CCCCeEEEecccccCCCCCCceeEeCCCCcc
Q psy7114 91 MFTYPC-RCGAEFCIEEQDTEGDGSDDNILIACDTCSL 127 (134)
Q Consensus 91 ~~~~~c-rcg~~~~v~~~~l~~g~~~~~~~v~C~~Csl 127 (134)
.|.|.| .||..|.+....-. ...+.|+.|..
T Consensus 3 ~Yey~C~~Cg~~fe~~~~~~~------~~~~~CP~Cg~ 34 (52)
T TIGR02605 3 IYEYRCTACGHRFEVLQKMSD------DPLATCPECGG 34 (52)
T ss_pred CEEEEeCCCCCEeEEEEecCC------CCCCCCCCCCC
Confidence 578999 79998988643111 12367999986
No 95
>PF03604 DNA_RNApol_7kD: DNA directed RNA polymerase, 7 kDa subunit; InterPro: IPR006591 DNA-dependent RNA polymerase catalyzes the transcription of DNA into RNA using the four ribonucleoside triphosphates as substrates. Each class of RNA polymerase is assembled from 9 to 15 different polypeptides. Rbp10 (RNA polymerase CX) is a domain found in RNA polymerase subunit 10; present in RNA polymerase I, II and III.; GO: 0003677 DNA binding, 0003899 DNA-directed RNA polymerase activity, 0006351 transcription, DNA-dependent; PDB: 2PMZ_Z 3HKZ_X 2NVX_L 3S1Q_L 2JA6_L 3S17_L 3HOW_L 3HOV_L 3PO2_L 3HOZ_L ....
Probab=74.65 E-value=3.4 Score=21.42 Aligned_cols=30 Identities=33% Similarity=0.712 Sum_probs=18.3
Q ss_pred eeC-CCCCeEEEecccccCCCCCCceeEeCCCCcceEEEec
Q psy7114 94 YPC-RCGAEFCIEEQDTEGDGSDDNILIACDTCSLLLEITA 133 (134)
Q Consensus 94 ~~c-rcg~~~~v~~~~l~~g~~~~~~~v~C~~Csl~i~v~~ 133 (134)
|.| +||..+.+...+ .+.|+.|...|..+.
T Consensus 1 Y~C~~Cg~~~~~~~~~----------~irC~~CG~RIlyK~ 31 (32)
T PF03604_consen 1 YICGECGAEVELKPGD----------PIRCPECGHRILYKK 31 (32)
T ss_dssp EBESSSSSSE-BSTSS----------TSSBSSSS-SEEBE-
T ss_pred CCCCcCCCeeEcCCCC----------cEECCcCCCeEEEec
Confidence 346 788777744322 268999999887654
No 96
>PF08271 TF_Zn_Ribbon: TFIIB zinc-binding; InterPro: IPR013137 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents a zinc finger motif found in transcription factor IIB (TFIIB). In eukaryotes the initiation of transcription of protein encoding genes by the polymerase II complexe (Pol II) is modulated by general and specific transcription factors. The general transcription factors operate through common promoters elements (such as the TATA box). At least seven different proteins associate to form the general transcription factors: TFIIA, -IIB, -IID, -IIE, -IIF, -IIG, and -IIH []. TFIIB and TFIID are responsible for promoter recognition and interaction with pol II; together with Pol II, they form a minimal initiation complex capable of transcription under certain conditions. The TATA box of a Pol II promoter is bound in the initiation complex by the TBP subunit of TFIID, which bends the DNA around the C-terminal domain of TFIIB whereas the N-terminal zinc finger of TFIIB interacts with Pol II [, ]. The TFIIB zinc finger adopts a zinc ribbon fold characterised by two beta-hairpins forming two structurally similar zinc-binding sub-sites []. The zinc finger contacts the rbp1 subunit of Pol II through its dock domain, a conserved region of about 70 amino acids located close to the polymerase active site []. In the Pol II complex this surface is located near the RNA exit groove. Interestingly this sequence is best conserved in the three polymerases that utilise a TFIIB-like general transcription factor (Pol II, Pol III, and archaeal RNA polymerase) but not in Pol I []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent; PDB: 1VD4_A 1PFT_A 3K1F_M 3K7A_M 1RO4_A 1RLY_A 1DL6_A.
Probab=73.30 E-value=5 Score=21.86 Aligned_cols=29 Identities=31% Similarity=0.776 Sum_probs=17.2
Q ss_pred eeC-CCCCeEEEecccccCCCCCCceeEeCCCCcceEE
Q psy7114 94 YPC-RCGAEFCIEEQDTEGDGSDDNILIACDTCSLLLE 130 (134)
Q Consensus 94 ~~c-rcg~~~~v~~~~l~~g~~~~~~~v~C~~Csl~i~ 130 (134)
+.| .||+...+... ..| -+.|+.|=++|.
T Consensus 1 m~Cp~Cg~~~~~~D~--~~g------~~vC~~CG~Vl~ 30 (43)
T PF08271_consen 1 MKCPNCGSKEIVFDP--ERG------ELVCPNCGLVLE 30 (43)
T ss_dssp ESBTTTSSSEEEEET--TTT------EEEETTT-BBEE
T ss_pred CCCcCCcCCceEEcC--CCC------eEECCCCCCEee
Confidence 356 78887744432 222 268999987775
No 97
>COG5552 Uncharacterized conserved protein [Function unknown]
Probab=71.36 E-value=17 Score=22.70 Aligned_cols=33 Identities=24% Similarity=0.349 Sum_probs=26.0
Q ss_pred CCccccccCCCCCCHHHHHHHHHHHHHHh----CCCC
Q psy7114 1 MNHFEILNCNVNSTLEEIKENYKSLILKY----HPDK 33 (134)
Q Consensus 1 ~~~Y~vLgv~~~as~~~Ik~ayr~l~~~~----HPDk 33 (134)
||.-+++|+.|-|+..||+.+-++..++. ||..
T Consensus 3 RNIk~LfnfdPPAT~~EvrdAAlQfVRKlSGtT~PS~ 39 (88)
T COG5552 3 RNIKELFNFDPPATPVEVRDAALQFVRKLSGTTHPSA 39 (88)
T ss_pred cchHHHhCCCCCCCcHHHHHHHHHHHHHhcCCCCcch
Confidence 35557889999999999999887766665 6664
No 98
>KOG3442|consensus
Probab=69.56 E-value=9.4 Score=26.20 Aligned_cols=47 Identities=26% Similarity=0.220 Sum_probs=34.5
Q ss_pred ccccCCCCCCHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHHHHHhcc
Q psy7114 5 EILNCNVNSTLEEIKENYKSLILKYHPDKNSEHQEMFLKINEAWNILK 52 (134)
Q Consensus 5 ~vLgv~~~as~~~Ik~ayr~l~~~~HPDk~~~~~~~f~~i~~Ay~~L~ 52 (134)
.||+|++.-+.++|-+.|..|...+.+.|. ++.-.-.+|-.|-+-|.
T Consensus 63 qILnV~~~ln~eei~k~yehLFevNdkskG-GSFYLQSKVfRAkErld 109 (132)
T KOG3442|consen 63 QILNVKEPLNREEIEKRYEHLFEVNDKSKG-GSFYLQSKVFRAKERLD 109 (132)
T ss_pred hHhCCCCCCCHHHHHHHHHHHHhccCcccC-cceeehHHHHHHHHHHH
Confidence 589999999999999999999999988872 12222234555555554
No 99
>PF02216 B: B domain; InterPro: IPR003132 This entry represents the immunoglobulin-binding domain found in the Staphylococcus aureus virulence factor protein A (SpA). Protein A contains five highly homologous Ig-binding domains in tandem (designated domains E, D, A, B and C), which share a common structure consisting of three helices in a closed left-handed twist. Protein A can exist in both secreted and membrane-bound forms, and has two distinct Ig-binding activities: each domain can bind Fc-gamma (the constant region of IgG involved in effector functions) and Fab (the Ig fragment responsible for antigen recognition) [].; GO: 0019865 immunoglobulin binding, 0009405 pathogenesis; PDB: 1EDL_A 1EDI_A 1EDJ_A 1EDK_A 2B88_A 2B87_A 2B89_A 1FC2_C 1DEE_H 1ZXG_A ....
Probab=68.12 E-value=13 Score=21.65 Aligned_cols=31 Identities=26% Similarity=0.445 Sum_probs=19.7
Q ss_pred ccccccCCCCCCHHHHHHHHHHHHHHhCCCCCHH
Q psy7114 3 HFEILNCNVNSTLEEIKENYKSLILKYHPDKNSE 36 (134)
Q Consensus 3 ~Y~vLgv~~~as~~~Ik~ayr~l~~~~HPDk~~~ 36 (134)
||+||+++. -.++=|..|-+ .++-||+....
T Consensus 13 FY~vl~~~n--LteeQrn~yI~-~lKddPs~sq~ 43 (54)
T PF02216_consen 13 FYEVLHMPN--LTEEQRNGYIQ-SLKDDPSRSQN 43 (54)
T ss_dssp HHHHHCSTT--S-HHHHHHHHH-HHHH-GCCHHH
T ss_pred HHHHHcCCC--cCHHHHHhHHH-HHhhChHHHHH
Confidence 799998865 33455677766 46688987433
No 100
>KOG1779|consensus
Probab=67.83 E-value=13 Score=23.37 Aligned_cols=38 Identities=21% Similarity=0.430 Sum_probs=25.7
Q ss_pred eeEEeeCCCCCeEEEecccccCCCCCCceeEeCCCCcceEEEe
Q psy7114 90 QMFTYPCRCGAEFCIEEQDTEGDGSDDNILIACDTCSLLLEIT 132 (134)
Q Consensus 90 ~~~~~~crcg~~~~v~~~~l~~g~~~~~~~v~C~~Csl~i~v~ 132 (134)
+.|++.-+|++-|.||-- + +....+|.|.+|+..+...
T Consensus 29 nsyFm~VkC~gc~~iT~v--f---SHaqtvVvc~~c~~il~~~ 66 (84)
T KOG1779|consen 29 NSYFMDVKCPGCFKITTV--F---SHAQTVVVCEGCSTILCQP 66 (84)
T ss_pred CceEEEEEcCCceEEEEE--e---ecCceEEEcCCCceEEEEe
Confidence 345566688887877643 1 1224579999999988764
No 101
>KOG2324|consensus
Probab=66.99 E-value=17 Score=29.74 Aligned_cols=29 Identities=21% Similarity=0.645 Sum_probs=17.0
Q ss_pred EEeeC-CCCCeEEEecccccCCCCCCceeEeCCCCcc
Q psy7114 92 FTYPC-RCGAEFCIEEQDTEGDGSDDNILIACDTCSL 127 (134)
Q Consensus 92 ~~~~c-rcg~~~~v~~~~l~~g~~~~~~~v~C~~Csl 127 (134)
+-+.| +|| |.-..+++.. .-+..|+.|.-
T Consensus 226 ~l~~C~~C~--~s~n~e~~~~-----sk~~~Cp~C~~ 255 (457)
T KOG2324|consen 226 TLMSCPSCG--YSKNSEDLDL-----SKIASCPKCNE 255 (457)
T ss_pred ceeecCcCC--ccCchhhhcC-----CccccCCcccC
Confidence 45788 887 3333333332 23478999975
No 102
>PF12434 Malate_DH: Malate dehydrogenase enzyme
Probab=66.96 E-value=9.3 Score=19.07 Aligned_cols=18 Identities=22% Similarity=0.471 Sum_probs=15.0
Q ss_pred CHHHHHHHHHHHHHHhCC
Q psy7114 14 TLEEIKENYKSLILKYHP 31 (134)
Q Consensus 14 s~~~Ik~ayr~l~~~~HP 31 (134)
..++.|.+.|+.++.||-
T Consensus 9 ~~~~~r~~lR~AALeYHe 26 (28)
T PF12434_consen 9 NKEDKRAQLRQAALEYHE 26 (28)
T ss_pred chHHHHHHHHHHHHHhcc
Confidence 347789999999999993
No 103
>PF07709 SRR: Seven Residue Repeat; InterPro: IPR011714 This repeat is found in some Plasmodium and Theileria proteins.
Probab=65.80 E-value=5.7 Score=16.48 Aligned_cols=13 Identities=31% Similarity=0.524 Sum_probs=9.1
Q ss_pred HHHHHHHHHHhcc
Q psy7114 40 MFLKINEAWNILK 52 (134)
Q Consensus 40 ~f~~i~~Ay~~L~ 52 (134)
.|..|..||+.|+
T Consensus 2 ~~~~V~~aY~~l~ 14 (14)
T PF07709_consen 2 KFEKVKNAYEQLS 14 (14)
T ss_pred cHHHHHHHHHhcC
Confidence 4677788887664
No 104
>PF09297 zf-NADH-PPase: NADH pyrophosphatase zinc ribbon domain; InterPro: IPR015376 This domain has a zinc ribbon structure and is often found between two NUDIX domains.; GO: 0016787 hydrolase activity, 0046872 metal ion binding; PDB: 1VK6_A 2GB5_A.
Probab=61.70 E-value=8.7 Score=19.50 Aligned_cols=26 Identities=27% Similarity=0.443 Sum_probs=14.1
Q ss_pred eC-CCCCeEEEecccccCCCCCCceeEeCCCCcceE
Q psy7114 95 PC-RCGAEFCIEEQDTEGDGSDDNILIACDTCSLLL 129 (134)
Q Consensus 95 ~c-rcg~~~~v~~~~l~~g~~~~~~~v~C~~Csl~i 129 (134)
.| +||+.-.....+.. ..|+.|-+.+
T Consensus 5 fC~~CG~~t~~~~~g~~---------r~C~~Cg~~~ 31 (32)
T PF09297_consen 5 FCGRCGAPTKPAPGGWA---------RRCPSCGHEH 31 (32)
T ss_dssp B-TTT--BEEE-SSSS----------EEESSSS-EE
T ss_pred ccCcCCccccCCCCcCE---------eECCCCcCEe
Confidence 46 89988887777443 4788887643
No 105
>TIGR00577 fpg formamidopyrimidine-DNA glycosylase (fpg). All proteins in the FPG family with known functions are FAPY-DNA glycosylases that function in base excision repair. Homologous to endonuclease VIII (nei). This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=60.76 E-value=24 Score=27.13 Aligned_cols=26 Identities=31% Similarity=0.659 Sum_probs=16.0
Q ss_pred eeC-CCCCeEEEecccccCCCCCCceeEeCCCCc
Q psy7114 94 YPC-RCGAEFCIEEQDTEGDGSDDNILIACDTCS 126 (134)
Q Consensus 94 ~~c-rcg~~~~v~~~~l~~g~~~~~~~v~C~~Cs 126 (134)
.+| +||....-..- .| -....|+.|-
T Consensus 246 ~pC~~Cg~~I~~~~~---~g----R~t~~CP~CQ 272 (272)
T TIGR00577 246 EPCRRCGTPIEKIKV---GG----RGTHFCPQCQ 272 (272)
T ss_pred CCCCCCCCeeEEEEE---CC----CCCEECCCCC
Confidence 378 89976554322 12 3457899983
No 106
>COG3195 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=59.71 E-value=34 Score=24.72 Aligned_cols=40 Identities=20% Similarity=0.330 Sum_probs=28.9
Q ss_pred CCCCCHHHHHHHHHHH-----------HHHhCCCC--------------------------CHHHHHHHHHHHHHHH
Q psy7114 10 NVNSTLEEIKENYKSL-----------ILKYHPDK--------------------------NSEHQEMFLKINEAWN 49 (134)
Q Consensus 10 ~~~as~~~Ik~ayr~l-----------~~~~HPDk--------------------------~~~~~~~f~~i~~Ay~ 49 (134)
.|.+|.+.+..+..+. .++-|||. .+...++|..+|+||.
T Consensus 41 ~pf~s~~~l~~am~~~v~~A~~~~rl~liraHPdLAgk~a~a~elta~S~~EQasAGLd~Ls~~E~a~f~~LN~aY~ 117 (176)
T COG3195 41 RPFASAEGLHAAMCRAVRAASEEERLALIRAHPDLAGKAAIAGELTAESTSEQASAGLDRLSPEEFARFTELNAAYV 117 (176)
T ss_pred CCcCCHHHHHHHHHHHHHcCCHHHHHHHHHhChhhHHHHHHHHHhhhhhHHHHHhcCcccCCHHHHHHHHHHHHHHH
Confidence 5677878777776443 45679997 4455678999999884
No 107
>COG1579 Zn-ribbon protein, possibly nucleic acid-binding [General function prediction only]
Probab=58.19 E-value=3.5 Score=31.35 Aligned_cols=33 Identities=18% Similarity=0.354 Sum_probs=23.4
Q ss_pred CCCCeEEEecccccCCCCCCceeEeCCCCcceE
Q psy7114 97 RCGAEFCIEEQDTEGDGSDDNILIACDTCSLLL 129 (134)
Q Consensus 97 rcg~~~~v~~~~l~~g~~~~~~~v~C~~Csl~i 129 (134)
+|||=+.+-.....+....+..+|.||.|+-.+
T Consensus 199 ~C~GC~m~l~~~~~~~V~~~d~iv~CP~CgRIL 231 (239)
T COG1579 199 VCGGCHMKLPSQTLSKVRKKDEIVFCPYCGRIL 231 (239)
T ss_pred cccCCeeeecHHHHHHHhcCCCCccCCccchHH
Confidence 677656555555666555567899999998654
No 108
>PF13831 PHD_2: PHD-finger; PDB: 2L43_A 2KU3_A.
Probab=57.47 E-value=9.1 Score=20.23 Aligned_cols=14 Identities=29% Similarity=0.759 Sum_probs=9.0
Q ss_pred eeEeCCCCcceEEE
Q psy7114 118 ILIACDTCSLLLEI 131 (134)
Q Consensus 118 ~~v~C~~Csl~i~v 131 (134)
.++.|..|++.|+-
T Consensus 3 ~ll~C~~C~v~VH~ 16 (36)
T PF13831_consen 3 PLLFCDNCNVAVHQ 16 (36)
T ss_dssp EEEE-SSS--EEEH
T ss_pred ceEEeCCCCCcCCh
Confidence 47899999999873
No 109
>PRK13945 formamidopyrimidine-DNA glycosylase; Provisional
Probab=57.36 E-value=35 Score=26.38 Aligned_cols=26 Identities=23% Similarity=0.583 Sum_probs=16.3
Q ss_pred eeC-CCCCeEEEecccccCCCCCCceeEeCCCCc
Q psy7114 94 YPC-RCGAEFCIEEQDTEGDGSDDNILIACDTCS 126 (134)
Q Consensus 94 ~~c-rcg~~~~v~~~~l~~g~~~~~~~v~C~~Cs 126 (134)
.+| +||....-..- .| -....|+.|-
T Consensus 255 ~pC~~Cg~~I~~~~~---~g----R~t~~CP~CQ 281 (282)
T PRK13945 255 KPCRKCGTPIERIKL---AG----RSTHWCPNCQ 281 (282)
T ss_pred CCCCcCCCeeEEEEE---CC----CccEECCCCc
Confidence 489 99976543221 22 3458899994
No 110
>PRK03824 hypA hydrogenase nickel incorporation protein; Provisional
Probab=55.78 E-value=16 Score=25.20 Aligned_cols=13 Identities=31% Similarity=1.032 Sum_probs=10.3
Q ss_pred EeeC-CCCCeEEEe
Q psy7114 93 TYPC-RCGAEFCIE 105 (134)
Q Consensus 93 ~~~c-rcg~~~~v~ 105 (134)
...| .||..|.+.
T Consensus 70 ~~~C~~CG~~~~~~ 83 (135)
T PRK03824 70 VLKCRNCGNEWSLK 83 (135)
T ss_pred EEECCCCCCEEecc
Confidence 3778 799888886
No 111
>PRK14811 formamidopyrimidine-DNA glycosylase; Provisional
Probab=55.15 E-value=43 Score=25.70 Aligned_cols=27 Identities=26% Similarity=0.501 Sum_probs=17.1
Q ss_pred eeC-CCCCeEEEecccccCCCCCCceeEeCCCCcc
Q psy7114 94 YPC-RCGAEFCIEEQDTEGDGSDDNILIACDTCSL 127 (134)
Q Consensus 94 ~~c-rcg~~~~v~~~~l~~g~~~~~~~v~C~~Csl 127 (134)
.+| +||....-..-. | -....|+.|-.
T Consensus 236 ~pC~~Cg~~I~~~~~~---g----R~ty~Cp~CQ~ 263 (269)
T PRK14811 236 QPCPRCGTPIEKIVVG---G----RGTHFCPQCQP 263 (269)
T ss_pred CCCCcCCCeeEEEEEC---C----CCcEECCCCcC
Confidence 479 999765543221 2 34589999964
No 112
>PRK01103 formamidopyrimidine/5-formyluracil/ 5-hydroxymethyluracil DNA glycosylase; Validated
Probab=54.97 E-value=43 Score=25.68 Aligned_cols=26 Identities=35% Similarity=0.786 Sum_probs=16.4
Q ss_pred eeC-CCCCeEEEecccccCCCCCCceeEeCCCCc
Q psy7114 94 YPC-RCGAEFCIEEQDTEGDGSDDNILIACDTCS 126 (134)
Q Consensus 94 ~~c-rcg~~~~v~~~~l~~g~~~~~~~v~C~~Cs 126 (134)
.+| +||....-..- .| -....|+.|-
T Consensus 246 ~pC~~Cg~~I~~~~~---~g----R~t~~CP~CQ 272 (274)
T PRK01103 246 EPCRRCGTPIEKIKQ---GG----RSTFFCPRCQ 272 (274)
T ss_pred CCCCCCCCeeEEEEE---CC----CCcEECcCCC
Confidence 478 89977543221 12 3458999995
No 113
>PF10041 DUF2277: Uncharacterized conserved protein (DUF2277); InterPro: IPR018735 Members of this family of hypothetical bacterial proteins have no known function.
Probab=52.96 E-value=55 Score=20.49 Aligned_cols=32 Identities=22% Similarity=0.091 Sum_probs=25.2
Q ss_pred CccccccCCCCCCHHHHHHHHHHHHHHhCCCC
Q psy7114 2 NHFEILNCNVNSTLEEIKENYKSLILKYHPDK 33 (134)
Q Consensus 2 ~~Y~vLgv~~~as~~~Ik~ayr~l~~~~HPDk 33 (134)
|.-.+.|+.|-|+.+||+.|-.+..++..--.
T Consensus 4 nI~~L~~fePpaT~~EI~aAAlQyVRKvSG~~ 35 (78)
T PF10041_consen 4 NIKTLRNFEPPATDEEIRAAALQYVRKVSGFR 35 (78)
T ss_pred chhhhcCCCCCCCHHHHHHHHHHHHHHHccCC
Confidence 44456789999999999999888888775433
No 114
>PRK14810 formamidopyrimidine-DNA glycosylase; Provisional
Probab=51.65 E-value=55 Score=25.16 Aligned_cols=26 Identities=19% Similarity=0.425 Sum_probs=15.8
Q ss_pred eeC-CCCCeEEEecccccCCCCCCceeEeCCCCc
Q psy7114 94 YPC-RCGAEFCIEEQDTEGDGSDDNILIACDTCS 126 (134)
Q Consensus 94 ~~c-rcg~~~~v~~~~l~~g~~~~~~~v~C~~Cs 126 (134)
.+| +||....-..- .| -....|+.|.
T Consensus 245 ~pCprCG~~I~~~~~---~g----R~t~~CP~CQ 271 (272)
T PRK14810 245 EPCLNCKTPIRRVVV---AG----RSSHYCPHCQ 271 (272)
T ss_pred CcCCCCCCeeEEEEE---CC----CccEECcCCc
Confidence 578 89976532221 12 3457899994
No 115
>COG0266 Nei Formamidopyrimidine-DNA glycosylase [DNA replication, recombination, and repair]
Probab=50.80 E-value=58 Score=25.34 Aligned_cols=26 Identities=27% Similarity=0.636 Sum_probs=15.1
Q ss_pred eeC-CCCCeEEEecccccCCCCCCceeEeCCCCc
Q psy7114 94 YPC-RCGAEFCIEEQDTEGDGSDDNILIACDTCS 126 (134)
Q Consensus 94 ~~c-rcg~~~~v~~~~l~~g~~~~~~~v~C~~Cs 126 (134)
-+| +||+...-..-.- -...-|+.|-
T Consensus 246 epC~~CGt~I~k~~~~g-------R~t~~CP~CQ 272 (273)
T COG0266 246 EPCRRCGTPIEKIKLGG-------RSTFYCPVCQ 272 (273)
T ss_pred CCCCccCCEeEEEEEcC-------CcCEeCCCCC
Confidence 378 8997654332211 2337899984
No 116
>COG5415 Predicted integral membrane metal-binding protein [General function prediction only]
Probab=50.03 E-value=57 Score=24.59 Aligned_cols=29 Identities=17% Similarity=0.471 Sum_probs=17.9
Q ss_pred EeeC-CCC---CeEEEecccccCCCCCCceeEeCCCCcceE
Q psy7114 93 TYPC-RCG---AEFCIEEQDTEGDGSDDNILIACDTCSLLL 129 (134)
Q Consensus 93 ~~~c-rcg---~~~~v~~~~l~~g~~~~~~~v~C~~Csl~i 129 (134)
.-.| .|- +-+...+.++.. ..|+.|..|+
T Consensus 192 alIC~~C~hhngl~~~~ek~~~e--------fiC~~Cn~~n 224 (251)
T COG5415 192 ALICPQCHHHNGLYRLAEKPIIE--------FICPHCNHKN 224 (251)
T ss_pred hhccccccccccccccccccchh--------eecccchhhc
Confidence 3456 553 444444444443 6999999998
No 117
>PF07295 DUF1451: Protein of unknown function (DUF1451); InterPro: IPR009912 This family consists of several hypothetical bacterial proteins of around 160 residues in length. Members of this family contain four highly conserved cysteine resides toward the C-terminal region of the protein. The function of this family is unknown.
Probab=49.22 E-value=18 Score=25.45 Aligned_cols=27 Identities=30% Similarity=0.657 Sum_probs=20.2
Q ss_pred EeeC-CCCCeEEEecccccCCCCCCceeEeCCCCcce
Q psy7114 93 TYPC-RCGAEFCIEEQDTEGDGSDDNILIACDTCSLL 128 (134)
Q Consensus 93 ~~~c-rcg~~~~v~~~~l~~g~~~~~~~v~C~~Csl~ 128 (134)
.+.| .||....++.. ..+-+|+.|..-
T Consensus 112 ~l~C~~Cg~~~~~~~~---------~~l~~Cp~C~~~ 139 (146)
T PF07295_consen 112 TLVCENCGHEVELTHP---------ERLPPCPKCGHT 139 (146)
T ss_pred eEecccCCCEEEecCC---------CcCCCCCCCCCC
Confidence 5789 99988888754 235699999753
No 118
>PF04216 FdhE: Protein involved in formate dehydrogenase formation; InterPro: IPR006452 This family of sequences describe an accessory protein required for the assembly of formate dehydrogenase of certain proteobacteria although not present in the final complex []. The exact nature of the function of FdhE in the assembly of the complex is unknown, but considering the presence of selenocysteine, molybdopterin, iron-sulphur clusters and cytochrome b556, it is likely to be involved in the insertion of cofactors. ; GO: 0005737 cytoplasm; PDB: 2FIY_B.
Probab=48.96 E-value=11 Score=29.16 Aligned_cols=17 Identities=18% Similarity=0.495 Sum_probs=14.5
Q ss_pred ceeEeCCCCcceEEEec
Q psy7114 117 NILIACDTCSLLLEITA 133 (134)
Q Consensus 117 ~~~v~C~~Csl~i~v~~ 133 (134)
.-+-.|.+|.-++++..
T Consensus 236 ~rve~C~~C~~YlK~vd 252 (290)
T PF04216_consen 236 YRVEVCESCGSYLKTVD 252 (290)
T ss_dssp EEEEEETTTTEEEEEEE
T ss_pred EEEEECCcccchHHHHh
Confidence 56799999999999864
No 119
>PRK06266 transcription initiation factor E subunit alpha; Validated
Probab=46.53 E-value=11 Score=27.32 Aligned_cols=29 Identities=24% Similarity=0.667 Sum_probs=20.9
Q ss_pred EEeeC-CCCCeEEEecccccCCCCCCceeEeCCCCcce
Q psy7114 92 FTYPC-RCGAEFCIEEQDTEGDGSDDNILIACDTCSLL 128 (134)
Q Consensus 92 ~~~~c-rcg~~~~v~~~~l~~g~~~~~~~v~C~~Csl~ 128 (134)
-+|.| +|+..|...++ +..+ ..|+.|.-.
T Consensus 116 ~~Y~Cp~C~~rytf~eA-~~~~-------F~Cp~Cg~~ 145 (178)
T PRK06266 116 MFFFCPNCHIRFTFDEA-MEYG-------FRCPQCGEM 145 (178)
T ss_pred CEEECCCCCcEEeHHHH-hhcC-------CcCCCCCCC
Confidence 34788 89988888766 3334 699999643
No 120
>smart00531 TFIIE Transcription initiation factor IIE.
Probab=44.32 E-value=16 Score=25.39 Aligned_cols=37 Identities=24% Similarity=0.454 Sum_probs=23.7
Q ss_pred EEeeC-CCCCeEEEecccccCCCCCCceeEeCCCCcceEEE
Q psy7114 92 FTYPC-RCGAEFCIEEQDTEGDGSDDNILIACDTCSLLLEI 131 (134)
Q Consensus 92 ~~~~c-rcg~~~~v~~~~l~~g~~~~~~~v~C~~Csl~i~v 131 (134)
..|.| +||..|....+..... ......|+.|.-.|..
T Consensus 98 ~~Y~Cp~C~~~y~~~ea~~~~d---~~~~f~Cp~Cg~~l~~ 135 (147)
T smart00531 98 AYYKCPNCQSKYTFLEANQLLD---MDGTFTCPRCGEELEE 135 (147)
T ss_pred cEEECcCCCCEeeHHHHHHhcC---CCCcEECCCCCCEEEE
Confidence 35888 9998888654422211 1233899999877643
No 121
>PF08274 PhnA_Zn_Ribbon: PhnA Zinc-Ribbon ; InterPro: IPR013987 The PhnA protein family includes the uncharacterised Escherichia coli protein PhnA and its homologues. The E. coli phnA gene is part of a large operon associated with alkylphosphonate uptake and carbon-phosphorus bond cleavage []. The protein is not related to the characterised phosphonoacetate hydrolase designated PhnA []. This entry represents the N-terminal domain of PhnA, which is predicted to form a zinc-ribbon.; PDB: 2AKL_A.
Probab=43.96 E-value=14 Score=18.80 Aligned_cols=23 Identities=30% Similarity=0.762 Sum_probs=11.3
Q ss_pred CCCCeEEEecccccCCCCCCceeEeCCCCcceE
Q psy7114 97 RCGAEFCIEEQDTEGDGSDDNILIACDTCSLLL 129 (134)
Q Consensus 97 rcg~~~~v~~~~l~~g~~~~~~~v~C~~Csl~i 129 (134)
.||..|.... ..+..|++|..-+
T Consensus 7 ~C~se~~y~D----------~~~~vCp~C~~ew 29 (30)
T PF08274_consen 7 LCGSEYTYED----------GELLVCPECGHEW 29 (30)
T ss_dssp TT-----EE-----------SSSEEETTTTEEE
T ss_pred CCCCcceecc----------CCEEeCCcccccC
Confidence 7887777722 2457899997643
No 122
>PRK00762 hypA hydrogenase nickel incorporation protein; Provisional
Probab=43.56 E-value=17 Score=24.69 Aligned_cols=35 Identities=26% Similarity=0.581 Sum_probs=19.0
Q ss_pred eeCCCCCeEEEecccccCCCCCCceeEeCCCCc-ceEEEe
Q psy7114 94 YPCRCGAEFCIEEQDTEGDGSDDNILIACDTCS-LLLEIT 132 (134)
Q Consensus 94 ~~crcg~~~~v~~~~l~~g~~~~~~~v~C~~Cs-l~i~v~ 132 (134)
..|.||..|.+...++..- .....||.|- .-+.|.
T Consensus 71 ~~C~Cg~~~~~~~~~~~~~----~~~~~CP~Cgs~~~~i~ 106 (124)
T PRK00762 71 IECECGYEGVVDEDEIDHY----AAVIECPVCGNKRAHIL 106 (124)
T ss_pred EEeeCcCcccccccchhcc----ccCCcCcCCCCCCCEEe
Confidence 4568887777654322110 0124799994 455543
No 123
>cd01388 SOX-TCF_HMG-box SOX-TCF_HMG-box, class I member of the HMG-box superfamily of DNA-binding proteins. These proteins contain a single HMG box, and bind the minor groove of DNA in a highly sequence-specific manner. Members include SRY and its homologs in insects and vertebrates, and transcription factor-like proteins, TCF-1, -3, -4, and LEF-1. They appear to bind the minor groove of the A/T C A A A G/C-motif.
Probab=43.23 E-value=71 Score=18.96 Aligned_cols=43 Identities=21% Similarity=0.420 Sum_probs=30.1
Q ss_pred HHHHHHHHHhCCCCCHHHHHHHHHHHHHHHhccChHHHHHHHHHh
Q psy7114 20 ENYKSLILKYHPDKNSEHQEMFLKINEAWNILKDEKERKLYESQL 64 (134)
Q Consensus 20 ~ayr~l~~~~HPDk~~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~ 64 (134)
+..+...+.-||+. ...+..+.|.+.|..|++.++...++...
T Consensus 14 ~~~r~~~~~~~p~~--~~~eisk~l~~~Wk~ls~~eK~~y~~~a~ 56 (72)
T cd01388 14 KRHRRKVLQEYPLK--ENRAISKILGDRWKALSNEEKQPYYEEAK 56 (72)
T ss_pred HHHHHHHHHHCCCC--CHHHHHHHHHHHHHcCCHHHHHHHHHHHH
Confidence 34455666789985 33466778999999999888766555443
No 124
>TIGR00143 hypF [NiFe] hydrogenase maturation protein HypF. A previously described regulatory effect of HypF mutatation is attributable to loss of activity of a regulatory hydrogenase. A zinc finger-like region CXXCX(18)CXXCX(24)CXXCX(18)CXXC region further supported the regulatory hypothesis. However, more recent work (PUBMED:11375153) shows the direct effect is on the activity of expressed hydrogenases with nickel/iron centers, rather than on expression.
Probab=42.40 E-value=16 Score=32.16 Aligned_cols=26 Identities=27% Similarity=0.608 Sum_probs=17.6
Q ss_pred CCceeEEe---eC-CCCCeEEEecccccCC
Q psy7114 87 GESQMFTY---PC-RCGAEFCIEEQDTEGD 112 (134)
Q Consensus 87 ~~~~~~~~---~c-rcg~~~~v~~~~l~~g 112 (134)
..+.+|.| .| .||..|.|...--+..
T Consensus 81 p~~Rry~YpF~nCt~CGPr~~i~~~lpydr 110 (711)
T TIGR00143 81 KNDRRYLYPFISCTHCGPRFTIIEALPYDR 110 (711)
T ss_pred CCcccccCCcccccCCCCCeEEeecCCCCC
Confidence 34445544 48 9999999998844443
No 125
>PF04502 DUF572: Family of unknown function (DUF572) ; InterPro: IPR007590 This entry represents eukaryotic proteins with undetermined function belonging to the CWC16 family.
Probab=41.47 E-value=39 Score=26.69 Aligned_cols=15 Identities=33% Similarity=0.671 Sum_probs=12.6
Q ss_pred eEeCCCCcceEEEec
Q psy7114 119 LIACDTCSLLLEITA 133 (134)
Q Consensus 119 ~v~C~~Csl~i~v~~ 133 (134)
.+.|+.|+-+|.+.+
T Consensus 77 ~~kC~~C~~~i~~kT 91 (324)
T PF04502_consen 77 YIKCPRCSNEIEFKT 91 (324)
T ss_pred EEEcCCCCCEEeeec
Confidence 499999999998754
No 126
>TIGR00373 conserved hypothetical protein TIGR00373. This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain.
Probab=41.24 E-value=14 Score=26.18 Aligned_cols=29 Identities=24% Similarity=0.571 Sum_probs=20.0
Q ss_pred EeeC-CCCCeEEEecccccCCCCCCceeEeCCCCcceE
Q psy7114 93 TYPC-RCGAEFCIEEQDTEGDGSDDNILIACDTCSLLL 129 (134)
Q Consensus 93 ~~~c-rcg~~~~v~~~~l~~g~~~~~~~v~C~~Csl~i 129 (134)
.|.| +|+..|...++- ..+ ..|+.|.-.+
T Consensus 109 ~Y~Cp~c~~r~tf~eA~-~~~-------F~Cp~Cg~~L 138 (158)
T TIGR00373 109 FFICPNMCVRFTFNEAM-ELN-------FTCPRCGAML 138 (158)
T ss_pred eEECCCCCcEeeHHHHH-HcC-------CcCCCCCCEe
Confidence 4778 898888776663 333 6899996543
No 127
>PF11331 DUF3133: Protein of unknown function (DUF3133); InterPro: IPR021480 This eukaryotic family of proteins has no known function.
Probab=40.78 E-value=41 Score=18.93 Aligned_cols=17 Identities=24% Similarity=0.575 Sum_probs=13.5
Q ss_pred ceeEeCCCCcceEEEec
Q psy7114 117 NILIACDTCSLLLEITA 133 (134)
Q Consensus 117 ~~~v~C~~Csl~i~v~~ 133 (134)
...+.|..||-.|.+.-
T Consensus 29 ~~klrCGaCs~vl~~s~ 45 (46)
T PF11331_consen 29 QQKLRCGACSEVLSFSL 45 (46)
T ss_pred eeEEeCCCCceeEEEec
Confidence 45689999999998753
No 128
>PF14354 Lar_restr_allev: Restriction alleviation protein Lar
Probab=40.68 E-value=42 Score=19.28 Aligned_cols=33 Identities=27% Similarity=0.540 Sum_probs=17.0
Q ss_pred eeC-CCCCeEEEecccccCCCCCCceeEeCCCCcc
Q psy7114 94 YPC-RCGAEFCIEEQDTEGDGSDDNILIACDTCSL 127 (134)
Q Consensus 94 ~~c-rcg~~~~v~~~~l~~g~~~~~~~v~C~~Csl 127 (134)
.+| .||..-.....+...+ ......|.|..|..
T Consensus 4 kPCPFCG~~~~~~~~~~~~~-~~~~~~V~C~~Cga 37 (61)
T PF14354_consen 4 KPCPFCGSADVLIRQDEGFD-YGMYYYVECTDCGA 37 (61)
T ss_pred cCCCCCCCcceEeecccCCC-CCCEEEEEcCCCCC
Confidence 467 6874444444332221 00116688998865
No 129
>PF12662 cEGF: Complement Clr-like EGF-like
Probab=38.87 E-value=11 Score=18.19 Aligned_cols=14 Identities=21% Similarity=0.826 Sum_probs=11.1
Q ss_pred eeCCCCCeEEEecc
Q psy7114 94 YPCRCGAEFCIEEQ 107 (134)
Q Consensus 94 ~~crcg~~~~v~~~ 107 (134)
|.|.|...|.++.+
T Consensus 2 y~C~C~~Gy~l~~d 15 (24)
T PF12662_consen 2 YTCSCPPGYQLSPD 15 (24)
T ss_pred EEeeCCCCCcCCCC
Confidence 67899988887654
No 130
>PF07739 TipAS: TipAS antibiotic-recognition domain; InterPro: IPR012925 TipAL is a bacterial transcriptional regulator of the MerR family. The tipA gene can be expressed as a long form, TipAL, and a short form, TipAS, which constitutes the C-terminal part of TipAL. TipAS forms the antibiotic-recognition domain []. This domain, which has an alpha-helical globin-like fold, is also found at the C terminus of other MerR family transcription factors, including Mta, a central regulator of multidrug resistance in Bacillus subtilis [], and SkgA from Caulobacter crescentus []. ; GO: 0003677 DNA binding, 0045892 negative regulation of transcription, DNA-dependent; PDB: 1NY9_A 3HH0_A 3QAO_A.
Probab=38.30 E-value=96 Score=19.89 Aligned_cols=49 Identities=16% Similarity=0.396 Sum_probs=27.9
Q ss_pred cCCCCCCH-HHHHHHHHHHHHHhCCCCCHHHHHHHHHHHHHHHhccChHHHHHHHH
Q psy7114 8 NCNVNSTL-EEIKENYKSLILKYHPDKNSEHQEMFLKINEAWNILKDEKERKLYES 62 (134)
Q Consensus 8 gv~~~as~-~~Ik~ayr~l~~~~HPDk~~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~ 62 (134)
|++|+... .+|-+.+..++..+++. +.+.+..|.+.| +.||.-+..||.
T Consensus 51 g~~p~s~evq~l~~~~~~~~~~~~~~----~~~~~~~l~~~y--~~~~~~~~~~~~ 100 (118)
T PF07739_consen 51 GVDPDSPEVQELAERWMELINQFTGG----DPELLRGLAQMY--VEDPRFAAMYDK 100 (118)
T ss_dssp T--TT-HHHHHHHHHHHHHHHHSS-------HHHHHHHHHHT--TSTHHHHHHHG-
T ss_pred CCCcCCHHHHHHHHHHHHHHHHHhCC----CHHHHHHHHHHH--HcCHHHHhhccc
Confidence 45555433 34566666666666662 345677777777 677887777773
No 131
>COG0089 RplW Ribosomal protein L23 [Translation, ribosomal structure and biogenesis]
Probab=37.46 E-value=33 Score=22.32 Aligned_cols=21 Identities=24% Similarity=0.346 Sum_probs=17.7
Q ss_pred cccCCCCCCHHHHHHHHHHHH
Q psy7114 6 ILNCNVNSTLEEIKENYKSLI 26 (134)
Q Consensus 6 vLgv~~~as~~~Ik~ayr~l~ 26 (134)
+|-|.++|+..+||+|...+.
T Consensus 25 vF~V~~~AtK~~IK~AvE~lF 45 (94)
T COG0089 25 VFIVDPDATKPEIKAAVEELF 45 (94)
T ss_pred EEEECCCCCHHHHHHHHHHHh
Confidence 467899999999999988764
No 132
>COG0068 HypF Hydrogenase maturation factor [Posttranslational modification, protein turnover, chaperones]
Probab=37.32 E-value=24 Score=31.05 Aligned_cols=28 Identities=32% Similarity=0.666 Sum_probs=19.5
Q ss_pred ccCCceeEEee---C-CCCCeEEEecccccCC
Q psy7114 85 HKGESQMFTYP---C-RCGAEFCIEEQDTEGD 112 (134)
Q Consensus 85 ~~~~~~~~~~~---c-rcg~~~~v~~~~l~~g 112 (134)
|+..+.+|-|| | .||..|+|-..--|..
T Consensus 112 ~dp~~rrY~YPF~~CT~CGPRfTIi~alPYDR 143 (750)
T COG0068 112 FDPNSRRYLYPFINCTNCGPRFTIIEALPYDR 143 (750)
T ss_pred cCCCCcceeccccccCCCCcceeeeccCCCCc
Confidence 34455565554 8 9999999988866654
No 133
>PRK11032 hypothetical protein; Provisional
Probab=36.92 E-value=1.1e+02 Score=21.80 Aligned_cols=26 Identities=19% Similarity=0.618 Sum_probs=19.0
Q ss_pred EeeC-CCCCeEEEecccccCCCCCCceeEeCCCCcc
Q psy7114 93 TYPC-RCGAEFCIEEQDTEGDGSDDNILIACDTCSL 127 (134)
Q Consensus 93 ~~~c-rcg~~~~v~~~~l~~g~~~~~~~v~C~~Csl 127 (134)
...| .||....+..- ..|-+|+.|..
T Consensus 124 ~LvC~~Cg~~~~~~~p---------~~i~pCp~C~~ 150 (160)
T PRK11032 124 NLVCEKCHHHLAFYTP---------EVLPLCPKCGH 150 (160)
T ss_pred eEEecCCCCEEEecCC---------CcCCCCCCCCC
Confidence 4678 89988877544 45568999964
No 134
>TIGR01562 FdhE formate dehydrogenase accessory protein FdhE. The only sequence scoring between trusted and noise is that from Aquifex aeolicus, which shows certain structural differences from the proteobacterial forms in the alignment. However it is notable that A. aeolicus also has a sequence scoring above trusted to the alpha subunit of formate dehydrogenase (TIGR01553).
Probab=36.42 E-value=31 Score=27.20 Aligned_cols=17 Identities=29% Similarity=0.438 Sum_probs=13.9
Q ss_pred ceeEeCCCCcceEEEec
Q psy7114 117 NILIACDTCSLLLEITA 133 (134)
Q Consensus 117 ~~~v~C~~Csl~i~v~~ 133 (134)
.-+-.|.+|.-++++..
T Consensus 250 ~r~e~C~~C~~YlK~~~ 266 (305)
T TIGR01562 250 LKAETCDSCQGYLKILY 266 (305)
T ss_pred eEEeeccccccchhhhc
Confidence 45679999999998864
No 135
>PF01155 HypA: Hydrogenase expression/synthesis hypA family; InterPro: IPR000688 Bacterial membrane-bound nickel-dependent hydrogenases requires a number of accessory proteins which are involved in their maturation. The exact role of these proteins is not yet clear, but some seem to be required for the incorporation of the nickel ions []. One of these proteins is generally known as hypA. It is a protein of about 12 to 14 kDa that contains, in its C-terminal region, four conserved cysteines that form a zinc-finger like motif. Escherichia coli has two proteins that belong to this family, hypA and hybF. A homologue, MJ0214, has also been found in a number of archaeal species, including the genome of Methanocaldococcus jannaschii (Methanococcus jannaschii).; GO: 0016151 nickel ion binding, 0006464 protein modification process; PDB: 2KDX_A 3A44_D 3A43_B.
Probab=35.93 E-value=23 Score=23.49 Aligned_cols=29 Identities=38% Similarity=0.740 Sum_probs=18.3
Q ss_pred EeeC-CCCCeEEEecccccCCCCCCceeEeCCCCcce-EEEe
Q psy7114 93 TYPC-RCGAEFCIEEQDTEGDGSDDNILIACDTCSLL-LEIT 132 (134)
Q Consensus 93 ~~~c-rcg~~~~v~~~~l~~g~~~~~~~v~C~~Csl~-i~v~ 132 (134)
...| .||..|.++.... .||.|... +.|.
T Consensus 70 ~~~C~~Cg~~~~~~~~~~-----------~CP~Cgs~~~~i~ 100 (113)
T PF01155_consen 70 RARCRDCGHEFEPDEFDF-----------SCPRCGSPDVEII 100 (113)
T ss_dssp EEEETTTS-EEECHHCCH-----------H-SSSSSS-EEEE
T ss_pred cEECCCCCCEEecCCCCC-----------CCcCCcCCCcEEc
Confidence 3568 8998888877652 48888765 4443
No 136
>PF09855 DUF2082: Nucleic-acid-binding protein containing Zn-ribbon domain (DUF2082); InterPro: IPR018652 This family of proteins contains various hypothetical prokaryotic proteins as well as some Zn-ribbon nucleic-acid-binding proteins.
Probab=34.78 E-value=49 Score=19.91 Aligned_cols=10 Identities=20% Similarity=0.873 Sum_probs=5.9
Q ss_pred eeEeCCCCcc
Q psy7114 118 ILIACDTCSL 127 (134)
Q Consensus 118 ~~v~C~~Csl 127 (134)
..+.|+.|..
T Consensus 35 ~~v~C~~CGY 44 (64)
T PF09855_consen 35 TTVSCTNCGY 44 (64)
T ss_pred EEEECCCCCC
Confidence 3466666654
No 137
>PF00505 HMG_box: HMG (high mobility group) box; InterPro: IPR000910 High mobility group (HMG or HMGB) proteins are a family of relatively low molecular weight non-histone components in chromatin. HMG1 (also called HMG-T in fish) and HMG2 are two highly related proteins that bind single-stranded DNA preferentially and unwind double-stranded DNA. Although they have no sequence specificity, they have a high affinity for bent or distorted DNA, and bend linear DNA. HMG1 and HMG2 contain two DNA-binding HMG-box domains (A and B) that show structural and functional differences, and have a long acidic C-terminal domain rich in aspartic and glutamic acid residues. The acidic tail modulates the affinity of the tandem HMG boxes in HMG1 and 2 for a variety of DNA targets. HMG1 and 2 appear to play important architectural roles in the assembly of nucleoprotein complexes in a variety of biological processes, for example V(D)J recombination, the initiation of transcription, and DNA repair []. The profile in this entry describing the HMG-domains is much more general than the signature. In addition to the HMG1 and HMG2 proteins, HMG-domains occur in single or multiple copies in the following protein classes; the SOX family of transcription factors; SRY sex determining region Y protein and related proteins []; LEF1 lymphoid enhancer binding factor 1 []; SSRP recombination signal recognition protein; MTF1 mitochondrial transcription factor 1; UBF1/2 nucleolar transcription factors; Abf2 yeast ARS-binding factor []; and Saccharomyces cerevisiae transcription factors Ixr1, Rox1, Nhp6a, Nhp6b and Spp41.; GO: 0003677 DNA binding; PDB: 1I11_A 1J3C_A 1J3D_A 1WZ6_A 1WGF_A 2D7L_A 1GT0_D 3U2B_C 2CRJ_A 2CS1_A ....
Probab=34.02 E-value=95 Score=17.78 Aligned_cols=42 Identities=17% Similarity=0.349 Sum_probs=29.2
Q ss_pred HHHHHHHHHhCCCCCHHHHHHHHHHHHHHHhccChHHHHHHHHH
Q psy7114 20 ENYKSLILKYHPDKNSEHQEMFLKINEAWNILKDEKERKLYESQ 63 (134)
Q Consensus 20 ~ayr~l~~~~HPDk~~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~ 63 (134)
...+...+.-||+.. ..+..+.|.+.|.-|++.++....+..
T Consensus 13 ~~~~~~~k~~~p~~~--~~~i~~~~~~~W~~l~~~eK~~y~~~a 54 (69)
T PF00505_consen 13 KEKRAKLKEENPDLS--NKEISKILAQMWKNLSEEEKAPYKEEA 54 (69)
T ss_dssp HHHHHHHHHHSTTST--HHHHHHHHHHHHHCSHHHHHHHHHHHH
T ss_pred HHHHHHHHHHhcccc--cccchhhHHHHHhcCCHHHHHHHHHHH
Confidence 444555667799865 456778999999999866655544444
No 138
>PRK13798 putative OHCU decarboxylase; Provisional
Probab=33.95 E-value=1.7e+02 Score=20.78 Aligned_cols=24 Identities=13% Similarity=0.072 Sum_probs=15.7
Q ss_pred CCCCCHHHHHHHHHHH-----------HHHhCCCC
Q psy7114 10 NVNSTLEEIKENYKSL-----------ILKYHPDK 33 (134)
Q Consensus 10 ~~~as~~~Ik~ayr~l-----------~~~~HPDk 33 (134)
.|-+|.+++..+.... ++..|||-
T Consensus 40 RPf~s~~~L~~a~~~~~~~~~~~~~~~~l~~HP~l 74 (166)
T PRK13798 40 RPFADHDALLAAADEALAGLSEADIDEALAGHPRI 74 (166)
T ss_pred CCCCCHHHHHHHHHHHHHcCCHHHHHHHHHhCCcc
Confidence 3556666666665443 66789997
No 139
>PF14706 Tnp_DNA_bind: Transposase DNA-binding; PDB: 3ECP_A 4DM0_A 1MUS_A 1MUH_A 1MM8_A.
Probab=33.71 E-value=71 Score=18.77 Aligned_cols=39 Identities=8% Similarity=0.199 Sum_probs=26.5
Q ss_pred HHHHHHHHHHH--HhCCCC-CHHHHHHHHHHHHHHHhccChH
Q psy7114 17 EIKENYKSLIL--KYHPDK-NSEHQEMFLKINEAWNILKDEK 55 (134)
Q Consensus 17 ~Ik~ayr~l~~--~~HPDk-~~~~~~~f~~i~~Ay~~L~d~~ 55 (134)
-+.+...+++. --||.. -+.+......+.-||..|.+|.
T Consensus 16 Rl~~Rl~~l~~~la~~p~~Sip~a~~~wa~tkaAYRF~~N~~ 57 (58)
T PF14706_consen 16 RLTRRLVKLAESLAEKPGASIPQACQDWAETKAAYRFFRNPR 57 (58)
T ss_dssp HHHHHHHHHHHHHHHTTTS-HHHHTT-HHHHHHHHHHHT-TT
T ss_pred hHHHHHHHHHHHHHHCCCCccchhccCHHHHHHHHHhhcCCC
Confidence 45666666654 458887 5566667778899999998874
No 140
>TIGR03180 UraD_2 OHCU decarboxylase. Previously thought to only proceed spontaneously, the decarboxylation of 2-oxo-4-hydroxy-4-carboxy--5-ureidoimidazoline (OHCU) has been recently been shown to be catalyzed by this enzyme in Mus musculus. Homologs of this enzyme are found adjacent to and fused with uricase in a number of prokaryotes and are represented by this model. This model is a separate (but related) clade from that represented by TIGR3164. This model places a second homolog in streptomyces species which (are not in the vicinity of other urate catabolism associated genes) below the trusted cutoff.
Probab=33.31 E-value=1.7e+02 Score=20.58 Aligned_cols=24 Identities=17% Similarity=0.115 Sum_probs=15.4
Q ss_pred CCCCCHHHHHHHHHHH-----------HHHhCCCC
Q psy7114 10 NVNSTLEEIKENYKSL-----------ILKYHPDK 33 (134)
Q Consensus 10 ~~~as~~~Ik~ayr~l-----------~~~~HPDk 33 (134)
.|-+|.+++..+.... ++..|||-
T Consensus 30 RPf~s~~~L~~a~~~~~~~~~~~~~~~~l~~HP~l 64 (158)
T TIGR03180 30 RPFASAEALLAAADQAWQNLSEQDLFEALAGHPRI 64 (158)
T ss_pred CCCCCHHHHHHHHHHHHHcCCHHHHHHHHHhCCcc
Confidence 3556666666665443 66779987
No 141
>PF06107 DUF951: Bacterial protein of unknown function (DUF951); InterPro: IPR009296 This family consists of several short hypothetical bacterial proteins of unknown function.
Probab=33.12 E-value=69 Score=18.89 Aligned_cols=30 Identities=23% Similarity=0.462 Sum_probs=19.3
Q ss_pred CCC-CeEEEecccccCCCCCCceeEeCCCCcceEEEec
Q psy7114 97 RCG-AEFCIEEQDTEGDGSDDNILIACDTCSLLLEITA 133 (134)
Q Consensus 97 rcg-~~~~v~~~~l~~g~~~~~~~v~C~~Csl~i~v~~ 133 (134)
.|| ..+.|..-.+.-. +.|.+|-..|.+.+
T Consensus 15 PCG~~~Wei~R~GaDik-------ikC~gCg~~imlpR 45 (57)
T PF06107_consen 15 PCGSNEWEIIRIGADIK-------IKCLGCGRQIMLPR 45 (57)
T ss_pred CCCCCEEEEEEccCcEE-------EEECCCCCEEEEeH
Confidence 455 4566655444443 77999998887653
No 142
>cd01780 PLC_epsilon_RA Ubiquitin-like domain of Phosphatidylinositide-specific phospholipase. PLC_epsilon_RA Phosphatidylinositide-specific phospholipase C (PLC) is a signaling enzyme that hydrolyzes membrane phospholipids to generate inositol triphosphate. PLC-epsilon represents a novel forth class of PLC that has a PLC catalytic core domain, a CDC25 guanine nucleotide exchange factor domain and two RA (Ras-association) domains of which the second is critical for Ras activation of the enzyme.
Probab=32.56 E-value=55 Score=21.25 Aligned_cols=32 Identities=16% Similarity=0.230 Sum_probs=24.9
Q ss_pred CccccccCCCCCCHHHHHHHHHHHHHHhCCCC
Q psy7114 2 NHFEILNCNVNSTLEEIKENYKSLILKYHPDK 33 (134)
Q Consensus 2 ~~Y~vLgv~~~as~~~Ik~ayr~l~~~~HPDk 33 (134)
.+|.||-++...|..+|-+.-...|++-+||-
T Consensus 12 qP~~il~a~~~STa~Dvi~Qal~KA~rs~~~~ 43 (93)
T cd01780 12 QPYAILRAPRVSTAQDVIQQTLCKARRSNPNP 43 (93)
T ss_pred CCeeEEEccccccHHHHHHHHHHHhccCCCCc
Confidence 47999999999998887666556667778884
No 143
>cd01390 HMGB-UBF_HMG-box HMGB-UBF_HMG-box, class II and III members of the HMG-box superfamily of DNA-binding proteins. These proteins bind the minor groove of DNA in a non-sequence specific fashion and contain two or more tandem HMG boxes. Class II members include non-histone chromosomal proteins, HMG1 and HMG2, which bind to bent or distorted DNA such as four-way DNA junctions, synthetic DNA cruciforms, kinked cisplatin-modified DNA, DNA bulges, cross-overs in supercoiled DNA, and can cause looping of linear DNA. Class III members include nucleolar and mitochondrial transcription factors, UBF and mtTF1, which bind four-way DNA junctions.
Probab=31.99 E-value=1e+02 Score=17.44 Aligned_cols=41 Identities=24% Similarity=0.375 Sum_probs=28.1
Q ss_pred HHHHHHHhCCCCCHHHHHHHHHHHHHHHhccChHHHHHHHHHh
Q psy7114 22 YKSLILKYHPDKNSEHQEMFLKINEAWNILKDEKERKLYESQL 64 (134)
Q Consensus 22 yr~l~~~~HPDk~~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~~ 64 (134)
.+...+.-||+. ...+..+.|.+.|..|++.++....+...
T Consensus 15 ~r~~~~~~~p~~--~~~~i~~~~~~~W~~ls~~eK~~y~~~a~ 55 (66)
T cd01390 15 QRPKLKKENPDA--SVTEVTKILGEKWKELSEEEKKKYEEKAE 55 (66)
T ss_pred HHHHHHHHCcCC--CHHHHHHHHHHHHHhCCHHHHHHHHHHHH
Confidence 345556678874 34567789999999999777655544443
No 144
>PTZ00043 cytochrome c oxidase subunit; Provisional
Probab=31.75 E-value=2.4e+02 Score=21.63 Aligned_cols=37 Identities=27% Similarity=0.265 Sum_probs=23.9
Q ss_pred HHHHHHHHHHHHHhCCCCCHHHHHHHHHHHHHHHhcc
Q psy7114 16 EEIKENYKSLILKYHPDKNSEHQEMFLKINEAWNILK 52 (134)
Q Consensus 16 ~~Ik~ayr~l~~~~HPDk~~~~~~~f~~i~~Ay~~L~ 52 (134)
.-++++|++||+-.-=.......+....+-+-|++|.
T Consensus 96 ~l~~ksykqla~d~gmqi~~~~~~hm~~~le~y~~Lk 132 (268)
T PTZ00043 96 TLPKKSYKQLARDMGMQIVNEPSEHMLGLLELYEYLK 132 (268)
T ss_pred hcchHHHHHHHHHhCceecCCchHHHHHHHHHHHhcC
Confidence 3478999999885543332233455667777787776
No 145
>cd00084 HMG-box High Mobility Group (HMG)-box is found in a variety of eukaryotic chromosomal proteins and transcription factors. HMGs bind to the minor groove of DNA and have been classified by DNA binding preferences. Two phylogenically distinct groups of Class I proteins bind DNA in a sequence specific fashion and contain a single HMG box. One group (SOX-TCF) includes transcription factors, TCF-1, -3, -4; and also SRY and LEF-1, which bind four-way DNA junctions and duplex DNA targets. The second group (MATA) includes fungal mating type gene products MC, MATA1 and Ste11. Class II and III proteins (HMGB-UBF) bind DNA in a non-sequence specific fashion and contain two or more tandem HMG boxes. Class II members include non-histone chromosomal proteins, HMG1 and HMG2, which bind to bent or distorted DNA such as four-way DNA junctions, synthetic DNA cruciforms, kinked cisplatin-modified DNA, DNA bulges, cross-overs in supercoiled DNA, and can cause looping of linear DNA. Class III member
Probab=31.34 E-value=1e+02 Score=17.26 Aligned_cols=43 Identities=19% Similarity=0.327 Sum_probs=29.8
Q ss_pred HHHHHHHHHHhCCCCCHHHHHHHHHHHHHHHhccChHHHHHHHHH
Q psy7114 19 KENYKSLILKYHPDKNSEHQEMFLKINEAWNILKDEKERKLYESQ 63 (134)
Q Consensus 19 k~ayr~l~~~~HPDk~~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~ 63 (134)
...++...+.-||+.. ..+..+.+.+.|..|++.++....+..
T Consensus 12 ~~~~~~~~~~~~~~~~--~~~i~~~~~~~W~~l~~~~k~~y~~~a 54 (66)
T cd00084 12 SQEHRAEVKAENPGLS--VGEISKILGEMWKSLSEEEKKKYEEKA 54 (66)
T ss_pred HHHHHHHHHHHCcCCC--HHHHHHHHHHHHHhCCHHHHHHHHHHH
Confidence 3455666777888843 355677899999999987665554444
No 146
>PF05876 Terminase_GpA: Phage terminase large subunit (GpA); InterPro: IPR008866 This entry is represented by Bacteriophage lambda, GpA. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This entry consists of several phage terminase large subunit proteins as well as related sequences from several bacterial species. The DNA packaging enzyme of bacteriophage lambda, terminase, is a heteromultimer composed of a small subunit, gpNu1, and a large subunit, gpA, products of the Nu1 and A genes, respectively. Terminase is involved in the site-specific binding and cutting of the DNA in the initial stages of packaging. It is now known that gpA is actively involved in late stages of packaging, including DNA translocation, and that this enzyme contains separate functional domains for its early and late packaging activities [].
Probab=31.24 E-value=41 Score=28.64 Aligned_cols=43 Identities=21% Similarity=0.549 Sum_probs=29.2
Q ss_pred CCceeEEeeC-CCCCeEEEecccccCC--CCCCceeEeCCCCcceE
Q psy7114 87 GESQMFTYPC-RCGAEFCIEEQDTEGD--GSDDNILIACDTCSLLL 129 (134)
Q Consensus 87 ~~~~~~~~~c-rcg~~~~v~~~~l~~g--~~~~~~~v~C~~Csl~i 129 (134)
.+...|..+| .||..+.+..+.+.-. ...+.+...|+.|-..|
T Consensus 194 sdqr~~~vpCPhCg~~~~l~~~~l~w~~~~~~~~a~y~C~~Cg~~i 239 (557)
T PF05876_consen 194 SDQRRYYVPCPHCGEEQVLEWENLKWDKGEAPETARYVCPHCGCEI 239 (557)
T ss_pred CCceEEEccCCCCCCCccccccceeecCCCCccceEEECCCCcCCC
Confidence 3445677889 8998888776655432 12345779999997654
No 147
>COG2331 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=31.16 E-value=9.1 Score=24.02 Aligned_cols=33 Identities=30% Similarity=0.817 Sum_probs=20.7
Q ss_pred ceeEEeeC-CCCCeEEEecccccCCCCCCceeEeCCCCcc
Q psy7114 89 SQMFTYPC-RCGAEFCIEEQDTEGDGSDDNILIACDTCSL 127 (134)
Q Consensus 89 ~~~~~~~c-rcg~~~~v~~~~l~~g~~~~~~~v~C~~Csl 127 (134)
.+.|.|.| .||..|.|...-- +.....|+.|+-
T Consensus 8 MPtY~Y~c~~cg~~~dvvq~~~------ddplt~ce~c~a 41 (82)
T COG2331 8 MPTYSYECTECGNRFDVVQAMT------DDPLTTCEECGA 41 (82)
T ss_pred ccceEEeecccchHHHHHHhcc------cCccccChhhCh
Confidence 45788999 8997776543311 123467888864
No 148
>COG2879 Uncharacterized small protein [Function unknown]
Probab=30.57 E-value=94 Score=18.75 Aligned_cols=21 Identities=43% Similarity=0.559 Sum_probs=13.7
Q ss_pred HHHHHHHHhCCCCCHHHHHHH
Q psy7114 21 NYKSLILKYHPDKNSEHQEMF 41 (134)
Q Consensus 21 ayr~l~~~~HPDk~~~~~~~f 41 (134)
.|-.-.++.|||+.+-..+.|
T Consensus 27 nYVehmr~~hPd~p~mT~~EF 47 (65)
T COG2879 27 NYVEHMRKKHPDKPPMTYEEF 47 (65)
T ss_pred HHHHHHHHhCcCCCcccHHHH
Confidence 466668899999954333333
No 149
>PF06827 zf-FPG_IleRS: Zinc finger found in FPG and IleRS; InterPro: IPR010663 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents a zinc finger domain found at the C-terminal in both DNA glycosylase/AP lyase enzymes and in isoleucyl tRNA synthetase. In these two types of enzymes, the C-terminal domain forms a zinc finger. Some related proteins may not bind zinc. DNA glycosylase/AP lyase enzymes are involved in base excision repair of DNA damaged by oxidation or by mutagenic agents. These enzymes have both DNA glycosylase activity (3.2.2 from EC) and AP lyase activity (4.2.99.18 from EC) []. Examples include formamidopyrimidine-DNA glycosylases (Fpg; MutM) and endonuclease VIII (Nei). Formamidopyrimidine-DNA glycosylases (Fpg, MutM) is a trifunctional DNA base excision repair enzyme that removes a wide range of oxidation-damaged bases (N-glycosylase activity; 3.2.2.23 from EC) and cleaves both the 3'- and 5'-phosphodiester bonds of the resulting apurinic/apyrimidinic site (AP lyase activity; 4.2.99.18 from EC). Fpg has a preference for oxidised purines, excising oxidized purine bases such as 7,8-dihydro-8-oxoguanine (8-oxoG). ITs AP (apurinic/apyrimidinic) lyase activity introduces nicks in the DNA strand, cleaving the DNA backbone by beta-delta elimination to generate a single-strand break at the site of the removed base with both 3'- and 5'-phosphates. Fpg is a monomer composed of 2 domains connected by a flexible hinge []. The two DNA-binding motifs (a zinc finger and the helix-two-turns-helix motifs) suggest that the oxidized base is flipped out from double-stranded DNA in the binding mode and excised by a catalytic mechanism similar to that of bifunctional base excision repair enzymes []. Fpg binds one ion of zinc at the C terminus, which contains four conserved and essential cysteines []. Endonuclease VIII (Nei) has the same enzyme activities as Fpg above, but with a preference for oxidized pyrimidines, such as thymine glycol, 5,6-dihydrouracil and 5,6-dihydrothymine [, ]. An Fpg-type zinc finger is also found at the C terminus of isoleucyl tRNA synthetase (6.1.1.5 from EC) [, ]. This enzyme catalyses the attachment of isoleucine to tRNA(Ile). As IleRS can inadvertently accommodate and process structurally similar amino acids such as valine, to avoid such errors it has two additional distinct tRNA(Ile)-dependent editing activities. One activity is designated as 'pre-transfer' editing and involves the hydrolysis of activated Val-AMP. The other activity is designated 'post-transfer' editing and involves deacylation of mischarged Val-tRNA(Ile) []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0003824 catalytic activity; PDB: 1K82_C 1Q39_A 2OQ4_B 2OPF_A 1K3X_A 1K3W_A 1Q3B_A 2EA0_A 1Q3C_A 2XZF_A ....
Probab=30.11 E-value=32 Score=16.98 Aligned_cols=9 Identities=22% Similarity=0.641 Sum_probs=4.7
Q ss_pred eeEeCCCCc
Q psy7114 118 ILIACDTCS 126 (134)
Q Consensus 118 ~~v~C~~Cs 126 (134)
....|+.|.
T Consensus 20 ~~~~C~rCq 28 (30)
T PF06827_consen 20 STYLCPRCQ 28 (30)
T ss_dssp EEEE-TTTC
T ss_pred CCeECcCCc
Confidence 346777764
No 150
>KOG0527|consensus
Probab=29.98 E-value=1.1e+02 Score=24.57 Aligned_cols=42 Identities=17% Similarity=0.469 Sum_probs=33.2
Q ss_pred HHHHHHHHHhCCCCCHHHHHHHHHHHHHHHhccChHHHHHHHHH
Q psy7114 20 ENYKSLILKYHPDKNSEHQEMFLKINEAWNILKDEKERKLYESQ 63 (134)
Q Consensus 20 ~ayr~l~~~~HPDk~~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~ 63 (134)
+..|+.+.+-.||. ...+..|+|-+-|+.|++.++|-.+|..
T Consensus 75 q~~RRkma~qnP~m--HNSEISK~LG~~WK~Lse~EKrPFi~EA 116 (331)
T KOG0527|consen 75 QGQRRKLAKQNPKM--HNSEISKRLGAEWKLLSEEEKRPFVDEA 116 (331)
T ss_pred HHHHHHHHHhCcch--hhHHHHHHHHHHHhhcCHhhhccHHHHH
Confidence 34567777778885 3356788999999999999999998854
No 151
>PF14353 CpXC: CpXC protein
Probab=29.81 E-value=61 Score=21.63 Aligned_cols=26 Identities=27% Similarity=0.560 Sum_probs=11.6
Q ss_pred EEeeC-CCCCeEEEecccccCCCCCCc
Q psy7114 92 FTYPC-RCGAEFCIEEQDTEGDGSDDN 117 (134)
Q Consensus 92 ~~~~c-rcg~~~~v~~~~l~~g~~~~~ 117 (134)
|.+.| .||..+.+.-.-++.....++
T Consensus 37 ~~~~CP~Cg~~~~~~~p~lY~D~~~~~ 63 (128)
T PF14353_consen 37 FSFTCPSCGHKFRLEYPLLYHDPEKKF 63 (128)
T ss_pred CEEECCCCCCceecCCCEEEEcCCCCE
Confidence 34445 555544444444444333333
No 152
>CHL00030 rpl23 ribosomal protein L23
Probab=29.76 E-value=54 Score=21.19 Aligned_cols=21 Identities=24% Similarity=0.267 Sum_probs=17.2
Q ss_pred cccCCCCCCHHHHHHHHHHHH
Q psy7114 6 ILNCNVNSTLEEIKENYKSLI 26 (134)
Q Consensus 6 vLgv~~~as~~~Ik~ayr~l~ 26 (134)
++-|+++|+..+||+|..++.
T Consensus 23 ~F~V~~~anK~eIK~avE~lf 43 (93)
T CHL00030 23 TFDVDSGSTKTEIKHWIELFF 43 (93)
T ss_pred EEEECCCCCHHHHHHHHHHHh
Confidence 456889999999999987764
No 153
>KOG3960|consensus
Probab=29.75 E-value=35 Score=26.33 Aligned_cols=16 Identities=38% Similarity=0.563 Sum_probs=13.4
Q ss_pred HHHHHHHHHHHHhccC
Q psy7114 38 QEMFLKINEAWNILKD 53 (134)
Q Consensus 38 ~~~f~~i~~Ay~~L~d 53 (134)
..+.++||+|+++|..
T Consensus 127 RRRLkKVNEAFE~LKR 142 (284)
T KOG3960|consen 127 RRRLKKVNEAFETLKR 142 (284)
T ss_pred HHHHHHHHHHHHHHHh
Confidence 4679999999999864
No 154
>smart00661 RPOL9 RNA polymerase subunit 9.
Probab=29.28 E-value=62 Score=17.76 Aligned_cols=9 Identities=22% Similarity=0.796 Sum_probs=3.9
Q ss_pred EeCCCCcce
Q psy7114 120 IACDTCSLL 128 (134)
Q Consensus 120 v~C~~Csl~ 128 (134)
..|+.|...
T Consensus 21 ~vC~~Cg~~ 29 (52)
T smart00661 21 FVCRKCGYE 29 (52)
T ss_pred EECCcCCCe
Confidence 344444433
No 155
>PF14951 DUF4503: Domain of unknown function (DUF4503)
Probab=28.53 E-value=32 Score=27.86 Aligned_cols=29 Identities=28% Similarity=0.670 Sum_probs=16.9
Q ss_pred EeeC-CCC-CeEEEecccccCCCCCCceeEeCCCCcceE
Q psy7114 93 TYPC-RCG-AEFCIEEQDTEGDGSDDNILIACDTCSLLL 129 (134)
Q Consensus 93 ~~~c-rcg-~~~~v~~~~l~~g~~~~~~~v~C~~Csl~i 129 (134)
|-.| ||| +..+...++-.. ..|..|+..|
T Consensus 274 WPvCd~CGn~rLe~~pe~rg~--------~~C~~Cs~~V 304 (389)
T PF14951_consen 274 WPVCDRCGNGRLEQSPEDRGA--------FSCGDCSRVV 304 (389)
T ss_pred CccccccCCccceeCccCCCc--------eeccchhhhc
Confidence 4458 999 566655555333 4566665543
No 156
>PF15178 TOM_sub5: Mitochondrial import receptor subunit TOM5 homolog
Probab=28.32 E-value=1.2e+02 Score=17.19 Aligned_cols=24 Identities=21% Similarity=0.306 Sum_probs=19.1
Q ss_pred cccccCCCCCCHHHHHHHHHHHHH
Q psy7114 4 FEILNCNVNSTLEEIKENYKSLIL 27 (134)
Q Consensus 4 Y~vLgv~~~as~~~Ik~ayr~l~~ 27 (134)
|.+=|+.|..+++|.|+.-|+-.+
T Consensus 2 ~~~egl~pk~DPeE~k~kmR~dvi 25 (51)
T PF15178_consen 2 FRIEGLGPKMDPEEMKRKMREDVI 25 (51)
T ss_pred cccccCCCCCCHHHHHHHHHHHHH
Confidence 456789999999999998876433
No 157
>TIGR03164 UHCUDC OHCU decarboxylase. Previously thought to only proceed spontaneously, the decarboxylation of 2-oxo-4-hydroxy-4-carboxy--5-ureidoimidazoline (OHCU) has been recently been shown to be catalyzed by this enzyme in Mus musculus. Homologs of this enzyme are found adjacent to and fused with uricase in a number of prokaryotes and are represented by this model.
Probab=28.21 E-value=2.2e+02 Score=20.06 Aligned_cols=24 Identities=17% Similarity=0.407 Sum_probs=14.9
Q ss_pred CCCCCHHHHHHHHHHH-----------HHHhCCCC
Q psy7114 10 NVNSTLEEIKENYKSL-----------ILKYHPDK 33 (134)
Q Consensus 10 ~~~as~~~Ik~ayr~l-----------~~~~HPDk 33 (134)
.|-+|.+.+..+.... ++..|||-
T Consensus 28 rPf~s~~~L~~a~~~~~~~~~~~~~~~ll~~HP~L 62 (157)
T TIGR03164 28 RPFDSIEDLHAAMVGAVRAASPEQQLALIRAHPDL 62 (157)
T ss_pred CCCCCHHHHHHHHHHHHHHCCHHHHHHHHHhCCcc
Confidence 3456666666655433 56678887
No 158
>PF08996 zf-DNA_Pol: DNA Polymerase alpha zinc finger; InterPro: IPR015088 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The DNA Polymerase alpha zinc finger domain adopts an alpha-helix-like structure, followed by three turns, all of which involve proline. The resulting motif is a helix-turn-helix motif, in contrast to other zinc finger domains, which show anti-parallel sheet and helix conformation. Zinc binding occurs due to the presence of four cysteine residues positioned to bind the metal centre in a tetrahedral coordination geometry. The function of this domain is uncertain: it has been proposed that the zinc finger motif may be an essential part of the DNA binding domain []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0001882 nucleoside binding, 0003887 DNA-directed DNA polymerase activity, 0006260 DNA replication; PDB: 3FLO_D 1N5G_A 1K0P_A 1K18_A.
Probab=28.18 E-value=38 Score=24.51 Aligned_cols=40 Identities=25% Similarity=0.452 Sum_probs=18.6
Q ss_pred eEEeeC-CCCCeEEEecccccCCCCCCceeEeCCCCcceEE
Q psy7114 91 MFTYPC-RCGAEFCIEEQDTEGDGSDDNILIACDTCSLLLE 130 (134)
Q Consensus 91 ~~~~~c-rcg~~~~v~~~~l~~g~~~~~~~v~C~~Csl~i~ 130 (134)
.+...| .||..+.+..---..+.........|+.|+-.+.
T Consensus 16 ~l~~~C~~C~~~~~f~g~~~~~~~~~~~~~~~C~~C~~~~~ 56 (188)
T PF08996_consen 16 PLKLTCPSCGTEFEFPGVFEEDGDDVSPSGLQCPNCSTPLS 56 (188)
T ss_dssp -EEEE-TTT--EEEE-SSS--SSEEEETTEEEETTT--B--
T ss_pred ceEeECCCCCCCccccccccCCccccccCcCcCCCCCCcCC
Confidence 467888 8998888766433222222234699999987543
No 159
>PTZ00199 high mobility group protein; Provisional
Probab=27.98 E-value=1.6e+02 Score=18.59 Aligned_cols=40 Identities=20% Similarity=0.359 Sum_probs=26.8
Q ss_pred HHHHHHhCCCCCHHHHHHHHHHHHHHHhccChHHHHHHHH
Q psy7114 23 KSLILKYHPDKNSEHQEMFLKINEAWNILKDEKERKLYES 62 (134)
Q Consensus 23 r~l~~~~HPDk~~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~ 62 (134)
|.....-||+......+..+.|.+.|..|++.++...++.
T Consensus 38 R~~i~~~~P~~~~~~~evsk~ige~Wk~ls~eeK~~y~~~ 77 (94)
T PTZ00199 38 RAEIIAENPELAKDVAAVGKMVGEAWNKLSEEEKAPYEKK 77 (94)
T ss_pred HHHHHHHCcCCcccHHHHHHHHHHHHHcCCHHHHHHHHHH
Confidence 4445567898643335567889999999997665544443
No 160
>COG5134 Uncharacterized conserved protein [Function unknown]
Probab=27.95 E-value=32 Score=25.97 Aligned_cols=13 Identities=31% Similarity=0.795 Sum_probs=10.9
Q ss_pred EeCCCCcceEEEe
Q psy7114 120 IACDTCSLLLEIT 132 (134)
Q Consensus 120 v~C~~Csl~i~v~ 132 (134)
+.|+.|+-.|.|-
T Consensus 80 I~C~~C~n~i~~R 92 (272)
T COG5134 80 IKCHLCSNPIDVR 92 (272)
T ss_pred EEccCCCCceeee
Confidence 8999999888763
No 161
>TIGR03636 L23_arch archaeal ribosomal protein L23. This model describes the archaeal ribosomal protein L23P and rigorously excludes the bacterial counterpart L23. In order to capture every known instance of archaeal L23P, the trusted cutoff is set lower than a few of the highest scoring eukaryotic cytosolic ribosomal counterparts.
Probab=27.60 E-value=64 Score=20.07 Aligned_cols=21 Identities=19% Similarity=0.248 Sum_probs=17.5
Q ss_pred cccCCCCCCHHHHHHHHHHHH
Q psy7114 6 ILNCNVNSTLEEIKENYKSLI 26 (134)
Q Consensus 6 vLgv~~~as~~~Ik~ayr~l~ 26 (134)
++-|+++|+..+||+|..++.
T Consensus 18 ~F~V~~~anK~eIK~avE~lf 38 (77)
T TIGR03636 18 TFIVDRKATKGDIKRAVEKLF 38 (77)
T ss_pred EEEECCCCCHHHHHHHHHHHh
Confidence 456889999999999988763
No 162
>PF00628 PHD: PHD-finger; InterPro: IPR019787 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents the PHD (homeodomain) zinc finger domain [,], which is a C4HC3 zinc-finger-like motif found in nuclear proteins thought to be involved in chromatin-mediated transcriptional regulation. The PHD finger motif is reminiscent of, but distinct from the C3HC4 type RING finger. The function of this domain is not yet known but in analogy with the LIM domain it could be involved in protein-protein interaction and be important for the assembly or activity of multicomponent complexes involved in transcriptional activation or repression. Alternatively, the interactions could be intra-molecular and be important in maintaining the structural integrity of the protein. In similarity to the RING finger and the LIM domain, the PHD finger is thought to bind two zinc ions. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0005515 protein binding; PDB: 3ZVY_A 2LGG_A 3SOW_A 3SOU_B 3ASL_A 3ASK_A 3ZVZ_B 3T6R_A 2LGK_A 3SOX_B ....
Probab=27.40 E-value=74 Score=17.30 Aligned_cols=15 Identities=27% Similarity=0.643 Sum_probs=12.8
Q ss_pred ceeEeCCCCcceEEE
Q psy7114 117 NILIACDTCSLLLEI 131 (134)
Q Consensus 117 ~~~v~C~~Csl~i~v 131 (134)
..+|.|++|..|++.
T Consensus 12 ~~~i~C~~C~~~~H~ 26 (51)
T PF00628_consen 12 GDMIQCDSCNRWYHQ 26 (51)
T ss_dssp SSEEEBSTTSCEEET
T ss_pred CCeEEcCCCChhhCc
Confidence 467999999999885
No 163
>PRK10445 endonuclease VIII; Provisional
Probab=27.34 E-value=62 Score=24.75 Aligned_cols=26 Identities=23% Similarity=0.533 Sum_probs=16.1
Q ss_pred eeC-CCCCeEEEecccccCCCCCCceeEeCCCCc
Q psy7114 94 YPC-RCGAEFCIEEQDTEGDGSDDNILIACDTCS 126 (134)
Q Consensus 94 ~~c-rcg~~~~v~~~~l~~g~~~~~~~v~C~~Cs 126 (134)
.+| +||+...-..- .| -....|+.|.
T Consensus 236 ~~Cp~Cg~~I~~~~~---~g----R~t~~CP~CQ 262 (263)
T PRK10445 236 EACERCGGIIEKTTL---SS----RPFYWCPGCQ 262 (263)
T ss_pred CCCCCCCCEeEEEEE---CC----CCcEECCCCc
Confidence 578 99976543322 12 3457899994
No 164
>PF07227 DUF1423: Protein of unknown function (DUF1423); InterPro: IPR004082 A total of 715 potential protein-coding genes have been identified in the nucleotide sequence of Arabidopsis thaliana chromosome 5, with an average gene density of 1 gene per 4001 bp []. Amongst the gene products is a well-conserved family of 130.7kDa proteins that share no sequence similarity with any other known proteins, other than in plants. The sequences are characterised by an N-terminal domain of variable length, a central cysteine-rich region and a relatively acidic C-terminal domain. The sequences may possess a PHD finger.
Probab=27.12 E-value=46 Score=27.71 Aligned_cols=15 Identities=27% Similarity=0.609 Sum_probs=13.2
Q ss_pred ceeEeCCCCcceEEE
Q psy7114 117 NILIACDTCSLLLEI 131 (134)
Q Consensus 117 ~~~v~C~~Csl~i~v 131 (134)
..++.|+.|++|-++
T Consensus 144 ~~Wi~Cd~CgH~cH~ 158 (446)
T PF07227_consen 144 CSWIGCDVCGHWCHL 158 (446)
T ss_pred eeEEeccCCCceehh
Confidence 678999999999875
No 165
>COG5525 Bacteriophage tail assembly protein [General function prediction only]
Probab=27.09 E-value=61 Score=28.03 Aligned_cols=42 Identities=24% Similarity=0.622 Sum_probs=28.5
Q ss_pred ceeEEeeC-CCCCeEEEecccccCCCCCC-----ceeEeCCCCcceEE
Q psy7114 89 SQMFTYPC-RCGAEFCIEEQDTEGDGSDD-----NILIACDTCSLLLE 130 (134)
Q Consensus 89 ~~~~~~~c-rcg~~~~v~~~~l~~g~~~~-----~~~v~C~~Csl~i~ 130 (134)
...|.-+| .||..+....++...-.... ...+.|+.|...|+
T Consensus 223 ~rr~yvpCPHCGe~q~l~~~e~~~~~g~~~~~~~~~~~~c~h~~~~i~ 270 (611)
T COG5525 223 QRRFYVPCPHCGEEQQLKFGEKSGPRGLKDTPAEAAFIQCEHCGCVIR 270 (611)
T ss_pred ceeEEeeCCCCCchhhccccccCCCcCcccchhhhhhhhccccCceee
Confidence 35677788 99988887776554421111 23579999999886
No 166
>cd03031 GRX_GRX_like Glutaredoxin (GRX) family, GRX-like domain containing protein subfamily; composed of uncharacterized eukaryotic proteins containing a GRX-like domain having only one conserved cysteine, aligning to the C-terminal cysteine of the CXXC motif of GRXs. This subfamily is predominantly composed of plant proteins. GRX is a glutathione (GSH) dependent reductase, catalyzing the disulfide reduction of target proteins via a redox active CXXC motif using a similar dithiol mechanism employed by TRXs. GRX has preference for mixed GSH disulfide substrates, in which it uses a monothiol mechanism where only the N-terminal cysteine is required. Proteins containing only the C-terminal cysteine are generally redox inactive.
Probab=25.51 E-value=99 Score=21.59 Aligned_cols=31 Identities=16% Similarity=0.519 Sum_probs=19.7
Q ss_pred EEeeC-CCCCeEEEecccccCCCCCCceeEeCCCCc
Q psy7114 92 FTYPC-RCGAEFCIEEQDTEGDGSDDNILIACDTCS 126 (134)
Q Consensus 92 ~~~~c-rcg~~~~v~~~~l~~g~~~~~~~v~C~~Cs 126 (134)
.+-+| .|++.-.+..++-. .....+.|+.|.
T Consensus 109 rfv~C~~C~Gs~k~~~~~~~----~~~~~~rC~~Cn 140 (147)
T cd03031 109 RFVPCSECNGSCKVFAENAT----AAGGFLRCPECN 140 (147)
T ss_pred CeEECCCCCCcceEEeccCc----ccccEEECCCCC
Confidence 45677 78776666554422 124568999996
No 167
>PF14803 Nudix_N_2: Nudix N-terminal; PDB: 3CNG_C.
Probab=24.96 E-value=1e+02 Score=15.98 Aligned_cols=28 Identities=18% Similarity=0.667 Sum_probs=11.4
Q ss_pred C-CCCCeEEEecccccCCCCCCceeEeCCCCcce
Q psy7114 96 C-RCGAEFCIEEQDTEGDGSDDNILIACDTCSLL 128 (134)
Q Consensus 96 c-rcg~~~~v~~~~l~~g~~~~~~~v~C~~Csl~ 128 (134)
| .||+......-+-+. ..-..|+.|..+
T Consensus 3 C~~CG~~l~~~ip~gd~-----r~R~vC~~Cg~I 31 (34)
T PF14803_consen 3 CPQCGGPLERRIPEGDD-----RERLVCPACGFI 31 (34)
T ss_dssp -TTT--B-EEE--TT-S-----S-EEEETTTTEE
T ss_pred cccccChhhhhcCCCCC-----ccceECCCCCCE
Confidence 5 788654433221111 233689999764
No 168
>PRK03564 formate dehydrogenase accessory protein FdhE; Provisional
Probab=24.79 E-value=89 Score=24.76 Aligned_cols=17 Identities=24% Similarity=0.409 Sum_probs=14.0
Q ss_pred ceeEeCCCCcceEEEec
Q psy7114 117 NILIACDTCSLLLEITA 133 (134)
Q Consensus 117 ~~~v~C~~Csl~i~v~~ 133 (134)
.-+-.|.+|.-++++..
T Consensus 250 ~r~e~C~~C~~YlK~~~ 266 (309)
T PRK03564 250 VKAESCGDCGTYLKILY 266 (309)
T ss_pred eEeeecccccccceecc
Confidence 45689999999999864
No 169
>TIGR00100 hypA hydrogenase nickel insertion protein HypA. In Hpylori, hypA mutant abolished hydrogenase activity and decrease in urease activity. Nickel supplementation in media restored urease activity and partial hydrogenase activity. HypA probably involved in inserting Ni in enzymes.
Probab=24.63 E-value=53 Score=21.87 Aligned_cols=23 Identities=30% Similarity=0.661 Sum_probs=15.5
Q ss_pred eeC-CCCCeEEEecccccCCCCCCceeEeCCCCcc
Q psy7114 94 YPC-RCGAEFCIEEQDTEGDGSDDNILIACDTCSL 127 (134)
Q Consensus 94 ~~c-rcg~~~~v~~~~l~~g~~~~~~~v~C~~Csl 127 (134)
..| .||..|.+... ...||.|.-
T Consensus 71 ~~C~~Cg~~~~~~~~-----------~~~CP~Cgs 94 (115)
T TIGR00100 71 CECEDCSEEVSPEID-----------LYRCPKCHG 94 (115)
T ss_pred EEcccCCCEEecCCc-----------CccCcCCcC
Confidence 568 78877776543 146888875
No 170
>PRK05738 rplW 50S ribosomal protein L23; Reviewed
Probab=23.96 E-value=79 Score=20.23 Aligned_cols=21 Identities=24% Similarity=0.363 Sum_probs=16.8
Q ss_pred cccCCCCCCHHHHHHHHHHHH
Q psy7114 6 ILNCNVNSTLEEIKENYKSLI 26 (134)
Q Consensus 6 vLgv~~~as~~~Ik~ayr~l~ 26 (134)
++-|++.|+..|||++..++-
T Consensus 24 ~F~V~~~a~K~eIK~aie~lf 44 (92)
T PRK05738 24 VFEVAPDATKPEIKAAVEKLF 44 (92)
T ss_pred EEEECCCCCHHHHHHHHHHHc
Confidence 356888999999999887763
No 171
>PF04967 HTH_10: HTH DNA binding domain; InterPro: IPR007050 Numerous bacterial transcription regulatory proteins bind DNA via a helix-turn-helix (HTH) motif. This entry represents the HTH DNA binding domain found in Halobacterium salinarium (Halobacterium halobium) and described as a putative bacterio-opsin activator.
Probab=23.74 E-value=32 Score=19.84 Aligned_cols=21 Identities=19% Similarity=0.284 Sum_probs=12.8
Q ss_pred cccCCCCCCHHHHHHHHHHHH
Q psy7114 6 ILNCNVNSTLEEIKENYKSLI 26 (134)
Q Consensus 6 vLgv~~~as~~~Ik~ayr~l~ 26 (134)
.|||++.+-...||++-+++.
T Consensus 32 ~lgis~st~~~~LRrae~kli 52 (53)
T PF04967_consen 32 ELGISKSTVSEHLRRAERKLI 52 (53)
T ss_pred HhCCCHHHHHHHHHHHHHHHh
Confidence 466666655666666666653
No 172
>PF14616 DUF4451: Domain of unknown function (DUF4451)
Probab=23.51 E-value=54 Score=22.21 Aligned_cols=16 Identities=25% Similarity=0.490 Sum_probs=13.6
Q ss_pred eeEeCCCCcceEEEec
Q psy7114 118 ILIACDTCSLLLEITA 133 (134)
Q Consensus 118 ~~v~C~~Csl~i~v~~ 133 (134)
..+.|+.|.-||.|..
T Consensus 88 ~~g~C~~C~~wI~l~~ 103 (124)
T PF14616_consen 88 IEGLCPHCNEWIELGC 103 (124)
T ss_pred EEEECCchhheeeeee
Confidence 4599999999999864
No 173
>PF03820 Mtc: Tricarboxylate carrier; InterPro: IPR004686 The MTC family consists of a limited number of homologues, all from eukaryotes. One member of the family has been functionally characterised as a tricarboxylate carrier from rat liver mitochondria. The rat liver mitochondrial tricarboxylate carrier has been reported to transport citrate, cis-aconitate, threo-D-isocitrate, D- and L-tartrate, malate, succinate and phosphoenolpyruvate. It presumably functions by a proton symport mechanism. The rest of the characterised proteins appear to be sideroflexins involved in iron transport.; GO: 0008324 cation transmembrane transporter activity, 0006812 cation transport, 0055085 transmembrane transport, 0016020 membrane
Probab=23.39 E-value=1.7e+02 Score=23.17 Aligned_cols=23 Identities=26% Similarity=0.258 Sum_probs=20.0
Q ss_pred CCCCHHHHHHHHHHHHHHhCCCC
Q psy7114 11 VNSTLEEIKENYKSLILKYHPDK 33 (134)
Q Consensus 11 ~~as~~~Ik~ayr~l~~~~HPDk 33 (134)
++.+.+++-+|.+..--.+|||.
T Consensus 46 ~~~~~~~lw~Ak~l~~Sa~HPDT 68 (308)
T PF03820_consen 46 PGLTDDELWKAKKLYDSAFHPDT 68 (308)
T ss_pred CCCCHHHHHHHHHHhhcccCCCC
Confidence 34588999999999999999997
No 174
>PRK02935 hypothetical protein; Provisional
Probab=23.08 E-value=43 Score=22.34 Aligned_cols=17 Identities=12% Similarity=0.485 Sum_probs=10.9
Q ss_pred ceeEeCCCCcceEEEec
Q psy7114 117 NILIACDTCSLLLEITA 133 (134)
Q Consensus 117 ~~~v~C~~Csl~i~v~~ 133 (134)
.+.|.||+|.-...+++
T Consensus 68 avqV~CP~C~K~TKmLG 84 (110)
T PRK02935 68 AVQVICPSCEKPTKMLG 84 (110)
T ss_pred ceeeECCCCCchhhhcc
Confidence 35578888876655543
No 175
>KOG2899|consensus
Probab=22.72 E-value=61 Score=25.18 Aligned_cols=35 Identities=11% Similarity=0.147 Sum_probs=29.5
Q ss_pred CeEEEecccccCCCCCCceeEeCCCCcceEEEecC
Q psy7114 100 AEFCIEEQDTEGDGSDDNILIACDTCSLLLEITAR 134 (134)
Q Consensus 100 ~~~~v~~~~l~~g~~~~~~~v~C~~Csl~i~v~~~ 134 (134)
..|+.+.+|+......++.++.|-+=+-||+++++
T Consensus 150 ~n~vle~~dfl~~~~~~fDiIlcLSiTkWIHLNwg 184 (288)
T KOG2899|consen 150 ENYVLESDDFLDMIQPEFDIILCLSITKWIHLNWG 184 (288)
T ss_pred ccEEEecchhhhhccccccEEEEEEeeeeEecccc
Confidence 67888888777766778899999999999999874
No 176
>PF10475 DUF2450: Protein of unknown function N-terminal domain (DUF2450) ; InterPro: IPR019515 This entry represents Vacuolar protein sorting-associated protein 54, and is thought to be involved in retrograde transport from early and late endosomes to late Golgi found in eukaryotes, but its function is not known.
Probab=22.46 E-value=1.3e+02 Score=23.19 Aligned_cols=33 Identities=21% Similarity=0.562 Sum_probs=24.4
Q ss_pred HHHHHHHHHHHHHhCCCCCHHHHHHHHHHHHHHHhccChH
Q psy7114 16 EEIKENYKSLILKYHPDKNSEHQEMFLKINEAWNILKDEK 55 (134)
Q Consensus 16 ~~Ik~ayr~l~~~~HPDk~~~~~~~f~~i~~Ay~~L~d~~ 55 (134)
+.+...+.+++..+.|+ .|..|.+||..|++..
T Consensus 182 ~~ld~~l~~~~~~Fd~~-------~Y~~v~~AY~lLgk~~ 214 (291)
T PF10475_consen 182 EQLDSDLSKVCQDFDPD-------KYSKVQEAYQLLGKTQ 214 (291)
T ss_pred HHHHHHHHHHHHhCCHH-------HHHHHHHHHHHHhhhH
Confidence 34556666777766665 7889999999999753
No 177
>PF11023 DUF2614: Protein of unknown function (DUF2614); InterPro: IPR020912 This entry describes proteins of unknown function, which are thought to be membrane proteins.; GO: 0005887 integral to plasma membrane
Probab=22.29 E-value=34 Score=23.02 Aligned_cols=15 Identities=13% Similarity=0.561 Sum_probs=9.6
Q ss_pred eeEeCCCCcceEEEe
Q psy7114 118 ILIACDTCSLLLEIT 132 (134)
Q Consensus 118 ~~v~C~~Csl~i~v~ 132 (134)
..|.||+|.-...+.
T Consensus 68 v~V~CP~C~K~TKmL 82 (114)
T PF11023_consen 68 VQVECPNCGKQTKML 82 (114)
T ss_pred eeeECCCCCChHhhh
Confidence 457788887655443
No 178
>PF08447 PAS_3: PAS fold; InterPro: IPR013655 The PAS fold corresponds to the structural domain that has previously been defined as PAS and PAC motifs []. The PAS fold appears in archaea, eubacteria and eukarya. The PAS domain contains a sensory box, or S-box domain that occupies the central portion of the PAS domain but is more widely distributed. It is often tandemly repeated. Known prosthetic groups bound in the S-box domain include haem in the oxygen sensor FixL [], FAD in the redox potential sensor NifL [], and a 4-hydroxycinnamyl chromophore in photoactive yellow protein []. Proteins containing the domain often contain other regulatory domains such as response regulator or sensor histidine kinase domains. Other S-box proteins include phytochromes and the aryl hydrocarbon receptor nuclear translocator. This domain has been found in the gene product of the madA gene of the filamentous zygomycete fungus Phycomyces blakesleeanus. It has been shown that MadA encodes a blue-light photoreceptor for phototropism and other light responses. The gene is involved in the phototropic responses associated with sporangiophore growth; they exhibit phototropism by bending toward near-UV and blue wavelengths and away from far-UV wavelengths in a manner that is physiologically similar to plant phototropic responses [].; GO: 0005515 protein binding; PDB: 3NJA_D 3H9W_A 3GDI_B 3ICY_A 3EEH_A 3MR0_B.
Probab=21.77 E-value=45 Score=19.88 Aligned_cols=28 Identities=36% Similarity=0.716 Sum_probs=18.6
Q ss_pred CccccccCCCCCCHHHH-HHHHHHHHHHhCCCC
Q psy7114 2 NHFEILNCNVNSTLEEI-KENYKSLILKYHPDK 33 (134)
Q Consensus 2 ~~Y~vLgv~~~as~~~I-k~ayr~l~~~~HPDk 33 (134)
+++++||.++ +++ ......+....|||=
T Consensus 7 ~~~~i~G~~~----~~~~~~~~~~~~~~ihpdD 35 (91)
T PF08447_consen 7 NFYEIFGYSP----EEIGKPDFEEWLERIHPDD 35 (91)
T ss_dssp HHHHHHTS-H----HHHTCBEHHHHHHHB-TTT
T ss_pred HHHHHhCCCH----HHhccCCHHHHHhhcCHHH
Confidence 4678888854 566 556667778899993
No 179
>PF05191 ADK_lid: Adenylate kinase, active site lid; InterPro: IPR007862 Adenylate kinases (ADK; 2.7.4.3 from EC) are phosphotransferases that catalyse the Mg-dependent reversible conversion of ATP and AMP to two molecules of ADP, an essential reaction for many processes in living cells. In large variants of adenylate kinase, the AMP and ATP substrates are buried in a domain that undergoes conformational changes from an open to a closed state when bound to substrate; the ligand is then contained within a highly specific environment required for catalysis. Adenylate kinase is a 3-domain protein consisting of a large central CORE domain flanked by a LID domain on one side and the AMP-binding NMPbind domain on the other []. The LID domain binds ATP and covers the phosphates at the active site. The substrates first bind the CORE domain, followed by closure of the active site by the LID and NMPbind domains. Comparisons of adenylate kinases have revealed a particular divergence in the active site lid. In some organisms, particularly the Gram-positive bacteria, residues in the lid domain have been mutated to cysteines and these cysteine residues (two CX(n)C motifs) are responsible for the binding of a zinc ion. The bound zinc ion in the lid domain is clearly structurally homologous to Zinc-finger domains. However, it is unclear whether the adenylate kinase lid is a novel zinc-finger DNA/RNA binding domain, or that the lid bound zinc serves a purely structural function [].; GO: 0004017 adenylate kinase activity; PDB: 3BE4_A 2OSB_B 2ORI_A 2EU8_A 3DL0_A 1P3J_A 2QAJ_A 2OO7_A 2P3S_A 3DKV_A ....
Probab=21.65 E-value=47 Score=17.50 Aligned_cols=26 Identities=31% Similarity=0.698 Sum_probs=13.9
Q ss_pred eC-CCCCeEEEecccccCCCCCCceeEeCCCCcc
Q psy7114 95 PC-RCGAEFCIEEQDTEGDGSDDNILIACDTCSL 127 (134)
Q Consensus 95 ~c-rcg~~~~v~~~~l~~g~~~~~~~v~C~~Csl 127 (134)
.| .||..|.+...--... ..|+.|--
T Consensus 3 ~C~~Cg~~Yh~~~~pP~~~-------~~Cd~cg~ 29 (36)
T PF05191_consen 3 ICPKCGRIYHIEFNPPKVE-------GVCDNCGG 29 (36)
T ss_dssp EETTTTEEEETTTB--SST-------TBCTTTTE
T ss_pred CcCCCCCccccccCCCCCC-------CccCCCCC
Confidence 35 6787777655532222 36766643
No 180
>PF08209 Sgf11: Sgf11 (transcriptional regulation protein); InterPro: IPR013246 The Sgf11 family is a SAGA complex subunit in Saccharomyces cerevisiae (Baker's yeast). The SAGA complex is a multisubunit protein complex involved in transcriptional regulation. SAGA combines proteins involved in interactions with DNA-bound activators and TATA-binding protein (TBP), as well as enzymes for histone acetylation and deubiquitylation [].; PDB: 3M99_B 2LO2_A 3MHH_C 3MHS_C.
Probab=21.61 E-value=1.1e+02 Score=15.89 Aligned_cols=14 Identities=14% Similarity=0.482 Sum_probs=9.3
Q ss_pred eeEeCCCCcceEEE
Q psy7114 118 ILIACDTCSLLLEI 131 (134)
Q Consensus 118 ~~v~C~~Csl~i~v 131 (134)
..+.|+.|.-.|.-
T Consensus 3 ~~~~C~nC~R~v~a 16 (33)
T PF08209_consen 3 PYVECPNCGRPVAA 16 (33)
T ss_dssp -EEE-TTTSSEEEG
T ss_pred CeEECCCCcCCcch
Confidence 46899999887753
No 181
>smart00778 Prim_Zn_Ribbon Zinc-binding domain of primase-helicase. This region represents the zinc binding domain. It is found in the N-terminal region of the bacteriophage P4 alpha protein, which is a multifunctional protein with origin recognition, helicase and primase activities.
Probab=21.51 E-value=85 Score=16.69 Aligned_cols=12 Identities=42% Similarity=1.353 Sum_probs=6.0
Q ss_pred eC-CCC--CeEEEec
Q psy7114 95 PC-RCG--AEFCIEE 106 (134)
Q Consensus 95 ~c-rcg--~~~~v~~ 106 (134)
+| .|| +.|.++.
T Consensus 5 pCP~CGG~DrFr~~d 19 (37)
T smart00778 5 PCPNCGGSDRFRFDD 19 (37)
T ss_pred CCCCCCCcccccccc
Confidence 45 566 3555433
No 182
>PF11126 Phage_DsbA: Transcriptional regulator DsbA; InterPro: IPR020313 DsbA is a double stranded binding protein found in bacteriophage T4 which is involved in transcriptional regulation. DsbA, along with other viral proteins, interacts with the host RNA polymerase core enzyme enabling initiation of transcription. DsbA acts as an enhancer protein of late genes in vitro. The protein consists of mainly alpha helices [].
Probab=21.47 E-value=1.6e+02 Score=18.08 Aligned_cols=31 Identities=16% Similarity=0.338 Sum_probs=18.8
Q ss_pred HHHHHHHHhCCCCCHHHHHHHHHHHHHHHhc
Q psy7114 21 NYKSLILKYHPDKNSEHQEMFLKINEAWNIL 51 (134)
Q Consensus 21 ayr~l~~~~HPDk~~~~~~~f~~i~~Ay~~L 51 (134)
-|.++...||-+-...-++.-.++.+.|+.+
T Consensus 36 ~Fnkl~~lyHk~~Re~fE~e~ee~~elYD~~ 66 (69)
T PF11126_consen 36 MFNKLLKLYHKQEREEFEAENEEVVELYDAV 66 (69)
T ss_pred HHHHHHHHHHHhhHHHHHHHHHHHHHHHHHH
Confidence 4677888888776444444444565555543
No 183
>cd01389 MATA_HMG-box MATA_HMG-box, class I member of the HMG-box superfamily of DNA-binding proteins. These proteins contain a single HMG box, and bind the minor groove of DNA in a highly sequence-specific manner. Members include the fungal mating type gene products MC, MATA1 and Ste11.
Probab=21.39 E-value=2e+02 Score=17.18 Aligned_cols=43 Identities=14% Similarity=0.187 Sum_probs=30.2
Q ss_pred HHHHHHHHHHhCCCCCHHHHHHHHHHHHHHHhccChHHHHHHHHH
Q psy7114 19 KENYKSLILKYHPDKNSEHQEMFLKINEAWNILKDEKERKLYESQ 63 (134)
Q Consensus 19 k~ayr~l~~~~HPDk~~~~~~~f~~i~~Ay~~L~d~~~R~~Yd~~ 63 (134)
.+.++...+.-||+.. ..+..+.|.+.|..|++..+....+..
T Consensus 13 ~~~~r~~~~~~~p~~~--~~eisk~~g~~Wk~ls~eeK~~y~~~A 55 (77)
T cd01389 13 RQDKHAQLKTENPGLT--NNEISRIIGRMWRSESPEVKAYYKELA 55 (77)
T ss_pred HHHHHHHHHHHCCCCC--HHHHHHHHHHHHhhCCHHHHHHHHHHH
Confidence 3456677777899863 346677899999999977665444443
No 184
>PF14606 Lipase_GDSL_3: GDSL-like Lipase/Acylhydrolase family; PDB: 3SKV_B.
Probab=21.16 E-value=91 Score=22.63 Aligned_cols=28 Identities=36% Similarity=0.621 Sum_probs=18.2
Q ss_pred cccCCCCCCHHHHHHHH---HHHHHHhCCCC
Q psy7114 6 ILNCNVNSTLEEIKENY---KSLILKYHPDK 33 (134)
Q Consensus 6 vLgv~~~as~~~Ik~ay---r~l~~~~HPDk 33 (134)
+|.+.++.+.++++++. -+..+.-||++
T Consensus 64 ~ld~~~N~~~~~~~~~~~~fv~~iR~~hP~t 94 (178)
T PF14606_consen 64 VLDCGPNMSPEEFRERLDGFVKTIREAHPDT 94 (178)
T ss_dssp EEEESHHCCTTTHHHHHHHHHHHHHTT-SSS
T ss_pred EEEeecCCCHHHHHHHHHHHHHHHHHhCCCC
Confidence 44555556777777654 55677789999
No 185
>PF01096 TFIIS_C: Transcription factor S-II (TFIIS); InterPro: IPR001222 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents a zinc finger motif found in transcription factor IIs (TFIIS). In eukaryotes the initiation of transcription of protein encoding genes by polymerase II (Pol II) is modulated by general and specific transcription factors. The general transcription factors operate through common promoters elements (such as the TATA box). At least eight different proteins associate to form the general transcription factors: TFIIA, -IIB, -IID, -IIE, -IIF, -IIG, -IIH and -IIS []. During mRNA elongation, Pol II can encounter DNA sequences that cause reverse movement of the enzyme. Such backtracking involves extrusion of the RNA 3'-end into the pore, and can lead to transcriptional arrest. Escape from arrest requires cleavage of the extruded RNA with the help of TFIIS, which induces mRNA cleavage by enhancing the intrinsic nuclease activity of RNA polymerase (Pol) II, past template-encoded pause sites []. TFIIS extends from the polymerase surface via a pore to the internal active site. Two essential and invariant acidic residues in a TFIIS loop complement the Pol II active site and could position a metal ion and a water molecule for hydrolytic RNA cleavage. TFIIS also induces extensive structural changes in Pol II that would realign nucleic acids in the active centre. TFIIS is a protein of about 300 amino acids. It contains three regions: a variable N-terminal domain not required for TFIIS activity; a conserved central domain required for Pol II binding; and a conserved C-terminal C4-type zinc finger essential for RNA cleavage. The zinc finger folds in a conformation termed a zinc ribbon [] characterised by a three-stranded antiparallel beta-sheet and two beta-hairpins. A backbone model for Pol II-TFIIS complex was obtained from X-ray analysis. It shows that a beta hairpin protrudes from the zinc finger and complements the pol II active site []. Some viral proteins also contain the TFIIS zinc ribbon C-terminal domain. The Vaccinia virus protein, unlike its eukaryotic homologue, is an integral RNA polymerase subunit rather than a readily separable transcription factor []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0003676 nucleic acid binding, 0008270 zinc ion binding, 0006351 transcription, DNA-dependent; PDB: 3M4O_I 3S14_I 2E2J_I 4A3J_I 3HOZ_I 1TWA_I 3S1Q_I 3S1N_I 1TWG_I 3I4M_I ....
Probab=21.06 E-value=1.5e+02 Score=15.65 Aligned_cols=10 Identities=20% Similarity=0.766 Sum_probs=5.8
Q ss_pred eeEeCCCCcc
Q psy7114 118 ILIACDTCSL 127 (134)
Q Consensus 118 ~~v~C~~Csl 127 (134)
....|..|..
T Consensus 27 ~fy~C~~C~~ 36 (39)
T PF01096_consen 27 LFYVCCNCGH 36 (39)
T ss_dssp EEEEESSSTE
T ss_pred EEEEeCCCCC
Confidence 3466666654
No 186
>PF00076 RRM_1: RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain); InterPro: IPR000504 Many eukaryotic proteins containing one or more copies of a putative RNA-binding domain of about 90 amino acids are known to bind single-stranded RNAs [, , ]. The largest group of single strand RNA-binding proteins is the eukaryotic RNA recognition motif (RRM) family that contains an eight amino acid RNP-1 consensus sequence [, ]. RRM proteins have a variety of RNA binding preferences and functions, and include heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing (SR, U2AF, Sxl), protein components of small nuclear ribonucleoproteins (U1 and U2 snRNPs), and proteins that regulate RNA stability and translation (PABP, La, Hu) [, , ]. The RRM in heterodimeric splicing factor U2 snRNP auxiliary factor (U2AF) appears to have two RRM-like domains with specialised features for protein recognition []. The motif also appears in a few single stranded DNA binding proteins. The typical RRM consists of four anti-parallel beta-strands and two alpha-helices arranged in a beta-alpha-beta-beta-alpha-beta fold with side chains that stack with RNA bases. Specificity of RNA binding is determined by multiple contacts with surrounding amino acids. A third helix is present during RNA binding in some cases []. The RRM is reviewed in a number of publications [, , ].; GO: 0003676 nucleic acid binding; PDB: 2RNE_A 2DGO_A 2DO4_A 1YTY_B 2VOO_B 2VOP_A 2VON_B 1ZH5_B 2VOD_A 1S79_A ....
Probab=21.03 E-value=76 Score=17.74 Aligned_cols=20 Identities=20% Similarity=0.316 Sum_probs=15.7
Q ss_pred cccCCCCCCHHHHHHHHHHH
Q psy7114 6 ILNCNVNSTLEEIKENYKSL 25 (134)
Q Consensus 6 vLgv~~~as~~~Ik~ayr~l 25 (134)
|=||+++.+.++|+..+.+.
T Consensus 3 v~nlp~~~t~~~l~~~f~~~ 22 (70)
T PF00076_consen 3 VGNLPPDVTEEELRDFFSQF 22 (70)
T ss_dssp EESETTTSSHHHHHHHHHTT
T ss_pred EcCCCCcCCHHHHHHHHHHh
Confidence 34788889999999887764
No 187
>PRK14548 50S ribosomal protein L23P; Provisional
Probab=21.01 E-value=1e+02 Score=19.55 Aligned_cols=21 Identities=19% Similarity=0.246 Sum_probs=17.3
Q ss_pred cccCCCCCCHHHHHHHHHHHH
Q psy7114 6 ILNCNVNSTLEEIKENYKSLI 26 (134)
Q Consensus 6 vLgv~~~as~~~Ik~ayr~l~ 26 (134)
.+-|++.|+..+||+|..++.
T Consensus 25 ~F~V~~~anK~eIK~AvE~lf 45 (84)
T PRK14548 25 TFIVDRRATKPDIKRAVEELF 45 (84)
T ss_pred EEEECCCCCHHHHHHHHHHHh
Confidence 356889999999999988763
No 188
>KOG2907|consensus
Probab=20.82 E-value=70 Score=21.58 Aligned_cols=40 Identities=15% Similarity=0.519 Sum_probs=21.6
Q ss_pred ceeEEeeC-CCCC-eEEEecccccCCCCCCceeEeCCCCcce
Q psy7114 89 SQMFTYPC-RCGA-EFCIEEQDTEGDGSDDNILIACDTCSLL 128 (134)
Q Consensus 89 ~~~~~~~c-rcg~-~~~v~~~~l~~g~~~~~~~v~C~~Csl~ 128 (134)
+......| +||. .+...-..+.+.-....+...|+.|...
T Consensus 70 ga~I~~kCpkCghe~m~Y~T~QlRSADEGQTVFYTC~kC~~k 111 (116)
T KOG2907|consen 70 GAVIKHKCPKCGHEEMSYHTLQLRSADEGQTVFYTCPKCKYK 111 (116)
T ss_pred ccchhccCcccCCchhhhhhhhcccccCCceEEEEcCcccee
Confidence 34556777 8883 3333333333322224566888888754
No 189
>COG3755 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=20.44 E-value=1.9e+02 Score=19.83 Aligned_cols=38 Identities=21% Similarity=0.448 Sum_probs=27.7
Q ss_pred HHHHHHHHHHHHHHhC-CCCCHHHHHHHHHHHHHHHhccChHH
Q psy7114 15 LEEIKENYKSLILKYH-PDKNSEHQEMFLKINEAWNILKDEKE 56 (134)
Q Consensus 15 ~~~Ik~ayr~l~~~~H-PDk~~~~~~~f~~i~~Ay~~L~d~~~ 56 (134)
..++.++|+.+....+ |++.. ..++-+.||=...|..-
T Consensus 50 Da~LN~AY~~ll~~l~~~~~~~----aL~kaQRAWi~fRDadC 88 (127)
T COG3755 50 DAELNKAYKALLKRLQDSPRTK----ALQKAQRAWIAFRDADC 88 (127)
T ss_pred HHHHHHHHHHHHHHhccChHHH----HHHHHHHHHHHHhhHhH
Confidence 4578899999988888 66422 46677888877776654
Done!