Query psy8325
Match_columns 237
No_of_seqs 236 out of 1461
Neff 6.2
Searched_HMMs 46136
Date Sat Aug 17 00:49:31 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy8325.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/8325hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PRK01773 hscB co-chaperone Hsc 100.0 9.4E-46 2E-50 311.7 21.0 165 67-231 2-171 (173)
2 PRK03578 hscB co-chaperone Hsc 100.0 9E-45 2E-49 306.6 21.7 166 66-231 5-175 (176)
3 PRK05014 hscB co-chaperone Hsc 100.0 1.4E-43 3.1E-48 298.1 19.8 165 67-231 1-170 (171)
4 PRK01356 hscB co-chaperone Hsc 100.0 3.5E-43 7.5E-48 294.5 20.9 160 67-228 2-163 (166)
5 PRK00294 hscB co-chaperone Hsc 100.0 1.8E-42 4E-47 291.7 19.8 166 66-231 3-172 (173)
6 TIGR00714 hscB Fe-S protein as 100.0 5.5E-39 1.2E-43 266.9 20.1 152 79-230 1-156 (157)
7 KOG3192|consensus 100.0 1.1E-35 2.4E-40 243.1 15.7 166 61-226 2-167 (168)
8 COG1076 DjlA DnaJ-domain-conta 99.9 2.4E-22 5.2E-27 169.4 7.5 166 67-232 1-172 (174)
9 PF07743 HSCB_C: HSCB C-termin 99.8 7.6E-18 1.7E-22 124.1 10.6 74 152-225 2-78 (78)
10 COG0484 DnaJ DnaJ-class molecu 99.7 1.5E-17 3.2E-22 154.2 7.2 71 66-145 3-73 (371)
11 PF00226 DnaJ: DnaJ domain; I 99.7 1.8E-16 3.9E-21 111.8 6.3 64 68-137 1-64 (64)
12 KOG0713|consensus 99.6 2.3E-16 5E-21 143.6 7.4 72 61-139 10-81 (336)
13 PRK14288 chaperone protein Dna 99.6 2.6E-15 5.6E-20 140.1 7.6 65 67-138 3-67 (369)
14 KOG0718|consensus 99.6 7.1E-15 1.5E-19 138.4 8.9 68 67-138 9-76 (546)
15 smart00271 DnaJ DnaJ molecular 99.5 1.7E-14 3.7E-19 100.1 7.5 60 67-132 1-60 (60)
16 PRK14296 chaperone protein Dna 99.5 8.7E-15 1.9E-19 136.8 7.2 64 67-138 4-67 (372)
17 PRK14286 chaperone protein Dna 99.5 1.1E-14 2.4E-19 136.0 7.5 65 67-138 4-68 (372)
18 PRK14282 chaperone protein Dna 99.5 1.9E-14 4.1E-19 134.2 7.8 67 66-138 3-69 (369)
19 PRK14279 chaperone protein Dna 99.5 1.8E-14 3.9E-19 135.5 7.5 65 67-138 9-73 (392)
20 PRK14295 chaperone protein Dna 99.5 1.9E-14 4.2E-19 135.1 7.6 65 67-138 9-73 (389)
21 PRK14297 chaperone protein Dna 99.5 1.7E-14 3.7E-19 135.1 7.1 65 67-138 4-68 (380)
22 PRK14301 chaperone protein Dna 99.5 2.1E-14 4.6E-19 134.2 7.0 65 67-138 4-68 (373)
23 cd06257 DnaJ DnaJ domain or J- 99.5 6E-14 1.3E-18 95.6 7.2 55 68-129 1-55 (55)
24 PRK14294 chaperone protein Dna 99.5 3.3E-14 7.1E-19 132.5 7.4 66 66-138 3-68 (366)
25 KOG0716|consensus 99.5 5E-14 1.1E-18 125.1 8.0 67 66-139 30-96 (279)
26 PRK14287 chaperone protein Dna 99.5 3.3E-14 7.2E-19 132.8 7.2 64 67-138 4-67 (371)
27 PRK14277 chaperone protein Dna 99.5 3.5E-14 7.7E-19 133.2 7.4 65 67-138 5-69 (386)
28 PRK14298 chaperone protein Dna 99.5 3.5E-14 7.6E-19 132.9 7.0 64 67-138 5-68 (377)
29 PRK14299 chaperone protein Dna 99.5 4.9E-14 1.1E-18 127.6 7.2 64 67-138 4-67 (291)
30 PRK14284 chaperone protein Dna 99.5 5.6E-14 1.2E-18 132.1 7.4 65 67-138 1-65 (391)
31 PRK14276 chaperone protein Dna 99.5 5.5E-14 1.2E-18 131.7 7.3 64 67-138 4-67 (380)
32 PRK14281 chaperone protein Dna 99.5 5.1E-14 1.1E-18 132.6 7.0 65 67-138 3-67 (397)
33 PRK14280 chaperone protein Dna 99.5 6.2E-14 1.3E-18 131.1 7.4 64 67-138 4-67 (376)
34 PRK14291 chaperone protein Dna 99.5 6.4E-14 1.4E-18 131.3 7.5 64 67-138 3-66 (382)
35 PRK14285 chaperone protein Dna 99.5 7.5E-14 1.6E-18 130.2 7.6 65 67-138 3-67 (365)
36 PRK10767 chaperone protein Dna 99.5 8E-14 1.7E-18 130.1 7.4 65 67-138 4-68 (371)
37 PTZ00341 Ring-infected erythro 99.5 1.6E-12 3.5E-17 131.6 16.3 83 66-158 572-654 (1136)
38 PTZ00037 DnaJ_C chaperone prot 99.5 8.5E-14 1.8E-18 132.0 6.6 62 66-138 27-88 (421)
39 PRK14292 chaperone protein Dna 99.4 1.3E-13 2.8E-18 128.6 7.2 64 67-138 2-65 (371)
40 PRK14290 chaperone protein Dna 99.4 1.4E-13 3.1E-18 128.2 7.4 66 67-138 3-68 (365)
41 PRK14293 chaperone protein Dna 99.4 1.5E-13 3.1E-18 128.5 7.4 64 67-138 3-66 (374)
42 PRK14278 chaperone protein Dna 99.4 1.1E-13 2.5E-18 129.5 6.6 64 67-138 3-66 (378)
43 PRK14283 chaperone protein Dna 99.4 1.6E-13 3.5E-18 128.4 7.3 65 66-138 4-68 (378)
44 PRK14300 chaperone protein Dna 99.4 1.7E-13 3.6E-18 128.1 7.3 64 67-138 3-66 (372)
45 PRK14289 chaperone protein Dna 99.4 2.3E-13 5E-18 127.6 7.5 65 67-138 5-69 (386)
46 PRK10266 curved DNA-binding pr 99.4 2.5E-13 5.3E-18 123.8 7.3 64 67-138 4-67 (306)
47 TIGR02349 DnaJ_bact chaperone 99.4 2.4E-13 5.2E-18 126.0 7.2 63 68-138 1-63 (354)
48 KOG0712|consensus 99.4 2.4E-13 5.1E-18 124.9 6.3 63 66-138 3-65 (337)
49 KOG0719|consensus 99.4 1.6E-11 3.4E-16 107.3 13.6 68 66-138 13-80 (264)
50 KOG0715|consensus 99.4 1.6E-12 3.4E-17 117.9 7.6 70 65-142 41-110 (288)
51 KOG0717|consensus 99.3 1.9E-12 4E-17 122.1 6.9 69 65-139 6-74 (508)
52 KOG0691|consensus 99.3 9.3E-12 2E-16 113.0 11.2 69 66-141 4-72 (296)
53 TIGR03835 termin_org_DnaJ term 99.3 5.3E-12 1.1E-16 125.4 9.4 65 67-139 2-66 (871)
54 COG2214 CbpA DnaJ-class molecu 99.3 5.2E-12 1.1E-16 105.6 7.2 67 66-138 5-71 (237)
55 KOG0721|consensus 99.2 1.7E-11 3.7E-16 106.0 7.3 70 66-144 98-167 (230)
56 PHA03102 Small T antigen; Revi 99.2 1.8E-11 3.9E-16 101.5 7.0 63 67-138 5-67 (153)
57 KOG0624|consensus 99.1 2.8E-10 6.1E-15 105.1 7.3 69 65-137 392-460 (504)
58 PRK09430 djlA Dna-J like membr 99.1 3.7E-10 8E-15 101.4 7.3 63 65-129 198-262 (267)
59 KOG0722|consensus 99.0 2.4E-10 5.3E-15 101.2 4.6 68 66-141 32-99 (329)
60 KOG0720|consensus 99.0 3.7E-10 8E-15 106.7 5.5 68 65-140 233-300 (490)
61 PTZ00100 DnaJ chaperone protei 98.9 3.7E-09 7.9E-14 83.9 5.9 57 61-128 59-115 (116)
62 COG5407 SEC63 Preprotein trans 98.7 1.4E-08 3E-13 96.2 6.1 72 66-139 97-168 (610)
63 PHA02624 large T antigen; Prov 98.7 1.3E-08 2.9E-13 100.1 5.8 63 66-137 10-72 (647)
64 KOG0550|consensus 98.7 2.5E-08 5.5E-13 93.7 5.8 68 65-138 371-438 (486)
65 KOG0714|consensus 98.7 1.6E-08 3.5E-13 88.5 4.2 68 66-139 2-69 (306)
66 KOG1150|consensus 98.6 1.1E-07 2.3E-12 81.9 6.2 64 66-135 52-115 (250)
67 COG5269 ZUO1 Ribosome-associat 98.4 3.2E-07 6.9E-12 82.3 6.0 88 66-156 42-130 (379)
68 KOG0568|consensus 97.5 0.00017 3.6E-09 63.8 5.8 57 67-131 47-104 (342)
69 KOG1789|consensus 97.4 0.00023 4.9E-09 73.9 5.8 54 67-128 1281-1336(2235)
70 KOG0723|consensus 96.9 0.002 4.3E-08 50.5 5.3 60 60-130 49-108 (112)
71 PF13248 zf-ribbon_3: zinc-rib 96.4 0.0017 3.7E-08 38.1 1.5 22 38-59 3-24 (26)
72 KOG0431|consensus 96.1 0.0098 2.1E-07 57.5 5.6 48 81-128 400-449 (453)
73 PF13240 zinc_ribbon_2: zinc-r 96.0 0.0033 7.2E-08 36.0 1.2 21 39-59 1-21 (23)
74 COG1076 DjlA DnaJ-domain-conta 94.9 0.034 7.4E-07 46.8 4.0 59 67-127 113-173 (174)
75 PF10571 UPF0547: Uncharacteri 93.9 0.039 8.5E-07 32.5 1.7 22 39-60 2-23 (26)
76 PF14687 DUF4460: Domain of un 93.5 0.2 4.4E-06 39.6 5.6 51 81-132 6-56 (112)
77 PF09538 FYDLN_acid: Protein o 93.3 0.06 1.3E-06 42.3 2.3 28 37-64 9-39 (108)
78 TIGR02300 FYDLN_acid conserved 91.1 0.16 3.5E-06 41.0 2.3 29 36-64 8-39 (129)
79 PRK00420 hypothetical protein; 91.0 0.15 3.2E-06 40.4 2.0 26 36-61 22-50 (112)
80 PF09862 DUF2089: Protein of u 88.1 0.29 6.2E-06 38.8 1.6 35 40-74 1-35 (113)
81 PF07754 DUF1610: Domain of un 87.8 0.33 7.2E-06 28.1 1.3 19 40-58 1-23 (24)
82 PF12773 DZR: Double zinc ribb 87.1 0.68 1.5E-05 30.6 2.8 27 35-61 10-39 (50)
83 COG1645 Uncharacterized Zn-fin 86.1 0.36 7.9E-06 39.2 1.2 23 37-59 28-52 (131)
84 PF12773 DZR: Double zinc ribb 85.2 0.52 1.1E-05 31.2 1.4 23 40-62 1-23 (50)
85 PRK14559 putative protein seri 84.7 0.72 1.6E-05 46.7 2.8 38 23-60 12-50 (645)
86 PF09889 DUF2116: Uncharacteri 84.5 0.32 6.8E-06 34.3 0.1 22 38-59 4-26 (59)
87 PF14803 Nudix_N_2: Nudix N-te 84.1 0.36 7.9E-06 30.2 0.3 22 39-60 2-31 (34)
88 PF08271 TF_Zn_Ribbon: TFIIB z 81.8 0.72 1.6E-05 29.9 1.0 24 39-62 2-30 (43)
89 PRK14811 formamidopyrimidine-D 79.9 1.2 2.6E-05 40.2 2.1 27 37-63 235-267 (269)
90 PF06677 Auto_anti-p27: Sjogre 78.6 1.3 2.8E-05 28.9 1.4 22 37-58 17-41 (41)
91 PF14353 CpXC: CpXC protein 78.4 1 2.2E-05 35.7 1.1 21 50-70 37-57 (128)
92 PRK14890 putative Zn-ribbon RN 77.8 1.5 3.2E-05 30.9 1.6 21 39-59 9-33 (59)
93 PF07282 OrfB_Zn_ribbon: Putat 77.1 1.5 3.3E-05 30.8 1.5 26 36-61 27-56 (69)
94 smart00661 RPOL9 RNA polymeras 76.8 1.7 3.7E-05 28.7 1.7 28 39-66 2-35 (52)
95 PRK01103 formamidopyrimidine/5 76.4 1.4 3E-05 39.7 1.5 23 37-59 245-273 (274)
96 PF08792 A2L_zn_ribbon: A2L zi 76.3 1.6 3.4E-05 27.1 1.3 25 38-62 4-32 (33)
97 PF03357 Snf7: Snf7; InterPro 76.0 16 0.00035 29.5 7.6 56 175-230 12-67 (171)
98 PRK00398 rpoP DNA-directed RNA 75.1 1.8 3.8E-05 28.4 1.3 21 39-59 5-29 (46)
99 COG0675 Transposase and inacti 74.8 1.6 3.5E-05 38.8 1.4 28 34-62 306-333 (364)
100 COG1439 Predicted nucleic acid 74.4 1.5 3.3E-05 37.3 1.2 23 37-59 139-161 (177)
101 COG4306 Uncharacterized protei 73.9 1.3 2.8E-05 36.0 0.5 37 27-63 29-80 (160)
102 PRK14559 putative protein seri 73.2 3.2 6.9E-05 42.1 3.2 36 28-65 6-41 (645)
103 CHL00174 accD acetyl-CoA carbo 72.5 1.9 4.2E-05 39.6 1.4 44 21-64 22-70 (296)
104 PF13877 RPAP3_C: Potential Mo 71.9 13 0.00028 27.7 5.6 49 118-166 8-58 (94)
105 PTZ00446 vacuolar sorting prot 70.6 37 0.0008 29.3 8.7 69 156-230 25-93 (191)
106 COG2888 Predicted Zn-ribbon RN 68.8 3.4 7.3E-05 29.2 1.6 11 49-59 48-58 (61)
107 PF03656 Pam16: Pam16; InterP 68.1 13 0.00029 30.0 5.2 53 68-131 59-111 (127)
108 PF14205 Cys_rich_KTR: Cystein 68.0 3.5 7.5E-05 28.6 1.5 21 39-59 6-36 (55)
109 COG1592 Rubrerythrin [Energy p 67.2 3.5 7.5E-05 34.9 1.7 30 38-73 135-165 (166)
110 TIGR01384 TFS_arch transcripti 66.8 4.3 9.4E-05 30.9 2.1 26 39-64 2-29 (104)
111 PRK14890 putative Zn-ribbon RN 66.2 4.1 9E-05 28.7 1.6 25 37-61 25-58 (59)
112 PRK12380 hydrogenase nickel in 66.0 3.1 6.7E-05 32.7 1.1 23 38-60 71-95 (113)
113 COG2888 Predicted Zn-ribbon RN 65.6 3.1 6.6E-05 29.4 0.9 25 37-61 9-37 (61)
114 PRK03681 hypA hydrogenase nick 65.4 3.3 7.1E-05 32.6 1.1 23 39-61 72-97 (114)
115 TIGR02098 MJ0042_CXXC MJ0042 f 65.2 3.2 6.8E-05 25.8 0.9 23 38-60 3-34 (38)
116 PRK04023 DNA polymerase II lar 65.1 11 0.00023 40.3 5.0 34 27-61 627-661 (1121)
117 PRK14892 putative transcriptio 64.4 4.3 9.3E-05 31.4 1.6 22 38-59 22-50 (99)
118 COG0375 HybF Zn finger protein 63.9 3.2 6.9E-05 33.0 0.8 22 39-60 72-95 (115)
119 KOG4623|consensus 63.7 2.9 6.2E-05 41.1 0.6 26 36-61 27-57 (611)
120 COG4640 Predicted membrane pro 63.3 4.1 8.9E-05 38.9 1.6 22 38-59 2-23 (465)
121 PF09297 zf-NADH-PPase: NADH p 62.4 4 8.7E-05 24.7 0.9 24 37-60 3-30 (32)
122 PRK12495 hypothetical protein; 61.9 4.5 9.7E-05 35.7 1.4 25 37-61 42-68 (226)
123 PF09779 Ima1_N: Ima1 N-termin 61.4 5.1 0.00011 32.3 1.6 25 38-62 1-31 (131)
124 KOG2910|consensus 61.0 60 0.0013 28.2 8.1 57 174-230 26-82 (209)
125 smart00685 DM14 Repeats in fly 60.6 55 0.0012 23.0 6.5 43 182-224 3-45 (59)
126 TIGR00373 conserved hypothetic 60.4 4 8.7E-05 33.9 0.9 23 38-60 110-137 (158)
127 PF08772 NOB1_Zn_bind: Nin one 60.3 4.2 9E-05 29.8 0.8 21 39-59 11-32 (73)
128 PF02150 RNA_POL_M_15KD: RNA p 60.0 4 8.8E-05 25.5 0.6 26 39-64 3-33 (35)
129 PF06044 DRP: Dam-replacing fa 60.0 3.4 7.3E-05 37.0 0.4 23 37-59 31-61 (254)
130 PRK06266 transcription initiat 59.0 5 0.00011 34.1 1.3 24 38-61 118-146 (178)
131 PRK11827 hypothetical protein; 58.9 7.1 0.00015 27.5 1.8 27 39-65 10-40 (60)
132 COG4031 Predicted metal-bindin 58.6 5.5 0.00012 34.5 1.4 61 39-102 2-74 (227)
133 PF06827 zf-FPG_IleRS: Zinc fi 58.6 2.9 6.2E-05 24.8 -0.2 21 39-59 3-29 (30)
134 PRK03824 hypA hydrogenase nick 57.9 5.2 0.00011 32.4 1.1 10 38-47 71-80 (135)
135 COG2093 DNA-directed RNA polym 57.5 7.6 0.00017 27.7 1.7 34 37-73 4-37 (64)
136 COG3024 Uncharacterized protei 57.3 5.7 0.00012 28.4 1.1 24 37-60 7-36 (65)
137 TIGR00100 hypA hydrogenase nic 56.0 6.1 0.00013 31.1 1.2 24 38-61 71-96 (115)
138 PHA02942 putative transposase; 55.8 7.1 0.00015 37.0 1.8 26 36-61 324-352 (383)
139 PRK12286 rpmF 50S ribosomal pr 54.6 7.7 0.00017 27.0 1.4 20 39-59 29-48 (57)
140 smart00531 TFIIE Transcription 54.2 5.3 0.00011 32.6 0.6 25 37-61 99-133 (147)
141 PF10083 DUF2321: Uncharacteri 54.2 3.1 6.8E-05 34.8 -0.7 21 27-47 29-49 (158)
142 PRK00418 DNA gyrase inhibitor; 53.8 6.8 0.00015 27.9 1.0 23 37-59 6-34 (62)
143 cd04476 RPA1_DBD_C RPA1_DBD_C: 53.5 6.2 0.00013 32.4 0.9 26 35-60 32-60 (166)
144 PRK14714 DNA polymerase II lar 53.2 10 0.00023 41.1 2.7 26 38-64 680-705 (1337)
145 PHA00626 hypothetical protein 52.9 10 0.00022 26.5 1.7 23 39-61 2-33 (59)
146 PRK01343 zinc-binding protein; 52.5 8.8 0.00019 26.8 1.4 24 37-60 9-34 (57)
147 PF09567 RE_MamI: MamI restric 52.4 7.9 0.00017 35.0 1.4 38 39-77 84-121 (314)
148 TIGR01031 rpmF_bact ribosomal 52.2 8.2 0.00018 26.6 1.2 20 39-59 28-47 (55)
149 PRK14714 DNA polymerase II lar 52.1 11 0.00025 40.8 2.8 9 39-47 694-702 (1337)
150 TIGR00686 phnA alkylphosphonat 51.9 10 0.00022 29.9 1.8 28 37-64 2-32 (109)
151 smart00834 CxxC_CXXC_SSSS Puta 51.9 10 0.00023 23.5 1.6 21 39-59 7-34 (41)
152 PF12172 DUF35_N: Rubredoxin-l 51.8 4.9 0.00011 25.0 0.0 21 39-59 13-33 (37)
153 PF01783 Ribosomal_L32p: Ribos 51.6 9.7 0.00021 26.2 1.5 21 38-59 27-47 (56)
154 PRK11788 tetratricopeptide rep 51.0 9.3 0.0002 34.6 1.7 25 38-62 355-379 (389)
155 PRK00564 hypA hydrogenase nick 51.0 8.9 0.00019 30.3 1.4 23 38-60 72-97 (117)
156 PF08274 PhnA_Zn_Ribbon: PhnA 50.2 7.1 0.00015 23.7 0.6 22 38-59 3-27 (30)
157 COG1096 Predicted RNA-binding 50.1 8 0.00017 33.3 1.1 25 37-61 149-175 (188)
158 TIGR00201 comF comF family pro 50.1 8.5 0.00019 32.4 1.2 21 40-60 1-21 (190)
159 PF13719 zinc_ribbon_5: zinc-r 49.9 9.3 0.0002 24.0 1.1 24 38-61 3-35 (37)
160 COG1198 PriA Primosomal protei 49.7 11 0.00023 38.9 2.1 27 38-64 445-475 (730)
161 PRK12366 replication factor A; 49.4 7.8 0.00017 39.2 1.0 26 37-62 532-559 (637)
162 TIGR00622 ssl1 transcription f 48.9 11 0.00025 29.8 1.7 40 39-78 3-44 (112)
163 TIGR02642 phage_xxxx uncharact 48.8 13 0.00027 32.1 2.1 59 37-97 99-162 (186)
164 COG3877 Uncharacterized protei 48.5 7.5 0.00016 30.7 0.6 37 37-73 6-42 (122)
165 PF02151 UVR: UvrB/uvrC motif; 48.3 54 0.0012 20.3 4.4 30 185-214 2-31 (36)
166 PF11023 DUF2614: Protein of u 47.6 9.5 0.00021 30.3 1.0 24 36-59 68-93 (114)
167 PRK10220 hypothetical protein; 47.2 13 0.00029 29.3 1.8 28 37-64 3-33 (111)
168 TIGR00595 priA primosomal prot 46.9 15 0.00032 36.1 2.5 29 36-64 221-253 (505)
169 PF09986 DUF2225: Uncharacteri 46.8 1.9E+02 0.0042 25.0 11.2 73 50-136 47-119 (214)
170 PRK09710 lar restriction allev 46.2 13 0.00028 26.6 1.5 28 38-65 7-41 (64)
171 PF13453 zf-TFIIB: Transcripti 45.6 8.5 0.00018 24.5 0.4 22 39-60 1-28 (41)
172 PRK04023 DNA polymerase II lar 45.2 14 0.0003 39.4 2.1 24 35-60 624-647 (1121)
173 PRK00432 30S ribosomal protein 45.1 15 0.00033 24.7 1.6 24 36-59 19-45 (50)
174 PF14260 zf-C4pol: C4-type zin 45.0 25 0.00053 24.9 2.8 46 40-96 1-47 (73)
175 PF01155 HypA: Hydrogenase exp 45.0 5.8 0.00013 31.0 -0.5 22 39-60 72-95 (113)
176 PRK08351 DNA-directed RNA poly 44.6 14 0.00031 26.1 1.5 33 38-75 4-36 (61)
177 smart00659 RPOLCX RNA polymera 44.6 14 0.00031 24.2 1.4 21 39-59 4-27 (44)
178 PF08447 PAS_3: PAS fold; Int 44.3 6 0.00013 27.9 -0.5 30 66-101 5-35 (91)
179 PRK00423 tfb transcription ini 44.3 14 0.00031 33.8 1.9 26 37-62 11-41 (310)
180 COG1545 Predicted nucleic-acid 43.6 12 0.00027 30.4 1.2 24 37-60 29-52 (140)
181 KOG4191|consensus 43.6 1.7E+02 0.0037 28.8 8.9 69 131-199 379-460 (516)
182 PRK14873 primosome assembly pr 43.5 17 0.00036 37.1 2.3 40 36-75 391-438 (665)
183 TIGR03338 phnR_burk phosphonat 43.0 1.2E+02 0.0026 25.3 7.3 52 119-170 39-95 (212)
184 PRK02935 hypothetical protein; 43.0 15 0.00033 28.9 1.5 25 35-59 68-94 (110)
185 PF00098 zf-CCHC: Zinc knuckle 42.5 25 0.00054 18.6 1.9 8 39-46 2-9 (18)
186 PF13717 zinc_ribbon_4: zinc-r 41.7 17 0.00037 22.7 1.4 22 39-60 4-34 (36)
187 PF07191 zinc-ribbons_6: zinc- 40.9 8.5 0.00018 28.0 -0.1 24 38-61 31-60 (70)
188 COG4307 Uncharacterized protei 40.5 9 0.0002 35.0 -0.1 25 38-62 4-28 (349)
189 COG5595 Zn-ribbon-containing, 40.4 13 0.00029 32.4 1.0 30 33-62 214-250 (256)
190 PRK05580 primosome assembly pr 40.2 20 0.00043 36.5 2.3 29 36-64 389-421 (679)
191 KOG1818|consensus 39.9 29 0.00063 35.2 3.3 71 2-78 137-226 (634)
192 PRK05654 acetyl-CoA carboxylas 39.3 13 0.00027 34.1 0.7 28 37-64 27-59 (292)
193 cd00729 rubredoxin_SM Rubredox 39.0 22 0.00048 21.9 1.6 21 39-59 4-26 (34)
194 PRK14102 nifW nitrogenase stab 38.9 81 0.0018 24.7 5.0 61 65-128 13-76 (105)
195 KOG1984|consensus 38.9 16 0.00034 38.4 1.3 66 35-103 336-421 (1007)
196 KOG0724|consensus 38.8 40 0.00087 30.8 3.9 51 82-133 5-55 (335)
197 COG2956 Predicted N-acetylgluc 38.6 18 0.0004 34.1 1.6 32 36-67 353-384 (389)
198 PF11781 RRN7: RNA polymerase 38.3 14 0.0003 23.2 0.6 23 39-61 10-35 (36)
199 PF07729 FCD: FCD domain; Int 37.1 36 0.00079 24.8 2.8 9 160-168 2-10 (125)
200 PF03206 NifW: Nitrogen fixati 36.9 1.2E+02 0.0026 23.7 5.7 63 64-130 12-77 (105)
201 PRK09521 exosome complex RNA-b 36.8 16 0.00035 30.9 0.9 24 39-62 151-177 (189)
202 PF14446 Prok-RING_1: Prokaryo 36.7 22 0.00048 24.6 1.4 23 37-59 5-29 (54)
203 PF07295 DUF1451: Protein of u 36.4 21 0.00045 29.5 1.5 22 39-60 114-139 (146)
204 PRK11414 colanic acid/biofilm 36.4 2.3E+02 0.005 23.9 8.1 51 120-170 40-95 (221)
205 KOG3795|consensus 36.0 17 0.00037 31.3 0.9 37 35-76 13-49 (230)
206 TIGR00515 accD acetyl-CoA carb 35.8 18 0.00039 33.1 1.1 28 37-64 26-58 (285)
207 PF03604 DNA_RNApol_7kD: DNA d 35.8 27 0.00058 21.4 1.5 20 40-59 3-25 (32)
208 KOG2906|consensus 35.7 13 0.00028 28.9 0.1 26 39-64 3-34 (105)
209 PF10071 DUF2310: Zn-ribbon-co 35.5 19 0.00041 32.6 1.2 29 35-63 218-253 (258)
210 COG1998 RPS31 Ribosomal protei 35.2 24 0.00051 24.1 1.3 26 36-61 18-47 (51)
211 PF11101 DUF2884: Protein of u 35.2 1.9E+02 0.0041 25.3 7.4 46 185-230 174-222 (229)
212 PRK08402 replication factor A; 35.0 18 0.00039 34.1 1.0 28 36-63 211-242 (355)
213 smart00709 Zpr1 Duplicated dom 34.8 16 0.00034 30.6 0.5 21 39-59 2-37 (160)
214 COG1110 Reverse gyrase [DNA re 34.7 18 0.0004 38.7 1.1 22 38-59 9-34 (1187)
215 TIGR00375 conserved hypothetic 34.5 64 0.0014 30.7 4.6 37 66-102 290-332 (374)
216 PRK13130 H/ACA RNA-protein com 34.3 36 0.00077 23.7 2.1 24 37-62 5-28 (56)
217 TIGR00340 zpr1_rel ZPR1-relate 34.2 18 0.00038 30.4 0.7 10 50-59 27-36 (163)
218 PRK12496 hypothetical protein; 34.2 28 0.0006 29.1 1.9 21 39-59 129-151 (164)
219 TIGR01206 lysW lysine biosynth 34.0 23 0.0005 24.4 1.1 24 38-61 3-32 (54)
220 PF14319 Zn_Tnp_IS91: Transpos 33.9 24 0.00052 27.6 1.4 25 37-61 42-70 (111)
221 TIGR00310 ZPR1_znf ZPR1 zinc f 33.7 20 0.00043 30.9 1.0 9 51-59 30-38 (192)
222 PRK07218 replication factor A; 33.1 26 0.00056 33.9 1.7 24 38-63 298-321 (423)
223 cd00350 rubredoxin_like Rubred 33.1 30 0.00066 21.0 1.5 21 39-59 3-25 (33)
224 PF04161 Arv1: Arv1-like famil 32.7 20 0.00043 31.0 0.8 51 39-96 2-63 (208)
225 PF03833 PolC_DP2: DNA polymer 32.7 15 0.00032 38.5 0.0 19 85-103 715-733 (900)
226 PF08646 Rep_fac-A_C: Replicat 32.7 15 0.00031 29.6 -0.0 24 37-60 18-46 (146)
227 COG3809 Uncharacterized protei 32.6 22 0.00048 26.6 0.9 23 39-61 3-31 (88)
228 COG2835 Uncharacterized conser 32.2 25 0.00055 24.8 1.1 27 39-65 10-40 (60)
229 PRK11032 hypothetical protein; 32.0 25 0.00055 29.5 1.3 22 38-59 125-150 (160)
230 PRK05978 hypothetical protein; 31.8 43 0.00093 27.8 2.6 37 37-73 33-77 (148)
231 PF12128 DUF3584: Protein of u 31.7 5.7E+02 0.012 28.0 11.7 58 77-140 596-653 (1201)
232 PRK06393 rpoE DNA-directed RNA 31.6 26 0.00057 25.0 1.1 34 37-75 5-38 (64)
233 PRK03988 translation initiatio 31.3 27 0.00059 28.5 1.4 28 37-64 102-136 (138)
234 PF01627 Hpt: Hpt domain; Int 30.8 1.5E+02 0.0033 20.5 5.2 27 188-214 11-40 (90)
235 PF13097 CENP-U: CENP-A nucleo 30.8 3.4E+02 0.0074 23.2 8.0 63 154-229 111-174 (175)
236 COG1802 GntR Transcriptional r 30.7 3.2E+02 0.0069 23.2 8.1 52 119-170 44-100 (230)
237 PRK00241 nudC NADH pyrophospha 30.5 37 0.00081 30.3 2.2 28 36-63 98-129 (256)
238 PF10005 DUF2248: Uncharacteri 30.3 30 0.00065 32.5 1.6 21 39-59 1-21 (343)
239 PF06556 ASFV_p27: IAP-like pr 30.3 12 0.00027 29.4 -0.8 34 38-76 94-128 (131)
240 COG1579 Zn-ribbon protein, pos 30.2 27 0.00058 31.3 1.2 27 35-61 195-231 (239)
241 PF13446 RPT: A repeated domai 29.9 1.8E+02 0.0039 19.7 5.9 26 68-95 6-31 (62)
242 PRK09401 reverse gyrase; Revie 29.8 24 0.00052 38.4 1.0 22 38-59 8-33 (1176)
243 KOG1598|consensus 29.8 28 0.00061 34.4 1.4 24 39-62 2-30 (521)
244 TIGR01054 rgy reverse gyrase. 29.4 27 0.00057 38.0 1.3 21 38-58 8-32 (1171)
245 PRK08665 ribonucleotide-diphos 29.4 33 0.00072 35.5 1.9 22 38-59 725-748 (752)
246 COG1594 RPB9 DNA-directed RNA 29.3 32 0.00069 27.1 1.4 28 38-65 3-36 (113)
247 TIGR00311 aIF-2beta translatio 29.0 32 0.0007 27.9 1.4 29 36-64 96-131 (133)
248 PF12644 DUF3782: Protein of u 28.8 1.8E+02 0.0038 19.8 5.0 28 159-186 9-39 (64)
249 PF01396 zf-C4_Topoisom: Topoi 28.8 42 0.00091 21.2 1.7 26 39-64 3-37 (39)
250 PF03833 PolC_DP2: DNA polymer 28.7 19 0.00041 37.7 0.0 20 39-59 669-688 (900)
251 PF03461 TRCF: TRCF domain; I 28.6 1.2E+02 0.0025 23.1 4.4 39 158-196 17-55 (101)
252 PF03993 DUF349: Domain of Unk 28.5 2E+02 0.0044 19.9 6.1 41 174-214 23-63 (77)
253 COG1571 Predicted DNA-binding 28.5 39 0.00085 32.6 2.1 28 36-63 349-379 (421)
254 COG2816 NPY1 NTP pyrophosphohy 27.9 36 0.00078 31.1 1.7 28 36-63 110-141 (279)
255 PF10122 Mu-like_Com: Mu-like 27.9 17 0.00038 24.8 -0.3 26 38-63 5-36 (51)
256 PRK08579 anaerobic ribonucleos 27.6 37 0.00081 34.4 1.9 23 37-59 568-590 (625)
257 COG1656 Uncharacterized conser 27.3 24 0.00052 29.8 0.4 11 37-47 97-107 (165)
258 PF07709 SRR: Seven Residue Re 27.2 38 0.00082 16.7 1.0 12 117-128 3-14 (14)
259 PRK00762 hypA hydrogenase nick 27.1 34 0.00073 27.2 1.2 23 38-61 71-102 (124)
260 PF09723 Zn-ribbon_8: Zinc rib 27.0 42 0.00092 21.5 1.4 21 39-59 7-34 (42)
261 PF03119 DNA_ligase_ZBD: NAD-d 26.4 26 0.00057 20.7 0.3 9 39-47 1-9 (28)
262 COG0777 AccD Acetyl-CoA carbox 26.4 30 0.00066 31.7 0.9 28 37-64 28-60 (294)
263 KOG2577|consensus 26.3 1.4E+02 0.0031 28.2 5.3 100 78-200 68-180 (354)
264 PF07361 Cytochrom_B562: Cytoc 26.3 3E+02 0.0065 21.0 6.6 25 192-216 70-94 (103)
265 PF14276 DUF4363: Domain of un 25.8 1.2E+02 0.0026 23.4 4.1 22 191-212 32-53 (121)
266 PF14369 zf-RING_3: zinc-finge 25.7 50 0.0011 20.5 1.6 20 40-59 5-29 (35)
267 PRK06386 replication factor A; 25.6 41 0.00088 31.8 1.6 20 38-59 237-256 (358)
268 TIGR03826 YvyF flagellar opero 25.6 46 0.001 27.2 1.7 24 35-59 79-102 (137)
269 TIGR03655 anti_R_Lar restricti 25.5 47 0.001 22.2 1.5 25 39-63 3-38 (53)
270 PF03367 zf-ZPR1: ZPR1 zinc-fi 25.3 19 0.00042 30.0 -0.5 21 39-59 3-38 (161)
271 PF06221 zf-C2HC5: Putative zi 25.3 61 0.0013 22.6 2.1 23 37-59 18-43 (57)
272 PF08312 cwf21: cwf21 domain; 25.1 1.8E+02 0.0038 19.3 4.2 14 158-171 11-24 (46)
273 PRK08271 anaerobic ribonucleos 25.1 45 0.00098 33.8 2.0 23 37-59 566-588 (623)
274 PF02591 DUF164: Putative zinc 24.9 41 0.00088 22.7 1.1 24 37-60 22-55 (56)
275 KOG0687|consensus 24.4 5.6E+02 0.012 24.4 8.8 16 150-165 66-81 (393)
276 PF13838 Clathrin_H_link: Clat 24.2 1.2E+02 0.0027 21.7 3.5 25 188-212 7-31 (66)
277 KOG0687|consensus 24.2 2.7E+02 0.0059 26.5 6.7 49 173-221 70-119 (393)
278 TIGR01666 YCCS hypothetical me 24.1 8.1E+02 0.018 25.3 11.8 47 82-128 159-215 (704)
279 COG2260 Predicted Zn-ribbon RN 24.0 1.2E+02 0.0025 21.4 3.2 24 37-62 5-28 (59)
280 PRK06342 transcription elongat 23.8 2.5E+02 0.0053 23.4 5.9 47 172-222 32-82 (160)
281 PF13812 PPR_3: Pentatricopept 23.8 1.2E+02 0.0025 17.1 2.9 23 193-215 7-29 (34)
282 TIGR00020 prfB peptide chain r 23.7 2.9E+02 0.0063 26.2 7.0 17 205-221 50-66 (364)
283 cd02340 ZZ_NBR1_like Zinc fing 23.5 58 0.0013 21.1 1.6 20 39-58 2-21 (43)
284 PF11278 DUF3079: Protein of u 23.3 47 0.001 22.6 1.1 29 35-63 11-40 (52)
285 PRK00415 rps27e 30S ribosomal 23.3 42 0.00092 23.6 1.0 25 37-61 11-40 (59)
286 smart00154 ZnF_AN1 AN1-like Zi 23.2 58 0.0013 20.7 1.5 20 40-59 1-20 (39)
287 COG1601 GCD7 Translation initi 23.0 22 0.00049 29.5 -0.5 33 37-69 105-144 (151)
288 COG0497 RecN ATPase involved i 23.0 5.8E+02 0.012 25.8 9.1 83 151-233 293-390 (557)
289 PRK14701 reverse gyrase; Provi 23.0 37 0.00079 38.3 0.9 21 38-58 7-31 (1638)
290 COG1326 Uncharacterized archae 22.9 31 0.00067 30.0 0.3 27 37-63 6-42 (201)
291 TIGR00354 polC DNA polymerase, 22.7 46 0.001 35.5 1.5 22 37-60 625-646 (1095)
292 TIGR01562 FdhE formate dehydro 22.5 43 0.00094 30.9 1.2 33 38-74 211-243 (305)
293 COG1675 TFA1 Transcription ini 22.5 46 0.001 28.4 1.3 25 37-61 113-142 (176)
294 COG0264 Tsf Translation elonga 22.5 1.3E+02 0.0028 27.8 4.2 39 172-212 5-43 (296)
295 cd00088 HPT Histidine Phosphot 22.5 2.1E+02 0.0046 20.4 4.8 14 200-213 29-42 (94)
296 PHA02998 RNA polymerase subuni 22.3 55 0.0012 28.2 1.6 28 36-63 142-184 (195)
297 PF10279 Latarcin: Latarcin pr 22.2 20 0.00044 26.6 -0.9 43 183-225 36-79 (81)
298 PF05600 DUF773: Protein of un 21.8 2.6E+02 0.0057 27.6 6.5 23 119-141 339-361 (507)
299 PRK11823 DNA repair protein Ra 21.8 54 0.0012 31.6 1.7 26 38-63 8-33 (446)
300 cd01121 Sms Sms (bacterial rad 21.8 54 0.0012 30.9 1.7 23 39-61 2-24 (372)
301 PF01895 PhoU: PhoU domain; I 21.4 2.8E+02 0.006 19.0 7.2 50 176-225 30-83 (88)
302 PF09845 DUF2072: Zn-ribbon co 21.4 64 0.0014 26.3 1.8 22 38-59 2-27 (131)
303 PRK00810 nifW nitrogenase stab 21.3 2.2E+02 0.0047 22.6 4.7 60 65-129 17-79 (113)
304 PF08882 Acetone_carb_G: Aceto 21.2 50 0.0011 26.2 1.1 17 49-65 72-88 (112)
305 PRK06556 vitamin B12-dependent 21.0 54 0.0012 35.0 1.6 26 35-60 922-949 (953)
306 PRK08270 anaerobic ribonucleos 20.8 58 0.0013 33.2 1.8 22 37-59 626-647 (656)
307 PF14122 YokU: YokU-like prote 20.7 47 0.001 25.2 0.8 30 48-77 32-80 (87)
308 COG5349 Uncharacterized protei 20.6 44 0.00096 26.9 0.7 39 37-75 21-67 (126)
309 TIGR02827 RNR_anaer_Bdell anae 20.4 61 0.0013 32.7 1.8 23 37-59 532-554 (586)
310 PF13597 NRDD: Anaerobic ribon 20.4 36 0.00077 33.9 0.1 32 39-71 493-526 (546)
311 PRK12336 translation initiatio 20.2 54 0.0012 28.3 1.2 29 37-65 98-133 (201)
312 PRK11595 DNA utilization prote 20.1 50 0.0011 28.6 1.0 23 37-59 34-56 (227)
No 1
>PRK01773 hscB co-chaperone HscB; Provisional
Probab=100.00 E-value=9.4e-46 Score=311.71 Aligned_cols=165 Identities=25% Similarity=0.466 Sum_probs=155.9
Q ss_pred CChhhhcCCcccCCCCHHHHHHHHHHHHHHhCCCCCCCCCHHHHHHHHHHHHHHHHHHHHhcCcchhhHHHHhhc-CCCc
Q psy8325 67 SNYYDLFDLKLTYLINNVDLSKKFKQLQSQLHPDKFSNKNQEEQAISETYSSYLNKAYSILQNPLERGLYLLSLQ-NISI 145 (237)
Q Consensus 67 ~nyf~llgl~~~f~id~~~Lk~~Yr~Lq~~~HPDk~~~~~~~e~~~a~~~s~~In~AY~tL~dp~~Ra~Yll~l~-g~~~ 145 (237)
.|||++||||++|+||...|+++|++||+.+|||+|+++++.|+++|.++|+.||+||+||+||++||.|+|.++ |+++
T Consensus 2 ~nyF~lf~lp~~F~iD~~~L~~~y~~Lq~~~HPD~f~~~~~~eq~~a~~~ss~iN~AY~tLkdPl~RA~YLL~L~~g~~~ 81 (173)
T PRK01773 2 NNPFALFDLPVDFQLDNALLSERYLALQKSLHPDNFANSSAQEQRLAMQKSAEVNDALQILKDPILRAEAIIALNTGEQQ 81 (173)
T ss_pred CChHHhcCCCCCCCCCHHHHHHHHHHHHHHhCcCcccCCCHHHHHHHHHHHHHHHHHHHHHCChHHHHHHHHHhccCCCC
Confidence 599999999999999999999999999999999999999999999999999999999999999999999999999 8875
Q ss_pred c-ccccCCCHHHHHHHHHHHHHHhccC---CHHHHHHHHHHHHHHHHHHHHHHHHHhccccHHHHHHHHHHHHhHHHHHH
Q psy8325 146 E-EDTKGTDQKLLMEILMLNEELDEAS---SEEDLEKLQTLIQATIDDLTKNVNASFEQKDFNQAKDLLIRMKYFTTLVS 221 (237)
Q Consensus 146 ~-e~~~~~d~efLmeimE~rE~leea~---~~~~l~~l~~~~~~~i~~~~~~l~~~f~~~d~~~A~~~l~rlkY~~~l~~ 221 (237)
. +..+..||+|||+||||||+|+++. +.++|..+..++++++.++...|.++|+.+||+.|+..++||+||.||.+
T Consensus 82 ~~e~~~~~d~~fLme~ME~rE~lee~~~~~d~~~L~~l~~~v~~~~~~~~~~l~~~~~~~d~~~A~~~~~rL~y~~kl~~ 161 (173)
T PRK01773 82 NLEEKSTQDMAFLMQQMEWREQLEEIEQQQDEDALTAFSKEIKQEQQAILTELSTALNSQQWQQASQINDRLRFIKKLII 161 (173)
T ss_pred CcccccCCCHHHHHHHHHHHHHHHhhcccCCHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHHHHHHHH
Confidence 4 4467889999999999999999864 56779999999999999999999999999999999999999999999999
Q ss_pred HHHHHHhhcc
Q psy8325 222 KVRDKKNQLE 231 (237)
Q Consensus 222 ~ik~k~~~l~ 231 (237)
+|++..++|+
T Consensus 162 ei~~~~~~l~ 171 (173)
T PRK01773 162 EIERVEEKLF 171 (173)
T ss_pred HHHHHHHHhh
Confidence 9999888763
No 2
>PRK03578 hscB co-chaperone HscB; Provisional
Probab=100.00 E-value=9e-45 Score=306.60 Aligned_cols=166 Identities=20% Similarity=0.459 Sum_probs=157.9
Q ss_pred CCChhhhcCCcccCCCCHHHHHHHHHHHHHHhCCCCCCCCCHHHHHHHHHHHHHHHHHHHHhcCcchhhHHHHhhcCCCc
Q psy8325 66 QSNYYDLFDLKLTYLINNVDLSKKFKQLQSQLHPDKFSNKNQEEQAISETYSSYLNKAYSILQNPLERGLYLLSLQNISI 145 (237)
Q Consensus 66 ~~nyf~llgl~~~f~id~~~Lk~~Yr~Lq~~~HPDk~~~~~~~e~~~a~~~s~~In~AY~tL~dp~~Ra~Yll~l~g~~~ 145 (237)
..|||++||||++|++|..+|+++||+||+++|||+|++.++.++.+|.++|+.||+||+||+||.+|+.|++.++|+++
T Consensus 5 ~~dyf~llglp~~f~~d~~~L~~~yr~lq~~~HPD~~~~~~~~e~~~a~~~s~~iN~AY~tL~~p~~Ra~Yll~l~G~~~ 84 (176)
T PRK03578 5 KDDHFSLFGLPARFALDEAALDAAYRTVQAQVHPDRFAAAGDAEKRVAMQWATRANEAYQTLRDPLKRARYLLHLRGVDV 84 (176)
T ss_pred CCCHHHHcCCCCCCCCCHHHHHHHHHHHHHHHCcCCCCCCCHHHHHHHHHHHHHHHHHHHHhCChhhHHHHHHHhcCCCC
Confidence 47999999999999999999999999999999999999999999999999999999999999999999999999999998
Q ss_pred ccc-ccCCCHHHHHHHHHHHHHHhccC---CHHHHHHHHHHHHHHHHHHHHHHHHHhcc-ccHHHHHHHHHHHHhHHHHH
Q psy8325 146 EED-TKGTDQKLLMEILMLNEELDEAS---SEEDLEKLQTLIQATIDDLTKNVNASFEQ-KDFNQAKDLLIRMKYFTTLV 220 (237)
Q Consensus 146 ~e~-~~~~d~efLmeimE~rE~leea~---~~~~l~~l~~~~~~~i~~~~~~l~~~f~~-~d~~~A~~~l~rlkY~~~l~ 220 (237)
..+ .+..||+||||||||||+|+++. +.++|..|..+++++++++.+.|.++|+. +||+.|+..+++|+||.||.
T Consensus 85 ~~e~~~~~d~~fLme~mE~rE~lee~~~~~d~~~L~~l~~e~~~~~~~~~~~l~~~~~~~~d~~~A~~~~~kL~y~~kl~ 164 (176)
T PRK03578 85 QAENNTAMPPAFLMQQMEWREAIEDARAARDVDALDALLAELRDERRERYAELGALLDSRGDDQAAAEAVRQLMFIEKLA 164 (176)
T ss_pred ccccCCCCCHHHHHHHHHHHHHHHHhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHccccHHHHHHHHHHHHHHHHHH
Confidence 754 67889999999999999999886 67889999999999999999999999988 89999999999999999999
Q ss_pred HHHHHHHhhcc
Q psy8325 221 SKVRDKKNQLE 231 (237)
Q Consensus 221 ~~ik~k~~~l~ 231 (237)
++|+++++.|.
T Consensus 165 ~ei~~~~~~l~ 175 (176)
T PRK03578 165 QEIGAAIERLE 175 (176)
T ss_pred HHHHHHHHHhh
Confidence 99999988763
No 3
>PRK05014 hscB co-chaperone HscB; Provisional
Probab=100.00 E-value=1.4e-43 Score=298.08 Aligned_cols=165 Identities=28% Similarity=0.528 Sum_probs=155.3
Q ss_pred CChhhhcCCcccCCCCHHHHHHHHHHHHHHhCCCCCCCCCHHHHHHHHHHHHHHHHHHHHhcCcchhhHHHHhhcCCCcc
Q psy8325 67 SNYYDLFDLKLTYLINNVDLSKKFKQLQSQLHPDKFSNKNQEEQAISETYSSYLNKAYSILQNPLERGLYLLSLQNISIE 146 (237)
Q Consensus 67 ~nyf~llgl~~~f~id~~~Lk~~Yr~Lq~~~HPDk~~~~~~~e~~~a~~~s~~In~AY~tL~dp~~Ra~Yll~l~g~~~~ 146 (237)
.|||++||||++|++|..+|+++||++|+++|||++++.++.++..|.++|+.||+||+||+||.+|+.|++.++|+++.
T Consensus 1 ~~yf~llgl~~~~~~d~~~L~~~yr~l~~~~HPDk~~~~~~~~~~~a~~~s~~iN~AY~~L~dp~~Ra~Yll~l~g~~~~ 80 (171)
T PRK05014 1 MDYFTLFGLPARYDIDTQLLASRYQELQRQFHPDKFANASERERLLAVQQAATINDAYQTLKHPLKRAEYLLSLHGFDLA 80 (171)
T ss_pred CCHHHHCCCCCCCCCCHHHHHHHHHHHHHHhCcCCCCCCcHHHHHHHHHHHHHHHHHHHHHCChhHHHHHHHHhcCCccc
Confidence 48999999999999999999999999999999999999989999999999999999999999999999999999999987
Q ss_pred cc-ccCCCHHHHHHHHHHHHHHhccCC----HHHHHHHHHHHHHHHHHHHHHHHHHhccccHHHHHHHHHHHHhHHHHHH
Q psy8325 147 ED-TKGTDQKLLMEILMLNEELDEASS----EEDLEKLQTLIQATIDDLTKNVNASFEQKDFNQAKDLLIRMKYFTTLVS 221 (237)
Q Consensus 147 e~-~~~~d~efLmeimE~rE~leea~~----~~~l~~l~~~~~~~i~~~~~~l~~~f~~~d~~~A~~~l~rlkY~~~l~~ 221 (237)
.+ .+..||+|||+||||||+|+++.+ .+.+..|..+++++++++.+.|..+|+.+||+.|+..+++|+||.||.+
T Consensus 81 ~~~~~~~d~efLme~me~rE~le~~~~~~d~~~~l~~l~~~~~~~~~~~~~~l~~~~~~~d~~~A~~~~~~Lky~~kl~~ 160 (171)
T PRK05014 81 HEQHTVRDTAFLMEQMELREELEDIEQSKDPEAALESFIKRVKKMFKTRLQQMVEQLDNEAWDAAADTVRKLKFLDKLRS 160 (171)
T ss_pred cccCCcCCHHHHHHHHHHHHHHHhhccccCHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCHHHHHHHHHHHHHHHHHHH
Confidence 55 567899999999999999999863 3449999999999999999999999999999999999999999999999
Q ss_pred HHHHHHhhcc
Q psy8325 222 KVRDKKNQLE 231 (237)
Q Consensus 222 ~ik~k~~~l~ 231 (237)
+|+++.+.|+
T Consensus 161 ei~~~~~~l~ 170 (171)
T PRK05014 161 EVEQLEEKLL 170 (171)
T ss_pred HHHHHHHHhh
Confidence 9999988763
No 4
>PRK01356 hscB co-chaperone HscB; Provisional
Probab=100.00 E-value=3.5e-43 Score=294.46 Aligned_cols=160 Identities=28% Similarity=0.406 Sum_probs=151.0
Q ss_pred CChhhhcCCcccCCCCHHHHHHHHHHHHHHhCCCCCCCCCHHHHHHHHHHHHHHHHHHHHhcCcchhhHHHHhhcCCCcc
Q psy8325 67 SNYYDLFDLKLTYLINNVDLSKKFKQLQSQLHPDKFSNKNQEEQAISETYSSYLNKAYSILQNPLERGLYLLSLQNISIE 146 (237)
Q Consensus 67 ~nyf~llgl~~~f~id~~~Lk~~Yr~Lq~~~HPDk~~~~~~~e~~~a~~~s~~In~AY~tL~dp~~Ra~Yll~l~g~~~~ 146 (237)
.|||++||||++|++|...|+++||+||+++|||++++ ..++..+.+.|+.||+||+||+||.+||.|+|.++|+++.
T Consensus 2 ~~yf~llgl~~~f~id~~~L~~aYr~lq~~~HPDk~~~--~~~k~~~~~~s~~in~AY~~L~dp~~Ra~YlL~l~g~~~~ 79 (166)
T PRK01356 2 QNYFQLLGLPQEYNIDLKILEKQYFAMQVKYHPDKAKT--LQEKEQNLIIASELNNAYSTLKDALKRAEYMLLLQNINLN 79 (166)
T ss_pred CCHHHHcCCCCCCCCCHHHHHHHHHHHHHHHCcCCCCC--HHHHHHHHHHHHHHHHHHHHhCCHHHHHHHHHHccCCCCC
Confidence 59999999999999999999999999999999999975 4566677789999999999999999999999999999987
Q ss_pred cc--ccCCCHHHHHHHHHHHHHHhccCCHHHHHHHHHHHHHHHHHHHHHHHHHhccccHHHHHHHHHHHHhHHHHHHHHH
Q psy8325 147 ED--TKGTDQKLLMEILMLNEELDEASSEEDLEKLQTLIQATIDDLTKNVNASFEQKDFNQAKDLLIRMKYFTTLVSKVR 224 (237)
Q Consensus 147 e~--~~~~d~efLmeimE~rE~leea~~~~~l~~l~~~~~~~i~~~~~~l~~~f~~~d~~~A~~~l~rlkY~~~l~~~ik 224 (237)
++ .+.+||+|||++||+||+||++.+.++|++|..+++.++.++.+.|.++|+.+||+.|+..+++|+||.||.++|+
T Consensus 80 ~~~~~~~~d~~fLme~me~rE~le~~~~~~~L~~l~~~~~~~~~~~~~~l~~~f~~~d~~~A~~~~~~L~y~~kl~~~i~ 159 (166)
T PRK01356 80 DEKTRSLLSPLELSIFWDEMERIENTILFSDLEKIKNKYELMYKNEIDSLKQAFEEQNLSDATIKTSKLKYIGTLLNKLQ 159 (166)
T ss_pred CccccccCCHHHHHHHHHHHHHHHcCCCHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHHHHHHHHHHH
Confidence 65 3678999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred HHHh
Q psy8325 225 DKKN 228 (237)
Q Consensus 225 ~k~~ 228 (237)
+|+.
T Consensus 160 ~k~~ 163 (166)
T PRK01356 160 EKIK 163 (166)
T ss_pred HHhh
Confidence 9875
No 5
>PRK00294 hscB co-chaperone HscB; Provisional
Probab=100.00 E-value=1.8e-42 Score=291.73 Aligned_cols=166 Identities=28% Similarity=0.533 Sum_probs=145.3
Q ss_pred CCChhhhcCCcccCCCCHHHHHHHHHHHHHHhCCCCCCCCCHHHHHHHHHHHHHHHHHHHHhcCcchhhHHHHhhcCCCc
Q psy8325 66 QSNYYDLFDLKLTYLINNVDLSKKFKQLQSQLHPDKFSNKNQEEQAISETYSSYLNKAYSILQNPLERGLYLLSLQNISI 145 (237)
Q Consensus 66 ~~nyf~llgl~~~f~id~~~Lk~~Yr~Lq~~~HPDk~~~~~~~e~~~a~~~s~~In~AY~tL~dp~~Ra~Yll~l~g~~~ 145 (237)
..|||++||||++|+||..+|+++||+||+++|||+|++.++.|+..|.++|+.||+||+||+||.+|+.|++.++|+++
T Consensus 3 ~~~~F~l~~l~~~f~id~~~L~~~Yr~Lq~~~HPDk~~~~~~~e~~~a~~~s~~IN~AY~~L~~p~~Ra~YlL~l~g~~~ 82 (173)
T PRK00294 3 TPCHFALFDLQPSFRLDLDQLATRYRELAREVHPDRFADAPEREQRLALERSASLNEAYQTLKSPPRRARYLLALSGHEV 82 (173)
T ss_pred CCChhhhcCcCCCCCCCHHHHHHHHHHHHHHHCcCCCCCCcHHHHHHHHHHHHHHHHHHHHhCChhhhHHHHHHhcCCCC
Confidence 47999999999999999999999999999999999999999999999999999999999999999999999999999998
Q ss_pred cccccCCCHHHHHHHHHHHHHHhccCCHHHHHHH---HHHHHHHHHHHHHHHHHHhcc-ccHHHHHHHHHHHHhHHHHHH
Q psy8325 146 EEDTKGTDQKLLMEILMLNEELDEASSEEDLEKL---QTLIQATIDDLTKNVNASFEQ-KDFNQAKDLLIRMKYFTTLVS 221 (237)
Q Consensus 146 ~e~~~~~d~efLmeimE~rE~leea~~~~~l~~l---~~~~~~~i~~~~~~l~~~f~~-~d~~~A~~~l~rlkY~~~l~~ 221 (237)
.++.+..||+|||++|||||+|+++.+.+++.+| ..+++.....+...+..+|+. +||+.|+..++||+||.||.+
T Consensus 83 ~~~~~~~d~~fLme~me~rE~le~~~~~~d~~~l~~~~~~i~~~~~~l~~~~~~~~~~~~~~~~A~~~v~kl~f~~kl~~ 162 (173)
T PRK00294 83 PLEVTVHDPEFLLQQMQLREELEELQDEADLAGVATFKRRLKAAQDELNESFAACWDDAARREEAERLMRRMQFLDKLAQ 162 (173)
T ss_pred CcccCCCCHHHHHHHHHHHHHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHH
Confidence 7666778999999999999999999876655555 444444444444555555554 569999999999999999999
Q ss_pred HHHHHHhhcc
Q psy8325 222 KVRDKKNQLE 231 (237)
Q Consensus 222 ~ik~k~~~l~ 231 (237)
+|.+..+.|+
T Consensus 163 ei~~~~~~l~ 172 (173)
T PRK00294 163 EVRQLEERLD 172 (173)
T ss_pred HHHHHHHHhh
Confidence 9998887763
No 6
>TIGR00714 hscB Fe-S protein assembly co-chaperone HscB. This model describes the small subunit, Hsc20 (20K heat shock cognate protein) of a pair of proteins Hsc66-Hsc20, related to the DnaK-DnaJ heat shock proteins, which also serve as molecular chaperones. Hsc20, unlike DnaJ, appears not to have chaperone activity on its own, but to act solely as a regulatory subunit for Hsc66 (i.e., to be a co-chaperone). The gene for Hsc20 in E. coli, hscB, is not induced by heat shock.
Probab=100.00 E-value=5.5e-39 Score=266.86 Aligned_cols=152 Identities=30% Similarity=0.501 Sum_probs=143.5
Q ss_pred CCCCHHHHHHHHHHHHHHhCCCCCCCCCHHHHHHHHHHHHHHHHHHHHhcCcchhhHHHHhhcCCCcccc-ccCCCHHHH
Q psy8325 79 YLINNVDLSKKFKQLQSQLHPDKFSNKNQEEQAISETYSSYLNKAYSILQNPLERGLYLLSLQNISIEED-TKGTDQKLL 157 (237)
Q Consensus 79 f~id~~~Lk~~Yr~Lq~~~HPDk~~~~~~~e~~~a~~~s~~In~AY~tL~dp~~Ra~Yll~l~g~~~~e~-~~~~d~efL 157 (237)
|+||..+|+++||+||+++|||+|++.++.++..+.++|+.||+||+||+||.+||.|++.++|+++.++ .+..||+||
T Consensus 1 f~iD~~~L~~~yr~lq~~~HPD~~~~~~~~~~~~a~~~s~~iN~AY~~L~~p~~Ra~ylL~l~g~~~~~e~~~~~d~~fL 80 (157)
T TIGR00714 1 YQLDTQALSLRYQDLQRQYHPDKFASGSAQEQLAAVQQSTTLNQAYQTLKDPLMRAEYMLSLHGIDLASEQHSVRDTAFL 80 (157)
T ss_pred CCCCHHHHHHHHHHHHHHHCcCCCCCCChhhhHHHHHHHHHHHHHHHHhCChhhhHHHHHHhcCCCCCcccCCCCCHHHH
Confidence 6899999999999999999999999999889999999999999999999999999999999999998765 678899999
Q ss_pred HHHHHHHHHHhcc---CCHHHHHHHHHHHHHHHHHHHHHHHHHhccccHHHHHHHHHHHHhHHHHHHHHHHHHhhc
Q psy8325 158 MEILMLNEELDEA---SSEEDLEKLQTLIQATIDDLTKNVNASFEQKDFNQAKDLLIRMKYFTTLVSKVRDKKNQL 230 (237)
Q Consensus 158 meimE~rE~leea---~~~~~l~~l~~~~~~~i~~~~~~l~~~f~~~d~~~A~~~l~rlkY~~~l~~~ik~k~~~l 230 (237)
|+||||||++|++ .+.++|..+..++++++.++.+.|.++|+.+||+.|+..+++||||.||.++|+++.+.|
T Consensus 81 me~Me~rE~lee~~~~~d~~~L~~l~~~~~~~~~~~~~~l~~~~~~~d~~~A~~~~~kLky~~kl~~~i~~~~~~l 156 (157)
T TIGR00714 81 MEQLELREELDEIEQAKDEARLESFIKRVKKMFQTRHQLLVEQLDNQTWAAAADYTRKLRFLDKLRSSAEQLEEKL 156 (157)
T ss_pred HHHHHHHHHHHHHHhCCCHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHHHHHHHHHHHHHHhhc
Confidence 9999999998765 466789999999999999999999999999999999999999999999999999998765
No 7
>KOG3192|consensus
Probab=100.00 E-value=1.1e-35 Score=243.07 Aligned_cols=166 Identities=31% Similarity=0.543 Sum_probs=156.4
Q ss_pred cCCCCCCChhhhcCCcccCCCCHHHHHHHHHHHHHHhCCCCCCCCCHHHHHHHHHHHHHHHHHHHHhcCcchhhHHHHhh
Q psy8325 61 QKPDSQSNYYDLFDLKLTYLINNVDLSKKFKQLQSQLHPDKFSNKNQEEQAISETYSSYLNKAYSILQNPLERGLYLLSL 140 (237)
Q Consensus 61 qp~~~~~nyf~llgl~~~f~id~~~Lk~~Yr~Lq~~~HPDk~~~~~~~e~~~a~~~s~~In~AY~tL~dp~~Ra~Yll~l 140 (237)
++.+...+||.+||....|.+++..+...|...++++|||+...+...+...|.+.|+.+|+||.||+||++||.|++.+
T Consensus 2 ~~~~~~~~ff~~Fg~e~~~~~~p~~l~~~~~~~skkL~~d~~~~~~~~~~d~a~eqSa~lnkAY~TLk~pL~RA~Yilkl 81 (168)
T KOG3192|consen 2 LKMGSPSRFFDIFGMELSFKIDPDKLKEKYTDISKKLHPDRPGLSFAGDTDQASEQSAELNKAYDTLKDPLARARYLLKL 81 (168)
T ss_pred cccchHHHHHHHhccccCCCCCcchhhHHHHHHHHhhCcccccccccccchhHHHHHHHHHHHHHHHHhHHHHHHHHHHH
Confidence 45556689999999999999999999999999999999999988888888899999999999999999999999999999
Q ss_pred cCCCccccccCCCHHHHHHHHHHHHHHhccCCHHHHHHHHHHHHHHHHHHHHHHHHHhccccHHHHHHHHHHHHhHHHHH
Q psy8325 141 QNISIEEDTKGTDQKLLMEILMLNEELDEASSEEDLEKLQTLIQATIDDLTKNVNASFEQKDFNQAKDLLIRMKYFTTLV 220 (237)
Q Consensus 141 ~g~~~~e~~~~~d~efLmeimE~rE~leea~~~~~l~~l~~~~~~~i~~~~~~l~~~f~~~d~~~A~~~l~rlkY~~~l~ 220 (237)
.|+...++....||+|||||||++|+|.++.+.+++..+.++|+.++.++.++|.++|+.++|++|...+.||+||+++.
T Consensus 82 ~g~e~~sne~stDpe~Lmevle~~E~IS~~~De~~l~~lk~q~q~ri~q~~~qlge~~esk~~~~Al~~i~rlrY~~~~~ 161 (168)
T KOG3192|consen 82 KGQEQTSNELSTDPEFLMEVLEYHEAISEMDDEEDLKQLKSQNQERIAQCKQQLGEAFESKKYDEALKKILRLRYWYELR 161 (168)
T ss_pred hCCCCchhhhccCHHHHHHHHHHHHHHHhccCcHHHHHHHHHHHHHHHHHHHHHHHHHhhccHHHHHHHHHHHHHHHHHH
Confidence 99988877556699999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred HHHHHH
Q psy8325 221 SKVRDK 226 (237)
Q Consensus 221 ~~ik~k 226 (237)
..|..+
T Consensus 162 k~v~kk 167 (168)
T KOG3192|consen 162 KNVHKK 167 (168)
T ss_pred HHHhhc
Confidence 998754
No 8
>COG1076 DjlA DnaJ-domain-containing proteins 1 [Posttranslational modification, protein turnover, chaperones]
Probab=99.87 E-value=2.4e-22 Score=169.45 Aligned_cols=166 Identities=28% Similarity=0.415 Sum_probs=154.0
Q ss_pred CChhhhcCCcccCCCCHHHHHHHHHHHHHHhCCCCCCCCCHHHHHHHHHHHHHHHHHHHHhcCcchhhHHHHhhc-CCCc
Q psy8325 67 SNYYDLFDLKLTYLINNVDLSKKFKQLQSQLHPDKFSNKNQEEQAISETYSSYLNKAYSILQNPLERGLYLLSLQ-NISI 145 (237)
Q Consensus 67 ~nyf~llgl~~~f~id~~~Lk~~Yr~Lq~~~HPDk~~~~~~~e~~~a~~~s~~In~AY~tL~dp~~Ra~Yll~l~-g~~~ 145 (237)
.|||..||+++.|.+|.+.++..|+.+++.+|||++...+..++..+.+.++.+|.||.||++|+.|+.|++.+. |...
T Consensus 1 ~~~~~~~~~~~~f~~~~~~l~~~~~~~~~~~~~dr~~~~~~~~~~~~l~~~~~~~~a~~tLk~~l~ra~~~lal~~g~~~ 80 (174)
T COG1076 1 SDGFVLFGLPRAFQIDLDALKLQYRELQRAYHPDRFGKASEAEQRKALQQSAEVNPAYQTLKDPLLRAEYLLALADGLDH 80 (174)
T ss_pred CCcccccccHHHHHHHHhHhhhhHHHHHHhhCcccccccchHHHHHHHHHHHHhcchHHHHHHHHHHHHHHHHhcccccc
Confidence 379999999999999999999999999999999999999999999999999999999999999999999999998 7765
Q ss_pred cc-cccCCCHHHHHHHHHHHHHHhccCC----HHHHHHHHHHHHHHHHHHHHHHHHHhccccHHHHHHHHHHHHhHHHHH
Q psy8325 146 EE-DTKGTDQKLLMEILMLNEELDEASS----EEDLEKLQTLIQATIDDLTKNVNASFEQKDFNQAKDLLIRMKYFTTLV 220 (237)
Q Consensus 146 ~e-~~~~~d~efLmeimE~rE~leea~~----~~~l~~l~~~~~~~i~~~~~~l~~~f~~~d~~~A~~~l~rlkY~~~l~ 220 (237)
.. ..+..++.|+|++|+++|+++.+.. .+.+..+...+...+.++.+.+..++++..|+.|.....+++|+.++.
T Consensus 81 ~~e~~~l~~~~~l~~~~~~~~e~~~~~~~~~~~~~l~~l~~~~~~~~~~i~~~~r~l~~e~~~d~a~~~~~~~e~~~~~~ 160 (174)
T COG1076 81 AKERQTLRDIAFLLEQSELREELEEAREQLDREDALKVLGVEIKADQDAIKKAYRKLLSEQHPDKAAAKGLKLEFIEKLK 160 (174)
T ss_pred hhHHHHHHHHHHHHHhhHHHHHHHHHHHcccchhHHHHhcCchhhhHHHHHHHHHHHHHhcCHHHHHHhcCCHHHHHHHH
Confidence 53 3678899999999999999998753 456889999999999999999999999999999999999999999999
Q ss_pred HHHHHHHhhccc
Q psy8325 221 SKVRDKKNQLEL 232 (237)
Q Consensus 221 ~~ik~k~~~l~~ 232 (237)
+++++..+.+++
T Consensus 161 ~~~~~i~~a~~~ 172 (174)
T COG1076 161 EKLQEIQEAYED 172 (174)
T ss_pred HHHHHHHHHHHh
Confidence 999999887764
No 9
>PF07743 HSCB_C: HSCB C-terminal oligomerisation domain; InterPro: IPR009073 This entry represents the C-terminal oligomerisation domain found in HscB (heat shock cognate protein B), which is also known as HSC20 (20K heat shock cognate protein). HscB acts as a co-chaperone to regulate the ATPase activity and peptide-binding specificity of the molecular chaperone HscA, also known as HSC66 (HSP70 class). HscB proteins contain two domains, an N-terminal J-domain, which is involved in interactions with HscA, connected by a short loop to the C-terminal oligomerisation domain; the two domains make contact through a hydrophobic interface. The core of the oligomerisation domain is thought to bind and target proteins to HscA and consists of an open, three-helical bundle []. HscB, along with HscA, has been shown to play a role in the biogenesis of iron-sulphur proteins.; GO: 0006457 protein folding; PDB: 1FPO_C 3BVO_B 3HHO_A 3UO2_B 3UO3_B.
Probab=99.76 E-value=7.6e-18 Score=124.10 Aligned_cols=74 Identities=36% Similarity=0.670 Sum_probs=69.6
Q ss_pred CCHHHHHHHHHHHHHHhcc---CCHHHHHHHHHHHHHHHHHHHHHHHHHhccccHHHHHHHHHHHHhHHHHHHHHHH
Q psy8325 152 TDQKLLMEILMLNEELDEA---SSEEDLEKLQTLIQATIDDLTKNVNASFEQKDFNQAKDLLIRMKYFTTLVSKVRD 225 (237)
Q Consensus 152 ~d~efLmeimE~rE~leea---~~~~~l~~l~~~~~~~i~~~~~~l~~~f~~~d~~~A~~~l~rlkY~~~l~~~ik~ 225 (237)
.||+|||+||||||+|+++ .+.+.|..+..+++.++.++.+.|..+|+.+||+.|+..+++|+||.||.++|++
T Consensus 2 ~d~eFLme~mE~rE~le~~~~~~~~~~L~~l~~~~~~~~~~~~~~l~~~f~~~d~~~A~~~~~kLky~~kl~~~ik~ 78 (78)
T PF07743_consen 2 MDPEFLMEQMELREELEEAQNSDDEAELEELKKEIEERIKELIKELAEAFDAKDWEEAKEALRKLKYLQKLLEEIKQ 78 (78)
T ss_dssp S-HHHHHHHHHHHHHHHHHCCCTSHHHHHHHHHHHHHHHHHHHHHHHHHHHTT-HHHHHHHHHHHHHHHHHHHHHHH
T ss_pred CCHHHHHHHHHHHHHHHHhhcCCCHHHHHHHHHHHHHHHHHHHHHHHHHHccCcHHHHHHHHHHHHHHHHHHHHhcC
Confidence 6899999999999999999 7789999999999999999999999999999999999999999999999999985
No 10
>COG0484 DnaJ DnaJ-class molecular chaperone with C-terminal Zn finger domain [Posttranslational modification, protein turnover, chaperones]
Probab=99.71 E-value=1.5e-17 Score=154.25 Aligned_cols=71 Identities=21% Similarity=0.462 Sum_probs=61.7
Q ss_pred CCChhhhcCCcccCCCCHHHHHHHHHHHHHHhCCCCCCCCCHHHHHHHHHHHHHHHHHHHHhcCcchhhHHHHhhcCCCc
Q psy8325 66 QSNYYDLFDLKLTYLINNVDLSKKFKQLQSQLHPDKFSNKNQEEQAISETYSSYLNKAYSILQNPLERGLYLLSLQNISI 145 (237)
Q Consensus 66 ~~nyf~llgl~~~f~id~~~Lk~~Yr~Lq~~~HPDk~~~~~~~e~~~a~~~s~~In~AY~tL~dp~~Ra~Yll~l~g~~~ 145 (237)
..|||++|||++++ +.++||+|||+|+++||||+++...+ |++.|++||+||++|+||.+||.|+ .+|...
T Consensus 3 ~~dyYeiLGV~k~A--s~~EIKkAYRkLA~kyHPD~n~g~~~-----AeeKFKEI~eAYEVLsD~eKRa~YD--~fG~~~ 73 (371)
T COG0484 3 KRDYYEILGVSKDA--SEEEIKKAYRKLAKKYHPDRNPGDKE-----AEEKFKEINEAYEVLSDPEKRAAYD--QFGHAG 73 (371)
T ss_pred ccchhhhcCCCCCC--CHHHHHHHHHHHHHHhCCCCCCCCHH-----HHHHHHHHHHHHHHhCCHHHHHHhh--ccCccc
Confidence 47999999999986 88999999999999999999986433 4588999999999999999999997 455443
No 11
>PF00226 DnaJ: DnaJ domain; InterPro: IPR001623 The prokaryotic heat shock protein DnaJ interacts with the chaperone hsp70-like DnaK protein []. Structurally, the DnaJ protein consists of an N-terminal conserved domain (called 'J' domain) of about 70 amino acids, a glycine-rich region ('G' domain') of about 30 residues, a central domain containing four repeats of a CXXCXGXG motif ('CRR' domain) and a C-terminal region of 120 to 170 residues. Such a structure is shown in the following schematic representation: +------------+-+-------+-----+-----------+--------------------------------+ | N-terminal | | Gly-R | | CXXCXGXG | C-terminal | +------------+-+-------+-----+-----------+--------------------------------+ It is thought that the 'J' domain of DnaJ mediates the interaction with the dnaK protein and consists of four helices, the second of which has a charged surface that includes at least one pair of basic residues that are essential for interaction with the ATPase domain of Hsp70. The J- and CRR-domains are found in many prokaryotic and eukaryotic proteins [], either together or separately. In yeast, J-domains have been classified into 3 groups; the class III proteins are functionally distinct and do not appear to act as molecular chaperones []. ; GO: 0031072 heat shock protein binding; PDB: 2GUZ_C 2L6L_A 1HDJ_A 2EJ7_A 1FPO_C 2CUG_A 2QSA_A 2OCH_A 3BVO_B 3APQ_A ....
Probab=99.66 E-value=1.8e-16 Score=111.84 Aligned_cols=64 Identities=33% Similarity=0.535 Sum_probs=58.4
Q ss_pred ChhhhcCCcccCCCCHHHHHHHHHHHHHHhCCCCCCCCCHHHHHHHHHHHHHHHHHHHHhcCcchhhHHH
Q psy8325 68 NYYDLFDLKLTYLINNVDLSKKFKQLQSQLHPDKFSNKNQEEQAISETYSSYLNKAYSILQNPLERGLYL 137 (237)
Q Consensus 68 nyf~llgl~~~f~id~~~Lk~~Yr~Lq~~~HPDk~~~~~~~e~~~a~~~s~~In~AY~tL~dp~~Ra~Yl 137 (237)
|||++|||+++ .+.++|+++|+++++.+|||++++.+ ..+.+.+..||+||++|+||.+|+.|+
T Consensus 1 ~~y~iLgl~~~--~~~~eik~~y~~l~~~~HPD~~~~~~----~~~~~~~~~i~~Ay~~L~~~~~R~~YD 64 (64)
T PF00226_consen 1 NPYEILGLPPD--ASDEEIKKAYRRLSKQYHPDKNSGDE----AEAEEKFARINEAYEILSDPERRRRYD 64 (64)
T ss_dssp HHHHHCTSTTT--SSHHHHHHHHHHHHHHTSTTTGTSTH----HHHHHHHHHHHHHHHHHHSHHHHHHHH
T ss_pred ChHHHCCCCCC--CCHHHHHHHHHhhhhccccccchhhh----hhhhHHHHHHHHHHHHhCCHHHHHhcC
Confidence 69999999996 58999999999999999999998865 446689999999999999999999995
No 12
>KOG0713|consensus
Probab=99.65 E-value=2.3e-16 Score=143.62 Aligned_cols=72 Identities=21% Similarity=0.409 Sum_probs=63.7
Q ss_pred cCCCCCCChhhhcCCcccCCCCHHHHHHHHHHHHHHhCCCCCCCCCHHHHHHHHHHHHHHHHHHHHhcCcchhhHHHHh
Q psy8325 61 QKPDSQSNYYDLFDLKLTYLINNVDLSKKFKQLQSQLHPDKFSNKNQEEQAISETYSSYLNKAYSILQNPLERGLYLLS 139 (237)
Q Consensus 61 qp~~~~~nyf~llgl~~~f~id~~~Lk~~Yr~Lq~~~HPDk~~~~~~~e~~~a~~~s~~In~AY~tL~dp~~Ra~Yll~ 139 (237)
+++....|||+|||+|.++ +..+||+|||+|++++||||+++.+.+ .+.|..||.||++|+||.+|..|+-.
T Consensus 10 ~~v~~~rDfYelLgV~k~A--sd~eIKkAYRKLALk~HPDkNpddp~A-----~e~F~~in~AYEVLsDpekRk~YD~~ 81 (336)
T KOG0713|consen 10 EAVLAGRDFYELLGVPKNA--SDQEIKKAYRKLALKYHPDKNPDDPNA-----NEKFKEINAAYEVLSDPEKRKHYDTY 81 (336)
T ss_pred hhhhcCCCHHHHhCCCCCC--CHHHHHHHHHHHHHHhCCCCCCCCHHH-----HHHHHHHHHHHHHhcCHHHHHHHHhh
Confidence 3455568999999999986 788999999999999999999987653 57899999999999999999999753
No 13
>PRK14288 chaperone protein DnaJ; Provisional
Probab=99.58 E-value=2.6e-15 Score=140.12 Aligned_cols=65 Identities=25% Similarity=0.535 Sum_probs=58.2
Q ss_pred CChhhhcCCcccCCCCHHHHHHHHHHHHHHhCCCCCCCCCHHHHHHHHHHHHHHHHHHHHhcCcchhhHHHH
Q psy8325 67 SNYYDLFDLKLTYLINNVDLSKKFKQLQSQLHPDKFSNKNQEEQAISETYSSYLNKAYSILQNPLERGLYLL 138 (237)
Q Consensus 67 ~nyf~llgl~~~f~id~~~Lk~~Yr~Lq~~~HPDk~~~~~~~e~~~a~~~s~~In~AY~tL~dp~~Ra~Yll 138 (237)
.|||++|||++++ +.++|+++||+|+++||||++..... |++.|+.||+||++|+||.+|+.|+.
T Consensus 3 ~dyY~vLgv~~~A--s~~eIkkayrkla~k~HPD~~~~~~~-----a~~~f~~i~~AYevLsd~~kR~~YD~ 67 (369)
T PRK14288 3 LSYYEILEVEKHS--NQETIKKSYRKLALKYHPDRNAGDKE-----AEEKFKLINEAYGVLSDEKKRALYDR 67 (369)
T ss_pred CChHHHcCCCCCC--CHHHHHHHHHHHHHHHCCCCCCCccH-----HHHHHHHHHHHHHHhccHHHHHHHHH
Confidence 6999999999985 78899999999999999999865332 45789999999999999999999985
No 14
>KOG0718|consensus
Probab=99.57 E-value=7.1e-15 Score=138.38 Aligned_cols=68 Identities=24% Similarity=0.521 Sum_probs=62.5
Q ss_pred CChhhhcCCcccCCCCHHHHHHHHHHHHHHhCCCCCCCCCHHHHHHHHHHHHHHHHHHHHhcCcchhhHHHH
Q psy8325 67 SNYYDLFDLKLTYLINNVDLSKKFKQLQSQLHPDKFSNKNQEEQAISETYSSYLNKAYSILQNPLERGLYLL 138 (237)
Q Consensus 67 ~nyf~llgl~~~f~id~~~Lk~~Yr~Lq~~~HPDk~~~~~~~e~~~a~~~s~~In~AY~tL~dp~~Ra~Yll 138 (237)
.+||.+||||+.+ +.++|+++||++.+.|||||+..+ .+++.|++.|..|..||++|+||.+|+.|+.
T Consensus 9 ~e~Ya~LNlpkdA--t~eeI~~AYrr~~~lfHPDkh~dp--d~K~~AE~~F~~i~~AyEVLsDp~kRaIYD~ 76 (546)
T KOG0718|consen 9 IELYALLNLPKDA--TDEEIKKAYRRLSRLFHPDKHTDP--DQKKAAEEKFQRIQRAYEVLSDPQKRAIYDN 76 (546)
T ss_pred hhHHHHhCCCccc--CHHHHHHHHHHHHHhcCCcccCCh--hHHHHHHHHHHHHHHHHHHhcChHHHHHHHH
Confidence 5999999999986 778999999999999999999875 4567889999999999999999999999975
No 15
>smart00271 DnaJ DnaJ molecular chaperone homology domain.
Probab=99.55 E-value=1.7e-14 Score=100.07 Aligned_cols=60 Identities=23% Similarity=0.502 Sum_probs=53.3
Q ss_pred CChhhhcCCcccCCCCHHHHHHHHHHHHHHhCCCCCCCCCHHHHHHHHHHHHHHHHHHHHhcCcch
Q psy8325 67 SNYYDLFDLKLTYLINNVDLSKKFKQLQSQLHPDKFSNKNQEEQAISETYSSYLNKAYSILQNPLE 132 (237)
Q Consensus 67 ~nyf~llgl~~~f~id~~~Lk~~Yr~Lq~~~HPDk~~~~~~~e~~~a~~~s~~In~AY~tL~dp~~ 132 (237)
.|||++|||+++ .+.++|+++|+++++.+|||++++. ...+.+.+..|++||++|+||.+
T Consensus 1 ~~~y~vLgl~~~--~~~~~ik~ay~~l~~~~HPD~~~~~----~~~~~~~~~~l~~Ay~~L~~~~~ 60 (60)
T smart00271 1 TDYYEILGVPRD--ASLDEIKKAYRKLALKYHPDKNPGD----KEEAEEKFKEINEAYEVLSDPEK 60 (60)
T ss_pred CCHHHHcCCCCC--CCHHHHHHHHHHHHHHHCcCCCCCc----hHHHHHHHHHHHHHHHHHcCCCC
Confidence 489999999997 5889999999999999999999865 34567899999999999999953
No 16
>PRK14296 chaperone protein DnaJ; Provisional
Probab=99.54 E-value=8.7e-15 Score=136.75 Aligned_cols=64 Identities=20% Similarity=0.388 Sum_probs=57.4
Q ss_pred CChhhhcCCcccCCCCHHHHHHHHHHHHHHhCCCCCCCCCHHHHHHHHHHHHHHHHHHHHhcCcchhhHHHH
Q psy8325 67 SNYYDLFDLKLTYLINNVDLSKKFKQLQSQLHPDKFSNKNQEEQAISETYSSYLNKAYSILQNPLERGLYLL 138 (237)
Q Consensus 67 ~nyf~llgl~~~f~id~~~Lk~~Yr~Lq~~~HPDk~~~~~~~e~~~a~~~s~~In~AY~tL~dp~~Ra~Yll 138 (237)
.|||++|||++++ +.++|+++||+|+++||||++..+. |++.|+.||+||++|+||.+|+.|+.
T Consensus 4 ~dyY~~Lgv~~~a--~~~eik~ayrkla~~~HPD~n~~~~------a~~~F~~i~~AyevLsD~~KR~~YD~ 67 (372)
T PRK14296 4 KDYYEVLGVSKTA--SEQEIRQAYRKLAKQYHPDLNKSPD------AHDKMVEINEAADVLLDKDKRKQYDQ 67 (372)
T ss_pred CCHHHhcCCCCCC--CHHHHHHHHHHHHHHHCcCCCCCch------HHHHHHHHHHHHHHhcCHHHhhhhhh
Confidence 6999999999985 7889999999999999999986421 45789999999999999999999985
No 17
>PRK14286 chaperone protein DnaJ; Provisional
Probab=99.54 E-value=1.1e-14 Score=135.97 Aligned_cols=65 Identities=28% Similarity=0.522 Sum_probs=58.1
Q ss_pred CChhhhcCCcccCCCCHHHHHHHHHHHHHHhCCCCCCCCCHHHHHHHHHHHHHHHHHHHHhcCcchhhHHHH
Q psy8325 67 SNYYDLFDLKLTYLINNVDLSKKFKQLQSQLHPDKFSNKNQEEQAISETYSSYLNKAYSILQNPLERGLYLL 138 (237)
Q Consensus 67 ~nyf~llgl~~~f~id~~~Lk~~Yr~Lq~~~HPDk~~~~~~~e~~~a~~~s~~In~AY~tL~dp~~Ra~Yll 138 (237)
.|||++|||++++ +.++|+++||+|+++||||++.+..+ |++.|++||+||++|+||.+|+.|+.
T Consensus 4 ~d~y~~Lgv~~~a--~~~eik~ayr~la~~~HPD~~~~~~~-----a~~~f~~i~~Ay~vL~d~~kR~~YD~ 68 (372)
T PRK14286 4 RSYYDILGVSKSA--NDEEIKSAYRKLAIKYHPDKNKGNKE-----SEEKFKEATEAYEILRDPKKRQAYDQ 68 (372)
T ss_pred CCHHHhcCCCCCC--CHHHHHHHHHHHHHHHCcCCCCCchH-----HHHHHHHHHHHHHHhccHHHHHHHHH
Confidence 6999999999986 67899999999999999999865322 45789999999999999999999985
No 18
>PRK14282 chaperone protein DnaJ; Provisional
Probab=99.52 E-value=1.9e-14 Score=134.24 Aligned_cols=67 Identities=21% Similarity=0.526 Sum_probs=58.7
Q ss_pred CCChhhhcCCcccCCCCHHHHHHHHHHHHHHhCCCCCCCCCHHHHHHHHHHHHHHHHHHHHhcCcchhhHHHH
Q psy8325 66 QSNYYDLFDLKLTYLINNVDLSKKFKQLQSQLHPDKFSNKNQEEQAISETYSSYLNKAYSILQNPLERGLYLL 138 (237)
Q Consensus 66 ~~nyf~llgl~~~f~id~~~Lk~~Yr~Lq~~~HPDk~~~~~~~e~~~a~~~s~~In~AY~tL~dp~~Ra~Yll 138 (237)
..|||++|||++++ +.++|+++||+|+++||||+++.. +..|++.|+.||+||++|+||.+|+.|+.
T Consensus 3 ~~d~y~~lgv~~~a--~~~eik~ayr~la~~~HPD~~~~~----~~~a~~~f~~i~~Ay~vL~d~~kR~~YD~ 69 (369)
T PRK14282 3 KKDYYEILGVSRNA--TQEEIKRAYKRLVKEWHPDRHPEN----RKEAEQKFKEIQEAYEVLSDPQKRAMYDR 69 (369)
T ss_pred CCChHHhcCCCCCC--CHHHHHHHHHHHHHHHCCCCCccc----hhHHHHHHHHHHHHHHHhcChhhHHHHhh
Confidence 36999999999975 788999999999999999998643 12256889999999999999999999985
No 19
>PRK14279 chaperone protein DnaJ; Provisional
Probab=99.52 E-value=1.8e-14 Score=135.48 Aligned_cols=65 Identities=20% Similarity=0.384 Sum_probs=58.3
Q ss_pred CChhhhcCCcccCCCCHHHHHHHHHHHHHHhCCCCCCCCCHHHHHHHHHHHHHHHHHHHHhcCcchhhHHHH
Q psy8325 67 SNYYDLFDLKLTYLINNVDLSKKFKQLQSQLHPDKFSNKNQEEQAISETYSSYLNKAYSILQNPLERGLYLL 138 (237)
Q Consensus 67 ~nyf~llgl~~~f~id~~~Lk~~Yr~Lq~~~HPDk~~~~~~~e~~~a~~~s~~In~AY~tL~dp~~Ra~Yll 138 (237)
.|||++|||++++ +.++|+++||+|+++||||++.+... |.+.|..|++||++|+||.+|+.|+.
T Consensus 9 ~Dyy~~Lgv~~~a--~~~eik~ayr~la~~~HPD~~~~~~~-----a~~~f~~i~~Ay~vLsD~~KR~~YD~ 73 (392)
T PRK14279 9 KDFYKELGVSSDA--SAEEIKKAYRKLARELHPDANPGDPA-----AEERFKAVSEAHDVLSDPAKRKEYDE 73 (392)
T ss_pred cCHHHhcCCCCCC--CHHHHHHHHHHHHHHHCcCCCCCChH-----HHHHHHHHHHHHHHhcchhhhhHHHH
Confidence 6999999999985 78899999999999999999865322 45789999999999999999999986
No 20
>PRK14295 chaperone protein DnaJ; Provisional
Probab=99.52 E-value=1.9e-14 Score=135.15 Aligned_cols=65 Identities=20% Similarity=0.430 Sum_probs=58.0
Q ss_pred CChhhhcCCcccCCCCHHHHHHHHHHHHHHhCCCCCCCCCHHHHHHHHHHHHHHHHHHHHhcCcchhhHHHH
Q psy8325 67 SNYYDLFDLKLTYLINNVDLSKKFKQLQSQLHPDKFSNKNQEEQAISETYSSYLNKAYSILQNPLERGLYLL 138 (237)
Q Consensus 67 ~nyf~llgl~~~f~id~~~Lk~~Yr~Lq~~~HPDk~~~~~~~e~~~a~~~s~~In~AY~tL~dp~~Ra~Yll 138 (237)
.|||++|||++++ +.++|+++||+|+++||||++..... |++.|+.||+||++|+||.+|+.|+.
T Consensus 9 ~d~y~~Lgv~~~a--~~~eik~ayr~la~~~HPD~~~~~~~-----a~~~f~~i~~Ay~vL~d~~~r~~yD~ 73 (389)
T PRK14295 9 KDYYKVLGVPKDA--TEAEIKKAYRKLAREYHPDANKGDAK-----AEERFKEISEAYDVLSDEKKRKEYDE 73 (389)
T ss_pred cCHHHhcCCCCCC--CHHHHHHHHHHHHHHHCCCcCCCchh-----HHHHHHHHHHHHHHHCchhhHHHHHH
Confidence 5999999999975 78899999999999999999865432 45788999999999999999999985
No 21
>PRK14297 chaperone protein DnaJ; Provisional
Probab=99.52 E-value=1.7e-14 Score=135.06 Aligned_cols=65 Identities=26% Similarity=0.542 Sum_probs=58.2
Q ss_pred CChhhhcCCcccCCCCHHHHHHHHHHHHHHhCCCCCCCCCHHHHHHHHHHHHHHHHHHHHhcCcchhhHHHH
Q psy8325 67 SNYYDLFDLKLTYLINNVDLSKKFKQLQSQLHPDKFSNKNQEEQAISETYSSYLNKAYSILQNPLERGLYLL 138 (237)
Q Consensus 67 ~nyf~llgl~~~f~id~~~Lk~~Yr~Lq~~~HPDk~~~~~~~e~~~a~~~s~~In~AY~tL~dp~~Ra~Yll 138 (237)
.|||++|||++++ +.++|+++||+|+++||||+++.... |++.|+.||+||++|+||.+|+.|+.
T Consensus 4 ~d~y~~Lgv~~~a--~~~~ik~ayr~la~~~HPD~~~~~~~-----a~~~f~~i~~Ay~vL~d~~~r~~yD~ 68 (380)
T PRK14297 4 KDYYEVLGLEKGA--SDDEIKKAFRKLAIKYHPDKNKGNKE-----AEEKFKEINEAYQVLSDPQKKAQYDQ 68 (380)
T ss_pred CChHHhhCCCCCC--CHHHHHHHHHHHHHHHCcCCCCCcHH-----HHHHHHHHHHHHHHhcCHhhhCchhh
Confidence 6999999999986 77899999999999999999875432 45789999999999999999999985
No 22
>PRK14301 chaperone protein DnaJ; Provisional
Probab=99.51 E-value=2.1e-14 Score=134.18 Aligned_cols=65 Identities=22% Similarity=0.443 Sum_probs=58.1
Q ss_pred CChhhhcCCcccCCCCHHHHHHHHHHHHHHhCCCCCCCCCHHHHHHHHHHHHHHHHHHHHhcCcchhhHHHH
Q psy8325 67 SNYYDLFDLKLTYLINNVDLSKKFKQLQSQLHPDKFSNKNQEEQAISETYSSYLNKAYSILQNPLERGLYLL 138 (237)
Q Consensus 67 ~nyf~llgl~~~f~id~~~Lk~~Yr~Lq~~~HPDk~~~~~~~e~~~a~~~s~~In~AY~tL~dp~~Ra~Yll 138 (237)
.|||++|||++++ +.++|+++||+|+++||||++++..+ |++.|+.|++||++|+||.+|+.|+.
T Consensus 4 ~~~y~~Lgv~~~a--~~~~ik~ayr~la~~~HPD~~~~~~~-----a~~~f~~i~~Ay~vL~d~~kr~~yD~ 68 (373)
T PRK14301 4 RDYYEVLGVSRDA--SEDEIKKAYRKLALQYHPDRNPDNPE-----AEQKFKEAAEAYEVLRDAEKRARYDR 68 (373)
T ss_pred CChHHhcCCCCCC--CHHHHHHHHHHHHHHhCCCcCCCChH-----HHHHHHHHHHHHHHhcchhhhhhhhh
Confidence 6999999999975 78899999999999999999876433 44678999999999999999999985
No 23
>cd06257 DnaJ DnaJ domain or J-domain. DnaJ/Hsp40 (heat shock protein 40) proteins are highly conserved and play crucial roles in protein translation, folding, unfolding, translocation, and degradation. They act primarily by stimulating the ATPase activity of Hsp70s, an important chaperonine family. Hsp40 proteins are characterized by the presence of a J domain, which mediates the interaction with Hsp70. They may contain other domains as well, and the architectures provide a means of classification.
Probab=99.51 E-value=6e-14 Score=95.57 Aligned_cols=55 Identities=25% Similarity=0.555 Sum_probs=49.4
Q ss_pred ChhhhcCCcccCCCCHHHHHHHHHHHHHHhCCCCCCCCCHHHHHHHHHHHHHHHHHHHHhcC
Q psy8325 68 NYYDLFDLKLTYLINNVDLSKKFKQLQSQLHPDKFSNKNQEEQAISETYSSYLNKAYSILQN 129 (237)
Q Consensus 68 nyf~llgl~~~f~id~~~Lk~~Yr~Lq~~~HPDk~~~~~~~e~~~a~~~s~~In~AY~tL~d 129 (237)
|||++||++++ .+.++|+++|++|++.+|||++++. ..+.+.+..|++||++|+|
T Consensus 1 ~~y~vLgl~~~--~~~~~ik~~y~~l~~~~HPD~~~~~-----~~~~~~~~~l~~Ay~~L~d 55 (55)
T cd06257 1 DYYDILGVPPD--ASDEEIKKAYRKLALKYHPDKNPDD-----PEAEEKFKEINEAYEVLSD 55 (55)
T ss_pred ChHHHcCCCCC--CCHHHHHHHHHHHHHHHCcCCCCCc-----HHHHHHHHHHHHHHHHhcC
Confidence 79999999976 5889999999999999999999875 4456889999999999986
No 24
>PRK14294 chaperone protein DnaJ; Provisional
Probab=99.50 E-value=3.3e-14 Score=132.54 Aligned_cols=66 Identities=23% Similarity=0.562 Sum_probs=58.4
Q ss_pred CCChhhhcCCcccCCCCHHHHHHHHHHHHHHhCCCCCCCCCHHHHHHHHHHHHHHHHHHHHhcCcchhhHHHH
Q psy8325 66 QSNYYDLFDLKLTYLINNVDLSKKFKQLQSQLHPDKFSNKNQEEQAISETYSSYLNKAYSILQNPLERGLYLL 138 (237)
Q Consensus 66 ~~nyf~llgl~~~f~id~~~Lk~~Yr~Lq~~~HPDk~~~~~~~e~~~a~~~s~~In~AY~tL~dp~~Ra~Yll 138 (237)
..|||++|||+++ ++.++|+++||+|+++||||+++...+ +.+.|+.|++||++|+||.+|+.|+.
T Consensus 3 ~~d~y~~lgv~~~--a~~~eik~ayr~la~~~HPD~~~~~~~-----~~~~f~~~~~Ay~vL~d~~~r~~yD~ 68 (366)
T PRK14294 3 KRDYYEILGVTRD--ASEEEIKKSYRKLAMKYHPDRNPGDKE-----AEELFKEAAEAYEVLSDPKKRGIYDQ 68 (366)
T ss_pred CCChHHHhCCCCC--CCHHHHHHHHHHHHHHHCCCCCCCchH-----HHHHHHHHHHHHHHhccHHHHHHHHh
Confidence 3699999999996 588999999999999999999875432 45679999999999999999999985
No 25
>KOG0716|consensus
Probab=99.50 E-value=5e-14 Score=125.08 Aligned_cols=67 Identities=27% Similarity=0.484 Sum_probs=59.9
Q ss_pred CCChhhhcCCcccCCCCHHHHHHHHHHHHHHhCCCCCCCCCHHHHHHHHHHHHHHHHHHHHhcCcchhhHHHHh
Q psy8325 66 QSNYYDLFDLKLTYLINNVDLSKKFKQLQSQLHPDKFSNKNQEEQAISETYSSYLNKAYSILQNPLERGLYLLS 139 (237)
Q Consensus 66 ~~nyf~llgl~~~f~id~~~Lk~~Yr~Lq~~~HPDk~~~~~~~e~~~a~~~s~~In~AY~tL~dp~~Ra~Yll~ 139 (237)
..|+|++||||++ ++.++||++||.|++++|||+.+..++. .+.|..||+||++|+||.+|-.|+--
T Consensus 30 ~~~LYdVLgl~k~--at~d~IKKaYR~L~~k~HPD~~gd~P~~-----~dkf~eIN~Ay~ILsD~~kR~~YD~~ 96 (279)
T KOG0716|consen 30 RLDLYDVLGLPKT--ATKDEIKKAYRKLALKYHPDKNGDNPEA-----TDKFKEINTAYAILSDPTKRNVYDEY 96 (279)
T ss_pred hhHHHHHhCCCcc--cchHHHHHHHHHHHHHhCCCcCCCCchh-----HHHHHHHHHHHHHhcChhhhhhHHHh
Confidence 5799999999996 4788999999999999999999987543 47899999999999999999999753
No 26
>PRK14287 chaperone protein DnaJ; Provisional
Probab=99.50 E-value=3.3e-14 Score=132.78 Aligned_cols=64 Identities=20% Similarity=0.428 Sum_probs=56.8
Q ss_pred CChhhhcCCcccCCCCHHHHHHHHHHHHHHhCCCCCCCCCHHHHHHHHHHHHHHHHHHHHhcCcchhhHHHH
Q psy8325 67 SNYYDLFDLKLTYLINNVDLSKKFKQLQSQLHPDKFSNKNQEEQAISETYSSYLNKAYSILQNPLERGLYLL 138 (237)
Q Consensus 67 ~nyf~llgl~~~f~id~~~Lk~~Yr~Lq~~~HPDk~~~~~~~e~~~a~~~s~~In~AY~tL~dp~~Ra~Yll 138 (237)
.|||++|||++++ +.++|+++||+|+++||||++..+ . +++.|+.||+||++|+||.+|+.|+.
T Consensus 4 ~d~y~~Lgv~~~a--~~~eik~ayr~la~~~HpD~~~~~-~-----~~~~f~~i~~Ay~~L~d~~kR~~YD~ 67 (371)
T PRK14287 4 RDYYEVLGVDRNA--SVDEVKKAYRKLARKYHPDVNKAP-D-----AEDKFKEVKEAYDTLSDPQKKAHYDQ 67 (371)
T ss_pred CCHHHhcCCCCCC--CHHHHHHHHHHHHHHHCcCCCCCh-h-----HHHHHHHHHHHHHHhCcHhHHHHHHh
Confidence 5999999999986 677999999999999999997632 1 44688999999999999999999985
No 27
>PRK14277 chaperone protein DnaJ; Provisional
Probab=99.50 E-value=3.5e-14 Score=133.19 Aligned_cols=65 Identities=25% Similarity=0.532 Sum_probs=58.2
Q ss_pred CChhhhcCCcccCCCCHHHHHHHHHHHHHHhCCCCCCCCCHHHHHHHHHHHHHHHHHHHHhcCcchhhHHHH
Q psy8325 67 SNYYDLFDLKLTYLINNVDLSKKFKQLQSQLHPDKFSNKNQEEQAISETYSSYLNKAYSILQNPLERGLYLL 138 (237)
Q Consensus 67 ~nyf~llgl~~~f~id~~~Lk~~Yr~Lq~~~HPDk~~~~~~~e~~~a~~~s~~In~AY~tL~dp~~Ra~Yll 138 (237)
.|||++|||+++ ++.++|+++||+|+++||||++..... |++.|+.||+||++|+||.+|+.|+.
T Consensus 5 ~d~y~~Lgv~~~--a~~~eik~ayr~la~~~HPD~~~~~~~-----a~~~f~~i~~Ay~vL~d~~kr~~yD~ 69 (386)
T PRK14277 5 KDYYEILGVDRN--ATEEEIKKAYRRLAKKYHPDLNPGDKE-----AEQKFKEINEAYEILSDPQKRAQYDQ 69 (386)
T ss_pred CCHHHhcCCCCC--CCHHHHHHHHHHHHHHHCCCcCCCchH-----HHHHHHHHHHHHHHhCCHHHHHHHHh
Confidence 699999999997 588999999999999999999875432 45789999999999999999999986
No 28
>PRK14298 chaperone protein DnaJ; Provisional
Probab=99.49 E-value=3.5e-14 Score=132.91 Aligned_cols=64 Identities=25% Similarity=0.486 Sum_probs=57.0
Q ss_pred CChhhhcCCcccCCCCHHHHHHHHHHHHHHhCCCCCCCCCHHHHHHHHHHHHHHHHHHHHhcCcchhhHHHH
Q psy8325 67 SNYYDLFDLKLTYLINNVDLSKKFKQLQSQLHPDKFSNKNQEEQAISETYSSYLNKAYSILQNPLERGLYLL 138 (237)
Q Consensus 67 ~nyf~llgl~~~f~id~~~Lk~~Yr~Lq~~~HPDk~~~~~~~e~~~a~~~s~~In~AY~tL~dp~~Ra~Yll 138 (237)
.|||++|||++.+ +.++|+++||+|+++||||+++.+. +.+.|..||+||++|+||.+|+.|+.
T Consensus 5 ~d~y~iLgv~~~a--~~~eik~ayr~la~~~HPD~~~~~~------~~~~f~~i~~Ay~vL~d~~kR~~YD~ 68 (377)
T PRK14298 5 RDYYEILGLSKDA--SVEDIKKAYRKLAMKYHPDKNKEPD------AEEKFKEISEAYAVLSDAEKRAQYDR 68 (377)
T ss_pred CCHHHhhCCCCCC--CHHHHHHHHHHHHHHhCccccCChh------HHHHHHHHHHHHHHhcchHhhhhhhh
Confidence 5999999999975 7889999999999999999986422 34688999999999999999999985
No 29
>PRK14299 chaperone protein DnaJ; Provisional
Probab=99.49 E-value=4.9e-14 Score=127.62 Aligned_cols=64 Identities=27% Similarity=0.517 Sum_probs=57.3
Q ss_pred CChhhhcCCcccCCCCHHHHHHHHHHHHHHhCCCCCCCCCHHHHHHHHHHHHHHHHHHHHhcCcchhhHHHH
Q psy8325 67 SNYYDLFDLKLTYLINNVDLSKKFKQLQSQLHPDKFSNKNQEEQAISETYSSYLNKAYSILQNPLERGLYLL 138 (237)
Q Consensus 67 ~nyf~llgl~~~f~id~~~Lk~~Yr~Lq~~~HPDk~~~~~~~e~~~a~~~s~~In~AY~tL~dp~~Ra~Yll 138 (237)
.|||++|||++++ +.++|+++||+|++++|||++.++. +.+.|+.||+||++|+||.+|+.|+.
T Consensus 4 ~d~y~vLgv~~~a--~~~eik~ayr~la~~~HPD~~~~~~------~~~~f~~i~~Ay~~L~d~~kr~~yD~ 67 (291)
T PRK14299 4 KDYYAILGVPKNA--SQDEIKKAFKKLARKYHPDVNKSPG------AEEKFKEINEAYTVLSDPEKRRIYDT 67 (291)
T ss_pred CCHHHHcCCCCCC--CHHHHHHHHHHHHHHHCcCCCCChh------HHHHHHHHHHHHHHhcCHHHHHHHHh
Confidence 6999999999975 8899999999999999999986422 34678999999999999999999986
No 30
>PRK14284 chaperone protein DnaJ; Provisional
Probab=99.48 E-value=5.6e-14 Score=132.06 Aligned_cols=65 Identities=23% Similarity=0.456 Sum_probs=58.0
Q ss_pred CChhhhcCCcccCCCCHHHHHHHHHHHHHHhCCCCCCCCCHHHHHHHHHHHHHHHHHHHHhcCcchhhHHHH
Q psy8325 67 SNYYDLFDLKLTYLINNVDLSKKFKQLQSQLHPDKFSNKNQEEQAISETYSSYLNKAYSILQNPLERGLYLL 138 (237)
Q Consensus 67 ~nyf~llgl~~~f~id~~~Lk~~Yr~Lq~~~HPDk~~~~~~~e~~~a~~~s~~In~AY~tL~dp~~Ra~Yll 138 (237)
.|||++|||+++ ++.++|+++||+|+++||||++++... |++.|+.||+||++|+||.+|+.|+.
T Consensus 1 ~d~y~iLgv~~~--a~~~eikkayr~la~~~HPD~~~~~~~-----a~~~f~~i~~Ay~vL~d~~kR~~YD~ 65 (391)
T PRK14284 1 MDYYTILGVSKT--ASPEEIKKAYRKLAVKYHPDKNPGDAE-----AEKRFKEVSEAYEVLSDAQKRESYDR 65 (391)
T ss_pred CCHHHhcCCCCC--CCHHHHHHHHHHHHHHHCcCCCCCchH-----HHHHHHHHHHHHHHhcCHHHHHHHHh
Confidence 489999999997 488999999999999999999876433 45778999999999999999999985
No 31
>PRK14276 chaperone protein DnaJ; Provisional
Probab=99.48 E-value=5.5e-14 Score=131.66 Aligned_cols=64 Identities=23% Similarity=0.457 Sum_probs=57.4
Q ss_pred CChhhhcCCcccCCCCHHHHHHHHHHHHHHhCCCCCCCCCHHHHHHHHHHHHHHHHHHHHhcCcchhhHHHH
Q psy8325 67 SNYYDLFDLKLTYLINNVDLSKKFKQLQSQLHPDKFSNKNQEEQAISETYSSYLNKAYSILQNPLERGLYLL 138 (237)
Q Consensus 67 ~nyf~llgl~~~f~id~~~Lk~~Yr~Lq~~~HPDk~~~~~~~e~~~a~~~s~~In~AY~tL~dp~~Ra~Yll 138 (237)
.|||++|||++.+ +.++|+++||+|+++||||+++++. |.+.|+.||+||++|+||.+|+.|+.
T Consensus 4 ~d~y~~Lgv~~~a--~~~eik~ayr~la~~~HpD~~~~~~------a~~~f~~i~~Ay~vL~d~~kR~~YD~ 67 (380)
T PRK14276 4 TEYYDRLGVSKDA--SQDEIKKAYRKLSKKYHPDINKEPG------AEEKYKEVQEAYETLSDPQKRAAYDQ 67 (380)
T ss_pred CCHHHhhCCCCCC--CHHHHHHHHHHHHHHHCcCCCCCcC------HHHHHHHHHHHHHHhcCHhhhhhHhh
Confidence 5999999999975 7889999999999999999987532 34678999999999999999999985
No 32
>PRK14281 chaperone protein DnaJ; Provisional
Probab=99.48 E-value=5.1e-14 Score=132.63 Aligned_cols=65 Identities=25% Similarity=0.483 Sum_probs=58.2
Q ss_pred CChhhhcCCcccCCCCHHHHHHHHHHHHHHhCCCCCCCCCHHHHHHHHHHHHHHHHHHHHhcCcchhhHHHH
Q psy8325 67 SNYYDLFDLKLTYLINNVDLSKKFKQLQSQLHPDKFSNKNQEEQAISETYSSYLNKAYSILQNPLERGLYLL 138 (237)
Q Consensus 67 ~nyf~llgl~~~f~id~~~Lk~~Yr~Lq~~~HPDk~~~~~~~e~~~a~~~s~~In~AY~tL~dp~~Ra~Yll 138 (237)
.|||++|||++++ +.++|+++||+|+++||||++++... |++.|+.||+||++|+||.+|+.|+.
T Consensus 3 ~d~y~iLgv~~~a--~~~eikkayr~la~~~HPD~~~~~~~-----a~~~f~~i~~Ay~vL~d~~~r~~yD~ 67 (397)
T PRK14281 3 RDYYEVLGVSRSA--DKDEIKKAYRKLALKYHPDKNPDNKE-----AEEHFKEVNEAYEVLSNDDKRRRYDQ 67 (397)
T ss_pred CChhhhcCCCCCC--CHHHHHHHHHHHHHHHCCCcCCCchH-----HHHHHHHHHHHHHHhhhhhhhhhhhh
Confidence 5999999999985 88999999999999999999875422 45789999999999999999999985
No 33
>PRK14280 chaperone protein DnaJ; Provisional
Probab=99.48 E-value=6.2e-14 Score=131.14 Aligned_cols=64 Identities=19% Similarity=0.454 Sum_probs=57.4
Q ss_pred CChhhhcCCcccCCCCHHHHHHHHHHHHHHhCCCCCCCCCHHHHHHHHHHHHHHHHHHHHhcCcchhhHHHH
Q psy8325 67 SNYYDLFDLKLTYLINNVDLSKKFKQLQSQLHPDKFSNKNQEEQAISETYSSYLNKAYSILQNPLERGLYLL 138 (237)
Q Consensus 67 ~nyf~llgl~~~f~id~~~Lk~~Yr~Lq~~~HPDk~~~~~~~e~~~a~~~s~~In~AY~tL~dp~~Ra~Yll 138 (237)
.|||++|||++++ +.++|+++||+|+++||||++..+. |.+.|+.||+||++|+||.+|+.|+.
T Consensus 4 ~~~y~iLgv~~~a--~~~eik~ayr~la~~~HpD~~~~~~------a~~~f~~i~~Ay~vL~d~~kr~~yD~ 67 (376)
T PRK14280 4 RDYYEVLGVSKSA--SKDEIKKAYRKLSKKYHPDINKEEG------ADEKFKEISEAYEVLSDDQKRAQYDQ 67 (376)
T ss_pred CChHHhhCCCCCC--CHHHHHHHHHHHHHHHCcCCCCCcc------HHHHHHHHHHHHHHhccHhHHHHHHh
Confidence 5999999999975 7889999999999999999986532 34678999999999999999999986
No 34
>PRK14291 chaperone protein DnaJ; Provisional
Probab=99.48 E-value=6.4e-14 Score=131.29 Aligned_cols=64 Identities=27% Similarity=0.534 Sum_probs=57.5
Q ss_pred CChhhhcCCcccCCCCHHHHHHHHHHHHHHhCCCCCCCCCHHHHHHHHHHHHHHHHHHHHhcCcchhhHHHH
Q psy8325 67 SNYYDLFDLKLTYLINNVDLSKKFKQLQSQLHPDKFSNKNQEEQAISETYSSYLNKAYSILQNPLERGLYLL 138 (237)
Q Consensus 67 ~nyf~llgl~~~f~id~~~Lk~~Yr~Lq~~~HPDk~~~~~~~e~~~a~~~s~~In~AY~tL~dp~~Ra~Yll 138 (237)
.|||++|||++++ +.++|+++||+|+++||||+++++. +.+.|+.|++||++|+||.+|+.|+.
T Consensus 3 ~d~Y~~Lgv~~~a--~~~~ik~ayr~la~~~HPD~~~~~~------~~~~f~~i~~Ay~vLsd~~kR~~YD~ 66 (382)
T PRK14291 3 KDYYEILGVSRNA--TQEEIKKAYRRLARKYHPDFNKNPE------AEEKFKEINEAYQVLSDPEKRKLYDQ 66 (382)
T ss_pred CCHHHhhCCCCCC--CHHHHHHHHHHHHHHHCCCCCCCcc------HHHHHHHHHHHHHHhcCHHHHHHHhh
Confidence 6999999999975 7889999999999999999987532 34678999999999999999999986
No 35
>PRK14285 chaperone protein DnaJ; Provisional
Probab=99.47 E-value=7.5e-14 Score=130.17 Aligned_cols=65 Identities=22% Similarity=0.442 Sum_probs=58.0
Q ss_pred CChhhhcCCcccCCCCHHHHHHHHHHHHHHhCCCCCCCCCHHHHHHHHHHHHHHHHHHHHhcCcchhhHHHH
Q psy8325 67 SNYYDLFDLKLTYLINNVDLSKKFKQLQSQLHPDKFSNKNQEEQAISETYSSYLNKAYSILQNPLERGLYLL 138 (237)
Q Consensus 67 ~nyf~llgl~~~f~id~~~Lk~~Yr~Lq~~~HPDk~~~~~~~e~~~a~~~s~~In~AY~tL~dp~~Ra~Yll 138 (237)
.|||++|||++++ +.++|+++||+|+++||||++....+ |.+.|..|++||++|+||.+|+.|+.
T Consensus 3 ~d~y~iLgv~~~a--~~~eIk~ayr~la~~~HPD~~~~~~~-----a~~~f~~i~~Ay~vL~d~~kr~~yd~ 67 (365)
T PRK14285 3 RDYYEILGLSKGA--SKDEIKKAYRKIAIKYHPDKNKGNKE-----AESIFKEATEAYEVLIDDNKRAQYDR 67 (365)
T ss_pred CCHHHhcCCCCCC--CHHHHHHHHHHHHHHHCCCCCCCCHH-----HHHHHHHHHHHHHHHcCcchhHHHHh
Confidence 5999999999975 88899999999999999999865432 45789999999999999999999986
No 36
>PRK10767 chaperone protein DnaJ; Provisional
Probab=99.47 E-value=8e-14 Score=130.06 Aligned_cols=65 Identities=22% Similarity=0.512 Sum_probs=58.1
Q ss_pred CChhhhcCCcccCCCCHHHHHHHHHHHHHHhCCCCCCCCCHHHHHHHHHHHHHHHHHHHHhcCcchhhHHHH
Q psy8325 67 SNYYDLFDLKLTYLINNVDLSKKFKQLQSQLHPDKFSNKNQEEQAISETYSSYLNKAYSILQNPLERGLYLL 138 (237)
Q Consensus 67 ~nyf~llgl~~~f~id~~~Lk~~Yr~Lq~~~HPDk~~~~~~~e~~~a~~~s~~In~AY~tL~dp~~Ra~Yll 138 (237)
.|||++|||++++ +.++|+++||+|+++||||++++... |.+.|..||+||++|+||.+|+.|+.
T Consensus 4 ~d~y~iLgv~~~a--s~~eik~ayr~la~~~HPD~~~~~~~-----a~~~f~~i~~Ay~~L~d~~~r~~yd~ 68 (371)
T PRK10767 4 RDYYEVLGVSRNA--SEDEIKKAYRKLAMKYHPDRNPGDKE-----AEEKFKEIKEAYEVLSDPQKRAAYDQ 68 (371)
T ss_pred CChHHhcCCCCCC--CHHHHHHHHHHHHHHHCCCCCCCcHH-----HHHHHHHHHHHHHHhcchhhhhHhhh
Confidence 6999999999974 78899999999999999999865322 45789999999999999999999985
No 37
>PTZ00341 Ring-infected erythrocyte surface antigen; Provisional
Probab=99.46 E-value=1.6e-12 Score=131.60 Aligned_cols=83 Identities=16% Similarity=0.244 Sum_probs=66.1
Q ss_pred CCChhhhcCCcccCCCCHHHHHHHHHHHHHHhCCCCCCCCCHHHHHHHHHHHHHHHHHHHHhcCcchhhHHHHhhcCCCc
Q psy8325 66 QSNYYDLFDLKLTYLINNVDLSKKFKQLQSQLHPDKFSNKNQEEQAISETYSSYLNKAYSILQNPLERGLYLLSLQNISI 145 (237)
Q Consensus 66 ~~nyf~llgl~~~f~id~~~Lk~~Yr~Lq~~~HPDk~~~~~~~e~~~a~~~s~~In~AY~tL~dp~~Ra~Yll~l~g~~~ 145 (237)
..+||++|||++++ +..+|+++||+|++++|||+++++ . |...|+.|++||++|+||.+|+.|+. +|...
T Consensus 572 d~dYYdILGVs~dA--S~~EIKKAYRKLAlkyHPDKN~~~-~-----A~ekFq~I~EAYeVLSDp~kRk~YD~--~G~~G 641 (1136)
T PTZ00341 572 DTLFYDILGVGVNA--DMKEISERYFKLAENYYPPKRSGN-E-----GFHKFKKINEAYQILGDIDKKKMYNK--FGYDG 641 (1136)
T ss_pred CCChHHHcCCCCCC--CHHHHHHHHHHHHHHhCCCCCCCc-h-----HHHHHHHHHHHHHHhCCHHHHHHHhh--ccccc
Confidence 47999999999986 888999999999999999998753 1 45789999999999999999999975 45432
Q ss_pred cccccCCCHHHHH
Q psy8325 146 EEDTKGTDQKLLM 158 (237)
Q Consensus 146 ~e~~~~~d~efLm 158 (237)
..+....||.+..
T Consensus 642 l~~~~~iDP~~Ff 654 (1136)
T PTZ00341 642 IKGVNFIHPSIFY 654 (1136)
T ss_pred cCCCCccCHHHHH
Confidence 2223456665433
No 38
>PTZ00037 DnaJ_C chaperone protein; Provisional
Probab=99.45 E-value=8.5e-14 Score=132.04 Aligned_cols=62 Identities=23% Similarity=0.482 Sum_probs=56.0
Q ss_pred CCChhhhcCCcccCCCCHHHHHHHHHHHHHHhCCCCCCCCCHHHHHHHHHHHHHHHHHHHHhcCcchhhHHHH
Q psy8325 66 QSNYYDLFDLKLTYLINNVDLSKKFKQLQSQLHPDKFSNKNQEEQAISETYSSYLNKAYSILQNPLERGLYLL 138 (237)
Q Consensus 66 ~~nyf~llgl~~~f~id~~~Lk~~Yr~Lq~~~HPDk~~~~~~~e~~~a~~~s~~In~AY~tL~dp~~Ra~Yll 138 (237)
..|||++|||++++ +.++||++||+|+++||||++.+ .+.|..||+||++|+||.+|+.|+.
T Consensus 27 ~~d~Y~vLGV~~~A--s~~eIKkAYrkla~k~HPDk~~~---------~e~F~~i~~AYevLsD~~kR~~YD~ 88 (421)
T PTZ00037 27 NEKLYEVLNLSKDC--TTSEIKKAYRKLAIKHHPDKGGD---------PEKFKEISRAYEVLSDPEKRKIYDE 88 (421)
T ss_pred chhHHHHcCCCCCC--CHHHHHHHHHHHHHHHCCCCCch---------HHHHHHHHHHHHHhccHHHHHHHhh
Confidence 46999999999975 88999999999999999999742 2578999999999999999999985
No 39
>PRK14292 chaperone protein DnaJ; Provisional
Probab=99.45 E-value=1.3e-13 Score=128.64 Aligned_cols=64 Identities=23% Similarity=0.474 Sum_probs=57.4
Q ss_pred CChhhhcCCcccCCCCHHHHHHHHHHHHHHhCCCCCCCCCHHHHHHHHHHHHHHHHHHHHhcCcchhhHHHH
Q psy8325 67 SNYYDLFDLKLTYLINNVDLSKKFKQLQSQLHPDKFSNKNQEEQAISETYSSYLNKAYSILQNPLERGLYLL 138 (237)
Q Consensus 67 ~nyf~llgl~~~f~id~~~Lk~~Yr~Lq~~~HPDk~~~~~~~e~~~a~~~s~~In~AY~tL~dp~~Ra~Yll 138 (237)
.|||++|||++++ +.++|+++||+|++++|||++..+. |.+.|..||+||++|+||.+|+.|+.
T Consensus 2 ~d~y~~Lgv~~~a--~~~~ik~ayr~l~~~~hpD~~~~~~------a~~~~~~i~~Ay~vL~d~~~r~~yd~ 65 (371)
T PRK14292 2 MDYYELLGVSRTA--SADEIKSAYRKLALKYHPDRNKEKG------AAEKFAQINEAYAVLSDAEKRAHYDR 65 (371)
T ss_pred CChHHHcCCCCCC--CHHHHHHHHHHHHHHHCCCCCCChh------HHHHHHHHHHHHHHhcchhhhhhHhh
Confidence 5999999999985 7889999999999999999987422 45788999999999999999999976
No 40
>PRK14290 chaperone protein DnaJ; Provisional
Probab=99.45 E-value=1.4e-13 Score=128.23 Aligned_cols=66 Identities=26% Similarity=0.514 Sum_probs=58.4
Q ss_pred CChhhhcCCcccCCCCHHHHHHHHHHHHHHhCCCCCCCCCHHHHHHHHHHHHHHHHHHHHhcCcchhhHHHH
Q psy8325 67 SNYYDLFDLKLTYLINNVDLSKKFKQLQSQLHPDKFSNKNQEEQAISETYSSYLNKAYSILQNPLERGLYLL 138 (237)
Q Consensus 67 ~nyf~llgl~~~f~id~~~Lk~~Yr~Lq~~~HPDk~~~~~~~e~~~a~~~s~~In~AY~tL~dp~~Ra~Yll 138 (237)
.|||++|||+++ .+.++|+++||+|++++|||+++.. ...|.+.|+.||+||++|+||.+|+.|+.
T Consensus 3 ~d~y~vLgv~~~--a~~~eik~ayr~la~~~HPD~~~~~----~~~a~~~f~~i~~Ay~~L~d~~~r~~yd~ 68 (365)
T PRK14290 3 KDYYKILGVDRN--ASQEDIKKAFRELAKKWHPDLHPGN----KAEAEEKFKEISEAYEVLSDPQKRRQYDQ 68 (365)
T ss_pred CChhhhcCCCCC--CCHHHHHHHHHHHHHHHCcCCCCCc----hhHHHHHHHHHHHHHHHhcChhhhhhhcc
Confidence 599999999997 4889999999999999999997643 22356889999999999999999999975
No 41
>PRK14293 chaperone protein DnaJ; Provisional
Probab=99.45 E-value=1.5e-13 Score=128.55 Aligned_cols=64 Identities=23% Similarity=0.521 Sum_probs=57.4
Q ss_pred CChhhhcCCcccCCCCHHHHHHHHHHHHHHhCCCCCCCCCHHHHHHHHHHHHHHHHHHHHhcCcchhhHHHH
Q psy8325 67 SNYYDLFDLKLTYLINNVDLSKKFKQLQSQLHPDKFSNKNQEEQAISETYSSYLNKAYSILQNPLERGLYLL 138 (237)
Q Consensus 67 ~nyf~llgl~~~f~id~~~Lk~~Yr~Lq~~~HPDk~~~~~~~e~~~a~~~s~~In~AY~tL~dp~~Ra~Yll 138 (237)
.|||++|||++. ++.++|+++||+|++++|||+++.+. +.+.|+.||+||++|+||.+|+.|+.
T Consensus 3 ~d~y~vLgv~~~--a~~~eik~ayr~la~~~HPD~~~~~~------a~~~f~~i~~Ay~vL~~~~~R~~yd~ 66 (374)
T PRK14293 3 ADYYEILGVSRD--ADKDELKRAYRRLARKYHPDVNKEPG------AEDRFKEINRAYEVLSDPETRARYDQ 66 (374)
T ss_pred CChhhhcCCCCC--CCHHHHHHHHHHHHHHHCCCCCCCcC------HHHHHHHHHHHHHHHhchHHHHHHhh
Confidence 599999999996 58899999999999999999987533 34678999999999999999999986
No 42
>PRK14278 chaperone protein DnaJ; Provisional
Probab=99.44 E-value=1.1e-13 Score=129.49 Aligned_cols=64 Identities=22% Similarity=0.494 Sum_probs=57.3
Q ss_pred CChhhhcCCcccCCCCHHHHHHHHHHHHHHhCCCCCCCCCHHHHHHHHHHHHHHHHHHHHhcCcchhhHHHH
Q psy8325 67 SNYYDLFDLKLTYLINNVDLSKKFKQLQSQLHPDKFSNKNQEEQAISETYSSYLNKAYSILQNPLERGLYLL 138 (237)
Q Consensus 67 ~nyf~llgl~~~f~id~~~Lk~~Yr~Lq~~~HPDk~~~~~~~e~~~a~~~s~~In~AY~tL~dp~~Ra~Yll 138 (237)
.|||++|||++++ +.++|+++||+|+++||||++..+ .|++.|+.||+||++|+||.+|+.|+.
T Consensus 3 ~d~y~iLgv~~~a--~~~eik~ayr~la~~~hpD~~~~~------~a~~~f~~i~~Ay~vL~d~~~r~~YD~ 66 (378)
T PRK14278 3 RDYYGLLGVSRNA--SDAEIKRAYRKLARELHPDVNPDE------EAQEKFKEISVAYEVLSDPEKRRIVDL 66 (378)
T ss_pred CCcceecCCCCCC--CHHHHHHHHHHHHHHHCCCCCCcH------HHHHHHHHHHHHHHHhchhhhhhhhhc
Confidence 5999999999975 789999999999999999998631 245789999999999999999999985
No 43
>PRK14283 chaperone protein DnaJ; Provisional
Probab=99.44 E-value=1.6e-13 Score=128.43 Aligned_cols=65 Identities=20% Similarity=0.432 Sum_probs=58.0
Q ss_pred CCChhhhcCCcccCCCCHHHHHHHHHHHHHHhCCCCCCCCCHHHHHHHHHHHHHHHHHHHHhcCcchhhHHHH
Q psy8325 66 QSNYYDLFDLKLTYLINNVDLSKKFKQLQSQLHPDKFSNKNQEEQAISETYSSYLNKAYSILQNPLERGLYLL 138 (237)
Q Consensus 66 ~~nyf~llgl~~~f~id~~~Lk~~Yr~Lq~~~HPDk~~~~~~~e~~~a~~~s~~In~AY~tL~dp~~Ra~Yll 138 (237)
..|||++|||++++ +.++|+++||+|+++||||++..+ .|.+.|+.|++||++|+||.+|+.|+.
T Consensus 4 ~~d~y~~Lgv~~~a--~~~eik~ayr~la~~~HPD~~~~~------~a~~~f~~i~~Ay~~Lsd~~kR~~YD~ 68 (378)
T PRK14283 4 KRDYYEVLGVDRNA--DKKEIKKAYRKLARKYHPDVSEEE------GAEEKFKEISEAYAVLSDDEKRQRYDQ 68 (378)
T ss_pred cCChHHhhCCCCCC--CHHHHHHHHHHHHHHHCcCCCCCc------cHHHHHHHHHHHHHHhchhHHHHHHhh
Confidence 36999999999975 789999999999999999997642 245789999999999999999999986
No 44
>PRK14300 chaperone protein DnaJ; Provisional
Probab=99.44 E-value=1.7e-13 Score=128.11 Aligned_cols=64 Identities=30% Similarity=0.521 Sum_probs=57.1
Q ss_pred CChhhhcCCcccCCCCHHHHHHHHHHHHHHhCCCCCCCCCHHHHHHHHHHHHHHHHHHHHhcCcchhhHHHH
Q psy8325 67 SNYYDLFDLKLTYLINNVDLSKKFKQLQSQLHPDKFSNKNQEEQAISETYSSYLNKAYSILQNPLERGLYLL 138 (237)
Q Consensus 67 ~nyf~llgl~~~f~id~~~Lk~~Yr~Lq~~~HPDk~~~~~~~e~~~a~~~s~~In~AY~tL~dp~~Ra~Yll 138 (237)
.|||++|||++++ +.++|+++||+|+++||||++.... +++.|+.||+||++|+||.+|+.|+.
T Consensus 3 ~~~y~iLgv~~~a--s~~eik~ayr~la~~~HPD~~~~~~------~~~~f~~i~~Ay~~L~d~~~r~~yD~ 66 (372)
T PRK14300 3 QDYYQILGVSKTA--SQADLKKAYLKLAKQYHPDTTDAKD------AEKKFKEINAAYDVLKDEQKRAAYDR 66 (372)
T ss_pred CChHHHcCCCCCC--CHHHHHHHHHHHHHHHCcCCCCCcC------HHHHHHHHHHHHHHhhhHhHhhHHHh
Confidence 5999999999974 7899999999999999999986432 34678999999999999999999986
No 45
>PRK14289 chaperone protein DnaJ; Provisional
Probab=99.43 E-value=2.3e-13 Score=127.61 Aligned_cols=65 Identities=25% Similarity=0.437 Sum_probs=58.2
Q ss_pred CChhhhcCCcccCCCCHHHHHHHHHHHHHHhCCCCCCCCCHHHHHHHHHHHHHHHHHHHHhcCcchhhHHHH
Q psy8325 67 SNYYDLFDLKLTYLINNVDLSKKFKQLQSQLHPDKFSNKNQEEQAISETYSSYLNKAYSILQNPLERGLYLL 138 (237)
Q Consensus 67 ~nyf~llgl~~~f~id~~~Lk~~Yr~Lq~~~HPDk~~~~~~~e~~~a~~~s~~In~AY~tL~dp~~Ra~Yll 138 (237)
.|||++|||++++ +.++|+++||+|+++||||+++.... |.+.|+.|++||++|+||.+|+.|+.
T Consensus 5 ~~~y~~Lgv~~~a--~~~eik~ayr~la~~~HpD~~~~~~~-----a~~~f~~i~~Ay~~L~d~~~R~~yD~ 69 (386)
T PRK14289 5 RDYYEVLGVSKTA--TVDEIKKAYRKKAIQYHPDKNPGDKE-----AEEKFKEAAEAYDVLSDPDKRSRYDQ 69 (386)
T ss_pred CCHHHHcCCCCCC--CHHHHHHHHHHHHHHHCCCCCCCChH-----HHHHHHHHHHHHHHhcCHHHHHHHHH
Confidence 6999999999986 77899999999999999999875432 45788999999999999999999986
No 46
>PRK10266 curved DNA-binding protein CbpA; Provisional
Probab=99.43 E-value=2.5e-13 Score=123.84 Aligned_cols=64 Identities=20% Similarity=0.362 Sum_probs=57.1
Q ss_pred CChhhhcCCcccCCCCHHHHHHHHHHHHHHhCCCCCCCCCHHHHHHHHHHHHHHHHHHHHhcCcchhhHHHH
Q psy8325 67 SNYYDLFDLKLTYLINNVDLSKKFKQLQSQLHPDKFSNKNQEEQAISETYSSYLNKAYSILQNPLERGLYLL 138 (237)
Q Consensus 67 ~nyf~llgl~~~f~id~~~Lk~~Yr~Lq~~~HPDk~~~~~~~e~~~a~~~s~~In~AY~tL~dp~~Ra~Yll 138 (237)
.|||++|||++.+ +.++|+++||+|++++|||++... .+.+.|+.||+||++|+||.+|+.|+.
T Consensus 4 ~d~y~~Lgv~~~a--~~~eik~ayr~la~k~HPD~~~~~------~~~~~f~~i~~Ay~~L~~~~kr~~yD~ 67 (306)
T PRK10266 4 KDYYAIMGVKPTD--DLKTIKTAYRRLARKYHPDVSKEP------DAEARFKEVAEAWEVLSDEQRRAEYDQ 67 (306)
T ss_pred CChHHHcCCCCCC--CHHHHHHHHHHHHHHHCcCCCCCc------cHHHHHHHHHHHHHHhhhHHHHHHHHH
Confidence 5999999999975 788999999999999999996532 245789999999999999999999986
No 47
>TIGR02349 DnaJ_bact chaperone protein DnaJ. This model represents bacterial forms of DnaJ, part of the DnaK-DnaJ-GrpE chaperone system. The three components typically are encoded by consecutive genes. DnaJ homologs occur in many genomes, typically not near DnaK and GrpE-like genes; most such genes are not included by this family. Eukaryotic (mitochondrial and chloroplast) forms are not included in the scope of this family.
Probab=99.42 E-value=2.4e-13 Score=126.01 Aligned_cols=63 Identities=25% Similarity=0.566 Sum_probs=56.2
Q ss_pred ChhhhcCCcccCCCCHHHHHHHHHHHHHHhCCCCCCCCCHHHHHHHHHHHHHHHHHHHHhcCcchhhHHHH
Q psy8325 68 NYYDLFDLKLTYLINNVDLSKKFKQLQSQLHPDKFSNKNQEEQAISETYSSYLNKAYSILQNPLERGLYLL 138 (237)
Q Consensus 68 nyf~llgl~~~f~id~~~Lk~~Yr~Lq~~~HPDk~~~~~~~e~~~a~~~s~~In~AY~tL~dp~~Ra~Yll 138 (237)
|||++|||++++ +.++|+++||+|++++|||+++.+. +.+.|+.||+||++|+||.+|+.|+.
T Consensus 1 d~y~~Lgv~~~a--~~~~ik~ayr~la~~~HPD~~~~~~------~~~~f~~i~~Ay~vL~d~~~R~~yd~ 63 (354)
T TIGR02349 1 DYYEILGVSKDA--SEEEIKKAYRKLAKKYHPDRNKDKE------AEEKFKEINEAYEVLSDPEKRAQYDQ 63 (354)
T ss_pred ChHHhCCCCCCC--CHHHHHHHHHHHHHHHCCCCCCCcc------HHHHHHHHHHHHHHhhChHHHHhhhh
Confidence 799999999975 8889999999999999999987321 34688999999999999999999975
No 48
>KOG0712|consensus
Probab=99.41 E-value=2.4e-13 Score=124.89 Aligned_cols=63 Identities=22% Similarity=0.489 Sum_probs=57.6
Q ss_pred CCChhhhcCCcccCCCCHHHHHHHHHHHHHHhCCCCCCCCCHHHHHHHHHHHHHHHHHHHHhcCcchhhHHHH
Q psy8325 66 QSNYYDLFDLKLTYLINNVDLSKKFKQLQSQLHPDKFSNKNQEEQAISETYSSYLNKAYSILQNPLERGLYLL 138 (237)
Q Consensus 66 ~~nyf~llgl~~~f~id~~~Lk~~Yr~Lq~~~HPDk~~~~~~~e~~~a~~~s~~In~AY~tL~dp~~Ra~Yll 138 (237)
...||++|||++.+ +.++||++||+|+++|||||+++ +.+.|+.|+.||++|+||.+|..|+.
T Consensus 3 ~~~~y~il~v~~~A--s~~eikkayrkla~k~HpDkn~~--------~~ekfkei~~AyevLsd~ekr~~yD~ 65 (337)
T KOG0712|consen 3 NTKLYDILGVSPDA--SEEEIKKAYRKLALKYHPDKNPD--------AGEKFKEISQAYEVLSDPEKREIYDQ 65 (337)
T ss_pred ccccceeeccCCCc--CHHHHHHHHHHHHHHhCCCCCcc--------HHHHHHHHHHHHHHhcCHHHHHHHHh
Confidence 35899999999964 88999999999999999999988 34789999999999999999999975
No 49
>KOG0719|consensus
Probab=99.35 E-value=1.6e-11 Score=107.32 Aligned_cols=68 Identities=25% Similarity=0.464 Sum_probs=61.5
Q ss_pred CCChhhhcCCcccCCCCHHHHHHHHHHHHHHhCCCCCCCCCHHHHHHHHHHHHHHHHHHHHhcCcchhhHHHH
Q psy8325 66 QSNYYDLFDLKLTYLINNVDLSKKFKQLQSQLHPDKFSNKNQEEQAISETYSSYLNKAYSILQNPLERGLYLL 138 (237)
Q Consensus 66 ~~nyf~llgl~~~f~id~~~Lk~~Yr~Lq~~~HPDk~~~~~~~e~~~a~~~s~~In~AY~tL~dp~~Ra~Yll 138 (237)
..|.|++|||.+++ +..+|+++||+|++++|||+.. +..+..|.+.|..+++||++|+|..+|+.|+-
T Consensus 13 ~~d~YevLGVer~a--~~~eIrkAY~klal~~HPDk~~---eed~~ea~~kFq~l~k~y~iLsDeekR~~YDe 80 (264)
T KOG0719|consen 13 KKDLYEVLGVERDA--TDKEIRKAYHKLALRLHPDKNH---EEDKVEATEKFQQLQKAYQILSDEEKRAVYDE 80 (264)
T ss_pred ccCHHHHhhhcccC--CHHHHHHHHHHHHHHhCCCcch---hhhHHHHHHHHHHHHHHHHHhhHHHHHHHHhc
Confidence 35999999999985 8899999999999999999986 45566788999999999999999999999964
No 50
>KOG0715|consensus
Probab=99.35 E-value=1.6e-12 Score=117.92 Aligned_cols=70 Identities=21% Similarity=0.362 Sum_probs=62.3
Q ss_pred CCCChhhhcCCcccCCCCHHHHHHHHHHHHHHhCCCCCCCCCHHHHHHHHHHHHHHHHHHHHhcCcchhhHHHHhhcC
Q psy8325 65 SQSNYYDLFDLKLTYLINNVDLSKKFKQLQSQLHPDKFSNKNQEEQAISETYSSYLNKAYSILQNPLERGLYLLSLQN 142 (237)
Q Consensus 65 ~~~nyf~llgl~~~f~id~~~Lk~~Yr~Lq~~~HPDk~~~~~~~e~~~a~~~s~~In~AY~tL~dp~~Ra~Yll~l~g 142 (237)
...|||++||+++++ +..+||.+|++|+++||||-+..+ .+.+.|..|.+||++|+|+.+|.+|+..+.+
T Consensus 41 ~~~d~Y~vLgv~~~A--t~~EIK~Af~~LaKkyHPD~n~~~------~a~~kF~eI~~AYEiLsd~eKR~~YD~~~~~ 110 (288)
T KOG0715|consen 41 SKEDYYKVLGVSRNA--TLSEIKSAFRKLAKKYHPDVNKDK------EASKKFKEISEAYEILSDEEKRQEYDVYGLE 110 (288)
T ss_pred CCcchhhhhCcCCCC--CHHHHHHHHHHHHHhhCCCCCCCc------chhhHHHHHHHHHHHhcCHHHHHHHHHhhhh
Confidence 334999999999986 788999999999999999999877 2457899999999999999999999987654
No 51
>KOG0717|consensus
Probab=99.33 E-value=1.9e-12 Score=122.06 Aligned_cols=69 Identities=23% Similarity=0.478 Sum_probs=60.1
Q ss_pred CCCChhhhcCCcccCCCCHHHHHHHHHHHHHHhCCCCCCCCCHHHHHHHHHHHHHHHHHHHHhcCcchhhHHHHh
Q psy8325 65 SQSNYYDLFDLKLTYLINNVDLSKKFKQLQSQLHPDKFSNKNQEEQAISETYSSYLNKAYSILQNPLERGLYLLS 139 (237)
Q Consensus 65 ~~~nyf~llgl~~~f~id~~~Lk~~Yr~Lq~~~HPDk~~~~~~~e~~~a~~~s~~In~AY~tL~dp~~Ra~Yll~ 139 (237)
....||++|||.... |..+|+++||+|+++||||+++..- +-|.+.|++|+.||++|+||..|+=|+-.
T Consensus 6 ~~~c~YE~L~v~~~a--~d~eik~~YRklALq~HPDknpd~i----eeat~~F~~i~aAYeVLSdp~eR~wyd~h 74 (508)
T KOG0717|consen 6 KKRCYYEVLGVERDA--DDDEIKKNYRKLALQYHPDKNPDRI----EEATQQFQLIQAAYEVLSDPQERAWYDSH 74 (508)
T ss_pred hhhHHHHHhcccccC--CHHHHHHHHHHHHHhhCCCCCCccH----HHHHHHHHHHHHHHHHhcChHhhhhHHHH
Confidence 357899999999975 7889999999999999999976542 23679999999999999999999999753
No 52
>KOG0691|consensus
Probab=99.33 E-value=9.3e-12 Score=112.99 Aligned_cols=69 Identities=25% Similarity=0.429 Sum_probs=62.0
Q ss_pred CCChhhhcCCcccCCCCHHHHHHHHHHHHHHhCCCCCCCCCHHHHHHHHHHHHHHHHHHHHhcCcchhhHHHHhhc
Q psy8325 66 QSNYYDLFDLKLTYLINNVDLSKKFKQLQSQLHPDKFSNKNQEEQAISETYSSYLNKAYSILQNPLERGLYLLSLQ 141 (237)
Q Consensus 66 ~~nyf~llgl~~~f~id~~~Lk~~Yr~Lq~~~HPDk~~~~~~~e~~~a~~~s~~In~AY~tL~dp~~Ra~Yll~l~ 141 (237)
..|||++||+.+.. +..+|+++||..++.+||||+++.+. |.+.|..+.+||++|+||..|+.|+--+.
T Consensus 4 ~~dyY~lLgi~~~a--t~~eIkKaYr~kaL~~HPDKNp~dP~-----A~ekFq~L~eAy~VL~D~~~R~~YDk~~k 72 (296)
T KOG0691|consen 4 DTDYYDLLGISEDA--TDAEIKKAYRKKALQYHPDKNPGDPQ-----AAEKFQELSEAYEVLSDEESRAAYDKLRK 72 (296)
T ss_pred cchHHHHhCCCCCC--CHHHHHHHHHHHHHhcCCCCCCCChH-----HHHHHHHHHHHHHHhcCHHHHHHHHHHhh
Confidence 57999999999875 78999999999999999999998765 45889999999999999999999986543
No 53
>TIGR03835 termin_org_DnaJ terminal organelle assembly protein TopJ. This model describes TopJ (MG_200, CbpA), a DnaJ homolog and probable assembly protein of the Mycoplasma terminal organelle. The terminal organelle is involved in both cytadherence and gliding motility.
Probab=99.32 E-value=5.3e-12 Score=125.43 Aligned_cols=65 Identities=23% Similarity=0.492 Sum_probs=57.5
Q ss_pred CChhhhcCCcccCCCCHHHHHHHHHHHHHHhCCCCCCCCCHHHHHHHHHHHHHHHHHHHHhcCcchhhHHHHh
Q psy8325 67 SNYYDLFDLKLTYLINNVDLSKKFKQLQSQLHPDKFSNKNQEEQAISETYSSYLNKAYSILQNPLERGLYLLS 139 (237)
Q Consensus 67 ~nyf~llgl~~~f~id~~~Lk~~Yr~Lq~~~HPDk~~~~~~~e~~~a~~~s~~In~AY~tL~dp~~Ra~Yll~ 139 (237)
.|||++|||+++ ++..+|+++||+|++++|||++.+. . +.+.|..||+||++|+||.+|+.|+..
T Consensus 2 ~DYYeVLGVs~d--AS~eEIKKAYRKLAKKyHPDKn~~~-e-----AeekFqeINEAYEVLSDP~KRa~YD~f 66 (871)
T TIGR03835 2 RDYYEVLGIDRD--ADEQEIKKAFRKLAKKYHPDRNKAP-D-----AASIFAEINEANDVLSNPKKRANYDKY 66 (871)
T ss_pred CChhHhcCCCCC--CCHHHHHHHHHHHHHHHCcCCCCCh-h-----HHHHHHHHHHHHHHhCCHHHHHHHhhh
Confidence 599999999996 4889999999999999999998762 1 346789999999999999999999864
No 54
>COG2214 CbpA DnaJ-class molecular chaperone [Posttranslational modification, protein turnover, chaperones]
Probab=99.30 E-value=5.2e-12 Score=105.59 Aligned_cols=67 Identities=24% Similarity=0.474 Sum_probs=60.7
Q ss_pred CCChhhhcCCcccCCCCHHHHHHHHHHHHHHhCCCCCCCCCHHHHHHHHHHHHHHHHHHHHhcCcchhhHHHH
Q psy8325 66 QSNYYDLFDLKLTYLINNVDLSKKFKQLQSQLHPDKFSNKNQEEQAISETYSSYLNKAYSILQNPLERGLYLL 138 (237)
Q Consensus 66 ~~nyf~llgl~~~f~id~~~Lk~~Yr~Lq~~~HPDk~~~~~~~e~~~a~~~s~~In~AY~tL~dp~~Ra~Yll 138 (237)
..+||++|||+++ .+..+|+++||++++++|||++..... .+.+.|..||+||.+|+||..|+.|+.
T Consensus 5 ~~~~y~iLgv~~~--as~~eik~ayrkla~~~HPD~~~~~~~----~a~~~f~~i~~Ay~vLsd~~~r~~yd~ 71 (237)
T COG2214 5 LLDYYEILGVPPN--ASLEEIKKAYRKLALKYHPDRNPGDPK----VAEEKFKEINEAYEILSDPERRAEYDK 71 (237)
T ss_pred hhhHHHHhCCCCC--CCHHHHHHHHHHHHHHhCCCCCCCchh----HHHHHHHHHHHHHHHhhCHHHHHHhhh
Confidence 4799999999976 488999999999999999999987654 456889999999999999999999987
No 55
>KOG0721|consensus
Probab=99.24 E-value=1.7e-11 Score=106.01 Aligned_cols=70 Identities=20% Similarity=0.359 Sum_probs=60.4
Q ss_pred CCChhhhcCCcccCCCCHHHHHHHHHHHHHHhCCCCCCCCCHHHHHHHHHHHHHHHHHHHHhcCcchhhHHHHhhcCCC
Q psy8325 66 QSNYYDLFDLKLTYLINNVDLSKKFKQLQSQLHPDKFSNKNQEEQAISETYSSYLNKAYSILQNPLERGLYLLSLQNIS 144 (237)
Q Consensus 66 ~~nyf~llgl~~~f~id~~~Lk~~Yr~Lq~~~HPDk~~~~~~~e~~~a~~~s~~In~AY~tL~dp~~Ra~Yll~l~g~~ 144 (237)
..|+|++|||+++. +.++||++||+|+.++||||.+..+..| +.+..|++||+.|.|+.+|..|.. .|.+
T Consensus 98 ~fDPyEILGl~pga--s~~eIKkaYR~LSik~HPDK~~~~~~~e-----~~~~~I~KAY~aLTD~~sreN~ek--YG~P 167 (230)
T KOG0721|consen 98 KFDPYEILGLDPGA--SEKEIKKAYRRLSIKYHPDKQPPEEGDE-----EFFEAIAKAYQALTDKKSRENWEK--YGNP 167 (230)
T ss_pred cCCcHHhhCCCCCC--CHHHHHHHHHHhhhhhCCCcCCCcchhH-----HHHHHHHHHHHHhcchhhHHHHHH--hCCC
Confidence 57999999999985 7889999999999999999998764433 677899999999999999999954 4543
No 56
>PHA03102 Small T antigen; Reviewed
Probab=99.23 E-value=1.8e-11 Score=101.48 Aligned_cols=63 Identities=22% Similarity=0.192 Sum_probs=54.3
Q ss_pred CChhhhcCCcccCCCCHHHHHHHHHHHHHHhCCCCCCCCCHHHHHHHHHHHHHHHHHHHHhcCcchhhHHHH
Q psy8325 67 SNYYDLFDLKLTYLINNVDLSKKFKQLQSQLHPDKFSNKNQEEQAISETYSSYLNKAYSILQNPLERGLYLL 138 (237)
Q Consensus 67 ~nyf~llgl~~~f~id~~~Lk~~Yr~Lq~~~HPDk~~~~~~~e~~~a~~~s~~In~AY~tL~dp~~Ra~Yll 138 (237)
..++++|||++++--+..+|+++||++++++|||+.++ .+.++.||+||++|+|+.+|+.|+.
T Consensus 5 ~~l~~vLGl~~~A~~s~~eIKkAYr~la~~~HPDkgg~---------~e~~k~in~Ay~~L~d~~~r~~yd~ 67 (153)
T PHA03102 5 KELMDLLGLPRSAWGNLPLMRKAYLRKCLEFHPDKGGD---------EEKMKELNTLYKKFRESVKSLRDLD 67 (153)
T ss_pred HHHHHHcCCCCCCCCCHHHHHHHHHHHHHHHCcCCCch---------hHHHHHHHHHHHHHhhHHHhccccc
Confidence 46899999999853478899999999999999999532 1467999999999999999999943
No 57
>KOG0624|consensus
Probab=99.07 E-value=2.8e-10 Score=105.07 Aligned_cols=69 Identities=23% Similarity=0.499 Sum_probs=64.0
Q ss_pred CCCChhhhcCCcccCCCCHHHHHHHHHHHHHHhCCCCCCCCCHHHHHHHHHHHHHHHHHHHHhcCcchhhHHH
Q psy8325 65 SQSNYYDLFDLKLTYLINNVDLSKKFKQLQSQLHPDKFSNKNQEEQAISETYSSYLNKAYSILQNPLERGLYL 137 (237)
Q Consensus 65 ~~~nyf~llgl~~~f~id~~~Lk~~Yr~Lq~~~HPDk~~~~~~~e~~~a~~~s~~In~AY~tL~dp~~Ra~Yl 137 (237)
...|||.+||+.+++ +..+|.++||+++.+||||.|++ ++|+..|+..|.-|..|-++|+||.+|.+|+
T Consensus 392 ~kRDYYKILGVkRnA--sKqEI~KAYRKlAqkWHPDNFqd--EeEKKkAEKKFIDIAAAKEVLsd~EkRrqFD 460 (504)
T KOG0624|consen 392 GKRDYYKILGVKRNA--SKQEITKAYRKLAQKWHPDNFQD--EEEKKKAEKKFIDIAAAKEVLSDPEKRRQFD 460 (504)
T ss_pred ccchHHHHhhhcccc--cHHHHHHHHHHHHHhcCCccccC--HHHHHHHHHhhhhHHHHHHhhcCHHHHhhcc
Confidence 457999999999985 78999999999999999999987 5778889999999999999999999999986
No 58
>PRK09430 djlA Dna-J like membrane chaperone protein; Provisional
Probab=99.05 E-value=3.7e-10 Score=101.43 Aligned_cols=63 Identities=17% Similarity=0.433 Sum_probs=55.4
Q ss_pred CCCChhhhcCCcccCCCCHHHHHHHHHHHHHHhCCCCCCCC--CHHHHHHHHHHHHHHHHHHHHhcC
Q psy8325 65 SQSNYYDLFDLKLTYLINNVDLSKKFKQLQSQLHPDKFSNK--NQEEQAISETYSSYLNKAYSILQN 129 (237)
Q Consensus 65 ~~~nyf~llgl~~~f~id~~~Lk~~Yr~Lq~~~HPDk~~~~--~~~e~~~a~~~s~~In~AY~tL~d 129 (237)
...++|++||++++ .+.++||++||+|++++|||++.++ ++...+.+.+.+..||+||++|+.
T Consensus 198 ~~~~ay~vLgv~~~--as~~eIk~aYr~L~~~~HPDk~~~~g~~~~~~~~a~ek~~~I~~AYe~L~~ 262 (267)
T PRK09430 198 TLEDAYKVLGVSES--DDDQEIKRAYRKLMSEHHPDKLVAKGLPPEMMEMAKEKAQEIQAAYELIKK 262 (267)
T ss_pred cHHhHHHHcCCCCC--CCHHHHHHHHHHHHHHhCcCCCCCCCCChhhHHHHHHHHHHHHHHHHHHHH
Confidence 34799999999997 4899999999999999999999754 455677889999999999999985
No 59
>KOG0722|consensus
Probab=99.02 E-value=2.4e-10 Score=101.20 Aligned_cols=68 Identities=25% Similarity=0.451 Sum_probs=61.3
Q ss_pred CCChhhhcCCcccCCCCHHHHHHHHHHHHHHhCCCCCCCCCHHHHHHHHHHHHHHHHHHHHhcCcchhhHHHHhhc
Q psy8325 66 QSNYYDLFDLKLTYLINNVDLSKKFKQLQSQLHPDKFSNKNQEEQAISETYSSYLNKAYSILQNPLERGLYLLSLQ 141 (237)
Q Consensus 66 ~~nyf~llgl~~~f~id~~~Lk~~Yr~Lq~~~HPDk~~~~~~~e~~~a~~~s~~In~AY~tL~dp~~Ra~Yll~l~ 141 (237)
..|-|++||+.++. +.++|.++||+|++++|||++.++..+ ..|..|..||++|+|...|+.|+..+.
T Consensus 32 ~enCYdVLgV~Rea--~KseIakAYRqLARrhHPDr~r~~e~k------~~F~~iAtayeilkd~e~rt~ydyald 99 (329)
T KOG0722|consen 32 AENCYDVLGVAREA--NKSEIAKAYRQLARRHHPDRNRDPESK------KLFVKIATAYEILKDNETRTQYDYALD 99 (329)
T ss_pred chhHHHHhhhhhhc--cHHHHHHHHHHHHHHhCCcccCCchhh------hhhhhhhcccccccchhhHHhHHHHhc
Confidence 36999999999986 889999999999999999999887654 678999999999999999999987654
No 60
>KOG0720|consensus
Probab=99.01 E-value=3.7e-10 Score=106.67 Aligned_cols=68 Identities=24% Similarity=0.349 Sum_probs=60.9
Q ss_pred CCCChhhhcCCcccCCCCHHHHHHHHHHHHHHhCCCCCCCCCHHHHHHHHHHHHHHHHHHHHhcCcchhhHHHHhh
Q psy8325 65 SQSNYYDLFDLKLTYLINNVDLSKKFKQLQSQLHPDKFSNKNQEEQAISETYSSYLNKAYSILQNPLERGLYLLSL 140 (237)
Q Consensus 65 ~~~nyf~llgl~~~f~id~~~Lk~~Yr~Lq~~~HPDk~~~~~~~e~~~a~~~s~~In~AY~tL~dp~~Ra~Yll~l 140 (237)
...|+|.+||||++ +|.++||+.||+++..+||||+..+ .|++.|..+.-||++|+|+.+|.+|++++
T Consensus 233 ~~~daYsvlGl~~d--~sd~~lKk~Yrk~A~LVhPDKn~~~------~A~Eafk~Lq~Afevig~~~kR~eYd~e~ 300 (490)
T KOG0720|consen 233 NILDAYSALGLPSD--CSDADLKKNYRKKAMLVHPDKNMIP------RAEEAFKKLQVAFEVIGDSVKRKEYDLEL 300 (490)
T ss_pred cCCCchhhcCCCCC--CCHHHHHHHHHhhceEeCCCccCCh------hHHHHHHHHHHHHHHhcchhhhhHHHHHH
Confidence 35899999999964 7999999999999999999999854 24588999999999999999999998765
No 61
>PTZ00100 DnaJ chaperone protein; Provisional
Probab=98.87 E-value=3.7e-09 Score=83.90 Aligned_cols=57 Identities=26% Similarity=0.367 Sum_probs=46.3
Q ss_pred cCCCCCCChhhhcCCcccCCCCHHHHHHHHHHHHHHhCCCCCCCCCHHHHHHHHHHHHHHHHHHHHhc
Q psy8325 61 QKPDSQSNYYDLFDLKLTYLINNVDLSKKFKQLQSQLHPDKFSNKNQEEQAISETYSSYLNKAYSILQ 128 (237)
Q Consensus 61 qp~~~~~nyf~llgl~~~f~id~~~Lk~~Yr~Lq~~~HPDk~~~~~~~e~~~a~~~s~~In~AY~tL~ 128 (237)
++.....++|++||+++++ +.++|+++||+|++++|||+.. +. ..++.||+||++|.
T Consensus 59 ~~~Ms~~eAy~ILGv~~~A--s~~eIkkaYRrLa~~~HPDkgG--s~-------~~~~kIneAyevL~ 115 (116)
T PTZ00100 59 ENPMSKSEAYKILNISPTA--SKERIREAHKQLMLRNHPDNGG--ST-------YIASKVNEAKDLLL 115 (116)
T ss_pred cCCCCHHHHHHHcCCCCCC--CHHHHHHHHHHHHHHhCCCCCC--CH-------HHHHHHHHHHHHHh
Confidence 4444457999999999975 7889999999999999999842 11 35678999999985
No 62
>COG5407 SEC63 Preprotein translocase subunit Sec63 [Intracellular trafficking and secretion]
Probab=98.74 E-value=1.4e-08 Score=96.19 Aligned_cols=72 Identities=24% Similarity=0.375 Sum_probs=66.0
Q ss_pred CCChhhhcCCcccCCCCHHHHHHHHHHHHHHhCCCCCCCCCHHHHHHHHHHHHHHHHHHHHhcCcchhhHHHHh
Q psy8325 66 QSNYYDLFDLKLTYLINNVDLSKKFKQLQSQLHPDKFSNKNQEEQAISETYSSYLNKAYSILQNPLERGLYLLS 139 (237)
Q Consensus 66 ~~nyf~llgl~~~f~id~~~Lk~~Yr~Lq~~~HPDk~~~~~~~e~~~a~~~s~~In~AY~tL~dp~~Ra~Yll~ 139 (237)
..|+||+||+..+. +..+||++||.|+.+|||||.+.-...++..-++....|++||..|.|...|..|+..
T Consensus 97 ~fDPyEILGI~~~t--s~rdik~~yr~Ls~KfhpdK~~~mvn~~rse~Ee~y~~ItkAY~~lTd~k~renyl~y 168 (610)
T COG5407 97 GFDPYEILGIDQDT--SERDIKKRYRMLSMKFHPDKAPPMVNELRSEYEEKYKTITKAYGLLTDKKRRENYLNY 168 (610)
T ss_pred CCChHHhhcccCCC--cHHHHHHHHHhheeecChhhcCCCChhHHHHHHHHHHHHHHHHHhhhhHHHHHHHHhc
Confidence 57999999999986 5679999999999999999999877788888889999999999999999999999864
No 63
>PHA02624 large T antigen; Provisional
Probab=98.73 E-value=1.3e-08 Score=100.09 Aligned_cols=63 Identities=24% Similarity=0.247 Sum_probs=54.2
Q ss_pred CCChhhhcCCcccCCCCHHHHHHHHHHHHHHhCCCCCCCCCHHHHHHHHHHHHHHHHHHHHhcCcchhhHHH
Q psy8325 66 QSNYYDLFDLKLTYLINNVDLSKKFKQLQSQLHPDKFSNKNQEEQAISETYSSYLNKAYSILQNPLERGLYL 137 (237)
Q Consensus 66 ~~nyf~llgl~~~f~id~~~Lk~~Yr~Lq~~~HPDk~~~~~~~e~~~a~~~s~~In~AY~tL~dp~~Ra~Yl 137 (237)
..++|++|||++++--+..+|+++||++++++|||+.++ .+.++.||.||++|+|+.+|+.|.
T Consensus 10 ~~elyelLGL~~~A~gs~~eIKkAYRkLAkkyHPDKgGd---------eekfk~Ln~AYevL~d~~k~~r~~ 72 (647)
T PHA02624 10 SKELMDLLGLPMAAWGNLPLMRKAYLRKCKEYHPDKGGD---------EEKMKRLNSLYKKLQEGVKSARQS 72 (647)
T ss_pred HHHHHHHcCCCCCCCCCHHHHHHHHHHHHHHHCcCCCCc---------HHHHHHHHHHHHHHhcHHHhhhcc
Confidence 358999999999742267899999999999999999432 257899999999999999999994
No 64
>KOG0550|consensus
Probab=98.68 E-value=2.5e-08 Score=93.72 Aligned_cols=68 Identities=22% Similarity=0.423 Sum_probs=60.2
Q ss_pred CCCChhhhcCCcccCCCCHHHHHHHHHHHHHHhCCCCCCCCCHHHHHHHHHHHHHHHHHHHHhcCcchhhHHHH
Q psy8325 65 SQSNYYDLFDLKLTYLINNVDLSKKFKQLQSQLHPDKFSNKNQEEQAISETYSSYLNKAYSILQNPLERGLYLL 138 (237)
Q Consensus 65 ~~~nyf~llgl~~~f~id~~~Lk~~Yr~Lq~~~HPDk~~~~~~~e~~~a~~~s~~In~AY~tL~dp~~Ra~Yll 138 (237)
...+||.+||+....+ ..+|+++||+++..+|||+++.. +..++..|..|-+||.+|+||.+|.+|+-
T Consensus 371 kRkd~ykilGi~~~as--~~eikkayrk~AL~~Hpd~~ags----q~eaE~kFkevgeAy~il~d~~kr~r~ds 438 (486)
T KOG0550|consen 371 KRKDWYKILGISRNAS--DDEIKKAYRKLALVHHPDKNAGS----QKEAEAKFKEVGEAYTILSDPMKRVRFDS 438 (486)
T ss_pred hhhhHHHHhhhhhhcc--cchhhhHHHHHHHHhCCCcCcch----hHHHHHHHHHHHHHHHHhcCHHHHhhccc
Confidence 3689999999998864 45999999999999999999886 44567899999999999999999999974
No 65
>KOG0714|consensus
Probab=98.68 E-value=1.6e-08 Score=88.50 Aligned_cols=68 Identities=24% Similarity=0.492 Sum_probs=55.9
Q ss_pred CCChhhhcCCcccCCCCHHHHHHHHHHHHHHhCCCCCCCCCHHHHHHHHHHHHHHHHHHHHhcCcchhhHHHHh
Q psy8325 66 QSNYYDLFDLKLTYLINNVDLSKKFKQLQSQLHPDKFSNKNQEEQAISETYSSYLNKAYSILQNPLERGLYLLS 139 (237)
Q Consensus 66 ~~nyf~llgl~~~f~id~~~Lk~~Yr~Lq~~~HPDk~~~~~~~e~~~a~~~s~~In~AY~tL~dp~~Ra~Yll~ 139 (237)
..+||.+||++++.. ..+|+++|+.+++++|||++... ...+...+.+|++||++|+||.+|..|+..
T Consensus 2 ~~d~~~~l~i~~~as--~~~i~ka~~~~a~~~hpdk~~~~----~~~~~~~~~~~~ea~~~ls~~~kr~~~d~~ 69 (306)
T KOG0714|consen 2 GKDYYKILGIARSAS--EEDIKKAYRKLALKYHPDKNPSP----KEVAEAKFKEIAEAYEVLSDPKKRKIYDQY 69 (306)
T ss_pred cccHHHHhCcccccc--HHHHHHHHHHHHHhhCCCCCCCc----hhhHHHHHhhhhccccccCCHHHhhhcccc
Confidence 369999999998763 34999999999999999995443 223344788999999999999999999753
No 66
>KOG1150|consensus
Probab=98.58 E-value=1.1e-07 Score=81.93 Aligned_cols=64 Identities=22% Similarity=0.441 Sum_probs=55.5
Q ss_pred CCChhhhcCCcccCCCCHHHHHHHHHHHHHHhCCCCCCCCCHHHHHHHHHHHHHHHHHHHHhcCcchhhH
Q psy8325 66 QSNYYDLFDLKLTYLINNVDLSKKFKQLQSQLHPDKFSNKNQEEQAISETYSSYLNKAYSILQNPLERGL 135 (237)
Q Consensus 66 ~~nyf~llgl~~~f~id~~~Lk~~Yr~Lq~~~HPDk~~~~~~~e~~~a~~~s~~In~AY~tL~dp~~Ra~ 135 (237)
+.|.|++|.+.|. ++.++|+++||+|+..+|||+++...+ .|...|--|.+||..|-|+..|..
T Consensus 52 nLNpfeVLqIdpe--v~~edikkryRklSilVHPDKN~Dd~~----rAqkAFdivkKA~k~l~n~~~rkr 115 (250)
T KOG1150|consen 52 NLNPFEVLQIDPE--VTDEDIKKRYRKLSILVHPDKNPDDAE----RAQKAFDIVKKAYKLLENDKIRKR 115 (250)
T ss_pred ccChHHHHhcCCC--CCHHHHHHHHHhhheeecCCCCcccHH----HHHHHHHHHHHHHHHHhCHHHHHH
Confidence 4799999999997 578899999999999999999987633 356789999999999999996553
No 67
>COG5269 ZUO1 Ribosome-associated chaperone zuotin [Translation, ribosomal structure and biogenesis / Posttranslational modification, protein turnover, chaperones]
Probab=98.45 E-value=3.2e-07 Score=82.30 Aligned_cols=88 Identities=17% Similarity=0.167 Sum_probs=68.2
Q ss_pred CCChhhhcCCcc-cCCCCHHHHHHHHHHHHHHhCCCCCCCCCHHHHHHHHHHHHHHHHHHHHhcCcchhhHHHHhhcCCC
Q psy8325 66 QSNYYDLFDLKL-TYLINNVDLSKKFKQLQSQLHPDKFSNKNQEEQAISETYSSYLNKAYSILQNPLERGLYLLSLQNIS 144 (237)
Q Consensus 66 ~~nyf~llgl~~-~f~id~~~Lk~~Yr~Lq~~~HPDk~~~~~~~e~~~a~~~s~~In~AY~tL~dp~~Ra~Yll~l~g~~ 144 (237)
..|+|.+|||+. .+..++..|.++.+....+||||+.+..+.... ...|..|.+||+||+||..|.+|+-.-...+
T Consensus 42 ~~DlYa~lgLskyR~ka~~~qi~kah~kkv~kyHPDk~aa~g~~~~---d~fFk~iqkA~evL~D~~~R~qyDS~df~ad 118 (379)
T COG5269 42 KVDLYALLGLSKYRTKAIPPQILKAHKKKVYKYHPDKTAAGGNKGC---DEFFKLIQKAREVLGDRKLRLQYDSNDFDAD 118 (379)
T ss_pred hhhHHHHhchHhhhcCCCcHHHHHHHHHHHHHhCccchhccCCCCc---HHHHHHHHHHHHHhccHHHHhhccccccccC
Confidence 369999999985 456678888889999999999999855443322 2689999999999999999999987666666
Q ss_pred ccccccCCCHHH
Q psy8325 145 IEEDTKGTDQKL 156 (237)
Q Consensus 145 ~~e~~~~~d~ef 156 (237)
+++.....+.+|
T Consensus 119 vppp~~~t~~~F 130 (379)
T COG5269 119 VPPPRIYTPDEF 130 (379)
T ss_pred CCCccCCCchhH
Confidence 665433444444
No 68
>KOG0568|consensus
Probab=97.53 E-value=0.00017 Score=63.81 Aligned_cols=57 Identities=18% Similarity=0.369 Sum_probs=46.9
Q ss_pred CChhhhcCCcccCCCCHHHHHHHHHHHHHHhCCCCCCCCCHHHHHHHHHHHHHHHHHHH-HhcCcc
Q psy8325 67 SNYYDLFDLKLTYLINNVDLSKKFKQLQSQLHPDKFSNKNQEEQAISETYSSYLNKAYS-ILQNPL 131 (237)
Q Consensus 67 ~nyf~llgl~~~f~id~~~Lk~~Yr~Lq~~~HPDk~~~~~~~e~~~a~~~s~~In~AY~-tL~dp~ 131 (237)
+.||.+||++.++ |..+++.+|..|++.+|||.-+...+ .+.|.+|.+||+ +|..-.
T Consensus 47 ~e~fril~v~e~~--~adevr~af~~lakq~hpdsgs~~ad------aa~f~qideafrkvlq~~~ 104 (342)
T KOG0568|consen 47 MECFRILGVEEGA--DADEVREAFHDLAKQVHPDSGSEEAD------AARFIQIDEAFRKVLQEKF 104 (342)
T ss_pred HHHHHHhcccccC--chhHHHHHHHHHHHHcCCCCCCcccc------HHHHHHHHHHHHHHHHHHH
Confidence 5899999999987 66799999999999999998765433 367899999998 665433
No 69
>KOG1789|consensus
Probab=97.41 E-value=0.00023 Score=73.88 Aligned_cols=54 Identities=22% Similarity=0.375 Sum_probs=43.8
Q ss_pred CChhhhcCCccc--CCCCHHHHHHHHHHHHHHhCCCCCCCCCHHHHHHHHHHHHHHHHHHHHhc
Q psy8325 67 SNYYDLFDLKLT--YLINNVDLSKKFKQLQSQLHPDKFSNKNQEEQAISETYSSYLNKAYSILQ 128 (237)
Q Consensus 67 ~nyf~llgl~~~--f~id~~~Lk~~Yr~Lq~~~HPDk~~~~~~~e~~~a~~~s~~In~AY~tL~ 128 (237)
.+-+++|.++-+ -.-+++.|+++|++|+.+|||||++.. .+.|-.||+||+.|.
T Consensus 1281 d~A~eiL~i~l~n~~hD~~~KirrqY~kLA~kYHPDKNPEG--------RemFe~VnKAYE~L~ 1336 (2235)
T KOG1789|consen 1281 DLAREILSVDLTNEEHDKPAKIRRQYYKLAAKYHPDKNPEG--------REMFERVNKAYELLS 1336 (2235)
T ss_pred HHHHHHhccccCCCCcccHHHHHHHHHHHHHHhCCCCCchH--------HHHHHHHHHHHHHHH
Confidence 367889988743 223558999999999999999998642 267899999999998
No 70
>KOG0723|consensus
Probab=96.91 E-value=0.002 Score=50.46 Aligned_cols=60 Identities=13% Similarity=0.228 Sum_probs=47.7
Q ss_pred ccCCCCCCChhhhcCCcccCCCCHHHHHHHHHHHHHHhCCCCCCCCCHHHHHHHHHHHHHHHHHHHHhcCc
Q psy8325 60 VQKPDSQSNYYDLFDLKLTYLINNVDLSKKFKQLQSQLHPDKFSNKNQEEQAISETYSSYLNKAYSILQNP 130 (237)
Q Consensus 60 ~qp~~~~~nyf~llgl~~~f~id~~~Lk~~Yr~Lq~~~HPDk~~~~~~~e~~~a~~~s~~In~AY~tL~dp 130 (237)
++|--....--.+||+.++ ++.+.|+.++|+....-|||+-.++ =..+.||+|++.|...
T Consensus 49 F~~kMsr~EA~lIL~v~~s--~~k~KikeaHrriM~~NHPD~GGSP---------YlAsKINEAKdlLe~~ 108 (112)
T KOG0723|consen 49 FEPKMSRREAALILGVTPS--LDKDKIKEAHRRIMLANHPDRGGSP---------YLASKINEAKDLLEGT 108 (112)
T ss_pred cccccchHHHHHHhCCCcc--ccHHHHHHHHHHHHHcCCCcCCCCH---------HHHHHHHHHHHHHhcc
Confidence 3444445566679999986 5899999999999999999998765 2347899999998643
No 71
>PF13248 zf-ribbon_3: zinc-ribbon domain
Probab=96.43 E-value=0.0017 Score=38.15 Aligned_cols=22 Identities=23% Similarity=0.668 Sum_probs=20.1
Q ss_pred CCCCcCCCCCCCCCccCcCCCc
Q psy8325 38 LKCWKCLKNLSGKNLFCQYCSS 59 (237)
Q Consensus 38 ~~Cw~C~~~~~~~~~fC~~C~~ 59 (237)
..||+|+..+....-||++||.
T Consensus 3 ~~Cp~Cg~~~~~~~~fC~~CG~ 24 (26)
T PF13248_consen 3 MFCPNCGAEIDPDAKFCPNCGA 24 (26)
T ss_pred CCCcccCCcCCcccccChhhCC
Confidence 4799999998889999999986
No 72
>KOG0431|consensus
Probab=96.09 E-value=0.0098 Score=57.47 Aligned_cols=48 Identities=23% Similarity=0.425 Sum_probs=40.2
Q ss_pred CCHHHHHHHHHHHHHHhCCCCCCCC--CHHHHHHHHHHHHHHHHHHHHhc
Q psy8325 81 INNVDLSKKFKQLQSQLHPDKFSNK--NQEEQAISETYSSYLNKAYSILQ 128 (237)
Q Consensus 81 id~~~Lk~~Yr~Lq~~~HPDk~~~~--~~~e~~~a~~~s~~In~AY~tL~ 128 (237)
|+.+.+|++||+..+.+||||.++. +...+-++++.|..+++|++--.
T Consensus 400 Vtp~~VKKaYrKA~L~VHPDKlqq~gas~~qK~Iaekvfd~l~eawn~f~ 449 (453)
T KOG0431|consen 400 VTPAQVKKAYRKAVLCVHPDKLQQKGASLEQKYIAEKVFDALSEAWNKFN 449 (453)
T ss_pred cCHHHHHHHHHhhhheeCcccccCCcccHHHHHHHHHHHHHHHHHHHhhh
Confidence 7899999999999999999999876 34456677888999999887544
No 73
>PF13240 zinc_ribbon_2: zinc-ribbon domain
Probab=96.02 E-value=0.0033 Score=35.99 Aligned_cols=21 Identities=24% Similarity=0.703 Sum_probs=19.2
Q ss_pred CCCcCCCCCCCCCccCcCCCc
Q psy8325 39 KCWKCLKNLSGKNLFCQYCSS 59 (237)
Q Consensus 39 ~Cw~C~~~~~~~~~fC~~C~~ 59 (237)
.|++|+..+....-||+.||.
T Consensus 1 ~Cp~CG~~~~~~~~fC~~CG~ 21 (23)
T PF13240_consen 1 YCPNCGAEIEDDAKFCPNCGT 21 (23)
T ss_pred CCcccCCCCCCcCcchhhhCC
Confidence 499999999989999999986
No 74
>COG1076 DjlA DnaJ-domain-containing proteins 1 [Posttranslational modification, protein turnover, chaperones]
Probab=94.86 E-value=0.034 Score=46.82 Aligned_cols=59 Identities=19% Similarity=0.294 Sum_probs=50.0
Q ss_pred CChhhhcCCcccCCCCHHHHHHHHHHHHHHhCCCCCCCCC--HHHHHHHHHHHHHHHHHHHHh
Q psy8325 67 SNYYDLFDLKLTYLINNVDLSKKFKQLQSQLHPDKFSNKN--QEEQAISETYSSYLNKAYSIL 127 (237)
Q Consensus 67 ~nyf~llgl~~~f~id~~~Lk~~Yr~Lq~~~HPDk~~~~~--~~e~~~a~~~s~~In~AY~tL 127 (237)
.+.+.+||++++- +..+++++|+++....|||+-...+ .+--+.+.+..+.||.||..+
T Consensus 113 ~~~l~~l~~~~~~--~~~~i~~~~r~l~~e~~~d~a~~~~~~~e~~~~~~~~~~~i~~a~~~~ 173 (174)
T COG1076 113 EDALKVLGVEIKA--DQDAIKKAYRKLLSEQHPDKAAAKGLKLEFIEKLKEKLQEIQEAYEDI 173 (174)
T ss_pred hhHHHHhcCchhh--hHHHHHHHHHHHHHhcCHHHHHHhcCCHHHHHHHHHHHHHHHHHHHhc
Confidence 6889999999975 7889999999999999999987655 444566778999999999864
No 75
>PF10571 UPF0547: Uncharacterised protein family UPF0547; InterPro: IPR018886 This domain may well be a type of zinc-finger as it carries two pairs of highly conserved cysteine residues though with no accompanying histidines. Several members are annotated as putative helicases.
Probab=93.92 E-value=0.039 Score=32.51 Aligned_cols=22 Identities=23% Similarity=0.572 Sum_probs=20.2
Q ss_pred CCCcCCCCCCCCCccCcCCCcc
Q psy8325 39 KCWKCLKNLSGKNLFCQYCSSV 60 (237)
Q Consensus 39 ~Cw~C~~~~~~~~~fC~~C~~~ 60 (237)
.|+.|+..|+...-+||.||..
T Consensus 2 ~CP~C~~~V~~~~~~Cp~CG~~ 23 (26)
T PF10571_consen 2 TCPECGAEVPESAKFCPHCGYD 23 (26)
T ss_pred cCCCCcCCchhhcCcCCCCCCC
Confidence 6999999999999999999974
No 76
>PF14687 DUF4460: Domain of unknown function (DUF4460)
Probab=93.54 E-value=0.2 Score=39.57 Aligned_cols=51 Identities=22% Similarity=0.372 Sum_probs=42.5
Q ss_pred CCHHHHHHHHHHHHHHhCCCCCCCCCHHHHHHHHHHHHHHHHHHHHhcCcch
Q psy8325 81 INNVDLSKKFKQLQSQLHPDKFSNKNQEEQAISETYSSYLNKAYSILQNPLE 132 (237)
Q Consensus 81 id~~~Lk~~Yr~Lq~~~HPDk~~~~~~~e~~~a~~~s~~In~AY~tL~dp~~ 132 (237)
+...+++.+.|.+-+++|||.|++.+. +++.-++-.+.+|.-.+.|..+..
T Consensus 6 ~~~~~l~~aLr~Fy~~VHPDlF~~~P~-~k~~Ne~SLk~Ln~~Ld~l~~~~~ 56 (112)
T PF14687_consen 6 LSSPDLRSALRPFYFAVHPDLFGQHPE-EKQVNEESLKLLNSYLDSLKKRKS 56 (112)
T ss_pred hhhHHHHHHHHHHHHHhCCcccccChH-HHHhhHHHHHHHHHHHHHHhccCC
Confidence 345689999999999999999998654 788877888889988888887653
No 77
>PF09538 FYDLN_acid: Protein of unknown function (FYDLN_acid); InterPro: IPR012644 Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=93.33 E-value=0.06 Score=42.35 Aligned_cols=28 Identities=21% Similarity=0.564 Sum_probs=24.1
Q ss_pred CCCCCcCCCC---CCCCCccCcCCCcccCCC
Q psy8325 37 ELKCWKCLKN---LSGKNLFCQYCSSVQKPD 64 (237)
Q Consensus 37 ~~~Cw~C~~~---~~~~~~fC~~C~~~qp~~ 64 (237)
...|++||.. ++.....||.||..+++.
T Consensus 9 KR~Cp~CG~kFYDLnk~PivCP~CG~~~~~~ 39 (108)
T PF09538_consen 9 KRTCPSCGAKFYDLNKDPIVCPKCGTEFPPE 39 (108)
T ss_pred cccCCCCcchhccCCCCCccCCCCCCccCcc
Confidence 5589999996 467888999999999887
No 78
>TIGR02300 FYDLN_acid conserved hypothetical protein TIGR02300. Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=91.10 E-value=0.16 Score=40.98 Aligned_cols=29 Identities=17% Similarity=0.240 Sum_probs=24.3
Q ss_pred CCCCCCcCCCC---CCCCCccCcCCCcccCCC
Q psy8325 36 TELKCWKCLKN---LSGKNLFCQYCSSVQKPD 64 (237)
Q Consensus 36 ~~~~Cw~C~~~---~~~~~~fC~~C~~~qp~~ 64 (237)
+...|++|++. ++.....||.||..+|+.
T Consensus 8 tKr~Cp~cg~kFYDLnk~p~vcP~cg~~~~~~ 39 (129)
T TIGR02300 8 TKRICPNTGSKFYDLNRRPAVSPYTGEQFPPE 39 (129)
T ss_pred ccccCCCcCccccccCCCCccCCCcCCccCcc
Confidence 35589999996 468899999999999775
No 79
>PRK00420 hypothetical protein; Validated
Probab=91.00 E-value=0.15 Score=40.41 Aligned_cols=26 Identities=19% Similarity=0.300 Sum_probs=21.5
Q ss_pred CCCCCCcCCCCC---CCCCccCcCCCccc
Q psy8325 36 TELKCWKCLKNL---SGKNLFCQYCSSVQ 61 (237)
Q Consensus 36 ~~~~Cw~C~~~~---~~~~~fC~~C~~~q 61 (237)
.+..||.|+.|+ .....|||.|+.+-
T Consensus 22 l~~~CP~Cg~pLf~lk~g~~~Cp~Cg~~~ 50 (112)
T PRK00420 22 LSKHCPVCGLPLFELKDGEVVCPVHGKVY 50 (112)
T ss_pred ccCCCCCCCCcceecCCCceECCCCCCee
Confidence 346899999987 37899999999844
No 80
>PF09862 DUF2089: Protein of unknown function (DUF2089); InterPro: IPR018658 This family consists of various hypothetical prokaryotic proteins.
Probab=88.10 E-value=0.29 Score=38.84 Aligned_cols=35 Identities=17% Similarity=0.372 Sum_probs=25.7
Q ss_pred CCcCCCCCCCCCccCcCCCcccCCCCCCChhhhcC
Q psy8325 40 CWKCLKNLSGKNLFCQYCSSVQKPDSQSNYYDLFD 74 (237)
Q Consensus 40 Cw~C~~~~~~~~~fC~~C~~~qp~~~~~nyf~llg 74 (237)
||.|++++....+.|++|+..-.-.=..++|..|.
T Consensus 1 CPvCg~~l~vt~l~C~~C~t~i~G~F~l~~~~~L~ 35 (113)
T PF09862_consen 1 CPVCGGELVVTRLKCPSCGTEIEGEFELPWFARLS 35 (113)
T ss_pred CCCCCCceEEEEEEcCCCCCEEEeeeccchhhcCC
Confidence 99999998888899999988654433445555543
No 81
>PF07754 DUF1610: Domain of unknown function (DUF1610); InterPro: IPR011668 This domain is found in archaeal species. It is likely to bind zinc via its four well-conserved cysteine residues.
Probab=87.79 E-value=0.33 Score=28.11 Aligned_cols=19 Identities=21% Similarity=0.681 Sum_probs=14.3
Q ss_pred CCcCCCCCC----CCCccCcCCC
Q psy8325 40 CWKCLKNLS----GKNLFCQYCS 58 (237)
Q Consensus 40 Cw~C~~~~~----~~~~fC~~C~ 58 (237)
|-+|+.++. ...|.||+|+
T Consensus 1 C~sC~~~i~~r~~~v~f~CPnCG 23 (24)
T PF07754_consen 1 CTSCGRPIAPREQAVPFPCPNCG 23 (24)
T ss_pred CccCCCcccCcccCceEeCCCCC
Confidence 678877662 5678999997
No 82
>PF12773 DZR: Double zinc ribbon
Probab=87.12 E-value=0.68 Score=30.59 Aligned_cols=27 Identities=19% Similarity=0.412 Sum_probs=15.9
Q ss_pred cCCCCCCcCCCCCC---CCCccCcCCCccc
Q psy8325 35 HTELKCWKCLKNLS---GKNLFCQYCSSVQ 61 (237)
Q Consensus 35 ~~~~~Cw~C~~~~~---~~~~fC~~C~~~q 61 (237)
.....|++|+.++. ....+|++|+...
T Consensus 10 ~~~~fC~~CG~~l~~~~~~~~~C~~Cg~~~ 39 (50)
T PF12773_consen 10 DDAKFCPHCGTPLPPPDQSKKICPNCGAEN 39 (50)
T ss_pred ccccCChhhcCChhhccCCCCCCcCCcCCC
Confidence 34556777776654 3456677776643
No 83
>COG1645 Uncharacterized Zn-finger containing protein [General function prediction only]
Probab=86.14 E-value=0.36 Score=39.20 Aligned_cols=23 Identities=30% Similarity=0.689 Sum_probs=19.8
Q ss_pred CCCCCcCCCCC--CCCCccCcCCCc
Q psy8325 37 ELKCWKCLKNL--SGKNLFCQYCSS 59 (237)
Q Consensus 37 ~~~Cw~C~~~~--~~~~~fC~~C~~ 59 (237)
...|+.|+.|+ .....|||.|+.
T Consensus 28 ~~hCp~Cg~PLF~KdG~v~CPvC~~ 52 (131)
T COG1645 28 AKHCPKCGTPLFRKDGEVFCPVCGY 52 (131)
T ss_pred HhhCcccCCcceeeCCeEECCCCCc
Confidence 45899999998 378899999985
No 84
>PF12773 DZR: Double zinc ribbon
Probab=85.17 E-value=0.52 Score=31.16 Aligned_cols=23 Identities=22% Similarity=0.501 Sum_probs=20.7
Q ss_pred CCcCCCCCCCCCccCcCCCcccC
Q psy8325 40 CWKCLKNLSGKNLFCQYCSSVQK 62 (237)
Q Consensus 40 Cw~C~~~~~~~~~fC~~C~~~qp 62 (237)
|++|+.++....-||+.|+...+
T Consensus 1 Cp~Cg~~~~~~~~fC~~CG~~l~ 23 (50)
T PF12773_consen 1 CPHCGTPNPDDAKFCPHCGTPLP 23 (50)
T ss_pred CCCcCCcCCccccCChhhcCChh
Confidence 88999999889999999998665
No 85
>PRK14559 putative protein serine/threonine phosphatase; Provisional
Probab=84.66 E-value=0.72 Score=46.65 Aligned_cols=38 Identities=16% Similarity=0.398 Sum_probs=25.9
Q ss_pred ccccccccccccc-CCCCCCcCCCCCCCCCccCcCCCcc
Q psy8325 23 ATLQNLCTDTVKH-TELKCWKCLKNLSGKNLFCQYCSSV 60 (237)
Q Consensus 23 ~~~~~~~~~~~~~-~~~~Cw~C~~~~~~~~~fC~~C~~~ 60 (237)
+....||..=..+ ....|++|+..++...-||+.||.-
T Consensus 12 ~~~akFC~~CG~~l~~~~Cp~CG~~~~~~~~fC~~CG~~ 50 (645)
T PRK14559 12 PNNNRFCQKCGTSLTHKPCPQCGTEVPVDEAHCPNCGAE 50 (645)
T ss_pred CCCCccccccCCCCCCCcCCCCCCCCCcccccccccCCc
Confidence 3344566442222 2257999999998889999999873
No 86
>PF09889 DUF2116: Uncharacterized protein containing a Zn-ribbon (DUF2116); InterPro: IPR019216 This entry contains various hypothetical prokaryotic proteins whose functions are unknown. They contain a conserved zinc ribbon motif in the N-terminal part and a predicted transmembrane segment in the C-terminal part.
Probab=84.46 E-value=0.32 Score=34.27 Aligned_cols=22 Identities=27% Similarity=0.632 Sum_probs=18.5
Q ss_pred CCCCcCCCCCCCCCccCc-CCCc
Q psy8325 38 LKCWKCLKNLSGKNLFCQ-YCSS 59 (237)
Q Consensus 38 ~~Cw~C~~~~~~~~~fC~-~C~~ 59 (237)
..|+.||.+++.+..||+ .|+.
T Consensus 4 kHC~~CG~~Ip~~~~fCS~~C~~ 26 (59)
T PF09889_consen 4 KHCPVCGKPIPPDESFCSPKCRE 26 (59)
T ss_pred CcCCcCCCcCCcchhhhCHHHHH
Confidence 479999999998999994 7754
No 87
>PF14803 Nudix_N_2: Nudix N-terminal; PDB: 3CNG_C.
Probab=84.10 E-value=0.36 Score=30.20 Aligned_cols=22 Identities=27% Similarity=0.609 Sum_probs=13.2
Q ss_pred CCCcCCCCC--------CCCCccCcCCCcc
Q psy8325 39 KCWKCLKNL--------SGKNLFCQYCSSV 60 (237)
Q Consensus 39 ~Cw~C~~~~--------~~~~~fC~~C~~~ 60 (237)
.|+.|+.++ ...-++|++|+.|
T Consensus 2 fC~~CG~~l~~~ip~gd~r~R~vC~~Cg~I 31 (34)
T PF14803_consen 2 FCPQCGGPLERRIPEGDDRERLVCPACGFI 31 (34)
T ss_dssp B-TTT--B-EEE--TT-SS-EEEETTTTEE
T ss_pred ccccccChhhhhcCCCCCccceECCCCCCE
Confidence 589999976 2456799999876
No 88
>PF08271 TF_Zn_Ribbon: TFIIB zinc-binding; InterPro: IPR013137 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents a zinc finger motif found in transcription factor IIB (TFIIB). In eukaryotes the initiation of transcription of protein encoding genes by the polymerase II complexe (Pol II) is modulated by general and specific transcription factors. The general transcription factors operate through common promoters elements (such as the TATA box). At least seven different proteins associate to form the general transcription factors: TFIIA, -IIB, -IID, -IIE, -IIF, -IIG, and -IIH []. TFIIB and TFIID are responsible for promoter recognition and interaction with pol II; together with Pol II, they form a minimal initiation complex capable of transcription under certain conditions. The TATA box of a Pol II promoter is bound in the initiation complex by the TBP subunit of TFIID, which bends the DNA around the C-terminal domain of TFIIB whereas the N-terminal zinc finger of TFIIB interacts with Pol II [, ]. The TFIIB zinc finger adopts a zinc ribbon fold characterised by two beta-hairpins forming two structurally similar zinc-binding sub-sites []. The zinc finger contacts the rbp1 subunit of Pol II through its dock domain, a conserved region of about 70 amino acids located close to the polymerase active site []. In the Pol II complex this surface is located near the RNA exit groove. Interestingly this sequence is best conserved in the three polymerases that utilise a TFIIB-like general transcription factor (Pol II, Pol III, and archaeal RNA polymerase) but not in Pol I []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent; PDB: 1VD4_A 1PFT_A 3K1F_M 3K7A_M 1RO4_A 1RLY_A 1DL6_A.
Probab=81.83 E-value=0.72 Score=29.86 Aligned_cols=24 Identities=29% Similarity=0.577 Sum_probs=18.1
Q ss_pred CCCcCCCCC-----CCCCccCcCCCcccC
Q psy8325 39 KCWKCLKNL-----SGKNLFCQYCSSVQK 62 (237)
Q Consensus 39 ~Cw~C~~~~-----~~~~~fC~~C~~~qp 62 (237)
.|++|+... ....++|+.||.|..
T Consensus 2 ~Cp~Cg~~~~~~D~~~g~~vC~~CG~Vl~ 30 (43)
T PF08271_consen 2 KCPNCGSKEIVFDPERGELVCPNCGLVLE 30 (43)
T ss_dssp SBTTTSSSEEEEETTTTEEEETTT-BBEE
T ss_pred CCcCCcCCceEEcCCCCeEECCCCCCEee
Confidence 599999853 367889999998764
No 89
>PRK14811 formamidopyrimidine-DNA glycosylase; Provisional
Probab=79.90 E-value=1.2 Score=40.15 Aligned_cols=27 Identities=19% Similarity=0.511 Sum_probs=21.0
Q ss_pred CCCCCcCCCCC------CCCCccCcCCCcccCC
Q psy8325 37 ELKCWKCLKNL------SGKNLFCQYCSSVQKP 63 (237)
Q Consensus 37 ~~~Cw~C~~~~------~~~~~fC~~C~~~qp~ 63 (237)
...|+.|+.++ ....+|||.|...-|.
T Consensus 235 g~pC~~Cg~~I~~~~~~gR~ty~Cp~CQ~~~~~ 267 (269)
T PRK14811 235 GQPCPRCGTPIEKIVVGGRGTHFCPQCQPLRPL 267 (269)
T ss_pred cCCCCcCCCeeEEEEECCCCcEECCCCcCCCCC
Confidence 34899999987 3788999999665443
No 90
>PF06677 Auto_anti-p27: Sjogren's syndrome/scleroderma autoantigen 1 (Autoantigen p27); InterPro: IPR009563 The proteins in this entry are functionally uncharacterised and include several proteins that characterise Sjogren's syndrome/scleroderma autoantigen 1 (Autoantigen p27). It is thought that the potential association of anti-p27 with anti-centromere antibodies suggests that autoantigen p27 might play a role in mitosis [].
Probab=78.62 E-value=1.3 Score=28.91 Aligned_cols=22 Identities=23% Similarity=0.691 Sum_probs=17.7
Q ss_pred CCCCCcCCCCC---CCCCccCcCCC
Q psy8325 37 ELKCWKCLKNL---SGKNLFCQYCS 58 (237)
Q Consensus 37 ~~~Cw~C~~~~---~~~~~fC~~C~ 58 (237)
...|+.|+.|+ .....||+.|+
T Consensus 17 ~~~Cp~C~~PL~~~k~g~~~Cv~C~ 41 (41)
T PF06677_consen 17 DEHCPDCGTPLMRDKDGKIYCVSCG 41 (41)
T ss_pred cCccCCCCCeeEEecCCCEECCCCC
Confidence 44899999987 36789999995
No 91
>PF14353 CpXC: CpXC protein
Probab=78.42 E-value=1 Score=35.68 Aligned_cols=21 Identities=14% Similarity=0.251 Sum_probs=15.3
Q ss_pred CCccCcCCCcccCCCCCCChh
Q psy8325 50 KNLFCQYCSSVQKPDSQSNYY 70 (237)
Q Consensus 50 ~~~fC~~C~~~qp~~~~~nyf 70 (237)
..+.||+||....++...-|+
T Consensus 37 ~~~~CP~Cg~~~~~~~p~lY~ 57 (128)
T PF14353_consen 37 FSFTCPSCGHKFRLEYPLLYH 57 (128)
T ss_pred CEEECCCCCCceecCCCEEEE
Confidence 368899999988776554443
No 92
>PRK14890 putative Zn-ribbon RNA-binding protein; Provisional
Probab=77.82 E-value=1.5 Score=30.91 Aligned_cols=21 Identities=24% Similarity=0.741 Sum_probs=10.6
Q ss_pred CCCcCCCCCC----CCCccCcCCCc
Q psy8325 39 KCWKCLKNLS----GKNLFCQYCSS 59 (237)
Q Consensus 39 ~Cw~C~~~~~----~~~~fC~~C~~ 59 (237)
.|-+|+..+. ...|.||+|+.
T Consensus 9 ~CtSCg~~i~~~~~~~~F~CPnCG~ 33 (59)
T PRK14890 9 KCTSCGIEIAPREKAVKFLCPNCGE 33 (59)
T ss_pred cccCCCCcccCCCccCEeeCCCCCC
Confidence 4555555442 33455555554
No 93
>PF07282 OrfB_Zn_ribbon: Putative transposase DNA-binding domain; InterPro: IPR010095 This entry represents a region of a sequence similarity between a family of putative transposases of Thermoanaerobacter tengcongensis, smaller related proteins from Bacillus anthracis, putative transposes described by IPR001959 from INTERPRO, and other proteins. More information about these proteins can be found at Protein of the Month: Transposase [].
Probab=77.05 E-value=1.5 Score=30.83 Aligned_cols=26 Identities=27% Similarity=0.521 Sum_probs=21.0
Q ss_pred CCCCCCcCCCCC----CCCCccCcCCCccc
Q psy8325 36 TELKCWKCLKNL----SGKNLFCQYCSSVQ 61 (237)
Q Consensus 36 ~~~~Cw~C~~~~----~~~~~fC~~C~~~q 61 (237)
++..|+.|+... ....+.|+.|+...
T Consensus 27 TSq~C~~CG~~~~~~~~~r~~~C~~Cg~~~ 56 (69)
T PF07282_consen 27 TSQTCPRCGHRNKKRRSGRVFTCPNCGFEM 56 (69)
T ss_pred CccCccCcccccccccccceEEcCCCCCEE
Confidence 566899999865 46789999998864
No 94
>smart00661 RPOL9 RNA polymerase subunit 9.
Probab=76.85 E-value=1.7 Score=28.71 Aligned_cols=28 Identities=21% Similarity=0.483 Sum_probs=21.1
Q ss_pred CCCcCCCCCC-----C-CCccCcCCCcccCCCCC
Q psy8325 39 KCWKCLKNLS-----G-KNLFCQYCSSVQKPDSQ 66 (237)
Q Consensus 39 ~Cw~C~~~~~-----~-~~~fC~~C~~~qp~~~~ 66 (237)
.|+.|+..+. . ..+.|+.|+...+.++.
T Consensus 2 FCp~Cg~~l~~~~~~~~~~~vC~~Cg~~~~~~~~ 35 (52)
T smart00661 2 FCPKCGNMLIPKEGKEKRRFVCRKCGYEEPIEQK 35 (52)
T ss_pred CCCCCCCccccccCCCCCEEECCcCCCeEECCCc
Confidence 6999998662 1 36889999998877653
No 95
>PRK01103 formamidopyrimidine/5-formyluracil/ 5-hydroxymethyluracil DNA glycosylase; Validated
Probab=76.41 E-value=1.4 Score=39.68 Aligned_cols=23 Identities=22% Similarity=0.693 Sum_probs=18.6
Q ss_pred CCCCCcCCCCC------CCCCccCcCCCc
Q psy8325 37 ELKCWKCLKNL------SGKNLFCQYCSS 59 (237)
Q Consensus 37 ~~~Cw~C~~~~------~~~~~fC~~C~~ 59 (237)
...|+.|+.++ ....+|||.|..
T Consensus 245 g~pC~~Cg~~I~~~~~~gR~t~~CP~CQ~ 273 (274)
T PRK01103 245 GEPCRRCGTPIEKIKQGGRSTFFCPRCQK 273 (274)
T ss_pred CCCCCCCCCeeEEEEECCCCcEECcCCCC
Confidence 34799999987 368899999953
No 96
>PF08792 A2L_zn_ribbon: A2L zinc ribbon domain; InterPro: IPR014900 This zinc ribbon protein is found associated with some viral A2L transcription factors [].
Probab=76.25 E-value=1.6 Score=27.08 Aligned_cols=25 Identities=32% Similarity=0.523 Sum_probs=20.2
Q ss_pred CCCCcCCCCC----CCCCccCcCCCcccC
Q psy8325 38 LKCWKCLKNL----SGKNLFCQYCSSVQK 62 (237)
Q Consensus 38 ~~Cw~C~~~~----~~~~~fC~~C~~~qp 62 (237)
..|..|+++. ...-.||..|+.++|
T Consensus 4 ~~C~~C~~~~i~~~~~~~~~C~~Cg~~~~ 32 (33)
T PF08792_consen 4 KKCSKCGGNGIVNKEDDYEVCIFCGSSFP 32 (33)
T ss_pred eEcCCCCCCeEEEecCCeEEcccCCcEee
Confidence 4688999975 367799999999875
No 97
>PF03357 Snf7: Snf7; InterPro: IPR005024 This is a family of eukaryotic proteins which are variously described as either hypothetical protein, developmental protein or related to yeast SNF7. The family contains human CHMP1. CHMP1 (CHromatin Modifying Protein; CHarged Multivesicular body Protein), is encoded by an alternative open reading frame in the PRSM1 gene [] and is conserved in both complex and simple eukaryotes. CHMP1 contains a predicted bipartite nuclear localisation signal and distributes as distinct forms to the cytoplasm and the nuclear matrix in all cell lines tested. Human CHMP1 is strongly implicated in multivesicular body formation. A multivesicular body is a vesicle-filled endosome that targets proteins to the interior of lysosomes. Immunocytochemistry and biochemical fractionation localise CHMP1 to early endosomes and CHMP1 physically interacts with SKD1/VPS4, a highly conserved protein directly linked to multivesicular body sorting in yeast. Similar to the action of a mutant SKD1 protein, over expression of a fusion derivative of human CHMP1 dilates endosomal compartments and disrupts the normal distribution of several endosomal markers. Genetic studies in Saccharomyces cerevisiae (Baker's yeast) further support a conserved role of CHMP1 in vesicle trafficking. Deletion of CHM1, the budding yeast homologue of CHMP1, results in defective sorting of carboxypeptidases S and Y and produces abnormal, multi-lamellar prevacuolar compartments. This phenotype classifies CHM1 as a member of the class E vacuolar protein sorting genes []. ; GO: 0015031 protein transport; PDB: 2V6X_B 2W2U_D 2GD5_D 3FRT_B 3FRV_A 4ABM_D 3EAB_H 3HTU_D.
Probab=76.02 E-value=16 Score=29.51 Aligned_cols=56 Identities=23% Similarity=0.289 Sum_probs=46.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccccHHHHHHHHHHHHhHHHHHHHHHHHHhhc
Q psy8325 175 DLEKLQTLIQATIDDLTKNVNASFEQKDFNQAKDLLIRMKYFTTLVSKVRDKKNQL 230 (237)
Q Consensus 175 ~l~~l~~~~~~~i~~~~~~l~~~f~~~d~~~A~~~l~rlkY~~~l~~~ik~k~~~l 230 (237)
.|..-...++.+++.+...+..+...|+-+.|+.++.+++.+.+..+.+......|
T Consensus 12 ~L~~~~~~le~~i~~~~~~~k~~~~~~~~~~A~~~lk~~k~~~k~~~~~~~~~~~l 67 (171)
T PF03357_consen 12 RLEKQIKRLEKKIKKLEKKAKKAIKKGNKERAKIYLKRKKRLEKQLEKLLNQLSNL 67 (171)
T ss_dssp HHHHHHHHHHHHHHHCHHHHHHHHCTT-HHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHcCChHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 45566667888889999999999999999999999999999999888887666554
No 98
>PRK00398 rpoP DNA-directed RNA polymerase subunit P; Provisional
Probab=75.11 E-value=1.8 Score=28.40 Aligned_cols=21 Identities=33% Similarity=0.939 Sum_probs=16.3
Q ss_pred CCCcCCCCC----CCCCccCcCCCc
Q psy8325 39 KCWKCLKNL----SGKNLFCQYCSS 59 (237)
Q Consensus 39 ~Cw~C~~~~----~~~~~fC~~C~~ 59 (237)
.|.+|+..+ ....+.||.|+.
T Consensus 5 ~C~~CG~~~~~~~~~~~~~Cp~CG~ 29 (46)
T PRK00398 5 KCARCGREVELDEYGTGVRCPYCGY 29 (46)
T ss_pred ECCCCCCEEEECCCCCceECCCCCC
Confidence 699999975 233789999986
No 99
>COG0675 Transposase and inactivated derivatives [DNA replication, recombination, and repair]
Probab=74.81 E-value=1.6 Score=38.77 Aligned_cols=28 Identities=32% Similarity=0.509 Sum_probs=22.9
Q ss_pred ccCCCCCCcCCCCCCCCCccCcCCCcccC
Q psy8325 34 KHTELKCWKCLKNLSGKNLFCQYCSSVQK 62 (237)
Q Consensus 34 ~~~~~~Cw~C~~~~~~~~~fC~~C~~~qp 62 (237)
..+|..|+.||. +....++|+.|+....
T Consensus 306 ~~tS~~C~~cg~-~~~r~~~C~~cg~~~~ 333 (364)
T COG0675 306 YYTSKTCPCCGH-LSGRLFKCPRCGFVHD 333 (364)
T ss_pred CCCcccccccCC-ccceeEECCCCCCeeh
Confidence 355679999999 6678899999998753
No 100
>COG1439 Predicted nucleic acid-binding protein, consists of a PIN domain and a Zn-ribbon module [General function prediction only]
Probab=74.43 E-value=1.5 Score=37.33 Aligned_cols=23 Identities=30% Similarity=0.804 Sum_probs=19.9
Q ss_pred CCCCCcCCCCCCCCCccCcCCCc
Q psy8325 37 ELKCWKCLKNLSGKNLFCQYCSS 59 (237)
Q Consensus 37 ~~~Cw~C~~~~~~~~~fC~~C~~ 59 (237)
...||.|....+.+.-|||.||.
T Consensus 139 ~~rC~GC~~~f~~~~~~Cp~CG~ 161 (177)
T COG1439 139 RLRCHGCKRIFPEPKDFCPICGS 161 (177)
T ss_pred eEEEecCceecCCCCCcCCCCCC
Confidence 34799999987778999999987
No 101
>COG4306 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=73.92 E-value=1.3 Score=36.01 Aligned_cols=37 Identities=22% Similarity=0.405 Sum_probs=26.3
Q ss_pred cccccccccCCCCCCcCCCCC---------------CCCCccCcCCCcccCC
Q psy8325 27 NLCTDTVKHTELKCWKCLKNL---------------SGKNLFCQYCSSVQKP 63 (237)
Q Consensus 27 ~~~~~~~~~~~~~Cw~C~~~~---------------~~~~~fC~~C~~~qp~ 63 (237)
.||+.-...+-..||.|+.++ ....-||++|++--|-
T Consensus 29 afcskcgeati~qcp~csasirgd~~vegvlglg~dye~psfchncgs~fpw 80 (160)
T COG4306 29 AFCSKCGEATITQCPICSASIRGDYYVEGVLGLGGDYEPPSFCHNCGSRFPW 80 (160)
T ss_pred HHHhhhchHHHhcCCccCCcccccceeeeeeccCCCCCCcchhhcCCCCCCc
Confidence 467665555566899999875 2456799999986653
No 102
>PRK14559 putative protein serine/threonine phosphatase; Provisional
Probab=73.17 E-value=3.2 Score=42.09 Aligned_cols=36 Identities=22% Similarity=0.372 Sum_probs=25.7
Q ss_pred ccccccccCCCCCCcCCCCCCCCCccCcCCCcccCCCC
Q psy8325 28 LCTDTVKHTELKCWKCLKNLSGKNLFCQYCSSVQKPDS 65 (237)
Q Consensus 28 ~~~~~~~~~~~~Cw~C~~~~~~~~~fC~~C~~~qp~~~ 65 (237)
.|...-..+...|++||.++.. -+|+.|+..-|++.
T Consensus 6 ~Cg~~n~~~akFC~~CG~~l~~--~~Cp~CG~~~~~~~ 41 (645)
T PRK14559 6 QCQFENPNNNRFCQKCGTSLTH--KPCPQCGTEVPVDE 41 (645)
T ss_pred CCCCcCCCCCccccccCCCCCC--CcCCCCCCCCCccc
Confidence 3555445667789999988742 47999999877753
No 103
>CHL00174 accD acetyl-CoA carboxylase beta subunit; Reviewed
Probab=72.49 E-value=1.9 Score=39.59 Aligned_cols=44 Identities=23% Similarity=0.268 Sum_probs=28.8
Q ss_pred hccccccccccccccCCCCCCcCCCCC-----CCCCccCcCCCcccCCC
Q psy8325 21 ANATLQNLCTDTVKHTELKCWKCLKNL-----SGKNLFCQYCSSVQKPD 64 (237)
Q Consensus 21 ~~~~~~~~~~~~~~~~~~~Cw~C~~~~-----~~~~~fC~~C~~~qp~~ 64 (237)
.+....+.|.......-..|++|+..+ ..+...||.|+.-.+..
T Consensus 22 ~~~~~~~~~~~~p~~lw~kc~~C~~~~~~~~l~~~~~vcp~c~~h~rlt 70 (296)
T CHL00174 22 MYDTKYSWNTQKYKHLWVQCENCYGLNYKKFLKSKMNICEQCGYHLKMS 70 (296)
T ss_pred ccccccccCCCCCCCCeeECCCccchhhHHHHHHcCCCCCCCCCCcCCC
Confidence 333444555443333455899999976 36678999999866654
No 104
>PF13877 RPAP3_C: Potential Monad-binding region of RPAP3
Probab=71.88 E-value=13 Score=27.68 Aligned_cols=49 Identities=27% Similarity=0.315 Sum_probs=35.5
Q ss_pred HHHHHHHHHh-cCcchhhHHHHhhcCCCcccc-ccCCCHHHHHHHHHHHHH
Q psy8325 118 SYLNKAYSIL-QNPLERGLYLLSLQNISIEED-TKGTDQKLLMEILMLNEE 166 (237)
Q Consensus 118 ~~In~AY~tL-~dp~~Ra~Yll~l~g~~~~e~-~~~~d~efLmeimE~rE~ 166 (237)
...-.+|+.+ ++|..|+.|+..+....+..= ...+++++|.++++.=..
T Consensus 8 ~eF~~~w~~~~~~~~~~~~yL~~i~p~~l~~if~~~l~~~~L~~il~~l~~ 58 (94)
T PF13877_consen 8 YEFERDWRRLKKDPEERYEYLKSIPPDSLPKIFKNSLEPEFLSEILEALNE 58 (94)
T ss_pred HHHHHHHHHHcCCHHHHHHHHHhCChHHHHHHHHccCCHHHHHHHHHHHHH
Confidence 3456677777 889999999987754444322 458899999999887554
No 105
>PTZ00446 vacuolar sorting protein SNF7-like; Provisional
Probab=70.59 E-value=37 Score=29.33 Aligned_cols=69 Identities=26% Similarity=0.358 Sum_probs=51.5
Q ss_pred HHHHHHHHHHHHhccCCHHHHHHHHHHHHHHHHHHHHHHHHHhccccHHHHHHHHHHHHhHHHHHHHHHHHHhhc
Q psy8325 156 LLMEILMLNEELDEASSEEDLEKLQTLIQATIDDLTKNVNASFEQKDFNQAKDLLIRMKYFTTLVSKVRDKKNQL 230 (237)
Q Consensus 156 fLmeimE~rE~leea~~~~~l~~l~~~~~~~i~~~~~~l~~~f~~~d~~~A~~~l~rlkY~~~l~~~ik~k~~~l 230 (237)
--..|+.++++++.. ..-...++.+|+............++-..|..+|.|-|||.+..+.+...+..|
T Consensus 25 ~~~AIl~Lk~~~~~L------~krq~~Le~kIe~e~~~Ak~~~~~~kk~~Al~~LkrKK~~E~ql~q~~~ql~nL 93 (191)
T PTZ00446 25 IYKAILKNREAIDAL------EKKQVQVEKKIKQLEIEAKQKVEQNQMSNAKILLKRKKLYEQEIENILNNRLTL 93 (191)
T ss_pred HHHHHHHHHHHHHHH------HHHHHHHHHHHHHHHHHHHHHHHcccHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 346788888887765 333445666777766666666677888899999999999999999887766554
No 106
>COG2888 Predicted Zn-ribbon RNA-binding protein with a function in translation [Translation, ribosomal structure and biogenesis]
Probab=68.82 E-value=3.4 Score=29.20 Aligned_cols=11 Identities=27% Similarity=0.655 Sum_probs=6.0
Q ss_pred CCCccCcCCCc
Q psy8325 49 GKNLFCQYCSS 59 (237)
Q Consensus 49 ~~~~fC~~C~~ 59 (237)
...+.||+|+-
T Consensus 48 g~~Y~Cp~CGF 58 (61)
T COG2888 48 GNPYRCPKCGF 58 (61)
T ss_pred CCceECCCcCc
Confidence 44555665554
No 107
>PF03656 Pam16: Pam16; InterPro: IPR005341 The Pam16 protein is the fifth essential subunit of the pre-sequence translocase-associated protein import motor (PAM) []. In Saccharomyces cerevisiae (Baker's yeast), Pam16 is required for preprotein translocation into the matrix, but not for protein insertion into the inner membrane [].; PDB: 2GUZ_J.
Probab=68.08 E-value=13 Score=29.97 Aligned_cols=53 Identities=19% Similarity=0.253 Sum_probs=35.7
Q ss_pred ChhhhcCCcccCCCCHHHHHHHHHHHHHHhCCCCCCCCCHHHHHHHHHHHHHHHHHHHHhcCcc
Q psy8325 68 NYYDLFDLKLTYLINNVDLSKKFKQLQSQLHPDKFSNKNQEEQAISETYSSYLNKAYSILQNPL 131 (237)
Q Consensus 68 nyf~llgl~~~f~id~~~Lk~~Yr~Lq~~~HPDk~~~~~~~e~~~a~~~s~~In~AY~tL~dp~ 131 (237)
.-..||||++. ++.++|.++|..|-..-+|++-++- - .=+.|..|.+.|...+
T Consensus 59 EA~~ILnv~~~--~~~eeI~k~y~~Lf~~Nd~~kGGSf--Y-------LQSKV~rAKErl~~El 111 (127)
T PF03656_consen 59 EARQILNVKEE--LSREEIQKRYKHLFKANDPSKGGSF--Y-------LQSKVFRAKERLEQEL 111 (127)
T ss_dssp HHHHHHT--G----SHHHHHHHHHHHHHHT-CCCTS-H--H-------HHHHHHHHHHHHHHHH
T ss_pred HHHHHcCCCCc--cCHHHHHHHHHHHHhccCCCcCCCH--H-------HHHHHHHHHHHHHHHH
Confidence 56789999994 6899999999999999999975431 1 1256777877776433
No 108
>PF14205 Cys_rich_KTR: Cysteine-rich KTR
Probab=68.04 E-value=3.5 Score=28.58 Aligned_cols=21 Identities=24% Similarity=0.573 Sum_probs=16.2
Q ss_pred CCCcCCCCC----------CCCCccCcCCCc
Q psy8325 39 KCWKCLKNL----------SGKNLFCQYCSS 59 (237)
Q Consensus 39 ~Cw~C~~~~----------~~~~~fC~~C~~ 59 (237)
.||.|+... ....+|||.|..
T Consensus 6 ~CP~CgnKTR~kir~DT~LkNfPlyCpKCK~ 36 (55)
T PF14205_consen 6 LCPICGNKTRLKIREDTVLKNFPLYCPKCKQ 36 (55)
T ss_pred ECCCCCCccceeeecCceeccccccCCCCCc
Confidence 699999642 356799999976
No 109
>COG1592 Rubrerythrin [Energy production and conversion]
Probab=67.23 E-value=3.5 Score=34.88 Aligned_cols=30 Identities=17% Similarity=0.390 Sum_probs=23.1
Q ss_pred CCCCcCCCCC-CCCCccCcCCCcccCCCCCCChhhhc
Q psy8325 38 LKCWKCLKNL-SGKNLFCQYCSSVQKPDSQSNYYDLF 73 (237)
Q Consensus 38 ~~Cw~C~~~~-~~~~~fC~~C~~~qp~~~~~nyf~ll 73 (237)
-.|+-||... ...+-.||.|++. ..+|+.|
T Consensus 135 ~vC~vCGy~~~ge~P~~CPiCga~------k~~F~~f 165 (166)
T COG1592 135 WVCPVCGYTHEGEAPEVCPICGAP------KEKFEKF 165 (166)
T ss_pred EEcCCCCCcccCCCCCcCCCCCCh------HHHhhcc
Confidence 3799999976 3677899999873 3677765
No 110
>TIGR01384 TFS_arch transcription factor S, archaeal. There has been an apparent duplication event in the Halobacteriaceae lineage (Haloarcula, Haloferax, Haloquadratum, Halobacterium and Natromonas). There appears to be a separate duplication in Methanosphaera stadtmanae.
Probab=66.76 E-value=4.3 Score=30.90 Aligned_cols=26 Identities=19% Similarity=0.414 Sum_probs=21.0
Q ss_pred CCCcCCCCCC--CCCccCcCCCcccCCC
Q psy8325 39 KCWKCLKNLS--GKNLFCQYCSSVQKPD 64 (237)
Q Consensus 39 ~Cw~C~~~~~--~~~~fC~~C~~~qp~~ 64 (237)
.|++|+..+. ...+.|+.|+...+..
T Consensus 2 fC~~Cg~~l~~~~~~~~C~~C~~~~~~~ 29 (104)
T TIGR01384 2 FCPKCGSLMTPKNGVYVCPSCGYEKEKK 29 (104)
T ss_pred CCcccCcccccCCCeEECcCCCCccccc
Confidence 6999999873 4689999999876654
No 111
>PRK14890 putative Zn-ribbon RNA-binding protein; Provisional
Probab=66.18 E-value=4.1 Score=28.66 Aligned_cols=25 Identities=16% Similarity=0.414 Sum_probs=18.6
Q ss_pred CCCCCcCCCC-C--------CCCCccCcCCCccc
Q psy8325 37 ELKCWKCLKN-L--------SGKNLFCQYCSSVQ 61 (237)
Q Consensus 37 ~~~Cw~C~~~-~--------~~~~~fC~~C~~~q 61 (237)
.-.|++|+.. + ....+.||+|+-.-
T Consensus 25 ~F~CPnCG~~~I~RC~~CRk~~~~Y~CP~CGF~G 58 (59)
T PRK14890 25 KFLCPNCGEVIIYRCEKCRKQSNPYTCPKCGFEG 58 (59)
T ss_pred EeeCCCCCCeeEeechhHHhcCCceECCCCCCcC
Confidence 3479999987 3 36788899988643
No 112
>PRK12380 hydrogenase nickel incorporation protein HybF; Provisional
Probab=66.03 E-value=3.1 Score=32.70 Aligned_cols=23 Identities=22% Similarity=0.712 Sum_probs=17.1
Q ss_pred CCCCcCCCCCC--CCCccCcCCCcc
Q psy8325 38 LKCWKCLKNLS--GKNLFCQYCSSV 60 (237)
Q Consensus 38 ~~Cw~C~~~~~--~~~~fC~~C~~~ 60 (237)
..|..|+.... ...+.||.|+..
T Consensus 71 ~~C~~Cg~~~~~~~~~~~CP~Cgs~ 95 (113)
T PRK12380 71 AWCWDCSQVVEIHQHDAQCPHCHGE 95 (113)
T ss_pred EEcccCCCEEecCCcCccCcCCCCC
Confidence 37999998653 355669999864
No 113
>COG2888 Predicted Zn-ribbon RNA-binding protein with a function in translation [Translation, ribosomal structure and biogenesis]
Probab=65.58 E-value=3.1 Score=29.40 Aligned_cols=25 Identities=20% Similarity=0.581 Sum_probs=18.6
Q ss_pred CCCCCcCCCCC----CCCCccCcCCCccc
Q psy8325 37 ELKCWKCLKNL----SGKNLFCQYCSSVQ 61 (237)
Q Consensus 37 ~~~Cw~C~~~~----~~~~~fC~~C~~~q 61 (237)
...|.+|+..+ ....|-||+|+.+.
T Consensus 9 ~~~CtSCg~~i~p~e~~v~F~CPnCGe~~ 37 (61)
T COG2888 9 PPVCTSCGREIAPGETAVKFPCPNCGEVE 37 (61)
T ss_pred CceeccCCCEeccCCceeEeeCCCCCcee
Confidence 46888998876 35678899998543
No 114
>PRK03681 hypA hydrogenase nickel incorporation protein; Validated
Probab=65.39 E-value=3.3 Score=32.63 Aligned_cols=23 Identities=17% Similarity=0.385 Sum_probs=16.7
Q ss_pred CCCcCCCCCCC--CC-ccCcCCCccc
Q psy8325 39 KCWKCLKNLSG--KN-LFCQYCSSVQ 61 (237)
Q Consensus 39 ~Cw~C~~~~~~--~~-~fC~~C~~~q 61 (237)
.|..|+..... .. +.||.|+...
T Consensus 72 ~C~~Cg~~~~~~~~~~~~CP~Cgs~~ 97 (114)
T PRK03681 72 WCETCQQYVTLLTQRVRRCPQCHGDM 97 (114)
T ss_pred EcccCCCeeecCCccCCcCcCcCCCC
Confidence 79999986643 23 5599998743
No 115
>TIGR02098 MJ0042_CXXC MJ0042 family finger-like domain. This domain contains a CXXCX(19)CXXC motif suggestive of both zinc fingers and thioredoxin, usually found at the N-terminus of prokaryotic proteins. One partially characterized gene, agmX, is among a large set in Myxococcus whose interruption affects adventurous gliding motility.
Probab=65.21 E-value=3.2 Score=25.84 Aligned_cols=23 Identities=26% Similarity=0.772 Sum_probs=16.2
Q ss_pred CCCCcCCCCC---------CCCCccCcCCCcc
Q psy8325 38 LKCWKCLKNL---------SGKNLFCQYCSSV 60 (237)
Q Consensus 38 ~~Cw~C~~~~---------~~~~~fC~~C~~~ 60 (237)
..|++|+... ....+.|++|+..
T Consensus 3 ~~CP~C~~~~~v~~~~~~~~~~~v~C~~C~~~ 34 (38)
T TIGR02098 3 IQCPNCKTSFRVVDSQLGANGGKVRCGKCGHV 34 (38)
T ss_pred EECCCCCCEEEeCHHHcCCCCCEEECCCCCCE
Confidence 3699998832 1335899999874
No 116
>PRK04023 DNA polymerase II large subunit; Validated
Probab=65.14 E-value=11 Score=40.25 Aligned_cols=34 Identities=18% Similarity=0.244 Sum_probs=19.3
Q ss_pred cccccccc-cCCCCCCcCCCCCCCCCccCcCCCccc
Q psy8325 27 NLCTDTVK-HTELKCWKCLKNLSGKNLFCQYCSSVQ 61 (237)
Q Consensus 27 ~~~~~~~~-~~~~~Cw~C~~~~~~~~~fC~~C~~~q 61 (237)
.+|.+-.. .....|++|+... ....||+.|+...
T Consensus 627 RfCpsCG~~t~~frCP~CG~~T-e~i~fCP~CG~~~ 661 (1121)
T PRK04023 627 RKCPSCGKETFYRRCPFCGTHT-EPVYRCPRCGIEV 661 (1121)
T ss_pred ccCCCCCCcCCcccCCCCCCCC-CcceeCccccCcC
Confidence 45554222 2344788887753 3456777776644
No 117
>PRK14892 putative transcription elongation factor Elf1; Provisional
Probab=64.43 E-value=4.3 Score=31.44 Aligned_cols=22 Identities=23% Similarity=0.431 Sum_probs=16.3
Q ss_pred CCCCcCCCCCC-------CCCccCcCCCc
Q psy8325 38 LKCWKCLKNLS-------GKNLFCQYCSS 59 (237)
Q Consensus 38 ~~Cw~C~~~~~-------~~~~fC~~C~~ 59 (237)
..|++|+.... .....|+.|+.
T Consensus 22 f~CP~Cge~~v~v~~~k~~~h~~C~~CG~ 50 (99)
T PRK14892 22 FECPRCGKVSISVKIKKNIAIITCGNCGL 50 (99)
T ss_pred eECCCCCCeEeeeecCCCcceEECCCCCC
Confidence 37999996432 25789999976
No 118
>COG0375 HybF Zn finger protein HypA/HybF (possibly regulating hydrogenase expression) [General function prediction only]
Probab=63.95 E-value=3.2 Score=33.04 Aligned_cols=22 Identities=23% Similarity=0.634 Sum_probs=17.0
Q ss_pred CCCcCCCCCC--CCCccCcCCCcc
Q psy8325 39 KCWKCLKNLS--GKNLFCQYCSSV 60 (237)
Q Consensus 39 ~Cw~C~~~~~--~~~~fC~~C~~~ 60 (237)
.||.|+.++. ...+.||.|++.
T Consensus 72 ~C~~C~~~~~~e~~~~~CP~C~s~ 95 (115)
T COG0375 72 WCLDCGQEVELEELDYRCPKCGSI 95 (115)
T ss_pred EeccCCCeecchhheeECCCCCCC
Confidence 7999998762 455679999864
No 119
>KOG4623|consensus
Probab=63.66 E-value=2.9 Score=41.08 Aligned_cols=26 Identities=27% Similarity=0.740 Sum_probs=20.3
Q ss_pred CCCCCCcCCCCC-----CCCCccCcCCCccc
Q psy8325 36 TELKCWKCLKNL-----SGKNLFCQYCSSVQ 61 (237)
Q Consensus 36 ~~~~Cw~C~~~~-----~~~~~fC~~C~~~q 61 (237)
..+.||-|++.+ ..+.+.||+|..--
T Consensus 27 t~VnCwFCnk~t~vpyq~rNswTCpsCEQyN 57 (611)
T KOG4623|consen 27 TTVNCWFCNKKTEVPYQGRNSWTCPSCEQYN 57 (611)
T ss_pred ceEEEEEecCcceeccCCCCCCcCCcHHhhC
Confidence 456899999965 47889999996543
No 120
>COG4640 Predicted membrane protein [Function unknown]
Probab=63.30 E-value=4.1 Score=38.95 Aligned_cols=22 Identities=23% Similarity=0.513 Sum_probs=18.0
Q ss_pred CCCCcCCCCCCCCCccCcCCCc
Q psy8325 38 LKCWKCLKNLSGKNLFCQYCSS 59 (237)
Q Consensus 38 ~~Cw~C~~~~~~~~~fC~~C~~ 59 (237)
..|++||..-.....+|++||+
T Consensus 2 ~fC~kcG~qk~Ed~~qC~qCG~ 23 (465)
T COG4640 2 KFCPKCGSQKAEDDVQCTQCGH 23 (465)
T ss_pred CcccccccccccccccccccCC
Confidence 4799999765667788999998
No 121
>PF09297 zf-NADH-PPase: NADH pyrophosphatase zinc ribbon domain; InterPro: IPR015376 This domain has a zinc ribbon structure and is often found between two NUDIX domains.; GO: 0016787 hydrolase activity, 0046872 metal ion binding; PDB: 1VK6_A 2GB5_A.
Probab=62.39 E-value=4 Score=24.65 Aligned_cols=24 Identities=21% Similarity=0.371 Sum_probs=13.6
Q ss_pred CCCCCcCCCCC----CCCCccCcCCCcc
Q psy8325 37 ELKCWKCLKNL----SGKNLFCQYCSSV 60 (237)
Q Consensus 37 ~~~Cw~C~~~~----~~~~~fC~~C~~~ 60 (237)
...|..|+.++ ......|++|+..
T Consensus 3 ~rfC~~CG~~t~~~~~g~~r~C~~Cg~~ 30 (32)
T PF09297_consen 3 HRFCGRCGAPTKPAPGGWARRCPSCGHE 30 (32)
T ss_dssp TSB-TTT--BEEE-SSSS-EEESSSS-E
T ss_pred CcccCcCCccccCCCCcCEeECCCCcCE
Confidence 34788999876 3567788888764
No 122
>PRK12495 hypothetical protein; Provisional
Probab=61.87 E-value=4.5 Score=35.73 Aligned_cols=25 Identities=20% Similarity=0.486 Sum_probs=20.6
Q ss_pred CCCCCcCCCCCC--CCCccCcCCCccc
Q psy8325 37 ELKCWKCLKNLS--GKNLFCQYCSSVQ 61 (237)
Q Consensus 37 ~~~Cw~C~~~~~--~~~~fC~~C~~~q 61 (237)
...|..|+.||+ ....||+.|+.+-
T Consensus 42 a~hC~~CG~PIpa~pG~~~Cp~CQ~~~ 68 (226)
T PRK12495 42 NAHCDECGDPIFRHDGQEFCPTCQQPV 68 (226)
T ss_pred hhhcccccCcccCCCCeeECCCCCCcc
Confidence 458999999984 6778999998764
No 123
>PF09779 Ima1_N: Ima1 N-terminal domain; InterPro: IPR018617 Members of this family of uncharacterised novel proteins have no known function.
Probab=61.39 E-value=5.1 Score=32.29 Aligned_cols=25 Identities=28% Similarity=0.752 Sum_probs=17.7
Q ss_pred CCCCcCCCCC-----CCC-CccCcCCCcccC
Q psy8325 38 LKCWKCLKNL-----SGK-NLFCQYCSSVQK 62 (237)
Q Consensus 38 ~~Cw~C~~~~-----~~~-~~fC~~C~~~qp 62 (237)
..||-|+... ... .+.|+.|+..--
T Consensus 1 v~C~fC~~~s~~~~~~~~~~w~C~~C~q~N~ 31 (131)
T PF09779_consen 1 VNCWFCGQNSKVPYDNRNSNWTCPHCEQYNG 31 (131)
T ss_pred CeeccCCCCCCCCCCCCCCeeECCCCCCccC
Confidence 3699999853 233 499999987433
No 124
>KOG2910|consensus
Probab=60.96 E-value=60 Score=28.23 Aligned_cols=57 Identities=14% Similarity=0.223 Sum_probs=44.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhccccHHHHHHHHHHHHhHHHHHHHHHHHHhhc
Q psy8325 174 EDLEKLQTLIQATIDDLTKNVNASFEQKDFNQAKDLLIRMKYFTTLVSKVRDKKNQL 230 (237)
Q Consensus 174 ~~l~~l~~~~~~~i~~~~~~l~~~f~~~d~~~A~~~l~rlkY~~~l~~~ik~k~~~l 230 (237)
+.|....+++...++.-......++..|+-+.|.-+|.+-||...|.....+++..+
T Consensus 26 dkl~qyqkR~e~~le~Er~~Ar~lird~rKdrAlllLKkKryQE~Ll~qt~~qL~nl 82 (209)
T KOG2910|consen 26 DKLKQYQKRLEKQLEAERQLARDLIRDGRKDRALLLLKKKRYQEELLTQTDNQLINL 82 (209)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhChHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 345566666666666656666777788999999999999999999999998887655
No 125
>smart00685 DM14 Repeats in fly CG4713, worm Y37H9A.3 and human FLJ20241.
Probab=60.64 E-value=55 Score=22.96 Aligned_cols=43 Identities=19% Similarity=0.199 Sum_probs=34.6
Q ss_pred HHHHHHHHHHHHHHHHhccccHHHHHHHHHHHHhHHHHHHHHH
Q psy8325 182 LIQATIDDLTKNVNASFEQKDFNQAKDLLIRMKYFTTLVSKVR 224 (237)
Q Consensus 182 ~~~~~i~~~~~~l~~~f~~~d~~~A~~~l~rlkY~~~l~~~ik 224 (237)
.++++.+.+.....++=..||.+.|+..++-.+-|+...+.++
T Consensus 3 ~L~~R~~~yk~Aa~~AK~~gd~~kAr~~~R~~K~~~~~I~~~~ 45 (59)
T smart00685 3 LLQQRQEQYKQAALQAKRAGDEEKARRHLRIAKQFDDAIKAAR 45 (59)
T ss_pred HHHHHHHHHHHHHHHHHHcCCHHHHHHHHHHHhhHHHHHHHHH
Confidence 3566777777777778889999999999999998887776654
No 126
>TIGR00373 conserved hypothetical protein TIGR00373. This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain.
Probab=60.36 E-value=4 Score=33.89 Aligned_cols=23 Identities=17% Similarity=0.418 Sum_probs=18.3
Q ss_pred CCCCcCCCCC-----CCCCccCcCCCcc
Q psy8325 38 LKCWKCLKNL-----SGKNLFCQYCSSV 60 (237)
Q Consensus 38 ~~Cw~C~~~~-----~~~~~fC~~C~~~ 60 (237)
-.|++|+... ....|.||.||..
T Consensus 110 Y~Cp~c~~r~tf~eA~~~~F~Cp~Cg~~ 137 (158)
T TIGR00373 110 FICPNMCVRFTFNEAMELNFTCPRCGAM 137 (158)
T ss_pred EECCCCCcEeeHHHHHHcCCcCCCCCCE
Confidence 3799999854 2467999999985
No 127
>PF08772 NOB1_Zn_bind: Nin one binding (NOB1) Zn-ribbon like; InterPro: IPR014881 This entry corresponds to a zinc ribbon and is found on the RNA binding protein NOB1. ; PDB: 2CON_A.
Probab=60.33 E-value=4.2 Score=29.81 Aligned_cols=21 Identities=29% Similarity=0.801 Sum_probs=10.0
Q ss_pred CCCcCCCCCC-CCCccCcCCCc
Q psy8325 39 KCWKCLKNLS-GKNLFCQYCSS 59 (237)
Q Consensus 39 ~Cw~C~~~~~-~~~~fC~~C~~ 59 (237)
.|..|-.... ....|||+||.
T Consensus 11 rC~aCf~~t~~~~k~FCp~CGn 32 (73)
T PF08772_consen 11 RCHACFKITKDMTKQFCPKCGN 32 (73)
T ss_dssp E-SSS--EES-SS--S-SSS--
T ss_pred EccccccCcCCCCceeCcccCC
Confidence 6888887664 57899999976
No 128
>PF02150 RNA_POL_M_15KD: RNA polymerases M/15 Kd subunit; InterPro: IPR001529 DNA-directed RNA polymerases 2.7.7.6 from EC (also known as DNA-dependent RNA polymerases) are responsible for the polymerisation of ribonucleotides into a sequence complementary to the template DNA. In eukaryotes, there are three different forms of DNA-directed RNA polymerases transcribing different sets of genes. Most RNA polymerases are multimeric enzymes and are composed of a variable number of subunits. The core RNA polymerase complex consists of five subunits (two alpha, one beta, one beta-prime and one omega) and is sufficient for transcription elongation and termination but is unable to initiate transcription. Transcription initiation from promoter elements requires a sixth, dissociable subunit called a sigma factor, which reversibly associates with the core RNA polymerase complex to form a holoenzyme []. The core RNA polymerase complex forms a "crab claw"-like structure with an internal channel running along the full length []. The key functional sites of the enzyme, as defined by mutational and cross-linking analysis, are located on the inner wall of this channel. RNA synthesis follows after the attachment of RNA polymerase to a specific site, the promoter, on the template DNA strand. The RNA synthesis process continues until a termination sequence is reached. The RNA product, which is synthesised in the 5' to 3'direction, is known as the primary transcript. Eukaryotic nuclei contain three distinct types of RNA polymerases that differ in the RNA they synthesise: RNA polymerase I: located in the nucleoli, synthesises precursors of most ribosomal RNAs. RNA polymerase II: occurs in the nucleoplasm, synthesises mRNA precursors. RNA polymerase III: also occurs in the nucleoplasm, synthesises the precursors of 5S ribosomal RNA, the tRNAs, and a variety of other small nuclear and cytosolic RNAs. Eukaryotic cells are also known to contain separate mitochondrial and chloroplast RNA polymerases. Eukaryotic RNA polymerases, whose molecular masses vary in size from 500 to 700 kDa, contain two non-identical large (>100 kDa) subunits and an array of up to 12 different small (less than 50 kDa) subunits. In archaebacteria, there is generally a single form of RNA polymerase which also consist of an oligomeric assemblage of 10 to 13 polypeptides. It has recently been shown [], [] that small subunits of about 15 kDa, found in polymerase types I and II, are highly conserved. These proteins contain a probable zinc finger in their N-terminal region and a C-terminal zinc ribbon domain (see IPR001222 from INTERPRO).; GO: 0003677 DNA binding, 0003899 DNA-directed RNA polymerase activity, 0006351 transcription, DNA-dependent; PDB: 3H0G_I 3M4O_I 3S14_I 2E2J_I 4A3J_I 3HOZ_I 1TWA_I 3S1Q_I 3S1N_I 1TWG_I ....
Probab=60.05 E-value=4 Score=25.46 Aligned_cols=26 Identities=23% Similarity=0.434 Sum_probs=16.4
Q ss_pred CCCcCCCCC-----CCCCccCcCCCcccCCC
Q psy8325 39 KCWKCLKNL-----SGKNLFCQYCSSVQKPD 64 (237)
Q Consensus 39 ~Cw~C~~~~-----~~~~~fC~~C~~~qp~~ 64 (237)
.|+.|+..+ ......|..|+.+.|++
T Consensus 3 FCp~C~nlL~p~~~~~~~~~C~~C~Y~~~~~ 33 (35)
T PF02150_consen 3 FCPECGNLLYPKEDKEKRVACRTCGYEEPIS 33 (35)
T ss_dssp BETTTTSBEEEEEETTTTEEESSSS-EEE-S
T ss_pred eCCCCCccceEcCCCccCcCCCCCCCccCCC
Confidence 688998865 12222699999988765
No 129
>PF06044 DRP: Dam-replacing family; InterPro: IPR010324 Dam-replacing protein (DRP) is a restriction endonuclease that is flanked by pseudo-transposable small repeat elements. The replacement of Dam-methylase by DRP allows phase variation through slippage-like mechanisms in several pathogenic isolates of Neisseria meningitidis [].; PDB: 4ESJ_A.
Probab=60.04 E-value=3.4 Score=37.04 Aligned_cols=23 Identities=22% Similarity=0.716 Sum_probs=9.7
Q ss_pred CCCCCcCCCC-C-------CCCCccCcCCCc
Q psy8325 37 ELKCWKCLKN-L-------SGKNLFCQYCSS 59 (237)
Q Consensus 37 ~~~Cw~C~~~-~-------~~~~~fC~~C~~ 59 (237)
...||+|+.. + +...|+|++|+.
T Consensus 31 n~yCP~Cg~~~L~~f~NN~PVaDF~C~~C~e 61 (254)
T PF06044_consen 31 NMYCPNCGSKPLSKFENNRPVADFYCPNCNE 61 (254)
T ss_dssp H---TTT--SS-EE--------EEE-TTT--
T ss_pred CCcCCCCCChhHhhccCCCccceeECCCCch
Confidence 3479999985 3 467899999975
No 130
>PRK06266 transcription initiation factor E subunit alpha; Validated
Probab=59.04 E-value=5 Score=34.06 Aligned_cols=24 Identities=17% Similarity=0.451 Sum_probs=18.8
Q ss_pred CCCCcCCCCC-----CCCCccCcCCCccc
Q psy8325 38 LKCWKCLKNL-----SGKNLFCQYCSSVQ 61 (237)
Q Consensus 38 ~~Cw~C~~~~-----~~~~~fC~~C~~~q 61 (237)
-.|++|+... ....|.||.||..-
T Consensus 118 Y~Cp~C~~rytf~eA~~~~F~Cp~Cg~~L 146 (178)
T PRK06266 118 FFCPNCHIRFTFDEAMEYGFRCPQCGEML 146 (178)
T ss_pred EECCCCCcEEeHHHHhhcCCcCCCCCCCC
Confidence 3799999854 25689999998844
No 131
>PRK11827 hypothetical protein; Provisional
Probab=58.90 E-value=7.1 Score=27.55 Aligned_cols=27 Identities=19% Similarity=0.285 Sum_probs=22.4
Q ss_pred CCCcCCCCCC----CCCccCcCCCcccCCCC
Q psy8325 39 KCWKCLKNLS----GKNLFCQYCSSVQKPDS 65 (237)
Q Consensus 39 ~Cw~C~~~~~----~~~~fC~~C~~~qp~~~ 65 (237)
.||.|++++. ...+.|..|+..-|+..
T Consensus 10 aCP~ckg~L~~~~~~~~Lic~~~~laYPI~d 40 (60)
T PRK11827 10 ACPVCNGKLWYNQEKQELICKLDNLAFPLRD 40 (60)
T ss_pred ECCCCCCcCeEcCCCCeEECCccCeeccccC
Confidence 7999999872 46799999999888854
No 132
>COG4031 Predicted metal-binding protein [General function prediction only]
Probab=58.62 E-value=5.5 Score=34.53 Aligned_cols=61 Identities=18% Similarity=0.394 Sum_probs=35.9
Q ss_pred CCCcCCCCCCCCCccCcCCCcccCCC-----CCCChh-hhcC------CcccCCCCHHHHHHHHHHHHHHhCCCCC
Q psy8325 39 KCWKCLKNLSGKNLFCQYCSSVQKPD-----SQSNYY-DLFD------LKLTYLINNVDLSKKFKQLQSQLHPDKF 102 (237)
Q Consensus 39 ~Cw~C~~~~~~~~~fC~~C~~~qp~~-----~~~nyf-~llg------l~~~f~id~~~Lk~~Yr~Lq~~~HPDk~ 102 (237)
.| .|+.++..+ .||+.|+.-..+. ....|| .++| ++...+-|- .|+.-|-.|+-..|.-+.
T Consensus 2 ~C-rCG~~l~~p-~~Cl~Cg~~~av~~~vy~~~~r~~v~~vge~D~~t~~~~gDedg-~lRNl~erlae~i~s~rp 74 (227)
T COG4031 2 IC-RCGAELSSP-AFCLNCGRRHAVGCGVYVSESRVFVRIVGERDFETFSIRGDEDG-SLRNLYERLAERIYSYRP 74 (227)
T ss_pred cc-ccCCccccc-chhcccCCcceeEeeeeccccEEEEEEeccccceeecccCCCcc-hHHHHHHHHHHHHHhcCc
Confidence 57 899988765 8999997632221 111122 1222 122444333 888888888888876554
No 133
>PF06827 zf-FPG_IleRS: Zinc finger found in FPG and IleRS; InterPro: IPR010663 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents a zinc finger domain found at the C-terminal in both DNA glycosylase/AP lyase enzymes and in isoleucyl tRNA synthetase. In these two types of enzymes, the C-terminal domain forms a zinc finger. Some related proteins may not bind zinc. DNA glycosylase/AP lyase enzymes are involved in base excision repair of DNA damaged by oxidation or by mutagenic agents. These enzymes have both DNA glycosylase activity (3.2.2 from EC) and AP lyase activity (4.2.99.18 from EC) []. Examples include formamidopyrimidine-DNA glycosylases (Fpg; MutM) and endonuclease VIII (Nei). Formamidopyrimidine-DNA glycosylases (Fpg, MutM) is a trifunctional DNA base excision repair enzyme that removes a wide range of oxidation-damaged bases (N-glycosylase activity; 3.2.2.23 from EC) and cleaves both the 3'- and 5'-phosphodiester bonds of the resulting apurinic/apyrimidinic site (AP lyase activity; 4.2.99.18 from EC). Fpg has a preference for oxidised purines, excising oxidized purine bases such as 7,8-dihydro-8-oxoguanine (8-oxoG). ITs AP (apurinic/apyrimidinic) lyase activity introduces nicks in the DNA strand, cleaving the DNA backbone by beta-delta elimination to generate a single-strand break at the site of the removed base with both 3'- and 5'-phosphates. Fpg is a monomer composed of 2 domains connected by a flexible hinge []. The two DNA-binding motifs (a zinc finger and the helix-two-turns-helix motifs) suggest that the oxidized base is flipped out from double-stranded DNA in the binding mode and excised by a catalytic mechanism similar to that of bifunctional base excision repair enzymes []. Fpg binds one ion of zinc at the C terminus, which contains four conserved and essential cysteines []. Endonuclease VIII (Nei) has the same enzyme activities as Fpg above, but with a preference for oxidized pyrimidines, such as thymine glycol, 5,6-dihydrouracil and 5,6-dihydrothymine [, ]. An Fpg-type zinc finger is also found at the C terminus of isoleucyl tRNA synthetase (6.1.1.5 from EC) [, ]. This enzyme catalyses the attachment of isoleucine to tRNA(Ile). As IleRS can inadvertently accommodate and process structurally similar amino acids such as valine, to avoid such errors it has two additional distinct tRNA(Ile)-dependent editing activities. One activity is designated as 'pre-transfer' editing and involves the hydrolysis of activated Val-AMP. The other activity is designated 'post-transfer' editing and involves deacylation of mischarged Val-tRNA(Ile) []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0003824 catalytic activity; PDB: 1K82_C 1Q39_A 2OQ4_B 2OPF_A 1K3X_A 1K3W_A 1Q3B_A 2EA0_A 1Q3C_A 2XZF_A ....
Probab=58.60 E-value=2.9 Score=24.84 Aligned_cols=21 Identities=24% Similarity=0.773 Sum_probs=12.1
Q ss_pred CCCcCCCCC------CCCCccCcCCCc
Q psy8325 39 KCWKCLKNL------SGKNLFCQYCSS 59 (237)
Q Consensus 39 ~Cw~C~~~~------~~~~~fC~~C~~ 59 (237)
.|..|+..+ ....++|+.|..
T Consensus 3 ~C~rC~~~~~~~~~~~r~~~~C~rCq~ 29 (30)
T PF06827_consen 3 KCPRCWNYIEDIGINGRSTYLCPRCQK 29 (30)
T ss_dssp B-TTT--BBEEEEETTEEEEE-TTTCC
T ss_pred cCccCCCcceEeEecCCCCeECcCCcC
Confidence 577787765 356799999954
No 134
>PRK03824 hypA hydrogenase nickel incorporation protein; Provisional
Probab=57.92 E-value=5.2 Score=32.40 Aligned_cols=10 Identities=40% Similarity=0.866 Sum_probs=7.9
Q ss_pred CCCCcCCCCC
Q psy8325 38 LKCWKCLKNL 47 (237)
Q Consensus 38 ~~Cw~C~~~~ 47 (237)
..|.+|+...
T Consensus 71 ~~C~~CG~~~ 80 (135)
T PRK03824 71 LKCRNCGNEW 80 (135)
T ss_pred EECCCCCCEE
Confidence 3799999755
No 135
>COG2093 DNA-directed RNA polymerase, subunit E'' [Transcription]
Probab=57.51 E-value=7.6 Score=27.68 Aligned_cols=34 Identities=21% Similarity=0.550 Sum_probs=24.0
Q ss_pred CCCCCcCCCCCCCCCccCcCCCcccCCCCCCChhhhc
Q psy8325 37 ELKCWKCLKNLSGKNLFCQYCSSVQKPDSQSNYYDLF 73 (237)
Q Consensus 37 ~~~Cw~C~~~~~~~~~fC~~C~~~qp~~~~~nyf~ll 73 (237)
...|-+|+..++...-.||.|+.-+ -+.+||-++
T Consensus 4 ~kAC~~Ck~l~~~d~e~CP~Cgs~~---~te~W~G~~ 37 (64)
T COG2093 4 EKACKNCKRLTPEDTEICPVCGSTD---LTEEWFGLL 37 (64)
T ss_pred hHHHhhccccCCCCCccCCCCCCcc---cchhhccEE
Confidence 4579999998887778899998733 234554443
No 136
>COG3024 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=57.31 E-value=5.7 Score=28.42 Aligned_cols=24 Identities=25% Similarity=0.563 Sum_probs=16.1
Q ss_pred CCCCCcCCCCCC-----CCCccCc-CCCcc
Q psy8325 37 ELKCWKCLKNLS-----GKNLFCQ-YCSSV 60 (237)
Q Consensus 37 ~~~Cw~C~~~~~-----~~~~fC~-~C~~~ 60 (237)
...|+-|++++. ..-+||. .|..|
T Consensus 7 ~v~CP~Cgkpv~w~~~s~frPFCSkRCklI 36 (65)
T COG3024 7 TVPCPTCGKPVVWGEESPFRPFCSKRCKLI 36 (65)
T ss_pred cccCCCCCCcccccccCCcCcchhHhhhhc
Confidence 468999999872 3445665 56543
No 137
>TIGR00100 hypA hydrogenase nickel insertion protein HypA. In Hpylori, hypA mutant abolished hydrogenase activity and decrease in urease activity. Nickel supplementation in media restored urease activity and partial hydrogenase activity. HypA probably involved in inserting Ni in enzymes.
Probab=55.98 E-value=6.1 Score=31.07 Aligned_cols=24 Identities=21% Similarity=0.497 Sum_probs=17.6
Q ss_pred CCCCcCCCCCC--CCCccCcCCCccc
Q psy8325 38 LKCWKCLKNLS--GKNLFCQYCSSVQ 61 (237)
Q Consensus 38 ~~Cw~C~~~~~--~~~~fC~~C~~~q 61 (237)
..|.+|+.... ...+.||.|+...
T Consensus 71 ~~C~~Cg~~~~~~~~~~~CP~Cgs~~ 96 (115)
T TIGR00100 71 CECEDCSEEVSPEIDLYRCPKCHGIM 96 (115)
T ss_pred EEcccCCCEEecCCcCccCcCCcCCC
Confidence 37999998653 3456699998743
No 138
>PHA02942 putative transposase; Provisional
Probab=55.80 E-value=7.1 Score=37.00 Aligned_cols=26 Identities=19% Similarity=0.586 Sum_probs=20.1
Q ss_pred CCCCCCcCCCCC---CCCCccCcCCCccc
Q psy8325 36 TELKCWKCLKNL---SGKNLFCQYCSSVQ 61 (237)
Q Consensus 36 ~~~~Cw~C~~~~---~~~~~fC~~C~~~q 61 (237)
++..|+.|+... ....|.|+.||..-
T Consensus 324 TSq~Cs~CG~~~~~l~~r~f~C~~CG~~~ 352 (383)
T PHA02942 324 SSVSCPKCGHKMVEIAHRYFHCPSCGYEN 352 (383)
T ss_pred CCccCCCCCCccCcCCCCEEECCCCCCEe
Confidence 567899999854 34679999999854
No 139
>PRK12286 rpmF 50S ribosomal protein L32; Reviewed
Probab=54.55 E-value=7.7 Score=26.96 Aligned_cols=20 Identities=20% Similarity=0.494 Sum_probs=16.1
Q ss_pred CCCcCCCCCCCCCccCcCCCc
Q psy8325 39 KCWKCLKNLSGKNLFCQYCSS 59 (237)
Q Consensus 39 ~Cw~C~~~~~~~~~fC~~C~~ 59 (237)
.|++||.... +...|++||.
T Consensus 29 ~C~~CG~~~~-~H~vC~~CG~ 48 (57)
T PRK12286 29 ECPNCGEPKL-PHRVCPSCGY 48 (57)
T ss_pred ECCCCCCccC-CeEECCCCCc
Confidence 6999998754 6788999975
No 140
>smart00531 TFIIE Transcription initiation factor IIE.
Probab=54.23 E-value=5.3 Score=32.61 Aligned_cols=25 Identities=20% Similarity=0.462 Sum_probs=17.6
Q ss_pred CCCCCcCCCCC----------CCCCccCcCCCccc
Q psy8325 37 ELKCWKCLKNL----------SGKNLFCQYCSSVQ 61 (237)
Q Consensus 37 ~~~Cw~C~~~~----------~~~~~fC~~C~~~q 61 (237)
.-.|++|+... ....|.||.|+..-
T Consensus 99 ~Y~Cp~C~~~y~~~ea~~~~d~~~~f~Cp~Cg~~l 133 (147)
T smart00531 99 YYKCPNCQSKYTFLEANQLLDMDGTFTCPRCGEEL 133 (147)
T ss_pred EEECcCCCCEeeHHHHHHhcCCCCcEECCCCCCEE
Confidence 34799999753 12239999998854
No 141
>PF10083 DUF2321: Uncharacterized protein conserved in bacteria (DUF2321); InterPro: IPR016891 This entry is represented by Bacteriophage 'Lactobacillus prophage Lj928', Orf-Ljo1454. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=54.16 E-value=3.1 Score=34.76 Aligned_cols=21 Identities=19% Similarity=0.477 Sum_probs=13.3
Q ss_pred cccccccccCCCCCCcCCCCC
Q psy8325 27 NLCTDTVKHTELKCWKCLKNL 47 (237)
Q Consensus 27 ~~~~~~~~~~~~~Cw~C~~~~ 47 (237)
.||+.-...+-..|++|+.++
T Consensus 29 ~fC~kCG~~tI~~Cp~C~~~I 49 (158)
T PF10083_consen 29 KFCSKCGAKTITSCPNCSTPI 49 (158)
T ss_pred HHHHHhhHHHHHHCcCCCCCC
Confidence 455544444455788888876
No 142
>PRK00418 DNA gyrase inhibitor; Reviewed
Probab=53.81 E-value=6.8 Score=27.86 Aligned_cols=23 Identities=30% Similarity=0.610 Sum_probs=16.0
Q ss_pred CCCCCcCCCCC-----CCCCccCc-CCCc
Q psy8325 37 ELKCWKCLKNL-----SGKNLFCQ-YCSS 59 (237)
Q Consensus 37 ~~~Cw~C~~~~-----~~~~~fC~-~C~~ 59 (237)
...|+.|++++ +...+||+ .|..
T Consensus 6 ~v~CP~C~k~~~w~~~~~~rPFCS~RCk~ 34 (62)
T PRK00418 6 TVNCPTCGKPVEWGEISPFRPFCSKRCQL 34 (62)
T ss_pred cccCCCCCCcccccCCCCcCCcccHHHHh
Confidence 45899999986 24456886 5654
No 143
>cd04476 RPA1_DBD_C RPA1_DBD_C: A subfamily of OB folds corresponding to the C-terminal OB fold, the ssDNA-binding domain (DBD)-C, of human RPA1 (also called RPA70). RPA1 is the large subunit of Replication protein A (RPA). RPA is a nuclear ssDNA-binding protein (SSB) which appears to be involved in all aspects of DNA metabolism including replication, recombination, and repair. RPA also mediates specific interactions of various nuclear proteins. In animals, plants, and fungi, RPA is a heterotrimer with subunits of 70KDa (RPA1), 32kDa (RPA2), and 14 KDa (RPA3). In addition to DBD-C, RPA1 contains three other OB folds: DBD-A, DBD-B, and RPA1N. The major DNA binding activity of RPA is associated with RPA1 DBD-A and DBD-B. RPA1 DBD-C is involved in DNA binding and trimerization. It contains two structural insertions not found to date in other OB-folds: a zinc ribbon and a three-helix bundle. RPA1 DBD-C also contains a Cys4-type zinc-binding motif, which plays a role in the ssDNA binding fun
Probab=53.53 E-value=6.2 Score=32.43 Aligned_cols=26 Identities=19% Similarity=0.348 Sum_probs=20.6
Q ss_pred cCCCCCCcCCCCCCC---CCccCcCCCcc
Q psy8325 35 HTELKCWKCLKNLSG---KNLFCQYCSSV 60 (237)
Q Consensus 35 ~~~~~Cw~C~~~~~~---~~~fC~~C~~~ 60 (237)
.-+..|+.|++.+.. ..++|+.|+..
T Consensus 32 ~~Y~aC~~C~kkv~~~~~~~~~C~~C~~~ 60 (166)
T cd04476 32 WWYPACPGCNKKVVEEGNGTYRCEKCNKS 60 (166)
T ss_pred eEEccccccCcccEeCCCCcEECCCCCCc
Confidence 346689999998842 57999999885
No 144
>PRK14714 DNA polymerase II large subunit; Provisional
Probab=53.18 E-value=10 Score=41.13 Aligned_cols=26 Identities=23% Similarity=0.528 Sum_probs=16.7
Q ss_pred CCCCcCCCCCCCCCccCcCCCcccCCC
Q psy8325 38 LKCWKCLKNLSGKNLFCQYCSSVQKPD 64 (237)
Q Consensus 38 ~~Cw~C~~~~~~~~~fC~~C~~~qp~~ 64 (237)
..||+|+.++. ..++|++|+.-.+++
T Consensus 680 ~fCP~CGs~te-~vy~CPsCGaev~~d 705 (1337)
T PRK14714 680 NRCPDCGTHTE-PVYVCPDCGAEVPPD 705 (1337)
T ss_pred ccCcccCCcCC-CceeCccCCCccCCC
Confidence 37777777653 345777777755544
No 145
>PHA00626 hypothetical protein
Probab=52.94 E-value=10 Score=26.53 Aligned_cols=23 Identities=22% Similarity=0.509 Sum_probs=17.1
Q ss_pred CCCcCCCC-C--------CCCCccCcCCCccc
Q psy8325 39 KCWKCLKN-L--------SGKNLFCQYCSSVQ 61 (237)
Q Consensus 39 ~Cw~C~~~-~--------~~~~~fC~~C~~~q 61 (237)
.||+|+.. + .+..+.|+.|+.--
T Consensus 2 ~CP~CGS~~Ivrcg~cr~~snrYkCkdCGY~f 33 (59)
T PHA00626 2 SCPKCGSGNIAKEKTMRGWSDDYVCCDCGYND 33 (59)
T ss_pred CCCCCCCceeeeeceecccCcceEcCCCCCee
Confidence 58888873 2 25789999998854
No 146
>PRK01343 zinc-binding protein; Provisional
Probab=52.49 E-value=8.8 Score=26.83 Aligned_cols=24 Identities=25% Similarity=0.563 Sum_probs=16.8
Q ss_pred CCCCCcCCCCCC-CCCccCc-CCCcc
Q psy8325 37 ELKCWKCLKNLS-GKNLFCQ-YCSSV 60 (237)
Q Consensus 37 ~~~Cw~C~~~~~-~~~~fC~-~C~~~ 60 (237)
...|+.|++++. ..-.||+ .|..|
T Consensus 9 ~~~CP~C~k~~~~~~rPFCS~RC~~i 34 (57)
T PRK01343 9 TRPCPECGKPSTREAYPFCSERCRDI 34 (57)
T ss_pred CCcCCCCCCcCcCCCCcccCHHHhhh
Confidence 458999999863 4557886 56543
No 147
>PF09567 RE_MamI: MamI restriction endonuclease; InterPro: IPR019067 There are four classes of restriction endonucleases: types I, II,III and IV. All types of enzymes recognise specific short DNA sequences and carry out the endonucleolytic cleavage of DNA to give specific double-stranded fragments with terminal 5'-phosphates. They differ in their recognition sequence, subunit composition, cleavage position, and cofactor requirements [, ], as summarised below: Type I enzymes (3.1.21.3 from EC) cleave at sites remote from recognition site; require both ATP and S-adenosyl-L-methionine to function; multifunctional protein with both restriction and methylase (2.1.1.72 from EC) activities. Type II enzymes (3.1.21.4 from EC) cleave within or at short specific distances from recognition site; most require magnesium; single function (restriction) enzymes independent of methylase. Type III enzymes (3.1.21.5 from EC) cleave at sites a short distance from recognition site; require ATP (but doesn't hydrolyse it); S-adenosyl-L-methionine stimulates reaction but is not required; exists as part of a complex with a modification methylase methylase (2.1.1.72 from EC). Type IV enzymes target methylated DNA. Type II restriction endonucleases (3.1.21.4 from EC) are components of prokaryotic DNA restriction-modification mechanisms that protect the organism against invading foreign DNA. These site-specific deoxyribonucleases catalyse the endonucleolytic cleavage of DNA to give specific double-stranded fragments with terminal 5'-phosphates. Of the 3000 restriction endonucleases that have been characterised, most are homodimeric or tetrameric enzymes that cleave target DNA at sequence-specific sites close to the recognition site. For homodimeric enzymes, the recognition site is usually a palindromic sequence 4-8 bp in length. Most enzymes require magnesium ions as a cofactor for catalysis. Although they can vary in their mode of recognition, many restriction endonucleases share a similar structural core comprising four beta-strands and one alpha-helix, as well as a similar mechanism of cleavage, suggesting a common ancestral origin []. However, there is still considerable diversity amongst restriction endonucleases [, ]. The target site recognition process triggers large conformational changes of the enzyme and the target DNA, leading to the activation of the catalytic centres. Like other DNA binding proteins, restriction enzymes are capable of non-specific DNA binding as well, which is the prerequisite for efficient target site location by facilitated diffusion. Non-specific binding usually does not involve interactions with the bases but only with the DNA backbone []. This entry includes the MamI restriction endonuclease which recognises and cleaves GATNN^NNATC. ; GO: 0003677 DNA binding, 0009036 Type II site-specific deoxyribonuclease activity, 0009307 DNA restriction-modification system
Probab=52.42 E-value=7.9 Score=35.02 Aligned_cols=38 Identities=26% Similarity=0.520 Sum_probs=27.2
Q ss_pred CCCcCCCCCCCCCccCcCCCcccCCCCCCChhhhcCCcc
Q psy8325 39 KCWKCLKNLSGKNLFCQYCSSVQKPDSQSNYYDLFDLKL 77 (237)
Q Consensus 39 ~Cw~C~~~~~~~~~fC~~C~~~qp~~~~~nyf~llgl~~ 77 (237)
.|-+|+..+..-..-||+|++.--.......+ +||+..
T Consensus 84 ~C~~CGa~V~~~e~~Cp~C~StnI~r~DdSkW-l~~ir~ 121 (314)
T PF09567_consen 84 KCNNCGANVSRLEESCPNCGSTNIKRKDDSKW-LFGIRS 121 (314)
T ss_pred hhccccceeeehhhcCCCCCcccccccCCcce-eccccc
Confidence 79999999988889999998854333333333 676644
No 148
>TIGR01031 rpmF_bact ribosomal protein L32. This protein describes bacterial ribosomal protein L32. The noise cutoff is set low enough to include the equivalent protein from mitochondria and chloroplasts. No related proteins from the Archaea nor from the eukaryotic cytosol are detected by this model. This model is a fragment model; the putative L32 of some species shows similarity only toward the N-terminus.
Probab=52.21 E-value=8.2 Score=26.61 Aligned_cols=20 Identities=20% Similarity=0.409 Sum_probs=16.1
Q ss_pred CCCcCCCCCCCCCccCcCCCc
Q psy8325 39 KCWKCLKNLSGKNLFCQYCSS 59 (237)
Q Consensus 39 ~Cw~C~~~~~~~~~fC~~C~~ 59 (237)
.|++|+.... +...|+.||.
T Consensus 28 ~C~~cG~~~~-~H~vc~~cG~ 47 (55)
T TIGR01031 28 VCPNCGEFKL-PHRVCPSCGY 47 (55)
T ss_pred ECCCCCCccc-CeeECCccCe
Confidence 6999998654 6789999974
No 149
>PRK14714 DNA polymerase II large subunit; Provisional
Probab=52.13 E-value=11 Score=40.85 Aligned_cols=9 Identities=22% Similarity=0.545 Sum_probs=4.2
Q ss_pred CCCcCCCCC
Q psy8325 39 KCWKCLKNL 47 (237)
Q Consensus 39 ~Cw~C~~~~ 47 (237)
.|++|+..+
T Consensus 694 ~CPsCGaev 702 (1337)
T PRK14714 694 VCPDCGAEV 702 (1337)
T ss_pred eCccCCCcc
Confidence 455554433
No 150
>TIGR00686 phnA alkylphosphonate utilization operon protein PhnA. The protein family includes an uncharacterized member designated phnA in Escherichia coli, part of a large operon associated with alkylphosphonate uptake and carbon-phosphorus bond cleavage. This protein is not related to the characterized phosphonoacetate hydrolase designated PhnA by Kulakova, et al. (2001, 1997).
Probab=51.94 E-value=10 Score=29.94 Aligned_cols=28 Identities=25% Similarity=0.491 Sum_probs=20.8
Q ss_pred CCCCCcCCCCC---CCCCccCcCCCcccCCC
Q psy8325 37 ELKCWKCLKNL---SGKNLFCQYCSSVQKPD 64 (237)
Q Consensus 37 ~~~Cw~C~~~~---~~~~~fC~~C~~~qp~~ 64 (237)
.+.||.|+..- ....+.||.|++--++.
T Consensus 2 lp~CP~C~seytY~dg~~~iCpeC~~EW~~~ 32 (109)
T TIGR00686 2 LPPCPKCNSEYTYHDGTQLICPSCLYEWNEN 32 (109)
T ss_pred CCcCCcCCCcceEecCCeeECcccccccccc
Confidence 36899998854 36679999998855443
No 151
>smart00834 CxxC_CXXC_SSSS Putative regulatory protein. CxxC_CXXC_SSSS represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=51.86 E-value=10 Score=23.51 Aligned_cols=21 Identities=24% Similarity=0.585 Sum_probs=14.6
Q ss_pred CCCcCCCCC------C-CCCccCcCCCc
Q psy8325 39 KCWKCLKNL------S-GKNLFCQYCSS 59 (237)
Q Consensus 39 ~Cw~C~~~~------~-~~~~fC~~C~~ 59 (237)
.|+.|+... . .....||.|+.
T Consensus 7 ~C~~Cg~~fe~~~~~~~~~~~~CP~Cg~ 34 (41)
T smart00834 7 RCEDCGHTFEVLQKISDDPLATCPECGG 34 (41)
T ss_pred EcCCCCCEEEEEEecCCCCCCCCCCCCC
Confidence 688888843 1 34567888887
No 152
>PF12172 DUF35_N: Rubredoxin-like zinc ribbon domain (DUF35_N); InterPro: IPR022002 This domain has no known function and is found in conserved hypothetical archaeal and bacterial proteins. The domain is duplicated in O53566 from SWISSPROT. The structure of a DUF35 representative reveals two long N-terminal helices followed by a rubredoxin-like zinc ribbon domain represented in this family and a C-terminal OB fold domain. Zinc is chelated by the four conserved cysteines in the alignment. ; PDB: 3IRB_A.
Probab=51.81 E-value=4.9 Score=24.97 Aligned_cols=21 Identities=24% Similarity=0.562 Sum_probs=12.2
Q ss_pred CCCcCCCCCCCCCccCcCCCc
Q psy8325 39 KCWKCLKNLSGKNLFCQYCSS 59 (237)
Q Consensus 39 ~Cw~C~~~~~~~~~fC~~C~~ 59 (237)
.|.+|+.....+..+|+.|+.
T Consensus 13 rC~~Cg~~~~pPr~~Cp~C~s 33 (37)
T PF12172_consen 13 RCRDCGRVQFPPRPVCPHCGS 33 (37)
T ss_dssp E-TTT--EEES--SEETTTT-
T ss_pred EcCCCCCEecCCCcCCCCcCc
Confidence 688999876556789999975
No 153
>PF01783 Ribosomal_L32p: Ribosomal L32p protein family; InterPro: IPR002677 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites [, ]. About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits. Many ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome [, ]. Ribosomal protein L32p is part of the 50S ribosomal subunit. This family is found in both prokaryotes and eukaryotes. Ribosomal protein L32 of yeast binds to and regulates the splicing and the translation of the transcript of its own gene [].; GO: 0003735 structural constituent of ribosome, 0006412 translation, 0015934 large ribosomal subunit; PDB: 3PYT_2 3F1F_5 3PYV_2 3D5B_5 3MRZ_2 3D5D_5 3F1H_5 1VSP_Y 3PYR_2 3MS1_2 ....
Probab=51.56 E-value=9.7 Score=26.17 Aligned_cols=21 Identities=24% Similarity=0.494 Sum_probs=16.3
Q ss_pred CCCCcCCCCCCCCCccCcCCCc
Q psy8325 38 LKCWKCLKNLSGKNLFCQYCSS 59 (237)
Q Consensus 38 ~~Cw~C~~~~~~~~~fC~~C~~ 59 (237)
..|++|+... .+...|++||.
T Consensus 27 ~~c~~cg~~~-~~H~vc~~cG~ 47 (56)
T PF01783_consen 27 VKCPNCGEPK-LPHRVCPSCGY 47 (56)
T ss_dssp EESSSSSSEE-STTSBCTTTBB
T ss_pred eeeccCCCEe-cccEeeCCCCe
Confidence 4799999754 46789999974
No 154
>PRK11788 tetratricopeptide repeat protein; Provisional
Probab=51.01 E-value=9.3 Score=34.59 Aligned_cols=25 Identities=16% Similarity=0.430 Sum_probs=20.8
Q ss_pred CCCCcCCCCCCCCCccCcCCCcccC
Q psy8325 38 LKCWKCLKNLSGKNLFCQYCSSVQK 62 (237)
Q Consensus 38 ~~Cw~C~~~~~~~~~fC~~C~~~qp 62 (237)
-.|-+|+.....-.+.||+|+.+-.
T Consensus 355 ~~c~~cg~~~~~~~~~c~~c~~~~~ 379 (389)
T PRK11788 355 YRCRNCGFTARTLYWHCPSCKAWET 379 (389)
T ss_pred EECCCCCCCCccceeECcCCCCccC
Confidence 3699999887788899999998763
No 155
>PRK00564 hypA hydrogenase nickel incorporation protein; Provisional
Probab=50.98 E-value=8.9 Score=30.27 Aligned_cols=23 Identities=26% Similarity=0.608 Sum_probs=16.5
Q ss_pred CCCCcCCCCCC--CCCcc-CcCCCcc
Q psy8325 38 LKCWKCLKNLS--GKNLF-CQYCSSV 60 (237)
Q Consensus 38 ~~Cw~C~~~~~--~~~~f-C~~C~~~ 60 (237)
..|..|+.... ...++ ||.|+..
T Consensus 72 ~~C~~Cg~~~~~~~~~~~~CP~Cgs~ 97 (117)
T PRK00564 72 LECKDCSHVFKPNALDYGVCEKCHSK 97 (117)
T ss_pred EEhhhCCCccccCCccCCcCcCCCCC
Confidence 37999998653 23455 9999874
No 156
>PF08274 PhnA_Zn_Ribbon: PhnA Zinc-Ribbon ; InterPro: IPR013987 The PhnA protein family includes the uncharacterised Escherichia coli protein PhnA and its homologues. The E. coli phnA gene is part of a large operon associated with alkylphosphonate uptake and carbon-phosphorus bond cleavage []. The protein is not related to the characterised phosphonoacetate hydrolase designated PhnA []. This entry represents the N-terminal domain of PhnA, which is predicted to form a zinc-ribbon.; PDB: 2AKL_A.
Probab=50.20 E-value=7.1 Score=23.73 Aligned_cols=22 Identities=32% Similarity=0.653 Sum_probs=12.0
Q ss_pred CCCCcCCCCC---CCCCccCcCCCc
Q psy8325 38 LKCWKCLKNL---SGKNLFCQYCSS 59 (237)
Q Consensus 38 ~~Cw~C~~~~---~~~~~fC~~C~~ 59 (237)
+.|+.|+..- ....+.|+.|+.
T Consensus 3 p~Cp~C~se~~y~D~~~~vCp~C~~ 27 (30)
T PF08274_consen 3 PKCPLCGSEYTYEDGELLVCPECGH 27 (30)
T ss_dssp ---TTT-----EE-SSSEEETTTTE
T ss_pred CCCCCCCCcceeccCCEEeCCcccc
Confidence 5799998854 467899999975
No 157
>COG1096 Predicted RNA-binding protein (consists of S1 domain and a Zn-ribbon domain) [Translation, ribosomal structure and biogenesis]
Probab=50.15 E-value=8 Score=33.30 Aligned_cols=25 Identities=28% Similarity=0.695 Sum_probs=20.7
Q ss_pred CCCCCcCCCCC--CCCCccCcCCCccc
Q psy8325 37 ELKCWKCLKNL--SGKNLFCQYCSSVQ 61 (237)
Q Consensus 37 ~~~Cw~C~~~~--~~~~~fC~~C~~~q 61 (237)
...|.+|+.++ ....+.||.|+..-
T Consensus 149 ~A~CsrC~~~L~~~~~~l~Cp~Cg~tE 175 (188)
T COG1096 149 YARCSRCRAPLVKKGNMLKCPNCGNTE 175 (188)
T ss_pred EEEccCCCcceEEcCcEEECCCCCCEE
Confidence 34799999987 57889999998854
No 158
>TIGR00201 comF comF family protein. This protein is found in species that do (Bacillus subtilis, Haemophilus influenzae) or do not (E. coli, Borrelia burgdorferi) have described systems for natural transformation with exogenous DNA. It is involved in competence for transformation in Bacillus subtilis.
Probab=50.05 E-value=8.5 Score=32.42 Aligned_cols=21 Identities=24% Similarity=0.615 Sum_probs=17.0
Q ss_pred CCcCCCCCCCCCccCcCCCcc
Q psy8325 40 CWKCLKNLSGKNLFCQYCSSV 60 (237)
Q Consensus 40 Cw~C~~~~~~~~~fC~~C~~~ 60 (237)
|+.|+.++.....+|+.|..-
T Consensus 1 C~~C~~~~~~~~~~C~~C~~~ 21 (190)
T TIGR00201 1 CSLCGRPYQSVHALCRQCGSW 21 (190)
T ss_pred CCccccccccccCCchhhCCc
Confidence 899999875567899999763
No 159
>PF13719 zinc_ribbon_5: zinc-ribbon domain
Probab=49.94 E-value=9.3 Score=23.96 Aligned_cols=24 Identities=25% Similarity=0.740 Sum_probs=17.1
Q ss_pred CCCCcCCCCC---------CCCCccCcCCCccc
Q psy8325 38 LKCWKCLKNL---------SGKNLFCQYCSSVQ 61 (237)
Q Consensus 38 ~~Cw~C~~~~---------~~~~~fC~~C~~~q 61 (237)
..|++|+... .....-|+.|+.+-
T Consensus 3 i~CP~C~~~f~v~~~~l~~~~~~vrC~~C~~~f 35 (37)
T PF13719_consen 3 ITCPNCQTRFRVPDDKLPAGGRKVRCPKCGHVF 35 (37)
T ss_pred EECCCCCceEEcCHHHcccCCcEEECCCCCcEe
Confidence 3699998742 35578899998753
No 160
>COG1198 PriA Primosomal protein N' (replication factor Y) - superfamily II helicase [DNA replication, recombination, and repair]
Probab=49.72 E-value=11 Score=38.90 Aligned_cols=27 Identities=26% Similarity=0.559 Sum_probs=19.6
Q ss_pred CCCCcCCCCC----CCCCccCcCCCcccCCC
Q psy8325 38 LKCWKCLKNL----SGKNLFCQYCSSVQKPD 64 (237)
Q Consensus 38 ~~Cw~C~~~~----~~~~~fC~~C~~~qp~~ 64 (237)
..|++|+.++ ....++|+.|+.-+|+.
T Consensus 445 ~~Cp~Cd~~lt~H~~~~~L~CH~Cg~~~~~p 475 (730)
T COG1198 445 AECPNCDSPLTLHKATGQLRCHYCGYQEPIP 475 (730)
T ss_pred ccCCCCCcceEEecCCCeeEeCCCCCCCCCC
Confidence 3788888765 46778888888876654
No 161
>PRK12366 replication factor A; Reviewed
Probab=49.45 E-value=7.8 Score=39.19 Aligned_cols=26 Identities=23% Similarity=0.680 Sum_probs=20.8
Q ss_pred CCCCCcCCCCCC--CCCccCcCCCcccC
Q psy8325 37 ELKCWKCLKNLS--GKNLFCQYCSSVQK 62 (237)
Q Consensus 37 ~~~Cw~C~~~~~--~~~~fC~~C~~~qp 62 (237)
...||.|++.+. ...+.|+.|+...|
T Consensus 532 y~aCp~CnkKv~~~~g~~~C~~c~~~~p 559 (637)
T PRK12366 532 LYLCPNCRKRVEEVDGEYICEFCGEVEP 559 (637)
T ss_pred EecccccCeEeEcCCCcEECCCCCCCCC
Confidence 568999999873 45689999998854
No 162
>TIGR00622 ssl1 transcription factor ssl1. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=48.86 E-value=11 Score=29.78 Aligned_cols=40 Identities=23% Similarity=0.429 Sum_probs=29.7
Q ss_pred CCCcCCCCCCCCCccCcCCCc--ccCCCCCCChhhhcCCccc
Q psy8325 39 KCWKCLKNLSGKNLFCQYCSS--VQKPDSQSNYYDLFDLKLT 78 (237)
Q Consensus 39 ~Cw~C~~~~~~~~~fC~~C~~--~qp~~~~~nyf~llgl~~~ 78 (237)
.|+.|+..+-.-...|+.|+. |.++.-...|.-||.+++=
T Consensus 3 ~CPrC~skvC~LP~~CpiCgLtLVss~HLARSyHHLfPl~~f 44 (112)
T TIGR00622 3 FCPQCRAKVCELPVECPICGLTLILSTHLARSYHHLFPLKAF 44 (112)
T ss_pred cCCCCCCCccCCCCcCCcCCCEEeccchHHHhhhccCCCccc
Confidence 588888877555678888876 4455556789999998863
No 163
>TIGR02642 phage_xxxx uncharacterized phage protein. This uncharacterized protein is found in prophage regions of Shewanella oneidensis MR-1, Vibrio vulnificus YJ016, Yersinia pseudotuberculosis IP 32953, and Aeromonas hydrophila ATCC7966. It appears to have regions of sequence similarity to phage lambda antitermination protein Q.
Probab=48.77 E-value=13 Score=32.06 Aligned_cols=59 Identities=14% Similarity=0.159 Sum_probs=40.3
Q ss_pred CCCCCcCCCCC--CCCCccCcCCCc---ccCCCCCCChhhhcCCcccCCCCHHHHHHHHHHHHHHh
Q psy8325 37 ELKCWKCLKNL--SGKNLFCQYCSS---VQKPDSQSNYYDLFDLKLTYLINNVDLSKKFKQLQSQL 97 (237)
Q Consensus 37 ~~~Cw~C~~~~--~~~~~fC~~C~~---~qp~~~~~nyf~llgl~~~f~id~~~Lk~~Yr~Lq~~~ 97 (237)
...|+.|++.- ....--|+.|+- +.+- ....+..||+..+..|..+.-+..|..+...+
T Consensus 99 ~~~C~~C~G~G~~i~~~~~C~~C~G~G~v~~~--~~~~~k~~g~~~g~~it~~~~~~~~~~~~~~~ 162 (186)
T TIGR02642 99 SCKCPRCRGTGLIQRRQRECDTCAGTGRFRPT--VEDLLKSFGVDSGAAIVLKDNWPQVVTSVRAQ 162 (186)
T ss_pred CCcCCCCCCeeEEecCCCCCCCCCCccEEeee--EEEEEEeeeccCCceeeHHHhHHHHHHHHHHH
Confidence 56899999863 222356888854 3322 34677889988777788877777777766543
No 164
>COG3877 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=48.53 E-value=7.5 Score=30.66 Aligned_cols=37 Identities=16% Similarity=0.399 Sum_probs=27.3
Q ss_pred CCCCCcCCCCCCCCCccCcCCCcccCCCCCCChhhhc
Q psy8325 37 ELKCWKCLKNLSGKNLFCQYCSSVQKPDSQSNYYDLF 73 (237)
Q Consensus 37 ~~~Cw~C~~~~~~~~~fC~~C~~~qp~~~~~nyf~ll 73 (237)
...||-|+.......+.|+.|+....-.=..+-|+.|
T Consensus 6 ~~~cPvcg~~~iVTeL~c~~~etTVrg~F~~s~F~~L 42 (122)
T COG3877 6 INRCPVCGRKLIVTELKCSNCETTVRGNFKMSKFEYL 42 (122)
T ss_pred CCCCCcccccceeEEEecCCCCceEecceeccccccc
Confidence 4589999998777789999998855443345666655
No 165
>PF02151 UVR: UvrB/uvrC motif; InterPro: IPR001943 During the process of Escherichia coli nucleotide excision repair, DNA damage recognition and processing are achieved by the action of the uvrA, uvrB, and uvrC gene products []. UvrB and UvrC share a common domain of around 35 amino acids, the so called UVR domain. This domain in UvrB can interact with the homologous domain in UvrC throughout a putative coiled coil structure. This interaction is important for the incision of the damaged strand [].; GO: 0003677 DNA binding, 0004518 nuclease activity, 0006289 nucleotide-excision repair; PDB: 3PXG_D 3PXI_C 1E52_B 1QOJ_B 2D7D_B 2NMV_B.
Probab=48.33 E-value=54 Score=20.26 Aligned_cols=30 Identities=27% Similarity=0.463 Sum_probs=21.9
Q ss_pred HHHHHHHHHHHHHhccccHHHHHHHHHHHH
Q psy8325 185 ATIDDLTKNVNASFEQKDFNQAKDLLIRMK 214 (237)
Q Consensus 185 ~~i~~~~~~l~~~f~~~d~~~A~~~l~rlk 214 (237)
..+.++...+..+-...||+.|..+=-++.
T Consensus 2 ~~i~~l~~~m~~a~~~~dfE~Aa~~Rd~i~ 31 (36)
T PF02151_consen 2 KLIKELEEKMEEAVENEDFEKAARLRDQIK 31 (36)
T ss_dssp HHHHHHHHHHHHHHHTT-CHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHcCHHHHHHHHHHHH
Confidence 467788888889999999999976544443
No 166
>PF11023 DUF2614: Protein of unknown function (DUF2614); InterPro: IPR020912 This entry describes proteins of unknown function, which are thought to be membrane proteins.; GO: 0005887 integral to plasma membrane
Probab=47.57 E-value=9.5 Score=30.27 Aligned_cols=24 Identities=25% Similarity=0.563 Sum_probs=19.3
Q ss_pred CCCCCCcCCCCC--CCCCccCcCCCc
Q psy8325 36 TELKCWKCLKNL--SGKNLFCQYCSS 59 (237)
Q Consensus 36 ~~~~Cw~C~~~~--~~~~~fC~~C~~ 59 (237)
....||+|++++ .....-|..|+.
T Consensus 68 v~V~CP~C~K~TKmLGr~D~CM~C~~ 93 (114)
T PF11023_consen 68 VQVECPNCGKQTKMLGRVDACMHCKE 93 (114)
T ss_pred eeeECCCCCChHhhhchhhccCcCCC
Confidence 456899999986 466679999986
No 167
>PRK10220 hypothetical protein; Provisional
Probab=47.23 E-value=13 Score=29.31 Aligned_cols=28 Identities=18% Similarity=0.353 Sum_probs=21.0
Q ss_pred CCCCCcCCCCC---CCCCccCcCCCcccCCC
Q psy8325 37 ELKCWKCLKNL---SGKNLFCQYCSSVQKPD 64 (237)
Q Consensus 37 ~~~Cw~C~~~~---~~~~~fC~~C~~~qp~~ 64 (237)
.+.||.|+..- ....+.||.|+.--++.
T Consensus 3 lP~CP~C~seytY~d~~~~vCpeC~hEW~~~ 33 (111)
T PRK10220 3 LPHCPKCNSEYTYEDNGMYICPECAHEWNDA 33 (111)
T ss_pred CCcCCCCCCcceEcCCCeEECCcccCcCCcc
Confidence 36899998864 36779999998855443
No 168
>TIGR00595 priA primosomal protein N'. All proteins in this family for which functions are known are components of the primosome which is involved in replication, repair, and recombination.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=46.95 E-value=15 Score=36.08 Aligned_cols=29 Identities=28% Similarity=0.474 Sum_probs=21.8
Q ss_pred CCCCCCcCCCCC----CCCCccCcCCCcccCCC
Q psy8325 36 TELKCWKCLKNL----SGKNLFCQYCSSVQKPD 64 (237)
Q Consensus 36 ~~~~Cw~C~~~~----~~~~~fC~~C~~~qp~~ 64 (237)
....|++|+.++ ....+.|+.|+..++++
T Consensus 221 ~~~~C~~C~~~l~~h~~~~~l~Ch~Cg~~~~~~ 253 (505)
T TIGR00595 221 YILCCPNCDVSLTYHKKEGKLRCHYCGYQEPIP 253 (505)
T ss_pred CccCCCCCCCceEEecCCCeEEcCCCcCcCCCC
Confidence 345788888876 36678899999887743
No 169
>PF09986 DUF2225: Uncharacterized protein conserved in bacteria (DUF2225); InterPro: IPR018708 This conserved bacterial family has no known function.
Probab=46.82 E-value=1.9e+02 Score=24.99 Aligned_cols=73 Identities=12% Similarity=0.145 Sum_probs=42.2
Q ss_pred CCccCcCCCcccCCCCCCChhhhcCCcccCCCCHHHHHHHHHHHHHHhCCCCCCCCCHHHHHHHHHHHHHHHHHHHHhcC
Q psy8325 50 KNLFCQYCSSVQKPDSQSNYYDLFDLKLTYLINNVDLSKKFKQLQSQLHPDKFSNKNQEEQAISETYSSYLNKAYSILQN 129 (237)
Q Consensus 50 ~~~fC~~C~~~qp~~~~~nyf~llgl~~~f~id~~~Lk~~Yr~Lq~~~HPDk~~~~~~~e~~~a~~~s~~In~AY~tL~d 129 (237)
...+||+||.-- ..+.|. .++....++-...+..+++|.-|...-.-+. |.+...+.---|+....
T Consensus 47 ~V~vCP~CgyA~----~~~~F~--------~l~~~~~~~i~~~i~~~~~~~~~~~~Rt~~~--ai~~YkLAll~~~~~~~ 112 (214)
T PF09986_consen 47 EVWVCPHCGYAA----FEEDFE--------KLSPEQKEKIKENISSRWKPRDFSGERTLEE--AIESYKLALLCAQIKKE 112 (214)
T ss_pred eEEECCCCCCcc----cccccc--------cCCHHHHHHHHHHHHhhcccCCCCCCCCHHH--HHHHHHHHHHHHHHhCC
Confidence 345677776633 122333 2455566666677888899998876543333 34555555555777775
Q ss_pred cchhhHH
Q psy8325 130 PLERGLY 136 (237)
Q Consensus 130 p~~Ra~Y 136 (237)
+.....+
T Consensus 113 ~~s~~A~ 119 (214)
T PF09986_consen 113 KPSKKAG 119 (214)
T ss_pred CHHHHHH
Confidence 5544443
No 170
>PRK09710 lar restriction alleviation and modification protein; Reviewed
Probab=46.15 E-value=13 Score=26.61 Aligned_cols=28 Identities=18% Similarity=0.101 Sum_probs=19.8
Q ss_pred CCCCcCCCCC-------CCCCccCcCCCcccCCCC
Q psy8325 38 LKCWKCLKNL-------SGKNLFCQYCSSVQKPDS 65 (237)
Q Consensus 38 ~~Cw~C~~~~-------~~~~~fC~~C~~~qp~~~ 65 (237)
.+||.|+... ....++|..|+..-|...
T Consensus 7 KPCPFCG~~~~~v~~~~g~~~v~C~~CgA~~~~~~ 41 (64)
T PRK09710 7 KPCPFCGCPSVTVKAISGYYRAKCNGCESRTGYGG 41 (64)
T ss_pred cCCCCCCCceeEEEecCceEEEEcCCCCcCccccc
Confidence 4899999864 234578999988655443
No 171
>PF13453 zf-TFIIB: Transcription factor zinc-finger
Probab=45.60 E-value=8.5 Score=24.51 Aligned_cols=22 Identities=27% Similarity=0.617 Sum_probs=13.3
Q ss_pred CCCcCCCCCC------CCCccCcCCCcc
Q psy8325 39 KCWKCLKNLS------GKNLFCQYCSSV 60 (237)
Q Consensus 39 ~Cw~C~~~~~------~~~~fC~~C~~~ 60 (237)
.||+|+..+. ..-..|++|+-+
T Consensus 1 ~CP~C~~~l~~~~~~~~~id~C~~C~G~ 28 (41)
T PF13453_consen 1 KCPRCGTELEPVRLGDVEIDVCPSCGGI 28 (41)
T ss_pred CcCCCCcccceEEECCEEEEECCCCCeE
Confidence 4888877551 234457777653
No 172
>PRK04023 DNA polymerase II large subunit; Validated
Probab=45.19 E-value=14 Score=39.36 Aligned_cols=24 Identities=25% Similarity=0.450 Sum_probs=17.7
Q ss_pred cCCCCCCcCCCCCCCCCccCcCCCcc
Q psy8325 35 HTELKCWKCLKNLSGKNLFCQYCSSV 60 (237)
Q Consensus 35 ~~~~~Cw~C~~~~~~~~~fC~~C~~~ 60 (237)
.....|++|+..+ ..++||.||..
T Consensus 624 Vg~RfCpsCG~~t--~~frCP~CG~~ 647 (1121)
T PRK04023 624 IGRRKCPSCGKET--FYRRCPFCGTH 647 (1121)
T ss_pred ccCccCCCCCCcC--CcccCCCCCCC
Confidence 3455888888864 56888888875
No 173
>PRK00432 30S ribosomal protein S27ae; Validated
Probab=45.12 E-value=15 Score=24.74 Aligned_cols=24 Identities=17% Similarity=0.355 Sum_probs=17.7
Q ss_pred CCCCCCcCCC-CC--CCCCccCcCCCc
Q psy8325 36 TELKCWKCLK-NL--SGKNLFCQYCSS 59 (237)
Q Consensus 36 ~~~~Cw~C~~-~~--~~~~~fC~~C~~ 59 (237)
....||+|+. -+ ....+.|+.|+.
T Consensus 19 ~~~fCP~Cg~~~m~~~~~r~~C~~Cgy 45 (50)
T PRK00432 19 KNKFCPRCGSGFMAEHLDRWHCGKCGY 45 (50)
T ss_pred ccCcCcCCCcchheccCCcEECCCcCC
Confidence 4458999988 33 356789999975
No 174
>PF14260 zf-C4pol: C4-type zinc-finger of DNA polymerase delta
Probab=44.97 E-value=25 Score=24.93 Aligned_cols=46 Identities=22% Similarity=0.527 Sum_probs=28.4
Q ss_pred CCcCCCCCCCCC-ccCcCCCcccCCCCCCChhhhcCCcccCCCCHHHHHHHHHHHHHH
Q psy8325 40 CWKCLKNLSGKN-LFCQYCSSVQKPDSQSNYYDLFDLKLTYLINNVDLSKKFKQLQSQ 96 (237)
Q Consensus 40 Cw~C~~~~~~~~-~fC~~C~~~qp~~~~~nyf~llgl~~~f~id~~~Lk~~Yr~Lq~~ 96 (237)
|..|+..+.... .+|+.|.. .|. ...|.++. ....++++|.++...
T Consensus 1 C~~C~~~~~~~~~~lC~~C~~-~~~---~~~~~l~~-------~~~~~E~~~~~l~~i 47 (73)
T PF14260_consen 1 CLVCGAKTQEGESPLCSNCRS-DPQ---QTIYNLLS-------RLRELERRFNELWTI 47 (73)
T ss_pred CCCCCCcCCCCCCCcCcccCc-ChH---HHHHHHHH-------HHHHHHHHHHHHHHH
Confidence 778999886444 99999976 322 33444442 244566666666543
No 175
>PF01155 HypA: Hydrogenase expression/synthesis hypA family; InterPro: IPR000688 Bacterial membrane-bound nickel-dependent hydrogenases requires a number of accessory proteins which are involved in their maturation. The exact role of these proteins is not yet clear, but some seem to be required for the incorporation of the nickel ions []. One of these proteins is generally known as hypA. It is a protein of about 12 to 14 kDa that contains, in its C-terminal region, four conserved cysteines that form a zinc-finger like motif. Escherichia coli has two proteins that belong to this family, hypA and hybF. A homologue, MJ0214, has also been found in a number of archaeal species, including the genome of Methanocaldococcus jannaschii (Methanococcus jannaschii).; GO: 0016151 nickel ion binding, 0006464 protein modification process; PDB: 2KDX_A 3A44_D 3A43_B.
Probab=44.96 E-value=5.8 Score=31.05 Aligned_cols=22 Identities=23% Similarity=0.656 Sum_probs=15.2
Q ss_pred CCCcCCCCC--CCCCccCcCCCcc
Q psy8325 39 KCWKCLKNL--SGKNLFCQYCSSV 60 (237)
Q Consensus 39 ~Cw~C~~~~--~~~~~fC~~C~~~ 60 (237)
.|..|+... ....+.||.|+..
T Consensus 72 ~C~~Cg~~~~~~~~~~~CP~Cgs~ 95 (113)
T PF01155_consen 72 RCRDCGHEFEPDEFDFSCPRCGSP 95 (113)
T ss_dssp EETTTS-EEECHHCCHH-SSSSSS
T ss_pred ECCCCCCEEecCCCCCCCcCCcCC
Confidence 799999965 2445779999874
No 176
>PRK08351 DNA-directed RNA polymerase subunit E''; Validated
Probab=44.65 E-value=14 Score=26.13 Aligned_cols=33 Identities=21% Similarity=0.469 Sum_probs=22.3
Q ss_pred CCCCcCCCCCCCCCccCcCCCcccCCCCCCChhhhcCC
Q psy8325 38 LKCWKCLKNLSGKNLFCQYCSSVQKPDSQSNYYDLFDL 75 (237)
Q Consensus 38 ~~Cw~C~~~~~~~~~fC~~C~~~qp~~~~~nyf~llgl 75 (237)
..|-+|+..+. .-.||.|+... -+.|||.++=+
T Consensus 4 kAC~~C~~i~~--~~~CP~Cgs~~---~T~~W~G~viI 36 (61)
T PRK08351 4 KACRHCHYITT--EDRCPVCGSRD---LSDEWFDLVII 36 (61)
T ss_pred hhhhhCCcccC--CCcCCCCcCCc---cccccccEEEE
Confidence 37999998764 33799998843 34567765443
No 177
>smart00659 RPOLCX RNA polymerase subunit CX. present in RNA polymerase I, II and III
Probab=44.62 E-value=14 Score=24.25 Aligned_cols=21 Identities=24% Similarity=0.518 Sum_probs=13.2
Q ss_pred CCCcCCCCC---CCCCccCcCCCc
Q psy8325 39 KCWKCLKNL---SGKNLFCQYCSS 59 (237)
Q Consensus 39 ~Cw~C~~~~---~~~~~fC~~C~~ 59 (237)
.|-.|+..+ ......|+.|+.
T Consensus 4 ~C~~Cg~~~~~~~~~~irC~~CG~ 27 (44)
T smart00659 4 ICGECGRENEIKSKDVVRCRECGY 27 (44)
T ss_pred ECCCCCCEeecCCCCceECCCCCc
Confidence 466677654 245677777765
No 178
>PF08447 PAS_3: PAS fold; InterPro: IPR013655 The PAS fold corresponds to the structural domain that has previously been defined as PAS and PAC motifs []. The PAS fold appears in archaea, eubacteria and eukarya. The PAS domain contains a sensory box, or S-box domain that occupies the central portion of the PAS domain but is more widely distributed. It is often tandemly repeated. Known prosthetic groups bound in the S-box domain include haem in the oxygen sensor FixL [], FAD in the redox potential sensor NifL [], and a 4-hydroxycinnamyl chromophore in photoactive yellow protein []. Proteins containing the domain often contain other regulatory domains such as response regulator or sensor histidine kinase domains. Other S-box proteins include phytochromes and the aryl hydrocarbon receptor nuclear translocator. This domain has been found in the gene product of the madA gene of the filamentous zygomycete fungus Phycomyces blakesleeanus. It has been shown that MadA encodes a blue-light photoreceptor for phototropism and other light responses. The gene is involved in the phototropic responses associated with sporangiophore growth; they exhibit phototropism by bending toward near-UV and blue wavelengths and away from far-UV wavelengths in a manner that is physiologically similar to plant phototropic responses [].; GO: 0005515 protein binding; PDB: 3NJA_D 3H9W_A 3GDI_B 3ICY_A 3EEH_A 3MR0_B.
Probab=44.35 E-value=6 Score=27.92 Aligned_cols=30 Identities=23% Similarity=0.712 Sum_probs=20.7
Q ss_pred CCChhhhcCCcccCCCCHHHH-HHHHHHHHHHhCCCC
Q psy8325 66 QSNYYDLFDLKLTYLINNVDL-SKKFKQLQSQLHPDK 101 (237)
Q Consensus 66 ~~nyf~llgl~~~f~id~~~L-k~~Yr~Lq~~~HPDk 101 (237)
+.+++++||+++. ++ ...+..+...+|||=
T Consensus 5 s~~~~~i~G~~~~------~~~~~~~~~~~~~ihpdD 35 (91)
T PF08447_consen 5 SDNFYEIFGYSPE------EIGKPDFEEWLERIHPDD 35 (91)
T ss_dssp -THHHHHHTS-HH------HHTCBEHHHHHHHB-TTT
T ss_pred eHHHHHHhCCCHH------HhccCCHHHHHhhcCHHH
Confidence 3579999999874 44 445667888999986
No 179
>PRK00423 tfb transcription initiation factor IIB; Reviewed
Probab=44.30 E-value=14 Score=33.78 Aligned_cols=26 Identities=27% Similarity=0.609 Sum_probs=20.3
Q ss_pred CCCCCcCCCC-C----CCCCccCcCCCcccC
Q psy8325 37 ELKCWKCLKN-L----SGKNLFCQYCSSVQK 62 (237)
Q Consensus 37 ~~~Cw~C~~~-~----~~~~~fC~~C~~~qp 62 (237)
...|+.|+.+ + .....+|..||.|-.
T Consensus 11 ~~~Cp~Cg~~~iv~d~~~Ge~vC~~CG~Vl~ 41 (310)
T PRK00423 11 KLVCPECGSDKLIYDYERGEIVCADCGLVIE 41 (310)
T ss_pred CCcCcCCCCCCeeEECCCCeEeecccCCccc
Confidence 4479999973 2 468899999999764
No 180
>COG1545 Predicted nucleic-acid-binding protein containing a Zn-ribbon [General function prediction only]
Probab=43.57 E-value=12 Score=30.41 Aligned_cols=24 Identities=29% Similarity=0.563 Sum_probs=19.2
Q ss_pred CCCCCcCCCCCCCCCccCcCCCcc
Q psy8325 37 ELKCWKCLKNLSGKNLFCQYCSSV 60 (237)
Q Consensus 37 ~~~Cw~C~~~~~~~~~fC~~C~~~ 60 (237)
...|.+|++..-.+.-+|+.|+.-
T Consensus 29 g~kC~~CG~v~~PPr~~Cp~C~~~ 52 (140)
T COG1545 29 GTKCKKCGRVYFPPRAYCPKCGSE 52 (140)
T ss_pred EEEcCCCCeEEcCCcccCCCCCCC
Confidence 458999999765678899999873
No 181
>KOG4191|consensus
Probab=43.57 E-value=1.7e+02 Score=28.76 Aligned_cols=69 Identities=28% Similarity=0.365 Sum_probs=43.9
Q ss_pred chhhHHHHhhcCCCcccc--ccCCCHHHHHHHHHHHHHHhccCC--HHH---HHHH------HHHHHHHHHHHHHHHHHH
Q psy8325 131 LERGLYLLSLQNISIEED--TKGTDQKLLMEILMLNEELDEASS--EED---LEKL------QTLIQATIDDLTKNVNAS 197 (237)
Q Consensus 131 ~~Ra~Yll~l~g~~~~e~--~~~~d~efLmeimE~rE~leea~~--~~~---l~~l------~~~~~~~i~~~~~~l~~~ 197 (237)
..|-.|-|.-.|+--+++ -...|.|.|||+-+++-+|-.++. ... |-.+ .+++++.++.+-++|.+|
T Consensus 379 E~RLK~eL~~~Gll~ded~pasdddDEvlaeLR~lqaeLk~vS~~N~k~k~~Ll~la~eE~a~qe~~q~lddlDkqI~qa 458 (516)
T KOG4191|consen 379 ESRLKEELIAQGLLEDEDRPASDDDDEVLAELRKLQAELKAVSAHNRKKKHDLLRLAPEEMARQEFQQVLDDLDKQIEQA 458 (516)
T ss_pred HHHHHHHHHHhccccccCCCcccchHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHH
Confidence 347777777777655444 234478999999999998887653 222 2111 345666666677777666
Q ss_pred hc
Q psy8325 198 FE 199 (237)
Q Consensus 198 f~ 199 (237)
|.
T Consensus 459 Yv 460 (516)
T KOG4191|consen 459 YV 460 (516)
T ss_pred HH
Confidence 64
No 182
>PRK14873 primosome assembly protein PriA; Provisional
Probab=43.46 E-value=17 Score=37.14 Aligned_cols=40 Identities=20% Similarity=0.391 Sum_probs=24.9
Q ss_pred CCCCCCcCCCCCC----CCCccCcCCCcccCCCCC----CChhhhcCC
Q psy8325 36 TELKCWKCLKNLS----GKNLFCQYCSSVQKPDSQ----SNYYDLFDL 75 (237)
Q Consensus 36 ~~~~Cw~C~~~~~----~~~~fC~~C~~~qp~~~~----~nyf~llgl 75 (237)
....|++|+.++. ...+.|+.|+...++..+ ...+..+|+
T Consensus 391 ~~~~C~~C~~~L~~h~~~~~l~Ch~CG~~~~p~~Cp~Cgs~~l~~~g~ 438 (665)
T PRK14873 391 TPARCRHCTGPLGLPSAGGTPRCRWCGRAAPDWRCPRCGSDRLRAVVV 438 (665)
T ss_pred CeeECCCCCCceeEecCCCeeECCCCcCCCcCccCCCCcCCcceeeec
Confidence 3457888888762 456889999886654322 234555554
No 183
>TIGR03338 phnR_burk phosphonate utilization associated transcriptional regulator. This family of proteins are members of the GntR family (pfam00392) containing an N-terminal helix-turn-helix (HTH) motif. This clade is found adjacent to or inside of operons for the degradation of 2-aminoethylphosphonate (AEP) in Polaromonas, Burkholderia, Ralstonia and Verminephrobacter.
Probab=43.04 E-value=1.2e+02 Score=25.33 Aligned_cols=52 Identities=17% Similarity=0.191 Sum_probs=35.1
Q ss_pred HHHHHHHHhcCcchhhHHHHhhcCCC-ccc--c--ccCCCHHHHHHHHHHHHHHhcc
Q psy8325 119 YLNKAYSILQNPLERGLYLLSLQNIS-IEE--D--TKGTDQKLLMEILMLNEELDEA 170 (237)
Q Consensus 119 ~In~AY~tL~dp~~Ra~Yll~l~g~~-~~e--~--~~~~d~efLmeimE~rE~leea 170 (237)
.+.+-|.+=..|++.|.-.|.-.|+- ... + -...+++.+.+++++|..||..
T Consensus 39 ~La~~lgVSRtpVReAL~~L~~eGlv~~~~~~G~~V~~~~~~~~~ei~~~R~~lE~~ 95 (212)
T TIGR03338 39 DIAARLGVSRGPVREAFRALEEAGLVRNEKNRGVFVREISLAEADEIYELRAVLDEI 95 (212)
T ss_pred HHHHHhCCChHHHHHHHHHHHHCCCEEEecCCCeEEecCCHHHHHHHHHHHHHHHHH
Confidence 35555666667777777766666642 221 1 2456788899999999999954
No 184
>PRK02935 hypothetical protein; Provisional
Probab=42.98 E-value=15 Score=28.88 Aligned_cols=25 Identities=24% Similarity=0.487 Sum_probs=19.7
Q ss_pred cCCCCCCcCCCCC--CCCCccCcCCCc
Q psy8325 35 HTELKCWKCLKNL--SGKNLFCQYCSS 59 (237)
Q Consensus 35 ~~~~~Cw~C~~~~--~~~~~fC~~C~~ 59 (237)
.....||+|++++ .....-|..|+.
T Consensus 68 avqV~CP~C~K~TKmLGrvD~CM~C~~ 94 (110)
T PRK02935 68 AVQVICPSCEKPTKMLGRVDACMHCNQ 94 (110)
T ss_pred ceeeECCCCCchhhhccceeecCcCCC
Confidence 3456899999986 467788999976
No 185
>PF00098 zf-CCHC: Zinc knuckle; InterPro: IPR001878 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents the CysCysHisCys (CCHC) type zinc finger domains, and have the sequence: C-X2-C-X4-H-X4-C where X can be any amino acid, and number indicates the number of residues. These 18 residues CCHC zinc finger domains are mainly found in the nucleocapsid protein of retroviruses. It is required for viral genome packaging and for early infection process [, , ]. It is also found in eukaryotic proteins involved in RNA binding or single-stranded DNA binding []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0003676 nucleic acid binding, 0008270 zinc ion binding; PDB: 2L44_A 1A1T_A 1WWG_A 1U6P_A 1WWD_A 1WWE_A 1A6B_B 1F6U_A 1MFS_A 1NCP_C ....
Probab=42.46 E-value=25 Score=18.62 Aligned_cols=8 Identities=38% Similarity=1.103 Sum_probs=6.5
Q ss_pred CCCcCCCC
Q psy8325 39 KCWKCLKN 46 (237)
Q Consensus 39 ~Cw~C~~~ 46 (237)
.||+|+..
T Consensus 2 ~C~~C~~~ 9 (18)
T PF00098_consen 2 KCFNCGEP 9 (18)
T ss_dssp BCTTTSCS
T ss_pred cCcCCCCc
Confidence 69999875
No 186
>PF13717 zinc_ribbon_4: zinc-ribbon domain
Probab=41.73 E-value=17 Score=22.66 Aligned_cols=22 Identities=27% Similarity=0.818 Sum_probs=15.6
Q ss_pred CCCcCCCC-------C--CCCCccCcCCCcc
Q psy8325 39 KCWKCLKN-------L--SGKNLFCQYCSSV 60 (237)
Q Consensus 39 ~Cw~C~~~-------~--~~~~~fC~~C~~~ 60 (237)
.|++|+.. + .....-|+.|+.+
T Consensus 4 ~Cp~C~~~y~i~d~~ip~~g~~v~C~~C~~~ 34 (36)
T PF13717_consen 4 TCPNCQAKYEIDDEKIPPKGRKVRCSKCGHV 34 (36)
T ss_pred ECCCCCCEEeCCHHHCCCCCcEEECCCCCCE
Confidence 68888873 1 3556888888764
No 187
>PF07191 zinc-ribbons_6: zinc-ribbons; InterPro: IPR010807 This family consists of several short, hypothetical bacterial proteins of around 70 residues in length. Members of this family 8 highly conserved cysteine residues. The function of the family is unknown.; PDB: 2JRP_A 2JNE_A.
Probab=40.87 E-value=8.5 Score=28.02 Aligned_cols=24 Identities=25% Similarity=0.614 Sum_probs=12.8
Q ss_pred CCCCcCCCCC------CCCCccCcCCCccc
Q psy8325 38 LKCWKCLKNL------SGKNLFCQYCSSVQ 61 (237)
Q Consensus 38 ~~Cw~C~~~~------~~~~~fC~~C~~~q 61 (237)
..||.|+.++ ..-.+||..|..+.
T Consensus 31 a~CPdC~~~Le~LkACGAvdYFC~~c~gLi 60 (70)
T PF07191_consen 31 AFCPDCGQPLEVLKACGAVDYFCNHCHGLI 60 (70)
T ss_dssp EE-TTT-SB-EEEEETTEEEEE-TTTT-EE
T ss_pred ccCCCcccHHHHHHHhcccceeeccCCcee
Confidence 3688888876 23568888887654
No 188
>COG4307 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=40.47 E-value=9 Score=35.02 Aligned_cols=25 Identities=24% Similarity=0.489 Sum_probs=21.0
Q ss_pred CCCCcCCCCCCCCCccCcCCCcccC
Q psy8325 38 LKCWKCLKNLSGKNLFCQYCSSVQK 62 (237)
Q Consensus 38 ~~Cw~C~~~~~~~~~fC~~C~~~qp 62 (237)
-.|++|+.++.-...+|.+|++..-
T Consensus 4 FhC~~CgQ~v~FeN~~C~~Cg~~Lg 28 (349)
T COG4307 4 FHCPNCGQRVAFENSACLSCGSALG 28 (349)
T ss_pred ccCCCCCCeeeecchHHHhhhhHhh
Confidence 3799999999777899999988553
No 189
>COG5595 Zn-ribbon-containing, possibly nucleic-acid-binding protein [General function prediction only]
Probab=40.40 E-value=13 Score=32.43 Aligned_cols=30 Identities=20% Similarity=0.367 Sum_probs=22.7
Q ss_pred cccCCCCCCcCCCCC-------CCCCccCcCCCcccC
Q psy8325 33 VKHTELKCWKCLKNL-------SGKNLFCQYCSSVQK 62 (237)
Q Consensus 33 ~~~~~~~Cw~C~~~~-------~~~~~fC~~C~~~qp 62 (237)
+.+.+..||+|++.+ ....|.|..|+.|..
T Consensus 214 Aqek~r~CPsC~k~Wqlk~~i~d~fhfkcd~crlvsn 250 (256)
T COG5595 214 AQEKYRCCPSCGKDWQLKNPIFDTFHFKCDTCRLVSN 250 (256)
T ss_pred hhhccCCCCcccccceeccchhhheeeecccceeecc
Confidence 446788999999965 255678999987653
No 190
>PRK05580 primosome assembly protein PriA; Validated
Probab=40.17 E-value=20 Score=36.50 Aligned_cols=29 Identities=21% Similarity=0.352 Sum_probs=21.4
Q ss_pred CCCCCCcCCCCC----CCCCccCcCCCcccCCC
Q psy8325 36 TELKCWKCLKNL----SGKNLFCQYCSSVQKPD 64 (237)
Q Consensus 36 ~~~~Cw~C~~~~----~~~~~fC~~C~~~qp~~ 64 (237)
....|++|+.++ ....+.|+.|+...++.
T Consensus 389 ~~~~C~~C~~~l~~h~~~~~l~Ch~Cg~~~~~~ 421 (679)
T PRK05580 389 WVAECPHCDASLTLHRFQRRLRCHHCGYQEPIP 421 (679)
T ss_pred CccCCCCCCCceeEECCCCeEECCCCcCCCCCC
Confidence 345788888876 35678899999877643
No 191
>KOG1818|consensus
Probab=39.94 E-value=29 Score=35.21 Aligned_cols=71 Identities=23% Similarity=0.394 Sum_probs=41.0
Q ss_pred cccccccc-ccchhH-HHhhhhcccccc--c---cccccc--cCCCCCCcCCCCCCCCCccCcCCCcccCCCC-------
Q psy8325 2 LRVRGIVQ-NLTVNS-LFSRIANATLQN--L---CTDTVK--HTELKCWKCLKNLSGKNLFCQYCSSVQKPDS------- 65 (237)
Q Consensus 2 ~~~~~~~~-~~~~~~-l~~~~~~~~~~~--~---~~~~~~--~~~~~Cw~C~~~~~~~~~fC~~C~~~qp~~~------- 65 (237)
|.+.|+++ -+.=|+ +|..-+.++--. . |..... ..+..|-+||. .||.+|....-+-+
T Consensus 137 lk~~g~~Fpe~~e~d~mf~~~~~pdW~D~~~C~rCr~~F~~~~rkHHCr~CG~------vFC~qcss~s~~lP~~Gi~~~ 210 (634)
T KOG1818|consen 137 LKGGGHVFPELDENDAMFDAETAPDWIDSEECLRCRVKFGLTNRKHHCRNCGQ------VFCGQCSSKSLTLPKLGIEKP 210 (634)
T ss_pred HhcCCccccccccchhhhcccCCcccccccccceeeeeeeeccccccccccch------hhccCccccccCccccccccc
Confidence 34678776 333334 888888766542 2 332222 23448999975 78888866443222
Q ss_pred ---CCChhhhcCCccc
Q psy8325 66 ---QSNYYDLFDLKLT 78 (237)
Q Consensus 66 ---~~nyf~llgl~~~ 78 (237)
...-|+++..+..
T Consensus 211 VRVCd~C~E~l~~~s~ 226 (634)
T KOG1818|consen 211 VRVCDSCYELLTRASV 226 (634)
T ss_pred ceehhhhHHHhhhccc
Confidence 3455677766543
No 192
>PRK05654 acetyl-CoA carboxylase subunit beta; Validated
Probab=39.29 E-value=13 Score=34.14 Aligned_cols=28 Identities=21% Similarity=0.343 Sum_probs=21.4
Q ss_pred CCCCCcCCCCC-----CCCCccCcCCCcccCCC
Q psy8325 37 ELKCWKCLKNL-----SGKNLFCQYCSSVQKPD 64 (237)
Q Consensus 37 ~~~Cw~C~~~~-----~~~~~fC~~C~~~qp~~ 64 (237)
-..|++|+..+ ..+...||.|++-.+..
T Consensus 27 ~~~c~~c~~~~~~~~l~~~~~vc~~c~~h~rl~ 59 (292)
T PRK05654 27 WTKCPSCGQVLYRKELEANLNVCPKCGHHMRIS 59 (292)
T ss_pred eeECCCccchhhHHHHHhcCCCCCCCCCCeeCC
Confidence 44899999976 35668999999866653
No 193
>cd00729 rubredoxin_SM Rubredoxin, Small Modular nonheme iron binding domain containing a [Fe(SCys)4] center, present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), and believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=39.05 E-value=22 Score=21.91 Aligned_cols=21 Identities=19% Similarity=0.303 Sum_probs=15.2
Q ss_pred CCCcCCCCCC--CCCccCcCCCc
Q psy8325 39 KCWKCLKNLS--GKNLFCQYCSS 59 (237)
Q Consensus 39 ~Cw~C~~~~~--~~~~fC~~C~~ 59 (237)
.|..|+.... ...-.||.|++
T Consensus 4 ~C~~CG~i~~g~~~p~~CP~Cg~ 26 (34)
T cd00729 4 VCPVCGYIHEGEEAPEKCPICGA 26 (34)
T ss_pred ECCCCCCEeECCcCCCcCcCCCC
Confidence 5888988642 34568999987
No 194
>PRK14102 nifW nitrogenase stabilizing/protective protein; Provisional
Probab=38.94 E-value=81 Score=24.72 Aligned_cols=61 Identities=25% Similarity=0.497 Sum_probs=39.5
Q ss_pred CCCChhhhcCCcccC---CCCHHHHHHHHHHHHHHhCCCCCCCCCHHHHHHHHHHHHHHHHHHHHhc
Q psy8325 65 SQSNYYDLFDLKLTY---LINNVDLSKKFKQLQSQLHPDKFSNKNQEEQAISETYSSYLNKAYSILQ 128 (237)
Q Consensus 65 ~~~nyf~llgl~~~f---~id~~~Lk~~Yr~Lq~~~HPDk~~~~~~~e~~~a~~~s~~In~AY~tL~ 128 (237)
..-+||+.||+|-.. ++..=-|-++|.+.....+.+. ...++.+ .-...-..+.+||+...
T Consensus 13 sAEdFf~ff~v~YDp~vvnV~RLHILkrf~qyl~~~~~~~-~~~~e~~--~~~~yr~~L~~AY~dF~ 76 (105)
T PRK14102 13 DAEDYFQFFELPYDPTVVNVNRLHILKQFSQLIAEIDANF-PDLSEEE--KLEKYQLALEEAYQVFL 76 (105)
T ss_pred cHHHHHHHhCCCCCcchhhHHHHHHHHHHHHHHHHhcccc-CCCCHHH--HHHHHHHHHHHHHHHHc
Confidence 346999999997432 3333357788888887766554 2333333 33456678899998764
No 195
>KOG1984|consensus
Probab=38.93 E-value=16 Score=38.40 Aligned_cols=66 Identities=15% Similarity=0.447 Sum_probs=43.5
Q ss_pred cCCCCCCcCCCCC--------CCCCccCcCCCcccCCCCCCChhhhcCCcc-cCC-----------CCHHHHHHHHHHHH
Q psy8325 35 HTELKCWKCLKNL--------SGKNLFCQYCSSVQKPDSQSNYYDLFDLKL-TYL-----------INNVDLSKKFKQLQ 94 (237)
Q Consensus 35 ~~~~~Cw~C~~~~--------~~~~~fC~~C~~~qp~~~~~nyf~llgl~~-~f~-----------id~~~Lk~~Yr~Lq 94 (237)
+.+..||.|.+-+ ..+.|.|.-|+..-.+. .+||.-||=.- .-+ +|. ...+-|-+..
T Consensus 336 sgPvRC~RCkaYinPFmqF~~~gr~f~Cn~C~~~n~vp--~~yf~~L~~~grr~D~~erpEL~~Gt~df-vatk~Y~~~~ 412 (1007)
T KOG1984|consen 336 SGPVRCNRCKAYINPFMQFIDGGRKFICNFCGSKNQVP--DDYFNHLGPTGRRVDVEERPELCLGTVDF-VATKDYCRKT 412 (1007)
T ss_pred CCCcchhhhhhhcCcceEEecCCceEEecCCCccccCC--hhhcccCCCcccccccccCchhcccccce-eeehhhhhcC
Confidence 5577999999854 35679999998865543 47888887521 111 111 2346677777
Q ss_pred HHhCCCCCC
Q psy8325 95 SQLHPDKFS 103 (237)
Q Consensus 95 ~~~HPDk~~ 103 (237)
+-.||-.|.
T Consensus 413 k~p~ppafv 421 (1007)
T KOG1984|consen 413 KPPKPPAFV 421 (1007)
T ss_pred CCCCCceEE
Confidence 788888763
No 196
>KOG0724|consensus
Probab=38.76 E-value=40 Score=30.82 Aligned_cols=51 Identities=22% Similarity=0.284 Sum_probs=36.8
Q ss_pred CHHHHHHHHHHHHHHhCCCCCCCCCHHHHHHHHHHHHHHHHHHHHhcCcchh
Q psy8325 82 NNVDLSKKFKQLQSQLHPDKFSNKNQEEQAISETYSSYLNKAYSILQNPLER 133 (237)
Q Consensus 82 d~~~Lk~~Yr~Lq~~~HPDk~~~~~~~e~~~a~~~s~~In~AY~tL~dp~~R 133 (237)
+...+...|+......||++... .........+.+..|.+||.+|++...|
T Consensus 5 ~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~k~i~ka~~i~~~~~~~ 55 (335)
T KOG0724|consen 5 SEDELRLAYREMALKSHPEKKSF-YEKLSLWTEEEFKKIEKALAILDDDEPR 55 (335)
T ss_pred cHHHHHHHHHHHhhhcCcHHHHH-HHHhhhhHHHHHHHHHHHHHHHhccccc
Confidence 45578999999999999999753 0111112346789999999999985543
No 197
>COG2956 Predicted N-acetylglucosaminyl transferase [Carbohydrate transport and metabolism]
Probab=38.64 E-value=18 Score=34.06 Aligned_cols=32 Identities=13% Similarity=0.288 Sum_probs=23.8
Q ss_pred CCCCCCcCCCCCCCCCccCcCCCcccCCCCCC
Q psy8325 36 TELKCWKCLKNLSGKNLFCQYCSSVQKPDSQS 67 (237)
Q Consensus 36 ~~~~Cw~C~~~~~~~~~fC~~C~~~qp~~~~~ 67 (237)
..-.|-+||-..-.----||+|+..+.+.|..
T Consensus 353 ~~YRC~~CGF~a~~l~W~CPsC~~W~TikPir 384 (389)
T COG2956 353 PRYRCQNCGFTAHTLYWHCPSCRAWETIKPIR 384 (389)
T ss_pred CCceecccCCcceeeeeeCCCcccccccCCcc
Confidence 34489999987655667899998877665543
No 198
>PF11781 RRN7: RNA polymerase I-specific transcription initiation factor Rrn7; InterPro: IPR021752 Rrn7 is a transcription binding factor that associates strongly with both Rrn6 and Rrn11 to form a complex which itself binds the TATA-binding protein and is required for transcription by the core domain of the RNA PolI promoter [],[].
Probab=38.34 E-value=14 Score=23.23 Aligned_cols=23 Identities=17% Similarity=0.486 Sum_probs=18.3
Q ss_pred CCCcCCCCC---CCCCccCcCCCccc
Q psy8325 39 KCWKCLKNL---SGKNLFCQYCSSVQ 61 (237)
Q Consensus 39 ~Cw~C~~~~---~~~~~fC~~C~~~q 61 (237)
.|..|+..+ ....++|..|++++
T Consensus 10 ~C~~C~~~~~~~~dG~~yC~~cG~~~ 35 (36)
T PF11781_consen 10 PCPVCGSRWFYSDDGFYYCDRCGHQS 35 (36)
T ss_pred cCCCCCCeEeEccCCEEEhhhCceEc
Confidence 599998864 47789999999864
No 199
>PF07729 FCD: FCD domain; InterPro: IPR011711 Many bacterial transcription regulation proteins bind DNA through a helix-turn-helix (HTH) motif, which can be classified into subfamilies on the basis of sequence similarities. The HTH GntR family has many members distributed among diverse bacterial groups that regulate various biological processes. It was named GntR after the Bacillus subtilis repressor of the gluconate operon []. In general, these proteins contain a DNA-binding HTH domain at the N terminus, and an effector binding or oligomerisation domain at the C terminus. The winged-helix DNA-binding domain is well conserved in structure for the whole of the GntR family (IPR000524 from INTERPRO), and is similar in structure to other transcriptional regulator families. The C-terminal effector-binding and oligomerisation domains are more variable and are consequently used to define the subfamilies. Based on the sequence and structure of the C-terminal domains, the GtnR family can be divided into four major groups, as represented by FadR (IPR008920 from INTERPRO), HutC, MocR and YtrA, as well as some minor groups such as those represented by AraR and PlmA []. This entry represents the C-terminal ligand binding domain of many members of the GntR family. This domain probably binds to a range of effector molecules that regulate the transcription of genes through the action of the N-terminal DNA-binding domain. This domain is found in P45427 from SWISSPROT and P31460 from SWISSPROT that are regulators of sugar biosynthesis operons.; PDB: 3SXK_A 3SXY_A 3SXM_B 3SXZ_A 3FMS_A 2DI3_B 2HS5_A 3IHU_B 3C7J_A.
Probab=37.11 E-value=36 Score=24.82 Aligned_cols=9 Identities=33% Similarity=0.431 Sum_probs=4.1
Q ss_pred HHHHHHHHh
Q psy8325 160 ILMLNEELD 168 (237)
Q Consensus 160 imE~rE~le 168 (237)
++++|.-||
T Consensus 2 i~~~R~~Le 10 (125)
T PF07729_consen 2 IYEVRELLE 10 (125)
T ss_dssp HHHHHHHHH
T ss_pred HHHHHHHHH
Confidence 444444444
No 200
>PF03206 NifW: Nitrogen fixation protein NifW; InterPro: IPR004893 Nitrogenase is a complex metalloenzyme composed of two proteins designated the Fe-protein and the MoFe-protein. Apart from these two proteins, a number of accessory proteins are essential for the maturation and assembly of nitrogenase. Even though experimental evidence suggests that these accessory proteins are required for nitrogenase activity, the exact roles played by many of these proteins in the functions of nitrogenase are unclear []. Using yeast two-hybrid screening it has been shown that NifW can interact with itself as well as NifZ. ; GO: 0009399 nitrogen fixation
Probab=36.94 E-value=1.2e+02 Score=23.68 Aligned_cols=63 Identities=21% Similarity=0.343 Sum_probs=40.7
Q ss_pred CCCCChhhhcCCcccC---CCCHHHHHHHHHHHHHHhCCCCCCCCCHHHHHHHHHHHHHHHHHHHHhcCc
Q psy8325 64 DSQSNYYDLFDLKLTY---LINNVDLSKKFKQLQSQLHPDKFSNKNQEEQAISETYSSYLNKAYSILQNP 130 (237)
Q Consensus 64 ~~~~nyf~llgl~~~f---~id~~~Lk~~Yr~Lq~~~HPDk~~~~~~~e~~~a~~~s~~In~AY~tL~dp 130 (237)
...-+||+.||||-+. .+..=-|=++|.+.....++. ...++.+ .-...-..+.+||+....-
T Consensus 12 ~sAEdFf~fF~V~YDp~vv~V~RLHILkrF~~yL~~~~~~--~~~~e~~--~~~~~R~~L~~AY~dFv~S 77 (105)
T PF03206_consen 12 SSAEDFFDFFGVPYDPKVVNVNRLHILKRFGQYLRAADFA--PGLSEEE--DWAAYRRALERAYQDFVTS 77 (105)
T ss_pred cCHHHHHHHhCCCcchhHHHHhhHHHHHHHHHHHHhccCC--CCCCHHH--HHHHHHHHHHHHHHHHhcC
Confidence 3456999999998432 233335778888888888665 3333333 2235567789999876543
No 201
>PRK09521 exosome complex RNA-binding protein Csl4; Provisional
Probab=36.82 E-value=16 Score=30.88 Aligned_cols=24 Identities=25% Similarity=0.717 Sum_probs=20.3
Q ss_pred CCCcCCCCC---CCCCccCcCCCcccC
Q psy8325 39 KCWKCLKNL---SGKNLFCQYCSSVQK 62 (237)
Q Consensus 39 ~Cw~C~~~~---~~~~~fC~~C~~~qp 62 (237)
.|..|+.++ ....+.|+.|+...+
T Consensus 151 ~~~~~g~~~~~~~~~~~~c~~~~~~e~ 177 (189)
T PRK09521 151 MCSRCRTPLVKKGENELKCPNCGNIET 177 (189)
T ss_pred EccccCCceEECCCCEEECCCCCCEEe
Confidence 699999976 458899999998775
No 202
>PF14446 Prok-RING_1: Prokaryotic RING finger family 1
Probab=36.65 E-value=22 Score=24.56 Aligned_cols=23 Identities=26% Similarity=0.587 Sum_probs=19.3
Q ss_pred CCCCCcCCCCC--CCCCccCcCCCc
Q psy8325 37 ELKCWKCLKNL--SGKNLFCQYCSS 59 (237)
Q Consensus 37 ~~~Cw~C~~~~--~~~~~fC~~C~~ 59 (237)
...|..|+.++ ...-.+|+.|++
T Consensus 5 ~~~C~~Cg~~~~~~dDiVvCp~Cga 29 (54)
T PF14446_consen 5 GCKCPVCGKKFKDGDDIVVCPECGA 29 (54)
T ss_pred CccChhhCCcccCCCCEEECCCCCC
Confidence 45799999988 577889999987
No 203
>PF07295 DUF1451: Protein of unknown function (DUF1451); InterPro: IPR009912 This family consists of several hypothetical bacterial proteins of around 160 residues in length. Members of this family contain four highly conserved cysteine resides toward the C-terminal region of the protein. The function of this family is unknown.
Probab=36.44 E-value=21 Score=29.49 Aligned_cols=22 Identities=18% Similarity=0.468 Sum_probs=17.7
Q ss_pred CCCcCCCCC----CCCCccCcCCCcc
Q psy8325 39 KCWKCLKNL----SGKNLFCQYCSSV 60 (237)
Q Consensus 39 ~Cw~C~~~~----~~~~~fC~~C~~~ 60 (237)
.|-+|+..+ +..-+.||.|+..
T Consensus 114 ~C~~Cg~~~~~~~~~~l~~Cp~C~~~ 139 (146)
T PF07295_consen 114 VCENCGHEVELTHPERLPPCPKCGHT 139 (146)
T ss_pred ecccCCCEEEecCCCcCCCCCCCCCC
Confidence 899999965 4667889999763
No 204
>PRK11414 colanic acid/biofilm transcriptional regulator; Provisional
Probab=36.44 E-value=2.3e+02 Score=23.89 Aligned_cols=51 Identities=20% Similarity=0.211 Sum_probs=33.7
Q ss_pred HHHHHHHhcCcchhhHHHHhhcCCC-cccc----ccCCCHHHHHHHHHHHHHHhcc
Q psy8325 120 LNKAYSILQNPLERGLYLLSLQNIS-IEED----TKGTDQKLLMEILMLNEELDEA 170 (237)
Q Consensus 120 In~AY~tL~dp~~Ra~Yll~l~g~~-~~e~----~~~~d~efLmeimE~rE~leea 170 (237)
|.+-|.+=..|++-|.-.|.-.|.- +... ....+.+-+.+++++|..||..
T Consensus 40 La~~lgVSRtpVREAL~~L~~eGLV~~~~~~g~~v~~~~~~~~~ei~~~R~~lE~~ 95 (221)
T PRK11414 40 LAEQLGMSITPVREALLRLVSVNALSVAPAQAFTVPEVSKRQLDEINRIRYELELM 95 (221)
T ss_pred HHHHHCCCchhHHHHHHHHHHCCCEEecCCCceeecCCCHHHHHHHHHHHHHHHHH
Confidence 5556666666777776666655532 2211 2456778889999999999854
No 205
>KOG3795|consensus
Probab=36.03 E-value=17 Score=31.28 Aligned_cols=37 Identities=16% Similarity=0.407 Sum_probs=24.2
Q ss_pred cCCCCCCcCCCCCCCCCccCcCCCcccCCCCCCChhhhcCCc
Q psy8325 35 HTELKCWKCLKNLSGKNLFCQYCSSVQKPDSQSNYYDLFDLK 76 (237)
Q Consensus 35 ~~~~~Cw~C~~~~~~~~~fC~~C~~~qp~~~~~nyf~llgl~ 76 (237)
+....|..|++ +..|||=.|.. |++....+--.+.||
T Consensus 13 eGRs~C~~C~~---SRkFfCY~C~V--PV~vp~E~~P~vkLP 49 (230)
T KOG3795|consen 13 EGRSTCPGCKS---SRKFFCYDCRV--PVPVPGEFTPTVKLP 49 (230)
T ss_pred cccccCCCCCC---cceEEEEeecc--ccCcchhcccccccc
Confidence 34558999997 47899999977 433333444445554
No 206
>TIGR00515 accD acetyl-CoA carboxylase, carboxyl transferase, beta subunit. The enzyme acetyl-CoA carboxylase contains a biotin carboxyl carrier protein or domain, a biotin carboxylase, and a carboxyl transferase. This model represents the beta chain of the carboxyl transferase for cases in which the architecture of the protein is as in E. coli, in which the carboxyltransferase portion consists of two non-identical subnits, alpha and beta.
Probab=35.80 E-value=18 Score=33.06 Aligned_cols=28 Identities=29% Similarity=0.418 Sum_probs=21.9
Q ss_pred CCCCCcCCCCC-----CCCCccCcCCCcccCCC
Q psy8325 37 ELKCWKCLKNL-----SGKNLFCQYCSSVQKPD 64 (237)
Q Consensus 37 ~~~Cw~C~~~~-----~~~~~fC~~C~~~qp~~ 64 (237)
-..|++|+..+ ..+...||.|++-.+..
T Consensus 26 ~~~c~~c~~~~~~~~l~~~~~vc~~c~~h~rl~ 58 (285)
T TIGR00515 26 WTKCPKCGQVLYTKELERNLEVCPKCDHHMRMD 58 (285)
T ss_pred eeECCCCcchhhHHHHHhhCCCCCCCCCcCcCC
Confidence 34899999976 35678999999877664
No 207
>PF03604 DNA_RNApol_7kD: DNA directed RNA polymerase, 7 kDa subunit; InterPro: IPR006591 DNA-dependent RNA polymerase catalyzes the transcription of DNA into RNA using the four ribonucleoside triphosphates as substrates. Each class of RNA polymerase is assembled from 9 to 15 different polypeptides. Rbp10 (RNA polymerase CX) is a domain found in RNA polymerase subunit 10; present in RNA polymerase I, II and III.; GO: 0003677 DNA binding, 0003899 DNA-directed RNA polymerase activity, 0006351 transcription, DNA-dependent; PDB: 2PMZ_Z 3HKZ_X 2NVX_L 3S1Q_L 2JA6_L 3S17_L 3HOW_L 3HOV_L 3PO2_L 3HOZ_L ....
Probab=35.77 E-value=27 Score=21.45 Aligned_cols=20 Identities=25% Similarity=0.743 Sum_probs=10.3
Q ss_pred CCcCCCCC---CCCCccCcCCCc
Q psy8325 40 CWKCLKNL---SGKNLFCQYCSS 59 (237)
Q Consensus 40 Cw~C~~~~---~~~~~fC~~C~~ 59 (237)
|-.|+..+ .....-|+.|+.
T Consensus 3 C~~Cg~~~~~~~~~~irC~~CG~ 25 (32)
T PF03604_consen 3 CGECGAEVELKPGDPIRCPECGH 25 (32)
T ss_dssp ESSSSSSE-BSTSSTSSBSSSS-
T ss_pred CCcCCCeeEcCCCCcEECCcCCC
Confidence 55565543 244556666654
No 208
>KOG2906|consensus
Probab=35.75 E-value=13 Score=28.93 Aligned_cols=26 Identities=27% Similarity=0.420 Sum_probs=21.1
Q ss_pred CCCcCCCCC------CCCCccCcCCCcccCCC
Q psy8325 39 KCWKCLKNL------SGKNLFCQYCSSVQKPD 64 (237)
Q Consensus 39 ~Cw~C~~~~------~~~~~fC~~C~~~qp~~ 64 (237)
.||.|+..+ ....|+|..|..+-|+.
T Consensus 3 FCP~Cgn~Live~g~~~~rf~C~tCpY~~~I~ 34 (105)
T KOG2906|consen 3 FCPTCGNMLIVESGESCNRFSCRTCPYVFPIS 34 (105)
T ss_pred ccCCCCCEEEEecCCeEeeEEcCCCCceeeEe
Confidence 699999854 24679999999998876
No 209
>PF10071 DUF2310: Zn-ribbon-containing, possibly nucleic-acid-binding protein (DUF2310); InterPro: IPR016908 This group represents uncharacterised conserved proteins.
Probab=35.52 E-value=19 Score=32.56 Aligned_cols=29 Identities=21% Similarity=0.343 Sum_probs=21.9
Q ss_pred cCCCCCCcCCCCC-------CCCCccCcCCCcccCC
Q psy8325 35 HTELKCWKCLKNL-------SGKNLFCQYCSSVQKP 63 (237)
Q Consensus 35 ~~~~~Cw~C~~~~-------~~~~~fC~~C~~~qp~ 63 (237)
.....||+|++.+ ..-.|.|..|+-|.-+
T Consensus 218 e~~r~CP~Cg~~W~L~~plh~iFdFKCD~CRLVSnl 253 (258)
T PF10071_consen 218 EQARKCPSCGGDWRLKEPLHDIFDFKCDPCRLVSNL 253 (258)
T ss_pred hhCCCCCCCCCccccCCchhhceeccCCcceeeccc
Confidence 5677999999975 2456789999877633
No 210
>COG1998 RPS31 Ribosomal protein S27AE [Translation, ribosomal structure and biogenesis]
Probab=35.17 E-value=24 Score=24.09 Aligned_cols=26 Identities=15% Similarity=0.386 Sum_probs=18.4
Q ss_pred CCCCCCcCCCCC----CCCCccCcCCCccc
Q psy8325 36 TELKCWKCLKNL----SGKNLFCQYCSSVQ 61 (237)
Q Consensus 36 ~~~~Cw~C~~~~----~~~~~fC~~C~~~q 61 (237)
....||+|+..+ -..-..|..|+..+
T Consensus 18 k~~~CPrCG~gvfmA~H~dR~~CGkCgyTe 47 (51)
T COG1998 18 KNRFCPRCGPGVFMADHKDRWACGKCGYTE 47 (51)
T ss_pred ccccCCCCCCcchhhhcCceeEeccccceE
Confidence 345899999643 14568899998754
No 211
>PF11101 DUF2884: Protein of unknown function (DUF2884); InterPro: IPR021307 Some members in this bacterial family of proteins are annotated as YggN which currently has no known function.
Probab=35.15 E-value=1.9e+02 Score=25.34 Aligned_cols=46 Identities=11% Similarity=0.314 Sum_probs=32.8
Q ss_pred HHHHHHHHHHHHHhcc--ccHH-HHHHHHHHHHhHHHHHHHHHHHHhhc
Q psy8325 185 ATIDDLTKNVNASFEQ--KDFN-QAKDLLIRMKYFTTLVSKVRDKKNQL 230 (237)
Q Consensus 185 ~~i~~~~~~l~~~f~~--~d~~-~A~~~l~rlkY~~~l~~~ik~k~~~l 230 (237)
.++..+...|..-++. .+++ .|.....+++-+......+..++++|
T Consensus 174 ~~m~~l~~~ie~~~~~q~~~le~~a~~lC~~l~~L~~~E~~L~~~IP~L 222 (229)
T PF11101_consen 174 QRMEGLQQQIEQEMEAQAQELEQKAQALCDSLQQLDQQEQQLQQRIPEL 222 (229)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccc
Confidence 4444444444444432 2333 68899999999999999999999887
No 212
>PRK08402 replication factor A; Reviewed
Probab=35.02 E-value=18 Score=34.06 Aligned_cols=28 Identities=14% Similarity=0.308 Sum_probs=21.7
Q ss_pred CCCCCCcCCCCCC----CCCccCcCCCcccCC
Q psy8325 36 TELKCWKCLKNLS----GKNLFCQYCSSVQKP 63 (237)
Q Consensus 36 ~~~~Cw~C~~~~~----~~~~fC~~C~~~qp~ 63 (237)
..+.|++|++.+. .....|+.|+.+.|.
T Consensus 211 ~y~aCp~CnKkv~~~~~~~~~~Ce~~~~v~p~ 242 (355)
T PRK08402 211 VYDACPECRRKVDYDPATDTWICPEHGEVEPI 242 (355)
T ss_pred eEecCCCCCeEEEEecCCCCEeCCCCCCcCcc
Confidence 3569999999873 357899999876654
No 213
>smart00709 Zpr1 Duplicated domain in the epidermal growth factor- and elongation factor-1alpha-binding protein Zpr1. Also present in archaeal proteins.
Probab=34.83 E-value=16 Score=30.58 Aligned_cols=21 Identities=24% Similarity=0.395 Sum_probs=15.4
Q ss_pred CCCcCCCCC---------------CCCCccCcCCCc
Q psy8325 39 KCWKCLKNL---------------SGKNLFCQYCSS 59 (237)
Q Consensus 39 ~Cw~C~~~~---------------~~~~~fC~~C~~ 59 (237)
.|++|+.+. -...|.|+.||.
T Consensus 2 ~Cp~C~~~~~~~~~~~~IP~F~evii~sf~C~~CGy 37 (160)
T smart00709 2 DCPSCGGNGTTRMLLTSIPYFREVIIMSFECEHCGY 37 (160)
T ss_pred cCCCCCCCCEEEEEEecCCCcceEEEEEEECCCCCC
Confidence 599998642 134788999986
No 214
>COG1110 Reverse gyrase [DNA replication, recombination, and repair]
Probab=34.72 E-value=18 Score=38.69 Aligned_cols=22 Identities=32% Similarity=0.647 Sum_probs=16.6
Q ss_pred CCCCcCCCCCC----CCCccCcCCCc
Q psy8325 38 LKCWKCLKNLS----GKNLFCQYCSS 59 (237)
Q Consensus 38 ~~Cw~C~~~~~----~~~~fC~~C~~ 59 (237)
..||||+++++ .....|+.|-.
T Consensus 9 ~~CpNCGG~isseRL~~glpCe~CLp 34 (1187)
T COG1110 9 SSCPNCGGDISSERLEKGLPCERCLP 34 (1187)
T ss_pred ccCCCCCCcCcHHHHhcCCCchhccC
Confidence 48999999873 45566999944
No 215
>TIGR00375 conserved hypothetical protein TIGR00375. The member of this family from Methanococcus jannaschii, MJ0043, is considerably longer and appears to contain an intein N-terminal to the region of homology.
Probab=34.50 E-value=64 Score=30.68 Aligned_cols=37 Identities=14% Similarity=0.159 Sum_probs=26.6
Q ss_pred CCChhhhcCCc------ccCCCCHHHHHHHHHHHHHHhCCCCC
Q psy8325 66 QSNYYDLFDLK------LTYLINNVDLSKKFKQLQSQLHPDKF 102 (237)
Q Consensus 66 ~~nyf~llgl~------~~f~id~~~Lk~~Yr~Lq~~~HPDk~ 102 (237)
...|+.+..|+ +.-.+..+.++..|.+|...|-|.-.
T Consensus 290 rppy~~~iPL~ei~~~~~~~~~~~k~v~~~~~~l~~~fG~E~~ 332 (374)
T TIGR00375 290 RPPYVHLIPLAEVIGVGPKKGIFTKAVQSLWEKLKKAFGTEIA 332 (374)
T ss_pred CCCeeeeCCHHHHHhhhcCCCCccHHHHHHHHHHHHHhccHHH
Confidence 45677766654 33345677888999999999888765
No 216
>PRK13130 H/ACA RNA-protein complex component Nop10p; Reviewed
Probab=34.27 E-value=36 Score=23.66 Aligned_cols=24 Identities=29% Similarity=0.440 Sum_probs=17.4
Q ss_pred CCCCCcCCCCCCCCCccCcCCCcccC
Q psy8325 37 ELKCWKCLKNLSGKNLFCQYCSSVQK 62 (237)
Q Consensus 37 ~~~Cw~C~~~~~~~~~fC~~C~~~qp 62 (237)
-..|.+|+.-.. .-.||.||....
T Consensus 5 mr~C~~CgvYTL--k~~CP~CG~~t~ 28 (56)
T PRK13130 5 IRKCPKCGVYTL--KEICPVCGGKTK 28 (56)
T ss_pred ceECCCCCCEEc--cccCcCCCCCCC
Confidence 347999987543 678999987543
No 217
>TIGR00340 zpr1_rel ZPR1-related zinc finger protein. A model ZPR1_znf (TIGR00310) has been created to describe the domain shared by this protein and ZPR1.
Probab=34.25 E-value=18 Score=30.44 Aligned_cols=10 Identities=20% Similarity=0.757 Sum_probs=7.9
Q ss_pred CCccCcCCCc
Q psy8325 50 KNLFCQYCSS 59 (237)
Q Consensus 50 ~~~fC~~C~~ 59 (237)
..|.|+.||.
T Consensus 27 ~sf~C~~CGy 36 (163)
T TIGR00340 27 STYICEKCGY 36 (163)
T ss_pred EEEECCCCCC
Confidence 4688999976
No 218
>PRK12496 hypothetical protein; Provisional
Probab=34.16 E-value=28 Score=29.12 Aligned_cols=21 Identities=29% Similarity=0.624 Sum_probs=16.0
Q ss_pred CCCcCCCCCC--CCCccCcCCCc
Q psy8325 39 KCWKCLKNLS--GKNLFCQYCSS 59 (237)
Q Consensus 39 ~Cw~C~~~~~--~~~~fC~~C~~ 59 (237)
.|..|+.... ...-|||-||+
T Consensus 129 ~C~gC~~~~~~~~~~~~C~~CG~ 151 (164)
T PRK12496 129 VCKGCKKKYPEDYPDDVCEICGS 151 (164)
T ss_pred ECCCCCccccCCCCCCcCCCCCC
Confidence 5999998653 34468999996
No 219
>TIGR01206 lysW lysine biosynthesis protein LysW. This very small, poorly characterized protein has been shown essential in Thermus thermophilus for an unusual pathway of Lys biosynthesis from aspartate by way of alpha-aminoadipate (AAA) rather than diaminopimelate. It is found also in Deinococcus radiodurans and Pyrococcus horikoshii, which appear to share the AAA pathway.
Probab=34.04 E-value=23 Score=24.40 Aligned_cols=24 Identities=21% Similarity=0.630 Sum_probs=17.4
Q ss_pred CCCCcCCCCC------CCCCccCcCCCccc
Q psy8325 38 LKCWKCLKNL------SGKNLFCQYCSSVQ 61 (237)
Q Consensus 38 ~~Cw~C~~~~------~~~~~fC~~C~~~q 61 (237)
..|+.|+..+ ...-.-|+.|++--
T Consensus 3 ~~CP~CG~~iev~~~~~GeiV~Cp~CGael 32 (54)
T TIGR01206 3 FECPDCGAEIELENPELGELVICDECGAEL 32 (54)
T ss_pred cCCCCCCCEEecCCCccCCEEeCCCCCCEE
Confidence 3799999865 24557899998743
No 220
>PF14319 Zn_Tnp_IS91: Transposase zinc-binding domain
Probab=33.85 E-value=24 Score=27.56 Aligned_cols=25 Identities=28% Similarity=0.567 Sum_probs=18.1
Q ss_pred CCCCCcCCCCC----CCCCccCcCCCccc
Q psy8325 37 ELKCWKCLKNL----SGKNLFCQYCSSVQ 61 (237)
Q Consensus 37 ~~~Cw~C~~~~----~~~~~fC~~C~~~q 61 (237)
...|.+|+... +-..-+||+|+...
T Consensus 42 ~~~C~~Cg~~~~~~~SCk~R~CP~C~~~~ 70 (111)
T PF14319_consen 42 RYRCEDCGHEKIVYNSCKNRHCPSCQAKA 70 (111)
T ss_pred eeecCCCCceEEecCcccCcCCCCCCChH
Confidence 44899998853 45556999998754
No 221
>TIGR00310 ZPR1_znf ZPR1 zinc finger domain.
Probab=33.70 E-value=20 Score=30.91 Aligned_cols=9 Identities=22% Similarity=0.800 Sum_probs=7.3
Q ss_pred CccCcCCCc
Q psy8325 51 NLFCQYCSS 59 (237)
Q Consensus 51 ~~fC~~C~~ 59 (237)
.|.|+.||.
T Consensus 30 sf~C~~CGy 38 (192)
T TIGR00310 30 STICEHCGY 38 (192)
T ss_pred EEECCCCCC
Confidence 678999976
No 222
>PRK07218 replication factor A; Provisional
Probab=33.13 E-value=26 Score=33.86 Aligned_cols=24 Identities=17% Similarity=0.373 Sum_probs=19.9
Q ss_pred CCCCcCCCCCCCCCccCcCCCcccCC
Q psy8325 38 LKCWKCLKNLSGKNLFCQYCSSVQKP 63 (237)
Q Consensus 38 ~~Cw~C~~~~~~~~~fC~~C~~~qp~ 63 (237)
..||.|++.+. ...|+.|+.+.|.
T Consensus 298 ~rCP~C~r~v~--~~~C~~hG~ve~~ 321 (423)
T PRK07218 298 ERCPECGRVIQ--KGQCRSHGAVEGE 321 (423)
T ss_pred ecCcCcccccc--CCcCCCCCCcCCe
Confidence 48999999884 3799999998664
No 223
>cd00350 rubredoxin_like Rubredoxin_like; nonheme iron binding domain containing a [Fe(SCys)4] center. The family includes rubredoxins, a small electron transfer protein, and a slightly smaller modular rubredoxin domain present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), but iron can also be replaced by cobalt, nickel or zinc and believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=33.07 E-value=30 Score=20.97 Aligned_cols=21 Identities=19% Similarity=0.414 Sum_probs=14.9
Q ss_pred CCCcCCCCC--CCCCccCcCCCc
Q psy8325 39 KCWKCLKNL--SGKNLFCQYCSS 59 (237)
Q Consensus 39 ~Cw~C~~~~--~~~~~fC~~C~~ 59 (237)
.|..|+... ......||.|++
T Consensus 3 ~C~~CGy~y~~~~~~~~CP~Cg~ 25 (33)
T cd00350 3 VCPVCGYIYDGEEAPWVCPVCGA 25 (33)
T ss_pred ECCCCCCEECCCcCCCcCcCCCC
Confidence 477888753 235778999976
No 224
>PF04161 Arv1: Arv1-like family ; InterPro: IPR007290 Arv1 is a transmembrane protein, with potential zinc-binding motifs, that mediates sterol homeostasis. Its action is important in lipid homeostasis, which prevents free sterol toxicity []. Arv1 contains a homology domain (AHD), which consists of an N-terminal cysteine-rich subdomain with a putative zinc-binding motif, followed by a C-terminal subdomain of 33 amino acids. The C-terminal subdomain of the AHD is critical for the protein's function []. In yeast, Arv1p is important for the delivery of an early glycosylphosphatidylinositol GPI intermediate, GlcN-acylPI, to the first mannosyltransferase of GPI synthesis in the ER lumen []. It is important for the traffic of sterol in yeast and in humans. In eukaryotic cells, it may fuction in the sphingolipid metabolic pathway as a transporter of ceramides between the ER and Golgi [].
Probab=32.72 E-value=20 Score=30.96 Aligned_cols=51 Identities=20% Similarity=0.333 Sum_probs=29.5
Q ss_pred CCCcCCCCCC----------CCCccCcCCCcccCCCCCCChhhhcCCcccCCCCHHHHH-HHHHHHHHH
Q psy8325 39 KCWKCLKNLS----------GKNLFCQYCSSVQKPDSQSNYYDLFDLKLTYLINNVDLS-KKFKQLQSQ 96 (237)
Q Consensus 39 ~Cw~C~~~~~----------~~~~fC~~C~~~qp~~~~~nyf~llgl~~~f~id~~~Lk-~~Yr~Lq~~ 96 (237)
.|-+|+.|+. ..--.|++|+.+...=---|..-+| ||.=-+| ++||.+.=-
T Consensus 2 iCIeCg~~v~~Ly~~Ys~~~irLt~C~~C~~vaDkYiE~d~vil~-------IDLlLlK~~AYRHllfN 63 (208)
T PF04161_consen 2 ICIECGHPVKSLYRQYSPGNIRLTKCPNCGKVADKYIEYDNVILF-------IDLLLLKPQAYRHLLFN 63 (208)
T ss_pred EeccCCCcchhhhhccCCCcEEEeeccccCCcccceeccccHHHH-------HHHHHcchhhHHHhhcc
Confidence 4888888762 2235799998766221112222222 5665565 788887643
No 225
>PF03833 PolC_DP2: DNA polymerase II large subunit DP2; InterPro: IPR016033 DP2 is the large subunit of a two-subunit novel archaebacterial replicative DNA polymerase first characterised for Pyrococcus furiosus. The structure of DP2 appears to be organised as a ~950 residue component separated from a ~300 residue component by a ~150 residue intein. The other subunit, DP1, has sequence similarity to the eukaryotic DNA polymerase delta small subunit. This entry represents the N-terminal ~950 residue component of DP2.; GO: 0003887 DNA-directed DNA polymerase activity; PDB: 3O59_X.
Probab=32.72 E-value=15 Score=38.50 Aligned_cols=19 Identities=16% Similarity=0.494 Sum_probs=0.0
Q ss_pred HHHHHHHHHHHHhCCCCCC
Q psy8325 85 DLSKKFKQLQSQLHPDKFS 103 (237)
Q Consensus 85 ~Lk~~Yr~Lq~~~HPDk~~ 103 (237)
.+.+++.++--.-.||.+.
T Consensus 715 ~~~~A~e~lg~r~~~~~~K 733 (900)
T PF03833_consen 715 EYDRALENLGERENPDELK 733 (900)
T ss_dssp -------------------
T ss_pred HHHHHHHhhccccchhhhh
Confidence 4556666665555566553
No 226
>PF08646 Rep_fac-A_C: Replication factor-A C terminal domain; InterPro: IPR013955 Replication factor A (RP-A) binds and subsequently stabilises single-stranded DNA intermediates and thus prevents complementary DNA from reannealing. It also plays an essential role in several cellular processes in DNA metabolism including replication, recombination and repair of DNA []. Replication factor-A protein is also known as Replication protein A 70 kDa DNA-binding subunit. This entry is found at the C terminus of Replication factor A.; PDB: 1L1O_F 3U50_C.
Probab=32.67 E-value=15 Score=29.59 Aligned_cols=24 Identities=25% Similarity=0.562 Sum_probs=16.6
Q ss_pred CCCCC--cCCCCCC---CCCccCcCCCcc
Q psy8325 37 ELKCW--KCLKNLS---GKNLFCQYCSSV 60 (237)
Q Consensus 37 ~~~Cw--~C~~~~~---~~~~fC~~C~~~ 60 (237)
.+.|+ .|++.+. ...++|++|+..
T Consensus 18 Y~aC~~~~C~kKv~~~~~~~y~C~~C~~~ 46 (146)
T PF08646_consen 18 YPACPNEKCNKKVTENGDGSYRCEKCNKT 46 (146)
T ss_dssp EEE-TSTTTS-B-EEETTTEEEETTTTEE
T ss_pred ECCCCCccCCCEeecCCCcEEECCCCCCc
Confidence 45899 9999763 457899999875
No 227
>COG3809 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=32.63 E-value=22 Score=26.61 Aligned_cols=23 Identities=26% Similarity=0.452 Sum_probs=15.7
Q ss_pred CCCcCCCCC------CCCCccCcCCCccc
Q psy8325 39 KCWKCLKNL------SGKNLFCQYCSSVQ 61 (237)
Q Consensus 39 ~Cw~C~~~~------~~~~~fC~~C~~~q 61 (237)
.||+|+..+ ...--.||.|+.|-
T Consensus 3 lCP~C~v~l~~~~rs~vEiD~CPrCrGVW 31 (88)
T COG3809 3 LCPICGVELVMSVRSGVEIDYCPRCRGVW 31 (88)
T ss_pred ccCcCCceeeeeeecCceeeeCCccccEe
Confidence 699998854 23345799997654
No 228
>COG2835 Uncharacterized conserved protein [Function unknown]
Probab=32.24 E-value=25 Score=24.81 Aligned_cols=27 Identities=22% Similarity=0.454 Sum_probs=23.1
Q ss_pred CCCcCCCCC----CCCCccCcCCCcccCCCC
Q psy8325 39 KCWKCLKNL----SGKNLFCQYCSSVQKPDS 65 (237)
Q Consensus 39 ~Cw~C~~~~----~~~~~fC~~C~~~qp~~~ 65 (237)
.|+-|.+++ ....+.|+.|+..-|+..
T Consensus 10 aCP~~kg~L~~~~~~~~L~c~~~~~aYpI~d 40 (60)
T COG2835 10 ACPVCKGPLVYDEEKQELICPRCKLAYPIRD 40 (60)
T ss_pred eccCcCCcceEeccCCEEEecccCceeeccc
Confidence 799999987 477899999999888764
No 229
>PRK11032 hypothetical protein; Provisional
Probab=32.05 E-value=25 Score=29.50 Aligned_cols=22 Identities=32% Similarity=0.661 Sum_probs=18.0
Q ss_pred CCCCcCCCCC----CCCCccCcCCCc
Q psy8325 38 LKCWKCLKNL----SGKNLFCQYCSS 59 (237)
Q Consensus 38 ~~Cw~C~~~~----~~~~~fC~~C~~ 59 (237)
..|-+|+..+ +..-.-||.|+.
T Consensus 125 LvC~~Cg~~~~~~~p~~i~pCp~C~~ 150 (160)
T PRK11032 125 LVCEKCHHHLAFYTPEVLPLCPKCGH 150 (160)
T ss_pred EEecCCCCEEEecCCCcCCCCCCCCC
Confidence 3899999965 567788999987
No 230
>PRK05978 hypothetical protein; Provisional
Probab=31.80 E-value=43 Score=27.77 Aligned_cols=37 Identities=19% Similarity=0.374 Sum_probs=23.2
Q ss_pred CCCCCcCCCC-C----CCCCccCcCCCcc---cCCCCCCChhhhc
Q psy8325 37 ELKCWKCLKN-L----SGKNLFCQYCSSV---QKPDSQSNYYDLF 73 (237)
Q Consensus 37 ~~~Cw~C~~~-~----~~~~~fC~~C~~~---qp~~~~~nyf~ll 73 (237)
...||+|+.- + .+-...|+.|+.= .|.+-..-||.++
T Consensus 33 ~grCP~CG~G~LF~g~Lkv~~~C~~CG~~~~~~~a~DgpAy~~i~ 77 (148)
T PRK05978 33 RGRCPACGEGKLFRAFLKPVDHCAACGEDFTHHRADDLPAYLVIV 77 (148)
T ss_pred cCcCCCCCCCcccccccccCCCccccCCccccCCccccCcchhHH
Confidence 4489999984 2 3667889999872 2222223365555
No 231
>PF12128 DUF3584: Protein of unknown function (DUF3584); InterPro: IPR021979 This family consist of uncharacterised bacterial proteins.
Probab=31.71 E-value=5.7e+02 Score=27.96 Aligned_cols=58 Identities=21% Similarity=0.243 Sum_probs=37.3
Q ss_pred ccCCCCHHHHHHHHHHHHHHhCCCCCCCCCHHHHHHHHHHHHHHHHHHHHhcCcchhhHHHHhh
Q psy8325 77 LTYLINNVDLSKKFKQLQSQLHPDKFSNKNQEEQAISETYSSYLNKAYSILQNPLERGLYLLSL 140 (237)
Q Consensus 77 ~~f~id~~~Lk~~Yr~Lq~~~HPDk~~~~~~~e~~~a~~~s~~In~AY~tL~dp~~Ra~Yll~l 140 (237)
|.|..+.+.|+.+...+...+.- ...+.+.+++....++++|+.+......+.|-+..
T Consensus 596 pd~~~~ee~L~~~l~~~~~~l~~------~~~~~~~~e~~l~~~~~~~~~~~~~~~~~~~~~~~ 653 (1201)
T PF12128_consen 596 PDYAASEEELRERLEQAEDQLQS------AEERQEELEKQLKQINKKIEELKREITQAEQELKQ 653 (1201)
T ss_pred chhhcChHHHHHHHHHHHHHHHH------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 44556777787776666555411 11223334556677899999999888888886543
No 232
>PRK06393 rpoE DNA-directed RNA polymerase subunit E''; Validated
Probab=31.65 E-value=26 Score=25.04 Aligned_cols=34 Identities=12% Similarity=0.347 Sum_probs=22.4
Q ss_pred CCCCCcCCCCCCCCCccCcCCCcccCCCCCCChhhhcCC
Q psy8325 37 ELKCWKCLKNLSGKNLFCQYCSSVQKPDSQSNYYDLFDL 75 (237)
Q Consensus 37 ~~~Cw~C~~~~~~~~~fC~~C~~~qp~~~~~nyf~llgl 75 (237)
...|-+|+..+. .-.||.|+.- .-+.||.-+.-+
T Consensus 5 ~~AC~~C~~i~~--~~~Cp~Cgs~---~~S~~w~G~v~i 38 (64)
T PRK06393 5 YRACKKCKRLTP--EKTCPVHGDE---KTTTEWFGFLII 38 (64)
T ss_pred hhhHhhCCcccC--CCcCCCCCCC---cCCcCcceEEEE
Confidence 457999998763 3499999883 234455555544
No 233
>PRK03988 translation initiation factor IF-2 subunit beta; Validated
Probab=31.33 E-value=27 Score=28.51 Aligned_cols=28 Identities=18% Similarity=0.210 Sum_probs=21.8
Q ss_pred CCCCCcCCCCC-------CCCCccCcCCCcccCCC
Q psy8325 37 ELKCWKCLKNL-------SGKNLFCQYCSSVQKPD 64 (237)
Q Consensus 37 ~~~Cw~C~~~~-------~~~~~fC~~C~~~qp~~ 64 (237)
...|+.|+.|= ......|..||+..|+.
T Consensus 102 yVlC~~C~spdT~l~k~~r~~~l~C~ACGa~~~V~ 136 (138)
T PRK03988 102 YVICPECGSPDTKLIKEGRIWVLKCEACGAETPVK 136 (138)
T ss_pred cEECCCCCCCCcEEEEcCCeEEEEcccCCCCCcCC
Confidence 56899999972 23467899999988874
No 234
>PF01627 Hpt: Hpt domain; InterPro: IPR008207 Two-component signal transduction systems enable bacteria to sense, respond, and adapt to a wide range of environments, stressors, and growth conditions []. Some bacteria can contain up to as many as 200 two-component systems that need tight regulation to prevent unwanted cross-talk []. These pathways have been adapted to response to a wide variety of stimuli, including nutrients, cellular redox state, changes in osmolarity, quorum signals, antibiotics, and more []. Two-component systems are comprised of a sensor histidine kinase (HK) and its cognate response regulator (RR) []. The HK catalyses its own auto-phosphorylation followed by the transfer of the phosphoryl group to the receiver domain on RR; phosphorylation of the RR usually activates an attached output domain, which can then effect changes in cellular physiology, often by regulating gene expression. Some HK are bifunctional, catalysing both the phosphorylation and dephosphorylation of their cognate RR. The input stimuli can regulate either the kinase or phosphatase activity of the bifunctional HK. A variant of the two-component system is the phospho-relay system. Here a hybrid HK auto-phosphorylates and then transfers the phosphoryl group to an internal receiver domain, rather than to a separate RR protein. The phosphoryl group is then shuttled to histidine phosphotransferase (HPT) and subsequently to a terminal RR, which can evoke the desired response [, ]. Signal transducing histidine kinases are the key elements in two-component signal transduction systems, which control complex processes such as the initiation of development in microorganisms [, ]. Examples of histidine kinases are EnvZ, which plays a central role in osmoregulation [], and CheA, which plays a central role in the chemotaxis system []. Histidine kinases usually have an N-terminal ligand-binding domain and a C-terminal kinase domain, but other domains may also be present. The kinase domain is responsible for the autophosphorylation of the histidine with ATP, the phosphotransfer from the kinase to an aspartate of the response regulator, and (with bifunctional enzymes) the phosphotransfer from aspartyl phosphate back to ADP or to water []. The kinase core has a unique fold, distinct from that of the Ser/Thr/Tyr kinase superfamily. HKs can be roughly divided into two classes: orthodox and hybrid kinases [, ]. Most orthodox HKs, typified by the Escherichia coli EnvZ protein, function as periplasmic membrane receptors and have a signal peptide and transmembrane segment(s) that separate the protein into a periplasmic N-terminal sensing domain and a highly conserved cytoplasmic C-terminal kinase core. Members of this family, however, have an integral membrane sensor domain. Not all orthodox kinases are membrane bound, e.g., the nitrogen regulatory kinase NtrB (GlnL) is a soluble cytoplasmic HK []. Hybrid kinases contain multiple phosphodonor and phosphoacceptor sites and use multi-step phospho-relay schemes instead of promoting a single phosphoryl transfer. In addition to the sensor domain and kinase core, they contain a CheY-like receiver domain and a His-containing phosphotransfer (HPt) domain. This entry represents a domain present at the N terminus in proteins which undergo autophosphorylation. The group includes, the gliding motility regulatory protein from Myxococcus xanthus and a number of bacterial chemotaxis proteins.; GO: 0004871 signal transducer activity, 0000160 two-component signal transduction system (phosphorelay); PDB: 3KYJ_A 3KYI_A 3IQT_A 1Y6D_A 2LD6_A 1TQG_A 2R25_A 1OXB_A 1QSP_B 1C03_B ....
Probab=30.83 E-value=1.5e+02 Score=20.46 Aligned_cols=27 Identities=22% Similarity=0.434 Sum_probs=13.0
Q ss_pred HHHHHHHHHHh---ccccHHHHHHHHHHHH
Q psy8325 188 DDLTKNVNASF---EQKDFNQAKDLLIRMK 214 (237)
Q Consensus 188 ~~~~~~l~~~f---~~~d~~~A~~~l~rlk 214 (237)
.+....|..++ ..+|++.....+.+||
T Consensus 11 ~~~~~~l~~~~~~~~~~d~~~l~~~~H~lk 40 (90)
T PF01627_consen 11 PEDLEQLEQALQALEQEDWEELRRLAHRLK 40 (90)
T ss_dssp HHHHHHHHHHHCSSHHCHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHhHhhHHHHHHHHHHHh
Confidence 33333444444 3455555555555544
No 235
>PF13097 CENP-U: CENP-A nucleosome associated complex (NAC) subunit
Probab=30.77 E-value=3.4e+02 Score=23.18 Aligned_cols=63 Identities=13% Similarity=0.252 Sum_probs=42.0
Q ss_pred HHHHHHHHHHHHHHhccCCHHHHHHHHHHHHHHHHHHHHHHHHHhccccHHHHHHHHHHHHhH-HHHHHHHHHHHhh
Q psy8325 154 QKLLMEILMLNEELDEASSEEDLEKLQTLIQATIDDLTKNVNASFEQKDFNQAKDLLIRMKYF-TTLVSKVRDKKNQ 229 (237)
Q Consensus 154 ~efLmeimE~rE~leea~~~~~l~~l~~~~~~~i~~~~~~l~~~f~~~d~~~A~~~l~rlkY~-~~l~~~ik~k~~~ 229 (237)
.+|---++|.++.||.-.-...+..+-..+++++-.++.+. +.|.-|++= .++...|+++...
T Consensus 111 ~~FEk~~~eYkq~ieS~~cr~AI~~F~~~~keqL~~~i~ev-------------q~lK~lkrkNakv~~~i~kkrqr 174 (175)
T PF13097_consen 111 SAFEKTALEYKQSIESKICRKAINKFYSNFKEQLIEMIKEV-------------QELKNLKRKNAKVISDINKKRQR 174 (175)
T ss_pred HHHHHHHHHHHHhhccHHHHHHHHHHHHHHHHHHHHHHHHH-------------HHHHHHHHHHHHHHHHHHHHhhc
Confidence 47777888999988876556677777777776666666554 334555554 5666666666544
No 236
>COG1802 GntR Transcriptional regulators [Transcription]
Probab=30.66 E-value=3.2e+02 Score=23.23 Aligned_cols=52 Identities=21% Similarity=0.250 Sum_probs=38.7
Q ss_pred HHHHHHHHhcCcchhhHHHHhhcCCC-ccc--c--ccCCCHHHHHHHHHHHHHHhcc
Q psy8325 119 YLNKAYSILQNPLERGLYLLSLQNIS-IEE--D--TKGTDQKLLMEILMLNEELDEA 170 (237)
Q Consensus 119 ~In~AY~tL~dp~~Ra~Yll~l~g~~-~~e--~--~~~~d~efLmeimE~rE~leea 170 (237)
.|.+.|.+=.-|++.|.--|.-.|.- ... + -...+.+.+.+++++|..||..
T Consensus 44 ~La~~~gvSrtPVReAL~rL~~eGlv~~~p~rG~~V~~~~~~~~~ei~~~R~~lE~~ 100 (230)
T COG1802 44 ELAEELGVSRTPVREALRRLEAEGLVEIEPNRGAFVAPLSLAEAREIFEVRELLEGA 100 (230)
T ss_pred HHHHHhCCCCccHHHHHHHHHHCCCeEecCCCCCeeCCCCHHHHHHHHHHHHHHHHH
Confidence 36677788888888887777666632 221 2 3578899999999999999965
No 237
>PRK00241 nudC NADH pyrophosphatase; Reviewed
Probab=30.50 E-value=37 Score=30.30 Aligned_cols=28 Identities=18% Similarity=0.424 Sum_probs=21.0
Q ss_pred CCCCCCcCCCCC----CCCCccCcCCCcccCC
Q psy8325 36 TELKCWKCLKNL----SGKNLFCQYCSSVQKP 63 (237)
Q Consensus 36 ~~~~Cw~C~~~~----~~~~~fC~~C~~~qp~ 63 (237)
....|+.|+.++ ......|++|+.+.=|
T Consensus 98 ~~~fC~~CG~~~~~~~~~~~~~C~~c~~~~yp 129 (256)
T PRK00241 98 SHRFCGYCGHPMHPSKTEWAMLCPHCRERYYP 129 (256)
T ss_pred cCccccccCCCCeecCCceeEECCCCCCEECC
Confidence 456899999986 3556789999875544
No 238
>PF10005 DUF2248: Uncharacterized protein conserved in bacteria (DUF2248); InterPro: IPR011201 This is a family of uncharacterised bacterial proteins.
Probab=30.31 E-value=30 Score=32.52 Aligned_cols=21 Identities=29% Similarity=0.501 Sum_probs=13.9
Q ss_pred CCCcCCCCCCCCCccCcCCCc
Q psy8325 39 KCWKCLKNLSGKNLFCQYCSS 59 (237)
Q Consensus 39 ~Cw~C~~~~~~~~~fC~~C~~ 59 (237)
.|.+|+.++.-..-.|.+|+.
T Consensus 1 ~C~~Cg~~v~FeNt~C~~Cg~ 21 (343)
T PF10005_consen 1 SCPNCGQPVFFENTRCLSCGS 21 (343)
T ss_pred CCCCCCCcceeCCCccccCCc
Confidence 377777776555566777765
No 239
>PF06556 ASFV_p27: IAP-like protein p27 C-terminus; InterPro: IPR010549 This entry represents the C-terminal region of the African swine fever virus (ASFV) IAP-like protein p27. This domain is found in conjunction with IPR001370 from INTERPRO. It has been suggested that the domain may be incoded by the gene involved in aspects of infection in the arthropod host, ticks of the genus Ornithodoros [].
Probab=30.30 E-value=12 Score=29.42 Aligned_cols=34 Identities=18% Similarity=0.595 Sum_probs=26.0
Q ss_pred CCCCcCCC-CCCCCCccCcCCCcccCCCCCCChhhhcCCc
Q psy8325 38 LKCWKCLK-NLSGKNLFCQYCSSVQKPDSQSNYYDLFDLK 76 (237)
Q Consensus 38 ~~Cw~C~~-~~~~~~~fC~~C~~~qp~~~~~nyf~llgl~ 76 (237)
.-|-.|+- ++....-||..|..+- -.||..|+.+
T Consensus 94 tycmtcgie~i~kdenfc~acktln-----yk~yk~lnfs 128 (131)
T PF06556_consen 94 TYCMTCGIEQINKDENFCNACKTLN-----YKHYKMLNFS 128 (131)
T ss_pred EEEEeeCceecccchhhhhhhcccc-----hhheeeeeeE
Confidence 36999987 5678899999998754 5677777654
No 240
>COG1579 Zn-ribbon protein, possibly nucleic acid-binding [General function prediction only]
Probab=30.15 E-value=27 Score=31.26 Aligned_cols=27 Identities=26% Similarity=0.647 Sum_probs=20.2
Q ss_pred cCCCCCCcCCCCC----------CCCCccCcCCCccc
Q psy8325 35 HTELKCWKCLKNL----------SGKNLFCQYCSSVQ 61 (237)
Q Consensus 35 ~~~~~Cw~C~~~~----------~~~~~fC~~C~~~q 61 (237)
-....|-.|+-.+ ...-.|||.||.+.
T Consensus 195 l~g~~C~GC~m~l~~~~~~~V~~~d~iv~CP~CgRIL 231 (239)
T COG1579 195 LEGRVCGGCHMKLPSQTLSKVRKKDEIVFCPYCGRIL 231 (239)
T ss_pred ecCCcccCCeeeecHHHHHHHhcCCCCccCCccchHH
Confidence 4456899998743 35678999999875
No 241
>PF13446 RPT: A repeated domain in UCH-protein
Probab=29.92 E-value=1.8e+02 Score=19.72 Aligned_cols=26 Identities=8% Similarity=0.229 Sum_probs=21.7
Q ss_pred ChhhhcCCcccCCCCHHHHHHHHHHHHH
Q psy8325 68 NYYDLFDLKLTYLINNVDLSKKFKQLQS 95 (237)
Q Consensus 68 nyf~llgl~~~f~id~~~Lk~~Yr~Lq~ 95 (237)
+-|+.||++++ ++...|-..|+....
T Consensus 6 ~Ay~~Lgi~~~--~~Dd~Ii~~f~~~~~ 31 (62)
T PF13446_consen 6 EAYEILGIDED--TDDDFIISAFQSKVN 31 (62)
T ss_pred HHHHHhCcCCC--CCHHHHHHHHHHHHH
Confidence 46899999876 578899999998777
No 242
>PRK09401 reverse gyrase; Reviewed
Probab=29.82 E-value=24 Score=38.38 Aligned_cols=22 Identities=32% Similarity=0.629 Sum_probs=17.2
Q ss_pred CCCCcCCCCCC----CCCccCcCCCc
Q psy8325 38 LKCWKCLKNLS----GKNLFCQYCSS 59 (237)
Q Consensus 38 ~~Cw~C~~~~~----~~~~fC~~C~~ 59 (237)
..||||++++. .....|+.|--
T Consensus 8 ~~cpnc~g~i~~~rl~~g~~c~~cl~ 33 (1176)
T PRK09401 8 NSCPNCGGDISDERLEKGLPCEKCLP 33 (1176)
T ss_pred ccCCCCCCcCcHhHHhcCCcChhhCC
Confidence 37999999883 45688999943
No 243
>KOG1598|consensus
Probab=29.82 E-value=28 Score=34.43 Aligned_cols=24 Identities=29% Similarity=0.674 Sum_probs=19.7
Q ss_pred CCCcCCCCC-----CCCCccCcCCCcccC
Q psy8325 39 KCWKCLKNL-----SGKNLFCQYCSSVQK 62 (237)
Q Consensus 39 ~Cw~C~~~~-----~~~~~fC~~C~~~qp 62 (237)
.|.+|++.. ......|..||.|+-
T Consensus 2 ~C~~C~~s~fe~d~a~g~~~C~~CG~v~E 30 (521)
T KOG1598|consen 2 VCKNCGGSNFERDEATGNLYCTACGTVLE 30 (521)
T ss_pred cCCCCCCCCcccccccCCceeccccceee
Confidence 599999864 477899999999875
No 244
>TIGR01054 rgy reverse gyrase. Generally, these gyrases are encoded as a single polypeptide. An exception was found in Methanopyrus kandleri, where enzyme is split within the topoisomerase domain, yielding a heterodimer of gene products designated RgyB and RgyA.
Probab=29.43 E-value=27 Score=38.02 Aligned_cols=21 Identities=29% Similarity=0.504 Sum_probs=17.0
Q ss_pred CCCCcCCCCC----CCCCccCcCCC
Q psy8325 38 LKCWKCLKNL----SGKNLFCQYCS 58 (237)
Q Consensus 38 ~~Cw~C~~~~----~~~~~fC~~C~ 58 (237)
..||||++++ ......|..|-
T Consensus 8 ~~CPnCgg~i~~~rl~~~~~c~~Cl 32 (1171)
T TIGR01054 8 NLCPNCGGEISSERLEKGLPCARCL 32 (1171)
T ss_pred CCCCCCCCccchhHhhcCCCccccC
Confidence 4899999987 25678899994
No 245
>PRK08665 ribonucleotide-diphosphate reductase subunit alpha; Validated
Probab=29.43 E-value=33 Score=35.51 Aligned_cols=22 Identities=23% Similarity=0.463 Sum_probs=18.3
Q ss_pred CCCCcCCCCC--CCCCccCcCCCc
Q psy8325 38 LKCWKCLKNL--SGKNLFCQYCSS 59 (237)
Q Consensus 38 ~~Cw~C~~~~--~~~~~fC~~C~~ 59 (237)
..||.|+.++ ......|.+||.
T Consensus 725 ~~Cp~Cg~~l~~~~GC~~C~~CG~ 748 (752)
T PRK08665 725 GACPECGSILEHEEGCVVCHSCGY 748 (752)
T ss_pred CCCCCCCcccEECCCCCcCCCCCC
Confidence 4699999876 478899999986
No 246
>COG1594 RPB9 DNA-directed RNA polymerase, subunit M/Transcription elongation factor TFIIS [Transcription]
Probab=29.31 E-value=32 Score=27.08 Aligned_cols=28 Identities=29% Similarity=0.555 Sum_probs=22.3
Q ss_pred CCCCcCCCCC------CCCCccCcCCCcccCCCC
Q psy8325 38 LKCWKCLKNL------SGKNLFCQYCSSVQKPDS 65 (237)
Q Consensus 38 ~~Cw~C~~~~------~~~~~fC~~C~~~qp~~~ 65 (237)
..|+.|+..+ ....+.|++|+...+...
T Consensus 3 ~FCp~Cgsll~p~~~~~~~~l~C~kCgye~~~~~ 36 (113)
T COG1594 3 RFCPKCGSLLYPKKDDEGGKLVCRKCGYEEEASN 36 (113)
T ss_pred cccCCccCeeEEeEcCCCcEEECCCCCcchhccc
Confidence 4799999854 256899999999888764
No 247
>TIGR00311 aIF-2beta translation initiation factor aIF-2, beta subunit, putative.
Probab=29.02 E-value=32 Score=27.92 Aligned_cols=29 Identities=17% Similarity=0.320 Sum_probs=21.4
Q ss_pred CCCCCCcCCCCC-------CCCCccCcCCCcccCCC
Q psy8325 36 TELKCWKCLKNL-------SGKNLFCQYCSSVQKPD 64 (237)
Q Consensus 36 ~~~~Cw~C~~~~-------~~~~~fC~~C~~~qp~~ 64 (237)
....|+.|+.|= ....+.|..||+..|+.
T Consensus 96 ~yVlC~~C~sPdT~l~k~~r~~~l~C~ACGa~~~v~ 131 (133)
T TIGR00311 96 KYVICRECNRPDTRIIKEGRVSLLKCEACGAKAPLR 131 (133)
T ss_pred heEECCCCCCCCcEEEEeCCeEEEecccCCCCCccC
Confidence 356899999972 12245799999988874
No 248
>PF12644 DUF3782: Protein of unknown function (DUF3782); InterPro: IPR024271 This functionally uncharacterised family of proteins is found in bacteria and archaea. Proteins in this family are typically between 91 and 186 amino acids in length.
Probab=28.83 E-value=1.8e+02 Score=19.82 Aligned_cols=28 Identities=39% Similarity=0.570 Sum_probs=17.1
Q ss_pred HHHHHHHHHhcc---CCHHHHHHHHHHHHHH
Q psy8325 159 EILMLNEELDEA---SSEEDLEKLQTLIQAT 186 (237)
Q Consensus 159 eimE~rE~leea---~~~~~l~~l~~~~~~~ 186 (237)
++|.+.|++++. -+.++...+..-+...
T Consensus 9 ~i~a~~e~l~~~~~~lt~e~~~~l~~~~~al 39 (64)
T PF12644_consen 9 EIMATKEELEELEERLTKEDKKRLEEYIDAL 39 (64)
T ss_pred HHHHHHHHHHHHHhhcCHHHHHHHHHHHHHH
Confidence 677777777776 5555555555444433
No 249
>PF01396 zf-C4_Topoisom: Topoisomerase DNA binding C4 zinc finger; InterPro: IPR013498 DNA topoisomerases regulate the number of topological links between two DNA strands (i.e. change the number of superhelical turns) by catalysing transient single- or double-strand breaks, crossing the strands through one another, then resealing the breaks []. These enzymes have several functions: to remove DNA supercoils during transcription and DNA replication; for strand breakage during recombination; for chromosome condensation; and to disentangle intertwined DNA during mitosis [, ]. DNA topoisomerases are divided into two classes: type I enzymes (5.99.1.2 from EC; topoisomerases I, III and V) break single-strand DNA, and type II enzymes (5.99.1.3 from EC; topoisomerases II, IV and VI) break double-strand DNA []. Type I topoisomerases are ATP-independent enzymes (except for reverse gyrase), and can be subdivided according to their structure and reaction mechanisms: type IA (bacterial and archaeal topoisomerase I, topoisomerase III and reverse gyrase) and type IB (eukaryotic topoisomerase I and topoisomerase V). These enzymes are primarily responsible for relaxing positively and/or negatively supercoiled DNA, except for reverse gyrase, which can introduce positive supercoils into DNA. This entry represents the zinc-finger domain found in type IA topoisomerases, including bacterial and archaeal topoisomerase I and III enzymes, and in eukaryotic topoisomerase III enzymes. Escherichia coli topoisomerase I proteins contain five copies of a zinc-ribbon-like domain at their C terminus, two of which have lost their cysteine residues and are therefore probably not able to bind zinc []. This domain is still considered to be a member of the zinc-ribbon superfamily despite not being able to bind zinc. More information about this protein can be found at Protein of the Month: DNA Topoisomerase [].; GO: 0003677 DNA binding, 0003916 DNA topoisomerase activity, 0006265 DNA topological change, 0005694 chromosome
Probab=28.80 E-value=42 Score=21.24 Aligned_cols=26 Identities=38% Similarity=0.586 Sum_probs=15.7
Q ss_pred CCCcCCCCC------CCCCccCcC---CCcccCCC
Q psy8325 39 KCWKCLKNL------SGKNLFCQY---CSSVQKPD 64 (237)
Q Consensus 39 ~Cw~C~~~~------~~~~~fC~~---C~~~qp~~ 64 (237)
.|+.|++.+ ...-+.|++ |..+.+..
T Consensus 3 ~CP~Cg~~lv~r~~k~g~F~~Cs~yP~C~~~~~~~ 37 (39)
T PF01396_consen 3 KCPKCGGPLVLRRGKKGKFLGCSNYPECKYTEPLP 37 (39)
T ss_pred CCCCCCceeEEEECCCCCEEECCCCCCcCCeEeCC
Confidence 689998865 123345654 76665543
No 250
>PF03833 PolC_DP2: DNA polymerase II large subunit DP2; InterPro: IPR016033 DP2 is the large subunit of a two-subunit novel archaebacterial replicative DNA polymerase first characterised for Pyrococcus furiosus. The structure of DP2 appears to be organised as a ~950 residue component separated from a ~300 residue component by a ~150 residue intein. The other subunit, DP1, has sequence similarity to the eukaryotic DNA polymerase delta small subunit. This entry represents the N-terminal ~950 residue component of DP2.; GO: 0003887 DNA-directed DNA polymerase activity; PDB: 3O59_X.
Probab=28.70 E-value=19 Score=37.73 Aligned_cols=20 Identities=20% Similarity=0.544 Sum_probs=0.0
Q ss_pred CCCcCCCCCCCCCccCcCCCc
Q psy8325 39 KCWKCLKNLSGKNLFCQYCSS 59 (237)
Q Consensus 39 ~Cw~C~~~~~~~~~fC~~C~~ 59 (237)
.||.|+..+ ...++|+.|+.
T Consensus 669 ~Cp~CG~~T-~~~~~Cp~C~~ 688 (900)
T PF03833_consen 669 RCPECGSHT-EPVYVCPDCGI 688 (900)
T ss_dssp ---------------------
T ss_pred cCcccCCcc-ccceecccccc
Confidence 455555432 23445555544
No 251
>PF03461 TRCF: TRCF domain; InterPro: IPR005118 This domain is found in proteins necessary for strand-specific repair in DNA such as TRCF in Escherichia coli. A lesion in the template strand blocks the RNA polymerase complex (RNAP). The RNAP-DNA-RNA complex is specifically recognised by the transcription-repair-coupling factor (TRCF) which releases RNAP and the truncated transcript.; GO: 0003684 damaged DNA binding, 0004386 helicase activity, 0005524 ATP binding, 0006281 DNA repair; PDB: 2QSR_A 2EYQ_A.
Probab=28.65 E-value=1.2e+02 Score=23.05 Aligned_cols=39 Identities=33% Similarity=0.377 Sum_probs=25.6
Q ss_pred HHHHHHHHHHhccCCHHHHHHHHHHHHHHHHHHHHHHHH
Q psy8325 158 MEILMLNEELDEASSEEDLEKLQTLIQATIDDLTKNVNA 196 (237)
Q Consensus 158 meimE~rE~leea~~~~~l~~l~~~~~~~i~~~~~~l~~ 196 (237)
.+=|++.-+|..+.+.+++.++..++..+.-..=.++..
T Consensus 17 ~~Rl~~Yrrl~~~~~~~el~~l~~El~DRFG~~P~ev~~ 55 (101)
T PF03461_consen 17 DERLELYRRLASAESEEELEDLREELIDRFGPLPEEVEN 55 (101)
T ss_dssp HHHHHHHHHHHC--SHHHHHHHHHHHHHHH-S--HHHHH
T ss_pred HHHHHHHHHHhhCCCHHHHHHHHHHHHHHcCCCcHHHHH
Confidence 467889999999999999888888777665444444433
No 252
>PF03993 DUF349: Domain of Unknown Function (DUF349); InterPro: IPR007139 This motif is found singly or as up to five tandem repeats in a small set of bacterial proteins. There are two or three alpha-helices, and possibly a beta-strand.
Probab=28.54 E-value=2e+02 Score=19.89 Aligned_cols=41 Identities=15% Similarity=0.217 Sum_probs=28.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhccccHHHHHHHHHHHH
Q psy8325 174 EDLEKLQTLIQATIDDLTKNVNASFEQKDFNQAKDLLIRMK 214 (237)
Q Consensus 174 ~~l~~l~~~~~~~i~~~~~~l~~~f~~~d~~~A~~~l~rlk 214 (237)
+++.....++-.+..+++.++.......||..+...+..|+
T Consensus 23 ~~~~~~~~~n~~~K~~Li~~~~~l~~~~d~~~~~~~~k~l~ 63 (77)
T PF03993_consen 23 EEQDAEREENLEKKEALIEEAEALAESEDWKEAAEEIKELQ 63 (77)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcccccHHHHHHHHHHHH
Confidence 34555566677777788888877777778877776665553
No 253
>COG1571 Predicted DNA-binding protein containing a Zn-ribbon domain [General function prediction only]
Probab=28.51 E-value=39 Score=32.65 Aligned_cols=28 Identities=14% Similarity=0.460 Sum_probs=21.5
Q ss_pred CCCCCCcCCCCCC---CCCccCcCCCcccCC
Q psy8325 36 TELKCWKCLKNLS---GKNLFCQYCSSVQKP 63 (237)
Q Consensus 36 ~~~~Cw~C~~~~~---~~~~fC~~C~~~qp~ 63 (237)
..+.|+.|++.+. ..-|-|+.|+.--|-
T Consensus 349 ~~p~Cp~Cg~~m~S~G~~g~rC~kCg~~~~~ 379 (421)
T COG1571 349 VNPVCPRCGGRMKSAGRNGFRCKKCGTRARE 379 (421)
T ss_pred cCCCCCccCCchhhcCCCCcccccccccCCc
Confidence 3568999999872 238999999986654
No 254
>COG2816 NPY1 NTP pyrophosphohydrolases containing a Zn-finger, probably nucleic-acid-binding [DNA replication, recombination, and repair]
Probab=27.93 E-value=36 Score=31.15 Aligned_cols=28 Identities=21% Similarity=0.464 Sum_probs=21.5
Q ss_pred CCCCCCcCCCCC----CCCCccCcCCCcccCC
Q psy8325 36 TELKCWKCLKNL----SGKNLFCQYCSSVQKP 63 (237)
Q Consensus 36 ~~~~Cw~C~~~~----~~~~~fC~~C~~~qp~ 63 (237)
....|..|+.++ ......|++|+..+=|
T Consensus 110 ~~RFCg~CG~~~~~~~~g~~~~C~~cg~~~fP 141 (279)
T COG2816 110 SHRFCGRCGTKTYPREGGWARVCPKCGHEHFP 141 (279)
T ss_pred hCcCCCCCCCcCccccCceeeeCCCCCCccCC
Confidence 355899999976 3677899999986544
No 255
>PF10122 Mu-like_Com: Mu-like prophage protein Com; InterPro: IPR019294 Members of this entry belong to the Com family of proteins that act as translational regulators of mom [, ].
Probab=27.92 E-value=17 Score=24.81 Aligned_cols=26 Identities=23% Similarity=0.643 Sum_probs=18.4
Q ss_pred CCCCcCCCCCC------CCCccCcCCCcccCC
Q psy8325 38 LKCWKCLKNLS------GKNLFCQYCSSVQKP 63 (237)
Q Consensus 38 ~~Cw~C~~~~~------~~~~fC~~C~~~qp~ 63 (237)
..|.+|++.+. .-..-||.|+.+.-+
T Consensus 5 iRC~~CnklLa~~g~~~~leIKCpRC~tiN~~ 36 (51)
T PF10122_consen 5 IRCGHCNKLLAKAGEVIELEIKCPRCKTINHV 36 (51)
T ss_pred eeccchhHHHhhhcCccEEEEECCCCCccceE
Confidence 47889988652 246779999886644
No 256
>PRK08579 anaerobic ribonucleoside triphosphate reductase; Provisional
Probab=27.62 E-value=37 Score=34.41 Aligned_cols=23 Identities=30% Similarity=0.559 Sum_probs=18.4
Q ss_pred CCCCCcCCCCCCCCCccCcCCCc
Q psy8325 37 ELKCWKCLKNLSGKNLFCQYCSS 59 (237)
Q Consensus 37 ~~~Cw~C~~~~~~~~~fC~~C~~ 59 (237)
-..|..|+.........||.|+.
T Consensus 568 ~~~C~~CG~~~~g~~~~CP~CGs 590 (625)
T PRK08579 568 ITVCNKCGRSTTGLYTRCPRCGS 590 (625)
T ss_pred CccCCCCCCccCCCCCcCcCCCC
Confidence 44899999855556789999986
No 257
>COG1656 Uncharacterized conserved protein [Function unknown]
Probab=27.31 E-value=24 Score=29.83 Aligned_cols=11 Identities=27% Similarity=0.507 Sum_probs=8.8
Q ss_pred CCCCCcCCCCC
Q psy8325 37 ELKCWKCLKNL 47 (237)
Q Consensus 37 ~~~Cw~C~~~~ 47 (237)
...|+-|++++
T Consensus 97 ~~RCp~CN~~L 107 (165)
T COG1656 97 FSRCPECNGEL 107 (165)
T ss_pred cccCcccCCEe
Confidence 45899999975
No 258
>PF07709 SRR: Seven Residue Repeat; InterPro: IPR011714 This repeat is found in some Plasmodium and Theileria proteins.
Probab=27.23 E-value=38 Score=16.73 Aligned_cols=12 Identities=25% Similarity=0.318 Sum_probs=8.6
Q ss_pred HHHHHHHHHHhc
Q psy8325 117 SSYLNKAYSILQ 128 (237)
Q Consensus 117 s~~In~AY~tL~ 128 (237)
+..|..||+.|+
T Consensus 3 ~~~V~~aY~~l~ 14 (14)
T PF07709_consen 3 FEKVKNAYEQLS 14 (14)
T ss_pred HHHHHHHHHhcC
Confidence 566788888764
No 259
>PRK00762 hypA hydrogenase nickel incorporation protein; Provisional
Probab=27.06 E-value=34 Score=27.23 Aligned_cols=23 Identities=17% Similarity=0.479 Sum_probs=15.0
Q ss_pred CCCCcCCCCCCC---------CCccCcCCCccc
Q psy8325 38 LKCWKCLKNLSG---------KNLFCQYCSSVQ 61 (237)
Q Consensus 38 ~~Cw~C~~~~~~---------~~~fC~~C~~~q 61 (237)
..| +|+..... ..+.||.|+...
T Consensus 71 ~~C-~Cg~~~~~~~~~~~~~~~~~~CP~Cgs~~ 102 (124)
T PRK00762 71 IEC-ECGYEGVVDEDEIDHYAAVIECPVCGNKR 102 (124)
T ss_pred EEe-eCcCcccccccchhccccCCcCcCCCCCC
Confidence 379 99976321 124699998643
No 260
>PF09723 Zn-ribbon_8: Zinc ribbon domain; InterPro: IPR013429 This entry represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB []. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=26.97 E-value=42 Score=21.47 Aligned_cols=21 Identities=29% Similarity=0.703 Sum_probs=12.0
Q ss_pred CCCcCCCCC------C-CCCccCcCCCc
Q psy8325 39 KCWKCLKNL------S-GKNLFCQYCSS 59 (237)
Q Consensus 39 ~Cw~C~~~~------~-~~~~fC~~C~~ 59 (237)
.|.+|+... . .....||.|+.
T Consensus 7 ~C~~Cg~~fe~~~~~~~~~~~~CP~Cg~ 34 (42)
T PF09723_consen 7 RCEECGHEFEVLQSISEDDPVPCPECGS 34 (42)
T ss_pred EeCCCCCEEEEEEEcCCCCCCcCCCCCC
Confidence 466676521 1 34667777765
No 261
>PF03119 DNA_ligase_ZBD: NAD-dependent DNA ligase C4 zinc finger domain; InterPro: IPR004149 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents the zinc finger domain found in NAD-dependent DNA ligases. DNA ligases catalyse the crucial step of joining the breaks in duplex DNA during DNA replication, repair and recombination, utilizing either ATP or NAD(+) as a cofactor []. This domain is a small zinc binding motif that is presumably DNA binding. It is found only in NAD-dependent DNA ligases. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0003911 DNA ligase (NAD+) activity, 0006260 DNA replication, 0006281 DNA repair; PDB: 1DGS_A 1V9P_B 2OWO_A.
Probab=26.38 E-value=26 Score=20.69 Aligned_cols=9 Identities=33% Similarity=0.626 Sum_probs=3.9
Q ss_pred CCCcCCCCC
Q psy8325 39 KCWKCLKNL 47 (237)
Q Consensus 39 ~Cw~C~~~~ 47 (237)
.||.|+.++
T Consensus 1 ~CP~C~s~l 9 (28)
T PF03119_consen 1 TCPVCGSKL 9 (28)
T ss_dssp B-TTT--BE
T ss_pred CcCCCCCEe
Confidence 377777765
No 262
>COG0777 AccD Acetyl-CoA carboxylase beta subunit [Lipid metabolism]
Probab=26.37 E-value=30 Score=31.68 Aligned_cols=28 Identities=21% Similarity=0.349 Sum_probs=22.3
Q ss_pred CCCCCcCCCCC-----CCCCccCcCCCcccCCC
Q psy8325 37 ELKCWKCLKNL-----SGKNLFCQYCSSVQKPD 64 (237)
Q Consensus 37 ~~~Cw~C~~~~-----~~~~~fC~~C~~~qp~~ 64 (237)
-..|++|+..+ ..+...||+|++--+..
T Consensus 28 w~KCp~c~~~~y~~eL~~n~~vcp~c~~h~ri~ 60 (294)
T COG0777 28 WTKCPSCGEMLYRKELESNLKVCPKCGHHMRIS 60 (294)
T ss_pred eeECCCccceeeHHHHHhhhhcccccCcccccC
Confidence 34899999865 46789999999877664
No 263
>KOG2577|consensus
Probab=26.26 E-value=1.4e+02 Score=28.24 Aligned_cols=100 Identities=25% Similarity=0.288 Sum_probs=63.9
Q ss_pred cCCCCHHHHHHHHHHHHHHhCCCCCCCCCHHHHHHHHHHHHHHHHHHHHhcCcchhhHHHHh--hcCCCcccccc-----
Q psy8325 78 TYLINNVDLSKKFKQLQSQLHPDKFSNKNQEEQAISETYSSYLNKAYSILQNPLERGLYLLS--LQNISIEEDTK----- 150 (237)
Q Consensus 78 ~f~id~~~Lk~~Yr~Lq~~~HPDk~~~~~~~e~~~a~~~s~~In~AY~tL~dp~~Ra~Yll~--l~g~~~~e~~~----- 150 (237)
.++-+..-|.++|-.|.+. -||.. --+|+|-++|.=- +|..|++. |.|+.+-+..+
T Consensus 68 R~d~SLglLTkKFv~Llq~-s~dGv---------------ldLn~aA~~L~Vq-KRRIYDITNVLEGI~LIeKksKN~Iq 130 (354)
T KOG2577|consen 68 RSDTSLGLLTKKFVDLLQE-SPDGV---------------LDLNKAAEVLNVQ-KRRIYDITNVLEGIGLIEKKSKNNIQ 130 (354)
T ss_pred cchhhHHHHHHHHHHHHHh-CCCce---------------eeHHHHHHHhccc-cceeeehhhhhhcccceeecccccee
Confidence 3455677899999988654 45554 3389999999866 99999985 77888755411
Q ss_pred ----CCC--HHHHHHHHHHHHHHhccCCHHHHHHHHHHHHHHHHHHHHHHHHHhcc
Q psy8325 151 ----GTD--QKLLMEILMLNEELDEASSEEDLEKLQTLIQATIDDLTKNVNASFEQ 200 (237)
Q Consensus 151 ----~~d--~efLmeimE~rE~leea~~~~~l~~l~~~~~~~i~~~~~~l~~~f~~ 200 (237)
..+ .+...++=.++.+++ +|....+++++.|..|...|...++.
T Consensus 131 W~G~~~~~~~~~~e~~~~l~~e~~------~L~~~E~~LD~~i~~~q~~L~~lted 180 (354)
T KOG2577|consen 131 WIGGDFNSTGGVPERLNGLEAEVE------DLSQEEDDLDQLIRDCQQNLRLLTED 180 (354)
T ss_pred eecCCCccccccHHHHHHHHHHHH------HHHHHHHHHHHHHHHHHHHHHHHhcc
Confidence 111 111122222334333 45555667888888888888777754
No 264
>PF07361 Cytochrom_B562: Cytochrome b562; InterPro: IPR009155 Cytochrome b562 is a haem-containing protein that is expressed in the periplasm of Escherichia coli. In b-type cytochromes, the haem atom is not covalently attached to the polypeptide. Cytochrome b562 has a four-helical bundle structure that is structurally similar to that found in members of the cytochrome c family (IPR002321 from INTERPRO). Cytochrome b562 has a reduction potential of 167 mV, which sets the energy yield possible in metabolism and is also a key determinant of the rate at which redox reactions proceed [].; GO: 0005506 iron ion binding, 0009055 electron carrier activity, 0020037 heme binding, 0042597 periplasmic space; PDB: 4ER9_A 3IQ6_G 2QLA_B 3FOO_A 3M79_C 256B_A 3NMI_F 3HNK_A 3NMK_D 2BC5_A ....
Probab=26.26 E-value=3e+02 Score=21.04 Aligned_cols=25 Identities=20% Similarity=0.271 Sum_probs=19.0
Q ss_pred HHHHHHhccccHHHHHHHHHHHHhH
Q psy8325 192 KNVNASFEQKDFNQAKDLLIRMKYF 216 (237)
Q Consensus 192 ~~l~~~f~~~d~~~A~~~l~rlkY~ 216 (237)
..+..+.+.|++++|+..+.++.=+
T Consensus 70 d~a~~~~~~G~l~~AK~~l~~l~~l 94 (103)
T PF07361_consen 70 DKAEALAEAGKLDEAKAALKKLDDL 94 (103)
T ss_dssp HHHHHHHHTTHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHCCCHHHHHHHHHHHHHH
Confidence 3455667789999999998887533
No 265
>PF14276 DUF4363: Domain of unknown function (DUF4363)
Probab=25.82 E-value=1.2e+02 Score=23.45 Aligned_cols=22 Identities=14% Similarity=0.264 Sum_probs=10.2
Q ss_pred HHHHHHHhccccHHHHHHHHHH
Q psy8325 191 TKNVNASFEQKDFNQAKDLLIR 212 (237)
Q Consensus 191 ~~~l~~~f~~~d~~~A~~~l~r 212 (237)
...+.+...++||+.|.+.+.+
T Consensus 32 l~~i~~~i~~~dW~~A~~~~~~ 53 (121)
T PF14276_consen 32 LEQIEEAIENEDWEKAYKETEE 53 (121)
T ss_pred HHHHHHHHHhCCHHHHHHHHHH
Confidence 3334444444555555544433
No 266
>PF14369 zf-RING_3: zinc-finger
Probab=25.73 E-value=50 Score=20.51 Aligned_cols=20 Identities=20% Similarity=0.750 Sum_probs=10.2
Q ss_pred CCcCCCCCC-----CCCccCcCCCc
Q psy8325 40 CWKCLKNLS-----GKNLFCQYCSS 59 (237)
Q Consensus 40 Cw~C~~~~~-----~~~~fC~~C~~ 59 (237)
|-.|...+. ....+||.|+.
T Consensus 5 Ch~C~~~V~~~~~~~~~~~CP~C~~ 29 (35)
T PF14369_consen 5 CHQCNRFVRIAPSPDSDVACPRCHG 29 (35)
T ss_pred CccCCCEeEeCcCCCCCcCCcCCCC
Confidence 556665431 22334777753
No 267
>PRK06386 replication factor A; Reviewed
Probab=25.63 E-value=41 Score=31.83 Aligned_cols=20 Identities=25% Similarity=0.433 Sum_probs=16.9
Q ss_pred CCCCcCCCCCCCCCccCcCCCc
Q psy8325 38 LKCWKCLKNLSGKNLFCQYCSS 59 (237)
Q Consensus 38 ~~Cw~C~~~~~~~~~fC~~C~~ 59 (237)
..||.|++.+.. -+|+.|+.
T Consensus 237 ~rCP~C~R~l~~--g~C~~HG~ 256 (358)
T PRK06386 237 TKCSVCNKIIED--GVCKDHPD 256 (358)
T ss_pred ecCcCCCeEccC--CcCCCCCC
Confidence 489999998863 49999997
No 268
>TIGR03826 YvyF flagellar operon protein TIGR03826. This gene is found in flagellar operons of Bacillus-related organisms. Its function has not been determined and an official gene symbol has not been assigned, although the gene is designated yvyF in B. subtilus. A tentative assignment as a regulator is suggested in the NCBI record GI:16080597.
Probab=25.56 E-value=46 Score=27.21 Aligned_cols=24 Identities=17% Similarity=0.563 Sum_probs=19.2
Q ss_pred cCCCCCCcCCCCCCCCCccCcCCCc
Q psy8325 35 HTELKCWKCLKNLSGKNLFCQYCSS 59 (237)
Q Consensus 35 ~~~~~Cw~C~~~~~~~~~fC~~C~~ 59 (237)
.....|-+||.++... -+|++|..
T Consensus 79 nl~~~CE~CG~~I~~G-r~C~~C~~ 102 (137)
T TIGR03826 79 NLGYPCERCGTSIREG-RLCDSCAG 102 (137)
T ss_pred CCcCcccccCCcCCCC-CccHHHHH
Confidence 4566999999998754 89999954
No 269
>TIGR03655 anti_R_Lar restriction alleviation protein, Lar family. Restriction alleviation proteins provide a countermeasure to host cell restriction enzyme defense against foreign DNA such as phage or plasmids. This family consists of homologs to the phage antirestriction protein Lar, and most members belong to phage genomes or prophage regions of bacterial genomes.
Probab=25.54 E-value=47 Score=22.22 Aligned_cols=25 Identities=16% Similarity=0.196 Sum_probs=16.0
Q ss_pred CCCcCCCCCC----------C-CCccCcCCCcccCC
Q psy8325 39 KCWKCLKNLS----------G-KNLFCQYCSSVQKP 63 (237)
Q Consensus 39 ~Cw~C~~~~~----------~-~~~fC~~C~~~qp~ 63 (237)
.||.|++.-. . .-+.|+.|+...|.
T Consensus 3 PCPfCGg~~~~~~~~~~~~~~~~~~~C~~Cga~~~~ 38 (53)
T TIGR03655 3 PCPFCGGADVYLRRGFDPLDLSHYFECSTCGASGPV 38 (53)
T ss_pred CCCCCCCcceeeEeccCCCCCEEEEECCCCCCCccc
Confidence 6888877321 1 12359999987765
No 270
>PF03367 zf-ZPR1: ZPR1 zinc-finger domain; InterPro: IPR004457 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents ZPR1-type zinc finger domains. An orthologous protein found once in each of the completed archaeal genomes corresponds to a zinc finger-containing domain repeated as the N-terminal and C-terminal halves of the mouse protein ZPR1. ZPR1 is an experimentally proven zinc-binding protein that binds the tyrosine kinase domain of the epidermal growth factor receptor (EGFR); binding is inhibited by EGF stimulation and tyrosine phosphorylation, and activation by EGF is followed by some redistribution of ZPR1 to the nucleus. By analogy, other proteins with the ZPR1 zinc finger domain may be regulatory proteins that sense protein phosphorylation state and/or participate in signal transduction (see also IPR004470 from INTERPRO). Deficiencies in ZPR1 may contribute to neurodegenerative disorders. ZPR1 appears to be down-regulated in patients with spinal muscular atrophy (SMA), a disease characterised by degeneration of the alpha-motor neurons in the spinal cord that can arise from mutations affecting the expression of Survival Motor Neurons (SMN) []. ZPR1 interacts with complexes formed by SMN [], and may act as a modifier that effects the severity of SMA. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding; PDB: 2QKD_A.
Probab=25.33 E-value=19 Score=30.03 Aligned_cols=21 Identities=24% Similarity=0.401 Sum_probs=12.6
Q ss_pred CCCcCCCCC---------------CCCCccCcCCCc
Q psy8325 39 KCWKCLKNL---------------SGKNLFCQYCSS 59 (237)
Q Consensus 39 ~Cw~C~~~~---------------~~~~~fC~~C~~ 59 (237)
.|++|+.+. .-..|.|+.||.
T Consensus 3 ~Cp~C~~~~~~~~~~~~IP~F~evii~sf~C~~CGy 38 (161)
T PF03367_consen 3 LCPNCGENGTTRILLTDIPYFKEVIIMSFECEHCGY 38 (161)
T ss_dssp E-TTTSSCCEEEEEEEEETTTEEEEEEEEE-TTT--
T ss_pred cCCCCCCCcEEEEEEEcCCCCceEEEEEeECCCCCC
Confidence 699999862 245689999986
No 271
>PF06221 zf-C2HC5: Putative zinc finger motif, C2HC5-type; InterPro: IPR009349 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This zinc finger appears to be common in activating signal cointegrator 1/thyroid receptor interacting protein 4. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus
Probab=25.28 E-value=61 Score=22.59 Aligned_cols=23 Identities=22% Similarity=0.541 Sum_probs=16.8
Q ss_pred CCCCCcCCCCC--CCC-CccCcCCCc
Q psy8325 37 ELKCWKCLKNL--SGK-NLFCQYCSS 59 (237)
Q Consensus 37 ~~~Cw~C~~~~--~~~-~~fC~~C~~ 59 (237)
.++|-+||+-+ ... .-.|+.|+.
T Consensus 18 ~~NCl~CGkIiC~~Eg~~~pC~fCg~ 43 (57)
T PF06221_consen 18 APNCLNCGKIICEQEGPLGPCPFCGT 43 (57)
T ss_pred cccccccChhhcccccCcCcCCCCCC
Confidence 56899999976 233 577888874
No 272
>PF08312 cwf21: cwf21 domain; InterPro: IPR013170 The cwf21 domain is found in proteins involved in mRNA splicing. Proteins containing this domain have been isolated as a subcomplex of the splicosome in Schizosaccharomyces pombe (Fission yeast) []. In yeast, this domain binds the protein Prp8p [], a large and highly conserved U5 snRNP protein which has been proposed as a protein cofactor at the spliceosomal catalytic centre []. The cwf21 domain is found in, amongst others, the small Cwc21p protein in yeast as well as in the much larger human ortholog SRm300 (serine/arginine repetitive matrix protein). ; PDB: 2E62_A.
Probab=25.15 E-value=1.8e+02 Score=19.29 Aligned_cols=14 Identities=36% Similarity=0.620 Sum_probs=11.5
Q ss_pred HHHHHHHHHHhccC
Q psy8325 158 MEILMLNEELDEAS 171 (237)
Q Consensus 158 meimE~rE~leea~ 171 (237)
..+|++++++++-.
T Consensus 11 lk~~elrd~LEe~g 24 (46)
T PF08312_consen 11 LKCLELRDELEEQG 24 (46)
T ss_dssp HHHHHHHHHHHHHT
T ss_pred HHHHHHHHHHHhCC
Confidence 46899999999854
No 273
>PRK08271 anaerobic ribonucleoside triphosphate reductase; Provisional
Probab=25.13 E-value=45 Score=33.83 Aligned_cols=23 Identities=22% Similarity=0.359 Sum_probs=18.2
Q ss_pred CCCCCcCCCCCCCCCccCcCCCc
Q psy8325 37 ELKCWKCLKNLSGKNLFCQYCSS 59 (237)
Q Consensus 37 ~~~Cw~C~~~~~~~~~fC~~C~~ 59 (237)
-..|..|+.........||.|+.
T Consensus 566 ~~iC~~CG~~~~g~~~~CP~CGs 588 (623)
T PRK08271 566 ITICNDCHHIDKRTGKRCPICGS 588 (623)
T ss_pred CccCCCCCCcCCCCCcCCcCCCC
Confidence 44899999864456799999986
No 274
>PF02591 DUF164: Putative zinc ribbon domain; InterPro: IPR003743 This entry describes proteins of unknown function.
Probab=24.92 E-value=41 Score=22.71 Aligned_cols=24 Identities=25% Similarity=0.660 Sum_probs=16.6
Q ss_pred CCCCCcCCCCC----------CCCCccCcCCCcc
Q psy8325 37 ELKCWKCLKNL----------SGKNLFCQYCSSV 60 (237)
Q Consensus 37 ~~~Cw~C~~~~----------~~~~~fC~~C~~~ 60 (237)
...|-.|+-.+ ...-.+||+|+.+
T Consensus 22 ~~~C~gC~~~l~~~~~~~i~~~~~i~~Cp~CgRi 55 (56)
T PF02591_consen 22 GGTCSGCHMELPPQELNEIRKGDEIVFCPNCGRI 55 (56)
T ss_pred CCccCCCCEEcCHHHHHHHHcCCCeEECcCCCcc
Confidence 34788887754 2356899999865
No 275
>KOG0687|consensus
Probab=24.38 E-value=5.6e+02 Score=24.43 Aligned_cols=16 Identities=38% Similarity=0.306 Sum_probs=10.6
Q ss_pred cCCCHHHHHHHHHHHH
Q psy8325 150 KGTDQKLLMEILMLNE 165 (237)
Q Consensus 150 ~~~d~efLmeimE~rE 165 (237)
...|.++|..+...||
T Consensus 66 i~~D~~~l~~m~~~ne 81 (393)
T KOG0687|consen 66 IKLDQDLLNSMKKANE 81 (393)
T ss_pred eeccHHHHHHHHHhhH
Confidence 4567777777766654
No 276
>PF13838 Clathrin_H_link: Clathrin-H-link; PDB: 2XZG_A 3GD1_I 1BPO_C 1C9I_B 1C9L_A.
Probab=24.25 E-value=1.2e+02 Score=21.67 Aligned_cols=25 Identities=20% Similarity=0.331 Sum_probs=18.9
Q ss_pred HHHHHHHHHHhccccHHHHHHHHHH
Q psy8325 188 DDLTKNVNASFEQKDFNQAKDLLIR 212 (237)
Q Consensus 188 ~~~~~~l~~~f~~~d~~~A~~~l~r 212 (237)
+....++.+.|..|+|++|++....
T Consensus 7 ~l~~~~F~~l~~~g~y~eAA~~AA~ 31 (66)
T PF13838_consen 7 DLYVQQFNELFSQGQYEEAAKVAAN 31 (66)
T ss_dssp HHHHHHHHHHHHTT-HHHHHHHHHH
T ss_pred HHHHHHHHHHHHcCCHHHHHHHHHh
Confidence 3456788999999999999876653
No 277
>KOG0687|consensus
Probab=24.25 E-value=2.7e+02 Score=26.47 Aligned_cols=49 Identities=12% Similarity=0.304 Sum_probs=28.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcc-ccHHHHHHHHHHHHhHHHHHH
Q psy8325 173 EEDLEKLQTLIQATIDDLTKNVNASFEQ-KDFNQAKDLLIRMKYFTTLVS 221 (237)
Q Consensus 173 ~~~l~~l~~~~~~~i~~~~~~l~~~f~~-~d~~~A~~~l~rlkY~~~l~~ 221 (237)
..-+..+.+.++++++++.+.++.+=+. |..+.....+.+--||-+|-+
T Consensus 70 ~~~l~~m~~~neeki~eld~~iedaeenlGE~ev~ea~~~kaeYycqigD 119 (393)
T KOG0687|consen 70 QDLLNSMKKANEEKIKELDEKIEDAEENLGESEVREAMLRKAEYYCQIGD 119 (393)
T ss_pred HHHHHHHHHhhHHHHHHHHHHHHHHHHhcchHHHHHHHHHHHHHHHHhcc
Confidence 3445666666666666666666655543 555555555666666666544
No 278
>TIGR01666 YCCS hypothetical membrane protein, TIGR01666. This model represents a clade of sequences from gamma and beta proteobacteria. These proteins are 700 amino acids long and many have been annotated as putative membrane proteins. The gene from Salmonella has been annotated as a putative efflux transporter. The gene from E. coli has the name yccS.
Probab=24.06 E-value=8.1e+02 Score=25.31 Aligned_cols=47 Identities=21% Similarity=0.306 Sum_probs=31.6
Q ss_pred CHHHHHHHHHHHHH-------HhCCCCCCCCCHHHHHHH---HHHHHHHHHHHHHhc
Q psy8325 82 NNVDLSKKFKQLQS-------QLHPDKFSNKNQEEQAIS---ETYSSYLNKAYSILQ 128 (237)
Q Consensus 82 d~~~Lk~~Yr~Lq~-------~~HPDk~~~~~~~e~~~a---~~~s~~In~AY~tL~ 128 (237)
..+.|-+.|+.|+. .|+||............+ ......+|+|+++|.
T Consensus 159 ~q~~LA~~y~~La~yL~aka~lf~p~~~~~~~~~~~~~a~~n~~lv~~ln~ar~~Ll 215 (704)
T TIGR01666 159 VQENLAKAFCQLAEYLETKSCFFDPDEVAEIQKKHLNFAMKNANVVTALNQVKTALL 215 (704)
T ss_pred HHHHHHHHHHHHHHHHHHHHhhCCCCccchhhhHHHHHHHHhHHHHHHHHHHHHHHH
Confidence 47789999998765 578988655433333333 345677888888883
No 279
>COG2260 Predicted Zn-ribbon RNA-binding protein [Translation, ribosomal structure and biogenesis]
Probab=23.99 E-value=1.2e+02 Score=21.42 Aligned_cols=24 Identities=29% Similarity=0.425 Sum_probs=16.7
Q ss_pred CCCCCcCCCCCCCCCccCcCCCcccC
Q psy8325 37 ELKCWKCLKNLSGKNLFCQYCSSVQK 62 (237)
Q Consensus 37 ~~~Cw~C~~~~~~~~~fC~~C~~~qp 62 (237)
-..|.+|+.-+. .-.||.||...-
T Consensus 5 ~rkC~~cg~YTL--ke~Cp~CG~~t~ 28 (59)
T COG2260 5 IRKCPKCGRYTL--KEKCPVCGGDTK 28 (59)
T ss_pred hhcCcCCCceee--cccCCCCCCccc
Confidence 347999987543 267999987553
No 280
>PRK06342 transcription elongation factor regulatory protein; Validated
Probab=23.84 E-value=2.5e+02 Score=23.38 Aligned_cols=47 Identities=15% Similarity=0.273 Sum_probs=27.5
Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHhccccHHHHHHHH----HHHHhHHHHHHH
Q psy8325 172 SEEDLEKLQTLIQATIDDLTKNVNASFEQKDFNQAKDLL----IRMKYFTTLVSK 222 (237)
Q Consensus 172 ~~~~l~~l~~~~~~~i~~~~~~l~~~f~~~d~~~A~~~l----~rlkY~~~l~~~ 222 (237)
+.+-+++|..++. .+...+..|-..||+.+|++.. .|++||.+..+.
T Consensus 32 T~~G~~~L~~El~----~L~~~i~~Ar~~GDlsEak~~~~~~e~rI~~L~~~L~~ 82 (160)
T PRK06342 32 TEAGLKALEDQLA----QARAAYEAAQAIEDVNERRRQMARPLRDLRYLAARRRT 82 (160)
T ss_pred CHHHHHHHHHHHH----HHHHHHHHHHHCCChhHHHHHHHHHHHHHHHHHHHHcc
Confidence 3444554444433 3446788888889988876443 455666555443
No 281
>PF13812 PPR_3: Pentatricopeptide repeat domain
Probab=23.77 E-value=1.2e+02 Score=17.07 Aligned_cols=23 Identities=26% Similarity=0.157 Sum_probs=17.7
Q ss_pred HHHHHhccccHHHHHHHHHHHHh
Q psy8325 193 NVNASFEQKDFNQAKDLLIRMKY 215 (237)
Q Consensus 193 ~l~~~f~~~d~~~A~~~l~rlkY 215 (237)
-|......++++.|...+..|+-
T Consensus 7 ll~a~~~~g~~~~a~~~~~~M~~ 29 (34)
T PF13812_consen 7 LLRACAKAGDPDAALQLFDEMKE 29 (34)
T ss_pred HHHHHHHCCCHHHHHHHHHHHHH
Confidence 34555678999999998888874
No 282
>TIGR00020 prfB peptide chain release factor 2. In many but not all taxa, there is a conserved real translational frameshift at a TGA codon. RF-2 helps terminate translation at TGA codons and can therefore regulate its own production by readthrough when RF-2 is insufficient. There is a Pfam model called "RF-1" for the superfamily of RF-1, RF-2, mitochondrial, RF-H, etc.
Probab=23.69 E-value=2.9e+02 Score=26.20 Aligned_cols=17 Identities=6% Similarity=0.219 Sum_probs=6.5
Q ss_pred HHHHHHHHHHhHHHHHH
Q psy8325 205 QAKDLLIRMKYFTTLVS 221 (237)
Q Consensus 205 ~A~~~l~rlkY~~~l~~ 221 (237)
.+.++..++..+..+.+
T Consensus 50 ~~~~~~ke~~~l~~~v~ 66 (364)
T TIGR00020 50 RAQAVIKERSSLEAVLD 66 (364)
T ss_pred HHHHHHHHHHHHHHHHH
Confidence 33333333333333333
No 283
>cd02340 ZZ_NBR1_like Zinc finger, ZZ type. Zinc finger present in Drosophila ref(2)P, NBR1, Human sequestosome 1 and related proteins. The ZZ motif coordinates two zinc ions and most likely participates in ligand binding or molecular scaffolding. Drosophila ref(2)P appears to control the multiplication of sigma rhabdovirus. NBR1 (Next to BRCA1 gene 1 protein) interacts with fasciculation and elongation protein zeta-1 (FEZ1) and calcium and integrin binding protein (CIB), and may function in cell signalling pathways. Sequestosome 1 is a phosphotyrosine independent ligand for the Lck SH2 domain and binds noncovalently to ubiquitin via its UBA domain.
Probab=23.51 E-value=58 Score=21.06 Aligned_cols=20 Identities=25% Similarity=0.591 Sum_probs=15.3
Q ss_pred CCCcCCCCCCCCCccCcCCC
Q psy8325 39 KCWKCLKNLSGKNLFCQYCS 58 (237)
Q Consensus 39 ~Cw~C~~~~~~~~~fC~~C~ 58 (237)
.|..|+.++...-+.|..|.
T Consensus 2 ~Cd~C~~~i~G~ry~C~~C~ 21 (43)
T cd02340 2 ICDGCQGPIVGVRYKCLVCP 21 (43)
T ss_pred CCCCCCCcCcCCeEECCCCC
Confidence 58888888776777787774
No 284
>PF11278 DUF3079: Protein of unknown function (DUF3079); InterPro: IPR021430 This family of proteins with unknown function appears to be restricted to Proteobacteria.
Probab=23.31 E-value=47 Score=22.56 Aligned_cols=29 Identities=34% Similarity=0.729 Sum_probs=19.3
Q ss_pred cCCCCCCcCCCCCCCCCccCcC-CCcccCC
Q psy8325 35 HTELKCWKCLKNLSGKNLFCQY-CSSVQKP 63 (237)
Q Consensus 35 ~~~~~Cw~C~~~~~~~~~fC~~-C~~~qp~ 63 (237)
+....||.|.+--+...+-|.+ +...|-|
T Consensus 11 hPERiCWGCD~YC~a~~l~CGNGs~RtqHP 40 (52)
T PF11278_consen 11 HPERICWGCDRYCPADSLACGNGSSRTQHP 40 (52)
T ss_pred CccceeeccccccChhhhcccCCcccccCc
Confidence 4556899999866667777874 3334433
No 285
>PRK00415 rps27e 30S ribosomal protein S27e; Reviewed
Probab=23.28 E-value=42 Score=23.61 Aligned_cols=25 Identities=20% Similarity=0.502 Sum_probs=18.8
Q ss_pred CCCCCcCCCCC-----CCCCccCcCCCccc
Q psy8325 37 ELKCWKCLKNL-----SGKNLFCQYCSSVQ 61 (237)
Q Consensus 37 ~~~Cw~C~~~~-----~~~~~fC~~C~~~q 61 (237)
...|+.|+... +.....|..|+.+.
T Consensus 11 ~VkCp~C~n~q~vFsha~t~V~C~~Cg~~L 40 (59)
T PRK00415 11 KVKCPDCGNEQVVFSHASTVVRCLVCGKTL 40 (59)
T ss_pred EEECCCCCCeEEEEecCCcEEECcccCCCc
Confidence 34799998854 57778899998854
No 286
>smart00154 ZnF_AN1 AN1-like Zinc finger. Zinc finger at the C-terminus of An1, a ubiquitin-like protein in Xenopus laevis.
Probab=23.23 E-value=58 Score=20.66 Aligned_cols=20 Identities=25% Similarity=0.688 Sum_probs=12.8
Q ss_pred CCcCCCCCCCCCccCcCCCc
Q psy8325 40 CWKCLKNLSGKNLFCQYCSS 59 (237)
Q Consensus 40 Cw~C~~~~~~~~~fC~~C~~ 59 (237)
|+.|+....-..+.|..|+.
T Consensus 1 C~~C~~~~~l~~f~C~~C~~ 20 (39)
T smart00154 1 CHFCRKKVGLTGFKCRHCGN 20 (39)
T ss_pred CcccCCcccccCeECCccCC
Confidence 66777655433677777764
No 287
>COG1601 GCD7 Translation initiation factor 2, beta subunit (eIF-2beta)/eIF-5 N-terminal domain [Translation, ribosomal structure and biogenesis]
Probab=23.05 E-value=22 Score=29.55 Aligned_cols=33 Identities=21% Similarity=0.376 Sum_probs=25.0
Q ss_pred CCCCCcCCCCC-------CCCCccCcCCCcccCCCCCCCh
Q psy8325 37 ELKCWKCLKNL-------SGKNLFCQYCSSVQKPDSQSNY 69 (237)
Q Consensus 37 ~~~Cw~C~~~~-------~~~~~fC~~C~~~qp~~~~~ny 69 (237)
...|.-|+.+- ....+.|..||+++|+....-|
T Consensus 105 yv~C~~c~s~dt~l~~~~R~~~l~c~acGa~~pv~~~~~~ 144 (151)
T COG1601 105 YVKCKECGSPDTELIKEERLLFLKCEACGAIRPVKLMKEY 144 (151)
T ss_pred eeEeccCCCCchhhhhhhhhHhhHHHHhCCcccchhhhhc
Confidence 56899999863 2456789999999998754443
No 288
>COG0497 RecN ATPase involved in DNA repair [DNA replication, recombination, and repair]
Probab=23.03 E-value=5.8e+02 Score=25.76 Aligned_cols=83 Identities=20% Similarity=0.231 Sum_probs=43.2
Q ss_pred CCCHHHHHHHHHHHHHHhccC-----CHHHHHHHHHHHHHHHHHHHHH------HHHHhc--cccHHHHHHHH--HHHHh
Q psy8325 151 GTDQKLLMEILMLNEELDEAS-----SEEDLEKLQTLIQATIDDLTKN------VNASFE--QKDFNQAKDLL--IRMKY 215 (237)
Q Consensus 151 ~~d~efLmeimE~rE~leea~-----~~~~l~~l~~~~~~~i~~~~~~------l~~~f~--~~d~~~A~~~l--~rlkY 215 (237)
..||+-|.++-+.-..|-... +.+++-+...++.++++.+... |...+. ...|.+++..| .|-++
T Consensus 293 e~Dp~~L~~ve~Rl~~L~~l~RKY~~~~~~l~~~~~~~~~el~~L~~~~~~~~~Le~~~~~l~~~~~~~A~~Ls~~R~~~ 372 (557)
T COG0497 293 EFDPNRLEEVEERLFALKSLARKYGVTIEDLLEYLDKIKEELAQLDNSEESLEALEKEVKKLKAELLEAAEALSAIRKKA 372 (557)
T ss_pred CCCHHHHHHHHHHHHHHHHHHHHhCCCHHHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 455555555544444443321 2344444444444444433321 111111 13455555554 45678
Q ss_pred HHHHHHHHHHHHhhcccc
Q psy8325 216 FTTLVSKVRDKKNQLELD 233 (237)
Q Consensus 216 ~~~l~~~ik~k~~~l~~~ 233 (237)
-.+|...|.+.+..|-+.
T Consensus 373 A~~L~~~v~~eL~~L~Me 390 (557)
T COG0497 373 AKELEKEVTAELKALAME 390 (557)
T ss_pred HHHHHHHHHHHHHhcCCC
Confidence 889999999998888654
No 289
>PRK14701 reverse gyrase; Provisional
Probab=22.97 E-value=37 Score=38.32 Aligned_cols=21 Identities=19% Similarity=0.545 Sum_probs=17.0
Q ss_pred CCCCcCCCCCC----CCCccCcCCC
Q psy8325 38 LKCWKCLKNLS----GKNLFCQYCS 58 (237)
Q Consensus 38 ~~Cw~C~~~~~----~~~~fC~~C~ 58 (237)
..||||++++. ...+.|+.|-
T Consensus 7 ~~cpnc~g~~~~~rl~~g~~c~~cl 31 (1638)
T PRK14701 7 EMCPNCGGDITDERLAEKGVCEKCL 31 (1638)
T ss_pred ccCCCCCCccchhHHhcCCCchhhc
Confidence 37999999883 5668999994
No 290
>COG1326 Uncharacterized archaeal Zn-finger protein [General function prediction only]
Probab=22.85 E-value=31 Score=29.98 Aligned_cols=27 Identities=19% Similarity=0.495 Sum_probs=19.4
Q ss_pred CCCCCcCCC-CC---------CCCCccCcCCCcccCC
Q psy8325 37 ELKCWKCLK-NL---------SGKNLFCQYCSSVQKP 63 (237)
Q Consensus 37 ~~~Cw~C~~-~~---------~~~~~fC~~C~~~qp~ 63 (237)
...|++|+. .+ .....-|+.||.+-|.
T Consensus 6 y~~Cp~Cg~eev~hEVik~~g~~~lvrC~eCG~V~~~ 42 (201)
T COG1326 6 YIECPSCGSEEVSHEVIKERGREPLVRCEECGTVHPA 42 (201)
T ss_pred EEECCCCCcchhhHHHHHhcCCceEEEccCCCcEeec
Confidence 457999992 11 2347889999999854
No 291
>TIGR00354 polC DNA polymerase, archaeal type II, large subunit. This model represents the large subunit, DP2, of a two subunit novel Archaeal replicative DNA polymerase first characterized for Pyrococcus furiosus. Structure of DP2 appears to be organized as a ~950 residue component separated from a ~300 residue component by a ~150 residue intein. The other subunit, DP1, has sequence similarity to the eukaryotic DNA polymerase delta small subunit.
Probab=22.68 E-value=46 Score=35.49 Aligned_cols=22 Identities=32% Similarity=0.543 Sum_probs=17.5
Q ss_pred CCCCCcCCCCCCCCCccCcCCCcc
Q psy8325 37 ELKCWKCLKNLSGKNLFCQYCSSV 60 (237)
Q Consensus 37 ~~~Cw~C~~~~~~~~~fC~~C~~~ 60 (237)
...|++|+.. +....||.|+.-
T Consensus 625 ~RKCPkCG~y--Tlk~rCP~CG~~ 646 (1095)
T TIGR00354 625 IRKCPQCGKE--SFWLKCPVCGEL 646 (1095)
T ss_pred EEECCCCCcc--cccccCCCCCCc
Confidence 3479999985 467899999874
No 292
>TIGR01562 FdhE formate dehydrogenase accessory protein FdhE. The only sequence scoring between trusted and noise is that from Aquifex aeolicus, which shows certain structural differences from the proteobacterial forms in the alignment. However it is notable that A. aeolicus also has a sequence scoring above trusted to the alpha subunit of formate dehydrogenase (TIGR01553).
Probab=22.54 E-value=43 Score=30.94 Aligned_cols=33 Identities=21% Similarity=0.423 Sum_probs=25.1
Q ss_pred CCCCcCCCCCCCCCccCcCCCcccCCCCCCChhhhcC
Q psy8325 38 LKCWKCLKNLSGKNLFCQYCSSVQKPDSQSNYYDLFD 74 (237)
Q Consensus 38 ~~Cw~C~~~~~~~~~fC~~C~~~qp~~~~~nyf~llg 74 (237)
..|.-|+..+...-..|+.|+.- ...+||.+=+
T Consensus 211 L~CslC~teW~~~R~~C~~Cg~~----~~l~y~~~e~ 243 (305)
T TIGR01562 211 LSCSLCATEWHYVRVKCSHCEES----KHLAYLSLEH 243 (305)
T ss_pred EEcCCCCCcccccCccCCCCCCC----CceeeEeecC
Confidence 37999999887778899999872 3467776654
No 293
>COG1675 TFA1 Transcription initiation factor IIE, alpha subunit [Transcription]
Probab=22.50 E-value=46 Score=28.37 Aligned_cols=25 Identities=16% Similarity=0.325 Sum_probs=18.5
Q ss_pred CCCCCcCCCCC-----CCCCccCcCCCccc
Q psy8325 37 ELKCWKCLKNL-----SGKNLFCQYCSSVQ 61 (237)
Q Consensus 37 ~~~Cw~C~~~~-----~~~~~fC~~C~~~q 61 (237)
.-.|++|+-.. ....|.||.||..-
T Consensus 113 ~y~C~~~~~r~sfdeA~~~~F~Cp~Cg~~L 142 (176)
T COG1675 113 YYVCPNCHVKYSFDEAMELGFTCPKCGEDL 142 (176)
T ss_pred ceeCCCCCCcccHHHHHHhCCCCCCCCchh
Confidence 44798888765 25669999998843
No 294
>COG0264 Tsf Translation elongation factor Ts [Translation, ribosomal structure and biogenesis]
Probab=22.48 E-value=1.3e+02 Score=27.81 Aligned_cols=39 Identities=18% Similarity=0.226 Sum_probs=33.5
Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHhccccHHHHHHHHHH
Q psy8325 172 SEEDLEKLQTLIQATIDDLTKNVNASFEQKDFNQAKDLLIR 212 (237)
Q Consensus 172 ~~~~l~~l~~~~~~~i~~~~~~l~~~f~~~d~~~A~~~l~r 212 (237)
+.+.+.+|++.-..=+-+|.+.|.++ .||++.|.++|++
T Consensus 5 ta~~VKeLRe~TgAGMmdCKkAL~E~--~Gd~EkAie~LR~ 43 (296)
T COG0264 5 TAALVKELREKTGAGMMDCKKALEEA--NGDIEKAIEWLRE 43 (296)
T ss_pred cHHHHHHHHHHhCCcHHHHHHHHHHc--CCCHHHHHHHHHH
Confidence 45677888888888888999999887 8999999999987
No 295
>cd00088 HPT Histidine Phosphotransfer domain, involved in signalling through a two part component systems in which an autophosphorylating histidine protein kinase serves as a phosphoryl donor to a response regulator protein; the response regulator protein is modulated by phosphorylation and dephosphorylation of a conserved aspartic acid residue; two-component proteins are abundant in most eubacteria; In E. coli there are 62 two-component proteins involved in a variety of processes such as chemotaxis, osmoregulation, metabolism and transport 1; also present in both Gram positive and Gram negative pathogenic bacteria where they regulate basic housekeeping functions and control expression of toxins and other proteins important for pathogenesis; in archaea and eukaryotes, two-component pathways constitute a very small number of all signaling systems; in fungi they mediate environmental stress responses and, in pathogenic yeast, hyphal development. In Dictyostelium and in plants, they are i
Probab=22.48 E-value=2.1e+02 Score=20.45 Aligned_cols=14 Identities=14% Similarity=0.164 Sum_probs=6.4
Q ss_pred cccHHHHHHHHHHH
Q psy8325 200 QKDFNQAKDLLIRM 213 (237)
Q Consensus 200 ~~d~~~A~~~l~rl 213 (237)
.+|++.+...+.+|
T Consensus 29 ~~d~~~l~~~~H~L 42 (94)
T cd00088 29 AEDLNEIFRAAHTL 42 (94)
T ss_pred HHHHHHHHHHHHhh
Confidence 34444444444444
No 296
>PHA02998 RNA polymerase subunit; Provisional
Probab=22.28 E-value=55 Score=28.18 Aligned_cols=28 Identities=18% Similarity=0.389 Sum_probs=19.2
Q ss_pred CCCCCCcCCCCC--------------CCCCccCcCCCc-ccCC
Q psy8325 36 TELKCWKCLKNL--------------SGKNLFCQYCSS-VQKP 63 (237)
Q Consensus 36 ~~~~Cw~C~~~~--------------~~~~~fC~~C~~-~qp~ 63 (237)
....|+.|++.- .+..+.|..|+. ..||
T Consensus 142 t~v~CPkCg~~~A~f~qlQTRSADEPmT~FYkC~~CG~~wkpp 184 (195)
T PHA02998 142 YNTPCPNCKSKNTTPMMIQTRAADEPPLVRHACRDCKKHFKPP 184 (195)
T ss_pred cCCCCCCCCCCceEEEEEeeccCCCCceEEEEcCCCCCccCCc
Confidence 346899999841 144568999987 5555
No 297
>PF10279 Latarcin: Latarcin precursor; InterPro: IPR018802 This entry represents the precursor proteins for a number of short antimicrobial peptides called Latarcins. Latarcins were discovered in the venom of the spider Lachesana tarabaevi []. Latarcins are likely to adopt amphipathic alpha-helical structure in the plasma membrane. ; PDB: 2PCO_A 2G9P_A.
Probab=22.16 E-value=20 Score=26.58 Aligned_cols=43 Identities=12% Similarity=0.293 Sum_probs=7.7
Q ss_pred HHHHHHHHHHHHHHHhccccHHHHHHHHHHH-HhHHHHHHHHHH
Q psy8325 183 IQATIDDLTKNVNASFEQKDFNQAKDLLIRM-KYFTTLVSKVRD 225 (237)
Q Consensus 183 ~~~~i~~~~~~l~~~f~~~d~~~A~~~l~rl-kY~~~l~~~ik~ 225 (237)
...++.+..+++...+.-.|.++|+.....| .+|.++...+++
T Consensus 36 ~~~rl~e~~e~l~k~~~~~~~EEARg~~~~~K~k~kk~~~~lkk 79 (81)
T PF10279_consen 36 VQGRLMEMVEQLRKVQGREDTEEARGAFQTFKPKFKKMREKLKK 79 (81)
T ss_dssp --------------------------CHCHCHHHCCHHHHHHHH
T ss_pred HHHHHHHHHHHHHHhhCcccHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 5566778888888888888888888733333 245555555544
No 298
>PF05600 DUF773: Protein of unknown function (DUF773); InterPro: IPR008491 This family contains several eukaryotic sequences which are thought to be CDK5 activator-binding proteins, however, the function of this family is unknown.
Probab=21.82 E-value=2.6e+02 Score=27.63 Aligned_cols=23 Identities=30% Similarity=0.416 Sum_probs=17.7
Q ss_pred HHHHHHHHhcCcchhhHHHHhhc
Q psy8325 119 YLNKAYSILQNPLERGLYLLSLQ 141 (237)
Q Consensus 119 ~In~AY~tL~dp~~Ra~Yll~l~ 141 (237)
.-.+|+..|-||..|..++-+|+
T Consensus 339 ~g~~a~tlLe~~~~R~~fldeL~ 361 (507)
T PF05600_consen 339 RGDDALTLLENPETRNQFLDELL 361 (507)
T ss_pred cCchhhhhcCCHhHHHHHHHHHH
Confidence 34578888888999998876654
No 299
>PRK11823 DNA repair protein RadA; Provisional
Probab=21.80 E-value=54 Score=31.62 Aligned_cols=26 Identities=15% Similarity=0.213 Sum_probs=19.3
Q ss_pred CCCCcCCCCCCCCCccCcCCCcccCC
Q psy8325 38 LKCWKCLKNLSGKNLFCQYCSSVQKP 63 (237)
Q Consensus 38 ~~Cw~C~~~~~~~~~fC~~C~~~qp~ 63 (237)
-.|-+|+.....-.-.||+|+..-..
T Consensus 8 y~C~~Cg~~~~~~~g~Cp~C~~w~t~ 33 (446)
T PRK11823 8 YVCQECGAESPKWLGRCPECGAWNTL 33 (446)
T ss_pred EECCcCCCCCcccCeeCcCCCCccce
Confidence 37999998776667789999775543
No 300
>cd01121 Sms Sms (bacterial radA) DNA repair protein. This protein is not related to archael radA any more than is to other RecA-like NTPases. Sms has a role in recombination and recombinational repair and is responsible for the stabilization or processing of branched DNA molecules.
Probab=21.79 E-value=54 Score=30.93 Aligned_cols=23 Identities=17% Similarity=0.180 Sum_probs=14.8
Q ss_pred CCCcCCCCCCCCCccCcCCCccc
Q psy8325 39 KCWKCLKNLSGKNLFCQYCSSVQ 61 (237)
Q Consensus 39 ~Cw~C~~~~~~~~~fC~~C~~~q 61 (237)
.|-+|+.....-.-.||+|+..-
T Consensus 2 ~c~~cg~~~~~~~g~cp~c~~w~ 24 (372)
T cd01121 2 VCSECGYVSPKWLGKCPECGEWN 24 (372)
T ss_pred CCCCCCCCCCCccEECcCCCCce
Confidence 47777776555556677776543
No 301
>PF01895 PhoU: PhoU domain; InterPro: IPR008170 This family contains phosphate regulatory proteins including PhoU. PhoU proteins are known to play a role in the regulation of phosphate uptake. The PhoU domain is composed of a three helix bundle []. The PhoU protein contains two copies of this domain. The domain binds to an iron cluster via its conserved E/DXXXD motif. Deletion of PhoU activates constitutive expression of the phosphate ABC transporter and allows phosphate transport, but causes a growth defect; suggesting that the protein has some secondary function []. ; PDB: 2I0M_A 1T72_B 1T8B_A 1SUM_B 1VCT_A 2BKN_A 2BKP_A 2BKO_A.
Probab=21.42 E-value=2.8e+02 Score=18.95 Aligned_cols=50 Identities=8% Similarity=0.161 Sum_probs=28.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcc-cc---HHHHHHHHHHHHhHHHHHHHHHH
Q psy8325 176 LEKLQTLIQATIDDLTKNVNASFEQ-KD---FNQAKDLLIRMKYFTTLVSKVRD 225 (237)
Q Consensus 176 l~~l~~~~~~~i~~~~~~l~~~f~~-~d---~~~A~~~l~rlkY~~~l~~~ik~ 225 (237)
+......+........+.+-..... ++ .......+.-++++.++-+.+.+
T Consensus 30 i~~~e~~id~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~lERigD~~~n 83 (88)
T PF01895_consen 30 IIQLEEEIDELYREIRRQILKILKNQNPLEELRELVGLLRIARDLERIGDHAVN 83 (88)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHCGHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHhHHHHHHHHHHHHHHHHHHHhhCCCHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3344444555555554444444432 23 44466777788888888776654
No 302
>PF09845 DUF2072: Zn-ribbon containing protein (DUF2072); InterPro: IPR018645 This archaeal Zinc-ribbon containing proteins have no known function.
Probab=21.41 E-value=64 Score=26.28 Aligned_cols=22 Identities=23% Similarity=0.459 Sum_probs=13.9
Q ss_pred CCCCcCCCCCCCC----CccCcCCCc
Q psy8325 38 LKCWKCLKNLSGK----NLFCQYCSS 59 (237)
Q Consensus 38 ~~Cw~C~~~~~~~----~~fC~~C~~ 59 (237)
..|.+|+...... .--||.||.
T Consensus 2 H~Ct~Cg~~f~dgs~eil~GCP~CGg 27 (131)
T PF09845_consen 2 HQCTKCGRVFEDGSKEILSGCPECGG 27 (131)
T ss_pred cccCcCCCCcCCCcHHHHccCcccCC
Confidence 4688888865322 246888854
No 303
>PRK00810 nifW nitrogenase stabilizing/protective protein; Provisional
Probab=21.33 E-value=2.2e+02 Score=22.59 Aligned_cols=60 Identities=17% Similarity=0.308 Sum_probs=36.3
Q ss_pred CCCChhhhcCCcccC---CCCHHHHHHHHHHHHHHhCCCCCCCCCHHHHHHHHHHHHHHHHHHHHhcC
Q psy8325 65 SQSNYYDLFDLKLTY---LINNVDLSKKFKQLQSQLHPDKFSNKNQEEQAISETYSSYLNKAYSILQN 129 (237)
Q Consensus 65 ~~~nyf~llgl~~~f---~id~~~Lk~~Yr~Lq~~~HPDk~~~~~~~e~~~a~~~s~~In~AY~tL~d 129 (237)
...+||+.||++-.- ++..--|=++|.+...... +.+.++.+. ....-..+.+||+....
T Consensus 17 sAEdff~ff~V~YDp~vvnV~RLHILKrF~~yL~~~~---~~~~~e~~~--~~~yr~aL~~AY~dF~~ 79 (113)
T PRK00810 17 SAEEFFQLLGVPYDPKVVNVARLHILKRMGQYLAQED---FAGLPEAEA--RARCRAVLERAYADFVA 79 (113)
T ss_pred cHHHHHHHhCCCCCHHHHHHhHHHHHHHHHHHHHhcc---cCCCCHHHH--HHHHHHHHHHHHHHHcc
Confidence 346899999997431 2333346677777665433 444444443 23556778899987643
No 304
>PF08882 Acetone_carb_G: Acetone carboxylase gamma subunit; InterPro: IPR014979 Acetone carboxylase is the key enzyme of bacterial acetone metabolism, catalysing the condensation of acetone and CO2 to form acetoacetate [] according to the following reaction: CH3COCH3 + CO2 + ATP = CH3COCH2COO- + AMP + 2P(i) + H+ It has the subunit composition: (alpha(2)beta(2)gamma(2) multimers of 85kDa, 78kDa, and 20kDa subunits). It is expressed to high levels (17 to 25% of soluble protein) in cells grown with acetone as the carbon source but are not present at detectable levels in cells grown with other carbon sources []. Acetone carboxylase may enable Helicobacter pylori to survive off acetone in the stomach of humans and other mammals where it is the etiological agent of peptic ulcer disease []. This entry represents the family of gamma subunit-related acetone carboxylase proteins.
Probab=21.15 E-value=50 Score=26.15 Aligned_cols=17 Identities=12% Similarity=0.491 Sum_probs=12.6
Q ss_pred CCCccCcCCCcccCCCC
Q psy8325 49 GKNLFCQYCSSVQKPDS 65 (237)
Q Consensus 49 ~~~~fC~~C~~~qp~~~ 65 (237)
...|+||+|+.+..+..
T Consensus 72 irEyyCP~Cgt~levE~ 88 (112)
T PF08882_consen 72 IREYYCPGCGTQLEVEA 88 (112)
T ss_pred EEEEECCCCcceeEEcc
Confidence 35789999999765543
No 305
>PRK06556 vitamin B12-dependent ribonucleotide reductase; Validated
Probab=20.98 E-value=54 Score=34.98 Aligned_cols=26 Identities=23% Similarity=0.531 Sum_probs=20.5
Q ss_pred cCCCCCCcCCCCC--CCCCccCcCCCcc
Q psy8325 35 HTELKCWKCLKNL--SGKNLFCQYCSSV 60 (237)
Q Consensus 35 ~~~~~Cw~C~~~~--~~~~~fC~~C~~~ 60 (237)
...+.|.+|+..+ +...+.|..||..
T Consensus 922 ~~~~~c~~c~~~~~~~g~c~~c~~cg~t 949 (953)
T PRK06556 922 ADAPLCPTCGTKMVRNGSCYVCEGCGST 949 (953)
T ss_pred ccCCcCCCccCeeeECCceEeccCCCCC
Confidence 3466799998765 5788999999874
No 306
>PRK08270 anaerobic ribonucleoside triphosphate reductase; Provisional
Probab=20.85 E-value=58 Score=33.23 Aligned_cols=22 Identities=32% Similarity=0.418 Sum_probs=17.4
Q ss_pred CCCCCcCCCCCCCCCccCcCCCc
Q psy8325 37 ELKCWKCLKNLSGKNLFCQYCSS 59 (237)
Q Consensus 37 ~~~Cw~C~~~~~~~~~fC~~C~~ 59 (237)
...|..|+.. ....+.||.||.
T Consensus 626 ~~~C~~CG~~-~g~~~~CP~CG~ 647 (656)
T PRK08270 626 FSICPKHGYL-SGEHEFCPKCGE 647 (656)
T ss_pred CcccCCCCCc-CCCCCCCcCCcC
Confidence 4489999983 456799999984
No 307
>PF14122 YokU: YokU-like protein
Probab=20.68 E-value=47 Score=25.15 Aligned_cols=30 Identities=17% Similarity=0.400 Sum_probs=20.9
Q ss_pred CCCCccCcCCCcccCC-------------------CCCCChhhhcCCcc
Q psy8325 48 SGKNLFCQYCSSVQKP-------------------DSQSNYYDLFDLKL 77 (237)
Q Consensus 48 ~~~~~fC~~C~~~qp~-------------------~~~~nyf~llgl~~ 77 (237)
..+...|.+||.+--. +.+.+|=+++++|+
T Consensus 32 ~tP~i~C~~CgmvYq~d~vi~EIEdql~Lidtk~l~~sity~~LM~~~r 80 (87)
T PF14122_consen 32 DTPAIICSNCGMVYQDDEVIKEIEDQLFLIDTKKLPNSITYEELMNQPR 80 (87)
T ss_pred CCceeeecCCCcEEehhHHHHHHhhheEEEecccCCccccHHHHhhchh
Confidence 4778999999875332 23567888887765
No 308
>COG5349 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=20.62 E-value=44 Score=26.93 Aligned_cols=39 Identities=15% Similarity=0.354 Sum_probs=29.0
Q ss_pred CCCCCcCCCCC-----CCCCccCcCCCc---ccCCCCCCChhhhcCC
Q psy8325 37 ELKCWKCLKNL-----SGKNLFCQYCSS---VQKPDSQSNYYDLFDL 75 (237)
Q Consensus 37 ~~~Cw~C~~~~-----~~~~~fC~~C~~---~qp~~~~~nyf~llgl 75 (237)
...|++|+.-- .+....|.+|+. ..+++...-||-++-+
T Consensus 21 ~grCP~CGeGrLF~gFLK~~p~C~aCG~dyg~~~a~DgPa~fvI~Iv 67 (126)
T COG5349 21 RGRCPRCGEGRLFRGFLKVVPACEACGLDYGFADADDGPAYFVILIV 67 (126)
T ss_pred cCCCCCCCCchhhhhhcccCchhhhccccccCCcccCCCcEEEEEee
Confidence 44899998742 467788999976 5666666788888765
No 309
>TIGR02827 RNR_anaer_Bdell anaerobic ribonucleoside-triphosphate reductase. Members of this family belong to the class III anaerobic ribonucleoside-triphosphate reductases (RNR). These glycine-radical-containing enzymes are oxygen-sensitive and operate under anaerobic conditions. The genes for this family are pair with genes for an acitivating protein that creates a glycine radical. Members of this family, though related, fall outside the scope of TIGR02487, a functionally equivalent protein set; no genome has members in both familes. Identification as RNR is supported by gene pairing with the activating protein, lack of other anaerobic RNR, and presence of an upstream regulatory element strongly conserved upstream of most RNR operons.
Probab=20.45 E-value=61 Score=32.68 Aligned_cols=23 Identities=22% Similarity=0.307 Sum_probs=16.5
Q ss_pred CCCCCcCCCCCCCCCccCcCCCc
Q psy8325 37 ELKCWKCLKNLSGKNLFCQYCSS 59 (237)
Q Consensus 37 ~~~Cw~C~~~~~~~~~fC~~C~~ 59 (237)
-..|..|+.........||.|+.
T Consensus 532 ~siC~~CGy~~g~~~~~CP~CGs 554 (586)
T TIGR02827 532 ITICNDCHHIDKRTLHRCPVCGS 554 (586)
T ss_pred CeecCCCCCcCCCcCCcCcCCCC
Confidence 44899999832223489999985
No 310
>PF13597 NRDD: Anaerobic ribonucleoside-triphosphate reductase; PDB: 1HK8_A 1H78_A 1H7A_A 1H79_A 1H7B_A.
Probab=20.35 E-value=36 Score=33.85 Aligned_cols=32 Identities=19% Similarity=0.230 Sum_probs=13.7
Q ss_pred CCCcCCCCCCCCCccCcCCCcc--cCCCCCCChhh
Q psy8325 39 KCWKCLKNLSGKNLFCQYCSSV--QKPDSQSNYYD 71 (237)
Q Consensus 39 ~Cw~C~~~~~~~~~fC~~C~~~--qp~~~~~nyf~ 71 (237)
.|.+|+..... ...||.|+.- .-..+-.-||.
T Consensus 493 ~C~~CG~~~~~-~~~CP~CGs~~~~~~~Rv~GYl~ 526 (546)
T PF13597_consen 493 ICPDCGYIGGE-GDKCPKCGSENIEVYSRVTGYLR 526 (546)
T ss_dssp EETTT---S---EEE-CCC----EEEEB-SSSS-B
T ss_pred cccCCCcCCCC-CCCCCCCCCcccceEEEeecccc
Confidence 78899886544 7889999762 22233345555
No 311
>PRK12336 translation initiation factor IF-2 subunit beta; Provisional
Probab=20.18 E-value=54 Score=28.31 Aligned_cols=29 Identities=17% Similarity=0.236 Sum_probs=22.1
Q ss_pred CCCCCcCCCCC-------CCCCccCcCCCcccCCCC
Q psy8325 37 ELKCWKCLKNL-------SGKNLFCQYCSSVQKPDS 65 (237)
Q Consensus 37 ~~~Cw~C~~~~-------~~~~~fC~~C~~~qp~~~ 65 (237)
...|+.|+.|= ....+.|..||+..|+..
T Consensus 98 yV~C~~C~~pdT~l~k~~~~~~l~C~aCGa~~~v~~ 133 (201)
T PRK12336 98 YVICSECGLPDTRLVKEDRVLMLRCDACGAHRPVKK 133 (201)
T ss_pred eEECCCCCCCCcEEEEcCCeEEEEcccCCCCccccc
Confidence 56899999972 233467999999988864
No 312
>PRK11595 DNA utilization protein GntX; Provisional
Probab=20.12 E-value=50 Score=28.64 Aligned_cols=23 Identities=22% Similarity=0.395 Sum_probs=15.5
Q ss_pred CCCCCcCCCCCCCCCccCcCCCc
Q psy8325 37 ELKCWKCLKNLSGKNLFCQYCSS 59 (237)
Q Consensus 37 ~~~Cw~C~~~~~~~~~fC~~C~~ 59 (237)
...|+.|+.+.......|+.|..
T Consensus 34 ~~~C~~Cg~~~~~~~~~C~~C~~ 56 (227)
T PRK11595 34 KTCCPQCGLPATHPHLPCGRCLQ 56 (227)
T ss_pred cCcCccCCCcCCCCCCCcHHHHc
Confidence 34688888876444456888855
Done!