Query psy5736
Match_columns 346
No_of_seqs 336 out of 1547
Neff 7.2
Searched_HMMs 46136
Date Fri Aug 16 16:42:00 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy5736.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/5736hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG2084|consensus 99.6 3.5E-15 7.6E-20 149.8 16.9 180 136-333 197-377 (482)
2 PF00856 SET: SET domain; Int 99.4 3.1E-13 6.7E-18 114.5 5.2 50 135-184 109-162 (162)
3 KOG1517|consensus 99.2 1.2E-11 2.7E-16 129.7 5.6 77 6-86 1086-1164(1387)
4 smart00317 SET SET (Su(var)3-9 99.1 6E-11 1.3E-15 95.9 3.9 46 138-183 67-116 (116)
5 KOG2589|consensus 98.6 1.6E-08 3.4E-13 96.4 3.0 60 144-215 196-256 (453)
6 COG2940 Proteins containing SE 98.0 4.2E-06 9.1E-11 85.3 3.3 71 143-215 404-478 (480)
7 KOG4442|consensus 97.9 7E-06 1.5E-10 84.3 3.6 41 145-185 194-238 (729)
8 KOG1080|consensus 97.8 1.1E-05 2.4E-10 87.4 3.6 41 144-184 939-983 (1005)
9 KOG1082|consensus 97.2 0.00026 5.6E-09 69.8 3.0 42 146-187 274-323 (364)
10 KOG1141|consensus 96.4 0.0012 2.5E-08 69.3 1.2 41 145-185 1190-1238(1262)
11 KOG1083|consensus 96.3 0.0023 5E-08 68.8 2.9 41 145-185 1251-1295(1306)
12 KOG1337|consensus 95.9 0.0067 1.4E-07 61.8 3.5 61 138-206 230-291 (472)
13 KOG1085|consensus 95.7 0.0074 1.6E-07 56.6 2.6 41 145-185 334-378 (392)
14 KOG0294|consensus 95.0 0.036 7.8E-07 52.8 4.8 65 3-74 226-292 (362)
15 KOG0282|consensus 94.2 0.071 1.5E-06 53.3 5.1 53 2-63 276-328 (503)
16 KOG0296|consensus 94.1 0.048 1E-06 52.8 3.6 51 2-63 304-354 (399)
17 PF13424 TPR_12: Tetratricopep 94.1 0.19 4.2E-06 37.3 6.2 51 283-334 12-62 (78)
18 KOG1079|consensus 93.3 0.074 1.6E-06 55.3 3.5 42 144-185 665-710 (739)
19 PLN00181 protein SPA1-RELATED; 92.8 0.19 4.1E-06 54.4 6.0 54 3-64 552-605 (793)
20 PF13374 TPR_10: Tetratricopep 92.3 0.32 6.9E-06 31.2 4.5 34 283-316 9-42 (42)
21 KOG0302|consensus 91.3 0.3 6.4E-06 47.8 4.8 57 3-67 277-336 (440)
22 KOG1188|consensus 91.3 0.3 6.5E-06 47.1 4.7 51 5-62 49-99 (376)
23 KOG0646|consensus 90.8 0.19 4.1E-06 50.1 3.0 51 3-62 100-150 (476)
24 KOG0283|consensus 90.6 0.31 6.6E-06 51.5 4.4 54 3-65 387-440 (712)
25 PF00400 WD40: WD domain, G-be 90.3 0.4 8.7E-06 30.4 3.3 31 31-63 9-39 (39)
26 PTZ00421 coronin; Provisional 90.3 0.47 1E-05 48.8 5.4 60 3-63 95-154 (493)
27 PTZ00421 coronin; Provisional 90.2 0.54 1.2E-05 48.3 5.8 52 3-63 145-196 (493)
28 KOG0264|consensus 89.8 0.51 1.1E-05 46.8 5.0 57 3-65 247-305 (422)
29 KOG0291|consensus 89.7 0.43 9.3E-06 50.4 4.5 51 5-64 329-379 (893)
30 PF13512 TPR_18: Tetratricopep 88.2 1.6 3.4E-05 37.4 6.2 52 277-333 11-62 (142)
31 PF13525 YfiO: Outer membrane 87.9 2 4.2E-05 38.5 7.2 54 276-334 5-58 (203)
32 PTZ00420 coronin; Provisional 87.8 1.2 2.6E-05 46.6 6.4 55 3-65 51-105 (568)
33 KOG0294|consensus 87.7 0.67 1.5E-05 44.4 4.1 55 2-63 59-113 (362)
34 KOG0319|consensus 87.3 0.8 1.7E-05 48.2 4.7 55 2-63 123-177 (775)
35 KOG1188|consensus 87.2 0.69 1.5E-05 44.7 3.9 55 3-64 140-195 (376)
36 KOG0302|consensus 87.2 0.85 1.9E-05 44.7 4.5 55 3-65 321-378 (440)
37 KOG0281|consensus 86.8 0.41 8.9E-06 46.5 2.1 61 4-64 255-347 (499)
38 KOG0269|consensus 86.4 0.57 1.2E-05 49.5 3.1 52 4-63 108-162 (839)
39 PTZ00420 coronin; Provisional 86.3 1.4 2.9E-05 46.2 5.8 52 3-64 145-196 (568)
40 KOG0270|consensus 86.1 1.2 2.6E-05 44.4 4.9 53 2-62 262-314 (463)
41 KOG0316|consensus 85.9 1.4 3.1E-05 40.8 5.0 51 3-62 78-128 (307)
42 KOG1081|consensus 85.7 0.33 7.2E-06 49.4 1.0 50 136-185 362-416 (463)
43 KOG0273|consensus 85.6 1.6 3.5E-05 43.9 5.6 60 2-63 294-387 (524)
44 KOG0276|consensus 85.5 0.96 2.1E-05 47.0 4.1 50 4-62 33-82 (794)
45 cd00200 WD40 WD40 domain, foun 85.0 2 4.3E-05 37.8 5.6 54 3-65 70-123 (289)
46 KOG1338|consensus 83.9 0.93 2E-05 44.7 3.1 53 133-185 207-260 (466)
47 KOG0306|consensus 83.7 1 2.2E-05 47.7 3.5 54 5-65 42-95 (888)
48 KOG0293|consensus 82.6 1.4 3E-05 43.8 3.7 54 4-65 460-513 (519)
49 KOG1334|consensus 82.0 1.3 2.8E-05 44.8 3.3 57 3-65 161-217 (559)
50 KOG0274|consensus 81.8 2.7 5.9E-05 43.7 5.8 50 3-63 308-357 (537)
51 PF13432 TPR_16: Tetratricopep 81.8 2.9 6.3E-05 29.6 4.4 46 281-334 2-47 (65)
52 PLN00181 protein SPA1-RELATED; 81.7 2.8 6E-05 45.5 6.1 53 3-64 595-647 (793)
53 COG1729 Uncharacterized protei 81.6 3 6.6E-05 39.2 5.5 52 278-334 143-194 (262)
54 KOG0263|consensus 81.4 2.9 6.2E-05 44.3 5.7 54 3-65 554-607 (707)
55 KOG0310|consensus 81.3 1.7 3.8E-05 43.7 3.9 49 6-63 217-266 (487)
56 KOG0271|consensus 81.3 2.3 4.9E-05 42.0 4.6 54 4-63 177-233 (480)
57 TIGR02795 tol_pal_ybgF tol-pal 81.1 4.3 9.4E-05 31.6 5.6 48 282-334 45-92 (119)
58 KOG0266|consensus 81.1 3.1 6.6E-05 42.2 5.8 51 3-62 222-273 (456)
59 KOG0266|consensus 80.4 2.9 6.4E-05 42.3 5.4 55 3-66 265-319 (456)
60 KOG1332|consensus 80.4 2.3 5.1E-05 39.6 4.2 55 4-63 78-132 (299)
61 KOG0272|consensus 80.4 2.2 4.7E-05 42.4 4.2 52 3-62 364-415 (459)
62 KOG2461|consensus 80.0 0.77 1.7E-05 45.8 1.0 33 155-187 114-146 (396)
63 PF14559 TPR_19: Tetratricopep 80.0 3.3 7.2E-05 29.4 4.2 40 287-334 2-41 (68)
64 KOG0289|consensus 79.9 3.4 7.4E-05 41.3 5.3 50 4-62 367-416 (506)
65 PF13414 TPR_11: TPR repeat; P 79.8 9.4 0.0002 27.1 6.6 49 277-333 4-52 (69)
66 KOG0292|consensus 79.5 2.5 5.3E-05 45.8 4.5 49 5-62 114-162 (1202)
67 cd00200 WD40 WD40 domain, foun 79.3 4.5 9.7E-05 35.4 5.7 51 5-64 114-164 (289)
68 PRK10866 outer membrane biogen 79.1 7.2 0.00016 36.1 7.1 51 278-333 34-84 (243)
69 KOG0263|consensus 78.8 2.2 4.9E-05 45.0 4.0 51 3-62 596-646 (707)
70 KOG0647|consensus 78.7 3 6.5E-05 39.9 4.4 55 3-65 91-145 (347)
71 KOG0276|consensus 78.5 3.3 7.3E-05 43.2 5.0 53 3-62 160-212 (794)
72 KOG0308|consensus 78.0 3.1 6.8E-05 43.5 4.6 53 3-64 190-242 (735)
73 smart00320 WD40 WD40 repeats. 77.9 2.6 5.5E-05 24.3 2.6 30 32-63 11-40 (40)
74 KOG1310|consensus 77.8 3.9 8.4E-05 42.2 5.1 55 5-65 71-125 (758)
75 KOG0278|consensus 77.3 4.9 0.00011 37.7 5.2 53 3-63 243-295 (334)
76 PRK10803 tol-pal system protei 76.4 7.9 0.00017 36.4 6.7 51 279-334 145-196 (263)
77 KOG0273|consensus 76.4 4.4 9.5E-05 40.9 5.0 51 3-62 429-479 (524)
78 KOG0277|consensus 75.9 4.8 0.0001 37.8 4.8 56 4-68 168-226 (311)
79 PF13174 TPR_6: Tetratricopept 75.6 5 0.00011 24.0 3.5 29 279-307 3-31 (33)
80 KOG0275|consensus 75.5 1.6 3.4E-05 42.1 1.6 60 3-65 232-293 (508)
81 KOG1034|consensus 75.5 3.7 7.9E-05 39.8 4.0 53 2-62 111-163 (385)
82 KOG0272|consensus 75.2 5.7 0.00012 39.6 5.4 57 3-66 194-250 (459)
83 KOG0265|consensus 75.0 4.4 9.6E-05 38.7 4.4 52 3-62 109-160 (338)
84 KOG0274|consensus 74.6 5.5 0.00012 41.5 5.5 54 3-67 430-483 (537)
85 TIGR03302 OM_YfiO outer membra 74.4 5.7 0.00012 35.7 5.0 50 279-333 36-85 (235)
86 KOG1517|consensus 74.2 6.3 0.00014 43.6 5.9 69 2-77 1183-1251(1387)
87 PF10516 SHNi-TPR: SHNi-TPR; 73.5 4.6 0.0001 26.4 3.0 27 286-312 11-37 (38)
88 PF09986 DUF2225: Uncharacteri 73.5 10 0.00022 34.5 6.4 80 254-334 48-134 (214)
89 cd02681 MIT_calpain7_1 MIT: do 73.3 24 0.00052 26.8 7.4 57 275-331 5-64 (76)
90 KOG0275|consensus 72.4 3.3 7.2E-05 40.0 3.0 51 4-62 283-333 (508)
91 KOG2055|consensus 72.3 7.4 0.00016 39.2 5.4 65 3-76 363-431 (514)
92 KOG0308|consensus 72.0 6 0.00013 41.4 4.9 59 2-62 136-198 (735)
93 KOG0271|consensus 71.7 7.2 0.00016 38.6 5.1 54 3-66 224-277 (480)
94 KOG1840|consensus 70.6 9.3 0.0002 39.5 6.0 54 283-336 248-301 (508)
95 COG4105 ComL DNA uptake lipopr 70.3 13 0.00027 34.9 6.2 52 276-332 34-85 (254)
96 KOG0268|consensus 70.2 5.5 0.00012 39.1 3.9 53 6-66 167-219 (433)
97 KOG0315|consensus 70.2 9.7 0.00021 35.7 5.4 62 3-74 102-164 (311)
98 KOG0305|consensus 70.1 6.7 0.00015 40.2 4.8 54 4-65 366-419 (484)
99 TIGR02795 tol_pal_ybgF tol-pal 69.6 16 0.00034 28.4 6.0 27 279-305 5-31 (119)
100 KOG0646|consensus 69.4 7.4 0.00016 39.2 4.7 61 3-65 142-206 (476)
101 KOG0296|consensus 69.3 11 0.00023 37.1 5.6 53 5-66 169-221 (399)
102 cd02678 MIT_VPS4 MIT: domain c 69.0 35 0.00076 25.4 7.4 58 273-330 3-62 (75)
103 cd02684 MIT_2 MIT: domain cont 68.7 34 0.00073 25.7 7.3 58 273-330 3-62 (75)
104 PLN03098 LPA1 LOW PSII ACCUMUL 67.9 8.9 0.00019 38.9 5.0 53 277-334 76-128 (453)
105 PF00515 TPR_1: Tetratricopept 66.6 16 0.00034 22.1 4.4 29 278-306 3-31 (34)
106 KOG0281|consensus 66.6 7.2 0.00016 38.2 3.9 49 4-63 338-386 (499)
107 KOG0285|consensus 66.1 13 0.00027 36.7 5.4 54 3-65 254-307 (460)
108 cd02680 MIT_calpain7_2 MIT: do 65.7 13 0.00028 28.2 4.5 36 273-308 3-38 (75)
109 KOG1007|consensus 65.7 6 0.00013 37.8 3.1 53 4-63 191-243 (370)
110 KOG0291|consensus 65.2 13 0.00027 39.9 5.6 51 3-62 369-419 (893)
111 PF10579 Rapsyn_N: Rapsyn N-te 65.2 35 0.00075 26.3 6.6 54 276-334 6-59 (80)
112 KOG0306|consensus 64.6 9.7 0.00021 40.7 4.7 51 4-63 85-135 (888)
113 PF07719 TPR_2: Tetratricopept 64.2 18 0.00039 21.6 4.3 28 279-306 4-31 (34)
114 KOG0284|consensus 64.0 6.6 0.00014 39.0 3.2 73 3-84 241-322 (464)
115 KOG4328|consensus 64.0 7.9 0.00017 39.0 3.7 51 4-62 208-262 (498)
116 PRK00398 rpoP DNA-directed RNA 63.9 5.2 0.00011 27.0 1.8 30 228-265 3-32 (46)
117 cd02682 MIT_AAA_Arch MIT: doma 63.6 53 0.0012 24.9 7.4 57 275-331 5-63 (75)
118 KOG0772|consensus 63.6 6.4 0.00014 40.3 3.1 56 4-64 337-393 (641)
119 PF14783 BBS2_Mid: Ciliary BBS 63.5 8 0.00017 31.6 3.1 30 36-65 2-33 (111)
120 KOG2110|consensus 63.0 14 0.00031 36.2 5.2 49 5-62 152-201 (391)
121 KOG1840|consensus 62.6 19 0.00042 37.2 6.4 57 287-343 336-392 (508)
122 KOG4328|consensus 61.8 8.2 0.00018 38.9 3.4 56 4-67 299-357 (498)
123 KOG0279|consensus 61.8 15 0.00032 35.0 4.9 59 3-63 251-311 (315)
124 cd02677 MIT_SNX15 MIT: domain 61.6 55 0.0012 24.7 7.2 44 273-316 3-47 (75)
125 KOG0771|consensus 61.6 8.4 0.00018 38.2 3.4 53 3-65 163-215 (398)
126 KOG0277|consensus 61.5 15 0.00033 34.5 4.9 52 4-63 125-176 (311)
127 KOG0279|consensus 61.4 13 0.00029 35.3 4.5 54 3-65 82-135 (315)
128 KOG0268|consensus 60.1 7.9 0.00017 38.0 2.9 57 3-67 291-347 (433)
129 KOG0640|consensus 59.4 16 0.00035 35.3 4.8 54 7-66 239-292 (430)
130 PF13717 zinc_ribbon_4: zinc-r 58.0 5.6 0.00012 25.6 1.1 33 229-264 3-35 (36)
131 KOG0313|consensus 57.8 13 0.00028 36.7 4.0 51 3-63 278-328 (423)
132 TIGR03866 PQQ_ABC_repeats PQQ- 57.1 29 0.00062 31.4 6.1 52 3-63 8-59 (300)
133 KOG0639|consensus 57.0 12 0.00027 38.2 3.8 52 4-64 529-580 (705)
134 PF04212 MIT: MIT (microtubule 56.9 25 0.00054 25.6 4.6 37 274-310 3-39 (69)
135 KOG0643|consensus 55.7 17 0.00037 34.4 4.2 51 4-62 167-217 (327)
136 cd02656 MIT MIT: domain contai 55.7 81 0.0017 23.2 7.7 57 274-330 4-62 (75)
137 CHL00033 ycf3 photosystem I as 55.5 54 0.0012 27.8 7.3 53 277-334 36-88 (168)
138 cd02683 MIT_1 MIT: domain cont 55.2 73 0.0016 24.1 7.0 56 275-330 5-62 (77)
139 KOG0290|consensus 55.2 16 0.00034 35.1 3.9 55 3-62 170-224 (364)
140 KOG0286|consensus 54.8 26 0.00057 33.6 5.4 50 5-62 165-214 (343)
141 PRK10803 tol-pal system protei 54.8 36 0.00078 32.0 6.4 50 282-336 186-235 (263)
142 KOG1446|consensus 54.7 27 0.00058 33.6 5.4 48 5-62 253-300 (311)
143 PRK00420 hypothetical protein; 54.6 60 0.0013 26.6 6.8 10 255-264 41-50 (112)
144 PRK06266 transcription initiat 53.4 13 0.00029 32.8 3.1 10 255-264 137-146 (178)
145 KOG0303|consensus 53.1 26 0.00056 34.9 5.2 52 2-63 150-201 (472)
146 KOG4714|consensus 53.1 9.9 0.00021 35.9 2.2 54 3-63 199-252 (319)
147 PF13719 zinc_ribbon_5: zinc-r 52.7 7 0.00015 25.3 0.9 33 229-264 3-35 (37)
148 PF13240 zinc_ribbon_2: zinc-r 52.4 8.9 0.00019 22.2 1.2 9 255-263 14-22 (23)
149 PF13248 zf-ribbon_3: zinc-rib 52.3 8.7 0.00019 22.7 1.2 23 229-263 3-25 (26)
150 PF14783 BBS2_Mid: Ciliary BBS 52.0 32 0.0007 28.1 4.8 47 5-61 62-110 (111)
151 PF08666 SAF: SAF domain; Int 51.6 9.5 0.00021 26.9 1.5 18 166-183 3-20 (63)
152 COG2888 Predicted Zn-ribbon RN 51.6 13 0.00029 26.8 2.2 49 208-263 11-59 (61)
153 PF04810 zf-Sec23_Sec24: Sec23 51.3 6.5 0.00014 25.9 0.6 32 229-264 3-34 (40)
154 KOG1587|consensus 51.2 25 0.00053 36.8 5.0 55 5-62 264-320 (555)
155 KOG0299|consensus 51.0 28 0.0006 35.2 5.1 60 4-65 346-410 (479)
156 smart00661 RPOL9 RNA polymeras 51.0 12 0.00025 25.6 1.8 30 230-265 2-31 (52)
157 TIGR00373 conserved hypothetic 50.9 8.8 0.00019 33.3 1.5 10 255-264 129-138 (158)
158 KOG2445|consensus 50.7 29 0.00062 33.5 4.9 34 33-66 112-145 (361)
159 KOG2096|consensus 50.6 14 0.0003 35.9 2.8 42 30-73 83-126 (420)
160 KOG0299|consensus 50.3 26 0.00055 35.4 4.7 52 3-65 305-356 (479)
161 TIGR02098 MJ0042_CXXC MJ0042 f 49.8 7.7 0.00017 24.9 0.7 33 229-264 3-35 (38)
162 smart00745 MIT Microtubule Int 49.7 1E+02 0.0022 22.7 7.6 38 273-310 5-42 (77)
163 PRK02603 photosystem I assembl 49.6 73 0.0016 27.2 7.1 50 279-333 38-87 (172)
164 KOG0318|consensus 49.5 23 0.00049 36.5 4.3 17 4-20 79-95 (603)
165 KOG0318|consensus 49.4 37 0.0008 35.0 5.7 56 2-63 208-263 (603)
166 COG2956 Predicted N-acetylgluc 49.1 36 0.00077 33.3 5.3 45 290-334 221-265 (389)
167 KOG0286|consensus 48.5 32 0.0007 33.0 4.8 53 3-64 206-258 (343)
168 KOG2111|consensus 48.4 45 0.00097 32.3 5.8 54 8-68 206-259 (346)
169 PF10571 UPF0547: Uncharacteri 48.2 13 0.00027 22.3 1.4 23 230-264 2-24 (26)
170 PF09577 Spore_YpjB: Sporulati 47.7 43 0.00093 31.0 5.5 57 275-331 4-60 (232)
171 PF13181 TPR_8: Tetratricopept 47.4 51 0.0011 19.6 4.3 26 281-306 6-31 (34)
172 TIGR03302 OM_YfiO outer membra 47.2 50 0.0011 29.4 6.0 48 283-335 173-220 (235)
173 PF14561 TPR_20: Tetratricopep 47.1 58 0.0013 25.3 5.5 51 277-333 23-73 (90)
174 KOG0315|consensus 46.4 30 0.00065 32.6 4.2 50 5-63 60-111 (311)
175 PF09976 TPR_21: Tetratricopep 45.9 1.5E+02 0.0033 24.4 8.4 49 281-334 53-101 (145)
176 TIGR02059 swm_rep_I cyanobacte 45.8 32 0.00068 27.6 3.7 23 163-185 75-97 (101)
177 KOG0772|consensus 44.2 29 0.00062 35.8 4.1 56 2-63 287-345 (641)
178 PF07754 DUF1610: Domain of un 43.9 18 0.00039 21.3 1.6 9 254-262 16-24 (24)
179 KOG0319|consensus 43.4 36 0.00077 36.4 4.7 50 4-62 525-574 (775)
180 KOG0293|consensus 43.1 23 0.00049 35.5 3.1 55 5-65 416-470 (519)
181 PF02150 RNA_POL_M_15KD: RNA p 43.1 8.4 0.00018 24.7 0.1 30 230-266 3-32 (35)
182 PF11817 Foie-gras_1: Foie gra 43.1 82 0.0018 29.1 6.8 51 283-335 185-235 (247)
183 KOG0289|consensus 42.8 47 0.001 33.5 5.2 56 3-65 322-377 (506)
184 cd00350 rubredoxin_like Rubred 42.7 19 0.00042 22.5 1.7 25 229-263 2-26 (33)
185 KOG0640|consensus 42.6 18 0.00038 35.1 2.2 55 4-65 281-335 (430)
186 KOG0645|consensus 42.5 29 0.00062 33.0 3.5 33 31-64 278-310 (312)
187 PF06957 COPI_C: Coatomer (COP 41.8 1E+02 0.0022 31.2 7.5 55 273-327 201-255 (422)
188 PF09538 FYDLN_acid: Protein o 41.8 17 0.00037 29.6 1.7 30 229-267 10-39 (108)
189 KOG0973|consensus 41.3 33 0.00071 37.9 4.2 55 3-66 148-202 (942)
190 PF14276 DUF4363: Domain of un 41.1 1.7E+02 0.0036 23.7 7.6 56 276-331 28-88 (121)
191 PF09976 TPR_21: Tetratricopep 40.6 1.2E+02 0.0027 25.0 7.0 39 287-334 96-134 (145)
192 KOG1539|consensus 40.2 45 0.00098 36.2 4.9 51 4-62 265-315 (910)
193 cd00189 TPR Tetratricopeptide 39.6 62 0.0013 22.3 4.4 16 319-334 69-84 (100)
194 KOG2445|consensus 39.2 52 0.0011 31.8 4.7 59 3-64 32-90 (361)
195 PRK06266 transcription initiat 39.2 22 0.00049 31.4 2.2 33 204-243 117-149 (178)
196 KOG0553|consensus 39.0 86 0.0019 30.2 6.2 33 274-306 79-111 (304)
197 PF08271 TF_Zn_Ribbon: TFIIB z 38.9 16 0.00034 24.3 0.9 11 255-265 20-30 (43)
198 PF14803 Nudix_N_2: Nudix N-te 38.8 17 0.00036 23.2 1.0 30 230-263 2-31 (34)
199 KOG1445|consensus 38.4 40 0.00086 35.7 4.1 60 2-66 599-659 (1012)
200 KOG0278|consensus 38.2 66 0.0014 30.4 5.1 48 4-62 163-210 (334)
201 KOG0305|consensus 38.0 41 0.00089 34.6 4.2 56 2-65 319-376 (484)
202 KOG1240|consensus 37.9 33 0.00072 38.8 3.6 38 26-63 1031-1077(1431)
203 PRK00464 nrdR transcriptional 37.9 1.6E+02 0.0035 25.4 7.3 24 313-336 101-124 (154)
204 KOG1446|consensus 37.0 69 0.0015 30.8 5.2 53 3-64 77-129 (311)
205 PLN03088 SGT1, suppressor of 36.9 74 0.0016 31.1 5.7 44 283-334 43-86 (356)
206 PRK05580 primosome assembly pr 36.8 54 0.0012 35.2 5.1 18 317-334 519-536 (679)
207 PF13371 TPR_9: Tetratricopept 36.7 61 0.0013 23.0 4.0 40 287-334 6-45 (73)
208 KOG0285|consensus 35.9 28 0.00062 34.3 2.5 53 2-63 211-263 (460)
209 PF04423 Rad50_zn_hook: Rad50 35.7 53 0.0012 22.8 3.3 25 255-279 21-45 (54)
210 COG1998 RPS31 Ribosomal protei 35.4 22 0.00047 24.7 1.2 27 229-263 20-46 (51)
211 PRK10866 outer membrane biogen 35.2 1.2E+02 0.0025 28.0 6.5 44 284-332 77-120 (243)
212 KOG1063|consensus 35.1 72 0.0016 34.0 5.4 54 3-63 31-85 (764)
213 PF13525 YfiO: Outer membrane 35.1 1.4E+02 0.003 26.5 6.7 46 281-331 47-92 (203)
214 PF08969 USP8_dimer: USP8 dime 35.1 83 0.0018 25.4 4.9 45 270-316 32-76 (115)
215 KOG0300|consensus 35.0 31 0.00067 33.5 2.5 51 4-63 168-218 (481)
216 PF01599 Ribosomal_S27: Riboso 35.0 24 0.00052 24.3 1.3 30 227-262 17-46 (47)
217 COG2319 FOG: WD40 repeat [Gene 34.9 83 0.0018 28.7 5.5 51 4-62 176-226 (466)
218 KOG0649|consensus 34.7 73 0.0016 30.0 4.8 55 2-65 132-186 (325)
219 PF13176 TPR_7: Tetratricopept 34.7 1E+02 0.0022 19.1 4.2 22 287-308 10-31 (36)
220 PRK10454 PTS system N,N'-diace 34.6 52 0.0011 27.1 3.5 32 275-306 30-61 (115)
221 KOG0647|consensus 34.5 65 0.0014 31.1 4.6 58 3-67 47-104 (347)
222 PF12773 DZR: Double zinc ribb 34.4 32 0.0007 23.2 2.0 41 209-266 1-41 (50)
223 TIGR03866 PQQ_ABC_repeats PQQ- 34.1 1.1E+02 0.0025 27.4 6.3 59 4-63 177-235 (300)
224 PF01485 IBR: IBR domain; Int 34.1 27 0.00058 24.5 1.6 31 229-263 19-49 (64)
225 PF13428 TPR_14: Tetratricopep 34.0 93 0.002 20.1 4.2 29 283-316 8-36 (44)
226 smart00531 TFIIE Transcription 33.3 11 0.00025 32.1 -0.6 10 255-264 124-133 (147)
227 smart00834 CxxC_CXXC_SSSS Puta 32.6 31 0.00068 22.1 1.6 31 228-263 5-35 (41)
228 TIGR03504 FimV_Cterm FimV C-te 32.4 1.4E+02 0.003 20.1 4.8 37 283-325 6-42 (44)
229 KOG1274|consensus 32.3 32 0.0007 37.5 2.4 37 2-43 250-292 (933)
230 PF12688 TPR_5: Tetratrico pep 32.0 2.2E+02 0.0047 23.4 6.9 16 289-304 14-29 (120)
231 KOG0321|consensus 31.9 61 0.0013 34.2 4.3 60 3-62 164-245 (720)
232 KOG0295|consensus 31.5 84 0.0018 31.1 4.9 51 4-63 312-362 (406)
233 KOG2376|consensus 31.4 83 0.0018 33.1 5.1 67 277-343 176-249 (652)
234 KOG0316|consensus 31.3 82 0.0018 29.5 4.5 69 4-86 163-231 (307)
235 KOG1407|consensus 31.2 70 0.0015 30.3 4.2 55 4-64 40-94 (313)
236 PF12862 Apc5: Anaphase-promot 30.9 2.3E+02 0.005 21.7 6.6 51 284-334 6-57 (94)
237 PF14853 Fis1_TPR_C: Fis1 C-te 30.8 1.9E+02 0.0041 20.2 7.2 47 279-330 4-50 (53)
238 PF10058 DUF2296: Predicted in 30.7 29 0.00063 24.5 1.3 33 225-263 19-53 (54)
239 KOG0547|consensus 30.6 96 0.0021 32.1 5.3 59 273-339 112-170 (606)
240 TIGR01206 lysW lysine biosynth 30.4 35 0.00075 24.2 1.6 31 229-265 3-33 (54)
241 COG1594 RPB9 DNA-directed RNA 30.4 27 0.00058 28.6 1.2 32 230-267 4-35 (113)
242 KOG0284|consensus 30.4 37 0.0008 33.9 2.3 58 5-71 159-216 (464)
243 KOG0543|consensus 30.3 1.5E+02 0.0032 29.8 6.4 49 277-330 292-340 (397)
244 KOG1036|consensus 30.1 55 0.0012 31.5 3.3 45 2-55 250-294 (323)
245 PF14828 Amnionless: Amnionles 30.1 29 0.00063 35.2 1.6 31 227-267 207-239 (437)
246 PF02748 PyrI_C: Aspartate car 30.0 45 0.00098 23.4 2.1 17 253-269 34-50 (52)
247 PF13424 TPR_12: Tetratricopep 29.8 1.4E+02 0.003 21.4 5.0 26 283-308 53-78 (78)
248 COG2319 FOG: WD40 repeat [Gene 29.6 1.2E+02 0.0026 27.5 5.7 54 3-65 131-186 (466)
249 PF12895 Apc3: Anaphase-promot 29.5 1.2E+02 0.0027 22.3 4.7 33 300-334 42-74 (84)
250 KOG0292|consensus 29.2 44 0.00095 36.8 2.8 49 5-62 30-78 (1202)
251 PRK00432 30S ribosomal protein 29.2 33 0.00073 23.8 1.3 26 228-263 20-46 (50)
252 KOG0313|consensus 29.2 1.2E+02 0.0025 30.3 5.4 52 4-63 123-175 (423)
253 PF07282 OrfB_Zn_ribbon: Putat 28.9 58 0.0013 23.5 2.7 39 222-268 22-60 (69)
254 PRK14890 putative Zn-ribbon RN 28.9 36 0.00077 24.6 1.4 26 208-239 9-34 (59)
255 KOG0267|consensus 28.8 28 0.0006 37.2 1.2 50 4-62 90-139 (825)
256 KOG4283|consensus 28.6 96 0.0021 30.0 4.6 59 2-63 62-130 (397)
257 PRK11827 hypothetical protein; 28.6 31 0.00068 25.0 1.1 34 227-268 7-40 (60)
258 PF10083 DUF2321: Uncharacteri 28.3 3.5E+02 0.0076 23.5 7.6 24 254-279 68-91 (158)
259 smart00028 TPR Tetratricopepti 28.2 97 0.0021 16.5 3.2 24 282-305 7-30 (34)
260 KOG0307|consensus 28.1 50 0.0011 36.8 3.1 56 3-63 87-145 (1049)
261 smart00858 SAF This domain fam 28.0 41 0.00089 23.5 1.7 16 166-181 3-18 (64)
262 PRK14714 DNA polymerase II lar 27.2 43 0.00094 38.2 2.4 12 254-265 709-720 (1337)
263 PF12895 Apc3: Anaphase-promot 27.1 1.2E+02 0.0027 22.3 4.4 20 282-301 31-50 (84)
264 KOG1034|consensus 27.1 1.3E+02 0.0028 29.5 5.3 55 5-65 157-211 (385)
265 KOG1240|consensus 27.0 71 0.0015 36.4 3.9 59 3-64 1068-1127(1431)
266 TIGR00373 conserved hypothetic 26.9 32 0.00069 29.8 1.1 34 203-243 108-141 (158)
267 KOG1539|consensus 26.8 1.2E+02 0.0026 33.1 5.5 52 4-63 222-273 (910)
268 PF09862 DUF2089: Protein of u 26.5 47 0.001 27.2 2.0 16 292-307 63-78 (113)
269 KOG1036|consensus 26.4 1.2E+02 0.0027 29.2 4.9 49 2-62 112-160 (323)
270 KOG0264|consensus 26.2 1.2E+02 0.0026 30.5 5.0 55 3-65 292-347 (422)
271 KOG0265|consensus 26.1 72 0.0016 30.8 3.3 52 4-65 153-204 (338)
272 KOG3914|consensus 26.1 41 0.00088 33.4 1.8 20 3-22 212-231 (390)
273 smart00659 RPOLCX RNA polymera 26.0 46 0.001 22.4 1.5 9 255-263 20-28 (44)
274 COG1997 RPL43A Ribosomal prote 25.9 41 0.00089 26.3 1.4 11 254-264 53-63 (89)
275 KOG0644|consensus 25.6 37 0.0008 37.0 1.5 51 6-65 212-262 (1113)
276 TIGR02521 type_IV_pilW type IV 25.3 2.3E+02 0.005 23.8 6.4 16 289-304 78-93 (234)
277 TIGR02552 LcrH_SycD type III s 25.3 1.9E+02 0.0042 22.9 5.6 40 287-334 62-101 (135)
278 PF03604 DNA_RNApol_7kD: DNA d 25.0 64 0.0014 20.3 1.9 9 255-263 18-26 (32)
279 COG1198 PriA Primosomal protei 24.9 2.2E+02 0.0048 31.0 7.1 30 311-340 567-596 (730)
280 COG1675 TFA1 Transcription ini 24.7 62 0.0013 28.7 2.5 24 162-185 59-82 (176)
281 TIGR01384 TFS_arch transcripti 24.5 43 0.00094 26.5 1.4 26 230-265 2-27 (104)
282 TIGR00595 priA primosomal prot 24.0 1E+02 0.0022 31.9 4.3 18 317-334 351-368 (505)
283 KOG1523|consensus 23.9 77 0.0017 30.8 3.1 44 19-65 126-176 (361)
284 PF14938 SNAP: Soluble NSF att 23.8 3.7E+02 0.008 25.0 7.9 44 290-335 129-172 (282)
285 CHL00033 ycf3 photosystem I as 23.8 2E+02 0.0044 24.2 5.6 52 281-333 77-128 (168)
286 PRK00564 hypA hydrogenase nick 23.2 85 0.0018 25.7 2.9 41 173-239 57-97 (117)
287 PRK14892 putative transcriptio 23.1 37 0.0008 27.2 0.7 58 227-291 20-77 (99)
288 TIGR02300 FYDLN_acid conserved 23.1 54 0.0012 27.5 1.7 32 229-269 10-41 (129)
289 PRK04016 DNA-directed RNA poly 23.0 1.3E+02 0.0028 22.0 3.4 13 254-266 4-16 (62)
290 PF13913 zf-C2HC_2: zinc-finge 23.0 89 0.0019 18.2 2.2 18 255-272 3-20 (25)
291 TIGR02878 spore_ypjB sporulati 22.7 1.6E+02 0.0034 27.3 4.8 35 275-309 4-38 (233)
292 cd02679 MIT_spastin MIT: domai 22.3 1.8E+02 0.004 22.1 4.4 40 274-313 6-45 (79)
293 KOG2055|consensus 22.3 1.1E+02 0.0024 31.2 3.9 29 33-63 213-241 (514)
294 KOG0321|consensus 22.1 1.1E+02 0.0024 32.4 4.0 52 5-62 121-172 (720)
295 PF01927 Mut7-C: Mut7-C RNAse 22.1 50 0.0011 28.0 1.4 34 229-264 92-134 (147)
296 PF12894 Apc4_WD40: Anaphase-p 22.0 1.2E+02 0.0025 20.7 3.0 29 34-64 12-40 (47)
297 KOG1273|consensus 22.0 1.1E+02 0.0024 29.8 3.8 54 4-66 43-96 (405)
298 KOG4056|consensus 21.9 1.6E+02 0.0036 25.0 4.4 43 281-323 86-138 (143)
299 smart00647 IBR In Between Ring 21.9 64 0.0014 22.5 1.7 32 228-263 18-49 (64)
300 PF04652 DUF605: Vta1 like; I 21.8 1.9E+02 0.0041 28.3 5.7 37 270-306 343-379 (380)
301 KOG0301|consensus 21.3 99 0.0021 33.0 3.5 32 29-63 136-167 (745)
302 KOG4378|consensus 21.2 1.4E+02 0.003 30.9 4.4 54 2-62 183-236 (673)
303 PRK14873 primosome assembly pr 21.0 1.3E+02 0.0028 32.4 4.4 24 311-334 503-526 (665)
304 PLN03086 PRLI-interacting fact 20.9 1.3E+02 0.0029 31.6 4.4 57 206-269 407-468 (567)
305 KOG0310|consensus 20.8 1.7E+02 0.0037 29.9 5.0 52 3-62 130-181 (487)
306 TIGR00990 3a0801s09 mitochondr 20.7 2.8E+02 0.0062 28.9 7.0 51 275-334 126-176 (615)
307 PF02561 FliS: Flagellar prote 20.6 4.3E+02 0.0094 21.3 6.7 45 269-313 22-69 (122)
308 KOG0282|consensus 20.5 74 0.0016 32.4 2.4 37 29-66 210-246 (503)
309 PRK00415 rps27e 30S ribosomal 20.4 70 0.0015 23.1 1.6 33 224-264 7-40 (59)
310 COG2178 Predicted RNA-binding 20.3 5.8E+02 0.013 23.1 7.8 47 266-312 19-68 (204)
311 COG1675 TFA1 Transcription ini 20.3 1.4E+02 0.0029 26.6 3.8 39 196-241 105-143 (176)
312 PF08631 SPO22: Meiosis protei 20.2 4E+02 0.0087 24.8 7.3 57 277-333 36-99 (278)
313 PF10122 Mu-like_Com: Mu-like 20.1 43 0.00093 23.4 0.5 31 227-263 3-33 (51)
No 1
>KOG2084|consensus
Probab=99.65 E-value=3.5e-15 Score=149.75 Aligned_cols=180 Identities=21% Similarity=0.351 Sum_probs=135.6
Q ss_pred eeeeeeccccccCCCCCCCCeEEEEeCCEEEEEEeeccCCCC-ceeecccccccCCCHHHHHHHHHhhCCCceecCCCCC
Q psy5736 136 YVGVGIYPTVSLFNHDCYPAVTRYFNGKNIIVKALRPLKPKE-VVAENYGLVFSRKHLIDRQKVLSARYWFECKCRACVE 214 (346)
Q Consensus 136 ~~g~glyp~~s~~NHSC~PN~~~~f~g~~~~v~a~r~I~~Ge-el~~~Y~~~~~~~~~~~R~~~L~~~~~F~C~C~~C~~ 214 (346)
.++.|+||..+++||||.||+...|++....+++...+.+++ +|+++|.+ ..+++..||+.|+..|.|.|.|++|.+
T Consensus 197 ~~~~~l~~~~~~~~hsC~pn~~~~~~~~~~~~~~~~~~~~~~~~l~~~y~~--~~~~~~~r~~~l~~~~~f~c~c~rc~d 274 (482)
T KOG2084|consen 197 FLGRGLFPGSSLFNHSCFPNISVIFDGRGLALLVPAGIDAGEEELTISYTD--PLLSTASRQKQLRQSKLFSCQCPRCLD 274 (482)
T ss_pred cceeeecccchhcccCCCCCeEEEECCceeEEEeecccCCCCCEEEEeecc--cccCHHHHHHHHhhccceeeecCCCCC
Confidence 589999999999999999999999999999999999999987 99999998 689999999999999999999999998
Q ss_pred CCccccccccCccccccCCcCcCCcccccccCCcccccCcccccccCCcCChHHHHHHHHHHHHHHHHHHHHhccccHHH
Q psy5736 215 NWPLMESLEKYPIRIRCSNDNCGQIIATVKKLEPSAKKVEKKCESCNSTSDLTEIKTKLSELNEMFYRGIEQMNTSCFRE 294 (346)
Q Consensus 215 ~~~~~~~~~~~~~~~~C~~~~C~g~~~~~~~~~~~~~~~~~~C~~C~~~~~~~~~~~~l~~~~~~~~~a~~~~~~~~~~~ 294 (346)
++ +.+.+..+++|.+..|.+.+.+.....+ ...|.|..|........+.......... ..... .
T Consensus 275 ~~----~~~~~~~~~~c~~~~~~~~~~~~~~~~~---~~~w~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~---~ 338 (482)
T KOG2084|consen 275 PT----ELGTFLSSLRCENCTCGGLLGTSFLDKE---DLQWPCTECALVRLKAYVVESREELQNE------LLDAF---S 338 (482)
T ss_pred CC----ccccchhhhhhcCCCCCCccCCCccccc---CCCccccccccchhHHHHHHHHHHHHhh------ccccC---C
Confidence 76 5566678899998888877665543221 2359999999987766555443332211 01111 1
Q ss_pred HHHHHHHHHHHhhcccCCCcHhHHHHHHHHHHHHHHhCC
Q psy5736 295 AVESLTKFSDQIHELVVPPYKLASLAHEALRNCWSLAGN 333 (346)
Q Consensus 295 ai~~l~~~l~~~~~~l~p~h~~~~~~~~~L~~~y~~~G~ 333 (346)
.....+.+.......+++.|.........+...+....+
T Consensus 339 ~~~~~e~l~~~~~~~~~~~~~~~~~~~~~~~~il~~~~~ 377 (482)
T KOG2084|consen 339 DLLIEELLLLRQESLELPNDFEVLLLKLHLLFILGSLLG 377 (482)
T ss_pred hhhhHHHHHHHHHhhhCcchHHHHHHHHHHHHHHHHHHh
Confidence 122344555666667888888777777766666665544
No 2
>PF00856 SET: SET domain; InterPro: IPR001214 The SET domain appears generally as one part of a larger multidomain protein, and recently there were described three structures of very different proteins with distinct domain compositions: Neurospora crassa DIM-5, a member of the Su(var) family of HKMTs which methylate histone H3 on lysine 9,human SET7 (also called SET9), which methylates H3 on lysine 4 and garden pea Rubisco LSMT, an enzyme that does not modify histones, but instead methylates lysine 14 in the flexible tail of the large subunit of the enzyme Rubisco. The SET domain itself turned out to be an uncommon structure. Although in all three studies, electron density maps revealed the location of the AdoMet or AdoHcy cofactor, the SET domain bears no similarity at all to the canonical/AdoMet-dependent methyltransferase fold. Strictly conserved in the C-terminal motif of the SET domain tyrosine could be involved in abstracting a proton from the protonated amino group of the substrate lysine, promoting its nucleophilic attack on the sulphonium methyl group of the AdoMet cofactor. In contrast to the AdoMet-dependent protein methyltranferases of the classical type, which tend to bind their polypeptide substrates on top of the cofactor, it is noted from the Rubisco LSMT structure that the AdoMet seems to bind in a separate cleft, suggesting how a polypeptide substrate could be subjected to multiple rounds of methylation without having to be released from the enzyme. In contrast, SET7/9 is able to add only a single methyl group to its substrate. It has been demonstrated that association of SET domain and myotubularin-related proteins modulates growth control []. The SET domain-containing Drosophila melanogaster (Fruit fly) protein, enhancer of zeste, has a function in segment determination and the mammalian homologue may be involved in the regulation of gene transcription and chromatin structure. Histone lysine methylation is part of the histone code that regulated chromatin function and epigenetic control of gene function. Histone lysine methyltransferases (HMTase) differ both in their substrate specificity for the various acceptor lysines as well as in their product specificity for the number of methyl groups (one, two, or three) they transfer. With just one exception [], the HMTases belong to SET family that can be classified according to the sequences surrounding the SET domain [, ]. Structural studies on the human SET7/9, a mono-methylase, have revealed the molecular basis for the specificity of the enzyme for the histone-target and the roles of the invariant residues in the SET domain in determining the methylation specificities []. The pre-SET domain, as found in the SUV39 SET family, contains nine invariant cysteine residues that are grouped into two segments separated by a region of variable length. These 9 cysteines coordinate 3 zinc ions to form to form a triangular cluster, where each of the zinc ions is coordinated by 4 four cysteines to give a tetrahedral configuration. The function of this domain is structural, holding together 2 long segments of random coils. The C-terminal region including the post-SET domain is disordered when not interacting with a histone tail and in the absence of zinc. The three conserved cysteines in the post-SET domain form a zinc-binding site when coupled to a fourth conserved cysteine in the knot-like structure close to the SET domain active site []. The structured post-SET region brings in the C-terminal residues that participate in S-adenosylmethine-binding and histone tail interactions. The three conserved cysteine residues are essential for HMTase activity, as replacement with serine abolishes HMTase activity [], []. ; GO: 0005515 protein binding; PDB: 3TG5_A 3S7F_A 3RIB_B 3TG4_A 3S7J_A 3S7D_A 3S7B_A 3H6L_A 3SMT_A 3K5K_A ....
Probab=99.39 E-value=3.1e-13 Score=114.54 Aligned_cols=50 Identities=32% Similarity=0.672 Sum_probs=45.5
Q ss_pred ceeeeeeccccccCCCCCCCCeEEEEe----CCEEEEEEeeccCCCCceeeccc
Q psy5736 135 NYVGVGIYPTVSLFNHDCYPAVTRYFN----GKNIIVKALRPLKPKEVVAENYG 184 (346)
Q Consensus 135 ~~~g~glyp~~s~~NHSC~PN~~~~f~----g~~~~v~a~r~I~~Geel~~~Y~ 184 (346)
...+.++||.++++||||.|||.+.|. ++.++|+|+|||++||||+++||
T Consensus 109 ~~~~~~l~p~~d~~NHsc~pn~~~~~~~~~~~~~~~~~a~r~I~~GeEi~isYG 162 (162)
T PF00856_consen 109 DRDGIALYPFADMLNHSCDPNCEVSFDFDGDGGCLVVRATRDIKKGEEIFISYG 162 (162)
T ss_dssp EEEEEEEETGGGGSEEESSTSEEEEEEEETTTTEEEEEESS-B-TTSBEEEEST
T ss_pred cccccccCcHhHheccccccccceeeEeecccceEEEEECCccCCCCEEEEEEC
Confidence 457899999999999999999999998 79999999999999999999997
No 3
>KOG1517|consensus
Probab=99.20 E-value=1.2e-11 Score=129.67 Aligned_cols=77 Identities=31% Similarity=0.521 Sum_probs=72.0
Q ss_pred ceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeCCcc--ccchhhhhhhhhcccc
Q psy5736 6 LKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRNYAS--HEHRLVTSFQALTDTS 83 (346)
Q Consensus 6 ~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~~~s--h~~~Lv~a~r~L~~~~ 83 (346)
.+|.||||+.++.+++|.|+.+ ..++||+|++||++|++|+|+|++||.||||++|.. ..++||++|..|+++.
T Consensus 1086 ~~i~vwd~e~~~~l~~F~n~~~----~~t~Vs~l~liNe~D~aLlLtas~dGvIRIwk~y~~~~~~~eLVTaw~~Ls~~~ 1161 (1387)
T KOG1517|consen 1086 ERIRVWDWEKGRLLNGFDNGAF----PDTRVSDLELINEQDDALLLTASSDGVIRIWKDYADKWKKPELVTAWSSLSDQL 1161 (1387)
T ss_pred ceEEEEecccCceeccccCCCC----CCCccceeeeecccchhheeeeccCceEEEecccccccCCceeEEeeccccccC
Confidence 6899999999999999999997 789999999999999999999999999999999987 4689999999998887
Q ss_pred ccc
Q psy5736 84 FVN 86 (346)
Q Consensus 84 y~~ 86 (346)
+-.
T Consensus 1162 ~~~ 1164 (1387)
T KOG1517|consen 1162 PGA 1164 (1387)
T ss_pred ccC
Confidence 765
No 4
>smart00317 SET SET (Su(var)3-9, Enhancer-of-zeste, Trithorax) domain. Putative methyl transferase, based on outlier plant homologues
Probab=99.10 E-value=6e-11 Score=95.90 Aligned_cols=46 Identities=28% Similarity=0.445 Sum_probs=41.0
Q ss_pred eeeeccccccCCCCCCCCeEEEEeC--C--EEEEEEeeccCCCCceeecc
Q psy5736 138 GVGIYPTVSLFNHDCYPAVTRYFNG--K--NIIVKALRPLKPKEVVAENY 183 (346)
Q Consensus 138 g~glyp~~s~~NHSC~PN~~~~f~g--~--~~~v~a~r~I~~Geel~~~Y 183 (346)
+..++|.++++||||.||+...+.. + .+.++|+|+|++||||+++|
T Consensus 67 ~~~~~~~~~~iNHsc~pN~~~~~~~~~~~~~~~~~a~r~I~~GeEi~i~Y 116 (116)
T smart00317 67 ARRKGNIARFINHSCEPNCELLFVEVNGDSRIVIFALRDIKPGEELTIDY 116 (116)
T ss_pred CCccCcHHHeeCCCCCCCEEEEEEEECCCcEEEEEECCCcCCCCEEeecC
Confidence 3448999999999999999988753 2 69999999999999999998
No 5
>KOG2589|consensus
Probab=98.63 E-value=1.6e-08 Score=96.39 Aligned_cols=60 Identities=32% Similarity=0.560 Sum_probs=52.0
Q ss_pred ccccCCCCCCCCeEEEEeC-CEEEEEEeeccCCCCceeecccccccCCCHHHHHHHHHhhCCCceecCCCCCC
Q psy5736 144 TVSLFNHDCYPAVTRYFNG-KNIIVKALRPLKPKEVVAENYGLVFSRKHLIDRQKVLSARYWFECKCRACVEN 215 (346)
Q Consensus 144 ~~s~~NHSC~PN~~~~f~g-~~~~v~a~r~I~~Geel~~~Y~~~~~~~~~~~R~~~L~~~~~F~C~C~~C~~~ 215 (346)
-++++||.|.|||.++-.| +++.|+++|||++|||||.=|+..|... -.-.|.|..|...
T Consensus 196 PaafINHDCrpnCkFvs~g~~tacvkvlRDIePGeEITcFYgs~fFG~------------~N~~CeC~TCER~ 256 (453)
T KOG2589|consen 196 PAAFINHDCRPNCKFVSTGRDTACVKVLRDIEPGEEITCFYGSGFFGE------------NNEECECVTCERR 256 (453)
T ss_pred cHHhhcCCCCCCceeecCCCceeeeehhhcCCCCceeEEeecccccCC------------CCceeEEeecccc
Confidence 4779999999999999887 8999999999999999999999976443 2358999999864
No 6
>COG2940 Proteins containing SET domain [General function prediction only]
Probab=97.98 E-value=4.2e-06 Score=85.29 Aligned_cols=71 Identities=23% Similarity=0.358 Sum_probs=57.3
Q ss_pred cccccCCCCCCCCeEEEEeC----CEEEEEEeeccCCCCceeecccccccCCCHHHHHHHHHhhCCCceecCCCCCC
Q psy5736 143 PTVSLFNHDCYPAVTRYFNG----KNIIVKALRPLKPKEVVAENYGLVFSRKHLIDRQKVLSARYWFECKCRACVEN 215 (346)
Q Consensus 143 p~~s~~NHSC~PN~~~~f~g----~~~~v~a~r~I~~Geel~~~Y~~~~~~~~~~~R~~~L~~~~~F~C~C~~C~~~ 215 (346)
..+.++||||.||+.....+ ..+.++|+|||.+||||+..|+.. .....++...+...+...|.|.+|...
T Consensus 404 ~~~r~~nHS~~pN~~~~~~~~~g~~~~~~~~~rDI~~geEl~~dy~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~ 478 (480)
T COG2940 404 DVARFINHSCTPNCEASPIEVNGIFKISIYAIRDIKAGEELTYDYGPS--LEDNRELKKLLEKRWGCACGEDRCSHT 478 (480)
T ss_pred cccceeecCCCCCcceecccccccceeeecccccchhhhhhccccccc--cccchhhhhhhhhhhccccCCCccCCC
Confidence 34669999999999999764 389999999999999999999873 333333466777789999999999764
No 7
>KOG4442|consensus
Probab=97.91 E-value=7e-06 Score=84.34 Aligned_cols=41 Identities=22% Similarity=0.388 Sum_probs=34.0
Q ss_pred cccCCCCCCCCeEEE---EeC-CEEEEEEeeccCCCCceeecccc
Q psy5736 145 VSLFNHDCYPAVTRY---FNG-KNIIVKALRPLKPKEVVAENYGL 185 (346)
Q Consensus 145 ~s~~NHSC~PN~~~~---f~g-~~~~v~a~r~I~~Geel~~~Y~~ 185 (346)
+-++||||+|||.+- ..| .+|-+.|.|+|++|||||+.|.-
T Consensus 194 aRFiNHSC~PNa~~~KWtV~~~lRvGiFakk~I~~GEEITFDYqf 238 (729)
T KOG4442|consen 194 ARFINHSCDPNAEVQKWTVPDELRVGIFAKKVIKPGEEITFDYQF 238 (729)
T ss_pred HHhhcCCCCCCceeeeeeeCCeeEEEEeEecccCCCceeeEeccc
Confidence 447899999999854 223 57888999999999999999964
No 8
>KOG1080|consensus
Probab=97.83 E-value=1.1e-05 Score=87.36 Aligned_cols=41 Identities=24% Similarity=0.455 Sum_probs=36.1
Q ss_pred ccccCCCCCCCCeEEEEe---C-CEEEEEEeeccCCCCceeeccc
Q psy5736 144 TVSLFNHDCYPAVTRYFN---G-KNIIVKALRPLKPKEVVAENYG 184 (346)
Q Consensus 144 ~~s~~NHSC~PN~~~~f~---g-~~~~v~a~r~I~~Geel~~~Y~ 184 (346)
.|.+|||||.|||..-.. | .+++|.|.|+|++|||||..|-
T Consensus 939 iAr~InHsC~PNCyakvi~V~g~~~IvIyakr~I~~~EElTYDYk 983 (1005)
T KOG1080|consen 939 IARFINHSCNPNCYAKVITVEGDKRIVIYSKRDIAAGEELTYDYK 983 (1005)
T ss_pred hhheeecccCCCceeeEEEecCeeEEEEEEecccccCceeeeecc
Confidence 378999999999996653 3 6899999999999999999994
No 9
>KOG1082|consensus
Probab=97.16 E-value=0.00026 Score=69.77 Aligned_cols=42 Identities=29% Similarity=0.460 Sum_probs=35.9
Q ss_pred ccCCCCCCCCeEEEEe--C------CEEEEEEeeccCCCCceeecccccc
Q psy5736 146 SLFNHDCYPAVTRYFN--G------KNIIVKALRPLKPKEVVAENYGLVF 187 (346)
Q Consensus 146 s~~NHSC~PN~~~~f~--g------~~~~v~a~r~I~~Geel~~~Y~~~~ 187 (346)
-++||||.||+.+... + -.+.+.|+++|++|+|||..|+..+
T Consensus 274 RfinHSC~PN~~~~~v~~~~~~~~~~~i~ffa~~~I~p~~ELT~dYg~~~ 323 (364)
T KOG1082|consen 274 RFINHSCSPNLLYQAVFQDEFVLLYLRIGFFALRDISPGEELTLDYGKAY 323 (364)
T ss_pred ccccCCCCccceeeeeeecCCccchheeeeeeccccCCCcccchhhcccc
Confidence 3789999999998753 3 3689999999999999999999743
No 10
>KOG1141|consensus
Probab=96.42 E-value=0.0012 Score=69.32 Aligned_cols=41 Identities=17% Similarity=0.265 Sum_probs=33.1
Q ss_pred cccCCCCCCCCeEEE--EeCC------EEEEEEeeccCCCCceeecccc
Q psy5736 145 VSLFNHDCYPAVTRY--FNGK------NIIVKALRPLKPKEVVAENYGL 185 (346)
Q Consensus 145 ~s~~NHSC~PN~~~~--f~g~------~~~v~a~r~I~~Geel~~~Y~~ 185 (346)
+-++||||+||+.+. |.+. -+.+.|.+-|++|+|||..|..
T Consensus 1190 GRfLNHSC~PNl~VQnVfvdTHdlrfPwVAFFt~kyVkAgtELTWDY~Y 1238 (1262)
T KOG1141|consen 1190 GRFLNHSCDPNLHVQNVFVDTHDLRFPWVAFFTRKYVKAGTELTWDYQY 1238 (1262)
T ss_pred hhhhccCCCccceeeeeeeeccccCCchhhhhhhhhhccCceeeeeccc
Confidence 347899999999865 5442 3567888999999999999975
No 11
>KOG1083|consensus
Probab=96.35 E-value=0.0023 Score=68.78 Aligned_cols=41 Identities=29% Similarity=0.473 Sum_probs=34.4
Q ss_pred cccCCCCCCCCeEEE---EeC-CEEEEEEeeccCCCCceeecccc
Q psy5736 145 VSLFNHDCYPAVTRY---FNG-KNIIVKALRPLKPKEVVAENYGL 185 (346)
Q Consensus 145 ~s~~NHSC~PN~~~~---f~g-~~~~v~a~r~I~~Geel~~~Y~~ 185 (346)
+-.+||||.|||... .+| .++.|.|.|||++||||+..|--
T Consensus 1251 ~RfinhscKPNc~~qkwSVNG~~Rv~L~A~rDi~kGEELtYDYN~ 1295 (1306)
T KOG1083|consen 1251 ARFINHSCKPNCEMQKWSVNGEYRVGLFALRDLPKGEELTYDYNF 1295 (1306)
T ss_pred ccccccccCCCCccccccccceeeeeeeecCCCCCCceEEEeccc
Confidence 347899999999854 345 68999999999999999999853
No 12
>KOG1337|consensus
Probab=95.87 E-value=0.0067 Score=61.84 Aligned_cols=61 Identities=25% Similarity=0.273 Sum_probs=48.5
Q ss_pred eeeeccccccCCCCCCCCeEEEEe-CCEEEEEEeeccCCCCceeecccccccCCCHHHHHHHHHhhCCCc
Q psy5736 138 GVGIYPTVSLFNHDCYPAVTRYFN-GKNIIVKALRPLKPKEVVAENYGLVFSRKHLIDRQKVLSARYWFE 206 (346)
Q Consensus 138 g~glyp~~s~~NHSC~PN~~~~f~-g~~~~v~a~r~I~~Geel~~~Y~~~~~~~~~~~R~~~L~~~~~F~ 206 (346)
-.++.|..-+.||+|.+....+.. +..+.+.+.++|.+||||+++||+. + ...|...|||.
T Consensus 230 ~~~L~P~~D~~NH~~~~~~~~~~~~d~~~~l~~~~~v~~geevfi~YG~~----~----N~eLL~~YGFv 291 (472)
T KOG1337|consen 230 NEALAPLIDLLNHSPEVIKAGYNQEDEAVELVAERDVSAGEEVFINYGPK----S----NAELLLHYGFV 291 (472)
T ss_pred chhhhhhHHhhccCchhccccccCCCCcEEEEEeeeecCCCeEEEecCCC----c----hHHHHHhcCCC
Confidence 367899999999999994443332 4589999999999999999999972 2 23466689995
No 13
>KOG1085|consensus
Probab=95.66 E-value=0.0074 Score=56.58 Aligned_cols=41 Identities=24% Similarity=0.340 Sum_probs=35.6
Q ss_pred cccCCCCCCCCeEEEE---eC-CEEEEEEeeccCCCCceeecccc
Q psy5736 145 VSLFNHDCYPAVTRYF---NG-KNIIVKALRPLKPKEVVAENYGL 185 (346)
Q Consensus 145 ~s~~NHSC~PN~~~~f---~g-~~~~v~a~r~I~~Geel~~~Y~~ 185 (346)
.-|||||=.+|+.... +| -.+++.|.|+|.+||||+..||+
T Consensus 334 GRLINHS~~gNl~TKvv~Idg~pHLiLvA~rdIa~GEELlYDYGD 378 (392)
T KOG1085|consen 334 GRLINHSVRGNLKTKVVEIDGSPHLILVARRDIAQGEELLYDYGD 378 (392)
T ss_pred hhhhcccccCcceeeEEEecCCceEEEEeccccccchhhhhhccc
Confidence 4589999999998553 34 58999999999999999999998
No 14
>KOG0294|consensus
Probab=94.97 E-value=0.036 Score=52.83 Aligned_cols=65 Identities=15% Similarity=0.176 Sum_probs=51.5
Q ss_pred cccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeCCcc--ccchhhh
Q psy5736 3 GLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRNYAS--HEHRLVT 74 (346)
Q Consensus 3 ~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~~~s--h~~~Lv~ 74 (346)
|-+++|++||..+...++.|. -|..||+++.++-..+.-++.++++||.|++|+.=+. ..+++++
T Consensus 226 ~d~~~i~~~D~ds~~~~~~~~-------AH~~RVK~i~~~~~~~~~~lvTaSSDG~I~vWd~~~~~k~~~~~l~ 292 (362)
T KOG0294|consen 226 GDNEWISLKDTDSDTPLTEFL-------AHENRVKDIASYTNPEHEYLVTASSDGFIKVWDIDMETKKRPTLLA 292 (362)
T ss_pred cCCceEEEeccCCCccceeee-------cchhheeeeEEEecCCceEEEEeccCceEEEEEccccccCCcceeE
Confidence 346899999999988888773 2789999999877777899999999999999965443 2344444
No 15
>KOG0282|consensus
Probab=94.22 E-value=0.071 Score=53.27 Aligned_cols=53 Identities=19% Similarity=0.340 Sum_probs=44.2
Q ss_pred CcccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEe
Q psy5736 2 TGLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYR 63 (346)
Q Consensus 2 ~~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r 63 (346)
+|.++.|.+||.|+|+-+.+|..+. ..+++| +++++..+.|+|.+||.|+.|.
T Consensus 276 ~sfD~~lKlwDtETG~~~~~f~~~~--------~~~cvk-f~pd~~n~fl~G~sd~ki~~wD 328 (503)
T KOG0282|consen 276 ASFDRFLKLWDTETGQVLSRFHLDK--------VPTCVK-FHPDNQNIFLVGGSDKKIRQWD 328 (503)
T ss_pred eecceeeeeeccccceEEEEEecCC--------Cceeee-cCCCCCcEEEEecCCCcEEEEe
Confidence 4788999999999999999997665 356666 4556678999999999999994
No 16
>KOG0296|consensus
Probab=94.10 E-value=0.048 Score=52.81 Aligned_cols=51 Identities=20% Similarity=0.211 Sum_probs=39.7
Q ss_pred CcccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEe
Q psy5736 2 TGLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYR 63 (346)
Q Consensus 2 ~~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r 63 (346)
.+++|+|.|||..+.+. ++.++++ .-|+.|+++| ..+++++..+|.||+|.
T Consensus 304 G~vdG~i~iyD~a~~~~--R~~c~he------~~V~~l~w~~---t~~l~t~c~~g~v~~wD 354 (399)
T KOG0296|consen 304 GSVDGTIAIYDLAASTL--RHICEHE------DGVTKLKWLN---TDYLLTACANGKVRQWD 354 (399)
T ss_pred ccccceEEEEecccchh--heeccCC------CceEEEEEcC---cchheeeccCceEEeee
Confidence 36899999999987744 3344443 3499999999 55668899999999993
No 17
>PF13424 TPR_12: Tetratricopeptide repeat; PDB: 3RO2_A 3Q15_A 3ASG_A 3ASD_A 3AS5_A 3AS4_A 3ASH_B 4A1S_B 3CEQ_B 3EDT_H ....
Probab=94.07 E-value=0.19 Score=37.31 Aligned_cols=51 Identities=16% Similarity=0.114 Sum_probs=43.7
Q ss_pred HHHHhccccHHHHHHHHHHHHHHhhcccCCCcHhHHHHHHHHHHHHHHhCCc
Q psy5736 283 GIEQMNTSCFREAVESLTKFSDQIHELVVPPYKLASLAHEALRNCWSLAGNK 334 (346)
Q Consensus 283 a~~~~~~~~~~~ai~~l~~~l~~~~~~l~p~h~~~~~~~~~L~~~y~~~G~~ 334 (346)
|.-....|++++|+..+++.++. .+.+.+.|+.+..+...|+.+|...|+.
T Consensus 12 a~~~~~~~~~~~A~~~~~~al~~-~~~~~~~~~~~a~~~~~lg~~~~~~g~~ 62 (78)
T PF13424_consen 12 ARVYRELGRYDEALDYYEKALDI-EEQLGDDHPDTANTLNNLGECYYRLGDY 62 (78)
T ss_dssp HHHHHHTT-HHHHHHHHHHHHHH-HHHTTTHHHHHHHHHHHHHHHHHHTTHH
T ss_pred HHHHHHcCCHHHHHHHHHHHHHH-HHHHCCCCHHHHHHHHHHHHHHHHcCCH
Confidence 44445679999999999999999 7777889999999999999999999875
No 18
>KOG1079|consensus
Probab=93.27 E-value=0.074 Score=55.26 Aligned_cols=42 Identities=21% Similarity=0.263 Sum_probs=35.1
Q ss_pred ccccCCCCCCCCeEEEE--e-C-CEEEEEEeeccCCCCceeecccc
Q psy5736 144 TVSLFNHDCYPAVTRYF--N-G-KNIIVKALRPLKPKEVVAENYGL 185 (346)
Q Consensus 144 ~~s~~NHSC~PN~~~~f--~-g-~~~~v~a~r~I~~Geel~~~Y~~ 185 (346)
.+-+.|||=.|||.... . | .++-++|.|.|.+|||||+.|..
T Consensus 665 k~rFANHS~nPNCYAkvm~V~GdhRIGifAkRaIeagEELffDYrY 710 (739)
T KOG1079|consen 665 KIRFANHSFNPNCYAKVMMVAGDHRIGIFAKRAIEAGEELFFDYRY 710 (739)
T ss_pred hhhhccCCCCCCcEEEEEEecCCcceeeeehhhcccCceeeeeecc
Confidence 34578999999998543 2 4 68889999999999999999975
No 19
>PLN00181 protein SPA1-RELATED; Provisional
Probab=92.78 E-value=0.19 Score=54.41 Aligned_cols=54 Identities=20% Similarity=0.343 Sum_probs=43.8
Q ss_pred cccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeC
Q psy5736 3 GLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRN 64 (346)
Q Consensus 3 ~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~ 64 (346)
+.+|+|.|||.++++.+..+. .|...|+++.+. ..++.++++|+.||.|++|..
T Consensus 552 ~~Dg~v~lWd~~~~~~~~~~~-------~H~~~V~~l~~~-p~~~~~L~Sgs~Dg~v~iWd~ 605 (793)
T PLN00181 552 NFEGVVQVWDVARSQLVTEMK-------EHEKRVWSIDYS-SADPTLLASGSDDGSVKLWSI 605 (793)
T ss_pred eCCCeEEEEECCCCeEEEEec-------CCCCCEEEEEEc-CCCCCEEEEEcCCCEEEEEEC
Confidence 568999999999988777663 266789999964 345678899999999999954
No 20
>PF13374 TPR_10: Tetratricopeptide repeat; PDB: 3CEQ_B 3EDT_H 3NF1_A.
Probab=92.35 E-value=0.32 Score=31.19 Aligned_cols=34 Identities=18% Similarity=0.095 Sum_probs=19.9
Q ss_pred HHHHhccccHHHHHHHHHHHHHHhhcccCCCcHh
Q psy5736 283 GIEQMNTSCFREAVESLTKFSDQIHELVVPPYKL 316 (346)
Q Consensus 283 a~~~~~~~~~~~ai~~l~~~l~~~~~~l~p~h~~ 316 (346)
|.-....|++++|++++++.++.+.+++.+.||.
T Consensus 9 a~~~~~~g~~~~A~~~~~~al~~~~~~~G~~Hpd 42 (42)
T PF13374_consen 9 ANAYRAQGRYEEALELLEEALEIRERLLGPDHPD 42 (42)
T ss_dssp HHHHHHCT-HHHHHHHHHHHHHHH----------
T ss_pred HHHHHhhhhcchhhHHHHHHHHHHHHHhcccccC
Confidence 4445677999999999999999999999999984
No 21
>KOG0302|consensus
Probab=91.34 E-value=0.3 Score=47.78 Aligned_cols=57 Identities=26% Similarity=0.473 Sum_probs=40.3
Q ss_pred cccceEEEEecCCC-ceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEE--eCCcc
Q psy5736 3 GLFLKEGIWDHITG-SRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVY--RNYAS 67 (346)
Q Consensus 3 ~~~~~~~~w~~~~~-~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~--r~~~s 67 (346)
+.+|+|.|||..++ ++-+.+. +. |.+.|.-|-+ | ...++++.|.+||.++|| |.+.+
T Consensus 277 S~DgsIrIWDiRs~~~~~~~~~--kA----h~sDVNVISW-n-r~~~lLasG~DdGt~~iwDLR~~~~ 336 (440)
T KOG0302|consen 277 SCDGSIRIWDIRSGPKKAAVST--KA----HNSDVNVISW-N-RREPLLASGGDDGTLSIWDLRQFKS 336 (440)
T ss_pred ecCceEEEEEecCCCccceeEe--ec----cCCceeeEEc-c-CCcceeeecCCCceEEEEEhhhccC
Confidence 46899999999998 3444333 32 6666666554 3 234489999999999999 77754
No 22
>KOG1188|consensus
Probab=91.32 E-value=0.3 Score=47.14 Aligned_cols=51 Identities=14% Similarity=0.236 Sum_probs=42.1
Q ss_pred cceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEE
Q psy5736 5 FLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVY 62 (346)
Q Consensus 5 ~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~ 62 (346)
-|.|+++||.+++.|--|.- +..-+.-++|++++.-..+.++++||.||+|
T Consensus 49 ngsv~lyd~~tg~~l~~fk~-------~~~~~N~vrf~~~ds~h~v~s~ssDG~Vr~w 99 (376)
T KOG1188|consen 49 NGSVRLYDKGTGQLLEEFKG-------PPATTNGVRFISCDSPHGVISCSSDGTVRLW 99 (376)
T ss_pred CCeEEEEeccchhhhheecC-------CCCcccceEEecCCCCCeeEEeccCCeEEEE
Confidence 48999999999988887752 3455779999999665666889999999999
No 23
>KOG0646|consensus
Probab=90.84 E-value=0.19 Score=50.10 Aligned_cols=51 Identities=24% Similarity=0.343 Sum_probs=45.1
Q ss_pred cccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEE
Q psy5736 3 GLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVY 62 (346)
Q Consensus 3 ~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~ 62 (346)
+..|+|.+|-..+|+.|+.++ .|.-.||.|+|.+ |++++.+|+.||.|.+|
T Consensus 100 ~i~g~lYlWelssG~LL~v~~-------aHYQ~ITcL~fs~--dgs~iiTgskDg~V~vW 150 (476)
T KOG0646|consen 100 TISGNLYLWELSSGILLNVLS-------AHYQSITCLKFSD--DGSHIITGSKDGAVLVW 150 (476)
T ss_pred cccCcEEEEEeccccHHHHHH-------hhccceeEEEEeC--CCcEEEecCCCccEEEE
Confidence 378999999999999998762 3788999999875 88999999999999999
No 24
>KOG0283|consensus
Probab=90.60 E-value=0.31 Score=51.55 Aligned_cols=54 Identities=17% Similarity=0.274 Sum_probs=47.6
Q ss_pred cccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeCC
Q psy5736 3 GLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRNY 65 (346)
Q Consensus 3 ~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~~ 65 (346)
..|.||.+|+-++..-|+.|. |..-+|+++ +|+-||.+.+.|+=||.+|+|.-.
T Consensus 387 SMDKTVRLWh~~~~~CL~~F~--------HndfVTcVa-FnPvDDryFiSGSLD~KvRiWsI~ 440 (712)
T KOG0283|consen 387 SMDKTVRLWHPGRKECLKVFS--------HNDFVTCVA-FNPVDDRYFISGSLDGKVRLWSIS 440 (712)
T ss_pred cccccEEeecCCCcceeeEEe--------cCCeeEEEE-ecccCCCcEeecccccceEEeecC
Confidence 468899999999999999995 556799999 588999999999999999999544
No 25
>PF00400 WD40: WD domain, G-beta repeat; InterPro: IPR019781 WD-40 repeats (also known as WD or beta-transducin repeats) are short ~40 amino acid motifs, often terminating in a Trp-Asp (W-D) dipeptide. WD40 repeats usually assume a 7-8 bladed beta-propeller fold, but proteins have been found with 4 to 16 repeated units, which also form a circularised beta-propeller structure. WD-repeat proteins are a large family found in all eukaryotes and are implicated in a variety of functions ranging from signal transduction and transcription regulation to cell cycle control and apoptosis. Repeated WD40 motifs act as a site for protein-protein interaction, and proteins containing WD40 repeats are known to serve as platforms for the assembly of protein complexes or mediators of transient interplay among other proteins. The specificity of the proteins is determined by the sequences outside the repeats themselves. Examples of such complexes are G proteins (beta subunit is a beta-propeller), TAFII transcription factor, and E3 ubiquitin ligase [, ]. In Arabidopsis spp., several WD40-containing proteins act as key regulators of plant-specific developmental events.; PDB: 2ZKQ_a 3CFV_B 3CFS_B 1PEV_A 1NR0_A 1VYH_T 3RFH_A 3O2Z_T 3FRX_C 3U5G_g ....
Probab=90.27 E-value=0.4 Score=30.41 Aligned_cols=31 Identities=19% Similarity=0.445 Sum_probs=25.5
Q ss_pred CCCCcccceeccccCCCceEEeeecCCceEEEe
Q psy5736 31 RNSARCTALTLLNEHDDSLLLTGYDDGSLSVYR 63 (346)
Q Consensus 31 ~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r 63 (346)
.|...|++|.+... ...+++|+.||.|++|.
T Consensus 9 ~h~~~i~~i~~~~~--~~~~~s~~~D~~i~vwd 39 (39)
T PF00400_consen 9 GHSSSINSIAWSPD--GNFLASGSSDGTIRVWD 39 (39)
T ss_dssp SSSSSEEEEEEETT--SSEEEEEETTSEEEEEE
T ss_pred CCCCcEEEEEEecc--cccceeeCCCCEEEEEC
Confidence 36778999996543 78889999999999984
No 26
>PTZ00421 coronin; Provisional
Probab=90.25 E-value=0.47 Score=48.77 Aligned_cols=60 Identities=7% Similarity=-0.068 Sum_probs=39.3
Q ss_pred cccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEe
Q psy5736 3 GLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYR 63 (346)
Q Consensus 3 ~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r 63 (346)
+.+|+|.|||..++.........-.....|...|.++.+. +....++++|+.||.|++|.
T Consensus 95 S~DgtIkIWdi~~~~~~~~~~~~l~~L~gH~~~V~~l~f~-P~~~~iLaSgs~DgtVrIWD 154 (493)
T PTZ00421 95 SEDGTIMGWGIPEEGLTQNISDPIVHLQGHTKKVGIVSFH-PSAMNVLASAGADMVVNVWD 154 (493)
T ss_pred eCCCEEEEEecCCCccccccCcceEEecCCCCcEEEEEeC-cCCCCEEEEEeCCCEEEEEE
Confidence 5689999999976532110000000122367789999874 44456889999999999994
No 27
>PTZ00421 coronin; Provisional
Probab=90.20 E-value=0.54 Score=48.32 Aligned_cols=52 Identities=27% Similarity=0.317 Sum_probs=41.8
Q ss_pred cccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEe
Q psy5736 3 GLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYR 63 (346)
Q Consensus 3 ~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r 63 (346)
|.+|+|.|||.++++.+..+.. |...|.++.+- .+..++++|+.||.|++|.
T Consensus 145 s~DgtVrIWDl~tg~~~~~l~~-------h~~~V~sla~s--pdG~lLatgs~Dg~IrIwD 196 (493)
T PTZ00421 145 GADMVVNVWDVERGKAVEVIKC-------HSDQITSLEWN--LDGSLLCTTSKDKKLNIID 196 (493)
T ss_pred eCCCEEEEEECCCCeEEEEEcC-------CCCceEEEEEE--CCCCEEEEecCCCEEEEEE
Confidence 5689999999999987776642 44568888863 3567889999999999995
No 28
>KOG0264|consensus
Probab=89.82 E-value=0.51 Score=46.79 Aligned_cols=57 Identities=25% Similarity=0.320 Sum_probs=42.5
Q ss_pred cccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEE--eCC
Q psy5736 3 GLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVY--RNY 65 (346)
Q Consensus 3 ~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~--r~~ 65 (346)
|-++.+.|||-.++ ...--.. .. .|..++.++. +|++++.++++|++||.|.+| |++
T Consensus 247 ~dd~~L~iwD~R~~-~~~~~~~-~~---ah~~~vn~~~-fnp~~~~ilAT~S~D~tV~LwDlRnL 305 (422)
T KOG0264|consen 247 GDDGKLMIWDTRSN-TSKPSHS-VK---AHSAEVNCVA-FNPFNEFILATGSADKTVALWDLRNL 305 (422)
T ss_pred cCCCeEEEEEcCCC-CCCCccc-cc---ccCCceeEEE-eCCCCCceEEeccCCCcEEEeechhc
Confidence 45678899999864 1111111 12 3678899988 799999999999999999999 665
No 29
>KOG0291|consensus
Probab=89.66 E-value=0.43 Score=50.39 Aligned_cols=51 Identities=27% Similarity=0.476 Sum_probs=39.4
Q ss_pred cceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeC
Q psy5736 5 FLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRN 64 (346)
Q Consensus 5 ~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~ 64 (346)
-|++-||+|.+..-+ .-|.. |..++++|. =.+|..+..+|.+||.|++|..
T Consensus 329 lgQLlVweWqsEsYV---lKQQg----H~~~i~~l~--YSpDgq~iaTG~eDgKVKvWn~ 379 (893)
T KOG0291|consen 329 LGQLLVWEWQSESYV---LKQQG----HSDRITSLA--YSPDGQLIATGAEDGKVKVWNT 379 (893)
T ss_pred cceEEEEEeecccee---eeccc----cccceeeEE--ECCCCcEEEeccCCCcEEEEec
Confidence 378999999876322 22333 888999988 3478889999999999999953
No 30
>PF13512 TPR_18: Tetratricopeptide repeat
Probab=88.15 E-value=1.6 Score=37.38 Aligned_cols=52 Identities=19% Similarity=0.332 Sum_probs=36.0
Q ss_pred HHHHHHHHHHhccccHHHHHHHHHHHHHHhhcccCCCcHhHHHHHHHHHHHHHHhCC
Q psy5736 277 NEMFYRGIEQMNTSCFREAVESLTKFSDQIHELVVPPYKLASLAHEALRNCWSLAGN 333 (346)
Q Consensus 277 ~~~~~~a~~~~~~~~~~~ai~~l~~~l~~~~~~l~p~h~~~~~~~~~L~~~y~~~G~ 333 (346)
.++|.+|.+++++|++++|++.++.+...| |..+.-.+++..|+.+|-.-|+
T Consensus 11 ~~ly~~a~~~l~~~~Y~~A~~~le~L~~ry-----P~g~ya~qAqL~l~yayy~~~~ 62 (142)
T PF13512_consen 11 QELYQEAQEALQKGNYEEAIKQLEALDTRY-----PFGEYAEQAQLDLAYAYYKQGD 62 (142)
T ss_pred HHHHHHHHHHHHhCCHHHHHHHHHHHHhcC-----CCCcccHHHHHHHHHHHHHccC
Confidence 457778888888888888888888887766 5555555555555555555443
No 31
>PF13525 YfiO: Outer membrane lipoprotein; PDB: 3TGO_A 3Q5M_A 2YHC_A.
Probab=87.90 E-value=2 Score=38.55 Aligned_cols=54 Identities=19% Similarity=0.341 Sum_probs=42.1
Q ss_pred HHHHHHHHHHHhccccHHHHHHHHHHHHHHhhcccCCCcHhHHHHHHHHHHHHHHhCCc
Q psy5736 276 LNEMFYRGIEQMNTSCFREAVESLTKFSDQIHELVVPPYKLASLAHEALRNCWSLAGNK 334 (346)
Q Consensus 276 ~~~~~~~a~~~~~~~~~~~ai~~l~~~l~~~~~~l~p~h~~~~~~~~~L~~~y~~~G~~ 334 (346)
...+|..|...++.|++.+|++.|++++..+ |.++....+...|+.+|-..|+.
T Consensus 5 ~~~lY~~a~~~~~~g~y~~Ai~~f~~l~~~~-----P~s~~a~~A~l~la~a~y~~~~y 58 (203)
T PF13525_consen 5 AEALYQKALEALQQGDYEEAIKLFEKLIDRY-----PNSPYAPQAQLMLAYAYYKQGDY 58 (203)
T ss_dssp HHHHHHHHHHHHHCT-HHHHHHHHHHHHHH------TTSTTHHHHHHHHHHHHHHTT-H
T ss_pred HHHHHHHHHHHHHCCCHHHHHHHHHHHHHHC-----CCChHHHHHHHHHHHHHHHcCCH
Confidence 4567888888899999999999999998886 77788888888888888777653
No 32
>PTZ00420 coronin; Provisional
Probab=87.77 E-value=1.2 Score=46.59 Aligned_cols=55 Identities=16% Similarity=0.236 Sum_probs=43.0
Q ss_pred cccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeCC
Q psy5736 3 GLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRNY 65 (346)
Q Consensus 3 ~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~~ 65 (346)
|.+|.|.+|+-.+...+..+. .|+..|.++.+ +++++.++++|+.||.|++|..-
T Consensus 51 G~~gvI~L~~~~r~~~v~~L~-------gH~~~V~~laf-sP~~~~lLASgS~DgtIrIWDi~ 105 (568)
T PTZ00420 51 GLIGAIRLENQMRKPPVIKLK-------GHTSSILDLQF-NPCFSEILASGSEDLTIRVWEIP 105 (568)
T ss_pred CceeEEEeeecCCCceEEEEc-------CCCCCEEEEEE-cCCCCCEEEEEeCCCeEEEEECC
Confidence 678999999987766555553 26778999986 55567788999999999999643
No 33
>KOG0294|consensus
Probab=87.74 E-value=0.67 Score=44.42 Aligned_cols=55 Identities=25% Similarity=0.266 Sum_probs=44.8
Q ss_pred CcccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEe
Q psy5736 2 TGLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYR 63 (346)
Q Consensus 2 ~~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r 63 (346)
+|-+.+|.|||..+...+-... .|...||+|+|.-.-..+=|+.|++||.|-+|+
T Consensus 59 GssDetI~IYDm~k~~qlg~ll-------~HagsitaL~F~~~~S~shLlS~sdDG~i~iw~ 113 (362)
T KOG0294|consen 59 GSSDETIHIYDMRKRKQLGILL-------SHAGSITALKFYPPLSKSHLLSGSDDGHIIIWR 113 (362)
T ss_pred cCCCCcEEEEeccchhhhccee-------ccccceEEEEecCCcchhheeeecCCCcEEEEE
Confidence 4678999999998876665443 256789999998887777889999999999984
No 34
>KOG0319|consensus
Probab=87.28 E-value=0.8 Score=48.20 Aligned_cols=55 Identities=22% Similarity=0.350 Sum_probs=45.6
Q ss_pred CcccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEe
Q psy5736 2 TGLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYR 63 (346)
Q Consensus 2 ~~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r 63 (346)
.|.+|.|.|||+..+.-.-+|. .|+.-+.++.|..-..--++..|++||.+++|.
T Consensus 123 ggaD~~v~VWdi~~~~~th~fk-------G~gGvVssl~F~~~~~~~lL~sg~~D~~v~vwn 177 (775)
T KOG0319|consen 123 GGADGRVKVWDIKNGYCTHSFK-------GHGGVVSSLLFHPHWNRWLLASGATDGTVRVWN 177 (775)
T ss_pred ccccceEEEEEeeCCEEEEEec-------CCCceEEEEEeCCccchhheeecCCCceEEEEE
Confidence 4789999999999999998884 256678888876555567888999999999994
No 35
>KOG1188|consensus
Probab=87.18 E-value=0.69 Score=44.71 Aligned_cols=55 Identities=24% Similarity=0.334 Sum_probs=39.8
Q ss_pred cccceEEEEecCCCcee-eeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeC
Q psy5736 3 GLFLKEGIWDHITGSRL-ISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRN 64 (346)
Q Consensus 3 ~~~~~~~~w~~~~~~~~-~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~ 64 (346)
+-.-.|-+|||.+.+++ -.+. .+ |.-.||.|+|+-. +..++|+||.||.+.+|..
T Consensus 140 ~s~A~v~lwDvR~~qq~l~~~~---eS---H~DDVT~lrFHP~-~pnlLlSGSvDGLvnlfD~ 195 (376)
T KOG1188|consen 140 RSDASVVLWDVRSEQQLLRQLN---ES---HNDDVTQLRFHPS-DPNLLLSGSVDGLVNLFDT 195 (376)
T ss_pred cCceEEEEEEeccccchhhhhh---hh---ccCcceeEEecCC-CCCeEEeecccceEEeeec
Confidence 33456899999888773 3221 21 6678999998764 4556689999999999943
No 36
>KOG0302|consensus
Probab=87.15 E-value=0.85 Score=44.69 Aligned_cols=55 Identities=20% Similarity=0.353 Sum_probs=42.9
Q ss_pred cccceEEEEecCC---CceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeCC
Q psy5736 3 GLFLKEGIWDHIT---GSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRNY 65 (346)
Q Consensus 3 ~~~~~~~~w~~~~---~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~~ 65 (346)
|=+|+++|||... ++.+-. ++-|+..||++.+ ++++++++++++.|.+|.+|..-
T Consensus 321 ~DdGt~~iwDLR~~~~~~pVA~-------fk~Hk~pItsieW-~p~e~s~iaasg~D~QitiWDls 378 (440)
T KOG0302|consen 321 GDDGTLSIWDLRQFKSGQPVAT-------FKYHKAPITSIEW-HPHEDSVIAASGEDNQITIWDLS 378 (440)
T ss_pred CCCceEEEEEhhhccCCCccee-------EEeccCCeeEEEe-ccccCceEEeccCCCcEEEEEee
Confidence 4579999999742 333333 4468899999996 56889999999999999999654
No 37
>KOG0281|consensus
Probab=86.76 E-value=0.41 Score=46.46 Aligned_cols=61 Identities=21% Similarity=0.323 Sum_probs=42.5
Q ss_pred ccceEEEEecCCCceee------------eeCCCCCC----------CCCCCC-cccceec-------ccc--CCCceEE
Q psy5736 4 LFLKEGIWDHITGSRLI------------SWRPDRGS----------WARNSA-RCTALTL-------LNE--HDDSLLL 51 (346)
Q Consensus 4 ~~~~~~~w~~~~~~~~~------------~~~~~~~~----------w~~~~~-~~t~l~~-------ine--~~~~lll 51 (346)
-+.+|.||||++|+.++ .|+|+-.- |++..+ .||.+.+ +|. -++.+.+
T Consensus 255 SDsTvrvWDv~tge~l~tlihHceaVLhlrf~ng~mvtcSkDrsiaVWdm~sps~it~rrVLvGHrAaVNvVdfd~kyIV 334 (499)
T KOG0281|consen 255 SDSTVRVWDVNTGEPLNTLIHHCEAVLHLRFSNGYMVTCSKDRSIAVWDMASPTDITLRRVLVGHRAAVNVVDFDDKYIV 334 (499)
T ss_pred CCceEEEEeccCCchhhHHhhhcceeEEEEEeCCEEEEecCCceeEEEeccCchHHHHHHHHhhhhhheeeeccccceEE
Confidence 36799999999999887 45554321 766553 4555443 344 4577888
Q ss_pred eeecCCceEEEeC
Q psy5736 52 TGYDDGSLSVYRN 64 (346)
Q Consensus 52 ~g~~dG~vri~r~ 64 (346)
+++.|-.+++|..
T Consensus 335 sASgDRTikvW~~ 347 (499)
T KOG0281|consen 335 SASGDRTIKVWST 347 (499)
T ss_pred EecCCceEEEEec
Confidence 8999999999843
No 38
>KOG0269|consensus
Probab=86.37 E-value=0.57 Score=49.53 Aligned_cols=52 Identities=15% Similarity=0.341 Sum_probs=39.5
Q ss_pred ccceEEEEecCC---CceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEe
Q psy5736 4 LFLKEGIWDHIT---GSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYR 63 (346)
Q Consensus 4 ~~~~~~~w~~~~---~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r 63 (346)
--|.|+|||... .+.+..| ..|.+-++.+.|.- ..-.++++|++||.|+.|.
T Consensus 108 ~nG~i~vWdlnk~~rnk~l~~f-------~EH~Rs~~~ldfh~-tep~iliSGSQDg~vK~~D 162 (839)
T KOG0269|consen 108 TNGVISVWDLNKSIRNKLLTVF-------NEHERSANKLDFHS-TEPNILISGSQDGTVKCWD 162 (839)
T ss_pred CCCcEEEEecCccccchhhhHh-------hhhccceeeeeecc-CCccEEEecCCCceEEEEe
Confidence 458999999988 3444444 34778899999655 4456668999999999993
No 39
>PTZ00420 coronin; Provisional
Probab=86.28 E-value=1.4 Score=46.21 Aligned_cols=52 Identities=15% Similarity=0.146 Sum_probs=39.9
Q ss_pred cccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeC
Q psy5736 3 GLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRN 64 (346)
Q Consensus 3 ~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~ 64 (346)
|.+|+|.|||.++++.+..+. +...|+++.+- .+..++++++.||.+++|..
T Consensus 145 S~DgtIrIWDl~tg~~~~~i~--------~~~~V~Slsws--pdG~lLat~s~D~~IrIwD~ 196 (568)
T PTZ00420 145 GFDSFVNIWDIENEKRAFQIN--------MPKKLSSLKWN--IKGNLLSGTCVGKHMHIIDP 196 (568)
T ss_pred eCCCeEEEEECCCCcEEEEEe--------cCCcEEEEEEC--CCCCEEEEEecCCEEEEEEC
Confidence 678999999999998776553 22468888853 35667788889999999953
No 40
>KOG0270|consensus
Probab=86.07 E-value=1.2 Score=44.40 Aligned_cols=53 Identities=23% Similarity=0.268 Sum_probs=43.8
Q ss_pred CcccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEE
Q psy5736 2 TGLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVY 62 (346)
Q Consensus 2 ~~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~ 62 (346)
+|.+-+|.+||..+|+....+. .|+.+|.+|++.+-.+ .++|+|+-||.++++
T Consensus 262 gsaD~TV~lWD~~~g~p~~s~~-------~~~k~Vq~l~wh~~~p-~~LLsGs~D~~V~l~ 314 (463)
T KOG0270|consen 262 GSADKTVKLWDVDTGKPKSSIT-------HHGKKVQTLEWHPYEP-SVLLSGSYDGTVALK 314 (463)
T ss_pred cCCCceEEEEEcCCCCcceehh-------hcCCceeEEEecCCCc-eEEEeccccceEEee
Confidence 4788999999999998777663 3678999999888544 555779999999998
No 41
>KOG0316|consensus
Probab=85.87 E-value=1.4 Score=40.82 Aligned_cols=51 Identities=25% Similarity=0.373 Sum_probs=42.5
Q ss_pred cccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEE
Q psy5736 3 GLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVY 62 (346)
Q Consensus 3 ~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~ 62 (346)
|-+..|.|||.++|+.+-+|. -|..++..++ +|| +.++++.|+-|..+|.|
T Consensus 78 GgDk~v~vwDV~TGkv~Rr~r-------gH~aqVNtV~-fNe-esSVv~SgsfD~s~r~w 128 (307)
T KOG0316|consen 78 GGDKAVQVWDVNTGKVDRRFR-------GHLAQVNTVR-FNE-ESSVVASGSFDSSVRLW 128 (307)
T ss_pred CCCceEEEEEcccCeeeeecc-------cccceeeEEE-ecC-cceEEEeccccceeEEE
Confidence 667899999999999887763 3778999998 464 45667889999999999
No 42
>KOG1081|consensus
Probab=85.74 E-value=0.33 Score=49.40 Aligned_cols=50 Identities=24% Similarity=0.291 Sum_probs=39.9
Q ss_pred eeeeeeccccc-cCCCCCCCCeEEEE---e-CCEEEEEEeeccCCCCceeecccc
Q psy5736 136 YVGVGIYPTVS-LFNHDCYPAVTRYF---N-GKNIIVKALRPLKPKEVVAENYGL 185 (346)
Q Consensus 136 ~~g~glyp~~s-~~NHSC~PN~~~~f---~-g~~~~v~a~r~I~~Geel~~~Y~~ 185 (346)
.+..+.++..| ++||||.||+.-.- . ...+.+.|.+.|++|+|+|++|..
T Consensus 362 ~id~~~~~n~sr~~nh~~~~~v~~~k~~~~~~t~~~~~a~~~i~~g~e~t~~~n~ 416 (463)
T KOG1081|consen 362 IIDAGPKGNYSRFLNHSCQPNVETEKWQVIGDTRVGLFAPRQIEAGEELTFNYNG 416 (463)
T ss_pred ccccccccchhhhhcccCCCceeechhheecccccccccccccccchhhhheeec
Confidence 34556677666 57999999998653 2 367899999999999999999975
No 43
>KOG0273|consensus
Probab=85.57 E-value=1.6 Score=43.88 Aligned_cols=60 Identities=27% Similarity=0.431 Sum_probs=43.2
Q ss_pred CcccceEEEEecCCCceeeeeCCCCCC-----C-----------------------------CCCCCcccceeccccCCC
Q psy5736 2 TGLFLKEGIWDHITGSRLISWRPDRGS-----W-----------------------------ARNSARCTALTLLNEHDD 47 (346)
Q Consensus 2 ~~~~~~~~~w~~~~~~~~~~~~~~~~~-----w-----------------------------~~~~~~~t~l~~ine~~~ 47 (346)
.|.+|++.+||-.+|.....|.-...- | .-|..+|.+||+ | ++.
T Consensus 294 ~~vD~ttilwd~~~g~~~q~f~~~s~~~lDVdW~~~~~F~ts~td~~i~V~kv~~~~P~~t~~GH~g~V~alk~-n-~tg 371 (524)
T KOG0273|consen 294 GGVDGTTILWDAHTGTVKQQFEFHSAPALDVDWQSNDEFATSSTDGCIHVCKVGEDRPVKTFIGHHGEVNALKW-N-PTG 371 (524)
T ss_pred ccCCccEEEEeccCceEEEeeeeccCCccceEEecCceEeecCCCceEEEEEecCCCcceeeecccCceEEEEE-C-CCC
Confidence 367899999999888777655422110 2 335667777773 3 568
Q ss_pred ceEEeeecCCceEEEe
Q psy5736 48 SLLLTGYDDGSLSVYR 63 (346)
Q Consensus 48 ~lll~g~~dG~vri~r 63 (346)
.+|+++++||.++||.
T Consensus 372 ~LLaS~SdD~TlkiWs 387 (524)
T KOG0273|consen 372 SLLASCSDDGTLKIWS 387 (524)
T ss_pred ceEEEecCCCeeEeee
Confidence 8999999999999994
No 44
>KOG0276|consensus
Probab=85.53 E-value=0.96 Score=47.00 Aligned_cols=50 Identities=20% Similarity=0.313 Sum_probs=38.8
Q ss_pred ccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEE
Q psy5736 4 LFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVY 62 (346)
Q Consensus 4 ~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~ 62 (346)
..|+|.|||++++..+.+|.=.+ ..|-+-|||.- ..-.++|++|+.||+|
T Consensus 33 ynG~V~IWnyetqtmVksfeV~~-------~PvRa~kfiaR--knWiv~GsDD~~IrVf 82 (794)
T KOG0276|consen 33 YNGDVQIWNYETQTMVKSFEVSE-------VPVRAAKFIAR--KNWIVTGSDDMQIRVF 82 (794)
T ss_pred ecCeeEEEecccceeeeeeeecc-------cchhhheeeec--cceEEEecCCceEEEE
Confidence 35999999999999999996444 34556666643 3456889999999998
No 45
>cd00200 WD40 WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and botto
Probab=85.01 E-value=2 Score=37.75 Aligned_cols=54 Identities=13% Similarity=0.212 Sum_probs=40.7
Q ss_pred cccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeCC
Q psy5736 3 GLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRNY 65 (346)
Q Consensus 3 ~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~~ 65 (346)
+.+|.|.|||.++++.+..+.. |...++++.+... ..++++++.||.+++|..-
T Consensus 70 ~~~~~i~i~~~~~~~~~~~~~~-------~~~~i~~~~~~~~--~~~~~~~~~~~~i~~~~~~ 123 (289)
T cd00200 70 SSDKTIRLWDLETGECVRTLTG-------HTSYVSSVAFSPD--GRILSSSSRDKTIKVWDVE 123 (289)
T ss_pred cCCCeEEEEEcCcccceEEEec-------cCCcEEEEEEcCC--CCEEEEecCCCeEEEEECC
Confidence 4589999999998877766642 4457888887654 5676767779999999543
No 46
>KOG1338|consensus
Probab=83.86 E-value=0.93 Score=44.73 Aligned_cols=53 Identities=21% Similarity=0.305 Sum_probs=45.7
Q ss_pred ccceeeeeeccccccCCCCCC-CCeEEEEeCCEEEEEEeeccCCCCceeecccc
Q psy5736 133 KINYVGVGIYPTVSLFNHDCY-PAVTRYFNGKNIIVKALRPLKPKEVVAENYGL 185 (346)
Q Consensus 133 ~~~~~g~glyp~~s~~NHSC~-PN~~~~f~g~~~~v~a~r~I~~Geel~~~Y~~ 185 (346)
+....|-..-|.+-++||.=. -|+.-.+.++.+.+.|.|+|++|+|++.+|+.
T Consensus 207 e~e~ngk~m~p~ad~lNhd~~k~nanl~y~~NcL~mva~r~iekgdev~n~dg~ 260 (466)
T KOG1338|consen 207 EIECNGKLMTPIADFLNHDGLKANANLRYEDNCLEMVADRNIEKGDEVDNSDGL 260 (466)
T ss_pred ccccCcccccchhhhhccchhhcccceeccCcceeeeecCCCCCcccccccccc
Confidence 345678889999999999866 56666778899999999999999999999984
No 47
>KOG0306|consensus
Probab=83.74 E-value=1 Score=47.66 Aligned_cols=54 Identities=30% Similarity=0.388 Sum_probs=39.5
Q ss_pred cceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeCC
Q psy5736 5 FLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRNY 65 (346)
Q Consensus 5 ~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~~ 65 (346)
-.+|.+||..+|.......-.+ .+.+||++. +.++..++.+|+.||.|++|.+-
T Consensus 42 ~E~vn~WdlRtge~~~~l~~~~-----~k~evt~l~--~~~d~l~lAVGYaDGsVqif~~~ 95 (888)
T KOG0306|consen 42 LEQVNIWDLRTGEIEKKLILLK-----KKAEVTCLR--SSDDILLLAVGYADGSVQIFSLE 95 (888)
T ss_pred cccEeEEeeecchhhhhhhhhc-----ccceEEEee--ccCCcceEEEEecCceEEeeccC
Confidence 4678999999994433332222 235899987 55667778999999999999776
No 48
>KOG0293|consensus
Probab=82.62 E-value=1.4 Score=43.78 Aligned_cols=54 Identities=22% Similarity=0.364 Sum_probs=42.0
Q ss_pred ccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeCC
Q psy5736 4 LFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRNY 65 (346)
Q Consensus 4 ~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~~ 65 (346)
-++.|.||+-.+|..+...+ | |...|..+. -|+.+-.|...|++||.||||..-
T Consensus 460 ED~kvyIWhr~sgkll~~Ls-G------Hs~~vNcVs-wNP~~p~m~ASasDDgtIRIWg~~ 513 (519)
T KOG0293|consen 460 EDSKVYIWHRISGKLLAVLS-G------HSKTVNCVS-WNPADPEMFASASDDGTIRIWGPS 513 (519)
T ss_pred CCceEEEEEccCCceeEeec-C------CcceeeEEe-cCCCCHHHhhccCCCCeEEEecCC
Confidence 36899999999998886553 2 444455544 588999999999999999999643
No 49
>KOG1334|consensus
Probab=81.99 E-value=1.3 Score=44.80 Aligned_cols=57 Identities=19% Similarity=0.304 Sum_probs=45.6
Q ss_pred cccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeCC
Q psy5736 3 GLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRNY 65 (346)
Q Consensus 3 ~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~~ 65 (346)
+-+.+|-||||.+++...+|..+ |...+---|||--.++.-+.+.+.||.+|.=...
T Consensus 161 SDD~~vv~WdW~~~~~~l~f~SG------H~~NvfQaKFiP~s~d~ti~~~s~dgqvr~s~i~ 217 (559)
T KOG1334|consen 161 SDDLQVVVWDWVSGSPKLSFESG------HCNNVFQAKFIPFSGDRTIVTSSRDGQVRVSEIL 217 (559)
T ss_pred CccceEEeehhhccCcccccccc------cccchhhhhccCCCCCcCceeccccCceeeeeec
Confidence 44678999999999999999765 5667877788888777777778889999875433
No 50
>KOG0274|consensus
Probab=81.81 E-value=2.7 Score=43.67 Aligned_cols=50 Identities=24% Similarity=0.388 Sum_probs=39.5
Q ss_pred cccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEe
Q psy5736 3 GLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYR 63 (346)
Q Consensus 3 ~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r 63 (346)
+.+.+|.|||+++++.++-+.- |..-|.++..- +.++++|+.||.|++|+
T Consensus 308 s~D~tVkVW~v~n~~~l~l~~~-------h~~~V~~v~~~----~~~lvsgs~d~~v~VW~ 357 (537)
T KOG0274|consen 308 SRDNTVKVWDVTNGACLNLLRG-------HTGPVNCVQLD----EPLLVSGSYDGTVKVWD 357 (537)
T ss_pred cCCceEEEEeccCcceEEEecc-------ccccEEEEEec----CCEEEEEecCceEEEEE
Confidence 3678999999999999987631 45557776633 78889999999999994
No 51
>PF13432 TPR_16: Tetratricopeptide repeat; PDB: 3CVP_A 3CVL_A 3CVQ_A 3CV0_A 2GW1_B 3CVN_A 3QKY_A 2PL2_B.
Probab=81.76 E-value=2.9 Score=29.58 Aligned_cols=46 Identities=13% Similarity=-0.025 Sum_probs=35.3
Q ss_pred HHHHHHhccccHHHHHHHHHHHHHHhhcccCCCcHhHHHHHHHHHHHHHHhCCc
Q psy5736 281 YRGIEQMNTSCFREAVESLTKFSDQIHELVVPPYKLASLAHEALRNCWSLAGNK 334 (346)
Q Consensus 281 ~~a~~~~~~~~~~~ai~~l~~~l~~~~~~l~p~h~~~~~~~~~L~~~y~~~G~~ 334 (346)
..|...++.|++++|++.++++++.. |.-..+...|..++...|+.
T Consensus 2 ~~a~~~~~~g~~~~A~~~~~~~l~~~--------P~~~~a~~~lg~~~~~~g~~ 47 (65)
T PF13432_consen 2 ALARALYQQGDYDEAIAAFEQALKQD--------PDNPEAWYLLGRILYQQGRY 47 (65)
T ss_dssp HHHHHHHHCTHHHHHHHHHHHHHCCS--------TTHHHHHHHHHHHHHHTT-H
T ss_pred hHHHHHHHcCCHHHHHHHHHHHHHHC--------CCCHHHHHHHHHHHHHcCCH
Confidence 34566788899999999998888765 33567888889999888875
No 52
>PLN00181 protein SPA1-RELATED; Provisional
Probab=81.66 E-value=2.8 Score=45.49 Aligned_cols=53 Identities=13% Similarity=0.114 Sum_probs=41.0
Q ss_pred cccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeC
Q psy5736 3 GLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRN 64 (346)
Q Consensus 3 ~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~ 64 (346)
|.+|+|.|||..++..+..+.. +..+.++.+.. ++..++++|+.||.|++|..
T Consensus 595 s~Dg~v~iWd~~~~~~~~~~~~--------~~~v~~v~~~~-~~g~~latgs~dg~I~iwD~ 647 (793)
T PLN00181 595 SDDGSVKLWSINQGVSIGTIKT--------KANICCVQFPS-ESGRSLAFGSADHKVYYYDL 647 (793)
T ss_pred cCCCEEEEEECCCCcEEEEEec--------CCCeEEEEEeC-CCCCEEEEEeCCCeEEEEEC
Confidence 5689999999999887776642 23577777643 56778899999999999953
No 53
>COG1729 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=81.60 E-value=3 Score=39.22 Aligned_cols=52 Identities=13% Similarity=0.218 Sum_probs=46.0
Q ss_pred HHHHHHHHHhccccHHHHHHHHHHHHHHhhcccCCCcHhHHHHHHHHHHHHHHhCCc
Q psy5736 278 EMFYRGIEQMNTSCFREAVESLTKFSDQIHELVVPPYKLASLAHEALRNCWSLAGNK 334 (346)
Q Consensus 278 ~~~~~a~~~~~~~~~~~ai~~l~~~l~~~~~~l~p~h~~~~~~~~~L~~~y~~~G~~ 334 (346)
.+|+.|.+.+.+|++.+|...|.++++.| |+....-+|+..|-++|-..|+.
T Consensus 143 ~~Y~~A~~~~ksgdy~~A~~~F~~fi~~Y-----P~s~~~~nA~yWLGe~~y~qg~y 194 (262)
T COG1729 143 KLYNAALDLYKSGDYAEAEQAFQAFIKKY-----PNSTYTPNAYYWLGESLYAQGDY 194 (262)
T ss_pred HHHHHHHHHHHcCCHHHHHHHHHHHHHcC-----CCCcccchhHHHHHHHHHhcccc
Confidence 37889999999999999999999999999 77788888888888888888775
No 54
>KOG0263|consensus
Probab=81.38 E-value=2.9 Score=44.26 Aligned_cols=54 Identities=20% Similarity=0.298 Sum_probs=45.3
Q ss_pred cccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeCC
Q psy5736 3 GLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRNY 65 (346)
Q Consensus 3 ~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~~ 65 (346)
+.+-+|.+||-.+|..+-.|. + |+..|++|.+- +.+..+.+|+.||.|.+|+.-
T Consensus 554 SsD~tVRlWDv~~G~~VRiF~-G------H~~~V~al~~S--p~Gr~LaSg~ed~~I~iWDl~ 607 (707)
T KOG0263|consen 554 SSDRTVRLWDVSTGNSVRIFT-G------HKGPVTALAFS--PCGRYLASGDEDGLIKIWDLA 607 (707)
T ss_pred CCCceEEEEEcCCCcEEEEec-C------CCCceEEEEEc--CCCceEeecccCCcEEEEEcC
Confidence 467899999999998887772 2 78899999953 588888999999999999654
No 55
>KOG0310|consensus
Probab=81.35 E-value=1.7 Score=43.67 Aligned_cols=49 Identities=31% Similarity=0.419 Sum_probs=38.1
Q ss_pred ceEEEEecCCC-ceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEe
Q psy5736 6 LKEGIWDHITG-SRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYR 63 (346)
Q Consensus 6 ~~~~~w~~~~~-~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r 63 (346)
..|.|||.-+| +.+..+.| |...+|+|.+-. +...|++|+=||.+++|.
T Consensus 217 n~vkVWDl~~G~qll~~~~~-------H~KtVTcL~l~s--~~~rLlS~sLD~~VKVfd 266 (487)
T KOG0310|consen 217 NSVKVWDLTTGGQLLTSMFN-------HNKTVTCLRLAS--DSTRLLSGSLDRHVKVFD 266 (487)
T ss_pred CeEEEEEecCCceehhhhhc-------ccceEEEEEeec--CCceEeecccccceEEEE
Confidence 47999999966 55555544 345799999655 558889999999999996
No 56
>KOG0271|consensus
Probab=81.26 E-value=2.3 Score=41.95 Aligned_cols=54 Identities=20% Similarity=0.187 Sum_probs=41.6
Q ss_pred ccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceecccc---CCCceEEeeecCCceEEEe
Q psy5736 4 LFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNE---HDDSLLLTGYDDGSLSVYR 63 (346)
Q Consensus 4 ~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine---~~~~lll~g~~dG~vri~r 63 (346)
.+|+|.+||=++|+.+-+ +.+.|+.-|++|.+--- ++-.++.+++-||.+|||.
T Consensus 177 ~dg~I~lwdpktg~~~g~------~l~gH~K~It~Lawep~hl~p~~r~las~skDg~vrIWd 233 (480)
T KOG0271|consen 177 KDGSIRLWDPKTGQQIGR------ALRGHKKWITALAWEPLHLVPPCRRLASSSKDGSVRIWD 233 (480)
T ss_pred cCCeEEEecCCCCCcccc------cccCcccceeEEeecccccCCCccceecccCCCCEEEEE
Confidence 689999999999988753 34568889999987433 4455666677799999994
No 57
>TIGR02795 tol_pal_ybgF tol-pal system protein YbgF. Members of this protein family are the product of one of seven genes regularly clustered in operons to encode the proteins of the tol-pal system, which is critical for maintaining the integrity of the bacterial outer membrane. The gene for this periplasmic protein has been designated orf2 and ybgF. All members of the seed alignment were from unique tol-pal gene regions from completed bacterial genomes. The architecture of this protein is a signal sequence, a low-complexity region usually rich in Asn and Gln, a well-conserved region with tandem repeats that resemble the tetratricopeptide (TPR) repeat, involved in protein-protein interaction.
Probab=81.12 E-value=4.3 Score=31.65 Aligned_cols=48 Identities=17% Similarity=0.118 Sum_probs=33.7
Q ss_pred HHHHHhccccHHHHHHHHHHHHHHhhcccCCCcHhHHHHHHHHHHHHHHhCCc
Q psy5736 282 RGIEQMNTSCFREAVESLTKFSDQIHELVVPPYKLASLAHEALRNCWSLAGNK 334 (346)
Q Consensus 282 ~a~~~~~~~~~~~ai~~l~~~l~~~~~~l~p~h~~~~~~~~~L~~~y~~~G~~ 334 (346)
.+...+..+++++|++.+.+++..+ |.++....+...++.+|...|+.
T Consensus 45 l~~~~~~~~~~~~A~~~~~~~~~~~-----p~~~~~~~~~~~~~~~~~~~~~~ 92 (119)
T TIGR02795 45 LGEAYYAQGKYADAAKAFLAVVKKY-----PKSPKAPDALLKLGMSLQELGDK 92 (119)
T ss_pred HHHHHHhhccHHHHHHHHHHHHHHC-----CCCCcccHHHHHHHHHHHHhCCh
Confidence 3444566677777777777777654 67776677788888888877765
No 58
>KOG0266|consensus
Probab=81.06 E-value=3.1 Score=42.21 Aligned_cols=51 Identities=24% Similarity=0.426 Sum_probs=38.9
Q ss_pred cccceEEEEecCCC-ceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEE
Q psy5736 3 GLFLKEGIWDHITG-SRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVY 62 (346)
Q Consensus 3 ~~~~~~~~w~~~~~-~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~ 62 (346)
+.+++|.|||-+.. .-+-.+. + |...++++.| ++.. .+++.|+.||.||+|
T Consensus 222 s~D~tiriwd~~~~~~~~~~l~-g------H~~~v~~~~f-~p~g-~~i~Sgs~D~tvriW 273 (456)
T KOG0266|consen 222 SDDKTLRIWDLKDDGRNLKTLK-G------HSTYVTSVAF-SPDG-NLLVSGSDDGTVRIW 273 (456)
T ss_pred cCCceEEEeeccCCCeEEEEec-C------CCCceEEEEe-cCCC-CEEEEecCCCcEEEE
Confidence 45889999999444 4444553 2 6778899884 5455 888999999999999
No 59
>KOG0266|consensus
Probab=80.42 E-value=2.9 Score=42.34 Aligned_cols=55 Identities=24% Similarity=0.308 Sum_probs=42.5
Q ss_pred cccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeCCc
Q psy5736 3 GLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRNYA 66 (346)
Q Consensus 3 ~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~~~ 66 (346)
+.+++|.|||.++++.+-.+. .|+..|+.+.| + .++.++++++.||.+++|..+.
T Consensus 265 s~D~tvriWd~~~~~~~~~l~-------~hs~~is~~~f-~-~d~~~l~s~s~d~~i~vwd~~~ 319 (456)
T KOG0266|consen 265 SDDGTVRIWDVRTGECVRKLK-------GHSDGISGLAF-S-PDGNLLVSASYDGTIRVWDLET 319 (456)
T ss_pred cCCCcEEEEeccCCeEEEeee-------ccCCceEEEEE-C-CCCCEEEEcCCCccEEEEECCC
Confidence 568999999999987766553 46778998863 3 4666778888899999996653
No 60
>KOG1332|consensus
Probab=80.39 E-value=2.3 Score=39.59 Aligned_cols=55 Identities=25% Similarity=0.234 Sum_probs=40.6
Q ss_pred ccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEe
Q psy5736 4 LFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYR 63 (346)
Q Consensus 4 ~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r 63 (346)
.+|.|.||..+.| |-+. ..-|..|+..++++.+-=-.-.-+|+.|++||.|++++
T Consensus 78 YDgkVIiWke~~g-~w~k----~~e~~~h~~SVNsV~wapheygl~LacasSDG~vsvl~ 132 (299)
T KOG1332|consen 78 YDGKVIIWKEENG-RWTK----AYEHAAHSASVNSVAWAPHEYGLLLACASSDGKVSVLT 132 (299)
T ss_pred cCceEEEEecCCC-chhh----hhhhhhhcccceeecccccccceEEEEeeCCCcEEEEE
Confidence 5799999999988 4432 22355688889988854323367778889999999984
No 61
>KOG0272|consensus
Probab=80.36 E-value=2.2 Score=42.40 Aligned_cols=52 Identities=15% Similarity=0.213 Sum_probs=43.2
Q ss_pred cccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEE
Q psy5736 3 GLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVY 62 (346)
Q Consensus 3 ~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~ 62 (346)
|-++++.|||-....-+... |+ |...||.+++.- ....++++++-|+.+++|
T Consensus 364 s~Dnt~kVWDLR~r~~ly~i----pA---H~nlVS~Vk~~p-~~g~fL~TasyD~t~kiW 415 (459)
T KOG0272|consen 364 SSDNTCKVWDLRMRSELYTI----PA---HSNLVSQVKYSP-QEGYFLVTASYDNTVKIW 415 (459)
T ss_pred CCCCcEEEeeecccccceec----cc---ccchhhheEecc-cCCeEEEEcccCcceeee
Confidence 46899999999977666543 54 788899999765 678899999999999999
No 62
>KOG2461|consensus
Probab=80.00 E-value=0.77 Score=45.81 Aligned_cols=33 Identities=24% Similarity=0.517 Sum_probs=27.4
Q ss_pred CeEEEEeCCEEEEEEeeccCCCCceeecccccc
Q psy5736 155 AVTRYFNGKNIIVKALRPLKPKEVVAENYGLVF 187 (346)
Q Consensus 155 N~~~~f~g~~~~v~a~r~I~~Geel~~~Y~~~~ 187 (346)
|....-.+..+..+|+|+|++||||.+.|++.|
T Consensus 114 NL~A~Q~~~~Ifyrt~r~I~p~eELlVWY~~e~ 146 (396)
T KOG2461|consen 114 NLLAFQIGENIFYRTIRDIRPNEELLVWYGSEY 146 (396)
T ss_pred hHHHHhccCceEEEecccCCCCCeEEEEeccch
Confidence 444444578999999999999999999999864
No 63
>PF14559 TPR_19: Tetratricopeptide repeat; PDB: 2R5S_A 3QDN_B 3QOU_A 3ASG_A 3ASD_A 3AS5_A 3AS4_A 3ASH_B 3FP3_A 3LCA_A ....
Probab=79.99 E-value=3.3 Score=29.43 Aligned_cols=40 Identities=25% Similarity=0.336 Sum_probs=29.8
Q ss_pred hccccHHHHHHHHHHHHHHhhcccCCCcHhHHHHHHHHHHHHHHhCCc
Q psy5736 287 MNTSCFREAVESLTKFSDQIHELVVPPYKLASLAHEALRNCWSLAGNK 334 (346)
Q Consensus 287 ~~~~~~~~ai~~l~~~l~~~~~~l~p~h~~~~~~~~~L~~~y~~~G~~ 334 (346)
++.|++++|++.+++.++.. |.+. .+...|+.||...|+.
T Consensus 2 l~~~~~~~A~~~~~~~l~~~-----p~~~---~~~~~la~~~~~~g~~ 41 (68)
T PF14559_consen 2 LKQGDYDEAIELLEKALQRN-----PDNP---EARLLLAQCYLKQGQY 41 (68)
T ss_dssp HHTTHHHHHHHHHHHHHHHT-----TTSH---HHHHHHHHHHHHTT-H
T ss_pred hhccCHHHHHHHHHHHHHHC-----CCCH---HHHHHHHHHHHHcCCH
Confidence 56788999999999988775 4444 4556788999888875
No 64
>KOG0289|consensus
Probab=79.86 E-value=3.4 Score=41.28 Aligned_cols=50 Identities=20% Similarity=0.321 Sum_probs=40.5
Q ss_pred ccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEE
Q psy5736 4 LFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVY 62 (346)
Q Consensus 4 ~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~ 62 (346)
.+|.|.|||..++.-+..| +.|...|++++|- ..+=.+.++.+||.|++|
T Consensus 367 ~d~~vkiwdlks~~~~a~F-------pght~~vk~i~Fs--ENGY~Lat~add~~V~lw 416 (506)
T KOG0289|consen 367 PDGVVKIWDLKSQTNVAKF-------PGHTGPVKAISFS--ENGYWLATAADDGSVKLW 416 (506)
T ss_pred CCceEEEEEcCCccccccC-------CCCCCceeEEEec--cCceEEEEEecCCeEEEE
Confidence 3789999999999866665 3578899999963 355667889999999999
No 65
>PF13414 TPR_11: TPR repeat; PDB: 2HO1_B 2FI7_B 2DBA_A 3Q4A_B 2C2L_D 3Q47_B 3Q49_B 2PL2_B 3IEG_B 2FBN_A ....
Probab=79.81 E-value=9.4 Score=27.11 Aligned_cols=49 Identities=18% Similarity=0.154 Sum_probs=38.7
Q ss_pred HHHHHHHHHHhccccHHHHHHHHHHHHHHhhcccCCCcHhHHHHHHHHHHHHHHhCC
Q psy5736 277 NEMFYRGIEQMNTSCFREAVESLTKFSDQIHELVVPPYKLASLAHEALRNCWSLAGN 333 (346)
Q Consensus 277 ~~~~~~a~~~~~~~~~~~ai~~l~~~l~~~~~~l~p~h~~~~~~~~~L~~~y~~~G~ 333 (346)
..++..|...+..+++++|+..+.+.++.- |.|+ .+...++.+|..+|+
T Consensus 4 ~~~~~~g~~~~~~~~~~~A~~~~~~ai~~~-----p~~~---~~~~~~g~~~~~~~~ 52 (69)
T PF13414_consen 4 EAWYNLGQIYFQQGDYEEAIEYFEKAIELD-----PNNA---EAYYNLGLAYMKLGK 52 (69)
T ss_dssp HHHHHHHHHHHHTTHHHHHHHHHHHHHHHS-----TTHH---HHHHHHHHHHHHTTT
T ss_pred HHHHHHHHHHHHcCCHHHHHHHHHHHHHcC-----CCCH---HHHHHHHHHHHHhCc
Confidence 345566777888999999999999999873 5554 477888889988883
No 66
>KOG0292|consensus
Probab=79.51 E-value=2.5 Score=45.84 Aligned_cols=49 Identities=10% Similarity=0.171 Sum_probs=35.6
Q ss_pred cceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEE
Q psy5736 5 FLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVY 62 (346)
Q Consensus 5 ~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~ 62 (346)
+-||.||||.++.-++...-.| |- +-+-+|+ +++.+..+||-|-.||+|
T Consensus 114 DQTIrIWNwqsr~~iavltGHn-----HY--VMcAqFh--ptEDlIVSaSLDQTVRVW 162 (1202)
T KOG0292|consen 114 DQTIRIWNWQSRKCIAVLTGHN-----HY--VMCAQFH--PTEDLIVSASLDQTVRVW 162 (1202)
T ss_pred CCeEEEEeccCCceEEEEecCc-----eE--EEeeccC--CccceEEEecccceEEEE
Confidence 5799999999998887665444 44 3333333 346666889999999999
No 67
>cd00200 WD40 WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and botto
Probab=79.26 E-value=4.5 Score=35.44 Aligned_cols=51 Identities=16% Similarity=0.340 Sum_probs=38.8
Q ss_pred cceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeC
Q psy5736 5 FLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRN 64 (346)
Q Consensus 5 ~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~ 64 (346)
+|.|.+||..+++....+. .|...++++.+... ..++++|..||.+++|..
T Consensus 114 ~~~i~~~~~~~~~~~~~~~-------~~~~~i~~~~~~~~--~~~l~~~~~~~~i~i~d~ 164 (289)
T cd00200 114 DKTIKVWDVETGKCLTTLR-------GHTDWVNSVAFSPD--GTFVASSSQDGTIKLWDL 164 (289)
T ss_pred CCeEEEEECCCcEEEEEec-------cCCCcEEEEEEcCc--CCEEEEEcCCCcEEEEEc
Confidence 8999999999887776664 24567888886543 566666666999999954
No 68
>PRK10866 outer membrane biogenesis protein BamD; Provisional
Probab=79.06 E-value=7.2 Score=36.14 Aligned_cols=51 Identities=10% Similarity=0.234 Sum_probs=28.7
Q ss_pred HHHHHHHHHhccccHHHHHHHHHHHHHHhhcccCCCcHhHHHHHHHHHHHHHHhCC
Q psy5736 278 EMFYRGIEQMNTSCFREAVESLTKFSDQIHELVVPPYKLASLAHEALRNCWSLAGN 333 (346)
Q Consensus 278 ~~~~~a~~~~~~~~~~~ai~~l~~~l~~~~~~l~p~h~~~~~~~~~L~~~y~~~G~ 333 (346)
..|.+|.+.++.|++++|++.|++++..+ |..+....+...|+.+|-..|+
T Consensus 34 ~~Y~~A~~~~~~g~y~~Ai~~f~~l~~~y-----P~s~~a~~a~l~la~ayy~~~~ 84 (243)
T PRK10866 34 EIYATAQQKLQDGNWKQAITQLEALDNRY-----PFGPYSQQVQLDLIYAYYKNAD 84 (243)
T ss_pred HHHHHHHHHHHCCCHHHHHHHHHHHHHhC-----CCChHHHHHHHHHHHHHHhcCC
Confidence 35556666666666776666666666543 4444444444455555544443
No 69
>KOG0263|consensus
Probab=78.83 E-value=2.2 Score=45.04 Aligned_cols=51 Identities=22% Similarity=0.322 Sum_probs=44.0
Q ss_pred cccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEE
Q psy5736 3 GLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVY 62 (346)
Q Consensus 3 ~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~ 62 (346)
+-+|.|.|||..+|.++-.+. .|+.-|.+|.| .-+..++.+|+.|..|++|
T Consensus 596 ~ed~~I~iWDl~~~~~v~~l~-------~Ht~ti~SlsF--S~dg~vLasgg~DnsV~lW 646 (707)
T KOG0263|consen 596 DEDGLIKIWDLANGSLVKQLK-------GHTGTIYSLSF--SRDGNVLASGGADNSVRLW 646 (707)
T ss_pred ccCCcEEEEEcCCCcchhhhh-------cccCceeEEEE--ecCCCEEEecCCCCeEEEE
Confidence 458999999999999998774 25567899997 6788899999999999999
No 70
>KOG0647|consensus
Probab=78.67 E-value=3 Score=39.92 Aligned_cols=55 Identities=18% Similarity=0.247 Sum_probs=46.1
Q ss_pred cccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeCC
Q psy5736 3 GLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRNY 65 (346)
Q Consensus 3 ~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~~ 65 (346)
|-++++.+||.++++ +..+. .|..-++.+++++.....+++|||=|..||.|..=
T Consensus 91 ~~Dk~~k~wDL~S~Q-~~~v~-------~Hd~pvkt~~wv~~~~~~cl~TGSWDKTlKfWD~R 145 (347)
T KOG0647|consen 91 GCDKQAKLWDLASGQ-VSQVA-------AHDAPVKTCHWVPGMNYQCLVTGSWDKTLKFWDTR 145 (347)
T ss_pred ccCCceEEEEccCCC-eeeee-------ecccceeEEEEecCCCcceeEecccccceeecccC
Confidence 568999999999994 44442 37788999999999999999999999999999443
No 71
>KOG0276|consensus
Probab=78.55 E-value=3.3 Score=43.17 Aligned_cols=53 Identities=15% Similarity=0.250 Sum_probs=45.8
Q ss_pred cccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEE
Q psy5736 3 GLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVY 62 (346)
Q Consensus 3 ~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~ 62 (346)
.|++||.||...+...-..+. -|.+-+.++.++.-.|.+++.+|++|..+++|
T Consensus 160 sLDrTVKVWslgs~~~nfTl~-------gHekGVN~Vdyy~~gdkpylIsgaDD~tiKvW 212 (794)
T KOG0276|consen 160 SLDRTVKVWSLGSPHPNFTLE-------GHEKGVNCVDYYTGGDKPYLISGADDLTIKVW 212 (794)
T ss_pred eccccEEEEEcCCCCCceeee-------ccccCcceEEeccCCCcceEEecCCCceEEEe
Confidence 589999999999876655553 26678999999999999999999999999999
No 72
>KOG0308|consensus
Probab=77.98 E-value=3.1 Score=43.46 Aligned_cols=53 Identities=23% Similarity=0.435 Sum_probs=42.5
Q ss_pred cccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeC
Q psy5736 3 GLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRN 64 (346)
Q Consensus 3 ~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~ 64 (346)
|..+-+.+||..+.+++-+.. + |...|..|- +| +|+..+++|++||.||+|..
T Consensus 190 gtek~lr~wDprt~~kimkLr-G------HTdNVr~ll-~~-dDGt~~ls~sSDgtIrlWdL 242 (735)
T KOG0308|consen 190 GTEKDLRLWDPRTCKKIMKLR-G------HTDNVRVLL-VN-DDGTRLLSASSDGTIRLWDL 242 (735)
T ss_pred CcccceEEeccccccceeeee-c------cccceEEEE-Ec-CCCCeEeecCCCceEEeeec
Confidence 667889999999999998775 2 667788776 33 56777799999999999943
No 73
>smart00320 WD40 WD40 repeats. Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Probab=77.94 E-value=2.6 Score=24.32 Aligned_cols=30 Identities=17% Similarity=0.419 Sum_probs=22.5
Q ss_pred CCCcccceeccccCCCceEEeeecCCceEEEe
Q psy5736 32 NSARCTALTLLNEHDDSLLLTGYDDGSLSVYR 63 (346)
Q Consensus 32 ~~~~~t~l~~ine~~~~lll~g~~dG~vri~r 63 (346)
|...++++.+.... .++++|+.||.+++|.
T Consensus 11 ~~~~i~~~~~~~~~--~~~~~~~~d~~~~~~~ 40 (40)
T smart00320 11 HTGPVTSVAFSPDG--KYLASASDDGTIKLWD 40 (40)
T ss_pred cCCceeEEEECCCC--CEEEEecCCCeEEEcC
Confidence 45578888865433 6778899999999873
No 74
>KOG1310|consensus
Probab=77.84 E-value=3.9 Score=42.16 Aligned_cols=55 Identities=15% Similarity=0.223 Sum_probs=47.9
Q ss_pred cceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeCC
Q psy5736 5 FLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRNY 65 (346)
Q Consensus 5 ~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~~ 65 (346)
+-++.|||.-+-+.+-+...+ |...|.+.||+-...+.++++|.-|..|++|..=
T Consensus 71 D~r~ivWd~~~~KllhsI~Tg------HtaNIFsvKFvP~tnnriv~sgAgDk~i~lfdl~ 125 (758)
T KOG1310|consen 71 DTRLIVWDPFEYKLLHSISTG------HTANIFSVKFVPYTNNRIVLSGAGDKLIKLFDLD 125 (758)
T ss_pred cceEEeecchhcceeeeeecc------cccceeEEeeeccCCCeEEEeccCcceEEEEecc
Confidence 457899999988888888766 5689999999999999999999999999999543
No 75
>KOG0278|consensus
Probab=77.30 E-value=4.9 Score=37.70 Aligned_cols=53 Identities=21% Similarity=0.343 Sum_probs=41.6
Q ss_pred cccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEe
Q psy5736 3 GLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYR 63 (346)
Q Consensus 3 ~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r 63 (346)
|-++.+..+|+.+|.-+-+|.-| |-..+-+++| .+|..+...||.||.||+|.
T Consensus 243 ged~~~~kfDy~TgeEi~~~nkg------h~gpVhcVrF--SPdGE~yAsGSEDGTirlWQ 295 (334)
T KOG0278|consen 243 GEDFKVYKFDYNTGEEIGSYNKG------HFGPVHCVRF--SPDGELYASGSEDGTIRLWQ 295 (334)
T ss_pred CcceEEEEEeccCCceeeecccC------CCCceEEEEE--CCCCceeeccCCCceEEEEE
Confidence 56788999999999888877333 4456888874 36666778899999999994
No 76
>PRK10803 tol-pal system protein YbgF; Provisional
Probab=76.44 E-value=7.9 Score=36.42 Aligned_cols=51 Identities=18% Similarity=0.148 Sum_probs=31.3
Q ss_pred HHHHHHHH-hccccHHHHHHHHHHHHHHhhcccCCCcHhHHHHHHHHHHHHHHhCCc
Q psy5736 279 MFYRGIEQ-MNTSCFREAVESLTKFSDQIHELVVPPYKLASLAHEALRNCWSLAGNK 334 (346)
Q Consensus 279 ~~~~a~~~-~~~~~~~~ai~~l~~~l~~~~~~l~p~h~~~~~~~~~L~~~y~~~G~~ 334 (346)
.|..|..+ ++.+++++|+..|.++++.| |.+.....++..|..+|-..|+.
T Consensus 145 ~Y~~A~~l~~~~~~y~~Ai~af~~fl~~y-----P~s~~a~~A~y~LG~~y~~~g~~ 196 (263)
T PRK10803 145 DYNAAIALVQDKSRQDDAIVAFQNFVKKY-----PDSTYQPNANYWLGQLNYNKGKK 196 (263)
T ss_pred HHHHHHHHHHhcCCHHHHHHHHHHHHHHC-----cCCcchHHHHHHHHHHHHHcCCH
Confidence 45555554 44567777777777777766 44555555566666666555543
No 77
>KOG0273|consensus
Probab=76.36 E-value=4.4 Score=40.90 Aligned_cols=51 Identities=14% Similarity=0.307 Sum_probs=44.2
Q ss_pred cccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEE
Q psy5736 3 GLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVY 62 (346)
Q Consensus 3 ~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~ 62 (346)
+.+++|.+||-+++..+..|. +|...+-+|+|. ++..++++|+-||.|-+|
T Consensus 429 s~dstV~lwdv~~gv~i~~f~-------kH~~pVysvafS--~~g~ylAsGs~dg~V~iw 479 (524)
T KOG0273|consen 429 SFDSTVKLWDVESGVPIHTLM-------KHQEPVYSVAFS--PNGRYLASGSLDGCVHIW 479 (524)
T ss_pred ecCCeEEEEEccCCceeEeec-------cCCCceEEEEec--CCCcEEEecCCCCeeEec
Confidence 467899999999999999883 367789999965 578999999999999888
No 78
>KOG0277|consensus
Probab=75.94 E-value=4.8 Score=37.77 Aligned_cols=56 Identities=16% Similarity=0.167 Sum_probs=42.9
Q ss_pred ccceEEEEecCC-CceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEE--eCCccc
Q psy5736 4 LFLKEGIWDHIT-GSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVY--RNYASH 68 (346)
Q Consensus 4 ~~~~~~~w~~~~-~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~--r~~~sh 68 (346)
-+|+..+||... |+.++ +. .|..||.++.+ |..++.++.||+.|+.||.| |++.-+
T Consensus 168 gd~~l~lwdvr~~gk~~~-i~-------ah~~Eil~cdw-~ky~~~vl~Tg~vd~~vr~wDir~~r~p 226 (311)
T KOG0277|consen 168 GDGTLRLWDVRSPGKFMS-IE-------AHNSEILCCDW-SKYNHNVLATGGVDNLVRGWDIRNLRTP 226 (311)
T ss_pred CCceEEEEEecCCCceeE-EE-------eccceeEeecc-cccCCcEEEecCCCceEEEEehhhcccc
Confidence 478999999854 45554 42 36679998885 66889999999999999999 666433
No 79
>PF13174 TPR_6: Tetratricopeptide repeat; PDB: 3QKY_A 2XEV_A 3URZ_B 2Q7F_A.
Probab=75.64 E-value=5 Score=23.98 Aligned_cols=29 Identities=10% Similarity=0.168 Sum_probs=22.4
Q ss_pred HHHHHHHHhccccHHHHHHHHHHHHHHhh
Q psy5736 279 MFYRGIEQMNTSCFREAVESLTKFSDQIH 307 (346)
Q Consensus 279 ~~~~a~~~~~~~~~~~ai~~l~~~l~~~~ 307 (346)
++..|.-....|++++|++.++++++.|-
T Consensus 3 ~~~~a~~~~~~g~~~~A~~~~~~~~~~~P 31 (33)
T PF13174_consen 3 LYRLARCYYKLGDYDEAIEYFQRLIKRYP 31 (33)
T ss_dssp HHHHHHHHHHHCHHHHHHHHHHHHHHHST
T ss_pred HHHHHHHHHHccCHHHHHHHHHHHHHHCc
Confidence 34455555667899999999999999873
No 80
>KOG0275|consensus
Probab=75.46 E-value=1.6 Score=42.12 Aligned_cols=60 Identities=17% Similarity=0.320 Sum_probs=39.5
Q ss_pred cccceEEEEecCCCceee--eeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeCC
Q psy5736 3 GLFLKEGIWDHITGSRLI--SWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRNY 65 (346)
Q Consensus 3 ~~~~~~~~w~~~~~~~~~--~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~~ 65 (346)
..+|-|-|||+.+|+.-- .|+-|. ++-++..-+-+|.| .-|..|+.+|++||.|++||--
T Consensus 232 SvDGFiEVWny~~GKlrKDLkYQAqd-~fMMmd~aVlci~F--SRDsEMlAsGsqDGkIKvWri~ 293 (508)
T KOG0275|consen 232 SVDGFIEVWNYTTGKLRKDLKYQAQD-NFMMMDDAVLCISF--SRDSEMLASGSQDGKIKVWRIE 293 (508)
T ss_pred cccceeeeehhccchhhhhhhhhhhc-ceeecccceEEEee--cccHHHhhccCcCCcEEEEEEe
Confidence 368899999999994322 343332 23344444444443 2567788999999999999643
No 81
>KOG1034|consensus
Probab=75.46 E-value=3.7 Score=39.77 Aligned_cols=53 Identities=15% Similarity=0.265 Sum_probs=43.0
Q ss_pred CcccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEE
Q psy5736 2 TGLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVY 62 (346)
Q Consensus 2 ~~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~ 62 (346)
+|.-|.|.|-|..+++-..++. -|+..|.+||+.- +.-.++|+||.|..||+|
T Consensus 111 ~G~~GvIrVid~~~~~~~~~~~-------ghG~sINeik~~p-~~~qlvls~SkD~svRlw 163 (385)
T KOG1034|consen 111 GGYLGVIRVIDVVSGQCSKNYR-------GHGGSINEIKFHP-DRPQLVLSASKDHSVRLW 163 (385)
T ss_pred ecceeEEEEEecchhhhcccee-------ccCccchhhhcCC-CCCcEEEEecCCceEEEE
Confidence 5778999999999987666552 2788899998653 556688999999999998
No 82
>KOG0272|consensus
Probab=75.18 E-value=5.7 Score=39.55 Aligned_cols=57 Identities=18% Similarity=0.190 Sum_probs=47.1
Q ss_pred cccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeCCc
Q psy5736 3 GLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRNYA 66 (346)
Q Consensus 3 ~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~~~ 66 (346)
+..|.+.||+-.+...++.|. -|..++.++.|.-.+++.-+.+|+.||.+++|+.-.
T Consensus 194 swsG~~kvW~~~~~~~~~~l~-------gH~~~v~~~~fhP~~~~~~lat~s~Dgtvklw~~~~ 250 (459)
T KOG0272|consen 194 SWSGLVKVWSVPQCNLLQTLR-------GHTSRVGAAVFHPVDSDLNLATASADGTVKLWKLSQ 250 (459)
T ss_pred ecCCceeEeecCCcceeEEEe-------ccccceeeEEEccCCCccceeeeccCCceeeeccCC
Confidence 468999999999999998884 378899999988775555568899999999996543
No 83
>KOG0265|consensus
Probab=75.04 E-value=4.4 Score=38.72 Aligned_cols=52 Identities=25% Similarity=0.417 Sum_probs=36.9
Q ss_pred cccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEE
Q psy5736 3 GLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVY 62 (346)
Q Consensus 3 ~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~ 62 (346)
|-+-+|.+||-++|++.-+|.- |..=+.++. -.--.-.|+-+|++||.+|+|
T Consensus 109 gtDk~v~~wD~~tG~~~rk~k~-------h~~~vNs~~-p~rrg~~lv~SgsdD~t~kl~ 160 (338)
T KOG0265|consen 109 GTDKTVRGWDAETGKRIRKHKG-------HTSFVNSLD-PSRRGPQLVCSGSDDGTLKLW 160 (338)
T ss_pred cCCceEEEEecccceeeehhcc-------ccceeeecC-ccccCCeEEEecCCCceEEEE
Confidence 6688999999999998876532 233234444 222446677788899999999
No 84
>KOG0274|consensus
Probab=74.63 E-value=5.5 Score=41.47 Aligned_cols=54 Identities=24% Similarity=0.313 Sum_probs=42.4
Q ss_pred cccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeCCcc
Q psy5736 3 GLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRNYAS 67 (346)
Q Consensus 3 ~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~~~s 67 (346)
.++|+|.+||-+++..+...... |...++++.+. +...+.+++||.+++| ++.+
T Consensus 430 ~aD~~Ik~WD~~~~~~~~~~~~~------~~~~v~~l~~~----~~~il~s~~~~~~~l~-dl~~ 483 (537)
T KOG0274|consen 430 SADGTIKLWDAEEGECLRTLEGR------HVGGVSALALG----KEEILCSSDDGSVKLW-DLRS 483 (537)
T ss_pred cccccEEEeecccCceeeeeccC------CcccEEEeecC----cceEEEEecCCeeEEE-eccc
Confidence 57899999999999988866533 34678888866 6667889999999999 4433
No 85
>TIGR03302 OM_YfiO outer membrane assembly lipoprotein YfiO. Members of this protein family include YfiO, a near-essential protein of the outer membrane, part of a complex involved in protein insertion into the bacterial outer membrane. Many proteins in this family are annotated as ComL, based on the involvement of this protein in natural transformation with exogenous DNA in Neisseria gonorrhoeae. This protein family shows sequence similarity to, but is distinct from, the tol-pal system protein YbgF (TIGR02795).
Probab=74.41 E-value=5.7 Score=35.70 Aligned_cols=50 Identities=16% Similarity=0.299 Sum_probs=27.4
Q ss_pred HHHHHHHHhccccHHHHHHHHHHHHHHhhcccCCCcHhHHHHHHHHHHHHHHhCC
Q psy5736 279 MFYRGIEQMNTSCFREAVESLTKFSDQIHELVVPPYKLASLAHEALRNCWSLAGN 333 (346)
Q Consensus 279 ~~~~a~~~~~~~~~~~ai~~l~~~l~~~~~~l~p~h~~~~~~~~~L~~~y~~~G~ 333 (346)
.+..+...+..|++++|+..++++++.+ |.++....+...++.+|...|+
T Consensus 36 ~~~~g~~~~~~~~~~~A~~~~~~~~~~~-----p~~~~~~~a~~~la~~~~~~~~ 85 (235)
T TIGR03302 36 LYEEAKEALDSGDYTEAIKYFEALESRY-----PFSPYAEQAQLDLAYAYYKSGD 85 (235)
T ss_pred HHHHHHHHHHcCCHHHHHHHHHHHHHhC-----CCchhHHHHHHHHHHHHHhcCC
Confidence 3444555555666666666666655543 4445445555555555555544
No 86
>KOG1517|consensus
Probab=74.24 E-value=6.3 Score=43.64 Aligned_cols=69 Identities=25% Similarity=0.356 Sum_probs=45.4
Q ss_pred CcccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeCCccccchhhhhhh
Q psy5736 2 TGLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRNYASHEHRLVTSFQ 77 (346)
Q Consensus 2 ~~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~~~sh~~~Lv~a~r 77 (346)
||--..|.|||-++-..+-...-+. .+-+|+|-- +.+..++.++|+.||.||+|..=+.....++..+|
T Consensus 1183 tGd~r~IRIWDa~~E~~~~diP~~s------~t~vTaLS~-~~~~gn~i~AGfaDGsvRvyD~R~a~~ds~v~~~R 1251 (1387)
T KOG1517|consen 1183 TGDVRSIRIWDAHKEQVVADIPYGS------STLVTALSA-DLVHGNIIAAGFADGSVRVYDRRMAPPDSLVCVYR 1251 (1387)
T ss_pred cCCeeEEEEEecccceeEeecccCC------Cccceeecc-cccCCceEEEeecCCceEEeecccCCccccceeec
Confidence 3445689999999887777654331 344677763 33447888999999999999433223334555554
No 87
>PF10516 SHNi-TPR: SHNi-TPR; InterPro: IPR019544 The tetratrico peptide repeat region (TPR) is a structural motif present in a wide range of proteins [, , ]. It mediates protein-protein interactions and the assembly of multiprotein complexes []. The TPR motif consists of 3-16 tandem-repeats of 34 amino acids residues, although individual TPR motifs can be dispersed in the protein sequence. Sequence alignment of the TPR domains reveals a consensus sequence defined by a pattern of small and large amino acids. TPR motifs have been identified in various different organisms, ranging from bacteria to humans. Proteins containing TPRs are involved in a variety of biological processes, such as cell cycle regulation, transcriptional control, mitochondrial and peroxisomal protein transport, neurogenesis and protein folding. The X-ray structure of a domain containing three TPRs from protein phosphatase 5 revealed that TPR adopts a helix-turn-helix arrangement, with adjacent TPR motifs packing in a parallel fashion, resulting in a spiral of repeating anti-parallel alpha-helices []. The two helices are denoted helix A and helix B. The packing angle between helix A and helix B is ~24 degrees within a single TPR and generates a right-handed superhelical shape. Helix A interacts with helix B and with helix A' of the next TPR. Two protein surfaces are generated: the inner concave surface is contributed to mainly by residue on helices A, and the other surface presents residues from both helices A and B. This entry represents SHNi-TPR (Sim3-Hif1-NASP interrupted TPR), a sequence that is an interrupted form of TPR repeat [].
Probab=73.49 E-value=4.6 Score=26.44 Aligned_cols=27 Identities=26% Similarity=0.429 Sum_probs=24.1
Q ss_pred HhccccHHHHHHHHHHHHHHhhcccCC
Q psy5736 286 QMNTSCFREAVESLTKFSDQIHELVVP 312 (346)
Q Consensus 286 ~~~~~~~~~ai~~l~~~l~~~~~~l~p 312 (346)
.++++++++|++-|+++|+...++++|
T Consensus 11 sle~e~f~qA~~D~~~aL~i~~~l~~~ 37 (38)
T PF10516_consen 11 SLENENFEQAIEDYEKALEIQEELLPP 37 (38)
T ss_pred HHHhccHHHHHHHHHHHHHHHHHhcCC
Confidence 467889999999999999999998876
No 88
>PF09986 DUF2225: Uncharacterized protein conserved in bacteria (DUF2225); InterPro: IPR018708 This conserved bacterial family has no known function.
Probab=73.45 E-value=10 Score=34.51 Aligned_cols=80 Identities=11% Similarity=0.041 Sum_probs=47.4
Q ss_pred cccccccCCcCChHHHHH----HHHHHHHHHHHHH---HHhccccHHHHHHHHHHHHHHhhcccCCCcHhHHHHHHHHHH
Q psy5736 254 EKKCESCNSTSDLTEIKT----KLSELNEMFYRGI---EQMNTSCFREAVESLTKFSDQIHELVVPPYKLASLAHEALRN 326 (346)
Q Consensus 254 ~~~C~~C~~~~~~~~~~~----~l~~~~~~~~~a~---~~~~~~~~~~ai~~l~~~l~~~~~~l~p~h~~~~~~~~~L~~ 326 (346)
.|+|+.||-....+.... ..+.+.+....-. +.-..-++++|++.|.-.|-.+ .+...++.....+...|+.
T Consensus 48 V~vCP~CgyA~~~~~F~~l~~~~~~~i~~~i~~~~~~~~~~~~Rt~~~ai~~YkLAll~~-~~~~~~~s~~A~l~LrlAW 126 (214)
T PF09986_consen 48 VWVCPHCGYAAFEEDFEKLSPEQKEKIKENISSRWKPRDFSGERTLEEAIESYKLALLCA-QIKKEKPSKKAGLCLRLAW 126 (214)
T ss_pred EEECCCCCCcccccccccCCHHHHHHHHHHHHhhcccCCCCCCCCHHHHHHHHHHHHHHH-HHhCCCHHHHHHHHHHHHH
Confidence 599999998654332111 1111111111111 1111234677887776666553 4456777789999999999
Q ss_pred HHHHhCCc
Q psy5736 327 CWSLAGNK 334 (346)
Q Consensus 327 ~y~~~G~~ 334 (346)
+|+..|+.
T Consensus 127 lyR~~~~~ 134 (214)
T PF09986_consen 127 LYRDLGDE 134 (214)
T ss_pred HhhccCCH
Confidence 99999984
No 89
>cd02681 MIT_calpain7_1 MIT: domain contained within Microtubule Interacting and Trafficking molecules. This sub-family of MIT domains is found in the nuclear thiol protease PalBH. The molecular function of the MIT domain is unclear.
Probab=73.33 E-value=24 Score=26.76 Aligned_cols=57 Identities=9% Similarity=-0.042 Sum_probs=39.2
Q ss_pred HHHHHHHHHHHHhccccHHHHHHHHHHHHHHhhcc--cC-CCcHhHHHHHHHHHHHHHHh
Q psy5736 275 ELNEMFYRGIEQMNTSCFREAVESLTKFSDQIHEL--VV-PPYKLASLAHEALRNCWSLA 331 (346)
Q Consensus 275 ~~~~~~~~a~~~~~~~~~~~ai~~l~~~l~~~~~~--l~-p~h~~~~~~~~~L~~~y~~~ 331 (346)
.+.++..+|++.=+.|++++|+.+|.+.++.+... +. +|-......+..+...+.+.
T Consensus 5 ~Ai~~a~~Ave~D~~g~y~eA~~~Y~~aie~l~~~~~~~~~n~~~k~~ir~K~~eYl~RA 64 (76)
T cd02681 5 DAVQFARLAVQRDQEGRYSEAVFYYKEAAQLLIYAEMAGTLNDSHLKTIQEKSNEYLDRA 64 (76)
T ss_pred HHHHHHHHHHHHHHccCHHHHHHHHHHHHHHHHHHHHhcCCChHHHHHHHHHHHHHHHHH
Confidence 45566778888878899999999999999998874 22 22222333466666665554
No 90
>KOG0275|consensus
Probab=72.41 E-value=3.3 Score=39.98 Aligned_cols=51 Identities=24% Similarity=0.300 Sum_probs=37.8
Q ss_pred ccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEE
Q psy5736 4 LFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVY 62 (346)
Q Consensus 4 ~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~ 62 (346)
.+|.|.||..++|.-+-+|. +. |...+|+|.|-. |.+-+|.++-|-.+|+-
T Consensus 283 qDGkIKvWri~tG~ClRrFd--rA----HtkGvt~l~FSr--D~SqiLS~sfD~tvRiH 333 (508)
T KOG0275|consen 283 QDGKIKVWRIETGQCLRRFD--RA----HTKGVTCLSFSR--DNSQILSASFDQTVRIH 333 (508)
T ss_pred cCCcEEEEEEecchHHHHhh--hh----hccCeeEEEEcc--CcchhhcccccceEEEe
Confidence 58999999999999888884 22 566788888643 34444667778878774
No 91
>KOG2055|consensus
Probab=72.28 E-value=7.4 Score=39.23 Aligned_cols=65 Identities=23% Similarity=0.390 Sum_probs=46.8
Q ss_pred cccceEEEEecCCCceeeeeCCCCCCCCCCCCc-ccceeccccCCCceEEeeecCCceEEEeC---Cccccchhhhhh
Q psy5736 3 GLFLKEGIWDHITGSRLISWRPDRGSWARNSAR-CTALTLLNEHDDSLLLTGYDDGSLSVYRN---YASHEHRLVTSF 76 (346)
Q Consensus 3 ~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~-~t~l~~ine~~~~lll~g~~dG~vri~r~---~~sh~~~Lv~a~ 76 (346)
|-.|.|.|||..+..-+.+|..+.. .|++. |++ .+.+++.+|++.|.|.||.. +.+..|+-+.++
T Consensus 363 ~~~GeV~v~nl~~~~~~~rf~D~G~---v~gts~~~S------~ng~ylA~GS~~GiVNIYd~~s~~~s~~PkPik~~ 431 (514)
T KOG2055|consen 363 GGTGEVYVWNLRQNSCLHRFVDDGS---VHGTSLCIS------LNGSYLATGSDSGIVNIYDGNSCFASTNPKPIKTV 431 (514)
T ss_pred cCCceEEEEecCCcceEEEEeecCc---cceeeeeec------CCCceEEeccCcceEEEeccchhhccCCCCchhhh
Confidence 5578999999999988889987764 34432 233 34569999999999999952 334556666554
No 92
>KOG0308|consensus
Probab=72.01 E-value=6 Score=41.43 Aligned_cols=59 Identities=20% Similarity=0.298 Sum_probs=39.0
Q ss_pred CcccceEEEEecCCC--ceeeeeCCCCCCCCC--CCCcccceeccccCCCceEEeeecCCceEEE
Q psy5736 2 TGLFLKEGIWDHITG--SRLISWRPDRGSWAR--NSARCTALTLLNEHDDSLLLTGYDDGSLSVY 62 (346)
Q Consensus 2 ~~~~~~~~~w~~~~~--~~~~~~~~~~~~w~~--~~~~~t~l~~ine~~~~lll~g~~dG~vri~ 62 (346)
.||++.|-|||.+++ +.+-+|.|-..+... ++..|=+|. .| ++..+.+.|+..+.+|+|
T Consensus 136 gGLD~~IflWDin~~~~~l~~s~n~~t~~sl~sG~k~siYSLA-~N-~t~t~ivsGgtek~lr~w 198 (735)
T KOG0308|consen 136 GGLDRKIFLWDINTGTATLVASFNNVTVNSLGSGPKDSIYSLA-MN-QTGTIIVSGGTEKDLRLW 198 (735)
T ss_pred cCCCccEEEEEccCcchhhhhhccccccccCCCCCccceeeee-cC-CcceEEEecCcccceEEe
Confidence 589999999999998 445555543322111 233344443 23 445788999999999999
No 93
>KOG0271|consensus
Probab=71.70 E-value=7.2 Score=38.55 Aligned_cols=54 Identities=20% Similarity=0.323 Sum_probs=40.7
Q ss_pred cccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeCCc
Q psy5736 3 GLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRNYA 66 (346)
Q Consensus 3 ~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~~~ 66 (346)
+-+|+|.|||...++-+-. ..+ |...+|++++= .+.++-+|+.|+.|++|+...
T Consensus 224 skDg~vrIWd~~~~~~~~~-lsg------HT~~VTCvrwG---G~gliySgS~DrtIkvw~a~d 277 (480)
T KOG0271|consen 224 SKDGSVRIWDTKLGTCVRT-LSG------HTASVTCVRWG---GEGLIYSGSQDRTIKVWRALD 277 (480)
T ss_pred cCCCCEEEEEccCceEEEE-ecc------CccceEEEEEc---CCceEEecCCCceEEEEEccc
Confidence 4589999999987754432 222 66789998853 367888999999999997664
No 94
>KOG1840|consensus
Probab=70.62 E-value=9.3 Score=39.48 Aligned_cols=54 Identities=11% Similarity=-0.025 Sum_probs=48.0
Q ss_pred HHHHhccccHHHHHHHHHHHHHHhhcccCCCcHhHHHHHHHHHHHHHHhCCccc
Q psy5736 283 GIEQMNTSCFREAVESLTKFSDQIHELVVPPYKLASLAHEALRNCWSLAGNKWV 336 (346)
Q Consensus 283 a~~~~~~~~~~~ai~~l~~~l~~~~~~l~p~h~~~~~~~~~L~~~y~~~G~~~~ 336 (346)
|.-.+..+++.+|+..|++.+..+..++.+.|+.+..+...|+.+|...|+.-.
T Consensus 248 a~~y~~~~k~~eAv~ly~~AL~i~e~~~G~~h~~va~~l~nLa~ly~~~GKf~E 301 (508)
T KOG1840|consen 248 ALVYRSLGKYDEAVNLYEEALTIREEVFGEDHPAVAATLNNLAVLYYKQGKFAE 301 (508)
T ss_pred HHHHHHhccHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHhccCChHH
Confidence 334567788999999999999999999999999999999999999999887643
No 95
>COG4105 ComL DNA uptake lipoprotein [General function prediction only]
Probab=70.26 E-value=13 Score=34.94 Aligned_cols=52 Identities=15% Similarity=0.332 Sum_probs=38.6
Q ss_pred HHHHHHHHHHHhccccHHHHHHHHHHHHHHhhcccCCCcHhHHHHHHHHHHHHHHhC
Q psy5736 276 LNEMFYRGIEQMNTSCFREAVESLTKFSDQIHELVVPPYKLASLAHEALRNCWSLAG 332 (346)
Q Consensus 276 ~~~~~~~a~~~~~~~~~~~ai~~l~~~l~~~~~~l~p~h~~~~~~~~~L~~~y~~~G 332 (346)
+..+|+++++.++.|++++|++.|+++...+ |..+...+++..|+..+-..|
T Consensus 34 ~~~LY~~g~~~L~~gn~~~A~~~fe~l~~~~-----p~s~~~~qa~l~l~yA~Yk~~ 85 (254)
T COG4105 34 ASELYNEGLTELQKGNYEEAIKYFEALDSRH-----PFSPYSEQAQLDLAYAYYKNG 85 (254)
T ss_pred HHHHHHHHHHHHhcCCHHHHHHHHHHHHHcC-----CCCcccHHHHHHHHHHHHhcc
Confidence 4568888999999999999999998887554 666666666666666655544
No 96
>KOG0268|consensus
Probab=70.25 E-value=5.5 Score=39.07 Aligned_cols=53 Identities=19% Similarity=0.252 Sum_probs=42.8
Q ss_pred ceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeCCc
Q psy5736 6 LKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRNYA 66 (346)
Q Consensus 6 ~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~~~ 66 (346)
.+|.|||.+....+.+|+-+-+ .|.+++ +|+-.-+++.++++|+.|.+|..=+
T Consensus 167 e~i~IWD~~R~~Pv~smswG~D-------ti~svk-fNpvETsILas~~sDrsIvLyD~R~ 219 (433)
T KOG0268|consen 167 EQIDIWDEQRDNPVSSMSWGAD-------SISSVK-FNPVETSILASCASDRSIVLYDLRQ 219 (433)
T ss_pred ceeeecccccCCccceeecCCC-------ceeEEe-cCCCcchheeeeccCCceEEEeccc
Confidence 4699999999999988765543 477777 5888888889999999999995543
No 97
>KOG0315|consensus
Probab=70.19 E-value=9.7 Score=35.72 Aligned_cols=62 Identities=18% Similarity=0.294 Sum_probs=38.5
Q ss_pred cccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeCCcc-ccchhhh
Q psy5736 3 GLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRNYAS-HEHRLVT 74 (346)
Q Consensus 3 ~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~~~s-h~~~Lv~ 74 (346)
|-+|++.|||..+ ..+ |+- ..+.+.++++- +++ ...-|++|-++|.|++|..-++ +..++++
T Consensus 102 seDgt~kIWdlR~--~~~----qR~--~~~~spVn~vv-lhp-nQteLis~dqsg~irvWDl~~~~c~~~liP 164 (311)
T KOG0315|consen 102 SEDGTVKIWDLRS--LSC----QRN--YQHNSPVNTVV-LHP-NQTELISGDQSGNIRVWDLGENSCTHELIP 164 (311)
T ss_pred CCCceEEEEeccC--ccc----chh--ccCCCCcceEE-ecC-CcceEEeecCCCcEEEEEccCCccccccCC
Confidence 5689999999986 222 221 02345677655 343 3445578999999999965543 3344444
No 98
>KOG0305|consensus
Probab=70.13 E-value=6.7 Score=40.16 Aligned_cols=54 Identities=17% Similarity=0.278 Sum_probs=46.6
Q ss_pred ccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeCC
Q psy5736 4 LFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRNY 65 (346)
Q Consensus 4 ~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~~ 65 (346)
-++.|.+||..+|+++.....+ .-|++|.+.-..-+-+...|++++.|++|+--
T Consensus 366 ~D~~i~fwn~~~g~~i~~vdtg--------sQVcsL~Wsk~~kEi~sthG~s~n~i~lw~~p 419 (484)
T KOG0305|consen 366 ADRCIKFWNTNTGARIDSVDTG--------SQVCSLIWSKKYKELLSTHGYSENQITLWKYP 419 (484)
T ss_pred cccEEEEEEcCCCcEecccccC--------CceeeEEEcCCCCEEEEecCCCCCcEEEEecc
Confidence 4789999999999999876544 46999999998889999999999999999754
No 99
>TIGR02795 tol_pal_ybgF tol-pal system protein YbgF. Members of this protein family are the product of one of seven genes regularly clustered in operons to encode the proteins of the tol-pal system, which is critical for maintaining the integrity of the bacterial outer membrane. The gene for this periplasmic protein has been designated orf2 and ybgF. All members of the seed alignment were from unique tol-pal gene regions from completed bacterial genomes. The architecture of this protein is a signal sequence, a low-complexity region usually rich in Asn and Gln, a well-conserved region with tandem repeats that resemble the tetratricopeptide (TPR) repeat, involved in protein-protein interaction.
Probab=69.57 E-value=16 Score=28.37 Aligned_cols=27 Identities=7% Similarity=0.321 Sum_probs=14.8
Q ss_pred HHHHHHHHhccccHHHHHHHHHHHHHH
Q psy5736 279 MFYRGIEQMNTSCFREAVESLTKFSDQ 305 (346)
Q Consensus 279 ~~~~a~~~~~~~~~~~ai~~l~~~l~~ 305 (346)
.+..+...+..+++++|++.+.++++.
T Consensus 5 ~~~~~~~~~~~~~~~~A~~~~~~~~~~ 31 (119)
T TIGR02795 5 YYDAALLVLKAGDYADAIQAFQAFLKK 31 (119)
T ss_pred HHHHHHHHHHcCCHHHHHHHHHHHHHH
Confidence 344444455556666666666666554
No 100
>KOG0646|consensus
Probab=69.37 E-value=7.4 Score=39.16 Aligned_cols=61 Identities=10% Similarity=0.027 Sum_probs=45.8
Q ss_pred cccceEEEEecCCCceeeeeCCCCCC----CCCCCCcccceeccccCCCceEEeeecCCceEEEeCC
Q psy5736 3 GLFLKEGIWDHITGSRLISWRPDRGS----WARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRNY 65 (346)
Q Consensus 3 ~~~~~~~~w~~~~~~~~~~~~~~~~~----w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~~ 65 (346)
|-||.|.||+.-.-. .-++.+.+. |..|+.-||+|+.=+-...+.+.+.+.|..+++|..-
T Consensus 142 skDg~V~vW~l~~lv--~a~~~~~~~p~~~f~~HtlsITDl~ig~Gg~~~rl~TaS~D~t~k~wdlS 206 (476)
T KOG0646|consen 142 SKDGAVLVWLLTDLV--SADNDHSVKPLHIFSDHTLSITDLQIGSGGTNARLYTASEDRTIKLWDLS 206 (476)
T ss_pred CCCccEEEEEEEeec--ccccCCCccceeeeccCcceeEEEEecCCCccceEEEecCCceEEEEEec
Confidence 678999999875431 111111222 8899999999997666789999999999999999543
No 101
>KOG0296|consensus
Probab=69.29 E-value=11 Score=37.09 Aligned_cols=53 Identities=21% Similarity=0.345 Sum_probs=40.4
Q ss_pred cceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeCCc
Q psy5736 5 FLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRNYA 66 (346)
Q Consensus 5 ~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~~~ 66 (346)
+|.|=+|.-.+++....|+- |..+||+=+|++. ...+++|++||.|++|....
T Consensus 169 DGsvWmw~ip~~~~~kv~~G-------h~~~ct~G~f~pd--GKr~~tgy~dgti~~Wn~kt 221 (399)
T KOG0296|consen 169 DGSVWMWQIPSQALCKVMSG-------HNSPCTCGEFIPD--GKRILTGYDDGTIIVWNPKT 221 (399)
T ss_pred CCcEEEEECCCcceeeEecC-------CCCCcccccccCC--CceEEEEecCceEEEEecCC
Confidence 57777777777655555542 4578999888874 88889999999999997664
No 102
>cd02678 MIT_VPS4 MIT: domain contained within Microtubule Interacting and Trafficking molecules. This sub-family of MIT domains is found in intracellular protein transport proteins of the AAA-ATPase family. The molecular function of the MIT domain is unclear.
Probab=69.00 E-value=35 Score=25.45 Aligned_cols=58 Identities=12% Similarity=0.096 Sum_probs=40.7
Q ss_pred HHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHhhccc-CCCcHhH-HHHHHHHHHHHHH
Q psy5736 273 LSELNEMFYRGIEQMNTSCFREAVESLTKFSDQIHELV-VPPYKLA-SLAHEALRNCWSL 330 (346)
Q Consensus 273 l~~~~~~~~~a~~~~~~~~~~~ai~~l~~~l~~~~~~l-~p~h~~~-~~~~~~L~~~y~~ 330 (346)
++++..++.+|++.=..|++++|+..|.+.++.+-..+ ..+.+.. ...+..+...+.+
T Consensus 3 ~~~A~~l~~~Av~~D~~g~y~eA~~~Y~~aie~l~~~~k~e~~~~~k~~~~~k~~eyl~R 62 (75)
T cd02678 3 LQKAIELVKKAIEEDNAGNYEEALRLYQHALEYFMHALKYEKNPKSKESIRAKCTEYLDR 62 (75)
T ss_pred HHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHHHHhhCCCHHHHHHHHHHHHHHHHH
Confidence 45667777888887788999999999999999998865 3344433 3344444444444
No 103
>cd02684 MIT_2 MIT: domain contained within Microtubule Interacting and Trafficking molecules. This sub-family of MIT domains is found in proteins with an n-terminal serine/threonine kinase domain. The molecular function of the MIT domain is unclear.
Probab=68.74 E-value=34 Score=25.75 Aligned_cols=58 Identities=5% Similarity=-0.063 Sum_probs=41.3
Q ss_pred HHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHhhccc-CCCcHhH-HHHHHHHHHHHHH
Q psy5736 273 LSELNEMFYRGIEQMNTSCFREAVESLTKFSDQIHELV-VPPYKLA-SLAHEALRNCWSL 330 (346)
Q Consensus 273 l~~~~~~~~~a~~~~~~~~~~~ai~~l~~~l~~~~~~l-~p~h~~~-~~~~~~L~~~y~~ 330 (346)
+.++..++.+|++.-..|++++|+.+|...++.|-..+ ..+++.. ...+..+...+.+
T Consensus 3 l~~Ai~lv~~Av~~D~~g~y~eA~~lY~~ale~~~~~~k~e~~~~~k~~lr~k~~eyl~R 62 (75)
T cd02684 3 LEKAIALVVQAVKKDQRGDAAAALSLYCSALQYFVPALHYETDAQRKEALRQKVLQYVSR 62 (75)
T ss_pred HHHHHHHHHHHHHHHHhccHHHHHHHHHHHHHHHHHHHhhCCCHHHHHHHHHHHHHHHHH
Confidence 45667778888888888999999999999999998866 4455544 3344444444443
No 104
>PLN03098 LPA1 LOW PSII ACCUMULATION1; Provisional
Probab=67.86 E-value=8.9 Score=38.87 Aligned_cols=53 Identities=11% Similarity=0.125 Sum_probs=42.4
Q ss_pred HHHHHHHHHHhccccHHHHHHHHHHHHHHhhcccCCCcHhHHHHHHHHHHHHHHhCCc
Q psy5736 277 NEMFYRGIEQMNTSCFREAVESLTKFSDQIHELVVPPYKLASLAHEALRNCWSLAGNK 334 (346)
Q Consensus 277 ~~~~~~a~~~~~~~~~~~ai~~l~~~l~~~~~~l~p~h~~~~~~~~~L~~~y~~~G~~ 334 (346)
...+..+..++..|++++|+..|++.|+.- |.+.....+...++-||+.+|+.
T Consensus 76 ~a~~NLG~AL~~lGryeEAIa~f~rALeL~-----Pd~aeA~~A~yNLAcaya~LGr~ 128 (453)
T PLN03098 76 EDAVNLGLSLFSKGRVKDALAQFETALELN-----PNPDEAQAAYYNKACCHAYREEG 128 (453)
T ss_pred HHHHHHHHHHHHcCCHHHHHHHHHHHHhhC-----CCchHHHHHHHHHHHHHHHcCCH
Confidence 344556677788899999999999999874 77776667788999999998874
No 105
>PF00515 TPR_1: Tetratricopeptide repeat; InterPro: IPR001440 The tetratrico peptide repeat (TPR) is a structural motif present in a wide range of proteins [, , ]. It mediates protein-protein interactions and the assembly of multiprotein complexes []. The TPR motif consists of 3-16 tandem-repeats of 34 amino acids residues, although individual TPR motifs can be dispersed in the protein sequence. Sequence alignment of the TPR domains reveals a consensus sequence defined by a pattern of small and large amino acids. TPR motifs have been identified in various different organisms, ranging from bacteria to humans. Proteins containing TPRs are involved in a variety of biological processes, such as cell cycle regulation, transcriptional control, mitochondrial and peroxisomal protein transport, neurogenesis and protein folding. The X-ray structure of a domain containing three TPRs from protein phosphatase 5 revealed that TPR adopts a helix-turn-helix arrangement, with adjacent TPR motifs packing in a parallel fashion, resulting in a spiral of repeating anti-parallel alpha-helices []. The two helices are denoted helix A and helix B. The packing angle between helix A and helix B is ~24 degrees; within a single TPR and generates a right-handed superhelical shape. Helix A interacts with helix B and with helix A' of the next TPR. Two protein surfaces are generated: the inner concave surface is contributed to mainly by residue on helices A, and the other surface presents residues from both helices A and B. ; GO: 0005515 protein binding; PDB: 3SF4_C 2LNI_A 1ELW_A 2C0M_A 1FCH_B 3R9A_B 2J9Q_A 2C0L_A 1KT1_A 3FWV_A ....
Probab=66.62 E-value=16 Score=22.14 Aligned_cols=29 Identities=14% Similarity=0.142 Sum_probs=22.4
Q ss_pred HHHHHHHHHhccccHHHHHHHHHHHHHHh
Q psy5736 278 EMFYRGIEQMNTSCFREAVESLTKFSDQI 306 (346)
Q Consensus 278 ~~~~~a~~~~~~~~~~~ai~~l~~~l~~~ 306 (346)
..+..|...+..+++++|+..+++.|+.-
T Consensus 3 ~~~~~g~~~~~~~~~~~A~~~~~~al~~~ 31 (34)
T PF00515_consen 3 AYYNLGNAYFQLGDYEEALEYYQRALELD 31 (34)
T ss_dssp HHHHHHHHHHHTT-HHHHHHHHHHHHHHS
T ss_pred HHHHHHHHHHHhCCchHHHHHHHHHHHHC
Confidence 34566777788899999999999999864
No 106
>KOG0281|consensus
Probab=66.57 E-value=7.2 Score=38.15 Aligned_cols=49 Identities=14% Similarity=0.213 Sum_probs=38.7
Q ss_pred ccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEe
Q psy5736 4 LFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYR 63 (346)
Q Consensus 4 ~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r 63 (346)
-+.+|.|||..++.-+-. -|+ |++.|.+++ ..+.+.+.|++|..||+|.
T Consensus 338 gDRTikvW~~st~efvRt-l~g------HkRGIAClQ----Yr~rlvVSGSSDntIRlwd 386 (499)
T KOG0281|consen 338 GDRTIKVWSTSTCEFVRT-LNG------HKRGIACLQ----YRDRLVVSGSSDNTIRLWD 386 (499)
T ss_pred CCceEEEEeccceeeehh-hhc------ccccceehh----ccCeEEEecCCCceEEEEe
Confidence 367899999998854422 222 778899987 6789999999999999994
No 107
>KOG0285|consensus
Probab=66.13 E-value=13 Score=36.66 Aligned_cols=54 Identities=15% Similarity=0.242 Sum_probs=40.4
Q ss_pred cccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeCC
Q psy5736 3 GLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRNY 65 (346)
Q Consensus 3 ~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~~ 65 (346)
|-+-++.|||..+...+..+. -|...++++. .++.+..+ .+||.|+.||+|..-
T Consensus 254 grDst~RvWDiRtr~~V~~l~-------GH~~~V~~V~-~~~~dpqv-it~S~D~tvrlWDl~ 307 (460)
T KOG0285|consen 254 GRDSTIRVWDIRTRASVHVLS-------GHTNPVASVM-CQPTDPQV-ITGSHDSTVRLWDLR 307 (460)
T ss_pred CCcceEEEeeecccceEEEec-------CCCCcceeEE-eecCCCce-EEecCCceEEEeeec
Confidence 567899999999888777663 2566777765 35555555 779999999999543
No 108
>cd02680 MIT_calpain7_2 MIT: domain contained within Microtubule Interacting and Trafficking molecules. This sub-family of MIT domains is found in the nuclear thiol protease PalBH. The molecular function of the MIT domain is unclear.
Probab=65.72 E-value=13 Score=28.17 Aligned_cols=36 Identities=14% Similarity=0.127 Sum_probs=29.2
Q ss_pred HHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHhhc
Q psy5736 273 LSELNEMFYRGIEQMNTSCFREAVESLTKFSDQIHE 308 (346)
Q Consensus 273 l~~~~~~~~~a~~~~~~~~~~~ai~~l~~~l~~~~~ 308 (346)
+.++..++++|++.=+.|++++|+++|...++.+-.
T Consensus 3 l~kai~Lv~~A~~eD~~gny~eA~~lY~~ale~~~~ 38 (75)
T cd02680 3 LERAHFLVTQAFDEDEKGNAEEAIELYTEAVELCIN 38 (75)
T ss_pred HHHHHHHHHHHHHhhHhhhHHHHHHHHHHHHHHHHH
Confidence 345666777777777789999999999999998866
No 109
>KOG1007|consensus
Probab=65.70 E-value=6 Score=37.78 Aligned_cols=53 Identities=23% Similarity=0.278 Sum_probs=43.7
Q ss_pred ccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEe
Q psy5736 4 LFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYR 63 (346)
Q Consensus 4 ~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r 63 (346)
-++++.-||..+..+.++..+- |+-++-+|. +|+.-.-++.++++||-||+|.
T Consensus 191 ~d~tl~~~D~RT~~~~~sI~dA------Hgq~vrdlD-fNpnkq~~lvt~gDdgyvriWD 243 (370)
T KOG1007|consen 191 SDSTLQFWDLRTMKKNNSIEDA------HGQRVRDLD-FNPNKQHILVTCGDDGYVRIWD 243 (370)
T ss_pred CCCcEEEEEccchhhhcchhhh------hcceeeecc-CCCCceEEEEEcCCCccEEEEe
Confidence 4688999999988777766432 777888888 6888888999999999999993
No 110
>KOG0291|consensus
Probab=65.24 E-value=13 Score=39.86 Aligned_cols=51 Identities=18% Similarity=0.354 Sum_probs=42.0
Q ss_pred cccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEE
Q psy5736 3 GLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVY 62 (346)
Q Consensus 3 ~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~ 62 (346)
+-+|.|.|||-.+|--...|.. |.+-+|.++|.- ....+++.+-||.||.|
T Consensus 369 ~eDgKVKvWn~~SgfC~vTFte-------Hts~Vt~v~f~~--~g~~llssSLDGtVRAw 419 (893)
T KOG0291|consen 369 AEDGKVKVWNTQSGFCFVTFTE-------HTSGVTAVQFTA--RGNVLLSSSLDGTVRAW 419 (893)
T ss_pred cCCCcEEEEeccCceEEEEecc-------CCCceEEEEEEe--cCCEEEEeecCCeEEee
Confidence 4589999999999988888863 556799999764 45666889999999998
No 111
>PF10579 Rapsyn_N: Rapsyn N-terminal myristoylation and linker region; InterPro: IPR019568 Neuromuscular junction formation relies upon the clustering of acetylcholine receptors and other proteins in the muscle membrane. Rapsyn is a peripheral membrane protein that is selectively concentrated at the neuromuscular junction and is essential for the formation of synaptic acetylcholine receptor aggregates. Acetylcholine receptors fail to aggregate beneath nerve terminals in mice where rapsyn has been knocked out. The N-terminal six amino acids of rapsyn are its myristoylation site, and myristoylation is necessary for the targeting of the protein to the membrane []. ; GO: 0008270 zinc ion binding, 0033130 acetylcholine receptor binding, 0007268 synaptic transmission, 0005856 cytoskeleton, 0030054 cell junction, 0045211 postsynaptic membrane
Probab=65.17 E-value=35 Score=26.25 Aligned_cols=54 Identities=11% Similarity=0.059 Sum_probs=41.6
Q ss_pred HHHHHHHHHHHhccccHHHHHHHHHHHHHHhhcccCCCcHhHHHHHHHHHHHHHHhCCc
Q psy5736 276 LNEMFYRGIEQMNTSCFREAVESLTKFSDQIHELVVPPYKLASLAHEALRNCWSLAGNK 334 (346)
Q Consensus 276 ~~~~~~~a~~~~~~~~~~~ai~~l~~~l~~~~~~l~p~h~~~~~~~~~L~~~y~~~G~~ 334 (346)
+...+++++++.++.+.++|+....+.|++..+ .+.-..+--.|+++|.+.|..
T Consensus 6 ak~~ie~GlkLY~~~~~~~Al~~W~~aL~k~~~-----~~~rf~~lG~l~qA~~e~Gky 59 (80)
T PF10579_consen 6 AKQQIEKGLKLYHQNETQQALQKWRKALEKITD-----REDRFRVLGYLIQAHMEWGKY 59 (80)
T ss_pred HHHHHHHHHHHhccchHHHHHHHHHHHHhhcCC-----hHHHHHHHHHHHHHHHHHHHH
Confidence 445567788888888888999999999988744 346777777888888888753
No 112
>KOG0306|consensus
Probab=64.62 E-value=9.7 Score=40.67 Aligned_cols=51 Identities=20% Similarity=0.184 Sum_probs=41.2
Q ss_pred ccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEe
Q psy5736 4 LFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYR 63 (346)
Q Consensus 4 ~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r 63 (346)
-+|.|.|||-+++..+..|+ -||..||.|++= ..+..+++|+-||.|-+|.
T Consensus 85 aDGsVqif~~~s~~~~~tfn-------gHK~AVt~l~fd--~~G~rlaSGskDt~IIvwD 135 (888)
T KOG0306|consen 85 ADGSVQIFSLESEEILITFN-------GHKAAVTTLKFD--KIGTRLASGSKDTDIIVWD 135 (888)
T ss_pred cCceEEeeccCCCceeeeec-------ccccceEEEEEc--ccCceEeecCCCccEEEEE
Confidence 37999999999998888772 288999999953 3456678899999998884
No 113
>PF07719 TPR_2: Tetratricopeptide repeat; InterPro: IPR013105 The tetratrico peptide repeat (TPR) is a structural motif present in a wide range of proteins [, , ]. It mediates protein-protein interactions and the assembly of multiprotein complexes []. The TPR motif consists of 3-16 tandem-repeats of 34 amino acids residues, although individual TPR motifs can be dispersed in the protein sequence. Sequence alignment of the TPR domains reveals a consensus sequence defined by a pattern of small and large amino acids. TPR motifs have been identified in various different organisms, ranging from bacteria to humans. Proteins containing TPRs are involved in a variety of biological processes, such as cell cycle regulation, transcriptional control, mitochondrial and peroxisomal protein transport, neurogenesis and protein folding. This repeat includes outlying Tetratricopeptide-like repeats (TPR) that are not matched by IPR001440 from INTERPRO.; PDB: 1XNF_B 3Q15_A 4ABN_A 1OUV_A 3U4T_A 3MA5_C 2KCV_A 2KCL_A 2XEV_A 3NF1_A ....
Probab=64.22 E-value=18 Score=21.63 Aligned_cols=28 Identities=21% Similarity=0.297 Sum_probs=21.3
Q ss_pred HHHHHHHHhccccHHHHHHHHHHHHHHh
Q psy5736 279 MFYRGIEQMNTSCFREAVESLTKFSDQI 306 (346)
Q Consensus 279 ~~~~a~~~~~~~~~~~ai~~l~~~l~~~ 306 (346)
.+..|.-.+..|++++|++.+++.++..
T Consensus 4 ~~~lg~~~~~~~~~~~A~~~~~~al~l~ 31 (34)
T PF07719_consen 4 WYYLGQAYYQLGNYEEAIEYFEKALELD 31 (34)
T ss_dssp HHHHHHHHHHTT-HHHHHHHHHHHHHHS
T ss_pred HHHHHHHHHHhCCHHHHHHHHHHHHHHC
Confidence 3445566678899999999999999865
No 114
>KOG0284|consensus
Probab=64.03 E-value=6.6 Score=39.04 Aligned_cols=73 Identities=26% Similarity=0.435 Sum_probs=51.3
Q ss_pred cccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEE--e------CCccccchhhh
Q psy5736 3 GLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVY--R------NYASHEHRLVT 74 (346)
Q Consensus 3 ~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~--r------~~~sh~~~Lv~ 74 (346)
|-+..|.+||-.+|.-+++.. .||.-|..++| |.+. ..||+++-|..+++| | -|..|+.+..+
T Consensus 241 skDnlVKlWDprSg~cl~tlh-------~HKntVl~~~f-~~n~-N~Llt~skD~~~kv~DiR~mkEl~~~r~Hkkdv~~ 311 (464)
T KOG0284|consen 241 SKDNLVKLWDPRSGSCLATLH-------GHKNTVLAVKF-NPNG-NWLLTGSKDQSCKVFDIRTMKELFTYRGHKKDVTS 311 (464)
T ss_pred cCCceeEeecCCCcchhhhhh-------hccceEEEEEE-cCCC-CeeEEccCCceEEEEehhHhHHHHHhhcchhhhee
Confidence 446689999999999888653 37778899985 5454 999999999999998 3 12235555444
Q ss_pred h-hhhhccccc
Q psy5736 75 S-FQALTDTSF 84 (346)
Q Consensus 75 a-~r~L~~~~y 84 (346)
. |+-+....|
T Consensus 312 ~~WhP~~~~lf 322 (464)
T KOG0284|consen 312 LTWHPLNESLF 322 (464)
T ss_pred eccccccccce
Confidence 3 555544433
No 115
>KOG4328|consensus
Probab=63.95 E-value=7.9 Score=38.98 Aligned_cols=51 Identities=14% Similarity=0.236 Sum_probs=35.1
Q ss_pred ccceEEEEecC---CCc-eeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEE
Q psy5736 4 LFLKEGIWDHI---TGS-RLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVY 62 (346)
Q Consensus 4 ~~~~~~~w~~~---~~~-~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~ 62 (346)
-+|+|.+||.. ... +.-.| . +|+..|+.|+|. +.+.+..+..|.||.+|.-
T Consensus 208 K~G~VG~Wn~~~~~~d~d~v~~f---~----~hs~~Vs~l~F~-P~n~s~i~ssSyDGtiR~~ 262 (498)
T KOG4328|consen 208 KGGQVGLWNFGTQEKDKDGVYLF---T----PHSGPVSGLKFS-PANTSQIYSSSYDGTIRLQ 262 (498)
T ss_pred CCCcEEEEecCCCCCccCceEEe---c----cCCccccceEec-CCChhheeeeccCceeeee
Confidence 47999999994 222 22233 2 377889999965 4454555668999999874
No 116
>PRK00398 rpoP DNA-directed RNA polymerase subunit P; Provisional
Probab=63.90 E-value=5.2 Score=27.05 Aligned_cols=30 Identities=17% Similarity=0.435 Sum_probs=18.9
Q ss_pred ccccCCcCcCCcccccccCCcccccCcccccccCCcCC
Q psy5736 228 RIRCSNDNCGQIIATVKKLEPSAKKVEKKCESCNSTSD 265 (346)
Q Consensus 228 ~~~C~~~~C~g~~~~~~~~~~~~~~~~~~C~~C~~~~~ 265 (346)
-++|+ +|+..+...... ..+.|+.||..+.
T Consensus 3 ~y~C~--~CG~~~~~~~~~------~~~~Cp~CG~~~~ 32 (46)
T PRK00398 3 EYKCA--RCGREVELDEYG------TGVRCPYCGYRIL 32 (46)
T ss_pred EEECC--CCCCEEEECCCC------CceECCCCCCeEE
Confidence 46788 888765444221 1478888887643
No 117
>cd02682 MIT_AAA_Arch MIT: domain contained within Microtubule Interacting and Trafficking molecules. This sub-family of MIT domains is found in mostly archaebacterial AAA-ATPases. The molecular function of the MIT domain is unclear.
Probab=63.64 E-value=53 Score=24.89 Aligned_cols=57 Identities=7% Similarity=0.081 Sum_probs=40.6
Q ss_pred HHHHHHHHHHHHhccccHHHHHHHHHHHHHHhhccc--CCCcHhHHHHHHHHHHHHHHh
Q psy5736 275 ELNEMFYRGIEQMNTSCFREAVESLTKFSDQIHELV--VPPYKLASLAHEALRNCWSLA 331 (346)
Q Consensus 275 ~~~~~~~~a~~~~~~~~~~~ai~~l~~~l~~~~~~l--~p~h~~~~~~~~~L~~~y~~~ 331 (346)
.+.....+|++.=..|++++|+..|.+.++.+.+++ .|.++.....++.+..+..+.
T Consensus 5 ~A~~~a~~AVe~D~~gr~~eAi~~Y~~aIe~L~q~~~~~pD~~~k~~yr~ki~eY~~Ra 63 (75)
T cd02682 5 MARKYAINAVKAEKEGNAEDAITNYKKAIEVLSQIVKNYPDSPTRLIYEQMINEYKRRI 63 (75)
T ss_pred HHHHHHHHHHHHHhcCCHHHHHHHHHHHHHHHHHHHHhCCChHHHHHHHHHHHHHHHHH
Confidence 344556677877778999999999999999998865 345555555566665555443
No 118
>KOG0772|consensus
Probab=63.61 E-value=6.4 Score=40.29 Aligned_cols=56 Identities=21% Similarity=0.193 Sum_probs=37.8
Q ss_pred ccceEEEEecCCC-ceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeC
Q psy5736 4 LFLKEGIWDHITG-SRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRN 64 (346)
Q Consensus 4 ~~~~~~~w~~~~~-~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~ 64 (346)
++|.|.+||..+- .+.+..- . .++. .+..||++.|- .|..+||+-+-|+.+++|.+
T Consensus 337 ~DGSIQ~W~~~~~~v~p~~~v-k-~AH~-~g~~Itsi~FS--~dg~~LlSRg~D~tLKvWDL 393 (641)
T KOG0772|consen 337 LDGSIQIWDKGSRTVRPVMKV-K-DAHL-PGQDITSISFS--YDGNYLLSRGFDDTLKVWDL 393 (641)
T ss_pred cCCceeeeecCCcccccceEe-e-eccC-CCCceeEEEec--cccchhhhccCCCceeeeec
Confidence 6899999998432 2222221 1 2211 24589999963 67888898889999999943
No 119
>PF14783 BBS2_Mid: Ciliary BBSome complex subunit 2, middle region
Probab=63.50 E-value=8 Score=31.63 Aligned_cols=30 Identities=33% Similarity=0.473 Sum_probs=24.0
Q ss_pred ccceeccccCC--CceEEeeecCCceEEEeCC
Q psy5736 36 CTALTLLNEHD--DSLLLTGYDDGSLSVYRNY 65 (346)
Q Consensus 36 ~t~l~~ine~~--~~lll~g~~dG~vri~r~~ 65 (346)
+++|-+..-+. ..-|++|++|..||+|++=
T Consensus 2 V~al~~~d~d~dg~~eLlvGs~D~~IRvf~~~ 33 (111)
T PF14783_consen 2 VTALCLFDFDGDGENELLVGSDDFEIRVFKGD 33 (111)
T ss_pred eeEEEEEecCCCCcceEEEecCCcEEEEEeCC
Confidence 67777777644 4778999999999999765
No 120
>KOG2110|consensus
Probab=62.98 E-value=14 Score=36.25 Aligned_cols=49 Identities=20% Similarity=0.289 Sum_probs=41.3
Q ss_pred cceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCce-EEE
Q psy5736 5 FLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSL-SVY 62 (346)
Q Consensus 5 ~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~v-ri~ 62 (346)
-|.|.|||-.+-+..+.+. . |+..+.+|.| | +++.++.+||+-|.| |+|
T Consensus 152 ~GdV~l~d~~nl~~v~~I~---a----H~~~lAalaf-s-~~G~llATASeKGTVIRVf 201 (391)
T KOG2110|consen 152 SGDVVLFDTINLQPVNTIN---A----HKGPLAALAF-S-PDGTLLATASEKGTVIRVF 201 (391)
T ss_pred CceEEEEEcccceeeeEEE---e----cCCceeEEEE-C-CCCCEEEEeccCceEEEEE
Confidence 4889999999999999885 2 7889999985 3 788999999998876 776
No 121
>KOG1840|consensus
Probab=62.58 E-value=19 Score=37.18 Aligned_cols=57 Identities=14% Similarity=-0.003 Sum_probs=50.7
Q ss_pred hccccHHHHHHHHHHHHHHhhcccCCCcHhHHHHHHHHHHHHHHhCCccccCccccc
Q psy5736 287 MNTSCFREAVESLTKFSDQIHELVVPPYKLASLAHEALRNCWSLAGNKWVIPENYSQ 343 (346)
Q Consensus 287 ~~~~~~~~ai~~l~~~l~~~~~~l~p~h~~~~~~~~~L~~~y~~~G~~~~~~~~~~~ 343 (346)
...+++++|+.++.+.++++.+.+.+.|+.+...+--|+.+|-..|+...--++++.
T Consensus 336 ~~~~~~Eea~~l~q~al~i~~~~~g~~~~~~a~~~~nl~~l~~~~gk~~ea~~~~k~ 392 (508)
T KOG1840|consen 336 QSMNEYEEAKKLLQKALKIYLDAPGEDNVNLAKIYANLAELYLKMGKYKEAEELYKK 392 (508)
T ss_pred HHhcchhHHHHHHHHHHHHHHhhccccchHHHHHHHHHHHHHHHhcchhHHHHHHHH
Confidence 345779999999999999999999999999999999999999999998777666654
No 122
>KOG4328|consensus
Probab=61.84 E-value=8.2 Score=38.88 Aligned_cols=56 Identities=16% Similarity=0.087 Sum_probs=44.0
Q ss_pred ccceEEEEecCCCce-eeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEE--eCCcc
Q psy5736 4 LFLKEGIWDHITGSR-LISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVY--RNYAS 67 (346)
Q Consensus 4 ~~~~~~~w~~~~~~~-~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~--r~~~s 67 (346)
-+|.++|||...+.. +. .|..|+.+|++|. +|+.-.-++++++.|+..+|| |.+..
T Consensus 299 ~~G~f~~iD~R~~~s~~~-------~~~lh~kKI~sv~-~NP~~p~~laT~s~D~T~kIWD~R~l~~ 357 (498)
T KOG4328|consen 299 NVGNFNVIDLRTDGSEYE-------NLRLHKKKITSVA-LNPVCPWFLATASLDQTAKIWDLRQLRG 357 (498)
T ss_pred cccceEEEEeecCCccch-------hhhhhhcccceee-cCCCCchheeecccCcceeeeehhhhcC
Confidence 467899999977643 32 2567889999998 788777788999999999999 56643
No 123
>KOG0279|consensus
Probab=61.84 E-value=15 Score=34.98 Aligned_cols=59 Identities=22% Similarity=0.328 Sum_probs=39.1
Q ss_pred cccceEEEEecCCCceeeeeCCCCC--CCCCCCCcccceeccccCCCceEEeeecCCceEEEe
Q psy5736 3 GLFLKEGIWDHITGSRLISWRPDRG--SWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYR 63 (346)
Q Consensus 3 ~~~~~~~~w~~~~~~~~~~~~~~~~--~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r 63 (346)
+....|.|||-++++.+-...-.-. +-+....+|.+|.+.+ |+.-|+.|++|+.||+|.
T Consensus 251 at~~sIkIwdl~~~~~v~~l~~d~~g~s~~~~~~~clslaws~--dG~tLf~g~td~~irv~q 311 (315)
T KOG0279|consen 251 ATATSIKIWDLESKAVVEELKLDGIGPSSKAGDPICLSLAWSA--DGQTLFAGYTDNVIRVWQ 311 (315)
T ss_pred ccCCceEEEeccchhhhhhccccccccccccCCcEEEEEEEcC--CCcEEEeeecCCcEEEEE
Confidence 3445688999988877664432111 1122245688877654 677778899999999984
No 124
>cd02677 MIT_SNX15 MIT: domain contained within Microtubule Interacting and Trafficking molecules. This MIT domain sub-family is found in sorting nexin 15 and related proteins. The molecular function of the MIT domain is unclear.
Probab=61.59 E-value=55 Score=24.66 Aligned_cols=44 Identities=16% Similarity=0.074 Sum_probs=34.6
Q ss_pred HHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHhhccc-CCCcHh
Q psy5736 273 LSELNEMFYRGIEQMNTSCFREAVESLTKFSDQIHELV-VPPYKL 316 (346)
Q Consensus 273 l~~~~~~~~~a~~~~~~~~~~~ai~~l~~~l~~~~~~l-~p~h~~ 316 (346)
+.++..++.+|++.=+.+++++|++.|.+.++.|...+ ..+++.
T Consensus 3 l~~A~~l~~~Ave~d~~~~y~eA~~~Y~~~i~~~~~~~k~e~~~~ 47 (75)
T cd02677 3 LEQAAELIRLALEKEEEGDYEAAFEFYRAGVDLLLKGVQGDSSPE 47 (75)
T ss_pred HHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHhccCCCHH
Confidence 44566777888888788999999999999999998866 345543
No 125
>KOG0771|consensus
Probab=61.57 E-value=8.4 Score=38.17 Aligned_cols=53 Identities=17% Similarity=0.156 Sum_probs=37.9
Q ss_pred cccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeCC
Q psy5736 3 GLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRNY 65 (346)
Q Consensus 3 ~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~~ 65 (346)
|-+|+++||+|.+-..+..+. .|+.+|++|.| . .|...+.+=+.| ..++|.--
T Consensus 163 g~dg~lRv~~~Ps~~t~l~e~-------~~~~eV~DL~F-S-~dgk~lasig~d-~~~VW~~~ 215 (398)
T KOG0771|consen 163 GTDGTLRVWEWPSMLTILEEI-------AHHAEVKDLDF-S-PDGKFLASIGAD-SARVWSVN 215 (398)
T ss_pred cccceEEEEecCcchhhhhhH-------hhcCcccccee-C-CCCcEEEEecCC-ceEEEEec
Confidence 679999999999887776542 37889999985 3 344444544455 88999543
No 126
>KOG0277|consensus
Probab=61.55 E-value=15 Score=34.53 Aligned_cols=52 Identities=13% Similarity=0.229 Sum_probs=41.5
Q ss_pred ccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEe
Q psy5736 4 LFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYR 63 (346)
Q Consensus 4 ~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r 63 (346)
-+|+|.+||...++-+..|.-. ...|=..+ .+++...+.+.+|.||.+|+|.
T Consensus 125 WD~TiKLW~~~r~~Sv~Tf~gh-------~~~Iy~a~-~sp~~~nlfas~Sgd~~l~lwd 176 (311)
T KOG0277|consen 125 WDGTIKLWDPNRPNSVQTFNGH-------NSCIYQAA-FSPHIPNLFASASGDGTLRLWD 176 (311)
T ss_pred cCCceEeecCCCCcceEeecCC-------ccEEEEEe-cCCCCCCeEEEccCCceEEEEE
Confidence 5899999999999999887533 34453333 5788889999999999999995
No 127
>KOG0279|consensus
Probab=61.41 E-value=13 Score=35.27 Aligned_cols=54 Identities=13% Similarity=0.304 Sum_probs=42.1
Q ss_pred cccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeCC
Q psy5736 3 GLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRNY 65 (346)
Q Consensus 3 ~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~~ 65 (346)
+-+|++.+||-++|+...+|. -|...+.++. |+ .|+.-..+||-|..+.+|.-.
T Consensus 82 swD~~lrlWDl~~g~~t~~f~-------GH~~dVlsva-~s-~dn~qivSGSrDkTiklwnt~ 135 (315)
T KOG0279|consen 82 SWDGTLRLWDLATGESTRRFV-------GHTKDVLSVA-FS-TDNRQIVSGSRDKTIKLWNTL 135 (315)
T ss_pred cccceEEEEEecCCcEEEEEE-------ecCCceEEEE-ec-CCCceeecCCCcceeeeeeec
Confidence 468999999999998888885 2667788877 44 455556889999999999543
No 128
>KOG0268|consensus
Probab=60.07 E-value=7.9 Score=38.01 Aligned_cols=57 Identities=18% Similarity=0.252 Sum_probs=42.5
Q ss_pred cccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeCCcc
Q psy5736 3 GLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRNYAS 67 (346)
Q Consensus 3 ~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~~~s 67 (346)
+.+.+|.|+....+.----|.+.++ -.+.++|+ .+|..+.+.||+||+||+|+-+-+
T Consensus 291 syDksIRIf~~~~~~SRdiYhtkRM------q~V~~Vk~--S~Dskyi~SGSdd~nvRlWka~As 347 (433)
T KOG0268|consen 291 SYDKSIRIFPVNHGHSRDIYHTKRM------QHVFCVKY--SMDSKYIISGSDDGNVRLWKAKAS 347 (433)
T ss_pred cccceEEEeecCCCcchhhhhHhhh------heeeEEEE--eccccEEEecCCCcceeeeecchh
Confidence 4577899998877754444555544 35777774 478899999999999999987754
No 129
>KOG0640|consensus
Probab=59.41 E-value=16 Score=35.33 Aligned_cols=54 Identities=20% Similarity=0.332 Sum_probs=40.9
Q ss_pred eEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeCCc
Q psy5736 7 KEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRNYA 66 (346)
Q Consensus 7 ~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~~~ 66 (346)
++++||.++-+-..+-..+. -|+.-|++++. ..+.++..+|+-||.|++|....
T Consensus 239 ~~rlYdv~T~QcfvsanPd~----qht~ai~~V~Y--s~t~~lYvTaSkDG~IklwDGVS 292 (430)
T KOG0640|consen 239 TLRLYDVNTYQCFVSANPDD----QHTGAITQVRY--SSTGSLYVTASKDGAIKLWDGVS 292 (430)
T ss_pred ceeEEeccceeEeeecCccc----ccccceeEEEe--cCCccEEEEeccCCcEEeecccc
Confidence 57899999776555444333 37778999873 36788889999999999997664
No 130
>PF13717 zinc_ribbon_4: zinc-ribbon domain
Probab=57.96 E-value=5.6 Score=25.63 Aligned_cols=33 Identities=24% Similarity=0.519 Sum_probs=18.5
Q ss_pred cccCCcCcCCcccccccCCcccccCcccccccCCcC
Q psy5736 229 IRCSNDNCGQIIATVKKLEPSAKKVEKKCESCNSTS 264 (346)
Q Consensus 229 ~~C~~~~C~g~~~~~~~~~~~~~~~~~~C~~C~~~~ 264 (346)
+.|| +|...+..++..-| ......+|+.|+...
T Consensus 3 i~Cp--~C~~~y~i~d~~ip-~~g~~v~C~~C~~~f 35 (36)
T PF13717_consen 3 ITCP--NCQAKYEIDDEKIP-PKGRKVRCSKCGHVF 35 (36)
T ss_pred EECC--CCCCEEeCCHHHCC-CCCcEEECCCCCCEe
Confidence 5677 78765443322111 223458899998753
No 131
>KOG0313|consensus
Probab=57.75 E-value=13 Score=36.66 Aligned_cols=51 Identities=16% Similarity=0.251 Sum_probs=35.9
Q ss_pred cccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEe
Q psy5736 3 GLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYR 63 (346)
Q Consensus 3 ~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r 63 (346)
+-+-+|.+||.++|.-......++ -++++... ....++++|++|-.+|+|.
T Consensus 278 SwDHTIk~WDletg~~~~~~~~~k--------sl~~i~~~--~~~~Ll~~gssdr~irl~D 328 (423)
T KOG0313|consen 278 SWDHTIKVWDLETGGLKSTLTTNK--------SLNCISYS--PLSKLLASGSSDRHIRLWD 328 (423)
T ss_pred cccceEEEEEeecccceeeeecCc--------ceeEeecc--cccceeeecCCCCceeecC
Confidence 357799999999997665443332 23443322 3678889999999999993
No 132
>TIGR03866 PQQ_ABC_repeats PQQ-dependent catabolism-associated beta-propeller protein. Members of this protein family consist of seven repeats each of the YVTN family beta-propeller repeat (see TIGR02276). Members occur invariably as part of a transport operon that is associated with PQQ-dependent catabolism of alcohols such as phenylethanol.
Probab=57.08 E-value=29 Score=31.41 Aligned_cols=52 Identities=15% Similarity=0.112 Sum_probs=36.0
Q ss_pred cccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEe
Q psy5736 3 GLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYR 63 (346)
Q Consensus 3 ~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r 63 (346)
+.+|+|.+||-++++.+..|.... .+..+.+ .++...+.++++.++.+++|.
T Consensus 8 ~~d~~v~~~d~~t~~~~~~~~~~~--------~~~~l~~-~~dg~~l~~~~~~~~~v~~~d 59 (300)
T TIGR03866 8 EKDNTISVIDTATLEVTRTFPVGQ--------RPRGITL-SKDGKLLYVCASDSDTIQVID 59 (300)
T ss_pred cCCCEEEEEECCCCceEEEEECCC--------CCCceEE-CCCCCEEEEEECCCCeEEEEE
Confidence 568999999999998887775322 2445554 334344556778899999985
No 133
>KOG0639|consensus
Probab=57.02 E-value=12 Score=38.22 Aligned_cols=52 Identities=17% Similarity=0.293 Sum_probs=38.1
Q ss_pred ccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeC
Q psy5736 4 LFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRN 64 (346)
Q Consensus 4 ~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~ 64 (346)
-+|+|.|||..++..+-.|.-.- -+..|-+|+ +|..-+-+|+-|..||-|..
T Consensus 529 sdGnI~vwDLhnq~~VrqfqGht-----DGascIdis----~dGtklWTGGlDntvRcWDl 580 (705)
T KOG0639|consen 529 SDGNIAVWDLHNQTLVRQFQGHT-----DGASCIDIS----KDGTKLWTGGLDNTVRCWDL 580 (705)
T ss_pred cCCcEEEEEcccceeeecccCCC-----CCceeEEec----CCCceeecCCCccceeehhh
Confidence 37999999999998888885332 234444433 56666788999999999943
No 134
>PF04212 MIT: MIT (microtubule interacting and transport) domain; InterPro: IPR007330 The MIT domain is found in vacuolar sorting proteins, spastin (probable ATPase involved in the assembly or function of nuclear protein complexes), and a sorting nexin, which may play a role in intracellular trafficking.; PDB: 2DL1_A 2JQK_A 1WR0_A 2CPT_A 2JQH_A 2V6Y_A 2JQ9_A 2K3W_A 1YXR_A 3EAB_E ....
Probab=56.85 E-value=25 Score=25.60 Aligned_cols=37 Identities=14% Similarity=0.241 Sum_probs=29.4
Q ss_pred HHHHHHHHHHHHHhccccHHHHHHHHHHHHHHhhccc
Q psy5736 274 SELNEMFYRGIEQMNTSCFREAVESLTKFSDQIHELV 310 (346)
Q Consensus 274 ~~~~~~~~~a~~~~~~~~~~~ai~~l~~~l~~~~~~l 310 (346)
+.+.....+|++.=+.|++++|++.|.+.++.+...+
T Consensus 3 ~~A~~~~~~Av~~D~~g~~~~A~~~Y~~ai~~l~~~~ 39 (69)
T PF04212_consen 3 DKAIELIKKAVEADEAGNYEEALELYKEAIEYLMQAL 39 (69)
T ss_dssp HHHHHHHHHHHHHHHTTSHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHh
Confidence 4455666777777778999999999999998887754
No 135
>KOG0643|consensus
Probab=55.73 E-value=17 Score=34.44 Aligned_cols=51 Identities=24% Similarity=0.334 Sum_probs=40.8
Q ss_pred ccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEE
Q psy5736 4 LFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVY 62 (346)
Q Consensus 4 ~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~ 62 (346)
-+|.|++||-.+|..+..- . ..|...|++|++ ..|..+.++|+.|..-++|
T Consensus 167 e~G~is~~da~~g~~~v~s--~----~~h~~~Ind~q~--s~d~T~FiT~s~Dttakl~ 217 (327)
T KOG0643|consen 167 EDGSISIYDARTGKELVDS--D----EEHSSKINDLQF--SRDRTYFITGSKDTTAKLV 217 (327)
T ss_pred CCCcEEEEEcccCceeeec--h----hhhccccccccc--cCCcceEEecccCccceee
Confidence 4799999999999777632 1 257889999984 4788999999988888776
No 136
>cd02656 MIT MIT: domain contained within Microtubule Interacting and Trafficking molecules. The MIT domain is found in sorting nexins, the nuclear thiol protease PalBH, the AAA protein spastin and archaebacterial proteins with similar domain architecture, vacuolar sorting proteins and others. The molecular function of the MIT domain is unclear.
Probab=55.71 E-value=81 Score=23.23 Aligned_cols=57 Identities=11% Similarity=0.158 Sum_probs=36.9
Q ss_pred HHHHHHHHHHHHHhccccHHHHHHHHHHHHHHhhcccC-CCcHhH-HHHHHHHHHHHHH
Q psy5736 274 SELNEMFYRGIEQMNTSCFREAVESLTKFSDQIHELVV-PPYKLA-SLAHEALRNCWSL 330 (346)
Q Consensus 274 ~~~~~~~~~a~~~~~~~~~~~ai~~l~~~l~~~~~~l~-p~h~~~-~~~~~~L~~~y~~ 330 (346)
..+..+...|++.=+.|++++|+..|...++.+...+. .+.+.. ...+..+...+.+
T Consensus 4 ~~a~~l~~~Av~~D~~g~~~~Al~~Y~~a~e~l~~~~~~~~~~~~k~~l~~k~~~yl~R 62 (75)
T cd02656 4 QQAKELIKQAVKEDEDGNYEEALELYKEALDYLLQALKAEKEPKLRKLLRKKVKEYLDR 62 (75)
T ss_pred HHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHHHhccCCCHHHHHHHHHHHHHHHHH
Confidence 34455666676666679999999999999999988663 233332 3334444444443
No 137
>CHL00033 ycf3 photosystem I assembly protein Ycf3
Probab=55.46 E-value=54 Score=27.82 Aligned_cols=53 Identities=9% Similarity=-0.002 Sum_probs=30.1
Q ss_pred HHHHHHHHHHhccccHHHHHHHHHHHHHHhhcccCCCcHhHHHHHHHHHHHHHHhCCc
Q psy5736 277 NEMFYRGIEQMNTSCFREAVESLTKFSDQIHELVVPPYKLASLAHEALRNCWSLAGNK 334 (346)
Q Consensus 277 ~~~~~~a~~~~~~~~~~~ai~~l~~~l~~~~~~l~p~h~~~~~~~~~L~~~y~~~G~~ 334 (346)
...+..+......+++++|+..+.+.+... |.+.....+...|..+|...|+.
T Consensus 36 ~~~~~~g~~~~~~g~~~~A~~~~~~al~l~-----~~~~~~~~~~~~lg~~~~~~g~~ 88 (168)
T CHL00033 36 FTYYRDGMSAQSEGEYAEALQNYYEAMRLE-----IDPYDRSYILYNIGLIHTSNGEH 88 (168)
T ss_pred HHHHHHHHHHHHcCCHHHHHHHHHHHHhcc-----ccchhhHHHHHHHHHHHHHcCCH
Confidence 334445555566677777777777776652 33333344555666666665543
No 138
>cd02683 MIT_1 MIT: domain contained within Microtubule Interacting and Trafficking molecules. This sub-family of MIT domains is found in proteins with unknown function, co-occuring with an as yet undescribed domain. The molecular function of the MIT domain is unclear.
Probab=55.24 E-value=73 Score=24.07 Aligned_cols=56 Identities=13% Similarity=0.128 Sum_probs=37.5
Q ss_pred HHHHHHHHHHHHhccccHHHHHHHHHHHHHHhhcccC-CCcH-hHHHHHHHHHHHHHH
Q psy5736 275 ELNEMFYRGIEQMNTSCFREAVESLTKFSDQIHELVV-PPYK-LASLAHEALRNCWSL 330 (346)
Q Consensus 275 ~~~~~~~~a~~~~~~~~~~~ai~~l~~~l~~~~~~l~-p~h~-~~~~~~~~L~~~y~~ 330 (346)
.+..++.+|++.=..|++++|+..|.+.|+.|..++. .+++ .-...+..+...+.+
T Consensus 5 ~a~~l~~~Ave~D~~g~y~eAl~~Y~~aie~l~~~lk~e~d~~~k~~~r~ki~eY~~R 62 (77)
T cd02683 5 AAKEVLKRAVELDQEGRFQEALVCYQEGIDLLMQVLKGTKDEAKKKNLRQKISEYMDR 62 (77)
T ss_pred HHHHHHHHHHHHHHhccHHHHHHHHHHHHHHHHHHHhhCCCHHHHHHHHHHHHHHHHH
Confidence 3455667777777789999999999999999988653 2333 333334444444443
No 139
>KOG0290|consensus
Probab=55.21 E-value=16 Score=35.10 Aligned_cols=55 Identities=16% Similarity=0.239 Sum_probs=38.1
Q ss_pred cccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEE
Q psy5736 3 GLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVY 62 (346)
Q Consensus 3 ~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~ 62 (346)
.++-|-+|||.|+|.- -+...|=-+ |.+|+-++.|.. ..-.+..+-+.||.+|+|
T Consensus 170 SiDTTCTiWdie~~~~-~~vkTQLIA---HDKEV~DIaf~~-~s~~~FASvgaDGSvRmF 224 (364)
T KOG0290|consen 170 SIDTTCTIWDIETGVS-GTVKTQLIA---HDKEVYDIAFLK-GSRDVFASVGADGSVRMF 224 (364)
T ss_pred cccCeEEEEEEeeccc-cceeeEEEe---cCcceeEEEecc-CccceEEEecCCCcEEEE
Confidence 3577889999999721 011122222 778999999887 445555667789999999
No 140
>KOG0286|consensus
Probab=54.84 E-value=26 Score=33.56 Aligned_cols=50 Identities=20% Similarity=0.350 Sum_probs=42.2
Q ss_pred cceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEE
Q psy5736 5 FLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVY 62 (346)
Q Consensus 5 ~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~ 62 (346)
+.+..+||.|+|+++..|. .|...|-+|.+--+ +.....+|+-|+.-++|
T Consensus 165 D~TCalWDie~g~~~~~f~-------GH~gDV~slsl~p~-~~ntFvSg~cD~~aklW 214 (343)
T KOG0286|consen 165 DMTCALWDIETGQQTQVFH-------GHTGDVMSLSLSPS-DGNTFVSGGCDKSAKLW 214 (343)
T ss_pred CceEEEEEcccceEEEEec-------CCcccEEEEecCCC-CCCeEEecccccceeee
Confidence 5688999999999999985 26788999886554 67777899999999999
No 141
>PRK10803 tol-pal system protein YbgF; Provisional
Probab=54.77 E-value=36 Score=31.97 Aligned_cols=50 Identities=14% Similarity=-0.027 Sum_probs=40.2
Q ss_pred HHHHHhccccHHHHHHHHHHHHHHhhcccCCCcHhHHHHHHHHHHHHHHhCCccc
Q psy5736 282 RGIEQMNTSCFREAVESLTKFSDQIHELVVPPYKLASLAHEALRNCWSLAGNKWV 336 (346)
Q Consensus 282 ~a~~~~~~~~~~~ai~~l~~~l~~~~~~l~p~h~~~~~~~~~L~~~y~~~G~~~~ 336 (346)
.|.-.+..|++++|+..+.++++.| |.|+....+...+..+|..+|+.-.
T Consensus 186 LG~~y~~~g~~~~A~~~f~~vv~~y-----P~s~~~~dAl~klg~~~~~~g~~~~ 235 (263)
T PRK10803 186 LGQLNYNKGKKDDAAYYFASVVKNY-----PKSPKAADAMFKVGVIMQDKGDTAK 235 (263)
T ss_pred HHHHHHHcCCHHHHHHHHHHHHHHC-----CCCcchhHHHHHHHHHHHHcCCHHH
Confidence 3444566788999999999988887 7888888899889999988887643
No 142
>KOG1446|consensus
Probab=54.75 E-value=27 Score=33.57 Aligned_cols=48 Identities=13% Similarity=0.189 Sum_probs=34.5
Q ss_pred cceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEE
Q psy5736 5 FLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVY 62 (346)
Q Consensus 5 ~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~ 62 (346)
+|+|+|||.++|.++..+...+ ...++.++ +|+ --. |-.+.+..+..|
T Consensus 253 dg~i~vw~~~tg~~v~~~~~~~------~~~~~~~~-fnP-~~~--mf~sa~s~l~fw 300 (311)
T KOG1446|consen 253 DGTIHVWNLETGKKVAVLRGPN------GGPVSCVR-FNP-RYA--MFVSASSNLVFW 300 (311)
T ss_pred CCcEEEEEcCCCcEeeEecCCC------CCCccccc-cCC-cee--eeeecCceEEEE
Confidence 5999999999999999997654 45677777 453 122 335566677776
No 143
>PRK00420 hypothetical protein; Validated
Probab=54.60 E-value=60 Score=26.60 Aligned_cols=10 Identities=10% Similarity=0.132 Sum_probs=5.7
Q ss_pred ccccccCCcC
Q psy5736 255 KKCESCNSTS 264 (346)
Q Consensus 255 ~~C~~C~~~~ 264 (346)
..|+.||...
T Consensus 41 ~~Cp~Cg~~~ 50 (112)
T PRK00420 41 VVCPVHGKVY 50 (112)
T ss_pred eECCCCCCee
Confidence 5566666643
No 144
>PRK06266 transcription initiation factor E subunit alpha; Validated
Probab=53.41 E-value=13 Score=32.85 Aligned_cols=10 Identities=20% Similarity=0.667 Sum_probs=5.7
Q ss_pred ccccccCCcC
Q psy5736 255 KKCESCNSTS 264 (346)
Q Consensus 255 ~~C~~C~~~~ 264 (346)
..|+.||..+
T Consensus 137 F~Cp~Cg~~L 146 (178)
T PRK06266 137 FRCPQCGEML 146 (178)
T ss_pred CcCCCCCCCC
Confidence 5555665554
No 145
>KOG0303|consensus
Probab=53.12 E-value=26 Score=34.92 Aligned_cols=52 Identities=29% Similarity=0.382 Sum_probs=41.2
Q ss_pred CcccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEe
Q psy5736 2 TGLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYR 63 (346)
Q Consensus 2 ~~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r 63 (346)
.|-+.+|.|||.++|.-+.... |---|.++.| | .+.+++.+..-|..||+|.
T Consensus 150 ag~Dn~v~iWnv~tgeali~l~--------hpd~i~S~sf-n-~dGs~l~TtckDKkvRv~d 201 (472)
T KOG0303|consen 150 AGSDNTVSIWNVGTGEALITLD--------HPDMVYSMSF-N-RDGSLLCTTCKDKKVRVID 201 (472)
T ss_pred ccCCceEEEEeccCCceeeecC--------CCCeEEEEEe-c-cCCceeeeecccceeEEEc
Confidence 4678899999999998887643 3446777763 4 6788888899999999993
No 146
>KOG4714|consensus
Probab=53.08 E-value=9.9 Score=35.88 Aligned_cols=54 Identities=20% Similarity=0.190 Sum_probs=41.8
Q ss_pred cccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEe
Q psy5736 3 GLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYR 63 (346)
Q Consensus 3 ~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r 63 (346)
+-+|.|++|||...+-..+.- +-|+.+|..+.|.-..++.+ ++.+.||.+=-|.
T Consensus 199 t~dg~~~l~d~rn~~~p~S~l------~ahk~~i~eV~FHpk~p~~L-ft~sedGslw~wd 252 (319)
T KOG4714|consen 199 TDDGIVGLWDARNVAMPVSLL------KAHKAEIWEVHFHPKNPEHL-FTCSEDGSLWHWD 252 (319)
T ss_pred cCCCeEEEEEcccccchHHHH------HHhhhhhhheeccCCCchhe-eEecCCCcEEEEc
Confidence 347899999998874433322 23889999999999888888 6799999987663
No 147
>PF13719 zinc_ribbon_5: zinc-ribbon domain
Probab=52.72 E-value=7 Score=25.27 Aligned_cols=33 Identities=24% Similarity=0.509 Sum_probs=18.6
Q ss_pred cccCCcCcCCcccccccCCcccccCcccccccCCcC
Q psy5736 229 IRCSNDNCGQIIATVKKLEPSAKKVEKKCESCNSTS 264 (346)
Q Consensus 229 ~~C~~~~C~g~~~~~~~~~~~~~~~~~~C~~C~~~~ 264 (346)
+.|| +|+..+..+++.-+ .......|++|+...
T Consensus 3 i~CP--~C~~~f~v~~~~l~-~~~~~vrC~~C~~~f 35 (37)
T PF13719_consen 3 ITCP--NCQTRFRVPDDKLP-AGGRKVRCPKCGHVF 35 (37)
T ss_pred EECC--CCCceEEcCHHHcc-cCCcEEECCCCCcEe
Confidence 4688 88765544432101 123358888888754
No 148
>PF13240 zinc_ribbon_2: zinc-ribbon domain
Probab=52.38 E-value=8.9 Score=22.17 Aligned_cols=9 Identities=33% Similarity=0.936 Sum_probs=6.7
Q ss_pred ccccccCCc
Q psy5736 255 KKCESCNST 263 (346)
Q Consensus 255 ~~C~~C~~~ 263 (346)
-.|+.||..
T Consensus 14 ~fC~~CG~~ 22 (23)
T PF13240_consen 14 KFCPNCGTP 22 (23)
T ss_pred cchhhhCCc
Confidence 568888875
No 149
>PF13248 zf-ribbon_3: zinc-ribbon domain
Probab=52.26 E-value=8.7 Score=22.74 Aligned_cols=23 Identities=35% Similarity=0.811 Sum_probs=14.6
Q ss_pred cccCCcCcCCcccccccCCcccccCcccccccCCc
Q psy5736 229 IRCSNDNCGQIIATVKKLEPSAKKVEKKCESCNST 263 (346)
Q Consensus 229 ~~C~~~~C~g~~~~~~~~~~~~~~~~~~C~~C~~~ 263 (346)
..|| +|+..+.+. .-.|+.||..
T Consensus 3 ~~Cp--~Cg~~~~~~----------~~fC~~CG~~ 25 (26)
T PF13248_consen 3 MFCP--NCGAEIDPD----------AKFCPNCGAK 25 (26)
T ss_pred CCCc--ccCCcCCcc----------cccChhhCCC
Confidence 4688 887643211 2678889875
No 150
>PF14783 BBS2_Mid: Ciliary BBSome complex subunit 2, middle region
Probab=51.98 E-value=32 Score=28.12 Aligned_cols=47 Identities=19% Similarity=0.400 Sum_probs=33.0
Q ss_pred cceEEEEecCCCceeeeeCCCCCCCCCCCCcccceecccc--CCCceEEeeecCCceEE
Q psy5736 5 FLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNE--HDDSLLLTGYDDGSLSV 61 (346)
Q Consensus 5 ~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine--~~~~lll~g~~dG~vri 61 (346)
.|+|.|||= ..|+=+.. .|.++++|..++. +...-+.+|+++|.|.+
T Consensus 62 NGTVGvY~~--~~RlWRiK--------SK~~~~~~~~~D~~gdG~~eLI~GwsnGkve~ 110 (111)
T PF14783_consen 62 NGTVGVYDR--SQRLWRIK--------SKNQVTSMAFYDINGDGVPELIVGWSNGKVEV 110 (111)
T ss_pred CCEEEEEeC--cceeeeec--------cCCCeEEEEEEcCCCCCceEEEEEecCCeEEe
Confidence 488999854 44554332 2456899998887 44566789999999864
No 151
>PF08666 SAF: SAF domain; InterPro: IPR013974 This entry includes a range of different proteins, such as antifreeze proteins, flagellar FlgA proteins, and CpaB pilus proteins. ; PDB: 1C89_A 3NLA_A 3RDN_A 1C8A_A 3FRN_A 1WVO_A 3K3S_H 3G8R_B 1XUU_A 1XUZ_A ....
Probab=51.60 E-value=9.5 Score=26.94 Aligned_cols=18 Identities=22% Similarity=0.307 Sum_probs=13.0
Q ss_pred EEEEeeccCCCCceeecc
Q psy5736 166 IVKALRPLKPKEVVAENY 183 (346)
Q Consensus 166 ~v~a~r~I~~Geel~~~Y 183 (346)
++.|.|+|++|+.|+-.-
T Consensus 3 vvVA~~di~~G~~i~~~d 20 (63)
T PF08666_consen 3 VVVAARDIPAGTVITAED 20 (63)
T ss_dssp EEEESSTB-TT-BECTTT
T ss_pred EEEEeCccCCCCEEccCC
Confidence 578999999999996544
No 152
>COG2888 Predicted Zn-ribbon RNA-binding protein with a function in translation [Translation, ribosomal structure and biogenesis]
Probab=51.56 E-value=13 Score=26.82 Aligned_cols=49 Identities=16% Similarity=0.392 Sum_probs=25.7
Q ss_pred ecCCCCCCCccccccccCccccccCCcCcCCcccccccCCcccccCcccccccCCc
Q psy5736 208 KCRACVENWPLMESLEKYPIRIRCSNDNCGQIIATVKKLEPSAKKVEKKCESCNST 263 (346)
Q Consensus 208 ~C~~C~~~~~~~~~~~~~~~~~~C~~~~C~g~~~~~~~~~~~~~~~~~~C~~C~~~ 263 (346)
.|..|-..-+ .+.....|.|| +|+..+...-..-- .......|++||..
T Consensus 11 ~CtSCg~~i~----p~e~~v~F~CP--nCGe~~I~Rc~~CR-k~g~~Y~Cp~CGF~ 59 (61)
T COG2888 11 VCTSCGREIA----PGETAVKFPCP--NCGEVEIYRCAKCR-KLGNPYRCPKCGFE 59 (61)
T ss_pred eeccCCCEec----cCCceeEeeCC--CCCceeeehhhhHH-HcCCceECCCcCcc
Confidence 4666765431 12335788999 89854432210000 00124788888863
No 153
>PF04810 zf-Sec23_Sec24: Sec23/Sec24 zinc finger; InterPro: IPR006895 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. COPII (coat protein complex II)-coated vesicles carry proteins from the endoplasmic reticulum (ER) to the Golgi complex []. COPII-coated vesicles form on the ER by the stepwise recruitment of three cytosolic components: Sar1-GTP to initiate coat formation, Sec23/24 heterodimer to select SNARE and cargo molecules, and Sec13/31 to induce coat polymerisation and membrane deformation []. Sec23 p and Sec24p are structurally related, folding into five distinct domains: a beta-barrel, a zinc-finger, an alpha/beta trunk domain (IPR006896 from INTERPRO), an all-helical region (IPR006900 from INTERPRO), and a C-terminal gelsolin-like domain (IPR007123 from INTERPRO). This entry describes an approximately 55-residue Sec23/24 zinc-binding domain, which lies against the beta-barrel at the periphery of the complex. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0006886 intracellular protein transport, 0006888 ER to Golgi vesicle-mediated transport, 0030127 COPII vesicle coat; PDB: 3EFO_B 3EG9_B 3EGD_A 2YRC_A 2NUP_A 2YRD_A 3EGX_A 2NUT_A 3EH1_A 1PD0_A ....
Probab=51.27 E-value=6.5 Score=25.87 Aligned_cols=32 Identities=16% Similarity=0.509 Sum_probs=16.1
Q ss_pred cccCCcCcCCcccccccCCcccccCcccccccCCcC
Q psy5736 229 IRCSNDNCGQIIATVKKLEPSAKKVEKKCESCNSTS 264 (346)
Q Consensus 229 ~~C~~~~C~g~~~~~~~~~~~~~~~~~~C~~C~~~~ 264 (346)
++|. .|++.+.|--..+. ....|.|.-|+..-
T Consensus 3 ~rC~--~C~aylNp~~~~~~--~~~~w~C~~C~~~N 34 (40)
T PF04810_consen 3 VRCR--RCRAYLNPFCQFDD--GGKTWICNFCGTKN 34 (40)
T ss_dssp -B-T--TT--BS-TTSEEET--TTTEEEETTT--EE
T ss_pred cccC--CCCCEECCcceEcC--CCCEEECcCCCCcC
Confidence 5788 89988876543211 22469999998853
No 154
>KOG1587|consensus
Probab=51.22 E-value=25 Score=36.84 Aligned_cols=55 Identities=20% Similarity=0.241 Sum_probs=40.0
Q ss_pred cceEEEEecCCCcee--eeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEE
Q psy5736 5 FLKEGIWDHITGSRL--ISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVY 62 (346)
Q Consensus 5 ~~~~~~w~~~~~~~~--~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~ 62 (346)
.|+|-+||-..+.+. ..-+..-. -|...|+.+.+++.....=..++++||.|..|
T Consensus 264 ~GqV~lWD~~~~~~~~~s~ls~~~~---sh~~~v~~vvW~~~~~~~~f~s~ssDG~i~~W 320 (555)
T KOG1587|consen 264 NGQVVLWDLRKGSDTPPSGLSALEV---SHSEPVTAVVWLQNEHNTEFFSLSSDGSICSW 320 (555)
T ss_pred CceEEEEEccCCCCCCCcccccccc---cCCcCeEEEEEeccCCCCceEEEecCCcEeee
Confidence 699999999988773 22222222 36678999998887433445778999999999
No 155
>KOG0299|consensus
Probab=51.00 E-value=28 Score=35.18 Aligned_cols=60 Identities=17% Similarity=0.207 Sum_probs=40.9
Q ss_pred ccceEEEEecCCCceeeeeCCCCCCCCCC--C---CcccceeccccCCCceEEeeecCCceEEEeCC
Q psy5736 4 LFLKEGIWDHITGSRLISWRPDRGSWARN--S---ARCTALTLLNEHDDSLLLTGYDDGSLSVYRNY 65 (346)
Q Consensus 4 ~~~~~~~w~~~~~~~~~~~~~~~~~w~~~--~---~~~t~l~~ine~~~~lll~g~~dG~vri~r~~ 65 (346)
-.|+|.+|+.-+.+.+..+.+..+--+.+ . .=||+|-.| ....|+.+|+-+|.||+|+--
T Consensus 346 dnG~IaLWs~~KKkplf~~~~AHgv~~~~~~~~~~~Witsla~i--~~sdL~asGS~~G~vrLW~i~ 410 (479)
T KOG0299|consen 346 DNGSIALWSLLKKKPLFTSRLAHGVIPELDPVNGNFWITSLAVI--PGSDLLASGSWSGCVRLWKIE 410 (479)
T ss_pred CCceEEEeeecccCceeEeeccccccCCccccccccceeeeEec--ccCceEEecCCCCceEEEEec
Confidence 46899999999999988776544321111 0 024555544 456788899999999999644
No 156
>smart00661 RPOL9 RNA polymerase subunit 9.
Probab=50.99 E-value=12 Score=25.65 Aligned_cols=30 Identities=20% Similarity=0.538 Sum_probs=19.5
Q ss_pred ccCCcCcCCcccccccCCcccccCcccccccCCcCC
Q psy5736 230 RCSNDNCGQIIATVKKLEPSAKKVEKKCESCNSTSD 265 (346)
Q Consensus 230 ~C~~~~C~g~~~~~~~~~~~~~~~~~~C~~C~~~~~ 265 (346)
.|| .|++.+.+.... ....+.|+.||....
T Consensus 2 FCp--~Cg~~l~~~~~~----~~~~~vC~~Cg~~~~ 31 (52)
T smart00661 2 FCP--KCGNMLIPKEGK----EKRRFVCRKCGYEEP 31 (52)
T ss_pred CCC--CCCCccccccCC----CCCEEECCcCCCeEE
Confidence 488 898877665321 112589999998543
No 157
>TIGR00373 conserved hypothetical protein TIGR00373. This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain.
Probab=50.92 E-value=8.8 Score=33.28 Aligned_cols=10 Identities=20% Similarity=0.597 Sum_probs=5.8
Q ss_pred ccccccCCcC
Q psy5736 255 KKCESCNSTS 264 (346)
Q Consensus 255 ~~C~~C~~~~ 264 (346)
..|+.||..+
T Consensus 129 F~Cp~Cg~~L 138 (158)
T TIGR00373 129 FTCPRCGAML 138 (158)
T ss_pred CcCCCCCCEe
Confidence 5566666654
No 158
>KOG2445|consensus
Probab=50.67 E-value=29 Score=33.55 Aligned_cols=34 Identities=21% Similarity=0.260 Sum_probs=28.4
Q ss_pred CCcccceeccccCCCceEEeeecCCceEEEeCCc
Q psy5736 33 SARCTALTLLNEHDDSLLLTGYDDGSLSVYRNYA 66 (346)
Q Consensus 33 ~~~~t~l~~ine~~~~lll~g~~dG~vri~r~~~ 66 (346)
.+.+++++|--.+-+-++.+++.||.+|||...+
T Consensus 112 rssV~DV~FaP~hlGLklA~~~aDG~lRIYEA~d 145 (361)
T KOG2445|consen 112 RSSVTDVKFAPKHLGLKLAAASADGILRIYEAPD 145 (361)
T ss_pred CcceeEEEecchhcceEEEEeccCcEEEEEecCC
Confidence 4578999987778888888999999999997654
No 159
>KOG2096|consensus
Probab=50.64 E-value=14 Score=35.89 Aligned_cols=42 Identities=24% Similarity=0.396 Sum_probs=32.6
Q ss_pred CCCCCcccceeccccCCCceEEeeecCCceEEE--eCCccccchhh
Q psy5736 30 ARNSARCTALTLLNEHDDSLLLTGYDDGSLSVY--RNYASHEHRLV 73 (346)
Q Consensus 30 ~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~--r~~~sh~~~Lv 73 (346)
+-|+.+||+|.|- .|+..+.+.+.|+.||+| ++|+..+++.+
T Consensus 83 KgH~~~vt~~~Fs--SdGK~lat~~~Dr~Ir~w~~~DF~~~eHr~~ 126 (420)
T KOG2096|consen 83 KGHKKEVTDVAFS--SDGKKLATISGDRSIRLWDVRDFENKEHRCI 126 (420)
T ss_pred hccCCceeeeEEc--CCCceeEEEeCCceEEEEecchhhhhhhhHh
Confidence 4588899999864 466777999999999999 78876555443
No 160
>KOG0299|consensus
Probab=50.28 E-value=26 Score=35.42 Aligned_cols=52 Identities=19% Similarity=0.370 Sum_probs=41.3
Q ss_pred cccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeCC
Q psy5736 3 GLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRNY 65 (346)
Q Consensus 3 ~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~~ 65 (346)
|-+.++.+|+..+..+|.+..+ ...+-+..+||+. - .++|+++|.|-+|...
T Consensus 305 grDrT~rlwKi~eesqlifrg~--------~~sidcv~~In~~--H-fvsGSdnG~IaLWs~~ 356 (479)
T KOG0299|consen 305 GRDRTVRLWKIPEESQLIFRGG--------EGSIDCVAFINDE--H-FVSGSDNGSIALWSLL 356 (479)
T ss_pred cccceeEEEeccccceeeeeCC--------CCCeeeEEEeccc--c-eeeccCCceEEEeeec
Confidence 5789999999999999987543 3457788888853 2 4779999999999665
No 161
>TIGR02098 MJ0042_CXXC MJ0042 family finger-like domain. This domain contains a CXXCX(19)CXXC motif suggestive of both zinc fingers and thioredoxin, usually found at the N-terminus of prokaryotic proteins. One partially characterized gene, agmX, is among a large set in Myxococcus whose interruption affects adventurous gliding motility.
Probab=49.83 E-value=7.7 Score=24.87 Aligned_cols=33 Identities=24% Similarity=0.490 Sum_probs=17.4
Q ss_pred cccCCcCcCCcccccccCCcccccCcccccccCCcC
Q psy5736 229 IRCSNDNCGQIIATVKKLEPSAKKVEKKCESCNSTS 264 (346)
Q Consensus 229 ~~C~~~~C~g~~~~~~~~~~~~~~~~~~C~~C~~~~ 264 (346)
+.|| +|+..+....+.-+ .......|++|+..+
T Consensus 3 ~~CP--~C~~~~~v~~~~~~-~~~~~v~C~~C~~~~ 35 (38)
T TIGR02098 3 IQCP--NCKTSFRVVDSQLG-ANGGKVRCGKCGHVW 35 (38)
T ss_pred EECC--CCCCEEEeCHHHcC-CCCCEEECCCCCCEE
Confidence 5677 78765433321100 112247888888764
No 162
>smart00745 MIT Microtubule Interacting and Trafficking molecule domain.
Probab=49.68 E-value=1e+02 Score=22.67 Aligned_cols=38 Identities=21% Similarity=0.294 Sum_probs=29.8
Q ss_pred HHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHhhccc
Q psy5736 273 LSELNEMFYRGIEQMNTSCFREAVESLTKFSDQIHELV 310 (346)
Q Consensus 273 l~~~~~~~~~a~~~~~~~~~~~ai~~l~~~l~~~~~~l 310 (346)
+.++..+..+|++.=+.|++++|+..|.+.++.+...+
T Consensus 5 ~~~A~~li~~Av~~d~~g~~~eAl~~Y~~a~e~l~~~~ 42 (77)
T smart00745 5 LSKAKELISKALKADEAGDYEEALELYKKAIEYLLEGI 42 (77)
T ss_pred HHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHHHh
Confidence 34555666677766668999999999999999888855
No 163
>PRK02603 photosystem I assembly protein Ycf3; Provisional
Probab=49.61 E-value=73 Score=27.23 Aligned_cols=50 Identities=12% Similarity=0.104 Sum_probs=24.5
Q ss_pred HHHHHHHHhccccHHHHHHHHHHHHHHhhcccCCCcHhHHHHHHHHHHHHHHhCC
Q psy5736 279 MFYRGIEQMNTSCFREAVESLTKFSDQIHELVVPPYKLASLAHEALRNCWSLAGN 333 (346)
Q Consensus 279 ~~~~a~~~~~~~~~~~ai~~l~~~l~~~~~~l~p~h~~~~~~~~~L~~~y~~~G~ 333 (346)
.+..+......|++++|+..+++.++... .+.....+...++.+|..+|+
T Consensus 38 ~~~lg~~~~~~g~~~~A~~~~~~al~~~~-----~~~~~~~~~~~la~~~~~~g~ 87 (172)
T PRK02603 38 YYRDGMSAQADGEYAEALENYEEALKLEE-----DPNDRSYILYNMGIIYASNGE 87 (172)
T ss_pred HHHHHHHHHHcCCHHHHHHHHHHHHHHhh-----ccchHHHHHHHHHHHHHHcCC
Confidence 34444444555666666666666655421 111123344555555555554
No 164
>KOG0318|consensus
Probab=49.54 E-value=23 Score=36.47 Aligned_cols=17 Identities=24% Similarity=0.436 Sum_probs=12.7
Q ss_pred ccceEEEEecCCCceee
Q psy5736 4 LFLKEGIWDHITGSRLI 20 (346)
Q Consensus 4 ~~~~~~~w~~~~~~~~~ 20 (346)
..|+|.||||....++-
T Consensus 79 ~sG~vRIWdtt~~~hiL 95 (603)
T KOG0318|consen 79 VSGKVRIWDTTQKEHIL 95 (603)
T ss_pred CcCcEEEEeccCcceee
Confidence 46899999998755544
No 165
>KOG0318|consensus
Probab=49.42 E-value=37 Score=35.03 Aligned_cols=56 Identities=21% Similarity=0.219 Sum_probs=48.5
Q ss_pred CcccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEe
Q psy5736 2 TGLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYR 63 (346)
Q Consensus 2 ~~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r 63 (346)
+|-+|+|.|||=.+|.++..+..+.+ |+.-|-+|-+. +|..-+++.+.|..+|+|.
T Consensus 208 ~gsDgki~iyDGktge~vg~l~~~~a----HkGsIfalsWs--PDs~~~~T~SaDkt~KIWd 263 (603)
T KOG0318|consen 208 AGSDGKIYIYDGKTGEKVGELEDSDA----HKGSIFALSWS--PDSTQFLTVSADKTIKIWD 263 (603)
T ss_pred ecCCccEEEEcCCCccEEEEecCCCC----ccccEEEEEEC--CCCceEEEecCCceEEEEE
Confidence 58899999999999999999987654 89999998863 5667778899999999994
No 166
>COG2956 Predicted N-acetylglucosaminyl transferase [Carbohydrate transport and metabolism]
Probab=49.10 E-value=36 Score=33.32 Aligned_cols=45 Identities=16% Similarity=0.180 Sum_probs=32.8
Q ss_pred ccHHHHHHHHHHHHHHhhcccCCCcHhHHHHHHHHHHHHHHhCCc
Q psy5736 290 SCFREAVESLTKFSDQIHELVVPPYKLASLAHEALRNCWSLAGNK 334 (346)
Q Consensus 290 ~~~~~ai~~l~~~l~~~~~~l~p~h~~~~~~~~~L~~~y~~~G~~ 334 (346)
|+..-+...+++.++.+..++-.++-.+.++-+.|..||..+|+.
T Consensus 221 G~v~~~~g~y~~AV~~~e~v~eQn~~yl~evl~~L~~~Y~~lg~~ 265 (389)
T COG2956 221 GRVELAKGDYQKAVEALERVLEQNPEYLSEVLEMLYECYAQLGKP 265 (389)
T ss_pred hHHHHhccchHHHHHHHHHHHHhChHHHHHHHHHHHHHHHHhCCH
Confidence 444444444556666666666678888999999999999999874
No 167
>KOG0286|consensus
Probab=48.48 E-value=32 Score=33.01 Aligned_cols=53 Identities=19% Similarity=0.394 Sum_probs=41.3
Q ss_pred cccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeC
Q psy5736 3 GLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRN 64 (346)
Q Consensus 3 ~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~ 64 (346)
|-+..-.+||..++.-...|. -|...|.+++|+- ..++. .+|++|+..|+|..
T Consensus 206 ~cD~~aklWD~R~~~c~qtF~-------ghesDINsv~ffP-~G~af-atGSDD~tcRlyDl 258 (343)
T KOG0286|consen 206 GCDKSAKLWDVRSGQCVQTFE-------GHESDINSVRFFP-SGDAF-ATGSDDATCRLYDL 258 (343)
T ss_pred ccccceeeeeccCcceeEeec-------ccccccceEEEcc-CCCee-eecCCCceeEEEee
Confidence 457788999999997777773 3667899999874 33444 78999999999943
No 168
>KOG2111|consensus
Probab=48.44 E-value=45 Score=32.34 Aligned_cols=54 Identities=20% Similarity=0.288 Sum_probs=41.6
Q ss_pred EEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeCCccc
Q psy5736 8 EGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRNYASH 68 (346)
Q Consensus 8 ~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~~~sh 68 (346)
|+|||-++|.++.-|.-|-. ...|-+|.| .++.+++.+.|+-|.+.+|......
T Consensus 206 IRIFdt~~g~~l~E~RRG~d-----~A~iy~iaF--Sp~~s~LavsSdKgTlHiF~l~~~~ 259 (346)
T KOG2111|consen 206 IRIFDTEDGTLLQELRRGVD-----RADIYCIAF--SPNSSWLAVSSDKGTLHIFSLRDTE 259 (346)
T ss_pred EEEEEcCCCcEeeeeecCCc-----hheEEEEEe--CCCccEEEEEcCCCeEEEEEeecCC
Confidence 79999999999999876653 345555442 2788888989999999999766543
No 169
>PF10571 UPF0547: Uncharacterised protein family UPF0547; InterPro: IPR018886 This domain may well be a type of zinc-finger as it carries two pairs of highly conserved cysteine residues though with no accompanying histidines. Several members are annotated as putative helicases.
Probab=48.17 E-value=13 Score=22.29 Aligned_cols=23 Identities=26% Similarity=0.696 Sum_probs=14.5
Q ss_pred ccCCcCcCCcccccccCCcccccCcccccccCCcC
Q psy5736 230 RCSNDNCGQIIATVKKLEPSAKKVEKKCESCNSTS 264 (346)
Q Consensus 230 ~C~~~~C~g~~~~~~~~~~~~~~~~~~C~~C~~~~ 264 (346)
.|| .|+..+... .-.|+.||...
T Consensus 2 ~CP--~C~~~V~~~----------~~~Cp~CG~~F 24 (26)
T PF10571_consen 2 TCP--ECGAEVPES----------AKFCPHCGYDF 24 (26)
T ss_pred cCC--CCcCCchhh----------cCcCCCCCCCC
Confidence 477 787665322 26788888764
No 170
>PF09577 Spore_YpjB: Sporulation protein YpjB (SpoYpjB); InterPro: IPR014231 Proteins in thie entry, typified by YpjB, are restricted to a subset of the endospore-forming bacteria which includes Bacillus species, but not species. In Bacillus subtilis, ypjB was found to be part of the sigma-E regulon []. Sigma-E is a sporulation sigma factor that regulates expression in the mother cell compartment. Null mutants of ypjB show a sporulation defect, but this gene is not, however, a part of the endospore formation minimal gene set.
Probab=47.73 E-value=43 Score=31.02 Aligned_cols=57 Identities=21% Similarity=0.217 Sum_probs=38.2
Q ss_pred HHHHHHHHHHHHhccccHHHHHHHHHHHHHHhhcccCCCcHhHHHHHHHHHHHHHHh
Q psy5736 275 ELNEMFYRGIEQMNTSCFREAVESLTKFSDQIHELVVPPYKLASLAHEALRNCWSLA 331 (346)
Q Consensus 275 ~~~~~~~~a~~~~~~~~~~~ai~~l~~~l~~~~~~l~p~h~~~~~~~~~L~~~y~~~ 331 (346)
++..+-+++.++.++|++++|.+.++.+-+.|...-...-.....-..+|...|-..
T Consensus 4 eLd~~sd~~lqlvk~~~yeeA~q~l~~fs~~f~~~~~~~~~~t~e~iralT~t~~~a 60 (232)
T PF09577_consen 4 ELDQLSDEALQLVKQGKYEEAKQLLEYFSEQFTSVDFKGRPLTMEEIRALTETIEEA 60 (232)
T ss_pred HHHHHHHHHHHHHHcccHHHHHHHHHHHHHHHhhccccccccCHHHHHHHHHHHHHH
Confidence 445555677888999999999999999998887754443323333444455544443
No 171
>PF13181 TPR_8: Tetratricopeptide repeat; PDB: 3GW4_B 3MA5_C 2KCV_A 2KCL_A 3FP3_A 3LCA_A 3FP4_A 3FP2_A 1W3B_B 1ELW_A ....
Probab=47.36 E-value=51 Score=19.63 Aligned_cols=26 Identities=19% Similarity=0.068 Sum_probs=19.7
Q ss_pred HHHHHHhccccHHHHHHHHHHHHHHh
Q psy5736 281 YRGIEQMNTSCFREAVESLTKFSDQI 306 (346)
Q Consensus 281 ~~a~~~~~~~~~~~ai~~l~~~l~~~ 306 (346)
..+.-....|++++|++.+++.++.-
T Consensus 6 ~lg~~y~~~~~~~~A~~~~~~a~~~~ 31 (34)
T PF13181_consen 6 NLGKIYEQLGDYEEALEYFEKALELN 31 (34)
T ss_dssp HHHHHHHHTTSHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHcCCHHHHHHHHHHHHhhC
Confidence 33444566789999999999998764
No 172
>TIGR03302 OM_YfiO outer membrane assembly lipoprotein YfiO. Members of this protein family include YfiO, a near-essential protein of the outer membrane, part of a complex involved in protein insertion into the bacterial outer membrane. Many proteins in this family are annotated as ComL, based on the involvement of this protein in natural transformation with exogenous DNA in Neisseria gonorrhoeae. This protein family shows sequence similarity to, but is distinct from, the tol-pal system protein YbgF (TIGR02795).
Probab=47.21 E-value=50 Score=29.44 Aligned_cols=48 Identities=13% Similarity=0.104 Sum_probs=40.1
Q ss_pred HHHHhccccHHHHHHHHHHHHHHhhcccCCCcHhHHHHHHHHHHHHHHhCCcc
Q psy5736 283 GIEQMNTSCFREAVESLTKFSDQIHELVVPPYKLASLAHEALRNCWSLAGNKW 335 (346)
Q Consensus 283 a~~~~~~~~~~~ai~~l~~~l~~~~~~l~p~h~~~~~~~~~L~~~y~~~G~~~ 335 (346)
|.-.+..|++.+|+..++++++.+ |.++....+...+...|..+|+.-
T Consensus 173 a~~~~~~g~~~~A~~~~~~al~~~-----p~~~~~~~a~~~l~~~~~~lg~~~ 220 (235)
T TIGR03302 173 ARFYLKRGAYVAAINRFETVVENY-----PDTPATEEALARLVEAYLKLGLKD 220 (235)
T ss_pred HHHHHHcCChHHHHHHHHHHHHHC-----CCCcchHHHHHHHHHHHHHcCCHH
Confidence 344577899999999999999887 556778889999999999999863
No 173
>PF14561 TPR_20: Tetratricopeptide repeat; PDB: 3QOU_A 2R5S_A 3QDN_B.
Probab=47.13 E-value=58 Score=25.27 Aligned_cols=51 Identities=14% Similarity=0.220 Sum_probs=29.3
Q ss_pred HHHHHHHHHHhccccHHHHHHHHHHHHHHhhcccCCCcHhHHHHHHHHHHHHHHhCC
Q psy5736 277 NEMFYRGIEQMNTSCFREAVESLTKFSDQIHELVVPPYKLASLAHEALRNCWSLAGN 333 (346)
Q Consensus 277 ~~~~~~a~~~~~~~~~~~ai~~l~~~l~~~~~~l~p~h~~~~~~~~~L~~~y~~~G~ 333 (346)
+..+.-|...+..|++++|++.+.++++.-..+ .=..++..|..+|..+|.
T Consensus 23 ~ar~~lA~~~~~~g~~e~Al~~Ll~~v~~dr~~------~~~~ar~~ll~~f~~lg~ 73 (90)
T PF14561_consen 23 DARYALADALLAAGDYEEALDQLLELVRRDRDY------EDDAARKRLLDIFELLGP 73 (90)
T ss_dssp HHHHHHHHHHHHTT-HHHHHHHHHHHHCC-TTC------CCCHHHHHHHHHHHHH-T
T ss_pred HHHHHHHHHHHHCCCHHHHHHHHHHHHHhCccc------cccHHHHHHHHHHHHcCC
Confidence 445566777788899999988887777543221 123345555555555544
No 174
>KOG0315|consensus
Probab=46.41 E-value=30 Score=32.57 Aligned_cols=50 Identities=16% Similarity=0.351 Sum_probs=36.2
Q ss_pred cceEEEEecCCCce--eeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEe
Q psy5736 5 FLKEGIWDHITGSR--LISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYR 63 (346)
Q Consensus 5 ~~~~~~w~~~~~~~--~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r 63 (346)
--+|.++|..++.. +-.| .-|.+.++++-|- .|.....+|+.||.+|||.
T Consensus 60 ~qhvRlyD~~S~np~Pv~t~-------e~h~kNVtaVgF~--~dgrWMyTgseDgt~kIWd 111 (311)
T KOG0315|consen 60 NQHVRLYDLNSNNPNPVATF-------EGHTKNVTAVGFQ--CDGRWMYTGSEDGTVKIWD 111 (311)
T ss_pred CCeeEEEEccCCCCCceeEE-------eccCCceEEEEEe--ecCeEEEecCCCceEEEEe
Confidence 34788999887743 2233 3466678887753 5677778899999999993
No 175
>PF09976 TPR_21: Tetratricopeptide repeat; InterPro: IPR018704 This domain, found in various hypothetical prokaryotic proteins, has no known function.
Probab=45.95 E-value=1.5e+02 Score=24.43 Aligned_cols=49 Identities=20% Similarity=0.142 Sum_probs=32.9
Q ss_pred HHHHHHhccccHHHHHHHHHHHHHHhhcccCCCcHhHHHHHHHHHHHHHHhCCc
Q psy5736 281 YRGIEQMNTSCFREAVESLTKFSDQIHELVVPPYKLASLAHEALRNCWSLAGNK 334 (346)
Q Consensus 281 ~~a~~~~~~~~~~~ai~~l~~~l~~~~~~l~p~h~~~~~~~~~L~~~y~~~G~~ 334 (346)
..|...++.|++++|+..|.+++... |.......++..|++++...|+.
T Consensus 53 ~lA~~~~~~g~~~~A~~~l~~~~~~~-----~d~~l~~~a~l~LA~~~~~~~~~ 101 (145)
T PF09976_consen 53 QLAKAAYEQGDYDEAKAALEKALANA-----PDPELKPLARLRLARILLQQGQY 101 (145)
T ss_pred HHHHHHHHCCCHHHHHHHHHHHHhhC-----CCHHHHHHHHHHHHHHHHHcCCH
Confidence 34445567788888877777777633 44455666777788888777654
No 176
>TIGR02059 swm_rep_I cyanobacterial long protein repeat. This domain appears in 29 copies in a large (10000 amino protein in Synechococcus sp. WH8102 associated with a novel flagellar system, as one of three different repeats. Similar domains are found in two different large (<3500) proteins of Synechocystis PCC6803.
Probab=45.82 E-value=32 Score=27.61 Aligned_cols=23 Identities=13% Similarity=0.174 Sum_probs=21.4
Q ss_pred CEEEEEEeeccCCCCceeecccc
Q psy5736 163 KNIIVKALRPLKPKEVVAENYGL 185 (346)
Q Consensus 163 ~~~~v~a~r~I~~Geel~~~Y~~ 185 (346)
+++.+.-.++|+.|++++++|.+
T Consensus 75 ktVTLTL~~~V~~Gq~VTVsYt~ 97 (101)
T TIGR02059 75 TTITLTLAQVVEDGDEVTLSYTK 97 (101)
T ss_pred cEEEEEecccccCCCEEEEEeeC
Confidence 38999999999999999999986
No 177
>KOG0772|consensus
Probab=44.18 E-value=29 Score=35.75 Aligned_cols=56 Identities=25% Similarity=0.351 Sum_probs=36.1
Q ss_pred CcccceEEEEecCCC-ceeeeeCCCCCCCCCCCCc--ccceeccccCCCceEEeeecCCceEEEe
Q psy5736 2 TGLFLKEGIWDHITG-SRLISWRPDRGSWARNSAR--CTALTLLNEHDDSLLLTGYDDGSLSVYR 63 (346)
Q Consensus 2 ~~~~~~~~~w~~~~~-~~~~~~~~~~~~w~~~~~~--~t~l~~ine~~~~lll~g~~dG~vri~r 63 (346)
++.+|++.|||.++. .++..|...-. .+.| .|+.. .| .|..+..+|-.||.|.+|.
T Consensus 287 ~s~DgtlRiWdv~~~k~q~qVik~k~~----~g~Rv~~tsC~-~n-rdg~~iAagc~DGSIQ~W~ 345 (641)
T KOG0772|consen 287 CSYDGTLRIWDVNNTKSQLQVIKTKPA----GGKRVPVTSCA-WN-RDGKLIAAGCLDGSIQIWD 345 (641)
T ss_pred ecCCCcEEEEecCCchhheeEEeeccC----CCcccCceeee-cC-CCcchhhhcccCCceeeee
Confidence 478999999999765 44545543221 1233 24433 34 4555568888999999995
No 178
>PF07754 DUF1610: Domain of unknown function (DUF1610); InterPro: IPR011668 This domain is found in archaeal species. It is likely to bind zinc via its four well-conserved cysteine residues.
Probab=43.90 E-value=18 Score=21.27 Aligned_cols=9 Identities=22% Similarity=0.615 Sum_probs=5.3
Q ss_pred cccccccCC
Q psy5736 254 EKKCESCNS 262 (346)
Q Consensus 254 ~~~C~~C~~ 262 (346)
...|+.||+
T Consensus 16 ~f~CPnCG~ 24 (24)
T PF07754_consen 16 PFPCPNCGF 24 (24)
T ss_pred eEeCCCCCC
Confidence 366666663
No 179
>KOG0319|consensus
Probab=43.38 E-value=36 Score=36.38 Aligned_cols=50 Identities=16% Similarity=0.228 Sum_probs=35.9
Q ss_pred ccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEE
Q psy5736 4 LFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVY 62 (346)
Q Consensus 4 ~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~ 62 (346)
-+.||.||.-++..-+-.|.- |..-+--..|++ .+.-++.+.+||.+++|
T Consensus 525 gD~TvKIW~is~fSClkT~eG-------H~~aVlra~F~~--~~~qliS~~adGliKlW 574 (775)
T KOG0319|consen 525 GDKTVKIWSISTFSCLKTFEG-------HTSAVLRASFIR--NGKQLISAGADGLIKLW 574 (775)
T ss_pred CCceEEEEEeccceeeeeecC-------ccceeEeeeeee--CCcEEEeccCCCcEEEE
Confidence 478999999998888877752 222333334444 55566889999999999
No 180
>KOG0293|consensus
Probab=43.14 E-value=23 Score=35.53 Aligned_cols=55 Identities=9% Similarity=0.208 Sum_probs=39.4
Q ss_pred cceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeCC
Q psy5736 5 FLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRNY 65 (346)
Q Consensus 5 ~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~~ 65 (346)
.-.|.+||.+..+.+-+|.-++.+ |- -|-+ -++-.++...+.||.|+.|.||..-
T Consensus 416 ~qei~LWDl~e~~lv~kY~Ghkq~---~f-iIrS--CFgg~~~~fiaSGSED~kvyIWhr~ 470 (519)
T KOG0293|consen 416 DQEIHLWDLEENKLVRKYFGHKQG---HF-IIRS--CFGGGNDKFIASGSEDSKVYIWHRI 470 (519)
T ss_pred cCeeEEeecchhhHHHHhhccccc---ce-EEEe--ccCCCCcceEEecCCCceEEEEEcc
Confidence 346899999988777777766542 11 1222 1456888999999999999999443
No 181
>PF02150 RNA_POL_M_15KD: RNA polymerases M/15 Kd subunit; InterPro: IPR001529 DNA-directed RNA polymerases 2.7.7.6 from EC (also known as DNA-dependent RNA polymerases) are responsible for the polymerisation of ribonucleotides into a sequence complementary to the template DNA. In eukaryotes, there are three different forms of DNA-directed RNA polymerases transcribing different sets of genes. Most RNA polymerases are multimeric enzymes and are composed of a variable number of subunits. The core RNA polymerase complex consists of five subunits (two alpha, one beta, one beta-prime and one omega) and is sufficient for transcription elongation and termination but is unable to initiate transcription. Transcription initiation from promoter elements requires a sixth, dissociable subunit called a sigma factor, which reversibly associates with the core RNA polymerase complex to form a holoenzyme []. The core RNA polymerase complex forms a "crab claw"-like structure with an internal channel running along the full length []. The key functional sites of the enzyme, as defined by mutational and cross-linking analysis, are located on the inner wall of this channel. RNA synthesis follows after the attachment of RNA polymerase to a specific site, the promoter, on the template DNA strand. The RNA synthesis process continues until a termination sequence is reached. The RNA product, which is synthesised in the 5' to 3'direction, is known as the primary transcript. Eukaryotic nuclei contain three distinct types of RNA polymerases that differ in the RNA they synthesise: RNA polymerase I: located in the nucleoli, synthesises precursors of most ribosomal RNAs. RNA polymerase II: occurs in the nucleoplasm, synthesises mRNA precursors. RNA polymerase III: also occurs in the nucleoplasm, synthesises the precursors of 5S ribosomal RNA, the tRNAs, and a variety of other small nuclear and cytosolic RNAs. Eukaryotic cells are also known to contain separate mitochondrial and chloroplast RNA polymerases. Eukaryotic RNA polymerases, whose molecular masses vary in size from 500 to 700 kDa, contain two non-identical large (>100 kDa) subunits and an array of up to 12 different small (less than 50 kDa) subunits. In archaebacteria, there is generally a single form of RNA polymerase which also consist of an oligomeric assemblage of 10 to 13 polypeptides. It has recently been shown [], [] that small subunits of about 15 kDa, found in polymerase types I and II, are highly conserved. These proteins contain a probable zinc finger in their N-terminal region and a C-terminal zinc ribbon domain (see IPR001222 from INTERPRO).; GO: 0003677 DNA binding, 0003899 DNA-directed RNA polymerase activity, 0006351 transcription, DNA-dependent; PDB: 3H0G_I 3M4O_I 3S14_I 2E2J_I 4A3J_I 3HOZ_I 1TWA_I 3S1Q_I 3S1N_I 1TWG_I ....
Probab=43.08 E-value=8.4 Score=24.68 Aligned_cols=30 Identities=23% Similarity=0.672 Sum_probs=19.7
Q ss_pred ccCCcCcCCcccccccCCcccccCcccccccCCcCCh
Q psy5736 230 RCSNDNCGQIIATVKKLEPSAKKVEKKCESCNSTSDL 266 (346)
Q Consensus 230 ~C~~~~C~g~~~~~~~~~~~~~~~~~~C~~C~~~~~~ 266 (346)
.|| .|++.+.|..+.. . ...|..|+.....
T Consensus 3 FCp--~C~nlL~p~~~~~----~-~~~C~~C~Y~~~~ 32 (35)
T PF02150_consen 3 FCP--ECGNLLYPKEDKE----K-RVACRTCGYEEPI 32 (35)
T ss_dssp BET--TTTSBEEEEEETT----T-TEEESSSS-EEE-
T ss_pred eCC--CCCccceEcCCCc----c-CcCCCCCCCccCC
Confidence 588 9999998875421 1 1269999987544
No 182
>PF11817 Foie-gras_1: Foie gras liver health family 1; InterPro: IPR021773 Mutating the gene foie gras in zebrafish has been shown to affect development; the mutants develop large, lipid-filled hepatocytes in the liver, resembling those in individuals with fatty liver disease []. Foie-gras protein is long and has several well-defined domains though none of them has a known function. We have annotated this one as the first []. THe C terminus of this region contains TPR repeats.
Probab=43.07 E-value=82 Score=29.06 Aligned_cols=51 Identities=14% Similarity=0.065 Sum_probs=42.5
Q ss_pred HHHHhccccHHHHHHHHHHHHHHhhcccCCCcHhHHHHHHHHHHHHHHhCCcc
Q psy5736 283 GIEQMNTSCFREAVESLTKFSDQIHELVVPPYKLASLAHEALRNCWSLAGNKW 335 (346)
Q Consensus 283 a~~~~~~~~~~~ai~~l~~~l~~~~~~l~p~h~~~~~~~~~L~~~y~~~G~~~ 335 (346)
|.+.+..|++++|+++|+.+...|.+- ..+..+..+...|.+|+..+|+.-
T Consensus 185 A~ey~~~g~~~~A~~~l~~~~~~yr~e--gW~~l~~~~l~~l~~Ca~~~~~~~ 235 (247)
T PF11817_consen 185 AEEYFRLGDYDKALKLLEPAASSYRRE--GWWSLLTEVLWRLLECAKRLGDVE 235 (247)
T ss_pred HHHHHHCCCHHHHHHHHHHHHHHHHhC--CcHHHHHHHHHHHHHHHHHhCCHH
Confidence 445577899999999999998888753 566788899999999999999863
No 183
>KOG0289|consensus
Probab=42.79 E-value=47 Score=33.49 Aligned_cols=56 Identities=18% Similarity=0.141 Sum_probs=40.9
Q ss_pred cccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeCC
Q psy5736 3 GLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRNY 65 (346)
Q Consensus 3 ~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~~ 65 (346)
..+|++..-|.++|..|+.-+-.. ...++|+..|+ +|+-++-+|..||++++|..-
T Consensus 322 s~d~~w~Fsd~~~g~~lt~vs~~~-----s~v~~ts~~fH--pDgLifgtgt~d~~vkiwdlk 377 (506)
T KOG0289|consen 322 SNDGTWAFSDISSGSQLTVVSDET-----SDVEYTSAAFH--PDGLIFGTGTPDGVVKIWDLK 377 (506)
T ss_pred cCCceEEEEEccCCcEEEEEeecc-----ccceeEEeeEc--CCceEEeccCCCceEEEEEcC
Confidence 357889999999999999765431 23567887764 345555667789999999544
No 184
>cd00350 rubredoxin_like Rubredoxin_like; nonheme iron binding domain containing a [Fe(SCys)4] center. The family includes rubredoxins, a small electron transfer protein, and a slightly smaller modular rubredoxin domain present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), but iron can also be replaced by cobalt, nickel or zinc and believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=42.75 E-value=19 Score=22.52 Aligned_cols=25 Identities=24% Similarity=0.562 Sum_probs=16.9
Q ss_pred cccCCcCcCCcccccccCCcccccCcccccccCCc
Q psy5736 229 IRCSNDNCGQIIATVKKLEPSAKKVEKKCESCNST 263 (346)
Q Consensus 229 ~~C~~~~C~g~~~~~~~~~~~~~~~~~~C~~C~~~ 263 (346)
+.|+ .|+-...+.. ..|.|+.||..
T Consensus 2 ~~C~--~CGy~y~~~~--------~~~~CP~Cg~~ 26 (33)
T cd00350 2 YVCP--VCGYIYDGEE--------APWVCPVCGAP 26 (33)
T ss_pred EECC--CCCCEECCCc--------CCCcCcCCCCc
Confidence 5787 8875544332 24999999985
No 185
>KOG0640|consensus
Probab=42.57 E-value=18 Score=35.07 Aligned_cols=55 Identities=7% Similarity=0.159 Sum_probs=45.5
Q ss_pred ccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeCC
Q psy5736 4 LFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRNY 65 (346)
Q Consensus 4 ~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~~ 65 (346)
-+|.|.+||=-+++-+..|.+-. -+.+|++.+|- .+..++|+-+-|..+++|..-
T Consensus 281 kDG~IklwDGVS~rCv~t~~~AH-----~gsevcSa~Ft--kn~kyiLsSG~DS~vkLWEi~ 335 (430)
T KOG0640|consen 281 KDGAIKLWDGVSNRCVRTIGNAH-----GGSEVCSAVFT--KNGKYILSSGKDSTVKLWEIS 335 (430)
T ss_pred cCCcEEeeccccHHHHHHHHhhc-----CCceeeeEEEc--cCCeEEeecCCcceeeeeeec
Confidence 58999999999999999997664 36899998864 467888888889999999654
No 186
>KOG0645|consensus
Probab=42.46 E-value=29 Score=32.98 Aligned_cols=33 Identities=18% Similarity=0.293 Sum_probs=26.1
Q ss_pred CCCCcccceeccccCCCceEEeeecCCceEEEeC
Q psy5736 31 RNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRN 64 (346)
Q Consensus 31 ~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~ 64 (346)
-|..+|.+++. |+....++++|++||.+++|+.
T Consensus 278 aHe~dVNsV~w-~p~~~~~L~s~~DDG~v~~W~l 310 (312)
T KOG0645|consen 278 AHEVDVNSVQW-NPKVSNRLASGGDDGIVNFWEL 310 (312)
T ss_pred ccccccceEEE-cCCCCCceeecCCCceEEEEEe
Confidence 35667888884 5556788899999999999975
No 187
>PF06957 COPI_C: Coatomer (COPI) alpha subunit C-terminus; InterPro: IPR010714 Proteins synthesised on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. This traffic is bidirectional, to ensure that proteins required to form vesicles are recycled. Vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transfer []. While clathrin mediates endocytic protein transport, and transport from ER to Golgi, coatomers primarily mediate intra-Golgi transport, as well as the reverse Golgi to ER transport of dilysine-tagged proteins []. For example, the coatomer COP1 (coat protein complex 1) is responsible for reverse transport of recycled proteins from Golgi and pre-Golgi compartments back to the ER, while COPII buds vesicles from the ER to the Golgi []. Coatomers reversibly associate with Golgi (non-clathrin-coated) vesicles to mediate protein transport and for budding from Golgi membranes []. Activated small guanine triphosphatases (GTPases) attract coat proteins to specific membrane export sites, thereby linking coatomers to export cargos. As coat proteins polymerise, vesicles are formed and budded from membrane-bound organelles. Coatomer complexes also influence Golgi structural integrity, as well as the processing, activity, and endocytic recycling of LDL receptors. In mammals, coatomer complexes can only be recruited by membranes associated to ADP-ribosylation factors (ARFs), which are small GTP-binding proteins. Coatomer complexes are hetero-oligomers composed of at least an alpha, beta, beta', gamma, delta, epsilon and zeta subunits. This entry represents the C terminus (approximately 500 residues) of the eukaryotic coatomer alpha subunit [, ]. This domain is found along with the IPR006692 from INTERPRO domain. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0005198 structural molecule activity, 0005515 protein binding, 0006886 intracellular protein transport, 0016192 vesicle-mediated transport, 0030126 COPI vesicle coat; PDB: 3MKR_B 3MV2_E 3MKQ_B 3MV3_A.
Probab=41.85 E-value=1e+02 Score=31.20 Aligned_cols=55 Identities=29% Similarity=0.373 Sum_probs=39.2
Q ss_pred HHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHhhcccCCCcHhHHHHHHHHHHH
Q psy5736 273 LSELNEMFYRGIEQMNTSCFREAVESLTKFSDQIHELVVPPYKLASLAHEALRNC 327 (346)
Q Consensus 273 l~~~~~~~~~a~~~~~~~~~~~ai~~l~~~l~~~~~~l~p~h~~~~~~~~~L~~~ 327 (346)
+..+.+.++.+-+++..|++.+|+..+..+|...--+.........++++.|.-|
T Consensus 201 l~~L~~~Lk~gyk~~t~gKF~eA~~~Fr~iL~~i~l~vv~~~~E~~e~~eli~ic 255 (422)
T PF06957_consen 201 LSSLEERLKEGYKLFTAGKFEEAIEIFRSILHSIPLLVVESREEEDEAKELIEIC 255 (422)
T ss_dssp HHHHHHHHHHHHHHHHTT-HHHHHHHHHHHHHHHHC--BSSCHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHHHhheeeecCHHHHHHHHHHHHHH
Confidence 4445556677888889999999999999998877666677777777777666554
No 188
>PF09538 FYDLN_acid: Protein of unknown function (FYDLN_acid); InterPro: IPR012644 Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=41.76 E-value=17 Score=29.55 Aligned_cols=30 Identities=20% Similarity=0.466 Sum_probs=21.4
Q ss_pred cccCCcCcCCcccccccCCcccccCcccccccCCcCChH
Q psy5736 229 IRCSNDNCGQIIATVKKLEPSAKKVEKKCESCNSTSDLT 267 (346)
Q Consensus 229 ~~C~~~~C~g~~~~~~~~~~~~~~~~~~C~~C~~~~~~~ 267 (346)
-.|| +|+..++--+- ...+|++||..+..+
T Consensus 10 R~Cp--~CG~kFYDLnk-------~PivCP~CG~~~~~~ 39 (108)
T PF09538_consen 10 RTCP--SCGAKFYDLNK-------DPIVCPKCGTEFPPE 39 (108)
T ss_pred ccCC--CCcchhccCCC-------CCccCCCCCCccCcc
Confidence 4688 89876653321 238899999988776
No 189
>KOG0973|consensus
Probab=41.34 E-value=33 Score=37.91 Aligned_cols=55 Identities=20% Similarity=0.204 Sum_probs=39.4
Q ss_pred cccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeCCc
Q psy5736 3 GLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRNYA 66 (346)
Q Consensus 3 ~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~~~ 66 (346)
|++++|.|||..+-.++-.+. + |.+-++-+-+ + +.+.++.+-++|+.+++|+..+
T Consensus 148 s~DnsViiwn~~tF~~~~vl~-~------H~s~VKGvs~-D-P~Gky~ASqsdDrtikvwrt~d 202 (942)
T KOG0973|consen 148 SLDNSVIIWNAKTFELLKVLR-G------HQSLVKGVSW-D-PIGKYFASQSDDRTLKVWRTSD 202 (942)
T ss_pred cccceEEEEccccceeeeeee-c------ccccccceEE-C-CccCeeeeecCCceEEEEEccc
Confidence 789999999999986665552 2 3444544331 1 4477888899999999998543
No 190
>PF14276 DUF4363: Domain of unknown function (DUF4363)
Probab=41.05 E-value=1.7e+02 Score=23.71 Aligned_cols=56 Identities=7% Similarity=-0.021 Sum_probs=41.4
Q ss_pred HHHHHHHHHHHhccccHHHHHHHHHHHHHHhhcccC-----CCcHhHHHHHHHHHHHHHHh
Q psy5736 276 LNEMFYRGIEQMNTSCFREAVESLTKFSDQIHELVV-----PPYKLASLAHEALRNCWSLA 331 (346)
Q Consensus 276 ~~~~~~~a~~~~~~~~~~~ai~~l~~~l~~~~~~l~-----p~h~~~~~~~~~L~~~y~~~ 331 (346)
+.+....+.+.++++++++|.+.+.++.+.+.+.-. -.|..+......|+++-+.+
T Consensus 28 i~~~l~~i~~~i~~~dW~~A~~~~~~l~~~W~k~~~~~~~~~~h~eid~i~~sl~rl~~~i 88 (121)
T PF14276_consen 28 IEEQLEQIEEAIENEDWEKAYKETEELEKEWDKNKKRWSILIEHQEIDNIDISLARLKGYI 88 (121)
T ss_pred HHHHHHHHHHHHHhCCHHHHHHHHHHHHHHHHhhchheeeeecHHHHHHHHHHHHHHHHHH
Confidence 334445556677889999999999999988776533 37888888888888776654
No 191
>PF09976 TPR_21: Tetratricopeptide repeat; InterPro: IPR018704 This domain, found in various hypothetical prokaryotic proteins, has no known function.
Probab=40.64 E-value=1.2e+02 Score=24.98 Aligned_cols=39 Identities=23% Similarity=0.429 Sum_probs=22.9
Q ss_pred hccccHHHHHHHHHHHHHHhhcccCCCcHhHHHHHHHHHHHHHHhCCc
Q psy5736 287 MNTSCFREAVESLTKFSDQIHELVVPPYKLASLAHEALRNCWSLAGNK 334 (346)
Q Consensus 287 ~~~~~~~~ai~~l~~~l~~~~~~l~p~h~~~~~~~~~L~~~y~~~G~~ 334 (346)
+..+++++|+..|.. .+....-..+.+.+..+|...|+.
T Consensus 96 ~~~~~~d~Al~~L~~---------~~~~~~~~~~~~~~Gdi~~~~g~~ 134 (145)
T PF09976_consen 96 LQQGQYDEALATLQQ---------IPDEAFKALAAELLGDIYLAQGDY 134 (145)
T ss_pred HHcCCHHHHHHHHHh---------ccCcchHHHHHHHHHHHHHHCCCH
Confidence 445666665555433 233334455677788888887765
No 192
>KOG1539|consensus
Probab=40.19 E-value=45 Score=36.18 Aligned_cols=51 Identities=24% Similarity=0.250 Sum_probs=39.2
Q ss_pred ccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEE
Q psy5736 4 LFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVY 62 (346)
Q Consensus 4 ~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~ 62 (346)
-+|.+.+||-++.++.+.-.|- |..-++.+.+++ ..+++++++-|..+++|
T Consensus 265 ~~G~m~~wDLe~kkl~~v~~na------h~~sv~~~~fl~--~epVl~ta~~DnSlk~~ 315 (910)
T KOG1539|consen 265 SNGDMAFWDLEKKKLINVTRNA------HYGSVTGATFLP--GEPVLVTAGADNSLKVW 315 (910)
T ss_pred CCceEEEEEcCCCeeeeeeecc------ccCCcccceecC--CCceEeeccCCCceeEE
Confidence 4699999999988887766543 556677777775 56778888888888887
No 193
>cd00189 TPR Tetratricopeptide repeat domain; typically contains 34 amino acids [WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]-X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found in a variety of organisms including bacteria, cyanobacteria, yeast, fungi, plants, and humans in various subcellular locations; involved in a variety of functions including protein-protein interactions, but common features in the interaction partners have not been defined; involved in chaperone, cell-cycle, transciption, and protein transport complexes; the number of TPR motifs varies among proteins (1,3-11,13 15,16,19); 5-6 tandem repeats generate a right-handed helical structure with an amphipathic channel that is thought to accomodate an alpha-helix of a target protein; it has been proposed that TPR proteins preferably interact with WD-40 repeat proteins, but in many instances several TPR-proteins seem to aggregate to multi-protein complexes; examples of TPR-proteins include, Cdc16p, Cdc23p and C
Probab=39.56 E-value=62 Score=22.34 Aligned_cols=16 Identities=19% Similarity=-0.002 Sum_probs=9.1
Q ss_pred HHHHHHHHHHHHhCCc
Q psy5736 319 LAHEALRNCWSLAGNK 334 (346)
Q Consensus 319 ~~~~~L~~~y~~~G~~ 334 (346)
.+...++.++...|+.
T Consensus 69 ~~~~~~~~~~~~~~~~ 84 (100)
T cd00189 69 KAYYNLGLAYYKLGKY 84 (100)
T ss_pred hHHHHHHHHHHHHHhH
Confidence 4455566666666553
No 194
>KOG2445|consensus
Probab=39.24 E-value=52 Score=31.84 Aligned_cols=59 Identities=17% Similarity=0.294 Sum_probs=43.4
Q ss_pred cccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeC
Q psy5736 3 GLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRN 64 (346)
Q Consensus 3 ~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~ 64 (346)
..+++|.|||..++.- .|.+- .+|.-|..-|=.+-+.-+.-+.++.+.+-|+.++||.+
T Consensus 32 SsDq~vkI~d~~~~s~--~W~~T-s~Wrah~~Si~rV~WAhPEfGqvvA~cS~Drtv~iWEE 90 (361)
T KOG2445|consen 32 SSDQTVKIWDSTSDSG--TWSCT-SSWRAHDGSIWRVVWAHPEFGQVVATCSYDRTVSIWEE 90 (361)
T ss_pred cCCCcEEEEeccCCCC--ceEEe-eeEEecCCcEEEEEecCccccceEEEEecCCceeeeee
Confidence 3688999999754422 22222 45788888887777777767888899999999999955
No 195
>PRK06266 transcription initiation factor E subunit alpha; Validated
Probab=39.17 E-value=22 Score=31.42 Aligned_cols=33 Identities=27% Similarity=0.792 Sum_probs=23.6
Q ss_pred CCceecCCCCCCCccccccccCccccccCCcCcCCccccc
Q psy5736 204 WFECKCRACVENWPLMESLEKYPIRIRCSNDNCGQIIATV 243 (346)
Q Consensus 204 ~F~C~C~~C~~~~~~~~~~~~~~~~~~C~~~~C~g~~~~~ 243 (346)
+|.| +.|...+...+.+. ..|.|| .|++.+...
T Consensus 117 ~Y~C--p~C~~rytf~eA~~---~~F~Cp--~Cg~~L~~~ 149 (178)
T PRK06266 117 FFFC--PNCHIRFTFDEAME---YGFRCP--QCGEMLEEY 149 (178)
T ss_pred EEEC--CCCCcEEeHHHHhh---cCCcCC--CCCCCCeec
Confidence 4544 88988776554443 589999 999987654
No 196
>KOG0553|consensus
Probab=38.98 E-value=86 Score=30.18 Aligned_cols=33 Identities=18% Similarity=0.187 Sum_probs=25.9
Q ss_pred HHHHHHHHHHHHHhccccHHHHHHHHHHHHHHh
Q psy5736 274 SELNEMFYRGIEQMNTSCFREAVESLTKFSDQI 306 (346)
Q Consensus 274 ~~~~~~~~~a~~~~~~~~~~~ai~~l~~~l~~~ 306 (346)
..++.+=.++-++|+.++|++|+..|.+.|+.-
T Consensus 79 ~~AE~LK~eGN~~m~~~~Y~eAv~kY~~AI~l~ 111 (304)
T KOG0553|consen 79 ALAESLKNEGNKLMKNKDYQEAVDKYTEAIELD 111 (304)
T ss_pred HHHHHHHHHHHHHHHhhhHHHHHHHHHHHHhcC
Confidence 345555567788899999999999999998754
No 197
>PF08271 TF_Zn_Ribbon: TFIIB zinc-binding; InterPro: IPR013137 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents a zinc finger motif found in transcription factor IIB (TFIIB). In eukaryotes the initiation of transcription of protein encoding genes by the polymerase II complexe (Pol II) is modulated by general and specific transcription factors. The general transcription factors operate through common promoters elements (such as the TATA box). At least seven different proteins associate to form the general transcription factors: TFIIA, -IIB, -IID, -IIE, -IIF, -IIG, and -IIH []. TFIIB and TFIID are responsible for promoter recognition and interaction with pol II; together with Pol II, they form a minimal initiation complex capable of transcription under certain conditions. The TATA box of a Pol II promoter is bound in the initiation complex by the TBP subunit of TFIID, which bends the DNA around the C-terminal domain of TFIIB whereas the N-terminal zinc finger of TFIIB interacts with Pol II [, ]. The TFIIB zinc finger adopts a zinc ribbon fold characterised by two beta-hairpins forming two structurally similar zinc-binding sub-sites []. The zinc finger contacts the rbp1 subunit of Pol II through its dock domain, a conserved region of about 70 amino acids located close to the polymerase active site []. In the Pol II complex this surface is located near the RNA exit groove. Interestingly this sequence is best conserved in the three polymerases that utilise a TFIIB-like general transcription factor (Pol II, Pol III, and archaeal RNA polymerase) but not in Pol I []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent; PDB: 1VD4_A 1PFT_A 3K1F_M 3K7A_M 1RO4_A 1RLY_A 1DL6_A.
Probab=38.85 E-value=16 Score=24.30 Aligned_cols=11 Identities=18% Similarity=0.561 Sum_probs=6.3
Q ss_pred ccccccCCcCC
Q psy5736 255 KKCESCNSTSD 265 (346)
Q Consensus 255 ~~C~~C~~~~~ 265 (346)
..|..||..+.
T Consensus 20 ~vC~~CG~Vl~ 30 (43)
T PF08271_consen 20 LVCPNCGLVLE 30 (43)
T ss_dssp EEETTT-BBEE
T ss_pred EECCCCCCEee
Confidence 67777776543
No 198
>PF14803 Nudix_N_2: Nudix N-terminal; PDB: 3CNG_C.
Probab=38.82 E-value=17 Score=23.23 Aligned_cols=30 Identities=17% Similarity=0.406 Sum_probs=13.8
Q ss_pred ccCCcCcCCcccccccCCcccccCcccccccCCc
Q psy5736 230 RCSNDNCGQIIATVKKLEPSAKKVEKKCESCNST 263 (346)
Q Consensus 230 ~C~~~~C~g~~~~~~~~~~~~~~~~~~C~~C~~~ 263 (346)
.|+ .|++.+...-+.. +....+.|+.||..
T Consensus 2 fC~--~CG~~l~~~ip~g--d~r~R~vC~~Cg~I 31 (34)
T PF14803_consen 2 FCP--QCGGPLERRIPEG--DDRERLVCPACGFI 31 (34)
T ss_dssp B-T--TT--B-EEE--TT---SS-EEEETTTTEE
T ss_pred ccc--cccChhhhhcCCC--CCccceECCCCCCE
Confidence 377 8887765432211 12346899999863
No 199
>KOG1445|consensus
Probab=38.38 E-value=40 Score=35.71 Aligned_cols=60 Identities=27% Similarity=0.264 Sum_probs=43.1
Q ss_pred CcccceEEEEecCCCceeee-eCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeCCc
Q psy5736 2 TGLFLKEGIWDHITGSRLIS-WRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRNYA 66 (346)
Q Consensus 2 ~~~~~~~~~w~~~~~~~~~~-~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~~~ 66 (346)
.|--|.|.|++..+--||-. .-.+- -.++-||+|++ |+-|+..+.+|.+||.|++|+.-.
T Consensus 599 ~g~gG~iai~el~~PGrLPDgv~p~l----~Ngt~vtDl~W-dPFD~~rLAVa~ddg~i~lWr~~a 659 (1012)
T KOG1445|consen 599 AGSGGVIAIYELNEPGRLPDGVMPGL----FNGTLVTDLHW-DPFDDERLAVATDDGQINLWRLTA 659 (1012)
T ss_pred cCCCceEEEEEcCCCCCCCccccccc----ccCceeeeccc-CCCChHHeeecccCceEEEEEecc
Confidence 46678999999987656531 11111 14578999985 777777779999999999997553
No 200
>KOG0278|consensus
Probab=38.23 E-value=66 Score=30.39 Aligned_cols=48 Identities=23% Similarity=0.275 Sum_probs=31.8
Q ss_pred ccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEE
Q psy5736 4 LFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVY 62 (346)
Q Consensus 4 ~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~ 62 (346)
-+++|++||..++.++++..- +..+|+|++.. +..++. -...+.|..|
T Consensus 163 dd~tVRLWD~rTgt~v~sL~~--------~s~VtSlEvs~--dG~ilT-ia~gssV~Fw 210 (334)
T KOG0278|consen 163 DDKTVRLWDHRTGTEVQSLEF--------NSPVTSLEVSQ--DGRILT-IAYGSSVKFW 210 (334)
T ss_pred cCCceEEEEeccCcEEEEEec--------CCCCcceeecc--CCCEEE-EecCceeEEe
Confidence 468999999999999987643 34588888543 334433 3334455555
No 201
>KOG0305|consensus
Probab=37.99 E-value=41 Score=34.56 Aligned_cols=56 Identities=21% Similarity=0.267 Sum_probs=42.2
Q ss_pred CcccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeee--cCCceEEEeCC
Q psy5736 2 TGLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGY--DDGSLSVYRNY 65 (346)
Q Consensus 2 ~~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~--~dG~vri~r~~ 65 (346)
.|.+..+.|||-.....+-+|.+ |+.-+++|.+ -++...+|.+|. .|+.|++|.-.
T Consensus 319 GgnDN~~~Iwd~~~~~p~~~~~~-------H~aAVKA~aw-cP~q~~lLAsGGGs~D~~i~fwn~~ 376 (484)
T KOG0305|consen 319 GGNDNVVFIWDGLSPEPKFTFTE-------HTAAVKALAW-CPWQSGLLATGGGSADRCIKFWNTN 376 (484)
T ss_pred CCCccceEeccCCCccccEEEec-------cceeeeEeee-CCCccCceEEcCCCcccEEEEEEcC
Confidence 37899999999976666666654 4566888775 457777878776 79999999655
No 202
>KOG1240|consensus
Probab=37.93 E-value=33 Score=38.84 Aligned_cols=38 Identities=24% Similarity=0.478 Sum_probs=26.8
Q ss_pred CCCCCCCCCcccceecccc--------CC-CceEEeeecCCceEEEe
Q psy5736 26 RGSWARNSARCTALTLLNE--------HD-DSLLLTGYDDGSLSVYR 63 (346)
Q Consensus 26 ~~~w~~~~~~~t~l~~ine--------~~-~~lll~g~~dG~vri~r 63 (346)
.+.|.+.|..|+-|....+ .+ .++.++|++||.||+|.
T Consensus 1031 ~~gW~p~G~lVAhL~Ehs~~v~k~a~s~~~~s~FvsgS~DGtVKvW~ 1077 (1431)
T KOG1240|consen 1031 PPGWNPRGILVAHLHEHSSAVIKLAVSSEHTSLFVSGSDDGTVKVWN 1077 (1431)
T ss_pred CCCCCccceEeehhhhccccccceeecCCCCceEEEecCCceEEEee
Confidence 3447777776665544332 33 48999999999999994
No 203
>PRK00464 nrdR transcriptional regulator NrdR; Validated
Probab=37.88 E-value=1.6e+02 Score=25.41 Aligned_cols=24 Identities=4% Similarity=-0.278 Sum_probs=18.0
Q ss_pred CcHhHHHHHHHHHHHHHHhCCccc
Q psy5736 313 PYKLASLAHEALRNCWSLAGNKWV 336 (346)
Q Consensus 313 ~h~~~~~~~~~L~~~y~~~G~~~~ 336 (346)
..+...+.++.+.+.+...|..+.
T Consensus 101 ~~IsveEIqDiVE~~L~~~~~~ay 124 (154)
T PRK00464 101 REVPSKEIGELVMEELKKLDEVAY 124 (154)
T ss_pred CCCCHHHHHHHHHHHHHhcCCEEE
Confidence 357778888888888888777654
No 204
>KOG1446|consensus
Probab=37.00 E-value=69 Score=30.85 Aligned_cols=53 Identities=13% Similarity=0.090 Sum_probs=40.7
Q ss_pred cccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeC
Q psy5736 3 GLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRN 64 (346)
Q Consensus 3 ~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~ 64 (346)
.-+-+|+--+....+.+-+|. -|+.+|++|. +++.+| ..|.++-|+.||+|..
T Consensus 77 k~d~tIryLsl~dNkylRYF~-------GH~~~V~sL~-~sP~~d-~FlS~S~D~tvrLWDl 129 (311)
T KOG1446|consen 77 KEDDTIRYLSLHDNKYLRYFP-------GHKKRVNSLS-VSPKDD-TFLSSSLDKTVRLWDL 129 (311)
T ss_pred CCCCceEEEEeecCceEEEcC-------CCCceEEEEE-ecCCCC-eEEecccCCeEEeeEe
Confidence 345678888888888887873 3788999988 565554 5588999999999943
No 205
>PLN03088 SGT1, suppressor of G2 allele of SKP1; Provisional
Probab=36.86 E-value=74 Score=31.11 Aligned_cols=44 Identities=16% Similarity=0.050 Sum_probs=24.6
Q ss_pred HHHHhccccHHHHHHHHHHHHHHhhcccCCCcHhHHHHHHHHHHHHHHhCCc
Q psy5736 283 GIEQMNTSCFREAVESLTKFSDQIHELVVPPYKLASLAHEALRNCWSLAGNK 334 (346)
Q Consensus 283 a~~~~~~~~~~~ai~~l~~~l~~~~~~l~p~h~~~~~~~~~L~~~y~~~G~~ 334 (346)
|...+..|++++|+..+.++++.. |.+ ..+...+..+|..+|+.
T Consensus 43 a~~~~~~g~~~eAl~~~~~Al~l~-----P~~---~~a~~~lg~~~~~lg~~ 86 (356)
T PLN03088 43 AQANIKLGNFTEAVADANKAIELD-----PSL---AKAYLRKGTACMKLEEY 86 (356)
T ss_pred HHHHHHcCCHHHHHHHHHHHHHhC-----cCC---HHHHHHHHHHHHHhCCH
Confidence 334455566666666666666542 333 23455666666666654
No 206
>PRK05580 primosome assembly protein PriA; Validated
Probab=36.78 E-value=54 Score=35.16 Aligned_cols=18 Identities=17% Similarity=0.152 Sum_probs=12.3
Q ss_pred HHHHHHHHHHHHHHhCCc
Q psy5736 317 ASLAHEALRNCWSLAGNK 334 (346)
Q Consensus 317 ~~~~~~~L~~~y~~~G~~ 334 (346)
...+.+.+.|..++.|+.
T Consensus 519 ~Er~~~~l~q~~GRagR~ 536 (679)
T PRK05580 519 SERTFQLLTQVAGRAGRA 536 (679)
T ss_pred HHHHHHHHHHHHhhccCC
Confidence 334567788888888773
No 207
>PF13371 TPR_9: Tetratricopeptide repeat
Probab=36.75 E-value=61 Score=22.96 Aligned_cols=40 Identities=18% Similarity=0.170 Sum_probs=22.7
Q ss_pred hccccHHHHHHHHHHHHHHhhcccCCCcHhHHHHHHHHHHHHHHhCCc
Q psy5736 287 MNTSCFREAVESLTKFSDQIHELVVPPYKLASLAHEALRNCWSLAGNK 334 (346)
Q Consensus 287 ~~~~~~~~ai~~l~~~l~~~~~~l~p~h~~~~~~~~~L~~~y~~~G~~ 334 (346)
+..+++++|++.+++.++.. |.+ ..+....+.+|..+|+.
T Consensus 6 ~~~~~~~~A~~~~~~~l~~~-----p~~---~~~~~~~a~~~~~~g~~ 45 (73)
T PF13371_consen 6 LQQEDYEEALEVLERALELD-----PDD---PELWLQRARCLFQLGRY 45 (73)
T ss_pred HhCCCHHHHHHHHHHHHHhC-----ccc---chhhHHHHHHHHHhccH
Confidence 45677777777777777663 222 23344455555555543
No 208
>KOG0285|consensus
Probab=35.94 E-value=28 Score=34.28 Aligned_cols=53 Identities=19% Similarity=0.238 Sum_probs=37.2
Q ss_pred CcccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEe
Q psy5736 2 TGLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYR 63 (346)
Q Consensus 2 ~~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r 63 (346)
.|.+++|.-||.|..+-+-.|.- |-+-+-+|.+. ++..++++|+.|..+|+|.
T Consensus 211 ~gedk~VKCwDLe~nkvIR~YhG-------HlS~V~~L~lh--PTldvl~t~grDst~RvWD 263 (460)
T KOG0285|consen 211 AGEDKQVKCWDLEYNKVIRHYHG-------HLSGVYCLDLH--PTLDVLVTGGRDSTIRVWD 263 (460)
T ss_pred ecCCCeeEEEechhhhhHHHhcc-------ccceeEEEecc--ccceeEEecCCcceEEEee
Confidence 47899999999998866654431 23333333321 4577889999999999993
No 209
>PF04423 Rad50_zn_hook: Rad50 zinc hook motif; InterPro: IPR007517 The Mre11 complex (Mre11 Rad50 Nbs1) is central to chromosomal maintenance and functions in homologous recombination, telomere maintenance and sister chromatid association. The Rad50 coiled-coil region contains a dimer interface at the apex of the coiled coils in which pairs of conserved Cys-X-X-Cys motifs form interlocking hooks that bind one Zn ion. This alignment includes the zinc hook motif and a short stretch of coiled-coil on either side.; GO: 0004518 nuclease activity, 0005524 ATP binding, 0008270 zinc ion binding, 0006281 DNA repair; PDB: 1L8D_B.
Probab=35.74 E-value=53 Score=22.78 Aligned_cols=25 Identities=12% Similarity=0.227 Sum_probs=13.5
Q ss_pred ccccccCCcCChHHHHHHHHHHHHH
Q psy5736 255 KKCESCNSTSDLTEIKTKLSELNEM 279 (346)
Q Consensus 255 ~~C~~C~~~~~~~~~~~~l~~~~~~ 279 (346)
..|+.|+..++.+.-...+.+....
T Consensus 21 ~~CPlC~r~l~~e~~~~li~~~~~~ 45 (54)
T PF04423_consen 21 GCCPLCGRPLDEEHRQELIKKYKSE 45 (54)
T ss_dssp EE-TTT--EE-HHHHHHHHHHHHHH
T ss_pred CcCCCCCCCCCHHHHHHHHHHHHHH
Confidence 4899999999887655555444443
No 210
>COG1998 RPS31 Ribosomal protein S27AE [Translation, ribosomal structure and biogenesis]
Probab=35.44 E-value=22 Score=24.75 Aligned_cols=27 Identities=22% Similarity=0.448 Sum_probs=17.1
Q ss_pred cccCCcCcCCcccccccCCcccccCcccccccCCc
Q psy5736 229 IRCSNDNCGQIIATVKKLEPSAKKVEKKCESCNST 263 (346)
Q Consensus 229 ~~C~~~~C~g~~~~~~~~~~~~~~~~~~C~~C~~~ 263 (346)
-.|| .|+.++.-.... ..|.|.+||.+
T Consensus 20 ~~CP--rCG~gvfmA~H~------dR~~CGkCgyT 46 (51)
T COG1998 20 RFCP--RCGPGVFMADHK------DRWACGKCGYT 46 (51)
T ss_pred ccCC--CCCCcchhhhcC------ceeEeccccce
Confidence 4688 897544333221 25999999975
No 211
>PRK10866 outer membrane biogenesis protein BamD; Provisional
Probab=35.21 E-value=1.2e+02 Score=28.04 Aligned_cols=44 Identities=7% Similarity=-0.108 Sum_probs=36.0
Q ss_pred HHHhccccHHHHHHHHHHHHHHhhcccCCCcHhHHHHHHHHHHHHHHhC
Q psy5736 284 IEQMNTSCFREAVESLTKFSDQIHELVVPPYKLASLAHEALRNCWSLAG 332 (346)
Q Consensus 284 ~~~~~~~~~~~ai~~l~~~l~~~~~~l~p~h~~~~~~~~~L~~~y~~~G 332 (346)
...+..+++++|+..++++++.+ |.|+....+...+..++..+|
T Consensus 77 ~ayy~~~~y~~A~~~~e~fi~~~-----P~~~~~~~a~Y~~g~~~~~~~ 120 (243)
T PRK10866 77 YAYYKNADLPLAQAAIDRFIRLN-----PTHPNIDYVLYMRGLTNMALD 120 (243)
T ss_pred HHHHhcCCHHHHHHHHHHHHHhC-----cCCCchHHHHHHHHHhhhhcc
Confidence 34467888999999999999876 889988888888888766665
No 212
>KOG1063|consensus
Probab=35.10 E-value=72 Score=34.03 Aligned_cols=54 Identities=15% Similarity=0.094 Sum_probs=40.0
Q ss_pred cccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceecccc-CCCceEEeeecCCceEEEe
Q psy5736 3 GLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNE-HDDSLLLTGYDDGSLSVYR 63 (346)
Q Consensus 3 ~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine-~~~~lll~g~~dG~vri~r 63 (346)
|..-+|.|||=++...+. ..+ .|..+|++++++-. .-.+.+++|.+||.|.+|+
T Consensus 31 Ga~~~Iav~dp~k~~i~t-~l~------GH~a~VnC~~~l~~s~~~a~~vsG~sD~~v~lW~ 85 (764)
T KOG1063|consen 31 GAGPAIAVADPEKILIVT-TLD------GHVARVNCVHWLPTSEIVAEMVSGDSDGRVILWK 85 (764)
T ss_pred cCCceEEEeCcccceeEE-ecc------CCccceEEEEEcccccccceEEEccCCCcEEEEE
Confidence 556689999988773333 222 37899999999843 3456789999999999984
No 213
>PF13525 YfiO: Outer membrane lipoprotein; PDB: 3TGO_A 3Q5M_A 2YHC_A.
Probab=35.09 E-value=1.4e+02 Score=26.50 Aligned_cols=46 Identities=15% Similarity=0.127 Sum_probs=33.9
Q ss_pred HHHHHHhccccHHHHHHHHHHHHHHhhcccCCCcHhHHHHHHHHHHHHHHh
Q psy5736 281 YRGIEQMNTSCFREAVESLTKFSDQIHELVVPPYKLASLAHEALRNCWSLA 331 (346)
Q Consensus 281 ~~a~~~~~~~~~~~ai~~l~~~l~~~~~~l~p~h~~~~~~~~~L~~~y~~~ 331 (346)
..|......+++++|+..++++++.+ |.|+....+...+..++-..
T Consensus 47 ~la~a~y~~~~y~~A~~~~~~fi~~y-----P~~~~~~~A~Y~~g~~~~~~ 92 (203)
T PF13525_consen 47 MLAYAYYKQGDYEEAIAAYERFIKLY-----PNSPKADYALYMLGLSYYKQ 92 (203)
T ss_dssp HHHHHHHHTT-HHHHHHHHHHHHHH------TT-TTHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHcCCHHHHHHHHHHHHHHC-----CCCcchhhHHHHHHHHHHHh
Confidence 33444567789999999999999877 88888888888887776655
No 214
>PF08969 USP8_dimer: USP8 dimerisation domain; InterPro: IPR015063 This domain is predominantly found in the amino terminal region of Ubiquitin carboxyl-terminal hydrolase 8 (USP8). It has no known function. ; PDB: 2XZE_B 2A9U_A.
Probab=35.07 E-value=83 Score=25.39 Aligned_cols=45 Identities=13% Similarity=0.110 Sum_probs=34.3
Q ss_pred HHHHHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHhhcccCCCcHh
Q psy5736 270 KTKLSELNEMFYRGIEQMNTSCFREAVESLTKFSDQIHELVVPPYKL 316 (346)
Q Consensus 270 ~~~l~~~~~~~~~a~~~~~~~~~~~ai~~l~~~l~~~~~~l~p~h~~ 316 (346)
...+..+..++++|...+..|+.+.|--+|.+++..+..+ |.|+.
T Consensus 32 ~~y~rsa~~l~~~A~~~~~egd~E~AYvl~~R~~~L~~ki--~~Hpd 76 (115)
T PF08969_consen 32 KRYLRSANKLLREAEEYRQEGDEEQAYVLYMRYLTLVEKI--PKHPD 76 (115)
T ss_dssp HHHHHHHHHHHHHHHHHHHCT-HHHHHHHHHHHHHHHCCH--CCSCC
T ss_pred HHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHh--hcCcc
Confidence 3356677788888988899999999999999999999333 44443
No 215
>KOG0300|consensus
Probab=35.03 E-value=31 Score=33.51 Aligned_cols=51 Identities=20% Similarity=0.371 Sum_probs=42.3
Q ss_pred ccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEe
Q psy5736 4 LFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYR 63 (346)
Q Consensus 4 ~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r 63 (346)
.+-+-.||-.|+|+-|-.|.- |...+.++||.| ...++++|+-|+.-.||+
T Consensus 168 ADhTA~iWs~Esg~CL~~Y~G-------H~GSVNsikfh~--s~~L~lTaSGD~taHIW~ 218 (481)
T KOG0300|consen 168 ADHTARIWSLESGACLATYTG-------HTGSVNSIKFHN--SGLLLLTASGDETAHIWK 218 (481)
T ss_pred cccceeEEeeccccceeeecc-------cccceeeEEecc--ccceEEEccCCcchHHHH
Confidence 356778999999999988742 566799999988 566889999999999986
No 216
>PF01599 Ribosomal_S27: Ribosomal protein S27a; InterPro: IPR002906 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites [, ]. About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits. Many ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome [, ]. This family of ribosomal proteins consists mainly of the 40S ribosomal protein S27a which is synthesized as a C-terminal extension of ubiquitin (CEP) (IPR000626 from INTERPRO). The S27a domain compromises the C-terminal half of the protein. The synthesis of ribosomal proteins as extensions of ubiquitin promotes their incorporation into nascent ribosomes by a transient metabolic stabilisation and is required for efficient ribosome biogenesis []. The ribosomal extension protein S27a contains a basic region that is proposed to form a zinc finger; its fusion gene is proposed as a mechanism to maintain a fixed ratio between ubiquitin necessary for degrading proteins and ribosomes a source of proteins [].; GO: 0003735 structural constituent of ribosome, 0006412 translation, 0005622 intracellular, 0005840 ribosome; PDB: 2K4X_A 3U5C_f 3U5G_f 2XZN_9 2XZM_9.
Probab=35.01 E-value=24 Score=24.31 Aligned_cols=30 Identities=23% Similarity=0.597 Sum_probs=18.6
Q ss_pred cccccCCcCcCCcccccccCCcccccCcccccccCC
Q psy5736 227 IRIRCSNDNCGQIIATVKKLEPSAKKVEKKCESCNS 262 (346)
Q Consensus 227 ~~~~C~~~~C~g~~~~~~~~~~~~~~~~~~C~~C~~ 262 (346)
..-.||++.|+.++.-... ...+.|.+|+.
T Consensus 17 ~rk~CP~~~CG~GvFMA~H------~dR~~CGKCg~ 46 (47)
T PF01599_consen 17 LRKECPSPRCGAGVFMAEH------KDRHYCGKCGY 46 (47)
T ss_dssp SSEE-TSTTTTSSSEEEE-------SSEEEETTTSS
T ss_pred hhhcCCCcccCCceEeeec------CCCccCCCccc
Confidence 4567998899865433322 12599999985
No 217
>COG2319 FOG: WD40 repeat [General function prediction only]
Probab=34.87 E-value=83 Score=28.67 Aligned_cols=51 Identities=22% Similarity=0.381 Sum_probs=37.2
Q ss_pred ccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEE
Q psy5736 4 LFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVY 62 (346)
Q Consensus 4 ~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~ 62 (346)
.++.+.+||-..+..+..+.. |...++.+.+. +....+.++++.||.+++|
T Consensus 176 ~~~~~~~~~~~~~~~~~~~~~-------~~~~v~~~~~~-~~~~~~~~~~~~d~~i~~w 226 (466)
T COG2319 176 LDGTIKLWDLRTGKPLSTLAG-------HTDPVSSLAFS-PDGGLLIASGSSDGTIRLW 226 (466)
T ss_pred CCCceEEEEcCCCceEEeecc-------CCCceEEEEEc-CCcceEEEEecCCCcEEEE
Confidence 378999999998766665543 45688999988 3333354545899999988
No 218
>KOG0649|consensus
Probab=34.72 E-value=73 Score=30.01 Aligned_cols=55 Identities=18% Similarity=0.168 Sum_probs=37.9
Q ss_pred CcccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeCC
Q psy5736 2 TGLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRNY 65 (346)
Q Consensus 2 ~~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~~ 65 (346)
.|-++.+.-||.|+|+---.|. -|.-.+-++-.=|. ... .++|+.||.+|+|..-
T Consensus 132 AgGD~~~y~~dlE~G~i~r~~r-------GHtDYvH~vv~R~~-~~q-ilsG~EDGtvRvWd~k 186 (325)
T KOG0649|consen 132 AGGDGVIYQVDLEDGRIQREYR-------GHTDYVHSVVGRNA-NGQ-ILSGAEDGTVRVWDTK 186 (325)
T ss_pred ecCCeEEEEEEecCCEEEEEEc-------CCcceeeeeeeccc-Ccc-eeecCCCccEEEEecc
Confidence 4668999999999996555552 25556666553333 233 3689999999999543
No 219
>PF13176 TPR_7: Tetratricopeptide repeat; PDB: 3SF4_C 3RO3_A 3RO2_A.
Probab=34.66 E-value=1e+02 Score=19.14 Aligned_cols=22 Identities=9% Similarity=0.119 Sum_probs=16.5
Q ss_pred hccccHHHHHHHHHHHHHHhhc
Q psy5736 287 MNTSCFREAVESLTKFSDQIHE 308 (346)
Q Consensus 287 ~~~~~~~~ai~~l~~~l~~~~~ 308 (346)
...|++++|++++++.|..-.+
T Consensus 10 ~~~g~~~~Ai~~y~~aL~l~~~ 31 (36)
T PF13176_consen 10 RQQGDYEKAIEYYEQALALARD 31 (36)
T ss_dssp HHCT-HHHHHHHHHHHHHHHHH
T ss_pred HHcCCHHHHHHHHHHHHHhccc
Confidence 4578999999999998865443
No 220
>PRK10454 PTS system N,N'-diacetylchitobiose-specific transporter subunit IIA; Provisional
Probab=34.64 E-value=52 Score=27.09 Aligned_cols=32 Identities=13% Similarity=0.208 Sum_probs=23.8
Q ss_pred HHHHHHHHHHHHhccccHHHHHHHHHHHHHHh
Q psy5736 275 ELNEMFYRGIEQMNTSCFREAVESLTKFSDQI 306 (346)
Q Consensus 275 ~~~~~~~~a~~~~~~~~~~~ai~~l~~~l~~~ 306 (346)
.+...+-+|++.++.|++++|-+.+.+.-+.+
T Consensus 30 ~ArS~~~eAl~~Ak~gdfe~A~~~l~eA~e~l 61 (115)
T PRK10454 30 QARSLAYAALKQAKQGDFAAAKAMMDQSRMAL 61 (115)
T ss_pred HHHHHHHHHHHHHHhCCHHHHHHHHHHHHHHH
Confidence 45566778899999999999877776655443
No 221
>KOG0647|consensus
Probab=34.49 E-value=65 Score=31.11 Aligned_cols=58 Identities=12% Similarity=0.204 Sum_probs=37.7
Q ss_pred cccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeCCcc
Q psy5736 3 GLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRNYAS 67 (346)
Q Consensus 3 ~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~~~s 67 (346)
.-+|+|++|+.+..-++.- .-++ .|...+-+..+.+ |.+...+|+.||.+++|..-.+
T Consensus 47 SWD~tVR~wevq~~g~~~~-ka~~----~~~~PvL~v~Wsd--dgskVf~g~~Dk~~k~wDL~S~ 104 (347)
T KOG0647|consen 47 SWDGTVRIWEVQNSGQLVP-KAQQ----SHDGPVLDVCWSD--DGSKVFSGGCDKQAKLWDLASG 104 (347)
T ss_pred ccCCceEEEEEecCCcccc-hhhh----ccCCCeEEEEEcc--CCceEEeeccCCceEEEEccCC
Confidence 4589999999988534432 1111 2444555555543 4466678999999999976644
No 222
>PF12773 DZR: Double zinc ribbon
Probab=34.40 E-value=32 Score=23.23 Aligned_cols=41 Identities=22% Similarity=0.542 Sum_probs=25.2
Q ss_pred cCCCCCCCccccccccCccccccCCcCcCCcccccccCCcccccCcccccccCCcCCh
Q psy5736 209 CRACVENWPLMESLEKYPIRIRCSNDNCGQIIATVKKLEPSAKKVEKKCESCNSTSDL 266 (346)
Q Consensus 209 C~~C~~~~~~~~~~~~~~~~~~C~~~~C~g~~~~~~~~~~~~~~~~~~C~~C~~~~~~ 266 (346)
|+.|..+-+ ..+-.|+ .|+..+.... ...+.|+.|+..+..
T Consensus 1 Cp~Cg~~~~--------~~~~fC~--~CG~~l~~~~-------~~~~~C~~Cg~~~~~ 41 (50)
T PF12773_consen 1 CPHCGTPNP--------DDAKFCP--HCGTPLPPPD-------QSKKICPNCGAENPP 41 (50)
T ss_pred CCCcCCcCC--------ccccCCh--hhcCChhhcc-------CCCCCCcCCcCCCcC
Confidence 566766432 2456788 8887665111 124889999987553
No 223
>TIGR03866 PQQ_ABC_repeats PQQ-dependent catabolism-associated beta-propeller protein. Members of this protein family consist of seven repeats each of the YVTN family beta-propeller repeat (see TIGR02276). Members occur invariably as part of a transport operon that is associated with PQQ-dependent catabolism of alcohols such as phenylethanol.
Probab=34.09 E-value=1.1e+02 Score=27.37 Aligned_cols=59 Identities=7% Similarity=-0.023 Sum_probs=30.6
Q ss_pred ccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEe
Q psy5736 4 LFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYR 63 (346)
Q Consensus 4 ~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r 63 (346)
.+|+|.+||.++++.+..+.-..+...........+. ++++...+.++...++.+.+|.
T Consensus 177 ~~~~v~i~d~~~~~~~~~~~~~~~~~~~~~~~~~~i~-~s~dg~~~~~~~~~~~~i~v~d 235 (300)
T TIGR03866 177 IGGTVSVIDVATRKVIKKITFEIPGVHPEAVQPVGIK-LTKDGKTAFVALGPANRVAVVD 235 (300)
T ss_pred CCCEEEEEEcCcceeeeeeeecccccccccCCccceE-ECCCCCEEEEEcCCCCeEEEEE
Confidence 4689999999998766554322111111111122333 2334444444445566788774
No 224
>PF01485 IBR: IBR domain; InterPro: IPR002867 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents a cysteine-rich (C6HC) zinc finger domain that is present in Triad1, and which is conserved in other proteins encoded by various eukaryotes. The C6HC consensus pattern is: C-x(4)-C-x(14-30)-C-x(1-4)-C-x(4)-C-x(2)-C-x(4)-H-x(4)-C The C6HC zinc finger motif is the fourth family member of the zinc-binding RING, LIM, and LAP/PHD fingers. Strikingly, in most of the proteins the C6HC domain is flanked by two RING finger structures IPR001841 from INTERPRO. The novel C6HC motif has been called DRIL (double RING finger linked). The strong conservation of the larger tripartite TRIAD (twoRING fingers and DRIL) structure indicates that the three subdomains are functionally linked and identifies a novel class of proteins []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding; PDB: 2CT7_A 1WD2_A 2JMO_A 1WIM_A.
Probab=34.07 E-value=27 Score=24.49 Aligned_cols=31 Identities=26% Similarity=0.506 Sum_probs=14.5
Q ss_pred cccCCcCcCCcccccccCCcccccCcccccccCCc
Q psy5736 229 IRCSNDNCGQIIATVKKLEPSAKKVEKKCESCNST 263 (346)
Q Consensus 229 ~~C~~~~C~g~~~~~~~~~~~~~~~~~~C~~C~~~ 263 (346)
..||++.|..++........ ....|+.|+..
T Consensus 19 ~~Cp~~~C~~~~~~~~~~~~----~~~~C~~C~~~ 49 (64)
T PF01485_consen 19 RWCPNPDCEYIIEKDDGCNS----PIVTCPSCGTE 49 (64)
T ss_dssp C--TTSST---ECS-SSTTS------CCTTSCCSE
T ss_pred cCCCCCCCcccEEecCCCCC----CeeECCCCCCc
Confidence 48998889887765543211 12678877764
No 225
>PF13428 TPR_14: Tetratricopeptide repeat
Probab=33.97 E-value=93 Score=20.14 Aligned_cols=29 Identities=14% Similarity=-0.145 Sum_probs=22.5
Q ss_pred HHHHhccccHHHHHHHHHHHHHHhhcccCCCcHh
Q psy5736 283 GIEQMNTSCFREAVESLTKFSDQIHELVVPPYKL 316 (346)
Q Consensus 283 a~~~~~~~~~~~ai~~l~~~l~~~~~~l~p~h~~ 316 (346)
|...+..|++++|++.|++.++.. |.++.
T Consensus 8 a~~~~~~G~~~~A~~~~~~~l~~~-----P~~~~ 36 (44)
T PF13428_consen 8 ARAYRRLGQPDEAERLLRRALALD-----PDDPE 36 (44)
T ss_pred HHHHHHcCCHHHHHHHHHHHHHHC-----cCCHH
Confidence 344567899999999999999864 66654
No 226
>smart00531 TFIIE Transcription initiation factor IIE.
Probab=33.30 E-value=11 Score=32.07 Aligned_cols=10 Identities=20% Similarity=0.584 Sum_probs=6.0
Q ss_pred ccccccCCcC
Q psy5736 255 KKCESCNSTS 264 (346)
Q Consensus 255 ~~C~~C~~~~ 264 (346)
..|+.||..+
T Consensus 124 f~Cp~Cg~~l 133 (147)
T smart00531 124 FTCPRCGEEL 133 (147)
T ss_pred EECCCCCCEE
Confidence 5666666654
No 227
>smart00834 CxxC_CXXC_SSSS Putative regulatory protein. CxxC_CXXC_SSSS represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=32.58 E-value=31 Score=22.10 Aligned_cols=31 Identities=23% Similarity=0.722 Sum_probs=16.7
Q ss_pred ccccCCcCcCCcccccccCCcccccCcccccccCCc
Q psy5736 228 RIRCSNDNCGQIIATVKKLEPSAKKVEKKCESCNST 263 (346)
Q Consensus 228 ~~~C~~~~C~g~~~~~~~~~~~~~~~~~~C~~C~~~ 263 (346)
.++|+ .|+..+....... ......|+.||..
T Consensus 5 ~y~C~--~Cg~~fe~~~~~~---~~~~~~CP~Cg~~ 35 (41)
T smart00834 5 EYRCE--DCGHTFEVLQKIS---DDPLATCPECGGD 35 (41)
T ss_pred EEEcC--CCCCEEEEEEecC---CCCCCCCCCCCCc
Confidence 46777 7776443221110 0224778899874
No 228
>TIGR03504 FimV_Cterm FimV C-terminal domain. This protein is found at the extreme C-terminus of FimV from Pseudomonas aeruginosa, and of TspA of Neisseria meningitidis. Disruption of the former blocks twitching motility from type IV pili; Semmler, et al. suggest a role in peptidoglycan layer remodelling required by type IV fimbrial systems.
Probab=32.37 E-value=1.4e+02 Score=20.06 Aligned_cols=37 Identities=14% Similarity=0.027 Sum_probs=25.4
Q ss_pred HHHHhccccHHHHHHHHHHHHHHhhcccCCCcHhHHHHHHHHH
Q psy5736 283 GIEQMNTSCFREAVESLTKFSDQIHELVVPPYKLASLAHEALR 325 (346)
Q Consensus 283 a~~~~~~~~~~~ai~~l~~~l~~~~~~l~p~h~~~~~~~~~L~ 325 (346)
|...++.|+.+.|++.+++.+.. .+.+.-..|+..|.
T Consensus 6 A~ayie~Gd~e~Ar~lL~evl~~------~~~~q~~eA~~LL~ 42 (44)
T TIGR03504 6 ARAYIEMGDLEGARELLEEVIEE------GDEAQRQEARALLA 42 (44)
T ss_pred HHHHHHcCChHHHHHHHHHHHHc------CCHHHHHHHHHHHh
Confidence 44567889999999999998852 34445555555554
No 229
>KOG1274|consensus
Probab=32.34 E-value=32 Score=37.55 Aligned_cols=37 Identities=19% Similarity=0.217 Sum_probs=25.6
Q ss_pred CcccceEEEEecC------CCceeeeeCCCCCCCCCCCCcccceeccc
Q psy5736 2 TGLFLKEGIWDHI------TGSRLISWRPDRGSWARNSARCTALTLLN 43 (346)
Q Consensus 2 ~~~~~~~~~w~~~------~~~~~~~~~~~~~~w~~~~~~~t~l~~in 43 (346)
.+++|.|.||||+ ...++| ..+|.+....||-+..++
T Consensus 250 s~~~g~I~vWnv~t~~~~~~~~~Vc-----~~aw~p~~n~it~~~~~g 292 (933)
T KOG1274|consen 250 STLDGQILVWNVDTHERHEFKRAVC-----CEAWKPNANAITLITALG 292 (933)
T ss_pred eccCCcEEEEecccchhccccceeE-----EEecCCCCCeeEEEeecc
Confidence 3689999999999 666666 355766666666555443
No 230
>PF12688 TPR_5: Tetratrico peptide repeat
Probab=31.97 E-value=2.2e+02 Score=23.38 Aligned_cols=16 Identities=13% Similarity=-0.045 Sum_probs=8.1
Q ss_pred cccHHHHHHHHHHHHH
Q psy5736 289 TSCFREAVESLTKFSD 304 (346)
Q Consensus 289 ~~~~~~ai~~l~~~l~ 304 (346)
.|+.++|+..|++.+.
T Consensus 14 ~G~~~~Ai~~Y~~Al~ 29 (120)
T PF12688_consen 14 LGREEEAIPLYRRALA 29 (120)
T ss_pred cCCHHHHHHHHHHHHH
Confidence 4455555555555544
No 231
>KOG0321|consensus
Probab=31.92 E-value=61 Score=34.21 Aligned_cols=60 Identities=20% Similarity=0.267 Sum_probs=34.8
Q ss_pred cccceEEEEecCCCc--eeeeeC-------CC-----------CCCCCCCCCcccc-eeccccCCCceEEeeec-CCceE
Q psy5736 3 GLFLKEGIWDHITGS--RLISWR-------PD-----------RGSWARNSARCTA-LTLLNEHDDSLLLTGYD-DGSLS 60 (346)
Q Consensus 3 ~~~~~~~~w~~~~~~--~~~~~~-------~~-----------~~~w~~~~~~~t~-l~~ine~~~~lll~g~~-dG~vr 60 (346)
|-+|.|-|||..-.. .+-.|. |. .+.|..+..-|++ +..+=-.|+..+.+++. ||.|+
T Consensus 164 gRDg~illWD~R~n~~d~~e~~~~~~~~~~n~~ptpskp~~kr~~k~kA~s~ti~ssvTvv~fkDe~tlaSaga~D~~iK 243 (720)
T KOG0321|consen 164 GRDGEILLWDCRCNGVDALEEFDNRIYGRHNTAPTPSKPLKKRIRKWKAASNTIFSSVTVVLFKDESTLASAGAADSTIK 243 (720)
T ss_pred cCCCcEEEEEEeccchhhHHHHhhhhhccccCCCCCCchhhccccccccccCceeeeeEEEEEeccceeeeccCCCcceE
Confidence 678999999995443 222222 22 1226666666655 22222245555555554 99999
Q ss_pred EE
Q psy5736 61 VY 62 (346)
Q Consensus 61 i~ 62 (346)
+|
T Consensus 244 VW 245 (720)
T KOG0321|consen 244 VW 245 (720)
T ss_pred EE
Confidence 99
No 232
>KOG0295|consensus
Probab=31.53 E-value=84 Score=31.06 Aligned_cols=51 Identities=22% Similarity=0.272 Sum_probs=37.0
Q ss_pred ccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEe
Q psy5736 4 LFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYR 63 (346)
Q Consensus 4 ~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r 63 (346)
-+++|.+||..++.-|-... . |..=|..+-|. +.+.+++...+|+.+|+|.
T Consensus 312 rDktIk~wdv~tg~cL~tL~--g-----hdnwVr~~af~--p~Gkyi~ScaDDktlrvwd 362 (406)
T KOG0295|consen 312 RDKTIKIWDVSTGMCLFTLV--G-----HDNWVRGVAFS--PGGKYILSCADDKTLRVWD 362 (406)
T ss_pred ccceEEEEeccCCeEEEEEe--c-----ccceeeeeEEc--CCCeEEEEEecCCcEEEEE
Confidence 47899999999996655442 1 44555554432 4578889999999999993
No 233
>KOG2376|consensus
Probab=31.42 E-value=83 Score=33.09 Aligned_cols=67 Identities=13% Similarity=0.092 Sum_probs=50.5
Q ss_pred HHHHHHHHHHhccccHHHHHHHHHHHHHHhhcccCCC-------cHhHHHHHHHHHHHHHHhCCccccCccccc
Q psy5736 277 NEMFYRGIEQMNTSCFREAVESLTKFSDQIHELVVPP-------YKLASLAHEALRNCWSLAGNKWVIPENYSQ 343 (346)
Q Consensus 277 ~~~~~~a~~~~~~~~~~~ai~~l~~~l~~~~~~l~p~-------h~~~~~~~~~L~~~y~~~G~~~~~~~~~~~ 343 (346)
+..|+.|...++.|++.+|++.|++.++...+-|-.. --.+..++-.|+..+-.+|.+-..-++|.+
T Consensus 176 el~yN~Ac~~i~~gky~qA~elL~kA~~~~~e~l~~~d~~eEeie~el~~IrvQlayVlQ~~Gqt~ea~~iy~~ 249 (652)
T KOG2376|consen 176 ELLYNTACILIENGKYNQAIELLEKALRICREKLEDEDTNEEEIEEELNPIRVQLAYVLQLQGQTAEASSIYVD 249 (652)
T ss_pred HHHHHHHHHHHhcccHHHHHHHHHHHHHHHHHhhcccccchhhHHHHHHHHHHHHHHHHHHhcchHHHHHHHHH
Confidence 4568889999999999999999999987777655432 124666777888888888988655554444
No 234
>KOG0316|consensus
Probab=31.29 E-value=82 Score=29.55 Aligned_cols=69 Identities=13% Similarity=0.163 Sum_probs=48.1
Q ss_pred ccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeCCccccchhhhhhhhhcccc
Q psy5736 4 LFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRNYASHEHRLVTSFQALTDTS 83 (346)
Q Consensus 4 ~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~~~sh~~~Lv~a~r~L~~~~ 83 (346)
.+|++..||...|+....+ .+..||++++- .|....|+|+-|+.+|+... .-.+|....+..+.++
T Consensus 163 ~DGtvRtydiR~G~l~sDy---------~g~pit~vs~s--~d~nc~La~~l~stlrLlDk---~tGklL~sYkGhkn~e 228 (307)
T KOG0316|consen 163 VDGTVRTYDIRKGTLSSDY---------FGHPITSVSFS--KDGNCSLASSLDSTLRLLDK---ETGKLLKSYKGHKNME 228 (307)
T ss_pred cCCcEEEEEeecceeehhh---------cCCcceeEEec--CCCCEEEEeeccceeeeccc---chhHHHHHhcccccce
Confidence 5899999999999766643 24457777754 46777888998998887632 2246666666666666
Q ss_pred ccc
Q psy5736 84 FVN 86 (346)
Q Consensus 84 y~~ 86 (346)
|..
T Consensus 229 ykl 231 (307)
T KOG0316|consen 229 YKL 231 (307)
T ss_pred eee
Confidence 653
No 235
>KOG1407|consensus
Probab=31.16 E-value=70 Score=30.33 Aligned_cols=55 Identities=13% Similarity=0.146 Sum_probs=36.8
Q ss_pred ccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeC
Q psy5736 4 LFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRN 64 (346)
Q Consensus 4 ~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~ 64 (346)
.++++.||+-+..+++.... +. .|+.-+-.|-.--.+++ ++++++.|..+|+|..
T Consensus 40 ~dktv~v~n~e~~r~~~~~~--~~---gh~~svdql~w~~~~~d-~~atas~dk~ir~wd~ 94 (313)
T KOG1407|consen 40 FDKTVSVWNLERDRFRKELV--YR---GHTDSVDQLCWDPKHPD-LFATASGDKTIRIWDI 94 (313)
T ss_pred cCCceEEEEecchhhhhhhc--cc---CCCcchhhheeCCCCCc-ceEEecCCceEEEEEe
Confidence 46899999999885544332 22 25555655554444444 5588999999999943
No 236
>PF12862 Apc5: Anaphase-promoting complex subunit 5
Probab=30.86 E-value=2.3e+02 Score=21.70 Aligned_cols=51 Identities=14% Similarity=0.158 Sum_probs=36.2
Q ss_pred HHHhccccHHHHHHHHHHHHHHhhcccCCC-cHhHHHHHHHHHHHHHHhCCc
Q psy5736 284 IEQMNTSCFREAVESLTKFSDQIHELVVPP-YKLASLAHEALRNCWSLAGNK 334 (346)
Q Consensus 284 ~~~~~~~~~~~ai~~l~~~l~~~~~~l~p~-h~~~~~~~~~L~~~y~~~G~~ 334 (346)
...+..+++.+|++.+.++.+....-..+. ...+..+...++......|+.
T Consensus 6 ~~~~~~~dy~~A~d~L~~~fD~~~~~~~~~~~~~~~~all~lA~~~~~~G~~ 57 (94)
T PF12862_consen 6 LNALRSGDYSEALDALHRYFDYAKQSNNSSSNSGLAYALLNLAELHRRFGHY 57 (94)
T ss_pred HHHHHcCCHHHHHHHHHHHHHHHhhcccchhhHHHHHHHHHHHHHHHHhCCH
Confidence 455678899999999999888777666555 233445556677777777764
No 237
>PF14853 Fis1_TPR_C: Fis1 C-terminal tetratricopeptide repeat; PDB: 1IYG_A 1PC2_A 1NZN_A 3UUX_C 1Y8M_A 2PQR_A 2PQN_A 3O48_A.
Probab=30.78 E-value=1.9e+02 Score=20.19 Aligned_cols=47 Identities=13% Similarity=0.176 Sum_probs=34.2
Q ss_pred HHHHHHHHhccccHHHHHHHHHHHHHHhhcccCCCcHhHHHHHHHHHHHHHH
Q psy5736 279 MFYRGIEQMNTSCFREAVESLTKFSDQIHELVVPPYKLASLAHEALRNCWSL 330 (346)
Q Consensus 279 ~~~~a~~~~~~~~~~~ai~~l~~~l~~~~~~l~p~h~~~~~~~~~L~~~y~~ 330 (346)
+|--|+.....|++++|+++...+|+.= |.|.-....+..+..-...
T Consensus 4 lY~lAig~ykl~~Y~~A~~~~~~lL~~e-----P~N~Qa~~L~~~i~~~i~k 50 (53)
T PF14853_consen 4 LYYLAIGHYKLGEYEKARRYCDALLEIE-----PDNRQAQSLKELIEDKIQK 50 (53)
T ss_dssp HHHHHHHHHHTT-HHHHHHHHHHHHHHT-----TS-HHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHhhhHHHHHHHHHHHHhhC-----CCcHHHHHHHHHHHHHHhc
Confidence 3455677788899999999999999864 8888877777777655443
No 238
>PF10058 DUF2296: Predicted integral membrane metal-binding protein (DUF2296); InterPro: IPR019273 This domain, found mainly in the eukaryotic lunapark proteins, has no known function [].
Probab=30.74 E-value=29 Score=24.50 Aligned_cols=33 Identities=21% Similarity=0.534 Sum_probs=21.9
Q ss_pred CccccccCCcCcC--CcccccccCCcccccCcccccccCCc
Q psy5736 225 YPIRIRCSNDNCG--QIIATVKKLEPSAKKVEKKCESCNST 263 (346)
Q Consensus 225 ~~~~~~C~~~~C~--g~~~~~~~~~~~~~~~~~~C~~C~~~ 263 (346)
...++.|+ +|. +++.|..+.. ...|.|+.||..
T Consensus 19 ~r~aLIC~--~C~~hNGla~~~~~~----~i~y~C~~Cg~~ 53 (54)
T PF10058_consen 19 NRYALICS--KCFSHNGLAPKEEFE----EIQYRCPYCGAL 53 (54)
T ss_pred CceeEECc--ccchhhcccccccCC----ceEEEcCCCCCc
Confidence 35789999 895 4555533221 246999999863
No 239
>KOG0547|consensus
Probab=30.55 E-value=96 Score=32.06 Aligned_cols=59 Identities=14% Similarity=0.141 Sum_probs=42.3
Q ss_pred HHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHhhcccCCCcHhHHHHHHHHHHHHHHhCCccccCc
Q psy5736 273 LSELNEMFYRGIEQMNTSCFREAVESLTKFSDQIHELVVPPYKLASLAHEALRNCWSLAGNKWVIPE 339 (346)
Q Consensus 273 l~~~~~~~~~a~~~~~~~~~~~ai~~l~~~l~~~~~~l~p~h~~~~~~~~~L~~~y~~~G~~~~~~~ 339 (346)
+..+.++-.++-.....++|++||++|...|+...+- +.+ ..-++.||..+|++....|
T Consensus 112 ~k~A~~lK~~GN~~f~~kkY~eAIkyY~~AI~l~p~e-----piF---YsNraAcY~~lgd~~~Vie 170 (606)
T KOG0547|consen 112 LKYAAALKTKGNKFFRNKKYDEAIKYYTQAIELCPDE-----PIF---YSNRAACYESLGDWEKVIE 170 (606)
T ss_pred HHHHHHHHhhhhhhhhcccHHHHHHHHHHHHhcCCCC-----chh---hhhHHHHHHHHhhHHHHHH
Confidence 4555555566777788999999999999999987553 222 2456788888888755433
No 240
>TIGR01206 lysW lysine biosynthesis protein LysW. This very small, poorly characterized protein has been shown essential in Thermus thermophilus for an unusual pathway of Lys biosynthesis from aspartate by way of alpha-aminoadipate (AAA) rather than diaminopimelate. It is found also in Deinococcus radiodurans and Pyrococcus horikoshii, which appear to share the AAA pathway.
Probab=30.42 E-value=35 Score=24.19 Aligned_cols=31 Identities=19% Similarity=0.486 Sum_probs=20.4
Q ss_pred cccCCcCcCCcccccccCCcccccCcccccccCCcCC
Q psy5736 229 IRCSNDNCGQIIATVKKLEPSAKKVEKKCESCNSTSD 265 (346)
Q Consensus 229 ~~C~~~~C~g~~~~~~~~~~~~~~~~~~C~~C~~~~~ 265 (346)
+.|| .|+..+..+.+. ......|+.||..+.
T Consensus 3 ~~CP--~CG~~iev~~~~----~GeiV~Cp~CGaele 33 (54)
T TIGR01206 3 FECP--DCGAEIELENPE----LGELVICDECGAELE 33 (54)
T ss_pred cCCC--CCCCEEecCCCc----cCCEEeCCCCCCEEE
Confidence 5788 888776554321 134578999998653
No 241
>COG1594 RPB9 DNA-directed RNA polymerase, subunit M/Transcription elongation factor TFIIS [Transcription]
Probab=30.42 E-value=27 Score=28.60 Aligned_cols=32 Identities=22% Similarity=0.532 Sum_probs=23.3
Q ss_pred ccCCcCcCCcccccccCCcccccCcccccccCCcCChH
Q psy5736 230 RCSNDNCGQIIATVKKLEPSAKKVEKKCESCNSTSDLT 267 (346)
Q Consensus 230 ~C~~~~C~g~~~~~~~~~~~~~~~~~~C~~C~~~~~~~ 267 (346)
.|| .|++.+.|..... +..++|.+||......
T Consensus 4 FCp--~Cgsll~p~~~~~----~~~l~C~kCgye~~~~ 35 (113)
T COG1594 4 FCP--KCGSLLYPKKDDE----GGKLVCRKCGYEEEAS 35 (113)
T ss_pred ccC--CccCeeEEeEcCC----CcEEECCCCCcchhcc
Confidence 588 9999998875421 2369999999975543
No 242
>KOG0284|consensus
Probab=30.42 E-value=37 Score=33.95 Aligned_cols=58 Identities=21% Similarity=0.243 Sum_probs=37.7
Q ss_pred cceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeCCccccch
Q psy5736 5 FLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRNYASHEHR 71 (346)
Q Consensus 5 ~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~~~sh~~~ 71 (346)
+|.|.+|+=.-.--.. + +. -|...|++|.|- +.|+-.+++++||.|+||.-.+.++.+
T Consensus 159 gG~iKyWqpnmnnVk~-~--~a----hh~eaIRdlafS--pnDskF~t~SdDg~ikiWdf~~~kee~ 216 (464)
T KOG0284|consen 159 GGMIKYWQPNMNNVKI-I--QA----HHAEAIRDLAFS--PNDSKFLTCSDDGTIKIWDFRMPKEER 216 (464)
T ss_pred CceEEecccchhhhHH-h--hH----hhhhhhheeccC--CCCceeEEecCCCeEEEEeccCCchhh
Confidence 6788888765331111 1 11 244678888753 478888999999999999655545433
No 243
>KOG0543|consensus
Probab=30.27 E-value=1.5e+02 Score=29.77 Aligned_cols=49 Identities=16% Similarity=0.140 Sum_probs=37.0
Q ss_pred HHHHHHHHHHhccccHHHHHHHHHHHHHHhhcccCCCcHhHHHHHHHHHHHHHH
Q psy5736 277 NEMFYRGIEQMNTSCFREAVESLTKFSDQIHELVVPPYKLASLAHEALRNCWSL 330 (346)
Q Consensus 277 ~~~~~~a~~~~~~~~~~~ai~~l~~~l~~~~~~l~p~h~~~~~~~~~L~~~y~~ 330 (346)
..+|+++...+..++++.|+..|++.++.. |.|.....-.-.|.+.+.+
T Consensus 292 KALyRrG~A~l~~~e~~~A~~df~ka~k~~-----P~Nka~~~el~~l~~k~~~ 340 (397)
T KOG0543|consen 292 KALYRRGQALLALGEYDLARDDFQKALKLE-----PSNKAARAELIKLKQKIRE 340 (397)
T ss_pred hHHHHHHHHHHhhccHHHHHHHHHHHHHhC-----CCcHHHHHHHHHHHHHHHH
Confidence 467888888899999999999999999876 7777655555555554443
No 244
>KOG1036|consensus
Probab=30.10 E-value=55 Score=31.51 Aligned_cols=45 Identities=24% Similarity=0.269 Sum_probs=33.7
Q ss_pred CcccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeec
Q psy5736 2 TGLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYD 55 (346)
Q Consensus 2 ~~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~ 55 (346)
.|-+|-|.+||-...+||+.|.-- .+.|.+|-| + +|..++.+|++
T Consensus 250 gGsDG~V~~Wd~~~rKrl~q~~~~-------~~SI~slsf-s-~dG~~LAia~s 294 (323)
T KOG1036|consen 250 GGSDGIVNIWDLFNRKRLKQLAKY-------ETSISSLSF-S-MDGSLLAIASS 294 (323)
T ss_pred cCCCceEEEccCcchhhhhhccCC-------CCceEEEEe-c-cCCCeEEEEec
Confidence 377999999999999999987422 355777664 2 67777787773
No 245
>PF14828 Amnionless: Amnionless
Probab=30.10 E-value=29 Score=35.19 Aligned_cols=31 Identities=19% Similarity=0.530 Sum_probs=23.0
Q ss_pred ccc--ccCCcCcCCcccccccCCcccccCcccccccCCcCChH
Q psy5736 227 IRI--RCSNDNCGQIIATVKKLEPSAKKVEKKCESCNSTSDLT 267 (346)
Q Consensus 227 ~~~--~C~~~~C~g~~~~~~~~~~~~~~~~~~C~~C~~~~~~~ 267 (346)
..+ +|+.+.|..++.|.. -.|..||..+..+
T Consensus 207 s~v~~~C~~~~C~~pl~P~G----------hCC~iCGa~v~~~ 239 (437)
T PF14828_consen 207 SNVLQRCPKPHCRSPLRPEG----------HCCPICGAIVTLE 239 (437)
T ss_pred HHhhCcCCCCccCCCCCCCC----------CchhhcceEEEEe
Confidence 445 899999988877652 6789999876543
No 246
>PF02748 PyrI_C: Aspartate carbamoyltransferase regulatory chain, metal binding domain; InterPro: IPR020542 Aspartate carbamoyltransferase (aspartate transcarbamylase, ATCase) 2.1.3.2 from EC is an allosteric enzyme that plays a central role in the regulation of the pyrimidine pathway in bacteria. The holoenzyme is a dodecamer composed of six catalytic chains, each with an active site, and six regulatory chains lacking catalytic activity []. The catalytic subunits exist as a dimer of catalytic trimers, (c3)2, while the regulatory subunits exist as a trimer of regulatory dimers, (r2)3, therefore the complete holoenzyme can be represented as (c3)2(r2)3. The association of the catalytic subunits c3 with the regulatory subunits r2 is responsible for the establishment of positive co-operativity between catalytic sites for the binding of aspartate and it dictates the pattern of allosteric response toward nucleotide effectors. ATCase from Escherichia coli is the most extensively studied allosteric enzyme []. The crystal structure of the T-state, the T-state with CTP bound, the R-state with N-phosphonacetyl-L-aspartate (PALA) bound, and the R-state with phosphonoacetamide plus malonate bound have been used in interpreting kinetic and mutational studies. A high-resolution structure of E. coli ATCase in the presence of PALA (a bisubstrate analog) allows a detailed description of the binding at the active site of the enzyme and allows a detailed model of the tetrahedral intermediate to be constructed. The entire regulatory chain has been traced showing that the N-terminal regions of the regulatory chains R1 and R6 are located in close proximity to each other and to the regulatory site. This portion of the molecule may be involved in the observed asymmetry between the regulatory binding sites as well as in the heterotropic response of the enzyme []. The C-terminal domain of the regulatory chains have a rubredoxin-like zinc-bound fold. ATCase from Enterobacter agglomerans (Erwinia herbicola) (Pantoea agglomerans) differs from the other investigated enterobacterial ATCases by its absence of homotropic co-operativity toward the substrate aspartate and its lack of response to ATP which is an allosteric effector (activator) of this family of enzymes. Nevertheless, the E. herbicola ATCase has the same quaternary structure, two trimers of catalytic chains with three dimers of regulatory chains, (c3)2(r2)3, as other enterobacterial ATCases and shows extensive primary structure conservation []. This entry represents the C-terminal domain.; PDB: 2YWW_B 1SKU_D 1Q95_L 8ATC_B 3AT1_D 1RAI_D 4E2F_D 1NBE_B 6AT1_B 2FZC_D ....
Probab=30.05 E-value=45 Score=23.37 Aligned_cols=17 Identities=24% Similarity=0.579 Sum_probs=11.7
Q ss_pred CcccccccCCcCChHHH
Q psy5736 253 VEKKCESCNSTSDLTEI 269 (346)
Q Consensus 253 ~~~~C~~C~~~~~~~~~ 269 (346)
....|.-|++.+..+++
T Consensus 34 ~~~rC~YCe~~~~~~eI 50 (52)
T PF02748_consen 34 IKLRCHYCERIITEDEI 50 (52)
T ss_dssp CEEEETTT--EEEHHHH
T ss_pred CEEEeeCCCCEecccEE
Confidence 45899999998887664
No 247
>PF13424 TPR_12: Tetratricopeptide repeat; PDB: 3RO2_A 3Q15_A 3ASG_A 3ASD_A 3AS5_A 3AS4_A 3ASH_B 4A1S_B 3CEQ_B 3EDT_H ....
Probab=29.81 E-value=1.4e+02 Score=21.44 Aligned_cols=26 Identities=23% Similarity=0.186 Sum_probs=19.6
Q ss_pred HHHHhccccHHHHHHHHHHHHHHhhc
Q psy5736 283 GIEQMNTSCFREAVESLTKFSDQIHE 308 (346)
Q Consensus 283 a~~~~~~~~~~~ai~~l~~~l~~~~~ 308 (346)
|.-....|++++|++++++.++.+++
T Consensus 53 g~~~~~~g~~~~A~~~~~~al~i~~k 78 (78)
T PF13424_consen 53 GECYYRLGDYEEALEYYQKALDIFEK 78 (78)
T ss_dssp HHHHHHTTHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHcCCHHHHHHHHHHHHhhhcC
Confidence 33345678899999999999887753
No 248
>COG2319 FOG: WD40 repeat [General function prediction only]
Probab=29.62 E-value=1.2e+02 Score=27.55 Aligned_cols=54 Identities=24% Similarity=0.370 Sum_probs=37.0
Q ss_pred cccceEEEEecCC-CceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeec-CCceEEEeCC
Q psy5736 3 GLFLKEGIWDHIT-GSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYD-DGSLSVYRNY 65 (346)
Q Consensus 3 ~~~~~~~~w~~~~-~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~-dG~vri~r~~ 65 (346)
+.++.+.+||.+. ......+.. |...|+.+.+..... .+..++. ||.+++|..-
T Consensus 131 ~~d~~~~~~~~~~~~~~~~~~~~-------~~~~v~~~~~~~~~~--~~~~~~~~~~~~~~~~~~ 186 (466)
T COG2319 131 SLDGTVKLWDLSTPGKLIRTLEG-------HSESVTSLAFSPDGK--LLASGSSLDGTIKLWDLR 186 (466)
T ss_pred CCCccEEEEEecCCCeEEEEEec-------CcccEEEEEECCCCC--EEEecCCCCCceEEEEcC
Confidence 4488999999998 555544432 456788777665433 5566665 9999999544
No 249
>PF12895 Apc3: Anaphase-promoting complex, cyclosome, subunit 3; PDB: 3KAE_D 3Q4A_B 2C2L_D 3Q47_B 3Q49_B 2XPI_A 3ULQ_A.
Probab=29.51 E-value=1.2e+02 Score=22.27 Aligned_cols=33 Identities=12% Similarity=0.088 Sum_probs=17.6
Q ss_pred HHHHHHhhcccCCCcHhHHHHHHHHHHHHHHhCCc
Q psy5736 300 TKFSDQIHELVVPPYKLASLAHEALRNCWSLAGNK 334 (346)
Q Consensus 300 ~~~l~~~~~~l~p~h~~~~~~~~~L~~~y~~~G~~ 334 (346)
.+.++.+.+ ....+....+...+++|+..+|+.
T Consensus 42 ~~A~~~~~~--~~~~~~~~~~~~l~a~~~~~l~~y 74 (84)
T PF12895_consen 42 EEAIELLQK--LKLDPSNPDIHYLLARCLLKLGKY 74 (84)
T ss_dssp HHHHHHHHC--HTHHHCHHHHHHHHHHHHHHTT-H
T ss_pred HHHHHHHHH--hCCCCCCHHHHHHHHHHHHHhCCH
Confidence 444444444 222233355666778888888763
No 250
>KOG0292|consensus
Probab=29.24 E-value=44 Score=36.78 Aligned_cols=49 Identities=16% Similarity=0.301 Sum_probs=41.0
Q ss_pred cceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEE
Q psy5736 5 FLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVY 62 (346)
Q Consensus 5 ~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~ 62 (346)
.|+|..|||.=+..+.+|. .|...+-.+.|+. +.++.++|++|-.|++|
T Consensus 30 sG~IQlWDYRM~tli~rFd-------eHdGpVRgv~FH~--~qplFVSGGDDykIkVW 78 (1202)
T KOG0292|consen 30 SGVIQLWDYRMGTLIDRFD-------EHDGPVRGVDFHP--TQPLFVSGGDDYKIKVW 78 (1202)
T ss_pred CceeeeehhhhhhHHhhhh-------ccCCccceeeecC--CCCeEEecCCccEEEEE
Confidence 5899999999999998885 3556777777764 57788999999999998
No 251
>PRK00432 30S ribosomal protein S27ae; Validated
Probab=29.23 E-value=33 Score=23.78 Aligned_cols=26 Identities=27% Similarity=0.579 Sum_probs=17.0
Q ss_pred ccccCCcCcCC-cccccccCCcccccCcccccccCCc
Q psy5736 228 RIRCSNDNCGQ-IIATVKKLEPSAKKVEKKCESCNST 263 (346)
Q Consensus 228 ~~~C~~~~C~g-~~~~~~~~~~~~~~~~~~C~~C~~~ 263 (346)
.-.|| .|+. .+.+.. ..+.|.+||.+
T Consensus 20 ~~fCP--~Cg~~~m~~~~--------~r~~C~~Cgyt 46 (50)
T PRK00432 20 NKFCP--RCGSGFMAEHL--------DRWHCGKCGYT 46 (50)
T ss_pred cCcCc--CCCcchheccC--------CcEECCCcCCE
Confidence 34798 8976 333322 24999999875
No 252
>KOG0313|consensus
Probab=29.17 E-value=1.2e+02 Score=30.26 Aligned_cols=52 Identities=13% Similarity=0.181 Sum_probs=39.4
Q ss_pred ccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCC-CceEEeeecCCceEEEe
Q psy5736 4 LFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHD-DSLLLTGYDDGSLSVYR 63 (346)
Q Consensus 4 ~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~-~~lll~g~~dG~vri~r 63 (346)
.+|.+.|||-........- -|...|++.-.++..+ ...+++|+.|-.+++|+
T Consensus 123 YDg~~riWd~~Gk~~~~~~--------Ght~~ik~v~~v~~n~~~~~fvsas~Dqtl~Lw~ 175 (423)
T KOG0313|consen 123 YDGTSRIWDLKGKSIKTIV--------GHTGPIKSVAWVIKNSSSCLFVSASMDQTLRLWK 175 (423)
T ss_pred cCCeeEEEecCCceEEEEe--------cCCcceeeeEEEecCCccceEEEecCCceEEEEE
Confidence 5899999998855444332 1567898888888755 56788899999999994
No 253
>PF07282 OrfB_Zn_ribbon: Putative transposase DNA-binding domain; InterPro: IPR010095 This entry represents a region of a sequence similarity between a family of putative transposases of Thermoanaerobacter tengcongensis, smaller related proteins from Bacillus anthracis, putative transposes described by IPR001959 from INTERPRO, and other proteins. More information about these proteins can be found at Protein of the Month: Transposase [].
Probab=28.95 E-value=58 Score=23.54 Aligned_cols=39 Identities=15% Similarity=0.311 Sum_probs=25.9
Q ss_pred cccCccccccCCcCcCCcccccccCCcccccCcccccccCCcCChHH
Q psy5736 222 LEKYPIRIRCSNDNCGQIIATVKKLEPSAKKVEKKCESCNSTSDLTE 268 (346)
Q Consensus 222 ~~~~~~~~~C~~~~C~g~~~~~~~~~~~~~~~~~~C~~C~~~~~~~~ 268 (346)
++....+-.|+ .|+....... ....+.|+.||...+.+.
T Consensus 22 v~~~~TSq~C~--~CG~~~~~~~------~~r~~~C~~Cg~~~~rD~ 60 (69)
T PF07282_consen 22 VDEAYTSQTCP--RCGHRNKKRR------SGRVFTCPNCGFEMDRDV 60 (69)
T ss_pred ECCCCCccCcc--Cccccccccc------ccceEEcCCCCCEECcHH
Confidence 44445777898 8876554311 123599999999977654
No 254
>PRK14890 putative Zn-ribbon RNA-binding protein; Provisional
Probab=28.86 E-value=36 Score=24.61 Aligned_cols=26 Identities=35% Similarity=0.884 Sum_probs=15.0
Q ss_pred ecCCCCCCCccccccccCccccccCCcCcCCc
Q psy5736 208 KCRACVENWPLMESLEKYPIRIRCSNDNCGQI 239 (346)
Q Consensus 208 ~C~~C~~~~~~~~~~~~~~~~~~C~~~~C~g~ 239 (346)
.|..|-..-. ..+. ...|.|| +|+..
T Consensus 9 ~CtSCg~~i~---~~~~-~~~F~CP--nCG~~ 34 (59)
T PRK14890 9 KCTSCGIEIA---PREK-AVKFLCP--NCGEV 34 (59)
T ss_pred cccCCCCccc---CCCc-cCEeeCC--CCCCe
Confidence 3666655431 2222 4788898 88755
No 255
>KOG0267|consensus
Probab=28.80 E-value=28 Score=37.18 Aligned_cols=50 Identities=12% Similarity=0.203 Sum_probs=33.7
Q ss_pred ccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEE
Q psy5736 4 LFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVY 62 (346)
Q Consensus 4 ~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~ 62 (346)
-+|+|.|||.|+++.+-... -|...|+++. +++. .-+.+.|++|+.+.+|
T Consensus 90 asgtiK~wDleeAk~vrtLt-------gh~~~~~sv~-f~P~-~~~~a~gStdtd~~iw 139 (825)
T KOG0267|consen 90 ASGTIKVWDLEEAKIVRTLT-------GHLLNITSVD-FHPY-GEFFASGSTDTDLKIW 139 (825)
T ss_pred cCCceeeeehhhhhhhhhhh-------ccccCcceee-eccc-eEEeccccccccceeh
Confidence 46899999999886443221 1456788887 4432 2333778888888887
No 256
>KOG4283|consensus
Probab=28.60 E-value=96 Score=30.01 Aligned_cols=59 Identities=10% Similarity=0.040 Sum_probs=42.3
Q ss_pred CcccceEEEEecCCCc----------eeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEe
Q psy5736 2 TGLFLKEGIWDHITGS----------RLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYR 63 (346)
Q Consensus 2 ~~~~~~~~~w~~~~~~----------~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r 63 (346)
.|-+|.|.|||.++.. .+|.. .++.=..|+=-|+++.+.- +|..|..+++=|..+++|.
T Consensus 62 Ggadgsi~v~Dl~n~t~~e~s~li~k~~c~v--~~~h~~~Hky~iss~~WyP-~DtGmFtssSFDhtlKVWD 130 (397)
T KOG4283|consen 62 GGADGSIAVFDLQNATDYEASGLIAKHKCIV--AKQHENGHKYAISSAIWYP-IDTGMFTSSSFDHTLKVWD 130 (397)
T ss_pred cCCCccEEEEEeccccchhhccceeheeeec--cccCCccceeeeeeeEEee-ecCceeecccccceEEEee
Confidence 3779999999998765 22322 2332245777799999875 5666768788899999994
No 257
>PRK11827 hypothetical protein; Provisional
Probab=28.58 E-value=31 Score=24.99 Aligned_cols=34 Identities=21% Similarity=0.228 Sum_probs=23.5
Q ss_pred cccccCCcCcCCcccccccCCcccccCcccccccCCcCChHH
Q psy5736 227 IRIRCSNDNCGQIIATVKKLEPSAKKVEKKCESCNSTSDLTE 268 (346)
Q Consensus 227 ~~~~C~~~~C~g~~~~~~~~~~~~~~~~~~C~~C~~~~~~~~ 268 (346)
.-+.|| .|+|.+....+. ....|..|+......+
T Consensus 7 eILaCP--~ckg~L~~~~~~------~~Lic~~~~laYPI~d 40 (60)
T PRK11827 7 EIIACP--VCNGKLWYNQEK------QELICKLDNLAFPLRD 40 (60)
T ss_pred hheECC--CCCCcCeEcCCC------CeEECCccCeeccccC
Confidence 457899 899887654321 2378999998766543
No 258
>PF10083 DUF2321: Uncharacterized protein conserved in bacteria (DUF2321); InterPro: IPR016891 This entry is represented by Bacteriophage 'Lactobacillus prophage Lj928', Orf-Ljo1454. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=28.30 E-value=3.5e+02 Score=23.53 Aligned_cols=24 Identities=29% Similarity=0.535 Sum_probs=16.3
Q ss_pred cccccccCCcCChHHHHHHHHHHHHH
Q psy5736 254 EKKCESCNSTSDLTEIKTKLSELNEM 279 (346)
Q Consensus 254 ~~~C~~C~~~~~~~~~~~~l~~~~~~ 279 (346)
...|..||+..+..+ ..|+.+.++
T Consensus 68 PsYC~~CGkpyPWt~--~~L~aa~el 91 (158)
T PF10083_consen 68 PSYCHNCGKPYPWTE--NALEAANEL 91 (158)
T ss_pred ChhHHhCCCCCchHH--HHHHHHHHH
Confidence 478999999998754 334444444
No 259
>smart00028 TPR Tetratricopeptide repeats. Repeats present in 4 or more copies in proteins. Contain a minimum of 34 amino acids each and self-associate via a "knobs and holes" mechanism.
Probab=28.20 E-value=97 Score=16.49 Aligned_cols=24 Identities=21% Similarity=0.189 Sum_probs=17.2
Q ss_pred HHHHHhccccHHHHHHHHHHHHHH
Q psy5736 282 RGIEQMNTSCFREAVESLTKFSDQ 305 (346)
Q Consensus 282 ~a~~~~~~~~~~~ai~~l~~~l~~ 305 (346)
.+.-.+..+++++|+..+.+.++.
T Consensus 7 ~a~~~~~~~~~~~a~~~~~~~~~~ 30 (34)
T smart00028 7 LGNAYLKLGDYDEALEYYEKALEL 30 (34)
T ss_pred HHHHHHHHhhHHHHHHHHHHHHcc
Confidence 344456667888888888887764
No 260
>KOG0307|consensus
Probab=28.08 E-value=50 Score=36.83 Aligned_cols=56 Identities=16% Similarity=0.214 Sum_probs=42.1
Q ss_pred cccceEEEEecCC---CceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEe
Q psy5736 3 GLFLKEGIWDHIT---GSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYR 63 (346)
Q Consensus 3 ~~~~~~~~w~~~~---~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r 63 (346)
+-+|+|.+||-.+ +.......-.+ .|..-|..|+ +|..-..++..|.+||.|=||.
T Consensus 87 ~edG~I~ly~p~~~~~~~~~~~la~~~----~h~G~V~gLD-fN~~q~nlLASGa~~geI~iWD 145 (1049)
T KOG0307|consen 87 LEDGNIVLYDPASIIANASEEVLATKS----KHTGPVLGLD-FNPFQGNLLASGADDGEILIWD 145 (1049)
T ss_pred ccCCceEEecchhhccCcchHHHhhhc----ccCCceeeee-ccccCCceeeccCCCCcEEEec
Confidence 3479999999875 33333332222 4778898888 6888888999999999999993
No 261
>smart00858 SAF This domain family includes a range of different proteins. Such as antifreeze proteins and flagellar FlgA proteins, and CpaB pilus proteins.
Probab=28.04 E-value=41 Score=23.49 Aligned_cols=16 Identities=38% Similarity=0.486 Sum_probs=14.0
Q ss_pred EEEEeeccCCCCceee
Q psy5736 166 IVKALRPLKPKEVVAE 181 (346)
Q Consensus 166 ~v~a~r~I~~Geel~~ 181 (346)
+++|.++|++|+.|+-
T Consensus 3 v~va~~~i~~G~~i~~ 18 (64)
T smart00858 3 VVVAARDLPAGEVITA 18 (64)
T ss_pred EEEEeCccCCCCCcch
Confidence 5778899999999985
No 262
>PRK14714 DNA polymerase II large subunit; Provisional
Probab=27.22 E-value=43 Score=38.20 Aligned_cols=12 Identities=17% Similarity=0.589 Sum_probs=8.6
Q ss_pred cccccccCCcCC
Q psy5736 254 EKKCESCNSTSD 265 (346)
Q Consensus 254 ~~~C~~C~~~~~ 265 (346)
...|+.|+..+.
T Consensus 709 a~~CP~CGtplv 720 (1337)
T PRK14714 709 RVECPRCDVELT 720 (1337)
T ss_pred cccCCCCCCccc
Confidence 357999987643
No 263
>PF12895 Apc3: Anaphase-promoting complex, cyclosome, subunit 3; PDB: 3KAE_D 3Q4A_B 2C2L_D 3Q47_B 3Q49_B 2XPI_A 3ULQ_A.
Probab=27.14 E-value=1.2e+02 Score=22.27 Aligned_cols=20 Identities=25% Similarity=0.250 Sum_probs=13.0
Q ss_pred HHHHHhccccHHHHHHHHHH
Q psy5736 282 RGIEQMNTSCFREAVESLTK 301 (346)
Q Consensus 282 ~a~~~~~~~~~~~ai~~l~~ 301 (346)
.|.-.+..|++++|++.+.+
T Consensus 31 la~~~~~~~~y~~A~~~~~~ 50 (84)
T PF12895_consen 31 LAQCYFQQGKYEEAIELLQK 50 (84)
T ss_dssp HHHHHHHTTHHHHHHHHHHC
T ss_pred HHHHHHHCCCHHHHHHHHHH
Confidence 34455666777777777766
No 264
>KOG1034|consensus
Probab=27.09 E-value=1.3e+02 Score=29.49 Aligned_cols=55 Identities=13% Similarity=0.114 Sum_probs=42.5
Q ss_pred cceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeCC
Q psy5736 5 FLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRNY 65 (346)
Q Consensus 5 ~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~~ 65 (346)
+.+|.+||.++..-+.-|.--- -|.-++=++.+ +.+++ .++.++.|..+.+|+.-
T Consensus 157 D~svRlwnI~~~~Cv~VfGG~e----gHrdeVLSvD~-~~~gd-~i~ScGmDhslk~W~l~ 211 (385)
T KOG1034|consen 157 DHSVRLWNIQTDVCVAVFGGVE----GHRDEVLSVDF-SLDGD-RIASCGMDHSLKLWRLN 211 (385)
T ss_pred CceEEEEeccCCeEEEEecccc----cccCcEEEEEE-cCCCC-eeeccCCcceEEEEecC
Confidence 6789999999999999886433 26678888774 33344 67889999999999765
No 265
>KOG1240|consensus
Probab=26.95 E-value=71 Score=36.38 Aligned_cols=59 Identities=17% Similarity=0.167 Sum_probs=33.7
Q ss_pred cccceEEEEecCCCcee-eeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeC
Q psy5736 3 GLFLKEGIWDHITGSRL-ISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRN 64 (346)
Q Consensus 3 ~~~~~~~~w~~~~~~~~-~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~ 64 (346)
+-+|+|.|||..+-.+= ..+.+- --|.+.+++++.+..+. ...-..+|++||.|++.+-
T Consensus 1068 S~DGtVKvW~~~k~~~~~~s~rS~-ltys~~~sr~~~vt~~~--~~~~~Av~t~DG~v~~~~i 1127 (1431)
T KOG1240|consen 1068 SDDGTVKVWNLRKLEGEGGSARSE-LTYSPEGSRVEKVTMCG--NGDQFAVSTKDGSVRVLRI 1127 (1431)
T ss_pred cCCceEEEeeehhhhcCcceeeee-EEEeccCCceEEEEecc--CCCeEEEEcCCCeEEEEEc
Confidence 45899999998654222 122111 11344455555544333 2333466899999998853
No 266
>TIGR00373 conserved hypothetical protein TIGR00373. This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain.
Probab=26.88 E-value=32 Score=29.80 Aligned_cols=34 Identities=21% Similarity=0.677 Sum_probs=24.6
Q ss_pred CCCceecCCCCCCCccccccccCccccccCCcCcCCccccc
Q psy5736 203 YWFECKCRACVENWPLMESLEKYPIRIRCSNDNCGQIIATV 243 (346)
Q Consensus 203 ~~F~C~C~~C~~~~~~~~~~~~~~~~~~C~~~~C~g~~~~~ 243 (346)
-+|.| +.|...+...+.+. ..|.|| .|++.+...
T Consensus 108 ~~Y~C--p~c~~r~tf~eA~~---~~F~Cp--~Cg~~L~~~ 141 (158)
T TIGR00373 108 MFFIC--PNMCVRFTFNEAME---LNFTCP--RCGAMLDYL 141 (158)
T ss_pred CeEEC--CCCCcEeeHHHHHH---cCCcCC--CCCCEeeec
Confidence 45555 88988776555543 589999 999987655
No 267
>KOG1539|consensus
Probab=26.76 E-value=1.2e+02 Score=33.07 Aligned_cols=52 Identities=21% Similarity=0.330 Sum_probs=43.0
Q ss_pred ccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEe
Q psy5736 4 LFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYR 63 (346)
Q Consensus 4 ~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r 63 (346)
..|+|.|+|...++.+-+|.-. ..+||+|-| ..+..+++.+|++.|.+.+|.
T Consensus 222 ~~G~ViifNlK~dkil~sFk~d-------~g~VtslSF-rtDG~p~las~~~~G~m~~wD 273 (910)
T KOG1539|consen 222 ENGTVIIFNLKFDKILMSFKQD-------WGRVTSLSF-RTDGNPLLASGRSNGDMAFWD 273 (910)
T ss_pred cCceEEEEEcccCcEEEEEEcc-------ccceeEEEe-ccCCCeeEEeccCCceEEEEE
Confidence 3799999999999999999522 368999885 557788889899999998883
No 268
>PF09862 DUF2089: Protein of unknown function (DUF2089); InterPro: IPR018658 This family consists of various hypothetical prokaryotic proteins.
Probab=26.48 E-value=47 Score=27.23 Aligned_cols=16 Identities=13% Similarity=0.312 Sum_probs=7.3
Q ss_pred HHHHHHHHHHHHHHhh
Q psy5736 292 FREAVESLTKFSDQIH 307 (346)
Q Consensus 292 ~~~ai~~l~~~l~~~~ 307 (346)
|-.++.-+.++++.+.
T Consensus 63 YPTvR~rLd~ii~~lg 78 (113)
T PF09862_consen 63 YPTVRNRLDKIIEKLG 78 (113)
T ss_pred cHHHHHHHHHHHHHhC
Confidence 3334444555554443
No 269
>KOG1036|consensus
Probab=26.43 E-value=1.2e+02 Score=29.23 Aligned_cols=49 Identities=12% Similarity=0.203 Sum_probs=33.9
Q ss_pred CcccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEE
Q psy5736 2 TGLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVY 62 (346)
Q Consensus 2 ~~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~ 62 (346)
+|.+++|.+||-........+.++ .+|-+|. -.+..+++|.+|..+-+|
T Consensus 112 gsWD~~ik~wD~R~~~~~~~~d~~--------kkVy~~~----v~g~~LvVg~~~r~v~iy 160 (323)
T KOG1036|consen 112 GSWDKTIKFWDPRNKVVVGTFDQG--------KKVYCMD----VSGNRLVVGTSDRKVLIY 160 (323)
T ss_pred cccCccEEEEeccccccccccccC--------ceEEEEe----ccCCEEEEeecCceEEEE
Confidence 477899999999874444444322 2455544 335667889999999999
No 270
>KOG0264|consensus
Probab=26.23 E-value=1.2e+02 Score=30.54 Aligned_cols=55 Identities=15% Similarity=0.222 Sum_probs=43.3
Q ss_pred cccceEEEEecCCC-ceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeCC
Q psy5736 3 GLFLKEGIWDHITG-SRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRNY 65 (346)
Q Consensus 3 ~~~~~~~~w~~~~~-~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~~ 65 (346)
+-+++|.+||-.+= +++..|. .|+.+|..+++ .++.+.++.+.++||.+-+|..-
T Consensus 292 S~D~tV~LwDlRnL~~~lh~~e-------~H~dev~~V~W-SPh~etvLASSg~D~rl~vWDls 347 (422)
T KOG0264|consen 292 SADKTVALWDLRNLNKPLHTFE-------GHEDEVFQVEW-SPHNETVLASSGTDRRLNVWDLS 347 (422)
T ss_pred cCCCcEEEeechhcccCceecc-------CCCcceEEEEe-CCCCCceeEecccCCcEEEEecc
Confidence 45899999999653 5555553 37788999884 77899999999999999999543
No 271
>KOG0265|consensus
Probab=26.14 E-value=72 Score=30.78 Aligned_cols=52 Identities=15% Similarity=0.201 Sum_probs=38.1
Q ss_pred ccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeCC
Q psy5736 4 LFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRNY 65 (346)
Q Consensus 4 ~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~~ 65 (346)
.+|++.|||..+....-.|.| +.++|++-|-.+.+. + ..|.-|+.|++|..-
T Consensus 153 dD~t~kl~D~R~k~~~~t~~~--------kyqltAv~f~d~s~q-v-~sggIdn~ikvWd~r 204 (338)
T KOG0265|consen 153 DDGTLKLWDIRKKEAIKTFEN--------KYQLTAVGFKDTSDQ-V-ISGGIDNDIKVWDLR 204 (338)
T ss_pred CCceEEEEeecccchhhcccc--------ceeEEEEEecccccc-e-eeccccCceeeeccc
Confidence 589999999998777766643 467888886544443 3 468889999999553
No 272
>KOG3914|consensus
Probab=26.09 E-value=41 Score=33.37 Aligned_cols=20 Identities=20% Similarity=0.190 Sum_probs=17.1
Q ss_pred cccceEEEEecCCCceeeee
Q psy5736 3 GLFLKEGIWDHITGSRLISW 22 (346)
Q Consensus 3 ~~~~~~~~w~~~~~~~~~~~ 22 (346)
|-+++|.+|||.+|+.|..|
T Consensus 212 sGD~tlr~Wd~~sgk~L~t~ 231 (390)
T KOG3914|consen 212 SGDKTLRLWDITSGKLLDTC 231 (390)
T ss_pred CCCCcEEEEecccCCccccc
Confidence 66899999999999888544
No 273
>smart00659 RPOLCX RNA polymerase subunit CX. present in RNA polymerase I, II and III
Probab=26.01 E-value=46 Score=22.44 Aligned_cols=9 Identities=22% Similarity=0.848 Sum_probs=4.7
Q ss_pred ccccccCCc
Q psy5736 255 KKCESCNST 263 (346)
Q Consensus 255 ~~C~~C~~~ 263 (346)
..|+.||..
T Consensus 20 irC~~CG~r 28 (44)
T smart00659 20 VRCRECGYR 28 (44)
T ss_pred eECCCCCce
Confidence 555555543
No 274
>COG1997 RPL43A Ribosomal protein L37AE/L43A [Translation, ribosomal structure and biogenesis]
Probab=25.95 E-value=41 Score=26.26 Aligned_cols=11 Identities=27% Similarity=0.586 Sum_probs=9.2
Q ss_pred cccccccCCcC
Q psy5736 254 EKKCESCNSTS 264 (346)
Q Consensus 254 ~~~C~~C~~~~ 264 (346)
.|.|.+||...
T Consensus 53 IW~C~kCg~~f 63 (89)
T COG1997 53 IWKCRKCGAKF 63 (89)
T ss_pred eEEcCCCCCee
Confidence 49999999865
No 275
>KOG0644|consensus
Probab=25.61 E-value=37 Score=37.04 Aligned_cols=51 Identities=24% Similarity=0.329 Sum_probs=39.2
Q ss_pred ceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeCC
Q psy5736 6 LKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRNY 65 (346)
Q Consensus 6 ~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~~ 65 (346)
--|.||-.+++..+.+.. -|.++||+|.. .....+..+|+.|.+||+|+.-
T Consensus 212 ~lvKiwS~et~~~lAs~r-------Ghs~ditdlav--s~~n~~iaaaS~D~vIrvWrl~ 262 (1113)
T KOG0644|consen 212 RLVKIWSMETARCLASCR-------GHSGDITDLAV--SSNNTMIAAASNDKVIRVWRLP 262 (1113)
T ss_pred ceeeeeeccchhhhccCC-------CCccccchhcc--chhhhhhhhcccCceEEEEecC
Confidence 458899999998887652 27889999873 2445666778899999999764
No 276
>TIGR02521 type_IV_pilW type IV pilus biogenesis/stability protein PilW. Members of this family are designated PilF in ref (PubMed:8973346) and PilW in ref (PubMed:15612916). This outer membrane protein is required both for pilus stability and for pilus function such as adherence to human cells. Members of this family contain copies of the TPR (tetratricopeptide repeat) domain.
Probab=25.27 E-value=2.3e+02 Score=23.84 Aligned_cols=16 Identities=13% Similarity=0.110 Sum_probs=7.8
Q ss_pred cccHHHHHHHHHHHHH
Q psy5736 289 TSCFREAVESLTKFSD 304 (346)
Q Consensus 289 ~~~~~~ai~~l~~~l~ 304 (346)
.|++++|++.+.+.++
T Consensus 78 ~~~~~~A~~~~~~al~ 93 (234)
T TIGR02521 78 LGELEKAEDSFRRALT 93 (234)
T ss_pred cCCHHHHHHHHHHHHh
Confidence 3445555555544444
No 277
>TIGR02552 LcrH_SycD type III secretion low calcium response chaperone LcrH/SycD. ScyD/LcrH contains three central tetratricopeptide-like repeats that are predicted to fold into an all-alpha-helical array.
Probab=25.25 E-value=1.9e+02 Score=22.86 Aligned_cols=40 Identities=13% Similarity=0.119 Sum_probs=21.0
Q ss_pred hccccHHHHHHHHHHHHHHhhcccCCCcHhHHHHHHHHHHHHHHhCCc
Q psy5736 287 MNTSCFREAVESLTKFSDQIHELVVPPYKLASLAHEALRNCWSLAGNK 334 (346)
Q Consensus 287 ~~~~~~~~ai~~l~~~l~~~~~~l~p~h~~~~~~~~~L~~~y~~~G~~ 334 (346)
+..+++++|+..+.+.++. .|. .......++.+|...|+.
T Consensus 62 ~~~~~~~~A~~~~~~~~~~-----~p~---~~~~~~~la~~~~~~g~~ 101 (135)
T TIGR02552 62 QMLKEYEEAIDAYALAAAL-----DPD---DPRPYFHAAECLLALGEP 101 (135)
T ss_pred HHHHHHHHHHHHHHHHHhc-----CCC---ChHHHHHHHHHHHHcCCH
Confidence 3445555555555555443 122 234445666777777664
No 278
>PF03604 DNA_RNApol_7kD: DNA directed RNA polymerase, 7 kDa subunit; InterPro: IPR006591 DNA-dependent RNA polymerase catalyzes the transcription of DNA into RNA using the four ribonucleoside triphosphates as substrates. Each class of RNA polymerase is assembled from 9 to 15 different polypeptides. Rbp10 (RNA polymerase CX) is a domain found in RNA polymerase subunit 10; present in RNA polymerase I, II and III.; GO: 0003677 DNA binding, 0003899 DNA-directed RNA polymerase activity, 0006351 transcription, DNA-dependent; PDB: 2PMZ_Z 3HKZ_X 2NVX_L 3S1Q_L 2JA6_L 3S17_L 3HOW_L 3HOV_L 3PO2_L 3HOZ_L ....
Probab=25.03 E-value=64 Score=20.27 Aligned_cols=9 Identities=22% Similarity=0.862 Sum_probs=4.8
Q ss_pred ccccccCCc
Q psy5736 255 KKCESCNST 263 (346)
Q Consensus 255 ~~C~~C~~~ 263 (346)
..|..||..
T Consensus 18 irC~~CG~R 26 (32)
T PF03604_consen 18 IRCPECGHR 26 (32)
T ss_dssp SSBSSSS-S
T ss_pred EECCcCCCe
Confidence 566666654
No 279
>COG1198 PriA Primosomal protein N' (replication factor Y) - superfamily II helicase [DNA replication, recombination, and repair]
Probab=24.88 E-value=2.2e+02 Score=30.99 Aligned_cols=30 Identities=17% Similarity=0.120 Sum_probs=22.2
Q ss_pred CCCcHhHHHHHHHHHHHHHHhCCccccCcc
Q psy5736 311 VPPYKLASLAHEALRNCWSLAGNKWVIPEN 340 (346)
Q Consensus 311 ~p~h~~~~~~~~~L~~~y~~~G~~~~~~~~ 340 (346)
-|........-+.|.|.-++.|+.-...+.
T Consensus 567 ~~DfRA~Er~fqll~QvaGRAgR~~~~G~V 596 (730)
T COG1198 567 SPDFRASERTFQLLMQVAGRAGRAGKPGEV 596 (730)
T ss_pred CCCcchHHHHHHHHHHHHhhhccCCCCCeE
Confidence 445556778888999999999997555443
No 280
>COG1675 TFA1 Transcription initiation factor IIE, alpha subunit [Transcription]
Probab=24.67 E-value=62 Score=28.68 Aligned_cols=24 Identities=13% Similarity=-0.201 Sum_probs=15.0
Q ss_pred CCEEEEEEeeccCCCCceeecccc
Q psy5736 162 GKNIIVKALRPLKPKEVVAENYGL 185 (346)
Q Consensus 162 g~~~~v~a~r~I~~Geel~~~Y~~ 185 (346)
.+.+..+-.|+=..|-+.+..|.+
T Consensus 59 ~~li~~~k~rd~~~~~~~y~w~~~ 82 (176)
T COG1675 59 DGLISYRKKRDEESGWEEYTWYIN 82 (176)
T ss_pred CCceEEEeecccCCCcEEEEEEec
Confidence 345555666777777666666654
No 281
>TIGR01384 TFS_arch transcription factor S, archaeal. There has been an apparent duplication event in the Halobacteriaceae lineage (Haloarcula, Haloferax, Haloquadratum, Halobacterium and Natromonas). There appears to be a separate duplication in Methanosphaera stadtmanae.
Probab=24.50 E-value=43 Score=26.48 Aligned_cols=26 Identities=23% Similarity=0.639 Sum_probs=19.0
Q ss_pred ccCCcCcCCcccccccCCcccccCcccccccCCcCC
Q psy5736 230 RCSNDNCGQIIATVKKLEPSAKKVEKKCESCNSTSD 265 (346)
Q Consensus 230 ~C~~~~C~g~~~~~~~~~~~~~~~~~~C~~C~~~~~ 265 (346)
.|| .|++.+.+.. ..+.|..|+....
T Consensus 2 fC~--~Cg~~l~~~~--------~~~~C~~C~~~~~ 27 (104)
T TIGR01384 2 FCP--KCGSLMTPKN--------GVYVCPSCGYEKE 27 (104)
T ss_pred CCc--ccCcccccCC--------CeEECcCCCCccc
Confidence 588 8998886542 1389999998644
No 282
>TIGR00595 priA primosomal protein N'. All proteins in this family for which functions are known are components of the primosome which is involved in replication, repair, and recombination.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=24.05 E-value=1e+02 Score=31.87 Aligned_cols=18 Identities=22% Similarity=0.174 Sum_probs=12.5
Q ss_pred HHHHHHHHHHHHHHhCCc
Q psy5736 317 ASLAHEALRNCWSLAGNK 334 (346)
Q Consensus 317 ~~~~~~~L~~~y~~~G~~ 334 (346)
...+.+.|.|..++.|+.
T Consensus 351 ~E~~~~ll~q~~GRagR~ 368 (505)
T TIGR00595 351 AERGFQLLTQVAGRAGRA 368 (505)
T ss_pred HHHHHHHHHHHHhccCCC
Confidence 345567788888888773
No 283
>KOG1523|consensus
Probab=23.87 E-value=77 Score=30.84 Aligned_cols=44 Identities=23% Similarity=0.470 Sum_probs=30.4
Q ss_pred eeeeCCCCCCC-------CCCCCcccceeccccCCCceEEeeecCCceEEEeCC
Q psy5736 19 LISWRPDRGSW-------ARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRNY 65 (346)
Q Consensus 19 ~~~~~~~~~~w-------~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~~ 65 (346)
+|||.-+|. | +++.+.|++|.++ ++..++.+|+.||..|+|.-|
T Consensus 126 Vcy~E~ENd-WWVsKhikkPirStv~sldWh--pnnVLlaaGs~D~k~rVfSay 176 (361)
T KOG1523|consen 126 VCYYEQEND-WWVSKHIKKPIRSTVTSLDWH--PNNVLLAAGSTDGKCRVFSAY 176 (361)
T ss_pred EEEEecccc-eehhhhhCCccccceeeeecc--CCcceecccccCcceeEEEEe
Confidence 356665553 5 4445567887765 456677788899999999655
No 284
>PF14938 SNAP: Soluble NSF attachment protein, SNAP; PDB: 1QQE_A 2IFU_A.
Probab=23.77 E-value=3.7e+02 Score=25.02 Aligned_cols=44 Identities=11% Similarity=0.022 Sum_probs=33.3
Q ss_pred ccHHHHHHHHHHHHHHhhcccCCCcHhHHHHHHHHHHHHHHhCCcc
Q psy5736 290 SCFREAVESLTKFSDQIHELVVPPYKLASLAHEALRNCWSLAGNKW 335 (346)
Q Consensus 290 ~~~~~ai~~l~~~l~~~~~~l~p~h~~~~~~~~~L~~~y~~~G~~~ 335 (346)
+++++|++.|++.++.|..-- .......+...++.++..+|+.-
T Consensus 129 ~d~e~Ai~~Y~~A~~~y~~e~--~~~~a~~~~~~~A~l~~~l~~y~ 172 (282)
T PF14938_consen 129 GDYEKAIEYYQKAAELYEQEG--SPHSAAECLLKAADLYARLGRYE 172 (282)
T ss_dssp --HHHHHHHHHHHHHHHHHTT---HHHHHHHHHHHHHHHHHTT-HH
T ss_pred CCHHHHHHHHHHHHHHHHHCC--ChhhHHHHHHHHHHHHHHhCCHH
Confidence 589999999999999998764 44556778888999999988653
No 285
>CHL00033 ycf3 photosystem I assembly protein Ycf3
Probab=23.76 E-value=2e+02 Score=24.19 Aligned_cols=52 Identities=8% Similarity=-0.047 Sum_probs=30.2
Q ss_pred HHHHHHhccccHHHHHHHHHHHHHHhhcccCCCcHhHHHHHHHHHHHHHHhCC
Q psy5736 281 YRGIEQMNTSCFREAVESLTKFSDQIHELVVPPYKLASLAHEALRNCWSLAGN 333 (346)
Q Consensus 281 ~~a~~~~~~~~~~~ai~~l~~~l~~~~~~l~p~h~~~~~~~~~L~~~y~~~G~ 333 (346)
.-+.-....|++++|+..+++.+..... ....+..+..+...+.+.+..+|+
T Consensus 77 ~lg~~~~~~g~~~eA~~~~~~Al~~~~~-~~~~~~~la~i~~~~~~~~~~~g~ 128 (168)
T CHL00033 77 NIGLIHTSNGEHTKALEYYFQALERNPF-LPQALNNMAVICHYRGEQAIEQGD 128 (168)
T ss_pred HHHHHHHHcCCHHHHHHHHHHHHHhCcC-cHHHHHHHHHHHHHhhHHHHHccc
Confidence 3345556678999999999999976322 222333344444444444445555
No 286
>PRK00564 hypA hydrogenase nickel incorporation protein; Provisional
Probab=23.23 E-value=85 Score=25.72 Aligned_cols=41 Identities=12% Similarity=0.216 Sum_probs=26.5
Q ss_pred cCCCCceeecccccccCCCHHHHHHHHHhhCCCceecCCCCCCCccccccccCccccccCCcCcCCc
Q psy5736 173 LKPKEVVAENYGLVFSRKHLIDRQKVLSARYWFECKCRACVENWPLMESLEKYPIRIRCSNDNCGQI 239 (346)
Q Consensus 173 I~~Geel~~~Y~~~~~~~~~~~R~~~L~~~~~F~C~C~~C~~~~~~~~~~~~~~~~~~C~~~~C~g~ 239 (346)
+.+|.+|.|.-.+ ....|..|...++ ... ...++|| .|++.
T Consensus 57 ~~ega~L~Ie~vp-------------------~~~~C~~Cg~~~~----~~~-~~~~~CP--~Cgs~ 97 (117)
T PRK00564 57 VCKDAILDIVDEK-------------------VELECKDCSHVFK----PNA-LDYGVCE--KCHSK 97 (117)
T ss_pred ccCCCEEEEEecC-------------------CEEEhhhCCCccc----cCC-ccCCcCc--CCCCC
Confidence 5678888888776 2677888876653 221 1234588 88764
No 287
>PRK14892 putative transcription elongation factor Elf1; Provisional
Probab=23.14 E-value=37 Score=27.18 Aligned_cols=58 Identities=14% Similarity=0.308 Sum_probs=30.7
Q ss_pred cccccCCcCcCCcccccccCCcccccCcccccccCCcCChHHHHHHHHHHHHHHHHHHHHhcccc
Q psy5736 227 IRIRCSNDNCGQIIATVKKLEPSAKKVEKKCESCNSTSDLTEIKTKLSELNEMFYRGIEQMNTSC 291 (346)
Q Consensus 227 ~~~~C~~~~C~g~~~~~~~~~~~~~~~~~~C~~C~~~~~~~~~~~~l~~~~~~~~~a~~~~~~~~ 291 (346)
..|.|| +|+....+..-. .+.....|..||...... ...|.+..+.|.+-+++...|.
T Consensus 20 t~f~CP--~Cge~~v~v~~~---k~~~h~~C~~CG~y~~~~--V~~l~epIDVY~~wiD~~~eg~ 77 (99)
T PRK14892 20 KIFECP--RCGKVSISVKIK---KNIAIITCGNCGLYTEFE--VPSVYDEVDVYNKFIDLYLEGK 77 (99)
T ss_pred cEeECC--CCCCeEeeeecC---CCcceEECCCCCCccCEE--CCccccchhhHHHHHHHHHhcC
Confidence 567898 888433221100 012358899999865432 2234444445555555554444
No 288
>TIGR02300 FYDLN_acid conserved hypothetical protein TIGR02300. Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=23.11 E-value=54 Score=27.46 Aligned_cols=32 Identities=13% Similarity=0.099 Sum_probs=21.4
Q ss_pred cccCCcCcCCcccccccCCcccccCcccccccCCcCChHHH
Q psy5736 229 IRCSNDNCGQIIATVKKLEPSAKKVEKKCESCNSTSDLTEI 269 (346)
Q Consensus 229 ~~C~~~~C~g~~~~~~~~~~~~~~~~~~C~~C~~~~~~~~~ 269 (346)
-.|| +|+..++--+ ....+|++||..+..+.+
T Consensus 10 r~Cp--~cg~kFYDLn-------k~p~vcP~cg~~~~~~~~ 41 (129)
T TIGR02300 10 RICP--NTGSKFYDLN-------RRPAVSPYTGEQFPPEEA 41 (129)
T ss_pred ccCC--CcCccccccC-------CCCccCCCcCCccCcchh
Confidence 4688 8876654332 124899999998776633
No 289
>PRK04016 DNA-directed RNA polymerase subunit N; Provisional
Probab=23.03 E-value=1.3e+02 Score=22.03 Aligned_cols=13 Identities=15% Similarity=0.429 Sum_probs=10.4
Q ss_pred cccccccCCcCCh
Q psy5736 254 EKKCESCNSTSDL 266 (346)
Q Consensus 254 ~~~C~~C~~~~~~ 266 (346)
...|-.||+.+..
T Consensus 4 PvRCFTCGkvi~~ 16 (62)
T PRK04016 4 PVRCFTCGKVIAE 16 (62)
T ss_pred CeEecCCCCChHH
Confidence 3789999998764
No 290
>PF13913 zf-C2HC_2: zinc-finger of a C2HC-type
Probab=23.00 E-value=89 Score=18.21 Aligned_cols=18 Identities=11% Similarity=0.388 Sum_probs=13.2
Q ss_pred ccccccCCcCChHHHHHH
Q psy5736 255 KKCESCNSTSDLTEIKTK 272 (346)
Q Consensus 255 ~~C~~C~~~~~~~~~~~~ 272 (346)
..|+.||.....+.+.+.
T Consensus 3 ~~C~~CgR~F~~~~l~~H 20 (25)
T PF13913_consen 3 VPCPICGRKFNPDRLEKH 20 (25)
T ss_pred CcCCCCCCEECHHHHHHH
Confidence 679999998876665443
No 291
>TIGR02878 spore_ypjB sporulation protein YpjB. Members of this protein, YpjB, family are restricted to a subset of endospore-forming bacteria, including Bacillus species but not CLostridium or some others. In Bacillus subtilis, ypjB was found to be part of the sigma-E regulon, where sigma-E is a sporulation sigma factor that regulates expression in the mother cell compartment. Null mutants of ypjB show a sporulation defect. This protein family is not, however, a part of the endospore formation minimal gene set.
Probab=22.68 E-value=1.6e+02 Score=27.32 Aligned_cols=35 Identities=26% Similarity=0.327 Sum_probs=27.3
Q ss_pred HHHHHHHHHHHHhccccHHHHHHHHHHHHHHhhcc
Q psy5736 275 ELNEMFYRGIEQMNTSCFREAVESLTKFSDQIHEL 309 (346)
Q Consensus 275 ~~~~~~~~a~~~~~~~~~~~ai~~l~~~l~~~~~~ 309 (346)
++..+-..++++.+++++++|.+.+.-+-+.|-..
T Consensus 4 eL~~lsD~~LQl~K~~~yeeAkqvL~yFs~~F~~~ 38 (233)
T TIGR02878 4 ELNDLSDQVLQLTKQARYEEAKQVLEYFSKQFVSA 38 (233)
T ss_pred hHhhhhHHHHHHHHhhhHHHHHHHHHHHHHHHHhc
Confidence 44455566788889999999999888888777664
No 292
>cd02679 MIT_spastin MIT: domain contained within Microtubule Interacting and Trafficking molecules. This MIT domain sub-family is found in the AAA protein spastin, a probable ATPase involved in the assembly or function of nuclear protein complexes; spastins might also be involved in microtubule dynamics. The molecular function of the MIT domain is unclear.
Probab=22.33 E-value=1.8e+02 Score=22.14 Aligned_cols=40 Identities=13% Similarity=0.063 Sum_probs=29.5
Q ss_pred HHHHHHHHHHHHHhccccHHHHHHHHHHHHHHhhcccCCC
Q psy5736 274 SELNEMFYRGIEQMNTSCFREAVESLTKFSDQIHELVVPP 313 (346)
Q Consensus 274 ~~~~~~~~~a~~~~~~~~~~~ai~~l~~~l~~~~~~l~p~ 313 (346)
..+-+...+|++.=+.|..++|+..|.+.++.+.+-+.-|
T Consensus 6 ~~A~~~I~kaL~~dE~g~~e~Al~~Y~~gi~~l~eg~ai~ 45 (79)
T cd02679 6 KQAFEEISKALRADEWGDKEQALAHYRKGLRELEEGIAVP 45 (79)
T ss_pred HHHHHHHHHHhhhhhcCCHHHHHHHHHHHHHHHHHHcCCC
Confidence 3445555667776677899999999999998888754443
No 293
>KOG2055|consensus
Probab=22.29 E-value=1.1e+02 Score=31.21 Aligned_cols=29 Identities=24% Similarity=0.386 Sum_probs=24.6
Q ss_pred CCcccceeccccCCCceEEeeecCCceEEEe
Q psy5736 33 SARCTALTLLNEHDDSLLLTGYDDGSLSVYR 63 (346)
Q Consensus 33 ~~~~t~l~~ine~~~~lll~g~~dG~vri~r 63 (346)
...|++++|+ ++-.++|+|+-||.+|+|.
T Consensus 213 ~~~I~sv~FH--p~~plllvaG~d~~lrifq 241 (514)
T KOG2055|consen 213 HGGITSVQFH--PTAPLLLVAGLDGTLRIFQ 241 (514)
T ss_pred cCCceEEEec--CCCceEEEecCCCcEEEEE
Confidence 3579999987 4678999999999999994
No 294
>KOG0321|consensus
Probab=22.13 E-value=1.1e+02 Score=32.44 Aligned_cols=52 Identities=13% Similarity=0.166 Sum_probs=35.7
Q ss_pred cceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEE
Q psy5736 5 FLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVY 62 (346)
Q Consensus 5 ~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~ 62 (346)
+-++.+||.+.+...-.-.+ -.|..-++++-|.- .+..+..+|+-||.+-+|
T Consensus 121 DsT~r~Wdvk~s~l~G~~~~-----~GH~~SvkS~cf~~-~n~~vF~tGgRDg~illW 172 (720)
T KOG0321|consen 121 DSTIRPWDVKTSRLVGGRLN-----LGHTGSVKSECFMP-TNPAVFCTGGRDGEILLW 172 (720)
T ss_pred Cceeeeeeeccceeecceee-----cccccccchhhhcc-CCCcceeeccCCCcEEEE
Confidence 56899999998744322111 12555567766544 566777889999999999
No 295
>PF01927 Mut7-C: Mut7-C RNAse domain; InterPro: IPR002782 This prokaryotic family of proteins have no known function. The proteins contain four conserved cysteines that may be involved in metal binding or disulphide bridges.
Probab=22.06 E-value=50 Score=28.05 Aligned_cols=34 Identities=24% Similarity=0.433 Sum_probs=19.8
Q ss_pred cccCCcCcCCcccccccCC---------cccccCcccccccCCcC
Q psy5736 229 IRCSNDNCGQIIATVKKLE---------PSAKKVEKKCESCNSTS 264 (346)
Q Consensus 229 ~~C~~~~C~g~~~~~~~~~---------~~~~~~~~~C~~C~~~~ 264 (346)
-+|+ .|.+.+.+....+ ....+.-|.|+.||+..
T Consensus 92 sRC~--~CN~~L~~v~~~~v~~~vp~~v~~~~~~f~~C~~C~kiy 134 (147)
T PF01927_consen 92 SRCP--KCNGPLRPVSKEEVKDRVPPYVYETYDEFWRCPGCGKIY 134 (147)
T ss_pred CccC--CCCcEeeechhhccccccCccccccCCeEEECCCCCCEe
Confidence 3677 7777655432110 01123479999999963
No 296
>PF12894 Apc4_WD40: Anaphase-promoting complex subunit 4 WD40 domain
Probab=22.03 E-value=1.2e+02 Score=20.66 Aligned_cols=29 Identities=28% Similarity=0.481 Sum_probs=21.3
Q ss_pred CcccceeccccCCCceEEeeecCCceEEEeC
Q psy5736 34 ARCTALTLLNEHDDSLLLTGYDDGSLSVYRN 64 (346)
Q Consensus 34 ~~~t~l~~ine~~~~lll~g~~dG~vri~r~ 64 (346)
.+|+.+.+ + ++-.++.+|+.+|.|.+||.
T Consensus 12 ~~v~~~~w-~-P~mdLiA~~t~~g~v~v~Rl 40 (47)
T PF12894_consen 12 SRVSCMSW-C-PTMDLIALGTEDGEVLVYRL 40 (47)
T ss_pred CcEEEEEE-C-CCCCEEEEEECCCeEEEEEC
Confidence 46776664 3 33447788999999999986
No 297
>KOG1273|consensus
Probab=21.96 E-value=1.1e+02 Score=29.83 Aligned_cols=54 Identities=20% Similarity=0.063 Sum_probs=40.3
Q ss_pred ccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeCCc
Q psy5736 4 LFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRNYA 66 (346)
Q Consensus 4 ~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~~~ 66 (346)
.+|.|-|||..+-..-..++ -|-..|++|-+ ..|+..+|++|.|-.+.+|....
T Consensus 43 ~nG~vvI~D~~T~~iar~ls-------aH~~pi~sl~W--S~dgr~LltsS~D~si~lwDl~~ 96 (405)
T KOG1273|consen 43 ANGRVVIYDFDTFRIARMLS-------AHVRPITSLCW--SRDGRKLLTSSRDWSIKLWDLLK 96 (405)
T ss_pred cCCcEEEEEccccchhhhhh-------ccccceeEEEe--cCCCCEeeeecCCceeEEEeccC
Confidence 37999999998764333331 25667888875 46788889999999999996653
No 298
>KOG4056|consensus
Probab=21.87 E-value=1.6e+02 Score=25.02 Aligned_cols=43 Identities=14% Similarity=0.084 Sum_probs=31.1
Q ss_pred HHHHHHhccccHHHHHHHH----------HHHHHHhhcccCCCcHhHHHHHHH
Q psy5736 281 YRGIEQMNTSCFREAVESL----------TKFSDQIHELVVPPYKLASLAHEA 323 (346)
Q Consensus 281 ~~a~~~~~~~~~~~ai~~l----------~~~l~~~~~~l~p~h~~~~~~~~~ 323 (346)
+.+.+++.+|++++....+ .++|..|++.++|+++.+...+..
T Consensus 86 ~lGE~L~~qg~~e~ga~h~~nAi~vcgqpaqLL~vlq~tlp~~if~~i~~~l~ 138 (143)
T KOG4056|consen 86 QLGEELLAQGNEEEGAEHLANAIVVCGQPAQLLQVLQQTLPEAIFAMLLKKLP 138 (143)
T ss_pred HhHHHHHHccCHHHHHHHHHHHHhhcCCHHHHHHHHHhhCCHHHHHHHHHHhh
Confidence 4466677888877754444 457789999999999987776653
No 299
>smart00647 IBR In Between Ring fingers. the domains occurs between pairs og RING fingers
Probab=21.87 E-value=64 Score=22.52 Aligned_cols=32 Identities=16% Similarity=0.369 Sum_probs=18.2
Q ss_pred ccccCCcCcCCcccccccCCcccccCcccccccCCc
Q psy5736 228 RIRCSNDNCGQIIATVKKLEPSAKKVEKKCESCNST 263 (346)
Q Consensus 228 ~~~C~~~~C~g~~~~~~~~~~~~~~~~~~C~~C~~~ 263 (346)
..+||+++|...+....+. ......|+.|+..
T Consensus 18 ~~~CP~~~C~~~~~~~~~~----~~~~v~C~~C~~~ 49 (64)
T smart00647 18 LKWCPAPDCSAAIIVTEEE----GCNRVTCPKCGFS 49 (64)
T ss_pred ccCCCCCCCcceEEecCCC----CCCeeECCCCCCe
Confidence 3468888887666544210 1124677777653
No 300
>PF04652 DUF605: Vta1 like; InterPro: IPR006745 This family contains proteins from the Eukaryota; functionally they are uncharacterised.; PDB: 2RKK_B 2RKL_B 3MHV_A.
Probab=21.76 E-value=1.9e+02 Score=28.28 Aligned_cols=37 Identities=14% Similarity=0.147 Sum_probs=30.3
Q ss_pred HHHHHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHh
Q psy5736 270 KTKLSELNEMFYRGIEQMNTSCFREAVESLTKFSDQI 306 (346)
Q Consensus 270 ~~~l~~~~~~~~~a~~~~~~~~~~~ai~~l~~~l~~~ 306 (346)
.+.+.+++...+=|+.+++-.|+..|++.|++.|+.+
T Consensus 343 ~~~i~~aQK~aK~AiSAL~feDv~tA~~~L~~AL~lL 379 (380)
T PF04652_consen 343 PEQIAKAQKHAKFAISALNFEDVPTAIKELRKALELL 379 (380)
T ss_dssp HHHHHHHHHHHHHHHHHHTTT-HHHHHHHHHHHHHHH
T ss_pred ccccccccccccccccccccccccccccccccccccC
Confidence 3456677777788899999999999999999999876
No 301
>KOG0301|consensus
Probab=21.26 E-value=99 Score=33.00 Aligned_cols=32 Identities=22% Similarity=0.357 Sum_probs=22.7
Q ss_pred CCCCCCcccceeccccCCCceEEeeecCCceEEEe
Q psy5736 29 WARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYR 63 (346)
Q Consensus 29 w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r 63 (346)
|..|..-+=+++.+++. ..++|+.|..|++|+
T Consensus 136 l~gH~asVWAv~~l~e~---~~vTgsaDKtIklWk 167 (745)
T KOG0301|consen 136 LQGHTASVWAVASLPEN---TYVTGSADKTIKLWK 167 (745)
T ss_pred cCCcchheeeeeecCCC---cEEeccCcceeeecc
Confidence 55566667777777765 557788888888884
No 302
>KOG4378|consensus
Probab=21.19 E-value=1.4e+02 Score=30.89 Aligned_cols=54 Identities=19% Similarity=0.286 Sum_probs=35.2
Q ss_pred CcccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEE
Q psy5736 2 TGLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVY 62 (346)
Q Consensus 2 ~~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~ 62 (346)
.|-.|.|++||......+-.|..+ |+..|..+-| .+..+.++++-+-|..|-+|
T Consensus 183 asd~G~VtlwDv~g~sp~~~~~~~------HsAP~~gicf-spsne~l~vsVG~Dkki~~y 236 (673)
T KOG4378|consen 183 ASDKGAVTLWDVQGMSPIFHASEA------HSAPCRGICF-SPSNEALLVSVGYDKKINIY 236 (673)
T ss_pred eccCCeEEEEeccCCCcccchhhh------ccCCcCccee-cCCccceEEEecccceEEEe
Confidence 356799999999988877766544 5666666654 33445555555556666555
No 303
>PRK14873 primosome assembly protein PriA; Provisional
Probab=20.96 E-value=1.3e+02 Score=32.39 Aligned_cols=24 Identities=8% Similarity=-0.054 Sum_probs=15.5
Q ss_pred CCCcHhHHHHHHHHHHHHHHhCCc
Q psy5736 311 VPPYKLASLAHEALRNCWSLAGNK 334 (346)
Q Consensus 311 ~p~h~~~~~~~~~L~~~y~~~G~~ 334 (346)
.|....-..+.+.|.+.-++.|..
T Consensus 503 ~pDfRA~Er~~qll~qvagragr~ 526 (665)
T PRK14873 503 RQDLRAAEDTLRRWMAAAALVRPR 526 (665)
T ss_pred CCCcChHHHHHHHHHHHHHhhcCC
Confidence 344445666677777777777664
No 304
>PLN03086 PRLI-interacting factor K; Provisional
Probab=20.93 E-value=1.3e+02 Score=31.56 Aligned_cols=57 Identities=25% Similarity=0.569 Sum_probs=33.3
Q ss_pred ceecCCCCCCCccccccccC-----ccccccCCcCcCCcccccccCCcccccCcccccccCCcCChHHH
Q psy5736 206 ECKCRACVENWPLMESLEKY-----PIRIRCSNDNCGQIIATVKKLEPSAKKVEKKCESCNSTSDLTEI 269 (346)
Q Consensus 206 ~C~C~~C~~~~~~~~~~~~~-----~~~~~C~~~~C~g~~~~~~~~~~~~~~~~~~C~~C~~~~~~~~~ 269 (346)
+..|.-|...-|. ..+..+ ...+.||+..|+..+..... ...|.|..|++.+.....
T Consensus 407 ~V~C~NC~~~i~l-~~l~lHe~~C~r~~V~Cp~~~Cg~v~~r~el------~~H~~C~~Cgk~f~~s~L 468 (567)
T PLN03086 407 TVECRNCKHYIPS-RSIALHEAYCSRHNVVCPHDGCGIVLRVEEA------KNHVHCEKCGQAFQQGEM 468 (567)
T ss_pred eEECCCCCCccch-hHHHHHHhhCCCcceeCCcccccceeecccc------ccCccCCCCCCccchHHH
Confidence 4567778775442 122211 25567875558876643321 235999999998765443
No 305
>KOG0310|consensus
Probab=20.79 E-value=1.7e+02 Score=29.89 Aligned_cols=52 Identities=15% Similarity=0.157 Sum_probs=34.1
Q ss_pred cccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEE
Q psy5736 3 GLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVY 62 (346)
Q Consensus 3 ~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~ 62 (346)
+=++.+.+||-.++.- ..--.+ |.-.+-+..++ +..+.++++|+=||.||+|
T Consensus 130 sDd~v~k~~d~s~a~v-~~~l~~------htDYVR~g~~~-~~~~hivvtGsYDg~vrl~ 181 (487)
T KOG0310|consen 130 SDDKVVKYWDLSTAYV-QAELSG------HTDYVRCGDIS-PANDHIVVTGSYDGKVRLW 181 (487)
T ss_pred CCCceEEEEEcCCcEE-EEEecC------CcceeEeeccc-cCCCeEEEecCCCceEEEE
Confidence 3467889999998874 222212 34344443333 3556788999999999999
No 306
>TIGR00990 3a0801s09 mitochondrial precursor proteins import receptor (72 kDa mitochondrial outermembrane protein) (mitochondrial import receptor for the ADP/ATP carrier) (translocase of outermembrane tom70).
Probab=20.69 E-value=2.8e+02 Score=28.93 Aligned_cols=51 Identities=12% Similarity=0.113 Sum_probs=34.7
Q ss_pred HHHHHHHHHHHHhccccHHHHHHHHHHHHHHhhcccCCCcHhHHHHHHHHHHHHHHhCCc
Q psy5736 275 ELNEMFYRGIEQMNTSCFREAVESLTKFSDQIHELVVPPYKLASLAHEALRNCWSLAGNK 334 (346)
Q Consensus 275 ~~~~~~~~a~~~~~~~~~~~ai~~l~~~l~~~~~~l~p~h~~~~~~~~~L~~~y~~~G~~ 334 (346)
.+..+...+...+..|++++|++.|.+.++... . + .....++.||..+|+.
T Consensus 126 ~a~~~k~~G~~~~~~~~~~~Ai~~y~~al~~~p-----~-~---~~~~n~a~~~~~l~~~ 176 (615)
T TIGR00990 126 YAAKLKEKGNKAYRNKDFNKAIKLYSKAIECKP-----D-P---VYYSNRAACHNALGDW 176 (615)
T ss_pred HHHHHHHHHHHHHHcCCHHHHHHHHHHHHhcCC-----c-h---HHHHHHHHHHHHhCCH
Confidence 345555667778888999999999999886542 1 1 2345566677766654
No 307
>PF02561 FliS: Flagellar protein FliS; InterPro: IPR003713 The fliD operon of several bacteria consists of three flagellar genes, fliD, fliS, and fliT, and is transcribed in this order []. In Bacillus subtilis the operon encoding the flagellar proteins FliD, FliS, and FliT is sigma D-dependent [].; GO: 0009296 flagellum assembly, 0009288 bacterial-type flagellum; PDB: 1VH6_A 3IQC_B 3K1I_B 1ORJ_B 1ORY_A.
Probab=20.60 E-value=4.3e+02 Score=21.35 Aligned_cols=45 Identities=16% Similarity=0.119 Sum_probs=29.7
Q ss_pred HHHHHHHHHHHHHHHHHHhccccHHHHHHHHHH---HHHHhhcccCCC
Q psy5736 269 IKTKLSELNEMFYRGIEQMNTSCFREAVESLTK---FSDQIHELVVPP 313 (346)
Q Consensus 269 ~~~~l~~~~~~~~~a~~~~~~~~~~~ai~~l~~---~l~~~~~~l~p~ 313 (346)
+.-.++.+...++.|...++.++++.+...+.+ ++..+...|.+.
T Consensus 22 i~~Lyd~ai~~l~~a~~a~~~~~~~~~~~~l~ka~~Ii~~L~~~Ld~e 69 (122)
T PF02561_consen 22 ILMLYDGAIEFLKQAKEAIEQGDIEEKNEALQKAQDIITELQSSLDFE 69 (122)
T ss_dssp HHHHHHHHHHHHHHHHHHHHTTHHHHHHHHHHHHHHHHHHHHHTCCTT
T ss_pred HHHHHHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHHHHhhcCCC
Confidence 444456667777788888899998876555544 445555556554
No 308
>KOG0282|consensus
Probab=20.54 E-value=74 Score=32.45 Aligned_cols=37 Identities=22% Similarity=0.402 Sum_probs=30.5
Q ss_pred CCCCCCcccceeccccCCCceEEeeecCCceEEEeCCc
Q psy5736 29 WARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRNYA 66 (346)
Q Consensus 29 w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~~~ 66 (346)
|..|..-+++++++- -..-|+|.|+.||.|.+|.-|.
T Consensus 210 ~~gH~kgvsai~~fp-~~~hLlLS~gmD~~vklW~vy~ 246 (503)
T KOG0282|consen 210 LSGHTKGVSAIQWFP-KKGHLLLSGGMDGLVKLWNVYD 246 (503)
T ss_pred ccCCccccchhhhcc-ceeeEEEecCCCceEEEEEEec
Confidence 788888899999886 3466789999999999996553
No 309
>PRK00415 rps27e 30S ribosomal protein S27e; Reviewed
Probab=20.40 E-value=70 Score=23.13 Aligned_cols=33 Identities=18% Similarity=0.511 Sum_probs=21.7
Q ss_pred cCccccccCCcCcCCcc-cccccCCcccccCcccccccCCcC
Q psy5736 224 KYPIRIRCSNDNCGQII-ATVKKLEPSAKKVEKKCESCNSTS 264 (346)
Q Consensus 224 ~~~~~~~C~~~~C~g~~-~~~~~~~~~~~~~~~~C~~C~~~~ 264 (346)
.++..++|| .|.+.- ..+.. .....|..||..+
T Consensus 7 S~F~~VkCp--~C~n~q~vFsha------~t~V~C~~Cg~~L 40 (59)
T PRK00415 7 SRFLKVKCP--DCGNEQVVFSHA------STVVRCLVCGKTL 40 (59)
T ss_pred CeEEEEECC--CCCCeEEEEecC------CcEEECcccCCCc
Confidence 456788999 897542 22322 2348899999876
No 310
>COG2178 Predicted RNA-binding protein of the translin family [Translation, ribosomal structure and biogenesis]
Probab=20.33 E-value=5.8e+02 Score=23.14 Aligned_cols=47 Identities=19% Similarity=0.174 Sum_probs=31.6
Q ss_pred hHHHHHHHHHHHHHHHHHHHHhccccHHHHHHHHHHH---HHHhhcccCC
Q psy5736 266 LTEIKTKLSELNEMFYRGIEQMNTSCFREAVESLTKF---SDQIHELVVP 312 (346)
Q Consensus 266 ~~~~~~~l~~~~~~~~~a~~~~~~~~~~~ai~~l~~~---l~~~~~~l~p 312 (346)
.+++.+...++..+-..++.++-.|++++|...+.++ ++.+++.+.+
T Consensus 19 REE~l~lsRei~r~s~~aI~~~H~~~~eeA~~~l~~a~~~v~~Lk~~l~~ 68 (204)
T COG2178 19 REEALKLSREIVRLSGEAIFLLHRGDFEEAEKKLKKASEAVEKLKRLLAG 68 (204)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccHHHHHHHHHHHHHHHHHHHHHHhh
Confidence 4556666667777777888889999999886666554 4555544444
No 311
>COG1675 TFA1 Transcription initiation factor IIE, alpha subunit [Transcription]
Probab=20.32 E-value=1.4e+02 Score=26.55 Aligned_cols=39 Identities=15% Similarity=0.566 Sum_probs=18.5
Q ss_pred HHHHHhhCCCceecCCCCCCCccccccccCccccccCCcCcCCccc
Q psy5736 196 QKVLSARYWFECKCRACVENWPLMESLEKYPIRIRCSNDNCGQIIA 241 (346)
Q Consensus 196 ~~~L~~~~~F~C~C~~C~~~~~~~~~~~~~~~~~~C~~~~C~g~~~ 241 (346)
.+...++-+|.| +.|.-.....+.+ ...|.|| .|++.+.
T Consensus 105 le~~~~~~~y~C--~~~~~r~sfdeA~---~~~F~Cp--~Cg~~L~ 143 (176)
T COG1675 105 LEKETENNYYVC--PNCHVKYSFDEAM---ELGFTCP--KCGEDLE 143 (176)
T ss_pred HHhhccCCceeC--CCCCCcccHHHHH---HhCCCCC--CCCchhh
Confidence 334445566666 5555443221111 2446666 5655443
No 312
>PF08631 SPO22: Meiosis protein SPO22/ZIP4 like; InterPro: IPR013940 SPO22 is a meiosis-specific protein with similarity to phospholipase A2, involved in completion of nuclear divisions during meiosis; induced early in meiosis []. It is also involved in sporulation [].
Probab=20.19 E-value=4e+02 Score=24.80 Aligned_cols=57 Identities=18% Similarity=0.165 Sum_probs=39.9
Q ss_pred HHHHHHHHHHhccc-cHHHHHHHHHHHHHHhhc-----ccCCCc-HhHHHHHHHHHHHHHHhCC
Q psy5736 277 NEMFYRGIEQMNTS-CFREAVESLTKFSDQIHE-----LVVPPY-KLASLAHEALRNCWSLAGN 333 (346)
Q Consensus 277 ~~~~~~a~~~~~~~-~~~~ai~~l~~~l~~~~~-----~l~p~h-~~~~~~~~~L~~~y~~~G~ 333 (346)
.-.|.-+..+++.+ ++++|+.++++.++.+.+ -.++.- -....+...|+++|...+.
T Consensus 36 ~~~yn~G~~l~~~~~~~~~a~~wL~~a~~~l~~~~~~~~~~~~~~elr~~iL~~La~~~l~~~~ 99 (278)
T PF08631_consen 36 RVCYNIGKSLLSKKDKYEEAVKWLQRAYDILEKPGKMDKLSPDGSELRLSILRLLANAYLEWDT 99 (278)
T ss_pred HHHHHHHHHHHHcCCChHHHHHHHHHHHHHHHhhhhccccCCcHHHHHHHHHHHHHHHHHcCCC
Confidence 44567788888888 999999999999988754 233322 2455566677777766543
No 313
>PF10122 Mu-like_Com: Mu-like prophage protein Com; InterPro: IPR019294 Members of this entry belong to the Com family of proteins that act as translational regulators of mom [, ].
Probab=20.10 E-value=43 Score=23.45 Aligned_cols=31 Identities=29% Similarity=0.787 Sum_probs=17.5
Q ss_pred cccccCCcCcCCcccccccCCcccccCcccccccCCc
Q psy5736 227 IRIRCSNDNCGQIIATVKKLEPSAKKVEKKCESCNST 263 (346)
Q Consensus 227 ~~~~C~~~~C~g~~~~~~~~~~~~~~~~~~C~~C~~~ 263 (346)
..+||+ .|...+.-...- ....++|+.|+..
T Consensus 3 ~eiRC~--~CnklLa~~g~~----~~leIKCpRC~ti 33 (51)
T PF10122_consen 3 KEIRCG--HCNKLLAKAGEV----IELEIKCPRCKTI 33 (51)
T ss_pred cceecc--chhHHHhhhcCc----cEEEEECCCCCcc
Confidence 456787 786554432111 1124788888874
Done!