Query psy2347
Match_columns 207
No_of_seqs 422 out of 2324
Neff 7.9
Searched_HMMs 46136
Date Fri Aug 16 22:41:21 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy2347.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/2347hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PLN03134 glycine-rich RNA-bind 99.9 1.8E-21 3.9E-26 149.9 15.1 84 5-88 30-118 (144)
2 TIGR01659 sex-lethal sex-letha 99.8 1.1E-17 2.3E-22 145.7 14.0 81 8-88 192-279 (346)
3 KOG0107|consensus 99.8 6.2E-18 1.3E-22 130.9 10.9 85 7-91 8-92 (195)
4 TIGR01659 sex-lethal sex-letha 99.8 6.3E-18 1.4E-22 147.1 11.0 80 5-84 103-187 (346)
5 TIGR01661 ELAV_HUD_SF ELAV/HuD 99.7 2E-17 4.2E-22 143.7 12.2 78 9-86 269-351 (352)
6 TIGR01661 ELAV_HUD_SF ELAV/HuD 99.7 3.8E-17 8.3E-22 141.9 10.9 80 7-86 1-85 (352)
7 KOG0125|consensus 99.7 2.2E-17 4.7E-22 138.7 8.7 83 5-87 92-177 (376)
8 PF00076 RRM_1: RNA recognitio 99.7 4E-17 8.8E-22 109.1 7.7 66 12-77 1-70 (70)
9 TIGR01648 hnRNP-R-Q heterogene 99.7 7.5E-16 1.6E-20 141.2 17.5 79 6-87 230-310 (578)
10 PLN03120 nucleic acid binding 99.7 6.9E-16 1.5E-20 127.8 11.0 74 9-83 4-79 (260)
11 KOG0122|consensus 99.7 6.4E-16 1.4E-20 125.4 9.6 79 7-85 187-270 (270)
12 KOG0149|consensus 99.6 2.7E-16 6E-21 126.9 7.2 75 8-83 11-90 (247)
13 KOG0148|consensus 99.6 6.1E-16 1.3E-20 127.2 9.2 83 4-87 159-241 (321)
14 KOG0121|consensus 99.6 3.8E-16 8.1E-21 115.5 6.5 78 5-82 32-114 (153)
15 KOG0117|consensus 99.6 5E-16 1.1E-20 135.0 7.4 81 4-87 254-334 (506)
16 KOG0105|consensus 99.6 1.8E-15 3.9E-20 118.4 9.7 77 7-83 4-82 (241)
17 PF14259 RRM_6: RNA recognitio 99.6 3.1E-15 6.8E-20 100.6 7.3 66 12-77 1-70 (70)
18 TIGR01649 hnRNP-L_PTB hnRNP-L/ 99.6 7.9E-15 1.7E-19 133.1 12.0 78 7-84 273-351 (481)
19 PLN03121 nucleic acid binding 99.6 9.1E-15 2E-19 119.6 10.7 76 6-82 2-79 (243)
20 KOG0113|consensus 99.6 7.5E-15 1.6E-19 122.1 10.2 80 5-84 97-181 (335)
21 TIGR01628 PABP-1234 polyadenyl 99.6 4.7E-15 1E-19 136.7 10.0 81 6-86 282-366 (562)
22 KOG4207|consensus 99.6 5.5E-15 1.2E-19 117.5 8.8 78 6-83 10-92 (256)
23 smart00362 RRM_2 RNA recogniti 99.6 1.3E-14 2.7E-19 95.9 9.3 69 11-79 1-72 (72)
24 TIGR01645 half-pint poly-U bin 99.6 9E-15 1.9E-19 134.6 11.0 80 8-87 203-287 (612)
25 TIGR01648 hnRNP-R-Q heterogene 99.6 7.5E-15 1.6E-19 134.7 10.3 75 6-80 55-134 (578)
26 TIGR01642 U2AF_lg U2 snRNP aux 99.6 2.2E-14 4.7E-19 130.5 12.4 78 7-84 293-375 (509)
27 TIGR01645 half-pint poly-U bin 99.6 1.1E-14 2.3E-19 134.1 10.1 78 5-82 103-185 (612)
28 TIGR01649 hnRNP-L_PTB hnRNP-L/ 99.6 1.4E-14 3E-19 131.5 10.8 76 8-84 1-78 (481)
29 KOG0130|consensus 99.6 9.7E-15 2.1E-19 109.0 7.6 79 8-86 71-154 (170)
30 PLN03213 repressor of silencin 99.6 1.4E-14 3.1E-19 127.6 9.9 79 5-83 6-87 (759)
31 TIGR01628 PABP-1234 polyadenyl 99.6 2.2E-14 4.7E-19 132.3 10.8 75 10-84 1-80 (562)
32 TIGR01622 SF-CC1 splicing fact 99.5 3.5E-14 7.5E-19 127.6 11.4 76 8-83 185-265 (457)
33 KOG0117|consensus 99.5 4E-14 8.6E-19 123.3 10.2 78 5-82 79-162 (506)
34 KOG0144|consensus 99.5 9.3E-15 2E-19 126.7 5.8 81 8-88 123-210 (510)
35 TIGR01622 SF-CC1 splicing fact 99.5 5.9E-14 1.3E-18 126.1 10.9 78 6-84 86-168 (457)
36 KOG0109|consensus 99.5 1.5E-14 3.2E-19 120.1 6.4 73 10-85 3-75 (346)
37 smart00360 RRM RNA recognition 99.5 9.2E-14 2E-18 91.4 8.6 66 14-79 1-71 (71)
38 KOG0111|consensus 99.5 1E-14 2.2E-19 117.0 4.7 82 4-85 5-91 (298)
39 cd00590 RRM RRM (RNA recogniti 99.5 1.5E-13 3.3E-18 91.1 9.6 70 11-80 1-74 (74)
40 KOG0131|consensus 99.5 4.7E-14 1E-18 110.1 7.1 80 3-82 3-87 (203)
41 KOG0114|consensus 99.5 2.1E-13 4.6E-18 97.5 8.4 77 7-83 16-94 (124)
42 COG0724 RNA-binding proteins ( 99.5 2.9E-13 6.2E-18 110.7 10.6 74 9-82 115-193 (306)
43 KOG0148|consensus 99.5 9.3E-14 2E-18 114.4 7.5 78 9-86 62-144 (321)
44 PF13893 RRM_5: RNA recognitio 99.5 3.1E-13 6.7E-18 87.2 8.3 56 26-81 1-56 (56)
45 KOG4212|consensus 99.5 3.3E-13 7.2E-18 117.5 10.4 79 6-84 41-124 (608)
46 KOG0144|consensus 99.5 1.2E-13 2.7E-18 119.8 7.7 81 5-85 30-118 (510)
47 KOG0109|consensus 99.4 3.3E-13 7.2E-18 112.1 6.6 82 6-90 75-156 (346)
48 KOG0145|consensus 99.4 8.3E-13 1.8E-17 108.4 8.7 82 6-87 38-124 (360)
49 KOG0146|consensus 99.4 2.3E-13 4.9E-18 112.1 5.1 82 7-88 283-369 (371)
50 KOG0415|consensus 99.4 3.8E-13 8.3E-18 114.4 6.5 80 4-83 234-318 (479)
51 KOG0127|consensus 99.4 1E-12 2.2E-17 117.1 9.1 79 7-85 115-197 (678)
52 KOG0108|consensus 99.4 9.3E-13 2E-17 117.0 8.5 77 10-86 19-100 (435)
53 KOG0116|consensus 99.4 3.6E-12 7.8E-17 112.8 11.9 79 8-87 287-370 (419)
54 TIGR01642 U2AF_lg U2 snRNP aux 99.4 2.2E-12 4.7E-17 117.4 9.7 76 5-82 171-258 (509)
55 KOG0145|consensus 99.4 2.8E-12 6.1E-17 105.4 9.2 77 9-85 278-359 (360)
56 KOG0126|consensus 99.3 1.7E-13 3.6E-18 107.1 -0.3 76 7-82 33-113 (219)
57 KOG0127|consensus 99.3 6.9E-12 1.5E-16 111.8 9.3 78 8-85 291-379 (678)
58 KOG0123|consensus 99.3 6.1E-12 1.3E-16 110.4 8.4 76 11-87 78-156 (369)
59 KOG0132|consensus 99.3 5.2E-12 1.1E-16 116.3 8.1 80 5-85 417-496 (894)
60 KOG4206|consensus 99.3 1.3E-11 2.9E-16 99.4 8.6 82 4-85 4-91 (221)
61 smart00361 RRM_1 RNA recogniti 99.3 1.7E-11 3.7E-16 82.8 7.6 57 23-79 2-70 (70)
62 KOG0153|consensus 99.3 1.8E-11 3.9E-16 104.0 8.7 76 7-83 226-302 (377)
63 KOG0131|consensus 99.2 2.2E-11 4.8E-16 95.3 7.3 85 5-89 92-182 (203)
64 KOG4205|consensus 99.2 1.1E-11 2.4E-16 106.0 5.5 82 8-90 96-182 (311)
65 KOG4205|consensus 99.2 8.6E-12 1.9E-16 106.7 4.8 79 8-87 5-88 (311)
66 KOG4212|consensus 99.2 2.2E-11 4.8E-16 106.2 7.3 78 4-81 531-608 (608)
67 KOG0147|consensus 99.2 3E-11 6.6E-16 107.7 6.5 77 10-86 279-360 (549)
68 KOG0146|consensus 99.2 2.6E-11 5.6E-16 100.1 5.6 80 8-87 18-104 (371)
69 KOG0124|consensus 99.2 2.1E-11 4.5E-16 104.4 4.2 73 9-81 113-190 (544)
70 KOG0110|consensus 99.1 1.3E-10 2.9E-15 106.3 8.6 73 10-82 516-596 (725)
71 KOG4661|consensus 99.1 1E-10 2.2E-15 105.0 7.5 76 8-83 404-484 (940)
72 KOG1457|consensus 99.1 3.7E-10 8E-15 91.1 9.3 85 6-90 31-124 (284)
73 KOG0533|consensus 99.1 8.5E-10 1.8E-14 91.2 9.1 80 7-86 81-164 (243)
74 KOG4454|consensus 99.1 6.3E-11 1.4E-15 95.2 2.3 81 1-81 1-84 (267)
75 KOG4211|consensus 99.0 1.1E-09 2.4E-14 96.8 8.9 79 3-83 4-85 (510)
76 KOG4208|consensus 99.0 1.4E-09 3E-14 86.7 8.1 78 7-84 47-130 (214)
77 KOG0110|consensus 99.0 3.1E-10 6.6E-15 104.0 4.9 75 9-83 613-692 (725)
78 KOG0106|consensus 98.9 9.9E-10 2.1E-14 89.1 5.3 73 10-85 2-74 (216)
79 KOG0123|consensus 98.9 1E-09 2.2E-14 96.5 5.6 82 5-86 266-351 (369)
80 KOG0124|consensus 98.9 3.9E-09 8.4E-14 90.6 7.1 77 8-84 209-290 (544)
81 KOG1548|consensus 98.9 8E-09 1.7E-13 88.0 8.5 76 7-82 132-219 (382)
82 KOG4660|consensus 98.9 1.9E-09 4.1E-14 96.6 4.2 71 7-77 73-143 (549)
83 KOG4209|consensus 98.8 5.6E-09 1.2E-13 86.2 6.4 82 4-86 96-182 (231)
84 PF04059 RRM_2: RNA recognitio 98.8 3.2E-08 7E-13 70.9 8.5 75 10-84 2-87 (97)
85 KOG1995|consensus 98.7 4.7E-08 1E-12 83.7 7.6 81 6-86 63-156 (351)
86 KOG0151|consensus 98.6 5.6E-08 1.2E-12 89.4 6.9 78 6-83 171-256 (877)
87 KOG1190|consensus 98.6 1.2E-07 2.5E-12 82.6 8.4 75 9-83 297-372 (492)
88 KOG4206|consensus 98.6 2.2E-07 4.9E-12 75.1 8.3 80 3-82 140-220 (221)
89 KOG4211|consensus 98.6 1E-07 2.2E-12 84.6 6.6 74 8-82 102-180 (510)
90 KOG0106|consensus 98.6 3.4E-08 7.4E-13 80.2 3.3 72 7-81 97-168 (216)
91 PF08777 RRM_3: RNA binding mo 98.5 1.5E-07 3.3E-12 68.5 5.2 71 10-81 2-77 (105)
92 PF11608 Limkain-b1: Limkain b 98.5 4.5E-07 9.8E-12 62.7 6.7 68 10-82 3-75 (90)
93 KOG1457|consensus 98.5 1.1E-07 2.3E-12 77.0 4.3 63 9-71 210-273 (284)
94 KOG0120|consensus 98.4 2.4E-07 5.2E-12 83.5 4.5 80 4-83 284-368 (500)
95 KOG0226|consensus 98.4 2E-07 4.3E-12 76.7 3.6 79 7-85 188-271 (290)
96 KOG1855|consensus 98.3 3.6E-07 7.9E-12 79.9 3.6 64 7-70 229-310 (484)
97 KOG1456|consensus 98.3 3.9E-06 8.4E-11 72.6 9.4 79 6-84 284-363 (494)
98 KOG1456|consensus 98.3 3.2E-06 7E-11 73.1 8.5 81 4-84 115-199 (494)
99 KOG1190|consensus 98.3 2.7E-06 5.8E-11 74.3 7.1 78 6-83 411-490 (492)
100 PF14605 Nup35_RRM_2: Nup53/35 98.2 3.7E-06 7.9E-11 53.7 5.6 52 10-63 2-53 (53)
101 KOG0105|consensus 98.2 1.7E-05 3.6E-10 62.8 10.1 64 8-73 114-177 (241)
102 PF05172 Nup35_RRM: Nup53/35/4 98.2 1.1E-05 2.3E-10 58.3 7.6 74 8-83 5-91 (100)
103 KOG2202|consensus 98.1 7E-07 1.5E-11 73.6 0.8 58 25-82 84-146 (260)
104 KOG1548|consensus 98.1 2.2E-05 4.8E-10 67.3 9.4 78 6-83 262-351 (382)
105 KOG4307|consensus 98.1 9.2E-06 2E-10 75.1 7.1 74 7-80 864-943 (944)
106 KOG0147|consensus 98.1 8.5E-06 1.9E-10 73.4 6.4 59 24-82 468-526 (549)
107 PF08952 DUF1866: Domain of un 98.0 3.4E-05 7.4E-10 59.0 8.0 76 5-84 23-107 (146)
108 KOG4210|consensus 98.0 3.3E-06 7.1E-11 72.0 2.6 79 8-87 183-267 (285)
109 KOG0129|consensus 98.0 2.7E-05 5.8E-10 69.9 8.2 63 3-65 364-432 (520)
110 KOG0129|consensus 98.0 2.8E-05 6.1E-10 69.7 7.8 59 7-66 257-326 (520)
111 COG5175 MOT2 Transcriptional r 97.9 2.2E-05 4.7E-10 67.3 6.4 73 10-82 115-201 (480)
112 KOG4849|consensus 97.9 1.4E-05 3E-10 68.7 3.7 71 9-79 80-157 (498)
113 KOG0120|consensus 97.8 5.7E-05 1.2E-09 68.4 7.2 57 26-82 426-490 (500)
114 KOG1365|consensus 97.8 3.1E-05 6.6E-10 67.4 5.0 75 9-83 280-361 (508)
115 KOG2314|consensus 97.7 8.6E-05 1.9E-09 67.4 6.8 74 8-81 57-141 (698)
116 KOG3152|consensus 97.7 2.5E-05 5.4E-10 64.5 3.0 68 8-75 73-157 (278)
117 KOG2416|consensus 97.6 5.8E-05 1.2E-09 68.8 4.2 77 6-83 441-521 (718)
118 PF10309 DUF2414: Protein of u 97.6 0.00041 8.9E-09 45.5 7.0 56 8-66 4-62 (62)
119 KOG4676|consensus 97.4 0.00022 4.9E-09 62.2 5.4 70 10-80 8-85 (479)
120 KOG0128|consensus 97.3 7.7E-05 1.7E-09 70.4 1.3 76 9-84 736-815 (881)
121 KOG1996|consensus 97.3 0.00071 1.5E-08 57.2 6.9 60 23-82 300-365 (378)
122 PF08675 RNA_bind: RNA binding 97.3 0.00086 1.9E-08 46.5 5.6 59 6-68 6-64 (87)
123 KOG2591|consensus 97.2 0.00041 8.8E-09 63.0 4.7 74 5-80 171-248 (684)
124 KOG0112|consensus 97.2 0.00044 9.4E-09 65.8 5.0 79 5-84 451-531 (975)
125 KOG0921|consensus 97.2 0.0072 1.6E-07 58.3 12.6 13 183-195 1267-1279(1282)
126 PF03880 DbpA: DbpA RNA bindin 97.2 0.0027 5.8E-08 43.1 7.3 67 11-81 2-74 (74)
127 KOG2193|consensus 97.1 0.00039 8.5E-09 61.3 3.1 76 10-86 2-78 (584)
128 KOG4307|consensus 97.1 0.00029 6.3E-09 65.5 2.3 75 7-81 432-511 (944)
129 KOG1365|consensus 97.0 0.00056 1.2E-08 59.7 3.4 70 10-80 162-239 (508)
130 PF07576 BRAP2: BRCA1-associat 97.0 0.0061 1.3E-07 44.7 8.2 68 6-73 9-81 (110)
131 PF03467 Smg4_UPF3: Smg-4/UPF3 96.8 0.0013 2.8E-08 52.3 3.5 75 7-81 5-95 (176)
132 PF15023 DUF4523: Protein of u 96.8 0.011 2.4E-07 45.2 8.1 74 6-82 83-160 (166)
133 KOG2135|consensus 96.7 0.001 2.2E-08 59.4 2.3 75 8-84 371-446 (526)
134 PF04847 Calcipressin: Calcipr 96.7 0.0084 1.8E-07 47.9 7.3 62 22-84 8-71 (184)
135 KOG0115|consensus 96.6 0.002 4.3E-08 53.5 3.5 72 10-81 32-111 (275)
136 KOG4285|consensus 96.5 0.015 3.3E-07 49.4 8.2 71 10-83 198-269 (350)
137 KOG0112|consensus 96.5 0.00056 1.2E-08 65.2 -0.6 77 6-82 369-449 (975)
138 KOG0128|consensus 96.3 0.00012 2.6E-09 69.2 -5.9 62 9-70 667-733 (881)
139 KOG2068|consensus 96.2 0.0014 3.1E-08 56.2 0.3 75 10-84 78-163 (327)
140 KOG4574|consensus 96.2 0.0045 9.8E-08 58.9 3.5 74 11-85 300-375 (1007)
141 KOG0804|consensus 96.1 0.019 4.1E-07 51.3 6.9 66 8-73 73-142 (493)
142 KOG2253|consensus 96.0 0.0044 9.6E-08 57.4 2.5 74 4-81 35-108 (668)
143 KOG4660|consensus 95.7 0.015 3.3E-07 52.9 4.7 73 10-82 389-471 (549)
144 KOG2318|consensus 95.3 0.11 2.3E-06 48.0 8.5 75 6-80 171-302 (650)
145 PRK11634 ATP-dependent RNA hel 95.2 0.48 1E-05 44.9 13.0 69 11-84 488-563 (629)
146 KOG3973|consensus 95.0 0.24 5.3E-06 43.1 9.4 6 25-30 244-249 (465)
147 PF11767 SET_assoc: Histone ly 94.7 0.22 4.8E-06 33.1 6.8 56 19-78 10-65 (66)
148 KOG4210|consensus 94.5 0.021 4.5E-07 48.8 1.9 76 8-83 87-167 (285)
149 KOG2193|consensus 93.0 0.0033 7.1E-08 55.6 -5.7 76 7-82 78-155 (584)
150 KOG4019|consensus 92.7 0.14 3.1E-06 40.6 3.5 75 9-84 10-90 (193)
151 KOG3973|consensus 92.1 0.82 1.8E-05 39.9 7.6 12 21-32 198-209 (465)
152 KOG4676|consensus 91.2 0.028 6E-07 49.5 -2.1 71 9-80 151-222 (479)
153 KOG3262|consensus 90.6 0.8 1.7E-05 36.5 5.7 6 52-57 84-89 (215)
154 KOG4410|consensus 90.5 0.93 2E-05 38.6 6.3 58 9-67 330-395 (396)
155 KOG4483|consensus 90.3 0.76 1.7E-05 40.8 5.8 55 9-65 391-446 (528)
156 KOG2295|consensus 89.5 0.06 1.3E-06 49.3 -1.5 71 7-77 229-304 (648)
157 PF03468 XS: XS domain; Inter 89.3 0.83 1.8E-05 33.7 4.7 52 11-63 10-74 (116)
158 PF10567 Nab6_mRNP_bdg: RNA-re 88.5 1 2.2E-05 38.4 5.2 75 8-82 14-106 (309)
159 PF00403 HMA: Heavy-metal-asso 85.6 5.8 0.00013 25.0 6.6 54 11-65 1-58 (62)
160 cd06405 PB1_Mekk2_3 The PB1 do 85.0 9.8 0.00021 25.9 7.7 62 16-81 15-77 (79)
161 PF02714 DUF221: Domain of unk 83.1 1.2 2.7E-05 38.2 3.2 33 49-83 1-33 (325)
162 COG0724 RNA-binding proteins ( 82.7 1.6 3.4E-05 34.9 3.6 40 5-44 221-260 (306)
163 PF14893 PNMA: PNMA 82.6 1.1 2.4E-05 39.1 2.8 52 6-57 15-73 (331)
164 PRK10629 EnvZ/OmpR regulon mod 82.1 13 0.00029 27.8 8.0 73 7-82 33-109 (127)
165 KOG2891|consensus 81.6 0.53 1.1E-05 40.0 0.4 37 7-43 147-195 (445)
166 KOG1295|consensus 79.5 2.1 4.7E-05 37.7 3.4 64 8-71 6-77 (376)
167 PRK14548 50S ribosomal protein 77.9 7.7 0.00017 27.0 5.2 55 12-66 23-81 (84)
168 COG5638 Uncharacterized conser 77.8 10 0.00022 34.1 7.1 27 48-74 260-286 (622)
169 PRK08559 nusG transcription an 76.6 10 0.00023 29.1 6.2 34 36-69 36-69 (153)
170 KOG4008|consensus 76.4 2.4 5.2E-05 35.2 2.6 37 4-40 35-71 (261)
171 PF03439 Spt5-NGN: Early trans 75.5 8.5 0.00018 26.5 5.0 36 35-70 33-68 (84)
172 PRK11901 hypothetical protein; 75.3 6.3 0.00014 34.2 5.0 50 20-70 253-308 (327)
173 KOG4365|consensus 74.2 0.34 7.3E-06 43.5 -3.0 73 10-83 4-81 (572)
174 PF07530 PRE_C2HC: Associated 73.8 6.8 0.00015 26.0 3.9 58 24-84 2-65 (68)
175 PF07292 NID: Nmi/IFP 35 domai 73.3 5.6 0.00012 27.9 3.6 32 49-81 1-34 (88)
176 TIGR03636 L23_arch archaeal ri 73.1 9.3 0.0002 26.1 4.6 56 11-66 15-74 (77)
177 PF14111 DUF4283: Domain of un 72.3 1.8 3.8E-05 32.7 0.9 73 10-83 16-91 (153)
178 PF14026 DUF4242: Protein of u 69.6 32 0.00069 23.3 7.3 54 12-65 3-66 (77)
179 KOG4213|consensus 69.1 6.2 0.00013 31.5 3.3 45 21-65 118-169 (205)
180 smart00596 PRE_C2HC PRE_C2HC d 68.8 8.9 0.00019 25.6 3.5 56 24-82 2-63 (69)
181 COG2608 CopZ Copper chaperone 68.4 21 0.00046 23.5 5.5 55 10-65 4-62 (71)
182 PF15513 DUF4651: Domain of un 66.9 13 0.00029 24.2 4.0 20 23-42 8-27 (62)
183 PF07237 DUF1428: Protein of u 65.7 18 0.0004 26.1 4.9 42 25-66 24-85 (103)
184 PF07292 NID: Nmi/IFP 35 domai 65.6 2.5 5.5E-05 29.7 0.5 22 8-29 51-72 (88)
185 PF13037 DUF3898: Domain of un 65.0 6.7 0.00015 27.4 2.5 47 21-67 31-90 (91)
186 COG0079 HisC Histidinol-phosph 62.3 8.5 0.00018 33.9 3.3 49 8-62 145-197 (356)
187 KOG1999|consensus 61.8 25 0.00054 34.9 6.5 33 45-78 209-241 (1024)
188 COG5470 Uncharacterized conser 61.7 18 0.00038 25.8 4.2 40 25-64 24-71 (96)
189 COG3254 Uncharacterized conser 60.9 29 0.00062 25.1 5.2 40 24-64 27-69 (105)
190 PF15063 TC1: Thyroid cancer p 60.3 2.5 5.4E-05 28.7 -0.3 48 12-66 28-78 (79)
191 KOG4454|consensus 59.5 1.9 4.1E-05 35.5 -1.2 66 9-74 80-153 (267)
192 CHL00123 rps6 ribosomal protei 59.3 60 0.0013 22.9 6.8 55 9-65 8-81 (97)
193 COG4010 Uncharacterized protei 59.2 29 0.00064 26.7 5.2 46 16-66 118-163 (170)
194 KOG3262|consensus 59.0 94 0.002 25.0 8.6 16 24-39 89-104 (215)
195 PRK11230 glycolate oxidase sub 57.9 27 0.00058 32.2 5.9 46 22-67 202-255 (499)
196 PF08544 GHMP_kinases_C: GHMP 57.9 52 0.0011 21.7 6.2 42 24-67 37-80 (85)
197 PF08503 DapH_N: Tetrahydrodip 57.4 50 0.0011 22.9 5.8 57 22-79 2-72 (83)
198 smart00666 PB1 PB1 domain. Pho 57.0 38 0.00081 22.5 5.2 54 13-67 13-69 (81)
199 KOG0156|consensus 56.7 27 0.00058 32.2 5.6 59 13-76 36-97 (489)
200 PF10567 Nab6_mRNP_bdg: RNA-re 55.7 91 0.002 26.9 8.1 34 36-69 174-214 (309)
201 COG3102 Uncharacterized protei 55.6 16 0.00035 28.8 3.4 41 47-87 110-154 (185)
202 PRK05772 translation initiatio 54.7 30 0.00066 30.7 5.4 47 21-67 3-57 (363)
203 PRK10905 cell division protein 53.4 24 0.00053 30.6 4.5 48 20-68 255-308 (328)
204 PF09707 Cas_Cas2CT1978: CRISP 52.3 33 0.00072 23.9 4.3 49 5-53 21-71 (86)
205 cd04908 ACT_Bt0572_1 N-termina 51.9 60 0.0013 20.6 8.6 44 22-65 14-59 (66)
206 cd00027 BRCT Breast Cancer Sup 51.7 52 0.0011 20.0 5.0 45 10-57 2-46 (72)
207 cd05992 PB1 The PB1 domain is 51.7 64 0.0014 21.2 5.7 50 17-67 16-69 (81)
208 COG0002 ArgC Acetylglutamate s 50.8 36 0.00078 30.0 5.2 39 17-55 253-302 (349)
209 PF02829 3H: 3H domain; Inter 50.8 50 0.0011 23.6 5.1 51 18-68 6-58 (98)
210 PF10141 ssDNA-exonuc_C: Singl 50.1 38 0.00082 27.1 5.0 58 5-65 48-106 (195)
211 KOG2187|consensus 49.4 14 0.0003 34.3 2.5 39 45-83 62-100 (534)
212 PF11823 DUF3343: Protein of u 49.3 19 0.00041 23.8 2.6 24 48-71 3-26 (73)
213 COG5227 SMT3 Ubiquitin-like pr 49.2 53 0.0011 23.2 4.8 59 9-68 34-100 (103)
214 PF07876 Dabb: Stress responsi 48.8 83 0.0018 21.3 6.1 52 12-63 4-71 (97)
215 PF00564 PB1: PB1 domain; Int 48.6 26 0.00056 23.4 3.3 62 16-80 17-81 (84)
216 PF00398 RrnaAD: Ribosomal RNA 47.9 27 0.00059 29.1 4.0 24 8-31 96-119 (262)
217 PF09869 DUF2096: Uncharacteri 47.7 1.1E+02 0.0024 24.1 6.9 51 11-67 114-164 (169)
218 PF08156 NOP5NT: NOP5NT (NUC12 47.6 6.1 0.00013 26.2 -0.0 39 24-67 27-65 (67)
219 PF08734 GYD: GYD domain; Int 47.3 95 0.0021 21.6 6.3 44 23-66 22-67 (91)
220 KOG3432|consensus 46.7 34 0.00074 25.0 3.7 25 18-42 42-66 (121)
221 PF09250 Prim-Pol: Bifunctiona 45.7 45 0.00097 25.1 4.6 36 21-57 32-67 (163)
222 PF12829 Mhr1: Transcriptional 45.4 33 0.00071 24.2 3.4 51 17-67 20-72 (91)
223 cd01201 Neurobeachin Neurobeac 44.6 1.2E+02 0.0026 22.1 7.2 52 16-67 53-106 (108)
224 PHA01632 hypothetical protein 44.6 25 0.00054 22.5 2.4 19 14-32 21-39 (64)
225 cd04904 ACT_AAAH ACT domain of 44.4 90 0.002 20.5 6.4 47 22-68 13-65 (74)
226 cd06410 PB1_UP2 Uncharacterize 44.3 1.1E+02 0.0025 21.6 6.3 61 17-78 29-94 (97)
227 PF11411 DNA_ligase_IV: DNA li 43.3 19 0.00041 20.9 1.6 16 19-34 19-34 (36)
228 PTZ00191 60S ribosomal protein 42.9 54 0.0012 25.2 4.5 52 13-64 85-140 (145)
229 cd04889 ACT_PDH-BS-like C-term 42.7 76 0.0017 19.2 5.7 40 24-63 13-55 (56)
230 PF14581 SseB_C: SseB protein 42.4 47 0.001 23.6 4.0 49 7-55 3-59 (108)
231 COG5507 Uncharacterized conser 42.3 24 0.00051 25.4 2.3 20 47-66 67-86 (117)
232 KOG0633|consensus 42.3 34 0.00073 29.3 3.6 56 8-69 160-219 (375)
233 PF05036 SPOR: Sporulation rel 42.3 9.8 0.00021 24.6 0.3 16 52-67 50-65 (76)
234 cd00412 pyrophosphatase Inorga 42.2 25 0.00054 27.3 2.6 39 22-66 115-153 (155)
235 COG0150 PurM Phosphoribosylami 42.1 23 0.00051 31.0 2.7 47 23-69 275-322 (345)
236 PF13193 AMP-binding_C: AMP-bi 41.7 69 0.0015 20.7 4.5 47 25-72 1-54 (73)
237 PF09162 Tap-RNA_bind: Tap, RN 41.6 76 0.0017 22.2 4.8 60 21-82 18-81 (88)
238 COG0858 RbfA Ribosome-binding 41.1 37 0.00081 25.0 3.3 33 36-68 36-71 (118)
239 COG0030 KsgA Dimethyladenosine 41.1 42 0.0009 28.4 4.0 45 10-65 96-140 (259)
240 PF06897 DUF1269: Protein of u 40.8 29 0.00063 24.9 2.6 14 21-34 66-79 (102)
241 PRK12450 foldase protein PrsA; 40.3 59 0.0013 28.0 4.9 39 20-67 132-170 (309)
242 KOG0524|consensus 39.5 54 0.0012 28.1 4.4 64 4-69 192-261 (359)
243 PF02033 RBFA: Ribosome-bindin 39.3 40 0.00086 23.9 3.2 34 35-68 30-65 (104)
244 COG5353 Uncharacterized protei 39.0 98 0.0021 24.0 5.3 51 9-59 87-155 (161)
245 KOG3424|consensus 38.4 71 0.0015 23.8 4.3 42 19-61 33-84 (132)
246 COG5193 LHP1 La protein, small 38.3 13 0.00027 33.3 0.5 55 10-64 175-244 (438)
247 TIGR00387 glcD glycolate oxida 38.2 68 0.0015 28.7 5.2 48 20-67 143-198 (413)
248 cd07990 LPLAT_LCLAT1-like Lyso 37.5 2E+02 0.0043 22.4 7.4 61 7-67 22-99 (193)
249 TIGR00405 L26e_arch ribosomal 37.5 1.3E+02 0.0029 22.4 6.1 31 38-68 30-60 (145)
250 PRK11700 hypothetical protein; 37.3 76 0.0017 25.4 4.7 20 18-37 48-67 (187)
251 TIGR00082 rbfA ribosome-bindin 37.2 46 0.001 24.3 3.3 34 35-68 34-69 (114)
252 TIGR00587 nfo apurinic endonuc 37.2 59 0.0013 27.2 4.4 57 9-66 137-201 (274)
253 COG0225 MsrA Peptide methionin 37.1 51 0.0011 26.1 3.6 72 11-82 59-136 (174)
254 PF01071 GARS_A: Phosphoribosy 36.8 94 0.002 25.0 5.3 44 22-66 25-70 (194)
255 cd04931 ACT_PAH ACT domain of 36.8 1.4E+02 0.0031 20.7 7.6 52 16-67 20-79 (90)
256 PF13820 Nucleic_acid_bd: Puta 36.7 63 0.0014 24.9 4.1 58 11-68 6-67 (149)
257 PTZ00481 Membrane attack compl 36.6 34 0.00075 31.1 2.9 51 14-65 210-274 (524)
258 COG0360 RpsF Ribosomal protein 36.6 1.1E+02 0.0023 22.4 5.1 51 12-64 6-75 (112)
259 PLN02805 D-lactate dehydrogena 36.4 94 0.002 29.2 5.9 47 21-67 278-332 (555)
260 PF11491 DUF3213: Protein of u 35.9 26 0.00057 24.3 1.6 67 12-80 3-73 (88)
261 PRK06545 prephenate dehydrogen 35.6 2.1E+02 0.0045 25.0 7.8 63 5-68 286-353 (359)
262 KOG3671|consensus 35.5 49 0.0011 30.6 3.7 50 20-69 89-138 (569)
263 KOG0226|consensus 35.3 13 0.00028 31.3 0.1 70 10-79 97-173 (290)
264 PTZ00146 fibrillarin; Provisio 35.0 1.3E+02 0.0029 25.9 6.1 8 192-199 107-114 (293)
265 COG3444 Phosphotransferase sys 34.9 1.8E+02 0.0038 22.8 6.3 23 45-67 75-97 (159)
266 PRK01178 rps24e 30S ribosomal 34.7 1.4E+02 0.0031 21.3 5.4 42 20-62 30-81 (99)
267 COG3797 Uncharacterized protei 34.5 2.1E+02 0.0045 22.8 6.6 78 7-84 41-144 (178)
268 COG1058 CinA Predicted nucleot 34.3 1.1E+02 0.0024 25.8 5.5 59 9-68 180-244 (255)
269 PF13721 SecD-TM1: SecD export 34.2 1.7E+02 0.0037 20.8 7.5 59 7-68 29-91 (101)
270 PF14528 LAGLIDADG_3: LAGLIDAD 33.9 1.3E+02 0.0029 19.5 6.6 41 24-65 35-75 (77)
271 PF08206 OB_RNB: Ribonuclease 33.9 16 0.00035 23.2 0.3 37 45-82 7-44 (58)
272 PF08502 LeuA_dimer: LeuA allo 33.6 1.4E+02 0.003 22.0 5.5 50 20-69 1-74 (133)
273 TIGR01638 Atha_cystat_rel Arab 33.6 65 0.0014 22.8 3.4 25 36-60 34-63 (92)
274 PF13046 DUF3906: Protein of u 33.4 58 0.0012 21.4 2.9 31 21-51 30-63 (64)
275 PF08002 DUF1697: Protein of u 33.1 2.1E+02 0.0045 21.4 6.9 32 10-42 4-39 (137)
276 cd04917 ACT_AKiii-LysC-EC_2 AC 32.6 1.1E+02 0.0023 19.2 4.2 16 54-69 46-61 (64)
277 PF13689 DUF4154: Domain of un 32.6 84 0.0018 23.6 4.2 35 46-81 26-60 (145)
278 PRK04405 prsA peptidylprolyl i 32.6 93 0.002 26.6 4.9 39 20-67 128-166 (298)
279 PLN02707 Soluble inorganic pyr 32.5 22 0.00047 30.2 1.0 42 23-68 207-249 (267)
280 PTZ00071 40S ribosomal protein 32.3 1.5E+02 0.0032 22.4 5.4 41 20-61 35-86 (132)
281 KOG3938|consensus 32.1 2.7E+02 0.0059 23.8 7.3 53 34-86 125-213 (334)
282 KOG2135|consensus 32.0 20 0.00043 32.8 0.7 53 19-73 207-259 (526)
283 PF02426 MIase: Muconolactone 31.9 1.8E+02 0.004 20.4 7.6 56 16-71 10-76 (91)
284 PRK13014 methionine sulfoxide 31.8 1.5E+02 0.0033 23.7 5.6 71 11-81 61-137 (186)
285 PF05336 DUF718: Domain of unk 31.7 95 0.0021 22.2 4.2 40 23-63 25-67 (106)
286 cd04905 ACT_CM-PDT C-terminal 31.4 1.6E+02 0.0034 19.4 6.1 47 22-68 14-68 (80)
287 KOG2671|consensus 30.9 66 0.0014 28.6 3.7 45 19-63 12-57 (421)
288 PRK12378 hypothetical protein; 30.6 1.7E+02 0.0038 24.3 6.0 56 7-62 89-150 (235)
289 smart00457 MACPF membrane-atta 30.5 1.1E+02 0.0025 24.0 4.8 26 15-40 31-58 (194)
290 CHL00128 psbW photosystem II p 30.5 1.5E+02 0.0032 21.8 4.9 27 36-62 15-44 (113)
291 PRK02302 hypothetical protein; 30.4 2E+02 0.0042 20.2 5.5 54 6-70 6-59 (89)
292 PF01037 AsnC_trans_reg: AsnC 30.3 1.5E+02 0.0032 18.8 8.5 44 22-65 11-55 (74)
293 PRK14054 methionine sulfoxide 30.1 2E+02 0.0043 22.7 6.0 71 11-81 56-132 (172)
294 COG5236 Uncharacterized conser 30.0 1.2E+02 0.0026 26.9 5.0 51 22-75 263-313 (493)
295 PLN02373 soluble inorganic pyr 29.5 65 0.0014 25.9 3.2 37 23-66 138-174 (188)
296 PF08004 DUF1699: Protein of u 29.4 92 0.002 23.4 3.7 36 6-44 16-51 (131)
297 KOG4388|consensus 29.3 87 0.0019 29.9 4.3 58 8-66 788-851 (880)
298 COG1839 Uncharacterized conser 29.1 2.7E+02 0.0059 21.5 6.8 59 6-64 12-73 (162)
299 PF07521 RMMBL: RNA-metabolisi 29.1 88 0.0019 18.4 3.0 33 9-42 6-38 (43)
300 PRK11558 putative ssRNA endonu 29.1 1.1E+02 0.0024 21.8 4.0 51 5-55 23-75 (97)
301 PF14350 Beta_protein: Beta pr 29.1 86 0.0019 27.2 4.2 37 21-60 137-174 (347)
302 KOG3062|consensus 29.0 1.4E+02 0.0029 25.2 5.0 33 12-44 4-39 (281)
303 smart00195 DSPc Dual specifici 28.9 1.4E+02 0.003 21.7 4.8 18 11-28 7-24 (138)
304 PF07045 DUF1330: Protein of u 28.8 1.6E+02 0.0035 18.8 5.4 40 25-64 10-57 (65)
305 KOG0256|consensus 28.8 1E+02 0.0022 28.0 4.6 76 8-86 226-305 (471)
306 KOG2854|consensus 28.3 47 0.001 29.1 2.4 60 6-65 157-231 (343)
307 smart00650 rADc Ribosomal RNA 28.2 89 0.0019 23.8 3.8 24 10-33 78-101 (169)
308 PRK08655 prephenate dehydrogen 28.1 1.1E+02 0.0023 27.7 4.8 26 17-42 369-394 (437)
309 TIGR02542 B_forsyth_147 Bacter 28.1 31 0.00068 25.6 1.1 40 18-57 83-130 (145)
310 PF09122 DUF1930: Domain of un 28.1 1.8E+02 0.0039 19.1 4.5 26 8-33 19-44 (68)
311 KOG3702|consensus 27.8 31 0.00067 32.8 1.2 66 11-77 513-583 (681)
312 PF12993 DUF3877: Domain of un 27.8 1.4E+02 0.003 23.6 4.7 25 19-43 107-131 (175)
313 PTZ00338 dimethyladenosine tra 27.8 66 0.0014 27.5 3.2 27 11-37 103-129 (294)
314 PF03108 DBD_Tnp_Mut: MuDR fam 27.6 68 0.0015 20.6 2.6 16 51-66 8-23 (67)
315 KOG2040|consensus 27.6 23 0.00051 34.0 0.4 56 16-71 460-535 (1001)
316 PF13549 ATP-grasp_5: ATP-gras 27.5 1.5E+02 0.0033 24.3 5.1 48 21-84 10-57 (222)
317 PF02222 ATP-grasp: ATP-grasp 27.3 1.8E+02 0.004 22.7 5.4 63 21-83 15-85 (172)
318 PF14111 DUF4283: Domain of un 27.2 89 0.0019 23.2 3.6 33 12-44 107-140 (153)
319 PF04940 BLUF: Sensors of blue 27.1 2.2E+02 0.0048 19.8 5.3 54 10-65 3-63 (93)
320 KOG4840|consensus 27.1 1.1E+02 0.0023 25.8 4.1 72 7-78 35-112 (299)
321 KOG1635|consensus 27.0 2.3E+02 0.005 22.6 5.7 60 10-69 76-138 (191)
322 PF15643 Tox-PL-2: Papain fold 27.0 52 0.0011 23.6 2.0 29 52-80 18-46 (100)
323 cd04880 ACT_AAAH-PDT-like ACT 26.9 1.8E+02 0.0039 18.7 6.3 47 22-68 12-66 (75)
324 KOG2848|consensus 26.6 3.2E+02 0.007 23.2 6.9 70 5-80 85-168 (276)
325 KOG1232|consensus 26.4 63 0.0014 29.1 2.8 50 15-64 230-287 (511)
326 PRK01688 histidinol-phosphate 26.3 1.1E+02 0.0024 26.4 4.4 51 8-63 145-199 (351)
327 cd04489 ExoVII_LU_OBF ExoVII_L 26.3 23 0.00051 23.2 0.1 21 35-57 7-27 (78)
328 PRK00274 ksgA 16S ribosomal RN 26.2 85 0.0018 26.3 3.6 22 11-32 107-128 (272)
329 PRK04199 rpl10e 50S ribosomal 25.9 3.4E+02 0.0073 21.5 7.0 57 8-65 80-151 (172)
330 cd04859 Prim_Pol Prim_Pol: Pri 25.9 2.4E+02 0.0052 21.0 5.8 42 20-62 24-66 (152)
331 PF01823 MACPF: MAC/Perforin d 25.8 42 0.0009 26.4 1.5 26 14-39 53-81 (212)
332 COG1098 VacB Predicted RNA bin 25.7 1.5E+02 0.0032 22.3 4.2 48 33-80 5-60 (129)
333 TIGR00854 pts-sorbose PTS syst 25.7 2.4E+02 0.0051 21.6 5.7 19 46-64 75-93 (151)
334 PRK13022 secF preprotein trans 25.5 2.2E+02 0.0048 24.2 6.0 51 16-67 48-101 (289)
335 PF04127 DFP: DNA / pantothena 25.2 2.3E+02 0.0051 22.4 5.7 56 11-66 20-79 (185)
336 PF11080 DUF2622: Protein of u 25.2 2.6E+02 0.0056 19.9 7.0 69 12-81 11-86 (96)
337 PF14268 YoaP: YoaP-like 25.1 51 0.0011 20.0 1.4 35 48-82 2-38 (44)
338 cd04882 ACT_Bt0572_2 C-termina 25.1 1.7E+02 0.0037 17.8 5.8 42 24-65 14-59 (65)
339 PRK13817 ribosome-binding fact 24.9 66 0.0014 23.7 2.3 21 34-54 32-53 (119)
340 cd06404 PB1_aPKC PB1 domain is 24.9 2.4E+02 0.0053 19.5 6.4 55 11-67 10-69 (83)
341 PF05711 TylF: Macrocin-O-meth 24.8 49 0.0011 27.7 1.8 58 21-78 140-210 (248)
342 PHA03008 hypothetical protein; 24.6 1.1E+02 0.0023 25.0 3.6 37 7-43 19-55 (234)
343 KOG0862|consensus 24.4 61 0.0013 26.6 2.2 12 45-56 108-119 (216)
344 PRK10773 murF UDP-N-acetylmura 24.4 2.8E+02 0.006 25.0 6.8 57 20-78 298-358 (453)
345 cd06301 PBP1_rhizopine_binding 24.3 2E+02 0.0044 23.0 5.5 22 46-67 96-117 (272)
346 COG4733 Phage-related protein, 24.3 95 0.0021 30.7 3.8 71 6-79 172-244 (952)
347 PRK14173 bifunctional 5,10-met 24.3 2.5E+02 0.0053 24.1 6.0 24 11-34 60-83 (287)
348 PF04865 Baseplate_J: Baseplat 24.3 3.6E+02 0.0079 21.7 7.0 31 14-44 112-143 (243)
349 PF12419 DUF3670: SNF2 Helicas 24.2 2.5E+02 0.0054 21.1 5.5 50 9-68 74-123 (141)
350 PF10281 Ish1: Putative stress 24.2 78 0.0017 18.1 2.1 18 20-37 3-20 (38)
351 PF15407 Spo7_2_N: Sporulation 24.2 15 0.00033 24.3 -1.1 27 5-31 23-49 (67)
352 PRK14176 bifunctional 5,10-met 24.0 2.5E+02 0.0055 24.1 6.0 24 11-34 69-92 (287)
353 PRK10792 bifunctional 5,10-met 24.0 2.4E+02 0.0053 24.1 5.9 24 11-34 64-87 (285)
354 PRK00059 prsA peptidylprolyl i 23.9 91 0.002 26.8 3.4 46 20-67 173-218 (336)
355 COG0217 Uncharacterized conser 23.8 2.8E+02 0.0061 23.2 6.0 54 9-62 94-155 (241)
356 PTZ00136 eukaryotic translatio 23.8 2.2E+02 0.0048 23.9 5.5 49 17-65 72-131 (247)
357 PRK13883 conjugal transfer pro 23.8 1.8E+02 0.0039 22.5 4.6 38 21-58 36-88 (151)
358 PHA00742 hypothetical protein 23.6 50 0.0011 26.2 1.5 26 47-72 133-158 (211)
359 CHL00100 ilvH acetohydroxyacid 23.5 3.7E+02 0.0079 21.2 6.5 59 23-81 57-125 (174)
360 TIGR00755 ksgA dimethyladenosi 23.5 97 0.0021 25.5 3.4 23 11-33 96-118 (253)
361 cd06314 PBP1_tmGBP Periplasmic 23.4 2.8E+02 0.0062 22.2 6.2 21 46-66 93-113 (271)
362 PF00719 Pyrophosphatase: Inor 23.4 1.3E+02 0.0028 23.3 3.8 38 23-66 113-150 (156)
363 KOG0738|consensus 23.4 1.9E+02 0.004 26.4 5.1 23 7-29 348-373 (491)
364 TIGR03380 agmatine_aguA agmati 23.2 1.7E+02 0.0036 26.0 4.9 27 16-42 172-198 (357)
365 cd01676 RNR_II_monomer Class I 23.2 1.8E+02 0.0039 28.0 5.4 60 8-70 569-634 (658)
366 COG1207 GlmU N-acetylglucosami 23.2 2.3E+02 0.005 25.9 5.8 58 11-68 99-173 (460)
367 PRK14168 bifunctional 5,10-met 23.0 3.1E+02 0.0067 23.7 6.4 24 11-34 64-87 (297)
368 cd04883 ACT_AcuB C-terminal AC 23.0 2E+02 0.0044 18.0 8.2 44 22-65 14-63 (72)
369 PRK00033 clpS ATP-dependent Cl 23.0 71 0.0015 22.9 2.1 25 45-69 67-91 (100)
370 PF04800 ETC_C1_NDUFA4: ETC co 22.9 1.5E+02 0.0033 21.2 3.8 20 49-69 51-70 (101)
371 PF13145 Rotamase_2: PPIC-type 22.9 35 0.00075 23.8 0.5 13 21-33 1-13 (121)
372 PF10041 DUF2277: Uncharacteri 22.8 41 0.00089 22.9 0.8 20 14-33 8-27 (78)
373 PF09341 Pcc1: Transcription f 22.8 98 0.0021 20.5 2.7 20 48-67 4-23 (76)
374 PF07693 KAP_NTPase: KAP famil 22.8 87 0.0019 26.4 3.0 40 46-85 203-250 (325)
375 PRK14887 KEOPS complex Pcc1-li 22.8 2.6E+02 0.0057 19.2 5.0 23 46-68 5-27 (84)
376 COG0318 CaiC Acyl-CoA syntheta 22.8 3.4E+02 0.0073 24.8 7.1 47 19-65 432-491 (534)
377 PF08002 DUF1697: Protein of u 22.7 1.7E+02 0.0037 21.9 4.3 30 47-76 100-129 (137)
378 PF05727 UPF0228: Uncharacteri 22.6 1.6E+02 0.0035 22.1 3.9 51 21-81 64-123 (127)
379 PF09902 DUF2129: Uncharacteri 22.5 2.3E+02 0.005 18.9 4.4 38 29-70 16-53 (71)
380 COG5594 Uncharacterized integr 22.5 96 0.0021 30.5 3.4 22 45-66 356-377 (827)
381 cd00495 Ribosomal_L25_TL5_CTC 22.4 1.6E+02 0.0034 20.2 3.8 53 16-80 36-89 (91)
382 KOG1975|consensus 22.4 1.5E+02 0.0033 26.2 4.3 39 19-63 179-220 (389)
383 PLN02655 ent-kaurene oxidase 22.3 2E+02 0.0044 25.7 5.5 49 13-65 9-60 (466)
384 PF04765 DUF616: Protein of un 22.2 3.6E+02 0.0079 23.4 6.6 48 13-61 44-105 (305)
385 PRK14188 bifunctional 5,10-met 22.1 2.9E+02 0.0064 23.7 6.1 24 11-34 63-86 (296)
386 TIGR00401 msrA methionine-S-su 21.9 3.6E+02 0.0079 20.6 6.0 71 11-81 53-129 (149)
387 PF14267 DUF4357: Domain of un 21.8 1.9E+02 0.004 18.3 3.6 14 51-64 23-36 (55)
388 PRK02230 inorganic pyrophospha 21.7 1.7E+02 0.0036 23.5 4.2 37 23-66 118-155 (184)
389 PF09623 Cas_NE0113: CRISPR-as 21.7 1.4E+02 0.003 24.7 3.8 44 38-81 72-120 (224)
390 PTZ00372 endonuclease 4-like p 21.6 1.9E+02 0.0041 26.2 5.0 58 9-67 268-333 (413)
391 PRK11370 YciI-like protein; Re 21.5 2.2E+02 0.0048 19.8 4.5 26 47-73 59-84 (99)
392 cd04879 ACT_3PGDH-like ACT_3PG 21.5 2E+02 0.0044 17.4 4.6 34 21-54 11-46 (71)
393 PRK11611 enhanced serine sensi 21.5 54 0.0012 27.5 1.4 80 5-84 125-215 (246)
394 PRK14093 UDP-N-acetylmuramoyla 21.3 3.9E+02 0.0085 24.2 7.2 60 20-79 307-375 (479)
395 PF11181 YflT: Heat induced st 21.3 86 0.0019 22.1 2.3 29 51-79 4-32 (103)
396 PRK12864 YciI-like protein; Re 21.2 2.8E+02 0.0061 18.9 6.6 28 46-74 49-76 (89)
397 PRK01250 inorganic pyrophospha 21.1 1.6E+02 0.0035 23.3 4.0 39 22-66 131-169 (176)
398 COG0769 MurE UDP-N-acetylmuram 21.1 4.3E+02 0.0092 24.3 7.3 60 19-79 304-365 (475)
399 COG0379 NadA Quinolinate synth 21.0 2.3E+02 0.005 24.7 5.2 60 5-65 87-155 (324)
400 PRK00110 hypothetical protein; 21.0 4.9E+02 0.011 21.7 7.1 55 8-64 93-156 (245)
401 PRK13818 ribosome-binding fact 20.9 1.4E+02 0.003 22.0 3.4 21 34-54 32-53 (121)
402 TIGR01873 cas_CT1978 CRISPR-as 20.8 2E+02 0.0044 20.1 4.0 51 5-55 21-74 (87)
403 PF04008 Adenosine_kin: Adenos 20.7 2.2E+02 0.0048 22.0 4.5 30 6-35 5-34 (155)
404 TIGR01033 DNA-binding regulato 20.7 2.9E+02 0.0062 23.0 5.6 46 7-54 92-143 (238)
405 PF08152 GUCT: GUCT (NUC152) d 20.7 36 0.00077 24.1 0.2 39 24-65 36-77 (97)
406 cd00001 PTS_IIB_man PTS_IIB, P 20.7 3.3E+02 0.0072 20.8 5.6 19 46-64 74-92 (151)
407 PLN02968 Probable N-acetyl-gam 20.7 2.1E+02 0.0046 25.4 5.1 31 11-42 280-311 (381)
408 cd01611 GABARAP Ubiquitin doma 20.7 2.1E+02 0.0045 20.8 4.2 17 17-33 47-63 (112)
409 PRK00642 inorganic pyrophospha 20.7 93 0.002 25.3 2.6 42 22-67 156-197 (205)
410 KOG4357|consensus 20.6 3.8E+02 0.0082 20.2 5.8 25 48-72 115-139 (164)
411 COG1308 EGD2 Transcription fac 20.6 2.1E+02 0.0045 21.3 4.2 14 55-68 98-111 (122)
412 cd01778 RASSF1_RA Ubiquitin-li 20.6 2.5E+02 0.0053 20.0 4.4 38 7-52 16-53 (96)
413 PHA02898 virion envelope prote 20.6 18 0.00039 25.3 -1.3 19 47-65 27-46 (92)
414 PF14844 PH_BEACH: PH domain a 20.5 1.9E+02 0.0041 20.2 4.0 22 45-66 84-105 (106)
415 PF12623 Hen1_L: RNA repair, l 20.5 2.1E+02 0.0045 23.9 4.5 56 10-65 119-182 (245)
416 TIGR01499 folC folylpolyglutam 20.5 5E+02 0.011 22.7 7.5 47 21-67 252-299 (397)
417 COG0045 SucC Succinyl-CoA synt 20.4 5.5E+02 0.012 23.1 7.5 63 21-83 26-97 (387)
418 PF12385 Peptidase_C70: Papain 20.3 2E+02 0.0043 22.6 4.2 19 20-38 95-113 (166)
419 PRK00343 ipk 4-diphosphocytidy 20.2 2.4E+02 0.0052 23.6 5.2 39 25-67 222-260 (271)
420 KOG2916|consensus 20.2 2.7E+02 0.0057 23.8 5.2 46 37-83 179-224 (304)
421 COG2242 CobL Precorrin-6B meth 20.2 4.6E+02 0.01 21.0 7.6 66 8-86 102-167 (187)
422 KOG0677|consensus 20.2 4.2E+02 0.0091 22.8 6.4 30 5-34 97-129 (389)
423 COG1710 Uncharacterized protei 20.2 2.4E+02 0.0052 21.0 4.4 38 3-43 22-59 (139)
424 cd06257 DnaJ DnaJ domain or J- 20.1 96 0.0021 18.5 2.1 19 15-33 6-24 (55)
425 PTZ00380 microtubule-associate 20.1 1.7E+02 0.0037 21.8 3.7 16 17-32 47-62 (121)
426 PF01782 RimM: RimM N-terminal 20.1 1.8E+02 0.0038 19.4 3.6 23 46-69 54-76 (84)
427 KOG3373|consensus 20.1 2E+02 0.0044 22.6 4.2 37 32-68 78-118 (172)
No 1
>PLN03134 glycine-rich RNA-binding protein 4; Provisional
Probab=99.88 E-value=1.8e-21 Score=149.87 Aligned_cols=84 Identities=27% Similarity=0.486 Sum_probs=77.4
Q ss_pred CCCCCcEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeC-----CCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEE
Q psy2347 5 MMERGTRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFN-----PPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVE 79 (207)
Q Consensus 5 ~~~~~~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d-----~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~ 79 (207)
....+++|||.|||+++||++|+++|++|++|++|+|++| +++||||+|+++++|++||+.||+++|++++|+|+
T Consensus 30 ~~~~~~~lfVgnL~~~~te~~L~~~F~~~G~I~~v~i~~d~~tg~~kGfaFV~F~~~e~A~~Al~~lng~~i~Gr~l~V~ 109 (144)
T PLN03134 30 LRLMSTKLFIGGLSWGTDDASLRDAFAHFGDVVDAKVIVDRETGRSRGFGFVNFNDEGAATAAISEMDGKELNGRHIRVN 109 (144)
T ss_pred ccCCCCEEEEeCCCCCCCHHHHHHHHhcCCCeEEEEEEecCCCCCcceEEEEEECCHHHHHHHHHHcCCCEECCEEEEEE
Confidence 3456789999999999999999999999999999999987 68999999999999999999999999999999999
Q ss_pred EeecCCCCC
Q psy2347 80 ISRGRGRGR 88 (207)
Q Consensus 80 ~a~~k~~~~ 88 (207)
++++++...
T Consensus 110 ~a~~~~~~~ 118 (144)
T PLN03134 110 PANDRPSAP 118 (144)
T ss_pred eCCcCCCCC
Confidence 998776543
No 2
>TIGR01659 sex-lethal sex-lethal family splicing factor. This model describes the sex-lethal family of splicing factors found in Dipteran insects. The sex-lethal phenotype, however, may be limited to the Melanogasters and closely related species. In Drosophila the protein acts as an inhibitor of splicing. This subfamily is most closely related to the ELAV/HUD subfamily of splicing factors (TIGR01661).
Probab=99.76 E-value=1.1e-17 Score=145.67 Aligned_cols=81 Identities=27% Similarity=0.525 Sum_probs=73.0
Q ss_pred CCcEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeC-----CCcEEEEEEcCHHHHHHHHHhcCCceeCC--cEEEEEE
Q psy2347 8 RGTRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFN-----PPGFAFVEFSNQIDAEAACDSMNDQDLMG--SKLRVEI 80 (207)
Q Consensus 8 ~~~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d-----~rG~afV~F~~~e~A~~Al~~l~g~~i~g--~~l~V~~ 80 (207)
+.++|||.|||+++||++|+++|++|++|+.|+|+.| +++||||+|+++++|++||++||+++|.+ ++|+|++
T Consensus 192 ~~~~lfV~nLp~~vtee~L~~~F~~fG~V~~v~i~~d~~tg~~kG~aFV~F~~~e~A~~Ai~~lng~~~~g~~~~l~V~~ 271 (346)
T TIGR01659 192 KDTNLYVTNLPRTITDDQLDTIFGKYGQIVQKNILRDKLTGTPRGVAFVRFNKREEAQEAISALNNVIPEGGSQPLTVRL 271 (346)
T ss_pred ccceeEEeCCCCcccHHHHHHHHHhcCCEEEEEEeecCCCCccceEEEEEECCHHHHHHHHHHhCCCccCCCceeEEEEE
Confidence 4678999999999999999999999999999999987 56899999999999999999999998876 6899999
Q ss_pred eecCCCCC
Q psy2347 81 SRGRGRGR 88 (207)
Q Consensus 81 a~~k~~~~ 88 (207)
++++.+.+
T Consensus 272 a~~~~~~~ 279 (346)
T TIGR01659 272 AEEHGKAK 279 (346)
T ss_pred CCcccccc
Confidence 88765443
No 3
>KOG0107|consensus
Probab=99.76 E-value=6.2e-18 Score=130.94 Aligned_cols=85 Identities=52% Similarity=0.885 Sum_probs=78.7
Q ss_pred CCCcEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeCCCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEEEeecCCC
Q psy2347 7 ERGTRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFNPPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVEISRGRGR 86 (207)
Q Consensus 7 ~~~~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~~a~~k~~ 86 (207)
+-.++|||+||+..+++.||+.+|..|++|..|.|...+.+||||||+++.+|++|+..|++++|++..|+|++++-+++
T Consensus 8 ~~~~kVYVGnL~~~a~k~eLE~~F~~yG~lrsvWvArnPPGfAFVEFed~RDA~DAvr~LDG~~~cG~r~rVE~S~G~~r 87 (195)
T KOG0107|consen 8 NGNTKVYVGNLGSRATKRELERAFSKYGPLRSVWVARNPPGFAFVEFEDPRDAEDAVRYLDGKDICGSRIRVELSTGRPR 87 (195)
T ss_pred CCCceEEeccCCCCcchHHHHHHHHhcCcceeEEEeecCCCceEEeccCcccHHHHHhhcCCccccCceEEEEeecCCcc
Confidence 45789999999999999999999999999999999988999999999999999999999999999999999999988776
Q ss_pred CCCCC
Q psy2347 87 GRGGG 91 (207)
Q Consensus 87 ~~~~g 91 (207)
...++
T Consensus 88 ~~r~g 92 (195)
T KOG0107|consen 88 GSRRG 92 (195)
T ss_pred ccccC
Confidence 54433
No 4
>TIGR01659 sex-lethal sex-lethal family splicing factor. This model describes the sex-lethal family of splicing factors found in Dipteran insects. The sex-lethal phenotype, however, may be limited to the Melanogasters and closely related species. In Drosophila the protein acts as an inhibitor of splicing. This subfamily is most closely related to the ELAV/HUD subfamily of splicing factors (TIGR01661).
Probab=99.75 E-value=6.3e-18 Score=147.10 Aligned_cols=80 Identities=24% Similarity=0.428 Sum_probs=74.3
Q ss_pred CCCCCcEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeC-----CCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEE
Q psy2347 5 MMERGTRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFN-----PPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVE 79 (207)
Q Consensus 5 ~~~~~~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d-----~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~ 79 (207)
.....++|||++||+++||++|+++|++|++|++|+|++| +++||||+|+++++|++||++||+++|.+++|+|+
T Consensus 103 ~~~~~~~LfVgnLp~~~te~~L~~lF~~~G~V~~v~i~~d~~tg~srGyaFVeF~~~e~A~~Ai~~LnG~~l~gr~i~V~ 182 (346)
T TIGR01659 103 TNNSGTNLIVNYLPQDMTDRELYALFRTIGPINTCRIMRDYKTGYSFGYAFVDFGSEADSQRAIKNLNGITVRNKRLKVS 182 (346)
T ss_pred CCCCCcEEEEeCCCCCCCHHHHHHHHHhcCCEEEEEEEecCCCCccCcEEEEEEccHHHHHHHHHHcCCCccCCceeeee
Confidence 3456789999999999999999999999999999999987 67999999999999999999999999999999999
Q ss_pred EeecC
Q psy2347 80 ISRGR 84 (207)
Q Consensus 80 ~a~~k 84 (207)
++++.
T Consensus 183 ~a~p~ 187 (346)
T TIGR01659 183 YARPG 187 (346)
T ss_pred ccccc
Confidence 98763
No 5
>TIGR01661 ELAV_HUD_SF ELAV/HuD family splicing factor. These proteins contain 3 RNA-recognition motifs (rrm: pfam00076).
Probab=99.74 E-value=2e-17 Score=143.71 Aligned_cols=78 Identities=24% Similarity=0.314 Sum_probs=74.2
Q ss_pred CcEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeC-----CCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEEEeec
Q psy2347 9 GTRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFN-----PPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVEISRG 83 (207)
Q Consensus 9 ~~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d-----~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~~a~~ 83 (207)
..+|||.|||+++++++|+++|++||.|++|+|++| +++||||+|++.++|++||++||+.+|.+++|+|.++.+
T Consensus 269 ~~~lfV~NL~~~~~e~~L~~~F~~fG~v~~v~i~~d~~t~~skG~aFV~F~~~~~A~~Ai~~lnG~~~~gr~i~V~~~~~ 348 (352)
T TIGR01661 269 GYCIFVYNLSPDTDETVLWQLFGPFGAVQNVKIIRDLTTNQCKGYGFVSMTNYDEAAMAILSLNGYTLGNRVLQVSFKTN 348 (352)
T ss_pred CcEEEEeCCCCCCCHHHHHHHHHhCCCeEEEEEeEcCCCCCccceEEEEECCHHHHHHHHHHhCCCEECCeEEEEEEccC
Confidence 447999999999999999999999999999999987 699999999999999999999999999999999999998
Q ss_pred CCC
Q psy2347 84 RGR 86 (207)
Q Consensus 84 k~~ 86 (207)
|+.
T Consensus 349 ~~~ 351 (352)
T TIGR01661 349 KAY 351 (352)
T ss_pred CCC
Confidence 865
No 6
>TIGR01661 ELAV_HUD_SF ELAV/HuD family splicing factor. These proteins contain 3 RNA-recognition motifs (rrm: pfam00076).
Probab=99.72 E-value=3.8e-17 Score=141.87 Aligned_cols=80 Identities=26% Similarity=0.427 Sum_probs=74.7
Q ss_pred CCCcEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeC-----CCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEEEe
Q psy2347 7 ERGTRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFN-----PPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVEIS 81 (207)
Q Consensus 7 ~~~~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d-----~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~~a 81 (207)
++.++|||+|||.++||++|+++|++||+|++|+|++| +++||||+|+++++|++||+.||+..|.+++|+|+++
T Consensus 1 ~~~~~l~V~nLp~~~~e~~l~~~F~~~G~i~~v~i~~d~~~g~s~g~afV~f~~~~~A~~Ai~~l~g~~l~g~~i~v~~a 80 (352)
T TIGR01661 1 ESKTNLIVNYLPQTMTQEEIRSLFTSIGEIESCKLVRDKVTGQSLGYGFVNYVRPEDAEKAVNSLNGLRLQNKTIKVSYA 80 (352)
T ss_pred CCCcEEEEeCCCCCCCHHHHHHHHHccCCEEEEEEEEcCCCCccceEEEEEECcHHHHHHHHhhcccEEECCeeEEEEee
Confidence 35789999999999999999999999999999999987 6799999999999999999999999999999999999
Q ss_pred ecCCC
Q psy2347 82 RGRGR 86 (207)
Q Consensus 82 ~~k~~ 86 (207)
+++..
T Consensus 81 ~~~~~ 85 (352)
T TIGR01661 81 RPSSD 85 (352)
T ss_pred ccccc
Confidence 87653
No 7
>KOG0125|consensus
Probab=99.71 E-value=2.2e-17 Score=138.68 Aligned_cols=83 Identities=30% Similarity=0.457 Sum_probs=77.0
Q ss_pred CCCCCcEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeC---CCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEEEe
Q psy2347 5 MMERGTRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFN---PPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVEIS 81 (207)
Q Consensus 5 ~~~~~~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d---~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~~a 81 (207)
..+..++|+|+|||+...|.||+.+|++||+|.+|+|+.+ +|+|+||+|++.++|++|-++||+..|++|+|+|..|
T Consensus 92 s~~~pkRLhVSNIPFrFRdpDL~aMF~kfG~VldVEIIfNERGSKGFGFVTmen~~dadRARa~LHgt~VEGRkIEVn~A 171 (376)
T KOG0125|consen 92 SKDTPKRLHVSNIPFRFRDPDLRAMFEKFGKVLDVEIIFNERGSKGFGFVTMENPADADRARAELHGTVVEGRKIEVNNA 171 (376)
T ss_pred CCCCCceeEeecCCccccCccHHHHHHhhCceeeEEEEeccCCCCccceEEecChhhHHHHHHHhhcceeeceEEEEecc
Confidence 3455689999999999999999999999999999999988 9999999999999999999999999999999999999
Q ss_pred ecCCCC
Q psy2347 82 RGRGRG 87 (207)
Q Consensus 82 ~~k~~~ 87 (207)
+++-..
T Consensus 172 TarV~n 177 (376)
T KOG0125|consen 172 TARVHN 177 (376)
T ss_pred chhhcc
Confidence 876443
No 8
>PF00076 RRM_1: RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain); InterPro: IPR000504 Many eukaryotic proteins containing one or more copies of a putative RNA-binding domain of about 90 amino acids are known to bind single-stranded RNAs [, , ]. The largest group of single strand RNA-binding proteins is the eukaryotic RNA recognition motif (RRM) family that contains an eight amino acid RNP-1 consensus sequence [, ]. RRM proteins have a variety of RNA binding preferences and functions, and include heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing (SR, U2AF, Sxl), protein components of small nuclear ribonucleoproteins (U1 and U2 snRNPs), and proteins that regulate RNA stability and translation (PABP, La, Hu) [, , ]. The RRM in heterodimeric splicing factor U2 snRNP auxiliary factor (U2AF) appears to have two RRM-like domains with specialised features for protein recognition []. The motif also appears in a few single stranded DNA binding proteins. The typical RRM consists of four anti-parallel beta-strands and two alpha-helices arranged in a beta-alpha-beta-beta-alpha-beta fold with side chains that stack with RNA bases. Specificity of RNA binding is determined by multiple contacts with surrounding amino acids. A third helix is present during RNA binding in some cases []. The RRM is reviewed in a number of publications [, , ].; GO: 0003676 nucleic acid binding; PDB: 2RNE_A 2DGO_A 2DO4_A 1YTY_B 2VOO_B 2VOP_A 2VON_B 1ZH5_B 2VOD_A 1S79_A ....
Probab=99.71 E-value=4e-17 Score=109.05 Aligned_cols=66 Identities=42% Similarity=0.712 Sum_probs=63.0
Q ss_pred EEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeC----CCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEE
Q psy2347 12 VYVGGLTETVKKEDLELEFEKFGKLNSVWVAFN----PPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLR 77 (207)
Q Consensus 12 lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d----~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~ 77 (207)
|||+|||+++|+++|+++|++|++|+.++++.+ ++++|||+|+++++|++|++.|+++.|.+++|+
T Consensus 1 l~v~nlp~~~t~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~a~V~F~~~~~a~~a~~~l~g~~~~~~~ir 70 (70)
T PF00076_consen 1 LYVGNLPPDVTEEELRDFFSQFGKIESIKVMRNSSGKSKGYAFVEFESEEDAEKALEELNGKKINGRKIR 70 (70)
T ss_dssp EEEESETTTSSHHHHHHHHHTTSTEEEEEEEEETTSSEEEEEEEEESSHHHHHHHHHHHTTEEETTEEEE
T ss_pred cEEcCCCCcCCHHHHHHHHHHhhhcccccccccccccccceEEEEEcCHHHHHHHHHHcCCCEECccCcC
Confidence 799999999999999999999999999999985 679999999999999999999999999999885
No 9
>TIGR01648 hnRNP-R-Q heterogeneous nuclear ribonucleoprotein R, Q family. Sequences in this subfamily include the human heterogeneous nuclear ribonucleoproteins (hnRNP) R, Q and APOBEC-1 complementation factor (aka APOBEC-1 stimulating protein). These proteins contain three RNA recognition domains (rrm: pfam00076) and a somewhat variable C-terminal domain.
Probab=99.70 E-value=7.5e-16 Score=141.22 Aligned_cols=79 Identities=32% Similarity=0.503 Sum_probs=73.0
Q ss_pred CCCCcEEEEcCCCcCCcHHHHHHHHHhC--CCeeEEEEeeCCCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEEEeec
Q psy2347 6 MERGTRVYVGGLTETVKKEDLELEFEKF--GKLNSVWVAFNPPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVEISRG 83 (207)
Q Consensus 6 ~~~~~~lfV~nLp~~~te~~L~~~F~~~--G~V~~v~i~~d~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~~a~~ 83 (207)
+.+.++|||+||++++||++|+++|++| ++|++|++++ .||||+|+++++|++||++||+++|.+++|+|++++|
T Consensus 230 ~~~~k~LfVgNL~~~~tee~L~~~F~~f~~G~I~rV~~~r---gfAFVeF~s~e~A~kAi~~lnG~~i~Gr~I~V~~Akp 306 (578)
T TIGR01648 230 MAKVKILYVRNLMTTTTEEIIEKSFSEFKPGKVERVKKIR---DYAFVHFEDREDAVKAMDELNGKELEGSEIEVTLAKP 306 (578)
T ss_pred cccccEEEEeCCCCCCCHHHHHHHHHhcCCCceEEEEeec---CeEEEEeCCHHHHHHHHHHhCCCEECCEEEEEEEccC
Confidence 4556899999999999999999999999 9999998875 4999999999999999999999999999999999998
Q ss_pred CCCC
Q psy2347 84 RGRG 87 (207)
Q Consensus 84 k~~~ 87 (207)
+...
T Consensus 307 ~~~~ 310 (578)
T TIGR01648 307 VDKK 310 (578)
T ss_pred CCcc
Confidence 7554
No 10
>PLN03120 nucleic acid binding protein; Provisional
Probab=99.66 E-value=6.9e-16 Score=127.83 Aligned_cols=74 Identities=16% Similarity=0.218 Sum_probs=70.2
Q ss_pred CcEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeC--CCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEEEeec
Q psy2347 9 GTRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFN--PPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVEISRG 83 (207)
Q Consensus 9 ~~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d--~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~~a~~ 83 (207)
.++|||+|||+.+||++|+++|+.||+|++|+|+.+ +++||||+|+++++|++||. |++..|.++.|+|.++..
T Consensus 4 ~rtVfVgNLs~~tTE~dLrefFS~~G~I~~V~I~~d~~~~GfAFVtF~d~eaAe~All-LnG~~l~gr~V~Vt~a~~ 79 (260)
T PLN03120 4 VRTVKVSNVSLKATERDIKEFFSFSGDIEYVEMQSENERSQIAYVTFKDPQGAETALL-LSGATIVDQSVTITPAED 79 (260)
T ss_pred CCEEEEeCCCCCCCHHHHHHHHHhcCCeEEEEEeecCCCCCEEEEEeCcHHHHHHHHH-hcCCeeCCceEEEEeccC
Confidence 579999999999999999999999999999999988 67999999999999999996 899999999999999864
No 11
>KOG0122|consensus
Probab=99.65 E-value=6.4e-16 Score=125.37 Aligned_cols=79 Identities=34% Similarity=0.505 Sum_probs=75.2
Q ss_pred CCCcEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeC-----CCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEEEe
Q psy2347 7 ERGTRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFN-----PPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVEIS 81 (207)
Q Consensus 7 ~~~~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d-----~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~~a 81 (207)
+..++|-|.||+++++|++|+++|.+|+.|..|.|..| +|+||||+|++.++|++||+.||++-++...|+|+|+
T Consensus 187 ~D~~tvRvtNLsed~~E~dL~eLf~~fg~i~rvylardK~TG~~kGFAFVtF~sRddA~rAI~~LnG~gyd~LILrvEws 266 (270)
T KOG0122|consen 187 DDEATVRVTNLSEDMREDDLEELFRPFGPITRVYLARDKETGLSKGFAFVTFESRDDAARAIADLNGYGYDNLILRVEWS 266 (270)
T ss_pred CccceeEEecCccccChhHHHHHhhccCccceeEEEEccccCcccceEEEEEecHHHHHHHHHHccCcccceEEEEEEec
Confidence 45678999999999999999999999999999999998 8999999999999999999999999999999999999
Q ss_pred ecCC
Q psy2347 82 RGRG 85 (207)
Q Consensus 82 ~~k~ 85 (207)
+|+.
T Consensus 267 kP~~ 270 (270)
T KOG0122|consen 267 KPSN 270 (270)
T ss_pred CCCC
Confidence 9863
No 12
>KOG0149|consensus
Probab=99.65 E-value=2.7e-16 Score=126.93 Aligned_cols=75 Identities=29% Similarity=0.491 Sum_probs=69.7
Q ss_pred CCcEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeC-----CCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEEEee
Q psy2347 8 RGTRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFN-----PPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVEISR 82 (207)
Q Consensus 8 ~~~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d-----~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~~a~ 82 (207)
.-+||||++|+|+++.++|+++|++||+|++..|++| +|+|+||+|.|.|+|++||+. ..-.|++|+..|.+|.
T Consensus 11 ~~TKifVggL~w~T~~~~l~~yFeqfGeI~eavvitd~~t~rskGyGfVTf~d~~aa~rAc~d-p~piIdGR~aNcnlA~ 89 (247)
T KOG0149|consen 11 TFTKIFVGGLAWETHKETLRRYFEQFGEIVEAVVITDKNTGRSKGYGFVTFRDAEAATRACKD-PNPIIDGRKANCNLAS 89 (247)
T ss_pred eEEEEEEcCcccccchHHHHHHHHHhCceEEEEEEeccCCccccceeeEEeecHHHHHHHhcC-CCCcccccccccchhh
Confidence 5679999999999999999999999999999999999 899999999999999999987 5678999999999875
Q ss_pred c
Q psy2347 83 G 83 (207)
Q Consensus 83 ~ 83 (207)
-
T Consensus 90 l 90 (247)
T KOG0149|consen 90 L 90 (247)
T ss_pred h
Confidence 3
No 13
>KOG0148|consensus
Probab=99.65 E-value=6.1e-16 Score=127.18 Aligned_cols=83 Identities=27% Similarity=0.448 Sum_probs=77.6
Q ss_pred CCCCCCcEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeCCCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEEEeec
Q psy2347 4 GMMERGTRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFNPPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVEISRG 83 (207)
Q Consensus 4 ~~~~~~~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~~a~~ 83 (207)
+.++.+++|||+||+..+||++|+++|++||+|.+|+|.++ ++||||.|+++|+|.+||..||+.+|.++.|+|.|-+.
T Consensus 159 Qssp~NtsVY~G~I~~~lte~~mr~~Fs~fG~I~EVRvFk~-qGYaFVrF~tkEaAahAIv~mNntei~G~~VkCsWGKe 237 (321)
T KOG0148|consen 159 QSSPDNTSVYVGNIASGLTEDLMRQTFSPFGPIQEVRVFKD-QGYAFVRFETKEAAAHAIVQMNNTEIGGQLVRCSWGKE 237 (321)
T ss_pred cCCCCCceEEeCCcCccccHHHHHHhcccCCcceEEEEecc-cceEEEEecchhhHHHHHHHhcCceeCceEEEEecccc
Confidence 45678899999999999999999999999999999999999 99999999999999999999999999999999999877
Q ss_pred CCCC
Q psy2347 84 RGRG 87 (207)
Q Consensus 84 k~~~ 87 (207)
....
T Consensus 238 ~~~~ 241 (321)
T KOG0148|consen 238 GDDG 241 (321)
T ss_pred CCCC
Confidence 5443
No 14
>KOG0121|consensus
Probab=99.64 E-value=3.8e-16 Score=115.46 Aligned_cols=78 Identities=27% Similarity=0.480 Sum_probs=72.7
Q ss_pred CCCCCcEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeC-----CCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEE
Q psy2347 5 MMERGTRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFN-----PPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVE 79 (207)
Q Consensus 5 ~~~~~~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d-----~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~ 79 (207)
+..++++|||+||+..++|++|.++|+++|+|..|.+-.| +.+||||+|.+.++|+.||+-|++..|+.++|+|.
T Consensus 32 a~r~S~tvyVgNlSfyttEEqiyELFs~cG~irriiMGLdr~kktpCGFCFVeyy~~~dA~~AlryisgtrLddr~ir~D 111 (153)
T KOG0121|consen 32 ALRKSCTVYVGNLSFYTTEEQIYELFSKCGDIRRIIMGLDRFKKTPCGFCFVEYYSRDDAEDALRYISGTRLDDRPIRID 111 (153)
T ss_pred HHhhcceEEEeeeeeeecHHHHHHHHHhccchheeEeccccCCcCccceEEEEEecchhHHHHHHHhccCcccccceeee
Confidence 4567899999999999999999999999999999987666 78999999999999999999999999999999999
Q ss_pred Eee
Q psy2347 80 ISR 82 (207)
Q Consensus 80 ~a~ 82 (207)
|.-
T Consensus 112 ~D~ 114 (153)
T KOG0121|consen 112 WDA 114 (153)
T ss_pred ccc
Confidence 874
No 15
>KOG0117|consensus
Probab=99.63 E-value=5e-16 Score=134.97 Aligned_cols=81 Identities=30% Similarity=0.472 Sum_probs=76.0
Q ss_pred CCCCCCcEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeCCCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEEEeec
Q psy2347 4 GMMERGTRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFNPPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVEISRG 83 (207)
Q Consensus 4 ~~~~~~~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~~a~~ 83 (207)
+.+.+.+.|||.||+.+|||+.|+++|++|+.|++|+.++| ||||.|.+.++|.+||+.||+++|++..|+|.+|+|
T Consensus 254 d~ms~VKvLYVRNL~~~tTeE~lk~~F~~~G~veRVkk~rD---YaFVHf~eR~davkAm~~~ngkeldG~~iEvtLAKP 330 (506)
T KOG0117|consen 254 DTMSKVKVLYVRNLMESTTEETLKKLFNEFGKVERVKKPRD---YAFVHFAEREDAVKAMKETNGKELDGSPIEVTLAKP 330 (506)
T ss_pred hhhhheeeeeeeccchhhhHHHHHHHHHhccceEEeecccc---eeEEeecchHHHHHHHHHhcCceecCceEEEEecCC
Confidence 46678899999999999999999999999999999999877 999999999999999999999999999999999998
Q ss_pred CCCC
Q psy2347 84 RGRG 87 (207)
Q Consensus 84 k~~~ 87 (207)
..+.
T Consensus 331 ~~k~ 334 (506)
T KOG0117|consen 331 VDKK 334 (506)
T ss_pred hhhh
Confidence 6544
No 16
>KOG0105|consensus
Probab=99.63 E-value=1.8e-15 Score=118.42 Aligned_cols=77 Identities=38% Similarity=0.670 Sum_probs=71.6
Q ss_pred CCCcEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeC--CCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEEEeec
Q psy2347 7 ERGTRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFN--PPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVEISRG 83 (207)
Q Consensus 7 ~~~~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d--~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~~a~~ 83 (207)
..+++|||+|||.++.|.||+++|.+|+.|.+|+|... +..||||+|+++.+|+.||..-++..+++..|+|+++..
T Consensus 4 r~~~~iyvGNLP~diRekeieDlFyKyg~i~~ieLK~r~g~ppfafVeFEd~RDAeDAiygRdGYdydg~rLRVEfprg 82 (241)
T KOG0105|consen 4 RNSRRIYVGNLPGDIREKEIEDLFYKYGRIREIELKNRPGPPPFAFVEFEDPRDAEDAIYGRDGYDYDGCRLRVEFPRG 82 (241)
T ss_pred cccceEEecCCCcchhhccHHHHHhhhcceEEEEeccCCCCCCeeEEEecCccchhhhhhcccccccCcceEEEEeccC
Confidence 45789999999999999999999999999999998766 678999999999999999999999999999999999854
No 17
>PF14259 RRM_6: RNA recognition motif (a.k.a. RRM, RBD, or RNP domain); PDB: 2DNN_A 1WI6_A 2EVZ_A 2ADC_A 1QM9_A 2KG1_A 2HGN_A 1U2F_A 2G4B_A 2HZC_A ....
Probab=99.60 E-value=3.1e-15 Score=100.55 Aligned_cols=66 Identities=41% Similarity=0.672 Sum_probs=60.4
Q ss_pred EEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeC----CCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEE
Q psy2347 12 VYVGGLTETVKKEDLELEFEKFGKLNSVWVAFN----PPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLR 77 (207)
Q Consensus 12 lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d----~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~ 77 (207)
|||.|||+++++++|+++|+.++.|+.|+++.+ .+++|||+|.++++|++|++.+++++|++++|+
T Consensus 1 v~i~nlp~~~~~~~l~~~f~~~g~v~~v~~~~~~~~~~~~~a~v~f~~~~~a~~al~~~~~~~~~g~~l~ 70 (70)
T PF14259_consen 1 VYISNLPPSTTEEDLRNFFSRFGPVEKVRLIKNKDGQSRGFAFVEFSSEEDAKRALELLNGKEIDGRKLR 70 (70)
T ss_dssp EEEESSTTT--HHHHHHHCTTSSBEEEEEEEESTTSSEEEEEEEEESSHHHHHHHHHHHTTEEETTEEEE
T ss_pred CEEeCCCCCCCHHHHHHHHHhcCCcceEEEEeeeccccCCEEEEEeCCHHHHHHHHHHCCCcEECCEEcC
Confidence 799999999999999999999999999999987 369999999999999999999988999999885
No 18
>TIGR01649 hnRNP-L_PTB hnRNP-L/PTB/hephaestus splicing factor family. Included in this family of heterogeneous ribonucleoproteins are PTB (polypyrimidine tract binding protein ) and hnRNP-L. These proteins contain four RNA recognition motifs (rrm: pfam00067).
Probab=99.60 E-value=7.9e-15 Score=133.08 Aligned_cols=78 Identities=27% Similarity=0.351 Sum_probs=73.7
Q ss_pred CCCcEEEEcCCCc-CCcHHHHHHHHHhCCCeeEEEEeeCCCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEEEeecC
Q psy2347 7 ERGTRVYVGGLTE-TVKKEDLELEFEKFGKLNSVWVAFNPPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVEISRGR 84 (207)
Q Consensus 7 ~~~~~lfV~nLp~-~~te~~L~~~F~~~G~V~~v~i~~d~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~~a~~k 84 (207)
+++++|||.|||+ .+|+++|+++|++||+|++|+|++++++||||+|+++++|++||+.||++.|.+++|+|++++.+
T Consensus 273 ~~~~~l~v~nL~~~~vt~~~L~~lF~~yG~V~~vki~~~~~g~afV~f~~~~~A~~Ai~~lng~~l~g~~l~v~~s~~~ 351 (481)
T TIGR01649 273 GPGSVLMVSGLHQEKVNCDRLFNLFCVYGNVERVKFMKNKKETALIEMADPYQAQLALTHLNGVKLFGKPLRVCPSKQQ 351 (481)
T ss_pred CCCCEEEEeCCCCCCCCHHHHHHHHHhcCCeEEEEEEeCCCCEEEEEECCHHHHHHHHHHhCCCEECCceEEEEEcccc
Confidence 4678999999998 69999999999999999999999988899999999999999999999999999999999998664
No 19
>PLN03121 nucleic acid binding protein; Provisional
Probab=99.59 E-value=9.1e-15 Score=119.57 Aligned_cols=76 Identities=16% Similarity=0.188 Sum_probs=70.0
Q ss_pred CCCCcEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeC--CCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEEEee
Q psy2347 6 MERGTRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFN--PPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVEISR 82 (207)
Q Consensus 6 ~~~~~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d--~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~~a~ 82 (207)
.+...+|||.||++.+||++|+++|+.||+|++|+|+.| .++||||+|+++++|+.||. |++.+|.+++|.|..+.
T Consensus 2 ~~~g~TV~V~NLS~~tTE~dLrefFS~~G~I~~V~I~~D~et~gfAfVtF~d~~aaetAll-LnGa~l~d~~I~It~~~ 79 (243)
T PLN03121 2 YPGGYTAEVTNLSPKATEKDVYDFFSHCGAIEHVEIIRSGEYACTAYVTFKDAYALETAVL-LSGATIVDQRVCITRWG 79 (243)
T ss_pred CCCceEEEEecCCCCCCHHHHHHHHHhcCCeEEEEEecCCCcceEEEEEECCHHHHHHHHh-cCCCeeCCceEEEEeCc
Confidence 345689999999999999999999999999999999998 56899999999999999995 89999999999998754
No 20
>KOG0113|consensus
Probab=99.59 E-value=7.5e-15 Score=122.14 Aligned_cols=80 Identities=25% Similarity=0.525 Sum_probs=75.2
Q ss_pred CCCCCcEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeC-----CCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEE
Q psy2347 5 MMERGTRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFN-----PPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVE 79 (207)
Q Consensus 5 ~~~~~~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d-----~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~ 79 (207)
.-++-+||||.-|+.+++|.+|+..|++||+|+.|.|+.| +++||||+|+++.++++|.+..++++|+++.|.|.
T Consensus 97 ~gDPy~TLFv~RLnydT~EskLrreF~~YG~IkrirlV~d~vTgkskGYAFIeye~erdm~~AYK~adG~~Idgrri~VD 176 (335)
T KOG0113|consen 97 IGDPYKTLFVARLNYDTSESKLRREFEKYGPIKRIRLVRDKVTGKSKGYAFIEYEHERDMKAAYKDADGIKIDGRRILVD 176 (335)
T ss_pred cCCccceeeeeeccccccHHHHHHHHHhcCcceeEEEeeecccCCccceEEEEeccHHHHHHHHHhccCceecCcEEEEE
Confidence 4467899999999999999999999999999999999998 89999999999999999999999999999999999
Q ss_pred EeecC
Q psy2347 80 ISRGR 84 (207)
Q Consensus 80 ~a~~k 84 (207)
+...+
T Consensus 177 vERgR 181 (335)
T KOG0113|consen 177 VERGR 181 (335)
T ss_pred ecccc
Confidence 87643
No 21
>TIGR01628 PABP-1234 polyadenylate binding protein, human types 1, 2, 3, 4 family. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed, or of unknown tissue range.
Probab=99.59 E-value=4.7e-15 Score=136.70 Aligned_cols=81 Identities=32% Similarity=0.507 Sum_probs=75.5
Q ss_pred CCCCcEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeC----CCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEEEe
Q psy2347 6 MERGTRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFN----PPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVEIS 81 (207)
Q Consensus 6 ~~~~~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d----~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~~a 81 (207)
....++|||.||++++|+++|+++|++||+|++|+|+.| +++||||+|+++++|++||+.||++.|.+++|.|.+|
T Consensus 282 ~~~~~~l~V~nl~~~~~~~~L~~~F~~~G~i~~~~i~~d~~g~~~g~gfV~f~~~~~A~~A~~~~~g~~~~gk~l~V~~a 361 (562)
T TIGR01628 282 KAQGVNLYVKNLDDTVTDEKLRELFSECGEITSAKVMLDEKGVSRGFGFVCFSNPEEANRAVTEMHGRMLGGKPLYVALA 361 (562)
T ss_pred ccCCCEEEEeCCCCccCHHHHHHHHHhcCCeEEEEEEECCCCCcCCeEEEEeCCHHHHHHHHHHhcCCeeCCceeEEEec
Confidence 345678999999999999999999999999999999987 6899999999999999999999999999999999999
Q ss_pred ecCCC
Q psy2347 82 RGRGR 86 (207)
Q Consensus 82 ~~k~~ 86 (207)
+.+..
T Consensus 362 ~~k~~ 366 (562)
T TIGR01628 362 QRKEQ 366 (562)
T ss_pred cCcHH
Confidence 87653
No 22
>KOG4207|consensus
Probab=99.59 E-value=5.5e-15 Score=117.46 Aligned_cols=78 Identities=38% Similarity=0.598 Sum_probs=73.4
Q ss_pred CCCCcEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeC-----CCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEEE
Q psy2347 6 MERGTRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFN-----PPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVEI 80 (207)
Q Consensus 6 ~~~~~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d-----~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~~ 80 (207)
++-..+|.|-||...||.++|+.+|++||.|.+|.|+.| +++||||.|.+..+|++||++|++.+|+++.|.|++
T Consensus 10 v~gm~SLkVdNLTyRTspd~LrrvFekYG~vgDVyIPrdr~Tr~sRgFaFVrf~~k~daedA~damDG~~ldgRelrVq~ 89 (256)
T KOG4207|consen 10 VEGMTSLKVDNLTYRTSPDDLRRVFEKYGRVGDVYIPRDRYTRQSRGFAFVRFHDKRDAEDALDAMDGAVLDGRELRVQM 89 (256)
T ss_pred cccceeEEecceeccCCHHHHHHHHHHhCcccceecccccccccccceeEEEeeecchHHHHHHhhcceeeccceeeehh
Confidence 344578999999999999999999999999999999998 899999999999999999999999999999999999
Q ss_pred eec
Q psy2347 81 SRG 83 (207)
Q Consensus 81 a~~ 83 (207)
|+-
T Consensus 90 ary 92 (256)
T KOG4207|consen 90 ARY 92 (256)
T ss_pred hhc
Confidence 874
No 23
>smart00362 RRM_2 RNA recognition motif.
Probab=99.59 E-value=1.3e-14 Score=95.94 Aligned_cols=69 Identities=45% Similarity=0.707 Sum_probs=64.4
Q ss_pred EEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeC---CCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEE
Q psy2347 11 RVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFN---PPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVE 79 (207)
Q Consensus 11 ~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d---~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~ 79 (207)
+|||.+||.++++++|+++|++|++|+.+.++.+ ++++|||+|.++++|++|++.|++..|.+++|+|+
T Consensus 1 ~v~i~~l~~~~~~~~l~~~~~~~g~v~~~~~~~~~~~~~~~~~v~f~~~~~a~~a~~~~~~~~~~~~~i~v~ 72 (72)
T smart00362 1 TLFVGNLPPDVTEEDLKELFSKFGPIESVKIPKDTGKSKGFAFVEFESEEDAEKAIEALNGTKLGGRPLRVE 72 (72)
T ss_pred CEEEcCCCCcCCHHHHHHHHHhcCCEEEEEEecCCCCCCceEEEEeCCHHHHHHHHHHhCCcEECCEEEeeC
Confidence 5899999999999999999999999999999877 36999999999999999999999999999998874
No 24
>TIGR01645 half-pint poly-U binding splicing factor, half-pint family. In the case of PUF60 (GP|6176532), in complex with p54, and in the presence of U2AF, facilitates association of U2 snRNP with pre-mRNA.
Probab=99.58 E-value=9e-15 Score=134.57 Aligned_cols=80 Identities=29% Similarity=0.417 Sum_probs=74.5
Q ss_pred CCcEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeC-----CCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEEEee
Q psy2347 8 RGTRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFN-----PPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVEISR 82 (207)
Q Consensus 8 ~~~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d-----~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~~a~ 82 (207)
..++|||+|||+++++++|+++|++||+|++|+|++| +++||||+|+++++|++||+.||+.+|.++.|+|.++.
T Consensus 203 ~~~rLfVgnLp~~vteedLk~lFs~FG~I~svrl~~D~~tgksKGfGFVeFe~~e~A~kAI~amNg~elgGr~LrV~kAi 282 (612)
T TIGR01645 203 KFNRIYVASVHPDLSETDIKSVFEAFGEIVKCQLARAPTGRGHKGYGFIEYNNLQSQSEAIASMNLFDLGGQYLRVGKCV 282 (612)
T ss_pred ccceEEeecCCCCCCHHHHHHHHhhcCCeeEEEEEecCCCCCcCCeEEEEECCHHHHHHHHHHhCCCeeCCeEEEEEecC
Confidence 4579999999999999999999999999999999987 68999999999999999999999999999999999998
Q ss_pred cCCCC
Q psy2347 83 GRGRG 87 (207)
Q Consensus 83 ~k~~~ 87 (207)
+++..
T Consensus 283 ~pP~~ 287 (612)
T TIGR01645 283 TPPDA 287 (612)
T ss_pred CCccc
Confidence 75543
No 25
>TIGR01648 hnRNP-R-Q heterogeneous nuclear ribonucleoprotein R, Q family. Sequences in this subfamily include the human heterogeneous nuclear ribonucleoproteins (hnRNP) R, Q and APOBEC-1 complementation factor (aka APOBEC-1 stimulating protein). These proteins contain three RNA recognition domains (rrm: pfam00076) and a somewhat variable C-terminal domain.
Probab=99.58 E-value=7.5e-15 Score=134.69 Aligned_cols=75 Identities=29% Similarity=0.511 Sum_probs=67.0
Q ss_pred CCCCcEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeC----CCcEEEEEEcCHHHHHHHHHhcCCceeC-CcEEEEEE
Q psy2347 6 MERGTRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFN----PPGFAFVEFSNQIDAEAACDSMNDQDLM-GSKLRVEI 80 (207)
Q Consensus 6 ~~~~~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d----~rG~afV~F~~~e~A~~Al~~l~g~~i~-g~~l~V~~ 80 (207)
.+..++|||+|||++++|++|+++|++|++|.+|+|++| +++||||+|+++|+|++||+.||+.+|. ++.|.|.+
T Consensus 55 p~~~~~lFVgnLp~~~tEd~L~~~F~~~G~I~~vrl~~D~sG~sRGfaFV~F~~~e~A~~Ai~~lng~~i~~Gr~l~V~~ 134 (578)
T TIGR01648 55 PGRGCEVFVGKIPRDLYEDELVPLFEKAGPIYELRLMMDFSGQNRGYAFVTFCGKEEAKEAVKLLNNYEIRPGRLLGVCI 134 (578)
T ss_pred CCCCCEEEeCCCCCCCCHHHHHHHHHhhCCEEEEEEEECCCCCccceEEEEeCCHHHHHHHHHHcCCCeecCCccccccc
Confidence 345689999999999999999999999999999999987 7899999999999999999999998885 56655543
No 26
>TIGR01642 U2AF_lg U2 snRNP auxilliary factor, large subunit, splicing factor. Members of this subfamily are found in plants, metazoa and fungi.
Probab=99.57 E-value=2.2e-14 Score=130.49 Aligned_cols=78 Identities=23% Similarity=0.425 Sum_probs=72.9
Q ss_pred CCCcEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeC-----CCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEEEe
Q psy2347 7 ERGTRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFN-----PPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVEIS 81 (207)
Q Consensus 7 ~~~~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d-----~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~~a 81 (207)
++.++|||+|||+.+|+++|+++|++||.|+.|.|+++ +++||||+|+++++|++||+.||++.|.+++|.|+++
T Consensus 293 ~~~~~l~v~nlp~~~~~~~l~~~f~~~G~i~~~~~~~~~~~g~~~g~afv~f~~~~~a~~A~~~l~g~~~~~~~l~v~~a 372 (509)
T TIGR01642 293 DSKDRIYIGNLPLYLGEDQIKELLESFGDLKAFNLIKDIATGLSKGYAFCEYKDPSVTDVAIAALNGKDTGDNKLHVQRA 372 (509)
T ss_pred CCCCEEEEeCCCCCCCHHHHHHHHHhcCCeeEEEEEecCCCCCcCeEEEEEECCHHHHHHHHHHcCCCEECCeEEEEEEC
Confidence 45679999999999999999999999999999999886 6899999999999999999999999999999999998
Q ss_pred ecC
Q psy2347 82 RGR 84 (207)
Q Consensus 82 ~~k 84 (207)
...
T Consensus 373 ~~~ 375 (509)
T TIGR01642 373 CVG 375 (509)
T ss_pred ccC
Confidence 653
No 27
>TIGR01645 half-pint poly-U binding splicing factor, half-pint family. In the case of PUF60 (GP|6176532), in complex with p54, and in the presence of U2AF, facilitates association of U2 snRNP with pre-mRNA.
Probab=99.57 E-value=1.1e-14 Score=134.12 Aligned_cols=78 Identities=29% Similarity=0.531 Sum_probs=72.5
Q ss_pred CCCCCcEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeC-----CCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEE
Q psy2347 5 MMERGTRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFN-----PPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVE 79 (207)
Q Consensus 5 ~~~~~~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d-----~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~ 79 (207)
.....++|||+|||++++|++|+++|++||+|++|+|+.| +++||||+|+++++|++||+.||++.|.+++|+|.
T Consensus 103 a~~~~~rLfVGnLp~~~tEe~Lr~lF~~fG~I~sV~I~~D~~TgkskGfAFVeF~s~e~A~~Ai~~lnG~~i~GR~IkV~ 182 (612)
T TIGR01645 103 ALAIMCRVYVGSISFELREDTIRRAFDPFGPIKSINMSWDPATGKHKGFAFVEYEVPEAAQLALEQMNGQMLGGRNIKVG 182 (612)
T ss_pred hhcCCCEEEEcCCCCCCCHHHHHHHHHccCCEEEEEEeecCCCCCcCCeEEEEeCcHHHHHHHHHhcCCeEEecceeeec
Confidence 3456789999999999999999999999999999999887 78999999999999999999999999999999998
Q ss_pred Eee
Q psy2347 80 ISR 82 (207)
Q Consensus 80 ~a~ 82 (207)
+..
T Consensus 183 rp~ 185 (612)
T TIGR01645 183 RPS 185 (612)
T ss_pred ccc
Confidence 654
No 28
>TIGR01649 hnRNP-L_PTB hnRNP-L/PTB/hephaestus splicing factor family. Included in this family of heterogeneous ribonucleoproteins are PTB (polypyrimidine tract binding protein ) and hnRNP-L. These proteins contain four RNA recognition motifs (rrm: pfam00067).
Probab=99.57 E-value=1.4e-14 Score=131.48 Aligned_cols=76 Identities=22% Similarity=0.208 Sum_probs=70.6
Q ss_pred CCcEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeCCCcEEEEEEcCHHHHHHHHHhc--CCceeCCcEEEEEEeecC
Q psy2347 8 RGTRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFNPPGFAFVEFSNQIDAEAACDSM--NDQDLMGSKLRVEISRGR 84 (207)
Q Consensus 8 ~~~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d~rG~afV~F~~~e~A~~Al~~l--~g~~i~g~~l~V~~a~~k 84 (207)
++++|||+|||+++||++|+++|++||+|++|.|+++ +++|||+|+++++|++||+.| +...|.+++|.|++++.+
T Consensus 1 ps~vv~V~nLp~~~te~~L~~~f~~fG~V~~v~i~~~-k~~afVef~~~e~A~~Ai~~~~~~~~~l~g~~l~v~~s~~~ 78 (481)
T TIGR01649 1 PSPVVHVRNLPQDVVEADLVEALIPFGPVSYVMMLPG-KRQALVEFEDEESAKACVNFATSVPIYIRGQPAFFNYSTSQ 78 (481)
T ss_pred CccEEEEcCCCCCCCHHHHHHHHHhcCCeeEEEEECC-CCEEEEEeCchHHHHHHHHHhhcCCceEcCeEEEEEecCCc
Confidence 5789999999999999999999999999999999986 899999999999999999864 678999999999998754
No 29
>KOG0130|consensus
Probab=99.56 E-value=9.7e-15 Score=108.97 Aligned_cols=79 Identities=27% Similarity=0.578 Sum_probs=74.0
Q ss_pred CCcEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeC-----CCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEEEee
Q psy2347 8 RGTRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFN-----PPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVEISR 82 (207)
Q Consensus 8 ~~~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d-----~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~~a~ 82 (207)
....|||.++.+++||++|.+.|..||+|++|++..| .|+||+|+|++.++|++||++||+.+|.+++|.|.|+-
T Consensus 71 EGwIi~VtgvHeEatEedi~d~F~dyGeiKNihLNLDRRtGy~KGYaLvEYet~keAq~A~~~~Ng~~ll~q~v~VDw~F 150 (170)
T KOG0130|consen 71 EGWIIFVTGVHEEATEEDIHDKFADYGEIKNIHLNLDRRTGYVKGYALVEYETLKEAQAAIDALNGAELLGQNVSVDWCF 150 (170)
T ss_pred eeEEEEEeccCcchhHHHHHHHHhhcccccceeeccccccccccceeeeehHhHHHHHHHHHhccchhhhCCceeEEEEE
Confidence 4568999999999999999999999999999999998 78999999999999999999999999999999999987
Q ss_pred cCCC
Q psy2347 83 GRGR 86 (207)
Q Consensus 83 ~k~~ 86 (207)
.+.+
T Consensus 151 v~gp 154 (170)
T KOG0130|consen 151 VKGP 154 (170)
T ss_pred ecCC
Confidence 6544
No 30
>PLN03213 repressor of silencing 3; Provisional
Probab=99.56 E-value=1.4e-14 Score=127.56 Aligned_cols=79 Identities=28% Similarity=0.414 Sum_probs=73.2
Q ss_pred CCCCCcEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeC-CCcEEEEEEcCH--HHHHHHHHhcCCceeCCcEEEEEEe
Q psy2347 5 MMERGTRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFN-PPGFAFVEFSNQ--IDAEAACDSMNDQDLMGSKLRVEIS 81 (207)
Q Consensus 5 ~~~~~~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d-~rG~afV~F~~~--e~A~~Al~~l~g~~i~g~~l~V~~a 81 (207)
......+|||+||++++|+++|+++|.+||.|+.|+|++. -|+||||+|.+. +++++||+.||+.+++++.|+|..|
T Consensus 6 s~~~gMRIYVGNLSydVTEDDLravFSeFGsVkdVEIpRETGRGFAFVEMssdddaEeeKAISaLNGAEWKGR~LKVNKA 85 (759)
T PLN03213 6 SGGGGVRLHVGGLGESVGRDDLLKIFSPMGTVDAVEFVRTKGRSFAYIDFSPSSTNSLTKLFSTYNGCVWKGGRLRLEKA 85 (759)
T ss_pred cCCcceEEEEeCCCCCCCHHHHHHHHHhcCCeeEEEEecccCCceEEEEecCCcHHHHHHHHHHhcCCeecCceeEEeec
Confidence 3445689999999999999999999999999999999977 789999999987 7899999999999999999999999
Q ss_pred ec
Q psy2347 82 RG 83 (207)
Q Consensus 82 ~~ 83 (207)
+|
T Consensus 86 KP 87 (759)
T PLN03213 86 KE 87 (759)
T ss_pred cH
Confidence 76
No 31
>TIGR01628 PABP-1234 polyadenylate binding protein, human types 1, 2, 3, 4 family. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed, or of unknown tissue range.
Probab=99.55 E-value=2.2e-14 Score=132.31 Aligned_cols=75 Identities=35% Similarity=0.512 Sum_probs=70.3
Q ss_pred cEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeC-----CCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEEEeecC
Q psy2347 10 TRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFN-----PPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVEISRGR 84 (207)
Q Consensus 10 ~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d-----~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~~a~~k 84 (207)
.+|||+|||+++||++|+++|++||+|++|+|++| +++||||+|.++++|++||+.||...|.++.|+|.|+...
T Consensus 1 ~sl~VgnLp~~vte~~L~~~F~~~G~v~~v~v~~d~~t~~s~G~afV~F~~~~~A~~Al~~ln~~~i~gk~i~i~~s~~~ 80 (562)
T TIGR01628 1 ASLYVGDLDPDVTEAKLYDLFKPFGPVLSVRVCRDSVTRRSLGYGYVNFQNPADAERALETMNFKRLGGKPIRIMWSQRD 80 (562)
T ss_pred CeEEEeCCCCCCCHHHHHHHHHhcCCEEEEEEEecCCCCCcceEEEEEECCHHHHHHHHHHhCCCEECCeeEEeeccccc
Confidence 37999999999999999999999999999999987 5689999999999999999999999999999999998643
No 32
>TIGR01622 SF-CC1 splicing factor, CC1-like family. A homologous gene from Plasmodium falciparum was identified in the course of the analysis of that genome at TIGR and was included in the model.
Probab=99.55 E-value=3.5e-14 Score=127.64 Aligned_cols=76 Identities=28% Similarity=0.540 Sum_probs=71.5
Q ss_pred CCcEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeC-----CCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEEEee
Q psy2347 8 RGTRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFN-----PPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVEISR 82 (207)
Q Consensus 8 ~~~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d-----~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~~a~ 82 (207)
..++|||.|||.++|+++|+++|++|+.|+.|.|+.+ +++||||+|.++++|++||+.||+.+|.+++|+|.+++
T Consensus 185 ~~~~l~v~nl~~~~te~~l~~~f~~~G~i~~v~~~~d~~~g~~~g~afV~f~~~e~A~~A~~~l~g~~i~g~~i~v~~a~ 264 (457)
T TIGR01622 185 NFLKLYVGNLHFNITEQELRQIFEPFGDIEDVQLHRDPETGRSKGFGFIQFHDAEEAKEALEVMNGFELAGRPIKVGYAQ 264 (457)
T ss_pred CCCEEEEcCCCCCCCHHHHHHHHHhcCCeEEEEEEEcCCCCccceEEEEEECCHHHHHHHHHhcCCcEECCEEEEEEEcc
Confidence 3589999999999999999999999999999999976 57899999999999999999999999999999999986
Q ss_pred c
Q psy2347 83 G 83 (207)
Q Consensus 83 ~ 83 (207)
.
T Consensus 265 ~ 265 (457)
T TIGR01622 265 D 265 (457)
T ss_pred C
Confidence 4
No 33
>KOG0117|consensus
Probab=99.54 E-value=4e-14 Score=123.28 Aligned_cols=78 Identities=31% Similarity=0.517 Sum_probs=72.0
Q ss_pred CCCCCcEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeC-----CCcEEEEEEcCHHHHHHHHHhcCCcee-CCcEEEE
Q psy2347 5 MMERGTRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFN-----PPGFAFVEFSNQIDAEAACDSMNDQDL-MGSKLRV 78 (207)
Q Consensus 5 ~~~~~~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d-----~rG~afV~F~~~e~A~~Al~~l~g~~i-~g~~l~V 78 (207)
..+..+.|||+.||.++.|+||..+|++.|+|-+++||.| +|+||||+|.++++|++||+.||+++| .++.|.|
T Consensus 79 ~p~~G~EVfvGkIPrD~~EdeLvplfEkiG~I~elRLMmD~~sG~nRGYAFVtf~~Ke~Aq~Aik~lnn~Eir~GK~igv 158 (506)
T KOG0117|consen 79 PPPRGCEVFVGKIPRDVFEDELVPLFEKIGKIYELRLMMDPFSGDNRGYAFVTFCTKEEAQEAIKELNNYEIRPGKLLGV 158 (506)
T ss_pred CCCCCceEEecCCCccccchhhHHHHHhccceeeEEEeecccCCCCcceEEEEeecHHHHHHHHHHhhCccccCCCEeEE
Confidence 3466899999999999999999999999999999999998 899999999999999999999999998 4788888
Q ss_pred EEee
Q psy2347 79 EISR 82 (207)
Q Consensus 79 ~~a~ 82 (207)
+++.
T Consensus 159 c~Sv 162 (506)
T KOG0117|consen 159 CVSV 162 (506)
T ss_pred EEee
Confidence 7764
No 34
>KOG0144|consensus
Probab=99.53 E-value=9.3e-15 Score=126.71 Aligned_cols=81 Identities=22% Similarity=0.426 Sum_probs=73.8
Q ss_pred CCcEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeC----CCcEEEEEEcCHHHHHHHHHhcCCce-e--CCcEEEEEE
Q psy2347 8 RGTRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFN----PPGFAFVEFSNQIDAEAACDSMNDQD-L--MGSKLRVEI 80 (207)
Q Consensus 8 ~~~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d----~rG~afV~F~~~e~A~~Al~~l~g~~-i--~g~~l~V~~ 80 (207)
..++|||+.|+..+||+||+++|++||.|++|.|++| +|+||||+|+++|.|..||+.||+.. + +..+|.|+|
T Consensus 123 ~e~KLFvg~lsK~~te~evr~iFs~fG~Ied~~ilrd~~~~sRGcaFV~fstke~A~~Aika~ng~~tmeGcs~PLVVkF 202 (510)
T KOG0144|consen 123 EERKLFVGMLSKQCTENEVREIFSRFGHIEDCYILRDPDGLSRGCAFVKFSTKEMAVAAIKALNGTQTMEGCSQPLVVKF 202 (510)
T ss_pred cchhhhhhhccccccHHHHHHHHHhhCccchhhheecccccccceeEEEEehHHHHHHHHHhhccceeeccCCCceEEEe
Confidence 4789999999999999999999999999999999998 89999999999999999999999864 4 457899999
Q ss_pred eecCCCCC
Q psy2347 81 SRGRGRGR 88 (207)
Q Consensus 81 a~~k~~~~ 88 (207)
|.++.++.
T Consensus 203 ADtqkdk~ 210 (510)
T KOG0144|consen 203 ADTQKDKD 210 (510)
T ss_pred cccCCCch
Confidence 99876653
No 35
>TIGR01622 SF-CC1 splicing factor, CC1-like family. A homologous gene from Plasmodium falciparum was identified in the course of the analysis of that genome at TIGR and was included in the model.
Probab=99.53 E-value=5.9e-14 Score=126.13 Aligned_cols=78 Identities=27% Similarity=0.334 Sum_probs=71.5
Q ss_pred CCCCcEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeC-----CCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEEE
Q psy2347 6 MERGTRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFN-----PPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVEI 80 (207)
Q Consensus 6 ~~~~~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d-----~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~~ 80 (207)
..+.++|||.|||.++|+++|+++|++|++|+.|+|++| +++||||+|.++++|++||+ |++..|.+++|.|+.
T Consensus 86 ~~~~~~l~V~nlp~~~~~~~l~~~F~~~G~v~~v~i~~d~~~~~skg~afVeF~~~e~A~~Al~-l~g~~~~g~~i~v~~ 164 (457)
T TIGR01622 86 ERDDRTVFVLQLALKARERDLYEFFSKVGKVRDVQCIKDRNSRRSKGVAYVEFYDVESVIKALA-LTGQMLLGRPIIVQS 164 (457)
T ss_pred ccCCcEEEEeCCCCCCCHHHHHHHHHhcCCeeEEEEeecCCCCCcceEEEEEECCHHHHHHHHH-hCCCEECCeeeEEee
Confidence 345789999999999999999999999999999999987 68999999999999999997 899999999999988
Q ss_pred eecC
Q psy2347 81 SRGR 84 (207)
Q Consensus 81 a~~k 84 (207)
+..+
T Consensus 165 ~~~~ 168 (457)
T TIGR01622 165 SQAE 168 (457)
T ss_pred cchh
Confidence 7543
No 36
>KOG0109|consensus
Probab=99.52 E-value=1.5e-14 Score=120.12 Aligned_cols=73 Identities=25% Similarity=0.508 Sum_probs=69.7
Q ss_pred cEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeCCCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEEEeecCC
Q psy2347 10 TRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFNPPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVEISRGRG 85 (207)
Q Consensus 10 ~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~~a~~k~ 85 (207)
.+|||+|||.++++.+|+.+|++|++|++|+|+++ |+||..+++++|+.||..||+.+|++..|+|+.++.|.
T Consensus 3 ~KLFIGNLp~~~~~~elr~lFe~ygkVlECDIvKN---YgFVHiEdktaaedairNLhgYtLhg~nInVeaSksKs 75 (346)
T KOG0109|consen 3 VKLFIGNLPREATEQELRSLFEQYGKVLECDIVKN---YGFVHIEDKTAAEDAIRNLHGYTLHGVNINVEASKSKS 75 (346)
T ss_pred cchhccCCCcccchHHHHHHHHhhCceEeeeeecc---cceEEeecccccHHHHhhcccceecceEEEEEeccccC
Confidence 48999999999999999999999999999999976 99999999999999999999999999999999998773
No 37
>smart00360 RRM RNA recognition motif.
Probab=99.52 E-value=9.2e-14 Score=91.35 Aligned_cols=66 Identities=44% Similarity=0.710 Sum_probs=61.2
Q ss_pred EcCCCcCCcHHHHHHHHHhCCCeeEEEEeeC-----CCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEE
Q psy2347 14 VGGLTETVKKEDLELEFEKFGKLNSVWVAFN-----PPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVE 79 (207)
Q Consensus 14 V~nLp~~~te~~L~~~F~~~G~V~~v~i~~d-----~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~ 79 (207)
|.+||.++++++|+++|++|+.|+.|.|+.+ ++++|||+|.+.++|++|++.|++..+.+++|+|+
T Consensus 1 i~~l~~~~~~~~l~~~f~~~g~v~~~~i~~~~~~~~~~~~a~v~f~~~~~a~~a~~~~~~~~~~~~~~~v~ 71 (71)
T smart00360 1 VGNLPPDVTEEELRELFSKFGKIESVRLVRDKDTGKSKGFAFVEFESEEDAEKALEALNGKELDGRPLKVK 71 (71)
T ss_pred CCCCCcccCHHHHHHHHHhhCCEeEEEEEeCCCCCCCCceEEEEeCCHHHHHHHHHHcCCCeeCCcEEEeC
Confidence 5799999999999999999999999999886 36899999999999999999999999999998874
No 38
>KOG0111|consensus
Probab=99.52 E-value=1e-14 Score=116.95 Aligned_cols=82 Identities=34% Similarity=0.661 Sum_probs=76.6
Q ss_pred CCCCCCcEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeC-----CCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEE
Q psy2347 4 GMMERGTRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFN-----PPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRV 78 (207)
Q Consensus 4 ~~~~~~~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d-----~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V 78 (207)
.+....++|||+.|..++||..|...|-+||.|+.|+|+.| .++|+||||+..|+|.+||..||..+|.++.|+|
T Consensus 5 ~~a~~KrtlYVGGladeVtekvLhaAFIPFGDI~dIqiPlDyesqkHRgFgFVefe~aEDAaaAiDNMnesEL~GrtirV 84 (298)
T KOG0111|consen 5 QMANQKRTLYVGGLADEVTEKVLHAAFIPFGDIKDIQIPLDYESQKHRGFGFVEFEEAEDAAAAIDNMNESELFGRTIRV 84 (298)
T ss_pred cccccceeEEeccchHHHHHHHHHhccccccchhhcccccchhcccccceeEEEeeccchhHHHhhcCchhhhcceeEEE
Confidence 34556789999999999999999999999999999999998 7899999999999999999999999999999999
Q ss_pred EEeecCC
Q psy2347 79 EISRGRG 85 (207)
Q Consensus 79 ~~a~~k~ 85 (207)
.+|+|..
T Consensus 85 N~AkP~k 91 (298)
T KOG0111|consen 85 NLAKPEK 91 (298)
T ss_pred eecCCcc
Confidence 9998743
No 39
>cd00590 RRM RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
Probab=99.51 E-value=1.5e-13 Score=91.13 Aligned_cols=70 Identities=46% Similarity=0.737 Sum_probs=65.5
Q ss_pred EEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeCC----CcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEEE
Q psy2347 11 RVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFNP----PGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVEI 80 (207)
Q Consensus 11 ~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d~----rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~~ 80 (207)
+|+|.+||+.+++++|+++|+.+++|..+.++.+. +++|||+|.++++|+.|++.++...+.+++|+|++
T Consensus 1 ~i~i~~l~~~~~~~~i~~~~~~~g~i~~~~~~~~~~~~~~~~~~v~f~s~~~a~~a~~~~~~~~~~~~~~~v~~ 74 (74)
T cd00590 1 TLFVGNLPPDVTEEDLRELFSKFGKVESVRIVRDKDTKSKGFAFVEFEDEEDAEKALEALNGKELGGRPLRVEF 74 (74)
T ss_pred CEEEeCCCCccCHHHHHHHHHhcCCEEEEEEeeCCCCCcceEEEEEECCHHHHHHHHHHhCCCeECCeEEEEeC
Confidence 58999999999999999999999999999999873 78999999999999999999999999999999864
No 40
>KOG0131|consensus
Probab=99.50 E-value=4.7e-14 Score=110.09 Aligned_cols=80 Identities=28% Similarity=0.377 Sum_probs=75.2
Q ss_pred CCCCCCCcEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeC-----CCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEE
Q psy2347 3 DGMMERGTRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFN-----PPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLR 77 (207)
Q Consensus 3 ~~~~~~~~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d-----~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~ 77 (207)
....+...+|||+||++.++++.|.++|-+.++|++|+|++| .++||||||.++|+|+-||+.||...|.+++|+
T Consensus 3 ~~~rnqd~tiyvgnld~kvs~~~l~EL~iqagpVv~i~iPkDrv~~~~qGygF~Ef~~eedadYAikiln~VkLYgrpIr 82 (203)
T KOG0131|consen 3 IIERNQDATLYVGNLDEKVSEELLYELFIQAGPVVNLHIPKDRVTQKHQGYGFAEFRTEEDADYAIKILNMVKLYGRPIR 82 (203)
T ss_pred ccccCCCceEEEecCCHHHHHHHHHHHHHhcCceeeeecchhhhcccccceeEEEEechhhhHHHHHHHHHHHhcCceeE
Confidence 345677889999999999999999999999999999999988 789999999999999999999999999999999
Q ss_pred EEEee
Q psy2347 78 VEISR 82 (207)
Q Consensus 78 V~~a~ 82 (207)
|..+.
T Consensus 83 v~kas 87 (203)
T KOG0131|consen 83 VNKAS 87 (203)
T ss_pred EEecc
Confidence 99987
No 41
>KOG0114|consensus
Probab=99.48 E-value=2.1e-13 Score=97.48 Aligned_cols=77 Identities=23% Similarity=0.392 Sum_probs=71.6
Q ss_pred CCCcEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeC--CCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEEEeec
Q psy2347 7 ERGTRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFN--PPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVEISRG 83 (207)
Q Consensus 7 ~~~~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d--~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~~a~~ 83 (207)
+.++.|||.|||.++|.+++.++|.+|+.|..|+|-.+ .++.|||.|++..+|++|++.|++..++++.|.|-+-.+
T Consensus 16 evnriLyirNLp~~ITseemydlFGkyg~IrQIRiG~~k~TrGTAFVVYedi~dAk~A~dhlsg~n~~~ryl~vlyyq~ 94 (124)
T KOG0114|consen 16 EVNRILYIRNLPFKITSEEMYDLFGKYGTIRQIRIGNTKETRGTAFVVYEDIFDAKKACDHLSGYNVDNRYLVVLYYQP 94 (124)
T ss_pred hhheeEEEecCCccccHHHHHHHhhcccceEEEEecCccCcCceEEEEehHhhhHHHHHHHhcccccCCceEEEEecCH
Confidence 45678999999999999999999999999999999766 789999999999999999999999999999999987654
No 42
>COG0724 RNA-binding proteins (RRM domain) [General function prediction only]
Probab=99.48 E-value=2.9e-13 Score=110.72 Aligned_cols=74 Identities=38% Similarity=0.634 Sum_probs=70.6
Q ss_pred CcEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeC-----CCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEEEee
Q psy2347 9 GTRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFN-----PPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVEISR 82 (207)
Q Consensus 9 ~~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d-----~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~~a~ 82 (207)
.++|||+|||+++|+++|+++|.+|+.|..|.|+.+ +++||||+|+++++|++|++.|++..|.+++|.|.++.
T Consensus 115 ~~~l~v~nL~~~~~~~~l~~~F~~~g~~~~~~~~~d~~~~~~~g~~~v~f~~~~~~~~a~~~~~~~~~~~~~~~v~~~~ 193 (306)
T COG0724 115 NNTLFVGNLPYDVTEEDLRELFKKFGPVKRVRLVRDRETGKSRGFAFVEFESEESAEKAIEELNGKELEGRPLRVQKAQ 193 (306)
T ss_pred CceEEEeCCCCCCCHHHHHHHHHhcCceeEEEeeeccccCccCceEEEEecCHHHHHHHHHHcCCCeECCceeEeeccc
Confidence 589999999999999999999999999999998877 78999999999999999999999999999999999965
No 43
>KOG0148|consensus
Probab=99.48 E-value=9.3e-14 Score=114.38 Aligned_cols=78 Identities=31% Similarity=0.476 Sum_probs=74.3
Q ss_pred CcEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeC-----CCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEEEeec
Q psy2347 9 GTRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFN-----PPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVEISRG 83 (207)
Q Consensus 9 ~~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d-----~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~~a~~ 83 (207)
-..|||+.|..+++.++|++.|.+||+|.+++|++| +|+|+||.|.++++|+.||..||+++|..|.|+-.||+.
T Consensus 62 hfhvfvgdls~eI~~e~lr~aF~pFGevS~akvirD~~T~KsKGYgFVSf~~k~dAEnAI~~MnGqWlG~R~IRTNWATR 141 (321)
T KOG0148|consen 62 HFHVFVGDLSPEIDNEKLREAFAPFGEVSDAKVIRDMNTGKSKGYGFVSFPNKEDAENAIQQMNGQWLGRRTIRTNWATR 141 (321)
T ss_pred ceeEEehhcchhcchHHHHHHhccccccccceEeecccCCcccceeEEeccchHHHHHHHHHhCCeeeccceeecccccc
Confidence 457999999999999999999999999999999999 899999999999999999999999999999999999988
Q ss_pred CCC
Q psy2347 84 RGR 86 (207)
Q Consensus 84 k~~ 86 (207)
|+.
T Consensus 142 Kp~ 144 (321)
T KOG0148|consen 142 KPS 144 (321)
T ss_pred Ccc
Confidence 773
No 44
>PF13893 RRM_5: RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain); PDB: 2PE8_A 2PEH_A 2D9O_A 1A9N_D 2DIT_A 3S6E_A 2LQ5_A 1SJQ_A 2AD9_A 1X4D_A ....
Probab=99.47 E-value=3.1e-13 Score=87.21 Aligned_cols=56 Identities=34% Similarity=0.557 Sum_probs=51.8
Q ss_pred HHHHHHhCCCeeEEEEeeCCCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEEEe
Q psy2347 26 LELEFEKFGKLNSVWVAFNPPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVEIS 81 (207)
Q Consensus 26 L~~~F~~~G~V~~v~i~~d~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~~a 81 (207)
|+++|++||+|++|.+..+.+.+|||+|++.++|++|++.||+..+.+++|+|++|
T Consensus 1 L~~~f~~fG~V~~i~~~~~~~~~a~V~f~~~~~A~~a~~~l~~~~~~g~~l~V~~a 56 (56)
T PF13893_consen 1 LYKLFSKFGEVKKIKIFKKKRGFAFVEFASVEDAQKAIEQLNGRQFNGRPLKVSYA 56 (56)
T ss_dssp HHHHHTTTS-EEEEEEETTSTTEEEEEESSHHHHHHHHHHHTTSEETTEEEEEEEE
T ss_pred ChHHhCCcccEEEEEEEeCCCCEEEEEECCHHHHHHHHHHhCCCEECCcEEEEEEC
Confidence 78999999999999998875599999999999999999999999999999999986
No 45
>KOG4212|consensus
Probab=99.46 E-value=3.3e-13 Score=117.45 Aligned_cols=79 Identities=22% Similarity=0.365 Sum_probs=72.0
Q ss_pred CCCCcEEEEcCCCcCCcHHHHHHHHHh-CCCeeEEEEeeC----CCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEEE
Q psy2347 6 MERGTRVYVGGLTETVKKEDLELEFEK-FGKLNSVWVAFN----PPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVEI 80 (207)
Q Consensus 6 ~~~~~~lfV~nLp~~~te~~L~~~F~~-~G~V~~v~i~~d----~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~~ 80 (207)
....+.+||.|||+++.|++||++|.+ .++|+.|+++.| +++||.|||+++|.++||++.||.+++++++|+|+.
T Consensus 41 ~~r~R~vfItNIpyd~rWqdLKdLvrekvGev~yveLl~D~~GK~rGcavVEFk~~E~~qKa~E~lnk~~~~GR~l~vKE 120 (608)
T KOG4212|consen 41 AARDRSVFITNIPYDYRWQDLKDLVREKVGEVEYVELLFDESGKARGCAVVEFKDPENVQKALEKLNKYEVNGRELVVKE 120 (608)
T ss_pred ccccceEEEecCcchhhhHhHHHHHHHhcCceEeeeeecccCCCcCCceEEEeeCHHHHHHHHHHhhhccccCceEEEec
Confidence 345567999999999999999999974 899999999998 899999999999999999999999999999999987
Q ss_pred eecC
Q psy2347 81 SRGR 84 (207)
Q Consensus 81 a~~k 84 (207)
....
T Consensus 121 d~d~ 124 (608)
T KOG4212|consen 121 DHDE 124 (608)
T ss_pred cCch
Confidence 7553
No 46
>KOG0144|consensus
Probab=99.46 E-value=1.2e-13 Score=119.82 Aligned_cols=81 Identities=23% Similarity=0.468 Sum_probs=71.4
Q ss_pred CCCCCcEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeC-----CCcEEEEEEcCHHHHHHHHHhcCCce-e--CCcEE
Q psy2347 5 MMERGTRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFN-----PPGFAFVEFSNQIDAEAACDSMNDQD-L--MGSKL 76 (207)
Q Consensus 5 ~~~~~~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d-----~rG~afV~F~~~e~A~~Al~~l~g~~-i--~g~~l 76 (207)
...+.-+|||+-||..++|+||+++|++|+.|.+|.|++| +++||||+|.+.++|.+|+.+||+++ | ...+|
T Consensus 30 ~d~~~vKlfVgqIprt~sE~dlr~lFe~yg~V~einl~kDk~t~~s~gcCFv~~~trk~a~~a~~Alhn~ktlpG~~~pv 109 (510)
T KOG0144|consen 30 PDGSAVKLFVGQIPRTASEKDLRELFEKYGNVYEINLIKDKSTGQSKGCCFVKYYTRKEADEAINALHNQKTLPGMHHPV 109 (510)
T ss_pred CCchhhhheeccCCccccHHHHHHHHHHhCceeEEEeecccccCcccceEEEEeccHHHHHHHHHHhhcccccCCCCcce
Confidence 3345679999999999999999999999999999999999 89999999999999999999999865 4 35778
Q ss_pred EEEEeecCC
Q psy2347 77 RVEISRGRG 85 (207)
Q Consensus 77 ~V~~a~~k~ 85 (207)
.|++|....
T Consensus 110 qvk~Ad~E~ 118 (510)
T KOG0144|consen 110 QVKYADGER 118 (510)
T ss_pred eecccchhh
Confidence 888886543
No 47
>KOG0109|consensus
Probab=99.41 E-value=3.3e-13 Score=112.15 Aligned_cols=82 Identities=27% Similarity=0.505 Sum_probs=76.0
Q ss_pred CCCCcEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeCCCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEEEeecCC
Q psy2347 6 MERGTRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFNPPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVEISRGRG 85 (207)
Q Consensus 6 ~~~~~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~~a~~k~ 85 (207)
...+++|+|+||.+.++.+||++.|++|++|.+|+|++| ++||.|+-.++|..|++.|++.++.+++|+|++++++-
T Consensus 75 sk~stkl~vgNis~tctn~ElRa~fe~ygpviecdivkd---y~fvh~d~~eda~~air~l~~~~~~gk~m~vq~stsrl 151 (346)
T KOG0109|consen 75 SKASTKLHVGNISPTCTNQELRAKFEKYGPVIECDIVKD---YAFVHFDRAEDAVEAIRGLDNTEFQGKRMHVQLSTSRL 151 (346)
T ss_pred CCCccccccCCCCccccCHHHhhhhcccCCceeeeeecc---eeEEEEeeccchHHHHhcccccccccceeeeeeecccc
Confidence 456789999999999999999999999999999999977 99999999999999999999999999999999999877
Q ss_pred CCCCC
Q psy2347 86 RGRGG 90 (207)
Q Consensus 86 ~~~~~ 90 (207)
+....
T Consensus 152 rtapg 156 (346)
T KOG0109|consen 152 RTAPG 156 (346)
T ss_pred ccCCC
Confidence 65443
No 48
>KOG0145|consensus
Probab=99.41 E-value=8.3e-13 Score=108.44 Aligned_cols=82 Identities=23% Similarity=0.392 Sum_probs=75.3
Q ss_pred CCCCcEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeC-----CCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEEE
Q psy2347 6 MERGTRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFN-----PPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVEI 80 (207)
Q Consensus 6 ~~~~~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d-----~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~~ 80 (207)
.+..+.|.|--||..+|++||+.+|...|+|++|++++| +.+|.||.|.++++|++||..||+..|..+.|+|++
T Consensus 38 ~~skTNLIvNYLPQ~MTqdE~rSLF~SiGeiEScKLvRDKitGqSLGYGFVNYv~p~DAe~AintlNGLrLQ~KTIKVSy 117 (360)
T KOG0145|consen 38 DESKTNLIVNYLPQNMTQDELRSLFGSIGEIESCKLVRDKITGQSLGYGFVNYVRPKDAEKAINTLNGLRLQNKTIKVSY 117 (360)
T ss_pred CcccceeeeeecccccCHHHHHHHhhcccceeeeeeeeccccccccccceeeecChHHHHHHHhhhcceeeccceEEEEe
Confidence 344567888889999999999999999999999999999 779999999999999999999999999999999999
Q ss_pred eecCCCC
Q psy2347 81 SRGRGRG 87 (207)
Q Consensus 81 a~~k~~~ 87 (207)
|+|....
T Consensus 118 ARPSs~~ 124 (360)
T KOG0145|consen 118 ARPSSDS 124 (360)
T ss_pred ccCChhh
Confidence 9986543
No 49
>KOG0146|consensus
Probab=99.41 E-value=2.3e-13 Score=112.11 Aligned_cols=82 Identities=27% Similarity=0.423 Sum_probs=77.6
Q ss_pred CCCcEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeC-----CCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEEEe
Q psy2347 7 ERGTRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFN-----PPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVEIS 81 (207)
Q Consensus 7 ~~~~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d-----~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~~a 81 (207)
++.|.|||-.||.+..+.||..+|..||.|++.+|..| +|+|.||.|+++.+|++||.+||+..|.-|+|+|.+.
T Consensus 283 PeGCNlFIYHLPQEFgDaEliQmF~PFGhivSaKVFvDRATNQSKCFGFVSfDNp~SaQaAIqAMNGFQIGMKRLKVQLK 362 (371)
T KOG0146|consen 283 PEGCNLFIYHLPQEFGDAELIQMFLPFGHIVSAKVFVDRATNQSKCFGFVSFDNPASAQAAIQAMNGFQIGMKRLKVQLK 362 (371)
T ss_pred CCcceEEEEeCchhhccHHHHHHhccccceeeeeeeehhccccccceeeEecCCchhHHHHHHHhcchhhhhhhhhhhhc
Confidence 57899999999999999999999999999999999888 8999999999999999999999999999999999999
Q ss_pred ecCCCCC
Q psy2347 82 RGRGRGR 88 (207)
Q Consensus 82 ~~k~~~~ 88 (207)
+||...+
T Consensus 363 RPkdanR 369 (371)
T KOG0146|consen 363 RPKDANR 369 (371)
T ss_pred CccccCC
Confidence 9987653
No 50
>KOG0415|consensus
Probab=99.40 E-value=3.8e-13 Score=114.41 Aligned_cols=80 Identities=28% Similarity=0.424 Sum_probs=75.2
Q ss_pred CCCCCCcEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeC-----CCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEE
Q psy2347 4 GMMERGTRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFN-----PPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRV 78 (207)
Q Consensus 4 ~~~~~~~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d-----~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V 78 (207)
.+.+|...|||..|.+-+|+++|+-+|+.||+|+.|+|+.| +..||||+|++++++++|.-+|++..|+.++|.|
T Consensus 234 d~~PPeNVLFVCKLNPVTtDeDLeiIFSrFG~i~sceVIRD~ktgdsLqyaFiEFen~escE~AyFKMdNvLIDDrRIHV 313 (479)
T KOG0415|consen 234 DVKPPENVLFVCKLNPVTTDEDLEIIFSRFGKIVSCEVIRDRKTGDSLQYAFIEFENKESCEQAYFKMDNVLIDDRRIHV 313 (479)
T ss_pred ccCCCcceEEEEecCCcccccchhhHHhhcccceeeeEEecccccchhheeeeeecchhhHHHHHhhhcceeeccceEEe
Confidence 45678899999999999999999999999999999999998 6689999999999999999999999999999999
Q ss_pred EEeec
Q psy2347 79 EISRG 83 (207)
Q Consensus 79 ~~a~~ 83 (207)
.++.+
T Consensus 314 DFSQS 318 (479)
T KOG0415|consen 314 DFSQS 318 (479)
T ss_pred ehhhh
Confidence 98764
No 51
>KOG0127|consensus
Probab=99.40 E-value=1e-12 Score=117.05 Aligned_cols=79 Identities=29% Similarity=0.416 Sum_probs=73.8
Q ss_pred CCCcEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeC----CCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEEEee
Q psy2347 7 ERGTRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFN----PPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVEISR 82 (207)
Q Consensus 7 ~~~~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d----~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~~a~ 82 (207)
.+-.+|.|.|||+.|.+++|+.+|+.||.|.+|.|++. .++||||+|.+..+|.+||+.+|.++|.+++|.|.||-
T Consensus 115 ~~k~rLIIRNLPf~~k~~dLk~vFs~~G~V~Ei~IP~k~dgklcGFaFV~fk~~~dA~~Al~~~N~~~i~gR~VAVDWAV 194 (678)
T KOG0127|consen 115 LPKWRLIIRNLPFKCKKPDLKNVFSNFGKVVEIVIPRKKDGKLCGFAFVQFKEKKDAEKALEFFNGNKIDGRPVAVDWAV 194 (678)
T ss_pred CccceEEeecCCcccCcHHHHHHHhhcceEEEEEcccCCCCCccceEEEEEeeHHHHHHHHHhccCceecCceeEEeeec
Confidence 34679999999999999999999999999999999976 56999999999999999999999999999999999998
Q ss_pred cCC
Q psy2347 83 GRG 85 (207)
Q Consensus 83 ~k~ 85 (207)
++.
T Consensus 195 ~Kd 197 (678)
T KOG0127|consen 195 DKD 197 (678)
T ss_pred ccc
Confidence 764
No 52
>KOG0108|consensus
Probab=99.39 E-value=9.3e-13 Score=117.04 Aligned_cols=77 Identities=31% Similarity=0.541 Sum_probs=73.6
Q ss_pred cEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeC-----CCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEEEeecC
Q psy2347 10 TRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFN-----PPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVEISRGR 84 (207)
Q Consensus 10 ~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d-----~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~~a~~k 84 (207)
++|||+|||++++|++|.++|+..+.|..++++.| +++|+|++|.++++|++|++.||+.++.+++|+|.++...
T Consensus 19 ~~v~vgnip~~~se~~l~~~~~~~g~v~s~~~v~D~~tG~~~G~~f~~~~~~~~~~~a~~~lNg~~~~gr~l~v~~~~~~ 98 (435)
T KOG0108|consen 19 SSVFVGNIPYEGSEEQLLSIFSGVGPVLSFRLVYDRETGKPKGFGFCEFTDEETAERAIRNLNGAEFNGRKLRVNYASNR 98 (435)
T ss_pred cceEecCCCCcccHHHHHHHHhccCccceeeecccccCCCcCceeeEecCchhhHHHHHHhcCCcccCCceEEeeccccc
Confidence 89999999999999999999999999999999999 8999999999999999999999999999999999998765
Q ss_pred CC
Q psy2347 85 GR 86 (207)
Q Consensus 85 ~~ 86 (207)
..
T Consensus 99 ~~ 100 (435)
T KOG0108|consen 99 KN 100 (435)
T ss_pred ch
Confidence 44
No 53
>KOG0116|consensus
Probab=99.39 E-value=3.6e-12 Score=112.75 Aligned_cols=79 Identities=30% Similarity=0.489 Sum_probs=68.8
Q ss_pred CCcEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeC-----CCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEEEee
Q psy2347 8 RGTRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFN-----PPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVEISR 82 (207)
Q Consensus 8 ~~~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d-----~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~~a~ 82 (207)
...+|||.|||.++++++|+++|.+||+|++..|... ..+|+||+|++.++++.||++ +...|.+++|.|+..+
T Consensus 287 ~~~~i~V~nlP~da~~~~l~~~Fk~FG~Ik~~~I~vr~~~~~~~~fgFV~f~~~~~~~~~i~A-sp~~ig~~kl~Veek~ 365 (419)
T KOG0116|consen 287 DGLGIFVKNLPPDATPAELEEVFKQFGPIKEGGIQVRSPGGKNPCFGFVEFENAAAVQNAIEA-SPLEIGGRKLNVEEKR 365 (419)
T ss_pred cccceEeecCCCCCCHHHHHHHHhhcccccccceEEeccCCCcCceEEEEEeecchhhhhhhc-CccccCCeeEEEEecc
Confidence 3456999999999999999999999999998877653 238999999999999999997 6888999999999887
Q ss_pred cCCCC
Q psy2347 83 GRGRG 87 (207)
Q Consensus 83 ~k~~~ 87 (207)
+..+.
T Consensus 366 ~~~~g 370 (419)
T KOG0116|consen 366 PGFRG 370 (419)
T ss_pred ccccc
Confidence 75544
No 54
>TIGR01642 U2AF_lg U2 snRNP auxilliary factor, large subunit, splicing factor. Members of this subfamily are found in plants, metazoa and fungi.
Probab=99.37 E-value=2.2e-12 Score=117.41 Aligned_cols=76 Identities=22% Similarity=0.334 Sum_probs=63.3
Q ss_pred CCCCCcEEEEcCCCcCCcHHHHHHHHHhC------------CCeeEEEEeeCCCcEEEEEEcCHHHHHHHHHhcCCceeC
Q psy2347 5 MMERGTRVYVGGLTETVKKEDLELEFEKF------------GKLNSVWVAFNPPGFAFVEFSNQIDAEAACDSMNDQDLM 72 (207)
Q Consensus 5 ~~~~~~~lfV~nLp~~~te~~L~~~F~~~------------G~V~~v~i~~d~rG~afV~F~~~e~A~~Al~~l~g~~i~ 72 (207)
+....++|||+|||+++|+++|+++|.++ ..|..+.+..+ ++||||+|.+.++|++||+ |+++.|.
T Consensus 171 ~~~~~r~lyVgnLp~~~t~~~l~~~F~~~~~~~~~~~~~~~~~v~~~~~~~~-kg~afVeF~~~e~A~~Al~-l~g~~~~ 248 (509)
T TIGR01642 171 ATRQARRLYVGGIPPEFVEEAVVDFFNDLMIATGYHKAEDGKHVSSVNINKE-KNFAFLEFRTVEEATFAMA-LDSIIYS 248 (509)
T ss_pred CCccccEEEEeCCCCCCCHHHHHHHHHHHHHhcCCCCCCCCCceEEEEECCC-CCEEEEEeCCHHHHhhhhc-CCCeEee
Confidence 34457899999999999999999999975 23455555544 8999999999999999995 9999999
Q ss_pred CcEEEEEEee
Q psy2347 73 GSKLRVEISR 82 (207)
Q Consensus 73 g~~l~V~~a~ 82 (207)
++.|+|...+
T Consensus 249 g~~l~v~r~~ 258 (509)
T TIGR01642 249 NVFLKIRRPH 258 (509)
T ss_pred CceeEecCcc
Confidence 9999987543
No 55
>KOG0145|consensus
Probab=99.37 E-value=2.8e-12 Score=105.36 Aligned_cols=77 Identities=27% Similarity=0.337 Sum_probs=73.4
Q ss_pred CcEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeC-----CCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEEEeec
Q psy2347 9 GTRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFN-----PPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVEISRG 83 (207)
Q Consensus 9 ~~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d-----~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~~a~~ 83 (207)
...|||-||.++++|..|+++|.+||.|+.|+|++| .|+|+||++.+.++|..||..||+..|.++.|.|.+.+.
T Consensus 278 g~ciFvYNLspd~de~~LWQlFgpFGAv~nVKvirD~ttnkCKGfgFVtMtNYdEAamAi~sLNGy~lg~rvLQVsFKtn 357 (360)
T KOG0145|consen 278 GWCIFVYNLSPDADESILWQLFGPFGAVTNVKVIRDFTTNKCKGFGFVTMTNYDEAAMAIASLNGYRLGDRVLQVSFKTN 357 (360)
T ss_pred eeEEEEEecCCCchHhHHHHHhCcccceeeEEEEecCCcccccceeEEEecchHHHHHHHHHhcCccccceEEEEEEecC
Confidence 578999999999999999999999999999999999 789999999999999999999999999999999999877
Q ss_pred CC
Q psy2347 84 RG 85 (207)
Q Consensus 84 k~ 85 (207)
|.
T Consensus 358 k~ 359 (360)
T KOG0145|consen 358 KA 359 (360)
T ss_pred CC
Confidence 64
No 56
>KOG0126|consensus
Probab=99.33 E-value=1.7e-13 Score=107.07 Aligned_cols=76 Identities=24% Similarity=0.512 Sum_probs=71.5
Q ss_pred CCCcEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeC-----CCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEEEe
Q psy2347 7 ERGTRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFN-----PPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVEIS 81 (207)
Q Consensus 7 ~~~~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d-----~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~~a 81 (207)
..+.-|||++||.++||.||..+|++||+|++|.+++| +++|||+.|++..+...|+..||+..|.++.|+|...
T Consensus 33 kdsA~Iyiggl~~~LtEgDil~VFSqyGe~vdinLiRDk~TGKSKGFaFLcYEDQRSTILAVDN~NGiki~gRtirVDHv 112 (219)
T KOG0126|consen 33 KDSAYIYIGGLPYELTEGDILCVFSQYGEIVDINLIRDKKTGKSKGFAFLCYEDQRSTILAVDNLNGIKILGRTIRVDHV 112 (219)
T ss_pred ccceEEEECCCcccccCCcEEEEeeccCceEEEEEEecCCCCcccceEEEEecCccceEEEEeccCCceecceeEEeeec
Confidence 45678999999999999999999999999999999999 8999999999999999999999999999999999865
Q ss_pred e
Q psy2347 82 R 82 (207)
Q Consensus 82 ~ 82 (207)
.
T Consensus 113 ~ 113 (219)
T KOG0126|consen 113 S 113 (219)
T ss_pred c
Confidence 3
No 57
>KOG0127|consensus
Probab=99.32 E-value=6.9e-12 Score=111.81 Aligned_cols=78 Identities=29% Similarity=0.466 Sum_probs=70.0
Q ss_pred CCcEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeC-----CCcEEEEEEcCHHHHHHHHHhcC-----C-ceeCCcEE
Q psy2347 8 RGTRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFN-----PPGFAFVEFSNQIDAEAACDSMN-----D-QDLMGSKL 76 (207)
Q Consensus 8 ~~~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d-----~rG~afV~F~~~e~A~~Al~~l~-----g-~~i~g~~l 76 (207)
...+|||.|||+++||++|.++|++||+|..+.|+.+ ++++|||.|.+..+|++||+... + ..|.++.|
T Consensus 291 ~~~tVFvRNL~fD~tEEel~~~fskFG~v~ya~iV~~k~T~~skGtAFv~Fkt~~~~~~ci~~Aspa~e~g~~ll~GR~L 370 (678)
T KOG0127|consen 291 EGKTVFVRNLPFDTTEEELKEHFSKFGEVKYAIIVKDKDTGHSKGTAFVKFKTQIAAQNCIEAASPASEDGSVLLDGRLL 370 (678)
T ss_pred ccceEEEecCCccccHHHHHHHHHhhccceeEEEEeccCCCCcccceEEEeccHHHHHHHHHhcCccCCCceEEEeccEE
Confidence 4589999999999999999999999999999999887 89999999999999999998752 2 56899999
Q ss_pred EEEEeecCC
Q psy2347 77 RVEISRGRG 85 (207)
Q Consensus 77 ~V~~a~~k~ 85 (207)
.|.+|-++.
T Consensus 371 kv~~Av~Rk 379 (678)
T KOG0127|consen 371 KVTLAVTRK 379 (678)
T ss_pred eeeeccchH
Confidence 999987653
No 58
>KOG0123|consensus
Probab=99.31 E-value=6.1e-12 Score=110.45 Aligned_cols=76 Identities=28% Similarity=0.459 Sum_probs=71.7
Q ss_pred EEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeC---CCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEEEeecCCCC
Q psy2347 11 RVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFN---PPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVEISRGRGRG 87 (207)
Q Consensus 11 ~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d---~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~~a~~k~~~ 87 (207)
.|||.||+++++.++|.++|+.||+|++|+|+++ ++++ ||+|+++++|++|++.||+..+.+++|.|.+..++..+
T Consensus 78 ~~~i~nl~~~~~~~~~~d~f~~~g~ilS~kv~~~~~g~kg~-FV~f~~e~~a~~ai~~~ng~ll~~kki~vg~~~~~~er 156 (369)
T KOG0123|consen 78 LVFIKNLDESIDNKSLYDTFSEFGNILSCKVATDENGSKGY-FVQFESEESAKKAIEKLNGMLLNGKKIYVGLFERKEER 156 (369)
T ss_pred eeeecCCCcccCcHHHHHHHHhhcCeeEEEEEEcCCCceee-EEEeCCHHHHHHHHHHhcCcccCCCeeEEeeccchhhh
Confidence 3999999999999999999999999999999999 8999 99999999999999999999999999999988776554
No 59
>KOG0132|consensus
Probab=99.31 E-value=5.2e-12 Score=116.26 Aligned_cols=80 Identities=25% Similarity=0.512 Sum_probs=73.9
Q ss_pred CCCCCcEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeCCCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEEEeecC
Q psy2347 5 MMERGTRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFNPPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVEISRGR 84 (207)
Q Consensus 5 ~~~~~~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~~a~~k 84 (207)
++--++||||+.|+..++|.||..+|+.||+|++|.++.. ++||||++...++|++||.+|+.+.+..+.|+|.|+..+
T Consensus 417 isV~SrTLwvG~i~k~v~e~dL~~~feefGeiqSi~li~~-R~cAfI~M~~RqdA~kalqkl~n~kv~~k~Iki~Wa~g~ 495 (894)
T KOG0132|consen 417 ISVCSRTLWVGGIPKNVTEQDLANLFEEFGEIQSIILIPP-RGCAFIKMVRRQDAEKALQKLSNVKVADKTIKIAWAVGK 495 (894)
T ss_pred eeEeeeeeeeccccchhhHHHHHHHHHhcccceeEeeccC-CceeEEEEeehhHHHHHHHHHhcccccceeeEEeeeccC
Confidence 3445789999999999999999999999999999988876 999999999999999999999999999999999999755
Q ss_pred C
Q psy2347 85 G 85 (207)
Q Consensus 85 ~ 85 (207)
.
T Consensus 496 G 496 (894)
T KOG0132|consen 496 G 496 (894)
T ss_pred C
Confidence 3
No 60
>KOG4206|consensus
Probab=99.29 E-value=1.3e-11 Score=99.41 Aligned_cols=82 Identities=24% Similarity=0.410 Sum_probs=74.5
Q ss_pred CCCCCCcEEEEcCCCcCCcHHHHHH----HHHhCCCeeEEEEeeC--CCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEE
Q psy2347 4 GMMERGTRVYVGGLTETVKKEDLEL----EFEKFGKLNSVWVAFN--PPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLR 77 (207)
Q Consensus 4 ~~~~~~~~lfV~nLp~~~te~~L~~----~F~~~G~V~~v~i~~d--~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~ 77 (207)
..+++..||||.||++.+..++|++ +|++||.|..|.+.+. .++-|||.|.+.++|..|+.+|++..+.++.|+
T Consensus 4 ~~~~pn~TlYInnLnekI~~~elkrsL~~LFsqfG~ildI~a~kt~KmRGQA~VvFk~~~~As~A~r~l~gfpFygK~mr 83 (221)
T KOG4206|consen 4 MSVNPNGTLYINNLNEKIKKDELKRSLYLLFSQFGKILDISAFKTPKMRGQAFVVFKETEAASAALRALQGFPFYGKPMR 83 (221)
T ss_pred cccCCCceEeehhccccccHHHHHHHHHHHHHhhCCeEEEEecCCCCccCceEEEecChhHHHHHHHHhcCCcccCchhh
Confidence 4456667999999999999999888 9999999999988766 789999999999999999999999999999999
Q ss_pred EEEeecCC
Q psy2347 78 VEISRGRG 85 (207)
Q Consensus 78 V~~a~~k~ 85 (207)
|++|+.+.
T Consensus 84 iqyA~s~s 91 (221)
T KOG4206|consen 84 IQYAKSDS 91 (221)
T ss_pred eecccCcc
Confidence 99997654
No 61
>smart00361 RRM_1 RNA recognition motif.
Probab=99.28 E-value=1.7e-11 Score=82.78 Aligned_cols=57 Identities=26% Similarity=0.476 Sum_probs=50.3
Q ss_pred HHHHHHHHH----hCCCeeEEE-EeeC-------CCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEE
Q psy2347 23 KEDLELEFE----KFGKLNSVW-VAFN-------PPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVE 79 (207)
Q Consensus 23 e~~L~~~F~----~~G~V~~v~-i~~d-------~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~ 79 (207)
+++|+++|+ +||+|.+|. |+.+ +++|+||+|+++++|++|++.||+..+.+++|++.
T Consensus 2 ~~~l~~~~~~~~~~fG~v~~v~~v~~~~~~~~~~~rG~~fV~f~~~~dA~~A~~~l~g~~~~gr~l~~~ 70 (70)
T smart00361 2 DEDFEREFSEEEEYFGEVGKINKIYIDNVGYENHKRGNVYITFERSEDAARAIVDLNGRYFDGRTVKAE 70 (70)
T ss_pred chhHHHHHHHHHHhcCCeeEEEEEEeCCCCCCCCCcEEEEEEECCHHHHHHHHHHhCCCEECCEEEEeC
Confidence 678888888 999999985 4443 58999999999999999999999999999999863
No 62
>KOG0153|consensus
Probab=99.27 E-value=1.8e-11 Score=103.97 Aligned_cols=76 Identities=28% Similarity=0.507 Sum_probs=68.7
Q ss_pred CCCcEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeCCCcEEEEEEcCHHHHHHHHHhc-CCceeCCcEEEEEEeec
Q psy2347 7 ERGTRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFNPPGFAFVEFSNQIDAEAACDSM-NDQDLMGSKLRVEISRG 83 (207)
Q Consensus 7 ~~~~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d~rG~afV~F~~~e~A~~Al~~l-~g~~i~g~~l~V~~a~~ 83 (207)
...++|||++|...++|.+|+++|.+||+|+.|.|+.. +.+|||+|.+.++|++|.+++ +...|++++|+|.|.++
T Consensus 226 ~~I~tLyIg~l~d~v~e~dIrdhFyqyGeirsi~~~~~-~~CAFv~ftTR~aAE~Aae~~~n~lvI~G~Rl~i~Wg~~ 302 (377)
T KOG0153|consen 226 TSIKTLYIGGLNDEVLEQDIRDHFYQYGEIRSIRILPR-KGCAFVTFTTREAAEKAAEKSFNKLVINGFRLKIKWGRP 302 (377)
T ss_pred cceeEEEecccccchhHHHHHHHHhhcCCeeeEEeecc-cccceeeehhhHHHHHHHHhhcceeeecceEEEEEeCCC
Confidence 34679999999999999999999999999999999887 779999999999999998864 44568999999999988
No 63
>KOG0131|consensus
Probab=99.24 E-value=2.2e-11 Score=95.25 Aligned_cols=85 Identities=27% Similarity=0.407 Sum_probs=74.8
Q ss_pred CCCCCcEEEEcCCCcCCcHHHHHHHHHhCCCeeE-EEEeeC-----CCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEE
Q psy2347 5 MMERGTRVYVGGLTETVKKEDLELEFEKFGKLNS-VWVAFN-----PPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRV 78 (207)
Q Consensus 5 ~~~~~~~lfV~nLp~~~te~~L~~~F~~~G~V~~-v~i~~d-----~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V 78 (207)
.++...+|||+||.++++|..|.++|+.||.|.. -+|+.+ +++|+||.|++.|++.+|++.||++.++.++|+|
T Consensus 92 nl~vganlfvgNLd~~vDe~~L~dtFsafG~l~~~P~i~rd~~tg~~~~~g~i~~~sfeasd~ai~s~ngq~l~nr~itv 171 (203)
T KOG0131|consen 92 NLDVGANLFVGNLDPEVDEKLLYDTFSAFGVLISPPKIMRDPDTGNPKGFGFINYASFEASDAAIGSMNGQYLCNRPITV 171 (203)
T ss_pred cccccccccccccCcchhHHHHHHHHHhccccccCCcccccccCCCCCCCeEEechhHHHHHHHHHHhccchhcCCceEE
Confidence 3455689999999999999999999999999765 366666 7889999999999999999999999999999999
Q ss_pred EEeecCCCCCC
Q psy2347 79 EISRGRGRGRG 89 (207)
Q Consensus 79 ~~a~~k~~~~~ 89 (207)
+++..+..+..
T Consensus 172 ~ya~k~~~kg~ 182 (203)
T KOG0131|consen 172 SYAFKKDTKGE 182 (203)
T ss_pred EEEEecCCCcc
Confidence 99988766543
No 64
>KOG4205|consensus
Probab=99.23 E-value=1.1e-11 Score=106.02 Aligned_cols=82 Identities=28% Similarity=0.525 Sum_probs=75.1
Q ss_pred CCcEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeC-----CCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEEEee
Q psy2347 8 RGTRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFN-----PPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVEISR 82 (207)
Q Consensus 8 ~~~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d-----~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~~a~ 82 (207)
..++|||+.||.++++++|+++|++|+.|..+.|+.| +++|+||+|++++++++++.+ +-++|++++|+|+.|.
T Consensus 96 ~tkkiFvGG~~~~~~e~~~r~yfe~~g~v~~~~~~~d~~~~~~rgFgfv~~~~e~sVdkv~~~-~f~~~~gk~vevkrA~ 174 (311)
T KOG4205|consen 96 RTKKIFVGGLPPDTTEEDFKDYFEQFGKVADVVIMYDKTTSRPRGFGFVTFDSEDSVDKVTLQ-KFHDFNGKKVEVKRAI 174 (311)
T ss_pred ceeEEEecCcCCCCchHHHhhhhhccceeEeeEEeecccccccccceeeEeccccccceeccc-ceeeecCceeeEeecc
Confidence 3569999999999999999999999999999999988 789999999999999999975 8899999999999999
Q ss_pred cCCCCCCC
Q psy2347 83 GRGRGRGG 90 (207)
Q Consensus 83 ~k~~~~~~ 90 (207)
||+.....
T Consensus 175 pk~~~~~~ 182 (311)
T KOG4205|consen 175 PKEVMQST 182 (311)
T ss_pred chhhcccc
Confidence 98766543
No 65
>KOG4205|consensus
Probab=99.23 E-value=8.6e-12 Score=106.66 Aligned_cols=79 Identities=23% Similarity=0.440 Sum_probs=73.2
Q ss_pred CCcEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeC-----CCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEEEee
Q psy2347 8 RGTRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFN-----PPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVEISR 82 (207)
Q Consensus 8 ~~~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d-----~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~~a~ 82 (207)
+.++|||+.|+|+++|+.|+++|.+|++|.+|.||+| +++|.||+|++++.+.++|.. ..++|+++.|.++.|.
T Consensus 5 ~~~KlfiGgisw~ttee~Lr~yf~~~Gev~d~~vm~d~~t~rsrgFgfv~f~~~~~v~~vl~~-~~h~~dgr~ve~k~av 83 (311)
T KOG4205|consen 5 ESGKLFIGGLSWETTEESLREYFSQFGEVTDCVVMRDPSTGRSRGFGFVTFATPEGVDAVLNA-RTHKLDGRSVEPKRAV 83 (311)
T ss_pred CCcceeecCcCccccHHHHHHHhcccCceeeEEEeccCCCCCcccccceecCCCcchheeecc-cccccCCccccceecc
Confidence 7899999999999999999999999999999999998 789999999999999999875 6789999999999998
Q ss_pred cCCCC
Q psy2347 83 GRGRG 87 (207)
Q Consensus 83 ~k~~~ 87 (207)
++...
T Consensus 84 ~r~~~ 88 (311)
T KOG4205|consen 84 SREDQ 88 (311)
T ss_pred Ccccc
Confidence 87644
No 66
>KOG4212|consensus
Probab=99.23 E-value=2.2e-11 Score=106.22 Aligned_cols=78 Identities=23% Similarity=0.290 Sum_probs=70.6
Q ss_pred CCCCCCcEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeCCCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEEEe
Q psy2347 4 GMMERGTRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFNPPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVEIS 81 (207)
Q Consensus 4 ~~~~~~~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~~a 81 (207)
.+..+.|+|||.|||.++||+.|++-|.+|+.|+.++||..-+....|.|.++++|+.||..|++..|+++.|+|.+.
T Consensus 531 gaarKa~qIiirNlP~dfTWqmlrDKfre~G~v~yadime~GkskGVVrF~s~edAEra~a~Mngs~l~Gr~I~V~y~ 608 (608)
T KOG4212|consen 531 GAARKACQIIIRNLPFDFTWQMLRDKFREIGHVLYADIMENGKSKGVVRFFSPEDAERACALMNGSRLDGRNIKVTYF 608 (608)
T ss_pred cccccccEEEEecCCccccHHHHHHHHHhccceehhhhhccCCccceEEecCHHHHHHHHHHhccCcccCceeeeeeC
Confidence 456788999999999999999999999999999999997773444589999999999999999999999999999863
No 67
>KOG0147|consensus
Probab=99.19 E-value=3e-11 Score=107.73 Aligned_cols=77 Identities=32% Similarity=0.558 Sum_probs=71.5
Q ss_pred cEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeC-----CCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEEEeecC
Q psy2347 10 TRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFN-----PPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVEISRGR 84 (207)
Q Consensus 10 ~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d-----~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~~a~~k 84 (207)
.+|||+||..++||++|+.+|+.|+.|+.|.+++| +++|+||+|.+.++|++|+++||+.+|.++.|+|...+.+
T Consensus 279 ~rl~vgnLHfNite~~lr~ifepfg~Ie~v~l~~d~~tG~skgfGfi~f~~~~~ar~a~e~lngfelAGr~ikV~~v~~r 358 (549)
T KOG0147|consen 279 RRLYVGNLHFNITEDMLRGIFEPFGKIENVQLTKDSETGRSKGFGFITFVNKEDARKALEQLNGFELAGRLIKVSVVTER 358 (549)
T ss_pred hhhhhcccccCchHHHHhhhccCcccceeeeeccccccccccCcceEEEecHHHHHHHHHHhccceecCceEEEEEeeee
Confidence 34999999999999999999999999999999998 7999999999999999999999999999999999887655
Q ss_pred CC
Q psy2347 85 GR 86 (207)
Q Consensus 85 ~~ 86 (207)
..
T Consensus 359 ~~ 360 (549)
T KOG0147|consen 359 VD 360 (549)
T ss_pred cc
Confidence 43
No 68
>KOG0146|consensus
Probab=99.19 E-value=2.6e-11 Score=100.06 Aligned_cols=80 Identities=25% Similarity=0.431 Sum_probs=71.0
Q ss_pred CCcEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeC----CCcEEEEEEcCHHHHHHHHHhcCCcee---CCcEEEEEE
Q psy2347 8 RGTRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFN----PPGFAFVEFSNQIDAEAACDSMNDQDL---MGSKLRVEI 80 (207)
Q Consensus 8 ~~~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d----~rG~afV~F~~~e~A~~Al~~l~g~~i---~g~~l~V~~ 80 (207)
+.++|||+-|....+|||++.+|..||+|++|.|+.. +|+||||.|.+..+|+.||..||+... ....|.|++
T Consensus 18 ~drklfvgml~kqq~e~dvrrlf~pfG~~~e~tvlrg~dg~sKGCAFVKf~s~~eAqaAI~aLHgSqTmpGASSSLVVK~ 97 (371)
T KOG0146|consen 18 DDRKLFVGMLNKQQSEDDVRRLFQPFGNIEECTVLRGPDGNSKGCAFVKFSSHAEAQAAINALHGSQTMPGASSSLVVKF 97 (371)
T ss_pred cchhhhhhhhcccccHHHHHHHhcccCCcceeEEecCCCCCCCCceEEEeccchHHHHHHHHhcccccCCCCccceEEEe
Confidence 5789999999999999999999999999999999876 899999999999999999999998653 246799999
Q ss_pred eecCCCC
Q psy2347 81 SRGRGRG 87 (207)
Q Consensus 81 a~~k~~~ 87 (207)
+...+++
T Consensus 98 ADTdkER 104 (371)
T KOG0146|consen 98 ADTDKER 104 (371)
T ss_pred ccchHHH
Confidence 8765443
No 69
>KOG0124|consensus
Probab=99.17 E-value=2.1e-11 Score=104.39 Aligned_cols=73 Identities=32% Similarity=0.569 Sum_probs=69.2
Q ss_pred CcEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeC-----CCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEEEe
Q psy2347 9 GTRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFN-----PPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVEIS 81 (207)
Q Consensus 9 ~~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d-----~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~~a 81 (207)
-|+|||+.|.+++.|+.|+..|..||+|++|.+.+| .|+||||||+-+|.|+.|++.||+..+.++.|+|...
T Consensus 113 McRvYVGSIsfEl~EDtiR~AF~PFGPIKSInMSWDp~T~kHKgFAFVEYEvPEaAqLAlEqMNg~mlGGRNiKVgrP 190 (544)
T KOG0124|consen 113 MCRVYVGSISFELREDTIRRAFDPFGPIKSINMSWDPATGKHKGFAFVEYEVPEAAQLALEQMNGQMLGGRNIKVGRP 190 (544)
T ss_pred hHheeeeeeEEEechHHHHhhccCCCCcceeecccccccccccceEEEEEeCcHHHHHHHHHhccccccCccccccCC
Confidence 478999999999999999999999999999999888 7899999999999999999999999999999998743
No 70
>KOG0110|consensus
Probab=99.14 E-value=1.3e-10 Score=106.26 Aligned_cols=73 Identities=36% Similarity=0.482 Sum_probs=68.3
Q ss_pred cEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeC--------CCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEEEe
Q psy2347 10 TRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFN--------PPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVEIS 81 (207)
Q Consensus 10 ~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d--------~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~~a 81 (207)
++|||.||++++|.++|+.+|.+.+.|..|.|.+. +.+|+||+|.++++|++|+++|+++.|+++.|.|+++
T Consensus 516 t~lfvkNlnf~Tt~e~l~~~F~k~G~VlS~~I~kkkd~~~k~lSmGfgFVEF~~~e~A~~a~k~lqgtvldGH~l~lk~S 595 (725)
T KOG0110|consen 516 TKLFVKNLNFDTTLEDLEDLFSKQGTVLSIEISKKKDPANKYLSMGFGFVEFAKPESAQAALKALQGTVLDGHKLELKIS 595 (725)
T ss_pred hhhhhhcCCcccchhHHHHHHHhcCeEEEEEEeccccccccccccceeEEEecCHHHHHHHHHHhcCceecCceEEEEec
Confidence 34999999999999999999999999999988765 3499999999999999999999999999999999999
Q ss_pred e
Q psy2347 82 R 82 (207)
Q Consensus 82 ~ 82 (207)
.
T Consensus 596 ~ 596 (725)
T KOG0110|consen 596 E 596 (725)
T ss_pred c
Confidence 7
No 71
>KOG4661|consensus
Probab=99.14 E-value=1e-10 Score=105.02 Aligned_cols=76 Identities=28% Similarity=0.490 Sum_probs=71.3
Q ss_pred CCcEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeC-----CCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEEEee
Q psy2347 8 RGTRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFN-----PPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVEISR 82 (207)
Q Consensus 8 ~~~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d-----~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~~a~ 82 (207)
-.+.|||..|+..+...+|+.||++|++|+-.+|+++ .++|.||++.+.++|.+||+.||..+|.++.|.|+.++
T Consensus 404 ~gRNlWVSGLSstTRAtDLKnlFSKyGKVvGAKVVTNaRsPGaRCYGfVTMSts~eAtkCI~hLHrTELHGrmISVEkaK 483 (940)
T KOG4661|consen 404 LGRNLWVSGLSSTTRATDLKNLFSKYGKVVGAKVVTNARSPGARCYGFVTMSTSAEATKCIEHLHRTELHGRMISVEKAK 483 (940)
T ss_pred cccceeeeccccchhhhHHHHHHHHhcceeceeeeecCCCCCcceeEEEEecchHHHHHHHHHhhhhhhcceeeeeeecc
Confidence 3578999999999999999999999999999999988 68999999999999999999999999999999999886
Q ss_pred c
Q psy2347 83 G 83 (207)
Q Consensus 83 ~ 83 (207)
.
T Consensus 484 N 484 (940)
T KOG4661|consen 484 N 484 (940)
T ss_pred c
Confidence 4
No 72
>KOG1457|consensus
Probab=99.12 E-value=3.7e-10 Score=91.13 Aligned_cols=85 Identities=21% Similarity=0.344 Sum_probs=71.6
Q ss_pred CCCCcEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeC------CCcEEEEEEcCHHHHHHHHHhcCCceeC---CcEE
Q psy2347 6 MERGTRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFN------PPGFAFVEFSNQIDAEAACDSMNDQDLM---GSKL 76 (207)
Q Consensus 6 ~~~~~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d------~rG~afV~F~~~e~A~~Al~~l~g~~i~---g~~l 76 (207)
....++|||.+||.++...||..+|..|---+.+.|... .+.++||+|.+..+|++|+++||+..|+ ...|
T Consensus 31 ~~~VRTLFVSGLP~DvKpREiynLFR~f~GYEgslLK~Tsk~~~~~~pvaFatF~s~q~A~aamnaLNGvrFDpE~~stL 110 (284)
T KOG1457|consen 31 PGAVRTLFVSGLPNDVKPREIYNLFRRFHGYEGSLLKYTSKGDQVCKPVAFATFTSHQFALAAMNALNGVRFDPETGSTL 110 (284)
T ss_pred ccccceeeeccCCcccCHHHHHHHhccCCCccceeeeeccCCCccccceEEEEecchHHHHHHHHHhcCeeeccccCcee
Confidence 345789999999999999999999999876666655443 3579999999999999999999999985 6789
Q ss_pred EEEEeecCCCCCCC
Q psy2347 77 RVEISRGRGRGRGG 90 (207)
Q Consensus 77 ~V~~a~~k~~~~~~ 90 (207)
+|++|+...+.+++
T Consensus 111 hiElAKSNtK~kr~ 124 (284)
T KOG1457|consen 111 HIELAKSNTKRKRR 124 (284)
T ss_pred EeeehhcCcccccC
Confidence 99999887766554
No 73
>KOG0533|consensus
Probab=99.06 E-value=8.5e-10 Score=91.21 Aligned_cols=80 Identities=34% Similarity=0.449 Sum_probs=72.3
Q ss_pred CCCcEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeC----CCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEEEee
Q psy2347 7 ERGTRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFN----PPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVEISR 82 (207)
Q Consensus 7 ~~~~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d----~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~~a~ 82 (207)
...++|.|.|||..|++++|+++|++|++++.+.|..+ +.+.|-|+|+..++|++||+.+|+..|+++.|++.+..
T Consensus 81 ~~~~~v~v~NL~~~V~~~Dl~eLF~~~~~~~r~~vhy~~~G~s~Gta~v~~~r~~DA~~avk~~~gv~ldG~~mk~~~i~ 160 (243)
T KOG0533|consen 81 TRSTKVNVSNLPYGVIDADLKELFAEFGELKRVAVHYDRAGRSLGTADVSFNRRDDAERAVKKYNGVALDGRPMKIEIIS 160 (243)
T ss_pred CCcceeeeecCCcCcchHHHHHHHHHhccceEEeeccCCCCCCCccceeeecchHhHHHHHHHhcCcccCCceeeeEEec
Confidence 44578999999999999999999999999988888877 77999999999999999999999999999999998876
Q ss_pred cCCC
Q psy2347 83 GRGR 86 (207)
Q Consensus 83 ~k~~ 86 (207)
+...
T Consensus 161 ~~~~ 164 (243)
T KOG0533|consen 161 SPSQ 164 (243)
T ss_pred Cccc
Confidence 5443
No 74
>KOG4454|consensus
Probab=99.06 E-value=6.3e-11 Score=95.23 Aligned_cols=81 Identities=22% Similarity=0.229 Sum_probs=72.9
Q ss_pred CCCCCCCCCcEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeC---CCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEE
Q psy2347 1 MSDGMMERGTRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFN---PPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLR 77 (207)
Q Consensus 1 ms~~~~~~~~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d---~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~ 77 (207)
|-..+.+...+|||.||...++|+.|.++|-+.|+|.+|.|..+ ...||||+|+++.++.-|++.||+..|.+..|+
T Consensus 1 mgaaaae~drtl~v~n~~~~v~eelL~ElfiqaGPV~kv~ip~~~d~~~kFa~v~f~~E~sv~~a~~L~ng~~l~~~e~q 80 (267)
T KOG4454|consen 1 MGAAAAEMDRTLLVQNMYSGVSEELLSELFIQAGPVYKVGIPSGQDQEQKFAYVFFPNENSVQLAGQLENGDDLEEDEEQ 80 (267)
T ss_pred CCCCCcchhhHHHHHhhhhhhhHHHHHHHhhccCceEEEeCCCCccCCCceeeeecccccchhhhhhhcccchhccchhh
Confidence 45667788899999999999999999999999999999999876 333999999999999999999999999998888
Q ss_pred EEEe
Q psy2347 78 VEIS 81 (207)
Q Consensus 78 V~~a 81 (207)
|++.
T Consensus 81 ~~~r 84 (267)
T KOG4454|consen 81 RTLR 84 (267)
T ss_pred cccc
Confidence 8764
No 75
>KOG4211|consensus
Probab=99.02 E-value=1.1e-09 Score=96.79 Aligned_cols=79 Identities=19% Similarity=0.273 Sum_probs=68.5
Q ss_pred CCCCCCCcEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeC---CCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEE
Q psy2347 3 DGMMERGTRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFN---PPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVE 79 (207)
Q Consensus 3 ~~~~~~~~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d---~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~ 79 (207)
+...+....|-+..|||++|++||++||+.+ .|+++++.+. +.+.|||||+++|++++||++ +...+..+-|+|-
T Consensus 4 ~~e~~~~~~vr~rGLPwsat~~ei~~Ff~~~-~I~~~~~~r~~Gr~sGeA~Ve~~seedv~~Alkk-dR~~mg~RYIEVf 81 (510)
T KOG4211|consen 4 ENEGSTAFEVRLRGLPWSATEKEILDFFSNC-GIENLEIPRRNGRPSGEAYVEFTSEEDVEKALKK-DRESMGHRYIEVF 81 (510)
T ss_pred ccCCCcceEEEecCCCccccHHHHHHHHhcC-ceeEEEEeccCCCcCcceEEEeechHHHHHHHHh-hHHHhCCceEEEE
Confidence 3456667788889999999999999999999 4888888776 789999999999999999996 8888999999998
Q ss_pred Eeec
Q psy2347 80 ISRG 83 (207)
Q Consensus 80 ~a~~ 83 (207)
.+.+
T Consensus 82 ~~~~ 85 (510)
T KOG4211|consen 82 TAGG 85 (510)
T ss_pred ccCC
Confidence 7754
No 76
>KOG4208|consensus
Probab=99.01 E-value=1.4e-09 Score=86.72 Aligned_cols=78 Identities=27% Similarity=0.388 Sum_probs=69.0
Q ss_pred CCCcEEEEcCCCcCCcHHHHHHHHHhC-CCeeEEEEeeC-----CCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEEE
Q psy2347 7 ERGTRVYVGGLTETVKKEDLELEFEKF-GKLNSVWVAFN-----PPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVEI 80 (207)
Q Consensus 7 ~~~~~lfV~nLp~~~te~~L~~~F~~~-G~V~~v~i~~d-----~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~~ 80 (207)
.....++|..||.-+.+.+|..+|.+| +.|..+++.++ +++||||+|+++|.|+-|.+.||+..|.++.|.|.+
T Consensus 47 ~~~g~~~~~~~p~g~~e~~~~~~~~q~~g~v~r~rlsRnkrTGNSKgYAFVEFEs~eVA~IaAETMNNYLl~e~lL~c~v 126 (214)
T KOG4208|consen 47 EIEGVVYVDHIPHGFFETEILNYFRQFGGTVTRFRLSRNKRTGNSKGYAFVEFESEEVAKIAAETMNNYLLMEHLLECHV 126 (214)
T ss_pred CCccceeecccccchhHHHHhhhhhhcCCeeEEEEeecccccCCcCceEEEEeccHHHHHHHHHHhhhhhhhhheeeeEE
Confidence 345678899999999999999999998 67777777565 899999999999999999999999999999999998
Q ss_pred eecC
Q psy2347 81 SRGR 84 (207)
Q Consensus 81 a~~k 84 (207)
..|.
T Consensus 127 mppe 130 (214)
T KOG4208|consen 127 MPPE 130 (214)
T ss_pred eCch
Confidence 7654
No 77
>KOG0110|consensus
Probab=99.01 E-value=3.1e-10 Score=103.97 Aligned_cols=75 Identities=29% Similarity=0.528 Sum_probs=71.2
Q ss_pred CcEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeC-----CCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEEEeec
Q psy2347 9 GTRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFN-----PPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVEISRG 83 (207)
Q Consensus 9 ~~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d-----~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~~a~~ 83 (207)
.++|+|.|||+.++..+|+++|..||.|..|+|++. .++||||+|-++++|.+|+++|....|.++.|.++||+.
T Consensus 613 ~tKIlVRNipFeAt~rEVr~LF~aFGqlksvRlPKK~~k~a~rGF~Fv~f~t~~ea~nA~~al~STHlyGRrLVLEwA~~ 692 (725)
T KOG0110|consen 613 GTKILVRNIPFEATKREVRKLFTAFGQLKSVRLPKKIGKGAHRGFGFVDFLTPREAKNAFDALGSTHLYGRRLVLEWAKS 692 (725)
T ss_pred cceeeeeccchHHHHHHHHHHHhcccceeeeccchhhcchhhccceeeeccCcHHHHHHHHhhcccceechhhheehhcc
Confidence 469999999999999999999999999999999876 689999999999999999999999999999999999975
No 78
>KOG0106|consensus
Probab=98.95 E-value=9.9e-10 Score=89.13 Aligned_cols=73 Identities=36% Similarity=0.660 Sum_probs=67.1
Q ss_pred cEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeCCCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEEEeecCC
Q psy2347 10 TRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFNPPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVEISRGRG 85 (207)
Q Consensus 10 ~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~~a~~k~ 85 (207)
.+|||++||+.+.+.+|++||..|+.|.+|.+. .+|+||+|++..+|+.||..||+.+|.+.+|.|++++.+.
T Consensus 2 ~rv~vg~~~~~~~~~d~E~~f~~yg~~~d~~mk---~gf~fv~fed~rda~Dav~~l~~~~l~~e~~vve~~r~~~ 74 (216)
T KOG0106|consen 2 PRVYIGRLPYRARERDVERFFKGYGKIPDADMK---NGFGFVEFEDPRDADDAVHDLDGKELCGERLVVEHARGKR 74 (216)
T ss_pred CceeecccCCccchhHHHHHHhhccccccceee---cccceeccCchhhhhcccchhcCceecceeeeeecccccc
Confidence 479999999999999999999999999999886 4699999999999999999999999999998888887643
No 79
>KOG0123|consensus
Probab=98.94 E-value=1e-09 Score=96.49 Aligned_cols=82 Identities=33% Similarity=0.528 Sum_probs=75.4
Q ss_pred CCCCCcEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeC----CCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEEE
Q psy2347 5 MMERGTRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFN----PPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVEI 80 (207)
Q Consensus 5 ~~~~~~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d----~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~~ 80 (207)
......+|||.||+..++++.|+++|+.+++|+.++|+.+ +++|+||+|++.++|++|+..+|+..|.++.|.|.+
T Consensus 266 ~~~~~~nl~vknld~~~~~e~L~~~f~~~GeI~s~kv~~~~~g~skG~gfV~fs~~eeA~~A~~~~n~~~i~~k~l~vav 345 (369)
T KOG0123|consen 266 VSLQGANLYVKNLDETLSDEKLRKIFSSFGEITSAKVMVDENGKSKGFGFVEFSSPEEAKKAMTEMNGRLIGGKPLYVAV 345 (369)
T ss_pred ccccccccccccCccccchhHHHHHHhcccceeeEEEEeccCCCccceEEEEcCCHHHHHHHHHhhChhhhcCCchhhhH
Confidence 3556789999999999999999999999999999999987 889999999999999999999999999999999998
Q ss_pred eecCCC
Q psy2347 81 SRGRGR 86 (207)
Q Consensus 81 a~~k~~ 86 (207)
+..+..
T Consensus 346 ~qr~~~ 351 (369)
T KOG0123|consen 346 AQRKED 351 (369)
T ss_pred Hhhhcc
Confidence 875443
No 80
>KOG0124|consensus
Probab=98.89 E-value=3.9e-09 Score=90.62 Aligned_cols=77 Identities=30% Similarity=0.439 Sum_probs=71.2
Q ss_pred CCcEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeC-----CCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEEEee
Q psy2347 8 RGTRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFN-----PPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVEISR 82 (207)
Q Consensus 8 ~~~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d-----~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~~a~ 82 (207)
+-.+|||..+.++++|+||+.+|+.||+|++|++.++ .|+|.||||++..+...||..||-..|.+.-|+|-.+-
T Consensus 209 ~fnRiYVaSvHpDLSe~DiKSVFEAFG~I~~C~LAr~pt~~~HkGyGfiEy~n~qs~~eAiasMNlFDLGGQyLRVGk~v 288 (544)
T KOG0124|consen 209 KFNRIYVASVHPDLSETDIKSVFEAFGEIVKCQLARAPTGRGHKGYGFIEYNNLQSQSEAIASMNLFDLGGQYLRVGKCV 288 (544)
T ss_pred hhheEEeeecCCCccHHHHHHHHHhhcceeeEEeeccCCCCCccceeeEEeccccchHHHhhhcchhhcccceEeccccc
Confidence 4579999999999999999999999999999999887 78999999999999999999999999999999997765
Q ss_pred cC
Q psy2347 83 GR 84 (207)
Q Consensus 83 ~k 84 (207)
..
T Consensus 289 TP 290 (544)
T KOG0124|consen 289 TP 290 (544)
T ss_pred CC
Confidence 43
No 81
>KOG1548|consensus
Probab=98.88 E-value=8e-09 Score=87.99 Aligned_cols=76 Identities=26% Similarity=0.370 Sum_probs=68.7
Q ss_pred CCCcEEEEcCCCcCCcHHHHHHHHHhCCCee--------EEEEeeC----CCcEEEEEEcCHHHHHHHHHhcCCceeCCc
Q psy2347 7 ERGTRVYVGGLTETVKKEDLELEFEKFGKLN--------SVWVAFN----PPGFAFVEFSNQIDAEAACDSMNDQDLMGS 74 (207)
Q Consensus 7 ~~~~~lfV~nLp~~~te~~L~~~F~~~G~V~--------~v~i~~d----~rG~afV~F~~~e~A~~Al~~l~g~~i~g~ 74 (207)
..+++|||.+||.++|.+++.++|++|+.|. .|+|..+ .|+=|+|+|...|+++.||+.|+..+|.++
T Consensus 132 ~~Nt~VYVsgLP~DiT~dE~~~~~sKcGiI~~d~~t~epk~KlYrd~~G~lKGDaLc~y~K~ESVeLA~~ilDe~~~rg~ 211 (382)
T KOG1548|consen 132 KVNTSVYVSGLPLDITVDEFAEVMSKCGIIMRDPQTGEPKVKLYRDNQGKLKGDALCCYIKRESVELAIKILDEDELRGK 211 (382)
T ss_pred ccCceEEecCCCCcccHHHHHHHHHhcceEeccCCCCCeeEEEEecCCCCccCceEEEeecccHHHHHHHHhCcccccCc
Confidence 4567899999999999999999999999875 3666666 789999999999999999999999999999
Q ss_pred EEEEEEee
Q psy2347 75 KLRVEISR 82 (207)
Q Consensus 75 ~l~V~~a~ 82 (207)
+|+|+.|+
T Consensus 212 ~~rVerAk 219 (382)
T KOG1548|consen 212 KLRVERAK 219 (382)
T ss_pred EEEEehhh
Confidence 99999885
No 82
>KOG4660|consensus
Probab=98.86 E-value=1.9e-09 Score=96.56 Aligned_cols=71 Identities=25% Similarity=0.426 Sum_probs=66.2
Q ss_pred CCCcEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeCCCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEE
Q psy2347 7 ERGTRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFNPPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLR 77 (207)
Q Consensus 7 ~~~~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~ 77 (207)
.+..+|+|-|||..|++++|+.+|+.||+|+.|+.....++.+||+|.|..+|++|+++|+..+|.++.|+
T Consensus 73 ~~~~~L~v~nl~~~Vsn~~L~~~f~~yGeir~ir~t~~~~~~~~v~FyDvR~A~~Alk~l~~~~~~~~~~k 143 (549)
T KOG4660|consen 73 MNQGTLVVFNLPRSVSNDTLLRIFGAYGEIREIRETPNKRGIVFVEFYDVRDAERALKALNRREIAGKRIK 143 (549)
T ss_pred CccceEEEEecCCcCCHHHHHHHHHhhcchhhhhcccccCceEEEEEeehHhHHHHHHHHHHHHhhhhhhc
Confidence 35679999999999999999999999999999877666889999999999999999999999999999887
No 83
>KOG4209|consensus
Probab=98.85 E-value=5.6e-09 Score=86.22 Aligned_cols=82 Identities=30% Similarity=0.457 Sum_probs=73.9
Q ss_pred CCCCCCcEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeC-----CCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEE
Q psy2347 4 GMMERGTRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFN-----PPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRV 78 (207)
Q Consensus 4 ~~~~~~~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d-----~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V 78 (207)
....+.+.|||+|+...+|.++|+.+|+.|+.|..|.|++| +|+|+||+|.+.+.+++|+. |+...|.++.|+|
T Consensus 96 ~~~~d~~sv~v~nvd~~~t~~~~e~hf~~Cg~i~~~ti~~d~~~~~~k~~~yvef~~~~~~~~ay~-l~gs~i~~~~i~v 174 (231)
T KOG4209|consen 96 QKEVDAPSVWVGNVDFLVTLTKIELHFESCGGINRVTVPKDKFRGHPKGFAYVEFSSYELVEEAYK-LDGSEIPGPAIEV 174 (231)
T ss_pred hhccCCceEEEeccccccccchhhheeeccCCccceeeeccccCCCcceeEEEecccHhhhHHHhh-cCCccccccccee
Confidence 34456789999999999999999999999999998888887 68999999999999999999 8999999999999
Q ss_pred EEeecCCC
Q psy2347 79 EISRGRGR 86 (207)
Q Consensus 79 ~~a~~k~~ 86 (207)
.+.+.+..
T Consensus 175 t~~r~~~p 182 (231)
T KOG4209|consen 175 TLKRTNVP 182 (231)
T ss_pred eeeeeecC
Confidence 98877633
No 84
>PF04059 RRM_2: RNA recognition motif 2; InterPro: IPR007201 This RNA recognition motif 2 is found in Meiosis protein mei2. It is found C-terminal to the RNA-binding region RNP-1 (IPR000504 from INTERPRO).
Probab=98.81 E-value=3.2e-08 Score=70.89 Aligned_cols=75 Identities=19% Similarity=0.302 Sum_probs=63.1
Q ss_pred cEEEEcCCCcCCcHHHHHHHHHhC--CCeeEEEEeeC-----CCcEEEEEEcCHHHHHHHHHhcCCceeC----CcEEEE
Q psy2347 10 TRVYVGGLTETVKKEDLELEFEKF--GKLNSVWVAFN-----PPGFAFVEFSNQIDAEAACDSMNDQDLM----GSKLRV 78 (207)
Q Consensus 10 ~~lfV~nLp~~~te~~L~~~F~~~--G~V~~v~i~~d-----~rG~afV~F~~~e~A~~Al~~l~g~~i~----g~~l~V 78 (207)
+||.|.|||...|.++|.+++.+. +....+.++.| +.+||||.|.++++|.+..+.++++.+. .+.++|
T Consensus 2 TTvMirNIPn~~t~~~L~~~l~~~~~g~yDF~YLPiDf~~~~N~GYAFVNf~~~~~~~~F~~~f~g~~w~~~~s~Kvc~i 81 (97)
T PF04059_consen 2 TTVMIRNIPNKYTQEMLIQILDEHFKGKYDFFYLPIDFKNKCNLGYAFVNFTSPQAAIRFYKAFNGKKWPNFNSKKVCEI 81 (97)
T ss_pred eeEEEecCCCCCCHHHHHHHHHHhccCcceEEEeeeeccCCCceEEEEEEcCCHHHHHHHHHHHcCCccccCCCCcEEEE
Confidence 689999999999999999998763 44556666666 6899999999999999999999998874 577888
Q ss_pred EEeecC
Q psy2347 79 EISRGR 84 (207)
Q Consensus 79 ~~a~~k 84 (207)
.+|+-.
T Consensus 82 ~yAriQ 87 (97)
T PF04059_consen 82 SYARIQ 87 (97)
T ss_pred ehhHhh
Confidence 888643
No 85
>KOG1995|consensus
Probab=98.70 E-value=4.7e-08 Score=83.72 Aligned_cols=81 Identities=21% Similarity=0.339 Sum_probs=71.0
Q ss_pred CCCCcEEEEcCCCcCCcHHHHHHHHHhCCCee--------EEEEeeC-----CCcEEEEEEcCHHHHHHHHHhcCCceeC
Q psy2347 6 MERGTRVYVGGLTETVKKEDLELEFEKFGKLN--------SVWVAFN-----PPGFAFVEFSNQIDAEAACDSMNDQDLM 72 (207)
Q Consensus 6 ~~~~~~lfV~nLp~~~te~~L~~~F~~~G~V~--------~v~i~~d-----~rG~afV~F~~~e~A~~Al~~l~g~~i~ 72 (207)
.....+|||..||..+++++|.++|.+++.|+ +|+|-+| +|+-|.|+|++..+|++||+.++.+.+.
T Consensus 63 ~s~~~ti~v~g~~d~~~~~~~~~~f~qcg~ikrnK~t~kPki~~y~dkeT~~~KGeatvS~~D~~~akaai~~~agkdf~ 142 (351)
T KOG1995|consen 63 KSDNETIFVWGCPDSVCENDNADFFLQCGVIKRNKRTGKPKIKIYTDKETGAPKGEATVSYEDPPAAKAAIEWFAGKDFC 142 (351)
T ss_pred ccccccceeeccCccchHHHHHHHHhhcceeccCCCCCCcchhccccccccCcCCceeeeecChhhhhhhhhhhcccccc
Confidence 45567999999999999999999999999875 2444444 8899999999999999999999999999
Q ss_pred CcEEEEEEeecCCC
Q psy2347 73 GSKLRVEISRGRGR 86 (207)
Q Consensus 73 g~~l~V~~a~~k~~ 86 (207)
+.+|+|.+|+.+..
T Consensus 143 gn~ikvs~a~~r~~ 156 (351)
T KOG1995|consen 143 GNTIKVSLAERRTG 156 (351)
T ss_pred CCCchhhhhhhccC
Confidence 99999999987663
No 86
>KOG0151|consensus
Probab=98.65 E-value=5.6e-08 Score=89.40 Aligned_cols=78 Identities=28% Similarity=0.429 Sum_probs=72.2
Q ss_pred CCCCcEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeC--------CCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEE
Q psy2347 6 MERGTRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFN--------PPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLR 77 (207)
Q Consensus 6 ~~~~~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d--------~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~ 77 (207)
.+..+.|||.||++.++|+.|...|..|++|..|+||.. .+.|+||.|-+..+|++|++.|+++.+....|+
T Consensus 171 DP~TTNlyv~Nlnpsv~E~~ll~tfGrfgPlasvKimwpRtEeEk~r~r~cgfvafmnR~D~era~k~lqg~iv~~~e~K 250 (877)
T KOG0151|consen 171 DPQTTNLYVGNLNPSVDENFLLRTFGRFGPLASVKIMWPRTEEEKRRERNCGFVAFMNRADAERALKELQGIIVMEYEMK 250 (877)
T ss_pred CCcccceeeecCCccccHHHHHHHhcccCcccceeeecccchhhhccccccceeeehhhhhHHHHHHHhcceeeeeeeee
Confidence 456789999999999999999999999999999999876 568999999999999999999999999999999
Q ss_pred EEEeec
Q psy2347 78 VEISRG 83 (207)
Q Consensus 78 V~~a~~ 83 (207)
+-|+++
T Consensus 251 ~gWgk~ 256 (877)
T KOG0151|consen 251 LGWGKA 256 (877)
T ss_pred eccccc
Confidence 998854
No 87
>KOG1190|consensus
Probab=98.65 E-value=1.2e-07 Score=82.62 Aligned_cols=75 Identities=27% Similarity=0.423 Sum_probs=69.8
Q ss_pred CcEEEEcCCCc-CCcHHHHHHHHHhCCCeeEEEEeeCCCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEEEeec
Q psy2347 9 GTRVYVGGLTE-TVKKEDLELEFEKFGKLNSVWVAFNPPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVEISRG 83 (207)
Q Consensus 9 ~~~lfV~nLp~-~~te~~L~~~F~~~G~V~~v~i~~d~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~~a~~ 83 (207)
++.|.|.||.+ .+|.+.|..+|.-||+|.+|+|+.+++--|+|.|.+...|+-|++.|+++.|.+++|+|.+++-
T Consensus 297 n~vllvsnln~~~VT~d~LftlFgvYGdVqRVkil~nkkd~ALIQmsd~~qAqLA~~hL~g~~l~gk~lrvt~SKH 372 (492)
T KOG1190|consen 297 NVVLLVSNLNEEAVTPDVLFTLFGVYGDVQRVKILYNKKDNALIQMSDGQQAQLAMEHLEGHKLYGKKLRVTLSKH 372 (492)
T ss_pred ceEEEEecCchhccchhHHHHHHhhhcceEEEEeeecCCcceeeeecchhHHHHHHHHhhcceecCceEEEeeccC
Confidence 57899999955 5899999999999999999999999888999999999999999999999999999999999874
No 88
>KOG4206|consensus
Probab=98.60 E-value=2.2e-07 Score=75.14 Aligned_cols=80 Identities=19% Similarity=0.324 Sum_probs=70.8
Q ss_pred CCCCCCCcEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeCCCcEEEEEEcCHHHHHHHHHhcCCceeC-CcEEEEEEe
Q psy2347 3 DGMMERGTRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFNPPGFAFVEFSNQIDAEAACDSMNDQDLM-GSKLRVEIS 81 (207)
Q Consensus 3 ~~~~~~~~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d~rG~afV~F~~~e~A~~Al~~l~g~~i~-g~~l~V~~a 81 (207)
+.+.++...||+.|||.+++.+.|..+|++|.-.++|+++...+..|||+|.+...|..|...|++..|. ..+|+|.++
T Consensus 140 ~~~~ppn~ilf~~niP~es~~e~l~~lf~qf~g~keir~i~~~~~iAfve~~~d~~a~~a~~~lq~~~it~~~~m~i~~a 219 (221)
T KOG4206|consen 140 AQMAPPNNILFLTNIPSESESEMLSDLFEQFPGFKEIRLIPPRSGIAFVEFLSDRQASAAQQALQGFKITKKNTMQITFA 219 (221)
T ss_pred ccCCCCceEEEEecCCcchhHHHHHHHHhhCcccceeEeccCCCceeEEecchhhhhHHHhhhhccceeccCceEEeccc
Confidence 3456788899999999999999999999999999999998877889999999999999999999988876 777877766
Q ss_pred e
Q psy2347 82 R 82 (207)
Q Consensus 82 ~ 82 (207)
+
T Consensus 220 ~ 220 (221)
T KOG4206|consen 220 K 220 (221)
T ss_pred C
Confidence 4
No 89
>KOG4211|consensus
Probab=98.59 E-value=1e-07 Score=84.63 Aligned_cols=74 Identities=24% Similarity=0.311 Sum_probs=62.6
Q ss_pred CCcEEEEcCCCcCCcHHHHHHHHHhCCCeeE-EEEeeC----CCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEEEee
Q psy2347 8 RGTRVYVGGLTETVKKEDLELEFEKFGKLNS-VWVAFN----PPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVEISR 82 (207)
Q Consensus 8 ~~~~lfV~nLp~~~te~~L~~~F~~~G~V~~-v~i~~d----~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~~a~ 82 (207)
....|-|..||+.||++||.+||+-.-.|.. |.++.+ +.+.|||.|++.|+|++||.. |...|..+-|+|..+.
T Consensus 102 ~d~vVRLRGLPfscte~dI~~FFaGL~Iv~~gi~l~~d~rgR~tGEAfVqF~sqe~ae~Al~r-hre~iGhRYIEvF~Ss 180 (510)
T KOG4211|consen 102 NDGVVRLRGLPFSCTEEDIVEFFAGLEIVPDGILLPMDQRGRPTGEAFVQFESQESAEIALGR-HRENIGHRYIEVFRSS 180 (510)
T ss_pred CCceEEecCCCccCcHHHHHHHhcCCcccccceeeeccCCCCcccceEEEecCHHHHHHHHHH-HHHhhccceEEeehhH
Confidence 5678999999999999999999998865555 444555 779999999999999999987 7888888989987664
No 90
>KOG0106|consensus
Probab=98.58 E-value=3.4e-08 Score=80.22 Aligned_cols=72 Identities=33% Similarity=0.450 Sum_probs=63.9
Q ss_pred CCCcEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeCCCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEEEe
Q psy2347 7 ERGTRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFNPPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVEIS 81 (207)
Q Consensus 7 ~~~~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~~a 81 (207)
...+.|+|.+|+..+.+.+|+++|+++++++...++ +.++||+|+++++|++||+.|++.+|.++.|+|...
T Consensus 97 ~s~~r~~~~~~~~r~~~qdl~d~~~~~g~~~~~~~~---~~~~~v~Fs~~~da~ra~~~l~~~~~~~~~l~~~~~ 168 (216)
T KOG0106|consen 97 RTHFRLIVRNLSLRVSWQDLKDHFRPAGEVTYVDAR---RNFAFVEFSEQEDAKRALEKLDGKKLNGRRISVEKN 168 (216)
T ss_pred cccceeeeccchhhhhHHHHhhhhcccCCCchhhhh---ccccceeehhhhhhhhcchhccchhhcCceeeeccc
Confidence 456789999999999999999999999999665553 569999999999999999999999999999999443
No 91
>PF08777 RRM_3: RNA binding motif; InterPro: IPR014886 This domain is found in protein La which functions as an RNA chaperone during RNA polymerase III transcription, and can also stimulate translation initiation. It contains a five stranded beta sheet which forms an atypical RNA recognition motif []. ; PDB: 1OWX_A.
Probab=98.53 E-value=1.5e-07 Score=68.54 Aligned_cols=71 Identities=21% Similarity=0.386 Sum_probs=46.2
Q ss_pred cEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeCCCcEEEEEEcCHHHHHHHHHhcCCc-----eeCCcEEEEEEe
Q psy2347 10 TRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFNPPGFAFVEFSNQIDAEAACDSMNDQ-----DLMGSKLRVEIS 81 (207)
Q Consensus 10 ~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d~rG~afV~F~~~e~A~~Al~~l~g~-----~i~g~~l~V~~a 81 (207)
+.|+|.+++..++.++|+++|++|++|..|.+... .-.|+|.|.++++|++|++++... .|.+..+++++-
T Consensus 2 ~il~~~g~~~~~~re~iK~~f~~~g~V~yVD~~~G-~~~g~VRf~~~~~A~~a~~~~~~~~~~~~~i~~~~~~~~vL 77 (105)
T PF08777_consen 2 CILKFSGLGEPTSREDIKEAFSQFGEVAYVDFSRG-DTEGYVRFKTPEAAQKALEKLKEANDGKLKIKGKEVTLEVL 77 (105)
T ss_dssp -EEEEEE--SS--HHHHHHHT-SS--EEEEE--TT--SEEEEEESS---HHHHHHHHHHTTTS-B-TTSSSEEEE--
T ss_pred eEEEEecCCCCcCHHHHHHHHHhcCCcceEEecCC-CCEEEEEECCcchHHHHHHHHHhccCCceEEcCceEEEEEC
Confidence 57899999999999999999999999999998876 458999999999999999876544 566777776654
No 92
>PF11608 Limkain-b1: Limkain b1; InterPro: IPR024582 This entry represents a conserved domain found in limkain b1, which is a novel human autoantigen, localised to a subset of ABCD3 and PXF marked peroxisomes. Limkain b1 may be a relatively common target of human autoantibodies reactive to cytoplasmic vesicle-like structures [].; PDB: 2DIU_A.
Probab=98.51 E-value=4.5e-07 Score=62.74 Aligned_cols=68 Identities=31% Similarity=0.417 Sum_probs=47.3
Q ss_pred cEEEEcCCCcCCcHHHH----HHHHHhCC-CeeEEEEeeCCCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEEEee
Q psy2347 10 TRVYVGGLTETVKKEDL----ELEFEKFG-KLNSVWVAFNPPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVEISR 82 (207)
Q Consensus 10 ~~lfV~nLp~~~te~~L----~~~F~~~G-~V~~v~i~~d~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~~a~ 82 (207)
+.|+|.|||.+.+...| +.++..+| .|..| ....|+|.|.++|.|++|++.|++..+-+.+|.|++..
T Consensus 3 s~L~V~NLP~~~d~~~I~~RL~qLsdNCGGkVl~v-----~~~tAilrF~~~~~A~RA~KRmegEdVfG~kI~v~~~~ 75 (90)
T PF11608_consen 3 SLLYVSNLPTNKDPSSIKNRLRQLSDNCGGKVLSV-----SGGTAILRFPNQEFAERAQKRMEGEDVFGNKISVSFSP 75 (90)
T ss_dssp EEEEEES--TTS-HHHHHHHHHHHHHTTT--EEE-------TT-EEEEESSHHHHHHHHHHHTT--SSSS--EEESS-
T ss_pred cEEEEecCCCCCCHHHHHHHHHHHhhccCCEEEEE-----eCCEEEEEeCCHHHHHHHHHhhcccccccceEEEEEcC
Confidence 47999999998887654 55666675 35554 25799999999999999999999999999999999874
No 93
>KOG1457|consensus
Probab=98.51 E-value=1.1e-07 Score=77.03 Aligned_cols=63 Identities=13% Similarity=0.261 Sum_probs=54.9
Q ss_pred CcEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeC-CCcEEEEEEcCHHHHHHHHHhcCCcee
Q psy2347 9 GTRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFN-PPGFAFVEFSNQIDAEAACDSMNDQDL 71 (207)
Q Consensus 9 ~~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d-~rG~afV~F~~~e~A~~Al~~l~g~~i 71 (207)
..+|||.||..++||++|+.+|+.|--...++|... -..+|||+|++.+.|..||..|++..|
T Consensus 210 cstlfianl~~~~~ed~l~~~~~~~~gf~~l~~~~~~g~~vaf~~~~~~~~at~am~~lqg~~~ 273 (284)
T KOG1457|consen 210 CSTLFIANLGPNCTEDELKQLLSRYPGFHILKIRARGGMPVAFADFEEIEQATDAMNHLQGNLL 273 (284)
T ss_pred hhhHhhhccCCCCCHHHHHHHHHhCCCceEEEEecCCCcceEeecHHHHHHHHHHHHHhhccee
Confidence 468999999999999999999999988777777654 456899999999999999999887665
No 94
>KOG0120|consensus
Probab=98.42 E-value=2.4e-07 Score=83.55 Aligned_cols=80 Identities=23% Similarity=0.412 Sum_probs=73.7
Q ss_pred CCCCCCcEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeC-----CCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEE
Q psy2347 4 GMMERGTRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFN-----PPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRV 78 (207)
Q Consensus 4 ~~~~~~~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d-----~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V 78 (207)
..+....+|||++||..++++++++++..|++++..+++.| +++|||.+|.+......|++.||++.+.+++|.|
T Consensus 284 ~~~~~~~ki~v~~lp~~l~~~q~~Ell~~fg~lk~f~lv~d~~~g~skg~af~ey~dpsvtd~A~agLnGm~lgd~~lvv 363 (500)
T KOG0120|consen 284 DVPDSPNKIFVGGLPLYLTEDQVKELLDSFGPLKAFRLVKDSATGNSKGFAFCEYCDPSVTDQAIAGLNGMQLGDKKLVV 363 (500)
T ss_pred CcccccchhhhccCcCccCHHHHHHHHHhcccchhheeecccccccccceeeeeeeCCcchhhhhcccchhhhcCceeEe
Confidence 34556789999999999999999999999999999988887 7999999999999999999999999999999999
Q ss_pred EEeec
Q psy2347 79 EISRG 83 (207)
Q Consensus 79 ~~a~~ 83 (207)
..|.+
T Consensus 364 q~A~~ 368 (500)
T KOG0120|consen 364 QRAIV 368 (500)
T ss_pred ehhhc
Confidence 98754
No 95
>KOG0226|consensus
Probab=98.42 E-value=2e-07 Score=76.71 Aligned_cols=79 Identities=24% Similarity=0.388 Sum_probs=71.4
Q ss_pred CCCcEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeC-----CCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEEEe
Q psy2347 7 ERGTRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFN-----PPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVEIS 81 (207)
Q Consensus 7 ~~~~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d-----~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~~a 81 (207)
+...+||.+.|..+++++.|...|.+|-.....++++| +++|.||.|.+.+++..|++.|+++.+..+.|++..+
T Consensus 188 ~~DfRIfcgdlgNevnd~vl~raf~Kfpsf~~akviRdkRTgKSkgygfVSf~~pad~~rAmrem~gkyVgsrpiklRkS 267 (290)
T KOG0226|consen 188 EDDFRIFCGDLGNEVNDDVLARAFKKFPSFQKAKVIRDKRTGKSKGYGFVSFRDPADYVRAMREMNGKYVGSRPIKLRKS 267 (290)
T ss_pred cccceeecccccccccHHHHHHHHHhccchhhccccccccccccccceeeeecCHHHHHHHHHhhcccccccchhHhhhh
Confidence 45679999999999999999999999988888888888 7899999999999999999999999999999988766
Q ss_pred ecCC
Q psy2347 82 RGRG 85 (207)
Q Consensus 82 ~~k~ 85 (207)
..++
T Consensus 268 ~wke 271 (290)
T KOG0226|consen 268 EWKE 271 (290)
T ss_pred hHHh
Confidence 5544
No 96
>KOG1855|consensus
Probab=98.34 E-value=3.6e-07 Score=79.94 Aligned_cols=64 Identities=19% Similarity=0.226 Sum_probs=57.2
Q ss_pred CCCcEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeC------------------CCcEEEEEEcCHHHHHHHHHhcCC
Q psy2347 7 ERGTRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFN------------------PPGFAFVEFSNQIDAEAACDSMND 68 (207)
Q Consensus 7 ~~~~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d------------------~rG~afV~F~~~e~A~~Al~~l~g 68 (207)
.++++|.+.|||.+-.-+.|.+||..+|.|+.|+|++. .+-||+|||+..++|.+|.+.|+.
T Consensus 229 l~srtivaenLP~Dh~~enl~kiFg~~G~IksIRIckPgaip~d~r~~~~~~~~~~tk~~AlvEye~~~~A~KA~e~~~~ 308 (484)
T KOG1855|consen 229 LPSRTIVAENLPLDHSYENLSKIFGTVGSIKSIRICKPGAIPEDVRGFPKKYFELQTKECALVEYEEVEAARKARELLNP 308 (484)
T ss_pred cccceEEEecCCcchHHHHHHHHhhcccceeeeeecCCCCCCcccccCCccchhhhhhhhhhhhhhhhHHHHHHHHhhch
Confidence 47899999999999999999999999999999999765 267999999999999999998865
Q ss_pred ce
Q psy2347 69 QD 70 (207)
Q Consensus 69 ~~ 70 (207)
..
T Consensus 309 e~ 310 (484)
T KOG1855|consen 309 EQ 310 (484)
T ss_pred hh
Confidence 43
No 97
>KOG1456|consensus
Probab=98.32 E-value=3.9e-06 Score=72.61 Aligned_cols=79 Identities=28% Similarity=0.359 Sum_probs=71.5
Q ss_pred CCCCcEEEEcCCCcC-CcHHHHHHHHHhCCCeeEEEEeeCCCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEEEeecC
Q psy2347 6 MERGTRVYVGGLTET-VKKEDLELEFEKFGKLNSVWVAFNPPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVEISRGR 84 (207)
Q Consensus 6 ~~~~~~lfV~nLp~~-~te~~L~~~F~~~G~V~~v~i~~d~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~~a~~k 84 (207)
-.+.+.++|-.|... +..+.|..+|..||.|++|++|+.+.+.|.||+-|+.++++||..||+..+-+.+|.|.+++..
T Consensus 284 ~~~g~VmMVyGLdh~k~N~drlFNl~ClYGNV~rvkFmkTk~gtamVemgd~~aver~v~hLnn~~lfG~kl~v~~SkQ~ 363 (494)
T KOG1456|consen 284 GAPGCVMMVYGLDHGKMNCDRLFNLFCLYGNVERVKFMKTKPGTAMVEMGDAYAVERAVTHLNNIPLFGGKLNVCVSKQN 363 (494)
T ss_pred CCCCcEEEEEeccccccchhhhhhhhhhcCceeeEEEeecccceeEEEcCcHHHHHHHHHHhccCccccceEEEeecccc
Confidence 356789999999764 6678899999999999999999998899999999999999999999999999999999988753
No 98
>KOG1456|consensus
Probab=98.30 E-value=3.2e-06 Score=73.07 Aligned_cols=81 Identities=17% Similarity=0.133 Sum_probs=66.5
Q ss_pred CCCCCCcEEEEcCC--CcCCcHHHHHHHHHhCCCeeEEEEeeCCCcEEEEEEcCHHHHHHHHHhcCCceeCC--cEEEEE
Q psy2347 4 GMMERGTRVYVGGL--TETVKKEDLELEFEKFGKLNSVWVAFNPPGFAFVEFSNQIDAEAACDSMNDQDLMG--SKLRVE 79 (207)
Q Consensus 4 ~~~~~~~~lfV~nL--p~~~te~~L~~~F~~~G~V~~v~i~~d~rG~afV~F~~~e~A~~Al~~l~g~~i~g--~~l~V~ 79 (207)
+...+++.|.+.=| -..+|.+.|..+....|+|..|.|++..---|.|||++.+.|++|.++||+..|.. .+|+|+
T Consensus 115 es~~pN~VLl~TIlNp~YpItvDVly~Icnp~GkVlRIvIfkkngVQAmVEFdsv~~AqrAk~alNGADIYsGCCTLKIe 194 (494)
T KOG1456|consen 115 ESATPNKVLLFTILNPQYPITVDVLYTICNPQGKVLRIVIFKKNGVQAMVEFDSVEVAQRAKAALNGADIYSGCCTLKIE 194 (494)
T ss_pred CCCCCCeEEEEEeecCccccchhhhhhhcCCCCceEEEEEEeccceeeEEeechhHHHHHHHhhcccccccccceeEEEE
Confidence 34455666666544 45689999999999999999999998755679999999999999999999999854 678888
Q ss_pred EeecC
Q psy2347 80 ISRGR 84 (207)
Q Consensus 80 ~a~~k 84 (207)
+|+|.
T Consensus 195 yAkP~ 199 (494)
T KOG1456|consen 195 YAKPT 199 (494)
T ss_pred ecCcc
Confidence 88874
No 99
>KOG1190|consensus
Probab=98.25 E-value=2.7e-06 Score=74.26 Aligned_cols=78 Identities=18% Similarity=0.195 Sum_probs=65.8
Q ss_pred CCCCcEEEEcCCCcCCcHHHHHHHHHhCCCeeEE-EEeeCCCcEEEEEEcCHHHHHHHHHhcCCceeCC-cEEEEEEeec
Q psy2347 6 MERGTRVYVGGLTETVKKEDLELEFEKFGKLNSV-WVAFNPPGFAFVEFSNQIDAEAACDSMNDQDLMG-SKLRVEISRG 83 (207)
Q Consensus 6 ~~~~~~lfV~nLp~~~te~~L~~~F~~~G~V~~v-~i~~d~rG~afV~F~~~e~A~~Al~~l~g~~i~g-~~l~V~~a~~ 83 (207)
.+++.+|++.|||.+++||+|+++|.+-+-+++. +.....+.+|++.+++.|+|..|+..|+.+.+.. ..|+|++++.
T Consensus 411 ~PpsatlHlsnip~svsee~lk~~f~~~g~~vkafkff~kd~kmal~q~~sveeA~~ali~~hnh~lgen~hlRvSFSks 490 (492)
T KOG1190|consen 411 FPPSATLHLSNIPPSVSEEDLKNLFQEPGGQVKAFKFFQKDRKMALPQLESVEEAIQALIDLHNHYLGENHHLRVSFSKS 490 (492)
T ss_pred CCchhheeeccCCcccchhHHHHhhhcCCceEEeeeecCCCcceeecccCChhHhhhhccccccccCCCCceEEEEeecc
Confidence 4678899999999999999999999998876554 3333246799999999999999999999999865 5899999875
No 100
>PF14605 Nup35_RRM_2: Nup53/35/40-type RNA recognition motif
Probab=98.23 E-value=3.7e-06 Score=53.66 Aligned_cols=52 Identities=25% Similarity=0.501 Sum_probs=43.3
Q ss_pred cEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeCCCcEEEEEEcCHHHHHHHH
Q psy2347 10 TRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFNPPGFAFVEFSNQIDAEAAC 63 (207)
Q Consensus 10 ~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d~rG~afV~F~~~e~A~~Al 63 (207)
+.|-|.+.+++..+..| ++|.+||+|+++.+... ..+.+|+|.++.+|++||
T Consensus 2 ~wI~V~Gf~~~~~~~vl-~~F~~fGeI~~~~~~~~-~~~~~l~y~~~~~ae~Al 53 (53)
T PF14605_consen 2 TWISVSGFPPDLAEEVL-EHFASFGEIVDIYVPES-TNWMYLKYKSRKDAEKAL 53 (53)
T ss_pred cEEEEEeECchHHHHHH-HHHHhcCCEEEEEcCCC-CcEEEEEECCHHHHHhhC
Confidence 56888899887775555 58889999999988743 679999999999999985
No 101
>KOG0105|consensus
Probab=98.21 E-value=1.7e-05 Score=62.79 Aligned_cols=64 Identities=25% Similarity=0.306 Sum_probs=58.5
Q ss_pred CCcEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeCCCcEEEEEEcCHHHHHHHHHhcCCceeCC
Q psy2347 8 RGTRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFNPPGFAFVEFSNQIDAEAACDSMNDQDLMG 73 (207)
Q Consensus 8 ~~~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d~rG~afV~F~~~e~A~~Al~~l~g~~i~g 73 (207)
...+|.|..||+..++++||++..+.++|...++.+| ++..|+|..+|+++-||.+|+.+++..
T Consensus 114 Se~RVvVsGLp~SgSWQDLKDHmReaGdvCfadv~rD--g~GvV~~~r~eDMkYAvr~ld~~~~~s 177 (241)
T KOG0105|consen 114 SEYRVVVSGLPPSGSWQDLKDHMREAGDVCFADVQRD--GVGVVEYLRKEDMKYAVRKLDDQKFRS 177 (241)
T ss_pred cceeEEEecCCCCCchHHHHHHHHhhCCeeeeeeecc--cceeeeeeehhhHHHHHHhhccccccC
Confidence 4568999999999999999999999999999999987 689999999999999999999877643
No 102
>PF05172 Nup35_RRM: Nup53/35/40-type RNA recognition motif; InterPro: IPR007846 The MPPN (Mitotic PhosphoProtein N end) family is uncharacterised however it probably plays a role in the cell cycle because the family includes mitotic phosphoproteins O13026 from SWISSPROT []. This family also includes Q05166 from SWISSPROT a suppressor of thermosensitive mutations in the DNA polymerase delta gene, Pol III []. The conserved central region appears to be distantly related to the RNA-binding region RNP-1 (RNA recognition motif, IPR000504 from INTERPRO), suggesting an RNA binding function for this protein.; PDB: 1WWH_C 3P3D_A.
Probab=98.17 E-value=1.1e-05 Score=58.26 Aligned_cols=74 Identities=19% Similarity=0.249 Sum_probs=52.6
Q ss_pred CCcEEEEcCCCcCCcHHHHHHHHHhCCCeeEEE------------EeeCCCcEEEEEEcCHHHHHHHHHhcCCceeCCcE
Q psy2347 8 RGTRVYVGGLTETVKKEDLELEFEKFGKLNSVW------------VAFNPPGFAFVEFSNQIDAEAACDSMNDQDLMGSK 75 (207)
Q Consensus 8 ~~~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~------------i~~d~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~ 75 (207)
..+.|.|-+.|+..+ ..|.++|++||+|++.. -......+..|+|+++.+|++||.+ |+..|.+..
T Consensus 5 ~~~wVtVFGfp~~~~-~~Vl~~F~~~G~Ile~~~~~~~~~~~~~~~~~~~~NWi~I~Y~~~~~A~rAL~~-NG~i~~g~~ 82 (100)
T PF05172_consen 5 SETWVTVFGFPPSAS-NQVLRHFSSFGTILEHFEVLRSSSGINPYPIPSGGNWIHITYDNPLSAQRALQK-NGTIFSGSL 82 (100)
T ss_dssp GCCEEEEE---GGGH-HHHHHHHHCCS-EECEEGGG----------E-CCTTEEEEEESSHHHHHHHHTT-TTEEETTCE
T ss_pred CCeEEEEEccCHHHH-HHHHHHHHhcceEEEeecccccccccccccCCCCCCEEEEECCCHHHHHHHHHh-CCeEEcCcE
Confidence 456788889998854 56667899999998875 1222467999999999999999986 999998854
Q ss_pred -EEEEEeec
Q psy2347 76 -LRVEISRG 83 (207)
Q Consensus 76 -l~V~~a~~ 83 (207)
|-|.++++
T Consensus 83 mvGV~~~~~ 91 (100)
T PF05172_consen 83 MVGVKPCDP 91 (100)
T ss_dssp EEEEEE-HH
T ss_pred EEEEEEcHH
Confidence 55776644
No 103
>KOG2202|consensus
Probab=98.13 E-value=7e-07 Score=73.56 Aligned_cols=58 Identities=33% Similarity=0.489 Sum_probs=49.6
Q ss_pred HHHHHHH-hCCCeeEEEEeeC----CCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEEEee
Q psy2347 25 DLELEFE-KFGKLNSVWVAFN----PPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVEISR 82 (207)
Q Consensus 25 ~L~~~F~-~~G~V~~v~i~~d----~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~~a~ 82 (207)
+|...|+ +|++|++++|..+ .++-++|.|..+|+|++|++.||+..+.+++|.+++..
T Consensus 84 d~f~E~~~kygEiee~~Vc~Nl~~hl~GNVYV~f~~Ee~ae~a~~~lnnRw~~G~pi~ae~~p 146 (260)
T KOG2202|consen 84 DVFTELEDKYGEIEELNVCDNLGDHLVGNVYVKFRSEEDAEAALEDLNNRWYNGRPIHAELSP 146 (260)
T ss_pred HHHHHHHHHhhhhhhhhhhcccchhhhhhhhhhcccHHHHHHHHHHHcCccccCCcceeeecC
Confidence 3333444 8999999987766 67899999999999999999999999999999998864
No 104
>KOG1548|consensus
Probab=98.10 E-value=2.2e-05 Score=67.31 Aligned_cols=78 Identities=31% Similarity=0.426 Sum_probs=63.2
Q ss_pred CCCCcEEEEcCC--Cc--CCc-------HHHHHHHHHhCCCeeEEEEeeC-CCcEEEEEEcCHHHHHHHHHhcCCceeCC
Q psy2347 6 MERGTRVYVGGL--TE--TVK-------KEDLELEFEKFGKLNSVWVAFN-PPGFAFVEFSNQIDAEAACDSMNDQDLMG 73 (207)
Q Consensus 6 ~~~~~~lfV~nL--p~--~~t-------e~~L~~~F~~~G~V~~v~i~~d-~rG~afV~F~~~e~A~~Al~~l~g~~i~g 73 (207)
+...++|.|.|| |. ..+ +++|++-.++||.|.+|.|... +.+.+-|.|.+.++|+.||+.|++..+.+
T Consensus 262 ~r~~~tVi~kn~Ftp~~~~~~~~l~~dlkedl~eec~K~G~v~~vvv~d~hPdGvvtV~f~n~eeA~~ciq~m~GR~fdg 341 (382)
T KOG1548|consen 262 ARADRTVILKNMFTPEDFEKNPDLLNDLKEDLTEECEKFGQVRKVVVYDRHPDGVVTVSFRNNEEADQCIQTMDGRWFDG 341 (382)
T ss_pred ccCCcEEEeeecCCHHHhccCHHHHHHHHHHHHHHHHHhCCcceEEEeccCCCceeEEEeCChHHHHHHHHHhcCeeecc
Confidence 345678889987 22 222 3456666889999999987754 88999999999999999999999999999
Q ss_pred cEEEEEEeec
Q psy2347 74 SKLRVEISRG 83 (207)
Q Consensus 74 ~~l~V~~a~~ 83 (207)
++|..++..-
T Consensus 342 Rql~A~i~DG 351 (382)
T KOG1548|consen 342 RQLTASIWDG 351 (382)
T ss_pred eEEEEEEeCC
Confidence 9999987644
No 105
>KOG4307|consensus
Probab=98.08 E-value=9.2e-06 Score=75.07 Aligned_cols=74 Identities=22% Similarity=0.296 Sum_probs=62.7
Q ss_pred CCCc-EEEEcCCCcCCcHHHHHHHHHhCCCee-EEEEeeC----CCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEEE
Q psy2347 7 ERGT-RVYVGGLTETVKKEDLELEFEKFGKLN-SVWVAFN----PPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVEI 80 (207)
Q Consensus 7 ~~~~-~lfV~nLp~~~te~~L~~~F~~~G~V~-~v~i~~d----~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~~ 80 (207)
.+.+ .|-+.|+|++++-+||.+||..|-.+. +|.|..+ ..+.|.|.|++.|+|.+|...|+++.|..++|+|.+
T Consensus 864 ~pGp~V~~~~n~Pf~v~l~dI~~FF~dY~~~p~sI~~r~nd~G~pTGe~mvAfes~~eAr~A~~dl~~~~i~nr~V~l~i 943 (944)
T KOG4307|consen 864 SPGPRVLSCNNFPFDVTLEDIVEFFNDYEPDPNSIRIRRNDDGVPTGECMVAFESQEEARRASMDLDGQKIRNRVVSLRI 943 (944)
T ss_pred CCCCeEEEecCCCccccHHHHHHHhcccccCCCceeEeecCCCCcccceeEeecCHHHHHhhhhccccCcccceeEEEEe
Confidence 3445 777899999999999999999997654 3444333 789999999999999999999999999999999875
No 106
>KOG0147|consensus
Probab=98.06 E-value=8.5e-06 Score=73.39 Aligned_cols=59 Identities=22% Similarity=0.315 Sum_probs=53.2
Q ss_pred HHHHHHHHhCCCeeEEEEeeCCCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEEEee
Q psy2347 24 EDLELEFEKFGKLNSVWVAFNPPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVEISR 82 (207)
Q Consensus 24 ~~L~~~F~~~G~V~~v~i~~d~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~~a~ 82 (207)
++|++...+|+.|..|.|.+++.++.||.|.+.++|.+|+++||+.+|.++.|++.+-.
T Consensus 468 edV~Eec~k~g~v~hi~vd~ns~g~VYvrc~s~~~A~~a~~alhgrWF~gr~Ita~~~~ 526 (549)
T KOG0147|consen 468 EDVIEECGKHGKVCHIFVDKNSAGCVYVRCPSAEAAGTAVKALHGRWFAGRMITAKYLP 526 (549)
T ss_pred HHHHHHHHhcCCeeEEEEccCCCceEEEecCcHHHHHHHHHHHhhhhhccceeEEEEee
Confidence 45666678999999999988877999999999999999999999999999999998764
No 107
>PF08952 DUF1866: Domain of unknown function (DUF1866) ; InterPro: IPR015047 This domain, found in synaptojanin, has no known function. ; PDB: 1UFW_A 2DNR_A.
Probab=98.01 E-value=3.4e-05 Score=59.01 Aligned_cols=76 Identities=17% Similarity=0.252 Sum_probs=54.1
Q ss_pred CCCCCcEEEEcCCC------cCCcH---HHHHHHHHhCCCeeEEEEeeCCCcEEEEEEcCHHHHHHHHHhcCCceeCCcE
Q psy2347 5 MMERGTRVYVGGLT------ETVKK---EDLELEFEKFGKLNSVWVAFNPPGFAFVEFSNQIDAEAACDSMNDQDLMGSK 75 (207)
Q Consensus 5 ~~~~~~~lfV~nLp------~~~te---~~L~~~F~~~G~V~~v~i~~d~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~ 75 (207)
+-++..+|.|.-+. ....+ ++|.+.|.+||+|.-|+++.+ .-+|+|.+-++|.+|+. |++.+|+++.
T Consensus 23 ~GPpDaTVvVsv~~~~~~e~~~Fdd~l~~~ll~~~~~~GevvLvRfv~~---~mwVTF~dg~sALaals-~dg~~v~g~~ 98 (146)
T PF08952_consen 23 QGPPDATVVVSVDSPSEDEDSSFDDNLMDELLQKFAQYGEVVLVRFVGD---TMWVTFRDGQSALAALS-LDGIQVNGRT 98 (146)
T ss_dssp ---TT-EEEEEECS-SCCCCHS--HHHHHHHHHHHHCCS-ECEEEEETT---CEEEEESSCHHHHHHHH-GCCSEETTEE
T ss_pred cCCCCceEEEEecCCCccccCcCCHHHHHHHHHHHHhCCceEEEEEeCC---eEEEEECccHHHHHHHc-cCCcEECCEE
Confidence 34566778776554 12332 367788899999998888865 78999999999999998 6999999999
Q ss_pred EEEEEeecC
Q psy2347 76 LRVEISRGR 84 (207)
Q Consensus 76 l~V~~a~~k 84 (207)
|+|++.+|.
T Consensus 99 l~i~LKtpd 107 (146)
T PF08952_consen 99 LKIRLKTPD 107 (146)
T ss_dssp EEEEE----
T ss_pred EEEEeCCcc
Confidence 999998764
No 108
>KOG4210|consensus
Probab=98.01 E-value=3.3e-06 Score=71.96 Aligned_cols=79 Identities=27% Similarity=0.479 Sum_probs=70.1
Q ss_pred CCcEEE-EcCCCcCCcHHHHHHHHHhCCCeeEEEEeeC-----CCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEEEe
Q psy2347 8 RGTRVY-VGGLTETVKKEDLELEFEKFGKLNSVWVAFN-----PPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVEIS 81 (207)
Q Consensus 8 ~~~~lf-V~nLp~~~te~~L~~~F~~~G~V~~v~i~~d-----~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~~a 81 (207)
+..++| |.+|+..+++++|+++|..++.|+.++++.+ +++|++|+|.+...+++++.. +...|.+++|.|...
T Consensus 183 ~s~~~~~~~~~~f~~~~d~~~~~~~~~~~i~~~r~~~~~~s~~~kg~a~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~ 261 (285)
T KOG4210|consen 183 PSDTIFFVGELDFSLTRDDLKEHFVSSGEITSVRLPTDEESGDSKGFAYVDFSAGNSKKLALND-QTRSIGGRPLRLEED 261 (285)
T ss_pred ccccceeecccccccchHHHhhhccCcCcceeeccCCCCCccchhhhhhhhhhhchhHHHHhhc-ccCcccCcccccccC
Confidence 344555 9999999999999999999999999999887 789999999999999999987 788899999999988
Q ss_pred ecCCCC
Q psy2347 82 RGRGRG 87 (207)
Q Consensus 82 ~~k~~~ 87 (207)
++++..
T Consensus 262 ~~~~~~ 267 (285)
T KOG4210|consen 262 EPRPKS 267 (285)
T ss_pred CCCccc
Confidence 887654
No 109
>KOG0129|consensus
Probab=97.99 E-value=2.7e-05 Score=69.86 Aligned_cols=63 Identities=30% Similarity=0.430 Sum_probs=57.7
Q ss_pred CCCCCCCcEEEEcCCCcCCcHHHHHHHHH-hCCCeeEEEEeeC-----CCcEEEEEEcCHHHHHHHHHh
Q psy2347 3 DGMMERGTRVYVGGLTETVKKEDLELEFE-KFGKLNSVWVAFN-----PPGFAFVEFSNQIDAEAACDS 65 (207)
Q Consensus 3 ~~~~~~~~~lfV~nLp~~~te~~L~~~F~-~~G~V~~v~i~~d-----~rG~afV~F~~~e~A~~Al~~ 65 (207)
+..+++.+||||+.||.-++.+||..+|+ .||.|+.+-|-+| +++-+=|+|++..+-.+||++
T Consensus 364 sq~lDprrTVFVGgvprpl~A~eLA~imd~lyGgV~yaGIDtD~k~KYPkGaGRVtFsnqqsYi~AIsa 432 (520)
T KOG0129|consen 364 NQPIDPRRTVFVGGLPRPLTAEELAMIMEDLFGGVLYVGIDTDPKLKYPKGAGRVTFSNQQAYIKAISA 432 (520)
T ss_pred CcccCccceEEecCCCCcchHHHHHHHHHHhcCceEEEEeccCcccCCCCCcceeeecccHHHHHHHhh
Confidence 45678899999999999999999999999 5999999988887 789999999999999999985
No 110
>KOG0129|consensus
Probab=97.96 E-value=2.8e-05 Score=69.72 Aligned_cols=59 Identities=22% Similarity=0.485 Sum_probs=48.1
Q ss_pred CCCcEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeC--------CCc---EEEEEEcCHHHHHHHHHhc
Q psy2347 7 ERGTRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFN--------PPG---FAFVEFSNQIDAEAACDSM 66 (207)
Q Consensus 7 ~~~~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d--------~rG---~afV~F~~~e~A~~Al~~l 66 (207)
.-+++|||+.||++++|++|.+.|..||.|. |+.... +++ |+|+.|+++.++++.|.++
T Consensus 257 ~~S~KVFvGGlp~dise~~i~~~F~~FGs~~-VdWP~k~~~~~~~ppkGs~~YvflvFe~E~sV~~Ll~aC 326 (520)
T KOG0129|consen 257 RYSRKVFVGGLPWDITEAQINASFGQFGSVK-VDWPGKANSRGRAPPKGSYGYVFLVFEDERSVQSLLSAC 326 (520)
T ss_pred ccccceeecCCCccccHHHHHhhcccccceE-eecCCCccccccCCCCCcccEEEEEecchHHHHHHHHHH
Confidence 3468999999999999999999999999864 333211 677 9999999999998877654
No 111
>COG5175 MOT2 Transcriptional repressor [Transcription]
Probab=97.94 E-value=2.2e-05 Score=67.26 Aligned_cols=73 Identities=25% Similarity=0.408 Sum_probs=60.1
Q ss_pred cEEEEcCCCcCCcHHH----H--HHHHHhCCCeeEEEEeeC------CCcEE--EEEEcCHHHHHHHHHhcCCceeCCcE
Q psy2347 10 TRVYVGGLTETVKKED----L--ELEFEKFGKLNSVWVAFN------PPGFA--FVEFSNQIDAEAACDSMNDQDLMGSK 75 (207)
Q Consensus 10 ~~lfV~nLp~~~te~~----L--~~~F~~~G~V~~v~i~~d------~rG~a--fV~F~~~e~A~~Al~~l~g~~i~g~~ 75 (207)
.-|||-.||+.+..++ | .++|.+||.|++|.|.+. ....+ ||+|.+.|+|.+||.++++..++++.
T Consensus 115 NLvYVigi~pkva~Ee~~~vLk~~eyFGQyGkI~KIvvNkkt~s~nst~~h~gvYITy~~kedAarcIa~vDgs~~DGr~ 194 (480)
T COG5175 115 NLVYVIGIPPKVADEEVAPVLKRHEYFGQYGKIKKIVVNKKTSSLNSTASHAGVYITYSTKEDAARCIAEVDGSLLDGRV 194 (480)
T ss_pred ceeEEecCCCCCCcccccccccchhhhhhccceeEEEecccccccccccccceEEEEecchHHHHHHHHHhccccccCce
Confidence 4678888988876665 3 378999999999988765 12233 99999999999999999999999999
Q ss_pred EEEEEee
Q psy2347 76 LRVEISR 82 (207)
Q Consensus 76 l~V~~a~ 82 (207)
|++.+-+
T Consensus 195 lkatYGT 201 (480)
T COG5175 195 LKATYGT 201 (480)
T ss_pred EeeecCc
Confidence 9988754
No 112
>KOG4849|consensus
Probab=97.85 E-value=1.4e-05 Score=68.72 Aligned_cols=71 Identities=17% Similarity=0.232 Sum_probs=58.4
Q ss_pred CcEEEEcCCCcCCcHHHHHHHHHhCCC--eeEEEEeeC-----CCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEE
Q psy2347 9 GTRVYVGGLTETVKKEDLELEFEKFGK--LNSVWVAFN-----PPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVE 79 (207)
Q Consensus 9 ~~~lfV~nLp~~~te~~L~~~F~~~G~--V~~v~i~~d-----~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~ 79 (207)
...+||+||-|.+|++||.+.+...|. |.++++..+ +|+||+|...+..++++.++.|..++|.++.-+|-
T Consensus 80 k~~~YvGNL~W~TTD~DL~~A~~S~G~~~~~dmKFFENR~NGQSKG~AL~~~~SdAa~Kq~MeiLP~k~iHGQ~P~V~ 157 (498)
T KOG4849|consen 80 KYCCYVGNLLWYTTDADLLKALQSTGLAQFADMKFFENRTNGQSKGYALLVLNSDAAVKQTMEILPTKTIHGQSPTVL 157 (498)
T ss_pred eEEEEecceeEEeccHHHHHHHHhhhHHHHhhhhhhhcccCCcccceEEEEecchHHHHHHHHhcccceecCCCCeee
Confidence 357899999999999999998877653 444444443 89999999999999999999999999988765554
No 113
>KOG0120|consensus
Probab=97.81 E-value=5.7e-05 Score=68.43 Aligned_cols=57 Identities=23% Similarity=0.363 Sum_probs=48.9
Q ss_pred HHHHHHhCCCeeEEEEeeC--------CCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEEEee
Q psy2347 26 LELEFEKFGKLNSVWVAFN--------PPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVEISR 82 (207)
Q Consensus 26 L~~~F~~~G~V~~v~i~~d--------~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~~a~ 82 (207)
|+.-+.+|+.|..|+|..+ ..+..||+|.+.+++++|+++|++.++.+++|.+.+-.
T Consensus 426 vr~ec~k~g~v~~v~ipr~~~~~~~~~G~GkVFVefas~ed~qrA~~~L~GrKF~nRtVvtsYyd 490 (500)
T KOG0120|consen 426 VRTECAKFGAVRSVEIPRPYPDENPVPGTGKVFVEFADTEDSQRAMEELTGRKFANRTVVASYYD 490 (500)
T ss_pred HHHHhcccCceeEEecCCCCCCCCcCCCcccEEEEecChHHHHHHHHHccCceeCCcEEEEEecC
Confidence 3344568899999999876 45789999999999999999999999999999888754
No 114
>KOG1365|consensus
Probab=97.80 E-value=3.1e-05 Score=67.36 Aligned_cols=75 Identities=23% Similarity=0.251 Sum_probs=63.9
Q ss_pred CcEEEEcCCCcCCcHHHHHHHHHhCCC-eeE--EEEeeC----CCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEEEe
Q psy2347 9 GTRVYVGGLTETVKKEDLELEFEKFGK-LNS--VWVAFN----PPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVEIS 81 (207)
Q Consensus 9 ~~~lfV~nLp~~~te~~L~~~F~~~G~-V~~--v~i~~d----~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~~a 81 (207)
..+|-+.+||.+.+.++|.+||..|-. |.. |+++.+ +.+.|||+|.++|+|.+|+.+.+.+..+.+-|+|..+
T Consensus 280 kdcvRLRGLPy~AtvEdIL~FlgdFa~~i~f~gVHmv~N~qGrPSGeAFIqm~nae~a~aaaqk~hk~~mk~RYiEvfp~ 359 (508)
T KOG1365|consen 280 KDCVRLRGLPYEATVEDILDFLGDFATDIRFQGVHMVLNGQGRPSGEAFIQMRNAERARAAAQKCHKKLMKSRYIEVFPC 359 (508)
T ss_pred CCeeEecCCChhhhHHHHHHHHHHHhhhcccceeEEEEcCCCCcChhhhhhhhhhHHHHHHHHHHHHhhcccceEEEeec
Confidence 557889999999999999999999865 333 666666 7899999999999999999988888888999998876
Q ss_pred ec
Q psy2347 82 RG 83 (207)
Q Consensus 82 ~~ 83 (207)
..
T Consensus 360 S~ 361 (508)
T KOG1365|consen 360 SV 361 (508)
T ss_pred cH
Confidence 53
No 115
>KOG2314|consensus
Probab=97.72 E-value=8.6e-05 Score=67.38 Aligned_cols=74 Identities=27% Similarity=0.376 Sum_probs=60.4
Q ss_pred CCcEEEEcCCCcCCc------HHHHHHHHHhCCCeeEEEEeeC----CCcEEEEEEcCHHHHHHHHHhcCCceeC-CcEE
Q psy2347 8 RGTRVYVGGLTETVK------KEDLELEFEKFGKLNSVWVAFN----PPGFAFVEFSNQIDAEAACDSMNDQDLM-GSKL 76 (207)
Q Consensus 8 ~~~~lfV~nLp~~~t------e~~L~~~F~~~G~V~~v~i~~d----~rG~afV~F~~~e~A~~Al~~l~g~~i~-g~~l 76 (207)
-...|+|-++|.--. ...|.++|+++++|+.+.++.+ .++++|++|.+..+|++|++.||++.|+ .+++
T Consensus 57 ~D~vVvv~g~PvV~~~rl~klk~vl~kvfsk~gk~vn~~~P~~e~ggtkG~lf~E~~~~~~A~~aVK~l~G~~ldknHtf 136 (698)
T KOG2314|consen 57 FDSVVVVDGAPVVGPARLEKLKKVLTKVFSKAGKIVNMYYPIDEEGGTKGYLFVEYASMRDAKKAVKSLNGKRLDKNHTF 136 (698)
T ss_pred cceEEEECCCcccChhHHHHHHHHHHHHHHhhccccceeeccCccCCeeeEEEEEecChhhHHHHHHhcccceecccceE
Confidence 356889999985321 2356789999999999999876 7899999999999999999999999874 6777
Q ss_pred EEEEe
Q psy2347 77 RVEIS 81 (207)
Q Consensus 77 ~V~~a 81 (207)
.|..-
T Consensus 137 ~v~~f 141 (698)
T KOG2314|consen 137 FVRLF 141 (698)
T ss_pred Eeehh
Confidence 77653
No 116
>KOG3152|consensus
Probab=97.71 E-value=2.5e-05 Score=64.52 Aligned_cols=68 Identities=18% Similarity=0.313 Sum_probs=59.1
Q ss_pred CCcEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeC--C---------------CcEEEEEEcCHHHHHHHHHhcCCce
Q psy2347 8 RGTRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFN--P---------------PGFAFVEFSNQIDAEAACDSMNDQD 70 (207)
Q Consensus 8 ~~~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d--~---------------rG~afV~F~~~e~A~~Al~~l~g~~ 70 (207)
..-.|||++||+.+....|+++|+.||+|-.|.|... + --.++|||.+...|+++.+.||+..
T Consensus 73 k~GVvylS~IPp~m~~~rlReil~~yGeVGRvylqpE~~s~~~~r~~~~~n~~~~y~EGWvEF~~KrvAK~iAe~Lnn~~ 152 (278)
T KOG3152|consen 73 KTGVVYLSNIPPYMDPVRLREILSQYGEVGRVYLQPEDDSKRAARKRKGGNYKKLYSEGWVEFISKRVAKRIAELLNNTP 152 (278)
T ss_pred cceEEEeccCCCccCHHHHHHHHHhccccceEEecchhhHHHHHHhhcCCCccccchhHHHHHHHHHHHHHHHHHhCCCc
Confidence 4568999999999999999999999999999988654 1 1236799999999999999999999
Q ss_pred eCCcE
Q psy2347 71 LMGSK 75 (207)
Q Consensus 71 i~g~~ 75 (207)
|.+++
T Consensus 153 Iggkk 157 (278)
T KOG3152|consen 153 IGGKK 157 (278)
T ss_pred cCCCC
Confidence 98765
No 117
>KOG2416|consensus
Probab=97.62 E-value=5.8e-05 Score=68.82 Aligned_cols=77 Identities=13% Similarity=0.172 Sum_probs=63.3
Q ss_pred CCCCcEEEEcCCCcCCcHHHHHHHHHh-CCCeeEEEEeeCCCcEEEEEEcCHHHHHHHHHhcCCcee---CCcEEEEEEe
Q psy2347 6 MERGTRVYVGGLTETVKKEDLELEFEK-FGKLNSVWVAFNPPGFAFVEFSNQIDAEAACDSMNDQDL---MGSKLRVEIS 81 (207)
Q Consensus 6 ~~~~~~lfV~nLp~~~te~~L~~~F~~-~G~V~~v~i~~d~rG~afV~F~~~e~A~~Al~~l~g~~i---~g~~l~V~~a 81 (207)
...+..|+|.||-..+|.-+|++++.. .+.|+++ +|..-|..|||.|.+.++|.+.+++||+..+ +.+.|.|.++
T Consensus 441 ~~~SnvlhI~nLvRPFTlgQLkelL~rtgg~Vee~-WmDkIKShCyV~yss~eEA~atr~AlhnV~WP~sNPK~L~adf~ 519 (718)
T KOG2416|consen 441 KEPSNVLHIDNLVRPFTLGQLKELLGRTGGNVEEF-WMDKIKSHCYVSYSSVEEAAATREALHNVQWPPSNPKHLIADFV 519 (718)
T ss_pred CCccceEeeecccccchHHHHHHHHhhccCchHHH-HHHHhhcceeEecccHHHHHHHHHHHhccccCCCCCceeEeeec
Confidence 346789999999999999999999995 5666666 3333477999999999999999999999876 6688888887
Q ss_pred ec
Q psy2347 82 RG 83 (207)
Q Consensus 82 ~~ 83 (207)
..
T Consensus 520 ~~ 521 (718)
T KOG2416|consen 520 RA 521 (718)
T ss_pred ch
Confidence 53
No 118
>PF10309 DUF2414: Protein of unknown function (DUF2414); InterPro: IPR019416 This entry contains proteins that have no known function.
Probab=97.61 E-value=0.00041 Score=45.54 Aligned_cols=56 Identities=16% Similarity=0.178 Sum_probs=45.8
Q ss_pred CCcEEEEcCCCcCCcHHHHHHHHHhC---CCeeEEEEeeCCCcEEEEEEcCHHHHHHHHHhc
Q psy2347 8 RGTRVYVGGLTETVKKEDLELEFEKF---GKLNSVWVAFNPPGFAFVEFSNQIDAEAACDSM 66 (207)
Q Consensus 8 ~~~~lfV~nLp~~~te~~L~~~F~~~---G~V~~v~i~~d~rG~afV~F~~~e~A~~Al~~l 66 (207)
...+|+|.+|. +++.++|+.+|..| .....|+.+.| --|-|.|.+++.|.+||.+|
T Consensus 4 rpeavhirGvd-~lsT~dI~~y~~~y~~~~~~~~IEWIdD--tScNvvf~d~~~A~~AL~~L 62 (62)
T PF10309_consen 4 RPEAVHIRGVD-ELSTDDIKAYFSEYFDEEGPFRIEWIDD--TSCNVVFKDEETAARALVAL 62 (62)
T ss_pred eeceEEEEcCC-CCCHHHHHHHHHHhcccCCCceEEEecC--CcEEEEECCHHHHHHHHHcC
Confidence 34589999985 58889999999998 23567888887 35789999999999999865
No 119
>KOG4676|consensus
Probab=97.44 E-value=0.00022 Score=62.19 Aligned_cols=70 Identities=16% Similarity=0.267 Sum_probs=58.2
Q ss_pred cEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeC--------CCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEEE
Q psy2347 10 TRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFN--------PPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVEI 80 (207)
Q Consensus 10 ~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d--------~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~~ 80 (207)
..|.|.||.+.+|.++++.||.-.|+|.++.|+.. ..-.|||.|.+.+.+..|.. |.+.++-++.|.|..
T Consensus 8 ~vIqvanispsat~dqm~tlFg~lGkI~elrlyp~~~d~~~pv~sRtcyVkf~d~~sv~vaQh-Ltntvfvdraliv~p 85 (479)
T KOG4676|consen 8 GVIQVANISPSATKDQMQTLFGNLGKIPELRLYPNVDDSKIPVISRTCYVKFLDSQSVTVAQH-LTNTVFVDRALIVRP 85 (479)
T ss_pred ceeeecccCchhhHHHHHHHHhhccccccccccCCCCCccCcceeeeEEEeccCCcceeHHhh-hccceeeeeeEEEEe
Confidence 38999999999999999999999999999988764 34689999999999998887 566666666555543
No 120
>KOG0128|consensus
Probab=97.33 E-value=7.7e-05 Score=70.43 Aligned_cols=76 Identities=22% Similarity=0.234 Sum_probs=68.2
Q ss_pred CcEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeC----CCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEEEeecC
Q psy2347 9 GTRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFN----PPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVEISRGR 84 (207)
Q Consensus 9 ~~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d----~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~~a~~k 84 (207)
..+|||.++|+..|.++|+.++.+++.++++.+++. +++.|+|.|.++.++.+++..++...+.-..+.|.++.|.
T Consensus 736 K~~v~i~g~pf~gt~e~~k~l~~~~gn~~~~~~vt~r~gkpkg~a~v~y~~ea~~s~~~~s~d~~~~rE~~~~v~vsnp~ 815 (881)
T KOG0128|consen 736 KISVAISGPPFQGTKEELKSLASKTGNVTSLRLVTVRAGKPKGKARVDYNTEADASRKVASVDVAGKRENNGEVQVSNPE 815 (881)
T ss_pred hhhhheeCCCCCCchHHHHhhccccCCccccchhhhhccccccceeccCCCcchhhhhcccchhhhhhhcCccccccCCc
Confidence 468999999999999999999999999999887766 7899999999999999999988888888888888887663
No 121
>KOG1996|consensus
Probab=97.33 E-value=0.00071 Score=57.17 Aligned_cols=60 Identities=23% Similarity=0.242 Sum_probs=51.1
Q ss_pred HHHHHHHHHhCCCeeEEEEeeC------CCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEEEee
Q psy2347 23 KEDLELEFEKFGKLNSVWVAFN------PPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVEISR 82 (207)
Q Consensus 23 e~~L~~~F~~~G~V~~v~i~~d------~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~~a~ 82 (207)
|+++++..++|+.|.+|.|..+ ..--.||+|+..++|.+|+..||+..|.++.+...+-.
T Consensus 300 ede~keEceKyg~V~~viifeip~~p~deavRiFveF~r~e~aiKA~VdlnGRyFGGr~v~A~Fyn 365 (378)
T KOG1996|consen 300 EDETKEECEKYGKVGNVIIFEIPSQPEDEAVRIFVEFERVESAIKAVVDLNGRYFGGRVVSACFYN 365 (378)
T ss_pred HHHHHHHHHhhcceeeEEEEecCCCccchhheeeeeeccHHHHHHHHHhcCCceecceeeeheecc
Confidence 4578888999999999988766 23457999999999999999999999999999877653
No 122
>PF08675 RNA_bind: RNA binding domain; InterPro: IPR014789 This domain corresponds to the RNA binding domain of Poly(A)-specific ribonuclease (PARN). ; GO: 0003723 RNA binding, 0004535 poly(A)-specific ribonuclease activity, 0046872 metal ion binding, 0006402 mRNA catabolic process, 0005634 nucleus, 0005737 cytoplasm; PDB: 3CTR_A 2ROK_A 3D45_B 1WHV_A.
Probab=97.27 E-value=0.00086 Score=46.49 Aligned_cols=59 Identities=20% Similarity=0.262 Sum_probs=42.8
Q ss_pred CCCCcEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeCCCcEEEEEEcCHHHHHHHHHhcCC
Q psy2347 6 MERGTRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFNPPGFAFVEFSNQIDAEAACDSMND 68 (207)
Q Consensus 6 ~~~~~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d~rG~afV~F~~~e~A~~Al~~l~g 68 (207)
.......+|. +|.++...||.++|+.||.|. |..+.| --|||...+.+.|+.|+..++-
T Consensus 6 P~RdHVFhlt-FPkeWK~~DI~qlFspfG~I~-VsWi~d--TSAfV~l~~r~~~~~v~~~~~~ 64 (87)
T PF08675_consen 6 PSRDHVFHLT-FPKEWKTSDIYQLFSPFGQIY-VSWIND--TSAFVALHNRDQAKVVMNTLKK 64 (87)
T ss_dssp -SGCCEEEEE---TT--HHHHHHHCCCCCCEE-EEEECT--TEEEEEECCCHHHHHHHHHHTT
T ss_pred CCcceEEEEe-CchHhhhhhHHHHhccCCcEE-EEEEcC--CcEEEEeecHHHHHHHHHHhcc
Confidence 3445566666 999999999999999999875 444443 4899999999999999988753
No 123
>KOG2591|consensus
Probab=97.23 E-value=0.00041 Score=63.01 Aligned_cols=74 Identities=20% Similarity=0.242 Sum_probs=57.5
Q ss_pred CCCCCcEEEEcCCCcCCcHHHHHHHHHh--CCCeeEEEEeeCCCcEEEEEEcCHHHHHHHHHhcCC--ceeCCcEEEEEE
Q psy2347 5 MMERGTRVYVGGLTETVKKEDLELEFEK--FGKLNSVWVAFNPPGFAFVEFSNQIDAEAACDSMND--QDLMGSKLRVEI 80 (207)
Q Consensus 5 ~~~~~~~lfV~nLp~~~te~~L~~~F~~--~G~V~~v~i~~d~rG~afV~F~~~e~A~~Al~~l~g--~~i~g~~l~V~~ 80 (207)
+..+.|.|.|..||+.+.+|+||.||+. +-++++|++..+ .-=||+|++.++|++|.+.|.. ++|.+|.|...+
T Consensus 171 p~~kRcIvilREIpettp~e~Vk~lf~~encPk~iscefa~N--~nWyITfesd~DAQqAykylreevk~fqgKpImARI 248 (684)
T KOG2591|consen 171 PNHKRCIVILREIPETTPIEVVKALFKGENCPKVISCEFAHN--DNWYITFESDTDAQQAYKYLREEVKTFQGKPIMARI 248 (684)
T ss_pred cCcceeEEEEeecCCCChHHHHHHHhccCCCCCceeeeeeec--CceEEEeecchhHHHHHHHHHHHHHhhcCcchhhhh
Confidence 3456788889999999999999999975 667888877654 2358999999999999887653 356677665544
No 124
>KOG0112|consensus
Probab=97.22 E-value=0.00044 Score=65.85 Aligned_cols=79 Identities=25% Similarity=0.410 Sum_probs=69.9
Q ss_pred CCCCCcEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeCCCcEEEEEEcCHHHHHHHHHhcCCceeCC--cEEEEEEee
Q psy2347 5 MMERGTRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFNPPGFAFVEFSNQIDAEAACDSMNDQDLMG--SKLRVEISR 82 (207)
Q Consensus 5 ~~~~~~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d~rG~afV~F~~~e~A~~Al~~l~g~~i~g--~~l~V~~a~ 82 (207)
...+.+.|+|+.|..++....|...|..||.|..|.+-.. ..||+|.+++...|+.|++.|.+..|.+ +.|.|.+|.
T Consensus 451 kst~ttr~~sgglg~w~p~~~l~r~fd~fGpir~Idy~hg-q~yayi~yes~~~aq~a~~~~rgap~G~P~~r~rvdla~ 529 (975)
T KOG0112|consen 451 KSTPTTRLQSGGLGPWSPVSRLNREFDRFGPIRIIDYRHG-QPYAYIQYESPPAAQAATHDMRGAPLGGPPRRLRVDLAS 529 (975)
T ss_pred ccccceeeccCCCCCCChHHHHHHHhhccCcceeeecccC-CcceeeecccCccchhhHHHHhcCcCCCCCccccccccc
Confidence 3457789999999999999999999999999999988765 6799999999999999999999988864 679999987
Q ss_pred cC
Q psy2347 83 GR 84 (207)
Q Consensus 83 ~k 84 (207)
+.
T Consensus 530 ~~ 531 (975)
T KOG0112|consen 530 PP 531 (975)
T ss_pred CC
Confidence 53
No 125
>KOG0921|consensus
Probab=97.19 E-value=0.0072 Score=58.30 Aligned_cols=13 Identities=62% Similarity=1.071 Sum_probs=5.5
Q ss_pred CCCCCCCCCCCCC
Q psy2347 183 GGGRGGGGRFRSR 195 (207)
Q Consensus 183 ~g~~~~g~~~~~~ 195 (207)
+|+++++...+.|
T Consensus 1267 gGgfg~G~~Gg~~ 1279 (1282)
T KOG0921|consen 1267 GGGFGGGGRGGNA 1279 (1282)
T ss_pred CCCCCCCCccccc
Confidence 4444444444333
No 126
>PF03880 DbpA: DbpA RNA binding domain ; InterPro: IPR005580 This RNA binding domain is found at the C terminus of a number of DEAD helicase proteins [].; PDB: 2G0C_A 3MOJ_B.
Probab=97.18 E-value=0.0027 Score=43.10 Aligned_cols=67 Identities=24% Similarity=0.387 Sum_probs=41.6
Q ss_pred EEEEc-CCCcCCcHHHHHHHHHhCCC-----eeEEEEeeCCCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEEEe
Q psy2347 11 RVYVG-GLTETVKKEDLELEFEKFGK-----LNSVWVAFNPPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVEIS 81 (207)
Q Consensus 11 ~lfV~-nLp~~~te~~L~~~F~~~G~-----V~~v~i~~d~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~~a 81 (207)
+|||. +-...++..+|..++..... |-.|+|..+ |+||+-.. +.|++++++|+...+.+++|+|+.|
T Consensus 2 rl~in~Gr~dg~~~~~iv~~i~~~~gi~~~~IG~I~I~~~---~S~vev~~-~~a~~v~~~l~~~~~~gk~v~ve~A 74 (74)
T PF03880_consen 2 RLFINVGRKDGLTPRDIVGAICNEAGIPGRDIGRIDIFDN---FSFVEVPE-EVAEKVLEALNGKKIKGKKVRVERA 74 (74)
T ss_dssp EEEES-SGGGT--HHHHHHHHHTCTTB-GGGEEEEEE-SS----EEEEE-T-T-HHHHHHHHTT--SSS----EEE-
T ss_pred EEEEEcccccCCCHHHHHHHHHhccCCCHHhEEEEEEeee---EEEEEECH-HHHHHHHHHhcCCCCCCeeEEEEEC
Confidence 45662 12456889999999988754 557777755 99999887 5788999999999999999999875
No 127
>KOG2193|consensus
Probab=97.09 E-value=0.00039 Score=61.31 Aligned_cols=76 Identities=28% Similarity=0.406 Sum_probs=60.4
Q ss_pred cEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeCCCcEEEEEEcCHHHHHHHHHhcCCc-eeCCcEEEEEEeecCCC
Q psy2347 10 TRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFNPPGFAFVEFSNQIDAEAACDSMNDQ-DLMGSKLRVEISRGRGR 86 (207)
Q Consensus 10 ~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d~rG~afV~F~~~e~A~~Al~~l~g~-~i~g~~l~V~~a~~k~~ 86 (207)
.+|||.||.+.++.++|+.+|...-.-..-.++. ..+++||.+.+..-|.+|++.++++ ++.++++.|+.+-++..
T Consensus 2 nklyignL~p~~~psdl~svfg~ak~~~~g~fl~-k~gyafvd~pdq~wa~kaie~~sgk~elqGkr~e~~~sv~kkq 78 (584)
T KOG2193|consen 2 NKLYIGNLSPQVTPSDLESVFGDAKIPGSGQFLV-KSGYAFVDCPDQQWANKAIETLSGKVELQGKRQEVEHSVPKKQ 78 (584)
T ss_pred CcccccccCCCCChHHHHHHhccccCCCCcceee-ecceeeccCCchhhhhhhHHhhchhhhhcCceeeccchhhHHH
Confidence 4799999999999999999997652211212222 3689999999999999999999875 58999999998876643
No 128
>KOG4307|consensus
Probab=97.08 E-value=0.00029 Score=65.46 Aligned_cols=75 Identities=16% Similarity=0.030 Sum_probs=60.9
Q ss_pred CCCcEEEEcCCCcCCcHHHHHHHHHhCCCeeE-EEEeeC----CCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEEEe
Q psy2347 7 ERGTRVYVGGLTETVKKEDLELEFEKFGKLNS-VWVAFN----PPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVEIS 81 (207)
Q Consensus 7 ~~~~~lfV~nLp~~~te~~L~~~F~~~G~V~~-v~i~~d----~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~~a 81 (207)
.....|||..||..+++.++.++|+..-.|++ |.|... .+..|||+|..++++.+|+..-+.+.+..+.|+|.-.
T Consensus 432 ~ag~~lyv~~lP~~t~~~~~v~~f~~~~~Ved~I~lt~~P~~~~~~~afv~F~~~~a~~~a~~~~~k~y~G~r~irv~si 511 (944)
T KOG4307|consen 432 GAGGALYVFQLPVMTPIVPPVNKFMGAAAVEDFIELTRLPTDLLRPAAFVAFIHPTAPLTASSVKTKFYPGHRIIRVDSI 511 (944)
T ss_pred CccceEEeccCCccccccchhhhhhhhhhhhheeEeccCCcccccchhhheeccccccchhhhcccccccCceEEEeech
Confidence 34679999999999999999999998777777 555433 6789999999988888888766667777888888754
No 129
>KOG1365|consensus
Probab=97.02 E-value=0.00056 Score=59.67 Aligned_cols=70 Identities=23% Similarity=0.166 Sum_probs=51.9
Q ss_pred cEEEEcCCCcCCcHHHHHHHHHhCCC----eeEEEEeeC----CCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEEE
Q psy2347 10 TRVYVGGLTETVKKEDLELEFEKFGK----LNSVWVAFN----PPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVEI 80 (207)
Q Consensus 10 ~~lfV~nLp~~~te~~L~~~F~~~G~----V~~v~i~~d----~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~~ 80 (207)
-.|-+.+||+++|+.++.+||.+.-. ++.|.+++. ..+-|||.|..+++|++||.+ |...|.-+-|+|..
T Consensus 162 vivRmRGLPfdat~~dVv~FF~~~cpv~~g~egvLFV~rpdgrpTGdAFvlfa~ee~aq~aL~k-hrq~iGqRYIElFR 239 (508)
T KOG1365|consen 162 VIVRMRGLPFDATALDVVEFFGPPCPVTGGTEGVLFVTRPDGRPTGDAFVLFACEEDAQFALRK-HRQNIGQRYIELFR 239 (508)
T ss_pred eEEEecCCCCCcchHHHHHhcCCCCcccCCccceEEEECCCCCcccceEEEecCHHHHHHHHHH-HHHHHhHHHHHHHH
Confidence 35667899999999999999964322 334444433 679999999999999999987 66666655555543
No 130
>PF07576 BRAP2: BRCA1-associated protein 2; InterPro: IPR011422 These proteins include BRCA1-associated protein 2 (BRAP2), which binds nuclear localisation signals (NLSs) in vitro and in yeast two-hybrid screening []. These proteins share a region of sequence similarity at their N terminus. They also have IPR001607 from INTERPRO at the C terminus.
Probab=96.99 E-value=0.0061 Score=44.71 Aligned_cols=68 Identities=12% Similarity=0.158 Sum_probs=49.3
Q ss_pred CCCCcEEEEcCCCc-CCcHHHHHHHHHhCCC-eeEEEEeeC---CCcEEEEEEcCHHHHHHHHHhcCCceeCC
Q psy2347 6 MERGTRVYVGGLTE-TVKKEDLELEFEKFGK-LNSVWVAFN---PPGFAFVEFSNQIDAEAACDSMNDQDLMG 73 (207)
Q Consensus 6 ~~~~~~lfV~nLp~-~~te~~L~~~F~~~G~-V~~v~i~~d---~rG~afV~F~~~e~A~~Al~~l~g~~i~g 73 (207)
....++|.|--+|+ .++.++|..+.+.+-+ |+.++|++| ++-.++|+|.+.++|++..+.+|++.++.
T Consensus 9 ~~~~~~~~~l~vp~~~~~~d~l~~f~~~~~~~i~~~riird~~pnrymVLikF~~~~~Ad~Fy~~fNGk~Fns 81 (110)
T PF07576_consen 9 DERRSTLCCLAVPPYMTPSDFLLFFGAPFREDIEHIRIIRDGTPNRYMVLIKFRDQESADEFYEEFNGKPFNS 81 (110)
T ss_pred CCCCceEEEEEeCcccccHHHHHHhhhcccccEEEEEEeeCCCCceEEEEEEECCHHHHHHHHHHhCCCccCC
Confidence 33445555544444 4555667666666544 778899998 55678999999999999999999988753
No 131
>PF03467 Smg4_UPF3: Smg-4/UPF3 family; InterPro: IPR005120 Nonsense-mediated mRNA decay (NMD) is a surveillance mechanism by which eukaryotic cells detect and degrade transcripts containing premature termination codons. Three 'up-frameshift' proteins, UPF1, UPF2 and UPF3, are essential for this process in organisms ranging from yeast, human to plants []. Exon junction complexes (EJCs) are deposited ~24 nucleotides upstream of exon-exon junctions after splicing. Translation causes displacement of the EJCs, however, premature translation termination upstream of one or more EJCs triggers the recruitment of UPF1, UPF2 and UPF3 and activates the NMD pathway [, ]. This family contains UPF3. The crystal structure of the complex between human UPF2 and UPF3b, which are, respectively, a MIF4G (middle portion of eIF4G) domain and an RNP domain (ribonucleoprotein-type RNA-binding domain) has been determined to 1.95A. The protein-protein interface is mediated by highly conserved charged residues in UPF2 and UPF3b and involves the beta-sheet surface of the UPF3b ribonucleoprotein (RNP) domain, which is generally used by these domains to bind nucleic acids. In UPF3b the RNP domain does not bind RNA, whereas the UPF2 construct and the complex do. It is clear that some RNP domains have evolved for specific protein-protein interactions rather than as nucleic acid binding modules [].; PDB: 1UW4_A 2L08_A.
Probab=96.79 E-value=0.0013 Score=52.28 Aligned_cols=75 Identities=17% Similarity=0.221 Sum_probs=48.2
Q ss_pred CCCcEEEEcCCCcCCcHHHHHHHHHh-CCCe---eEEEEeeC-------CCcEEEEEEcCHHHHHHHHHhcCCceeCCc-
Q psy2347 7 ERGTRVYVGGLTETVKKEDLELEFEK-FGKL---NSVWVAFN-------PPGFAFVEFSNQIDAEAACDSMNDQDLMGS- 74 (207)
Q Consensus 7 ~~~~~lfV~nLp~~~te~~L~~~F~~-~G~V---~~v~i~~d-------~rG~afV~F~~~e~A~~Al~~l~g~~i~g~- 74 (207)
....+|.|.+||+.+||+++++.++. +... ..+.-... ...-|||.|.+.+++...+..++++.+.+.
T Consensus 5 ~~~~KvVIR~LPP~LteeeF~~~i~~~l~~~~~w~y~~g~~~~~~~~~~~~SRaYi~F~~~~~~~~F~~~~~g~~F~D~k 84 (176)
T PF03467_consen 5 KEGTKVVIRRLPPNLTEEEFWEQISPWLPDEWDWYYFQGKYGKKSFKPPTYSRAYINFKNPEDLLEFRDRFDGHVFVDSK 84 (176)
T ss_dssp ----EEEEEEE-TTS-HHHHCCCCSS--SSE---EEEEEEES-SSSTTS--EEEEEEESSCHHHHHHHHHCTTEEEE-TT
T ss_pred ccCceEEEeCCCCCCCHHHHHHHhhhhcccccceEEEecCCCCccCCCCcceEEEEEeCCHHHHHHHHHhcCCcEEECCC
Confidence 34569999999999999999986665 4444 23321122 235699999999999999999999887432
Q ss_pred ----EEEEEEe
Q psy2347 75 ----KLRVEIS 81 (207)
Q Consensus 75 ----~l~V~~a 81 (207)
...|++|
T Consensus 85 g~~~~~~VE~A 95 (176)
T PF03467_consen 85 GNEYPAVVEFA 95 (176)
T ss_dssp S-EEEEEEEE-
T ss_pred CCCcceeEEEc
Confidence 3455555
No 132
>PF15023 DUF4523: Protein of unknown function (DUF4523)
Probab=96.76 E-value=0.011 Score=45.18 Aligned_cols=74 Identities=16% Similarity=0.161 Sum_probs=54.5
Q ss_pred CCCCcEEEEcCCCcCC----cHHHHHHHHHhCCCeeEEEEeeCCCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEEEe
Q psy2347 6 MERGTRVYVGGLTETV----KKEDLELEFEKFGKLNSVWVAFNPPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVEIS 81 (207)
Q Consensus 6 ~~~~~~lfV~nLp~~~----te~~L~~~F~~~G~V~~v~i~~d~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~~a 81 (207)
.++-.+|.|.=|..++ +.+.|...++.||+|++|.++- +--|.|+|+|..+|=+|+.+++. ..-+..+.+.|-
T Consensus 83 epPMsTIVVRWlkknm~~~edl~sV~~~Ls~fGpI~SVT~cG--rqsavVvF~d~~SAC~Av~Af~s-~~pgtm~qCsWq 159 (166)
T PF15023_consen 83 EPPMSTIVVRWLKKNMQPTEDLKSVIQRLSVFGPIQSVTLCG--RQSAVVVFKDITSACKAVSAFQS-RAPGTMFQCSWQ 159 (166)
T ss_pred CCCceeEEeehhhhcCChHHHHHHHHHHHHhcCCcceeeecC--CceEEEEehhhHHHHHHHHhhcC-CCCCceEEeecc
Confidence 3455677776555544 3344556678899999997763 56899999999999999998765 556777777765
Q ss_pred e
Q psy2347 82 R 82 (207)
Q Consensus 82 ~ 82 (207)
.
T Consensus 160 q 160 (166)
T PF15023_consen 160 Q 160 (166)
T ss_pred c
Confidence 3
No 133
>KOG2135|consensus
Probab=96.67 E-value=0.001 Score=59.39 Aligned_cols=75 Identities=16% Similarity=0.213 Sum_probs=59.7
Q ss_pred CCcEEEEcCCCcCC-cHHHHHHHHHhCCCeeEEEEeeCCCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEEEeecC
Q psy2347 8 RGTRVYVGGLTETV-KKEDLELEFEKFGKLNSVWVAFNPPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVEISRGR 84 (207)
Q Consensus 8 ~~~~lfV~nLp~~~-te~~L~~~F~~~G~V~~v~i~~d~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~~a~~k 84 (207)
..+.|-|.-+|..+ |.++|..+|.+||+|+.|.|-.. .-.|.|+|.+..+|-+|.. .+...|+++.|+|.|-++.
T Consensus 371 dhs~l~lek~~~glnt~a~ln~hfA~fG~i~n~qv~~~-~~~a~vTF~t~aeag~a~~-s~~avlnnr~iKl~whnps 446 (526)
T KOG2135|consen 371 DHSPLALEKSPFGLNTIADLNPHFAQFGEIENIQVDYS-SLHAVVTFKTRAEAGEAYA-SHGAVLNNRFIKLFWHNPS 446 (526)
T ss_pred ccchhhhhccCCCCchHhhhhhhhhhcCccccccccCc-hhhheeeeeccccccchhc-cccceecCceeEEEEecCC
Confidence 34445555556554 56899999999999999988654 4589999999999977765 4889999999999998873
No 134
>PF04847 Calcipressin: Calcipressin; InterPro: IPR006931 Calcipressin 1 negatively regulates calcineurin (IPR015757 from INTERPRO) by direct binding and is essential for the survival of T helper type 1 cells. Calcipressin 1 is a phosphoprotein that increases its capacity to inhibit calcineurin when phosphorylated at the FLISPP motif, and this phosphorylation also controls the half-life of calcipressin 1 by accelerating its degradation []. Calcineurin is a calcium-responsive enzyme that dephosphorylates the nuclear factor of activated T cells (NFAT). In so doing it promotes its nuclear translocation and uniquely links calcium signalling to transcriptional regulation []. Calcipressins are a family of proteins derived from three genes. Calcipressin 1 is also known as modulatory calcineurin-interacting protein 1 (MCIP1), Adapt78 and Down syndrome critical region 1 (DSCR1). Calcipressin 2 is variously known as MCIP2, ZAKI-4 and DSCR1-like 1. Calcipressin 3 is also called MCIP3 and DSCR1-like 2 []. DSCR1 (Adapt78) is associated with successful adaptation to oxidative stress and calcium stress as well as with diseases like Alzheimer's and Down syndrome. The DSCR1 (Adapt78) isoform 1 protein, calcipressin 1, inhibits calcineurin and protects against acute calcium-mediated stress damage, including transient oxidative stress []. Calcipressin 1 is encoded by DSCR1, a gene on human chromosome 21. Calcipressin 1 isoform 1 has an N-terminal coding region, which generates a new polypeptide of 252 amino acids. Endogenous calcipressin 1 exists as a complex together with the calcineurin A and B heterodimer []. ; GO: 0019722 calcium-mediated signaling; PDB: 1WEY_A.
Probab=96.66 E-value=0.0084 Score=47.95 Aligned_cols=62 Identities=18% Similarity=0.133 Sum_probs=46.7
Q ss_pred cHHHHHHHHHhCCCeeEEEEeeCCCcEEEEEEcCHHHHHHHHHhcC--CceeCCcEEEEEEeecC
Q psy2347 22 KKEDLELEFEKFGKLNSVWVAFNPPGFAFVEFSNQIDAEAACDSMN--DQDLMGSKLRVEISRGR 84 (207)
Q Consensus 22 te~~L~~~F~~~G~V~~v~i~~d~rG~afV~F~~~e~A~~Al~~l~--g~~i~g~~l~V~~a~~k 84 (207)
..+.|+++|.++..+..+.+++. -.-..|.|.+.++|++|...|+ ...+.+..|+|.++.+.
T Consensus 8 ~~~~l~~l~~~~~~~~~~~~L~s-FrRi~v~f~~~~~A~~~r~~l~~~~~~~~g~~l~~yf~~~~ 71 (184)
T PF04847_consen 8 NLAELEELFSTYDPPVQFSPLKS-FRRIRVVFESPESAQRARQLLHWDGTSFNGKRLRVYFGQPT 71 (184)
T ss_dssp -HHHHHHHHHTT-SS-EEEEETT-TTEEEEE-SSTTHHHHHHHTST--TSEETTEE-EEE----S
T ss_pred hHHHHHHHHHhcCCceEEEEcCC-CCEEEEEeCCHHHHHHHHHHhcccccccCCCceEEEEcccc
Confidence 45789999999999988877776 3468899999999999999999 88999999999988543
No 135
>KOG0115|consensus
Probab=96.61 E-value=0.002 Score=53.46 Aligned_cols=72 Identities=24% Similarity=0.367 Sum_probs=58.0
Q ss_pred cEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeC----CCcEEEEEEcCHHHHHHHHHhcCCc----eeCCcEEEEEEe
Q psy2347 10 TRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFN----PPGFAFVEFSNQIDAEAACDSMNDQ----DLMGSKLRVEIS 81 (207)
Q Consensus 10 ~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d----~rG~afV~F~~~e~A~~Al~~l~g~----~i~g~~l~V~~a 81 (207)
..|+|.||+.-++.+.|++.|+.|++|+...++.| ...-.+|+|..+..|.+|+..++-. ++..+.+-|...
T Consensus 32 a~l~V~nl~~~~sndll~~~f~~fg~~e~av~~vD~r~k~t~eg~v~~~~k~~a~~a~rr~~~~g~~~~~~~~p~~VeP~ 111 (275)
T KOG0115|consen 32 AELYVVNLMQGASNDLLEQAFRRFGPIERAVAKVDDRGKPTREGIVEFAKKPNARKAARRCREGGFGGTTGGRPVGVEPM 111 (275)
T ss_pred ceEEEEecchhhhhHHHHHhhhhcCccchheeeecccccccccchhhhhcchhHHHHHHHhccCccccCCCCCccCCChh
Confidence 68999999999999999999999999998777666 5678899999999999999876433 234455555433
No 136
>KOG4285|consensus
Probab=96.52 E-value=0.015 Score=49.44 Aligned_cols=71 Identities=17% Similarity=0.168 Sum_probs=52.3
Q ss_pred cEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeCCCcEEEEEEcCHHHHHHHHHhcCCceeCCcE-EEEEEeec
Q psy2347 10 TRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFNPPGFAFVEFSNQIDAEAACDSMNDQDLMGSK-LRVEISRG 83 (207)
Q Consensus 10 ~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~-l~V~~a~~ 83 (207)
.=|-|-.+|+..+ ..|..+|++||+|++.... ..-.|-+|.|.+..+|+|||.+ |++.|++.. |-|+.+..
T Consensus 198 ~WVTVfGFppg~~-s~vL~~F~~cG~Vvkhv~~-~ngNwMhirYssr~~A~KALsk-ng~ii~g~vmiGVkpCtD 269 (350)
T KOG4285|consen 198 TWVTVFGFPPGQV-SIVLNLFSRCGEVVKHVTP-SNGNWMHIRYSSRTHAQKALSK-NGTIIDGDVMIGVKPCTD 269 (350)
T ss_pred ceEEEeccCccch-hHHHHHHHhhCeeeeeecC-CCCceEEEEecchhHHHHhhhh-cCeeeccceEEeeeecCC
Confidence 3455556766543 4566789999999887655 3467999999999999999987 888887654 45555443
No 137
>KOG0112|consensus
Probab=96.48 E-value=0.00056 Score=65.15 Aligned_cols=77 Identities=16% Similarity=0.271 Sum_probs=64.9
Q ss_pred CCCCcEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeC----CCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEEEe
Q psy2347 6 MERGTRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFN----PPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVEIS 81 (207)
Q Consensus 6 ~~~~~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d----~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~~a 81 (207)
...+.+||++||+..+++.+|+..|..++.|++|.|.+. -.-++||.|.+.+.+-+|+..|....|..-.+++.+.
T Consensus 369 ~~atrTLf~Gnl~~kl~eseiR~af~e~gkve~VDiKtP~~~~esa~~f~~~~n~dmtp~ak~e~s~~~I~~g~~r~glG 448 (975)
T KOG0112|consen 369 FRATRTLFLGNLDSKLTESEIRPAFDESGKVEEVDIKTPHIKTESAYAFVSLLNTDMTPSAKFEESGPLIGNGTHRIGLG 448 (975)
T ss_pred hhhhhhhhhcCcccchhhhhhhhhhhhhccccccccccCCCCcccchhhhhhhccccCcccchhhcCCccccCccccccc
Confidence 345789999999999999999999999999999988655 2358999999999999999888888876656666555
Q ss_pred e
Q psy2347 82 R 82 (207)
Q Consensus 82 ~ 82 (207)
.
T Consensus 449 ~ 449 (975)
T KOG0112|consen 449 Q 449 (975)
T ss_pred c
Confidence 3
No 138
>KOG0128|consensus
Probab=96.34 E-value=0.00012 Score=69.19 Aligned_cols=62 Identities=27% Similarity=0.321 Sum_probs=51.5
Q ss_pred CcEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEe--eC---CCcEEEEEEcCHHHHHHHHHhcCCce
Q psy2347 9 GTRVYVGGLTETVKKEDLELEFEKFGKLNSVWVA--FN---PPGFAFVEFSNQIDAEAACDSMNDQD 70 (207)
Q Consensus 9 ~~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~--~d---~rG~afV~F~~~e~A~~Al~~l~g~~ 70 (207)
..++||+||+..+.+++|...|..++.|+.++|. .+ .++.|+|+|.+++.+.+||.....+.
T Consensus 667 ~~~~fvsnl~~~~~~~dl~~~~~~~~~~e~vqi~~h~n~~~~rG~~Y~~F~~~~~~~aaV~f~d~~~ 733 (881)
T KOG0128|consen 667 LIKIFVSNLSPKMSEEDLSERFSPSGTIEVVQIVIHKNEKRFRGKAYVEFLKPEHAGAAVAFRDSCF 733 (881)
T ss_pred HHHHHHhhcchhhcCchhhhhcCccchhhhHHHHHHhhccccccceeeEeecCCchhhhhhhhhhhh
Confidence 3689999999999999999999999988776554 11 78999999999999999998544333
No 139
>KOG2068|consensus
Probab=96.19 E-value=0.0014 Score=56.18 Aligned_cols=75 Identities=21% Similarity=0.357 Sum_probs=57.8
Q ss_pred cEEEEcCCCcCCcHHHH-H--HHHHhCCCeeEEEEeeC--------CCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEE
Q psy2347 10 TRVYVGGLTETVKKEDL-E--LEFEKFGKLNSVWVAFN--------PPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRV 78 (207)
Q Consensus 10 ~~lfV~nLp~~~te~~L-~--~~F~~~G~V~~v~i~~d--------~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V 78 (207)
..+||-.|+..+.++++ + +.|.+|+.|.+|.+..+ ...-++|+|+.+|+|..||...++..+.++.|++
T Consensus 78 nlvyvvgl~~~~ade~~l~~~eyfgqygki~ki~~~~~~S~~s~~~~~~s~yITy~~~eda~rci~~v~g~~~dg~~lka 157 (327)
T KOG2068|consen 78 NLVYVVGLPLDLADESVLERTEYFGQYGKINKIVKNKDPSSSSSSGGTCSVYITYEEEEDADRCIDDVDGFVDDGRALKA 157 (327)
T ss_pred hhhhhhCCCccccchhhhhCcccccccccceEEeecCCcccccCCCCCCcccccccchHhhhhHHHHhhhHHhhhhhhHH
Confidence 45677778776655444 3 47888999999988775 1234899999999999999999999988888777
Q ss_pred EEeecC
Q psy2347 79 EISRGR 84 (207)
Q Consensus 79 ~~a~~k 84 (207)
.+.+++
T Consensus 158 ~~gttk 163 (327)
T KOG2068|consen 158 SLGTTK 163 (327)
T ss_pred hhCCCc
Confidence 766544
No 140
>KOG4574|consensus
Probab=96.16 E-value=0.0045 Score=58.87 Aligned_cols=74 Identities=19% Similarity=0.310 Sum_probs=62.1
Q ss_pred EEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeCCCcEEEEEEcCHHHHHHHHHhcCCcee--CCcEEEEEEeecCC
Q psy2347 11 RVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFNPPGFAFVEFSNQIDAEAACDSMNDQDL--MGSKLRVEISRGRG 85 (207)
Q Consensus 11 ~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d~rG~afV~F~~~e~A~~Al~~l~g~~i--~g~~l~V~~a~~k~ 85 (207)
+..+.|.+-..+...|..+|.+|++|..+..+++ -..|.|+|.+.|.|..|+++|+++++ .+.+.+|.+|++-+
T Consensus 300 ~~~~~nn~v~~tSssL~~l~s~yg~v~s~wtlr~-~N~alvs~~s~~sai~a~dAl~gkevs~~g~Ps~V~~ak~~~ 375 (1007)
T KOG4574|consen 300 KQSLENNAVNLTSSSLATLCSDYGSVASAWTLRD-LNMALVSFSSVESAILALDALQGKEVSVTGAPSRVSFAKTLP 375 (1007)
T ss_pred hhhhhcccccchHHHHHHHHHhhcchhhheeccc-ccchhhhhHHHHHHHHhhhhhcCCcccccCCceeEEeccccc
Confidence 4444555566778889999999999999988877 56899999999999999999999885 57889999988654
No 141
>KOG0804|consensus
Probab=96.12 E-value=0.019 Score=51.28 Aligned_cols=66 Identities=14% Similarity=0.236 Sum_probs=57.6
Q ss_pred CCcEEEEcCCCcCCcHHHHHHHHHhCC-CeeEEEEeeC---CCcEEEEEEcCHHHHHHHHHhcCCceeCC
Q psy2347 8 RGTRVYVGGLTETVKKEDLELEFEKFG-KLNSVWVAFN---PPGFAFVEFSNQIDAEAACDSMNDQDLMG 73 (207)
Q Consensus 8 ~~~~lfV~nLp~~~te~~L~~~F~~~G-~V~~v~i~~d---~rG~afV~F~~~e~A~~Al~~l~g~~i~g 73 (207)
+++.|+|--+|..+|-.||..|...+- .|..|+|++| .+=.++|+|.+.++|....+.+|++.++.
T Consensus 73 ~~~mLcilaVP~~mt~~Dll~F~~~~~~~I~~irivRd~~pnrymvLIkFr~q~da~~Fy~efNGk~Fn~ 142 (493)
T KOG0804|consen 73 SSTMLCILAVPAYMTSHDLLRFCASFIKQISDIRIVRDGMPNRYMVLIKFRDQADADTFYEEFNGKQFNS 142 (493)
T ss_pred CCcEEEEEeccccccHHHHHHHHHHHhhhhheeEEeecCCCceEEEEEEeccchhHHHHHHHcCCCcCCC
Confidence 378899999999999999999988764 4889999998 56678999999999999999999988754
No 142
>KOG2253|consensus
Probab=95.99 E-value=0.0044 Score=57.45 Aligned_cols=74 Identities=22% Similarity=0.238 Sum_probs=65.0
Q ss_pred CCCCCCcEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeCCCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEEEe
Q psy2347 4 GMMERGTRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFNPPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVEIS 81 (207)
Q Consensus 4 ~~~~~~~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~~a 81 (207)
+..++..+|||+||...+.++-++.++..+|.|..++.++ |.|.+|.++..+..|+..|+-..++++.|.+...
T Consensus 35 ~~~~~~~~vfv~~~~~~~s~~~~~~il~~~g~v~s~kr~~----fgf~~f~~~~~~~ra~r~~t~~~~~~~kl~~~~d 108 (668)
T KOG2253|consen 35 QPLPPRDTVFVGNISYLVSQEFWKSILAKSGFVPSWKRDK----FGFCEFLKHIGDLRASRLLTELNIDDQKLIENVD 108 (668)
T ss_pred cCCCCCceeEecchhhhhhHHHHHHHHhhCCcchhhhhhh----hcccchhhHHHHHHHHHHhcccCCCcchhhccch
Confidence 3556788999999999999999999999999998887654 9999999999999999999988898888777653
No 143
>KOG4660|consensus
Probab=95.70 E-value=0.015 Score=52.93 Aligned_cols=73 Identities=14% Similarity=0.232 Sum_probs=44.6
Q ss_pred cEEEEcCCCcCCcHHHHHHHHHh-CCCeeEEEEeeC-----CCcEEEEEEcCHHHHHHHHHhcCCceeC----CcEEEEE
Q psy2347 10 TRVYVGGLTETVKKEDLELEFEK-FGKLNSVWVAFN-----PPGFAFVEFSNQIDAEAACDSMNDQDLM----GSKLRVE 79 (207)
Q Consensus 10 ~~lfV~nLp~~~te~~L~~~F~~-~G~V~~v~i~~d-----~rG~afV~F~~~e~A~~Al~~l~g~~i~----g~~l~V~ 79 (207)
+++.|.|+|...|...|.+.-++ .+.-..+.++.| ..+||||.|.+++++..+.++.|++.++ .+.++|.
T Consensus 389 tt~~iknipNK~T~~ml~~~d~~~~gtYDFlYLPiDF~nkcNvGYAFINm~sp~ai~~F~kAFnGk~W~~FnS~Kia~it 468 (549)
T KOG4660|consen 389 TTLMIKNIPNKYTSKMLLAADEKNKGTYDFLYLPIDFKNKCNVGYAFINMTSPEAIIRFYKAFNGKKWEKFNSEKIASIT 468 (549)
T ss_pred hhhHhhccCchhhHHhhhhhhccccCccceEEeccccccccccceeEEeecCHHHHHHHHHHHcCCchhhhcceeeeeee
Confidence 34444454444443333333221 233334445545 5799999999999999999999998653 3455666
Q ss_pred Eee
Q psy2347 80 ISR 82 (207)
Q Consensus 80 ~a~ 82 (207)
+|+
T Consensus 469 YAr 471 (549)
T KOG4660|consen 469 YAR 471 (549)
T ss_pred hhh
Confidence 654
No 144
>KOG2318|consensus
Probab=95.28 E-value=0.11 Score=48.00 Aligned_cols=75 Identities=25% Similarity=0.298 Sum_probs=59.3
Q ss_pred CCCCcEEEEcCCCcC-CcHHHHHHHHHhC----CCeeEEEEeeC--------------C---------------------
Q psy2347 6 MERGTRVYVGGLTET-VKKEDLELEFEKF----GKLNSVWVAFN--------------P--------------------- 45 (207)
Q Consensus 6 ~~~~~~lfV~nLp~~-~te~~L~~~F~~~----G~V~~v~i~~d--------------~--------------------- 45 (207)
-..+++|-|.||.|+ +..++|..+|..| |.|.+|.|... +
T Consensus 171 ~~~T~RLAVvNMDWd~v~AkDL~v~~nSFlP~gGsilSV~IYpSeFGkeRM~eEeV~GP~~el~~~~e~~~~s~sD~ee~ 250 (650)
T KOG2318|consen 171 GEETKRLAVVNMDWDRVKAKDLFVLFNSFLPKGGSILSVKIYPSEFGKERMKEEEVHGPPKELFKPVEEYKESESDDEEE 250 (650)
T ss_pred ccccceeeEeccccccccHHHHHHHHHhhcCCCCceeEEEechhhhhHHHhhhhcccCChhhhccccccCcccccchhhh
Confidence 456789999999987 6778999999887 35778877543 1
Q ss_pred -----------------CcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEEE
Q psy2347 46 -----------------PGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVEI 80 (207)
Q Consensus 46 -----------------rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~~ 80 (207)
-=||.|+|++.+.|.++.+.+++.++....+.+.+
T Consensus 251 ~~~~~~kLR~Yq~~rLkYYyAVvecDsi~tA~~vYe~CDG~EfEsS~~~~DL 302 (650)
T KOG2318|consen 251 EDVDREKLRQYQLNRLKYYYAVVECDSIETAKAVYEECDGIEFESSANKLDL 302 (650)
T ss_pred hhHHHHHHHHHHhhhheeEEEEEEecCchHHHHHHHhcCcceeccccceeee
Confidence 13799999999999999999999998765544444
No 145
>PRK11634 ATP-dependent RNA helicase DeaD; Provisional
Probab=95.19 E-value=0.48 Score=44.92 Aligned_cols=69 Identities=4% Similarity=0.100 Sum_probs=52.0
Q ss_pred EEEEcCC--CcCCcHHHHHHHHHhCCCe-----eEEEEeeCCCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEEEeec
Q psy2347 11 RVYVGGL--TETVKKEDLELEFEKFGKL-----NSVWVAFNPPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVEISRG 83 (207)
Q Consensus 11 ~lfV~nL--p~~~te~~L~~~F~~~G~V-----~~v~i~~d~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~~a~~ 83 (207)
++|| ++ ...++..+|..++..-..| -.|+|..+ |.||+... +.|++.++.|+...+.+++|.|+.++.
T Consensus 488 ~~~~-~~g~~~~~~~~~~~~~i~~~~~~~~~~ig~i~i~~~---~s~v~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~ 562 (629)
T PRK11634 488 LYRI-EVGRDDGVEVRHIVGAIANEGDISSRYIGNIKLFAS---HSTIELPK-GMPGEVLQHFTRTRILNKPMNMQLLGD 562 (629)
T ss_pred EEEE-ecccccCCCHHHHHHHHHhhcCCChhhCCcEEEeCC---ceEEEcCh-hhHHHHHHHhccccccCCceEEEECCC
Confidence 3555 33 4568888888888776654 45666654 99999876 558888999999999999999998854
Q ss_pred C
Q psy2347 84 R 84 (207)
Q Consensus 84 k 84 (207)
.
T Consensus 563 ~ 563 (629)
T PRK11634 563 A 563 (629)
T ss_pred C
Confidence 3
No 146
>KOG3973|consensus
Probab=94.97 E-value=0.24 Score=43.11 Aligned_cols=6 Identities=0% Similarity=0.246 Sum_probs=2.3
Q ss_pred HHHHHH
Q psy2347 25 DLELEF 30 (207)
Q Consensus 25 ~L~~~F 30 (207)
+|.++|
T Consensus 244 ei~~~~ 249 (465)
T KOG3973|consen 244 EIQSIL 249 (465)
T ss_pred HHHHHH
Confidence 333333
No 147
>PF11767 SET_assoc: Histone lysine methyltransferase SET associated; InterPro: IPR024636 The SET domain is a protein-protein interaction domain found in protein lysine methyltransferase enzymes. This entry represents a domain of unknown function which is associated with the SET domain and found in histone lysine methyltransferases [].
Probab=94.70 E-value=0.22 Score=33.05 Aligned_cols=56 Identities=14% Similarity=0.299 Sum_probs=43.2
Q ss_pred cCCcHHHHHHHHHhCCCeeEEEEeeCCCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEE
Q psy2347 19 ETVKKEDLELEFEKFGKLNSVWVAFNPPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRV 78 (207)
Q Consensus 19 ~~~te~~L~~~F~~~G~V~~v~i~~d~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V 78 (207)
..++-++|+..+.+|.- . +|..|..| -||.|.+.++|++|....++..+...+|.+
T Consensus 10 ~~~~v~d~K~~Lr~y~~-~--~I~~d~tG-fYIvF~~~~Ea~rC~~~~~~~~~f~y~m~M 65 (66)
T PF11767_consen 10 HGVTVEDFKKRLRKYRW-D--RIRDDRTG-FYIVFNDSKEAERCFRAEDGTLFFTYRMQM 65 (66)
T ss_pred CCccHHHHHHHHhcCCc-c--eEEecCCE-EEEEECChHHHHHHHHhcCCCEEEEEEEEe
Confidence 35788999999999853 3 34445344 489999999999999999988887776654
No 148
>KOG4210|consensus
Probab=94.54 E-value=0.021 Score=48.83 Aligned_cols=76 Identities=17% Similarity=0.155 Sum_probs=59.3
Q ss_pred CCcEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeC-----CCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEEEee
Q psy2347 8 RGTRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFN-----PPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVEISR 82 (207)
Q Consensus 8 ~~~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d-----~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~~a~ 82 (207)
..+++||+++.+.+.+.++..++.+.+.+..+.+... +++++.|.|...+.+..||+....+.+....+...+.+
T Consensus 87 ~~~~~f~g~~s~~~e~~~~~~~~~~~g~~~~~~~S~~~~~~~sk~~~s~~f~~ks~~~~~l~~s~~~~~~~~~~~~dl~~ 166 (285)
T KOG4210|consen 87 SSSTFFVGELSENIEESEDDNFSSEAGLRVDARSSSLEDSLSSKGGLSVHFAGKSQFFAALEESGSKVLDGNKGEKDLNT 166 (285)
T ss_pred ccccccccccccchhhccccccchhhcCcccchhhhhccccccccceeeccccHHHHHHHHHhhhccccccccccCcccc
Confidence 4789999999999999999999999987766554432 78999999999999999998654456666555555544
Q ss_pred c
Q psy2347 83 G 83 (207)
Q Consensus 83 ~ 83 (207)
.
T Consensus 167 ~ 167 (285)
T KOG4210|consen 167 R 167 (285)
T ss_pred c
Confidence 3
No 149
>KOG2193|consensus
Probab=92.97 E-value=0.0033 Score=55.65 Aligned_cols=76 Identities=16% Similarity=0.252 Sum_probs=63.5
Q ss_pred CCCcEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEee-C-CCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEEEee
Q psy2347 7 ERGTRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAF-N-PPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVEISR 82 (207)
Q Consensus 7 ~~~~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~-d-~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~~a~ 82 (207)
..+++|-|.|||+...++.|..++.+|+.|+.|.++. + ..-..-|+|.+.+.++.||.+|++..|....++|.+..
T Consensus 78 qrsrk~Qirnippql~wevld~Ll~qyg~ve~~eqvnt~~etavvnvty~~~~~~~~ai~kl~g~Q~en~~~k~~YiP 155 (584)
T KOG2193|consen 78 QRSRKIQIRNIPPQLQWEVLDSLLAQYGTVENCEQVNTDSETAVVNVTYSAQQQHRQAIHKLNGPQLENQHLKVGYIP 155 (584)
T ss_pred HHhhhhhHhcCCHHHHHHHHHHHHhccCCHhHhhhhccchHHHHHHHHHHHHHHHHHHHHhhcchHhhhhhhhcccCc
Confidence 3467788999999999999999999999999986643 3 23344578899999999999999999999999887653
No 150
>KOG4019|consensus
Probab=92.69 E-value=0.14 Score=40.59 Aligned_cols=75 Identities=16% Similarity=0.119 Sum_probs=54.2
Q ss_pred CcEEEEcCCCcCCcH-----HHHHHHHHhCCCeeEEEEeeCCCcEEEEEEcCHHHHHHHHHhcCCceeCCc-EEEEEEee
Q psy2347 9 GTRVYVGGLTETVKK-----EDLELEFEKFGKLNSVWVAFNPPGFAFVEFSNQIDAEAACDSMNDQDLMGS-KLRVEISR 82 (207)
Q Consensus 9 ~~~lfV~nLp~~~te-----~~L~~~F~~~G~V~~v~i~~d~rG~afV~F~~~e~A~~Al~~l~g~~i~g~-~l~V~~a~ 82 (207)
..+|++.+|+.++-. ...+.+|.++-+...+++++. ....-|.|.+++.|..|..+++...|.++ .|++-++.
T Consensus 10 p~~~~~c~i~~~VF~~~~~k~~~~~lFrq~n~~~~fq~lrs-frrvRi~f~~p~~a~~a~i~~~~~~f~~~~~~k~yfaQ 88 (193)
T KOG4019|consen 10 PTAIIACDIHEEVFVNREDKALFENLFRQINEDATFQLLRS-FRRVRINFSNPEAAADARIKLHSTSFNGKNELKLYFAQ 88 (193)
T ss_pred cceeeeecccHHhhccHHHHHHHHhHHhhhCcchHHHHHHh-hceeEEeccChhHHHHHHHHhhhcccCCCceEEEEEcc
Confidence 456788888765422 234456666665555555443 55778999999999999999999999998 78887776
Q ss_pred cC
Q psy2347 83 GR 84 (207)
Q Consensus 83 ~k 84 (207)
+-
T Consensus 89 ~~ 90 (193)
T KOG4019|consen 89 PG 90 (193)
T ss_pred CC
Confidence 53
No 151
>KOG3973|consensus
Probab=92.06 E-value=0.82 Score=39.93 Aligned_cols=12 Identities=17% Similarity=0.219 Sum_probs=5.9
Q ss_pred CcHHHHHHHHHh
Q psy2347 21 VKKEDLELEFEK 32 (207)
Q Consensus 21 ~te~~L~~~F~~ 32 (207)
.++++|+++.+.
T Consensus 198 a~w~~iE~~~~~ 209 (465)
T KOG3973|consen 198 ATWPEIEKQCES 209 (465)
T ss_pred hhHHHHHHHHHH
Confidence 445555554443
No 152
>KOG4676|consensus
Probab=91.24 E-value=0.028 Score=49.46 Aligned_cols=71 Identities=17% Similarity=0.122 Sum_probs=54.7
Q ss_pred CcEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeC-CCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEEE
Q psy2347 9 GTRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFN-PPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVEI 80 (207)
Q Consensus 9 ~~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d-~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~~ 80 (207)
..+|+|.+|+..+...++.++|..+|+|...++... ..-+|.|+|....+...|+.. ++.++.-+...+.+
T Consensus 151 rRt~~v~sl~~~~~l~e~~e~f~r~Gev~ya~~ask~~s~~c~~sf~~qts~~halr~-~gre~k~qhsr~ai 222 (479)
T KOG4676|consen 151 RRTREVQSLISAAILPESGESFERKGEVSYAHTASKSRSSSCSHSFRKQTSSKHALRS-HGRERKRQHSRRAI 222 (479)
T ss_pred HhhhhhhcchhhhcchhhhhhhhhcchhhhhhhhccCCCcchhhhHhhhhhHHHHHHh-cchhhhhhhhhhhh
Confidence 367999999999999999999999999987766544 445777999998888888875 66666544333333
No 153
>KOG3262|consensus
Probab=90.58 E-value=0.8 Score=36.47 Aligned_cols=6 Identities=17% Similarity=0.119 Sum_probs=2.2
Q ss_pred EEcCHH
Q psy2347 52 EFSNQI 57 (207)
Q Consensus 52 ~F~~~e 57 (207)
-+++++
T Consensus 84 ylenk~ 89 (215)
T KOG3262|consen 84 YLENKE 89 (215)
T ss_pred eecchh
Confidence 333333
No 154
>KOG4410|consensus
Probab=90.46 E-value=0.93 Score=38.60 Aligned_cols=58 Identities=24% Similarity=0.363 Sum_probs=42.4
Q ss_pred CcEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeC-CCcEEEEEEcC-------HHHHHHHHHhcC
Q psy2347 9 GTRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFN-PPGFAFVEFSN-------QIDAEAACDSMN 67 (207)
Q Consensus 9 ~~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d-~rG~afV~F~~-------~e~A~~Al~~l~ 67 (207)
.+-|+|.|||.++.-.||+..+.+.+.+ .+.|... ..+-||+.|-+ .+++.+++.++|
T Consensus 330 ~~di~~~nl~rd~rv~dlk~~lr~~~~~-pm~iswkg~~~k~flh~~~~~~~~~~~~~~~~~~~s~~ 395 (396)
T KOG4410|consen 330 KTDIKLTNLSRDIRVKDLKSELRKRECT-PMSISWKGHFGKCFLHFGNRKGVPSTQDDMDKVLKSLN 395 (396)
T ss_pred ccceeeccCccccchHHHHHHHHhcCCC-ceeEeeecCCcceeEecCCccCCCCCchHHHHHhccCC
Confidence 3569999999999999999999887643 3334433 67889999965 345566665554
No 155
>KOG4483|consensus
Probab=90.26 E-value=0.76 Score=40.82 Aligned_cols=55 Identities=20% Similarity=0.146 Sum_probs=44.9
Q ss_pred CcEEEEcCCCcCCcHHHHHHHHHhCCCe-eEEEEeeCCCcEEEEEEcCHHHHHHHHHh
Q psy2347 9 GTRVYVGGLTETVKKEDLELEFEKFGKL-NSVWVAFNPPGFAFVEFSNQIDAEAACDS 65 (207)
Q Consensus 9 ~~~lfV~nLp~~~te~~L~~~F~~~G~V-~~v~i~~d~rG~afV~F~~~e~A~~Al~~ 65 (207)
.+.|-|.++|.....+||..+|+.|..- -+|++|.| -+||..|++...|..||..
T Consensus 391 pHVlEIydfp~efkteDll~~f~~yq~kgfdIkWvDd--thalaVFss~~~AaeaLt~ 446 (528)
T KOG4483|consen 391 PHVLEIYDFPDEFKTEDLLKAFETYQNKGFDIKWVDD--THALAVFSSVNRAAEALTL 446 (528)
T ss_pred cceeEeccCchhhccHHHHHHHHHhhcCCceeEEeec--ceeEEeecchHHHHHHhhc
Confidence 4688899999999999999999999752 34555554 5899999999999999974
No 156
>KOG2295|consensus
Probab=89.54 E-value=0.06 Score=49.30 Aligned_cols=71 Identities=13% Similarity=0.130 Sum_probs=56.1
Q ss_pred CCCcEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeC-----CCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEE
Q psy2347 7 ERGTRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFN-----PPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLR 77 (207)
Q Consensus 7 ~~~~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d-----~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~ 77 (207)
.+.++|||.||++.++-++|+.+++.+--+..+.+..+ ..-+.+|+|+-....+.|+.+||++.+....+.
T Consensus 229 hke~sll~rni~Pnis~aeIe~~ck~i~~~lrfals~~~aek~~~r~lwv~fk~~~ni~~a~~aLn~irl~s~~~s 304 (648)
T KOG2295|consen 229 HKECSLLVRNILPNISVAEIENLCKGIPGFLRFALSTINAEKNFERRLWVTFKRGTNIKEACWALNGIRLRSNFLS 304 (648)
T ss_pred hHHHHHHHhccCCcccHHHHHHHhccCchheeeeccCchHHHHHHHHhhHhhccccchHHHHHHhhhccccccccc
Confidence 35688999999999999999999998876666655444 345788999988888888888888877655443
No 157
>PF03468 XS: XS domain; InterPro: IPR005380 The XS (rice gene X and SGS3) domain is found in a family of plant proteins including gene X Q9SBW2 from SWISSPROT and SGS3 Q9LDX1 from SWISSPROT. SGS3 is thought to be involved in post-transcriptional gene silencing (PTGS). This domain contains a conserved aspartate residue that may be functionally important. The XS domain containing proteins contain coiled-coils, which suggests that they will oligomerise. Most coiled-coil proteins form either a dimeric or a trimeric structure. It is possible that different members of the XS domain family could oligomerise via their coiled-coils forming a variety of complexes [].; PDB: 4E8U_C.
Probab=89.28 E-value=0.83 Score=33.73 Aligned_cols=52 Identities=23% Similarity=0.316 Sum_probs=27.1
Q ss_pred EEEEcCCCcC---------CcHHHHHHHHHhCCCeeEEEEeeC---CCcEEEEEEcC-HHHHHHHH
Q psy2347 11 RVYVGGLTET---------VKKEDLELEFEKFGKLNSVWVAFN---PPGFAFVEFSN-QIDAEAAC 63 (207)
Q Consensus 11 ~lfV~nLp~~---------~te~~L~~~F~~~G~V~~v~i~~d---~rG~afV~F~~-~e~A~~Al 63 (207)
++.|-|++.+ ++.++|++.|+.|.+++ ++.+.+ ..++++|+|.+ -.-.+.|+
T Consensus 10 mgIi~N~~~~~~~~g~~~g~~~~~l~~~l~~f~p~k-v~~l~~~~gh~g~aiv~F~~~w~Gf~~A~ 74 (116)
T PF03468_consen 10 MGIIVNIPTEKDDDGRWVGMSNEELLDKLAEFNPLK-VKPLYGKQGHTGFAIVEFNKDWSGFKNAM 74 (116)
T ss_dssp EEEEE----EE-TTS-EE---SHHHHHHHHH---SE-EEEEEETTEEEEEEEEE--SSHHHHHHHH
T ss_pred EEEEEcCccccCCCCceeccCHHHHHHHHHhcCCce-eEECcCCCCCcEEEEEEECCChHHHHHHH
Confidence 5667787543 35678999999998875 444443 56899999985 33334443
No 158
>PF10567 Nab6_mRNP_bdg: RNA-recognition motif; InterPro: IPR018885 This conserved domain is found in fungal proteins and appears to be involved in RNA-processing. It binds to poly-adenylated RNA, interacts genetically with mRNA 3'-end processing factors, co-purifies with the nuclear cap-binding protein Cbp20p, and is found in complexes containing other translation factors, such as EIF4G as in P39935 from SWISSPROT and P39936 from SWISSPROT.
Probab=88.49 E-value=1 Score=38.42 Aligned_cols=75 Identities=12% Similarity=0.229 Sum_probs=57.8
Q ss_pred CCcEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeCC------------CcEEEEEEcCHHHHHHHH----HhcC--Cc
Q psy2347 8 RGTRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFNP------------PGFAFVEFSNQIDAEAAC----DSMN--DQ 69 (207)
Q Consensus 8 ~~~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d~------------rG~afV~F~~~e~A~~Al----~~l~--g~ 69 (207)
.++.|.+.||..+++-.++...|.+|++|+.|.++.+. .....+.|-+.+.|.... ..|+ .+
T Consensus 14 rTRSLLfeNv~~sidLh~Fl~~fv~~~pIESiYL~~~~~~~~d~~~~d~~~~SilLSFlsr~~CLdFYNnvLQrLsEfK~ 93 (309)
T PF10567_consen 14 RTRSLLFENVNNSIDLHSFLTKFVKFGPIESIYLIKSNDKPSDDYNDDKNNQSILLSFLSREICLDFYNNVLQRLSEFKT 93 (309)
T ss_pred eeHHHHHhhccccccHHHHHHHhhccCceeEEEEecCCCcccccccccccceEEEEeeechHHHHHHHHHHHHHHHHHHH
Confidence 46788899999999999999999999999999998762 467889999988876433 2222 23
Q ss_pred eeCCcEEEEEEee
Q psy2347 70 DLMGSKLRVEISR 82 (207)
Q Consensus 70 ~i~g~~l~V~~a~ 82 (207)
.|+...|+|.+..
T Consensus 94 ~L~S~~L~lsFV~ 106 (309)
T PF10567_consen 94 KLKSESLTLSFVS 106 (309)
T ss_pred hcCCcceeEEEEE
Confidence 4677778877765
No 159
>PF00403 HMA: Heavy-metal-associated domain; InterPro: IPR006121 Proteins that transport heavy metals in micro-organisms and mammals share similarities in their sequences and structures. These proteins provide an important focus for research, some being involved in bacterial resistance to toxic metals, such as lead and cadmium, while others are involved in inherited human syndromes, such as Wilson's and Menke's diseases []. A conserved domain has been found in a number of these heavy metal transport or detoxification proteins []. The domain, which has been termed Heavy-Metal-Associated (HMA), contains two conserved cysteines that are probably involved in metal binding. Structure solution of the fourth HMA domain of the Menke's copper transporting ATPase shows a well-defined structure comprising a four-stranded antiparallel beta-sheet and two alpha helices packed in an alpha-beta sandwich fold []. This fold is common to other domains and is classified as "ferredoxin-like".; GO: 0046872 metal ion binding, 0030001 metal ion transport; PDB: 2VOY_A 1P6T_A 1KQK_A 2RML_A 1JWW_A 3K7R_F 1FES_A 1CC8_A 1FD8_A 2GGP_A ....
Probab=85.64 E-value=5.8 Score=25.04 Aligned_cols=54 Identities=19% Similarity=0.152 Sum_probs=41.6
Q ss_pred EEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeCCCcEEEEEEcCH----HHHHHHHHh
Q psy2347 11 RVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFNPPGFAFVEFSNQ----IDAEAACDS 65 (207)
Q Consensus 11 ~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d~rG~afV~F~~~----e~A~~Al~~ 65 (207)
+|.|.+|.-..-...|++.+.+.-.|.++.+... ...+-|+|+.. ++..++|++
T Consensus 1 t~~v~~m~C~~C~~~v~~~l~~~~GV~~v~vd~~-~~~v~v~~~~~~~~~~~i~~~i~~ 58 (62)
T PF00403_consen 1 TFKVPGMTCEGCAKKVEKALSKLPGVKSVKVDLE-TKTVTVTYDPDKTSIEKIIEAIEK 58 (62)
T ss_dssp EEEEESTTSHHHHHHHHHHHHTSTTEEEEEEETT-TTEEEEEESTTTSCHHHHHHHHHH
T ss_pred CEEECCcccHHHHHHHHHHHhcCCCCcEEEEECC-CCEEEEEEecCCCCHHHHHHHHHH
Confidence 4677777776677889999999988999888765 56888898743 566666665
No 160
>cd06405 PB1_Mekk2_3 The PB1 domain is present in the two mitogen-activated protein kinase kinases MEKK2 and MEKK3 which are two members of the signaling kinase cascade involved in angiogenesis and early cardiovascular development. The PB1 domain of MEKK2 (and/or MEKK3) interacts with the PB1 domain of another member of the kinase cascade Map2k5. A canonical PB1-PB1 interaction, which involves heterodimerization of two PB1 domains, is required for the formation of macromolecular signaling complexes ensuring specificity and fidelity during cellular signaling. The interaction between two PB1 domain depends on the type of PB1. There are three types of PB1 domains: type I which contains an OPCA motif, acidic aminoacid cluster, type II which contains a basic cluster, and type I/II which contains both an OPCA motif and a basic cluster. Interactions of PB1 domains with other protein domains have been described as noncanonical PB1-interactions. The PB1 domain module is conserved in amoebas, f
Probab=84.97 E-value=9.8 Score=25.86 Aligned_cols=62 Identities=23% Similarity=0.288 Sum_probs=44.8
Q ss_pred CCCcCCcHHHHHHHH-HhCCCeeEEEEeeCCCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEEEe
Q psy2347 16 GLTETVKKEDLELEF-EKFGKLNSVWVAFNPPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVEIS 81 (207)
Q Consensus 16 nLp~~~te~~L~~~F-~~~G~V~~v~i~~d~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~~a 81 (207)
.+|..+.-+||+... ..||...++..+.+ .-.|-..+.++.++|++.++. ...-+.|+|-+.
T Consensus 15 ~f~RPvkf~dl~~kv~~afGq~mdl~ytn~---eL~iPl~~Q~DLDkAie~ld~-s~~~ksLRilL~ 77 (79)
T cd06405 15 QFPRPVKFKDLQQKVTTAFGQPMDLHYTNN---ELLIPLKNQEDLDRAIELLDR-SPHMKSLRILLS 77 (79)
T ss_pred ecCCCccHHHHHHHHHHHhCCeeeEEEecc---cEEEeccCHHHHHHHHHHHcc-CccccceeEeEe
Confidence 456677767766544 46998888877765 477888999999999998766 445555666554
No 161
>PF02714 DUF221: Domain of unknown function DUF221; InterPro: IPR003864 This domain is found in a family of hypothetical transmembrane proteins none of which have any known function, the aligned region is at 538 residues at maximum length.; GO: 0016020 membrane
Probab=83.06 E-value=1.2 Score=38.17 Aligned_cols=33 Identities=27% Similarity=0.314 Sum_probs=24.5
Q ss_pred EEEEEcCHHHHHHHHHhcCCceeCCcEEEEEEeec
Q psy2347 49 AFVEFSNQIDAEAACDSMNDQDLMGSKLRVEISRG 83 (207)
Q Consensus 49 afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~~a~~ 83 (207)
|||+|+++++|+.|++.+..++ .++++|+.|.+
T Consensus 1 aFVtF~~~~~a~~~~q~~~~~~--~~~~~v~~APe 33 (325)
T PF02714_consen 1 AFVTFNSQKSAQIALQLLLSKR--PNSWRVSPAPE 33 (325)
T ss_pred CEEEECCHHHHHHHHHHHhcCC--CCCceEeeCCC
Confidence 7999999999999998654443 35556766644
No 162
>COG0724 RNA-binding proteins (RRM domain) [General function prediction only]
Probab=82.65 E-value=1.6 Score=34.95 Aligned_cols=40 Identities=23% Similarity=0.392 Sum_probs=34.3
Q ss_pred CCCCCcEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeC
Q psy2347 5 MMERGTRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFN 44 (207)
Q Consensus 5 ~~~~~~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d 44 (207)
.......+++.+++..+++.++..+|..++.+..+.+...
T Consensus 221 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 260 (306)
T COG0724 221 LLEKSDNLYVGNLPLKTAEEELADLFKSRGDIVRASLPPS 260 (306)
T ss_pred cccccceeeccccccccchhHHHHhccccccceeeeccCC
Confidence 4456789999999999999999999999999977766655
No 163
>PF14893 PNMA: PNMA
Probab=82.64 E-value=1.1 Score=39.08 Aligned_cols=52 Identities=19% Similarity=0.316 Sum_probs=35.5
Q ss_pred CCCCcEEEEcCCCcCCcHHHHHHHHHh-CCCeeEEEEeeC------CCcEEEEEEcCHH
Q psy2347 6 MERGTRVYVGGLTETVKKEDLELEFEK-FGKLNSVWVAFN------PPGFAFVEFSNQI 57 (207)
Q Consensus 6 ~~~~~~lfV~nLp~~~te~~L~~~F~~-~G~V~~v~i~~d------~rG~afV~F~~~e 57 (207)
.++-+.|.|.+||+++++++|++.+.. .-++-..+|... ..-.|+|+|...-
T Consensus 15 ~~~~r~lLv~giP~dc~~~ei~e~l~~~l~plg~yrvl~~~f~~~~~~~aalve~~e~~ 73 (331)
T PF14893_consen 15 VDPQRALLVLGIPEDCEEAEIEEALQAALSPLGRYRVLGKMFRREENAKAALVEFAEDV 73 (331)
T ss_pred cChhhhheeecCCCCCCHHHHHHHHHHhhcccccceehhhHhhhhcccceeeeeccccc
Confidence 356678999999999999999998765 323333333222 4457888887533
No 164
>PRK10629 EnvZ/OmpR regulon moderator; Provisional
Probab=82.05 E-value=13 Score=27.83 Aligned_cols=73 Identities=8% Similarity=0.070 Sum_probs=49.7
Q ss_pred CCCcEEEEcCCCcC---CcHHHHHHHHHhCCC-eeEEEEeeCCCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEEEee
Q psy2347 7 ERGTRVYVGGLTET---VKKEDLELEFEKFGK-LNSVWVAFNPPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVEISR 82 (207)
Q Consensus 7 ~~~~~lfV~nLp~~---~te~~L~~~F~~~G~-V~~v~i~~d~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~~a~ 82 (207)
.+...|.|+..... .+.+.|++++++-+- ++.+..-. .-..|.|++.|+..+|.+.|+...-+.-.|.+.++.
T Consensus 33 gedpavQIs~~~~g~~~~~~~~v~~~L~~~gI~~ksi~~~~---~~~~irf~~~~~Ql~Ak~vL~~~L~~~y~VAlnl~p 109 (127)
T PRK10629 33 QQESTLAIRAVHQGASLPDGFYVYQHLDANGIHIKSITPEN---DSLLIRFDSPEQSAAAKEVLDRTLPHGYIIAQQDDN 109 (127)
T ss_pred CCCceEEEecCCCCCccchHHHHHHHHHHCCCCcceEEeeC---CEEEEEECCHHHHHHHHHHHHHHcCCCCEEEEecCC
Confidence 45567788766333 456778888887763 44554433 378899999999999988876555455566666554
No 165
>KOG2891|consensus
Probab=81.59 E-value=0.53 Score=39.98 Aligned_cols=37 Identities=24% Similarity=0.470 Sum_probs=28.8
Q ss_pred CCCcEEEEcCCCcC------------CcHHHHHHHHHhCCCeeEEEEee
Q psy2347 7 ERGTRVYVGGLTET------------VKKEDLELEFEKFGKLNSVWVAF 43 (207)
Q Consensus 7 ~~~~~lfV~nLp~~------------~te~~L~~~F~~~G~V~~v~i~~ 43 (207)
+...+|++..||-. .+|+.|+..|+.||+|..|.|+.
T Consensus 147 erpdti~la~ip~kwf~lkedg~~dlpse~rlr~a~eafg~ir~vdipi 195 (445)
T KOG2891|consen 147 ERPDTIHLAGIPCKWFALKEDGSEDLPSEDRLRKAFEAFGEIRNVDIPI 195 (445)
T ss_pred CCCCceeecCCcceeeeecccccccCChHHHHHHHHHHhccceecCCcc
Confidence 34568888888732 46788999999999999888753
No 166
>KOG1295|consensus
Probab=79.53 E-value=2.1 Score=37.71 Aligned_cols=64 Identities=20% Similarity=0.229 Sum_probs=46.7
Q ss_pred CCcEEEEcCCCcCCcHHHHHHHHHhCCC-eeEEEEeeC-------CCcEEEEEEcCHHHHHHHHHhcCCcee
Q psy2347 8 RGTRVYVGGLTETVKKEDLELEFEKFGK-LNSVWVAFN-------PPGFAFVEFSNQIDAEAACDSMNDQDL 71 (207)
Q Consensus 8 ~~~~lfV~nLp~~~te~~L~~~F~~~G~-V~~v~i~~d-------~rG~afV~F~~~e~A~~Al~~l~g~~i 71 (207)
.-++|.|.+||+.+++++|.+.+..+-+ |....+.+. ....|+|.|..++++..-...++++++
T Consensus 6 ~~~Kvv~rrlpp~l~~~~~~eqi~p~~~~v~~~~F~~a~~s~~~~~ysrayinFk~~~dv~ef~~~f~g~if 77 (376)
T KOG1295|consen 6 AKVKVVVRRLPPKLTEEQLLEQINPFPEHVNWEFFAKADESLRNHKYSRAYINFKNPEDVEEFRRRFDGYIF 77 (376)
T ss_pred cceeeeeecCCCcccHHHHhhhcCCCccccchheeccccccchhhhhhhhhhccccHHHHHHHHhhCCceEE
Confidence 3468889999999999999887777543 222222211 346799999999998888887787764
No 167
>PRK14548 50S ribosomal protein L23P; Provisional
Probab=77.86 E-value=7.7 Score=26.96 Aligned_cols=55 Identities=20% Similarity=0.257 Sum_probs=38.8
Q ss_pred EEEcCCCcCCcHHHHHHHHHh-CC-CeeEEEEeeC--CCcEEEEEEcCHHHHHHHHHhc
Q psy2347 12 VYVGGLTETVKKEDLELEFEK-FG-KLNSVWVAFN--PPGFAFVEFSNQIDAEAACDSM 66 (207)
Q Consensus 12 lfV~nLp~~~te~~L~~~F~~-~G-~V~~v~i~~d--~rG~afV~F~~~e~A~~Al~~l 66 (207)
.|+--++.+++..+|++.+++ |+ +|.+|..+.. ..--|||++...++|......|
T Consensus 23 ~y~F~V~~~anK~eIK~AvE~lf~VkV~~VnT~~~~~~~KKA~V~L~~g~~A~~va~ki 81 (84)
T PRK14548 23 KLTFIVDRRATKPDIKRAVEELFDVKVEKVNTLITPKGEKKAYVKLAEEYDAEEIASRL 81 (84)
T ss_pred EEEEEECCCCCHHHHHHHHHHHhCCceEEEEeEEcCCCcEEEEEEeCCCCcHHHHHHhh
Confidence 344456888999999999988 44 3556655443 2346999999888888776553
No 168
>COG5638 Uncharacterized conserved protein [Function unknown]
Probab=77.83 E-value=10 Score=34.11 Aligned_cols=27 Identities=15% Similarity=0.213 Sum_probs=23.0
Q ss_pred EEEEEEcCHHHHHHHHHhcCCceeCCc
Q psy2347 48 FAFVEFSNQIDAEAACDSMNDQDLMGS 74 (207)
Q Consensus 48 ~afV~F~~~e~A~~Al~~l~g~~i~g~ 74 (207)
||.|+|++.+.++.+..++++.++...
T Consensus 260 yAvvec~d~~tsK~iY~~CDG~Eye~s 286 (622)
T COG5638 260 YAVVECEDIETSKNIYSACDGVEYENS 286 (622)
T ss_pred EEEEEeccchhhHHHHhccCccccccc
Confidence 789999999999999999888877543
No 169
>PRK08559 nusG transcription antitermination protein NusG; Validated
Probab=76.61 E-value=10 Score=29.13 Aligned_cols=34 Identities=15% Similarity=0.212 Sum_probs=27.4
Q ss_pred eeEEEEeeCCCcEEEEEEcCHHHHHHHHHhcCCc
Q psy2347 36 LNSVWVAFNPPGFAFVEFSNQIDAEAACDSMNDQ 69 (207)
Q Consensus 36 V~~v~i~~d~rG~afV~F~~~e~A~~Al~~l~g~ 69 (207)
|.+|.++...++|.||+.+..+++..++..+.+.
T Consensus 36 i~~i~vp~~fpGYVfVe~~~~~~~~~~i~~v~~v 69 (153)
T PRK08559 36 IYAILAPPELKGYVLVEAESKGAVEEAIRGIPHV 69 (153)
T ss_pred EEEEEccCCCCcEEEEEEEChHHHHHHHhcCCCE
Confidence 6666666668999999999889899998877653
No 170
>KOG4008|consensus
Probab=76.37 E-value=2.4 Score=35.15 Aligned_cols=37 Identities=8% Similarity=0.152 Sum_probs=30.6
Q ss_pred CCCCCCcEEEEcCCCcCCcHHHHHHHHHhCCCeeEEE
Q psy2347 4 GMMERGTRVYVGGLTETVKKEDLELEFEKFGKLNSVW 40 (207)
Q Consensus 4 ~~~~~~~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~ 40 (207)
.+..+..+||+-|||..+|++.|+++.++++-+..+.
T Consensus 35 s~~~eKd~lfl~Nvp~~~tee~lkr~vsqlg~vq~~~ 71 (261)
T KOG4008|consen 35 SNSNEKDCLFLVNVPLLSTEEHLKRFVSQLGHVQELL 71 (261)
T ss_pred cccccccceeeecccccccHHHHHHHHHHhhhhhhee
Confidence 3456678999999999999999999999998665543
No 171
>PF03439 Spt5-NGN: Early transcription elongation factor of RNA pol II, NGN section; InterPro: IPR005100 Spt5p and prokaryotic NusG are shown to contain a novel 'NGN' domain. The combined NGN and KOW motif regions of Spt5 form the binding domain with Spt4 []. Spt5 complexes with Spt4 as a 1:1 heterodimer snf this Spt5-Spt4 complex regulates early transcription elongation by RNA polymerase II and has an imputed role in pre-mRNA processing via its physical association with mRNA capping enzymes. The Schizosaccharomyces pombe core Spt5-Spt4 complex is a heterodimer bearing a trypsin-resistant Spt4-binding domain within the Spt5 subunit []. ; PDB: 3H7H_B 3QQC_D 3P8B_B 2EXU_A.
Probab=75.52 E-value=8.5 Score=26.48 Aligned_cols=36 Identities=17% Similarity=0.321 Sum_probs=26.3
Q ss_pred CeeEEEEeeCCCcEEEEEEcCHHHHHHHHHhcCCce
Q psy2347 35 KLNSVWVAFNPPGFAFVEFSNQIDAEAACDSMNDQD 70 (207)
Q Consensus 35 ~V~~v~i~~d~rG~afV~F~~~e~A~~Al~~l~g~~ 70 (207)
.|.++..+.+.++|-|||=.+++++++|++.+....
T Consensus 33 ~I~Si~~~~~lkGyIyVEA~~~~~V~~ai~gi~~i~ 68 (84)
T PF03439_consen 33 NIYSIFAPDSLKGYIYVEAERESDVKEAIRGIRHIR 68 (84)
T ss_dssp ---EEEE-TTSTSEEEEEESSHHHHHHHHTT-TTEE
T ss_pred ceEEEEEeCCCceEEEEEeCCHHHHHHHHhccccee
Confidence 466666666689999999999999999998876544
No 172
>PRK11901 hypothetical protein; Reviewed
Probab=75.30 E-value=6.3 Score=34.24 Aligned_cols=50 Identities=16% Similarity=0.243 Sum_probs=34.0
Q ss_pred CCcHHHHHHHHHhCCCeeEEEEeeC----CCcEEEE--EEcCHHHHHHHHHhcCCce
Q psy2347 20 TVKKEDLELEFEKFGKLNSVWVAFN----PPGFAFV--EFSNQIDAEAACDSMNDQD 70 (207)
Q Consensus 20 ~~te~~L~~~F~~~G~V~~v~i~~d----~rG~afV--~F~~~e~A~~Al~~l~g~~ 70 (207)
..++++|++|..++. +..+.|... ...|..| +|.+.++|++|++.|....
T Consensus 253 as~~~~L~~f~~~~~-L~~~~VYqT~RnGkpWYVVvyG~Y~Sr~eAk~Ai~sLPa~l 308 (327)
T PRK11901 253 ASRSDTLNAYAKKQN-LSHYHVYETKRDGKPWYVLVSGNYASSAEAKRAIATLPAEV 308 (327)
T ss_pred CCCHHHHHHHHHHcC-cCceEEEEEEECCceEEEEEecCcCCHHHHHHHHHhCCHHH
Confidence 356788888888774 444544433 3345444 6899999999999886533
No 173
>KOG4365|consensus
Probab=74.15 E-value=0.34 Score=43.48 Aligned_cols=73 Identities=10% Similarity=-0.011 Sum_probs=54.5
Q ss_pred cEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeC-----CCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEEEeec
Q psy2347 10 TRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFN-----PPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVEISRG 83 (207)
Q Consensus 10 ~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d-----~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~~a~~ 83 (207)
.+.|+..||..+++++|.-+|+.++.|..+.+... .+-.+||+-.+ +++..+|..+.-+.+.+.+++|.++..
T Consensus 4 ~~~~l~d~~~~~~~~~~~~~~~d~~~i~~~d~~~~~~~~~~~v~~f~~~~~-~~~~~~i~~~k~q~~~~~~~r~~~~~~ 81 (572)
T KOG4365|consen 4 MKKSLKDSVASNNKDQNSMKHEDPSIISMEDGSPYVNGSLGEVTPFQHAKK-ANGPNYIQPQKRQTTFESQDRKAVSPS 81 (572)
T ss_pred hhhhHhhcccccccchhhhhccCCcceeeccCCccccCCcceeeeeeeeec-cCcccccCHHHHhhhhhhhhhhhcCch
Confidence 45678889999999999999999999887765433 34567877665 456667766666777777777777653
No 174
>PF07530 PRE_C2HC: Associated with zinc fingers; InterPro: IPR006579 This domain is present in proteins found exclusively in the arthropods, including a number of Drosophila species, the silk moth and the gypsy moth. These proteins are possibly involved in RNA binding or single strand DNA binding.
Probab=73.76 E-value=6.8 Score=26.00 Aligned_cols=58 Identities=22% Similarity=0.407 Sum_probs=37.5
Q ss_pred HHHHHHHHhCC-CeeEEEEeeC-----CCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEEEeecC
Q psy2347 24 EDLELEFEKFG-KLNSVWVAFN-----PPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVEISRGR 84 (207)
Q Consensus 24 ~~L~~~F~~~G-~V~~v~i~~d-----~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~~a~~k 84 (207)
++|++.|...| +|.+|.-+.. +....||+.+...+.++ .++=+.|.+..|+|+..+.+
T Consensus 2 ~~I~~~L~~~G~~v~~i~~~~~~~~k~pl~mf~veL~p~~~~k~---i~~Ik~l~~~~V~vE~~~k~ 65 (68)
T PF07530_consen 2 EEIKEELKDQGHPVRNIHNMHSRNTKKPLNMFFVELEPKPNNKE---IYKIKTLCGQRVKVERPRKR 65 (68)
T ss_pred HHHHHHHHHcCCceEEEEccccCCCCCCceEEEEeeccCccccc---eeehHhhCCeEEEEecCCCC
Confidence 46677777766 4666665554 34577888876655333 35567788888888765543
No 175
>PF07292 NID: Nmi/IFP 35 domain (NID); InterPro: IPR009909 This entry represents a domain of approximately 90 residues that is tandemly repeated within interferon-induced 35 kDa protein (IFP 35) and the homologous N-myc-interactor (Nmi). This domain mediates Nmi-Nmi protein interactions and subcellular localisation [].
Probab=73.27 E-value=5.6 Score=27.91 Aligned_cols=32 Identities=22% Similarity=0.306 Sum_probs=22.6
Q ss_pred EEEEEcCHHHHHHHHHhcCCce--eCCcEEEEEEe
Q psy2347 49 AFVEFSNQIDAEAACDSMNDQD--LMGSKLRVEIS 81 (207)
Q Consensus 49 afV~F~~~e~A~~Al~~l~g~~--i~g~~l~V~~a 81 (207)
|+|+|++++.|++.++. ..+. ++...+.|+..
T Consensus 1 AlITF~e~~VA~~i~~~-~~~~v~l~~~~~~V~v~ 34 (88)
T PF07292_consen 1 ALITFEEEGVAQRILKK-KKHPVPLEDCCVRVKVS 34 (88)
T ss_pred CEEEeCcHHHHHHHHhC-CEEEEEECCEEEEEEEE
Confidence 68999999999999874 4433 45555555443
No 176
>TIGR03636 L23_arch archaeal ribosomal protein L23. This model describes the archaeal ribosomal protein L23P and rigorously excludes the bacterial counterpart L23. In order to capture every known instance of archaeal L23P, the trusted cutoff is set lower than a few of the highest scoring eukaryotic cytosolic ribosomal counterparts.
Probab=73.13 E-value=9.3 Score=26.07 Aligned_cols=56 Identities=20% Similarity=0.237 Sum_probs=38.7
Q ss_pred EEEEcCCCcCCcHHHHHHHHHh-CC-CeeEEEEeeC--CCcEEEEEEcCHHHHHHHHHhc
Q psy2347 11 RVYVGGLTETVKKEDLELEFEK-FG-KLNSVWVAFN--PPGFAFVEFSNQIDAEAACDSM 66 (207)
Q Consensus 11 ~lfV~nLp~~~te~~L~~~F~~-~G-~V~~v~i~~d--~rG~afV~F~~~e~A~~Al~~l 66 (207)
.-|+-.++.+++..+|++.+++ |+ +|..|..+.- ..--|||++..-++|..+...|
T Consensus 15 n~y~F~V~~~anK~eIK~avE~lf~VkV~~Vnt~~~~~~~KKA~VtL~~g~~a~~va~k~ 74 (77)
T TIGR03636 15 NKLTFIVDRKATKGDIKRAVEKLFDVKVEKVNTLITPRGEKKAYVKLAEEYAAEEIASRL 74 (77)
T ss_pred CEEEEEECCCCCHHHHHHHHHHHhCCceEEEEeEEcCCCceEEEEEECCCCcHHHHHHhh
Confidence 3455567889999999999988 43 3555544433 2346999998888877766543
No 177
>PF14111 DUF4283: Domain of unknown function (DUF4283)
Probab=72.31 E-value=1.8 Score=32.73 Aligned_cols=73 Identities=15% Similarity=0.147 Sum_probs=46.7
Q ss_pred cEEEEcCC-C-cCCcHHHHHHHHHhC-CCeeEEEEeeCCCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEEEeec
Q psy2347 10 TRVYVGGL-T-ETVKKEDLELEFEKF-GKLNSVWVAFNPPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVEISRG 83 (207)
Q Consensus 10 ~~lfV~nL-p-~~~te~~L~~~F~~~-G~V~~v~i~~d~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~~a~~ 83 (207)
....|+.+ . .+++.+.|.+.+.+. .....+.+..-...+..+.|+++++++++++. ....+.+..|.++.-+|
T Consensus 16 ~~~lVg~~l~~~~~~~~~l~~~l~~~W~~~~~~~i~~l~~~~fl~~F~~~~d~~~vl~~-~p~~~~~~~~~l~~W~~ 91 (153)
T PF14111_consen 16 QLCLVGRVLSPKPISLSALEQELAKIWKLKGGVKIRDLGDNLFLFQFESEEDRQRVLKG-GPWNFNGHFLILQRWSP 91 (153)
T ss_pred CeEEEEEECCCCCCCHHHHHHHHHHHhCCCCcEEEEEeCCCeEEEEEEeccceeEEEec-ccccccccchhhhhhcc
Confidence 34445544 2 346677777766653 33223444333467999999999999999874 55667777777765554
No 178
>PF14026 DUF4242: Protein of unknown function (DUF4242)
Probab=69.56 E-value=32 Score=23.28 Aligned_cols=54 Identities=11% Similarity=0.068 Sum_probs=36.4
Q ss_pred EEEcCCCcCCcHHHHHHHHHhC-------CCeeEEEEeeC---CCcEEEEEEcCHHHHHHHHHh
Q psy2347 12 VYVGGLTETVKKEDLELEFEKF-------GKLNSVWVAFN---PPGFAFVEFSNQIDAEAACDS 65 (207)
Q Consensus 12 lfV~nLp~~~te~~L~~~F~~~-------G~V~~v~i~~d---~rG~afV~F~~~e~A~~Al~~ 65 (207)
|-..+||..+|.++|.++..+. -.|..++...+ .+-||+.+=.|+|++.++.+.
T Consensus 3 mver~~p~~it~e~l~~~~~~~~~~~~~~~~V~w~~s~v~~d~~k~~Cly~Ap~~eaV~~~~~~ 66 (77)
T PF14026_consen 3 MVERDFPGGITAEDLAAAHAKSCAVQAEMPGVQWLRSYVSEDDGKIFCLYEAPDEEAVREHARR 66 (77)
T ss_pred EEEEeCCCCCCHHHHHHHHHHhHHHHhhcCCeEEEEEEEecCCCeEEEEEECCCHHHHHHHHHH
Confidence 4456788889999988766543 23444432221 567888888899998888775
No 179
>KOG4213|consensus
Probab=69.10 E-value=6.2 Score=31.46 Aligned_cols=45 Identities=22% Similarity=0.127 Sum_probs=29.3
Q ss_pred CcHHHHHHHHHhC-CCeeEEEEeeC------CCcEEEEEEcCHHHHHHHHHh
Q psy2347 21 VKKEDLELEFEKF-GKLNSVWVAFN------PPGFAFVEFSNQIDAEAACDS 65 (207)
Q Consensus 21 ~te~~L~~~F~~~-G~V~~v~i~~d------~rG~afV~F~~~e~A~~Al~~ 65 (207)
+|++.|.++..-. +.+..|.+... .++-.||+|.+.+.|.++++.
T Consensus 118 ~td~ql~~l~qw~~~k~~nv~mr~~~~k~~~fkGsvkv~f~tk~qa~a~~~~ 169 (205)
T KOG4213|consen 118 ITDDQLDDLNQWASGKGHNVKMRRHGNKAHPFKGSVKVTFQTKEQAFANDDT 169 (205)
T ss_pred CCHHHHHHHHHHhcccceEeeccccCCCCCCCCCceEEEeecHHHHHhhhhh
Confidence 3444444433221 56777665433 568899999999999988765
No 180
>smart00596 PRE_C2HC PRE_C2HC domain.
Probab=68.75 E-value=8.9 Score=25.64 Aligned_cols=56 Identities=20% Similarity=0.296 Sum_probs=37.9
Q ss_pred HHHHHHHHhCC-CeeEEEEeeC-----CCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEEEee
Q psy2347 24 EDLELEFEKFG-KLNSVWVAFN-----PPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVEISR 82 (207)
Q Consensus 24 ~~L~~~F~~~G-~V~~v~i~~d-----~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~~a~ 82 (207)
++|++.|++.| +|..|..|.. +...-+|+.....+... .|+-+.|.+++|+|+...
T Consensus 2 ~~I~~~L~~~G~~v~~i~~m~~~~~r~P~nmf~vel~~~~~~~~---Il~ik~Lg~~~V~VEr~~ 63 (69)
T smart00596 2 SQIEEALKDIGFPVLFIHNMLNRDTKNPQNMFEVELVPAANGKE---ILNIKTLGGQRVTVERPH 63 (69)
T ss_pred HHHHHHHHHcCCceeEEEcccccCCCCcceeEEEEeeecCCCcc---eEeehhhCCeeEEEecCc
Confidence 46777888877 4677777665 44567788776544333 356677889998887554
No 181
>COG2608 CopZ Copper chaperone [Inorganic ion transport and metabolism]
Probab=68.41 E-value=21 Score=23.54 Aligned_cols=55 Identities=25% Similarity=0.220 Sum_probs=39.8
Q ss_pred cEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeCCCcEEEEEEcC----HHHHHHHHHh
Q psy2347 10 TRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFNPPGFAFVEFSN----QIDAEAACDS 65 (207)
Q Consensus 10 ~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d~rG~afV~F~~----~e~A~~Al~~ 65 (207)
.+|.|.++.-.-=...+++.++....|.++++-.+ ++.+.|+|++ .++...|++.
T Consensus 4 ~~l~v~~MtC~~C~~~V~~al~~v~gv~~v~v~l~-~~~~~V~~d~~~~~~~~i~~ai~~ 62 (71)
T COG2608 4 TTLKVEGMTCGHCVKTVEKALEEVDGVASVDVDLE-KGTATVTFDSNKVDIEAIIEAIED 62 (71)
T ss_pred EEEEECCcCcHHHHHHHHHHHhcCCCeeEEEEEcc-cCeEEEEEcCCcCCHHHHHHHHHH
Confidence 46777777665556788888888888888888776 5579999987 3444445443
No 182
>PF15513 DUF4651: Domain of unknown function (DUF4651)
Probab=66.91 E-value=13 Score=24.23 Aligned_cols=20 Identities=15% Similarity=0.493 Sum_probs=16.3
Q ss_pred HHHHHHHHHhCCCeeEEEEe
Q psy2347 23 KEDLELEFEKFGKLNSVWVA 42 (207)
Q Consensus 23 e~~L~~~F~~~G~V~~v~i~ 42 (207)
.++|+++|++.|+|.-+.|.
T Consensus 8 ~~~iR~~fs~lG~I~vLYvn 27 (62)
T PF15513_consen 8 TAEIRQFFSQLGEIAVLYVN 27 (62)
T ss_pred HHHHHHHHHhcCcEEEEEEc
Confidence 36899999999998876553
No 183
>PF07237 DUF1428: Protein of unknown function (DUF1428); InterPro: IPR009874 This family consists of several hypothetical bacterial and one archaeal sequence of around 120 residues in length. The function of this family is unknown.; PDB: 2OKQ_A.
Probab=65.69 E-value=18 Score=26.12 Aligned_cols=42 Identities=21% Similarity=0.272 Sum_probs=30.6
Q ss_pred HHHHHHHhCCCeeEEEEeeC--------------------CCcEEEEEEcCHHHHHHHHHhc
Q psy2347 25 DLELEFEKFGKLNSVWVAFN--------------------PPGFAFVEFSNQIDAEAACDSM 66 (207)
Q Consensus 25 ~L~~~F~~~G~V~~v~i~~d--------------------~rG~afV~F~~~e~A~~Al~~l 66 (207)
...++|..||.+..++..-+ .--|.+|+|.|++..+++.+++
T Consensus 24 ~a~~vf~e~GAl~~vE~wgdDvp~G~~TsF~~Av~a~~~E~VVFSWi~wpska~rD~~~~k~ 85 (103)
T PF07237_consen 24 KAAEVFKEHGALRVVECWGDDVPDGKVTSFPRAVKAKPDETVVFSWIEWPSKATRDAANAKM 85 (103)
T ss_dssp HHHHHHHHTT-SEEEEEEEEE----SS--HHHHTT--TTEEEEEEEEEESSHHHHHHHHHHH
T ss_pred HHHHHHHHhCCEEEEEeecCcCCcCccCCHHHHhcCCCCCEEEEEEEEcCCHHHHHHHHHHh
Confidence 34578999998777665433 2358999999999998888764
No 184
>PF07292 NID: Nmi/IFP 35 domain (NID); InterPro: IPR009909 This entry represents a domain of approximately 90 residues that is tandemly repeated within interferon-induced 35 kDa protein (IFP 35) and the homologous N-myc-interactor (Nmi). This domain mediates Nmi-Nmi protein interactions and subcellular localisation [].
Probab=65.60 E-value=2.5 Score=29.66 Aligned_cols=22 Identities=23% Similarity=0.442 Sum_probs=19.3
Q ss_pred CCcEEEEcCCCcCCcHHHHHHH
Q psy2347 8 RGTRVYVGGLTETVKKEDLELE 29 (207)
Q Consensus 8 ~~~~lfV~nLp~~~te~~L~~~ 29 (207)
..++|.|.+||..++|++|++.
T Consensus 51 s~rtVlvsgip~~l~ee~l~D~ 72 (88)
T PF07292_consen 51 SKRTVLVSGIPDVLDEEELRDK 72 (88)
T ss_pred cCCEEEEeCCCCCCChhhheee
Confidence 4678999999999999999874
No 185
>PF13037 DUF3898: Domain of unknown function (DUF3898)
Probab=64.97 E-value=6.7 Score=27.36 Aligned_cols=47 Identities=17% Similarity=0.273 Sum_probs=34.3
Q ss_pred CcHHHHHHHHHhCCCeeEEEEeeC-------------CCcEEEEEEcCHHHHHHHHHhcC
Q psy2347 21 VKKEDLELEFEKFGKLNSVWVAFN-------------PPGFAFVEFSNQIDAEAACDSMN 67 (207)
Q Consensus 21 ~te~~L~~~F~~~G~V~~v~i~~d-------------~rG~afV~F~~~e~A~~Al~~l~ 67 (207)
+.+-+++.+++.||.-..+.-+.+ -|++.=|+|-.+++.+..++.+.
T Consensus 31 Ld~~~Vk~lLaDfG~~iHiAKv~~RYv~liEgd~~~FEKG~SPVEflkP~~l~~V~eri~ 90 (91)
T PF13037_consen 31 LDHTTVKGLLADFGETIHIAKVNDRYVLLIEGDSLQFEKGFSPVEFLKPEDLQEVIERIK 90 (91)
T ss_pred cCceehhHHHHhhccceeEEEECCEEEEEEEcceEEEccCCCceeeeCchhHHHHHHHhc
Confidence 455678899999997444322222 58888899999999999888653
No 186
>COG0079 HisC Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase [Amino acid transport and metabolism]
Probab=62.34 E-value=8.5 Score=33.89 Aligned_cols=49 Identities=18% Similarity=0.267 Sum_probs=35.0
Q ss_pred CCcEEEEcCCC----cCCcHHHHHHHHHhCCCeeEEEEeeCCCcEEEEEEcCHHHHHHH
Q psy2347 8 RGTRVYVGGLT----ETVKKEDLELEFEKFGKLNSVWVAFNPPGFAFVEFSNQIDAEAA 62 (207)
Q Consensus 8 ~~~~lfV~nLp----~~~te~~L~~~F~~~G~V~~v~i~~d~rG~afV~F~~~e~A~~A 62 (207)
+.+.|||.|=. ..++.++|+++.+..-. .+.|+.| .||++|.. +++...
T Consensus 145 ~~~lv~i~nPNNPTG~~~~~~~l~~l~~~~~~--~~~vVvD---EAY~eF~~-~~~~~l 197 (356)
T COG0079 145 KTKLVFLCNPNNPTGTLLPREELRALLEALPE--GGLVVID---EAYIEFSP-ESSLEL 197 (356)
T ss_pred CCCEEEEeCCCCCCCCCCCHHHHHHHHHhCCC--CcEEEEe---CchhhcCC-chhhhh
Confidence 56789998642 24788999999998766 4445556 79999999 433333
No 187
>KOG1999|consensus
Probab=61.83 E-value=25 Score=34.92 Aligned_cols=33 Identities=18% Similarity=0.251 Sum_probs=25.2
Q ss_pred CCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEE
Q psy2347 45 PPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRV 78 (207)
Q Consensus 45 ~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V 78 (207)
.++|-|||=..+..++.||+.|-+..+. +.|.|
T Consensus 209 lkGyIYIEA~KqshV~~Ai~gv~niy~~-~~~lV 241 (1024)
T KOG1999|consen 209 LKGYIYIEADKQSHVKEAIEGVRNIYAN-RILLV 241 (1024)
T ss_pred cceeEEEEechhHHHHHHHhhhhhheec-cEEEE
Confidence 7889999988888899998887766655 44333
No 188
>COG5470 Uncharacterized conserved protein [Function unknown]
Probab=61.73 E-value=18 Score=25.76 Aligned_cols=40 Identities=20% Similarity=0.332 Sum_probs=25.8
Q ss_pred HHHHHHHhCCC--------eeEEEEeeCCCcEEEEEEcCHHHHHHHHH
Q psy2347 25 DLELEFEKFGK--------LNSVWVAFNPPGFAFVEFSNQIDAEAACD 64 (207)
Q Consensus 25 ~L~~~F~~~G~--------V~~v~i~~d~rG~afV~F~~~e~A~~Al~ 64 (207)
.+...|++||- |+.++=..++.....|||+|.+.|..+..
T Consensus 24 ~~~~a~~~~Ggr~LvRGG~v~~lEG~w~ptr~vviEFps~~~ar~~y~ 71 (96)
T COG5470 24 KAKPAIEKFGGRYLVRGGEVETLEGEWRPTRNVVIEFPSLEAARDCYN 71 (96)
T ss_pred HhHHHHHHhCCeeEeeCCCeeeccCCCCcccEEEEEcCCHHHHHHHhc
Confidence 35567778864 33332222246689999999999887653
No 189
>COG3254 Uncharacterized conserved protein [Function unknown]
Probab=60.92 E-value=29 Score=25.14 Aligned_cols=40 Identities=13% Similarity=0.161 Sum_probs=27.8
Q ss_pred HHHHHHHHhCCCeeEEEEeeC---CCcEEEEEEcCHHHHHHHHH
Q psy2347 24 EDLELEFEKFGKLNSVWVAFN---PPGFAFVEFSNQIDAEAACD 64 (207)
Q Consensus 24 ~~L~~~F~~~G~V~~v~i~~d---~rG~afV~F~~~e~A~~Al~ 64 (207)
.+|.+++++++ |.+-.|..| ..-|+++++.|.++..++|.
T Consensus 27 PE~~a~lk~ag-i~nYSIfLde~~n~lFgy~E~~d~~a~m~~~a 69 (105)
T COG3254 27 PELLALLKEAG-IRNYSIFLDEEENLLFGYWEYEDFEADMAKMA 69 (105)
T ss_pred HHHHHHHHHcC-CceeEEEecCCcccEEEEEEEcChHHHHHHHh
Confidence 46777888886 555555555 56799999996666555553
No 190
>PF15063 TC1: Thyroid cancer protein 1
Probab=60.25 E-value=2.5 Score=28.71 Aligned_cols=48 Identities=15% Similarity=0.174 Sum_probs=31.5
Q ss_pred EEEcCCCcCCcHHHHHHHHHhCCCee---EEEEeeCCCcEEEEEEcCHHHHHHHHHhc
Q psy2347 12 VYVGGLTETVKKEDLELEFEKFGKLN---SVWVAFNPPGFAFVEFSNQIDAEAACDSM 66 (207)
Q Consensus 12 lfV~nLp~~~te~~L~~~F~~~G~V~---~v~i~~d~rG~afV~F~~~e~A~~Al~~l 66 (207)
--+.||=.+++.+.|+.+|.+-|+.+ .++|+.. .-.++++..+||..|
T Consensus 28 kasaNIFe~vn~~qlqrLF~~sGD~kAEeRA~iI~~-------~~~d~ee~a~AL~~L 78 (79)
T PF15063_consen 28 KASANIFENVNLDQLQRLFQKSGDKKAEERARIIWE-------CAQDPEEKARALMAL 78 (79)
T ss_pred hhhhhhhhccCHHHHHHHHHHccchhHHHHHHHHHh-------hCCCHHHHHHHHHhc
Confidence 33456667899999999999998743 3344332 224666666777654
No 191
>KOG4454|consensus
Probab=59.52 E-value=1.9 Score=35.47 Aligned_cols=66 Identities=23% Similarity=0.315 Sum_probs=48.6
Q ss_pred CcEEEEcC----CCcCCcHHHHHHHHHhCCCeeEEEEeeC----CCcEEEEEEcCHHHHHHHHHhcCCceeCCc
Q psy2347 9 GTRVYVGG----LTETVKKEDLELEFEKFGKLNSVWVAFN----PPGFAFVEFSNQIDAEAACDSMNDQDLMGS 74 (207)
Q Consensus 9 ~~~lfV~n----Lp~~~te~~L~~~F~~~G~V~~v~i~~d----~rG~afV~F~~~e~A~~Al~~l~g~~i~g~ 74 (207)
..+++.++ |...+|++.+.+.|++-++|+.+++.++ ++.+.||++....++-.++......++--+
T Consensus 80 q~~~r~G~shapld~r~~~ei~~~v~s~a~p~~~~R~~~~~d~rnrn~~~~~~qr~~~~P~~~~~y~~l~~~~~ 153 (267)
T KOG4454|consen 80 QRTLRCGNSHAPLDERVTEEILYEVFSQAGPIEGVRIPTDNDGRNRNFGFVTYQRLCAVPFALDLYQGLELFQK 153 (267)
T ss_pred hcccccCCCcchhhhhcchhhheeeecccCCCCCccccccccCCccCccchhhhhhhcCcHHhhhhcccCcCCC
Confidence 34566666 6677888999999999999998888776 677889988877777677665555444333
No 192
>CHL00123 rps6 ribosomal protein S6; Validated
Probab=59.34 E-value=60 Score=22.89 Aligned_cols=55 Identities=20% Similarity=0.236 Sum_probs=30.9
Q ss_pred CcEEEEcCCCcCCcHHHHHHHHHhC-------C-CeeEEEEeeC----------CCc-EEEEEEcCHHHHHHHHHh
Q psy2347 9 GTRVYVGGLTETVKKEDLELEFEKF-------G-KLNSVWVAFN----------PPG-FAFVEFSNQIDAEAACDS 65 (207)
Q Consensus 9 ~~~lfV~nLp~~~te~~L~~~F~~~-------G-~V~~v~i~~d----------~rG-~afV~F~~~e~A~~Al~~ 65 (207)
--++|| |.+++++++++++.+++ + +|.+++..-. .++ |.++.|+...++.+.++.
T Consensus 8 YE~~~I--l~p~l~e~~~~~~~~~~~~~i~~~gg~i~~~~~wG~r~LAY~I~k~~~G~Yv~~~f~~~~~~i~eler 81 (97)
T CHL00123 8 YETMYL--LKPDLNEEELLKWIENYKKLLRKRGAKNISVQNRGKRKLSYKINKYEDGIYIQMNYSGNGKLVNSLEK 81 (97)
T ss_pred eeEEEE--ECCCCCHHHHHHHHHHHHHHHHHCCCEEEEEEeecCeeeeEEcCCCCEEEEEEEEEEECHHHHHHHHH
Confidence 345666 46677777766655443 3 4555432111 333 678889866666666653
No 193
>COG4010 Uncharacterized protein conserved in archaea [Function unknown]
Probab=59.19 E-value=29 Score=26.69 Aligned_cols=46 Identities=15% Similarity=0.187 Sum_probs=36.4
Q ss_pred CCCcCCcHHHHHHHHHhCCCeeEEEEeeCCCcEEEEEEcCHHHHHHHHHhc
Q psy2347 16 GLTETVKKEDLELEFEKFGKLNSVWVAFNPPGFAFVEFSNQIDAEAACDSM 66 (207)
Q Consensus 16 nLp~~~te~~L~~~F~~~G~V~~v~i~~d~rG~afV~F~~~e~A~~Al~~l 66 (207)
.|+..+.++-|.++.+.++.|.+.+ -.| ..+.|-+.+.+++||+.+
T Consensus 118 ~L~epl~~eRlqDi~E~hgvIiE~~-E~D----~V~i~Gd~drVk~aLke~ 163 (170)
T COG4010 118 HLREPLAEERLQDIAETHGVIIEFE-EYD----LVAIYGDSDRVKKALKEI 163 (170)
T ss_pred ecCchhHHHHHHHHHHhhheeEEee-ecc----EEEEeccHHHHHHHHHHH
Confidence 4677888999999999999887765 222 356788999999999875
No 194
>KOG3262|consensus
Probab=58.97 E-value=94 Score=25.00 Aligned_cols=16 Identities=38% Similarity=0.480 Sum_probs=7.1
Q ss_pred HHHHHHHHhCCCeeEE
Q psy2347 24 EDLELEFEKFGKLNSV 39 (207)
Q Consensus 24 ~~L~~~F~~~G~V~~v 39 (207)
++|-++=+-|++|.++
T Consensus 89 ~qIGKVDEIfG~i~d~ 104 (215)
T KOG3262|consen 89 EQIGKVDEIFGPINDV 104 (215)
T ss_pred hhhcchhhhccccccc
Confidence 3333333445555444
No 195
>PRK11230 glycolate oxidase subunit GlcD; Provisional
Probab=57.92 E-value=27 Score=32.23 Aligned_cols=46 Identities=20% Similarity=0.144 Sum_probs=33.2
Q ss_pred cHHHHHHHHH----hCCCeeEEEEeeC----CCcEEEEEEcCHHHHHHHHHhcC
Q psy2347 22 KKEDLELEFE----KFGKLNSVWVAFN----PPGFAFVEFSNQIDAEAACDSMN 67 (207)
Q Consensus 22 te~~L~~~F~----~~G~V~~v~i~~d----~rG~afV~F~~~e~A~~Al~~l~ 67 (207)
+..+|..+|. .+|.|+++.+... .+...++.|.+.++|.+|+..+.
T Consensus 202 ~g~dl~~l~~Gs~GtlGIIt~atlkl~p~p~~~~~~~~~f~~~~~a~~~~~~~~ 255 (499)
T PRK11230 202 PGFDLLALFTGSEGMLGVVTEVTVKLLPKPPVARVLLASFDSVEKAGLAVGDII 255 (499)
T ss_pred CccchHhhhccCCCccEEEEEEEEEEEcCCcceEEEEEECCCHHHHHHHHHHHH
Confidence 3456777776 5777888766544 33566789999999999987754
No 196
>PF08544 GHMP_kinases_C: GHMP kinases C terminal ; InterPro: IPR013750 This domain is found in homoserine kinases (2.7.1.39 from EC), galactokinases (2.7.1.6 from EC) and mevalonate kinases (2.7.1.36 from EC). These kinases make up the GHMP kinase superfamily of ATP-dependent enzymes []. These enzymes are involved in the biosynthesis of isoprenes and amino acids as well as in carbohydrate metabolism. The C-terminal domain of homoserine kinase has a central alpha-beta plait fold and an insertion of four helices, which, together with the N-terminal fold, create a novel nucleotide binding fold [].; PDB: 2R3V_C 4EMD_A 4DXL_A 4ED4_A 2GS8_A 1K47_E 3GON_A 3K17_B 1PIE_A 2AJ4_A ....
Probab=57.87 E-value=52 Score=21.69 Aligned_cols=42 Identities=17% Similarity=0.209 Sum_probs=29.5
Q ss_pred HHHHHHHHhCCCeeEEEEeeCC--CcEEEEEEcCHHHHHHHHHhcC
Q psy2347 24 EDLELEFEKFGKLNSVWVAFNP--PGFAFVEFSNQIDAEAACDSMN 67 (207)
Q Consensus 24 ~~L~~~F~~~G~V~~v~i~~d~--rG~afV~F~~~e~A~~Al~~l~ 67 (207)
++|++++.+++ +.-+.+. .+ -.+.|+.+++++.++++++.|.
T Consensus 37 ~~~~~~~~~~G-a~~~~~s-GsG~G~~v~~l~~~~~~~~~v~~~l~ 80 (85)
T PF08544_consen 37 DELKEAAEENG-ALGAKMS-GSGGGPTVFALCKDEDDAERVAEALR 80 (85)
T ss_dssp HHHHHHHHHTT-ESEEEEE-TTSSSSEEEEEESSHHHHHHHHHHHH
T ss_pred HHHHHHHHHCC-CCceecC-CCCCCCeEEEEECCHHHHHHHHHHHH
Confidence 45667777888 3344333 33 6788888989999988888763
No 197
>PF08503 DapH_N: Tetrahydrodipicolinate succinyltransferase N-terminal; InterPro: IPR013710 This domain is found at the N terminus of tetrahydrodipicolinate N-acetyltransferase (DapH) which catalyses the acylation of L-2-amino-6-oxopimelate to 2-N-acetyl-6-oxopimelate in the meso-diaminopimelate/lysine biosynthetic pathway of bacteria, blue-green algae, and plants []. The N-terminal domain as defined here contains three alpha-helices and two twisted hairpin loops []. ; GO: 0047200 tetrahydrodipicolinate N-acetyltransferase activity; PDB: 3CJ8_A 3BV8_A 3R8Y_F.
Probab=57.45 E-value=50 Score=22.90 Aligned_cols=57 Identities=14% Similarity=0.105 Sum_probs=33.7
Q ss_pred cHHHHHHHHHhCCCeeEEEEeeC----------CCcE----EEEEEcCHHHHHHHHHhcCCceeCCcEEEEE
Q psy2347 22 KKEDLELEFEKFGKLNSVWVAFN----------PPGF----AFVEFSNQIDAEAACDSMNDQDLMGSKLRVE 79 (207)
Q Consensus 22 te~~L~~~F~~~G~V~~v~i~~d----------~rG~----afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~ 79 (207)
+.++|.++++.--+-+.|++... .+-| ..|.|-+.++++..|++ |...|....|+..
T Consensus 2 ~a~eII~~I~~skKkTPVKvYv~G~l~~~~~~~~~~fg~~~~~vvfGd~~~i~~~Le~-~~~~I~~y~iE~d 72 (83)
T PF08503_consen 2 DAEEIIRYIKNSKKKTPVKVYVKGDLAGIDFEDVKVFGSGNFGVVFGDWDEIKPFLEA-NKDKIEDYRIEND 72 (83)
T ss_dssp HHHHHHHHHHHCTTB-EEEEEEEESCTC---TTSEEEEESSEEEEEEEHHHHHHHHHH-TTTTEEEEEEEE-
T ss_pred CHHHHHHHHHhCCCCCCEEEEEeeeecCCChhheEEEeCCCcEEEEecHHHHHHHHHh-chhhCcEEEEEec
Confidence 44566666666555444443322 1122 34789999999999997 5666666555543
No 198
>smart00666 PB1 PB1 domain. Phox and Bem1p domain, present in many eukaryotic cytoplasmic signalling proteins. The domain adopts a beta-grasp fold, similar to that found in ubiquitin and Ras-binding domains. A motif, variously termed OPR, PC and AID, represents the most conserved region of the majority of PB1 domains, and is necessary for PB1 domain function. This function is the formation of PB1 domain heterodimers, although not all PB1 domain pairs associate.
Probab=57.01 E-value=38 Score=22.50 Aligned_cols=54 Identities=22% Similarity=0.321 Sum_probs=34.2
Q ss_pred EEcCCCcCCcHHHHHHHHHh-CCCe-eEEEEee-CCCcEEEEEEcCHHHHHHHHHhcC
Q psy2347 13 YVGGLTETVKKEDLELEFEK-FGKL-NSVWVAF-NPPGFAFVEFSNQIDAEAACDSMN 67 (207)
Q Consensus 13 fV~nLp~~~te~~L~~~F~~-~G~V-~~v~i~~-d~rG~afV~F~~~e~A~~Al~~l~ 67 (207)
++-.||..++-++|.+...+ |... ..+.|.. |..+ -+|+..+.++.+.|++.+.
T Consensus 13 ~~~~~~~~~s~~dL~~~i~~~~~~~~~~~~l~Y~Dedg-d~v~l~sd~Dl~~a~~~~~ 69 (81)
T smart00666 13 RRLSVPRDISFEDLRSKVAKRFGLDNQSFTLKYQDEDG-DLVSLTSDEDLEEAIEEYD 69 (81)
T ss_pred EEEEECCCCCHHHHHHHHHHHhCCCCCCeEEEEECCCC-CEEEecCHHHHHHHHHHHH
Confidence 33356778888888876654 4331 1333322 3222 3899999999999998653
No 199
>KOG0156|consensus
Probab=56.67 E-value=27 Score=32.21 Aligned_cols=59 Identities=14% Similarity=0.144 Sum_probs=44.3
Q ss_pred EEcCCCcCC---cHHHHHHHHHhCCCeeEEEEeeCCCcEEEEEEcCHHHHHHHHHhcCCceeCCcEE
Q psy2347 13 YVGGLTETV---KKEDLELEFEKFGKLNSVWVAFNPPGFAFVEFSNQIDAEAACDSMNDQDLMGSKL 76 (207)
Q Consensus 13 fV~nLp~~~---te~~L~~~F~~~G~V~~v~i~~d~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l 76 (207)
+|+||+.-. ....|.++-++||+|-.+++-. .-.|...+.+.|+.|+.+ ++.++.+|+.
T Consensus 36 iIGnl~~l~~~~~h~~~~~ls~~yGpi~tl~lG~----~~~Vviss~~~akE~l~~-~d~~fa~Rp~ 97 (489)
T KOG0156|consen 36 IIGNLHQLGSLPPHRSFRKLSKKYGPVFTLRLGS----VPVVVISSYEAAKEVLVK-QDLEFADRPD 97 (489)
T ss_pred ccccHHHcCCCchhHHHHHHHHHhCCeEEEEecC----ceEEEECCHHHHHHHHHh-CCccccCCCC
Confidence 567775432 3456777777999999887743 247888999999999987 7888888875
No 200
>PF10567 Nab6_mRNP_bdg: RNA-recognition motif; InterPro: IPR018885 This conserved domain is found in fungal proteins and appears to be involved in RNA-processing. It binds to poly-adenylated RNA, interacts genetically with mRNA 3'-end processing factors, co-purifies with the nuclear cap-binding protein Cbp20p, and is found in complexes containing other translation factors, such as EIF4G as in P39935 from SWISSPROT and P39936 from SWISSPROT.
Probab=55.65 E-value=91 Score=26.87 Aligned_cols=34 Identities=24% Similarity=0.286 Sum_probs=27.0
Q ss_pred eeEEEEeeC-------CCcEEEEEEcCHHHHHHHHHhcCCc
Q psy2347 36 LNSVWVAFN-------PPGFAFVEFSNQIDAEAACDSMNDQ 69 (207)
Q Consensus 36 V~~v~i~~d-------~rG~afV~F~~~e~A~~Al~~l~g~ 69 (207)
|+.|.|+.. ++.||.++|-+...|...++-|..+
T Consensus 174 lEsIDlVna~~~~~~Fp~~YaILtFlnIsMAiEV~dYlk~~ 214 (309)
T PF10567_consen 174 LESIDLVNADEPSKHFPKNYAILTFLNISMAIEVLDYLKSN 214 (309)
T ss_pred EEEEEEeccCcccccCCcceEEEeehhHHhHHHHHHHHHhc
Confidence 667777754 7889999999999988888766543
No 201
>COG3102 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=55.60 E-value=16 Score=28.82 Aligned_cols=41 Identities=22% Similarity=0.286 Sum_probs=27.5
Q ss_pred cEEEEEE----cCHHHHHHHHHhcCCceeCCcEEEEEEeecCCCC
Q psy2347 47 GFAFVEF----SNQIDAEAACDSMNDQDLMGSKLRVEISRGRGRG 87 (207)
Q Consensus 47 G~afV~F----~~~e~A~~Al~~l~g~~i~g~~l~V~~a~~k~~~ 87 (207)
++-.||| .-++...+|+..++..-+..--|.|+.+.|+.+.
T Consensus 110 gWEHIEiVlP~~peel~~~~~~l~~~~~l~~~gi~vk~ssp~ge~ 154 (185)
T COG3102 110 GWEHIEIVLPGDPEELNARALALLNDEFLWELGISVKLSSPQGEH 154 (185)
T ss_pred CceeEEEEcCCChHHHHHHHHhhcchhhcccCceEEEecCCCCcc
Confidence 4444554 3456667777777766666666999999887553
No 202
>PRK05772 translation initiation factor IF-2B subunit alpha; Provisional
Probab=54.67 E-value=30 Score=30.67 Aligned_cols=47 Identities=11% Similarity=0.118 Sum_probs=32.8
Q ss_pred CcHHHHHHHHHhCC-Cee----EEEEeeC---CCcEEEEEEcCHHHHHHHHHhcC
Q psy2347 21 VKKEDLELEFEKFG-KLN----SVWVAFN---PPGFAFVEFSNQIDAEAACDSMN 67 (207)
Q Consensus 21 ~te~~L~~~F~~~G-~V~----~v~i~~d---~rG~afV~F~~~e~A~~Al~~l~ 67 (207)
+|-.+++++|..-- .|. .+.|+.. +.-+-||+..+.+++.+||+.|.
T Consensus 3 ~~~~~~~~~~~~~~~~i~~~~~~l~~lDq~~lP~~~~~~~~~~~~~v~~aI~~M~ 57 (363)
T PRK05772 3 LTVKEVKELFKPKLLPIIWKDNTLTLLDQSLLPFETVYVDLKTVEEVALAIRNMQ 57 (363)
T ss_pred chHHHHHHHhCCCCceEEecCCEEEEEecCCCCCeEEEEEeCCHHHHHHHHHhCc
Confidence 46788999997531 111 2333322 66789999999999999998764
No 203
>PRK10905 cell division protein DamX; Validated
Probab=53.41 E-value=24 Score=30.64 Aligned_cols=48 Identities=17% Similarity=0.199 Sum_probs=31.2
Q ss_pred CCcHHHHHHHHHhCCCeeEEEEeeC----CCcEEEE--EEcCHHHHHHHHHhcCC
Q psy2347 20 TVKKEDLELEFEKFGKLNSVWVAFN----PPGFAFV--EFSNQIDAEAACDSMND 68 (207)
Q Consensus 20 ~~te~~L~~~F~~~G~V~~v~i~~d----~rG~afV--~F~~~e~A~~Al~~l~g 68 (207)
-.+++.|++|..+++ +....+... ...|..| .|.+.++|++||+.|..
T Consensus 255 ~Ss~~~l~~fakKlg-L~~y~vy~TtRnGkpWYVV~yG~YaSraeAk~AiakLPa 308 (328)
T PRK10905 255 SSNYDNLNGWAKKEN-LKNYVVYETTRNGQPWYVLVSGVYASKEEAKRAVSTLPA 308 (328)
T ss_pred cCCHHHHHHHHHHcC-CCceEEEEeccCCceEEEEEecCCCCHHHHHHHHHHCCH
Confidence 356788888888874 333333222 2233333 68899999999998864
No 204
>PF09707 Cas_Cas2CT1978: CRISPR-associated protein (Cas_Cas2CT1978); InterPro: IPR010152 Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR) are a family of DNA direct repeats separated by regularly sized non-repetitive spacer sequences that are found in most bacterial and archaeal genomes []. CRISPRs appear to provide acquired resistance against bacteriophages, possibly acting with an RNA interference-like mechanism to inhibit gene functions of invasive DNA elements [, ]. Differences in the number and type of spacers between CRISPR repeats correlate with phage sensitivity. It is thought that following phage infection, bacteria integrate new spacers derived from phage genomic sequences, and that the removal or addition of particular spacers modifies the phage-resistance phenotype of the cell. Therefore, the specificity of CRISPRs may be determined by spacer-phage sequence similarity. In addition, there are many protein families known as CRISPR-associated sequences (Cas), which are encoded in the vicinity of CRISPR loci []. CRISPR/cas gene regions can be quite large, with up to 20 different, tandem-arranged cas genes next to a CRISPR cluster or filling the region between two repeat clusters. Cas genes and CRISPRs are found on mobile genetic elements such as plasmids, and have undergone extensive horizontal transfer. Cas proteins are thought to be involved in the propagation and functioning of CRISPRs. Some Cas proteins show similarity to helicases and repair proteins, although the functions of most are unknown. Cas families can be divided into subtypes according to operon organisation and phylogeny. This entry represents a minor branch of the Cas2 family of CRISPR-associated protein which are found in IPR003799 from INTERPRO. Cas2 is one of four protein families (Cas1 to Cas4) that are associated with CRISPR elements and always occur near a repeat cluster, usually in the order cas3-cas4-cas1-cas2. The function of Cas2 (and Cas1) is unknown. Cas3 proteins appear to be helicases while Cas4 proteins resemble RecB-type exonucleases, suggesting that these genes are involved in DNA metabolism or gene expression [].
Probab=52.33 E-value=33 Score=23.89 Aligned_cols=49 Identities=29% Similarity=0.388 Sum_probs=30.2
Q ss_pred CCCCCcEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeC--CCcEEEEEE
Q psy2347 5 MMERGTRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFN--PPGFAFVEF 53 (207)
Q Consensus 5 ~~~~~~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d--~rG~afV~F 53 (207)
+.+...-|||++++..+-|.-.+.+.+...+-.-+-+..+ ..+|+|-+.
T Consensus 21 l~Ei~~GVyVg~~s~rVRe~lW~~v~~~~~~G~a~m~~~~~neqG~~~~t~ 71 (86)
T PF09707_consen 21 LLEIRPGVYVGNVSARVRERLWERVTEWIGDGSAVMVWSDNNEQGFDFRTL 71 (86)
T ss_pred eEecCCCcEEcCCCHHHHHHHHHHHHhhCCCccEEEEEccCCCCCEEEEEe
Confidence 4566778999999888776666555554332222222222 678888776
No 205
>cd04908 ACT_Bt0572_1 N-terminal ACT domain of a novel protein composed almost entirely of two tandem ACT domains. Included in this CD is the N-terminal ACT domain of a novel protein composed almost entirely of two tandem ACT domains as seen in the uncharacterized structure (pdb 2F06) of the Bt0572 protein from Bacteroides thetaiotaomicron and related ACT domains. These tandem ACT domain proteins belong to the superfamily of ACT regulatory domains.
Probab=51.89 E-value=60 Score=20.59 Aligned_cols=44 Identities=14% Similarity=0.104 Sum_probs=28.8
Q ss_pred cHHHHHHHHHhCC-CeeEEEEeeC-CCcEEEEEEcCHHHHHHHHHh
Q psy2347 22 KKEDLELEFEKFG-KLNSVWVAFN-PPGFAFVEFSNQIDAEAACDS 65 (207)
Q Consensus 22 te~~L~~~F~~~G-~V~~v~i~~d-~rG~afV~F~~~e~A~~Al~~ 65 (207)
.-.+|-++|.+.+ .|+.+.+... .+...-|.+++.+.|.++|++
T Consensus 14 ~La~v~~~l~~~~inI~~i~~~~~~~~~~~rl~~~~~~~~~~~L~~ 59 (66)
T cd04908 14 RLAAVTEILSEAGINIRALSIADTSEFGILRLIVSDPDKAKEALKE 59 (66)
T ss_pred hHHHHHHHHHHCCCCEEEEEEEecCCCCEEEEEECCHHHHHHHHHH
Confidence 3467778887765 4777765443 345555667777788888776
No 206
>cd00027 BRCT Breast Cancer Suppressor Protein (BRCA1), carboxy-terminal domain. The BRCT domain is found within many DNA damage repair and cell cycle checkpoint proteins. The unique diversity of this domain superfamily allows BRCT modules to interact forming homo/hetero BRCT multimers, BRCT-non-BRCT interactions, and interactions within DNA strand breaks.
Probab=51.69 E-value=52 Score=19.99 Aligned_cols=45 Identities=16% Similarity=0.152 Sum_probs=28.7
Q ss_pred cEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeCCCcEEEEEEcCHH
Q psy2347 10 TRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFNPPGFAFVEFSNQI 57 (207)
Q Consensus 10 ~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d~rG~afV~F~~~e 57 (207)
.+++|.+.....+.++|++++..+|-.....+. ...-+|.+.+.+
T Consensus 2 ~~~~i~g~~~~~~~~~l~~~i~~~Gg~v~~~~~---~~~thvI~~~~~ 46 (72)
T cd00027 2 LTFVITGDLPSEERDELKELIEKLGGKVTSSVS---KKTTHVIVGSDA 46 (72)
T ss_pred CEEEEEecCCCcCHHHHHHHHHHcCCEEecccc---CCceEEEECCCC
Confidence 467777776678899999999999873332222 224445554443
No 207
>cd05992 PB1 The PB1 domain is a modular domain mediating specific protein-protein interactions which play a role in many critical cell processes, such as osteoclastogenesis, angiogenesis, early cardiovascular development, and cell polarity. A canonical PB1-PB1 interaction, which involves heterodimerization of two PB1 domain, is required for the formation of macromolecular signaling complexes ensuring specificity and fidelity during cellular signaling. The interaction between two PB1 domain depends on the type of PB1. There are three types of PB1 domains: type I which contains an OPCA motif, acidic aminoacid cluster, type II which contains a basic cluster, and type I/II which contains both an OPCA motif and a basic cluster. Interactions of PB1 domains with other protein domains have been described as a noncanonical PB1-interactions. The PB1 domain module is conserved in amoebas, fungi, animals, and plants.
Probab=51.67 E-value=64 Score=21.21 Aligned_cols=50 Identities=24% Similarity=0.302 Sum_probs=32.8
Q ss_pred CC-cCCcHHHHHHHHHh-CCCe-eEEEEe-eCCCcEEEEEEcCHHHHHHHHHhcC
Q psy2347 17 LT-ETVKKEDLELEFEK-FGKL-NSVWVA-FNPPGFAFVEFSNQIDAEAACDSMN 67 (207)
Q Consensus 17 Lp-~~~te~~L~~~F~~-~G~V-~~v~i~-~d~rG~afV~F~~~e~A~~Al~~l~ 67 (207)
++ ..++-++|++...+ |... ..+.|. +|..+ -+|+..+.++.+.|++.+.
T Consensus 16 ~~~~~~s~~~L~~~i~~~~~~~~~~~~l~y~D~e~-d~v~l~sd~Dl~~a~~~~~ 69 (81)
T cd05992 16 VVSRSISFEDLRSKIAEKFGLDAVSFKLKYPDEDG-DLVTISSDEDLEEAIEEAR 69 (81)
T ss_pred EecCCCCHHHHHHHHHHHhCCCCCcEEEEeeCCCC-CEEEeCCHHHHHHHHHHHh
Confidence 44 77888888776654 4432 233332 22233 7899999999999998754
No 208
>COG0002 ArgC Acetylglutamate semialdehyde dehydrogenase [Amino acid transport and metabolism]
Probab=50.77 E-value=36 Score=30.01 Aligned_cols=39 Identities=23% Similarity=0.319 Sum_probs=25.1
Q ss_pred CCcCCcHHHHHHHHHh-CCCeeEEEEeeC----------CCcEEEEEEcC
Q psy2347 17 LTETVKKEDLELEFEK-FGKLNSVWVAFN----------PPGFAFVEFSN 55 (207)
Q Consensus 17 Lp~~~te~~L~~~F~~-~G~V~~v~i~~d----------~rG~afV~F~~ 55 (207)
|...+|++||+++|++ |..-..|+|+.. -..||.|-|..
T Consensus 253 l~~~~t~~~i~~~y~~~Y~~epfVrv~~~~~~P~~k~V~GsN~cdIgf~~ 302 (349)
T COG0002 253 LKDLVTLEELHAAYEEFYAGEPFVRVVPEGGYPDTKAVAGSNFCDIGFAV 302 (349)
T ss_pred cCCCCCHHHHHHHHHHHhCCCCeEEEecCCCCCChhhhcCCcceEEEEEE
Confidence 4456899999999987 444455565542 23466665554
No 209
>PF02829 3H: 3H domain; InterPro: IPR004173 The 3H domain is named after its three highly conserved histidine residues. The 3H domain appears to be a small molecule-binding domain, based on its occurrence with other domains []. Several proteins carrying this domain are transcriptional regulators from the biotin repressor family. The transcription regulator TM1602 from Thermotoga maritima is a DNA-binding protein thought to belong to a family of de novo NAD synthesis pathway regulators. TM1602 has an N-terminal DNA-binding domain and a C-terminal 3H regulatory domain. The N-terminal domain appears to bind to the NAD promoter region and repress the de novo NAD biosynthesis operon, while the C-terminal 3H domain may bind to nicotinamide, nicotinic acid, or other substrate/products []. The 3H domain has a 2-layer alpha/beta sandwich fold.; GO: 0005488 binding; PDB: 1J5Y_A.
Probab=50.75 E-value=50 Score=23.58 Aligned_cols=51 Identities=16% Similarity=0.188 Sum_probs=35.1
Q ss_pred CcCCcHHHHHHHHHhCCCeeEEEEeeC--CCcEEEEEEcCHHHHHHHHHhcCC
Q psy2347 18 TETVKKEDLELEFEKFGKLNSVWVAFN--PPGFAFVEFSNQIDAEAACDSMND 68 (207)
Q Consensus 18 p~~~te~~L~~~F~~~G~V~~v~i~~d--~rG~afV~F~~~e~A~~Al~~l~g 68 (207)
..+-++++|..+...=+.|++|.+... ..-.+.+...+.+++++.++.|+.
T Consensus 6 ~~~~~~~EL~~IVd~Gg~V~DV~veHp~YG~i~~~L~i~sr~Dv~~Fi~~l~~ 58 (98)
T PF02829_consen 6 TPDEIEDELEIIVDNGGRVLDVIVEHPVYGEITGNLNISSRRDVDKFIEKLEK 58 (98)
T ss_dssp -GGGHHHHHHHHHHTT-EEEEEEEEETTTEEEEEEEEE-SHHHHHHHHHHHHH
T ss_pred CHHHHHHHHHHHHHCCCEEEEEEEeCCCCcEEEEEEecCCHHHHHHHHHHHhc
Confidence 445567888888776667888877665 233566788899999998887654
No 210
>PF10141 ssDNA-exonuc_C: Single-strand DNA-specific exonuclease, C terminal domain; InterPro: IPR018779 This entry represents a domain found at the C terminus of a set of single-stranded DNA-specific exonucleases, including RecJ. Its function has not, as yet, been determined.
Probab=50.09 E-value=38 Score=27.06 Aligned_cols=58 Identities=10% Similarity=0.047 Sum_probs=39.0
Q ss_pred CCCCCcEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeC-CCcEEEEEEcCHHHHHHHHHh
Q psy2347 5 MMERGTRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFN-PPGFAFVEFSNQIDAEAACDS 65 (207)
Q Consensus 5 ~~~~~~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d-~rG~afV~F~~~e~A~~Al~~ 65 (207)
......+|.|-.||. +.++|++++ +...++.|.++.. ....-|..+.+.++-+++.+.
T Consensus 48 ~~~~~~~lVl~D~P~--~~~~l~~~l-~~~~~~~Iyl~f~~~~~~y~~~~P~Re~F~~~Y~~ 106 (195)
T PF10141_consen 48 EDESYDNLVLLDLPP--SLEQLKELL-QQQQPERIYLLFYQQDSAYFEGMPTREQFKKLYKF 106 (195)
T ss_pred ccccCCEEEEEeCCC--CHHHHHHHH-HhCCcceEEEEECCccchhhcCCCCHHHHHHHHHH
Confidence 345667888888887 478899998 4667788777664 223335667777776665543
No 211
>KOG2187|consensus
Probab=49.38 E-value=14 Score=34.26 Aligned_cols=39 Identities=28% Similarity=0.402 Sum_probs=32.8
Q ss_pred CCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEEEeec
Q psy2347 45 PPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVEISRG 83 (207)
Q Consensus 45 ~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~~a~~ 83 (207)
...+++++|++++.+.+|+..+++.......+.|..+..
T Consensus 62 ~~~~~~~~~et~~~~~ka~~~v~g~~~k~~~~~~~~~~~ 100 (534)
T KOG2187|consen 62 MPKYAYVTFETPSDAGKAINLVDGLLYKGFILRVQLGAT 100 (534)
T ss_pred CCCceEEEEeccchhhhHHHHHhhhhhhcchhhhhhccc
Confidence 356999999999999999999999888777777766654
No 212
>PF11823 DUF3343: Protein of unknown function (DUF3343); InterPro: IPR021778 This family of proteins are functionally uncharacterised. This protein is found in bacteria and archaea. Proteins in this family are typically between 78 to 102 amino acids in length.
Probab=49.29 E-value=19 Score=23.84 Aligned_cols=24 Identities=17% Similarity=0.338 Sum_probs=19.6
Q ss_pred EEEEEEcCHHHHHHHHHhcCCcee
Q psy2347 48 FAFVEFSNQIDAEAACDSMNDQDL 71 (207)
Q Consensus 48 ~afV~F~~~e~A~~Al~~l~g~~i 71 (207)
+.+|+|.+..+|.+|-+.|....|
T Consensus 3 ~~~i~F~st~~a~~~ek~lk~~gi 26 (73)
T PF11823_consen 3 YYLITFPSTHDAMKAEKLLKKNGI 26 (73)
T ss_pred eEEEEECCHHHHHHHHHHHHHCCC
Confidence 678999999999988887766544
No 213
>COG5227 SMT3 Ubiquitin-like protein (sentrin) [Posttranslational modification, protein turnover, chaperones]
Probab=49.17 E-value=53 Score=23.22 Aligned_cols=59 Identities=19% Similarity=0.329 Sum_probs=36.3
Q ss_pred CcEEEEcCCCcCCcHHHHHHHHHh-CCC-eeEEEEeeC------CCcEEEEEEcCHHHHHHHHHhcCC
Q psy2347 9 GTRVYVGGLTETVKKEDLELEFEK-FGK-LNSVWVAFN------PPGFAFVEFSNQIDAEAACDSMND 68 (207)
Q Consensus 9 ~~~lfV~nLp~~~te~~L~~~F~~-~G~-V~~v~i~~d------~rG~afV~F~~~e~A~~Al~~l~g 68 (207)
.+.||+ +|...++-..|.+.|++ .|+ ..+++++.| -+-=+=+.|++-++.++..+++-+
T Consensus 34 ~telfF-kiKktT~f~klm~af~~rqGK~m~slRfL~dG~rI~~dqTP~dldmEdnd~iEav~eQvGG 100 (103)
T COG5227 34 GTELFF-KIKKTTTFKKLMDAFSRRQGKNMSSLRFLFDGKRIDLDQTPGDLDMEDNDEIEAVTEQVGG 100 (103)
T ss_pred CCEEEE-EEeccchHHHHHHHHHHHhCcCcceeEEEEcceecCCCCChhhcCCccchHHHHHHHHhcC
Confidence 344444 56667777777777765 443 456666665 122344577887887777776543
No 214
>PF07876 Dabb: Stress responsive A/B Barrel Domain; InterPro: IPR013097 The stress-response A/B barrel domain is found in a class of stress-response proteins in plants. It is also found in some bacterial fructose-bisphosphate aldolase such as at the C terminus of a fructose 1,6-bisphosphate aldolase from Hydrogenophilus thermoluteolus (Q9ZA13 from SWISSPROT) []. Q93NG5 from SWISSPROT is found in the pA01 plasmid, which encodes genes for molybdopterin uptake and degradation of plant alkaloid nicotine. The stress-response A/B barrel domain forms a very stable dimer. This dimer belongs to the superfamily of dimeric alpha+beta barrels in which the two beta-sheets form a beta-barrel. The two molecules in the dimer are related by a 2-fold axis parallel to helix H1 and beta-strands B3 and B4. C-terminal residues extending from the beta4 strand of each monomer wrap around and connect with the beta2 strand and alpha1 helix of the opposing monomer to form the dimer interface [, , ].The outer surface of the beta-sheets of the two molecules forms a beta-barrel-like structure defining a central pore. The function of the stress-response A/B barrel domain is unknown [, , ], but it is upregulated in response to salt stress in Populus balsamifera (balsam poplar) []. Some proteins known to contain a stress response A/B barrel domain are listed below: - Arabidopsis thaliana At3g17210 - Arabidopsis thaliana At5g22580 -Populus tremula stable protein 1 (SP-1)(Populus species), a thermostable stress-responsive protein. - Pseudomonas hydrogenothermophila fructose 1,6-bisphosphate aldolase (cbbA). The structure of one of these proteins has been solved (Q9LUV2 from SWISSPROT) and the domain forms an alpha-beta barrel dimer [].; PDB: 3BB5_E 3FMB_A 3BDE_B 2QYC_A 1Q53_B 2Q3P_A 1Q4R_A 3BN7_A 3BGU_B 1RJJ_B ....
Probab=48.85 E-value=83 Score=21.35 Aligned_cols=52 Identities=21% Similarity=0.205 Sum_probs=32.6
Q ss_pred EEEcCCCcCCcHHHHHHHH-------HhCCCeeEEEEeeC----C--Cc---EEEEEEcCHHHHHHHH
Q psy2347 12 VYVGGLTETVKKEDLELEF-------EKFGKLNSVWVAFN----P--PG---FAFVEFSNQIDAEAAC 63 (207)
Q Consensus 12 lfV~nLp~~~te~~L~~~F-------~~~G~V~~v~i~~d----~--rG---~afV~F~~~e~A~~Al 63 (207)
|.+-.|.++++++++.+++ .+.-.|+.+.+-++ . ++ ..+++|+|+++.++-+
T Consensus 4 ivlfklk~~~~~~~~~~~~~~l~~l~~~ip~i~~~~~G~~~~~~~~~~~~~~~~~~~F~s~~~l~~Y~ 71 (97)
T PF07876_consen 4 IVLFKLKPDATEEEIEEVLEALRALKDKIPGIVSFEVGRNFSPEDLAKGYDHALVSTFESEEDLDAYQ 71 (97)
T ss_dssp EEEEEESTTTCHHHHHHHHHHHHHHHHHSTTECEEEEEEESSTSSTSTT-SEEEEEEESSHHHHHHHH
T ss_pred EEEEEECCCCCHHHHHHHHHHHHhcccCCCceEEEEEEcccCcccccCCCcEEEEEEECCHHHHHHHH
Confidence 3344677888887775544 34455777776554 2 23 3457999988876544
No 215
>PF00564 PB1: PB1 domain; InterPro: IPR000270 The Phox and Bem1p domain, is present in many eukaryotic cytoplasmic signalling proteins. The domain adopts a beta-grasp fold, similar to that found in ubiquitin and Ras-binding domains. A motif, variously termed OPR, PC and AID, represents the most conserved region of the majority of PB1 domains, and is necessary for PB1 domain function. This function is the formation of PB1 domain heterodimers, although not all PB1 domain pairs associate.; GO: 0005515 protein binding; PDB: 1IPG_A 1IP9_A 2KFK_A 1WMH_A 1VD2_A 1WI0_A 1OEY_C 1PQS_A 1Q1O_A 1TZ1_A ....
Probab=48.63 E-value=26 Score=23.38 Aligned_cols=62 Identities=23% Similarity=0.395 Sum_probs=37.7
Q ss_pred CCCcCCcHHHHHHHHHh-CCCe-eEEEEee-CCCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEEE
Q psy2347 16 GLTETVKKEDLELEFEK-FGKL-NSVWVAF-NPPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVEI 80 (207)
Q Consensus 16 nLp~~~te~~L~~~F~~-~G~V-~~v~i~~-d~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~~ 80 (207)
.++..++-++|++..++ |... ..+.|.. |..+ -.|+..+.++.+.|++.++.. ..+.|+|.+
T Consensus 17 ~~~~~~s~~~L~~~i~~~~~~~~~~~~l~Y~D~dg-D~V~i~sd~Dl~~a~~~~~~~--~~~~lrl~v 81 (84)
T PF00564_consen 17 SLPSDVSFDDLRSKIREKFGLLDEDFQLKYKDEDG-DLVTISSDEDLQEAIEQAKES--GSKTLRLFV 81 (84)
T ss_dssp EECSTSHHHHHHHHHHHHHTTSTSSEEEEEEETTS-SEEEESSHHHHHHHHHHHHHC--TTSCEEEEE
T ss_pred EcCCCCCHHHHHHHHHHHhCCCCccEEEEeeCCCC-CEEEeCCHHHHHHHHHHHHhc--CCCcEEEEE
Confidence 45677788777776654 4432 2333321 2233 689999999999999876433 334555543
No 216
>PF00398 RrnaAD: Ribosomal RNA adenine dimethylase; InterPro: IPR001737 This family of proteins include rRNA adenine dimethylases (e.g. KsgA) and the Erythromycin resistance methylases (Erm). The bacterial enzyme KsgA catalyses the transfer of a total of four methyl groups from S-adenosyl-l-methionine (S-AdoMet) to two adjacent adenosine bases in 16S rRNA. This enzyme and the resulting modified adenosine bases appear to be conserved in all species of eubacteria, eukaryotes, and archaea, and in eukaryotic organelles. Bacterial resistance to the aminoglycoside antibiotic kasugamycin involves inactivation of KsgA and resulting loss of the dimethylations, with modest consequences to the overall fitness of the organism. In contrast, the yeast ortholog, Dim1, is essential. In Saccharomyces cerevisiae (Baker's yeast), and presumably in other eukaryotes, the enzyme performs a vital role in pre-rRNA processing in addition to its methylating activity. The best conserved region in these enzymes is located in the N-terminal section and corresponds to a region that is probably involved in S-adenosyl methionine (SAM) binding domain. The crystal structure of KsgA from Escherichia coli has been solved to a resolution of 2.1A. It bears a strong similarity to the crystal structure of ErmC' from Bacillus stearothermophilus and a lesser similarity to the yeast mitochondrial transcription factor, sc-mtTFB []. The Erm family of RNA methyltransferases, which methylate a single adenosine base in 23S rRNA confer resistance to the MLS-B group of antibiotics. Despite their sequence similarity, the two enzyme families have strikingly different levels of regulation that remain to be elucidated. Other orthologs, of this family include the yeast and Homo sapiens (Human) mitochondrial transcription factors (MTF1 and h-mtTFB respectively), which are nuclear encoded []. Human-mtTFB is able to stimulate transcription in vitro independently of its S-adenosylmethionine binding and rRNA methyltransferase activity [].; GO: 0000179 rRNA (adenine-N6,N6-)-dimethyltransferase activity, 0008649 rRNA methyltransferase activity, 0000154 rRNA modification; PDB: 3FTF_A 3R9X_B 3FTE_A 3FTC_A 3FTD_A 3GRY_A 3FYC_A 3GRU_A 3FYD_A 3GRV_A ....
Probab=47.88 E-value=27 Score=29.08 Aligned_cols=24 Identities=21% Similarity=0.200 Sum_probs=20.9
Q ss_pred CCcEEEEcCCCcCCcHHHHHHHHH
Q psy2347 8 RGTRVYVGGLTETVKKEDLELEFE 31 (207)
Q Consensus 8 ~~~~lfV~nLp~~~te~~L~~~F~ 31 (207)
....++|+|||..++++-|.+++.
T Consensus 96 ~~~~~vv~NlPy~is~~il~~ll~ 119 (262)
T PF00398_consen 96 NQPLLVVGNLPYNISSPILRKLLE 119 (262)
T ss_dssp SSEEEEEEEETGTGHHHHHHHHHH
T ss_pred CCceEEEEEecccchHHHHHHHhh
Confidence 356788999999999999999987
No 217
>PF09869 DUF2096: Uncharacterized protein conserved in archaea (DUF2096); InterPro: IPR017098 There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=47.74 E-value=1.1e+02 Score=24.05 Aligned_cols=51 Identities=16% Similarity=0.097 Sum_probs=39.8
Q ss_pred EEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeCCCcEEEEEEcCHHHHHHHHHhcC
Q psy2347 11 RVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFNPPGFAFVEFSNQIDAEAACDSMN 67 (207)
Q Consensus 11 ~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d~rG~afV~F~~~e~A~~Al~~l~ 67 (207)
++-| +|+..+.++-|.++-+-++.|.+. .+ -.-.+.|-+.+.+++||+.+.
T Consensus 114 ~iRv-~l~~~i~~erl~ei~E~~gvI~Ef---ee--~~~V~I~Gdke~Ik~aLKe~s 164 (169)
T PF09869_consen 114 TIRV-KLKKPIQEERLQEISEWHGVIFEF---EE--DDKVVIEGDKERIKKALKEFS 164 (169)
T ss_pred eEEE-ecCccchHHHHHHHHHHhceeEEe---cC--CcEEEEeccHHHHHHHHHHHH
Confidence 3444 689999999999999999988776 11 134678889999999998753
No 218
>PF08156 NOP5NT: NOP5NT (NUC127) domain; InterPro: IPR012974 This N-terminal domain is found in RNA-binding proteins of the NOP5 family [].
Probab=47.58 E-value=6.1 Score=26.16 Aligned_cols=39 Identities=18% Similarity=0.272 Sum_probs=25.9
Q ss_pred HHHHHHHHhCCCeeEEEEeeCCCcEEEEEEcCHHHHHHHHHhcC
Q psy2347 24 EDLELEFEKFGKLNSVWVAFNPPGFAFVEFSNQIDAEAACDSMN 67 (207)
Q Consensus 24 ~~L~~~F~~~G~V~~v~i~~d~rG~afV~F~~~e~A~~Al~~l~ 67 (207)
++|++.|..+....++. +-.+|..|++.++|..++..+.
T Consensus 27 ~~v~~~~~~~~~f~k~v-----kL~aF~pF~s~~~ALe~~~ais 65 (67)
T PF08156_consen 27 EEVQKSFSDPEKFSKIV-----KLKAFSPFKSAEEALENANAIS 65 (67)
T ss_pred HHHHHHHcCHHHHhhhh-----hhhhccCCCCHHHHHHHHHHhh
Confidence 57777776544333321 2368999999999888877653
No 219
>PF08734 GYD: GYD domain; InterPro: IPR014845 These proteins of unknown function are usually less than 100 amino acids in length. They may belong to the dimeric alpha/beta barrel superfamily.
Probab=47.27 E-value=95 Score=21.57 Aligned_cols=44 Identities=25% Similarity=0.254 Sum_probs=30.5
Q ss_pred HHHHHHHHHhCC-CeeEEEEeeC-CCcEEEEEEcCHHHHHHHHHhc
Q psy2347 23 KEDLELEFEKFG-KLNSVWVAFN-PPGFAFVEFSNQIDAEAACDSM 66 (207)
Q Consensus 23 e~~L~~~F~~~G-~V~~v~i~~d-~rG~afV~F~~~e~A~~Al~~l 66 (207)
++.+++++++.| +|+++.+..- ---...+++.|.++|.++.-.+
T Consensus 22 ~~a~~~~~e~~Gg~l~~~y~t~G~yD~v~i~eaPD~~~a~~~~l~i 67 (91)
T PF08734_consen 22 AEAVRALIEALGGKLKSFYWTLGEYDFVVIVEAPDDETAAAASLAI 67 (91)
T ss_pred HHHHHHHHHHcCCEEEEEEEecCCCCEEEEEEcCCHHHHHHHHHHH
Confidence 456778888876 4666766654 3346788999999888776543
No 220
>KOG3432|consensus
Probab=46.67 E-value=34 Score=25.01 Aligned_cols=25 Identities=16% Similarity=0.315 Sum_probs=19.4
Q ss_pred CcCCcHHHHHHHHHhCCCeeEEEEe
Q psy2347 18 TETVKKEDLELEFEKFGKLNSVWVA 42 (207)
Q Consensus 18 p~~~te~~L~~~F~~~G~V~~v~i~ 42 (207)
...+|+++|++.|.+|-.-.+|.|+
T Consensus 42 ~~~Tt~~eiedaF~~f~~RdDIaIi 66 (121)
T KOG3432|consen 42 DSKTTVEEIEDAFKSFTARDDIAII 66 (121)
T ss_pred eccCCHHHHHHHHHhhccccCeEEE
Confidence 4589999999999999765555443
No 221
>PF09250 Prim-Pol: Bifunctional DNA primase/polymerase, N-terminal; InterPro: IPR015330 Members of this family adopt a structure consisting of a core of antiparallel beta sheets. They are found in various bacterial hypothetical proteins, and have been shown to harbour both primase and polymerase activities []. ; PDB: 1RO0_A 1RNI_A 1RO2_A 3M1M_A.
Probab=45.68 E-value=45 Score=25.15 Aligned_cols=36 Identities=14% Similarity=0.378 Sum_probs=23.8
Q ss_pred CcHHHHHHHHHhCCCeeEEEEeeCCCcEEEEEEcCHH
Q psy2347 21 VKKEDLELEFEKFGKLNSVWVAFNPPGFAFVEFSNQI 57 (207)
Q Consensus 21 ~te~~L~~~F~~~G~V~~v~i~~d~rG~afV~F~~~e 57 (207)
+++++|+++|.++ .-..|-|++...++..|++++++
T Consensus 32 ~~~~~i~~~~~~~-~~~~igl~~~~~gl~viDiD~~~ 67 (163)
T PF09250_consen 32 TDPEQIERWWRRY-PGANIGLVLGPSGLVVIDIDNKD 67 (163)
T ss_dssp THHHHHHHHHH---TT-EEEEESSGGGEEEEEES-HH
T ss_pred CCHHHHHHHHhhC-CCceEEEEecCCceEEEECCCcc
Confidence 5788999999877 33444455544789999999988
No 222
>PF12829 Mhr1: Transcriptional regulation of mitochondrial recombination; InterPro: IPR024629 These proteins are involved in regulation of RNA polymerase II-dependent transcription. They are also involved in regulation of mitochondrial DNA recombination, maintenance, repair, and generation of homoplasmic cells [, , , ].
Probab=45.43 E-value=33 Score=24.23 Aligned_cols=51 Identities=16% Similarity=0.109 Sum_probs=33.7
Q ss_pred CCcCCcHHHHHHHHHhCCC-eeEEEEeeC-CCcEEEEEEcCHHHHHHHHHhcC
Q psy2347 17 LTETVKKEDLELEFEKFGK-LNSVWVAFN-PPGFAFVEFSNQIDAEAACDSMN 67 (207)
Q Consensus 17 Lp~~~te~~L~~~F~~~G~-V~~v~i~~d-~rG~afV~F~~~e~A~~Al~~l~ 67 (207)
+.+.+++..|..-|..-+. -+...+.+| =+.+|.|+|.+.+.+.+|...|-
T Consensus 20 ~~p~l~~~~i~~Q~~~~gkk~~pp~lRkD~W~pm~vv~f~~~~~g~~~yq~Lr 72 (91)
T PF12829_consen 20 QTPNLDNNQILKQFPFPGKKNKPPSLRKDYWRPMCVVNFPNYEVGVSAYQKLR 72 (91)
T ss_pred cCcccChhHHHHhccCCCcccCCchhccccceEeEEEECCChHHHHHHHHHHH
Confidence 3456677777666654443 222344444 35699999999999999987653
No 223
>cd01201 Neurobeachin Neurobeachin Pleckstrin homology-like domain. Neurobeachin Pleckstrin homology-like domain. This domain is found in the large multi-domain eukaryotic protein Nerubeachin, N-terminal to the BEACH domain. This PH-like domain interacts with the BEACH domain in the same manner used by other PH-like domains to bind peptides.
Probab=44.58 E-value=1.2e+02 Score=22.07 Aligned_cols=52 Identities=12% Similarity=0.099 Sum_probs=33.9
Q ss_pred CCCcCCcHHHHHHHHHhCCCe--eEEEEeeCCCcEEEEEEcCHHHHHHHHHhcC
Q psy2347 16 GLTETVKKEDLELEFEKFGKL--NSVWVAFNPPGFAFVEFSNQIDAEAACDSMN 67 (207)
Q Consensus 16 nLp~~~te~~L~~~F~~~G~V--~~v~i~~d~rG~afV~F~~~e~A~~Al~~l~ 67 (207)
.+...+.-++|++++..-=-+ .-++|....+.-.|+.|.+.+++++.+.+|.
T Consensus 53 ~~~~~w~ls~Ir~v~~RRylLr~~alEiF~~d~~~~f~~F~~~~~~k~vv~~lp 106 (108)
T cd01201 53 ELHGKWPFSEIRAIFSRRYLLQNTALELFLASRTSIFFAFPDQNAVKKVVYALP 106 (108)
T ss_pred cccceeeHHHHHHHHHHhhhcccceEEEEEeCCceEEEEeCcHHHHHHHHhhcC
Confidence 344456667888888652122 1234433335678999999999999988764
No 224
>PHA01632 hypothetical protein
Probab=44.57 E-value=25 Score=22.47 Aligned_cols=19 Identities=16% Similarity=0.281 Sum_probs=15.1
Q ss_pred EcCCCcCCcHHHHHHHHHh
Q psy2347 14 VGGLTETVKKEDLELEFEK 32 (207)
Q Consensus 14 V~nLp~~~te~~L~~~F~~ 32 (207)
|-.+|..-||++|++.+.+
T Consensus 21 ieqvp~kpteeelrkvlpk 39 (64)
T PHA01632 21 IEQVPQKPTEEELRKVLPK 39 (64)
T ss_pred hhhcCCCCCHHHHHHHHHH
Confidence 3478999999999987654
No 225
>cd04904 ACT_AAAH ACT domain of the nonheme iron-dependent, aromatic amino acid hydroxylases (AAAH). ACT domain of the nonheme iron-dependent, aromatic amino acid hydroxylases (AAAH): Phenylalanine hydroxylases (PAH), tyrosine hydroxylases (TH) and tryptophan hydroxylases (TPH), both peripheral (TPH1) and neuronal (TPH2) enzymes. This family of enzymes shares a common catalytic mechanism, in which dioxygen is used by an active site containing a single, reduced iron atom to hydroxylate an unactivated aromatic substrate, concomitant with a two-electron oxidation of tetrahydropterin (BH4) cofactor to its quinonoid dihydropterin form. PAH catalyzes the hydroxylation of L-Phe to L-Tyr, the first step in the catabolic degradation of L-Phe; TH catalyses the hydroxylation of L-Tyr to 3,4-dihydroxyphenylalanine, the rate limiting step in the biosynthesis of catecholamines; and TPH catalyses the hydroxylation of L-Trp to 5-hydroxytryptophan, the rate limiting step in the biosynthesis of 5-hydroxy
Probab=44.36 E-value=90 Score=20.49 Aligned_cols=47 Identities=19% Similarity=0.152 Sum_probs=27.5
Q ss_pred cHHHHHHHHHhCCC-eeEEEEeeC----CCcEEEEEEc-CHHHHHHHHHhcCC
Q psy2347 22 KKEDLELEFEKFGK-LNSVWVAFN----PPGFAFVEFS-NQIDAEAACDSMND 68 (207)
Q Consensus 22 te~~L~~~F~~~G~-V~~v~i~~d----~rG~afV~F~-~~e~A~~Al~~l~g 68 (207)
+-.++.+.|+.++- +++|+-... ..=.-||+++ +.+++++|++.|+.
T Consensus 13 ~L~~vL~~f~~~~iNlt~IeSRP~~~~~~~y~Ffvd~~~~~~~~~~~l~~L~~ 65 (74)
T cd04904 13 ALARALKLFEEFGVNLTHIESRPSRRNGSEYEFFVDCEVDRGDLDQLISSLRR 65 (74)
T ss_pred HHHHHHHHHHHCCCcEEEEECCCCCCCCceEEEEEEEEcChHHHHHHHHHHHH
Confidence 34667777887752 444433222 1224567877 55567788887654
No 226
>cd06410 PB1_UP2 Uncharacterized protein 2. The PB1 domain is a modular domain mediating specific protein-protein interaction which play a role in many critical cell processes such as osteoclastogenesis, angiogenesis, early cardiovascular development, and cell polarity. A canonical PB1-PB1 interaction, which involves heterodimerization of two PB1 domains, is required for the formation of macromolecular signaling complexes ensuring specificity and fidelity during cellular signaling. The interaction between two PB1 domain depends on the type of PB1. There are three types of PB1 domains: type I which contains an OPCA motif, acidic aminoacid cluster, type II which contains a basic cluster, and type I/II which contains both an OPCA motif and a basic cluster. Interactions of PB1 domains with other protein domains have been described as noncanonical PB1-interactions.
Probab=44.30 E-value=1.1e+02 Score=21.63 Aligned_cols=61 Identities=13% Similarity=0.120 Sum_probs=37.3
Q ss_pred CCcCCcHHHHHHHHHhCCCee---EEEE--eeCCCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEE
Q psy2347 17 LTETVKKEDLELEFEKFGKLN---SVWV--AFNPPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRV 78 (207)
Q Consensus 17 Lp~~~te~~L~~~F~~~G~V~---~v~i--~~d~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V 78 (207)
|+.+++-.+|+..+++.-.+. .++. +.+.. -+.|...+.|+.+..++..+...-.-.+|+|
T Consensus 29 V~r~~s~~el~~kl~~~~~~~~~~~lky~Lp~edl-d~Lisv~~DeDl~~M~~e~~~~~~~~~rirv 94 (97)
T cd06410 29 VDRSISFKELVSKLSELFGAGVVVTLKYQLPDEDL-DALISVSNDEDLKNMMEEYDRLSGGSARLRV 94 (97)
T ss_pred EcCCCCHHHHHHHHHHHhCCCCceEEEEEcCCCCc-ceeEEecCcHHHHHHHHhhccccCCCceEEE
Confidence 577888888887665532222 2222 22222 3899999999999999886544223334444
No 227
>PF11411 DNA_ligase_IV: DNA ligase IV; InterPro: IPR021536 DNA ligase IV along with Xrcc4 functions in DNA non-homologous end joining. This process is required to mend double-strand breaks. Upon ligase binding to an Xrcc4 dimer, the helical tails unwind leading to a flat interaction surface []. ; GO: 0003910 DNA ligase (ATP) activity; PDB: 3II6_Y 2E2W_A 1IK9_C.
Probab=43.33 E-value=19 Score=20.87 Aligned_cols=16 Identities=13% Similarity=0.302 Sum_probs=10.3
Q ss_pred cCCcHHHHHHHHHhCC
Q psy2347 19 ETVKKEDLELEFEKFG 34 (207)
Q Consensus 19 ~~~te~~L~~~F~~~G 34 (207)
.++++++|+++|.+..
T Consensus 19 ~Dtd~~~Lk~vF~~i~ 34 (36)
T PF11411_consen 19 VDTDEDQLKEVFNRIK 34 (36)
T ss_dssp S---HHHHHHHHHCS-
T ss_pred ccCCHHHHHHHHHHhc
Confidence 4689999999998753
No 228
>PTZ00191 60S ribosomal protein L23a; Provisional
Probab=42.89 E-value=54 Score=25.18 Aligned_cols=52 Identities=19% Similarity=0.231 Sum_probs=31.9
Q ss_pred EEcCCCcCCcHHHHHHHHHh-CC-CeeEEEEeeC--CCcEEEEEEcCHHHHHHHHH
Q psy2347 13 YVGGLTETVKKEDLELEFEK-FG-KLNSVWVAFN--PPGFAFVEFSNQIDAEAACD 64 (207)
Q Consensus 13 fV~nLp~~~te~~L~~~F~~-~G-~V~~v~i~~d--~rG~afV~F~~~e~A~~Al~ 64 (207)
|+--++..++..+|++.+++ |. .|..|..+.. ..--|||++....+|.....
T Consensus 85 yvF~Vd~kAnK~qIK~AVEklf~VkV~kVNTli~p~g~KKA~V~L~~~~~aidva~ 140 (145)
T PTZ00191 85 LVFIVDQRANKTQIKKAVEKLYDVKVVKVNTLITPDGLKKAYIRLSPDVDALDVAN 140 (145)
T ss_pred EEEEEcCCCCHHHHHHHHHHHhCCeeEEEEeEEcCCCceEEEEEECCCCcHHHHHH
Confidence 33346778899999988887 43 2444443333 22369999976665544433
No 229
>cd04889 ACT_PDH-BS-like C-terminal ACT domain of the monofunctional, NAD dependent, prephenate dehydrogenase (PDH) enzyme that catalyzes the formation of 4-hydroxyphenylpyruvate from prephenate. Included in this CD is the C-terminal ACT domain of the monofunctional, NAD dependent, prephenate dehydrogenase (PDH) enzyme that catalyzes the formation of 4-hydroxyphenylpyruvate from prephenate, found in Bacillus subtilis (BS) and other Firmicutes, Deinococci, and Bacteroidetes. PDH is the first enzyme in the aromatic amino acid pathway specific for the biosynthesis of tyrosine. This enzyme is feedback inhibited by tyrosine in B. subtilis and other microorganisms. Both phenylalanine and tryptophan have been shown to be inhibitors of this activity in B. subtilis. Bifunctional chorismate mutase-PDH (TyrA) enzymes such as those seen in Escherichia coli do not contain an ACT domain. Also included in this CD is the N-terminal ACT domain of a novel protein composed almost entirely of two tandem A
Probab=42.65 E-value=76 Score=19.17 Aligned_cols=40 Identities=15% Similarity=0.187 Sum_probs=26.2
Q ss_pred HHHHHHHHhCC-CeeEEEEeeC--CCcEEEEEEcCHHHHHHHH
Q psy2347 24 EDLELEFEKFG-KLNSVWVAFN--PPGFAFVEFSNQIDAEAAC 63 (207)
Q Consensus 24 ~~L~~~F~~~G-~V~~v~i~~d--~rG~afV~F~~~e~A~~Al 63 (207)
.+|.++|.+.+ .|+.+.+... .+....+++++.+.|.+++
T Consensus 13 ~~i~~~l~~~~inI~~~~~~~~~~~~~~~~~~v~~~~~a~~~l 55 (56)
T cd04889 13 AEVTEILAEAGINIKAISIAETRGEFGILRLIFSDPERAKEVL 55 (56)
T ss_pred HHHHHHHHHcCCCEeeEEEEEccCCcEEEEEEECCHHHHHHHh
Confidence 45556676654 4766665543 3566677888888777765
No 230
>PF14581 SseB_C: SseB protein C-terminal domain
Probab=42.43 E-value=47 Score=23.56 Aligned_cols=49 Identities=20% Similarity=0.388 Sum_probs=29.2
Q ss_pred CCCcEEEEcCCCcCCc--HHHHHHHHHhCCCeeEEEEee----C--CCcEEEEEEcC
Q psy2347 7 ERGTRVYVGGLTETVK--KEDLELEFEKFGKLNSVWVAF----N--PPGFAFVEFSN 55 (207)
Q Consensus 7 ~~~~~lfV~nLp~~~t--e~~L~~~F~~~G~V~~v~i~~----d--~rG~afV~F~~ 55 (207)
+...+|.|+-.....+ .+.|.++|++...|..+.+.. + +.-+-.|+++.
T Consensus 3 ~~g~~v~l~~P~~~p~~l~~aL~~~~~~~~~V~~Ayl~~~~~~~~~~~~li~vd~~~ 59 (108)
T PF14581_consen 3 PKGEKVLLGEPEEEPTDLLAALSEYFKQHKNVRAAYLALMQDEDEQPSLLIGVDFDG 59 (108)
T ss_pred CCCCEEEecCCccCHHHHHHHHHHHHhhCccHHHhHHHHhhccCCCceEEEEEeccC
Confidence 4556777765433333 367888999998887654322 2 23344466666
No 231
>COG5507 Uncharacterized conserved protein [Function unknown]
Probab=42.31 E-value=24 Score=25.36 Aligned_cols=20 Identities=25% Similarity=0.458 Sum_probs=16.7
Q ss_pred cEEEEEEcCHHHHHHHHHhc
Q psy2347 47 GFAFVEFSNQIDAEAACDSM 66 (207)
Q Consensus 47 G~afV~F~~~e~A~~Al~~l 66 (207)
-|..++|.+.++.++|..++
T Consensus 67 vFsW~~Y~skq~rDA~~~km 86 (117)
T COG5507 67 VFSWIEYPSKQVRDAANAKM 86 (117)
T ss_pred EEEEEEcCchhHHHHHHHHh
Confidence 58899999999988887653
No 232
>KOG0633|consensus
Probab=42.30 E-value=34 Score=29.32 Aligned_cols=56 Identities=13% Similarity=0.164 Sum_probs=37.2
Q ss_pred CCcEEEEcCC--C--cCCcHHHHHHHHHhCCCeeEEEEeeCCCcEEEEEEcCHHHHHHHHHhcCCc
Q psy2347 8 RGTRVYVGGL--T--ETVKKEDLELEFEKFGKLNSVWVAFNPPGFAFVEFSNQIDAEAACDSMNDQ 69 (207)
Q Consensus 8 ~~~~lfV~nL--p--~~~te~~L~~~F~~~G~V~~v~i~~d~rG~afV~F~~~e~A~~Al~~l~g~ 69 (207)
.-+.+|+..- | ..+.+++|+++++..- +..|+.| .|||+|.-.+++.+.+++..+.
T Consensus 160 ~iK~~F~tSPgNPtg~~ik~~di~KiLe~p~---nglVVvD---EAYidFsg~~S~~~lV~kYpNL 219 (375)
T KOG0633|consen 160 KIKCIFLTSPGNPTGSIIKEDDILKILEMPD---NGLVVVD---EAYIDFSGVESRMKLVKKYPNL 219 (375)
T ss_pred cceEEEEcCCCCCCcccccHHHHHHHHhCCC---CcEEEEe---eeeEeeccccccchHhHhCCce
Confidence 3456666532 2 2367888888888763 2334444 7999999888888887776553
No 233
>PF05036 SPOR: Sporulation related domain; InterPro: IPR007730 This 70 residue domain is composed of two 35 residue repeats that are found in bacterial proteins involved in sporulation and cell division, such as FtsN, CwlM and RlpA. This repeat might be involved in binding peptidoglycan. FtsN is an essential cell division protein with a simple bitopic topology: a short N-terminal cytoplasmic segment fused to a large carboxy periplasmic domain through a single transmembrane domain. The repeats lie at the periplasmic C terminus, which has an RNP-like fold []. FtsN localises to the septum ring complex. The CwlM protein is a cell wall hydrolase, where the C-terminal region, including the repeats, determines substrate specificity []. RlpA is a rare lipoprotein A protein that may be important for cell division. Its N-terminal cysteine may be attached to thioglyceride and N-fatty acyl residues [].; PDB: 1X60_A 1UTA_A.
Probab=42.29 E-value=9.8 Score=24.61 Aligned_cols=16 Identities=38% Similarity=0.497 Sum_probs=14.3
Q ss_pred EEcCHHHHHHHHHhcC
Q psy2347 52 EFSNQIDAEAACDSMN 67 (207)
Q Consensus 52 ~F~~~e~A~~Al~~l~ 67 (207)
.|.+.++|+++++.|.
T Consensus 50 ~f~~~~~A~~~~~~l~ 65 (76)
T PF05036_consen 50 PFSSREEAEAALRKLK 65 (76)
T ss_dssp CECTCCHHHHHHHHHH
T ss_pred CCCCHHHHHHHHHHHh
Confidence 7899999999999886
No 234
>cd00412 pyrophosphatase Inorganic pyrophosphatase. These enzymes hydrolyze inorganic pyrophosphate (PPi) to two molecules of orthophosphates (Pi). The reaction requires bivalent cations. The enzymes in general exist as homooligomers.
Probab=42.19 E-value=25 Score=27.31 Aligned_cols=39 Identities=10% Similarity=0.186 Sum_probs=27.3
Q ss_pred cHHHHHHHHHhCCCeeEEEEeeCCCcEEEEEEcCHHHHHHHHHhc
Q psy2347 22 KKEDLELEFEKFGKLNSVWVAFNPPGFAFVEFSNQIDAEAACDSM 66 (207)
Q Consensus 22 te~~L~~~F~~~G~V~~v~i~~d~rG~afV~F~~~e~A~~Al~~l 66 (207)
.-++|+++|+.|-.++. .+-.-+..|.+.++|+++|++.
T Consensus 115 ~l~~I~~fF~~YK~le~------~k~~~~~g~~~~~~A~~~I~~~ 153 (155)
T cd00412 115 LLDEIKHFFEHYKDLEG------KKEVKVAGWKDKEEALKIIKES 153 (155)
T ss_pred HHHHHHHHHHHhcccCC------CCceEECcCcCHHHHHHHHHHH
Confidence 45688999999854432 1334456888999999988753
No 235
>COG0150 PurM Phosphoribosylaminoimidazole (AIR) synthetase [Nucleotide transport and metabolism]
Probab=42.09 E-value=23 Score=31.02 Aligned_cols=47 Identities=19% Similarity=0.128 Sum_probs=36.8
Q ss_pred HHHHHHHHHhCCCeeEEEEeeC-CCcEEEEEEcCHHHHHHHHHhcCCc
Q psy2347 23 KEDLELEFEKFGKLNSVWVAFN-PPGFAFVEFSNQIDAEAACDSMNDQ 69 (207)
Q Consensus 23 e~~L~~~F~~~G~V~~v~i~~d-~rG~afV~F~~~e~A~~Al~~l~g~ 69 (207)
...|.+++.+.+.|..-++.+. +-+..||..-.+++++++++.|+.+
T Consensus 275 ~p~iF~~i~~~G~v~~~EM~rtFNmGvG~v~iv~~e~~~~~~~~l~~~ 322 (345)
T COG0150 275 PPPIFKWLQKAGNVEREEMYRTFNMGVGMVLIVPEEDAEKALALLKEQ 322 (345)
T ss_pred CcHHHHHHHHhcCCCHHHHHHHhcCccceEEEEcHHHHHHHHHHHHhc
Confidence 3567778888888776555443 6788999999999999999988764
No 236
>PF13193 AMP-binding_C: AMP-binding enzyme C-terminal domain; PDB: 3L8C_B 2VSQ_A 3R44_A 3RG2_B 3A9U_A 3A9V_A 3NI2_A 1V26_B 1ULT_B 1V25_B ....
Probab=41.71 E-value=69 Score=20.68 Aligned_cols=47 Identities=11% Similarity=0.036 Sum_probs=29.3
Q ss_pred HHHHHHHhCCCeeEEEEeeC------CCcEEEEEEcCHHHHHHHHHh-cCCceeC
Q psy2347 25 DLELEFEKFGKLNSVWVAFN------PPGFAFVEFSNQIDAEAACDS-MNDQDLM 72 (207)
Q Consensus 25 ~L~~~F~~~G~V~~v~i~~d------~rG~afV~F~~~e~A~~Al~~-l~g~~i~ 72 (207)
||++.+.++-.|.++.++.. ..-+|||+. +.++..+.++. |....+-
T Consensus 1 EIE~~l~~~~~V~~~~V~~~~d~~~g~~l~a~vv~-~~~~i~~~~~~~l~~~~~P 54 (73)
T PF13193_consen 1 EIESVLRQHPGVAEAAVVGVPDEDWGERLVAFVVL-DEEEIRDHLRDKLPPYMVP 54 (73)
T ss_dssp HHHHHHHTSTTEEEEEEEEEEETTTEEEEEEEEEE-HHHHHHHHHHHHS-GGGS-
T ss_pred CHHHHHhcCCCccEEEEEEEEcccccccceeEEEe-eecccccchhhhCCCccee
Confidence 57788888888988766543 346888888 44555555443 5544444
No 237
>PF09162 Tap-RNA_bind: Tap, RNA-binding; InterPro: IPR015245 This domain adopts a structure consisting of an alpha+beta sandwich with an antiparallel beta-sheet, arranged in a 2(beta-alpha-beta) motif. It is mainly found in mRNA export factors, which mediate the sequence nonspecific nuclear export of cellular mRNAs as well as the sequence-specific export of retroviral mRNAs bearing the constitutive transport element []. ; GO: 0003723 RNA binding, 0006406 mRNA export from nucleus, 0005634 nucleus, 0005737 cytoplasm; PDB: 1FT8_A 1KOH_C 1KOO_C 3RW6_B 3RW7_C 1FO1_A.
Probab=41.65 E-value=76 Score=22.21 Aligned_cols=60 Identities=12% Similarity=0.080 Sum_probs=30.1
Q ss_pred CcHHHHHHHHHhCCCee--EEEEeeCCCcEEEEEEcCHHHHHHHHHhcCCcee--CCcEEEEEEee
Q psy2347 21 VKKEDLELEFEKFGKLN--SVWVAFNPPGFAFVEFSNQIDAEAACDSMNDQDL--MGSKLRVEISR 82 (207)
Q Consensus 21 ~te~~L~~~F~~~G~V~--~v~i~~d~rG~afV~F~~~e~A~~Al~~l~g~~i--~g~~l~V~~a~ 82 (207)
..++.|...+...-+|. -+....+ +. .++-|-+-.++..||+.++.+.. ++.+|.|....
T Consensus 18 YdK~wLl~~iq~~c~v~F~Pv~fh~~-~~-~a~FfV~D~~tA~aLk~vsrkI~~~dg~Ki~I~V~p 81 (88)
T PF09162_consen 18 YDKDWLLNSIQSHCSVPFTPVDFHYE-KN-RAQFFVEDASTASALKDVSRKICDEDGFKISIFVNP 81 (88)
T ss_dssp S-HHHHHHHHHHHSSS----EEEEEE-TT-EEEEEESSHHHHHHHHTTTTTEEBTTSBEE--EEEE
T ss_pred cCHHHHHHHHHHHCCCCeeeeeeeee-CC-EEEEEeCCHHHHHHHHHCCCceECCCCCEEEEEEcC
Confidence 45666666666544332 2333332 22 33445455666677777776654 56777766543
No 238
>COG0858 RbfA Ribosome-binding factor A [Translation, ribosomal structure and biogenesis]
Probab=41.10 E-value=37 Score=25.03 Aligned_cols=33 Identities=12% Similarity=0.152 Sum_probs=19.9
Q ss_pred eeEEEEeeC-CCcEEEEEEc--CHHHHHHHHHhcCC
Q psy2347 36 LNSVWVAFN-PPGFAFVEFS--NQIDAEAACDSMND 68 (207)
Q Consensus 36 V~~v~i~~d-~rG~afV~F~--~~e~A~~Al~~l~g 68 (207)
|++|++..| ..-.+||++- +.++.+++++.|+.
T Consensus 36 Vt~V~vS~Dl~~A~Vyvt~l~~~~~~~~~~~~~L~~ 71 (118)
T COG0858 36 VTDVEVSKDLSHAKVYVTVLGDEESSKAEILAALNK 71 (118)
T ss_pred EEEEEEcCCCceEEEEEEecCCchhhHHHHHHHHHH
Confidence 778888888 4556777762 33345555555543
No 239
>COG0030 KsgA Dimethyladenosine transferase (rRNA methylation) [Translation, ribosomal structure and biogenesis]
Probab=41.08 E-value=42 Score=28.35 Aligned_cols=45 Identities=13% Similarity=-0.028 Sum_probs=31.3
Q ss_pred cEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeCCCcEEEEEEcCHHHHHHHHHh
Q psy2347 10 TRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFNPPGFAFVEFSNQIDAEAACDS 65 (207)
Q Consensus 10 ~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d~rG~afV~F~~~e~A~~Al~~ 65 (207)
....|+|||..+|..-|.++++..-.+..+.+| -..|-|++.+..
T Consensus 96 ~~~vVaNlPY~Isspii~kll~~~~~~~~~v~M-----------~QkEva~Rl~A~ 140 (259)
T COG0030 96 PYKVVANLPYNISSPILFKLLEEKFIIQDMVLM-----------VQKEVAERLVAK 140 (259)
T ss_pred CCEEEEcCCCcccHHHHHHHHhccCccceEEEE-----------eHHHHHHHHhCC
Confidence 456789999999999999998876555444443 344566665544
No 240
>PF06897 DUF1269: Protein of unknown function (DUF1269); InterPro: IPR009200 There are currently no experimental data for members of this group or their homologues. However, these proteins are predicted to contain two or more transmembrane segments.
Probab=40.78 E-value=29 Score=24.90 Aligned_cols=14 Identities=21% Similarity=0.646 Sum_probs=6.2
Q ss_pred CcHHHHHHHHHhCC
Q psy2347 21 VKKEDLELEFEKFG 34 (207)
Q Consensus 21 ~te~~L~~~F~~~G 34 (207)
.+++.|.+.|++|+
T Consensus 66 ~~~d~v~~~l~~~g 79 (102)
T PF06897_consen 66 ATEDKVDAALRKFG 79 (102)
T ss_pred CCHHHHHHHHHhcC
Confidence 34444444444444
No 241
>PRK12450 foldase protein PrsA; Reviewed
Probab=40.29 E-value=59 Score=27.96 Aligned_cols=39 Identities=21% Similarity=0.435 Sum_probs=29.8
Q ss_pred CCcHHHHHHHHHhCCCeeEEEEeeCCCcEEEEEEcCHHHHHHHHHhcC
Q psy2347 20 TVKKEDLELEFEKFGKLNSVWVAFNPPGFAFVEFSNQIDAEAACDSMN 67 (207)
Q Consensus 20 ~~te~~L~~~F~~~G~V~~v~i~~d~rG~afV~F~~~e~A~~Al~~l~ 67 (207)
.+|+++|++++..+-+ .++ ...|.+++++.|+++++.|.
T Consensus 132 ~Vtd~evk~~y~~~~~--~~~-------~~~I~~~~~~~A~~i~~~l~ 170 (309)
T PRK12450 132 TISKKDYRQAYDAYTP--TMT-------AEIMQFEKEEDAKAALEAVK 170 (309)
T ss_pred CCCHHHHHHHHHHhCc--cce-------eEEEEeCCHHHHHHHHHHHH
Confidence 4799999999998743 222 13477889999999999885
No 242
>KOG0524|consensus
Probab=39.53 E-value=54 Score=28.14 Aligned_cols=64 Identities=14% Similarity=0.221 Sum_probs=35.9
Q ss_pred CCCCCCcEEEEcC-----CCcCCcHHHHHHHHHhCCCeeEEEEeeCCCcEEEEEEcC-HHHHHHHHHhcCCc
Q psy2347 4 GMMERGTRVYVGG-----LTETVKKEDLELEFEKFGKLNSVWVAFNPPGFAFVEFSN-QIDAEAACDSMNDQ 69 (207)
Q Consensus 4 ~~~~~~~~lfV~n-----Lp~~~te~~L~~~F~~~G~V~~v~i~~d~rG~afV~F~~-~e~A~~Al~~l~g~ 69 (207)
...+++..||+-| +++.+.++.|.+-|-. ++.+.+|.+.-+..-+|+|+. ...+.+|.+.|...
T Consensus 192 AIRd~NPVV~lEnelLYg~~f~i~~E~ls~~fv~--p~gkAkier~G~~iTivt~Sr~v~~~leAA~~L~~~ 261 (359)
T KOG0524|consen 192 AIRDENPVVFLENELLYGLSFEIPEEALSKDFVL--PLGKAKIEREGTHITIVTYSRMVGHCLEAAETLVAK 261 (359)
T ss_pred hccCCCCeEEEechhhcCCCccCChhhcCcceee--eccceeeeecCCceEEEEechhHHHHHHHHHHHHhc
Confidence 3456777888764 5777888877654421 233344444444567777763 33444444444433
No 243
>PF02033 RBFA: Ribosome-binding factor A; InterPro: IPR000238 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites [, ]. About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits. Many ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome [, ]. Ribosome-binding factor A [] (gene rbfA) is a bacterial protein that associates with free 30S ribosomal subunits. It does not associate with 30S subunits that are part of 70S ribosomes or polysomes. It is essential for efficient processing of 16S rRNA. Ribosome-binding factor A is a protein of from 13 to 15 Kd which is found in most bacteria. A putative chloroplastic form seems to exist in plants.; GO: 0006364 rRNA processing; PDB: 2R1C_A 2DYJ_B 2KZF_A 2E7G_A 1JOS_A 1KKG_A 1PA4_A.
Probab=39.32 E-value=40 Score=23.88 Aligned_cols=34 Identities=12% Similarity=0.243 Sum_probs=22.5
Q ss_pred CeeEEEEeeC-CCcEEEEEEc-CHHHHHHHHHhcCC
Q psy2347 35 KLNSVWVAFN-PPGFAFVEFS-NQIDAEAACDSMND 68 (207)
Q Consensus 35 ~V~~v~i~~d-~rG~afV~F~-~~e~A~~Al~~l~g 68 (207)
.|+.|++..| +....||++. ++++.+++++.|+.
T Consensus 30 tIt~V~ls~Dl~~a~Vy~~~~~~~~~~~~~~~~L~~ 65 (104)
T PF02033_consen 30 TITRVELSPDLSHAKVYVSILGDEEEQEEVLEALNK 65 (104)
T ss_dssp EEEEEEECTTSSEEEEEEEESSSHHHHHHHHHHHHH
T ss_pred EEEEEEECCCCCEEEEEEEEecCchhHHHHHHHHHH
Confidence 4778888877 4455667664 66666666666554
No 244
>COG5353 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=39.03 E-value=98 Score=23.98 Aligned_cols=51 Identities=18% Similarity=0.284 Sum_probs=36.1
Q ss_pred CcEEEEcCCCcCCcHHHHHHHHHhCC---CeeEEEEeeC--------------C-CcEEEEEEcCHHHH
Q psy2347 9 GTRVYVGGLTETVKKEDLELEFEKFG---KLNSVWVAFN--------------P-PGFAFVEFSNQIDA 59 (207)
Q Consensus 9 ~~~lfV~nLp~~~te~~L~~~F~~~G---~V~~v~i~~d--------------~-rG~afV~F~~~e~A 59 (207)
..+|++.-++.-++|++.+++.++-. +++.|.+-+. - -.|-+|.|++-+..
T Consensus 87 ~~KI~~k~asqGISe~~a~~~i~kE~~~~e~~~V~Lg~e~~~PiWEV~y~dkeg~~s~~~vdFetG~~l 155 (161)
T COG5353 87 DGKIYSKKASQGISEEDARAIISKEKAVKEIKSVTLGREKEKPIWEVTYLDKEGRLSFYYVDFETGKEL 155 (161)
T ss_pred CCeEEEEehhcCCCHHHHHHHHhhhccccceeEEEEEeeCCceeEEEeecCccCcceEEEEEeccchhh
Confidence 36899999999999999998887753 4555554332 1 23888888875543
No 245
>KOG3424|consensus
Probab=38.40 E-value=71 Score=23.75 Aligned_cols=42 Identities=19% Similarity=0.385 Sum_probs=24.4
Q ss_pred cCCcHHHHHHHHHh-CCCeeEEEEeeC---------CCcEEEEEEcCHHHHHH
Q psy2347 19 ETVKKEDLELEFEK-FGKLNSVWVAFN---------PPGFAFVEFSNQIDAEA 61 (207)
Q Consensus 19 ~~~te~~L~~~F~~-~G~V~~v~i~~d---------~rG~afV~F~~~e~A~~ 61 (207)
..++.+||++-+.+ |-.-.++.++.+ +.+||.| |++.|.|++
T Consensus 33 a~vsK~EirEKla~mYkt~~d~V~vfgfrt~~GggkstgfalI-Ydsve~akk 84 (132)
T KOG3424|consen 33 ANVSKTEIREKLAKMYKTTPDAVFVFGFRTHFGGGKSTGFALI-YDSVEYAKK 84 (132)
T ss_pred CCCCHHHHHHHHHHHhcCCcceEEEEEeeeccCCcccceeeee-eehHHHHHh
Confidence 35778888877765 433223222222 5677764 777776654
No 246
>COG5193 LHP1 La protein, small RNA-binding pol III transcript stabilizing protein and related La-motif-containing proteins involved in translation [Posttranslational modification, protein turnover, chaperones / Translation, ribosomal structure and biogenesis]
Probab=38.33 E-value=13 Score=33.32 Aligned_cols=55 Identities=25% Similarity=0.336 Sum_probs=40.4
Q ss_pred cEEEEcCCCcCCcH--------HHHHHHHHh--CCCeeEEEEeeC-----CCcEEEEEEcCHHHHHHHHH
Q psy2347 10 TRVYVGGLTETVKK--------EDLELEFEK--FGKLNSVWVAFN-----PPGFAFVEFSNQIDAEAACD 64 (207)
Q Consensus 10 ~~lfV~nLp~~~te--------~~L~~~F~~--~G~V~~v~i~~d-----~rG~afV~F~~~e~A~~Al~ 64 (207)
+.+|+..+..+.+. ++|..+|.. +..+..|.+..+ .++..||+|...+.|++++.
T Consensus 175 r~~y~n~fG~e~~~~a~~~e~~~d~~~~~p~h~h~~~~~i~~rrd~~nkn~~gSv~~efk~~~~~q~~nn 244 (438)
T COG5193 175 RDVYQNGFGKEDVNNASRPEQQEDLEIQFPPHYHAPPSQIRNRRDWLNKNFRGSVFVEFKYFREAQRFNN 244 (438)
T ss_pred hhHHhhcCCcccccccccchhhhhHHhhCCCcccCChhhccchhhhhhccccCcccccccChHHHHHHhc
Confidence 45666666554433 488888888 566677766555 67889999999999999984
No 247
>TIGR00387 glcD glycolate oxidase, subunit GlcD. This protein, the glycolate oxidase GlcD subunit, is similar in sequence to that of several D-lactate dehydrogenases, including that of E. coli. The glycolate oxidase has been found to have some D-lactate dehydrogenase activity.
Probab=38.24 E-value=68 Score=28.65 Aligned_cols=48 Identities=19% Similarity=0.063 Sum_probs=33.0
Q ss_pred CCcHHHHHHHHHh----CCCeeEEEEeeC----CCcEEEEEEcCHHHHHHHHHhcC
Q psy2347 20 TVKKEDLELEFEK----FGKLNSVWVAFN----PPGFAFVEFSNQIDAEAACDSMN 67 (207)
Q Consensus 20 ~~te~~L~~~F~~----~G~V~~v~i~~d----~rG~afV~F~~~e~A~~Al~~l~ 67 (207)
+.+..+|..+|.. ++.|+++.+... .+...++.|++.++|.+|+..+.
T Consensus 143 ~~~g~dl~~l~~Gs~GtlGiit~~~lkl~p~p~~~~~~~~~f~~~~~~~~~~~~~~ 198 (413)
T TIGR00387 143 DVAGYDLTGLFVGSEGTLGIVTEATLKLLPKPENIVVALAFFDSIEKAMQAVYDII 198 (413)
T ss_pred CCCCCChhhhcccCCccceEEEEEEEEeecCCCccEEEEEECCCHHHHHHHHHHHH
Confidence 3344567777753 677888766544 34567789999999999886543
No 248
>cd07990 LPLAT_LCLAT1-like Lysophospholipid Acyltransferases (LPLATs) of Glycerophospholipid Biosynthesis: LCLAT1-like. Lysophospholipid acyltransferase (LPLAT) superfamily member: acyltransferases of de novo and remodeling pathways of glycerophospholipid biosynthesis which catalyze the incorporation of an acyl group from either acylCoAs or acyl-acyl carrier proteins (acylACPs) into acceptors such as glycerol 3-phosphate, dihydroxyacetone phosphate or lyso-phosphatidic acid. Included in this subgroup are such LPLATs as Lysocardiolipin acyltransferase 1 (LCLAT1) or 1-acyl-sn-glycerol-3-phosphate acyltransferase and similar proteins.
Probab=37.53 E-value=2e+02 Score=22.43 Aligned_cols=61 Identities=11% Similarity=0.179 Sum_probs=38.1
Q ss_pred CCCcEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeC--------------CCcEEEEEEcC---HHHHHHHHHhcC
Q psy2347 7 ERGTRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFN--------------PPGFAFVEFSN---QIDAEAACDSMN 67 (207)
Q Consensus 7 ~~~~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d--------------~rG~afV~F~~---~e~A~~Al~~l~ 67 (207)
.+...|+|.|-...++.--|..++...+.+..++++.. ..++.||+-.. .+.++++++.+.
T Consensus 22 ~~~~~iiv~NH~s~~D~~~~~~~~~~~~~~~~~~~v~K~~l~~~p~~g~~~~~~~~i~v~R~~~~d~~~i~~~~~~l~ 99 (193)
T cd07990 22 PKERALIISNHRSEVDWLVLWMLADRFGRLGRLKIVLKDSLKYPPLGGWGWQLGEFIFLKRKWEKDEKTIKRQLKRLK 99 (193)
T ss_pred CCccEEEEEcCCcccCHHHHHHHHHHcCccceEEeeehhhhhcCChhhHHHhhCeeEEEECChHHhHHHHHHHHHHHh
Confidence 55679999998888888778888877764444433322 23477777653 333444444443
No 249
>TIGR00405 L26e_arch ribosomal protein L24p/L26e, archaeal. This protein contains a KOW domain, shared by bacterial NusG and the L24p/L26e family of ribosomal proteins. Although called archaeal NusG in several publications, it is the only close homolog of eukaryotic L26e in archaeal genomes, shares an operon with L11 in many genomes, and has been sequenced from purified ribosomes. It is here designated as a ribosomal protein for these reasons.
Probab=37.46 E-value=1.3e+02 Score=22.44 Aligned_cols=31 Identities=19% Similarity=0.259 Sum_probs=23.4
Q ss_pred EEEEeeCCCcEEEEEEcCHHHHHHHHHhcCC
Q psy2347 38 SVWVAFNPPGFAFVEFSNQIDAEAACDSMND 68 (207)
Q Consensus 38 ~v~i~~d~rG~afV~F~~~e~A~~Al~~l~g 68 (207)
.+.++...++|.||+++..++...++..+.+
T Consensus 30 ~~~vp~~fpGYvFV~~~~~~~~~~~i~~~~g 60 (145)
T TIGR00405 30 SILAPESLKGYILVEAETKIDMRNPIIGVPH 60 (145)
T ss_pred EEEccCCCCcEEEEEEECcHHHHHHHhCCCC
Confidence 4444444789999999988888888877655
No 250
>PRK11700 hypothetical protein; Provisional
Probab=37.34 E-value=76 Score=25.45 Aligned_cols=20 Identities=20% Similarity=0.248 Sum_probs=14.7
Q ss_pred CcCCcHHHHHHHHHhCCCee
Q psy2347 18 TETVKKEDLELEFEKFGKLN 37 (207)
Q Consensus 18 p~~~te~~L~~~F~~~G~V~ 37 (207)
....+.+.+++.|.++|++.
T Consensus 48 n~~~tAe~w~~~l~~~G~ll 67 (187)
T PRK11700 48 NQNETAERWRQGFLQCGELL 67 (187)
T ss_pred CCHHHHHHHHHHHHHhchhh
Confidence 44456778889999998744
No 251
>TIGR00082 rbfA ribosome-binding factor A. Associates with free 30S ribosomal subunits (but not with 30S subunits that are part of 70S ribosomes or polysomes). Essential for efficient processing of 16S rRNA. May interact with the 5'terminal helix region of 16S rRNA. Mutants lacking rbfA have a cold-sensitive phenotype.
Probab=37.25 E-value=46 Score=24.28 Aligned_cols=34 Identities=15% Similarity=0.221 Sum_probs=20.4
Q ss_pred CeeEEEEeeC-CCcEEEEEEc-CHHHHHHHHHhcCC
Q psy2347 35 KLNSVWVAFN-PPGFAFVEFS-NQIDAEAACDSMND 68 (207)
Q Consensus 35 ~V~~v~i~~d-~rG~afV~F~-~~e~A~~Al~~l~g 68 (207)
.|+.|++..| +-..+||++- ++++.+++++.|+.
T Consensus 34 tVt~V~lS~Dl~~AkVyvs~~~~~~~~~~~l~~L~~ 69 (114)
T TIGR00082 34 TVTKVEVSKDLQHAKVFVDCYGDEEAIDRVVKALNK 69 (114)
T ss_pred EEeEEEECCCCCEEEEEEEECCChhhHHHHHHHHHH
Confidence 4677788777 4445666665 44444556555554
No 252
>TIGR00587 nfo apurinic endonuclease (APN1). All proteins in this family for which functions are known are 5' AP endonculeases that are used in base excision repair and the repair of abasic sites in DNA.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=37.20 E-value=59 Score=27.23 Aligned_cols=57 Identities=9% Similarity=0.054 Sum_probs=38.6
Q ss_pred CcEEEEcCCCcC-----CcHHHHHHHHHhCCCeeEEEEeeCCCcEEEEEE---cCHHHHHHHHHhc
Q psy2347 9 GTRVYVGGLTET-----VKKEDLELEFEKFGKLNSVWVAFNPPGFAFVEF---SNQIDAEAACDSM 66 (207)
Q Consensus 9 ~~~lfV~nLp~~-----~te~~L~~~F~~~G~V~~v~i~~d~rG~afV~F---~~~e~A~~Al~~l 66 (207)
..+|.|-|+|.. .+.++|.++++.+.....+.++.| .+.+++.- .+++..+.+++.+
T Consensus 137 ~v~l~lEN~~~~~~~l~~~~~el~~ll~~~~~~~~lg~~lD-t~H~~~~g~~~~~~~~~~~~~~~~ 201 (274)
T TIGR00587 137 IVTILLENMAGQGSELGRSFEELAYIIKVIVDKRRIGVCLD-TCHFFAAGYDITTKAYFEVVKNEF 201 (274)
T ss_pred CCEEEEEeCCCCCCccCCCHHHHHHHHHhcCCCCceEEEEE-hhhHHhcCCCcCCHHHHHHHHHHH
Confidence 367888898632 478999999998876567888888 34444332 3456666666654
No 253
>COG0225 MsrA Peptide methionine sulfoxide reductase [Posttranslational modification, protein turnover, chaperones]
Probab=37.10 E-value=51 Score=26.13 Aligned_cols=72 Identities=17% Similarity=0.138 Sum_probs=36.9
Q ss_pred EEEEcCCCcCCcHHHHHHHHHhCCCeeEE-EEeeC-CCcEEEEEE-cCH---HHHHHHHHhcCCceeCCcEEEEEEee
Q psy2347 11 RVYVGGLTETVKKEDLELEFEKFGKLNSV-WVAFN-PPGFAFVEF-SNQ---IDAEAACDSMNDQDLMGSKLRVEISR 82 (207)
Q Consensus 11 ~lfV~nLp~~~te~~L~~~F~~~G~V~~v-~i~~d-~rG~afV~F-~~~---e~A~~Al~~l~g~~i~g~~l~V~~a~ 82 (207)
.|.|.==|..++-++|.++|-+.-+.+.+ +--.| -.-|--+.| .++ +.|++.++++....+..++|.+++..
T Consensus 59 ~V~V~yDp~~isy~~LL~~ff~ihDPT~~nrQGnD~GtqYRs~Iy~~~~~q~~~a~~~~~~~q~~~~~~~~IvteI~p 136 (174)
T COG0225 59 AVEVTYDPKVISYEELLEVFFEIHDPTSLNRQGNDRGTQYRSAIYYTNEEQKAIAEASIEELQASGYFKKPIVTEIEP 136 (174)
T ss_pred EEEEEeCCccccHHHHHHHHheecCCCCCCccCCcccccceeEEEEcCHHHHHHHHHHHHHHHHhccCCCCeEEEeec
Confidence 45555457778888887777554222221 11112 112322333 333 34555556665555666677777654
No 254
>PF01071 GARS_A: Phosphoribosylglycinamide synthetase, ATP-grasp (A) domain; InterPro: IPR020561 Phosphoribosylglycinamide synthetase (6.3.4.13 from EC) (GARS) (phosphoribosylamine glycine ligase) [] catalyses the second step in the de novo biosynthesis of purine. The reaction catalysed by phosphoribosylglycinamide synthetase is the ATP-dependent addition of 5-phosphoribosylamine to glycine to form 5'phosphoribosylglycinamide: ATP + 5-phosphoribosylamine + glycine = ADP + Pi + 5'-phosphoribosylglycinamide In bacteria, GARS is a monofunctional enzyme (encoded by the purD gene). In yeast, GARS is part of a bifunctional enzyme (encoded by the ADE5/7 gene) in conjunction with phosphoribosylformylglycinamidine cyclo-ligase (AIRS) (IPR000728 from INTERPRO). In higher eukaryotes, GARS is part of a trifunctional enzyme in conjunction with AIRS (IPR000728 from INTERPRO) and with phosphoribosylglycinamide formyltransferase (GART) (), forming GARS-AIRS-GART. This entry represents the A-domain of the enzyme, and is related to the ATP-grasp domain of biotin carboxylase/carbamoyl phosphate synthetase.; PDB: 1GSO_A 3LP8_A 2IP4_A 1VKZ_A 2QK4_A 2XD4_A 2XCL_A 3MJF_A 2YRW_A 2YS6_A ....
Probab=36.84 E-value=94 Score=25.02 Aligned_cols=44 Identities=18% Similarity=0.105 Sum_probs=28.6
Q ss_pred cHHHHHHHHHhCCCeeEEEEeeC--CCcEEEEEEcCHHHHHHHHHhc
Q psy2347 22 KKEDLELEFEKFGKLNSVWVAFN--PPGFAFVEFSNQIDAEAACDSM 66 (207)
Q Consensus 22 te~~L~~~F~~~G~V~~v~i~~d--~rG~afV~F~~~e~A~~Al~~l 66 (207)
+.++.+++++++..-. +.|..| ..+-..+...+.++|.+|++.+
T Consensus 25 ~~~~A~~~l~~~~~p~-~ViKadGla~GKGV~i~~~~~eA~~~l~~~ 70 (194)
T PF01071_consen 25 DYEEALEYLEEQGYPY-VVIKADGLAAGKGVVIADDREEALEALREI 70 (194)
T ss_dssp SHHHHHHHHHHHSSSE-EEEEESSSCTTTSEEEESSHHHHHHHHHHH
T ss_pred CHHHHHHHHHhcCCCc-eEEccCCCCCCCEEEEeCCHHHHHHHHHHh
Confidence 5677777777765433 444445 3333455668999999998865
No 255
>cd04931 ACT_PAH ACT domain of the nonheme iron-dependent aromatic amino acid hydroxylase, phenylalanine hydroxylases (PAH). ACT domain of the nonheme iron-dependent aromatic amino acid hydroxylase, phenylalanine hydroxylases (PAH). PAH catalyzes the hydroxylation of L-Phe to L-Tyr, the first step in the catabolic degradation of L-Phe. In PAH, an autoregulatory sequence, N-terminal of the ACT domain, extends across the catalytic domain active site and regulates the enzyme by intrasteric regulation. It appears that the activation by L-Phe induces a conformational change that converts the enzyme to a high-affinity and high-activity state. Modulation of activity is achieved through inhibition by BH4 and activation by phosphorylation of serine residues of the autoregulatory region. The molecular basis for the cooperative activation process is not fully understood yet. Members of this CD belong to the superfamily of ACT regulatory domains.
Probab=36.82 E-value=1.4e+02 Score=20.67 Aligned_cols=52 Identities=15% Similarity=0.093 Sum_probs=27.8
Q ss_pred CCCcCC-cHHHHHHHHHhCCC-eeEEEEeeC---C-CcEEEEEEcC--HHHHHHHHHhcC
Q psy2347 16 GLTETV-KKEDLELEFEKFGK-LNSVWVAFN---P-PGFAFVEFSN--QIDAEAACDSMN 67 (207)
Q Consensus 16 nLp~~~-te~~L~~~F~~~G~-V~~v~i~~d---~-rG~afV~F~~--~e~A~~Al~~l~ 67 (207)
.|+... +-.++.+.|+.++. +++|+-... . .=.-||+|+- .+.+++||+.|.
T Consensus 20 ~l~~~pGsL~~vL~~Fa~~~INLt~IeSRP~~~~~~~Y~FfVDieg~~~~~~~~~l~~L~ 79 (90)
T cd04931 20 SLKEEVGALAKVLRLFEEKDINLTHIESRPSRLNKDEYEFFINLDKKSAPALDPIIKSLR 79 (90)
T ss_pred EcCCCCcHHHHHHHHHHHCCCCEEEEEeccCCCCCceEEEEEEEEcCCCHHHHHHHHHHH
Confidence 344432 34566777887752 444433222 1 1245678873 455667777654
No 256
>PF13820 Nucleic_acid_bd: Putative nucleic acid-binding region
Probab=36.69 E-value=63 Score=24.94 Aligned_cols=58 Identities=21% Similarity=0.239 Sum_probs=33.3
Q ss_pred EEEEcCCCc-CCcHHHHHHHHHhC-CC-eeEEEEeeC-CCcEEEEEEcCHHHHHHHHHhcCC
Q psy2347 11 RVYVGGLTE-TVKKEDLELEFEKF-GK-LNSVWVAFN-PPGFAFVEFSNQIDAEAACDSMND 68 (207)
Q Consensus 11 ~lfV~nLp~-~~te~~L~~~F~~~-G~-V~~v~i~~d-~rG~afV~F~~~e~A~~Al~~l~g 68 (207)
.+|+++|.. ..+-++|+..+... .+ +.++++.+. +-..+.|+|.-+.+|...|..|-.
T Consensus 6 la~~G~l~~~~~~ld~i~~~l~~L~~~~~~~l~~~~~~~~~sv~V~f~ipreaa~~Lr~LA~ 67 (149)
T PF13820_consen 6 LAYIGGLRMFQYKLDDIKNWLASLYKPRISDLKVRKVEPWNSVRVTFSIPREAATRLRQLAQ 67 (149)
T ss_pred EEEECChhhhHHHHHHHHHHHHHHHhcccccceeeccccCceEEEEEechHHHHHHHHHHhh
Confidence 455677752 12334555555442 22 233333333 456899999999888877776544
No 257
>PTZ00481 Membrane attack complex/ Perforin (MACPF) Superfamily; Provisional
Probab=36.56 E-value=34 Score=31.11 Aligned_cols=51 Identities=16% Similarity=0.169 Sum_probs=32.8
Q ss_pred EcCCCcCCcHHHHHHHHHhCCC--eeEEE---------EeeCCCcEEEEEEc---CHHHHHHHHHh
Q psy2347 14 VGGLTETVKKEDLELEFEKFGK--LNSVW---------VAFNPPGFAFVEFS---NQIDAEAACDS 65 (207)
Q Consensus 14 V~nLp~~~te~~L~~~F~~~G~--V~~v~---------i~~d~rG~afV~F~---~~e~A~~Al~~ 65 (207)
|..||.+.+.++-.++++.||+ |+++. .+++ .-++...|. +.++++.||+.
T Consensus 210 L~~LP~eYd~g~Y~~FIe~YGTHYITsv~LGG~VE~vTs~Rt-C~~slps~~~GlS~~qVk~cLn~ 274 (524)
T PTZ00481 210 IDALPEDYQPDVYNEFLNAWGTHISVDTFIGGMIEKQTVFKD-CVFFTPQFNGGLSPEQVEQALRN 274 (524)
T ss_pred HHhcchhcCHHHHHHHHHhcCceEEEEeeEcCEEEEEeehhh-hhhhchhhhcCCCHHHHHHHHHH
Confidence 4578999999999999999987 33322 2222 112333344 77788888863
No 258
>COG0360 RpsF Ribosomal protein S6 [Translation, ribosomal structure and biogenesis]
Probab=36.55 E-value=1.1e+02 Score=22.44 Aligned_cols=51 Identities=18% Similarity=0.290 Sum_probs=29.3
Q ss_pred EEEcCCCcCCcHHHHHHHHHhC--------CCeeEEEEe----------eCCC-cEEEEEEcCHHHHHHHHH
Q psy2347 12 VYVGGLTETVKKEDLELEFEKF--------GKLNSVWVA----------FNPP-GFAFVEFSNQIDAEAACD 64 (207)
Q Consensus 12 lfV~nLp~~~te~~L~~~F~~~--------G~V~~v~i~----------~d~r-G~afV~F~~~e~A~~Al~ 64 (207)
+|| |.++++++++..+.++| ++|.+++.+ +..+ -|.++.|+...++.+-++
T Consensus 6 ~~i--v~p~~see~~~~~ve~~~~~l~~~gg~i~~~e~wG~R~LAY~IkK~~~g~Y~l~~f~~~~~~i~Ele 75 (112)
T COG0360 6 VFI--VRPDLSEEQVAALVEKYKGVLTNNGGEIHKVEDWGKRRLAYPIKKLREGHYVLMNFEAEPAAIAELE 75 (112)
T ss_pred EEE--ECCCCCHHHHHHHHHHHHHHHHHCCCEEEEehhhhhhhhcceecccceEEEEEEEEEcCHHHHHHHH
Confidence 444 46677877777666655 334444321 1122 467888887766665554
No 259
>PLN02805 D-lactate dehydrogenase [cytochrome]
Probab=36.35 E-value=94 Score=29.20 Aligned_cols=47 Identities=17% Similarity=0.130 Sum_probs=33.6
Q ss_pred CcHHHHHHHHH----hCCCeeEEEEeeC----CCcEEEEEEcCHHHHHHHHHhcC
Q psy2347 21 VKKEDLELEFE----KFGKLNSVWVAFN----PPGFAFVEFSNQIDAEAACDSMN 67 (207)
Q Consensus 21 ~te~~L~~~F~----~~G~V~~v~i~~d----~rG~afV~F~~~e~A~~Al~~l~ 67 (207)
.+..+|..+|. .+|.|+++.+... .+..+++.|.+.++|.+|+..+.
T Consensus 278 ~~g~dL~~l~~GseGtLGIIT~~tlrl~p~P~~~~~~~~~f~~~~~a~~av~~i~ 332 (555)
T PLN02805 278 AAGYDLTRLVIGSEGTLGVITEVTLRLQKIPQHSVVAMCNFPTIKDAADVAIATM 332 (555)
T ss_pred CCCccHHHHhccCCCceEEEEEEEEEeecCCcceEEEEEEcCCHHHHHHHHHHHH
Confidence 34457777772 4777888877543 35678899999999988887643
No 260
>PF11491 DUF3213: Protein of unknown function (DUF3213) ; InterPro: IPR021583 The backbone structure of this family of proteins has been determined however the function remains unknown. The protein has an alpha and beta structure with a ferredoxin-like fold []. ; PDB: 2F40_A.
Probab=35.87 E-value=26 Score=24.28 Aligned_cols=67 Identities=16% Similarity=0.169 Sum_probs=23.8
Q ss_pred EEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeCCCcEEEEEEc----CHHHHHHHHHhcCCceeCCcEEEEEE
Q psy2347 12 VYVGGLTETVKKEDLELEFEKFGKLNSVWVAFNPPGFAFVEFS----NQIDAEAACDSMNDQDLMGSKLRVEI 80 (207)
Q Consensus 12 lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d~rG~afV~F~----~~e~A~~Al~~l~g~~i~g~~l~V~~ 80 (207)
|-+++|.++-.. +++--+++-..|-.+.|.-- ...+||.|+ +.+...++++.+....+.-+.|+|+.
T Consensus 3 lkfg~It~eeA~-~~QYeLsk~~~vyRvFiNgY-ar~g~VifDe~kl~~e~lL~~le~~kpEVi~ek~lTveE 73 (88)
T PF11491_consen 3 LKFGNITPEEAM-VKQYELSKNEAVYRVFINGY-ARNGFVIFDESKLSKEELLEMLEEFKPEVIEEKELTVEE 73 (88)
T ss_dssp EE--S-TTTTTH-HHHHTTTTTTTB------TT-SS--EEE--B-S-SHHHH---HHHTTT-SS-------SS
T ss_pred cccCCCCHHHHH-HHHHHhhcccceeeeeeccc-ccceEEEECcccCCHHHHHHHHHhcChhheeeccccHHH
Confidence 444666543221 12222344444555555433 337899997 46788888999999888888888753
No 261
>PRK06545 prephenate dehydrogenase; Validated
Probab=35.65 E-value=2.1e+02 Score=24.97 Aligned_cols=63 Identities=14% Similarity=0.182 Sum_probs=42.8
Q ss_pred CCCCCcEEEEcCCCcCC-cHHHHHHHHHhCC-CeeEEEEeeC---CCcEEEEEEcCHHHHHHHHHhcCC
Q psy2347 5 MMERGTRVYVGGLTETV-KKEDLELEFEKFG-KLNSVWVAFN---PPGFAFVEFSNQIDAEAACDSMND 68 (207)
Q Consensus 5 ~~~~~~~lfV~nLp~~~-te~~L~~~F~~~G-~V~~v~i~~d---~rG~afV~F~~~e~A~~Al~~l~g 68 (207)
..++...|+| .||... .-.+|..++...+ .|++|+|... ..+..-++|.+.+++++|.+.|..
T Consensus 286 ~~~~~~~~~v-~v~d~pg~~~~~~~~~~~~~i~i~~~~i~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~ 353 (359)
T PRK06545 286 AIPSFYDLYV-DVPDEPGVIARVTAILGEEGISIENLRILEAREDIHGVLQISFKNEEDRERAKALLEE 353 (359)
T ss_pred CCCcceEEEE-eCCCCCCHHHHHHHHHHHcCCCeecceeeeccCCcCceEEEEeCCHHHHHHHHHHHHh
Confidence 3455667777 455433 2345555555554 4888888765 557888999999999999887654
No 262
>KOG3671|consensus
Probab=35.51 E-value=49 Score=30.61 Aligned_cols=50 Identities=14% Similarity=0.065 Sum_probs=30.9
Q ss_pred CCcHHHHHHHHHhCCCeeEEEEeeCCCcEEEEEEcCHHHHHHHHHhcCCc
Q psy2347 20 TVKKEDLELEFEKFGKLNSVWVAFNPPGFAFVEFSNQIDAEAACDSMNDQ 69 (207)
Q Consensus 20 ~~te~~L~~~F~~~G~V~~v~i~~d~rG~afV~F~~~e~A~~Al~~l~g~ 69 (207)
-+-|+||.+-|+-+-.-..++-..-..+++=+.|.++++|++.++.+..+
T Consensus 89 liWdqELY~nf~y~q~r~ffhtFegddc~aGLnF~~E~EA~~F~k~V~~r 138 (569)
T KOG3671|consen 89 LIWDQELYQNFEYRQPRTFFHTFEGDDCQAGLNFASEEEAQKFRKKVQDR 138 (569)
T ss_pred eeehHHhhhhceeccCccceeeeccccceeeecccCHHHHHHHHHHHHHH
Confidence 35566777766544332222211114567778999999999988876543
No 263
>KOG0226|consensus
Probab=35.28 E-value=13 Score=31.32 Aligned_cols=70 Identities=17% Similarity=0.153 Sum_probs=47.3
Q ss_pred cEEEEcCCCcCCcHHH-H--HHHHHhCCCeeEEEEeeC----CCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEE
Q psy2347 10 TRVYVGGLTETVKKED-L--ELEFEKFGKLNSVWVAFN----PPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVE 79 (207)
Q Consensus 10 ~~lfV~nLp~~~te~~-L--~~~F~~~G~V~~v~i~~d----~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~ 79 (207)
..+++.++-..+..+- | ...|+.+-.+...+++.+ .+.++|+.|...++..++...-+.+.|.-..|++.
T Consensus 97 f~p~~~~~g~~v~pep~lp~~~~f~~~p~L~ktk~v~~~p~~~~~~~~~~~k~s~a~~k~~~~~~~Kki~~~~VR~a 173 (290)
T KOG0226|consen 97 FRPFQSNAGATVNPEPPLPLPVVFSEYPSLVKTKLVRDRPQPIRPEAFESFKASDALLKAETEKEKKKIGKPPVRLA 173 (290)
T ss_pred ccccccccccccCCCCCCcchhhhccchhhhhhhhhhcCCCccCcccccCcchhhhhhhhccccccccccCcceeec
Confidence 3455566544444433 3 567777777777777766 56899999998888888877666667766665554
No 264
>PTZ00146 fibrillarin; Provisional
Probab=34.96 E-value=1.3e+02 Score=25.86 Aligned_cols=8 Identities=25% Similarity=0.721 Sum_probs=3.1
Q ss_pred CCCCCCCC
Q psy2347 192 FRSRSPVQ 199 (207)
Q Consensus 192 ~~~~~~~~ 199 (207)
+.-|-|.|
T Consensus 107 yR~w~p~r 114 (293)
T PTZ00146 107 YRVWNPFR 114 (293)
T ss_pred eeeeCCcc
Confidence 33444433
No 265
>COG3444 Phosphotransferase system, mannose/fructose/N-acetylgalactosamine-specific component IIB [Carbohydrate transport and metabolism]
Probab=34.85 E-value=1.8e+02 Score=22.78 Aligned_cols=23 Identities=17% Similarity=0.089 Sum_probs=19.4
Q ss_pred CCcEEEEEEcCHHHHHHHHHhcC
Q psy2347 45 PPGFAFVEFSNQIDAEAACDSMN 67 (207)
Q Consensus 45 ~rG~afV~F~~~e~A~~Al~~l~ 67 (207)
.+-.+||.|+++++|.++++.-.
T Consensus 75 ~~~~v~ll~~~p~d~~~lve~gv 97 (159)
T COG3444 75 DGQKVFLLFENPQDVLRLVEGGV 97 (159)
T ss_pred CCeEEEEEECCHHHHHHHHhcCC
Confidence 35689999999999999998644
No 266
>PRK01178 rps24e 30S ribosomal protein S24e; Reviewed
Probab=34.68 E-value=1.4e+02 Score=21.27 Aligned_cols=42 Identities=17% Similarity=0.267 Sum_probs=22.2
Q ss_pred CCcHHHHHHHHHh-CCCeeEEEEeeC---------CCcEEEEEEcCHHHHHHH
Q psy2347 20 TVKKEDLELEFEK-FGKLNSVWVAFN---------PPGFAFVEFSNQIDAEAA 62 (207)
Q Consensus 20 ~~te~~L~~~F~~-~G~V~~v~i~~d---------~rG~afV~F~~~e~A~~A 62 (207)
..+..+|++-+.+ +..-.++.|+.. +.+||.| |+|.+.|++.
T Consensus 30 tpsr~eirekLa~~~~~~~~~vvv~~~~t~fG~g~s~G~a~I-Yds~e~~kk~ 81 (99)
T PRK01178 30 TPSRKDVRKKLAAMLNADKELVVVRKIKTEYGMGKSKGYAKV-YDDKERARKI 81 (99)
T ss_pred CCCHHHHHHHHHHHHCcCCCEEEEEccCccCCCceEEEEEEE-ECCHHHHHhh
Confidence 5667777766554 553223333322 4455553 6666665543
No 267
>COG3797 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=34.50 E-value=2.1e+02 Score=22.76 Aligned_cols=78 Identities=27% Similarity=0.373 Sum_probs=43.3
Q ss_pred CCCcEEEEcCCCcCCcHHHHHHHHHh-CCCeeEEEEeeC---------------C---CcEEEEEEc----CHHHHHHHH
Q psy2347 7 ERGTRVYVGGLTETVKKEDLELEFEK-FGKLNSVWVAFN---------------P---PGFAFVEFS----NQIDAEAAC 63 (207)
Q Consensus 7 ~~~~~lfV~nLp~~~te~~L~~~F~~-~G~V~~v~i~~d---------------~---rG~afV~F~----~~e~A~~Al 63 (207)
+....||-+.+...--++.|++.|++ |+-..+|.|..- . .--..|.|. +.|++.+..
T Consensus 41 aSGNlvf~s~~~~~el~~klE~afe~~fg~~~dilVrs~~~~~~i~a~nPf~~~~~~~~~~~~v~f~~ep~dvd~v~~l~ 120 (178)
T COG3797 41 ASGNLVFESEAGAAELEAKLEAAFEKRFGRHVDILVRSAEDWETIAAANPFPEEAAGEPARVCVRFYREPLDVDEVEALA 120 (178)
T ss_pred hcCCEEEEcCCChHHHHHHHHHHHHHHcCCCccEEEeeHHHHHHHHhcCCCchhhccCcceEEEEEEcCCCcHHHHHHHH
Confidence 34455666666533345678888875 888777655322 0 122334443 334455544
Q ss_pred HhcCCc---eeCCcEEEEEEeecC
Q psy2347 64 DSMNDQ---DLMGSKLRVEISRGR 84 (207)
Q Consensus 64 ~~l~g~---~i~g~~l~V~~a~~k 84 (207)
+++... ...++.|.|.++...
T Consensus 121 ~~~~~~eev~~~g~~L~v~f~~~~ 144 (178)
T COG3797 121 DKAGGDEEVLAVGDDLWVDFSDGP 144 (178)
T ss_pred HHhCCCceEEecCCeeEEEecCCC
Confidence 544443 357788888887643
No 268
>COG1058 CinA Predicted nucleotide-utilizing enzyme related to molybdopterin-biosynthesis enzyme MoeA [General function prediction only]
Probab=34.33 E-value=1.1e+02 Score=25.79 Aligned_cols=59 Identities=10% Similarity=0.039 Sum_probs=34.2
Q ss_pred CcEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeCCCcEEE----EEE--cCHHHHHHHHHhcCC
Q psy2347 9 GTRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFNPPGFAF----VEF--SNQIDAEAACDSMND 68 (207)
Q Consensus 9 ~~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d~rG~af----V~F--~~~e~A~~Al~~l~g 68 (207)
+++|.+-+++++.-.+.|+++...+-.|.--...++ ....+ |.. ++++.|+++++.+..
T Consensus 180 ~~~~~~~gi~ES~la~~L~~i~~~~~~~~i~s~p~~-~~~~~~~~~i~~~~~~~~~~~~~~~~~~~ 244 (255)
T COG1058 180 SRVLRVFGIGESSLAPTLKDLQDEQPNVTIASYPKD-GEVRLRELVIRAEARDEEEADALLRWLEG 244 (255)
T ss_pred EEEEEEcCCChHHHHHHHHHHHhcCCCCEEEecCCC-CceeccceEEEEecCCHHHHHHHHHHHHH
Confidence 456667777777777777777776644332222222 11111 444 677788887776543
No 269
>PF13721 SecD-TM1: SecD export protein N-terminal TM region
Probab=34.21 E-value=1.7e+02 Score=20.77 Aligned_cols=59 Identities=12% Similarity=0.130 Sum_probs=38.4
Q ss_pred CCCcEEEEcCCCcC--C-cHHHHHHHHHhCCC-eeEEEEeeCCCcEEEEEEcCHHHHHHHHHhcCC
Q psy2347 7 ERGTRVYVGGLTET--V-KKEDLELEFEKFGK-LNSVWVAFNPPGFAFVEFSNQIDAEAACDSMND 68 (207)
Q Consensus 7 ~~~~~lfV~nLp~~--~-te~~L~~~F~~~G~-V~~v~i~~d~rG~afV~F~~~e~A~~Al~~l~g 68 (207)
.+...|-|+..... + +.++|++++++-+- ++.++.. ..-..|.|++.++-.+|.+.|+.
T Consensus 29 ge~pAvqIs~~~~~~~~~~~~~v~~~L~~~~I~~k~i~~~---~~~llirf~~~~~Ql~Ak~~L~~ 91 (101)
T PF13721_consen 29 GEDPAVQISASSAGVQLPDAFQVEQALKAAGIAVKSIEQE---GDSLLIRFDSTDQQLKAKDVLSK 91 (101)
T ss_pred CCCCcEEEecCCCCccCChHHHHHHHHHHCCCCcceEEee---CCEEEEEECCHHHHHHHHHHHHH
Confidence 34556777764322 2 23578888887763 4445443 34788999999988888877654
No 270
>PF14528 LAGLIDADG_3: LAGLIDADG-like domain; PDB: 2CW7_A 2CW8_A 2VS8_F 2VS7_G 1B24_A 1DQ3_A 2DCH_X.
Probab=33.94 E-value=1.3e+02 Score=19.47 Aligned_cols=41 Identities=15% Similarity=0.139 Sum_probs=25.8
Q ss_pred HHHHHHHHhCCCeeEEEEeeCCCcEEEEEEcCHHHHHHHHHh
Q psy2347 24 EDLELEFEKFGKLNSVWVAFNPPGFAFVEFSNQIDAEAACDS 65 (207)
Q Consensus 24 ~~L~~~F~~~G~V~~v~i~~d~rG~afV~F~~~e~A~~Al~~ 65 (207)
++|++++.++|-...+...+..+..=-|++.+ +++.+.++.
T Consensus 35 ~~v~~lL~~lGi~~~i~~~~~~~~~y~l~i~~-~~~~~f~~~ 75 (77)
T PF14528_consen 35 EDVQKLLLRLGIKASIYEKKRKKGSYRLRISG-KSLKRFLEK 75 (77)
T ss_dssp HHHHHHHHHTT--EEEEEEECTTTEEEEEEEC-HHHHHHHHH
T ss_pred HHHHHHHHHCCCeeEEEEEcCCCceEEEEECc-hHHHHHHHH
Confidence 56778888999777776555534444466666 667776664
No 271
>PF08206 OB_RNB: Ribonuclease B OB domain; InterPro: IPR013223 This domain includes the N-terminal OB domain found in ribonuclease B proteins in one or two copies.; PDB: 2ID0_D 2IX1_A 2IX0_A.
Probab=33.94 E-value=16 Score=23.16 Aligned_cols=37 Identities=30% Similarity=0.401 Sum_probs=18.7
Q ss_pred CCcEEEEEEcC-HHHHHHHHHhcCCceeCCcEEEEEEee
Q psy2347 45 PPGFAFVEFSN-QIDAEAACDSMNDQDLMGSKLRVEISR 82 (207)
Q Consensus 45 ~rG~afV~F~~-~e~A~~Al~~l~g~~i~g~~l~V~~a~ 82 (207)
+++|+||..++ .++.--.-+.|+ .-+++-++.|++..
T Consensus 7 ~~GfGFv~~~~~~~DifIp~~~l~-~A~~gD~V~v~i~~ 44 (58)
T PF08206_consen 7 PKGFGFVIPDDGGEDIFIPPRNLN-GAMDGDKVLVRITP 44 (58)
T ss_dssp SSS-EEEEECT-TEEEEE-HHHHT-TS-TT-EEEEEEEE
T ss_pred cCCCEEEEECCCCCCEEECHHHHC-CCCCCCEEEEEEec
Confidence 58999999987 222111222232 23456667777766
No 272
>PF08502 LeuA_dimer: LeuA allosteric (dimerisation) domain; InterPro: IPR013709 This is the C-terminal regulatory (R) domain of alpha-isopropylmalate synthase, which catalyses the first committed step in the leucine biosynthetic pathway []. This domain, is an internally duplicated structure with a novel fold []. It comprises two similar units that are arranged such that the two -helices pack together in the centre, crossing at an angle of 34 degrees, sandwiched between the two three-stranded, antiparallel beta-sheets. The overall domain is thus constructed as a beta-alpha-beta three-layer sandwich []. ; GO: 0003852 2-isopropylmalate synthase activity, 0009098 leucine biosynthetic process; PDB: 3HQ1_A 3HPZ_B 1SR9_A 3FIG_B 3HPS_A 3F6G_A 3F6H_A.
Probab=33.64 E-value=1.4e+02 Score=22.03 Aligned_cols=50 Identities=22% Similarity=0.067 Sum_probs=26.4
Q ss_pred CCcHHHHHHHHHh-CCC------eeEEEEeeC--CCcEEEEEEcC---------------HHHHHHHHHhcCCc
Q psy2347 20 TVKKEDLELEFEK-FGK------LNSVWVAFN--PPGFAFVEFSN---------------QIDAEAACDSMNDQ 69 (207)
Q Consensus 20 ~~te~~L~~~F~~-~G~------V~~v~i~~d--~rG~afV~F~~---------------~e~A~~Al~~l~g~ 69 (207)
++++++|+++|.+ |.. ++++++... ....+.|++.. .+++.+||++.-+.
T Consensus 1 Ev~~~~i~~lf~~~y~~~~~~~~l~~~~v~~~~~~~~~a~v~l~~~~g~~~~~~~~GnGpv~A~~~Al~~~~g~ 74 (133)
T PF08502_consen 1 EVTDEDIWALFEEEYLEVEEPYRLKSFQVSSGSGGRPTATVTLEVTDGEEHTISGTGNGPVDAFFNALNKALGI 74 (133)
T ss_dssp ---HHHHHHHHHHHHTS--SSEEEEEEEEEEETTCCEEEEEEEEEETTEEEEEEEEESSHHHHHHHHHHCHCCC
T ss_pred CcCHHHHHHHHHHHhCcCCCcEEEEEEEEEECCCCCcEEEEEEEEcCCeEEEEEEECCChHHHHHHHHHHHcCC
Confidence 4789999999975 433 445555444 23344444432 55666677653333
No 273
>TIGR01638 Atha_cystat_rel Arabidopsis thaliana cystatin-related protein. This model represents a family similar in sequence and probably homologous to a large family of cysteine proteinase inhibitors, or cystatins, as described by pfam model pfam00031. Cystatins may help plants resist attack by insects.
Probab=33.60 E-value=65 Score=22.80 Aligned_cols=25 Identities=16% Similarity=0.215 Sum_probs=15.1
Q ss_pred eeEEEEeeC-----CCcEEEEEEcCHHHHH
Q psy2347 36 LNSVWVAFN-----PPGFAFVEFSNQIDAE 60 (207)
Q Consensus 36 V~~v~i~~d-----~rG~afV~F~~~e~A~ 60 (207)
++-|+|++- ..-..||+|.-.|...
T Consensus 34 lEfV~vVrAn~~~~~g~~~yITF~Ard~~d 63 (92)
T TIGR01638 34 LELVEVVRANYRGGAKSKSYITFEARDKPD 63 (92)
T ss_pred EEEEEEEEEEeeccceEEEEEEEEEecCCC
Confidence 555555543 4456789997555443
No 274
>PF13046 DUF3906: Protein of unknown function (DUF3906)
Probab=33.39 E-value=58 Score=21.41 Aligned_cols=31 Identities=19% Similarity=0.260 Sum_probs=23.1
Q ss_pred CcHHHHHHHHHhCCCeeEEEEeeC---CCcEEEE
Q psy2347 21 VKKEDLELEFEKFGKLNSVWVAFN---PPGFAFV 51 (207)
Q Consensus 21 ~te~~L~~~F~~~G~V~~v~i~~d---~rG~afV 51 (207)
.-|.+|+++|-+--+|+++.|... .++-+||
T Consensus 30 ~~e~eler~fl~~P~v~e~~l~EKKri~~G~gyV 63 (64)
T PF13046_consen 30 LVEVELERHFLPLPEVKEVALYEKKRIRKGAGYV 63 (64)
T ss_pred HHHHHhhhhccCCCCceEEEEEEEEeeeCCceeE
Confidence 456778888888888999888765 4566666
No 275
>PF08002 DUF1697: Protein of unknown function (DUF1697); InterPro: IPR012545 This family contains many hypothetical bacterial proteins.; PDB: 2HIY_B.
Probab=33.11 E-value=2.1e+02 Score=21.44 Aligned_cols=32 Identities=19% Similarity=0.360 Sum_probs=19.3
Q ss_pred cEEEEcCCC----cCCcHHHHHHHHHhCCCeeEEEEe
Q psy2347 10 TRVYVGGLT----ETVKKEDLELEFEKFGKLNSVWVA 42 (207)
Q Consensus 10 ~~lfV~nLp----~~~te~~L~~~F~~~G~V~~v~i~ 42 (207)
..+|+..|+ ..+..++|+++|++.| ..+|+-.
T Consensus 4 yiaLLRGINVGG~nki~MaeLr~~l~~~G-f~~V~Ty 39 (137)
T PF08002_consen 4 YIALLRGINVGGKNKIKMAELREALEDLG-FTNVRTY 39 (137)
T ss_dssp EEEEESS-SBTTBS---HHHHHHHHHHCT--EEEEEE
T ss_pred EEEEEcceecCCCCcccHHHHHHHHHHcC-CCCceEE
Confidence 356777763 3488999999999987 4555433
No 276
>cd04917 ACT_AKiii-LysC-EC_2 ACT domains located C-terminal to the catalytic domain of the lysine-sensitive aspartokinase isoenzyme AKIII. This CD includes the second of two ACT domains located C-terminal to the catalytic domain of the lysine-sensitive aspartokinase isoenzyme AKIII, a monofunctional class enzyme found in bacteria (Escherichia coli (EC) LysC). Aspartokinase is the first enzyme in the aspartate metabolic pathway and catalyzes the conversion of aspartate and ATP to aspartylphosphate and ADP. The E. coli AKIII (LysC) binds two feedback allosteric inhibitor lysine molecules at the dimer interface located between the ACT1 domain of two subunits. The second ACT domain (ACT2), this CD, is not involved in the binding of heterotrophic effectors. Members of this CD belong to the superfamily of ACT regulatory domains.
Probab=32.64 E-value=1.1e+02 Score=19.17 Aligned_cols=16 Identities=6% Similarity=0.264 Sum_probs=10.4
Q ss_pred cCHHHHHHHHHhcCCc
Q psy2347 54 SNQIDAEAACDSMNDQ 69 (207)
Q Consensus 54 ~~~e~A~~Al~~l~g~ 69 (207)
-+.+++++|++.||..
T Consensus 46 V~~~~~~~a~~~Lh~~ 61 (64)
T cd04917 46 VKEEDKDEVVQRLHSR 61 (64)
T ss_pred EeHHHHHHHHHHHHHH
Confidence 3456677787777643
No 277
>PF13689 DUF4154: Domain of unknown function (DUF4154)
Probab=32.61 E-value=84 Score=23.59 Aligned_cols=35 Identities=14% Similarity=0.227 Sum_probs=24.3
Q ss_pred CcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEEEe
Q psy2347 46 PGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVEIS 81 (207)
Q Consensus 46 rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~~a 81 (207)
..+-+..+.+.+ ...++..|.++.+.+++|+|..-
T Consensus 26 ~~~~icv~g~~~-~~~~L~~l~~~~~~~~~i~v~~~ 60 (145)
T PF13689_consen 26 SPFRICVLGDDP-FAEALSTLAGKQVGGRPIRVRRL 60 (145)
T ss_pred CCeEEEEECChH-HHHHHHHhhhcccCCCcEEEEEC
Confidence 345555665544 45677788888899999888754
No 278
>PRK04405 prsA peptidylprolyl isomerase; Provisional
Probab=32.61 E-value=93 Score=26.60 Aligned_cols=39 Identities=15% Similarity=0.256 Sum_probs=27.8
Q ss_pred CCcHHHHHHHHHhCCCeeEEEEeeCCCcEEEEEEcCHHHHHHHHHhcC
Q psy2347 20 TVKKEDLELEFEKFGKLNSVWVAFNPPGFAFVEFSNQIDAEAACDSMN 67 (207)
Q Consensus 20 ~~te~~L~~~F~~~G~V~~v~i~~d~rG~afV~F~~~e~A~~Al~~l~ 67 (207)
.+|+++|+++|..+.+ .++ ...|.+.+++.|+++++.|.
T Consensus 128 ~Vtd~ei~~~y~~~~~--~~~-------v~hIlv~~~~~A~~v~~~l~ 166 (298)
T PRK04405 128 KVTNSQLKKAWKSYQP--KVT-------VQHILVSKKSTAETVIKKLK 166 (298)
T ss_pred CCCHHHHHHHHHHhhh--hEE-------EEEEEecChHHHHHHHHHHH
Confidence 5799999999988633 222 23456678888888888764
No 279
>PLN02707 Soluble inorganic pyrophosphatase
Probab=32.49 E-value=22 Score=30.17 Aligned_cols=42 Identities=12% Similarity=0.037 Sum_probs=26.8
Q ss_pred HHHHHHHHHhCCCeeEEEEeeCCCcEEEE-EEcCHHHHHHHHHhcCC
Q psy2347 23 KEDLELEFEKFGKLNSVWVAFNPPGFAFV-EFSNQIDAEAACDSMND 68 (207)
Q Consensus 23 e~~L~~~F~~~G~V~~v~i~~d~rG~afV-~F~~~e~A~~Al~~l~g 68 (207)
-++|+++|+.|-..+. +.+ +-|+|+ +|.+.+.|.+.|+..+.
T Consensus 207 l~~I~~fF~~YK~~eG-K~~---n~~~~~~~~~~~~~A~~vI~e~~~ 249 (267)
T PLN02707 207 LTAIRDWFRDYKIPDG-KPA---NKFGLDNKPMDKDYALKVIEETNE 249 (267)
T ss_pred HHHHHHHHHHhcCCCC-Cce---eeccccCCcCCHHHHHHHHHHHHH
Confidence 3688889988843321 111 124554 78999999998876443
No 280
>PTZ00071 40S ribosomal protein S24; Provisional
Probab=32.33 E-value=1.5e+02 Score=22.42 Aligned_cols=41 Identities=24% Similarity=0.451 Sum_probs=21.6
Q ss_pred CCcHHHHHHHHHh-CC-CeeEEEEeeC---------CCcEEEEEEcCHHHHHH
Q psy2347 20 TVKKEDLELEFEK-FG-KLNSVWVAFN---------PPGFAFVEFSNQIDAEA 61 (207)
Q Consensus 20 ~~te~~L~~~F~~-~G-~V~~v~i~~d---------~rG~afV~F~~~e~A~~ 61 (207)
..+.+||++-+.+ |. .-.++.|+.. +.+||.| |+|.|+|++
T Consensus 35 TpSr~eirekLA~~~~v~d~~~Vvv~~~~T~fG~g~StG~a~I-Yds~e~~kk 86 (132)
T PTZ00071 35 TVSKKDIKEKLAKQYKVADARTIVLFGFKTKFGGGKTTGFGLI-YDNLAALKK 86 (132)
T ss_pred CCCHHHHHHHHHHHhCCCCCCEEEEEccEecCCCceEEEEEEE-ECCHHHHHh
Confidence 5677888877665 54 2222222222 4556653 666665543
No 281
>KOG3938|consensus
Probab=32.07 E-value=2.7e+02 Score=23.83 Aligned_cols=53 Identities=15% Similarity=0.313 Sum_probs=34.7
Q ss_pred CCeeEEEEeeC-----------CCcEEEEEEcCHHHH---------HHHHHhcCCcee----------------CCcEEE
Q psy2347 34 GKLNSVWVAFN-----------PPGFAFVEFSNQIDA---------EAACDSMNDQDL----------------MGSKLR 77 (207)
Q Consensus 34 G~V~~v~i~~d-----------~rG~afV~F~~~e~A---------~~Al~~l~g~~i----------------~g~~l~ 77 (207)
|..++|+|+++ -.+||||.--++.++ -..|+++|++.| .+..++
T Consensus 125 Gq~kEv~v~KsedalGlTITDNG~GyAFIKrIkegsvidri~~i~VGd~IEaiNge~ivG~RHYeVArmLKel~rge~ft 204 (334)
T KOG3938|consen 125 GQAKEVEVVKSEDALGLTITDNGAGYAFIKRIKEGSVIDRIEAICVGDHIEAINGESIVGKRHYEVARMLKELPRGETFT 204 (334)
T ss_pred CcceeEEEEecccccceEEeeCCcceeeeEeecCCchhhhhhheeHHhHHHhhcCccccchhHHHHHHHHHhcccCCeeE
Confidence 55677877776 468999976544333 345567787766 345677
Q ss_pred EEEeecCCC
Q psy2347 78 VEISRGRGR 86 (207)
Q Consensus 78 V~~a~~k~~ 86 (207)
+++..||..
T Consensus 205 lrLiePk~a 213 (334)
T KOG3938|consen 205 LRLIEPKSA 213 (334)
T ss_pred EEeeccccc
Confidence 788777643
No 282
>KOG2135|consensus
Probab=31.95 E-value=20 Score=32.78 Aligned_cols=53 Identities=21% Similarity=0.199 Sum_probs=37.1
Q ss_pred cCCcHHHHHHHHHhCCCeeEEEEeeCCCcEEEEEEcCHHHHHHHHHhcCCceeCC
Q psy2347 19 ETVKKEDLELEFEKFGKLNSVWVAFNPPGFAFVEFSNQIDAEAACDSMNDQDLMG 73 (207)
Q Consensus 19 ~~~te~~L~~~F~~~G~V~~v~i~~d~rG~afV~F~~~e~A~~Al~~l~g~~i~g 73 (207)
....++.+..+|++++.++..+++....++..|.++ +.|.++|..++..+|.-
T Consensus 207 ks~~s~~r~k~fee~g~~~r~el~p~~hg~~~vv~~--enan~~m~s~da~ei~~ 259 (526)
T KOG2135|consen 207 KSRNSENRRKFFEEFGVLERGELCPTHHGCVPVVSK--ENANKTMKSEDAAEIMK 259 (526)
T ss_pred cccccHHhhhhhHhhceeeeccccccccccceeEee--ccccccccCCcchhhhh
Confidence 456678888999999998877766554555566665 77777777665555543
No 283
>PF02426 MIase: Muconolactone delta-isomerase; InterPro: IPR003464 This small enzyme forms a homodecameric complex, that catalyses the third step in the catabolism of catechol to succinate- and acetyl-coa in the beta-ketoadipate pathway (5.3.3.4 from EC). The protein has a ferredoxin-like fold according to SCOP.; GO: 0006725 cellular aromatic compound metabolic process
Probab=31.91 E-value=1.8e+02 Score=20.39 Aligned_cols=56 Identities=9% Similarity=0.205 Sum_probs=40.3
Q ss_pred CCCcCCcHHHHHHH----------HHhCCCeeEEEEeeC-CCcEEEEEEcCHHHHHHHHHhcCCcee
Q psy2347 16 GLTETVKKEDLELE----------FEKFGKLNSVWVAFN-PPGFAFVEFSNQIDAEAACDSMNDQDL 71 (207)
Q Consensus 16 nLp~~~te~~L~~~----------F~~~G~V~~v~i~~d-~rG~afV~F~~~e~A~~Al~~l~g~~i 71 (207)
+||.+++.+++.++ +.+-|.+..+--+.. ...++.++-+|.++..+.|..|.-...
T Consensus 10 ~~P~~~~~~~~~~~~a~E~~~a~eLq~~G~~~~lWr~~G~~~n~~Ifdv~d~~eLh~lL~sLPL~p~ 76 (91)
T PF02426_consen 10 NVPPDMPPEEVDRLKAREKARAQELQRQGKWRHLWRVVGRYANVSIFDVEDNDELHELLSSLPLFPY 76 (91)
T ss_pred eCCCCCCHHHHHHHHHHHHHHHHHHHHCCeeeEEEEecCCcceEEEEECCCHHHHHHHHHhCCCccc
Confidence 67888887765544 345677887754444 567888888999999999888765543
No 284
>PRK13014 methionine sulfoxide reductase A; Provisional
Probab=31.79 E-value=1.5e+02 Score=23.73 Aligned_cols=71 Identities=14% Similarity=0.091 Sum_probs=34.5
Q ss_pred EEEEcCCCcCCcHHHHHHHHHhCCCeeEEEE-eeC-CCcEEEEEE-cCHHH---HHHHHHhcCCceeCCcEEEEEEe
Q psy2347 11 RVYVGGLTETVKKEDLELEFEKFGKLNSVWV-AFN-PPGFAFVEF-SNQID---AEAACDSMNDQDLMGSKLRVEIS 81 (207)
Q Consensus 11 ~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i-~~d-~rG~afV~F-~~~e~---A~~Al~~l~g~~i~g~~l~V~~a 81 (207)
.|-|-==|..++-++|.++|-+.-+-+.+.- -.| ..-|.-+.| .++++ |+++++.|+......++|.+++.
T Consensus 61 ~V~V~yDp~~iSy~~LL~~Ff~~hDPt~~~~Qg~D~G~QYRS~If~~~~eQ~~~a~~~~~~~~~~~~~~~~i~Tei~ 137 (186)
T PRK13014 61 AVQITYDPKQVSYENLLQIFFSTHDPTQLNRQGPDRGEQYRSAIFYHDEEQKKVAEAYIAQLDEAGIFKKPIVTPIK 137 (186)
T ss_pred EEEEEECCCcCCHHHHHHHHHHhcCCCccCCCCCCCCCCceEEEEeCCHHHHHHHHHHHHHHHhccccCCCcEEEEe
Confidence 3555444677888888887765433222211 112 122433333 34444 44445554433223456666654
No 285
>PF05336 DUF718: Domain of unknown function (DUF718); InterPro: IPR008000 Mutarotases are enzymes which interconvert the alpha and beta stereoisomers of monosaccharides, enhancing the rate of their metabolism. Proteins in this entry are homologues of the rhamnose mutarotase YiiL (P32156 from SWISSPROT) from Escherichia coli, and are often encoded in rhamnose utilisation operons. YiiL is an enzyme which interconverts the alpha and beta stereoisomers of the pyranose form of L-rhamnose []. It is not required for growth on rhamnose, but allows cells to utilise this carbon source more efficiently [].The structure of YiiL is distinct from other mutarotases, forming an asymmetric dimmer stabilised by an intermolecular beta-sheet, hydrophobic interactions and a salt bridge [].; GO: 0016857 racemase and epimerase activity, acting on carbohydrates and derivatives, 0019299 rhamnose metabolic process, 0005737 cytoplasm; PDB: 2QLX_A 2QLW_B 1X8D_B.
Probab=31.70 E-value=95 Score=22.21 Aligned_cols=40 Identities=13% Similarity=0.079 Sum_probs=26.1
Q ss_pred HHHHHHHHHhCCCeeEEEEeeC-C--CcEEEEEEcCHHHHHHHH
Q psy2347 23 KEDLELEFEKFGKLNSVWVAFN-P--PGFAFVEFSNQIDAEAAC 63 (207)
Q Consensus 23 e~~L~~~F~~~G~V~~v~i~~d-~--rG~afV~F~~~e~A~~Al 63 (207)
..+|.+.+...| |.+..|..+ . .-|++++.++.+.+.+++
T Consensus 25 WPEv~~~l~~~G-i~~ysIf~~g~~~~LF~~~E~~~~~~~~~~l 67 (106)
T PF05336_consen 25 WPEVLAALREAG-IRNYSIFRDGDTGRLFMYMETDDFDADMAAL 67 (106)
T ss_dssp -HHHHHHHHHCT-EEEEEEEEETTTTEEEEEEEECT-CHHHHHG
T ss_pred CHHHHHHHHHCC-CeEEEEEEeCCCCEEEEEEEecChhhHHHHc
Confidence 456777777776 677666655 3 678888888844444444
No 286
>cd04905 ACT_CM-PDT C-terminal ACT domain of the bifunctional chorismate mutase-prephenate dehydratase (CM-PDT) enzyme and the prephenate dehydratase (PDT) enzyme. The C-terminal ACT domain of the bifunctional chorismate mutase-prephenate dehydratase (CM-PDT) enzyme and the prephenate dehydratase (PDT) enzyme, found in plants, fungi, bacteria, and archaea. The P-protein of E. coli (CM-PDT, PheA) catalyzes the conversion of chorismate to prephenate and then the decarboxylation and dehydration to form phenylpyruvate. These are the first two steps in the biosynthesis of L-Phe and L-Tyr via the shikimate pathway in microorganisms and plants. The E. coli P-protein (CM-PDT) has three domains with an N-terminal domain with chorismate mutase activity, a middle domain with prephenate dehydratase activity, and an ACT regulatory C-terminal domain. The prephenate dehydratase enzyme has a PDT and ACT domain. The ACT domain is essential to bring about the negative allosteric regulation by L-Phe bindi
Probab=31.35 E-value=1.6e+02 Score=19.40 Aligned_cols=47 Identities=15% Similarity=0.162 Sum_probs=28.6
Q ss_pred cHHHHHHHHHhCC-CeeEEEEeeC---C-CcEEEEEEcC---HHHHHHHHHhcCC
Q psy2347 22 KKEDLELEFEKFG-KLNSVWVAFN---P-PGFAFVEFSN---QIDAEAACDSMND 68 (207)
Q Consensus 22 te~~L~~~F~~~G-~V~~v~i~~d---~-rG~afV~F~~---~e~A~~Al~~l~g 68 (207)
+-.+|.++|++++ .|.++..... . .-.-+|+++. .++.+++++.|+.
T Consensus 14 ~L~~il~~f~~~~ini~~i~s~p~~~~~~~~~f~vd~~~~~~~~~~~~~l~~l~~ 68 (80)
T cd04905 14 ALYDVLGVFAERGINLTKIESRPSKGGLWEYVFFIDFEGHIEDPNVAEALEELKR 68 (80)
T ss_pred HHHHHHHHHHHCCcCEEEEEEEEcCCCCceEEEEEEEECCCCCHHHHHHHHHHHH
Confidence 3467778888875 4666654433 1 1244566763 5677777777654
No 287
>KOG2671|consensus
Probab=30.92 E-value=66 Score=28.62 Aligned_cols=45 Identities=20% Similarity=0.188 Sum_probs=27.6
Q ss_pred cCCcHHHHHHHHHhCCC-eeEEEEeeCCCcEEEEEEcCHHHHHHHH
Q psy2347 19 ETVKKEDLELEFEKFGK-LNSVWVAFNPPGFAFVEFSNQIDAEAAC 63 (207)
Q Consensus 19 ~~~te~~L~~~F~~~G~-V~~v~i~~d~rG~afV~F~~~e~A~~Al 63 (207)
.+....||+.+.+-|+. |...+-......|-+|+|+++++|.+.+
T Consensus 12 l~Fr~pE~eala~lF~~~i~~sq~~~t~~PF~iv~~~s~~~Ar~~~ 57 (421)
T KOG2671|consen 12 LDFRLPELEALASLFGLQIDESQKLHTTSPFFIVEFESEEIARNLI 57 (421)
T ss_pred hccchHHHHHHHHHHhhhhhhhhhhcCCCceEEEEeCchHHHHHHH
Confidence 34445566666555553 3322211114679999999999998765
No 288
>PRK12378 hypothetical protein; Provisional
Probab=30.59 E-value=1.7e+02 Score=24.25 Aligned_cols=56 Identities=27% Similarity=0.277 Sum_probs=33.3
Q ss_pred CCCcEEEEcCCCcCC--cHHHHHHHHHhCCCe----eEEEEeeCCCcEEEEEEcCHHHHHHH
Q psy2347 7 ERGTRVYVGGLTETV--KKEDLELEFEKFGKL----NSVWVAFNPPGFAFVEFSNQIDAEAA 62 (207)
Q Consensus 7 ~~~~~lfV~nLp~~~--te~~L~~~F~~~G~V----~~v~i~~d~rG~afV~F~~~e~A~~A 62 (207)
+....|.|--|..+. |..+|+.+|.+++-- -.|..+.+.+++..|.-.+++++..+
T Consensus 89 PgGvaiiVe~lTDN~nRt~~~vr~~f~K~gg~l~~~gsv~~~Fe~kG~i~i~~~~~d~~~e~ 150 (235)
T PRK12378 89 PNGVMVIVECLTDNVNRTVANVRSAFNKNGGNLGTSGSVAFMFDHKGVFVFEGDDEDELLEA 150 (235)
T ss_pred CCCcEEEEEECCCCHHHHHHHHHHHHhhcCCeECCCCceeeeeecceEEEeCCCCHHHHHHH
Confidence 344566676666553 568999999998652 23566665555555543454544443
No 289
>smart00457 MACPF membrane-attack complex / perforin.
Probab=30.51 E-value=1.1e+02 Score=24.04 Aligned_cols=26 Identities=15% Similarity=0.282 Sum_probs=20.8
Q ss_pred cCCCcCCcHHHHHHHHHhCCC--eeEEE
Q psy2347 15 GGLTETVKKEDLELEFEKFGK--LNSVW 40 (207)
Q Consensus 15 ~nLp~~~te~~L~~~F~~~G~--V~~v~ 40 (207)
..||...+.+++.++|++||. |+++.
T Consensus 31 ~~Lp~~~~~~~~~~fi~~yGTH~i~s~~ 58 (194)
T smart00457 31 RDLPDQYNRGAYARFIDKYGTHYITSAT 58 (194)
T ss_pred HhCccccCHHHHHHHHHHhCCeEEEeee
Confidence 478888888999999999997 44443
No 290
>CHL00128 psbW photosystem II protein W; Reviewed
Probab=30.49 E-value=1.5e+02 Score=21.82 Aligned_cols=27 Identities=19% Similarity=0.202 Sum_probs=20.3
Q ss_pred eeEEEEeeC---CCcEEEEEEcCHHHHHHH
Q psy2347 36 LNSVWVAFN---PPGFAFVEFSNQIDAEAA 62 (207)
Q Consensus 36 V~~v~i~~d---~rG~afV~F~~~e~A~~A 62 (207)
|.+|+|.+. ..+.|...|+++.+.++.
T Consensus 15 ip~VrLtRsrdg~~g~a~f~F~~p~al~~~ 44 (113)
T CHL00128 15 IPDVRLTRSRDGSTGTATFRFKNPNILDKS 44 (113)
T ss_pred CCceEEEEccCCCceEEEEEECCchhhhhc
Confidence 456666655 778999999998876653
No 291
>PRK02302 hypothetical protein; Provisional
Probab=30.35 E-value=2e+02 Score=20.24 Aligned_cols=54 Identities=19% Similarity=0.233 Sum_probs=32.5
Q ss_pred CCCCcEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeCCCcEEEEEEcCHHHHHHHHHhcCCce
Q psy2347 6 MERGTRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFNPPGFAFVEFSNQIDAEAACDSMNDQD 70 (207)
Q Consensus 6 ~~~~~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d~rG~afV~F~~~e~A~~Al~~l~g~~ 70 (207)
....-.||+..+. +++ .+.+||.|..+.-. ..|+ |.|-+.++|++.+++|....
T Consensus 6 ~R~glIVyl~~~k------~~r-~LrkfG~I~Y~Skk---~kYv-vlYvn~~~~e~~~~kl~~l~ 59 (89)
T PRK02302 6 ERIGLIVYLYYNR------DAR-KLSKYGDIVYHSKR---SRYL-VLYVNKEDVEQKLEELSKLK 59 (89)
T ss_pred ceeEEEEEEeecH------hHH-HHhhcCcEEEEecc---ccEE-EEEECHHHHHHHHHHHhcCC
Confidence 3344456664432 232 35789998876321 2244 45778888888888876544
No 292
>PF01037 AsnC_trans_reg: AsnC family; InterPro: IPR019887 The many bacterial transcription regulation proteins which bind DNA through a 'helix-turn-helix' motif can be classified into subfamilies on the basis of sequence similarities. One such family is the AsnC/Lrp subfamily []. The Lrp family of transcriptional regulators appears to be widely distributed among bacteria and archaea, as an important regulatory system of the amino acid metabolism and related processes []. Members of the Lrp family are small DNA-binding proteins with molecular masses of around 15 kDa. Target promoters often contain a number of binding sites that typically lack obvious inverted repeat elements, and to which binding is usually co-operative. LrpA from Pyrococcus furiosus is the first Lrp-like protein to date of which a three-dimensional structure has been solved. In the crystal structure LrpA forms an octamer consisting of four dimers. The structure revealed that the N-terminal part of the protein consists of a helix-turn-helix (HTH) domain, a fold generally involved in DNA binding. The C terminus of Lrp-like proteins has a beta-fold, where the two alpha-helices are located at one side of the four-stranded antiparallel beta-sheet. LrpA forms a homodimer mainly through interactions between the beta-strands of this C-terminal domain, and an octamer through further interactions between the second alpha-helix and fourth beta-strand of the motif. Hence, the C-terminal domain of Lrp-like proteins appears to be involved in ligand-response and activation [].; GO: 0003700 sequence-specific DNA binding transcription factor activity, 0043565 sequence-specific DNA binding, 0006355 regulation of transcription, DNA-dependent, 0005622 intracellular; PDB: 2DJW_F 2GQQ_A 2ZNY_E 2ZNZ_G 1RI7_A 2CYY_A 2E1C_A 2CG4_B 2DBB_B 1I1G_A ....
Probab=30.28 E-value=1.5e+02 Score=18.77 Aligned_cols=44 Identities=5% Similarity=-0.022 Sum_probs=36.1
Q ss_pred cHHHHHHHHHhCCCeeEEEEeeC-CCcEEEEEEcCHHHHHHHHHh
Q psy2347 22 KKEDLELEFEKFGKLNSVWVAFN-PPGFAFVEFSNQIDAEAACDS 65 (207)
Q Consensus 22 te~~L~~~F~~~G~V~~v~i~~d-~rG~afV~F~~~e~A~~Al~~ 65 (207)
..+++.+.+.+.-.|.+|..+.- ..-.+.|.+.|.++.++.+..
T Consensus 11 ~~~~~~~~l~~~p~V~~~~~vtG~~d~~~~v~~~d~~~l~~~i~~ 55 (74)
T PF01037_consen 11 AYDEFAEALAEIPEVVECYSVTGEYDLILKVRARDMEELEEFIRE 55 (74)
T ss_dssp HHHHHHHHHHTSTTEEEEEEESSSSSEEEEEEESSHHHHHHHHHH
T ss_pred hHHHHHHHHHcCCCEEEEEEEeCCCCEEEEEEECCHHHHHHHHHH
Confidence 46788889999999999988776 445678999999999998654
No 293
>PRK14054 methionine sulfoxide reductase A; Provisional
Probab=30.09 E-value=2e+02 Score=22.72 Aligned_cols=71 Identities=10% Similarity=0.106 Sum_probs=34.4
Q ss_pred EEEEcCCCcCCcHHHHHHHHHhCCCeeEEEE-eeC-CCcEE-EEEEcCHHHH---HHHHHhcCCceeCCcEEEEEEe
Q psy2347 11 RVYVGGLTETVKKEDLELEFEKFGKLNSVWV-AFN-PPGFA-FVEFSNQIDA---EAACDSMNDQDLMGSKLRVEIS 81 (207)
Q Consensus 11 ~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i-~~d-~rG~a-fV~F~~~e~A---~~Al~~l~g~~i~g~~l~V~~a 81 (207)
.|.|-==|..++-++|.++|-+.-+.+.+.- -.| ..-|. -|-+.++++. ++++++++...+..++|.+++.
T Consensus 56 ~V~V~yDp~~isy~~Ll~~f~~~hDPt~~~~Qg~D~G~qYRS~If~~~~~q~~~a~~~~~~~~~~~~~~~~i~Tei~ 132 (172)
T PRK14054 56 AVEITYDPAVISYRELLELFFQIHDPTTLNRQGNDRGTQYRSAIFYHDEEQKEIAEASIAELQASGLFDKPIVTEVE 132 (172)
T ss_pred EEEEEECCCcCCHHHHHHHHHHhCCCCccCCCCCCCCcCceeEEEeCCHHHHHHHHHHHHHHHHhcccCCCcEEEEe
Confidence 4555545777888888887765433322211 112 12233 3334455544 4444444433223455655553
No 294
>COG5236 Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only]
Probab=30.02 E-value=1.2e+02 Score=26.95 Aligned_cols=51 Identities=14% Similarity=0.024 Sum_probs=35.2
Q ss_pred cHHHHHHHHHhCCCeeEEEEeeCCCcEEEEEEcCHHHHHHHHHhcCCceeCCcE
Q psy2347 22 KKEDLELEFEKFGKLNSVWVAFNPPGFAFVEFSNQIDAEAACDSMNDQDLMGSK 75 (207)
Q Consensus 22 te~~L~~~F~~~G~V~~v~i~~d~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~ 75 (207)
+-++|.++|+.---+..+....+. -||.|.++.+.++-|...++..+..+.
T Consensus 263 ~Y~~Le~HF~~~hy~ct~qtc~~~---k~~vf~~~~el~~h~~~~h~~~~~~~~ 313 (493)
T COG5236 263 SYEDLEAHFRNAHYCCTFQTCRVG---KCYVFPYHTELLEHLTRFHKVNARLSE 313 (493)
T ss_pred CHHHHHHHhhcCceEEEEEEEecC---cEEEeccHHHHHHHHHHHhhcccccCc
Confidence 446788888776566666655553 378899988888877776766655443
No 295
>PLN02373 soluble inorganic pyrophosphatase
Probab=29.54 E-value=65 Score=25.87 Aligned_cols=37 Identities=19% Similarity=0.129 Sum_probs=26.9
Q ss_pred HHHHHHHHHhCCCeeEEEEeeCCCcEEEEEEcCHHHHHHHHHhc
Q psy2347 23 KEDLELEFEKFGKLNSVWVAFNPPGFAFVEFSNQIDAEAACDSM 66 (207)
Q Consensus 23 e~~L~~~F~~~G~V~~v~i~~d~rG~afV~F~~~e~A~~Al~~l 66 (207)
-++|+++|+.|-.++ .|-+.+..|.+.+.|.++|++.
T Consensus 138 l~~I~~fF~~YK~le-------gK~v~v~g~~~~~~A~~~I~~~ 174 (188)
T PLN02373 138 LAEIRRFFEDYKKNE-------NKEVAVNDFLPAEAAIEAIQYS 174 (188)
T ss_pred HHHHHHHHHHhcccC-------CCeEEeCCccCHHHHHHHHHHH
Confidence 467889999885433 2335567889999998888753
No 296
>PF08004 DUF1699: Protein of unknown function (DUF1699); InterPro: IPR012546 This family contains many archaeal proteins which have very conserved sequences.
Probab=29.37 E-value=92 Score=23.38 Aligned_cols=36 Identities=11% Similarity=0.043 Sum_probs=26.9
Q ss_pred CCCCcEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeC
Q psy2347 6 MERGTRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFN 44 (207)
Q Consensus 6 ~~~~~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d 44 (207)
.+....|++ .+..+..||..+.+.|-.|+-|+|+..
T Consensus 16 n~nE~~VHl---AFRPSN~Dif~Lv~~CP~lk~iqiP~S 51 (131)
T PF08004_consen 16 NPNEEIVHL---AFRPSNKDIFSLVERCPNLKAIQIPPS 51 (131)
T ss_pred CCCceEEEE---EecCcchHHHHHHHhCCCCeEEeCChH
Confidence 344555655 455567789999999999999998876
No 297
>KOG4388|consensus
Probab=29.33 E-value=87 Score=29.93 Aligned_cols=58 Identities=10% Similarity=0.204 Sum_probs=38.1
Q ss_pred CCcEEEEcCCCcCCcHH-HHHHHHHhCCCeeEEEEeeCCCcEEEEEE-----cCHHHHHHHHHhc
Q psy2347 8 RGTRVYVGGLTETVKKE-DLELEFEKFGKLNSVWVAFNPPGFAFVEF-----SNQIDAEAACDSM 66 (207)
Q Consensus 8 ~~~~lfV~nLp~~~te~-~L~~~F~~~G~V~~v~i~~d~rG~afV~F-----~~~e~A~~Al~~l 66 (207)
+.++|...++.+-+++. .+..-+...|.++.++|+.+ ...+|+.| +..+.+++||+.|
T Consensus 788 Pp~~i~ac~mDP~LDD~vmfA~kLr~lG~~v~l~vle~-lPHGFLnft~ls~E~~~~~~~CI~rl 851 (880)
T KOG4388|consen 788 PPVHIVACAMDPMLDDSVMFARKLRNLGQPVTLRVLED-LPHGFLNFTALSRETRQAAELCIERL 851 (880)
T ss_pred CCceEEEeccCcchhHHHHHHHHHHhcCCceeehhhhc-CCccceeHHhhCHHHHHHHHHHHHHH
Confidence 34555556666555543 34455677899999999887 44667766 4566777777765
No 298
>COG1839 Uncharacterized conserved protein [Function unknown]
Probab=29.09 E-value=2.7e+02 Score=21.47 Aligned_cols=59 Identities=19% Similarity=0.172 Sum_probs=33.2
Q ss_pred CCCCcEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEee-CCCcEEEEEEc--CHHHHHHHHH
Q psy2347 6 MERGTRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAF-NPPGFAFVEFS--NQIDAEAACD 64 (207)
Q Consensus 6 ~~~~~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~-d~rG~afV~F~--~~e~A~~Al~ 64 (207)
.++...|.|+.-.+-.|-+||.+.+...-+-.++-|.. ...+-.+|.++ |+|..+.|++
T Consensus 12 ~Pe~~nvIiGqshFIkTVeDL~ealvt~vP~~kfgiAf~EAsg~rLvR~~GND~eL~~lA~e 73 (162)
T COG1839 12 IPEGVNVIIGQSHFIKTVEDLYEALVTAVPGLKFGIAFNEASGPRLVRYTGNDEELVKLAIE 73 (162)
T ss_pred CcCCceEEEeechhheeHHHHHHHHHhcCCCceEEEEeecccCCeeEEecCCcHHHHHHHHH
Confidence 45567888887777778888888776543322222222 23344556554 4444444443
No 299
>PF07521 RMMBL: RNA-metabolising metallo-beta-lactamase; InterPro: IPR011108 The metallo-beta-lactamase fold contains five sequence motifs. The first four motifs are found in IPR001279 from INTERPRO and are common to all metallo-beta-lactamases. The fifth motif appears to be specific to function. This entry represents the fifth motif from metallo-beta-lactamases involved in RNA metabolism [].; PDB: 3ZQ4_D 2I7T_A 2I7V_A 2YCB_B 3BK1_A 3T3N_A 3BK2_A 3T3O_A 3AF5_A 3AF6_A ....
Probab=29.08 E-value=88 Score=18.42 Aligned_cols=33 Identities=18% Similarity=0.178 Sum_probs=22.3
Q ss_pred CcEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEe
Q psy2347 9 GTRVYVGGLTETVKKEDLELEFEKFGKLNSVWVA 42 (207)
Q Consensus 9 ~~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~ 42 (207)
.++|..-.++-..+.++|+++++..-+ +.|.++
T Consensus 6 ~a~v~~~~fSgHad~~~L~~~i~~~~p-~~vilV 38 (43)
T PF07521_consen 6 RARVEQIDFSGHADREELLEFIEQLNP-RKVILV 38 (43)
T ss_dssp -SEEEESGCSSS-BHHHHHHHHHHHCS-SEEEEE
T ss_pred EEEEEEEeecCCCCHHHHHHHHHhcCC-CEEEEe
Confidence 445555568888999999999988854 555443
No 300
>PRK11558 putative ssRNA endonuclease; Provisional
Probab=29.08 E-value=1.1e+02 Score=21.83 Aligned_cols=51 Identities=20% Similarity=0.408 Sum_probs=30.5
Q ss_pred CCCCCcEEEEcCCCcCCcHHHHHHHHHhCC--CeeEEEEeeCCCcEEEEEEcC
Q psy2347 5 MMERGTRVYVGGLTETVKKEDLELEFEKFG--KLNSVWVAFNPPGFAFVEFSN 55 (207)
Q Consensus 5 ~~~~~~~lfV~nLp~~~te~~L~~~F~~~G--~V~~v~i~~d~rG~afV~F~~ 55 (207)
+.+...-|||++++..+-|...+.+-+.+. .++-+.-..+..+|.|.++-+
T Consensus 23 llEv~~GVyVg~~S~rVRd~lW~~v~~~~~~G~avmv~~~~~eqG~~~~t~G~ 75 (97)
T PRK11558 23 LLEVRAGVYVGDVSRRIREMIWQQVTQLAEEGNVVMAWATNTESGFEFQTFGE 75 (97)
T ss_pred eEecCCCcEEcCCCHHHHHHHHHHHHHhCCCCcEEEEEcCCCCCCcEEEecCC
Confidence 345667899999887776655444444343 333332112256888887765
No 301
>PF14350 Beta_protein: Beta protein
Probab=29.07 E-value=86 Score=27.20 Aligned_cols=37 Identities=11% Similarity=0.099 Sum_probs=24.1
Q ss_pred CcHHHHHHHHHhCC-CeeEEEEeeCCCcEEEEEEcCHHHHH
Q psy2347 21 VKKEDLELEFEKFG-KLNSVWVAFNPPGFAFVEFSNQIDAE 60 (207)
Q Consensus 21 ~te~~L~~~F~~~G-~V~~v~i~~d~rG~afV~F~~~e~A~ 60 (207)
..+++|.+++.... ...+|.|+.| +..|.-.+..++.
T Consensus 137 ~~~~~i~~i~~~l~~~~~~~~lilD---~~~i~~~~~~~~~ 174 (347)
T PF14350_consen 137 DFPSEISRILAALGLSPNEVDLILD---LGDIRDSDESAVA 174 (347)
T ss_pred hHHHHHHHHHHHcCCCccceEEEEE---CCccCCcchHHHH
Confidence 44577888888877 5677888888 5555554444333
No 302
>KOG3062|consensus
Probab=29.02 E-value=1.4e+02 Score=25.20 Aligned_cols=33 Identities=21% Similarity=0.282 Sum_probs=24.3
Q ss_pred EEEcCCCcC---CcHHHHHHHHHhCCCeeEEEEeeC
Q psy2347 12 VYVGGLTET---VKKEDLELEFEKFGKLNSVWVAFN 44 (207)
Q Consensus 12 lfV~nLp~~---~te~~L~~~F~~~G~V~~v~i~~d 44 (207)
|.|.++|.+ +...+|+++|..-+.-..|.|+.|
T Consensus 4 Vvi~G~P~SGKstrA~~L~~~l~~~~~K~~v~ii~d 39 (281)
T KOG3062|consen 4 VVICGLPCSGKSTRAVELREALKERGTKQSVRIIDD 39 (281)
T ss_pred EEEeCCCCCCchhHHHHHHHHHHhhcccceEEEech
Confidence 567788854 566899999998876446777766
No 303
>smart00195 DSPc Dual specificity phosphatase, catalytic domain.
Probab=28.86 E-value=1.4e+02 Score=21.66 Aligned_cols=18 Identities=17% Similarity=0.388 Sum_probs=13.2
Q ss_pred EEEEcCCCcCCcHHHHHH
Q psy2347 11 RVYVGGLTETVKKEDLEL 28 (207)
Q Consensus 11 ~lfV~nLp~~~te~~L~~ 28 (207)
.|||+++|...+.+.|++
T Consensus 7 ~l~~G~~~~~~~~~~l~~ 24 (138)
T smart00195 7 HLYLGSYSSALNLALLKK 24 (138)
T ss_pred CeEECChhHcCCHHHHHH
Confidence 599999987776555544
No 304
>PF07045 DUF1330: Protein of unknown function (DUF1330); InterPro: IPR010753 This family consists of several hypothetical bacterial proteins of around 90 residues in length. The function of this family is unknown.; PDB: 2FIU_B 3HHL_A 3DCA_D 3LO3_I.
Probab=28.83 E-value=1.6e+02 Score=18.77 Aligned_cols=40 Identities=23% Similarity=0.212 Sum_probs=25.1
Q ss_pred HHHHHHHhCCCe-e----EEEEeeC---CCcEEEEEEcCHHHHHHHHH
Q psy2347 25 DLELEFEKFGKL-N----SVWVAFN---PPGFAFVEFSNQIDAEAACD 64 (207)
Q Consensus 25 ~L~~~F~~~G~V-~----~v~i~~d---~rG~afV~F~~~e~A~~Al~ 64 (207)
.+..++++|+-- . ..+++.. ..-..+|+|.|.++|++..+
T Consensus 10 ~~~~~l~~~GG~~l~~~~~~~~leG~~~~~~~viieFPs~~aa~~~~~ 57 (65)
T PF07045_consen 10 AVPPILEKYGGRVLARGGEPEVLEGDWDPDRVVIIEFPSMEAAKAWYN 57 (65)
T ss_dssp HHHHHHHHTT-EEEEECEEEEEEEST-SSSEEEEEEESSHHHHHHHHC
T ss_pred HHHHHHHHcCCEEEEECCceeEEecCCCCCeEEEEECCCHHHHHHHHC
Confidence 455677777742 1 1222222 45688999999999988764
No 305
>KOG0256|consensus
Probab=28.80 E-value=1e+02 Score=28.00 Aligned_cols=76 Identities=16% Similarity=0.188 Sum_probs=41.5
Q ss_pred CCcEEEEcC----CCcCCcHHHHHHHHHhCCCeeEEEEeeCCCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEEEeec
Q psy2347 8 RGTRVYVGG----LTETVKKEDLELEFEKFGKLNSVWVAFNPPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVEISRG 83 (207)
Q Consensus 8 ~~~~lfV~n----Lp~~~te~~L~~~F~~~G~V~~v~i~~d~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~~a~~ 83 (207)
+.+-|.|.| |-..++.++|.+++. |-.-++|+|+.| .-|+--.|+.. .-...++..+...+.-..|+|-++.+
T Consensus 226 kVkGvlitNPsNPLG~~~~~e~L~~ll~-Fa~~kniHvI~D-EIya~sVF~~~-~F~Sv~ev~~~~~~~~~rvHivyslS 302 (471)
T KOG0256|consen 226 KVKGVLITNPSNPLGTTLSPEELISLLN-FASRKNIHVISD-EIYAGSVFDKS-EFRSVLEVRKDPHLDPDRVHIVYSLS 302 (471)
T ss_pred ceeEEEEeCCCCCCCCccCHHHHHHHHH-HHhhcceEEEee-hhhcccccCcc-CceEHHHHhhccccCCCcEEEEEEec
Confidence 334455655 334567888887764 444577888887 22333333332 22222222233334566788888877
Q ss_pred CCC
Q psy2347 84 RGR 86 (207)
Q Consensus 84 k~~ 86 (207)
|.-
T Consensus 303 KD~ 305 (471)
T KOG0256|consen 303 KDF 305 (471)
T ss_pred ccc
Confidence 653
No 306
>KOG2854|consensus
Probab=28.34 E-value=47 Score=29.08 Aligned_cols=60 Identities=22% Similarity=0.235 Sum_probs=35.7
Q ss_pred CCCCcEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeC---------------CCcEEEEEEcCHHHHHHHHHh
Q psy2347 6 MERGTRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFN---------------PPGFAFVEFSNQIDAEAACDS 65 (207)
Q Consensus 6 ~~~~~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d---------------~rG~afV~F~~~e~A~~Al~~ 65 (207)
.++...+||..+-..+..+.|+.+-+..-+..++.++.- .-.|+.|.|.++++|++.++.
T Consensus 157 veka~v~yv~Gffltv~p~ai~~v~qh~~e~~r~~~lnlsapfI~q~~~~~l~~v~~y~DiifgNe~EA~af~~~ 231 (343)
T KOG2854|consen 157 VEKAKVFYVAGFFLTVSPDAIRKVAQHAAENNRVFTLNLSAPFISQFFKDALDKVLPYADIIFGNEDEAAAFARA 231 (343)
T ss_pred hhheeEEEEEEEEEEeChHHHHHHHHHHHHhcchhheeccchhHHHHHHHHHHhhcCcceEEEcCHHHHHHHHHh
Confidence 345566777777666665555544433322222211111 346899999999999887765
No 307
>smart00650 rADc Ribosomal RNA adenine dimethylases.
Probab=28.15 E-value=89 Score=23.84 Aligned_cols=24 Identities=21% Similarity=0.072 Sum_probs=19.4
Q ss_pred cEEEEcCCCcCCcHHHHHHHHHhC
Q psy2347 10 TRVYVGGLTETVKKEDLELEFEKF 33 (207)
Q Consensus 10 ~~lfV~nLp~~~te~~L~~~F~~~ 33 (207)
..+.|.|+|..++.+.|..+++..
T Consensus 78 ~d~vi~n~Py~~~~~~i~~~l~~~ 101 (169)
T smart00650 78 PYKVVGNLPYNISTPILFKLLEEP 101 (169)
T ss_pred CCEEEECCCcccHHHHHHHHHhcC
Confidence 356789999999998898887653
No 308
>PRK08655 prephenate dehydrogenase; Provisional
Probab=28.13 E-value=1.1e+02 Score=27.73 Aligned_cols=26 Identities=4% Similarity=0.046 Sum_probs=14.9
Q ss_pred CCcCCcHHHHHHHHHhCCCeeEEEEe
Q psy2347 17 LTETVKKEDLELEFEKFGKLNSVWVA 42 (207)
Q Consensus 17 Lp~~~te~~L~~~F~~~G~V~~v~i~ 42 (207)
+|.+++.++|.+++.+.-.+++|.++
T Consensus 369 v~~~~~~~~i~~~i~~~~~l~~v~~f 394 (437)
T PRK08655 369 FPKGADPEVILDLLDNLKHVFDIEII 394 (437)
T ss_pred ECCCCCHHHHHHHHhcCCCcceEEEE
Confidence 35666666666666665334555443
No 309
>TIGR02542 B_forsyth_147 Bacteroides forsythus 147-residue repeat. The longest predicted protein in Bacteroides forsythus ATCC 43037 is over 3000 residues long and lacks homology to other known proteins. Immediately after the signal sequence are four tandem repeats, approximately 147 residues long. This model describes that repeat. This model describes that repeat.
Probab=28.11 E-value=31 Score=25.56 Aligned_cols=40 Identities=15% Similarity=0.180 Sum_probs=26.7
Q ss_pred CcCCcHHHHHHHHHh---CCCeeEEEEeeC-----CCcEEEEEEcCHH
Q psy2347 18 TETVKKEDLELEFEK---FGKLNSVWVAFN-----PPGFAFVEFSNQI 57 (207)
Q Consensus 18 p~~~te~~L~~~F~~---~G~V~~v~i~~d-----~rG~afV~F~~~e 57 (207)
|..+|..+|+++|++ |--|+.-.|.+| +-..||+.|....
T Consensus 83 PYTlT~~e~r~iF~Epm~YQGITReQV~rdGLP~GsYRiCFrL~~~~~ 130 (145)
T TIGR02542 83 PYTLTYNELRQIFREPMVYQGITREQVQRDGLPEGSYRICFRLFNATQ 130 (145)
T ss_pred ceeeeHHHHHHHHhhhhhhccccHHHHhhcCCCCCceEEEEEEeccch
Confidence 567899999999985 434444444444 3357888887553
No 310
>PF09122 DUF1930: Domain of unknown function (DUF1930); InterPro: IPR015206 This entry represents a domain found in 3-mercaptopyruvate sulphurtransferase which has no known function. This domain adopts a structure consisting of a four-stranded antiparallel beta-sheet and an alpha-helix, arranged in a beta(2)-alpha-beta(2) fashion, and bearing a remarkable structural similarity to the FK506-binding protein class of peptidylprolyl cis/trans-isomerase []. ; PDB: 1OKG_A.
Probab=28.07 E-value=1.8e+02 Score=19.10 Aligned_cols=26 Identities=12% Similarity=0.162 Sum_probs=12.8
Q ss_pred CCcEEEEcCCCcCCcHHHHHHHHHhC
Q psy2347 8 RGTRVYVGGLTETVKKEDLELEFEKF 33 (207)
Q Consensus 8 ~~~~lfV~nLp~~~te~~L~~~F~~~ 33 (207)
.+.+|.|-..+-.-.++||+..|...
T Consensus 19 ~~~tl~vDg~~v~~PD~El~sA~~Hl 44 (68)
T PF09122_consen 19 DNATLIVDGEIVENPDAELKSALVHL 44 (68)
T ss_dssp TT--EEETTEEESS--HHHHHHHTT-
T ss_pred cceEEEEcCeEcCCCCHHHHHHHHHh
Confidence 34466666655556666676666543
No 311
>KOG3702|consensus
Probab=27.84 E-value=31 Score=32.81 Aligned_cols=66 Identities=18% Similarity=0.180 Sum_probs=46.1
Q ss_pred EEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeC-----CCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEE
Q psy2347 11 RVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFN-----PPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLR 77 (207)
Q Consensus 11 ~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d-----~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~ 77 (207)
+||+.+-....+..-|..++..+.+++..+++.. ...-++++|..+.+++.|.. |....+..+.|+
T Consensus 513 ~i~~~~~~~~s~~~s~s~~s~~~~~ltk~k~l~~Cky~~~Ct~a~Ce~~HPtaa~~~~s-~p~k~fa~~~~k 583 (681)
T KOG3702|consen 513 TIFVANGHGGSNPDSLSRHSEKKNELTKAKILTRCKYGPACTSAECEFAHPTAAENAKS-LPNKKFASKCLK 583 (681)
T ss_pred ceecccccccCCCcchhhCcccccccccceeeccccCCCcCCchhhhhcCCcchhhhhc-ccccccccccee
Confidence 7888777777777778888888888887777665 33468999998887766543 445544444333
No 312
>PF12993 DUF3877: Domain of unknown function, E. rectale Gene description (DUF3877); InterPro: IPR024539 This entry represents proteins of unknown function found primarily in Firmicutes. The Eubacterium rectale gene appears to be upregulated in the presence of Bacteroides thetaiotaomicron compared to growth in pure culture [].
Probab=27.77 E-value=1.4e+02 Score=23.63 Aligned_cols=25 Identities=16% Similarity=-0.000 Sum_probs=19.0
Q ss_pred cCCcHHHHHHHHHhCCCeeEEEEee
Q psy2347 19 ETVKKEDLELEFEKFGKLNSVWVAF 43 (207)
Q Consensus 19 ~~~te~~L~~~F~~~G~V~~v~i~~ 43 (207)
..+|-++|+++|.+|.+-..++-+.
T Consensus 107 hgcT~e~I~~~F~~ys~~~~~e~~~ 131 (175)
T PF12993_consen 107 HGCTLEDILELFHKYSDNVHCEEMD 131 (175)
T ss_pred CCcCHHHHHHHHHHhcCCeEEEeec
Confidence 5789999999999998754444333
No 313
>PTZ00338 dimethyladenosine transferase-like protein; Provisional
Probab=27.76 E-value=66 Score=27.52 Aligned_cols=27 Identities=7% Similarity=-0.048 Sum_probs=20.9
Q ss_pred EEEEcCCCcCCcHHHHHHHHHhCCCee
Q psy2347 11 RVYVGGLTETVKKEDLELEFEKFGKLN 37 (207)
Q Consensus 11 ~lfV~nLp~~~te~~L~~~F~~~G~V~ 37 (207)
.+.|.|||+.++...|.++++..-.+.
T Consensus 103 d~VvaNlPY~Istpil~~ll~~~~~~~ 129 (294)
T PTZ00338 103 DVCVANVPYQISSPLVFKLLAHRPLFR 129 (294)
T ss_pred CEEEecCCcccCcHHHHHHHhcCCCCc
Confidence 467899999999999998886533333
No 314
>PF03108 DBD_Tnp_Mut: MuDR family transposase; InterPro: IPR004332 The plant MuDR transposase domain is present in plant proteins that are presumed to be the transposases for Mutator transposable elements [, ]. The function of these proteins is unknown. More information about these proteins can be found at Protein of the Month: Transposase [].
Probab=27.60 E-value=68 Score=20.56 Aligned_cols=16 Identities=13% Similarity=0.194 Sum_probs=12.7
Q ss_pred EEEcCHHHHHHHHHhc
Q psy2347 51 VEFSNQIDAEAACDSM 66 (207)
Q Consensus 51 V~F~~~e~A~~Al~~l 66 (207)
.+|.+.++++.||..+
T Consensus 8 ~~F~~~~e~k~av~~y 23 (67)
T PF03108_consen 8 QTFPSKEEFKEAVREY 23 (67)
T ss_pred CEECCHHHHHHHHHHH
Confidence 3788999999988753
No 315
>KOG2040|consensus
Probab=27.56 E-value=23 Score=34.01 Aligned_cols=56 Identities=20% Similarity=0.243 Sum_probs=38.8
Q ss_pred CCCcCCcHHHHHHHHHhCCCeeEEEEeeC--------------------CCcEEEEEEcCHHHHHHHHHhcCCcee
Q psy2347 16 GLTETVKKEDLELEFEKFGKLNSVWVAFN--------------------PPGFAFVEFSNQIDAEAACDSMNDQDL 71 (207)
Q Consensus 16 nLp~~~te~~L~~~F~~~G~V~~v~i~~d--------------------~rG~afV~F~~~e~A~~Al~~l~g~~i 71 (207)
.|.+.++++||.++|.-|.+-..++++.. .+.-.|-.+.++.+..+-+++|.++.|
T Consensus 460 slDETv~~~DvddLl~vf~~~ss~~~~~E~~~~e~~~~~~s~f~Rts~~L~hpVFn~yhSEt~lvRYm~kLenKDl 535 (1001)
T KOG2040|consen 460 SLDETVTEKDVDDLLWVFNEESSVELVAESMGEECNGIPPSVFKRTSPYLTHPVFNSYHSETELVRYMKKLENKDL 535 (1001)
T ss_pred eecccccHHHHHHHHHHHccCChHHHHHHhhhhhccCCCCcccccCCccccchhhhhcccHHHHHHHHHHhhccch
Confidence 37888999999999988864222221111 223457778888999999988888765
No 316
>PF13549 ATP-grasp_5: ATP-grasp domain; PDB: 1WR2_A.
Probab=27.52 E-value=1.5e+02 Score=24.26 Aligned_cols=48 Identities=15% Similarity=0.180 Sum_probs=0.0
Q ss_pred CcHHHHHHHHHhCCCeeEEEEeeCCCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEEEeecC
Q psy2347 21 VKKEDLELEFEKFGKLNSVWVAFNPPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVEISRGR 84 (207)
Q Consensus 21 ~te~~L~~~F~~~G~V~~v~i~~d~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~~a~~k 84 (207)
++|.+.+++|+.||- +--..+...+.++|.++.+.+ +.++.++...|.
T Consensus 10 L~e~e~~~lL~~yGI----------~~~~~~~~~~~~ea~~~a~~i------g~PvvlKi~sp~ 57 (222)
T PF13549_consen 10 LTEAEAKELLAAYGI----------PVPPTRLVTSAEEAVAAAEEI------GFPVVLKIVSPD 57 (222)
T ss_dssp E-HHHHHHHHHTTT----------------EEESSHHHHHHHHHHH-------SSEEEEEE-TT
T ss_pred cCHHHHHHHHHHcCc----------CCCCeeEeCCHHHHHHHHHHh------CCCEEEEEecCC
No 317
>PF02222 ATP-grasp: ATP-grasp domain; InterPro: IPR003135 The ATP-grasp domain has an unusual nucleotide-binding fold, also referred to as palmate, and is found in a superfamily of enzymes including D-alanine-D-alanine ligase, glutathione synthetase, biotin carboxylase, and carbamoyl phosphate synthetase, the ribosomal protein S6 modification enzyme (RimK), urea amidolyase, tubulin-tyrosine ligase, and three enzymes of purine biosynthesis. This family does not contain all known ATP-grasp domain members. All the enzymes of this family possess ATP-dependent carboxylate-amine ligase activity, and their catalytic mechanisms are likely to include acylphosphate intermediates.; PDB: 3K5H_C 3K5I_C 3AX6_A 3Q2O_B 3QFF_B 3R5H_A 3ORQ_B 3ORR_B 4E4T_B 2Z04_A ....
Probab=27.29 E-value=1.8e+02 Score=22.69 Aligned_cols=63 Identities=19% Similarity=0.139 Sum_probs=37.7
Q ss_pred CcHHHHHHHHHhCCCeeEEEEeeC-CCcEEEEEEcCHHHHHHHHHhcCCcee-------CCcEEEEEEeec
Q psy2347 21 VKKEDLELEFEKFGKLNSVWVAFN-PPGFAFVEFSNQIDAEAACDSMNDQDL-------MGSKLRVEISRG 83 (207)
Q Consensus 21 ~te~~L~~~F~~~G~V~~v~i~~d-~rG~afV~F~~~e~A~~Al~~l~g~~i-------~g~~l~V~~a~~ 83 (207)
.+.++|++...+++--.-++.... ..+..-+...+++++++|+..+....+ ..+.|.|-.++.
T Consensus 15 ~~~~~l~~a~~~iG~P~vlK~~~~GYDGkGq~~i~~~~dl~~a~~~~~~~~~ilE~~v~f~~EiSvivaR~ 85 (172)
T PF02222_consen 15 DSLEDLEEAAESIGFPAVLKTRRGGYDGKGQFVIRSEEDLEKAWQELGGGPCILEEFVPFDREISVIVARD 85 (172)
T ss_dssp SSHHHHHHHHHHHTSSEEEEESSSSCTTTTEEEESSGGGHHHHHHHTTTSCEEEEE---ESEEEEEEEEEE
T ss_pred CCHHHHHHHHHHcCCCEEEEccCcCcCCCccEEECCHHHHHHHHHhcCCCcEEEEeccCCcEEEEEEEEEc
Confidence 467899999998775444443222 223334455788899999988743321 245555555553
No 318
>PF14111 DUF4283: Domain of unknown function (DUF4283)
Probab=27.21 E-value=89 Score=23.15 Aligned_cols=33 Identities=24% Similarity=0.013 Sum_probs=26.1
Q ss_pred EEEcCCCcC-CcHHHHHHHHHhCCCeeEEEEeeC
Q psy2347 12 VYVGGLTET-VKKEDLELEFEKFGKLNSVWVAFN 44 (207)
Q Consensus 12 lfV~nLp~~-~te~~L~~~F~~~G~V~~v~i~~d 44 (207)
|-|.+||.. .+++.|+++.+.+|++.+++..+.
T Consensus 107 Vri~glP~~~~~~~~~~~i~~~iG~~i~vD~~t~ 140 (153)
T PF14111_consen 107 VRIYGLPLHLWSEEILKAIGSKIGEPIEVDENTL 140 (153)
T ss_pred hhhccCCHHHhhhHHHHHHHHhcCCeEEEEcCCC
Confidence 445688887 677889999999999988876544
No 319
>PF04940 BLUF: Sensors of blue-light using FAD; InterPro: IPR007024 An FAD-binding domain, BLUF, exemplified by the N terminus of the AppA protein, (Q53119 from SWISSPROT), from Rhodobacter sphaeroides, is present in various proteins, primarily from Bacteria. The BLUF domain is involved in sensing blue-light (and possibly redox) using FAD and is similar to the flavin-binding PAS domains and cryptochromes. The predicted secondary structure reveals that the BLUF domain is a novel FAD-binding fold [].; PDB: 2IYG_A 2IYI_B 1X0P_A 2HFN_G 3MZI_A 2HFO_E 3GFZ_A 3GG1_B 2KB2_A 3GFY_A ....
Probab=27.11 E-value=2.2e+02 Score=19.77 Aligned_cols=54 Identities=17% Similarity=0.198 Sum_probs=32.6
Q ss_pred cEEEEcCCCcCCcHHHHHHHHHhCC------CeeEEEEeeCCCcEEEEEEc-CHHHHHHHHHh
Q psy2347 10 TRVYVGGLTETVKKEDLELEFEKFG------KLNSVWVAFNPPGFAFVEFS-NQIDAEAACDS 65 (207)
Q Consensus 10 ~~lfV~nLp~~~te~~L~~~F~~~G------~V~~v~i~~d~rG~afV~F~-~~e~A~~Al~~ 65 (207)
+-+||+.++..+++++|.+++.+.- .|+-+-+..+ ++.|=..| +++++++.++.
T Consensus 3 ~l~Y~S~~~~~~~~~~~~~Il~~s~~~N~~~~iTG~Ll~~~--~~F~Q~LEG~~~~v~~l~~r 63 (93)
T PF04940_consen 3 RLIYVSTASEDLSPEDLADILRSSRRNNRRHGITGFLLYDG--GHFFQVLEGPEEAVDALFER 63 (93)
T ss_dssp EEEEEEEE-TTS-HHHHHHHHHHHHHHHHHHTEEEEEEEET--TEEEEEEEEEHHHHHHHHHH
T ss_pred EEEEEEccCCCCCHHHHHHHHHHHHHhhhhcCCEEEEEEeC--CEEEEEEECCHHHHHHHHHH
Confidence 3578888888999999999886542 3777766654 33333344 44444444443
No 320
>KOG4840|consensus
Probab=27.07 E-value=1.1e+02 Score=25.78 Aligned_cols=72 Identities=14% Similarity=0.161 Sum_probs=35.8
Q ss_pred CCCcEEEEcCCCcCCc----HHHHHHHHHhCC-CeeEEEEeeCCCcEEEEEE-cCHHHHHHHHHhcCCceeCCcEEEE
Q psy2347 7 ERGTRVYVGGLTETVK----KEDLELEFEKFG-KLNSVWVAFNPPGFAFVEF-SNQIDAEAACDSMNDQDLMGSKLRV 78 (207)
Q Consensus 7 ~~~~~lfV~nLp~~~t----e~~L~~~F~~~G-~V~~v~i~~d~rG~afV~F-~~~e~A~~Al~~l~g~~i~g~~l~V 78 (207)
.+...|||+.|...+- .++|...+.+-. .++.+.+.....+|+.-.. .+.|+.+++|+.+....+.-..|-|
T Consensus 35 ~~~~vvfiGGLgdgLl~~~y~~~L~~~lde~~wslVq~q~~Ssy~G~Gt~slk~D~edl~~l~~Hi~~~~fSt~vVL~ 112 (299)
T KOG4840|consen 35 ESVKVVFIGGLGDGLLICLYTTMLNRYLDENSWSLVQPQLRSSYNGYGTFSLKDDVEDLKCLLEHIQLCGFSTDVVLV 112 (299)
T ss_pred eEEEEEEEcccCCCccccccHHHHHHHHhhccceeeeeeccccccccccccccccHHHHHHHHHHhhccCcccceEEE
Confidence 3467899998865422 233433333221 2333332222334443333 3677778888766555544444433
No 321
>KOG1635|consensus
Probab=27.04 E-value=2.3e+02 Score=22.60 Aligned_cols=60 Identities=12% Similarity=-0.040 Sum_probs=33.9
Q ss_pred cEEEEcCCCcCCcHHHHHHHHHhCCCeeEE-EEeeC-CCcE-EEEEEcCHHHHHHHHHhcCCc
Q psy2347 10 TRVYVGGLTETVKKEDLELEFEKFGKLNSV-WVAFN-PPGF-AFVEFSNQIDAEAACDSMNDQ 69 (207)
Q Consensus 10 ~~lfV~nLp~~~te~~L~~~F~~~G~V~~v-~i~~d-~rG~-afV~F~~~e~A~~Al~~l~g~ 69 (207)
-.|-|.-=|..++-++|.++|-+.-+.+.+ +.-.| -.-| .-|-|.+.|+++.|.+.|+..
T Consensus 76 EvvrV~ydpk~~sy~~Lld~Fw~~HdPtt~n~QG~D~GtQYRS~I~~~s~eq~k~A~~s~e~~ 138 (191)
T KOG1635|consen 76 EVVRVQYDPKVISYEELLDFFWSRHDPTTLNRQGNDVGTQYRSGIYTYSPEQEKLARESKERE 138 (191)
T ss_pred eEEEEEeCcccccHHHHHHHHHHcCCchhhhccCCcccceeeeeeeeCCHHHHHHHHHHHHHH
Confidence 355666568889999999998765443332 11112 0111 224556677777666665543
No 322
>PF15643 Tox-PL-2: Papain fold toxin 2
Probab=26.98 E-value=52 Score=23.63 Aligned_cols=29 Identities=17% Similarity=0.115 Sum_probs=24.2
Q ss_pred EEcCHHHHHHHHHhcCCceeCCcEEEEEE
Q psy2347 52 EFSNQIDAEAACDSMNDQDLMGSKLRVEI 80 (207)
Q Consensus 52 ~F~~~e~A~~Al~~l~g~~i~g~~l~V~~ 80 (207)
.|.-.+-|++.++.|....|.++.|+|+-
T Consensus 18 ~~qC~~cA~Al~~~L~~~gI~Gk~i~l~T 46 (100)
T PF15643_consen 18 IFQCVECASALKQFLKQAGIPGKIIRLYT 46 (100)
T ss_pred ceehHHHHHHHHHHHHHCCCCceEEEEEe
Confidence 56667788888888999999999999875
No 323
>cd04880 ACT_AAAH-PDT-like ACT domain of the nonheme iron-dependent, aromatic amino acid hydroxylases (AAAH). ACT domain of the nonheme iron-dependent, aromatic amino acid hydroxylases (AAAH): Phenylalanine hydroxylases (PAH), tyrosine hydroxylases (TH) and tryptophan hydroxylases (TPH), both peripheral (TPH1) and neuronal (TPH2) enzymes. This family of enzymes shares a common catalytic mechanism, in which dioxygen is used by an active site containing a single, reduced iron atom to hydroxylate an unactivated aromatic substrate, concomitant with a two-electron oxidation of tetrahydropterin (BH4) cofactor to its quinonoid dihydropterin form. Eukaryotic AAAHs have an N-terminal ACT (regulatory) domain, a middle catalytic domain and a C-terminal domain which is responsible for the oligomeric state of the enzyme forming a domain-swapped tetrameric coiled-coil. The PAH, TH, and TPH enzymes contain highly conserved catalytic domains but distinct N-terminal ACT domains and differ in their mech
Probab=26.95 E-value=1.8e+02 Score=18.72 Aligned_cols=47 Identities=17% Similarity=0.128 Sum_probs=29.1
Q ss_pred cHHHHHHHHHhCC-CeeEEEEeeC----CCcEEEEEEcC---HHHHHHHHHhcCC
Q psy2347 22 KKEDLELEFEKFG-KLNSVWVAFN----PPGFAFVEFSN---QIDAEAACDSMND 68 (207)
Q Consensus 22 te~~L~~~F~~~G-~V~~v~i~~d----~rG~afV~F~~---~e~A~~Al~~l~g 68 (207)
+-.+|.+.|++++ .|.+|+-... ..-.-||+++. .+..+++++.|..
T Consensus 12 ~L~~vL~~f~~~~vni~~I~Srp~~~~~~~~~f~id~~~~~~~~~~~~~l~~l~~ 66 (75)
T cd04880 12 ALAKALKVFAERGINLTKIESRPSRKGLWEYEFFVDFEGHIDDPDVKEALEELKR 66 (75)
T ss_pred HHHHHHHHHHHCCCCEEEEEeeecCCCCceEEEEEEEECCCCCHHHHHHHHHHHH
Confidence 4567778888875 3566543332 22356788874 5667777776643
No 324
>KOG2848|consensus
Probab=26.56 E-value=3.2e+02 Score=23.22 Aligned_cols=70 Identities=16% Similarity=0.099 Sum_probs=40.3
Q ss_pred CCCCCcEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeC--------------CCcEEEEEEcCHHHHHHHHHhcCCce
Q psy2347 5 MMERGTRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFN--------------PPGFAFVEFSNQIDAEAACDSMNDQD 70 (207)
Q Consensus 5 ~~~~~~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d--------------~rG~afV~F~~~e~A~~Al~~l~g~~ 70 (207)
..++...|+|.|- .+.-||..+++-+.+ +|.++.. ..+..||+=.+.++|.+.++.. .++
T Consensus 85 L~~~~p~ViVsNH---QS~LDil~m~~i~p~--~cvviaKr~L~yvp~~gl~m~L~gvvfIdR~r~~~Ai~~l~~~-~~~ 158 (276)
T KOG2848|consen 85 LPKSKPAVIVSNH---QSSLDILGMGSIWPK--NCVVIAKRSLFYVPIFGLAMYLSGVVFIDRSRREKAIDTLDKC-AER 158 (276)
T ss_pred CCccCCeEEEecc---hhHHHHHHHHhhcCC--ceEEEEeeeeeecchHHHHHHHcCceEEecCCHHHHHHHHHHH-HHH
Confidence 3344467888774 455566667666643 2222211 3567788888888888777653 234
Q ss_pred eCCcEEEEEE
Q psy2347 71 LMGSKLRVEI 80 (207)
Q Consensus 71 i~g~~l~V~~ 80 (207)
+.-++++|-+
T Consensus 159 mkk~~~kvWv 168 (276)
T KOG2848|consen 159 MKKENRKVWV 168 (276)
T ss_pred HHhCCeeEEE
Confidence 4444444443
No 325
>KOG1232|consensus
Probab=26.39 E-value=63 Score=29.08 Aligned_cols=50 Identities=18% Similarity=0.179 Sum_probs=36.5
Q ss_pred cCCCcCCcHHHHHHHHH----hCCCeeEEEEeeC----CCcEEEEEEcCHHHHHHHHH
Q psy2347 15 GGLTETVKKEDLELEFE----KFGKLNSVWVAFN----PPGFAFVEFSNQIDAEAACD 64 (207)
Q Consensus 15 ~nLp~~~te~~L~~~F~----~~G~V~~v~i~~d----~rG~afV~F~~~e~A~~Al~ 64 (207)
..|-.+-|--+|+++|- ..|.|+.|.|+.. +...||+-.++-+++++++.
T Consensus 230 ~slRKDNTgydlkhLFIGSEGtlGVvT~vSil~~~kpksvn~af~gi~sf~~v~k~fv 287 (511)
T KOG1232|consen 230 SSLRKDNTGYDLKHLFIGSEGTLGVVTKVSILAPPKPKSVNVAFIGIESFDDVQKVFV 287 (511)
T ss_pred hhhcccCccccchhheecCCceeeEEeeEEEeecCCCcceeEEEEccccHHHHHHHHH
Confidence 34556667778899884 4567888888766 34678998888888887653
No 326
>PRK01688 histidinol-phosphate aminotransferase; Provisional
Probab=26.34 E-value=1.1e+02 Score=26.35 Aligned_cols=51 Identities=22% Similarity=0.189 Sum_probs=30.3
Q ss_pred CCcEEEEcCCC----cCCcHHHHHHHHHhCCCeeEEEEeeCCCcEEEEEEcCHHHHHHHH
Q psy2347 8 RGTRVYVGGLT----ETVKKEDLELEFEKFGKLNSVWVAFNPPGFAFVEFSNQIDAEAAC 63 (207)
Q Consensus 8 ~~~~lfV~nLp----~~~te~~L~~~F~~~G~V~~v~i~~d~rG~afV~F~~~e~A~~Al 63 (207)
+.+.|+|.|-. ..++.++|+++.+.+-. .+.|+.| .||++|...+++...+
T Consensus 145 ~~~lv~l~nPnNPTG~~~~~~~l~~l~~~~~~--~~~vivD---Eay~~f~~~~s~~~~~ 199 (351)
T PRK01688 145 GVKVVYVCSPNNPTGNLINPQDLRTLLELTRG--KAIVVAD---EAYIEFCPQASLAGWL 199 (351)
T ss_pred CCcEEEEeCCCCCCCCCCCHHHHHHHHHhCCC--CcEEEEE---CchhhcCCCCChHHHH
Confidence 45667887643 23678899988876533 2345666 5677776433333333
No 327
>cd04489 ExoVII_LU_OBF ExoVII_LU_OBF: A subfamily of OB folds corresponding to the N-terminal OB-fold domain of Escherichia coli exodeoxyribonuclease VII (ExoVII) large subunit. E. coli ExoVII is composed of two non-identical subunits. E. coli ExoVII is a single-strand-specific exonuclease which degrades ssDNA from both 3-prime and 5-prime ends. ExoVII plays a role in methyl-directed mismatch repair in vivo. ExoVII may also guard the genome from mutagenesis by removing excess ssDNA, since the build up of ssDNA would lead to SOS induction and PolIV-dependent mutagenesis.
Probab=26.32 E-value=23 Score=23.18 Aligned_cols=21 Identities=5% Similarity=0.188 Sum_probs=11.2
Q ss_pred CeeEEEEeeCCCcEEEEEEcCHH
Q psy2347 35 KLNSVWVAFNPPGFAFVEFSNQI 57 (207)
Q Consensus 35 ~V~~v~i~~d~rG~afV~F~~~e 57 (207)
.|.++++ + ++++||++.+|..
T Consensus 7 ~v~~i~~-t-k~g~~~~~L~D~~ 27 (78)
T cd04489 7 EISNLKR-P-SSGHLYFTLKDED 27 (78)
T ss_pred EEecCEE-C-CCcEEEEEEEeCC
Confidence 3445544 2 3457777766544
No 328
>PRK00274 ksgA 16S ribosomal RNA methyltransferase KsgA/Dim1 family protein; Reviewed
Probab=26.15 E-value=85 Score=26.26 Aligned_cols=22 Identities=18% Similarity=0.036 Sum_probs=18.9
Q ss_pred EEEEcCCCcCCcHHHHHHHHHh
Q psy2347 11 RVYVGGLTETVKKEDLELEFEK 32 (207)
Q Consensus 11 ~lfV~nLp~~~te~~L~~~F~~ 32 (207)
.+.|+|||+.++.+.|.+++..
T Consensus 107 ~~vv~NlPY~iss~ii~~~l~~ 128 (272)
T PRK00274 107 LKVVANLPYNITTPLLFHLLEE 128 (272)
T ss_pred ceEEEeCCccchHHHHHHHHhc
Confidence 5779999999999888888764
No 329
>PRK04199 rpl10e 50S ribosomal protein L10e; Reviewed
Probab=25.94 E-value=3.4e+02 Score=21.51 Aligned_cols=57 Identities=11% Similarity=0.112 Sum_probs=34.1
Q ss_pred CCcEEEEcCCCcC-CcHHHHHHHH------Hh----CCCeeEEEEeeCCCcEEEEEEc----CHHHHHHHHHh
Q psy2347 8 RGTRVYVGGLTET-VKKEDLELEF------EK----FGKLNSVWVAFNPPGFAFVEFS----NQIDAEAACDS 65 (207)
Q Consensus 8 ~~~~lfV~nLp~~-~te~~L~~~F------~~----~G~V~~v~i~~d~rG~afV~F~----~~e~A~~Al~~ 65 (207)
...+|+|.-.|.. +|+.-+.... +- +|.++....... .+-.++|+. +++.|++||..
T Consensus 80 ~~~k~wIrvfP~~~lt~kp~~~rmGAdRmg~GMr~akG~p~~wva~Vk-~G~ilfei~~~~~~~~~akeAlr~ 151 (172)
T PRK04199 80 SNYHLKIRVYPHHVLRENKMATGAGADRVSDGMRLAFGKPVGTAARVE-KGQKIFTVRVNPEHLEAAKEALRR 151 (172)
T ss_pred cccEEEEEeCCCcceecChhHhhcchhhhccCcccCCCCccEEEEEEC-cCCEEEEEEecCCCHHHHHHHHHH
Confidence 3457888777765 3444443211 12 777766544333 466666775 77788888875
No 330
>cd04859 Prim_Pol Prim_Pol: Primase-polymerase (primpol) domain of the type found in bifunctional replicases from archaeal plasmids, including ORF904 protein of the crenarchaeal plasmid pRN1 from Sulfolobus islandicus (pRN1 primpol). These primpol domains belong to the archaeal/eukaryal primase (AEP) superfamily. This group includes archaeal plasmids and bacteriophage AEPs. The ORF904 protein is a multifunctional protein having ATPase, primase and DNA polymerase activity, and may play a role in the replication of the archaeal plasmid. The pRN1 primpol domain exhibits DNA polymerase and primase activities; a cluster of active site residues (three acidic residues, and a histidine) is required for both these activities. For pRN1 primpol, the primase activity prefers dNTPs to rNTPs; incorporation of dNTPs requires rNTP as cofactor. The pRN1 primpol contains an unusual zinc-binding stem, which is not conserved in other members of this group.
Probab=25.94 E-value=2.4e+02 Score=20.97 Aligned_cols=42 Identities=17% Similarity=0.227 Sum_probs=28.6
Q ss_pred CCcHHHHHHHHHhCCCeeEEEEeeCCCcEEEEEEcCH-HHHHHH
Q psy2347 20 TVKKEDLELEFEKFGKLNSVWVAFNPPGFAFVEFSNQ-IDAEAA 62 (207)
Q Consensus 20 ~~te~~L~~~F~~~G~V~~v~i~~d~rG~afV~F~~~-e~A~~A 62 (207)
.++.++|+++|.+.-. ..|-|++...++..|+++++ +.++..
T Consensus 24 ~~~~~~i~~w~~~~~~-~niGi~~g~s~lvvvDiD~~~~~~~~~ 66 (152)
T cd04859 24 TTDPEQIEAWWRDGPD-ANIGLRTGPSGLVVIDIDVKHDGAAAL 66 (152)
T ss_pred CCCHHHHHHHHHhCCC-ccEEEEcCCCCEEEEeCCCCCCcHHHH
Confidence 5788999999998753 44555554237888899887 444433
No 331
>PF01823 MACPF: MAC/Perforin domain; InterPro: IPR020864 The membrane attack complex/perforin (MACPF) domain is conserved in bacteria, fungi, mammals and plants. It was originally identified and named as being common to five complement components (C6, C7, C8-alpha, C8-beta, and C9) and perforin. These molecules perform critical functions in innate and adaptive immunity. The MAC family proteins and perforin are known to participate in lytic pore formation. In response to pathogen infection, a sequential and highly specific interaction between the constituent elements occurs to form transmembrane channels which are known as the membrane-attack complex (MAC).Only a few other MACPF proteins have been characterised and several are thought to form pores for invasion or protection [, , ]. Examples are proteins from malarial parasites [], the cytolytic toxins from sea anemones [], and proteins that provide plant immunity [, ]. Functionally uncharacterised MACPF proteins are also evident in pathogenic bacteria such as Chlamydia spp [] and Photorhabdus luminescens (Xenorhabdus luminescens) []. The MACPF domain is commonly found to be associated with other N- and C-terminal domains, such as TSP1 (see PDOC50092 from PROSITEDOC), LDLRA (see PDOC00929 from PROSITEDOC), EGF-like (see PDOC00021 from PROSITEDOC),Sushi/CCP/SCR (see PDOC50923 from PROSITEDOC), FIMAC or C2 (see PDOC00380 from PROSITEDOC). They probably control or target MACPF function [, ]. The MACPF domain oligomerizes, undergoes conformational change, and is required for lytic activity. The MACPF domain consists of a central kinked four-stranded antiparallel beta sheet surrounded by alpha helices and beta strands, forming two structural segments. Overall, the MACPF domain has a thin L-shaped appearance. MACPF domains exhibit limited sequence similarity but contain a signature [YW]-G-[TS]-H-[FY]-x(6)-G-G motif [, , ]. Some proteins known to contain a MACPF domain are listed below: Vertebrate complement proteins C6 to C9. Complement factors C6 to C9 assemble to form a scaffold, the membrane attack complex (MAC), that permits C9 polymerisation into pores that lyse Gram-negative pathogens [, ]. Vertebrate perforin. It is delivered by natural killer cells and cytotoxic T lymphocytes and forms oligomeric pores (12 to 18 monomers) in the plasma membrane of either virus-infected or transformed cells. Arabidopsis thaliana (Mouse-ear cress) constitutively activated cell death 1 (CAD1) protein. It is likely to act as a mediator that recognises plant signals for pathogen infection []. Arabidopsis thaliana (Mouse-ear cress) necrotic spotted lesions 1 (NSL1) protein []. Venomous sea anemone Phyllodiscus semoni (Night anemone) toxins PsTX-60A and PsTX-60B []. Venomous sea anemone Actineria villosa (Okinawan sea anemone) toxin AvTX-60A []. Plasmodium sporozoite microneme protein essential for cell traversal 2 (SPECT2). It is essential for the membrane-wounding activity of the sporozoite and is involved in its traversal of the sinusoidal cell layer prior to hepatocyte-infection []. P. luminescens Plu-MACPF. Although nonlytic, it was shown to bind to cell membranes []. Chlamydial putative uncharacterised protein CT153 []. ; PDB: 2QP2_A 3OJY_B 3NSJ_A 4E0S_B 3T5O_A 4A5W_B 2QQH_A 2RD7_A.
Probab=25.84 E-value=42 Score=26.39 Aligned_cols=26 Identities=15% Similarity=0.296 Sum_probs=16.9
Q ss_pred EcCCCcCCcHHH---HHHHHHhCCCeeEE
Q psy2347 14 VGGLTETVKKED---LELEFEKFGKLNSV 39 (207)
Q Consensus 14 V~nLp~~~te~~---L~~~F~~~G~V~~v 39 (207)
|..||...+.++ +.+||..||.-.-+
T Consensus 53 l~~L~~~~~~~~~~~y~~f~~~yGTH~v~ 81 (212)
T PF01823_consen 53 LNALPAEYNSDNTDEYYRFFDKYGTHYVT 81 (212)
T ss_dssp HHTSHSS--HHHHHHHHHHHHHH-SEEEE
T ss_pred HHhhCcccCccchHHHHHHHHHhCcEEEE
Confidence 456788877777 78899999984333
No 332
>COG1098 VacB Predicted RNA binding protein (contains ribosomal protein S1 domain) [Translation, ribosomal structure and biogenesis]
Probab=25.73 E-value=1.5e+02 Score=22.30 Aligned_cols=48 Identities=21% Similarity=0.221 Sum_probs=28.0
Q ss_pred CCCeeEEEEeeCCCcEEEEEEcC--------HHHHHHHHHhcCCceeCCcEEEEEE
Q psy2347 33 FGKLNSVWVAFNPPGFAFVEFSN--------QIDAEAACDSMNDQDLMGSKLRVEI 80 (207)
Q Consensus 33 ~G~V~~v~i~~d~rG~afV~F~~--------~e~A~~Al~~l~g~~i~g~~l~V~~ 80 (207)
.|.+..-+|..-..==|||+|+. .|.|..-++.+|.+.-.+..|.|++
T Consensus 5 vG~~l~GkItgI~~yGAFV~l~~g~tGLVHISEIa~~fVkdI~d~L~vG~eV~vKV 60 (129)
T COG1098 5 VGSKLKGKITGITPYGAFVELEGGKTGLVHISEIADGFVKDIHDHLKVGQEVKVKV 60 (129)
T ss_pred ccceEEEEEEeeEecceEEEecCCCcceEEehHhhhhhHHhHHHHhcCCCEEEEEE
Confidence 34455444432212248888887 4666777777776655566666654
No 333
>TIGR00854 pts-sorbose PTS system, mannose/fructose/sorbose family, IIB component. Bacterial PTS transporters transport and concomitantly phosphorylate their sugar substrates, and typically consist of multiple subunits or protein domains.The Man family is unique in several respects among PTS permease families.It is the only PTS family in which members possess a IID protein. It is the only PTS family in which the IIB constituent is phosphorylated on a histidyl rather than a cysteyl residue. Its permease members exhibit broad specificity for a range of sugars, rather than being specific for just one or a few sugars. The mannose permease of E. coli, for example, can transport and phosphorylate glucose, mannose, fructose, glucosamine, N-acetylglucosamine, and other sugars. Other members of this can transport sorbose, fructose and N-acetylglucosamine. This family is specific for the IIB components of this family of PTS transporters.
Probab=25.70 E-value=2.4e+02 Score=21.61 Aligned_cols=19 Identities=21% Similarity=0.246 Sum_probs=16.7
Q ss_pred CcEEEEEEcCHHHHHHHHH
Q psy2347 46 PGFAFVEFSNQIDAEAACD 64 (207)
Q Consensus 46 rG~afV~F~~~e~A~~Al~ 64 (207)
+--.||.|+++++|.++++
T Consensus 75 ~~~v~vl~k~~~da~~l~~ 93 (151)
T TIGR00854 75 DQTIFLLFRNPQDVLTLVE 93 (151)
T ss_pred CceEEEEECCHHHHHHHHH
Confidence 3468999999999999998
No 334
>PRK13022 secF preprotein translocase subunit SecF; Reviewed
Probab=25.49 E-value=2.2e+02 Score=24.23 Aligned_cols=51 Identities=14% Similarity=0.165 Sum_probs=30.1
Q ss_pred CCCcCCcHHHHHHHHHhCCCeeEEEEeeC-CCcEEEEEEc--CHHHHHHHHHhcC
Q psy2347 16 GLTETVKKEDLELEFEKFGKLNSVWVAFN-PPGFAFVEFS--NQIDAEAACDSMN 67 (207)
Q Consensus 16 nLp~~~te~~L~~~F~~~G~V~~v~i~~d-~rG~afV~F~--~~e~A~~Al~~l~ 67 (207)
.++.+++.+++++.+++.+ +.++.+... ......|.+. +.+++++..+.|+
T Consensus 48 ~~~~~~~~~~v~~~~~~~~-~~~~~v~~~~~~~~~~i~~~~~~~~~~~~~~~~l~ 101 (289)
T PRK13022 48 RFEQPADLEQVREALEKAG-FEDAQVQNFGSSRDVLIRLPPASEELSEKVKKALN 101 (289)
T ss_pred EcCCCCCHHHHHHHHHhcC-CCCceEEEcCCCCEEEEEeCCCChHHHHHHHHHHH
Confidence 3456789999999999874 222222221 1224556666 5566666555554
No 335
>PF04127 DFP: DNA / pantothenate metabolism flavoprotein; InterPro: IPR007085 This entry represents the C-terminal domain found in DNA/pantothenate metabolism flavoproteins, which affects synthesis of DNA and pantothenate metabolism. These proteins contain ATP, phosphopantothenate, and cysteine binding sites. The structure of this domain has been determined in human phosphopantothenoylcysteine (PPC) synthetase [] and as the PPC synthase domain (CoaB) from the Escherichia coli coenzyme A bifunctional protein CoaBC []. This domain adopts a 3-layer alpha/beta/alpha fold with mixed beta-sheets, which topologically resembles a combination of Rossmann-like and ribokinase-like folds. The structure of these proteins predicts a ping pong mechanism with initial formation of an acyladenylate intermediate, followed by release of pyrophosphate and attack by cysteine to form the final products PPC and AMP. ; PDB: 1U7W_A 1U7U_A 1U80_C 1U7Z_A 1P9O_B 2GK4_A.
Probab=25.22 E-value=2.3e+02 Score=22.39 Aligned_cols=56 Identities=13% Similarity=0.163 Sum_probs=36.2
Q ss_pred EEEEcCCCcCCcHHHHHHHHHhCCCe-eEEEEeeC---CCcEEEEEEcCHHHHHHHHHhc
Q psy2347 11 RVYVGGLTETVKKEDLELEFEKFGKL-NSVWVAFN---PPGFAFVEFSNQIDAEAACDSM 66 (207)
Q Consensus 11 ~lfV~nLp~~~te~~L~~~F~~~G~V-~~v~i~~d---~rG~afV~F~~~e~A~~Al~~l 66 (207)
.=||.|.+.-.+-..|-+.|...|.- +-|.=... +..+-.|++++.++..+++..+
T Consensus 20 VR~ItN~SSG~~G~~lA~~~~~~Ga~V~li~g~~~~~~p~~~~~i~v~sa~em~~~~~~~ 79 (185)
T PF04127_consen 20 VRFITNRSSGKMGAALAEEAARRGAEVTLIHGPSSLPPPPGVKVIRVESAEEMLEAVKEL 79 (185)
T ss_dssp SEEEEES--SHHHHHHHHHHHHTT-EEEEEE-TTS----TTEEEEE-SSHHHHHHHHHHH
T ss_pred ceEecCCCcCHHHHHHHHHHHHCCCEEEEEecCccccccccceEEEecchhhhhhhhccc
Confidence 45788888888888888888887752 22221111 4578899999999988888753
No 336
>PF11080 DUF2622: Protein of unknown function (DUF2622); InterPro: IPR022597 This family is conserved in the Enterobacteriaceae family. The function is not known.
Probab=25.20 E-value=2.6e+02 Score=19.94 Aligned_cols=69 Identities=10% Similarity=0.142 Sum_probs=38.8
Q ss_pred EEEcCCCcCCcH-HHHHHHHHhCCCeeEEEEee----C--CCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEEEe
Q psy2347 12 VYVGGLTETVKK-EDLELEFEKFGKLNSVWVAF----N--PPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVEIS 81 (207)
Q Consensus 12 lfV~nLp~~~te-~~L~~~F~~~G~V~~v~i~~----d--~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~~a 81 (207)
|-|.-.+...++ .+|...+..-+-...|.-.. . ..-|++|.--+++++++-++.| ....-++...|.+.
T Consensus 11 Vt~~~~e~~l~d~~~L~~~lt~~GF~~tl~D~~G~~HeLgtntfgl~S~l~~~eV~~la~~l-ae~algk~p~V~V~ 86 (96)
T PF11080_consen 11 VTFEYQEAGLTDINELNNHLTRAGFSTTLTDEDGNPHELGTNTFGLISALSAEEVAQLARGL-AESALGKTPEVEVT 86 (96)
T ss_pred EEEEeccCChHHHHHHHHHHHhcCceeEEecCCCCEeecCCCeEEEEecCCHHHHHHHHHHH-hhhhcCCCCceEEE
Confidence 333333333333 56777777665544332111 0 4579999999988888777664 33444555444443
No 337
>PF14268 YoaP: YoaP-like
Probab=25.14 E-value=51 Score=19.98 Aligned_cols=35 Identities=17% Similarity=0.177 Sum_probs=24.1
Q ss_pred EEEEEEcCHHHHHHHHHhcCCce--eCCcEEEEEEee
Q psy2347 48 FAFVEFSNQIDAEAACDSMNDQD--LMGSKLRVEISR 82 (207)
Q Consensus 48 ~afV~F~~~e~A~~Al~~l~g~~--i~g~~l~V~~a~ 82 (207)
+-+|..++.|+|+.|-.-.+... .+++-|++++-.
T Consensus 2 ~~~i~i~t~e~Aq~~P~pft~yalFYnGkfiT~eils 38 (44)
T PF14268_consen 2 FKLIKIDTLEKAQNAPCPFTTYALFYNGKFITNEILS 38 (44)
T ss_pred cEEEEeccHHHHhcCCCceeEEEEEECCEEEEeeccC
Confidence 34688889999888766555433 377888877643
No 338
>cd04882 ACT_Bt0572_2 C-terminal ACT domain of a novel protein composed of just two ACT domains. Included in this CD is the C-terminal ACT domain of a novel protein composed of just two ACT domains, as seen in the yet uncharacterized structure (pdb 2F06) of the Bt0572 protein from Bacteroides thetaiotaomicron and related proteins. Members of this CD belong to the superfamily of ACT regulatory domains.
Probab=25.07 E-value=1.7e+02 Score=17.80 Aligned_cols=42 Identities=10% Similarity=0.101 Sum_probs=23.7
Q ss_pred HHHHHHHHhCC-CeeEEEEeeCC-CcEE--EEEEcCHHHHHHHHHh
Q psy2347 24 EDLELEFEKFG-KLNSVWVAFNP-PGFA--FVEFSNQIDAEAACDS 65 (207)
Q Consensus 24 ~~L~~~F~~~G-~V~~v~i~~d~-rG~a--fV~F~~~e~A~~Al~~ 65 (207)
.+|.++|.+++ .|..+...... ...+ .|+.++.+.+.+++++
T Consensus 14 ~~i~~~l~~~~~nI~~i~~~~~~~~~~~~v~~~ve~~~~~~~~L~~ 59 (65)
T cd04882 14 HEILQILSEEGINIEYMYAFVEKKGGKALLIFRTEDIEKAIEVLQE 59 (65)
T ss_pred HHHHHHHHHCCCChhheEEEccCCCCeEEEEEEeCCHHHHHHHHHH
Confidence 45667777765 35555443331 2334 4555677777777765
No 339
>PRK13817 ribosome-binding factor A; Provisional
Probab=24.87 E-value=66 Score=23.69 Aligned_cols=21 Identities=14% Similarity=0.104 Sum_probs=12.4
Q ss_pred CCeeEEEEeeC-CCcEEEEEEc
Q psy2347 34 GKLNSVWVAFN-PPGFAFVEFS 54 (207)
Q Consensus 34 G~V~~v~i~~d-~rG~afV~F~ 54 (207)
-.|+.|+|..| .-..+||.+-
T Consensus 32 vtVt~V~vS~Dl~~AkVyvs~~ 53 (119)
T PRK13817 32 ISLTAVSISPDLKQAKVFYSLL 53 (119)
T ss_pred eEEeEEEECCCCCEEEEEEEEC
Confidence 34777888777 3334455553
No 340
>cd06404 PB1_aPKC PB1 domain is an essential modular domain of the atypical protein kinase C (aPKC) which in complex with Par6 and Par3 proteins is crucial for establishment of apical-basal polarity of animal cells. PB1 domain is a modular domain mediating specific protein-protein interaction which play roles in many critical cell processes. A canonical PB1-PB1 interaction, which involves heterodimerization of two PB1 domains, is required for the formation of macromolecular signaling complexes ensuring specificity and fidelity during cellular signaling. The interaction between two PB1 domain depends on the type of PB1. There are three types of PB1 domains: type I which contains an OPCA motif, acidic aminoacid cluster, type II which contains a basic cluster, and type I/II which contains both an OPCA motif and a basic cluster. Interactions of PB1 domains with other protein domains have been described as noncanonical PB1-interactions. The PB1 domain module is conserved in amoebas, fungi,
Probab=24.85 E-value=2.4e+02 Score=19.51 Aligned_cols=55 Identities=13% Similarity=0.061 Sum_probs=31.9
Q ss_pred EEEEcCCCcCCcHHHHHHHHHh-CC--C--eeEEEEeeCCCcEEEEEEcCHHHHHHHHHhcC
Q psy2347 11 RVYVGGLTETVKKEDLELEFEK-FG--K--LNSVWVAFNPPGFAFVEFSNQIDAEAACDSMN 67 (207)
Q Consensus 11 ~lfV~nLp~~~te~~L~~~F~~-~G--~--V~~v~i~~d~rG~afV~F~~~e~A~~Al~~l~ 67 (207)
-|+|..++..++-++|.+.... |- + .-.++.+.+ . -=-|+|++.++.+.|++.+.
T Consensus 10 di~it~~d~~~s~e~L~~~v~~~c~~~~~q~ft~kw~DE-E-GDp~tiSS~~EL~EA~rl~~ 69 (83)
T cd06404 10 DIMITSIDPSISLEELCNEVRDMCRFHNDQPFTLKWIDE-E-GDPCTISSQMELEEAFRLYE 69 (83)
T ss_pred cEEEEEcCCCcCHHHHHHHHHHHhCCCCCCcEEEEEECC-C-CCceeecCHHHHHHHHHHHH
Confidence 4677778888887666554432 11 1 112333322 1 12478899999999987543
No 341
>PF05711 TylF: Macrocin-O-methyltransferase (TylF); InterPro: IPR008884 This family consists of bacterial macrocin O-methyltransferase (TylF) proteins. TylF is responsible for the methylation of macrocin to produce tylosin. Tylosin is a macrolide antibiotic used in veterinary medicine to treat infections caused by Gram-positive bacteria and as an animal growth promoter in the Sus scrofa (Pig) industry. It is produced by several Streptomyces species. As with other macrolides, the antibiotic activity of tylosin is due to the inhibition of protein biosynthesis by a mechanism that involves the binding of tylosin to the ribosome, preventing the formation of the mRNA-aminoacyl-tRNA-ribosome complex [].; PDB: 3TOS_D 2WK1_A.
Probab=24.80 E-value=49 Score=27.73 Aligned_cols=58 Identities=16% Similarity=0.182 Sum_probs=35.3
Q ss_pred CcHHHHHHHHHhCCCee-EEEEeeC----------CCcEE--EEEEcCHHHHHHHHHhcCCceeCCcEEEE
Q psy2347 21 VKKEDLELEFEKFGKLN-SVWVAFN----------PPGFA--FVEFSNQIDAEAACDSMNDQDLMGSKLRV 78 (207)
Q Consensus 21 ~te~~L~~~F~~~G~V~-~v~i~~d----------~rG~a--fV~F~~~e~A~~Al~~l~g~~i~g~~l~V 78 (207)
++++++++.|++|+.+. +|++++- ...+| .|..+-.++...||+.|-.+...+-.|.+
T Consensus 140 ~s~e~V~~n~~~~gl~~~~v~~vkG~F~dTLp~~p~~~IAll~lD~DlYesT~~aLe~lyprl~~GGiIi~ 210 (248)
T PF05711_consen 140 VSLEEVRENFARYGLLDDNVRFVKGWFPDTLPDAPIERIALLHLDCDLYESTKDALEFLYPRLSPGGIIIF 210 (248)
T ss_dssp HHHHHHHHCCCCTTTSSTTEEEEES-HHHHCCC-TT--EEEEEE---SHHHHHHHHHHHGGGEEEEEEEEE
T ss_pred cCHHHHHHHHHHcCCCcccEEEECCcchhhhccCCCccEEEEEEeccchHHHHHHHHHHHhhcCCCeEEEE
Confidence 46778888888887432 3555443 22344 45556788888999887777666655544
No 342
>PHA03008 hypothetical protein; Provisional
Probab=24.61 E-value=1.1e+02 Score=24.99 Aligned_cols=37 Identities=14% Similarity=0.296 Sum_probs=30.1
Q ss_pred CCCcEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEee
Q psy2347 7 ERGTRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAF 43 (207)
Q Consensus 7 ~~~~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~ 43 (207)
+....+||+|+..--+...|+-+|.+|-.+.+|-++.
T Consensus 19 ~~~d~~~~snit~~h~~n~i~~ff~~~d~~~~~ifvp 55 (234)
T PHA03008 19 EICDIAFISNITHIHDHNIIKIFFDKFDDFDEIIFVP 55 (234)
T ss_pred ccccEEEEecccccccccHHHHHHhhccccceEEEcc
Confidence 3445789999998888889999999999888876554
No 343
>KOG0862|consensus
Probab=24.39 E-value=61 Score=26.57 Aligned_cols=12 Identities=33% Similarity=0.681 Sum_probs=10.0
Q ss_pred CCcEEEEEEcCH
Q psy2347 45 PPGFAFVEFSNQ 56 (207)
Q Consensus 45 ~rG~afV~F~~~ 56 (207)
.|.||||+|++-
T Consensus 108 ~RPY~FieFD~~ 119 (216)
T KOG0862|consen 108 SRPYAFIEFDTF 119 (216)
T ss_pred CCCeeEEehhHH
Confidence 578999999863
No 344
>PRK10773 murF UDP-N-acetylmuramoyl-tripeptide--D-alanyl-D-alanine ligase; Reviewed
Probab=24.37 E-value=2.8e+02 Score=25.00 Aligned_cols=57 Identities=11% Similarity=0.190 Sum_probs=35.8
Q ss_pred CCcHHHHHHHHHhCCCee-EEEEeeCCCcEEEEE--Ec-CHHHHHHHHHhcCCceeCCcEEEE
Q psy2347 20 TVKKEDLELEFEKFGKLN-SVWVAFNPPGFAFVE--FS-NQIDAEAACDSMNDQDLMGSKLRV 78 (207)
Q Consensus 20 ~~te~~L~~~F~~~G~V~-~v~i~~d~rG~afV~--F~-~~e~A~~Al~~l~g~~i~g~~l~V 78 (207)
.++.++|.+-++.|-.+. +++++....+..+|. |. +++++++||+.|... .+++|.|
T Consensus 298 Gi~~~~i~~~L~~~~~~~gR~e~v~~~~g~~iIDDsYn~nP~s~~aaL~~l~~~--~~r~i~V 358 (453)
T PRK10773 298 GATLDAVKAGLANLKAVPGRLFPIQLAEGQLLLDDSYNANVGSMTAAAQVLAEM--PGYRVMV 358 (453)
T ss_pred CCCHHHHHHHHHhCCCCCCceeEEECCCCeEEEEcCCCCCHHHHHHHHHHHHhC--CCCEEEE
Confidence 456778888888875443 344443223456665 55 588999999988653 4555544
No 345
>cd06301 PBP1_rhizopine_binding_like Periplasmic binding proteins specific to rhizopines. Periplasmic binding proteins specific to rhizopines, which are simple sugar-like compounds produced in the nodules induced by the symbiotic root nodule bacteria, such as Rhizobium and Sinorhizobium. Rhizopine-binding-like proteins from other bacteria are also included. Two inositol based rhizopine compounds are known to date: L-3-O-methly-scyllo-inosamine (3-O-MSI) and scyllo-inosamine. Bacterial strains that can metabolize rhizopine have a greater competitive advantage in nodulation and rhizopine synthesis is regulated by NifA/NtrA regulatory transcription activators which are maximally expressed at the onset of nitrogen fixation in bacteroids. The members of this group belong to the pentose/hexose sugar-binding protein family of the type I periplasmic binding protein superfamily.
Probab=24.34 E-value=2e+02 Score=22.97 Aligned_cols=22 Identities=14% Similarity=0.277 Sum_probs=17.2
Q ss_pred CcEEEEEEcCHHHHHHHHHhcC
Q psy2347 46 PGFAFVEFSNQIDAEAACDSMN 67 (207)
Q Consensus 46 rG~afV~F~~~e~A~~Al~~l~ 67 (207)
..+.+|.+++.+.+..|.+.|-
T Consensus 96 ~~~~~V~~d~~~~g~~~~~~l~ 117 (272)
T cd06301 96 KGVAYVGSDEVVAGRLQAEYVA 117 (272)
T ss_pred CeeEEEecChHHHHHHHHHHHH
Confidence 4678999999888887776654
No 346
>COG4733 Phage-related protein, tail component [Function unknown]
Probab=24.32 E-value=95 Score=30.71 Aligned_cols=71 Identities=8% Similarity=0.086 Sum_probs=47.6
Q ss_pred CCCCcEEEEcCCCcCCcHHHHHH--HHHhCCCeeEEEEeeCCCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEE
Q psy2347 6 MERGTRVYVGGLTETVKKEDLEL--EFEKFGKLNSVWVAFNPPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVE 79 (207)
Q Consensus 6 ~~~~~~lfV~nLp~~~te~~L~~--~F~~~G~V~~v~i~~d~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~ 79 (207)
.+...+|-|..|..+.+.+.|.+ +.+.+.+|.++++...+.-..+|+|+.++= .. +- .-...+.++.|+|-
T Consensus 172 ~~r~~~vR~~R~T~ds~s~~l~n~t~~~s~tEiID~k~~YPNtAlvgl~vdaeQF-gs-~~-t~~y~~rGrii~VP 244 (952)
T COG4733 172 PARPFNIRMRRETADSTSDQLQNKTLWSSYTEIIDVKQCYPNTALVGLQVDAEQF-GG-QP-TVNYHIRGRIIQVP 244 (952)
T ss_pred CCCcceEEEEEecCCCchhhhhcchhhhhhhheeeccccCCCceEEEEEEcHHHh-CC-cc-eeEEeecceEEeCC
Confidence 34446777777777777777754 567899999998877666677888875442 22 21 13456788888874
No 347
>PRK14173 bifunctional 5,10-methylene-tetrahydrofolate dehydrogenase/ 5,10-methylene-tetrahydrofolate cyclohydrolase; Provisional
Probab=24.31 E-value=2.5e+02 Score=24.12 Aligned_cols=24 Identities=25% Similarity=0.281 Sum_probs=17.8
Q ss_pred EEEEcCCCcCCcHHHHHHHHHhCC
Q psy2347 11 RVYVGGLTETVKKEDLELEFEKFG 34 (207)
Q Consensus 11 ~lfV~nLp~~~te~~L~~~F~~~G 34 (207)
...+-.||+++|+++|.+++.+..
T Consensus 60 ~~~~~~l~~~~~~~el~~~I~~lN 83 (287)
T PRK14173 60 RSQVEVLPESTSQEELLELIARLN 83 (287)
T ss_pred EEEEEECCCCCCHHHHHHHHHHHh
Confidence 445557888888888888877653
No 348
>PF04865 Baseplate_J: Baseplate J-like protein; InterPro: IPR006949 The temperate bacteriophage P2 has four defined tail genes: V, J, W and I. Their order is the late gene promoter, VWJI, followed by the tail fibre genes H and G and then a transcription terminator. BAP V protein is the small spike at the tip of the tail and basal plate assembly protein J lies at the edge of the baseplate []. This family also includes a number of bacterial homologues, which are thought to have been horizontally transferred.
Probab=24.27 E-value=3.6e+02 Score=21.72 Aligned_cols=31 Identities=16% Similarity=0.003 Sum_probs=22.4
Q ss_pred EcCCCcCCcHHHHHHHHHhC-CCeeEEEEeeC
Q psy2347 14 VGGLTETVKKEDLELEFEKF-GKLNSVWVAFN 44 (207)
Q Consensus 14 V~nLp~~~te~~L~~~F~~~-G~V~~v~i~~d 44 (207)
+..++...+.++++.+..+. -.|..+++...
T Consensus 112 ~~~~~~~gt~~dy~~~a~~~~~gV~~v~v~~~ 143 (243)
T PF04865_consen 112 LRSPSRAGTADDYEYWALSVSPGVADVKVYPN 143 (243)
T ss_pred hcccccCCCHHHHHHHHHHhCCCceEEEEecC
Confidence 34455567788888777776 77888888776
No 349
>PF12419 DUF3670: SNF2 Helicase protein ; InterPro: IPR022138 This domain family is found in bacteria, archaea and eukaryotes, and is approximately 140 amino acids in length. The family is found in association with PF00271 from PFAM, PF00176 from PFAM. Most of the proteins in this family are annotated as SNF2 helicases but there is little accompanying literature to confirm this.
Probab=24.21 E-value=2.5e+02 Score=21.05 Aligned_cols=50 Identities=14% Similarity=0.143 Sum_probs=32.6
Q ss_pred CcEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeCCCcEEEEEEcCHHHHHHHHHhcCC
Q psy2347 9 GTRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFNPPGFAFVEFSNQIDAEAACDSMND 68 (207)
Q Consensus 9 ~~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d~rG~afV~F~~~e~A~~Al~~l~g 68 (207)
.-+|-|++ ..+|++|++++.++-.+++.++ --+..| |+++++++++.+..
T Consensus 74 ~W~lalGd--~~Ls~eEf~~L~~~~~~LV~~r-----g~WV~l---d~~~l~~~~~~~~~ 123 (141)
T PF12419_consen 74 DWELALGD--EELSEEEFEQLVEQKRPLVRFR-----GRWVEL---DPEELRRALAFLEK 123 (141)
T ss_pred eEEEEECC--EECCHHHHHHHHHcCCCeEEEC-----CEEEEE---CHHHHHHHHHHHHh
Confidence 34555554 5689999999999887777662 113333 56777777766554
No 350
>PF10281 Ish1: Putative stress-responsive nuclear envelope protein; InterPro: IPR018803 This group of proteins, found primarily in fungi, consists of putative stress-responsive nuclear envelope protein Ish1 and homologues [].
Probab=24.21 E-value=78 Score=18.07 Aligned_cols=18 Identities=22% Similarity=0.139 Sum_probs=14.6
Q ss_pred CCcHHHHHHHHHhCCCee
Q psy2347 20 TVKKEDLELEFEKFGKLN 37 (207)
Q Consensus 20 ~~te~~L~~~F~~~G~V~ 37 (207)
..++++|++.+.+++.+.
T Consensus 3 tWs~~~L~~wL~~~gi~~ 20 (38)
T PF10281_consen 3 TWSDSDLKSWLKSHGIPV 20 (38)
T ss_pred CCCHHHHHHHHHHcCCCC
Confidence 468899999999988543
No 351
>PF15407 Spo7_2_N: Sporulation protein family 7
Probab=24.17 E-value=15 Score=24.35 Aligned_cols=27 Identities=15% Similarity=0.289 Sum_probs=18.7
Q ss_pred CCCCCcEEEEcCCCcCCcHHHHHHHHH
Q psy2347 5 MMERGTRVYVGGLTETVKKEDLELEFE 31 (207)
Q Consensus 5 ~~~~~~~lfV~nLp~~~te~~L~~~F~ 31 (207)
+...+++|||+.||..+-++.=+.++.
T Consensus 23 Ls~tSr~vflG~IP~~W~~~~~~~~~k 49 (67)
T PF15407_consen 23 LSLTSRRVFLGPIPEIWLQDHRKSWYK 49 (67)
T ss_pred HHHcCceEEECCCChHHHHcCcchHHH
Confidence 445688999999998766554444443
No 352
>PRK14176 bifunctional 5,10-methylene-tetrahydrofolate dehydrogenase/ 5,10-methylene-tetrahydrofolate cyclohydrolase; Provisional
Probab=23.99 E-value=2.5e+02 Score=24.06 Aligned_cols=24 Identities=17% Similarity=0.097 Sum_probs=17.2
Q ss_pred EEEEcCCCcCCcHHHHHHHHHhCC
Q psy2347 11 RVYVGGLTETVKKEDLELEFEKFG 34 (207)
Q Consensus 11 ~lfV~nLp~~~te~~L~~~F~~~G 34 (207)
...+-.||.++++++|.+.+.+..
T Consensus 69 ~~~~~~l~~~~~~~el~~~I~~LN 92 (287)
T PRK14176 69 RAEDQFLPADTTQEELLELIDSLN 92 (287)
T ss_pred EEEEEECCCCCCHHHHHHHHHHHh
Confidence 444557788888888888777653
No 353
>PRK10792 bifunctional 5,10-methylene-tetrahydrofolate dehydrogenase/ 5,10-methylene-tetrahydrofolate cyclohydrolase; Provisional
Probab=23.96 E-value=2.4e+02 Score=24.12 Aligned_cols=24 Identities=17% Similarity=0.127 Sum_probs=16.7
Q ss_pred EEEEcCCCcCCcHHHHHHHHHhCC
Q psy2347 11 RVYVGGLTETVKKEDLELEFEKFG 34 (207)
Q Consensus 11 ~lfV~nLp~~~te~~L~~~F~~~G 34 (207)
...+-.||.++|+++|.+.+.+..
T Consensus 64 ~~~~~~l~~~~s~~el~~~I~~lN 87 (285)
T PRK10792 64 VSRSYDLPETTSEAELLALIDELN 87 (285)
T ss_pred EEEEEECCCCCCHHHHHHHHHHHh
Confidence 344456788888888888777653
No 354
>PRK00059 prsA peptidylprolyl isomerase; Provisional
Probab=23.93 E-value=91 Score=26.76 Aligned_cols=46 Identities=11% Similarity=0.068 Sum_probs=28.6
Q ss_pred CCcHHHHHHHHHhCCCeeEEEEeeCCCcEEEEEEcCHHHHHHHHHhcC
Q psy2347 20 TVKKEDLELEFEKFGKLNSVWVAFNPPGFAFVEFSNQIDAEAACDSMN 67 (207)
Q Consensus 20 ~~te~~L~~~F~~~G~V~~v~i~~d~rG~afV~F~~~e~A~~Al~~l~ 67 (207)
.+|+++|+++|...-.- ........-+..|.|.+++.|+++++.+.
T Consensus 173 ~vsd~ei~~~y~~~~~~--~~~~p~~~~v~~I~~~~~~~a~~~~~~l~ 218 (336)
T PRK00059 173 KVTDKDAQKYYNENKSK--FTEKPNTMHLAHILVKTEDEAKKVKKRLD 218 (336)
T ss_pred CCCHHHHHHHHHHhhhh--hcCCcceEEEEEEEecCHHHHHHHHHHHH
Confidence 57999999998764210 00000011245678888888888887764
No 355
>COG0217 Uncharacterized conserved protein [Function unknown]
Probab=23.80 E-value=2.8e+02 Score=23.18 Aligned_cols=54 Identities=28% Similarity=0.367 Sum_probs=32.4
Q ss_pred CcEEEEcCCCcC--CcHHHHHHHHHhCCC-e---eEEEEeeCCCcEEEEEEc--CHHHHHHH
Q psy2347 9 GTRVYVGGLTET--VKKEDLELEFEKFGK-L---NSVWVAFNPPGFAFVEFS--NQIDAEAA 62 (207)
Q Consensus 9 ~~~lfV~nLp~~--~te~~L~~~F~~~G~-V---~~v~i~~d~rG~afV~F~--~~e~A~~A 62 (207)
...|.|--|..+ -|-.+|+..|.+.|- + -+|.++.+.+++..|.=. +++++..+
T Consensus 94 GvaiiVe~LTDN~NRTas~vR~~F~K~GG~lg~~GSV~~mF~~kGvi~~~~~~~~ed~l~e~ 155 (241)
T COG0217 94 GVAIIVEALTDNRNRTASNVRSAFNKNGGNLGEPGSVSYMFDRKGVIVVEKNEIDEDELLEA 155 (241)
T ss_pred ceEEEEEeccCCcchhHHHHHHHHHhcCCccCCCceEEEEEeccEEEEECCCCCCHHHHHHH
Confidence 456777777554 466889999998764 2 246666665554444433 44444433
No 356
>PTZ00136 eukaryotic translation initiation factor 6-like protein; Provisional
Probab=23.79 E-value=2.2e+02 Score=23.88 Aligned_cols=49 Identities=10% Similarity=0.070 Sum_probs=27.4
Q ss_pred CCcCCcHHHHHHHHHhC---CCeeEEEEeeC--------CCcEEEEEEcCHHHHHHHHHh
Q psy2347 17 LTETVKKEDLELEFEKF---GKLNSVWVAFN--------PPGFAFVEFSNQIDAEAACDS 65 (207)
Q Consensus 17 Lp~~~te~~L~~~F~~~---G~V~~v~i~~d--------~rG~afV~F~~~e~A~~Al~~ 65 (207)
||..++++||+.+=... -.|..+.-... .--+|+|.-+-.++..+.++.
T Consensus 72 vp~~~~d~El~~l~~~l~d~v~V~~l~~~~saiGn~i~~ND~~alV~p~l~~~~~e~I~d 131 (247)
T PTZ00136 72 VPSICTDQELQHLRNSLPDSVKVQRVEERLSALGNCIACNDYVALIHPDLDRETEEIIQD 131 (247)
T ss_pred cCCcCCHHHHHHHHHhCcCCccEEEeCCccccceeEEEEcCCEEEECCCCCHHHHHHHHH
Confidence 58889999998653333 12222211000 346889877755655555543
No 357
>PRK13883 conjugal transfer protein TrbH; Provisional
Probab=23.77 E-value=1.8e+02 Score=22.52 Aligned_cols=38 Identities=8% Similarity=0.133 Sum_probs=25.0
Q ss_pred CcHHHHHHHHHhCCCee-EEEEeeC--------------CCcEEEEEEcCHHH
Q psy2347 21 VKKEDLELEFEKFGKLN-SVWVAFN--------------PPGFAFVEFSNQID 58 (207)
Q Consensus 21 ~te~~L~~~F~~~G~V~-~v~i~~d--------------~rG~afV~F~~~e~ 58 (207)
+..|.++++-+.|.+-+ .++|..+ .++|+.||+.+...
T Consensus 36 iA~D~v~qL~~~ypPA~Tt~~l~q~~~D~Fg~aL~~aLR~~GYaV~e~~~~~~ 88 (151)
T PRK13883 36 LATDAVQQLATLYPPAQTRFELQQPTPDAFGQALVKALRDKGYALLEYNPAGA 88 (151)
T ss_pred HHHHHHHHHHHhCCCcceEEEEecCCCcHHHHHHHHHHHHcCeEEEecCCccc
Confidence 34556667777777644 3444443 57999999988653
No 358
>PHA00742 hypothetical protein
Probab=23.61 E-value=50 Score=26.18 Aligned_cols=26 Identities=12% Similarity=0.206 Sum_probs=20.3
Q ss_pred cEEEEEEcCHHHHHHHHHhcCCceeC
Q psy2347 47 GFAFVEFSNQIDAEAACDSMNDQDLM 72 (207)
Q Consensus 47 G~afV~F~~~e~A~~Al~~l~g~~i~ 72 (207)
..-||.+.|.|+-.+||.+|+-..+.
T Consensus 133 Rv~FihynD~EDKlk~isALqilNL~ 158 (211)
T PHA00742 133 RVWFIHYNDTEDKLKCISALQILNLE 158 (211)
T ss_pred eEEEEEecChhHhhhhhHHHHHhhhh
Confidence 36799999999999998887654443
No 359
>CHL00100 ilvH acetohydroxyacid synthase small subunit
Probab=23.55 E-value=3.7e+02 Score=21.21 Aligned_cols=59 Identities=12% Similarity=0.130 Sum_probs=39.9
Q ss_pred HHHHHHHHHhCCCeeEEEEeeC----CCcEEEEEEc----CHHHHHHHHHhcCCcee--CCcEEEEEEe
Q psy2347 23 KEDLELEFEKFGKLNSVWVAFN----PPGFAFVEFS----NQIDAEAACDSMNDQDL--MGSKLRVEIS 81 (207)
Q Consensus 23 e~~L~~~F~~~G~V~~v~i~~d----~rG~afV~F~----~~e~A~~Al~~l~g~~i--~g~~l~V~~a 81 (207)
-++|.+.|.+.-+|.+|..+.+ .+-.++|... +..++.+.++.++.+.+ .-..+.|++.
T Consensus 57 ieqL~kQL~KLidVl~V~~~~~~~~v~rEl~LiKv~~~~~~r~ei~~~~~~f~a~ivdv~~~~~~ie~t 125 (174)
T CHL00100 57 IEQLTKQLYKLVNILKVQDITNIPCVERELMLIKINVNSQTRPEILEIAQIFRAKVVDLSEESLILEVT 125 (174)
T ss_pred HHHHHHHHHHHhHhhEEEecCCccceeeEEEEEEEecCCcCHHHHHHHHHHhCCEEEEecCCEEEEEEc
Confidence 5678888888888888888766 5667887775 34556666666666543 3445556554
No 360
>TIGR00755 ksgA dimethyladenosine transferase. Alternate name: S-adenosylmethionine--6-N',N'-adenosyl (rRNA) dimethyltransferase
Probab=23.51 E-value=97 Score=25.48 Aligned_cols=23 Identities=17% Similarity=-0.013 Sum_probs=19.5
Q ss_pred EEEEcCCCcCCcHHHHHHHHHhC
Q psy2347 11 RVYVGGLTETVKKEDLELEFEKF 33 (207)
Q Consensus 11 ~lfV~nLp~~~te~~L~~~F~~~ 33 (207)
.+.|+|||..++.+.|.++++..
T Consensus 96 ~~vvsNlPy~i~~~il~~ll~~~ 118 (253)
T TIGR00755 96 LKVVSNLPYNISSPLIFKLLEKP 118 (253)
T ss_pred ceEEEcCChhhHHHHHHHHhccC
Confidence 37799999999999999998643
No 361
>cd06314 PBP1_tmGBP Periplasmic sugar-binding domain of Thermotoga maritima glucose-binding protein (tmGBP) and its close homologs. Periplasmic sugar-binding domain of Thermotoga maritima glucose-binding protein (tmGBP) and its close homologs from other bacteria. They are a member of the type I periplasmic binding protein superfamily which consists of two domains connected by a three-stranded hinge. TmGBP is specific for glucose and its binding pocket is buried at the interface of the two domains. TmGBP also exhibits high thermostability and the highest structural similarity to E. coli glucose binding protein (ecGBP).
Probab=23.45 E-value=2.8e+02 Score=22.21 Aligned_cols=21 Identities=14% Similarity=0.262 Sum_probs=16.9
Q ss_pred CcEEEEEEcCHHHHHHHHHhc
Q psy2347 46 PGFAFVEFSNQIDAEAACDSM 66 (207)
Q Consensus 46 rG~afV~F~~~e~A~~Al~~l 66 (207)
..+.+|.+++.+.+..|++.|
T Consensus 93 ~~~~~V~~D~~~~g~~a~~~l 113 (271)
T cd06314 93 GRYVYIGTDNYAAGRTAGEIM 113 (271)
T ss_pred ceeEEEccChHHHHHHHHHHH
Confidence 347899999998888888765
No 362
>PF00719 Pyrophosphatase: Inorganic pyrophosphatase; InterPro: IPR008162 Inorganic pyrophosphatase (3.6.1.1 from EC) (PPase) [, ] is the enzyme responsible for the hydrolysis of pyrophosphate (PPi) which is formed principally as the product of the many biosynthetic reactions that utilise ATP. All known PPases require the presence of divalent metal cations, with magnesium conferring the highest activity. Among other residues, a lysine has been postulated to be part of or close to the active site. PPases have been sequenced from bacteria such as Escherichia coli (homohexamer), Bacillus PS3 (Thermophilic bacterium PS-3) and Thermus thermophilus, from the archaebacteria Thermoplasma acidophilum, from fungi (homodimer), from a plant, and from bovine retina. In yeast, a mitochondrial isoform of PPase has been characterised which seems to be involved in energy production and whose activity is stimulated by uncouplers of ATP synthesis. The sequences of PPases share some regions of similarities, among which is a region that contains three conserved aspartates that are involved in the binding of cations.; GO: 0000287 magnesium ion binding, 0004427 inorganic diphosphatase activity, 0006796 phosphate-containing compound metabolic process, 0005737 cytoplasm; PDB: 2UXS_A 1WCF_A 1SXV_A 3I4Q_A 2PRD_A 2IHP_B 1PYP_A 2IK7_A 2IK4_A 1E9G_A ....
Probab=23.42 E-value=1.3e+02 Score=23.32 Aligned_cols=38 Identities=11% Similarity=0.151 Sum_probs=27.1
Q ss_pred HHHHHHHHHhCCCeeEEEEeeCCCcEEEEEEcCHHHHHHHHHhc
Q psy2347 23 KEDLELEFEKFGKLNSVWVAFNPPGFAFVEFSNQIDAEAACDSM 66 (207)
Q Consensus 23 e~~L~~~F~~~G~V~~v~i~~d~rG~afV~F~~~e~A~~Al~~l 66 (207)
-++|+++|..|-.++ ..+-+..-.|.+.+.|.+.|+..
T Consensus 113 ~~~i~~fF~~YK~l~------~~k~~~~~~~~~~~~A~~~i~~~ 150 (156)
T PF00719_consen 113 LDEIEHFFRNYKDLE------ENKWVEVGGWEDAEEALKVIKEA 150 (156)
T ss_dssp HHHHHHHHHHTTTTS------TTEEEEEEEEEEHHHHHHHHHHH
T ss_pred HHHHHHHHHHhcCcC------CCCeEEeCCCcCHHHHHHHHHHH
Confidence 467889999997663 12334456788999998888753
No 363
>KOG0738|consensus
Probab=23.35 E-value=1.9e+02 Score=26.42 Aligned_cols=23 Identities=13% Similarity=0.004 Sum_probs=17.5
Q ss_pred CCCcEEEE---cCCCcCCcHHHHHHH
Q psy2347 7 ERGTRVYV---GGLTETVKKEDLELE 29 (207)
Q Consensus 7 ~~~~~lfV---~nLp~~~te~~L~~~ 29 (207)
+..+.||| .||||+++|+.++.|
T Consensus 348 e~~k~VmVLAATN~PWdiDEAlrRRl 373 (491)
T KOG0738|consen 348 ENSKVVMVLAATNFPWDIDEALRRRL 373 (491)
T ss_pred ccceeEEEEeccCCCcchHHHHHHHH
Confidence 34567777 699999998877764
No 364
>TIGR03380 agmatine_aguA agmatine deiminase. Members of this family are agmatine deiminase (3.5.3.12), as characterized in Pseudomonas aeruginosa and plants. Related deiminases include the peptidyl-arginine deiminase (3.5.3.15) as found in Porphyromonas gingivalis.
Probab=23.22 E-value=1.7e+02 Score=25.96 Aligned_cols=27 Identities=19% Similarity=0.278 Sum_probs=20.2
Q ss_pred CCCcCCcHHHHHHHHHhCCCeeEEEEe
Q psy2347 16 GLTETVKKEDLELEFEKFGKLNSVWVA 42 (207)
Q Consensus 16 nLp~~~te~~L~~~F~~~G~V~~v~i~ 42 (207)
|-.+.+|+++|++.|+++=-|++|.++
T Consensus 172 nRNP~ls~~eIe~~Lk~~LGv~kviWL 198 (357)
T TIGR03380 172 GRNPHLTKEQIEEKLKDYLGVEKVIWL 198 (357)
T ss_pred CCCCCCCHHHHHHHHHHHHCCCEEEEE
Confidence 456788999999999885556666554
No 365
>cd01676 RNR_II_monomer Class II ribonucleotide reductase, monomeric form. Ribonucleotide reductase (RNR) catalyzes the reductive synthesis of deoxyribonucleotides from their corresponding ribonucleotides. It provides the precursors necessary for DNA synthesis. RNRs are separated into three classes based on their metallocofactor usage. Class I RNRs, found in eukaryotes, bacteria, and bacteriophage, use a diiron-tyrosyl radical. Class II RNRs, found in bacteria, bacteriophage, algae and archaea, use coenzyme B12 (adenosylcobalamin, AdoCbl). Class III RNRs, found in anaerobic bacteria, bacteriophage, and archaea, use an FeS cluster and S-adenosylmethionine to generate a glycyl radical. Many organisms have more than one class of RNR present in their genomes. All three RNRs have a ten-stranded alpha-beta barrel domain that is structurally similar to the domain of PFL (pyruvate formate lyase). Class II RNRs are found in bacteria that can live under both aerobic and anaerobic conditions. Man
Probab=23.18 E-value=1.8e+02 Score=28.04 Aligned_cols=60 Identities=15% Similarity=0.173 Sum_probs=39.3
Q ss_pred CCcEEEEcCCCcCCcHHHHHHHHHhCC-CeeEEEEeeC---CCcEEEEEEc--CHHHHHHHHHhcCCce
Q psy2347 8 RGTRVYVGGLTETVKKEDLELEFEKFG-KLNSVWVAFN---PPGFAFVEFS--NQIDAEAACDSMNDQD 70 (207)
Q Consensus 8 ~~~~lfV~nLp~~~te~~L~~~F~~~G-~V~~v~i~~d---~rG~afV~F~--~~e~A~~Al~~l~g~~ 70 (207)
.++|| |+|++++.++|.+++..+- .++-|.|..+ .+.|..+-|+ ++|.-++++.+|....
T Consensus 569 vSkTI---n~p~~~~~~dV~~~l~~~~~~~KG~t~l~~~d~~~~~~q~P~E~I~ke~ye~~~~~~~~~~ 634 (658)
T cd01676 569 VSNTI---SFDPSPEGPALIQALQQFQYQYKSVSLLPRFDTTKNAAQMPFEPITKERYEQRIAKLKDVD 634 (658)
T ss_pred ccceE---eeCCCCCHHHHHHHHHHhhcCCeEEEEeecCCcccCcccCCccccCHHHHHHHHhhcCccc
Confidence 34455 5688888999988877653 3777777663 3334455565 6777777777766443
No 366
>COG1207 GlmU N-acetylglucosamine-1-phosphate uridyltransferase (contains nucleotidyltransferase and I-patch acetyltransferase domains) [Cell envelope biogenesis, outer membrane]
Probab=23.17 E-value=2.3e+02 Score=25.91 Aligned_cols=58 Identities=16% Similarity=0.087 Sum_probs=35.5
Q ss_pred EEEEcCCCcCCcHHHHHHHHHhCC----Cee--EEE---------EeeC--CCcEEEEEEcCHHHHHHHHHhcCC
Q psy2347 11 RVYVGGLTETVKKEDLELEFEKFG----KLN--SVW---------VAFN--PPGFAFVEFSNQIDAEAACDSMND 68 (207)
Q Consensus 11 ~lfV~nLp~~~te~~L~~~F~~~G----~V~--~v~---------i~~d--~rG~afV~F~~~e~A~~Al~~l~g 68 (207)
.|.+..=.+-++.+.|++++..+- .+. .++ |+++ .+-.++||.+|..++++.++.+|.
T Consensus 99 vLVl~GD~PLit~~TL~~L~~~~~~~~~~~tvLt~~~~dP~GYGRIvr~~~g~V~~IVE~KDA~~eek~I~eiNt 173 (460)
T COG1207 99 VLVLYGDVPLITAETLEELLAAHPAHGAAATVLTAELDDPTGYGRIVRDGNGEVTAIVEEKDASEEEKQIKEINT 173 (460)
T ss_pred EEEEeCCcccCCHHHHHHHHHhhhhcCCceEEEEEEcCCCCCcceEEEcCCCcEEEEEEcCCCCHHHhcCcEEee
Confidence 555544445588888887777653 222 221 2222 234788999988888887776554
No 367
>PRK14168 bifunctional 5,10-methylene-tetrahydrofolate dehydrogenase/ 5,10-methylene-tetrahydrofolate cyclohydrolase; Provisional
Probab=23.00 E-value=3.1e+02 Score=23.66 Aligned_cols=24 Identities=13% Similarity=0.214 Sum_probs=18.4
Q ss_pred EEEEcCCCcCCcHHHHHHHHHhCC
Q psy2347 11 RVYVGGLTETVKKEDLELEFEKFG 34 (207)
Q Consensus 11 ~lfV~nLp~~~te~~L~~~F~~~G 34 (207)
...+-.||.++|+++|.+.+.+..
T Consensus 64 ~~~~~~l~~~~t~~el~~~I~~lN 87 (297)
T PRK14168 64 HEIQDNQSVDITEEELLALIDKYN 87 (297)
T ss_pred EEEEEECCCCCCHHHHHHHHHHHh
Confidence 445567889999999888887764
No 368
>cd04883 ACT_AcuB C-terminal ACT domain of the Bacillus subtilis acetoin utilization protein, AcuB. This CD includes the C-terminal ACT domain of the Bacillus subtilis acetoin utilization protein, AcuB. AcuB is putatively involved in the anaerobic catabolism of acetoin, and related proteins. Studies report the induction of AcuB by nitrate respiration and also by fermentation. Since acetoin can be secreted and later serve as a source of carbon, it has been proposed that, during anaerobic growth when other carbon sources are exhausted, the induction of the AcuB protein results in acetoin catabolism. AcuB-like proteins have two N-terminal tandem CBS domains and a single C-terminal ACT domain. Members of this CD belong to the superfamily of ACT regulatory domains.
Probab=22.98 E-value=2e+02 Score=17.98 Aligned_cols=44 Identities=16% Similarity=0.066 Sum_probs=24.9
Q ss_pred cHHHHHHHHHhCCC-eeEEEEeeC-CCcEE--EEEEc--CHHHHHHHHHh
Q psy2347 22 KKEDLELEFEKFGK-LNSVWVAFN-PPGFA--FVEFS--NQIDAEAACDS 65 (207)
Q Consensus 22 te~~L~~~F~~~G~-V~~v~i~~d-~rG~a--fV~F~--~~e~A~~Al~~ 65 (207)
.-.+|.++|.+++. |.++..... ....+ +|.++ +.+++.++|++
T Consensus 14 ~l~~i~~~l~~~~inI~~i~~~~~~~~~~~~v~i~v~~~~~~~~~~~L~~ 63 (72)
T cd04883 14 QLADIAAIFKDRGVNIVSVLVYPSKEEDNKILVFRVQTMNPRPIIEDLRR 63 (72)
T ss_pred HHHHHHHHHHHcCCCEEEEEEeccCCCCeEEEEEEEecCCHHHHHHHHHH
Confidence 44567788888753 666654333 22344 44444 55566666665
No 369
>PRK00033 clpS ATP-dependent Clp protease adaptor protein ClpS; Reviewed
Probab=22.96 E-value=71 Score=22.88 Aligned_cols=25 Identities=20% Similarity=0.236 Sum_probs=19.9
Q ss_pred CCcEEEEEEcCHHHHHHHHHhcCCc
Q psy2347 45 PPGFAFVEFSNQIDAEAACDSMNDQ 69 (207)
Q Consensus 45 ~rG~afV~F~~~e~A~~Al~~l~g~ 69 (207)
.++-|.|--.+.|.|+.-+++|+..
T Consensus 67 ~~G~avv~~~~~e~AE~~~~~l~~~ 91 (100)
T PRK00033 67 NEGKAVVGVCTREVAETKVEQVHQH 91 (100)
T ss_pred cCCcEEEEEEcHHHHHHHHHHHHcC
Confidence 5788888888999999888887443
No 370
>PF04800 ETC_C1_NDUFA4: ETC complex I subunit conserved region; InterPro: IPR006885 This entry represents prokaryotic NADH-ubiquinone oxidoreductase subunits (1.6.5.3 from EC, 1.6.99.3 from EC) from complex I of the electron transport chain initially identified in Neurospora crassa as a 21 kDa protein [].; GO: 0016651 oxidoreductase activity, acting on NADH or NADPH, 0022900 electron transport chain, 0005743 mitochondrial inner membrane; PDB: 2JYA_A 2LJU_A.
Probab=22.89 E-value=1.5e+02 Score=21.25 Aligned_cols=20 Identities=25% Similarity=0.267 Sum_probs=15.8
Q ss_pred EEEEEcCHHHHHHHHHhcCCc
Q psy2347 49 AFVEFSNQIDAEAACDSMNDQ 69 (207)
Q Consensus 49 afV~F~~~e~A~~Al~~l~g~ 69 (207)
.-+.|.+.|+|.+-++. ++.
T Consensus 51 v~l~F~skE~Ai~yaer-~G~ 70 (101)
T PF04800_consen 51 VRLKFDSKEDAIAYAER-NGW 70 (101)
T ss_dssp CEEEESSHHHHHHHHHH-CT-
T ss_pred eEeeeCCHHHHHHHHHH-cCC
Confidence 45799999999999987 543
No 371
>PF13145 Rotamase_2: PPIC-type PPIASE domain; PDB: 3NRK_A 2KGJ_A.
Probab=22.86 E-value=35 Score=23.78 Aligned_cols=13 Identities=31% Similarity=0.470 Sum_probs=8.8
Q ss_pred CcHHHHHHHHHhC
Q psy2347 21 VKKEDLELEFEKF 33 (207)
Q Consensus 21 ~te~~L~~~F~~~ 33 (207)
+|+++|+++++..
T Consensus 1 vte~el~~yY~~n 13 (121)
T PF13145_consen 1 VTEEELRAYYEEN 13 (121)
T ss_dssp --HHHHHHHHHH-
T ss_pred CCHHHHHHHHHHH
Confidence 5899999998754
No 372
>PF10041 DUF2277: Uncharacterized conserved protein (DUF2277); InterPro: IPR018735 Members of this family of hypothetical bacterial proteins have no known function.
Probab=22.83 E-value=41 Score=22.88 Aligned_cols=20 Identities=5% Similarity=0.222 Sum_probs=16.4
Q ss_pred EcCCCcCCcHHHHHHHHHhC
Q psy2347 14 VGGLTETVKKEDLELEFEKF 33 (207)
Q Consensus 14 V~nLp~~~te~~L~~~F~~~ 33 (207)
+.|+.+.+|++||+..--+|
T Consensus 8 L~~fePpaT~~EI~aAAlQy 27 (78)
T PF10041_consen 8 LRNFEPPATDEEIRAAALQY 27 (78)
T ss_pred hcCCCCCCCHHHHHHHHHHH
Confidence 34787889999999887777
No 373
>PF09341 Pcc1: Transcription factor Pcc1; InterPro: IPR015419 Pcc1 is a proposed transcription factor involved in the expression of genes regulated by alpha-factor and galactose; component of the EKC/KEOPS protein complex with Kae1, Gon7, Bud32, and Cgi121; related to human cancer-testis antigens [].; PDB: 2BNR_C 2P5W_C 3KLA_C 2F54_C 2P5E_C 2F53_C 3ENO_E 3ENC_B.
Probab=22.83 E-value=98 Score=20.45 Aligned_cols=20 Identities=20% Similarity=0.207 Sum_probs=14.9
Q ss_pred EEEEEEcCHHHHHHHHHhcC
Q psy2347 48 FAFVEFSNQIDAEAACDSMN 67 (207)
Q Consensus 48 ~afV~F~~~e~A~~Al~~l~ 67 (207)
.+-|.|++++.|+.+++.|.
T Consensus 4 ~l~i~f~s~~~A~ii~~sL~ 23 (76)
T PF09341_consen 4 TLEIPFESEEKAEIIYRSLK 23 (76)
T ss_dssp EEEEE-SSHHHHHHHHHHHH
T ss_pred EEEEEeCCHHHHHHHHHHhC
Confidence 46689999999998887543
No 374
>PF07693 KAP_NTPase: KAP family P-loop domain; InterPro: IPR011646 The KAP (after Kidins220/ARMS and PifA) family of predicted NTPases are sporadically distributed across a wide phylogenetic range in bacteria and in animals. Many of the prokaryotic KAP NTPases are encoded in plasmids and tend to undergo disruption to form pseudogenes. A unique feature of all eukaryotic and certain bacterial KAP NTPases is the presence of two or four transmembrane helices inserted into the P-loop NTPase domain. These transmembrane helices anchor KAP NTPases in the membrane such that the P-loop domain is located on the intracellular side [].
Probab=22.81 E-value=87 Score=26.39 Aligned_cols=40 Identities=15% Similarity=0.142 Sum_probs=27.5
Q ss_pred CcEEEEEEcCHHHHHHHHHhcC--------CceeCCcEEEEEEeecCC
Q psy2347 46 PGFAFVEFSNQIDAEAACDSMN--------DQDLMGSKLRVEISRGRG 85 (207)
Q Consensus 46 rG~afV~F~~~e~A~~Al~~l~--------g~~i~g~~l~V~~a~~k~ 85 (207)
+.+.||.--|++.+.+|++..- ..++-.+.|.+.+.-|..
T Consensus 203 ~~i~~Il~~D~~~l~~ai~~~~~~~~~~~~~~~yLeKiiq~~~~lP~~ 250 (325)
T PF07693_consen 203 PNIIFILAFDPEILEKAIEKNYGEGFDEIDGREYLEKIIQVPFSLPPP 250 (325)
T ss_pred CCeEEEEEecHHHHHHHHHhhcCcccccccHHHHHHhhcCeEEEeCCC
Confidence 5677888889999999988632 234456667777766644
No 375
>PRK14887 KEOPS complex Pcc1-like subunit; Provisional
Probab=22.80 E-value=2.6e+02 Score=19.17 Aligned_cols=23 Identities=17% Similarity=-0.034 Sum_probs=17.8
Q ss_pred CcEEEEEEcCHHHHHHHHHhcCC
Q psy2347 46 PGFAFVEFSNQIDAEAACDSMND 68 (207)
Q Consensus 46 rG~afV~F~~~e~A~~Al~~l~g 68 (207)
+..+-|+|++++.|+.+.+.|.-
T Consensus 5 ~~~lei~f~s~~~A~iiy~sl~~ 27 (84)
T PRK14887 5 KFTLEFEFETEERARIIYRSVLP 27 (84)
T ss_pred eEEEEEEECCHHHHHHHHHHhCc
Confidence 44667899999999888877653
No 376
>COG0318 CaiC Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II [Lipid metabolism / Secondary metabolites biosynthesis, transport, and catabolism]
Probab=22.79 E-value=3.4e+02 Score=24.84 Aligned_cols=47 Identities=15% Similarity=0.065 Sum_probs=35.1
Q ss_pred cCCcHHHHHHHHHhCCCeeEEEEeeC------CCcEEEEEEc-------CHHHHHHHHHh
Q psy2347 19 ETVKKEDLELEFEKFGKLNSVWVAFN------PPGFAFVEFS-------NQIDAEAACDS 65 (207)
Q Consensus 19 ~~~te~~L~~~F~~~G~V~~v~i~~d------~rG~afV~F~-------~~e~A~~Al~~ 65 (207)
+++..++|++++.++-.|.++.++-- .+.+|+|+-. ++++..+.|+.
T Consensus 432 ~ni~p~eiE~~l~~~~~V~~aavvgvpd~~~Ge~~~a~v~~~~~~~~~~~~~~i~~~~~~ 491 (534)
T COG0318 432 ENIYPEEIEAVLAEHPAVAEAAVVGVPDERWGERVVAVVVLKPGGDAELTAEELRAFLRK 491 (534)
T ss_pred eEECHHHHHHHHHhCCCcceEEEEeCCCCccCceEEEEEEEcCCCCCCCCHHHHHHHHHh
Confidence 46788999999999988888766532 5678888886 56666666664
No 377
>PF08002 DUF1697: Protein of unknown function (DUF1697); InterPro: IPR012545 This family contains many hypothetical bacterial proteins.; PDB: 2HIY_B.
Probab=22.69 E-value=1.7e+02 Score=21.90 Aligned_cols=30 Identities=13% Similarity=0.131 Sum_probs=12.0
Q ss_pred cEEEEEEcCHHHHHHHHHhcCCceeCCcEE
Q psy2347 47 GFAFVEFSNQIDAEAACDSMNDQDLMGSKL 76 (207)
Q Consensus 47 G~afV~F~~~e~A~~Al~~l~g~~i~g~~l 76 (207)
.+-+|.|-+.+....+++.|........++
T Consensus 100 ~~~~v~fl~~~~~~~~~~~l~~~~~~~E~~ 129 (137)
T PF08002_consen 100 KRLYVTFLSGPPDAEALEELAAYDTGPERF 129 (137)
T ss_dssp SEEEEEEE-TT--HHHHHHHHTS---SEEE
T ss_pred ceEEEEEeCCCCCHHHHHHHhccCCCCcEE
Confidence 355666665555444555554443333333
No 378
>PF05727 UPF0228: Uncharacterised protein family (UPF0228); InterPro: IPR008887 This small family of proteins is currently restricted to Methanosarcina species. Members of this family are about 200 residues in length, except for Q8TMK1 from SWISSPROT that has two copies of this region. Although the function of this region is unknown the pattern of conservation suggests that this may be an enzyme, including multiple conserved aspartate and glutamate residues. The most conserved motif in these proteins is NEL/MEXNE/D, where X can be any amino acid, and is found at the C terminus of these proteins.
Probab=22.58 E-value=1.6e+02 Score=22.05 Aligned_cols=51 Identities=14% Similarity=0.280 Sum_probs=28.1
Q ss_pred CcHHHHHHHHHhCCC-eeEEEEeeCCCcEEEEEEcC--------HHHHHHHHHhcCCceeCCcEEEEEEe
Q psy2347 21 VKKEDLELEFEKFGK-LNSVWVAFNPPGFAFVEFSN--------QIDAEAACDSMNDQDLMGSKLRVEIS 81 (207)
Q Consensus 21 ~te~~L~~~F~~~G~-V~~v~i~~d~rG~afV~F~~--------~e~A~~Al~~l~g~~i~g~~l~V~~a 81 (207)
+.++.+.++++++-- |++. -+|+|.|.+ +++|.+.... .+.+.+.|.|.+.
T Consensus 64 i~d~~FL~iL~K~nLqvKKf-------VwCyI~f~dgs~nywIpe~DAirIKnE---LE~NE~il~V~~d 123 (127)
T PF05727_consen 64 IEDKNFLEILEKYNLQVKKF-------VWCYIRFGDGSKNYWIPEKDAIRIKNE---LEMNEKILIVYLD 123 (127)
T ss_pred cCchhHHHHHHhcCcceeEE-------EEEEEEcCCCCcccccchHHHHHHHHH---HhcCCcEEEEEec
Confidence 445567777777642 2221 267777754 3444444443 3445667777653
No 379
>PF09902 DUF2129: Uncharacterized protein conserved in bacteria (DUF2129); InterPro: IPR016979 This is a group of uncharacterised conserved proteins.
Probab=22.51 E-value=2.3e+02 Score=18.92 Aligned_cols=38 Identities=21% Similarity=0.378 Sum_probs=26.0
Q ss_pred HHHhCCCeeEEEEeeCCCcEEEEEEcCHHHHHHHHHhcCCce
Q psy2347 29 EFEKFGKLNSVWVAFNPPGFAFVEFSNQIDAEAACDSMNDQD 70 (207)
Q Consensus 29 ~F~~~G~V~~v~i~~d~rG~afV~F~~~e~A~~Al~~l~g~~ 70 (207)
-+.+||.|..+.-. ..|+ |.|-+.+++++.++.|....
T Consensus 16 ~L~kfG~i~Y~Skk---~kYv-vlYvn~~~~e~~~~kl~~l~ 53 (71)
T PF09902_consen 16 QLRKFGDIHYVSKK---MKYV-VLYVNEEDVEEIIEKLKKLK 53 (71)
T ss_pred hHhhcccEEEEECC---ccEE-EEEECHHHHHHHHHHHhcCC
Confidence 45789998876321 2244 45778899999988876544
No 380
>COG5594 Uncharacterized integral membrane protein [Function unknown]
Probab=22.51 E-value=96 Score=30.47 Aligned_cols=22 Identities=32% Similarity=0.501 Sum_probs=18.3
Q ss_pred CCcEEEEEEcCHHHHHHHHHhc
Q psy2347 45 PPGFAFVEFSNQIDAEAACDSM 66 (207)
Q Consensus 45 ~rG~afV~F~~~e~A~~Al~~l 66 (207)
+...+||+|++...|+.|.+..
T Consensus 356 ~~~~~FItFkSq~~Aq~~aQ~~ 377 (827)
T COG5594 356 PTKSGFITFKSQASAQIAAQSQ 377 (827)
T ss_pred ccccEEEEEehhHHHHHHHHhh
Confidence 3458999999999999888763
No 381
>cd00495 Ribosomal_L25_TL5_CTC Ribosomal_L25_TL5_CTC: Ribosomal L25/TL5/CTC N-terminal 5S rRNA binding domain. L25 is a single-domain protein, homologous to the N-terminal domain of TL5 and CTC, which each contain two domains. CTC is a known stress protein, and proteins of this family are believed to have two functions, acting as both ribosomal and stress proteins. In Escherichia coli, cells deleted for L25 were found to be viable; however, these cells grew slowly and had impaired protein synthesis capability. In Bacillus subtilis, CTC is induced under stress conditions and located in the ribosome; it has been proposed that CTC may be necessary for accurate translation under stress conditions. Ribosomal_L25_TL5_CTC is found only in bacteria and some plastids. Due to its limited taxonomic diversity and the viability of cells deleted for L25, this protein is not believed to be necessary for ribosomal assembly. Eukaryotes contain a protein called L25, which is not homologous to bacterial L
Probab=22.41 E-value=1.6e+02 Score=20.25 Aligned_cols=53 Identities=13% Similarity=0.240 Sum_probs=28.9
Q ss_pred CCCcCCcHHHHHHHHHhCCCeeEEEEeeC-CCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEEE
Q psy2347 16 GLTETVKKEDLELEFEKFGKLNSVWVAFN-PPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVEI 80 (207)
Q Consensus 16 nLp~~~te~~L~~~F~~~G~V~~v~i~~d-~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~~ 80 (207)
++|-.+.+.+|.+++...+.-.-+.+..+ .+-.|+| +.++-+.+.+..+.|++
T Consensus 36 ~~~i~v~~~~l~k~l~~~~~~~~~~L~i~g~~~~~~i------------kevQ~~pv~~~i~HvDF 89 (91)
T cd00495 36 PISISVDEKELEKLLRKEGRSTLIELNIDGKKENVLI------------KDVQRHPVKDKILHVDF 89 (91)
T ss_pred CEEEEEcHHHHHHHHhhcCCceEEEEEECCEEEEEEE------------ehhhhccCCCCEEEEec
Confidence 34556788889888887753222222222 1112322 23455566677777765
No 382
>KOG1975|consensus
Probab=22.36 E-value=1.5e+02 Score=26.21 Aligned_cols=39 Identities=28% Similarity=0.178 Sum_probs=22.0
Q ss_pred cCCcHHHHHHHHHh--C-CCeeEEEEeeCCCcEEEEEEcCHHHHHHHH
Q psy2347 19 ETVKKEDLELEFEK--F-GKLNSVWVAFNPPGFAFVEFSNQIDAEAAC 63 (207)
Q Consensus 19 ~~~te~~L~~~F~~--~-G~V~~v~i~~d~rG~afV~F~~~e~A~~Al 63 (207)
-+++.+.|.++++. . =+|++|++... ||| ++++.|..||
T Consensus 179 ~Dc~~~~l~d~~e~~dp~fDivScQF~~H---YaF---etee~ar~~l 220 (389)
T KOG1975|consen 179 ADCFKERLMDLLEFKDPRFDIVSCQFAFH---YAF---ETEESARIAL 220 (389)
T ss_pred eccchhHHHHhccCCCCCcceeeeeeeEe---eee---ccHHHHHHHH
Confidence 46788888888731 1 24666665443 444 4555555444
No 383
>PLN02655 ent-kaurene oxidase
Probab=22.26 E-value=2e+02 Score=25.70 Aligned_cols=49 Identities=14% Similarity=0.161 Sum_probs=34.4
Q ss_pred EEcCCCcC---CcHHHHHHHHHhCCCeeEEEEeeCCCcEEEEEEcCHHHHHHHHHh
Q psy2347 13 YVGGLTET---VKKEDLELEFEKFGKLNSVWVAFNPPGFAFVEFSNQIDAEAACDS 65 (207)
Q Consensus 13 fV~nLp~~---~te~~L~~~F~~~G~V~~v~i~~d~rG~afV~F~~~e~A~~Al~~ 65 (207)
+|+||..- -....+.+++++|++|-.+.+. ..-.|...+++.++.++.+
T Consensus 9 ~iG~l~~~~~~~~~~~~~~~~~~yG~i~~~~~g----~~~~vvv~~pe~~k~il~~ 60 (466)
T PLN02655 9 VIGNLLQLKEKKPHRTFTKWSEIYGPIYTIRTG----ASSVVVLNSTEVAKEAMVT 60 (466)
T ss_pred ccccHHHcCCCchhHHHHHHHHHhCCeEEEEEC----CEeEEEeCCHHHHHHHHHh
Confidence 45665321 1346788889999998777653 2356778899999998864
No 384
>PF04765 DUF616: Protein of unknown function (DUF616); InterPro: IPR006852 The entry represents a protein of unknown function. The function of is unknown although a number of the members are thought to be glycosyltransferases.
Probab=22.17 E-value=3.6e+02 Score=23.36 Aligned_cols=48 Identities=15% Similarity=0.138 Sum_probs=28.7
Q ss_pred EEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeC--------------CCcEEEEEEcCHHHHHH
Q psy2347 13 YVGGLTETVKKEDLELEFEKFGKLNSVWVAFN--------------PPGFAFVEFSNQIDAEA 61 (207)
Q Consensus 13 fV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d--------------~rG~afV~F~~~e~A~~ 61 (207)
||.+--.+++|.|++. .+++..|+..-|.-+ .+.+|||.|.|++..+.
T Consensus 44 f~~~~gf~i~~~d~~~-m~~c~vvV~saIFG~yD~l~qP~~i~~~s~~~vcf~mF~D~~t~~~ 105 (305)
T PF04765_consen 44 FVKNTGFDISESDRRY-MEKCRVVVYSAIFGNYDKLRQPKNISEYSKKNVCFFMFVDEETLKS 105 (305)
T ss_pred cccCCCCCCCHHHHHH-HhcCCEEEEEEecCCCccccCchhhCHHHhcCccEEEEEehhhHHH
Confidence 4444446788888765 456653333333222 34689999999766543
No 385
>PRK14188 bifunctional 5,10-methylene-tetrahydrofolate dehydrogenase/ 5,10-methylene-tetrahydrofolate cyclohydrolase; Provisional
Probab=22.11 E-value=2.9e+02 Score=23.71 Aligned_cols=24 Identities=8% Similarity=0.152 Sum_probs=18.6
Q ss_pred EEEEcCCCcCCcHHHHHHHHHhCC
Q psy2347 11 RVYVGGLTETVKKEDLELEFEKFG 34 (207)
Q Consensus 11 ~lfV~nLp~~~te~~L~~~F~~~G 34 (207)
...+-.||.++++++|.+++.+..
T Consensus 63 ~~~~~~l~~~~~~~el~~~i~~lN 86 (296)
T PRK14188 63 ASFEHKLPADTSQAELLALIARLN 86 (296)
T ss_pred EEEEEECCCCCCHHHHHHHHHHHh
Confidence 455667889999999888887764
No 386
>TIGR00401 msrA methionine-S-sulfoxide reductase. This model describes peptide methionine sulfoxide reductase (MsrA), a repair enzyme for proteins that have been inactivated by oxidation. The enzyme from E. coli is coextensive with this model and has enzymatic activity. However, in all completed genomes in which this module is present, a second protein module, described in TIGR00357, is also found, and in several cases as part of the same polypeptide chain: N-terminal to this module in Helicobacter pylori and Haemophilus influenzae (as in PilB of Neisseria gonorrhoeae) but C-terminal to it in Treponema pallidum. PilB, containing both domains, has been shown to be important for the expression of adhesins in certain pathogens.
Probab=21.86 E-value=3.6e+02 Score=20.63 Aligned_cols=71 Identities=14% Similarity=0.084 Sum_probs=32.2
Q ss_pred EEEEcCCCcCCcHHHHHHHHHhCCCeeEEEE-eeC-CCcEEE-EEEcCHHHHHHH---HHhcCCceeCCcEEEEEEe
Q psy2347 11 RVYVGGLTETVKKEDLELEFEKFGKLNSVWV-AFN-PPGFAF-VEFSNQIDAEAA---CDSMNDQDLMGSKLRVEIS 81 (207)
Q Consensus 11 ~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i-~~d-~rG~af-V~F~~~e~A~~A---l~~l~g~~i~g~~l~V~~a 81 (207)
.|-|-==|..++-++|.++|-+.-+.+.+.- -.| ..-|.- |-+.++++.+.| +++++......++|.+++.
T Consensus 53 ~V~V~yDp~~is~~~Ll~~f~~~hdPt~~~~Qg~d~G~qYRs~If~~~~~q~~~a~~~~~~~~~~~~~~~~i~tei~ 129 (149)
T TIGR00401 53 AVQVTYDPKVISYEELLDVFWEIHDPTQGNRQGNDIGTQYRSGIYYHSDEQEKAARASKERLQAAANYGDPIVTEIE 129 (149)
T ss_pred EEEEEECCCcCcHHHHHHHHHHhCCCCcCCCCCCCCCCCceEEEEeCCHHHHHHHHHHHHHHHHhcccCCCeEEEEe
Confidence 3444333666787888777655323222211 112 122333 333455554444 4444332122456666554
No 387
>PF14267 DUF4357: Domain of unknown function (DUF4357)
Probab=21.77 E-value=1.9e+02 Score=18.26 Aligned_cols=14 Identities=29% Similarity=0.116 Sum_probs=11.5
Q ss_pred EEEcCHHHHHHHHH
Q psy2347 51 VEFSNQIDAEAACD 64 (207)
Q Consensus 51 V~F~~~e~A~~Al~ 64 (207)
+.|.++..|..+|.
T Consensus 23 ~~F~SPS~AA~~v~ 36 (55)
T PF14267_consen 23 YLFSSPSAAAAVVL 36 (55)
T ss_pred eEcCChHHHHHHHc
Confidence 78999998888774
No 388
>PRK02230 inorganic pyrophosphatase; Provisional
Probab=21.71 E-value=1.7e+02 Score=23.48 Aligned_cols=37 Identities=14% Similarity=0.227 Sum_probs=25.1
Q ss_pred HHHHHHHHHhCCCeeEEEEeeCCCcE-EEEEEcCHHHHHHHHHhc
Q psy2347 23 KEDLELEFEKFGKLNSVWVAFNPPGF-AFVEFSNQIDAEAACDSM 66 (207)
Q Consensus 23 e~~L~~~F~~~G~V~~v~i~~d~rG~-afV~F~~~e~A~~Al~~l 66 (207)
-++|+++|+.|-.++. +.+ .+..|.+.+.|.++|+..
T Consensus 118 l~~I~~fF~~YK~leg-------k~~~~v~g~~~~~~A~~~I~~~ 155 (184)
T PRK02230 118 LDEIEYFFSNYKNWKR-------KGITKVKGFEDEKWALKEYKEC 155 (184)
T ss_pred HHHHHHHHHHhcCCCC-------CCeEEeCCccCHHHHHHHHHHH
Confidence 4688899999855442 222 256788888888877653
No 389
>PF09623 Cas_NE0113: CRISPR-associated protein NE0113 (Cas_NE0113); InterPro: IPR019092 Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR) are a family of DNA direct repeats separated by regularly sized non-repetitive spacer sequences that are found in most bacterial and archaeal genomes []. CRISPRs appear to provide acquired resistance against bacteriophages, possibly acting with an RNA interference-like mechanism to inhibit gene functions of invasive DNA elements [, ]. Differences in the number and type of spacers between CRISPR repeats correlate with phage sensitivity. It is thought that following phage infection, bacteria integrate new spacers derived from phage genomic sequences, and that the removal or addition of particular spacers modifies the phage-resistance phenotype of the cell. Therefore, the specificity of CRISPRs may be determined by spacer-phage sequence similarity. In addition, there are many protein families known as CRISPR-associated sequences (Cas), which are encoded in the vicinity of CRISPR loci []. CRISPR/cas gene regions can be quite large, with up to 20 different, tandem-arranged cas genes next to a CRISPR cluster or filling the region between two repeat clusters. Cas genes and CRISPRs are found on mobile genetic elements such as plasmids, and have undergone extensive horizontal transfer. Cas proteins are thought to be involved in the propagation and functioning of CRISPRs. Some Cas proteins show similarity to helicases and repair proteins, although the functions of most are unknown. Cas families can be divided into subtypes according to operon organisation and phylogeny. This entry represents a Cas protein family found in both bacteria and arachaea. The function of these proteins is unknown.
Probab=21.69 E-value=1.4e+02 Score=24.66 Aligned_cols=44 Identities=23% Similarity=0.299 Sum_probs=28.5
Q ss_pred EEEEeeCCCcEEEEEEcCHHHHHHHHHhcCCcee----C-CcEEEEEEe
Q psy2347 38 SVWVAFNPPGFAFVEFSNQIDAEAACDSMNDQDL----M-GSKLRVEIS 81 (207)
Q Consensus 38 ~v~i~~d~rG~afV~F~~~e~A~~Al~~l~g~~i----~-g~~l~V~~a 81 (207)
.|+|+.+..+.-+-+..++++++.+++.+..... + +..|.|++|
T Consensus 72 ~i~vi~~~~g~~l~DI~t~~d~~~~~~~I~~~i~~l~~~~~~~lh~sIA 120 (224)
T PF09623_consen 72 HIHVIIDVNGLPLDDIRTEEDNEAFADFIYRLIRELKQDPGRRLHVSIA 120 (224)
T ss_pred ccEEEecCCCccccccCCHHHHHHHHHHHHHHHHHHhhCCCCeEEEEec
Confidence 3445555556667788888888887776544322 2 577888776
No 390
>PTZ00372 endonuclease 4-like protein; Provisional
Probab=21.64 E-value=1.9e+02 Score=26.21 Aligned_cols=58 Identities=12% Similarity=0.113 Sum_probs=38.8
Q ss_pred CcEEEEcCCCcC-----CcHHHHHHHHHhCCCeeEEEEeeCCCcEEE---EEEcCHHHHHHHHHhcC
Q psy2347 9 GTRVYVGGLTET-----VKKEDLELEFEKFGKLNSVWVAFNPPGFAF---VEFSNQIDAEAACDSMN 67 (207)
Q Consensus 9 ~~~lfV~nLp~~-----~te~~L~~~F~~~G~V~~v~i~~d~rG~af---V~F~~~e~A~~Al~~l~ 67 (207)
..+|.|-|++.. .+-++|.++++.......|.|+.| .+.+| -+..+++.+...++.++
T Consensus 268 gV~IlLENmag~g~~lG~~~eeL~~Iid~v~~~~rlGvCLD-TcHafaAGydl~t~e~~~~~l~~f~ 333 (413)
T PTZ00372 268 SVIIVLENTAGQKNSVGSKFEDLRDIIALVEDKSRVGVCLD-TCHLFAAGYDIRTKESFDKVMKEFD 333 (413)
T ss_pred CCEEEEecCCCCCCcccCCHHHHHHHHHhcCCcCCeEEEEE-HHHHHhcCCCCCcHHHHHHHHHHHH
Confidence 467888888752 467899999998765567778888 33333 23456666666666543
No 391
>PRK11370 YciI-like protein; Reviewed
Probab=21.52 E-value=2.2e+02 Score=19.80 Aligned_cols=26 Identities=23% Similarity=0.224 Sum_probs=18.6
Q ss_pred cEEEEEEcCHHHHHHHHHhcCCceeCC
Q psy2347 47 GFAFVEFSNQIDAEAACDSMNDQDLMG 73 (207)
Q Consensus 47 G~afV~F~~~e~A~~Al~~l~g~~i~g 73 (207)
.+..|+|+|.++|++.++. +.....+
T Consensus 59 ~~ii~ea~s~~~a~~~~~~-DPy~~aG 84 (99)
T PRK11370 59 STVIAEFESLEAAQAWADA-DPYVAAG 84 (99)
T ss_pred eEEEEEECCHHHHHHHHHC-CchhhcC
Confidence 3566699999999999875 4444334
No 392
>cd04879 ACT_3PGDH-like ACT_3PGDH-like CD includes the C-terminal ACT (regulatory) domain of D-3-phosphoglycerate dehydrogenase (3PGDH). ACT_3PGDH-like: The ACT_3PGDH-like CD includes the C-terminal ACT (regulatory) domain of D-3-phosphoglycerate dehydrogenase (3PGDH), with or without an extended C-terminal (xct) region found in various bacteria, archaea, fungi, and plants. 3PGDH is an enzyme that belongs to the D-isomer specific, 2-hydroxyacid dehydrogenase family and catalyzes the oxidation of D-3-phosphoglycerate to 3- phosphohydroxypyruvate, which is the first step in the biosynthesis of L-serine, using NAD+ as the oxidizing agent. In bacteria, 3PGDH is feedback controlled by the end product L-serine in an allosteric manner. In the Escherichia coli homotetrameric enzyme, the interface at adjacent ACT (regulatory) domains couples to create an extended beta-sheet. Each regulatory interface forms two serine-binding sites. The mechanism by which serine transmits inhibition to the active
Probab=21.52 E-value=2e+02 Score=17.37 Aligned_cols=34 Identities=12% Similarity=0.183 Sum_probs=20.0
Q ss_pred CcHHHHHHHHHhCC-CeeEEEEeeCC-CcEEEEEEc
Q psy2347 21 VKKEDLELEFEKFG-KLNSVWVAFNP-PGFAFVEFS 54 (207)
Q Consensus 21 ~te~~L~~~F~~~G-~V~~v~i~~d~-rG~afV~F~ 54 (207)
-.-.+|.++|.+++ .|..+.+..+. ..++.++|.
T Consensus 11 g~l~~i~~~l~~~~~nI~~~~~~~~~~~~~~~~~~~ 46 (71)
T cd04879 11 GVIGKVGTILGEHGINIAAMQVGRKEKGGIAYMVLD 46 (71)
T ss_pred CHHHHHHHHHHhcCCCeeeEEEeccCCCCEEEEEEE
Confidence 34567778888775 36667665542 245555443
No 393
>PRK11611 enhanced serine sensitivity protein SseB; Provisional
Probab=21.50 E-value=54 Score=27.48 Aligned_cols=80 Identities=10% Similarity=0.147 Sum_probs=44.1
Q ss_pred CCCCCcEEEEcCCCcCCc--HHHHHHHHHhCCCeeEEEE--ee---CCCcEEEEEEcCHHHHHHHHHh---cC-CceeCC
Q psy2347 5 MMERGTRVYVGGLTETVK--KEDLELEFEKFGKLNSVWV--AF---NPPGFAFVEFSNQIDAEAACDS---MN-DQDLMG 73 (207)
Q Consensus 5 ~~~~~~~lfV~nLp~~~t--e~~L~~~F~~~G~V~~v~i--~~---d~rG~afV~F~~~e~A~~Al~~---l~-g~~i~g 73 (207)
..+....|.|+.+...-. -+.|+++|+++-.|+...| +. |.+-.-.|.++...+.+..+.. +. ....++
T Consensus 125 ~i~~g~~v~lg~p~~~P~~lv~~L~~lf~~~k~V~rAyL~~~~~~~d~~p~LLI~le~~~d~e~ii~~ag~~a~~~l~~d 204 (246)
T PRK11611 125 VLEGGESLLLSEVAEPPAQMIDSLTTLFKTIKPVKRAFLASIKENADAQPNLLIGIEADGDIEEIIQAAGSVATDTLPGD 204 (246)
T ss_pred EeCCCCEEEecCCccchHHHHHHHHHHHhhcchHHHHHHHHHhccCCCCCceEEEEecCCCHHHHHHHHhHHHHHhCCCC
Confidence 345677888877644322 3678999999988775432 22 2233344444322233333332 22 233467
Q ss_pred cEEEEEEeecC
Q psy2347 74 SKLRVEISRGR 84 (207)
Q Consensus 74 ~~l~V~~a~~k 84 (207)
..|.|....+.
T Consensus 205 ~~IDi~~v~~~ 215 (246)
T PRK11611 205 EPIDICQVKEG 215 (246)
T ss_pred CceeEEEecCC
Confidence 78888777653
No 394
>PRK14093 UDP-N-acetylmuramoylalanyl-D-glutamyl-2,6-diaminopimelate--D-alanyl-D-alanine ligase; Provisional
Probab=21.30 E-value=3.9e+02 Score=24.25 Aligned_cols=60 Identities=10% Similarity=0.087 Sum_probs=35.6
Q ss_pred CCcHHHHHHHHHhCCCe----eEEEEeeCCCcEEEEE--Ec-CHHHHHHHHHhcCCce--eCCcEEEEE
Q psy2347 20 TVKKEDLELEFEKFGKL----NSVWVAFNPPGFAFVE--FS-NQIDAEAACDSMNDQD--LMGSKLRVE 79 (207)
Q Consensus 20 ~~te~~L~~~F~~~G~V----~~v~i~~d~rG~afV~--F~-~~e~A~~Al~~l~g~~--i~g~~l~V~ 79 (207)
.++.++|.+-++.|-.+ +.+++........+|. |. +++++++||+.|.... ..++.|.|-
T Consensus 307 Gi~~~~i~~~l~~~~~~~gR~~~~r~~~~~~~~~iIDDsYahnP~s~~aaL~~l~~~~~~~~~r~i~V~ 375 (479)
T PRK14093 307 GADLALAALALSQVQPAAGRGVRHTLEVGGGEATLIDESYNANPASMAAALGVLGRAPVGPQGRRIAVL 375 (479)
T ss_pred CCCHHHHHHHHHhCCCcCCcceEEEeecCCCCEEEEECCCCCCHHHHHHHHHHHHhhhccCCCCEEEEE
Confidence 46777888888887532 2222211123467777 64 7888999998876542 234555543
No 395
>PF11181 YflT: Heat induced stress protein YflT
Probab=21.25 E-value=86 Score=22.13 Aligned_cols=29 Identities=17% Similarity=0.211 Sum_probs=22.0
Q ss_pred EEEcCHHHHHHHHHhcCCceeCCcEEEEE
Q psy2347 51 VEFSNQIDAEAACDSMNDQDLMGSKLRVE 79 (207)
Q Consensus 51 V~F~~~e~A~~Al~~l~g~~i~g~~l~V~ 79 (207)
-+|++.++|..||+.|...=+....|.|-
T Consensus 4 gv~~~~~E~~~~I~~L~~~Gy~~ddI~Vv 32 (103)
T PF11181_consen 4 GVYDNEEEALSAIEELKAQGYSEDDIYVV 32 (103)
T ss_pred EEECCHHHHHHHHHHHHHcCCCcccEEEE
Confidence 46889999999999887766666666654
No 396
>PRK12864 YciI-like protein; Reviewed
Probab=21.20 E-value=2.8e+02 Score=18.94 Aligned_cols=28 Identities=11% Similarity=0.030 Sum_probs=19.6
Q ss_pred CcEEEEEEcCHHHHHHHHHhcCCceeCCc
Q psy2347 46 PGFAFVEFSNQIDAEAACDSMNDQDLMGS 74 (207)
Q Consensus 46 rG~afV~F~~~e~A~~Al~~l~g~~i~g~ 74 (207)
..+..|+++|.++|++.++. +.....+-
T Consensus 49 g~~~i~~a~s~eea~~~~~~-DPy~~aGl 76 (89)
T PRK12864 49 YVFGIFEAEDEETVRQLIEA-DPYWQNGI 76 (89)
T ss_pred CEEEEEEeCCHHHHHHHHHc-CCchhcCC
Confidence 34456688999999988875 55555554
No 397
>PRK01250 inorganic pyrophosphatase; Provisional
Probab=21.14 E-value=1.6e+02 Score=23.35 Aligned_cols=39 Identities=15% Similarity=0.152 Sum_probs=25.8
Q ss_pred cHHHHHHHHHhCCCeeEEEEeeCCCcEEEEEEcCHHHHHHHHHhc
Q psy2347 22 KKEDLELEFEKFGKLNSVWVAFNPPGFAFVEFSNQIDAEAACDSM 66 (207)
Q Consensus 22 te~~L~~~F~~~G~V~~v~i~~d~rG~afV~F~~~e~A~~Al~~l 66 (207)
.-++|+++|+.|-.++. .+-+-...|.+.+.|+++|++.
T Consensus 131 ~l~eI~~fF~~YK~le~------gk~~~v~g~~~~~~A~~~I~~~ 169 (176)
T PRK01250 131 LKAQIKHFFEHYKDLEK------GKWVKVEGWGGAEEAKAEIVEA 169 (176)
T ss_pred HHHHHHHHHHHhcCCCC------CCCEEecCccCHHHHHHHHHHH
Confidence 34688899998865431 1223344678999998888753
No 398
>COG0769 MurE UDP-N-acetylmuramyl tripeptide synthase [Cell envelope biogenesis, outer membrane]
Probab=21.13 E-value=4.3e+02 Score=24.32 Aligned_cols=60 Identities=17% Similarity=0.126 Sum_probs=37.9
Q ss_pred cCCcHHHHHHHHHhCCCee-EEEEeeCCCcEEEEEEc-CHHHHHHHHHhcCCceeCCcEEEEE
Q psy2347 19 ETVKKEDLELEFEKFGKLN-SVWVAFNPPGFAFVEFS-NQIDAEAACDSMNDQDLMGSKLRVE 79 (207)
Q Consensus 19 ~~~te~~L~~~F~~~G~V~-~v~i~~d~rG~afV~F~-~~e~A~~Al~~l~g~~i~g~~l~V~ 79 (207)
..++.++|++-++++-.|. ..+++.-..--++|.|. +++..+++|+.+. ....++.|.|.
T Consensus 304 lG~~~e~i~~~l~~~~~v~GRmE~v~~~~~~v~VDyAHnPd~le~~L~~~~-~~~~g~li~Vf 365 (475)
T COG0769 304 LGVDLEDILAGLETLKPVPGRMELVNIGGKLVIVDYAHNPDGLEKALRAVR-LHAAGRLIVVF 365 (475)
T ss_pred cCCCHHHHHHHHHhcCCCCCcceEecCCCCeEEEEeccChHHHHHHHHHHH-hhcCCcEEEEE
Confidence 3567888888888775432 23333222457899996 8888999998876 22345544443
No 399
>COG0379 NadA Quinolinate synthase [Coenzyme metabolism]
Probab=21.05 E-value=2.3e+02 Score=24.73 Aligned_cols=60 Identities=15% Similarity=0.090 Sum_probs=38.4
Q ss_pred CCCCCcEEEEcCC------CcCCcHHHHHHHHHhCCCeeEEEEeeC---CCcEEEEEEcCHHHHHHHHHh
Q psy2347 5 MMERGTRVYVGGL------TETVKKEDLELEFEKFGKLNSVWVAFN---PPGFAFVEFSNQIDAEAACDS 65 (207)
Q Consensus 5 ~~~~~~~lfV~nL------p~~~te~~L~~~F~~~G~V~~v~i~~d---~rG~afV~F~~~e~A~~Al~~ 65 (207)
+.++.++|.|=.+ ...++.++|+++-++|..+.-|..+.. -|-++=|...| ..|.+.++.
T Consensus 87 iLnPeK~VL~Pd~~AgCsmA~~~~~~~~~~~~~~~P~~~vV~YvNtsA~vKA~~Di~cTS-sNAvkvVe~ 155 (324)
T COG0379 87 ILNPEKTVLLPDLEAGCSMADMITAEEVRAFKEKHPDAPVVTYVNTSAEVKAEADICCTS-SNAVKVVES 155 (324)
T ss_pred hcCCCCeEecCCCCCCCCcccCCCHHHHHHHHHHCCCCceEEEeeChHHHHhhcCeEEec-chHHHHHHh
Confidence 4567777777544 345788999999889987666555444 34455455544 445666665
No 400
>PRK00110 hypothetical protein; Validated
Probab=21.03 E-value=4.9e+02 Score=21.70 Aligned_cols=55 Identities=25% Similarity=0.368 Sum_probs=31.3
Q ss_pred CCcEEEEcCCCcC--CcHHHHHHHHHhCCC-e---eEEEEeeCCCcEEEEEEc--CHHHH-HHHHH
Q psy2347 8 RGTRVYVGGLTET--VKKEDLELEFEKFGK-L---NSVWVAFNPPGFAFVEFS--NQIDA-EAACD 64 (207)
Q Consensus 8 ~~~~lfV~nLp~~--~te~~L~~~F~~~G~-V---~~v~i~~d~rG~afV~F~--~~e~A-~~Al~ 64 (207)
....|.|--|..+ -|..+|+.+|.+++- + -.|..+.+.+ +.|+|+ +++++ +.||+
T Consensus 93 ~GvaiiVe~lTDN~nRt~~~vR~~f~K~gG~l~~~Gsv~~~Fe~k--G~i~~~~~~~d~~~e~aie 156 (245)
T PRK00110 93 GGVAIIVEALTDNRNRTAAEVRHAFSKNGGNLGETGSVSYMFDRK--GVIVIEPLDEDELMEAALE 156 (245)
T ss_pred CCeEEEEEEecCCHHHHHHHHHHHHHhcCceeCCCcceEEEeccc--eEEEeCCCCHHHHHHHHHh
Confidence 3455666666554 356899999999864 2 2355555533 455554 34444 34444
No 401
>PRK13818 ribosome-binding factor A; Provisional
Probab=20.91 E-value=1.4e+02 Score=22.01 Aligned_cols=21 Identities=5% Similarity=-0.001 Sum_probs=13.9
Q ss_pred CCeeEEEEeeC-CCcEEEEEEc
Q psy2347 34 GKLNSVWVAFN-PPGFAFVEFS 54 (207)
Q Consensus 34 G~V~~v~i~~d-~rG~afV~F~ 54 (207)
-.|+.|+|..| +-..+||++-
T Consensus 32 vtVt~V~lS~Dl~~AkVyvs~~ 53 (121)
T PRK13818 32 VTITAVECTNDLSYATVYYSLL 53 (121)
T ss_pred eEEeEEEECCCCCEEEEEEEeC
Confidence 45778888888 4345667665
No 402
>TIGR01873 cas_CT1978 CRISPR-associated endoribonuclease Cas2, E. coli subfamily. CRISPR is a term for Clustered, Regularly Interspaced Short Palidromic Repeats. A number of protein families appear only in association with these repeats and are designated Cas (CRISPR-Associated) proteins. This model represents a minor branch of the Cas2 family of CRISPR-associated endonuclease, whereas most Cas2 proteins are modeled instead by TIGR01573. This form of Cas2 is characteristic for the Ecoli subtype of CRISPR/Cas locus.
Probab=20.83 E-value=2e+02 Score=20.06 Aligned_cols=51 Identities=27% Similarity=0.580 Sum_probs=28.5
Q ss_pred CCCCCcEEEEcCCCcCCcHHHHHHHHHhC---CCeeEEEEeeCCCcEEEEEEcC
Q psy2347 5 MMERGTRVYVGGLTETVKKEDLELEFEKF---GKLNSVWVAFNPPGFAFVEFSN 55 (207)
Q Consensus 5 ~~~~~~~lfV~nLp~~~te~~L~~~F~~~---G~V~~v~i~~d~rG~afV~F~~ 55 (207)
+.+...-|||++++..+-|...+.+-+.| |.+.-+.-..+..+|.|-++-+
T Consensus 21 llEv~~GVyVg~~s~rVRe~lW~~v~~~~~~~G~avm~~~~~~e~G~~~~t~G~ 74 (87)
T TIGR01873 21 LLEPRAGVYVGGVSASVRERIWDYLAQHCPPKGSLVITWSSNTCPGFEFFTLGE 74 (87)
T ss_pred eeecCCCcEEcCCCHHHHHHHHHHHHHhCCCCccEEEEEeCCCCCCcEEEecCC
Confidence 34567789999988777654444333332 3333222222256777776654
No 403
>PF04008 Adenosine_kin: Adenosine specific kinase; InterPro: IPR007153 The structure of a member of this family from the hyperthermophilic archaeon Pyrobaculum aerophilum contains a modified histidine residue which is interpreted as stable phosphorylation. In vitro binding studies confirmed that adenosine and AMP but not ADP or ATP bind to the protein [].; PDB: 1VGG_A 1RLH_A 1WVQ_A 2GL0_F 2JB7_B 2EKM_C 2D16_D.
Probab=20.74 E-value=2.2e+02 Score=21.97 Aligned_cols=30 Identities=20% Similarity=0.059 Sum_probs=21.7
Q ss_pred CCCCcEEEEcCCCcCCcHHHHHHHHHhCCC
Q psy2347 6 MERGTRVYVGGLTETVKKEDLELEFEKFGK 35 (207)
Q Consensus 6 ~~~~~~lfV~nLp~~~te~~L~~~F~~~G~ 35 (207)
.++.+.|.|+.-.+--|-|||.+.+-.--+
T Consensus 5 ~Pe~~N~IlGqsHFIKTvEDl~Ealv~svp 34 (155)
T PF04008_consen 5 KPEGCNVILGQSHFIKTVEDLYEALVTSVP 34 (155)
T ss_dssp --TTEEEEEEEESSTTHHHHHHHHHHCCST
T ss_pred CCCCceEEEeccchhhhHHHHHHHHhhcCC
Confidence 456778888877777788999888766544
No 404
>TIGR01033 DNA-binding regulatory protein, YebC/PmpR family. This model describes a minimally characterized protein family, restricted to bacteria excepting for some eukaryotic sequences that have possible transit peptides. YebC from E. coli is crystallized, and PA0964 from Pseudomonas aeruginosa has been shown to be a sequence-specific DNA-binding regulatory protein.
Probab=20.72 E-value=2.9e+02 Score=22.97 Aligned_cols=46 Identities=24% Similarity=0.276 Sum_probs=28.2
Q ss_pred CCCcEEEEcCCCcC--CcHHHHHHHHHhCCCee----EEEEeeCCCcEEEEEEc
Q psy2347 7 ERGTRVYVGGLTET--VKKEDLELEFEKFGKLN----SVWVAFNPPGFAFVEFS 54 (207)
Q Consensus 7 ~~~~~lfV~nLp~~--~te~~L~~~F~~~G~V~----~v~i~~d~rG~afV~F~ 54 (207)
+....|+|--|..+ -|..+|+.+|.+++-.. .|..+.+. .+.|+|+
T Consensus 92 P~GvaiiVe~lTDN~nRt~~~ir~~f~K~gg~l~~~gsv~~~Fe~--kG~i~~~ 143 (238)
T TIGR01033 92 PGGVAIIVECLTDNKNRTASEVRSAFNKNGGSLGEPGSVSYLFSR--KGVIEVP 143 (238)
T ss_pred CCceEEEEEEecCCHHhHHHHHHHHHHHcCCeeCCCCceeeeeec--ceEEEEC
Confidence 34456666666554 35689999999987522 35555553 4555553
No 405
>PF08152 GUCT: GUCT (NUC152) domain; InterPro: IPR012562 This is the C-terminal domain found in the RNA helicase II / Gu protein family [].; GO: 0003723 RNA binding, 0004386 helicase activity, 0005524 ATP binding, 0005634 nucleus; PDB: 2E29_A.
Probab=20.72 E-value=36 Score=24.15 Aligned_cols=39 Identities=15% Similarity=0.042 Sum_probs=21.3
Q ss_pred HHHHHHHH--hCCCeeEEEEeeCCCcEEEEEEcC-HHHHHHHHHh
Q psy2347 24 EDLELEFE--KFGKLNSVWVAFNPPGFAFVEFSN-QIDAEAACDS 65 (207)
Q Consensus 24 ~~L~~~F~--~~G~V~~v~i~~d~rG~afV~F~~-~e~A~~Al~~ 65 (207)
..|++.+. ..-.|..+.+.+|+.+.+| += .+.++..++.
T Consensus 36 ~~L~~~L~e~~~~~v~~m~l~~d~~GavF---DvP~~~~~~~~~~ 77 (97)
T PF08152_consen 36 RILRRQLSEEIADKVKGMTLLKDKMGAVF---DVPSEIAEEFLAK 77 (97)
T ss_dssp HHHHHHS-HHHHTT-EEEEE-TTSSEEEE---EEEHHHHHHHHHH
T ss_pred HHHHHhcCHHHHHhhCcEEEecCCCEEEE---EChHHHHHHHHHh
Confidence 45666554 3557999999998766554 42 3344444443
No 406
>cd00001 PTS_IIB_man PTS_IIB, PTS system, Mannose/sorbose specific IIB subunit. The bacterial phosphoenolpyruvate: sugar phosphotransferase system (PTS) is a multi-protein system involved in the regulation of a variety of metabolic and transcriptional processes. This family is one of four structurally and functionally distinct group IIB PTS system cytoplasmic enzymes, necessary for the uptake of carbohydrates across the cytoplasmic membrane and their phosphorylation. The active site histidine receives a phosphate group from the IIA subunit and transfers it to the substrate.
Probab=20.70 E-value=3.3e+02 Score=20.75 Aligned_cols=19 Identities=21% Similarity=0.230 Sum_probs=16.7
Q ss_pred CcEEEEEEcCHHHHHHHHH
Q psy2347 46 PGFAFVEFSNQIDAEAACD 64 (207)
Q Consensus 46 rG~afV~F~~~e~A~~Al~ 64 (207)
+.-.||.|+++++|.++++
T Consensus 74 ~~~v~il~k~~~~~~~l~~ 92 (151)
T cd00001 74 KQRVFLLFKNPQDVLRLVE 92 (151)
T ss_pred CceEEEEECCHHHHHHHHH
Confidence 3468999999999999988
No 407
>PLN02968 Probable N-acetyl-gamma-glutamyl-phosphate reductase
Probab=20.70 E-value=2.1e+02 Score=25.43 Aligned_cols=31 Identities=29% Similarity=0.341 Sum_probs=21.9
Q ss_pred EEEEcCCCcCCcHHHHHHHHHh-CCCeeEEEEe
Q psy2347 11 RVYVGGLTETVKKEDLELEFEK-FGKLNSVWVA 42 (207)
Q Consensus 11 ~lfV~nLp~~~te~~L~~~F~~-~G~V~~v~i~ 42 (207)
+|+| .|...++.++|+++|++ |-.+..|++.
T Consensus 280 tv~v-~~~~~~~~~~v~~~~~~~y~~~~fV~~~ 311 (381)
T PLN02968 280 TVYV-HYAPGVTAEDLHQHLKERYEGEEFVKVL 311 (381)
T ss_pred EEEE-EeCCCCCHHHHHHHHHHhCCCCCEEEeC
Confidence 4444 24567899999999998 6666666663
No 408
>cd01611 GABARAP Ubiquitin domain of GABA-receptor-associated protein. GABARAP (GABA-receptor-associated protein) belongs ot a large family of proteins that mediate intracellular membrane trafficking and/or fusion. GABARAP binds not only to GABA, type A but also to tubulin, gephrin, and ULK1. Orthologues of GABARAP include Gate-16 (golgi-associated ATPase enhancer), LC3 (microtubule-associated protein light chain 3), and ATG8 (autophagy protein 8). ATG8 is a ubiquitin-like protein that is conjugated to the membrane phospholipid, phosphatidylethanolamine as part of a ubiquitin-like conjugation system essential for autophagosome-formation.
Probab=20.70 E-value=2.1e+02 Score=20.83 Aligned_cols=17 Identities=6% Similarity=0.026 Sum_probs=12.6
Q ss_pred CCcCCcHHHHHHHHHhC
Q psy2347 17 LTETVKKEDLELEFEKF 33 (207)
Q Consensus 17 Lp~~~te~~L~~~F~~~ 33 (207)
+|.++|-.++..++.+.
T Consensus 47 Vp~~~tv~~f~~~irk~ 63 (112)
T cd01611 47 VPSDLTVGQFVYIIRKR 63 (112)
T ss_pred ecCCCCHHHHHHHHHHH
Confidence 67888887777777654
No 409
>PRK00642 inorganic pyrophosphatase; Provisional
Probab=20.67 E-value=93 Score=25.32 Aligned_cols=42 Identities=12% Similarity=0.071 Sum_probs=27.6
Q ss_pred cHHHHHHHHHhCCCeeEEEEeeCCCcEEEEEEcCHHHHHHHHHhcC
Q psy2347 22 KKEDLELEFEKFGKLNSVWVAFNPPGFAFVEFSNQIDAEAACDSMN 67 (207)
Q Consensus 22 te~~L~~~F~~~G~V~~v~i~~d~rG~afV~F~~~e~A~~Al~~l~ 67 (207)
.-++|+++|..|=..+. +..+.+.+..+.+.+.|++.|++.+
T Consensus 156 ~l~~I~~fF~~YK~leg----k~~k~~~~~g~~~~~~A~~vI~~~~ 197 (205)
T PRK00642 156 LLDRLQHYFLTYKATPG----ELIKGVEIVGIYGKEEAQKVIQLAH 197 (205)
T ss_pred HHHHHHHHHHHHcCccc----CCCCeEEECCCcCHHHHHHHHHHHH
Confidence 35678889998854321 0013366777888999988887643
No 410
>KOG4357|consensus
Probab=20.64 E-value=3.8e+02 Score=20.19 Aligned_cols=25 Identities=28% Similarity=0.178 Sum_probs=19.9
Q ss_pred EEEEEEcCHHHHHHHHHhcCCceeC
Q psy2347 48 FAFVEFSNQIDAEAACDSMNDQDLM 72 (207)
Q Consensus 48 ~afV~F~~~e~A~~Al~~l~g~~i~ 72 (207)
-|+..|.+-+.|-.|...|-+++++
T Consensus 115 raifm~kdge~a~e~k~fll~qd~~ 139 (164)
T KOG4357|consen 115 RAIFMFKDGEQAFEAKDFLLGQDFC 139 (164)
T ss_pred eEEEEEeChhHHHHHHHHhhccchh
Confidence 5788899999998888877776654
No 411
>COG1308 EGD2 Transcription factor homologous to NACalpha-BTF3 [Transcription]
Probab=20.63 E-value=2.1e+02 Score=21.32 Aligned_cols=14 Identities=14% Similarity=0.128 Sum_probs=10.6
Q ss_pred CHHHHHHHHHhcCC
Q psy2347 55 NQIDAEAACDSMND 68 (207)
Q Consensus 55 ~~e~A~~Al~~l~g 68 (207)
+.++|.+||+..++
T Consensus 98 sreeA~kAL~e~~G 111 (122)
T COG1308 98 SREEAIKALEEAGG 111 (122)
T ss_pred CHHHHHHHHHHcCC
Confidence 56888899887554
No 412
>cd01778 RASSF1_RA Ubiquitin-like domain of RASSF1 tumour supproessor protein. RASSF1 (also known as RASSF3 and NORE1) is a tumour suppressor protein with a C-terminal Ras-associating (RA) domain that binds Ras. RASSF1 also binds the proapoptotic protein kinase MST1 and is thus thought to regulate the proapoptotic signalling pathway. RASSF1 also associates with microtubule-associated proteins like MAP1B and regulates tubulin polymerization. RASSF1 also binds CDC20 and regulates mitosis by inhibiting the anaphase-promoting complex and preventing degradation of cyclin A and cyclin B until the spindle checkpoint becomes fully operational.
Probab=20.59 E-value=2.5e+02 Score=20.03 Aligned_cols=38 Identities=29% Similarity=0.427 Sum_probs=23.2
Q ss_pred CCCcEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeCCCcEEEEE
Q psy2347 7 ERGTRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFNPPGFAFVE 52 (207)
Q Consensus 7 ~~~~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d~rG~afV~ 52 (207)
...+.|.|. +..++.+.|+.++.+|- |..+++.||.-+
T Consensus 16 ~s~k~v~Is--S~tTt~eVI~~LL~KF~------v~~nP~kFALYe 53 (96)
T cd01778 16 DTAKHLHIS--SKTTVREVIEALLKKFL------VVDNPRKFALFE 53 (96)
T ss_pred CceeEEEEe--cCCcHHHHHHHHHHhhe------eccCCcceEEEE
Confidence 345566664 34456777888888884 233355677553
No 413
>PHA02898 virion envelope protein; Provisional
Probab=20.57 E-value=18 Score=25.32 Aligned_cols=19 Identities=32% Similarity=0.347 Sum_probs=11.6
Q ss_pred cEEEEEEcCHH-HHHHHHHh
Q psy2347 47 GFAFVEFSNQI-DAEAACDS 65 (207)
Q Consensus 47 G~afV~F~~~e-~A~~Al~~ 65 (207)
-||||+|...+ .++.++++
T Consensus 27 IfAfidfSK~~~~~~~~wRa 46 (92)
T PHA02898 27 ICAYIELSKSEKPADSALRS 46 (92)
T ss_pred HHheehhhcCCCcchhHHHH
Confidence 48999997544 22444443
No 414
>PF14844 PH_BEACH: PH domain associated with Beige/BEACH; PDB: 1MI1_B 1T77_C.
Probab=20.49 E-value=1.9e+02 Score=20.21 Aligned_cols=22 Identities=9% Similarity=0.255 Sum_probs=16.3
Q ss_pred CCcEEEEEEcCHHHHHHHHHhc
Q psy2347 45 PPGFAFVEFSNQIDAEAACDSM 66 (207)
Q Consensus 45 ~rG~afV~F~~~e~A~~Al~~l 66 (207)
..--.|+.|++.++.++++++|
T Consensus 84 dg~s~f~~F~~~~~R~~v~~~l 105 (106)
T PF14844_consen 84 DGRSYFFNFESKKERDEVYNKL 105 (106)
T ss_dssp TS-EEEEE-SSHHHHHHHHCCS
T ss_pred CCcEEEEEcCCHHHHHHHHHhh
Confidence 3457899999999999988765
No 415
>PF12623 Hen1_L: RNA repair, ligase-Pnkp-associating, region of Hen1; InterPro: IPR024740 This entry represents the N-terminal domain of the bacterial Hen1 protein, which is a 3' terminal RNA ribose 2'-O-methyltransferase that is involved in bacterial RNA repair []. Hen1 forms a heterotetramer with Pnkp. The heterotetramer has been shown to repair transfer RNAs cleaved by ribotoxins in vitro []. The N-terminal domain of Hen1 contributes to the ligase activity of the heterotetramer.
Probab=20.49 E-value=2.1e+02 Score=23.90 Aligned_cols=56 Identities=14% Similarity=0.128 Sum_probs=39.7
Q ss_pred cEEEEcCCCcCCcHHHHHHHHHhCCCeeEE-EEeeC-------CCcEEEEEEcCHHHHHHHHHh
Q psy2347 10 TRVYVGGLTETVKKEDLELEFEKFGKLNSV-WVAFN-------PPGFAFVEFSNQIDAEAACDS 65 (207)
Q Consensus 10 ~~lfV~nLp~~~te~~L~~~F~~~G~V~~v-~i~~d-------~rG~afV~F~~~e~A~~Al~~ 65 (207)
-+|.|.-||..-.++.++++|+..|--+.+ .+..| ...|..|+.+-....+.|+..
T Consensus 119 L~v~~p~lp~rGg~~l~~rLFePLGw~V~a~~~~Ld~~fP~WG~S~y~~l~L~g~~rl~daL~H 182 (245)
T PF12623_consen 119 LEVRLPALPCRGGEELVRRLFEPLGWTVTAEPVPLDEQFPEWGDSRYVDLTLTGTVRLADALNH 182 (245)
T ss_pred eEEEeeeeecCCcHHHHHHhhcCcCceEEeEeccCCccCccccCCcceEEEEeeeEEHHHHHhh
Confidence 467777888888999999999998864443 34444 245777777776666666654
No 416
>TIGR01499 folC folylpolyglutamate synthase/dihydrofolate synthase. A mutation study of the FolC gene of E. coli suggests that both activitities belong to the same active site. Because some examples are monofunctional (and these cannot be separated phylogenetically), the model is treated as subfamily, not equivalog.
Probab=20.47 E-value=5e+02 Score=22.74 Aligned_cols=47 Identities=11% Similarity=0.003 Sum_probs=28.8
Q ss_pred CcHHHHHHHHHhCCCeeEEEEeeCCCcEEEEEE-cCHHHHHHHHHhcC
Q psy2347 21 VKKEDLELEFEKFGKLNSVWVAFNPPGFAFVEF-SNQIDAEAACDSMN 67 (207)
Q Consensus 21 ~te~~L~~~F~~~G~V~~v~i~~d~rG~afV~F-~~~e~A~~Al~~l~ 67 (207)
+++++|++-++.+...-+++++.......+|.+ .++++++++++.|.
T Consensus 252 i~~~~i~~~L~~~~~pGR~e~i~~~~~~viiD~AHNp~a~~~~l~~l~ 299 (397)
T TIGR01499 252 LSEEAIRKGLANTIWPGRLEILSEDNPNILLDGAHNPHSAEALAEWFK 299 (397)
T ss_pred CCHHHHHHHHHhCCCCceEEEEecCCCEEEEECCCCHHHHHHHHHHHH
Confidence 456777777777643334454432112345666 37888888888875
No 417
>COG0045 SucC Succinyl-CoA synthetase, beta subunit [Energy production and conversion]
Probab=20.44 E-value=5.5e+02 Score=23.09 Aligned_cols=63 Identities=17% Similarity=0.180 Sum_probs=39.9
Q ss_pred CcHHHHHHHHHhCC--C-eeEEEEeeCCCcEE-EEE-EcCHHHHHHHHHhcCC----ceeCCcEEEEEEeec
Q psy2347 21 VKKEDLELEFEKFG--K-LNSVWVAFNPPGFA-FVE-FSNQIDAEAACDSMND----QDLMGSKLRVEISRG 83 (207)
Q Consensus 21 ~te~~L~~~F~~~G--~-V~~v~i~~d~rG~a-fV~-F~~~e~A~~Al~~l~g----~~i~g~~l~V~~a~~ 83 (207)
++.++++++-.+++ + |.+.+|..--|+-| =|. -+|.++|.++.++|-+ ..+.++.++.-+..+
T Consensus 26 ~s~eea~~~a~~lg~~~~VvKaQV~aGGRGKaGGVk~~~s~~ea~~~a~~~lg~~~q~~~~G~~v~~vlvee 97 (387)
T COG0045 26 TSPEEAEEAAKELGGGPVVVKAQVHAGGRGKAGGVKLAKSPEEAKEAAEEILGKNYQTDIKGEPVNKVLVEE 97 (387)
T ss_pred eCHHHHHHHHHHhCCCcEEEEeeeeecCccccCceEEeCCHHHHHHHHHHHhCcccccCcCCceeeEEEEEe
Confidence 57788888888886 2 44555554433322 333 3578888888887777 667777665555443
No 418
>PF12385 Peptidase_C70: Papain-like cysteine protease AvrRpt2; InterPro: IPR022118 This is a family of cysteine proteases, found in actinobacteria, protobacteria and firmicutes. Papain-like cysteine proteases play a crucial role in plant-pathogen/pest interactions. On entering the host they act on non-self substrates, thereby manipulating the host to evade proteolysis []. AvrRpt2 from Pseudomonas syringae pv tomato DC3000 triggers resistance to P. syringae-2-dependent defence responses, including hypersensitive cell death, by cleaving the Arabidopsis RIN4 protein which is monitored by the cognate resistance protein RPS2 [].
Probab=20.34 E-value=2e+02 Score=22.56 Aligned_cols=19 Identities=21% Similarity=0.284 Sum_probs=16.5
Q ss_pred CCcHHHHHHHHHhCCCeeE
Q psy2347 20 TVKKEDLELEFEKFGKLNS 38 (207)
Q Consensus 20 ~~te~~L~~~F~~~G~V~~ 38 (207)
..+.+.|++++.+||+|.-
T Consensus 95 ~~t~e~~~~LL~~yGPLwv 113 (166)
T PF12385_consen 95 SYTAEGLANLLREYGPLWV 113 (166)
T ss_pred ccCHHHHHHHHHHcCCeEE
Confidence 6788999999999999753
No 419
>PRK00343 ipk 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase; Provisional
Probab=20.23 E-value=2.4e+02 Score=23.63 Aligned_cols=39 Identities=13% Similarity=0.149 Sum_probs=24.7
Q ss_pred HHHHHHHhCCCeeEEEEeeCCCcEEEEEEcCHHHHHHHHHhcC
Q psy2347 25 DLELEFEKFGKLNSVWVAFNPPGFAFVEFSNQIDAEAACDSMN 67 (207)
Q Consensus 25 ~L~~~F~~~G~V~~v~i~~d~rG~afV~F~~~e~A~~Al~~l~ 67 (207)
++++.+.+++. + +|.-+=..+|..|+++++|+++.+.|.
T Consensus 222 ~~~~~l~~~ga---~-~mSGSG~tvF~l~~~~~~a~~~~~~l~ 260 (271)
T PRK00343 222 QALSWLLEYAP---S-RMTGTGACVFAEFDTEAEAEQVLAQLP 260 (271)
T ss_pred HHHHHHHhCCC---e-EEeccccceEEEcCCHHHHHHHHHHhh
Confidence 34444445554 2 233344567888999999998888775
No 420
>KOG2916|consensus
Probab=20.22 E-value=2.7e+02 Score=23.83 Aligned_cols=46 Identities=7% Similarity=0.057 Sum_probs=26.7
Q ss_pred eEEEEeeCCCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEEEeec
Q psy2347 37 NSVWVAFNPPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVEISRG 83 (207)
Q Consensus 37 ~~v~i~~d~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~~a~~ 83 (207)
..++|.-|-. +..-.|+-.++.++|+++-....-...+|+|++..|
T Consensus 179 q~vKIRADIE-v~Cf~yeGI~aIK~alk~a~~~~te~~piki~LIAp 224 (304)
T KOG2916|consen 179 QTVKIRADIE-VSCFGYEGIDAIKAALKAALNLSTEECPIKIKLIAP 224 (304)
T ss_pred cceEEEeeeE-EEeccccCHHHHHHHHHHHHhCCcccCceEEEEecC
Confidence 4445544411 222356777888888876444444566777777654
No 421
>COG2242 CobL Precorrin-6B methylase 2 [Coenzyme metabolism]
Probab=20.18 E-value=4.6e+02 Score=21.01 Aligned_cols=66 Identities=18% Similarity=0.160 Sum_probs=33.6
Q ss_pred CCcEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEeeCCCcEEEEEEcCHHHHHHHHHhcCCceeCCcEEEEEEeecCCC
Q psy2347 8 RGTRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAFNPPGFAFVEFSNQIDAEAACDSMNDQDLMGSKLRVEISRGRGR 86 (207)
Q Consensus 8 ~~~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~d~rG~afV~F~~~e~A~~Al~~l~g~~i~g~~l~V~~a~~k~~ 86 (207)
+...|||+.= .+.++|.+.+..+=. +-+--.+.+.+.|.+.+|++.|...-+. ..++|.+++.++-
T Consensus 102 ~~daiFIGGg---~~i~~ile~~~~~l~---------~ggrlV~naitlE~~~~a~~~~~~~g~~-ei~~v~is~~~~l 167 (187)
T COG2242 102 SPDAIFIGGG---GNIEEILEAAWERLK---------PGGRLVANAITLETLAKALEALEQLGGR-EIVQVQISRGKPL 167 (187)
T ss_pred CCCEEEECCC---CCHHHHHHHHHHHcC---------cCCeEEEEeecHHHHHHHHHHHHHcCCc-eEEEEEeecceec
Confidence 5678999874 333444444333210 1122234445556666666655544444 6667777665443
No 422
>KOG0677|consensus
Probab=20.17 E-value=4.2e+02 Score=22.81 Aligned_cols=30 Identities=17% Similarity=0.360 Sum_probs=21.5
Q ss_pred CCCCCcEEEEcCCCcC---CcHHHHHHHHHhCC
Q psy2347 5 MMERGTRVYVGGLTET---VKKEDLELEFEKFG 34 (207)
Q Consensus 5 ~~~~~~~lfV~nLp~~---~te~~L~~~F~~~G 34 (207)
..+..++|.+..-|.+ -.|..++-+|++|+
T Consensus 97 idp~~~KiLLTePPmNP~kNREKm~evMFEkY~ 129 (389)
T KOG0677|consen 97 IDPTNCKILLTEPPMNPTKNREKMIEVMFEKYG 129 (389)
T ss_pred CCCccCeEEeeCCCCCccccHHHHHHHHHHHcC
Confidence 3456788988876544 34677888999985
No 423
>COG1710 Uncharacterized protein conserved in archaea [Function unknown]
Probab=20.16 E-value=2.4e+02 Score=21.04 Aligned_cols=38 Identities=13% Similarity=0.146 Sum_probs=23.5
Q ss_pred CCCCCCCcEEEEcCCCcCCcHHHHHHHHHhCCCeeEEEEee
Q psy2347 3 DGMMERGTRVYVGGLTETVKKEDLELEFEKFGKLNSVWVAF 43 (207)
Q Consensus 3 ~~~~~~~~~lfV~nLp~~~te~~L~~~F~~~G~V~~v~i~~ 43 (207)
+...+..+.||| ....|.+-|..+++..-+|+.|-++.
T Consensus 22 e~lnE~~~eiyi---nlr~tr~v~vallens~~vK~Ig~P~ 59 (139)
T COG1710 22 EDLNEDVTEIYI---NLRPTREVIVALLENSPNVKVIGCPP 59 (139)
T ss_pred hhhccccceEEE---eecccHHHHHHHHhcCCCcceecCCc
Confidence 344556677776 23557777777777555666655544
No 424
>cd06257 DnaJ DnaJ domain or J-domain. DnaJ/Hsp40 (heat shock protein 40) proteins are highly conserved and play crucial roles in protein translation, folding, unfolding, translocation, and degradation. They act primarily by stimulating the ATPase activity of Hsp70s, an important chaperonine family. Hsp40 proteins are characterized by the presence of a J domain, which mediates the interaction with Hsp70. They may contain other domains as well, and the architectures provide a means of classification.
Probab=20.10 E-value=96 Score=18.50 Aligned_cols=19 Identities=16% Similarity=0.408 Sum_probs=15.5
Q ss_pred cCCCcCCcHHHHHHHHHhC
Q psy2347 15 GGLTETVKKEDLELEFEKF 33 (207)
Q Consensus 15 ~nLp~~~te~~L~~~F~~~ 33 (207)
-+|+..++.++|++.|...
T Consensus 6 Lgl~~~~~~~~ik~~y~~l 24 (55)
T cd06257 6 LGVPPDASDEEIKKAYRKL 24 (55)
T ss_pred cCCCCCCCHHHHHHHHHHH
Confidence 4678899999999888754
No 425
>PTZ00380 microtubule-associated protein (MAP); Provisional
Probab=20.08 E-value=1.7e+02 Score=21.77 Aligned_cols=16 Identities=19% Similarity=0.295 Sum_probs=8.7
Q ss_pred CCcCCcHHHHHHHHHh
Q psy2347 17 LTETVKKEDLELEFEK 32 (207)
Q Consensus 17 Lp~~~te~~L~~~F~~ 32 (207)
+|.++|-.++..+..+
T Consensus 47 VP~d~tV~qF~~iIRk 62 (121)
T PTZ00380 47 LPRDATVAELEAAVRQ 62 (121)
T ss_pred cCCCCcHHHHHHHHHH
Confidence 3666666555555443
No 426
>PF01782 RimM: RimM N-terminal domain; InterPro: IPR002676 The RimM protein is essential for efficient processing of 16S rRNA []. The RimM protein was shown to have affinity for free ribosomal 30S subunits but not for 30S subunits in the 70S ribosomes [].; GO: 0006364 rRNA processing; PDB: 2QGG_A 3A1P_C 2DOG_A 2DYI_A 3H9N_A 2F1L_A.
Probab=20.06 E-value=1.8e+02 Score=19.40 Aligned_cols=23 Identities=26% Similarity=0.159 Sum_probs=15.1
Q ss_pred CcEEEEEEcCHHHHHHHHHhcCCc
Q psy2347 46 PGFAFVEFSNQIDAEAACDSMNDQ 69 (207)
Q Consensus 46 rG~afV~F~~~e~A~~Al~~l~g~ 69 (207)
+...+|.|+..++.++|.+ |.+.
T Consensus 54 ~~~~i~~~~gi~~r~~Ae~-l~g~ 76 (84)
T PF01782_consen 54 GKSLIVKFEGIDDREAAEA-LRGC 76 (84)
T ss_dssp TTEEEEEETT--SHHHHHT-TTT-
T ss_pred CCEEEEEEcCCCCHHHHHh-hCCC
Confidence 5689999998888877764 4443
No 427
>KOG3373|consensus
Probab=20.06 E-value=2e+02 Score=22.60 Aligned_cols=37 Identities=14% Similarity=0.104 Sum_probs=22.4
Q ss_pred hCCCeeEEEEeeC----CCcEEEEEEcCHHHHHHHHHhcCC
Q psy2347 32 KFGKLNSVWVAFN----PPGFAFVEFSNQIDAEAACDSMND 68 (207)
Q Consensus 32 ~~G~V~~v~i~~d----~rG~afV~F~~~e~A~~Al~~l~g 68 (207)
.+++|+.|+++.- +++-.|-..++.++|..+..-+++
T Consensus 78 ~LGdvv~veLPe~Gt~vskgds~gavESVKaaSeIysp~sG 118 (172)
T KOG3373|consen 78 HLGDVVYVELPEVGTEVSKGDSFGAVESVKAASEIYSPVSG 118 (172)
T ss_pred hcCceEEEEcCCCCCccccCcceeeeeehhhhhhhhCcCCc
Confidence 4788888887654 455555555555666655544343
Done!