Query psy7051
Match_columns 158
No_of_seqs 238 out of 2416
Neff 9.4
Searched_HMMs 46136
Date Fri Aug 16 16:53:40 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy7051.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/7051hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG4207|consensus 100.0 1.8E-27 3.8E-32 166.0 12.8 95 1-95 1-95 (256)
2 KOG0107|consensus 99.9 9.6E-24 2.1E-28 143.8 14.1 76 13-93 10-85 (195)
3 PLN03134 glycine-rich RNA-bind 99.9 2.2E-22 4.9E-27 137.3 14.6 84 12-95 33-116 (144)
4 KOG0113|consensus 99.9 5.8E-21 1.3E-25 139.8 14.2 87 5-91 93-179 (335)
5 TIGR01661 ELAV_HUD_SF ELAV/HuD 99.8 5.5E-19 1.2E-23 136.7 12.0 79 14-92 270-348 (352)
6 TIGR01659 sex-lethal sex-letha 99.8 4.8E-19 1.1E-23 136.4 11.2 82 10-91 104-185 (346)
7 TIGR01661 ELAV_HUD_SF ELAV/HuD 99.8 4.4E-19 9.5E-24 137.2 10.4 81 12-92 2-82 (352)
8 TIGR01659 sex-lethal sex-letha 99.8 1.9E-18 4.2E-23 133.1 13.5 90 4-93 184-275 (346)
9 PF00076 RRM_1: RNA recognitio 99.8 9.5E-19 2.1E-23 104.9 8.8 70 16-86 1-70 (70)
10 KOG0130|consensus 99.8 1.4E-18 3.1E-23 114.0 9.0 84 11-94 70-153 (170)
11 KOG0105|consensus 99.7 1.5E-17 3.2E-22 114.8 10.3 83 13-98 6-88 (241)
12 KOG0122|consensus 99.7 8.3E-18 1.8E-22 120.3 9.3 82 11-92 187-268 (270)
13 KOG0149|consensus 99.7 6.1E-18 1.3E-22 120.5 8.6 80 12-92 11-90 (247)
14 KOG0121|consensus 99.7 1.2E-17 2.6E-22 108.8 7.6 80 12-91 35-114 (153)
15 PF14259 RRM_6: RNA recognitio 99.7 3.2E-17 7E-22 98.6 9.1 70 16-86 1-70 (70)
16 TIGR01642 U2AF_lg U2 snRNP aux 99.7 7.4E-17 1.6E-21 130.4 12.1 84 11-94 293-376 (509)
17 PLN03120 nucleic acid binding 99.7 7.9E-17 1.7E-21 117.9 10.8 76 13-92 4-79 (260)
18 TIGR01648 hnRNP-R-Q heterogene 99.7 8.4E-17 1.8E-21 130.3 10.3 87 3-90 43-135 (578)
19 TIGR01645 half-pint poly-U bin 99.7 1.3E-16 2.8E-21 129.6 10.6 80 13-92 204-283 (612)
20 TIGR01622 SF-CC1 splicing fact 99.7 1.4E-16 3E-21 127.3 10.4 79 13-91 186-264 (457)
21 TIGR01645 half-pint poly-U bin 99.7 1.6E-16 3.4E-21 129.1 9.5 79 12-90 106-184 (612)
22 smart00362 RRM_2 RNA recogniti 99.7 4E-16 8.6E-21 93.0 8.8 72 15-88 1-72 (72)
23 TIGR01628 PABP-1234 polyadenyl 99.7 2.4E-16 5.2E-21 129.0 10.3 77 15-91 2-78 (562)
24 KOG0117|consensus 99.7 3.8E-16 8.3E-21 120.0 10.2 89 3-91 68-162 (506)
25 KOG0111|consensus 99.7 5.2E-17 1.1E-21 114.9 4.6 82 13-94 10-91 (298)
26 PLN03121 nucleic acid binding 99.7 8.3E-16 1.8E-20 111.0 10.1 76 12-91 4-79 (243)
27 TIGR01622 SF-CC1 splicing fact 99.7 6.9E-16 1.5E-20 123.3 10.8 80 12-92 88-167 (457)
28 PLN03213 repressor of silencin 99.7 6.1E-16 1.3E-20 120.6 9.7 76 12-91 9-86 (759)
29 KOG0125|consensus 99.7 4.1E-16 8.8E-21 115.9 8.0 80 11-92 94-173 (376)
30 KOG0131|consensus 99.7 3.8E-16 8.2E-21 107.5 6.9 82 10-91 6-87 (203)
31 KOG0148|consensus 99.6 4.8E-16 1E-20 112.9 7.3 80 12-91 61-140 (321)
32 smart00360 RRM RNA recognition 99.6 1.5E-15 3.1E-20 90.2 8.2 71 18-88 1-71 (71)
33 TIGR01628 PABP-1234 polyadenyl 99.6 1.7E-15 3.7E-20 123.9 10.7 80 12-92 284-363 (562)
34 KOG0126|consensus 99.6 2.5E-17 5.4E-22 113.3 -0.3 79 13-91 35-113 (219)
35 KOG0415|consensus 99.6 4.8E-16 1E-20 116.7 6.3 81 11-91 237-317 (479)
36 KOG0114|consensus 99.6 3.2E-15 6.9E-20 94.1 9.0 77 12-91 17-93 (124)
37 KOG0148|consensus 99.6 1.7E-15 3.7E-20 110.1 8.9 78 11-94 162-239 (321)
38 TIGR01648 hnRNP-R-Q heterogene 99.6 3.9E-15 8.4E-20 120.8 11.7 74 12-93 232-307 (578)
39 KOG0145|consensus 99.6 1.7E-15 3.8E-20 109.7 8.3 84 10-93 38-121 (360)
40 TIGR01649 hnRNP-L_PTB hnRNP-L/ 99.6 3.5E-15 7.6E-20 120.0 11.0 78 11-93 273-351 (481)
41 cd00590 RRM RRM (RNA recogniti 99.6 6.6E-15 1.4E-19 88.1 9.6 74 15-89 1-74 (74)
42 COG0724 RNA-binding proteins ( 99.6 4.7E-15 1E-19 109.5 9.7 79 13-91 115-193 (306)
43 KOG0108|consensus 99.6 3.7E-15 8.1E-20 116.9 7.9 82 14-95 19-100 (435)
44 KOG0145|consensus 99.6 1.2E-14 2.6E-19 105.3 9.8 80 12-91 277-356 (360)
45 TIGR01649 hnRNP-L_PTB hnRNP-L/ 99.6 9.7E-15 2.1E-19 117.5 9.9 74 13-92 2-77 (481)
46 KOG0144|consensus 99.6 1.1E-14 2.4E-19 111.7 8.4 85 11-95 32-119 (510)
47 KOG0109|consensus 99.5 7.7E-15 1.7E-19 107.7 5.9 72 14-93 3-74 (346)
48 PF13893 RRM_5: RNA recognitio 99.5 5.6E-14 1.2E-18 80.8 8.0 56 30-90 1-56 (56)
49 KOG0127|consensus 99.5 4.7E-14 1E-18 111.1 9.2 82 10-91 289-376 (678)
50 KOG0117|consensus 99.5 3.6E-14 7.8E-19 109.3 7.7 72 13-92 259-330 (506)
51 KOG0127|consensus 99.5 7.3E-14 1.6E-18 110.0 9.1 78 13-91 117-194 (678)
52 smart00361 RRM_1 RNA recogniti 99.5 1E-13 2.2E-18 83.4 7.7 61 27-87 2-69 (70)
53 KOG0147|consensus 99.5 2.3E-14 4.9E-19 112.7 6.1 78 14-91 279-356 (549)
54 KOG4212|consensus 99.5 1.1E-13 2.4E-18 106.6 9.1 78 14-92 45-123 (608)
55 KOG0144|consensus 99.5 3.3E-14 7.3E-19 109.1 5.7 85 12-97 123-210 (510)
56 KOG0109|consensus 99.4 1.8E-13 3.9E-18 100.5 6.1 75 11-93 76-150 (346)
57 KOG0124|consensus 99.4 9.5E-14 2.1E-18 105.0 4.7 77 13-89 113-189 (544)
58 KOG0131|consensus 99.4 9.3E-13 2E-17 90.9 7.6 85 9-93 92-177 (203)
59 TIGR01642 U2AF_lg U2 snRNP aux 99.4 1.2E-12 2.5E-17 106.1 9.1 73 12-91 174-258 (509)
60 KOG0146|consensus 99.4 1E-12 2.2E-17 95.8 5.8 82 10-91 282-363 (371)
61 KOG0153|consensus 99.3 4.3E-12 9.2E-17 95.3 7.8 80 7-92 222-302 (377)
62 KOG4206|consensus 99.3 7.4E-12 1.6E-16 89.2 8.4 80 11-93 7-90 (221)
63 KOG4208|consensus 99.3 1.5E-11 3.3E-16 86.4 8.3 81 12-92 48-129 (214)
64 KOG0132|consensus 99.3 9.4E-12 2E-16 101.6 8.3 77 14-96 422-498 (894)
65 KOG4661|consensus 99.3 8.4E-12 1.8E-16 99.2 7.3 78 14-91 406-483 (940)
66 KOG0123|consensus 99.3 1.4E-11 3.1E-16 95.8 8.5 74 16-92 79-152 (369)
67 KOG0116|consensus 99.2 1.5E-10 3.3E-15 90.7 11.4 78 14-92 289-366 (419)
68 KOG4212|consensus 99.2 3.6E-11 7.9E-16 92.9 7.7 76 10-90 533-608 (608)
69 KOG0106|consensus 99.2 1.6E-11 3.5E-16 88.0 5.3 71 14-92 2-72 (216)
70 KOG0533|consensus 99.2 8.7E-11 1.9E-15 85.9 8.9 79 12-91 82-160 (243)
71 KOG4209|consensus 99.2 1.2E-10 2.6E-15 85.1 9.4 84 12-96 100-183 (231)
72 KOG0146|consensus 99.2 2.3E-11 5E-16 88.8 5.3 82 13-95 19-103 (371)
73 KOG0124|consensus 99.2 5.3E-11 1.2E-15 90.3 7.3 78 13-90 210-287 (544)
74 KOG0110|consensus 99.2 7.7E-11 1.7E-15 95.5 8.0 78 14-91 516-596 (725)
75 KOG4454|consensus 99.2 1.1E-11 2.4E-16 88.0 2.8 82 7-90 3-84 (267)
76 KOG0123|consensus 99.2 8.7E-11 1.9E-15 91.4 7.7 72 14-91 2-73 (369)
77 PF04059 RRM_2: RNA recognitio 99.2 4.6E-10 1E-14 71.1 9.0 78 14-91 2-85 (97)
78 KOG0110|consensus 99.1 5.2E-11 1.1E-15 96.5 5.3 82 12-93 612-693 (725)
79 KOG4660|consensus 99.1 5.8E-11 1.3E-15 94.0 5.0 72 10-86 72-143 (549)
80 KOG4205|consensus 99.1 8.7E-11 1.9E-15 89.0 5.3 81 12-93 5-85 (311)
81 KOG4205|consensus 99.1 1.7E-10 3.6E-15 87.4 5.5 79 13-92 97-175 (311)
82 KOG1457|consensus 99.1 2.2E-09 4.8E-14 76.6 10.4 83 13-95 34-120 (284)
83 KOG0106|consensus 99.0 8.8E-10 1.9E-14 79.1 7.6 67 13-87 99-165 (216)
84 KOG1548|consensus 99.0 1.4E-09 3E-14 82.0 8.4 81 13-94 134-222 (382)
85 KOG4211|consensus 99.0 3.4E-09 7.3E-14 83.1 9.5 78 11-92 8-85 (510)
86 KOG1995|consensus 98.9 2.6E-09 5.7E-14 80.9 5.2 87 5-91 58-152 (351)
87 KOG0120|consensus 98.9 2.8E-09 6.1E-14 84.9 4.7 83 12-94 288-370 (500)
88 KOG0151|consensus 98.8 8.8E-09 1.9E-13 83.9 6.8 79 13-91 174-255 (877)
89 KOG0226|consensus 98.8 5.1E-09 1.1E-13 76.0 4.9 81 11-91 188-268 (290)
90 KOG0105|consensus 98.8 2.3E-07 5.1E-12 64.7 11.9 63 12-81 114-176 (241)
91 PF11608 Limkain-b1: Limkain b 98.7 1E-07 2.2E-12 58.2 7.8 69 14-92 3-76 (90)
92 PF08777 RRM_3: RNA binding mo 98.6 1.3E-07 2.9E-12 61.1 6.7 70 14-89 2-76 (105)
93 KOG0147|consensus 98.6 1.2E-08 2.6E-13 81.0 2.0 79 13-92 179-257 (549)
94 KOG4211|consensus 98.6 1.5E-07 3.2E-12 74.1 7.7 78 11-90 101-179 (510)
95 KOG4206|consensus 98.6 2.8E-07 6E-12 66.0 8.1 77 10-91 143-220 (221)
96 KOG1190|consensus 98.6 2.2E-07 4.8E-12 71.7 8.1 76 13-93 297-373 (492)
97 KOG2202|consensus 98.5 6E-07 1.3E-11 65.5 7.5 62 29-91 84-146 (260)
98 KOG1457|consensus 98.4 2.3E-07 5E-12 66.5 4.0 64 13-80 210-273 (284)
99 KOG4849|consensus 98.4 2.3E-07 4.9E-12 70.5 4.0 75 14-88 81-157 (498)
100 KOG4210|consensus 98.4 3.5E-07 7.7E-12 69.0 4.7 80 13-93 184-264 (285)
101 COG5175 MOT2 Transcriptional r 98.4 1.5E-06 3.2E-11 65.9 6.9 78 14-91 115-201 (480)
102 KOG0129|consensus 98.4 2.9E-06 6.4E-11 67.3 8.7 68 7-74 364-432 (520)
103 KOG1365|consensus 98.3 7.5E-07 1.6E-11 68.4 4.9 77 13-90 280-359 (508)
104 KOG1456|consensus 98.3 5.4E-06 1.2E-10 63.6 9.3 77 12-93 286-363 (494)
105 KOG4307|consensus 98.3 2.5E-06 5.3E-11 70.0 7.7 79 11-89 865-943 (944)
106 KOG2416|consensus 98.2 1.2E-06 2.5E-11 70.6 4.2 81 5-91 436-520 (718)
107 KOG2314|consensus 98.2 1E-05 2.2E-10 65.0 9.0 76 13-89 58-140 (698)
108 KOG1190|consensus 98.2 4.4E-06 9.5E-11 64.7 6.7 77 11-92 412-490 (492)
109 PF14605 Nup35_RRM_2: Nup53/35 98.2 5.9E-06 1.3E-10 46.6 5.4 52 14-72 2-53 (53)
110 KOG1548|consensus 98.2 9.8E-06 2.1E-10 61.6 7.3 76 12-91 264-350 (382)
111 KOG3152|consensus 98.1 3.6E-06 7.9E-11 61.4 3.2 72 13-84 74-157 (278)
112 KOG0120|consensus 98.0 2.5E-05 5.3E-10 62.8 7.3 62 30-91 426-490 (500)
113 PF05172 Nup35_RRM: Nup53/35/4 98.0 5.1E-05 1.1E-09 48.5 7.3 75 14-90 7-89 (100)
114 KOG4676|consensus 98.0 5.9E-07 1.3E-11 69.1 -2.3 64 13-81 151-214 (479)
115 KOG1365|consensus 97.9 1.4E-05 3E-10 61.7 4.5 76 11-88 159-238 (508)
116 KOG1456|consensus 97.9 6.7E-05 1.4E-09 57.8 8.1 77 12-93 119-199 (494)
117 KOG4676|consensus 97.9 1.7E-05 3.7E-10 61.2 4.9 75 14-89 8-85 (479)
118 PF08675 RNA_bind: RNA binding 97.9 0.00012 2.6E-09 44.8 7.2 55 14-77 10-64 (87)
119 PF08952 DUF1866: Domain of un 97.8 0.00015 3.3E-09 49.1 7.5 54 29-91 52-105 (146)
120 KOG0112|consensus 97.8 4.2E-05 9.1E-10 64.5 5.5 77 9-91 451-529 (975)
121 KOG0129|consensus 97.8 6.5E-05 1.4E-09 59.9 6.3 62 14-76 260-327 (520)
122 KOG1996|consensus 97.8 8.6E-05 1.9E-09 55.5 6.5 65 27-91 300-365 (378)
123 PF10309 DUF2414: Protein of u 97.8 0.00025 5.4E-09 41.1 6.8 54 14-75 6-62 (62)
124 KOG1855|consensus 97.8 3.5E-05 7.6E-10 60.1 4.1 67 12-78 230-309 (484)
125 KOG0128|consensus 97.5 6.9E-05 1.5E-09 62.9 2.3 79 13-92 736-814 (881)
126 PF03467 Smg4_UPF3: Smg-4/UPF3 97.4 0.00038 8.2E-09 49.1 4.8 83 13-95 7-100 (176)
127 KOG0128|consensus 97.3 1.4E-05 3.1E-10 66.8 -3.4 68 14-81 668-735 (881)
128 PF07576 BRAP2: BRCA1-associat 97.3 0.0044 9.6E-08 40.3 8.7 68 13-82 13-81 (110)
129 KOG4307|consensus 97.3 0.00027 5.8E-09 58.5 3.5 77 12-89 433-510 (944)
130 PF15023 DUF4523: Protein of u 97.2 0.0032 6.9E-08 42.5 7.5 74 9-90 82-159 (166)
131 KOG4660|consensus 97.2 0.0007 1.5E-08 54.6 5.0 55 37-91 413-471 (549)
132 PF03880 DbpA: DbpA RNA bindin 97.2 0.0056 1.2E-07 36.8 7.8 66 15-90 2-74 (74)
133 KOG0115|consensus 97.1 0.0008 1.7E-08 49.5 4.1 75 14-89 32-110 (275)
134 KOG2193|consensus 97.0 0.0008 1.7E-08 52.8 4.0 69 14-90 2-73 (584)
135 KOG2135|consensus 97.0 0.00054 1.2E-08 54.3 2.6 72 14-92 373-445 (526)
136 KOG2253|consensus 96.9 0.00083 1.8E-08 55.2 3.3 70 11-89 38-107 (668)
137 KOG0112|consensus 96.8 0.00027 5.9E-09 59.8 -0.2 77 13-90 372-448 (975)
138 PF04847 Calcipressin: Calcipr 96.7 0.0068 1.5E-07 43.0 6.3 60 26-91 8-69 (184)
139 KOG4285|consensus 96.6 0.016 3.4E-07 43.8 7.8 61 16-84 200-260 (350)
140 KOG2591|consensus 96.6 0.005 1.1E-07 50.0 5.5 71 12-89 174-248 (684)
141 KOG2068|consensus 96.6 0.00078 1.7E-08 51.3 0.8 78 14-91 78-161 (327)
142 KOG0804|consensus 96.3 0.023 4.9E-07 45.2 7.3 68 13-82 74-142 (493)
143 KOG4574|consensus 96.0 0.0054 1.2E-07 52.0 2.6 70 17-92 302-373 (1007)
144 KOG4210|consensus 95.8 0.008 1.7E-07 45.6 2.7 80 12-91 87-166 (285)
145 PF11767 SET_assoc: Histone ly 95.5 0.11 2.5E-06 30.5 6.3 55 24-87 11-65 (66)
146 PRK11634 ATP-dependent RNA hel 95.0 1.7 3.7E-05 36.9 14.2 70 13-92 486-562 (629)
147 KOG2318|consensus 94.4 0.4 8.6E-06 39.6 8.6 79 10-88 171-301 (650)
148 KOG2193|consensus 94.4 0.0039 8.4E-08 49.1 -2.5 74 14-91 81-155 (584)
149 KOG4207|consensus 94.3 0.41 8.8E-06 34.6 7.6 55 26-80 29-85 (256)
150 KOG4483|consensus 94.2 0.16 3.5E-06 40.0 5.9 58 9-73 387-445 (528)
151 KOG0835|consensus 93.6 0.092 2E-06 40.3 3.5 10 24-33 212-221 (367)
152 smart00596 PRE_C2HC PRE_C2HC d 92.9 0.25 5.4E-06 29.2 3.9 61 28-91 2-63 (69)
153 KOG4019|consensus 92.5 0.18 3.8E-06 35.5 3.4 73 14-92 11-89 (193)
154 PF07530 PRE_C2HC: Associated 92.1 0.55 1.2E-05 27.7 4.8 61 28-91 2-63 (68)
155 KOG4410|consensus 90.5 0.87 1.9E-05 34.5 5.4 46 14-65 331-377 (396)
156 KOG0107|consensus 88.2 1.8 3.8E-05 30.6 5.3 12 13-24 37-48 (195)
157 KOG1295|consensus 86.9 0.97 2.1E-05 35.5 3.8 69 12-80 6-77 (376)
158 KOG2295|consensus 85.9 0.12 2.5E-06 42.4 -1.7 73 13-85 231-303 (648)
159 PRK14548 50S ribosomal protein 85.3 6.1 0.00013 24.3 6.1 57 16-75 23-81 (84)
160 PF03468 XS: XS domain; Inter 84.5 3.4 7.3E-05 27.1 5.0 55 15-72 10-74 (116)
161 KOG4365|consensus 84.2 0.17 3.7E-06 40.4 -1.4 78 14-92 4-81 (572)
162 PF10567 Nab6_mRNP_bdg: RNA-re 84.0 3 6.5E-05 31.8 5.0 78 14-91 16-106 (309)
163 TIGR03636 L23_arch archaeal ri 81.4 9.5 0.00021 23.1 6.0 57 16-75 16-74 (77)
164 KOG4454|consensus 80.8 0.43 9.3E-06 34.8 -0.3 76 11-87 78-157 (267)
165 COG0724 RNA-binding proteins ( 80.1 3.1 6.7E-05 30.0 4.0 62 11-72 223-284 (306)
166 COG5638 Uncharacterized conser 77.9 20 0.00043 28.9 7.8 28 56-83 259-286 (622)
167 KOG4840|consensus 77.2 5.6 0.00012 29.5 4.4 76 11-91 35-116 (299)
168 KOG2891|consensus 76.2 1.2 2.6E-05 33.7 0.8 34 13-46 149-194 (445)
169 PF15513 DUF4651: Domain of un 75.9 9 0.00019 22.2 4.2 18 28-45 9-26 (62)
170 PRK11901 hypothetical protein; 74.4 9.3 0.0002 29.6 5.1 62 13-79 245-308 (327)
171 PRK08559 nusG transcription an 73.1 16 0.00035 25.0 5.8 34 40-78 36-69 (153)
172 KOG3580|consensus 70.0 21 0.00045 30.4 6.5 33 14-46 62-95 (1027)
173 PF02714 DUF221: Domain of unk 68.5 7.1 0.00015 29.9 3.5 32 58-91 1-32 (325)
174 PRK10629 EnvZ/OmpR regulon mod 68.3 33 0.0007 22.9 8.2 70 14-91 36-109 (127)
175 PF00403 HMA: Heavy-metal-asso 67.8 19 0.00041 20.0 6.2 54 15-74 1-58 (62)
176 PF03439 Spt5-NGN: Early trans 67.5 14 0.0003 22.6 4.0 36 39-79 33-68 (84)
177 KOG4008|consensus 67.4 9 0.00019 28.4 3.6 33 11-43 38-70 (261)
178 KOG3702|consensus 66.2 5.4 0.00012 33.7 2.5 71 15-86 513-583 (681)
179 COG5507 Uncharacterized conser 66.1 8.1 0.00017 24.5 2.7 23 53-75 64-86 (117)
180 KOG0156|consensus 66.0 17 0.00037 30.0 5.4 59 17-85 36-97 (489)
181 PF07292 NID: Nmi/IFP 35 domai 65.8 3.8 8.2E-05 25.5 1.3 23 13-35 52-74 (88)
182 PTZ00191 60S ribosomal protein 64.6 39 0.00084 23.1 6.0 56 16-74 84-141 (145)
183 PF12623 Hen1_L: RNA repair, l 62.2 29 0.00063 25.7 5.4 62 13-75 118-183 (245)
184 COG5584 Predicted small secret 57.1 21 0.00047 22.5 3.4 33 20-52 29-61 (103)
185 PF04127 DFP: DNA / pantothena 55.8 46 0.001 23.6 5.5 57 16-75 21-79 (185)
186 PHA03008 hypothetical protein; 53.6 31 0.00068 24.9 4.2 39 10-48 18-56 (234)
187 PF09869 DUF2096: Uncharacteri 53.5 76 0.0016 22.3 7.1 53 12-75 111-163 (169)
188 PF09707 Cas_Cas2CT1978: CRISP 52.3 36 0.00077 21.1 3.9 50 11-63 23-72 (86)
189 COG2608 CopZ Copper chaperone 52.0 47 0.001 19.4 5.5 45 14-64 4-48 (71)
190 PF11823 DUF3343: Protein of u 51.7 22 0.00049 20.9 2.9 25 57-81 3-27 (73)
191 COG0150 PurM Phosphoribosylami 51.7 6.4 0.00014 30.7 0.6 49 26-78 274-322 (345)
192 PF12829 Mhr1: Transcriptional 51.1 47 0.001 20.8 4.3 54 20-77 19-73 (91)
193 PF11491 DUF3213: Protein of u 50.9 46 0.00099 20.5 4.1 66 16-88 3-72 (88)
194 KOG3424|consensus 49.4 75 0.0016 21.0 5.4 45 24-69 34-83 (132)
195 PF05036 SPOR: Sporulation rel 48.8 4.1 9E-05 23.6 -0.6 60 13-76 4-65 (76)
196 PF08734 GYD: GYD domain; Int 47.6 66 0.0014 19.9 5.9 45 27-75 22-67 (91)
197 smart00195 DSPc Dual specifici 47.4 78 0.0017 20.6 5.7 70 15-88 7-84 (138)
198 COG0002 ArgC Acetylglutamate s 47.2 52 0.0011 25.9 4.9 51 14-65 247-303 (349)
199 PF11411 DNA_ligase_IV: DNA li 46.5 16 0.00034 18.7 1.4 17 23-39 19-35 (36)
200 KOG1999|consensus 44.3 1.3E+02 0.0028 27.2 7.2 34 54-88 209-242 (1024)
201 COG5470 Uncharacterized conser 43.8 50 0.0011 20.9 3.6 41 30-72 25-70 (96)
202 TIGR02223 ftsN cell division p 43.8 1.5E+02 0.0032 22.9 6.9 69 12-85 226-296 (298)
203 KOG4213|consensus 43.6 84 0.0018 22.5 5.0 50 25-74 118-169 (205)
204 PRK10905 cell division protein 42.6 69 0.0015 25.0 4.9 61 13-78 247-309 (328)
205 COG3254 Uncharacterized conser 42.3 92 0.002 20.0 5.0 42 28-72 27-68 (105)
206 PF08156 NOP5NT: NOP5NT (NUC12 41.6 9.6 0.00021 22.3 0.2 39 28-76 27-65 (67)
207 COG5193 LHP1 La protein, small 41.5 13 0.00029 29.7 1.0 47 27-73 196-244 (438)
208 PF13689 DUF4154: Domain of un 40.9 87 0.0019 21.0 4.9 60 27-91 2-61 (145)
209 KOG4246|consensus 40.5 17 0.00037 31.9 1.5 10 14-23 146-155 (1194)
210 COG1098 VacB Predicted RNA bin 40.2 72 0.0016 21.3 4.1 34 56-89 19-60 (129)
211 PF00398 RrnaAD: Ribosomal RNA 39.9 37 0.0008 25.3 3.1 23 13-35 97-119 (262)
212 PRK11633 cell division protein 39.6 1.4E+02 0.0031 22.0 6.0 71 13-85 149-221 (226)
213 PRK02886 hypothetical protein; 39.5 94 0.002 19.3 4.4 40 32-80 19-58 (87)
214 COG0030 KsgA Dimethyladenosine 39.2 50 0.0011 24.9 3.7 27 14-40 96-122 (259)
215 PF09902 DUF2129: Uncharacteri 38.6 86 0.0019 18.6 4.4 40 32-80 15-54 (71)
216 PF14581 SseB_C: SseB protein 38.4 39 0.00084 21.4 2.7 52 13-64 5-59 (108)
217 PHA01782 hypothetical protein 38.4 40 0.00088 23.5 2.8 40 20-62 62-101 (177)
218 PRK11558 putative ssRNA endonu 38.2 69 0.0015 20.3 3.6 49 11-64 25-75 (97)
219 KOG2888|consensus 38.1 13 0.00028 29.1 0.4 8 29-36 173-180 (453)
220 PRK02302 hypothetical protein; 37.7 1E+02 0.0022 19.2 4.5 39 32-79 21-59 (89)
221 PF06014 DUF910: Bacterial pro 37.7 22 0.00047 20.6 1.2 18 26-43 3-20 (62)
222 cd00027 BRCT Breast Cancer Sup 37.1 63 0.0014 17.5 3.3 26 15-40 3-28 (72)
223 KOG0862|consensus 37.1 24 0.00053 25.7 1.7 12 54-65 108-119 (216)
224 PRK11230 glycolate oxidase sub 37.1 1.7E+02 0.0038 24.2 6.8 48 28-76 204-255 (499)
225 PF12687 DUF3801: Protein of u 36.2 1.1E+02 0.0024 22.1 5.0 57 25-83 39-98 (204)
226 PF04026 SpoVG: SpoVG; InterP 36.1 77 0.0017 19.5 3.6 26 39-64 2-27 (84)
227 KOG0113|consensus 36.1 53 0.0012 25.4 3.4 14 25-38 151-164 (335)
228 PF05189 RTC_insert: RNA 3'-te 36.1 78 0.0017 19.9 3.8 48 15-62 12-64 (103)
229 KOG3671|consensus 36.0 74 0.0016 26.4 4.3 51 23-78 88-138 (569)
230 COG4010 Uncharacterized protei 35.9 1.5E+02 0.0031 20.5 5.2 46 20-75 118-163 (170)
231 TIGR00405 L26e_arch ribosomal 35.9 1.3E+02 0.0029 20.0 5.7 25 54-78 37-61 (145)
232 PF14893 PNMA: PNMA 35.6 34 0.00073 26.8 2.4 26 11-36 16-41 (331)
233 PF05573 NosL: NosL; InterPro 35.3 13 0.00028 25.3 0.1 22 54-75 113-134 (149)
234 COG5236 Uncharacterized conser 34.9 1.2E+02 0.0027 24.0 5.2 52 26-85 263-314 (493)
235 PRK12450 foldase protein PrsA; 34.8 1.1E+02 0.0024 23.6 5.1 39 24-76 132-170 (309)
236 PF08544 GHMP_kinases_C: GHMP 34.6 98 0.0021 18.1 5.5 44 28-76 37-80 (85)
237 KOG2855|consensus 33.8 79 0.0017 24.8 4.1 47 12-62 60-107 (330)
238 cd04904 ACT_AAAH ACT domain of 33.5 1E+02 0.0022 18.0 7.8 51 26-77 13-65 (74)
239 cd04878 ACT_AHAS N-terminal AC 33.3 88 0.0019 17.2 7.3 60 15-76 2-63 (72)
240 PF01762 Galactosyl_T: Galacto 33.1 68 0.0015 22.5 3.5 37 11-47 19-58 (195)
241 TIGR01033 DNA-binding regulato 32.3 2.1E+02 0.0046 21.3 6.1 45 12-63 93-143 (238)
242 PRK13259 regulatory protein Sp 32.1 89 0.0019 19.7 3.4 26 39-64 2-27 (94)
243 PF15407 Spo7_2_N: Sporulation 31.8 5.9 0.00013 23.3 -1.8 25 11-35 25-49 (67)
244 PF00585 Thr_dehydrat_C: C-ter 31.8 1.3E+02 0.0028 18.6 4.4 59 15-78 11-74 (91)
245 PF13291 ACT_4: ACT domain; PD 31.3 1.1E+02 0.0024 17.8 6.2 65 14-78 7-72 (80)
246 cd04880 ACT_AAAH-PDT-like ACT 30.9 1.1E+02 0.0024 17.6 6.2 51 26-77 12-66 (75)
247 CHL00123 rps6 ribosomal protei 30.9 1.4E+02 0.003 18.7 6.4 57 15-73 10-80 (97)
248 PF10905 DUF2695: Protein of u 29.9 74 0.0016 17.8 2.5 24 24-47 29-52 (53)
249 PRK10672 rare lipoprotein A; P 29.7 2.6E+02 0.0057 22.3 6.4 64 12-82 288-354 (361)
250 TIGR00755 ksgA dimethyladenosi 29.1 74 0.0016 23.5 3.2 24 15-38 96-119 (253)
251 PF05727 UPF0228: Uncharacteri 28.5 1.8E+02 0.004 19.3 5.7 55 20-89 58-122 (127)
252 KOG1232|consensus 28.5 74 0.0016 25.7 3.2 50 22-72 233-286 (511)
253 PRK12280 rplW 50S ribosomal pr 28.3 2.1E+02 0.0045 19.9 5.5 32 15-46 23-56 (158)
254 KOG0829|consensus 28.2 2.1E+02 0.0045 19.9 5.0 58 11-68 15-83 (169)
255 COG4009 Uncharacterized protei 28.2 72 0.0016 19.5 2.4 23 16-38 51-73 (88)
256 KOG0984|consensus 27.8 1.2E+02 0.0025 22.7 3.8 41 5-45 23-67 (282)
257 PF08538 DUF1749: Protein of u 27.7 60 0.0013 25.1 2.6 62 11-77 32-99 (303)
258 PRK00274 ksgA 16S ribosomal RN 27.7 82 0.0018 23.6 3.3 22 15-36 107-128 (272)
259 PF10281 Ish1: Putative stress 27.5 69 0.0015 16.1 2.1 17 24-40 3-19 (38)
260 PF08002 DUF1697: Protein of u 27.3 1.9E+02 0.0041 19.4 4.7 31 14-45 4-38 (137)
261 PF14401 RLAN: RimK-like ATPgr 27.2 1.5E+02 0.0032 20.4 4.2 58 13-70 87-145 (153)
262 PRK00110 hypothetical protein; 26.7 2.2E+02 0.0049 21.3 5.4 44 13-63 94-143 (245)
263 CHL00030 rpl23 ribosomal prote 26.7 1.7E+02 0.0037 18.3 5.2 31 16-46 21-53 (93)
264 PRK04405 prsA peptidylprolyl i 26.7 1.8E+02 0.0039 22.3 5.0 40 24-77 128-167 (298)
265 PRK12378 hypothetical protein; 26.7 2.7E+02 0.0058 20.7 5.7 28 12-39 90-119 (235)
266 PHA01632 hypothetical protein 26.5 83 0.0018 17.7 2.3 21 16-36 19-39 (64)
267 PRK10927 essential cell divisi 26.3 2.4E+02 0.0052 22.0 5.5 67 12-85 246-314 (319)
268 TIGR01873 cas_CT1978 CRISPR-as 26.3 51 0.0011 20.4 1.6 48 11-64 23-74 (87)
269 KOG2854|consensus 26.3 98 0.0021 24.3 3.4 65 10-74 157-231 (343)
270 PF01037 AsnC_trans_reg: AsnC 25.6 1.4E+02 0.0029 16.9 6.5 45 26-74 11-55 (74)
271 PTZ00338 dimethyladenosine tra 25.5 87 0.0019 24.0 3.1 22 15-36 103-124 (294)
272 TIGR00387 glcD glycolate oxida 25.5 2.4E+02 0.0051 22.6 5.7 48 28-76 147-198 (413)
273 PF08206 OB_RNB: Ribonuclease 25.4 42 0.00092 18.7 1.1 37 54-91 7-44 (58)
274 PF08442 ATP-grasp_2: ATP-gras 25.4 2.1E+02 0.0046 20.6 4.9 54 25-81 25-81 (202)
275 KOG2187|consensus 25.3 41 0.0009 28.0 1.4 38 54-91 62-99 (534)
276 PF11061 DUF2862: Protein of u 24.5 1.5E+02 0.0032 17.3 3.2 31 28-64 18-51 (64)
277 PF00521 DNA_topoisoIV: DNA gy 24.5 1.9E+02 0.0042 23.4 5.0 56 14-73 217-276 (426)
278 PRK13011 formyltetrahydrofolat 24.4 3E+02 0.0066 21.0 5.9 61 20-81 56-116 (286)
279 PF14111 DUF4283: Domain of un 24.2 78 0.0017 21.0 2.4 32 16-47 107-139 (153)
280 COG0225 MsrA Peptide methionin 24.2 2.5E+02 0.0054 19.9 4.9 74 15-91 59-136 (174)
281 PF00276 Ribosomal_L23: Riboso 24.2 1.4E+02 0.003 18.5 3.3 30 16-45 22-53 (91)
282 PF15063 TC1: Thyroid cancer p 24.2 54 0.0012 19.8 1.4 24 17-40 29-52 (79)
283 smart00434 TOP4c DNA Topoisome 24.0 3E+02 0.0064 22.6 6.0 58 14-72 233-294 (445)
284 PF09341 Pcc1: Transcription f 23.9 1.3E+02 0.0028 17.6 3.1 20 57-76 4-23 (76)
285 PLN02655 ent-kaurene oxidase 23.7 2.1E+02 0.0046 23.0 5.2 49 17-74 9-60 (466)
286 KOG2287|consensus 23.6 98 0.0021 24.3 3.1 37 12-48 128-166 (349)
287 cd04879 ACT_3PGDH-like ACT_3PG 23.6 1.4E+02 0.003 16.2 5.8 24 24-47 10-34 (71)
288 PF12007 DUF3501: Protein of u 23.6 1.5E+02 0.0032 21.4 3.7 47 25-78 64-110 (192)
289 COG0217 Uncharacterized conser 23.5 3.2E+02 0.007 20.5 5.9 36 13-48 94-135 (241)
290 smart00457 MACPF membrane-atta 23.4 67 0.0015 22.7 2.1 28 18-45 30-59 (194)
291 KOG1719|consensus 23.1 2.6E+02 0.0057 19.6 4.7 26 63-88 90-115 (183)
292 COG2088 SpoVG Uncharacterized 23.0 1.3E+02 0.0028 18.8 2.8 27 39-65 2-28 (95)
293 COG5594 Uncharacterized integr 22.9 1.1E+02 0.0024 27.0 3.5 30 54-83 356-385 (827)
294 PF02946 GTF2I: GTF2I-like rep 22.8 1.9E+02 0.0041 17.5 3.6 30 14-43 35-72 (76)
295 KOG2836|consensus 22.7 2.1E+02 0.0046 19.5 4.1 32 14-45 17-48 (173)
296 PLN00110 flavonoid 3',5'-hydro 22.6 2.9E+02 0.0062 22.7 5.8 49 17-74 41-91 (504)
297 smart00650 rADc Ribosomal RNA 22.5 1.4E+02 0.003 20.3 3.5 23 14-36 78-100 (169)
298 PF06919 Phage_T4_Gp30_7: Phag 22.4 1.5E+02 0.0032 19.1 3.1 42 36-91 28-69 (121)
299 PF09702 Cas_Csa5: CRISPR-asso 22.3 1.2E+02 0.0027 19.4 2.8 21 13-36 64-84 (105)
300 KOG0226|consensus 21.7 32 0.0007 25.9 0.1 74 14-88 97-173 (290)
301 PF11181 YflT: Heat induced st 21.3 1.2E+02 0.0026 19.1 2.7 29 60-88 4-32 (103)
302 KOG3580|consensus 21.1 4.5E+02 0.0097 22.9 6.4 15 58-72 64-78 (1027)
303 PLN02971 tryptophan N-hydroxyl 21.0 2.6E+02 0.0056 23.2 5.3 58 17-84 67-129 (543)
304 PF13046 DUF3906: Protein of u 21.0 1E+02 0.0022 17.9 2.0 33 26-60 31-63 (64)
305 smart00738 NGN In Spt5p, this 20.7 82 0.0018 19.5 1.9 23 56-78 60-82 (106)
306 PLN02968 Probable N-acetyl-gam 20.7 1.7E+02 0.0037 23.3 4.0 32 14-46 279-311 (381)
307 TIGR02118 conserved hypothetic 20.6 1.3E+02 0.0028 18.6 2.8 22 55-76 53-74 (100)
308 PF01984 dsDNA_bind: Double-st 20.6 78 0.0017 20.4 1.7 21 19-39 75-95 (107)
309 PF00067 p450: Cytochrome P450 20.6 3.4E+02 0.0074 20.9 5.7 37 28-73 24-60 (463)
310 PF07237 DUF1428: Protein of u 20.5 2.5E+02 0.0054 18.0 6.0 48 29-76 24-86 (103)
311 PRK09213 pur operon repressor; 20.4 3.2E+02 0.0069 20.8 5.2 41 29-76 46-88 (271)
312 KOG2943|consensus 20.2 3E+02 0.0064 20.8 4.8 29 51-79 205-237 (299)
313 PRK00464 nrdR transcriptional 20.2 1.9E+02 0.004 20.0 3.6 40 24-75 102-147 (154)
314 PLN02936 epsilon-ring hydroxyl 20.1 1.1E+02 0.0025 24.8 3.0 40 26-74 38-77 (489)
315 TIGR01743 purR_Bsub pur operon 20.1 3.6E+02 0.0077 20.5 5.4 42 29-77 44-87 (268)
No 1
>KOG4207|consensus
Probab=99.95 E-value=1.8e-27 Score=166.00 Aligned_cols=95 Identities=86% Similarity=1.431 Sum_probs=93.2
Q ss_pred CCCCCCCCCCCCCcEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCccc
Q psy7051 1 MSYGRPPPRIEGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRML 80 (158)
Q Consensus 1 ~~~~~p~~~~~~~~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i 80 (158)
|+||.+||+++++++|.|-||..-++.++|..+|++||.|..|.|+.|..|....|||||.|....+|+.|++.|+|..|
T Consensus 1 MS~g~~PPdv~gm~SLkVdNLTyRTspd~LrrvFekYG~vgDVyIPrdr~Tr~sRgFaFVrf~~k~daedA~damDG~~l 80 (256)
T KOG4207|consen 1 MSYGRPPPDVEGMTSLKVDNLTYRTSPDDLRRVFEKYGRVGDVYIPRDRYTRQSRGFAFVRFHDKRDAEDALDAMDGAVL 80 (256)
T ss_pred CCCCCCCCCcccceeEEecceeccCCHHHHHHHHHHhCcccceecccccccccccceeEEEeeecchHHHHHHhhcceee
Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred CCceEEEEeeecCCC
Q psy7051 81 DGRELRVQMARYGRP 95 (158)
Q Consensus 81 ~g~~i~v~~a~~~~~ 95 (158)
+|+.|.|++|+++++
T Consensus 81 dgRelrVq~arygr~ 95 (256)
T KOG4207|consen 81 DGRELRVQMARYGRP 95 (256)
T ss_pred ccceeeehhhhcCCC
Confidence 999999999998876
No 2
>KOG0107|consensus
Probab=99.92 E-value=9.6e-24 Score=143.82 Aligned_cols=76 Identities=37% Similarity=0.640 Sum_probs=71.5
Q ss_pred CcEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEEEEeeec
Q psy7051 13 MVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELRVQMARY 92 (158)
Q Consensus 13 ~~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~v~~a~~ 92 (158)
.++||||||+..+++.||+.+|..||.|..|+|-.+ +.|||||||+++.+|+.|+..|+|..|+|..|.|++++.
T Consensus 10 ~~kVYVGnL~~~a~k~eLE~~F~~yG~lrsvWvArn-----PPGfAFVEFed~RDA~DAvr~LDG~~~cG~r~rVE~S~G 84 (195)
T KOG0107|consen 10 NTKVYVGNLGSRATKRELERAFSKYGPLRSVWVARN-----PPGFAFVEFEDPRDAEDAVRYLDGKDICGSRIRVELSTG 84 (195)
T ss_pred CceEEeccCCCCcchHHHHHHHHhcCcceeEEEeec-----CCCceEEeccCcccHHHHHhhcCCccccCceEEEEeecC
Confidence 478999999999999999999999999999999776 479999999999999999999999999999999999984
Q ss_pred C
Q psy7051 93 G 93 (158)
Q Consensus 93 ~ 93 (158)
.
T Consensus 85 ~ 85 (195)
T KOG0107|consen 85 R 85 (195)
T ss_pred C
Confidence 3
No 3
>PLN03134 glycine-rich RNA-binding protein 4; Provisional
Probab=99.90 E-value=2.2e-22 Score=137.30 Aligned_cols=84 Identities=37% Similarity=0.525 Sum_probs=78.9
Q ss_pred CCcEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEEEEeee
Q psy7051 12 GMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELRVQMAR 91 (158)
Q Consensus 12 ~~~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~v~~a~ 91 (158)
..++|||+|||+.+++++|+++|++||.|+.|.|+.+..+++++|||||+|.+.++|+.||+.||+..|+|+.|+|+++.
T Consensus 33 ~~~~lfVgnL~~~~te~~L~~~F~~~G~I~~v~i~~d~~tg~~kGfaFV~F~~~e~A~~Al~~lng~~i~Gr~l~V~~a~ 112 (144)
T PLN03134 33 MSTKLFIGGLSWGTDDASLRDAFAHFGDVVDAKVIVDRETGRSRGFGFVNFNDEGAATAAISEMDGKELNGRHIRVNPAN 112 (144)
T ss_pred CCCEEEEeCCCCCCCHHHHHHHHhcCCCeEEEEEEecCCCCCcceEEEEEECCHHHHHHHHHHcCCCEECCEEEEEEeCC
Confidence 35789999999999999999999999999999999999999999999999999999999999999999999999999997
Q ss_pred cCCC
Q psy7051 92 YGRP 95 (158)
Q Consensus 92 ~~~~ 95 (158)
....
T Consensus 113 ~~~~ 116 (144)
T PLN03134 113 DRPS 116 (144)
T ss_pred cCCC
Confidence 5433
No 4
>KOG0113|consensus
Probab=99.87 E-value=5.8e-21 Score=139.80 Aligned_cols=87 Identities=34% Similarity=0.587 Sum_probs=81.5
Q ss_pred CCCCCCCCCcEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCce
Q psy7051 5 RPPPRIEGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRE 84 (158)
Q Consensus 5 ~p~~~~~~~~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~ 84 (158)
.|....++-.||||+-|+..++|..|+..|+.||+|+.|.|+.++.||+++|||||+|+.+-++..|++..+|..|+|+.
T Consensus 93 dp~a~gDPy~TLFv~RLnydT~EskLrreF~~YG~IkrirlV~d~vTgkskGYAFIeye~erdm~~AYK~adG~~Idgrr 172 (335)
T KOG0113|consen 93 DPNAIGDPYKTLFVARLNYDTSESKLRREFEKYGPIKRIRLVRDKVTGKSKGYAFIEYEHERDMKAAYKDADGIKIDGRR 172 (335)
T ss_pred CCcccCCccceeeeeeccccccHHHHHHHHHhcCcceeEEEeeecccCCccceEEEEeccHHHHHHHHHhccCceecCcE
Confidence 44445577799999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred EEEEeee
Q psy7051 85 LRVQMAR 91 (158)
Q Consensus 85 i~v~~a~ 91 (158)
|.|.+..
T Consensus 173 i~VDvER 179 (335)
T KOG0113|consen 173 ILVDVER 179 (335)
T ss_pred EEEEecc
Confidence 9999865
No 5
>TIGR01661 ELAV_HUD_SF ELAV/HuD family splicing factor. These proteins contain 3 RNA-recognition motifs (rrm: pfam00076).
Probab=99.80 E-value=5.5e-19 Score=136.68 Aligned_cols=79 Identities=28% Similarity=0.387 Sum_probs=76.2
Q ss_pred cEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEEEEeeec
Q psy7051 14 VSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELRVQMARY 92 (158)
Q Consensus 14 ~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~v~~a~~ 92 (158)
.+|||+|||+.+++++|.++|++||.|+.|.|+.+..++.++|||||+|.+.++|..|++.|||..|.|+.|+|.|...
T Consensus 270 ~~lfV~NL~~~~~e~~L~~~F~~fG~v~~v~i~~d~~t~~skG~aFV~F~~~~~A~~Ai~~lnG~~~~gr~i~V~~~~~ 348 (352)
T TIGR01661 270 YCIFVYNLSPDTDETVLWQLFGPFGAVQNVKIIRDLTTNQCKGYGFVSMTNYDEAAMAILSLNGYTLGNRVLQVSFKTN 348 (352)
T ss_pred cEEEEeCCCCCCCHHHHHHHHHhCCCeEEEEEeEcCCCCCccceEEEEECCHHHHHHHHHHhCCCEECCeEEEEEEccC
Confidence 4799999999999999999999999999999999998999999999999999999999999999999999999999873
No 6
>TIGR01659 sex-lethal sex-lethal family splicing factor. This model describes the sex-lethal family of splicing factors found in Dipteran insects. The sex-lethal phenotype, however, may be limited to the Melanogasters and closely related species. In Drosophila the protein acts as an inhibitor of splicing. This subfamily is most closely related to the ELAV/HUD subfamily of splicing factors (TIGR01661).
Probab=99.80 E-value=4.8e-19 Score=136.38 Aligned_cols=82 Identities=29% Similarity=0.400 Sum_probs=77.6
Q ss_pred CCCCcEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEEEEe
Q psy7051 10 IEGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELRVQM 89 (158)
Q Consensus 10 ~~~~~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~v~~ 89 (158)
....++|||+|||+++++++|+++|+.||+|+.|+|+.+..+++.+|||||+|.++++|+.||+.||+..|.++.|+|.+
T Consensus 104 ~~~~~~LfVgnLp~~~te~~L~~lF~~~G~V~~v~i~~d~~tg~srGyaFVeF~~~e~A~~Ai~~LnG~~l~gr~i~V~~ 183 (346)
T TIGR01659 104 NNSGTNLIVNYLPQDMTDRELYALFRTIGPINTCRIMRDYKTGYSFGYAFVDFGSEADSQRAIKNLNGITVRNKRLKVSY 183 (346)
T ss_pred CCCCcEEEEeCCCCCCCHHHHHHHHHhcCCEEEEEEEecCCCCccCcEEEEEEccHHHHHHHHHHcCCCccCCceeeeec
Confidence 34568999999999999999999999999999999999988999999999999999999999999999999999999998
Q ss_pred ee
Q psy7051 90 AR 91 (158)
Q Consensus 90 a~ 91 (158)
++
T Consensus 184 a~ 185 (346)
T TIGR01659 184 AR 185 (346)
T ss_pred cc
Confidence 75
No 7
>TIGR01661 ELAV_HUD_SF ELAV/HuD family splicing factor. These proteins contain 3 RNA-recognition motifs (rrm: pfam00076).
Probab=99.80 E-value=4.4e-19 Score=137.24 Aligned_cols=81 Identities=30% Similarity=0.540 Sum_probs=77.4
Q ss_pred CCcEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEEEEeee
Q psy7051 12 GMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELRVQMAR 91 (158)
Q Consensus 12 ~~~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~v~~a~ 91 (158)
+..+|||+|||..+++++|+++|+.||+|..|.|+.++.+++++|||||+|.+.++|+.||+.|||..|.|+.|.|.+++
T Consensus 2 ~~~~l~V~nLp~~~~e~~l~~~F~~~G~i~~v~i~~d~~~g~s~g~afV~f~~~~~A~~Ai~~l~g~~l~g~~i~v~~a~ 81 (352)
T TIGR01661 2 SKTNLIVNYLPQTMTQEEIRSLFTSIGEIESCKLVRDKVTGQSLGYGFVNYVRPEDAEKAVNSLNGLRLQNKTIKVSYAR 81 (352)
T ss_pred CCcEEEEeCCCCCCCHHHHHHHHHccCCEEEEEEEEcCCCCccceEEEEEECcHHHHHHHHhhcccEEECCeeEEEEeec
Confidence 45789999999999999999999999999999999998899999999999999999999999999999999999999986
Q ss_pred c
Q psy7051 92 Y 92 (158)
Q Consensus 92 ~ 92 (158)
.
T Consensus 82 ~ 82 (352)
T TIGR01661 82 P 82 (352)
T ss_pred c
Confidence 3
No 8
>TIGR01659 sex-lethal sex-lethal family splicing factor. This model describes the sex-lethal family of splicing factors found in Dipteran insects. The sex-lethal phenotype, however, may be limited to the Melanogasters and closely related species. In Drosophila the protein acts as an inhibitor of splicing. This subfamily is most closely related to the ELAV/HUD subfamily of splicing factors (TIGR01661).
Probab=99.79 E-value=1.9e-18 Score=133.08 Aligned_cols=90 Identities=32% Similarity=0.506 Sum_probs=79.5
Q ss_pred CCCCCCCCCCcEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCC-
Q psy7051 4 GRPPPRIEGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDG- 82 (158)
Q Consensus 4 ~~p~~~~~~~~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g- 82 (158)
..|.......++|||+|||..+++++|+++|++||+|+.|.|+.++.++++++||||+|.+.++|++||+.||+..|.+
T Consensus 184 a~p~~~~~~~~~lfV~nLp~~vtee~L~~~F~~fG~V~~v~i~~d~~tg~~kG~aFV~F~~~e~A~~Ai~~lng~~~~g~ 263 (346)
T TIGR01659 184 ARPGGESIKDTNLYVTNLPRTITDDQLDTIFGKYGQIVQKNILRDKLTGTPRGVAFVRFNKREEAQEAISALNNVIPEGG 263 (346)
T ss_pred ccccccccccceeEEeCCCCcccHHHHHHHHHhcCCEEEEEEeecCCCCccceEEEEEECCHHHHHHHHHHhCCCccCCC
Confidence 3343333446789999999999999999999999999999999998899999999999999999999999999998866
Q ss_pred -ceEEEEeeecC
Q psy7051 83 -RELRVQMARYG 93 (158)
Q Consensus 83 -~~i~v~~a~~~ 93 (158)
..|+|.+++..
T Consensus 264 ~~~l~V~~a~~~ 275 (346)
T TIGR01659 264 SQPLTVRLAEEH 275 (346)
T ss_pred ceeEEEEECCcc
Confidence 68999998754
No 9
>PF00076 RRM_1: RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain); InterPro: IPR000504 Many eukaryotic proteins containing one or more copies of a putative RNA-binding domain of about 90 amino acids are known to bind single-stranded RNAs [, , ]. The largest group of single strand RNA-binding proteins is the eukaryotic RNA recognition motif (RRM) family that contains an eight amino acid RNP-1 consensus sequence [, ]. RRM proteins have a variety of RNA binding preferences and functions, and include heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing (SR, U2AF, Sxl), protein components of small nuclear ribonucleoproteins (U1 and U2 snRNPs), and proteins that regulate RNA stability and translation (PABP, La, Hu) [, , ]. The RRM in heterodimeric splicing factor U2 snRNP auxiliary factor (U2AF) appears to have two RRM-like domains with specialised features for protein recognition []. The motif also appears in a few single stranded DNA binding proteins. The typical RRM consists of four anti-parallel beta-strands and two alpha-helices arranged in a beta-alpha-beta-beta-alpha-beta fold with side chains that stack with RNA bases. Specificity of RNA binding is determined by multiple contacts with surrounding amino acids. A third helix is present during RNA binding in some cases []. The RRM is reviewed in a number of publications [, , ].; GO: 0003676 nucleic acid binding; PDB: 2RNE_A 2DGO_A 2DO4_A 1YTY_B 2VOO_B 2VOP_A 2VON_B 1ZH5_B 2VOD_A 1S79_A ....
Probab=99.79 E-value=9.5e-19 Score=104.91 Aligned_cols=70 Identities=37% Similarity=0.695 Sum_probs=66.3
Q ss_pred EEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEE
Q psy7051 16 LKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELR 86 (158)
Q Consensus 16 l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~ 86 (158)
|||+|||+.+++++|.++|++||.|..+.+..+ .++...++|||+|.+.++|+.|++.|+|..|+|+.|+
T Consensus 1 l~v~nlp~~~t~~~l~~~f~~~g~i~~~~~~~~-~~~~~~~~a~V~F~~~~~a~~a~~~l~g~~~~~~~ir 70 (70)
T PF00076_consen 1 LYVGNLPPDVTEEELRDFFSQFGKIESIKVMRN-SSGKSKGYAFVEFESEEDAEKALEELNGKKINGRKIR 70 (70)
T ss_dssp EEEESETTTSSHHHHHHHHHTTSTEEEEEEEEE-TTSSEEEEEEEEESSHHHHHHHHHHHTTEEETTEEEE
T ss_pred cEEcCCCCcCCHHHHHHHHHHhhhccccccccc-ccccccceEEEEEcCHHHHHHHHHHcCCCEECccCcC
Confidence 799999999999999999999999999999887 5788899999999999999999999999999999875
No 10
>KOG0130|consensus
Probab=99.78 E-value=1.4e-18 Score=114.04 Aligned_cols=84 Identities=25% Similarity=0.357 Sum_probs=78.8
Q ss_pred CCCcEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEEEEee
Q psy7051 11 EGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELRVQMA 90 (158)
Q Consensus 11 ~~~~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~v~~a 90 (158)
-....|||+++...+++++|.+.|..||+|+++.|..+..||..+|||+|+|++.+.|++|+..|||..|.|+.|.|.|+
T Consensus 70 VEGwIi~VtgvHeEatEedi~d~F~dyGeiKNihLNLDRRtGy~KGYaLvEYet~keAq~A~~~~Ng~~ll~q~v~VDw~ 149 (170)
T KOG0130|consen 70 VEGWIIFVTGVHEEATEEDIHDKFADYGEIKNIHLNLDRRTGYVKGYALVEYETLKEAQAAIDALNGAELLGQNVSVDWC 149 (170)
T ss_pred eeeEEEEEeccCcchhHHHHHHHHhhcccccceeeccccccccccceeeeehHhHHHHHHHHHhccchhhhCCceeEEEE
Confidence 34567999999999999999999999999999999999999999999999999999999999999999999999999998
Q ss_pred ecCC
Q psy7051 91 RYGR 94 (158)
Q Consensus 91 ~~~~ 94 (158)
-...
T Consensus 150 Fv~g 153 (170)
T KOG0130|consen 150 FVKG 153 (170)
T ss_pred EecC
Confidence 7443
No 11
>KOG0105|consensus
Probab=99.75 E-value=1.5e-17 Score=114.83 Aligned_cols=83 Identities=35% Similarity=0.502 Sum_probs=73.4
Q ss_pred CcEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEEEEeeec
Q psy7051 13 MVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELRVQMARY 92 (158)
Q Consensus 13 ~~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~v~~a~~ 92 (158)
..+|||+|||..|.+.+|+++|.+||.|.+|.|... ..+..||||+|+++.+|+.||..-+|..++|+.|.|+++..
T Consensus 6 ~~~iyvGNLP~diRekeieDlFyKyg~i~~ieLK~r---~g~ppfafVeFEd~RDAeDAiygRdGYdydg~rLRVEfprg 82 (241)
T KOG0105|consen 6 SRRIYVGNLPGDIREKEIEDLFYKYGRIREIELKNR---PGPPPFAFVEFEDPRDAEDAIYGRDGYDYDGCRLRVEFPRG 82 (241)
T ss_pred cceEEecCCCcchhhccHHHHHhhhcceEEEEeccC---CCCCCeeEEEecCccchhhhhhcccccccCcceEEEEeccC
Confidence 468999999999999999999999999999988443 22467999999999999999999999999999999999997
Q ss_pred CCCCCC
Q psy7051 93 GRPPSS 98 (158)
Q Consensus 93 ~~~~~~ 98 (158)
+++...
T Consensus 83 gr~s~~ 88 (241)
T KOG0105|consen 83 GRSSSD 88 (241)
T ss_pred CCcccc
Confidence 764433
No 12
>KOG0122|consensus
Probab=99.75 E-value=8.3e-18 Score=120.34 Aligned_cols=82 Identities=33% Similarity=0.546 Sum_probs=78.5
Q ss_pred CCCcEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEEEEee
Q psy7051 11 EGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELRVQMA 90 (158)
Q Consensus 11 ~~~~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~v~~a 90 (158)
....+|-|.||+.++++.+|+++|.+||.|..|.|..++.||.++|||||.|.+.++|+.||+.|||.-++.-.|.|+|+
T Consensus 187 ~D~~tvRvtNLsed~~E~dL~eLf~~fg~i~rvylardK~TG~~kGFAFVtF~sRddA~rAI~~LnG~gyd~LILrvEws 266 (270)
T KOG0122|consen 187 DDEATVRVTNLSEDMREDDLEELFRPFGPITRVYLARDKETGLSKGFAFVTFESRDDAARAIADLNGYGYDNLILRVEWS 266 (270)
T ss_pred CccceeEEecCccccChhHHHHHhhccCccceeEEEEccccCcccceEEEEEecHHHHHHHHHHccCcccceEEEEEEec
Confidence 34578999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred ec
Q psy7051 91 RY 92 (158)
Q Consensus 91 ~~ 92 (158)
++
T Consensus 267 kP 268 (270)
T KOG0122|consen 267 KP 268 (270)
T ss_pred CC
Confidence 84
No 13
>KOG0149|consensus
Probab=99.75 E-value=6.1e-18 Score=120.46 Aligned_cols=80 Identities=30% Similarity=0.513 Sum_probs=74.7
Q ss_pred CCcEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEEEEeee
Q psy7051 12 GMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELRVQMAR 91 (158)
Q Consensus 12 ~~~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~v~~a~ 91 (158)
..++||||||+|.+..++|.++|++||+|.+..|+.|+.+++++||+||+|.+.+.|+.||+. ....|+|++..|.+|-
T Consensus 11 ~~TKifVggL~w~T~~~~l~~yFeqfGeI~eavvitd~~t~rskGyGfVTf~d~~aa~rAc~d-p~piIdGR~aNcnlA~ 89 (247)
T KOG0149|consen 11 TFTKIFVGGLAWETHKETLRRYFEQFGEIVEAVVITDKNTGRSKGYGFVTFRDAEAATRACKD-PNPIIDGRKANCNLAS 89 (247)
T ss_pred eEEEEEEcCcccccchHHHHHHHHHhCceEEEEEEeccCCccccceeeEEeecHHHHHHHhcC-CCCcccccccccchhh
Confidence 347899999999999999999999999999999999999999999999999999999999964 4668999999999987
Q ss_pred c
Q psy7051 92 Y 92 (158)
Q Consensus 92 ~ 92 (158)
.
T Consensus 90 l 90 (247)
T KOG0149|consen 90 L 90 (247)
T ss_pred h
Confidence 6
No 14
>KOG0121|consensus
Probab=99.73 E-value=1.2e-17 Score=108.77 Aligned_cols=80 Identities=34% Similarity=0.690 Sum_probs=76.1
Q ss_pred CCcEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEEEEeee
Q psy7051 12 GMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELRVQMAR 91 (158)
Q Consensus 12 ~~~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~v~~a~ 91 (158)
...||||+||+..++|++|.++|+++|+|..|.|-.|..+..+-|||||+|.+.++|+.|++.++|..|+.+.|.|.|..
T Consensus 35 ~S~tvyVgNlSfyttEEqiyELFs~cG~irriiMGLdr~kktpCGFCFVeyy~~~dA~~AlryisgtrLddr~ir~D~D~ 114 (153)
T KOG0121|consen 35 KSCTVYVGNLSFYTTEEQIYELFSKCGDIRRIIMGLDRFKKTPCGFCFVEYYSRDDAEDALRYISGTRLDDRPIRIDWDA 114 (153)
T ss_pred hcceEEEeeeeeeecHHHHHHHHHhccchheeEeccccCCcCccceEEEEEecchhHHHHHHHhccCcccccceeeeccc
Confidence 35799999999999999999999999999999999998888889999999999999999999999999999999999875
No 15
>PF14259 RRM_6: RNA recognition motif (a.k.a. RRM, RBD, or RNP domain); PDB: 2DNN_A 1WI6_A 2EVZ_A 2ADC_A 1QM9_A 2KG1_A 2HGN_A 1U2F_A 2G4B_A 2HZC_A ....
Probab=99.73 E-value=3.2e-17 Score=98.55 Aligned_cols=70 Identities=41% Similarity=0.721 Sum_probs=64.3
Q ss_pred EEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEE
Q psy7051 16 LKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELR 86 (158)
Q Consensus 16 l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~ 86 (158)
|||+|||+.+++++|.++|+.||.|..+.+..++. +..+++|||+|.++++|..|++.+++..|+|+.|+
T Consensus 1 v~i~nlp~~~~~~~l~~~f~~~g~v~~v~~~~~~~-~~~~~~a~v~f~~~~~a~~al~~~~~~~~~g~~l~ 70 (70)
T PF14259_consen 1 VYISNLPPSTTEEDLRNFFSRFGPVEKVRLIKNKD-GQSRGFAFVEFSSEEDAKRALELLNGKEIDGRKLR 70 (70)
T ss_dssp EEEESSTTT--HHHHHHHCTTSSBEEEEEEEESTT-SSEEEEEEEEESSHHHHHHHHHHHTTEEETTEEEE
T ss_pred CEEeCCCCCCCHHHHHHHHHhcCCcceEEEEeeec-cccCCEEEEEeCCHHHHHHHHHHCCCcEECCEEcC
Confidence 79999999999999999999999999999999876 88899999999999999999999999999999874
No 16
>TIGR01642 U2AF_lg U2 snRNP auxilliary factor, large subunit, splicing factor. Members of this subfamily are found in plants, metazoa and fungi.
Probab=99.72 E-value=7.4e-17 Score=130.39 Aligned_cols=84 Identities=24% Similarity=0.387 Sum_probs=78.5
Q ss_pred CCCcEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEEEEee
Q psy7051 11 EGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELRVQMA 90 (158)
Q Consensus 11 ~~~~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~v~~a 90 (158)
+...+|||+|||..+++++|+++|+.||.|..+.|+.+..++..+|||||+|.+.++|+.||+.|||..|.|..|.|.++
T Consensus 293 ~~~~~l~v~nlp~~~~~~~l~~~f~~~G~i~~~~~~~~~~~g~~~g~afv~f~~~~~a~~A~~~l~g~~~~~~~l~v~~a 372 (509)
T TIGR01642 293 DSKDRIYIGNLPLYLGEDQIKELLESFGDLKAFNLIKDIATGLSKGYAFCEYKDPSVTDVAIAALNGKDTGDNKLHVQRA 372 (509)
T ss_pred CCCCEEEEeCCCCCCCHHHHHHHHHhcCCeeEEEEEecCCCCCcCeEEEEEECCHHHHHHHHHHcCCCEECCeEEEEEEC
Confidence 34578999999999999999999999999999999999889999999999999999999999999999999999999998
Q ss_pred ecCC
Q psy7051 91 RYGR 94 (158)
Q Consensus 91 ~~~~ 94 (158)
....
T Consensus 373 ~~~~ 376 (509)
T TIGR01642 373 CVGA 376 (509)
T ss_pred ccCC
Confidence 7543
No 17
>PLN03120 nucleic acid binding protein; Provisional
Probab=99.72 E-value=7.9e-17 Score=117.90 Aligned_cols=76 Identities=22% Similarity=0.400 Sum_probs=70.1
Q ss_pred CcEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEEEEeeec
Q psy7051 13 MVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELRVQMARY 92 (158)
Q Consensus 13 ~~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~v~~a~~ 92 (158)
..+|||+||++.+++++|+++|+.||+|+.|.|+.+.. ..|||||+|.++++|+.|| .|||..|.|+.|.|.++..
T Consensus 4 ~rtVfVgNLs~~tTE~dLrefFS~~G~I~~V~I~~d~~---~~GfAFVtF~d~eaAe~Al-lLnG~~l~gr~V~Vt~a~~ 79 (260)
T PLN03120 4 VRTVKVSNVSLKATERDIKEFFSFSGDIEYVEMQSENE---RSQIAYVTFKDPQGAETAL-LLSGATIVDQSVTITPAED 79 (260)
T ss_pred CCEEEEeCCCCCCCHHHHHHHHHhcCCeEEEEEeecCC---CCCEEEEEeCcHHHHHHHH-HhcCCeeCCceEEEEeccC
Confidence 47899999999999999999999999999999988753 4689999999999999999 5999999999999999863
No 18
>TIGR01648 hnRNP-R-Q heterogeneous nuclear ribonucleoprotein R, Q family. Sequences in this subfamily include the human heterogeneous nuclear ribonucleoproteins (hnRNP) R, Q and APOBEC-1 complementation factor (aka APOBEC-1 stimulating protein). These proteins contain three RNA recognition domains (rrm: pfam00076) and a somewhat variable C-terminal domain.
Probab=99.70 E-value=8.4e-17 Score=130.35 Aligned_cols=87 Identities=30% Similarity=0.558 Sum_probs=76.1
Q ss_pred CCCCCCC-----CCCCcEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCC
Q psy7051 3 YGRPPPR-----IEGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDG 77 (158)
Q Consensus 3 ~~~p~~~-----~~~~~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g 77 (158)
||.||+. .+..++|||+|||.+++|++|.++|++||.|..|.|+.+ .++.++|||||+|.+.++|+.||+.||+
T Consensus 43 ~g~Pp~~~~~~~p~~~~~lFVgnLp~~~tEd~L~~~F~~~G~I~~vrl~~D-~sG~sRGfaFV~F~~~e~A~~Ai~~lng 121 (578)
T TIGR01648 43 YGGPPPGWSGVQPGRGCEVFVGKIPRDLYEDELVPLFEKAGPIYELRLMMD-FSGQNRGYAFVTFCGKEEAKEAVKLLNN 121 (578)
T ss_pred CCCCCCcccCCCCCCCCEEEeCCCCCCCCHHHHHHHHHhhCCEEEEEEEEC-CCCCccceEEEEeCCHHHHHHHHHHcCC
Confidence 7766653 233489999999999999999999999999999999999 6899999999999999999999999999
Q ss_pred cccC-CceEEEEee
Q psy7051 78 RMLD-GRELRVQMA 90 (158)
Q Consensus 78 ~~i~-g~~i~v~~a 90 (158)
..|. ++.|.|..+
T Consensus 122 ~~i~~Gr~l~V~~S 135 (578)
T TIGR01648 122 YEIRPGRLLGVCIS 135 (578)
T ss_pred CeecCCcccccccc
Confidence 9884 677766654
No 19
>TIGR01645 half-pint poly-U binding splicing factor, half-pint family. In the case of PUF60 (GP|6176532), in complex with p54, and in the presence of U2AF, facilitates association of U2 snRNP with pre-mRNA.
Probab=99.70 E-value=1.3e-16 Score=129.64 Aligned_cols=80 Identities=23% Similarity=0.372 Sum_probs=76.7
Q ss_pred CcEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEEEEeeec
Q psy7051 13 MVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELRVQMARY 92 (158)
Q Consensus 13 ~~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~v~~a~~ 92 (158)
..+|||+||+..+++++|+++|+.||.|..|.|+.+..++..+|||||+|.+.++|..||+.||+..|+|+.|.|.++..
T Consensus 204 ~~rLfVgnLp~~vteedLk~lFs~FG~I~svrl~~D~~tgksKGfGFVeFe~~e~A~kAI~amNg~elgGr~LrV~kAi~ 283 (612)
T TIGR01645 204 FNRIYVASVHPDLSETDIKSVFEAFGEIVKCQLARAPTGRGHKGYGFIEYNNLQSQSEAIASMNLFDLGGQYLRVGKCVT 283 (612)
T ss_pred cceEEeecCCCCCCHHHHHHHHhhcCCeeEEEEEecCCCCCcCCeEEEEECCHHHHHHHHHHhCCCeeCCeEEEEEecCC
Confidence 46899999999999999999999999999999999988899999999999999999999999999999999999998874
No 20
>TIGR01622 SF-CC1 splicing factor, CC1-like family. A homologous gene from Plasmodium falciparum was identified in the course of the analysis of that genome at TIGR and was included in the model.
Probab=99.69 E-value=1.4e-16 Score=127.34 Aligned_cols=79 Identities=38% Similarity=0.714 Sum_probs=76.2
Q ss_pred CcEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEEEEeee
Q psy7051 13 MVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELRVQMAR 91 (158)
Q Consensus 13 ~~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~v~~a~ 91 (158)
..+|||+|||..+++++|+++|++||.|..|.|+.+..++..+|||||+|.+.++|..|++.|||..|.|+.|.|.++.
T Consensus 186 ~~~l~v~nl~~~~te~~l~~~f~~~G~i~~v~~~~d~~~g~~~g~afV~f~~~e~A~~A~~~l~g~~i~g~~i~v~~a~ 264 (457)
T TIGR01622 186 FLKLYVGNLHFNITEQELRQIFEPFGDIEDVQLHRDPETGRSKGFGFIQFHDAEEAKEALEVMNGFELAGRPIKVGYAQ 264 (457)
T ss_pred CCEEEEcCCCCCCCHHHHHHHHHhcCCeEEEEEEEcCCCCccceEEEEEECCHHHHHHHHHhcCCcEECCEEEEEEEcc
Confidence 5789999999999999999999999999999999998888999999999999999999999999999999999999976
No 21
>TIGR01645 half-pint poly-U binding splicing factor, half-pint family. In the case of PUF60 (GP|6176532), in complex with p54, and in the presence of U2AF, facilitates association of U2 snRNP with pre-mRNA.
Probab=99.68 E-value=1.6e-16 Score=129.12 Aligned_cols=79 Identities=32% Similarity=0.553 Sum_probs=75.3
Q ss_pred CCcEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEEEEee
Q psy7051 12 GMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELRVQMA 90 (158)
Q Consensus 12 ~~~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~v~~a 90 (158)
...+|||+||++.+++++|+++|.+||.|..|.|+.++.+++++|||||+|.+.++|+.|++.|||..|.|+.|+|...
T Consensus 106 ~~~rLfVGnLp~~~tEe~Lr~lF~~fG~I~sV~I~~D~~TgkskGfAFVeF~s~e~A~~Ai~~lnG~~i~GR~IkV~rp 184 (612)
T TIGR01645 106 IMCRVYVGSISFELREDTIRRAFDPFGPIKSINMSWDPATGKHKGFAFVEYEVPEAAQLALEQMNGQMLGGRNIKVGRP 184 (612)
T ss_pred CCCEEEEcCCCCCCCHHHHHHHHHccCCEEEEEEeecCCCCCcCCeEEEEeCcHHHHHHHHHhcCCeEEecceeeeccc
Confidence 4578999999999999999999999999999999999999999999999999999999999999999999999999854
No 22
>smart00362 RRM_2 RNA recognition motif.
Probab=99.68 E-value=4e-16 Score=93.01 Aligned_cols=72 Identities=43% Similarity=0.757 Sum_probs=66.5
Q ss_pred EEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEEEE
Q psy7051 15 SLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELRVQ 88 (158)
Q Consensus 15 ~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~v~ 88 (158)
+|+|+|||..+++++|+++|.+||.|..+.+..+. +...++|||+|.+.++|+.|++.|++..|.|..|.|.
T Consensus 1 ~v~i~~l~~~~~~~~l~~~~~~~g~v~~~~~~~~~--~~~~~~~~v~f~~~~~a~~a~~~~~~~~~~~~~i~v~ 72 (72)
T smart00362 1 TLFVGNLPPDVTEEDLKELFSKFGPIESVKIPKDT--GKSKGFAFVEFESEEDAEKAIEALNGTKLGGRPLRVE 72 (72)
T ss_pred CEEEcCCCCcCCHHHHHHHHHhcCCEEEEEEecCC--CCCCceEEEEeCCHHHHHHHHHHhCCcEECCEEEeeC
Confidence 58999999999999999999999999999888775 6678999999999999999999999999999998873
No 23
>TIGR01628 PABP-1234 polyadenylate binding protein, human types 1, 2, 3, 4 family. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed, or of unknown tissue range.
Probab=99.68 E-value=2.4e-16 Score=128.96 Aligned_cols=77 Identities=34% Similarity=0.518 Sum_probs=74.6
Q ss_pred EEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEEEEeee
Q psy7051 15 SLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELRVQMAR 91 (158)
Q Consensus 15 ~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~v~~a~ 91 (158)
+|||+|||.++++++|.++|.+||.|..|.|+.+..+++++|||||+|.+.++|+.||+.||+..|.|+.|.|.|+.
T Consensus 2 sl~VgnLp~~vte~~L~~~F~~~G~v~~v~v~~d~~t~~s~G~afV~F~~~~~A~~Al~~ln~~~i~gk~i~i~~s~ 78 (562)
T TIGR01628 2 SLYVGDLDPDVTEAKLYDLFKPFGPVLSVRVCRDSVTRRSLGYGYVNFQNPADAERALETMNFKRLGGKPIRIMWSQ 78 (562)
T ss_pred eEEEeCCCCCCCHHHHHHHHHhcCCEEEEEEEecCCCCCcceEEEEEECCHHHHHHHHHHhCCCEECCeeEEeeccc
Confidence 69999999999999999999999999999999999889999999999999999999999999999999999999875
No 24
>KOG0117|consensus
Probab=99.68 E-value=3.8e-16 Score=120.03 Aligned_cols=89 Identities=30% Similarity=0.588 Sum_probs=80.0
Q ss_pred CCCCCCCCCC-----CcEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCC
Q psy7051 3 YGRPPPRIEG-----MVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDG 77 (158)
Q Consensus 3 ~~~p~~~~~~-----~~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g 77 (158)
||.|||.-+. .+-||||.||.++.|++|..+|++.|+|-++.|++++.+|.++|||||+|.+.++|+.||+.||+
T Consensus 68 ~ggPpP~weg~~p~~G~EVfvGkIPrD~~EdeLvplfEkiG~I~elRLMmD~~sG~nRGYAFVtf~~Ke~Aq~Aik~lnn 147 (506)
T KOG0117|consen 68 YGGPPPGWEGPPPPRGCEVFVGKIPRDVFEDELVPLFEKIGKIYELRLMMDPFSGDNRGYAFVTFCTKEEAQEAIKELNN 147 (506)
T ss_pred cCCCCCcccCCCCCCCceEEecCCCccccchhhHHHHHhccceeeEEEeecccCCCCcceEEEEeecHHHHHHHHHHhhC
Confidence 5666664332 46899999999999999999999999999999999999999999999999999999999999999
Q ss_pred ccc-CCceEEEEeee
Q psy7051 78 RML-DGRELRVQMAR 91 (158)
Q Consensus 78 ~~i-~g~~i~v~~a~ 91 (158)
.+| .|+.|.|+.+-
T Consensus 148 ~Eir~GK~igvc~Sv 162 (506)
T KOG0117|consen 148 YEIRPGKLLGVCVSV 162 (506)
T ss_pred ccccCCCEeEEEEee
Confidence 998 68888888764
No 25
>KOG0111|consensus
Probab=99.67 E-value=5.2e-17 Score=114.92 Aligned_cols=82 Identities=35% Similarity=0.519 Sum_probs=78.8
Q ss_pred CcEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEEEEeeec
Q psy7051 13 MVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELRVQMARY 92 (158)
Q Consensus 13 ~~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~v~~a~~ 92 (158)
..+||||+|...+++.-|...|-+||.|+.|.++.|.++++.+||+||+|...|+|.+||..||+.+|.|+.|.|.+|++
T Consensus 10 KrtlYVGGladeVtekvLhaAFIPFGDI~dIqiPlDyesqkHRgFgFVefe~aEDAaaAiDNMnesEL~GrtirVN~AkP 89 (298)
T KOG0111|consen 10 KRTLYVGGLADEVTEKVLHAAFIPFGDIKDIQIPLDYESQKHRGFGFVEFEEAEDAAAAIDNMNESELFGRTIRVNLAKP 89 (298)
T ss_pred ceeEEeccchHHHHHHHHHhccccccchhhcccccchhcccccceeEEEeeccchhHHHhhcCchhhhcceeEEEeecCC
Confidence 47899999999999999999999999999999999999999999999999999999999999999999999999999986
Q ss_pred CC
Q psy7051 93 GR 94 (158)
Q Consensus 93 ~~ 94 (158)
.+
T Consensus 90 ~k 91 (298)
T KOG0111|consen 90 EK 91 (298)
T ss_pred cc
Confidence 54
No 26
>PLN03121 nucleic acid binding protein; Provisional
Probab=99.66 E-value=8.3e-16 Score=111.01 Aligned_cols=76 Identities=24% Similarity=0.336 Sum_probs=69.1
Q ss_pred CCcEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEEEEeee
Q psy7051 12 GMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELRVQMAR 91 (158)
Q Consensus 12 ~~~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~v~~a~ 91 (158)
...+|||+||++.+++++|++||+.||+|..|.|+.+.. ..++|||+|.++++|+.|+ .|+|..|.++.|.|....
T Consensus 4 ~g~TV~V~NLS~~tTE~dLrefFS~~G~I~~V~I~~D~e---t~gfAfVtF~d~~aaetAl-lLnGa~l~d~~I~It~~~ 79 (243)
T PLN03121 4 GGYTAEVTNLSPKATEKDVYDFFSHCGAIEHVEIIRSGE---YACTAYVTFKDAYALETAV-LLSGATIVDQRVCITRWG 79 (243)
T ss_pred CceEEEEecCCCCCCHHHHHHHHHhcCCeEEEEEecCCC---cceEEEEEECCHHHHHHHH-hcCCCeeCCceEEEEeCc
Confidence 358999999999999999999999999999999998743 3579999999999999999 699999999999998754
No 27
>TIGR01622 SF-CC1 splicing factor, CC1-like family. A homologous gene from Plasmodium falciparum was identified in the course of the analysis of that genome at TIGR and was included in the model.
Probab=99.66 E-value=6.9e-16 Score=123.32 Aligned_cols=80 Identities=33% Similarity=0.509 Sum_probs=75.2
Q ss_pred CCcEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEEEEeee
Q psy7051 12 GMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELRVQMAR 91 (158)
Q Consensus 12 ~~~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~v~~a~ 91 (158)
...+|||+|||..+++++|+++|++||.|..|.|+.++.++..+|||||+|.+.++|+.||. |+|..|.|..|.|.++.
T Consensus 88 ~~~~l~V~nlp~~~~~~~l~~~F~~~G~v~~v~i~~d~~~~~skg~afVeF~~~e~A~~Al~-l~g~~~~g~~i~v~~~~ 166 (457)
T TIGR01622 88 DDRTVFVLQLALKARERDLYEFFSKVGKVRDVQCIKDRNSRRSKGVAYVEFYDVESVIKALA-LTGQMLLGRPIIVQSSQ 166 (457)
T ss_pred CCcEEEEeCCCCCCCHHHHHHHHHhcCCeeEEEEeecCCCCCcceEEEEEECCHHHHHHHHH-hCCCEECCeeeEEeecc
Confidence 35789999999999999999999999999999999999899999999999999999999995 89999999999998875
Q ss_pred c
Q psy7051 92 Y 92 (158)
Q Consensus 92 ~ 92 (158)
.
T Consensus 167 ~ 167 (457)
T TIGR01622 167 A 167 (457)
T ss_pred h
Confidence 3
No 28
>PLN03213 repressor of silencing 3; Provisional
Probab=99.66 E-value=6.1e-16 Score=120.59 Aligned_cols=76 Identities=28% Similarity=0.456 Sum_probs=70.4
Q ss_pred CCcEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCH--HHHHHHHHHhCCcccCCceEEEEe
Q psy7051 12 GMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDK--RDAEDALDAMDGRMLDGRELRVQM 89 (158)
Q Consensus 12 ~~~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~--~~a~~a~~~l~g~~i~g~~i~v~~ 89 (158)
...+||||||++.+++++|..+|..||.|..|.|+. .+| +|||||+|.+. .++..||..|||..|.|+.|+|..
T Consensus 9 ~gMRIYVGNLSydVTEDDLravFSeFGsVkdVEIpR--ETG--RGFAFVEMssdddaEeeKAISaLNGAEWKGR~LKVNK 84 (759)
T PLN03213 9 GGVRLHVGGLGESVGRDDLLKIFSPMGTVDAVEFVR--TKG--RSFAYIDFSPSSTNSLTKLFSTYNGCVWKGGRLRLEK 84 (759)
T ss_pred cceEEEEeCCCCCCCHHHHHHHHHhcCCeeEEEEec--ccC--CceEEEEecCCcHHHHHHHHHHhcCCeecCceeEEee
Confidence 347899999999999999999999999999999984 466 89999999987 689999999999999999999999
Q ss_pred ee
Q psy7051 90 AR 91 (158)
Q Consensus 90 a~ 91 (158)
|+
T Consensus 85 AK 86 (759)
T PLN03213 85 AK 86 (759)
T ss_pred cc
Confidence 86
No 29
>KOG0125|consensus
Probab=99.65 E-value=4.1e-16 Score=115.88 Aligned_cols=80 Identities=34% Similarity=0.553 Sum_probs=73.8
Q ss_pred CCCcEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEEEEee
Q psy7051 11 EGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELRVQMA 90 (158)
Q Consensus 11 ~~~~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~v~~a 90 (158)
...+.|+|.|||...-+.||..+|++||.|..|+|+.+. .-++||+||+|++.+||+.|-++|||..|.|++|.|..|
T Consensus 94 ~~pkRLhVSNIPFrFRdpDL~aMF~kfG~VldVEIIfNE--RGSKGFGFVTmen~~dadRARa~LHgt~VEGRkIEVn~A 171 (376)
T KOG0125|consen 94 DTPKRLHVSNIPFRFRDPDLRAMFEKFGKVLDVEIIFNE--RGSKGFGFVTMENPADADRARAELHGTVVEGRKIEVNNA 171 (376)
T ss_pred CCCceeEeecCCccccCccHHHHHHhhCceeeEEEEecc--CCCCccceEEecChhhHHHHHHHhhcceeeceEEEEecc
Confidence 345789999999999999999999999999999999873 447999999999999999999999999999999999998
Q ss_pred ec
Q psy7051 91 RY 92 (158)
Q Consensus 91 ~~ 92 (158)
+.
T Consensus 172 Ta 173 (376)
T KOG0125|consen 172 TA 173 (376)
T ss_pred ch
Confidence 73
No 30
>KOG0131|consensus
Probab=99.65 E-value=3.8e-16 Score=107.47 Aligned_cols=82 Identities=30% Similarity=0.483 Sum_probs=77.6
Q ss_pred CCCCcEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEEEEe
Q psy7051 10 IEGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELRVQM 89 (158)
Q Consensus 10 ~~~~~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~v~~ 89 (158)
.....||||+||+..++++.|.++|-+.|+|..+.++.+..++..+|||||+|.++|+|+-||+.||...|.|++|+|..
T Consensus 6 rnqd~tiyvgnld~kvs~~~l~EL~iqagpVv~i~iPkDrv~~~~qGygF~Ef~~eedadYAikiln~VkLYgrpIrv~k 85 (203)
T KOG0131|consen 6 RNQDATLYVGNLDEKVSEELLYELFIQAGPVVNLHIPKDRVTQKHQGYGFAEFRTEEDADYAIKILNMVKLYGRPIRVNK 85 (203)
T ss_pred cCCCceEEEecCCHHHHHHHHHHHHHhcCceeeeecchhhhcccccceeEEEEechhhhHHHHHHHHHHHhcCceeEEEe
Confidence 34568999999999999999999999999999999999999999999999999999999999999998899999999988
Q ss_pred ee
Q psy7051 90 AR 91 (158)
Q Consensus 90 a~ 91 (158)
+.
T Consensus 86 as 87 (203)
T KOG0131|consen 86 AS 87 (203)
T ss_pred cc
Confidence 75
No 31
>KOG0148|consensus
Probab=99.64 E-value=4.8e-16 Score=112.89 Aligned_cols=80 Identities=35% Similarity=0.543 Sum_probs=77.4
Q ss_pred CCcEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEEEEeee
Q psy7051 12 GMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELRVQMAR 91 (158)
Q Consensus 12 ~~~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~v~~a~ 91 (158)
....|||+.|...|+-++|++.|.+||+|.+++|+.|..|++.+||+||.|.+.++|+.||..|||.+|+++.|...||.
T Consensus 61 ~hfhvfvgdls~eI~~e~lr~aF~pFGevS~akvirD~~T~KsKGYgFVSf~~k~dAEnAI~~MnGqWlG~R~IRTNWAT 140 (321)
T KOG0148|consen 61 QHFHVFVGDLSPEIDNEKLREAFAPFGEVSDAKVIRDMNTGKSKGYGFVSFPNKEDAENAIQQMNGQWLGRRTIRTNWAT 140 (321)
T ss_pred cceeEEehhcchhcchHHHHHHhccccccccceEeecccCCcccceeEEeccchHHHHHHHHHhCCeeeccceeeccccc
Confidence 35679999999999999999999999999999999999999999999999999999999999999999999999999997
No 32
>smart00360 RRM RNA recognition motif.
Probab=99.64 E-value=1.5e-15 Score=90.19 Aligned_cols=71 Identities=44% Similarity=0.759 Sum_probs=66.1
Q ss_pred EeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEEEE
Q psy7051 18 VDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELRVQ 88 (158)
Q Consensus 18 V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~v~ 88 (158)
|+|||..+++++|+++|++||.|..+.+..++.++.+.++|||+|.+.++|..|++.|++..+.|..|.|.
T Consensus 1 i~~l~~~~~~~~l~~~f~~~g~v~~~~i~~~~~~~~~~~~a~v~f~~~~~a~~a~~~~~~~~~~~~~~~v~ 71 (71)
T smart00360 1 VGNLPPDVTEEELRELFSKFGKIESVRLVRDKDTGKSKGFAFVEFESEEDAEKALEALNGKELDGRPLKVK 71 (71)
T ss_pred CCCCCcccCHHHHHHHHHhhCCEeEEEEEeCCCCCCCCceEEEEeCCHHHHHHHHHHcCCCeeCCcEEEeC
Confidence 57999999999999999999999999998887778889999999999999999999999999999998873
No 33
>TIGR01628 PABP-1234 polyadenylate binding protein, human types 1, 2, 3, 4 family. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed, or of unknown tissue range.
Probab=99.64 E-value=1.7e-15 Score=123.94 Aligned_cols=80 Identities=39% Similarity=0.586 Sum_probs=75.3
Q ss_pred CCcEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEEEEeee
Q psy7051 12 GMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELRVQMAR 91 (158)
Q Consensus 12 ~~~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~v~~a~ 91 (158)
...+|||+||+..+++++|+++|++||.|+.|.++.+ .++..+|||||+|.+.++|++|+..|||..|+|+.|.|.++.
T Consensus 284 ~~~~l~V~nl~~~~~~~~L~~~F~~~G~i~~~~i~~d-~~g~~~g~gfV~f~~~~~A~~A~~~~~g~~~~gk~l~V~~a~ 362 (562)
T TIGR01628 284 QGVNLYVKNLDDTVTDEKLRELFSECGEITSAKVMLD-EKGVSRGFGFVCFSNPEEANRAVTEMHGRMLGGKPLYVALAQ 362 (562)
T ss_pred CCCEEEEeCCCCccCHHHHHHHHHhcCCeEEEEEEEC-CCCCcCCeEEEEeCCHHHHHHHHHHhcCCeeCCceeEEEecc
Confidence 3468999999999999999999999999999999988 588899999999999999999999999999999999999987
Q ss_pred c
Q psy7051 92 Y 92 (158)
Q Consensus 92 ~ 92 (158)
.
T Consensus 363 ~ 363 (562)
T TIGR01628 363 R 363 (562)
T ss_pred C
Confidence 3
No 34
>KOG0126|consensus
Probab=99.63 E-value=2.5e-17 Score=113.30 Aligned_cols=79 Identities=34% Similarity=0.544 Sum_probs=75.3
Q ss_pred CcEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEEEEeee
Q psy7051 13 MVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELRVQMAR 91 (158)
Q Consensus 13 ~~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~v~~a~ 91 (158)
..-|||||||...||.+|..+|++||+|+.|.++.|+.||+++||||+.|++..+..-|+..|||..|.|+.|+|...-
T Consensus 35 sA~Iyiggl~~~LtEgDil~VFSqyGe~vdinLiRDk~TGKSKGFaFLcYEDQRSTILAVDN~NGiki~gRtirVDHv~ 113 (219)
T KOG0126|consen 35 SAYIYIGGLPYELTEGDILCVFSQYGEIVDINLIRDKKTGKSKGFAFLCYEDQRSTILAVDNLNGIKILGRTIRVDHVS 113 (219)
T ss_pred ceEEEECCCcccccCCcEEEEeeccCceEEEEEEecCCCCcccceEEEEecCccceEEEEeccCCceecceeEEeeecc
Confidence 3569999999999999999999999999999999999999999999999999999999999999999999999999753
No 35
>KOG0415|consensus
Probab=99.63 E-value=4.8e-16 Score=116.70 Aligned_cols=81 Identities=32% Similarity=0.434 Sum_probs=77.0
Q ss_pred CCCcEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEEEEee
Q psy7051 11 EGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELRVQMA 90 (158)
Q Consensus 11 ~~~~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~v~~a 90 (158)
++...|||..|++-++.++|+-+|+.||.|..|.|+.+..||....||||+|++.++++.|+-+|++..|..+.|+|.|+
T Consensus 237 PPeNVLFVCKLNPVTtDeDLeiIFSrFG~i~sceVIRD~ktgdsLqyaFiEFen~escE~AyFKMdNvLIDDrRIHVDFS 316 (479)
T KOG0415|consen 237 PPENVLFVCKLNPVTTDEDLEIIFSRFGKIVSCEVIRDRKTGDSLQYAFIEFENKESCEQAYFKMDNVLIDDRRIHVDFS 316 (479)
T ss_pred CCcceEEEEecCCcccccchhhHHhhcccceeeeEEecccccchhheeeeeecchhhHHHHHhhhcceeeccceEEeehh
Confidence 44467999999999999999999999999999999999999999999999999999999999999999999999999996
Q ss_pred e
Q psy7051 91 R 91 (158)
Q Consensus 91 ~ 91 (158)
.
T Consensus 317 Q 317 (479)
T KOG0415|consen 317 Q 317 (479)
T ss_pred h
Confidence 4
No 36
>KOG0114|consensus
Probab=99.63 E-value=3.2e-15 Score=94.10 Aligned_cols=77 Identities=31% Similarity=0.422 Sum_probs=70.4
Q ss_pred CCcEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEEEEeee
Q psy7051 12 GMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELRVQMAR 91 (158)
Q Consensus 12 ~~~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~v~~a~ 91 (158)
....|||.|||..+|.+++.++|.+||.|.+|.+-..+.| +|.|||.|++..+|..|+.+|+|..+.+..+.|-+-.
T Consensus 17 vnriLyirNLp~~ITseemydlFGkyg~IrQIRiG~~k~T---rGTAFVVYedi~dAk~A~dhlsg~n~~~ryl~vlyyq 93 (124)
T KOG0114|consen 17 VNRILYIRNLPFKITSEEMYDLFGKYGTIRQIRIGNTKET---RGTAFVVYEDIFDAKKACDHLSGYNVDNRYLVVLYYQ 93 (124)
T ss_pred hheeEEEecCCccccHHHHHHHhhcccceEEEEecCccCc---CceEEEEehHhhhHHHHHHHhcccccCCceEEEEecC
Confidence 3478999999999999999999999999999999766544 6999999999999999999999999999999998754
No 37
>KOG0148|consensus
Probab=99.63 E-value=1.7e-15 Score=110.07 Aligned_cols=78 Identities=28% Similarity=0.468 Sum_probs=72.4
Q ss_pred CCCcEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEEEEee
Q psy7051 11 EGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELRVQMA 90 (158)
Q Consensus 11 ~~~~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~v~~a 90 (158)
+..++|||||++..+++++|++.|++||+|.+|.+.++ +|||||.|.+.|.|..||..+|+.+|.|..|++.|-
T Consensus 162 p~NtsVY~G~I~~~lte~~mr~~Fs~fG~I~EVRvFk~------qGYaFVrF~tkEaAahAIv~mNntei~G~~VkCsWG 235 (321)
T KOG0148|consen 162 PDNTSVYVGNIASGLTEDLMRQTFSPFGPIQEVRVFKD------QGYAFVRFETKEAAAHAIVQMNNTEIGGQLVRCSWG 235 (321)
T ss_pred CCCceEEeCCcCccccHHHHHHhcccCCcceEEEEecc------cceEEEEecchhhHHHHHHHhcCceeCceEEEEecc
Confidence 44589999999999999999999999999999999887 589999999999999999999999999999999998
Q ss_pred ecCC
Q psy7051 91 RYGR 94 (158)
Q Consensus 91 ~~~~ 94 (158)
+...
T Consensus 236 Ke~~ 239 (321)
T KOG0148|consen 236 KEGD 239 (321)
T ss_pred ccCC
Confidence 7543
No 38
>TIGR01648 hnRNP-R-Q heterogeneous nuclear ribonucleoprotein R, Q family. Sequences in this subfamily include the human heterogeneous nuclear ribonucleoproteins (hnRNP) R, Q and APOBEC-1 complementation factor (aka APOBEC-1 stimulating protein). These proteins contain three RNA recognition domains (rrm: pfam00076) and a somewhat variable C-terminal domain.
Probab=99.63 E-value=3.9e-15 Score=120.77 Aligned_cols=74 Identities=34% Similarity=0.521 Sum_probs=68.5
Q ss_pred CCcEEEEeCCCCCCCHHHHHHHhhhc--CceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEEEEe
Q psy7051 12 GMVSLKVDNLTYRTTPDDLRRVFERC--GEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELRVQM 89 (158)
Q Consensus 12 ~~~~l~V~nl~~~~~~~~l~~~f~~~--G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~v~~ 89 (158)
..++|||+||+..+++++|+++|++| |+|+.|.++ ++||||+|.+.++|++|++.||+.+|.|+.|+|.+
T Consensus 232 ~~k~LfVgNL~~~~tee~L~~~F~~f~~G~I~rV~~~--------rgfAFVeF~s~e~A~kAi~~lnG~~i~Gr~I~V~~ 303 (578)
T TIGR01648 232 KVKILYVRNLMTTTTEEIIEKSFSEFKPGKVERVKKI--------RDYAFVHFEDREDAVKAMDELNGKELEGSEIEVTL 303 (578)
T ss_pred cccEEEEeCCCCCCCHHHHHHHHHhcCCCceEEEEee--------cCeEEEEeCCHHHHHHHHHHhCCCEECCEEEEEEE
Confidence 35789999999999999999999999 999999875 35999999999999999999999999999999999
Q ss_pred eecC
Q psy7051 90 ARYG 93 (158)
Q Consensus 90 a~~~ 93 (158)
++..
T Consensus 304 Akp~ 307 (578)
T TIGR01648 304 AKPV 307 (578)
T ss_pred ccCC
Confidence 9854
No 39
>KOG0145|consensus
Probab=99.62 E-value=1.7e-15 Score=109.68 Aligned_cols=84 Identities=32% Similarity=0.548 Sum_probs=79.3
Q ss_pred CCCCcEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEEEEe
Q psy7051 10 IEGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELRVQM 89 (158)
Q Consensus 10 ~~~~~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~v~~ 89 (158)
.+..+.|.|.-||..+|++||+.+|...|+|+.|+++.|+.+|.+.||+||.|..++||+.|+..|||..|..+.|+|.|
T Consensus 38 ~~skTNLIvNYLPQ~MTqdE~rSLF~SiGeiEScKLvRDKitGqSLGYGFVNYv~p~DAe~AintlNGLrLQ~KTIKVSy 117 (360)
T KOG0145|consen 38 DESKTNLIVNYLPQNMTQDELRSLFGSIGEIESCKLVRDKITGQSLGYGFVNYVRPKDAEKAINTLNGLRLQNKTIKVSY 117 (360)
T ss_pred CcccceeeeeecccccCHHHHHHHhhcccceeeeeeeeccccccccccceeeecChHHHHHHHhhhcceeeccceEEEEe
Confidence 34457899999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred eecC
Q psy7051 90 ARYG 93 (158)
Q Consensus 90 a~~~ 93 (158)
|++.
T Consensus 118 ARPS 121 (360)
T KOG0145|consen 118 ARPS 121 (360)
T ss_pred ccCC
Confidence 9844
No 40
>TIGR01649 hnRNP-L_PTB hnRNP-L/PTB/hephaestus splicing factor family. Included in this family of heterogeneous ribonucleoproteins are PTB (polypyrimidine tract binding protein ) and hnRNP-L. These proteins contain four RNA recognition motifs (rrm: pfam00067).
Probab=99.62 E-value=3.5e-15 Score=120.04 Aligned_cols=78 Identities=24% Similarity=0.290 Sum_probs=71.6
Q ss_pred CCCcEEEEeCCCC-CCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEEEEe
Q psy7051 11 EGMVSLKVDNLTY-RTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELRVQM 89 (158)
Q Consensus 11 ~~~~~l~V~nl~~-~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~v~~ 89 (158)
.+..+|||+||++ .+++++|.++|+.||.|..|+++.++ +|+|||+|.+.++|+.|++.|||..|.|+.|.|.+
T Consensus 273 ~~~~~l~v~nL~~~~vt~~~L~~lF~~yG~V~~vki~~~~-----~g~afV~f~~~~~A~~Ai~~lng~~l~g~~l~v~~ 347 (481)
T TIGR01649 273 GPGSVLMVSGLHQEKVNCDRLFNLFCVYGNVERVKFMKNK-----KETALIEMADPYQAQLALTHLNGVKLFGKPLRVCP 347 (481)
T ss_pred CCCCEEEEeCCCCCCCCHHHHHHHHHhcCCeEEEEEEeCC-----CCEEEEEECCHHHHHHHHHHhCCCEECCceEEEEE
Confidence 3557999999998 69999999999999999999998763 58999999999999999999999999999999999
Q ss_pred eecC
Q psy7051 90 ARYG 93 (158)
Q Consensus 90 a~~~ 93 (158)
++..
T Consensus 348 s~~~ 351 (481)
T TIGR01649 348 SKQQ 351 (481)
T ss_pred cccc
Confidence 8754
No 41
>cd00590 RRM RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
Probab=99.62 E-value=6.6e-15 Score=88.11 Aligned_cols=74 Identities=49% Similarity=0.773 Sum_probs=67.8
Q ss_pred EEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEEEEe
Q psy7051 15 SLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELRVQM 89 (158)
Q Consensus 15 ~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~v~~ 89 (158)
+|+|+|||..+++++|.++|..||.|..+.+..+..+ ...++|||+|.+.++|..|++.+++..+.|..|.|.+
T Consensus 1 ~i~i~~l~~~~~~~~i~~~~~~~g~i~~~~~~~~~~~-~~~~~~~v~f~s~~~a~~a~~~~~~~~~~~~~~~v~~ 74 (74)
T cd00590 1 TLFVGNLPPDVTEEDLRELFSKFGKVESVRIVRDKDT-KSKGFAFVEFEDEEDAEKALEALNGKELGGRPLRVEF 74 (74)
T ss_pred CEEEeCCCCccCHHHHHHHHHhcCCEEEEEEeeCCCC-CcceEEEEEECCHHHHHHHHHHhCCCeECCeEEEEeC
Confidence 4899999999999999999999999999999877644 5688999999999999999999999999999998864
No 42
>COG0724 RNA-binding proteins (RRM domain) [General function prediction only]
Probab=99.61 E-value=4.7e-15 Score=109.49 Aligned_cols=79 Identities=46% Similarity=0.723 Sum_probs=76.2
Q ss_pred CcEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEEEEeee
Q psy7051 13 MVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELRVQMAR 91 (158)
Q Consensus 13 ~~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~v~~a~ 91 (158)
..+|||+|||..+++++|.++|.+||.|..+.+..++.++..+|||||+|.+.++|..|++.+++..|.|+.|.|.++.
T Consensus 115 ~~~l~v~nL~~~~~~~~l~~~F~~~g~~~~~~~~~d~~~~~~~g~~~v~f~~~~~~~~a~~~~~~~~~~~~~~~v~~~~ 193 (306)
T COG0724 115 NNTLFVGNLPYDVTEEDLRELFKKFGPVKRVRLVRDRETGKSRGFAFVEFESEESAEKAIEELNGKELEGRPLRVQKAQ 193 (306)
T ss_pred CceEEEeCCCCCCCHHHHHHHHHhcCceeEEEeeeccccCccCceEEEEecCHHHHHHHHHHcCCCeECCceeEeeccc
Confidence 5899999999999999999999999999999999998899999999999999999999999999999999999999975
No 43
>KOG0108|consensus
Probab=99.59 E-value=3.7e-15 Score=116.95 Aligned_cols=82 Identities=32% Similarity=0.532 Sum_probs=78.8
Q ss_pred cEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEEEEeeecC
Q psy7051 14 VSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELRVQMARYG 93 (158)
Q Consensus 14 ~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~v~~a~~~ 93 (158)
..|||||+|+++++++|.++|+..|.|..++++.|..+|+.+|||||+|.+.++|+.|++.|||.++.|++|+|.|+...
T Consensus 19 ~~v~vgnip~~~se~~l~~~~~~~g~v~s~~~v~D~~tG~~~G~~f~~~~~~~~~~~a~~~lNg~~~~gr~l~v~~~~~~ 98 (435)
T KOG0108|consen 19 SSVFVGNIPYEGSEEQLLSIFSGVGPVLSFRLVYDRETGKPKGFGFCEFTDEETAERAIRNLNGAEFNGRKLRVNYASNR 98 (435)
T ss_pred cceEecCCCCcccHHHHHHHHhccCccceeeecccccCCCcCceeeEecCchhhHHHHHHhcCCcccCCceEEeeccccc
Confidence 78999999999999999999999999999999999999999999999999999999999999999999999999998755
Q ss_pred CC
Q psy7051 94 RP 95 (158)
Q Consensus 94 ~~ 95 (158)
..
T Consensus 99 ~~ 100 (435)
T KOG0108|consen 99 KN 100 (435)
T ss_pred ch
Confidence 43
No 44
>KOG0145|consensus
Probab=99.59 E-value=1.2e-14 Score=105.32 Aligned_cols=80 Identities=28% Similarity=0.392 Sum_probs=76.6
Q ss_pred CCcEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEEEEeee
Q psy7051 12 GMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELRVQMAR 91 (158)
Q Consensus 12 ~~~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~v~~a~ 91 (158)
....|||-||.++++|..|+++|.+||.|..|+++.|..+.+++||+||.+.+.++|..||..|||..+.++.|.|.|..
T Consensus 277 ~g~ciFvYNLspd~de~~LWQlFgpFGAv~nVKvirD~ttnkCKGfgFVtMtNYdEAamAi~sLNGy~lg~rvLQVsFKt 356 (360)
T KOG0145|consen 277 GGWCIFVYNLSPDADESILWQLFGPFGAVTNVKVIRDFTTNKCKGFGFVTMTNYDEAAMAIASLNGYRLGDRVLQVSFKT 356 (360)
T ss_pred CeeEEEEEecCCCchHhHHHHHhCcccceeeEEEEecCCcccccceeEEEecchHHHHHHHHHhcCccccceEEEEEEec
Confidence 45789999999999999999999999999999999999999999999999999999999999999999999999999865
No 45
>TIGR01649 hnRNP-L_PTB hnRNP-L/PTB/hephaestus splicing factor family. Included in this family of heterogeneous ribonucleoproteins are PTB (polypyrimidine tract binding protein ) and hnRNP-L. These proteins contain four RNA recognition motifs (rrm: pfam00067).
Probab=99.58 E-value=9.7e-15 Score=117.52 Aligned_cols=74 Identities=18% Similarity=0.186 Sum_probs=67.7
Q ss_pred CcEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHh--CCcccCCceEEEEee
Q psy7051 13 MVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAM--DGRMLDGRELRVQMA 90 (158)
Q Consensus 13 ~~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l--~g~~i~g~~i~v~~a 90 (158)
..+|||+|||+.+++++|.++|++||.|..|.++.+ ++||||+|.+.++|+.|++.| ++..|.|+.|.|.++
T Consensus 2 s~vv~V~nLp~~~te~~L~~~f~~fG~V~~v~i~~~------k~~afVef~~~e~A~~Ai~~~~~~~~~l~g~~l~v~~s 75 (481)
T TIGR01649 2 SPVVHVRNLPQDVVEADLVEALIPFGPVSYVMMLPG------KRQALVEFEDEESAKACVNFATSVPIYIRGQPAFFNYS 75 (481)
T ss_pred ccEEEEcCCCCCCCHHHHHHHHHhcCCeeEEEEECC------CCEEEEEeCchHHHHHHHHHhhcCCceEcCeEEEEEec
Confidence 478999999999999999999999999999998764 479999999999999999864 678999999999998
Q ss_pred ec
Q psy7051 91 RY 92 (158)
Q Consensus 91 ~~ 92 (158)
..
T Consensus 76 ~~ 77 (481)
T TIGR01649 76 TS 77 (481)
T ss_pred CC
Confidence 64
No 46
>KOG0144|consensus
Probab=99.57 E-value=1.1e-14 Score=111.69 Aligned_cols=85 Identities=29% Similarity=0.569 Sum_probs=76.2
Q ss_pred CCCcEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcc-cCC--ceEEE
Q psy7051 11 EGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRM-LDG--RELRV 87 (158)
Q Consensus 11 ~~~~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~-i~g--~~i~v 87 (158)
....++|||.+|..++|.||+++|++||.|.+|.|++|+.++..+|||||.|.+.++|.+|+..|++.. |-| ..|.|
T Consensus 32 ~~~vKlfVgqIprt~sE~dlr~lFe~yg~V~einl~kDk~t~~s~gcCFv~~~trk~a~~a~~Alhn~ktlpG~~~pvqv 111 (510)
T KOG0144|consen 32 GSAVKLFVGQIPRTASEKDLRELFEKYGNVYEINLIKDKSTGQSKGCCFVKYYTRKEADEAINALHNQKTLPGMHHPVQV 111 (510)
T ss_pred chhhhheeccCCccccHHHHHHHHHHhCceeEEEeecccccCcccceEEEEeccHHHHHHHHHHhhcccccCCCCcceee
Confidence 445689999999999999999999999999999999999999999999999999999999999999875 433 67899
Q ss_pred EeeecCCC
Q psy7051 88 QMARYGRP 95 (158)
Q Consensus 88 ~~a~~~~~ 95 (158)
.+|...+.
T Consensus 112 k~Ad~E~e 119 (510)
T KOG0144|consen 112 KYADGERE 119 (510)
T ss_pred cccchhhh
Confidence 99875443
No 47
>KOG0109|consensus
Probab=99.55 E-value=7.7e-15 Score=107.66 Aligned_cols=72 Identities=33% Similarity=0.541 Sum_probs=68.1
Q ss_pred cEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEEEEeeecC
Q psy7051 14 VSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELRVQMARYG 93 (158)
Q Consensus 14 ~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~v~~a~~~ 93 (158)
.+|||||||..+++++|+.+|++||+|.+|.|+++ |+||..++...++.||..||+..|+|..|+|+-++..
T Consensus 3 ~KLFIGNLp~~~~~~elr~lFe~ygkVlECDIvKN--------YgFVHiEdktaaedairNLhgYtLhg~nInVeaSksK 74 (346)
T KOG0109|consen 3 VKLFIGNLPREATEQELRSLFEQYGKVLECDIVKN--------YGFVHIEDKTAAEDAIRNLHGYTLHGVNINVEASKSK 74 (346)
T ss_pred cchhccCCCcccchHHHHHHHHhhCceEeeeeecc--------cceEEeecccccHHHHhhcccceecceEEEEEecccc
Confidence 57999999999999999999999999999999866 9999999999999999999999999999999988754
No 48
>PF13893 RRM_5: RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain); PDB: 2PE8_A 2PEH_A 2D9O_A 1A9N_D 2DIT_A 3S6E_A 2LQ5_A 1SJQ_A 2AD9_A 1X4D_A ....
Probab=99.54 E-value=5.6e-14 Score=80.82 Aligned_cols=56 Identities=43% Similarity=0.786 Sum_probs=50.6
Q ss_pred HHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEEEEee
Q psy7051 30 LRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELRVQMA 90 (158)
Q Consensus 30 l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~v~~a 90 (158)
|.++|++||+|..+.+..+. .++|||+|.+.++|+.|++.|||..|.|+.|+|+||
T Consensus 1 L~~~f~~fG~V~~i~~~~~~-----~~~a~V~f~~~~~A~~a~~~l~~~~~~g~~l~V~~a 56 (56)
T PF13893_consen 1 LYKLFSKFGEVKKIKIFKKK-----RGFAFVEFASVEDAQKAIEQLNGRQFNGRPLKVSYA 56 (56)
T ss_dssp HHHHHTTTS-EEEEEEETTS-----TTEEEEEESSHHHHHHHHHHHTTSEETTEEEEEEEE
T ss_pred ChHHhCCcccEEEEEEEeCC-----CCEEEEEECCHHHHHHHHHHhCCCEECCcEEEEEEC
Confidence 67899999999999987653 579999999999999999999999999999999986
No 49
>KOG0127|consensus
Probab=99.53 E-value=4.7e-14 Score=111.06 Aligned_cols=82 Identities=38% Similarity=0.568 Sum_probs=74.4
Q ss_pred CCCCcEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHh-----CC-cccCCc
Q psy7051 10 IEGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAM-----DG-RMLDGR 83 (158)
Q Consensus 10 ~~~~~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l-----~g-~~i~g~ 83 (158)
+....||||.|||+.+++++|.+.|.+||+|.++.++.++.|++++|.|||.|.+..+|++||... .| ..|.|+
T Consensus 289 ~~~~~tVFvRNL~fD~tEEel~~~fskFG~v~ya~iV~~k~T~~skGtAFv~Fkt~~~~~~ci~~Aspa~e~g~~ll~GR 368 (678)
T KOG0127|consen 289 ITEGKTVFVRNLPFDTTEEELKEHFSKFGEVKYAIIVKDKDTGHSKGTAFVKFKTQIAAQNCIEAASPASEDGSVLLDGR 368 (678)
T ss_pred ccccceEEEecCCccccHHHHHHHHHhhccceeEEEEeccCCCCcccceEEEeccHHHHHHHHHhcCccCCCceEEEecc
Confidence 444589999999999999999999999999999999999999999999999999999999999876 23 467999
Q ss_pred eEEEEeee
Q psy7051 84 ELRVQMAR 91 (158)
Q Consensus 84 ~i~v~~a~ 91 (158)
.|+|..|-
T Consensus 369 ~Lkv~~Av 376 (678)
T KOG0127|consen 369 LLKVTLAV 376 (678)
T ss_pred EEeeeecc
Confidence 99999874
No 50
>KOG0117|consensus
Probab=99.52 E-value=3.6e-14 Score=109.32 Aligned_cols=72 Identities=36% Similarity=0.627 Sum_probs=68.1
Q ss_pred CcEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEEEEeeec
Q psy7051 13 MVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELRVQMARY 92 (158)
Q Consensus 13 ~~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~v~~a~~ 92 (158)
.+.|||.||+.++|++.|+++|.+||.|+.|+.+.| ||||.|.+.++|.+|++.|||.+|+|..|.|.+|+.
T Consensus 259 VKvLYVRNL~~~tTeE~lk~~F~~~G~veRVkk~rD--------YaFVHf~eR~davkAm~~~ngkeldG~~iEvtLAKP 330 (506)
T KOG0117|consen 259 VKVLYVRNLMESTTEETLKKLFNEFGKVERVKKPRD--------YAFVHFAEREDAVKAMKETNGKELDGSPIEVTLAKP 330 (506)
T ss_pred eeeeeeeccchhhhHHHHHHHHHhccceEEeecccc--------eeEEeecchHHHHHHHHHhcCceecCceEEEEecCC
Confidence 378999999999999999999999999999988755 999999999999999999999999999999999984
No 51
>KOG0127|consensus
Probab=99.51 E-value=7.3e-14 Score=110.02 Aligned_cols=78 Identities=40% Similarity=0.666 Sum_probs=72.2
Q ss_pred CcEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEEEEeee
Q psy7051 13 MVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELRVQMAR 91 (158)
Q Consensus 13 ~~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~v~~a~ 91 (158)
...|+|.|||+.+...+|+.+|+.||.|.+|.|+... .++..|||||+|....+|..|++.||+..|.|++|-|.||-
T Consensus 117 k~rLIIRNLPf~~k~~dLk~vFs~~G~V~Ei~IP~k~-dgklcGFaFV~fk~~~dA~~Al~~~N~~~i~gR~VAVDWAV 194 (678)
T KOG0127|consen 117 KWRLIIRNLPFKCKKPDLKNVFSNFGKVVEIVIPRKK-DGKLCGFAFVQFKEKKDAEKALEFFNGNKIDGRPVAVDWAV 194 (678)
T ss_pred cceEEeecCCcccCcHHHHHHHhhcceEEEEEcccCC-CCCccceEEEEEeeHHHHHHHHHhccCceecCceeEEeeec
Confidence 5689999999999999999999999999999999776 44555999999999999999999999999999999999974
No 52
>smart00361 RRM_1 RNA recognition motif.
Probab=99.51 E-value=1e-13 Score=83.38 Aligned_cols=61 Identities=34% Similarity=0.602 Sum_probs=54.4
Q ss_pred HHHHHHHhh----hcCceEEEE-EccCCCC--CCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEEE
Q psy7051 27 PDDLRRVFE----RCGEVGDIY-IPRDRFT--RESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELRV 87 (158)
Q Consensus 27 ~~~l~~~f~----~~G~i~~v~-~~~~~~~--~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~v 87 (158)
+++|.++|. +||.|..|. ++.++.+ +..+|||||+|.+.++|..|++.|||..|.|+.|.+
T Consensus 2 ~~~l~~~~~~~~~~fG~v~~v~~v~~~~~~~~~~~rG~~fV~f~~~~dA~~A~~~l~g~~~~gr~l~~ 69 (70)
T smart00361 2 DEDFEREFSEEEEYFGEVGKINKIYIDNVGYENHKRGNVYITFERSEDAARAIVDLNGRYFDGRTVKA 69 (70)
T ss_pred chhHHHHHHHHHHhcCCeeEEEEEEeCCCCCCCCCcEEEEEEECCHHHHHHHHHHhCCCEECCEEEEe
Confidence 578888888 999999985 6666555 889999999999999999999999999999999986
No 53
>KOG0147|consensus
Probab=99.51 E-value=2.3e-14 Score=112.73 Aligned_cols=78 Identities=36% Similarity=0.635 Sum_probs=74.5
Q ss_pred cEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEEEEeee
Q psy7051 14 VSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELRVQMAR 91 (158)
Q Consensus 14 ~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~v~~a~ 91 (158)
..|||+||..++++++|..+|++||.|..|.+.++..+|.++||+||+|.+.++|..|++.|||.+|.|+.|+|..-.
T Consensus 279 ~rl~vgnLHfNite~~lr~ifepfg~Ie~v~l~~d~~tG~skgfGfi~f~~~~~ar~a~e~lngfelAGr~ikV~~v~ 356 (549)
T KOG0147|consen 279 RRLYVGNLHFNITEDMLRGIFEPFGKIENVQLTKDSETGRSKGFGFITFVNKEDARKALEQLNGFELAGRLIKVSVVT 356 (549)
T ss_pred hhhhhcccccCchHHHHhhhccCcccceeeeeccccccccccCcceEEEecHHHHHHHHHHhccceecCceEEEEEee
Confidence 349999999999999999999999999999999999999999999999999999999999999999999999998755
No 54
>KOG4212|consensus
Probab=99.50 E-value=1.1e-13 Score=106.60 Aligned_cols=78 Identities=28% Similarity=0.433 Sum_probs=72.9
Q ss_pred cEEEEeCCCCCCCHHHHHHHhh-hcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEEEEeeec
Q psy7051 14 VSLKVDNLTYRTTPDDLRRVFE-RCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELRVQMARY 92 (158)
Q Consensus 14 ~~l~V~nl~~~~~~~~l~~~f~-~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~v~~a~~ 92 (158)
..+||+|+|+++.|++|+++|. +.|+|++|.+..|. +++.+|||.|+|+++|.+++|+++||..+|.|+.|+|+....
T Consensus 45 R~vfItNIpyd~rWqdLKdLvrekvGev~yveLl~D~-~GK~rGcavVEFk~~E~~qKa~E~lnk~~~~GR~l~vKEd~d 123 (608)
T KOG4212|consen 45 RSVFITNIPYDYRWQDLKDLVREKVGEVEYVELLFDE-SGKARGCAVVEFKDPENVQKALEKLNKYEVNGRELVVKEDHD 123 (608)
T ss_pred ceEEEecCcchhhhHhHHHHHHHhcCceEeeeeeccc-CCCcCCceEEEeeCHHHHHHHHHHhhhccccCceEEEeccCc
Confidence 4599999999999999999996 57999999999985 899999999999999999999999999999999999998764
No 55
>KOG0144|consensus
Probab=99.50 E-value=3.3e-14 Score=109.11 Aligned_cols=85 Identities=34% Similarity=0.515 Sum_probs=76.0
Q ss_pred CCcEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcc-cCC--ceEEEE
Q psy7051 12 GMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRM-LDG--RELRVQ 88 (158)
Q Consensus 12 ~~~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~-i~g--~~i~v~ 88 (158)
...+|||+-|+..++|.+|+++|.+||.|++|.|..+. .+.++|||||.|.+.+.|..||+.|||.. +.| .+|.|.
T Consensus 123 ~e~KLFvg~lsK~~te~evr~iFs~fG~Ied~~ilrd~-~~~sRGcaFV~fstke~A~~Aika~ng~~tmeGcs~PLVVk 201 (510)
T KOG0144|consen 123 EERKLFVGMLSKQCTENEVREIFSRFGHIEDCYILRDP-DGLSRGCAFVKFSTKEMAVAAIKALNGTQTMEGCSQPLVVK 201 (510)
T ss_pred cchhhhhhhccccccHHHHHHHHHhhCccchhhheecc-cccccceeEEEEehHHHHHHHHHhhccceeeccCCCceEEE
Confidence 35789999999999999999999999999999999986 77899999999999999999999999984 555 579999
Q ss_pred eeecCCCCC
Q psy7051 89 MARYGRPPS 97 (158)
Q Consensus 89 ~a~~~~~~~ 97 (158)
||...+++.
T Consensus 202 FADtqkdk~ 210 (510)
T KOG0144|consen 202 FADTQKDKD 210 (510)
T ss_pred ecccCCCch
Confidence 998766543
No 56
>KOG0109|consensus
Probab=99.44 E-value=1.8e-13 Score=100.52 Aligned_cols=75 Identities=31% Similarity=0.504 Sum_probs=69.8
Q ss_pred CCCcEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEEEEee
Q psy7051 11 EGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELRVQMA 90 (158)
Q Consensus 11 ~~~~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~v~~a 90 (158)
...++|+|+||.+.++.+||++.|++||.|.+|+|+++ |+||.|+-.++|..|++.||+.+|.|+.++|+++
T Consensus 76 k~stkl~vgNis~tctn~ElRa~fe~ygpviecdivkd--------y~fvh~d~~eda~~air~l~~~~~~gk~m~vq~s 147 (346)
T KOG0109|consen 76 KASTKLHVGNISPTCTNQELRAKFEKYGPVIECDIVKD--------YAFVHFDRAEDAVEAIRGLDNTEFQGKRMHVQLS 147 (346)
T ss_pred CCccccccCCCCccccCHHHhhhhcccCCceeeeeecc--------eeEEEEeeccchHHHHhcccccccccceeeeeee
Confidence 34578999999999999999999999999999999765 9999999999999999999999999999999998
Q ss_pred ecC
Q psy7051 91 RYG 93 (158)
Q Consensus 91 ~~~ 93 (158)
+..
T Consensus 148 tsr 150 (346)
T KOG0109|consen 148 TSR 150 (346)
T ss_pred ccc
Confidence 743
No 57
>KOG0124|consensus
Probab=99.44 E-value=9.5e-14 Score=104.96 Aligned_cols=77 Identities=32% Similarity=0.586 Sum_probs=74.3
Q ss_pred CcEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEEEEe
Q psy7051 13 MVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELRVQM 89 (158)
Q Consensus 13 ~~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~v~~ 89 (158)
+..||||.+.+.+.|+.|...|.+||+|+.|.+..|+.|++.+|||||+|+-+|.|+-|++.|||..++|+.|+|..
T Consensus 113 McRvYVGSIsfEl~EDtiR~AF~PFGPIKSInMSWDp~T~kHKgFAFVEYEvPEaAqLAlEqMNg~mlGGRNiKVgr 189 (544)
T KOG0124|consen 113 MCRVYVGSISFELREDTIRRAFDPFGPIKSINMSWDPATGKHKGFAFVEYEVPEAAQLALEQMNGQMLGGRNIKVGR 189 (544)
T ss_pred hHheeeeeeEEEechHHHHhhccCCCCcceeecccccccccccceEEEEEeCcHHHHHHHHHhccccccCccccccC
Confidence 46799999999999999999999999999999999999999999999999999999999999999999999999974
No 58
>KOG0131|consensus
Probab=99.41 E-value=9.3e-13 Score=90.86 Aligned_cols=85 Identities=25% Similarity=0.308 Sum_probs=76.4
Q ss_pred CCCCCcEEEEeCCCCCCCHHHHHHHhhhcCceEEE-EEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEEE
Q psy7051 9 RIEGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDI-YIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELRV 87 (158)
Q Consensus 9 ~~~~~~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v-~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~v 87 (158)
+......|||+||.+.+++..|.+.|+.||.|... +++.+..+|.+++|+||.|.+.+.+.+|+..|||..+++++|.|
T Consensus 92 nl~vganlfvgNLd~~vDe~~L~dtFsafG~l~~~P~i~rd~~tg~~~~~g~i~~~sfeasd~ai~s~ngq~l~nr~itv 171 (203)
T KOG0131|consen 92 NLDVGANLFVGNLDPEVDEKLLYDTFSAFGVLISPPKIMRDPDTGNPKGFGFINYASFEASDAAIGSMNGQYLCNRPITV 171 (203)
T ss_pred cccccccccccccCcchhHHHHHHHHHhccccccCCcccccccCCCCCCCeEEechhHHHHHHHHHHhccchhcCCceEE
Confidence 34445789999999999999999999999988653 77888889999999999999999999999999999999999999
Q ss_pred EeeecC
Q psy7051 88 QMARYG 93 (158)
Q Consensus 88 ~~a~~~ 93 (158)
.++...
T Consensus 172 ~ya~k~ 177 (203)
T KOG0131|consen 172 SYAFKK 177 (203)
T ss_pred EEEEec
Confidence 999743
No 59
>TIGR01642 U2AF_lg U2 snRNP auxilliary factor, large subunit, splicing factor. Members of this subfamily are found in plants, metazoa and fungi.
Probab=99.41 E-value=1.2e-12 Score=106.08 Aligned_cols=73 Identities=19% Similarity=0.290 Sum_probs=60.2
Q ss_pred CCcEEEEeCCCCCCCHHHHHHHhhhc------------CceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcc
Q psy7051 12 GMVSLKVDNLTYRTTPDDLRRVFERC------------GEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRM 79 (158)
Q Consensus 12 ~~~~l~V~nl~~~~~~~~l~~~f~~~------------G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~ 79 (158)
...+|||+|||+.+++++|.++|.++ +.|..+.+. ..+|||||+|.+.++|+.|| .|+|..
T Consensus 174 ~~r~lyVgnLp~~~t~~~l~~~F~~~~~~~~~~~~~~~~~v~~~~~~------~~kg~afVeF~~~e~A~~Al-~l~g~~ 246 (509)
T TIGR01642 174 QARRLYVGGIPPEFVEEAVVDFFNDLMIATGYHKAEDGKHVSSVNIN------KEKNFAFLEFRTVEEATFAM-ALDSII 246 (509)
T ss_pred cccEEEEeCCCCCCCHHHHHHHHHHHHHhcCCCCCCCCCceEEEEEC------CCCCEEEEEeCCHHHHhhhh-cCCCeE
Confidence 35789999999999999999999975 233444432 24689999999999999999 599999
Q ss_pred cCCceEEEEeee
Q psy7051 80 LDGRELRVQMAR 91 (158)
Q Consensus 80 i~g~~i~v~~a~ 91 (158)
|.|..|+|....
T Consensus 247 ~~g~~l~v~r~~ 258 (509)
T TIGR01642 247 YSNVFLKIRRPH 258 (509)
T ss_pred eeCceeEecCcc
Confidence 999999986543
No 60
>KOG0146|consensus
Probab=99.37 E-value=1e-12 Score=95.80 Aligned_cols=82 Identities=27% Similarity=0.342 Sum_probs=77.5
Q ss_pred CCCCcEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEEEEe
Q psy7051 10 IEGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELRVQM 89 (158)
Q Consensus 10 ~~~~~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~v~~ 89 (158)
.++...|||-.||.+..+.||..+|-+||.|+..++..|..|..++.|+||.|+++.++++||..|||..|+.+.|+|.+
T Consensus 282 GPeGCNlFIYHLPQEFgDaEliQmF~PFGhivSaKVFvDRATNQSKCFGFVSfDNp~SaQaAIqAMNGFQIGMKRLKVQL 361 (371)
T KOG0146|consen 282 GPEGCNLFIYHLPQEFGDAELIQMFLPFGHIVSAKVFVDRATNQSKCFGFVSFDNPASAQAAIQAMNGFQIGMKRLKVQL 361 (371)
T ss_pred CCCcceEEEEeCchhhccHHHHHHhccccceeeeeeeehhccccccceeeEecCCchhHHHHHHHhcchhhhhhhhhhhh
Confidence 34568899999999999999999999999999999999999999999999999999999999999999999999999998
Q ss_pred ee
Q psy7051 90 AR 91 (158)
Q Consensus 90 a~ 91 (158)
.+
T Consensus 362 KR 363 (371)
T KOG0146|consen 362 KR 363 (371)
T ss_pred cC
Confidence 76
No 61
>KOG0153|consensus
Probab=99.34 E-value=4.3e-12 Score=95.28 Aligned_cols=80 Identities=28% Similarity=0.475 Sum_probs=68.8
Q ss_pred CCCCCCCcEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHH-hCCcccCCceE
Q psy7051 7 PPRIEGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDA-MDGRMLDGREL 85 (158)
Q Consensus 7 ~~~~~~~~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~-l~g~~i~g~~i 85 (158)
|++.....+|||++|...+++.+|.++|.+||+|..+.++.. .++|||+|.+.+.|+.|.++ ++...|+|..|
T Consensus 222 pPeD~~I~tLyIg~l~d~v~e~dIrdhFyqyGeirsi~~~~~------~~CAFv~ftTR~aAE~Aae~~~n~lvI~G~Rl 295 (377)
T KOG0153|consen 222 PPEDTSIKTLYIGGLNDEVLEQDIRDHFYQYGEIRSIRILPR------KGCAFVTFTTREAAEKAAEKSFNKLVINGFRL 295 (377)
T ss_pred CCcccceeEEEecccccchhHHHHHHHHhhcCCeeeEEeecc------cccceeeehhhHHHHHHHHhhcceeeecceEE
Confidence 445556689999999999999999999999999999988765 46999999999999988775 55556899999
Q ss_pred EEEeeec
Q psy7051 86 RVQMARY 92 (158)
Q Consensus 86 ~v~~a~~ 92 (158)
+|.|...
T Consensus 296 ~i~Wg~~ 302 (377)
T KOG0153|consen 296 KIKWGRP 302 (377)
T ss_pred EEEeCCC
Confidence 9998775
No 62
>KOG4206|consensus
Probab=99.34 E-value=7.4e-12 Score=89.17 Aligned_cols=80 Identities=33% Similarity=0.488 Sum_probs=71.1
Q ss_pred CCCcEEEEeCCCCCCCHHHHHH----HhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEE
Q psy7051 11 EGMVSLKVDNLTYRTTPDDLRR----VFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELR 86 (158)
Q Consensus 11 ~~~~~l~V~nl~~~~~~~~l~~----~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~ 86 (158)
.+..||||.||+..+..++|+. +|++||.|..|.... +.+.+|-|||.|.+.+.|-.|+..|+|..+.|+.+.
T Consensus 7 ~pn~TlYInnLnekI~~~elkrsL~~LFsqfG~ildI~a~k---t~KmRGQA~VvFk~~~~As~A~r~l~gfpFygK~mr 83 (221)
T KOG4206|consen 7 NPNGTLYINNLNEKIKKDELKRSLYLLFSQFGKILDISAFK---TPKMRGQAFVVFKETEAASAALRALQGFPFYGKPMR 83 (221)
T ss_pred CCCceEeehhccccccHHHHHHHHHHHHHhhCCeEEEEecC---CCCccCceEEEecChhHHHHHHHHhcCCcccCchhh
Confidence 3445999999999999999887 999999998887643 566789999999999999999999999999999999
Q ss_pred EEeeecC
Q psy7051 87 VQMARYG 93 (158)
Q Consensus 87 v~~a~~~ 93 (158)
|+||+..
T Consensus 84 iqyA~s~ 90 (221)
T KOG4206|consen 84 IQYAKSD 90 (221)
T ss_pred eecccCc
Confidence 9999843
No 63
>KOG4208|consensus
Probab=99.30 E-value=1.5e-11 Score=86.44 Aligned_cols=81 Identities=22% Similarity=0.282 Sum_probs=73.6
Q ss_pred CCcEEEEeCCCCCCCHHHHHHHhhhc-CceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEEEEee
Q psy7051 12 GMVSLKVDNLTYRTTPDDLRRVFERC-GEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELRVQMA 90 (158)
Q Consensus 12 ~~~~l~V~nl~~~~~~~~l~~~f~~~-G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~v~~a 90 (158)
....++|..+|..+.+.+|..+|.+| |.|..+.+..++.||.++|||||+|++++.|+-|.+.||+..|.++-|.|.+-
T Consensus 48 ~~g~~~~~~~p~g~~e~~~~~~~~q~~g~v~r~rlsRnkrTGNSKgYAFVEFEs~eVA~IaAETMNNYLl~e~lL~c~vm 127 (214)
T KOG4208|consen 48 IEGVVYVDHIPHGFFETEILNYFRQFGGTVTRFRLSRNKRTGNSKGYAFVEFESEEVAKIAAETMNNYLLMEHLLECHVM 127 (214)
T ss_pred CccceeecccccchhHHHHhhhhhhcCCeeEEEEeecccccCCcCceEEEEeccHHHHHHHHHHhhhhhhhhheeeeEEe
Confidence 34568999999999999999999998 67888888899999999999999999999999999999999999999998886
Q ss_pred ec
Q psy7051 91 RY 92 (158)
Q Consensus 91 ~~ 92 (158)
..
T Consensus 128 pp 129 (214)
T KOG4208|consen 128 PP 129 (214)
T ss_pred Cc
Confidence 53
No 64
>KOG0132|consensus
Probab=99.30 E-value=9.4e-12 Score=101.59 Aligned_cols=77 Identities=27% Similarity=0.458 Sum_probs=70.9
Q ss_pred cEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEEEEeeecC
Q psy7051 14 VSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELRVQMARYG 93 (158)
Q Consensus 14 ~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~v~~a~~~ 93 (158)
+|||||+|+..++|++|..+|+.||+|..|.++.. .+||||.+....+|++|+.+|.+..|.+..|+|.|+-..
T Consensus 422 rTLwvG~i~k~v~e~dL~~~feefGeiqSi~li~~------R~cAfI~M~~RqdA~kalqkl~n~kv~~k~Iki~Wa~g~ 495 (894)
T KOG0132|consen 422 RTLWVGGIPKNVTEQDLANLFEEFGEIQSIILIPP------RGCAFIKMVRRQDAEKALQKLSNVKVADKTIKIAWAVGK 495 (894)
T ss_pred eeeeeccccchhhHHHHHHHHHhcccceeEeeccC------CceeEEEEeehhHHHHHHHHHhcccccceeeEEeeeccC
Confidence 79999999999999999999999999999988654 689999999999999999999999999999999999754
Q ss_pred CCC
Q psy7051 94 RPP 96 (158)
Q Consensus 94 ~~~ 96 (158)
..+
T Consensus 496 G~k 498 (894)
T KOG0132|consen 496 GPK 498 (894)
T ss_pred Ccc
Confidence 443
No 65
>KOG4661|consensus
Probab=99.29 E-value=8.4e-12 Score=99.22 Aligned_cols=78 Identities=23% Similarity=0.389 Sum_probs=74.2
Q ss_pred cEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEEEEeee
Q psy7051 14 VSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELRVQMAR 91 (158)
Q Consensus 14 ~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~v~~a~ 91 (158)
..|||.+|...+...+|+.||++||+|+..+|+.+..+.-...|+||++.+.++|..||+.|+..+|.|+.|.|+-++
T Consensus 406 RNlWVSGLSstTRAtDLKnlFSKyGKVvGAKVVTNaRsPGaRCYGfVTMSts~eAtkCI~hLHrTELHGrmISVEkaK 483 (940)
T KOG4661|consen 406 RNLWVSGLSSTTRATDLKNLFSKYGKVVGAKVVTNARSPGARCYGFVTMSTSAEATKCIEHLHRTELHGRMISVEKAK 483 (940)
T ss_pred cceeeeccccchhhhHHHHHHHHhcceeceeeeecCCCCCcceeEEEEecchHHHHHHHHHhhhhhhcceeeeeeecc
Confidence 579999999999999999999999999999999998777788999999999999999999999999999999999876
No 66
>KOG0123|consensus
Probab=99.29 E-value=1.4e-11 Score=95.79 Aligned_cols=74 Identities=23% Similarity=0.470 Sum_probs=69.6
Q ss_pred EEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEEEEeeec
Q psy7051 16 LKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELRVQMARY 92 (158)
Q Consensus 16 l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~v~~a~~ 92 (158)
|||.||++.++..+|.++|+.||+|..|++..+. .| .+|| ||+|++++.|.+|++.|||..+.+++|.|.....
T Consensus 79 ~~i~nl~~~~~~~~~~d~f~~~g~ilS~kv~~~~-~g-~kg~-FV~f~~e~~a~~ai~~~ng~ll~~kki~vg~~~~ 152 (369)
T KOG0123|consen 79 VFIKNLDESIDNKSLYDTFSEFGNILSCKVATDE-NG-SKGY-FVQFESEESAKKAIEKLNGMLLNGKKIYVGLFER 152 (369)
T ss_pred eeecCCCcccCcHHHHHHHHhhcCeeEEEEEEcC-CC-ceee-EEEeCCHHHHHHHHHHhcCcccCCCeeEEeeccc
Confidence 9999999999999999999999999999999986 44 8899 9999999999999999999999999999987653
No 67
>KOG0116|consensus
Probab=99.23 E-value=1.5e-10 Score=90.73 Aligned_cols=78 Identities=27% Similarity=0.396 Sum_probs=66.5
Q ss_pred cEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEEEEeeec
Q psy7051 14 VSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELRVQMARY 92 (158)
Q Consensus 14 ~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~v~~a~~ 92 (158)
.+|||.|||.+++..+|+++|..||.|+...|......++..+||||+|.+.++++.||++ +-..|++++|.|+.-+.
T Consensus 289 ~~i~V~nlP~da~~~~l~~~Fk~FG~Ik~~~I~vr~~~~~~~~fgFV~f~~~~~~~~~i~A-sp~~ig~~kl~Veek~~ 366 (419)
T KOG0116|consen 289 LGIFVKNLPPDATPAELEEVFKQFGPIKEGGIQVRSPGGKNPCFGFVEFENAAAVQNAIEA-SPLEIGGRKLNVEEKRP 366 (419)
T ss_pred cceEeecCCCCCCHHHHHHHHhhcccccccceEEeccCCCcCceEEEEEeecchhhhhhhc-CccccCCeeEEEEeccc
Confidence 4599999999999999999999999998876654433344458999999999999999975 57789999999998764
No 68
>KOG4212|consensus
Probab=99.23 E-value=3.6e-11 Score=92.93 Aligned_cols=76 Identities=34% Similarity=0.463 Sum_probs=67.4
Q ss_pred CCCCcEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEEEEe
Q psy7051 10 IEGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELRVQM 89 (158)
Q Consensus 10 ~~~~~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~v~~ 89 (158)
....++|||.|||...||+.|++-|..||.|.++.|+. .++.+| .|.|.++++|+.||..|+|..|.|+.|+|.|
T Consensus 533 arKa~qIiirNlP~dfTWqmlrDKfre~G~v~yadime---~GkskG--VVrF~s~edAEra~a~Mngs~l~Gr~I~V~y 607 (608)
T KOG4212|consen 533 ARKACQIIIRNLPFDFTWQMLRDKFREIGHVLYADIME---NGKSKG--VVRFFSPEDAERACALMNGSRLDGRNIKVTY 607 (608)
T ss_pred cccccEEEEecCCccccHHHHHHHHHhccceehhhhhc---cCCccc--eEEecCHHHHHHHHHHhccCcccCceeeeee
Confidence 34457899999999999999999999999999998843 455555 8999999999999999999999999999987
Q ss_pred e
Q psy7051 90 A 90 (158)
Q Consensus 90 a 90 (158)
+
T Consensus 608 ~ 608 (608)
T KOG4212|consen 608 F 608 (608)
T ss_pred C
Confidence 4
No 69
>KOG0106|consensus
Probab=99.22 E-value=1.6e-11 Score=87.99 Aligned_cols=71 Identities=38% Similarity=0.638 Sum_probs=65.7
Q ss_pred cEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEEEEeeec
Q psy7051 14 VSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELRVQMARY 92 (158)
Q Consensus 14 ~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~v~~a~~ 92 (158)
..|||++||+.+.+.+|++||..||+|..|.|. .||+||+|++..+|..|+..||+.+|++..+.|++++.
T Consensus 2 ~rv~vg~~~~~~~~~d~E~~f~~yg~~~d~~mk--------~gf~fv~fed~rda~Dav~~l~~~~l~~e~~vve~~r~ 72 (216)
T KOG0106|consen 2 PRVYIGRLPYRARERDVERFFKGYGKIPDADMK--------NGFGFVEFEDPRDADDAVHDLDGKELCGERLVVEHARG 72 (216)
T ss_pred CceeecccCCccchhHHHHHHhhccccccceee--------cccceeccCchhhhhcccchhcCceecceeeeeecccc
Confidence 469999999999999999999999999988873 46899999999999999999999999999999999884
No 70
>KOG0533|consensus
Probab=99.22 E-value=8.7e-11 Score=85.87 Aligned_cols=79 Identities=32% Similarity=0.516 Sum_probs=73.0
Q ss_pred CCcEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEEEEeee
Q psy7051 12 GMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELRVQMAR 91 (158)
Q Consensus 12 ~~~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~v~~a~ 91 (158)
..++|+|.||+..|++++|+++|..||.++.+.|..++ .|.+.|.|-|.|...++|+.|++.+|+..++|..|++....
T Consensus 82 ~~~~v~v~NL~~~V~~~Dl~eLF~~~~~~~r~~vhy~~-~G~s~Gta~v~~~r~~DA~~avk~~~gv~ldG~~mk~~~i~ 160 (243)
T KOG0533|consen 82 RSTKVNVSNLPYGVIDADLKELFAEFGELKRVAVHYDR-AGRSLGTADVSFNRRDDAERAVKKYNGVALDGRPMKIEIIS 160 (243)
T ss_pred CcceeeeecCCcCcchHHHHHHHHHhccceEEeeccCC-CCCCCccceeeecchHhHHHHHHHhcCcccCCceeeeEEec
Confidence 34789999999999999999999999999988888875 88899999999999999999999999999999999888764
No 71
>KOG4209|consensus
Probab=99.21 E-value=1.2e-10 Score=85.08 Aligned_cols=84 Identities=29% Similarity=0.561 Sum_probs=77.5
Q ss_pred CCcEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEEEEeee
Q psy7051 12 GMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELRVQMAR 91 (158)
Q Consensus 12 ~~~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~v~~a~ 91 (158)
+...+||+|+...++.++++..|+.||.|..+.++++...++++||+||+|.+.+.++.||. ||+..|.+..|.|.+.+
T Consensus 100 d~~sv~v~nvd~~~t~~~~e~hf~~Cg~i~~~ti~~d~~~~~~k~~~yvef~~~~~~~~ay~-l~gs~i~~~~i~vt~~r 178 (231)
T KOG4209|consen 100 DAPSVWVGNVDFLVTLTKIELHFESCGGINRVTVPKDKFRGHPKGFAYVEFSSYELVEEAYK-LDGSEIPGPAIEVTLKR 178 (231)
T ss_pred CCceEEEeccccccccchhhheeeccCCccceeeeccccCCCcceeEEEecccHhhhHHHhh-cCCcccccccceeeeee
Confidence 45789999999999999999999999999999999999898999999999999999999997 99999999999999988
Q ss_pred cCCCC
Q psy7051 92 YGRPP 96 (158)
Q Consensus 92 ~~~~~ 96 (158)
...+.
T Consensus 179 ~~~pg 183 (231)
T KOG4209|consen 179 TNVPG 183 (231)
T ss_pred eecCC
Confidence 66433
No 72
>KOG0146|consensus
Probab=99.20 E-value=2.3e-11 Score=88.77 Aligned_cols=82 Identities=29% Similarity=0.493 Sum_probs=72.6
Q ss_pred CcEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcc-cCC--ceEEEEe
Q psy7051 13 MVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRM-LDG--RELRVQM 89 (158)
Q Consensus 13 ~~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~-i~g--~~i~v~~ 89 (158)
..+||||-|...-.|+|+..+|..||.|++|.+...+ .|..+|||||.|.+..+|+.||..|+|.. +-| ..|.|++
T Consensus 19 drklfvgml~kqq~e~dvrrlf~pfG~~~e~tvlrg~-dg~sKGCAFVKf~s~~eAqaAI~aLHgSqTmpGASSSLVVK~ 97 (371)
T KOG0146|consen 19 DRKLFVGMLNKQQSEDDVRRLFQPFGNIEECTVLRGP-DGNSKGCAFVKFSSHAEAQAAINALHGSQTMPGASSSLVVKF 97 (371)
T ss_pred chhhhhhhhcccccHHHHHHHhcccCCcceeEEecCC-CCCCCCceEEEeccchHHHHHHHHhcccccCCCCccceEEEe
Confidence 4789999999999999999999999999999998876 67799999999999999999999999985 433 5689999
Q ss_pred eecCCC
Q psy7051 90 ARYGRP 95 (158)
Q Consensus 90 a~~~~~ 95 (158)
+...+.
T Consensus 98 ADTdkE 103 (371)
T KOG0146|consen 98 ADTDKE 103 (371)
T ss_pred ccchHH
Confidence 875543
No 73
>KOG0124|consensus
Probab=99.20 E-value=5.3e-11 Score=90.28 Aligned_cols=78 Identities=23% Similarity=0.398 Sum_probs=73.5
Q ss_pred CcEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEEEEee
Q psy7051 13 MVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELRVQMA 90 (158)
Q Consensus 13 ~~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~v~~a 90 (158)
...|||..+.++.+++||+..|+.||+|..|.+-..+..+..+||+||+|.+......||..||-..|+|..|.|.-+
T Consensus 210 fnRiYVaSvHpDLSe~DiKSVFEAFG~I~~C~LAr~pt~~~HkGyGfiEy~n~qs~~eAiasMNlFDLGGQyLRVGk~ 287 (544)
T KOG0124|consen 210 FNRIYVASVHPDLSETDIKSVFEAFGEIVKCQLARAPTGRGHKGYGFIEYNNLQSQSEAIASMNLFDLGGQYLRVGKC 287 (544)
T ss_pred hheEEeeecCCCccHHHHHHHHHhhcceeeEEeeccCCCCCccceeeEEeccccchHHHhhhcchhhcccceEecccc
Confidence 368999999999999999999999999999999999988899999999999999999999999999999999998754
No 74
>KOG0110|consensus
Probab=99.18 E-value=7.7e-11 Score=95.52 Aligned_cols=78 Identities=36% Similarity=0.575 Sum_probs=69.1
Q ss_pred cEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCC---CCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEEEEee
Q psy7051 14 VSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFT---RESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELRVQMA 90 (158)
Q Consensus 14 ~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~---~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~v~~a 90 (158)
++|||.||++.++.++|..+|.++|.|..+.|...+.. -.+.|||||+|.+.++|+.|++.|+|..|+|+.|.|.++
T Consensus 516 t~lfvkNlnf~Tt~e~l~~~F~k~G~VlS~~I~kkkd~~~k~lSmGfgFVEF~~~e~A~~a~k~lqgtvldGH~l~lk~S 595 (725)
T KOG0110|consen 516 TKLFVKNLNFDTTLEDLEDLFSKQGTVLSIEISKKKDPANKYLSMGFGFVEFAKPESAQAALKALQGTVLDGHKLELKIS 595 (725)
T ss_pred hhhhhhcCCcccchhHHHHHHHhcCeEEEEEEeccccccccccccceeEEEecCHHHHHHHHHHhcCceecCceEEEEec
Confidence 34999999999999999999999999998887655422 124499999999999999999999999999999999999
Q ss_pred e
Q psy7051 91 R 91 (158)
Q Consensus 91 ~ 91 (158)
.
T Consensus 596 ~ 596 (725)
T KOG0110|consen 596 E 596 (725)
T ss_pred c
Confidence 8
No 75
>KOG4454|consensus
Probab=99.18 E-value=1.1e-11 Score=87.95 Aligned_cols=82 Identities=23% Similarity=0.291 Sum_probs=72.1
Q ss_pred CCCCCCCcEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEE
Q psy7051 7 PPRIEGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELR 86 (158)
Q Consensus 7 ~~~~~~~~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~ 86 (158)
++..+...||||+||...++|+.|.++|-+.|+|..|.|+.++ .+..+ ||||.|.++..+.-|++.+||..+.+..|+
T Consensus 3 aaaae~drtl~v~n~~~~v~eelL~ElfiqaGPV~kv~ip~~~-d~~~k-Fa~v~f~~E~sv~~a~~L~ng~~l~~~e~q 80 (267)
T KOG4454|consen 3 AAAAEMDRTLLVQNMYSGVSEELLSELFIQAGPVYKVGIPSGQ-DQEQK-FAYVFFPNENSVQLAGQLENGDDLEEDEEQ 80 (267)
T ss_pred CCCcchhhHHHHHhhhhhhhHHHHHHHhhccCceEEEeCCCCc-cCCCc-eeeeecccccchhhhhhhcccchhccchhh
Confidence 3444555889999999999999999999999999999998876 44455 999999999999999999999999999988
Q ss_pred EEee
Q psy7051 87 VQMA 90 (158)
Q Consensus 87 v~~a 90 (158)
|.+-
T Consensus 81 ~~~r 84 (267)
T KOG4454|consen 81 RTLR 84 (267)
T ss_pred cccc
Confidence 8874
No 76
>KOG0123|consensus
Probab=99.18 E-value=8.7e-11 Score=91.42 Aligned_cols=72 Identities=38% Similarity=0.482 Sum_probs=68.4
Q ss_pred cEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEEEEeee
Q psy7051 14 VSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELRVQMAR 91 (158)
Q Consensus 14 ~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~v~~a~ 91 (158)
..|||| +.+|+..|.++|+++|+|..+.+..+. + +.|||||.|.++++|+.|++.||...|.|+.|.|-|+.
T Consensus 2 ~sl~vg---~~v~e~~l~~~f~~~~~v~s~rvc~d~-t--slgy~yvnf~~~~da~~A~~~~n~~~~~~~~~rim~s~ 73 (369)
T KOG0123|consen 2 ASLYVG---PDVTEAMLFDKFSPAGPVLSIRVCRDA-T--SLGYAYVNFQQPADAERALDTMNFDVLKGKPIRIMWSQ 73 (369)
T ss_pred CceecC---CcCChHHHHHHhcccCCceeEEEeecC-C--ccceEEEecCCHHHHHHHHHHcCCcccCCcEEEeehhc
Confidence 468998 899999999999999999999999998 6 89999999999999999999999999999999999886
No 77
>PF04059 RRM_2: RNA recognition motif 2; InterPro: IPR007201 This RNA recognition motif 2 is found in Meiosis protein mei2. It is found C-terminal to the RNA-binding region RNP-1 (IPR000504 from INTERPRO).
Probab=99.16 E-value=4.6e-10 Score=71.13 Aligned_cols=78 Identities=22% Similarity=0.381 Sum_probs=69.3
Q ss_pred cEEEEeCCCCCCCHHHHHHHhhhc--CceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccC----CceEEE
Q psy7051 14 VSLKVDNLTYRTTPDDLRRVFERC--GEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLD----GRELRV 87 (158)
Q Consensus 14 ~~l~V~nl~~~~~~~~l~~~f~~~--G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~----g~~i~v 87 (158)
+||.|.|+|...+.++|.+++... |....+-++.|..+..+.|||||.|.+++.|....+.++|..|. .+.+.|
T Consensus 2 TTvMirNIPn~~t~~~L~~~l~~~~~g~yDF~YLPiDf~~~~N~GYAFVNf~~~~~~~~F~~~f~g~~w~~~~s~Kvc~i 81 (97)
T PF04059_consen 2 TTVMIRNIPNKYTQEMLIQILDEHFKGKYDFFYLPIDFKNKCNLGYAFVNFTSPQAAIRFYKAFNGKKWPNFNSKKVCEI 81 (97)
T ss_pred eeEEEecCCCCCCHHHHHHHHHHhccCcceEEEeeeeccCCCceEEEEEEcCCHHHHHHHHHHHcCCccccCCCCcEEEE
Confidence 789999999999999999998663 67788899999989999999999999999999999999999875 456778
Q ss_pred Eeee
Q psy7051 88 QMAR 91 (158)
Q Consensus 88 ~~a~ 91 (158)
.+|+
T Consensus 82 ~yAr 85 (97)
T PF04059_consen 82 SYAR 85 (97)
T ss_pred ehhH
Confidence 7776
No 78
>KOG0110|consensus
Probab=99.14 E-value=5.2e-11 Score=96.49 Aligned_cols=82 Identities=28% Similarity=0.542 Sum_probs=75.2
Q ss_pred CCcEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEEEEeee
Q psy7051 12 GMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELRVQMAR 91 (158)
Q Consensus 12 ~~~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~v~~a~ 91 (158)
.++.|+|.|||..++..+|+.+|..||+|..|.|+.....+...|||||+|-++.+|.+|+..|..+.|.|+.|.++||+
T Consensus 612 ~~tKIlVRNipFeAt~rEVr~LF~aFGqlksvRlPKK~~k~a~rGF~Fv~f~t~~ea~nA~~al~STHlyGRrLVLEwA~ 691 (725)
T KOG0110|consen 612 KGTKILVRNIPFEATKREVRKLFTAFGQLKSVRLPKKIGKGAHRGFGFVDFLTPREAKNAFDALGSTHLYGRRLVLEWAK 691 (725)
T ss_pred ccceeeeeccchHHHHHHHHHHHhcccceeeeccchhhcchhhccceeeeccCcHHHHHHHHhhcccceechhhheehhc
Confidence 36789999999999999999999999999999998875556678999999999999999999999999999999999987
Q ss_pred cC
Q psy7051 92 YG 93 (158)
Q Consensus 92 ~~ 93 (158)
..
T Consensus 692 ~d 693 (725)
T KOG0110|consen 692 SD 693 (725)
T ss_pred cc
Confidence 43
No 79
>KOG4660|consensus
Probab=99.13 E-value=5.8e-11 Score=93.98 Aligned_cols=72 Identities=36% Similarity=0.565 Sum_probs=64.9
Q ss_pred CCCCcEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEE
Q psy7051 10 IEGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELR 86 (158)
Q Consensus 10 ~~~~~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~ 86 (158)
.....+|+|.|||..|++++|..+|+.||+|.+|..... ..+.+||+|.+.-+|+.|++.|++.+|.|+.|+
T Consensus 72 ~~~~~~L~v~nl~~~Vsn~~L~~~f~~yGeir~ir~t~~-----~~~~~~v~FyDvR~A~~Alk~l~~~~~~~~~~k 143 (549)
T KOG4660|consen 72 DMNQGTLVVFNLPRSVSNDTLLRIFGAYGEIREIRETPN-----KRGIVFVEFYDVRDAERALKALNRREIAGKRIK 143 (549)
T ss_pred cCccceEEEEecCCcCCHHHHHHHHHhhcchhhhhcccc-----cCceEEEEEeehHhHHHHHHHHHHHHhhhhhhc
Confidence 344578999999999999999999999999999876443 478999999999999999999999999999888
No 80
>KOG4205|consensus
Probab=99.12 E-value=8.7e-11 Score=88.97 Aligned_cols=81 Identities=33% Similarity=0.476 Sum_probs=72.8
Q ss_pred CCcEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEEEEeee
Q psy7051 12 GMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELRVQMAR 91 (158)
Q Consensus 12 ~~~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~v~~a~ 91 (158)
...+|||++|++.++++.|++.|.+||+|..|.++.++.++...||+||+|++.+.+.+++. ..-+.|.|+.|.+..|.
T Consensus 5 ~~~KlfiGgisw~ttee~Lr~yf~~~Gev~d~~vm~d~~t~rsrgFgfv~f~~~~~v~~vl~-~~~h~~dgr~ve~k~av 83 (311)
T KOG4205|consen 5 ESGKLFIGGLSWETTEESLREYFSQFGEVTDCVVMRDPSTGRSRGFGFVTFATPEGVDAVLN-ARTHKLDGRSVEPKRAV 83 (311)
T ss_pred CCcceeecCcCccccHHHHHHHhcccCceeeEEEeccCCCCCcccccceecCCCcchheeec-ccccccCCccccceecc
Confidence 45789999999999999999999999999999999999999999999999999999888884 45678999999888776
Q ss_pred cC
Q psy7051 92 YG 93 (158)
Q Consensus 92 ~~ 93 (158)
..
T Consensus 84 ~r 85 (311)
T KOG4205|consen 84 SR 85 (311)
T ss_pred Cc
Confidence 43
No 81
>KOG4205|consensus
Probab=99.09 E-value=1.7e-10 Score=87.45 Aligned_cols=79 Identities=28% Similarity=0.403 Sum_probs=73.2
Q ss_pred CcEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEEEEeeec
Q psy7051 13 MVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELRVQMARY 92 (158)
Q Consensus 13 ~~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~v~~a~~ 92 (158)
..+|||++||..+++++|+++|++||.|..+.++.+..+...++|+||.|.+++.+..++. ..-+.|+++.|.|..|..
T Consensus 97 tkkiFvGG~~~~~~e~~~r~yfe~~g~v~~~~~~~d~~~~~~rgFgfv~~~~e~sVdkv~~-~~f~~~~gk~vevkrA~p 175 (311)
T KOG4205|consen 97 TKKIFVGGLPPDTTEEDFKDYFEQFGKVADVVIMYDKTTSRPRGFGFVTFDSEDSVDKVTL-QKFHDFNGKKVEVKRAIP 175 (311)
T ss_pred eeEEEecCcCCCCchHHHhhhhhccceeEeeEEeecccccccccceeeEeccccccceecc-cceeeecCceeeEeeccc
Confidence 4689999999999999999999999999999999999999999999999999999999884 578899999999998763
No 82
>KOG1457|consensus
Probab=99.08 E-value=2.2e-09 Score=76.64 Aligned_cols=83 Identities=19% Similarity=0.319 Sum_probs=67.2
Q ss_pred CcEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEcc-CCCCCCcccEEEEEEcCHHHHHHHHHHhCCccc---CCceEEEE
Q psy7051 13 MVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPR-DRFTRESRGFAFVRFFDKRDAEDALDAMDGRML---DGRELRVQ 88 (158)
Q Consensus 13 ~~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~-~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i---~g~~i~v~ 88 (158)
..||||.+||.++..-+|..+|..|-.-+.+.+.. ++....++-+|||+|.+..+|++|.+.|||..| .+..|+++
T Consensus 34 VRTLFVSGLP~DvKpREiynLFR~f~GYEgslLK~Tsk~~~~~~pvaFatF~s~q~A~aamnaLNGvrFDpE~~stLhiE 113 (284)
T KOG1457|consen 34 VRTLFVSGLPNDVKPREIYNLFRRFHGYEGSLLKYTSKGDQVCKPVAFATFTSHQFALAAMNALNGVRFDPETGSTLHIE 113 (284)
T ss_pred cceeeeccCCcccCHHHHHHHhccCCCccceeeeeccCCCccccceEEEEecchHHHHHHHHHhcCeeeccccCceeEee
Confidence 57999999999999999999999986555554432 222233457999999999999999999999988 47889999
Q ss_pred eeecCCC
Q psy7051 89 MARYGRP 95 (158)
Q Consensus 89 ~a~~~~~ 95 (158)
+|+....
T Consensus 114 lAKSNtK 120 (284)
T KOG1457|consen 114 LAKSNTK 120 (284)
T ss_pred ehhcCcc
Confidence 9986543
No 83
>KOG0106|consensus
Probab=99.05 E-value=8.8e-10 Score=79.10 Aligned_cols=67 Identities=40% Similarity=0.648 Sum_probs=60.5
Q ss_pred CcEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEEE
Q psy7051 13 MVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELRV 87 (158)
Q Consensus 13 ~~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~v 87 (158)
.+.|+|.+++..+.|++|.++|.++|++.+..+ ..+++||+|...++|..|+..|++..|.++.|.+
T Consensus 99 ~~r~~~~~~~~r~~~qdl~d~~~~~g~~~~~~~--------~~~~~~v~Fs~~~da~ra~~~l~~~~~~~~~l~~ 165 (216)
T KOG0106|consen 99 HFRLIVRNLSLRVSWQDLKDHFRPAGEVTYVDA--------RRNFAFVEFSEQEDAKRALEKLDGKKLNGRRISV 165 (216)
T ss_pred cceeeeccchhhhhHHHHhhhhcccCCCchhhh--------hccccceeehhhhhhhhcchhccchhhcCceeee
Confidence 467899999999999999999999999955544 2468999999999999999999999999999999
No 84
>KOG1548|consensus
Probab=99.04 E-value=1.4e-09 Score=81.98 Aligned_cols=81 Identities=26% Similarity=0.450 Sum_probs=72.4
Q ss_pred CcEEEEeCCCCCCCHHHHHHHhhhcCceE--------EEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCce
Q psy7051 13 MVSLKVDNLTYRTTPDDLRRVFERCGEVG--------DIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRE 84 (158)
Q Consensus 13 ~~~l~V~nl~~~~~~~~l~~~f~~~G~i~--------~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~ 84 (158)
.+.|||.|||..+|.+++.++|+++|-|. .|+|..+. .|+.+|-|+|.|-..++++-||+.|++..|.|+.
T Consensus 134 Nt~VYVsgLP~DiT~dE~~~~~sKcGiI~~d~~t~epk~KlYrd~-~G~lKGDaLc~y~K~ESVeLA~~ilDe~~~rg~~ 212 (382)
T KOG1548|consen 134 NTSVYVSGLPLDITVDEFAEVMSKCGIIMRDPQTGEPKVKLYRDN-QGKLKGDALCCYIKRESVELAIKILDEDELRGKK 212 (382)
T ss_pred CceEEecCCCCcccHHHHHHHHHhcceEeccCCCCCeeEEEEecC-CCCccCceEEEeecccHHHHHHHHhCcccccCcE
Confidence 35699999999999999999999999774 36777775 5899999999999999999999999999999999
Q ss_pred EEEEeeecCC
Q psy7051 85 LRVQMARYGR 94 (158)
Q Consensus 85 i~v~~a~~~~ 94 (158)
|.|+.|+...
T Consensus 213 ~rVerAkfq~ 222 (382)
T KOG1548|consen 213 LRVERAKFQM 222 (382)
T ss_pred EEEehhhhhh
Confidence 9999997543
No 85
>KOG4211|consensus
Probab=99.00 E-value=3.4e-09 Score=83.09 Aligned_cols=78 Identities=24% Similarity=0.431 Sum_probs=67.6
Q ss_pred CCCcEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEEEEee
Q psy7051 11 EGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELRVQMA 90 (158)
Q Consensus 11 ~~~~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~v~~a 90 (158)
+...-|.+.+|||++|++||.+||+.+ .|+.+.+.+. +|++.|-|||+|.++++++.|++ .+...+..+.|.|-.+
T Consensus 8 ~~~~~vr~rGLPwsat~~ei~~Ff~~~-~I~~~~~~r~--~Gr~sGeA~Ve~~seedv~~Alk-kdR~~mg~RYIEVf~~ 83 (510)
T KOG4211|consen 8 STAFEVRLRGLPWSATEKEILDFFSNC-GIENLEIPRR--NGRPSGEAYVEFTSEEDVEKALK-KDRESMGHRYIEVFTA 83 (510)
T ss_pred CcceEEEecCCCccccHHHHHHHHhcC-ceeEEEEecc--CCCcCcceEEEeechHHHHHHHH-hhHHHhCCceEEEEcc
Confidence 344678899999999999999999999 5877766554 79999999999999999999996 5888899999999877
Q ss_pred ec
Q psy7051 91 RY 92 (158)
Q Consensus 91 ~~ 92 (158)
..
T Consensus 84 ~~ 85 (510)
T KOG4211|consen 84 GG 85 (510)
T ss_pred CC
Confidence 53
No 86
>KOG1995|consensus
Probab=98.89 E-value=2.6e-09 Score=80.85 Aligned_cols=87 Identities=22% Similarity=0.270 Sum_probs=77.4
Q ss_pred CCCCCCCCCcEEEEeCCCCCCCHHHHHHHhhhcCceE--------EEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhC
Q psy7051 5 RPPPRIEGMVSLKVDNLTYRTTPDDLRRVFERCGEVG--------DIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMD 76 (158)
Q Consensus 5 ~p~~~~~~~~~l~V~nl~~~~~~~~l~~~f~~~G~i~--------~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~ 76 (158)
.+........+|||-+|+..+++++|.++|.++|.|. .|+|-++++|+.+++-|.|.|.+...|++|+..++
T Consensus 58 ~~~~~~s~~~ti~v~g~~d~~~~~~~~~~f~qcg~ikrnK~t~kPki~~y~dkeT~~~KGeatvS~~D~~~akaai~~~a 137 (351)
T KOG1995|consen 58 SSMADKSDNETIFVWGCPDSVCENDNADFFLQCGVIKRNKRTGKPKIKIYTDKETGAPKGEATVSYEDPPAAKAAIEWFA 137 (351)
T ss_pred CccccccccccceeeccCccchHHHHHHHHhhcceeccCCCCCCcchhccccccccCcCCceeeeecChhhhhhhhhhhc
Confidence 3444556668999999999999999999999999773 46778888999999999999999999999999999
Q ss_pred CcccCCceEEEEeee
Q psy7051 77 GRMLDGRELRVQMAR 91 (158)
Q Consensus 77 g~~i~g~~i~v~~a~ 91 (158)
+..+.+..|+|.+|.
T Consensus 138 gkdf~gn~ikvs~a~ 152 (351)
T KOG1995|consen 138 GKDFCGNTIKVSLAE 152 (351)
T ss_pred cccccCCCchhhhhh
Confidence 999999999998875
No 87
>KOG0120|consensus
Probab=98.86 E-value=2.8e-09 Score=84.86 Aligned_cols=83 Identities=25% Similarity=0.418 Sum_probs=77.2
Q ss_pred CCcEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEEEEeee
Q psy7051 12 GMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELRVQMAR 91 (158)
Q Consensus 12 ~~~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~v~~a~ 91 (158)
....|||++||..+++.++.+++..||.+..+.++.+..++-++||||.+|.+......|+..|||..+.+.+|.|+.|-
T Consensus 288 ~~~ki~v~~lp~~l~~~q~~Ell~~fg~lk~f~lv~d~~~g~skg~af~ey~dpsvtd~A~agLnGm~lgd~~lvvq~A~ 367 (500)
T KOG0120|consen 288 SPNKIFVGGLPLYLTEDQVKELLDSFGPLKAFRLVKDSATGNSKGFAFCEYCDPSVTDQAIAGLNGMQLGDKKLVVQRAI 367 (500)
T ss_pred ccchhhhccCcCccCHHHHHHHHHhcccchhheeecccccccccceeeeeeeCCcchhhhhcccchhhhcCceeEeehhh
Confidence 34689999999999999999999999999999999999889999999999999999999999999999999999999886
Q ss_pred cCC
Q psy7051 92 YGR 94 (158)
Q Consensus 92 ~~~ 94 (158)
.+.
T Consensus 368 ~g~ 370 (500)
T KOG0120|consen 368 VGA 370 (500)
T ss_pred ccc
Confidence 443
No 88
>KOG0151|consensus
Probab=98.83 E-value=8.8e-09 Score=83.94 Aligned_cols=79 Identities=29% Similarity=0.437 Sum_probs=69.9
Q ss_pred CcEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCC---CCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEEEEe
Q psy7051 13 MVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDR---FTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELRVQM 89 (158)
Q Consensus 13 ~~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~---~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~v~~ 89 (158)
.+.|||+||++.++++.|...|..||+|..|+++... +......|+||.|-+..+|++|++.|+|..|.+..+++.|
T Consensus 174 TTNlyv~Nlnpsv~E~~ll~tfGrfgPlasvKimwpRtEeEk~r~r~cgfvafmnR~D~era~k~lqg~iv~~~e~K~gW 253 (877)
T KOG0151|consen 174 TTNLYVGNLNPSVDENFLLRTFGRFGPLASVKIMWPRTEEEKRRERNCGFVAFMNRADAERALKELQGIIVMEYEMKLGW 253 (877)
T ss_pred ccceeeecCCccccHHHHHHHhcccCcccceeeecccchhhhccccccceeeehhhhhHHHHHHHhcceeeeeeeeeecc
Confidence 4789999999999999999999999999999886542 2234567999999999999999999999999999999999
Q ss_pred ee
Q psy7051 90 AR 91 (158)
Q Consensus 90 a~ 91 (158)
++
T Consensus 254 gk 255 (877)
T KOG0151|consen 254 GK 255 (877)
T ss_pred cc
Confidence 85
No 89
>KOG0226|consensus
Probab=98.83 E-value=5.1e-09 Score=76.04 Aligned_cols=81 Identities=25% Similarity=0.422 Sum_probs=74.2
Q ss_pred CCCcEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEEEEee
Q psy7051 11 EGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELRVQMA 90 (158)
Q Consensus 11 ~~~~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~v~~a 90 (158)
+....||+|.|..+++.+.|-..|.+|-.....+++.++.+++++||+||.|.+.+++..|+..|+|..++.+.|++.-+
T Consensus 188 ~~DfRIfcgdlgNevnd~vl~raf~Kfpsf~~akviRdkRTgKSkgygfVSf~~pad~~rAmrem~gkyVgsrpiklRkS 267 (290)
T KOG0226|consen 188 EDDFRIFCGDLGNEVNDDVLARAFKKFPSFQKAKVIRDKRTGKSKGYGFVSFRDPADYVRAMREMNGKYVGSRPIKLRKS 267 (290)
T ss_pred cccceeecccccccccHHHHHHHHHhccchhhccccccccccccccceeeeecCHHHHHHHHHhhcccccccchhHhhhh
Confidence 33478999999999999999999999988888889999999999999999999999999999999999999999987654
Q ss_pred e
Q psy7051 91 R 91 (158)
Q Consensus 91 ~ 91 (158)
.
T Consensus 268 ~ 268 (290)
T KOG0226|consen 268 E 268 (290)
T ss_pred h
Confidence 4
No 90
>KOG0105|consensus
Probab=98.80 E-value=2.3e-07 Score=64.69 Aligned_cols=63 Identities=22% Similarity=0.331 Sum_probs=57.3
Q ss_pred CCcEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccC
Q psy7051 12 GMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLD 81 (158)
Q Consensus 12 ~~~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~ 81 (158)
....|.|.+||++.+|++|++++.+.|+|++..+..+ |++.|+|...|+++-|+.+|+...+.
T Consensus 114 Se~RVvVsGLp~SgSWQDLKDHmReaGdvCfadv~rD-------g~GvV~~~r~eDMkYAvr~ld~~~~~ 176 (241)
T KOG0105|consen 114 SEYRVVVSGLPPSGSWQDLKDHMREAGDVCFADVQRD-------GVGVVEYLRKEDMKYAVRKLDDQKFR 176 (241)
T ss_pred cceeEEEecCCCCCchHHHHHHHHhhCCeeeeeeecc-------cceeeeeeehhhHHHHHHhhcccccc
Confidence 3477999999999999999999999999999998766 58999999999999999999988764
No 91
>PF11608 Limkain-b1: Limkain b1; InterPro: IPR024582 This entry represents a conserved domain found in limkain b1, which is a novel human autoantigen, localised to a subset of ABCD3 and PXF marked peroxisomes. Limkain b1 may be a relatively common target of human autoantibodies reactive to cytoplasmic vesicle-like structures [].; PDB: 2DIU_A.
Probab=98.75 E-value=1e-07 Score=58.19 Aligned_cols=69 Identities=29% Similarity=0.453 Sum_probs=47.2
Q ss_pred cEEEEeCCCCCCCHHHH----HHHhhhcC-ceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEEEE
Q psy7051 14 VSLKVDNLTYRTTPDDL----RRVFERCG-EVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELRVQ 88 (158)
Q Consensus 14 ~~l~V~nl~~~~~~~~l----~~~f~~~G-~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~v~ 88 (158)
..|+|.|||...+...| ++++..+| .|..|. .+.|+|.|.+++.|..|.+.|+|..+.|.+|.|.
T Consensus 3 s~L~V~NLP~~~d~~~I~~RL~qLsdNCGGkVl~v~----------~~tAilrF~~~~~A~RA~KRmegEdVfG~kI~v~ 72 (90)
T PF11608_consen 3 SLLYVSNLPTNKDPSSIKNRLRQLSDNCGGKVLSVS----------GGTAILRFPNQEFAERAQKRMEGEDVFGNKISVS 72 (90)
T ss_dssp EEEEEES--TTS-HHHHHHHHHHHHHTTT--EEE------------TT-EEEEESSHHHHHHHHHHHTT--SSSS--EEE
T ss_pred cEEEEecCCCCCCHHHHHHHHHHHhhccCCEEEEEe----------CCEEEEEeCCHHHHHHHHHhhcccccccceEEEE
Confidence 46999999999887665 45556676 665551 3579999999999999999999999999999999
Q ss_pred eeec
Q psy7051 89 MARY 92 (158)
Q Consensus 89 ~a~~ 92 (158)
+...
T Consensus 73 ~~~~ 76 (90)
T PF11608_consen 73 FSPK 76 (90)
T ss_dssp SS--
T ss_pred EcCC
Confidence 9853
No 92
>PF08777 RRM_3: RNA binding motif; InterPro: IPR014886 This domain is found in protein La which functions as an RNA chaperone during RNA polymerase III transcription, and can also stimulate translation initiation. It contains a five stranded beta sheet which forms an atypical RNA recognition motif []. ; PDB: 1OWX_A.
Probab=98.65 E-value=1.3e-07 Score=61.07 Aligned_cols=70 Identities=26% Similarity=0.459 Sum_probs=45.5
Q ss_pred cEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCc-----ccCCceEEEE
Q psy7051 14 VSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGR-----MLDGRELRVQ 88 (158)
Q Consensus 14 ~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~-----~i~g~~i~v~ 88 (158)
..|+|.+++..++.++|+++|.+||.|.+|.+.... ..|||-|.+.+.|+.|++.+... .|.+..+.+.
T Consensus 2 ~il~~~g~~~~~~re~iK~~f~~~g~V~yVD~~~G~------~~g~VRf~~~~~A~~a~~~~~~~~~~~~~i~~~~~~~~ 75 (105)
T PF08777_consen 2 CILKFSGLGEPTSREDIKEAFSQFGEVAYVDFSRGD------TEGYVRFKTPEAAQKALEKLKEANDGKLKIKGKEVTLE 75 (105)
T ss_dssp -EEEEEE--SS--HHHHHHHT-SS--EEEEE--TT-------SEEEEEESS---HHHHHHHHHHTTTS-B-TTSSSEEEE
T ss_pred eEEEEecCCCCcCHHHHHHHHHhcCCcceEEecCCC------CEEEEEECCcchHHHHHHHHHhccCCceEEcCceEEEE
Confidence 468899999999999999999999999999986652 37999999999999999876544 5677777666
Q ss_pred e
Q psy7051 89 M 89 (158)
Q Consensus 89 ~ 89 (158)
.
T Consensus 76 v 76 (105)
T PF08777_consen 76 V 76 (105)
T ss_dssp -
T ss_pred E
Confidence 5
No 93
>KOG0147|consensus
Probab=98.64 E-value=1.2e-08 Score=80.97 Aligned_cols=79 Identities=33% Similarity=0.492 Sum_probs=73.9
Q ss_pred CcEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEEEEeeec
Q psy7051 13 MVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELRVQMARY 92 (158)
Q Consensus 13 ~~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~v~~a~~ 92 (158)
+.|||+-.|+..++..+|.+||+.+|.|..|.++.+..++..+|.|||+|.+.+.+..|| .|.|..+.|.+|.|.....
T Consensus 179 ~Rtvf~~qla~r~~pRdL~efFs~~gkVrdVriI~Dr~s~rskgi~Yvef~D~~sVp~ai-aLsGqrllg~pv~vq~sEa 257 (549)
T KOG0147|consen 179 QRTVFCMQLARRNPPRDLEEFFSIVGKVRDVRIIGDRNSRRSKGIAYVEFCDEQSVPLAI-ALSGQRLLGVPVIVQLSEA 257 (549)
T ss_pred HHHHHHHHHhhcCCchhHHHHHHhhcCcceeEeeccccchhhcceeEEEEecccchhhHh-hhcCCcccCceeEecccHH
Confidence 468899899999999999999999999999999999999999999999999999999999 7999999999999998653
No 94
>KOG4211|consensus
Probab=98.63 E-value=1.5e-07 Score=74.08 Aligned_cols=78 Identities=26% Similarity=0.344 Sum_probs=64.2
Q ss_pred CCCcEEEEeCCCCCCCHHHHHHHhhhcCceEE-EEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEEEEe
Q psy7051 11 EGMVSLKVDNLTYRTTPDDLRRVFERCGEVGD-IYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELRVQM 89 (158)
Q Consensus 11 ~~~~~l~V~nl~~~~~~~~l~~~f~~~G~i~~-v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~v~~ 89 (158)
.....|-+.+||+.|+++||.+||+..--|.. |.|+.++ .+++.|-|||+|++.+.|+.||.. |...|..+.|.|..
T Consensus 101 ~~d~vVRLRGLPfscte~dI~~FFaGL~Iv~~gi~l~~d~-rgR~tGEAfVqF~sqe~ae~Al~r-hre~iGhRYIEvF~ 178 (510)
T KOG4211|consen 101 ANDGVVRLRGLPFSCTEEDIVEFFAGLEIVPDGILLPMDQ-RGRPTGEAFVQFESQESAEIALGR-HRENIGHRYIEVFR 178 (510)
T ss_pred CCCceEEecCCCccCcHHHHHHHhcCCcccccceeeeccC-CCCcccceEEEecCHHHHHHHHHH-HHHhhccceEEeeh
Confidence 45578999999999999999999998743333 4556665 677899999999999999999964 77788888898876
Q ss_pred e
Q psy7051 90 A 90 (158)
Q Consensus 90 a 90 (158)
+
T Consensus 179 S 179 (510)
T KOG4211|consen 179 S 179 (510)
T ss_pred h
Confidence 5
No 95
>KOG4206|consensus
Probab=98.61 E-value=2.8e-07 Score=66.04 Aligned_cols=77 Identities=23% Similarity=0.373 Sum_probs=67.2
Q ss_pred CCCCcEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccC-CceEEEE
Q psy7051 10 IEGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLD-GRELRVQ 88 (158)
Q Consensus 10 ~~~~~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~-g~~i~v~ 88 (158)
..+..+||+.|||..++.+.|..+|.+|....+|.++... .+.|||+|.+...|..|...|++..|- ...+.|.
T Consensus 143 ~ppn~ilf~~niP~es~~e~l~~lf~qf~g~keir~i~~~-----~~iAfve~~~d~~a~~a~~~lq~~~it~~~~m~i~ 217 (221)
T KOG4206|consen 143 APPNNILFLTNIPSESESEMLSDLFEQFPGFKEIRLIPPR-----SGIAFVEFLSDRQASAAQQALQGFKITKKNTMQIT 217 (221)
T ss_pred CCCceEEEEecCCcchhHHHHHHHHhhCcccceeEeccCC-----CceeEEecchhhhhHHHhhhhccceeccCceEEec
Confidence 4456789999999999999999999999999999887653 579999999999999999999999885 7788887
Q ss_pred eee
Q psy7051 89 MAR 91 (158)
Q Consensus 89 ~a~ 91 (158)
+++
T Consensus 218 ~a~ 220 (221)
T KOG4206|consen 218 FAK 220 (221)
T ss_pred ccC
Confidence 764
No 96
>KOG1190|consensus
Probab=98.61 E-value=2.2e-07 Score=71.70 Aligned_cols=76 Identities=30% Similarity=0.463 Sum_probs=68.4
Q ss_pred CcEEEEeCCCC-CCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEEEEeee
Q psy7051 13 MVSLKVDNLTY-RTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELRVQMAR 91 (158)
Q Consensus 13 ~~~l~V~nl~~-~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~v~~a~ 91 (158)
...|.|.||.. .+|.+-|.-+|.-||+|..|+|..++ +..|+|+|.+...|+-|++.|+|..|.|+.|.|.+++
T Consensus 297 n~vllvsnln~~~VT~d~LftlFgvYGdVqRVkil~nk-----kd~ALIQmsd~~qAqLA~~hL~g~~l~gk~lrvt~SK 371 (492)
T KOG1190|consen 297 NVVLLVSNLNEEAVTPDVLFTLFGVYGDVQRVKILYNK-----KDNALIQMSDGQQAQLAMEHLEGHKLYGKKLRVTLSK 371 (492)
T ss_pred ceEEEEecCchhccchhHHHHHHhhhcceEEEEeeecC-----CcceeeeecchhHHHHHHHHhhcceecCceEEEeecc
Confidence 46788999965 57999999999999999999998875 3579999999999999999999999999999999998
Q ss_pred cC
Q psy7051 92 YG 93 (158)
Q Consensus 92 ~~ 93 (158)
..
T Consensus 372 H~ 373 (492)
T KOG1190|consen 372 HT 373 (492)
T ss_pred Cc
Confidence 54
No 97
>KOG2202|consensus
Probab=98.50 E-value=6e-07 Score=65.47 Aligned_cols=62 Identities=24% Similarity=0.509 Sum_probs=51.6
Q ss_pred HHHHHhh-hcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEEEEeee
Q psy7051 29 DLRRVFE-RCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELRVQMAR 91 (158)
Q Consensus 29 ~l~~~f~-~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~v~~a~ 91 (158)
+|...|+ +||+|+++.|..+ ...+..|.+||.|...++|++|++.||+.+|.|++|..++..
T Consensus 84 d~f~E~~~kygEiee~~Vc~N-l~~hl~GNVYV~f~~Ee~ae~a~~~lnnRw~~G~pi~ae~~p 146 (260)
T KOG2202|consen 84 DVFTELEDKYGEIEELNVCDN-LGDHLVGNVYVKFRSEEDAEAALEDLNNRWYNGRPIHAELSP 146 (260)
T ss_pred HHHHHHHHHhhhhhhhhhhcc-cchhhhhhhhhhcccHHHHHHHHHHHcCccccCCcceeeecC
Confidence 3444444 8999999866554 356678999999999999999999999999999999999864
No 98
>KOG1457|consensus
Probab=98.44 E-value=2.3e-07 Score=66.51 Aligned_cols=64 Identities=25% Similarity=0.353 Sum_probs=53.5
Q ss_pred CcEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCccc
Q psy7051 13 MVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRML 80 (158)
Q Consensus 13 ~~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i 80 (158)
+.||||.||..+++|++|+.+|+.|-...-++|... .| ...||++|+..+.|..|+..|+|..|
T Consensus 210 cstlfianl~~~~~ed~l~~~~~~~~gf~~l~~~~~--~g--~~vaf~~~~~~~~at~am~~lqg~~~ 273 (284)
T KOG1457|consen 210 CSTLFIANLGPNCTEDELKQLLSRYPGFHILKIRAR--GG--MPVAFADFEEIEQATDAMNHLQGNLL 273 (284)
T ss_pred hhhHhhhccCCCCCHHHHHHHHHhCCCceEEEEecC--CC--cceEeecHHHHHHHHHHHHHhhccee
Confidence 578999999999999999999999977665655332 23 34799999999999999999998765
No 99
>KOG4849|consensus
Probab=98.43 E-value=2.3e-07 Score=70.47 Aligned_cols=75 Identities=20% Similarity=0.369 Sum_probs=66.6
Q ss_pred cEEEEeCCCCCCCHHHHHHHhhhcC--ceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEEEE
Q psy7051 14 VSLKVDNLTYRTTPDDLRRVFERCG--EVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELRVQ 88 (158)
Q Consensus 14 ~~l~V~nl~~~~~~~~l~~~f~~~G--~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~v~ 88 (158)
..+|||||-|++|.++|.+.+...| ++.++++..+..+|.++|||+|...+....+..++.|...+|.|+.-.|.
T Consensus 81 ~~~YvGNL~W~TTD~DL~~A~~S~G~~~~~dmKFFENR~NGQSKG~AL~~~~SdAa~Kq~MeiLP~k~iHGQ~P~V~ 157 (498)
T KOG4849|consen 81 YCCYVGNLLWYTTDADLLKALQSTGLAQFADMKFFENRTNGQSKGYALLVLNSDAAVKQTMEILPTKTIHGQSPTVL 157 (498)
T ss_pred EEEEecceeEEeccHHHHHHHHhhhHHHHhhhhhhhcccCCcccceEEEEecchHHHHHHHHhcccceecCCCCeee
Confidence 5799999999999999999998877 67888888899999999999999999999999999999999988764443
No 100
>KOG4210|consensus
Probab=98.41 E-value=3.5e-07 Score=69.03 Aligned_cols=80 Identities=30% Similarity=0.498 Sum_probs=72.5
Q ss_pred CcEEE-EeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEEEEeee
Q psy7051 13 MVSLK-VDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELRVQMAR 91 (158)
Q Consensus 13 ~~~l~-V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~v~~a~ 91 (158)
..++| |++|+..+++++|+.+|..+|.|..+.++....++...++|||.|.+...+..++.. +...+.+..+.|.+..
T Consensus 184 s~~~~~~~~~~f~~~~d~~~~~~~~~~~i~~~r~~~~~~s~~~kg~a~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~ 262 (285)
T KOG4210|consen 184 SDTIFFVGELDFSLTRDDLKEHFVSSGEITSVRLPTDEESGDSKGFAYVDFSAGNSKKLALND-QTRSIGGRPLRLEEDE 262 (285)
T ss_pred cccceeecccccccchHHHhhhccCcCcceeeccCCCCCccchhhhhhhhhhhchhHHHHhhc-ccCcccCcccccccCC
Confidence 45566 999999999999999999999999999999999999999999999999999999876 7888999999998877
Q ss_pred cC
Q psy7051 92 YG 93 (158)
Q Consensus 92 ~~ 93 (158)
..
T Consensus 263 ~~ 264 (285)
T KOG4210|consen 263 PR 264 (285)
T ss_pred CC
Confidence 43
No 101
>COG5175 MOT2 Transcriptional repressor [Transcription]
Probab=98.36 E-value=1.5e-06 Score=65.90 Aligned_cols=78 Identities=19% Similarity=0.417 Sum_probs=60.9
Q ss_pred cEEEEeCCCCCCCHHH----H--HHHhhhcCceEEEEEccCCCC-CCcccE--EEEEEcCHHHHHHHHHHhCCcccCCce
Q psy7051 14 VSLKVDNLTYRTTPDD----L--RRVFERCGEVGDIYIPRDRFT-RESRGF--AFVRFFDKRDAEDALDAMDGRMLDGRE 84 (158)
Q Consensus 14 ~~l~V~nl~~~~~~~~----l--~~~f~~~G~i~~v~~~~~~~~-~~~~g~--afV~f~~~~~a~~a~~~l~g~~i~g~~ 84 (158)
.-|||-+|++.+..++ | .++|.+||.|..|.+-..... .-..+. .||+|.+.++|..||...+|..++|+.
T Consensus 115 NLvYVigi~pkva~Ee~~~vLk~~eyFGQyGkI~KIvvNkkt~s~nst~~h~gvYITy~~kedAarcIa~vDgs~~DGr~ 194 (480)
T COG5175 115 NLVYVIGIPPKVADEEVAPVLKRHEYFGQYGKIKKIVVNKKTSSLNSTASHAGVYITYSTKEDAARCIAEVDGSLLDGRV 194 (480)
T ss_pred ceeEEecCCCCCCcccccccccchhhhhhccceeEEEecccccccccccccceEEEEecchHHHHHHHHHhccccccCce
Confidence 4589999998876655 2 578999999998877543211 111233 499999999999999999999999999
Q ss_pred EEEEeee
Q psy7051 85 LRVQMAR 91 (158)
Q Consensus 85 i~v~~a~ 91 (158)
|+..|..
T Consensus 195 lkatYGT 201 (480)
T COG5175 195 LKATYGT 201 (480)
T ss_pred EeeecCc
Confidence 9999854
No 102
>KOG0129|consensus
Probab=98.36 E-value=2.9e-06 Score=67.31 Aligned_cols=68 Identities=22% Similarity=0.211 Sum_probs=62.1
Q ss_pred CCCCCCCcEEEEeCCCCCCCHHHHHHHhh-hcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHH
Q psy7051 7 PPRIEGMVSLKVDNLTYRTTPDDLRRVFE-RCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDA 74 (158)
Q Consensus 7 ~~~~~~~~~l~V~nl~~~~~~~~l~~~f~-~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~ 74 (158)
...+++..|||||+||--++.++|..+|+ -||.|+.+-|-.|++-..++|-|=|+|.+..+-.+||.+
T Consensus 364 sq~lDprrTVFVGgvprpl~A~eLA~imd~lyGgV~yaGIDtD~k~KYPkGaGRVtFsnqqsYi~AIsa 432 (520)
T KOG0129|consen 364 NQPIDPRRTVFVGGLPRPLTAEELAMIMEDLFGGVLYVGIDTDPKLKYPKGAGRVTFSNQQAYIKAISA 432 (520)
T ss_pred CcccCccceEEecCCCCcchHHHHHHHHHHhcCceEEEEeccCcccCCCCCcceeeecccHHHHHHHhh
Confidence 34567889999999999999999999999 699999999999988899999999999999999999864
No 103
>KOG1365|consensus
Probab=98.34 E-value=7.5e-07 Score=68.43 Aligned_cols=77 Identities=14% Similarity=0.251 Sum_probs=67.2
Q ss_pred CcEEEEeCCCCCCCHHHHHHHhhhcC-ceEE--EEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEEEEe
Q psy7051 13 MVSLKVDNLTYRTTPDDLRRVFERCG-EVGD--IYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELRVQM 89 (158)
Q Consensus 13 ~~~l~V~nl~~~~~~~~l~~~f~~~G-~i~~--v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~v~~ 89 (158)
..+|-+.+||..++.++|.+||..|. .|.. |.|+.+. .|++.|-|||+|.+.++|.+|..+.+.+..+.+.|.|-.
T Consensus 280 kdcvRLRGLPy~AtvEdIL~FlgdFa~~i~f~gVHmv~N~-qGrPSGeAFIqm~nae~a~aaaqk~hk~~mk~RYiEvfp 358 (508)
T KOG1365|consen 280 KDCVRLRGLPYEATVEDILDFLGDFATDIRFQGVHMVLNG-QGRPSGEAFIQMRNAERARAAAQKCHKKLMKSRYIEVFP 358 (508)
T ss_pred CCeeEecCCChhhhHHHHHHHHHHHhhhcccceeEEEEcC-CCCcChhhhhhhhhhHHHHHHHHHHHHhhcccceEEEee
Confidence 46799999999999999999999986 4444 7777774 788999999999999999999999998888899998876
Q ss_pred e
Q psy7051 90 A 90 (158)
Q Consensus 90 a 90 (158)
+
T Consensus 359 ~ 359 (508)
T KOG1365|consen 359 C 359 (508)
T ss_pred c
Confidence 4
No 104
>KOG1456|consensus
Probab=98.33 E-value=5.4e-06 Score=63.65 Aligned_cols=77 Identities=22% Similarity=0.219 Sum_probs=68.3
Q ss_pred CCcEEEEeCCCCC-CCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEEEEee
Q psy7051 12 GMVSLKVDNLTYR-TTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELRVQMA 90 (158)
Q Consensus 12 ~~~~l~V~nl~~~-~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~v~~a 90 (158)
+...+.|-+|... ++.+.|..+|..||.|+.|++++.+ .|.|.|++.+..+.+.|+..||+..+-|.+|.|.++
T Consensus 286 ~g~VmMVyGLdh~k~N~drlFNl~ClYGNV~rvkFmkTk-----~gtamVemgd~~aver~v~hLnn~~lfG~kl~v~~S 360 (494)
T KOG1456|consen 286 PGCVMMVYGLDHGKMNCDRLFNLFCLYGNVERVKFMKTK-----PGTAMVEMGDAYAVERAVTHLNNIPLFGGKLNVCVS 360 (494)
T ss_pred CCcEEEEEeccccccchhhhhhhhhhcCceeeEEEeecc-----cceeEEEcCcHHHHHHHHHHhccCccccceEEEeec
Confidence 3467889999764 6788899999999999999998875 578999999999999999999999999999999998
Q ss_pred ecC
Q psy7051 91 RYG 93 (158)
Q Consensus 91 ~~~ 93 (158)
+..
T Consensus 361 kQ~ 363 (494)
T KOG1456|consen 361 KQN 363 (494)
T ss_pred ccc
Confidence 843
No 105
>KOG4307|consensus
Probab=98.32 E-value=2.5e-06 Score=70.01 Aligned_cols=79 Identities=19% Similarity=0.266 Sum_probs=66.3
Q ss_pred CCCcEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEEEEe
Q psy7051 11 EGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELRVQM 89 (158)
Q Consensus 11 ~~~~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~v~~ 89 (158)
.+++.|-|.|+|..++-+||.+||..|-.+-.-.++.-...|...|-|.|.|++.++|..|...|++..|..+.|.+.+
T Consensus 865 pGp~V~~~~n~Pf~v~l~dI~~FF~dY~~~p~sI~~r~nd~G~pTGe~mvAfes~~eAr~A~~dl~~~~i~nr~V~l~i 943 (944)
T KOG4307|consen 865 PGPRVLSCNNFPFDVTLEDIVEFFNDYEPDPNSIRIRRNDDGVPTGECMVAFESQEEARRASMDLDGQKIRNRVVSLRI 943 (944)
T ss_pred CCCeEEEecCCCccccHHHHHHHhcccccCCCceeEeecCCCCcccceeEeecCHHHHHhhhhccccCcccceeEEEEe
Confidence 3446899999999999999999999996543332333345789999999999999999999999999999999988764
No 106
>KOG2416|consensus
Probab=98.24 E-value=1.2e-06 Score=70.61 Aligned_cols=81 Identities=22% Similarity=0.394 Sum_probs=65.9
Q ss_pred CCCCCCCCCcEEEEeCCCCCCCHHHHHHHhhh-cCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCccc---
Q psy7051 5 RPPPRIEGMVSLKVDNLTYRTTPDDLRRVFER-CGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRML--- 80 (158)
Q Consensus 5 ~p~~~~~~~~~l~V~nl~~~~~~~~l~~~f~~-~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i--- 80 (158)
.+|........|||.||---.|..+|+++|.. .|.|+.+|| | +.+..|||.|.+.++|.+...+|||..|
T Consensus 436 pSPsR~~~SnvlhI~nLvRPFTlgQLkelL~rtgg~Vee~Wm--D----kIKShCyV~yss~eEA~atr~AlhnV~WP~s 509 (718)
T KOG2416|consen 436 PSPSRKEPSNVLHIDNLVRPFTLGQLKELLGRTGGNVEEFWM--D----KIKSHCYVSYSSVEEAAATREALHNVQWPPS 509 (718)
T ss_pred CCCCCCCccceEeeecccccchHHHHHHHHhhccCchHHHHH--H----HhhcceeEecccHHHHHHHHHHHhccccCCC
Confidence 34446677889999999999999999999995 556666633 2 2467899999999999999999999987
Q ss_pred CCceEEEEeee
Q psy7051 81 DGRELRVQMAR 91 (158)
Q Consensus 81 ~g~~i~v~~a~ 91 (158)
+.+.|.|.|..
T Consensus 510 NPK~L~adf~~ 520 (718)
T KOG2416|consen 510 NPKHLIADFVR 520 (718)
T ss_pred CCceeEeeecc
Confidence 56778888864
No 107
>KOG2314|consensus
Probab=98.22 E-value=1e-05 Score=65.01 Aligned_cols=76 Identities=26% Similarity=0.508 Sum_probs=61.1
Q ss_pred CcEEEEeCCCCCCCH------HHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccC-CceE
Q psy7051 13 MVSLKVDNLTYRTTP------DDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLD-GREL 85 (158)
Q Consensus 13 ~~~l~V~nl~~~~~~------~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~-g~~i 85 (158)
...|+|.|+|.--.. .-|..+|+++|+|..+.++.+..+| .+||.|++|++..+|+.|++.|||..|+ ++..
T Consensus 58 D~vVvv~g~PvV~~~rl~klk~vl~kvfsk~gk~vn~~~P~~e~gg-tkG~lf~E~~~~~~A~~aVK~l~G~~ldknHtf 136 (698)
T KOG2314|consen 58 DSVVVVDGAPVVGPARLEKLKKVLTKVFSKAGKIVNMYYPIDEEGG-TKGYLFVEYASMRDAKKAVKSLNGKRLDKNHTF 136 (698)
T ss_pred ceEEEECCCcccChhHHHHHHHHHHHHHHhhccccceeeccCccCC-eeeEEEEEecChhhHHHHHHhcccceecccceE
Confidence 357888888863221 3466789999999999999887555 9999999999999999999999999874 5666
Q ss_pred EEEe
Q psy7051 86 RVQM 89 (158)
Q Consensus 86 ~v~~ 89 (158)
.|..
T Consensus 137 ~v~~ 140 (698)
T KOG2314|consen 137 FVRL 140 (698)
T ss_pred Eeeh
Confidence 6654
No 108
>KOG1190|consensus
Probab=98.22 E-value=4.4e-06 Score=64.73 Aligned_cols=77 Identities=21% Similarity=0.325 Sum_probs=64.0
Q ss_pred CCCcEEEEeCCCCCCCHHHHHHHhhhcCc-eEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCc-eEEEE
Q psy7051 11 EGMVSLKVDNLTYRTTPDDLRRVFERCGE-VGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGR-ELRVQ 88 (158)
Q Consensus 11 ~~~~~l~V~nl~~~~~~~~l~~~f~~~G~-i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~-~i~v~ 88 (158)
++..+|++.|+|.+++|++|+++|..-|. |...++.. +-..+|++++.+.|+|..|+-.++.+.+++. .|.|.
T Consensus 412 PpsatlHlsnip~svsee~lk~~f~~~g~~vkafkff~-----kd~kmal~q~~sveeA~~ali~~hnh~lgen~hlRvS 486 (492)
T KOG1190|consen 412 PPSATLHLSNIPPSVSEEDLKNLFQEPGGQVKAFKFFQ-----KDRKMALPQLESVEEAIQALIDLHNHYLGENHHLRVS 486 (492)
T ss_pred CchhheeeccCCcccchhHHHHhhhcCCceEEeeeecC-----CCcceeecccCChhHhhhhccccccccCCCCceEEEE
Confidence 55678999999999999999999998874 45555532 2346999999999999999999999999765 89999
Q ss_pred eeec
Q psy7051 89 MARY 92 (158)
Q Consensus 89 ~a~~ 92 (158)
|++.
T Consensus 487 FSks 490 (492)
T KOG1190|consen 487 FSKS 490 (492)
T ss_pred eecc
Confidence 9874
No 109
>PF14605 Nup35_RRM_2: Nup53/35/40-type RNA recognition motif
Probab=98.20 E-value=5.9e-06 Score=46.64 Aligned_cols=52 Identities=25% Similarity=0.540 Sum_probs=41.8
Q ss_pred cEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHH
Q psy7051 14 VSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDAL 72 (158)
Q Consensus 14 ~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~ 72 (158)
+.|-|.++++...+ .|..+|..||+|+.+.+... ..+.||.|.+..+|+.||
T Consensus 2 ~wI~V~Gf~~~~~~-~vl~~F~~fGeI~~~~~~~~------~~~~~l~y~~~~~ae~Al 53 (53)
T PF14605_consen 2 TWISVSGFPPDLAE-EVLEHFASFGEIVDIYVPES------TNWMYLKYKSRKDAEKAL 53 (53)
T ss_pred cEEEEEeECchHHH-HHHHHHHhcCCEEEEEcCCC------CcEEEEEECCHHHHHhhC
Confidence 46889999987654 45558889999999988622 458999999999999885
No 110
>KOG1548|consensus
Probab=98.15 E-value=9.8e-06 Score=61.56 Aligned_cols=76 Identities=24% Similarity=0.438 Sum_probs=60.7
Q ss_pred CCcEEEEeCCCC----CCC-------HHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCccc
Q psy7051 12 GMVSLKVDNLTY----RTT-------PDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRML 80 (158)
Q Consensus 12 ~~~~l~V~nl~~----~~~-------~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i 80 (158)
...+|.|.||=. ..+ .++|.+-.++||.|..|.|... ++.|.+-|.|.+.++|..||+.|+|.+|
T Consensus 264 ~~~tVi~kn~Ftp~~~~~~~~l~~dlkedl~eec~K~G~v~~vvv~d~----hPdGvvtV~f~n~eeA~~ciq~m~GR~f 339 (382)
T KOG1548|consen 264 ADRTVILKNMFTPEDFEKNPDLLNDLKEDLTEECEKFGQVRKVVVYDR----HPDGVVTVSFRNNEEADQCIQTMDGRWF 339 (382)
T ss_pred CCcEEEeeecCCHHHhccCHHHHHHHHHHHHHHHHHhCCcceEEEecc----CCCceeEEEeCChHHHHHHHHHhcCeee
Confidence 346788888721 223 3456667889999999987543 3678999999999999999999999999
Q ss_pred CCceEEEEeee
Q psy7051 81 DGRELRVQMAR 91 (158)
Q Consensus 81 ~g~~i~v~~a~ 91 (158)
.|++|...+..
T Consensus 340 dgRql~A~i~D 350 (382)
T KOG1548|consen 340 DGRQLTASIWD 350 (382)
T ss_pred cceEEEEEEeC
Confidence 99999988754
No 111
>KOG3152|consensus
Probab=98.06 E-value=3.6e-06 Score=61.43 Aligned_cols=72 Identities=21% Similarity=0.391 Sum_probs=59.7
Q ss_pred CcEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCC--------CCcc----cEEEEEEcCHHHHHHHHHHhCCccc
Q psy7051 13 MVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFT--------RESR----GFAFVRFFDKRDAEDALDAMDGRML 80 (158)
Q Consensus 13 ~~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~--------~~~~----g~afV~f~~~~~a~~a~~~l~g~~i 80 (158)
.-.||++++|+.++..-|.++|..||.|-.|.|-....+ +.+. --++|+|.+...|..+...||+..|
T Consensus 74 ~GVvylS~IPp~m~~~rlReil~~yGeVGRvylqpE~~s~~~~r~~~~~n~~~~y~EGWvEF~~KrvAK~iAe~Lnn~~I 153 (278)
T KOG3152|consen 74 TGVVYLSNIPPYMDPVRLREILSQYGEVGRVYLQPEDDSKRAARKRKGGNYKKLYSEGWVEFISKRVAKRIAELLNNTPI 153 (278)
T ss_pred ceEEEeccCCCccCHHHHHHHHHhccccceEEecchhhHHHHHHhhcCCCccccchhHHHHHHHHHHHHHHHHHhCCCcc
Confidence 457999999999999999999999999988877544333 2222 2378999999999999999999999
Q ss_pred CCce
Q psy7051 81 DGRE 84 (158)
Q Consensus 81 ~g~~ 84 (158)
+|..
T Consensus 154 ggkk 157 (278)
T KOG3152|consen 154 GGKK 157 (278)
T ss_pred CCCC
Confidence 9875
No 112
>KOG0120|consensus
Probab=98.01 E-value=2.5e-05 Score=62.78 Aligned_cols=62 Identities=26% Similarity=0.426 Sum_probs=49.5
Q ss_pred HHHHhhhcCceEEEEEccCCCC---CCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEEEEeee
Q psy7051 30 LRRVFERCGEVGDIYIPRDRFT---RESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELRVQMAR 91 (158)
Q Consensus 30 l~~~f~~~G~i~~v~~~~~~~~---~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~v~~a~ 91 (158)
|+..+.+||.|..|.++.+... .--.|..||+|.+.++++.|.+.|+|..|+++.|...|-.
T Consensus 426 vr~ec~k~g~v~~v~ipr~~~~~~~~~G~GkVFVefas~ed~qrA~~~L~GrKF~nRtVvtsYyd 490 (500)
T KOG0120|consen 426 VRTECAKFGAVRSVEIPRPYPDENPVPGTGKVFVEFADTEDSQRAMEELTGRKFANRTVVASYYD 490 (500)
T ss_pred HHHHhcccCceeEEecCCCCCCCCcCCCcccEEEEecChHHHHHHHHHccCceeCCcEEEEEecC
Confidence 3344568999999988766222 2235789999999999999999999999999999887743
No 113
>PF05172 Nup35_RRM: Nup53/35/40-type RNA recognition motif; InterPro: IPR007846 The MPPN (Mitotic PhosphoProtein N end) family is uncharacterised however it probably plays a role in the cell cycle because the family includes mitotic phosphoproteins O13026 from SWISSPROT []. This family also includes Q05166 from SWISSPROT a suppressor of thermosensitive mutations in the DNA polymerase delta gene, Pol III []. The conserved central region appears to be distantly related to the RNA-binding region RNP-1 (RNA recognition motif, IPR000504 from INTERPRO), suggesting an RNA binding function for this protein.; PDB: 1WWH_C 3P3D_A.
Probab=97.99 E-value=5.1e-05 Score=48.46 Aligned_cols=75 Identities=13% Similarity=0.199 Sum_probs=49.9
Q ss_pred cEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCC-------CCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCce-E
Q psy7051 14 VSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDR-------FTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRE-L 85 (158)
Q Consensus 14 ~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~-------~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~-i 85 (158)
..|.|-++|+. ....|.++|++||+|.+..-.... .......+..|+|+++.+|..||. .||..|.|.. |
T Consensus 7 ~wVtVFGfp~~-~~~~Vl~~F~~~G~Ile~~~~~~~~~~~~~~~~~~~~NWi~I~Y~~~~~A~rAL~-~NG~i~~g~~mv 84 (100)
T PF05172_consen 7 TWVTVFGFPPS-ASNQVLRHFSSFGTILEHFEVLRSSSGINPYPIPSGGNWIHITYDNPLSAQRALQ-KNGTIFSGSLMV 84 (100)
T ss_dssp CEEEEE---GG-GHHHHHHHHHCCS-EECEEGGG----------E-CCTTEEEEEESSHHHHHHHHT-TTTEEETTCEEE
T ss_pred eEEEEEccCHH-HHHHHHHHHHhcceEEEeecccccccccccccCCCCCCEEEEECCCHHHHHHHHH-hCCeEEcCcEEE
Confidence 45778899988 667788999999999776411000 001124699999999999999995 5999998864 4
Q ss_pred EEEee
Q psy7051 86 RVQMA 90 (158)
Q Consensus 86 ~v~~a 90 (158)
-|.+.
T Consensus 85 GV~~~ 89 (100)
T PF05172_consen 85 GVKPC 89 (100)
T ss_dssp EEEE-
T ss_pred EEEEc
Confidence 46655
No 114
>KOG4676|consensus
Probab=97.96 E-value=5.9e-07 Score=69.08 Aligned_cols=64 Identities=20% Similarity=0.179 Sum_probs=52.5
Q ss_pred CcEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccC
Q psy7051 13 MVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLD 81 (158)
Q Consensus 13 ~~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~ 81 (158)
..||+|++|+..+...++.++|..+|+|.+..|... ...-+|.|+|........|+. ++|.++.
T Consensus 151 rRt~~v~sl~~~~~l~e~~e~f~r~Gev~ya~~ask----~~s~~c~~sf~~qts~~halr-~~gre~k 214 (479)
T KOG4676|consen 151 RRTREVQSLISAAILPESGESFERKGEVSYAHTASK----SRSSSCSHSFRKQTSSKHALR-SHGRERK 214 (479)
T ss_pred Hhhhhhhcchhhhcchhhhhhhhhcchhhhhhhhcc----CCCcchhhhHhhhhhHHHHHH-hcchhhh
Confidence 367999999999999999999999999988877433 234578899999889888985 5777665
No 115
>KOG1365|consensus
Probab=97.93 E-value=1.4e-05 Score=61.65 Aligned_cols=76 Identities=22% Similarity=0.277 Sum_probs=57.8
Q ss_pred CCCcEEEEeCCCCCCCHHHHHHHhhhc----CceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEE
Q psy7051 11 EGMVSLKVDNLTYRTTPDDLRRVFERC----GEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELR 86 (158)
Q Consensus 11 ~~~~~l~V~nl~~~~~~~~l~~~f~~~----G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~ 86 (158)
+....|-+.+||.++++.++.+||.+- |..+.|.++..+ .|+..|-|||.|..+++|+.||.+ |...|+.+.|.
T Consensus 159 ~~qvivRmRGLPfdat~~dVv~FF~~~cpv~~g~egvLFV~rp-dgrpTGdAFvlfa~ee~aq~aL~k-hrq~iGqRYIE 236 (508)
T KOG1365|consen 159 ENQVIVRMRGLPFDATALDVVEFFGPPCPVTGGTEGVLFVTRP-DGRPTGDAFVLFACEEDAQFALRK-HRQNIGQRYIE 236 (508)
T ss_pred ccceEEEecCCCCCcchHHHHHhcCCCCcccCCccceEEEECC-CCCcccceEEEecCHHHHHHHHHH-HHHHHhHHHHH
Confidence 345667888999999999999999632 244566565554 788999999999999999999975 55556655554
Q ss_pred EE
Q psy7051 87 VQ 88 (158)
Q Consensus 87 v~ 88 (158)
+-
T Consensus 237 lF 238 (508)
T KOG1365|consen 237 LF 238 (508)
T ss_pred HH
Confidence 43
No 116
>KOG1456|consensus
Probab=97.93 E-value=6.7e-05 Score=57.80 Aligned_cols=77 Identities=26% Similarity=0.239 Sum_probs=61.5
Q ss_pred CCcEEEEeCCC--CCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCccc-CC-ceEEE
Q psy7051 12 GMVSLKVDNLT--YRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRML-DG-RELRV 87 (158)
Q Consensus 12 ~~~~l~V~nl~--~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i-~g-~~i~v 87 (158)
+.+.|.++=|+ ..+|.+-|..+....|+|..|.|... ++ --|.|||++.+.|++|...|||..| .| +.|+|
T Consensus 119 pN~VLl~TIlNp~YpItvDVly~Icnp~GkVlRIvIfkk--ng---VQAmVEFdsv~~AqrAk~alNGADIYsGCCTLKI 193 (494)
T KOG1456|consen 119 PNKVLLFTILNPQYPITVDVLYTICNPQGKVLRIVIFKK--NG---VQAMVEFDSVEVAQRAKAALNGADIYSGCCTLKI 193 (494)
T ss_pred CCeEEEEEeecCccccchhhhhhhcCCCCceEEEEEEec--cc---eeeEEeechhHHHHHHHhhcccccccccceeEEE
Confidence 33455555444 46899999999999999999987655 32 3599999999999999999999987 34 68999
Q ss_pred EeeecC
Q psy7051 88 QMARYG 93 (158)
Q Consensus 88 ~~a~~~ 93 (158)
+||+..
T Consensus 194 eyAkP~ 199 (494)
T KOG1456|consen 194 EYAKPT 199 (494)
T ss_pred EecCcc
Confidence 999844
No 117
>KOG4676|consensus
Probab=97.92 E-value=1.7e-05 Score=61.19 Aligned_cols=75 Identities=17% Similarity=0.282 Sum_probs=60.0
Q ss_pred cEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCC---CCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEEEEe
Q psy7051 14 VSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRF---TRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELRVQM 89 (158)
Q Consensus 14 ~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~---~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~v~~ 89 (158)
..|.|.||.++++.++++.||...|+|.++.|..+.. .....-.|||.|.+...+..|- +|.+..|-++.|.|-.
T Consensus 8 ~vIqvanispsat~dqm~tlFg~lGkI~elrlyp~~~d~~~pv~sRtcyVkf~d~~sv~vaQ-hLtntvfvdraliv~p 85 (479)
T KOG4676|consen 8 GVIQVANISPSATKDQMQTLFGNLGKIPELRLYPNVDDSKIPVISRTCYVKFLDSQSVTVAQ-HLTNTVFVDRALIVRP 85 (479)
T ss_pred ceeeecccCchhhHHHHHHHHhhccccccccccCCCCCccCcceeeeEEEeccCCcceeHHh-hhccceeeeeeEEEEe
Confidence 3799999999999999999999999999998765321 1234568999999999888886 6788877777666654
No 118
>PF08675 RNA_bind: RNA binding domain; InterPro: IPR014789 This domain corresponds to the RNA binding domain of Poly(A)-specific ribonuclease (PARN). ; GO: 0003723 RNA binding, 0004535 poly(A)-specific ribonuclease activity, 0046872 metal ion binding, 0006402 mRNA catabolic process, 0005634 nucleus, 0005737 cytoplasm; PDB: 3CTR_A 2ROK_A 3D45_B 1WHV_A.
Probab=97.90 E-value=0.00012 Score=44.80 Aligned_cols=55 Identities=15% Similarity=0.292 Sum_probs=41.8
Q ss_pred cEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCC
Q psy7051 14 VSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDG 77 (158)
Q Consensus 14 ~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g 77 (158)
...+|+ +|..+...||.++|+.||.|. |.++.+. -|||...+.+.|..|+..+.-
T Consensus 10 HVFhlt-FPkeWK~~DI~qlFspfG~I~-VsWi~dT-------SAfV~l~~r~~~~~v~~~~~~ 64 (87)
T PF08675_consen 10 HVFHLT-FPKEWKTSDIYQLFSPFGQIY-VSWINDT-------SAFVALHNRDQAKVVMNTLKK 64 (87)
T ss_dssp CEEEEE---TT--HHHHHHHCCCCCCEE-EEEECTT-------EEEEEECCCHHHHHHHHHHTT
T ss_pred eEEEEe-CchHhhhhhHHHHhccCCcEE-EEEEcCC-------cEEEEeecHHHHHHHHHHhcc
Confidence 556665 999999999999999999985 5565552 699999999999999887753
No 119
>PF08952 DUF1866: Domain of unknown function (DUF1866) ; InterPro: IPR015047 This domain, found in synaptojanin, has no known function. ; PDB: 1UFW_A 2DNR_A.
Probab=97.81 E-value=0.00015 Score=49.09 Aligned_cols=54 Identities=31% Similarity=0.509 Sum_probs=44.9
Q ss_pred HHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEEEEeee
Q psy7051 29 DLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELRVQMAR 91 (158)
Q Consensus 29 ~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~v~~a~ 91 (158)
+|.+.|..||++.-++++.+ .-+|+|.+-+.|.+|+ .|+|.+|+|+.|+|.+-.
T Consensus 52 ~ll~~~~~~GevvLvRfv~~--------~mwVTF~dg~sALaal-s~dg~~v~g~~l~i~LKt 105 (146)
T PF08952_consen 52 ELLQKFAQYGEVVLVRFVGD--------TMWVTFRDGQSALAAL-SLDGIQVNGRTLKIRLKT 105 (146)
T ss_dssp HHHHHHHCCS-ECEEEEETT--------CEEEEESSCHHHHHHH-HGCCSEETTEEEEEEE--
T ss_pred HHHHHHHhCCceEEEEEeCC--------eEEEEECccHHHHHHH-ccCCcEECCEEEEEEeCC
Confidence 67778889999988888654 4899999999999999 589999999999999865
No 120
>KOG0112|consensus
Probab=97.79 E-value=4.2e-05 Score=64.47 Aligned_cols=77 Identities=26% Similarity=0.393 Sum_probs=67.3
Q ss_pred CCCCCcEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCC--ceEE
Q psy7051 9 RIEGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDG--RELR 86 (158)
Q Consensus 9 ~~~~~~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g--~~i~ 86 (158)
.....+.++|++|..++....|...|..||.|..|.+-.. .-||||.|++...+++|+..|-|..|++ +.|.
T Consensus 451 kst~ttr~~sgglg~w~p~~~l~r~fd~fGpir~Idy~hg------q~yayi~yes~~~aq~a~~~~rgap~G~P~~r~r 524 (975)
T KOG0112|consen 451 KSTPTTRLQSGGLGPWSPVSRLNREFDRFGPIRIIDYRHG------QPYAYIQYESPPAAQAATHDMRGAPLGGPPRRLR 524 (975)
T ss_pred ccccceeeccCCCCCCChHHHHHHHhhccCcceeeecccC------CcceeeecccCccchhhHHHHhcCcCCCCCcccc
Confidence 4456688999999999999999999999999998877332 4699999999999999999999999976 5689
Q ss_pred EEeee
Q psy7051 87 VQMAR 91 (158)
Q Consensus 87 v~~a~ 91 (158)
|.|+.
T Consensus 525 vdla~ 529 (975)
T KOG0112|consen 525 VDLAS 529 (975)
T ss_pred ccccc
Confidence 99876
No 121
>KOG0129|consensus
Probab=97.79 E-value=6.5e-05 Score=59.88 Aligned_cols=62 Identities=23% Similarity=0.377 Sum_probs=48.0
Q ss_pred cEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCC---CCCCccc---EEEEEEcCHHHHHHHHHHhC
Q psy7051 14 VSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDR---FTRESRG---FAFVRFFDKRDAEDALDAMD 76 (158)
Q Consensus 14 ~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~---~~~~~~g---~afV~f~~~~~a~~a~~~l~ 76 (158)
.+||||+||+.++|++|...|..||.+. |.++... .-..++| |+|+.|+++..+...+..+.
T Consensus 260 ~KVFvGGlp~dise~~i~~~F~~FGs~~-VdWP~k~~~~~~~ppkGs~~YvflvFe~E~sV~~Ll~aC~ 327 (520)
T KOG0129|consen 260 RKVFVGGLPWDITEAQINASFGQFGSVK-VDWPGKANSRGRAPPKGSYGYVFLVFEDERSVQSLLSACS 327 (520)
T ss_pred cceeecCCCccccHHHHHhhcccccceE-eecCCCccccccCCCCCcccEEEEEecchHHHHHHHHHHh
Confidence 6899999999999999999999999874 5555211 1122456 99999999988887766543
No 122
>KOG1996|consensus
Probab=97.79 E-value=8.6e-05 Score=55.50 Aligned_cols=65 Identities=23% Similarity=0.319 Sum_probs=51.4
Q ss_pred HHHHHHHhhhcCceEEEEEccCCCCC-CcccEEEEEEcCHHHHHHHHHHhCCcccCCceEEEEeee
Q psy7051 27 PDDLRRVFERCGEVGDIYIPRDRFTR-ESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELRVQMAR 91 (158)
Q Consensus 27 ~~~l~~~f~~~G~i~~v~~~~~~~~~-~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~v~~a~ 91 (158)
++++++.+++||+|..|.|...+..- .-.--.||+|...+.|..|+-.|||..|+|+.+...|-+
T Consensus 300 ede~keEceKyg~V~~viifeip~~p~deavRiFveF~r~e~aiKA~VdlnGRyFGGr~v~A~Fyn 365 (378)
T KOG1996|consen 300 EDETKEECEKYGKVGNVIIFEIPSQPEDEAVRIFVEFERVESAIKAVVDLNGRYFGGRVVSACFYN 365 (378)
T ss_pred HHHHHHHHHhhcceeeEEEEecCCCccchhheeeeeeccHHHHHHHHHhcCCceecceeeeheecc
Confidence 45678889999999988775543221 123458999999999999999999999999999887754
No 123
>PF10309 DUF2414: Protein of unknown function (DUF2414); InterPro: IPR019416 This entry contains proteins that have no known function.
Probab=97.76 E-value=0.00025 Score=41.13 Aligned_cols=54 Identities=22% Similarity=0.228 Sum_probs=45.1
Q ss_pred cEEEEeCCCCCCCHHHHHHHhhhc---CceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHh
Q psy7051 14 VSLKVDNLTYRTTPDDLRRVFERC---GEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAM 75 (158)
Q Consensus 14 ~~l~V~nl~~~~~~~~l~~~f~~~---G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l 75 (158)
.+|+|.|+. .++.++|+.+|..| .....|.++.|. -|-|.|.+.+.|..||.+|
T Consensus 6 eavhirGvd-~lsT~dI~~y~~~y~~~~~~~~IEWIdDt-------ScNvvf~d~~~A~~AL~~L 62 (62)
T PF10309_consen 6 EAVHIRGVD-ELSTDDIKAYFSEYFDEEGPFRIEWIDDT-------SCNVVFKDEETAARALVAL 62 (62)
T ss_pred ceEEEEcCC-CCCHHHHHHHHHHhcccCCCceEEEecCC-------cEEEEECCHHHHHHHHHcC
Confidence 569999986 57889999999998 235678888874 5999999999999999754
No 124
>KOG1855|consensus
Probab=97.75 E-value=3.5e-05 Score=60.15 Aligned_cols=67 Identities=16% Similarity=0.261 Sum_probs=55.3
Q ss_pred CCcEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccC---CCCC--C--------cccEEEEEEcCHHHHHHHHHHhCCc
Q psy7051 12 GMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRD---RFTR--E--------SRGFAFVRFFDKRDAEDALDAMDGR 78 (158)
Q Consensus 12 ~~~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~---~~~~--~--------~~g~afV~f~~~~~a~~a~~~l~g~ 78 (158)
+..+|.+.|||.+-.-+.|.++|..+|.|+.|.|+.. +... . .+-+|+|+|+..+.|.+|.+.|+..
T Consensus 230 ~srtivaenLP~Dh~~enl~kiFg~~G~IksIRIckPgaip~d~r~~~~~~~~~~tk~~AlvEye~~~~A~KA~e~~~~e 309 (484)
T KOG1855|consen 230 PSRTIVAENLPLDHSYENLSKIFGTVGSIKSIRICKPGAIPEDVRGFPKKYFELQTKECALVEYEEVEAARKARELLNPE 309 (484)
T ss_pred ccceEEEecCCcchHHHHHHHHhhcccceeeeeecCCCCCCcccccCCccchhhhhhhhhhhhhhhhHHHHHHHHhhchh
Confidence 5689999999999988999999999999999988655 2221 1 2458999999999999999887543
No 125
>KOG0128|consensus
Probab=97.46 E-value=6.9e-05 Score=62.90 Aligned_cols=79 Identities=18% Similarity=0.265 Sum_probs=70.4
Q ss_pred CcEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEEEEeeec
Q psy7051 13 MVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELRVQMARY 92 (158)
Q Consensus 13 ~~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~v~~a~~ 92 (158)
...|+|.|+|+..|.++|+.++.++|.++.+.++... .|+++|.|+|.|.++.++..++...+...+....+.|..++.
T Consensus 736 K~~v~i~g~pf~gt~e~~k~l~~~~gn~~~~~~vt~r-~gkpkg~a~v~y~~ea~~s~~~~s~d~~~~rE~~~~v~vsnp 814 (881)
T KOG0128|consen 736 KISVAISGPPFQGTKEELKSLASKTGNVTSLRLVTVR-AGKPKGKARVDYNTEADASRKVASVDVAGKRENNGEVQVSNP 814 (881)
T ss_pred hhhhheeCCCCCCchHHHHhhccccCCccccchhhhh-ccccccceeccCCCcchhhhhcccchhhhhhhcCccccccCC
Confidence 3579999999999999999999999999999887775 789999999999999999999988888888878788877654
No 126
>PF03467 Smg4_UPF3: Smg-4/UPF3 family; InterPro: IPR005120 Nonsense-mediated mRNA decay (NMD) is a surveillance mechanism by which eukaryotic cells detect and degrade transcripts containing premature termination codons. Three 'up-frameshift' proteins, UPF1, UPF2 and UPF3, are essential for this process in organisms ranging from yeast, human to plants []. Exon junction complexes (EJCs) are deposited ~24 nucleotides upstream of exon-exon junctions after splicing. Translation causes displacement of the EJCs, however, premature translation termination upstream of one or more EJCs triggers the recruitment of UPF1, UPF2 and UPF3 and activates the NMD pathway [, ]. This family contains UPF3. The crystal structure of the complex between human UPF2 and UPF3b, which are, respectively, a MIF4G (middle portion of eIF4G) domain and an RNP domain (ribonucleoprotein-type RNA-binding domain) has been determined to 1.95A. The protein-protein interface is mediated by highly conserved charged residues in UPF2 and UPF3b and involves the beta-sheet surface of the UPF3b ribonucleoprotein (RNP) domain, which is generally used by these domains to bind nucleic acids. In UPF3b the RNP domain does not bind RNA, whereas the UPF2 construct and the complex do. It is clear that some RNP domains have evolved for specific protein-protein interactions rather than as nucleic acid binding modules [].; PDB: 1UW4_A 2L08_A.
Probab=97.36 E-value=0.00038 Score=49.11 Aligned_cols=83 Identities=14% Similarity=0.181 Sum_probs=52.6
Q ss_pred CcEEEEeCCCCCCCHHHHHHHhhh-cCce---EEEE--EccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCC----
Q psy7051 13 MVSLKVDNLTYRTTPDDLRRVFER-CGEV---GDIY--IPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDG---- 82 (158)
Q Consensus 13 ~~~l~V~nl~~~~~~~~l~~~f~~-~G~i---~~v~--~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g---- 82 (158)
..+|.|.+||+.+|++++.+.+.. ++.. ..+. ............-|||.|.+.+++...+..++|..|.+
T Consensus 7 ~~KvVIR~LPP~LteeeF~~~i~~~l~~~~~w~y~~g~~~~~~~~~~~~SRaYi~F~~~~~~~~F~~~~~g~~F~D~kg~ 86 (176)
T PF03467_consen 7 GTKVVIRRLPPNLTEEEFWEQISPWLPDEWDWYYFQGKYGKKSFKPPTYSRAYINFKNPEDLLEFRDRFDGHVFVDSKGN 86 (176)
T ss_dssp --EEEEEEE-TTS-HHHHCCCCSS--SSE---EEEEEEES-SSSTTS--EEEEEEESSCHHHHHHHHHCTTEEEE-TTS-
T ss_pred CceEEEeCCCCCCCHHHHHHHhhhhcccccceEEEecCCCCccCCCCcceEEEEEeCCHHHHHHHHHhcCCcEEECCCCC
Confidence 468999999999999999997777 6654 3443 22221112234579999999999999999999987633
Q ss_pred -ceEEEEeeecCCC
Q psy7051 83 -RELRVQMARYGRP 95 (158)
Q Consensus 83 -~~i~v~~a~~~~~ 95 (158)
....|++|-..+-
T Consensus 87 ~~~~~VE~Apyqk~ 100 (176)
T PF03467_consen 87 EYPAVVEFAPYQKV 100 (176)
T ss_dssp EEEEEEEE-SS---
T ss_pred CcceeEEEcchhcc
Confidence 3467788776443
No 127
>KOG0128|consensus
Probab=97.30 E-value=1.4e-05 Score=66.84 Aligned_cols=68 Identities=24% Similarity=0.374 Sum_probs=58.4
Q ss_pred cEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccC
Q psy7051 14 VSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLD 81 (158)
Q Consensus 14 ~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~ 81 (158)
.++||.||+..+.+.+|...|..+|.|..+.+......++.+|+|||+|..++.+.+||.......++
T Consensus 668 ~~~fvsnl~~~~~~~dl~~~~~~~~~~e~vqi~~h~n~~~~rG~~Y~~F~~~~~~~aaV~f~d~~~~g 735 (881)
T KOG0128|consen 668 IKIFVSNLSPKMSEEDLSERFSPSGTIEVVQIVIHKNEKRFRGKAYVEFLKPEHAGAAVAFRDSCFFG 735 (881)
T ss_pred HHHHHhhcchhhcCchhhhhcCccchhhhHHHHHHhhccccccceeeEeecCCchhhhhhhhhhhhhh
Confidence 47899999999999999999999998888777656668889999999999999999999765555444
No 128
>PF07576 BRAP2: BRCA1-associated protein 2; InterPro: IPR011422 These proteins include BRCA1-associated protein 2 (BRAP2), which binds nuclear localisation signals (NLSs) in vitro and in yeast two-hybrid screening []. These proteins share a region of sequence similarity at their N terminus. They also have IPR001607 from INTERPRO at the C terminus.
Probab=97.28 E-value=0.0044 Score=40.29 Aligned_cols=68 Identities=19% Similarity=0.255 Sum_probs=49.0
Q ss_pred CcEEEEeCCCCCCCHHHHHHHhhhcC-ceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCC
Q psy7051 13 MVSLKVDNLTYRTTPDDLRRVFERCG-EVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDG 82 (158)
Q Consensus 13 ~~~l~V~nl~~~~~~~~l~~~f~~~G-~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g 82 (158)
...+.+...|..++.++|..+.+.+- .|..+.|+.+. ..++-.++|.|.+.++|.+.++.+||..++.
T Consensus 13 ~~~~~l~vp~~~~~~d~l~~f~~~~~~~i~~~riird~--~pnrymVLikF~~~~~Ad~Fy~~fNGk~Fns 81 (110)
T PF07576_consen 13 STLCCLAVPPYMTPSDFLLFFGAPFREDIEHIRIIRDG--TPNRYMVLIKFRDQESADEFYEEFNGKPFNS 81 (110)
T ss_pred ceEEEEEeCcccccHHHHHHhhhcccccEEEEEEeeCC--CCceEEEEEEECCHHHHHHHHHHhCCCccCC
Confidence 34455545555556667776666654 67788888763 2245579999999999999999999997653
No 129
>KOG4307|consensus
Probab=97.26 E-value=0.00027 Score=58.50 Aligned_cols=77 Identities=25% Similarity=0.091 Sum_probs=63.2
Q ss_pred CCcEEEEeCCCCCCCHHHHHHHhhhcCceEE-EEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEEEEe
Q psy7051 12 GMVSLKVDNLTYRTTPDDLRRVFERCGEVGD-IYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELRVQM 89 (158)
Q Consensus 12 ~~~~l~V~nl~~~~~~~~l~~~f~~~G~i~~-v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~v~~ 89 (158)
....|||-+||..+++.++.++|...-.|+. |.|... .+++..+.|||+|..++++..|+..-+...++.+.|.|.-
T Consensus 433 ag~~lyv~~lP~~t~~~~~v~~f~~~~~Ved~I~lt~~-P~~~~~~~afv~F~~~~a~~~a~~~~~k~y~G~r~irv~s 510 (944)
T KOG4307|consen 433 AGGALYVFQLPVMTPIVPPVNKFMGAAAVEDFIELTRL-PTDLLRPAAFVAFIHPTAPLTASSVKTKFYPGHRIIRVDS 510 (944)
T ss_pred ccceEEeccCCccccccchhhhhhhhhhhhheeEeccC-CcccccchhhheeccccccchhhhcccccccCceEEEeec
Confidence 3478999999999999999999998777766 555444 3777889999999998888888776677778888888875
No 130
>PF15023 DUF4523: Protein of unknown function (DUF4523)
Probab=97.20 E-value=0.0032 Score=42.52 Aligned_cols=74 Identities=18% Similarity=0.161 Sum_probs=53.5
Q ss_pred CCCCCcEEEEeCCCCCC----CHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCce
Q psy7051 9 RIEGMVSLKVDNLTYRT----TPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRE 84 (158)
Q Consensus 9 ~~~~~~~l~V~nl~~~~----~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~ 84 (158)
..++..||.|.=|..++ +...|...++.||+|..|.+. | +.-|.|+|.+..+|..|+.+++. ...|..
T Consensus 82 kepPMsTIVVRWlkknm~~~edl~sV~~~Ls~fGpI~SVT~c-----G--rqsavVvF~d~~SAC~Av~Af~s-~~pgtm 153 (166)
T PF15023_consen 82 KEPPMSTIVVRWLKKNMQPTEDLKSVIQRLSVFGPIQSVTLC-----G--RQSAVVVFKDITSACKAVSAFQS-RAPGTM 153 (166)
T ss_pred CCCCceeEEeehhhhcCChHHHHHHHHHHHHhcCCcceeeec-----C--CceEEEEehhhHHHHHHHHhhcC-CCCCce
Confidence 34567889888666555 334456677899999999763 2 34699999999999999988765 455666
Q ss_pred EEEEee
Q psy7051 85 LRVQMA 90 (158)
Q Consensus 85 i~v~~a 90 (158)
+.+.|-
T Consensus 154 ~qCsWq 159 (166)
T PF15023_consen 154 FQCSWQ 159 (166)
T ss_pred EEeecc
Confidence 666553
No 131
>KOG4660|consensus
Probab=97.18 E-value=0.0007 Score=54.58 Aligned_cols=55 Identities=20% Similarity=0.340 Sum_probs=44.5
Q ss_pred cCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCccc---C-CceEEEEeee
Q psy7051 37 CGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRML---D-GRELRVQMAR 91 (158)
Q Consensus 37 ~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i---~-g~~i~v~~a~ 91 (158)
.|.-..+.++.|..+.++.|||||.|.+++++..+.+++||+.| + .+.+.+.||+
T Consensus 413 ~gtYDFlYLPiDF~nkcNvGYAFINm~sp~ai~~F~kAFnGk~W~~FnS~Kia~itYAr 471 (549)
T KOG4660|consen 413 KGTYDFLYLPIDFKNKCNVGYAFINMTSPEAIIRFYKAFNGKKWEKFNSEKIASITYAR 471 (549)
T ss_pred cCccceEEeccccccccccceeEEeecCHHHHHHHHHHHcCCchhhhcceeeeeeehhh
Confidence 45556777888888888999999999999999999999999965 3 3445777765
No 132
>PF03880 DbpA: DbpA RNA binding domain ; InterPro: IPR005580 This RNA binding domain is found at the C terminus of a number of DEAD helicase proteins [].; PDB: 2G0C_A 3MOJ_B.
Probab=97.15 E-value=0.0056 Score=36.82 Aligned_cols=66 Identities=30% Similarity=0.505 Sum_probs=39.3
Q ss_pred EEEEeCC--CCCCCHHHHHHHhhhcC-----ceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEEE
Q psy7051 15 SLKVDNL--TYRTTPDDLRRVFERCG-----EVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELRV 87 (158)
Q Consensus 15 ~l~V~nl--~~~~~~~~l~~~f~~~G-----~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~v 87 (158)
+||| |+ -..++..+|..+|...+ .|-.|.+..+ |+||+... +.|+.+++.|++..+.|+.|.|
T Consensus 2 rl~i-n~Gr~dg~~~~~iv~~i~~~~gi~~~~IG~I~I~~~--------~S~vev~~-~~a~~v~~~l~~~~~~gk~v~v 71 (74)
T PF03880_consen 2 RLFI-NVGRKDGLTPRDIVGAICNEAGIPGRDIGRIDIFDN--------FSFVEVPE-EVAEKVLEALNGKKIKGKKVRV 71 (74)
T ss_dssp EEEE-S-SGGGT--HHHHHHHHHTCTTB-GGGEEEEEE-SS---------EEEEE-T-T-HHHHHHHHTT--SSS----E
T ss_pred EEEE-EcccccCCCHHHHHHHHHhccCCCHHhEEEEEEeee--------EEEEEECH-HHHHHHHHHhcCCCCCCeeEEE
Confidence 4555 33 34678899999988764 4567777443 89999865 4788889999999999999999
Q ss_pred Eee
Q psy7051 88 QMA 90 (158)
Q Consensus 88 ~~a 90 (158)
+.|
T Consensus 72 e~A 74 (74)
T PF03880_consen 72 ERA 74 (74)
T ss_dssp EE-
T ss_pred EEC
Confidence 865
No 133
>KOG0115|consensus
Probab=97.09 E-value=0.0008 Score=49.46 Aligned_cols=75 Identities=24% Similarity=0.255 Sum_probs=58.8
Q ss_pred cEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCc----ccCCceEEEEe
Q psy7051 14 VSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGR----MLDGRELRVQM 89 (158)
Q Consensus 14 ~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~----~i~g~~i~v~~ 89 (158)
..|+|.||+..+..+.|...|+.||+|....++.|. .++..+-++|+|...-.+.+|+..+.-. ...+..+-|..
T Consensus 32 a~l~V~nl~~~~sndll~~~f~~fg~~e~av~~vD~-r~k~t~eg~v~~~~k~~a~~a~rr~~~~g~~~~~~~~p~~VeP 110 (275)
T KOG0115|consen 32 AELYVVNLMQGASNDLLEQAFRRFGPIERAVAKVDD-RGKPTREGIVEFAKKPNARKAARRCREGGFGGTTGGRPVGVEP 110 (275)
T ss_pred ceEEEEecchhhhhHHHHHhhhhcCccchheeeecc-cccccccchhhhhcchhHHHHHHHhccCccccCCCCCccCCCh
Confidence 569999999999999999999999999766555553 6778889999999999999998876422 33455555543
No 134
>KOG2193|consensus
Probab=97.03 E-value=0.0008 Score=52.81 Aligned_cols=69 Identities=30% Similarity=0.529 Sum_probs=55.7
Q ss_pred cEEEEeCCCCCCCHHHHHHHhhhc--CceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCc-ccCCceEEEEee
Q psy7051 14 VSLKVDNLTYRTTPDDLRRVFERC--GEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGR-MLDGRELRVQMA 90 (158)
Q Consensus 14 ~~l~V~nl~~~~~~~~l~~~f~~~--G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~-~i~g~~i~v~~a 90 (158)
..||++||.+.++..+|+.+|... +.-..+ |+ ..||+||.+.+..-|..|++.++|. ++.|..+.|+++
T Consensus 2 nklyignL~p~~~psdl~svfg~ak~~~~g~f-l~-------k~gyafvd~pdq~wa~kaie~~sgk~elqGkr~e~~~s 73 (584)
T KOG2193|consen 2 NKLYIGNLSPQVTPSDLESVFGDAKIPGSGQF-LV-------KSGYAFVDCPDQQWANKAIETLSGKVELQGKRQEVEHS 73 (584)
T ss_pred CcccccccCCCCChHHHHHHhccccCCCCcce-ee-------ecceeeccCCchhhhhhhHHhhchhhhhcCceeeccch
Confidence 468999999999999999999754 111122 22 2479999999999999999999987 689999999875
No 135
>KOG2135|consensus
Probab=96.97 E-value=0.00054 Score=54.31 Aligned_cols=72 Identities=24% Similarity=0.341 Sum_probs=57.9
Q ss_pred cEEEEeCCCCCC-CHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEEEEeeec
Q psy7051 14 VSLKVDNLTYRT-TPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELRVQMARY 92 (158)
Q Consensus 14 ~~l~V~nl~~~~-~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~v~~a~~ 92 (158)
..|-+.-.+... +-++|...|.+||.|..|.+-.. ...|.|+|.+..+|..|. ..++..|+++.|+|.|-+.
T Consensus 373 s~l~lek~~~glnt~a~ln~hfA~fG~i~n~qv~~~------~~~a~vTF~t~aeag~a~-~s~~avlnnr~iKl~whnp 445 (526)
T KOG2135|consen 373 SPLALEKSPFGLNTIADLNPHFAQFGEIENIQVDYS------SLHAVVTFKTRAEAGEAY-ASHGAVLNNRFIKLFWHNP 445 (526)
T ss_pred chhhhhccCCCCchHhhhhhhhhhcCccccccccCc------hhhheeeeeccccccchh-ccccceecCceeEEEEecC
Confidence 445555555555 56889999999999999987543 347999999999998887 4799999999999999764
No 136
>KOG2253|consensus
Probab=96.92 E-value=0.00083 Score=55.23 Aligned_cols=70 Identities=16% Similarity=0.214 Sum_probs=59.7
Q ss_pred CCCcEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEEEEe
Q psy7051 11 EGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELRVQM 89 (158)
Q Consensus 11 ~~~~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~v~~ 89 (158)
++..+|||+|+...+..+-+..++..+|.|..+... -|+|++|..+.....|+..|+...++|..+.+..
T Consensus 38 ~~~~~vfv~~~~~~~s~~~~~~il~~~g~v~s~kr~---------~fgf~~f~~~~~~~ra~r~~t~~~~~~~kl~~~~ 107 (668)
T KOG2253|consen 38 PPRDTVFVGNISYLVSQEFWKSILAKSGFVPSWKRD---------KFGFCEFLKHIGDLRASRLLTELNIDDQKLIENV 107 (668)
T ss_pred CCCceeEecchhhhhhHHHHHHHHhhCCcchhhhhh---------hhcccchhhHHHHHHHHHHhcccCCCcchhhccc
Confidence 445789999999999999999999999988766543 2899999999999999988998889888776665
No 137
>KOG0112|consensus
Probab=96.82 E-value=0.00027 Score=59.78 Aligned_cols=77 Identities=19% Similarity=0.367 Sum_probs=61.9
Q ss_pred CcEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEEEEee
Q psy7051 13 MVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELRVQMA 90 (158)
Q Consensus 13 ~~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~v~~a 90 (158)
..|||++||...+++.+|...|..+|.|..|.|-..+ -+.-.-|+||.|.+...+..|+..+.+..|..-.+++.+.
T Consensus 372 trTLf~Gnl~~kl~eseiR~af~e~gkve~VDiKtP~-~~~esa~~f~~~~n~dmtp~ak~e~s~~~I~~g~~r~glG 448 (975)
T KOG0112|consen 372 TRTLFLGNLDSKLTESEIRPAFDESGKVEEVDIKTPH-IKTESAYAFVSLLNTDMTPSAKFEESGPLIGNGTHRIGLG 448 (975)
T ss_pred hhhhhhcCcccchhhhhhhhhhhhhccccccccccCC-CCcccchhhhhhhccccCcccchhhcCCccccCccccccc
Confidence 4689999999999999999999999999999885543 2223459999999999999998888888775544444443
No 138
>PF04847 Calcipressin: Calcipressin; InterPro: IPR006931 Calcipressin 1 negatively regulates calcineurin (IPR015757 from INTERPRO) by direct binding and is essential for the survival of T helper type 1 cells. Calcipressin 1 is a phosphoprotein that increases its capacity to inhibit calcineurin when phosphorylated at the FLISPP motif, and this phosphorylation also controls the half-life of calcipressin 1 by accelerating its degradation []. Calcineurin is a calcium-responsive enzyme that dephosphorylates the nuclear factor of activated T cells (NFAT). In so doing it promotes its nuclear translocation and uniquely links calcium signalling to transcriptional regulation []. Calcipressins are a family of proteins derived from three genes. Calcipressin 1 is also known as modulatory calcineurin-interacting protein 1 (MCIP1), Adapt78 and Down syndrome critical region 1 (DSCR1). Calcipressin 2 is variously known as MCIP2, ZAKI-4 and DSCR1-like 1. Calcipressin 3 is also called MCIP3 and DSCR1-like 2 []. DSCR1 (Adapt78) is associated with successful adaptation to oxidative stress and calcium stress as well as with diseases like Alzheimer's and Down syndrome. The DSCR1 (Adapt78) isoform 1 protein, calcipressin 1, inhibits calcineurin and protects against acute calcium-mediated stress damage, including transient oxidative stress []. Calcipressin 1 is encoded by DSCR1, a gene on human chromosome 21. Calcipressin 1 isoform 1 has an N-terminal coding region, which generates a new polypeptide of 252 amino acids. Endogenous calcipressin 1 exists as a complex together with the calcineurin A and B heterodimer []. ; GO: 0019722 calcium-mediated signaling; PDB: 1WEY_A.
Probab=96.72 E-value=0.0068 Score=43.04 Aligned_cols=60 Identities=18% Similarity=0.225 Sum_probs=45.3
Q ss_pred CHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhC--CcccCCceEEEEeee
Q psy7051 26 TPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMD--GRMLDGRELRVQMAR 91 (158)
Q Consensus 26 ~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~--g~~i~g~~i~v~~a~ 91 (158)
..+.|+++|..|+.+..+.+... .+-..|.|.+.++|..|...|+ +..+.|..|+|.++.
T Consensus 8 ~~~~l~~l~~~~~~~~~~~~L~s------FrRi~v~f~~~~~A~~~r~~l~~~~~~~~g~~l~~yf~~ 69 (184)
T PF04847_consen 8 NLAELEELFSTYDPPVQFSPLKS------FRRIRVVFESPESAQRARQLLHWDGTSFNGKRLRVYFGQ 69 (184)
T ss_dssp -HHHHHHHHHTT-SS-EEEEETT------TTEEEEE-SSTTHHHHHHHTST--TSEETTEE-EEE---
T ss_pred hHHHHHHHHHhcCCceEEEEcCC------CCEEEEEeCCHHHHHHHHHHhcccccccCCCceEEEEcc
Confidence 45789999999998887766554 3568999999999999999999 889999999999885
No 139
>KOG4285|consensus
Probab=96.61 E-value=0.016 Score=43.84 Aligned_cols=61 Identities=25% Similarity=0.456 Sum_probs=47.3
Q ss_pred EEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCce
Q psy7051 16 LKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRE 84 (158)
Q Consensus 16 l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~ 84 (158)
|-|-++++. ....|..+|++||+|....... +-.|-+|.|.+.-+|++||. .||..|+|..
T Consensus 200 VTVfGFppg-~~s~vL~~F~~cG~Vvkhv~~~------ngNwMhirYssr~~A~KALs-kng~ii~g~v 260 (350)
T KOG4285|consen 200 VTVFGFPPG-QVSIVLNLFSRCGEVVKHVTPS------NGNWMHIRYSSRTHAQKALS-KNGTIIDGDV 260 (350)
T ss_pred EEEeccCcc-chhHHHHHHHhhCeeeeeecCC------CCceEEEEecchhHHHHhhh-hcCeeeccce
Confidence 445578876 4566888999999997765531 23589999999999999996 5999998764
No 140
>KOG2591|consensus
Probab=96.61 E-value=0.005 Score=49.99 Aligned_cols=71 Identities=13% Similarity=0.258 Sum_probs=55.8
Q ss_pred CCcEEEEeCCCCCCCHHHHHHHhhh--cCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCc--ccCCceEEE
Q psy7051 12 GMVSLKVDNLTYRTTPDDLRRVFER--CGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGR--MLDGRELRV 87 (158)
Q Consensus 12 ~~~~l~V~nl~~~~~~~~l~~~f~~--~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~--~i~g~~i~v 87 (158)
..+.|+|.-||..+-.++|+.||.. +-.++.|.+-.+. .=||+|++..||+.|++.|... +|.|+.|..
T Consensus 174 kRcIvilREIpettp~e~Vk~lf~~encPk~iscefa~N~-------nWyITfesd~DAQqAykylreevk~fqgKpImA 246 (684)
T KOG2591|consen 174 KRCIVILREIPETTPIEVVKALFKGENCPKVISCEFAHND-------NWYITFESDTDAQQAYKYLREEVKTFQGKPIMA 246 (684)
T ss_pred ceeEEEEeecCCCChHHHHHHHhccCCCCCceeeeeeecC-------ceEEEeecchhHHHHHHHHHHHHHhhcCcchhh
Confidence 3467889999999999999999976 5677888775542 3599999999999999877643 577777654
Q ss_pred Ee
Q psy7051 88 QM 89 (158)
Q Consensus 88 ~~ 89 (158)
.+
T Consensus 247 RI 248 (684)
T KOG2591|consen 247 RI 248 (684)
T ss_pred hh
Confidence 43
No 141
>KOG2068|consensus
Probab=96.58 E-value=0.00078 Score=51.28 Aligned_cols=78 Identities=21% Similarity=0.368 Sum_probs=57.6
Q ss_pred cEEEEeCCCCCCCHH-HHH--HHhhhcCceEEEEEccCCC--CCC-cccEEEEEEcCHHHHHHHHHHhCCcccCCceEEE
Q psy7051 14 VSLKVDNLTYRTTPD-DLR--RVFERCGEVGDIYIPRDRF--TRE-SRGFAFVRFFDKRDAEDALDAMDGRMLDGRELRV 87 (158)
Q Consensus 14 ~~l~V~nl~~~~~~~-~l~--~~f~~~G~i~~v~~~~~~~--~~~-~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~v 87 (158)
.-+||-+|+..+..+ .|+ +.|.+||.|..|.+..+.. ... ...-+||+|...++|..||...+|..+.|+.|+.
T Consensus 78 nlvyvvgl~~~~ade~~l~~~eyfgqygki~ki~~~~~~S~~s~~~~~~s~yITy~~~eda~rci~~v~g~~~dg~~lka 157 (327)
T KOG2068|consen 78 NLVYVVGLPLDLADESVLERTEYFGQYGKINKIVKNKDPSSSSSSGGTCSVYITYEEEEDADRCIDDVDGFVDDGRALKA 157 (327)
T ss_pred hhhhhhCCCccccchhhhhCcccccccccceEEeecCCcccccCCCCCCcccccccchHhhhhHHHHhhhHHhhhhhhHH
Confidence 347788888776444 443 5788999999888766541 111 1224899999999999999999999999988777
Q ss_pred Eeee
Q psy7051 88 QMAR 91 (158)
Q Consensus 88 ~~a~ 91 (158)
.+..
T Consensus 158 ~~gt 161 (327)
T KOG2068|consen 158 SLGT 161 (327)
T ss_pred hhCC
Confidence 7643
No 142
>KOG0804|consensus
Probab=96.30 E-value=0.023 Score=45.20 Aligned_cols=68 Identities=24% Similarity=0.348 Sum_probs=57.3
Q ss_pred CcEEEEeCCCCCCCHHHHHHHhhhcC-ceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCC
Q psy7051 13 MVSLKVDNLTYRTTPDDLRRVFERCG-EVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDG 82 (158)
Q Consensus 13 ~~~l~V~nl~~~~~~~~l~~~f~~~G-~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g 82 (158)
.+.|+|-.+|..++..||..|+..+- .|..+.++.+. -.+.-.++|.|.+.++|...++.+||..|+.
T Consensus 74 ~~mLcilaVP~~mt~~Dll~F~~~~~~~I~~irivRd~--~pnrymvLIkFr~q~da~~Fy~efNGk~Fn~ 142 (493)
T KOG0804|consen 74 STMLCILAVPAYMTSHDLLRFCASFIKQISDIRIVRDG--MPNRYMVLIKFRDQADADTFYEEFNGKQFNS 142 (493)
T ss_pred CcEEEEEeccccccHHHHHHHHHHHhhhhheeEEeecC--CCceEEEEEEeccchhHHHHHHHcCCCcCCC
Confidence 67899999999999999999998764 68888888863 2234568999999999999999999998754
No 143
>KOG4574|consensus
Probab=95.96 E-value=0.0054 Score=52.01 Aligned_cols=70 Identities=29% Similarity=0.393 Sum_probs=57.8
Q ss_pred EEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCccc--CCceEEEEeeec
Q psy7051 17 KVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRML--DGRELRVQMARY 92 (158)
Q Consensus 17 ~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i--~g~~i~v~~a~~ 92 (158)
++-|.+-..+-.-|..+|.+||.|..++...+ ...|.|+|...+.|..|++.|+|.++ .|-+.+|.+|+.
T Consensus 302 ~~~nn~v~~tSssL~~l~s~yg~v~s~wtlr~------~N~alvs~~s~~sai~a~dAl~gkevs~~g~Ps~V~~ak~ 373 (1007)
T KOG4574|consen 302 SLENNAVNLTSSSLATLCSDYGSVASAWTLRD------LNMALVSFSSVESAILALDALQGKEVSVTGAPSRVSFAKT 373 (1007)
T ss_pred hhhcccccchHHHHHHHHHhhcchhhheeccc------ccchhhhhHHHHHHHHhhhhhcCCcccccCCceeEEeccc
Confidence 34444555667788999999999999988666 35899999999999999999999974 678899999873
No 144
>KOG4210|consensus
Probab=95.76 E-value=0.008 Score=45.63 Aligned_cols=80 Identities=18% Similarity=0.118 Sum_probs=63.3
Q ss_pred CCcEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEEEEeee
Q psy7051 12 GMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELRVQMAR 91 (158)
Q Consensus 12 ~~~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~v~~a~ 91 (158)
...++|++++...+.+.++..++..+|.+..+.+.........+++++|.|...+.+..|+.......+.+..+...+.+
T Consensus 87 ~~~~~f~g~~s~~~e~~~~~~~~~~~g~~~~~~~S~~~~~~~sk~~~s~~f~~ks~~~~~l~~s~~~~~~~~~~~~dl~~ 166 (285)
T KOG4210|consen 87 SSSTFFVGELSENIEESEDDNFSSEAGLRVDARSSSLEDSLSSKGGLSVHFAGKSQFFAALEESGSKVLDGNKGEKDLNT 166 (285)
T ss_pred ccccccccccccchhhccccccchhhcCcccchhhhhccccccccceeeccccHHHHHHHHHhhhccccccccccCcccc
Confidence 46789999999999999999999999988777666655677789999999999999999996433345555555555444
No 145
>PF11767 SET_assoc: Histone lysine methyltransferase SET associated; InterPro: IPR024636 The SET domain is a protein-protein interaction domain found in protein lysine methyltransferase enzymes. This entry represents a domain of unknown function which is associated with the SET domain and found in histone lysine methyltransferases [].
Probab=95.51 E-value=0.11 Score=30.49 Aligned_cols=55 Identities=24% Similarity=0.404 Sum_probs=43.2
Q ss_pred CCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEEE
Q psy7051 24 RTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELRV 87 (158)
Q Consensus 24 ~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~v 87 (158)
.++.++|+..|..|+- ..| ..++ +| -||.|.+..+|+.|+...++..+.+..|.+
T Consensus 11 ~~~v~d~K~~Lr~y~~-~~I--~~d~-tG-----fYIvF~~~~Ea~rC~~~~~~~~~f~y~m~M 65 (66)
T PF11767_consen 11 GVTVEDFKKRLRKYRW-DRI--RDDR-TG-----FYIVFNDSKEAERCFRAEDGTLFFTYRMQM 65 (66)
T ss_pred CccHHHHHHHHhcCCc-ceE--EecC-CE-----EEEEECChHHHHHHHHhcCCCEEEEEEEEe
Confidence 5688999999999963 233 3332 33 589999999999999999999888877765
No 146
>PRK11634 ATP-dependent RNA helicase DeaD; Provisional
Probab=94.99 E-value=1.7 Score=36.88 Aligned_cols=70 Identities=9% Similarity=0.156 Sum_probs=50.7
Q ss_pred CcEEEEeCC--CCCCCHHHHHHHhhhcCce-----EEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceE
Q psy7051 13 MVSLKVDNL--TYRTTPDDLRRVFERCGEV-----GDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGREL 85 (158)
Q Consensus 13 ~~~l~V~nl--~~~~~~~~l~~~f~~~G~i-----~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i 85 (158)
+.++|| |+ ...++..+|..++..-+.| -.|.|.. .|.||+... +.+...+..|++..+.|+.|
T Consensus 486 ~~~~~~-~~g~~~~~~~~~~~~~i~~~~~~~~~~ig~i~i~~--------~~s~v~~~~-~~~~~~~~~~~~~~~~~~~~ 555 (629)
T PRK11634 486 MQLYRI-EVGRDDGVEVRHIVGAIANEGDISSRYIGNIKLFA--------SHSTIELPK-GMPGEVLQHFTRTRILNKPM 555 (629)
T ss_pred CEEEEE-ecccccCCCHHHHHHHHHhhcCCChhhCCcEEEeC--------CceEEEcCh-hhHHHHHHHhccccccCCce
Confidence 455655 44 4568888888888766544 4455533 378998864 55778888999999999999
Q ss_pred EEEeeec
Q psy7051 86 RVQMARY 92 (158)
Q Consensus 86 ~v~~a~~ 92 (158)
.|+.++.
T Consensus 556 ~~~~~~~ 562 (629)
T PRK11634 556 NMQLLGD 562 (629)
T ss_pred EEEECCC
Confidence 9998753
No 147
>KOG2318|consensus
Probab=94.40 E-value=0.4 Score=39.57 Aligned_cols=79 Identities=18% Similarity=0.205 Sum_probs=57.8
Q ss_pred CCCCcEEEEeCCCCC-CCHHHHHHHhhhc----CceEEEEEccCC----------CCCC---------------------
Q psy7051 10 IEGMVSLKVDNLTYR-TTPDDLRRVFERC----GEVGDIYIPRDR----------FTRE--------------------- 53 (158)
Q Consensus 10 ~~~~~~l~V~nl~~~-~~~~~l~~~f~~~----G~i~~v~~~~~~----------~~~~--------------------- 53 (158)
...+..|-|.||.|. +...+|.-+|..| |.|..|.|...- ..|.
T Consensus 171 ~~~T~RLAVvNMDWd~v~AkDL~v~~nSFlP~gGsilSV~IYpSeFGkeRM~eEeV~GP~~el~~~~e~~~~s~sD~ee~ 250 (650)
T KOG2318|consen 171 GEETKRLAVVNMDWDRVKAKDLFVLFNSFLPKGGSILSVKIYPSEFGKERMKEEEVHGPPKELFKPVEEYKESESDDEEE 250 (650)
T ss_pred ccccceeeEeccccccccHHHHHHHHHhhcCCCCceeEEEechhhhhHHHhhhhcccCChhhhccccccCcccccchhhh
Confidence 456788999999985 6888999998876 477777653210 0111
Q ss_pred ----------------cccEEEEEEcCHHHHHHHHHHhCCcccCCceEEEE
Q psy7051 54 ----------------SRGFAFVRFFDKRDAEDALDAMDGRMLDGRELRVQ 88 (158)
Q Consensus 54 ----------------~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~v~ 88 (158)
.--||.|+|.+.+.|.++++.|+|.+|...-..+.
T Consensus 251 ~~~~~~kLR~Yq~~rLkYYyAVvecDsi~tA~~vYe~CDG~EfEsS~~~~D 301 (650)
T KOG2318|consen 251 EDVDREKLRQYQLNRLKYYYAVVECDSIETAKAVYEECDGIEFESSANKLD 301 (650)
T ss_pred hhHHHHHHHHHHhhhheeEEEEEEecCchHHHHHHHhcCcceeccccceee
Confidence 12489999999999999999999999865444433
No 148
>KOG2193|consensus
Probab=94.37 E-value=0.0039 Score=49.11 Aligned_cols=74 Identities=15% Similarity=0.265 Sum_probs=62.0
Q ss_pred cEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEc-cCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEEEEeee
Q psy7051 14 VSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIP-RDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELRVQMAR 91 (158)
Q Consensus 14 ~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~-~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~v~~a~ 91 (158)
..+.|.|+|+...|+.|..++..||.++.|..+ ++. ..-..-|+|.+.+.+..||.+|+|..|....++|.|-.
T Consensus 81 rk~Qirnippql~wevld~Ll~qyg~ve~~eqvnt~~----etavvnvty~~~~~~~~ai~kl~g~Q~en~~~k~~YiP 155 (584)
T KOG2193|consen 81 RKIQIRNIPPQLQWEVLDSLLAQYGTVENCEQVNTDS----ETAVVNVTYSAQQQHRQAIHKLNGPQLENQHLKVGYIP 155 (584)
T ss_pred hhhhHhcCCHHHHHHHHHHHHhccCCHhHhhhhccch----HHHHHHHHHHHHHHHHHHHHhhcchHhhhhhhhcccCc
Confidence 458899999999999999999999999888653 232 23345678999999999999999999999999998854
No 149
>KOG4207|consensus
Probab=94.29 E-value=0.41 Score=34.61 Aligned_cols=55 Identities=29% Similarity=0.235 Sum_probs=29.5
Q ss_pred CHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHH--HhCCccc
Q psy7051 26 TPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALD--AMDGRML 80 (158)
Q Consensus 26 ~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~--~l~g~~i 80 (158)
+---|.+-+...|+|---.-....+.....-.-|-.=.+.++|.+|+. .|+|.+|
T Consensus 29 ~LrrvFekYG~vgDVyIPrdr~Tr~sRgFaFVrf~~k~daedA~damDG~~ldgRel 85 (256)
T KOG4207|consen 29 DLRRVFEKYGRVGDVYIPRDRYTRQSRGFAFVRFHDKRDAEDALDAMDGAVLDGREL 85 (256)
T ss_pred HHHHHHHHhCcccceecccccccccccceeEEEeeecchHHHHHHhhcceeecccee
Confidence 444455666666654221111111111122245666677888888887 4888877
No 150
>KOG4483|consensus
Probab=94.21 E-value=0.16 Score=39.99 Aligned_cols=58 Identities=22% Similarity=0.199 Sum_probs=48.8
Q ss_pred CCCCCcEEEEeCCCCCCCHHHHHHHhhhcCc-eEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHH
Q psy7051 9 RIEGMVSLKVDNLTYRTTPDDLRRVFERCGE-VGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALD 73 (158)
Q Consensus 9 ~~~~~~~l~V~nl~~~~~~~~l~~~f~~~G~-i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~ 73 (158)
+.+-.+.|-|-++|.....+||...|+.|++ -..|.|+.+. +||-.|.+...|..||.
T Consensus 387 e~dlpHVlEIydfp~efkteDll~~f~~yq~kgfdIkWvDdt-------halaVFss~~~AaeaLt 445 (528)
T KOG4483|consen 387 ESDLPHVLEIYDFPDEFKTEDLLKAFETYQNKGFDIKWVDDT-------HALAVFSSVNRAAEALT 445 (528)
T ss_pred cccccceeEeccCchhhccHHHHHHHHHhhcCCceeEEeecc-------eeEEeecchHHHHHHhh
Confidence 3445678999999999999999999999974 4667787774 79999999999999985
No 151
>KOG0835|consensus
Probab=93.58 E-value=0.092 Score=40.30 Aligned_cols=10 Identities=20% Similarity=0.418 Sum_probs=4.8
Q ss_pred CCCHHHHHHH
Q psy7051 24 RTTPDDLRRV 33 (158)
Q Consensus 24 ~~~~~~l~~~ 33 (158)
.+++.+|.++
T Consensus 212 d~~k~eid~i 221 (367)
T KOG0835|consen 212 DTTKREIDEI 221 (367)
T ss_pred CCcHHHHHHH
Confidence 3455555444
No 152
>smart00596 PRE_C2HC PRE_C2HC domain.
Probab=92.92 E-value=0.25 Score=29.18 Aligned_cols=61 Identities=15% Similarity=0.216 Sum_probs=46.3
Q ss_pred HHHHHHhhhcC-ceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEEEEeee
Q psy7051 28 DDLRRVFERCG-EVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELRVQMAR 91 (158)
Q Consensus 28 ~~l~~~f~~~G-~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~v~~a~ 91 (158)
.+|++.|...| ++.++..+....+..+...-||+.....+... .|+=..|++..|.|+-..
T Consensus 2 ~~I~~~L~~~G~~v~~i~~m~~~~~r~P~nmf~vel~~~~~~~~---Il~ik~Lg~~~V~VEr~~ 63 (69)
T smart00596 2 SQIEEALKDIGFPVLFIHNMLNRDTKNPQNMFEVELVPAANGKE---ILNIKTLGGQRVTVERPH 63 (69)
T ss_pred HHHHHHHHHcCCceeEEEcccccCCCCcceeEEEEeeecCCCcc---eEeehhhCCeeEEEecCc
Confidence 46888899998 88888888877777777888999887655444 345567889998887543
No 153
>KOG4019|consensus
Probab=92.49 E-value=0.18 Score=35.55 Aligned_cols=73 Identities=15% Similarity=0.193 Sum_probs=53.9
Q ss_pred cEEEEeCCCCCCCH-----HHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCc-eEEE
Q psy7051 14 VSLKVDNLTYRTTP-----DDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGR-ELRV 87 (158)
Q Consensus 14 ~~l~V~nl~~~~~~-----~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~-~i~v 87 (158)
+++++.+++..+.. ...+.+|.+|-+...+.+... .+..-|.|.+++.|..|...++...|.|. .++.
T Consensus 11 ~~~~~c~i~~~VF~~~~~k~~~~~lFrq~n~~~~fq~lrs------frrvRi~f~~p~~a~~a~i~~~~~~f~~~~~~k~ 84 (193)
T KOG4019|consen 11 TAIIACDIHEEVFVNREDKALFENLFRQINEDATFQLLRS------FRRVRINFSNPEAAADARIKLHSTSFNGKNELKL 84 (193)
T ss_pred ceeeeecccHHhhccHHHHHHHHhHHhhhCcchHHHHHHh------hceeEEeccChhHHHHHHHHhhhcccCCCceEEE
Confidence 55888888776532 234566777766665555443 45677899999999999999999999988 7887
Q ss_pred Eeeec
Q psy7051 88 QMARY 92 (158)
Q Consensus 88 ~~a~~ 92 (158)
-++..
T Consensus 85 yfaQ~ 89 (193)
T KOG4019|consen 85 YFAQP 89 (193)
T ss_pred EEccC
Confidence 77763
No 154
>PF07530 PRE_C2HC: Associated with zinc fingers; InterPro: IPR006579 This domain is present in proteins found exclusively in the arthropods, including a number of Drosophila species, the silk moth and the gypsy moth. These proteins are possibly involved in RNA binding or single strand DNA binding.
Probab=92.14 E-value=0.55 Score=27.72 Aligned_cols=61 Identities=20% Similarity=0.328 Sum_probs=45.9
Q ss_pred HHHHHHhhhcC-ceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEEEEeee
Q psy7051 28 DDLRRVFERCG-EVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELRVQMAR 91 (158)
Q Consensus 28 ~~l~~~f~~~G-~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~v~~a~ 91 (158)
++|.+.|...| +|.+|.-+....+..+....||+.+...+...+ ++=..|++..|+|+...
T Consensus 2 ~~I~~~L~~~G~~v~~i~~~~~~~~k~pl~mf~veL~p~~~~k~i---~~Ik~l~~~~V~vE~~~ 63 (68)
T PF07530_consen 2 EEIKEELKDQGHPVRNIHNMHSRNTKKPLNMFFVELEPKPNNKEI---YKIKTLCGQRVKVERPR 63 (68)
T ss_pred HHHHHHHHHcCCceEEEEccccCCCCCCceEEEEeeccCccccce---eehHhhCCeEEEEecCC
Confidence 46888888888 788888777776777888999999876664433 44567888889988654
No 155
>KOG4410|consensus
Probab=90.49 E-value=0.87 Score=34.53 Aligned_cols=46 Identities=20% Similarity=0.248 Sum_probs=35.8
Q ss_pred cEEEEeCCCCCCCHHHHHHHhhhcCce-EEEEEccCCCCCCcccEEEEEEcCH
Q psy7051 14 VSLKVDNLTYRTTPDDLRRVFERCGEV-GDIYIPRDRFTRESRGFAFVRFFDK 65 (158)
Q Consensus 14 ~~l~V~nl~~~~~~~~l~~~f~~~G~i-~~v~~~~~~~~~~~~g~afV~f~~~ 65 (158)
+-|+|+||+..+...+|+..+.+.+.+ ..+.|. | +.+-||+.|.+.
T Consensus 331 ~di~~~nl~rd~rv~dlk~~lr~~~~~pm~iswk-----g-~~~k~flh~~~~ 377 (396)
T KOG4410|consen 331 TDIKLTNLSRDIRVKDLKSELRKRECTPMSISWK-----G-HFGKCFLHFGNR 377 (396)
T ss_pred cceeeccCccccchHHHHHHHHhcCCCceeEeee-----c-CCcceeEecCCc
Confidence 569999999999999999999998744 444442 2 357899999754
No 156
>KOG0107|consensus
Probab=88.22 E-value=1.8 Score=30.58 Aligned_cols=12 Identities=17% Similarity=-0.061 Sum_probs=7.4
Q ss_pred CcEEEEeCCCCC
Q psy7051 13 MVSLKVDNLTYR 24 (158)
Q Consensus 13 ~~~l~V~nl~~~ 24 (158)
...|||.-.|+.
T Consensus 37 lrsvWvArnPPG 48 (195)
T KOG0107|consen 37 LRSVWVARNPPG 48 (195)
T ss_pred ceeEEEeecCCC
Confidence 456777665554
No 157
>KOG1295|consensus
Probab=86.88 E-value=0.97 Score=35.50 Aligned_cols=69 Identities=19% Similarity=0.258 Sum_probs=48.9
Q ss_pred CCcEEEEeCCCCCCCHHHHHHHhhhcCc-eEEEEEccCCCC--CCcccEEEEEEcCHHHHHHHHHHhCCccc
Q psy7051 12 GMVSLKVDNLTYRTTPDDLRRVFERCGE-VGDIYIPRDRFT--RESRGFAFVRFFDKRDAEDALDAMDGRML 80 (158)
Q Consensus 12 ~~~~l~V~nl~~~~~~~~l~~~f~~~G~-i~~v~~~~~~~~--~~~~g~afV~f~~~~~a~~a~~~l~g~~i 80 (158)
....|.|.+||+..++.+|.+.+.+|-. +....+...... ....+.|||.|..+++.......++|..|
T Consensus 6 ~~~Kvv~rrlpp~l~~~~~~eqi~p~~~~v~~~~F~~a~~s~~~~~ysrayinFk~~~dv~ef~~~f~g~if 77 (376)
T KOG1295|consen 6 AKVKVVVRRLPPKLTEEQLLEQINPFPEHVNWEFFAKADESLRNHKYSRAYINFKNPEDVEEFRRRFDGYIF 77 (376)
T ss_pred cceeeeeecCCCcccHHHHhhhcCCCccccchheeccccccchhhhhhhhhhccccHHHHHHHHhhCCceEE
Confidence 3467889999999999999988887642 222223211111 12356799999999999988888898865
No 158
>KOG2295|consensus
Probab=85.94 E-value=0.12 Score=42.35 Aligned_cols=73 Identities=15% Similarity=0.102 Sum_probs=54.8
Q ss_pred CcEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceE
Q psy7051 13 MVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGREL 85 (158)
Q Consensus 13 ~~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i 85 (158)
..+|||.|++++++-.+|+.++..+--+..+.+..........-+.+|+|.---....|+.+||+..+....+
T Consensus 231 e~sll~rni~Pnis~aeIe~~ck~i~~~lrfals~~~aek~~~r~lwv~fk~~~ni~~a~~aLn~irl~s~~~ 303 (648)
T KOG2295|consen 231 ECSLLVRNILPNISVAEIENLCKGIPGFLRFALSTINAEKNFERRLWVTFKRGTNIKEACWALNGIRLRSNFL 303 (648)
T ss_pred HHHHHHhccCCcccHHHHHHHhccCchheeeeccCchHHHHHHHHhhHhhccccchHHHHHHhhhcccccccc
Confidence 3578999999999999999999998766666554443333345678999997777888888888876654443
No 159
>PRK14548 50S ribosomal protein L23P; Provisional
Probab=85.32 E-value=6.1 Score=24.34 Aligned_cols=57 Identities=19% Similarity=0.223 Sum_probs=40.4
Q ss_pred EEEeCCCCCCCHHHHHHHhhh-cC-ceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHh
Q psy7051 16 LKVDNLTYRTTPDDLRRVFER-CG-EVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAM 75 (158)
Q Consensus 16 l~V~nl~~~~~~~~l~~~f~~-~G-~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l 75 (158)
.|+--++..++..+|++.++. || .|..|....-+ .+ .-=|||.+...++|......+
T Consensus 23 ~y~F~V~~~anK~eIK~AvE~lf~VkV~~VnT~~~~-~~--~KKA~V~L~~g~~A~~va~ki 81 (84)
T PRK14548 23 KLTFIVDRRATKPDIKRAVEELFDVKVEKVNTLITP-KG--EKKAYVKLAEEYDAEEIASRL 81 (84)
T ss_pred EEEEEECCCCCHHHHHHHHHHHhCCceEEEEeEEcC-CC--cEEEEEEeCCCCcHHHHHHhh
Confidence 344456888999999999988 66 67777554433 11 235999999888888776543
No 160
>PF03468 XS: XS domain; InterPro: IPR005380 The XS (rice gene X and SGS3) domain is found in a family of plant proteins including gene X Q9SBW2 from SWISSPROT and SGS3 Q9LDX1 from SWISSPROT. SGS3 is thought to be involved in post-transcriptional gene silencing (PTGS). This domain contains a conserved aspartate residue that may be functionally important. The XS domain containing proteins contain coiled-coils, which suggests that they will oligomerise. Most coiled-coil proteins form either a dimeric or a trimeric structure. It is possible that different members of the XS domain family could oligomerise via their coiled-coils forming a variety of complexes [].; PDB: 4E8U_C.
Probab=84.51 E-value=3.4 Score=27.11 Aligned_cols=55 Identities=15% Similarity=0.184 Sum_probs=28.1
Q ss_pred EEEEeCCCCC---------CCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCH-HHHHHHH
Q psy7051 15 SLKVDNLTYR---------TTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDK-RDAEDAL 72 (158)
Q Consensus 15 ~l~V~nl~~~---------~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~-~~a~~a~ 72 (158)
++.|-|++.. ++.++|.+.|..|..++ +..+.+.. ...|+++|+|..- .-...|+
T Consensus 10 mgIi~N~~~~~~~~g~~~g~~~~~l~~~l~~f~p~k-v~~l~~~~--gh~g~aiv~F~~~w~Gf~~A~ 74 (116)
T PF03468_consen 10 MGIIVNIPTEKDDDGRWVGMSNEELLDKLAEFNPLK-VKPLYGKQ--GHTGFAIVEFNKDWSGFKNAM 74 (116)
T ss_dssp EEEEE----EE-TTS-EE---SHHHHHHHHH---SE-EEEEEETT--EEEEEEEEE--SSHHHHHHHH
T ss_pred EEEEEcCccccCCCCceeccCHHHHHHHHHhcCCce-eEECcCCC--CCcEEEEEEECCChHHHHHHH
Confidence 4667788553 35678999999998775 33334432 3678999999854 3344444
No 161
>KOG4365|consensus
Probab=84.23 E-value=0.17 Score=40.36 Aligned_cols=78 Identities=5% Similarity=-0.125 Sum_probs=58.7
Q ss_pred cEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEEEEeeec
Q psy7051 14 VSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELRVQMARY 92 (158)
Q Consensus 14 ~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~v~~a~~ 92 (158)
+..|+..|+..+++.++.-+|..||-|..+.+...-..+.....+||+..+ .++..+|.-+....+.|.++.|.++..
T Consensus 4 ~~~~l~d~~~~~~~~~~~~~~~d~~~i~~~d~~~~~~~~~~~v~~f~~~~~-~~~~~~i~~~k~q~~~~~~~r~~~~~~ 81 (572)
T KOG4365|consen 4 MKKSLKDSVASNNKDQNSMKHEDPSIISMEDGSPYVNGSLGEVTPFQHAKK-ANGPNYIQPQKRQTTFESQDRKAVSPS 81 (572)
T ss_pred hhhhHhhcccccccchhhhhccCCcceeeccCCccccCCcceeeeeeeeec-cCcccccCHHHHhhhhhhhhhhhcCch
Confidence 456778899999999999999999999888775544444445567776654 566777777777778888888888763
No 162
>PF10567 Nab6_mRNP_bdg: RNA-recognition motif; InterPro: IPR018885 This conserved domain is found in fungal proteins and appears to be involved in RNA-processing. It binds to poly-adenylated RNA, interacts genetically with mRNA 3'-end processing factors, co-purifies with the nuclear cap-binding protein Cbp20p, and is found in complexes containing other translation factors, such as EIF4G as in P39935 from SWISSPROT and P39936 from SWISSPROT.
Probab=83.95 E-value=3 Score=31.77 Aligned_cols=78 Identities=14% Similarity=0.255 Sum_probs=56.3
Q ss_pred cEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCC-------CCCCcccEEEEEEcCHHHHHHHHHH----hCC--ccc
Q psy7051 14 VSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDR-------FTRESRGFAFVRFFDKRDAEDALDA----MDG--RML 80 (158)
Q Consensus 14 ~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~-------~~~~~~g~afV~f~~~~~a~~a~~~----l~g--~~i 80 (158)
..|.+.||..+++-..+...|.+||+|+.|.++.+. ..........+-|-+.+.|...+.. |.. ..+
T Consensus 16 RSLLfeNv~~sidLh~Fl~~fv~~~pIESiYL~~~~~~~~d~~~~d~~~~SilLSFlsr~~CLdFYNnvLQrLsEfK~~L 95 (309)
T PF10567_consen 16 RSLLFENVNNSIDLHSFLTKFVKFGPIESIYLIKSNDKPSDDYNDDKNNQSILLSFLSREICLDFYNNVLQRLSEFKTKL 95 (309)
T ss_pred HHHHHhhccccccHHHHHHHhhccCceeEEEEecCCCcccccccccccceEEEEeeechHHHHHHHHHHHHHHHHHHHhc
Confidence 457788999999988899999999999999988764 1223345788899988887665432 221 246
Q ss_pred CCceEEEEeee
Q psy7051 81 DGRELRVQMAR 91 (158)
Q Consensus 81 ~g~~i~v~~a~ 91 (158)
....|.|.+..
T Consensus 96 ~S~~L~lsFV~ 106 (309)
T PF10567_consen 96 KSESLTLSFVS 106 (309)
T ss_pred CCcceeEEEEE
Confidence 66777777754
No 163
>TIGR03636 L23_arch archaeal ribosomal protein L23. This model describes the archaeal ribosomal protein L23P and rigorously excludes the bacterial counterpart L23. In order to capture every known instance of archaeal L23P, the trusted cutoff is set lower than a few of the highest scoring eukaryotic cytosolic ribosomal counterparts.
Probab=81.44 E-value=9.5 Score=23.08 Aligned_cols=57 Identities=16% Similarity=0.207 Sum_probs=39.9
Q ss_pred EEEeCCCCCCCHHHHHHHhhh-cC-ceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHh
Q psy7051 16 LKVDNLTYRTTPDDLRRVFER-CG-EVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAM 75 (158)
Q Consensus 16 l~V~nl~~~~~~~~l~~~f~~-~G-~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l 75 (158)
-|+-.++..++..+|+..++. |+ .|..|....-+ . ..-=|||.+..-+.|......+
T Consensus 16 ~y~F~V~~~anK~eIK~avE~lf~VkV~~Vnt~~~~-~--~~KKA~VtL~~g~~a~~va~k~ 74 (77)
T TIGR03636 16 KLTFIVDRKATKGDIKRAVEKLFDVKVEKVNTLITP-R--GEKKAYVKLAEEYAAEEIASRL 74 (77)
T ss_pred EEEEEECCCCCHHHHHHHHHHHhCCceEEEEeEEcC-C--CceEEEEEECCCCcHHHHHHhh
Confidence 455567889999999999988 66 66666544332 1 1235999998888877765543
No 164
>KOG4454|consensus
Probab=80.78 E-value=0.43 Score=34.78 Aligned_cols=76 Identities=29% Similarity=0.423 Sum_probs=58.7
Q ss_pred CCCcEEEEeC----CCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEE
Q psy7051 11 EGMVSLKVDN----LTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELR 86 (158)
Q Consensus 11 ~~~~~l~V~n----l~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~ 86 (158)
+...+++.|+ |...++++.+...|+..|.+..+.+..+. .+.+..++|+.+.-....-.++...++..+.-+++.
T Consensus 78 e~q~~~r~G~shapld~r~~~ei~~~v~s~a~p~~~~R~~~~~-d~rnrn~~~~~~qr~~~~P~~~~~y~~l~~~~~~~~ 156 (267)
T KOG4454|consen 78 EEQRTLRCGNSHAPLDERVTEEILYEVFSQAGPIEGVRIPTDN-DGRNRNFGFVTYQRLCAVPFALDLYQGLELFQKKVT 156 (267)
T ss_pred hhhcccccCCCcchhhhhcchhhheeeecccCCCCCccccccc-cCCccCccchhhhhhhcCcHHhhhhcccCcCCCCcc
Confidence 4446777888 77788999999999999999999888775 477888999999877777777777677665555444
Q ss_pred E
Q psy7051 87 V 87 (158)
Q Consensus 87 v 87 (158)
+
T Consensus 157 ~ 157 (267)
T KOG4454|consen 157 I 157 (267)
T ss_pred c
Confidence 4
No 165
>COG0724 RNA-binding proteins (RRM domain) [General function prediction only]
Probab=80.12 E-value=3.1 Score=30.01 Aligned_cols=62 Identities=23% Similarity=0.315 Sum_probs=42.3
Q ss_pred CCCcEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHH
Q psy7051 11 EGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDAL 72 (158)
Q Consensus 11 ~~~~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~ 72 (158)
.....+++.+++..++..++..+|..+|.+..+.+.............++.+.....+..++
T Consensus 223 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 284 (306)
T COG0724 223 EKSDNLYVGNLPLKTAEEELADLFKSRGDIVRASLPPSKDGKIPKSRSFVGNEASKDALESN 284 (306)
T ss_pred cccceeeccccccccchhHHHHhccccccceeeeccCCCCCcccccccccchhHHHhhhhhh
Confidence 44578999999999999999999999999977776555433333334444444444444333
No 166
>COG5638 Uncharacterized conserved protein [Function unknown]
Probab=77.93 E-value=20 Score=28.92 Aligned_cols=28 Identities=21% Similarity=0.259 Sum_probs=24.2
Q ss_pred cEEEEEEcCHHHHHHHHHHhCCcccCCc
Q psy7051 56 GFAFVRFFDKRDAEDALDAMDGRMLDGR 83 (158)
Q Consensus 56 g~afV~f~~~~~a~~a~~~l~g~~i~g~ 83 (158)
-||.|++.+.+.+..++..++|.++...
T Consensus 259 YyAvvec~d~~tsK~iY~~CDG~Eye~s 286 (622)
T COG5638 259 YYAVVECEDIETSKNIYSACDGVEYENS 286 (622)
T ss_pred EEEEEEeccchhhHHHHhccCccccccc
Confidence 3789999999999999999999987653
No 167
>KOG4840|consensus
Probab=77.17 E-value=5.6 Score=29.51 Aligned_cols=76 Identities=12% Similarity=0.071 Sum_probs=44.2
Q ss_pred CCCcEEEEeCCCCCCCH----HHHHHHhhhcC-ceEEEEEccCCCCCCcccEEEEEEc-CHHHHHHHHHHhCCcccCCce
Q psy7051 11 EGMVSLKVDNLTYRTTP----DDLRRVFERCG-EVGDIYIPRDRFTRESRGFAFVRFF-DKRDAEDALDAMDGRMLDGRE 84 (158)
Q Consensus 11 ~~~~~l~V~nl~~~~~~----~~l~~~f~~~G-~i~~v~~~~~~~~~~~~g~afV~f~-~~~~a~~a~~~l~g~~i~g~~ 84 (158)
+....||||+|....-. ++|-..+-+.+ .+..+.+... ..||+.-..+ +.++...+++++.+..+....
T Consensus 35 ~~~~vvfiGGLgdgLl~~~y~~~L~~~lde~~wslVq~q~~Ss-----y~G~Gt~slk~D~edl~~l~~Hi~~~~fSt~v 109 (299)
T KOG4840|consen 35 ESVKVVFIGGLGDGLLICLYTTMLNRYLDENSWSLVQPQLRSS-----YNGYGTFSLKDDVEDLKCLLEHIQLCGFSTDV 109 (299)
T ss_pred eEEEEEEEcccCCCccccccHHHHHHHHhhccceeeeeecccc-----ccccccccccccHHHHHHHHHHhhccCcccce
Confidence 34568999999776432 34444443333 4544444222 2345544443 677888888877777766666
Q ss_pred EEEEeee
Q psy7051 85 LRVQMAR 91 (158)
Q Consensus 85 i~v~~a~ 91 (158)
+-|..++
T Consensus 110 VL~GhST 116 (299)
T KOG4840|consen 110 VLVGHST 116 (299)
T ss_pred EEEecCc
Confidence 6665544
No 168
>KOG2891|consensus
Probab=76.21 E-value=1.2 Score=33.72 Aligned_cols=34 Identities=26% Similarity=0.581 Sum_probs=27.0
Q ss_pred CcEEEEeCCCCC------------CCHHHHHHHhhhcCceEEEEEc
Q psy7051 13 MVSLKVDNLTYR------------TTPDDLRRVFERCGEVGDIYIP 46 (158)
Q Consensus 13 ~~~l~V~nl~~~------------~~~~~l~~~f~~~G~i~~v~~~ 46 (158)
..|||+.+||-. -++.-|...|+.||.|..|.++
T Consensus 149 pdti~la~ip~kwf~lkedg~~dlpse~rlr~a~eafg~ir~vdip 194 (445)
T KOG2891|consen 149 PDTIHLAGIPCKWFALKEDGSEDLPSEDRLRKAFEAFGEIRNVDIP 194 (445)
T ss_pred CCceeecCCcceeeeecccccccCChHHHHHHHHHHhccceecCCc
Confidence 467888888753 2566789999999999988875
No 169
>PF15513 DUF4651: Domain of unknown function (DUF4651)
Probab=75.90 E-value=9 Score=22.15 Aligned_cols=18 Identities=28% Similarity=0.802 Sum_probs=14.8
Q ss_pred HHHHHHhhhcCceEEEEE
Q psy7051 28 DDLRRVFERCGEVGDIYI 45 (158)
Q Consensus 28 ~~l~~~f~~~G~i~~v~~ 45 (158)
++|+++|+.+|+|.-+-+
T Consensus 9 ~~iR~~fs~lG~I~vLYv 26 (62)
T PF15513_consen 9 AEIRQFFSQLGEIAVLYV 26 (62)
T ss_pred HHHHHHHHhcCcEEEEEE
Confidence 679999999999876544
No 170
>PRK11901 hypothetical protein; Reviewed
Probab=74.38 E-value=9.3 Score=29.64 Aligned_cols=62 Identities=10% Similarity=0.210 Sum_probs=40.5
Q ss_pred CcEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEE--EEcCHHHHHHHHHHhCCcc
Q psy7051 13 MVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFV--RFFDKRDAEDALDAMDGRM 79 (158)
Q Consensus 13 ~~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV--~f~~~~~a~~a~~~l~g~~ 79 (158)
.++|.|..+ ..++.|..|..+++ +..+++......| ..+|..| .|.+.++|..|+..|....
T Consensus 245 ~YTLQL~Aa---s~~~~L~~f~~~~~-L~~~~VYqT~RnG-kpWYVVvyG~Y~Sr~eAk~Ai~sLPa~l 308 (327)
T PRK11901 245 HYTLQLSSA---SRSDTLNAYAKKQN-LSHYHVYETKRDG-KPWYVLVSGNYASSAEAKRAIATLPAEV 308 (327)
T ss_pred CeEEEeecC---CCHHHHHHHHHHcC-cCceEEEEEEECC-ceEEEEEecCcCCHHHHHHHHHhCCHHH
Confidence 356666554 46888888888875 4445444433233 2456555 5899999999999886543
No 171
>PRK08559 nusG transcription antitermination protein NusG; Validated
Probab=73.10 E-value=16 Score=24.99 Aligned_cols=34 Identities=21% Similarity=0.354 Sum_probs=25.8
Q ss_pred eEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCc
Q psy7051 40 VGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGR 78 (158)
Q Consensus 40 i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~ 78 (158)
|..|.++.. .+||.||+....+++..++..+.+.
T Consensus 36 i~~i~vp~~-----fpGYVfVe~~~~~~~~~~i~~v~~v 69 (153)
T PRK08559 36 IYAILAPPE-----LKGYVLVEAESKGAVEEAIRGIPHV 69 (153)
T ss_pred EEEEEccCC-----CCcEEEEEEEChHHHHHHHhcCCCE
Confidence 555555443 5899999999888888888877664
No 172
>KOG3580|consensus
Probab=70.04 E-value=21 Score=30.36 Aligned_cols=33 Identities=12% Similarity=0.180 Sum_probs=16.0
Q ss_pred cEEEEeCCCC-CCCHHHHHHHhhhcCceEEEEEc
Q psy7051 14 VSLKVDNLTY-RTTPDDLRRVFERCGEVGDIYIP 46 (158)
Q Consensus 14 ~~l~V~nl~~-~~~~~~l~~~f~~~G~i~~v~~~ 46 (158)
..|+|.+++- +++.....+.+.+.|++..|.+.
T Consensus 62 rvvMVNGvsMenv~haFAvQqLrksgK~A~Itvk 95 (1027)
T KOG3580|consen 62 RVVMVNGVSMENVLHAFAVQQLRKSGKVAAITVK 95 (1027)
T ss_pred eEEEEcCcchhhhHHHHHHHHHHhhccceeEEec
Confidence 3455544432 23333334445566666665553
No 173
>PF02714 DUF221: Domain of unknown function DUF221; InterPro: IPR003864 This domain is found in a family of hypothetical transmembrane proteins none of which have any known function, the aligned region is at 538 residues at maximum length.; GO: 0016020 membrane
Probab=68.50 E-value=7.1 Score=29.93 Aligned_cols=32 Identities=31% Similarity=0.347 Sum_probs=23.7
Q ss_pred EEEEEcCHHHHHHHHHHhCCcccCCceEEEEeee
Q psy7051 58 AFVRFFDKRDAEDALDAMDGRMLDGRELRVQMAR 91 (158)
Q Consensus 58 afV~f~~~~~a~~a~~~l~g~~i~g~~i~v~~a~ 91 (158)
|||+|++..+|+.|++.+....- ..+.|..|.
T Consensus 1 aFVtF~~~~~a~~~~q~~~~~~~--~~~~v~~AP 32 (325)
T PF02714_consen 1 AFVTFNSQKSAQIALQLLLSKRP--NSWRVSPAP 32 (325)
T ss_pred CEEEECCHHHHHHHHHHHhcCCC--CCceEeeCC
Confidence 79999999999999986554432 445666654
No 174
>PRK10629 EnvZ/OmpR regulon moderator; Provisional
Probab=68.28 E-value=33 Score=22.89 Aligned_cols=70 Identities=13% Similarity=0.147 Sum_probs=48.3
Q ss_pred cEEEEeCCCCC---CCHHHHHHHhhhcC-ceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEEEEe
Q psy7051 14 VSLKVDNLTYR---TTPDDLRRVFERCG-EVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELRVQM 89 (158)
Q Consensus 14 ~~l~V~nl~~~---~~~~~l~~~f~~~G-~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~v~~ 89 (158)
..|.|.+.... .+...+.+.+..-| .++.+.... +-..|.|.+.++...|.+.|....=.+..|.+..
T Consensus 36 pavQIs~~~~g~~~~~~~~v~~~L~~~gI~~ksi~~~~--------~~~~irf~~~~~Ql~Ak~vL~~~L~~~y~VAlnl 107 (127)
T PRK10629 36 STLAIRAVHQGASLPDGFYVYQHLDANGIHIKSITPEN--------DSLLIRFDSPEQSAAAKEVLDRTLPHGYIIAQQD 107 (127)
T ss_pred ceEEEecCCCCCccchHHHHHHHHHHCCCCcceEEeeC--------CEEEEEECCHHHHHHHHHHHHHHcCCCCEEEEec
Confidence 45778776443 56678888888887 556665532 3689999999998888887766554555666555
Q ss_pred ee
Q psy7051 90 AR 91 (158)
Q Consensus 90 a~ 91 (158)
+.
T Consensus 108 ~p 109 (127)
T PRK10629 108 DN 109 (127)
T ss_pred CC
Confidence 54
No 175
>PF00403 HMA: Heavy-metal-associated domain; InterPro: IPR006121 Proteins that transport heavy metals in micro-organisms and mammals share similarities in their sequences and structures. These proteins provide an important focus for research, some being involved in bacterial resistance to toxic metals, such as lead and cadmium, while others are involved in inherited human syndromes, such as Wilson's and Menke's diseases []. A conserved domain has been found in a number of these heavy metal transport or detoxification proteins []. The domain, which has been termed Heavy-Metal-Associated (HMA), contains two conserved cysteines that are probably involved in metal binding. Structure solution of the fourth HMA domain of the Menke's copper transporting ATPase shows a well-defined structure comprising a four-stranded antiparallel beta-sheet and two alpha helices packed in an alpha-beta sandwich fold []. This fold is common to other domains and is classified as "ferredoxin-like".; GO: 0046872 metal ion binding, 0030001 metal ion transport; PDB: 2VOY_A 1P6T_A 1KQK_A 2RML_A 1JWW_A 3K7R_F 1FES_A 1CC8_A 1FD8_A 2GGP_A ....
Probab=67.82 E-value=19 Score=20.04 Aligned_cols=54 Identities=11% Similarity=0.206 Sum_probs=39.7
Q ss_pred EEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCH----HHHHHHHHH
Q psy7051 15 SLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDK----RDAEDALDA 74 (158)
Q Consensus 15 ~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~----~~a~~a~~~ 74 (158)
||.|.||.-..-...|+..+...-.|..+.+-.. .+-+-|.|... ++..++|+.
T Consensus 1 t~~v~~m~C~~C~~~v~~~l~~~~GV~~v~vd~~------~~~v~v~~~~~~~~~~~i~~~i~~ 58 (62)
T PF00403_consen 1 TFKVPGMTCEGCAKKVEKALSKLPGVKSVKVDLE------TKTVTVTYDPDKTSIEKIIEAIEK 58 (62)
T ss_dssp EEEEESTTSHHHHHHHHHHHHTSTTEEEEEEETT------TTEEEEEESTTTSCHHHHHHHHHH
T ss_pred CEEECCcccHHHHHHHHHHHhcCCCCcEEEEECC------CCEEEEEEecCCCCHHHHHHHHHH
Confidence 5778888877778889999999988888877543 35788888744 455555544
No 176
>PF03439 Spt5-NGN: Early transcription elongation factor of RNA pol II, NGN section; InterPro: IPR005100 Spt5p and prokaryotic NusG are shown to contain a novel 'NGN' domain. The combined NGN and KOW motif regions of Spt5 form the binding domain with Spt4 []. Spt5 complexes with Spt4 as a 1:1 heterodimer snf this Spt5-Spt4 complex regulates early transcription elongation by RNA polymerase II and has an imputed role in pre-mRNA processing via its physical association with mRNA capping enzymes. The Schizosaccharomyces pombe core Spt5-Spt4 complex is a heterodimer bearing a trypsin-resistant Spt4-binding domain within the Spt5 subunit []. ; PDB: 3H7H_B 3QQC_D 3P8B_B 2EXU_A.
Probab=67.54 E-value=14 Score=22.57 Aligned_cols=36 Identities=17% Similarity=0.393 Sum_probs=24.2
Q ss_pred ceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcc
Q psy7051 39 EVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRM 79 (158)
Q Consensus 39 ~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~ 79 (158)
.|..+-.+.+ .+||-|||=.+..++..|+..+.+..
T Consensus 33 ~I~Si~~~~~-----lkGyIyVEA~~~~~V~~ai~gi~~i~ 68 (84)
T PF03439_consen 33 NIYSIFAPDS-----LKGYIYVEAERESDVKEAIRGIRHIR 68 (84)
T ss_dssp ---EEEE-TT-----STSEEEEEESSHHHHHHHHTT-TTEE
T ss_pred ceEEEEEeCC-----CceEEEEEeCCHHHHHHHHhccccee
Confidence 4555544333 58999999999999999998776543
No 177
>KOG4008|consensus
Probab=67.44 E-value=9 Score=28.37 Aligned_cols=33 Identities=21% Similarity=0.328 Sum_probs=27.3
Q ss_pred CCCcEEEEeCCCCCCCHHHHHHHhhhcCceEEE
Q psy7051 11 EGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDI 43 (158)
Q Consensus 11 ~~~~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v 43 (158)
....+||+-|+|..++++.|..+.+++|-+..+
T Consensus 38 ~eKd~lfl~Nvp~~~tee~lkr~vsqlg~vq~~ 70 (261)
T KOG4008|consen 38 NEKDCLFLVNVPLLSTEEHLKRFVSQLGHVQEL 70 (261)
T ss_pred ccccceeeecccccccHHHHHHHHHHhhhhhhe
Confidence 344689999999999999999999998854433
No 178
>KOG3702|consensus
Probab=66.25 E-value=5.4 Score=33.73 Aligned_cols=71 Identities=21% Similarity=0.164 Sum_probs=52.9
Q ss_pred EEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEE
Q psy7051 15 SLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELR 86 (158)
Q Consensus 15 ~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~ 86 (158)
+||+-|-....+..-+..++..++.++...++..+..+..-.-++++|..+..++.|. .|.+..+....++
T Consensus 513 ~i~~~~~~~~s~~~s~s~~s~~~~~ltk~k~l~~Cky~~~Ct~a~Ce~~HPtaa~~~~-s~p~k~fa~~~~k 583 (681)
T KOG3702|consen 513 TIFVANGHGGSNPDSLSRHSEKKNELTKAKILTRCKYGPACTSAECEFAHPTAAENAK-SLPNKKFASKCLK 583 (681)
T ss_pred ceecccccccCCCcchhhCcccccccccceeeccccCCCcCCchhhhhcCCcchhhhh-cccccccccccee
Confidence 7888888777777888888888988887777766656655668999999988887765 4566655444433
No 179
>COG5507 Uncharacterized conserved protein [Function unknown]
Probab=66.05 E-value=8.1 Score=24.53 Aligned_cols=23 Identities=22% Similarity=0.322 Sum_probs=18.0
Q ss_pred CcccEEEEEEcCHHHHHHHHHHh
Q psy7051 53 ESRGFAFVRFFDKRDAEDALDAM 75 (158)
Q Consensus 53 ~~~g~afV~f~~~~~a~~a~~~l 75 (158)
...-|++++|.+.+...+|..++
T Consensus 64 E~VvFsW~~Y~skq~rDA~~~km 86 (117)
T COG5507 64 EEVVFSWIEYPSKQVRDAANAKM 86 (117)
T ss_pred cEEEEEEEEcCchhHHHHHHHHh
Confidence 34568999999999888877654
No 180
>KOG0156|consensus
Probab=65.99 E-value=17 Score=29.99 Aligned_cols=59 Identities=15% Similarity=0.148 Sum_probs=43.4
Q ss_pred EEeCCCCCCC---HHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceE
Q psy7051 17 KVDNLTYRTT---PDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGREL 85 (158)
Q Consensus 17 ~V~nl~~~~~---~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i 85 (158)
+||||+.-.. ...|..+-++||+|-.+++-. .-.|...+.+.|++|+. -++..+.++..
T Consensus 36 iIGnl~~l~~~~~h~~~~~ls~~yGpi~tl~lG~---------~~~Vviss~~~akE~l~-~~d~~fa~Rp~ 97 (489)
T KOG0156|consen 36 IIGNLHQLGSLPPHRSFRKLSKKYGPVFTLRLGS---------VPVVVISSYEAAKEVLV-KQDLEFADRPD 97 (489)
T ss_pred ccccHHHcCCCchhHHHHHHHHHhCCeEEEEecC---------ceEEEECCHHHHHHHHH-hCCccccCCCC
Confidence 4788865433 345566667899999887732 24788889999999996 48888888875
No 181
>PF07292 NID: Nmi/IFP 35 domain (NID); InterPro: IPR009909 This entry represents a domain of approximately 90 residues that is tandemly repeated within interferon-induced 35 kDa protein (IFP 35) and the homologous N-myc-interactor (Nmi). This domain mediates Nmi-Nmi protein interactions and subcellular localisation [].
Probab=65.82 E-value=3.8 Score=25.50 Aligned_cols=23 Identities=17% Similarity=0.235 Sum_probs=19.5
Q ss_pred CcEEEEeCCCCCCCHHHHHHHhh
Q psy7051 13 MVSLKVDNLTYRTTPDDLRRVFE 35 (158)
Q Consensus 13 ~~~l~V~nl~~~~~~~~l~~~f~ 35 (158)
..+|.|.|||..+++++|++.++
T Consensus 52 ~rtVlvsgip~~l~ee~l~D~Le 74 (88)
T PF07292_consen 52 KRTVLVSGIPDVLDEEELRDKLE 74 (88)
T ss_pred CCEEEEeCCCCCCChhhheeeEE
Confidence 46899999999999999997653
No 182
>PTZ00191 60S ribosomal protein L23a; Provisional
Probab=64.58 E-value=39 Score=23.13 Aligned_cols=56 Identities=16% Similarity=0.124 Sum_probs=36.4
Q ss_pred EEEeCCCCCCCHHHHHHHhhh-cC-ceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHH
Q psy7051 16 LKVDNLTYRTTPDDLRRVFER-CG-EVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDA 74 (158)
Q Consensus 16 l~V~nl~~~~~~~~l~~~f~~-~G-~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~ 74 (158)
.||--+...++..+|++.++. |+ .|..|..+.-+. | .-=|||.+....+|......
T Consensus 84 ~yvF~Vd~kAnK~qIK~AVEklf~VkV~kVNTli~p~-g--~KKA~V~L~~~~~aidva~k 141 (145)
T PTZ00191 84 TLVFIVDQRANKTQIKKAVEKLYDVKVVKVNTLITPD-G--LKKAYIRLSPDVDALDVANK 141 (145)
T ss_pred EEEEEEcCCCCHHHHHHHHHHHhCCeeEEEEeEEcCC-C--ceEEEEEECCCCcHHHHHHh
Confidence 344457788899999999987 65 566665443321 2 23599999877765554433
No 183
>PF12623 Hen1_L: RNA repair, ligase-Pnkp-associating, region of Hen1; InterPro: IPR024740 This entry represents the N-terminal domain of the bacterial Hen1 protein, which is a 3' terminal RNA ribose 2'-O-methyltransferase that is involved in bacterial RNA repair []. Hen1 forms a heterotetramer with Pnkp. The heterotetramer has been shown to repair transfer RNAs cleaved by ribotoxins in vitro []. The N-terminal domain of Hen1 contributes to the ligase activity of the heterotetramer.
Probab=62.17 E-value=29 Score=25.68 Aligned_cols=62 Identities=21% Similarity=0.291 Sum_probs=43.1
Q ss_pred CcEEEEeCCCCCCCHHHHHHHhhhcC-ceEEEEEccCC---CCCCcccEEEEEEcCHHHHHHHHHHh
Q psy7051 13 MVSLKVDNLTYRTTPDDLRRVFERCG-EVGDIYIPRDR---FTRESRGFAFVRFFDKRDAEDALDAM 75 (158)
Q Consensus 13 ~~~l~V~nl~~~~~~~~l~~~f~~~G-~i~~v~~~~~~---~~~~~~g~afV~f~~~~~a~~a~~~l 75 (158)
.-+|.|.-||-.-.++-++.+|++.| +|.--.+..+. +.|. .-|..|+.....-...|+..|
T Consensus 118 pL~v~~p~lp~rGg~~l~~rLFePLGw~V~a~~~~Ld~~fP~WG~-S~y~~l~L~g~~rl~daL~HL 183 (245)
T PF12623_consen 118 PLEVRLPALPCRGGEELVRRLFEPLGWTVTAEPVPLDEQFPEWGD-SRYVDLTLTGTVRLADALNHL 183 (245)
T ss_pred ceEEEeeeeecCCcHHHHHHhhcCcCceEEeEeccCCccCccccC-CcceEEEEeeeEEHHHHHhhh
Confidence 35788889998889999999999999 44433444443 2233 337778877766677777654
No 184
>COG5584 Predicted small secreted protein [Function unknown]
Probab=57.14 E-value=21 Score=22.53 Aligned_cols=33 Identities=9% Similarity=0.156 Sum_probs=24.2
Q ss_pred CCCCCCCHHHHHHHhhhcCceEEEEEccCCCCC
Q psy7051 20 NLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTR 52 (158)
Q Consensus 20 nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~ 52 (158)
|+.....-.-+++.|+++|+|+.-++...++.-
T Consensus 29 ~is~e~alk~vk~afk~~mnI~GSwI~~~pe~~ 61 (103)
T COG5584 29 NISRENALKVVKEAFKQFMNIKGSWIVYEPEVY 61 (103)
T ss_pred ccChhHHHHHHHHHhcccCCcceeEEEEecccc
Confidence 345555556788999999999988887766443
No 185
>PF04127 DFP: DNA / pantothenate metabolism flavoprotein; InterPro: IPR007085 This entry represents the C-terminal domain found in DNA/pantothenate metabolism flavoproteins, which affects synthesis of DNA and pantothenate metabolism. These proteins contain ATP, phosphopantothenate, and cysteine binding sites. The structure of this domain has been determined in human phosphopantothenoylcysteine (PPC) synthetase [] and as the PPC synthase domain (CoaB) from the Escherichia coli coenzyme A bifunctional protein CoaBC []. This domain adopts a 3-layer alpha/beta/alpha fold with mixed beta-sheets, which topologically resembles a combination of Rossmann-like and ribokinase-like folds. The structure of these proteins predicts a ping pong mechanism with initial formation of an acyladenylate intermediate, followed by release of pyrophosphate and attack by cysteine to form the final products PPC and AMP. ; PDB: 1U7W_A 1U7U_A 1U80_C 1U7Z_A 1P9O_B 2GK4_A.
Probab=55.83 E-value=46 Score=23.63 Aligned_cols=57 Identities=18% Similarity=0.141 Sum_probs=37.9
Q ss_pred EEEeCCCCCCCHHHHHHHhhhcC-ceEEEEEccCCCC-CCcccEEEEEEcCHHHHHHHHHHh
Q psy7051 16 LKVDNLTYRTTPDDLRRVFERCG-EVGDIYIPRDRFT-RESRGFAFVRFFDKRDAEDALDAM 75 (158)
Q Consensus 16 l~V~nl~~~~~~~~l~~~f~~~G-~i~~v~~~~~~~~-~~~~g~afV~f~~~~~a~~a~~~l 75 (158)
=||+|.+...+-..|-+.|...| .|+.|. .+.. ..+.+.-.|.+.+.++...++..+
T Consensus 21 R~ItN~SSG~~G~~lA~~~~~~Ga~V~li~---g~~~~~~p~~~~~i~v~sa~em~~~~~~~ 79 (185)
T PF04127_consen 21 RFITNRSSGKMGAALAEEAARRGAEVTLIH---GPSSLPPPPGVKVIRVESAEEMLEAVKEL 79 (185)
T ss_dssp EEEEES--SHHHHHHHHHHHHTT-EEEEEE----TTS----TTEEEEE-SSHHHHHHHHHHH
T ss_pred eEecCCCcCHHHHHHHHHHHHCCCEEEEEe---cCccccccccceEEEecchhhhhhhhccc
Confidence 57899999989999998888887 444333 2212 224578999999999988888754
No 186
>PHA03008 hypothetical protein; Provisional
Probab=53.60 E-value=31 Score=24.95 Aligned_cols=39 Identities=13% Similarity=0.109 Sum_probs=32.6
Q ss_pred CCCCcEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccC
Q psy7051 10 IEGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRD 48 (158)
Q Consensus 10 ~~~~~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~ 48 (158)
.+.+..+||.|+..--+..-|+-+|.+|..+.++-++..
T Consensus 18 ~~~~d~~~~snit~~h~~n~i~~ff~~~d~~~~~ifvpg 56 (234)
T PHA03008 18 DEICDIAFISNITHIHDHNIIKIFFDKFDDFDEIIFVPG 56 (234)
T ss_pred cccccEEEEecccccccccHHHHHHhhccccceEEEccC
Confidence 455678999999998899999999999999888876543
No 187
>PF09869 DUF2096: Uncharacterized protein conserved in archaea (DUF2096); InterPro: IPR017098 There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=53.49 E-value=76 Score=22.26 Aligned_cols=53 Identities=21% Similarity=0.258 Sum_probs=41.7
Q ss_pred CCcEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHh
Q psy7051 12 GMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAM 75 (158)
Q Consensus 12 ~~~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l 75 (158)
+..++-| +|+..+.++-|.++.+-+|-|.+. .+ ..-.+.|.+.+..+.||+.+
T Consensus 111 ~~~~iRv-~l~~~i~~erl~ei~E~~gvI~Ef---ee-------~~~V~I~Gdke~Ik~aLKe~ 163 (169)
T PF09869_consen 111 GFETIRV-KLKKPIQEERLQEISEWHGVIFEF---EE-------DDKVVIEGDKERIKKALKEF 163 (169)
T ss_pred CceeEEE-ecCccchHHHHHHHHHHhceeEEe---cC-------CcEEEEeccHHHHHHHHHHH
Confidence 3455666 899999999999999999988666 11 13577888999999999865
No 188
>PF09707 Cas_Cas2CT1978: CRISPR-associated protein (Cas_Cas2CT1978); InterPro: IPR010152 Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR) are a family of DNA direct repeats separated by regularly sized non-repetitive spacer sequences that are found in most bacterial and archaeal genomes []. CRISPRs appear to provide acquired resistance against bacteriophages, possibly acting with an RNA interference-like mechanism to inhibit gene functions of invasive DNA elements [, ]. Differences in the number and type of spacers between CRISPR repeats correlate with phage sensitivity. It is thought that following phage infection, bacteria integrate new spacers derived from phage genomic sequences, and that the removal or addition of particular spacers modifies the phage-resistance phenotype of the cell. Therefore, the specificity of CRISPRs may be determined by spacer-phage sequence similarity. In addition, there are many protein families known as CRISPR-associated sequences (Cas), which are encoded in the vicinity of CRISPR loci []. CRISPR/cas gene regions can be quite large, with up to 20 different, tandem-arranged cas genes next to a CRISPR cluster or filling the region between two repeat clusters. Cas genes and CRISPRs are found on mobile genetic elements such as plasmids, and have undergone extensive horizontal transfer. Cas proteins are thought to be involved in the propagation and functioning of CRISPRs. Some Cas proteins show similarity to helicases and repair proteins, although the functions of most are unknown. Cas families can be divided into subtypes according to operon organisation and phylogeny. This entry represents a minor branch of the Cas2 family of CRISPR-associated protein which are found in IPR003799 from INTERPRO. Cas2 is one of four protein families (Cas1 to Cas4) that are associated with CRISPR elements and always occur near a repeat cluster, usually in the order cas3-cas4-cas1-cas2. The function of Cas2 (and Cas1) is unknown. Cas3 proteins appear to be helicases while Cas4 proteins resemble RecB-type exonucleases, suggesting that these genes are involved in DNA metabolism or gene expression [].
Probab=52.31 E-value=36 Score=21.08 Aligned_cols=50 Identities=24% Similarity=0.264 Sum_probs=29.2
Q ss_pred CCCcEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEc
Q psy7051 11 EGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFF 63 (158)
Q Consensus 11 ~~~~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~ 63 (158)
|...-|||++++..+-+.-.+.+.+..++-..+-+..+. + ..||+|-.+-
T Consensus 23 Ei~~GVyVg~~s~rVRe~lW~~v~~~~~~G~a~m~~~~~-n--eqG~~~~t~G 72 (86)
T PF09707_consen 23 EIRPGVYVGNVSARVRERLWERVTEWIGDGSAVMVWSDN-N--EQGFDFRTLG 72 (86)
T ss_pred ecCCCcEEcCCCHHHHHHHHHHHHhhCCCccEEEEEccC-C--CCCEEEEEeC
Confidence 344569999999887766555555544333222222221 1 5789988773
No 189
>COG2608 CopZ Copper chaperone [Inorganic ion transport and metabolism]
Probab=52.04 E-value=47 Score=19.41 Aligned_cols=45 Identities=22% Similarity=0.359 Sum_probs=35.0
Q ss_pred cEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcC
Q psy7051 14 VSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFD 64 (158)
Q Consensus 14 ~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~ 64 (158)
.+|+|-++.-.-=...+...+.....+..+.+-.. .+-++|+|++
T Consensus 4 ~~l~v~~MtC~~C~~~V~~al~~v~gv~~v~v~l~------~~~~~V~~d~ 48 (71)
T COG2608 4 TTLKVEGMTCGHCVKTVEKALEEVDGVASVDVDLE------KGTATVTFDS 48 (71)
T ss_pred EEEEECCcCcHHHHHHHHHHHhcCCCeeEEEEEcc------cCeEEEEEcC
Confidence 56778777776667889999999888888877554 3459999987
No 190
>PF11823 DUF3343: Protein of unknown function (DUF3343); InterPro: IPR021778 This family of proteins are functionally uncharacterised. This protein is found in bacteria and archaea. Proteins in this family are typically between 78 to 102 amino acids in length.
Probab=51.73 E-value=22 Score=20.86 Aligned_cols=25 Identities=16% Similarity=0.239 Sum_probs=20.4
Q ss_pred EEEEEEcCHHHHHHHHHHhCCcccC
Q psy7051 57 FAFVRFFDKRDAEDALDAMDGRMLD 81 (158)
Q Consensus 57 ~afV~f~~~~~a~~a~~~l~g~~i~ 81 (158)
+.+|.|.+..+|..|-+.|....|.
T Consensus 3 ~~~i~F~st~~a~~~ek~lk~~gi~ 27 (73)
T PF11823_consen 3 YYLITFPSTHDAMKAEKLLKKNGIP 27 (73)
T ss_pred eEEEEECCHHHHHHHHHHHHHCCCc
Confidence 6899999999999988877766543
No 191
>COG0150 PurM Phosphoribosylaminoimidazole (AIR) synthetase [Nucleotide transport and metabolism]
Probab=51.65 E-value=6.4 Score=30.74 Aligned_cols=49 Identities=22% Similarity=0.338 Sum_probs=37.6
Q ss_pred CHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCc
Q psy7051 26 TPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGR 78 (158)
Q Consensus 26 ~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~ 78 (158)
+...|.+++.+.|+|..-.+..-. +-|.+||..-.+++++.+++.|.+.
T Consensus 274 ~~p~iF~~i~~~G~v~~~EM~rtF----NmGvG~v~iv~~e~~~~~~~~l~~~ 322 (345)
T COG0150 274 PPPPIFKWLQKAGNVEREEMYRTF----NMGVGMVLIVPEEDAEKALALLKEQ 322 (345)
T ss_pred CCcHHHHHHHHhcCCCHHHHHHHh----cCccceEEEEcHHHHHHHHHHHHhc
Confidence 456788888899987655554332 3578999999999999999988875
No 192
>PF12829 Mhr1: Transcriptional regulation of mitochondrial recombination; InterPro: IPR024629 These proteins are involved in regulation of RNA polymerase II-dependent transcription. They are also involved in regulation of mitochondrial DNA recombination, maintenance, repair, and generation of homoplasmic cells [, , , ].
Probab=51.05 E-value=47 Score=20.80 Aligned_cols=54 Identities=15% Similarity=0.161 Sum_probs=35.1
Q ss_pred CCCCCCCHHHHHHHhhhcCc-eEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCC
Q psy7051 20 NLTYRTTPDDLRRVFERCGE-VGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDG 77 (158)
Q Consensus 20 nl~~~~~~~~l~~~f~~~G~-i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g 77 (158)
.+.+.++...|...|---|. -+-..+-.|-+ +-+|.|+|.+.+.+..|+..|-.
T Consensus 19 S~~p~l~~~~i~~Q~~~~gkk~~pp~lRkD~W----~pm~vv~f~~~~~g~~~yq~Lre 73 (91)
T PF12829_consen 19 SQTPNLDNNQILKQFPFPGKKNKPPSLRKDYW----RPMCVVNFPNYEVGVSAYQKLRE 73 (91)
T ss_pred ecCcccChhHHHHhccCCCcccCCchhccccc----eEeEEEECCChHHHHHHHHHHHH
Confidence 44556777777766655552 22233444432 34899999999999999887643
No 193
>PF11491 DUF3213: Protein of unknown function (DUF3213) ; InterPro: IPR021583 The backbone structure of this family of proteins has been determined however the function remains unknown. The protein has an alpha and beta structure with a ferredoxin-like fold []. ; PDB: 2F40_A.
Probab=50.92 E-value=46 Score=20.47 Aligned_cols=66 Identities=21% Similarity=0.297 Sum_probs=24.5
Q ss_pred EEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcC----HHHHHHHHHHhCCcccCCceEEEE
Q psy7051 16 LKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFD----KRDAEDALDAMDGRMLDGRELRVQ 88 (158)
Q Consensus 16 l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~----~~~a~~a~~~l~g~~i~g~~i~v~ 88 (158)
|-++++.+.-. .+++-.+++-..|-.+-+ +|.. ..|||.|+. .+....+++.|....+..+.|.|+
T Consensus 3 lkfg~It~eeA-~~~QYeLsk~~~vyRvFi-----NgYa-r~g~VifDe~kl~~e~lL~~le~~kpEVi~ek~lTve 72 (88)
T PF11491_consen 3 LKFGNITPEEA-MVKQYELSKNEAVYRVFI-----NGYA-RNGFVIFDESKLSKEELLEMLEEFKPEVIEEKELTVE 72 (88)
T ss_dssp EE--S-TTTTT-HHHHHTTTTTTTB-----------TTS-S--EEE--B-S-SHHHH---HHHTTT-SS-------S
T ss_pred cccCCCCHHHH-HHHHHHhhcccceeeeee-----cccc-cceEEEECcccCCHHHHHHHHHhcChhheeeccccHH
Confidence 44566655422 234444566666655544 2222 268999974 467777888888877777777764
No 194
>KOG3424|consensus
Probab=49.43 E-value=75 Score=21.00 Aligned_cols=45 Identities=16% Similarity=0.152 Sum_probs=28.5
Q ss_pred CCCHHHHHHHhhh-cC----ceEEEEEccCCCCCCcccEEEEEEcCHHHHH
Q psy7051 24 RTTPDDLRRVFER-CG----EVGDIYIPRDRFTRESRGFAFVRFFDKRDAE 69 (158)
Q Consensus 24 ~~~~~~l~~~f~~-~G----~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~ 69 (158)
.++.+||.+-+.+ |- .|.-+.+-.....|+..|||.| |++.+.|.
T Consensus 34 ~vsK~EirEKla~mYkt~~d~V~vfgfrt~~GggkstgfalI-Ydsve~ak 83 (132)
T KOG3424|consen 34 NVSKTEIREKLAKMYKTTPDAVFVFGFRTHFGGGKSTGFALI-YDSVEYAK 83 (132)
T ss_pred CCCHHHHHHHHHHHhcCCcceEEEEEeeeccCCcccceeeee-eehHHHHH
Confidence 4688888887766 32 2333345555567788889887 55554443
No 195
>PF05036 SPOR: Sporulation related domain; InterPro: IPR007730 This 70 residue domain is composed of two 35 residue repeats that are found in bacterial proteins involved in sporulation and cell division, such as FtsN, CwlM and RlpA. This repeat might be involved in binding peptidoglycan. FtsN is an essential cell division protein with a simple bitopic topology: a short N-terminal cytoplasmic segment fused to a large carboxy periplasmic domain through a single transmembrane domain. The repeats lie at the periplasmic C terminus, which has an RNP-like fold []. FtsN localises to the septum ring complex. The CwlM protein is a cell wall hydrolase, where the C-terminal region, including the repeats, determines substrate specificity []. RlpA is a rare lipoprotein A protein that may be important for cell division. Its N-terminal cysteine may be attached to thioglyceride and N-fatty acyl residues [].; PDB: 1X60_A 1UTA_A.
Probab=48.80 E-value=4.1 Score=23.62 Aligned_cols=60 Identities=13% Similarity=0.146 Sum_probs=32.2
Q ss_pred CcEEEEeCCCCCCCHHHHHHHhhhcCceEE-EEEccCCCCCCcccEEE-EEEcCHHHHHHHHHHhC
Q psy7051 13 MVSLKVDNLTYRTTPDDLRRVFERCGEVGD-IYIPRDRFTRESRGFAF-VRFFDKRDAEDALDAMD 76 (158)
Q Consensus 13 ~~~l~V~nl~~~~~~~~l~~~f~~~G~i~~-v~~~~~~~~~~~~g~af-V~f~~~~~a~~a~~~l~ 76 (158)
...|.|+.+...-..+.+..-+...|.-.. +.+... + ..-..+ -.|.+.++|+.++..|.
T Consensus 4 ~y~vQv~s~~~~~~A~~~~~~l~~~g~~~~~~~~~~~---~-~~yrV~~G~f~~~~~A~~~~~~l~ 65 (76)
T PF05036_consen 4 GYYVQVGSFSSEENAERLLAKLKKKGPDAYVVQVSKG---G-PWYRVRVGPFSSREEAEAALRKLK 65 (76)
T ss_dssp EEEEEEEEES-HHHHHHHHHHHHHHT-----EEEEEE---T-TCEEEEECCECTCCHHHHHHHHHH
T ss_pred cEEEEEEEcCCHHHHHHHHHHHHhcCCCcceEEEecC---C-ceEEEEECCCCCHHHHHHHHHHHh
Confidence 456788777655444555555555553322 222211 1 111233 36899999999998887
No 196
>PF08734 GYD: GYD domain; InterPro: IPR014845 These proteins of unknown function are usually less than 100 amino acids in length. They may belong to the dimeric alpha/beta barrel superfamily.
Probab=47.62 E-value=66 Score=19.88 Aligned_cols=45 Identities=20% Similarity=0.207 Sum_probs=30.3
Q ss_pred HHHHHHHhhhcC-ceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHh
Q psy7051 27 PDDLRRVFERCG-EVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAM 75 (158)
Q Consensus 27 ~~~l~~~f~~~G-~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l 75 (158)
.+.+.++++.+| ++..+.+.... --....+++.+.+.|.++.-.+
T Consensus 22 ~~a~~~~~e~~Gg~l~~~y~t~G~----yD~v~i~eaPD~~~a~~~~l~i 67 (91)
T PF08734_consen 22 AEAVRALIEALGGKLKSFYWTLGE----YDFVVIVEAPDDETAAAASLAI 67 (91)
T ss_pred HHHHHHHHHHcCCEEEEEEEecCC----CCEEEEEEcCCHHHHHHHHHHH
Confidence 345777888876 78888776553 2246777888888777665443
No 197
>smart00195 DSPc Dual specificity phosphatase, catalytic domain.
Probab=47.37 E-value=78 Score=20.63 Aligned_cols=70 Identities=13% Similarity=0.183 Sum_probs=33.3
Q ss_pred EEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEc--C--H----HHHHHHHHHhCCcccCCceEE
Q psy7051 15 SLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFF--D--K----RDAEDALDAMDGRMLDGRELR 86 (158)
Q Consensus 15 ~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~--~--~----~~a~~a~~~l~g~~i~g~~i~ 86 (158)
.||||++|...+.+.|++. .+..|..+.-... .....++-++.+. + . +....+++.++...-.|..|-
T Consensus 7 ~l~~G~~~~~~~~~~l~~~--gi~~Vi~l~~~~~--~~~~~~~~~~~ipi~D~~~~~~~~~~~~~~~~i~~~~~~~~~Vl 82 (138)
T smart00195 7 HLYLGSYSSALNLALLKKL--GITHVINVTNEVP--NLNKKGFTYLGVPILDNTETKISPYFPEAVEFIEDAEKKGGKVL 82 (138)
T ss_pred CeEECChhHcCCHHHHHHc--CCCEEEEccCCCC--CCCCCCCEEEEEECCCCCCCChHHHHHHHHHHHHHHhcCCCeEE
Confidence 4999999977654444432 2334443322111 1123444555443 2 1 223445555544444555655
Q ss_pred EE
Q psy7051 87 VQ 88 (158)
Q Consensus 87 v~ 88 (158)
|.
T Consensus 83 VH 84 (138)
T smart00195 83 VH 84 (138)
T ss_pred EE
Confidence 54
No 198
>COG0002 ArgC Acetylglutamate semialdehyde dehydrogenase [Amino acid transport and metabolism]
Probab=47.19 E-value=52 Score=25.94 Aligned_cols=51 Identities=14% Similarity=0.173 Sum_probs=29.5
Q ss_pred cEEEEeCCCCCCCHHHHHHHhhh-cCceEEEEEccC---CCCC--CcccEEEEEEcCH
Q psy7051 14 VSLKVDNLTYRTTPDDLRRVFER-CGEVGDIYIPRD---RFTR--ESRGFAFVRFFDK 65 (158)
Q Consensus 14 ~~l~V~nl~~~~~~~~l~~~f~~-~G~i~~v~~~~~---~~~~--~~~g~afV~f~~~ 65 (158)
.|+++ .|...++.++|.++|.+ |..-..|.+... +.+. .-..||.|-|...
T Consensus 247 ~Ti~~-~l~~~~t~~~i~~~y~~~Y~~epfVrv~~~~~~P~~k~V~GsN~cdIgf~~d 303 (349)
T COG0002 247 ATIYL-KLKDLVTLEELHAAYEEFYAGEPFVRVVPEGGYPDTKAVAGSNFCDIGFAVD 303 (349)
T ss_pred EEEEE-ecCCCCCHHHHHHHHHHHhCCCCeEEEecCCCCCChhhhcCCcceEEEEEEc
Confidence 34554 45566899999999987 555555655442 1111 1234666666543
No 199
>PF11411 DNA_ligase_IV: DNA ligase IV; InterPro: IPR021536 DNA ligase IV along with Xrcc4 functions in DNA non-homologous end joining. This process is required to mend double-strand breaks. Upon ligase binding to an Xrcc4 dimer, the helical tails unwind leading to a flat interaction surface []. ; GO: 0003910 DNA ligase (ATP) activity; PDB: 3II6_Y 2E2W_A 1IK9_C.
Probab=46.51 E-value=16 Score=18.69 Aligned_cols=17 Identities=35% Similarity=0.556 Sum_probs=10.2
Q ss_pred CCCCHHHHHHHhhhcCc
Q psy7051 23 YRTTPDDLRRVFERCGE 39 (158)
Q Consensus 23 ~~~~~~~l~~~f~~~G~ 39 (158)
..+++++|++.|.+...
T Consensus 19 ~Dtd~~~Lk~vF~~i~~ 35 (36)
T PF11411_consen 19 VDTDEDQLKEVFNRIKK 35 (36)
T ss_dssp S---HHHHHHHHHCS--
T ss_pred ccCCHHHHHHHHHHhcc
Confidence 46789999999987643
No 200
>KOG1999|consensus
Probab=44.33 E-value=1.3e+02 Score=27.20 Aligned_cols=34 Identities=15% Similarity=0.344 Sum_probs=27.0
Q ss_pred cccEEEEEEcCHHHHHHHHHHhCCcccCCceEEEE
Q psy7051 54 SRGFAFVRFFDKRDAEDALDAMDGRMLDGRELRVQ 88 (158)
Q Consensus 54 ~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~v~ 88 (158)
.+||.|||-..+..+..||+.|-+.... +.+.|-
T Consensus 209 lkGyIYIEA~KqshV~~Ai~gv~niy~~-~~~lVP 242 (1024)
T KOG1999|consen 209 LKGYIYIEADKQSHVKEAIEGVRNIYAN-RILLVP 242 (1024)
T ss_pred cceeEEEEechhHHHHHHHhhhhhheec-cEEEEe
Confidence 5789999999999999999988776655 444443
No 201
>COG5470 Uncharacterized conserved protein [Function unknown]
Probab=43.85 E-value=50 Score=20.86 Aligned_cols=41 Identities=15% Similarity=0.217 Sum_probs=24.7
Q ss_pred HHHHhhhcCce-----EEEEEccCCCCCCcccEEEEEEcCHHHHHHHH
Q psy7051 30 LRRVFERCGEV-----GDIYIPRDRFTRESRGFAFVRFFDKRDAEDAL 72 (158)
Q Consensus 30 l~~~f~~~G~i-----~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~ 72 (158)
+...|++||.- -++..+.. ...+.....|+|.+.+.|..++
T Consensus 25 ~~~a~~~~Ggr~LvRGG~v~~lEG--~w~ptr~vviEFps~~~ar~~y 70 (96)
T COG5470 25 AKPAIEKFGGRYLVRGGEVETLEG--EWRPTRNVVIEFPSLEAARDCY 70 (96)
T ss_pred hHHHHHHhCCeeEeeCCCeeeccC--CCCcccEEEEEcCCHHHHHHHh
Confidence 45667788631 11222222 2334568999999998887765
No 202
>TIGR02223 ftsN cell division protein FtsN. FtsN is a poorly conserved protein active in cell division in a number of Proteobacteria. The N-terminal 30 residue region tends to by Lys/Arg-rich, and is followed by a membrane-spanning region. This is followed by an acidic low-complexity region of variable length and a well-conserved C-terminal domain of two tandem regions matched by Pfam model pfam05036 (Sporulation related repeat), found in several cell division and sporulation proteins. The role of FtsN as a suppressor for other cell division mutations is poorly understood; it may involve cell wall hydrolysis.
Probab=43.85 E-value=1.5e+02 Score=22.89 Aligned_cols=69 Identities=16% Similarity=0.228 Sum_probs=47.5
Q ss_pred CCcEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEE--EEcCHHHHHHHHHHhCCcccCCceE
Q psy7051 12 GMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFV--RFFDKRDAEDALDAMDGRMLDGREL 85 (158)
Q Consensus 12 ~~~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV--~f~~~~~a~~a~~~l~g~~i~g~~i 85 (158)
....|.||.+......+.|..-|...|--..|. .. .+. ..|-.+ -|.+.++|..++..|...-|.+..|
T Consensus 226 ~~~~lQvGAF~~~~~Ae~l~akL~~~G~~~~i~--~~--~g~-~~yRV~vGPf~sr~~A~~~~~~Lk~~Gi~~~iv 296 (298)
T TIGR02223 226 RAAALQCGAYANKEQAESVRAKLAFLGISSKIT--TT--DGG-KWYRVVSGPYKNKDDAEKDLNKLKVAGVAGCII 296 (298)
T ss_pred ccEEEEEeecCCHHHHHHHHHHHHhcCCceEEE--ec--CCc-eEEEEEeCCCCCHHHHHHHHHHHHHcCCCCeEe
Confidence 347788999988888888998888887333332 11 221 233222 3778899999999998877777665
No 203
>KOG4213|consensus
Probab=43.56 E-value=84 Score=22.46 Aligned_cols=50 Identities=22% Similarity=0.205 Sum_probs=32.6
Q ss_pred CCHHHHHHHhhhc-CceEEEEEccCCCC-CCcccEEEEEEcCHHHHHHHHHH
Q psy7051 25 TTPDDLRRVFERC-GEVGDIYIPRDRFT-RESRGFAFVRFFDKRDAEDALDA 74 (158)
Q Consensus 25 ~~~~~l~~~f~~~-G~i~~v~~~~~~~~-~~~~g~afV~f~~~~~a~~a~~~ 74 (158)
+++++|.++..=. |++..|.+-..... ...+|-.||+|.+.+.|.++++.
T Consensus 118 ~td~ql~~l~qw~~~k~~nv~mr~~~~k~~~fkGsvkv~f~tk~qa~a~~~~ 169 (205)
T KOG4213|consen 118 ITDDQLDDLNQWASGKGHNVKMRRHGNKAHPFKGSVKVTFQTKEQAFANDDT 169 (205)
T ss_pred CCHHHHHHHHHHhcccceEeeccccCCCCCCCCCceEEEeecHHHHHhhhhh
Confidence 4555555544322 68888876443211 24578899999999999987753
No 204
>PRK10905 cell division protein DamX; Validated
Probab=42.62 E-value=69 Score=24.96 Aligned_cols=61 Identities=11% Similarity=0.191 Sum_probs=37.5
Q ss_pred CcEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEE--EEcCHHHHHHHHHHhCCc
Q psy7051 13 MVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFV--RFFDKRDAEDALDAMDGR 78 (158)
Q Consensus 13 ~~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV--~f~~~~~a~~a~~~l~g~ 78 (158)
.++|.|+.+. +++.|.+|..+.| +....+.....+|+ .+|..+ .|.+.++|+.|+..|-..
T Consensus 247 ~YTLQL~A~S---s~~~l~~fakKlg-L~~y~vy~TtRnGk-pWYVV~yG~YaSraeAk~AiakLPa~ 309 (328)
T PRK10905 247 HYTLQLSSSS---NYDNLNGWAKKEN-LKNYVVYETTRNGQ-PWYVLVSGVYASKEEAKRAVSTLPAD 309 (328)
T ss_pred ceEEEEEecC---CHHHHHHHHHHcC-CCceEEEEeccCCc-eEEEEEecCCCCHHHHHHHHHHCCHH
Confidence 3667666555 5677888877775 33222322222332 245444 488999999999988644
No 205
>COG3254 Uncharacterized conserved protein [Function unknown]
Probab=42.32 E-value=92 Score=20.04 Aligned_cols=42 Identities=14% Similarity=0.190 Sum_probs=26.2
Q ss_pred HHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHH
Q psy7051 28 DDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDAL 72 (158)
Q Consensus 28 ~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~ 72 (158)
.+|.+++...| |.+-.|..+..+ +.-||++++.+.+...+++
T Consensus 27 PE~~a~lk~ag-i~nYSIfLde~~--n~lFgy~E~~d~~a~m~~~ 68 (105)
T COG3254 27 PELLALLKEAG-IRNYSIFLDEEE--NLLFGYWEYEDFEADMAKM 68 (105)
T ss_pred HHHHHHHHHcC-CceeEEEecCCc--ccEEEEEEEcChHHHHHHH
Confidence 56788888886 444445444322 3569999999555444443
No 206
>PF08156 NOP5NT: NOP5NT (NUC127) domain; InterPro: IPR012974 This N-terminal domain is found in RNA-binding proteins of the NOP5 family [].
Probab=41.59 E-value=9.6 Score=22.31 Aligned_cols=39 Identities=18% Similarity=0.306 Sum_probs=25.3
Q ss_pred HHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhC
Q psy7051 28 DDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMD 76 (158)
Q Consensus 28 ~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~ 76 (158)
++|++.|..+.....+ + +-.+|..|.+.++|..++..+.
T Consensus 27 ~~v~~~~~~~~~f~k~--v--------kL~aF~pF~s~~~ALe~~~ais 65 (67)
T PF08156_consen 27 EEVQKSFSDPEKFSKI--V--------KLKAFSPFKSAEEALENANAIS 65 (67)
T ss_pred HHHHHHHcCHHHHhhh--h--------hhhhccCCCCHHHHHHHHHHhh
Confidence 5777777665433222 1 1158999999998888776553
No 207
>COG5193 LHP1 La protein, small RNA-binding pol III transcript stabilizing protein and related La-motif-containing proteins involved in translation [Posttranslational modification, protein turnover, chaperones / Translation, ribosomal structure and biogenesis]
Probab=41.52 E-value=13 Score=29.66 Aligned_cols=47 Identities=28% Similarity=0.329 Sum_probs=37.1
Q ss_pred HHHHHHHhhh--cCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHH
Q psy7051 27 PDDLRRVFER--CGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALD 73 (158)
Q Consensus 27 ~~~l~~~f~~--~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~ 73 (158)
.+++..+|.. .+.+..+.+..+.......|-.|++|.....+++++.
T Consensus 196 ~~d~~~~~p~h~h~~~~~i~~rrd~~nkn~~gSv~~efk~~~~~q~~nn 244 (438)
T COG5193 196 QEDLEIQFPPHYHAPPSQIRNRRDWLNKNFRGSVFVEFKYFREAQRFNN 244 (438)
T ss_pred hhhHHhhCCCcccCChhhccchhhhhhccccCcccccccChHHHHHHhc
Confidence 4588999988 5677777776665455677889999999999998874
No 208
>PF13689 DUF4154: Domain of unknown function (DUF4154)
Probab=40.90 E-value=87 Score=20.98 Aligned_cols=60 Identities=22% Similarity=0.282 Sum_probs=36.1
Q ss_pred HHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEEEEeee
Q psy7051 27 PDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELRVQMAR 91 (158)
Q Consensus 27 ~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~v~~a~ 91 (158)
|.+|+..|- |.=+.++.|+... ....+-++.+.+.. ...++..|.+..+.++.|.|..-.
T Consensus 2 e~~lkAa~l-~nf~~f~~WP~~~---~~~~~~icv~g~~~-~~~~L~~l~~~~~~~~~i~v~~~~ 61 (145)
T PF13689_consen 2 EYQLKAAYL-YNFAKFIEWPDSA---PSSPFRICVLGDDP-FAEALSTLAGKQVGGRPIRVRRLS 61 (145)
T ss_pred HHHHHHHHH-HHhHhhccCCCCC---CCCCeEEEEECChH-HHHHHHHhhhcccCCCcEEEEECC
Confidence 445554431 1113445565542 12346677776655 455777888899999999987643
No 209
>KOG4246|consensus
Probab=40.45 E-value=17 Score=31.94 Aligned_cols=10 Identities=10% Similarity=-0.230 Sum_probs=5.2
Q ss_pred cEEEEeCCCC
Q psy7051 14 VSLKVDNLTY 23 (158)
Q Consensus 14 ~~l~V~nl~~ 23 (158)
..+|+|++..
T Consensus 146 qR~f~gvvtk 155 (1194)
T KOG4246|consen 146 QRRFAGVVTK 155 (1194)
T ss_pred ceeeehhhhh
Confidence 3456665543
No 210
>COG1098 VacB Predicted RNA binding protein (contains ribosomal protein S1 domain) [Translation, ribosomal structure and biogenesis]
Probab=40.19 E-value=72 Score=21.27 Aligned_cols=34 Identities=21% Similarity=0.275 Sum_probs=21.6
Q ss_pred cEEEEEEcC--------HHHHHHHHHHhCCcccCCceEEEEe
Q psy7051 56 GFAFVRFFD--------KRDAEDALDAMDGRMLDGRELRVQM 89 (158)
Q Consensus 56 g~afV~f~~--------~~~a~~a~~~l~g~~i~g~~i~v~~ 89 (158)
--|||+|+. .+-|...++.+|...--|..|.|..
T Consensus 19 yGAFV~l~~g~tGLVHISEIa~~fVkdI~d~L~vG~eV~vKV 60 (129)
T COG1098 19 YGAFVELEGGKTGLVHISEIADGFVKDIHDHLKVGQEVKVKV 60 (129)
T ss_pred cceEEEecCCCcceEEehHhhhhhHHhHHHHhcCCCEEEEEE
Confidence 347888876 2455666666666655666676665
No 211
>PF00398 RrnaAD: Ribosomal RNA adenine dimethylase; InterPro: IPR001737 This family of proteins include rRNA adenine dimethylases (e.g. KsgA) and the Erythromycin resistance methylases (Erm). The bacterial enzyme KsgA catalyses the transfer of a total of four methyl groups from S-adenosyl-l-methionine (S-AdoMet) to two adjacent adenosine bases in 16S rRNA. This enzyme and the resulting modified adenosine bases appear to be conserved in all species of eubacteria, eukaryotes, and archaea, and in eukaryotic organelles. Bacterial resistance to the aminoglycoside antibiotic kasugamycin involves inactivation of KsgA and resulting loss of the dimethylations, with modest consequences to the overall fitness of the organism. In contrast, the yeast ortholog, Dim1, is essential. In Saccharomyces cerevisiae (Baker's yeast), and presumably in other eukaryotes, the enzyme performs a vital role in pre-rRNA processing in addition to its methylating activity. The best conserved region in these enzymes is located in the N-terminal section and corresponds to a region that is probably involved in S-adenosyl methionine (SAM) binding domain. The crystal structure of KsgA from Escherichia coli has been solved to a resolution of 2.1A. It bears a strong similarity to the crystal structure of ErmC' from Bacillus stearothermophilus and a lesser similarity to the yeast mitochondrial transcription factor, sc-mtTFB []. The Erm family of RNA methyltransferases, which methylate a single adenosine base in 23S rRNA confer resistance to the MLS-B group of antibiotics. Despite their sequence similarity, the two enzyme families have strikingly different levels of regulation that remain to be elucidated. Other orthologs, of this family include the yeast and Homo sapiens (Human) mitochondrial transcription factors (MTF1 and h-mtTFB respectively), which are nuclear encoded []. Human-mtTFB is able to stimulate transcription in vitro independently of its S-adenosylmethionine binding and rRNA methyltransferase activity [].; GO: 0000179 rRNA (adenine-N6,N6-)-dimethyltransferase activity, 0008649 rRNA methyltransferase activity, 0000154 rRNA modification; PDB: 3FTF_A 3R9X_B 3FTE_A 3FTC_A 3FTD_A 3GRY_A 3FYC_A 3GRU_A 3FYD_A 3GRV_A ....
Probab=39.88 E-value=37 Score=25.32 Aligned_cols=23 Identities=35% Similarity=0.472 Sum_probs=20.1
Q ss_pred CcEEEEeCCCCCCCHHHHHHHhh
Q psy7051 13 MVSLKVDNLTYRTTPDDLRRVFE 35 (158)
Q Consensus 13 ~~~l~V~nl~~~~~~~~l~~~f~ 35 (158)
...++|+|||..++..-|..++.
T Consensus 97 ~~~~vv~NlPy~is~~il~~ll~ 119 (262)
T PF00398_consen 97 QPLLVVGNLPYNISSPILRKLLE 119 (262)
T ss_dssp SEEEEEEEETGTGHHHHHHHHHH
T ss_pred CceEEEEEecccchHHHHHHHhh
Confidence 35688999999999999998887
No 212
>PRK11633 cell division protein DedD; Provisional
Probab=39.63 E-value=1.4e+02 Score=22.01 Aligned_cols=71 Identities=10% Similarity=0.033 Sum_probs=47.6
Q ss_pred CcEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEE-EcCHHHHHHHHHHhCCc-ccCCceE
Q psy7051 13 MVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR-FFDKRDAEDALDAMDGR-MLDGREL 85 (158)
Q Consensus 13 ~~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~-f~~~~~a~~a~~~l~g~-~i~g~~i 85 (158)
.+.|.||.|....+.+.|..-+..-|--.++.-... ..|. ..-.||- |.+.+.|+.++..|+.. -|.|..+
T Consensus 149 ~~vVQlgaf~n~~~A~~l~~kL~~~G~~Ay~~~~~~-~~G~-~tRV~VGP~~sk~~ae~~~~~Lk~~~Gl~g~Vv 221 (226)
T PRK11633 149 AYVVQLGALKNADKVNEIVAKLRLSGYRVYTVPSTP-VQGK-ITRIYVGPDASKDKLKGSLGELKQLSGLSGVVM 221 (226)
T ss_pred cEEEEecccCCHHHHHHHHHHHHHCCCeeEEEeeec-CCCc-EEEEEeCCCCCHHHHHHHHHHHHHhcCCCceEE
Confidence 478889988888888888888888886555543221 1221 1123343 67899999999998874 6666544
No 213
>PRK02886 hypothetical protein; Provisional
Probab=39.52 E-value=94 Score=19.30 Aligned_cols=40 Identities=18% Similarity=0.318 Sum_probs=27.6
Q ss_pred HHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCccc
Q psy7051 32 RVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRML 80 (158)
Q Consensus 32 ~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i 80 (158)
..+.+||.|..+.--. .|+ |-|-+.++++..++.|....+
T Consensus 19 r~LrkyG~I~Y~Skr~--------kYv-vlYvn~~~~e~~~~kl~~l~f 58 (87)
T PRK02886 19 KQLRKFGNVHYVSKRL--------KYA-VLYCDMEQVEDIMNKLSSLPF 58 (87)
T ss_pred HHHhhcCcEEEEeccc--------cEE-EEEECHHHHHHHHHHHhcCCC
Confidence 4467999998874311 243 446788999999988876643
No 214
>COG0030 KsgA Dimethyladenosine transferase (rRNA methylation) [Translation, ribosomal structure and biogenesis]
Probab=39.19 E-value=50 Score=24.91 Aligned_cols=27 Identities=22% Similarity=0.151 Sum_probs=21.8
Q ss_pred cEEEEeCCCCCCCHHHHHHHhhhcCce
Q psy7051 14 VSLKVDNLTYRTTPDDLRRVFERCGEV 40 (158)
Q Consensus 14 ~~l~V~nl~~~~~~~~l~~~f~~~G~i 40 (158)
....|+|||++++..-|..+++..-.+
T Consensus 96 ~~~vVaNlPY~Isspii~kll~~~~~~ 122 (259)
T COG0030 96 PYKVVANLPYNISSPILFKLLEEKFII 122 (259)
T ss_pred CCEEEEcCCCcccHHHHHHHHhccCcc
Confidence 356799999999999999888765444
No 215
>PF09902 DUF2129: Uncharacterized protein conserved in bacteria (DUF2129); InterPro: IPR016979 This is a group of uncharacterised conserved proteins.
Probab=38.64 E-value=86 Score=18.63 Aligned_cols=40 Identities=13% Similarity=0.358 Sum_probs=27.5
Q ss_pred HHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCccc
Q psy7051 32 RVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRML 80 (158)
Q Consensus 32 ~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i 80 (158)
..+.+||.|..+.=-. .|+ |-|-+.++++..++.|....+
T Consensus 15 r~L~kfG~i~Y~Skk~--------kYv-vlYvn~~~~e~~~~kl~~l~f 54 (71)
T PF09902_consen 15 RQLRKFGDIHYVSKKM--------KYV-VLYVNEEDVEEIIEKLKKLKF 54 (71)
T ss_pred HhHhhcccEEEEECCc--------cEE-EEEECHHHHHHHHHHHhcCCC
Confidence 3578999998874311 243 446788999999888876543
No 216
>PF14581 SseB_C: SseB protein C-terminal domain
Probab=38.41 E-value=39 Score=21.39 Aligned_cols=52 Identities=13% Similarity=0.048 Sum_probs=28.3
Q ss_pred CcEEEEeCCCCCCC--HHHHHHHhhhcCceEEEEEccCCC-CCCcccEEEEEEcC
Q psy7051 13 MVSLKVDNLTYRTT--PDDLRRVFERCGEVGDIYIPRDRF-TRESRGFAFVRFFD 64 (158)
Q Consensus 13 ~~~l~V~nl~~~~~--~~~l~~~f~~~G~i~~v~~~~~~~-~~~~~g~afV~f~~ 64 (158)
...|.++-.....+ .+.|.++|++.+.|..+.+..-.. .+...-+-.|+|..
T Consensus 5 g~~v~l~~P~~~p~~l~~aL~~~~~~~~~V~~Ayl~~~~~~~~~~~~li~vd~~~ 59 (108)
T PF14581_consen 5 GEKVLLGEPEEEPTDLLAALSEYFKQHKNVRAAYLALMQDEDEQPSLLIGVDFDG 59 (108)
T ss_pred CCEEEecCCccCHHHHHHHHHHHHhhCccHHHhHHHHhhccCCCceEEEEEeccC
Confidence 34566653322222 356788888888887665443332 33333344456665
No 217
>PHA01782 hypothetical protein
Probab=38.38 E-value=40 Score=23.49 Aligned_cols=40 Identities=15% Similarity=0.322 Sum_probs=26.3
Q ss_pred CCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEE
Q psy7051 20 NLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRF 62 (158)
Q Consensus 20 nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f 62 (158)
.+|......-|.+.|..||+|. +..++.+.+...|.|-.|
T Consensus 62 aMPKGsRrnAL~~wlv~~Gkv~---vntDkk~aKefpf~~nK~ 101 (177)
T PHA01782 62 AMPKGSRRNALAEWLVKFGKVQ---VNTDKKSAKEFPFVYNKF 101 (177)
T ss_pred HccccchhhHHHHHHHHhCCcc---ccccccccccCceeeccc
Confidence 5688888889999999999874 344443333333444444
No 218
>PRK11558 putative ssRNA endonuclease; Provisional
Probab=38.17 E-value=69 Score=20.34 Aligned_cols=49 Identities=20% Similarity=0.255 Sum_probs=27.7
Q ss_pred CCCcEEEEeCCCCCCCHHHHHHHhhhc--CceEEEEEccCCCCCCcccEEEEEEcC
Q psy7051 11 EGMVSLKVDNLTYRTTPDDLRRVFERC--GEVGDIYIPRDRFTRESRGFAFVRFFD 64 (158)
Q Consensus 11 ~~~~~l~V~nl~~~~~~~~l~~~f~~~--G~i~~v~~~~~~~~~~~~g~afV~f~~ 64 (158)
|...-||||+++..+-+.--+.+-+.+ |.+.-+. .+ + ...||+|-++..
T Consensus 25 Ev~~GVyVg~~S~rVRd~lW~~v~~~~~~G~avmv~--~~--~-~eqG~~~~t~G~ 75 (97)
T PRK11558 25 EVRAGVYVGDVSRRIREMIWQQVTQLAEEGNVVMAW--AT--N-TESGFEFQTFGE 75 (97)
T ss_pred ecCCCcEEcCCCHHHHHHHHHHHHHhCCCCcEEEEE--cC--C-CCCCcEEEecCC
Confidence 344569999988877655433433434 3333222 22 1 234899988764
No 219
>KOG2888|consensus
Probab=38.13 E-value=13 Score=29.06 Aligned_cols=8 Identities=38% Similarity=0.501 Sum_probs=3.1
Q ss_pred HHHHHhhh
Q psy7051 29 DLRRVFER 36 (158)
Q Consensus 29 ~l~~~f~~ 36 (158)
+|..-|++
T Consensus 173 dLw~WyEp 180 (453)
T KOG2888|consen 173 DLWDWYEP 180 (453)
T ss_pred HHHHHhhh
Confidence 34433333
No 220
>PRK02302 hypothetical protein; Provisional
Probab=37.70 E-value=1e+02 Score=19.22 Aligned_cols=39 Identities=18% Similarity=0.307 Sum_probs=27.2
Q ss_pred HHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcc
Q psy7051 32 RVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRM 79 (158)
Q Consensus 32 ~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~ 79 (158)
..+.+||.|..+.-. ..|+ |-|-+.++++..++.|....
T Consensus 21 r~LrkfG~I~Y~Skk--------~kYv-vlYvn~~~~e~~~~kl~~l~ 59 (89)
T PRK02302 21 RKLSKYGDIVYHSKR--------SRYL-VLYVNKEDVEQKLEELSKLK 59 (89)
T ss_pred HHHhhcCcEEEEecc--------ccEE-EEEECHHHHHHHHHHHhcCC
Confidence 346799999887431 1243 44678899999998887654
No 221
>PF06014 DUF910: Bacterial protein of unknown function (DUF910); InterPro: IPR009256 This family consists of several short bacterial proteins of unknown function.; PDB: 2NN4_A.
Probab=37.66 E-value=22 Score=20.57 Aligned_cols=18 Identities=17% Similarity=0.309 Sum_probs=10.7
Q ss_pred CHHHHHHHhhhcCceEEE
Q psy7051 26 TPDDLRRVFERCGEVGDI 43 (158)
Q Consensus 26 ~~~~l~~~f~~~G~i~~v 43 (158)
|--|+.+++.+||-+.++
T Consensus 3 tlyDVqQLLK~fG~~IY~ 20 (62)
T PF06014_consen 3 TLYDVQQLLKKFGIIIYV 20 (62)
T ss_dssp SHHHHHHHHHTTS-----
T ss_pred cHHHHHHHHHHCCEEEEe
Confidence 345889999999987654
No 222
>cd00027 BRCT Breast Cancer Suppressor Protein (BRCA1), carboxy-terminal domain. The BRCT domain is found within many DNA damage repair and cell cycle checkpoint proteins. The unique diversity of this domain superfamily allows BRCT modules to interact forming homo/hetero BRCT multimers, BRCT-non-BRCT interactions, and interactions within DNA strand breaks.
Probab=37.13 E-value=63 Score=17.48 Aligned_cols=26 Identities=15% Similarity=0.281 Sum_probs=21.4
Q ss_pred EEEEeCCCCCCCHHHHHHHhhhcCce
Q psy7051 15 SLKVDNLTYRTTPDDLRRVFERCGEV 40 (158)
Q Consensus 15 ~l~V~nl~~~~~~~~l~~~f~~~G~i 40 (158)
.++|.+.....+.++|.+++..+|..
T Consensus 3 ~~~i~g~~~~~~~~~l~~~i~~~Gg~ 28 (72)
T cd00027 3 TFVITGDLPSEERDELKELIEKLGGK 28 (72)
T ss_pred EEEEEecCCCcCHHHHHHHHHHcCCE
Confidence 57777777678899999999999863
No 223
>KOG0862|consensus
Probab=37.11 E-value=24 Score=25.72 Aligned_cols=12 Identities=42% Similarity=0.625 Sum_probs=9.4
Q ss_pred cccEEEEEEcCH
Q psy7051 54 SRGFAFVRFFDK 65 (158)
Q Consensus 54 ~~g~afV~f~~~ 65 (158)
.+-|+||+|++.
T Consensus 108 ~RPY~FieFD~~ 119 (216)
T KOG0862|consen 108 SRPYAFIEFDTF 119 (216)
T ss_pred CCCeeEEehhHH
Confidence 456999999863
No 224
>PRK11230 glycolate oxidase subunit GlcD; Provisional
Probab=37.10 E-value=1.7e+02 Score=24.22 Aligned_cols=48 Identities=17% Similarity=0.073 Sum_probs=32.5
Q ss_pred HHHHHHhh----hcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhC
Q psy7051 28 DDLRRVFE----RCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMD 76 (158)
Q Consensus 28 ~~l~~~f~----~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~ 76 (158)
-+|..+|. .+|-|+.+.+...+.. ......++.|.+.++|..++..+-
T Consensus 204 ~dl~~l~~Gs~GtlGIIt~atlkl~p~p-~~~~~~~~~f~~~~~a~~~~~~~~ 255 (499)
T PRK11230 204 FDLLALFTGSEGMLGVVTEVTVKLLPKP-PVARVLLASFDSVEKAGLAVGDII 255 (499)
T ss_pred cchHhhhccCCCccEEEEEEEEEEEcCC-cceEEEEEECCCHHHHHHHHHHHH
Confidence 45666664 5677888776554422 234567888999999988887654
No 225
>PF12687 DUF3801: Protein of unknown function (DUF3801); InterPro: IPR024234 This functionally uncharacterised protein family is found in bacteria. Proteins found in this family are typically between 158 and 187 amino acids in length and include the PcfB protein.
Probab=36.18 E-value=1.1e+02 Score=22.09 Aligned_cols=57 Identities=19% Similarity=0.276 Sum_probs=33.6
Q ss_pred CCHHHHH---HHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCc
Q psy7051 25 TTPDDLR---RVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGR 83 (158)
Q Consensus 25 ~~~~~l~---~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~ 83 (158)
+++.+|. ..+..|| |.++ ++.|..++...-..|+.=.+.+.+..|++.+....+...
T Consensus 39 i~~~~lk~F~k~AkKyG-V~ya-v~kdk~~~~~~~~V~FkA~Da~~i~~af~~~~~~~~~~~ 98 (204)
T PF12687_consen 39 ITDEDLKEFKKEAKKYG-VDYA-VKKDKSTGPGKYDVFFKAKDADVINRAFKEFSAKKLKKE 98 (204)
T ss_pred cCHhhHHHHHHHHHHcC-CceE-EeeccCCCCCcEEEEEEcCcHHHHHHHHHHHHHHhhhhh
Confidence 4445554 4466786 4444 445544443333455555677888889988877655433
No 226
>PF04026 SpoVG: SpoVG; InterPro: IPR007170 This is a stage V sporulation protein G. It is essential for sporulation and specific to stage V sporulation in Bacillus megaterium and Bacillus subtilis []. In B. subtilis, expression decreases after 30-60 minutes of cold shock [].; GO: 0030435 sporulation resulting in formation of a cellular spore; PDB: 2IA9_F 2I9X_B 2I9Z_A.
Probab=36.08 E-value=77 Score=19.46 Aligned_cols=26 Identities=27% Similarity=0.335 Sum_probs=18.5
Q ss_pred ceEEEEEccCCCCCCcccEEEEEEcC
Q psy7051 39 EVGDIYIPRDRFTRESRGFAFVRFFD 64 (158)
Q Consensus 39 ~i~~v~~~~~~~~~~~~g~afV~f~~ 64 (158)
+|+.|.+..-...+..+++|-|+|.+
T Consensus 2 ~itdVri~~~~~~~~lka~asV~~dd 27 (84)
T PF04026_consen 2 KITDVRIRKIEPEGKLKAFASVTFDD 27 (84)
T ss_dssp -EEEEEEEETTSSSSEEEEEEEEETT
T ss_pred ccEEEEEEEecCCCCEEEEEEEEECC
Confidence 36677665554458888999999976
No 227
>KOG0113|consensus
Probab=36.08 E-value=53 Score=25.36 Aligned_cols=14 Identities=7% Similarity=0.361 Sum_probs=9.6
Q ss_pred CCHHHHHHHhhhcC
Q psy7051 25 TTPDDLRRVFERCG 38 (158)
Q Consensus 25 ~~~~~l~~~f~~~G 38 (158)
..+.+++..|...+
T Consensus 151 e~erdm~~AYK~ad 164 (335)
T KOG0113|consen 151 EHERDMKAAYKDAD 164 (335)
T ss_pred ccHHHHHHHHHhcc
Confidence 36677777777665
No 228
>PF05189 RTC_insert: RNA 3'-terminal phosphate cyclase (RTC), insert domain; InterPro: IPR013796 RNA cyclases are a family of RNA-modifying enzymes that are conserved in eukaryotes, bacteria and archaea. RNA 3'-terminal phosphate cyclase (6.5.1.4 from EC) [, ] catalyses the conversion of 3'-phosphate to a 2',3'-cyclic phosphodiester at the end of RNA. ATP + RNA 3'-terminal-phosphate = AMP + diphosphate + RNA terminal-2',3'-cyclic-phosphate These enzymes might be responsible for production of the cyclic phosphate RNA ends that are known to be required by many RNA ligases in both prokaryotes and eukaryotes. RNA cyclase is a protein of from 36 to 42 kDa. The best conserved region is a glycine-rich stretch of residues located in the central part of the sequence and which is reminiscent of various ATP, GTP or AMP glycine-rich loops. The crystal structure of RNA 3'-terminal phosphate cyclase shows that each molecule consists of two domains. The larger domain contains three repeats of a folding unit comprising two parallel alpha helices and a four-stranded beta sheet; this fold was previously identified in translation initiation factor 3 (IF3). The large domain is similar to one of the two domains of 5-enolpyruvylshikimate-3-phosphate synthase and UDP-N-acetylglucosamine enolpyruvyl transferase. The smaller domain uses a similar secondary structure element with different topology, observed in many other proteins such as thioredoxin []. Although the active site of this enzyme could not be unambiguously assigned, it can be mapped to a region surrounding His309, an adenylate acceptor, in which a number of amino acids are highly conserved in the enzyme from different sources []. This entry contains the insert-domain of approximately 100 amino acids.; PDB: 3TV1_B 3KGD_D 1QMI_D 1QMH_B 3TUX_A 3TUT_A 3TW3_A 3PQV_A.
Probab=36.07 E-value=78 Score=19.86 Aligned_cols=48 Identities=10% Similarity=-0.040 Sum_probs=26.2
Q ss_pred EEEEeCCCCCCCHHHHH---HHhhhcCceEEEEE--ccCCCCCCcccEEEEEE
Q psy7051 15 SLKVDNLTYRTTPDDLR---RVFERCGEVGDIYI--PRDRFTRESRGFAFVRF 62 (158)
Q Consensus 15 ~l~V~nl~~~~~~~~l~---~~f~~~G~i~~v~~--~~~~~~~~~~g~afV~f 62 (158)
..|+.+||..+.+.++. ..|..+..-..|.. ..........|++.+.+
T Consensus 12 ~a~~a~lp~~va~R~~~~a~~~L~~~~~~v~i~~d~~~~~~~~~~~G~gi~l~ 64 (103)
T PF05189_consen 12 IAFVAGLPSSVAERMANAARKRLNWYGPDVEIETDYRESDDSAFGPGSGISLV 64 (103)
T ss_dssp EEEEESS-CHHHHHHHHHHHHHHCTTCSEEEEEEEEE-CCCCGCSSEEEEEEE
T ss_pred EEEEcCCCHHHHHHHHHHHHHHhhhhCCCeEEEEecccCccCCCCCceEEEEE
Confidence 46899999998877655 44555553334433 12233444566655544
No 229
>KOG3671|consensus
Probab=36.03 E-value=74 Score=26.43 Aligned_cols=51 Identities=14% Similarity=0.006 Sum_probs=34.3
Q ss_pred CCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCc
Q psy7051 23 YRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGR 78 (158)
Q Consensus 23 ~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~ 78 (158)
.-+.+++|..-|+-+-.-.++.-... -.+++=+.|.++++|+..++.+...
T Consensus 88 rliWdqELY~nf~y~q~r~ffhtFeg-----ddc~aGLnF~~E~EA~~F~k~V~~r 138 (569)
T KOG3671|consen 88 RLIWDQELYQNFEYRQPRTFFHTFEG-----DDCQAGLNFASEEEAQKFRKKVQDR 138 (569)
T ss_pred eeeehHHhhhhceeccCccceeeecc-----ccceeeecccCHHHHHHHHHHHHHH
Confidence 35677888888876654444432221 1346778899999999988876654
No 230
>COG4010 Uncharacterized protein conserved in archaea [Function unknown]
Probab=35.94 E-value=1.5e+02 Score=20.45 Aligned_cols=46 Identities=17% Similarity=0.214 Sum_probs=36.2
Q ss_pred CCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHh
Q psy7051 20 NLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAM 75 (158)
Q Consensus 20 nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l 75 (158)
.|+..+.++-|+++.+-.|-|.++. -.| -.+.|.+.+.+..|++.+
T Consensus 118 ~L~epl~~eRlqDi~E~hgvIiE~~-E~D---------~V~i~Gd~drVk~aLke~ 163 (170)
T COG4010 118 HLREPLAEERLQDIAETHGVIIEFE-EYD---------LVAIYGDSDRVKKALKEI 163 (170)
T ss_pred ecCchhHHHHHHHHHHhhheeEEee-ecc---------EEEEeccHHHHHHHHHHH
Confidence 4677888899999999999887765 222 456788999999999865
No 231
>TIGR00405 L26e_arch ribosomal protein L24p/L26e, archaeal. This protein contains a KOW domain, shared by bacterial NusG and the L24p/L26e family of ribosomal proteins. Although called archaeal NusG in several publications, it is the only close homolog of eukaryotic L26e in archaeal genomes, shares an operon with L11 in many genomes, and has been sequenced from purified ribosomes. It is here designated as a ribosomal protein for these reasons.
Probab=35.88 E-value=1.3e+02 Score=20.05 Aligned_cols=25 Identities=16% Similarity=0.212 Sum_probs=20.3
Q ss_pred cccEEEEEEcCHHHHHHHHHHhCCc
Q psy7051 54 SRGFAFVRFFDKRDAEDALDAMDGR 78 (158)
Q Consensus 54 ~~g~afV~f~~~~~a~~a~~~l~g~ 78 (158)
.+||-||++....+...++..+.|.
T Consensus 37 fpGYvFV~~~~~~~~~~~i~~~~gv 61 (145)
T TIGR00405 37 LKGYILVEAETKIDMRNPIIGVPHV 61 (145)
T ss_pred CCcEEEEEEECcHHHHHHHhCCCCE
Confidence 6899999999877778888777664
No 232
>PF14893 PNMA: PNMA
Probab=35.63 E-value=34 Score=26.80 Aligned_cols=26 Identities=8% Similarity=0.116 Sum_probs=21.6
Q ss_pred CCCcEEEEeCCCCCCCHHHHHHHhhh
Q psy7051 11 EGMVSLKVDNLTYRTTPDDLRRVFER 36 (158)
Q Consensus 11 ~~~~~l~V~nl~~~~~~~~l~~~f~~ 36 (158)
+....|.|.++|..+++++|++.+..
T Consensus 16 ~~~r~lLv~giP~dc~~~ei~e~l~~ 41 (331)
T PF14893_consen 16 DPQRALLVLGIPEDCEEAEIEEALQA 41 (331)
T ss_pred ChhhhheeecCCCCCCHHHHHHHHHH
Confidence 34467899999999999999988754
No 233
>PF05573 NosL: NosL; InterPro: IPR008719 NosL is one of the accessory proteins of the nos (nitrous oxide reductase) gene cluster. NosL is a monomeric protein of 18,540 MW that specifically and stoichiometrically binds Cu(I). The copper ion in NosL is ligated by a Cys residue, and one Met and one His are thought to serve as the other ligands. It is possible that NosL is a copper chaperone involved in metallocentre assembly []. This entry also contains HTH-type transcriptional repressors, including YcnK. YcnK may act as a negative transcriptional regulator of YcnJ in the presence of copper and may use copper as a corepressor. The gene, ycnK, is significantly induced under copper-limiting conditions.; PDB: 2HQ3_A 2HPU_A.
Probab=35.32 E-value=13 Score=25.35 Aligned_cols=22 Identities=27% Similarity=0.314 Sum_probs=16.2
Q ss_pred cccEEEEEEcCHHHHHHHHHHh
Q psy7051 54 SRGFAFVRFFDKRDAEDALDAM 75 (158)
Q Consensus 54 ~~g~afV~f~~~~~a~~a~~~l 75 (158)
+-|..+|-|.+.++|++..+..
T Consensus 113 pMg~~~~aF~~~~~A~~F~~~~ 134 (149)
T PF05573_consen 113 PMGPDLIAFASKEDAEAFAKEH 134 (149)
T ss_dssp TTS--EEEES-HHHHHHHHHHT
T ss_pred CCCCcccccCCHHHHHHHHHHc
Confidence 4578899999999999998764
No 234
>COG5236 Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only]
Probab=34.86 E-value=1.2e+02 Score=24.02 Aligned_cols=52 Identities=13% Similarity=-0.001 Sum_probs=37.8
Q ss_pred CHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceE
Q psy7051 26 TPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGREL 85 (158)
Q Consensus 26 ~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i 85 (158)
+-++|++.|..--.++.+....+ .-||+|...-+.++-|...++..+.+..+
T Consensus 263 ~Y~~Le~HF~~~hy~ct~qtc~~--------~k~~vf~~~~el~~h~~~~h~~~~~~~~~ 314 (493)
T COG5236 263 SYEDLEAHFRNAHYCCTFQTCRV--------GKCYVFPYHTELLEHLTRFHKVNARLSEI 314 (493)
T ss_pred CHHHHHHHhhcCceEEEEEEEec--------CcEEEeccHHHHHHHHHHHhhcccccCcC
Confidence 55778888887666666655433 35889999999888888888877666543
No 235
>PRK12450 foldase protein PrsA; Reviewed
Probab=34.81 E-value=1.1e+02 Score=23.58 Aligned_cols=39 Identities=21% Similarity=0.394 Sum_probs=29.5
Q ss_pred CCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhC
Q psy7051 24 RTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMD 76 (158)
Q Consensus 24 ~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~ 76 (158)
.+|+++|+.++..|.+ .+. ...|.+.+.+.|+.+++.|.
T Consensus 132 ~Vtd~evk~~y~~~~~--~~~------------~~~I~~~~~~~A~~i~~~l~ 170 (309)
T PRK12450 132 TISKKDYRQAYDAYTP--TMT------------AEIMQFEKEEDAKAALEAVK 170 (309)
T ss_pred CCCHHHHHHHHHHhCc--cce------------eEEEEeCCHHHHHHHHHHHH
Confidence 4799999999998732 111 23577889999999999885
No 236
>PF08544 GHMP_kinases_C: GHMP kinases C terminal ; InterPro: IPR013750 This domain is found in homoserine kinases (2.7.1.39 from EC), galactokinases (2.7.1.6 from EC) and mevalonate kinases (2.7.1.36 from EC). These kinases make up the GHMP kinase superfamily of ATP-dependent enzymes []. These enzymes are involved in the biosynthesis of isoprenes and amino acids as well as in carbohydrate metabolism. The C-terminal domain of homoserine kinase has a central alpha-beta plait fold and an insertion of four helices, which, together with the N-terminal fold, create a novel nucleotide binding fold [].; PDB: 2R3V_C 4EMD_A 4DXL_A 4ED4_A 2GS8_A 1K47_E 3GON_A 3K17_B 1PIE_A 2AJ4_A ....
Probab=34.60 E-value=98 Score=18.07 Aligned_cols=44 Identities=23% Similarity=0.278 Sum_probs=27.6
Q ss_pred HHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhC
Q psy7051 28 DDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMD 76 (158)
Q Consensus 28 ~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~ 76 (158)
.+|.+.+..+| +..+.+. . .| .-++.|+-+.+.+.++.+++.|.
T Consensus 37 ~~~~~~~~~~G-a~~~~~s-G--sG-~G~~v~~l~~~~~~~~~v~~~l~ 80 (85)
T PF08544_consen 37 DELKEAAEENG-ALGAKMS-G--SG-GGPTVFALCKDEDDAERVAEALR 80 (85)
T ss_dssp HHHHHHHHHTT-ESEEEEE-T--TS-SSSEEEEEESSHHHHHHHHHHHH
T ss_pred HHHHHHHHHCC-CCceecC-C--CC-CCCeEEEEECCHHHHHHHHHHHH
Confidence 45677778888 4344442 1 10 12477888888888888877663
No 237
>KOG2855|consensus
Probab=33.82 E-value=79 Score=24.79 Aligned_cols=47 Identities=13% Similarity=0.185 Sum_probs=33.0
Q ss_pred CCcEEEEeCCCCCCCHHHHHHHhhhcC-ceEEEEEccCCCCCCcccEEEEEE
Q psy7051 12 GMVSLKVDNLTYRTTPDDLRRVFERCG-EVGDIYIPRDRFTRESRGFAFVRF 62 (158)
Q Consensus 12 ~~~~l~V~nl~~~~~~~~l~~~f~~~G-~i~~v~~~~~~~~~~~~g~afV~f 62 (158)
+..+.|||+|-.+.--++|.+.+.++| .+..|....+. ..++++|..
T Consensus 60 G~~~afiGkvGdD~fG~~l~~~L~~~~V~~~~v~~~~~~----~T~~a~i~v 107 (330)
T KOG2855|consen 60 GGRVAFIGKVGDDEFGDDLLDILKQNGVDTSGVKFDENA----RTACATITV 107 (330)
T ss_pred CcceeeeecccchhhHHHHHHHHhhCCcccccceecCCC----ceEEEEEEE
Confidence 457899999999888899999999986 44455544332 345555543
No 238
>cd04904 ACT_AAAH ACT domain of the nonheme iron-dependent, aromatic amino acid hydroxylases (AAAH). ACT domain of the nonheme iron-dependent, aromatic amino acid hydroxylases (AAAH): Phenylalanine hydroxylases (PAH), tyrosine hydroxylases (TH) and tryptophan hydroxylases (TPH), both peripheral (TPH1) and neuronal (TPH2) enzymes. This family of enzymes shares a common catalytic mechanism, in which dioxygen is used by an active site containing a single, reduced iron atom to hydroxylate an unactivated aromatic substrate, concomitant with a two-electron oxidation of tetrahydropterin (BH4) cofactor to its quinonoid dihydropterin form. PAH catalyzes the hydroxylation of L-Phe to L-Tyr, the first step in the catabolic degradation of L-Phe; TH catalyses the hydroxylation of L-Tyr to 3,4-dihydroxyphenylalanine, the rate limiting step in the biosynthesis of catecholamines; and TPH catalyses the hydroxylation of L-Trp to 5-hydroxytryptophan, the rate limiting step in the biosynthesis of 5-hydroxy
Probab=33.53 E-value=1e+02 Score=17.96 Aligned_cols=51 Identities=20% Similarity=0.236 Sum_probs=29.0
Q ss_pred CHHHHHHHhhhcC-ceEEEEEccCCCCCCcccEEEEEEc-CHHHHHHHHHHhCC
Q psy7051 26 TPDDLRRVFERCG-EVGDIYIPRDRFTRESRGFAFVRFF-DKRDAEDALDAMDG 77 (158)
Q Consensus 26 ~~~~l~~~f~~~G-~i~~v~~~~~~~~~~~~g~afV~f~-~~~~a~~a~~~l~g 77 (158)
.-.++...|..+| +++.+.-...+ ...-.-+-||+++ ..+..+.|++.|..
T Consensus 13 ~L~~vL~~f~~~~iNlt~IeSRP~~-~~~~~y~Ffvd~~~~~~~~~~~l~~L~~ 65 (74)
T cd04904 13 ALARALKLFEEFGVNLTHIESRPSR-RNGSEYEFFVDCEVDRGDLDQLISSLRR 65 (74)
T ss_pred HHHHHHHHHHHCCCcEEEEECCCCC-CCCceEEEEEEEEcChHHHHHHHHHHHH
Confidence 3456777788886 66666433222 1111234567777 44566777777654
No 239
>cd04878 ACT_AHAS N-terminal ACT domain of the Escherichia coli IlvH-like regulatory subunit of acetohydroxyacid synthase (AHAS). ACT_AHAS: N-terminal ACT domain of the Escherichia coli IlvH-like regulatory subunit of acetohydroxyacid synthase (AHAS). AHAS catalyses the first common step in the biosynthesis of the three branched-chain amino acids. The first step involves the condensation of either pyruvate or 2-ketobutyrate with the two-carbon hydroxyethyl fragment derived from another pyruvate molecule, covalently bound to the coenzyme thiamine diphosphate. Bacterial AHASs generally consist of regulatory and catalytic subunits. The effector (valine) binding sites are proposed to be located in two symmetrically related positions in the interface between a pair of N-terminal ACT domains with the C-terminal domain of IlvH contacting the catalytic dimer. Plants Arabidopsis and Oryza have tandem IlvH subunits; both the first and second ACT domain sequences are present in this CD. Members of
Probab=33.28 E-value=88 Score=17.16 Aligned_cols=60 Identities=10% Similarity=0.067 Sum_probs=29.6
Q ss_pred EEEEeCCCCCCCHHHHHHHhhhcC-ceEEEEEccCCCCCCcccEEEEEEcCH-HHHHHHHHHhC
Q psy7051 15 SLKVDNLTYRTTPDDLRRVFERCG-EVGDIYIPRDRFTRESRGFAFVRFFDK-RDAEDALDAMD 76 (158)
Q Consensus 15 ~l~V~nl~~~~~~~~l~~~f~~~G-~i~~v~~~~~~~~~~~~g~afV~f~~~-~~a~~a~~~l~ 76 (158)
+|.|..-...-.-.+|..+|..++ .|..+........+ ....++++... +....+++.|.
T Consensus 2 ~l~i~~~d~~g~l~~i~~~l~~~~~~i~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~l~~~l~ 63 (72)
T cd04878 2 TLSVLVENEPGVLNRISGLFARRGFNIESLTVGPTEDPG--ISRITIVVEGDDDVIEQIVKQLN 63 (72)
T ss_pred EEEEEEcCCCcHHHHHHHHHHhCCCCEEEEEeeecCCCC--eEEEEEEEECCHHHHHHHHHHHh
Confidence 344433333334567888888886 67767653321111 12233333322 55555665554
No 240
>PF01762 Galactosyl_T: Galactosyltransferase; InterPro: IPR002659 The biosynthesis of disaccharides, oligosaccharides and polysaccharides involves the action of hundreds of different glycosyltransferases. These enzymes catalyse the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. A classification of glycosyltransferases using nucleotide diphospho-sugar, nucleotide monophospho-sugar and sugar phosphates (2.4.1.- from EC) and related proteins into distinct sequence based families has been described []. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. The same three-dimensional fold is expected to occur within each of the families. Because 3-D structures are better conserved than sequences, several of the families defined on the basis of sequence similarities may have similar 3-D structures and therefore form 'clans'. Glycosyltransferase family 31 (GH31 from CAZY) comprises enzymes with a number of known activities; N-acetyllactosaminide beta-1,3-N-acetylglucosaminyltransferase (2.4.1.149 from EC); beta-1,3-galactosyltransferase (2.4.1 from EC); fucose-specific beta-1,3-N-acetylglucosaminyltransferase (2.4.1 from EC); globotriosylceramide beta-1,3-GalNAc transferase (2.4.1.79 from EC) [, ].; GO: 0008378 galactosyltransferase activity, 0006486 protein glycosylation, 0016020 membrane
Probab=33.15 E-value=68 Score=22.53 Aligned_cols=37 Identities=11% Similarity=0.012 Sum_probs=25.8
Q ss_pred CCCcEEEEeCCCC--CC-CHHHHHHHhhhcCceEEEEEcc
Q psy7051 11 EGMVSLKVDNLTY--RT-TPDDLRRVFERCGEVGDIYIPR 47 (158)
Q Consensus 11 ~~~~~l~V~nl~~--~~-~~~~l~~~f~~~G~i~~v~~~~ 47 (158)
.....+||-+.+. .. ..+.|.+....||.|..+.+..
T Consensus 19 ~~~~~~FvvG~~~~~~~~~~~~l~~E~~~y~Dil~~d~~D 58 (195)
T PF01762_consen 19 VRVKVVFVVGESPNSDSDLQEALQEEAEKYGDILQGDFVD 58 (195)
T ss_pred CcEEEEEEEecCCCCcHHHHHHhhhhhhhcCceEeeeccc
Confidence 3446788888877 22 2344778888999998877643
No 241
>TIGR01033 DNA-binding regulatory protein, YebC/PmpR family. This model describes a minimally characterized protein family, restricted to bacteria excepting for some eukaryotic sequences that have possible transit peptides. YebC from E. coli is crystallized, and PA0964 from Pseudomonas aeruginosa has been shown to be a sequence-specific DNA-binding regulatory protein.
Probab=32.27 E-value=2.1e+02 Score=21.25 Aligned_cols=45 Identities=24% Similarity=0.371 Sum_probs=29.0
Q ss_pred CCcEEEEeCCCCCC--CHHHHHHHhhhcCc-e---EEEEEccCCCCCCcccEEEEEEc
Q psy7051 12 GMVSLKVDNLTYRT--TPDDLRRVFERCGE-V---GDIYIPRDRFTRESRGFAFVRFF 63 (158)
Q Consensus 12 ~~~~l~V~nl~~~~--~~~~l~~~f~~~G~-i---~~v~~~~~~~~~~~~g~afV~f~ 63 (158)
+...|+|--|..+. +..+|+.+|.++|. + -.|.+..+. .++|+|.
T Consensus 93 ~GvaiiVe~lTDN~nRt~~~ir~~f~K~gg~l~~~gsv~~~Fe~-------kG~i~~~ 143 (238)
T TIGR01033 93 GGVAIIVECLTDNKNRTASEVRSAFNKNGGSLGEPGSVSYLFSR-------KGVIEVP 143 (238)
T ss_pred CceEEEEEEecCCHHhHHHHHHHHHHHcCCeeCCCCceeeeeec-------ceEEEEC
Confidence 34567777776665 67899999999874 2 124444442 4677774
No 242
>PRK13259 regulatory protein SpoVG; Reviewed
Probab=32.08 E-value=89 Score=19.71 Aligned_cols=26 Identities=15% Similarity=0.241 Sum_probs=18.0
Q ss_pred ceEEEEEccCCCCCCcccEEEEEEcC
Q psy7051 39 EVGDIYIPRDRFTRESRGFAFVRFFD 64 (158)
Q Consensus 39 ~i~~v~~~~~~~~~~~~g~afV~f~~ 64 (158)
+|+.|.+..-...|+.+++|-|+|.+
T Consensus 2 ~ITdVri~~~~~~g~lka~asit~dd 27 (94)
T PRK13259 2 EVTDVRLRKVNTEGRMKAIVSITFDN 27 (94)
T ss_pred eEEEEEEEEeCCCCcEEEEEEEEECC
Confidence 36666664444457788899999876
No 243
>PF15407 Spo7_2_N: Sporulation protein family 7
Probab=31.81 E-value=5.9 Score=23.30 Aligned_cols=25 Identities=4% Similarity=-0.078 Sum_probs=17.0
Q ss_pred CCCcEEEEeCCCCCCCHHHHHHHhh
Q psy7051 11 EGMVSLKVDNLTYRTTPDDLRRVFE 35 (158)
Q Consensus 11 ~~~~~l~V~nl~~~~~~~~l~~~f~ 35 (158)
.-..+||||++|..+-++.=..++.
T Consensus 25 ~tSr~vflG~IP~~W~~~~~~~~~k 49 (67)
T PF15407_consen 25 LTSRRVFLGPIPEIWLQDHRKSWYK 49 (67)
T ss_pred HcCceEEECCCChHHHHcCcchHHH
Confidence 3457899999998876655444443
No 244
>PF00585 Thr_dehydrat_C: C-terminal regulatory domain of Threonine dehydratase; InterPro: IPR001721 Threonine dehydratases including Serine/threonine dehydratase (see IPR001926 from INTERPRO) contain a common C-terminal region that may have a regulatory role. Some members contain two copies of this region [].; GO: 0004794 L-threonine ammonia-lyase activity, 0009097 isoleucine biosynthetic process; PDB: 1TDJ_A 3IAU_A.
Probab=31.81 E-value=1.3e+02 Score=18.56 Aligned_cols=59 Identities=17% Similarity=0.328 Sum_probs=32.9
Q ss_pred EEEEeCCCCCCCHHHHHHHhhhcCc---eEEEEEccCCCCCCcccEEEEEEc--CHHHHHHHHHHhCCc
Q psy7051 15 SLKVDNLTYRTTPDDLRRVFERCGE---VGDIYIPRDRFTRESRGFAFVRFF--DKRDAEDALDAMDGR 78 (158)
Q Consensus 15 ~l~V~nl~~~~~~~~l~~~f~~~G~---i~~v~~~~~~~~~~~~g~afV~f~--~~~~a~~a~~~l~g~ 78 (158)
.+|.-.+|.. ...|+.|+...++ |+.+... .++...+.++|-|. +.++.+..++.|+..
T Consensus 11 ~~~~v~~PE~--pGal~~F~~~l~~~~nITeF~YR---~~~~~~a~vlvgi~v~~~~~~~~l~~~L~~~ 74 (91)
T PF00585_consen 11 ALFAVEFPER--PGALKRFLDALGPRNNITEFHYR---YSGDDFARVLVGIEVPDAEDLEELIERLKAL 74 (91)
T ss_dssp EEEEEE--BS--TTHCHHHHHCCSSSE-EEEEEEE----TTTSCSEEEEEEE-SSTHHHHHHHHHHTSS
T ss_pred EEEEEECCCC--ccHHHHHHHHhCCCceEEEEEEc---CCCCCeeeEEEEEEeCCHHHHHHHHHHHHHc
Confidence 3444456653 3346667766663 5555543 24445677887776 445556677777654
No 245
>PF13291 ACT_4: ACT domain; PDB: 2KO1_B 3IBW_A.
Probab=31.29 E-value=1.1e+02 Score=17.78 Aligned_cols=65 Identities=14% Similarity=0.135 Sum_probs=38.9
Q ss_pred cEEEEeCCCCCCCHHHHHHHhhhcC-ceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCc
Q psy7051 14 VSLKVDNLTYRTTPDDLRRVFERCG-EVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGR 78 (158)
Q Consensus 14 ~~l~V~nl~~~~~~~~l~~~f~~~G-~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~ 78 (158)
.+|.|......---.+|-..+...| .|..+.+......+......-|+..+.+.....+..|...
T Consensus 7 ~~l~i~~~dr~GlL~dI~~~i~~~~~nI~~i~~~~~~~~~~~~~~l~v~V~d~~~L~~ii~~L~~i 72 (80)
T PF13291_consen 7 VRLRIEAEDRPGLLADITSVISENGVNIRSINARTNKDDGTARITLTVEVKDLEHLNQIIRKLRQI 72 (80)
T ss_dssp EEEEEEEE--TTHHHHHHHHHHCSSSEEEEEEEEE--ETTEEEEEEEEEESSHHHHHHHHHHHCTS
T ss_pred EEEEEEEEcCCCHHHHHHHHHHHCCCCeEEEEeEEeccCCEEEEEEEEEECCHHHHHHHHHHHHCC
Confidence 3455544444445677888888776 7888877654223323334445567888888888877653
No 246
>cd04880 ACT_AAAH-PDT-like ACT domain of the nonheme iron-dependent, aromatic amino acid hydroxylases (AAAH). ACT domain of the nonheme iron-dependent, aromatic amino acid hydroxylases (AAAH): Phenylalanine hydroxylases (PAH), tyrosine hydroxylases (TH) and tryptophan hydroxylases (TPH), both peripheral (TPH1) and neuronal (TPH2) enzymes. This family of enzymes shares a common catalytic mechanism, in which dioxygen is used by an active site containing a single, reduced iron atom to hydroxylate an unactivated aromatic substrate, concomitant with a two-electron oxidation of tetrahydropterin (BH4) cofactor to its quinonoid dihydropterin form. Eukaryotic AAAHs have an N-terminal ACT (regulatory) domain, a middle catalytic domain and a C-terminal domain which is responsible for the oligomeric state of the enzyme forming a domain-swapped tetrameric coiled-coil. The PAH, TH, and TPH enzymes contain highly conserved catalytic domains but distinct N-terminal ACT domains and differ in their mech
Probab=30.92 E-value=1.1e+02 Score=17.58 Aligned_cols=51 Identities=22% Similarity=0.225 Sum_probs=30.2
Q ss_pred CHHHHHHHhhhcC-ceEEEEEccCCCCCCcccEEEEEEcC---HHHHHHHHHHhCC
Q psy7051 26 TPDDLRRVFERCG-EVGDIYIPRDRFTRESRGFAFVRFFD---KRDAEDALDAMDG 77 (158)
Q Consensus 26 ~~~~l~~~f~~~G-~i~~v~~~~~~~~~~~~g~afV~f~~---~~~a~~a~~~l~g 77 (158)
.-.++.+.|..+| .|..+.-...+ .....-.-||++.. ....+.+++.|..
T Consensus 12 ~L~~vL~~f~~~~vni~~I~Srp~~-~~~~~~~f~id~~~~~~~~~~~~~l~~l~~ 66 (75)
T cd04880 12 ALAKALKVFAERGINLTKIESRPSR-KGLWEYEFFVDFEGHIDDPDVKEALEELKR 66 (75)
T ss_pred HHHHHHHHHHHCCCCEEEEEeeecC-CCCceEEEEEEEECCCCCHHHHHHHHHHHH
Confidence 4567788888886 77777443222 11223356778774 4556667766643
No 247
>CHL00123 rps6 ribosomal protein S6; Validated
Probab=30.87 E-value=1.4e+02 Score=18.69 Aligned_cols=57 Identities=9% Similarity=0.168 Sum_probs=31.1
Q ss_pred EEEEeCCCCCCCHHHHHHHhhhc--------CceEEEEEcc-----CCCCCCccc-EEEEEEcCHHHHHHHHH
Q psy7051 15 SLKVDNLTYRTTPDDLRRVFERC--------GEVGDIYIPR-----DRFTRESRG-FAFVRFFDKRDAEDALD 73 (158)
Q Consensus 15 ~l~V~nl~~~~~~~~l~~~f~~~--------G~i~~v~~~~-----~~~~~~~~g-~afV~f~~~~~a~~a~~ 73 (158)
++|| |.+.++++++..+.+.+ |.|..+...- -+-.....| |.++.|....++.+.++
T Consensus 10 ~~~I--l~p~l~e~~~~~~~~~~~~~i~~~gg~i~~~~~wG~r~LAY~I~k~~~G~Yv~~~f~~~~~~i~ele 80 (97)
T CHL00123 10 TMYL--LKPDLNEEELLKWIENYKKLLRKRGAKNISVQNRGKRKLSYKINKYEDGIYIQMNYSGNGKLVNSLE 80 (97)
T ss_pred EEEE--ECCCCCHHHHHHHHHHHHHHHHHCCCEEEEEEeecCeeeeEEcCCCCEEEEEEEEEEECHHHHHHHH
Confidence 3454 56677777766655443 3665554210 011223345 68888886666666554
No 248
>PF10905 DUF2695: Protein of unknown function (DUF2695); InterPro: IPR024248 This bacterial family of proteins has no known function.
Probab=29.87 E-value=74 Score=17.76 Aligned_cols=24 Identities=13% Similarity=0.052 Sum_probs=18.5
Q ss_pred CCCHHHHHHHhhhcCceEEEEEcc
Q psy7051 24 RTTPDDLRRVFERCGEVGDIYIPR 47 (158)
Q Consensus 24 ~~~~~~l~~~f~~~G~i~~v~~~~ 47 (158)
.++.++|.+.|.+.|.-..+.|+.
T Consensus 29 ~~~~~~vl~~l~~nGg~CDCEVl~ 52 (53)
T PF10905_consen 29 QLDWEDVLEWLRENGGYCDCEVLY 52 (53)
T ss_pred CCCHHHHHHHHHHcCCCcceeeec
Confidence 345688999999999887777653
No 249
>PRK10672 rare lipoprotein A; Provisional
Probab=29.71 E-value=2.6e+02 Score=22.28 Aligned_cols=64 Identities=9% Similarity=0.200 Sum_probs=42.4
Q ss_pred CCcEEEEeCCCCCCCHHHHHHHhhh-cCceEEEEEccCCCCCCcccE-EEE-EEcCHHHHHHHHHHhCCcccCC
Q psy7051 12 GMVSLKVDNLTYRTTPDDLRRVFER-CGEVGDIYIPRDRFTRESRGF-AFV-RFFDKRDAEDALDAMDGRMLDG 82 (158)
Q Consensus 12 ~~~~l~V~nl~~~~~~~~l~~~f~~-~G~i~~v~~~~~~~~~~~~g~-afV-~f~~~~~a~~a~~~l~g~~i~g 82 (158)
....|.|+-+......+.|.+-|.. ||....+.. . + .-| ..| -|.+.++|+++++.|....+..
T Consensus 288 g~y~VQVGAFsn~a~Ae~L~akL~~~~gvpa~V~~--~---g--~lYRVrLGPF~sr~eA~kl~~rLk~~g~~q 354 (361)
T PRK10672 288 GNFVVQVGAVSDQQRAQQWQQSLSQRFGVPGRVTQ--N---G--AVYRVQLGPFASRQQASALQQRLQTEAQQQ 354 (361)
T ss_pred CCEEEEEEeCCCHHHHHHHHHHHHHHhCCCeEEee--C---C--CEEEEEeCCCCCHHHHHHHHHHHHhccCCC
Confidence 3468999999988888888888875 774333321 1 1 112 222 4889999999999987654433
No 250
>TIGR00755 ksgA dimethyladenosine transferase. Alternate name: S-adenosylmethionine--6-N',N'-adenosyl (rRNA) dimethyltransferase
Probab=29.14 E-value=74 Score=23.49 Aligned_cols=24 Identities=21% Similarity=0.263 Sum_probs=20.0
Q ss_pred EEEEeCCCCCCCHHHHHHHhhhcC
Q psy7051 15 SLKVDNLTYRTTPDDLRRVFERCG 38 (158)
Q Consensus 15 ~l~V~nl~~~~~~~~l~~~f~~~G 38 (158)
-+.|+|||..++...|..++..+|
T Consensus 96 ~~vvsNlPy~i~~~il~~ll~~~~ 119 (253)
T TIGR00755 96 LKVVSNLPYNISSPLIFKLLEKPK 119 (253)
T ss_pred ceEEEcCChhhHHHHHHHHhccCC
Confidence 377999999999999999986443
No 251
>PF05727 UPF0228: Uncharacterised protein family (UPF0228); InterPro: IPR008887 This small family of proteins is currently restricted to Methanosarcina species. Members of this family are about 200 residues in length, except for Q8TMK1 from SWISSPROT that has two copies of this region. Although the function of this region is unknown the pattern of conservation suggests that this may be an enzyme, including multiple conserved aspartate and glutamate residues. The most conserved motif in these proteins is NEL/MEXNE/D, where X can be any amino acid, and is found at the C terminus of these proteins.
Probab=28.50 E-value=1.8e+02 Score=19.34 Aligned_cols=55 Identities=16% Similarity=0.297 Sum_probs=30.0
Q ss_pred CCCC-CCCHHHHHHHhhhcC-ceEEEEEccCCCCCCcccEEEEEEcCH--------HHHHHHHHHhCCcccCCceEEEEe
Q psy7051 20 NLTY-RTTPDDLRRVFERCG-EVGDIYIPRDRFTRESRGFAFVRFFDK--------RDAEDALDAMDGRMLDGRELRVQM 89 (158)
Q Consensus 20 nl~~-~~~~~~l~~~f~~~G-~i~~v~~~~~~~~~~~~g~afV~f~~~--------~~a~~a~~~l~g~~i~g~~i~v~~ 89 (158)
-|+. .+..+.+.++++++- +|+.+. +|+|.|.+. ++|.... |..+-+...+.|.+
T Consensus 58 ~vSEqai~d~~FL~iL~K~nLqvKKfV------------wCyI~f~dgs~nywIpe~DAirIK---nELE~NE~il~V~~ 122 (127)
T PF05727_consen 58 MVSEQAIEDKNFLEILEKYNLQVKKFV------------WCYIRFGDGSKNYWIPEKDAIRIK---NELEMNEKILIVYL 122 (127)
T ss_pred EeehhhcCchhHHHHHHhcCcceeEEE------------EEEEEcCCCCcccccchHHHHHHH---HHHhcCCcEEEEEe
Confidence 3443 344557788888875 444332 477777532 3444444 33444555666654
No 252
>KOG1232|consensus
Probab=28.46 E-value=74 Score=25.70 Aligned_cols=50 Identities=22% Similarity=0.244 Sum_probs=34.1
Q ss_pred CCCCCHHHHHHHhh----hcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHH
Q psy7051 22 TYRTTPDDLRRVFE----RCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDAL 72 (158)
Q Consensus 22 ~~~~~~~~l~~~f~----~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~ 72 (158)
-.+-|--+|+++|- ..|-|+.+.|...+ ..+....||+-.++.+++++++
T Consensus 233 RKDNTgydlkhLFIGSEGtlGVvT~vSil~~~-kpksvn~af~gi~sf~~v~k~f 286 (511)
T KOG1232|consen 233 RKDNTGYDLKHLFIGSEGTLGVVTKVSILAPP-KPKSVNVAFIGIESFDDVQKVF 286 (511)
T ss_pred cccCccccchhheecCCceeeEEeeEEEeecC-CCcceeEEEEccccHHHHHHHH
Confidence 33445567888883 45778888887664 3445678999888777766543
No 253
>PRK12280 rplW 50S ribosomal protein L23; Reviewed
Probab=28.28 E-value=2.1e+02 Score=19.92 Aligned_cols=32 Identities=13% Similarity=0.111 Sum_probs=24.3
Q ss_pred EEEEeCCCCCCCHHHHHHHhhh-cC-ceEEEEEc
Q psy7051 15 SLKVDNLTYRTTPDDLRRVFER-CG-EVGDIYIP 46 (158)
Q Consensus 15 ~l~V~nl~~~~~~~~l~~~f~~-~G-~i~~v~~~ 46 (158)
..|+-.++..++-.+|+..|+. || .|..|..+
T Consensus 23 N~ytF~V~~~anK~eIK~AVE~iF~VkV~~VNT~ 56 (158)
T PRK12280 23 NVYTFKVDRRANKIEIKKAVEFIFKVKVLKVNIF 56 (158)
T ss_pred CEEEEEECCCCCHHHHHHHHHHHhCCceEEEEeE
Confidence 3566678999999999999987 66 56666543
No 254
>KOG0829|consensus
Probab=28.17 E-value=2.1e+02 Score=19.88 Aligned_cols=58 Identities=12% Similarity=0.157 Sum_probs=30.1
Q ss_pred CCCcEEEEeCC--C-CCCCHHHHHHHhh-------hcCceEEEEEccCCCCCCcccEE-EEEEcCHHHH
Q psy7051 11 EGMVSLKVDNL--T-YRTTPDDLRRVFE-------RCGEVGDIYIPRDRFTRESRGFA-FVRFFDKRDA 68 (158)
Q Consensus 11 ~~~~~l~V~nl--~-~~~~~~~l~~~f~-------~~G~i~~v~~~~~~~~~~~~g~a-fV~f~~~~~a 68 (158)
++...||...| + .-+...-+.-+++ .-|+|..|.-+..+.-..-+.|+ ++.|++....
T Consensus 15 ~p~p~l~~m~ifa~N~V~AKsrfwyfl~~l~KvKks~Geiv~i~qi~E~~p~~vkNfGIwlrYdSRsG~ 83 (169)
T KOG0829|consen 15 EPTPKLYRMRIFAPNHVVAKSRFWYFLSKLKKVKKSSGEIVAINQIFEKSPLKVKNFGIWLRYDSRSGT 83 (169)
T ss_pred CCCCceEEEEEeccceeehhHHHHHHHHHHHHHhhcCceEEEeceecCCCCceeeeeEEEEEEccCCcc
Confidence 44455666655 2 2233333344433 34888888655554444445554 3666655333
No 255
>COG4009 Uncharacterized protein conserved in archaea [Function unknown]
Probab=28.16 E-value=72 Score=19.47 Aligned_cols=23 Identities=22% Similarity=0.245 Sum_probs=15.4
Q ss_pred EEEeCCCCCCCHHHHHHHhhhcC
Q psy7051 16 LKVDNLTYRTTPDDLRRVFERCG 38 (158)
Q Consensus 16 l~V~nl~~~~~~~~l~~~f~~~G 38 (158)
-+|-=|....++++|+..|+..|
T Consensus 51 y~V~Fl~~~~s~eev~~ele~mg 73 (88)
T COG4009 51 YYVVFLEEVESEEEVERELEDMG 73 (88)
T ss_pred EEEEEEeccCCHHHHHHHHHHhC
Confidence 33334556667888888887776
No 256
>KOG0984|consensus
Probab=27.75 E-value=1.2e+02 Score=22.71 Aligned_cols=41 Identities=20% Similarity=0.370 Sum_probs=29.4
Q ss_pred CCCCCCCCCcEEEEeCCCCCCCHHHHHHHhh----hcCceEEEEE
Q psy7051 5 RPPPRIEGMVSLKVDNLTYRTTPDDLRRVFE----RCGEVGDIYI 45 (158)
Q Consensus 5 ~p~~~~~~~~~l~V~nl~~~~~~~~l~~~f~----~~G~i~~v~~ 45 (158)
.||...+....++|++-..++.+++|.-+.+ .||.++.+..
T Consensus 23 tpp~~Ld~~a~i~I~~~~~eV~ad~L~~i~elGrGayG~vekmrh 67 (282)
T KOG0984|consen 23 TPPRNLDSRALIYIGDRNFEVPADDLVGIEELGRGAYGVVEKMRH 67 (282)
T ss_pred CCCcccCCCceEEEecCccccchhhhhhhhhhcCCccchhhheee
Confidence 4555666678899999899999999877653 3666655443
No 257
>PF08538 DUF1749: Protein of unknown function (DUF1749); InterPro: IPR013744 This is a plant and fungal family of unknown function. This family contains many hypothetical proteins. ; PDB: 2Q0X_B.
Probab=27.72 E-value=60 Score=25.11 Aligned_cols=62 Identities=15% Similarity=0.173 Sum_probs=31.2
Q ss_pred CCCcEEEEeCCCCCCC----HHHHHHHhhhcC-ceEEEEEccCCCCCCcccEEEEEEc-CHHHHHHHHHHhCC
Q psy7051 11 EGMVSLKVDNLTYRTT----PDDLRRVFERCG-EVGDIYIPRDRFTRESRGFAFVRFF-DKRDAEDALDAMDG 77 (158)
Q Consensus 11 ~~~~~l~V~nl~~~~~----~~~l~~~f~~~G-~i~~v~~~~~~~~~~~~g~afV~f~-~~~~a~~a~~~l~g 77 (158)
.+...||||+|....- -.+|.+.+..-+ .+.++.+... -.||++-..+ +.++..++|+.|-.
T Consensus 32 ~~~~llfIGGLtDGl~tvpY~~~La~aL~~~~wsl~q~~LsSS-----y~G~G~~SL~~D~~eI~~~v~ylr~ 99 (303)
T PF08538_consen 32 APNALLFIGGLTDGLLTVPYLPDLAEALEETGWSLFQVQLSSS-----YSGWGTSSLDRDVEEIAQLVEYLRS 99 (303)
T ss_dssp SSSEEEEE--TT--TT-STCHHHHHHHHT-TT-EEEEE--GGG-----BTTS-S--HHHHHHHHHHHHHHHHH
T ss_pred CCcEEEEECCCCCCCCCCchHHHHHHHhccCCeEEEEEEecCc-----cCCcCcchhhhHHHHHHHHHHHHHH
Confidence 4456899999977532 456777776544 5555555332 2467766655 46667777765443
No 258
>PRK00274 ksgA 16S ribosomal RNA methyltransferase KsgA/Dim1 family protein; Reviewed
Probab=27.68 E-value=82 Score=23.62 Aligned_cols=22 Identities=32% Similarity=0.300 Sum_probs=18.4
Q ss_pred EEEEeCCCCCCCHHHHHHHhhh
Q psy7051 15 SLKVDNLTYRTTPDDLRRVFER 36 (158)
Q Consensus 15 ~l~V~nl~~~~~~~~l~~~f~~ 36 (158)
.+.|+|+|..++..-|..++..
T Consensus 107 ~~vv~NlPY~iss~ii~~~l~~ 128 (272)
T PRK00274 107 LKVVANLPYNITTPLLFHLLEE 128 (272)
T ss_pred ceEEEeCCccchHHHHHHHHhc
Confidence 5779999999998888888754
No 259
>PF10281 Ish1: Putative stress-responsive nuclear envelope protein; InterPro: IPR018803 This group of proteins, found primarily in fungi, consists of putative stress-responsive nuclear envelope protein Ish1 and homologues [].
Probab=27.53 E-value=69 Score=16.14 Aligned_cols=17 Identities=18% Similarity=0.198 Sum_probs=14.1
Q ss_pred CCCHHHHHHHhhhcCce
Q psy7051 24 RTTPDDLRRVFERCGEV 40 (158)
Q Consensus 24 ~~~~~~l~~~f~~~G~i 40 (158)
.++.++|++.+..+|-+
T Consensus 3 tWs~~~L~~wL~~~gi~ 19 (38)
T PF10281_consen 3 TWSDSDLKSWLKSHGIP 19 (38)
T ss_pred CCCHHHHHHHHHHcCCC
Confidence 56889999999999843
No 260
>PF08002 DUF1697: Protein of unknown function (DUF1697); InterPro: IPR012545 This family contains many hypothetical bacterial proteins.; PDB: 2HIY_B.
Probab=27.26 E-value=1.9e+02 Score=19.36 Aligned_cols=31 Identities=13% Similarity=0.234 Sum_probs=18.6
Q ss_pred cEEEEeCCCC----CCCHHHHHHHhhhcCceEEEEE
Q psy7051 14 VSLKVDNLTY----RTTPDDLRRVFERCGEVGDIYI 45 (158)
Q Consensus 14 ~~l~V~nl~~----~~~~~~l~~~f~~~G~i~~v~~ 45 (158)
..+++.+++- .+.-.+|+++|...| ...|.-
T Consensus 4 yiaLLRGINVGG~nki~MaeLr~~l~~~G-f~~V~T 38 (137)
T PF08002_consen 4 YIALLRGINVGGKNKIKMAELREALEDLG-FTNVRT 38 (137)
T ss_dssp EEEEESS-SBTTBS---HHHHHHHHHHCT--EEEEE
T ss_pred EEEEEcceecCCCCcccHHHHHHHHHHcC-CCCceE
Confidence 3466676643 368899999999987 445543
No 261
>PF14401 RLAN: RimK-like ATPgrasp N-terminal domain
Probab=27.22 E-value=1.5e+02 Score=20.42 Aligned_cols=58 Identities=14% Similarity=0.104 Sum_probs=34.4
Q ss_pred CcEEEEeCCCCCCCHHHHHHHhhhcC-ceEEEEEccCCCCCCcccEEEEEEcCHHHHHH
Q psy7051 13 MVSLKVDNLTYRTTPDDLRRVFERCG-EVGDIYIPRDRFTRESRGFAFVRFFDKRDAED 70 (158)
Q Consensus 13 ~~~l~V~nl~~~~~~~~l~~~f~~~G-~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~ 70 (158)
..+||.|.-+..--+.--+.+|+.|- +|..|.+..............+.+....+.+.
T Consensus 87 ~l~iyFG~~~~~~~~~lAr~lFe~F~~PlL~v~~~~~~~~w~i~~i~~~~~~~l~~~e~ 145 (153)
T PF14401_consen 87 ELSIYFGQTPDPRLERLARQLFERFPCPLLEVEFVRDDGKWRISSIKPLSLSELSEEEQ 145 (153)
T ss_pred EEEEEECCCCCHHHHHHHHHHHHhCCCceEEEEEEecCCcEEEeeEeecChhhCCHHHH
Confidence 35788877655545555678899885 77778776664323334445555554444333
No 262
>PRK00110 hypothetical protein; Validated
Probab=26.74 E-value=2.2e+02 Score=21.25 Aligned_cols=44 Identities=25% Similarity=0.352 Sum_probs=28.2
Q ss_pred CcEEEEeCCCCCC--CHHHHHHHhhhcCc-e---EEEEEccCCCCCCcccEEEEEEc
Q psy7051 13 MVSLKVDNLTYRT--TPDDLRRVFERCGE-V---GDIYIPRDRFTRESRGFAFVRFF 63 (158)
Q Consensus 13 ~~~l~V~nl~~~~--~~~~l~~~f~~~G~-i---~~v~~~~~~~~~~~~g~afV~f~ 63 (158)
...|.|--|..+. |..+|+.+|.++|. + -.|.++.+. .+.|+|.
T Consensus 94 GvaiiVe~lTDN~nRt~~~vR~~f~K~gG~l~~~Gsv~~~Fe~-------kG~i~~~ 143 (245)
T PRK00110 94 GVAIIVEALTDNRNRTAAEVRHAFSKNGGNLGETGSVSYMFDR-------KGVIVIE 143 (245)
T ss_pred CeEEEEEEecCCHHHHHHHHHHHHHhcCceeCCCcceEEEecc-------ceEEEeC
Confidence 3556777776655 67899999999863 3 224444442 4666665
No 263
>CHL00030 rpl23 ribosomal protein L23
Probab=26.73 E-value=1.7e+02 Score=18.34 Aligned_cols=31 Identities=13% Similarity=0.102 Sum_probs=23.1
Q ss_pred EEEeCCCCCCCHHHHHHHhhh-cC-ceEEEEEc
Q psy7051 16 LKVDNLTYRTTPDDLRRVFER-CG-EVGDIYIP 46 (158)
Q Consensus 16 l~V~nl~~~~~~~~l~~~f~~-~G-~i~~v~~~ 46 (158)
.|+--++..++..+|++.++. || .|..|..+
T Consensus 21 ~y~F~V~~~anK~eIK~avE~lf~VkV~~VNt~ 53 (93)
T CHL00030 21 QYTFDVDSGSTKTEIKHWIELFFGVKVIAVNSH 53 (93)
T ss_pred EEEEEECCCCCHHHHHHHHHHHhCCeEEEEEEE
Confidence 455567889999999999988 66 56666543
No 264
>PRK04405 prsA peptidylprolyl isomerase; Provisional
Probab=26.71 E-value=1.8e+02 Score=22.27 Aligned_cols=40 Identities=13% Similarity=0.280 Sum_probs=27.9
Q ss_pred CCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCC
Q psy7051 24 RTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDG 77 (158)
Q Consensus 24 ~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g 77 (158)
.+|+++|+.+|..+.+ .+. ...|.+.+.+.|+.+++.+..
T Consensus 128 ~Vtd~ei~~~y~~~~~--~~~------------v~hIlv~~~~~A~~v~~~l~~ 167 (298)
T PRK04405 128 KVTNSQLKKAWKSYQP--KVT------------VQHILVSKKSTAETVIKKLKD 167 (298)
T ss_pred CCCHHHHHHHHHHhhh--hEE------------EEEEEecChHHHHHHHHHHHC
Confidence 4799999999987642 121 245666677888888887653
No 265
>PRK12378 hypothetical protein; Provisional
Probab=26.67 E-value=2.7e+02 Score=20.73 Aligned_cols=28 Identities=29% Similarity=0.364 Sum_probs=21.2
Q ss_pred CCcEEEEeCCCCCC--CHHHHHHHhhhcCc
Q psy7051 12 GMVSLKVDNLTYRT--TPDDLRRVFERCGE 39 (158)
Q Consensus 12 ~~~~l~V~nl~~~~--~~~~l~~~f~~~G~ 39 (158)
+...|.|--|..+. +..+|+.+|.++|.
T Consensus 90 gGvaiiVe~lTDN~nRt~~~vr~~f~K~gg 119 (235)
T PRK12378 90 NGVMVIVECLTDNVNRTVANVRSAFNKNGG 119 (235)
T ss_pred CCcEEEEEECCCCHHHHHHHHHHHHhhcCC
Confidence 34567777787665 67899999999864
No 266
>PHA01632 hypothetical protein
Probab=26.50 E-value=83 Score=17.75 Aligned_cols=21 Identities=19% Similarity=0.581 Sum_probs=16.2
Q ss_pred EEEeCCCCCCCHHHHHHHhhh
Q psy7051 16 LKVDNLTYRTTPDDLRRVFER 36 (158)
Q Consensus 16 l~V~nl~~~~~~~~l~~~f~~ 36 (158)
|.|..+|..-|+++|+..+.+
T Consensus 19 ilieqvp~kpteeelrkvlpk 39 (64)
T PHA01632 19 ILIEQVPQKPTEEELRKVLPK 39 (64)
T ss_pred EehhhcCCCCCHHHHHHHHHH
Confidence 345688999999999887654
No 267
>PRK10927 essential cell division protein FtsN; Provisional
Probab=26.32 E-value=2.4e+02 Score=22.03 Aligned_cols=67 Identities=9% Similarity=-0.015 Sum_probs=48.1
Q ss_pred CCcEEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEE-EE-EEcCHHHHHHHHHHhCCcccCCceE
Q psy7051 12 GMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFA-FV-RFFDKRDAEDALDAMDGRMLDGREL 85 (158)
Q Consensus 12 ~~~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~a-fV-~f~~~~~a~~a~~~l~g~~i~g~~i 85 (158)
....|.||.+....+.+.|..-+...|--.+|. . .+ ..|- +| -|.+.++|+.++..|.+.-|.++.+
T Consensus 246 ~~~~VQvGSF~n~~nAE~LrAkLa~~G~~A~I~--~---~g--~~~RVrVGPf~sr~eAe~a~~rLk~aGis~ci~ 314 (319)
T PRK10927 246 RRWMVQCGSFRGAEQAETVRAQLAFEGFDSKIT--T---NN--GWNRVVIGPVKGKENADSTLNRLKMAGHTNCIR 314 (319)
T ss_pred CcEEEEeCccCCHHHHHHHHHHHHHcCCeeEEc--c---CC--cEEEEEeCCCCCHHHHHHHHHHHHHCCCCceee
Confidence 457889999988888888998888888555552 1 11 1222 22 3889999999999998887776654
No 268
>TIGR01873 cas_CT1978 CRISPR-associated endoribonuclease Cas2, E. coli subfamily. CRISPR is a term for Clustered, Regularly Interspaced Short Palidromic Repeats. A number of protein families appear only in association with these repeats and are designated Cas (CRISPR-Associated) proteins. This model represents a minor branch of the Cas2 family of CRISPR-associated endonuclease, whereas most Cas2 proteins are modeled instead by TIGR01573. This form of Cas2 is characteristic for the Ecoli subtype of CRISPR/Cas locus.
Probab=26.31 E-value=51 Score=20.44 Aligned_cols=48 Identities=13% Similarity=0.213 Sum_probs=26.2
Q ss_pred CCCcEEEEeCCCCCCCHHHHHHHhhhc----CceEEEEEccCCCCCCcccEEEEEEcC
Q psy7051 11 EGMVSLKVDNLTYRTTPDDLRRVFERC----GEVGDIYIPRDRFTRESRGFAFVRFFD 64 (158)
Q Consensus 11 ~~~~~l~V~nl~~~~~~~~l~~~f~~~----G~i~~v~~~~~~~~~~~~g~afV~f~~ 64 (158)
|...-|||++++..+-+. |.+...+. |.+.-+.- +....||.|-++..
T Consensus 23 Ev~~GVyVg~~s~rVRe~-lW~~v~~~~~~~G~avm~~~-----~~~e~G~~~~t~G~ 74 (87)
T TIGR01873 23 EPRAGVYVGGVSASVRER-IWDYLAQHCPPKGSLVITWS-----SNTCPGFEFFTLGE 74 (87)
T ss_pred ecCCCcEEcCCCHHHHHH-HHHHHHHhCCCCccEEEEEe-----CCCCCCcEEEecCC
Confidence 344569999998876554 33333332 44432221 12245788877754
No 269
>KOG2854|consensus
Probab=26.29 E-value=98 Score=24.34 Aligned_cols=65 Identities=17% Similarity=0.177 Sum_probs=35.4
Q ss_pred CCCCcEEEEeCCCCCCCHHHHHHHhhh---cCc--eEEEEEccCC-----CCCCcccEEEEEEcCHHHHHHHHHH
Q psy7051 10 IEGMVSLKVDNLTYRTTPDDLRRVFER---CGE--VGDIYIPRDR-----FTRESRGFAFVRFFDKRDAEDALDA 74 (158)
Q Consensus 10 ~~~~~~l~V~nl~~~~~~~~l~~~f~~---~G~--i~~v~~~~~~-----~~~~~~g~afV~f~~~~~a~~a~~~ 74 (158)
++....+||.++-..+..+.|..+-+. .-. +..+..+... .-.+...|+.|.|.|+++|.+.++.
T Consensus 157 veka~v~yv~Gffltv~p~ai~~v~qh~~e~~r~~~lnlsapfI~q~~~~~l~~v~~y~DiifgNe~EA~af~~~ 231 (343)
T KOG2854|consen 157 VEKAKVFYVAGFFLTVSPDAIRKVAQHAAENNRVFTLNLSAPFISQFFKDALDKVLPYADIIFGNEDEAAAFARA 231 (343)
T ss_pred hhheeEEEEEEEEEEeChHHHHHHHHHHHHhcchhheeccchhHHHHHHHHHHhhcCcceEEEcCHHHHHHHHHh
Confidence 344567888888777765554433222 111 1111111000 0112345899999999999887754
No 270
>PF01037 AsnC_trans_reg: AsnC family; InterPro: IPR019887 The many bacterial transcription regulation proteins which bind DNA through a 'helix-turn-helix' motif can be classified into subfamilies on the basis of sequence similarities. One such family is the AsnC/Lrp subfamily []. The Lrp family of transcriptional regulators appears to be widely distributed among bacteria and archaea, as an important regulatory system of the amino acid metabolism and related processes []. Members of the Lrp family are small DNA-binding proteins with molecular masses of around 15 kDa. Target promoters often contain a number of binding sites that typically lack obvious inverted repeat elements, and to which binding is usually co-operative. LrpA from Pyrococcus furiosus is the first Lrp-like protein to date of which a three-dimensional structure has been solved. In the crystal structure LrpA forms an octamer consisting of four dimers. The structure revealed that the N-terminal part of the protein consists of a helix-turn-helix (HTH) domain, a fold generally involved in DNA binding. The C terminus of Lrp-like proteins has a beta-fold, where the two alpha-helices are located at one side of the four-stranded antiparallel beta-sheet. LrpA forms a homodimer mainly through interactions between the beta-strands of this C-terminal domain, and an octamer through further interactions between the second alpha-helix and fourth beta-strand of the motif. Hence, the C-terminal domain of Lrp-like proteins appears to be involved in ligand-response and activation [].; GO: 0003700 sequence-specific DNA binding transcription factor activity, 0043565 sequence-specific DNA binding, 0006355 regulation of transcription, DNA-dependent, 0005622 intracellular; PDB: 2DJW_F 2GQQ_A 2ZNY_E 2ZNZ_G 1RI7_A 2CYY_A 2E1C_A 2CG4_B 2DBB_B 1I1G_A ....
Probab=25.59 E-value=1.4e+02 Score=16.85 Aligned_cols=45 Identities=20% Similarity=0.177 Sum_probs=35.2
Q ss_pred CHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHH
Q psy7051 26 TPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDA 74 (158)
Q Consensus 26 ~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~ 74 (158)
..+++.+.+...-.|..|..+.. ...-...|.+.+.++....+..
T Consensus 11 ~~~~~~~~l~~~p~V~~~~~vtG----~~d~~~~v~~~d~~~l~~~i~~ 55 (74)
T PF01037_consen 11 AYDEFAEALAEIPEVVECYSVTG----EYDLILKVRARDMEELEEFIRE 55 (74)
T ss_dssp HHHHHHHHHHTSTTEEEEEEESS----SSSEEEEEEESSHHHHHHHHHH
T ss_pred hHHHHHHHHHcCCCEEEEEEEeC----CCCEEEEEEECCHHHHHHHHHH
Confidence 46778888899999999987644 3445788899999999998654
No 271
>PTZ00338 dimethyladenosine transferase-like protein; Provisional
Probab=25.54 E-value=87 Score=23.99 Aligned_cols=22 Identities=14% Similarity=0.303 Sum_probs=18.4
Q ss_pred EEEEeCCCCCCCHHHHHHHhhh
Q psy7051 15 SLKVDNLTYRTTPDDLRRVFER 36 (158)
Q Consensus 15 ~l~V~nl~~~~~~~~l~~~f~~ 36 (158)
.+.|+|||+.++...|..++..
T Consensus 103 d~VvaNlPY~Istpil~~ll~~ 124 (294)
T PTZ00338 103 DVCVANVPYQISSPLVFKLLAH 124 (294)
T ss_pred CEEEecCCcccCcHHHHHHHhc
Confidence 3678999999999888888854
No 272
>TIGR00387 glcD glycolate oxidase, subunit GlcD. This protein, the glycolate oxidase GlcD subunit, is similar in sequence to that of several D-lactate dehydrogenases, including that of E. coli. The glycolate oxidase has been found to have some D-lactate dehydrogenase activity.
Probab=25.45 E-value=2.4e+02 Score=22.64 Aligned_cols=48 Identities=21% Similarity=0.074 Sum_probs=31.8
Q ss_pred HHHHHHhhh----cCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhC
Q psy7051 28 DDLRRVFER----CGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMD 76 (158)
Q Consensus 28 ~~l~~~f~~----~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~ 76 (158)
-+|..+|.. +|-|+.+.+...+.. ....+.++.|.+.+++..|+..+.
T Consensus 147 ~dl~~l~~Gs~GtlGiit~~~lkl~p~p-~~~~~~~~~f~~~~~~~~~~~~~~ 198 (413)
T TIGR00387 147 YDLTGLFVGSEGTLGIVTEATLKLLPKP-ENIVVALAFFDSIEKAMQAVYDII 198 (413)
T ss_pred CChhhhcccCCccceEEEEEEEEeecCC-CccEEEEEECCCHHHHHHHHHHHH
Confidence 357777643 677888876554422 234566788999999888875543
No 273
>PF08206 OB_RNB: Ribonuclease B OB domain; InterPro: IPR013223 This domain includes the N-terminal OB domain found in ribonuclease B proteins in one or two copies.; PDB: 2ID0_D 2IX1_A 2IX0_A.
Probab=25.41 E-value=42 Score=18.74 Aligned_cols=37 Identities=27% Similarity=0.425 Sum_probs=19.3
Q ss_pred cccEEEEEEcC-HHHHHHHHHHhCCcccCCceEEEEeee
Q psy7051 54 SRGFAFVRFFD-KRDAEDALDAMDGRMLDGRELRVQMAR 91 (158)
Q Consensus 54 ~~g~afV~f~~-~~~a~~a~~~l~g~~i~g~~i~v~~a~ 91 (158)
..|||||...+ .++.--.-..|++ -++|-.+.|....
T Consensus 7 ~~GfGFv~~~~~~~DifIp~~~l~~-A~~gD~V~v~i~~ 44 (58)
T PF08206_consen 7 PKGFGFVIPDDGGEDIFIPPRNLNG-AMDGDKVLVRITP 44 (58)
T ss_dssp SSS-EEEEECT-TEEEEE-HHHHTT-S-TT-EEEEEEEE
T ss_pred cCCCEEEEECCCCCCEEECHHHHCC-CCCCCEEEEEEec
Confidence 46899999886 3333222333433 3456667776665
No 274
>PF08442 ATP-grasp_2: ATP-grasp domain; InterPro: IPR013650 The ATP-grasp superfamily currently includes 17 groups of enzymes, catalyzing ATP-dependent ligation of a carboxylate containing molecule to an amino or thiol group-containing molecule []. They contribute predominantly to macromolecular synthesis. ATP-hydrolysis is used to activate a substrate. For example, DD-ligase transfers phosphate from ATP to D-alanine on the first step of catalysis. On the second step the resulting acylphosphate is attacked by a second D-alanine to produce a DD dipeptide following phosphate elimination []. The ATP-grasp domain contains three conserved motifs, corresponding to the phosphate binding loop and the Mg(2+) binding site []. The fold is characterised by two alpha-beta subdomains that grasp the ATP molecule between them. Each subdomain provides a variable loop that forms a part of the active site, completed by region of other domains not conserved between the various ATP-grasp enzymes []. The ATP-grasp domain represented by this entry is found primarily in succinyl-CoA synthetases (6.2.1.5 from EC).; PDB: 3PFF_A 3MWD_A 3MWE_A 1CQI_E 1SCU_B 2NU9_G 2NU6_E 1CQJ_E 2NU7_B 1JLL_E ....
Probab=25.38 E-value=2.1e+02 Score=20.62 Aligned_cols=54 Identities=19% Similarity=0.218 Sum_probs=34.5
Q ss_pred CCHHHHHHHhhhcCc---eEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccC
Q psy7051 25 TTPDDLRRVFERCGE---VGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLD 81 (158)
Q Consensus 25 ~~~~~l~~~f~~~G~---i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~ 81 (158)
.+.+++.++...+|. |...++.. .|+-++-+...-.+.++|..+.+.|=|..|.
T Consensus 25 ~s~eea~~~~~~l~~~~~VvKaQvl~---GgRGK~GgVk~~~s~~ea~~~a~~mlg~~l~ 81 (202)
T PF08442_consen 25 TSPEEAREAAKELGGKPLVVKAQVLA---GGRGKAGGVKIAKSPEEAKEAAKEMLGKTLK 81 (202)
T ss_dssp SSHHHHHHHHHHHTTSSEEEEE-SSS---STTTTTTCEEEESSHHHHHHHHHTTTTSEEE
T ss_pred CCHHHHHHHHHHhCCCcEEEEEeEee---cCcccCCceeecCCHHHHHHHHHHHhCCceE
Confidence 367888888887764 44444433 3333443333345789999999888888776
No 275
>KOG2187|consensus
Probab=25.32 E-value=41 Score=27.98 Aligned_cols=38 Identities=34% Similarity=0.526 Sum_probs=33.0
Q ss_pred cccEEEEEEcCHHHHHHHHHHhCCcccCCceEEEEeee
Q psy7051 54 SRGFAFVRFFDKRDAEDALDAMDGRMLDGRELRVQMAR 91 (158)
Q Consensus 54 ~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~v~~a~ 91 (158)
...++++.|++.+.+..|+..++|..+.+..+.|..+.
T Consensus 62 ~~~~~~~~~et~~~~~ka~~~v~g~~~k~~~~~~~~~~ 99 (534)
T KOG2187|consen 62 MPKYAYVTFETPSDAGKAINLVDGLLYKGFILRVQLGA 99 (534)
T ss_pred CCCceEEEEeccchhhhHHHHHhhhhhhcchhhhhhcc
Confidence 34699999999999999999999998888887777665
No 276
>PF11061 DUF2862: Protein of unknown function (DUF2862); InterPro: IPR021291 This family of proteins has no known function.
Probab=24.52 E-value=1.5e+02 Score=17.28 Aligned_cols=31 Identities=16% Similarity=0.364 Sum_probs=19.9
Q ss_pred HHHHHHhhh--cCceEEEEEccCCCCCCcccE-EEEEEcC
Q psy7051 28 DDLRRVFER--CGEVGDIYIPRDRFTRESRGF-AFVRFFD 64 (158)
Q Consensus 28 ~~l~~~f~~--~G~i~~v~~~~~~~~~~~~g~-afV~f~~ 64 (158)
++|.+.+.+ .|.|...++... .|. .+|+|.+
T Consensus 18 ~~l~~~l~~~~~g~I~~fKmtDG------~giG~vv~~~n 51 (64)
T PF11061_consen 18 KELVDKLGKNPIGTIKGFKMTDG------SGIGVVVEFSN 51 (64)
T ss_pred HHHHHHhccCCcEEEEEEEEecC------CcEEEEEEecC
Confidence 345555665 889999998654 344 4566764
No 277
>PF00521 DNA_topoisoIV: DNA gyrase/topoisomerase IV, subunit A; InterPro: IPR002205 DNA topoisomerases regulate the number of topological links between two DNA strands (i.e. change the number of superhelical turns) by catalysing transient single- or double-strand breaks, crossing the strands through one another, then resealing the breaks []. These enzymes have several functions: to remove DNA supercoils during transcription and DNA replication; for strand breakage during recombination; for chromosome condensation; and to disentangle intertwined DNA during mitosis [, ]. DNA topoisomerases are divided into two classes: type I enzymes (5.99.1.2 from EC; topoisomerases I, III and V) break single-strand DNA, and type II enzymes (5.99.1.3 from EC; topoisomerases II, IV and VI) break double-strand DNA []. Type II topoisomerases are ATP-dependent enzymes, and can be subdivided according to their structure and reaction mechanisms: type IIA (topoisomerase II or gyrase, and topoisomerase IV) and type IIB (topoisomerase VI). These enzymes are responsible for relaxing supercoiled DNA as well as for introducing both negative and positive supercoils []. Type IIA topoisomerases together manage chromosome integrity and topology in cells. Topoisomerase II (called gyrase in bacteria) primarily introduces negative supercoils into DNA. In bacteria, topoisomerase II consists of two polypeptide subunits, gyrA and gyrB, which form a heterotetramer: (BA)2. In most eukaryotes, topoisomerase II consists of a single polypeptide, where the N- and C-terminal regions correspond to gyrB and gyrA, respectively; this topoisomerase II forms a homodimer that is equivalent to the bacterial heterotetramer. There are four functional domains in topoisomerase II: domain 1 (N-terminal of gyrB) is an ATPase, domain 2 (C-terminal of gyrB) is responsible for subunit interactions (differs between eukaryotic and bacterial enzymes), domain 3 (N-terminal of gyrA) is responsible for the breaking-rejoining function through its capacity to form protein-DNA bridges, and domain 4 (C-terminal of gyrA) is able to non-specifically bind DNA []. Topoisomerase IV primarily decatenates DNA and relaxes positive supercoils, which is important in bacteria, where the circular chromosome becomes catenated, or linked, during replication []. Topoisomerase IV consists of two polypeptide subunits, parE and parC, where parC is homologous to gyrA and parE is homologous to gyrB. This entry represents subunit A (gyrA and parC) of bacterial gyrase and topoisomerase IV, and the equivalent C-terminal region in eukaryotic topoisomerase II composed of a single polypeptide. This subunit has DNA-binding capacity. More information about this protein can be found at Protein of the Month: DNA Topoisomerase [].; GO: 0003677 DNA binding, 0003918 DNA topoisomerase (ATP-hydrolyzing) activity, 0005524 ATP binding, 0006265 DNA topological change, 0005694 chromosome; PDB: 1ZVU_A 1AB4_A 1X75_A 3NUH_A 1BJT_A 1BGW_A 2RGR_A 3KSB_B 3FOE_B 2NOV_C ....
Probab=24.51 E-value=1.9e+02 Score=23.38 Aligned_cols=56 Identities=18% Similarity=0.305 Sum_probs=30.9
Q ss_pred cEEEEeCCCCCCCHHHHHHHhhhc---CceEEEEEccCCCCCCcccEEE-EEEcCHHHHHHHHH
Q psy7051 14 VSLKVDNLTYRTTPDDLRRVFERC---GEVGDIYIPRDRFTRESRGFAF-VRFFDKRDAEDALD 73 (158)
Q Consensus 14 ~~l~V~nl~~~~~~~~l~~~f~~~---G~i~~v~~~~~~~~~~~~g~af-V~f~~~~~a~~a~~ 73 (158)
.+|.|.-||..++.+++.+.+... +.|. |.=+.+. +.. +..| |++....+++..++
T Consensus 217 ~~i~ITELP~~~~t~~~~e~i~~l~~~~~i~-I~d~~D~-Sd~--~v~ivI~lk~~~~~~~~~~ 276 (426)
T PF00521_consen 217 NTIVITELPYGVWTEKYKEKIEELVEDKKIK-ISDYRDE-SDR--GVRIVIELKRGADPEKILE 276 (426)
T ss_dssp EEEEEEE--TT--HHHHHHHHHHHHHTTSST-EEEEEE--BBT--BS-EEEEESTTSHHHHHHH
T ss_pred cEEEEEeCCccccHHHHHHHHHHHhccCcch-hhHHHhc-CCC--ceeEEEEecCCccHHHHHH
Confidence 689999999999988887776653 5666 5444443 443 5555 45555444444444
No 278
>PRK13011 formyltetrahydrofolate deformylase; Reviewed
Probab=24.42 E-value=3e+02 Score=20.98 Aligned_cols=61 Identities=11% Similarity=-0.125 Sum_probs=34.9
Q ss_pred CCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccC
Q psy7051 20 NLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLD 81 (158)
Q Consensus 20 nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~ 81 (158)
.+|..++.++|++.|...++-..+.+....... ....|+..-.+-..+++.+.......++
T Consensus 56 ~~p~~~~~~~L~~~L~~l~~~l~l~i~i~~~~~-~~ri~vl~Sg~g~nl~al~~~~~~~~~~ 116 (286)
T PRK13011 56 HSEEGLDEDALRAGFAPIAARFGMQWELHDPAA-RPKVLIMVSKFDHCLNDLLYRWRIGELP 116 (286)
T ss_pred ecCCCCCHHHHHHHHHHHHHHhCcEEEEeeccc-CceEEEEEcCCcccHHHHHHHHHcCCCC
Confidence 478888899999999887753333322221111 2335555555555566666555555554
No 279
>PF14111 DUF4283: Domain of unknown function (DUF4283)
Probab=24.22 E-value=78 Score=21.03 Aligned_cols=32 Identities=13% Similarity=0.160 Sum_probs=26.1
Q ss_pred EEEeCCCCC-CCHHHHHHHhhhcCceEEEEEcc
Q psy7051 16 LKVDNLTYR-TTPDDLRRVFERCGEVGDIYIPR 47 (158)
Q Consensus 16 l~V~nl~~~-~~~~~l~~~f~~~G~i~~v~~~~ 47 (158)
|.|-|||.. .+++.|..+.+.+|++..+....
T Consensus 107 Vri~glP~~~~~~~~~~~i~~~iG~~i~vD~~t 139 (153)
T PF14111_consen 107 VRIYGLPLHLWSEEILKAIGSKIGEPIEVDENT 139 (153)
T ss_pred hhhccCCHHHhhhHHHHHHHHhcCCeEEEEcCC
Confidence 567799987 57788999999999998887643
No 280
>COG0225 MsrA Peptide methionine sulfoxide reductase [Posttranslational modification, protein turnover, chaperones]
Probab=24.20 E-value=2.5e+02 Score=19.88 Aligned_cols=74 Identities=8% Similarity=0.119 Sum_probs=38.0
Q ss_pred EEEEeCCCCCCCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEE-EcCHHH---HHHHHHHhCCcccCCceEEEEee
Q psy7051 15 SLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR-FFDKRD---AEDALDAMDGRMLDGRELRVQMA 90 (158)
Q Consensus 15 ~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~-f~~~~~---a~~a~~~l~g~~i~g~~i~v~~a 90 (158)
.|.|.==|..++-++|.++|-..-+.+.+. .+.+..-..|--+- +.+.++ |++.++.+....+.+.+|.+++.
T Consensus 59 ~V~V~yDp~~isy~~LL~~ff~ihDPT~~n---rQGnD~GtqYRs~Iy~~~~~q~~~a~~~~~~~q~~~~~~~~IvteI~ 135 (174)
T COG0225 59 AVEVTYDPKVISYEELLEVFFEIHDPTSLN---RQGNDRGTQYRSAIYYTNEEQKAIAEASIEELQASGYFKKPIVTEIE 135 (174)
T ss_pred EEEEEeCCccccHHHHHHHHheecCCCCCC---ccCCcccccceeEEEEcCHHHHHHHHHHHHHHHHhccCCCCeEEEee
Confidence 566666677788888877765432222110 01111112233333 334444 45555556555566667777765
Q ss_pred e
Q psy7051 91 R 91 (158)
Q Consensus 91 ~ 91 (158)
.
T Consensus 136 p 136 (174)
T COG0225 136 P 136 (174)
T ss_pred c
Confidence 4
No 281
>PF00276 Ribosomal_L23: Ribosomal protein L23; InterPro: IPR013025 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites [, ]. About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits. Many ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome [, ]. This domain is found in both eukaryotic L25 and prokaryotic and eukaryotic L23 proteins.; GO: 0003735 structural constituent of ribosome, 0006412 translation, 0005622 intracellular, 0005840 ribosome; PDB: 3DF4_T 3DF2_T 4A1E_R 4A1A_R 4A1C_R 4A17_R 2HGU_W 2HGQ_W 1N88_A 2HGJ_W ....
Probab=24.16 E-value=1.4e+02 Score=18.47 Aligned_cols=30 Identities=17% Similarity=0.288 Sum_probs=22.1
Q ss_pred EEEeCCCCCCCHHHHHHHhhh-cC-ceEEEEE
Q psy7051 16 LKVDNLTYRTTPDDLRRVFER-CG-EVGDIYI 45 (158)
Q Consensus 16 l~V~nl~~~~~~~~l~~~f~~-~G-~i~~v~~ 45 (158)
.++-.++..++..+|++.|+. || .|..|..
T Consensus 22 ~~tF~V~~~atK~~Ik~aie~iy~V~V~~Vnt 53 (91)
T PF00276_consen 22 QYTFEVDPRATKTEIKEAIEKIYGVKVKKVNT 53 (91)
T ss_dssp EEEEEETTTSTHHHHHHHHHHHHTSEEEEEEE
T ss_pred EEEEEEeCCCCHHHHHHHHHhhcCCCeeEEEE
Confidence 344467889999999999987 66 5555543
No 282
>PF15063 TC1: Thyroid cancer protein 1
Probab=24.15 E-value=54 Score=19.76 Aligned_cols=24 Identities=25% Similarity=0.490 Sum_probs=20.3
Q ss_pred EEeCCCCCCCHHHHHHHhhhcCce
Q psy7051 17 KVDNLTYRTTPDDLRRVFERCGEV 40 (158)
Q Consensus 17 ~V~nl~~~~~~~~l~~~f~~~G~i 40 (158)
-+.|+=.+++...|+.+|..-|+.
T Consensus 29 asaNIFe~vn~~qlqrLF~~sGD~ 52 (79)
T PF15063_consen 29 ASANIFENVNLDQLQRLFQKSGDK 52 (79)
T ss_pred hhhhhhhccCHHHHHHHHHHccch
Confidence 366777789999999999999965
No 283
>smart00434 TOP4c DNA Topoisomerase IV. Bacterial DNA topoisomerase IV, GyrA, ParC
Probab=23.97 E-value=3e+02 Score=22.60 Aligned_cols=58 Identities=19% Similarity=0.247 Sum_probs=34.0
Q ss_pred cEEEEeCCCCCCCHHHHHHHhhh---cCceEEEEEccCCCCCCcccEEEE-EEcCHHHHHHHH
Q psy7051 14 VSLKVDNLTYRTTPDDLRRVFER---CGEVGDIYIPRDRFTRESRGFAFV-RFFDKRDAEDAL 72 (158)
Q Consensus 14 ~~l~V~nl~~~~~~~~l~~~f~~---~G~i~~v~~~~~~~~~~~~g~afV-~f~~~~~a~~a~ 72 (158)
++|.|+-||..++.+.+.+.+.. -+.|..+.-..+. +....+..|| ++.....++..+
T Consensus 233 ~~ivItElP~~~~~~~~~e~I~~lv~~~ki~~i~~~~de-s~~~~~vrivI~lk~~~~~~~~~ 294 (445)
T smart00434 233 NTIVITELPYQVNKAKLIEKIAELVKDKKIEGIIDVRDE-SHDRTGVRIVIELKRGAMAEVVL 294 (445)
T ss_pred ceEEEEeCCCcccHHHHHHHHHHHHhcCCCCcceehhhc-cCCCCceEEEEEECCCcCHHHHH
Confidence 67999999999999888877554 3445444433332 2112445554 455443444433
No 284
>PF09341 Pcc1: Transcription factor Pcc1; InterPro: IPR015419 Pcc1 is a proposed transcription factor involved in the expression of genes regulated by alpha-factor and galactose; component of the EKC/KEOPS protein complex with Kae1, Gon7, Bud32, and Cgi121; related to human cancer-testis antigens [].; PDB: 2BNR_C 2P5W_C 3KLA_C 2F54_C 2P5E_C 2F53_C 3ENO_E 3ENC_B.
Probab=23.87 E-value=1.3e+02 Score=17.64 Aligned_cols=20 Identities=15% Similarity=0.104 Sum_probs=15.4
Q ss_pred EEEEEEcCHHHHHHHHHHhC
Q psy7051 57 FAFVRFFDKRDAEDALDAMD 76 (158)
Q Consensus 57 ~afV~f~~~~~a~~a~~~l~ 76 (158)
..-|.|.+++.|+.+++.|.
T Consensus 4 ~l~i~f~s~~~A~ii~~sL~ 23 (76)
T PF09341_consen 4 TLEIPFESEEKAEIIYRSLK 23 (76)
T ss_dssp EEEEE-SSHHHHHHHHHHHH
T ss_pred EEEEEeCCHHHHHHHHHHhC
Confidence 46789999999998887654
No 285
>PLN02655 ent-kaurene oxidase
Probab=23.70 E-value=2.1e+02 Score=23.03 Aligned_cols=49 Identities=16% Similarity=0.100 Sum_probs=33.6
Q ss_pred EEeCCCCCC---CHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHH
Q psy7051 17 KVDNLTYRT---TPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDA 74 (158)
Q Consensus 17 ~V~nl~~~~---~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~ 74 (158)
+||||..-. ....+.+.+.+||+|..+.+. +.-.|...+++.++.++.+
T Consensus 9 ~iG~l~~~~~~~~~~~~~~~~~~yG~i~~~~~g---------~~~~vvv~~pe~~k~il~~ 60 (466)
T PLN02655 9 VIGNLLQLKEKKPHRTFTKWSEIYGPIYTIRTG---------ASSVVVLNSTEVAKEAMVT 60 (466)
T ss_pred ccccHHHcCCCchhHHHHHHHHHhCCeEEEEEC---------CEeEEEeCCHHHHHHHHHh
Confidence 456663321 245788888999998777652 2357777888999888753
No 286
>KOG2287|consensus
Probab=23.58 E-value=98 Score=24.31 Aligned_cols=37 Identities=11% Similarity=0.016 Sum_probs=28.5
Q ss_pred CCcEEEEeCCCCCCC--HHHHHHHhhhcCceEEEEEccC
Q psy7051 12 GMVSLKVDNLTYRTT--PDDLRRVFERCGEVGDIYIPRD 48 (158)
Q Consensus 12 ~~~~l~V~nl~~~~~--~~~l~~~f~~~G~i~~v~~~~~ 48 (158)
...++|+-+++.... ..+|.+.-..||+|..+.+...
T Consensus 128 ~v~~~FLvG~~~~~~~~~~~l~~Ea~~ygDIi~~df~Dt 166 (349)
T KOG2287|consen 128 RVRVLFLVGLPSNEDKLNKLLADEARLYGDIIQVDFEDT 166 (349)
T ss_pred cEEEEEEecCCCcHHHHHHHHHHHHHHhCCEEEEecccc
Confidence 357888888887655 6778888889999988876543
No 287
>cd04879 ACT_3PGDH-like ACT_3PGDH-like CD includes the C-terminal ACT (regulatory) domain of D-3-phosphoglycerate dehydrogenase (3PGDH). ACT_3PGDH-like: The ACT_3PGDH-like CD includes the C-terminal ACT (regulatory) domain of D-3-phosphoglycerate dehydrogenase (3PGDH), with or without an extended C-terminal (xct) region found in various bacteria, archaea, fungi, and plants. 3PGDH is an enzyme that belongs to the D-isomer specific, 2-hydroxyacid dehydrogenase family and catalyzes the oxidation of D-3-phosphoglycerate to 3- phosphohydroxypyruvate, which is the first step in the biosynthesis of L-serine, using NAD+ as the oxidizing agent. In bacteria, 3PGDH is feedback controlled by the end product L-serine in an allosteric manner. In the Escherichia coli homotetrameric enzyme, the interface at adjacent ACT (regulatory) domains couples to create an extended beta-sheet. Each regulatory interface forms two serine-binding sites. The mechanism by which serine transmits inhibition to the active
Probab=23.57 E-value=1.4e+02 Score=16.21 Aligned_cols=24 Identities=8% Similarity=0.249 Sum_probs=16.7
Q ss_pred CCCHHHHHHHhhhcC-ceEEEEEcc
Q psy7051 24 RTTPDDLRRVFERCG-EVGDIYIPR 47 (158)
Q Consensus 24 ~~~~~~l~~~f~~~G-~i~~v~~~~ 47 (158)
.-.-.+|-.+|.++| .|..+.+..
T Consensus 10 ~g~l~~i~~~l~~~~~nI~~~~~~~ 34 (71)
T cd04879 10 PGVIGKVGTILGEHGINIAAMQVGR 34 (71)
T ss_pred CCHHHHHHHHHHhcCCCeeeEEEec
Confidence 335667888888886 777776643
No 288
>PF12007 DUF3501: Protein of unknown function (DUF3501); InterPro: IPR021890 This family of proteins is functionally uncharacterised. This protein is found in bacteria and archaea. Proteins in this family are about 200 amino acids in length. The structure of protein of unknown function (YP_111841.1) from B. pseudomallei has been solved. ; PDB: 3FJV_B.
Probab=23.56 E-value=1.5e+02 Score=21.38 Aligned_cols=47 Identities=9% Similarity=0.141 Sum_probs=30.8
Q ss_pred CCHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCc
Q psy7051 25 TTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGR 78 (158)
Q Consensus 25 ~~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~ 78 (158)
..+++|.+.+..|.++. + ..+.-+-.-||++.+.++....|..|.|.
T Consensus 64 ~~~~~I~~Ei~aYnpLi----P---~~~~l~ATl~IE~~d~~~r~~~L~~L~Gi 110 (192)
T PF12007_consen 64 FDEEGIQEEIDAYNPLI----P---DGGNLKATLMIEIPDEDERRRELARLVGI 110 (192)
T ss_dssp -SHHHHHHHHHHHGGGS---------SSEEEEEEEE--SSHHHHHHHHHHCTTG
T ss_pred CCHHHHHHHHHHhcccC----C---CCCcEEEEEEEEcCCHHHHHHHHHHhcCc
Confidence 35566777777776642 1 12223446799999999999999999887
No 289
>COG0217 Uncharacterized conserved protein [Function unknown]
Probab=23.51 E-value=3.2e+02 Score=20.45 Aligned_cols=36 Identities=28% Similarity=0.445 Sum_probs=25.3
Q ss_pred CcEEEEeCCCCCC--CHHHHHHHhhhcC-ceE---EEEEccC
Q psy7051 13 MVSLKVDNLTYRT--TPDDLRRVFERCG-EVG---DIYIPRD 48 (158)
Q Consensus 13 ~~~l~V~nl~~~~--~~~~l~~~f~~~G-~i~---~v~~~~~ 48 (158)
..-|+|--|..+. |..+|+..|.+.| .+- .|.++.+
T Consensus 94 GvaiiVe~LTDN~NRTas~vR~~F~K~GG~lg~~GSV~~mF~ 135 (241)
T COG0217 94 GVAIIVEALTDNRNRTASNVRSAFNKNGGNLGEPGSVSYMFD 135 (241)
T ss_pred ceEEEEEeccCCcchhHHHHHHHHHhcCCccCCCceEEEEEe
Confidence 4568888887655 6789999999986 332 3555554
No 290
>smart00457 MACPF membrane-attack complex / perforin.
Probab=23.44 E-value=67 Score=22.69 Aligned_cols=28 Identities=11% Similarity=0.278 Sum_probs=22.3
Q ss_pred EeCCCCCCCHHHHHHHhhhcCc--eEEEEE
Q psy7051 18 VDNLTYRTTPDDLRRVFERCGE--VGDIYI 45 (158)
Q Consensus 18 V~nl~~~~~~~~l~~~f~~~G~--i~~v~~ 45 (158)
+.+||...+..+...||..||. |..+.+
T Consensus 30 l~~Lp~~~~~~~~~~fi~~yGTH~i~s~~~ 59 (194)
T smart00457 30 LRDLPDQYNRGAYARFIDKYGTHYITSATL 59 (194)
T ss_pred HHhCccccCHHHHHHHHHHhCCeEEEeeee
Confidence 4578888889999999999994 555554
No 291
>KOG1719|consensus
Probab=23.08 E-value=2.6e+02 Score=19.64 Aligned_cols=26 Identities=12% Similarity=0.160 Sum_probs=21.3
Q ss_pred cCHHHHHHHHHHhCCcccCCceEEEE
Q psy7051 63 FDKRDAEDALDAMDGRMLDGRELRVQ 88 (158)
Q Consensus 63 ~~~~~a~~a~~~l~g~~i~g~~i~v~ 88 (158)
.+.++.+.|++.++.....|+.+.|.
T Consensus 90 Ps~~~i~~aVeFi~k~asLGktvYVH 115 (183)
T KOG1719|consen 90 PSLENIQKAVEFIHKNASLGKTVYVH 115 (183)
T ss_pred CCHHHHHHHHHHHHhccccCCeEEEE
Confidence 46788999999988888888877774
No 292
>COG2088 SpoVG Uncharacterized protein, involved in the regulation of septum location [Cell envelope biogenesis, outer membrane]
Probab=22.98 E-value=1.3e+02 Score=18.78 Aligned_cols=27 Identities=15% Similarity=0.245 Sum_probs=19.9
Q ss_pred ceEEEEEccCCCCCCcccEEEEEEcCH
Q psy7051 39 EVGDIYIPRDRFTRESRGFAFVRFFDK 65 (158)
Q Consensus 39 ~i~~v~~~~~~~~~~~~g~afV~f~~~ 65 (158)
+|+.|.+.+-...|+.+.|+-|+|++.
T Consensus 2 ~iTdVRirkv~~dgrmkA~vsvT~D~e 28 (95)
T COG2088 2 EITDVRIRKVDTDGRMKAYVSVTLDNE 28 (95)
T ss_pred cceeEEEEEecCCCcEEEEEEEEecce
Confidence 467777766555688888899998763
No 293
>COG5594 Uncharacterized integral membrane protein [Function unknown]
Probab=22.93 E-value=1.1e+02 Score=27.02 Aligned_cols=30 Identities=17% Similarity=0.138 Sum_probs=22.3
Q ss_pred cccEEEEEEcCHHHHHHHHHHhCCcccCCc
Q psy7051 54 SRGFAFVRFFDKRDAEDALDAMDGRMLDGR 83 (158)
Q Consensus 54 ~~g~afV~f~~~~~a~~a~~~l~g~~i~g~ 83 (158)
..+.|||+|++...|+.|.+..-...+.+.
T Consensus 356 ~~~~~FItFkSq~~Aq~~aQ~~~~sr~~~~ 385 (827)
T COG5594 356 PTKSGFITFKSQASAQIAAQSQIYSRVLGK 385 (827)
T ss_pred ccccEEEEEehhHHHHHHHHhhhhhhhhcc
Confidence 356899999999999999876544444433
No 294
>PF02946 GTF2I: GTF2I-like repeat; InterPro: IPR004212 This region of sequence similarity is found up to six times in a variety of proteins including general transcription factor II-I (GTF2I). It has been suggested that this may be a DNA binding domain [, ].; PDB: 2E3L_A 2D99_A 2DN4_A 2D9B_A 2EJE_A 1Q60_A 2DZR_A 2DN5_A 2DZQ_A 2ED2_A.
Probab=22.80 E-value=1.9e+02 Score=17.49 Aligned_cols=30 Identities=13% Similarity=0.187 Sum_probs=17.6
Q ss_pred cEEEEeCCCCCCC--------HHHHHHHhhhcCceEEE
Q psy7051 14 VSLKVDNLTYRTT--------PDDLRRVFERCGEVGDI 43 (158)
Q Consensus 14 ~~l~V~nl~~~~~--------~~~l~~~f~~~G~i~~v 43 (158)
..|+|.+||..+. -..|+.+++.-..|..+
T Consensus 35 ~~v~V~GLPegi~fr~P~~Y~i~~L~~IL~~~~~I~Fv 72 (76)
T PF02946_consen 35 EAVYVQGLPEGIPFRRPSNYGIPRLEKILEASSRIRFV 72 (76)
T ss_dssp TTEEEES--TT--SS-TTTS-HHHHHHHHHTTTT-EEE
T ss_pred CcEEEEeCCCCCcCCCCCcCCHHHHHHHHHccCCcEEE
Confidence 3589999988763 45677777776666544
No 295
>KOG2836|consensus
Probab=22.71 E-value=2.1e+02 Score=19.53 Aligned_cols=32 Identities=6% Similarity=0.139 Sum_probs=25.1
Q ss_pred cEEEEeCCCCCCCHHHHHHHhhhcCceEEEEE
Q psy7051 14 VSLKVDNLTYRTTPDDLRRVFERCGEVGDIYI 45 (158)
Q Consensus 14 ~~l~V~nl~~~~~~~~l~~~f~~~G~i~~v~~ 45 (158)
....|+.-|.++|-+...+.+.+||.-+-|.+
T Consensus 17 MrFLIThnPtnaTln~fieELkKygvttvVRV 48 (173)
T KOG2836|consen 17 MRFLITHNPTNATLNKFIEELKKYGVTTVVRV 48 (173)
T ss_pred eEEEEecCCCchhHHHHHHHHHhcCCeEEEEe
Confidence 45678888999999999999999995444443
No 296
>PLN00110 flavonoid 3',5'-hydroxylase (F3'5'H); Provisional
Probab=22.65 E-value=2.9e+02 Score=22.68 Aligned_cols=49 Identities=16% Similarity=0.074 Sum_probs=32.9
Q ss_pred EEeCCCCCC--CHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHH
Q psy7051 17 KVDNLTYRT--TPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDA 74 (158)
Q Consensus 17 ~V~nl~~~~--~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~ 74 (158)
++|||..-. ....+.++..+||+|..+.+.. .-+|...+++.++.++..
T Consensus 41 l~G~l~~~~~~~~~~~~~~~~~yG~i~~~~~g~---------~~~vvv~dpe~~~~vl~~ 91 (504)
T PLN00110 41 LLGALPLLGNMPHVALAKMAKRYGPVMFLKMGT---------NSMVVASTPEAARAFLKT 91 (504)
T ss_pred eeechhhcCCchHHHHHHHHHHhCCeEEEEcCC---------ccEEEECCHHHHHHHHHh
Confidence 357764322 2456677778899987766521 246788889999988864
No 297
>smart00650 rADc Ribosomal RNA adenine dimethylases.
Probab=22.49 E-value=1.4e+02 Score=20.35 Aligned_cols=23 Identities=26% Similarity=0.288 Sum_probs=18.3
Q ss_pred cEEEEeCCCCCCCHHHHHHHhhh
Q psy7051 14 VSLKVDNLTYRTTPDDLRRVFER 36 (158)
Q Consensus 14 ~~l~V~nl~~~~~~~~l~~~f~~ 36 (158)
..+.|+|+|..++.+.|..++..
T Consensus 78 ~d~vi~n~Py~~~~~~i~~~l~~ 100 (169)
T smart00650 78 PYKVVGNLPYNISTPILFKLLEE 100 (169)
T ss_pred CCEEEECCCcccHHHHHHHHHhc
Confidence 34678999999988888888764
No 298
>PF06919 Phage_T4_Gp30_7: Phage Gp30.7 protein; InterPro: IPR009690 This family consists of several phage Gp30.7 proteins of 121 residues in length. Family members seem to be exclusively from the T4-like viruses. The function of this family is unknown.
Probab=22.35 E-value=1.5e+02 Score=19.14 Aligned_cols=42 Identities=17% Similarity=0.314 Sum_probs=25.2
Q ss_pred hcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEEEEeee
Q psy7051 36 RCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELRVQMAR 91 (158)
Q Consensus 36 ~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~v~~a~ 91 (158)
+-|.+++|..... ..|+|+.|++--.+ ...+.|..+++.+..
T Consensus 28 ~NGtv~qI~~Y~~------pNYvf~~FEnG~tv--------sv~~~gs~~kI~~~D 69 (121)
T PF06919_consen 28 KNGTVAQIEQYMT------PNYVFMRFENGITV--------SVTYNGSIFKIGLDD 69 (121)
T ss_pred CCCcEEEEeeecC------CCEEEEEecCCCEE--------EEEecCcEEEEEecC
Confidence 4577777766554 35999999752110 123456667776654
No 299
>PF09702 Cas_Csa5: CRISPR-associated protein (Cas_Csa5); InterPro: IPR010157 Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR) are a family of DNA direct repeats separated by regularly sized non-repetitive spacer sequences that are found in most bacterial and archaeal genomes []. CRISPRs appear to provide acquired resistance against bacteriophages, possibly acting with an RNA interference-like mechanism to inhibit gene functions of invasive DNA elements [, ]. Differences in the number and type of spacers between CRISPR repeats correlate with phage sensitivity. It is thought that following phage infection, bacteria integrate new spacers derived from phage genomic sequences, and that the removal or addition of particular spacers modifies the phage-resistance phenotype of the cell. Therefore, the specificity of CRISPRs may be determined by spacer-phage sequence similarity. In addition, there are many protein families known as CRISPR-associated sequences (Cas), which are encoded in the vicinity of CRISPR loci []. CRISPR/cas gene regions can be quite large, with up to 20 different, tandem-arranged cas genes next to a CRISPR cluster or filling the region between two repeat clusters. Cas genes and CRISPRs are found on mobile genetic elements such as plasmids, and have undergone extensive horizontal transfer. Cas proteins are thought to be involved in the propagation and functioning of CRISPRs. Some Cas proteins show similarity to helicases and repair proteins, although the functions of most are unknown. Cas families can be divided into subtypes according to operon organisation and phylogeny. This entry represents a minor family of Cas protein found in various species of Sulfolobus and Pyrococcus (all archaeal). It is found with two different CRISPR loci in Sulfolobus solfataricus.
Probab=22.35 E-value=1.2e+02 Score=19.44 Aligned_cols=21 Identities=10% Similarity=0.137 Sum_probs=14.4
Q ss_pred CcEEEEeCCCCCCCHHHHHHHhhh
Q psy7051 13 MVSLKVDNLTYRTTPDDLRRVFER 36 (158)
Q Consensus 13 ~~~l~V~nl~~~~~~~~l~~~f~~ 36 (158)
...++++.||. .+|+++|+..
T Consensus 64 g~~i~~g~lPt---~~eVe~Fl~~ 84 (105)
T PF09702_consen 64 GNYIIVGYLPT---DEEVEDFLDD 84 (105)
T ss_pred CCEEecCCCCC---hHHHHHHHHH
Confidence 36788999984 5666666643
No 300
>KOG0226|consensus
Probab=21.66 E-value=32 Score=25.90 Aligned_cols=74 Identities=19% Similarity=0.177 Sum_probs=48.0
Q ss_pred cEEEEeCCCCCCCHHH---HHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCceEEEE
Q psy7051 14 VSLKVDNLTYRTTPDD---LRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRELRVQ 88 (158)
Q Consensus 14 ~~l~V~nl~~~~~~~~---l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~i~v~ 88 (158)
..++++++-..+..+- +...|+.|-.+...+++.+. .+...+++|+.|........+-..-++..+.-..|++.
T Consensus 97 f~p~~~~~g~~v~pep~lp~~~~f~~~p~L~ktk~v~~~-p~~~~~~~~~~~k~s~a~~k~~~~~~~Kki~~~~VR~a 173 (290)
T KOG0226|consen 97 FRPFQSNAGATVNPEPPLPLPVVFSEYPSLVKTKLVRDR-PQPIRPEAFESFKASDALLKAETEKEKKKIGKPPVRLA 173 (290)
T ss_pred ccccccccccccCCCCCCcchhhhccchhhhhhhhhhcC-CCccCcccccCcchhhhhhhhccccccccccCcceeec
Confidence 4566666655554443 26667777666666666553 45567899999987777777665556666666655543
No 301
>PF11181 YflT: Heat induced stress protein YflT
Probab=21.31 E-value=1.2e+02 Score=19.08 Aligned_cols=29 Identities=10% Similarity=0.193 Sum_probs=22.4
Q ss_pred EEEcCHHHHHHHHHHhCCcccCCceEEEE
Q psy7051 60 VRFFDKRDAEDALDAMDGRMLDGRELRVQ 88 (158)
Q Consensus 60 V~f~~~~~a~~a~~~l~g~~i~g~~i~v~ 88 (158)
-.|.+.++|..+|+.|...-+....|.|-
T Consensus 4 gv~~~~~E~~~~I~~L~~~Gy~~ddI~Vv 32 (103)
T PF11181_consen 4 GVYDNEEEALSAIEELKAQGYSEDDIYVV 32 (103)
T ss_pred EEECCHHHHHHHHHHHHHcCCCcccEEEE
Confidence 46889999999999888776666666654
No 302
>KOG3580|consensus
Probab=21.11 E-value=4.5e+02 Score=22.88 Aligned_cols=15 Identities=13% Similarity=0.093 Sum_probs=6.0
Q ss_pred EEEEEcCHHHHHHHH
Q psy7051 58 AFVRFFDKRDAEDAL 72 (158)
Q Consensus 58 afV~f~~~~~a~~a~ 72 (158)
+.|.=.+.+.+..|+
T Consensus 64 vMVNGvsMenv~haF 78 (1027)
T KOG3580|consen 64 VMVNGVSMENVLHAF 78 (1027)
T ss_pred EEEcCcchhhhHHHH
Confidence 333333344444443
No 303
>PLN02971 tryptophan N-hydroxylase
Probab=21.04 E-value=2.6e+02 Score=23.18 Aligned_cols=58 Identities=9% Similarity=0.070 Sum_probs=36.5
Q ss_pred EEeCCCCCC---C-HHHHHHHhhhcC-ceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCCcccCCce
Q psy7051 17 KVDNLTYRT---T-PDDLRRVFERCG-EVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDGRMLDGRE 84 (158)
Q Consensus 17 ~V~nl~~~~---~-~~~l~~~f~~~G-~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~i~g~~ 84 (158)
+||||..-. . ...+.+++.+|| .|..+.+- +.-+|...+++.++.++.. ++..+.++.
T Consensus 67 iiGnl~~l~~~~~~~~~l~~~~~~yg~~i~~~~~G---------~~~~vvv~dpe~ikevl~~-~~~~f~~rp 129 (543)
T PLN02971 67 IVGMIPAMLKNRPVFRWLHSLMKELNTEIACVRLG---------NTHVIPVTCPKIAREIFKQ-QDALFASRP 129 (543)
T ss_pred cccchHHhccCCcHhHHHHHHHHHhCCceEEEEcC---------CcceEEECCHHHHHHHHHh-cchhhcCCC
Confidence 468875321 1 345778889999 67666541 1247777889999998864 444444443
No 304
>PF13046 DUF3906: Protein of unknown function (DUF3906)
Probab=21.01 E-value=1e+02 Score=17.94 Aligned_cols=33 Identities=24% Similarity=0.368 Sum_probs=21.6
Q ss_pred CHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEE
Q psy7051 26 TPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFV 60 (158)
Q Consensus 26 ~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV 60 (158)
-+.+|+.+|-+--+|+.+.+...+.-. +|-|||
T Consensus 31 ~e~eler~fl~~P~v~e~~l~EKKri~--~G~gyV 63 (64)
T PF13046_consen 31 VEVELERHFLPLPEVKEVALYEKKRIR--KGAGYV 63 (64)
T ss_pred HHHHhhhhccCCCCceEEEEEEEEeee--CCceeE
Confidence 345677777777788888887664333 455665
No 305
>smart00738 NGN In Spt5p, this domain may confer affinity for Spt4p. It possesses a RNP-like fold. In Spt5p, this domain may confer affinity for Spt4p.Spt4p
Probab=20.72 E-value=82 Score=19.50 Aligned_cols=23 Identities=22% Similarity=0.277 Sum_probs=17.9
Q ss_pred cEEEEEEcCHHHHHHHHHHhCCc
Q psy7051 56 GFAFVRFFDKRDAEDALDAMDGR 78 (158)
Q Consensus 56 g~afV~f~~~~~a~~a~~~l~g~ 78 (158)
||.||++...+++..++..+.+.
T Consensus 60 GYvFv~~~~~~~~~~~i~~~~~v 82 (106)
T smart00738 60 GYIFVEADLEDEVWTAIRGTPGV 82 (106)
T ss_pred CEEEEEEEeCCcHHHHHhcCCCc
Confidence 89999998766667777766664
No 306
>PLN02968 Probable N-acetyl-gamma-glutamyl-phosphate reductase
Probab=20.66 E-value=1.7e+02 Score=23.34 Aligned_cols=32 Identities=13% Similarity=0.124 Sum_probs=23.8
Q ss_pred cEEEEeCCCCCCCHHHHHHHhhh-cCceEEEEEc
Q psy7051 14 VSLKVDNLTYRTTPDDLRRVFER-CGEVGDIYIP 46 (158)
Q Consensus 14 ~~l~V~nl~~~~~~~~l~~~f~~-~G~i~~v~~~ 46 (158)
.+|+| .|...++.++|+++|.. |..+..|.+.
T Consensus 279 ~tv~v-~~~~~~~~~~v~~~~~~~y~~~~fV~~~ 311 (381)
T PLN02968 279 STVYV-HYAPGVTAEDLHQHLKERYEGEEFVKVL 311 (381)
T ss_pred EEEEE-EeCCCCCHHHHHHHHHHhCCCCCEEEeC
Confidence 35665 45667899999999998 6777777663
No 307
>TIGR02118 conserved hypothetical protein. This model represents a small family of proteins of unknown function, each about 105 amino acids in length. Conserved sites in the multiple alignment include a pair of aromatic residues, a histidine, and an aspartate.
Probab=20.61 E-value=1.3e+02 Score=18.59 Aligned_cols=22 Identities=23% Similarity=0.283 Sum_probs=17.3
Q ss_pred ccEEEEEEcCHHHHHHHHHHhC
Q psy7051 55 RGFAFVRFFDKRDAEDALDAMD 76 (158)
Q Consensus 55 ~g~afV~f~~~~~a~~a~~~l~ 76 (158)
-+++.+.|++.+++++|+..-.
T Consensus 53 d~i~el~Fds~e~~~~a~~sp~ 74 (100)
T TIGR02118 53 YGMCELYFDSIEDFQAAFDSPE 74 (100)
T ss_pred eEEEEEEECCHHHHHHHHcCHH
Confidence 4678889999999999885433
No 308
>PF01984 dsDNA_bind: Double-stranded DNA-binding domain; InterPro: IPR002836 This protein family is found in archaea and eukaryota. The human TFAR19 (TF-1 cell apoptosis-related protein 19) encodes a protein which shares significant homology to the corresponding proteins of species ranging from yeast to mice. TFAR19 exhibits a ubiquitous expression pattern and its expression is up-regulated in the tumour cells undergoing apoptosis. TFAR19 may play a general role in the apoptotic process []. Also included in this family is a DNA-binding protein from the archaea, Methanobacterium thermoautotrophicum.; GO: 0003677 DNA binding; PDB: 1EIJ_A 2K6B_A 2CRU_A 1YYB_A 2JXN_A 2FH0_A.
Probab=20.58 E-value=78 Score=20.38 Aligned_cols=21 Identities=19% Similarity=0.411 Sum_probs=16.0
Q ss_pred eCCCCCCCHHHHHHHhhhcCc
Q psy7051 19 DNLTYRTTPDDLRRVFERCGE 39 (158)
Q Consensus 19 ~nl~~~~~~~~l~~~f~~~G~ 39 (158)
|.|...+++++|..+|.....
T Consensus 75 G~l~~kI~d~~L~~iL~~i~~ 95 (107)
T PF01984_consen 75 GQLRGKIDDEQLKEILEQISE 95 (107)
T ss_dssp TSSSS-B-HHHHHHHHHHHCC
T ss_pred CCCCCCcCHHHHHHHHHHHhh
Confidence 678888999999999988754
No 309
>PF00067 p450: Cytochrome P450 p450 superfamily signature b-class p450 signature mitochondrial p450 signature E-class p450 group I signature E-class p450 group II signature E-class p450 group IV signature; InterPro: IPR001128 Cytochrome P450 enzymes are a superfamily of haem-containing mono-oxygenases that are found in all kingdoms of life, and which show extraordinary diversity in their reaction chemistry. In mammals, these proteins are found primarily in microsomes of hepatocytes and other cell types, where they oxidise steroids, fatty acids and xenobiotics, and are important for the detoxification and clearance of various compounds, as well as for hormone synthesis and breakdown, cholesterol synthesis and vitamin D metabolism. In plants, these proteins are important for the biosynthesis of several compounds such as hormones, defensive compounds and fatty acids. In bacteria, they are important for several metabolic processes, such as the biosynthesis of antibiotic erythromycin in Saccharopolyspora erythraea (Streptomyces erythraeus). Cytochrome P450 enzymes use haem to oxidise their substrates, using protons derived from NADH or NADPH to split the oxygen so a single atom can be added to a substrate. They also require electrons, which they receive from a variety of redox partners. In certain cases, cytochrome P450 can be fused to its redox partner to produce a bi-functional protein, such as with P450BM-3 from Bacillus megaterium [], which has haem and flavin domains. Organisms produce many different cytochrome P450 enzymes (at least 58 in humans), which together with alternative splicing can provide a wide array of enzymes with different substrate and tissue specificities. Individual cytochrome P450 proteins follow the nomenclature: CYP, followed by a number (family), then a letter (subfamily), and another number (protein); e.g. CYP3A4 is the fourth protein in family 3, subfamily A. In general, family members should share >40% identity, while subfamily members should share >55% identity. Cytochrome P450 proteins can also be grouped by two different schemes. One scheme was based on a taxonomic split: class I (prokaryotic/mitochondrial) and class II (eukaryotic microsomes). The other scheme was based on the number of components in the system: class B (3-components) and class E (2-components). These classes merge to a certain degree. Most prokaryotes and mitochondria (and fungal CYP55) have 3-component systems (class I/class B) - a FAD-containing flavoprotein (NAD(P)H-dependent reductase), an iron-sulphur protein and P450. Most eukaryotic microsomes have 2-component systems (class II/class E) - NADPH:P450 reductase (FAD and FMN-containing flavoprotein) and P450. There are exceptions to this scheme, such as 1-component systems that resemble class E enzymes [, , ]. The class E enzymes can be further subdivided into five sequence clusters, groups I-V, each of which may contain more than one cytochrome P450 family (eg, CYP1 and CYP2 are both found in group I). The divergence of the cytochrome P450 superfamily into B- and E-classes, and further divergence into stable clusters within the E-class, appears to be very ancient, occurring before the appearance of eukaryotes. More information about these proteins can be found at Protein of the Month: Cytochrome P450 [].; GO: 0005506 iron ion binding, 0009055 electron carrier activity, 0016705 oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen, 0020037 heme binding, 0055114 oxidation-reduction process; PDB: 2RFC_B 2RFB_A 3EJB_H 3EJE_H 3EJD_H 1N6B_A 1NR6_A 1DT6_A 3EL3_A 3DBG_B ....
Probab=20.56 E-value=3.4e+02 Score=20.90 Aligned_cols=37 Identities=19% Similarity=0.258 Sum_probs=28.3
Q ss_pred HHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHH
Q psy7051 28 DDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALD 73 (158)
Q Consensus 28 ~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~ 73 (158)
+.+.++..+||+|..+.+. +.-+|...+++.++.++.
T Consensus 24 ~~~~~~~~kyG~i~~~~~~---------~~~~vvv~~pe~~~~il~ 60 (463)
T PF00067_consen 24 EFFRELHKKYGPIFRIWPG---------GQPIVVVSDPELIKEILR 60 (463)
T ss_dssp HHHHHHHHHHTSEEEEEET---------TEEEEEEESHHHHHHHHT
T ss_pred HHHHHHHHHhCCEEEEeEe---------cccccccccchhhccccc
Confidence 3477788899999888763 346777788888888875
No 310
>PF07237 DUF1428: Protein of unknown function (DUF1428); InterPro: IPR009874 This family consists of several hypothetical bacterial and one archaeal sequence of around 120 residues in length. The function of this family is unknown.; PDB: 2OKQ_A.
Probab=20.46 E-value=2.5e+02 Score=18.03 Aligned_cols=48 Identities=19% Similarity=0.279 Sum_probs=29.4
Q ss_pred HHHHHhhhcCceEEEEE-----ccCC----------CCCCcccEEEEEEcCHHHHHHHHHHhC
Q psy7051 29 DLRRVFERCGEVGDIYI-----PRDR----------FTRESRGFAFVRFFDKRDAEDALDAMD 76 (158)
Q Consensus 29 ~l~~~f~~~G~i~~v~~-----~~~~----------~~~~~~g~afV~f~~~~~a~~a~~~l~ 76 (158)
....+|..||.+..+.. +..+ ..+...-|.+|+|.+.+...++.+++.
T Consensus 24 ~a~~vf~e~GAl~~vE~wgdDvp~G~~TsF~~Av~a~~~E~VVFSWi~wpska~rD~~~~k~m 86 (103)
T PF07237_consen 24 KAAEVFKEHGALRVVECWGDDVPDGKVTSFPRAVKAKPDETVVFSWIEWPSKATRDAANAKMM 86 (103)
T ss_dssp HHHHHHHHTT-SEEEEEEEEE----SS--HHHHTT--TTEEEEEEEEEESSHHHHHHHHHHHH
T ss_pred HHHHHHHHhCCEEEEEeecCcCCcCccCCHHHHhcCCCCCEEEEEEEEcCCHHHHHHHHHHhh
Confidence 45678899996544432 1111 223345689999999998888776543
No 311
>PRK09213 pur operon repressor; Provisional
Probab=20.41 E-value=3.2e+02 Score=20.79 Aligned_cols=41 Identities=12% Similarity=0.381 Sum_probs=29.5
Q ss_pred HHHHHhhhcC--ceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhC
Q psy7051 29 DLRRVFERCG--EVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMD 76 (158)
Q Consensus 29 ~l~~~f~~~G--~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~ 76 (158)
.|++.|++.| .|+.+ .|..-|.-||-+-+.++|+..++.|.
T Consensus 46 i~~~~~~~~~~g~~~t~-------~ga~ggv~~~p~~~~~~a~~~~~~L~ 88 (271)
T PRK09213 46 IIKETFEKQGIGTLETV-------PGAAGGVKYIPSISEEEAREFVEELC 88 (271)
T ss_pred HHHHHHHhcCCceEEEe-------CCCCCCeEEEcCCCHHHHHHHHHHHH
Confidence 4688898865 44333 45566788999988888888777665
No 312
>KOG2943|consensus
Probab=20.22 E-value=3e+02 Score=20.82 Aligned_cols=29 Identities=17% Similarity=0.409 Sum_probs=21.0
Q ss_pred CCCcccEEEEEEcC----HHHHHHHHHHhCCcc
Q psy7051 51 TRESRGFAFVRFFD----KRDAEDALDAMDGRM 79 (158)
Q Consensus 51 ~~~~~g~afV~f~~----~~~a~~a~~~l~g~~ 79 (158)
+...++||+|.|.- ....+.||+..|+..
T Consensus 205 id~~kg~griafaip~d~~~~l~e~iK~~n~~i 237 (299)
T KOG2943|consen 205 IDRAKGFGRIAFAIPTDDLPKLQEAIKSANGTI 237 (299)
T ss_pred ccccccceeEEEeccccccccHHHHHHHhcccc
Confidence 45568999999973 446788887776654
No 313
>PRK00464 nrdR transcriptional regulator NrdR; Validated
Probab=20.22 E-value=1.9e+02 Score=20.01 Aligned_cols=40 Identities=20% Similarity=0.327 Sum_probs=25.8
Q ss_pred CCCHHHHHHHhh----hcCceEEEEEccCCCCCCcccEEEEE--EcCHHHHHHHHHHh
Q psy7051 24 RTTPDDLRRVFE----RCGEVGDIYIPRDRFTRESRGFAFVR--FFDKRDAEDALDAM 75 (158)
Q Consensus 24 ~~~~~~l~~~f~----~~G~i~~v~~~~~~~~~~~~g~afV~--f~~~~~a~~a~~~l 75 (158)
.++.++|++++. ..|.+.++.+ |-|. |.+.++....|..+
T Consensus 102 ~IsveEIqDiVE~~L~~~~~~ay~rf------------a~~~~~f~~~~~~~~~~~~~ 147 (154)
T PRK00464 102 EVPSKEIGELVMEELKKLDEVAYVRF------------ASVYRSFKDVDDFEEEIEEL 147 (154)
T ss_pred CCCHHHHHHHHHHHHHhcCCEEEEEh------------hhhcCCCCCHHHHHHHHHHH
Confidence 466666665543 4566666654 4444 88888888888654
No 314
>PLN02936 epsilon-ring hydroxylase
Probab=20.07 E-value=1.1e+02 Score=24.84 Aligned_cols=40 Identities=20% Similarity=0.042 Sum_probs=28.5
Q ss_pred CHHHHHHHhhhcCceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHH
Q psy7051 26 TPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDA 74 (158)
Q Consensus 26 ~~~~l~~~f~~~G~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~ 74 (158)
....|.+++.+||+|..+.+. +.-.|.+.+++.++.++..
T Consensus 38 ~~~~~~~~~~~yG~i~~~~~g---------~~~~vvv~~pe~~~~il~~ 77 (489)
T PLN02936 38 LFLPLFKWMNEYGPVYRLAAG---------PRNFVVVSDPAIAKHVLRN 77 (489)
T ss_pred HHHHHHHHHHHcCCEEEEccC---------CccEEEEcCHHHHHHHHHh
Confidence 345677888899998766541 1346777888998888753
No 315
>TIGR01743 purR_Bsub pur operon repressor, Bacillus subtilis type. This model represents the puring operon repressor PurR of low-GC Gram-positive bacteria. This homodimeric repressor contains a large region homologous to phosphoribosyltransferases and is inhibited by 5-phosphoribosyl 1-pyrophosphate.
Probab=20.06 E-value=3.6e+02 Score=20.52 Aligned_cols=42 Identities=14% Similarity=0.379 Sum_probs=29.3
Q ss_pred HHHHHhhhcC--ceEEEEEccCCCCCCcccEEEEEEcCHHHHHHHHHHhCC
Q psy7051 29 DLRRVFERCG--EVGDIYIPRDRFTRESRGFAFVRFFDKRDAEDALDAMDG 77 (158)
Q Consensus 29 ~l~~~f~~~G--~i~~v~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g 77 (158)
.|++.|+++| .|+.+ .|..-|.-||-+-+.++|+..++.|..
T Consensus 44 i~~~~~~~~~~g~~~t~-------~ga~ggv~~~p~~~~~~~~~~~~~l~~ 87 (268)
T TIGR01743 44 IIKETFEKFGIGKLLTV-------PGAAGGVKYIPKMSQAEAEEFVEELCQ 87 (268)
T ss_pred HHHHHHHhcCCceEEEe-------CCCCCCeEEEeCCCHHHHHHHHHHHHH
Confidence 4688898865 44333 455667889988888888877766543
Done!