Query 021958
Match_columns 305
No_of_seqs 298 out of 2441
Neff 9.8
Searched_HMMs 46136
Date Fri Mar 29 06:56:38 2013
Command hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/021958.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/021958hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 TIGR01659 sex-lethal sex-letha 100.0 6.7E-36 1.5E-40 261.8 22.5 175 77-258 103-279 (346)
2 KOG0148 Apoptosis-promoting RN 100.0 6.3E-35 1.4E-39 234.7 18.2 172 79-259 60-243 (321)
3 TIGR01645 half-pint poly-U bin 100.0 1.2E-33 2.6E-38 259.9 22.1 176 79-257 105-287 (612)
4 TIGR01661 ELAV_HUD_SF ELAV/HuD 100.0 1.3E-32 2.8E-37 245.9 23.2 169 80-255 2-172 (352)
5 TIGR01661 ELAV_HUD_SF ELAV/HuD 100.0 1.8E-32 3.9E-37 245.0 23.5 176 80-256 88-351 (352)
6 TIGR01628 PABP-1234 polyadenyl 100.0 3.2E-32 6.8E-37 257.2 22.3 199 83-288 2-204 (562)
7 KOG0144 RNA-binding protein CU 100.0 4.7E-32 1E-36 231.1 15.9 178 76-258 29-210 (510)
8 TIGR01622 SF-CC1 splicing fact 100.0 2.1E-30 4.5E-35 239.4 21.6 172 78-253 86-265 (457)
9 KOG0117 Heterogeneous nuclear 100.0 9E-30 2E-34 218.1 21.0 195 79-286 81-283 (506)
10 TIGR01648 hnRNP-R-Q heterogene 100.0 8.1E-30 1.7E-34 234.4 22.2 195 79-287 56-254 (578)
11 KOG0131 Splicing factor 3b, su 100.0 1.2E-30 2.5E-35 198.6 11.8 176 77-258 5-181 (203)
12 TIGR01628 PABP-1234 polyadenyl 100.0 2.4E-29 5.2E-34 237.6 22.3 173 80-257 87-264 (562)
13 KOG0145 RNA-binding protein EL 100.0 5.9E-30 1.3E-34 205.1 15.2 173 78-257 38-212 (360)
14 TIGR01642 U2AF_lg U2 snRNP aux 100.0 2.9E-28 6.2E-33 228.1 25.5 170 76-255 170-376 (509)
15 KOG0145 RNA-binding protein EL 100.0 1.2E-28 2.7E-33 197.5 16.4 175 79-254 125-358 (360)
16 TIGR01649 hnRNP-L_PTB hnRNP-L/ 100.0 3.4E-27 7.3E-32 218.0 22.8 167 78-254 272-480 (481)
17 TIGR01648 hnRNP-R-Q heterogene 100.0 1.5E-27 3.2E-32 219.5 19.0 170 79-257 136-310 (578)
18 TIGR01649 hnRNP-L_PTB hnRNP-L/ 100.0 4.2E-27 9.1E-32 217.4 20.3 162 81-255 2-175 (481)
19 KOG0127 Nucleolar protein fibr 100.0 2E-27 4.4E-32 208.2 16.8 220 81-304 117-446 (678)
20 TIGR01642 U2AF_lg U2 snRNP aux 99.9 1.9E-26 4.1E-31 215.8 21.4 172 79-253 293-501 (509)
21 KOG0117 Heterogeneous nuclear 99.9 4.6E-27 1E-31 201.5 15.5 173 78-259 161-336 (506)
22 KOG0127 Nucleolar protein fibr 99.9 1.5E-26 3.3E-31 202.7 16.4 175 81-259 5-201 (678)
23 KOG0148 Apoptosis-promoting RN 99.9 3.6E-26 7.7E-31 184.4 14.8 178 78-293 3-191 (321)
24 KOG0146 RNA-binding protein ET 99.9 3E-26 6.4E-31 184.5 13.1 176 80-257 18-368 (371)
25 KOG0109 RNA-binding protein LA 99.9 1.2E-26 2.6E-31 188.9 10.7 152 82-258 3-154 (346)
26 KOG0124 Polypyrimidine tract-b 99.9 4.3E-26 9.3E-31 191.0 10.9 171 81-254 113-290 (544)
27 KOG0123 Polyadenylate-binding 99.9 9.2E-25 2E-29 192.6 18.2 198 82-300 2-201 (369)
28 KOG0147 Transcriptional coacti 99.9 4.5E-26 9.7E-31 200.6 8.9 209 77-289 175-399 (549)
29 TIGR01622 SF-CC1 splicing fact 99.9 3.1E-24 6.7E-29 198.3 20.1 166 81-253 186-447 (457)
30 KOG0144 RNA-binding protein CU 99.9 3.5E-25 7.6E-30 189.2 12.6 176 80-257 123-507 (510)
31 KOG4205 RNA-binding protein mu 99.9 3.1E-23 6.7E-28 176.8 11.4 176 79-259 4-181 (311)
32 TIGR01645 half-pint poly-U bin 99.9 1E-21 2.2E-26 181.4 19.7 82 79-162 202-283 (612)
33 KOG0110 RNA-binding protein (R 99.9 1.6E-22 3.6E-27 182.8 12.8 172 82-256 516-695 (725)
34 KOG0123 Polyadenylate-binding 99.9 4.8E-21 1E-25 169.1 14.8 168 83-257 78-249 (369)
35 KOG4206 Spliceosomal protein s 99.8 4.3E-19 9.3E-24 141.0 16.3 161 81-252 9-220 (221)
36 KOG1457 RNA binding protein (c 99.8 8.5E-19 1.8E-23 138.1 15.2 161 76-241 29-273 (284)
37 KOG0147 Transcriptional coacti 99.8 1.6E-19 3.4E-24 159.6 11.4 165 80-252 277-526 (549)
38 PLN03134 glycine-rich RNA-bind 99.8 6.6E-19 1.4E-23 135.9 12.8 87 170-257 31-117 (144)
39 TIGR01659 sex-lethal sex-letha 99.8 2E-18 4.2E-23 151.8 13.2 113 169-288 103-219 (346)
40 KOG1548 Transcription elongati 99.8 3.7E-17 8.1E-22 136.8 18.2 172 78-257 131-355 (382)
41 KOG0105 Alternative splicing f 99.8 2E-17 4.2E-22 126.8 14.4 151 79-242 4-176 (241)
42 KOG4211 Splicing factor hnRNP- 99.8 1.7E-17 3.8E-22 144.9 15.9 166 79-253 8-181 (510)
43 KOG4212 RNA-binding protein hn 99.8 5.1E-17 1.1E-21 139.7 16.6 168 79-251 42-291 (608)
44 PLN03134 glycine-rich RNA-bind 99.7 2.2E-17 4.9E-22 127.4 10.9 84 78-163 31-114 (144)
45 KOG0110 RNA-binding protein (R 99.7 3.4E-16 7.4E-21 142.3 16.0 165 79-252 383-596 (725)
46 KOG0106 Alternative splicing f 99.7 3.1E-17 6.8E-22 131.8 6.9 152 82-252 2-169 (216)
47 PF00076 RRM_1: RNA recognitio 99.7 2E-16 4.3E-21 107.2 9.8 70 176-247 1-70 (70)
48 KOG0122 Translation initiation 99.7 2.6E-16 5.7E-21 125.9 9.6 83 171-254 187-269 (270)
49 KOG0124 Polypyrimidine tract-b 99.7 1.5E-15 3.2E-20 128.1 13.5 172 77-251 206-532 (544)
50 KOG1190 Polypyrimidine tract-b 99.7 2.9E-15 6.3E-20 128.0 14.5 161 81-253 297-490 (492)
51 KOG0126 Predicted RNA-binding 99.6 2.2E-17 4.8E-22 126.2 0.8 109 140-257 10-118 (219)
52 KOG0149 Predicted RNA-binding 99.6 3.5E-16 7.6E-21 124.8 7.3 81 171-253 10-90 (247)
53 KOG0125 Ataxin 2-binding prote 99.6 1E-15 2.2E-20 127.4 10.2 89 168-259 91-179 (376)
54 PF14259 RRM_6: RNA recognitio 99.6 3.2E-15 6.9E-20 101.3 9.7 70 176-247 1-70 (70)
55 KOG0121 Nuclear cap-binding pr 99.6 9.8E-16 2.1E-20 110.5 7.2 81 172-253 35-115 (153)
56 KOG0122 Translation initiation 99.6 1.6E-15 3.4E-20 121.5 9.1 85 77-163 185-269 (270)
57 KOG1190 Polypyrimidine tract-b 99.6 2E-14 4.3E-19 123.0 16.2 166 79-255 148-374 (492)
58 KOG0121 Nuclear cap-binding pr 99.6 1.1E-15 2.4E-20 110.3 7.2 82 78-161 33-114 (153)
59 KOG4212 RNA-binding protein hn 99.6 1.9E-14 4.2E-19 124.0 15.5 74 172-251 535-608 (608)
60 PF00076 RRM_1: RNA recognitio 99.6 3E-15 6.6E-20 101.3 8.4 70 84-156 1-70 (70)
61 KOG0107 Alternative splicing f 99.6 2.3E-15 5E-20 114.7 7.7 79 172-256 9-87 (195)
62 KOG0131 Splicing factor 3b, su 99.6 2.6E-15 5.7E-20 115.1 7.7 118 171-294 7-124 (203)
63 COG0724 RNA-binding proteins ( 99.6 1.7E-14 3.7E-19 124.3 13.8 151 81-234 115-285 (306)
64 KOG0113 U1 small nuclear ribon 99.6 6.1E-15 1.3E-19 121.4 10.0 89 168-257 96-184 (335)
65 KOG0120 Splicing factor U2AF, 99.6 1.2E-14 2.7E-19 130.3 12.8 174 77-253 285-491 (500)
66 KOG0125 Ataxin 2-binding prote 99.6 1.5E-14 3.2E-19 120.6 11.8 86 77-166 92-177 (376)
67 KOG0149 Predicted RNA-binding 99.6 3.7E-15 8E-20 119.0 7.5 82 78-162 9-90 (247)
68 PLN03120 nucleic acid binding 99.6 1.2E-14 2.7E-19 119.9 10.2 78 173-255 4-81 (260)
69 KOG0114 Predicted RNA-binding 99.6 3.4E-14 7.3E-19 98.8 9.0 85 168-256 13-97 (124)
70 KOG0130 RNA-binding protein RB 99.5 9E-15 2E-19 106.5 5.9 87 169-256 68-154 (170)
71 KOG0129 Predicted RNA-binding 99.5 2.5E-13 5.3E-18 120.0 15.5 198 77-279 255-483 (520)
72 KOG4207 Predicted splicing fac 99.5 1.2E-14 2.6E-19 113.6 6.4 84 172-256 12-95 (256)
73 PF14259 RRM_6: RNA recognitio 99.5 3.7E-14 7.9E-19 96.1 8.0 69 84-155 1-69 (70)
74 PLN03213 repressor of silencin 99.5 3.7E-14 8.1E-19 124.1 9.5 82 169-255 6-89 (759)
75 KOG0107 Alternative splicing f 99.5 2.8E-14 6E-19 108.8 7.4 78 79-163 8-85 (195)
76 smart00362 RRM_2 RNA recogniti 99.5 1.4E-13 2.9E-18 93.1 9.9 72 175-249 1-72 (72)
77 KOG0120 Splicing factor U2AF, 99.5 5.5E-14 1.2E-18 126.1 9.3 171 77-257 171-372 (500)
78 KOG0108 mRNA cleavage and poly 99.5 3.7E-14 8.1E-19 126.5 8.1 83 174-257 19-101 (435)
79 smart00360 RRM RNA recognition 99.5 1.3E-13 2.8E-18 92.9 9.0 71 178-249 1-71 (71)
80 KOG0226 RNA-binding proteins [ 99.5 3.3E-13 7.1E-18 108.9 12.5 171 81-255 96-271 (290)
81 KOG0111 Cyclophilin-type pepti 99.5 1.4E-14 3.1E-19 114.2 4.5 86 172-258 9-94 (298)
82 PLN03120 nucleic acid binding 99.5 1.2E-13 2.6E-18 114.1 9.6 76 81-162 4-79 (260)
83 KOG0108 mRNA cleavage and poly 99.5 3.5E-14 7.5E-19 126.7 6.1 80 82-163 19-98 (435)
84 KOG1365 RNA-binding protein Fu 99.5 2E-13 4.4E-18 116.0 10.2 168 79-252 159-360 (508)
85 KOG0113 U1 small nuclear ribon 99.5 2.1E-13 4.6E-18 112.3 9.6 85 77-161 97-181 (335)
86 KOG1456 Heterogeneous nuclear 99.5 9.3E-13 2E-17 111.6 13.4 167 79-257 29-202 (494)
87 KOG0126 Predicted RNA-binding 99.5 7.3E-15 1.6E-19 112.5 0.7 82 79-162 33-114 (219)
88 KOG1456 Heterogeneous nuclear 99.5 8.1E-12 1.8E-16 106.0 18.9 202 87-299 128-414 (494)
89 PLN03121 nucleic acid binding 99.5 4.5E-13 9.8E-18 108.9 10.1 77 172-253 4-80 (243)
90 KOG4207 Predicted splicing fac 99.4 1.8E-13 3.8E-18 107.1 6.5 85 77-163 9-93 (256)
91 cd00590 RRM RRM (RNA recogniti 99.4 1.4E-12 2.9E-17 88.7 10.2 74 175-250 1-74 (74)
92 PF13893 RRM_5: RNA recognitio 99.4 1.1E-12 2.3E-17 84.6 7.7 56 190-251 1-56 (56)
93 PLN03213 repressor of silencin 99.4 8.6E-13 1.9E-17 115.7 9.0 79 79-163 8-88 (759)
94 KOG0105 Alternative splicing f 99.4 1.4E-12 3.1E-17 100.3 9.1 80 172-255 5-84 (241)
95 COG0724 RNA-binding proteins ( 99.4 1.3E-12 2.9E-17 112.4 10.2 79 173-252 115-193 (306)
96 PLN03121 nucleic acid binding 99.4 1.7E-12 3.6E-17 105.6 9.6 76 80-161 4-79 (243)
97 smart00362 RRM_2 RNA recogniti 99.4 2.6E-12 5.6E-17 86.7 8.6 71 83-157 1-71 (72)
98 KOG0114 Predicted RNA-binding 99.4 2.8E-12 6E-17 89.3 8.5 81 76-161 13-93 (124)
99 KOG0111 Cyclophilin-type pepti 99.4 5.4E-13 1.2E-17 105.3 5.1 86 79-166 8-93 (298)
100 KOG0130 RNA-binding protein RB 99.4 1.2E-12 2.7E-17 95.4 6.5 83 79-163 70-152 (170)
101 smart00361 RRM_1 RNA recogniti 99.4 4E-12 8.7E-17 85.8 8.5 63 187-249 2-70 (70)
102 KOG0109 RNA-binding protein LA 99.4 1.2E-12 2.5E-17 107.7 6.4 75 174-257 3-77 (346)
103 smart00360 RRM RNA recognition 99.4 4.2E-12 9.2E-17 85.4 8.3 70 86-157 1-70 (71)
104 KOG0146 RNA-binding protein ET 99.3 2.8E-12 6.2E-17 104.2 6.9 99 153-257 3-104 (371)
105 cd00590 RRM RRM (RNA recogniti 99.3 2.6E-11 5.5E-16 82.3 9.3 73 83-158 1-73 (74)
106 KOG0415 Predicted peptidyl pro 99.3 5.3E-12 1.1E-16 106.4 6.7 84 171-255 237-320 (479)
107 KOG4454 RNA binding protein (R 99.3 8.7E-13 1.9E-17 104.2 1.3 141 78-241 6-150 (267)
108 KOG4205 RNA-binding protein mu 99.3 1.1E-11 2.5E-16 106.2 8.1 122 172-295 5-126 (311)
109 KOG4210 Nuclear localization s 99.2 1.1E-11 2.4E-16 105.9 6.1 175 79-257 86-267 (285)
110 KOG4206 Spliceosomal protein s 99.2 4.2E-11 9E-16 95.7 8.5 85 171-259 7-95 (221)
111 KOG4208 Nucleolar RNA-binding 99.2 4E-11 8.7E-16 94.3 8.2 86 168-254 44-130 (214)
112 smart00361 RRM_1 RNA recogniti 99.2 6.6E-11 1.4E-15 79.9 7.4 61 95-157 2-69 (70)
113 KOG4660 Protein Mei2, essentia 99.2 8.3E-11 1.8E-15 105.1 9.7 70 76-150 70-139 (549)
114 KOG0112 Large RNA-binding prot 99.2 1.5E-11 3.2E-16 115.0 5.0 167 77-259 368-536 (975)
115 KOG0132 RNA polymerase II C-te 99.2 6.2E-11 1.4E-15 109.3 7.4 81 171-258 419-499 (894)
116 KOG4211 Splicing factor hnRNP- 99.2 6.6E-10 1.4E-14 98.0 13.0 164 79-250 101-354 (510)
117 PF13893 RRM_5: RNA recognitio 99.1 1.2E-10 2.7E-15 74.9 6.3 56 98-160 1-56 (56)
118 KOG4307 RNA binding protein RB 99.1 1.2E-09 2.7E-14 99.7 14.8 169 80-254 310-514 (944)
119 KOG0153 Predicted RNA-binding 99.1 2.3E-10 5.1E-15 96.5 9.1 80 167-253 222-302 (377)
120 KOG0128 RNA-binding protein SA 99.1 1.1E-11 2.4E-16 115.4 -1.1 150 79-253 665-814 (881)
121 KOG4307 RNA binding protein RB 99.1 1.6E-09 3.4E-14 99.1 12.4 76 174-250 868-943 (944)
122 KOG4208 Nucleolar RNA-binding 99.1 4.2E-10 9.2E-15 88.6 7.6 81 79-161 47-128 (214)
123 KOG0132 RNA polymerase II C-te 99.1 2.9E-10 6.4E-15 105.0 7.6 107 81-195 421-528 (894)
124 KOG4661 Hsp27-ERE-TATA-binding 99.0 7.8E-10 1.7E-14 98.9 8.1 82 171-253 403-484 (940)
125 KOG0415 Predicted peptidyl pro 99.0 7E-10 1.5E-14 93.8 6.3 82 79-162 237-318 (479)
126 KOG1365 RNA-binding protein Fu 99.0 3.7E-09 7.9E-14 90.4 9.9 170 78-252 57-241 (508)
127 KOG0153 Predicted RNA-binding 98.9 6.9E-09 1.5E-13 87.8 8.3 82 74-162 221-302 (377)
128 KOG1548 Transcription elongati 98.9 2.4E-08 5.3E-13 84.5 10.5 85 171-257 132-224 (382)
129 KOG0533 RRM motif-containing p 98.8 1.6E-08 3.5E-13 83.6 8.3 85 169-255 79-163 (243)
130 PF04059 RRM_2: RNA recognitio 98.8 4.7E-08 1E-12 69.3 9.3 81 174-254 2-87 (97)
131 KOG1457 RNA binding protein (c 98.8 7E-08 1.5E-12 77.0 9.9 86 171-257 32-121 (284)
132 KOG4661 Hsp27-ERE-TATA-binding 98.7 3E-08 6.4E-13 89.0 8.0 83 79-163 403-485 (940)
133 KOG0226 RNA-binding proteins [ 98.7 1.1E-08 2.4E-13 83.1 4.8 82 78-161 187-268 (290)
134 KOG0116 RasGAP SH3 binding pro 98.7 3.5E-08 7.7E-13 88.1 8.0 81 173-255 288-368 (419)
135 KOG4660 Protein Mei2, essentia 98.7 9.2E-09 2E-13 92.2 4.3 72 170-247 72-143 (549)
136 KOG4209 Splicing factor RNPS1, 98.7 1.6E-08 3.4E-13 83.8 5.3 85 168-254 96-180 (231)
137 KOG2193 IGF-II mRNA-binding pr 98.7 1.6E-09 3.6E-14 93.6 -1.2 155 82-254 2-157 (584)
138 PF04059 RRM_2: RNA recognitio 98.7 1.9E-07 4.2E-12 66.2 9.2 81 82-162 2-86 (97)
139 KOG0533 RRM motif-containing p 98.6 1.3E-07 2.8E-12 78.3 8.2 82 77-161 79-160 (243)
140 KOG0106 Alternative splicing f 98.6 4E-08 8.6E-13 79.5 4.5 72 174-254 2-73 (216)
141 KOG4209 Splicing factor RNPS1, 98.5 1.4E-07 3E-12 78.3 5.7 83 77-162 97-179 (231)
142 KOG4454 RNA binding protein (R 98.5 3.3E-08 7.1E-13 78.6 1.9 80 171-253 7-86 (267)
143 KOG0116 RasGAP SH3 binding pro 98.5 6.1E-07 1.3E-11 80.3 9.6 79 79-160 286-364 (419)
144 KOG0151 Predicted splicing reg 98.5 2.7E-07 5.9E-12 85.0 7.1 83 170-253 171-256 (877)
145 KOG0151 Predicted splicing reg 98.5 2.3E-07 5.1E-12 85.4 6.4 83 78-162 171-256 (877)
146 KOG4676 Splicing factor, argin 98.4 6.9E-08 1.5E-12 83.0 1.8 164 82-253 8-225 (479)
147 PF11608 Limkain-b1: Limkain b 98.4 1.5E-06 3.4E-11 58.7 7.7 72 174-255 3-78 (90)
148 KOG1924 RhoA GTPase effector D 98.2 3.4E-06 7.4E-11 78.8 7.8 7 230-236 805-811 (1102)
149 KOG0115 RNA-binding protein p5 98.2 4.3E-06 9.2E-11 68.4 6.6 103 134-251 5-111 (275)
150 KOG1995 Conserved Zn-finger pr 98.2 2.2E-06 4.9E-11 73.4 4.8 86 170-256 63-156 (351)
151 PF08777 RRM_3: RNA binding mo 98.1 5E-06 1.1E-10 60.4 5.4 59 174-239 2-60 (105)
152 KOG1855 Predicted RNA-binding 98.1 3.9E-06 8.4E-11 73.3 5.4 80 168-248 226-318 (484)
153 COG5175 MOT2 Transcriptional r 98.1 9.1E-06 2E-10 68.9 6.4 83 173-256 114-205 (480)
154 KOG4849 mRNA cleavage factor I 97.9 1.3E-05 2.9E-10 68.2 5.1 75 77-151 76-152 (498)
155 PF11608 Limkain-b1: Limkain b 97.9 8E-05 1.7E-09 50.5 7.5 69 82-162 3-76 (90)
156 COG5175 MOT2 Transcriptional r 97.9 2.1E-05 4.5E-10 66.8 4.9 112 81-194 114-241 (480)
157 KOG4849 mRNA cleavage factor I 97.9 1.5E-05 3.2E-10 67.9 4.0 80 171-251 78-159 (498)
158 KOG0128 RNA-binding protein SA 97.9 1.3E-06 2.8E-11 82.3 -2.7 158 81-242 571-735 (881)
159 KOG1996 mRNA splicing factor [ 97.8 6.1E-05 1.3E-09 62.8 6.9 66 187-253 300-366 (378)
160 KOG1995 Conserved Zn-finger pr 97.8 2.8E-05 6.1E-10 66.8 4.3 84 78-163 63-154 (351)
161 KOG2314 Translation initiation 97.7 0.00011 2.5E-09 66.5 6.9 79 171-251 56-141 (698)
162 PF08777 RRM_3: RNA binding mo 97.7 0.00011 2.5E-09 53.3 5.7 58 82-145 2-59 (105)
163 PF05172 Nup35_RRM: Nup53/35/4 97.6 0.00036 7.8E-09 49.9 7.6 78 173-253 6-91 (100)
164 PF14605 Nup35_RRM_2: Nup53/35 97.6 0.0002 4.3E-09 45.0 5.1 52 174-233 2-53 (53)
165 KOG4210 Nuclear localization s 97.6 7.2E-05 1.6E-09 64.3 3.9 82 79-163 182-264 (285)
166 KOG2202 U2 snRNP splicing fact 97.5 5.4E-05 1.2E-09 62.2 1.7 66 188-254 83-148 (260)
167 KOG3152 TBP-binding protein, a 97.4 0.00014 3E-09 59.7 3.7 73 172-245 73-157 (278)
168 KOG2416 Acinus (induces apopto 97.3 0.00014 3E-09 66.3 3.1 80 170-255 441-523 (718)
169 KOG0129 Predicted RNA-binding 97.3 0.0022 4.8E-08 57.9 10.5 65 171-237 257-327 (520)
170 PF14605 Nup35_RRM_2: Nup53/35 97.3 0.00055 1.2E-08 43.0 4.8 52 82-140 2-53 (53)
171 KOG2314 Translation initiation 97.3 0.00094 2E-08 60.8 7.7 78 78-157 55-138 (698)
172 KOG3152 TBP-binding protein, a 97.3 0.00026 5.7E-09 58.1 3.8 71 80-150 73-155 (278)
173 PF08952 DUF1866: Domain of un 97.1 0.004 8.6E-08 47.4 8.0 79 170-258 24-111 (146)
174 KOG1855 Predicted RNA-binding 97.1 0.00069 1.5E-08 59.6 4.4 69 78-146 228-309 (484)
175 KOG2253 U1 snRNP complex, subu 97.0 0.00018 3.8E-09 66.6 0.1 144 81-235 40-195 (668)
176 PF08675 RNA_bind: RNA binding 97.0 0.0055 1.2E-07 41.7 7.0 56 173-238 9-64 (87)
177 KOG0112 Large RNA-binding prot 96.8 0.00049 1.1E-08 65.8 1.4 81 171-253 370-450 (975)
178 PF05172 Nup35_RRM: Nup53/35/4 96.8 0.0044 9.4E-08 44.4 5.7 79 79-160 4-89 (100)
179 PF08675 RNA_bind: RNA binding 96.8 0.0069 1.5E-07 41.2 6.2 54 83-144 10-63 (87)
180 PF10309 DUF2414: Protein of u 96.7 0.014 2.9E-07 37.7 6.7 53 82-143 6-62 (62)
181 PF07576 BRAP2: BRCA1-associat 96.6 0.029 6.4E-07 40.9 9.1 67 81-149 13-80 (110)
182 PF10309 DUF2414: Protein of u 96.6 0.016 3.4E-07 37.4 6.7 55 173-236 5-62 (62)
183 PF07576 BRAP2: BRCA1-associat 96.4 0.05 1.1E-06 39.7 9.5 69 173-243 13-81 (110)
184 KOG2193 IGF-II mRNA-binding pr 96.4 0.0021 4.6E-08 56.5 2.7 78 174-258 2-80 (584)
185 KOG2416 Acinus (induces apopto 96.4 0.003 6.5E-08 58.0 3.6 80 77-162 440-521 (718)
186 KOG2202 U2 snRNP splicing fact 96.4 0.0019 4.1E-08 53.3 2.1 55 104-161 92-146 (260)
187 KOG1996 mRNA splicing factor [ 96.3 0.012 2.7E-07 49.4 6.3 65 95-161 300-365 (378)
188 PF15023 DUF4523: Protein of u 96.2 0.016 3.5E-07 43.5 5.9 75 170-253 83-161 (166)
189 KOG0115 RNA-binding protein p5 96.1 0.024 5.3E-07 46.9 7.1 61 82-143 32-92 (275)
190 PF04847 Calcipressin: Calcipr 96.0 0.019 4.1E-07 46.1 5.9 63 186-255 8-72 (184)
191 KOG2591 c-Mpl binding protein, 95.8 0.035 7.6E-07 50.8 7.2 97 132-247 146-245 (684)
192 KOG2068 MOT2 transcription fac 95.8 0.0051 1.1E-07 52.9 1.7 83 174-257 78-166 (327)
193 PF08952 DUF1866: Domain of un 95.7 0.026 5.7E-07 43.0 5.3 54 97-161 52-105 (146)
194 KOG4676 Splicing factor, argin 95.7 0.015 3.3E-07 50.9 4.4 76 174-251 8-86 (479)
195 KOG3671 Actin regulatory prote 95.6 0.15 3.2E-06 46.4 10.4 31 81-111 481-511 (569)
196 KOG2135 Proteins containing th 95.5 0.0063 1.4E-07 54.5 1.6 73 175-255 374-447 (526)
197 PF03467 Smg4_UPF3: Smg-4/UPF3 95.4 0.022 4.7E-07 45.6 4.2 83 172-255 6-99 (176)
198 PF11767 SET_assoc: Histone ly 95.2 0.17 3.7E-06 33.2 7.1 55 184-248 11-65 (66)
199 PF03467 Smg4_UPF3: Smg-4/UPF3 95.1 0.028 6.1E-07 44.9 3.8 84 79-162 5-97 (176)
200 PF10567 Nab6_mRNP_bdg: RNA-re 94.9 0.87 1.9E-05 38.8 12.2 157 80-238 14-213 (309)
201 KOG4285 Mitotic phosphoprotein 94.7 0.14 3.1E-06 43.4 7.1 75 172-255 196-271 (350)
202 PF15023 DUF4523: Protein of u 94.6 0.13 2.9E-06 38.8 6.1 73 79-161 84-160 (166)
203 KOG0804 Cytoplasmic Zn-finger 94.4 0.25 5.3E-06 44.4 8.3 70 172-243 73-142 (493)
204 PF07292 NID: Nmi/IFP 35 domai 94.4 0.092 2E-06 36.5 4.5 69 126-195 1-74 (88)
205 KOG4574 RNA-binding protein (c 94.2 0.17 3.6E-06 48.9 7.2 76 82-163 299-374 (1007)
206 KOG4574 RNA-binding protein (c 94.2 0.027 5.9E-07 54.1 2.0 73 177-256 302-376 (1007)
207 KOG2068 MOT2 transcription fac 94.0 0.035 7.6E-07 47.9 2.1 80 82-163 78-163 (327)
208 KOG2253 U1 snRNP complex, subu 93.9 0.068 1.5E-06 50.1 4.0 72 169-250 36-107 (668)
209 KOG0804 Cytoplasmic Zn-finger 93.0 0.4 8.7E-06 43.1 7.1 67 81-149 74-141 (493)
210 PF03880 DbpA: DbpA RNA bindin 92.8 0.52 1.1E-05 31.7 6.0 59 183-251 11-74 (74)
211 KOG2591 c-Mpl binding protein, 90.6 0.51 1.1E-05 43.6 5.2 57 79-142 173-231 (684)
212 KOG1923 Rac1 GTPase effector F 90.4 1.1 2.3E-05 43.3 7.2 15 216-230 604-618 (830)
213 KOG4285 Mitotic phosphoprotein 90.3 0.66 1.4E-05 39.6 5.3 60 83-150 199-258 (350)
214 KOG1923 Rac1 GTPase effector F 90.3 1.5 3.3E-05 42.2 8.2 10 82-91 386-395 (830)
215 KOG2318 Uncharacterized conser 89.3 1.4 3E-05 41.1 6.9 84 171-255 172-309 (650)
216 PF04847 Calcipressin: Calcipr 89.1 0.94 2E-05 36.4 5.1 62 94-163 8-71 (184)
217 PF03880 DbpA: DbpA RNA bindin 89.0 1.9 4.1E-05 28.9 5.9 58 91-159 11-73 (74)
218 KOG2135 Proteins containing th 88.0 0.26 5.6E-06 44.6 1.4 75 80-163 371-446 (526)
219 PF11767 SET_assoc: Histone ly 87.0 3 6.5E-05 27.3 5.6 49 91-148 10-58 (66)
220 PF13689 DUF4154: Domain of un 86.1 5.2 0.00011 30.7 7.6 89 186-286 1-89 (145)
221 PF10567 Nab6_mRNP_bdg: RNA-re 85.9 1.8 4E-05 36.9 5.2 84 169-253 11-107 (309)
222 KOG4410 5-formyltetrahydrofola 83.9 5.8 0.00013 33.7 7.3 48 173-227 330-378 (396)
223 COG5178 PRP8 U5 snRNP spliceos 81.7 1.5 3.2E-05 44.4 3.4 28 80-107 71-98 (2365)
224 PF14111 DUF4283: Domain of un 81.4 2.6 5.5E-05 32.5 4.2 116 83-207 17-138 (153)
225 PRK14548 50S ribosomal protein 81.0 7.1 0.00015 26.9 5.7 58 176-236 23-81 (84)
226 KOG4483 Uncharacterized conser 80.7 6.2 0.00013 35.2 6.5 62 73-141 383-445 (528)
227 KOG4483 Uncharacterized conser 80.5 10 0.00022 34.0 7.7 57 171-235 389-446 (528)
228 KOG4672 Uncharacterized conser 79.5 6.4 0.00014 35.2 6.3 10 186-195 471-480 (487)
229 KOG4019 Calcineurin-mediated s 79.1 2.2 4.7E-05 33.8 3.0 43 216-258 51-94 (193)
230 TIGR03636 L23_arch archaeal ri 78.6 10 0.00022 25.7 5.8 58 176-236 16-74 (77)
231 PF07530 PRE_C2HC: Associated 78.4 4.3 9.3E-05 26.8 3.8 65 188-255 2-66 (68)
232 KOG2318 Uncharacterized conser 78.3 11 0.00024 35.4 7.6 84 78-161 171-306 (650)
233 KOG2891 Surface glycoprotein [ 72.5 13 0.00027 31.6 6.0 129 125-254 77-268 (445)
234 KOG3424 40S ribosomal protein 66.1 25 0.00054 25.8 5.5 47 184-231 34-84 (132)
235 TIGR02542 B_forsyth_147 Bacter 62.3 24 0.00052 25.8 4.9 115 89-226 11-129 (145)
236 PRK14548 50S ribosomal protein 61.7 35 0.00076 23.6 5.5 58 83-143 22-81 (84)
237 KOG4410 5-formyltetrahydrofola 60.8 19 0.00042 30.7 4.9 48 81-134 330-378 (396)
238 COG5178 PRP8 U5 snRNP spliceos 58.8 10 0.00023 38.8 3.4 8 82-89 108-115 (2365)
239 PTZ00191 60S ribosomal protein 57.9 40 0.00086 25.9 5.7 59 175-236 83-142 (145)
240 PHA03378 EBNA-3B; Provisional 57.5 65 0.0014 31.3 8.1 23 81-103 806-830 (991)
241 TIGR03636 L23_arch archaeal ri 56.8 51 0.0011 22.3 5.6 58 83-143 15-74 (77)
242 PF03468 XS: XS domain; Inter 56.1 23 0.00051 26.1 4.2 40 93-135 29-68 (116)
243 PF03468 XS: XS domain; Inter 53.5 19 0.00041 26.6 3.3 50 175-228 10-68 (116)
244 KOG2295 C2H2 Zn-finger protein 52.4 2.8 6E-05 39.0 -1.4 70 80-149 230-299 (648)
245 KOG2295 C2H2 Zn-finger protein 49.0 2.7 5.8E-05 39.1 -2.1 72 172-244 230-301 (648)
246 PRK11901 hypothetical protein; 48.8 38 0.00082 29.7 4.9 61 172-238 244-306 (327)
247 KOG1830 Wiskott Aldrich syndro 48.4 1.2E+02 0.0025 27.8 7.8 11 241-251 492-502 (518)
248 PRK01178 rps24e 30S ribosomal 47.9 58 0.0013 23.3 5.0 47 184-231 30-80 (99)
249 PF11823 DUF3343: Protein of u 47.3 27 0.00059 23.1 3.1 30 217-246 2-31 (73)
250 PF07292 NID: Nmi/IFP 35 domai 44.2 13 0.00028 25.9 1.2 25 78-102 49-73 (88)
251 smart00596 PRE_C2HC PRE_C2HC d 44.2 47 0.001 21.9 3.7 64 188-255 2-66 (69)
252 PF02714 DUF221: Domain of unk 44.1 54 0.0012 28.8 5.4 56 126-195 1-56 (325)
253 PF15513 DUF4651: Domain of un 43.9 62 0.0013 20.9 4.1 19 188-207 9-27 (62)
254 PF02714 DUF221: Domain of unk 42.5 29 0.00062 30.5 3.4 37 219-257 1-37 (325)
255 PF12091 DUF3567: Protein of u 41.6 24 0.00051 24.3 2.1 56 123-200 8-76 (85)
256 KOG4213 RNA-binding protein La 41.3 39 0.00086 26.9 3.5 58 80-142 110-169 (205)
257 COG5193 LHP1 La protein, small 40.9 13 0.00028 33.4 0.9 63 79-141 172-244 (438)
258 KOG1985 Vesicle coat complex C 40.0 4.1E+02 0.0089 26.8 11.6 19 170-188 292-310 (887)
259 PTZ00071 40S ribosomal protein 39.2 1E+02 0.0022 23.3 5.3 47 184-231 35-86 (132)
260 KOG2893 Zn finger protein [Gen 38.9 2.3E+02 0.0051 23.7 8.6 14 181-194 318-331 (341)
261 KOG4365 Uncharacterized conser 37.7 6 0.00013 35.8 -1.5 78 174-253 4-81 (572)
262 PRK10905 cell division protein 37.5 84 0.0018 27.6 5.2 60 172-237 246-307 (328)
263 COG5638 Uncharacterized conser 35.8 1.9E+02 0.0042 26.3 7.3 142 77-251 142-295 (622)
264 COG5193 LHP1 La protein, small 35.5 26 0.00056 31.6 1.9 63 171-234 172-244 (438)
265 PF14111 DUF4283: Domain of un 35.0 41 0.00089 25.6 2.9 68 184-257 28-95 (153)
266 KOG1925 Rac1 GTPase effector F 34.5 40 0.00086 31.4 3.0 24 81-104 306-329 (817)
267 PF08734 GYD: GYD domain; Int 34.0 1.6E+02 0.0035 20.4 6.5 47 187-237 22-68 (91)
268 PF14893 PNMA: PNMA 33.9 25 0.00055 31.1 1.7 77 79-162 16-96 (331)
269 KOG0156 Cytochrome P450 CYP2 s 33.5 1.4E+02 0.003 28.2 6.5 66 170-246 29-97 (489)
270 PRK11901 hypothetical protein; 32.9 3.5E+02 0.0076 23.9 12.3 63 78-145 242-306 (327)
271 KOG4008 rRNA processing protei 32.8 41 0.0009 28.0 2.6 35 77-111 36-70 (261)
272 PF00403 HMA: Heavy-metal-asso 32.8 1.3E+02 0.0027 18.7 6.4 54 175-235 1-58 (62)
273 PF03439 Spt5-NGN: Early trans 32.8 1.1E+02 0.0024 20.9 4.5 27 215-241 43-69 (84)
274 KOG2675 Adenylate cyclase-asso 32.8 25 0.00054 32.0 1.4 14 219-232 453-466 (480)
275 cd00874 RNA_Cyclase_Class_II R 31.0 3.8E+02 0.0082 23.8 9.1 118 82-224 110-238 (326)
276 PF00564 PB1: PB1 domain; Int 29.8 1.6E+02 0.0036 19.6 5.0 63 179-248 16-79 (84)
277 PF01282 Ribosomal_S24e: Ribos 28.9 2E+02 0.0042 19.8 5.5 48 183-231 11-62 (84)
278 KOG1295 Nonsense-mediated deca 28.0 58 0.0012 29.2 2.8 70 80-149 6-78 (376)
279 PTZ00191 60S ribosomal protein 27.7 2.2E+02 0.0048 21.9 5.6 57 83-142 83-141 (145)
280 cd06405 PB1_Mekk2_3 The PB1 do 27.6 2E+02 0.0043 19.4 7.3 63 178-249 13-75 (79)
281 PF14893 PNMA: PNMA 27.5 62 0.0014 28.7 3.0 53 172-226 17-72 (331)
282 CHL00030 rpl23 ribosomal prote 26.8 2.3E+02 0.005 19.9 5.4 35 175-209 20-55 (93)
283 PF01690 PLRV_ORF5: Potato lea 26.3 64 0.0014 29.8 2.9 10 125-134 125-134 (465)
284 PLN02805 D-lactate dehydrogena 25.7 1.3E+02 0.0029 28.8 5.1 50 187-237 280-332 (555)
285 KOG1984 Vesicle coat complex C 25.3 3.7E+02 0.0079 27.4 7.8 12 242-253 357-368 (1007)
286 KOG1295 Nonsense-mediated deca 24.6 97 0.0021 27.8 3.6 68 173-241 7-77 (376)
287 KOG4008 rRNA processing protei 24.4 74 0.0016 26.6 2.6 35 170-205 37-71 (261)
288 PF04026 SpoVG: SpoVG; InterP 24.3 1.5E+02 0.0033 20.4 3.9 47 200-253 2-50 (84)
289 PRK10905 cell division protein 23.6 1.7E+02 0.0036 25.8 4.8 60 80-144 246-307 (328)
290 PRK12280 rplW 50S ribosomal pr 23.4 3E+02 0.0064 21.6 5.7 35 175-209 23-58 (158)
291 PF00276 Ribosomal_L23: Riboso 23.2 1.3E+02 0.0028 21.0 3.5 34 176-209 22-56 (91)
292 cd02568 PseudoU_synth_PUS1_PUS 22.9 2E+02 0.0043 24.3 5.1 115 185-302 22-154 (245)
293 COG5353 Uncharacterized protei 22.5 3.7E+02 0.008 20.8 6.6 56 81-136 87-155 (161)
294 PRK10629 EnvZ/OmpR regulon mod 22.5 3.4E+02 0.0073 20.3 7.7 59 185-252 50-109 (127)
295 smart00666 PB1 PB1 domain. Pho 22.4 2.4E+02 0.0052 18.6 7.1 57 176-237 12-69 (81)
296 PF11411 DNA_ligase_IV: DNA li 21.8 65 0.0014 18.2 1.3 16 91-106 19-34 (36)
297 TIGR03399 RNA_3prim_cycl RNA 3 21.8 5.7E+02 0.012 22.7 11.4 119 82-224 112-240 (326)
298 COG5470 Uncharacterized conser 21.5 2.4E+02 0.0051 20.0 4.3 21 120-140 50-70 (96)
299 COG5353 Uncharacterized protei 21.4 3.9E+02 0.0085 20.7 6.1 55 174-228 88-154 (161)
300 PHA01632 hypothetical protein 21.4 1E+02 0.0022 19.3 2.2 20 176-195 19-38 (64)
301 PRK11558 putative ssRNA endonu 21.2 2.2E+02 0.0048 20.2 4.2 49 173-225 27-75 (97)
302 PF14657 Integrase_AP2: AP2-li 21.0 1.9E+02 0.0042 17.0 3.6 30 208-237 9-38 (46)
303 PF09707 Cas_Cas2CT1978: CRISP 20.9 2.9E+02 0.0063 19.1 4.7 48 173-224 25-72 (86)
304 COG2004 RPS24A Ribosomal prote 20.8 3.4E+02 0.0073 19.7 5.1 48 183-231 30-81 (107)
305 KOG0862 Synaptobrevin/VAMP-lik 20.8 61 0.0013 26.6 1.5 31 188-226 89-119 (216)
306 PRK13259 regulatory protein Sp 20.7 1.9E+02 0.004 20.5 3.7 47 200-253 2-50 (94)
307 PF10281 Ish1: Putative stress 20.6 98 0.0021 17.4 2.0 18 184-202 3-20 (38)
308 PRK14950 DNA polymerase III su 20.4 5E+02 0.011 25.2 7.9 12 189-200 524-535 (585)
309 cd06408 PB1_NoxR The PB1 domai 20.1 3.1E+02 0.0068 19.0 5.1 55 176-236 13-67 (86)
No 1
>TIGR01659 sex-lethal sex-lethal family splicing factor. This model describes the sex-lethal family of splicing factors found in Dipteran insects. The sex-lethal phenotype, however, may be limited to the Melanogasters and closely related species. In Drosophila the protein acts as an inhibitor of splicing. This subfamily is most closely related to the ELAV/HUD subfamily of splicing factors (TIGR01661).
Probab=100.00 E-value=6.7e-36 Score=261.79 Aligned_cols=175 Identities=25% Similarity=0.462 Sum_probs=159.2
Q ss_pred CCCCCceEEEcCCCcccCHHHHhHhhhhcCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCCCCCccee
Q 021958 77 QPGEIRTLWIGDLQYWMDETYLNTCFAHTGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMPNGEQNFR 156 (305)
Q Consensus 77 ~~~~~~~l~v~nLp~~~t~~~l~~~f~~~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~~~~i~ 156 (305)
.....++|||+|||.++|+++|+++|+.||.|++|+|++|+.+++++|||||+|.++++|++|++.|++..+ .++.|+
T Consensus 103 ~~~~~~~LfVgnLp~~~te~~L~~lF~~~G~V~~v~i~~d~~tg~srGyaFVeF~~~e~A~~Ai~~LnG~~l--~gr~i~ 180 (346)
T TIGR01659 103 TNNSGTNLIVNYLPQDMTDRELYALFRTIGPINTCRIMRDYKTGYSFGYAFVDFGSEADSQRAIKNLNGITV--RNKRLK 180 (346)
T ss_pred CCCCCcEEEEeCCCCCCCHHHHHHHHHhcCCEEEEEEEecCCCCccCcEEEEEEccHHHHHHHHHHcCCCcc--CCceee
Confidence 445678999999999999999999999999999999999999999999999999999999999999999999 678999
Q ss_pred eeccCCCCCCCCCCCCCceEEEcCCCcCCCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHHh
Q 021958 157 LNWASFGAGEKRDDTPDHTIFVGDLAADVTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEM 236 (305)
Q Consensus 157 v~~~~~~~~~~~~~~~~~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l 236 (305)
|.++.... ......+|||+||+.++++++|+++|+ +||.|++++|+.|..+++++|||||+|.+.++|++|++.|
T Consensus 181 V~~a~p~~----~~~~~~~lfV~nLp~~vtee~L~~~F~-~fG~V~~v~i~~d~~tg~~kG~aFV~F~~~e~A~~Ai~~l 255 (346)
T TIGR01659 181 VSYARPGG----ESIKDTNLYVTNLPRTITDDQLDTIFG-KYGQIVQKNILRDKLTGTPRGVAFVRFNKREEAQEAISAL 255 (346)
T ss_pred eecccccc----cccccceeEEeCCCCcccHHHHHHHHH-hcCCEEEEEEeecCCCCccceEEEEEECCHHHHHHHHHHh
Confidence 99886432 223457899999999999999999999 9999999999999989999999999999999999999999
Q ss_pred CCceecc--eeeEEccCCCCCCCC
Q 021958 237 NGVFCST--RPMRIGPATNKKTVS 258 (305)
Q Consensus 237 ~~~~~~g--~~l~v~~~~~~~~~~ 258 (305)
|+..+.+ +.|+|.+++.+....
T Consensus 256 ng~~~~g~~~~l~V~~a~~~~~~~ 279 (346)
T TIGR01659 256 NNVIPEGGSQPLTVRLAEEHGKAK 279 (346)
T ss_pred CCCccCCCceeEEEEECCcccccc
Confidence 9998865 789999998765543
No 2
>KOG0148 consensus Apoptosis-promoting RNA-binding protein TIA-1/TIAR (RRM superfamily) [RNA processing and modification; Translation, ribosomal structure and biogenesis]
Probab=100.00 E-value=6.3e-35 Score=234.74 Aligned_cols=172 Identities=29% Similarity=0.532 Sum_probs=159.8
Q ss_pred CCCceEEEcCCCcccCHHHHhHhhhhcCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCCCCCcceeee
Q 021958 79 GEIRTLWIGDLQYWMDETYLNTCFAHTGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMPNGEQNFRLN 158 (305)
Q Consensus 79 ~~~~~l~v~nLp~~~t~~~l~~~f~~~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~~~~i~v~ 158 (305)
.....|||+.|...++.++|++.|..||+|.++++++|..|+++|||+||.|.+.++|++||..|+|+.| ++|.||.+
T Consensus 60 ~~hfhvfvgdls~eI~~e~lr~aF~pFGevS~akvirD~~T~KsKGYgFVSf~~k~dAEnAI~~MnGqWl--G~R~IRTN 137 (321)
T KOG0148|consen 60 NQHFHVFVGDLSPEIDNEKLREAFAPFGEVSDAKVIRDMNTGKSKGYGFVSFPNKEDAENAIQQMNGQWL--GRRTIRTN 137 (321)
T ss_pred ccceeEEehhcchhcchHHHHHHhccccccccceEeecccCCcccceeEEeccchHHHHHHHHHhCCeee--ccceeecc
Confidence 3367899999999999999999999999999999999999999999999999999999999999999999 89999999
Q ss_pred ccCCCCCCC------------CCCCCCceEEEcCCCcCCCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCH
Q 021958 159 WASFGAGEK------------RDDTPDHTIFVGDLAADVTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDE 226 (305)
Q Consensus 159 ~~~~~~~~~------------~~~~~~~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~ 226 (305)
|+.++..+. .....+++|||||+...++|++|++.|+ .||.|.+|+|..+ +|||||+|++.
T Consensus 138 WATRKp~e~n~~~ltfdeV~NQssp~NtsVY~G~I~~~lte~~mr~~Fs-~fG~I~EVRvFk~------qGYaFVrF~tk 210 (321)
T KOG0148|consen 138 WATRKPSEMNGKPLTFDEVYNQSSPDNTSVYVGNIASGLTEDLMRQTFS-PFGPIQEVRVFKD------QGYAFVRFETK 210 (321)
T ss_pred ccccCccccCCCCccHHHHhccCCCCCceEEeCCcCccccHHHHHHhcc-cCCcceEEEEecc------cceEEEEecch
Confidence 999876432 2335688999999999999999999999 9999999999988 89999999999
Q ss_pred HHHHHHHHHhCCceecceeeEEccCCCCCCCCC
Q 021958 227 SEQLRAMTEMNGVFCSTRPMRIGPATNKKTVSG 259 (305)
Q Consensus 227 ~~A~~a~~~l~~~~~~g~~l~v~~~~~~~~~~~ 259 (305)
|+|.+||..+|+.+|+|..++|.|.+.......
T Consensus 211 EaAahAIv~mNntei~G~~VkCsWGKe~~~~~~ 243 (321)
T KOG0148|consen 211 EAAAHAIVQMNNTEIGGQLVRCSWGKEGDDGIN 243 (321)
T ss_pred hhHHHHHHHhcCceeCceEEEEeccccCCCCCC
Confidence 999999999999999999999999998776644
No 3
>TIGR01645 half-pint poly-U binding splicing factor, half-pint family. In the case of PUF60 (GP|6176532), in complex with p54, and in the presence of U2AF, facilitates association of U2 snRNP with pre-mRNA.
Probab=100.00 E-value=1.2e-33 Score=259.93 Aligned_cols=176 Identities=18% Similarity=0.370 Sum_probs=157.8
Q ss_pred CCCceEEEcCCCcccCHHHHhHhhhhcCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCCCCCcceeee
Q 021958 79 GEIRTLWIGDLQYWMDETYLNTCFAHTGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMPNGEQNFRLN 158 (305)
Q Consensus 79 ~~~~~l~v~nLp~~~t~~~l~~~f~~~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~~~~i~v~ 158 (305)
...++|||+||+.++++++|+++|+.||.|.+|+++.|+.+++++|||||+|.+.++|.+|++.|||..+ .|+.|++.
T Consensus 105 ~~~~rLfVGnLp~~~tEe~Lr~lF~~fG~I~sV~I~~D~~TgkskGfAFVeF~s~e~A~~Ai~~lnG~~i--~GR~IkV~ 182 (612)
T TIGR01645 105 AIMCRVYVGSISFELREDTIRRAFDPFGPIKSINMSWDPATGKHKGFAFVEYEVPEAAQLALEQMNGQML--GGRNIKVG 182 (612)
T ss_pred cCCCEEEEcCCCCCCCHHHHHHHHHccCCEEEEEEeecCCCCCcCCeEEEEeCcHHHHHHHHHhcCCeEE--ecceeeec
Confidence 4568999999999999999999999999999999999999999999999999999999999999999999 67888887
Q ss_pred ccCCCCCCC-------CCCCCCceEEEcCCCcCCCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHH
Q 021958 159 WASFGAGEK-------RDDTPDHTIFVGDLAADVTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLR 231 (305)
Q Consensus 159 ~~~~~~~~~-------~~~~~~~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~ 231 (305)
+........ ......++|||+||+.++++++|+++|+ .||.|.+++|..|..+++++|||||+|++.++|.+
T Consensus 183 rp~~~p~a~~~~~~~~~~~~~~~rLfVgnLp~~vteedLk~lFs-~FG~I~svrl~~D~~tgksKGfGFVeFe~~e~A~k 261 (612)
T TIGR01645 183 RPSNMPQAQPIIDMVQEEAKKFNRIYVASVHPDLSETDIKSVFE-AFGEIVKCQLARAPTGRGHKGYGFIEYNNLQSQSE 261 (612)
T ss_pred ccccccccccccccccccccccceEEeecCCCCCCHHHHHHHHh-hcCCeeEEEEEecCCCCCcCCeEEEEECCHHHHHH
Confidence 654321111 1223457999999999999999999999 99999999999999899999999999999999999
Q ss_pred HHHHhCCceecceeeEEccCCCCCCC
Q 021958 232 AMTEMNGVFCSTRPMRIGPATNKKTV 257 (305)
Q Consensus 232 a~~~l~~~~~~g~~l~v~~~~~~~~~ 257 (305)
|++.||+..|+|+.|+|.++......
T Consensus 262 AI~amNg~elgGr~LrV~kAi~pP~~ 287 (612)
T TIGR01645 262 AIASMNLFDLGGQYLRVGKCVTPPDA 287 (612)
T ss_pred HHHHhCCCeeCCeEEEEEecCCCccc
Confidence 99999999999999999998865444
No 4
>TIGR01661 ELAV_HUD_SF ELAV/HuD family splicing factor. These proteins contain 3 RNA-recognition motifs (rrm: pfam00076).
Probab=100.00 E-value=1.3e-32 Score=245.93 Aligned_cols=169 Identities=27% Similarity=0.494 Sum_probs=154.3
Q ss_pred CCceEEEcCCCcccCHHHHhHhhhhcCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCCCCCcceeeec
Q 021958 80 EIRTLWIGDLQYWMDETYLNTCFAHTGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMPNGEQNFRLNW 159 (305)
Q Consensus 80 ~~~~l~v~nLp~~~t~~~l~~~f~~~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~~~~i~v~~ 159 (305)
...+|||+|||.++++++|+++|+.||.|.+|++++|+.+|+++|||||+|.+.++|.+|++.|+|..+ .++.+++.|
T Consensus 2 ~~~~l~V~nLp~~~~e~~l~~~F~~~G~i~~v~i~~d~~~g~s~g~afV~f~~~~~A~~Ai~~l~g~~l--~g~~i~v~~ 79 (352)
T TIGR01661 2 SKTNLIVNYLPQTMTQEEIRSLFTSIGEIESCKLVRDKVTGQSLGYGFVNYVRPEDAEKAVNSLNGLRL--QNKTIKVSY 79 (352)
T ss_pred CCcEEEEeCCCCCCCHHHHHHHHHccCCEEEEEEEEcCCCCccceEEEEEECcHHHHHHHHhhcccEEE--CCeeEEEEe
Confidence 467999999999999999999999999999999999999999999999999999999999999999999 678899988
Q ss_pred cCCCCCCCCCCCCCceEEEcCCCcCCCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHHhCCc
Q 021958 160 ASFGAGEKRDDTPDHTIFVGDLAADVTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEMNGV 239 (305)
Q Consensus 160 ~~~~~~~~~~~~~~~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l~~~ 239 (305)
+... ......++|||+||+.++++++|+++|+ .||.|..+++..+..++.++|||||+|++.++|.+|++.|||.
T Consensus 80 a~~~----~~~~~~~~l~v~~l~~~~~~~~l~~~f~-~~G~i~~~~~~~~~~~~~~~g~~fv~f~~~~~A~~ai~~l~g~ 154 (352)
T TIGR01661 80 ARPS----SDSIKGANLYVSGLPKTMTQHELESIFS-PFGQIITSRILSDNVTGLSKGVGFIRFDKRDEADRAIKTLNGT 154 (352)
T ss_pred eccc----ccccccceEEECCccccCCHHHHHHHHh-ccCCEEEEEEEecCCCCCcCcEEEEEECCHHHHHHHHHHhCCC
Confidence 8643 2233467899999999999999999999 9999999999999888899999999999999999999999999
Q ss_pred eecc--eeeEEccCCCCC
Q 021958 240 FCST--RPMRIGPATNKK 255 (305)
Q Consensus 240 ~~~g--~~l~v~~~~~~~ 255 (305)
.+.| +.|+|.|++...
T Consensus 155 ~~~g~~~~i~v~~a~~~~ 172 (352)
T TIGR01661 155 TPSGCTEPITVKFANNPS 172 (352)
T ss_pred ccCCCceeEEEEECCCCC
Confidence 9876 678898886544
No 5
>TIGR01661 ELAV_HUD_SF ELAV/HuD family splicing factor. These proteins contain 3 RNA-recognition motifs (rrm: pfam00076).
Probab=100.00 E-value=1.8e-32 Score=244.99 Aligned_cols=176 Identities=26% Similarity=0.440 Sum_probs=156.8
Q ss_pred CCceEEEcCCCcccCHHHHhHhhhhcCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCCCCCcceeeec
Q 021958 80 EIRTLWIGDLQYWMDETYLNTCFAHTGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMPNGEQNFRLNW 159 (305)
Q Consensus 80 ~~~~l~v~nLp~~~t~~~l~~~f~~~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~~~~i~v~~ 159 (305)
..++|||+||+..+++++|+++|+.||.|..+++..+..++.++|||||+|.+.++|.+|++.|+|..+.+....+++.|
T Consensus 88 ~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~~~~~~g~~fv~f~~~~~A~~ai~~l~g~~~~g~~~~i~v~~ 167 (352)
T TIGR01661 88 KGANLYVSGLPKTMTQHELESIFSPFGQIITSRILSDNVTGLSKGVGFIRFDKRDEADRAIKTLNGTTPSGCTEPITVKF 167 (352)
T ss_pred ccceEEECCccccCCHHHHHHHHhccCCEEEEEEEecCCCCCcCcEEEEEECCHHHHHHHHHHhCCCccCCCceeEEEEE
Confidence 45689999999999999999999999999999999998889999999999999999999999999999977778888888
Q ss_pred cCCCCCCC------------------------------------------------------------------------
Q 021958 160 ASFGAGEK------------------------------------------------------------------------ 167 (305)
Q Consensus 160 ~~~~~~~~------------------------------------------------------------------------ 167 (305)
+.......
T Consensus 168 a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 247 (352)
T TIGR01661 168 ANNPSSSNSKGLLSQLEAVQNPQTTRVPLSTILTAAGIGPMHHAAARFRPSAGDFTAVLAHQQQQHAVAQQHAAQRASPP 247 (352)
T ss_pred CCCCCcCCchhcCchhhcccCcccCCCCccccccccCCCCccCcccccccCcchhhhhhhhhhhhcccccccccccCCCc
Confidence 75332000
Q ss_pred ----------------CCCCCCceEEEcCCCcCCCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHH
Q 021958 168 ----------------RDDTPDHTIFVGDLAADVTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLR 231 (305)
Q Consensus 168 ----------------~~~~~~~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~ 231 (305)
.....+.+|||+||+.++++++|+++|+ +||.|.+++|+.|..+|.++|||||+|.+.++|.+
T Consensus 248 ~~~~~~~~~~~~~~~~~~~~~~~~lfV~NL~~~~~e~~L~~~F~-~fG~v~~v~i~~d~~t~~skG~aFV~F~~~~~A~~ 326 (352)
T TIGR01661 248 ATDGQTAGLAAGAQIAASDGAGYCIFVYNLSPDTDETVLWQLFG-PFGAVQNVKIIRDLTTNQCKGYGFVSMTNYDEAAM 326 (352)
T ss_pred cccccccccccCCCCCCCCCCCcEEEEeCCCCCCCHHHHHHHHH-hCCCeEEEEEeEcCCCCCccceEEEEECCHHHHHH
Confidence 0011233699999999999999999999 99999999999999899999999999999999999
Q ss_pred HHHHhCCceecceeeEEccCCCCCC
Q 021958 232 AMTEMNGVFCSTRPMRIGPATNKKT 256 (305)
Q Consensus 232 a~~~l~~~~~~g~~l~v~~~~~~~~ 256 (305)
|+..|||..|+||.|+|+|+..+..
T Consensus 327 Ai~~lnG~~~~gr~i~V~~~~~~~~ 351 (352)
T TIGR01661 327 AILSLNGYTLGNRVLQVSFKTNKAY 351 (352)
T ss_pred HHHHhCCCEECCeEEEEEEccCCCC
Confidence 9999999999999999999988753
No 6
>TIGR01628 PABP-1234 polyadenylate binding protein, human types 1, 2, 3, 4 family. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed, or of unknown tissue range.
Probab=100.00 E-value=3.2e-32 Score=257.16 Aligned_cols=199 Identities=29% Similarity=0.440 Sum_probs=165.7
Q ss_pred eEEEcCCCcccCHHHHhHhhhhcCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCCCCCcceeeeccCC
Q 021958 83 TLWIGDLQYWMDETYLNTCFAHTGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMPNGEQNFRLNWASF 162 (305)
Q Consensus 83 ~l~v~nLp~~~t~~~l~~~f~~~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~~~~i~v~~~~~ 162 (305)
+|||+||+.++|+++|+++|++||.|.+|++.+|+.+++++|||||+|.+.++|++|++.+++..+ .++.|+|.|+..
T Consensus 2 sl~VgnLp~~vte~~L~~~F~~~G~v~~v~v~~d~~t~~s~G~afV~F~~~~~A~~Al~~ln~~~i--~gk~i~i~~s~~ 79 (562)
T TIGR01628 2 SLYVGDLDPDVTEAKLYDLFKPFGPVLSVRVCRDSVTRRSLGYGYVNFQNPADAERALETMNFKRL--GGKPIRIMWSQR 79 (562)
T ss_pred eEEEeCCCCCCCHHHHHHHHHhcCCEEEEEEEecCCCCCcceEEEEEECCHHHHHHHHHHhCCCEE--CCeeEEeecccc
Confidence 799999999999999999999999999999999999999999999999999999999999999988 679999999865
Q ss_pred CCCCCCCCCCCceEEEcCCCcCCCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHHhCCceec
Q 021958 163 GAGEKRDDTPDHTIFVGDLAADVTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEMNGVFCS 242 (305)
Q Consensus 163 ~~~~~~~~~~~~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l~~~~~~ 242 (305)
.... ......+|||+||+.++++++|+++|+ +||.|.+|++..+. +|+++|||||+|.+.++|.+|++.|||..++
T Consensus 80 ~~~~--~~~~~~~vfV~nLp~~~~~~~L~~~F~-~~G~i~~~~i~~~~-~g~skg~afV~F~~~e~A~~Ai~~lng~~~~ 155 (562)
T TIGR01628 80 DPSL--RRSGVGNIFVKNLDKSVDNKALFDTFS-KFGNILSCKVATDE-NGKSRGYGFVHFEKEESAKAAIQKVNGMLLN 155 (562)
T ss_pred cccc--cccCCCceEEcCCCccCCHHHHHHHHH-hcCCcceeEeeecC-CCCcccEEEEEECCHHHHHHHHHHhcccEec
Confidence 4322 223456899999999999999999999 99999999999885 7889999999999999999999999999999
Q ss_pred ceeeEEccCCCCCCCCCCCCCCCcceEEeCCcc----hhhhHHHHhhccc
Q 021958 243 TRPMRIGPATNKKTVSGQQQYPKGTFLVVGTGF----CLQIKFKFKYFIG 288 (305)
Q Consensus 243 g~~l~v~~~~~~~~~~~~~~~~~~~~~~~g~~~----~~~~~~~~~~~~g 288 (305)
|+.|.|.....+...... .....+.+++++.. ...+.+.|..|+.
T Consensus 156 ~~~i~v~~~~~~~~~~~~-~~~~~~~l~V~nl~~~~tee~L~~~F~~fG~ 204 (562)
T TIGR01628 156 DKEVYVGRFIKKHEREAA-PLKKFTNLYVKNLDPSVNEDKLRELFAKFGE 204 (562)
T ss_pred CceEEEeccccccccccc-cccCCCeEEEeCCCCcCCHHHHHHHHHhcCC
Confidence 999999877665554321 22233445555422 2233455555543
No 7
>KOG0144 consensus RNA-binding protein CUGBP1/BRUNO (RRM superfamily) [RNA processing and modification]
Probab=100.00 E-value=4.7e-32 Score=231.06 Aligned_cols=178 Identities=22% Similarity=0.447 Sum_probs=159.7
Q ss_pred CCCCCCceEEEcCCCcccCHHHHhHhhhhcCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCC-CCCCCCCcc
Q 021958 76 GQPGEIRTLWIGDLQYWMDETYLNTCFAHTGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNG-TPMPNGEQN 154 (305)
Q Consensus 76 ~~~~~~~~l~v~nLp~~~t~~~l~~~f~~~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g-~~~~~~~~~ 154 (305)
..+.+.-++|||-||..++|.||+++|++||.|.+|.+++|+.++.++|||||.|.+.++|.+|+..|+. +.+.|....
T Consensus 29 ~~d~~~vKlfVgqIprt~sE~dlr~lFe~yg~V~einl~kDk~t~~s~gcCFv~~~trk~a~~a~~Alhn~ktlpG~~~p 108 (510)
T KOG0144|consen 29 NPDGSAVKLFVGQIPRTASEKDLRELFEKYGNVYEINLIKDKSTGQSKGCCFVKYYTRKEADEAINALHNQKTLPGMHHP 108 (510)
T ss_pred CCCchhhhheeccCCccccHHHHHHHHHHhCceeEEEeecccccCcccceEEEEeccHHHHHHHHHHhhcccccCCCCcc
Confidence 3456778999999999999999999999999999999999999999999999999999999999999855 567788888
Q ss_pred eeeeccCCCCCCCCCCCCCceEEEcCCCcCCCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHH
Q 021958 155 FRLNWASFGAGEKRDDTPDHTIFVGDLAADVTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMT 234 (305)
Q Consensus 155 i~v~~~~~~~~~~~~~~~~~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~ 234 (305)
|++.+++.+ +..-...++|||+-|+..++|.|++++|+ +||.|++|.|++|. .+.+||||||+|.+.|.|..|++
T Consensus 109 vqvk~Ad~E---~er~~~e~KLFvg~lsK~~te~evr~iFs-~fG~Ied~~ilrd~-~~~sRGcaFV~fstke~A~~Aik 183 (510)
T KOG0144|consen 109 VQVKYADGE---RERIVEERKLFVGMLSKQCTENEVREIFS-RFGHIEDCYILRDP-DGLSRGCAFVKFSTKEMAVAAIK 183 (510)
T ss_pred eeecccchh---hhccccchhhhhhhccccccHHHHHHHHH-hhCccchhhheecc-cccccceeEEEEehHHHHHHHHH
Confidence 999998743 22224578999999999999999999999 99999999999996 89999999999999999999999
Q ss_pred HhCCce-e--cceeeEEccCCCCCCCC
Q 021958 235 EMNGVF-C--STRPMRIGPATNKKTVS 258 (305)
Q Consensus 235 ~l~~~~-~--~g~~l~v~~~~~~~~~~ 258 (305)
.|||.. + +...|.|+|+..++.+.
T Consensus 184 a~ng~~tmeGcs~PLVVkFADtqkdk~ 210 (510)
T KOG0144|consen 184 ALNGTQTMEGCSQPLVVKFADTQKDKD 210 (510)
T ss_pred hhccceeeccCCCceEEEecccCCCch
Confidence 999966 5 44689999999887664
No 8
>TIGR01622 SF-CC1 splicing factor, CC1-like family. A homologous gene from Plasmodium falciparum was identified in the course of the analysis of that genome at TIGR and was included in the model.
Probab=99.97 E-value=2.1e-30 Score=239.37 Aligned_cols=172 Identities=24% Similarity=0.420 Sum_probs=152.8
Q ss_pred CCCCceEEEcCCCcccCHHHHhHhhhhcCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCCCCCcceee
Q 021958 78 PGEIRTLWIGDLQYWMDETYLNTCFAHTGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMPNGEQNFRL 157 (305)
Q Consensus 78 ~~~~~~l~v~nLp~~~t~~~l~~~f~~~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~~~~i~v 157 (305)
....++|||+|||..+++++|+++|+.||.|.+|+++.++.+++++|||||+|.+.++|.+|+ .|+|..+ .++.|.+
T Consensus 86 ~~~~~~l~V~nlp~~~~~~~l~~~F~~~G~v~~v~i~~d~~~~~skg~afVeF~~~e~A~~Al-~l~g~~~--~g~~i~v 162 (457)
T TIGR01622 86 ERDDRTVFVLQLALKARERDLYEFFSKVGKVRDVQCIKDRNSRRSKGVAYVEFYDVESVIKAL-ALTGQML--LGRPIIV 162 (457)
T ss_pred ccCCcEEEEeCCCCCCCHHHHHHHHHhcCCeeEEEEeecCCCCCcceEEEEEECCHHHHHHHH-HhCCCEE--CCeeeEE
Confidence 345789999999999999999999999999999999999999999999999999999999999 4899998 5677777
Q ss_pred eccCCCCCC--------CCCCCCCceEEEcCCCcCCCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHH
Q 021958 158 NWASFGAGE--------KRDDTPDHTIFVGDLAADVTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQ 229 (305)
Q Consensus 158 ~~~~~~~~~--------~~~~~~~~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A 229 (305)
.++...... ........+|||+||+..+++++|+++|+ .||.|..|.+..+..+|+++|||||+|.+.++|
T Consensus 163 ~~~~~~~~~~~~~~~~~~~~~p~~~~l~v~nl~~~~te~~l~~~f~-~~G~i~~v~~~~d~~~g~~~g~afV~f~~~e~A 241 (457)
T TIGR01622 163 QSSQAEKNRAAKAATHQPGDIPNFLKLYVGNLHFNITEQELRQIFE-PFGDIEDVQLHRDPETGRSKGFGFIQFHDAEEA 241 (457)
T ss_pred eecchhhhhhhhcccccCCCCCCCCEEEEcCCCCCCCHHHHHHHHH-hcCCeEEEEEEEcCCCCccceEEEEEECCHHHH
Confidence 665432111 01112368999999999999999999999 999999999999988899999999999999999
Q ss_pred HHHHHHhCCceecceeeEEccCCC
Q 021958 230 LRAMTEMNGVFCSTRPMRIGPATN 253 (305)
Q Consensus 230 ~~a~~~l~~~~~~g~~l~v~~~~~ 253 (305)
.+|+..|+|..|.|+.|+|.|++.
T Consensus 242 ~~A~~~l~g~~i~g~~i~v~~a~~ 265 (457)
T TIGR01622 242 KEALEVMNGFELAGRPIKVGYAQD 265 (457)
T ss_pred HHHHHhcCCcEECCEEEEEEEccC
Confidence 999999999999999999999764
No 9
>KOG0117 consensus Heterogeneous nuclear ribonucleoprotein R (RRM superfamily) [RNA processing and modification]
Probab=99.97 E-value=9e-30 Score=218.10 Aligned_cols=195 Identities=14% Similarity=0.228 Sum_probs=163.2
Q ss_pred CCCceEEEcCCCcccCHHHHhHhhhhcCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCCCCCcceeee
Q 021958 79 GEIRTLWIGDLQYWMDETYLNTCFAHTGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMPNGEQNFRLN 158 (305)
Q Consensus 79 ~~~~~l~v~nLp~~~t~~~l~~~f~~~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~~~~i~v~ 158 (305)
...+-||||.||.++.|++|..+|++.|.|-+++++.|+.+|.+||||||.|.+.++|++|++.||+.+|. .|+.|.|+
T Consensus 81 ~~G~EVfvGkIPrD~~EdeLvplfEkiG~I~elRLMmD~~sG~nRGYAFVtf~~Ke~Aq~Aik~lnn~Eir-~GK~igvc 159 (506)
T KOG0117|consen 81 PRGCEVFVGKIPRDVFEDELVPLFEKIGKIYELRLMMDPFSGDNRGYAFVTFCTKEEAQEAIKELNNYEIR-PGKLLGVC 159 (506)
T ss_pred CCCceEEecCCCccccchhhHHHHHhccceeeEEEeecccCCCCcceEEEEeecHHHHHHHHHHhhCcccc-CCCEeEEE
Confidence 34578999999999999999999999999999999999999999999999999999999999999999997 48888887
Q ss_pred ccCCCCCCCCCCCCCceEEEcCCCcCCCHHHHHHHHhhcCCC-eeEEEEEecC-CCCCcccEEEEEeCCHHHHHHHHHHh
Q 021958 159 WASFGAGEKRDDTPDHTIFVGDLAADVTDYMLQETFRARYPS-TKGAKVVIDR-LTGRTKGYGFVRFGDESEQLRAMTEM 236 (305)
Q Consensus 159 ~~~~~~~~~~~~~~~~~l~v~nlp~~~~~~~l~~~f~~~~G~-i~~i~i~~~~-~~~~~~g~afV~f~~~~~A~~a~~~l 236 (305)
.+.. +++|||||+|..+++++|.+.|+ +.++ |.+|.+..+. +..++||||||+|.|...|..|.++|
T Consensus 160 ~Sva----------n~RLFiG~IPK~k~keeIlee~~-kVteGVvdVivy~~p~dk~KNRGFaFveYe~H~~Aa~aRrKl 228 (506)
T KOG0117|consen 160 VSVA----------NCRLFIGNIPKTKKKEEILEEMK-KVTEGVVDVIVYPSPDDKTKNRGFAFVEYESHRAAAMARRKL 228 (506)
T ss_pred Eeee----------cceeEeccCCccccHHHHHHHHH-hhCCCeeEEEEecCccccccccceEEEEeecchhHHHHHhhc
Confidence 6653 58999999999999999999999 6665 6666665543 35688999999999999999999888
Q ss_pred CC--ceecceeeEEccCCCCCCCCCCCCCCCcceEEeCCcchh----hhHHHHhhc
Q 021958 237 NG--VFCSTRPMRIGPATNKKTVSGQQQYPKGTFLVVGTGFCL----QIKFKFKYF 286 (305)
Q Consensus 237 ~~--~~~~g~~l~v~~~~~~~~~~~~~~~~~~~~~~~g~~~~~----~~~~~~~~~ 286 (305)
-. ..++|..+.|+|+.+....... ...+-..|++.+.... .+++.|.+|
T Consensus 229 ~~g~~klwgn~~tVdWAep~~e~ded-~ms~VKvLYVRNL~~~tTeE~lk~~F~~~ 283 (506)
T KOG0117|consen 229 MPGKIKLWGNAITVDWAEPEEEPDED-TMSKVKVLYVRNLMESTTEETLKKLFNEF 283 (506)
T ss_pred cCCceeecCCcceeeccCcccCCChh-hhhheeeeeeeccchhhhHHHHHHHHHhc
Confidence 43 4589999999999998887543 4444457777764433 334555555
No 10
>TIGR01648 hnRNP-R-Q heterogeneous nuclear ribonucleoprotein R, Q family. Sequences in this subfamily include the human heterogeneous nuclear ribonucleoproteins (hnRNP) R, Q and APOBEC-1 complementation factor (aka APOBEC-1 stimulating protein). These proteins contain three RNA recognition domains (rrm: pfam00076) and a somewhat variable C-terminal domain.
Probab=99.97 E-value=8.1e-30 Score=234.39 Aligned_cols=195 Identities=15% Similarity=0.230 Sum_probs=154.0
Q ss_pred CCCceEEEcCCCcccCHHHHhHhhhhcCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCCCCCcceeee
Q 021958 79 GEIRTLWIGDLQYWMDETYLNTCFAHTGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMPNGEQNFRLN 158 (305)
Q Consensus 79 ~~~~~l~v~nLp~~~t~~~l~~~f~~~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~~~~i~v~ 158 (305)
...++|||+|||.++++++|+++|++||.|.+++|++| .+|+++|||||+|.+.++|++|++.|++..+.. ++.+.+.
T Consensus 56 ~~~~~lFVgnLp~~~tEd~L~~~F~~~G~I~~vrl~~D-~sG~sRGfaFV~F~~~e~A~~Ai~~lng~~i~~-Gr~l~V~ 133 (578)
T TIGR01648 56 GRGCEVFVGKIPRDLYEDELVPLFEKAGPIYELRLMMD-FSGQNRGYAFVTFCGKEEAKEAVKLLNNYEIRP-GRLLGVC 133 (578)
T ss_pred CCCCEEEeCCCCCCCCHHHHHHHHHhhCCEEEEEEEEC-CCCCccceEEEEeCCHHHHHHHHHHcCCCeecC-Ccccccc
Confidence 45689999999999999999999999999999999999 689999999999999999999999999998852 5566665
Q ss_pred ccCCCCCCCCCCCCCceEEEcCCCcCCCHHHHHHHHhhcCCC-eeEEEEE-ecCCCCCcccEEEEEeCCHHHHHHHHHHh
Q 021958 159 WASFGAGEKRDDTPDHTIFVGDLAADVTDYMLQETFRARYPS-TKGAKVV-IDRLTGRTKGYGFVRFGDESEQLRAMTEM 236 (305)
Q Consensus 159 ~~~~~~~~~~~~~~~~~l~v~nlp~~~~~~~l~~~f~~~~G~-i~~i~i~-~~~~~~~~~g~afV~f~~~~~A~~a~~~l 236 (305)
++. ..++|||+|||.++++++|.+.|+ +++. +.++.+. .+...++++|||||+|++.++|.+|++.|
T Consensus 134 ~S~----------~~~rLFVgNLP~~~TeeeL~eeFs-kv~egvv~vIv~~~~~~kgKnRGFAFVeF~s~edAa~AirkL 202 (578)
T TIGR01648 134 ISV----------DNCRLFVGGIPKNKKREEILEEFS-KVTEGVVDVIVYHSAADKKKNRGFAFVEYESHRAAAMARRKL 202 (578)
T ss_pred ccc----------cCceeEeecCCcchhhHHHHHHhh-cccCCceEEEEeccccccCccCceEEEEcCCHHHHHHHHHHh
Confidence 543 258899999999999999999999 7754 4444332 33345678999999999999999999988
Q ss_pred CC--ceecceeeEEccCCCCCCCCCCCCCCCcceEEeCCcchhhhHHHHhhcc
Q 021958 237 NG--VFCSTRPMRIGPATNKKTVSGQQQYPKGTFLVVGTGFCLQIKFKFKYFI 287 (305)
Q Consensus 237 ~~--~~~~g~~l~v~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~ 287 (305)
+. ..++|+.|+|+|+.++..... ........+++++....-..+.|+++|
T Consensus 203 ~~gki~l~Gr~I~VdwA~p~~~~d~-~~~~~~k~LfVgNL~~~~tee~L~~~F 254 (578)
T TIGR01648 203 MPGRIQLWGHVIAVDWAEPEEEVDE-DVMAKVKILYVRNLMTTTTEEIIEKSF 254 (578)
T ss_pred hccceEecCceEEEEeecccccccc-cccccccEEEEeCCCCCCCHHHHHHHH
Confidence 64 458999999999987654432 222344567777643332233344333
No 11
>KOG0131 consensus Splicing factor 3b, subunit 4 [RNA processing and modification]
Probab=99.97 E-value=1.2e-30 Score=198.64 Aligned_cols=176 Identities=27% Similarity=0.475 Sum_probs=159.4
Q ss_pred CCCCCceEEEcCCCcccCHHHHhHhhhhcCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCCCCCccee
Q 021958 77 QPGEIRTLWIGDLQYWMDETYLNTCFAHTGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMPNGEQNFR 156 (305)
Q Consensus 77 ~~~~~~~l~v~nLp~~~t~~~l~~~f~~~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~~~~i~ 156 (305)
+.+...+||||||+..++++.|+++|-+.|.|.++++.+|+.+...+|||||+|.++++|+.|++.|+...+ .|+.|+
T Consensus 5 ~rnqd~tiyvgnld~kvs~~~l~EL~iqagpVv~i~iPkDrv~~~~qGygF~Ef~~eedadYAikiln~VkL--YgrpIr 82 (203)
T KOG0131|consen 5 ERNQDATLYVGNLDEKVSEELLYELFIQAGPVVNLHIPKDRVTQKHQGYGFAEFRTEEDADYAIKILNMVKL--YGRPIR 82 (203)
T ss_pred ccCCCceEEEecCCHHHHHHHHHHHHHhcCceeeeecchhhhcccccceeEEEEechhhhHHHHHHHHHHHh--cCceeE
Confidence 456678999999999999999999999999999999999999999999999999999999999999998777 889999
Q ss_pred eeccCCCCCCCCCCCCCceEEEcCCCcCCCHHHHHHHHhhcCCCeeE-EEEEecCCCCCcccEEEEEeCCHHHHHHHHHH
Q 021958 157 LNWASFGAGEKRDDTPDHTIFVGDLAADVTDYMLQETFRARYPSTKG-AKVVIDRLTGRTKGYGFVRFGDESEQLRAMTE 235 (305)
Q Consensus 157 v~~~~~~~~~~~~~~~~~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~-i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~ 235 (305)
++.+.. .......+.+|||+||..+++|..|...|+ .||.+.+ -.++++..+|.++|||||.|.+.+.+.+|+..
T Consensus 83 v~kas~---~~~nl~vganlfvgNLd~~vDe~~L~dtFs-afG~l~~~P~i~rd~~tg~~~~~g~i~~~sfeasd~ai~s 158 (203)
T KOG0131|consen 83 VNKASA---HQKNLDVGANLFVGNLDPEVDEKLLYDTFS-AFGVLISPPKIMRDPDTGNPKGFGFINYASFEASDAAIGS 158 (203)
T ss_pred EEeccc---ccccccccccccccccCcchhHHHHHHHHH-hccccccCCcccccccCCCCCCCeEEechhHHHHHHHHHH
Confidence 998872 223334458999999999999999999999 9999765 37888888999999999999999999999999
Q ss_pred hCCceecceeeEEccCCCCCCCC
Q 021958 236 MNGVFCSTRPMRIGPATNKKTVS 258 (305)
Q Consensus 236 l~~~~~~g~~l~v~~~~~~~~~~ 258 (305)
++|..+++|.++|+++..+...+
T Consensus 159 ~ngq~l~nr~itv~ya~k~~~kg 181 (203)
T KOG0131|consen 159 MNGQYLCNRPITVSYAFKKDTKG 181 (203)
T ss_pred hccchhcCCceEEEEEEecCCCc
Confidence 99999999999999988766554
No 12
>TIGR01628 PABP-1234 polyadenylate binding protein, human types 1, 2, 3, 4 family. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed, or of unknown tissue range.
Probab=99.97 E-value=2.4e-29 Score=237.60 Aligned_cols=173 Identities=23% Similarity=0.432 Sum_probs=152.9
Q ss_pred CCceEEEcCCCcccCHHHHhHhhhhcCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCCCCCcceeeec
Q 021958 80 EIRTLWIGDLQYWMDETYLNTCFAHTGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMPNGEQNFRLNW 159 (305)
Q Consensus 80 ~~~~l~v~nLp~~~t~~~l~~~f~~~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~~~~i~v~~ 159 (305)
...+|||+||+.++++++|+++|+.||.|.+|++..+. +|+++|||||+|.+.++|.+|++.++|..+ .++.+.+.+
T Consensus 87 ~~~~vfV~nLp~~~~~~~L~~~F~~~G~i~~~~i~~~~-~g~skg~afV~F~~~e~A~~Ai~~lng~~~--~~~~i~v~~ 163 (562)
T TIGR01628 87 GVGNIFVKNLDKSVDNKALFDTFSKFGNILSCKVATDE-NGKSRGYGFVHFEKEESAKAAIQKVNGMLL--NDKEVYVGR 163 (562)
T ss_pred CCCceEEcCCCccCCHHHHHHHHHhcCCcceeEeeecC-CCCcccEEEEEECCHHHHHHHHHHhcccEe--cCceEEEec
Confidence 45689999999999999999999999999999999985 688999999999999999999999999988 567777765
Q ss_pred cCCCCCC-CCCCCCCceEEEcCCCcCCCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHHhCC
Q 021958 160 ASFGAGE-KRDDTPDHTIFVGDLAADVTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEMNG 238 (305)
Q Consensus 160 ~~~~~~~-~~~~~~~~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l~~ 238 (305)
....... .......++|||+||+.++++++|+++|+ .||.|.++.+..+. +|+++|||||+|++.++|.+|++.|+|
T Consensus 164 ~~~~~~~~~~~~~~~~~l~V~nl~~~~tee~L~~~F~-~fG~i~~~~i~~~~-~g~~~G~afV~F~~~e~A~~Av~~l~g 241 (562)
T TIGR01628 164 FIKKHEREAAPLKKFTNLYVKNLDPSVNEDKLRELFA-KFGEITSAAVMKDG-SGRSRGFAFVNFEKHEDAAKAVEEMNG 241 (562)
T ss_pred cccccccccccccCCCeEEEeCCCCcCCHHHHHHHHH-hcCCEEEEEEEECC-CCCcccEEEEEECCHHHHHHHHHHhCC
Confidence 5433222 22344567899999999999999999999 99999999999885 789999999999999999999999999
Q ss_pred ceec----ceeeEEccCCCCCCC
Q 021958 239 VFCS----TRPMRIGPATNKKTV 257 (305)
Q Consensus 239 ~~~~----g~~l~v~~~~~~~~~ 257 (305)
..+. |+.|.|.++..+...
T Consensus 242 ~~i~~~~~g~~l~v~~a~~k~er 264 (562)
T TIGR01628 242 KKIGLAKEGKKLYVGRAQKRAER 264 (562)
T ss_pred cEecccccceeeEeecccChhhh
Confidence 9999 999999998776544
No 13
>KOG0145 consensus RNA-binding protein ELAV/HU (RRM superfamily) [RNA processing and modification]
Probab=99.97 E-value=5.9e-30 Score=205.06 Aligned_cols=173 Identities=27% Similarity=0.493 Sum_probs=159.4
Q ss_pred CCCCceEEEcCCCcccCHHHHhHhhhhcCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCCCCCcceee
Q 021958 78 PGEIRTLWIGDLQYWMDETYLNTCFAHTGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMPNGEQNFRL 157 (305)
Q Consensus 78 ~~~~~~l~v~nLp~~~t~~~l~~~f~~~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~~~~i~v 157 (305)
++..+.|.|..||..+|+|+|+.+|...|+|++|++++|+.+|.+.||+||.|.++++|++|+..|||.++ ..+.|+|
T Consensus 38 ~~skTNLIvNYLPQ~MTqdE~rSLF~SiGeiEScKLvRDKitGqSLGYGFVNYv~p~DAe~AintlNGLrL--Q~KTIKV 115 (360)
T KOG0145|consen 38 DESKTNLIVNYLPQNMTQDELRSLFGSIGEIESCKLVRDKITGQSLGYGFVNYVRPKDAEKAINTLNGLRL--QNKTIKV 115 (360)
T ss_pred CcccceeeeeecccccCHHHHHHHhhcccceeeeeeeeccccccccccceeeecChHHHHHHHhhhcceee--ccceEEE
Confidence 45556799999999999999999999999999999999999999999999999999999999999999999 6799999
Q ss_pred eccCCCCCCCCCCCCCceEEEcCCCcCCCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHHhC
Q 021958 158 NWASFGAGEKRDDTPDHTIFVGDLAADVTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEMN 237 (305)
Q Consensus 158 ~~~~~~~~~~~~~~~~~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l~ 237 (305)
.++... .+......|||.+||...+..||.++|+ .||.|.-.+|+.|..+|.++|.|||+|+..++|+.|++.||
T Consensus 116 SyARPS----s~~Ik~aNLYvSGlPktMtqkelE~iFs-~fGrIItSRiL~dqvtg~srGVgFiRFDKr~EAe~AIk~lN 190 (360)
T KOG0145|consen 116 SYARPS----SDSIKDANLYVSGLPKTMTQKELEQIFS-PFGRIITSRILVDQVTGLSRGVGFIRFDKRIEAEEAIKGLN 190 (360)
T ss_pred EeccCC----hhhhcccceEEecCCccchHHHHHHHHH-HhhhhhhhhhhhhcccceecceeEEEecchhHHHHHHHhcc
Confidence 999865 4456678999999999999999999999 99999999999999999999999999999999999999999
Q ss_pred Cceec--ceeeEEccCCCCCCC
Q 021958 238 GVFCS--TRPMRIGPATNKKTV 257 (305)
Q Consensus 238 ~~~~~--g~~l~v~~~~~~~~~ 257 (305)
|..-. ...|.|+|++.....
T Consensus 191 G~~P~g~tepItVKFannPsq~ 212 (360)
T KOG0145|consen 191 GQKPSGCTEPITVKFANNPSQK 212 (360)
T ss_pred CCCCCCCCCCeEEEecCCcccc
Confidence 98854 468999999876544
No 14
>TIGR01642 U2AF_lg U2 snRNP auxilliary factor, large subunit, splicing factor. Members of this subfamily are found in plants, metazoa and fungi.
Probab=99.97 E-value=2.9e-28 Score=228.12 Aligned_cols=170 Identities=17% Similarity=0.260 Sum_probs=141.0
Q ss_pred CCCCCCceEEEcCCCcccCHHHHhHhhhhcC------------CcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHh
Q 021958 76 GQPGEIRTLWIGDLQYWMDETYLNTCFAHTG------------EVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTF 143 (305)
Q Consensus 76 ~~~~~~~~l~v~nLp~~~t~~~l~~~f~~~G------------~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l 143 (305)
......++|||||||..+|+++|+++|..++ .|..+.+ ++.+|||||+|.+.++|..|+ .|
T Consensus 170 ~~~~~~r~lyVgnLp~~~t~~~l~~~F~~~~~~~~~~~~~~~~~v~~~~~------~~~kg~afVeF~~~e~A~~Al-~l 242 (509)
T TIGR01642 170 QATRQARRLYVGGIPPEFVEEAVVDFFNDLMIATGYHKAEDGKHVSSVNI------NKEKNFAFLEFRTVEEATFAM-AL 242 (509)
T ss_pred cCCccccEEEEeCCCCCCCHHHHHHHHHHHHHhcCCCCCCCCCceEEEEE------CCCCCEEEEEeCCHHHHhhhh-cC
Confidence 4456778999999999999999999999752 2333333 455789999999999999999 69
Q ss_pred CCCCCCCCCcceeeeccCCCCC-------------------------CCCCCCCCceEEEcCCCcCCCHHHHHHHHhhcC
Q 021958 144 NGTPMPNGEQNFRLNWASFGAG-------------------------EKRDDTPDHTIFVGDLAADVTDYMLQETFRARY 198 (305)
Q Consensus 144 ~g~~~~~~~~~i~v~~~~~~~~-------------------------~~~~~~~~~~l~v~nlp~~~~~~~l~~~f~~~~ 198 (305)
+|..+ .++.|+|.+...... .........+|||+||+..+++++|+++|+ .|
T Consensus 243 ~g~~~--~g~~l~v~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~v~nlp~~~~~~~l~~~f~-~~ 319 (509)
T TIGR01642 243 DSIIY--SNVFLKIRRPHDYIPVPQITPEVSQKNPDDNAKNVEKLVNSTTVLDSKDRIYIGNLPLYLGEDQIKELLE-SF 319 (509)
T ss_pred CCeEe--eCceeEecCccccCCccccCCCCCCCCCcccccccccccccccCCCCCCEEEEeCCCCCCCHHHHHHHHH-hc
Confidence 99988 567787764432110 000123457899999999999999999999 99
Q ss_pred CCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHHhCCceecceeeEEccCCCCC
Q 021958 199 PSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEMNGVFCSTRPMRIGPATNKK 255 (305)
Q Consensus 199 G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l~~~~~~g~~l~v~~~~~~~ 255 (305)
|.|..+.++.+..+|.++|||||+|.+.++|..|+..|+|..|+|+.|.|.++....
T Consensus 320 G~i~~~~~~~~~~~g~~~g~afv~f~~~~~a~~A~~~l~g~~~~~~~l~v~~a~~~~ 376 (509)
T TIGR01642 320 GDLKAFNLIKDIATGLSKGYAFCEYKDPSVTDVAIAALNGKDTGDNKLHVQRACVGA 376 (509)
T ss_pred CCeeEEEEEecCCCCCcCeEEEEEECCHHHHHHHHHHcCCCEECCeEEEEEECccCC
Confidence 999999999998899999999999999999999999999999999999999986543
No 15
>KOG0145 consensus RNA-binding protein ELAV/HU (RRM superfamily) [RNA processing and modification]
Probab=99.96 E-value=1.2e-28 Score=197.48 Aligned_cols=175 Identities=25% Similarity=0.392 Sum_probs=159.3
Q ss_pred CCCceEEEcCCCcccCHHHHhHhhhhcCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCCCCCcceeee
Q 021958 79 GEIRTLWIGDLQYWMDETYLNTCFAHTGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMPNGEQNFRLN 158 (305)
Q Consensus 79 ~~~~~l~v~nLp~~~t~~~l~~~f~~~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~~~~i~v~ 158 (305)
-....|||++||..+|..+|+++|+.||.|...+|..|..+|.+||.+||.|...++|+.|++.|||..-.+....|.|.
T Consensus 125 Ik~aNLYvSGlPktMtqkelE~iFs~fGrIItSRiL~dqvtg~srGVgFiRFDKr~EAe~AIk~lNG~~P~g~tepItVK 204 (360)
T KOG0145|consen 125 IKDANLYVSGLPKTMTQKELEQIFSPFGRIITSRILVDQVTGLSRGVGFIRFDKRIEAEEAIKGLNGQKPSGCTEPITVK 204 (360)
T ss_pred hcccceEEecCCccchHHHHHHHHHHhhhhhhhhhhhhcccceecceeEEEecchhHHHHHHHhccCCCCCCCCCCeEEE
Confidence 34567999999999999999999999999999999999999999999999999999999999999999988888899999
Q ss_pred ccCCCCCCC-----------------------------------------------------------CCCCCCceEEEc
Q 021958 159 WASFGAGEK-----------------------------------------------------------RDDTPDHTIFVG 179 (305)
Q Consensus 159 ~~~~~~~~~-----------------------------------------------------------~~~~~~~~l~v~ 179 (305)
++....... ......++|||.
T Consensus 205 FannPsq~t~~a~ls~ly~sp~rr~~Gp~hh~~~r~r~~~~~~~~~~~~rfsP~~~d~m~~l~~~~lp~~~~~g~ciFvY 284 (360)
T KOG0145|consen 205 FANNPSQKTNQALLSQLYQSPARRYGGPMHHQAQRFRLDNLLNPHAAQARFSPMTIDGMSGLAGVNLPGGPGGGWCIFVY 284 (360)
T ss_pred ecCCcccccchhhhHHhhcCccccCCCcccchhhhhccccccchhhhhccCCCccccccceeeeeccCCCCCCeeEEEEE
Confidence 887641110 011357899999
Q ss_pred CCCcCCCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHHhCCceecceeeEEccCCCC
Q 021958 180 DLAADVTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEMNGVFCSTRPMRIGPATNK 254 (305)
Q Consensus 180 nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l~~~~~~g~~l~v~~~~~~ 254 (305)
||..+.+|..|.++|. .||-|.+|+|++|..+++.||||||.+.+-++|..|+..|||..+++|.|.|.|..++
T Consensus 285 NLspd~de~~LWQlFg-pFGAv~nVKvirD~ttnkCKGfgFVtMtNYdEAamAi~sLNGy~lg~rvLQVsFKtnk 358 (360)
T KOG0145|consen 285 NLSPDADESILWQLFG-PFGAVTNVKVIRDFTTNKCKGFGFVTMTNYDEAAMAIASLNGYRLGDRVLQVSFKTNK 358 (360)
T ss_pred ecCCCchHhHHHHHhC-cccceeeEEEEecCCcccccceeEEEecchHHHHHHHHHhcCccccceEEEEEEecCC
Confidence 9999999999999999 9999999999999999999999999999999999999999999999999999998765
No 16
>TIGR01649 hnRNP-L_PTB hnRNP-L/PTB/hephaestus splicing factor family. Included in this family of heterogeneous ribonucleoproteins are PTB (polypyrimidine tract binding protein ) and hnRNP-L. These proteins contain four RNA recognition motifs (rrm: pfam00067).
Probab=99.96 E-value=3.4e-27 Score=218.02 Aligned_cols=167 Identities=18% Similarity=0.237 Sum_probs=142.3
Q ss_pred CCCCceEEEcCCCc-ccCHHHHhHhhhhcCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCCCCCccee
Q 021958 78 PGEIRTLWIGDLQY-WMDETYLNTCFAHTGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMPNGEQNFR 156 (305)
Q Consensus 78 ~~~~~~l~v~nLp~-~~t~~~l~~~f~~~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~~~~i~ 156 (305)
....++|||+||+. .+|+++|+++|+.||.|.++++++++ +|||||+|.+.++|.+|++.|+|..| .|+.|+
T Consensus 272 ~~~~~~l~v~nL~~~~vt~~~L~~lF~~yG~V~~vki~~~~-----~g~afV~f~~~~~A~~Ai~~lng~~l--~g~~l~ 344 (481)
T TIGR01649 272 GGPGSVLMVSGLHQEKVNCDRLFNLFCVYGNVERVKFMKNK-----KETALIEMADPYQAQLALTHLNGVKL--FGKPLR 344 (481)
T ss_pred CCCCCEEEEeCCCCCCCCHHHHHHHHHhcCCeEEEEEEeCC-----CCEEEEEECCHHHHHHHHHHhCCCEE--CCceEE
Confidence 34568999999997 69999999999999999999999863 58999999999999999999999999 678888
Q ss_pred eeccCCCCCCC-------------C--------------------CCCCCceEEEcCCCcCCCHHHHHHHHhhcCCC--e
Q 021958 157 LNWASFGAGEK-------------R--------------------DDTPDHTIFVGDLAADVTDYMLQETFRARYPS--T 201 (305)
Q Consensus 157 v~~~~~~~~~~-------------~--------------------~~~~~~~l~v~nlp~~~~~~~l~~~f~~~~G~--i 201 (305)
|.++....... . ...++.+|||+|||.++++++|+++|+ .||. |
T Consensus 345 v~~s~~~~~~~~~~~~~~~~~~~~~d~~~~~~~r~~~~~~~~~~~~~~ps~~L~v~NLp~~~tee~L~~lF~-~~G~~~i 423 (481)
T TIGR01649 345 VCPSKQQNVQPPREGQLDDGLTSYKDYSSSRNHRFKKPGSANKNNIQPPSATLHLSNIPLSVSEEDLKELFA-ENGVHKV 423 (481)
T ss_pred EEEcccccccCCCCCcCcCCCcccccccCCccccCCCcccccccccCCCCcEEEEecCCCCCCHHHHHHHHH-hcCCccc
Confidence 88775431100 0 012467899999999999999999999 9998 8
Q ss_pred eEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHHhCCceeccee------eEEccCCCC
Q 021958 202 KGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEMNGVFCSTRP------MRIGPATNK 254 (305)
Q Consensus 202 ~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l~~~~~~g~~------l~v~~~~~~ 254 (305)
..+++.... + ..+++|||+|++.++|.+|+..||++.|+|+. |+|+|++++
T Consensus 424 ~~ik~~~~~-~-~~~~~gfVeF~~~e~A~~Al~~ln~~~l~~~~~~~~~~lkv~fs~~~ 480 (481)
T TIGR01649 424 KKFKFFPKD-N-ERSKMGLLEWESVEDAVEALIALNHHQLNEPNGSAPYHLKVSFSTSR 480 (481)
T ss_pred eEEEEecCC-C-CcceeEEEEcCCHHHHHHHHHHhcCCccCCCCCCccceEEEEeccCC
Confidence 888886543 3 35889999999999999999999999999985 999999865
No 17
>TIGR01648 hnRNP-R-Q heterogeneous nuclear ribonucleoprotein R, Q family. Sequences in this subfamily include the human heterogeneous nuclear ribonucleoproteins (hnRNP) R, Q and APOBEC-1 complementation factor (aka APOBEC-1 stimulating protein). These proteins contain three RNA recognition domains (rrm: pfam00076) and a somewhat variable C-terminal domain.
Probab=99.96 E-value=1.5e-27 Score=219.53 Aligned_cols=170 Identities=18% Similarity=0.295 Sum_probs=141.9
Q ss_pred CCCceEEEcCCCcccCHHHHhHhhhhcCC-cEEEEEEe-eCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCCCCCccee
Q 021958 79 GEIRTLWIGDLQYWMDETYLNTCFAHTGE-VVAVKVIR-NKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMPNGEQNFR 156 (305)
Q Consensus 79 ~~~~~l~v~nLp~~~t~~~l~~~f~~~G~-v~~~~~~~-~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~~~~i~ 156 (305)
...++|||+|||.++++++|.+.|++++. +.++.+.. ...+++++|||||+|.+.++|..|++.|+...+...++.|.
T Consensus 136 ~~~~rLFVgNLP~~~TeeeL~eeFskv~egvv~vIv~~~~~~kgKnRGFAFVeF~s~edAa~AirkL~~gki~l~Gr~I~ 215 (578)
T TIGR01648 136 VDNCRLFVGGIPKNKKREEILEEFSKVTEGVVDVIVYHSAADKKKNRGFAFVEYESHRAAAMARRKLMPGRIQLWGHVIA 215 (578)
T ss_pred ccCceeEeecCCcchhhHHHHHHhhcccCCceEEEEeccccccCccCceEEEEcCCHHHHHHHHHHhhccceEecCceEE
Confidence 34688999999999999999999999864 44444432 23456889999999999999999999887554444688999
Q ss_pred eeccCCCCCCC-CCCCCCceEEEcCCCcCCCHHHHHHHHhhcC--CCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHH
Q 021958 157 LNWASFGAGEK-RDDTPDHTIFVGDLAADVTDYMLQETFRARY--PSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAM 233 (305)
Q Consensus 157 v~~~~~~~~~~-~~~~~~~~l~v~nlp~~~~~~~l~~~f~~~~--G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~ 233 (305)
|+|+....... ......++|||+||+.++++++|+++|+ .| |.|++|+++ ++||||+|++.++|.+|+
T Consensus 216 VdwA~p~~~~d~~~~~~~k~LfVgNL~~~~tee~L~~~F~-~f~~G~I~rV~~~--------rgfAFVeF~s~e~A~kAi 286 (578)
T TIGR01648 216 VDWAEPEEEVDEDVMAKVKILYVRNLMTTTTEEIIEKSFS-EFKPGKVERVKKI--------RDYAFVHFEDREDAVKAM 286 (578)
T ss_pred EEeecccccccccccccccEEEEeCCCCCCCHHHHHHHHH-hcCCCceEEEEee--------cCeEEEEeCCHHHHHHHH
Confidence 99998654332 2234467899999999999999999999 99 999999875 459999999999999999
Q ss_pred HHhCCceecceeeEEccCCCCCCC
Q 021958 234 TEMNGVFCSTRPMRIGPATNKKTV 257 (305)
Q Consensus 234 ~~l~~~~~~g~~l~v~~~~~~~~~ 257 (305)
+.||+..|+|+.|+|+|+++....
T Consensus 287 ~~lnG~~i~Gr~I~V~~Akp~~~~ 310 (578)
T TIGR01648 287 DELNGKELEGSEIEVTLAKPVDKK 310 (578)
T ss_pred HHhCCCEECCEEEEEEEccCCCcc
Confidence 999999999999999999886543
No 18
>TIGR01649 hnRNP-L_PTB hnRNP-L/PTB/hephaestus splicing factor family. Included in this family of heterogeneous ribonucleoproteins are PTB (polypyrimidine tract binding protein ) and hnRNP-L. These proteins contain four RNA recognition motifs (rrm: pfam00067).
Probab=99.95 E-value=4.2e-27 Score=217.37 Aligned_cols=162 Identities=14% Similarity=0.106 Sum_probs=138.1
Q ss_pred CceEEEcCCCcccCHHHHhHhhhhcCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHh--CCCCCCCCCcceeee
Q 021958 81 IRTLWIGDLQYWMDETYLNTCFAHTGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTF--NGTPMPNGEQNFRLN 158 (305)
Q Consensus 81 ~~~l~v~nLp~~~t~~~l~~~f~~~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l--~g~~~~~~~~~i~v~ 158 (305)
+++|||+|||.++++++|+++|+.||.|.++.++++ ++||||+|.+.++|.+|++.+ ++..+ .++.|+|.
T Consensus 2 s~vv~V~nLp~~~te~~L~~~f~~fG~V~~v~i~~~------k~~afVef~~~e~A~~Ai~~~~~~~~~l--~g~~l~v~ 73 (481)
T TIGR01649 2 SPVVHVRNLPQDVVEADLVEALIPFGPVSYVMMLPG------KRQALVEFEDEESAKACVNFATSVPIYI--RGQPAFFN 73 (481)
T ss_pred ccEEEEcCCCCCCCHHHHHHHHHhcCCeeEEEEECC------CCEEEEEeCchHHHHHHHHHhhcCCceE--cCeEEEEE
Confidence 589999999999999999999999999999999864 469999999999999999875 55666 78999999
Q ss_pred ccCCCCCCCCC--------CCCCceEEEcCCCcCCCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHH
Q 021958 159 WASFGAGEKRD--------DTPDHTIFVGDLAADVTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQL 230 (305)
Q Consensus 159 ~~~~~~~~~~~--------~~~~~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~ 230 (305)
|+......+.. .....+|+|+||++++++++|+++|+ .||.|.+|.|..+. + +++|||+|.+.++|.
T Consensus 74 ~s~~~~~~~~~~~~~~~~~~~~~~~v~v~nl~~~vt~~~L~~~F~-~~G~V~~v~i~~~~--~--~~~afVef~~~~~A~ 148 (481)
T TIGR01649 74 YSTSQEIKRDGNSDFDSAGPNKVLRVIVENPMYPITLDVLYQIFN-PYGKVLRIVTFTKN--N--VFQALVEFESVNSAQ 148 (481)
T ss_pred ecCCcccccCCCCcccCCCCCceEEEEEcCCCCCCCHHHHHHHHh-ccCCEEEEEEEecC--C--ceEEEEEECCHHHHH
Confidence 98754322211 11234799999999999999999999 99999999998763 2 468999999999999
Q ss_pred HHHHHhCCceecc--eeeEEccCCCCC
Q 021958 231 RAMTEMNGVFCST--RPMRIGPATNKK 255 (305)
Q Consensus 231 ~a~~~l~~~~~~g--~~l~v~~~~~~~ 255 (305)
+|++.|||..|.| +.|+|.|++.+.
T Consensus 149 ~A~~~Lng~~i~~~~~~l~v~~sk~~~ 175 (481)
T TIGR01649 149 HAKAALNGADIYNGCCTLKIEYAKPTR 175 (481)
T ss_pred HHHHHhcCCcccCCceEEEEEEecCCC
Confidence 9999999999854 689999988643
No 19
>KOG0127 consensus Nucleolar protein fibrillarin NOP77 (RRM superfamily) [RNA processing and modification]
Probab=99.95 E-value=2e-27 Score=208.20 Aligned_cols=220 Identities=25% Similarity=0.386 Sum_probs=178.9
Q ss_pred CceEEEcCCCcccCHHHHhHhhhhcCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCCCCCcceeeecc
Q 021958 81 IRTLWIGDLQYWMDETYLNTCFAHTGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMPNGEQNFRLNWA 160 (305)
Q Consensus 81 ~~~l~v~nLp~~~t~~~l~~~f~~~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~~~~i~v~~~ 160 (305)
.-+|.|.|||+.+...+|..+|+.||.|.+|.|.+.+ .|+-.|||||.|....+|..|++.+||..| .|+.|-++|+
T Consensus 117 k~rLIIRNLPf~~k~~dLk~vFs~~G~V~Ei~IP~k~-dgklcGFaFV~fk~~~dA~~Al~~~N~~~i--~gR~VAVDWA 193 (678)
T KOG0127|consen 117 KWRLIIRNLPFKCKKPDLKNVFSNFGKVVEIVIPRKK-DGKLCGFAFVQFKEKKDAEKALEFFNGNKI--DGRPVAVDWA 193 (678)
T ss_pred cceEEeecCCcccCcHHHHHHHhhcceEEEEEcccCC-CCCccceEEEEEeeHHHHHHHHHhccCcee--cCceeEEeee
Confidence 6689999999999999999999999999999999776 456669999999999999999999999999 7799999999
Q ss_pred CCCC---------------------------------------------C-------------C----------------
Q 021958 161 SFGA---------------------------------------------G-------------E---------------- 166 (305)
Q Consensus 161 ~~~~---------------------------------------------~-------------~---------------- 166 (305)
-.+. . .
T Consensus 194 V~Kd~ye~ta~~~~~s~Kk~~~eEed~e~~~d~~~~~~~Ed~e~d~edeEe~D~~se~~ee~~~~Eee~~~vDd~e~S~~ 273 (678)
T KOG0127|consen 194 VDKDTYEDTAHEEKQSLKKAVKEEEDKEADEDDGKDFDEEDGEEDSEDEEETDGNSEAFEEGEESEEEEDDVDDEESSGK 273 (678)
T ss_pred cccccccccchhhhhhhhhccchhhhcccccccccccchhcccccccccccccccchhhhcccccccccccccccccccc
Confidence 5430 0 0
Q ss_pred ------------CCCCCCCceEEEcCCCcCCCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHH
Q 021958 167 ------------KRDDTPDHTIFVGDLAADVTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMT 234 (305)
Q Consensus 167 ------------~~~~~~~~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~ 234 (305)
+.+.....+|||+||++++++++|.++|+ +||.|.++.|+.++.|+.++|.|||.|.+..+|.+||.
T Consensus 274 ~~~~k~~q~k~~~en~~~~~tVFvRNL~fD~tEEel~~~fs-kFG~v~ya~iV~~k~T~~skGtAFv~Fkt~~~~~~ci~ 352 (678)
T KOG0127|consen 274 KESDKKAQNKTTRENITEGKTVFVRNLPFDTTEEELKEHFS-KFGEVKYAIIVKDKDTGHSKGTAFVKFKTQIAAQNCIE 352 (678)
T ss_pred CcccchhccccccccccccceEEEecCCccccHHHHHHHHH-hhccceeEEEEeccCCCCcccceEEEeccHHHHHHHHH
Confidence 00002347999999999999999999999 99999999999999999999999999999999999998
Q ss_pred Hh-----CC-ceecceeeEEccCCCCCCCCC-------CCCCCCcceEEeC-----------CcchhhhHHHHhhcccee
Q 021958 235 EM-----NG-VFCSTRPMRIGPATNKKTVSG-------QQQYPKGTFLVVG-----------TGFCLQIKFKFKYFIGWL 290 (305)
Q Consensus 235 ~l-----~~-~~~~g~~l~v~~~~~~~~~~~-------~~~~~~~~~~~~g-----------~~~~~~~~~~~~~~~g~~ 290 (305)
.. .| ..++||.|+|..+-.+..... ......+.++... .|.+..+|++-..+....
T Consensus 353 ~Aspa~e~g~~ll~GR~Lkv~~Av~RkeA~dmeqkk~~Kk~~gkrNLyLa~EG~I~~gt~aAeglS~~Dm~kRer~~~~k 432 (678)
T KOG0127|consen 353 AASPASEDGSVLLDGRLLKVTLAVTRKEAADMEQKKKRKKPKGKRNLYLAREGLIRDGTPAAEGLSATDMAKRERIAERK 432 (678)
T ss_pred hcCccCCCceEEEeccEEeeeeccchHHHHHHHHHhhhhccCCccceeeeccCccccCChhhcccchhhHHHHHHHHHHH
Confidence 76 23 568999999999877655432 1112223233222 366667788777777677
Q ss_pred eEEEEEeeeeccCC
Q 021958 291 IFIIIFFNISTSST 304 (305)
Q Consensus 291 ~~~~~~~~~~~~~~ 304 (305)
.+.+..=||..|+|
T Consensus 433 ~k~lknpnlhlSrt 446 (678)
T KOG0127|consen 433 RKKLKNPNLHLSRT 446 (678)
T ss_pred HHhhcCCceeeehh
Confidence 77888888888876
No 20
>TIGR01642 U2AF_lg U2 snRNP auxilliary factor, large subunit, splicing factor. Members of this subfamily are found in plants, metazoa and fungi.
Probab=99.95 E-value=1.9e-26 Score=215.84 Aligned_cols=172 Identities=19% Similarity=0.332 Sum_probs=146.0
Q ss_pred CCCceEEEcCCCcccCHHHHhHhhhhcCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCCCCCcceeee
Q 021958 79 GEIRTLWIGDLQYWMDETYLNTCFAHTGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMPNGEQNFRLN 158 (305)
Q Consensus 79 ~~~~~l~v~nLp~~~t~~~l~~~f~~~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~~~~i~v~ 158 (305)
...++|||+|||..+|+++|+++|+.||.|..+.++.+..+|.++|||||+|.+.++|..|++.|+|..| .++.|.|.
T Consensus 293 ~~~~~l~v~nlp~~~~~~~l~~~f~~~G~i~~~~~~~~~~~g~~~g~afv~f~~~~~a~~A~~~l~g~~~--~~~~l~v~ 370 (509)
T TIGR01642 293 DSKDRIYIGNLPLYLGEDQIKELLESFGDLKAFNLIKDIATGLSKGYAFCEYKDPSVTDVAIAALNGKDT--GDNKLHVQ 370 (509)
T ss_pred CCCCEEEEeCCCCCCCHHHHHHHHHhcCCeeEEEEEecCCCCCcCeEEEEEECCHHHHHHHHHHcCCCEE--CCeEEEEE
Confidence 4467999999999999999999999999999999999988999999999999999999999999999999 66788888
Q ss_pred ccCCCCCCC------------------------CCCCCCceEEEcCCCcC--C--------CHHHHHHHHhhcCCCeeEE
Q 021958 159 WASFGAGEK------------------------RDDTPDHTIFVGDLAAD--V--------TDYMLQETFRARYPSTKGA 204 (305)
Q Consensus 159 ~~~~~~~~~------------------------~~~~~~~~l~v~nlp~~--~--------~~~~l~~~f~~~~G~i~~i 204 (305)
++....... ....++.+|+|.|+... + ..++|+++|+ +||.|..|
T Consensus 371 ~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~v~~l~N~~~~~~l~~d~~~~~~~edl~~~f~-~~G~v~~v 449 (509)
T TIGR01642 371 RACVGANQATIDTSNGMAPVTLLAKALSQSILQIGGKPTKVVQLTNLVTGDDLMDDEEYEEIYEDVKTEFS-KYGPLINI 449 (509)
T ss_pred ECccCCCCCCccccccccccccccccchhhhccccCCCceEEEeccCCchhHhcCcchHHHHHHHHHHHHH-hcCCeeEE
Confidence 875321110 01235678999998542 1 2368999999 99999999
Q ss_pred EEEecC---CCCCcccEEEEEeCCHHHHHHHHHHhCCceecceeeEEccCCC
Q 021958 205 KVVIDR---LTGRTKGYGFVRFGDESEQLRAMTEMNGVFCSTRPMRIGPATN 253 (305)
Q Consensus 205 ~i~~~~---~~~~~~g~afV~f~~~~~A~~a~~~l~~~~~~g~~l~v~~~~~ 253 (305)
.|..+. .++.+.|+|||+|.+.++|.+|+..|||..|+|+.|.|.|...
T Consensus 450 ~i~~~~~~~~~~~~~G~~fV~F~~~e~A~~A~~~lnGr~~~gr~v~~~~~~~ 501 (509)
T TIGR01642 450 VIPRPNGDRNSTPGVGKVFLEYADVRSAEKAMEGMNGRKFNDRVVVAAFYGE 501 (509)
T ss_pred EeeccCcCCCcCCCcceEEEEECCHHHHHHHHHHcCCCEECCeEEEEEEeCH
Confidence 998652 3455689999999999999999999999999999999999764
No 21
>KOG0117 consensus Heterogeneous nuclear ribonucleoprotein R (RRM superfamily) [RNA processing and modification]
Probab=99.95 E-value=4.6e-27 Score=201.53 Aligned_cols=173 Identities=19% Similarity=0.282 Sum_probs=151.9
Q ss_pred CCCCceEEEcCCCcccCHHHHhHhhhhcCC-cEEEEEEeeCC-CCCccceEEEEEcCHHHHHHHHHHhCCCCCCCCCcce
Q 021958 78 PGEIRTLWIGDLQYWMDETYLNTCFAHTGE-VVAVKVIRNKQ-TGQIEGYGFIEFISRAGAERVLQTFNGTPMPNGEQNF 155 (305)
Q Consensus 78 ~~~~~~l~v~nLp~~~t~~~l~~~f~~~G~-v~~~~~~~~~~-~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~~~~i 155 (305)
....++|||||||.+.++++|.+.+++.++ |++|.+..+.. ..++||||||+|.+...|..|..+|..-.|...+..+
T Consensus 161 Svan~RLFiG~IPK~k~keeIlee~~kVteGVvdVivy~~p~dk~KNRGFaFveYe~H~~Aa~aRrKl~~g~~klwgn~~ 240 (506)
T KOG0117|consen 161 SVANCRLFIGNIPKTKKKEEILEEMKKVTEGVVDVIVYPSPDDKTKNRGFAFVEYESHRAAAMARRKLMPGKIKLWGNAI 240 (506)
T ss_pred eeecceeEeccCCccccHHHHHHHHHhhCCCeeEEEEecCccccccccceEEEEeecchhHHHHHhhccCCceeecCCcc
Confidence 345689999999999999999999999987 77887777653 3689999999999999999999988766676678999
Q ss_pred eeeccCCCCCCCC-CCCCCceEEEcCCCcCCCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHH
Q 021958 156 RLNWASFGAGEKR-DDTPDHTIFVGDLAADVTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMT 234 (305)
Q Consensus 156 ~v~~~~~~~~~~~-~~~~~~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~ 234 (305)
.|+|+........ .....+.|||+||+.++|+|.|+++|+ +||.|++|+.++| ||||.|.++++|.+|++
T Consensus 241 tVdWAep~~e~ded~ms~VKvLYVRNL~~~tTeE~lk~~F~-~~G~veRVkk~rD--------YaFVHf~eR~davkAm~ 311 (506)
T KOG0117|consen 241 TVDWAEPEEEPDEDTMSKVKVLYVRNLMESTTEETLKKLFN-EFGKVERVKKPRD--------YAFVHFAEREDAVKAMK 311 (506)
T ss_pred eeeccCcccCCChhhhhheeeeeeeccchhhhHHHHHHHHH-hccceEEeecccc--------eeEEeecchHHHHHHHH
Confidence 9999997754432 335567899999999999999999999 9999999988755 99999999999999999
Q ss_pred HhCCceecceeeEEccCCCCCCCCC
Q 021958 235 EMNGVFCSTRPMRIGPATNKKTVSG 259 (305)
Q Consensus 235 ~l~~~~~~g~~l~v~~~~~~~~~~~ 259 (305)
.+||++|+|..|.|.++++..+.+.
T Consensus 312 ~~ngkeldG~~iEvtLAKP~~k~k~ 336 (506)
T KOG0117|consen 312 ETNGKELDGSPIEVTLAKPVDKKKK 336 (506)
T ss_pred HhcCceecCceEEEEecCChhhhcc
Confidence 9999999999999999999766644
No 22
>KOG0127 consensus Nucleolar protein fibrillarin NOP77 (RRM superfamily) [RNA processing and modification]
Probab=99.95 E-value=1.5e-26 Score=202.73 Aligned_cols=175 Identities=20% Similarity=0.347 Sum_probs=156.2
Q ss_pred CceEEEcCCCcccCHHHHhHhhhhcCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCCCCCcceeeecc
Q 021958 81 IRTLWIGDLQYWMDETYLNTCFAHTGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMPNGEQNFRLNWA 160 (305)
Q Consensus 81 ~~~l~v~nLp~~~t~~~l~~~f~~~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~~~~i~v~~~ 160 (305)
..||||++||++++.++|.++|+.+|.|..+.++.++.++.++||+||.|...+++.+|+..+++..+ .|+.++++.+
T Consensus 5 g~TlfV~~lp~~~~~~qL~e~FS~vGPik~~~vVt~~gs~~~RGfgfVtFam~ED~qrA~~e~~~~kf--~Gr~l~v~~A 82 (678)
T KOG0127|consen 5 GATLFVSRLPFSSTGEQLEEFFSYVGPIKHAVVVTNKGSSEKRGFGFVTFAMEEDVQRALAETEQSKF--EGRILNVDPA 82 (678)
T ss_pred CceEEEecCCCccchhHHHHhhhcccCcceeEEecCCCcccccCccceeeehHhHHHHHHHHhhcCcc--cceecccccc
Confidence 37999999999999999999999999999999999999999999999999999999999999999988 6788888888
Q ss_pred CCCCCCC----------------------CCCCCCceEEEcCCCcCCCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccE
Q 021958 161 SFGAGEK----------------------RDDTPDHTIFVGDLAADVTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGY 218 (305)
Q Consensus 161 ~~~~~~~----------------------~~~~~~~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~ 218 (305)
..+...+ .-..+..+|.|+|||+.+.+.+|..+|+ .||.|.+|.|.+.. .|+-.||
T Consensus 83 ~~R~r~e~~~~~e~~~veK~~~q~~~~k~~v~~~k~rLIIRNLPf~~k~~dLk~vFs-~~G~V~Ei~IP~k~-dgklcGF 160 (678)
T KOG0127|consen 83 KKRARSEEVEKGENKAVEKPIEQKRPTKAKVDLPKWRLIIRNLPFKCKKPDLKNVFS-NFGKVVEIVIPRKK-DGKLCGF 160 (678)
T ss_pred cccccchhcccccchhhhcccccCCcchhhccCccceEEeecCCcccCcHHHHHHHh-hcceEEEEEcccCC-CCCccce
Confidence 7653322 0023478999999999999999999999 99999999999776 4555699
Q ss_pred EEEEeCCHHHHHHHHHHhCCceecceeeEEccCCCCCCCCC
Q 021958 219 GFVRFGDESEQLRAMTEMNGVFCSTRPMRIGPATNKKTVSG 259 (305)
Q Consensus 219 afV~f~~~~~A~~a~~~l~~~~~~g~~l~v~~~~~~~~~~~ 259 (305)
|||.|....+|.+|++.+|+..|+||.|.|+|+-.+.....
T Consensus 161 aFV~fk~~~dA~~Al~~~N~~~i~gR~VAVDWAV~Kd~ye~ 201 (678)
T KOG0127|consen 161 AFVQFKEKKDAEKALEFFNGNKIDGRPVAVDWAVDKDTYED 201 (678)
T ss_pred EEEEEeeHHHHHHHHHhccCceecCceeEEeeecccccccc
Confidence 99999999999999999999999999999999988776644
No 23
>KOG0148 consensus Apoptosis-promoting RNA-binding protein TIA-1/TIAR (RRM superfamily) [RNA processing and modification; Translation, ribosomal structure and biogenesis]
Probab=99.94 E-value=3.6e-26 Score=184.40 Aligned_cols=178 Identities=30% Similarity=0.466 Sum_probs=138.8
Q ss_pred CCCCceEEEcCCCcccCHHHHhHhhhhcCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCCCCCcceee
Q 021958 78 PGEIRTLWIGDLQYWMDETYLNTCFAHTGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMPNGEQNFRL 157 (305)
Q Consensus 78 ~~~~~~l~v~nLp~~~t~~~l~~~f~~~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~~~~i~v 157 (305)
+++.++||||||+.++||+.|..+|++.|.+.+++++.| .+++
T Consensus 3 ~~~prtlyvgnld~~vte~~i~~lf~qig~v~~~k~i~~-------------------------------------e~~v 45 (321)
T KOG0148|consen 3 SDEPRTLYVGNLDSTVTEDFIATLFNQIGSVTKTKVIFD-------------------------------------ELKV 45 (321)
T ss_pred CCCCceEEeeccChhhHHHHHHHHHHhccccccceeehh-------------------------------------hhcc
Confidence 466799999999999999999999999999999999876 2344
Q ss_pred eccCCCCCCC-CCCCCCceEEEcCCCcCCCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHHh
Q 021958 158 NWASFGAGEK-RDDTPDHTIFVGDLAADVTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEM 236 (305)
Q Consensus 158 ~~~~~~~~~~-~~~~~~~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l 236 (305)
.|+......+ ........+||+.|..+++.|+|++.|. +||+|.+++|++|..|+++||||||.|.+.++|++||..|
T Consensus 46 ~wa~~p~nQsk~t~~~hfhvfvgdls~eI~~e~lr~aF~-pFGevS~akvirD~~T~KsKGYgFVSf~~k~dAEnAI~~M 124 (321)
T KOG0148|consen 46 NWATAPGNQSKPTSNQHFHVFVGDLSPEIDNEKLREAFA-PFGEVSDAKVIRDMNTGKSKGYGFVSFPNKEDAENAIQQM 124 (321)
T ss_pred ccccCcccCCCCccccceeEEehhcchhcchHHHHHHhc-cccccccceEeecccCCcccceeEEeccchHHHHHHHHHh
Confidence 4444432222 2223366799999999999999999999 9999999999999999999999999999999999999999
Q ss_pred CCceecceeeEEccCCCCCCCCC----------CCCCCCcceEEeCCcchhhhHHHHhhccceeeEE
Q 021958 237 NGVFCSTRPMRIGPATNKKTVSG----------QQQYPKGTFLVVGTGFCLQIKFKFKYFIGWLIFI 293 (305)
Q Consensus 237 ~~~~~~g~~l~v~~~~~~~~~~~----------~~~~~~~~~~~~g~~~~~~~~~~~~~~~g~~~~~ 293 (305)
||..|++|.|+-.|+.+|..... -+...+.....+|+....-..+.+++.|.-+..|
T Consensus 125 nGqWlG~R~IRTNWATRKp~e~n~~~ltfdeV~NQssp~NtsVY~G~I~~~lte~~mr~~Fs~fG~I 191 (321)
T KOG0148|consen 125 NGQWLGRRTIRTNWATRKPSEMNGKPLTFDEVYNQSSPDNTSVYVGNIASGLTEDLMRQTFSPFGPI 191 (321)
T ss_pred CCeeeccceeeccccccCccccCCCCccHHHHhccCCCCCceEEeCCcCccccHHHHHHhcccCCcc
Confidence 99999999999999998774432 2223444555666543322234444444444333
No 24
>KOG0146 consensus RNA-binding protein ETR-3 (RRM superfamily) [RNA processing and modification]
Probab=99.94 E-value=3e-26 Score=184.53 Aligned_cols=176 Identities=24% Similarity=0.436 Sum_probs=158.7
Q ss_pred CCceEEEcCCCcccCHHHHhHhhhhcCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCC-CCCCCCcceeee
Q 021958 80 EIRTLWIGDLQYWMDETYLNTCFAHTGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGT-PMPNGEQNFRLN 158 (305)
Q Consensus 80 ~~~~l~v~nLp~~~t~~~l~~~f~~~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~-~~~~~~~~i~v~ 158 (305)
+.++||||.|...-.|||++.+|..||.+.+|.+.+.. .|.+||||||.|.+..+|..||..|+|. .+.|....+.|.
T Consensus 18 ~drklfvgml~kqq~e~dvrrlf~pfG~~~e~tvlrg~-dg~sKGCAFVKf~s~~eAqaAI~aLHgSqTmpGASSSLVVK 96 (371)
T KOG0146|consen 18 DDRKLFVGMLNKQQSEDDVRRLFQPFGNIEECTVLRGP-DGNSKGCAFVKFSSHAEAQAAINALHGSQTMPGASSSLVVK 96 (371)
T ss_pred cchhhhhhhhcccccHHHHHHHhcccCCcceeEEecCC-CCCCCCceEEEeccchHHHHHHHHhcccccCCCCccceEEE
Confidence 67899999999999999999999999999999999986 5999999999999999999999999886 566667778888
Q ss_pred ccCCC---------------------------------------------------------------------------
Q 021958 159 WASFG--------------------------------------------------------------------------- 163 (305)
Q Consensus 159 ~~~~~--------------------------------------------------------------------------- 163 (305)
+++.+
T Consensus 97 ~ADTdkER~lRRMQQma~qlGm~~Pl~l~~g~~~ay~qalmQqQa~~~at~~~~~L~p~~~~~~~~mQ~~aA~~angl~A 176 (371)
T KOG0146|consen 97 FADTDKERTLRRMQQMAGQLGMFNPLALPFGAYGAYAQALMQQQAALLATVAGPYLSPMAAFAAAQMQQMAALNANGLAA 176 (371)
T ss_pred eccchHHHHHHHHHHHHHHhcccCccccccchhHHHHHHHHHHHHHHHHhhcccccChhhhhHHHHHHHHHHHhhccccc
Confidence 88764
Q ss_pred --------------------------------------------------------------------------------
Q 021958 164 -------------------------------------------------------------------------------- 163 (305)
Q Consensus 164 -------------------------------------------------------------------------------- 163 (305)
T Consensus 177 ~Pv~p~s~~~~pp~~~a~~~~~~~A~~g~ng~~~l~~q~~gqpa~~~vy~ng~~pypaQsp~va~~lq~a~~g~~~Y~Aa 256 (371)
T KOG0146|consen 177 APVTPASGGSTPPGIGATAVPGIPAPIGVNGFTGLPPQPNGQPAAEAVYANGLHPYPAQSPTVADPLQQAYAGVQQYAAA 256 (371)
T ss_pred CCcCccccCCCCCcccccccCCcccccccccccCCCCCCCCCcchhHHhhcCCccCCCCCccccchhhhhhhhHHHHhhh
Confidence
Q ss_pred -------------------CCCCCCCCCCceEEEcCCCcCCCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeC
Q 021958 164 -------------------AGEKRDDTPDHTIFVGDLAADVTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFG 224 (305)
Q Consensus 164 -------------------~~~~~~~~~~~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~ 224 (305)
.....+....|.|||..||.++.+.||.+.|. .||.|.+.++..|+.|+.+|.|+||.|+
T Consensus 257 ypaays~v~~~~pq~p~~~~qqqreGPeGCNlFIYHLPQEFgDaEliQmF~-PFGhivSaKVFvDRATNQSKCFGFVSfD 335 (371)
T KOG0146|consen 257 YPAAYSPISQAFPQPPPLLPQQQREGPEGCNLFIYHLPQEFGDAELIQMFL-PFGHIVSAKVFVDRATNQSKCFGFVSFD 335 (371)
T ss_pred cchhhhhhhhcCCCCcchhhhhhhcCCCcceEEEEeCchhhccHHHHHHhc-cccceeeeeeeehhccccccceeeEecC
Confidence 00012225689999999999999999999999 9999999999999999999999999999
Q ss_pred CHHHHHHHHHHhCCceecceeeEEccCCCCCCC
Q 021958 225 DESEQLRAMTEMNGVFCSTRPMRIGPATNKKTV 257 (305)
Q Consensus 225 ~~~~A~~a~~~l~~~~~~g~~l~v~~~~~~~~~ 257 (305)
+..+|..||..|||..|+-++|+|...++++..
T Consensus 336 Np~SaQaAIqAMNGFQIGMKRLKVQLKRPkdan 368 (371)
T KOG0146|consen 336 NPASAQAAIQAMNGFQIGMKRLKVQLKRPKDAN 368 (371)
T ss_pred CchhHHHHHHHhcchhhhhhhhhhhhcCccccC
Confidence 999999999999999999999999998887654
No 25
>KOG0109 consensus RNA-binding protein LARK, contains RRM and retroviral-type Zn-finger domains [RNA processing and modification; General function prediction only]
Probab=99.94 E-value=1.2e-26 Score=188.95 Aligned_cols=152 Identities=21% Similarity=0.437 Sum_probs=140.5
Q ss_pred ceEEEcCCCcccCHHHHhHhhhhcCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCCCCCcceeeeccC
Q 021958 82 RTLWIGDLQYWMDETYLNTCFAHTGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMPNGEQNFRLNWAS 161 (305)
Q Consensus 82 ~~l~v~nLp~~~t~~~l~~~f~~~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~~~~i~v~~~~ 161 (305)
.+|||||||.++++.+|+.+|++||.|.+|.|+++ ||||..++...|+.|+..|+|..| .|..|+|..++
T Consensus 3 ~KLFIGNLp~~~~~~elr~lFe~ygkVlECDIvKN--------YgFVHiEdktaaedairNLhgYtL--hg~nInVeaSk 72 (346)
T KOG0109|consen 3 VKLFIGNLPREATEQELRSLFEQYGKVLECDIVKN--------YGFVHIEDKTAAEDAIRNLHGYTL--HGVNINVEASK 72 (346)
T ss_pred cchhccCCCcccchHHHHHHHHhhCceEeeeeecc--------cceEEeecccccHHHHhhccccee--cceEEEEEecc
Confidence 47999999999999999999999999999999986 999999999999999999999999 56888887776
Q ss_pred CCCCCCCCCCCCceEEEcCCCcCCCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHHhCCcee
Q 021958 162 FGAGEKRDDTPDHTIFVGDLAADVTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEMNGVFC 241 (305)
Q Consensus 162 ~~~~~~~~~~~~~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l~~~~~ 241 (305)
.+ ...+.+|+|+||...++.++|+..|+ +||.|.+++|+ ++|+||.|+-.++|..|++.|++.++
T Consensus 73 sK------sk~stkl~vgNis~tctn~ElRa~fe-~ygpviecdiv--------kdy~fvh~d~~eda~~air~l~~~~~ 137 (346)
T KOG0109|consen 73 SK------SKASTKLHVGNISPTCTNQELRAKFE-KYGPVIECDIV--------KDYAFVHFDRAEDAVEAIRGLDNTEF 137 (346)
T ss_pred cc------CCCccccccCCCCccccCHHHhhhhc-ccCCceeeeee--------cceeEEEEeeccchHHHHhccccccc
Confidence 43 45688999999999999999999999 99999999998 45999999999999999999999999
Q ss_pred cceeeEEccCCCCCCCC
Q 021958 242 STRPMRIGPATNKKTVS 258 (305)
Q Consensus 242 ~g~~l~v~~~~~~~~~~ 258 (305)
.|++++|..++.+.+..
T Consensus 138 ~gk~m~vq~stsrlrta 154 (346)
T KOG0109|consen 138 QGKRMHVQLSTSRLRTA 154 (346)
T ss_pred ccceeeeeeeccccccC
Confidence 99999999999877664
No 26
>KOG0124 consensus Polypyrimidine tract-binding protein PUF60 (RRM superfamily) [RNA processing and modification]
Probab=99.93 E-value=4.3e-26 Score=191.00 Aligned_cols=171 Identities=18% Similarity=0.386 Sum_probs=156.0
Q ss_pred CceEEEcCCCcccCHHHHhHhhhhcCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCCCCCcceeeecc
Q 021958 81 IRTLWIGDLQYWMDETYLNTCFAHTGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMPNGEQNFRLNWA 160 (305)
Q Consensus 81 ~~~l~v~nLp~~~t~~~l~~~f~~~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~~~~i~v~~~ 160 (305)
.++||||.|.+.+.||.|+..|..||.|++|.+..|+.|++.||||||+|+-.|.|..|++.+||..+ +||.|++.+.
T Consensus 113 McRvYVGSIsfEl~EDtiR~AF~PFGPIKSInMSWDp~T~kHKgFAFVEYEvPEaAqLAlEqMNg~ml--GGRNiKVgrP 190 (544)
T KOG0124|consen 113 MCRVYVGSISFELREDTIRRAFDPFGPIKSINMSWDPATGKHKGFAFVEYEVPEAAQLALEQMNGQML--GGRNIKVGRP 190 (544)
T ss_pred hHheeeeeeEEEechHHHHhhccCCCCcceeecccccccccccceEEEEEeCcHHHHHHHHHhccccc--cCccccccCC
Confidence 47899999999999999999999999999999999999999999999999999999999999999999 8899999876
Q ss_pred CCCCCCC-------CCCCCCceEEEcCCCcCCCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHH
Q 021958 161 SFGAGEK-------RDDTPDHTIFVGDLAADVTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAM 233 (305)
Q Consensus 161 ~~~~~~~-------~~~~~~~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~ 233 (305)
....... .+...-++|||..+..+++|+||+.+|+ .||+|.+|.+.+++..+..+||+|++|.+..+-..|+
T Consensus 191 sNmpQAQpiID~vqeeAk~fnRiYVaSvHpDLSe~DiKSVFE-AFG~I~~C~LAr~pt~~~HkGyGfiEy~n~qs~~eAi 269 (544)
T KOG0124|consen 191 SNMPQAQPIIDMVQEEAKKFNRIYVASVHPDLSETDIKSVFE-AFGEIVKCQLARAPTGRGHKGYGFIEYNNLQSQSEAI 269 (544)
T ss_pred CCCcccchHHHHHHHHHHhhheEEeeecCCCccHHHHHHHHH-hhcceeeEEeeccCCCCCccceeeEEeccccchHHHh
Confidence 5442221 2234568999999999999999999999 9999999999999988899999999999999999999
Q ss_pred HHhCCceecceeeEEccCCCC
Q 021958 234 TEMNGVFCSTRPMRIGPATNK 254 (305)
Q Consensus 234 ~~l~~~~~~g~~l~v~~~~~~ 254 (305)
..||=+.++|..|+|..+-..
T Consensus 270 asMNlFDLGGQyLRVGk~vTP 290 (544)
T KOG0124|consen 270 ASMNLFDLGGQYLRVGKCVTP 290 (544)
T ss_pred hhcchhhcccceEecccccCC
Confidence 999999999999999876543
No 27
>KOG0123 consensus Polyadenylate-binding protein (RRM superfamily) [RNA processing and modification; Translation, ribosomal structure and biogenesis]
Probab=99.93 E-value=9.2e-25 Score=192.58 Aligned_cols=198 Identities=24% Similarity=0.363 Sum_probs=161.4
Q ss_pred ceEEEcCCCcccCHHHHhHhhhhcCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCCCCCcceeeeccC
Q 021958 82 RTLWIGDLQYWMDETYLNTCFAHTGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMPNGEQNFRLNWAS 161 (305)
Q Consensus 82 ~~l~v~nLp~~~t~~~l~~~f~~~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~~~~i~v~~~~ 161 (305)
..|||| +++|+.+|.+.|+.+|.+.++++.+|. + +.|||||.|.+.++|++|++++|...+ .++.+++.|+.
T Consensus 2 ~sl~vg---~~v~e~~l~~~f~~~~~v~s~rvc~d~-t--slgy~yvnf~~~~da~~A~~~~n~~~~--~~~~~rim~s~ 73 (369)
T KOG0123|consen 2 ASLYVG---PDVTEAMLFDKFSPAGPVLSIRVCRDA-T--SLGYAYVNFQQPADAERALDTMNFDVL--KGKPIRIMWSQ 73 (369)
T ss_pred CceecC---CcCChHHHHHHhcccCCceeEEEeecC-C--ccceEEEecCCHHHHHHHHHHcCCccc--CCcEEEeehhc
Confidence 368999 999999999999999999999999997 5 999999999999999999999999999 78999999998
Q ss_pred CCCCCCCCCCCCceEEEcCCCcCCCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHHhCCcee
Q 021958 162 FGAGEKRDDTPDHTIFVGDLAADVTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEMNGVFC 241 (305)
Q Consensus 162 ~~~~~~~~~~~~~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l~~~~~ 241 (305)
.... .|||.||+.+++..+|.+.|+ .||+|.+|++..+. +| ++|| ||+|++.++|.+|+..+||..+
T Consensus 74 rd~~---------~~~i~nl~~~~~~~~~~d~f~-~~g~ilS~kv~~~~-~g-~kg~-FV~f~~e~~a~~ai~~~ng~ll 140 (369)
T KOG0123|consen 74 RDPS---------LVFIKNLDESIDNKSLYDTFS-EFGNILSCKVATDE-NG-SKGY-FVQFESEESAKKAIEKLNGMLL 140 (369)
T ss_pred cCCc---------eeeecCCCcccCcHHHHHHHH-hhcCeeEEEEEEcC-CC-ceee-EEEeCCHHHHHHHHHHhcCccc
Confidence 6532 299999999999999999999 99999999999996 45 8999 9999999999999999999999
Q ss_pred cceeeEEccCCCCCCCCCCC--CCCCcceEEeCCcchhhhHHHHhhccceeeEEEEEeeee
Q 021958 242 STRPMRIGPATNKKTVSGQQ--QYPKGTFLVVGTGFCLQIKFKFKYFIGWLIFIIIFFNIS 300 (305)
Q Consensus 242 ~g~~l~v~~~~~~~~~~~~~--~~~~~~~~~~g~~~~~~~~~~~~~~~g~~~~~~~~~~~~ 300 (305)
.|+.|.|.....+..+.... ....-+..++.+.........+..++.....|...+=|-
T Consensus 141 ~~kki~vg~~~~~~er~~~~~~~~~~~t~v~vk~~~~~~~~~~l~~~f~~~g~i~s~~v~~ 201 (369)
T KOG0123|consen 141 NGKKIYVGLFERKEEREAPLGEYKKRFTNVYVKNLEEDSTDEELKDLFSAYGSITSVAVMR 201 (369)
T ss_pred CCCeeEEeeccchhhhcccccchhhhhhhhheeccccccchHHHHHhhcccCcceEEEEee
Confidence 99999999888766653211 111112223333222333445666666665555554443
No 28
>KOG0147 consensus Transcriptional coactivator CAPER (RRM superfamily) [Transcription]
Probab=99.93 E-value=4.5e-26 Score=200.55 Aligned_cols=209 Identities=20% Similarity=0.332 Sum_probs=175.5
Q ss_pred CCCCCceEEEcCCCcccCHHHHhHhhhhcCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCCCCCccee
Q 021958 77 QPGEIRTLWIGDLQYWMDETYLNTCFAHTGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMPNGEQNFR 156 (305)
Q Consensus 77 ~~~~~~~l~v~nLp~~~t~~~l~~~f~~~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~~~~i~ 156 (305)
...+.++||+--|...++..+|+++|+.+|.|.+++++.|+.+++++|.|||+|.+.+....|+ .|.|+.+ .|.+|.
T Consensus 175 eERd~Rtvf~~qla~r~~pRdL~efFs~~gkVrdVriI~Dr~s~rskgi~Yvef~D~~sVp~ai-aLsGqrl--lg~pv~ 251 (549)
T KOG0147|consen 175 EERDQRTVFCMQLARRNPPRDLEEFFSIVGKVRDVRIIGDRNSRRSKGIAYVEFCDEQSVPLAI-ALSGQRL--LGVPVI 251 (549)
T ss_pred hHHhHHHHHHHHHhhcCCchhHHHHHHhhcCcceeEeeccccchhhcceeEEEEecccchhhHh-hhcCCcc--cCceeE
Confidence 4567899999999999999999999999999999999999999999999999999999999999 7899998 456666
Q ss_pred eeccCCCCCC----------CCCCCCCceEEEcCCCcCCCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCH
Q 021958 157 LNWASFGAGE----------KRDDTPDHTIFVGDLAADVTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDE 226 (305)
Q Consensus 157 v~~~~~~~~~----------~~~~~~~~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~ 226 (305)
+......... +.-..+-..|||+||.+++++++|+..|+ .||.|+.|.+..|.++|.++|||||+|.+.
T Consensus 252 vq~sEaeknr~a~~s~a~~~k~~~~p~~rl~vgnLHfNite~~lr~ife-pfg~Ie~v~l~~d~~tG~skgfGfi~f~~~ 330 (549)
T KOG0147|consen 252 VQLSEAEKNRAANASPALQGKGFTGPMRRLYVGNLHFNITEDMLRGIFE-PFGKIENVQLTKDSETGRSKGFGFITFVNK 330 (549)
T ss_pred ecccHHHHHHHHhccccccccccccchhhhhhcccccCchHHHHhhhcc-CcccceeeeeccccccccccCcceEEEecH
Confidence 6544322100 11122333499999999999999999999 999999999999999999999999999999
Q ss_pred HHHHHHHHHhCCceecceeeEEccCCCCCCCCCC------CCCCCcceEEeCCcchhhhHHHHhhccce
Q 021958 227 SEQLRAMTEMNGVFCSTRPMRIGPATNKKTVSGQ------QQYPKGTFLVVGTGFCLQIKFKFKYFIGW 289 (305)
Q Consensus 227 ~~A~~a~~~l~~~~~~g~~l~v~~~~~~~~~~~~------~~~~~~~~~~~g~~~~~~~~~~~~~~~g~ 289 (305)
++|.+|+..|||..|.||.|+|.....+...... .....+..+..|.++..|+|++|+++.|.
T Consensus 331 ~~ar~a~e~lngfelAGr~ikV~~v~~r~~~~~a~~~~~d~D~~d~~gl~~~~~g~~Ql~~kla~~~~~ 399 (549)
T KOG0147|consen 331 EDARKALEQLNGFELAGRLIKVSVVTERVDTKEAAVTQFDFDEDDRQGLSLGSGGRNQLMAKLAEGKGR 399 (549)
T ss_pred HHHHHHHHHhccceecCceEEEEEeeeecccccccccccccchhhccccccccccHHHHHHHHhccCCc
Confidence 9999999999999999999999987776655421 12223456677778899999999998873
No 29
>TIGR01622 SF-CC1 splicing factor, CC1-like family. A homologous gene from Plasmodium falciparum was identified in the course of the analysis of that genome at TIGR and was included in the model.
Probab=99.93 E-value=3.1e-24 Score=198.32 Aligned_cols=166 Identities=19% Similarity=0.375 Sum_probs=141.4
Q ss_pred CceEEEcCCCcccCHHHHhHhhhhcCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCCCCCcceeeecc
Q 021958 81 IRTLWIGDLQYWMDETYLNTCFAHTGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMPNGEQNFRLNWA 160 (305)
Q Consensus 81 ~~~l~v~nLp~~~t~~~l~~~f~~~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~~~~i~v~~~ 160 (305)
.++|||+||+..+|+++|+++|+.||.|..+.+..+..+|+++|||||+|.+.++|.+|++.|+|..| .++.|+|.|+
T Consensus 186 ~~~l~v~nl~~~~te~~l~~~f~~~G~i~~v~~~~d~~~g~~~g~afV~f~~~e~A~~A~~~l~g~~i--~g~~i~v~~a 263 (457)
T TIGR01622 186 FLKLYVGNLHFNITEQELRQIFEPFGDIEDVQLHRDPETGRSKGFGFIQFHDAEEAKEALEVMNGFEL--AGRPIKVGYA 263 (457)
T ss_pred CCEEEEcCCCCCCCHHHHHHHHHhcCCeEEEEEEEcCCCCccceEEEEEECCHHHHHHHHHhcCCcEE--CCEEEEEEEc
Confidence 68999999999999999999999999999999999998899999999999999999999999999888 6788999885
Q ss_pred CCCCC------------------------------------C--------------------------------------
Q 021958 161 SFGAG------------------------------------E-------------------------------------- 166 (305)
Q Consensus 161 ~~~~~------------------------------------~-------------------------------------- 166 (305)
..... .
T Consensus 264 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 343 (457)
T TIGR01622 264 QDSTYLLDAANTFEDIDKQQQMGKNLNTEEREQLMEKLDRDDGDGGLLIPGTGSKIALMQKLQRDGIIDPNIPSRYATGA 343 (457)
T ss_pred cCCCccccchhhhccccccccCCcCCCccchHHHHHhhccCCCCccccCCCccchhhhhccccccccccccccccccccc
Confidence 32100 0
Q ss_pred ------------CCCCCCCceEEEcCCCcCCC----------HHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeC
Q 021958 167 ------------KRDDTPDHTIFVGDLAADVT----------DYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFG 224 (305)
Q Consensus 167 ------------~~~~~~~~~l~v~nlp~~~~----------~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~ 224 (305)
........+|+|.||-...+ .+||++.|+ +||.|..|.|.. ....|++||+|.
T Consensus 344 ~~~~~~~~~~~~~~~~~~~~~l~l~n~~~~~~~~~~~~~~~~~~dv~~e~~-k~G~v~~v~v~~----~~~~G~~fV~F~ 418 (457)
T TIGR01622 344 LAIMARNSFVPSTNNNLATTCLVLSNMFDPATEEEPNFDNEILDDVKEECS-KYGGVVHIYVDT----KNSAGKIYLKFS 418 (457)
T ss_pred cccccCCCCCCcccCCCCCcEEEEecCCCCcccccchHHHHHHHHHHHHHH-hcCCeeEEEEeC----CCCceeEEEEEC
Confidence 00113467889999844433 368999999 999999998863 345799999999
Q ss_pred CHHHHHHHHHHhCCceecceeeEEccCCC
Q 021958 225 DESEQLRAMTEMNGVFCSTRPMRIGPATN 253 (305)
Q Consensus 225 ~~~~A~~a~~~l~~~~~~g~~l~v~~~~~ 253 (305)
+.++|.+|++.|||..|+|+.|.|.|...
T Consensus 419 ~~e~A~~A~~~lnGr~f~gr~i~~~~~~~ 447 (457)
T TIGR01622 419 SVDAALAAFQALNGRYFGGKMITAAFVVN 447 (457)
T ss_pred CHHHHHHHHHHhcCcccCCeEEEEEEEcH
Confidence 99999999999999999999999998754
No 30
>KOG0144 consensus RNA-binding protein CUGBP1/BRUNO (RRM superfamily) [RNA processing and modification]
Probab=99.93 E-value=3.5e-25 Score=189.21 Aligned_cols=176 Identities=23% Similarity=0.413 Sum_probs=159.2
Q ss_pred CCceEEEcCCCcccCHHHHhHhhhhcCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCC-CCCCCCcceeee
Q 021958 80 EIRTLWIGDLQYWMDETYLNTCFAHTGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGT-PMPNGEQNFRLN 158 (305)
Q Consensus 80 ~~~~l~v~nLp~~~t~~~l~~~f~~~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~-~~~~~~~~i~v~ 158 (305)
+.++||||-|+..+||.+|+++|++||.|++|.|.+|. .+.+||||||.|.+.+.|..|++.|||. .++|...++-|.
T Consensus 123 ~e~KLFvg~lsK~~te~evr~iFs~fG~Ied~~ilrd~-~~~sRGcaFV~fstke~A~~Aika~ng~~tmeGcs~PLVVk 201 (510)
T KOG0144|consen 123 EERKLFVGMLSKQCTENEVREIFSRFGHIEDCYILRDP-DGLSRGCAFVKFSTKEMAVAAIKALNGTQTMEGCSQPLVVK 201 (510)
T ss_pred cchhhhhhhccccccHHHHHHHHHhhCccchhhheecc-cccccceeEEEEehHHHHHHHHHhhccceeeccCCCceEEE
Confidence 37899999999999999999999999999999999996 5999999999999999999999999986 678889999999
Q ss_pred ccCCC---------------------------------------------------------------------------
Q 021958 159 WASFG--------------------------------------------------------------------------- 163 (305)
Q Consensus 159 ~~~~~--------------------------------------------------------------------------- 163 (305)
|++..
T Consensus 202 FADtqkdk~~~~lqq~~~~~~qql~~~~~~~n~~~~~~l~~~~~~~~Qq~~~sqn~g~l~g~~~L~~l~a~~~qq~~~~~ 281 (510)
T KOG0144|consen 202 FADTQKDKDGKRLQQLNPALLQQLGNGQNPQNLASLGALSNGYQGPQQQTQQSQNVGTLGGLPPLGPLNATQLQQAAALA 281 (510)
T ss_pred ecccCCCchHHHHHhhhHHHHHHhcCCCCccchhhhhccCcccCchhhhccccCCCcccccccCCCCcchhHHHHHHHhh
Confidence 99765
Q ss_pred --------------------------------------------------------------------------------
Q 021958 164 -------------------------------------------------------------------------------- 163 (305)
Q Consensus 164 -------------------------------------------------------------------------------- 163 (305)
T Consensus 282 ~~~ta~q~~~~s~q~~pl~~qts~~~~~~~~~~~~~~ss~~~~s~~~~aq~~~~q~~p~t~~~~n~~~~~a~a~~~sp~a 361 (510)
T KOG0144|consen 282 AAATAAQKTASSTQGLPLRTQTSFPGSQTSPQSASAPSSSLSTSQNPLAQLGARQTFPGTPANYNLAGGMAGAGTTSPVA 361 (510)
T ss_pred hhcccccCCCCCcccCccccccCCccccCCCccccCccccCcccccchhhhhHhhcCCCCchhcccccccccccccCccc
Confidence
Q ss_pred -----------------------------------------------------CCCCCCCCCCceEEEcCCCcCCCHHHH
Q 021958 164 -----------------------------------------------------AGEKRDDTPDHTIFVGDLAADVTDYML 190 (305)
Q Consensus 164 -----------------------------------------------------~~~~~~~~~~~~l~v~nlp~~~~~~~l 190 (305)
.....+......|||.+||.++-+.||
T Consensus 362 a~~~~lq~~~ltp~~~~~~~~~tQa~q~~~q~a~~a~~~l~~q~~~~qq~~~~~~~q~eGpeGanlfiyhlPqefgdq~l 441 (510)
T KOG0144|consen 362 ASLANLQQIGLTPFAGAAALDHTQAMQQYAQSANLAAPGLVGQQATTQQAQMVGNGQVEGPEGANLFIYHLPQEFGDQDL 441 (510)
T ss_pred ccccccccccCCChhhhhhHhHHHhhhHhhhhhhhcccchhhhhHhhhhhhcccCccccCCCccceeeeeCchhhhhHHH
Confidence 000011145678999999999999999
Q ss_pred HHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHHhCCceecceeeEEccCCCCCCC
Q 021958 191 QETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEMNGVFCSTRPMRIGPATNKKTV 257 (305)
Q Consensus 191 ~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l~~~~~~g~~l~v~~~~~~~~~ 257 (305)
...|. .||.|.+.++..|+.||-++.|+||.|++..+|.+||..|||..|++++++|.....+.+.
T Consensus 442 ~~~f~-pfG~Vlsakvfidk~tnlskcfgfvSyen~~sa~~aI~amngfQig~KrlkVQlk~~~~np 507 (510)
T KOG0144|consen 442 IATFQ-PFGGVLSAKVFIDKVTNLSKCFGFVSYENAQSAQNAISAMNGFQIGSKRLKVQLKRDRNNP 507 (510)
T ss_pred HHHhc-cccceeEEEEEEecccCHhhhcCcccccchhhhHHHHHHhcchhhccccceEEeeeccCCC
Confidence 99999 9999999999999999999999999999999999999999999999999999987776554
No 31
>KOG4205 consensus RNA-binding protein musashi/mRNA cleavage and polyadenylation factor I complex, subunit HRP1 [RNA processing and modification]
Probab=99.89 E-value=3.1e-23 Score=176.75 Aligned_cols=176 Identities=24% Similarity=0.428 Sum_probs=151.7
Q ss_pred CCCceEEEcCCCcccCHHHHhHhhhhcCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCCCCCcceeee
Q 021958 79 GEIRTLWIGDLQYWMDETYLNTCFAHTGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMPNGEQNFRLN 158 (305)
Q Consensus 79 ~~~~~l~v~nLp~~~t~~~l~~~f~~~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~~~~i~v~ 158 (305)
.+.++||||+|.++++++.|+++|.+||+|.++.+++|+.+++++||+||+|++.+...+++.. .-..| .++.|...
T Consensus 4 ~~~~KlfiGgisw~ttee~Lr~yf~~~Gev~d~~vm~d~~t~rsrgFgfv~f~~~~~v~~vl~~-~~h~~--dgr~ve~k 80 (311)
T KOG4205|consen 4 GESGKLFIGGLSWETTEESLREYFSQFGEVTDCVVMRDPSTGRSRGFGFVTFATPEGVDAVLNA-RTHKL--DGRSVEPK 80 (311)
T ss_pred cCCcceeecCcCccccHHHHHHHhcccCceeeEEEeccCCCCCcccccceecCCCcchheeecc-ccccc--CCccccce
Confidence 3778999999999999999999999999999999999999999999999999999999888743 33344 45666666
Q ss_pred ccCCCCCCC--CCCCCCceEEEcCCCcCCCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHHh
Q 021958 159 WASFGAGEK--RDDTPDHTIFVGDLAADVTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEM 236 (305)
Q Consensus 159 ~~~~~~~~~--~~~~~~~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l 236 (305)
.+..+.... .......+|||++|+.++++++++++|+ +||.|..+.++.|..+.+.+||+||.|.+.+++.+++. .
T Consensus 81 ~av~r~~~~~~~~~~~tkkiFvGG~~~~~~e~~~r~yfe-~~g~v~~~~~~~d~~~~~~rgFgfv~~~~e~sVdkv~~-~ 158 (311)
T KOG4205|consen 81 RAVSREDQTKVGRHLRTKKIFVGGLPPDTTEEDFKDYFE-QFGKVADVVIMYDKTTSRPRGFGFVTFDSEDSVDKVTL-Q 158 (311)
T ss_pred eccCcccccccccccceeEEEecCcCCCCchHHHhhhhh-ccceeEeeEEeecccccccccceeeEeccccccceecc-c
Confidence 665543322 2223577999999999999999999999 99999999999999999999999999999999999976 4
Q ss_pred CCceecceeeEEccCCCCCCCCC
Q 021958 237 NGVFCSTRPMRIGPATNKKTVSG 259 (305)
Q Consensus 237 ~~~~~~g~~l~v~~~~~~~~~~~ 259 (305)
+-+.|+|+.+.|..|.++.....
T Consensus 159 ~f~~~~gk~vevkrA~pk~~~~~ 181 (311)
T KOG4205|consen 159 KFHDFNGKKVEVKRAIPKEVMQS 181 (311)
T ss_pred ceeeecCceeeEeeccchhhccc
Confidence 77889999999999999877643
No 32
>TIGR01645 half-pint poly-U binding splicing factor, half-pint family. In the case of PUF60 (GP|6176532), in complex with p54, and in the presence of U2AF, facilitates association of U2 snRNP with pre-mRNA.
Probab=99.89 E-value=1e-21 Score=181.41 Aligned_cols=82 Identities=21% Similarity=0.380 Sum_probs=76.3
Q ss_pred CCCceEEEcCCCcccCHHHHhHhhhhcCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCCCCCcceeee
Q 021958 79 GEIRTLWIGDLQYWMDETYLNTCFAHTGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMPNGEQNFRLN 158 (305)
Q Consensus 79 ~~~~~l~v~nLp~~~t~~~l~~~f~~~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~~~~i~v~ 158 (305)
...++|||+||+.++++++|+++|+.||.|.++++.+|+.+++++|||||+|.+.++|.+|++.||+..+ +|+.|+|.
T Consensus 202 ~~~~rLfVgnLp~~vteedLk~lFs~FG~I~svrl~~D~~tgksKGfGFVeFe~~e~A~kAI~amNg~el--gGr~LrV~ 279 (612)
T TIGR01645 202 KKFNRIYVASVHPDLSETDIKSVFEAFGEIVKCQLARAPTGRGHKGYGFIEYNNLQSQSEAIASMNLFDL--GGQYLRVG 279 (612)
T ss_pred cccceEEeecCCCCCCHHHHHHHHhhcCCeeEEEEEecCCCCCcCCeEEEEECCHHHHHHHHHHhCCCee--CCeEEEEE
Confidence 3457999999999999999999999999999999999998899999999999999999999999999998 67889988
Q ss_pred ccCC
Q 021958 159 WASF 162 (305)
Q Consensus 159 ~~~~ 162 (305)
++..
T Consensus 280 kAi~ 283 (612)
T TIGR01645 280 KCVT 283 (612)
T ss_pred ecCC
Confidence 7754
No 33
>KOG0110 consensus RNA-binding protein (RRM superfamily) [General function prediction only]
Probab=99.88 E-value=1.6e-22 Score=182.84 Aligned_cols=172 Identities=20% Similarity=0.353 Sum_probs=146.4
Q ss_pred ceEEEcCCCcccCHHHHhHhhhhcCCcEEEEEEeeCCC---CCccceEEEEEcCHHHHHHHHHHhCCCCCCCCCcceeee
Q 021958 82 RTLWIGDLQYWMDETYLNTCFAHTGEVVAVKVIRNKQT---GQIEGYGFIEFISRAGAERVLQTFNGTPMPNGEQNFRLN 158 (305)
Q Consensus 82 ~~l~v~nLp~~~t~~~l~~~f~~~G~v~~~~~~~~~~~---~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~~~~i~v~ 158 (305)
++|||.||++++|.++|...|...|.|.++.|...+.. -.|.|||||+|.+.++|.+|++.|+|..++ |+.+.+.
T Consensus 516 t~lfvkNlnf~Tt~e~l~~~F~k~G~VlS~~I~kkkd~~~k~lSmGfgFVEF~~~e~A~~a~k~lqgtvld--GH~l~lk 593 (725)
T KOG0110|consen 516 TKLFVKNLNFDTTLEDLEDLFSKQGTVLSIEISKKKDPANKYLSMGFGFVEFAKPESAQAALKALQGTVLD--GHKLELK 593 (725)
T ss_pred hhhhhhcCCcccchhHHHHHHHhcCeEEEEEEeccccccccccccceeEEEecCHHHHHHHHHHhcCceec--CceEEEE
Confidence 33999999999999999999999999999988876532 135599999999999999999999999995 5666666
Q ss_pred ccCCCC-----CCCCCCCCCceEEEcCCCcCCCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHH
Q 021958 159 WASFGA-----GEKRDDTPDHTIFVGDLAADVTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAM 233 (305)
Q Consensus 159 ~~~~~~-----~~~~~~~~~~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~ 233 (305)
++.... ..........+|+|+|||+..+..+++++|. .||.|.+|+|......+.++|||||+|-+..+|.+|+
T Consensus 594 ~S~~k~~~~~gK~~~~kk~~tKIlVRNipFeAt~rEVr~LF~-aFGqlksvRlPKK~~k~a~rGF~Fv~f~t~~ea~nA~ 672 (725)
T KOG0110|consen 594 ISENKPASTVGKKKSKKKKGTKILVRNIPFEATKREVRKLFT-AFGQLKSVRLPKKIGKGAHRGFGFVDFLTPREAKNAF 672 (725)
T ss_pred eccCccccccccccccccccceeeeeccchHHHHHHHHHHHh-cccceeeeccchhhcchhhccceeeeccCcHHHHHHH
Confidence 665211 1112223367999999999999999999999 9999999999987666778999999999999999999
Q ss_pred HHhCCceecceeeEEccCCCCCC
Q 021958 234 TEMNGVFCSTRPMRIGPATNKKT 256 (305)
Q Consensus 234 ~~l~~~~~~g~~l~v~~~~~~~~ 256 (305)
.+|..+-+.||+|.+.|++....
T Consensus 673 ~al~STHlyGRrLVLEwA~~d~~ 695 (725)
T KOG0110|consen 673 DALGSTHLYGRRLVLEWAKSDNT 695 (725)
T ss_pred HhhcccceechhhheehhccchH
Confidence 99999999999999999987543
No 34
>KOG0123 consensus Polyadenylate-binding protein (RRM superfamily) [RNA processing and modification; Translation, ribosomal structure and biogenesis]
Probab=99.86 E-value=4.8e-21 Score=169.08 Aligned_cols=168 Identities=24% Similarity=0.468 Sum_probs=146.3
Q ss_pred eEEEcCCCcccCHHHHhHhhhhcCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCCCCCcceeeeccCC
Q 021958 83 TLWIGDLQYWMDETYLNTCFAHTGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMPNGEQNFRLNWASF 162 (305)
Q Consensus 83 ~l~v~nLp~~~t~~~l~~~f~~~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~~~~i~v~~~~~ 162 (305)
.|||.||+.+++..+|.++|+.||.|.+|++..+. +| ++|| ||+|+++++|.+|++.+||..+ .++.+.+.....
T Consensus 78 ~~~i~nl~~~~~~~~~~d~f~~~g~ilS~kv~~~~-~g-~kg~-FV~f~~e~~a~~ai~~~ng~ll--~~kki~vg~~~~ 152 (369)
T KOG0123|consen 78 LVFIKNLDESIDNKSLYDTFSEFGNILSCKVATDE-NG-SKGY-FVQFESEESAKKAIEKLNGMLL--NGKKIYVGLFER 152 (369)
T ss_pred eeeecCCCcccCcHHHHHHHHhhcCeeEEEEEEcC-CC-ceee-EEEeCCHHHHHHHHHHhcCccc--CCCeeEEeeccc
Confidence 39999999999999999999999999999999996 35 9999 9999999999999999999999 566777665554
Q ss_pred CCCCC----CCCCCCceEEEcCCCcCCCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHHhCC
Q 021958 163 GAGEK----RDDTPDHTIFVGDLAADVTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEMNG 238 (305)
Q Consensus 163 ~~~~~----~~~~~~~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l~~ 238 (305)
+.... .....-..++|.|+..+++++.|..+|. .||.|.++.++.+. .+++++|+||+|++.++|..|+..|++
T Consensus 153 ~~er~~~~~~~~~~~t~v~vk~~~~~~~~~~l~~~f~-~~g~i~s~~v~~~~-~g~~~~~gfv~f~~~e~a~~av~~l~~ 230 (369)
T KOG0123|consen 153 KEEREAPLGEYKKRFTNVYVKNLEEDSTDEELKDLFS-AYGSITSVAVMRDS-IGKSKGFGFVNFENPEDAKKAVETLNG 230 (369)
T ss_pred hhhhcccccchhhhhhhhheeccccccchHHHHHhhc-ccCcceEEEEeecC-CCCCCCccceeecChhHHHHHHHhccC
Confidence 32211 1223456799999999999999999999 99999999999986 677999999999999999999999999
Q ss_pred ceecceeeEEccCCCCCCC
Q 021958 239 VFCSTRPMRIGPATNKKTV 257 (305)
Q Consensus 239 ~~~~g~~l~v~~~~~~~~~ 257 (305)
..+++..+.|..+..+...
T Consensus 231 ~~~~~~~~~V~~aqkk~e~ 249 (369)
T KOG0123|consen 231 KIFGDKELYVGRAQKKSER 249 (369)
T ss_pred CcCCccceeecccccchhh
Confidence 9999999999988774433
No 35
>KOG4206 consensus Spliceosomal protein snRNP-U1A/U2B [RNA processing and modification]
Probab=99.83 E-value=4.3e-19 Score=140.97 Aligned_cols=161 Identities=21% Similarity=0.421 Sum_probs=140.7
Q ss_pred CceEEEcCCCcccCHHHHhH----hhhhcCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCCCCCccee
Q 021958 81 IRTLWIGDLQYWMDETYLNT----CFAHTGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMPNGEQNFR 156 (305)
Q Consensus 81 ~~~l~v~nLp~~~t~~~l~~----~f~~~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~~~~i~ 156 (305)
..||||.||+..+..++|+. +|++||.|.+|...+ +.+.+|-|||.|.+.+.|-.|+..|+|..+ .|+.++
T Consensus 9 n~TlYInnLnekI~~~elkrsL~~LFsqfG~ildI~a~k---t~KmRGQA~VvFk~~~~As~A~r~l~gfpF--ygK~mr 83 (221)
T KOG4206|consen 9 NGTLYINNLNEKIKKDELKRSLYLLFSQFGKILDISAFK---TPKMRGQAFVVFKETEAASAALRALQGFPF--YGKPMR 83 (221)
T ss_pred CceEeehhccccccHHHHHHHHHHHHHhhCCeEEEEecC---CCCccCceEEEecChhHHHHHHHHhcCCcc--cCchhh
Confidence 34999999999999999999 999999999988765 568899999999999999999999999999 778899
Q ss_pred eeccCCCCCC----------------------------------------------CCCCCCCceEEEcCCCcCCCHHHH
Q 021958 157 LNWASFGAGE----------------------------------------------KRDDTPDHTIFVGDLAADVTDYML 190 (305)
Q Consensus 157 v~~~~~~~~~----------------------------------------------~~~~~~~~~l~v~nlp~~~~~~~l 190 (305)
+.++..+... .....+...||+.|||.+++.+.|
T Consensus 84 iqyA~s~sdii~~~~~~~v~~~~k~~~~~~~~~~~~~~~ng~~~~~~~~~~p~p~~~~~~ppn~ilf~~niP~es~~e~l 163 (221)
T KOG4206|consen 84 IQYAKSDSDIIAQAPGTFVEKEKKINGEILARIKQPLDTNGHFYNMNRMNLPPPFLAQMAPPNNILFLTNIPSESESEML 163 (221)
T ss_pred eecccCccchhhccCceeccccCccccccccccCCcccccccccccccccCCCCccccCCCCceEEEEecCCcchhHHHH
Confidence 8888764111 011346778999999999999999
Q ss_pred HHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHHhCCceec-ceeeEEccCC
Q 021958 191 QETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEMNGVFCS-TRPMRIGPAT 252 (305)
Q Consensus 191 ~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l~~~~~~-g~~l~v~~~~ 252 (305)
..+|. .|...++++++... .+.|||+|.+...|..|...+.+..|. ...++|.+++
T Consensus 164 ~~lf~-qf~g~keir~i~~~-----~~iAfve~~~d~~a~~a~~~lq~~~it~~~~m~i~~a~ 220 (221)
T KOG4206|consen 164 SDLFE-QFPGFKEIRLIPPR-----SGIAFVEFLSDRQASAAQQALQGFKITKKNTMQITFAK 220 (221)
T ss_pred HHHHh-hCcccceeEeccCC-----CceeEEecchhhhhHHHhhhhccceeccCceEEecccC
Confidence 99999 99999999998765 789999999999999999999999885 8889998875
No 36
>KOG1457 consensus RNA binding protein (contains RRM repeats) [General function prediction only]
Probab=99.81 E-value=8.5e-19 Score=138.14 Aligned_cols=161 Identities=19% Similarity=0.283 Sum_probs=126.9
Q ss_pred CCCCCCceEEEcCCCcccCHHHHhHhhhhcCCcEEEEEEee-CCCCCccceEEEEEcCHHHHHHHHHHhCCCCCCC-CCc
Q 021958 76 GQPGEIRTLWIGDLQYWMDETYLNTCFAHTGEVVAVKVIRN-KQTGQIEGYGFIEFISRAGAERVLQTFNGTPMPN-GEQ 153 (305)
Q Consensus 76 ~~~~~~~~l~v~nLp~~~t~~~l~~~f~~~G~v~~~~~~~~-~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~-~~~ 153 (305)
.++..-+||||.+||.++...+|+.+|..|-..+.+.+... +...-.+-+|||+|.+..+|..|++.|||.+++- .+.
T Consensus 29 ~~~~~VRTLFVSGLP~DvKpREiynLFR~f~GYEgslLK~Tsk~~~~~~pvaFatF~s~q~A~aamnaLNGvrFDpE~~s 108 (284)
T KOG1457|consen 29 DEPGAVRTLFVSGLPNDVKPREIYNLFRRFHGYEGSLLKYTSKGDQVCKPVAFATFTSHQFALAAMNALNGVRFDPETGS 108 (284)
T ss_pred ccccccceeeeccCCcccCHHHHHHHhccCCCccceeeeeccCCCccccceEEEEecchHHHHHHHHHhcCeeeccccCc
Confidence 34567899999999999999999999999877766666543 3333455799999999999999999999999863 456
Q ss_pred ceeeeccCCCCCCC------------------------------------------------------------------
Q 021958 154 NFRLNWASFGAGEK------------------------------------------------------------------ 167 (305)
Q Consensus 154 ~i~v~~~~~~~~~~------------------------------------------------------------------ 167 (305)
.++++.++.....+
T Consensus 109 tLhiElAKSNtK~kr~k~sgtP~~s~al~~~~~~~~qr~sa~~qhd~~l~~p~~l~~~~~a~al~~~~~t~~~~l~a~~~ 188 (284)
T KOG1457|consen 109 TLHIELAKSNTKRKRRKGSGTPGSSPALVIDNRNKEQRKSADDQHDEGLSDPDELQEPGNADALKENDTTKSEALSAPDS 188 (284)
T ss_pred eeEeeehhcCcccccCCCCCCCCCCccccccccChhhcccchhhccccccCccccCCccccccCCCccccchhhhhhhhh
Confidence 67776665420000
Q ss_pred ----------------CCCCCCceEEEcCCCcCCCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHH
Q 021958 168 ----------------RDDTPDHTIFVGDLAADVTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLR 231 (305)
Q Consensus 168 ----------------~~~~~~~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~ 231 (305)
.....+.+|||.||..+++|++|+.+|+ .|-....++|... .| ...||++|++.+.|..
T Consensus 189 ~~P~a~a~l~ks~q~~~~~~acstlfianl~~~~~ed~l~~~~~-~~~gf~~l~~~~~--~g--~~vaf~~~~~~~~at~ 263 (284)
T KOG1457|consen 189 KAPSANAHLEKSSQGGSGARACSTLFIANLGPNCTEDELKQLLS-RYPGFHILKIRAR--GG--MPVAFADFEEIEQATD 263 (284)
T ss_pred cCCcccchhhhhhcccccchhhhhHhhhccCCCCCHHHHHHHHH-hCCCceEEEEecC--CC--cceEeecHHHHHHHHH
Confidence 0013467899999999999999999999 9998877777432 33 5579999999999999
Q ss_pred HHHHhCCcee
Q 021958 232 AMTEMNGVFC 241 (305)
Q Consensus 232 a~~~l~~~~~ 241 (305)
|+..|.|..|
T Consensus 264 am~~lqg~~~ 273 (284)
T KOG1457|consen 264 AMNHLQGNLL 273 (284)
T ss_pred HHHHhhccee
Confidence 9999998776
No 37
>KOG0147 consensus Transcriptional coactivator CAPER (RRM superfamily) [Transcription]
Probab=99.81 E-value=1.6e-19 Score=159.55 Aligned_cols=165 Identities=19% Similarity=0.359 Sum_probs=131.1
Q ss_pred CCceEEEcCCCcccCHHHHhHhhhhcCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCCCCCcceeeec
Q 021958 80 EIRTLWIGDLQYWMDETYLNTCFAHTGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMPNGEQNFRLNW 159 (305)
Q Consensus 80 ~~~~l~v~nLp~~~t~~~l~~~f~~~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~~~~i~v~~ 159 (305)
+.+.||||||..++++++|+.+|+.||.|..|.+.+|..+|+++||+|++|.+.++|++|++.|||..| .|+.|+|..
T Consensus 277 p~~rl~vgnLHfNite~~lr~ifepfg~Ie~v~l~~d~~tG~skgfGfi~f~~~~~ar~a~e~lngfel--AGr~ikV~~ 354 (549)
T KOG0147|consen 277 PMRRLYVGNLHFNITEDMLRGIFEPFGKIENVQLTKDSETGRSKGFGFITFVNKEDARKALEQLNGFEL--AGRLIKVSV 354 (549)
T ss_pred chhhhhhcccccCchHHHHhhhccCcccceeeeeccccccccccCcceEEEecHHHHHHHHHHhcccee--cCceEEEEE
Confidence 334499999999999999999999999999999999988999999999999999999999999999766 778887654
Q ss_pred cCCCCCCC--------------------------------------------------------------------CCC-
Q 021958 160 ASFGAGEK--------------------------------------------------------------------RDD- 170 (305)
Q Consensus 160 ~~~~~~~~--------------------------------------------------------------------~~~- 170 (305)
........ ...
T Consensus 355 v~~r~~~~~a~~~~~d~D~~d~~gl~~~~~g~~Ql~~kla~~~~~~~~s~~~~~l~~~~~~~~~~~~~~~~~~~~~~p~~ 434 (549)
T KOG0147|consen 355 VTERVDTKEAAVTQFDFDEDDRQGLSLGSGGRNQLMAKLAEGKGRSLPSTAISALLLLAKLASAAQFNGVVRVRSVDPAD 434 (549)
T ss_pred eeeecccccccccccccchhhccccccccccHHHHHHHHhccCCccccchhhhHHHhccccchHHhhcCCcCccccCccc
Confidence 43320000 000
Q ss_pred ------CCCceEEEcCC--CcCCC--------HHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHH
Q 021958 171 ------TPDHTIFVGDL--AADVT--------DYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMT 234 (305)
Q Consensus 171 ------~~~~~l~v~nl--p~~~~--------~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~ 234 (305)
.++.++.+.|+ |.+.| .||+.+.++ +||.|..|.|..+ +.|+.||.|.+.+.|..|++
T Consensus 435 ~~p~~~i~t~C~lL~nMFdpstete~n~d~eI~edV~Eec~-k~g~v~hi~vd~n-----s~g~VYvrc~s~~~A~~a~~ 508 (549)
T KOG0147|consen 435 ASPAFDIPTQCLLLSNMFDPSTETEPNWDQEIREDVIEECG-KHGKVCHIFVDKN-----SAGCVYVRCPSAEAAGTAVK 508 (549)
T ss_pred cccccCCccHHHHHhhcCCcccccCcchhhHHHHHHHHHHH-hcCCeeEEEEccC-----CCceEEEecCcHHHHHHHHH
Confidence 23445666665 11111 267888888 9999977766333 24899999999999999999
Q ss_pred HhCCceecceeeEEccCC
Q 021958 235 EMNGVFCSTRPMRIGPAT 252 (305)
Q Consensus 235 ~l~~~~~~g~~l~v~~~~ 252 (305)
+|||.+|.|+.|++.|-.
T Consensus 509 alhgrWF~gr~Ita~~~~ 526 (549)
T KOG0147|consen 509 ALHGRWFAGRMITAKYLP 526 (549)
T ss_pred HHhhhhhccceeEEEEee
Confidence 999999999999999854
No 38
>PLN03134 glycine-rich RNA-binding protein 4; Provisional
Probab=99.80 E-value=6.6e-19 Score=135.88 Aligned_cols=87 Identities=33% Similarity=0.637 Sum_probs=81.5
Q ss_pred CCCCceEEEcCCCcCCCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHHhCCceecceeeEEc
Q 021958 170 DTPDHTIFVGDLAADVTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEMNGVFCSTRPMRIG 249 (305)
Q Consensus 170 ~~~~~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l~~~~~~g~~l~v~ 249 (305)
....++|||+||++++++++|+++|+ +||.|.+++|+.|..+++++|||||+|++.++|++|++.|++..|+|+.|+|+
T Consensus 31 ~~~~~~lfVgnL~~~~te~~L~~~F~-~~G~I~~v~i~~d~~tg~~kGfaFV~F~~~e~A~~Al~~lng~~i~Gr~l~V~ 109 (144)
T PLN03134 31 RLMSTKLFIGGLSWGTDDASLRDAFA-HFGDVVDAKVIVDRETGRSRGFGFVNFNDEGAATAAISEMDGKELNGRHIRVN 109 (144)
T ss_pred cCCCCEEEEeCCCCCCCHHHHHHHHh-cCCCeEEEEEEecCCCCCcceEEEEEECCHHHHHHHHHHcCCCEECCEEEEEE
Confidence 34578999999999999999999999 99999999999999999999999999999999999999999999999999999
Q ss_pred cCCCCCCC
Q 021958 250 PATNKKTV 257 (305)
Q Consensus 250 ~~~~~~~~ 257 (305)
|++.+...
T Consensus 110 ~a~~~~~~ 117 (144)
T PLN03134 110 PANDRPSA 117 (144)
T ss_pred eCCcCCCC
Confidence 99876543
No 39
>TIGR01659 sex-lethal sex-lethal family splicing factor. This model describes the sex-lethal family of splicing factors found in Dipteran insects. The sex-lethal phenotype, however, may be limited to the Melanogasters and closely related species. In Drosophila the protein acts as an inhibitor of splicing. This subfamily is most closely related to the ELAV/HUD subfamily of splicing factors (TIGR01661).
Probab=99.78 E-value=2e-18 Score=151.79 Aligned_cols=113 Identities=26% Similarity=0.343 Sum_probs=92.2
Q ss_pred CCCCCceEEEcCCCcCCCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHHhCCceecceeeEE
Q 021958 169 DDTPDHTIFVGDLAADVTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEMNGVFCSTRPMRI 248 (305)
Q Consensus 169 ~~~~~~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l~~~~~~g~~l~v 248 (305)
.....++|||+||+.++++++|+++|+ .||.|++|+|+.|..+++++|||||+|.+.++|++|++.|++..|.++.|+|
T Consensus 103 ~~~~~~~LfVgnLp~~~te~~L~~lF~-~~G~V~~v~i~~d~~tg~srGyaFVeF~~~e~A~~Ai~~LnG~~l~gr~i~V 181 (346)
T TIGR01659 103 TNNSGTNLIVNYLPQDMTDRELYALFR-TIGPINTCRIMRDYKTGYSFGYAFVDFGSEADSQRAIKNLNGITVRNKRLKV 181 (346)
T ss_pred CCCCCcEEEEeCCCCCCCHHHHHHHHH-hcCCEEEEEEEecCCCCccCcEEEEEEccHHHHHHHHHHcCCCccCCceeee
Confidence 445678999999999999999999999 9999999999999989999999999999999999999999999999999999
Q ss_pred ccCCCCCCCCCCCCCCCcceEEeCCcch----hhhHHHHhhccc
Q 021958 249 GPATNKKTVSGQQQYPKGTFLVVGTGFC----LQIKFKFKYFIG 288 (305)
Q Consensus 249 ~~~~~~~~~~~~~~~~~~~~~~~g~~~~----~~~~~~~~~~~g 288 (305)
.|+++.... .....+++++... .++.+.|.+|+.
T Consensus 182 ~~a~p~~~~------~~~~~lfV~nLp~~vtee~L~~~F~~fG~ 219 (346)
T TIGR01659 182 SYARPGGES------IKDTNLYVTNLPRTITDDQLDTIFGKYGQ 219 (346)
T ss_pred ecccccccc------cccceeEEeCCCCcccHHHHHHHHHhcCC
Confidence 998764322 1233466664322 233455555543
No 40
>KOG1548 consensus Transcription elongation factor TAT-SF1 [Transcription]
Probab=99.77 E-value=3.7e-17 Score=136.82 Aligned_cols=172 Identities=16% Similarity=0.284 Sum_probs=139.7
Q ss_pred CCCCceEEEcCCCcccCHHHHhHhhhhcCCcE--------EEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCC
Q 021958 78 PGEIRTLWIGDLQYWMDETYLNTCFAHTGEVV--------AVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMP 149 (305)
Q Consensus 78 ~~~~~~l~v~nLp~~~t~~~l~~~f~~~G~v~--------~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~ 149 (305)
......|||+|||.++|.+++.++|++||.|. .|++.++. .|..+|-|++.|...+++..|++.|++..+
T Consensus 131 ~~~Nt~VYVsgLP~DiT~dE~~~~~sKcGiI~~d~~t~epk~KlYrd~-~G~lKGDaLc~y~K~ESVeLA~~ilDe~~~- 208 (382)
T KOG1548|consen 131 PKVNTSVYVSGLPLDITVDEFAEVMSKCGIIMRDPQTGEPKVKLYRDN-QGKLKGDALCCYIKRESVELAIKILDEDEL- 208 (382)
T ss_pred cccCceEEecCCCCcccHHHHHHHHHhcceEeccCCCCCeeEEEEecC-CCCccCceEEEeecccHHHHHHHHhCcccc-
Confidence 34456699999999999999999999999764 37888886 499999999999999999999999999999
Q ss_pred CCCcceeeeccCCCCCC----------------------------------CCCCCCCceEEEcCCC----cCCC-----
Q 021958 150 NGEQNFRLNWASFGAGE----------------------------------KRDDTPDHTIFVGDLA----ADVT----- 186 (305)
Q Consensus 150 ~~~~~i~v~~~~~~~~~----------------------------------~~~~~~~~~l~v~nlp----~~~~----- 186 (305)
+|+.++|..|.+.... .......++|.|.|+= ...+
T Consensus 209 -rg~~~rVerAkfq~Kge~~~~~k~k~k~~~~kk~~k~q~k~~dw~pd~~~~sk~r~~~tVi~kn~Ftp~~~~~~~~l~~ 287 (382)
T KOG1548|consen 209 -RGKKLRVERAKFQMKGEYDASKKEKGKCKDKKKLKKQQQKLLDWRPDRDDPSKARADRTVILKNMFTPEDFEKNPDLLN 287 (382)
T ss_pred -cCcEEEEehhhhhhccCcCcccccccccccHHHHHHHHHhhcccCCCccccccccCCcEEEeeecCCHHHhccCHHHHH
Confidence 6899999888764100 1112346788899871 1222
Q ss_pred --HHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHHhCCceecceeeEEccCCCCCCC
Q 021958 187 --DYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEMNGVFCSTRPMRIGPATNKKTV 257 (305)
Q Consensus 187 --~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l~~~~~~g~~l~v~~~~~~~~~ 257 (305)
.++|++.++ +||.|.+|.|.- ....|.+.|.|.+.++|..|++.|+|+.|+||.|..+....+...
T Consensus 288 dlkedl~eec~-K~G~v~~vvv~d----~hPdGvvtV~f~n~eeA~~ciq~m~GR~fdgRql~A~i~DG~t~~ 355 (382)
T KOG1548|consen 288 DLKEDLTEECE-KFGQVRKVVVYD----RHPDGVVTVSFRNNEEADQCIQTMDGRWFDGRQLTASIWDGKTKF 355 (382)
T ss_pred HHHHHHHHHHH-HhCCcceEEEec----cCCCceeEEEeCChHHHHHHHHHhcCeeecceEEEEEEeCCccee
Confidence 367888898 999999998863 345889999999999999999999999999999999876655444
No 41
>KOG0105 consensus Alternative splicing factor ASF/SF2 (RRM superfamily) [RNA processing and modification]
Probab=99.77 E-value=2e-17 Score=126.82 Aligned_cols=151 Identities=17% Similarity=0.231 Sum_probs=124.8
Q ss_pred CCCceEEEcCCCcccCHHHHhHhhhhcCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCCCCCcceeee
Q 021958 79 GEIRTLWIGDLQYWMDETYLNTCFAHTGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMPNGEQNFRLN 158 (305)
Q Consensus 79 ~~~~~l~v~nLp~~~t~~~l~~~f~~~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~~~~i~v~ 158 (305)
...++|||||||.++.+.+|+++|-+||.|.+|.+... ...-.||||+|++..+|+.|+..-+|..+ .+..++|.
T Consensus 4 r~~~~iyvGNLP~diRekeieDlFyKyg~i~~ieLK~r---~g~ppfafVeFEd~RDAeDAiygRdGYdy--dg~rLRVE 78 (241)
T KOG0105|consen 4 RNSRRIYVGNLPGDIREKEIEDLFYKYGRIREIELKNR---PGPPPFAFVEFEDPRDAEDAIYGRDGYDY--DGCRLRVE 78 (241)
T ss_pred cccceEEecCCCcchhhccHHHHHhhhcceEEEEeccC---CCCCCeeEEEecCccchhhhhhccccccc--CcceEEEE
Confidence 46789999999999999999999999999999987543 23456999999999999999998899988 45677777
Q ss_pred ccCCCCCCC----------------------CCCCCCceEEEcCCCcCCCHHHHHHHHhhcCCCeeEEEEEecCCCCCcc
Q 021958 159 WASFGAGEK----------------------RDDTPDHTIFVGDLAADVTDYMLQETFRARYPSTKGAKVVIDRLTGRTK 216 (305)
Q Consensus 159 ~~~~~~~~~----------------------~~~~~~~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~ 216 (305)
++....... ........|.|.+||..-+.+||+.++. +-|.|....+.+|
T Consensus 79 fprggr~s~~~~G~y~gggrgGgg~gg~rgppsrrSe~RVvVsGLp~SgSWQDLKDHmR-eaGdvCfadv~rD------- 150 (241)
T KOG0105|consen 79 FPRGGRSSSDRRGSYSGGGRGGGGGGGRRGPPSRRSEYRVVVSGLPPSGSWQDLKDHMR-EAGDVCFADVQRD------- 150 (241)
T ss_pred eccCCCcccccccccCCCCCCCCCCCcccCCcccccceeEEEecCCCCCchHHHHHHHH-hhCCeeeeeeecc-------
Confidence 765431110 0112345799999999999999999998 9999998888766
Q ss_pred cEEEEEeCCHHHHHHHHHHhCCceec
Q 021958 217 GYGFVRFGDESEQLRAMTEMNGVFCS 242 (305)
Q Consensus 217 g~afV~f~~~~~A~~a~~~l~~~~~~ 242 (305)
|++.|+|...|+-+-|+..|....+.
T Consensus 151 g~GvV~~~r~eDMkYAvr~ld~~~~~ 176 (241)
T KOG0105|consen 151 GVGVVEYLRKEDMKYAVRKLDDQKFR 176 (241)
T ss_pred cceeeeeeehhhHHHHHHhhcccccc
Confidence 37899999999999999999887653
No 42
>KOG4211 consensus Splicing factor hnRNP-F and related RNA-binding proteins [RNA processing and modification]
Probab=99.76 E-value=1.7e-17 Score=144.89 Aligned_cols=166 Identities=17% Similarity=0.266 Sum_probs=131.8
Q ss_pred CCCceEEEcCCCcccCHHHHhHhhhhcCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCCCCCcceeee
Q 021958 79 GEIRTLWIGDLQYWMDETYLNTCFAHTGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMPNGEQNFRLN 158 (305)
Q Consensus 79 ~~~~~l~v~nLp~~~t~~~l~~~f~~~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~~~~i~v~ 158 (305)
....-|-+.+||+++|++||.++|+.++ |+++.+.+ .+|+..|-|||+|.+++++++|+++ +...+ ..+.|.|-
T Consensus 8 ~~~~~vr~rGLPwsat~~ei~~Ff~~~~-I~~~~~~r--~~Gr~sGeA~Ve~~seedv~~Alkk-dR~~m--g~RYIEVf 81 (510)
T KOG4211|consen 8 STAFEVRLRGLPWSATEKEILDFFSNCG-IENLEIPR--RNGRPSGEAYVEFTSEEDVEKALKK-DRESM--GHRYIEVF 81 (510)
T ss_pred CcceEEEecCCCccccHHHHHHHHhcCc-eeEEEEec--cCCCcCcceEEEeechHHHHHHHHh-hHHHh--CCceEEEE
Confidence 3445678899999999999999999985 67765555 4799999999999999999999965 76666 67778876
Q ss_pred ccCCCCCCC-------CCCCCCceEEEcCCCcCCCHHHHHHHHhhcCCCeeE-EEEEecCCCCCcccEEEEEeCCHHHHH
Q 021958 159 WASFGAGEK-------RDDTPDHTIFVGDLAADVTDYMLQETFRARYPSTKG-AKVVIDRLTGRTKGYGFVRFGDESEQL 230 (305)
Q Consensus 159 ~~~~~~~~~-------~~~~~~~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~-i~i~~~~~~~~~~g~afV~f~~~~~A~ 230 (305)
-+.....+. ........|.+++||+.++++||.++|+ -.-.|.. |.++.+. .+++.|-|||+|++.+.|+
T Consensus 82 ~~~~~e~d~~~~~~g~~s~~~d~vVRLRGLPfscte~dI~~FFa-GL~Iv~~gi~l~~d~-rgR~tGEAfVqF~sqe~ae 159 (510)
T KOG4211|consen 82 TAGGAEADWVMRPGGPNSSANDGVVRLRGLPFSCTEEDIVEFFA-GLEIVPDGILLPMDQ-RGRPTGEAFVQFESQESAE 159 (510)
T ss_pred ccCCccccccccCCCCCCCCCCceEEecCCCccCcHHHHHHHhc-CCcccccceeeeccC-CCCcccceEEEecCHHHHH
Confidence 664332211 1224667899999999999999999998 5555554 4455554 7889999999999999999
Q ss_pred HHHHHhCCceecceeeEEccCCC
Q 021958 231 RAMTEMNGVFCSTRPMRIGPATN 253 (305)
Q Consensus 231 ~a~~~l~~~~~~g~~l~v~~~~~ 253 (305)
+|+.. |...|+-|-|.|..+..
T Consensus 160 ~Al~r-hre~iGhRYIEvF~Ss~ 181 (510)
T KOG4211|consen 160 IALGR-HRENIGHRYIEVFRSSR 181 (510)
T ss_pred HHHHH-HHHhhccceEEeehhHH
Confidence 99986 77889999999976654
No 43
>KOG4212 consensus RNA-binding protein hnRNP-M [RNA processing and modification]
Probab=99.75 E-value=5.1e-17 Score=139.72 Aligned_cols=168 Identities=17% Similarity=0.303 Sum_probs=140.3
Q ss_pred CCCceEEEcCCCcccCHHHHhHhhh-hcCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCCCCCcceee
Q 021958 79 GEIRTLWIGDLQYWMDETYLNTCFA-HTGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMPNGEQNFRL 157 (305)
Q Consensus 79 ~~~~~l~v~nLp~~~t~~~l~~~f~-~~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~~~~i~v 157 (305)
...+.+||.|||+++.+.+|+++|. +.|.|..|.+..|. +|+++|||.|||+++|.+++|++.|+...+ .++.+.+
T Consensus 42 ~r~R~vfItNIpyd~rWqdLKdLvrekvGev~yveLl~D~-~GK~rGcavVEFk~~E~~qKa~E~lnk~~~--~GR~l~v 118 (608)
T KOG4212|consen 42 ARDRSVFITNIPYDYRWQDLKDLVREKVGEVEYVELLFDE-SGKARGCAVVEFKDPENVQKALEKLNKYEV--NGRELVV 118 (608)
T ss_pred cccceEEEecCcchhhhHhHHHHHHHhcCceEeeeeeccc-CCCcCCceEEEeeCHHHHHHHHHHhhhccc--cCceEEE
Confidence 3456799999999999999999996 67899999999996 799999999999999999999999999888 5566655
Q ss_pred eccCCC-------------------------------------------CCC------C---------------------
Q 021958 158 NWASFG-------------------------------------------AGE------K--------------------- 167 (305)
Q Consensus 158 ~~~~~~-------------------------------------------~~~------~--------------------- 167 (305)
.-.... ... +
T Consensus 119 KEd~d~q~~~~~~~~r~g~~~f~~~~~~q~G~~~l~~~g~ggG~~~Rg~~~~D~Dr~sr~~~t~t~~~~~~~~~~~~lfg 198 (608)
T KOG4212|consen 119 KEDHDEQRDQYGRIVRDGGGGFGGGGGVQGGNGGLNGGGGGGGDRDRGFSRRDDDRLSRRNNTNTMSNDYNNSSNYNLFG 198 (608)
T ss_pred eccCchhhhhhhheeeccCcccccCcceecccccccccCCCCccccCCCCcccccccccccCccccccccccchhhhccc
Confidence 433221 000 0
Q ss_pred -----------CCCCCCceEEEcCCCcCCCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHHh
Q 021958 168 -----------RDDTPDHTIFVGDLAADVTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEM 236 (305)
Q Consensus 168 -----------~~~~~~~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l 236 (305)
-......++||+||.+.+....|++.|. .-|.|+.+.+-.|+ .|.++|+|.++|++.-+|..||..|
T Consensus 199 l~~~Flr~~h~f~pPl~~k~fvanl~~~vg~~kL~qvfg-mAGkv~~vdf~idK-eG~s~G~~vi~y~hpveavqaIsml 276 (608)
T KOG4212|consen 199 LSASFLRSLHIFSPPLHNKVFVANLDYKVGNKKLKQVFG-MAGKVQSVDFSIDK-EGNSRGFAVIEYDHPVEAVQAISML 276 (608)
T ss_pred chhhhhhhccCCCCCccceeeeeccccccchHHHHHHhc-cceeeeeeceeecc-ccccCCeeEEEecchHHHHHHHHhh
Confidence 0113456899999999999999999998 99999999999997 5689999999999999999999999
Q ss_pred CCceecceeeEEccC
Q 021958 237 NGVFCSTRPMRIGPA 251 (305)
Q Consensus 237 ~~~~~~g~~l~v~~~ 251 (305)
++.-+.+++..+...
T Consensus 277 ~~~g~~~~~~~~Rl~ 291 (608)
T KOG4212|consen 277 DRQGLFDRRMTVRLD 291 (608)
T ss_pred ccCCCccccceeecc
Confidence 986677777777663
No 44
>PLN03134 glycine-rich RNA-binding protein 4; Provisional
Probab=99.74 E-value=2.2e-17 Score=127.37 Aligned_cols=84 Identities=31% Similarity=0.542 Sum_probs=78.6
Q ss_pred CCCCceEEEcCCCcccCHHHHhHhhhhcCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCCCCCcceee
Q 021958 78 PGEIRTLWIGDLQYWMDETYLNTCFAHTGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMPNGEQNFRL 157 (305)
Q Consensus 78 ~~~~~~l~v~nLp~~~t~~~l~~~f~~~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~~~~i~v 157 (305)
....++|||+||+.++++++|+++|++||.|.++.++.|+.+++++|||||+|.+.++|++|++.|++..| .++.|+|
T Consensus 31 ~~~~~~lfVgnL~~~~te~~L~~~F~~~G~I~~v~i~~d~~tg~~kGfaFV~F~~~e~A~~Al~~lng~~i--~Gr~l~V 108 (144)
T PLN03134 31 RLMSTKLFIGGLSWGTDDASLRDAFAHFGDVVDAKVIVDRETGRSRGFGFVNFNDEGAATAAISEMDGKEL--NGRHIRV 108 (144)
T ss_pred cCCCCEEEEeCCCCCCCHHHHHHHHhcCCCeEEEEEEecCCCCCcceEEEEEECCHHHHHHHHHHcCCCEE--CCEEEEE
Confidence 45577999999999999999999999999999999999999999999999999999999999999999999 6789999
Q ss_pred eccCCC
Q 021958 158 NWASFG 163 (305)
Q Consensus 158 ~~~~~~ 163 (305)
+|+..+
T Consensus 109 ~~a~~~ 114 (144)
T PLN03134 109 NPANDR 114 (144)
T ss_pred EeCCcC
Confidence 998754
No 45
>KOG0110 consensus RNA-binding protein (RRM superfamily) [General function prediction only]
Probab=99.71 E-value=3.4e-16 Score=142.27 Aligned_cols=165 Identities=17% Similarity=0.269 Sum_probs=131.1
Q ss_pred CCCceEEEcCCCcccCHHHHhHhhhhcCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCCCCCcceeee
Q 021958 79 GEIRTLWIGDLQYWMDETYLNTCFAHTGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMPNGEQNFRLN 158 (305)
Q Consensus 79 ~~~~~l~v~nLp~~~t~~~l~~~f~~~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~~~~i~v~ 158 (305)
...+.++|+|||..+..++|..+|..||.|..+.+... | --++|+|.+..+|+.|.+.|....+ ....+.+.
T Consensus 383 rs~~vil~kNlpa~t~~~elt~~F~~fG~i~rvllp~~---G---~~aiv~fl~p~eAr~Afrklaysr~--k~~plyle 454 (725)
T KOG0110|consen 383 RSDTVILVKNLPAGTLSEELTEAFLRFGEIGRVLLPPG---G---TGAIVEFLNPLEARKAFRKLAYSRF--KSAPLYLE 454 (725)
T ss_pred hhcceeeeccCccccccHHHHHHhhcccccceeecCcc---c---ceeeeeecCccchHHHHHHhchhhh--ccCccccc
Confidence 34578999999999999999999999999988844322 1 1499999999999999999888877 34455554
Q ss_pred ccCCCCCC---------------------------------CC------------CC-CCCceEEEcCCCcCCCHHHHHH
Q 021958 159 WASFGAGE---------------------------------KR------------DD-TPDHTIFVGDLAADVTDYMLQE 192 (305)
Q Consensus 159 ~~~~~~~~---------------------------------~~------------~~-~~~~~l~v~nlp~~~~~~~l~~ 192 (305)
|+...... .. .. ...++|||.||+++.+.++|..
T Consensus 455 ~aP~dvf~~~pka~~~~~e~~~~~ee~~~Er~s~~d~~v~eD~d~te~ss~a~~a~~~~~~t~lfvkNlnf~Tt~e~l~~ 534 (725)
T KOG0110|consen 455 WAPEDVFTEDPKADDLSAESRSKMEENPSERVSAEDGQVEEDKDPTEESSLARVAEDEETETKLFVKNLNFDTTLEDLED 534 (725)
T ss_pred cChhhhccCCccccccccccccccccCcceecccccccccccCCccccccchhhhhccccchhhhhhcCCcccchhHHHH
Confidence 44321000 00 00 1123399999999999999999
Q ss_pred HHhhcCCCeeEEEEEecCCC---CCcccEEEEEeCCHHHHHHHHHHhCCceecceeeEEccCC
Q 021958 193 TFRARYPSTKGAKVVIDRLT---GRTKGYGFVRFGDESEQLRAMTEMNGVFCSTRPMRIGPAT 252 (305)
Q Consensus 193 ~f~~~~G~i~~i~i~~~~~~---~~~~g~afV~f~~~~~A~~a~~~l~~~~~~g~~l~v~~~~ 252 (305)
+|. .+|.|.++.|...+.. -.|.|||||+|.+.++|.+|++.|+|+.|+|+.|.|+++.
T Consensus 535 ~F~-k~G~VlS~~I~kkkd~~~k~lSmGfgFVEF~~~e~A~~a~k~lqgtvldGH~l~lk~S~ 596 (725)
T KOG0110|consen 535 LFS-KQGTVLSIEISKKKDPANKYLSMGFGFVEFAKPESAQAALKALQGTVLDGHKLELKISE 596 (725)
T ss_pred HHH-hcCeEEEEEEeccccccccccccceeEEEecCHHHHHHHHHHhcCceecCceEEEEecc
Confidence 999 9999999988765422 1356999999999999999999999999999999999998
No 46
>KOG0106 consensus Alternative splicing factor SRp55/B52/SRp75 (RRM superfamily) [RNA processing and modification]
Probab=99.70 E-value=3.1e-17 Score=131.76 Aligned_cols=152 Identities=18% Similarity=0.363 Sum_probs=127.4
Q ss_pred ceEEEcCCCcccCHHHHhHhhhhcCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCCCCCcceeeeccC
Q 021958 82 RTLWIGDLQYWMDETYLNTCFAHTGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMPNGEQNFRLNWAS 161 (305)
Q Consensus 82 ~~l~v~nLp~~~t~~~l~~~f~~~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~~~~i~v~~~~ 161 (305)
..||||+|++.+.+.+|+.+|..||.+.++.+.. ||+||+|.+..+|..|+..+++..+.+.. +.+.|+.
T Consensus 2 ~rv~vg~~~~~~~~~d~E~~f~~yg~~~d~~mk~--------gf~fv~fed~rda~Dav~~l~~~~l~~e~--~vve~~r 71 (216)
T KOG0106|consen 2 PRVYIGRLPYRARERDVERFFKGYGKIPDADMKN--------GFGFVEFEDPRDADDAVHDLDGKELCGER--LVVEHAR 71 (216)
T ss_pred CceeecccCCccchhHHHHHHhhccccccceeec--------ccceeccCchhhhhcccchhcCceeccee--eeeeccc
Confidence 4799999999999999999999999998887643 58999999999999999999999996533 7777776
Q ss_pred CCCCC----------------CCCCCCCceEEEcCCCcCCCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCC
Q 021958 162 FGAGE----------------KRDDTPDHTIFVGDLAADVTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGD 225 (305)
Q Consensus 162 ~~~~~----------------~~~~~~~~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~ 225 (305)
..... .......+.+.|.|+..++.+.+|.+.|. .+|.+..... ..+++||+|++
T Consensus 72 ~~~~~~g~~~~g~r~~~~~~~~~p~~s~~r~~~~~~~~r~~~qdl~d~~~-~~g~~~~~~~--------~~~~~~v~Fs~ 142 (216)
T KOG0106|consen 72 GKRRGRGRPRGGDRRSDSRRYRPPSRTHFRLIVRNLSLRVSWQDLKDHFR-PAGEVTYVDA--------RRNFAFVEFSE 142 (216)
T ss_pred ccccccCCCCCCCccchhhccCCcccccceeeeccchhhhhHHHHhhhhc-ccCCCchhhh--------hccccceeehh
Confidence 32100 01134567799999999999999999999 9999955444 26689999999
Q ss_pred HHHHHHHHHHhCCceecceeeEEccCC
Q 021958 226 ESEQLRAMTEMNGVFCSTRPMRIGPAT 252 (305)
Q Consensus 226 ~~~A~~a~~~l~~~~~~g~~l~v~~~~ 252 (305)
.++|.+|+..|++..+.|+.|++....
T Consensus 143 ~~da~ra~~~l~~~~~~~~~l~~~~~~ 169 (216)
T KOG0106|consen 143 QEDAKRALEKLDGKKLNGRRISVEKNS 169 (216)
T ss_pred hhhhhhcchhccchhhcCceeeecccC
Confidence 999999999999999999999995433
No 47
>PF00076 RRM_1: RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain); InterPro: IPR000504 Many eukaryotic proteins containing one or more copies of a putative RNA-binding domain of about 90 amino acids are known to bind single-stranded RNAs [, , ]. The largest group of single strand RNA-binding proteins is the eukaryotic RNA recognition motif (RRM) family that contains an eight amino acid RNP-1 consensus sequence [, ]. RRM proteins have a variety of RNA binding preferences and functions, and include heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing (SR, U2AF, Sxl), protein components of small nuclear ribonucleoproteins (U1 and U2 snRNPs), and proteins that regulate RNA stability and translation (PABP, La, Hu) [, , ]. The RRM in heterodimeric splicing factor U2 snRNP auxiliary factor (U2AF) appears to have two RRM-like domains with specialised features for protein recognition []. The motif also appears in a few single stranded DNA binding proteins. The typical RRM consists of four anti-parallel beta-strands and two alpha-helices arranged in a beta-alpha-beta-beta-alpha-beta fold with side chains that stack with RNA bases. Specificity of RNA binding is determined by multiple contacts with surrounding amino acids. A third helix is present during RNA binding in some cases []. The RRM is reviewed in a number of publications [, , ].; GO: 0003676 nucleic acid binding; PDB: 2RNE_A 2DGO_A 2DO4_A 1YTY_B 2VOO_B 2VOP_A 2VON_B 1ZH5_B 2VOD_A 1S79_A ....
Probab=99.69 E-value=2e-16 Score=107.20 Aligned_cols=70 Identities=34% Similarity=0.659 Sum_probs=67.2
Q ss_pred EEEcCCCcCCCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHHhCCceecceeeE
Q 021958 176 IFVGDLAADVTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEMNGVFCSTRPMR 247 (305)
Q Consensus 176 l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l~~~~~~g~~l~ 247 (305)
|||+|||.++++++|+++|+ .||.|..+.+..+ .++..+++|||+|.+.++|++|++.|+|..++|+.|+
T Consensus 1 l~v~nlp~~~t~~~l~~~f~-~~g~i~~~~~~~~-~~~~~~~~a~V~F~~~~~a~~a~~~l~g~~~~~~~ir 70 (70)
T PF00076_consen 1 LYVGNLPPDVTEEELRDFFS-QFGKIESIKVMRN-SSGKSKGYAFVEFESEEDAEKALEELNGKKINGRKIR 70 (70)
T ss_dssp EEEESETTTSSHHHHHHHHH-TTSTEEEEEEEEE-TTSSEEEEEEEEESSHHHHHHHHHHHTTEEETTEEEE
T ss_pred cEEcCCCCcCCHHHHHHHHH-Hhhhccccccccc-ccccccceEEEEEcCHHHHHHHHHHcCCCEECccCcC
Confidence 79999999999999999999 9999999999998 5889999999999999999999999999999999986
No 48
>KOG0122 consensus Translation initiation factor 3, subunit g (eIF-3g) [Translation, ribosomal structure and biogenesis]
Probab=99.67 E-value=2.6e-16 Score=125.94 Aligned_cols=83 Identities=25% Similarity=0.415 Sum_probs=80.0
Q ss_pred CCCceEEEcCCCcCCCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHHhCCceecceeeEEcc
Q 021958 171 TPDHTIFVGDLAADVTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEMNGVFCSTRPMRIGP 250 (305)
Q Consensus 171 ~~~~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l~~~~~~g~~l~v~~ 250 (305)
...++|.|.||+.+++|++|+++|. .||.|.++.+.+|++||.++|||||.|.+.++|.+||+.|||.-++.-.|+|.|
T Consensus 187 ~D~~tvRvtNLsed~~E~dL~eLf~-~fg~i~rvylardK~TG~~kGFAFVtF~sRddA~rAI~~LnG~gyd~LILrvEw 265 (270)
T KOG0122|consen 187 DDEATVRVTNLSEDMREDDLEELFR-PFGPITRVYLARDKETGLSKGFAFVTFESRDDAARAIADLNGYGYDNLILRVEW 265 (270)
T ss_pred CccceeEEecCccccChhHHHHHhh-ccCccceeEEEEccccCcccceEEEEEecHHHHHHHHHHccCcccceEEEEEEe
Confidence 3568899999999999999999999 999999999999999999999999999999999999999999999999999999
Q ss_pred CCCC
Q 021958 251 ATNK 254 (305)
Q Consensus 251 ~~~~ 254 (305)
++++
T Consensus 266 skP~ 269 (270)
T KOG0122|consen 266 SKPS 269 (270)
T ss_pred cCCC
Confidence 9986
No 49
>KOG0124 consensus Polypyrimidine tract-binding protein PUF60 (RRM superfamily) [RNA processing and modification]
Probab=99.66 E-value=1.5e-15 Score=128.10 Aligned_cols=172 Identities=19% Similarity=0.288 Sum_probs=139.3
Q ss_pred CCCCCceEEEcCCCcccCHHHHhHhhhhcCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCCCCCccee
Q 021958 77 QPGEIRTLWIGDLQYWMDETYLNTCFAHTGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMPNGEQNFR 156 (305)
Q Consensus 77 ~~~~~~~l~v~nLp~~~t~~~l~~~f~~~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~~~~i~ 156 (305)
+....++|||..+..+.+++||+..|+.||+|..|.+-++...+..+||+|++|.+...-..|+..+|-..+ +|..++
T Consensus 206 eAk~fnRiYVaSvHpDLSe~DiKSVFEAFG~I~~C~LAr~pt~~~HkGyGfiEy~n~qs~~eAiasMNlFDL--GGQyLR 283 (544)
T KOG0124|consen 206 EAKKFNRIYVASVHPDLSETDIKSVFEAFGEIVKCQLARAPTGRGHKGYGFIEYNNLQSQSEAIASMNLFDL--GGQYLR 283 (544)
T ss_pred HHHhhheEEeeecCCCccHHHHHHHHHhhcceeeEEeeccCCCCCccceeeEEeccccchHHHhhhcchhhc--ccceEe
Confidence 345678999999999999999999999999999999999998899999999999999999999998887777 788888
Q ss_pred eeccCCC-------------------------------------------------------------------------
Q 021958 157 LNWASFG------------------------------------------------------------------------- 163 (305)
Q Consensus 157 v~~~~~~------------------------------------------------------------------------- 163 (305)
+..+-..
T Consensus 284 VGk~vTPP~aLl~Pat~s~~P~aaaVAaAAaTAKi~A~eAvAg~avlg~~G~~~~vSpA~~aa~p~~~l~qa~~a~~~pg 363 (544)
T KOG0124|consen 284 VGKCVTPPDALLQPATVSAIPAAAAVAAAAATAKIMAAEAVAGSAVLGTVGAPGLVSPAPRAAQPLGTLPQAVMAAQAPG 363 (544)
T ss_pred cccccCCCchhcCCCCcccCchHHHHHHHHHHHHHHHHHHhccCCcccccCCccccCccccccCCCCCccccchhccCCc
Confidence 8766443
Q ss_pred -------------------------------------CCCC------------------------------------CCC
Q 021958 164 -------------------------------------AGEK------------------------------------RDD 170 (305)
Q Consensus 164 -------------------------------------~~~~------------------------------------~~~ 170 (305)
..++ -..
T Consensus 364 vi~~vtP~~P~iP~~i~p~g~v~P~LA~ppT~g~L~kkkeKe~eelqpkl~~~~~L~~QE~msI~G~sARhlvMqkLmR~ 443 (544)
T KOG0124|consen 364 VITGVTPARPPIPVTIPPVGVVNPILASPPTLGLLEKKKEKEEEELQPKLERPEMLSEQEHMSISGSSARHLVMQKLMRK 443 (544)
T ss_pred eeccCCCCCCCCCccCCCcceechhhcCCCchhhcchhhhhhHhhhcccccCHHHhhhhhCccccCccHHHHHHHHHhcc
Confidence 0000 001
Q ss_pred CCCceEEEcCC--CcCCCH---HHHHHHHhhcCCCeeEEEEEecCCCCCc----ccEEEEEeCCHHHHHHHHHHhCCcee
Q 021958 171 TPDHTIFVGDL--AADVTD---YMLQETFRARYPSTKGAKVVIDRLTGRT----KGYGFVRFGDESEQLRAMTEMNGVFC 241 (305)
Q Consensus 171 ~~~~~l~v~nl--p~~~~~---~~l~~~f~~~~G~i~~i~i~~~~~~~~~----~g~afV~f~~~~~A~~a~~~l~~~~~ 241 (305)
..++.+.++|+ |.+++| .+|.+.+. +||.|.++.|...+.++.. ----||+|+...++.+|+..|+|+.|
T Consensus 444 ~~S~VivLRNMV~P~DiDe~LegEi~EECg-KfG~V~rViI~nekq~e~edaeiiVKIFVefS~~~e~~rak~ALdGRfF 522 (544)
T KOG0124|consen 444 QESTVIVLRNMVDPKDIDEDLEGEITEECG-KFGAVNRVIIYNEKQGEEEDAEIIVKIFVEFSIASETHRAKQALDGRFF 522 (544)
T ss_pred ccCcEEEEeccCChhhhhhHHHHHHHHHHh-cccceeEEEEEecccccccchhhhheeeeeechhhHHHHHHHhhcccee
Confidence 35677888886 556654 68999998 9999999988766533311 12369999999999999999999999
Q ss_pred cceeeEEccC
Q 021958 242 STRPMRIGPA 251 (305)
Q Consensus 242 ~g~~l~v~~~ 251 (305)
+||++..+..
T Consensus 523 gGr~VvAE~Y 532 (544)
T KOG0124|consen 523 GGRKVVAEVY 532 (544)
T ss_pred cCceeehhhh
Confidence 9999987643
No 50
>KOG1190 consensus Polypyrimidine tract-binding protein [RNA processing and modification]
Probab=99.66 E-value=2.9e-15 Score=128.03 Aligned_cols=161 Identities=16% Similarity=0.259 Sum_probs=133.9
Q ss_pred CceEEEcCCC-cccCHHHHhHhhhhcCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCCCCCcceeeec
Q 021958 81 IRTLWIGDLQ-YWMDETYLNTCFAHTGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMPNGEQNFRLNW 159 (305)
Q Consensus 81 ~~~l~v~nLp-~~~t~~~l~~~f~~~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~~~~i~v~~ 159 (305)
...|.|.||. ..+|.+-|..+|+-||.|.+|+|..++.+ -|+|++.+...|.-|++.|+|..+ .++.|++.+
T Consensus 297 n~vllvsnln~~~VT~d~LftlFgvYGdVqRVkil~nkkd-----~ALIQmsd~~qAqLA~~hL~g~~l--~gk~lrvt~ 369 (492)
T KOG1190|consen 297 NVVLLVSNLNEEAVTPDVLFTLFGVYGDVQRVKILYNKKD-----NALIQMSDGQQAQLAMEHLEGHKL--YGKKLRVTL 369 (492)
T ss_pred ceEEEEecCchhccchhHHHHHHhhhcceEEEEeeecCCc-----ceeeeecchhHHHHHHHHhhccee--cCceEEEee
Confidence 6788999996 77899999999999999999999998643 699999999999999999999999 678999988
Q ss_pred cCCCCCCC-------------------------------CCCCCCceEEEcCCCcCCCHHHHHHHHhhcCCCeeEEEEEe
Q 021958 160 ASFGAGEK-------------------------------RDDTPDHTIFVGDLAADVTDYMLQETFRARYPSTKGAKVVI 208 (305)
Q Consensus 160 ~~~~~~~~-------------------------------~~~~~~~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~ 208 (305)
++.....- ....++.+|++.|+|.+++||+|++.|.+..|.|+-.++.
T Consensus 370 SKH~~vqlp~egq~d~glT~dy~~spLhrfkkpgsKN~~ni~PpsatlHlsnip~svsee~lk~~f~~~g~~vkafkff- 448 (492)
T KOG1190|consen 370 SKHTNVQLPREGQEDQGLTKDYGNSPLHRFKKPGSKNYQNIFPPSATLHLSNIPPSVSEEDLKNLFQEPGGQVKAFKFF- 448 (492)
T ss_pred ccCccccCCCCCCccccccccCCCCchhhccCcccccccccCCchhheeeccCCcccchhHHHHhhhcCCceEEeeeec-
Confidence 87641110 0114677999999999999999999999444444544442
Q ss_pred cCCCCCcccEEEEEeCCHHHHHHHHHHhCCceecce-eeEEccCCC
Q 021958 209 DRLTGRTKGYGFVRFGDESEQLRAMTEMNGVFCSTR-PMRIGPATN 253 (305)
Q Consensus 209 ~~~~~~~~g~afV~f~~~~~A~~a~~~l~~~~~~g~-~l~v~~~~~ 253 (305)
++.+.+|++.+++.|+|..|+-.+|++.+++. .++|+|++.
T Consensus 449 ----~kd~kmal~q~~sveeA~~ali~~hnh~lgen~hlRvSFSks 490 (492)
T KOG1190|consen 449 ----QKDRKMALPQLESVEEAIQALIDLHNHYLGENHHLRVSFSKS 490 (492)
T ss_pred ----CCCcceeecccCChhHhhhhccccccccCCCCceEEEEeecc
Confidence 33477999999999999999999999998765 999999875
No 51
>KOG0126 consensus Predicted RNA-binding protein (RRM superfamily) [General function prediction only]
Probab=99.65 E-value=2.2e-17 Score=126.22 Aligned_cols=109 Identities=22% Similarity=0.436 Sum_probs=90.7
Q ss_pred HHHhCCCCCCCCCcceeeeccCCCCCCCCCCCCCceEEEcCCCcCCCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEE
Q 021958 140 LQTFNGTPMPNGEQNFRLNWASFGAGEKRDDTPDHTIFVGDLAADVTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYG 219 (305)
Q Consensus 140 ~~~l~g~~~~~~~~~i~v~~~~~~~~~~~~~~~~~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~a 219 (305)
+..||.+++..+.. -+.+|.. ....+..|||||||+++||.||..+|+ +||+|.+|.+++|+.||+++|||
T Consensus 10 i~~lne~Elq~g~~-~~~SWH~-------~YkdsA~Iyiggl~~~LtEgDil~VFS-qyGe~vdinLiRDk~TGKSKGFa 80 (219)
T KOG0126|consen 10 IQKLNERELQLGIA-DKKSWHQ-------EYKDSAYIYIGGLPYELTEGDILCVFS-QYGEIVDINLIRDKKTGKSKGFA 80 (219)
T ss_pred HHHhhHHhhccccc-cccchhh-------hcccceEEEECCCcccccCCcEEEEee-ccCceEEEEEEecCCCCcccceE
Confidence 44556555543222 2455553 234468999999999999999999999 99999999999999999999999
Q ss_pred EEEeCCHHHHHHHHHHhCCceecceeeEEccCCCCCCC
Q 021958 220 FVRFGDESEQLRAMTEMNGVFCSTRPMRIGPATNKKTV 257 (305)
Q Consensus 220 fV~f~~~~~A~~a~~~l~~~~~~g~~l~v~~~~~~~~~ 257 (305)
|+.|++..+..-|+..|||..|.||.|+|+........
T Consensus 81 FLcYEDQRSTILAVDN~NGiki~gRtirVDHv~~Yk~p 118 (219)
T KOG0126|consen 81 FLCYEDQRSTILAVDNLNGIKILGRTIRVDHVSNYKKP 118 (219)
T ss_pred EEEecCccceEEEEeccCCceecceeEEeeecccccCC
Confidence 99999999999999999999999999999987665544
No 52
>KOG0149 consensus Predicted RNA-binding protein SEB4 (RRM superfamily) [General function prediction only]
Probab=99.64 E-value=3.5e-16 Score=124.78 Aligned_cols=81 Identities=33% Similarity=0.458 Sum_probs=74.2
Q ss_pred CCCceEEEcCCCcCCCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHHhCCceecceeeEEcc
Q 021958 171 TPDHTIFVGDLAADVTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEMNGVFCSTRPMRIGP 250 (305)
Q Consensus 171 ~~~~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l~~~~~~g~~l~v~~ 250 (305)
..-.+||||+|++++..|+|+++|+ +||+|.+..|+.|+.+|++||||||+|.|.++|.+|++.- +-.|+||+.-|+.
T Consensus 10 T~~TKifVggL~w~T~~~~l~~yFe-qfGeI~eavvitd~~t~rskGyGfVTf~d~~aa~rAc~dp-~piIdGR~aNcnl 87 (247)
T KOG0149|consen 10 TTFTKIFVGGLAWETHKETLRRYFE-QFGEIVEAVVITDKNTGRSKGYGFVTFRDAEAATRACKDP-NPIIDGRKANCNL 87 (247)
T ss_pred ceEEEEEEcCcccccchHHHHHHHH-HhCceEEEEEEeccCCccccceeeEEeecHHHHHHHhcCC-CCcccccccccch
Confidence 3457899999999999999999999 9999999999999999999999999999999999999863 3669999999988
Q ss_pred CCC
Q 021958 251 ATN 253 (305)
Q Consensus 251 ~~~ 253 (305)
+.-
T Consensus 88 A~l 90 (247)
T KOG0149|consen 88 ASL 90 (247)
T ss_pred hhh
Confidence 765
No 53
>KOG0125 consensus Ataxin 2-binding protein (RRM superfamily) [General function prediction only]
Probab=99.64 E-value=1e-15 Score=127.45 Aligned_cols=89 Identities=18% Similarity=0.377 Sum_probs=80.0
Q ss_pred CCCCCCceEEEcCCCcCCCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHHhCCceecceeeE
Q 021958 168 RDDTPDHTIFVGDLAADVTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEMNGVFCSTRPMR 247 (305)
Q Consensus 168 ~~~~~~~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l~~~~~~g~~l~ 247 (305)
......++|+|.|||+.+.|-||+.+|+ +||.|.+|.|+.+. --|||||||+|++.+||++|.++|||..+.||+|.
T Consensus 91 ~s~~~pkRLhVSNIPFrFRdpDL~aMF~-kfG~VldVEIIfNE--RGSKGFGFVTmen~~dadRARa~LHgt~VEGRkIE 167 (376)
T KOG0125|consen 91 SSKDTPKRLHVSNIPFRFRDPDLRAMFE-KFGKVLDVEIIFNE--RGSKGFGFVTMENPADADRARAELHGTVVEGRKIE 167 (376)
T ss_pred CCCCCCceeEeecCCccccCccHHHHHH-hhCceeeEEEEecc--CCCCccceEEecChhhHHHHHHHhhcceeeceEEE
Confidence 3345568999999999999999999999 99999999999984 34599999999999999999999999999999999
Q ss_pred EccCCCCCCCCC
Q 021958 248 IGPATNKKTVSG 259 (305)
Q Consensus 248 v~~~~~~~~~~~ 259 (305)
|..++.+....+
T Consensus 168 Vn~ATarV~n~K 179 (376)
T KOG0125|consen 168 VNNATARVHNKK 179 (376)
T ss_pred EeccchhhccCC
Confidence 999998765544
No 54
>PF14259 RRM_6: RNA recognition motif (a.k.a. RRM, RBD, or RNP domain); PDB: 2DNN_A 1WI6_A 2EVZ_A 2ADC_A 1QM9_A 2KG1_A 2HGN_A 1U2F_A 2G4B_A 2HZC_A ....
Probab=99.63 E-value=3.2e-15 Score=101.35 Aligned_cols=70 Identities=24% Similarity=0.556 Sum_probs=65.0
Q ss_pred EEEcCCCcCCCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHHhCCceecceeeE
Q 021958 176 IFVGDLAADVTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEMNGVFCSTRPMR 247 (305)
Q Consensus 176 l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l~~~~~~g~~l~ 247 (305)
|+|+|||.++++++|+++|+ .||.|..+++..++. +..+++|||+|.+.++|.+|++.+++..++|+.|+
T Consensus 1 v~i~nlp~~~~~~~l~~~f~-~~g~v~~v~~~~~~~-~~~~~~a~v~f~~~~~a~~al~~~~~~~~~g~~l~ 70 (70)
T PF14259_consen 1 VYISNLPPSTTEEDLRNFFS-RFGPVEKVRLIKNKD-GQSRGFAFVEFSSEEDAKRALELLNGKEIDGRKLR 70 (70)
T ss_dssp EEEESSTTT--HHHHHHHCT-TSSBEEEEEEEESTT-SSEEEEEEEEESSHHHHHHHHHHHTTEEETTEEEE
T ss_pred CEEeCCCCCCCHHHHHHHHH-hcCCcceEEEEeeec-cccCCEEEEEeCCHHHHHHHHHHCCCcEECCEEcC
Confidence 79999999999999999999 999999999999976 89999999999999999999999999999999885
No 55
>KOG0121 consensus Nuclear cap-binding protein complex, subunit CBP20 (RRM superfamily) [RNA processing and modification]
Probab=99.63 E-value=9.8e-16 Score=110.54 Aligned_cols=81 Identities=22% Similarity=0.375 Sum_probs=76.9
Q ss_pred CCceEEEcCCCcCCCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHHhCCceecceeeEEccC
Q 021958 172 PDHTIFVGDLAADVTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEMNGVFCSTRPMRIGPA 251 (305)
Q Consensus 172 ~~~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l~~~~~~g~~l~v~~~ 251 (305)
.+++|||+||.+.++||+|.++|+ ++|.|..|.+-.|+.+...-|||||+|.+.++|+.|++-++|..++.+.|+|+|.
T Consensus 35 ~S~tvyVgNlSfyttEEqiyELFs-~cG~irriiMGLdr~kktpCGFCFVeyy~~~dA~~AlryisgtrLddr~ir~D~D 113 (153)
T KOG0121|consen 35 KSCTVYVGNLSFYTTEEQIYELFS-KCGDIRRIIMGLDRFKKTPCGFCFVEYYSRDDAEDALRYISGTRLDDRPIRIDWD 113 (153)
T ss_pred hcceEEEeeeeeeecHHHHHHHHH-hccchheeEeccccCCcCccceEEEEEecchhHHHHHHHhccCcccccceeeecc
Confidence 368999999999999999999999 9999999999999888899999999999999999999999999999999999995
Q ss_pred CC
Q 021958 252 TN 253 (305)
Q Consensus 252 ~~ 253 (305)
-.
T Consensus 114 ~G 115 (153)
T KOG0121|consen 114 AG 115 (153)
T ss_pred cc
Confidence 44
No 56
>KOG0122 consensus Translation initiation factor 3, subunit g (eIF-3g) [Translation, ribosomal structure and biogenesis]
Probab=99.63 E-value=1.6e-15 Score=121.49 Aligned_cols=85 Identities=27% Similarity=0.448 Sum_probs=79.7
Q ss_pred CCCCCceEEEcCCCcccCHHHHhHhhhhcCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCCCCCccee
Q 021958 77 QPGEIRTLWIGDLQYWMDETYLNTCFAHTGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMPNGEQNFR 156 (305)
Q Consensus 77 ~~~~~~~l~v~nLp~~~t~~~l~~~f~~~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~~~~i~ 156 (305)
..++.++|-|.||+.++++++|+++|..||.|..+.+.+|+.||.++|||||.|.+.++|.+||+.|||.-+ ....++
T Consensus 185 ~R~D~~tvRvtNLsed~~E~dL~eLf~~fg~i~rvylardK~TG~~kGFAFVtF~sRddA~rAI~~LnG~gy--d~LILr 262 (270)
T KOG0122|consen 185 ERDDEATVRVTNLSEDMREDDLEELFRPFGPITRVYLARDKETGLSKGFAFVTFESRDDAARAIADLNGYGY--DNLILR 262 (270)
T ss_pred cCCccceeEEecCccccChhHHHHHhhccCccceeEEEEccccCcccceEEEEEecHHHHHHHHHHccCccc--ceEEEE
Confidence 456788999999999999999999999999999999999999999999999999999999999999999998 568899
Q ss_pred eeccCCC
Q 021958 157 LNWASFG 163 (305)
Q Consensus 157 v~~~~~~ 163 (305)
+.|++..
T Consensus 263 vEwskP~ 269 (270)
T KOG0122|consen 263 VEWSKPS 269 (270)
T ss_pred EEecCCC
Confidence 9999753
No 57
>KOG1190 consensus Polypyrimidine tract-binding protein [RNA processing and modification]
Probab=99.62 E-value=2e-14 Score=123.00 Aligned_cols=166 Identities=16% Similarity=0.329 Sum_probs=136.7
Q ss_pred CCCceEEEcCCCcccCHHHHhHhhhhcCCcEEEEEEeeCCCCCccc-eEEEEEcCHHHHHHHHHHhCCCCCCCCCcceee
Q 021958 79 GEIRTLWIGDLQYWMDETYLNTCFAHTGEVVAVKVIRNKQTGQIEG-YGFIEFISRAGAERVLQTFNGTPMPNGEQNFRL 157 (305)
Q Consensus 79 ~~~~~l~v~nLp~~~t~~~l~~~f~~~G~v~~~~~~~~~~~~~~~g-~afV~f~~~~~a~~a~~~l~g~~~~~~~~~i~v 157 (305)
..-..++|+|+-..++-|-|..+|++||.|..|.-... ..| .|+|+|.+.+.|..|...|+|+.|..+...+++
T Consensus 148 n~vLr~iie~m~ypVslDVLHqvFS~fG~VlKIiTF~K-----nn~FQALvQy~d~~sAq~AK~aLdGqnIyngcCtLrI 222 (492)
T KOG1190|consen 148 NPVLRTIIENMFYPVSLDVLHQVFSKFGFVLKIITFTK-----NNGFQALVQYTDAVSAQAAKLALDGQNIYNGCCTLRI 222 (492)
T ss_pred ceeEEEEeccceeeeEHHHHHHHHhhcceeEEEEEEec-----ccchhhhhhccchhhHHHHHHhccCCcccCceeEEEe
Confidence 34456888999999999999999999999887754432 123 489999999999999999999999888888998
Q ss_pred eccCCCC--------------------C-------------------------------------CCCCCC--CCceEEE
Q 021958 158 NWASFGA--------------------G-------------------------------------EKRDDT--PDHTIFV 178 (305)
Q Consensus 158 ~~~~~~~--------------------~-------------------------------------~~~~~~--~~~~l~v 178 (305)
++++... . ...... .+..|.|
T Consensus 223 d~Sklt~LnvKynndkSRDyTnp~LP~gd~~p~l~~~~~aa~~~~~~~~g~p~aip~~~~~a~~a~~~~~~~~~n~vllv 302 (492)
T KOG1190|consen 223 DFSKLTDLNVKYNNDKSRDYTNPDLPVGDGQPSLDQLMAAAFGSVPAVHGAPLAIPSGAAGANAADGKIESPSANVVLLV 302 (492)
T ss_pred ehhhcccceeeccccccccccCCCCCCCccccccchhhhccccccccccCCcccCCccchhhcccccccccCCCceEEEE
Confidence 8876430 0 000001 1477888
Q ss_pred cCCCcC-CCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHHhCCceecceeeEEccCCCCC
Q 021958 179 GDLAAD-VTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEMNGVFCSTRPMRIGPATNKK 255 (305)
Q Consensus 179 ~nlp~~-~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l~~~~~~g~~l~v~~~~~~~ 255 (305)
.||..+ +|.+.|..+|. .||.|.+|+|+.++ +.-|+|+|.+...|.-|++.|+|..+.|++|+|.+++-..
T Consensus 303 snln~~~VT~d~LftlFg-vYGdVqRVkil~nk-----kd~ALIQmsd~~qAqLA~~hL~g~~l~gk~lrvt~SKH~~ 374 (492)
T KOG1190|consen 303 SNLNEEAVTPDVLFTLFG-VYGDVQRVKILYNK-----KDNALIQMSDGQQAQLAMEHLEGHKLYGKKLRVTLSKHTN 374 (492)
T ss_pred ecCchhccchhHHHHHHh-hhcceEEEEeeecC-----CcceeeeecchhHHHHHHHHhhcceecCceEEEeeccCcc
Confidence 888765 89999999999 99999999999886 3569999999999999999999999999999999988643
No 58
>KOG0121 consensus Nuclear cap-binding protein complex, subunit CBP20 (RRM superfamily) [RNA processing and modification]
Probab=99.62 E-value=1.1e-15 Score=110.29 Aligned_cols=82 Identities=22% Similarity=0.505 Sum_probs=76.4
Q ss_pred CCCCceEEEcCCCcccCHHHHhHhhhhcCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCCCCCcceee
Q 021958 78 PGEIRTLWIGDLQYWMDETYLNTCFAHTGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMPNGEQNFRL 157 (305)
Q Consensus 78 ~~~~~~l~v~nLp~~~t~~~l~~~f~~~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~~~~i~v 157 (305)
-..+++||||||...++|++|+++|+++|.|..|.+-.|+.+....|||||+|.+.++|..|++.++|..+ ..+.|++
T Consensus 33 ~r~S~tvyVgNlSfyttEEqiyELFs~cG~irriiMGLdr~kktpCGFCFVeyy~~~dA~~AlryisgtrL--ddr~ir~ 110 (153)
T KOG0121|consen 33 LRKSCTVYVGNLSFYTTEEQIYELFSKCGDIRRIIMGLDRFKKTPCGFCFVEYYSRDDAEDALRYISGTRL--DDRPIRI 110 (153)
T ss_pred HhhcceEEEeeeeeeecHHHHHHHHHhccchheeEeccccCCcCccceEEEEEecchhHHHHHHHhccCcc--cccceee
Confidence 35679999999999999999999999999999999999999999999999999999999999999999999 5688888
Q ss_pred eccC
Q 021958 158 NWAS 161 (305)
Q Consensus 158 ~~~~ 161 (305)
+|..
T Consensus 111 D~D~ 114 (153)
T KOG0121|consen 111 DWDA 114 (153)
T ss_pred eccc
Confidence 8764
No 59
>KOG4212 consensus RNA-binding protein hnRNP-M [RNA processing and modification]
Probab=99.62 E-value=1.9e-14 Score=124.04 Aligned_cols=74 Identities=28% Similarity=0.359 Sum_probs=68.1
Q ss_pred CCceEEEcCCCcCCCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHHhCCceecceeeEEccC
Q 021958 172 PDHTIFVGDLAADVTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEMNGVFCSTRPMRIGPA 251 (305)
Q Consensus 172 ~~~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l~~~~~~g~~l~v~~~ 251 (305)
..++|+|+|||+++|++.|++-|. .||.|.++.|+ ++|+++| .|.|.+.++|++|+..|+|..++||.|+|.|.
T Consensus 535 Ka~qIiirNlP~dfTWqmlrDKfr-e~G~v~yadim---e~GkskG--VVrF~s~edAEra~a~Mngs~l~Gr~I~V~y~ 608 (608)
T KOG4212|consen 535 KACQIIIRNLPFDFTWQMLRDKFR-EIGHVLYADIM---ENGKSKG--VVRFFSPEDAERACALMNGSRLDGRNIKVTYF 608 (608)
T ss_pred cccEEEEecCCccccHHHHHHHHH-hccceehhhhh---ccCCccc--eEEecCHHHHHHHHHHhccCcccCceeeeeeC
Confidence 467899999999999999999999 99999999994 3577776 99999999999999999999999999999873
No 60
>PF00076 RRM_1: RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain); InterPro: IPR000504 Many eukaryotic proteins containing one or more copies of a putative RNA-binding domain of about 90 amino acids are known to bind single-stranded RNAs [, , ]. The largest group of single strand RNA-binding proteins is the eukaryotic RNA recognition motif (RRM) family that contains an eight amino acid RNP-1 consensus sequence [, ]. RRM proteins have a variety of RNA binding preferences and functions, and include heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing (SR, U2AF, Sxl), protein components of small nuclear ribonucleoproteins (U1 and U2 snRNPs), and proteins that regulate RNA stability and translation (PABP, La, Hu) [, , ]. The RRM in heterodimeric splicing factor U2 snRNP auxiliary factor (U2AF) appears to have two RRM-like domains with specialised features for protein recognition []. The motif also appears in a few single stranded DNA binding proteins. The typical RRM consists of four anti-parallel beta-strands and two alpha-helices arranged in a beta-alpha-beta-beta-alpha-beta fold with side chains that stack with RNA bases. Specificity of RNA binding is determined by multiple contacts with surrounding amino acids. A third helix is present during RNA binding in some cases []. The RRM is reviewed in a number of publications [, , ].; GO: 0003676 nucleic acid binding; PDB: 2RNE_A 2DGO_A 2DO4_A 1YTY_B 2VOO_B 2VOP_A 2VON_B 1ZH5_B 2VOD_A 1S79_A ....
Probab=99.61 E-value=3e-15 Score=101.33 Aligned_cols=70 Identities=34% Similarity=0.669 Sum_probs=64.7
Q ss_pred EEEcCCCcccCHHHHhHhhhhcCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCCCCCccee
Q 021958 84 LWIGDLQYWMDETYLNTCFAHTGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMPNGEQNFR 156 (305)
Q Consensus 84 l~v~nLp~~~t~~~l~~~f~~~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~~~~i~ 156 (305)
|||+|||.++|+++|+++|+.||.+..+.+..+ .++..++||||+|.+.++|++|++.++|..+ .++.|+
T Consensus 1 l~v~nlp~~~t~~~l~~~f~~~g~i~~~~~~~~-~~~~~~~~a~V~F~~~~~a~~a~~~l~g~~~--~~~~ir 70 (70)
T PF00076_consen 1 LYVGNLPPDVTEEELRDFFSQFGKIESIKVMRN-SSGKSKGYAFVEFESEEDAEKALEELNGKKI--NGRKIR 70 (70)
T ss_dssp EEEESETTTSSHHHHHHHHHTTSTEEEEEEEEE-TTSSEEEEEEEEESSHHHHHHHHHHHTTEEE--TTEEEE
T ss_pred cEEcCCCCcCCHHHHHHHHHHhhhccccccccc-ccccccceEEEEEcCHHHHHHHHHHcCCCEE--CccCcC
Confidence 799999999999999999999999999999998 5789999999999999999999999999998 455543
No 61
>KOG0107 consensus Alternative splicing factor SRp20/9G8 (RRM superfamily) [RNA processing and modification]
Probab=99.60 E-value=2.3e-15 Score=114.68 Aligned_cols=79 Identities=22% Similarity=0.412 Sum_probs=74.0
Q ss_pred CCceEEEcCCCcCCCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHHhCCceecceeeEEccC
Q 021958 172 PDHTIFVGDLAADVTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEMNGVFCSTRPMRIGPA 251 (305)
Q Consensus 172 ~~~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l~~~~~~g~~l~v~~~ 251 (305)
..++|||+||+..+++.||..+|. .||.|.+|.|..++ .|||||+|++..+|+.|+..|+|..|+|..|+|+.+
T Consensus 9 ~~~kVYVGnL~~~a~k~eLE~~F~-~yG~lrsvWvArnP-----PGfAFVEFed~RDA~DAvr~LDG~~~cG~r~rVE~S 82 (195)
T KOG0107|consen 9 GNTKVYVGNLGSRATKRELERAFS-KYGPLRSVWVARNP-----PGFAFVEFEDPRDAEDAVRYLDGKDICGSRIRVELS 82 (195)
T ss_pred CCceEEeccCCCCcchHHHHHHHH-hcCcceeEEEeecC-----CCceEEeccCcccHHHHHhhcCCccccCceEEEEee
Confidence 468999999999999999999999 99999999998776 899999999999999999999999999999999998
Q ss_pred CCCCC
Q 021958 252 TNKKT 256 (305)
Q Consensus 252 ~~~~~ 256 (305)
+....
T Consensus 83 ~G~~r 87 (195)
T KOG0107|consen 83 TGRPR 87 (195)
T ss_pred cCCcc
Confidence 77655
No 62
>KOG0131 consensus Splicing factor 3b, subunit 4 [RNA processing and modification]
Probab=99.60 E-value=2.6e-15 Score=115.07 Aligned_cols=118 Identities=26% Similarity=0.330 Sum_probs=97.3
Q ss_pred CCCceEEEcCCCcCCCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHHhCCceecceeeEEcc
Q 021958 171 TPDHTIFVGDLAADVTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEMNGVFCSTRPMRIGP 250 (305)
Q Consensus 171 ~~~~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l~~~~~~g~~l~v~~ 250 (305)
....+|||+||+..++++.|.++|- +.|.|.++++.+|..++..+|||||+|.+.|+|+-|++.||...+.||.|+|..
T Consensus 7 nqd~tiyvgnld~kvs~~~l~EL~i-qagpVv~i~iPkDrv~~~~qGygF~Ef~~eedadYAikiln~VkLYgrpIrv~k 85 (203)
T KOG0131|consen 7 NQDATLYVGNLDEKVSEELLYELFI-QAGPVVNLHIPKDRVTQKHQGYGFAEFRTEEDADYAIKILNMVKLYGRPIRVNK 85 (203)
T ss_pred CCCceEEEecCCHHHHHHHHHHHHH-hcCceeeeecchhhhcccccceeEEEEechhhhHHHHHHHHHHHhcCceeEEEe
Confidence 4468999999999999999999999 999999999999999999999999999999999999999999999999999998
Q ss_pred CCCCCCCCCCCCCCCcceEEeCCcchhhhHHHHhhccceeeEEE
Q 021958 251 ATNKKTVSGQQQYPKGTFLVVGTGFCLQIKFKFKYFIGWLIFII 294 (305)
Q Consensus 251 ~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~g~~~~~~ 294 (305)
+... +.....+..+++|+....-+...|-..|..++.++
T Consensus 86 as~~-----~~nl~vganlfvgNLd~~vDe~~L~dtFsafG~l~ 124 (203)
T KOG0131|consen 86 ASAH-----QKNLDVGANLFVGNLDPEVDEKLLYDTFSAFGVLI 124 (203)
T ss_pred cccc-----cccccccccccccccCcchhHHHHHHHHHhccccc
Confidence 8722 22345668888988665444444444444444433
No 63
>COG0724 RNA-binding proteins (RRM domain) [General function prediction only]
Probab=99.60 E-value=1.7e-14 Score=124.29 Aligned_cols=151 Identities=23% Similarity=0.368 Sum_probs=119.1
Q ss_pred CceEEEcCCCcccCHHHHhHhhhhcCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCCCCCcceeeecc
Q 021958 81 IRTLWIGDLQYWMDETYLNTCFAHTGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMPNGEQNFRLNWA 160 (305)
Q Consensus 81 ~~~l~v~nLp~~~t~~~l~~~f~~~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~~~~i~v~~~ 160 (305)
.++|||+||+.++|+++|+++|..||.+..+.+..++.+++++|||||+|.+.++|..|+..++|..+ .++.+.+.+.
T Consensus 115 ~~~l~v~nL~~~~~~~~l~~~F~~~g~~~~~~~~~d~~~~~~~g~~~v~f~~~~~~~~a~~~~~~~~~--~~~~~~v~~~ 192 (306)
T COG0724 115 NNTLFVGNLPYDVTEEDLRELFKKFGPVKRVRLVRDRETGKSRGFAFVEFESEESAEKAIEELNGKEL--EGRPLRVQKA 192 (306)
T ss_pred CceEEEeCCCCCCCHHHHHHHHHhcCceeEEEeeeccccCccCceEEEEecCHHHHHHHHHHcCCCeE--CCceeEeecc
Confidence 69999999999999999999999999999999999988999999999999999999999999999999 6788888874
Q ss_pred C----CCCCC----------------CCCCCCCceEEEcCCCcCCCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEE
Q 021958 161 S----FGAGE----------------KRDDTPDHTIFVGDLAADVTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGF 220 (305)
Q Consensus 161 ~----~~~~~----------------~~~~~~~~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~af 220 (305)
. ..... .........+++.+++..++..++...|. .+|.+....+.............+
T Consensus 193 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~ 271 (306)
T COG0724 193 QPASQPRSELSNNLDASFAKKLSRGKALLLEKSDNLYVGNLPLKTAEEELADLFK-SRGDIVRASLPPSKDGKIPKSRSF 271 (306)
T ss_pred ccccccccccccccchhhhccccccccccccccceeeccccccccchhHHHHhcc-ccccceeeeccCCCCCcccccccc
Confidence 3 11111 12224567899999999999999999999 999997777665543333333444
Q ss_pred EEeCCHHHHHHHHH
Q 021958 221 VRFGDESEQLRAMT 234 (305)
Q Consensus 221 V~f~~~~~A~~a~~ 234 (305)
+.+.....+..+..
T Consensus 272 ~~~~~~~~~~~~~~ 285 (306)
T COG0724 272 VGNEASKDALESNS 285 (306)
T ss_pred cchhHHHhhhhhhc
Confidence 44444444444433
No 64
>KOG0113 consensus U1 small nuclear ribonucleoprotein (RRM superfamily) [RNA processing and modification]
Probab=99.60 E-value=6.1e-15 Score=121.36 Aligned_cols=89 Identities=25% Similarity=0.484 Sum_probs=82.8
Q ss_pred CCCCCCceEEEcCCCcCCCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHHhCCceecceeeE
Q 021958 168 RDDTPDHTIFVGDLAADVTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEMNGVFCSTRPMR 247 (305)
Q Consensus 168 ~~~~~~~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l~~~~~~g~~l~ 247 (305)
....+-+||||+-|+++++|.+|+..|+ .||.|+.|+|+.|..||+++|||||+|++..+...|.+..+|..|+|+.|.
T Consensus 96 a~gDPy~TLFv~RLnydT~EskLrreF~-~YG~IkrirlV~d~vTgkskGYAFIeye~erdm~~AYK~adG~~Idgrri~ 174 (335)
T KOG0113|consen 96 AIGDPYKTLFVARLNYDTSESKLRREFE-KYGPIKRIRLVRDKVTGKSKGYAFIEYEHERDMKAAYKDADGIKIDGRRIL 174 (335)
T ss_pred ccCCccceeeeeeccccccHHHHHHHHH-hcCcceeEEEeeecccCCccceEEEEeccHHHHHHHHHhccCceecCcEEE
Confidence 4457789999999999999999999999 999999999999999999999999999999999999999999999999999
Q ss_pred EccCCCCCCC
Q 021958 248 IGPATNKKTV 257 (305)
Q Consensus 248 v~~~~~~~~~ 257 (305)
|++-..+...
T Consensus 175 VDvERgRTvk 184 (335)
T KOG0113|consen 175 VDVERGRTVK 184 (335)
T ss_pred EEeccccccc
Confidence 9987665444
No 65
>KOG0120 consensus Splicing factor U2AF, large subunit (RRM superfamily) [RNA processing and modification]
Probab=99.60 E-value=1.2e-14 Score=130.29 Aligned_cols=174 Identities=17% Similarity=0.372 Sum_probs=136.1
Q ss_pred CCCCCceEEEcCCCcccCHHHHhHhhhhcCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCCCCCccee
Q 021958 77 QPGEIRTLWIGDLQYWMDETYLNTCFAHTGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMPNGEQNFR 156 (305)
Q Consensus 77 ~~~~~~~l~v~nLp~~~t~~~l~~~f~~~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~~~~i~ 156 (305)
......++||++|+...+++++.+++..||.+....++.+..+|.++||||.+|.+......|+..|||+.+ ++..+.
T Consensus 285 ~~~~~~ki~v~~lp~~l~~~q~~Ell~~fg~lk~f~lv~d~~~g~skg~af~ey~dpsvtd~A~agLnGm~l--gd~~lv 362 (500)
T KOG0120|consen 285 VPDSPNKIFVGGLPLYLTEDQVKELLDSFGPLKAFRLVKDSATGNSKGFAFCEYCDPSVTDQAIAGLNGMQL--GDKKLV 362 (500)
T ss_pred cccccchhhhccCcCccCHHHHHHHHHhcccchhheeecccccccccceeeeeeeCCcchhhhhcccchhhh--cCceeE
Confidence 345668899999999999999999999999999999999998899999999999999999999999999998 556666
Q ss_pred eeccCCCCCCC--------------------CCCCCCceEEEcCC--CcCC-CH-------HHHHHHHhhcCCCeeEEEE
Q 021958 157 LNWASFGAGEK--------------------RDDTPDHTIFVGDL--AADV-TD-------YMLQETFRARYPSTKGAKV 206 (305)
Q Consensus 157 v~~~~~~~~~~--------------------~~~~~~~~l~v~nl--p~~~-~~-------~~l~~~f~~~~G~i~~i~i 206 (305)
+..+......- ....++..|.+.|+ +.++ ++ |+++..+. +||.|..|.+
T Consensus 363 vq~A~~g~~~~~~~~~~~~~~~~~i~~~~~q~~g~~t~Vl~L~n~Vt~deLkdd~EyeeIlEdvr~ec~-k~g~v~~v~i 441 (500)
T KOG0120|consen 363 VQRAIVGASNANVNFNISQSQVPGIPLLMTQMAGIPTEVLCLTNVVTPDELKDDEEYEEILEDVRTECA-KFGAVRSVEI 441 (500)
T ss_pred eehhhccchhccccCCccccccccchhhhcccCCCcchhhhhhhcCCHHHhcchHHHHHHHHHHHHHhc-ccCceeEEec
Confidence 66554321110 11123333444443 1111 11 56777787 9999999999
Q ss_pred EecC---CCCCcccEEEEEeCCHHHHHHHHHHhCCceecceeeEEccCCC
Q 021958 207 VIDR---LTGRTKGYGFVRFGDESEQLRAMTEMNGVFCSTRPMRIGPATN 253 (305)
Q Consensus 207 ~~~~---~~~~~~g~afV~f~~~~~A~~a~~~l~~~~~~g~~l~v~~~~~ 253 (305)
.++. ...-..|--||+|.+.+++++|..+|+|..|.||.|...|...
T Consensus 442 pr~~~~~~~~~G~GkVFVefas~ed~qrA~~~L~GrKF~nRtVvtsYyde 491 (500)
T KOG0120|consen 442 PRPYPDENPVPGTGKVFVEFADTEDSQRAMEELTGRKFANRTVVASYYDE 491 (500)
T ss_pred CCCCCCCCcCCCcccEEEEecChHHHHHHHHHccCceeCCcEEEEEecCH
Confidence 8772 1233467789999999999999999999999999999998654
No 66
>KOG0125 consensus Ataxin 2-binding protein (RRM superfamily) [General function prediction only]
Probab=99.59 E-value=1.5e-14 Score=120.56 Aligned_cols=86 Identities=21% Similarity=0.398 Sum_probs=77.4
Q ss_pred CCCCCceEEEcCCCcccCHHHHhHhhhhcCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCCCCCccee
Q 021958 77 QPGEIRTLWIGDLQYWMDETYLNTCFAHTGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMPNGEQNFR 156 (305)
Q Consensus 77 ~~~~~~~l~v~nLp~~~t~~~l~~~f~~~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~~~~i~ 156 (305)
.....++|+|+|||+...|.||+.+|++||.|.+|.|+.+. .-+|||+||+|++.++|++|.++|+|..+ .||.|+
T Consensus 92 s~~~pkRLhVSNIPFrFRdpDL~aMF~kfG~VldVEIIfNE--RGSKGFGFVTmen~~dadRARa~LHgt~V--EGRkIE 167 (376)
T KOG0125|consen 92 SKDTPKRLHVSNIPFRFRDPDLRAMFEKFGKVLDVEIIFNE--RGSKGFGFVTMENPADADRARAELHGTVV--EGRKIE 167 (376)
T ss_pred CCCCCceeEeecCCccccCccHHHHHHhhCceeeEEEEecc--CCCCccceEEecChhhHHHHHHHhhccee--eceEEE
Confidence 44556889999999999999999999999999999999873 57899999999999999999999999999 679999
Q ss_pred eeccCCCCCC
Q 021958 157 LNWASFGAGE 166 (305)
Q Consensus 157 v~~~~~~~~~ 166 (305)
|+.+..+-..
T Consensus 168 Vn~ATarV~n 177 (376)
T KOG0125|consen 168 VNNATARVHN 177 (376)
T ss_pred Eeccchhhcc
Confidence 9999876433
No 67
>KOG0149 consensus Predicted RNA-binding protein SEB4 (RRM superfamily) [General function prediction only]
Probab=99.58 E-value=3.7e-15 Score=118.97 Aligned_cols=82 Identities=27% Similarity=0.473 Sum_probs=70.9
Q ss_pred CCCCceEEEcCCCcccCHHHHhHhhhhcCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCCCCCcceee
Q 021958 78 PGEIRTLWIGDLQYWMDETYLNTCFAHTGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMPNGEQNFRL 157 (305)
Q Consensus 78 ~~~~~~l~v~nLp~~~t~~~l~~~f~~~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~~~~i~v 157 (305)
+...++||||+|++.+..++|+++|++||+|++..++.|+.+|++|||+||+|++.+.|.+|++. -.-.| .||...+
T Consensus 9 DT~~TKifVggL~w~T~~~~l~~yFeqfGeI~eavvitd~~t~rskGyGfVTf~d~~aa~rAc~d-p~piI--dGR~aNc 85 (247)
T KOG0149|consen 9 DTTFTKIFVGGLAWETHKETLRRYFEQFGEIVEAVVITDKNTGRSKGYGFVTFRDAEAATRACKD-PNPII--DGRKANC 85 (247)
T ss_pred CceEEEEEEcCcccccchHHHHHHHHHhCceEEEEEEeccCCccccceeeEEeecHHHHHHHhcC-CCCcc--ccccccc
Confidence 44567899999999999999999999999999999999999999999999999999999999975 33345 4566666
Q ss_pred eccCC
Q 021958 158 NWASF 162 (305)
Q Consensus 158 ~~~~~ 162 (305)
+.+..
T Consensus 86 nlA~l 90 (247)
T KOG0149|consen 86 NLASL 90 (247)
T ss_pred chhhh
Confidence 66654
No 68
>PLN03120 nucleic acid binding protein; Provisional
Probab=99.58 E-value=1.2e-14 Score=119.91 Aligned_cols=78 Identities=17% Similarity=0.226 Sum_probs=71.8
Q ss_pred CceEEEcCCCcCCCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHHhCCceecceeeEEccCC
Q 021958 173 DHTIFVGDLAADVTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEMNGVFCSTRPMRIGPAT 252 (305)
Q Consensus 173 ~~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l~~~~~~g~~l~v~~~~ 252 (305)
.++|||+||++.+++++|+++|+ .||.|++|+|..|.. .+|||||+|++.++|..|+. |+|..|+|+.|+|.++.
T Consensus 4 ~rtVfVgNLs~~tTE~dLrefFS-~~G~I~~V~I~~d~~---~~GfAFVtF~d~eaAe~All-LnG~~l~gr~V~Vt~a~ 78 (260)
T PLN03120 4 VRTVKVSNVSLKATERDIKEFFS-FSGDIEYVEMQSENE---RSQIAYVTFKDPQGAETALL-LSGATIVDQSVTITPAE 78 (260)
T ss_pred CCEEEEeCCCCCCCHHHHHHHHH-hcCCeEEEEEeecCC---CCCEEEEEeCcHHHHHHHHH-hcCCeeCCceEEEEecc
Confidence 57999999999999999999999 999999999998853 47899999999999999995 99999999999999987
Q ss_pred CCC
Q 021958 253 NKK 255 (305)
Q Consensus 253 ~~~ 255 (305)
...
T Consensus 79 ~~~ 81 (260)
T PLN03120 79 DYQ 81 (260)
T ss_pred CCC
Confidence 544
No 69
>KOG0114 consensus Predicted RNA-binding protein (RRM superfamily) [General function prediction only]
Probab=99.55 E-value=3.4e-14 Score=98.77 Aligned_cols=85 Identities=14% Similarity=0.250 Sum_probs=75.9
Q ss_pred CCCCCCceEEEcCCCcCCCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHHhCCceecceeeE
Q 021958 168 RDDTPDHTIFVGDLAADVTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEMNGVFCSTRPMR 247 (305)
Q Consensus 168 ~~~~~~~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l~~~~~~g~~l~ 247 (305)
.....++.|||+|||+++|.|+..++|. +||.|..|+|-..++ .+|-|||.|++..+|.+|++.|+|..++++.+.
T Consensus 13 lppevnriLyirNLp~~ITseemydlFG-kyg~IrQIRiG~~k~---TrGTAFVVYedi~dAk~A~dhlsg~n~~~ryl~ 88 (124)
T KOG0114|consen 13 LPPEVNRILYIRNLPFKITSEEMYDLFG-KYGTIRQIRIGNTKE---TRGTAFVVYEDIFDAKKACDHLSGYNVDNRYLV 88 (124)
T ss_pred CChhhheeEEEecCCccccHHHHHHHhh-cccceEEEEecCccC---cCceEEEEehHhhhHHHHHHHhcccccCCceEE
Confidence 3445678999999999999999999999 999999999976543 489999999999999999999999999999999
Q ss_pred EccCCCCCC
Q 021958 248 IGPATNKKT 256 (305)
Q Consensus 248 v~~~~~~~~ 256 (305)
|-|..+-..
T Consensus 89 vlyyq~~~~ 97 (124)
T KOG0114|consen 89 VLYYQPEDA 97 (124)
T ss_pred EEecCHHHH
Confidence 999876443
No 70
>KOG0130 consensus RNA-binding protein RBM8/Tsunagi (RRM superfamily) [General function prediction only]
Probab=99.55 E-value=9e-15 Score=106.52 Aligned_cols=87 Identities=23% Similarity=0.342 Sum_probs=81.0
Q ss_pred CCCCCceEEEcCCCcCCCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHHhCCceecceeeEE
Q 021958 169 DDTPDHTIFVGDLAADVTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEMNGVFCSTRPMRI 248 (305)
Q Consensus 169 ~~~~~~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l~~~~~~g~~l~v 248 (305)
.......|||.++..+.+|++|.+.|. .||+|++|.+..|..+|..+|||+|+|++.++|.+|+..+||..+.|..|.|
T Consensus 68 rSVEGwIi~VtgvHeEatEedi~d~F~-dyGeiKNihLNLDRRtGy~KGYaLvEYet~keAq~A~~~~Ng~~ll~q~v~V 146 (170)
T KOG0130|consen 68 RSVEGWIIFVTGVHEEATEEDIHDKFA-DYGEIKNIHLNLDRRTGYVKGYALVEYETLKEAQAAIDALNGAELLGQNVSV 146 (170)
T ss_pred cceeeEEEEEeccCcchhHHHHHHHHh-hcccccceeeccccccccccceeeeehHhHHHHHHHHHhccchhhhCCceeE
Confidence 345578999999999999999999999 9999999999999999999999999999999999999999999999999999
Q ss_pred ccCCCCCC
Q 021958 249 GPATNKKT 256 (305)
Q Consensus 249 ~~~~~~~~ 256 (305)
+|+=-+..
T Consensus 147 Dw~Fv~gp 154 (170)
T KOG0130|consen 147 DWCFVKGP 154 (170)
T ss_pred EEEEecCC
Confidence 99755444
No 71
>KOG0129 consensus Predicted RNA-binding protein (RRM superfamily) [Translation, ribosomal structure and biogenesis]
Probab=99.54 E-value=2.5e-13 Score=119.98 Aligned_cols=198 Identities=19% Similarity=0.301 Sum_probs=135.1
Q ss_pred CCCCCceEEEcCCCcccCHHHHhHhhhhcCCcEEEEEEeeCC-C--CCccc---eEEEEEcCHHHHHHHHHHhCCCCCCC
Q 021958 77 QPGEIRTLWIGDLQYWMDETYLNTCFAHTGEVVAVKVIRNKQ-T--GQIEG---YGFIEFISRAGAERVLQTFNGTPMPN 150 (305)
Q Consensus 77 ~~~~~~~l~v~nLp~~~t~~~l~~~f~~~G~v~~~~~~~~~~-~--~~~~g---~afV~f~~~~~a~~a~~~l~g~~~~~ 150 (305)
...-+++||||+||++++|+.|...|..||.+ .+.+..... . --.+| |+|+.|+++..+..-+.+..- ..
T Consensus 255 ~~~~S~KVFvGGlp~dise~~i~~~F~~FGs~-~VdWP~k~~~~~~~ppkGs~~YvflvFe~E~sV~~Ll~aC~~---~~ 330 (520)
T KOG0129|consen 255 SPRYSRKVFVGGLPWDITEAQINASFGQFGSV-KVDWPGKANSRGRAPPKGSYGYVFLVFEDERSVQSLLSACSE---GE 330 (520)
T ss_pred ccccccceeecCCCccccHHHHHhhcccccce-EeecCCCccccccCCCCCcccEEEEEecchHHHHHHHHHHhh---cc
Confidence 34567899999999999999999999999985 333332111 1 23456 999999999999888765422 11
Q ss_pred CCcceeeecc-----------------CCCCCCCCCCCCCceEEEcCCCcCCCHHHHHHHHhhcCCCeeEEEEEecCCCC
Q 021958 151 GEQNFRLNWA-----------------SFGAGEKRDDTPDHTIFVGDLAADVTDYMLQETFRARYPSTKGAKVVIDRLTG 213 (305)
Q Consensus 151 ~~~~i~v~~~-----------------~~~~~~~~~~~~~~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~~~ 213 (305)
.+..+++.-. ..-......-.+.+|||||+||.-++.++|..+|++.||.|..+.|-.|++-+
T Consensus 331 ~~~yf~vss~~~k~k~VQIrPW~laDs~fv~d~sq~lDprrTVFVGgvprpl~A~eLA~imd~lyGgV~yaGIDtD~k~K 410 (520)
T KOG0129|consen 331 GNYYFKVSSPTIKDKEVQIRPWVLADSDFVLDHNQPIDPRRTVFVGGLPRPLTAEELAMIMEDLFGGVLYVGIDTDPKLK 410 (520)
T ss_pred cceEEEEecCcccccceeEEeeEeccchhhhccCcccCccceEEecCCCCcchHHHHHHHHHHhcCceEEEEeccCcccC
Confidence 1222222222 21122234457899999999999999999999999999999999999998889
Q ss_pred CcccEEEEEeCCHHHHHHHHHHh----CCceecceeeEEccCCCCCCC----CCCCCCCCcceEEeCCcchhhh
Q 021958 214 RTKGYGFVRFGDESEQLRAMTEM----NGVFCSTRPMRIGPATNKKTV----SGQQQYPKGTFLVVGTGFCLQI 279 (305)
Q Consensus 214 ~~~g~afV~f~~~~~A~~a~~~l----~~~~~~g~~l~v~~~~~~~~~----~~~~~~~~~~~~~~g~~~~~~~ 279 (305)
..+|-|-|+|.+..+..+||.+- +...| .++|.|+..--.... ....-...-+-+|+.+..|+|.
T Consensus 411 YPkGaGRVtFsnqqsYi~AIsarFvql~h~d~-~KRVEIkPYv~eDq~CdeC~g~~c~~q~aPfFC~n~~C~QY 483 (520)
T KOG0129|consen 411 YPKGAGRVTFSNQQAYIKAISARFVQLDHTDI-DKRVEIKPYVMEDQLCDECGGRRCGGQFAPFFCRNATCFQY 483 (520)
T ss_pred CCCCcceeeecccHHHHHHHhhheEEEecccc-ceeeeecceeccccchhhhcCeeccCccCCcccCCccHHhh
Confidence 99999999999999999999741 11122 235555532210000 0111112334567777777776
No 72
>KOG4207 consensus Predicted splicing factor, SR protein superfamily [RNA processing and modification]
Probab=99.54 E-value=1.2e-14 Score=113.57 Aligned_cols=84 Identities=24% Similarity=0.405 Sum_probs=78.9
Q ss_pred CCceEEEcCCCcCCCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHHhCCceecceeeEEccC
Q 021958 172 PDHTIFVGDLAADVTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEMNGVFCSTRPMRIGPA 251 (305)
Q Consensus 172 ~~~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l~~~~~~g~~l~v~~~ 251 (305)
.-..|.|.||-+.++-++|+.+|+ +||.|-+|.|..|..|+.++|||||.|.+..+|+.|+++|+|..++|+.|+|.++
T Consensus 12 gm~SLkVdNLTyRTspd~LrrvFe-kYG~vgDVyIPrdr~Tr~sRgFaFVrf~~k~daedA~damDG~~ldgRelrVq~a 90 (256)
T KOG4207|consen 12 GMTSLKVDNLTYRTSPDDLRRVFE-KYGRVGDVYIPRDRYTRQSRGFAFVRFHDKRDAEDALDAMDGAVLDGRELRVQMA 90 (256)
T ss_pred cceeEEecceeccCCHHHHHHHHH-HhCcccceecccccccccccceeEEEeeecchHHHHHHhhcceeeccceeeehhh
Confidence 356799999999999999999999 9999999999999999999999999999999999999999999999999999988
Q ss_pred CCCCC
Q 021958 252 TNKKT 256 (305)
Q Consensus 252 ~~~~~ 256 (305)
+-...
T Consensus 91 rygr~ 95 (256)
T KOG4207|consen 91 RYGRP 95 (256)
T ss_pred hcCCC
Confidence 75544
No 73
>PF14259 RRM_6: RNA recognition motif (a.k.a. RRM, RBD, or RNP domain); PDB: 2DNN_A 1WI6_A 2EVZ_A 2ADC_A 1QM9_A 2KG1_A 2HGN_A 1U2F_A 2G4B_A 2HZC_A ....
Probab=99.53 E-value=3.7e-14 Score=96.08 Aligned_cols=69 Identities=33% Similarity=0.594 Sum_probs=62.2
Q ss_pred EEEcCCCcccCHHHHhHhhhhcCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCCCCCcce
Q 021958 84 LWIGDLQYWMDETYLNTCFAHTGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMPNGEQNF 155 (305)
Q Consensus 84 l~v~nLp~~~t~~~l~~~f~~~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~~~~i 155 (305)
|||+|||.++++++|+++|+.+|.|..+.+..++. +..+|+|||+|.+.++|.+|++.+++..+ .|+.|
T Consensus 1 v~i~nlp~~~~~~~l~~~f~~~g~v~~v~~~~~~~-~~~~~~a~v~f~~~~~a~~al~~~~~~~~--~g~~l 69 (70)
T PF14259_consen 1 VYISNLPPSTTEEDLRNFFSRFGPVEKVRLIKNKD-GQSRGFAFVEFSSEEDAKRALELLNGKEI--DGRKL 69 (70)
T ss_dssp EEEESSTTT--HHHHHHHCTTSSBEEEEEEEESTT-SSEEEEEEEEESSHHHHHHHHHHHTTEEE--TTEEE
T ss_pred CEEeCCCCCCCHHHHHHHHHhcCCcceEEEEeeec-cccCCEEEEEeCCHHHHHHHHHHCCCcEE--CCEEc
Confidence 79999999999999999999999999999999976 89999999999999999999999988888 45554
No 74
>PLN03213 repressor of silencing 3; Provisional
Probab=99.53 E-value=3.7e-14 Score=124.07 Aligned_cols=82 Identities=16% Similarity=0.294 Sum_probs=74.4
Q ss_pred CCCCCceEEEcCCCcCCCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCH--HHHHHHHHHhCCceecceee
Q 021958 169 DDTPDHTIFVGDLAADVTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDE--SEQLRAMTEMNGVFCSTRPM 246 (305)
Q Consensus 169 ~~~~~~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~--~~A~~a~~~l~~~~~~g~~l 246 (305)
......+||||||.++++++||+..|. .||.|.+|.|++. +| ||||||+|.+. .++.+||..|||..+.||.|
T Consensus 6 s~~~gMRIYVGNLSydVTEDDLravFS-eFGsVkdVEIpRE--TG--RGFAFVEMssdddaEeeKAISaLNGAEWKGR~L 80 (759)
T PLN03213 6 SGGGGVRLHVGGLGESVGRDDLLKIFS-PMGTVDAVEFVRT--KG--RSFAYIDFSPSSTNSLTKLFSTYNGCVWKGGRL 80 (759)
T ss_pred cCCcceEEEEeCCCCCCCHHHHHHHHH-hcCCeeEEEEecc--cC--CceEEEEecCCcHHHHHHHHHHhcCCeecCcee
Confidence 334568999999999999999999999 9999999999944 66 99999999987 78999999999999999999
Q ss_pred EEccCCCCC
Q 021958 247 RIGPATNKK 255 (305)
Q Consensus 247 ~v~~~~~~~ 255 (305)
+|..+++..
T Consensus 81 KVNKAKP~Y 89 (759)
T PLN03213 81 RLEKAKEHY 89 (759)
T ss_pred EEeeccHHH
Confidence 999998843
No 75
>KOG0107 consensus Alternative splicing factor SRp20/9G8 (RRM superfamily) [RNA processing and modification]
Probab=99.53 E-value=2.8e-14 Score=108.82 Aligned_cols=78 Identities=22% Similarity=0.392 Sum_probs=68.8
Q ss_pred CCCceEEEcCCCcccCHHHHhHhhhhcCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCCCCCcceeee
Q 021958 79 GEIRTLWIGDLQYWMDETYLNTCFAHTGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMPNGEQNFRLN 158 (305)
Q Consensus 79 ~~~~~l~v~nLp~~~t~~~l~~~f~~~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~~~~i~v~ 158 (305)
...++||||||+..+++.||+..|..||.+.+++|-..+ .|||||||++..+|+.|+..|+|..| .+..|+|.
T Consensus 8 ~~~~kVYVGnL~~~a~k~eLE~~F~~yG~lrsvWvArnP-----PGfAFVEFed~RDA~DAvr~LDG~~~--cG~r~rVE 80 (195)
T KOG0107|consen 8 NGNTKVYVGNLGSRATKRELERAFSKYGPLRSVWVARNP-----PGFAFVEFEDPRDAEDAVRYLDGKDI--CGSRIRVE 80 (195)
T ss_pred CCCceEEeccCCCCcchHHHHHHHHhcCcceeEEEeecC-----CCceEEeccCcccHHHHHhhcCCccc--cCceEEEE
Confidence 346899999999999999999999999999999998764 47999999999999999999999999 56667776
Q ss_pred ccCCC
Q 021958 159 WASFG 163 (305)
Q Consensus 159 ~~~~~ 163 (305)
.+.-.
T Consensus 81 ~S~G~ 85 (195)
T KOG0107|consen 81 LSTGR 85 (195)
T ss_pred eecCC
Confidence 66543
No 76
>smart00362 RRM_2 RNA recognition motif.
Probab=99.52 E-value=1.4e-13 Score=93.14 Aligned_cols=72 Identities=36% Similarity=0.696 Sum_probs=67.4
Q ss_pred eEEEcCCCcCCCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHHhCCceecceeeEEc
Q 021958 175 TIFVGDLAADVTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEMNGVFCSTRPMRIG 249 (305)
Q Consensus 175 ~l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l~~~~~~g~~l~v~ 249 (305)
+|+|+||+.++++++|+++|. .||.|.++.+..+. +.++++|||+|.+.++|.+|++.+++..++|+.|+|+
T Consensus 1 ~v~i~~l~~~~~~~~l~~~~~-~~g~v~~~~~~~~~--~~~~~~~~v~f~~~~~a~~a~~~~~~~~~~~~~i~v~ 72 (72)
T smart00362 1 TLFVGNLPPDVTEEDLKELFS-KFGPIESVKIPKDT--GKSKGFAFVEFESEEDAEKAIEALNGTKLGGRPLRVE 72 (72)
T ss_pred CEEEcCCCCcCCHHHHHHHHH-hcCCEEEEEEecCC--CCCCceEEEEeCCHHHHHHHHHHhCCcEECCEEEeeC
Confidence 589999999999999999999 99999999998775 6788999999999999999999999999999999874
No 77
>KOG0120 consensus Splicing factor U2AF, large subunit (RRM superfamily) [RNA processing and modification]
Probab=99.51 E-value=5.5e-14 Score=126.14 Aligned_cols=171 Identities=21% Similarity=0.371 Sum_probs=136.0
Q ss_pred CCCCCceEEEcCCCcccCHHHHhHhhhhc-----------CC-cEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhC
Q 021958 77 QPGEIRTLWIGDLQYWMDETYLNTCFAHT-----------GE-VVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFN 144 (305)
Q Consensus 77 ~~~~~~~l~v~nLp~~~t~~~l~~~f~~~-----------G~-v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~ 144 (305)
.....+.+||++++..++++.+..+|..- |. +..+.+... +.|||++|.+.++|..++ .++
T Consensus 171 ~t~q~~r~~v~~~~~~~~e~~~~~f~~~~~~~~gl~~~~~g~~~~s~~~n~~------~nfa~ie~~s~~~at~~~-~~~ 243 (500)
T KOG0120|consen 171 ATRQARRLYVGNIPFTSNEESMMSFFNSRMHASGLNQAPDGPSFVSVQLNLE------KNFAFIEFRSISEATEAM-ALD 243 (500)
T ss_pred hhhhhhhhcccccCCccCcHhhhhhhhhhhhhcccccCCCCCceeeeeeccc------ccceeEEecCCCchhhhh-ccc
Confidence 34566889999999999999999998654 33 556655444 459999999999999998 557
Q ss_pred CCCCCCCCcceeeeccCCC-------------------CCCCCCCCCCceEEEcCCCcCCCHHHHHHHHhhcCCCeeEEE
Q 021958 145 GTPMPNGEQNFRLNWASFG-------------------AGEKRDDTPDHTIFVGDLAADVTDYMLQETFRARYPSTKGAK 205 (305)
Q Consensus 145 g~~~~~~~~~i~v~~~~~~-------------------~~~~~~~~~~~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~ 205 (305)
+..+ .+..+++...... ...........+++|++|+..+++.++.+++. .||.+....
T Consensus 244 ~~~f--~g~~~~~~r~~d~~~~p~~~~~~~~~~~~~~~~~~t~~~~~~~ki~v~~lp~~l~~~q~~Ell~-~fg~lk~f~ 320 (500)
T KOG0120|consen 244 GIIF--EGRPLKIRRPHDYQPVPGITLSPSQLGKVGLLPASTDVPDSPNKIFVGGLPLYLTEDQVKELLD-SFGPLKAFR 320 (500)
T ss_pred chhh--CCCCceecccccccCCccchhhhccccccCCcccccCcccccchhhhccCcCccCHHHHHHHHH-hcccchhhe
Confidence 7666 3344443222111 01112234567899999999999999999999 999999999
Q ss_pred EEecCCCCCcccEEEEEeCCHHHHHHHHHHhCCceecceeeEEccCCCCCCC
Q 021958 206 VVIDRLTGRTKGYGFVRFGDESEQLRAMTEMNGVFCSTRPMRIGPATNKKTV 257 (305)
Q Consensus 206 i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l~~~~~~g~~l~v~~~~~~~~~ 257 (305)
++.|..+|.++||||.+|.+...+..|+..|||..++++.|.|..+-.....
T Consensus 321 lv~d~~~g~skg~af~ey~dpsvtd~A~agLnGm~lgd~~lvvq~A~~g~~~ 372 (500)
T KOG0120|consen 321 LVKDSATGNSKGFAFCEYCDPSVTDQAIAGLNGMQLGDKKLVVQRAIVGASN 372 (500)
T ss_pred eecccccccccceeeeeeeCCcchhhhhcccchhhhcCceeEeehhhccchh
Confidence 9999999999999999999999999999999999999999999987765444
No 78
>KOG0108 consensus mRNA cleavage and polyadenylation factor I complex, subunit RNA15 [RNA processing and modification]
Probab=99.51 E-value=3.7e-14 Score=126.54 Aligned_cols=83 Identities=31% Similarity=0.550 Sum_probs=80.2
Q ss_pred ceEEEcCCCcCCCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHHhCCceecceeeEEccCCC
Q 021958 174 HTIFVGDLAADVTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEMNGVFCSTRPMRIGPATN 253 (305)
Q Consensus 174 ~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l~~~~~~g~~l~v~~~~~ 253 (305)
+.+||||+++++++++|..+|+ ..|.|.+++++.|.++|+.+||||++|.+.++|.+|++.|||.+++||+|+|.|+..
T Consensus 19 ~~v~vgnip~~~se~~l~~~~~-~~g~v~s~~~v~D~~tG~~~G~~f~~~~~~~~~~~a~~~lNg~~~~gr~l~v~~~~~ 97 (435)
T KOG0108|consen 19 SSVFVGNIPYEGSEEQLLSIFS-GVGPVLSFRLVYDRETGKPKGFGFCEFTDEETAERAIRNLNGAEFNGRKLRVNYASN 97 (435)
T ss_pred cceEecCCCCcccHHHHHHHHh-ccCccceeeecccccCCCcCceeeEecCchhhHHHHHHhcCCcccCCceEEeecccc
Confidence 8999999999999999999999 999999999999999999999999999999999999999999999999999999987
Q ss_pred CCCC
Q 021958 254 KKTV 257 (305)
Q Consensus 254 ~~~~ 257 (305)
....
T Consensus 98 ~~~~ 101 (435)
T KOG0108|consen 98 RKNA 101 (435)
T ss_pred cchh
Confidence 6654
No 79
>smart00360 RRM RNA recognition motif.
Probab=99.51 E-value=1.3e-13 Score=92.93 Aligned_cols=71 Identities=34% Similarity=0.653 Sum_probs=67.1
Q ss_pred EcCCCcCCCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHHhCCceecceeeEEc
Q 021958 178 VGDLAADVTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEMNGVFCSTRPMRIG 249 (305)
Q Consensus 178 v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l~~~~~~g~~l~v~ 249 (305)
|+||+.++++++|+++|+ .||.|.++.+..+..++.++|+|||+|.+.++|.+|++.+++..++|+.|+|+
T Consensus 1 i~~l~~~~~~~~l~~~f~-~~g~v~~~~i~~~~~~~~~~~~a~v~f~~~~~a~~a~~~~~~~~~~~~~~~v~ 71 (71)
T smart00360 1 VGNLPPDVTEEELRELFS-KFGKIESVRLVRDKDTGKSKGFAFVEFESEEDAEKALEALNGKELDGRPLKVK 71 (71)
T ss_pred CCCCCcccCHHHHHHHHH-hhCCEeEEEEEeCCCCCCCCceEEEEeCCHHHHHHHHHHcCCCeeCCcEEEeC
Confidence 579999999999999999 99999999999988778999999999999999999999999999999999874
No 80
>KOG0226 consensus RNA-binding proteins [General function prediction only]
Probab=99.51 E-value=3.3e-13 Score=108.88 Aligned_cols=171 Identities=28% Similarity=0.464 Sum_probs=133.9
Q ss_pred CceEEEcCCCcccCHHH-H--hHhhhhcCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCCCCCcceee
Q 021958 81 IRTLWIGDLQYWMDETY-L--NTCFAHTGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMPNGEQNFRL 157 (305)
Q Consensus 81 ~~~l~v~nLp~~~t~~~-l--~~~f~~~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~~~~i~v 157 (305)
....+++++-..+..+- | ...|+.+-.+...+++++. .+..++++|+.|...+.-.++-.+-+++.+ +...|++
T Consensus 96 vf~p~~~~~g~~v~pep~lp~~~~f~~~p~L~ktk~v~~~-p~~~~~~~~~~~k~s~a~~k~~~~~~~Kki--~~~~VR~ 172 (290)
T KOG0226|consen 96 VFRPFQSNAGATVNPEPPLPLPVVFSEYPSLVKTKLVRDR-PQPIRPEAFESFKASDALLKAETEKEKKKI--GKPPVRL 172 (290)
T ss_pred cccccccccccccCCCCCCcchhhhccchhhhhhhhhhcC-CCccCcccccCcchhhhhhhhccccccccc--cCcceee
Confidence 34566666666665554 3 5667777777777777775 477788999999988877777766666666 5555776
Q ss_pred eccCCCCCC--CCCCCCCceEEEcCCCcCCCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHH
Q 021958 158 NWASFGAGE--KRDDTPDHTIFVGDLAADVTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTE 235 (305)
Q Consensus 158 ~~~~~~~~~--~~~~~~~~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~ 235 (305)
..+..-... -.-.....+||.+.|..+++++.|...|. +|-.....++++|+.+|+++||+||.|.+..++..|++.
T Consensus 173 a~gtswedPsl~ew~~~DfRIfcgdlgNevnd~vl~raf~-Kfpsf~~akviRdkRTgKSkgygfVSf~~pad~~rAmre 251 (290)
T KOG0226|consen 173 AAGTSWEDPSLAEWDEDDFRIFCGDLGNEVNDDVLARAFK-KFPSFQKAKVIRDKRTGKSKGYGFVSFRDPADYVRAMRE 251 (290)
T ss_pred ccccccCCcccccCccccceeecccccccccHHHHHHHHH-hccchhhccccccccccccccceeeeecCHHHHHHHHHh
Confidence 554432111 12334568999999999999999999999 999999999999999999999999999999999999999
Q ss_pred hCCceecceeeEEccCCCCC
Q 021958 236 MNGVFCSTRPMRIGPATNKK 255 (305)
Q Consensus 236 l~~~~~~g~~l~v~~~~~~~ 255 (305)
|+|.+++.|.|++.-+.-+.
T Consensus 252 m~gkyVgsrpiklRkS~wke 271 (290)
T KOG0226|consen 252 MNGKYVGSRPIKLRKSEWKE 271 (290)
T ss_pred hcccccccchhHhhhhhHHh
Confidence 99999999999987655444
No 81
>KOG0111 consensus Cyclophilin-type peptidyl-prolyl cis-trans isomerase [Posttranslational modification, protein turnover, chaperones]
Probab=99.50 E-value=1.4e-14 Score=114.17 Aligned_cols=86 Identities=27% Similarity=0.481 Sum_probs=81.7
Q ss_pred CCceEEEcCCCcCCCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHHhCCceecceeeEEccC
Q 021958 172 PDHTIFVGDLAADVTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEMNGVFCSTRPMRIGPA 251 (305)
Q Consensus 172 ~~~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l~~~~~~g~~l~v~~~ 251 (305)
..++||||+|..+++|.-|...|- .||.|.+|.+..|-++++.+|||||+|...|+|..|+..||+.++.||.|+|.++
T Consensus 9 ~KrtlYVGGladeVtekvLhaAFI-PFGDI~dIqiPlDyesqkHRgFgFVefe~aEDAaaAiDNMnesEL~GrtirVN~A 87 (298)
T KOG0111|consen 9 QKRTLYVGGLADEVTEKVLHAAFI-PFGDIKDIQIPLDYESQKHRGFGFVEFEEAEDAAAAIDNMNESELFGRTIRVNLA 87 (298)
T ss_pred cceeEEeccchHHHHHHHHHhccc-cccchhhcccccchhcccccceeEEEeeccchhHHHhhcCchhhhcceeEEEeec
Confidence 458999999999999999999999 9999999999999999999999999999999999999999999999999999999
Q ss_pred CCCCCCC
Q 021958 252 TNKKTVS 258 (305)
Q Consensus 252 ~~~~~~~ 258 (305)
++-+...
T Consensus 88 kP~kike 94 (298)
T KOG0111|consen 88 KPEKIKE 94 (298)
T ss_pred CCccccC
Confidence 9866554
No 82
>PLN03120 nucleic acid binding protein; Provisional
Probab=99.50 E-value=1.2e-13 Score=114.11 Aligned_cols=76 Identities=18% Similarity=0.339 Sum_probs=69.6
Q ss_pred CceEEEcCCCcccCHHHHhHhhhhcCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCCCCCcceeeecc
Q 021958 81 IRTLWIGDLQYWMDETYLNTCFAHTGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMPNGEQNFRLNWA 160 (305)
Q Consensus 81 ~~~l~v~nLp~~~t~~~l~~~f~~~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~~~~i~v~~~ 160 (305)
.++|||+||++.+|+++|+++|+.||.|.++.|..++. .+|||||+|.+.++|+.|+ .|+|..| .++.|+|.++
T Consensus 4 ~rtVfVgNLs~~tTE~dLrefFS~~G~I~~V~I~~d~~---~~GfAFVtF~d~eaAe~Al-lLnG~~l--~gr~V~Vt~a 77 (260)
T PLN03120 4 VRTVKVSNVSLKATERDIKEFFSFSGDIEYVEMQSENE---RSQIAYVTFKDPQGAETAL-LLSGATI--VDQSVTITPA 77 (260)
T ss_pred CCEEEEeCCCCCCCHHHHHHHHHhcCCeEEEEEeecCC---CCCEEEEEeCcHHHHHHHH-HhcCCee--CCceEEEEec
Confidence 57999999999999999999999999999999998853 5689999999999999999 5999999 6799999888
Q ss_pred CC
Q 021958 161 SF 162 (305)
Q Consensus 161 ~~ 162 (305)
..
T Consensus 78 ~~ 79 (260)
T PLN03120 78 ED 79 (260)
T ss_pred cC
Confidence 63
No 83
>KOG0108 consensus mRNA cleavage and polyadenylation factor I complex, subunit RNA15 [RNA processing and modification]
Probab=99.49 E-value=3.5e-14 Score=126.73 Aligned_cols=80 Identities=29% Similarity=0.618 Sum_probs=77.6
Q ss_pred ceEEEcCCCcccCHHHHhHhhhhcCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCCCCCcceeeeccC
Q 021958 82 RTLWIGDLQYWMDETYLNTCFAHTGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMPNGEQNFRLNWAS 161 (305)
Q Consensus 82 ~~l~v~nLp~~~t~~~l~~~f~~~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~~~~i~v~~~~ 161 (305)
+.|||||+|+++++++|.++|+..|.|.+++++.|+.+|+.+||||++|.+.++|..|++.|+|.++ .++.++++|+.
T Consensus 19 ~~v~vgnip~~~se~~l~~~~~~~g~v~s~~~v~D~~tG~~~G~~f~~~~~~~~~~~a~~~lNg~~~--~gr~l~v~~~~ 96 (435)
T KOG0108|consen 19 SSVFVGNIPYEGSEEQLLSIFSGVGPVLSFRLVYDRETGKPKGFGFCEFTDEETAERAIRNLNGAEF--NGRKLRVNYAS 96 (435)
T ss_pred cceEecCCCCcccHHHHHHHHhccCccceeeecccccCCCcCceeeEecCchhhHHHHHHhcCCccc--CCceEEeeccc
Confidence 8999999999999999999999999999999999999999999999999999999999999999999 88999999997
Q ss_pred CC
Q 021958 162 FG 163 (305)
Q Consensus 162 ~~ 163 (305)
..
T Consensus 97 ~~ 98 (435)
T KOG0108|consen 97 NR 98 (435)
T ss_pred cc
Confidence 65
No 84
>KOG1365 consensus RNA-binding protein Fusilli, contains RRM domain [RNA processing and modification; General function prediction only]
Probab=99.48 E-value=2e-13 Score=115.99 Aligned_cols=168 Identities=14% Similarity=0.213 Sum_probs=129.5
Q ss_pred CCCceEEEcCCCcccCHHHHhHhhhhc----CCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCCCCCcc
Q 021958 79 GEIRTLWIGDLQYWMDETYLNTCFAHT----GEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMPNGEQN 154 (305)
Q Consensus 79 ~~~~~l~v~nLp~~~t~~~l~~~f~~~----G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~~~~ 154 (305)
...-.|-+++||+++++.++.++|..- |..+.+.++... +|+..|-|||.|..+++|..|+.+ +...| +.|.
T Consensus 159 ~~qvivRmRGLPfdat~~dVv~FF~~~cpv~~g~egvLFV~rp-dgrpTGdAFvlfa~ee~aq~aL~k-hrq~i--GqRY 234 (508)
T KOG1365|consen 159 ENQVIVRMRGLPFDATALDVVEFFGPPCPVTGGTEGVLFVTRP-DGRPTGDAFVLFACEEDAQFALRK-HRQNI--GQRY 234 (508)
T ss_pred ccceEEEecCCCCCcchHHHHHhcCCCCcccCCccceEEEECC-CCCcccceEEEecCHHHHHHHHHH-HHHHH--hHHH
Confidence 445567789999999999999999622 235566666654 689999999999999999999965 44444 4555
Q ss_pred eeeeccCCC---------------------------CCCCCCCCCCceEEEcCCCcCCCHHHHHHHHhhcCCC-eeE--E
Q 021958 155 FRLNWASFG---------------------------AGEKRDDTPDHTIFVGDLAADVTDYMLQETFRARYPS-TKG--A 204 (305)
Q Consensus 155 i~v~~~~~~---------------------------~~~~~~~~~~~~l~v~nlp~~~~~~~l~~~f~~~~G~-i~~--i 204 (305)
|++-+++.. ...-.......+|.+++||++.+.|||..+|. .|-. |.. |
T Consensus 235 IElFRSTaaEvqqvlnr~~s~pLi~~~~sp~~p~~p~~~~p~~~~kdcvRLRGLPy~AtvEdIL~Flg-dFa~~i~f~gV 313 (508)
T KOG1365|consen 235 IELFRSTAAEVQQVLNREVSEPLIPGLTSPLLPGGPARLVPPTRSKDCVRLRGLPYEATVEDILDFLG-DFATDIRFQGV 313 (508)
T ss_pred HHHHHHhHHHHHHHHHhhccccccCCCCCCCCCCCccccCCCCCCCCeeEecCCChhhhHHHHHHHHH-HHhhhccccee
Confidence 555444321 00001223477899999999999999999998 6765 333 7
Q ss_pred EEEecCCCCCcccEEEEEeCCHHHHHHHHHHhCCceecceeeEEccCC
Q 021958 205 KVVIDRLTGRTKGYGFVRFGDESEQLRAMTEMNGVFCSTRPMRIGPAT 252 (305)
Q Consensus 205 ~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l~~~~~~g~~l~v~~~~ 252 (305)
+++.+. .|+..|-|||+|.+.|+|..|..+.|++...+|.|.|.-+.
T Consensus 314 Hmv~N~-qGrPSGeAFIqm~nae~a~aaaqk~hk~~mk~RYiEvfp~S 360 (508)
T KOG1365|consen 314 HMVLNG-QGRPSGEAFIQMRNAERARAAAQKCHKKLMKSRYIEVFPCS 360 (508)
T ss_pred EEEEcC-CCCcChhhhhhhhhhHHHHHHHHHHHHhhcccceEEEeecc
Confidence 777774 78999999999999999999999999988899999997654
No 85
>KOG0113 consensus U1 small nuclear ribonucleoprotein (RRM superfamily) [RNA processing and modification]
Probab=99.48 E-value=2.1e-13 Score=112.34 Aligned_cols=85 Identities=21% Similarity=0.397 Sum_probs=76.3
Q ss_pred CCCCCceEEEcCCCcccCHHHHhHhhhhcCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCCCCCccee
Q 021958 77 QPGEIRTLWIGDLQYWMDETYLNTCFAHTGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMPNGEQNFR 156 (305)
Q Consensus 77 ~~~~~~~l~v~nLp~~~t~~~l~~~f~~~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~~~~i~ 156 (305)
..+..+||||+-|+.+++|.+|+..|+.||.|+.|+|++|+.||+++|||||+|+++.+...|.+..+|..|++....|.
T Consensus 97 ~gDPy~TLFv~RLnydT~EskLrreF~~YG~IkrirlV~d~vTgkskGYAFIeye~erdm~~AYK~adG~~Idgrri~VD 176 (335)
T KOG0113|consen 97 IGDPYKTLFVARLNYDTSESKLRREFEKYGPIKRIRLVRDKVTGKSKGYAFIEYEHERDMKAAYKDADGIKIDGRRILVD 176 (335)
T ss_pred cCCccceeeeeeccccccHHHHHHHHHhcCcceeEEEeeecccCCccceEEEEeccHHHHHHHHHhccCceecCcEEEEE
Confidence 45788999999999999999999999999999999999999999999999999999999999999999999966554455
Q ss_pred eeccC
Q 021958 157 LNWAS 161 (305)
Q Consensus 157 v~~~~ 161 (305)
+....
T Consensus 177 vERgR 181 (335)
T KOG0113|consen 177 VERGR 181 (335)
T ss_pred ecccc
Confidence 54443
No 86
>KOG1456 consensus Heterogeneous nuclear ribonucleoprotein L (contains RRM repeats) [RNA processing and modification]
Probab=99.47 E-value=9.3e-13 Score=111.65 Aligned_cols=167 Identities=15% Similarity=0.147 Sum_probs=131.8
Q ss_pred CCCceEEEcCCCcccCHHHHhHhhhhcCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCCCCCcceeee
Q 021958 79 GEIRTLWIGDLQYWMDETYLNTCFAHTGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMPNGEQNFRLN 158 (305)
Q Consensus 79 ~~~~~l~v~nLp~~~t~~~l~~~f~~~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~~~~i~v~ 158 (305)
..+-.|+|++|-..+++.+|.+.++.||.|..+..+..+ ..|.|+|++.+.|++++.--....+...+...-++
T Consensus 29 ~~spvvhvr~l~~~v~eadl~eal~~fG~i~yvt~~P~~------r~alvefedi~~akn~Vnfaa~n~i~i~gq~Al~N 102 (494)
T KOG1456|consen 29 NPSPVVHVRGLHQGVVEADLVEALSNFGPIAYVTCMPHK------RQALVEFEDIEGAKNCVNFAADNQIYIAGQQALFN 102 (494)
T ss_pred CCCceEEEeccccccchhHHHHHHhcCCceEEEEecccc------ceeeeeeccccchhhheehhccCcccccCchhhcc
Confidence 456689999999999999999999999999888777654 37999999999999998654444454467777777
Q ss_pred ccCCCCCCCC---CCCCCceEEEc--CCCcCCCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHH
Q 021958 159 WASFGAGEKR---DDTPDHTIFVG--DLAADVTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAM 233 (305)
Q Consensus 159 ~~~~~~~~~~---~~~~~~~l~v~--nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~ 233 (305)
+++....++. ...++..|.+. |--+.+|.+-|..++. ..|.|.+|.|... + .-.|.|+|++.+.|++|.
T Consensus 103 yStsq~i~R~g~es~~pN~VLl~TIlNp~YpItvDVly~Icn-p~GkVlRIvIfkk--n---gVQAmVEFdsv~~AqrAk 176 (494)
T KOG1456|consen 103 YSTSQCIERPGDESATPNKVLLFTILNPQYPITVDVLYTICN-PQGKVLRIVIFKK--N---GVQAMVEFDSVEVAQRAK 176 (494)
T ss_pred cchhhhhccCCCCCCCCCeEEEEEeecCccccchhhhhhhcC-CCCceEEEEEEec--c---ceeeEEeechhHHHHHHH
Confidence 7765544432 22344444444 4455789999999998 9999999988765 3 346999999999999999
Q ss_pred HHhCCcee--cceeeEEccCCCCCCC
Q 021958 234 TEMNGVFC--STRPMRIGPATNKKTV 257 (305)
Q Consensus 234 ~~l~~~~~--~g~~l~v~~~~~~~~~ 257 (305)
..|||..| +-+.|+|+|+++.+-.
T Consensus 177 ~alNGADIYsGCCTLKIeyAkP~rln 202 (494)
T KOG1456|consen 177 AALNGADIYSGCCTLKIEYAKPTRLN 202 (494)
T ss_pred hhcccccccccceeEEEEecCcceee
Confidence 99999987 5589999999986544
No 87
>KOG0126 consensus Predicted RNA-binding protein (RRM superfamily) [General function prediction only]
Probab=99.47 E-value=7.3e-15 Score=112.50 Aligned_cols=82 Identities=24% Similarity=0.488 Sum_probs=76.5
Q ss_pred CCCceEEEcCCCcccCHHHHhHhhhhcCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCCCCCcceeee
Q 021958 79 GEIRTLWIGDLQYWMDETYLNTCFAHTGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMPNGEQNFRLN 158 (305)
Q Consensus 79 ~~~~~l~v~nLp~~~t~~~l~~~f~~~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~~~~i~v~ 158 (305)
.++.-|||||||.+.||.||.-.|++||.|++|.+++|+.||+++||||+.|++..+...|+..|||..| .+|.|+|+
T Consensus 33 kdsA~Iyiggl~~~LtEgDil~VFSqyGe~vdinLiRDk~TGKSKGFaFLcYEDQRSTILAVDN~NGiki--~gRtirVD 110 (219)
T KOG0126|consen 33 KDSAYIYIGGLPYELTEGDILCVFSQYGEIVDINLIRDKKTGKSKGFAFLCYEDQRSTILAVDNLNGIKI--LGRTIRVD 110 (219)
T ss_pred ccceEEEECCCcccccCCcEEEEeeccCceEEEEEEecCCCCcccceEEEEecCccceEEEEeccCCcee--cceeEEee
Confidence 4567899999999999999999999999999999999999999999999999999999999999999999 77999998
Q ss_pred ccCC
Q 021958 159 WASF 162 (305)
Q Consensus 159 ~~~~ 162 (305)
....
T Consensus 111 Hv~~ 114 (219)
T KOG0126|consen 111 HVSN 114 (219)
T ss_pred eccc
Confidence 6543
No 88
>KOG1456 consensus Heterogeneous nuclear ribonucleoprotein L (contains RRM repeats) [RNA processing and modification]
Probab=99.47 E-value=8.1e-12 Score=106.02 Aligned_cols=202 Identities=20% Similarity=0.201 Sum_probs=150.4
Q ss_pred cCCCcccCHHHHhHhhhhcCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCCCCCcceeeeccCCCCCC
Q 021958 87 GDLQYWMDETYLNTCFAHTGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMPNGEQNFRLNWASFGAGE 166 (305)
Q Consensus 87 ~nLp~~~t~~~l~~~f~~~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~~~~i~v~~~~~~~~~ 166 (305)
-|--..+|.|-|..+....|.|.+|.|++. +| -.|.|||++.+.|.+|.+.|||..|..+.-.++|++++.....
T Consensus 128 lNp~YpItvDVly~Icnp~GkVlRIvIfkk--ng---VQAmVEFdsv~~AqrAk~alNGADIYsGCCTLKIeyAkP~rln 202 (494)
T KOG1456|consen 128 LNPQYPITVDVLYTICNPQGKVLRIVIFKK--NG---VQAMVEFDSVEVAQRAKAALNGADIYSGCCTLKIEYAKPTRLN 202 (494)
T ss_pred ecCccccchhhhhhhcCCCCceEEEEEEec--cc---eeeEEeechhHHHHHHHhhcccccccccceeEEEEecCcceee
Confidence 344478899999999999999999988765 22 2699999999999999999999999888888999888764000
Q ss_pred ------------------------------------------------------------------------------CC
Q 021958 167 ------------------------------------------------------------------------------KR 168 (305)
Q Consensus 167 ------------------------------------------------------------------------------~~ 168 (305)
..
T Consensus 203 V~knd~DtwDyTlp~~~~~~~~g~~~~~r~~~p~~~~~~pss~~G~h~~y~sg~~~~p~~~~P~r~~~~~~~~~g~a~p~ 282 (494)
T KOG1456|consen 203 VQKNDKDTWDYTLPDLRGPYDPGRNHYDRQRQPAPLGYHPSSRGGGHSGYYSGDRHGPPHPPPSRYRDGYRDGRGYASPG 282 (494)
T ss_pred eeecCCccccccCCCCCCCCCCCCCCCccccCCCccCCChhhcCCCCCCCcccccCCCCCCCCCCCccccccCCCCCCCC
Confidence 00
Q ss_pred CCCCCceEEEcCCCcC-CCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHHhCCceecceeeE
Q 021958 169 DDTPDHTIFVGDLAAD-VTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEMNGVFCSTRPMR 247 (305)
Q Consensus 169 ~~~~~~~l~v~nlp~~-~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l~~~~~~g~~l~ 247 (305)
....++.+.|.+|+.. .+-+.|..+|- .||.|++|++++.+ .|-|.|++.|..+.++|+..||+..+-|.+|.
T Consensus 283 g~~~g~VmMVyGLdh~k~N~drlFNl~C-lYGNV~rvkFmkTk-----~gtamVemgd~~aver~v~hLnn~~lfG~kl~ 356 (494)
T KOG1456|consen 283 GGAPGCVMMVYGLDHGKMNCDRLFNLFC-LYGNVERVKFMKTK-----PGTAMVEMGDAYAVERAVTHLNNIPLFGGKLN 356 (494)
T ss_pred CCCCCcEEEEEeccccccchhhhhhhhh-hcCceeeEEEeecc-----cceeEEEcCcHHHHHHHHHHhccCccccceEE
Confidence 1134678999999886 66788999998 99999999999876 67899999999999999999999999999999
Q ss_pred EccCCCCCCCCCC-CCCCCcc---eEEeC--CcchhhhHHHHhhccceeeEEEEEeee
Q 021958 248 IGPATNKKTVSGQ-QQYPKGT---FLVVG--TGFCLQIKFKFKYFIGWLIFIIIFFNI 299 (305)
Q Consensus 248 v~~~~~~~~~~~~-~~~~~~~---~~~~g--~~~~~~~~~~~~~~~g~~~~~~~~~~~ 299 (305)
|.+++...-.-.+ --...+. ..+.+ +..-+.-....+..+..-+.||-|||.
T Consensus 357 v~~SkQ~~v~~~~pflLpDgSpSfKdys~SkNnRFssp~qAsKNrIq~Ps~vLHffNa 414 (494)
T KOG1456|consen 357 VCVSKQNFVSPVQPFLLPDGSPSFKDYSGSKNNRFSSPEQASKNRIQPPSNVLHFFNA 414 (494)
T ss_pred EeeccccccccCCceecCCCCcchhhcccccccccCChhHhhcccccCCcceeEEecC
Confidence 9988764433210 0001111 01111 111111134555667788899999985
No 89
>PLN03121 nucleic acid binding protein; Provisional
Probab=99.46 E-value=4.5e-13 Score=108.94 Aligned_cols=77 Identities=14% Similarity=0.123 Sum_probs=70.1
Q ss_pred CCceEEEcCCCcCCCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHHhCCceecceeeEEccC
Q 021958 172 PDHTIFVGDLAADVTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEMNGVFCSTRPMRIGPA 251 (305)
Q Consensus 172 ~~~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l~~~~~~g~~l~v~~~ 251 (305)
...+|||+||++.+++++|+++|+ .||.|.+|+|.+|. ..+++|||+|++.++|..|+ .|+|..|.++.|.|..+
T Consensus 4 ~g~TV~V~NLS~~tTE~dLrefFS-~~G~I~~V~I~~D~---et~gfAfVtF~d~~aaetAl-lLnGa~l~d~~I~It~~ 78 (243)
T PLN03121 4 GGYTAEVTNLSPKATEKDVYDFFS-HCGAIEHVEIIRSG---EYACTAYVTFKDAYALETAV-LLSGATIVDQRVCITRW 78 (243)
T ss_pred CceEEEEecCCCCCCHHHHHHHHH-hcCCeEEEEEecCC---CcceEEEEEECCHHHHHHHH-hcCCCeeCCceEEEEeC
Confidence 357999999999999999999999 99999999999884 44689999999999999999 58999999999999876
Q ss_pred CC
Q 021958 252 TN 253 (305)
Q Consensus 252 ~~ 253 (305)
..
T Consensus 79 ~~ 80 (243)
T PLN03121 79 GQ 80 (243)
T ss_pred cc
Confidence 54
No 90
>KOG4207 consensus Predicted splicing factor, SR protein superfamily [RNA processing and modification]
Probab=99.44 E-value=1.8e-13 Score=107.10 Aligned_cols=85 Identities=25% Similarity=0.473 Sum_probs=78.3
Q ss_pred CCCCCceEEEcCCCcccCHHHHhHhhhhcCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCCCCCccee
Q 021958 77 QPGEIRTLWIGDLQYWMDETYLNTCFAHTGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMPNGEQNFR 156 (305)
Q Consensus 77 ~~~~~~~l~v~nLp~~~t~~~l~~~f~~~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~~~~i~ 156 (305)
+-+...+|.|-||-..++.++|+.+|++||.|-+|.|.+|+.|+.++|||||.|.+..+|+.|+++|+|..+ .|+.|+
T Consensus 9 dv~gm~SLkVdNLTyRTspd~LrrvFekYG~vgDVyIPrdr~Tr~sRgFaFVrf~~k~daedA~damDG~~l--dgRelr 86 (256)
T KOG4207|consen 9 DVEGMTSLKVDNLTYRTSPDDLRRVFEKYGRVGDVYIPRDRYTRQSRGFAFVRFHDKRDAEDALDAMDGAVL--DGRELR 86 (256)
T ss_pred CcccceeEEecceeccCCHHHHHHHHHHhCcccceecccccccccccceeEEEeeecchHHHHHHhhcceee--ccceee
Confidence 446678899999999999999999999999999999999999999999999999999999999999999999 567888
Q ss_pred eeccCCC
Q 021958 157 LNWASFG 163 (305)
Q Consensus 157 v~~~~~~ 163 (305)
|..+...
T Consensus 87 Vq~aryg 93 (256)
T KOG4207|consen 87 VQMARYG 93 (256)
T ss_pred ehhhhcC
Confidence 8777654
No 91
>cd00590 RRM RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
Probab=99.44 E-value=1.4e-12 Score=88.66 Aligned_cols=74 Identities=36% Similarity=0.668 Sum_probs=68.6
Q ss_pred eEEEcCCCcCCCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHHhCCceecceeeEEcc
Q 021958 175 TIFVGDLAADVTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEMNGVFCSTRPMRIGP 250 (305)
Q Consensus 175 ~l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l~~~~~~g~~l~v~~ 250 (305)
+|+|+|||..+++++|+++|+ .||.|..+.+..+..+ ..+++|||+|.+.++|..|++.+++..++|+.+.|.+
T Consensus 1 ~i~i~~l~~~~~~~~i~~~~~-~~g~i~~~~~~~~~~~-~~~~~~~v~f~s~~~a~~a~~~~~~~~~~~~~~~v~~ 74 (74)
T cd00590 1 TLFVGNLPPDVTEEDLRELFS-KFGKVESVRIVRDKDT-KSKGFAFVEFEDEEDAEKALEALNGKELGGRPLRVEF 74 (74)
T ss_pred CEEEeCCCCccCHHHHHHHHH-hcCCEEEEEEeeCCCC-CcceEEEEEECCHHHHHHHHHHhCCCeECCeEEEEeC
Confidence 489999999999999999999 9999999999987644 6789999999999999999999999999999999875
No 92
>PF13893 RRM_5: RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain); PDB: 2PE8_A 2PEH_A 2D9O_A 1A9N_D 2DIT_A 3S6E_A 2LQ5_A 1SJQ_A 2AD9_A 1X4D_A ....
Probab=99.42 E-value=1.1e-12 Score=84.63 Aligned_cols=56 Identities=25% Similarity=0.563 Sum_probs=50.6
Q ss_pred HHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHHhCCceecceeeEEccC
Q 021958 190 LQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEMNGVFCSTRPMRIGPA 251 (305)
Q Consensus 190 l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l~~~~~~g~~l~v~~~ 251 (305)
|.++|+ +||.|.++.+..+. +++|||+|.+.++|.+|++.|||..++|+.|+|+|+
T Consensus 1 L~~~f~-~fG~V~~i~~~~~~-----~~~a~V~f~~~~~A~~a~~~l~~~~~~g~~l~V~~a 56 (56)
T PF13893_consen 1 LYKLFS-KFGEVKKIKIFKKK-----RGFAFVEFASVEDAQKAIEQLNGRQFNGRPLKVSYA 56 (56)
T ss_dssp HHHHHT-TTS-EEEEEEETTS-----TTEEEEEESSHHHHHHHHHHHTTSEETTEEEEEEEE
T ss_pred ChHHhC-CcccEEEEEEEeCC-----CCEEEEEECCHHHHHHHHHHhCCCEECCcEEEEEEC
Confidence 678999 99999999997553 689999999999999999999999999999999985
No 93
>PLN03213 repressor of silencing 3; Provisional
Probab=99.41 E-value=8.6e-13 Score=115.67 Aligned_cols=79 Identities=25% Similarity=0.383 Sum_probs=71.0
Q ss_pred CCCceEEEcCCCcccCHHHHhHhhhhcCCcEEEEEEeeCCCCCccceEEEEEcCH--HHHHHHHHHhCCCCCCCCCccee
Q 021958 79 GEIRTLWIGDLQYWMDETYLNTCFAHTGEVVAVKVIRNKQTGQIEGYGFIEFISR--AGAERVLQTFNGTPMPNGEQNFR 156 (305)
Q Consensus 79 ~~~~~l~v~nLp~~~t~~~l~~~f~~~G~v~~~~~~~~~~~~~~~g~afV~f~~~--~~a~~a~~~l~g~~~~~~~~~i~ 156 (305)
....+||||||.+.+++++|+..|+.||.|.++.|++. +| ||||||+|.+. .++.+|+..|||..+ .|+.|+
T Consensus 8 ~~gMRIYVGNLSydVTEDDLravFSeFGsVkdVEIpRE--TG--RGFAFVEMssdddaEeeKAISaLNGAEW--KGR~LK 81 (759)
T PLN03213 8 GGGVRLHVGGLGESVGRDDLLKIFSPMGTVDAVEFVRT--KG--RSFAYIDFSPSSTNSLTKLFSTYNGCVW--KGGRLR 81 (759)
T ss_pred CcceEEEEeCCCCCCCHHHHHHHHHhcCCeeEEEEecc--cC--CceEEEEecCCcHHHHHHHHHHhcCCee--cCceeE
Confidence 44578999999999999999999999999999999954 56 89999999987 789999999999999 779999
Q ss_pred eeccCCC
Q 021958 157 LNWASFG 163 (305)
Q Consensus 157 v~~~~~~ 163 (305)
|+.++..
T Consensus 82 VNKAKP~ 88 (759)
T PLN03213 82 LEKAKEH 88 (759)
T ss_pred EeeccHH
Confidence 9888654
No 94
>KOG0105 consensus Alternative splicing factor ASF/SF2 (RRM superfamily) [RNA processing and modification]
Probab=99.41 E-value=1.4e-12 Score=100.31 Aligned_cols=80 Identities=18% Similarity=0.336 Sum_probs=71.8
Q ss_pred CCceEEEcCCCcCCCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHHhCCceecceeeEEccC
Q 021958 172 PDHTIFVGDLAADVTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEMNGVFCSTRPMRIGPA 251 (305)
Q Consensus 172 ~~~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l~~~~~~g~~l~v~~~ 251 (305)
..++|||+|||.++.+.||..+|. +||.|..|.+.... ....||||+|++..+|+.||..-+|..++|+.|+|+|.
T Consensus 5 ~~~~iyvGNLP~diRekeieDlFy-Kyg~i~~ieLK~r~---g~ppfafVeFEd~RDAeDAiygRdGYdydg~rLRVEfp 80 (241)
T KOG0105|consen 5 NSRRIYVGNLPGDIREKEIEDLFY-KYGRIREIELKNRP---GPPPFAFVEFEDPRDAEDAIYGRDGYDYDGCRLRVEFP 80 (241)
T ss_pred ccceEEecCCCcchhhccHHHHHh-hhcceEEEEeccCC---CCCCeeEEEecCccchhhhhhcccccccCcceEEEEec
Confidence 468999999999999999999999 99999999986442 23679999999999999999999999999999999998
Q ss_pred CCCC
Q 021958 252 TNKK 255 (305)
Q Consensus 252 ~~~~ 255 (305)
....
T Consensus 81 rggr 84 (241)
T KOG0105|consen 81 RGGR 84 (241)
T ss_pred cCCC
Confidence 7653
No 95
>COG0724 RNA-binding proteins (RRM domain) [General function prediction only]
Probab=99.41 E-value=1.3e-12 Score=112.44 Aligned_cols=79 Identities=38% Similarity=0.687 Sum_probs=76.6
Q ss_pred CceEEEcCCCcCCCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHHhCCceecceeeEEccCC
Q 021958 173 DHTIFVGDLAADVTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEMNGVFCSTRPMRIGPAT 252 (305)
Q Consensus 173 ~~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l~~~~~~g~~l~v~~~~ 252 (305)
..+|||+||+.++++++|.++|. .||.|..+.+..|..++.++|||||+|.+.++|..|+..++|..|.|+.|+|.++.
T Consensus 115 ~~~l~v~nL~~~~~~~~l~~~F~-~~g~~~~~~~~~d~~~~~~~g~~~v~f~~~~~~~~a~~~~~~~~~~~~~~~v~~~~ 193 (306)
T COG0724 115 NNTLFVGNLPYDVTEEDLRELFK-KFGPVKRVRLVRDRETGKSRGFAFVEFESEESAEKAIEELNGKELEGRPLRVQKAQ 193 (306)
T ss_pred CceEEEeCCCCCCCHHHHHHHHH-hcCceeEEEeeeccccCccCceEEEEecCHHHHHHHHHHcCCCeECCceeEeeccc
Confidence 69999999999999999999999 99999999999998899999999999999999999999999999999999999965
No 96
>PLN03121 nucleic acid binding protein; Provisional
Probab=99.40 E-value=1.7e-12 Score=105.64 Aligned_cols=76 Identities=17% Similarity=0.227 Sum_probs=67.7
Q ss_pred CCceEEEcCCCcccCHHHHhHhhhhcCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCCCCCcceeeec
Q 021958 80 EIRTLWIGDLQYWMDETYLNTCFAHTGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMPNGEQNFRLNW 159 (305)
Q Consensus 80 ~~~~l~v~nLp~~~t~~~l~~~f~~~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~~~~i~v~~ 159 (305)
...+|||+||++.+|+++|+++|+.||.|.+|+|++|. +.+++|||+|.+.++|+.|+ .|+|..| .++.|.|..
T Consensus 4 ~g~TV~V~NLS~~tTE~dLrefFS~~G~I~~V~I~~D~---et~gfAfVtF~d~~aaetAl-lLnGa~l--~d~~I~It~ 77 (243)
T PLN03121 4 GGYTAEVTNLSPKATEKDVYDFFSHCGAIEHVEIIRSG---EYACTAYVTFKDAYALETAV-LLSGATI--VDQRVCITR 77 (243)
T ss_pred CceEEEEecCCCCCCHHHHHHHHHhcCCeEEEEEecCC---CcceEEEEEECCHHHHHHHH-hcCCCee--CCceEEEEe
Confidence 45799999999999999999999999999999999884 45679999999999999999 7899999 567788766
Q ss_pred cC
Q 021958 160 AS 161 (305)
Q Consensus 160 ~~ 161 (305)
..
T Consensus 78 ~~ 79 (243)
T PLN03121 78 WG 79 (243)
T ss_pred Cc
Confidence 54
No 97
>smart00362 RRM_2 RNA recognition motif.
Probab=99.39 E-value=2.6e-12 Score=86.74 Aligned_cols=71 Identities=32% Similarity=0.640 Sum_probs=63.9
Q ss_pred eEEEcCCCcccCHHHHhHhhhhcCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCCCCCcceee
Q 021958 83 TLWIGDLQYWMDETYLNTCFAHTGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMPNGEQNFRL 157 (305)
Q Consensus 83 ~l~v~nLp~~~t~~~l~~~f~~~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~~~~i~v 157 (305)
+|||+|||..+++++|+++|+.||.+..+.+..+. +.++|+|||+|.+.++|.+|++.+++..+ .++.+++
T Consensus 1 ~v~i~~l~~~~~~~~l~~~~~~~g~v~~~~~~~~~--~~~~~~~~v~f~~~~~a~~a~~~~~~~~~--~~~~i~v 71 (72)
T smart00362 1 TLFVGNLPPDVTEEDLKELFSKFGPIESVKIPKDT--GKSKGFAFVEFESEEDAEKAIEALNGTKL--GGRPLRV 71 (72)
T ss_pred CEEEcCCCCcCCHHHHHHHHHhcCCEEEEEEecCC--CCCCceEEEEeCCHHHHHHHHHHhCCcEE--CCEEEee
Confidence 58999999999999999999999999999998876 77889999999999999999999999888 4555554
No 98
>KOG0114 consensus Predicted RNA-binding protein (RRM superfamily) [General function prediction only]
Probab=99.38 E-value=2.8e-12 Score=89.33 Aligned_cols=81 Identities=15% Similarity=0.294 Sum_probs=69.5
Q ss_pred CCCCCCceEEEcCCCcccCHHHHhHhhhhcCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCCCCCcce
Q 021958 76 GQPGEIRTLWIGDLQYWMDETYLNTCFAHTGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMPNGEQNF 155 (305)
Q Consensus 76 ~~~~~~~~l~v~nLp~~~t~~~l~~~f~~~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~~~~i 155 (305)
..+...+-|||.|||.++|.+++.++|.+||.|..|++--.+ ..+|-|||.|++..+|++|++.|+|..+ .++.+
T Consensus 13 lppevnriLyirNLp~~ITseemydlFGkyg~IrQIRiG~~k---~TrGTAFVVYedi~dAk~A~dhlsg~n~--~~ryl 87 (124)
T KOG0114|consen 13 LPPEVNRILYIRNLPFKITSEEMYDLFGKYGTIRQIRIGNTK---ETRGTAFVVYEDIFDAKKACDHLSGYNV--DNRYL 87 (124)
T ss_pred CChhhheeEEEecCCccccHHHHHHHhhcccceEEEEecCcc---CcCceEEEEehHhhhHHHHHHHhccccc--CCceE
Confidence 344667899999999999999999999999999999986554 4678999999999999999999999998 55766
Q ss_pred eeeccC
Q 021958 156 RLNWAS 161 (305)
Q Consensus 156 ~v~~~~ 161 (305)
.+-+-.
T Consensus 88 ~vlyyq 93 (124)
T KOG0114|consen 88 VVLYYQ 93 (124)
T ss_pred EEEecC
Confidence 665443
No 99
>KOG0111 consensus Cyclophilin-type peptidyl-prolyl cis-trans isomerase [Posttranslational modification, protein turnover, chaperones]
Probab=99.37 E-value=5.4e-13 Score=105.32 Aligned_cols=86 Identities=22% Similarity=0.396 Sum_probs=80.8
Q ss_pred CCCceEEEcCCCcccCHHHHhHhhhhcCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCCCCCcceeee
Q 021958 79 GEIRTLWIGDLQYWMDETYLNTCFAHTGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMPNGEQNFRLN 158 (305)
Q Consensus 79 ~~~~~l~v~nLp~~~t~~~l~~~f~~~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~~~~i~v~ 158 (305)
...++||||+|..++|+.-|...|-.||.|.+|.+..|-.+++.+|||||+|...|+|..|+..+|+.++ .|+.|+|+
T Consensus 8 ~~KrtlYVGGladeVtekvLhaAFIPFGDI~dIqiPlDyesqkHRgFgFVefe~aEDAaaAiDNMnesEL--~GrtirVN 85 (298)
T KOG0111|consen 8 NQKRTLYVGGLADEVTEKVLHAAFIPFGDIKDIQIPLDYESQKHRGFGFVEFEEAEDAAAAIDNMNESEL--FGRTIRVN 85 (298)
T ss_pred ccceeEEeccchHHHHHHHHHhccccccchhhcccccchhcccccceeEEEeeccchhHHHhhcCchhhh--cceeEEEe
Confidence 5678999999999999999999999999999999999999999999999999999999999999999999 88999999
Q ss_pred ccCCCCCC
Q 021958 159 WASFGAGE 166 (305)
Q Consensus 159 ~~~~~~~~ 166 (305)
++......
T Consensus 86 ~AkP~kik 93 (298)
T KOG0111|consen 86 LAKPEKIK 93 (298)
T ss_pred ecCCcccc
Confidence 99876443
No 100
>KOG0130 consensus RNA-binding protein RBM8/Tsunagi (RRM superfamily) [General function prediction only]
Probab=99.37 E-value=1.2e-12 Score=95.45 Aligned_cols=83 Identities=20% Similarity=0.405 Sum_probs=77.0
Q ss_pred CCCceEEEcCCCcccCHHHHhHhhhhcCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCCCCCcceeee
Q 021958 79 GEIRTLWIGDLQYWMDETYLNTCFAHTGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMPNGEQNFRLN 158 (305)
Q Consensus 79 ~~~~~l~v~nLp~~~t~~~l~~~f~~~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~~~~i~v~ 158 (305)
.+.-.|||.++...+|+++|.+.|..||.|+.+++..|+.||-.+|||+|+|++.++|.+|+..+||..+ -+..+.|+
T Consensus 70 VEGwIi~VtgvHeEatEedi~d~F~dyGeiKNihLNLDRRtGy~KGYaLvEYet~keAq~A~~~~Ng~~l--l~q~v~VD 147 (170)
T KOG0130|consen 70 VEGWIIFVTGVHEEATEEDIHDKFADYGEIKNIHLNLDRRTGYVKGYALVEYETLKEAQAAIDALNGAEL--LGQNVSVD 147 (170)
T ss_pred eeeEEEEEeccCcchhHHHHHHHHhhcccccceeeccccccccccceeeeehHhHHHHHHHHHhccchhh--hCCceeEE
Confidence 4556799999999999999999999999999999999999999999999999999999999999999999 56888888
Q ss_pred ccCCC
Q 021958 159 WASFG 163 (305)
Q Consensus 159 ~~~~~ 163 (305)
|+--+
T Consensus 148 w~Fv~ 152 (170)
T KOG0130|consen 148 WCFVK 152 (170)
T ss_pred EEEec
Confidence 88644
No 101
>smart00361 RRM_1 RNA recognition motif.
Probab=99.37 E-value=4e-12 Score=85.82 Aligned_cols=63 Identities=17% Similarity=0.410 Sum_probs=53.2
Q ss_pred HHHHHHHHh---hcCCCeeEEE-EEecCCC--CCcccEEEEEeCCHHHHHHHHHHhCCceecceeeEEc
Q 021958 187 DYMLQETFR---ARYPSTKGAK-VVIDRLT--GRTKGYGFVRFGDESEQLRAMTEMNGVFCSTRPMRIG 249 (305)
Q Consensus 187 ~~~l~~~f~---~~~G~i~~i~-i~~~~~~--~~~~g~afV~f~~~~~A~~a~~~l~~~~~~g~~l~v~ 249 (305)
+++|+++|+ ..||.|.++. +..+..+ +.++|||||+|.+.++|.+|++.|||..++||.|++.
T Consensus 2 ~~~l~~~~~~~~~~fG~v~~v~~v~~~~~~~~~~~rG~~fV~f~~~~dA~~A~~~l~g~~~~gr~l~~~ 70 (70)
T smart00361 2 DEDFEREFSEEEEYFGEVGKINKIYIDNVGYENHKRGNVYITFERSEDAARAIVDLNGRYFDGRTVKAE 70 (70)
T ss_pred chhHHHHHHHHHHhcCCeeEEEEEEeCCCCCCCCCcEEEEEEECCHHHHHHHHHHhCCCEECCEEEEeC
Confidence 455555552 1799999995 7777666 8999999999999999999999999999999999873
No 102
>KOG0109 consensus RNA-binding protein LARK, contains RRM and retroviral-type Zn-finger domains [RNA processing and modification; General function prediction only]
Probab=99.36 E-value=1.2e-12 Score=107.69 Aligned_cols=75 Identities=21% Similarity=0.437 Sum_probs=70.0
Q ss_pred ceEEEcCCCcCCCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHHhCCceecceeeEEccCCC
Q 021958 174 HTIFVGDLAADVTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEMNGVFCSTRPMRIGPATN 253 (305)
Q Consensus 174 ~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l~~~~~~g~~l~v~~~~~ 253 (305)
.+|||+|||.++++.+|+.+|+ +||+|.+|+|++ .||||..++...|..|+..|||..|+|..|.|+-+++
T Consensus 3 ~KLFIGNLp~~~~~~elr~lFe-~ygkVlECDIvK--------NYgFVHiEdktaaedairNLhgYtLhg~nInVeaSks 73 (346)
T KOG0109|consen 3 VKLFIGNLPREATEQELRSLFE-QYGKVLECDIVK--------NYGFVHIEDKTAAEDAIRNLHGYTLHGVNINVEASKS 73 (346)
T ss_pred cchhccCCCcccchHHHHHHHH-hhCceEeeeeec--------ccceEEeecccccHHHHhhcccceecceEEEEEeccc
Confidence 4799999999999999999999 999999999994 4999999999999999999999999999999999988
Q ss_pred CCCC
Q 021958 254 KKTV 257 (305)
Q Consensus 254 ~~~~ 257 (305)
|...
T Consensus 74 Ksk~ 77 (346)
T KOG0109|consen 74 KSKA 77 (346)
T ss_pred cCCC
Confidence 8433
No 103
>smart00360 RRM RNA recognition motif.
Probab=99.36 E-value=4.2e-12 Score=85.36 Aligned_cols=70 Identities=30% Similarity=0.621 Sum_probs=63.5
Q ss_pred EcCCCcccCHHHHhHhhhhcCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCCCCCcceee
Q 021958 86 IGDLQYWMDETYLNTCFAHTGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMPNGEQNFRL 157 (305)
Q Consensus 86 v~nLp~~~t~~~l~~~f~~~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~~~~i~v 157 (305)
|+||+..+++++|+++|+.||.|..+.+..++.++.++|+|||+|.+.++|.+|++.+++..+ .++.+++
T Consensus 1 i~~l~~~~~~~~l~~~f~~~g~v~~~~i~~~~~~~~~~~~a~v~f~~~~~a~~a~~~~~~~~~--~~~~~~v 70 (71)
T smart00360 1 VGNLPPDVTEEELRELFSKFGKIESVRLVRDKDTGKSKGFAFVEFESEEDAEKALEALNGKEL--DGRPLKV 70 (71)
T ss_pred CCCCCcccCHHHHHHHHHhhCCEeEEEEEeCCCCCCCCceEEEEeCCHHHHHHHHHHcCCCee--CCcEEEe
Confidence 579999999999999999999999999999887789999999999999999999999998888 4565554
No 104
>KOG0146 consensus RNA-binding protein ETR-3 (RRM superfamily) [RNA processing and modification]
Probab=99.33 E-value=2.8e-12 Score=104.19 Aligned_cols=99 Identities=19% Similarity=0.330 Sum_probs=82.1
Q ss_pred cceeeeccCCCCCCCCCCCCCceEEEcCCCcCCCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHH
Q 021958 153 QNFRLNWASFGAGEKRDDTPDHTIFVGDLAADVTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRA 232 (305)
Q Consensus 153 ~~i~v~~~~~~~~~~~~~~~~~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a 232 (305)
|.|.|..+..+ .....+++||||-|.+.-.|||++.+|. .||.|++|.+.+.. +|.+||+|||+|.+..+|..|
T Consensus 3 rpiqvkpadse----srg~~drklfvgml~kqq~e~dvrrlf~-pfG~~~e~tvlrg~-dg~sKGCAFVKf~s~~eAqaA 76 (371)
T KOG0146|consen 3 RPIQVKPADSE----SRGGDDRKLFVGMLNKQQSEDDVRRLFQ-PFGNIEECTVLRGP-DGNSKGCAFVKFSSHAEAQAA 76 (371)
T ss_pred CCccccccccc----cCCccchhhhhhhhcccccHHHHHHHhc-ccCCcceeEEecCC-CCCCCCceEEEeccchHHHHH
Confidence 44555555433 2233578999999999999999999999 99999999999886 899999999999999999999
Q ss_pred HHHhCCce-e--cceeeEEccCCCCCCC
Q 021958 233 MTEMNGVF-C--STRPMRIGPATNKKTV 257 (305)
Q Consensus 233 ~~~l~~~~-~--~g~~l~v~~~~~~~~~ 257 (305)
|..|||.. + ....|.|+|+...+++
T Consensus 77 I~aLHgSqTmpGASSSLVVK~ADTdkER 104 (371)
T KOG0146|consen 77 INALHGSQTMPGASSSLVVKFADTDKER 104 (371)
T ss_pred HHHhcccccCCCCccceEEEeccchHHH
Confidence 99999976 3 4478999998765554
No 105
>cd00590 RRM RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
Probab=99.30 E-value=2.6e-11 Score=82.25 Aligned_cols=73 Identities=33% Similarity=0.653 Sum_probs=65.2
Q ss_pred eEEEcCCCcccCHHHHhHhhhhcCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCCCCCcceeee
Q 021958 83 TLWIGDLQYWMDETYLNTCFAHTGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMPNGEQNFRLN 158 (305)
Q Consensus 83 ~l~v~nLp~~~t~~~l~~~f~~~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~~~~i~v~ 158 (305)
+|+|+|||..+++++|+++|+.+|.+..+.+..+..+ ..+|+|||+|.+.++|..|++.+++..+ .++.+.+.
T Consensus 1 ~i~i~~l~~~~~~~~i~~~~~~~g~i~~~~~~~~~~~-~~~~~~~v~f~s~~~a~~a~~~~~~~~~--~~~~~~v~ 73 (74)
T cd00590 1 TLFVGNLPPDVTEEDLRELFSKFGKVESVRIVRDKDT-KSKGFAFVEFEDEEDAEKALEALNGKEL--GGRPLRVE 73 (74)
T ss_pred CEEEeCCCCccCHHHHHHHHHhcCCEEEEEEeeCCCC-CcceEEEEEECCHHHHHHHHHHhCCCeE--CCeEEEEe
Confidence 5899999999999999999999999999999987643 7789999999999999999999999887 55666654
No 106
>KOG0415 consensus Predicted peptidyl prolyl cis-trans isomerase [Posttranslational modification, protein turnover, chaperones]
Probab=99.29 E-value=5.3e-12 Score=106.45 Aligned_cols=84 Identities=25% Similarity=0.424 Sum_probs=80.1
Q ss_pred CCCceEEEcCCCcCCCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHHhCCceecceeeEEcc
Q 021958 171 TPDHTIFVGDLAADVTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEMNGVFCSTRPMRIGP 250 (305)
Q Consensus 171 ~~~~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l~~~~~~g~~l~v~~ 250 (305)
.+.+.|||..|..-++++||.-+|+ .||.|.+|.|++|..||.+-.||||+|++.+++++|.-+|++..|++|+|.|+|
T Consensus 237 PPeNVLFVCKLNPVTtDeDLeiIFS-rFG~i~sceVIRD~ktgdsLqyaFiEFen~escE~AyFKMdNvLIDDrRIHVDF 315 (479)
T KOG0415|consen 237 PPENVLFVCKLNPVTTDEDLEIIFS-RFGKIVSCEVIRDRKTGDSLQYAFIEFENKESCEQAYFKMDNVLIDDRRIHVDF 315 (479)
T ss_pred CCcceEEEEecCCcccccchhhHHh-hcccceeeeEEecccccchhheeeeeecchhhHHHHHhhhcceeeccceEEeeh
Confidence 5788999999999999999999999 999999999999999999999999999999999999999999999999999999
Q ss_pred CCCCC
Q 021958 251 ATNKK 255 (305)
Q Consensus 251 ~~~~~ 255 (305)
+.+-.
T Consensus 316 SQSVs 320 (479)
T KOG0415|consen 316 SQSVS 320 (479)
T ss_pred hhhhh
Confidence 87633
No 107
>KOG4454 consensus RNA binding protein (RRM superfamily) [General function prediction only]
Probab=99.28 E-value=8.7e-13 Score=104.17 Aligned_cols=141 Identities=22% Similarity=0.333 Sum_probs=118.6
Q ss_pred CCCCceEEEcCCCcccCHHHHhHhhhhcCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCCCCCcceee
Q 021958 78 PGEIRTLWIGDLQYWMDETYLNTCFAHTGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMPNGEQNFRL 157 (305)
Q Consensus 78 ~~~~~~l~v~nLp~~~t~~~l~~~f~~~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~~~~i~v 157 (305)
....++|||+|+...++++.|.++|-+.|.|..+.|..++ .++.+ ||||+|.++..+.-|++.+||..+. ++.+++
T Consensus 6 ae~drtl~v~n~~~~v~eelL~ElfiqaGPV~kv~ip~~~-d~~~k-Fa~v~f~~E~sv~~a~~L~ng~~l~--~~e~q~ 81 (267)
T KOG4454|consen 6 AEMDRTLLVQNMYSGVSEELLSELFIQAGPVYKVGIPSGQ-DQEQK-FAYVFFPNENSVQLAGQLENGDDLE--EDEEQR 81 (267)
T ss_pred cchhhHHHHHhhhhhhhHHHHHHHhhccCceEEEeCCCCc-cCCCc-eeeeecccccchhhhhhhcccchhc--cchhhc
Confidence 3567899999999999999999999999999999888776 45666 9999999999999999999999884 344333
Q ss_pred eccCCCCCCCCCCCCCceEEEcC----CCcCCCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHH
Q 021958 158 NWASFGAGEKRDDTPDHTIFVGD----LAADVTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAM 233 (305)
Q Consensus 158 ~~~~~~~~~~~~~~~~~~l~v~n----lp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~ 233 (305)
+++.|+ |...++++.+.+.|+ .-|.++.+++..+. +|.++.++|+.+....+.-.++
T Consensus 82 -----------------~~r~G~shapld~r~~~ei~~~v~s-~a~p~~~~R~~~~~-d~rnrn~~~~~~qr~~~~P~~~ 142 (267)
T KOG4454|consen 82 -----------------TLRCGNSHAPLDERVTEEILYEVFS-QAGPIEGVRIPTDN-DGRNRNFGFVTYQRLCAVPFAL 142 (267)
T ss_pred -----------------ccccCCCcchhhhhcchhhheeeec-ccCCCCCccccccc-cCCccCccchhhhhhhcCcHHh
Confidence 455566 788899999999999 99999999999886 5888999999998887777777
Q ss_pred HHhCCcee
Q 021958 234 TEMNGVFC 241 (305)
Q Consensus 234 ~~l~~~~~ 241 (305)
..-.+...
T Consensus 143 ~~y~~l~~ 150 (267)
T KOG4454|consen 143 DLYQGLEL 150 (267)
T ss_pred hhhcccCc
Confidence 76665543
No 108
>KOG4205 consensus RNA-binding protein musashi/mRNA cleavage and polyadenylation factor I complex, subunit HRP1 [RNA processing and modification]
Probab=99.28 E-value=1.1e-11 Score=106.18 Aligned_cols=122 Identities=19% Similarity=0.237 Sum_probs=98.3
Q ss_pred CCceEEEcCCCcCCCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHHhCCceecceeeEEccC
Q 021958 172 PDHTIFVGDLAADVTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEMNGVFCSTRPMRIGPA 251 (305)
Q Consensus 172 ~~~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l~~~~~~g~~l~v~~~ 251 (305)
..++|||++|.+++++|.|+++|. .||+|.++.+++|..++.++||+||+|++.+...+++.. ..+.|+|+.|.+..+
T Consensus 5 ~~~KlfiGgisw~ttee~Lr~yf~-~~Gev~d~~vm~d~~t~rsrgFgfv~f~~~~~v~~vl~~-~~h~~dgr~ve~k~a 82 (311)
T KOG4205|consen 5 ESGKLFIGGLSWETTEESLREYFS-QFGEVTDCVVMRDPSTGRSRGFGFVTFATPEGVDAVLNA-RTHKLDGRSVEPKRA 82 (311)
T ss_pred CCcceeecCcCccccHHHHHHHhc-ccCceeeEEEeccCCCCCcccccceecCCCcchheeecc-cccccCCccccceec
Confidence 568999999999999999999998 999999999999999999999999999999999988875 568899999999999
Q ss_pred CCCCCCCCCCCCCCcceEEeCCcchhhhHHHHhhccceeeEEEE
Q 021958 252 TNKKTVSGQQQYPKGTFLVVGTGFCLQIKFKFKYFIGWLIFIII 295 (305)
Q Consensus 252 ~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~g~~~~~~~ 295 (305)
.++...............++|.....-..+.|++++-.+.+|..
T Consensus 83 v~r~~~~~~~~~~~tkkiFvGG~~~~~~e~~~r~yfe~~g~v~~ 126 (311)
T KOG4205|consen 83 VSREDQTKVGRHLRTKKIFVGGLPPDTTEEDFKDYFEQFGKVAD 126 (311)
T ss_pred cCcccccccccccceeEEEecCcCCCCchHHHhhhhhccceeEe
Confidence 98887755444444456677754444445566666555554443
No 109
>KOG4210 consensus Nuclear localization sequence binding protein [Transcription]
Probab=99.24 E-value=1.1e-11 Score=105.93 Aligned_cols=175 Identities=18% Similarity=0.254 Sum_probs=139.5
Q ss_pred CCCceEEEcCCCcccCHHHHhHhhhhcCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCCCCCcceeee
Q 021958 79 GEIRTLWIGDLQYWMDETYLNTCFAHTGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMPNGEQNFRLN 158 (305)
Q Consensus 79 ~~~~~l~v~nLp~~~t~~~l~~~f~~~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~~~~i~v~ 158 (305)
....++|++++...+.+.+...++..+|......+........++|++++.|...+.+..++.......+. +..+...
T Consensus 86 ~~~~~~f~g~~s~~~e~~~~~~~~~~~g~~~~~~~S~~~~~~~sk~~~s~~f~~ks~~~~~l~~s~~~~~~--~~~~~~d 163 (285)
T KOG4210|consen 86 GSSSTFFVGELSENIEESEDDNFSSEAGLRVDARSSSLEDSLSSKGGLSVHFAGKSQFFAALEESGSKVLD--GNKGEKD 163 (285)
T ss_pred cccccccccccccchhhccccccchhhcCcccchhhhhccccccccceeeccccHHHHHHHHHhhhccccc--cccccCc
Confidence 45788999999999999989999999998878877776667899999999999999999999653323332 2222222
Q ss_pred ccCCCC------CCCCCCCCCceEE-EcCCCcCCCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHH
Q 021958 159 WASFGA------GEKRDDTPDHTIF-VGDLAADVTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLR 231 (305)
Q Consensus 159 ~~~~~~------~~~~~~~~~~~l~-v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~ 231 (305)
...... ..+.......+++ |+||+..+++++|+.+|. .+|.|..+++..+..++..+|||+|+|.+...+..
T Consensus 164 l~~~~~~~~~n~~~~~~~~~s~~~~~~~~~~f~~~~d~~~~~~~-~~~~i~~~r~~~~~~s~~~kg~a~~~~~~~~~~~~ 242 (285)
T KOG4210|consen 164 LNTRRGLRPKNKLSRLSSGPSDTIFFVGELDFSLTRDDLKEHFV-SSGEITSVRLPTDEESGDSKGFAYVDFSAGNSKKL 242 (285)
T ss_pred ccccccccccchhcccccCccccceeecccccccchHHHhhhcc-CcCcceeeccCCCCCccchhhhhhhhhhhchhHHH
Confidence 222211 1112223344455 999999999999999998 99999999999999999999999999999999999
Q ss_pred HHHHhCCceecceeeEEccCCCCCCC
Q 021958 232 AMTEMNGVFCSTRPMRIGPATNKKTV 257 (305)
Q Consensus 232 a~~~l~~~~~~g~~l~v~~~~~~~~~ 257 (305)
++.. ....+.++.+.+.+..++...
T Consensus 243 ~~~~-~~~~~~~~~~~~~~~~~~~~~ 267 (285)
T KOG4210|consen 243 ALND-QTRSIGGRPLRLEEDEPRPKS 267 (285)
T ss_pred Hhhc-ccCcccCcccccccCCCCccc
Confidence 9987 788899999999998876544
No 110
>KOG4206 consensus Spliceosomal protein snRNP-U1A/U2B [RNA processing and modification]
Probab=99.23 E-value=4.2e-11 Score=95.71 Aligned_cols=85 Identities=20% Similarity=0.383 Sum_probs=75.7
Q ss_pred CCCceEEEcCCCcCCCHHHHHH----HHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHHhCCceecceee
Q 021958 171 TPDHTIFVGDLAADVTDYMLQE----TFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEMNGVFCSTRPM 246 (305)
Q Consensus 171 ~~~~~l~v~nlp~~~~~~~l~~----~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l~~~~~~g~~l 246 (305)
.++.||||.||+..+..++|++ +|+ .||.|.+|.... +.+.+|.|||.|++.+.|..|++.|+|..+.|+.+
T Consensus 7 ~pn~TlYInnLnekI~~~elkrsL~~LFs-qfG~ildI~a~k---t~KmRGQA~VvFk~~~~As~A~r~l~gfpFygK~m 82 (221)
T KOG4206|consen 7 NPNGTLYINNLNEKIKKDELKRSLYLLFS-QFGKILDISAFK---TPKMRGQAFVVFKETEAASAALRALQGFPFYGKPM 82 (221)
T ss_pred CCCceEeehhccccccHHHHHHHHHHHHH-hhCCeEEEEecC---CCCccCceEEEecChhHHHHHHHHhcCCcccCchh
Confidence 3445999999999999999998 999 999998887763 56889999999999999999999999999999999
Q ss_pred EEccCCCCCCCCC
Q 021958 247 RIGPATNKKTVSG 259 (305)
Q Consensus 247 ~v~~~~~~~~~~~ 259 (305)
+|.|++.+.....
T Consensus 83 riqyA~s~sdii~ 95 (221)
T KOG4206|consen 83 RIQYAKSDSDIIA 95 (221)
T ss_pred heecccCccchhh
Confidence 9999998766543
No 111
>KOG4208 consensus Nucleolar RNA-binding protein NIFK [General function prediction only]
Probab=99.23 E-value=4e-11 Score=94.27 Aligned_cols=86 Identities=17% Similarity=0.312 Sum_probs=78.3
Q ss_pred CCCCCCceEEEcCCCcCCCHHHHHHHHhhcC-CCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHHhCCceecceee
Q 021958 168 RDDTPDHTIFVGDLAADVTDYMLQETFRARY-PSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEMNGVFCSTRPM 246 (305)
Q Consensus 168 ~~~~~~~~l~v~nlp~~~~~~~l~~~f~~~~-G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l~~~~~~g~~l 246 (305)
........++|+.++..+.+.++..+|. +| |.|.++++.+++.||.++|||||+|++.+.|..|-+.||+..+.++.|
T Consensus 44 p~~~~~g~~~~~~~p~g~~e~~~~~~~~-q~~g~v~r~rlsRnkrTGNSKgYAFVEFEs~eVA~IaAETMNNYLl~e~lL 122 (214)
T KOG4208|consen 44 PEQEIEGVVYVDHIPHGFFETEILNYFR-QFGGTVTRFRLSRNKRTGNSKGYAFVEFESEEVAKIAAETMNNYLLMEHLL 122 (214)
T ss_pred CccCCccceeecccccchhHHHHhhhhh-hcCCeeEEEEeecccccCCcCceEEEEeccHHHHHHHHHHhhhhhhhhhee
Confidence 4445667899999999999999999998 66 888888888999999999999999999999999999999999999999
Q ss_pred EEccCCCC
Q 021958 247 RIGPATNK 254 (305)
Q Consensus 247 ~v~~~~~~ 254 (305)
.|.+-.+.
T Consensus 123 ~c~vmppe 130 (214)
T KOG4208|consen 123 ECHVMPPE 130 (214)
T ss_pred eeEEeCch
Confidence 99987665
No 112
>smart00361 RRM_1 RNA recognition motif.
Probab=99.21 E-value=6.6e-11 Score=79.86 Aligned_cols=61 Identities=18% Similarity=0.298 Sum_probs=53.3
Q ss_pred HHHHhHhhh----hcCCcEEEE-EEeeCCC--CCccceEEEEEcCHHHHHHHHHHhCCCCCCCCCcceee
Q 021958 95 ETYLNTCFA----HTGEVVAVK-VIRNKQT--GQIEGYGFIEFISRAGAERVLQTFNGTPMPNGEQNFRL 157 (305)
Q Consensus 95 ~~~l~~~f~----~~G~v~~~~-~~~~~~~--~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~~~~i~v 157 (305)
+++|+++|+ .||.|.++. +..++.+ +.++|||||+|.+.++|.+|++.|+|..+ .++.|++
T Consensus 2 ~~~l~~~~~~~~~~fG~v~~v~~v~~~~~~~~~~~rG~~fV~f~~~~dA~~A~~~l~g~~~--~gr~l~~ 69 (70)
T smart00361 2 DEDFEREFSEEEEYFGEVGKINKIYIDNVGYENHKRGNVYITFERSEDAARAIVDLNGRYF--DGRTVKA 69 (70)
T ss_pred chhHHHHHHHHHHhcCCeeEEEEEEeCCCCCCCCCcEEEEEEECCHHHHHHHHHHhCCCEE--CCEEEEe
Confidence 578899998 999999995 7777666 89999999999999999999999999999 5566654
No 113
>KOG4660 consensus Protein Mei2, essential for commitment to meiosis, and related proteins [Cell cycle control, cell division, chromosome partitioning]
Probab=99.20 E-value=8.3e-11 Score=105.07 Aligned_cols=70 Identities=24% Similarity=0.303 Sum_probs=61.4
Q ss_pred CCCCCCceEEEcCCCcccCHHHHhHhhhhcCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCCC
Q 021958 76 GQPGEIRTLWIGDLQYWMDETYLNTCFAHTGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMPN 150 (305)
Q Consensus 76 ~~~~~~~~l~v~nLp~~~t~~~l~~~f~~~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~ 150 (305)
..+..+++|+|-|||..+++++|.++|+.||+|+.++..+. .+|.+||+|-|..+|++|+++|++.+|.+
T Consensus 70 ~~~~~~~~L~v~nl~~~Vsn~~L~~~f~~yGeir~ir~t~~-----~~~~~~v~FyDvR~A~~Alk~l~~~~~~~ 139 (549)
T KOG4660|consen 70 EKDMNQGTLVVFNLPRSVSNDTLLRIFGAYGEIREIRETPN-----KRGIVFVEFYDVRDAERALKALNRREIAG 139 (549)
T ss_pred cccCccceEEEEecCCcCCHHHHHHHHHhhcchhhhhcccc-----cCceEEEEEeehHhHHHHHHHHHHHHhhh
Confidence 34567899999999999999999999999999999765444 46799999999999999999999988843
No 114
>KOG0112 consensus Large RNA-binding protein (RRM superfamily) [General function prediction only]
Probab=99.19 E-value=1.5e-11 Score=115.02 Aligned_cols=167 Identities=18% Similarity=0.288 Sum_probs=138.3
Q ss_pred CCCCCceEEEcCCCcccCHHHHhHhhhhcCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCCCCCccee
Q 021958 77 QPGEIRTLWIGDLQYWMDETYLNTCFAHTGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMPNGEQNFR 156 (305)
Q Consensus 77 ~~~~~~~l~v~nLp~~~t~~~l~~~f~~~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~~~~i~ 156 (305)
+....++||+|||+..+++.+|+..|..+|.|.+|.|-.-+. +...-|+||.|.+.+.+-.|.-.+.+..|..+...+.
T Consensus 368 D~~atrTLf~Gnl~~kl~eseiR~af~e~gkve~VDiKtP~~-~~esa~~f~~~~n~dmtp~ak~e~s~~~I~~g~~r~g 446 (975)
T KOG0112|consen 368 DFRATRTLFLGNLDSKLTESEIRPAFDESGKVEEVDIKTPHI-KTESAYAFVSLLNTDMTPSAKFEESGPLIGNGTHRIG 446 (975)
T ss_pred chhhhhhhhhcCcccchhhhhhhhhhhhhccccccccccCCC-CcccchhhhhhhccccCcccchhhcCCccccCccccc
Confidence 456678999999999999999999999999999998866532 4445599999999999999998988888854433333
Q ss_pred eeccCCCCCCCCCCCCCceEEEcCCCcCCCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHHh
Q 021958 157 LNWASFGAGEKRDDTPDHTIFVGDLAADVTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEM 236 (305)
Q Consensus 157 v~~~~~~~~~~~~~~~~~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l 236 (305)
+... .......+++++|..++....|...|. .||.|..|.+-.. .-||+|.|++...|..|+..|
T Consensus 447 lG~~--------kst~ttr~~sgglg~w~p~~~l~r~fd-~fGpir~Idy~hg------q~yayi~yes~~~aq~a~~~~ 511 (975)
T KOG0112|consen 447 LGQP--------KSTPTTRLQSGGLGPWSPVSRLNREFD-RFGPIRIIDYRHG------QPYAYIQYESPPAAQAATHDM 511 (975)
T ss_pred cccc--------ccccceeeccCCCCCCChHHHHHHHhh-ccCcceeeecccC------CcceeeecccCccchhhHHHH
Confidence 3321 345678999999999999999999998 9999998877322 569999999999999999999
Q ss_pred CCceecc--eeeEEccCCCCCCCCC
Q 021958 237 NGVFCST--RPMRIGPATNKKTVSG 259 (305)
Q Consensus 237 ~~~~~~g--~~l~v~~~~~~~~~~~ 259 (305)
.|..|+| +.++|.|+..-...-+
T Consensus 512 rgap~G~P~~r~rvdla~~~~~~Pq 536 (975)
T KOG0112|consen 512 RGAPLGGPPRRLRVDLASPPGATPQ 536 (975)
T ss_pred hcCcCCCCCcccccccccCCCCChh
Confidence 9999976 7899999987655533
No 115
>KOG0132 consensus RNA polymerase II C-terminal domain-binding protein RA4, contains RPR and RRM domains [RNA processing and modification; Transcription]
Probab=99.16 E-value=6.2e-11 Score=109.30 Aligned_cols=81 Identities=19% Similarity=0.365 Sum_probs=74.8
Q ss_pred CCCceEEEcCCCcCCCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHHhCCceecceeeEEcc
Q 021958 171 TPDHTIFVGDLAADVTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEMNGVFCSTRPMRIGP 250 (305)
Q Consensus 171 ~~~~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l~~~~~~g~~l~v~~ 250 (305)
..++|||||+|+..++|.||..+|+ .||+|.+|.++.. ++||||.+.+..+|.+|+.+|++..+.++.|+|.|
T Consensus 419 V~SrTLwvG~i~k~v~e~dL~~~fe-efGeiqSi~li~~------R~cAfI~M~~RqdA~kalqkl~n~kv~~k~Iki~W 491 (894)
T KOG0132|consen 419 VCSRTLWVGGIPKNVTEQDLANLFE-EFGEIQSIILIPP------RGCAFIKMVRRQDAEKALQKLSNVKVADKTIKIAW 491 (894)
T ss_pred EeeeeeeeccccchhhHHHHHHHHH-hcccceeEeeccC------CceeEEEEeehhHHHHHHHHHhcccccceeeEEee
Confidence 3478999999999999999999999 9999999998755 89999999999999999999999999999999999
Q ss_pred CCCCCCCC
Q 021958 251 ATNKKTVS 258 (305)
Q Consensus 251 ~~~~~~~~ 258 (305)
+..+.-+.
T Consensus 492 a~g~G~ks 499 (894)
T KOG0132|consen 492 AVGKGPKS 499 (894)
T ss_pred eccCCcch
Confidence 98876654
No 116
>KOG4211 consensus Splicing factor hnRNP-F and related RNA-binding proteins [RNA processing and modification]
Probab=99.15 E-value=6.6e-10 Score=97.96 Aligned_cols=164 Identities=18% Similarity=0.230 Sum_probs=120.4
Q ss_pred CCCceEEEcCCCcccCHHHHhHhhhhcCCcEE-EEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCCCCCcceee
Q 021958 79 GEIRTLWIGDLQYWMDETYLNTCFAHTGEVVA-VKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMPNGEQNFRL 157 (305)
Q Consensus 79 ~~~~~l~v~nLp~~~t~~~l~~~f~~~G~v~~-~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~~~~i~v 157 (305)
.....|-+.+||+.+|++||.++|+-.-.+.. +.+..+. .+++.|-|||.|++.+.|++|+.. +...| ..+.|.|
T Consensus 101 ~~d~vVRLRGLPfscte~dI~~FFaGL~Iv~~gi~l~~d~-rgR~tGEAfVqF~sqe~ae~Al~r-hre~i--GhRYIEv 176 (510)
T KOG4211|consen 101 ANDGVVRLRGLPFSCTEEDIVEFFAGLEIVPDGILLPMDQ-RGRPTGEAFVQFESQESAEIALGR-HRENI--GHRYIEV 176 (510)
T ss_pred CCCceEEecCCCccCcHHHHHHHhcCCcccccceeeeccC-CCCcccceEEEecCHHHHHHHHHH-HHHhh--ccceEEe
Confidence 45568999999999999999999998765555 4455554 578999999999999999999964 33333 3333333
Q ss_pred eccCCC--------------------------------------------------------------------------
Q 021958 158 NWASFG-------------------------------------------------------------------------- 163 (305)
Q Consensus 158 ~~~~~~-------------------------------------------------------------------------- 163 (305)
..+...
T Consensus 177 F~Ss~~e~~~~~~~~~~~~~rpGpy~~~~a~Rg~~d~~~~~~~~~~~~r~g~~~~g~~g~~~~~~~~d~~~~gs~~~~~~ 256 (510)
T KOG4211|consen 177 FRSSRAEVKRAAGPGDGRVGRPGPYDRPGAPRGGYDYGQGRDPGRNATRYGAGGEGYYGFSRYPSLQDYGNFGSYGGGRD 256 (510)
T ss_pred ehhHHHHHHhhccccccccCCCCccccccCCccccccccccCCCccccccccccCCccccccCccccccccccccccccc
Confidence 222110
Q ss_pred ---CC----CC-------CC-CCCCceEEEcCCCcCCCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHH
Q 021958 164 ---AG----EK-------RD-DTPDHTIFVGDLAADVTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESE 228 (305)
Q Consensus 164 ---~~----~~-------~~-~~~~~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~ 228 (305)
.. .+ .. ......++.++||+..++.++..+|+ ......|.|-... +|+..|-|+|+|.+.++
T Consensus 257 ~~~~~~~g~~~~g~~g~~~~~~~~g~fv~MRGlpy~a~~~di~nfFs--pl~p~~v~i~ig~-dGr~TGEAdveF~t~ed 333 (510)
T KOG4211|consen 257 PNYPVSSGPHRQGGAGDYGNGGPGGHFVHMRGLPYDATENDIANFFS--PLNPYRVHIEIGP-DGRATGEADVEFATGED 333 (510)
T ss_pred cccCCCCCcccCCCcccccCCCCCCceeeecCCCccCCCcchhhhcC--CCCceeEEEEeCC-CCccCCcceeecccchh
Confidence 00 00 00 01226788999999999999999997 4455578777764 89999999999999999
Q ss_pred HHHHHHHhCCceecceeeEEcc
Q 021958 229 QLRAMTEMNGVFCSTRPMRIGP 250 (305)
Q Consensus 229 A~~a~~~l~~~~~~g~~l~v~~ 250 (305)
|..|+.. ++..+..+-|....
T Consensus 334 av~Amsk-d~anm~hrYVElFl 354 (510)
T KOG4211|consen 334 AVGAMGK-DGANMGHRYVELFL 354 (510)
T ss_pred hHhhhcc-CCcccCcceeeecc
Confidence 9999875 67777777777654
No 117
>PF13893 RRM_5: RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain); PDB: 2PE8_A 2PEH_A 2D9O_A 1A9N_D 2DIT_A 3S6E_A 2LQ5_A 1SJQ_A 2AD9_A 1X4D_A ....
Probab=99.15 E-value=1.2e-10 Score=74.88 Aligned_cols=56 Identities=29% Similarity=0.644 Sum_probs=49.4
Q ss_pred HhHhhhhcCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCCCCCcceeeecc
Q 021958 98 LNTCFAHTGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMPNGEQNFRLNWA 160 (305)
Q Consensus 98 l~~~f~~~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~~~~i~v~~~ 160 (305)
|+++|++||.|.++.+..+. +++|||+|.+.++|.+|++.|+|..+ .++.+++.|+
T Consensus 1 L~~~f~~fG~V~~i~~~~~~-----~~~a~V~f~~~~~A~~a~~~l~~~~~--~g~~l~V~~a 56 (56)
T PF13893_consen 1 LYKLFSKFGEVKKIKIFKKK-----RGFAFVEFASVEDAQKAIEQLNGRQF--NGRPLKVSYA 56 (56)
T ss_dssp HHHHHTTTS-EEEEEEETTS-----TTEEEEEESSHHHHHHHHHHHTTSEE--TTEEEEEEEE
T ss_pred ChHHhCCcccEEEEEEEeCC-----CCEEEEEECCHHHHHHHHHHhCCCEE--CCcEEEEEEC
Confidence 67899999999999987653 57999999999999999999999999 6788999875
No 118
>KOG4307 consensus RNA binding protein RBM12/SWAN [General function prediction only]
Probab=99.15 E-value=1.2e-09 Score=99.72 Aligned_cols=169 Identities=12% Similarity=0.069 Sum_probs=124.1
Q ss_pred CCceEEEcCCCcccCHHHHhHhhhhcCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCCCCCcceeeec
Q 021958 80 EIRTLWIGDLQYWMDETYLNTCFAHTGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMPNGEQNFRLNW 159 (305)
Q Consensus 80 ~~~~l~v~nLp~~~t~~~l~~~f~~~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~~~~i~v~~ 159 (305)
+.+.+-+.+.+.+.++.+++++|... .+....+..+...+...|.++|+|....++.+|++. +...+ ..|.+++..
T Consensus 310 d~~y~~~~gm~fn~~~nd~rkfF~g~-~~~~~~l~~~~v~~~~tG~~~v~f~~~~~~q~A~~r-n~~~~--~~R~~q~~P 385 (944)
T KOG4307|consen 310 DKYYNNYKGMEFNNDFNDGRKFFPGR-NAQSTDLSENRVAPPQTGRKTVMFTPQAPFQNAFTR-NPSDD--VNRPFQTGP 385 (944)
T ss_pred hhheeeecccccccccchhhhhcCcc-cccccchhhhhcCCCcCCceEEEecCcchHHHHHhc-Cchhh--hhcceeecC
Confidence 34555667899999999999998743 255666666655555578999999999999999864 22222 244444433
Q ss_pred cCCC---------------------------------CCCC--CCCCCCceEEEcCCCcCCCHHHHHHHHhhcCCCeeE-
Q 021958 160 ASFG---------------------------------AGEK--RDDTPDHTIFVGDLAADVTDYMLQETFRARYPSTKG- 203 (305)
Q Consensus 160 ~~~~---------------------------------~~~~--~~~~~~~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~- 203 (305)
.... .... .....+.+|||..||..+++.++.+.|. .--.|++
T Consensus 386 ~g~~~~~~a~~~~~~~~~~~~~~~hg~p~~~pr~~~~~gq~vp~P~~ag~~lyv~~lP~~t~~~~~v~~f~-~~~~Ved~ 464 (944)
T KOG4307|consen 386 PGNLGRNGAPPFQAGVPPPVIQNNHGRPIAPPRAMVRPGQNVPFPGGAGGALYVFQLPVMTPIVPPVNKFM-GAAAVEDF 464 (944)
T ss_pred CCccccccCccccccCCCCcccccCCCCCCCcccccCCCCCCCCCCCccceEEeccCCccccccchhhhhh-hhhhhhhe
Confidence 3221 1111 2224577999999999999999999998 5556666
Q ss_pred EEEEecCCCCCcccEEEEEeCCHHHHHHHHHHhCCceecceeeEEccCCCC
Q 021958 204 AKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEMNGVFCSTRPMRIGPATNK 254 (305)
Q Consensus 204 i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l~~~~~~g~~l~v~~~~~~ 254 (305)
|.|.... +++.++.|||.|...+++..|+..-+.++++.|.|+|+-...+
T Consensus 465 I~lt~~P-~~~~~~~afv~F~~~~a~~~a~~~~~k~y~G~r~irv~si~~~ 514 (944)
T KOG4307|consen 465 IELTRLP-TDLLRPAAFVAFIHPTAPLTASSVKTKFYPGHRIIRVDSIADY 514 (944)
T ss_pred eEeccCC-cccccchhhheeccccccchhhhcccccccCceEEEeechhhH
Confidence 6666554 7888999999999999999998887888999999999865443
No 119
>KOG0153 consensus Predicted RNA-binding protein (RRM superfamily) [General function prediction only]
Probab=99.14 E-value=2.3e-10 Score=96.54 Aligned_cols=80 Identities=20% Similarity=0.367 Sum_probs=70.9
Q ss_pred CCCCCCCceEEEcCCCcCCCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHHh-CCceeccee
Q 021958 167 KRDDTPDHTIFVGDLAADVTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEM-NGVFCSTRP 245 (305)
Q Consensus 167 ~~~~~~~~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l-~~~~~~g~~ 245 (305)
..++....+|||++|...++|.+|+++|. +||+|+++++... +++|||+|.+.++|+.|..++ +...|+|++
T Consensus 222 pPeD~~I~tLyIg~l~d~v~e~dIrdhFy-qyGeirsi~~~~~------~~CAFv~ftTR~aAE~Aae~~~n~lvI~G~R 294 (377)
T KOG0153|consen 222 PPEDTSIKTLYIGGLNDEVLEQDIRDHFY-QYGEIRSIRILPR------KGCAFVTFTTREAAEKAAEKSFNKLVINGFR 294 (377)
T ss_pred CCcccceeEEEecccccchhHHHHHHHHh-hcCCeeeEEeecc------cccceeeehhhHHHHHHHHhhcceeeecceE
Confidence 35566778999999999999999999999 9999999999866 679999999999999988766 445589999
Q ss_pred eEEccCCC
Q 021958 246 MRIGPATN 253 (305)
Q Consensus 246 l~v~~~~~ 253 (305)
|+|.|+.+
T Consensus 295 l~i~Wg~~ 302 (377)
T KOG0153|consen 295 LKIKWGRP 302 (377)
T ss_pred EEEEeCCC
Confidence 99999998
No 120
>KOG0128 consensus RNA-binding protein SART3 (RRM superfamily) [RNA processing and modification]
Probab=99.09 E-value=1.1e-11 Score=115.44 Aligned_cols=150 Identities=15% Similarity=0.232 Sum_probs=131.3
Q ss_pred CCCceEEEcCCCcccCHHHHhHhhhhcCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCCCCCcceeee
Q 021958 79 GEIRTLWIGDLQYWMDETYLNTCFAHTGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMPNGEQNFRLN 158 (305)
Q Consensus 79 ~~~~~l~v~nLp~~~t~~~l~~~f~~~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~~~~i~v~ 158 (305)
....++||+||+..+.+.+|...|..+|.+..+++...+.+++.+|+||++|...+++.+|+.-.++..+ +
T Consensus 665 R~~~~~fvsnl~~~~~~~dl~~~~~~~~~~e~vqi~~h~n~~~~rG~~Y~~F~~~~~~~aaV~f~d~~~~---g------ 735 (881)
T KOG0128|consen 665 RDLIKIFVSNLSPKMSEEDLSERFSPSGTIEVVQIVIHKNEKRFRGKAYVEFLKPEHAGAAVAFRDSCFF---G------ 735 (881)
T ss_pred HHHHHHHHhhcchhhcCchhhhhcCccchhhhHHHHHHhhccccccceeeEeecCCchhhhhhhhhhhhh---h------
Confidence 3456899999999999999999999999988887776677899999999999999999999965444433 1
Q ss_pred ccCCCCCCCCCCCCCceEEEcCCCcCCCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHHhCC
Q 021958 159 WASFGAGEKRDDTPDHTIFVGDLAADVTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEMNG 238 (305)
Q Consensus 159 ~~~~~~~~~~~~~~~~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l~~ 238 (305)
...++|.|.|+..|.++|+.++. .+|++.+++++..+ .|+.+|.|+|.|.+..+|.+++...+.
T Consensus 736 --------------K~~v~i~g~pf~gt~e~~k~l~~-~~gn~~~~~~vt~r-~gkpkg~a~v~y~~ea~~s~~~~s~d~ 799 (881)
T KOG0128|consen 736 --------------KISVAISGPPFQGTKEELKSLAS-KTGNVTSLRLVTVR-AGKPKGKARVDYNTEADASRKVASVDV 799 (881)
T ss_pred --------------hhhhheeCCCCCCchHHHHhhcc-ccCCccccchhhhh-ccccccceeccCCCcchhhhhcccchh
Confidence 36799999999999999999999 99999999988775 799999999999999999999998888
Q ss_pred ceecceeeEEccCCC
Q 021958 239 VFCSTRPMRIGPATN 253 (305)
Q Consensus 239 ~~~~g~~l~v~~~~~ 253 (305)
..+.-+.+.|..+++
T Consensus 800 ~~~rE~~~~v~vsnp 814 (881)
T KOG0128|consen 800 AGKRENNGEVQVSNP 814 (881)
T ss_pred hhhhhcCccccccCC
Confidence 888888888887655
No 121
>KOG4307 consensus RNA binding protein RBM12/SWAN [General function prediction only]
Probab=99.08 E-value=1.6e-09 Score=99.08 Aligned_cols=76 Identities=20% Similarity=0.235 Sum_probs=66.5
Q ss_pred ceEEEcCCCcCCCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHHhCCceecceeeEEcc
Q 021958 174 HTIFVGDLAADVTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEMNGVFCSTRPMRIGP 250 (305)
Q Consensus 174 ~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l~~~~~~g~~l~v~~ 250 (305)
+.|-+.|+|++++-+||.++|. -|-.+-.-.+++..+.|+.+|-|.|.|++.++|.+|...|+++.|.+|+|++..
T Consensus 868 ~V~~~~n~Pf~v~l~dI~~FF~-dY~~~p~sI~~r~nd~G~pTGe~mvAfes~~eAr~A~~dl~~~~i~nr~V~l~i 943 (944)
T KOG4307|consen 868 RVLSCNNFPFDVTLEDIVEFFN-DYEPDPNSIRIRRNDDGVPTGECMVAFESQEEARRASMDLDGQKIRNRVVSLRI 943 (944)
T ss_pred eEEEecCCCccccHHHHHHHhc-ccccCCCceeEeecCCCCcccceeEeecCHHHHHhhhhccccCcccceeEEEEe
Confidence 4788999999999999999999 887765444444446899999999999999999999999999999999998863
No 122
>KOG4208 consensus Nucleolar RNA-binding protein NIFK [General function prediction only]
Probab=99.08 E-value=4.2e-10 Score=88.56 Aligned_cols=81 Identities=25% Similarity=0.320 Sum_probs=71.3
Q ss_pred CCCceEEEcCCCcccCHHHHhHhhhhc-CCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCCCCCcceee
Q 021958 79 GEIRTLWIGDLQYWMDETYLNTCFAHT-GEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMPNGEQNFRL 157 (305)
Q Consensus 79 ~~~~~l~v~nLp~~~t~~~l~~~f~~~-G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~~~~i~v 157 (305)
.....+||..++.-+.+.++..+|.++ |.+..+++.+++.||.++|||||+|++.+.|.-|.+.||+..+ .++.+.+
T Consensus 47 ~~~g~~~~~~~p~g~~e~~~~~~~~q~~g~v~r~rlsRnkrTGNSKgYAFVEFEs~eVA~IaAETMNNYLl--~e~lL~c 124 (214)
T KOG4208|consen 47 EIEGVVYVDHIPHGFFETEILNYFRQFGGTVTRFRLSRNKRTGNSKGYAFVEFESEEVAKIAAETMNNYLL--MEHLLEC 124 (214)
T ss_pred CCccceeecccccchhHHHHhhhhhhcCCeeEEEEeecccccCCcCceEEEEeccHHHHHHHHHHhhhhhh--hhheeee
Confidence 445679999999999999999999998 6788889989999999999999999999999999999999988 5666666
Q ss_pred eccC
Q 021958 158 NWAS 161 (305)
Q Consensus 158 ~~~~ 161 (305)
.+-.
T Consensus 125 ~vmp 128 (214)
T KOG4208|consen 125 HVMP 128 (214)
T ss_pred EEeC
Confidence 5443
No 123
>KOG0132 consensus RNA polymerase II C-terminal domain-binding protein RA4, contains RPR and RRM domains [RNA processing and modification; Transcription]
Probab=99.07 E-value=2.9e-10 Score=104.95 Aligned_cols=107 Identities=23% Similarity=0.372 Sum_probs=87.0
Q ss_pred CceEEEcCCCcccCHHHHhHhhhhcCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCCCCCcceeeecc
Q 021958 81 IRTLWIGDLQYWMDETYLNTCFAHTGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMPNGEQNFRLNWA 160 (305)
Q Consensus 81 ~~~l~v~nLp~~~t~~~l~~~f~~~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~~~~i~v~~~ 160 (305)
++|||||+|+.++++.||.++|+.||.|.+|.++.. +|||||.+...++|.+|+.+|+...+ ..+.|++.|+
T Consensus 421 SrTLwvG~i~k~v~e~dL~~~feefGeiqSi~li~~------R~cAfI~M~~RqdA~kalqkl~n~kv--~~k~Iki~Wa 492 (894)
T KOG0132|consen 421 SRTLWVGGIPKNVTEQDLANLFEEFGEIQSIILIPP------RGCAFIKMVRRQDAEKALQKLSNVKV--ADKTIKIAWA 492 (894)
T ss_pred eeeeeeccccchhhHHHHHHHHHhcccceeEeeccC------CceeEEEEeehhHHHHHHHHHhcccc--cceeeEEeee
Confidence 489999999999999999999999999999998765 46999999999999999999997777 7899999999
Q ss_pred CCCCCCC-CCCCCCceEEEcCCCcCCCHHHHHHHHh
Q 021958 161 SFGAGEK-RDDTPDHTIFVGDLAADVTDYMLQETFR 195 (305)
Q Consensus 161 ~~~~~~~-~~~~~~~~l~v~nlp~~~~~~~l~~~f~ 195 (305)
....... ..+.....+=|.-||++.-.++|+.+++
T Consensus 493 ~g~G~kse~k~~wD~~lGVt~IP~~kLt~dl~~~~e 528 (894)
T KOG0132|consen 493 VGKGPKSEYKDYWDVELGVTYIPWEKLTDDLEAWCE 528 (894)
T ss_pred ccCCcchhhhhhhhcccCeeEeehHhcCHHHHHhhh
Confidence 8654332 2333444556777888755555766664
No 124
>KOG4661 consensus Hsp27-ERE-TATA-binding protein/Scaffold attachment factor (SAF-B) [Transcription]
Probab=99.03 E-value=7.8e-10 Score=98.90 Aligned_cols=82 Identities=23% Similarity=0.386 Sum_probs=76.8
Q ss_pred CCCceEEEcCCCcCCCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHHhCCceecceeeEEcc
Q 021958 171 TPDHTIFVGDLAADVTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEMNGVFCSTRPMRIGP 250 (305)
Q Consensus 171 ~~~~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l~~~~~~g~~l~v~~ 250 (305)
.-.+.|||.+|...+...||+.+|+ +||.|...+|+.+..+--.+.|+||++.+.++|.+||+.||.+.|.|+.|.|..
T Consensus 403 ~~gRNlWVSGLSstTRAtDLKnlFS-KyGKVvGAKVVTNaRsPGaRCYGfVTMSts~eAtkCI~hLHrTELHGrmISVEk 481 (940)
T KOG4661|consen 403 TLGRNLWVSGLSSTTRATDLKNLFS-KYGKVVGAKVVTNARSPGARCYGFVTMSTSAEATKCIEHLHRTELHGRMISVEK 481 (940)
T ss_pred ccccceeeeccccchhhhHHHHHHH-HhcceeceeeeecCCCCCcceeEEEEecchHHHHHHHHHhhhhhhcceeeeeee
Confidence 4567899999999999999999999 999999999999977777799999999999999999999999999999999998
Q ss_pred CCC
Q 021958 251 ATN 253 (305)
Q Consensus 251 ~~~ 253 (305)
+++
T Consensus 482 aKN 484 (940)
T KOG4661|consen 482 AKN 484 (940)
T ss_pred ccc
Confidence 876
No 125
>KOG0415 consensus Predicted peptidyl prolyl cis-trans isomerase [Posttranslational modification, protein turnover, chaperones]
Probab=99.00 E-value=7e-10 Score=93.82 Aligned_cols=82 Identities=21% Similarity=0.421 Sum_probs=76.0
Q ss_pred CCCceEEEcCCCcccCHHHHhHhhhhcCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCCCCCcceeee
Q 021958 79 GEIRTLWIGDLQYWMDETYLNTCFAHTGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMPNGEQNFRLN 158 (305)
Q Consensus 79 ~~~~~l~v~nLp~~~t~~~l~~~f~~~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~~~~i~v~ 158 (305)
...+.|||..|.+-+|+++|.-+|+.||.|.+|.+++|..||.+..||||+|++.+++++|.-+|+...| ..+.|.|+
T Consensus 237 PPeNVLFVCKLNPVTtDeDLeiIFSrFG~i~sceVIRD~ktgdsLqyaFiEFen~escE~AyFKMdNvLI--DDrRIHVD 314 (479)
T KOG0415|consen 237 PPENVLFVCKLNPVTTDEDLEIIFSRFGKIVSCEVIRDRKTGDSLQYAFIEFENKESCEQAYFKMDNVLI--DDRRIHVD 314 (479)
T ss_pred CCcceEEEEecCCcccccchhhHHhhcccceeeeEEecccccchhheeeeeecchhhHHHHHhhhcceee--ccceEEee
Confidence 3457999999999999999999999999999999999999999999999999999999999999999998 56888888
Q ss_pred ccCC
Q 021958 159 WASF 162 (305)
Q Consensus 159 ~~~~ 162 (305)
++..
T Consensus 315 FSQS 318 (479)
T KOG0415|consen 315 FSQS 318 (479)
T ss_pred hhhh
Confidence 8764
No 126
>KOG1365 consensus RNA-binding protein Fusilli, contains RRM domain [RNA processing and modification; General function prediction only]
Probab=98.98 E-value=3.7e-09 Score=90.43 Aligned_cols=170 Identities=16% Similarity=0.174 Sum_probs=118.0
Q ss_pred CCCCceEEEcCCCcccCHHHHhHhhhhcCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCCCCCcceee
Q 021958 78 PGEIRTLWIGDLQYWMDETYLNTCFAHTGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMPNGEQNFRL 157 (305)
Q Consensus 78 ~~~~~~l~v~nLp~~~t~~~l~~~f~~~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~~~~i~v 157 (305)
.++...+-.++|++..++.+|..+|.-.....-.+.+-....|+..|.+.|.|.+.|.-+.|++. +...+ .++.+.+
T Consensus 57 ~~~~vvvRaRglpwq~Sd~~ia~ff~gl~ia~gg~aKOG~~qgrRnge~lvrf~d~e~RdlalkR-hkhh~--g~ryiev 133 (508)
T KOG1365|consen 57 ADDNVVVRARGLPWQSSDQDIARFFKGLNIANGGRALCLNAQGRRNGEALVRFVDPEGRDLALKR-HKHHM--GTRYIEV 133 (508)
T ss_pred cCcceEEEecCCCCCcccCCHHHHHhhhhccccceeeeehhhhccccceEEEecCchhhhhhhHh-hhhhc--cCCceee
Confidence 34455677799999999999999997553322222222223477789999999999998888865 33333 5666776
Q ss_pred eccCCCCCC-----------C-CCCCCCceEEEcCCCcCCCHHHHHHHHhhc---CCCeeEEEEEecCCCCCcccEEEEE
Q 021958 158 NWASFGAGE-----------K-RDDTPDHTIFVGDLAADVTDYMLQETFRAR---YPSTKGAKVVIDRLTGRTKGYGFVR 222 (305)
Q Consensus 158 ~~~~~~~~~-----------~-~~~~~~~~l~v~nlp~~~~~~~l~~~f~~~---~G~i~~i~i~~~~~~~~~~g~afV~ 222 (305)
-.+.....- . ......-.|.+++||+++++.|+.++|... -|..+.|.++... +|+.+|-|||.
T Consensus 134 Yka~ge~f~~iagg~s~e~~~flsk~~qvivRmRGLPfdat~~dVv~FF~~~cpv~~g~egvLFV~rp-dgrpTGdAFvl 212 (508)
T KOG1365|consen 134 YKATGEEFLKIAGGTSNEAAPFLSKENQVIVRMRGLPFDATALDVVEFFGPPCPVTGGTEGVLFVTRP-DGRPTGDAFVL 212 (508)
T ss_pred eccCchhheEecCCccccCCCCCCcccceEEEecCCCCCcchHHHHHhcCCCCcccCCccceEEEECC-CCCcccceEEE
Confidence 555432110 0 111234568899999999999999999411 3455666666554 78999999999
Q ss_pred eCCHHHHHHHHHHhCCceecceeeEEccCC
Q 021958 223 FGDESEQLRAMTEMNGVFCSTRPMRIGPAT 252 (305)
Q Consensus 223 f~~~~~A~~a~~~l~~~~~~g~~l~v~~~~ 252 (305)
|...++|..||.+ |...|+-|.|.+..++
T Consensus 213 fa~ee~aq~aL~k-hrq~iGqRYIElFRST 241 (508)
T KOG1365|consen 213 FACEEDAQFALRK-HRQNIGQRYIELFRST 241 (508)
T ss_pred ecCHHHHHHHHHH-HHHHHhHHHHHHHHHh
Confidence 9999999999986 5555666666664443
No 127
>KOG0153 consensus Predicted RNA-binding protein (RRM superfamily) [General function prediction only]
Probab=98.89 E-value=6.9e-09 Score=87.79 Aligned_cols=82 Identities=22% Similarity=0.424 Sum_probs=70.5
Q ss_pred CCCCCCCCceEEEcCCCcccCHHHHhHhhhhcCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCCCCCc
Q 021958 74 QQGQPGEIRTLWIGDLQYWMDETYLNTCFAHTGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMPNGEQ 153 (305)
Q Consensus 74 ~~~~~~~~~~l~v~nLp~~~t~~~l~~~f~~~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~~~ 153 (305)
-+.++....+|||++|...+++.+|++.|.+||+|+++.+...+ ++|||+|.+.+.|+.|.+++-. .+...|.
T Consensus 221 epPeD~~I~tLyIg~l~d~v~e~dIrdhFyqyGeirsi~~~~~~------~CAFv~ftTR~aAE~Aae~~~n-~lvI~G~ 293 (377)
T KOG0153|consen 221 EPPEDTSIKTLYIGGLNDEVLEQDIRDHFYQYGEIRSIRILPRK------GCAFVTFTTREAAEKAAEKSFN-KLVINGF 293 (377)
T ss_pred CCCcccceeEEEecccccchhHHHHHHHHhhcCCeeeEEeeccc------ccceeeehhhHHHHHHHHhhcc-eeeecce
Confidence 34466788999999999999999999999999999999998764 4999999999999999877544 3334789
Q ss_pred ceeeeccCC
Q 021958 154 NFRLNWASF 162 (305)
Q Consensus 154 ~i~v~~~~~ 162 (305)
.++|.|+..
T Consensus 294 Rl~i~Wg~~ 302 (377)
T KOG0153|consen 294 RLKIKWGRP 302 (377)
T ss_pred EEEEEeCCC
Confidence 999999987
No 128
>KOG1548 consensus Transcription elongation factor TAT-SF1 [Transcription]
Probab=98.86 E-value=2.4e-08 Score=84.48 Aligned_cols=85 Identities=15% Similarity=0.173 Sum_probs=75.9
Q ss_pred CCCceEEEcCCCcCCCHHHHHHHHhhcCCCeeE--------EEEEecCCCCCcccEEEEEeCCHHHHHHHHHHhCCceec
Q 021958 171 TPDHTIFVGDLAADVTDYMLQETFRARYPSTKG--------AKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEMNGVFCS 242 (305)
Q Consensus 171 ~~~~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~--------i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l~~~~~~ 242 (305)
..+..|||.|||.++|.+++.++|+ +||.|.+ |++.++. .|+-+|=|+|.|-..++..-|++.|++..+.
T Consensus 132 ~~Nt~VYVsgLP~DiT~dE~~~~~s-KcGiI~~d~~t~epk~KlYrd~-~G~lKGDaLc~y~K~ESVeLA~~ilDe~~~r 209 (382)
T KOG1548|consen 132 KVNTSVYVSGLPLDITVDEFAEVMS-KCGIIMRDPQTGEPKVKLYRDN-QGKLKGDALCCYIKRESVELAIKILDEDELR 209 (382)
T ss_pred ccCceEEecCCCCcccHHHHHHHHH-hcceEeccCCCCCeeEEEEecC-CCCccCceEEEeecccHHHHHHHHhCccccc
Confidence 3456799999999999999999999 9999875 6777775 6999999999999999999999999999999
Q ss_pred ceeeEEccCCCCCCC
Q 021958 243 TRPMRIGPATNKKTV 257 (305)
Q Consensus 243 g~~l~v~~~~~~~~~ 257 (305)
|+.|+|..|+-....
T Consensus 210 g~~~rVerAkfq~Kg 224 (382)
T KOG1548|consen 210 GKKLRVERAKFQMKG 224 (382)
T ss_pred CcEEEEehhhhhhcc
Confidence 999999998765544
No 129
>KOG0533 consensus RRM motif-containing protein [RNA processing and modification]
Probab=98.82 E-value=1.6e-08 Score=83.57 Aligned_cols=85 Identities=31% Similarity=0.430 Sum_probs=76.4
Q ss_pred CCCCCceEEEcCCCcCCCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHHhCCceecceeeEE
Q 021958 169 DDTPDHTIFVGDLAADVTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEMNGVFCSTRPMRI 248 (305)
Q Consensus 169 ~~~~~~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l~~~~~~g~~l~v 248 (305)
......+|.|.||+..++++||+++|. .||.++.+-+.+|. .|.+.|.|-|.|...++|.+|++.++|..++|+.|++
T Consensus 79 ~~~~~~~v~v~NL~~~V~~~Dl~eLF~-~~~~~~r~~vhy~~-~G~s~Gta~v~~~r~~DA~~avk~~~gv~ldG~~mk~ 156 (243)
T KOG0533|consen 79 NETRSTKVNVSNLPYGVIDADLKELFA-EFGELKRVAVHYDR-AGRSLGTADVSFNRRDDAERAVKKYNGVALDGRPMKI 156 (243)
T ss_pred cCCCcceeeeecCCcCcchHHHHHHHH-HhccceEEeeccCC-CCCCCccceeeecchHhHHHHHHHhcCcccCCceeee
Confidence 334457899999999999999999999 99999999999886 8999999999999999999999999999999999999
Q ss_pred ccCCCCC
Q 021958 249 GPATNKK 255 (305)
Q Consensus 249 ~~~~~~~ 255 (305)
.......
T Consensus 157 ~~i~~~~ 163 (243)
T KOG0533|consen 157 EIISSPS 163 (243)
T ss_pred EEecCcc
Confidence 8765543
No 130
>PF04059 RRM_2: RNA recognition motif 2; InterPro: IPR007201 This RNA recognition motif 2 is found in Meiosis protein mei2. It is found C-terminal to the RNA-binding region RNP-1 (IPR000504 from INTERPRO).
Probab=98.81 E-value=4.7e-08 Score=69.28 Aligned_cols=81 Identities=20% Similarity=0.253 Sum_probs=69.0
Q ss_pred ceEEEcCCCcCCCHHHHHHHHhhcC-CCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHHhCCceec----ceeeEE
Q 021958 174 HTIFVGDLAADVTDYMLQETFRARY-PSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEMNGVFCS----TRPMRI 248 (305)
Q Consensus 174 ~~l~v~nlp~~~~~~~l~~~f~~~~-G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l~~~~~~----g~~l~v 248 (305)
+||.|+|+|...+.++|.+++.+.+ |...-+.+..|..++.+.|||||.|.+.+.|.+-.+.++|..+. .+...|
T Consensus 2 TTvMirNIPn~~t~~~L~~~l~~~~~g~yDF~YLPiDf~~~~N~GYAFVNf~~~~~~~~F~~~f~g~~w~~~~s~Kvc~i 81 (97)
T PF04059_consen 2 TTVMIRNIPNKYTQEMLIQILDEHFKGKYDFFYLPIDFKNKCNLGYAFVNFTSPQAAIRFYKAFNGKKWPNFNSKKVCEI 81 (97)
T ss_pred eeEEEecCCCCCCHHHHHHHHHHhccCcceEEEeeeeccCCCceEEEEEEcCCHHHHHHHHHHHcCCccccCCCCcEEEE
Confidence 6899999999999999999987433 55677788889889999999999999999999999999998864 466777
Q ss_pred ccCCCC
Q 021958 249 GPATNK 254 (305)
Q Consensus 249 ~~~~~~ 254 (305)
.||+-+
T Consensus 82 ~yAriQ 87 (97)
T PF04059_consen 82 SYARIQ 87 (97)
T ss_pred ehhHhh
Confidence 777643
No 131
>KOG1457 consensus RNA binding protein (contains RRM repeats) [General function prediction only]
Probab=98.77 E-value=7e-08 Score=76.96 Aligned_cols=86 Identities=24% Similarity=0.310 Sum_probs=72.4
Q ss_pred CCCceEEEcCCCcCCCHHHHHHHHhhcCCCeeEEEEEecC-CCCCcccEEEEEeCCHHHHHHHHHHhCCcee---cceee
Q 021958 171 TPDHTIFVGDLAADVTDYMLQETFRARYPSTKGAKVVIDR-LTGRTKGYGFVRFGDESEQLRAMTEMNGVFC---STRPM 246 (305)
Q Consensus 171 ~~~~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~-~~~~~~g~afV~f~~~~~A~~a~~~l~~~~~---~g~~l 246 (305)
...+||||.+||.++.-.+|..+|. .|-..+.+.+.... .....+-+|||.|.+..+|.+|+++|||..| .+..|
T Consensus 32 ~~VRTLFVSGLP~DvKpREiynLFR-~f~GYEgslLK~Tsk~~~~~~pvaFatF~s~q~A~aamnaLNGvrFDpE~~stL 110 (284)
T KOG1457|consen 32 GAVRTLFVSGLPNDVKPREIYNLFR-RFHGYEGSLLKYTSKGDQVCKPVAFATFTSHQFALAAMNALNGVRFDPETGSTL 110 (284)
T ss_pred cccceeeeccCCcccCHHHHHHHhc-cCCCccceeeeeccCCCccccceEEEEecchHHHHHHHHHhcCeeeccccCcee
Confidence 4578999999999999999999998 89888887776542 2233467999999999999999999999998 57899
Q ss_pred EEccCCCCCCC
Q 021958 247 RIGPATNKKTV 257 (305)
Q Consensus 247 ~v~~~~~~~~~ 257 (305)
+|++++...+.
T Consensus 111 hiElAKSNtK~ 121 (284)
T KOG1457|consen 111 HIELAKSNTKR 121 (284)
T ss_pred EeeehhcCccc
Confidence 99998775544
No 132
>KOG4661 consensus Hsp27-ERE-TATA-binding protein/Scaffold attachment factor (SAF-B) [Transcription]
Probab=98.75 E-value=3e-08 Score=89.01 Aligned_cols=83 Identities=24% Similarity=0.370 Sum_probs=75.6
Q ss_pred CCCceEEEcCCCcccCHHHHhHhhhhcCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCCCCCcceeee
Q 021958 79 GEIRTLWIGDLQYWMDETYLNTCFAHTGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMPNGEQNFRLN 158 (305)
Q Consensus 79 ~~~~~l~v~nLp~~~t~~~l~~~f~~~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~~~~i~v~ 158 (305)
.-.+.|||.+|...+...+|+++|++||.|+-.+++.+..+.-.+.|+||++.+.++|.+||+.|+...+ .|+.|.|.
T Consensus 403 ~~gRNlWVSGLSstTRAtDLKnlFSKyGKVvGAKVVTNaRsPGaRCYGfVTMSts~eAtkCI~hLHrTEL--HGrmISVE 480 (940)
T KOG4661|consen 403 TLGRNLWVSGLSSTTRATDLKNLFSKYGKVVGAKVVTNARSPGARCYGFVTMSTSAEATKCIEHLHRTEL--HGRMISVE 480 (940)
T ss_pred ccccceeeeccccchhhhHHHHHHHHhcceeceeeeecCCCCCcceeEEEEecchHHHHHHHHHhhhhhh--cceeeeee
Confidence 4468899999999999999999999999999999999877777788999999999999999999999999 67899998
Q ss_pred ccCCC
Q 021958 159 WASFG 163 (305)
Q Consensus 159 ~~~~~ 163 (305)
.++..
T Consensus 481 kaKNE 485 (940)
T KOG4661|consen 481 KAKNE 485 (940)
T ss_pred ecccC
Confidence 88765
No 133
>KOG0226 consensus RNA-binding proteins [General function prediction only]
Probab=98.75 E-value=1.1e-08 Score=83.07 Aligned_cols=82 Identities=28% Similarity=0.481 Sum_probs=74.1
Q ss_pred CCCCceEEEcCCCcccCHHHHhHhhhhcCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCCCCCcceee
Q 021958 78 PGEIRTLWIGDLQYWMDETYLNTCFAHTGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMPNGEQNFRL 157 (305)
Q Consensus 78 ~~~~~~l~v~nLp~~~t~~~l~~~f~~~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~~~~i~v 157 (305)
+....+||.|.|-.+++++.|...|.+|-.....++++|+.+|+++||+||.|.+..++.+|+++++|+.+ +.+.|++
T Consensus 187 ~~~DfRIfcgdlgNevnd~vl~raf~Kfpsf~~akviRdkRTgKSkgygfVSf~~pad~~rAmrem~gkyV--gsrpikl 264 (290)
T KOG0226|consen 187 DEDDFRIFCGDLGNEVNDDVLARAFKKFPSFQKAKVIRDKRTGKSKGYGFVSFRDPADYVRAMREMNGKYV--GSRPIKL 264 (290)
T ss_pred ccccceeecccccccccHHHHHHHHHhccchhhccccccccccccccceeeeecCHHHHHHHHHhhccccc--ccchhHh
Confidence 45567899999999999999999999999888899999999999999999999999999999999999999 6677776
Q ss_pred eccC
Q 021958 158 NWAS 161 (305)
Q Consensus 158 ~~~~ 161 (305)
..+.
T Consensus 265 RkS~ 268 (290)
T KOG0226|consen 265 RKSE 268 (290)
T ss_pred hhhh
Confidence 5443
No 134
>KOG0116 consensus RasGAP SH3 binding protein rasputin, contains NTF2 and RRM domains [Signal transduction mechanisms]
Probab=98.73 E-value=3.5e-08 Score=88.06 Aligned_cols=81 Identities=21% Similarity=0.354 Sum_probs=69.4
Q ss_pred CceEEEcCCCcCCCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHHhCCceecceeeEEccCC
Q 021958 173 DHTIFVGDLAADVTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEMNGVFCSTRPMRIGPAT 252 (305)
Q Consensus 173 ~~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l~~~~~~g~~l~v~~~~ 252 (305)
..+|||+|||.+++.++|+++|+ .||.|+...|......++...||||+|.+.+++..|+.+ +-..+++++|.|+-.+
T Consensus 288 ~~~i~V~nlP~da~~~~l~~~Fk-~FG~Ik~~~I~vr~~~~~~~~fgFV~f~~~~~~~~~i~A-sp~~ig~~kl~Veek~ 365 (419)
T KOG0116|consen 288 GLGIFVKNLPPDATPAELEEVFK-QFGPIKEGGIQVRSPGGKNPCFGFVEFENAAAVQNAIEA-SPLEIGGRKLNVEEKR 365 (419)
T ss_pred ccceEeecCCCCCCHHHHHHHHh-hcccccccceEEeccCCCcCceEEEEEeecchhhhhhhc-CccccCCeeEEEEecc
Confidence 34599999999999999999999 999999988877543455559999999999999999987 5778999999998766
Q ss_pred CCC
Q 021958 253 NKK 255 (305)
Q Consensus 253 ~~~ 255 (305)
...
T Consensus 366 ~~~ 368 (419)
T KOG0116|consen 366 PGF 368 (419)
T ss_pred ccc
Confidence 533
No 135
>KOG4660 consensus Protein Mei2, essential for commitment to meiosis, and related proteins [Cell cycle control, cell division, chromosome partitioning]
Probab=98.73 E-value=9.2e-09 Score=92.21 Aligned_cols=72 Identities=22% Similarity=0.330 Sum_probs=66.2
Q ss_pred CCCCceEEEcCCCcCCCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHHhCCceecceeeE
Q 021958 170 DTPDHTIFVGDLAADVTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEMNGVFCSTRPMR 247 (305)
Q Consensus 170 ~~~~~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l~~~~~~g~~l~ 247 (305)
+.+..+|+|-|||..+++++|.++|+ .||+|+.|+.-... +|..||+|.|+.+|++|+++|++..|.|+.|+
T Consensus 72 ~~~~~~L~v~nl~~~Vsn~~L~~~f~-~yGeir~ir~t~~~-----~~~~~v~FyDvR~A~~Alk~l~~~~~~~~~~k 143 (549)
T KOG4660|consen 72 DMNQGTLVVFNLPRSVSNDTLLRIFG-AYGEIREIRETPNK-----RGIVFVEFYDVRDAERALKALNRREIAGKRIK 143 (549)
T ss_pred cCccceEEEEecCCcCCHHHHHHHHH-hhcchhhhhccccc-----CceEEEEEeehHhHHHHHHHHHHHHhhhhhhc
Confidence 45678999999999999999999999 99999997764443 88999999999999999999999999999999
No 136
>KOG4209 consensus Splicing factor RNPS1, SR protein superfamily [RNA processing and modification]
Probab=98.73 E-value=1.6e-08 Score=83.83 Aligned_cols=85 Identities=16% Similarity=0.284 Sum_probs=78.7
Q ss_pred CCCCCCceEEEcCCCcCCCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHHhCCceecceeeE
Q 021958 168 RDDTPDHTIFVGDLAADVTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEMNGVFCSTRPMR 247 (305)
Q Consensus 168 ~~~~~~~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l~~~~~~g~~l~ 247 (305)
........+||+|+.+.++.+++...|+ -||.|..+.+..|...+.++|||||+|.+.+.+.+|+. |++..|.|+.+.
T Consensus 96 ~~~~d~~sv~v~nvd~~~t~~~~e~hf~-~Cg~i~~~ti~~d~~~~~~k~~~yvef~~~~~~~~ay~-l~gs~i~~~~i~ 173 (231)
T KOG4209|consen 96 QKEVDAPSVWVGNVDFLVTLTKIELHFE-SCGGINRVTVPKDKFRGHPKGFAYVEFSSYELVEEAYK-LDGSEIPGPAIE 173 (231)
T ss_pred hhccCCceEEEeccccccccchhhheee-ccCCccceeeeccccCCCcceeEEEecccHhhhHHHhh-cCCcccccccce
Confidence 3445678999999999999999999999 99999999999999899999999999999999999999 999999999999
Q ss_pred EccCCCC
Q 021958 248 IGPATNK 254 (305)
Q Consensus 248 v~~~~~~ 254 (305)
|.+.+-+
T Consensus 174 vt~~r~~ 180 (231)
T KOG4209|consen 174 VTLKRTN 180 (231)
T ss_pred eeeeeee
Confidence 9987665
No 137
>KOG2193 consensus IGF-II mRNA-binding protein IMP, contains RRM and KH domains [RNA processing and modification; General function prediction only]
Probab=98.70 E-value=1.6e-09 Score=93.62 Aligned_cols=155 Identities=14% Similarity=0.262 Sum_probs=121.2
Q ss_pred ceEEEcCCCcccCHHHHhHhhhhcCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCCCCCcceeeeccC
Q 021958 82 RTLWIGDLQYWMDETYLNTCFAHTGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMPNGEQNFRLNWAS 161 (305)
Q Consensus 82 ~~l~v~nLp~~~t~~~l~~~f~~~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~~~~i~v~~~~ 161 (305)
.++|++||.+.++..+|+.+|.....-.+-.++.. .||+||.+.+...|.+|++.++|+.- ..|..+.+..+.
T Consensus 2 nklyignL~p~~~psdl~svfg~ak~~~~g~fl~k------~gyafvd~pdq~wa~kaie~~sgk~e-lqGkr~e~~~sv 74 (584)
T KOG2193|consen 2 NKLYIGNLSPQVTPSDLESVFGDAKIPGSGQFLVK------SGYAFVDCPDQQWANKAIETLSGKVE-LQGKRQEVEHSV 74 (584)
T ss_pred CcccccccCCCCChHHHHHHhccccCCCCcceeee------cceeeccCCchhhhhhhHHhhchhhh-hcCceeeccchh
Confidence 47999999999999999999976532222222222 25999999999999999999888643 257777777665
Q ss_pred CCCCCCCCCCCCceEEEcCCCcCCCHHHHHHHHhhcCCCeeEEEEE-ecCCCCCcccEEEEEeCCHHHHHHHHHHhCCce
Q 021958 162 FGAGEKRDDTPDHTIFVGDLAADVTDYMLQETFRARYPSTKGAKVV-IDRLTGRTKGYGFVRFGDESEQLRAMTEMNGVF 240 (305)
Q Consensus 162 ~~~~~~~~~~~~~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~-~~~~~~~~~g~afV~f~~~~~A~~a~~~l~~~~ 240 (305)
.+ ...++++-|.|++.....+.|..++. .||.++.|..+ .+.+ ....-|+|.+.+.+..|+..|+|..
T Consensus 75 ~k------kqrsrk~Qirnippql~wevld~Ll~-qyg~ve~~eqvnt~~e----tavvnvty~~~~~~~~ai~kl~g~Q 143 (584)
T KOG2193|consen 75 PK------KQRSRKIQIRNIPPQLQWEVLDSLLA-QYGTVENCEQVNTDSE----TAVVNVTYSAQQQHRQAIHKLNGPQ 143 (584)
T ss_pred hH------HHHhhhhhHhcCCHHHHHHHHHHHHh-ccCCHhHhhhhccchH----HHHHHHHHHHHHHHHHHHHhhcchH
Confidence 33 23356799999999999999999998 99999998764 2322 2223378999999999999999999
Q ss_pred ecceeeEEccCCCC
Q 021958 241 CSTRPMRIGPATNK 254 (305)
Q Consensus 241 ~~g~~l~v~~~~~~ 254 (305)
+.+..++|.|.-..
T Consensus 144 ~en~~~k~~YiPde 157 (584)
T KOG2193|consen 144 LENQHLKVGYIPDE 157 (584)
T ss_pred hhhhhhhcccCchh
Confidence 99999999986543
No 138
>PF04059 RRM_2: RNA recognition motif 2; InterPro: IPR007201 This RNA recognition motif 2 is found in Meiosis protein mei2. It is found C-terminal to the RNA-binding region RNP-1 (IPR000504 from INTERPRO).
Probab=98.69 E-value=1.9e-07 Score=66.17 Aligned_cols=81 Identities=21% Similarity=0.316 Sum_probs=68.3
Q ss_pred ceEEEcCCCcccCHHHHhHhhhhc--CCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCCC--CCcceee
Q 021958 82 RTLWIGDLQYWMDETYLNTCFAHT--GEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMPN--GEQNFRL 157 (305)
Q Consensus 82 ~~l~v~nLp~~~t~~~l~~~f~~~--G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~--~~~~i~v 157 (305)
+||.|.|||...|.++|.+++... |...-+.+..|-.++.+.|||||.|.+.+.|.+..+.++|..+.. ..+..++
T Consensus 2 TTvMirNIPn~~t~~~L~~~l~~~~~g~yDF~YLPiDf~~~~N~GYAFVNf~~~~~~~~F~~~f~g~~w~~~~s~Kvc~i 81 (97)
T PF04059_consen 2 TTVMIRNIPNKYTQEMLIQILDEHFKGKYDFFYLPIDFKNKCNLGYAFVNFTSPQAAIRFYKAFNGKKWPNFNSKKVCEI 81 (97)
T ss_pred eeEEEecCCCCCCHHHHHHHHHHhccCcceEEEeeeeccCCCceEEEEEEcCCHHHHHHHHHHHcCCccccCCCCcEEEE
Confidence 689999999999999999999754 556667778888889999999999999999999999999998853 2455667
Q ss_pred eccCC
Q 021958 158 NWASF 162 (305)
Q Consensus 158 ~~~~~ 162 (305)
.||..
T Consensus 82 ~yAri 86 (97)
T PF04059_consen 82 SYARI 86 (97)
T ss_pred ehhHh
Confidence 77653
No 139
>KOG0533 consensus RRM motif-containing protein [RNA processing and modification]
Probab=98.64 E-value=1.3e-07 Score=78.30 Aligned_cols=82 Identities=23% Similarity=0.372 Sum_probs=71.2
Q ss_pred CCCCCceEEEcCCCcccCHHHHhHhhhhcCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCCCCCccee
Q 021958 77 QPGEIRTLWIGDLQYWMDETYLNTCFAHTGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMPNGEQNFR 156 (305)
Q Consensus 77 ~~~~~~~l~v~nLp~~~t~~~l~~~f~~~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~~~~i~ 156 (305)
......+|+|.||+..++++||+++|+.||.+..+.+..++ +|++.|.|-|.|...++|..|++.++|..++ ++.++
T Consensus 79 ~~~~~~~v~v~NL~~~V~~~Dl~eLF~~~~~~~r~~vhy~~-~G~s~Gta~v~~~r~~DA~~avk~~~gv~ld--G~~mk 155 (243)
T KOG0533|consen 79 NETRSTKVNVSNLPYGVIDADLKELFAEFGELKRVAVHYDR-AGRSLGTADVSFNRRDDAERAVKKYNGVALD--GRPMK 155 (243)
T ss_pred cCCCcceeeeecCCcCcchHHHHHHHHHhccceEEeeccCC-CCCCCccceeeecchHhHHHHHHHhcCcccC--Cceee
Confidence 34455789999999999999999999999999888888886 6999999999999999999999999997774 55566
Q ss_pred eeccC
Q 021958 157 LNWAS 161 (305)
Q Consensus 157 v~~~~ 161 (305)
+....
T Consensus 156 ~~~i~ 160 (243)
T KOG0533|consen 156 IEIIS 160 (243)
T ss_pred eEEec
Confidence 65554
No 140
>KOG0106 consensus Alternative splicing factor SRp55/B52/SRp75 (RRM superfamily) [RNA processing and modification]
Probab=98.62 E-value=4e-08 Score=79.53 Aligned_cols=72 Identities=21% Similarity=0.417 Sum_probs=66.7
Q ss_pred ceEEEcCCCcCCCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHHhCCceecceeeEEccCCC
Q 021958 174 HTIFVGDLAADVTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEMNGVFCSTRPMRIGPATN 253 (305)
Q Consensus 174 ~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l~~~~~~g~~l~v~~~~~ 253 (305)
..+||++|++.+.+.+|..+|. .||.|.++.+. .||+||+|.+..+|..|+..||+..|+|..+.|.|++.
T Consensus 2 ~rv~vg~~~~~~~~~d~E~~f~-~yg~~~d~~mk--------~gf~fv~fed~rda~Dav~~l~~~~l~~e~~vve~~r~ 72 (216)
T KOG0106|consen 2 PRVYIGRLPYRARERDVERFFK-GYGKIPDADMK--------NGFGFVEFEDPRDADDAVHDLDGKELCGERLVVEHARG 72 (216)
T ss_pred CceeecccCCccchhHHHHHHh-hccccccceee--------cccceeccCchhhhhcccchhcCceecceeeeeecccc
Confidence 4689999999999999999999 99999999875 56889999999999999999999999999999999985
Q ss_pred C
Q 021958 254 K 254 (305)
Q Consensus 254 ~ 254 (305)
.
T Consensus 73 ~ 73 (216)
T KOG0106|consen 73 K 73 (216)
T ss_pred c
Confidence 3
No 141
>KOG4209 consensus Splicing factor RNPS1, SR protein superfamily [RNA processing and modification]
Probab=98.53 E-value=1.4e-07 Score=78.29 Aligned_cols=83 Identities=18% Similarity=0.428 Sum_probs=73.7
Q ss_pred CCCCCceEEEcCCCcccCHHHHhHhhhhcCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCCCCCccee
Q 021958 77 QPGEIRTLWIGDLQYWMDETYLNTCFAHTGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMPNGEQNFR 156 (305)
Q Consensus 77 ~~~~~~~l~v~nLp~~~t~~~l~~~f~~~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~~~~i~ 156 (305)
...+.+.+||+|+...+|.+++...|+.||.+..+.+..|+.++.++||+||+|.+.+.+.+++. |+|..| .++.+.
T Consensus 97 ~~~d~~sv~v~nvd~~~t~~~~e~hf~~Cg~i~~~ti~~d~~~~~~k~~~yvef~~~~~~~~ay~-l~gs~i--~~~~i~ 173 (231)
T KOG4209|consen 97 KEVDAPSVWVGNVDFLVTLTKIELHFESCGGINRVTVPKDKFRGHPKGFAYVEFSSYELVEEAYK-LDGSEI--PGPAIE 173 (231)
T ss_pred hccCCceEEEeccccccccchhhheeeccCCccceeeeccccCCCcceeEEEecccHhhhHHHhh-cCCccc--ccccce
Confidence 34566899999999999999999999999999999999999888999999999999999999996 899999 567777
Q ss_pred eeccCC
Q 021958 157 LNWASF 162 (305)
Q Consensus 157 v~~~~~ 162 (305)
+.+...
T Consensus 174 vt~~r~ 179 (231)
T KOG4209|consen 174 VTLKRT 179 (231)
T ss_pred eeeeee
Confidence 666553
No 142
>KOG4454 consensus RNA binding protein (RRM superfamily) [General function prediction only]
Probab=98.53 E-value=3.3e-08 Score=78.60 Aligned_cols=80 Identities=18% Similarity=0.163 Sum_probs=71.3
Q ss_pred CCCceEEEcCCCcCCCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHHhCCceecceeeEEcc
Q 021958 171 TPDHTIFVGDLAADVTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEMNGVFCSTRPMRIGP 250 (305)
Q Consensus 171 ~~~~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l~~~~~~g~~l~v~~ 250 (305)
...++|||+|+...++||-|.++|- .-|.|..+.|..+. .++.+ ||||.|++..+..-|++.+||..+.++.++|.+
T Consensus 7 e~drtl~v~n~~~~v~eelL~Elfi-qaGPV~kv~ip~~~-d~~~k-Fa~v~f~~E~sv~~a~~L~ng~~l~~~e~q~~~ 83 (267)
T KOG4454|consen 7 EMDRTLLVQNMYSGVSEELLSELFI-QAGPVYKVGIPSGQ-DQEQK-FAYVFFPNENSVQLAGQLENGDDLEEDEEQRTL 83 (267)
T ss_pred chhhHHHHHhhhhhhhHHHHHHHhh-ccCceEEEeCCCCc-cCCCc-eeeeecccccchhhhhhhcccchhccchhhccc
Confidence 3468999999999999999999999 99999999888775 56666 999999999999999999999999999998887
Q ss_pred CCC
Q 021958 251 ATN 253 (305)
Q Consensus 251 ~~~ 253 (305)
-..
T Consensus 84 r~G 86 (267)
T KOG4454|consen 84 RCG 86 (267)
T ss_pred ccC
Confidence 443
No 143
>KOG0116 consensus RasGAP SH3 binding protein rasputin, contains NTF2 and RRM domains [Signal transduction mechanisms]
Probab=98.52 E-value=6.1e-07 Score=80.28 Aligned_cols=79 Identities=15% Similarity=0.294 Sum_probs=62.7
Q ss_pred CCCceEEEcCCCcccCHHHHhHhhhhcCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCCCCCcceeee
Q 021958 79 GEIRTLWIGDLQYWMDETYLNTCFAHTGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMPNGEQNFRLN 158 (305)
Q Consensus 79 ~~~~~l~v~nLp~~~t~~~l~~~f~~~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~~~~i~v~ 158 (305)
....+|||+|||.+++.++|+++|..||.|+...|......++..+||||+|++.+++..|+.+ + .+..+++.+.|.
T Consensus 286 ~~~~~i~V~nlP~da~~~~l~~~Fk~FG~Ik~~~I~vr~~~~~~~~fgFV~f~~~~~~~~~i~A-s--p~~ig~~kl~Ve 362 (419)
T KOG0116|consen 286 ADGLGIFVKNLPPDATPAELEEVFKQFGPIKEGGIQVRSPGGKNPCFGFVEFENAAAVQNAIEA-S--PLEIGGRKLNVE 362 (419)
T ss_pred ecccceEeecCCCCCCHHHHHHHHhhcccccccceEEeccCCCcCceEEEEEeecchhhhhhhc-C--ccccCCeeEEEE
Confidence 3455699999999999999999999999999988877543355558999999999999999966 4 333356666665
Q ss_pred cc
Q 021958 159 WA 160 (305)
Q Consensus 159 ~~ 160 (305)
-.
T Consensus 363 ek 364 (419)
T KOG0116|consen 363 EK 364 (419)
T ss_pred ec
Confidence 33
No 144
>KOG0151 consensus Predicted splicing regulator, contains RRM, SWAP and RPR domains [General function prediction only]
Probab=98.50 E-value=2.7e-07 Score=84.99 Aligned_cols=83 Identities=27% Similarity=0.517 Sum_probs=74.3
Q ss_pred CCCCceEEEcCCCcCCCHHHHHHHHhhcCCCeeEEEEEecCC---CCCcccEEEEEeCCHHHHHHHHHHhCCceecceee
Q 021958 170 DTPDHTIFVGDLAADVTDYMLQETFRARYPSTKGAKVVIDRL---TGRTKGYGFVRFGDESEQLRAMTEMNGVFCSTRPM 246 (305)
Q Consensus 170 ~~~~~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~---~~~~~g~afV~f~~~~~A~~a~~~l~~~~~~g~~l 246 (305)
+..++.|||+||+..++++.|...|. .||.|.+++|+.... ....+.+|||.|-+..+|++|++.|+|..+.++.+
T Consensus 171 DP~TTNlyv~Nlnpsv~E~~ll~tfG-rfgPlasvKimwpRtEeEk~r~r~cgfvafmnR~D~era~k~lqg~iv~~~e~ 249 (877)
T KOG0151|consen 171 DPQTTNLYVGNLNPSVDENFLLRTFG-RFGPLASVKIMWPRTEEEKRRERNCGFVAFMNRADAERALKELQGIIVMEYEM 249 (877)
T ss_pred CCcccceeeecCCccccHHHHHHHhc-ccCcccceeeecccchhhhccccccceeeehhhhhHHHHHHHhcceeeeeeee
Confidence 45678899999999999999999999 999999999986532 23457799999999999999999999999999999
Q ss_pred EEccCCC
Q 021958 247 RIGPATN 253 (305)
Q Consensus 247 ~v~~~~~ 253 (305)
++.|++.
T Consensus 250 K~gWgk~ 256 (877)
T KOG0151|consen 250 KLGWGKA 256 (877)
T ss_pred eeccccc
Confidence 9999954
No 145
>KOG0151 consensus Predicted splicing regulator, contains RRM, SWAP and RPR domains [General function prediction only]
Probab=98.49 E-value=2.3e-07 Score=85.40 Aligned_cols=83 Identities=28% Similarity=0.553 Sum_probs=72.1
Q ss_pred CCCCceEEEcCCCcccCHHHHhHhhhhcCCcEEEEEEeeCC---CCCccceEEEEEcCHHHHHHHHHHhCCCCCCCCCcc
Q 021958 78 PGEIRTLWIGDLQYWMDETYLNTCFAHTGEVVAVKVIRNKQ---TGQIEGYGFIEFISRAGAERVLQTFNGTPMPNGEQN 154 (305)
Q Consensus 78 ~~~~~~l~v~nLp~~~t~~~l~~~f~~~G~v~~~~~~~~~~---~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~~~~ 154 (305)
+...+.|||+||++.+++++|...|..||.|.+++|+.-+. ..+.+.|+||.|-+..+|++|++.|+|..+ .+..
T Consensus 171 DP~TTNlyv~Nlnpsv~E~~ll~tfGrfgPlasvKimwpRtEeEk~r~r~cgfvafmnR~D~era~k~lqg~iv--~~~e 248 (877)
T KOG0151|consen 171 DPQTTNLYVGNLNPSVDENFLLRTFGRFGPLASVKIMWPRTEEEKRRERNCGFVAFMNRADAERALKELQGIIV--MEYE 248 (877)
T ss_pred CCcccceeeecCCccccHHHHHHHhcccCcccceeeecccchhhhccccccceeeehhhhhHHHHHHHhcceee--eeee
Confidence 34467799999999999999999999999999999987542 245567999999999999999999999999 6788
Q ss_pred eeeeccCC
Q 021958 155 FRLNWASF 162 (305)
Q Consensus 155 i~v~~~~~ 162 (305)
+++.|++.
T Consensus 249 ~K~gWgk~ 256 (877)
T KOG0151|consen 249 MKLGWGKA 256 (877)
T ss_pred eeeccccc
Confidence 99999854
No 146
>KOG4676 consensus Splicing factor, arginine/serine-rich [RNA processing and modification]
Probab=98.45 E-value=6.9e-08 Score=82.97 Aligned_cols=164 Identities=12% Similarity=0.065 Sum_probs=116.5
Q ss_pred ceEEEcCCCcccCHHHHhHhhhhcCCcEEEEEEeeCCC---CCccceEEEEEcCHHHHHHHHHHhCCCCCCCCCcceeee
Q 021958 82 RTLWIGDLQYWMDETYLNTCFAHTGEVVAVKVIRNKQT---GQIEGYGFIEFISRAGAERVLQTFNGTPMPNGEQNFRLN 158 (305)
Q Consensus 82 ~~l~v~nLp~~~t~~~l~~~f~~~G~v~~~~~~~~~~~---~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~~~~i~v~ 158 (305)
..|.|.||.+++|.++++.+|...|.|.+++++..... ......|||.|.+...+..|....|...|+ ...|.+-
T Consensus 8 ~vIqvanispsat~dqm~tlFg~lGkI~elrlyp~~~d~~~pv~sRtcyVkf~d~~sv~vaQhLtntvfvd--raliv~p 85 (479)
T KOG4676|consen 8 GVIQVANISPSATKDQMQTLFGNLGKIPELRLYPNVDDSKIPVISRTCYVKFLDSQSVTVAQHLTNTVFVD--RALIVRP 85 (479)
T ss_pred ceeeecccCchhhHHHHHHHHhhccccccccccCCCCCccCcceeeeEEEeccCCcceeHHhhhccceeee--eeEEEEe
Confidence 48999999999999999999999999999999874422 344558999999999999888655554442 2222222
Q ss_pred ccCCCC--------------C-------------------------------------CCCCCCCCceEEEcCCCcCCCH
Q 021958 159 WASFGA--------------G-------------------------------------EKRDDTPDHTIFVGDLAADVTD 187 (305)
Q Consensus 159 ~~~~~~--------------~-------------------------------------~~~~~~~~~~l~v~nlp~~~~~ 187 (305)
+..... . ......-.++++|++|...+..
T Consensus 86 ~~~~~~p~r~af~~l~~~navprll~pdg~Lp~~~~lt~~nh~p~ailktP~Lp~~~~A~kleeirRt~~v~sl~~~~~l 165 (479)
T KOG4676|consen 86 YGDEVIPDRFAFVELADQNAVPRLLPPDGVLPGDRPLTKINHSPNAILKTPELPPQAAAKKLEEIRRTREVQSLISAAIL 165 (479)
T ss_pred cCCCCCccHHHHHhcCcccccccccCCCCccCCCCccccccCCccceecCCCCChHhhhhhhHHHHhhhhhhcchhhhcc
Confidence 221110 0 0000122467999999999999
Q ss_pred HHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHHhCCceecceeeEEccCCC
Q 021958 188 YMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEMNGVFCSTRPMRIGPATN 253 (305)
Q Consensus 188 ~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l~~~~~~g~~l~v~~~~~ 253 (305)
.++.++|. .+|+|.+.++.. +...-+|.++|....+...|+.. +|..+.-...++...++
T Consensus 166 ~e~~e~f~-r~Gev~ya~~as----k~~s~~c~~sf~~qts~~halr~-~gre~k~qhsr~ai~kP 225 (479)
T KOG4676|consen 166 PESGESFE-RKGEVSYAHTAS----KSRSSSCSHSFRKQTSSKHALRS-HGRERKRQHSRRAIIKP 225 (479)
T ss_pred hhhhhhhh-hcchhhhhhhhc----cCCCcchhhhHhhhhhHHHHHHh-cchhhhhhhhhhhhcCc
Confidence 99999999 999999888753 34456788999999999999875 77776544444433333
No 147
>PF11608 Limkain-b1: Limkain b1; InterPro: IPR024582 This entry represents a conserved domain found in limkain b1, which is a novel human autoantigen, localised to a subset of ABCD3 and PXF marked peroxisomes. Limkain b1 may be a relatively common target of human autoantibodies reactive to cytoplasmic vesicle-like structures [].; PDB: 2DIU_A.
Probab=98.44 E-value=1.5e-06 Score=58.66 Aligned_cols=72 Identities=14% Similarity=0.171 Sum_probs=46.2
Q ss_pred ceEEEcCCCcCCCHHH----HHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHHhCCceecceeeEEc
Q 021958 174 HTIFVGDLAADVTDYM----LQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEMNGVFCSTRPMRIG 249 (305)
Q Consensus 174 ~~l~v~nlp~~~~~~~----l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l~~~~~~g~~l~v~ 249 (305)
..|+|.|||.+.+-.. |++++..-.|.|.+|. .+.|+|.|.+.+.|.+|.+.|+|..+.|++|.|+
T Consensus 3 s~L~V~NLP~~~d~~~I~~RL~qLsdNCGGkVl~v~----------~~tAilrF~~~~~A~RA~KRmegEdVfG~kI~v~ 72 (90)
T PF11608_consen 3 SLLYVSNLPTNKDPSSIKNRLRQLSDNCGGKVLSVS----------GGTAILRFPNQEFAERAQKRMEGEDVFGNKISVS 72 (90)
T ss_dssp EEEEEES--TTS-HHHHHHHHHHHHHTTT--EEE------------TT-EEEEESSHHHHHHHHHHHTT--SSSS--EEE
T ss_pred cEEEEecCCCCCCHHHHHHHHHHHhhccCCEEEEEe----------CCEEEEEeCCHHHHHHHHHhhcccccccceEEEE
Confidence 4699999999988765 4555552334564441 4579999999999999999999999999999999
Q ss_pred cCCCCC
Q 021958 250 PATNKK 255 (305)
Q Consensus 250 ~~~~~~ 255 (305)
|.....
T Consensus 73 ~~~~~r 78 (90)
T PF11608_consen 73 FSPKNR 78 (90)
T ss_dssp SS--S-
T ss_pred EcCCcc
Confidence 985543
No 148
>KOG1924 consensus RhoA GTPase effector DIA/Diaphanous [Signal transduction mechanisms; Cytoskeleton]
Probab=98.24 E-value=3.4e-06 Score=78.82 Aligned_cols=7 Identities=29% Similarity=0.282 Sum_probs=2.7
Q ss_pred HHHHHHh
Q 021958 230 LRAMTEM 236 (305)
Q Consensus 230 ~~a~~~l 236 (305)
..|++++
T Consensus 805 t~ACEE~ 811 (1102)
T KOG1924|consen 805 TAACEEL 811 (1102)
T ss_pred HHHHHHH
Confidence 3344433
No 149
>KOG0115 consensus RNA-binding protein p54nrb (RRM superfamily) [RNA processing and modification]
Probab=98.20 E-value=4.3e-06 Score=68.38 Aligned_cols=103 Identities=20% Similarity=0.355 Sum_probs=85.4
Q ss_pred HHHHHHHHHhCCCCCCCCCcceeeeccCCCCCCCCCCCCCceEEEcCCCcCCCHHHHHHHHhhcCCCeeEEEEEecCCCC
Q 021958 134 AGAERVLQTFNGTPMPNGEQNFRLNWASFGAGEKRDDTPDHTIFVGDLAADVTDYMLQETFRARYPSTKGAKVVIDRLTG 213 (305)
Q Consensus 134 ~~a~~a~~~l~g~~~~~~~~~i~v~~~~~~~~~~~~~~~~~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~~~ 213 (305)
.-|..|..+|++... .++.+++.|+. ...|+|.||...++.+.|.+.|+ .||.|+...++.|. .+
T Consensus 5 t~ae~ak~eLd~~~~--~~~~lr~rfa~-----------~a~l~V~nl~~~~sndll~~~f~-~fg~~e~av~~vD~-r~ 69 (275)
T KOG0115|consen 5 TLAEIAKRELDGRFP--KGRSLRVRFAM-----------HAELYVVNLMQGASNDLLEQAFR-RFGPIERAVAKVDD-RG 69 (275)
T ss_pred cHHHHHHHhcCCCCC--CCCceEEEeec-----------cceEEEEecchhhhhHHHHHhhh-hcCccchheeeecc-cc
Confidence 456667778899988 78999999997 36799999999999999999999 99999998888885 68
Q ss_pred CcccEEEEEeCCHHHHHHHHHHhCCce----ecceeeEEccC
Q 021958 214 RTKGYGFVRFGDESEQLRAMTEMNGVF----CSTRPMRIGPA 251 (305)
Q Consensus 214 ~~~g~afV~f~~~~~A~~a~~~l~~~~----~~g~~l~v~~~ 251 (305)
+..+-++|+|...-.|.+|+..++-.- ..++...|...
T Consensus 70 k~t~eg~v~~~~k~~a~~a~rr~~~~g~~~~~~~~p~~VeP~ 111 (275)
T KOG0115|consen 70 KPTREGIVEFAKKPNARKAARRCREGGFGGTTGGRPVGVEPM 111 (275)
T ss_pred cccccchhhhhcchhHHHHHHHhccCccccCCCCCccCCChh
Confidence 888999999999999999999874322 35566555543
No 150
>KOG1995 consensus Conserved Zn-finger protein [General function prediction only]
Probab=98.17 E-value=2.2e-06 Score=73.38 Aligned_cols=86 Identities=21% Similarity=0.243 Sum_probs=78.2
Q ss_pred CCCCceEEEcCCCcCCCHHHHHHHHhhcCCCeeE--------EEEEecCCCCCcccEEEEEeCCHHHHHHHHHHhCCcee
Q 021958 170 DTPDHTIFVGDLAADVTDYMLQETFRARYPSTKG--------AKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEMNGVFC 241 (305)
Q Consensus 170 ~~~~~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~--------i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l~~~~~ 241 (305)
.....+|||-+|+..+++++|..+|. ++|.|.. |.|-+|++|+..|+-|.|.|++...|+.|+.-++++.+
T Consensus 63 ~s~~~ti~v~g~~d~~~~~~~~~~f~-qcg~ikrnK~t~kPki~~y~dkeT~~~KGeatvS~~D~~~akaai~~~agkdf 141 (351)
T KOG1995|consen 63 KSDNETIFVWGCPDSVCENDNADFFL-QCGVIKRNKRTGKPKIKIYTDKETGAPKGEATVSYEDPPAAKAAIEWFAGKDF 141 (351)
T ss_pred ccccccceeeccCccchHHHHHHHHh-hcceeccCCCCCCcchhccccccccCcCCceeeeecChhhhhhhhhhhccccc
Confidence 44567899999999999999999999 9999864 66778889999999999999999999999999999999
Q ss_pred cceeeEEccCCCCCC
Q 021958 242 STRPMRIGPATNKKT 256 (305)
Q Consensus 242 ~g~~l~v~~~~~~~~ 256 (305)
.|..|+|.++..+..
T Consensus 142 ~gn~ikvs~a~~r~~ 156 (351)
T KOG1995|consen 142 CGNTIKVSLAERRTG 156 (351)
T ss_pred cCCCchhhhhhhccC
Confidence 999999999887664
No 151
>PF08777 RRM_3: RNA binding motif; InterPro: IPR014886 This domain is found in protein La which functions as an RNA chaperone during RNA polymerase III transcription, and can also stimulate translation initiation. It contains a five stranded beta sheet which forms an atypical RNA recognition motif []. ; PDB: 1OWX_A.
Probab=98.14 E-value=5e-06 Score=60.38 Aligned_cols=59 Identities=14% Similarity=0.246 Sum_probs=39.2
Q ss_pred ceEEEcCCCcCCCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHHhCCc
Q 021958 174 HTIFVGDLAADVTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEMNGV 239 (305)
Q Consensus 174 ~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l~~~ 239 (305)
+.|+|.++...++.++|++.|+ .||.|..|.+... ...|+|.|.+.++|..|+..+...
T Consensus 2 ~il~~~g~~~~~~re~iK~~f~-~~g~V~yVD~~~G------~~~g~VRf~~~~~A~~a~~~~~~~ 60 (105)
T PF08777_consen 2 CILKFSGLGEPTSREDIKEAFS-QFGEVAYVDFSRG------DTEGYVRFKTPEAAQKALEKLKEA 60 (105)
T ss_dssp -EEEEEE--SS--HHHHHHHT--SS--EEEEE--TT-------SEEEEEESS---HHHHHHHHHHT
T ss_pred eEEEEecCCCCcCHHHHHHHHH-hcCCcceEEecCC------CCEEEEEECCcchHHHHHHHHHhc
Confidence 5789999999999999999999 9999999888654 346999999999999999887544
No 152
>KOG1855 consensus Predicted RNA-binding protein [General function prediction only]
Probab=98.13 E-value=3.9e-06 Score=73.33 Aligned_cols=80 Identities=19% Similarity=0.204 Sum_probs=63.6
Q ss_pred CCCCCCceEEEcCCCcCCCHHHHHHHHhhcCCCeeEEEEEec---CCCCC----------cccEEEEEeCCHHHHHHHHH
Q 021958 168 RDDTPDHTIFVGDLAADVTDYMLQETFRARYPSTKGAKVVID---RLTGR----------TKGYGFVRFGDESEQLRAMT 234 (305)
Q Consensus 168 ~~~~~~~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~---~~~~~----------~~g~afV~f~~~~~A~~a~~ 234 (305)
...-++++|.+.|||.+-..+.|.++|+ .+|.|..|+|+.. ..+++ .+-+|+|+|+..+.|.+|.+
T Consensus 226 ~eel~srtivaenLP~Dh~~enl~kiFg-~~G~IksIRIckPgaip~d~r~~~~~~~~~~tk~~AlvEye~~~~A~KA~e 304 (484)
T KOG1855|consen 226 EEELPSRTIVAENLPLDHSYENLSKIFG-TVGSIKSIRICKPGAIPEDVRGFPKKYFELQTKECALVEYEEVEAARKARE 304 (484)
T ss_pred ccccccceEEEecCCcchHHHHHHHHhh-cccceeeeeecCCCCCCcccccCCccchhhhhhhhhhhhhhhhHHHHHHHH
Confidence 3445789999999999999999999999 9999999999976 32221 25689999999999999999
Q ss_pred HhCCceecceeeEE
Q 021958 235 EMNGVFCSTRPMRI 248 (305)
Q Consensus 235 ~l~~~~~~g~~l~v 248 (305)
.|+...-+-..++|
T Consensus 305 ~~~~e~~wr~glkv 318 (484)
T KOG1855|consen 305 LLNPEQNWRMGLKV 318 (484)
T ss_pred hhchhhhhhhcchh
Confidence 88665443333333
No 153
>COG5175 MOT2 Transcriptional repressor [Transcription]
Probab=98.07 E-value=9.1e-06 Score=68.91 Aligned_cols=83 Identities=18% Similarity=0.286 Sum_probs=64.6
Q ss_pred CceEEEcCCCcCCCHHH------HHHHHhhcCCCeeEEEEEecCC-CCCcccEE--EEEeCCHHHHHHHHHHhCCceecc
Q 021958 173 DHTIFVGDLAADVTDYM------LQETFRARYPSTKGAKVVIDRL-TGRTKGYG--FVRFGDESEQLRAMTEMNGVFCST 243 (305)
Q Consensus 173 ~~~l~v~nlp~~~~~~~------l~~~f~~~~G~i~~i~i~~~~~-~~~~~g~a--fV~f~~~~~A~~a~~~l~~~~~~g 243 (305)
.+-+||-+|+..+-.|+ -.++|. +||.|..|.|.+... .+...+.+ ||+|.+.|+|.+||.+.+|..++|
T Consensus 114 KNLvYVigi~pkva~Ee~~~vLk~~eyFG-QyGkI~KIvvNkkt~s~nst~~h~gvYITy~~kedAarcIa~vDgs~~DG 192 (480)
T COG5175 114 KNLVYVIGIPPKVADEEVAPVLKRHEYFG-QYGKIKKIVVNKKTSSLNSTASHAGVYITYSTKEDAARCIAEVDGSLLDG 192 (480)
T ss_pred cceeEEecCCCCCCcccccccccchhhhh-hccceeEEEecccccccccccccceEEEEecchHHHHHHHHHhccccccC
Confidence 35688889998877766 246788 999998887765431 12222323 899999999999999999999999
Q ss_pred eeeEEccCCCCCC
Q 021958 244 RPMRIGPATNKKT 256 (305)
Q Consensus 244 ~~l~v~~~~~~~~ 256 (305)
|.|+..|..-+.-
T Consensus 193 r~lkatYGTTKYC 205 (480)
T COG5175 193 RVLKATYGTTKYC 205 (480)
T ss_pred ceEeeecCchHHH
Confidence 9999999887543
No 154
>KOG4849 consensus mRNA cleavage factor I subunit/CPSF subunit [RNA processing and modification]
Probab=97.94 E-value=1.3e-05 Score=68.20 Aligned_cols=75 Identities=19% Similarity=0.393 Sum_probs=65.6
Q ss_pred CCCCCceEEEcCCCcccCHHHHhHhhhhcCC--cEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCCCC
Q 021958 77 QPGEIRTLWIGDLQYWMDETYLNTCFAHTGE--VVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMPNG 151 (305)
Q Consensus 77 ~~~~~~~l~v~nLp~~~t~~~l~~~f~~~G~--v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~ 151 (305)
......++|||||-|.+|++||.+.+...|- +.++++..++.+|.+||||+|...+....++.++.|-.++|.|.
T Consensus 76 ~~Grk~~~YvGNL~W~TTD~DL~~A~~S~G~~~~~dmKFFENR~NGQSKG~AL~~~~SdAa~Kq~MeiLP~k~iHGQ 152 (498)
T KOG4849|consen 76 SEGRKYCCYVGNLLWYTTDADLLKALQSTGLAQFADMKFFENRTNGQSKGYALLVLNSDAAVKQTMEILPTKTIHGQ 152 (498)
T ss_pred ccCceEEEEecceeEEeccHHHHHHHHhhhHHHHhhhhhhhcccCCcccceEEEEecchHHHHHHHHhcccceecCC
Confidence 4456678999999999999999999987775 77889999999999999999999999999999988877777653
No 155
>PF11608 Limkain-b1: Limkain b1; InterPro: IPR024582 This entry represents a conserved domain found in limkain b1, which is a novel human autoantigen, localised to a subset of ABCD3 and PXF marked peroxisomes. Limkain b1 may be a relatively common target of human autoantibodies reactive to cytoplasmic vesicle-like structures [].; PDB: 2DIU_A.
Probab=97.92 E-value=8e-05 Score=50.50 Aligned_cols=69 Identities=17% Similarity=0.292 Sum_probs=45.2
Q ss_pred ceEEEcCCCcccCHHHHhH----hhhhcCC-cEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCCCCCccee
Q 021958 82 RTLWIGDLQYWMDETYLNT----CFAHTGE-VVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMPNGEQNFR 156 (305)
Q Consensus 82 ~~l~v~nLp~~~t~~~l~~----~f~~~G~-v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~~~~i~ 156 (305)
..|||.|||.+.+...|+. ++..||. |.++. .+.|.|.|.+.+.|.+|.+.|+|..+ .|..|.
T Consensus 3 s~L~V~NLP~~~d~~~I~~RL~qLsdNCGGkVl~v~----------~~tAilrF~~~~~A~RA~KRmegEdV--fG~kI~ 70 (90)
T PF11608_consen 3 SLLYVSNLPTNKDPSSIKNRLRQLSDNCGGKVLSVS----------GGTAILRFPNQEFAERAQKRMEGEDV--FGNKIS 70 (90)
T ss_dssp EEEEEES--TTS-HHHHHHHHHHHHHTTT--EEE------------TT-EEEEESSHHHHHHHHHHHTT--S--SSS--E
T ss_pred cEEEEecCCCCCCHHHHHHHHHHHhhccCCEEEEEe----------CCEEEEEeCCHHHHHHHHHhhccccc--ccceEE
Confidence 4699999999888776655 4557765 54441 24799999999999999999999988 678888
Q ss_pred eeccCC
Q 021958 157 LNWASF 162 (305)
Q Consensus 157 v~~~~~ 162 (305)
+.+...
T Consensus 71 v~~~~~ 76 (90)
T PF11608_consen 71 VSFSPK 76 (90)
T ss_dssp EESS--
T ss_pred EEEcCC
Confidence 887743
No 156
>COG5175 MOT2 Transcriptional repressor [Transcription]
Probab=97.86 E-value=2.1e-05 Score=66.76 Aligned_cols=112 Identities=8% Similarity=0.183 Sum_probs=75.6
Q ss_pred CceEEEcCCCcccCHHHH------hHhhhhcCCcEEEEEEeeCCC-CCccce--EEEEEcCHHHHHHHHHHhCCCCCCCC
Q 021958 81 IRTLWIGDLQYWMDETYL------NTCFAHTGEVVAVKVIRNKQT-GQIEGY--GFIEFISRAGAERVLQTFNGTPMPNG 151 (305)
Q Consensus 81 ~~~l~v~nLp~~~t~~~l------~~~f~~~G~v~~~~~~~~~~~-~~~~g~--afV~f~~~~~a~~a~~~l~g~~~~~~ 151 (305)
..-+||-+|++.+..+++ .++|.+||.|..|.+.+...+ ....+. .||+|.+.|+|.+||...+|..+ .
T Consensus 114 KNLvYVigi~pkva~Ee~~~vLk~~eyFGQyGkI~KIvvNkkt~s~nst~~h~gvYITy~~kedAarcIa~vDgs~~--D 191 (480)
T COG5175 114 KNLVYVIGIPPKVADEEVAPVLKRHEYFGQYGKIKKIVVNKKTSSLNSTASHAGVYITYSTKEDAARCIAEVDGSLL--D 191 (480)
T ss_pred cceeEEecCCCCCCcccccccccchhhhhhccceeEEEecccccccccccccceEEEEecchHHHHHHHHHhccccc--c
Confidence 356899999988877763 368999999998877664311 222222 39999999999999999999999 5
Q ss_pred CcceeeeccCCCCCC----CCCCCCCceEEEcCCCc---CCCHHHHHHHH
Q 021958 152 EQNFRLNWASFGAGE----KRDDTPDHTIFVGDLAA---DVTDYMLQETF 194 (305)
Q Consensus 152 ~~~i~v~~~~~~~~~----~~~~~~~~~l~v~nlp~---~~~~~~l~~~f 194 (305)
|+.++..+...+--. ...-....+.|+..-.. .++.+||...-
T Consensus 192 Gr~lkatYGTTKYCtsYLRn~~CpNp~CMyLHEpg~e~Ds~tK~el~n~q 241 (480)
T COG5175 192 GRVLKATYGTTKYCTSYLRNAVCPNPDCMYLHEPGPEKDSLTKDELCNSQ 241 (480)
T ss_pred CceEeeecCchHHHHHHHcCCCCCCCCeeeecCCCcccccccHHHHhhhh
Confidence 799999888754111 11122334555533222 35667765543
No 157
>KOG4849 consensus mRNA cleavage factor I subunit/CPSF subunit [RNA processing and modification]
Probab=97.86 E-value=1.5e-05 Score=67.94 Aligned_cols=80 Identities=16% Similarity=0.180 Sum_probs=69.2
Q ss_pred CCCceEEEcCCCcCCCHHHHHHHHhhcCCC--eeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHHhCCceecceeeEE
Q 021958 171 TPDHTIFVGDLAADVTDYMLQETFRARYPS--TKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEMNGVFCSTRPMRI 248 (305)
Q Consensus 171 ~~~~~l~v~nlp~~~~~~~l~~~f~~~~G~--i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l~~~~~~g~~l~v 248 (305)
....++|||||-|++|++||.+.+. ..|. +.++++..+..+|.+||||+|...+..+.++.++.|-.+.|.|..-.|
T Consensus 78 Grk~~~YvGNL~W~TTD~DL~~A~~-S~G~~~~~dmKFFENR~NGQSKG~AL~~~~SdAa~Kq~MeiLP~k~iHGQ~P~V 156 (498)
T KOG4849|consen 78 GRKYCCYVGNLLWYTTDADLLKALQ-STGLAQFADMKFFENRTNGQSKGYALLVLNSDAAVKQTMEILPTKTIHGQSPTV 156 (498)
T ss_pred CceEEEEecceeEEeccHHHHHHHH-hhhHHHHhhhhhhhcccCCcccceEEEEecchHHHHHHHHhcccceecCCCCee
Confidence 4457899999999999999999997 6664 667788888889999999999999999999999999999999987666
Q ss_pred ccC
Q 021958 249 GPA 251 (305)
Q Consensus 249 ~~~ 251 (305)
--+
T Consensus 157 ~~~ 159 (498)
T KOG4849|consen 157 LSY 159 (498)
T ss_pred ecc
Confidence 443
No 158
>KOG0128 consensus RNA-binding protein SART3 (RRM superfamily) [RNA processing and modification]
Probab=97.85 E-value=1.3e-06 Score=82.35 Aligned_cols=158 Identities=16% Similarity=0.124 Sum_probs=113.4
Q ss_pred CceEEEcCCCcccCHH-HHhHhhhhcCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCCCCCcceeeec
Q 021958 81 IRTLWIGDLQYWMDET-YLNTCFAHTGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMPNGEQNFRLNW 159 (305)
Q Consensus 81 ~~~l~v~nLp~~~t~~-~l~~~f~~~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~~~~i~v~~ 159 (305)
.+...+.++.+..... ..+..|..+|.|+.+++...........+.++++....+++.|... .+..+ ..+...+..
T Consensus 571 ~~e~~s~~v~p~~~~ke~~~~~~k~~~~vekv~~p~~g~k~h~q~~~~~~~s~~~~~esat~p-a~~~~--a~~~~av~~ 647 (881)
T KOG0128|consen 571 RREKESTNVYPEQQKKEIQRRQFKGEGNVEKVNGPKRGFKAHEQPQQQKVQSKHGSAESATVP-AGGAL--ANRSAAVGL 647 (881)
T ss_pred hhhhcccCCCcchhhHHhhHHHhhcccccccccCccccccccccchhhhhhccccchhhcccc-ccccc--CCccccCCC
Confidence 3456666776665544 6778899999999998766322222222789999999999888743 55555 445555554
Q ss_pred cCCCCCCCCCC------CCCceEEEcCCCcCCCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHH
Q 021958 160 ASFGAGEKRDD------TPDHTIFVGDLAADVTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAM 233 (305)
Q Consensus 160 ~~~~~~~~~~~------~~~~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~ 233 (305)
+.......... ....++||.||+..+.++||...|. .+|.+..+.+.....+++.+|+|+++|.+.+++.+|+
T Consensus 648 ad~~~~~~~~kvs~n~~R~~~~~fvsnl~~~~~~~dl~~~~~-~~~~~e~vqi~~h~n~~~~rG~~Y~~F~~~~~~~aaV 726 (881)
T KOG0128|consen 648 ADAEEKEENFKVSPNEIRDLIKIFVSNLSPKMSEEDLSERFS-PSGTIEVVQIVIHKNEKRFRGKAYVEFLKPEHAGAAV 726 (881)
T ss_pred CCchhhhhccCcCchHHHHHHHHHHhhcchhhcCchhhhhcC-ccchhhhHHHHHHhhccccccceeeEeecCCchhhhh
Confidence 44332211111 1235799999999999999999999 9999998887766678899999999999999999998
Q ss_pred HHhCCceec
Q 021958 234 TEMNGVFCS 242 (305)
Q Consensus 234 ~~l~~~~~~ 242 (305)
....++.++
T Consensus 727 ~f~d~~~~g 735 (881)
T KOG0128|consen 727 AFRDSCFFG 735 (881)
T ss_pred hhhhhhhhh
Confidence 765554444
No 159
>KOG1996 consensus mRNA splicing factor [RNA processing and modification]
Probab=97.82 E-value=6.1e-05 Score=62.82 Aligned_cols=66 Identities=14% Similarity=0.128 Sum_probs=53.8
Q ss_pred HHHHHHHHhhcCCCeeEEEEEecCCCCC-cccEEEEEeCCHHHHHHHHHHhCCceecceeeEEccCCC
Q 021958 187 DYMLQETFRARYPSTKGAKVVIDRLTGR-TKGYGFVRFGDESEQLRAMTEMNGVFCSTRPMRIGPATN 253 (305)
Q Consensus 187 ~~~l~~~f~~~~G~i~~i~i~~~~~~~~-~~g~afV~f~~~~~A~~a~~~l~~~~~~g~~l~v~~~~~ 253 (305)
++++++.++ +||.|.+|.|..+...-. ..---||+|+..++|.+|+-.|||.+|+||.++..|.+-
T Consensus 300 ede~keEce-Kyg~V~~viifeip~~p~deavRiFveF~r~e~aiKA~VdlnGRyFGGr~v~A~Fyn~ 366 (378)
T KOG1996|consen 300 EDETKEECE-KYGKVGNVIIFEIPSQPEDEAVRIFVEFERVESAIKAVVDLNGRYFGGRVVSACFYNL 366 (378)
T ss_pred HHHHHHHHH-hhcceeeEEEEecCCCccchhheeeeeeccHHHHHHHHHhcCCceecceeeeheeccH
Confidence 367888998 999999998887642211 122379999999999999999999999999999888653
No 160
>KOG1995 consensus Conserved Zn-finger protein [General function prediction only]
Probab=97.77 E-value=2.8e-05 Score=66.80 Aligned_cols=84 Identities=15% Similarity=0.237 Sum_probs=73.5
Q ss_pred CCCCceEEEcCCCcccCHHHHhHhhhhcCCcE--------EEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCC
Q 021958 78 PGEIRTLWIGDLQYWMDETYLNTCFAHTGEVV--------AVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMP 149 (305)
Q Consensus 78 ~~~~~~l~v~nLp~~~t~~~l~~~f~~~G~v~--------~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~ 149 (305)
.....+|||.+|+..+++++|.++|.++|.|. .|++-+++.|++.|+-|.|.|++...|+.|+..++++.+.
T Consensus 63 ~s~~~ti~v~g~~d~~~~~~~~~~f~qcg~ikrnK~t~kPki~~y~dkeT~~~KGeatvS~~D~~~akaai~~~agkdf~ 142 (351)
T KOG1995|consen 63 KSDNETIFVWGCPDSVCENDNADFFLQCGVIKRNKRTGKPKIKIYTDKETGAPKGEATVSYEDPPAAKAAIEWFAGKDFC 142 (351)
T ss_pred ccccccceeeccCccchHHHHHHHHhhcceeccCCCCCCcchhccccccccCcCCceeeeecChhhhhhhhhhhcccccc
Confidence 45567899999999999999999999999764 3677788899999999999999999999999999999994
Q ss_pred CCCcceeeeccCCC
Q 021958 150 NGEQNFRLNWASFG 163 (305)
Q Consensus 150 ~~~~~i~v~~~~~~ 163 (305)
+..|+|..+...
T Consensus 143 --gn~ikvs~a~~r 154 (351)
T KOG1995|consen 143 --GNTIKVSLAERR 154 (351)
T ss_pred --CCCchhhhhhhc
Confidence 477888777654
No 161
>KOG2314 consensus Translation initiation factor 3, subunit b (eIF-3b) [Translation, ribosomal structure and biogenesis]
Probab=97.68 E-value=0.00011 Score=66.53 Aligned_cols=79 Identities=20% Similarity=0.241 Sum_probs=63.7
Q ss_pred CCCceEEEcCCCcC------CCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHHhCCcee-cc
Q 021958 171 TPDHTIFVGDLAAD------VTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEMNGVFC-ST 243 (305)
Q Consensus 171 ~~~~~l~v~nlp~~------~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l~~~~~-~g 243 (305)
.-...|+|.|+|.- .-..-|.++|+ ++|.|....+..+.++| .+||.|++|.+..+|..|++.|||+.| .+
T Consensus 56 g~D~vVvv~g~PvV~~~rl~klk~vl~kvfs-k~gk~vn~~~P~~e~gg-tkG~lf~E~~~~~~A~~aVK~l~G~~ldkn 133 (698)
T KOG2314|consen 56 GFDSVVVVDGAPVVGPARLEKLKKVLTKVFS-KAGKIVNMYYPIDEEGG-TKGYLFVEYASMRDAKKAVKSLNGKRLDKN 133 (698)
T ss_pred CcceEEEECCCcccChhHHHHHHHHHHHHHH-hhccccceeeccCccCC-eeeEEEEEecChhhHHHHHHhcccceeccc
Confidence 44678999998863 22345778898 99999999999887555 899999999999999999999999997 45
Q ss_pred eeeEEccC
Q 021958 244 RPMRIGPA 251 (305)
Q Consensus 244 ~~l~v~~~ 251 (305)
++..|..-
T Consensus 134 Htf~v~~f 141 (698)
T KOG2314|consen 134 HTFFVRLF 141 (698)
T ss_pred ceEEeehh
Confidence 66666543
No 162
>PF08777 RRM_3: RNA binding motif; InterPro: IPR014886 This domain is found in protein La which functions as an RNA chaperone during RNA polymerase III transcription, and can also stimulate translation initiation. It contains a five stranded beta sheet which forms an atypical RNA recognition motif []. ; PDB: 1OWX_A.
Probab=97.67 E-value=0.00011 Score=53.28 Aligned_cols=58 Identities=21% Similarity=0.320 Sum_probs=38.2
Q ss_pred ceEEEcCCCcccCHHHHhHhhhhcCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCC
Q 021958 82 RTLWIGDLQYWMDETYLNTCFAHTGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNG 145 (305)
Q Consensus 82 ~~l~v~nLp~~~t~~~l~~~f~~~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g 145 (305)
+.|.|.++...++.++|++.|+.||.|..|.+.+.-. -|||.|.+.+.|+.|+..+..
T Consensus 2 ~il~~~g~~~~~~re~iK~~f~~~g~V~yVD~~~G~~------~g~VRf~~~~~A~~a~~~~~~ 59 (105)
T PF08777_consen 2 CILKFSGLGEPTSREDIKEAFSQFGEVAYVDFSRGDT------EGYVRFKTPEAAQKALEKLKE 59 (105)
T ss_dssp -EEEEEE--SS--HHHHHHHT-SS--EEEEE--TT-S------EEEEEESS---HHHHHHHHHH
T ss_pred eEEEEecCCCCcCHHHHHHHHHhcCCcceEEecCCCC------EEEEEECCcchHHHHHHHHHh
Confidence 4688899999999999999999999999998876432 699999999999999987643
No 163
>PF05172 Nup35_RRM: Nup53/35/40-type RNA recognition motif; InterPro: IPR007846 The MPPN (Mitotic PhosphoProtein N end) family is uncharacterised however it probably plays a role in the cell cycle because the family includes mitotic phosphoproteins O13026 from SWISSPROT []. This family also includes Q05166 from SWISSPROT a suppressor of thermosensitive mutations in the DNA polymerase delta gene, Pol III []. The conserved central region appears to be distantly related to the RNA-binding region RNP-1 (RNA recognition motif, IPR000504 from INTERPRO), suggesting an RNA binding function for this protein.; PDB: 1WWH_C 3P3D_A.
Probab=97.63 E-value=0.00036 Score=49.91 Aligned_cols=78 Identities=12% Similarity=0.158 Sum_probs=52.2
Q ss_pred CceEEEcCCCcCCCHHHHHHHHhhcCCCeeEEE-EEecC------CCCCcccEEEEEeCCHHHHHHHHHHhCCceecce-
Q 021958 173 DHTIFVGDLAADVTDYMLQETFRARYPSTKGAK-VVIDR------LTGRTKGYGFVRFGDESEQLRAMTEMNGVFCSTR- 244 (305)
Q Consensus 173 ~~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~-i~~~~------~~~~~~g~afV~f~~~~~A~~a~~~l~~~~~~g~- 244 (305)
...|.|-+.|.. ....|.++|+ +||+|.+.. +..+. .......+-.|+|++..+|.+||.+ ||..++|.
T Consensus 6 ~~wVtVFGfp~~-~~~~Vl~~F~-~~G~Ile~~~~~~~~~~~~~~~~~~~~NWi~I~Y~~~~~A~rAL~~-NG~i~~g~~ 82 (100)
T PF05172_consen 6 ETWVTVFGFPPS-ASNQVLRHFS-SFGTILEHFEVLRSSSGINPYPIPSGGNWIHITYDNPLSAQRALQK-NGTIFSGSL 82 (100)
T ss_dssp CCEEEEE---GG-GHHHHHHHHH-CCS-EECEEGGG----------E-CCTTEEEEEESSHHHHHHHHTT-TTEEETTCE
T ss_pred CeEEEEEccCHH-HHHHHHHHHH-hcceEEEeecccccccccccccCCCCCCEEEEECCCHHHHHHHHHh-CCeEEcCcE
Confidence 467889999998 5566788888 999998764 11100 0012266889999999999999986 99999875
Q ss_pred eeEEccCCC
Q 021958 245 PMRIGPATN 253 (305)
Q Consensus 245 ~l~v~~~~~ 253 (305)
.+-|.++++
T Consensus 83 mvGV~~~~~ 91 (100)
T PF05172_consen 83 MVGVKPCDP 91 (100)
T ss_dssp EEEEEE-HH
T ss_pred EEEEEEcHH
Confidence 555777743
No 164
>PF14605 Nup35_RRM_2: Nup53/35/40-type RNA recognition motif
Probab=97.58 E-value=0.0002 Score=44.98 Aligned_cols=52 Identities=13% Similarity=0.302 Sum_probs=41.4
Q ss_pred ceEEEcCCCcCCCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHH
Q 021958 174 HTIFVGDLAADVTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAM 233 (305)
Q Consensus 174 ~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~ 233 (305)
+.|-|.+.+.+..+ ++..+|. .||+|..+.+. ....+.+|+|.+..+|++|+
T Consensus 2 ~wI~V~Gf~~~~~~-~vl~~F~-~fGeI~~~~~~------~~~~~~~l~y~~~~~ae~Al 53 (53)
T PF14605_consen 2 TWISVSGFPPDLAE-EVLEHFA-SFGEIVDIYVP------ESTNWMYLKYKSRKDAEKAL 53 (53)
T ss_pred cEEEEEeECchHHH-HHHHHHH-hcCCEEEEEcC------CCCcEEEEEECCHHHHHhhC
Confidence 56788888877654 4556888 89999998875 22568999999999999985
No 165
>KOG4210 consensus Nuclear localization sequence binding protein [Transcription]
Probab=97.57 E-value=7.2e-05 Score=64.35 Aligned_cols=82 Identities=20% Similarity=0.490 Sum_probs=70.9
Q ss_pred CCCceEE-EcCCCcccCHHHHhHhhhhcCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCCCCCcceee
Q 021958 79 GEIRTLW-IGDLQYWMDETYLNTCFAHTGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMPNGEQNFRL 157 (305)
Q Consensus 79 ~~~~~l~-v~nLp~~~t~~~l~~~f~~~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~~~~i~v 157 (305)
....++| |++|+.++++++|+..|..+|.|..+++..+..++..+|||||+|.+...+..++.. ....+ .++.+.+
T Consensus 182 ~~s~~~~~~~~~~f~~~~d~~~~~~~~~~~i~~~r~~~~~~s~~~kg~a~~~~~~~~~~~~~~~~-~~~~~--~~~~~~~ 258 (285)
T KOG4210|consen 182 GPSDTIFFVGELDFSLTRDDLKEHFVSSGEITSVRLPTDEESGDSKGFAYVDFSAGNSKKLALND-QTRSI--GGRPLRL 258 (285)
T ss_pred CccccceeecccccccchHHHhhhccCcCcceeeccCCCCCccchhhhhhhhhhhchhHHHHhhc-ccCcc--cCccccc
Confidence 4445556 999999999999999999999999999999999999999999999999999999865 56666 6788888
Q ss_pred eccCCC
Q 021958 158 NWASFG 163 (305)
Q Consensus 158 ~~~~~~ 163 (305)
.+....
T Consensus 259 ~~~~~~ 264 (285)
T KOG4210|consen 259 EEDEPR 264 (285)
T ss_pred ccCCCC
Confidence 777654
No 166
>KOG2202 consensus U2 snRNP splicing factor, small subunit, and related proteins [RNA processing and modification]
Probab=97.46 E-value=5.4e-05 Score=62.17 Aligned_cols=66 Identities=15% Similarity=0.249 Sum_probs=55.5
Q ss_pred HHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHHhCCceecceeeEEccCCCC
Q 021958 188 YMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEMNGVFCSTRPMRIGPATNK 254 (305)
Q Consensus 188 ~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l~~~~~~g~~l~v~~~~~~ 254 (305)
|||...|+.+||+|+++.|..+. .-.-.|=++|.|...++|++|++.||+..+.|+.|...++.-.
T Consensus 83 Ed~f~E~~~kygEiee~~Vc~Nl-~~hl~GNVYV~f~~Ee~ae~a~~~lnnRw~~G~pi~ae~~pvT 148 (260)
T KOG2202|consen 83 EDVFTELEDKYGEIEELNVCDNL-GDHLVGNVYVKFRSEEDAEAALEDLNNRWYNGRPIHAELSPVT 148 (260)
T ss_pred HHHHHHHHHHhhhhhhhhhhccc-chhhhhhhhhhcccHHHHHHHHHHHcCccccCCcceeeecCcC
Confidence 56777777799999999887664 2344777999999999999999999999999999999887653
No 167
>KOG3152 consensus TBP-binding protein, activator of basal transcription (contains rrm motif) [Transcription]
Probab=97.43 E-value=0.00014 Score=59.72 Aligned_cols=73 Identities=12% Similarity=0.186 Sum_probs=62.1
Q ss_pred CCceEEEcCCCcCCCHHHHHHHHhhcCCCeeEEEEEecCCC--------CCcc----cEEEEEeCCHHHHHHHHHHhCCc
Q 021958 172 PDHTIFVGDLAADVTDYMLQETFRARYPSTKGAKVVIDRLT--------GRTK----GYGFVRFGDESEQLRAMTEMNGV 239 (305)
Q Consensus 172 ~~~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~~--------~~~~----g~afV~f~~~~~A~~a~~~l~~~ 239 (305)
....||+++||...+-.-|+++|+ .||.|-+|.+.....+ |.++ .-|.|+|.+...|.++...||+.
T Consensus 73 k~GVvylS~IPp~m~~~rlReil~-~yGeVGRvylqpE~~s~~~~r~~~~~n~~~~y~EGWvEF~~KrvAK~iAe~Lnn~ 151 (278)
T KOG3152|consen 73 KTGVVYLSNIPPYMDPVRLREILS-QYGEVGRVYLQPEDDSKRAARKRKGGNYKKLYSEGWVEFISKRVAKRIAELLNNT 151 (278)
T ss_pred cceEEEeccCCCccCHHHHHHHHH-hccccceEEecchhhHHHHHHhhcCCCccccchhHHHHHHHHHHHHHHHHHhCCC
Confidence 467899999999999999999999 9999999998765443 2222 23679999999999999999999
Q ss_pred eeccee
Q 021958 240 FCSTRP 245 (305)
Q Consensus 240 ~~~g~~ 245 (305)
.|+|++
T Consensus 152 ~Iggkk 157 (278)
T KOG3152|consen 152 PIGGKK 157 (278)
T ss_pred ccCCCC
Confidence 999875
No 168
>KOG2416 consensus Acinus (induces apoptotic chromatin condensation) [Chromatin structure and dynamics]
Probab=97.35 E-value=0.00014 Score=66.34 Aligned_cols=80 Identities=13% Similarity=0.156 Sum_probs=67.1
Q ss_pred CCCCceEEEcCCCcCCCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHHhCCcee---cceee
Q 021958 170 DTPDHTIFVGDLAADVTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEMNGVFC---STRPM 246 (305)
Q Consensus 170 ~~~~~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l~~~~~---~g~~l 246 (305)
...++.|+|.||=.-+|.-+|+.++...+|.|++..| | +-+..|||.|.+.++|.+...+|||..+ +++.|
T Consensus 441 ~~~SnvlhI~nLvRPFTlgQLkelL~rtgg~Vee~Wm--D----kIKShCyV~yss~eEA~atr~AlhnV~WP~sNPK~L 514 (718)
T KOG2416|consen 441 KEPSNVLHIDNLVRPFTLGQLKELLGRTGGNVEEFWM--D----KIKSHCYVSYSSVEEAAATREALHNVQWPPSNPKHL 514 (718)
T ss_pred CCccceEeeecccccchHHHHHHHHhhccCchHHHHH--H----HhhcceeEecccHHHHHHHHHHHhccccCCCCCcee
Confidence 3567899999999999999999999867777877733 2 2266899999999999999999999885 77999
Q ss_pred EEccCCCCC
Q 021958 247 RIGPATNKK 255 (305)
Q Consensus 247 ~v~~~~~~~ 255 (305)
.+.|.....
T Consensus 515 ~adf~~~de 523 (718)
T KOG2416|consen 515 IADFVRADE 523 (718)
T ss_pred EeeecchhH
Confidence 999987643
No 169
>KOG0129 consensus Predicted RNA-binding protein (RRM superfamily) [Translation, ribosomal structure and biogenesis]
Probab=97.35 E-value=0.0022 Score=57.91 Aligned_cols=65 Identities=25% Similarity=0.371 Sum_probs=50.0
Q ss_pred CCCceEEEcCCCcCCCHHHHHHHHhhcCCCeeEEEEEecC---CCCCccc---EEEEEeCCHHHHHHHHHHhC
Q 021958 171 TPDHTIFVGDLAADVTDYMLQETFRARYPSTKGAKVVIDR---LTGRTKG---YGFVRFGDESEQLRAMTEMN 237 (305)
Q Consensus 171 ~~~~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~---~~~~~~g---~afV~f~~~~~A~~a~~~l~ 237 (305)
.-+++||||+||++++|+.|...|. .||.+. +...... ..--.+| |+|+-|+++.+....+.+..
T Consensus 257 ~~S~KVFvGGlp~dise~~i~~~F~-~FGs~~-VdWP~k~~~~~~~ppkGs~~YvflvFe~E~sV~~Ll~aC~ 327 (520)
T KOG0129|consen 257 RYSRKVFVGGLPWDITEAQINASFG-QFGSVK-VDWPGKANSRGRAPPKGSYGYVFLVFEDERSVQSLLSACS 327 (520)
T ss_pred ccccceeecCCCccccHHHHHhhcc-cccceE-eecCCCccccccCCCCCcccEEEEEecchHHHHHHHHHHh
Confidence 4578999999999999999999999 999976 4444211 1113456 99999999998887776554
No 170
>PF14605 Nup35_RRM_2: Nup53/35/40-type RNA recognition motif
Probab=97.33 E-value=0.00055 Score=42.99 Aligned_cols=52 Identities=21% Similarity=0.315 Sum_probs=41.2
Q ss_pred ceEEEcCCCcccCHHHHhHhhhhcCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHH
Q 021958 82 RTLWIGDLQYWMDETYLNTCFAHTGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVL 140 (305)
Q Consensus 82 ~~l~v~nLp~~~t~~~l~~~f~~~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~ 140 (305)
+.|-|.+.+.+.. +++...|..||+|..+.+-.. ..+.||.|.+..+|++|+
T Consensus 2 ~wI~V~Gf~~~~~-~~vl~~F~~fGeI~~~~~~~~------~~~~~l~y~~~~~ae~Al 53 (53)
T PF14605_consen 2 TWISVSGFPPDLA-EEVLEHFASFGEIVDIYVPES------TNWMYLKYKSRKDAEKAL 53 (53)
T ss_pred cEEEEEeECchHH-HHHHHHHHhcCCEEEEEcCCC------CcEEEEEECCHHHHHhhC
Confidence 5678888887666 456669999999999877522 238999999999999985
No 171
>KOG2314 consensus Translation initiation factor 3, subunit b (eIF-3b) [Translation, ribosomal structure and biogenesis]
Probab=97.30 E-value=0.00094 Score=60.78 Aligned_cols=78 Identities=19% Similarity=0.330 Sum_probs=60.3
Q ss_pred CCCCceEEEcCCC--cccCHHHH----hHhhhhcCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCCCC
Q 021958 78 PGEIRTLWIGDLQ--YWMDETYL----NTCFAHTGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMPNG 151 (305)
Q Consensus 78 ~~~~~~l~v~nLp--~~~t~~~l----~~~f~~~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~ 151 (305)
+.....|+|.|+| .....+.| ..+|+++|.+....+..+.. |..+||.|++|.+..+|+.|++.|||..++ +
T Consensus 55 eg~D~vVvv~g~PvV~~~rl~klk~vl~kvfsk~gk~vn~~~P~~e~-ggtkG~lf~E~~~~~~A~~aVK~l~G~~ld-k 132 (698)
T KOG2314|consen 55 EGFDSVVVVDGAPVVGPARLEKLKKVLTKVFSKAGKIVNMYYPIDEE-GGTKGYLFVEYASMRDAKKAVKSLNGKRLD-K 132 (698)
T ss_pred CCcceEEEECCCcccChhHHHHHHHHHHHHHHhhccccceeeccCcc-CCeeeEEEEEecChhhHHHHHHhcccceec-c
Confidence 3556778999998 33334444 45689999999988887765 559999999999999999999999999986 3
Q ss_pred Ccceee
Q 021958 152 EQNFRL 157 (305)
Q Consensus 152 ~~~i~v 157 (305)
...+.+
T Consensus 133 nHtf~v 138 (698)
T KOG2314|consen 133 NHTFFV 138 (698)
T ss_pred cceEEe
Confidence 444443
No 172
>KOG3152 consensus TBP-binding protein, activator of basal transcription (contains rrm motif) [Transcription]
Probab=97.29 E-value=0.00026 Score=58.09 Aligned_cols=71 Identities=25% Similarity=0.442 Sum_probs=59.6
Q ss_pred CCceEEEcCCCcccCHHHHhHhhhhcCCcEEEEEEeeCCC--------CCccc----eEEEEEcCHHHHHHHHHHhCCCC
Q 021958 80 EIRTLWIGDLQYWMDETYLNTCFAHTGEVVAVKVIRNKQT--------GQIEG----YGFIEFISRAGAERVLQTFNGTP 147 (305)
Q Consensus 80 ~~~~l~v~nLp~~~t~~~l~~~f~~~G~v~~~~~~~~~~~--------~~~~g----~afV~f~~~~~a~~a~~~l~g~~ 147 (305)
..-.||+++||+.++...|+++|+.||.|-.|.+.....+ |.+++ -|+|+|.+...|..+...||+..
T Consensus 73 k~GVvylS~IPp~m~~~rlReil~~yGeVGRvylqpE~~s~~~~r~~~~~n~~~~y~EGWvEF~~KrvAK~iAe~Lnn~~ 152 (278)
T KOG3152|consen 73 KTGVVYLSNIPPYMDPVRLREILSQYGEVGRVYLQPEDDSKRAARKRKGGNYKKLYSEGWVEFISKRVAKRIAELLNNTP 152 (278)
T ss_pred cceEEEeccCCCccCHHHHHHHHHhccccceEEecchhhHHHHHHhhcCCCccccchhHHHHHHHHHHHHHHHHHhCCCc
Confidence 3457999999999999999999999999999988776544 22222 36799999999999999999999
Q ss_pred CCC
Q 021958 148 MPN 150 (305)
Q Consensus 148 ~~~ 150 (305)
|.|
T Consensus 153 Igg 155 (278)
T KOG3152|consen 153 IGG 155 (278)
T ss_pred cCC
Confidence 954
No 173
>PF08952 DUF1866: Domain of unknown function (DUF1866) ; InterPro: IPR015047 This domain, found in synaptojanin, has no known function. ; PDB: 1UFW_A 2DNR_A.
Probab=97.09 E-value=0.004 Score=47.37 Aligned_cols=79 Identities=24% Similarity=0.322 Sum_probs=52.2
Q ss_pred CCCCceEEEcCCC------cCCCH---HHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHHhCCce
Q 021958 170 DTPDHTIFVGDLA------ADVTD---YMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEMNGVF 240 (305)
Q Consensus 170 ~~~~~~l~v~nlp------~~~~~---~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l~~~~ 240 (305)
..+..||.|.-+. ..+.+ .+|.+.|. .||++.-+++.-+ .-.|+|.+-++|.+|+. ++|..
T Consensus 24 GPpDaTVvVsv~~~~~~e~~~Fdd~l~~~ll~~~~-~~GevvLvRfv~~--------~mwVTF~dg~sALaals-~dg~~ 93 (146)
T PF08952_consen 24 GPPDATVVVSVDSPSEDEDSSFDDNLMDELLQKFA-QYGEVVLVRFVGD--------TMWVTFRDGQSALAALS-LDGIQ 93 (146)
T ss_dssp --TT-EEEEEECS-SCCCCHS--HHHHHHHHHHHH-CCS-ECEEEEETT--------CEEEEESSCHHHHHHHH-GCCSE
T ss_pred CCCCceEEEEecCCCccccCcCCHHHHHHHHHHHH-hCCceEEEEEeCC--------eEEEEECccHHHHHHHc-cCCcE
Confidence 3456777776555 23333 36778888 8999887877643 46899999999999997 69999
Q ss_pred ecceeeEEccCCCCCCCC
Q 021958 241 CSTRPMRIGPATNKKTVS 258 (305)
Q Consensus 241 ~~g~~l~v~~~~~~~~~~ 258 (305)
++|+.|+|+..++.....
T Consensus 94 v~g~~l~i~LKtpdW~~~ 111 (146)
T PF08952_consen 94 VNGRTLKIRLKTPDWLKG 111 (146)
T ss_dssp ETTEEEEEEE--------
T ss_pred ECCEEEEEEeCCccHHHH
Confidence 999999999988876654
No 174
>KOG1855 consensus Predicted RNA-binding protein [General function prediction only]
Probab=97.07 E-value=0.00069 Score=59.65 Aligned_cols=69 Identities=14% Similarity=0.229 Sum_probs=57.4
Q ss_pred CCCCceEEEcCCCcccCHHHHhHhhhhcCCcEEEEEEee---CCC--CC--------ccceEEEEEcCHHHHHHHHHHhC
Q 021958 78 PGEIRTLWIGDLQYWMDETYLNTCFAHTGEVVAVKVIRN---KQT--GQ--------IEGYGFIEFISRAGAERVLQTFN 144 (305)
Q Consensus 78 ~~~~~~l~v~nLp~~~t~~~l~~~f~~~G~v~~~~~~~~---~~~--~~--------~~g~afV~f~~~~~a~~a~~~l~ 144 (305)
.-.+++|.+-|||.+-.-+.|.++|..+|.|..|+|..- ..+ +. .+-+|+|+|+..+.|.+|.+.++
T Consensus 228 el~srtivaenLP~Dh~~enl~kiFg~~G~IksIRIckPgaip~d~r~~~~~~~~~~tk~~AlvEye~~~~A~KA~e~~~ 307 (484)
T KOG1855|consen 228 ELPSRTIVAENLPLDHSYENLSKIFGTVGSIKSIRICKPGAIPEDVRGFPKKYFELQTKECALVEYEEVEAARKARELLN 307 (484)
T ss_pred ccccceEEEecCCcchHHHHHHHHhhcccceeeeeecCCCCCCcccccCCccchhhhhhhhhhhhhhhhHHHHHHHHhhc
Confidence 347899999999999999999999999999999999875 222 21 24589999999999999998774
Q ss_pred CC
Q 021958 145 GT 146 (305)
Q Consensus 145 g~ 146 (305)
..
T Consensus 308 ~e 309 (484)
T KOG1855|consen 308 PE 309 (484)
T ss_pred hh
Confidence 43
No 175
>KOG2253 consensus U1 snRNP complex, subunit SNU71 and related PWI-motif proteins [RNA processing and modification]
Probab=97.00 E-value=0.00018 Score=66.63 Aligned_cols=144 Identities=11% Similarity=0.210 Sum_probs=91.2
Q ss_pred CceEEEcCCCcccCHHHHhHhhhhcCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCCCCCcceeeecc
Q 021958 81 IRTLWIGDLQYWMDETYLNTCFAHTGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMPNGEQNFRLNWA 160 (305)
Q Consensus 81 ~~~l~v~nLp~~~t~~~l~~~f~~~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~~~~i~v~~~ 160 (305)
..+|||+|+-..+..+.++.++..+|.|.+++... |||++|.....+.+|+..++-..+.+.+....++..
T Consensus 40 ~~~vfv~~~~~~~s~~~~~~il~~~g~v~s~kr~~---------fgf~~f~~~~~~~ra~r~~t~~~~~~~kl~~~~d~q 110 (668)
T KOG2253|consen 40 RDTVFVGNISYLVSQEFWKSILAKSGFVPSWKRDK---------FGFCEFLKHIGDLRASRLLTELNIDDQKLIENVDEQ 110 (668)
T ss_pred CceeEecchhhhhhHHHHHHHHhhCCcchhhhhhh---------hcccchhhHHHHHHHHHHhcccCCCcchhhccchhh
Confidence 46899999999999999999999999988876543 899999999999999988888888665555555422
Q ss_pred CCCCCCC------------CCCCCCceEEEcCCCcCCCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHH
Q 021958 161 SFGAGEK------------RDDTPDHTIFVGDLAADVTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESE 228 (305)
Q Consensus 161 ~~~~~~~------------~~~~~~~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~ 228 (305)
......+ ......+-.+|+|++..+.+....+.|. --+.....+-..+. .+..+.++|-+|.+...
T Consensus 111 ~~~n~~k~~~~~~~~~~~f~p~~srr~e~i~~k~~~l~~~~~~~~~~-is~s~~s~~~~~e~-d~h~~e~~~~~~~s~~~ 188 (668)
T KOG2253|consen 111 TIENADKEKSIANKESHKFVPSSSRRQESIQNKPLSLDEQIHKKSLQ-ISSSAASRRQIAEA-DDHCLELEKTETESNSA 188 (668)
T ss_pred hhcCccccccchhhhhcccCCchhHHHHHhhccccchhHHHHHHHHh-ccchhhhhhhhHHH-HHHHHHHHHhhcccccc
Confidence 2211111 1111245577788887777766666665 33333332222221 22223344555554444
Q ss_pred HHHHHHH
Q 021958 229 QLRAMTE 235 (305)
Q Consensus 229 A~~a~~~ 235 (305)
+..+...
T Consensus 189 ~~~~~~~ 195 (668)
T KOG2253|consen 189 LSKEAES 195 (668)
T ss_pred cCccccc
Confidence 4444333
No 176
>PF08675 RNA_bind: RNA binding domain; InterPro: IPR014789 This domain corresponds to the RNA binding domain of Poly(A)-specific ribonuclease (PARN). ; GO: 0003723 RNA binding, 0004535 poly(A)-specific ribonuclease activity, 0046872 metal ion binding, 0006402 mRNA catabolic process, 0005634 nucleus, 0005737 cytoplasm; PDB: 3CTR_A 2ROK_A 3D45_B 1WHV_A.
Probab=96.97 E-value=0.0055 Score=41.67 Aligned_cols=56 Identities=13% Similarity=0.223 Sum_probs=42.6
Q ss_pred CceEEEcCCCcCCCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHHhCC
Q 021958 173 DHTIFVGDLAADVTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEMNG 238 (305)
Q Consensus 173 ~~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l~~ 238 (305)
....+|. +|.++...||.++|+ .||.|. |..+.|. -|||...+.+.|..|+..+..
T Consensus 9 dHVFhlt-FPkeWK~~DI~qlFs-pfG~I~-VsWi~dT-------SAfV~l~~r~~~~~v~~~~~~ 64 (87)
T PF08675_consen 9 DHVFHLT-FPKEWKTSDIYQLFS-PFGQIY-VSWINDT-------SAFVALHNRDQAKVVMNTLKK 64 (87)
T ss_dssp CCEEEEE---TT--HHHHHHHCC-CCCCEE-EEEECTT-------EEEEEECCCHHHHHHHHHHTT
T ss_pred ceEEEEe-CchHhhhhhHHHHhc-cCCcEE-EEEEcCC-------cEEEEeecHHHHHHHHHHhcc
Confidence 3455665 999999999999999 999986 6666552 599999999999999988763
No 177
>KOG0112 consensus Large RNA-binding protein (RRM superfamily) [General function prediction only]
Probab=96.80 E-value=0.00049 Score=65.83 Aligned_cols=81 Identities=19% Similarity=0.302 Sum_probs=69.7
Q ss_pred CCCceEEEcCCCcCCCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHHhCCceecceeeEEcc
Q 021958 171 TPDHTIFVGDLAADVTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEMNGVFCSTRPMRIGP 250 (305)
Q Consensus 171 ~~~~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l~~~~~~g~~l~v~~ 250 (305)
..+.+||++||+..+++.+|+..|. .+|.|.+|.|..-. -+..--||||.|.+...+-.|+..+.+..|..-.+++.+
T Consensus 370 ~atrTLf~Gnl~~kl~eseiR~af~-e~gkve~VDiKtP~-~~~esa~~f~~~~n~dmtp~ak~e~s~~~I~~g~~r~gl 447 (975)
T KOG0112|consen 370 RATRTLFLGNLDSKLTESEIRPAFD-ESGKVEEVDIKTPH-IKTESAYAFVSLLNTDMTPSAKFEESGPLIGNGTHRIGL 447 (975)
T ss_pred hhhhhhhhcCcccchhhhhhhhhhh-hhccccccccccCC-CCcccchhhhhhhccccCcccchhhcCCccccCcccccc
Confidence 4678999999999999999999999 99999999997663 233355899999999999999999999998777777777
Q ss_pred CCC
Q 021958 251 ATN 253 (305)
Q Consensus 251 ~~~ 253 (305)
...
T Consensus 448 G~~ 450 (975)
T KOG0112|consen 448 GQP 450 (975)
T ss_pred ccc
Confidence 654
No 178
>PF05172 Nup35_RRM: Nup53/35/40-type RNA recognition motif; InterPro: IPR007846 The MPPN (Mitotic PhosphoProtein N end) family is uncharacterised however it probably plays a role in the cell cycle because the family includes mitotic phosphoproteins O13026 from SWISSPROT []. This family also includes Q05166 from SWISSPROT a suppressor of thermosensitive mutations in the DNA polymerase delta gene, Pol III []. The conserved central region appears to be distantly related to the RNA-binding region RNP-1 (RNA recognition motif, IPR000504 from INTERPRO), suggesting an RNA binding function for this protein.; PDB: 1WWH_C 3P3D_A.
Probab=96.77 E-value=0.0044 Score=44.37 Aligned_cols=79 Identities=15% Similarity=0.104 Sum_probs=49.2
Q ss_pred CCCceEEEcCCCcccCHHHHhHhhhhcCCcEEEE-EEeeC------CCCCccceEEEEEcCHHHHHHHHHHhCCCCCCCC
Q 021958 79 GEIRTLWIGDLQYWMDETYLNTCFAHTGEVVAVK-VIRNK------QTGQIEGYGFIEFISRAGAERVLQTFNGTPMPNG 151 (305)
Q Consensus 79 ~~~~~l~v~nLp~~~t~~~l~~~f~~~G~v~~~~-~~~~~------~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~ 151 (305)
...+-|.|-+.|.. ....|.+.|++||.|.+.. +.++. .......+-.|+|.+..+|.+||.+ ||..+.+
T Consensus 4 ~~~~wVtVFGfp~~-~~~~Vl~~F~~~G~Ile~~~~~~~~~~~~~~~~~~~~NWi~I~Y~~~~~A~rAL~~-NG~i~~g- 80 (100)
T PF05172_consen 4 DSETWVTVFGFPPS-ASNQVLRHFSSFGTILEHFEVLRSSSGINPYPIPSGGNWIHITYDNPLSAQRALQK-NGTIFSG- 80 (100)
T ss_dssp GGCCEEEEE---GG-GHHHHHHHHHCCS-EECEEGGG----------E-CCTTEEEEEESSHHHHHHHHTT-TTEEETT-
T ss_pred cCCeEEEEEccCHH-HHHHHHHHHHhcceEEEeecccccccccccccCCCCCCEEEEECCCHHHHHHHHHh-CCeEEcC-
Confidence 34566888899888 5577889999999987764 10100 0012234889999999999999965 9988843
Q ss_pred Ccceeeecc
Q 021958 152 EQNFRLNWA 160 (305)
Q Consensus 152 ~~~i~v~~~ 160 (305)
...+-|.+.
T Consensus 81 ~~mvGV~~~ 89 (100)
T PF05172_consen 81 SLMVGVKPC 89 (100)
T ss_dssp CEEEEEEE-
T ss_pred cEEEEEEEc
Confidence 233444443
No 179
>PF08675 RNA_bind: RNA binding domain; InterPro: IPR014789 This domain corresponds to the RNA binding domain of Poly(A)-specific ribonuclease (PARN). ; GO: 0003723 RNA binding, 0004535 poly(A)-specific ribonuclease activity, 0046872 metal ion binding, 0006402 mRNA catabolic process, 0005634 nucleus, 0005737 cytoplasm; PDB: 3CTR_A 2ROK_A 3D45_B 1WHV_A.
Probab=96.76 E-value=0.0069 Score=41.20 Aligned_cols=54 Identities=17% Similarity=0.249 Sum_probs=41.4
Q ss_pred eEEEcCCCcccCHHHHhHhhhhcCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhC
Q 021958 83 TLWIGDLQYWMDETYLNTCFAHTGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFN 144 (305)
Q Consensus 83 ~l~v~nLp~~~t~~~l~~~f~~~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~ 144 (305)
.||--..|......||.++|+.||.| .|..+-|. -|||.+.+.+.|..++..+.
T Consensus 10 HVFhltFPkeWK~~DI~qlFspfG~I-~VsWi~dT-------SAfV~l~~r~~~~~v~~~~~ 63 (87)
T PF08675_consen 10 HVFHLTFPKEWKTSDIYQLFSPFGQI-YVSWINDT-------SAFVALHNRDQAKVVMNTLK 63 (87)
T ss_dssp CEEEEE--TT--HHHHHHHCCCCCCE-EEEEECTT-------EEEEEECCCHHHHHHHHHHT
T ss_pred eEEEEeCchHhhhhhHHHHhccCCcE-EEEEEcCC-------cEEEEeecHHHHHHHHHHhc
Confidence 34444499999999999999999997 66777664 69999999999999998775
No 180
>PF10309 DUF2414: Protein of unknown function (DUF2414); InterPro: IPR019416 This entry contains proteins that have no known function.
Probab=96.65 E-value=0.014 Score=37.68 Aligned_cols=53 Identities=15% Similarity=0.237 Sum_probs=44.3
Q ss_pred ceEEEcCCCcccCHHHHhHhhhhc----CCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHh
Q 021958 82 RTLWIGDLQYWMDETYLNTCFAHT----GEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTF 143 (305)
Q Consensus 82 ~~l~v~nLp~~~t~~~l~~~f~~~----G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l 143 (305)
.+|+|.|+. +++.++|+.+|..| + ...+.++-|. -|-|.|.+.+.|.+|+..|
T Consensus 6 eavhirGvd-~lsT~dI~~y~~~y~~~~~-~~~IEWIdDt-------ScNvvf~d~~~A~~AL~~L 62 (62)
T PF10309_consen 6 EAVHIRGVD-ELSTDDIKAYFSEYFDEEG-PFRIEWIDDT-------SCNVVFKDEETAARALVAL 62 (62)
T ss_pred ceEEEEcCC-CCCHHHHHHHHHHhcccCC-CceEEEecCC-------cEEEEECCHHHHHHHHHcC
Confidence 579999995 47789999999999 4 4578888775 5999999999999999754
No 181
>PF07576 BRAP2: BRCA1-associated protein 2; InterPro: IPR011422 These proteins include BRCA1-associated protein 2 (BRAP2), which binds nuclear localisation signals (NLSs) in vitro and in yeast two-hybrid screening []. These proteins share a region of sequence similarity at their N terminus. They also have IPR001607 from INTERPRO at the C terminus.
Probab=96.61 E-value=0.029 Score=40.92 Aligned_cols=67 Identities=19% Similarity=0.263 Sum_probs=50.4
Q ss_pred CceEEEcCCCcccCHHHHhHhhhhcCC-cEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCC
Q 021958 81 IRTLWIGDLQYWMDETYLNTCFAHTGE-VVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMP 149 (305)
Q Consensus 81 ~~~l~v~nLp~~~t~~~l~~~f~~~G~-v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~ 149 (305)
...+.+...+..++.++|..+.+.+-. |..+++++|. ..++-.+++.|.+.++|+...+.+||+.+.
T Consensus 13 ~~~~~l~vp~~~~~~d~l~~f~~~~~~~i~~~riird~--~pnrymVLikF~~~~~Ad~Fy~~fNGk~Fn 80 (110)
T PF07576_consen 13 STLCCLAVPPYMTPSDFLLFFGAPFREDIEHIRIIRDG--TPNRYMVLIKFRDQESADEFYEEFNGKPFN 80 (110)
T ss_pred ceEEEEEeCcccccHHHHHHhhhcccccEEEEEEeeCC--CCceEEEEEEECCHHHHHHHHHHhCCCccC
Confidence 344555556667777777766666544 7788888874 335567899999999999999999999884
No 182
>PF10309 DUF2414: Protein of unknown function (DUF2414); InterPro: IPR019416 This entry contains proteins that have no known function.
Probab=96.59 E-value=0.016 Score=37.39 Aligned_cols=55 Identities=16% Similarity=0.239 Sum_probs=44.1
Q ss_pred CceEEEcCCCcCCCHHHHHHHHhhcC---CCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHHh
Q 021958 173 DHTIFVGDLAADVTDYMLQETFRARY---PSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEM 236 (305)
Q Consensus 173 ~~~l~v~nlp~~~~~~~l~~~f~~~~---G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l 236 (305)
...|+|.|++. ++.+||+.+|. .| ....+|..+-|. -|-|.|.+.+.|.+||.+|
T Consensus 5 peavhirGvd~-lsT~dI~~y~~-~y~~~~~~~~IEWIdDt-------ScNvvf~d~~~A~~AL~~L 62 (62)
T PF10309_consen 5 PEAVHIRGVDE-LSTDDIKAYFS-EYFDEEGPFRIEWIDDT-------SCNVVFKDEETAARALVAL 62 (62)
T ss_pred eceEEEEcCCC-CCHHHHHHHHH-HhcccCCCceEEEecCC-------cEEEEECCHHHHHHHHHcC
Confidence 46799999854 78889999998 77 235568888764 3779999999999999864
No 183
>PF07576 BRAP2: BRCA1-associated protein 2; InterPro: IPR011422 These proteins include BRCA1-associated protein 2 (BRAP2), which binds nuclear localisation signals (NLSs) in vitro and in yeast two-hybrid screening []. These proteins share a region of sequence similarity at their N terminus. They also have IPR001607 from INTERPRO at the C terminus.
Probab=96.45 E-value=0.05 Score=39.70 Aligned_cols=69 Identities=10% Similarity=0.015 Sum_probs=50.4
Q ss_pred CceEEEcCCCcCCCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHHhCCceecc
Q 021958 173 DHTIFVGDLAADVTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEMNGVFCST 243 (305)
Q Consensus 173 ~~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l~~~~~~g 243 (305)
...+.+...+..++-++|..+.+.--..|..++|++|. ..++-.++++|.+.++|..-....||+.++.
T Consensus 13 ~~~~~l~vp~~~~~~d~l~~f~~~~~~~i~~~riird~--~pnrymVLikF~~~~~Ad~Fy~~fNGk~Fns 81 (110)
T PF07576_consen 13 STLCCLAVPPYMTPSDFLLFFGAPFREDIEHIRIIRDG--TPNRYMVLIKFRDQESADEFYEEFNGKPFNS 81 (110)
T ss_pred ceEEEEEeCcccccHHHHHHhhhcccccEEEEEEeeCC--CCceEEEEEEECCHHHHHHHHHHhCCCccCC
Confidence 34444555566666677776666233447788898884 3357789999999999999999999998643
No 184
>KOG2193 consensus IGF-II mRNA-binding protein IMP, contains RRM and KH domains [RNA processing and modification; General function prediction only]
Probab=96.44 E-value=0.0021 Score=56.53 Aligned_cols=78 Identities=22% Similarity=0.391 Sum_probs=60.9
Q ss_pred ceEEEcCCCcCCCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHHhCCce-ecceeeEEccCC
Q 021958 174 HTIFVGDLAADVTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEMNGVF-CSTRPMRIGPAT 252 (305)
Q Consensus 174 ~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l~~~~-~~g~~l~v~~~~ 252 (305)
+++|++||...++-.||..+|.+.--.... .++. -.||+||.+.+...|.+|++.++|+. +.|.++.|.++-
T Consensus 2 nklyignL~p~~~psdl~svfg~ak~~~~g-~fl~------k~gyafvd~pdq~wa~kaie~~sgk~elqGkr~e~~~sv 74 (584)
T KOG2193|consen 2 NKLYIGNLSPQVTPSDLESVFGDAKIPGSG-QFLV------KSGYAFVDCPDQQWANKAIETLSGKVELQGKRQEVEHSV 74 (584)
T ss_pred CcccccccCCCCChHHHHHHhccccCCCCc-ceee------ecceeeccCCchhhhhhhHHhhchhhhhcCceeeccchh
Confidence 468999999999999999999732111111 1111 16799999999999999999999876 899999999988
Q ss_pred CCCCCC
Q 021958 253 NKKTVS 258 (305)
Q Consensus 253 ~~~~~~ 258 (305)
++..+.
T Consensus 75 ~kkqrs 80 (584)
T KOG2193|consen 75 PKKQRS 80 (584)
T ss_pred hHHHHh
Confidence 766553
No 185
>KOG2416 consensus Acinus (induces apoptotic chromatin condensation) [Chromatin structure and dynamics]
Probab=96.42 E-value=0.003 Score=57.97 Aligned_cols=80 Identities=13% Similarity=0.161 Sum_probs=63.6
Q ss_pred CCCCCceEEEcCCCcccCHHHHhHhhh-hcCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCC-CCCcc
Q 021958 77 QPGEIRTLWIGDLQYWMDETYLNTCFA-HTGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMP-NGEQN 154 (305)
Q Consensus 77 ~~~~~~~l~v~nLp~~~t~~~l~~~f~-~~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~-~~~~~ 154 (305)
-...++.|||.||-.-.|.-+|++++. .+|.|.+.+|= +-+..|||.|.+.++|.+.+.+|+|..+. ++.+.
T Consensus 440 R~~~SnvlhI~nLvRPFTlgQLkelL~rtgg~Vee~WmD------kIKShCyV~yss~eEA~atr~AlhnV~WP~sNPK~ 513 (718)
T KOG2416|consen 440 RKEPSNVLHIDNLVRPFTLGQLKELLGRTGGNVEEFWMD------KIKSHCYVSYSSVEEAAATREALHNVQWPPSNPKH 513 (718)
T ss_pred CCCccceEeeecccccchHHHHHHHHhhccCchHHHHHH------HhhcceeEecccHHHHHHHHHHHhccccCCCCCce
Confidence 345678999999999999999999999 56667776442 23447999999999999999999998775 45666
Q ss_pred eeeeccCC
Q 021958 155 FRLNWASF 162 (305)
Q Consensus 155 i~v~~~~~ 162 (305)
|.+.|...
T Consensus 514 L~adf~~~ 521 (718)
T KOG2416|consen 514 LIADFVRA 521 (718)
T ss_pred eEeeecch
Confidence 77777654
No 186
>KOG2202 consensus U2 snRNP splicing factor, small subunit, and related proteins [RNA processing and modification]
Probab=96.41 E-value=0.0019 Score=53.31 Aligned_cols=55 Identities=20% Similarity=0.307 Sum_probs=45.4
Q ss_pred hcCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCCCCCcceeeeccC
Q 021958 104 HTGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMPNGEQNFRLNWAS 161 (305)
Q Consensus 104 ~~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~~~~i~v~~~~ 161 (305)
+||+|+++.+..+. .-.-.|-+||.|..+++|++|++.||+..+ .|++|...+..
T Consensus 92 kygEiee~~Vc~Nl-~~hl~GNVYV~f~~Ee~ae~a~~~lnnRw~--~G~pi~ae~~p 146 (260)
T KOG2202|consen 92 KYGEIEELNVCDNL-GDHLVGNVYVKFRSEEDAEAALEDLNNRWY--NGRPIHAELSP 146 (260)
T ss_pred Hhhhhhhhhhhccc-chhhhhhhhhhcccHHHHHHHHHHHcCccc--cCCcceeeecC
Confidence 89999998777654 235567899999999999999999999999 66777776554
No 187
>KOG1996 consensus mRNA splicing factor [RNA processing and modification]
Probab=96.30 E-value=0.012 Score=49.42 Aligned_cols=65 Identities=18% Similarity=0.154 Sum_probs=50.6
Q ss_pred HHHHhHhhhhcCCcEEEEEEeeCCCCCcc-ceEEEEEcCHHHHHHHHHHhCCCCCCCCCcceeeeccC
Q 021958 95 ETYLNTCFAHTGEVVAVKVIRNKQTGQIE-GYGFIEFISRAGAERVLQTFNGTPMPNGEQNFRLNWAS 161 (305)
Q Consensus 95 ~~~l~~~f~~~G~v~~~~~~~~~~~~~~~-g~afV~f~~~~~a~~a~~~l~g~~~~~~~~~i~v~~~~ 161 (305)
++++.+..++||.|..|.|..+...-... ---||+|...++|.+|+-.|||+.+ +|+.++..+-.
T Consensus 300 ede~keEceKyg~V~~viifeip~~p~deavRiFveF~r~e~aiKA~VdlnGRyF--GGr~v~A~Fyn 365 (378)
T KOG1996|consen 300 EDETKEECEKYGKVGNVIIFEIPSQPEDEAVRIFVEFERVESAIKAVVDLNGRYF--GGRVVSACFYN 365 (378)
T ss_pred HHHHHHHHHhhcceeeEEEEecCCCccchhheeeeeeccHHHHHHHHHhcCCcee--cceeeeheecc
Confidence 45666778999999999888775332222 2379999999999999999999999 78888876543
No 188
>PF15023 DUF4523: Protein of unknown function (DUF4523)
Probab=96.21 E-value=0.016 Score=43.54 Aligned_cols=75 Identities=13% Similarity=0.169 Sum_probs=56.5
Q ss_pred CCCCceEEEcCCCcCC----CHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHHhCCceeccee
Q 021958 170 DTPDHTIFVGDLAADV----TDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEMNGVFCSTRP 245 (305)
Q Consensus 170 ~~~~~~l~v~nlp~~~----~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l~~~~~~g~~ 245 (305)
+.+-.+|.|+=|..+. +...+.+.++ .||.|.+|... | +.-|.|.|+|..+|-+|+.+++. ..-|..
T Consensus 83 epPMsTIVVRWlkknm~~~edl~sV~~~Ls-~fGpI~SVT~c-----G--rqsavVvF~d~~SAC~Av~Af~s-~~pgtm 153 (166)
T PF15023_consen 83 EPPMSTIVVRWLKKNMQPTEDLKSVIQRLS-VFGPIQSVTLC-----G--RQSAVVVFKDITSACKAVSAFQS-RAPGTM 153 (166)
T ss_pred CCCceeEEeehhhhcCChHHHHHHHHHHHH-hcCCcceeeec-----C--CceEEEEehhhHHHHHHHHhhcC-CCCCce
Confidence 4556778887665543 3344556677 99999999874 3 44699999999999999999875 677888
Q ss_pred eEEccCCC
Q 021958 246 MRIGPATN 253 (305)
Q Consensus 246 l~v~~~~~ 253 (305)
+.++|-..
T Consensus 154 ~qCsWqqr 161 (166)
T PF15023_consen 154 FQCSWQQR 161 (166)
T ss_pred EEeecccc
Confidence 89988543
No 189
>KOG0115 consensus RNA-binding protein p54nrb (RRM superfamily) [RNA processing and modification]
Probab=96.13 E-value=0.024 Score=46.89 Aligned_cols=61 Identities=16% Similarity=0.285 Sum_probs=54.4
Q ss_pred ceEEEcCCCcccCHHHHhHhhhhcCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHh
Q 021958 82 RTLWIGDLQYWMDETYLNTCFAHTGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTF 143 (305)
Q Consensus 82 ~~l~v~nLp~~~t~~~l~~~f~~~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l 143 (305)
..|||.||..-++.+.+.+.|+.||.|....++.|- .++..+-++|+|...-.|.+|+..+
T Consensus 32 a~l~V~nl~~~~sndll~~~f~~fg~~e~av~~vD~-r~k~t~eg~v~~~~k~~a~~a~rr~ 92 (275)
T KOG0115|consen 32 AELYVVNLMQGASNDLLEQAFRRFGPIERAVAKVDD-RGKPTREGIVEFAKKPNARKAARRC 92 (275)
T ss_pred ceEEEEecchhhhhHHHHHhhhhcCccchheeeecc-cccccccchhhhhcchhHHHHHHHh
Confidence 679999999999999999999999999887777763 5788888999999999999999876
No 190
>PF04847 Calcipressin: Calcipressin; InterPro: IPR006931 Calcipressin 1 negatively regulates calcineurin (IPR015757 from INTERPRO) by direct binding and is essential for the survival of T helper type 1 cells. Calcipressin 1 is a phosphoprotein that increases its capacity to inhibit calcineurin when phosphorylated at the FLISPP motif, and this phosphorylation also controls the half-life of calcipressin 1 by accelerating its degradation []. Calcineurin is a calcium-responsive enzyme that dephosphorylates the nuclear factor of activated T cells (NFAT). In so doing it promotes its nuclear translocation and uniquely links calcium signalling to transcriptional regulation []. Calcipressins are a family of proteins derived from three genes. Calcipressin 1 is also known as modulatory calcineurin-interacting protein 1 (MCIP1), Adapt78 and Down syndrome critical region 1 (DSCR1). Calcipressin 2 is variously known as MCIP2, ZAKI-4 and DSCR1-like 1. Calcipressin 3 is also called MCIP3 and DSCR1-like 2 []. DSCR1 (Adapt78) is associated with successful adaptation to oxidative stress and calcium stress as well as with diseases like Alzheimer's and Down syndrome. The DSCR1 (Adapt78) isoform 1 protein, calcipressin 1, inhibits calcineurin and protects against acute calcium-mediated stress damage, including transient oxidative stress []. Calcipressin 1 is encoded by DSCR1, a gene on human chromosome 21. Calcipressin 1 isoform 1 has an N-terminal coding region, which generates a new polypeptide of 252 amino acids. Endogenous calcipressin 1 exists as a complex together with the calcineurin A and B heterodimer []. ; GO: 0019722 calcium-mediated signaling; PDB: 1WEY_A.
Probab=96.01 E-value=0.019 Score=46.06 Aligned_cols=63 Identities=16% Similarity=0.074 Sum_probs=46.7
Q ss_pred CHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHHhC--CceecceeeEEccCCCCC
Q 021958 186 TDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEMN--GVFCSTRPMRIGPATNKK 255 (305)
Q Consensus 186 ~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l~--~~~~~g~~l~v~~~~~~~ 255 (305)
..+.|+++|. .|+.+..+.++.. -+-..|.|.+.++|.+|...|+ +..+.|..++|.|+....
T Consensus 8 ~~~~l~~l~~-~~~~~~~~~~L~s------FrRi~v~f~~~~~A~~~r~~l~~~~~~~~g~~l~~yf~~~~~ 72 (184)
T PF04847_consen 8 NLAELEELFS-TYDPPVQFSPLKS------FRRIRVVFESPESAQRARQLLHWDGTSFNGKRLRVYFGQPTP 72 (184)
T ss_dssp -HHHHHHHHH-TT-SS-EEEEETT------TTEEEEE-SSTTHHHHHHHTST--TSEETTEE-EEE----SS
T ss_pred hHHHHHHHHH-hcCCceEEEEcCC------CCEEEEEeCCHHHHHHHHHHhcccccccCCCceEEEEccccc
Confidence 4578999998 9999988877754 4458899999999999999999 999999999999985443
No 191
>KOG2591 consensus c-Mpl binding protein, contains La domain [Signal transduction mechanisms]
Probab=95.79 E-value=0.035 Score=50.84 Aligned_cols=97 Identities=9% Similarity=0.184 Sum_probs=66.5
Q ss_pred CHHHHHHHHHHhCCCCCCCCCcceeeeccCCCCCCCCCCCCCceEEEcCCCcCCCHHHHHHHHhh-cCCCeeEEEEEecC
Q 021958 132 SRAGAERVLQTFNGTPMPNGEQNFRLNWASFGAGEKRDDTPDHTIFVGDLAADVTDYMLQETFRA-RYPSTKGAKVVIDR 210 (305)
Q Consensus 132 ~~~~a~~a~~~l~g~~~~~~~~~i~v~~~~~~~~~~~~~~~~~~l~v~nlp~~~~~~~l~~~f~~-~~G~i~~i~i~~~~ 210 (305)
|.+-...++++.-+..++.++..++-. ...+.|.|+.|+..+-+|+++.+|+. -+-.+.+|.+..+.
T Consensus 146 DvdLI~Evlresp~VqvDekgekVrp~------------~kRcIvilREIpettp~e~Vk~lf~~encPk~iscefa~N~ 213 (684)
T KOG2591|consen 146 DVDLIVEVLRESPNVQVDEKGEKVRPN------------HKRCIVILREIPETTPIEVVKALFKGENCPKVISCEFAHND 213 (684)
T ss_pred chHHHHHHHhcCCCceeccCccccccC------------cceeEEEEeecCCCChHHHHHHHhccCCCCCceeeeeeecC
Confidence 444455566666666665544444433 23578889999999999999999972 25667778776652
Q ss_pred CCCCcccEEEEEeCCHHHHHHHHHHhCC--ceecceeeE
Q 021958 211 LTGRTKGYGFVRFGDESEQLRAMTEMNG--VFCSTRPMR 247 (305)
Q Consensus 211 ~~~~~~g~afV~f~~~~~A~~a~~~l~~--~~~~g~~l~ 247 (305)
+ =||+|++..||..|.+.|.. +.|-|+.|.
T Consensus 214 --n-----WyITfesd~DAQqAykylreevk~fqgKpIm 245 (684)
T KOG2591|consen 214 --N-----WYITFESDTDAQQAYKYLREEVKTFQGKPIM 245 (684)
T ss_pred --c-----eEEEeecchhHHHHHHHHHHHHHhhcCcchh
Confidence 2 48999999999999877653 225555443
No 192
>KOG2068 consensus MOT2 transcription factor [Transcription]
Probab=95.75 E-value=0.0051 Score=52.88 Aligned_cols=83 Identities=17% Similarity=0.267 Sum_probs=64.2
Q ss_pred ceEEEcCCCcCCCHHHHH---HHHhhcCCCeeEEEEEecCC--CCC-cccEEEEEeCCHHHHHHHHHHhCCceecceeeE
Q 021958 174 HTIFVGDLAADVTDYMLQ---ETFRARYPSTKGAKVVIDRL--TGR-TKGYGFVRFGDESEQLRAMTEMNGVFCSTRPMR 247 (305)
Q Consensus 174 ~~l~v~nlp~~~~~~~l~---~~f~~~~G~i~~i~i~~~~~--~~~-~~g~afV~f~~~~~A~~a~~~l~~~~~~g~~l~ 247 (305)
.-+||-+|+..+.++++. +.|. +||.|..|.+..+.. .+. ...-++|+|...++|..||...+|..++|+.++
T Consensus 78 nlvyvvgl~~~~ade~~l~~~eyfg-qygki~ki~~~~~~S~~s~~~~~~s~yITy~~~eda~rci~~v~g~~~dg~~lk 156 (327)
T KOG2068|consen 78 NLVYVVGLPLDLADESVLERTEYFG-QYGKINKIVKNKDPSSSSSSGGTCSVYITYEEEEDADRCIDDVDGFVDDGRALK 156 (327)
T ss_pred hhhhhhCCCccccchhhhhCccccc-ccccceEEeecCCcccccCCCCCCcccccccchHhhhhHHHHhhhHHhhhhhhH
Confidence 557788888877666554 4577 889999888877652 111 122389999999999999999999999999999
Q ss_pred EccCCCCCCC
Q 021958 248 IGPATNKKTV 257 (305)
Q Consensus 248 v~~~~~~~~~ 257 (305)
..+...+...
T Consensus 157 a~~gttkycs 166 (327)
T KOG2068|consen 157 ASLGTTKYCS 166 (327)
T ss_pred HhhCCCcchh
Confidence 9998887654
No 193
>PF08952 DUF1866: Domain of unknown function (DUF1866) ; InterPro: IPR015047 This domain, found in synaptojanin, has no known function. ; PDB: 1UFW_A 2DNR_A.
Probab=95.72 E-value=0.026 Score=42.96 Aligned_cols=54 Identities=22% Similarity=0.384 Sum_probs=41.9
Q ss_pred HHhHhhhhcCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCCCCCcceeeeccC
Q 021958 97 YLNTCFAHTGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMPNGEQNFRLNWAS 161 (305)
Q Consensus 97 ~l~~~f~~~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~~~~i~v~~~~ 161 (305)
+|.+.|+.||.+.-++++.+ .-+|+|.+.+.|.+|+ .++|..+ .|+.+++....
T Consensus 52 ~ll~~~~~~GevvLvRfv~~--------~mwVTF~dg~sALaal-s~dg~~v--~g~~l~i~LKt 105 (146)
T PF08952_consen 52 ELLQKFAQYGEVVLVRFVGD--------TMWVTFRDGQSALAAL-SLDGIQV--NGRTLKIRLKT 105 (146)
T ss_dssp HHHHHHHCCS-ECEEEEETT--------CEEEEESSCHHHHHHH-HGCCSEE--TTEEEEEEE--
T ss_pred HHHHHHHhCCceEEEEEeCC--------eEEEEECccHHHHHHH-ccCCcEE--CCEEEEEEeCC
Confidence 66777889999887777764 4899999999999999 6799999 66777775444
No 194
>KOG4676 consensus Splicing factor, arginine/serine-rich [RNA processing and modification]
Probab=95.70 E-value=0.015 Score=50.88 Aligned_cols=76 Identities=14% Similarity=0.152 Sum_probs=60.5
Q ss_pred ceEEEcCCCcCCCHHHHHHHHhhcCCCeeEEEEEecCC---CCCcccEEEEEeCCHHHHHHHHHHhCCceecceeeEEcc
Q 021958 174 HTIFVGDLAADVTDYMLQETFRARYPSTKGAKVVIDRL---TGRTKGYGFVRFGDESEQLRAMTEMNGVFCSTRPMRIGP 250 (305)
Q Consensus 174 ~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~---~~~~~g~afV~f~~~~~A~~a~~~l~~~~~~g~~l~v~~ 250 (305)
..|.|.||...++.+++..+|. -.|.|..+.+..+.. -......|||.|.|..++..|- .|.++.+=|+.|.|-.
T Consensus 8 ~vIqvanispsat~dqm~tlFg-~lGkI~elrlyp~~~d~~~pv~sRtcyVkf~d~~sv~vaQ-hLtntvfvdraliv~p 85 (479)
T KOG4676|consen 8 GVIQVANISPSATKDQMQTLFG-NLGKIPELRLYPNVDDSKIPVISRTCYVKFLDSQSVTVAQ-HLTNTVFVDRALIVRP 85 (479)
T ss_pred ceeeecccCchhhHHHHHHHHh-hccccccccccCCCCCccCcceeeeEEEeccCCcceeHHh-hhccceeeeeeEEEEe
Confidence 4899999999999999999998 999999999876422 2334668999999999988885 4666777667666654
Q ss_pred C
Q 021958 251 A 251 (305)
Q Consensus 251 ~ 251 (305)
+
T Consensus 86 ~ 86 (479)
T KOG4676|consen 86 Y 86 (479)
T ss_pred c
Confidence 3
No 195
>KOG3671 consensus Actin regulatory protein (Wiskott-Aldrich syndrome protein) [Signal transduction mechanisms; Cytoskeleton]
Probab=95.64 E-value=0.15 Score=46.36 Aligned_cols=31 Identities=13% Similarity=0.073 Sum_probs=20.4
Q ss_pred CceEEEcCCCcccCHHHHhHhhhhcCCcEEE
Q 021958 81 IRTLWIGDLQYWMDETYLNTCFAHTGEVVAV 111 (305)
Q Consensus 81 ~~~l~v~nLp~~~t~~~l~~~f~~~G~v~~~ 111 (305)
-+.+-...|+..--.++|..-+.+-|.++.+
T Consensus 481 l~~ve~t~~~~~dgR~~LmaqIRqG~~Lk~v 511 (569)
T KOG3671|consen 481 LKKVETTALSSGDGRDALMAQIRQGGQLKKV 511 (569)
T ss_pred ccceeeccCcCcccHHHHHHHHHhccccccc
Confidence 3456667777666777888877765554444
No 196
>KOG2135 consensus Proteins containing the RNA recognition motif [General function prediction only]
Probab=95.53 E-value=0.0063 Score=54.47 Aligned_cols=73 Identities=14% Similarity=0.099 Sum_probs=57.8
Q ss_pred eEEEcCCCcCC-CHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHHhCCceecceeeEEccCCC
Q 021958 175 TIFVGDLAADV-TDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEMNGVFCSTRPMRIGPATN 253 (305)
Q Consensus 175 ~l~v~nlp~~~-~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l~~~~~~g~~l~v~~~~~ 253 (305)
.|-+.-.+... +.++|...|. +||+|..|.+-.. --.|.|+|.+..+|-+|.. .++..|++|.|+|.|.++
T Consensus 374 ~l~lek~~~glnt~a~ln~hfA-~fG~i~n~qv~~~------~~~a~vTF~t~aeag~a~~-s~~avlnnr~iKl~whnp 445 (526)
T KOG2135|consen 374 PLALEKSPFGLNTIADLNPHFA-QFGEIENIQVDYS------SLHAVVTFKTRAEAGEAYA-SHGAVLNNRFIKLFWHNP 445 (526)
T ss_pred hhhhhccCCCCchHhhhhhhhh-hcCccccccccCc------hhhheeeeeccccccchhc-cccceecCceeEEEEecC
Confidence 34444445554 4589999999 9999999988544 3358999999999988875 599999999999999988
Q ss_pred CC
Q 021958 254 KK 255 (305)
Q Consensus 254 ~~ 255 (305)
..
T Consensus 446 s~ 447 (526)
T KOG2135|consen 446 SP 447 (526)
T ss_pred Cc
Confidence 44
No 197
>PF03467 Smg4_UPF3: Smg-4/UPF3 family; InterPro: IPR005120 Nonsense-mediated mRNA decay (NMD) is a surveillance mechanism by which eukaryotic cells detect and degrade transcripts containing premature termination codons. Three 'up-frameshift' proteins, UPF1, UPF2 and UPF3, are essential for this process in organisms ranging from yeast, human to plants []. Exon junction complexes (EJCs) are deposited ~24 nucleotides upstream of exon-exon junctions after splicing. Translation causes displacement of the EJCs, however, premature translation termination upstream of one or more EJCs triggers the recruitment of UPF1, UPF2 and UPF3 and activates the NMD pathway [, ]. This family contains UPF3. The crystal structure of the complex between human UPF2 and UPF3b, which are, respectively, a MIF4G (middle portion of eIF4G) domain and an RNP domain (ribonucleoprotein-type RNA-binding domain) has been determined to 1.95A. The protein-protein interface is mediated by highly conserved charged residues in UPF2 and UPF3b and involves the beta-sheet surface of the UPF3b ribonucleoprotein (RNP) domain, which is generally used by these domains to bind nucleic acids. In UPF3b the RNP domain does not bind RNA, whereas the UPF2 construct and the complex do. It is clear that some RNP domains have evolved for specific protein-protein interactions rather than as nucleic acid binding modules [].; PDB: 1UW4_A 2L08_A.
Probab=95.44 E-value=0.022 Score=45.57 Aligned_cols=83 Identities=13% Similarity=0.175 Sum_probs=51.5
Q ss_pred CCceEEEcCCCcCCCHHHHHHHHhhc-CCCe---eEEEEEecCCC--CCcccEEEEEeCCHHHHHHHHHHhCCceecc--
Q 021958 172 PDHTIFVGDLAADVTDYMLQETFRAR-YPST---KGAKVVIDRLT--GRTKGYGFVRFGDESEQLRAMTEMNGVFCST-- 243 (305)
Q Consensus 172 ~~~~l~v~nlp~~~~~~~l~~~f~~~-~G~i---~~i~i~~~~~~--~~~~g~afV~f~~~~~A~~a~~~l~~~~~~g-- 243 (305)
...+|.|++||..+|++++.+.+. . ++.- .++.-...... .....-|+|.|.+.++...-...++|+.|-+
T Consensus 6 ~~~KvVIR~LPP~LteeeF~~~i~-~~l~~~~~w~y~~g~~~~~~~~~~~~SRaYi~F~~~~~~~~F~~~~~g~~F~D~k 84 (176)
T PF03467_consen 6 EGTKVVIRRLPPNLTEEEFWEQIS-PWLPDEWDWYYFQGKYGKKSFKPPTYSRAYINFKNPEDLLEFRDRFDGHVFVDSK 84 (176)
T ss_dssp ---EEEEEEE-TTS-HHHHCCCCS-S--SSE---EEEEEEES-SSSTTS--EEEEEEESSCHHHHHHHHHCTTEEEE-TT
T ss_pred cCceEEEeCCCCCCCHHHHHHHhh-hhcccccceEEEecCCCCccCCCCcceEEEEEeCCHHHHHHHHHhcCCcEEECCC
Confidence 356899999999999999999777 5 6665 33332222111 1223459999999999999999999988632
Q ss_pred ---eeeEEccCCCCC
Q 021958 244 ---RPMRIGPATNKK 255 (305)
Q Consensus 244 ---~~l~v~~~~~~~ 255 (305)
....|.+|--+.
T Consensus 85 g~~~~~~VE~Apyqk 99 (176)
T PF03467_consen 85 GNEYPAVVEFAPYQK 99 (176)
T ss_dssp S-EEEEEEEE-SS--
T ss_pred CCCcceeEEEcchhc
Confidence 345666665543
No 198
>PF11767 SET_assoc: Histone lysine methyltransferase SET associated; InterPro: IPR024636 The SET domain is a protein-protein interaction domain found in protein lysine methyltransferase enzymes. This entry represents a domain of unknown function which is associated with the SET domain and found in histone lysine methyltransferases [].
Probab=95.20 E-value=0.17 Score=33.17 Aligned_cols=55 Identities=25% Similarity=0.393 Sum_probs=43.9
Q ss_pred CCCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHHhCCceecceeeEE
Q 021958 184 DVTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEMNGVFCSTRPMRI 248 (305)
Q Consensus 184 ~~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l~~~~~~g~~l~v 248 (305)
.++-++++..|. .|+- .+|..|. .| =||.|.+.++|++|....+|..+.+..|.+
T Consensus 11 ~~~v~d~K~~Lr-~y~~---~~I~~d~-----tG-fYIvF~~~~Ea~rC~~~~~~~~~f~y~m~M 65 (66)
T PF11767_consen 11 GVTVEDFKKRLR-KYRW---DRIRDDR-----TG-FYIVFNDSKEAERCFRAEDGTLFFTYRMQM 65 (66)
T ss_pred CccHHHHHHHHh-cCCc---ceEEecC-----CE-EEEEECChHHHHHHHHhcCCCEEEEEEEEe
Confidence 467889999998 8875 3455554 33 379999999999999999999988877765
No 199
>PF03467 Smg4_UPF3: Smg-4/UPF3 family; InterPro: IPR005120 Nonsense-mediated mRNA decay (NMD) is a surveillance mechanism by which eukaryotic cells detect and degrade transcripts containing premature termination codons. Three 'up-frameshift' proteins, UPF1, UPF2 and UPF3, are essential for this process in organisms ranging from yeast, human to plants []. Exon junction complexes (EJCs) are deposited ~24 nucleotides upstream of exon-exon junctions after splicing. Translation causes displacement of the EJCs, however, premature translation termination upstream of one or more EJCs triggers the recruitment of UPF1, UPF2 and UPF3 and activates the NMD pathway [, ]. This family contains UPF3. The crystal structure of the complex between human UPF2 and UPF3b, which are, respectively, a MIF4G (middle portion of eIF4G) domain and an RNP domain (ribonucleoprotein-type RNA-binding domain) has been determined to 1.95A. The protein-protein interface is mediated by highly conserved charged residues in UPF2 and UPF3b and involves the beta-sheet surface of the UPF3b ribonucleoprotein (RNP) domain, which is generally used by these domains to bind nucleic acids. In UPF3b the RNP domain does not bind RNA, whereas the UPF2 construct and the complex do. It is clear that some RNP domains have evolved for specific protein-protein interactions rather than as nucleic acid binding modules [].; PDB: 1UW4_A 2L08_A.
Probab=95.08 E-value=0.028 Score=44.92 Aligned_cols=84 Identities=13% Similarity=0.097 Sum_probs=50.1
Q ss_pred CCCceEEEcCCCcccCHHHHhHhhhh-cCCc---EEEEEEeeCCC--CCccceEEEEEcCHHHHHHHHHHhCCCCCCCC-
Q 021958 79 GEIRTLWIGDLQYWMDETYLNTCFAH-TGEV---VAVKVIRNKQT--GQIEGYGFIEFISRAGAERVLQTFNGTPMPNG- 151 (305)
Q Consensus 79 ~~~~~l~v~nLp~~~t~~~l~~~f~~-~G~v---~~~~~~~~~~~--~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~- 151 (305)
....+|.|++||+++|++++++.++. ++.- ..+.-...... ...-.-|||.|.+.+++......++|..+...
T Consensus 5 ~~~~KvVIR~LPP~LteeeF~~~i~~~l~~~~~w~y~~g~~~~~~~~~~~~SRaYi~F~~~~~~~~F~~~~~g~~F~D~k 84 (176)
T PF03467_consen 5 KEGTKVVIRRLPPNLTEEEFWEQISPWLPDEWDWYYFQGKYGKKSFKPPTYSRAYINFKNPEDLLEFRDRFDGHVFVDSK 84 (176)
T ss_dssp ----EEEEEEE-TTS-HHHHCCCCSS--SSE---EEEEEEES-SSSTTS--EEEEEEESSCHHHHHHHHHCTTEEEE-TT
T ss_pred ccCceEEEeCCCCCCCHHHHHHHhhhhcccccceEEEecCCCCccCCCCcceEEEEEeCCHHHHHHHHHhcCCcEEECCC
Confidence 45568999999999999999998776 5554 23321122111 12234699999999999999999999766321
Q ss_pred --CcceeeeccCC
Q 021958 152 --EQNFRLNWASF 162 (305)
Q Consensus 152 --~~~i~v~~~~~ 162 (305)
.....|.+|-.
T Consensus 85 g~~~~~~VE~Apy 97 (176)
T PF03467_consen 85 GNEYPAVVEFAPY 97 (176)
T ss_dssp S-EEEEEEEE-SS
T ss_pred CCCcceeEEEcch
Confidence 22334555544
No 200
>PF10567 Nab6_mRNP_bdg: RNA-recognition motif; InterPro: IPR018885 This conserved domain is found in fungal proteins and appears to be involved in RNA-processing. It binds to poly-adenylated RNA, interacts genetically with mRNA 3'-end processing factors, co-purifies with the nuclear cap-binding protein Cbp20p, and is found in complexes containing other translation factors, such as EIF4G as in P39935 from SWISSPROT and P39936 from SWISSPROT.
Probab=94.91 E-value=0.87 Score=38.79 Aligned_cols=157 Identities=11% Similarity=0.171 Sum_probs=96.7
Q ss_pred CCceEEEcCCCcccCHHHHhHhhhhcCCcEEEEEEeeCC-------CCCccceEEEEEcCHHHHHHHHH----HhCCCCC
Q 021958 80 EIRTLWIGDLQYWMDETYLNTCFAHTGEVVAVKVIRNKQ-------TGQIEGYGFIEFISRAGAERVLQ----TFNGTPM 148 (305)
Q Consensus 80 ~~~~l~v~nLp~~~t~~~l~~~f~~~G~v~~~~~~~~~~-------~~~~~g~afV~f~~~~~a~~a~~----~l~g~~~ 148 (305)
..|.|...|+..+++-..+..-|.+||.|++|.++.+.. .........+.|-+.+.+...-. .|....-
T Consensus 14 rTRSLLfeNv~~sidLh~Fl~~fv~~~pIESiYL~~~~~~~~d~~~~d~~~~SilLSFlsr~~CLdFYNnvLQrLsEfK~ 93 (309)
T PF10567_consen 14 RTRSLLFENVNNSIDLHSFLTKFVKFGPIESIYLIKSNDKPSDDYNDDKNNQSILLSFLSREICLDFYNNVLQRLSEFKT 93 (309)
T ss_pred eeHHHHHhhccccccHHHHHHHhhccCceeEEEEecCCCcccccccccccceEEEEeeechHHHHHHHHHHHHHHHHHHH
Confidence 457799999999999999999999999999999998751 12334568889999888765432 2221111
Q ss_pred CCCCcceeeeccCCCCCC----------------------CCCCCCCceEEEcCCCcCCCHHH-HHHHH---hhcCC---
Q 021958 149 PNGEQNFRLNWASFGAGE----------------------KRDDTPDHTIFVGDLAADVTDYM-LQETF---RARYP--- 199 (305)
Q Consensus 149 ~~~~~~i~v~~~~~~~~~----------------------~~~~~~~~~l~v~nlp~~~~~~~-l~~~f---~~~~G--- 199 (305)
......+.+.+....... -.....++.|.|. +...+.+++ +.+.+ . .-+
T Consensus 94 ~L~S~~L~lsFV~l~y~~~~~~~~~~~~~~~~~~~~L~~~i~~~gATRSl~Ie-F~~~~~~~dl~~~kL~fL~-~~~n~R 171 (309)
T PF10567_consen 94 KLKSESLTLSFVSLNYQKKTDPNDEEADFSDYLVASLQYNIINRGATRSLAIE-FKDPVDKDDLIEKKLPFLK-NSNNKR 171 (309)
T ss_pred hcCCcceeEEEEEEeccccccccccccchhhHHhhhhhheeecCCcceEEEEE-ecCccchhHHHHHhhhhhc-cCCCce
Confidence 112334444443311000 0122445666664 334443333 33222 2 223
Q ss_pred -CeeEEEEEecCC--CCCcccEEEEEeCCHHHHHHHHHHhCC
Q 021958 200 -STKGAKVVIDRL--TGRTKGYGFVRFGDESEQLRAMTEMNG 238 (305)
Q Consensus 200 -~i~~i~i~~~~~--~~~~~g~afV~f~~~~~A~~a~~~l~~ 238 (305)
.|++|.++...+ ..-.+.||.++|-++.-|...+.-|..
T Consensus 172 YVlEsIDlVna~~~~~~Fp~~YaILtFlnIsMAiEV~dYlk~ 213 (309)
T PF10567_consen 172 YVLESIDLVNADEPSKHFPKNYAILTFLNISMAIEVLDYLKS 213 (309)
T ss_pred EEEEEEEEeccCcccccCCcceEEEeehhHHhHHHHHHHHHh
Confidence 377788775422 334588999999999999999887763
No 201
>KOG4285 consensus Mitotic phosphoprotein [Cell cycle control, cell division, chromosome partitioning]
Probab=94.71 E-value=0.14 Score=43.43 Aligned_cols=75 Identities=15% Similarity=0.251 Sum_probs=55.4
Q ss_pred CCceEEEcCCCcCCCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHHhCCceecce-eeEEcc
Q 021958 172 PDHTIFVGDLAADVTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEMNGVFCSTR-PMRIGP 250 (305)
Q Consensus 172 ~~~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l~~~~~~g~-~l~v~~ 250 (305)
....|-|.+++..-. ..|..+|+ +||+|.+.... ....+-.|.|.++-+|.+||.+ +|+.|+|. .|-|..
T Consensus 196 ~D~WVTVfGFppg~~-s~vL~~F~-~cG~Vvkhv~~------~ngNwMhirYssr~~A~KALsk-ng~ii~g~vmiGVkp 266 (350)
T KOG4285|consen 196 ADTWVTVFGFPPGQV-SIVLNLFS-RCGEVVKHVTP------SNGNWMHIRYSSRTHAQKALSK-NGTIIDGDVMIGVKP 266 (350)
T ss_pred ccceEEEeccCccch-hHHHHHHH-hhCeeeeeecC------CCCceEEEEecchhHHHHhhhh-cCeeeccceEEeeee
Confidence 356788888887644 45777888 99999755432 2255889999999999999987 88888774 456666
Q ss_pred CCCCC
Q 021958 251 ATNKK 255 (305)
Q Consensus 251 ~~~~~ 255 (305)
+..+.
T Consensus 267 CtDks 271 (350)
T KOG4285|consen 267 CTDKS 271 (350)
T ss_pred cCCHH
Confidence 55543
No 202
>PF15023 DUF4523: Protein of unknown function (DUF4523)
Probab=94.64 E-value=0.13 Score=38.80 Aligned_cols=73 Identities=15% Similarity=0.209 Sum_probs=53.5
Q ss_pred CCCceEEEcCCCccc----CHHHHhHhhhhcCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCCCCCcc
Q 021958 79 GEIRTLWIGDLQYWM----DETYLNTCFAHTGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMPNGEQN 154 (305)
Q Consensus 79 ~~~~~l~v~nLp~~~----t~~~l~~~f~~~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~~~~ 154 (305)
..-.+|.|.=|..++ +...|...++.||.|.++...-. --|.|.|++..+|-+|+.++.... -|..
T Consensus 84 pPMsTIVVRWlkknm~~~edl~sV~~~Ls~fGpI~SVT~cGr-------qsavVvF~d~~SAC~Av~Af~s~~---pgtm 153 (166)
T PF15023_consen 84 PPMSTIVVRWLKKNMQPTEDLKSVIQRLSVFGPIQSVTLCGR-------QSAVVVFKDITSACKAVSAFQSRA---PGTM 153 (166)
T ss_pred CCceeEEeehhhhcCChHHHHHHHHHHHHhcCCcceeeecCC-------ceEEEEehhhHHHHHHHHhhcCCC---CCce
Confidence 344577776555444 44455566789999999977543 269999999999999999887643 4778
Q ss_pred eeeeccC
Q 021958 155 FRLNWAS 161 (305)
Q Consensus 155 i~v~~~~ 161 (305)
+++.|..
T Consensus 154 ~qCsWqq 160 (166)
T PF15023_consen 154 FQCSWQQ 160 (166)
T ss_pred EEeeccc
Confidence 8888875
No 203
>KOG0804 consensus Cytoplasmic Zn-finger protein BRAP2 (BRCA1 associated protein) [General function prediction only]
Probab=94.43 E-value=0.25 Score=44.42 Aligned_cols=70 Identities=14% Similarity=0.165 Sum_probs=59.3
Q ss_pred CCceEEEcCCCcCCCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHHhCCceecc
Q 021958 172 PDHTIFVGDLAADVTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEMNGVFCST 243 (305)
Q Consensus 172 ~~~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l~~~~~~g 243 (305)
.+..|+|-.+|..++-.||..++....-.|..++|++|.. .++-..+|+|.+.++|..-.+.+||+.|+.
T Consensus 73 ~~~mLcilaVP~~mt~~Dll~F~~~~~~~I~~irivRd~~--pnrymvLIkFr~q~da~~Fy~efNGk~Fn~ 142 (493)
T KOG0804|consen 73 SSTMLCILAVPAYMTSHDLLRFCASFIKQISDIRIVRDGM--PNRYMVLIKFRDQADADTFYEEFNGKQFNS 142 (493)
T ss_pred CCcEEEEEeccccccHHHHHHHHHHHhhhhheeEEeecCC--CceEEEEEEeccchhHHHHHHHcCCCcCCC
Confidence 3788999999999999999999874445689999999742 336678999999999999999999988643
No 204
>PF07292 NID: Nmi/IFP 35 domain (NID); InterPro: IPR009909 This entry represents a domain of approximately 90 residues that is tandemly repeated within interferon-induced 35 kDa protein (IFP 35) and the homologous N-myc-interactor (Nmi). This domain mediates Nmi-Nmi protein interactions and subcellular localisation [].
Probab=94.40 E-value=0.092 Score=36.52 Aligned_cols=69 Identities=17% Similarity=0.330 Sum_probs=44.2
Q ss_pred EEEEEcCHHHHHHHHHHhCCCCCCCCCcc--eeeeccCCCCCCC---CCCCCCceEEEcCCCcCCCHHHHHHHHh
Q 021958 126 GFIEFISRAGAERVLQTFNGTPMPNGEQN--FRLNWASFGAGEK---RDDTPDHTIFVGDLAADVTDYMLQETFR 195 (305)
Q Consensus 126 afV~f~~~~~a~~a~~~l~g~~~~~~~~~--i~v~~~~~~~~~~---~~~~~~~~l~v~nlp~~~~~~~l~~~f~ 195 (305)
|+|+|.+..-|.+.++. ....+...+.. +++..-......+ ......++|.|.|||...++|+|++.++
T Consensus 1 AlITF~e~~VA~~i~~~-~~~~v~l~~~~~~V~v~P~~~~~~~k~qv~~~vs~rtVlvsgip~~l~ee~l~D~Le 74 (88)
T PF07292_consen 1 ALITFEEEGVAQRILKK-KKHPVPLEDCCVRVKVSPVTLGHLQKFQVFSGVSKRTVLVSGIPDVLDEEELRDKLE 74 (88)
T ss_pred CEEEeCcHHHHHHHHhC-CEEEEEECCEEEEEEEEeEecCCceEEEEEEcccCCEEEEeCCCCCCChhhheeeEE
Confidence 68999999999998854 22222222222 3333222221111 2334578999999999999999998886
No 205
>KOG4574 consensus RNA-binding protein (contains RRM and Pumilio-like repeats) [General function prediction only]
Probab=94.21 E-value=0.17 Score=48.92 Aligned_cols=76 Identities=16% Similarity=0.166 Sum_probs=66.5
Q ss_pred ceEEEcCCCcccCHHHHhHhhhhcCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCCCCCcceeeeccC
Q 021958 82 RTLWIGDLQYWMDETYLNTCFAHTGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMPNGEQNFRLNWAS 161 (305)
Q Consensus 82 ~~l~v~nLp~~~t~~~l~~~f~~~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~~~~i~v~~~~ 161 (305)
.+.++.|..-..+-.-|..++.+||.|.+.+..++-+ .|.|+|.+.+.|..|+++++|+++-..|.+.+|.+++
T Consensus 299 p~~~~~nn~v~~tSssL~~l~s~yg~v~s~wtlr~~N------~alvs~~s~~sai~a~dAl~gkevs~~g~Ps~V~~ak 372 (1007)
T KOG4574|consen 299 PKQSLENNAVNLTSSSLATLCSDYGSVASAWTLRDLN------MALVSFSSVESAILALDALQGKEVSVTGAPSRVSFAK 372 (1007)
T ss_pred chhhhhcccccchHHHHHHHHHhhcchhhheeccccc------chhhhhHHHHHHHHhhhhhcCCcccccCCceeEEecc
Confidence 3556677777888999999999999999999988754 7999999999999999999999988788889998887
Q ss_pred CC
Q 021958 162 FG 163 (305)
Q Consensus 162 ~~ 163 (305)
..
T Consensus 373 ~~ 374 (1007)
T KOG4574|consen 373 TL 374 (1007)
T ss_pred cc
Confidence 54
No 206
>KOG4574 consensus RNA-binding protein (contains RRM and Pumilio-like repeats) [General function prediction only]
Probab=94.20 E-value=0.027 Score=54.08 Aligned_cols=73 Identities=19% Similarity=0.214 Sum_probs=60.7
Q ss_pred EEcCCCcCCCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHHhCCcee--cceeeEEccCCCC
Q 021958 177 FVGDLAADVTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEMNGVFC--STRPMRIGPATNK 254 (305)
Q Consensus 177 ~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l~~~~~--~g~~l~v~~~~~~ 254 (305)
.+.|..-..+-.-|..+|+ .||.|.+++.+++ -.+|.|+|.+.+.|..|+++|+|+++ -|-..+|.+++.-
T Consensus 302 ~~~nn~v~~tSssL~~l~s-~yg~v~s~wtlr~------~N~alvs~~s~~sai~a~dAl~gkevs~~g~Ps~V~~ak~~ 374 (1007)
T KOG4574|consen 302 SLENNAVNLTSSSLATLCS-DYGSVASAWTLRD------LNMALVSFSSVESAILALDALQGKEVSVTGAPSRVSFAKTL 374 (1007)
T ss_pred hhhcccccchHHHHHHHHH-hhcchhhheeccc------ccchhhhhHHHHHHHHhhhhhcCCcccccCCceeEEecccc
Confidence 3344455667788999999 9999999999887 45799999999999999999999884 7778999998865
Q ss_pred CC
Q 021958 255 KT 256 (305)
Q Consensus 255 ~~ 256 (305)
.-
T Consensus 375 ~~ 376 (1007)
T KOG4574|consen 375 PM 376 (1007)
T ss_pred cc
Confidence 43
No 207
>KOG2068 consensus MOT2 transcription factor [Transcription]
Probab=93.98 E-value=0.035 Score=47.87 Aligned_cols=80 Identities=10% Similarity=0.218 Sum_probs=58.9
Q ss_pred ceEEEcCCCcccCHHHHh---HhhhhcCCcEEEEEEeeCC-CCCccc--eEEEEEcCHHHHHHHHHHhCCCCCCCCCcce
Q 021958 82 RTLWIGDLQYWMDETYLN---TCFAHTGEVVAVKVIRNKQ-TGQIEG--YGFIEFISRAGAERVLQTFNGTPMPNGEQNF 155 (305)
Q Consensus 82 ~~l~v~nLp~~~t~~~l~---~~f~~~G~v~~~~~~~~~~-~~~~~g--~afV~f~~~~~a~~a~~~l~g~~~~~~~~~i 155 (305)
+-+||-+|+..+..+++. ++|.+||.|..+.+.++.. .....+ -+||+|...++|..||...+|..++ ++.+
T Consensus 78 nlvyvvgl~~~~ade~~l~~~eyfgqygki~ki~~~~~~S~~s~~~~~~s~yITy~~~eda~rci~~v~g~~~d--g~~l 155 (327)
T KOG2068|consen 78 NLVYVVGLPLDLADESVLERTEYFGQYGKINKIVKNKDPSSSSSSGGTCSVYITYEEEEDADRCIDDVDGFVDD--GRAL 155 (327)
T ss_pred hhhhhhCCCccccchhhhhCcccccccccceEEeecCCcccccCCCCCCcccccccchHhhhhHHHHhhhHHhh--hhhh
Confidence 568888998777555544 4688999999998888652 111111 3799999999999999999998884 4667
Q ss_pred eeeccCCC
Q 021958 156 RLNWASFG 163 (305)
Q Consensus 156 ~v~~~~~~ 163 (305)
+..+...+
T Consensus 156 ka~~gttk 163 (327)
T KOG2068|consen 156 KASLGTTK 163 (327)
T ss_pred HHhhCCCc
Confidence 77666654
No 208
>KOG2253 consensus U1 snRNP complex, subunit SNU71 and related PWI-motif proteins [RNA processing and modification]
Probab=93.90 E-value=0.068 Score=50.09 Aligned_cols=72 Identities=18% Similarity=0.216 Sum_probs=63.5
Q ss_pred CCCCCceEEEcCCCcCCCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHHhCCceecceeeEE
Q 021958 169 DDTPDHTIFVGDLAADVTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEMNGVFCSTRPMRI 248 (305)
Q Consensus 169 ~~~~~~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l~~~~~~g~~l~v 248 (305)
...+..++||+|+-..+.++-++.++. .+|.|.++.... |||+.|.....+.+|+..++...++|..+.+
T Consensus 36 ~~~~~~~vfv~~~~~~~s~~~~~~il~-~~g~v~s~kr~~---------fgf~~f~~~~~~~ra~r~~t~~~~~~~kl~~ 105 (668)
T KOG2253|consen 36 PLPPRDTVFVGNISYLVSQEFWKSILA-KSGFVPSWKRDK---------FGFCEFLKHIGDLRASRLLTELNIDDQKLIE 105 (668)
T ss_pred CCCCCceeEecchhhhhhHHHHHHHHh-hCCcchhhhhhh---------hcccchhhHHHHHHHHHHhcccCCCcchhhc
Confidence 345678999999999999999999998 999998776532 8999999999999999999999999988877
Q ss_pred cc
Q 021958 249 GP 250 (305)
Q Consensus 249 ~~ 250 (305)
.-
T Consensus 106 ~~ 107 (668)
T KOG2253|consen 106 NV 107 (668)
T ss_pred cc
Confidence 65
No 209
>KOG0804 consensus Cytoplasmic Zn-finger protein BRAP2 (BRCA1 associated protein) [General function prediction only]
Probab=93.00 E-value=0.4 Score=43.12 Aligned_cols=67 Identities=19% Similarity=0.250 Sum_probs=57.3
Q ss_pred CceEEEcCCCcccCHHHHhHhhhhcCC-cEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCC
Q 021958 81 IRTLWIGDLQYWMDETYLNTCFAHTGE-VVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMP 149 (305)
Q Consensus 81 ~~~l~v~nLp~~~t~~~l~~~f~~~G~-v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~ 149 (305)
.+.|+|-.+|..+|..||..+...+-. |.++++++|.. ..+-..++.|.+.++|....+.+||+.+.
T Consensus 74 ~~mLcilaVP~~mt~~Dll~F~~~~~~~I~~irivRd~~--pnrymvLIkFr~q~da~~Fy~efNGk~Fn 141 (493)
T KOG0804|consen 74 STMLCILAVPAYMTSHDLLRFCASFIKQISDIRIVRDGM--PNRYMVLIKFRDQADADTFYEEFNGKQFN 141 (493)
T ss_pred CcEEEEEeccccccHHHHHHHHHHHhhhhheeEEeecCC--CceEEEEEEeccchhHHHHHHHcCCCcCC
Confidence 788999999999999999999886644 89999999642 23345899999999999999999999874
No 210
>PF03880 DbpA: DbpA RNA binding domain ; InterPro: IPR005580 This RNA binding domain is found at the C terminus of a number of DEAD helicase proteins [].; PDB: 2G0C_A 3MOJ_B.
Probab=92.75 E-value=0.52 Score=31.68 Aligned_cols=59 Identities=14% Similarity=0.216 Sum_probs=34.9
Q ss_pred cCCCHHHHHHHHhhcCCC-----eeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHHhCCceecceeeEEccC
Q 021958 183 ADVTDYMLQETFRARYPS-----TKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEMNGVFCSTRPMRIGPA 251 (305)
Q Consensus 183 ~~~~~~~l~~~f~~~~G~-----i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l~~~~~~g~~l~v~~~ 251 (305)
..++..+|..++. .-+. |-.|+|. ..|+||+-... .|..+++.|++..++|+.++|+.+
T Consensus 11 dg~~~~~iv~~i~-~~~gi~~~~IG~I~I~--------~~~S~vev~~~-~a~~v~~~l~~~~~~gk~v~ve~A 74 (74)
T PF03880_consen 11 DGLTPRDIVGAIC-NEAGIPGRDIGRIDIF--------DNFSFVEVPEE-VAEKVLEALNGKKIKGKKVRVERA 74 (74)
T ss_dssp GT--HHHHHHHHH-TCTTB-GGGEEEEEE---------SS-EEEEE-TT--HHHHHHHHTT--SSS----EEE-
T ss_pred cCCCHHHHHHHHH-hccCCCHHhEEEEEEe--------eeEEEEEECHH-HHHHHHHHhcCCCCCCeeEEEEEC
Confidence 3578888888887 4444 4456665 33789987765 788899999999999999999754
No 211
>KOG2591 consensus c-Mpl binding protein, contains La domain [Signal transduction mechanisms]
Probab=90.62 E-value=0.51 Score=43.61 Aligned_cols=57 Identities=12% Similarity=0.149 Sum_probs=46.7
Q ss_pred CCCceEEEcCCCcccCHHHHhHhhhh--cCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHH
Q 021958 79 GEIRTLWIGDLQYWMDETYLNTCFAH--TGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQT 142 (305)
Q Consensus 79 ~~~~~l~v~nLp~~~t~~~l~~~f~~--~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~ 142 (305)
...+.|.|+.|+..+-.++|+.+|+. |-.+.+|.+-.+. -=||+|++..+|..|.+.
T Consensus 173 ~kRcIvilREIpettp~e~Vk~lf~~encPk~iscefa~N~-------nWyITfesd~DAQqAyky 231 (684)
T KOG2591|consen 173 HKRCIVILREIPETTPIEVVKALFKGENCPKVISCEFAHND-------NWYITFESDTDAQQAYKY 231 (684)
T ss_pred cceeEEEEeecCCCChHHHHHHHhccCCCCCceeeeeeecC-------ceEEEeecchhHHHHHHH
Confidence 44566888999999999999999974 5668888877653 369999999999999764
No 212
>KOG1923 consensus Rac1 GTPase effector FRL [Signal transduction mechanisms; Cytoskeleton]
Probab=90.40 E-value=1.1 Score=43.26 Aligned_cols=15 Identities=0% Similarity=-0.066 Sum_probs=7.0
Q ss_pred ccEEEEEeCCHHHHH
Q 021958 216 KGYGFVRFGDESEQL 230 (305)
Q Consensus 216 ~g~afV~f~~~~~A~ 230 (305)
++.-.+.|...+.++
T Consensus 604 r~~tLlh~iv~~i~e 618 (830)
T KOG1923|consen 604 RSMTLLHYIVLTIAE 618 (830)
T ss_pred cceeeeehhhHHHHH
Confidence 444455555444443
No 213
>KOG4285 consensus Mitotic phosphoprotein [Cell cycle control, cell division, chromosome partitioning]
Probab=90.32 E-value=0.66 Score=39.58 Aligned_cols=60 Identities=22% Similarity=0.200 Sum_probs=44.6
Q ss_pred eEEEcCCCcccCHHHHhHhhhhcCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCCC
Q 021958 83 TLWIGDLQYWMDETYLNTCFAHTGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMPN 150 (305)
Q Consensus 83 ~l~v~nLp~~~t~~~l~~~f~~~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~ 150 (305)
=|-|-+++.... ..|..+|++||.|.+...- ....+-+|.|.+..+|++||.+ +|+.|.+
T Consensus 199 WVTVfGFppg~~-s~vL~~F~~cG~Vvkhv~~------~ngNwMhirYssr~~A~KALsk-ng~ii~g 258 (350)
T KOG4285|consen 199 WVTVFGFPPGQV-SIVLNLFSRCGEVVKHVTP------SNGNWMHIRYSSRTHAQKALSK-NGTIIDG 258 (350)
T ss_pred eEEEeccCccch-hHHHHHHHhhCeeeeeecC------CCCceEEEEecchhHHHHhhhh-cCeeecc
Confidence 355556665443 5678899999998776543 2234899999999999999965 8888853
No 214
>KOG1923 consensus Rac1 GTPase effector FRL [Signal transduction mechanisms; Cytoskeleton]
Probab=90.29 E-value=1.5 Score=42.25 Aligned_cols=10 Identities=10% Similarity=-0.160 Sum_probs=4.6
Q ss_pred ceEEEcCCCc
Q 021958 82 RTLWIGDLQY 91 (305)
Q Consensus 82 ~~l~v~nLp~ 91 (305)
.++|-.-+++
T Consensus 386 ~tvf~~~~De 395 (830)
T KOG1923|consen 386 GTVFHELNDE 395 (830)
T ss_pred cchhhhhhHH
Confidence 3555444443
No 215
>KOG2318 consensus Uncharacterized conserved protein [Function unknown]
Probab=89.29 E-value=1.4 Score=41.11 Aligned_cols=84 Identities=14% Similarity=0.148 Sum_probs=61.9
Q ss_pred CCCceEEEcCCCcC-CCHHHHHHHHhhcC----CCeeEEEEEecCC----------CCC---------------------
Q 021958 171 TPDHTIFVGDLAAD-VTDYMLQETFRARY----PSTKGAKVVIDRL----------TGR--------------------- 214 (305)
Q Consensus 171 ~~~~~l~v~nlp~~-~~~~~l~~~f~~~~----G~i~~i~i~~~~~----------~~~--------------------- 214 (305)
..+++|-|.|+.|+ +.-+||.-+|+ .| |.|.+|.|..... .|.
T Consensus 172 ~~T~RLAVvNMDWd~v~AkDL~v~~n-SFlP~gGsilSV~IYpSeFGkeRM~eEeV~GP~~el~~~~e~~~~s~sD~ee~ 250 (650)
T KOG2318|consen 172 EETKRLAVVNMDWDRVKAKDLFVLFN-SFLPKGGSILSVKIYPSEFGKERMKEEEVHGPPKELFKPVEEYKESESDDEEE 250 (650)
T ss_pred cccceeeEeccccccccHHHHHHHHH-hhcCCCCceeEEEechhhhhHHHhhhhcccCChhhhccccccCcccccchhhh
Confidence 46789999999997 78889988887 44 5788888753210 111
Q ss_pred ----------------cccEEEEEeCCHHHHHHHHHHhCCceecc--eeeEEccCCCCC
Q 021958 215 ----------------TKGYGFVRFGDESEQLRAMTEMNGVFCST--RPMRIGPATNKK 255 (305)
Q Consensus 215 ----------------~~g~afV~f~~~~~A~~a~~~l~~~~~~g--~~l~v~~~~~~~ 255 (305)
.-.||.|+|.+++.|.+....++|..+.. ..|-+.|.-...
T Consensus 251 ~~~~~~kLR~Yq~~rLkYYyAVvecDsi~tA~~vYe~CDG~EfEsS~~~~DLRFIPDdm 309 (650)
T KOG2318|consen 251 EDVDREKLRQYQLNRLKYYYAVVECDSIETAKAVYEECDGIEFESSANKLDLRFIPDDM 309 (650)
T ss_pred hhHHHHHHHHHHhhhheeEEEEEEecCchHHHHHHHhcCcceeccccceeeeeecCCCC
Confidence 13479999999999999999999999854 555555654433
No 216
>PF04847 Calcipressin: Calcipressin; InterPro: IPR006931 Calcipressin 1 negatively regulates calcineurin (IPR015757 from INTERPRO) by direct binding and is essential for the survival of T helper type 1 cells. Calcipressin 1 is a phosphoprotein that increases its capacity to inhibit calcineurin when phosphorylated at the FLISPP motif, and this phosphorylation also controls the half-life of calcipressin 1 by accelerating its degradation []. Calcineurin is a calcium-responsive enzyme that dephosphorylates the nuclear factor of activated T cells (NFAT). In so doing it promotes its nuclear translocation and uniquely links calcium signalling to transcriptional regulation []. Calcipressins are a family of proteins derived from three genes. Calcipressin 1 is also known as modulatory calcineurin-interacting protein 1 (MCIP1), Adapt78 and Down syndrome critical region 1 (DSCR1). Calcipressin 2 is variously known as MCIP2, ZAKI-4 and DSCR1-like 1. Calcipressin 3 is also called MCIP3 and DSCR1-like 2 []. DSCR1 (Adapt78) is associated with successful adaptation to oxidative stress and calcium stress as well as with diseases like Alzheimer's and Down syndrome. The DSCR1 (Adapt78) isoform 1 protein, calcipressin 1, inhibits calcineurin and protects against acute calcium-mediated stress damage, including transient oxidative stress []. Calcipressin 1 is encoded by DSCR1, a gene on human chromosome 21. Calcipressin 1 isoform 1 has an N-terminal coding region, which generates a new polypeptide of 252 amino acids. Endogenous calcipressin 1 exists as a complex together with the calcineurin A and B heterodimer []. ; GO: 0019722 calcium-mediated signaling; PDB: 1WEY_A.
Probab=89.05 E-value=0.94 Score=36.42 Aligned_cols=62 Identities=18% Similarity=0.195 Sum_probs=43.4
Q ss_pred CHHHHhHhhhhcCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhC--CCCCCCCCcceeeeccCCC
Q 021958 94 DETYLNTCFAHTGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFN--GTPMPNGEQNFRLNWASFG 163 (305)
Q Consensus 94 t~~~l~~~f~~~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~--g~~~~~~~~~i~v~~~~~~ 163 (305)
..+.|+++|..++.+..+...+.- +-..|.|.+.+.|.+|...|+ +..+ .+..+++.++...
T Consensus 8 ~~~~l~~l~~~~~~~~~~~~L~sF------rRi~v~f~~~~~A~~~r~~l~~~~~~~--~g~~l~~yf~~~~ 71 (184)
T PF04847_consen 8 NLAELEELFSTYDPPVQFSPLKSF------RRIRVVFESPESAQRARQLLHWDGTSF--NGKRLRVYFGQPT 71 (184)
T ss_dssp -HHHHHHHHHTT-SS-EEEEETTT------TEEEEE-SSTTHHHHHHHTST--TSEE--TTEE-EEE----S
T ss_pred hHHHHHHHHHhcCCceEEEEcCCC------CEEEEEeCCHHHHHHHHHHhccccccc--CCCceEEEEcccc
Confidence 457899999999998888776542 248999999999999999988 7777 5677888877543
No 217
>PF03880 DbpA: DbpA RNA binding domain ; InterPro: IPR005580 This RNA binding domain is found at the C terminus of a number of DEAD helicase proteins [].; PDB: 2G0C_A 3MOJ_B.
Probab=88.97 E-value=1.9 Score=28.94 Aligned_cols=58 Identities=17% Similarity=0.334 Sum_probs=34.8
Q ss_pred cccCHHHHhHhhhhcCC-----cEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCCCCCcceeeec
Q 021958 91 YWMDETYLNTCFAHTGE-----VVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMPNGEQNFRLNW 159 (305)
Q Consensus 91 ~~~t~~~l~~~f~~~G~-----v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~~~~i~v~~ 159 (305)
..++..+|..++...+. |-.+.+..+ |+||+... +.|..+++.|++..+ .++.+++..
T Consensus 11 dg~~~~~iv~~i~~~~gi~~~~IG~I~I~~~--------~S~vev~~-~~a~~v~~~l~~~~~--~gk~v~ve~ 73 (74)
T PF03880_consen 11 DGLTPRDIVGAICNEAGIPGRDIGRIDIFDN--------FSFVEVPE-EVAEKVLEALNGKKI--KGKKVRVER 73 (74)
T ss_dssp GT--HHHHHHHHHTCTTB-GGGEEEEEE-SS---------EEEEE-T-T-HHHHHHHHTT--S--SS----EEE
T ss_pred cCCCHHHHHHHHHhccCCCHHhEEEEEEeee--------EEEEEECH-HHHHHHHHHhcCCCC--CCeeEEEEE
Confidence 56788899999987654 556666543 89999776 477889999999999 567777654
No 218
>KOG2135 consensus Proteins containing the RNA recognition motif [General function prediction only]
Probab=88.00 E-value=0.26 Score=44.55 Aligned_cols=75 Identities=20% Similarity=0.297 Sum_probs=57.6
Q ss_pred CCceEEEcCCCccc-CHHHHhHhhhhcCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCCCCCcceeee
Q 021958 80 EIRTLWIGDLQYWM-DETYLNTCFAHTGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMPNGEQNFRLN 158 (305)
Q Consensus 80 ~~~~l~v~nLp~~~-t~~~l~~~f~~~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~~~~i~v~ 158 (305)
+.+.|-+...+... +-.+|...|.+||.|..|.+-.... -|.|+|.+..+|-+|- ...+..| .++.|++.
T Consensus 371 dhs~l~lek~~~glnt~a~ln~hfA~fG~i~n~qv~~~~~------~a~vTF~t~aeag~a~-~s~~avl--nnr~iKl~ 441 (526)
T KOG2135|consen 371 DHSPLALEKSPFGLNTIADLNPHFAQFGEIENIQVDYSSL------HAVVTFKTRAEAGEAY-ASHGAVL--NNRFIKLF 441 (526)
T ss_pred ccchhhhhccCCCCchHhhhhhhhhhcCccccccccCchh------hheeeeeccccccchh-cccccee--cCceeEEE
Confidence 34455555566544 6789999999999999998765522 5999999999996666 4578888 67999999
Q ss_pred ccCCC
Q 021958 159 WASFG 163 (305)
Q Consensus 159 ~~~~~ 163 (305)
|....
T Consensus 442 whnps 446 (526)
T KOG2135|consen 442 WHNPS 446 (526)
T ss_pred EecCC
Confidence 98754
No 219
>PF11767 SET_assoc: Histone lysine methyltransferase SET associated; InterPro: IPR024636 The SET domain is a protein-protein interaction domain found in protein lysine methyltransferase enzymes. This entry represents a domain of unknown function which is associated with the SET domain and found in histone lysine methyltransferases [].
Probab=87.01 E-value=3 Score=27.34 Aligned_cols=49 Identities=18% Similarity=0.186 Sum_probs=38.3
Q ss_pred cccCHHHHhHhhhhcCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCC
Q 021958 91 YWMDETYLNTCFAHTGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPM 148 (305)
Q Consensus 91 ~~~t~~~l~~~f~~~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~ 148 (305)
..++-++++..+.+|+- .+|..|+ +| -||.|.+..+|+++....+|..+
T Consensus 10 ~~~~v~d~K~~Lr~y~~---~~I~~d~-tG-----fYIvF~~~~Ea~rC~~~~~~~~~ 58 (66)
T PF11767_consen 10 HGVTVEDFKKRLRKYRW---DRIRDDR-TG-----FYIVFNDSKEAERCFRAEDGTLF 58 (66)
T ss_pred CCccHHHHHHHHhcCCc---ceEEecC-CE-----EEEEECChHHHHHHHHhcCCCEE
Confidence 35678999999999964 2444554 22 58999999999999999888876
No 220
>PF13689 DUF4154: Domain of unknown function (DUF4154)
Probab=86.13 E-value=5.2 Score=30.75 Aligned_cols=89 Identities=15% Similarity=0.238 Sum_probs=59.5
Q ss_pred CHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHHhCCceecceeeEEccCCCCCCCCCCCCCCC
Q 021958 186 TDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEMNGVFCSTRPMRIGPATNKKTVSGQQQYPK 265 (305)
Q Consensus 186 ~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l~~~~~~g~~l~v~~~~~~~~~~~~~~~~~ 265 (305)
+|.+|+..| .|.-+..+.+.... ....+-++.+.+..- ..++..+.++.++|+.+.|.....-.. ...
T Consensus 1 ~e~~lkAa~--l~nf~~f~~WP~~~---~~~~~~icv~g~~~~-~~~L~~l~~~~~~~~~i~v~~~~~~~~------~~~ 68 (145)
T PF13689_consen 1 KEYQLKAAY--LYNFAKFIEWPDSA---PSSPFRICVLGDDPF-AEALSTLAGKQVGGRPIRVRRLSSPNE------ISG 68 (145)
T ss_pred CHHHHHHHH--HHHhHhhccCCCCC---CCCCeEEEEECChHH-HHHHHHhhhcccCCCcEEEEECCCCcc------ccc
Confidence 356777777 36666666665442 234456666666664 456778889999999999987643222 367
Q ss_pred cceEEeCCcchhhhHHHHhhc
Q 021958 266 GTFLVVGTGFCLQIKFKFKYF 286 (305)
Q Consensus 266 ~~~~~~g~~~~~~~~~~~~~~ 286 (305)
+..++++.....+..+.++..
T Consensus 69 C~ilyi~~~~~~~~~~i~~~~ 89 (145)
T PF13689_consen 69 CHILYISSSESSQLPEILRKL 89 (145)
T ss_pred ccEEEECCCChHHHHHHHHhc
Confidence 889999988777665555544
No 221
>PF10567 Nab6_mRNP_bdg: RNA-recognition motif; InterPro: IPR018885 This conserved domain is found in fungal proteins and appears to be involved in RNA-processing. It binds to poly-adenylated RNA, interacts genetically with mRNA 3'-end processing factors, co-purifies with the nuclear cap-binding protein Cbp20p, and is found in complexes containing other translation factors, such as EIF4G as in P39935 from SWISSPROT and P39936 from SWISSPROT.
Probab=85.90 E-value=1.8 Score=36.90 Aligned_cols=84 Identities=5% Similarity=0.103 Sum_probs=62.7
Q ss_pred CCCCCceEEEcCCCcCCCHHHHHHHHhhcCCCeeEEEEEecC-------CCCCcccEEEEEeCCHHHHHHH----HHHhC
Q 021958 169 DDTPDHTIFVGDLAADVTDYMLQETFRARYPSTKGAKVVIDR-------LTGRTKGYGFVRFGDESEQLRA----MTEMN 237 (305)
Q Consensus 169 ~~~~~~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~-------~~~~~~g~afV~f~~~~~A~~a----~~~l~ 237 (305)
+.-.++.|.+.|+..+++-..+...|. +||.|++|.++.+. +..+......+-|-+.+.+..- ++.|.
T Consensus 11 D~YrTRSLLfeNv~~sidLh~Fl~~fv-~~~pIESiYL~~~~~~~~d~~~~d~~~~SilLSFlsr~~CLdFYNnvLQrLs 89 (309)
T PF10567_consen 11 DEYRTRSLLFENVNNSIDLHSFLTKFV-KFGPIESIYLIKSNDKPSDDYNDDKNNQSILLSFLSREICLDFYNNVLQRLS 89 (309)
T ss_pred ccceeHHHHHhhccccccHHHHHHHhh-ccCceeEEEEecCCCcccccccccccceEEEEeeechHHHHHHHHHHHHHHH
Confidence 344578899999999999999999998 99999999998764 1123345678999999987654 34444
Q ss_pred C--ceecceeeEEccCCC
Q 021958 238 G--VFCSTRPMRIGPATN 253 (305)
Q Consensus 238 ~--~~~~g~~l~v~~~~~ 253 (305)
. ..+....|++.|..-
T Consensus 90 EfK~~L~S~~L~lsFV~l 107 (309)
T PF10567_consen 90 EFKTKLKSESLTLSFVSL 107 (309)
T ss_pred HHHHhcCCcceeEEEEEE
Confidence 3 337777888877653
No 222
>KOG4410 consensus 5-formyltetrahydrofolate cyclo-ligase [Coenzyme transport and metabolism]
Probab=83.94 E-value=5.8 Score=33.72 Aligned_cols=48 Identities=23% Similarity=0.337 Sum_probs=35.9
Q ss_pred CceEEEcCCCcCCCHHHHHHHHhhcCCCe-eEEEEEecCCCCCcccEEEEEeCCHH
Q 021958 173 DHTIFVGDLAADVTDYMLQETFRARYPST-KGAKVVIDRLTGRTKGYGFVRFGDES 227 (305)
Q Consensus 173 ~~~l~v~nlp~~~~~~~l~~~f~~~~G~i-~~i~i~~~~~~~~~~g~afV~f~~~~ 227 (305)
..-|+++||+.++.-.||+..+. +-|.+ .++.+. | ..|-||+.|.+..
T Consensus 330 ~~di~~~nl~rd~rv~dlk~~lr-~~~~~pm~iswk-----g-~~~k~flh~~~~~ 378 (396)
T KOG4410|consen 330 KTDIKLTNLSRDIRVKDLKSELR-KRECTPMSISWK-----G-HFGKCFLHFGNRK 378 (396)
T ss_pred ccceeeccCccccchHHHHHHHH-hcCCCceeEeee-----c-CCcceeEecCCcc
Confidence 45699999999999999999998 65543 344442 2 2667999997753
No 223
>COG5178 PRP8 U5 snRNP spliceosome subunit [RNA processing and modification]
Probab=81.71 E-value=1.5 Score=44.40 Aligned_cols=28 Identities=14% Similarity=0.099 Sum_probs=14.2
Q ss_pred CCceEEEcCCCcccCHHHHhHhhhhcCC
Q 021958 80 EIRTLWIGDLQYWMDETYLNTCFAHTGE 107 (305)
Q Consensus 80 ~~~~l~v~nLp~~~t~~~l~~~f~~~G~ 107 (305)
..++++|--+-..+-.+.|+.+.+.++.
T Consensus 71 kak~~~v~t~ka~~PpeHLrki~~~~sd 98 (2365)
T COG5178 71 KAKTLHVLTLKAPIPPEHLRKIQSPCSD 98 (2365)
T ss_pred hhhheeeeccCCCCCHHHHHhhhCcccc
Confidence 3445555555555555555555554443
No 224
>PF14111 DUF4283: Domain of unknown function (DUF4283)
Probab=81.41 E-value=2.6 Score=32.53 Aligned_cols=116 Identities=13% Similarity=-0.006 Sum_probs=68.0
Q ss_pred eEEEcCCC--cccCHHHHhHhhhh-cCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCCCCCcceeeec
Q 021958 83 TLWIGDLQ--YWMDETYLNTCFAH-TGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMPNGEQNFRLNW 159 (305)
Q Consensus 83 ~l~v~nLp--~~~t~~~l~~~f~~-~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~~~~i~v~~ 159 (305)
...||.+- ...+...|...+.+ ++....+.+..- ..++..+.|.+++++.++++. ....+ .+..+.+..
T Consensus 17 ~~lVg~~l~~~~~~~~~l~~~l~~~W~~~~~~~i~~l-----~~~~fl~~F~~~~d~~~vl~~-~p~~~--~~~~~~l~~ 88 (153)
T PF14111_consen 17 LCLVGRVLSPKPISLSALEQELAKIWKLKGGVKIRDL-----GDNLFLFQFESEEDRQRVLKG-GPWNF--NGHFLILQR 88 (153)
T ss_pred eEEEEEECCCCCCCHHHHHHHHHHHhCCCCcEEEEEe-----CCCeEEEEEEeccceeEEEec-ccccc--cccchhhhh
Confidence 34455542 34567777777764 343323333332 125899999999999988742 22233 344444433
Q ss_pred cCCCCC--CCCCCCCCceEEEcCCCcC-CCHHHHHHHHhhcCCCeeEEEEE
Q 021958 160 ASFGAG--EKRDDTPDHTIFVGDLAAD-VTDYMLQETFRARYPSTKGAKVV 207 (305)
Q Consensus 160 ~~~~~~--~~~~~~~~~~l~v~nlp~~-~~~~~l~~~f~~~~G~i~~i~i~ 207 (305)
-..... ...-.....-|.|.|||.. .+++-++++.+ .+|.+.+++..
T Consensus 89 W~~~~~~~~~~~~~~~vWVri~glP~~~~~~~~~~~i~~-~iG~~i~vD~~ 138 (153)
T PF14111_consen 89 WSPDFNPSEVKFEHIPVWVRIYGLPLHLWSEEILKAIGS-KIGEPIEVDEN 138 (153)
T ss_pred hcccccccccceeccchhhhhccCCHHHhhhHHHHHHHH-hcCCeEEEEcC
Confidence 332111 1111223455788899998 66777777777 99999877664
No 225
>PRK14548 50S ribosomal protein L23P; Provisional
Probab=81.00 E-value=7.1 Score=26.94 Aligned_cols=58 Identities=9% Similarity=0.163 Sum_probs=43.4
Q ss_pred EEEcCCCcCCCHHHHHHHHhhcCCC-eeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHHh
Q 021958 176 IFVGDLAADVTDYMLQETFRARYPS-TKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEM 236 (305)
Q Consensus 176 l~v~nlp~~~~~~~l~~~f~~~~G~-i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l 236 (305)
-|.-.++.+.+..++++.++..||. |.+|....-. .+ ..-|||++..-++|.+....+
T Consensus 23 ~y~F~V~~~anK~eIK~AvE~lf~VkV~~VnT~~~~-~~--~KKA~V~L~~g~~A~~va~ki 81 (84)
T PRK14548 23 KLTFIVDRRATKPDIKRAVEELFDVKVEKVNTLITP-KG--EKKAYVKLAEEYDAEEIASRL 81 (84)
T ss_pred EEEEEECCCCCHHHHHHHHHHHhCCceEEEEeEEcC-CC--cEEEEEEeCCCCcHHHHHHhh
Confidence 4444568889999999999977875 6677766554 22 345999999999998886654
No 226
>KOG4483 consensus Uncharacterized conserved protein [Function unknown]
Probab=80.73 E-value=6.2 Score=35.24 Aligned_cols=62 Identities=16% Similarity=0.064 Sum_probs=50.2
Q ss_pred CCCCCCCCCceEEEcCCCcccCHHHHhHhhhhcCC-cEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHH
Q 021958 73 QQQGQPGEIRTLWIGDLQYWMDETYLNTCFAHTGE-VVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQ 141 (305)
Q Consensus 73 ~~~~~~~~~~~l~v~nLp~~~t~~~l~~~f~~~G~-v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~ 141 (305)
....+.+-.+.|=|.++|.....+||...|+.|+. -.+|+++-|. .+|..|.+...|..||.
T Consensus 383 ~ll~e~dlpHVlEIydfp~efkteDll~~f~~yq~kgfdIkWvDdt-------halaVFss~~~AaeaLt 445 (528)
T KOG4483|consen 383 PLLRESDLPHVLEIYDFPDEFKTEDLLKAFETYQNKGFDIKWVDDT-------HALAVFSSVNRAAEALT 445 (528)
T ss_pred CCCCcccccceeEeccCchhhccHHHHHHHHHhhcCCceeEEeecc-------eeEEeecchHHHHHHhh
Confidence 33445556678899999999999999999999976 4567777765 69999999999999984
No 227
>KOG4483 consensus Uncharacterized conserved protein [Function unknown]
Probab=80.53 E-value=10 Score=34.00 Aligned_cols=57 Identities=19% Similarity=0.201 Sum_probs=46.8
Q ss_pred CCCceEEEcCCCcCCCHHHHHHHHhhcCCC-eeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHH
Q 021958 171 TPDHTIFVGDLAADVTDYMLQETFRARYPS-TKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTE 235 (305)
Q Consensus 171 ~~~~~l~v~nlp~~~~~~~l~~~f~~~~G~-i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~ 235 (305)
.-...|-|.++|..+..+||...|. .|+. =.+|.++.|. .||..|.+...|..||..
T Consensus 389 dlpHVlEIydfp~efkteDll~~f~-~yq~kgfdIkWvDdt-------halaVFss~~~AaeaLt~ 446 (528)
T KOG4483|consen 389 DLPHVLEIYDFPDEFKTEDLLKAFE-TYQNKGFDIKWVDDT-------HALAVFSSVNRAAEALTL 446 (528)
T ss_pred cccceeEeccCchhhccHHHHHHHH-HhhcCCceeEEeecc-------eeEEeecchHHHHHHhhc
Confidence 3457899999999999999999998 8876 3345565542 799999999999999975
No 228
>KOG4672 consensus Uncharacterized conserved low complexity protein [Function unknown]
Probab=79.55 E-value=6.4 Score=35.24 Aligned_cols=10 Identities=20% Similarity=0.039 Sum_probs=5.7
Q ss_pred CHHHHHHHHh
Q 021958 186 TDYMLQETFR 195 (305)
Q Consensus 186 ~~~~l~~~f~ 195 (305)
|.+|....|-
T Consensus 471 tkDDaY~~FM 480 (487)
T KOG4672|consen 471 TKDDAYNAFM 480 (487)
T ss_pred cchHHHHHHH
Confidence 5566665553
No 229
>KOG4019 consensus Calcineurin-mediated signaling pathway inhibitor DSCR1 [Signal transduction mechanisms; General function prediction only]
Probab=79.06 E-value=2.2 Score=33.76 Aligned_cols=43 Identities=9% Similarity=0.037 Sum_probs=36.2
Q ss_pred ccEEEEEeCCHHHHHHHHHHhCCceecce-eeEEccCCCCCCCC
Q 021958 216 KGYGFVRFGDESEQLRAMTEMNGVFCSTR-PMRIGPATNKKTVS 258 (305)
Q Consensus 216 ~g~afV~f~~~~~A~~a~~~l~~~~~~g~-~l~v~~~~~~~~~~ 258 (305)
.+.-.|.|.+.+.|.+|..++++..|.|. .++.-|+.......
T Consensus 51 frrvRi~f~~p~~a~~a~i~~~~~~f~~~~~~k~yfaQ~~~~~~ 94 (193)
T KOG4019|consen 51 FRRVRINFSNPEAAADARIKLHSTSFNGKNELKLYFAQPGHPES 94 (193)
T ss_pred hceeEEeccChhHHHHHHHHhhhcccCCCceEEEEEccCCCccc
Confidence 45567899999999999999999999888 88888887765554
No 230
>TIGR03636 L23_arch archaeal ribosomal protein L23. This model describes the archaeal ribosomal protein L23P and rigorously excludes the bacterial counterpart L23. In order to capture every known instance of archaeal L23P, the trusted cutoff is set lower than a few of the highest scoring eukaryotic cytosolic ribosomal counterparts.
Probab=78.59 E-value=10 Score=25.69 Aligned_cols=58 Identities=9% Similarity=0.143 Sum_probs=43.1
Q ss_pred EEEcCCCcCCCHHHHHHHHhhcCCC-eeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHHh
Q 021958 176 IFVGDLAADVTDYMLQETFRARYPS-TKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEM 236 (305)
Q Consensus 176 l~v~nlp~~~~~~~l~~~f~~~~G~-i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l 236 (305)
-|+-.++.+.+..+|++.++..||. |.+|....-+ .+ ..-|||++..-++|.+.-..+
T Consensus 16 ~y~F~V~~~anK~eIK~avE~lf~VkV~~Vnt~~~~-~~--~KKA~VtL~~g~~a~~va~k~ 74 (77)
T TIGR03636 16 KLTFIVDRKATKGDIKRAVEKLFDVKVEKVNTLITP-RG--EKKAYVKLAEEYAAEEIASRL 74 (77)
T ss_pred EEEEEECCCCCHHHHHHHHHHHhCCceEEEEeEEcC-CC--ceEEEEEECCCCcHHHHHHhh
Confidence 4556678899999999999967875 6666665543 22 345999999988888876554
No 231
>PF07530 PRE_C2HC: Associated with zinc fingers; InterPro: IPR006579 This domain is present in proteins found exclusively in the arthropods, including a number of Drosophila species, the silk moth and the gypsy moth. These proteins are possibly involved in RNA binding or single strand DNA binding.
Probab=78.38 E-value=4.3 Score=26.78 Aligned_cols=65 Identities=8% Similarity=0.095 Sum_probs=44.5
Q ss_pred HHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHHhCCceecceeeEEccCCCCC
Q 021958 188 YMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEMNGVFCSTRPMRIGPATNKK 255 (305)
Q Consensus 188 ~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l~~~~~~g~~l~v~~~~~~~ 255 (305)
++|.+.|.+..-.|..+.-+....++.....-||+.+...+...++ +=..+++..|+|+....+.
T Consensus 2 ~~I~~~L~~~G~~v~~i~~~~~~~~k~pl~mf~veL~p~~~~k~i~---~Ik~l~~~~V~vE~~~k~~ 66 (68)
T PF07530_consen 2 EEIKEELKDQGHPVRNIHNMHSRNTKKPLNMFFVELEPKPNNKEIY---KIKTLCGQRVKVERPRKRR 66 (68)
T ss_pred HHHHHHHHHcCCceEEEEccccCCCCCCceEEEEeeccCcccccee---ehHhhCCeEEEEecCCCCC
Confidence 4677777733345777777777666777778888888776644443 3366899999998765543
No 232
>KOG2318 consensus Uncharacterized conserved protein [Function unknown]
Probab=78.30 E-value=11 Score=35.41 Aligned_cols=84 Identities=18% Similarity=0.278 Sum_probs=64.3
Q ss_pred CCCCceEEEcCCCc-ccCHHHHhHhhhhc----CCcEEEEEEeeCC----------CCC---------------------
Q 021958 78 PGEIRTLWIGDLQY-WMDETYLNTCFAHT----GEVVAVKVIRNKQ----------TGQ--------------------- 121 (305)
Q Consensus 78 ~~~~~~l~v~nLp~-~~t~~~l~~~f~~~----G~v~~~~~~~~~~----------~~~--------------------- 121 (305)
...+++|-|-|+.| .+...+|.-+|+.| |.|.+|.|..... .|.
T Consensus 171 ~~~T~RLAVvNMDWd~v~AkDL~v~~nSFlP~gGsilSV~IYpSeFGkeRM~eEeV~GP~~el~~~~e~~~~s~sD~ee~ 250 (650)
T KOG2318|consen 171 GEETKRLAVVNMDWDRVKAKDLFVLFNSFLPKGGSILSVKIYPSEFGKERMKEEEVHGPPKELFKPVEEYKESESDDEEE 250 (650)
T ss_pred ccccceeeEeccccccccHHHHHHHHHhhcCCCCceeEEEechhhhhHHHhhhhcccCChhhhccccccCcccccchhhh
Confidence 45678999999996 56999999998876 4688888764321 121
Q ss_pred ----------------ccceEEEEEcCHHHHHHHHHHhCCCCCCCCCcceeeeccC
Q 021958 122 ----------------IEGYGFIEFISRAGAERVLQTFNGTPMPNGEQNFRLNWAS 161 (305)
Q Consensus 122 ----------------~~g~afV~f~~~~~a~~a~~~l~g~~~~~~~~~i~v~~~~ 161 (305)
..-||.|+|.+.+.|.+..+.++|..+...+..+.+.|..
T Consensus 251 ~~~~~~kLR~Yq~~rLkYYyAVvecDsi~tA~~vYe~CDG~EfEsS~~~~DLRFIP 306 (650)
T KOG2318|consen 251 EDVDREKLRQYQLNRLKYYYAVVECDSIETAKAVYEECDGIEFESSANKLDLRFIP 306 (650)
T ss_pred hhHHHHHHHHHHhhhheeEEEEEEecCchHHHHHHHhcCcceeccccceeeeeecC
Confidence 1237889999999999999999999998777777776654
No 233
>KOG2891 consensus Surface glycoprotein [General function prediction only]
Probab=72.51 E-value=13 Score=31.62 Aligned_cols=129 Identities=16% Similarity=0.340 Sum_probs=72.4
Q ss_pred eEEEEEcCHHH----HHHHHHHhCCCCCCCCC--cceee-------eccCCC-----------CCCCCCCCCCceEEEcC
Q 021958 125 YGFIEFISRAG----AERVLQTFNGTPMPNGE--QNFRL-------NWASFG-----------AGEKRDDTPDHTIFVGD 180 (305)
Q Consensus 125 ~afV~f~~~~~----a~~a~~~l~g~~~~~~~--~~i~v-------~~~~~~-----------~~~~~~~~~~~~l~v~n 180 (305)
.-||.|+-+-. ..+.+..|+|..+...| ..++| ++...- ..+-.......+|++.+
T Consensus 77 id~iifeael~n~gimkk~l~~ldgfsiklsgfad~lkvka~eakidfpsrhdwdd~fm~~kdmdemkpgerpdti~la~ 156 (445)
T KOG2891|consen 77 IDFIIFEAELENKGIMKKFLACLDGFSIKLSGFADILKVKAAEAKIDFPSRHDWDDFFMDAKDMDEMKPGERPDTIHLAG 156 (445)
T ss_pred cceEEeeHhhhhhhHHHHHHHHhcCCeeeecccchHHhhhHHhhcCCCCcccchHHHHhhhhhhhccCCCCCCCceeecC
Confidence 67888865443 44556668888775332 22332 222110 00112234456888888
Q ss_pred CCcC------------CCHHHHHHHHhhcCCCeeEEEEEec-----CCCCCc-----ccEE---------EEEeCCHHHH
Q 021958 181 LAAD------------VTDYMLQETFRARYPSTKGAKVVID-----RLTGRT-----KGYG---------FVRFGDESEQ 229 (305)
Q Consensus 181 lp~~------------~~~~~l~~~f~~~~G~i~~i~i~~~-----~~~~~~-----~g~a---------fV~f~~~~~A 229 (305)
||-. -+++-|+..|+ .||.|..|.|..- .-+|+. .||+ ||+|-.....
T Consensus 157 ip~kwf~lkedg~~dlpse~rlr~a~e-afg~ir~vdipicdplr~~mn~kisgiq~~gfg~g~dlffeayvqfmeykgf 235 (445)
T KOG2891|consen 157 IPCKWFALKEDGSEDLPSEDRLRKAFE-AFGEIRNVDIPICDPLREEMNGKISGIQFHGFGFGGDLFFEAYVQFMEYKGF 235 (445)
T ss_pred CcceeeeecccccccCChHHHHHHHHH-HhccceecCCcccchhHHHhcCccccceeeccccCcchhHHHHHHHHHHHhH
Confidence 8753 35788999999 9999998887531 113332 3333 3444444445
Q ss_pred HHHHHHhCCcee----cc----eeeEEccCCCC
Q 021958 230 LRAMTEMNGVFC----ST----RPMRIGPATNK 254 (305)
Q Consensus 230 ~~a~~~l~~~~~----~g----~~l~v~~~~~~ 254 (305)
..|+..|.|..+ +| -.++|+|.+++
T Consensus 236 a~amdalr~~k~akk~d~~ffqanvkvdfdrsr 268 (445)
T KOG2891|consen 236 AQAMDALRGMKLAKKGDDGFFQANVKVDFDRSR 268 (445)
T ss_pred HHHHHHHhcchHHhhcCCcccccccccccchhh
Confidence 566666666542 33 35777776554
No 234
>KOG3424 consensus 40S ribosomal protein S24 [Translation, ribosomal structure and biogenesis]
Probab=66.12 E-value=25 Score=25.79 Aligned_cols=47 Identities=17% Similarity=0.334 Sum_probs=31.0
Q ss_pred CCCHHHHHHHHhhcCCCeeEEEEE----ecCCCCCcccEEEEEeCCHHHHHH
Q 021958 184 DVTDYMLQETFRARYPSTKGAKVV----IDRLTGRTKGYGFVRFGDESEQLR 231 (305)
Q Consensus 184 ~~~~~~l~~~f~~~~G~i~~i~i~----~~~~~~~~~g~afV~f~~~~~A~~ 231 (305)
.++.+||++-+...|-.-.++.++ ..-..|++.|||.| |++.+.|.+
T Consensus 34 ~vsK~EirEKla~mYkt~~d~V~vfgfrt~~GggkstgfalI-Ydsve~akk 84 (132)
T KOG3424|consen 34 NVSKTEIREKLAKMYKTTPDAVFVFGFRTHFGGGKSTGFALI-YDSVEYAKK 84 (132)
T ss_pred CCCHHHHHHHHHHHhcCCcceEEEEEeeeccCCcccceeeee-eehHHHHHh
Confidence 467888988887667654343333 33456788889876 677666553
No 235
>TIGR02542 B_forsyth_147 Bacteroides forsythus 147-residue repeat. The longest predicted protein in Bacteroides forsythus ATCC 43037 is over 3000 residues long and lacks homology to other known proteins. Immediately after the signal sequence are four tandem repeats, approximately 147 residues long. This model describes that repeat. This model describes that repeat.
Probab=62.32 E-value=24 Score=25.76 Aligned_cols=115 Identities=19% Similarity=0.201 Sum_probs=59.6
Q ss_pred CCcccCHHHHhHhhhhcCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCCCCCcceeeeccCCCCCCCC
Q 021958 89 LQYWMDETYLNTCFAHTGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMPNGEQNFRLNWASFGAGEKR 168 (305)
Q Consensus 89 Lp~~~t~~~l~~~f~~~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~~~~i~v~~~~~~~~~~~ 168 (305)
||+-+ +.|.++|+.=|+|.++..+..- .+ + .|+-.++|..-...+ .|++.-.......+-
T Consensus 11 lPPYT--nKLSDYfeSPGKI~svItvtqy-------------pd-n---dal~~~~G~lE~vDg-~i~IGs~q~~~sV~i 70 (145)
T TIGR02542 11 LPPYT--NKLSDYFESPGKIQSVITVTQY-------------PD-N---DALLYVHGTLEQVDG-NIRIGSGQTPASVRI 70 (145)
T ss_pred cCCcc--chhhHHhcCCCceEEEEEEecc-------------CC-c---hhhheeeeehhhccC-cEEEccCCCcccEEE
Confidence 55544 5689999999999888655432 11 1 122233333211112 245543332211111
Q ss_pred CC--CCCceEEEcCCCcCCCHHHHHHHHhh--cCCCeeEEEEEecCCCCCcccEEEEEeCCH
Q 021958 169 DD--TPDHTIFVGDLAADVTDYMLQETFRA--RYPSTKGAKVVIDRLTGRTKGYGFVRFGDE 226 (305)
Q Consensus 169 ~~--~~~~~l~v~nlp~~~~~~~l~~~f~~--~~G~i~~i~i~~~~~~~~~~g~afV~f~~~ 226 (305)
.. ..+..+ --|+.+|..+++++|++ .|-.|.+-.+.+|-.-.-+-..||.-|...
T Consensus 71 ~gTPsgnnv~---F~PYTlT~~e~r~iF~Epm~YQGITReQV~rdGLP~GsYRiCFrL~~~~ 129 (145)
T TIGR02542 71 QGTPSGNNVI---FPPYTLTYNELRQIFREPMVYQGITREQVQRDGLPEGSYRICFRLFNAT 129 (145)
T ss_pred ecCCCCCcee---cCceeeeHHHHHHHHhhhhhhccccHHHHhhcCCCCCceEEEEEEeccc
Confidence 11 111222 23678899999999973 244455444555533333455788877655
No 236
>PRK14548 50S ribosomal protein L23P; Provisional
Probab=61.68 E-value=35 Score=23.55 Aligned_cols=58 Identities=10% Similarity=0.115 Sum_probs=42.8
Q ss_pred eEEEcCCCcccCHHHHhHhhhh-cCC-cEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHh
Q 021958 83 TLWIGDLQYWMDETYLNTCFAH-TGE-VVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTF 143 (305)
Q Consensus 83 ~l~v~nLp~~~t~~~l~~~f~~-~G~-v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l 143 (305)
.-|+-.++.+++..++++.++. ||- |.+++...-+. + .-=|||.+...+.|......+
T Consensus 22 n~y~F~V~~~anK~eIK~AvE~lf~VkV~~VnT~~~~~-~--~KKA~V~L~~g~~A~~va~ki 81 (84)
T PRK14548 22 NKLTFIVDRRATKPDIKRAVEELFDVKVEKVNTLITPK-G--EKKAYVKLAEEYDAEEIASRL 81 (84)
T ss_pred CEEEEEECCCCCHHHHHHHHHHHhCCceEEEEeEEcCC-C--cEEEEEEeCCCCcHHHHHHhh
Confidence 4566667889999999999987 554 67777665541 2 224999999999998876554
No 237
>KOG4410 consensus 5-formyltetrahydrofolate cyclo-ligase [Coenzyme transport and metabolism]
Probab=60.82 E-value=19 Score=30.70 Aligned_cols=48 Identities=13% Similarity=0.082 Sum_probs=35.8
Q ss_pred CceEEEcCCCcccCHHHHhHhhhhcCCc-EEEEEEeeCCCCCccceEEEEEcCHH
Q 021958 81 IRTLWIGDLQYWMDETYLNTCFAHTGEV-VAVKVIRNKQTGQIEGYGFIEFISRA 134 (305)
Q Consensus 81 ~~~l~v~nLp~~~t~~~l~~~f~~~G~v-~~~~~~~~~~~~~~~g~afV~f~~~~ 134 (305)
..-||++||+.++.-.||+..+.+.+.+ .++.+. -..|-||+.|.+..
T Consensus 330 ~~di~~~nl~rd~rv~dlk~~lr~~~~~pm~iswk------g~~~k~flh~~~~~ 378 (396)
T KOG4410|consen 330 KTDIKLTNLSRDIRVKDLKSELRKRECTPMSISWK------GHFGKCFLHFGNRK 378 (396)
T ss_pred ccceeeccCccccchHHHHHHHHhcCCCceeEeee------cCCcceeEecCCcc
Confidence 4459999999999999999999987763 333332 23457999997653
No 238
>COG5178 PRP8 U5 snRNP spliceosome subunit [RNA processing and modification]
Probab=58.83 E-value=10 Score=38.79 Aligned_cols=8 Identities=25% Similarity=0.733 Sum_probs=3.6
Q ss_pred ceEEEcCC
Q 021958 82 RTLWIGDL 89 (305)
Q Consensus 82 ~~l~v~nL 89 (305)
++-|+|-|
T Consensus 108 krs~lgal 115 (2365)
T COG5178 108 KRSYLGAL 115 (2365)
T ss_pred hHhhhhhh
Confidence 34455544
No 239
>PTZ00191 60S ribosomal protein L23a; Provisional
Probab=57.87 E-value=40 Score=25.90 Aligned_cols=59 Identities=10% Similarity=0.208 Sum_probs=40.9
Q ss_pred eEEEcCCCcCCCHHHHHHHHhhcCCC-eeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHHh
Q 021958 175 TIFVGDLAADVTDYMLQETFRARYPS-TKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEM 236 (305)
Q Consensus 175 ~l~v~nlp~~~~~~~l~~~f~~~~G~-i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l 236 (305)
.-++-.++...+..+|++.++..|+. |..|..+.-. .| ..-|||++....+|......+
T Consensus 83 N~yvF~Vd~kAnK~qIK~AVEklf~VkV~kVNTli~p-~g--~KKA~V~L~~~~~aidva~ki 142 (145)
T PTZ00191 83 NTLVFIVDQRANKTQIKKAVEKLYDVKVVKVNTLITP-DG--LKKAYIRLSPDVDALDVANKI 142 (145)
T ss_pred CEEEEEEcCCCCHHHHHHHHHHHhCCeeEEEEeEEcC-CC--ceEEEEEECCCCcHHHHHHhh
Confidence 34555678889999999999866775 5566655543 22 335999998877776655543
No 240
>PHA03378 EBNA-3B; Provisional
Probab=57.48 E-value=65 Score=31.31 Aligned_cols=23 Identities=9% Similarity=-0.113 Sum_probs=11.9
Q ss_pred CceEEEcCCC--cccCHHHHhHhhh
Q 021958 81 IRTLWIGDLQ--YWMDETYLNTCFA 103 (305)
Q Consensus 81 ~~~l~v~nLp--~~~t~~~l~~~f~ 103 (305)
...|....++ ..-+..-|++++.
T Consensus 806 ~~q~~~~~~~gq~gp~k~ilrqllt 830 (991)
T PHA03378 806 SMQLMPRAAPGQQGPTKQILRQLLT 830 (991)
T ss_pred cceecCCCCccccCcHHHHHHHHhh
Confidence 3445444444 3345666666654
No 241
>TIGR03636 L23_arch archaeal ribosomal protein L23. This model describes the archaeal ribosomal protein L23P and rigorously excludes the bacterial counterpart L23. In order to capture every known instance of archaeal L23P, the trusted cutoff is set lower than a few of the highest scoring eukaryotic cytosolic ribosomal counterparts.
Probab=56.75 E-value=51 Score=22.30 Aligned_cols=58 Identities=10% Similarity=0.101 Sum_probs=42.1
Q ss_pred eEEEcCCCcccCHHHHhHhhhh-cCC-cEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHh
Q 021958 83 TLWIGDLQYWMDETYLNTCFAH-TGE-VVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTF 143 (305)
Q Consensus 83 ~l~v~nLp~~~t~~~l~~~f~~-~G~-v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l 143 (305)
.-|+-.++.+++..++++.++. ||- |.+++...-+. + .-=|||.+...+.|...-..+
T Consensus 15 n~y~F~V~~~anK~eIK~avE~lf~VkV~~Vnt~~~~~-~--~KKA~VtL~~g~~a~~va~k~ 74 (77)
T TIGR03636 15 NKLTFIVDRKATKGDIKRAVEKLFDVKVEKVNTLITPR-G--EKKAYVKLAEEYAAEEIASRL 74 (77)
T ss_pred CEEEEEECCCCCHHHHHHHHHHHhCCceEEEEeEEcCC-C--ceEEEEEECCCCcHHHHHHhh
Confidence 4667778899999999999987 553 66666655432 2 124999999988888776553
No 242
>PF03468 XS: XS domain; InterPro: IPR005380 The XS (rice gene X and SGS3) domain is found in a family of plant proteins including gene X Q9SBW2 from SWISSPROT and SGS3 Q9LDX1 from SWISSPROT. SGS3 is thought to be involved in post-transcriptional gene silencing (PTGS). This domain contains a conserved aspartate residue that may be functionally important. The XS domain containing proteins contain coiled-coils, which suggests that they will oligomerise. Most coiled-coil proteins form either a dimeric or a trimeric structure. It is possible that different members of the XS domain family could oligomerise via their coiled-coils forming a variety of complexes [].; PDB: 4E8U_C.
Probab=56.12 E-value=23 Score=26.08 Aligned_cols=40 Identities=23% Similarity=0.258 Sum_probs=24.7
Q ss_pred cCHHHHhHhhhhcCCcEEEEEEeeCCCCCccceEEEEEcCHHH
Q 021958 93 MDETYLNTCFAHTGEVVAVKVIRNKQTGQIEGYGFIEFISRAG 135 (305)
Q Consensus 93 ~t~~~l~~~f~~~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~ 135 (305)
++.++|.+.|+.|..+ .++...++ ....|++.|+|.+.-.
T Consensus 29 ~~~~~l~~~l~~f~p~-kv~~l~~~--~gh~g~aiv~F~~~w~ 68 (116)
T PF03468_consen 29 MSNEELLDKLAEFNPL-KVKPLYGK--QGHTGFAIVEFNKDWS 68 (116)
T ss_dssp --SHHHHHHHHH---S-EEEEEEET--TEEEEEEEEE--SSHH
T ss_pred cCHHHHHHHHHhcCCc-eeEECcCC--CCCcEEEEEEECCChH
Confidence 3568899999999887 46666665 3567899999976443
No 243
>PF03468 XS: XS domain; InterPro: IPR005380 The XS (rice gene X and SGS3) domain is found in a family of plant proteins including gene X Q9SBW2 from SWISSPROT and SGS3 Q9LDX1 from SWISSPROT. SGS3 is thought to be involved in post-transcriptional gene silencing (PTGS). This domain contains a conserved aspartate residue that may be functionally important. The XS domain containing proteins contain coiled-coils, which suggests that they will oligomerise. Most coiled-coil proteins form either a dimeric or a trimeric structure. It is possible that different members of the XS domain family could oligomerise via their coiled-coils forming a variety of complexes [].; PDB: 4E8U_C.
Probab=53.53 E-value=19 Score=26.59 Aligned_cols=50 Identities=12% Similarity=0.309 Sum_probs=27.3
Q ss_pred eEEEcCCCcC---------CCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHH
Q 021958 175 TIFVGDLAAD---------VTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESE 228 (305)
Q Consensus 175 ~l~v~nlp~~---------~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~ 228 (305)
++.|.|++.+ .+.++|.+.|. .|..++ ++.+.+. .-..|+++|+|.+.-+
T Consensus 10 mgIi~N~~~~~~~~g~~~g~~~~~l~~~l~-~f~p~k-v~~l~~~--~gh~g~aiv~F~~~w~ 68 (116)
T PF03468_consen 10 MGIIVNIPTEKDDDGRWVGMSNEELLDKLA-EFNPLK-VKPLYGK--QGHTGFAIVEFNKDWS 68 (116)
T ss_dssp EEEEE----EE-TTS-EE---SHHHHHHHH-H---SE-EEEEEET--TEEEEEEEEE--SSHH
T ss_pred EEEEEcCccccCCCCceeccCHHHHHHHHH-hcCCce-eEECcCC--CCCcEEEEEEECCChH
Confidence 4555666443 34578999998 798876 6666664 2458899999977543
No 244
>KOG2295 consensus C2H2 Zn-finger protein [General function prediction only]
Probab=52.37 E-value=2.8 Score=38.98 Aligned_cols=70 Identities=7% Similarity=0.064 Sum_probs=53.1
Q ss_pred CCceEEEcCCCcccCHHHHhHhhhhcCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCC
Q 021958 80 EIRTLWIGDLQYWMDETYLNTCFAHTGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMP 149 (305)
Q Consensus 80 ~~~~l~v~nLp~~~t~~~l~~~f~~~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~ 149 (305)
..++|++.|+.++++-.+|..+++.+-.+..+.+..+........+.+|+|.-.-....|+.+||+..+.
T Consensus 230 ke~sll~rni~Pnis~aeIe~~ck~i~~~lrfals~~~aek~~~r~lwv~fk~~~ni~~a~~aLn~irl~ 299 (648)
T KOG2295|consen 230 KECSLLVRNILPNISVAEIENLCKGIPGFLRFALSTINAEKNFERRLWVTFKRGTNIKEACWALNGIRLR 299 (648)
T ss_pred HHHHHHHhccCCcccHHHHHHHhccCchheeeeccCchHHHHHHHHhhHhhccccchHHHHHHhhhcccc
Confidence 4578999999999999999999998866666555443333455667899998777777777777887664
No 245
>KOG2295 consensus C2H2 Zn-finger protein [General function prediction only]
Probab=48.96 E-value=2.7 Score=39.08 Aligned_cols=72 Identities=10% Similarity=0.121 Sum_probs=53.7
Q ss_pred CCceEEEcCCCcCCCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHHhCCceecce
Q 021958 172 PDHTIFVGDLAADVTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEMNGVFCSTR 244 (305)
Q Consensus 172 ~~~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l~~~~~~g~ 244 (305)
..+.||+.|+...++..+|..+++ .+-.+..+.+..+........++.|+|+---.-..|+.+||+..+...
T Consensus 230 ke~sll~rni~Pnis~aeIe~~ck-~i~~~lrfals~~~aek~~~r~lwv~fk~~~ni~~a~~aLn~irl~s~ 301 (648)
T KOG2295|consen 230 KECSLLVRNILPNISVAEIENLCK-GIPGFLRFALSTINAEKNFERRLWVTFKRGTNIKEACWALNGIRLRSN 301 (648)
T ss_pred HHHHHHHhccCCcccHHHHHHHhc-cCchheeeeccCchHHHHHHHHhhHhhccccchHHHHHHhhhcccccc
Confidence 457799999999999999999998 887777666654433344466788999877777777777777665443
No 246
>PRK11901 hypothetical protein; Reviewed
Probab=48.82 E-value=38 Score=29.74 Aligned_cols=61 Identities=15% Similarity=0.194 Sum_probs=40.3
Q ss_pred CCceEEEcCCCcCCCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEE--EeCCHHHHHHHHHHhCC
Q 021958 172 PDHTIFVGDLAADVTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFV--RFGDESEQLRAMTEMNG 238 (305)
Q Consensus 172 ~~~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV--~f~~~~~A~~a~~~l~~ 238 (305)
...+|-|..+ .+++.|..+.. +.+ +..+++..-..+|+.. |.+| .|.+.++|..|+..|-.
T Consensus 244 ~~YTLQL~Aa---s~~~~L~~f~~-~~~-L~~~~VYqT~RnGkpW-YVVvyG~Y~Sr~eAk~Ai~sLPa 306 (327)
T PRK11901 244 SHYTLQLSSA---SRSDTLNAYAK-KQN-LSHYHVYETKRDGKPW-YVLVSGNYASSAEAKRAIATLPA 306 (327)
T ss_pred CCeEEEeecC---CCHHHHHHHHH-HcC-cCceEEEEEEECCceE-EEEEecCcCCHHHHHHHHHhCCH
Confidence 3456666554 45777888876 665 4556665554455533 4443 79999999999998753
No 247
>KOG1830 consensus Wiskott Aldrich syndrome proteins [Cytoskeleton]
Probab=48.37 E-value=1.2e+02 Score=27.80 Aligned_cols=11 Identities=9% Similarity=0.132 Sum_probs=4.6
Q ss_pred ecceeeEEccC
Q 021958 241 CSTRPMRIGPA 251 (305)
Q Consensus 241 ~~g~~l~v~~~ 251 (305)
|-.|+|.|+|+
T Consensus 492 iLsRRiaveys 502 (518)
T KOG1830|consen 492 ILSRRIAVEYS 502 (518)
T ss_pred HHHHHHHHHhc
Confidence 33344444443
No 248
>PRK01178 rps24e 30S ribosomal protein S24e; Reviewed
Probab=47.94 E-value=58 Score=23.27 Aligned_cols=47 Identities=21% Similarity=0.280 Sum_probs=27.7
Q ss_pred CCCHHHHHHHHhhcCCCeeEEEEEecC----CCCCcccEEEEEeCCHHHHHH
Q 021958 184 DVTDYMLQETFRARYPSTKGAKVVIDR----LTGRTKGYGFVRFGDESEQLR 231 (305)
Q Consensus 184 ~~~~~~l~~~f~~~~G~i~~i~i~~~~----~~~~~~g~afV~f~~~~~A~~ 231 (305)
..+..+|++.+...|+.=.+..++..- ..|+++|||.| |+|.+.|.+
T Consensus 30 tpsr~eirekLa~~~~~~~~~vvv~~~~t~fG~g~s~G~a~I-Yds~e~~kk 80 (99)
T PRK01178 30 TPSRKDVRKKLAAMLNADKELVVVRKIKTEYGMGKSKGYAKV-YDDKERARK 80 (99)
T ss_pred CCCHHHHHHHHHHHHCcCCCEEEEEccCccCCCceEEEEEEE-ECCHHHHHh
Confidence 567888888887678753344444332 23455666644 566666554
No 249
>PF11823 DUF3343: Protein of unknown function (DUF3343); InterPro: IPR021778 This family of proteins are functionally uncharacterised. This protein is found in bacteria and archaea. Proteins in this family are typically between 78 to 102 amino acids in length.
Probab=47.34 E-value=27 Score=23.13 Aligned_cols=30 Identities=13% Similarity=0.204 Sum_probs=23.5
Q ss_pred cEEEEEeCCHHHHHHHHHHhCCceecceee
Q 021958 217 GYGFVRFGDESEQLRAMTEMNGVFCSTRPM 246 (305)
Q Consensus 217 g~afV~f~~~~~A~~a~~~l~~~~~~g~~l 246 (305)
.+.+|.|.+..+|.+|-+.|....+..+-+
T Consensus 2 ~~~~i~F~st~~a~~~ek~lk~~gi~~~li 31 (73)
T PF11823_consen 2 KYYLITFPSTHDAMKAEKLLKKNGIPVRLI 31 (73)
T ss_pred ceEEEEECCHHHHHHHHHHHHHCCCcEEEe
Confidence 368999999999999999887665544443
No 250
>PF07292 NID: Nmi/IFP 35 domain (NID); InterPro: IPR009909 This entry represents a domain of approximately 90 residues that is tandemly repeated within interferon-induced 35 kDa protein (IFP 35) and the homologous N-myc-interactor (Nmi). This domain mediates Nmi-Nmi protein interactions and subcellular localisation [].
Probab=44.17 E-value=13 Score=25.93 Aligned_cols=25 Identities=20% Similarity=0.190 Sum_probs=21.1
Q ss_pred CCCCceEEEcCCCcccCHHHHhHhh
Q 021958 78 PGEIRTLWIGDLQYWMDETYLNTCF 102 (305)
Q Consensus 78 ~~~~~~l~v~nLp~~~t~~~l~~~f 102 (305)
.-..++|.|.|||...++++|++.+
T Consensus 49 ~vs~rtVlvsgip~~l~ee~l~D~L 73 (88)
T PF07292_consen 49 GVSKRTVLVSGIPDVLDEEELRDKL 73 (88)
T ss_pred cccCCEEEEeCCCCCCChhhheeeE
Confidence 3456899999999999999998764
No 251
>smart00596 PRE_C2HC PRE_C2HC domain.
Probab=44.16 E-value=47 Score=21.92 Aligned_cols=64 Identities=9% Similarity=0.027 Sum_probs=42.1
Q ss_pred HHHHHHHhhcCC-CeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHHhCCceecceeeEEccCCCCC
Q 021958 188 YMLQETFRARYP-STKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEMNGVFCSTRPMRIGPATNKK 255 (305)
Q Consensus 188 ~~l~~~f~~~~G-~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l~~~~~~g~~l~v~~~~~~~ 255 (305)
++|.+.|+ ..| .+..++-+....+...-.+-+|+.....+-.. .|+=+.++|+++.|+....+.
T Consensus 2 ~~I~~~L~-~~G~~v~~i~~m~~~~~r~P~nmf~vel~~~~~~~~---Il~ik~Lg~~~V~VEr~~k~~ 66 (69)
T smart00596 2 SQIEEALK-DIGFPVLFIHNMLNRDTKNPQNMFEVELVPAANGKE---ILNIKTLGGQRVTVERPHKRK 66 (69)
T ss_pred HHHHHHHH-HcCCceeEEEcccccCCCCcceeEEEEeeecCCCcc---eEeehhhCCeeEEEecCcccC
Confidence 35677777 555 47777777776656666777888766543333 334456899999998765443
No 252
>PF02714 DUF221: Domain of unknown function DUF221; InterPro: IPR003864 This domain is found in a family of hypothetical transmembrane proteins none of which have any known function, the aligned region is at 538 residues at maximum length.; GO: 0016020 membrane
Probab=44.06 E-value=54 Score=28.79 Aligned_cols=56 Identities=20% Similarity=0.255 Sum_probs=36.8
Q ss_pred EEEEEcCHHHHHHHHHHhCCCCCCCCCcceeeeccCCCCCCCCCCCCCceEEEcCCCcCCCHHHHHHHHh
Q 021958 126 GFIEFISRAGAERVLQTFNGTPMPNGEQNFRLNWASFGAGEKRDDTPDHTIFVGDLAADVTDYMLQETFR 195 (305)
Q Consensus 126 afV~f~~~~~a~~a~~~l~g~~~~~~~~~i~v~~~~~~~~~~~~~~~~~~l~v~nlp~~~~~~~l~~~f~ 195 (305)
|||+|++..+|..|++.+.... ...+++..+.. ...|.=.||.....+..++..+.
T Consensus 1 aFVtF~~~~~a~~~~q~~~~~~----~~~~~v~~APe----------P~DI~W~NL~~~~~~r~~R~~~~ 56 (325)
T PF02714_consen 1 AFVTFNSQKSAQIALQLLLSKR----PNSWRVSPAPE----------PDDIIWENLSISSKQRFLRRIIV 56 (325)
T ss_pred CEEEECCHHHHHHHHHHHhcCC----CCCceEeeCCC----------cccccccccCCChHHHHHHHHHH
Confidence 7999999999999998654433 23345554432 23355578877777776666654
No 253
>PF15513 DUF4651: Domain of unknown function (DUF4651)
Probab=43.91 E-value=62 Score=20.86 Aligned_cols=19 Identities=11% Similarity=0.004 Sum_probs=15.4
Q ss_pred HHHHHHHhhcCCCeeEEEEE
Q 021958 188 YMLQETFRARYPSTKGAKVV 207 (305)
Q Consensus 188 ~~l~~~f~~~~G~i~~i~i~ 207 (305)
.+|+++|+ ..|.|.-+.+.
T Consensus 9 ~~iR~~fs-~lG~I~vLYvn 27 (62)
T PF15513_consen 9 AEIRQFFS-QLGEIAVLYVN 27 (62)
T ss_pred HHHHHHHH-hcCcEEEEEEc
Confidence 68999999 99999855553
No 254
>PF02714 DUF221: Domain of unknown function DUF221; InterPro: IPR003864 This domain is found in a family of hypothetical transmembrane proteins none of which have any known function, the aligned region is at 538 residues at maximum length.; GO: 0016020 membrane
Probab=42.45 E-value=29 Score=30.54 Aligned_cols=37 Identities=19% Similarity=0.327 Sum_probs=26.3
Q ss_pred EEEEeCCHHHHHHHHHHhCCceecceeeEEccCCCCCCC
Q 021958 219 GFVRFGDESEQLRAMTEMNGVFCSTRPMRIGPATNKKTV 257 (305)
Q Consensus 219 afV~f~~~~~A~~a~~~l~~~~~~g~~l~v~~~~~~~~~ 257 (305)
|||+|++..+|..|++.+.... ++.+++..+-+.++-
T Consensus 1 aFVtF~~~~~a~~~~q~~~~~~--~~~~~v~~APeP~DI 37 (325)
T PF02714_consen 1 AFVTFNSQKSAQIALQLLLSKR--PNSWRVSPAPEPDDI 37 (325)
T ss_pred CEEEECCHHHHHHHHHHHhcCC--CCCceEeeCCCcccc
Confidence 7999999999999999765543 345566665544433
No 255
>PF12091 DUF3567: Protein of unknown function (DUF3567); InterPro: IPR021951 This family of proteins is functionally uncharacterised. This protein is found in bacteria. Proteins in this family are about 90 amino acids in length. This protein has a conserved EIVDK sequence motif.
Probab=41.57 E-value=24 Score=24.29 Aligned_cols=56 Identities=23% Similarity=0.397 Sum_probs=32.9
Q ss_pred cceEEEEEcCHHHHHHHHHHhCCCCCCCCCcceeeeccCCCCCCCCCCCCCceEEEcCC-------------CcCCCHHH
Q 021958 123 EGYGFIEFISRAGAERVLQTFNGTPMPNGEQNFRLNWASFGAGEKRDDTPDHTIFVGDL-------------AADVTDYM 189 (305)
Q Consensus 123 ~g~afV~f~~~~~a~~a~~~l~g~~~~~~~~~i~v~~~~~~~~~~~~~~~~~~l~v~nl-------------p~~~~~~~ 189 (305)
..||.|+|.-.++. .++. ..|.+|. +....+.|||.+. ..+-+.|+
T Consensus 8 d~y~VV~~~~~~~~-~~l~-~gGyEIV-------------------DK~~~rEifi~G~~Ae~Fr~~V~~li~~~Pt~Ee 66 (85)
T PF12091_consen 8 DNYCVVEFPPDAGH-PALA-RGGYEIV-------------------DKNARREIFIDGSWAEMFREDVQALIASEPTQEE 66 (85)
T ss_pred CceEEEEecCCCCc-cchh-cCCcEEe-------------------ecCCCceEEeCcHHHHHHHHHHHHHHhcCCCHHH
Confidence 34999999533332 3332 2555552 2233577888874 44567778
Q ss_pred HHHHHhhcCCC
Q 021958 190 LQETFRARYPS 200 (305)
Q Consensus 190 l~~~f~~~~G~ 200 (305)
+.+++. .|..
T Consensus 67 vDdfL~-~y~~ 76 (85)
T PF12091_consen 67 VDDFLG-GYDA 76 (85)
T ss_pred HHHHHH-HHHH
Confidence 888876 6654
No 256
>KOG4213 consensus RNA-binding protein La [RNA processing and modification]
Probab=41.28 E-value=39 Score=26.86 Aligned_cols=58 Identities=19% Similarity=0.190 Sum_probs=38.1
Q ss_pred CCceEEEcCCCcccCHHHHhHhhh-hcCCcEEEEEEeeCCC-CCccceEEEEEcCHHHHHHHHHH
Q 021958 80 EIRTLWIGDLQYWMDETYLNTCFA-HTGEVVAVKVIRNKQT-GQIEGYGFIEFISRAGAERVLQT 142 (305)
Q Consensus 80 ~~~~l~v~nLp~~~t~~~l~~~f~-~~G~v~~~~~~~~~~~-~~~~g~afV~f~~~~~a~~a~~~ 142 (305)
..+++|.. .|+++|.++.. .-|.+..+.+.+.... ...+|--||+|.+.+.|.+.++.
T Consensus 110 ~~r~v~~K-----~td~ql~~l~qw~~~k~~nv~mr~~~~k~~~fkGsvkv~f~tk~qa~a~~~~ 169 (205)
T KOG4213|consen 110 KERTVYKK-----ITDDQLDDLNQWASGKGHNVKMRRHGNKAHPFKGSVKVTFQTKEQAFANDDT 169 (205)
T ss_pred HHhhhhcc-----CCHHHHHHHHHHhcccceEeeccccCCCCCCCCCceEEEeecHHHHHhhhhh
Confidence 34566665 45555555432 1167777777665321 26678899999999999987754
No 257
>COG5193 LHP1 La protein, small RNA-binding pol III transcript stabilizing protein and related La-motif-containing proteins involved in translation [Posttranslational modification, protein turnover, chaperones / Translation, ribosomal structure and biogenesis]
Probab=40.89 E-value=13 Score=33.36 Aligned_cols=63 Identities=16% Similarity=0.113 Sum_probs=50.7
Q ss_pred CCCceEEEcCCCcccCHH--------HHhHhhhh--cCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHH
Q 021958 79 GEIRTLWIGDLQYWMDET--------YLNTCFAH--TGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQ 141 (305)
Q Consensus 79 ~~~~~l~v~nLp~~~t~~--------~l~~~f~~--~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~ 141 (305)
...+.+|+.+.....+.+ ++...|.. .+.+..++..++..+..++|..|++|...+.+.++..
T Consensus 172 ~~qr~~y~n~fG~e~~~~a~~~e~~~d~~~~~p~h~h~~~~~i~~rrd~~nkn~~gSv~~efk~~~~~q~~nn 244 (438)
T COG5193 172 QMQRDVYQNGFGKEDVNNASRPEQQEDLEIQFPPHYHAPPSQIRNRRDWLNKNFRGSVFVEFKYFREAQRFNN 244 (438)
T ss_pred hHhhhHHhhcCCcccccccccchhhhhHHhhCCCcccCChhhccchhhhhhccccCcccccccChHHHHHHhc
Confidence 345678888877655555 88889988 6778888888887677888999999999999998874
No 258
>KOG1985 consensus Vesicle coat complex COPII, subunit SEC24/subunit SFB2 [Intracellular trafficking, secretion, and vesicular transport]
Probab=39.97 E-value=4.1e+02 Score=26.81 Aligned_cols=19 Identities=16% Similarity=0.273 Sum_probs=12.2
Q ss_pred CCCCceEEEcCCCcCCCHH
Q 021958 170 DTPDHTIFVGDLAADVTDY 188 (305)
Q Consensus 170 ~~~~~~l~v~nlp~~~~~~ 188 (305)
..+..-+||-.+.....+.
T Consensus 292 P~Pavy~FliDVS~~a~ks 310 (887)
T KOG1985|consen 292 PQPAVYVFLIDVSISAIKS 310 (887)
T ss_pred CCCceEEEEEEeehHhhhh
Confidence 3456667777777665555
No 259
>PTZ00071 40S ribosomal protein S24; Provisional
Probab=39.19 E-value=1e+02 Score=23.28 Aligned_cols=47 Identities=17% Similarity=0.386 Sum_probs=27.2
Q ss_pred CCCHHHHHHHHhhcCC-CeeEEEEEe----cCCCCCcccEEEEEeCCHHHHHH
Q 021958 184 DVTDYMLQETFRARYP-STKGAKVVI----DRLTGRTKGYGFVRFGDESEQLR 231 (305)
Q Consensus 184 ~~~~~~l~~~f~~~~G-~i~~i~i~~----~~~~~~~~g~afV~f~~~~~A~~ 231 (305)
..+..||++.+...|+ .=.+..++. .-..|.+.|||.| |+|.+.|..
T Consensus 35 TpSr~eirekLA~~~~v~d~~~Vvv~~~~T~fG~g~StG~a~I-Yds~e~~kk 86 (132)
T PTZ00071 35 TVSKKDIKEKLAKQYKVADARTIVLFGFKTKFGGGKTTGFGLI-YDNLAALKK 86 (132)
T ss_pred CCCHHHHHHHHHHHhCCCCCCEEEEEccEecCCCceEEEEEEE-ECCHHHHHh
Confidence 5788899999887888 222222222 2234566666654 566555443
No 260
>KOG2893 consensus Zn finger protein [General function prediction only]
Probab=38.92 E-value=2.3e+02 Score=23.69 Aligned_cols=14 Identities=7% Similarity=-0.014 Sum_probs=6.7
Q ss_pred CCcCCCHHHHHHHH
Q 021958 181 LAADVTDYMLQETF 194 (305)
Q Consensus 181 lp~~~~~~~l~~~f 194 (305)
-+.+++.|+-+..+
T Consensus 318 pdedisleerraql 331 (341)
T KOG2893|consen 318 PDEDISLEERRAQL 331 (341)
T ss_pred CcccccHHHHhhhh
Confidence 34455555544443
No 261
>KOG4365 consensus Uncharacterized conserved protein [Function unknown]
Probab=37.74 E-value=6 Score=35.77 Aligned_cols=78 Identities=5% Similarity=-0.202 Sum_probs=59.2
Q ss_pred ceEEEcCCCcCCCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHHhCCceecceeeEEccCCC
Q 021958 174 HTIFVGDLAADVTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEMNGVFCSTRPMRIGPATN 253 (305)
Q Consensus 174 ~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l~~~~~~g~~l~v~~~~~ 253 (305)
.+.++..++...+++++.-.|. -||.|.-+.....-..+..+-.+||.-.+ .+|..|++.+.-..+.|..++|..+..
T Consensus 4 ~~~~l~d~~~~~~~~~~~~~~~-d~~~i~~~d~~~~~~~~~~~v~~f~~~~~-~~~~~~i~~~k~q~~~~~~~r~~~~~~ 81 (572)
T KOG4365|consen 4 MKKSLKDSVASNNKDQNSMKHE-DPSIISMEDGSPYVNGSLGEVTPFQHAKK-ANGPNYIQPQKRQTTFESQDRKAVSPS 81 (572)
T ss_pred hhhhHhhcccccccchhhhhcc-CCcceeeccCCccccCCcceeeeeeeeec-cCcccccCHHHHhhhhhhhhhhhcCch
Confidence 4567888999999999999999 99999988877665566667778877654 456677776666667777777776654
No 262
>PRK10905 cell division protein DamX; Validated
Probab=37.45 E-value=84 Score=27.59 Aligned_cols=60 Identities=15% Similarity=0.105 Sum_probs=37.6
Q ss_pred CCceEEEcCCCcCCCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEE--EEeCCHHHHHHHHHHhC
Q 021958 172 PDHTIFVGDLAADVTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGF--VRFGDESEQLRAMTEMN 237 (305)
Q Consensus 172 ~~~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~af--V~f~~~~~A~~a~~~l~ 237 (305)
...+|-|+.+. +++.|+++.. +.| +....+.....+|+.. |-+ =.|.+.++|++|+..|-
T Consensus 246 ~~YTLQL~A~S---s~~~l~~fak-Klg-L~~y~vy~TtRnGkpW-YVV~yG~YaSraeAk~AiakLP 307 (328)
T PRK10905 246 SHYTLQLSSSS---NYDNLNGWAK-KEN-LKNYVVYETTRNGQPW-YVLVSGVYASKEEAKRAVSTLP 307 (328)
T ss_pred CceEEEEEecC---CHHHHHHHHH-HcC-CCceEEEEeccCCceE-EEEEecCCCCHHHHHHHHHHCC
Confidence 35666666554 4577777776 664 3444444444455422 332 37899999999999875
No 263
>COG5638 Uncharacterized conserved protein [Function unknown]
Probab=35.82 E-value=1.9e+02 Score=26.28 Aligned_cols=142 Identities=15% Similarity=0.174 Sum_probs=73.1
Q ss_pred CCCCCceEEEcCCCc-ccCHHHHhHhhhhc----CCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCCCC
Q 021958 77 QPGEIRTLWIGDLQY-WMDETYLNTCFAHT----GEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMPNG 151 (305)
Q Consensus 77 ~~~~~~~l~v~nLp~-~~t~~~l~~~f~~~----G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~ 151 (305)
+...+..|-|-|+.| .+...+|.-.|+.| |.+..+.|... +|..+.-| .+...|-.-+..
T Consensus 142 ~G~~tkrLAvVnmDWd~v~a~DLf~~fsSf~P~ggkl~kV~iyps------------efGkeRm~---~e~vqGpprdif 206 (622)
T COG5638 142 EGNPTKRLAVVNMDWDRVDAKDLFKIFSSFLPYGGKLSKVKIYPS------------EFGKERMA---AEHVQGPPRDIF 206 (622)
T ss_pred CCCcccceeEeecccccchHHHHHHHHHhhCCCCCccceeEechh------------hhhHHHHh---HhhccCCchhhc
Confidence 456678899999985 56888888888766 45666766554 23333222 223333211000
Q ss_pred C----cceeeeccCCCCCCCCCCCCCceEEE-cCCCcCCCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCH
Q 021958 152 E----QNFRLNWASFGAGEKRDDTPDHTIFV-GDLAADVTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDE 226 (305)
Q Consensus 152 ~----~~i~v~~~~~~~~~~~~~~~~~~l~v-~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~ 226 (305)
. ..-.-.+..... ..+...+.-++| +.+..+++-..|+++- +.+++ ..||.|++.+.
T Consensus 207 ~~~d~~~ssqk~~~dn~--~sd~d~g~d~~~Egd~g~e~d~~~lrqyq------lerlr----------yYyAvvec~d~ 268 (622)
T COG5638 207 TPADNQPSSQKFGDDNV--FSDRDAGEDALIEGDRGNEFDMVKLRQYQ------LERLR----------YYYAVVECEDI 268 (622)
T ss_pred cccccCcchhccCCccc--hhhhhcchhhhhhcccccchhHHHHHHHH------hhhhe----------eEEEEEEeccc
Confidence 0 000000000000 000111122333 2334444444555442 22233 34899999999
Q ss_pred HHHHHHHHHhCCceecc--eeeEEccC
Q 021958 227 SEQLRAMTEMNGVFCST--RPMRIGPA 251 (305)
Q Consensus 227 ~~A~~a~~~l~~~~~~g--~~l~v~~~ 251 (305)
+.+......++|..+.. ..+-+.|.
T Consensus 269 ~tsK~iY~~CDG~Eye~san~~DLRfv 295 (622)
T COG5638 269 ETSKNIYSACDGVEYENSANVLDLRFV 295 (622)
T ss_pred hhhHHHHhccCccccccccceeeeeec
Confidence 99999999999987643 44444443
No 264
>COG5193 LHP1 La protein, small RNA-binding pol III transcript stabilizing protein and related La-motif-containing proteins involved in translation [Posttranslational modification, protein turnover, chaperones / Translation, ribosomal structure and biogenesis]
Probab=35.54 E-value=26 Score=31.56 Aligned_cols=63 Identities=16% Similarity=0.144 Sum_probs=50.0
Q ss_pred CCCceEEEcCCCcCCCH--------HHHHHHHhhc--CCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHH
Q 021958 171 TPDHTIFVGDLAADVTD--------YMLQETFRAR--YPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMT 234 (305)
Q Consensus 171 ~~~~~l~v~nlp~~~~~--------~~l~~~f~~~--~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~ 234 (305)
..-+.+|+.++..+-.. +++...|. . .+.+..++..++..+..++|.-|++|+....|.+.+.
T Consensus 172 ~~qr~~y~n~fG~e~~~~a~~~e~~~d~~~~~p-~h~h~~~~~i~~rrd~~nkn~~gSv~~efk~~~~~q~~nn 244 (438)
T COG5193 172 QMQRDVYQNGFGKEDVNNASRPEQQEDLEIQFP-PHYHAPPSQIRNRRDWLNKNFRGSVFVEFKYFREAQRFNN 244 (438)
T ss_pred hHhhhHHhhcCCcccccccccchhhhhHHhhCC-CcccCChhhccchhhhhhccccCcccccccChHHHHHHhc
Confidence 33456777777776544 48999998 7 6778888888887778889999999999999998864
No 265
>PF14111 DUF4283: Domain of unknown function (DUF4283)
Probab=35.02 E-value=41 Score=25.65 Aligned_cols=68 Identities=9% Similarity=0.043 Sum_probs=44.4
Q ss_pred CCCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHHhCCceecceeeEEccCCCCCCC
Q 021958 184 DVTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEMNGVFCSTRPMRIGPATNKKTV 257 (305)
Q Consensus 184 ~~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l~~~~~~g~~l~v~~~~~~~~~ 257 (305)
..+-..|.+.+....+....+.+..-. .++..++|.+.+++.++++. ....++|..+.++..++....
T Consensus 28 ~~~~~~l~~~l~~~W~~~~~~~i~~l~-----~~~fl~~F~~~~d~~~vl~~-~p~~~~~~~~~l~~W~~~~~~ 95 (153)
T PF14111_consen 28 PISLSALEQELAKIWKLKGGVKIRDLG-----DNLFLFQFESEEDRQRVLKG-GPWNFNGHFLILQRWSPDFNP 95 (153)
T ss_pred CCCHHHHHHHHHHHhCCCCcEEEEEeC-----CCeEEEEEEeccceeEEEec-ccccccccchhhhhhcccccc
Confidence 355666666665333333334443221 56889999999999999873 445678888888877754443
No 266
>KOG1925 consensus Rac1 GTPase effector FHOS [Signal transduction mechanisms; Cytoskeleton]
Probab=34.52 E-value=40 Score=31.42 Aligned_cols=24 Identities=25% Similarity=0.292 Sum_probs=15.1
Q ss_pred CceEEEcCCCcccCHHHHhHhhhh
Q 021958 81 IRTLWIGDLQYWMDETYLNTCFAH 104 (305)
Q Consensus 81 ~~~l~v~nLp~~~t~~~l~~~f~~ 104 (305)
..++|-.-=+-.++...|..+|+.
T Consensus 306 ~~t~W~s~D~~~~D~~r~~~LFEs 329 (817)
T KOG1925|consen 306 CATLWASLDPVSVDTARLEHLFES 329 (817)
T ss_pred cchhhhccCcceecHHHHHHHHHH
Confidence 344544433455788889999964
No 267
>PF08734 GYD: GYD domain; InterPro: IPR014845 These proteins of unknown function are usually less than 100 amino acids in length. They may belong to the dimeric alpha/beta barrel superfamily.
Probab=34.00 E-value=1.6e+02 Score=20.45 Aligned_cols=47 Identities=13% Similarity=-0.016 Sum_probs=35.1
Q ss_pred HHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHHhC
Q 021958 187 DYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEMN 237 (305)
Q Consensus 187 ~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l~ 237 (305)
.+.+++++++..|+++++.+.. |.-.....+++.|.+.|.++.-.+.
T Consensus 22 ~~a~~~~~e~~Gg~l~~~y~t~----G~yD~v~i~eaPD~~~a~~~~l~i~ 68 (91)
T PF08734_consen 22 AEAVRALIEALGGKLKSFYWTL----GEYDFVVIVEAPDDETAAAASLAIR 68 (91)
T ss_pred HHHHHHHHHHcCCEEEEEEEec----CCCCEEEEEEcCCHHHHHHHHHHHH
Confidence 4667888885667788888764 4446678899999999888765554
No 268
>PF14893 PNMA: PNMA
Probab=33.90 E-value=25 Score=31.13 Aligned_cols=77 Identities=10% Similarity=0.152 Sum_probs=44.3
Q ss_pred CCCceEEEcCCCcccCHHHHhHhhhh-cCCcEEEEEEee---CCCCCccceEEEEEcCHHHHHHHHHHhCCCCCCCCCcc
Q 021958 79 GEIRTLWIGDLQYWMDETYLNTCFAH-TGEVVAVKVIRN---KQTGQIEGYGFIEFISRAGAERVLQTFNGTPMPNGEQN 154 (305)
Q Consensus 79 ~~~~~l~v~nLp~~~t~~~l~~~f~~-~G~v~~~~~~~~---~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~~~~ 154 (305)
+..+.|.|.+||.++++++|++.+.. .-.+-..++... +..+ ..-++|+|...-+-.. -=..|.|+|-.
T Consensus 16 ~~~r~lLv~giP~dc~~~ei~e~l~~~l~plg~yrvl~~~f~~~~~--~~aalve~~e~~n~~~-----iP~~i~g~gg~ 88 (331)
T PF14893_consen 16 DPQRALLVLGIPEDCEEAEIEEALQAALSPLGRYRVLGKMFRREEN--AKAALVEFAEDVNYSL-----IPREIPGKGGP 88 (331)
T ss_pred ChhhhheeecCCCCCCHHHHHHHHHHhhcccccceehhhHhhhhcc--cceeeeecccccchhh-----CchhcCCCCCc
Confidence 55688999999999999999998754 222222222211 1111 2357888865333211 22345566666
Q ss_pred eeeeccCC
Q 021958 155 FRLNWASF 162 (305)
Q Consensus 155 i~v~~~~~ 162 (305)
++|-+...
T Consensus 89 W~Vv~~p~ 96 (331)
T PF14893_consen 89 WRVVFKPP 96 (331)
T ss_pred eEEEecCC
Confidence 77655443
No 269
>KOG0156 consensus Cytochrome P450 CYP2 subfamily [Secondary metabolites biosynthesis, transport and catabolism]
Probab=33.49 E-value=1.4e+02 Score=28.24 Aligned_cols=66 Identities=11% Similarity=0.094 Sum_probs=46.7
Q ss_pred CCCCceEEEcCCCcCCCH---HHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHHhCCceecceee
Q 021958 170 DTPDHTIFVGDLAADVTD---YMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEMNGVFCSTRPM 246 (305)
Q Consensus 170 ~~~~~~l~v~nlp~~~~~---~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l~~~~~~g~~l 246 (305)
..+..-=.||||..-... .-+.++-+ +||.|..+++-. .-.|.-.+.+.|+.|+.. ++..+.+|..
T Consensus 29 PGP~~lPiIGnl~~l~~~~~h~~~~~ls~-~yGpi~tl~lG~---------~~~Vviss~~~akE~l~~-~d~~fa~Rp~ 97 (489)
T KOG0156|consen 29 PGPPPLPIIGNLHQLGSLPPHRSFRKLSK-KYGPVFTLRLGS---------VPVVVISSYEAAKEVLVK-QDLEFADRPD 97 (489)
T ss_pred cCCCCCCccccHHHcCCCchhHHHHHHHH-HhCCeEEEEecC---------ceEEEECCHHHHHHHHHh-CCccccCCCC
Confidence 344555668887664333 34444444 999999887732 236888999999999986 8888988886
No 270
>PRK11901 hypothetical protein; Reviewed
Probab=32.85 E-value=3.5e+02 Score=23.94 Aligned_cols=63 Identities=22% Similarity=0.259 Sum_probs=40.5
Q ss_pred CCCCceEEEcCCCcccCHHHHhHhhhhcCCcEEEEEEeeCCCCCccceEE--EEEcCHHHHHHHHHHhCC
Q 021958 78 PGEIRTLWIGDLQYWMDETYLNTCFAHTGEVVAVKVIRNKQTGQIEGYGF--IEFISRAGAERVLQTFNG 145 (305)
Q Consensus 78 ~~~~~~l~v~nLp~~~t~~~l~~~f~~~G~v~~~~~~~~~~~~~~~g~af--V~f~~~~~a~~a~~~l~g 145 (305)
+....+|-|..+ .+++.|..|..+++ +..+++.+...+|+. .|.. =.|.+.++|+.|+..|-.
T Consensus 242 p~~~YTLQL~Aa---s~~~~L~~f~~~~~-L~~~~VYqT~RnGkp-WYVVvyG~Y~Sr~eAk~Ai~sLPa 306 (327)
T PRK11901 242 PASHYTLQLSSA---SRSDTLNAYAKKQN-LSHYHVYETKRDGKP-WYVLVSGNYASSAEAKRAIATLPA 306 (327)
T ss_pred CCCCeEEEeecC---CCHHHHHHHHHHcC-cCceEEEEEEECCce-EEEEEecCcCCHHHHHHHHHhCCH
Confidence 344556655554 55788888888776 345556554433433 2433 369999999999988643
No 271
>KOG4008 consensus rRNA processing protein RRP7 [RNA processing and modification]
Probab=32.83 E-value=41 Score=28.00 Aligned_cols=35 Identities=17% Similarity=0.267 Sum_probs=29.9
Q ss_pred CCCCCceEEEcCCCcccCHHHHhHhhhhcCCcEEE
Q 021958 77 QPGEIRTLWIGDLQYWMDETYLNTCFAHTGEVVAV 111 (305)
Q Consensus 77 ~~~~~~~l~v~nLp~~~t~~~l~~~f~~~G~v~~~ 111 (305)
...+..+||+-|+|...|++.|.++.+.+|-+...
T Consensus 36 ~~~eKd~lfl~Nvp~~~tee~lkr~vsqlg~vq~~ 70 (261)
T KOG4008|consen 36 NSNEKDCLFLVNVPLLSTEEHLKRFVSQLGHVQEL 70 (261)
T ss_pred ccccccceeeecccccccHHHHHHHHHHhhhhhhe
Confidence 45677899999999999999999999999865443
No 272
>PF00403 HMA: Heavy-metal-associated domain; InterPro: IPR006121 Proteins that transport heavy metals in micro-organisms and mammals share similarities in their sequences and structures. These proteins provide an important focus for research, some being involved in bacterial resistance to toxic metals, such as lead and cadmium, while others are involved in inherited human syndromes, such as Wilson's and Menke's diseases []. A conserved domain has been found in a number of these heavy metal transport or detoxification proteins []. The domain, which has been termed Heavy-Metal-Associated (HMA), contains two conserved cysteines that are probably involved in metal binding. Structure solution of the fourth HMA domain of the Menke's copper transporting ATPase shows a well-defined structure comprising a four-stranded antiparallel beta-sheet and two alpha helices packed in an alpha-beta sandwich fold []. This fold is common to other domains and is classified as "ferredoxin-like".; GO: 0046872 metal ion binding, 0030001 metal ion transport; PDB: 2VOY_A 1P6T_A 1KQK_A 2RML_A 1JWW_A 3K7R_F 1FES_A 1CC8_A 1FD8_A 2GGP_A ....
Probab=32.76 E-value=1.3e+02 Score=18.75 Aligned_cols=54 Identities=15% Similarity=0.264 Sum_probs=39.0
Q ss_pred eEEEcCCCcCCCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCH----HHHHHHHHH
Q 021958 175 TIFVGDLAADVTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDE----SEQLRAMTE 235 (305)
Q Consensus 175 ~l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~----~~A~~a~~~ 235 (305)
++.|.|+.=.--...+.+.+. ..-.|.++.+... .+.+-|.|... ++..++++.
T Consensus 1 t~~v~~m~C~~C~~~v~~~l~-~~~GV~~v~vd~~------~~~v~v~~~~~~~~~~~i~~~i~~ 58 (62)
T PF00403_consen 1 TFKVPGMTCEGCAKKVEKALS-KLPGVKSVKVDLE------TKTVTVTYDPDKTSIEKIIEAIEK 58 (62)
T ss_dssp EEEEESTTSHHHHHHHHHHHH-TSTTEEEEEEETT------TTEEEEEESTTTSCHHHHHHHHHH
T ss_pred CEEECCcccHHHHHHHHHHHh-cCCCCcEEEEECC------CCEEEEEEecCCCCHHHHHHHHHH
Confidence 467778877777889999998 7878888887544 34677888754 455555554
No 273
>PF03439 Spt5-NGN: Early transcription elongation factor of RNA pol II, NGN section; InterPro: IPR005100 Spt5p and prokaryotic NusG are shown to contain a novel 'NGN' domain. The combined NGN and KOW motif regions of Spt5 form the binding domain with Spt4 []. Spt5 complexes with Spt4 as a 1:1 heterodimer snf this Spt5-Spt4 complex regulates early transcription elongation by RNA polymerase II and has an imputed role in pre-mRNA processing via its physical association with mRNA capping enzymes. The Schizosaccharomyces pombe core Spt5-Spt4 complex is a heterodimer bearing a trypsin-resistant Spt4-binding domain within the Spt5 subunit []. ; PDB: 3H7H_B 3QQC_D 3P8B_B 2EXU_A.
Probab=32.76 E-value=1.1e+02 Score=20.86 Aligned_cols=27 Identities=26% Similarity=0.320 Sum_probs=22.3
Q ss_pred cccEEEEEeCCHHHHHHHHHHhCCcee
Q 021958 215 TKGYGFVRFGDESEQLRAMTEMNGVFC 241 (305)
Q Consensus 215 ~~g~afV~f~~~~~A~~a~~~l~~~~~ 241 (305)
-+||-|||=.+..+..+|++.+.+...
T Consensus 43 lkGyIyVEA~~~~~V~~ai~gi~~i~~ 69 (84)
T PF03439_consen 43 LKGYIYVEAERESDVKEAIRGIRHIRG 69 (84)
T ss_dssp STSEEEEEESSHHHHHHHHTT-TTEEE
T ss_pred CceEEEEEeCCHHHHHHHHhcccceee
Confidence 499999999999999999988766443
No 274
>KOG2675 consensus Adenylate cyclase-associated protein (CAP/Srv2p) [Cytoskeleton; Signal transduction mechanisms]
Probab=32.75 E-value=25 Score=32.05 Aligned_cols=14 Identities=14% Similarity=0.183 Sum_probs=8.8
Q ss_pred EEEEeCCHHHHHHH
Q 021958 219 GFVRFGDESEQLRA 232 (305)
Q Consensus 219 afV~f~~~~~A~~a 232 (305)
-|++|.-.|.-...
T Consensus 453 Dy~EfpvPEQfkt~ 466 (480)
T KOG2675|consen 453 DYVEFPVPEQFKTK 466 (480)
T ss_pred CcccccChHHHhhh
Confidence 36777777765544
No 275
>cd00874 RNA_Cyclase_Class_II RNA 3' phosphate cyclase domain (class II). These proteins function as RNA cyclase to catalyze the ATP-dependent conversion of 3'-phosphate to a 2'.3'-cyclic phosphodiester at the end of RNA molecule. A conserved catalytic histidine residue is found in all members of this subfamily.
Probab=31.04 E-value=3.8e+02 Score=23.79 Aligned_cols=118 Identities=14% Similarity=0.166 Sum_probs=62.8
Q ss_pred ceEEEc---CCCcccCHHHHhHhh----hhcCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCCCCCcc
Q 021958 82 RTLWIG---DLQYWMDETYLNTCF----AHTGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMPNGEQN 154 (305)
Q Consensus 82 ~~l~v~---nLp~~~t~~~l~~~f----~~~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~~~~ 154 (305)
.+|-+. |.+++-+-|.++..| +++|.-.++++.+.- -.++|-+-|.|.-.-. +.+....+...++.
T Consensus 110 ~~l~l~GgT~~~~sPsvD~~~~v~lP~l~~~G~~~~l~v~rRG--~yP~GgGeV~~~v~p~-----~~l~~i~l~~~g~i 182 (326)
T cd00874 110 STVTISGGTDVPWAPPIDYLRNVTLPLLERMGIEAELEVLRRG--FYPRGGGEVVLTVEPS-----KLLPPLLLEERGEI 182 (326)
T ss_pred EEEEEEcccCCCCCCCHHHHHHHHHHHHHhCCCcEEEEEEeCC--cCCCCCEEEEEEEecc-----cCCCcceeecCCCe
Confidence 345553 456666777777654 788876677776642 2344445555543211 11122222212222
Q ss_pred eeeeccCCCCCCCCCCCCCceEEEcCCCcCCCHHHHHHH---Hhhc-CCCeeEEEEEecCCCCCcccEEEEEeC
Q 021958 155 FRLNWASFGAGEKRDDTPDHTIFVGDLAADVTDYMLQET---FRAR-YPSTKGAKVVIDRLTGRTKGYGFVRFG 224 (305)
Q Consensus 155 i~v~~~~~~~~~~~~~~~~~~l~v~nlp~~~~~~~l~~~---f~~~-~G~i~~i~i~~~~~~~~~~g~afV~f~ 224 (305)
.++. -..+..+++..+.+.++... |+ + ++ .++.+..+...+.+.|++++-+.
T Consensus 183 ~~ir---------------g~~~~~~l~~~va~r~~~~a~~~L~-~~~~--~dv~i~~~~~~~~s~G~~i~L~a 238 (326)
T cd00874 183 EKIR---------------GISHAANLPPHVAERQAEAAAALLR-KALG--LQIEIEPEDQSALGPGSGIVLWA 238 (326)
T ss_pred EEEE---------------EEEEEccCCHHHHHHHHHHHHHHHh-hccC--CCeEEEEEecCCCCCCEEEEEEE
Confidence 2222 35678899998887655444 44 4 32 23455544444677788765544
No 276
>PF00564 PB1: PB1 domain; InterPro: IPR000270 The Phox and Bem1p domain, is present in many eukaryotic cytoplasmic signalling proteins. The domain adopts a beta-grasp fold, similar to that found in ubiquitin and Ras-binding domains. A motif, variously termed OPR, PC and AID, represents the most conserved region of the majority of PB1 domains, and is necessary for PB1 domain function. This function is the formation of PB1 domain heterodimers, although not all PB1 domain pairs associate.; GO: 0005515 protein binding; PDB: 1IPG_A 1IP9_A 2KFK_A 1WMH_A 1VD2_A 1WI0_A 1OEY_C 1PQS_A 1Q1O_A 1TZ1_A ....
Probab=29.85 E-value=1.6e+02 Score=19.55 Aligned_cols=63 Identities=14% Similarity=0.259 Sum_probs=41.6
Q ss_pred cCCCcCCCHHHHHHHHhhcCCCe-eEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHHhCCceecceeeEE
Q 021958 179 GDLAADVTDYMLQETFRARYPST-KGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEMNGVFCSTRPMRI 248 (305)
Q Consensus 179 ~nlp~~~~~~~l~~~f~~~~G~i-~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l~~~~~~g~~l~v 248 (305)
-.++..++-++|.......||.. ..+.+..-.+.| -.|...+.++-..|++..... +++.+++
T Consensus 16 ~~~~~~~s~~~L~~~i~~~~~~~~~~~~l~Y~D~dg-----D~V~i~sd~Dl~~a~~~~~~~--~~~~lrl 79 (84)
T PF00564_consen 16 ISLPSDVSFDDLRSKIREKFGLLDEDFQLKYKDEDG-----DLVTISSDEDLQEAIEQAKES--GSKTLRL 79 (84)
T ss_dssp EEECSTSHHHHHHHHHHHHHTTSTSSEEEEEEETTS-----SEEEESSHHHHHHHHHHHHHC--TTSCEEE
T ss_pred EEcCCCCCHHHHHHHHHHHhCCCCccEEEEeeCCCC-----CEEEeCCHHHHHHHHHHHHhc--CCCcEEE
Confidence 45677778888888887677764 334444332233 479999999999999876533 3444444
No 277
>PF01282 Ribosomal_S24e: Ribosomal protein S24e; InterPro: IPR001976 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites [, ]. About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits. Many ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome [, ]. This family contains the S24e ribosomal proteins from eukaryotes and archaebacteria. These proteins have 101 to 148 amino acids.; GO: 0003735 structural constituent of ribosome, 0006412 translation, 0005622 intracellular, 0005840 ribosome; PDB: 2V94_B 1YWX_A 2G1D_A 3IZ6_U 1XN9_A 2XZM_P 2XZN_P 3U5G_Y 3J16_D 3IZB_U ....
Probab=28.95 E-value=2e+02 Score=19.80 Aligned_cols=48 Identities=8% Similarity=0.158 Sum_probs=24.7
Q ss_pred cCCCHHHHHHHHhhcCCCeeE----EEEEecCCCCCcccEEEEEeCCHHHHHH
Q 021958 183 ADVTDYMLQETFRARYPSTKG----AKVVIDRLTGRTKGYGFVRFGDESEQLR 231 (305)
Q Consensus 183 ~~~~~~~l~~~f~~~~G~i~~----i~i~~~~~~~~~~g~afV~f~~~~~A~~ 231 (305)
...+..+|++.+...|+.=.+ ..+...-..+.+.|||.| |+|.+.+.+
T Consensus 11 ~Tpsr~ei~~klA~~~~~~~~~ivv~~~~t~fG~~~s~g~a~I-Yd~~e~~kk 62 (84)
T PF01282_consen 11 PTPSRKEIREKLAAMLNVDPDLIVVFGIKTEFGGGKSTGFAKI-YDSAEALKK 62 (84)
T ss_dssp SS--HHHHHHHHHHHHTSTGCCEEEEEEEESSSSSEEEEEEEE-ESSHHHHHH
T ss_pred CCCCHHHHHHHHHHHhCCCCCeEEEeccEecCCCceEEEEEEE-eCCHHHHHH
Confidence 456778888887766665112 122233334455665544 566665553
No 278
>KOG1295 consensus Nonsense-mediated decay protein Upf3 [RNA processing and modification]
Probab=27.97 E-value=58 Score=29.21 Aligned_cols=70 Identities=14% Similarity=0.094 Sum_probs=48.5
Q ss_pred CCceEEEcCCCcccCHHHHhHhhhhcCC-cEEEEEEeeCCC--CCccceEEEEEcCHHHHHHHHHHhCCCCCC
Q 021958 80 EIRTLWIGDLQYWMDETYLNTCFAHTGE-VVAVKVIRNKQT--GQIEGYGFIEFISRAGAERVLQTFNGTPMP 149 (305)
Q Consensus 80 ~~~~l~v~nLp~~~t~~~l~~~f~~~G~-v~~~~~~~~~~~--~~~~g~afV~f~~~~~a~~a~~~l~g~~~~ 149 (305)
....|.|..||+..++.+|.+.+..+-. +....+...... ..-.+.+||.|...++.......++|..+.
T Consensus 6 ~~~Kvv~rrlpp~l~~~~~~eqi~p~~~~v~~~~F~~a~~s~~~~~ysrayinFk~~~dv~ef~~~f~g~ifl 78 (376)
T KOG1295|consen 6 AKVKVVVRRLPPKLTEEQLLEQINPFPEHVNWEFFAKADESLRNHKYSRAYINFKNPEDVEEFRRRFDGYIFL 78 (376)
T ss_pred cceeeeeecCCCcccHHHHhhhcCCCccccchheeccccccchhhhhhhhhhccccHHHHHHHHhhCCceEEe
Confidence 4567899999999999999988876543 222233311111 122567899999999988888888887653
No 279
>PTZ00191 60S ribosomal protein L23a; Provisional
Probab=27.71 E-value=2.2e+02 Score=21.90 Aligned_cols=57 Identities=14% Similarity=0.131 Sum_probs=38.4
Q ss_pred eEEEcCCCcccCHHHHhHhhhh-cCC-cEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHH
Q 021958 83 TLWIGDLQYWMDETYLNTCFAH-TGE-VVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQT 142 (305)
Q Consensus 83 ~l~v~nLp~~~t~~~l~~~f~~-~G~-v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~ 142 (305)
.-|+-.++...+..+|.+.++. |+. |..|..+.-+. |.- =|||.+....+|......
T Consensus 83 N~yvF~Vd~kAnK~qIK~AVEklf~VkV~kVNTli~p~-g~K--KA~V~L~~~~~aidva~k 141 (145)
T PTZ00191 83 NTLVFIVDQRANKTQIKKAVEKLYDVKVVKVNTLITPD-GLK--KAYIRLSPDVDALDVANK 141 (145)
T ss_pred CEEEEEEcCCCCHHHHHHHHHHHhCCeeEEEEeEEcCC-Cce--EEEEEECCCCcHHHHHHh
Confidence 4566667889999999999986 553 56665554432 222 499999877776555433
No 280
>cd06405 PB1_Mekk2_3 The PB1 domain is present in the two mitogen-activated protein kinase kinases MEKK2 and MEKK3 which are two members of the signaling kinase cascade involved in angiogenesis and early cardiovascular development. The PB1 domain of MEKK2 (and/or MEKK3) interacts with the PB1 domain of another member of the kinase cascade Map2k5. A canonical PB1-PB1 interaction, which involves heterodimerization of two PB1 domains, is required for the formation of macromolecular signaling complexes ensuring specificity and fidelity during cellular signaling. The interaction between two PB1 domain depends on the type of PB1. There are three types of PB1 domains: type I which contains an OPCA motif, acidic aminoacid cluster, type II which contains a basic cluster, and type I/II which contains both an OPCA motif and a basic cluster. Interactions of PB1 domains with other protein domains have been described as noncanonical PB1-interactions. The PB1 domain module is conserved in amoebas, f
Probab=27.59 E-value=2e+02 Score=19.39 Aligned_cols=63 Identities=11% Similarity=0.105 Sum_probs=47.3
Q ss_pred EcCCCcCCCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHHhCCceecceeeEEc
Q 021958 178 VGDLAADVTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEMNGVFCSTRPMRIG 249 (305)
Q Consensus 178 v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l~~~~~~g~~l~v~ 249 (305)
|--++..+.-+||.......||.-.++.+..+. -.|-..+.++-.+|++.++. .-..+.|+|-
T Consensus 13 Ii~f~RPvkf~dl~~kv~~afGq~mdl~ytn~e--------L~iPl~~Q~DLDkAie~ld~-s~~~ksLRil 75 (79)
T cd06405 13 IIQFPRPVKFKDLQQKVTTAFGQPMDLHYTNNE--------LLIPLKNQEDLDRAIELLDR-SPHMKSLRIL 75 (79)
T ss_pred EEecCCCccHHHHHHHHHHHhCCeeeEEEeccc--------EEEeccCHHHHHHHHHHHcc-CccccceeEe
Confidence 445678888999998888899999888775331 56888999999999998775 4444455543
No 281
>PF14893 PNMA: PNMA
Probab=27.54 E-value=62 Score=28.72 Aligned_cols=53 Identities=17% Similarity=0.349 Sum_probs=32.3
Q ss_pred CCceEEEcCCCcCCCHHHHHHHHhhcCCCeeEEEEE---ecCCCCCcccEEEEEeCCH
Q 021958 172 PDHTIFVGDLAADVTDYMLQETFRARYPSTKGAKVV---IDRLTGRTKGYGFVRFGDE 226 (305)
Q Consensus 172 ~~~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~---~~~~~~~~~g~afV~f~~~ 226 (305)
..+.|.|.+||.++++++|.+.+....-.+-..++. ...+.+ ..-|+|+|...
T Consensus 17 ~~r~lLv~giP~dc~~~ei~e~l~~~l~plg~yrvl~~~f~~~~~--~~aalve~~e~ 72 (331)
T PF14893_consen 17 PQRALLVLGIPEDCEEAEIEEALQAALSPLGRYRVLGKMFRREEN--AKAALVEFAED 72 (331)
T ss_pred hhhhheeecCCCCCCHHHHHHHHHHhhcccccceehhhHhhhhcc--cceeeeecccc
Confidence 357799999999999999998886322222222222 111112 34577777654
No 282
>CHL00030 rpl23 ribosomal protein L23
Probab=26.82 E-value=2.3e+02 Score=19.94 Aligned_cols=35 Identities=6% Similarity=-0.049 Sum_probs=25.8
Q ss_pred eEEEcCCCcCCCHHHHHHHHhhcCCC-eeEEEEEec
Q 021958 175 TIFVGDLAADVTDYMLQETFRARYPS-TKGAKVVID 209 (305)
Q Consensus 175 ~l~v~nlp~~~~~~~l~~~f~~~~G~-i~~i~i~~~ 209 (305)
.-|+-.++.+.+..+|++.++..||. |..|..+.-
T Consensus 20 n~y~F~V~~~anK~eIK~avE~lf~VkV~~VNt~~~ 55 (93)
T CHL00030 20 NQYTFDVDSGSTKTEIKHWIELFFGVKVIAVNSHRL 55 (93)
T ss_pred CEEEEEECCCCCHHHHHHHHHHHhCCeEEEEEEEEc
Confidence 35555678899999999999966875 666665544
No 283
>PF01690 PLRV_ORF5: Potato leaf roll virus readthrough protein; InterPro: IPR002929 This family consists mainly of the Potato leafroll virus (PLrV) read through protein otherwise known as the minor capsid protein. This is generated via a readthrough of open reading frame 3, the coat protein, allowing transcription of open reading frame 5 to give an extended coat protein with a large C-terminal addition or read through domain []. The read through protein is essential for the circulative aphid transmission of PLrV [] and Beet western yellows virus []. The N-terminal region of the luteovirus readthrough domain determines virus binding to Buchnera GroEL and is essential for virus persistence in the aphid [].; GO: 0019028 viral capsid
Probab=26.32 E-value=64 Score=29.84 Aligned_cols=10 Identities=20% Similarity=0.431 Sum_probs=5.1
Q ss_pred eEEEEEcCHH
Q 021958 125 YGFIEFISRA 134 (305)
Q Consensus 125 ~afV~f~~~~ 134 (305)
-|+|.|.+..
T Consensus 125 ~GlIAY~~~~ 134 (465)
T PF01690_consen 125 DGLIAYDNSS 134 (465)
T ss_pred eeeEEecCcc
Confidence 3555555544
No 284
>PLN02805 D-lactate dehydrogenase [cytochrome]
Probab=25.75 E-value=1.3e+02 Score=28.84 Aligned_cols=50 Identities=6% Similarity=0.080 Sum_probs=36.7
Q ss_pred HHHHHHHH---hhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHHhC
Q 021958 187 DYMLQETF---RARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEMN 237 (305)
Q Consensus 187 ~~~l~~~f---~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l~ 237 (305)
.-||..+| +..+|.|.++.+...+. -..+..+++.|.+.++|.+|+..+.
T Consensus 280 g~dL~~l~~GseGtLGIIT~~tlrl~p~-P~~~~~~~~~f~~~~~a~~av~~i~ 332 (555)
T PLN02805 280 GYDLTRLVIGSEGTLGVITEVTLRLQKI-PQHSVVAMCNFPTIKDAADVAIATM 332 (555)
T ss_pred CccHHHHhccCCCceEEEEEEEEEeecC-CcceEEEEEEcCCHHHHHHHHHHHH
Confidence 35788887 33688899988865542 2335678899999999999887754
No 285
>KOG1984 consensus Vesicle coat complex COPII, subunit SFB3 [Intracellular trafficking, secretion, and vesicular transport]
Probab=25.30 E-value=3.7e+02 Score=27.38 Aligned_cols=12 Identities=8% Similarity=-0.042 Sum_probs=5.2
Q ss_pred cceeeEEccCCC
Q 021958 242 STRPMRIGPATN 253 (305)
Q Consensus 242 ~g~~l~v~~~~~ 253 (305)
+||+..+.++..
T Consensus 357 ~gr~f~Cn~C~~ 368 (1007)
T KOG1984|consen 357 GGRKFICNFCGS 368 (1007)
T ss_pred CCceEEecCCCc
Confidence 344444444433
No 286
>KOG1295 consensus Nonsense-mediated decay protein Upf3 [RNA processing and modification]
Probab=24.57 E-value=97 Score=27.84 Aligned_cols=68 Identities=12% Similarity=0.191 Sum_probs=46.6
Q ss_pred CceEEEcCCCcCCCHHHHHHHHhhcCCC-eeEEEEEecCC--CCCcccEEEEEeCCHHHHHHHHHHhCCcee
Q 021958 173 DHTIFVGDLAADVTDYMLQETFRARYPS-TKGAKVVIDRL--TGRTKGYGFVRFGDESEQLRAMTEMNGVFC 241 (305)
Q Consensus 173 ~~~l~v~nlp~~~~~~~l~~~f~~~~G~-i~~i~i~~~~~--~~~~~g~afV~f~~~~~A~~a~~~l~~~~~ 241 (305)
-..+.|.+||...++++|.+.+. .|-. +....+..... ...-.+.|+|.|...++...-...++|+.+
T Consensus 7 ~~Kvv~rrlpp~l~~~~~~eqi~-p~~~~v~~~~F~~a~~s~~~~~ysrayinFk~~~dv~ef~~~f~g~if 77 (376)
T KOG1295|consen 7 KVKVVVRRLPPKLTEEQLLEQIN-PFPEHVNWEFFAKADESLRNHKYSRAYINFKNPEDVEEFRRRFDGYIF 77 (376)
T ss_pred ceeeeeecCCCcccHHHHhhhcC-CCccccchheeccccccchhhhhhhhhhccccHHHHHHHHhhCCceEE
Confidence 46789999999999999999887 5543 22222221110 112256789999999997777777777664
No 287
>KOG4008 consensus rRNA processing protein RRP7 [RNA processing and modification]
Probab=24.37 E-value=74 Score=26.57 Aligned_cols=35 Identities=9% Similarity=0.090 Sum_probs=28.8
Q ss_pred CCCCceEEEcCCCcCCCHHHHHHHHhhcCCCeeEEE
Q 021958 170 DTPDHTIFVGDLAADVTDYMLQETFRARYPSTKGAK 205 (305)
Q Consensus 170 ~~~~~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~ 205 (305)
.....+||+-|+|...+++-|.++.+ .+|.+..+.
T Consensus 37 ~~eKd~lfl~Nvp~~~tee~lkr~vs-qlg~vq~~~ 71 (261)
T KOG4008|consen 37 SNEKDCLFLVNVPLLSTEEHLKRFVS-QLGHVQELL 71 (261)
T ss_pred cccccceeeecccccccHHHHHHHHH-Hhhhhhhee
Confidence 34468899999999999999999998 888665443
No 288
>PF04026 SpoVG: SpoVG; InterPro: IPR007170 This is a stage V sporulation protein G. It is essential for sporulation and specific to stage V sporulation in Bacillus megaterium and Bacillus subtilis []. In B. subtilis, expression decreases after 30-60 minutes of cold shock [].; GO: 0030435 sporulation resulting in formation of a cellular spore; PDB: 2IA9_F 2I9X_B 2I9Z_A.
Probab=24.31 E-value=1.5e+02 Score=20.36 Aligned_cols=47 Identities=11% Similarity=0.238 Sum_probs=29.8
Q ss_pred CeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHHhCCcee--cceeeEEccCCC
Q 021958 200 STKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEMNGVFC--STRPMRIGPATN 253 (305)
Q Consensus 200 ~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l~~~~~--~g~~l~v~~~~~ 253 (305)
.|.+++|..-...|+-+++|-|+|.+ ++ .+++..+ +...+.|.+-+.
T Consensus 2 ~itdVri~~~~~~~~lka~asV~~dd------~f-~I~~ikVieg~~GlFVaMPs~ 50 (84)
T PF04026_consen 2 KITDVRIRKIEPEGKLKAFASVTFDD------CF-VIHDIKVIEGEKGLFVAMPSR 50 (84)
T ss_dssp -EEEEEEEETTSSSSEEEEEEEEETT------TE-EEEEEEEEEETTEEEEE--EE
T ss_pred ccEEEEEEEecCCCCEEEEEEEEECC------EE-EEEeEEEEECCCCcEEECCCc
Confidence 36778887766668889999999988 11 2455443 345577766443
No 289
>PRK10905 cell division protein DamX; Validated
Probab=23.60 E-value=1.7e+02 Score=25.80 Aligned_cols=60 Identities=20% Similarity=0.117 Sum_probs=36.9
Q ss_pred CCceEEEcCCCcccCHHHHhHhhhhcCCcEEEEEEeeCCCCCccceE--EEEEcCHHHHHHHHHHhC
Q 021958 80 EIRTLWIGDLQYWMDETYLNTCFAHTGEVVAVKVIRNKQTGQIEGYG--FIEFISRAGAERVLQTFN 144 (305)
Q Consensus 80 ~~~~l~v~nLp~~~t~~~l~~~f~~~G~v~~~~~~~~~~~~~~~g~a--fV~f~~~~~a~~a~~~l~ 144 (305)
...+|-|+.+ .+++.|+++..+.|. ....+.....+|+. .|- +=.|.+.++|++|+..|-
T Consensus 246 ~~YTLQL~A~---Ss~~~l~~fakKlgL-~~y~vy~TtRnGkp-WYVV~yG~YaSraeAk~AiakLP 307 (328)
T PRK10905 246 SHYTLQLSSS---SNYDNLNGWAKKENL-KNYVVYETTRNGQP-WYVLVSGVYASKEEAKRAVSTLP 307 (328)
T ss_pred CceEEEEEec---CCHHHHHHHHHHcCC-CceEEEEeccCCce-EEEEEecCCCCHHHHHHHHHHCC
Confidence 3345555544 566888888888764 33334443333432 232 336999999999998763
No 290
>PRK12280 rplW 50S ribosomal protein L23; Reviewed
Probab=23.44 E-value=3e+02 Score=21.60 Aligned_cols=35 Identities=0% Similarity=0.037 Sum_probs=27.1
Q ss_pred eEEEcCCCcCCCHHHHHHHHhhcCCC-eeEEEEEec
Q 021958 175 TIFVGDLAADVTDYMLQETFRARYPS-TKGAKVVID 209 (305)
Q Consensus 175 ~l~v~nlp~~~~~~~l~~~f~~~~G~-i~~i~i~~~ 209 (305)
..|+-.++.+.+..+|++.++..||. |..|..+.-
T Consensus 23 N~ytF~V~~~anK~eIK~AVE~iF~VkV~~VNT~~~ 58 (158)
T PRK12280 23 NVYTFKVDRRANKIEIKKAVEFIFKVKVLKVNIFNV 58 (158)
T ss_pred CEEEEEECCCCCHHHHHHHHHHHhCCceEEEEeEec
Confidence 35777789999999999999977876 666666543
No 291
>PF00276 Ribosomal_L23: Ribosomal protein L23; InterPro: IPR013025 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites [, ]. About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits. Many ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome [, ]. This domain is found in both eukaryotic L25 and prokaryotic and eukaryotic L23 proteins.; GO: 0003735 structural constituent of ribosome, 0006412 translation, 0005622 intracellular, 0005840 ribosome; PDB: 3DF4_T 3DF2_T 4A1E_R 4A1A_R 4A1C_R 4A17_R 2HGU_W 2HGQ_W 1N88_A 2HGJ_W ....
Probab=23.22 E-value=1.3e+02 Score=20.96 Aligned_cols=34 Identities=12% Similarity=0.127 Sum_probs=25.4
Q ss_pred EEEcCCCcCCCHHHHHHHHhhcCCC-eeEEEEEec
Q 021958 176 IFVGDLAADVTDYMLQETFRARYPS-TKGAKVVID 209 (305)
Q Consensus 176 l~v~nlp~~~~~~~l~~~f~~~~G~-i~~i~i~~~ 209 (305)
-+.-.++.+.+..||++.++..||. |.+|....-
T Consensus 22 ~~tF~V~~~atK~~Ik~aie~iy~V~V~~Vnt~~~ 56 (91)
T PF00276_consen 22 QYTFEVDPRATKTEIKEAIEKIYGVKVKKVNTMNY 56 (91)
T ss_dssp EEEEEETTTSTHHHHHHHHHHHHTSEEEEEEEEEE
T ss_pred EEEEEEeCCCCHHHHHHHHHhhcCCCeeEEEEeEe
Confidence 4445678899999999999988886 556665533
No 292
>cd02568 PseudoU_synth_PUS1_PUS2 PseudoU_synth_PUS1_PUS2: Pseudouridine synthase, PUS1/ PUS2 like. This group consists of eukaryotic pseudouridine synthases similar to Saccharomyces cerevisiae Pus1p, S. cerevisiae Pus2p, Caenorhabditis elegans Pus1p and human PUS1. Pseudouridine synthases catalyze the isomerization of specific uridines in an RNA molecule to pseudouridines (5-ribosyluracil, psi). No cofactors are required. S. cerevisiae Pus1p catalyzes the formation of psi34 and psi36 in the intron-containing tRNAIle, psi35 in the intron-containing tRNATyr, psi27 and/or psi28 in several yeast cytoplasmic tRNAs and, psi44 in U2 small nuclear RNA (U2 snRNA). The presence of the intron is required for the formation of psi 34, 35 and 36. In addition S. cerevisiae PUS1 makes are psi 26, 65 and 67. C. elegans Pus1p does not modify psi44 in U2 snRNA. Mouse Pus1p makes psi27/28 in pre- tRNASer , tRNAVal and tRNAIle, psi 34/36 in tRNAIle and, psi 32 and potentially 67 in tRNAVal. Psi44 in U
Probab=22.86 E-value=2e+02 Score=24.25 Aligned_cols=115 Identities=12% Similarity=0.048 Sum_probs=55.1
Q ss_pred CCHHHHHHHHhhcCCCe--------eEEEEEec--CCCCCcccEEEEEeCCHHH-------HHHHHHHhCCceecc-eee
Q 021958 185 VTDYMLQETFRARYPST--------KGAKVVID--RLTGRTKGYGFVRFGDESE-------QLRAMTEMNGVFCST-RPM 246 (305)
Q Consensus 185 ~~~~~l~~~f~~~~G~i--------~~i~i~~~--~~~~~~~g~afV~f~~~~~-------A~~a~~~l~~~~~~g-~~l 246 (305)
+-+++|.+.+. +.+.| ..+.+..- .+.|.+...-+|.|..... ..+.+..||...-.+ +.+
T Consensus 22 TVq~~le~aL~-~~~~~~~~~~~~~~~~~~~~agRTD~GVHA~~qv~~~~~~~~~~~~~~~~~~~~~~lN~~Lp~dI~v~ 100 (245)
T cd02568 22 TIEGELERALV-KAGAISESNAGDPKKIGFSRAARTDKGVHAARNVVSLKVIIDDPEGLGIGEDLVEKLNSHLPEDIRVF 100 (245)
T ss_pred ChHHHHHHHHH-HcCCcCccccCChhhcceeeecccccchhheeeEEEEEEeccccccccchHHHHHHHHccCCCceEEE
Confidence 55788999998 77765 34555432 2345544444555543221 334455566543222 233
Q ss_pred EEccCCCCCCCCCCCCCCCcceEEeCCcchhhhHHHHhhccceeeEEEEEeeeecc
Q 021958 247 RIGPATNKKTVSGQQQYPKGTFLVVGTGFCLQIKFKFKYFIGWLIFIIIFFNISTS 302 (305)
Q Consensus 247 ~v~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~ 302 (305)
.|......-..+.......-.+++ ... ...+.+++.+....+.--++|-|+|.+
T Consensus 101 ~~~~v~~~F~aR~~a~~R~Y~Y~i-~~~-~~~d~~~m~~a~~~~~G~HdF~~F~~~ 154 (245)
T cd02568 101 GITRVTKSFNARKACDSRTYEYLL-PTF-ALETLDRFNEILKLYVGTHNFHNFTVR 154 (245)
T ss_pred EEEcCCCCCCcchhhhcceeEEEC-Ccc-cHhHHHHHHHHHHHhhCccccccCccc
Confidence 333332222222111112222222 211 113455555555555666788888875
No 293
>COG5353 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=22.50 E-value=3.7e+02 Score=20.80 Aligned_cols=56 Identities=11% Similarity=0.127 Sum_probs=39.7
Q ss_pred CceEEEcCCCcccCHHHHhHhhhhcC---CcEEEEEEeeCCC---------CCccc-eEEEEEcCHHHH
Q 021958 81 IRTLWIGDLQYWMDETYLNTCFAHTG---EVVAVKVIRNKQT---------GQIEG-YGFIEFISRAGA 136 (305)
Q Consensus 81 ~~~l~v~nLp~~~t~~~l~~~f~~~G---~v~~~~~~~~~~~---------~~~~g-~afV~f~~~~~a 136 (305)
..+||+..+...+++++.++..++-+ .+.++.+-+.+.. ...+. |-+|.|++-...
T Consensus 87 ~~KI~~k~asqGISe~~a~~~i~kE~~~~e~~~V~Lg~e~~~PiWEV~y~dkeg~~s~~~vdFetG~~l 155 (161)
T COG5353 87 DGKIYSKKASQGISEEDARAIISKEKAVKEIKSVTLGREKEKPIWEVTYLDKEGRLSFYYVDFETGKEL 155 (161)
T ss_pred CCeEEEEehhcCCCHHHHHHHHhhhccccceeEEEEEeeCCceeEEEeecCccCcceEEEEEeccchhh
Confidence 36888889999999999999987653 4666666554432 12233 888999876543
No 294
>PRK10629 EnvZ/OmpR regulon moderator; Provisional
Probab=22.50 E-value=3.4e+02 Score=20.34 Aligned_cols=59 Identities=7% Similarity=0.037 Sum_probs=40.0
Q ss_pred CCHHHHHHHHhhcCCC-eeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHHhCCceecceeeEEccCC
Q 021958 185 VTDYMLQETFRARYPS-TKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEMNGVFCSTRPMRIGPAT 252 (305)
Q Consensus 185 ~~~~~l~~~f~~~~G~-i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l~~~~~~g~~l~v~~~~ 252 (305)
.+-+.+.+.+. .-|. ++++..- .+-..|.|.+.++-.+|.+.|....-++-.|.+..+.
T Consensus 50 ~~~~~v~~~L~-~~gI~~ksi~~~--------~~~~~irf~~~~~Ql~Ak~vL~~~L~~~y~VAlnl~p 109 (127)
T PRK10629 50 PDGFYVYQHLD-ANGIHIKSITPE--------NDSLLIRFDSPEQSAAAKEVLDRTLPHGYIIAQQDDN 109 (127)
T ss_pred chHHHHHHHHH-HCCCCcceEEee--------CCEEEEEECCHHHHHHHHHHHHHHcCCCCEEEEecCC
Confidence 56678899998 5543 4444432 2247899999999999988887655455566655543
No 295
>smart00666 PB1 PB1 domain. Phox and Bem1p domain, present in many eukaryotic cytoplasmic signalling proteins. The domain adopts a beta-grasp fold, similar to that found in ubiquitin and Ras-binding domains. A motif, variously termed OPR, PC and AID, represents the most conserved region of the majority of PB1 domains, and is necessary for PB1 domain function. This function is the formation of PB1 domain heterodimers, although not all PB1 domain pairs associate.
Probab=22.39 E-value=2.4e+02 Score=18.61 Aligned_cols=57 Identities=12% Similarity=0.143 Sum_probs=39.7
Q ss_pred EEEcCCCcCCCHHHHHHHHhhcCCCe-eEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHHhC
Q 021958 176 IFVGDLAADVTDYMLQETFRARYPST-KGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEMN 237 (305)
Q Consensus 176 l~v~nlp~~~~~~~l~~~f~~~~G~i-~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l~ 237 (305)
++.-.++..++-++|......+|+.- ..+.+....+.| -+|...+.++-..|+....
T Consensus 12 ~~~~~~~~~~s~~dL~~~i~~~~~~~~~~~~l~Y~Dedg-----d~v~l~sd~Dl~~a~~~~~ 69 (81)
T smart00666 12 TRRLSVPRDISFEDLRSKVAKRFGLDNQSFTLKYQDEDG-----DLVSLTSDEDLEEAIEEYD 69 (81)
T ss_pred EEEEEECCCCCHHHHHHHHHHHhCCCCCCeEEEEECCCC-----CEEEecCHHHHHHHHHHHH
Confidence 44445677899999999888788753 234444332233 3799999999999998754
No 296
>PF11411 DNA_ligase_IV: DNA ligase IV; InterPro: IPR021536 DNA ligase IV along with Xrcc4 functions in DNA non-homologous end joining. This process is required to mend double-strand breaks. Upon ligase binding to an Xrcc4 dimer, the helical tails unwind leading to a flat interaction surface []. ; GO: 0003910 DNA ligase (ATP) activity; PDB: 3II6_Y 2E2W_A 1IK9_C.
Probab=21.83 E-value=65 Score=18.24 Aligned_cols=16 Identities=25% Similarity=0.210 Sum_probs=10.2
Q ss_pred cccCHHHHhHhhhhcC
Q 021958 91 YWMDETYLNTCFAHTG 106 (305)
Q Consensus 91 ~~~t~~~l~~~f~~~G 106 (305)
.++++++|++.|.+.+
T Consensus 19 ~Dtd~~~Lk~vF~~i~ 34 (36)
T PF11411_consen 19 VDTDEDQLKEVFNRIK 34 (36)
T ss_dssp S---HHHHHHHHHCS-
T ss_pred ccCCHHHHHHHHHHhc
Confidence 4678999999998764
No 297
>TIGR03399 RNA_3prim_cycl RNA 3'-phosphate cyclase. Members of this protein family are RNA 3'-phosphate cyclase (6.5.1.4), an enzyme whose function is conserved from E. coli to human. The modification this enzyme performs enables certain RNA ligations to occur, although the full biological roll for this enzyme is not fully described. This model separates this enzyme from a related protein, present only in eukaryotes, localized to the nucleolus, and involved in ribosomal modification.
Probab=21.78 E-value=5.7e+02 Score=22.69 Aligned_cols=119 Identities=15% Similarity=0.223 Sum_probs=64.6
Q ss_pred ceEEEc---CCCcccCHHHHhHh----hhhcCCcEEEEEEeeCCCCCccceEEEEEcCHHHHHHHHHHhCCCCCCCCCcc
Q 021958 82 RTLWIG---DLQYWMDETYLNTC----FAHTGEVVAVKVIRNKQTGQIEGYGFIEFISRAGAERVLQTFNGTPMPNGEQN 154 (305)
Q Consensus 82 ~~l~v~---nLp~~~t~~~l~~~----f~~~G~v~~~~~~~~~~~~~~~g~afV~f~~~~~a~~a~~~l~g~~~~~~~~~ 154 (305)
.+|-+. |.+++-+-|.++.. ++++|.-.++++.+.- -.++|=+=|.|.-.-. +.+....+...++.
T Consensus 112 ~~l~l~GgT~~~~sPsvDy~~~v~lP~l~~~G~~~~l~v~rRG--~yP~GGGeV~~~i~p~-----~~l~~i~l~~~G~i 184 (326)
T TIGR03399 112 SRVTVSGGTDVPWAPPVDYLRNVFLPLLERMGIRAELELLRRG--FYPRGGGEVRLRVEPV-----KKLKPLELEERGEL 184 (326)
T ss_pred eEEEEEcccCCCCCCCHHHHHHHHHHHHHhCCCcEEEEEEeCC--cCCCCCEEEEEEEccc-----cCCCceeeecCCce
Confidence 455553 45566677777765 4788886777776643 2344445555543221 11222233223333
Q ss_pred eeeeccCCCCCCCCCCCCCceEEEcCCCcCCCHHHH---HHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeC
Q 021958 155 FRLNWASFGAGEKRDDTPDHTIFVGDLAADVTDYML---QETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFG 224 (305)
Q Consensus 155 i~v~~~~~~~~~~~~~~~~~~l~v~nlp~~~~~~~l---~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~ 224 (305)
.++. ...+..+++..+-+.++ ++.+. .++.-.++.+... ..+.+.|++++-+.
T Consensus 185 ~~i~---------------g~~~~~~l~~~va~r~~~~a~~~L~-~~~~~~~i~~~~~-~~~~s~G~~i~L~a 240 (326)
T TIGR03399 185 LRVS---------------GIAHAANLPAHVAERMAKAAREELR-KLGLDPEIEIEVL-DKGLGPGSGIVLWA 240 (326)
T ss_pred EEEE---------------EEEEEccCCHHHHHHHHHHHHHHHH-hhCCCceEEEEec-cCCCCCcEEEEEEE
Confidence 3332 36778899998877544 44455 5554334443332 45677788776554
No 298
>COG5470 Uncharacterized conserved protein [Function unknown]
Probab=21.51 E-value=2.4e+02 Score=19.99 Aligned_cols=21 Identities=24% Similarity=0.069 Sum_probs=16.5
Q ss_pred CCccceEEEEEcCHHHHHHHH
Q 021958 120 GQIEGYGFIEFISRAGAERVL 140 (305)
Q Consensus 120 ~~~~g~afV~f~~~~~a~~a~ 140 (305)
........|+|.+.+.|+.+-
T Consensus 50 w~ptr~vviEFps~~~ar~~y 70 (96)
T COG5470 50 WRPTRNVVIEFPSLEAARDCY 70 (96)
T ss_pred CCcccEEEEEcCCHHHHHHHh
Confidence 343557899999999998775
No 299
>COG5353 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=21.43 E-value=3.9e+02 Score=20.68 Aligned_cols=55 Identities=9% Similarity=0.144 Sum_probs=38.0
Q ss_pred ceEEEcCCCcCCCHHHHHHHHhhc--CCCeeEEEEEecCC---------CCCccc-EEEEEeCCHHH
Q 021958 174 HTIFVGDLAADVTDYMLQETFRAR--YPSTKGAKVVIDRL---------TGRTKG-YGFVRFGDESE 228 (305)
Q Consensus 174 ~~l~v~nlp~~~~~~~l~~~f~~~--~G~i~~i~i~~~~~---------~~~~~g-~afV~f~~~~~ 228 (305)
.+|++.-+...+++++.++..+.. -+++..|.+-+..+ +...+. |-+|.|++-..
T Consensus 88 ~KI~~k~asqGISe~~a~~~i~kE~~~~e~~~V~Lg~e~~~PiWEV~y~dkeg~~s~~~vdFetG~~ 154 (161)
T COG5353 88 GKIYSKKASQGISEEDARAIISKEKAVKEIKSVTLGREKEKPIWEVTYLDKEGRLSFYYVDFETGKE 154 (161)
T ss_pred CeEEEEehhcCCCHHHHHHHHhhhccccceeEEEEEeeCCceeEEEeecCccCcceEEEEEeccchh
Confidence 689999999999999999998733 35566676654432 112233 77888887543
No 300
>PHA01632 hypothetical protein
Probab=21.39 E-value=1e+02 Score=19.29 Aligned_cols=20 Identities=15% Similarity=0.303 Sum_probs=16.8
Q ss_pred EEEcCCCcCCCHHHHHHHHh
Q 021958 176 IFVGDLAADVTDYMLQETFR 195 (305)
Q Consensus 176 l~v~nlp~~~~~~~l~~~f~ 195 (305)
|.|..+|..-|+++|++.+.
T Consensus 19 ilieqvp~kpteeelrkvlp 38 (64)
T PHA01632 19 ILIEQVPQKPTEEELRKVLP 38 (64)
T ss_pred EehhhcCCCCCHHHHHHHHH
Confidence 44578899999999999875
No 301
>PRK11558 putative ssRNA endonuclease; Provisional
Probab=21.22 E-value=2.2e+02 Score=20.24 Aligned_cols=49 Identities=16% Similarity=0.399 Sum_probs=28.8
Q ss_pred CceEEEcCCCcCCCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCC
Q 021958 173 DHTIFVGDLAADVTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGD 225 (305)
Q Consensus 173 ~~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~ 225 (305)
..-+||+++...+.+.--..+-+ .++.= ++.++... ++. .||+|-++.+
T Consensus 27 ~~GVyVg~~S~rVRd~lW~~v~~-~~~~G-~avmv~~~-~~e-qG~~~~t~G~ 75 (97)
T PRK11558 27 RAGVYVGDVSRRIREMIWQQVTQ-LAEEG-NVVMAWAT-NTE-SGFEFQTFGE 75 (97)
T ss_pred CCCcEEcCCCHHHHHHHHHHHHH-hCCCC-cEEEEEcC-CCC-CCcEEEecCC
Confidence 46699999988887754444443 44432 23333321 333 4899888665
No 302
>PF14657 Integrase_AP2: AP2-like DNA-binding integrase domain
Probab=21.00 E-value=1.9e+02 Score=16.96 Aligned_cols=30 Identities=23% Similarity=0.292 Sum_probs=21.0
Q ss_pred ecCCCCCcccEEEEEeCCHHHHHHHHHHhC
Q 021958 208 IDRLTGRTKGYGFVRFGDESEQLRAMTEMN 237 (305)
Q Consensus 208 ~~~~~~~~~g~afV~f~~~~~A~~a~~~l~ 237 (305)
.+..+|+.+..--=-|.+..+|..++..+.
T Consensus 9 ~~~~~Gkrk~~~k~GF~TkkeA~~~~~~~~ 38 (46)
T PF14657_consen 9 YDDETGKRKQKTKRGFKTKKEAEKALAKIE 38 (46)
T ss_pred EECCCCCEEEEEcCCCCcHHHHHHHHHHHH
Confidence 455577765444334999999999987653
No 303
>PF09707 Cas_Cas2CT1978: CRISPR-associated protein (Cas_Cas2CT1978); InterPro: IPR010152 Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR) are a family of DNA direct repeats separated by regularly sized non-repetitive spacer sequences that are found in most bacterial and archaeal genomes []. CRISPRs appear to provide acquired resistance against bacteriophages, possibly acting with an RNA interference-like mechanism to inhibit gene functions of invasive DNA elements [, ]. Differences in the number and type of spacers between CRISPR repeats correlate with phage sensitivity. It is thought that following phage infection, bacteria integrate new spacers derived from phage genomic sequences, and that the removal or addition of particular spacers modifies the phage-resistance phenotype of the cell. Therefore, the specificity of CRISPRs may be determined by spacer-phage sequence similarity. In addition, there are many protein families known as CRISPR-associated sequences (Cas), which are encoded in the vicinity of CRISPR loci []. CRISPR/cas gene regions can be quite large, with up to 20 different, tandem-arranged cas genes next to a CRISPR cluster or filling the region between two repeat clusters. Cas genes and CRISPRs are found on mobile genetic elements such as plasmids, and have undergone extensive horizontal transfer. Cas proteins are thought to be involved in the propagation and functioning of CRISPRs. Some Cas proteins show similarity to helicases and repair proteins, although the functions of most are unknown. Cas families can be divided into subtypes according to operon organisation and phylogeny. This entry represents a minor branch of the Cas2 family of CRISPR-associated protein which are found in IPR003799 from INTERPRO. Cas2 is one of four protein families (Cas1 to Cas4) that are associated with CRISPR elements and always occur near a repeat cluster, usually in the order cas3-cas4-cas1-cas2. The function of Cas2 (and Cas1) is unknown. Cas3 proteins appear to be helicases while Cas4 proteins resemble RecB-type exonucleases, suggesting that these genes are involved in DNA metabolism or gene expression [].
Probab=20.93 E-value=2.9e+02 Score=19.15 Aligned_cols=48 Identities=19% Similarity=0.394 Sum_probs=30.0
Q ss_pred CceEEEcCCCcCCCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeC
Q 021958 173 DHTIFVGDLAADVTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFG 224 (305)
Q Consensus 173 ~~~l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~ 224 (305)
..-+||+|++..+.|.-...+.+ ..+.=. +.++... .+ ..||+|-...
T Consensus 25 ~~GVyVg~~s~rVRe~lW~~v~~-~~~~G~-a~m~~~~-~n-eqG~~~~t~G 72 (86)
T PF09707_consen 25 RPGVYVGNVSARVRERLWERVTE-WIGDGS-AVMVWSD-NN-EQGFDFRTLG 72 (86)
T ss_pred CCCcEEcCCCHHHHHHHHHHHHh-hCCCcc-EEEEEcc-CC-CCCEEEEEeC
Confidence 46799999999988875555554 333333 3333332 22 5889888763
No 304
>COG2004 RPS24A Ribosomal protein S24E [Translation, ribosomal structure and biogenesis]
Probab=20.76 E-value=3.4e+02 Score=19.73 Aligned_cols=48 Identities=21% Similarity=0.229 Sum_probs=28.6
Q ss_pred cCCCHHHHHHHHhhcCCCeeEEEEEecC----CCCCcccEEEEEeCCHHHHHH
Q 021958 183 ADVTDYMLQETFRARYPSTKGAKVVIDR----LTGRTKGYGFVRFGDESEQLR 231 (305)
Q Consensus 183 ~~~~~~~l~~~f~~~~G~i~~i~i~~~~----~~~~~~g~afV~f~~~~~A~~ 231 (305)
...+..+|++.+...+|.=.+..++..- ..+.++|||-| |.|.+.|..
T Consensus 30 ~TPSr~evrekla~~l~~d~e~VvV~~ikt~fG~~~s~g~akI-Y~s~e~~~~ 81 (107)
T COG2004 30 PTPSRKEVREKLAAMLGADKELVVVDYIKTEFGKGRSKGYAKI-YDSVERAKK 81 (107)
T ss_pred CCCCHHHHHHHHHHHHCCCcceEEEEehhhhcCCcceeEEEEE-ECCHHHHHh
Confidence 4567788888887677764444444332 23455666644 566666553
No 305
>KOG0862 consensus Synaptobrevin/VAMP-like protein SEC22 [Intracellular trafficking, secretion, and vesicular transport]
Probab=20.75 E-value=61 Score=26.63 Aligned_cols=31 Identities=19% Similarity=0.385 Sum_probs=20.7
Q ss_pred HHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCH
Q 021958 188 YMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDE 226 (305)
Q Consensus 188 ~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~ 226 (305)
|||.+.|...||.-. .+-.+|-|+||+|++.
T Consensus 89 edL~~EF~~~~~~~~--------~~~~~RPY~FieFD~~ 119 (216)
T KOG0862|consen 89 EDLAQEFDKSYGKNI--------IQPASRPYAFIEFDTF 119 (216)
T ss_pred HHHHHHHHHhccccc--------CCccCCCeeEEehhHH
Confidence 667777765666421 1225588999999874
No 306
>PRK13259 regulatory protein SpoVG; Reviewed
Probab=20.74 E-value=1.9e+02 Score=20.50 Aligned_cols=47 Identities=13% Similarity=0.191 Sum_probs=29.7
Q ss_pred CeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHHhCCcee--cceeeEEccCCC
Q 021958 200 STKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEMNGVFC--STRPMRIGPATN 253 (305)
Q Consensus 200 ~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l~~~~~--~g~~l~v~~~~~ 253 (305)
+|.+|+|..-...|+-++||-|+|.+. + ..|+..+ +.+.|.|..-+.
T Consensus 2 ~ITdVri~~~~~~g~lka~asit~dd~------f-vI~~ikVieg~~GlFVaMPs~ 50 (94)
T PRK13259 2 EVTDVRLRKVNTEGRMKAIVSITFDNE------F-VVHDIRVIEGNNGLFIAMPSK 50 (94)
T ss_pred eEEEEEEEEeCCCCcEEEEEEEEECCE------E-EEeeeEEEECCCCeEEECcCc
Confidence 467777776555688899999999882 1 2344443 234566665443
No 307
>PF10281 Ish1: Putative stress-responsive nuclear envelope protein; InterPro: IPR018803 This group of proteins, found primarily in fungi, consists of putative stress-responsive nuclear envelope protein Ish1 and homologues [].
Probab=20.63 E-value=98 Score=17.45 Aligned_cols=18 Identities=11% Similarity=-0.031 Sum_probs=14.7
Q ss_pred CCCHHHHHHHHhhcCCCee
Q 021958 184 DVTDYMLQETFRARYPSTK 202 (305)
Q Consensus 184 ~~~~~~l~~~f~~~~G~i~ 202 (305)
.+++++|++.+. .+|.+.
T Consensus 3 tWs~~~L~~wL~-~~gi~~ 20 (38)
T PF10281_consen 3 TWSDSDLKSWLK-SHGIPV 20 (38)
T ss_pred CCCHHHHHHHHH-HcCCCC
Confidence 578999999998 888653
No 308
>PRK14950 DNA polymerase III subunits gamma and tau; Provisional
Probab=20.38 E-value=5e+02 Score=25.17 Aligned_cols=12 Identities=8% Similarity=0.027 Sum_probs=4.7
Q ss_pred HHHHHHhhcCCC
Q 021958 189 MLQETFRARYPS 200 (305)
Q Consensus 189 ~l~~~f~~~~G~ 200 (305)
.|.+.+...+|.
T Consensus 524 ~i~~~l~~~~g~ 535 (585)
T PRK14950 524 ITEELLSNFVGK 535 (585)
T ss_pred HHHHHHHHHhCC
Confidence 344444433343
No 309
>cd06408 PB1_NoxR The PB1 domain is present in the Epichloe festucae NoxR protein (NADPH oxidase regulator), a key regulator of NADPH oxidase isoform, NoxA. NoxA is essential for growth control of the fungal endophyte in plant tissue in the process of symbiotic interaction between a fungi and its plant host. The Epichloe festucae p67(phox)-like regulator, NoxR, dispensable in culture but essential in plants for the symbiotic interaction. Plants infected with a noxR deletion mutant show severe stunting and premature senescence, whereas hyphae in the meristematic tissues show increased branching leading to increased fungal colonization of pseudostem and leaf blade tissue. The PB1 domain is a modular domain mediating specific protein-protein interactions which a play role in many critical cell processes such as osteoclastogenesis, angiogenesis, early cardiovascular development, and cell polarity. A canonical PB1-PB1 interaction, which involves heterodimerization of two PB1 domains, is
Probab=20.11 E-value=3.1e+02 Score=19.01 Aligned_cols=55 Identities=13% Similarity=0.184 Sum_probs=38.4
Q ss_pred EEEcCCCcCCCHHHHHHHHhhcCCCeeEEEEEecCCCCCcccEEEEEeCCHHHHHHHHHHh
Q 021958 176 IFVGDLAADVTDYMLQETFRARYPSTKGAKVVIDRLTGRTKGYGFVRFGDESEQLRAMTEM 236 (305)
Q Consensus 176 l~v~nlp~~~~~~~l~~~f~~~~G~i~~i~i~~~~~~~~~~g~afV~f~~~~~A~~a~~~l 236 (305)
+.+--++.+++-++|.+.+.++||--..+.|..-.+ | .+|+..|.++-..|+...
T Consensus 13 v~~i~v~~~i~f~dL~~kIrdkf~~~~~~~iKykDE-G-----D~iti~sq~DLd~Ai~~a 67 (86)
T cd06408 13 TRYIMIGPDTGFADFEDKIRDKFGFKRRLKIKMKDD-G-----DMITMGDQDDLDMAIDTA 67 (86)
T ss_pred EEEEEcCCCCCHHHHHHHHHHHhCCCCceEEEEEcC-C-----CCccccCHHHHHHHHHHH
Confidence 444467888888888777776777655555554433 3 578999998888887653
Done!