Query 034021
Match_columns 106
No_of_seqs 110 out of 1624
Neff 10.2
Searched_HMMs 46136
Date Fri Mar 29 08:54:24 2013
Command hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/034021.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/034021hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PLN03134 glycine-rich RNA-bind 99.9 5E-21 1.1E-25 113.6 12.8 83 23-105 31-113 (144)
2 TIGR01661 ELAV_HUD_SF ELAV/HuD 99.8 1.7E-19 3.7E-24 120.2 12.2 79 26-104 269-347 (352)
3 TIGR01661 ELAV_HUD_SF ELAV/HuD 99.8 1.7E-19 3.8E-24 120.2 11.3 81 25-105 2-82 (352)
4 KOG0122 Translation initiation 99.8 2.2E-19 4.7E-24 112.0 10.6 84 22-105 185-268 (270)
5 TIGR01659 sex-lethal sex-letha 99.8 2.4E-19 5.2E-24 119.3 11.0 85 21-105 102-186 (346)
6 PF00076 RRM_1: RNA recognitio 99.8 6E-19 1.3E-23 92.5 10.0 70 29-99 1-70 (70)
7 PF14259 RRM_6: RNA recognitio 99.8 1.4E-17 3.1E-22 87.6 10.2 70 29-99 1-70 (70)
8 TIGR01659 sex-lethal sex-letha 99.8 9.5E-18 2.1E-22 111.7 11.0 82 24-105 191-274 (346)
9 TIGR01645 half-pint poly-U bin 99.8 1.4E-17 3.1E-22 116.7 11.4 80 25-104 203-282 (612)
10 KOG0107 Alternative splicing f 99.8 6.3E-18 1.4E-22 101.0 8.2 76 24-104 8-83 (195)
11 TIGR01648 hnRNP-R-Q heterogene 99.7 2.7E-17 5.9E-22 115.0 11.1 100 2-103 35-135 (578)
12 TIGR01628 PABP-1234 polyadenyl 99.7 2.5E-17 5.4E-22 115.8 10.8 79 28-106 2-80 (562)
13 KOG0149 Predicted RNA-binding 99.7 7.6E-18 1.7E-22 104.6 7.2 78 25-103 11-88 (247)
14 TIGR01645 half-pint poly-U bin 99.7 2.4E-17 5.2E-22 115.6 10.0 79 25-103 106-184 (612)
15 PLN03120 nucleic acid binding 99.7 7.4E-17 1.6E-21 102.6 10.1 75 26-104 4-78 (260)
16 TIGR01642 U2AF_lg U2 snRNP aux 99.7 1.7E-16 3.6E-21 110.3 12.1 80 25-104 294-373 (509)
17 TIGR01622 SF-CC1 splicing fact 99.7 1.5E-16 3.2E-21 109.4 11.2 79 26-104 186-264 (457)
18 KOG0117 Heterogeneous nuclear 99.7 1.3E-16 2.9E-21 106.7 10.3 99 5-104 63-162 (506)
19 KOG0113 U1 small nuclear ribon 99.7 1.1E-16 2.4E-21 102.5 9.1 80 24-103 99-178 (335)
20 TIGR01622 SF-CC1 splicing fact 99.7 2.4E-16 5.3E-21 108.3 11.3 82 22-104 85-166 (457)
21 smart00362 RRM_2 RNA recogniti 99.7 3.2E-16 7E-21 81.7 9.3 72 28-101 1-72 (72)
22 KOG0145 RNA-binding protein EL 99.7 1.3E-16 2.7E-21 101.0 8.7 84 22-105 37-120 (360)
23 KOG0125 Ataxin 2-binding prote 99.7 1.5E-16 3.2E-21 103.0 8.5 84 20-105 90-173 (376)
24 TIGR01628 PABP-1234 polyadenyl 99.7 3.6E-16 7.9E-21 109.9 11.0 81 24-105 283-363 (562)
25 KOG4207 Predicted splicing fac 99.7 1.5E-16 3.2E-21 97.6 7.1 85 21-105 8-92 (256)
26 KOG0114 Predicted RNA-binding 99.7 5.8E-16 1.3E-20 85.5 8.7 83 20-105 12-94 (124)
27 KOG0148 Apoptosis-promoting RN 99.7 4.6E-16 9.9E-21 98.7 9.5 77 22-104 160-236 (321)
28 PLN03213 repressor of silencin 99.7 4.7E-16 1E-20 105.7 9.7 79 23-105 7-87 (759)
29 smart00360 RRM RNA recognition 99.7 9.4E-16 2E-20 79.6 8.4 71 31-101 1-71 (71)
30 PLN03121 nucleic acid binding 99.7 1.6E-15 3.6E-20 95.3 10.4 75 25-103 4-78 (243)
31 KOG0108 mRNA cleavage and poly 99.7 5.1E-16 1.1E-20 105.4 8.3 79 27-105 19-97 (435)
32 KOG0111 Cyclophilin-type pepti 99.7 1.2E-16 2.5E-21 99.0 4.6 84 23-106 7-90 (298)
33 KOG0121 Nuclear cap-binding pr 99.6 1E-15 2.2E-20 87.5 7.0 81 23-103 33-113 (153)
34 TIGR01649 hnRNP-L_PTB hnRNP-L/ 99.6 3.9E-15 8.4E-20 103.2 11.1 77 24-105 273-350 (481)
35 cd00590 RRM RRM (RNA recogniti 99.6 1.2E-14 2.7E-19 76.0 10.1 74 28-102 1-74 (74)
36 KOG0126 Predicted RNA-binding 99.6 4.9E-17 1.1E-21 97.9 0.6 81 22-102 31-111 (219)
37 KOG0148 Apoptosis-promoting RN 99.6 1.4E-15 3.1E-20 96.4 7.2 80 26-105 62-141 (321)
38 COG0724 RNA-binding proteins ( 99.6 6.4E-15 1.4E-19 94.0 10.1 79 26-104 115-193 (306)
39 TIGR01648 hnRNP-R-Q heterogene 99.6 1.4E-14 3E-19 101.6 11.0 73 25-105 232-306 (578)
40 TIGR01649 hnRNP-L_PTB hnRNP-L/ 99.6 1.4E-14 3E-19 100.5 10.5 74 26-105 2-77 (481)
41 KOG0105 Alternative splicing f 99.6 9.1E-15 2E-19 88.5 6.5 78 24-104 4-81 (241)
42 KOG0130 RNA-binding protein RB 99.6 1.2E-14 2.6E-19 83.9 6.6 80 25-104 71-150 (170)
43 KOG0131 Splicing factor 3b, su 99.6 9.1E-15 2E-19 88.1 6.4 82 23-104 6-87 (203)
44 KOG0144 RNA-binding protein CU 99.6 2.6E-14 5.5E-19 95.6 8.1 90 17-106 25-117 (510)
45 KOG0117 Heterogeneous nuclear 99.6 2.1E-14 4.6E-19 96.3 7.7 72 26-105 259-330 (506)
46 PF13893 RRM_5: RNA recognitio 99.5 9.4E-14 2E-18 70.0 8.0 56 43-103 1-56 (56)
47 KOG0144 RNA-binding protein CU 99.5 9.4E-15 2E-19 97.7 5.3 80 25-105 123-205 (510)
48 KOG0124 Polypyrimidine tract-b 99.5 1E-14 2.2E-19 96.2 4.6 77 26-102 113-189 (544)
49 KOG0127 Nucleolar protein fibr 99.5 1E-13 2.2E-18 95.2 8.3 78 26-104 117-194 (678)
50 KOG0145 RNA-binding protein EL 99.5 3.6E-13 7.9E-18 85.5 10.0 82 22-103 274-355 (360)
51 KOG0127 Nucleolar protein fibr 99.5 3.4E-13 7.3E-18 92.7 10.0 82 22-103 288-375 (678)
52 smart00361 RRM_1 RNA recogniti 99.5 3.6E-13 7.7E-18 70.9 8.0 62 40-101 2-70 (70)
53 KOG4206 Spliceosomal protein s 99.5 2E-13 4.4E-18 84.7 7.9 78 26-106 9-90 (221)
54 KOG0131 Splicing factor 3b, su 99.5 5.5E-14 1.2E-18 84.7 4.9 102 5-106 72-177 (203)
55 KOG0147 Transcriptional coacti 99.5 1.2E-13 2.7E-18 94.4 6.3 74 29-102 281-354 (549)
56 KOG0123 Polyadenylate-binding 99.5 5.9E-13 1.3E-17 89.5 8.7 95 4-104 57-151 (369)
57 KOG0146 RNA-binding protein ET 99.4 1.5E-13 3.3E-18 87.5 5.3 87 19-105 278-364 (371)
58 KOG0415 Predicted peptidyl pro 99.4 3.7E-13 8.1E-18 88.5 7.0 84 21-104 234-317 (479)
59 KOG0124 Polypyrimidine tract-b 99.4 4.1E-13 8.9E-18 88.7 7.1 100 3-102 174-286 (544)
60 KOG4212 RNA-binding protein hn 99.4 8.8E-13 1.9E-17 88.8 8.7 76 26-102 44-120 (608)
61 TIGR01642 U2AF_lg U2 snRNP aux 99.4 2.5E-12 5.5E-17 89.6 9.7 75 21-102 170-256 (509)
62 KOG0123 Polyadenylate-binding 99.4 3.2E-12 6.9E-17 86.0 8.1 73 28-106 3-75 (369)
63 KOG0109 RNA-binding protein LA 99.4 1.9E-12 4.1E-17 83.2 5.8 70 27-104 3-72 (346)
64 KOG0110 RNA-binding protein (R 99.3 1.1E-12 2.4E-17 92.1 4.4 81 26-106 613-693 (725)
65 KOG0109 RNA-binding protein LA 99.3 3.4E-12 7.4E-17 82.0 5.7 75 23-105 75-149 (346)
66 KOG0110 RNA-binding protein (R 99.3 1.3E-11 2.8E-16 86.9 8.1 77 27-104 516-596 (725)
67 KOG0153 Predicted RNA-binding 99.3 2.4E-11 5.2E-16 79.6 8.2 79 21-105 223-302 (377)
68 PF04059 RRM_2: RNA recognitio 99.3 9.3E-11 2E-15 65.0 9.3 79 26-104 1-85 (97)
69 KOG4208 Nucleolar RNA-binding 99.3 4.4E-11 9.5E-16 73.5 8.3 83 21-103 44-127 (214)
70 KOG0146 RNA-binding protein ET 99.3 2.3E-11 5E-16 77.7 6.3 80 25-105 18-100 (371)
71 KOG1548 Transcription elongati 99.2 8.5E-11 1.9E-15 77.1 8.3 82 22-104 130-219 (382)
72 KOG0533 RRM motif-containing p 99.2 1.5E-10 3.2E-15 73.7 8.5 82 22-104 79-160 (243)
73 KOG0132 RNA polymerase II C-te 99.2 6.4E-11 1.4E-15 84.3 7.5 73 26-104 421-493 (894)
74 KOG4205 RNA-binding protein mu 99.2 4E-11 8.6E-16 78.8 5.1 80 25-105 5-84 (311)
75 KOG1457 RNA binding protein (c 99.2 7.3E-10 1.6E-14 69.2 10.3 85 22-106 30-118 (284)
76 KOG0116 RasGAP SH3 binding pro 99.2 1.6E-10 3.5E-15 78.6 7.2 78 26-104 288-365 (419)
77 KOG4211 Splicing factor hnRNP- 99.1 5.6E-10 1.2E-14 76.1 8.7 80 22-105 6-85 (510)
78 KOG4212 RNA-binding protein hn 99.1 5.8E-10 1.3E-14 75.5 7.7 80 19-103 529-608 (608)
79 KOG4660 Protein Mei2, essentia 99.1 2.2E-10 4.7E-15 79.0 4.5 72 23-99 72-143 (549)
80 KOG4661 Hsp27-ERE-TATA-binding 99.0 1.3E-09 2.7E-14 76.1 7.6 82 24-105 403-484 (940)
81 KOG0151 Predicted splicing reg 99.0 1.8E-09 3.9E-14 76.6 8.1 84 21-104 169-255 (877)
82 KOG4205 RNA-binding protein mu 99.0 7.1E-10 1.5E-14 73.0 5.6 79 26-105 97-175 (311)
83 KOG4209 Splicing factor RNPS1, 99.0 2.2E-09 4.7E-14 68.3 6.2 83 20-103 95-177 (231)
84 KOG0226 RNA-binding proteins [ 98.9 1.4E-09 3.1E-14 69.0 4.2 83 21-103 185-267 (290)
85 KOG0106 Alternative splicing f 98.9 1.5E-09 3.2E-14 67.9 4.1 70 27-104 2-71 (216)
86 KOG1190 Polypyrimidine tract-b 98.9 1.8E-08 3.8E-13 67.8 8.2 74 26-104 297-371 (492)
87 KOG4454 RNA binding protein (R 98.8 2.3E-09 4.9E-14 66.8 2.0 79 22-102 5-83 (267)
88 KOG4211 Splicing factor hnRNP- 98.8 2.6E-08 5.7E-13 68.2 7.2 78 25-104 102-180 (510)
89 KOG0120 Splicing factor U2AF, 98.7 2.9E-08 6.2E-13 68.8 4.9 87 19-105 282-368 (500)
90 PF11608 Limkain-b1: Limkain b 98.6 2.2E-07 4.9E-12 49.8 6.3 68 27-104 3-75 (90)
91 KOG0147 Transcriptional coacti 98.6 1.2E-08 2.7E-13 70.5 1.2 85 19-104 172-256 (549)
92 KOG4206 Spliceosomal protein s 98.6 4.4E-07 9.6E-12 56.8 7.9 78 23-105 143-221 (221)
93 KOG1995 Conserved Zn-finger pr 98.5 2.4E-07 5.1E-12 61.4 4.1 85 21-105 61-153 (351)
94 PF08777 RRM_3: RNA binding mo 98.4 9.3E-07 2E-11 49.9 5.8 69 27-101 2-75 (105)
95 KOG4307 RNA binding protein RB 98.4 1.8E-06 3.9E-11 61.8 8.1 76 26-102 867-943 (944)
96 KOG1365 RNA-binding protein Fu 98.4 5.4E-07 1.2E-11 60.5 4.4 81 23-104 277-360 (508)
97 COG5175 MOT2 Transcriptional r 98.3 2.8E-06 6.1E-11 56.4 6.6 78 26-103 114-200 (480)
98 KOG3152 TBP-binding protein, a 98.3 3.6E-07 7.9E-12 58.2 2.3 73 25-97 73-157 (278)
99 KOG2314 Translation initiation 98.3 7E-06 1.5E-10 57.7 8.6 79 22-101 54-139 (698)
100 KOG1457 RNA binding protein (c 98.3 1.3E-06 2.8E-11 55.0 4.4 64 27-94 211-274 (284)
101 KOG4849 mRNA cleavage factor I 98.3 1.1E-06 2.4E-11 58.5 3.9 79 22-100 76-156 (498)
102 KOG1456 Heterogeneous nuclear 98.2 1.8E-05 3.8E-10 53.3 8.0 70 32-106 128-199 (494)
103 KOG4210 Nuclear localization s 98.1 1.9E-06 4E-11 56.6 3.0 80 25-105 183-263 (285)
104 KOG0106 Alternative splicing f 98.1 2.9E-06 6.4E-11 53.3 3.6 75 23-105 96-170 (216)
105 KOG1855 Predicted RNA-binding 98.1 4.7E-06 1E-10 56.7 4.0 74 19-92 224-310 (484)
106 KOG1456 Heterogeneous nuclear 98.1 5.1E-05 1.1E-09 51.2 8.5 78 24-106 285-363 (494)
107 PF08952 DUF1866: Domain of un 98.1 4.4E-05 9.5E-10 45.3 7.3 57 41-106 51-107 (146)
108 PF05172 Nup35_RRM: Nup53/35/4 98.1 5.5E-05 1.2E-09 42.3 7.3 77 25-103 5-89 (100)
109 KOG1190 Polypyrimidine tract-b 98.0 2.8E-05 6.1E-10 52.8 6.5 76 25-105 413-490 (492)
110 KOG0120 Splicing factor U2AF, 98.0 3.5E-05 7.6E-10 53.9 7.1 65 41-105 424-491 (500)
111 KOG1365 RNA-binding protein Fu 98.0 3.5E-06 7.6E-11 56.7 2.0 73 26-100 161-237 (508)
112 PF14605 Nup35_RRM_2: Nup53/35 98.0 3.4E-05 7.4E-10 38.2 5.1 52 27-85 2-53 (53)
113 KOG0129 Predicted RNA-binding 98.0 6.8E-05 1.5E-09 52.2 8.0 65 23-88 256-326 (520)
114 KOG0129 Predicted RNA-binding 98.0 4E-05 8.6E-10 53.3 6.6 66 22-87 366-432 (520)
115 KOG1548 Transcription elongati 97.9 7.5E-05 1.6E-09 49.8 7.4 78 22-103 261-349 (382)
116 PF10309 DUF2414: Protein of u 97.8 0.00031 6.7E-09 35.8 6.7 56 25-88 4-62 (62)
117 PF08675 RNA_bind: RNA binding 97.7 0.00061 1.3E-08 36.7 6.9 57 25-90 8-64 (87)
118 KOG0112 Large RNA-binding prot 97.6 9.5E-05 2.1E-09 54.5 4.7 77 22-104 451-529 (975)
119 KOG0105 Alternative splicing f 97.6 0.00065 1.4E-08 42.0 7.1 65 22-93 111-175 (241)
120 KOG0128 RNA-binding protein SA 97.5 4.4E-05 9.5E-10 55.8 1.4 79 26-105 736-814 (881)
121 PF15023 DUF4523: Protein of u 97.3 0.0044 9.5E-08 36.8 7.5 81 15-103 75-159 (166)
122 KOG2202 U2 snRNP splicing fact 97.2 0.00017 3.6E-09 46.3 1.5 62 41-103 83-145 (260)
123 KOG1996 mRNA splicing factor [ 97.2 0.0025 5.3E-08 42.1 6.7 64 40-103 300-364 (378)
124 KOG2416 Acinus (induces apopto 97.2 0.00045 9.7E-09 49.2 3.6 78 22-105 440-521 (718)
125 KOG4307 RNA binding protein RB 97.2 0.00035 7.5E-09 50.7 3.1 78 24-102 432-510 (944)
126 KOG4676 Splicing factor, argin 97.2 0.0012 2.6E-08 45.0 5.4 75 27-102 8-85 (479)
127 PF03467 Smg4_UPF3: Smg-4/UPF3 97.2 0.0012 2.6E-08 40.7 5.0 71 23-93 4-80 (176)
128 PF07576 BRAP2: BRCA1-associat 96.9 0.029 6.2E-07 32.0 9.4 67 27-95 13-81 (110)
129 KOG2193 IGF-II mRNA-binding pr 96.9 0.0017 3.7E-08 44.8 4.0 69 28-103 3-73 (584)
130 KOG0128 RNA-binding protein SA 96.9 3.9E-05 8.5E-10 56.1 -3.9 68 27-94 668-735 (881)
131 KOG0115 RNA-binding protein p5 96.9 0.002 4.2E-08 41.6 3.9 61 27-88 32-92 (275)
132 KOG0112 Large RNA-binding prot 96.8 0.00024 5.2E-09 52.5 -0.6 81 22-103 368-448 (975)
133 KOG4660 Protein Mei2, essentia 96.8 0.0051 1.1E-07 43.6 5.6 55 50-104 413-471 (549)
134 PF04847 Calcipressin: Calcipr 96.4 0.019 4.1E-07 35.7 6.2 60 39-104 8-69 (184)
135 KOG0804 Cytoplasmic Zn-finger 95.9 0.075 1.6E-06 37.2 7.3 69 25-95 73-142 (493)
136 KOG2068 MOT2 transcription fac 95.9 0.0034 7.4E-08 41.9 0.9 79 25-103 76-160 (327)
137 KOG2135 Proteins containing th 95.8 0.0068 1.5E-07 42.3 2.0 75 24-105 370-445 (526)
138 PF03880 DbpA: DbpA RNA bindin 95.5 0.16 3.4E-06 26.7 6.8 67 28-103 2-74 (74)
139 KOG4574 RNA-binding protein (c 95.0 0.021 4.6E-07 42.6 2.6 72 27-104 299-372 (1007)
140 KOG2591 c-Mpl binding protein, 94.6 0.089 1.9E-06 37.8 4.7 69 26-101 175-247 (684)
141 KOG4285 Mitotic phosphoprotein 94.2 0.69 1.5E-05 31.0 7.8 64 26-97 197-260 (350)
142 PF11767 SET_assoc: Histone ly 92.8 0.74 1.6E-05 23.8 6.8 56 37-101 11-66 (66)
143 KOG4483 Uncharacterized conser 92.6 0.53 1.2E-05 32.8 5.6 57 24-87 389-446 (528)
144 KOG2253 U1 snRNP complex, subu 91.6 0.23 5E-06 36.3 3.1 71 22-101 36-106 (668)
145 PF15513 DUF4651: Domain of un 90.0 1 2.3E-05 22.9 3.8 21 41-61 9-29 (62)
146 KOG4210 Nuclear localization s 90.0 0.28 6.1E-06 32.6 2.3 74 25-98 87-160 (285)
147 PF07530 PRE_C2HC: Associated 88.4 1.4 3.1E-05 22.8 3.8 62 41-105 2-64 (68)
148 KOG4676 Splicing factor, argin 88.1 0.027 5.9E-07 38.7 -3.5 65 25-94 150-214 (479)
149 KOG2318 Uncharacterized conser 87.3 4.6 0.0001 29.6 6.8 75 25-99 173-299 (650)
150 smart00596 PRE_C2HC PRE_C2HC d 85.0 2.2 4.7E-05 22.3 3.3 62 41-105 2-64 (69)
151 KOG4410 5-formyltetrahydrofola 84.1 6.2 0.00014 26.5 5.8 47 26-78 330-377 (396)
152 KOG2295 C2H2 Zn-finger protein 75.3 0.29 6.4E-06 35.3 -2.4 72 25-96 230-301 (648)
153 KOG2891 Surface glycoprotein [ 73.0 7.8 0.00017 26.0 3.8 70 23-92 146-246 (445)
154 PF07292 NID: Nmi/IFP 35 domai 72.9 2.7 5.9E-05 23.0 1.5 25 24-48 50-74 (88)
155 PF02714 DUF221: Domain of unk 70.5 7.3 0.00016 26.1 3.4 22 71-92 1-22 (325)
156 KOG2193 IGF-II mRNA-binding pr 69.6 0.11 2.4E-06 36.2 -5.3 75 25-102 79-153 (584)
157 KOG4454 RNA binding protein (R 69.5 0.87 1.9E-05 29.3 -1.0 66 26-92 80-149 (267)
158 PF03468 XS: XS domain; Inter 67.7 21 0.00046 20.5 5.2 45 39-86 30-75 (116)
159 PRK14548 50S ribosomal protein 64.5 21 0.00046 19.3 5.4 56 30-88 24-81 (84)
160 KOG1295 Nonsense-mediated deca 63.9 14 0.00031 25.7 3.8 69 25-93 6-77 (376)
161 COG5193 LHP1 La protein, small 63.0 3.9 8.4E-05 28.6 1.0 64 23-86 171-244 (438)
162 KOG4019 Calcineurin-mediated s 63.0 9.4 0.0002 23.9 2.5 73 26-104 10-88 (193)
163 COG0724 RNA-binding proteins ( 61.9 16 0.00035 22.8 3.7 41 22-62 221-261 (306)
164 TIGR03636 L23_arch archaeal ri 61.0 24 0.00051 18.8 5.2 57 29-88 16-74 (77)
165 KOG4213 RNA-binding protein La 55.0 24 0.00052 22.2 3.3 56 26-87 111-169 (205)
166 COG5507 Uncharacterized conser 50.0 32 0.00068 19.4 3.0 22 67-88 65-86 (117)
167 TIGR02542 B_forsyth_147 Bacter 49.6 31 0.00067 20.1 3.0 46 35-80 83-131 (145)
168 PF10567 Nab6_mRNP_bdg: RNA-re 49.1 62 0.0013 22.0 4.7 57 24-80 13-76 (309)
169 PF02946 GTF2I: GTF2I-like rep 46.5 34 0.00075 18.2 2.7 31 26-56 34-72 (76)
170 PF11411 DNA_ligase_IV: DNA li 43.8 21 0.00045 16.1 1.4 16 36-51 19-34 (36)
171 PF03439 Spt5-NGN: Early trans 42.0 42 0.00091 17.9 2.7 26 67-92 43-68 (84)
172 PRK11901 hypothetical protein; 41.9 1.2E+02 0.0025 21.0 6.2 67 20-90 239-306 (327)
173 COG5584 Predicted small secret 40.2 69 0.0015 18.0 3.8 31 33-63 29-59 (103)
174 PHA01632 hypothetical protein 36.4 42 0.00091 16.7 1.9 19 31-49 21-39 (64)
175 PF14111 DUF4283: Domain of un 34.5 28 0.0006 20.4 1.4 31 29-59 107-138 (153)
176 KOG4008 rRNA processing protei 34.3 45 0.00098 21.9 2.4 34 22-55 36-69 (261)
177 smart00457 MACPF membrane-atta 32.5 41 0.0009 20.8 2.0 28 31-58 30-59 (194)
178 PF15407 Spo7_2_N: Sporulation 32.2 8.6 0.00019 19.9 -0.9 22 21-42 22-43 (67)
179 PF15063 TC1: Thyroid cancer p 32.1 30 0.00066 18.4 1.1 28 26-53 25-52 (79)
180 COG3254 Uncharacterized conser 32.0 1E+02 0.0022 17.5 5.2 42 41-85 27-68 (105)
181 PF00054 Laminin_G_1: Laminin 30.5 13 0.00027 21.5 -0.5 11 25-35 91-101 (131)
182 cd00027 BRCT Breast Cancer Sup 30.0 71 0.0015 15.1 3.1 26 27-52 2-27 (72)
183 PRK01178 rps24e 30S ribosomal 29.5 1.1E+02 0.0024 17.1 5.1 47 37-84 30-81 (99)
184 PRK10905 cell division protein 29.3 2E+02 0.0043 20.0 6.0 68 19-90 240-308 (328)
185 KOG3424 40S ribosomal protein 28.7 1.3E+02 0.0028 17.6 5.1 46 36-82 33-83 (132)
186 KOG0156 Cytochrome P450 CYP2 s 28.1 1.9E+02 0.0041 21.1 4.9 58 31-98 37-97 (489)
187 PF14893 PNMA: PNMA 27.8 52 0.0011 22.7 2.0 26 24-49 16-41 (331)
188 PF03108 DBD_Tnp_Mut: MuDR fam 27.3 77 0.0017 15.8 2.2 15 74-88 9-23 (67)
189 PF13046 DUF3906: Protein of u 26.6 1E+02 0.0022 15.8 2.6 33 39-73 31-63 (64)
190 KOG1134 Uncharacterized conser 26.3 1.1E+02 0.0025 23.5 3.6 35 66-100 303-337 (728)
191 COG0030 KsgA Dimethyladenosine 25.2 1.7E+02 0.0037 19.5 3.9 28 26-53 95-122 (259)
192 PF01823 MACPF: MAC/Perforin d 24.7 46 0.001 20.5 1.2 23 31-53 53-78 (212)
193 PF12007 DUF3501: Protein of u 24.6 2E+02 0.0042 18.3 4.6 47 39-92 65-111 (192)
194 PF10281 Ish1: Putative stress 24.3 78 0.0017 14.0 1.7 16 37-52 3-18 (38)
195 PF09902 DUF2129: Uncharacteri 23.9 1.2E+02 0.0027 15.8 4.2 38 46-92 16-53 (71)
196 PF11823 DUF3343: Protein of u 23.3 1.2E+02 0.0026 15.5 3.1 23 70-92 3-25 (73)
197 PF12623 Hen1_L: RNA repair, l 22.5 2.4E+02 0.0052 18.6 4.6 64 25-88 117-183 (245)
198 PF06613 KorB_C: KorB C-termin 22.1 1.3E+02 0.0027 15.3 2.3 12 66-77 29-40 (60)
199 PF13773 DUF4170: Domain of un 22.1 1.4E+02 0.003 15.6 2.8 31 71-101 28-58 (69)
200 PF00398 RrnaAD: Ribosomal RNA 21.9 1.2E+02 0.0027 19.7 2.9 24 25-48 96-119 (262)
201 COG5594 Uncharacterized integr 21.4 1.3E+02 0.0028 23.6 3.1 36 68-103 357-393 (827)
202 TIGR00755 ksgA dimethyladenosi 21.1 1.8E+02 0.0039 18.8 3.5 23 29-51 97-119 (253)
203 PF05573 NosL: NosL; InterPro 21.0 2E+02 0.0043 17.1 3.4 22 68-89 114-135 (149)
204 PHA01627 DNA binding protein 20.8 1.8E+02 0.004 16.6 3.7 35 71-105 39-73 (107)
205 PRK08559 nusG transcription an 20.7 1.5E+02 0.0032 17.8 2.8 26 67-92 45-70 (153)
206 PF05189 RTC_insert: RNA 3'-te 20.6 1.7E+02 0.0036 16.1 5.1 46 28-73 12-62 (103)
207 cd06257 DnaJ DnaJ domain or J- 20.1 1.2E+02 0.0025 14.0 2.1 19 32-50 6-24 (55)
No 1
>PLN03134 glycine-rich RNA-binding protein 4; Provisional
Probab=99.88 E-value=5e-21 Score=113.63 Aligned_cols=83 Identities=55% Similarity=0.896 Sum_probs=78.2
Q ss_pred CCCCceEEEcCCCCCCCHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeCCeEEEEEe
Q 034021 23 ANSRTKLFIGGLSYDTNETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLDGRNIRVNY 102 (106)
Q Consensus 23 ~~~~~~l~v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g~~l~v~~ 102 (106)
....++|||+|||+++++++|+++|++||.|..+.++.+..++.+++||||+|.+.++|++|+..|++..+.|++|+|.+
T Consensus 31 ~~~~~~lfVgnL~~~~te~~L~~~F~~~G~I~~v~i~~d~~tg~~kGfaFV~F~~~e~A~~Al~~lng~~i~Gr~l~V~~ 110 (144)
T PLN03134 31 RLMSTKLFIGGLSWGTDDASLRDAFAHFGDVVDAKVIVDRETGRSRGFGFVNFNDEGAATAAISEMDGKELNGRHIRVNP 110 (144)
T ss_pred cCCCCEEEEeCCCCCCCHHHHHHHHhcCCCeEEEEEEecCCCCCcceEEEEEECCHHHHHHHHHHcCCCEECCEEEEEEe
Confidence 34567899999999999999999999999999999999998999999999999999999999999999999999999999
Q ss_pred ccC
Q 034021 103 AHK 105 (106)
Q Consensus 103 s~~ 105 (106)
+.+
T Consensus 111 a~~ 113 (144)
T PLN03134 111 AND 113 (144)
T ss_pred CCc
Confidence 764
No 2
>TIGR01661 ELAV_HUD_SF ELAV/HuD family splicing factor. These proteins contain 3 RNA-recognition motifs (rrm: pfam00076).
Probab=99.83 E-value=1.7e-19 Score=120.24 Aligned_cols=79 Identities=38% Similarity=0.581 Sum_probs=75.7
Q ss_pred CceEEEcCCCCCCCHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeCCeEEEEEecc
Q 034021 26 RTKLFIGGLSYDTNETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLDGRNIRVNYAH 104 (106)
Q Consensus 26 ~~~l~v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g~~l~v~~s~ 104 (106)
.++|||+|||.++++++|+++|++||.|..+.+++++.++.++|||||+|.+.++|.+|+..|||..++|++|+|.|..
T Consensus 269 ~~~lfV~NL~~~~~e~~L~~~F~~fG~v~~v~i~~d~~t~~skG~aFV~F~~~~~A~~Ai~~lnG~~~~gr~i~V~~~~ 347 (352)
T TIGR01661 269 GYCIFVYNLSPDTDETVLWQLFGPFGAVQNVKIIRDLTTNQCKGYGFVSMTNYDEAAMAILSLNGYTLGNRVLQVSFKT 347 (352)
T ss_pred CcEEEEeCCCCCCCHHHHHHHHHhCCCeEEEEEeEcCCCCCccceEEEEECCHHHHHHHHHHhCCCEECCeEEEEEEcc
Confidence 4579999999999999999999999999999999999899999999999999999999999999999999999999964
No 3
>TIGR01661 ELAV_HUD_SF ELAV/HuD family splicing factor. These proteins contain 3 RNA-recognition motifs (rrm: pfam00076).
Probab=99.82 E-value=1.7e-19 Score=120.17 Aligned_cols=81 Identities=36% Similarity=0.629 Sum_probs=76.9
Q ss_pred CCceEEEcCCCCCCCHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeCCeEEEEEecc
Q 034021 25 SRTKLFIGGLSYDTNETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLDGRNIRVNYAH 104 (106)
Q Consensus 25 ~~~~l~v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g~~l~v~~s~ 104 (106)
...+|||+|||.++++++|+++|+.||+|..+.+++++.+|.++|||||+|.+.++|++|+..|+|..+.|++|+|.+++
T Consensus 2 ~~~~l~V~nLp~~~~e~~l~~~F~~~G~i~~v~i~~d~~~g~s~g~afV~f~~~~~A~~Ai~~l~g~~l~g~~i~v~~a~ 81 (352)
T TIGR01661 2 SKTNLIVNYLPQTMTQEEIRSLFTSIGEIESCKLVRDKVTGQSLGYGFVNYVRPEDAEKAVNSLNGLRLQNKTIKVSYAR 81 (352)
T ss_pred CCcEEEEeCCCCCCCHHHHHHHHHccCCEEEEEEEEcCCCCccceEEEEEECcHHHHHHHHhhcccEEECCeeEEEEeec
Confidence 36789999999999999999999999999999999998889999999999999999999999999999999999999876
Q ss_pred C
Q 034021 105 K 105 (106)
Q Consensus 105 ~ 105 (106)
+
T Consensus 82 ~ 82 (352)
T TIGR01661 82 P 82 (352)
T ss_pred c
Confidence 4
No 4
>KOG0122 consensus Translation initiation factor 3, subunit g (eIF-3g) [Translation, ribosomal structure and biogenesis]
Probab=99.82 E-value=2.2e-19 Score=112.02 Aligned_cols=84 Identities=30% Similarity=0.492 Sum_probs=80.3
Q ss_pred CCCCCceEEEcCCCCCCCHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeCCeEEEEE
Q 034021 22 DANSRTKLFIGGLSYDTNETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLDGRNIRVN 101 (106)
Q Consensus 22 ~~~~~~~l~v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g~~l~v~ 101 (106)
.-.+..+|.|.||+.++++++|+++|.+||.|..+.+.+++.+|.++|||||.|.+.++|++|+..|+|+-.++--|+|+
T Consensus 185 ~R~D~~tvRvtNLsed~~E~dL~eLf~~fg~i~rvylardK~TG~~kGFAFVtF~sRddA~rAI~~LnG~gyd~LILrvE 264 (270)
T KOG0122|consen 185 ERDDEATVRVTNLSEDMREDDLEELFRPFGPITRVYLARDKETGLSKGFAFVTFESRDDAARAIADLNGYGYDNLILRVE 264 (270)
T ss_pred cCCccceeEEecCccccChhHHHHHhhccCccceeEEEEccccCcccceEEEEEecHHHHHHHHHHccCcccceEEEEEE
Confidence 33467889999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred eccC
Q 034021 102 YAHK 105 (106)
Q Consensus 102 ~s~~ 105 (106)
||+|
T Consensus 265 wskP 268 (270)
T KOG0122|consen 265 WSKP 268 (270)
T ss_pred ecCC
Confidence 9987
No 5
>TIGR01659 sex-lethal sex-lethal family splicing factor. This model describes the sex-lethal family of splicing factors found in Dipteran insects. The sex-lethal phenotype, however, may be limited to the Melanogasters and closely related species. In Drosophila the protein acts as an inhibitor of splicing. This subfamily is most closely related to the ELAV/HUD subfamily of splicing factors (TIGR01661).
Probab=99.82 E-value=2.4e-19 Score=119.32 Aligned_cols=85 Identities=34% Similarity=0.535 Sum_probs=79.7
Q ss_pred cCCCCCceEEEcCCCCCCCHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeCCeEEEE
Q 034021 21 SDANSRTKLFIGGLSYDTNETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLDGRNIRV 100 (106)
Q Consensus 21 ~~~~~~~~l~v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g~~l~v 100 (106)
......++|||++||+++++++|+++|+.||.|..++++.+..++.++|||||+|.+.++|++|+..|++..+.+++|+|
T Consensus 102 ~~~~~~~~LfVgnLp~~~te~~L~~lF~~~G~V~~v~i~~d~~tg~srGyaFVeF~~~e~A~~Ai~~LnG~~l~gr~i~V 181 (346)
T TIGR01659 102 DTNNSGTNLIVNYLPQDMTDRELYALFRTIGPINTCRIMRDYKTGYSFGYAFVDFGSEADSQRAIKNLNGITVRNKRLKV 181 (346)
T ss_pred CCCCCCcEEEEeCCCCCCCHHHHHHHHHhcCCEEEEEEEecCCCCccCcEEEEEEccHHHHHHHHHHcCCCccCCceeee
Confidence 34556899999999999999999999999999999999999889999999999999999999999999999999999999
Q ss_pred EeccC
Q 034021 101 NYAHK 105 (106)
Q Consensus 101 ~~s~~ 105 (106)
.++++
T Consensus 182 ~~a~p 186 (346)
T TIGR01659 182 SYARP 186 (346)
T ss_pred ecccc
Confidence 98764
No 6
>PF00076 RRM_1: RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain); InterPro: IPR000504 Many eukaryotic proteins containing one or more copies of a putative RNA-binding domain of about 90 amino acids are known to bind single-stranded RNAs [, , ]. The largest group of single strand RNA-binding proteins is the eukaryotic RNA recognition motif (RRM) family that contains an eight amino acid RNP-1 consensus sequence [, ]. RRM proteins have a variety of RNA binding preferences and functions, and include heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing (SR, U2AF, Sxl), protein components of small nuclear ribonucleoproteins (U1 and U2 snRNPs), and proteins that regulate RNA stability and translation (PABP, La, Hu) [, , ]. The RRM in heterodimeric splicing factor U2 snRNP auxiliary factor (U2AF) appears to have two RRM-like domains with specialised features for protein recognition []. The motif also appears in a few single stranded DNA binding proteins. The typical RRM consists of four anti-parallel beta-strands and two alpha-helices arranged in a beta-alpha-beta-beta-alpha-beta fold with side chains that stack with RNA bases. Specificity of RNA binding is determined by multiple contacts with surrounding amino acids. A third helix is present during RNA binding in some cases []. The RRM is reviewed in a number of publications [, , ].; GO: 0003676 nucleic acid binding; PDB: 2RNE_A 2DGO_A 2DO4_A 1YTY_B 2VOO_B 2VOP_A 2VON_B 1ZH5_B 2VOD_A 1S79_A ....
Probab=99.82 E-value=6e-19 Score=92.53 Aligned_cols=70 Identities=43% Similarity=0.792 Sum_probs=66.4
Q ss_pred EEEcCCCCCCCHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeCCeEEE
Q 034021 29 LFIGGLSYDTNETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLDGRNIR 99 (106)
Q Consensus 29 l~v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g~~l~ 99 (106)
|||+|||.++++++|+++|+.||.+..+.+..+ .++..+++|||+|.+.++|++|+..++|..+.|+.|+
T Consensus 1 l~v~nlp~~~t~~~l~~~f~~~g~i~~~~~~~~-~~~~~~~~a~V~F~~~~~a~~a~~~l~g~~~~~~~ir 70 (70)
T PF00076_consen 1 LYVGNLPPDVTEEELRDFFSQFGKIESIKVMRN-SSGKSKGYAFVEFESEEDAEKALEELNGKKINGRKIR 70 (70)
T ss_dssp EEEESETTTSSHHHHHHHHHTTSTEEEEEEEEE-TTSSEEEEEEEEESSHHHHHHHHHHHTTEEETTEEEE
T ss_pred cEEcCCCCcCCHHHHHHHHHHhhhccccccccc-ccccccceEEEEEcCHHHHHHHHHHcCCCEECccCcC
Confidence 799999999999999999999999999999887 4788899999999999999999999999999999886
No 7
>PF14259 RRM_6: RNA recognition motif (a.k.a. RRM, RBD, or RNP domain); PDB: 2DNN_A 1WI6_A 2EVZ_A 2ADC_A 1QM9_A 2KG1_A 2HGN_A 1U2F_A 2G4B_A 2HZC_A ....
Probab=99.77 E-value=1.4e-17 Score=87.57 Aligned_cols=70 Identities=37% Similarity=0.686 Sum_probs=64.4
Q ss_pred EEEcCCCCCCCHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeCCeEEE
Q 034021 29 LFIGGLSYDTNETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLDGRNIR 99 (106)
Q Consensus 29 l~v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g~~l~ 99 (106)
|||+|||+++++++|+++|+.+|.|..+.+...+. +..+++|||+|.+.++|.+|+..+++..++|++|+
T Consensus 1 v~i~nlp~~~~~~~l~~~f~~~g~v~~v~~~~~~~-~~~~~~a~v~f~~~~~a~~al~~~~~~~~~g~~l~ 70 (70)
T PF14259_consen 1 VYISNLPPSTTEEDLRNFFSRFGPVEKVRLIKNKD-GQSRGFAFVEFSSEEDAKRALELLNGKEIDGRKLR 70 (70)
T ss_dssp EEEESSTTT--HHHHHHHCTTSSBEEEEEEEESTT-SSEEEEEEEEESSHHHHHHHHHHHTTEEETTEEEE
T ss_pred CEEeCCCCCCCHHHHHHHHHhcCCcceEEEEeeec-cccCCEEEEEeCCHHHHHHHHHHCCCcEECCEEcC
Confidence 78999999999999999999999999999999875 88999999999999999999999998999999885
No 8
>TIGR01659 sex-lethal sex-lethal family splicing factor. This model describes the sex-lethal family of splicing factors found in Dipteran insects. The sex-lethal phenotype, however, may be limited to the Melanogasters and closely related species. In Drosophila the protein acts as an inhibitor of splicing. This subfamily is most closely related to the ELAV/HUD subfamily of splicing factors (TIGR01661).
Probab=99.77 E-value=9.5e-18 Score=111.73 Aligned_cols=82 Identities=28% Similarity=0.574 Sum_probs=75.5
Q ss_pred CCCceEEEcCCCCCCCHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeCC--eEEEEE
Q 034021 24 NSRTKLFIGGLSYDTNETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLDG--RNIRVN 101 (106)
Q Consensus 24 ~~~~~l~v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g--~~l~v~ 101 (106)
...++|||+|||.++++++|+++|++||.|..+.++++..+++.+++|||+|.+.++|++|++.|++..+.+ ++|+|.
T Consensus 191 ~~~~~lfV~nLp~~vtee~L~~~F~~fG~V~~v~i~~d~~tg~~kG~aFV~F~~~e~A~~Ai~~lng~~~~g~~~~l~V~ 270 (346)
T TIGR01659 191 IKDTNLYVTNLPRTITDDQLDTIFGKYGQIVQKNILRDKLTGTPRGVAFVRFNKREEAQEAISALNNVIPEGGSQPLTVR 270 (346)
T ss_pred cccceeEEeCCCCcccHHHHHHHHHhcCCEEEEEEeecCCCCccceEEEEEECCHHHHHHHHHHhCCCccCCCceeEEEE
Confidence 356789999999999999999999999999999999998899999999999999999999999999998865 689998
Q ss_pred eccC
Q 034021 102 YAHK 105 (106)
Q Consensus 102 ~s~~ 105 (106)
+++.
T Consensus 271 ~a~~ 274 (346)
T TIGR01659 271 LAEE 274 (346)
T ss_pred ECCc
Confidence 8753
No 9
>TIGR01645 half-pint poly-U binding splicing factor, half-pint family. In the case of PUF60 (GP|6176532), in complex with p54, and in the presence of U2AF, facilitates association of U2 snRNP with pre-mRNA.
Probab=99.76 E-value=1.4e-17 Score=116.74 Aligned_cols=80 Identities=25% Similarity=0.468 Sum_probs=76.0
Q ss_pred CCceEEEcCCCCCCCHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeCCeEEEEEecc
Q 034021 25 SRTKLFIGGLSYDTNETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLDGRNIRVNYAH 104 (106)
Q Consensus 25 ~~~~l~v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g~~l~v~~s~ 104 (106)
..++|||+|||.++++++|+++|+.||.|..+.+.+++.++.++|||||+|.+.++|.+|+..||+..++|+.|+|.++-
T Consensus 203 ~~~rLfVgnLp~~vteedLk~lFs~FG~I~svrl~~D~~tgksKGfGFVeFe~~e~A~kAI~amNg~elgGr~LrV~kAi 282 (612)
T TIGR01645 203 KFNRIYVASVHPDLSETDIKSVFEAFGEIVKCQLARAPTGRGHKGYGFIEYNNLQSQSEAIASMNLFDLGGQYLRVGKCV 282 (612)
T ss_pred ccceEEeecCCCCCCHHHHHHHHhhcCCeeEEEEEecCCCCCcCCeEEEEECCHHHHHHHHHHhCCCeeCCeEEEEEecC
Confidence 34789999999999999999999999999999999998888899999999999999999999999999999999998864
No 10
>KOG0107 consensus Alternative splicing factor SRp20/9G8 (RRM superfamily) [RNA processing and modification]
Probab=99.76 E-value=6.3e-18 Score=100.98 Aligned_cols=76 Identities=30% Similarity=0.522 Sum_probs=71.0
Q ss_pred CCCceEEEcCCCCCCCHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeCCeEEEEEec
Q 034021 24 NSRTKLFIGGLSYDTNETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLDGRNIRVNYA 103 (106)
Q Consensus 24 ~~~~~l~v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g~~l~v~~s 103 (106)
...++|||+||+..+++.+|+..|..||++..+++.+.+ .|||||+|++..+|+.|+..|+|..|.|.+++|+++
T Consensus 8 ~~~~kVYVGnL~~~a~k~eLE~~F~~yG~lrsvWvArnP-----PGfAFVEFed~RDA~DAvr~LDG~~~cG~r~rVE~S 82 (195)
T KOG0107|consen 8 NGNTKVYVGNLGSRATKRELERAFSKYGPLRSVWVARNP-----PGFAFVEFEDPRDAEDAVRYLDGKDICGSRIRVELS 82 (195)
T ss_pred CCCceEEeccCCCCcchHHHHHHHHhcCcceeEEEeecC-----CCceEEeccCcccHHHHHhhcCCccccCceEEEEee
Confidence 457899999999999999999999999999999997754 579999999999999999999999999999999997
Q ss_pred c
Q 034021 104 H 104 (106)
Q Consensus 104 ~ 104 (106)
.
T Consensus 83 ~ 83 (195)
T KOG0107|consen 83 T 83 (195)
T ss_pred c
Confidence 5
No 11
>TIGR01648 hnRNP-R-Q heterogeneous nuclear ribonucleoprotein R, Q family. Sequences in this subfamily include the human heterogeneous nuclear ribonucleoproteins (hnRNP) R, Q and APOBEC-1 complementation factor (aka APOBEC-1 stimulating protein). These proteins contain three RNA recognition domains (rrm: pfam00076) and a somewhat variable C-terminal domain.
Probab=99.74 E-value=2.7e-17 Score=114.99 Aligned_cols=100 Identities=27% Similarity=0.425 Sum_probs=85.5
Q ss_pred ccccCceeeccCccccccccCCCCCceEEEcCCCCCCCHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHH
Q 034021 2 QWLNGSVRFSHSVRFTRLLSDANSRTKLFIGGLSYDTNETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAA 81 (106)
Q Consensus 2 ~~~~~~~~~~~~~~~~~~~~~~~~~~~l~v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a 81 (106)
..+|+...+..++..+. ...+...++|||+|||+++++++|+++|+.+|.|..++++++ .++.++|||||+|.+.++|
T Consensus 35 ~~~~g~r~~g~Pp~~~~-~~~p~~~~~lFVgnLp~~~tEd~L~~~F~~~G~I~~vrl~~D-~sG~sRGfaFV~F~~~e~A 112 (578)
T TIGR01648 35 VQENGQRKYGGPPPGWS-GVQPGRGCEVFVGKIPRDLYEDELVPLFEKAGPIYELRLMMD-FSGQNRGYAFVTFCGKEEA 112 (578)
T ss_pred cccCCcccCCCCCCccc-CCCCCCCCEEEeCCCCCCCCHHHHHHHHHhhCCEEEEEEEEC-CCCCccceEEEEeCCHHHH
Confidence 45677777777776653 344556799999999999999999999999999999999999 5899999999999999999
Q ss_pred HHHHHHhCCceeC-CeEEEEEec
Q 034021 82 GEAIKEMDGLLLD-GRNIRVNYA 103 (106)
Q Consensus 82 ~~al~~l~~~~~~-g~~l~v~~s 103 (106)
++|+..|++..+. ++.|.|..+
T Consensus 113 ~~Ai~~lng~~i~~Gr~l~V~~S 135 (578)
T TIGR01648 113 KEAVKLLNNYEIRPGRLLGVCIS 135 (578)
T ss_pred HHHHHHcCCCeecCCcccccccc
Confidence 9999999999885 677766654
No 12
>TIGR01628 PABP-1234 polyadenylate binding protein, human types 1, 2, 3, 4 family. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed, or of unknown tissue range.
Probab=99.74 E-value=2.5e-17 Score=115.77 Aligned_cols=79 Identities=37% Similarity=0.630 Sum_probs=75.4
Q ss_pred eEEEcCCCCCCCHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeCCeEEEEEeccCC
Q 034021 28 KLFIGGLSYDTNETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLDGRNIRVNYAHKD 106 (106)
Q Consensus 28 ~l~v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g~~l~v~~s~~~ 106 (106)
+|||+|||.++++++|+++|+.||.|..+++.++..+++++|||||+|.+.++|++|+..+++..+.|+.|+|.|+.+|
T Consensus 2 sl~VgnLp~~vte~~L~~~F~~~G~v~~v~v~~d~~t~~s~G~afV~F~~~~~A~~Al~~ln~~~i~gk~i~i~~s~~~ 80 (562)
T TIGR01628 2 SLYVGDLDPDVTEAKLYDLFKPFGPVLSVRVCRDSVTRRSLGYGYVNFQNPADAERALETMNFKRLGGKPIRIMWSQRD 80 (562)
T ss_pred eEEEeCCCCCCCHHHHHHHHHhcCCEEEEEEEecCCCCCcceEEEEEECCHHHHHHHHHHhCCCEECCeeEEeeccccc
Confidence 6999999999999999999999999999999999888999999999999999999999999999999999999998653
No 13
>KOG0149 consensus Predicted RNA-binding protein SEB4 (RRM superfamily) [General function prediction only]
Probab=99.74 E-value=7.6e-18 Score=104.65 Aligned_cols=78 Identities=45% Similarity=0.777 Sum_probs=71.5
Q ss_pred CCceEEEcCCCCCCCHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeCCeEEEEEec
Q 034021 25 SRTKLFIGGLSYDTNETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLDGRNIRVNYA 103 (106)
Q Consensus 25 ~~~~l~v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g~~l~v~~s 103 (106)
.-++|||+||+|.+..++|+++|+.||+|.+..++.|+.+|+++||+||.|+|.++|.+||+. .+..|+|++..+.+|
T Consensus 11 ~~TKifVggL~w~T~~~~l~~yFeqfGeI~eavvitd~~t~rskGyGfVTf~d~~aa~rAc~d-p~piIdGR~aNcnlA 88 (247)
T KOG0149|consen 11 TFTKIFVGGLAWETHKETLRRYFEQFGEIVEAVVITDKNTGRSKGYGFVTFRDAEAATRACKD-PNPIIDGRKANCNLA 88 (247)
T ss_pred eEEEEEEcCcccccchHHHHHHHHHhCceEEEEEEeccCCccccceeeEEeecHHHHHHHhcC-CCCcccccccccchh
Confidence 457899999999999999999999999999999999999999999999999999999999994 566789998777664
No 14
>TIGR01645 half-pint poly-U binding splicing factor, half-pint family. In the case of PUF60 (GP|6176532), in complex with p54, and in the presence of U2AF, facilitates association of U2 snRNP with pre-mRNA.
Probab=99.74 E-value=2.4e-17 Score=115.62 Aligned_cols=79 Identities=30% Similarity=0.643 Sum_probs=75.3
Q ss_pred CCceEEEcCCCCCCCHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeCCeEEEEEec
Q 034021 25 SRTKLFIGGLSYDTNETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLDGRNIRVNYA 103 (106)
Q Consensus 25 ~~~~l~v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g~~l~v~~s 103 (106)
..++|||+|||+++++++|+++|+.||.|..+.++.++.+++++|||||+|.+.++|++|+..+||..++|+.|+|.+.
T Consensus 106 ~~~rLfVGnLp~~~tEe~Lr~lF~~fG~I~sV~I~~D~~TgkskGfAFVeF~s~e~A~~Ai~~lnG~~i~GR~IkV~rp 184 (612)
T TIGR01645 106 IMCRVYVGSISFELREDTIRRAFDPFGPIKSINMSWDPATGKHKGFAFVEYEVPEAAQLALEQMNGQMLGGRNIKVGRP 184 (612)
T ss_pred CCCEEEEcCCCCCCCHHHHHHHHHccCCEEEEEEeecCCCCCcCCeEEEEeCcHHHHHHHHHhcCCeEEecceeeeccc
Confidence 4679999999999999999999999999999999999889999999999999999999999999999999999999754
No 15
>PLN03120 nucleic acid binding protein; Provisional
Probab=99.73 E-value=7.4e-17 Score=102.60 Aligned_cols=75 Identities=17% Similarity=0.292 Sum_probs=69.4
Q ss_pred CceEEEcCCCCCCCHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeCCeEEEEEecc
Q 034021 26 RTKLFIGGLSYDTNETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLDGRNIRVNYAH 104 (106)
Q Consensus 26 ~~~l~v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g~~l~v~~s~ 104 (106)
.++|||+|||+.+++++|+++|+.||.|..+.++.+.. .+|+|||+|.+.++++.|+. |+|..+.|+.|.|.++.
T Consensus 4 ~rtVfVgNLs~~tTE~dLrefFS~~G~I~~V~I~~d~~---~~GfAFVtF~d~eaAe~All-LnG~~l~gr~V~Vt~a~ 78 (260)
T PLN03120 4 VRTVKVSNVSLKATERDIKEFFSFSGDIEYVEMQSENE---RSQIAYVTFKDPQGAETALL-LSGATIVDQSVTITPAE 78 (260)
T ss_pred CCEEEEeCCCCCCCHHHHHHHHHhcCCeEEEEEeecCC---CCCEEEEEeCcHHHHHHHHH-hcCCeeCCceEEEEecc
Confidence 57899999999999999999999999999999988753 45799999999999999996 99999999999999875
No 16
>TIGR01642 U2AF_lg U2 snRNP auxilliary factor, large subunit, splicing factor. Members of this subfamily are found in plants, metazoa and fungi.
Probab=99.72 E-value=1.7e-16 Score=110.32 Aligned_cols=80 Identities=25% Similarity=0.471 Sum_probs=76.1
Q ss_pred CCceEEEcCCCCCCCHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeCCeEEEEEecc
Q 034021 25 SRTKLFIGGLSYDTNETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLDGRNIRVNYAH 104 (106)
Q Consensus 25 ~~~~l~v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g~~l~v~~s~ 104 (106)
..++|||+|||..+++++|+++|+.||.|..+.++.+..+|.++|||||+|.+.++|..|+..|+|..++|+.|.|.++.
T Consensus 294 ~~~~l~v~nlp~~~~~~~l~~~f~~~G~i~~~~~~~~~~~g~~~g~afv~f~~~~~a~~A~~~l~g~~~~~~~l~v~~a~ 373 (509)
T TIGR01642 294 SKDRIYIGNLPLYLGEDQIKELLESFGDLKAFNLIKDIATGLSKGYAFCEYKDPSVTDVAIAALNGKDTGDNKLHVQRAC 373 (509)
T ss_pred CCCEEEEeCCCCCCCHHHHHHHHHhcCCeeEEEEEecCCCCCcCeEEEEEECCHHHHHHHHHHcCCCEECCeEEEEEECc
Confidence 45799999999999999999999999999999999998889999999999999999999999999999999999999874
No 17
>TIGR01622 SF-CC1 splicing factor, CC1-like family. A homologous gene from Plasmodium falciparum was identified in the course of the analysis of that genome at TIGR and was included in the model.
Probab=99.71 E-value=1.5e-16 Score=109.41 Aligned_cols=79 Identities=38% Similarity=0.720 Sum_probs=75.7
Q ss_pred CceEEEcCCCCCCCHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeCCeEEEEEecc
Q 034021 26 RTKLFIGGLSYDTNETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLDGRNIRVNYAH 104 (106)
Q Consensus 26 ~~~l~v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g~~l~v~~s~ 104 (106)
.++|||+|||.++++++|+++|+.||.|..+.+..+..+|..+|||||+|.+.++|.+|+..|+|..+.|++|+|.++.
T Consensus 186 ~~~l~v~nl~~~~te~~l~~~f~~~G~i~~v~~~~d~~~g~~~g~afV~f~~~e~A~~A~~~l~g~~i~g~~i~v~~a~ 264 (457)
T TIGR01622 186 FLKLYVGNLHFNITEQELRQIFEPFGDIEDVQLHRDPETGRSKGFGFIQFHDAEEAKEALEVMNGFELAGRPIKVGYAQ 264 (457)
T ss_pred CCEEEEcCCCCCCCHHHHHHHHHhcCCeEEEEEEEcCCCCccceEEEEEECCHHHHHHHHHhcCCcEECCEEEEEEEcc
Confidence 6899999999999999999999999999999999998888899999999999999999999999999999999999964
No 18
>KOG0117 consensus Heterogeneous nuclear ribonucleoprotein R (RRM superfamily) [RNA processing and modification]
Probab=99.71 E-value=1.3e-16 Score=106.67 Aligned_cols=99 Identities=27% Similarity=0.449 Sum_probs=85.3
Q ss_pred cCceeeccCccccccccCCCCCceEEEcCCCCCCCHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHH
Q 034021 5 NGSVRFSHSVRFTRLLSDANSRTKLFIGGLSYDTNETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEA 84 (106)
Q Consensus 5 ~~~~~~~~~~~~~~~~~~~~~~~~l~v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~a 84 (106)
+++.-+.-++.. +....+...+.|||++||.++.+++|..+|++.|.|-++++++++.+|.++|||||.|.+.+.|+.|
T Consensus 63 ~gqrk~ggPpP~-weg~~p~~G~EVfvGkIPrD~~EdeLvplfEkiG~I~elRLMmD~~sG~nRGYAFVtf~~Ke~Aq~A 141 (506)
T KOG0117|consen 63 NGQRKYGGPPPG-WEGPPPPRGCEVFVGKIPRDVFEDELVPLFEKIGKIYELRLMMDPFSGDNRGYAFVTFCTKEEAQEA 141 (506)
T ss_pred ccccccCCCCCc-ccCCCCCCCceEEecCCCccccchhhHHHHHhccceeeEEEeecccCCCCcceEEEEeecHHHHHHH
Confidence 344445555555 3345557889999999999999999999999999999999999988999999999999999999999
Q ss_pred HHHhCCcee-CCeEEEEEecc
Q 034021 85 IKEMDGLLL-DGRNIRVNYAH 104 (106)
Q Consensus 85 l~~l~~~~~-~g~~l~v~~s~ 104 (106)
++.||+.+| .|+.|.|..|-
T Consensus 142 ik~lnn~Eir~GK~igvc~Sv 162 (506)
T KOG0117|consen 142 IKELNNYEIRPGKLLGVCVSV 162 (506)
T ss_pred HHHhhCccccCCCEeEEEEee
Confidence 999999987 57888887753
No 19
>KOG0113 consensus U1 small nuclear ribonucleoprotein (RRM superfamily) [RNA processing and modification]
Probab=99.71 E-value=1.1e-16 Score=102.47 Aligned_cols=80 Identities=36% Similarity=0.709 Sum_probs=76.1
Q ss_pred CCCceEEEcCCCCCCCHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeCCeEEEEEec
Q 034021 24 NSRTKLFIGGLSYDTNETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLDGRNIRVNYA 103 (106)
Q Consensus 24 ~~~~~l~v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g~~l~v~~s 103 (106)
..=+||||+.|+.++++..|++.|+.||+|..|.++++..+|+++|||||+|++..++.+|-+..+|..|+|+.|.|.+-
T Consensus 99 DPy~TLFv~RLnydT~EskLrreF~~YG~IkrirlV~d~vTgkskGYAFIeye~erdm~~AYK~adG~~Idgrri~VDvE 178 (335)
T KOG0113|consen 99 DPYKTLFVARLNYDTSESKLRREFEKYGPIKRIRLVRDKVTGKSKGYAFIEYEHERDMKAAYKDADGIKIDGRRILVDVE 178 (335)
T ss_pred CccceeeeeeccccccHHHHHHHHHhcCcceeEEEeeecccCCccceEEEEeccHHHHHHHHHhccCceecCcEEEEEec
Confidence 34589999999999999999999999999999999999999999999999999999999999999999999999999763
No 20
>TIGR01622 SF-CC1 splicing factor, CC1-like family. A homologous gene from Plasmodium falciparum was identified in the course of the analysis of that genome at TIGR and was included in the model.
Probab=99.71 E-value=2.4e-16 Score=108.33 Aligned_cols=82 Identities=26% Similarity=0.420 Sum_probs=76.2
Q ss_pred CCCCCceEEEcCCCCCCCHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeCCeEEEEE
Q 034021 22 DANSRTKLFIGGLSYDTNETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLDGRNIRVN 101 (106)
Q Consensus 22 ~~~~~~~l~v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g~~l~v~ 101 (106)
.+...++|||+|||..+++++|+++|+.||.|..+.++.+..++..+|||||+|.+.++|++|+. ++|..+.|++|.|.
T Consensus 85 ~~~~~~~l~V~nlp~~~~~~~l~~~F~~~G~v~~v~i~~d~~~~~skg~afVeF~~~e~A~~Al~-l~g~~~~g~~i~v~ 163 (457)
T TIGR01622 85 AERDDRTVFVLQLALKARERDLYEFFSKVGKVRDVQCIKDRNSRRSKGVAYVEFYDVESVIKALA-LTGQMLLGRPIIVQ 163 (457)
T ss_pred cccCCcEEEEeCCCCCCCHHHHHHHHHhcCCeeEEEEeecCCCCCcceEEEEEECCHHHHHHHHH-hCCCEECCeeeEEe
Confidence 34457899999999999999999999999999999999998889999999999999999999998 89999999999998
Q ss_pred ecc
Q 034021 102 YAH 104 (106)
Q Consensus 102 ~s~ 104 (106)
++.
T Consensus 164 ~~~ 166 (457)
T TIGR01622 164 SSQ 166 (457)
T ss_pred ecc
Confidence 764
No 21
>smart00362 RRM_2 RNA recognition motif.
Probab=99.71 E-value=3.2e-16 Score=81.71 Aligned_cols=72 Identities=43% Similarity=0.724 Sum_probs=66.3
Q ss_pred eEEEcCCCCCCCHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeCCeEEEEE
Q 034021 28 KLFIGGLSYDTNETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLDGRNIRVN 101 (106)
Q Consensus 28 ~l~v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g~~l~v~ 101 (106)
+|+|+|||.+++.++|+++|+.||.+..+.+..+. +..++++||+|.+.+.|+.|+..+++..+.|++++|+
T Consensus 1 ~v~i~~l~~~~~~~~l~~~~~~~g~v~~~~~~~~~--~~~~~~~~v~f~~~~~a~~a~~~~~~~~~~~~~i~v~ 72 (72)
T smart00362 1 TLFVGNLPPDVTEEDLKELFSKFGPIESVKIPKDT--GKSKGFAFVEFESEEDAEKAIEALNGTKLGGRPLRVE 72 (72)
T ss_pred CEEEcCCCCcCCHHHHHHHHHhcCCEEEEEEecCC--CCCCceEEEEeCCHHHHHHHHHHhCCcEECCEEEeeC
Confidence 48999999999999999999999999999888765 6678899999999999999999999999999998873
No 22
>KOG0145 consensus RNA-binding protein ELAV/HU (RRM superfamily) [RNA processing and modification]
Probab=99.70 E-value=1.3e-16 Score=100.98 Aligned_cols=84 Identities=37% Similarity=0.653 Sum_probs=79.9
Q ss_pred CCCCCceEEEcCCCCCCCHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeCCeEEEEE
Q 034021 22 DANSRTKLFIGGLSYDTNETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLDGRNIRVN 101 (106)
Q Consensus 22 ~~~~~~~l~v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g~~l~v~ 101 (106)
.+...+.+.|.-||..+++++++.+|...|.|+.|++++|+-+|.+.||+||.|.++++|++|+..|||..+..+.|+|+
T Consensus 37 t~~skTNLIvNYLPQ~MTqdE~rSLF~SiGeiEScKLvRDKitGqSLGYGFVNYv~p~DAe~AintlNGLrLQ~KTIKVS 116 (360)
T KOG0145|consen 37 TDESKTNLIVNYLPQNMTQDELRSLFGSIGEIESCKLVRDKITGQSLGYGFVNYVRPKDAEKAINTLNGLRLQNKTIKVS 116 (360)
T ss_pred cCcccceeeeeecccccCHHHHHHHhhcccceeeeeeeeccccccccccceeeecChHHHHHHHhhhcceeeccceEEEE
Confidence 34567889999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred eccC
Q 034021 102 YAHK 105 (106)
Q Consensus 102 ~s~~ 105 (106)
|++|
T Consensus 117 yARP 120 (360)
T KOG0145|consen 117 YARP 120 (360)
T ss_pred eccC
Confidence 9986
No 23
>KOG0125 consensus Ataxin 2-binding protein (RRM superfamily) [General function prediction only]
Probab=99.69 E-value=1.5e-16 Score=103.05 Aligned_cols=84 Identities=27% Similarity=0.570 Sum_probs=75.1
Q ss_pred ccCCCCCceEEEcCCCCCCCHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeCCeEEE
Q 034021 20 LSDANSRTKLFIGGLSYDTNETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLDGRNIR 99 (106)
Q Consensus 20 ~~~~~~~~~l~v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g~~l~ 99 (106)
.......++|+|+|||+...+.||+..|.+||.|.++.|+... ..++||+||.|++.+++++|-.+|||..|.|++|+
T Consensus 90 s~s~~~pkRLhVSNIPFrFRdpDL~aMF~kfG~VldVEIIfNE--RGSKGFGFVTmen~~dadRARa~LHgt~VEGRkIE 167 (376)
T KOG0125|consen 90 SSSKDTPKRLHVSNIPFRFRDPDLRAMFEKFGKVLDVEIIFNE--RGSKGFGFVTMENPADADRARAELHGTVVEGRKIE 167 (376)
T ss_pred CCCCCCCceeEeecCCccccCccHHHHHHhhCceeeEEEEecc--CCCCccceEEecChhhHHHHHHHhhcceeeceEEE
Confidence 3444557899999999999999999999999999999998764 44789999999999999999999999999999999
Q ss_pred EEeccC
Q 034021 100 VNYAHK 105 (106)
Q Consensus 100 v~~s~~ 105 (106)
|..+..
T Consensus 168 Vn~ATa 173 (376)
T KOG0125|consen 168 VNNATA 173 (376)
T ss_pred Eeccch
Confidence 988753
No 24
>TIGR01628 PABP-1234 polyadenylate binding protein, human types 1, 2, 3, 4 family. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed, or of unknown tissue range.
Probab=99.69 E-value=3.6e-16 Score=109.92 Aligned_cols=81 Identities=33% Similarity=0.620 Sum_probs=76.0
Q ss_pred CCCceEEEcCCCCCCCHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeCCeEEEEEec
Q 034021 24 NSRTKLFIGGLSYDTNETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLDGRNIRVNYA 103 (106)
Q Consensus 24 ~~~~~l~v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g~~l~v~~s 103 (106)
....+|||+||+.++++++|+++|+.||.|..+.++.+. ++..+|+|||+|.+.++|.+|+..+||..++|++|.|.++
T Consensus 283 ~~~~~l~V~nl~~~~~~~~L~~~F~~~G~i~~~~i~~d~-~g~~~g~gfV~f~~~~~A~~A~~~~~g~~~~gk~l~V~~a 361 (562)
T TIGR01628 283 AQGVNLYVKNLDDTVTDEKLRELFSECGEITSAKVMLDE-KGVSRGFGFVCFSNPEEANRAVTEMHGRMLGGKPLYVALA 361 (562)
T ss_pred cCCCEEEEeCCCCccCHHHHHHHHHhcCCeEEEEEEECC-CCCcCCeEEEEeCCHHHHHHHHHHhcCCeeCCceeEEEec
Confidence 345789999999999999999999999999999999984 7889999999999999999999999999999999999998
Q ss_pred cC
Q 034021 104 HK 105 (106)
Q Consensus 104 ~~ 105 (106)
.+
T Consensus 362 ~~ 363 (562)
T TIGR01628 362 QR 363 (562)
T ss_pred cC
Confidence 65
No 25
>KOG4207 consensus Predicted splicing factor, SR protein superfamily [RNA processing and modification]
Probab=99.68 E-value=1.5e-16 Score=97.56 Aligned_cols=85 Identities=38% Similarity=0.611 Sum_probs=79.0
Q ss_pred cCCCCCceEEEcCCCCCCCHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeCCeEEEE
Q 034021 21 SDANSRTKLFIGGLSYDTNETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLDGRNIRV 100 (106)
Q Consensus 21 ~~~~~~~~l~v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g~~l~v 100 (106)
.....-..|.|-||.+-++.++|+.+|++||.|-.|.|+++..++..+|||||.|.+..+|+.|+.+|+|.+++|+.|+|
T Consensus 8 Pdv~gm~SLkVdNLTyRTspd~LrrvFekYG~vgDVyIPrdr~Tr~sRgFaFVrf~~k~daedA~damDG~~ldgRelrV 87 (256)
T KOG4207|consen 8 PDVEGMTSLKVDNLTYRTSPDDLRRVFEKYGRVGDVYIPRDRYTRQSRGFAFVRFHDKRDAEDALDAMDGAVLDGRELRV 87 (256)
T ss_pred CCcccceeEEecceeccCCHHHHHHHHHHhCcccceecccccccccccceeEEEeeecchHHHHHHhhcceeeccceeee
Confidence 34445678999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred EeccC
Q 034021 101 NYAHK 105 (106)
Q Consensus 101 ~~s~~ 105 (106)
+.|+-
T Consensus 88 q~ary 92 (256)
T KOG4207|consen 88 QMARY 92 (256)
T ss_pred hhhhc
Confidence 98753
No 26
>KOG0114 consensus Predicted RNA-binding protein (RRM superfamily) [General function prediction only]
Probab=99.68 E-value=5.8e-16 Score=85.54 Aligned_cols=83 Identities=23% Similarity=0.378 Sum_probs=73.1
Q ss_pred ccCCCCCceEEEcCCCCCCCHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeCCeEEE
Q 034021 20 LSDANSRTKLFIGGLSYDTNETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLDGRNIR 99 (106)
Q Consensus 20 ~~~~~~~~~l~v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g~~l~ 99 (106)
..++..++.|||+|||++++.+++.++|..||.|..+++-..++ .+|.|||.|++..+|.+|+..|.|..+.++.+.
T Consensus 12 rlppevnriLyirNLp~~ITseemydlFGkyg~IrQIRiG~~k~---TrGTAFVVYedi~dAk~A~dhlsg~n~~~ryl~ 88 (124)
T KOG0114|consen 12 RLPPEVNRILYIRNLPFKITSEEMYDLFGKYGTIRQIRIGNTKE---TRGTAFVVYEDIFDAKKACDHLSGYNVDNRYLV 88 (124)
T ss_pred CCChhhheeEEEecCCccccHHHHHHHhhcccceEEEEecCccC---cCceEEEEehHhhhHHHHHHHhcccccCCceEE
Confidence 34455678999999999999999999999999999999865543 578999999999999999999999999999999
Q ss_pred EEeccC
Q 034021 100 VNYAHK 105 (106)
Q Consensus 100 v~~s~~ 105 (106)
|-+-.+
T Consensus 89 vlyyq~ 94 (124)
T KOG0114|consen 89 VLYYQP 94 (124)
T ss_pred EEecCH
Confidence 987543
No 27
>KOG0148 consensus Apoptosis-promoting RNA-binding protein TIA-1/TIAR (RRM superfamily) [RNA processing and modification; Translation, ribosomal structure and biogenesis]
Probab=99.68 E-value=4.6e-16 Score=98.67 Aligned_cols=77 Identities=29% Similarity=0.616 Sum_probs=71.8
Q ss_pred CCCCCceEEEcCCCCCCCHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeCCeEEEEE
Q 034021 22 DANSRTKLFIGGLSYDTNETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLDGRNIRVN 101 (106)
Q Consensus 22 ~~~~~~~l~v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g~~l~v~ 101 (106)
...++++||++||+..++++.|++.|++||+|.+++++.++ ||+||.|.+.++|.+|+..+|+.+|.|+.++..
T Consensus 160 ssp~NtsVY~G~I~~~lte~~mr~~Fs~fG~I~EVRvFk~q------GYaFVrF~tkEaAahAIv~mNntei~G~~VkCs 233 (321)
T KOG0148|consen 160 SSPDNTSVYVGNIASGLTEDLMRQTFSPFGPIQEVRVFKDQ------GYAFVRFETKEAAAHAIVQMNNTEIGGQLVRCS 233 (321)
T ss_pred CCCCCceEEeCCcCccccHHHHHHhcccCCcceEEEEeccc------ceEEEEecchhhHHHHHHHhcCceeCceEEEEe
Confidence 34467999999999999999999999999999999998774 799999999999999999999999999999999
Q ss_pred ecc
Q 034021 102 YAH 104 (106)
Q Consensus 102 ~s~ 104 (106)
|.+
T Consensus 234 WGK 236 (321)
T KOG0148|consen 234 WGK 236 (321)
T ss_pred ccc
Confidence 865
No 28
>PLN03213 repressor of silencing 3; Provisional
Probab=99.68 E-value=4.7e-16 Score=105.71 Aligned_cols=79 Identities=19% Similarity=0.437 Sum_probs=72.0
Q ss_pred CCCCceEEEcCCCCCCCHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCH--HHHHHHHHHhCCceeCCeEEEE
Q 034021 23 ANSRTKLFIGGLSYDTNETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASE--AAAGEAIKEMDGLLLDGRNIRV 100 (106)
Q Consensus 23 ~~~~~~l~v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~--~~a~~al~~l~~~~~~g~~l~v 100 (106)
.....+|||+||++.+++++|+..|+.||.|..+.|++. +| +|||||+|.+. .++.+|+..|+|..+.|+.|+|
T Consensus 7 ~~~gMRIYVGNLSydVTEDDLravFSeFGsVkdVEIpRE--TG--RGFAFVEMssdddaEeeKAISaLNGAEWKGR~LKV 82 (759)
T PLN03213 7 GGGGVRLHVGGLGESVGRDDLLKIFSPMGTVDAVEFVRT--KG--RSFAYIDFSPSSTNSLTKLFSTYNGCVWKGGRLRL 82 (759)
T ss_pred CCcceEEEEeCCCCCCCHHHHHHHHHhcCCeeEEEEecc--cC--CceEEEEecCCcHHHHHHHHHHhcCCeecCceeEE
Confidence 345688999999999999999999999999999999944 45 89999999976 7899999999999999999999
Q ss_pred EeccC
Q 034021 101 NYAHK 105 (106)
Q Consensus 101 ~~s~~ 105 (106)
..|++
T Consensus 83 NKAKP 87 (759)
T PLN03213 83 EKAKE 87 (759)
T ss_pred eeccH
Confidence 99875
No 29
>smart00360 RRM RNA recognition motif.
Probab=99.67 E-value=9.4e-16 Score=79.64 Aligned_cols=71 Identities=38% Similarity=0.725 Sum_probs=65.9
Q ss_pred EcCCCCCCCHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeCCeEEEEE
Q 034021 31 IGGLSYDTNETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLDGRNIRVN 101 (106)
Q Consensus 31 v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g~~l~v~ 101 (106)
|+|||..+++++|+++|+.||.+..+.+...+.++..+++|||+|.+.++|.+|+..+++..++|++++|+
T Consensus 1 i~~l~~~~~~~~l~~~f~~~g~v~~~~i~~~~~~~~~~~~a~v~f~~~~~a~~a~~~~~~~~~~~~~~~v~ 71 (71)
T smart00360 1 VGNLPPDVTEEELRELFSKFGKIESVRLVRDKDTGKSKGFAFVEFESEEDAEKALEALNGKELDGRPLKVK 71 (71)
T ss_pred CCCCCcccCHHHHHHHHHhhCCEeEEEEEeCCCCCCCCceEEEEeCCHHHHHHHHHHcCCCeeCCcEEEeC
Confidence 57899999999999999999999999998887678889999999999999999999999999999998874
No 30
>PLN03121 nucleic acid binding protein; Provisional
Probab=99.67 E-value=1.6e-15 Score=95.32 Aligned_cols=75 Identities=19% Similarity=0.217 Sum_probs=68.4
Q ss_pred CCceEEEcCCCCCCCHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeCCeEEEEEec
Q 034021 25 SRTKLFIGGLSYDTNETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLDGRNIRVNYA 103 (106)
Q Consensus 25 ~~~~l~v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g~~l~v~~s 103 (106)
..++|+|+||++.+++++|++||+.||.|..+.++++. ...++|||+|.+.+.++.|+. |+|..|.+++|.|...
T Consensus 4 ~g~TV~V~NLS~~tTE~dLrefFS~~G~I~~V~I~~D~---et~gfAfVtF~d~~aaetAll-LnGa~l~d~~I~It~~ 78 (243)
T PLN03121 4 GGYTAEVTNLSPKATEKDVYDFFSHCGAIEHVEIIRSG---EYACTAYVTFKDAYALETAVL-LSGATIVDQRVCITRW 78 (243)
T ss_pred CceEEEEecCCCCCCHHHHHHHHHhcCCeEEEEEecCC---CcceEEEEEECCHHHHHHHHh-cCCCeeCCceEEEEeC
Confidence 45899999999999999999999999999999999874 334799999999999999997 9999999999999863
No 31
>KOG0108 consensus mRNA cleavage and polyadenylation factor I complex, subunit RNA15 [RNA processing and modification]
Probab=99.66 E-value=5.1e-16 Score=105.41 Aligned_cols=79 Identities=34% Similarity=0.771 Sum_probs=76.9
Q ss_pred ceEEEcCCCCCCCHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeCCeEEEEEeccC
Q 034021 27 TKLFIGGLSYDTNETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLDGRNIRVNYAHK 105 (106)
Q Consensus 27 ~~l~v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g~~l~v~~s~~ 105 (106)
+.+||+|+|+++++++|..+|+..|.|..++++.|+.+|+.+||+|++|.+.+++..|++.|+|.++.|++|+|.|+..
T Consensus 19 ~~v~vgnip~~~se~~l~~~~~~~g~v~s~~~v~D~~tG~~~G~~f~~~~~~~~~~~a~~~lNg~~~~gr~l~v~~~~~ 97 (435)
T KOG0108|consen 19 SSVFVGNIPYEGSEEQLLSIFSGVGPVLSFRLVYDRETGKPKGFGFCEFTDEETAERAIRNLNGAEFNGRKLRVNYASN 97 (435)
T ss_pred cceEecCCCCcccHHHHHHHHhccCccceeeecccccCCCcCceeeEecCchhhHHHHHHhcCCcccCCceEEeecccc
Confidence 8899999999999999999999999999999999999999999999999999999999999999999999999999754
No 32
>KOG0111 consensus Cyclophilin-type peptidyl-prolyl cis-trans isomerase [Posttranslational modification, protein turnover, chaperones]
Probab=99.66 E-value=1.2e-16 Score=98.96 Aligned_cols=84 Identities=38% Similarity=0.641 Sum_probs=79.8
Q ss_pred CCCCceEEEcCCCCCCCHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeCCeEEEEEe
Q 034021 23 ANSRTKLFIGGLSYDTNETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLDGRNIRVNY 102 (106)
Q Consensus 23 ~~~~~~l~v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g~~l~v~~ 102 (106)
....++|||++|..++++.-|...|-+||+|..+.++.+-.+++++||+||+|...++|.+|+.-||+.++.|+.|+|.+
T Consensus 7 a~~KrtlYVGGladeVtekvLhaAFIPFGDI~dIqiPlDyesqkHRgFgFVefe~aEDAaaAiDNMnesEL~GrtirVN~ 86 (298)
T KOG0111|consen 7 ANQKRTLYVGGLADEVTEKVLHAAFIPFGDIKDIQIPLDYESQKHRGFGFVEFEEAEDAAAAIDNMNESELFGRTIRVNL 86 (298)
T ss_pred cccceeEEeccchHHHHHHHHHhccccccchhhcccccchhcccccceeEEEeeccchhHHHhhcCchhhhcceeEEEee
Confidence 34578999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred ccCC
Q 034021 103 AHKD 106 (106)
Q Consensus 103 s~~~ 106 (106)
++|.
T Consensus 87 AkP~ 90 (298)
T KOG0111|consen 87 AKPE 90 (298)
T ss_pred cCCc
Confidence 9874
No 33
>KOG0121 consensus Nuclear cap-binding protein complex, subunit CBP20 (RRM superfamily) [RNA processing and modification]
Probab=99.64 E-value=1e-15 Score=87.54 Aligned_cols=81 Identities=27% Similarity=0.518 Sum_probs=74.2
Q ss_pred CCCCceEEEcCCCCCCCHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeCCeEEEEEe
Q 034021 23 ANSRTKLFIGGLSYDTNETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLDGRNIRVNY 102 (106)
Q Consensus 23 ~~~~~~l~v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g~~l~v~~ 102 (106)
...+++|||+||...+++++|.++|+..|+|..+-+-.++.+..+-|++||+|.+.++|..|++-+++..++.+.|++.|
T Consensus 33 ~r~S~tvyVgNlSfyttEEqiyELFs~cG~irriiMGLdr~kktpCGFCFVeyy~~~dA~~AlryisgtrLddr~ir~D~ 112 (153)
T KOG0121|consen 33 LRKSCTVYVGNLSFYTTEEQIYELFSKCGDIRRIIMGLDRFKKTPCGFCFVEYYSRDDAEDALRYISGTRLDDRPIRIDW 112 (153)
T ss_pred HhhcceEEEeeeeeeecHHHHHHHHHhccchheeEeccccCCcCccceEEEEEecchhHHHHHHHhccCcccccceeeec
Confidence 34579999999999999999999999999999998877776777789999999999999999999999999999999876
Q ss_pred c
Q 034021 103 A 103 (106)
Q Consensus 103 s 103 (106)
-
T Consensus 113 D 113 (153)
T KOG0121|consen 113 D 113 (153)
T ss_pred c
Confidence 3
No 34
>TIGR01649 hnRNP-L_PTB hnRNP-L/PTB/hephaestus splicing factor family. Included in this family of heterogeneous ribonucleoproteins are PTB (polypyrimidine tract binding protein ) and hnRNP-L. These proteins contain four RNA recognition motifs (rrm: pfam00067).
Probab=99.64 E-value=3.9e-15 Score=103.21 Aligned_cols=77 Identities=22% Similarity=0.349 Sum_probs=70.8
Q ss_pred CCCceEEEcCCCC-CCCHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeCCeEEEEEe
Q 034021 24 NSRTKLFIGGLSY-DTNETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLDGRNIRVNY 102 (106)
Q Consensus 24 ~~~~~l~v~~lp~-~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g~~l~v~~ 102 (106)
...++|||+|||. .+++++|+++|+.||.|..++++++. +|+|||+|.+.++|..|+..|+|..+.|++|+|.+
T Consensus 273 ~~~~~l~v~nL~~~~vt~~~L~~lF~~yG~V~~vki~~~~-----~g~afV~f~~~~~A~~Ai~~lng~~l~g~~l~v~~ 347 (481)
T TIGR01649 273 GPGSVLMVSGLHQEKVNCDRLFNLFCVYGNVERVKFMKNK-----KETALIEMADPYQAQLALTHLNGVKLFGKPLRVCP 347 (481)
T ss_pred CCCCEEEEeCCCCCCCCHHHHHHHHHhcCCeEEEEEEeCC-----CCEEEEEECCHHHHHHHHHHhCCCEECCceEEEEE
Confidence 3567999999998 69999999999999999999998763 57999999999999999999999999999999999
Q ss_pred ccC
Q 034021 103 AHK 105 (106)
Q Consensus 103 s~~ 105 (106)
++.
T Consensus 348 s~~ 350 (481)
T TIGR01649 348 SKQ 350 (481)
T ss_pred ccc
Confidence 864
No 35
>cd00590 RRM RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
Probab=99.63 E-value=1.2e-14 Score=76.01 Aligned_cols=74 Identities=35% Similarity=0.717 Sum_probs=67.7
Q ss_pred eEEEcCCCCCCCHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeCCeEEEEEe
Q 034021 28 KLFIGGLSYDTNETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLDGRNIRVNY 102 (106)
Q Consensus 28 ~l~v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g~~l~v~~ 102 (106)
+|+|+|||..+++++++++|+.+|.+..+.+..++.+ ...+++||+|.+.++|+.|+..+++..+.|+++.|.+
T Consensus 1 ~i~i~~l~~~~~~~~i~~~~~~~g~i~~~~~~~~~~~-~~~~~~~v~f~s~~~a~~a~~~~~~~~~~~~~~~v~~ 74 (74)
T cd00590 1 TLFVGNLPPDVTEEDLRELFSKFGKVESVRIVRDKDT-KSKGFAFVEFEDEEDAEKALEALNGKELGGRPLRVEF 74 (74)
T ss_pred CEEEeCCCCccCHHHHHHHHHhcCCEEEEEEeeCCCC-CcceEEEEEECCHHHHHHHHHHhCCCeECCeEEEEeC
Confidence 4889999999999999999999999999999877633 6688999999999999999999999999999999874
No 36
>KOG0126 consensus Predicted RNA-binding protein (RRM superfamily) [General function prediction only]
Probab=99.63 E-value=4.9e-17 Score=97.87 Aligned_cols=81 Identities=31% Similarity=0.635 Sum_probs=76.1
Q ss_pred CCCCCceEEEcCCCCCCCHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeCCeEEEEE
Q 034021 22 DANSRTKLFIGGLSYDTNETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLDGRNIRVN 101 (106)
Q Consensus 22 ~~~~~~~l~v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g~~l~v~ 101 (106)
+-.+..-|||+|||++.|+.+|.-+|+.||.|..+-++++..+|+++||||+.|+++.+..-|+.-|||+.|.|+.|+|.
T Consensus 31 ~YkdsA~Iyiggl~~~LtEgDil~VFSqyGe~vdinLiRDk~TGKSKGFaFLcYEDQRSTILAVDN~NGiki~gRtirVD 110 (219)
T KOG0126|consen 31 EYKDSAYIYIGGLPYELTEGDILCVFSQYGEIVDINLIRDKKTGKSKGFAFLCYEDQRSTILAVDNLNGIKILGRTIRVD 110 (219)
T ss_pred hcccceEEEECCCcccccCCcEEEEeeccCceEEEEEEecCCCCcccceEEEEecCccceEEEEeccCCceecceeEEee
Confidence 33456789999999999999999999999999999999999999999999999999999999999999999999999997
Q ss_pred e
Q 034021 102 Y 102 (106)
Q Consensus 102 ~ 102 (106)
.
T Consensus 111 H 111 (219)
T KOG0126|consen 111 H 111 (219)
T ss_pred e
Confidence 5
No 37
>KOG0148 consensus Apoptosis-promoting RNA-binding protein TIA-1/TIAR (RRM superfamily) [RNA processing and modification; Translation, ribosomal structure and biogenesis]
Probab=99.63 E-value=1.4e-15 Score=96.43 Aligned_cols=80 Identities=41% Similarity=0.730 Sum_probs=76.3
Q ss_pred CceEEEcCCCCCCCHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeCCeEEEEEeccC
Q 034021 26 RTKLFIGGLSYDTNETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLDGRNIRVNYAHK 105 (106)
Q Consensus 26 ~~~l~v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g~~l~v~~s~~ 105 (106)
-..++|+-|...++.++|++.|.+||.|.++++++|..+++++||+||.|.+.++|++|+..|+|..++++.|+-.||.+
T Consensus 62 hfhvfvgdls~eI~~e~lr~aF~pFGevS~akvirD~~T~KsKGYgFVSf~~k~dAEnAI~~MnGqWlG~R~IRTNWATR 141 (321)
T KOG0148|consen 62 HFHVFVGDLSPEIDNEKLREAFAPFGEVSDAKVIRDMNTGKSKGYGFVSFPNKEDAENAIQQMNGQWLGRRTIRTNWATR 141 (321)
T ss_pred ceeEEehhcchhcchHHHHHHhccccccccceEeecccCCcccceeEEeccchHHHHHHHHHhCCeeeccceeecccccc
Confidence 45688999999999999999999999999999999999999999999999999999999999999999999999998764
No 38
>COG0724 RNA-binding proteins (RRM domain) [General function prediction only]
Probab=99.62 E-value=6.4e-15 Score=93.99 Aligned_cols=79 Identities=43% Similarity=0.782 Sum_probs=75.6
Q ss_pred CceEEEcCCCCCCCHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeCCeEEEEEecc
Q 034021 26 RTKLFIGGLSYDTNETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLDGRNIRVNYAH 104 (106)
Q Consensus 26 ~~~l~v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g~~l~v~~s~ 104 (106)
.++|||+|||.++++++|.++|..||.+..+.+..+..++..+|+|||+|.+.+++..|+..+++..+.|++|+|.++.
T Consensus 115 ~~~l~v~nL~~~~~~~~l~~~F~~~g~~~~~~~~~d~~~~~~~g~~~v~f~~~~~~~~a~~~~~~~~~~~~~~~v~~~~ 193 (306)
T COG0724 115 NNTLFVGNLPYDVTEEDLRELFKKFGPVKRVRLVRDRETGKSRGFAFVEFESEESAEKAIEELNGKELEGRPLRVQKAQ 193 (306)
T ss_pred CceEEEeCCCCCCCHHHHHHHHHhcCceeEEEeeeccccCccCceEEEEecCHHHHHHHHHHcCCCeECCceeEeeccc
Confidence 6999999999999999999999999999999999998789999999999999999999999999999999999999853
No 39
>TIGR01648 hnRNP-R-Q heterogeneous nuclear ribonucleoprotein R, Q family. Sequences in this subfamily include the human heterogeneous nuclear ribonucleoproteins (hnRNP) R, Q and APOBEC-1 complementation factor (aka APOBEC-1 stimulating protein). These proteins contain three RNA recognition domains (rrm: pfam00076) and a somewhat variable C-terminal domain.
Probab=99.61 E-value=1.4e-14 Score=101.62 Aligned_cols=73 Identities=32% Similarity=0.560 Sum_probs=67.5
Q ss_pred CCceEEEcCCCCCCCHHHHHHHhccC--CCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeCCeEEEEEe
Q 034021 25 SRTKLFIGGLSYDTNETVLKDAFGQH--GEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLDGRNIRVNY 102 (106)
Q Consensus 25 ~~~~l~v~~lp~~~~~~~l~~~f~~~--g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g~~l~v~~ 102 (106)
..++|||+|||.++++++|+++|+.| |.|..+.++ ++||||+|.+.++|++|+..||+..|.|+.|+|.+
T Consensus 232 ~~k~LfVgNL~~~~tee~L~~~F~~f~~G~I~rV~~~--------rgfAFVeF~s~e~A~kAi~~lnG~~i~Gr~I~V~~ 303 (578)
T TIGR01648 232 KVKILYVRNLMTTTTEEIIEKSFSEFKPGKVERVKKI--------RDYAFVHFEDREDAVKAMDELNGKELEGSEIEVTL 303 (578)
T ss_pred cccEEEEeCCCCCCCHHHHHHHHHhcCCCceEEEEee--------cCeEEEEeCCHHHHHHHHHHhCCCEECCEEEEEEE
Confidence 35789999999999999999999999 999998765 34999999999999999999999999999999999
Q ss_pred ccC
Q 034021 103 AHK 105 (106)
Q Consensus 103 s~~ 105 (106)
+++
T Consensus 304 Akp 306 (578)
T TIGR01648 304 AKP 306 (578)
T ss_pred ccC
Confidence 875
No 40
>TIGR01649 hnRNP-L_PTB hnRNP-L/PTB/hephaestus splicing factor family. Included in this family of heterogeneous ribonucleoproteins are PTB (polypyrimidine tract binding protein ) and hnRNP-L. These proteins contain four RNA recognition motifs (rrm: pfam00067).
Probab=99.60 E-value=1.4e-14 Score=100.47 Aligned_cols=74 Identities=19% Similarity=0.266 Sum_probs=67.0
Q ss_pred CceEEEcCCCCCCCHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHh--CCceeCCeEEEEEec
Q 034021 26 RTKLFIGGLSYDTNETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEM--DGLLLDGRNIRVNYA 103 (106)
Q Consensus 26 ~~~l~v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l--~~~~~~g~~l~v~~s 103 (106)
.++|||+|||+++++++|+++|++||.|..+.++.+ +++|||+|.+.++|++|+..+ ++..+.|++|+|.++
T Consensus 2 s~vv~V~nLp~~~te~~L~~~f~~fG~V~~v~i~~~------k~~afVef~~~e~A~~Ai~~~~~~~~~l~g~~l~v~~s 75 (481)
T TIGR01649 2 SPVVHVRNLPQDVVEADLVEALIPFGPVSYVMMLPG------KRQALVEFEDEESAKACVNFATSVPIYIRGQPAFFNYS 75 (481)
T ss_pred ccEEEEcCCCCCCCHHHHHHHHHhcCCeeEEEEECC------CCEEEEEeCchHHHHHHHHHhhcCCceEcCeEEEEEec
Confidence 578999999999999999999999999999988753 469999999999999999864 778999999999998
Q ss_pred cC
Q 034021 104 HK 105 (106)
Q Consensus 104 ~~ 105 (106)
..
T Consensus 76 ~~ 77 (481)
T TIGR01649 76 TS 77 (481)
T ss_pred CC
Confidence 53
No 41
>KOG0105 consensus Alternative splicing factor ASF/SF2 (RRM superfamily) [RNA processing and modification]
Probab=99.57 E-value=9.1e-15 Score=88.52 Aligned_cols=78 Identities=27% Similarity=0.479 Sum_probs=69.4
Q ss_pred CCCceEEEcCCCCCCCHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeCCeEEEEEec
Q 034021 24 NSRTKLFIGGLSYDTNETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLDGRNIRVNYA 103 (106)
Q Consensus 24 ~~~~~l~v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g~~l~v~~s 103 (106)
...++|||+|||.++.+.+|+++|.+||.|..|.+...+ ....||||+|++..+|+.|+..-+|...+|++|+|+++
T Consensus 4 r~~~~iyvGNLP~diRekeieDlFyKyg~i~~ieLK~r~---g~ppfafVeFEd~RDAeDAiygRdGYdydg~rLRVEfp 80 (241)
T KOG0105|consen 4 RNSRRIYVGNLPGDIREKEIEDLFYKYGRIREIELKNRP---GPPPFAFVEFEDPRDAEDAIYGRDGYDYDGCRLRVEFP 80 (241)
T ss_pred cccceEEecCCCcchhhccHHHHHhhhcceEEEEeccCC---CCCCeeEEEecCccchhhhhhcccccccCcceEEEEec
Confidence 356889999999999999999999999999999885433 23569999999999999999999999999999999986
Q ss_pred c
Q 034021 104 H 104 (106)
Q Consensus 104 ~ 104 (106)
.
T Consensus 81 r 81 (241)
T KOG0105|consen 81 R 81 (241)
T ss_pred c
Confidence 4
No 42
>KOG0130 consensus RNA-binding protein RBM8/Tsunagi (RRM superfamily) [General function prediction only]
Probab=99.57 E-value=1.2e-14 Score=83.94 Aligned_cols=80 Identities=29% Similarity=0.524 Sum_probs=75.7
Q ss_pred CCceEEEcCCCCCCCHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeCCeEEEEEecc
Q 034021 25 SRTKLFIGGLSYDTNETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLDGRNIRVNYAH 104 (106)
Q Consensus 25 ~~~~l~v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g~~l~v~~s~ 104 (106)
..-.|+|.++...+++++|.+.|..||.|.++-+-.+..+|-.+||++|+|.+...|++|+..+||..+.|..|.|.|+.
T Consensus 71 EGwIi~VtgvHeEatEedi~d~F~dyGeiKNihLNLDRRtGy~KGYaLvEYet~keAq~A~~~~Ng~~ll~q~v~VDw~F 150 (170)
T KOG0130|consen 71 EGWIIFVTGVHEEATEEDIHDKFADYGEIKNIHLNLDRRTGYVKGYALVEYETLKEAQAAIDALNGAELLGQNVSVDWCF 150 (170)
T ss_pred eeEEEEEeccCcchhHHHHHHHHhhcccccceeeccccccccccceeeeehHhHHHHHHHHHhccchhhhCCceeEEEEE
Confidence 45789999999999999999999999999999999998899999999999999999999999999999999999999863
No 43
>KOG0131 consensus Splicing factor 3b, subunit 4 [RNA processing and modification]
Probab=99.57 E-value=9.1e-15 Score=88.06 Aligned_cols=82 Identities=38% Similarity=0.593 Sum_probs=77.4
Q ss_pred CCCCceEEEcCCCCCCCHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeCCeEEEEEe
Q 034021 23 ANSRTKLFIGGLSYDTNETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLDGRNIRVNY 102 (106)
Q Consensus 23 ~~~~~~l~v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g~~l~v~~ 102 (106)
.....+|||+||+..++++-|.++|-..|+|..+.++++.-+..+.||||++|.++++|+=|++.|+.+.+.|++|+|..
T Consensus 6 rnqd~tiyvgnld~kvs~~~l~EL~iqagpVv~i~iPkDrv~~~~qGygF~Ef~~eedadYAikiln~VkLYgrpIrv~k 85 (203)
T KOG0131|consen 6 RNQDATLYVGNLDEKVSEELLYELFIQAGPVVNLHIPKDRVTQKHQGYGFAEFRTEEDADYAIKILNMVKLYGRPIRVNK 85 (203)
T ss_pred cCCCceEEEecCCHHHHHHHHHHHHHhcCceeeeecchhhhcccccceeEEEEechhhhHHHHHHHHHHHhcCceeEEEe
Confidence 34567999999999999999999999999999999999998999999999999999999999999999999999999998
Q ss_pred cc
Q 034021 103 AH 104 (106)
Q Consensus 103 s~ 104 (106)
+.
T Consensus 86 as 87 (203)
T KOG0131|consen 86 AS 87 (203)
T ss_pred cc
Confidence 75
No 44
>KOG0144 consensus RNA-binding protein CUGBP1/BRUNO (RRM superfamily) [RNA processing and modification]
Probab=99.55 E-value=2.6e-14 Score=95.64 Aligned_cols=90 Identities=32% Similarity=0.545 Sum_probs=78.5
Q ss_pred cccccCCCCCceEEEcCCCCCCCHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCce-eCC
Q 034021 17 TRLLSDANSRTKLFIGGLSYDTNETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLL-LDG 95 (106)
Q Consensus 17 ~~~~~~~~~~~~l~v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~-~~g 95 (106)
......+.+.-++||+-||+.++|.+|+++|++||.|.+|.+++|+.++..+||+||.|.+.++|.+|+.+|++.. +.|
T Consensus 25 ~~~d~~d~~~vKlfVgqIprt~sE~dlr~lFe~yg~V~einl~kDk~t~~s~gcCFv~~~trk~a~~a~~Alhn~ktlpG 104 (510)
T KOG0144|consen 25 DHTDNPDGSAVKLFVGQIPRTASEKDLRELFEKYGNVYEINLIKDKSTGQSKGCCFVKYYTRKEADEAINALHNQKTLPG 104 (510)
T ss_pred CCCCCCCchhhhheeccCCccccHHHHHHHHHHhCceeEEEeecccccCcccceEEEEeccHHHHHHHHHHhhcccccCC
Confidence 3344455667889999999999999999999999999999999999999999999999999999999999998875 433
Q ss_pred --eEEEEEeccCC
Q 034021 96 --RNIRVNYAHKD 106 (106)
Q Consensus 96 --~~l~v~~s~~~ 106 (106)
.+|+|+++..|
T Consensus 105 ~~~pvqvk~Ad~E 117 (510)
T KOG0144|consen 105 MHHPVQVKYADGE 117 (510)
T ss_pred CCcceeecccchh
Confidence 68999887643
No 45
>KOG0117 consensus Heterogeneous nuclear ribonucleoprotein R (RRM superfamily) [RNA processing and modification]
Probab=99.55 E-value=2.1e-14 Score=96.30 Aligned_cols=72 Identities=38% Similarity=0.524 Sum_probs=68.1
Q ss_pred CceEEEcCCCCCCCHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeCCeEEEEEeccC
Q 034021 26 RTKLFIGGLSYDTNETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLDGRNIRVNYAHK 105 (106)
Q Consensus 26 ~~~l~v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g~~l~v~~s~~ 105 (106)
-..|||+||+.++|++.|++.|+.||.|..++.++| ||||.|.+.++|-+|++.+|+.++.|..|.|.+|+|
T Consensus 259 VKvLYVRNL~~~tTeE~lk~~F~~~G~veRVkk~rD--------YaFVHf~eR~davkAm~~~ngkeldG~~iEvtLAKP 330 (506)
T KOG0117|consen 259 VKVLYVRNLMESTTEETLKKLFNEFGKVERVKKPRD--------YAFVHFAEREDAVKAMKETNGKELDGSPIEVTLAKP 330 (506)
T ss_pred eeeeeeeccchhhhHHHHHHHHHhccceEEeecccc--------eeEEeecchHHHHHHHHHhcCceecCceEEEEecCC
Confidence 478999999999999999999999999999977644 999999999999999999999999999999999986
No 46
>PF13893 RRM_5: RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain); PDB: 2PE8_A 2PEH_A 2D9O_A 1A9N_D 2DIT_A 3S6E_A 2LQ5_A 1SJQ_A 2AD9_A 1X4D_A ....
Probab=99.54 E-value=9.4e-14 Score=70.01 Aligned_cols=56 Identities=38% Similarity=0.746 Sum_probs=50.0
Q ss_pred HHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeCCeEEEEEec
Q 034021 43 LKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLDGRNIRVNYA 103 (106)
Q Consensus 43 l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g~~l~v~~s 103 (106)
|.++|++||.|..+.+.... ++++||+|.+.++|+.|+..|||..+.|++|+|+||
T Consensus 1 L~~~f~~fG~V~~i~~~~~~-----~~~a~V~f~~~~~A~~a~~~l~~~~~~g~~l~V~~a 56 (56)
T PF13893_consen 1 LYKLFSKFGEVKKIKIFKKK-----RGFAFVEFASVEDAQKAIEQLNGRQFNGRPLKVSYA 56 (56)
T ss_dssp HHHHHTTTS-EEEEEEETTS-----TTEEEEEESSHHHHHHHHHHHTTSEETTEEEEEEEE
T ss_pred ChHHhCCcccEEEEEEEeCC-----CCEEEEEECCHHHHHHHHHHhCCCEECCcEEEEEEC
Confidence 57899999999999886543 469999999999999999999999999999999986
No 47
>KOG0144 consensus RNA-binding protein CUGBP1/BRUNO (RRM superfamily) [RNA processing and modification]
Probab=99.54 E-value=9.4e-15 Score=97.70 Aligned_cols=80 Identities=34% Similarity=0.604 Sum_probs=73.6
Q ss_pred CCceEEEcCCCCCCCHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCce-eCC--eEEEEE
Q 034021 25 SRTKLFIGGLSYDTNETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLL-LDG--RNIRVN 101 (106)
Q Consensus 25 ~~~~l~v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~-~~g--~~l~v~ 101 (106)
.+++|||+.|++.+++.+++++|++||-|++|+++++. .+.++||+||.|...+-|..|++.|||.. +.| .+|.|+
T Consensus 123 ~e~KLFvg~lsK~~te~evr~iFs~fG~Ied~~ilrd~-~~~sRGcaFV~fstke~A~~Aika~ng~~tmeGcs~PLVVk 201 (510)
T KOG0144|consen 123 EERKLFVGMLSKQCTENEVREIFSRFGHIEDCYILRDP-DGLSRGCAFVKFSTKEMAVAAIKALNGTQTMEGCSQPLVVK 201 (510)
T ss_pred cchhhhhhhccccccHHHHHHHHHhhCccchhhheecc-cccccceeEEEEehHHHHHHHHHhhccceeeccCCCceEEE
Confidence 37899999999999999999999999999999999998 78899999999999999999999999985 544 589999
Q ss_pred eccC
Q 034021 102 YAHK 105 (106)
Q Consensus 102 ~s~~ 105 (106)
++..
T Consensus 202 FADt 205 (510)
T KOG0144|consen 202 FADT 205 (510)
T ss_pred eccc
Confidence 9864
No 48
>KOG0124 consensus Polypyrimidine tract-binding protein PUF60 (RRM superfamily) [RNA processing and modification]
Probab=99.53 E-value=1e-14 Score=96.20 Aligned_cols=77 Identities=31% Similarity=0.667 Sum_probs=74.2
Q ss_pred CceEEEcCCCCCCCHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeCCeEEEEEe
Q 034021 26 RTKLFIGGLSYDTNETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLDGRNIRVNY 102 (106)
Q Consensus 26 ~~~l~v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g~~l~v~~ 102 (106)
-+.|||+.|.+...++.|+..|.+||+|+.+.+-+++.+++++||+||+|+-++.|+-|++.+||..++|+.|+|..
T Consensus 113 McRvYVGSIsfEl~EDtiR~AF~PFGPIKSInMSWDp~T~kHKgFAFVEYEvPEaAqLAlEqMNg~mlGGRNiKVgr 189 (544)
T KOG0124|consen 113 MCRVYVGSISFELREDTIRRAFDPFGPIKSINMSWDPATGKHKGFAFVEYEVPEAAQLALEQMNGQMLGGRNIKVGR 189 (544)
T ss_pred hHheeeeeeEEEechHHHHhhccCCCCcceeecccccccccccceEEEEEeCcHHHHHHHHHhccccccCccccccC
Confidence 47899999999999999999999999999999999999999999999999999999999999999999999999864
No 49
>KOG0127 consensus Nucleolar protein fibrillarin NOP77 (RRM superfamily) [RNA processing and modification]
Probab=99.51 E-value=1e-13 Score=95.16 Aligned_cols=78 Identities=27% Similarity=0.570 Sum_probs=72.3
Q ss_pred CceEEEcCCCCCCCHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeCCeEEEEEecc
Q 034021 26 RTKLFIGGLSYDTNETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLDGRNIRVNYAH 104 (106)
Q Consensus 26 ~~~l~v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g~~l~v~~s~ 104 (106)
.-.|.|+|||+.+..++|+.+|+.||.+..+.|++.. .|+..|||||+|....+|..|+..+|+..|+|++|-|.||-
T Consensus 117 k~rLIIRNLPf~~k~~dLk~vFs~~G~V~Ei~IP~k~-dgklcGFaFV~fk~~~dA~~Al~~~N~~~i~gR~VAVDWAV 194 (678)
T KOG0127|consen 117 KWRLIIRNLPFKCKKPDLKNVFSNFGKVVEIVIPRKK-DGKLCGFAFVQFKEKKDAEKALEFFNGNKIDGRPVAVDWAV 194 (678)
T ss_pred cceEEeecCCcccCcHHHHHHHhhcceEEEEEcccCC-CCCccceEEEEEeeHHHHHHHHHhccCceecCceeEEeeec
Confidence 5678999999999999999999999999999999777 55556999999999999999999999999999999999973
No 50
>KOG0145 consensus RNA-binding protein ELAV/HU (RRM superfamily) [RNA processing and modification]
Probab=99.51 E-value=3.6e-13 Score=85.48 Aligned_cols=82 Identities=32% Similarity=0.535 Sum_probs=77.0
Q ss_pred CCCCCceEEEcCCCCCCCHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeCCeEEEEE
Q 034021 22 DANSRTKLFIGGLSYDTNETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLDGRNIRVN 101 (106)
Q Consensus 22 ~~~~~~~l~v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g~~l~v~ 101 (106)
.+....+|||=||..++.+.-|+++|.+||.|..+++++|..+.+++||+||.+.+-++|..|+..|||..++++.|+|.
T Consensus 274 ~~~~g~ciFvYNLspd~de~~LWQlFgpFGAv~nVKvirD~ttnkCKGfgFVtMtNYdEAamAi~sLNGy~lg~rvLQVs 353 (360)
T KOG0145|consen 274 GPGGGWCIFVYNLSPDADESILWQLFGPFGAVTNVKVIRDFTTNKCKGFGFVTMTNYDEAAMAIASLNGYRLGDRVLQVS 353 (360)
T ss_pred CCCCeeEEEEEecCCCchHhHHHHHhCcccceeeEEEEecCCcccccceeEEEecchHHHHHHHHHhcCccccceEEEEE
Confidence 34456889999999999999999999999999999999999889999999999999999999999999999999999998
Q ss_pred ec
Q 034021 102 YA 103 (106)
Q Consensus 102 ~s 103 (106)
+-
T Consensus 354 FK 355 (360)
T KOG0145|consen 354 FK 355 (360)
T ss_pred Ee
Confidence 73
No 51
>KOG0127 consensus Nucleolar protein fibrillarin NOP77 (RRM superfamily) [RNA processing and modification]
Probab=99.50 E-value=3.4e-13 Score=92.65 Aligned_cols=82 Identities=38% Similarity=0.660 Sum_probs=73.9
Q ss_pred CCCCCceEEEcCCCCCCCHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHh-----CC-ceeCC
Q 034021 22 DANSRTKLFIGGLSYDTNETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEM-----DG-LLLDG 95 (106)
Q Consensus 22 ~~~~~~~l~v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l-----~~-~~~~g 95 (106)
......+|||+|||+++++++|...|+.||.+....++.++.++.++|.|||.|.++..+++||... .| +.+.|
T Consensus 288 n~~~~~tVFvRNL~fD~tEEel~~~fskFG~v~ya~iV~~k~T~~skGtAFv~Fkt~~~~~~ci~~Aspa~e~g~~ll~G 367 (678)
T KOG0127|consen 288 NITEGKTVFVRNLPFDTTEEELKEHFSKFGEVKYAIIVKDKDTGHSKGTAFVKFKTQIAAQNCIEAASPASEDGSVLLDG 367 (678)
T ss_pred cccccceEEEecCCccccHHHHHHHHHhhccceeEEEEeccCCCCcccceEEEeccHHHHHHHHHhcCccCCCceEEEec
Confidence 3344689999999999999999999999999999999999999999999999999999999999865 23 66899
Q ss_pred eEEEEEec
Q 034021 96 RNIRVNYA 103 (106)
Q Consensus 96 ~~l~v~~s 103 (106)
+.|.|..+
T Consensus 368 R~Lkv~~A 375 (678)
T KOG0127|consen 368 RLLKVTLA 375 (678)
T ss_pred cEEeeeec
Confidence 99999875
No 52
>smart00361 RRM_1 RNA recognition motif.
Probab=99.50 E-value=3.6e-13 Score=70.89 Aligned_cols=62 Identities=24% Similarity=0.535 Sum_probs=54.3
Q ss_pred HHHHHHHhc----cCCCeeEEE-EeecCCC--CCceeEEEEEecCHHHHHHHHHHhCCceeCCeEEEEE
Q 034021 40 ETVLKDAFG----QHGEIIEVK-VICDRVT--GQSRGYGFVKFASEAAAGEAIKEMDGLLLDGRNIRVN 101 (106)
Q Consensus 40 ~~~l~~~f~----~~g~i~~~~-~~~~~~~--~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g~~l~v~ 101 (106)
+++|+++|+ .||.+..+. +..++.+ +..+|++||.|.+.++|.+|+..|||..+.|+.|+++
T Consensus 2 ~~~l~~~~~~~~~~fG~v~~v~~v~~~~~~~~~~~rG~~fV~f~~~~dA~~A~~~l~g~~~~gr~l~~~ 70 (70)
T smart00361 2 DEDFEREFSEEEEYFGEVGKINKIYIDNVGYENHKRGNVYITFERSEDAARAIVDLNGRYFDGRTVKAE 70 (70)
T ss_pred chhHHHHHHHHHHhcCCeeEEEEEEeCCCCCCCCCcEEEEEEECCHHHHHHHHHHhCCCEECCEEEEeC
Confidence 467888888 999999885 6666555 7889999999999999999999999999999999863
No 53
>KOG4206 consensus Spliceosomal protein snRNP-U1A/U2B [RNA processing and modification]
Probab=99.49 E-value=2e-13 Score=84.65 Aligned_cols=78 Identities=31% Similarity=0.572 Sum_probs=70.5
Q ss_pred CceEEEcCCCCCCCHHHHHH----HhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeCCeEEEEE
Q 034021 26 RTKLFIGGLSYDTNETVLKD----AFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLDGRNIRVN 101 (106)
Q Consensus 26 ~~~l~v~~lp~~~~~~~l~~----~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g~~l~v~ 101 (106)
..++||.||+..+..++|++ +|+.||.|..|.... +.+.+|-|||.|++.+.|-.|++.|+|..+.|+.++++
T Consensus 9 n~TlYInnLnekI~~~elkrsL~~LFsqfG~ildI~a~k---t~KmRGQA~VvFk~~~~As~A~r~l~gfpFygK~mriq 85 (221)
T KOG4206|consen 9 NGTLYINNLNEKIKKDELKRSLYLLFSQFGKILDISAFK---TPKMRGQAFVVFKETEAASAALRALQGFPFYGKPMRIQ 85 (221)
T ss_pred CceEeehhccccccHHHHHHHHHHHHHhhCCeEEEEecC---CCCccCceEEEecChhHHHHHHHHhcCCcccCchhhee
Confidence 33999999999999999888 999999999887653 56678999999999999999999999999999999999
Q ss_pred eccCC
Q 034021 102 YAHKD 106 (106)
Q Consensus 102 ~s~~~ 106 (106)
||+.|
T Consensus 86 yA~s~ 90 (221)
T KOG4206|consen 86 YAKSD 90 (221)
T ss_pred cccCc
Confidence 99764
No 54
>KOG0131 consensus Splicing factor 3b, subunit 4 [RNA processing and modification]
Probab=99.48 E-value=5.5e-14 Score=84.73 Aligned_cols=102 Identities=32% Similarity=0.562 Sum_probs=85.5
Q ss_pred cCceeeccCccccccc---cCCCCCceEEEcCCCCCCCHHHHHHHhccCCCee-EEEEeecCCCCCceeEEEEEecCHHH
Q 034021 5 NGSVRFSHSVRFTRLL---SDANSRTKLFIGGLSYDTNETVLKDAFGQHGEII-EVKVICDRVTGQSRGYGFVKFASEAA 80 (106)
Q Consensus 5 ~~~~~~~~~~~~~~~~---~~~~~~~~l~v~~lp~~~~~~~l~~~f~~~g~i~-~~~~~~~~~~~~~~g~~fv~f~~~~~ 80 (106)
|.-.++.++..+.... ........+||+||.+++.+..|.+.|+.||.+. ..++++++.+|..++++|+.|.+.+.
T Consensus 72 n~VkLYgrpIrv~kas~~~~nl~vganlfvgNLd~~vDe~~L~dtFsafG~l~~~P~i~rd~~tg~~~~~g~i~~~sfea 151 (203)
T KOG0131|consen 72 NMVKLYGRPIRVNKASAHQKNLDVGANLFVGNLDPEVDEKLLYDTFSAFGVLISPPKIMRDPDTGNPKGFGFINYASFEA 151 (203)
T ss_pred HHHHhcCceeEEEecccccccccccccccccccCcchhHHHHHHHHHhccccccCCcccccccCCCCCCCeEEechhHHH
Confidence 3334455555555444 4444568899999999999999999999999864 45889999889999999999999999
Q ss_pred HHHHHHHhCCceeCCeEEEEEeccCC
Q 034021 81 AGEAIKEMDGLLLDGRNIRVNYAHKD 106 (106)
Q Consensus 81 a~~al~~l~~~~~~g~~l~v~~s~~~ 106 (106)
+.+|+..++|..+.++++.|.++.++
T Consensus 152 sd~ai~s~ngq~l~nr~itv~ya~k~ 177 (203)
T KOG0131|consen 152 SDAAIGSMNGQYLCNRPITVSYAFKK 177 (203)
T ss_pred HHHHHHHhccchhcCCceEEEEEEec
Confidence 99999999999999999999998763
No 55
>KOG0147 consensus Transcriptional coactivator CAPER (RRM superfamily) [Transcription]
Probab=99.47 E-value=1.2e-13 Score=94.44 Aligned_cols=74 Identities=34% Similarity=0.719 Sum_probs=71.8
Q ss_pred EEEcCCCCCCCHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeCCeEEEEEe
Q 034021 29 LFIGGLSYDTNETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLDGRNIRVNY 102 (106)
Q Consensus 29 l~v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g~~l~v~~ 102 (106)
+||+||.+++++++++..|++||.|..+.+.++..+|.++||+||+|.+.++|.+|+..|||.++-|+.|+|..
T Consensus 281 l~vgnLHfNite~~lr~ifepfg~Ie~v~l~~d~~tG~skgfGfi~f~~~~~ar~a~e~lngfelAGr~ikV~~ 354 (549)
T KOG0147|consen 281 LYVGNLHFNITEDMLRGIFEPFGKIENVQLTKDSETGRSKGFGFITFVNKEDARKALEQLNGFELAGRLIKVSV 354 (549)
T ss_pred hhhcccccCchHHHHhhhccCcccceeeeeccccccccccCcceEEEecHHHHHHHHHHhccceecCceEEEEE
Confidence 89999999999999999999999999999999988999999999999999999999999999999999999875
No 56
>KOG0123 consensus Polyadenylate-binding protein (RRM superfamily) [RNA processing and modification; Translation, ribosomal structure and biogenesis]
Probab=99.45 E-value=5.9e-13 Score=89.52 Aligned_cols=95 Identities=35% Similarity=0.585 Sum_probs=77.1
Q ss_pred ccCceeeccCccccccccCCCCCceEEEcCCCCCCCHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHH
Q 034021 4 LNGSVRFSHSVRFTRLLSDANSRTKLFIGGLSYDTNETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGE 83 (106)
Q Consensus 4 ~~~~~~~~~~~~~~~~~~~~~~~~~l~v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~ 83 (106)
+|.......+.+..+.... ...+||.||+.+++...|.++|+.||.|..|++..+. +| .+|| ||+|.+.+.|++
T Consensus 57 ~n~~~~~~~~~rim~s~rd---~~~~~i~nl~~~~~~~~~~d~f~~~g~ilS~kv~~~~-~g-~kg~-FV~f~~e~~a~~ 130 (369)
T KOG0123|consen 57 MNFDVLKGKPIRIMWSQRD---PSLVFIKNLDESIDNKSLYDTFSEFGNILSCKVATDE-NG-SKGY-FVQFESEESAKK 130 (369)
T ss_pred cCCcccCCcEEEeehhccC---CceeeecCCCcccCcHHHHHHHHhhcCeeEEEEEEcC-CC-ceee-EEEeCCHHHHHH
Confidence 3444444444444333222 2239999999999999999999999999999999987 55 7889 999999999999
Q ss_pred HHHHhCCceeCCeEEEEEecc
Q 034021 84 AIKEMDGLLLDGRNIRVNYAH 104 (106)
Q Consensus 84 al~~l~~~~~~g~~l~v~~s~ 104 (106)
|+..+||..+.+++|.|....
T Consensus 131 ai~~~ng~ll~~kki~vg~~~ 151 (369)
T KOG0123|consen 131 AIEKLNGMLLNGKKIYVGLFE 151 (369)
T ss_pred HHHHhcCcccCCCeeEEeecc
Confidence 999999999999999997654
No 57
>KOG0146 consensus RNA-binding protein ETR-3 (RRM superfamily) [RNA processing and modification]
Probab=99.45 E-value=1.5e-13 Score=87.46 Aligned_cols=87 Identities=30% Similarity=0.478 Sum_probs=80.3
Q ss_pred cccCCCCCceEEEcCCCCCCCHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeCCeEE
Q 034021 19 LLSDANSRTKLFIGGLSYDTNETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLDGRNI 98 (106)
Q Consensus 19 ~~~~~~~~~~l~v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g~~l 98 (106)
...+-.+.|.|||=.||.+..+.+|...|-+||-|...+++.|+.+..++.|+||.|++..++++|+..+||+.|+.++|
T Consensus 278 qqreGPeGCNlFIYHLPQEFgDaEliQmF~PFGhivSaKVFvDRATNQSKCFGFVSfDNp~SaQaAIqAMNGFQIGMKRL 357 (371)
T KOG0146|consen 278 QQREGPEGCNLFIYHLPQEFGDAELIQMFLPFGHIVSAKVFVDRATNQSKCFGFVSFDNPASAQAAIQAMNGFQIGMKRL 357 (371)
T ss_pred hhhcCCCcceEEEEeCchhhccHHHHHHhccccceeeeeeeehhccccccceeeEecCCchhHHHHHHHhcchhhhhhhh
Confidence 34556678999999999999999999999999999999999999888899999999999999999999999999999999
Q ss_pred EEEeccC
Q 034021 99 RVNYAHK 105 (106)
Q Consensus 99 ~v~~s~~ 105 (106)
+|.+.++
T Consensus 358 KVQLKRP 364 (371)
T KOG0146|consen 358 KVQLKRP 364 (371)
T ss_pred hhhhcCc
Confidence 9988654
No 58
>KOG0415 consensus Predicted peptidyl prolyl cis-trans isomerase [Posttranslational modification, protein turnover, chaperones]
Probab=99.45 E-value=3.7e-13 Score=88.48 Aligned_cols=84 Identities=31% Similarity=0.526 Sum_probs=78.6
Q ss_pred cCCCCCceEEEcCCCCCCCHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeCCeEEEE
Q 034021 21 SDANSRTKLFIGGLSYDTNETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLDGRNIRV 100 (106)
Q Consensus 21 ~~~~~~~~l~v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g~~l~v 100 (106)
...++...|||..|.+-++.++|.-+|+.||.|..|.++++..+|....||||+|++.++.++|.-.|+++.|+.++|.|
T Consensus 234 d~~PPeNVLFVCKLNPVTtDeDLeiIFSrFG~i~sceVIRD~ktgdsLqyaFiEFen~escE~AyFKMdNvLIDDrRIHV 313 (479)
T KOG0415|consen 234 DVKPPENVLFVCKLNPVTTDEDLEIIFSRFGKIVSCEVIRDRKTGDSLQYAFIEFENKESCEQAYFKMDNVLIDDRRIHV 313 (479)
T ss_pred ccCCCcceEEEEecCCcccccchhhHHhhcccceeeeEEecccccchhheeeeeecchhhHHHHHhhhcceeeccceEEe
Confidence 34456788999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred Eecc
Q 034021 101 NYAH 104 (106)
Q Consensus 101 ~~s~ 104 (106)
.+|.
T Consensus 314 DFSQ 317 (479)
T KOG0415|consen 314 DFSQ 317 (479)
T ss_pred ehhh
Confidence 8863
No 59
>KOG0124 consensus Polypyrimidine tract-binding protein PUF60 (RRM superfamily) [RNA processing and modification]
Probab=99.44 E-value=4.1e-13 Score=88.73 Aligned_cols=100 Identities=23% Similarity=0.419 Sum_probs=83.5
Q ss_pred cccCceeeccCccccccccCCCC-------------CceEEEcCCCCCCCHHHHHHHhccCCCeeEEEEeecCCCCCcee
Q 034021 3 WLNGSVRFSHSVRFTRLLSDANS-------------RTKLFIGGLSYDTNETVLKDAFGQHGEIIEVKVICDRVTGQSRG 69 (106)
Q Consensus 3 ~~~~~~~~~~~~~~~~~~~~~~~-------------~~~l~v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g 69 (106)
.||+.-.-.++..+.++..-++. =..|||..+..+.++++|+..|+.||.|..|.+.+.+..+.++|
T Consensus 174 qMNg~mlGGRNiKVgrPsNmpQAQpiID~vqeeAk~fnRiYVaSvHpDLSe~DiKSVFEAFG~I~~C~LAr~pt~~~HkG 253 (544)
T KOG0124|consen 174 QMNGQMLGGRNIKVGRPSNMPQAQPIIDMVQEEAKKFNRIYVASVHPDLSETDIKSVFEAFGEIVKCQLARAPTGRGHKG 253 (544)
T ss_pred HhccccccCccccccCCCCCcccchHHHHHHHHHHhhheEEeeecCCCccHHHHHHHHHhhcceeeEEeeccCCCCCccc
Confidence 45555555555555444333332 27899999999999999999999999999999999998888999
Q ss_pred EEEEEecCHHHHHHHHHHhCCceeCCeEEEEEe
Q 034021 70 YGFVKFASEAAAGEAIKEMDGLLLDGRNIRVNY 102 (106)
Q Consensus 70 ~~fv~f~~~~~a~~al~~l~~~~~~g~~l~v~~ 102 (106)
|+|++|.+..+...|+..+|-+.++|..|+|..
T Consensus 254 yGfiEy~n~qs~~eAiasMNlFDLGGQyLRVGk 286 (544)
T KOG0124|consen 254 YGFIEYNNLQSQSEAIASMNLFDLGGQYLRVGK 286 (544)
T ss_pred eeeEEeccccchHHHhhhcchhhcccceEeccc
Confidence 999999999999999999999999999999865
No 60
>KOG4212 consensus RNA-binding protein hnRNP-M [RNA processing and modification]
Probab=99.44 E-value=8.8e-13 Score=88.77 Aligned_cols=76 Identities=26% Similarity=0.483 Sum_probs=71.2
Q ss_pred CceEEEcCCCCCCCHHHHHHHhc-cCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeCCeEEEEEe
Q 034021 26 RTKLFIGGLSYDTNETVLKDAFG-QHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLDGRNIRVNY 102 (106)
Q Consensus 26 ~~~l~v~~lp~~~~~~~l~~~f~-~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g~~l~v~~ 102 (106)
.+.+||+|||+++...+|++++. +.|.|..|.++.+. +++.+|+|.|+|++++.+++|++.|+.+.+.|+.|.|.-
T Consensus 44 ~R~vfItNIpyd~rWqdLKdLvrekvGev~yveLl~D~-~GK~rGcavVEFk~~E~~qKa~E~lnk~~~~GR~l~vKE 120 (608)
T KOG4212|consen 44 DRSVFITNIPYDYRWQDLKDLVREKVGEVEYVELLFDE-SGKARGCAVVEFKDPENVQKALEKLNKYEVNGRELVVKE 120 (608)
T ss_pred cceEEEecCcchhhhHhHHHHHHHhcCceEeeeeeccc-CCCcCCceEEEeeCHHHHHHHHHHhhhccccCceEEEec
Confidence 35699999999999999999996 67999999999997 899999999999999999999999999999999999974
No 61
>TIGR01642 U2AF_lg U2 snRNP auxilliary factor, large subunit, splicing factor. Members of this subfamily are found in plants, metazoa and fungi.
Probab=99.41 E-value=2.5e-12 Score=89.58 Aligned_cols=75 Identities=16% Similarity=0.398 Sum_probs=61.3
Q ss_pred cCCCCCceEEEcCCCCCCCHHHHHHHhccC------------CCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHh
Q 034021 21 SDANSRTKLFIGGLSYDTNETVLKDAFGQH------------GEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEM 88 (106)
Q Consensus 21 ~~~~~~~~l~v~~lp~~~~~~~l~~~f~~~------------g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l 88 (106)
......++|||+|||.++++++|.++|+.+ +.|..+.+. ..+|||||+|.+.++|..||. |
T Consensus 170 ~~~~~~r~lyVgnLp~~~t~~~l~~~F~~~~~~~~~~~~~~~~~v~~~~~~------~~kg~afVeF~~~e~A~~Al~-l 242 (509)
T TIGR01642 170 QATRQARRLYVGGIPPEFVEEAVVDFFNDLMIATGYHKAEDGKHVSSVNIN------KEKNFAFLEFRTVEEATFAMA-L 242 (509)
T ss_pred cCCccccEEEEeCCCCCCCHHHHHHHHHHHHHhcCCCCCCCCCceEEEEEC------CCCCEEEEEeCCHHHHhhhhc-C
Confidence 344457899999999999999999999864 234444332 246799999999999999995 9
Q ss_pred CCceeCCeEEEEEe
Q 034021 89 DGLLLDGRNIRVNY 102 (106)
Q Consensus 89 ~~~~~~g~~l~v~~ 102 (106)
+|..+.|..|+|..
T Consensus 243 ~g~~~~g~~l~v~r 256 (509)
T TIGR01642 243 DSIIYSNVFLKIRR 256 (509)
T ss_pred CCeEeeCceeEecC
Confidence 99999999999864
No 62
>KOG0123 consensus Polyadenylate-binding protein (RRM superfamily) [RNA processing and modification; Translation, ribosomal structure and biogenesis]
Probab=99.38 E-value=3.2e-12 Score=86.03 Aligned_cols=73 Identities=34% Similarity=0.634 Sum_probs=69.5
Q ss_pred eEEEcCCCCCCCHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeCCeEEEEEeccCC
Q 034021 28 KLFIGGLSYDTNETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLDGRNIRVNYAHKD 106 (106)
Q Consensus 28 ~l~v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g~~l~v~~s~~~ 106 (106)
.+||+ .++++..|.+.|+++|++..+++.++. + +.|||||.|.+..+|++|+.++|...+.|+++++.|+.+|
T Consensus 3 sl~vg---~~v~e~~l~~~f~~~~~v~s~rvc~d~-t--slgy~yvnf~~~~da~~A~~~~n~~~~~~~~~rim~s~rd 75 (369)
T KOG0123|consen 3 SLYVG---PDVTEAMLFDKFSPAGPVLSIRVCRDA-T--SLGYAYVNFQQPADAERALDTMNFDVLKGKPIRIMWSQRD 75 (369)
T ss_pred ceecC---CcCChHHHHHHhcccCCceeEEEeecC-C--ccceEEEecCCHHHHHHHHHHcCCcccCCcEEEeehhccC
Confidence 58887 899999999999999999999999998 6 8999999999999999999999999999999999999876
No 63
>KOG0109 consensus RNA-binding protein LARK, contains RRM and retroviral-type Zn-finger domains [RNA processing and modification; General function prediction only]
Probab=99.36 E-value=1.9e-12 Score=83.15 Aligned_cols=70 Identities=37% Similarity=0.678 Sum_probs=65.8
Q ss_pred ceEEEcCCCCCCCHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeCCeEEEEEecc
Q 034021 27 TKLFIGGLSYDTNETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLDGRNIRVNYAH 104 (106)
Q Consensus 27 ~~l~v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g~~l~v~~s~ 104 (106)
.++||+|||..+++.+|+.+|++||.|.+|.|+. .||||..++...++.+++-|++..++|..|.|+-|+
T Consensus 3 ~KLFIGNLp~~~~~~elr~lFe~ygkVlECDIvK--------NYgFVHiEdktaaedairNLhgYtLhg~nInVeaSk 72 (346)
T KOG0109|consen 3 VKLFIGNLPREATEQELRSLFEQYGKVLECDIVK--------NYGFVHIEDKTAAEDAIRNLHGYTLHGVNINVEASK 72 (346)
T ss_pred cchhccCCCcccchHHHHHHHHhhCceEeeeeec--------ccceEEeecccccHHHHhhcccceecceEEEEEecc
Confidence 4689999999999999999999999999999973 499999999999999999999999999999999875
No 64
>KOG0110 consensus RNA-binding protein (RRM superfamily) [General function prediction only]
Probab=99.34 E-value=1.1e-12 Score=92.10 Aligned_cols=81 Identities=23% Similarity=0.494 Sum_probs=75.3
Q ss_pred CceEEEcCCCCCCCHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeCCeEEEEEeccC
Q 034021 26 RTKLFIGGLSYDTNETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLDGRNIRVNYAHK 105 (106)
Q Consensus 26 ~~~l~v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g~~l~v~~s~~ 105 (106)
.+.|.|+|||+.++..+++++|..||.+..++++.....+.++|+|||.|-++.+|.+|+.+|....+.|++|.++|++-
T Consensus 613 ~tKIlVRNipFeAt~rEVr~LF~aFGqlksvRlPKK~~k~a~rGF~Fv~f~t~~ea~nA~~al~STHlyGRrLVLEwA~~ 692 (725)
T KOG0110|consen 613 GTKILVRNIPFEATKREVRKLFTAFGQLKSVRLPKKIGKGAHRGFGFVDFLTPREAKNAFDALGSTHLYGRRLVLEWAKS 692 (725)
T ss_pred cceeeeeccchHHHHHHHHHHHhcccceeeeccchhhcchhhccceeeeccCcHHHHHHHHhhcccceechhhheehhcc
Confidence 46899999999999999999999999999999988755667899999999999999999999999999999999999986
Q ss_pred C
Q 034021 106 D 106 (106)
Q Consensus 106 ~ 106 (106)
|
T Consensus 693 d 693 (725)
T KOG0110|consen 693 D 693 (725)
T ss_pred c
Confidence 5
No 65
>KOG0109 consensus RNA-binding protein LARK, contains RRM and retroviral-type Zn-finger domains [RNA processing and modification; General function prediction only]
Probab=99.33 E-value=3.4e-12 Score=82.01 Aligned_cols=75 Identities=29% Similarity=0.539 Sum_probs=69.4
Q ss_pred CCCCceEEEcCCCCCCCHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeCCeEEEEEe
Q 034021 23 ANSRTKLFIGGLSYDTNETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLDGRNIRVNY 102 (106)
Q Consensus 23 ~~~~~~l~v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g~~l~v~~ 102 (106)
....++++|+||.+.++.++++..|.+||++.++.++++ |+||.|+-.+++..|++.|++.++.|++|+|++
T Consensus 75 sk~stkl~vgNis~tctn~ElRa~fe~ygpviecdivkd--------y~fvh~d~~eda~~air~l~~~~~~gk~m~vq~ 146 (346)
T KOG0109|consen 75 SKASTKLHVGNISPTCTNQELRAKFEKYGPVIECDIVKD--------YAFVHFDRAEDAVEAIRGLDNTEFQGKRMHVQL 146 (346)
T ss_pred CCCccccccCCCCccccCHHHhhhhcccCCceeeeeecc--------eeEEEEeeccchHHHHhcccccccccceeeeee
Confidence 445788999999999999999999999999999999754 999999999999999999999999999999998
Q ss_pred ccC
Q 034021 103 AHK 105 (106)
Q Consensus 103 s~~ 105 (106)
|..
T Consensus 147 sts 149 (346)
T KOG0109|consen 147 STS 149 (346)
T ss_pred ecc
Confidence 753
No 66
>KOG0110 consensus RNA-binding protein (RRM superfamily) [General function prediction only]
Probab=99.31 E-value=1.3e-11 Score=86.86 Aligned_cols=77 Identities=32% Similarity=0.654 Sum_probs=69.0
Q ss_pred ceEEEcCCCCCCCHHHHHHHhccCCCeeEEEEeecCCCCC----ceeEEEEEecCHHHHHHHHHHhCCceeCCeEEEEEe
Q 034021 27 TKLFIGGLSYDTNETVLKDAFGQHGEIIEVKVICDRVTGQ----SRGYGFVKFASEAAAGEAIKEMDGLLLDGRNIRVNY 102 (106)
Q Consensus 27 ~~l~v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~----~~g~~fv~f~~~~~a~~al~~l~~~~~~g~~l~v~~ 102 (106)
+++||.|++++++.+++...|...|.|..+.|...+ +++ +.|||||+|.+.++|++|++.|+|..++|+.|.|++
T Consensus 516 t~lfvkNlnf~Tt~e~l~~~F~k~G~VlS~~I~kkk-d~~~k~lSmGfgFVEF~~~e~A~~a~k~lqgtvldGH~l~lk~ 594 (725)
T KOG0110|consen 516 TKLFVKNLNFDTTLEDLEDLFSKQGTVLSIEISKKK-DPANKYLSMGFGFVEFAKPESAQAALKALQGTVLDGHKLELKI 594 (725)
T ss_pred hhhhhhcCCcccchhHHHHHHHhcCeEEEEEEeccc-cccccccccceeEEEecCHHHHHHHHHHhcCceecCceEEEEe
Confidence 449999999999999999999999999998886665 332 459999999999999999999999999999999999
Q ss_pred cc
Q 034021 103 AH 104 (106)
Q Consensus 103 s~ 104 (106)
+.
T Consensus 595 S~ 596 (725)
T KOG0110|consen 595 SE 596 (725)
T ss_pred cc
Confidence 86
No 67
>KOG0153 consensus Predicted RNA-binding protein (RRM superfamily) [General function prediction only]
Probab=99.29 E-value=2.4e-11 Score=79.64 Aligned_cols=79 Identities=28% Similarity=0.558 Sum_probs=68.2
Q ss_pred cCCCCCceEEEcCCCCCCCHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHh-CCceeCCeEEE
Q 034021 21 SDANSRTKLFIGGLSYDTNETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEM-DGLLLDGRNIR 99 (106)
Q Consensus 21 ~~~~~~~~l~v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l-~~~~~~g~~l~ 99 (106)
.++..-.+|||++|...+++.+|++.|.+||.|..+.+.... ++|||+|.+.+.|+.|..+. +...|.|.+|+
T Consensus 223 PeD~~I~tLyIg~l~d~v~e~dIrdhFyqyGeirsi~~~~~~------~CAFv~ftTR~aAE~Aae~~~n~lvI~G~Rl~ 296 (377)
T KOG0153|consen 223 PEDTSIKTLYIGGLNDEVLEQDIRDHFYQYGEIRSIRILPRK------GCAFVTFTTREAAEKAAEKSFNKLVINGFRLK 296 (377)
T ss_pred CcccceeEEEecccccchhHHHHHHHHhhcCCeeeEEeeccc------ccceeeehhhHHHHHHHHhhcceeeecceEEE
Confidence 455567899999999999999999999999999999887553 48999999999999888744 55568999999
Q ss_pred EEeccC
Q 034021 100 VNYAHK 105 (106)
Q Consensus 100 v~~s~~ 105 (106)
|.|+.+
T Consensus 297 i~Wg~~ 302 (377)
T KOG0153|consen 297 IKWGRP 302 (377)
T ss_pred EEeCCC
Confidence 999864
No 68
>PF04059 RRM_2: RNA recognition motif 2; InterPro: IPR007201 This RNA recognition motif 2 is found in Meiosis protein mei2. It is found C-terminal to the RNA-binding region RNP-1 (IPR000504 from INTERPRO).
Probab=99.29 E-value=9.3e-11 Score=65.00 Aligned_cols=79 Identities=24% Similarity=0.320 Sum_probs=68.9
Q ss_pred CceEEEcCCCCCCCHHHHHHHhccC--CCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeC----CeEEE
Q 034021 26 RTKLFIGGLSYDTNETVLKDAFGQH--GEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLD----GRNIR 99 (106)
Q Consensus 26 ~~~l~v~~lp~~~~~~~l~~~f~~~--g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~----g~~l~ 99 (106)
++||.|+|||...+.++|.+.+... |..-.+.++.|..++...|||||.|.+.+.+.+..+.++|..+. .+.+.
T Consensus 1 RTTvMirNIPn~~t~~~L~~~l~~~~~g~yDF~YLPiDf~~~~N~GYAFVNf~~~~~~~~F~~~f~g~~w~~~~s~Kvc~ 80 (97)
T PF04059_consen 1 RTTVMIRNIPNKYTQEMLIQILDEHFKGKYDFFYLPIDFKNKCNLGYAFVNFTSPQAAIRFYKAFNGKKWPNFNSKKVCE 80 (97)
T ss_pred CeeEEEecCCCCCCHHHHHHHHHHhccCcceEEEeeeeccCCCceEEEEEEcCCHHHHHHHHHHHcCCccccCCCCcEEE
Confidence 4689999999999999999998643 67788899999989999999999999999999999999999874 45677
Q ss_pred EEecc
Q 034021 100 VNYAH 104 (106)
Q Consensus 100 v~~s~ 104 (106)
|.+|+
T Consensus 81 i~yAr 85 (97)
T PF04059_consen 81 ISYAR 85 (97)
T ss_pred EehhH
Confidence 77764
No 69
>KOG4208 consensus Nucleolar RNA-binding protein NIFK [General function prediction only]
Probab=99.28 E-value=4.4e-11 Score=73.54 Aligned_cols=83 Identities=24% Similarity=0.392 Sum_probs=73.8
Q ss_pred cCCCCCceEEEcCCCCCCCHHHHHHHhccC-CCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeCCeEEE
Q 034021 21 SDANSRTKLFIGGLSYDTNETVLKDAFGQH-GEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLDGRNIR 99 (106)
Q Consensus 21 ~~~~~~~~l~v~~lp~~~~~~~l~~~f~~~-g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g~~l~ 99 (106)
.......-+|+..+|....+.++..+|.++ |.+...++-+...+|.++|||||+|++.+.|.-|...||+..+.++-|.
T Consensus 44 p~~~~~g~~~~~~~p~g~~e~~~~~~~~q~~g~v~r~rlsRnkrTGNSKgYAFVEFEs~eVA~IaAETMNNYLl~e~lL~ 123 (214)
T KOG4208|consen 44 PEQEIEGVVYVDHIPHGFFETEILNYFRQFGGTVTRFRLSRNKRTGNSKGYAFVEFESEEVAKIAAETMNNYLLMEHLLE 123 (214)
T ss_pred CccCCccceeecccccchhHHHHhhhhhhcCCeeEEEEeecccccCCcCceEEEEeccHHHHHHHHHHhhhhhhhhheee
Confidence 344456778999999999999999999988 6788888888888999999999999999999999999999999999988
Q ss_pred EEec
Q 034021 100 VNYA 103 (106)
Q Consensus 100 v~~s 103 (106)
+.+=
T Consensus 124 c~vm 127 (214)
T KOG4208|consen 124 CHVM 127 (214)
T ss_pred eEEe
Confidence 8763
No 70
>KOG0146 consensus RNA-binding protein ETR-3 (RRM superfamily) [RNA processing and modification]
Probab=99.25 E-value=2.3e-11 Score=77.68 Aligned_cols=80 Identities=33% Similarity=0.529 Sum_probs=71.7
Q ss_pred CCceEEEcCCCCCCCHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCce-e--CCeEEEEE
Q 034021 25 SRTKLFIGGLSYDTNETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLL-L--DGRNIRVN 101 (106)
Q Consensus 25 ~~~~l~v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~-~--~g~~l~v~ 101 (106)
+.+++||+.|.+.-.+++++++|..||.+.+|.+.+.. +|..+|++||.|.+..++++|+..|+|-. + -+.-|.|+
T Consensus 18 ~drklfvgml~kqq~e~dvrrlf~pfG~~~e~tvlrg~-dg~sKGCAFVKf~s~~eAqaAI~aLHgSqTmpGASSSLVVK 96 (371)
T KOG0146|consen 18 DDRKLFVGMLNKQQSEDDVRRLFQPFGNIEECTVLRGP-DGNSKGCAFVKFSSHAEAQAAINALHGSQTMPGASSSLVVK 96 (371)
T ss_pred cchhhhhhhhcccccHHHHHHHhcccCCcceeEEecCC-CCCCCCceEEEeccchHHHHHHHHhcccccCCCCccceEEE
Confidence 67899999999999999999999999999999999987 88999999999999999999999999874 3 33568888
Q ss_pred eccC
Q 034021 102 YAHK 105 (106)
Q Consensus 102 ~s~~ 105 (106)
++.-
T Consensus 97 ~ADT 100 (371)
T KOG0146|consen 97 FADT 100 (371)
T ss_pred eccc
Confidence 8754
No 71
>KOG1548 consensus Transcription elongation factor TAT-SF1 [Transcription]
Probab=99.23 E-value=8.5e-11 Score=77.10 Aligned_cols=82 Identities=24% Similarity=0.427 Sum_probs=73.2
Q ss_pred CCCCCceEEEcCCCCCCCHHHHHHHhccCCCee--------EEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCcee
Q 034021 22 DANSRTKLFIGGLSYDTNETVLKDAFGQHGEII--------EVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLL 93 (106)
Q Consensus 22 ~~~~~~~l~v~~lp~~~~~~~l~~~f~~~g~i~--------~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~ 93 (106)
.+..+..|||+|||.++|.+++.++|+++|-|. .|++.++. .|..+|-|++.|...+++.-|+..|++..+
T Consensus 130 ~~~~Nt~VYVsgLP~DiT~dE~~~~~sKcGiI~~d~~t~epk~KlYrd~-~G~lKGDaLc~y~K~ESVeLA~~ilDe~~~ 208 (382)
T KOG1548|consen 130 EPKVNTSVYVSGLPLDITVDEFAEVMSKCGIIMRDPQTGEPKVKLYRDN-QGKLKGDALCCYIKRESVELAIKILDEDEL 208 (382)
T ss_pred ccccCceEEecCCCCcccHHHHHHHHHhcceEeccCCCCCeeEEEEecC-CCCccCceEEEeecccHHHHHHHHhCcccc
Confidence 344567799999999999999999999999763 36787776 599999999999999999999999999999
Q ss_pred CCeEEEEEecc
Q 034021 94 DGRNIRVNYAH 104 (106)
Q Consensus 94 ~g~~l~v~~s~ 104 (106)
.|+.|+|+.|+
T Consensus 209 rg~~~rVerAk 219 (382)
T KOG1548|consen 209 RGKKLRVERAK 219 (382)
T ss_pred cCcEEEEehhh
Confidence 99999999875
No 72
>KOG0533 consensus RRM motif-containing protein [RNA processing and modification]
Probab=99.21 E-value=1.5e-10 Score=73.67 Aligned_cols=82 Identities=32% Similarity=0.528 Sum_probs=74.4
Q ss_pred CCCCCceEEEcCCCCCCCHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeCCeEEEEE
Q 034021 22 DANSRTKLFIGGLSYDTNETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLDGRNIRVN 101 (106)
Q Consensus 22 ~~~~~~~l~v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g~~l~v~ 101 (106)
.+....+|+|.|||..+.+++|+++|..||.+..+.+.+++ .|.+.|.|-|.|...++|.++++.++++.++|+.|++.
T Consensus 79 ~~~~~~~v~v~NL~~~V~~~Dl~eLF~~~~~~~r~~vhy~~-~G~s~Gta~v~~~r~~DA~~avk~~~gv~ldG~~mk~~ 157 (243)
T KOG0533|consen 79 NETRSTKVNVSNLPYGVIDADLKELFAEFGELKRVAVHYDR-AGRSLGTADVSFNRRDDAERAVKKYNGVALDGRPMKIE 157 (243)
T ss_pred cCCCcceeeeecCCcCcchHHHHHHHHHhccceEEeeccCC-CCCCCccceeeecchHhHHHHHHHhcCcccCCceeeeE
Confidence 44456889999999999999999999999988888888887 78889999999999999999999999999999999988
Q ss_pred ecc
Q 034021 102 YAH 104 (106)
Q Consensus 102 ~s~ 104 (106)
...
T Consensus 158 ~i~ 160 (243)
T KOG0533|consen 158 IIS 160 (243)
T ss_pred Eec
Confidence 754
No 73
>KOG0132 consensus RNA polymerase II C-terminal domain-binding protein RA4, contains RPR and RRM domains [RNA processing and modification; Transcription]
Probab=99.21 E-value=6.4e-11 Score=84.28 Aligned_cols=73 Identities=25% Similarity=0.525 Sum_probs=67.7
Q ss_pred CceEEEcCCCCCCCHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeCCeEEEEEecc
Q 034021 26 RTKLFIGGLSYDTNETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLDGRNIRVNYAH 104 (106)
Q Consensus 26 ~~~l~v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g~~l~v~~s~ 104 (106)
.+||||++|+..+++.+|...|+.||.|..|.+... +++|||......+|.+|+..|..+.+.++.|++.|+-
T Consensus 421 SrTLwvG~i~k~v~e~dL~~~feefGeiqSi~li~~------R~cAfI~M~~RqdA~kalqkl~n~kv~~k~Iki~Wa~ 493 (894)
T KOG0132|consen 421 SRTLWVGGIPKNVTEQDLANLFEEFGEIQSIILIPP------RGCAFIKMVRRQDAEKALQKLSNVKVADKTIKIAWAV 493 (894)
T ss_pred eeeeeeccccchhhHHHHHHHHHhcccceeEeeccC------CceeEEEEeehhHHHHHHHHHhcccccceeeEEeeec
Confidence 489999999999999999999999999999977543 5799999999999999999999999999999999973
No 74
>KOG4205 consensus RNA-binding protein musashi/mRNA cleavage and polyadenylation factor I complex, subunit HRP1 [RNA processing and modification]
Probab=99.18 E-value=4e-11 Score=78.84 Aligned_cols=80 Identities=40% Similarity=0.681 Sum_probs=72.4
Q ss_pred CCceEEEcCCCCCCCHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeCCeEEEEEecc
Q 034021 25 SRTKLFIGGLSYDTNETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLDGRNIRVNYAH 104 (106)
Q Consensus 25 ~~~~l~v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g~~l~v~~s~ 104 (106)
..++++|++|+++++++.|++.|..||.+..+.+++++.++..++++||+|.+.+...+++. ...+.++|+.|.+..+.
T Consensus 5 ~~~KlfiGgisw~ttee~Lr~yf~~~Gev~d~~vm~d~~t~rsrgFgfv~f~~~~~v~~vl~-~~~h~~dgr~ve~k~av 83 (311)
T KOG4205|consen 5 ESGKLFIGGLSWETTEESLREYFSQFGEVTDCVVMRDPSTGRSRGFGFVTFATPEGVDAVLN-ARTHKLDGRSVEPKRAV 83 (311)
T ss_pred CCcceeecCcCccccHHHHHHHhcccCceeeEEEeccCCCCCcccccceecCCCcchheeec-ccccccCCccccceecc
Confidence 67899999999999999999999999999999999999999999999999999999998888 36677888888777654
Q ss_pred C
Q 034021 105 K 105 (106)
Q Consensus 105 ~ 105 (106)
+
T Consensus 84 ~ 84 (311)
T KOG4205|consen 84 S 84 (311)
T ss_pred C
Confidence 3
No 75
>KOG1457 consensus RNA binding protein (contains RRM repeats) [General function prediction only]
Probab=99.18 E-value=7.3e-10 Score=69.23 Aligned_cols=85 Identities=18% Similarity=0.322 Sum_probs=67.8
Q ss_pred CCCCCceEEEcCCCCCCCHHHHHHHhccCCCeeEEEEeec-CCCCCceeEEEEEecCHHHHHHHHHHhCCceeC---CeE
Q 034021 22 DANSRTKLFIGGLSYDTNETVLKDAFGQHGEIIEVKVICD-RVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLD---GRN 97 (106)
Q Consensus 22 ~~~~~~~l~v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~-~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~---g~~ 97 (106)
+...-+||||++||.++...+|.-+|..|..-+.+.+... +....++-+||+.|.+.+.|.+|+..|||+.++ +..
T Consensus 30 ~~~~VRTLFVSGLP~DvKpREiynLFR~f~GYEgslLK~Tsk~~~~~~pvaFatF~s~q~A~aamnaLNGvrFDpE~~st 109 (284)
T KOG1457|consen 30 EPGAVRTLFVSGLPNDVKPREIYNLFRRFHGYEGSLLKYTSKGDQVCKPVAFATFTSHQFALAAMNALNGVRFDPETGST 109 (284)
T ss_pred cccccceeeeccCCcccCHHHHHHHhccCCCccceeeeeccCCCccccceEEEEecchHHHHHHHHHhcCeeeccccCce
Confidence 3444799999999999999999999998865544444332 223345679999999999999999999999985 678
Q ss_pred EEEEeccCC
Q 034021 98 IRVNYAHKD 106 (106)
Q Consensus 98 l~v~~s~~~ 106 (106)
|++++++.+
T Consensus 110 LhiElAKSN 118 (284)
T KOG1457|consen 110 LHIELAKSN 118 (284)
T ss_pred eEeeehhcC
Confidence 999988754
No 76
>KOG0116 consensus RasGAP SH3 binding protein rasputin, contains NTF2 and RRM domains [Signal transduction mechanisms]
Probab=99.16 E-value=1.6e-10 Score=78.59 Aligned_cols=78 Identities=28% Similarity=0.451 Sum_probs=67.4
Q ss_pred CceEEEcCCCCCCCHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeCCeEEEEEecc
Q 034021 26 RTKLFIGGLSYDTNETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLDGRNIRVNYAH 104 (106)
Q Consensus 26 ~~~l~v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g~~l~v~~s~ 104 (106)
..+|||.|||.+++..+|+++|..||.|....|......++...||||+|.+...++.++.+ +...++++++.|+.-+
T Consensus 288 ~~~i~V~nlP~da~~~~l~~~Fk~FG~Ik~~~I~vr~~~~~~~~fgFV~f~~~~~~~~~i~A-sp~~ig~~kl~Veek~ 365 (419)
T KOG0116|consen 288 GLGIFVKNLPPDATPAELEEVFKQFGPIKEGGIQVRSPGGKNPCFGFVEFENAAAVQNAIEA-SPLEIGGRKLNVEEKR 365 (419)
T ss_pred ccceEeecCCCCCCHHHHHHHHhhcccccccceEEeccCCCcCceEEEEEeecchhhhhhhc-CccccCCeeEEEEecc
Confidence 35599999999999999999999999999998876543444448999999999999999996 6888999999998643
No 77
>KOG4211 consensus Splicing factor hnRNP-F and related RNA-binding proteins [RNA processing and modification]
Probab=99.13 E-value=5.6e-10 Score=76.13 Aligned_cols=80 Identities=25% Similarity=0.386 Sum_probs=68.4
Q ss_pred CCCCCceEEEcCCCCCCCHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeCCeEEEEE
Q 034021 22 DANSRTKLFIGGLSYDTNETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLDGRNIRVN 101 (106)
Q Consensus 22 ~~~~~~~l~v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g~~l~v~ 101 (106)
+......|.+++|||++|+++|++||+.++ |..+.+.+. +|+..|-|||+|.+.+++++|++ .+...++.+.|.|.
T Consensus 6 e~~~~~~vr~rGLPwsat~~ei~~Ff~~~~-I~~~~~~r~--~Gr~sGeA~Ve~~seedv~~Alk-kdR~~mg~RYIEVf 81 (510)
T KOG4211|consen 6 EGSTAFEVRLRGLPWSATEKEILDFFSNCG-IENLEIPRR--NGRPSGEAYVEFTSEEDVEKALK-KDRESMGHRYIEVF 81 (510)
T ss_pred CCCcceEEEecCCCccccHHHHHHHHhcCc-eeEEEEecc--CCCcCcceEEEeechHHHHHHHH-hhHHHhCCceEEEE
Confidence 444567788999999999999999999875 777666554 68889999999999999999999 58888999999998
Q ss_pred eccC
Q 034021 102 YAHK 105 (106)
Q Consensus 102 ~s~~ 105 (106)
-+..
T Consensus 82 ~~~~ 85 (510)
T KOG4211|consen 82 TAGG 85 (510)
T ss_pred ccCC
Confidence 7643
No 78
>KOG4212 consensus RNA-binding protein hnRNP-M [RNA processing and modification]
Probab=99.10 E-value=5.8e-10 Score=75.48 Aligned_cols=80 Identities=33% Similarity=0.555 Sum_probs=69.9
Q ss_pred cccCCCCCceEEEcCCCCCCCHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeCCeEE
Q 034021 19 LLSDANSRTKLFIGGLSYDTNETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLDGRNI 98 (106)
Q Consensus 19 ~~~~~~~~~~l~v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g~~l 98 (106)
........++|+|+|||.+.|...|++-|..+|.+....|+ ++++.+| .|.|.+.++|+.|+..+++..++|+.|
T Consensus 529 a~gaarKa~qIiirNlP~dfTWqmlrDKfre~G~v~yadim---e~GkskG--VVrF~s~edAEra~a~Mngs~l~Gr~I 603 (608)
T KOG4212|consen 529 AVGAARKACQIIIRNLPFDFTWQMLRDKFREIGHVLYADIM---ENGKSKG--VVRFFSPEDAERACALMNGSRLDGRNI 603 (608)
T ss_pred cccccccccEEEEecCCccccHHHHHHHHHhccceehhhhh---ccCCccc--eEEecCHHHHHHHHHHhccCcccCcee
Confidence 34455567899999999999999999999999999988883 3566654 899999999999999999999999999
Q ss_pred EEEec
Q 034021 99 RVNYA 103 (106)
Q Consensus 99 ~v~~s 103 (106)
.|.|.
T Consensus 604 ~V~y~ 608 (608)
T KOG4212|consen 604 KVTYF 608 (608)
T ss_pred eeeeC
Confidence 99873
No 79
>KOG4660 consensus Protein Mei2, essential for commitment to meiosis, and related proteins [Cell cycle control, cell division, chromosome partitioning]
Probab=99.06 E-value=2.2e-10 Score=78.98 Aligned_cols=72 Identities=28% Similarity=0.412 Sum_probs=64.3
Q ss_pred CCCCceEEEcCCCCCCCHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeCCeEEE
Q 034021 23 ANSRTKLFIGGLSYDTNETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLDGRNIR 99 (106)
Q Consensus 23 ~~~~~~l~v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g~~l~ 99 (106)
.-...+|+|-|||.++++++|++.|+.||.|+.++- +....++.||+|.|..+|++|+++|++.++.|++++
T Consensus 72 ~~~~~~L~v~nl~~~Vsn~~L~~~f~~yGeir~ir~-----t~~~~~~~~v~FyDvR~A~~Alk~l~~~~~~~~~~k 143 (549)
T KOG4660|consen 72 DMNQGTLVVFNLPRSVSNDTLLRIFGAYGEIREIRE-----TPNKRGIVFVEFYDVRDAERALKALNRREIAGKRIK 143 (549)
T ss_pred cCccceEEEEecCCcCCHHHHHHHHHhhcchhhhhc-----ccccCceEEEEEeehHhHHHHHHHHHHHHhhhhhhc
Confidence 456789999999999999999999999999999644 344578999999999999999999999999998876
No 80
>KOG4661 consensus Hsp27-ERE-TATA-binding protein/Scaffold attachment factor (SAF-B) [Transcription]
Probab=99.04 E-value=1.3e-09 Score=76.08 Aligned_cols=82 Identities=32% Similarity=0.504 Sum_probs=74.1
Q ss_pred CCCceEEEcCCCCCCCHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeCCeEEEEEec
Q 034021 24 NSRTKLFIGGLSYDTNETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLDGRNIRVNYA 103 (106)
Q Consensus 24 ~~~~~l~v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g~~l~v~~s 103 (106)
...+.+||++|...+...+|+.+|++||.|...+++...-+...+.|+||.+.+..+|.+||..|+..++.|+.|.|+.+
T Consensus 403 ~~gRNlWVSGLSstTRAtDLKnlFSKyGKVvGAKVVTNaRsPGaRCYGfVTMSts~eAtkCI~hLHrTELHGrmISVEka 482 (940)
T KOG4661|consen 403 TLGRNLWVSGLSSTTRATDLKNLFSKYGKVVGAKVVTNARSPGARCYGFVTMSTSAEATKCIEHLHRTELHGRMISVEKA 482 (940)
T ss_pred ccccceeeeccccchhhhHHHHHHHHhcceeceeeeecCCCCCcceeEEEEecchHHHHHHHHHhhhhhhcceeeeeeec
Confidence 34688999999999999999999999999999988877656667889999999999999999999999999999999987
Q ss_pred cC
Q 034021 104 HK 105 (106)
Q Consensus 104 ~~ 105 (106)
+-
T Consensus 483 KN 484 (940)
T KOG4661|consen 483 KN 484 (940)
T ss_pred cc
Confidence 63
No 81
>KOG0151 consensus Predicted splicing regulator, contains RRM, SWAP and RPR domains [General function prediction only]
Probab=99.03 E-value=1.8e-09 Score=76.58 Aligned_cols=84 Identities=27% Similarity=0.566 Sum_probs=74.0
Q ss_pred cCCCCCceEEEcCCCCCCCHHHHHHHhccCCCeeEEEEeecCCC---CCceeEEEEEecCHHHHHHHHHHhCCceeCCeE
Q 034021 21 SDANSRTKLFIGGLSYDTNETVLKDAFGQHGEIIEVKVICDRVT---GQSRGYGFVKFASEAAAGEAIKEMDGLLLDGRN 97 (106)
Q Consensus 21 ~~~~~~~~l~v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~~---~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g~~ 97 (106)
..++..+.+||+||+..++++.|...|..||++..++++..... .....++||.|.+..++++|++.|+|..+.+..
T Consensus 169 dgDP~TTNlyv~Nlnpsv~E~~ll~tfGrfgPlasvKimwpRtEeEk~r~r~cgfvafmnR~D~era~k~lqg~iv~~~e 248 (877)
T KOG0151|consen 169 DGDPQTTNLYVGNLNPSVDENFLLRTFGRFGPLASVKIMWPRTEEEKRRERNCGFVAFMNRADAERALKELQGIIVMEYE 248 (877)
T ss_pred CCCCcccceeeecCCccccHHHHHHHhcccCcccceeeecccchhhhccccccceeeehhhhhHHHHHHHhcceeeeeee
Confidence 33556789999999999999999999999999999988877632 234679999999999999999999999999999
Q ss_pred EEEEecc
Q 034021 98 IRVNYAH 104 (106)
Q Consensus 98 l~v~~s~ 104 (106)
+++.|++
T Consensus 249 ~K~gWgk 255 (877)
T KOG0151|consen 249 MKLGWGK 255 (877)
T ss_pred eeecccc
Confidence 9999885
No 82
>KOG4205 consensus RNA-binding protein musashi/mRNA cleavage and polyadenylation factor I complex, subunit HRP1 [RNA processing and modification]
Probab=99.02 E-value=7.1e-10 Score=73.02 Aligned_cols=79 Identities=34% Similarity=0.595 Sum_probs=73.0
Q ss_pred CceEEEcCCCCCCCHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeCCeEEEEEeccC
Q 034021 26 RTKLFIGGLSYDTNETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLDGRNIRVNYAHK 105 (106)
Q Consensus 26 ~~~l~v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g~~l~v~~s~~ 105 (106)
..+++|++||.++++++++++|.+||.|..+.++.+..+...++++||.|.+.+++.+++. ..-+.+.++.+.|..|-|
T Consensus 97 tkkiFvGG~~~~~~e~~~r~yfe~~g~v~~~~~~~d~~~~~~rgFgfv~~~~e~sVdkv~~-~~f~~~~gk~vevkrA~p 175 (311)
T KOG4205|consen 97 TKKIFVGGLPPDTTEEDFKDYFEQFGKVADVVIMYDKTTSRPRGFGFVTFDSEDSVDKVTL-QKFHDFNGKKVEVKRAIP 175 (311)
T ss_pred eeEEEecCcCCCCchHHHhhhhhccceeEeeEEeecccccccccceeeEeccccccceecc-cceeeecCceeeEeeccc
Confidence 4589999999999999999999999999999999999899999999999999999999998 577789999999988755
No 83
>KOG4209 consensus Splicing factor RNPS1, SR protein superfamily [RNA processing and modification]
Probab=98.97 E-value=2.2e-09 Score=68.28 Aligned_cols=83 Identities=25% Similarity=0.426 Sum_probs=75.9
Q ss_pred ccCCCCCceEEEcCCCCCCCHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeCCeEEE
Q 034021 20 LSDANSRTKLFIGGLSYDTNETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLDGRNIR 99 (106)
Q Consensus 20 ~~~~~~~~~l~v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g~~l~ 99 (106)
.....+...+||+|+.+.++.++++..|+.+|.+..+.+..+...+..+|++||+|.+.+.++.++. |++-.+.|..+.
T Consensus 95 ~~~~~d~~sv~v~nvd~~~t~~~~e~hf~~Cg~i~~~ti~~d~~~~~~k~~~yvef~~~~~~~~ay~-l~gs~i~~~~i~ 173 (231)
T KOG4209|consen 95 RQKEVDAPSVWVGNVDFLVTLTKIELHFESCGGINRVTVPKDKFRGHPKGFAYVEFSSYELVEEAYK-LDGSEIPGPAIE 173 (231)
T ss_pred hhhccCCceEEEeccccccccchhhheeeccCCccceeeeccccCCCcceeEEEecccHhhhHHHhh-cCCcccccccce
Confidence 3556678899999999999999999999999999999999888788899999999999999999999 999999999998
Q ss_pred EEec
Q 034021 100 VNYA 103 (106)
Q Consensus 100 v~~s 103 (106)
|.+-
T Consensus 174 vt~~ 177 (231)
T KOG4209|consen 174 VTLK 177 (231)
T ss_pred eeee
Confidence 8864
No 84
>KOG0226 consensus RNA-binding proteins [General function prediction only]
Probab=98.93 E-value=1.4e-09 Score=68.97 Aligned_cols=83 Identities=34% Similarity=0.561 Sum_probs=75.1
Q ss_pred cCCCCCceEEEcCCCCCCCHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeCCeEEEE
Q 034021 21 SDANSRTKLFIGGLSYDTNETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLDGRNIRV 100 (106)
Q Consensus 21 ~~~~~~~~l~v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g~~l~v 100 (106)
+.+.+...||.+-|..+++.+-+-+.|.+|-....-.++++.-+++.+||+||.|.+..++..|+++++|..++++.|++
T Consensus 185 ew~~~DfRIfcgdlgNevnd~vl~raf~Kfpsf~~akviRdkRTgKSkgygfVSf~~pad~~rAmrem~gkyVgsrpikl 264 (290)
T KOG0226|consen 185 EWDEDDFRIFCGDLGNEVNDDVLARAFKKFPSFQKAKVIRDKRTGKSKGYGFVSFRDPADYVRAMREMNGKYVGSRPIKL 264 (290)
T ss_pred cCccccceeecccccccccHHHHHHHHHhccchhhccccccccccccccceeeeecCHHHHHHHHHhhcccccccchhHh
Confidence 34556789999999999999999999999988888888999889999999999999999999999999999999999887
Q ss_pred Eec
Q 034021 101 NYA 103 (106)
Q Consensus 101 ~~s 103 (106)
+.+
T Consensus 265 RkS 267 (290)
T KOG0226|consen 265 RKS 267 (290)
T ss_pred hhh
Confidence 643
No 85
>KOG0106 consensus Alternative splicing factor SRp55/B52/SRp75 (RRM superfamily) [RNA processing and modification]
Probab=98.92 E-value=1.5e-09 Score=67.92 Aligned_cols=70 Identities=30% Similarity=0.578 Sum_probs=63.6
Q ss_pred ceEEEcCCCCCCCHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeCCeEEEEEecc
Q 034021 27 TKLFIGGLSYDTNETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLDGRNIRVNYAH 104 (106)
Q Consensus 27 ~~l~v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g~~l~v~~s~ 104 (106)
..+||++||+.+.+.++.++|..||.+..+.+. .+|+||+|.+..+|..|+..+++..+.+-++.|+++.
T Consensus 2 ~rv~vg~~~~~~~~~d~E~~f~~yg~~~d~~mk--------~gf~fv~fed~rda~Dav~~l~~~~l~~e~~vve~~r 71 (216)
T KOG0106|consen 2 PRVYIGRLPYRARERDVERFFKGYGKIPDADMK--------NGFGFVEFEDPRDADDAVHDLDGKELCGERLVVEHAR 71 (216)
T ss_pred CceeecccCCccchhHHHHHHhhccccccceee--------cccceeccCchhhhhcccchhcCceecceeeeeeccc
Confidence 368999999999999999999999999988763 3588999999999999999999999999889988876
No 86
>KOG1190 consensus Polypyrimidine tract-binding protein [RNA processing and modification]
Probab=98.88 E-value=1.8e-08 Score=67.83 Aligned_cols=74 Identities=22% Similarity=0.441 Sum_probs=67.1
Q ss_pred CceEEEcCCCCC-CCHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeCCeEEEEEecc
Q 034021 26 RTKLFIGGLSYD-TNETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLDGRNIRVNYAH 104 (106)
Q Consensus 26 ~~~l~v~~lp~~-~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g~~l~v~~s~ 104 (106)
+..|-|.||... +|.+.|..+|+.||+|.+++++..+. ..|+|+|.+...|+-|+..|+|..+.|++|+|.+|+
T Consensus 297 n~vllvsnln~~~VT~d~LftlFgvYGdVqRVkil~nkk-----d~ALIQmsd~~qAqLA~~hL~g~~l~gk~lrvt~SK 371 (492)
T KOG1190|consen 297 NVVLLVSNLNEEAVTPDVLFTLFGVYGDVQRVKILYNKK-----DNALIQMSDGQQAQLAMEHLEGHKLYGKKLRVTLSK 371 (492)
T ss_pred ceEEEEecCchhccchhHHHHHHhhhcceEEEEeeecCC-----cceeeeecchhHHHHHHHHhhcceecCceEEEeecc
Confidence 577888888655 89999999999999999999998763 479999999999999999999999999999999986
No 87
>KOG4454 consensus RNA binding protein (RRM superfamily) [General function prediction only]
Probab=98.80 E-value=2.3e-09 Score=66.84 Aligned_cols=79 Identities=18% Similarity=0.262 Sum_probs=70.7
Q ss_pred CCCCCceEEEcCCCCCCCHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeCCeEEEEE
Q 034021 22 DANSRTKLFIGGLSYDTNETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLDGRNIRVN 101 (106)
Q Consensus 22 ~~~~~~~l~v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g~~l~v~ 101 (106)
.....++|||.|+-..++++-|.++|-.-|+|..+.|.... +++.+ ||||.|.++.+.+-|+..+||..+.+..++++
T Consensus 5 aae~drtl~v~n~~~~v~eelL~ElfiqaGPV~kv~ip~~~-d~~~k-Fa~v~f~~E~sv~~a~~L~ng~~l~~~e~q~~ 82 (267)
T KOG4454|consen 5 AAEMDRTLLVQNMYSGVSEELLSELFIQAGPVYKVGIPSGQ-DQEQK-FAYVFFPNENSVQLAGQLENGDDLEEDEEQRT 82 (267)
T ss_pred CcchhhHHHHHhhhhhhhHHHHHHHhhccCceEEEeCCCCc-cCCCc-eeeeecccccchhhhhhhcccchhccchhhcc
Confidence 34457899999999999999999999999999999887776 67777 99999999999999999999999999888877
Q ss_pred e
Q 034021 102 Y 102 (106)
Q Consensus 102 ~ 102 (106)
+
T Consensus 83 ~ 83 (267)
T KOG4454|consen 83 L 83 (267)
T ss_pred c
Confidence 5
No 88
>KOG4211 consensus Splicing factor hnRNP-F and related RNA-binding proteins [RNA processing and modification]
Probab=98.80 E-value=2.6e-08 Score=68.15 Aligned_cols=78 Identities=21% Similarity=0.295 Sum_probs=64.1
Q ss_pred CCceEEEcCCCCCCCHHHHHHHhccCCCeeE-EEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeCCeEEEEEec
Q 034021 25 SRTKLFIGGLSYDTNETVLKDAFGQHGEIIE-VKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLDGRNIRVNYA 103 (106)
Q Consensus 25 ~~~~l~v~~lp~~~~~~~l~~~f~~~g~i~~-~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g~~l~v~~s 103 (106)
....|.+++||+.+++++|.+||+..--+.. +-++.+. .+...|-|||+|.+.+.++.|+.. |...|+.+.|.|..|
T Consensus 102 ~d~vVRLRGLPfscte~dI~~FFaGL~Iv~~gi~l~~d~-rgR~tGEAfVqF~sqe~ae~Al~r-hre~iGhRYIEvF~S 179 (510)
T KOG4211|consen 102 NDGVVRLRGLPFSCTEEDIVEFFAGLEIVPDGILLPMDQ-RGRPTGEAFVQFESQESAEIALGR-HRENIGHRYIEVFRS 179 (510)
T ss_pred CCceEEecCCCccCcHHHHHHHhcCCcccccceeeeccC-CCCcccceEEEecCHHHHHHHHHH-HHHhhccceEEeehh
Confidence 4578999999999999999999997643333 4455555 666889999999999999999995 777889999998765
Q ss_pred c
Q 034021 104 H 104 (106)
Q Consensus 104 ~ 104 (106)
.
T Consensus 180 s 180 (510)
T KOG4211|consen 180 S 180 (510)
T ss_pred H
Confidence 4
No 89
>KOG0120 consensus Splicing factor U2AF, large subunit (RRM superfamily) [RNA processing and modification]
Probab=98.70 E-value=2.9e-08 Score=68.82 Aligned_cols=87 Identities=24% Similarity=0.489 Sum_probs=78.8
Q ss_pred cccCCCCCceEEEcCCCCCCCHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeCCeEE
Q 034021 19 LLSDANSRTKLFIGGLSYDTNETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLDGRNI 98 (106)
Q Consensus 19 ~~~~~~~~~~l~v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g~~l 98 (106)
.+.-......++|++||...++.++.+.+..||++....++.+..+|-++||+|.+|.+......|+..|||..++++++
T Consensus 282 ~t~~~~~~~ki~v~~lp~~l~~~q~~Ell~~fg~lk~f~lv~d~~~g~skg~af~ey~dpsvtd~A~agLnGm~lgd~~l 361 (500)
T KOG0120|consen 282 STDVPDSPNKIFVGGLPLYLTEDQVKELLDSFGPLKAFRLVKDSATGNSKGFAFCEYCDPSVTDQAIAGLNGMQLGDKKL 361 (500)
T ss_pred ccCcccccchhhhccCcCccCHHHHHHHHHhcccchhheeecccccccccceeeeeeeCCcchhhhhcccchhhhcCcee
Confidence 34445567889999999999999999999999999999999998888899999999999999999999999999999999
Q ss_pred EEEeccC
Q 034021 99 RVNYAHK 105 (106)
Q Consensus 99 ~v~~s~~ 105 (106)
.|+.+-.
T Consensus 362 vvq~A~~ 368 (500)
T KOG0120|consen 362 VVQRAIV 368 (500)
T ss_pred Eeehhhc
Confidence 9987643
No 90
>PF11608 Limkain-b1: Limkain b1; InterPro: IPR024582 This entry represents a conserved domain found in limkain b1, which is a novel human autoantigen, localised to a subset of ABCD3 and PXF marked peroxisomes. Limkain b1 may be a relatively common target of human autoantibodies reactive to cytoplasmic vesicle-like structures [].; PDB: 2DIU_A.
Probab=98.65 E-value=2.2e-07 Score=49.85 Aligned_cols=68 Identities=22% Similarity=0.441 Sum_probs=46.2
Q ss_pred ceEEEcCCCCCCCHHH----HHHHhccCC-CeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeCCeEEEEE
Q 034021 27 TKLFIGGLSYDTNETV----LKDAFGQHG-EIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLDGRNIRVN 101 (106)
Q Consensus 27 ~~l~v~~lp~~~~~~~----l~~~f~~~g-~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g~~l~v~ 101 (106)
..|+|.|||.+..... |++++..+| .|..+ . .+.|++.|.+.+.|.+|.+.|+|..+.|++|.|+
T Consensus 3 s~L~V~NLP~~~d~~~I~~RL~qLsdNCGGkVl~v-------~---~~tAilrF~~~~~A~RA~KRmegEdVfG~kI~v~ 72 (90)
T PF11608_consen 3 SLLYVSNLPTNKDPSSIKNRLRQLSDNCGGKVLSV-------S---GGTAILRFPNQEFAERAQKRMEGEDVFGNKISVS 72 (90)
T ss_dssp EEEEEES--TTS-HHHHHHHHHHHHHTTT--EEE------------TT-EEEEESSHHHHHHHHHHHTT--SSSS--EEE
T ss_pred cEEEEecCCCCCCHHHHHHHHHHHhhccCCEEEEE-------e---CCEEEEEeCCHHHHHHHHHhhcccccccceEEEE
Confidence 3589999999887655 555565665 56655 1 2479999999999999999999999999999999
Q ss_pred ecc
Q 034021 102 YAH 104 (106)
Q Consensus 102 ~s~ 104 (106)
+..
T Consensus 73 ~~~ 75 (90)
T PF11608_consen 73 FSP 75 (90)
T ss_dssp SS-
T ss_pred EcC
Confidence 873
No 91
>KOG0147 consensus Transcriptional coactivator CAPER (RRM superfamily) [Transcription]
Probab=98.61 E-value=1.2e-08 Score=70.46 Aligned_cols=85 Identities=25% Similarity=0.399 Sum_probs=76.5
Q ss_pred cccCCCCCceEEEcCCCCCCCHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeCCeEE
Q 034021 19 LLSDANSRTKLFIGGLSYDTNETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLDGRNI 98 (106)
Q Consensus 19 ~~~~~~~~~~l~v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g~~l 98 (106)
...++.+.+++++--++...+..+|.+||+.+|.|..+.++.+..++..+|.+||+|.+.+....|+. |.|..+.|-+|
T Consensus 172 l~~eERd~Rtvf~~qla~r~~pRdL~efFs~~gkVrdVriI~Dr~s~rskgi~Yvef~D~~sVp~aia-LsGqrllg~pv 250 (549)
T KOG0147|consen 172 LSPEERDQRTVFCMQLARRNPPRDLEEFFSIVGKVRDVRIIGDRNSRRSKGIAYVEFCDEQSVPLAIA-LSGQRLLGVPV 250 (549)
T ss_pred CCchHHhHHHHHHHHHhhcCCchhHHHHHHhhcCcceeEeeccccchhhcceeEEEEecccchhhHhh-hcCCcccCcee
Confidence 34555667888888888889999999999999999999999999999999999999999999999997 89999999999
Q ss_pred EEEecc
Q 034021 99 RVNYAH 104 (106)
Q Consensus 99 ~v~~s~ 104 (106)
.|..+.
T Consensus 251 ~vq~sE 256 (549)
T KOG0147|consen 251 IVQLSE 256 (549)
T ss_pred EecccH
Confidence 998754
No 92
>KOG4206 consensus Spliceosomal protein snRNP-U1A/U2B [RNA processing and modification]
Probab=98.61 E-value=4.4e-07 Score=56.79 Aligned_cols=78 Identities=26% Similarity=0.492 Sum_probs=69.0
Q ss_pred CCCCceEEEcCCCCCCCHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeC-CeEEEEE
Q 034021 23 ANSRTKLFIGGLSYDTNETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLD-GRNIRVN 101 (106)
Q Consensus 23 ~~~~~~l~v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~-g~~l~v~ 101 (106)
......+++.|||..++.+.+..+|..|+....++++... .+.|||+|.+...+..|...+++..|. .+.++|.
T Consensus 143 ~ppn~ilf~~niP~es~~e~l~~lf~qf~g~keir~i~~~-----~~iAfve~~~d~~a~~a~~~lq~~~it~~~~m~i~ 217 (221)
T KOG4206|consen 143 APPNNILFLTNIPSESESEMLSDLFEQFPGFKEIRLIPPR-----SGIAFVEFLSDRQASAAQQALQGFKITKKNTMQIT 217 (221)
T ss_pred CCCceEEEEecCCcchhHHHHHHHHhhCcccceeEeccCC-----CceeEEecchhhhhHHHhhhhccceeccCceEEec
Confidence 4567889999999999999999999999989999887654 468999999999999999999999987 8899998
Q ss_pred eccC
Q 034021 102 YAHK 105 (106)
Q Consensus 102 ~s~~ 105 (106)
++++
T Consensus 218 ~a~K 221 (221)
T KOG4206|consen 218 FAKK 221 (221)
T ss_pred ccCC
Confidence 8864
No 93
>KOG1995 consensus Conserved Zn-finger protein [General function prediction only]
Probab=98.46 E-value=2.4e-07 Score=61.42 Aligned_cols=85 Identities=28% Similarity=0.384 Sum_probs=74.4
Q ss_pred cCCCCCceEEEcCCCCCCCHHHHHHHhccCCCee--------EEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCce
Q 034021 21 SDANSRTKLFIGGLSYDTNETVLKDAFGQHGEII--------EVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLL 92 (106)
Q Consensus 21 ~~~~~~~~l~v~~lp~~~~~~~l~~~f~~~g~i~--------~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~ 92 (106)
.......++||-++|..+++.++.++|.++|.|. .+.+..+.++++.++-|.|.|.+...|++|+.-+++..
T Consensus 61 ~~~s~~~ti~v~g~~d~~~~~~~~~~f~qcg~ikrnK~t~kPki~~y~dkeT~~~KGeatvS~~D~~~akaai~~~agkd 140 (351)
T KOG1995|consen 61 ADKSDNETIFVWGCPDSVCENDNADFFLQCGVIKRNKRTGKPKIKIYTDKETGAPKGEATVSYEDPPAAKAAIEWFAGKD 140 (351)
T ss_pred ccccccccceeeccCccchHHHHHHHHhhcceeccCCCCCCcchhccccccccCcCCceeeeecChhhhhhhhhhhcccc
Confidence 3455678999999999999999999999998763 35677777899999999999999999999999999999
Q ss_pred eCCeEEEEEeccC
Q 034021 93 LDGRNIRVNYAHK 105 (106)
Q Consensus 93 ~~g~~l~v~~s~~ 105 (106)
+.++.|+|..+.+
T Consensus 141 f~gn~ikvs~a~~ 153 (351)
T KOG1995|consen 141 FCGNTIKVSLAER 153 (351)
T ss_pred ccCCCchhhhhhh
Confidence 9999999987653
No 94
>PF08777 RRM_3: RNA binding motif; InterPro: IPR014886 This domain is found in protein La which functions as an RNA chaperone during RNA polymerase III transcription, and can also stimulate translation initiation. It contains a five stranded beta sheet which forms an atypical RNA recognition motif []. ; PDB: 1OWX_A.
Probab=98.45 E-value=9.3e-07 Score=49.92 Aligned_cols=69 Identities=28% Similarity=0.466 Sum_probs=43.2
Q ss_pred ceEEEcCCCCCCCHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCC-----ceeCCeEEEEE
Q 034021 27 TKLFIGGLSYDTNETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDG-----LLLDGRNIRVN 101 (106)
Q Consensus 27 ~~l~v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~-----~~~~g~~l~v~ 101 (106)
+.|.+.+++..++.++|++.|+.||.|..|.+.... ..|+|.|.+.+.|+.++..+.. ..+.+..+.++
T Consensus 2 ~il~~~g~~~~~~re~iK~~f~~~g~V~yVD~~~G~------~~g~VRf~~~~~A~~a~~~~~~~~~~~~~i~~~~~~~~ 75 (105)
T PF08777_consen 2 CILKFSGLGEPTSREDIKEAFSQFGEVAYVDFSRGD------TEGYVRFKTPEAAQKALEKLKEANDGKLKIKGKEVTLE 75 (105)
T ss_dssp -EEEEEE--SS--HHHHHHHT-SS--EEEEE--TT-------SEEEEEESS---HHHHHHHHHHTTTS-B-TTSSSEEEE
T ss_pred eEEEEecCCCCcCHHHHHHHHHhcCCcceEEecCCC------CEEEEEECCcchHHHHHHHHHhccCCceEEcCceEEEE
Confidence 468899999999999999999999999999886543 2799999999999999986533 34555555554
No 95
>KOG4307 consensus RNA binding protein RBM12/SWAN [General function prediction only]
Probab=98.43 E-value=1.8e-06 Score=61.84 Aligned_cols=76 Identities=17% Similarity=0.243 Sum_probs=65.5
Q ss_pred CceEEEcCCCCCCCHHHHHHHhccCCCe-eEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeCCeEEEEEe
Q 034021 26 RTKLFIGGLSYDTNETVLKDAFGQHGEI-IEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLDGRNIRVNY 102 (106)
Q Consensus 26 ~~~l~v~~lp~~~~~~~l~~~f~~~g~i-~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g~~l~v~~ 102 (106)
.+.|-+.|+|++++-++|.+||..|-.+ ..|.+.+. ..|+..|.+.|-|++.++|.+|...|++..|.++.+++.+
T Consensus 867 p~V~~~~n~Pf~v~l~dI~~FF~dY~~~p~sI~~r~n-d~G~pTGe~mvAfes~~eAr~A~~dl~~~~i~nr~V~l~i 943 (944)
T KOG4307|consen 867 PRVLSCNNFPFDVTLEDIVEFFNDYEPDPNSIRIRRN-DDGVPTGECMVAFESQEEARRASMDLDGQKIRNRVVSLRI 943 (944)
T ss_pred CeEEEecCCCccccHHHHHHHhcccccCCCceeEeec-CCCCcccceeEeecCHHHHHhhhhccccCcccceeEEEEe
Confidence 3478889999999999999999999754 44444444 4799999999999999999999999999999999998865
No 96
>KOG1365 consensus RNA-binding protein Fusilli, contains RRM domain [RNA processing and modification; General function prediction only]
Probab=98.39 E-value=5.4e-07 Score=60.53 Aligned_cols=81 Identities=21% Similarity=0.345 Sum_probs=66.4
Q ss_pred CCCCceEEEcCCCCCCCHHHHHHHhccCC-CeeE--EEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeCCeEEE
Q 034021 23 ANSRTKLFIGGLSYDTNETVLKDAFGQHG-EIIE--VKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLDGRNIR 99 (106)
Q Consensus 23 ~~~~~~l~v~~lp~~~~~~~l~~~f~~~g-~i~~--~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g~~l~ 99 (106)
.....+|.+++||+..+.++|.+||..|- .|.. +-++... .|...|-|||+|.+.+.+.+|....++....++.|.
T Consensus 277 ~~~kdcvRLRGLPy~AtvEdIL~FlgdFa~~i~f~gVHmv~N~-qGrPSGeAFIqm~nae~a~aaaqk~hk~~mk~RYiE 355 (508)
T KOG1365|consen 277 TRSKDCVRLRGLPYEATVEDILDFLGDFATDIRFQGVHMVLNG-QGRPSGEAFIQMRNAERARAAAQKCHKKLMKSRYIE 355 (508)
T ss_pred CCCCCeeEecCCChhhhHHHHHHHHHHHhhhcccceeEEEEcC-CCCcChhhhhhhhhhHHHHHHHHHHHHhhcccceEE
Confidence 33467899999999999999999999885 3444 5444444 577889999999999999999999888888899998
Q ss_pred EEecc
Q 034021 100 VNYAH 104 (106)
Q Consensus 100 v~~s~ 104 (106)
|..+.
T Consensus 356 vfp~S 360 (508)
T KOG1365|consen 356 VFPCS 360 (508)
T ss_pred Eeecc
Confidence 87653
No 97
>COG5175 MOT2 Transcriptional repressor [Transcription]
Probab=98.33 E-value=2.8e-06 Score=56.42 Aligned_cols=78 Identities=23% Similarity=0.468 Sum_probs=60.9
Q ss_pred CceEEEcCCCCCCCHHH------HHHHhccCCCeeEEEEeecCCC-CCceeE--EEEEecCHHHHHHHHHHhCCceeCCe
Q 034021 26 RTKLFIGGLSYDTNETV------LKDAFGQHGEIIEVKVICDRVT-GQSRGY--GFVKFASEAAAGEAIKEMDGLLLDGR 96 (106)
Q Consensus 26 ~~~l~v~~lp~~~~~~~------l~~~f~~~g~i~~~~~~~~~~~-~~~~g~--~fv~f~~~~~a~~al~~l~~~~~~g~ 96 (106)
..-+||-+||..+..++ -.++|.+||.|..+.+-+.... ....++ .||.|.+.++|.+|+...+|..++|+
T Consensus 114 KNLvYVigi~pkva~Ee~~~vLk~~eyFGQyGkI~KIvvNkkt~s~nst~~h~gvYITy~~kedAarcIa~vDgs~~DGr 193 (480)
T COG5175 114 KNLVYVIGIPPKVADEEVAPVLKRHEYFGQYGKIKKIVVNKKTSSLNSTASHAGVYITYSTKEDAARCIAEVDGSLLDGR 193 (480)
T ss_pred cceeEEecCCCCCCcccccccccchhhhhhccceeEEEecccccccccccccceEEEEecchHHHHHHHHHhccccccCc
Confidence 45689999998877655 3578999999999877544311 111222 49999999999999999999999999
Q ss_pred EEEEEec
Q 034021 97 NIRVNYA 103 (106)
Q Consensus 97 ~l~v~~s 103 (106)
.|+..|.
T Consensus 194 ~lkatYG 200 (480)
T COG5175 194 VLKATYG 200 (480)
T ss_pred eEeeecC
Confidence 9998875
No 98
>KOG3152 consensus TBP-binding protein, activator of basal transcription (contains rrm motif) [Transcription]
Probab=98.31 E-value=3.6e-07 Score=58.21 Aligned_cols=73 Identities=16% Similarity=0.277 Sum_probs=61.7
Q ss_pred CCceEEEcCCCCCCCHHHHHHHhccCCCeeEEEEeecCCC--------CC----ceeEEEEEecCHHHHHHHHHHhCCce
Q 034021 25 SRTKLFIGGLSYDTNETVLKDAFGQHGEIIEVKVICDRVT--------GQ----SRGYGFVKFASEAAAGEAIKEMDGLL 92 (106)
Q Consensus 25 ~~~~l~v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~~--------~~----~~g~~fv~f~~~~~a~~al~~l~~~~ 92 (106)
..-.||+++||+.+...-|+++|+.||.|-.+.+.....+ |. ...-|+|+|.+...|..+...||+..
T Consensus 73 k~GVvylS~IPp~m~~~rlReil~~yGeVGRvylqpE~~s~~~~r~~~~~n~~~~y~EGWvEF~~KrvAK~iAe~Lnn~~ 152 (278)
T KOG3152|consen 73 KTGVVYLSNIPPYMDPVRLREILSQYGEVGRVYLQPEDDSKRAARKRKGGNYKKLYSEGWVEFISKRVAKRIAELLNNTP 152 (278)
T ss_pred cceEEEeccCCCccCHHHHHHHHHhccccceEEecchhhHHHHHHhhcCCCccccchhHHHHHHHHHHHHHHHHHhCCCc
Confidence 4468999999999999999999999999999988665433 11 22348999999999999999999999
Q ss_pred eCCeE
Q 034021 93 LDGRN 97 (106)
Q Consensus 93 ~~g~~ 97 (106)
|+|++
T Consensus 153 Iggkk 157 (278)
T KOG3152|consen 153 IGGKK 157 (278)
T ss_pred cCCCC
Confidence 99864
No 99
>KOG2314 consensus Translation initiation factor 3, subunit b (eIF-3b) [Translation, ribosomal structure and biogenesis]
Probab=98.31 E-value=7e-06 Score=57.66 Aligned_cols=79 Identities=25% Similarity=0.386 Sum_probs=63.2
Q ss_pred CCCCCceEEEcCCCCCCCH------HHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeCC
Q 034021 22 DANSRTKLFIGGLSYDTNE------TVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLDG 95 (106)
Q Consensus 22 ~~~~~~~l~v~~lp~~~~~------~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g 95 (106)
.+.-...|.|.|+|---.. .-|.++|+++|++....++.+..+ ..+||.|++|.+..+|+.|++.|||+.++-
T Consensus 54 ~eg~D~vVvv~g~PvV~~~rl~klk~vl~kvfsk~gk~vn~~~P~~e~g-gtkG~lf~E~~~~~~A~~aVK~l~G~~ldk 132 (698)
T KOG2314|consen 54 AEGFDSVVVVDGAPVVGPARLEKLKKVLTKVFSKAGKIVNMYYPIDEEG-GTKGYLFVEYASMRDAKKAVKSLNGKRLDK 132 (698)
T ss_pred cCCcceEEEECCCcccChhHHHHHHHHHHHHHHhhccccceeeccCccC-CeeeEEEEEecChhhHHHHHHhcccceecc
Confidence 3355678889999865332 446778999999999999888744 489999999999999999999999998754
Q ss_pred -eEEEEE
Q 034021 96 -RNIRVN 101 (106)
Q Consensus 96 -~~l~v~ 101 (106)
++..|.
T Consensus 133 nHtf~v~ 139 (698)
T KOG2314|consen 133 NHTFFVR 139 (698)
T ss_pred cceEEee
Confidence 455544
No 100
>KOG1457 consensus RNA binding protein (contains RRM repeats) [General function prediction only]
Probab=98.30 E-value=1.3e-06 Score=54.99 Aligned_cols=64 Identities=25% Similarity=0.371 Sum_probs=54.6
Q ss_pred ceEEEcCCCCCCCHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeC
Q 034021 27 TKLFIGGLSYDTNETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLD 94 (106)
Q Consensus 27 ~~l~v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~ 94 (106)
.++||.||..++++++|+.+|+.|.....+++... +|. ..+|++|++.+.|..|+..|+|..+.
T Consensus 211 stlfianl~~~~~ed~l~~~~~~~~gf~~l~~~~~--~g~--~vaf~~~~~~~~at~am~~lqg~~~s 274 (284)
T KOG1457|consen 211 STLFIANLGPNCTEDELKQLLSRYPGFHILKIRAR--GGM--PVAFADFEEIEQATDAMNHLQGNLLS 274 (284)
T ss_pred hhHhhhccCCCCCHHHHHHHHHhCCCceEEEEecC--CCc--ceEeecHHHHHHHHHHHHHhhcceec
Confidence 68999999999999999999999987777776432 333 37999999999999999999998763
No 101
>KOG4849 consensus mRNA cleavage factor I subunit/CPSF subunit [RNA processing and modification]
Probab=98.27 E-value=1.1e-06 Score=58.47 Aligned_cols=79 Identities=23% Similarity=0.429 Sum_probs=68.7
Q ss_pred CCCCCceEEEcCCCCCCCHHHHHHHhccCC--CeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeCCeEEE
Q 034021 22 DANSRTKLFIGGLSYDTNETVLKDAFGQHG--EIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLDGRNIR 99 (106)
Q Consensus 22 ~~~~~~~l~v~~lp~~~~~~~l~~~f~~~g--~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g~~l~ 99 (106)
-+.....+||+||-|.+|+++|.+.....| .+..++++..+.+|.++|||++...+....++.++.|...+|.|..-.
T Consensus 76 ~~Grk~~~YvGNL~W~TTD~DL~~A~~S~G~~~~~dmKFFENR~NGQSKG~AL~~~~SdAa~Kq~MeiLP~k~iHGQ~P~ 155 (498)
T KOG4849|consen 76 SEGRKYCCYVGNLLWYTTDADLLKALQSTGLAQFADMKFFENRTNGQSKGYALLVLNSDAAVKQTMEILPTKTIHGQSPT 155 (498)
T ss_pred ccCceEEEEecceeEEeccHHHHHHHHhhhHHHHhhhhhhhcccCCcccceEEEEecchHHHHHHHHhcccceecCCCCe
Confidence 344568899999999999999999998777 578888888888999999999999999999999999999999886544
Q ss_pred E
Q 034021 100 V 100 (106)
Q Consensus 100 v 100 (106)
|
T Consensus 156 V 156 (498)
T KOG4849|consen 156 V 156 (498)
T ss_pred e
Confidence 4
No 102
>KOG1456 consensus Heterogeneous nuclear ribonucleoprotein L (contains RRM repeats) [RNA processing and modification]
Probab=98.18 E-value=1.8e-05 Score=53.32 Aligned_cols=70 Identities=19% Similarity=0.314 Sum_probs=60.1
Q ss_pred cCCCCCCCHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeCC--eEEEEEeccCC
Q 034021 32 GGLSYDTNETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLDG--RNIRVNYAHKD 106 (106)
Q Consensus 32 ~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g--~~l~v~~s~~~ 106 (106)
-|--..++-+-|..++.+.|+|.++.|++. +|. -|.|+|++.+.|++|-..|||..|.. +.|+|+||+++
T Consensus 128 lNp~YpItvDVly~Icnp~GkVlRIvIfkk--ngV---QAmVEFdsv~~AqrAk~alNGADIYsGCCTLKIeyAkP~ 199 (494)
T KOG1456|consen 128 LNPQYPITVDVLYTICNPQGKVLRIVIFKK--NGV---QAMVEFDSVEVAQRAKAALNGADIYSGCCTLKIEYAKPT 199 (494)
T ss_pred ecCccccchhhhhhhcCCCCceEEEEEEec--cce---eeEEeechhHHHHHHHhhcccccccccceeEEEEecCcc
Confidence 354566888999999999999999988766 454 59999999999999999999999865 68999999885
No 103
>KOG4210 consensus Nuclear localization sequence binding protein [Transcription]
Probab=98.15 E-value=1.9e-06 Score=56.63 Aligned_cols=80 Identities=25% Similarity=0.521 Sum_probs=71.5
Q ss_pred CCceEE-EcCCCCCCCHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeCCeEEEEEec
Q 034021 25 SRTKLF-IGGLSYDTNETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLDGRNIRVNYA 103 (106)
Q Consensus 25 ~~~~l~-v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g~~l~v~~s 103 (106)
...+++ |.+++.+++.++|...|..+|.|..+++...+.++..++++++.|........++.. +...+.++.+.+.+.
T Consensus 183 ~s~~~~~~~~~~f~~~~d~~~~~~~~~~~i~~~r~~~~~~s~~~kg~a~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~ 261 (285)
T KOG4210|consen 183 PSDTIFFVGELDFSLTRDDLKEHFVSSGEITSVRLPTDEESGDSKGFAYVDFSAGNSKKLALND-QTRSIGGRPLRLEED 261 (285)
T ss_pred ccccceeecccccccchHHHhhhccCcCcceeeccCCCCCccchhhhhhhhhhhchhHHHHhhc-ccCcccCcccccccC
Confidence 345566 999999999999999999999999999999988999999999999999999999986 788899999998876
Q ss_pred cC
Q 034021 104 HK 105 (106)
Q Consensus 104 ~~ 105 (106)
.+
T Consensus 262 ~~ 263 (285)
T KOG4210|consen 262 EP 263 (285)
T ss_pred CC
Confidence 54
No 104
>KOG0106 consensus Alternative splicing factor SRp55/B52/SRp75 (RRM superfamily) [RNA processing and modification]
Probab=98.14 E-value=2.9e-06 Score=53.34 Aligned_cols=75 Identities=31% Similarity=0.526 Sum_probs=63.7
Q ss_pred CCCCceEEEcCCCCCCCHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeCCeEEEEEe
Q 034021 23 ANSRTKLFIGGLSYDTNETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLDGRNIRVNY 102 (106)
Q Consensus 23 ~~~~~~l~v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g~~l~v~~ 102 (106)
....+.+.|.+++..+...+|.+.|..+|.+..... ..+.+||+|...+++..|+..+++..+.+++|.+..
T Consensus 96 ~~s~~r~~~~~~~~r~~~qdl~d~~~~~g~~~~~~~--------~~~~~~v~Fs~~~da~ra~~~l~~~~~~~~~l~~~~ 167 (216)
T KOG0106|consen 96 SRTHFRLIVRNLSLRVSWQDLKDHFRPAGEVTYVDA--------RRNFAFVEFSEQEDAKRALEKLDGKKLNGRRISVEK 167 (216)
T ss_pred ccccceeeeccchhhhhHHHHhhhhcccCCCchhhh--------hccccceeehhhhhhhhcchhccchhhcCceeeecc
Confidence 445678889999999999999999999998844333 245899999999999999999999999999999866
Q ss_pred ccC
Q 034021 103 AHK 105 (106)
Q Consensus 103 s~~ 105 (106)
+-+
T Consensus 168 ~~~ 170 (216)
T KOG0106|consen 168 NSR 170 (216)
T ss_pred cCc
Confidence 543
No 105
>KOG1855 consensus Predicted RNA-binding protein [General function prediction only]
Probab=98.09 E-value=4.7e-06 Score=56.69 Aligned_cols=74 Identities=15% Similarity=0.201 Sum_probs=59.9
Q ss_pred cccCCCCCceEEEcCCCCCCCHHHHHHHhccCCCeeEEEEeec---CCCCC----------ceeEEEEEecCHHHHHHHH
Q 034021 19 LLSDANSRTKLFIGGLSYDTNETVLKDAFGQHGEIIEVKVICD---RVTGQ----------SRGYGFVKFASEAAAGEAI 85 (106)
Q Consensus 19 ~~~~~~~~~~l~v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~---~~~~~----------~~g~~fv~f~~~~~a~~al 85 (106)
...++...++|.+.|||.+-..+.|.++|+.+|.|..|+|+.. +.+.. .+-+|+|+|...+.|.+|.
T Consensus 224 ~~~eel~srtivaenLP~Dh~~enl~kiFg~~G~IksIRIckPgaip~d~r~~~~~~~~~~tk~~AlvEye~~~~A~KA~ 303 (484)
T KOG1855|consen 224 FDEEELPSRTIVAENLPLDHSYENLSKIFGTVGSIKSIRICKPGAIPEDVRGFPKKYFELQTKECALVEYEEVEAARKAR 303 (484)
T ss_pred ccccccccceEEEecCCcchHHHHHHHHhhcccceeeeeecCCCCCCcccccCCccchhhhhhhhhhhhhhhhHHHHHHH
Confidence 3445557889999999999999999999999999999999876 33221 2457999999999999999
Q ss_pred HHhCCce
Q 034021 86 KEMDGLL 92 (106)
Q Consensus 86 ~~l~~~~ 92 (106)
..++...
T Consensus 304 e~~~~e~ 310 (484)
T KOG1855|consen 304 ELLNPEQ 310 (484)
T ss_pred Hhhchhh
Confidence 9775443
No 106
>KOG1456 consensus Heterogeneous nuclear ribonucleoprotein L (contains RRM repeats) [RNA processing and modification]
Probab=98.07 E-value=5.1e-05 Score=51.20 Aligned_cols=78 Identities=19% Similarity=0.291 Sum_probs=68.3
Q ss_pred CCCceEEEcCCCCC-CCHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeCCeEEEEEe
Q 034021 24 NSRTKLFIGGLSYD-TNETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLDGRNIRVNY 102 (106)
Q Consensus 24 ~~~~~l~v~~lp~~-~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g~~l~v~~ 102 (106)
...+.+.|-+|... ++.+.+-.+|-.||.|..+++++.+ .|.|.|++-+....++|+..||+..+-|.+|.|..
T Consensus 285 ~~g~VmMVyGLdh~k~N~drlFNl~ClYGNV~rvkFmkTk-----~gtamVemgd~~aver~v~hLnn~~lfG~kl~v~~ 359 (494)
T KOG1456|consen 285 APGCVMMVYGLDHGKMNCDRLFNLFCLYGNVERVKFMKTK-----PGTAMVEMGDAYAVERAVTHLNNIPLFGGKLNVCV 359 (494)
T ss_pred CCCcEEEEEeccccccchhhhhhhhhhcCceeeEEEeecc-----cceeEEEcCcHHHHHHHHHHhccCccccceEEEee
Confidence 34678889999877 6677788889999999999998776 45899999999999999999999999999999999
Q ss_pred ccCC
Q 034021 103 AHKD 106 (106)
Q Consensus 103 s~~~ 106 (106)
|+.+
T Consensus 360 SkQ~ 363 (494)
T KOG1456|consen 360 SKQN 363 (494)
T ss_pred cccc
Confidence 8753
No 107
>PF08952 DUF1866: Domain of unknown function (DUF1866) ; InterPro: IPR015047 This domain, found in synaptojanin, has no known function. ; PDB: 1UFW_A 2DNR_A.
Probab=98.07 E-value=4.4e-05 Score=45.26 Aligned_cols=57 Identities=28% Similarity=0.524 Sum_probs=45.6
Q ss_pred HHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeCCeEEEEEeccCC
Q 034021 41 TVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLDGRNIRVNYAHKD 106 (106)
Q Consensus 41 ~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g~~l~v~~s~~~ 106 (106)
.++.+.|..||.+.-++++.+ .-+|.|.+.++|.+|++ ++|..++|+.|+|++-.+|
T Consensus 51 ~~ll~~~~~~GevvLvRfv~~--------~mwVTF~dg~sALaals-~dg~~v~g~~l~i~LKtpd 107 (146)
T PF08952_consen 51 DELLQKFAQYGEVVLVRFVGD--------TMWVTFRDGQSALAALS-LDGIQVNGRTLKIRLKTPD 107 (146)
T ss_dssp HHHHHHHHCCS-ECEEEEETT--------CEEEEESSCHHHHHHHH-GCCSEETTEEEEEEE----
T ss_pred HHHHHHHHhCCceEEEEEeCC--------eEEEEECccHHHHHHHc-cCCcEECCEEEEEEeCCcc
Confidence 467788889998887777644 47999999999999999 8999999999999987765
No 108
>PF05172 Nup35_RRM: Nup53/35/40-type RNA recognition motif; InterPro: IPR007846 The MPPN (Mitotic PhosphoProtein N end) family is uncharacterised however it probably plays a role in the cell cycle because the family includes mitotic phosphoproteins O13026 from SWISSPROT []. This family also includes Q05166 from SWISSPROT a suppressor of thermosensitive mutations in the DNA polymerase delta gene, Pol III []. The conserved central region appears to be distantly related to the RNA-binding region RNP-1 (RNA recognition motif, IPR000504 from INTERPRO), suggesting an RNA binding function for this protein.; PDB: 1WWH_C 3P3D_A.
Probab=98.06 E-value=5.5e-05 Score=42.30 Aligned_cols=77 Identities=17% Similarity=0.247 Sum_probs=50.4
Q ss_pred CCceEEEcCCCCCCCHHHHHHHhccCCCeeEEE-EeecC------CCCCceeEEEEEecCHHHHHHHHHHhCCceeCCeE
Q 034021 25 SRTKLFIGGLSYDTNETVLKDAFGQHGEIIEVK-VICDR------VTGQSRGYGFVKFASEAAAGEAIKEMDGLLLDGRN 97 (106)
Q Consensus 25 ~~~~l~v~~lp~~~~~~~l~~~f~~~g~i~~~~-~~~~~------~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g~~ 97 (106)
..+-|.|-|.|.. ....+.+.|++||.|.+.. ..+.. .......+-.|.|+++.+|++||+ .||..+.|.-
T Consensus 5 ~~~wVtVFGfp~~-~~~~Vl~~F~~~G~Ile~~~~~~~~~~~~~~~~~~~~NWi~I~Y~~~~~A~rAL~-~NG~i~~g~~ 82 (100)
T PF05172_consen 5 SETWVTVFGFPPS-ASNQVLRHFSSFGTILEHFEVLRSSSGINPYPIPSGGNWIHITYDNPLSAQRALQ-KNGTIFSGSL 82 (100)
T ss_dssp GCCEEEEE---GG-GHHHHHHHHHCCS-EECEEGGG----------E-CCTTEEEEEESSHHHHHHHHT-TTTEEETTCE
T ss_pred CCeEEEEEccCHH-HHHHHHHHHHhcceEEEeecccccccccccccCCCCCCEEEEECCCHHHHHHHHH-hCCeEEcCcE
Confidence 3556888899998 4566889999999987664 11100 011234589999999999999999 5999998864
Q ss_pred E-EEEec
Q 034021 98 I-RVNYA 103 (106)
Q Consensus 98 l-~v~~s 103 (106)
| -|.+.
T Consensus 83 mvGV~~~ 89 (100)
T PF05172_consen 83 MVGVKPC 89 (100)
T ss_dssp EEEEEE-
T ss_pred EEEEEEc
Confidence 4 45554
No 109
>KOG1190 consensus Polypyrimidine tract-binding protein [RNA processing and modification]
Probab=98.01 E-value=2.8e-05 Score=52.83 Aligned_cols=76 Identities=18% Similarity=0.335 Sum_probs=62.3
Q ss_pred CCceEEEcCCCCCCCHHHHHHHhccCCC-eeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeCCe-EEEEEe
Q 034021 25 SRTKLFIGGLSYDTNETVLKDAFGQHGE-IIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLDGR-NIRVNY 102 (106)
Q Consensus 25 ~~~~l~v~~lp~~~~~~~l~~~f~~~g~-i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g~-~l~v~~ 102 (106)
...++.++|+|.++++++++..|..-|. +..-++. ++.+.++++++.+.+.|..|+..++.+.+++. .++|++
T Consensus 413 psatlHlsnip~svsee~lk~~f~~~g~~vkafkff-----~kd~kmal~q~~sveeA~~ali~~hnh~lgen~hlRvSF 487 (492)
T KOG1190|consen 413 PSATLHLSNIPPSVSEEDLKNLFQEPGGQVKAFKFF-----QKDRKMALPQLESVEEAIQALIDLHNHYLGENHHLRVSF 487 (492)
T ss_pred chhheeeccCCcccchhHHHHhhhcCCceEEeeeec-----CCCcceeecccCChhHhhhhccccccccCCCCceEEEEe
Confidence 3468999999999999999999987765 4544443 22345999999999999999999999988765 899999
Q ss_pred ccC
Q 034021 103 AHK 105 (106)
Q Consensus 103 s~~ 105 (106)
|+.
T Consensus 488 Sks 490 (492)
T KOG1190|consen 488 SKS 490 (492)
T ss_pred ecc
Confidence 874
No 110
>KOG0120 consensus Splicing factor U2AF, large subunit (RRM superfamily) [RNA processing and modification]
Probab=98.01 E-value=3.5e-05 Score=53.92 Aligned_cols=65 Identities=18% Similarity=0.307 Sum_probs=53.7
Q ss_pred HHHHHHhccCCCeeEEEEeecCCC---CCceeEEEEEecCHHHHHHHHHHhCCceeCCeEEEEEeccC
Q 034021 41 TVLKDAFGQHGEIIEVKVICDRVT---GQSRGYGFVKFASEAAAGEAIKEMDGLLLDGRNIRVNYAHK 105 (106)
Q Consensus 41 ~~l~~~f~~~g~i~~~~~~~~~~~---~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g~~l~v~~s~~ 105 (106)
++++.-+++||.|..|.+.++... .-+.|-.||+|.+.+++++|+.+|+|..+.|+.+...|-..
T Consensus 424 Edvr~ec~k~g~v~~v~ipr~~~~~~~~~G~GkVFVefas~ed~qrA~~~L~GrKF~nRtVvtsYyde 491 (500)
T KOG0120|consen 424 EDVRTECAKFGAVRSVEIPRPYPDENPVPGTGKVFVEFADTEDSQRAMEELTGRKFANRTVVASYYDE 491 (500)
T ss_pred HHHHHHhcccCceeEEecCCCCCCCCcCCCcccEEEEecChHHHHHHHHHccCceeCCcEEEEEecCH
Confidence 456666779999999999887222 23467799999999999999999999999999999888654
No 111
>KOG1365 consensus RNA-binding protein Fusilli, contains RRM domain [RNA processing and modification; General function prediction only]
Probab=98.00 E-value=3.5e-06 Score=56.75 Aligned_cols=73 Identities=25% Similarity=0.371 Sum_probs=55.4
Q ss_pred CceEEEcCCCCCCCHHHHHHHhccC----CCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeCCeEEEE
Q 034021 26 RTKLFIGGLSYDTNETVLKDAFGQH----GEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLDGRNIRV 100 (106)
Q Consensus 26 ~~~l~v~~lp~~~~~~~l~~~f~~~----g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g~~l~v 100 (106)
.-.|..++||+++++.++.+||.+- |....+-++..+ +|+..|-||+.|..+++|+.|+.+ |.-.++-+.|.+
T Consensus 161 qvivRmRGLPfdat~~dVv~FF~~~cpv~~g~egvLFV~rp-dgrpTGdAFvlfa~ee~aq~aL~k-hrq~iGqRYIEl 237 (508)
T KOG1365|consen 161 QVIVRMRGLPFDATALDVVEFFGPPCPVTGGTEGVLFVTRP-DGRPTGDAFVLFACEEDAQFALRK-HRQNIGQRYIEL 237 (508)
T ss_pred ceEEEecCCCCCcchHHHHHhcCCCCcccCCccceEEEECC-CCCcccceEEEecCHHHHHHHHHH-HHHHHhHHHHHH
Confidence 4567778999999999999999632 345566555555 788899999999999999999985 544455444443
No 112
>PF14605 Nup35_RRM_2: Nup53/35/40-type RNA recognition motif
Probab=97.99 E-value=3.4e-05 Score=38.21 Aligned_cols=52 Identities=27% Similarity=0.454 Sum_probs=41.4
Q ss_pred ceEEEcCCCCCCCHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHH
Q 034021 27 TKLFIGGLSYDTNETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAI 85 (106)
Q Consensus 27 ~~l~v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al 85 (106)
+.|-|.|.|.+..+. +.+.|..||.|....+.. ...+.++.|.+..+|++|+
T Consensus 2 ~wI~V~Gf~~~~~~~-vl~~F~~fGeI~~~~~~~------~~~~~~l~y~~~~~ae~Al 53 (53)
T PF14605_consen 2 TWISVSGFPPDLAEE-VLEHFASFGEIVDIYVPE------STNWMYLKYKSRKDAEKAL 53 (53)
T ss_pred cEEEEEeECchHHHH-HHHHHHhcCCEEEEEcCC------CCcEEEEEECCHHHHHhhC
Confidence 467889999887755 556888899999988852 2348999999999999885
No 113
>KOG0129 consensus Predicted RNA-binding protein (RRM superfamily) [Translation, ribosomal structure and biogenesis]
Probab=97.98 E-value=6.8e-05 Score=52.18 Aligned_cols=65 Identities=28% Similarity=0.536 Sum_probs=49.5
Q ss_pred CCCCceEEEcCCCCCCCHHHHHHHhccCCCeeEEEEeecCC-CC--Ccee---EEEEEecCHHHHHHHHHHh
Q 034021 23 ANSRTKLFIGGLSYDTNETVLKDAFGQHGEIIEVKVICDRV-TG--QSRG---YGFVKFASEAAAGEAIKEM 88 (106)
Q Consensus 23 ~~~~~~l~v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~-~~--~~~g---~~fv~f~~~~~a~~al~~l 88 (106)
....++|||++||++++++.|...|..||.+. +.+..... .+ ..+| |+|+.|+++.+++.-+.++
T Consensus 256 ~~~S~KVFvGGlp~dise~~i~~~F~~FGs~~-VdWP~k~~~~~~~ppkGs~~YvflvFe~E~sV~~Ll~aC 326 (520)
T KOG0129|consen 256 PRYSRKVFVGGLPWDITEAQINASFGQFGSVK-VDWPGKANSRGRAPPKGSYGYVFLVFEDERSVQSLLSAC 326 (520)
T ss_pred cccccceeecCCCccccHHHHHhhcccccceE-eecCCCccccccCCCCCcccEEEEEecchHHHHHHHHHH
Confidence 34568999999999999999999999999754 33432211 11 1355 9999999999999888754
No 114
>KOG0129 consensus Predicted RNA-binding protein (RRM superfamily) [Translation, ribosomal structure and biogenesis]
Probab=97.96 E-value=4e-05 Score=53.31 Aligned_cols=66 Identities=24% Similarity=0.331 Sum_probs=59.3
Q ss_pred CCCCCceEEEcCCCCCCCHHHHHHHhc-cCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHH
Q 034021 22 DANSRTKLFIGGLSYDTNETVLKDAFG-QHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKE 87 (106)
Q Consensus 22 ~~~~~~~l~v~~lp~~~~~~~l~~~f~-~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~ 87 (106)
.-...+||||++||.-++.++|..+|. -||.|..+.|-.|++-+-.+|-|-|.|.+..+-.+|+.+
T Consensus 366 ~lDprrTVFVGgvprpl~A~eLA~imd~lyGgV~yaGIDtD~k~KYPkGaGRVtFsnqqsYi~AIsa 432 (520)
T KOG0129|consen 366 PIDPRRTVFVGGLPRPLTAEELAMIMEDLFGGVLYVGIDTDPKLKYPKGAGRVTFSNQQAYIKAISA 432 (520)
T ss_pred ccCccceEEecCCCCcchHHHHHHHHHHhcCceEEEEeccCcccCCCCCcceeeecccHHHHHHHhh
Confidence 334578999999999999999999998 699999999998876777889999999999999999983
No 115
>KOG1548 consensus Transcription elongation factor TAT-SF1 [Transcription]
Probab=97.94 E-value=7.5e-05 Score=49.77 Aligned_cols=78 Identities=22% Similarity=0.398 Sum_probs=61.1
Q ss_pred CCCCCceEEEcCC--C--CCCC-------HHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCC
Q 034021 22 DANSRTKLFIGGL--S--YDTN-------ETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDG 90 (106)
Q Consensus 22 ~~~~~~~l~v~~l--p--~~~~-------~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~ 90 (106)
.....++|.+.|+ | ...+ .++|.+-+++||.+..+.+.-.. +.|.+-|.|.+.+.|..|++.|+|
T Consensus 261 k~r~~~tVi~kn~Ftp~~~~~~~~l~~dlkedl~eec~K~G~v~~vvv~d~h----PdGvvtV~f~n~eeA~~ciq~m~G 336 (382)
T KOG1548|consen 261 KARADRTVILKNMFTPEDFEKNPDLLNDLKEDLTEECEKFGQVRKVVVYDRH----PDGVVTVSFRNNEEADQCIQTMDG 336 (382)
T ss_pred cccCCcEEEeeecCCHHHhccCHHHHHHHHHHHHHHHHHhCCcceEEEeccC----CCceeEEEeCChHHHHHHHHHhcC
Confidence 3345678888887 2 1222 35677778899999999776333 356899999999999999999999
Q ss_pred ceeCCeEEEEEec
Q 034021 91 LLLDGRNIRVNYA 103 (106)
Q Consensus 91 ~~~~g~~l~v~~s 103 (106)
..++|++|..++.
T Consensus 337 R~fdgRql~A~i~ 349 (382)
T KOG1548|consen 337 RWFDGRQLTASIW 349 (382)
T ss_pred eeecceEEEEEEe
Confidence 9999999988764
No 116
>PF10309 DUF2414: Protein of unknown function (DUF2414); InterPro: IPR019416 This entry contains proteins that have no known function.
Probab=97.80 E-value=0.00031 Score=35.83 Aligned_cols=56 Identities=20% Similarity=0.249 Sum_probs=45.0
Q ss_pred CCceEEEcCCCCCCCHHHHHHHhccC---CCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHh
Q 034021 25 SRTKLFIGGLSYDTNETVLKDAFGQH---GEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEM 88 (106)
Q Consensus 25 ~~~~l~v~~lp~~~~~~~l~~~f~~~---g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l 88 (106)
....|+|+|+. +++.++++.+|..| .....+.++-+. .+-|.|.+...|.+||..|
T Consensus 4 rpeavhirGvd-~lsT~dI~~y~~~y~~~~~~~~IEWIdDt-------ScNvvf~d~~~A~~AL~~L 62 (62)
T PF10309_consen 4 RPEAVHIRGVD-ELSTDDIKAYFSEYFDEEGPFRIEWIDDT-------SCNVVFKDEETAARALVAL 62 (62)
T ss_pred eeceEEEEcCC-CCCHHHHHHHHHHhcccCCCceEEEecCC-------cEEEEECCHHHHHHHHHcC
Confidence 35679999984 47778899999988 246788888775 5889999999999999854
No 117
>PF08675 RNA_bind: RNA binding domain; InterPro: IPR014789 This domain corresponds to the RNA binding domain of Poly(A)-specific ribonuclease (PARN). ; GO: 0003723 RNA binding, 0004535 poly(A)-specific ribonuclease activity, 0046872 metal ion binding, 0006402 mRNA catabolic process, 0005634 nucleus, 0005737 cytoplasm; PDB: 3CTR_A 2ROK_A 3D45_B 1WHV_A.
Probab=97.67 E-value=0.00061 Score=36.72 Aligned_cols=57 Identities=14% Similarity=0.211 Sum_probs=42.2
Q ss_pred CCceEEEcCCCCCCCHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCC
Q 034021 25 SRTKLFIGGLSYDTNETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDG 90 (106)
Q Consensus 25 ~~~~l~v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~ 90 (106)
.....+|+ +|..+...+|.++|++||.|.-- ++.+. .|||...+.+.+..++..+..
T Consensus 8 RdHVFhlt-FPkeWK~~DI~qlFspfG~I~Vs-Wi~dT-------SAfV~l~~r~~~~~v~~~~~~ 64 (87)
T PF08675_consen 8 RDHVFHLT-FPKEWKTSDIYQLFSPFGQIYVS-WINDT-------SAFVALHNRDQAKVVMNTLKK 64 (87)
T ss_dssp GCCEEEEE---TT--HHHHHHHCCCCCCEEEE-EECTT-------EEEEEECCCHHHHHHHHHHTT
T ss_pred cceEEEEe-CchHhhhhhHHHHhccCCcEEEE-EEcCC-------cEEEEeecHHHHHHHHHHhcc
Confidence 45566675 99999999999999999987644 44443 699999999999999987754
No 118
>KOG0112 consensus Large RNA-binding protein (RRM superfamily) [General function prediction only]
Probab=97.64 E-value=9.5e-05 Score=54.51 Aligned_cols=77 Identities=29% Similarity=0.449 Sum_probs=66.3
Q ss_pred CCCCCceEEEcCCCCCCCHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeCC--eEEE
Q 034021 22 DANSRTKLFIGGLSYDTNETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLDG--RNIR 99 (106)
Q Consensus 22 ~~~~~~~l~v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g--~~l~ 99 (106)
.....+.+++++|+.+.....+.+.|..||.|..|.+.... .|++|+|.+...+++|+..+.|..+++ ++++
T Consensus 451 kst~ttr~~sgglg~w~p~~~l~r~fd~fGpir~Idy~hgq------~yayi~yes~~~aq~a~~~~rgap~G~P~~r~r 524 (975)
T KOG0112|consen 451 KSTPTTRLQSGGLGPWSPVSRLNREFDRFGPIRIIDYRHGQ------PYAYIQYESPPAAQAATHDMRGAPLGGPPRRLR 524 (975)
T ss_pred ccccceeeccCCCCCCChHHHHHHHhhccCcceeeecccCC------cceeeecccCccchhhHHHHhcCcCCCCCcccc
Confidence 44456789999999999999999999999999998775432 499999999999999999999999876 5688
Q ss_pred EEecc
Q 034021 100 VNYAH 104 (106)
Q Consensus 100 v~~s~ 104 (106)
|.++.
T Consensus 525 vdla~ 529 (975)
T KOG0112|consen 525 VDLAS 529 (975)
T ss_pred ccccc
Confidence 87765
No 119
>KOG0105 consensus Alternative splicing factor ASF/SF2 (RRM superfamily) [RNA processing and modification]
Probab=97.59 E-value=0.00065 Score=41.98 Aligned_cols=65 Identities=20% Similarity=0.305 Sum_probs=56.9
Q ss_pred CCCCCceEEEcCCCCCCCHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCcee
Q 034021 22 DANSRTKLFIGGLSYDTNETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLL 93 (106)
Q Consensus 22 ~~~~~~~l~v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~ 93 (106)
.......|.|++||.+.+.++|++....-|.+....+.++- ++.|+|-..++++-|+..|+...+
T Consensus 111 srrSe~RVvVsGLp~SgSWQDLKDHmReaGdvCfadv~rDg-------~GvV~~~r~eDMkYAvr~ld~~~~ 175 (241)
T KOG0105|consen 111 SRRSEYRVVVSGLPPSGSWQDLKDHMREAGDVCFADVQRDG-------VGVVEYLRKEDMKYAVRKLDDQKF 175 (241)
T ss_pred ccccceeEEEecCCCCCchHHHHHHHHhhCCeeeeeeeccc-------ceeeeeeehhhHHHHHHhhccccc
Confidence 33456889999999999999999999999999999887763 799999999999999998876654
No 120
>KOG0128 consensus RNA-binding protein SART3 (RRM superfamily) [RNA processing and modification]
Probab=97.50 E-value=4.4e-05 Score=55.85 Aligned_cols=79 Identities=19% Similarity=0.323 Sum_probs=68.4
Q ss_pred CceEEEcCCCCCCCHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeCCeEEEEEeccC
Q 034021 26 RTKLFIGGLSYDTNETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLDGRNIRVNYAHK 105 (106)
Q Consensus 26 ~~~l~v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g~~l~v~~s~~ 105 (106)
...++|+|.|+..|.+.++.+++++|.+..+.++..+ .|+.+|.+++.|.+..++.+++...+...+..+.+.|..+++
T Consensus 736 K~~v~i~g~pf~gt~e~~k~l~~~~gn~~~~~~vt~r-~gkpkg~a~v~y~~ea~~s~~~~s~d~~~~rE~~~~v~vsnp 814 (881)
T KOG0128|consen 736 KISVAISGPPFQGTKEELKSLASKTGNVTSLRLVTVR-AGKPKGKARVDYNTEADASRKVASVDVAGKRENNGEVQVSNP 814 (881)
T ss_pred hhhhheeCCCCCCchHHHHhhccccCCccccchhhhh-ccccccceeccCCCcchhhhhcccchhhhhhhcCccccccCC
Confidence 5678999999999999999999999999998877666 789999999999999999999988777777777777766554
No 121
>PF15023 DUF4523: Protein of unknown function (DUF4523)
Probab=97.26 E-value=0.0044 Score=36.81 Aligned_cols=81 Identities=12% Similarity=0.172 Sum_probs=57.3
Q ss_pred cccccccCCCCCceEEEcCCCCCCCH-HH---HHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCC
Q 034021 15 RFTRLLSDANSRTKLFIGGLSYDTNE-TV---LKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDG 90 (106)
Q Consensus 15 ~~~~~~~~~~~~~~l~v~~lp~~~~~-~~---l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~ 90 (106)
..+.....+..-.+|.|+=|.+++.. ++ +...++.||+|..+... ++..|.|.|+|..+|=+|+.+++.
T Consensus 75 ~lwkk~~kepPMsTIVVRWlkknm~~~edl~sV~~~Ls~fGpI~SVT~c-------GrqsavVvF~d~~SAC~Av~Af~s 147 (166)
T PF15023_consen 75 HLWKKNTKEPPMSTIVVRWLKKNMQPTEDLKSVIQRLSVFGPIQSVTLC-------GRQSAVVVFKDITSACKAVSAFQS 147 (166)
T ss_pred hhhcccCCCCCceeEEeehhhhcCChHHHHHHHHHHHHhcCCcceeeec-------CCceEEEEehhhHHHHHHHHhhcC
Confidence 33444445566678888887777643 34 44556789999988553 345799999999999999998765
Q ss_pred ceeCCeEEEEEec
Q 034021 91 LLLDGRNIRVNYA 103 (106)
Q Consensus 91 ~~~~g~~l~v~~s 103 (106)
...|..++..|-
T Consensus 148 -~~pgtm~qCsWq 159 (166)
T PF15023_consen 148 -RAPGTMFQCSWQ 159 (166)
T ss_pred -CCCCceEEeecc
Confidence 556666666653
No 122
>KOG2202 consensus U2 snRNP splicing factor, small subunit, and related proteins [RNA processing and modification]
Probab=97.22 E-value=0.00017 Score=46.33 Aligned_cols=62 Identities=26% Similarity=0.361 Sum_probs=48.3
Q ss_pred HHHHHHhc-cCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeCCeEEEEEec
Q 034021 41 TVLKDAFG-QHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLDGRNIRVNYA 103 (106)
Q Consensus 41 ~~l~~~f~-~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g~~l~v~~s 103 (106)
+++...++ +||.|..+.+.... .-...|-++|.|...++|++|+..||+-.+.|++|...++
T Consensus 83 Ed~f~E~~~kygEiee~~Vc~Nl-~~hl~GNVYV~f~~Ee~ae~a~~~lnnRw~~G~pi~ae~~ 145 (260)
T KOG2202|consen 83 EDVFTELEDKYGEIEELNVCDNL-GDHLVGNVYVKFRSEEDAEAALEDLNNRWYNGRPIHAELS 145 (260)
T ss_pred HHHHHHHHHHhhhhhhhhhhccc-chhhhhhhhhhcccHHHHHHHHHHHcCccccCCcceeeec
Confidence 34444454 88999888554332 3345678999999999999999999999999999988765
No 123
>KOG1996 consensus mRNA splicing factor [RNA processing and modification]
Probab=97.21 E-value=0.0025 Score=42.06 Aligned_cols=64 Identities=19% Similarity=0.248 Sum_probs=50.9
Q ss_pred HHHHHHHhccCCCeeEEEEeecCCCCCce-eEEEEEecCHHHHHHHHHHhCCceeCCeEEEEEec
Q 034021 40 ETVLKDAFGQHGEIIEVKVICDRVTGQSR-GYGFVKFASEAAAGEAIKEMDGLLLDGRNIRVNYA 103 (106)
Q Consensus 40 ~~~l~~~f~~~g~i~~~~~~~~~~~~~~~-g~~fv~f~~~~~a~~al~~l~~~~~~g~~l~v~~s 103 (106)
++++.+-+++||.|..|-|+..+...... ---||+|...+++.+|+-.|||..++|+.+...+.
T Consensus 300 ede~keEceKyg~V~~viifeip~~p~deavRiFveF~r~e~aiKA~VdlnGRyFGGr~v~A~Fy 364 (378)
T KOG1996|consen 300 EDETKEECEKYGKVGNVIIFEIPSQPEDEAVRIFVEFERVESAIKAVVDLNGRYFGGRVVSACFY 364 (378)
T ss_pred HHHHHHHHHhhcceeeEEEEecCCCccchhheeeeeeccHHHHHHHHHhcCCceecceeeeheec
Confidence 35677788899999998887775333222 23799999999999999999999999998876653
No 124
>KOG2416 consensus Acinus (induces apoptotic chromatin condensation) [Chromatin structure and dynamics]
Probab=97.20 E-value=0.00045 Score=49.19 Aligned_cols=78 Identities=14% Similarity=0.227 Sum_probs=62.2
Q ss_pred CCCCCceEEEcCCCCCCCHHHHHHHhccC-CCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCcee---CCeE
Q 034021 22 DANSRTKLFIGGLSYDTNETVLKDAFGQH-GEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLL---DGRN 97 (106)
Q Consensus 22 ~~~~~~~l~v~~lp~~~~~~~l~~~f~~~-g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~---~g~~ 97 (106)
.......|||.||-.-.|.-+|+.++..- |.|...++-+ -+.+|||.|.+.++|.+...+|||+.+ +.+.
T Consensus 440 R~~~SnvlhI~nLvRPFTlgQLkelL~rtgg~Vee~WmDk------IKShCyV~yss~eEA~atr~AlhnV~WP~sNPK~ 513 (718)
T KOG2416|consen 440 RKEPSNVLHIDNLVRPFTLGQLKELLGRTGGNVEEFWMDK------IKSHCYVSYSSVEEAAATREALHNVQWPPSNPKH 513 (718)
T ss_pred CCCccceEeeecccccchHHHHHHHHhhccCchHHHHHHH------hhcceeEecccHHHHHHHHHHHhccccCCCCCce
Confidence 45567899999999999999999999854 5566663322 245899999999999999999999986 4567
Q ss_pred EEEEeccC
Q 034021 98 IRVNYAHK 105 (106)
Q Consensus 98 l~v~~s~~ 105 (106)
|.+.|...
T Consensus 514 L~adf~~~ 521 (718)
T KOG2416|consen 514 LIADFVRA 521 (718)
T ss_pred eEeeecch
Confidence 88877654
No 125
>KOG4307 consensus RNA binding protein RBM12/SWAN [General function prediction only]
Probab=97.20 E-value=0.00035 Score=50.66 Aligned_cols=78 Identities=19% Similarity=0.120 Sum_probs=64.6
Q ss_pred CCCceEEEcCCCCCCCHHHHHHHhccCCCeeE-EEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeCCeEEEEEe
Q 034021 24 NSRTKLFIGGLSYDTNETVLKDAFGQHGEIIE-VKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLDGRNIRVNY 102 (106)
Q Consensus 24 ~~~~~l~v~~lp~~~~~~~l~~~f~~~g~i~~-~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g~~l~v~~ 102 (106)
.....|||..||..++..++.++|+..-.|+. |.+-+.+ ++..++.|||.|...+.+..|+..-+.+.++.+.|+|.-
T Consensus 432 ~ag~~lyv~~lP~~t~~~~~v~~f~~~~~Ved~I~lt~~P-~~~~~~~afv~F~~~~a~~~a~~~~~k~y~G~r~irv~s 510 (944)
T KOG4307|consen 432 GAGGALYVFQLPVMTPIVPPVNKFMGAAAVEDFIELTRLP-TDLLRPAAFVAFIHPTAPLTASSVKTKFYPGHRIIRVDS 510 (944)
T ss_pred CccceEEeccCCccccccchhhhhhhhhhhhheeEeccCC-cccccchhhheeccccccchhhhcccccccCceEEEeec
Confidence 34678999999999999999999987666655 7777766 677788999999998888888876677778888899874
No 126
>KOG4676 consensus Splicing factor, arginine/serine-rich [RNA processing and modification]
Probab=97.19 E-value=0.0012 Score=44.96 Aligned_cols=75 Identities=16% Similarity=0.216 Sum_probs=59.2
Q ss_pred ceEEEcCCCCCCCHHHHHHHhccCCCeeEEEEeecCCC---CCceeEEEEEecCHHHHHHHHHHhCCceeCCeEEEEEe
Q 034021 27 TKLFIGGLSYDTNETVLKDAFGQHGEIIEVKVICDRVT---GQSRGYGFVKFASEAAAGEAIKEMDGLLLDGRNIRVNY 102 (106)
Q Consensus 27 ~~l~v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~~---~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g~~l~v~~ 102 (106)
..|.|.||..+++.++++.+|...|.|..+.++....+ ......+||-|.|...+..|.. |.+..+-++.|.|-.
T Consensus 8 ~vIqvanispsat~dqm~tlFg~lGkI~elrlyp~~~d~~~pv~sRtcyVkf~d~~sv~vaQh-Ltntvfvdraliv~p 85 (479)
T KOG4676|consen 8 GVIQVANISPSATKDQMQTLFGNLGKIPELRLYPNVDDSKIPVISRTCYVKFLDSQSVTVAQH-LTNTVFVDRALIVRP 85 (479)
T ss_pred ceeeecccCchhhHHHHHHHHhhccccccccccCCCCCccCcceeeeEEEeccCCcceeHHhh-hccceeeeeeEEEEe
Confidence 47899999999999999999999999999988663322 2345689999999999988877 666666666655543
No 127
>PF03467 Smg4_UPF3: Smg-4/UPF3 family; InterPro: IPR005120 Nonsense-mediated mRNA decay (NMD) is a surveillance mechanism by which eukaryotic cells detect and degrade transcripts containing premature termination codons. Three 'up-frameshift' proteins, UPF1, UPF2 and UPF3, are essential for this process in organisms ranging from yeast, human to plants []. Exon junction complexes (EJCs) are deposited ~24 nucleotides upstream of exon-exon junctions after splicing. Translation causes displacement of the EJCs, however, premature translation termination upstream of one or more EJCs triggers the recruitment of UPF1, UPF2 and UPF3 and activates the NMD pathway [, ]. This family contains UPF3. The crystal structure of the complex between human UPF2 and UPF3b, which are, respectively, a MIF4G (middle portion of eIF4G) domain and an RNP domain (ribonucleoprotein-type RNA-binding domain) has been determined to 1.95A. The protein-protein interface is mediated by highly conserved charged residues in UPF2 and UPF3b and involves the beta-sheet surface of the UPF3b ribonucleoprotein (RNP) domain, which is generally used by these domains to bind nucleic acids. In UPF3b the RNP domain does not bind RNA, whereas the UPF2 construct and the complex do. It is clear that some RNP domains have evolved for specific protein-protein interactions rather than as nucleic acid binding modules [].; PDB: 1UW4_A 2L08_A.
Probab=97.19 E-value=0.0012 Score=40.66 Aligned_cols=71 Identities=13% Similarity=0.133 Sum_probs=45.0
Q ss_pred CCCCceEEEcCCCCCCCHHHHHHHhcc-CCCe---eEEEEeecCC--CCCceeEEEEEecCHHHHHHHHHHhCCcee
Q 034021 23 ANSRTKLFIGGLSYDTNETVLKDAFGQ-HGEI---IEVKVICDRV--TGQSRGYGFVKFASEAAAGEAIKEMDGLLL 93 (106)
Q Consensus 23 ~~~~~~l~v~~lp~~~~~~~l~~~f~~-~g~i---~~~~~~~~~~--~~~~~g~~fv~f~~~~~a~~al~~l~~~~~ 93 (106)
......|.|++||.+++++++.+.+++ ++.- ....-..... .....+-|||.|.+.+++......++|+.+
T Consensus 4 ~~~~~KvVIR~LPP~LteeeF~~~i~~~l~~~~~w~y~~g~~~~~~~~~~~~SRaYi~F~~~~~~~~F~~~~~g~~F 80 (176)
T PF03467_consen 4 EKEGTKVVIRRLPPNLTEEEFWEQISPWLPDEWDWYYFQGKYGKKSFKPPTYSRAYINFKNPEDLLEFRDRFDGHVF 80 (176)
T ss_dssp -----EEEEEEE-TTS-HHHHCCCCSS--SSE---EEEEEEES-SSSTTS--EEEEEEESSCHHHHHHHHHCTTEEE
T ss_pred cccCceEEEeCCCCCCCHHHHHHHhhhhcccccceEEEecCCCCccCCCCcceEEEEEeCCHHHHHHHHHhcCCcEE
Confidence 345668999999999999998887776 6654 2222112211 112345699999999999999999999875
No 128
>PF07576 BRAP2: BRCA1-associated protein 2; InterPro: IPR011422 These proteins include BRCA1-associated protein 2 (BRAP2), which binds nuclear localisation signals (NLSs) in vitro and in yeast two-hybrid screening []. These proteins share a region of sequence similarity at their N terminus. They also have IPR001607 from INTERPRO at the C terminus.
Probab=96.89 E-value=0.029 Score=31.99 Aligned_cols=67 Identities=21% Similarity=0.240 Sum_probs=48.6
Q ss_pred ceEEE-cCCCCCCCHHHHHHHhccCC-CeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeCC
Q 034021 27 TKLFI-GGLSYDTNETVLKDAFGQHG-EIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLDG 95 (106)
Q Consensus 27 ~~l~v-~~lp~~~~~~~l~~~f~~~g-~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g 95 (106)
..+.+ ...|..++.+.|..+.+.+- .|..++++++.. ..+=.+++.|++...|......+||..+..
T Consensus 13 ~~~~~l~vp~~~~~~d~l~~f~~~~~~~i~~~riird~~--pnrymVLikF~~~~~Ad~Fy~~fNGk~Fns 81 (110)
T PF07576_consen 13 STLCCLAVPPYMTPSDFLLFFGAPFREDIEHIRIIRDGT--PNRYMVLIKFRDQESADEFYEEFNGKPFNS 81 (110)
T ss_pred ceEEEEEeCcccccHHHHHHhhhcccccEEEEEEeeCCC--CceEEEEEEECCHHHHHHHHHHhCCCccCC
Confidence 33444 44455566666766666654 588899988753 245579999999999999999999998654
No 129
>KOG2193 consensus IGF-II mRNA-binding protein IMP, contains RRM and KH domains [RNA processing and modification; General function prediction only]
Probab=96.88 E-value=0.0017 Score=44.79 Aligned_cols=69 Identities=29% Similarity=0.473 Sum_probs=54.1
Q ss_pred eEEEcCCCCCCCHHHHHHHhccCC-CeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCce-eCCeEEEEEec
Q 034021 28 KLFIGGLSYDTNETVLKDAFGQHG-EIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLL-LDGRNIRVNYA 103 (106)
Q Consensus 28 ~l~v~~lp~~~~~~~l~~~f~~~g-~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~-~~g~~l~v~~s 103 (106)
.+|++||...++..++..+|...- +...-.++ ..||+|+.+.+...+.++++.++|.. +.|.++.+..+
T Consensus 3 klyignL~p~~~psdl~svfg~ak~~~~g~fl~-------k~gyafvd~pdq~wa~kaie~~sgk~elqGkr~e~~~s 73 (584)
T KOG2193|consen 3 KLYIGNLSPQVTPSDLESVFGDAKIPGSGQFLV-------KSGYAFVDCPDQQWANKAIETLSGKVELQGKRQEVEHS 73 (584)
T ss_pred cccccccCCCCChHHHHHHhccccCCCCcceee-------ecceeeccCCchhhhhhhHHhhchhhhhcCceeeccch
Confidence 589999999999999999997531 11111111 24699999999999999999998884 88999888765
No 130
>KOG0128 consensus RNA-binding protein SART3 (RRM superfamily) [RNA processing and modification]
Probab=96.88 E-value=3.9e-05 Score=56.10 Aligned_cols=68 Identities=26% Similarity=0.435 Sum_probs=57.6
Q ss_pred ceEEEcCCCCCCCHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeC
Q 034021 27 TKLFIGGLSYDTNETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLD 94 (106)
Q Consensus 27 ~~l~v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~ 94 (106)
.++||+||+..+.+.++...|..+|.+..+.+......+..+|++|++|...+.+.+++....+..++
T Consensus 668 ~~~fvsnl~~~~~~~dl~~~~~~~~~~e~vqi~~h~n~~~~rG~~Y~~F~~~~~~~aaV~f~d~~~~g 735 (881)
T KOG0128|consen 668 IKIFVSNLSPKMSEEDLSERFSPSGTIEVVQIVIHKNEKRFRGKAYVEFLKPEHAGAAVAFRDSCFFG 735 (881)
T ss_pred HHHHHhhcchhhcCchhhhhcCccchhhhHHHHHHhhccccccceeeEeecCCchhhhhhhhhhhhhh
Confidence 57889999999999999999999998888777655557788999999999999999999865555444
No 131
>KOG0115 consensus RNA-binding protein p54nrb (RRM superfamily) [RNA processing and modification]
Probab=96.85 E-value=0.002 Score=41.62 Aligned_cols=61 Identities=25% Similarity=0.408 Sum_probs=53.0
Q ss_pred ceEEEcCCCCCCCHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHh
Q 034021 27 TKLFIGGLSYDTNETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEM 88 (106)
Q Consensus 27 ~~l~v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l 88 (106)
..|||.||+.-+..+.+.+.|+.||++..--+..+. .+++.+-++|.|.....+.+|+..+
T Consensus 32 a~l~V~nl~~~~sndll~~~f~~fg~~e~av~~vD~-r~k~t~eg~v~~~~k~~a~~a~rr~ 92 (275)
T KOG0115|consen 32 AELYVVNLMQGASNDLLEQAFRRFGPIERAVAKVDD-RGKPTREGIVEFAKKPNARKAARRC 92 (275)
T ss_pred ceEEEEecchhhhhHHHHHhhhhcCccchheeeecc-cccccccchhhhhcchhHHHHHHHh
Confidence 579999999999999999999999998766555554 5777889999999999999999876
No 132
>KOG0112 consensus Large RNA-binding protein (RRM superfamily) [General function prediction only]
Probab=96.79 E-value=0.00024 Score=52.53 Aligned_cols=81 Identities=23% Similarity=0.395 Sum_probs=67.3
Q ss_pred CCCCCceEEEcCCCCCCCHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeCCeEEEEE
Q 034021 22 DANSRTKLFIGGLSYDTNETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLDGRNIRVN 101 (106)
Q Consensus 22 ~~~~~~~l~v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g~~l~v~ 101 (106)
.....++|+++|++..+++.+|+..|..+|.+..+.+-... .+....++|+.|.+...+..++..+.+..|....+++.
T Consensus 368 D~~atrTLf~Gnl~~kl~eseiR~af~e~gkve~VDiKtP~-~~~esa~~f~~~~n~dmtp~ak~e~s~~~I~~g~~r~g 446 (975)
T KOG0112|consen 368 DFRATRTLFLGNLDSKLTESEIRPAFDESGKVEEVDIKTPH-IKTESAYAFVSLLNTDMTPSAKFEESGPLIGNGTHRIG 446 (975)
T ss_pred chhhhhhhhhcCcccchhhhhhhhhhhhhccccccccccCC-CCcccchhhhhhhccccCcccchhhcCCccccCccccc
Confidence 33457899999999999999999999999999999886664 33445699999999999999999899988877666665
Q ss_pred ec
Q 034021 102 YA 103 (106)
Q Consensus 102 ~s 103 (106)
+.
T Consensus 447 lG 448 (975)
T KOG0112|consen 447 LG 448 (975)
T ss_pred cc
Confidence 54
No 133
>KOG4660 consensus Protein Mei2, essential for commitment to meiosis, and related proteins [Cell cycle control, cell division, chromosome partitioning]
Probab=96.76 E-value=0.0051 Score=43.56 Aligned_cols=55 Identities=20% Similarity=0.243 Sum_probs=43.5
Q ss_pred CCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCcee---C-CeEEEEEecc
Q 034021 50 HGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLL---D-GRNIRVNYAH 104 (106)
Q Consensus 50 ~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~---~-g~~l~v~~s~ 104 (106)
.|.=.+++++.|..+.+..|||||.|.+.+.+..+.++.+|+.+ . .+...+.||+
T Consensus 413 ~gtYDFlYLPiDF~nkcNvGYAFINm~sp~ai~~F~kAFnGk~W~~FnS~Kia~itYAr 471 (549)
T KOG4660|consen 413 KGTYDFLYLPIDFKNKCNVGYAFINMTSPEAIIRFYKAFNGKKWEKFNSEKIASITYAR 471 (549)
T ss_pred cCccceEEeccccccccccceeEEeecCHHHHHHHHHHHcCCchhhhcceeeeeeehhh
Confidence 45556678888888888999999999999999999999999974 2 3445565553
No 134
>PF04847 Calcipressin: Calcipressin; InterPro: IPR006931 Calcipressin 1 negatively regulates calcineurin (IPR015757 from INTERPRO) by direct binding and is essential for the survival of T helper type 1 cells. Calcipressin 1 is a phosphoprotein that increases its capacity to inhibit calcineurin when phosphorylated at the FLISPP motif, and this phosphorylation also controls the half-life of calcipressin 1 by accelerating its degradation []. Calcineurin is a calcium-responsive enzyme that dephosphorylates the nuclear factor of activated T cells (NFAT). In so doing it promotes its nuclear translocation and uniquely links calcium signalling to transcriptional regulation []. Calcipressins are a family of proteins derived from three genes. Calcipressin 1 is also known as modulatory calcineurin-interacting protein 1 (MCIP1), Adapt78 and Down syndrome critical region 1 (DSCR1). Calcipressin 2 is variously known as MCIP2, ZAKI-4 and DSCR1-like 1. Calcipressin 3 is also called MCIP3 and DSCR1-like 2 []. DSCR1 (Adapt78) is associated with successful adaptation to oxidative stress and calcium stress as well as with diseases like Alzheimer's and Down syndrome. The DSCR1 (Adapt78) isoform 1 protein, calcipressin 1, inhibits calcineurin and protects against acute calcium-mediated stress damage, including transient oxidative stress []. Calcipressin 1 is encoded by DSCR1, a gene on human chromosome 21. Calcipressin 1 isoform 1 has an N-terminal coding region, which generates a new polypeptide of 252 amino acids. Endogenous calcipressin 1 exists as a complex together with the calcineurin A and B heterodimer []. ; GO: 0019722 calcium-mediated signaling; PDB: 1WEY_A.
Probab=96.45 E-value=0.019 Score=35.67 Aligned_cols=60 Identities=20% Similarity=0.263 Sum_probs=44.7
Q ss_pred CHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhC--CceeCCeEEEEEecc
Q 034021 39 NETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMD--GLLLDGRNIRVNYAH 104 (106)
Q Consensus 39 ~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~--~~~~~g~~l~v~~s~ 104 (106)
....|+++|..++.+.....+... +=..|.|.+.+.|.++...|+ +..+.|..+++.++.
T Consensus 8 ~~~~l~~l~~~~~~~~~~~~L~sF------rRi~v~f~~~~~A~~~r~~l~~~~~~~~g~~l~~yf~~ 69 (184)
T PF04847_consen 8 NLAELEELFSTYDPPVQFSPLKSF------RRIRVVFESPESAQRARQLLHWDGTSFNGKRLRVYFGQ 69 (184)
T ss_dssp -HHHHHHHHHTT-SS-EEEEETTT------TEEEEE-SSTTHHHHHHHTST--TSEETTEE-EEE---
T ss_pred hHHHHHHHHHhcCCceEEEEcCCC------CEEEEEeCCHHHHHHHHHHhcccccccCCCceEEEEcc
Confidence 347899999999988777666554 248899999999999999999 899999999998864
No 135
>KOG0804 consensus Cytoplasmic Zn-finger protein BRAP2 (BRCA1 associated protein) [General function prediction only]
Probab=95.88 E-value=0.075 Score=37.17 Aligned_cols=69 Identities=20% Similarity=0.215 Sum_probs=57.9
Q ss_pred CCceEEEcCCCCCCCHHHHHHHhccCC-CeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeCC
Q 034021 25 SRTKLFIGGLSYDTNETVLKDAFGQHG-EIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLDG 95 (106)
Q Consensus 25 ~~~~l~v~~lp~~~~~~~l~~~f~~~g-~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g 95 (106)
....|.|-.+|..++-.++..|+..+- .|..++++++. -..+=..+|.|++..+|...-..+||..+..
T Consensus 73 ~~~mLcilaVP~~mt~~Dll~F~~~~~~~I~~irivRd~--~pnrymvLIkFr~q~da~~Fy~efNGk~Fn~ 142 (493)
T KOG0804|consen 73 SSTMLCILAVPAYMTSHDLLRFCASFIKQISDIRIVRDG--MPNRYMVLIKFRDQADADTFYEEFNGKQFNS 142 (493)
T ss_pred CCcEEEEEeccccccHHHHHHHHHHHhhhhheeEEeecC--CCceEEEEEEeccchhHHHHHHHcCCCcCCC
Confidence 377899999999999999999998764 58999999964 2234468999999999999999999997643
No 136
>KOG2068 consensus MOT2 transcription factor [Transcription]
Probab=95.87 E-value=0.0034 Score=41.88 Aligned_cols=79 Identities=22% Similarity=0.431 Sum_probs=58.0
Q ss_pred CCceEEEcCCCCCCCHHHHH---HHhccCCCeeEEEEeecCC--CC-CceeEEEEEecCHHHHHHHHHHhCCceeCCeEE
Q 034021 25 SRTKLFIGGLSYDTNETVLK---DAFGQHGEIIEVKVICDRV--TG-QSRGYGFVKFASEAAAGEAIKEMDGLLLDGRNI 98 (106)
Q Consensus 25 ~~~~l~v~~lp~~~~~~~l~---~~f~~~g~i~~~~~~~~~~--~~-~~~g~~fv~f~~~~~a~~al~~l~~~~~~g~~l 98 (106)
....+||-+|+.....+.+. ++|..||.|..+....+.. .+ .....++|.|...+++..|+...+|...+|+.+
T Consensus 76 qknlvyvvgl~~~~ade~~l~~~eyfgqygki~ki~~~~~~S~~s~~~~~~s~yITy~~~eda~rci~~v~g~~~dg~~l 155 (327)
T KOG2068|consen 76 QKNLVYVVGLPLDLADESVLERTEYFGQYGKINKIVKNKDPSSSSSSGGTCSVYITYEEEEDADRCIDDVDGFVDDGRAL 155 (327)
T ss_pred hhhhhhhhCCCccccchhhhhCcccccccccceEEeecCCcccccCCCCCCcccccccchHhhhhHHHHhhhHHhhhhhh
Confidence 34668888998877655443 4577889998887776541 11 123359999999999999999999999888876
Q ss_pred EEEec
Q 034021 99 RVNYA 103 (106)
Q Consensus 99 ~v~~s 103 (106)
+..+.
T Consensus 156 ka~~g 160 (327)
T KOG2068|consen 156 KASLG 160 (327)
T ss_pred HHhhC
Confidence 65543
No 137
>KOG2135 consensus Proteins containing the RNA recognition motif [General function prediction only]
Probab=95.76 E-value=0.0068 Score=42.34 Aligned_cols=75 Identities=25% Similarity=0.339 Sum_probs=60.4
Q ss_pred CCCceEEEcCCCCCC-CHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeCCeEEEEEe
Q 034021 24 NSRTKLFIGGLSYDT-NETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLDGRNIRVNY 102 (106)
Q Consensus 24 ~~~~~l~v~~lp~~~-~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g~~l~v~~ 102 (106)
.+.+.+-+...|+.. +-.+|...|.+||.|..|.+-... ..|.|.|.+..+|-.|-. .++..+.++.|+|.|
T Consensus 370 ~dhs~l~lek~~~glnt~a~ln~hfA~fG~i~n~qv~~~~------~~a~vTF~t~aeag~a~~-s~~avlnnr~iKl~w 442 (526)
T KOG2135|consen 370 VDHSPLALEKSPFGLNTIADLNPHFAQFGEIENIQVDYSS------LHAVVTFKTRAEAGEAYA-SHGAVLNNRFIKLFW 442 (526)
T ss_pred cccchhhhhccCCCCchHhhhhhhhhhcCccccccccCch------hhheeeeeccccccchhc-cccceecCceeEEEE
Confidence 345666677777775 458899999999999999886552 379999999999977776 589999999999999
Q ss_pred ccC
Q 034021 103 AHK 105 (106)
Q Consensus 103 s~~ 105 (106)
-++
T Consensus 443 hnp 445 (526)
T KOG2135|consen 443 HNP 445 (526)
T ss_pred ecC
Confidence 765
No 138
>PF03880 DbpA: DbpA RNA binding domain ; InterPro: IPR005580 This RNA binding domain is found at the C terminus of a number of DEAD helicase proteins [].; PDB: 2G0C_A 3MOJ_B.
Probab=95.48 E-value=0.16 Score=26.68 Aligned_cols=67 Identities=22% Similarity=0.444 Sum_probs=37.9
Q ss_pred eEEEc-CCCCCCCHHHHHHHhccCC-----CeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeCCeEEEEE
Q 034021 28 KLFIG-GLSYDTNETVLKDAFGQHG-----EIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLDGRNIRVN 101 (106)
Q Consensus 28 ~l~v~-~lp~~~~~~~l~~~f~~~g-----~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g~~l~v~ 101 (106)
++||. |--..++..+|..++..-+ .|-.+.+.. .|+||+.... .+..++..|++..+.|++++|+
T Consensus 2 rl~in~Gr~dg~~~~~iv~~i~~~~gi~~~~IG~I~I~~--------~~S~vev~~~-~a~~v~~~l~~~~~~gk~v~ve 72 (74)
T PF03880_consen 2 RLFINVGRKDGLTPRDIVGAICNEAGIPGRDIGRIDIFD--------NFSFVEVPEE-VAEKVLEALNGKKIKGKKVRVE 72 (74)
T ss_dssp EEEES-SGGGT--HHHHHHHHHTCTTB-GGGEEEEEE-S--------S-EEEEE-TT--HHHHHHHHTT--SSS----EE
T ss_pred EEEEEcccccCCCHHHHHHHHHhccCCCHHhEEEEEEee--------eEEEEEECHH-HHHHHHHHhcCCCCCCeeEEEE
Confidence 35553 3344578888888887653 355666642 2788887765 6777888899999999999998
Q ss_pred ec
Q 034021 102 YA 103 (106)
Q Consensus 102 ~s 103 (106)
.|
T Consensus 73 ~A 74 (74)
T PF03880_consen 73 RA 74 (74)
T ss_dssp E-
T ss_pred EC
Confidence 75
No 139
>KOG4574 consensus RNA-binding protein (contains RRM and Pumilio-like repeats) [General function prediction only]
Probab=95.03 E-value=0.021 Score=42.63 Aligned_cols=72 Identities=19% Similarity=0.253 Sum_probs=59.5
Q ss_pred ceEEEcCCCCCCCHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCcee--CCeEEEEEecc
Q 034021 27 TKLFIGGLSYDTNETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLL--DGRNIRVNYAH 104 (106)
Q Consensus 27 ~~l~v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~--~g~~l~v~~s~ 104 (106)
.+.++.|.+-..+-..|..+|+.||.+...+.+++. .++.|+|...+.|-.|+.+++|+++ -|-+.+|.+|+
T Consensus 299 p~~~~~nn~v~~tSssL~~l~s~yg~v~s~wtlr~~------N~alvs~~s~~sai~a~dAl~gkevs~~g~Ps~V~~ak 372 (1007)
T KOG4574|consen 299 PKQSLENNAVNLTSSSLATLCSDYGSVASAWTLRDL------NMALVSFSSVESAILALDALQGKEVSVTGAPSRVSFAK 372 (1007)
T ss_pred chhhhhcccccchHHHHHHHHHhhcchhhheecccc------cchhhhhHHHHHHHHhhhhhcCCcccccCCceeEEecc
Confidence 344556666677778899999999999999888775 3899999999999999999999984 56678887775
No 140
>KOG2591 consensus c-Mpl binding protein, contains La domain [Signal transduction mechanisms]
Probab=94.61 E-value=0.089 Score=37.82 Aligned_cols=69 Identities=19% Similarity=0.274 Sum_probs=51.4
Q ss_pred CceEEEcCCCCCCCHHHHHHHhcc--CCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCC--ceeCCeEEEEE
Q 034021 26 RTKLFIGGLSYDTNETVLKDAFGQ--HGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDG--LLLDGRNIRVN 101 (106)
Q Consensus 26 ~~~l~v~~lp~~~~~~~l~~~f~~--~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~--~~~~g~~l~v~ 101 (106)
+|.+.++-||..+..++++.+|.. +-.+..|.+-... -=||.|++..+|+.|.+-|.. .+|.|++|..+
T Consensus 175 RcIvilREIpettp~e~Vk~lf~~encPk~iscefa~N~-------nWyITfesd~DAQqAykylreevk~fqgKpImAR 247 (684)
T KOG2591|consen 175 RCIVILREIPETTPIEVVKALFKGENCPKVISCEFAHND-------NWYITFESDTDAQQAYKYLREEVKTFQGKPIMAR 247 (684)
T ss_pred eeEEEEeecCCCChHHHHHHHhccCCCCCceeeeeeecC-------ceEEEeecchhHHHHHHHHHHHHHhhcCcchhhh
Confidence 566777999999999999999975 4567777665443 368999999999999875533 34667665443
No 141
>KOG4285 consensus Mitotic phosphoprotein [Cell cycle control, cell division, chromosome partitioning]
Probab=94.23 E-value=0.69 Score=31.04 Aligned_cols=64 Identities=17% Similarity=0.345 Sum_probs=46.9
Q ss_pred CceEEEcCCCCCCCHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeCCeE
Q 034021 26 RTKLFIGGLSYDTNETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLDGRN 97 (106)
Q Consensus 26 ~~~l~v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g~~ 97 (106)
..=|.|-++|..-.. -+...|+.+|.|...... . + -.+-+|.|.+..+|++||. .+|..|+|..
T Consensus 197 D~WVTVfGFppg~~s-~vL~~F~~cG~Vvkhv~~--~-n---gNwMhirYssr~~A~KALs-kng~ii~g~v 260 (350)
T KOG4285|consen 197 DTWVTVFGFPPGQVS-IVLNLFSRCGEVVKHVTP--S-N---GNWMHIRYSSRTHAQKALS-KNGTIIDGDV 260 (350)
T ss_pred cceEEEeccCccchh-HHHHHHHhhCeeeeeecC--C-C---CceEEEEecchhHHHHhhh-hcCeeeccce
Confidence 344556677776443 467889999988765332 1 2 2488999999999999999 5999988764
No 142
>PF11767 SET_assoc: Histone lysine methyltransferase SET associated; InterPro: IPR024636 The SET domain is a protein-protein interaction domain found in protein lysine methyltransferase enzymes. This entry represents a domain of unknown function which is associated with the SET domain and found in histone lysine methyltransferases [].
Probab=92.77 E-value=0.74 Score=23.78 Aligned_cols=56 Identities=16% Similarity=0.155 Sum_probs=42.7
Q ss_pred CCCHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeCCeEEEEE
Q 034021 37 DTNETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLDGRNIRVN 101 (106)
Q Consensus 37 ~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g~~l~v~ 101 (106)
.++-++++..+..|+- ..+ ..+. + -=||.|.+..+|+++....++..+.+.+|.++
T Consensus 11 ~~~v~d~K~~Lr~y~~-~~I--~~d~-t-----GfYIvF~~~~Ea~rC~~~~~~~~~f~y~m~M~ 66 (66)
T PF11767_consen 11 GVTVEDFKKRLRKYRW-DRI--RDDR-T-----GFYIVFNDSKEAERCFRAEDGTLFFTYRMQME 66 (66)
T ss_pred CccHHHHHHHHhcCCc-ceE--EecC-C-----EEEEEECChHHHHHHHHhcCCCEEEEEEEEeC
Confidence 4677899999999863 333 2222 2 36799999999999999999999888887753
No 143
>KOG4483 consensus Uncharacterized conserved protein [Function unknown]
Probab=92.64 E-value=0.53 Score=32.78 Aligned_cols=57 Identities=21% Similarity=0.158 Sum_probs=48.8
Q ss_pred CCCceEEEcCCCCCCCHHHHHHHhccCC-CeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHH
Q 034021 24 NSRTKLFIGGLSYDTNETVLKDAFGQHG-EIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKE 87 (106)
Q Consensus 24 ~~~~~l~v~~lp~~~~~~~l~~~f~~~g-~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~ 87 (106)
.-.+.|-|-++|.....++|...|+.|+ .-..|.|+-+. ++|..|.+...|..||-.
T Consensus 389 dlpHVlEIydfp~efkteDll~~f~~yq~kgfdIkWvDdt-------halaVFss~~~AaeaLt~ 446 (528)
T KOG4483|consen 389 DLPHVLEIYDFPDEFKTEDLLKAFETYQNKGFDIKWVDDT-------HALAVFSSVNRAAEALTL 446 (528)
T ss_pred cccceeEeccCchhhccHHHHHHHHHhhcCCceeEEeecc-------eeEEeecchHHHHHHhhc
Confidence 3467888999999999999999999996 46778888775 899999999999999983
No 144
>KOG2253 consensus U1 snRNP complex, subunit SNU71 and related PWI-motif proteins [RNA processing and modification]
Probab=91.60 E-value=0.23 Score=36.33 Aligned_cols=71 Identities=21% Similarity=0.345 Sum_probs=57.6
Q ss_pred CCCCCceEEEcCCCCCCCHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeCCeEEEEE
Q 034021 22 DANSRTKLFIGGLSYDTNETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLDGRNIRVN 101 (106)
Q Consensus 22 ~~~~~~~l~v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g~~l~v~ 101 (106)
......++||+|+...+..+-+..+...+|-|..+... .|||..|........++..++...++|..+.+.
T Consensus 36 ~~~~~~~vfv~~~~~~~s~~~~~~il~~~g~v~s~kr~---------~fgf~~f~~~~~~~ra~r~~t~~~~~~~kl~~~ 106 (668)
T KOG2253|consen 36 PLPPRDTVFVGNISYLVSQEFWKSILAKSGFVPSWKRD---------KFGFCEFLKHIGDLRASRLLTELNIDDQKLIEN 106 (668)
T ss_pred CCCCCceeEecchhhhhhHHHHHHHHhhCCcchhhhhh---------hhcccchhhHHHHHHHHHHhcccCCCcchhhcc
Confidence 33456789999999999999999999999977655332 289999999999999999888888888776543
No 145
>PF15513 DUF4651: Domain of unknown function (DUF4651)
Probab=90.01 E-value=1 Score=22.92 Aligned_cols=21 Identities=29% Similarity=0.270 Sum_probs=17.2
Q ss_pred HHHHHHhccCCCeeEEEEeec
Q 034021 41 TVLKDAFGQHGEIIEVKVICD 61 (106)
Q Consensus 41 ~~l~~~f~~~g~i~~~~~~~~ 61 (106)
.+|+++|+..|.|.-+.+-.-
T Consensus 9 ~~iR~~fs~lG~I~vLYvn~~ 29 (62)
T PF15513_consen 9 AEIRQFFSQLGEIAVLYVNPY 29 (62)
T ss_pred HHHHHHHHhcCcEEEEEEccc
Confidence 679999999999988876433
No 146
>KOG4210 consensus Nuclear localization sequence binding protein [Transcription]
Probab=89.97 E-value=0.28 Score=32.60 Aligned_cols=74 Identities=23% Similarity=0.241 Sum_probs=55.0
Q ss_pred CCceEEEcCCCCCCCHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeCCeEE
Q 034021 25 SRTKLFIGGLSYDTNETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLDGRNI 98 (106)
Q Consensus 25 ~~~~l~v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g~~l 98 (106)
..++.|++++.+.+.+.+...++...|................++++++.|...+.+..++.........++.+
T Consensus 87 ~~~~~f~g~~s~~~e~~~~~~~~~~~g~~~~~~~S~~~~~~~sk~~~s~~f~~ks~~~~~l~~s~~~~~~~~~~ 160 (285)
T KOG4210|consen 87 SSSTFFVGELSENIEESEDDNFSSEAGLRVDARSSSLEDSLSSKGGLSVHFAGKSQFFAALEESGSKVLDGNKG 160 (285)
T ss_pred ccccccccccccchhhccccccchhhcCcccchhhhhccccccccceeeccccHHHHHHHHHhhhccccccccc
Confidence 46788999999999888888888888865555444433356678899999999999999999544345554443
No 147
>PF07530 PRE_C2HC: Associated with zinc fingers; InterPro: IPR006579 This domain is present in proteins found exclusively in the arthropods, including a number of Drosophila species, the silk moth and the gypsy moth. These proteins are possibly involved in RNA binding or single strand DNA binding.
Probab=88.43 E-value=1.4 Score=22.81 Aligned_cols=62 Identities=18% Similarity=0.259 Sum_probs=43.0
Q ss_pred HHHHHHhccCC-CeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeCCeEEEEEeccC
Q 034021 41 TVLKDAFGQHG-EIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLDGRNIRVNYAHK 105 (106)
Q Consensus 41 ~~l~~~f~~~g-~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g~~l~v~~s~~ 105 (106)
++|.+-|...| .+.++.-+....++.....-||++....+... .++-..+++..++|+...+
T Consensus 2 ~~I~~~L~~~G~~v~~i~~~~~~~~k~pl~mf~veL~p~~~~k~---i~~Ik~l~~~~V~vE~~~k 64 (68)
T PF07530_consen 2 EEIKEELKDQGHPVRNIHNMHSRNTKKPLNMFFVELEPKPNNKE---IYKIKTLCGQRVKVERPRK 64 (68)
T ss_pred HHHHHHHHHcCCceEEEEccccCCCCCCceEEEEeeccCccccc---eeehHhhCCeEEEEecCCC
Confidence 46777787777 67777766666566677788888876655222 3344567888899987654
No 148
>KOG4676 consensus Splicing factor, arginine/serine-rich [RNA processing and modification]
Probab=88.07 E-value=0.027 Score=38.69 Aligned_cols=65 Identities=12% Similarity=0.101 Sum_probs=51.7
Q ss_pred CCceEEEcCCCCCCCHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeC
Q 034021 25 SRTKLFIGGLSYDTNETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLD 94 (106)
Q Consensus 25 ~~~~l~v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~ 94 (106)
..++++|.+|+..+...++.+.|..+|.+....+.-. ....++-++|....+...|+. .+|.++.
T Consensus 150 irRt~~v~sl~~~~~l~e~~e~f~r~Gev~ya~~ask----~~s~~c~~sf~~qts~~halr-~~gre~k 214 (479)
T KOG4676|consen 150 IRRTREVQSLISAAILPESGESFERKGEVSYAHTASK----SRSSSCSHSFRKQTSSKHALR-SHGRERK 214 (479)
T ss_pred HHhhhhhhcchhhhcchhhhhhhhhcchhhhhhhhcc----CCCcchhhhHhhhhhHHHHHH-hcchhhh
Confidence 3588999999999999999999999998877655432 233467799999999999998 5776654
No 149
>KOG2318 consensus Uncharacterized conserved protein [Function unknown]
Probab=87.31 E-value=4.6 Score=29.65 Aligned_cols=75 Identities=21% Similarity=0.323 Sum_probs=56.5
Q ss_pred CCceEEEcCCCCC-CCHHHHHHHhccC----CCeeEEEEeecCC----------CCC-----------------------
Q 034021 25 SRTKLFIGGLSYD-TNETVLKDAFGQH----GEIIEVKVICDRV----------TGQ----------------------- 66 (106)
Q Consensus 25 ~~~~l~v~~lp~~-~~~~~l~~~f~~~----g~i~~~~~~~~~~----------~~~----------------------- 66 (106)
..+.|-|-|+.|+ +...+|.-+|+.| |.|..|.|....- .|.
T Consensus 173 ~T~RLAVvNMDWd~v~AkDL~v~~nSFlP~gGsilSV~IYpSeFGkeRM~eEeV~GP~~el~~~~e~~~~s~sD~ee~~~ 252 (650)
T KOG2318|consen 173 ETKRLAVVNMDWDRVKAKDLFVLFNSFLPKGGSILSVKIYPSEFGKERMKEEEVHGPPKELFKPVEEYKESESDDEEEED 252 (650)
T ss_pred ccceeeEeccccccccHHHHHHHHHhhcCCCCceeEEEechhhhhHHHhhhhcccCChhhhccccccCcccccchhhhhh
Confidence 4567888899998 7889999999876 5788887754420 011
Q ss_pred --------------ceeEEEEEecCHHHHHHHHHHhCCceeCCeEEE
Q 034021 67 --------------SRGYGFVKFASEAAAGEAIKEMDGLLLDGRNIR 99 (106)
Q Consensus 67 --------------~~g~~fv~f~~~~~a~~al~~l~~~~~~g~~l~ 99 (106)
.-=||.|+|.+...|.+.-..++|+.+......
T Consensus 253 ~~~~kLR~Yq~~rLkYYyAVvecDsi~tA~~vYe~CDG~EfEsS~~~ 299 (650)
T KOG2318|consen 253 VDREKLRQYQLNRLKYYYAVVECDSIETAKAVYEECDGIEFESSANK 299 (650)
T ss_pred HHHHHHHHHHhhhheeEEEEEEecCchHHHHHHHhcCcceeccccce
Confidence 112799999999999999999999998654433
No 150
>smart00596 PRE_C2HC PRE_C2HC domain.
Probab=84.96 E-value=2.2 Score=22.25 Aligned_cols=62 Identities=19% Similarity=0.283 Sum_probs=41.6
Q ss_pred HHHHHHhccCC-CeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeCCeEEEEEeccC
Q 034021 41 TVLKDAFGQHG-EIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLDGRNIRVNYAHK 105 (106)
Q Consensus 41 ~~l~~~f~~~g-~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g~~l~v~~s~~ 105 (106)
.+|.+-|...| ++.++.-+....++.....-+|+.....+... .++-..+++.++.|+...+
T Consensus 2 ~~I~~~L~~~G~~v~~i~~m~~~~~r~P~nmf~vel~~~~~~~~---Il~ik~Lg~~~V~VEr~~k 64 (69)
T smart00596 2 SQIEEALKDIGFPVLFIHNMLNRDTKNPQNMFEVELVPAANGKE---ILNIKTLGGQRVTVERPHK 64 (69)
T ss_pred HHHHHHHHHcCCceeEEEcccccCCCCcceeEEEEeeecCCCcc---eEeehhhCCeeEEEecCcc
Confidence 46788888888 67777766666555566677887765543322 3444567888898887543
No 151
>KOG4410 consensus 5-formyltetrahydrofolate cyclo-ligase [Coenzyme transport and metabolism]
Probab=84.09 E-value=6.2 Score=26.55 Aligned_cols=47 Identities=21% Similarity=0.292 Sum_probs=36.1
Q ss_pred CceEEEcCCCCCCCHHHHHHHhccCCC-eeEEEEeecCCCCCceeEEEEEecCH
Q 034021 26 RTKLFIGGLSYDTNETVLKDAFGQHGE-IIEVKVICDRVTGQSRGYGFVKFASE 78 (106)
Q Consensus 26 ~~~l~v~~lp~~~~~~~l~~~f~~~g~-i~~~~~~~~~~~~~~~g~~fv~f~~~ 78 (106)
..-||++|||.++.-.+|+..+.+.+. ...+.|. | +.+-+|+.|-+.
T Consensus 330 ~~di~~~nl~rd~rv~dlk~~lr~~~~~pm~iswk-----g-~~~k~flh~~~~ 377 (396)
T KOG4410|consen 330 KTDIKLTNLSRDIRVKDLKSELRKRECTPMSISWK-----G-HFGKCFLHFGNR 377 (396)
T ss_pred ccceeeccCccccchHHHHHHHHhcCCCceeEeee-----c-CCcceeEecCCc
Confidence 467999999999999999999988764 4555552 2 345699999763
No 152
>KOG2295 consensus C2H2 Zn-finger protein [General function prediction only]
Probab=75.34 E-value=0.29 Score=35.27 Aligned_cols=72 Identities=10% Similarity=0.104 Sum_probs=52.8
Q ss_pred CCceEEEcCCCCCCCHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeCCe
Q 034021 25 SRTKLFIGGLSYDTNETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLDGR 96 (106)
Q Consensus 25 ~~~~l~v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g~ 96 (106)
..+.++++|++.+.+-.+|...++.+..+..+.+-...........+++.|+-.-...-|+-+||++.+...
T Consensus 230 ke~sll~rni~Pnis~aeIe~~ck~i~~~lrfals~~~aek~~~r~lwv~fk~~~ni~~a~~aLn~irl~s~ 301 (648)
T KOG2295|consen 230 KECSLLVRNILPNISVAEIENLCKGIPGFLRFALSTINAEKNFERRLWVTFKRGTNIKEACWALNGIRLRSN 301 (648)
T ss_pred HHHHHHHhccCCcccHHHHHHHhccCchheeeeccCchHHHHHHHHhhHhhccccchHHHHHHhhhcccccc
Confidence 357889999999999999999999886555554433332333455788999877777777777888765443
No 153
>KOG2891 consensus Surface glycoprotein [General function prediction only]
Probab=73.02 E-value=7.8 Score=26.02 Aligned_cols=70 Identities=21% Similarity=0.463 Sum_probs=44.4
Q ss_pred CCCCceEEEcCCCCC------------CCHHHHHHHhccCCCeeEEEEeecC-----CCCCc-----eeE---------E
Q 034021 23 ANSRTKLFIGGLSYD------------TNETVLKDAFGQHGEIIEVKVICDR-----VTGQS-----RGY---------G 71 (106)
Q Consensus 23 ~~~~~~l~v~~lp~~------------~~~~~l~~~f~~~g~i~~~~~~~~~-----~~~~~-----~g~---------~ 71 (106)
...+.+||+.+||-. .+++-|+..|..||.|..+.++... .+|+. .|| |
T Consensus 146 gerpdti~la~ip~kwf~lkedg~~dlpse~rlr~a~eafg~ir~vdipicdplr~~mn~kisgiq~~gfg~g~dlffea 225 (445)
T KOG2891|consen 146 GERPDTIHLAGIPCKWFALKEDGSEDLPSEDRLRKAFEAFGEIRNVDIPICDPLREEMNGKISGIQFHGFGFGGDLFFEA 225 (445)
T ss_pred CCCCCceeecCCcceeeeecccccccCChHHHHHHHHHHhccceecCCcccchhHHHhcCccccceeeccccCcchhHHH
Confidence 334678999998832 3467799999999999988775432 13332 222 3
Q ss_pred EEEecCHHHHHHHHHHhCCce
Q 034021 72 FVKFASEAAAGEAIKEMDGLL 92 (106)
Q Consensus 72 fv~f~~~~~a~~al~~l~~~~ 92 (106)
||+|...-....|+.+|.|..
T Consensus 226 yvqfmeykgfa~amdalr~~k 246 (445)
T KOG2891|consen 226 YVQFMEYKGFAQAMDALRGMK 246 (445)
T ss_pred HHHHHHHHhHHHHHHHHhcch
Confidence 355555555566666666654
No 154
>PF07292 NID: Nmi/IFP 35 domain (NID); InterPro: IPR009909 This entry represents a domain of approximately 90 residues that is tandemly repeated within interferon-induced 35 kDa protein (IFP 35) and the homologous N-myc-interactor (Nmi). This domain mediates Nmi-Nmi protein interactions and subcellular localisation [].
Probab=72.88 E-value=2.7 Score=23.01 Aligned_cols=25 Identities=20% Similarity=0.384 Sum_probs=20.5
Q ss_pred CCCceEEEcCCCCCCCHHHHHHHhc
Q 034021 24 NSRTKLFIGGLSYDTNETVLKDAFG 48 (106)
Q Consensus 24 ~~~~~l~v~~lp~~~~~~~l~~~f~ 48 (106)
...++|-|+|||....+++|++.++
T Consensus 50 vs~rtVlvsgip~~l~ee~l~D~Le 74 (88)
T PF07292_consen 50 VSKRTVLVSGIPDVLDEEELRDKLE 74 (88)
T ss_pred ccCCEEEEeCCCCCCChhhheeeEE
Confidence 3567899999999999999886653
No 155
>PF02714 DUF221: Domain of unknown function DUF221; InterPro: IPR003864 This domain is found in a family of hypothetical transmembrane proteins none of which have any known function, the aligned region is at 538 residues at maximum length.; GO: 0016020 membrane
Probab=70.46 E-value=7.3 Score=26.09 Aligned_cols=22 Identities=27% Similarity=0.301 Sum_probs=18.4
Q ss_pred EEEEecCHHHHHHHHHHhCCce
Q 034021 71 GFVKFASEAAAGEAIKEMDGLL 92 (106)
Q Consensus 71 ~fv~f~~~~~a~~al~~l~~~~ 92 (106)
|||.|++..+|+.+++.+....
T Consensus 1 aFVtF~~~~~a~~~~q~~~~~~ 22 (325)
T PF02714_consen 1 AFVTFNSQKSAQIALQLLLSKR 22 (325)
T ss_pred CEEEECCHHHHHHHHHHHhcCC
Confidence 6999999999999999655444
No 156
>KOG2193 consensus IGF-II mRNA-binding protein IMP, contains RRM and KH domains [RNA processing and modification; General function prediction only]
Probab=69.64 E-value=0.11 Score=36.22 Aligned_cols=75 Identities=17% Similarity=0.314 Sum_probs=58.7
Q ss_pred CCceEEEcCCCCCCCHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeCCeEEEEEe
Q 034021 25 SRTKLFIGGLSYDTNETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLDGRNIRVNY 102 (106)
Q Consensus 25 ~~~~l~v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g~~l~v~~ 102 (106)
..+.+.|+|+|.....+-+..+...||.+..+..+.-. + -....-+.|...+.++.++..++|..+.+..+++.|
T Consensus 79 rsrk~Qirnippql~wevld~Ll~qyg~ve~~eqvnt~-~--etavvnvty~~~~~~~~ai~kl~g~Q~en~~~k~~Y 153 (584)
T KOG2193|consen 79 RSRKIQIRNIPPQLQWEVLDSLLAQYGTVENCEQVNTD-S--ETAVVNVTYSAQQQHRQAIHKLNGPQLENQHLKVGY 153 (584)
T ss_pred HhhhhhHhcCCHHHHHHHHHHHHhccCCHhHhhhhccc-h--HHHHHHHHHHHHHHHHHHHHhhcchHhhhhhhhccc
Confidence 34668899999999999999999999999888653322 1 112344678889999999999999988887777765
No 157
>KOG4454 consensus RNA binding protein (RRM superfamily) [General function prediction only]
Probab=69.53 E-value=0.87 Score=29.29 Aligned_cols=66 Identities=30% Similarity=0.483 Sum_probs=50.4
Q ss_pred CceEEEcC----CCCCCCHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCce
Q 034021 26 RTKLFIGG----LSYDTNETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLL 92 (106)
Q Consensus 26 ~~~l~v~~----lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~ 92 (106)
..+++-++ |...++++.+...|+.-|++...++..+. ++..+.++++.+........++...++..
T Consensus 80 q~~~r~G~shapld~r~~~ei~~~v~s~a~p~~~~R~~~~~-d~rnrn~~~~~~qr~~~~P~~~~~y~~l~ 149 (267)
T KOG4454|consen 80 QRTLRCGNSHAPLDERVTEEILYEVFSQAGPIEGVRIPTDN-DGRNRNFGFVTYQRLCAVPFALDLYQGLE 149 (267)
T ss_pred hcccccCCCcchhhhhcchhhheeeecccCCCCCccccccc-cCCccCccchhhhhhhcCcHHhhhhcccC
Confidence 34455555 56667888888999999999998888776 57777889999888888777887666554
No 158
>PF03468 XS: XS domain; InterPro: IPR005380 The XS (rice gene X and SGS3) domain is found in a family of plant proteins including gene X Q9SBW2 from SWISSPROT and SGS3 Q9LDX1 from SWISSPROT. SGS3 is thought to be involved in post-transcriptional gene silencing (PTGS). This domain contains a conserved aspartate residue that may be functionally important. The XS domain containing proteins contain coiled-coils, which suggests that they will oligomerise. Most coiled-coil proteins form either a dimeric or a trimeric structure. It is possible that different members of the XS domain family could oligomerise via their coiled-coils forming a variety of complexes [].; PDB: 4E8U_C.
Probab=67.74 E-value=21 Score=20.51 Aligned_cols=45 Identities=18% Similarity=0.339 Sum_probs=27.0
Q ss_pred CHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEec-CHHHHHHHHH
Q 034021 39 NETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFA-SEAAAGEAIK 86 (106)
Q Consensus 39 ~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~-~~~~a~~al~ 86 (106)
+.+.|++.|+.|.++. +..+... ..+.|+++|.|. +..-...|+.
T Consensus 30 ~~~~l~~~l~~f~p~k-v~~l~~~--~gh~g~aiv~F~~~w~Gf~~A~~ 75 (116)
T PF03468_consen 30 SNEELLDKLAEFNPLK-VKPLYGK--QGHTGFAIVEFNKDWSGFKNAMR 75 (116)
T ss_dssp -SHHHHHHHHH---SE-EEEEEET--TEEEEEEEEE--SSHHHHHHHHH
T ss_pred CHHHHHHHHHhcCCce-eEECcCC--CCCcEEEEEEECCChHHHHHHHH
Confidence 4578999999998765 4444554 246889999997 4565666665
No 159
>PRK14548 50S ribosomal protein L23P; Provisional
Probab=64.50 E-value=21 Score=19.34 Aligned_cols=56 Identities=18% Similarity=0.222 Sum_probs=39.9
Q ss_pred EEcCCCCCCCHHHHHHHhcc-CC-CeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHh
Q 034021 30 FIGGLSYDTNETVLKDAFGQ-HG-EIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEM 88 (106)
Q Consensus 30 ~v~~lp~~~~~~~l~~~f~~-~g-~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l 88 (106)
|+--.+..++..+|++.++. || .+..+....-+ .+ ..=|+|.+.....|......+
T Consensus 24 y~F~V~~~anK~eIK~AvE~lf~VkV~~VnT~~~~-~~--~KKA~V~L~~g~~A~~va~ki 81 (84)
T PRK14548 24 LTFIVDRRATKPDIKRAVEELFDVKVEKVNTLITP-KG--EKKAYVKLAEEYDAEEIASRL 81 (84)
T ss_pred EEEEECCCCCHHHHHHHHHHHhCCceEEEEeEEcC-CC--cEEEEEEeCCCCcHHHHHHhh
Confidence 33346889999999999986 67 57777665554 22 224999999888877766543
No 160
>KOG1295 consensus Nonsense-mediated decay protein Upf3 [RNA processing and modification]
Probab=63.94 E-value=14 Score=25.69 Aligned_cols=69 Identities=13% Similarity=0.189 Sum_probs=47.4
Q ss_pred CCceEEEcCCCCCCCHHHHHHHhccCCC-eeEEEEeecCC--CCCceeEEEEEecCHHHHHHHHHHhCCcee
Q 034021 25 SRTKLFIGGLSYDTNETVLKDAFGQHGE-IIEVKVICDRV--TGQSRGYGFVKFASEAAAGEAIKEMDGLLL 93 (106)
Q Consensus 25 ~~~~l~v~~lp~~~~~~~l~~~f~~~g~-i~~~~~~~~~~--~~~~~g~~fv~f~~~~~a~~al~~l~~~~~ 93 (106)
.-..+.|.+||...++.++.+-+.++-. +....+..... .....+.+||.|....+.......++|+.+
T Consensus 6 ~~~Kvv~rrlpp~l~~~~~~eqi~p~~~~v~~~~F~~a~~s~~~~~ysrayinFk~~~dv~ef~~~f~g~if 77 (376)
T KOG1295|consen 6 AKVKVVVRRLPPKLTEEQLLEQINPFPEHVNWEFFAKADESLRNHKYSRAYINFKNPEDVEEFRRRFDGYIF 77 (376)
T ss_pred cceeeeeecCCCcccHHHHhhhcCCCccccchheeccccccchhhhhhhhhhccccHHHHHHHHhhCCceEE
Confidence 3457888999999999998888877632 22222221110 011246799999999998888888888864
No 161
>COG5193 LHP1 La protein, small RNA-binding pol III transcript stabilizing protein and related La-motif-containing proteins involved in translation [Posttranslational modification, protein turnover, chaperones / Translation, ribosomal structure and biogenesis]
Probab=63.04 E-value=3.9 Score=28.65 Aligned_cols=64 Identities=17% Similarity=0.206 Sum_probs=49.1
Q ss_pred CCCCceEEEcCCCCCCCH--------HHHHHHhcc--CCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHH
Q 034021 23 ANSRTKLFIGGLSYDTNE--------TVLKDAFGQ--HGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIK 86 (106)
Q Consensus 23 ~~~~~~l~v~~lp~~~~~--------~~l~~~f~~--~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~ 86 (106)
....+.+|+.+++..-.. +++..+|.. .+.+..+...++..+....|..|++|.....+++.+.
T Consensus 171 ~~~qr~~y~n~fG~e~~~~a~~~e~~~d~~~~~p~h~h~~~~~i~~rrd~~nkn~~gSv~~efk~~~~~q~~nn 244 (438)
T COG5193 171 SQMQRDVYQNGFGKEDVNNASRPEQQEDLEIQFPPHYHAPPSQIRNRRDWLNKNFRGSVFVEFKYFREAQRFNN 244 (438)
T ss_pred hhHhhhHHhhcCCcccccccccchhhhhHHhhCCCcccCChhhccchhhhhhccccCcccccccChHHHHHHhc
Confidence 334567788888766443 488999988 5678888887776666678889999999999998874
No 162
>KOG4019 consensus Calcineurin-mediated signaling pathway inhibitor DSCR1 [Signal transduction mechanisms; General function prediction only]
Probab=63.01 E-value=9.4 Score=23.91 Aligned_cols=73 Identities=14% Similarity=0.244 Sum_probs=47.9
Q ss_pred CceEEEcCCCCCCCH-----HHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeCCe-EEE
Q 034021 26 RTKLFIGGLSYDTNE-----TVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLDGR-NIR 99 (106)
Q Consensus 26 ~~~l~v~~lp~~~~~-----~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g~-~l~ 99 (106)
..++.+..++..+-. .....+|..|.......+++. .+..-|.|.+...+..|...+++..+.|. .++
T Consensus 10 p~~~~~c~i~~~VF~~~~~k~~~~~lFrq~n~~~~fq~lrs------frrvRi~f~~p~~a~~a~i~~~~~~f~~~~~~k 83 (193)
T KOG4019|consen 10 PTAIIACDIHEEVFVNREDKALFENLFRQINEDATFQLLRS------FRRVRINFSNPEAAADARIKLHSTSFNGKNELK 83 (193)
T ss_pred cceeeeecccHHhhccHHHHHHHHhHHhhhCcchHHHHHHh------hceeEEeccChhHHHHHHHHhhhcccCCCceEE
Confidence 344556666655322 234445556655555555443 24577999999999999999999998887 666
Q ss_pred EEecc
Q 034021 100 VNYAH 104 (106)
Q Consensus 100 v~~s~ 104 (106)
..++.
T Consensus 84 ~yfaQ 88 (193)
T KOG4019|consen 84 LYFAQ 88 (193)
T ss_pred EEEcc
Confidence 66553
No 163
>COG0724 RNA-binding proteins (RRM domain) [General function prediction only]
Probab=61.93 E-value=16 Score=22.81 Aligned_cols=41 Identities=24% Similarity=0.443 Sum_probs=33.2
Q ss_pred CCCCCceEEEcCCCCCCCHHHHHHHhccCCCeeEEEEeecC
Q 034021 22 DANSRTKLFIGGLSYDTNETVLKDAFGQHGEIIEVKVICDR 62 (106)
Q Consensus 22 ~~~~~~~l~v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~ 62 (106)
.......+++.+++..++...+...|..+|.+....+....
T Consensus 221 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 261 (306)
T COG0724 221 LLEKSDNLYVGNLPLKTAEEELADLFKSRGDIVRASLPPSK 261 (306)
T ss_pred cccccceeeccccccccchhHHHHhccccccceeeeccCCC
Confidence 34456788999999999999999999999998666665444
No 164
>TIGR03636 L23_arch archaeal ribosomal protein L23. This model describes the archaeal ribosomal protein L23P and rigorously excludes the bacterial counterpart L23. In order to capture every known instance of archaeal L23P, the trusted cutoff is set lower than a few of the highest scoring eukaryotic cytosolic ribosomal counterparts.
Probab=61.05 E-value=24 Score=18.78 Aligned_cols=57 Identities=18% Similarity=0.170 Sum_probs=39.5
Q ss_pred EEEcCCCCCCCHHHHHHHhcc-CC-CeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHh
Q 034021 29 LFIGGLSYDTNETVLKDAFGQ-HG-EIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEM 88 (106)
Q Consensus 29 l~v~~lp~~~~~~~l~~~f~~-~g-~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l 88 (106)
-|+-..+..++..+|++.++. || .+..+....-+. ...=|||.+.....|......+
T Consensus 16 ~y~F~V~~~anK~eIK~avE~lf~VkV~~Vnt~~~~~---~~KKA~VtL~~g~~a~~va~k~ 74 (77)
T TIGR03636 16 KLTFIVDRKATKGDIKRAVEKLFDVKVEKVNTLITPR---GEKKAYVKLAEEYAAEEIASRL 74 (77)
T ss_pred EEEEEECCCCCHHHHHHHHHHHhCCceEEEEeEEcCC---CceEEEEEECCCCcHHHHHHhh
Confidence 444457899999999999986 66 467776555442 2234999998877777665543
No 165
>KOG4213 consensus RNA-binding protein La [RNA processing and modification]
Probab=54.96 E-value=24 Score=22.19 Aligned_cols=56 Identities=16% Similarity=0.086 Sum_probs=35.8
Q ss_pred CceEEEcCCCCCCCHHHHHHHhccC-CCeeEEEEeecCCCC--CceeEEEEEecCHHHHHHHHHH
Q 034021 26 RTKLFIGGLSYDTNETVLKDAFGQH-GEIIEVKVICDRVTG--QSRGYGFVKFASEAAAGEAIKE 87 (106)
Q Consensus 26 ~~~l~v~~lp~~~~~~~l~~~f~~~-g~i~~~~~~~~~~~~--~~~g~~fv~f~~~~~a~~al~~ 87 (106)
.+++|.. .+++++.++...- |.+.++...+.. ++ ..+|..||.|...+.+.+++..
T Consensus 111 ~r~v~~K-----~td~ql~~l~qw~~~k~~nv~mr~~~-~k~~~fkGsvkv~f~tk~qa~a~~~~ 169 (205)
T KOG4213|consen 111 ERTVYKK-----ITDDQLDDLNQWASGKGHNVKMRRHG-NKAHPFKGSVKVTFQTKEQAFANDDT 169 (205)
T ss_pred Hhhhhcc-----CCHHHHHHHHHHhcccceEeeccccC-CCCCCCCCceEEEeecHHHHHhhhhh
Confidence 4556554 4444444443221 677777664443 33 4678999999999999987763
No 166
>COG5507 Uncharacterized conserved protein [Function unknown]
Probab=50.04 E-value=32 Score=19.38 Aligned_cols=22 Identities=14% Similarity=0.323 Sum_probs=17.8
Q ss_pred ceeEEEEEecCHHHHHHHHHHh
Q 034021 67 SRGYGFVKFASEAAAGEAIKEM 88 (106)
Q Consensus 67 ~~g~~fv~f~~~~~a~~al~~l 88 (106)
---|++++|.+.+...++...+
T Consensus 65 ~VvFsW~~Y~skq~rDA~~~km 86 (117)
T COG5507 65 EVVFSWIEYPSKQVRDAANAKM 86 (117)
T ss_pred EEEEEEEEcCchhHHHHHHHHh
Confidence 3458999999999998887654
No 167
>TIGR02542 B_forsyth_147 Bacteroides forsythus 147-residue repeat. The longest predicted protein in Bacteroides forsythus ATCC 43037 is over 3000 residues long and lacks homology to other known proteins. Immediately after the signal sequence are four tandem repeats, approximately 147 residues long. This model describes that repeat. This model describes that repeat.
Probab=49.58 E-value=31 Score=20.08 Aligned_cols=46 Identities=20% Similarity=0.161 Sum_probs=29.8
Q ss_pred CCCCCHHHHHHHhc---cCCCeeEEEEeecCCCCCceeEEEEEecCHHH
Q 034021 35 SYDTNETVLKDAFG---QHGEIIEVKVICDRVTGQSRGYGFVKFASEAA 80 (106)
Q Consensus 35 p~~~~~~~l~~~f~---~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~ 80 (106)
|..++..+++++|+ .|..|..-.+.++-.....-..+|..|.....
T Consensus 83 PYTlT~~e~r~iF~Epm~YQGITReQV~rdGLP~GsYRiCFrL~~~~~~ 131 (145)
T TIGR02542 83 PYTLTYNELRQIFREPMVYQGITREQVQRDGLPEGSYRICFRLFNATQF 131 (145)
T ss_pred ceeeeHHHHHHHHhhhhhhccccHHHHhhcCCCCCceEEEEEEeccchh
Confidence 66789999999997 35556655555553333344567877766543
No 168
>PF10567 Nab6_mRNP_bdg: RNA-recognition motif; InterPro: IPR018885 This conserved domain is found in fungal proteins and appears to be involved in RNA-processing. It binds to poly-adenylated RNA, interacts genetically with mRNA 3'-end processing factors, co-purifies with the nuclear cap-binding protein Cbp20p, and is found in complexes containing other translation factors, such as EIF4G as in P39935 from SWISSPROT and P39936 from SWISSPROT.
Probab=49.13 E-value=62 Score=22.02 Aligned_cols=57 Identities=12% Similarity=0.148 Sum_probs=41.1
Q ss_pred CCCceEEEcCCCCCCCHHHHHHHhccCCCeeEEEEeecCC-------CCCceeEEEEEecCHHH
Q 034021 24 NSRTKLFIGGLSYDTNETVLKDAFGQHGEIIEVKVICDRV-------TGQSRGYGFVKFASEAA 80 (106)
Q Consensus 24 ~~~~~l~v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~-------~~~~~g~~fv~f~~~~~ 80 (106)
-..+.|-..|+..++.-..+..-|-.||+|+.++++.+.. ..+......+.|-+.+.
T Consensus 13 YrTRSLLfeNv~~sidLh~Fl~~fv~~~pIESiYL~~~~~~~~d~~~~d~~~~SilLSFlsr~~ 76 (309)
T PF10567_consen 13 YRTRSLLFENVNNSIDLHSFLTKFVKFGPIESIYLIKSNDKPSDDYNDDKNNQSILLSFLSREI 76 (309)
T ss_pred ceeHHHHHhhccccccHHHHHHHhhccCceeEEEEecCCCcccccccccccceEEEEeeechHH
Confidence 3456677889998898888888899999999999987751 11123356677766654
No 169
>PF02946 GTF2I: GTF2I-like repeat; InterPro: IPR004212 This region of sequence similarity is found up to six times in a variety of proteins including general transcription factor II-I (GTF2I). It has been suggested that this may be a DNA binding domain [, ].; PDB: 2E3L_A 2D99_A 2DN4_A 2D9B_A 2EJE_A 1Q60_A 2DZR_A 2DN5_A 2DZQ_A 2ED2_A.
Probab=46.55 E-value=34 Score=18.21 Aligned_cols=31 Identities=16% Similarity=0.190 Sum_probs=17.6
Q ss_pred CceEEEcCCCCCCC--------HHHHHHHhccCCCeeEE
Q 034021 26 RTKLFIGGLSYDTN--------ETVLKDAFGQHGEIIEV 56 (106)
Q Consensus 26 ~~~l~v~~lp~~~~--------~~~l~~~f~~~g~i~~~ 56 (106)
...|+|.|||.++. -..|++++..-..|.++
T Consensus 34 p~~v~V~GLPegi~fr~P~~Y~i~~L~~IL~~~~~I~Fv 72 (76)
T PF02946_consen 34 PEAVYVQGLPEGIPFRRPSNYGIPRLEKILEASSRIRFV 72 (76)
T ss_dssp TTTEEEES--TT--SS-TTTS-HHHHHHHHHTTTT-EEE
T ss_pred CCcEEEEeCCCCCcCCCCCcCCHHHHHHHHHccCCcEEE
Confidence 34699999987654 46677777665555544
No 170
>PF11411 DNA_ligase_IV: DNA ligase IV; InterPro: IPR021536 DNA ligase IV along with Xrcc4 functions in DNA non-homologous end joining. This process is required to mend double-strand breaks. Upon ligase binding to an Xrcc4 dimer, the helical tails unwind leading to a flat interaction surface []. ; GO: 0003910 DNA ligase (ATP) activity; PDB: 3II6_Y 2E2W_A 1IK9_C.
Probab=43.80 E-value=21 Score=16.12 Aligned_cols=16 Identities=38% Similarity=0.542 Sum_probs=9.9
Q ss_pred CCCCHHHHHHHhccCC
Q 034021 36 YDTNETVLKDAFGQHG 51 (106)
Q Consensus 36 ~~~~~~~l~~~f~~~g 51 (106)
.++++++|++.|...+
T Consensus 19 ~Dtd~~~Lk~vF~~i~ 34 (36)
T PF11411_consen 19 VDTDEDQLKEVFNRIK 34 (36)
T ss_dssp S---HHHHHHHHHCS-
T ss_pred ccCCHHHHHHHHHHhc
Confidence 3567899999998754
No 171
>PF03439 Spt5-NGN: Early transcription elongation factor of RNA pol II, NGN section; InterPro: IPR005100 Spt5p and prokaryotic NusG are shown to contain a novel 'NGN' domain. The combined NGN and KOW motif regions of Spt5 form the binding domain with Spt4 []. Spt5 complexes with Spt4 as a 1:1 heterodimer snf this Spt5-Spt4 complex regulates early transcription elongation by RNA polymerase II and has an imputed role in pre-mRNA processing via its physical association with mRNA capping enzymes. The Schizosaccharomyces pombe core Spt5-Spt4 complex is a heterodimer bearing a trypsin-resistant Spt4-binding domain within the Spt5 subunit []. ; PDB: 3H7H_B 3QQC_D 3P8B_B 2EXU_A.
Probab=41.97 E-value=42 Score=17.91 Aligned_cols=26 Identities=27% Similarity=0.442 Sum_probs=21.8
Q ss_pred ceeEEEEEecCHHHHHHHHHHhCCce
Q 034021 67 SRGYGFVKFASEAAAGEAIKEMDGLL 92 (106)
Q Consensus 67 ~~g~~fv~f~~~~~a~~al~~l~~~~ 92 (106)
.+||-||+=.+..++..|++.+.+..
T Consensus 43 lkGyIyVEA~~~~~V~~ai~gi~~i~ 68 (84)
T PF03439_consen 43 LKGYIYVEAERESDVKEAIRGIRHIR 68 (84)
T ss_dssp STSEEEEEESSHHHHHHHHTT-TTEE
T ss_pred CceEEEEEeCCHHHHHHHHhccccee
Confidence 57899999999999999999776654
No 172
>PRK11901 hypothetical protein; Reviewed
Probab=41.91 E-value=1.2e+02 Score=21.05 Aligned_cols=67 Identities=18% Similarity=0.137 Sum_probs=41.4
Q ss_pred ccCCCCCceEEEcCCCCCCCHHHHHHHhccCCCeeEEEEeecCCCCCc-eeEEEEEecCHHHHHHHHHHhCC
Q 034021 20 LSDANSRTKLFIGGLSYDTNETVLKDAFGQHGEIIEVKVICDRVTGQS-RGYGFVKFASEAAAGEAIKEMDG 90 (106)
Q Consensus 20 ~~~~~~~~~l~v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~-~g~~fv~f~~~~~a~~al~~l~~ 90 (106)
...+...++|.+... ..++.|..|....+ +..+++.....+|+. .-.-+-.|.+.++|..|+..|..
T Consensus 239 ~s~p~~~YTLQL~Aa---s~~~~L~~f~~~~~-L~~~~VYqT~RnGkpWYVVvyG~Y~Sr~eAk~Ai~sLPa 306 (327)
T PRK11901 239 SSAPASHYTLQLSSA---SRSDTLNAYAKKQN-LSHYHVYETKRDGKPWYVLVSGNYASSAEAKRAIATLPA 306 (327)
T ss_pred hcCCCCCeEEEeecC---CCHHHHHHHHHHcC-cCceEEEEEEECCceEEEEEecCcCCHHHHHHHHHhCCH
Confidence 334455677777654 34677888887765 344444443333432 12234568899999999998754
No 173
>COG5584 Predicted small secreted protein [Function unknown]
Probab=40.19 E-value=69 Score=17.96 Aligned_cols=31 Identities=23% Similarity=0.275 Sum_probs=23.3
Q ss_pred CCCCCCCHHHHHHHhccCCCeeEEEEeecCC
Q 034021 33 GLSYDTNETVLKDAFGQHGEIIEVKVICDRV 63 (106)
Q Consensus 33 ~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~ 63 (106)
|+.....-.-+++.|+++|+|...++...+.
T Consensus 29 ~is~e~alk~vk~afk~~mnI~GSwI~~~pe 59 (103)
T COG5584 29 NISRENALKVVKEAFKQFMNIKGSWIVYEPE 59 (103)
T ss_pred ccChhHHHHHHHHHhcccCCcceeEEEEecc
Confidence 3444455566888899999999988887764
No 174
>PHA01632 hypothetical protein
Probab=36.44 E-value=42 Score=16.73 Aligned_cols=19 Identities=16% Similarity=0.258 Sum_probs=15.9
Q ss_pred EcCCCCCCCHHHHHHHhcc
Q 034021 31 IGGLSYDTNETVLKDAFGQ 49 (106)
Q Consensus 31 v~~lp~~~~~~~l~~~f~~ 49 (106)
|...|...++++|++.+.+
T Consensus 21 ieqvp~kpteeelrkvlpk 39 (64)
T PHA01632 21 IEQVPQKPTEEELRKVLPK 39 (64)
T ss_pred hhhcCCCCCHHHHHHHHHH
Confidence 4688999999999988764
No 175
>PF14111 DUF4283: Domain of unknown function (DUF4283)
Probab=34.52 E-value=28 Score=20.36 Aligned_cols=31 Identities=35% Similarity=0.313 Sum_probs=24.8
Q ss_pred EEEcCCCCC-CCHHHHHHHhccCCCeeEEEEe
Q 034021 29 LFIGGLSYD-TNETVLKDAFGQHGEIIEVKVI 59 (106)
Q Consensus 29 l~v~~lp~~-~~~~~l~~~f~~~g~i~~~~~~ 59 (106)
|.|.|||.. .+++.++.+.+.+|.+..+...
T Consensus 107 Vri~glP~~~~~~~~~~~i~~~iG~~i~vD~~ 138 (153)
T PF14111_consen 107 VRIYGLPLHLWSEEILKAIGSKIGEPIEVDEN 138 (153)
T ss_pred hhhccCCHHHhhhHHHHHHHHhcCCeEEEEcC
Confidence 556799988 6778888888999998887553
No 176
>KOG4008 consensus rRNA processing protein RRP7 [RNA processing and modification]
Probab=34.31 E-value=45 Score=21.92 Aligned_cols=34 Identities=26% Similarity=0.369 Sum_probs=27.7
Q ss_pred CCCCCceEEEcCCCCCCCHHHHHHHhccCCCeeE
Q 034021 22 DANSRTKLFIGGLSYDTNETVLKDAFGQHGEIIE 55 (106)
Q Consensus 22 ~~~~~~~l~v~~lp~~~~~~~l~~~f~~~g~i~~ 55 (106)
......++|+-|+|...+++.+.++.+..|-+..
T Consensus 36 ~~~eKd~lfl~Nvp~~~tee~lkr~vsqlg~vq~ 69 (261)
T KOG4008|consen 36 NSNEKDCLFLVNVPLLSTEEHLKRFVSQLGHVQE 69 (261)
T ss_pred ccccccceeeecccccccHHHHHHHHHHhhhhhh
Confidence 4445678999999999999999999998874433
No 177
>smart00457 MACPF membrane-attack complex / perforin.
Probab=32.52 E-value=41 Score=20.84 Aligned_cols=28 Identities=14% Similarity=0.183 Sum_probs=22.2
Q ss_pred EcCCCCCCCHHHHHHHhccCCC--eeEEEE
Q 034021 31 IGGLSYDTNETVLKDAFGQHGE--IIEVKV 58 (106)
Q Consensus 31 v~~lp~~~~~~~l~~~f~~~g~--i~~~~~ 58 (106)
+..||..........||..||. |..+.+
T Consensus 30 l~~Lp~~~~~~~~~~fi~~yGTH~i~s~~~ 59 (194)
T smart00457 30 LRDLPDQYNRGAYARFIDKYGTHYITSATL 59 (194)
T ss_pred HHhCccccCHHHHHHHHHHhCCeEEEeeee
Confidence 4688988999999999999996 555443
No 178
>PF15407 Spo7_2_N: Sporulation protein family 7
Probab=32.17 E-value=8.6 Score=19.90 Aligned_cols=22 Identities=9% Similarity=0.072 Sum_probs=15.9
Q ss_pred cCCCCCceEEEcCCCCCCCHHH
Q 034021 21 SDANSRTKLFIGGLSYDTNETV 42 (106)
Q Consensus 21 ~~~~~~~~l~v~~lp~~~~~~~ 42 (106)
.-....++++||++|..+-.+.
T Consensus 22 ~Ls~tSr~vflG~IP~~W~~~~ 43 (67)
T PF15407_consen 22 ELSLTSRRVFLGPIPEIWLQDH 43 (67)
T ss_pred HHHHcCceEEECCCChHHHHcC
Confidence 3344578899999998876544
No 179
>PF15063 TC1: Thyroid cancer protein 1
Probab=32.12 E-value=30 Score=18.40 Aligned_cols=28 Identities=18% Similarity=0.181 Sum_probs=22.9
Q ss_pred CceEEEcCCCCCCCHHHHHHHhccCCCe
Q 034021 26 RTKLFIGGLSYDTNETVLKDAFGQHGEI 53 (106)
Q Consensus 26 ~~~l~v~~lp~~~~~~~l~~~f~~~g~i 53 (106)
.++--+.||=.++...+|+++|..-|+.
T Consensus 25 ~RKkasaNIFe~vn~~qlqrLF~~sGD~ 52 (79)
T PF15063_consen 25 SRKKASANIFENVNLDQLQRLFQKSGDK 52 (79)
T ss_pred HhhhhhhhhhhccCHHHHHHHHHHccch
Confidence 3444577888999999999999998864
No 180
>COG3254 Uncharacterized conserved protein [Function unknown]
Probab=32.05 E-value=1e+02 Score=17.53 Aligned_cols=42 Identities=14% Similarity=0.225 Sum_probs=27.8
Q ss_pred HHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHH
Q 034021 41 TVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAI 85 (106)
Q Consensus 41 ~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al 85 (106)
.++...++..| |.+..|..+.. ...-||++++.|.+..-+.+
T Consensus 27 PE~~a~lk~ag-i~nYSIfLde~--~n~lFgy~E~~d~~a~m~~~ 68 (105)
T COG3254 27 PELLALLKEAG-IRNYSIFLDEE--ENLLFGYWEYEDFEADMAKM 68 (105)
T ss_pred HHHHHHHHHcC-CceeEEEecCC--cccEEEEEEEcChHHHHHHH
Confidence 45667777776 77777777642 34569999999555444333
No 181
>PF00054 Laminin_G_1: Laminin G domain; InterPro: IPR012679 Laminins are large heterotrimeric glycoproteins involved in basement membrane function []. The laminin globular (G) domain can be found in one to several copies in various laminin family members, which includes a large number of extracellular proteins. The C terminus of laminin alpha chain contains a tandem repeat of five laminin G domains, which are critical for heparin-binding and cell attachment activity []. Laminin alpha4 is distributed in a variety of tissues including peripheral nerves, dorsal root ganglion, skeletal muscle and capillaries; in the neuromuscular junction, it is required for synaptic specialisation []. The structure of the laminin-G domain has been predicted to resemble that of pentraxin []. Laminin G domains can vary in their function, and a variety of binding functions has been ascribed to different LamG modules. For example, the laminin alpha1 and alpha2 chains each has five C-teminal laminin G domains, where only domains LG4 and LG5 contain binding sites for heparin, sulphatides and the cell surface receptor dystroglycan []. Laminin G-containing proteins appear to have a wide variety of roles in cell adhesion, signalling, migration, assembly and differentiation. This entry represents one subtype of laminin G domains, which is sometimes found in association with thrombospondin-type laminin G domains (IPR012680 from INTERPRO).; PDB: 1OKQ_A 1DYK_A 2C5D_A 1H30_A 1LHW_A 1KDK_A 1LHU_A 1KDM_A 1LHO_A 1D2S_A ....
Probab=30.47 E-value=13 Score=21.46 Aligned_cols=11 Identities=36% Similarity=0.812 Sum_probs=8.9
Q ss_pred CCceEEEcCCC
Q 034021 25 SRTKLFIGGLS 35 (106)
Q Consensus 25 ~~~~l~v~~lp 35 (106)
....+||+++|
T Consensus 91 ~~~~lyvGG~p 101 (131)
T PF00054_consen 91 VDGPLYVGGLP 101 (131)
T ss_dssp ECSEEEESSSS
T ss_pred cccCEEEccCC
Confidence 34569999999
No 182
>cd00027 BRCT Breast Cancer Suppressor Protein (BRCA1), carboxy-terminal domain. The BRCT domain is found within many DNA damage repair and cell cycle checkpoint proteins. The unique diversity of this domain superfamily allows BRCT modules to interact forming homo/hetero BRCT multimers, BRCT-non-BRCT interactions, and interactions within DNA strand breaks.
Probab=30.01 E-value=71 Score=15.05 Aligned_cols=26 Identities=19% Similarity=0.198 Sum_probs=20.8
Q ss_pred ceEEEcCCCCCCCHHHHHHHhccCCC
Q 034021 27 TKLFIGGLSYDTNETVLKDAFGQHGE 52 (106)
Q Consensus 27 ~~l~v~~lp~~~~~~~l~~~f~~~g~ 52 (106)
..+++.+........++.+.+..+|.
T Consensus 2 ~~~~i~g~~~~~~~~~l~~~i~~~Gg 27 (72)
T cd00027 2 LTFVITGDLPSEERDELKELIEKLGG 27 (72)
T ss_pred CEEEEEecCCCcCHHHHHHHHHHcCC
Confidence 35777777667888999999999886
No 183
>PRK01178 rps24e 30S ribosomal protein S24e; Reviewed
Probab=29.46 E-value=1.1e+02 Score=17.13 Aligned_cols=47 Identities=17% Similarity=0.251 Sum_probs=25.1
Q ss_pred CCCHHHHHHHhc-cCCCeeEEEEeecC----CCCCceeEEEEEecCHHHHHHH
Q 034021 37 DTNETVLKDAFG-QHGEIIEVKVICDR----VTGQSRGYGFVKFASEAAAGEA 84 (106)
Q Consensus 37 ~~~~~~l~~~f~-~~g~i~~~~~~~~~----~~~~~~g~~fv~f~~~~~a~~a 84 (106)
+.+..+|++-+. .++.-...-++... ..+...|+|.| |++.+.+.+.
T Consensus 30 tpsr~eirekLa~~~~~~~~~vvv~~~~t~fG~g~s~G~a~I-Yds~e~~kk~ 81 (99)
T PRK01178 30 TPSRKDVRKKLAAMLNADKELVVVRKIKTEYGMGKSKGYAKV-YDDKERARKI 81 (99)
T ss_pred CCCHHHHHHHHHHHHCcCCCEEEEEccCccCCCceEEEEEEE-ECCHHHHHhh
Confidence 567777777765 35532222222222 23445666666 6777766543
No 184
>PRK10905 cell division protein DamX; Validated
Probab=29.34 E-value=2e+02 Score=19.97 Aligned_cols=68 Identities=19% Similarity=0.035 Sum_probs=41.4
Q ss_pred cccCCCCCceEEEcCCCCCCCHHHHHHHhccCCCeeEEEEeecCCCCC-ceeEEEEEecCHHHHHHHHHHhCC
Q 034021 19 LLSDANSRTKLFIGGLSYDTNETVLKDAFGQHGEIIEVKVICDRVTGQ-SRGYGFVKFASEAAAGEAIKEMDG 90 (106)
Q Consensus 19 ~~~~~~~~~~l~v~~lp~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~-~~g~~fv~f~~~~~a~~al~~l~~ 90 (106)
........++|.|..+.. .+.++.|..+.|- ....+.....+|+ -.-.-+-.|.+.++|++|+..|..
T Consensus 240 L~sapa~~YTLQL~A~Ss---~~~l~~fakKlgL-~~y~vy~TtRnGkpWYVV~yG~YaSraeAk~AiakLPa 308 (328)
T PRK10905 240 LKSAPSSHYTLQLSSSSN---YDNLNGWAKKENL-KNYVVYETTRNGQPWYVLVSGVYASKEEAKRAVSTLPA 308 (328)
T ss_pred HhcCCCCceEEEEEecCC---HHHHHHHHHHcCC-CceEEEEeccCCceEEEEEecCCCCHHHHHHHHHHCCH
Confidence 344555678888766554 4667777776653 3332333332444 222345578899999999998754
No 185
>KOG3424 consensus 40S ribosomal protein S24 [Translation, ribosomal structure and biogenesis]
Probab=28.70 E-value=1.3e+02 Score=17.63 Aligned_cols=46 Identities=15% Similarity=0.264 Sum_probs=26.3
Q ss_pred CCCCHHHHHHHhcc-CC---CeeEE-EEeecCCCCCceeEEEEEecCHHHHH
Q 034021 36 YDTNETVLKDAFGQ-HG---EIIEV-KVICDRVTGQSRGYGFVKFASEAAAG 82 (106)
Q Consensus 36 ~~~~~~~l~~~f~~-~g---~i~~~-~~~~~~~~~~~~g~~fv~f~~~~~a~ 82 (106)
..++.++|++-++. |- +...+ .+-.....|+..|+|.| |++.+.+.
T Consensus 33 a~vsK~EirEKla~mYkt~~d~V~vfgfrt~~GggkstgfalI-Ydsve~ak 83 (132)
T KOG3424|consen 33 ANVSKTEIREKLAKMYKTTPDAVFVFGFRTHFGGGKSTGFALI-YDSVEYAK 83 (132)
T ss_pred CCCCHHHHHHHHHHHhcCCcceEEEEEeeeccCCcccceeeee-eehHHHHH
Confidence 34677778777654 32 22222 33333345667778877 66666655
No 186
>KOG0156 consensus Cytochrome P450 CYP2 subfamily [Secondary metabolites biosynthesis, transport and catabolism]
Probab=28.11 E-value=1.9e+02 Score=21.12 Aligned_cols=58 Identities=19% Similarity=0.186 Sum_probs=42.3
Q ss_pred EcCCCCCC---CHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCceeCCeEE
Q 034021 31 IGGLSYDT---NETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLLLDGRNI 98 (106)
Q Consensus 31 v~~lp~~~---~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~~~g~~l 98 (106)
||||+.-. ....+.++-.+||++-.+++-. .-.|...+.+.|+.++.. ++..+.+++.
T Consensus 37 IGnl~~l~~~~~h~~~~~ls~~yGpi~tl~lG~---------~~~Vviss~~~akE~l~~-~d~~fa~Rp~ 97 (489)
T KOG0156|consen 37 IGNLHQLGSLPPHRSFRKLSKKYGPVFTLRLGS---------VPVVVISSYEAAKEVLVK-QDLEFADRPD 97 (489)
T ss_pred cccHHHcCCCchhHHHHHHHHHhCCeEEEEecC---------ceEEEECCHHHHHHHHHh-CCccccCCCC
Confidence 67776443 4456677777899999877721 246778899999999995 7888877764
No 187
>PF14893 PNMA: PNMA
Probab=27.78 E-value=52 Score=22.69 Aligned_cols=26 Identities=19% Similarity=0.419 Sum_probs=21.0
Q ss_pred CCCceEEEcCCCCCCCHHHHHHHhcc
Q 034021 24 NSRTKLFIGGLSYDTNETVLKDAFGQ 49 (106)
Q Consensus 24 ~~~~~l~v~~lp~~~~~~~l~~~f~~ 49 (106)
...+.+-|.+||.++++.+|++.+..
T Consensus 16 ~~~r~lLv~giP~dc~~~ei~e~l~~ 41 (331)
T PF14893_consen 16 DPQRALLVLGIPEDCEEAEIEEALQA 41 (331)
T ss_pred ChhhhheeecCCCCCCHHHHHHHHHH
Confidence 34567889999999999998888653
No 188
>PF03108 DBD_Tnp_Mut: MuDR family transposase; InterPro: IPR004332 The plant MuDR transposase domain is present in plant proteins that are presumed to be the transposases for Mutator transposable elements [, ]. The function of these proteins is unknown. More information about these proteins can be found at Protein of the Month: Transposase [].
Probab=27.28 E-value=77 Score=15.79 Aligned_cols=15 Identities=33% Similarity=0.503 Sum_probs=12.5
Q ss_pred EecCHHHHHHHHHHh
Q 034021 74 KFASEAAAGEAIKEM 88 (106)
Q Consensus 74 ~f~~~~~a~~al~~l 88 (106)
.|.+.++.+.|+..+
T Consensus 9 ~F~~~~e~k~av~~y 23 (67)
T PF03108_consen 9 TFPSKEEFKEAVREY 23 (67)
T ss_pred EECCHHHHHHHHHHH
Confidence 588999999999854
No 189
>PF13046 DUF3906: Protein of unknown function (DUF3906)
Probab=26.60 E-value=1e+02 Score=15.83 Aligned_cols=33 Identities=27% Similarity=0.349 Sum_probs=22.1
Q ss_pred CHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEE
Q 034021 39 NETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFV 73 (106)
Q Consensus 39 ~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv 73 (106)
-+..+.+.|-+-..+..+.+.-.+.- .+|-|||
T Consensus 31 ~e~eler~fl~~P~v~e~~l~EKKri--~~G~gyV 63 (64)
T PF13046_consen 31 VEVELERHFLPLPEVKEVALYEKKRI--RKGAGYV 63 (64)
T ss_pred HHHHhhhhccCCCCceEEEEEEEEee--eCCceeE
Confidence 34667888877778899888766532 3455555
No 190
>KOG1134 consensus Uncharacterized conserved protein [General function prediction only]
Probab=26.34 E-value=1.1e+02 Score=23.54 Aligned_cols=35 Identities=20% Similarity=0.113 Sum_probs=25.2
Q ss_pred CceeEEEEEecCHHHHHHHHHHhCCceeCCeEEEE
Q 034021 66 QSRGYGFVKFASEAAAGEAIKEMDGLLLDGRNIRV 100 (106)
Q Consensus 66 ~~~g~~fv~f~~~~~a~~al~~l~~~~~~g~~l~v 100 (106)
+..+.|||.|++...|+.|.+..+...........
T Consensus 303 ~~~~~aFVtf~sr~~A~~~aq~~~~~~~~~w~~~~ 337 (728)
T KOG1134|consen 303 KPLPAAFVTFKSRYGAAVAAQTQQSLNPTKWLTEF 337 (728)
T ss_pred CCCceEEEEEEeeHHHHHHHHhhhcCCCCceEEEe
Confidence 34568999999999999999865555444444433
No 191
>COG0030 KsgA Dimethyladenosine transferase (rRNA methylation) [Translation, ribosomal structure and biogenesis]
Probab=25.18 E-value=1.7e+02 Score=19.51 Aligned_cols=28 Identities=14% Similarity=0.230 Sum_probs=21.2
Q ss_pred CceEEEcCCCCCCCHHHHHHHhccCCCe
Q 034021 26 RTKLFIGGLSYDTNETVLKDAFGQHGEI 53 (106)
Q Consensus 26 ~~~l~v~~lp~~~~~~~l~~~f~~~g~i 53 (106)
.....|+|||++++..-+.+++...-.+
T Consensus 95 ~~~~vVaNlPY~Isspii~kll~~~~~~ 122 (259)
T COG0030 95 QPYKVVANLPYNISSPILFKLLEEKFII 122 (259)
T ss_pred CCCEEEEcCCCcccHHHHHHHHhccCcc
Confidence 3456689999999999888888654333
No 192
>PF01823 MACPF: MAC/Perforin domain; InterPro: IPR020864 The membrane attack complex/perforin (MACPF) domain is conserved in bacteria, fungi, mammals and plants. It was originally identified and named as being common to five complement components (C6, C7, C8-alpha, C8-beta, and C9) and perforin. These molecules perform critical functions in innate and adaptive immunity. The MAC family proteins and perforin are known to participate in lytic pore formation. In response to pathogen infection, a sequential and highly specific interaction between the constituent elements occurs to form transmembrane channels which are known as the membrane-attack complex (MAC).Only a few other MACPF proteins have been characterised and several are thought to form pores for invasion or protection [, , ]. Examples are proteins from malarial parasites [], the cytolytic toxins from sea anemones [], and proteins that provide plant immunity [, ]. Functionally uncharacterised MACPF proteins are also evident in pathogenic bacteria such as Chlamydia spp [] and Photorhabdus luminescens (Xenorhabdus luminescens) []. The MACPF domain is commonly found to be associated with other N- and C-terminal domains, such as TSP1 (see PDOC50092 from PROSITEDOC), LDLRA (see PDOC00929 from PROSITEDOC), EGF-like (see PDOC00021 from PROSITEDOC),Sushi/CCP/SCR (see PDOC50923 from PROSITEDOC), FIMAC or C2 (see PDOC00380 from PROSITEDOC). They probably control or target MACPF function [, ]. The MACPF domain oligomerizes, undergoes conformational change, and is required for lytic activity. The MACPF domain consists of a central kinked four-stranded antiparallel beta sheet surrounded by alpha helices and beta strands, forming two structural segments. Overall, the MACPF domain has a thin L-shaped appearance. MACPF domains exhibit limited sequence similarity but contain a signature [YW]-G-[TS]-H-[FY]-x(6)-G-G motif [, , ]. Some proteins known to contain a MACPF domain are listed below: Vertebrate complement proteins C6 to C9. Complement factors C6 to C9 assemble to form a scaffold, the membrane attack complex (MAC), that permits C9 polymerisation into pores that lyse Gram-negative pathogens [, ]. Vertebrate perforin. It is delivered by natural killer cells and cytotoxic T lymphocytes and forms oligomeric pores (12 to 18 monomers) in the plasma membrane of either virus-infected or transformed cells. Arabidopsis thaliana (Mouse-ear cress) constitutively activated cell death 1 (CAD1) protein. It is likely to act as a mediator that recognises plant signals for pathogen infection []. Arabidopsis thaliana (Mouse-ear cress) necrotic spotted lesions 1 (NSL1) protein []. Venomous sea anemone Phyllodiscus semoni (Night anemone) toxins PsTX-60A and PsTX-60B []. Venomous sea anemone Actineria villosa (Okinawan sea anemone) toxin AvTX-60A []. Plasmodium sporozoite microneme protein essential for cell traversal 2 (SPECT2). It is essential for the membrane-wounding activity of the sporozoite and is involved in its traversal of the sinusoidal cell layer prior to hepatocyte-infection []. P. luminescens Plu-MACPF. Although nonlytic, it was shown to bind to cell membranes []. Chlamydial putative uncharacterised protein CT153 []. ; PDB: 2QP2_A 3OJY_B 3NSJ_A 4E0S_B 3T5O_A 4A5W_B 2QQH_A 2RD7_A.
Probab=24.73 E-value=46 Score=20.51 Aligned_cols=23 Identities=17% Similarity=0.212 Sum_probs=16.2
Q ss_pred EcCCCCCCCHHH---HHHHhccCCCe
Q 034021 31 IGGLSYDTNETV---LKDAFGQHGEI 53 (106)
Q Consensus 31 v~~lp~~~~~~~---l~~~f~~~g~i 53 (106)
+..||...+..+ ..++|..||.-
T Consensus 53 l~~L~~~~~~~~~~~y~~f~~~yGTH 78 (212)
T PF01823_consen 53 LNALPAEYNSDNTDEYYRFFDKYGTH 78 (212)
T ss_dssp HHTSHSS--HHHHHHHHHHHHHH-SE
T ss_pred HHhhCcccCccchHHHHHHHHHhCcE
Confidence 567888888777 78899999973
No 193
>PF12007 DUF3501: Protein of unknown function (DUF3501); InterPro: IPR021890 This family of proteins is functionally uncharacterised. This protein is found in bacteria and archaea. Proteins in this family are about 200 amino acids in length. The structure of protein of unknown function (YP_111841.1) from B. pseudomallei has been solved. ; PDB: 3FJV_B.
Probab=24.61 E-value=2e+02 Score=18.34 Aligned_cols=47 Identities=11% Similarity=0.222 Sum_probs=29.9
Q ss_pred CHHHHHHHhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCce
Q 034021 39 NETVLKDAFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLL 92 (106)
Q Consensus 39 ~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~ 92 (106)
.+++|++-...|.+.. +..+.-+..-||+|.+.......+..|.|..
T Consensus 65 ~~~~I~~Ei~aYnpLi-------P~~~~l~ATl~IE~~d~~~r~~~L~~L~Gie 111 (192)
T PF12007_consen 65 DEEGIQEEIDAYNPLI-------PDGGNLKATLMIEIPDEDERRRELARLVGIE 111 (192)
T ss_dssp SHHHHHHHHHHHGGGS---------SSEEEEEEEE--SSHHHHHHHHHHCTTGG
T ss_pred CHHHHHHHHHHhcccC-------CCCCcEEEEEEEEcCCHHHHHHHHHHhcCcc
Confidence 3445666666665432 1123335578999999999999999998884
No 194
>PF10281 Ish1: Putative stress-responsive nuclear envelope protein; InterPro: IPR018803 This group of proteins, found primarily in fungi, consists of putative stress-responsive nuclear envelope protein Ish1 and homologues [].
Probab=24.35 E-value=78 Score=13.98 Aligned_cols=16 Identities=25% Similarity=0.289 Sum_probs=13.0
Q ss_pred CCCHHHHHHHhccCCC
Q 034021 37 DTNETVLKDAFGQHGE 52 (106)
Q Consensus 37 ~~~~~~l~~~f~~~g~ 52 (106)
+.+.++|+..+..+|-
T Consensus 3 tWs~~~L~~wL~~~gi 18 (38)
T PF10281_consen 3 TWSDSDLKSWLKSHGI 18 (38)
T ss_pred CCCHHHHHHHHHHcCC
Confidence 4678899999998873
No 195
>PF09902 DUF2129: Uncharacterized protein conserved in bacteria (DUF2129); InterPro: IPR016979 This is a group of uncharacterised conserved proteins.
Probab=23.90 E-value=1.2e+02 Score=15.82 Aligned_cols=38 Identities=21% Similarity=0.353 Sum_probs=25.4
Q ss_pred HhccCCCeeEEEEeecCCCCCceeEEEEEecCHHHHHHHHHHhCCce
Q 034021 46 AFGQHGEIIEVKVICDRVTGQSRGYGFVKFASEAAAGEAIKEMDGLL 92 (106)
Q Consensus 46 ~f~~~g~i~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~al~~l~~~~ 92 (106)
-+..||.|..+. .+ ..|+.+ |-+.++++..+..+....
T Consensus 16 ~L~kfG~i~Y~S-------kk-~kYvvl-Yvn~~~~e~~~~kl~~l~ 53 (71)
T PF09902_consen 16 QLRKFGDIHYVS-------KK-MKYVVL-YVNEEDVEEIIEKLKKLK 53 (71)
T ss_pred hHhhcccEEEEE-------CC-ccEEEE-EECHHHHHHHHHHHhcCC
Confidence 456899887662 11 225544 788999998888776553
No 196
>PF11823 DUF3343: Protein of unknown function (DUF3343); InterPro: IPR021778 This family of proteins are functionally uncharacterised. This protein is found in bacteria and archaea. Proteins in this family are typically between 78 to 102 amino acids in length.
Probab=23.33 E-value=1.2e+02 Score=15.48 Aligned_cols=23 Identities=26% Similarity=0.180 Sum_probs=18.2
Q ss_pred EEEEEecCHHHHHHHHHHhCCce
Q 034021 70 YGFVKFASEAAAGEAIKEMDGLL 92 (106)
Q Consensus 70 ~~fv~f~~~~~a~~al~~l~~~~ 92 (106)
+.++.|.+..+|-++-+.|...-
T Consensus 3 ~~~i~F~st~~a~~~ek~lk~~g 25 (73)
T PF11823_consen 3 YYLITFPSTHDAMKAEKLLKKNG 25 (73)
T ss_pred eEEEEECCHHHHHHHHHHHHHCC
Confidence 67899999999988888665443
No 197
>PF12623 Hen1_L: RNA repair, ligase-Pnkp-associating, region of Hen1; InterPro: IPR024740 This entry represents the N-terminal domain of the bacterial Hen1 protein, which is a 3' terminal RNA ribose 2'-O-methyltransferase that is involved in bacterial RNA repair []. Hen1 forms a heterotetramer with Pnkp. The heterotetramer has been shown to repair transfer RNAs cleaved by ribotoxins in vitro []. The N-terminal domain of Hen1 contributes to the ligase activity of the heterotetramer.
Probab=22.47 E-value=2.4e+02 Score=18.63 Aligned_cols=64 Identities=13% Similarity=0.168 Sum_probs=42.1
Q ss_pred CCceEEEcCCCCCCCHHHHHHHhccCC-CeeEEEEeecCC--CCCceeEEEEEecCHHHHHHHHHHh
Q 034021 25 SRTKLFIGGLSYDTNETVLKDAFGQHG-EIIEVKVICDRV--TGQSRGYGFVKFASEAAAGEAIKEM 88 (106)
Q Consensus 25 ~~~~l~v~~lp~~~~~~~l~~~f~~~g-~i~~~~~~~~~~--~~~~~g~~fv~f~~~~~a~~al~~l 88 (106)
-.-++.|.-+|-.-.++-++++|++.| .|..-.+..+.. ......|..+...-..-.+.++..|
T Consensus 117 ~pL~v~~p~lp~rGg~~l~~rLFePLGw~V~a~~~~Ld~~fP~WG~S~y~~l~L~g~~rl~daL~HL 183 (245)
T PF12623_consen 117 IPLEVRLPALPCRGGEELVRRLFEPLGWTVTAEPVPLDEQFPEWGDSRYVDLTLTGTVRLADALNHL 183 (245)
T ss_pred CceEEEeeeeecCCcHHHHHHhhcCcCceEEeEeccCCccCccccCCcceEEEEeeeEEHHHHHhhh
Confidence 345677888888888999999999998 454444544442 1112236677777666666666643
No 198
>PF06613 KorB_C: KorB C-terminal beta-barrel domain; InterPro: IPR010575 This domain is found in several KorB transcriptional repressor proteins. The korB gene is a major regulatory element in the replication and maintenance of broad host-range plasmid RK2. It negatively controls the replication gene trfA, the host-lethal determinants kilA and kilB, and the korA-korB operon []. This family is found in conjunction with IPR003115 from INTERPRO.; GO: 0003677 DNA binding, 0045892 negative regulation of transcription, DNA-dependent; PDB: 1IGQ_B 1IGU_A.
Probab=22.12 E-value=1.3e+02 Score=15.28 Aligned_cols=12 Identities=25% Similarity=0.706 Sum_probs=9.6
Q ss_pred CceeEEEEEecC
Q 034021 66 QSRGYGFVKFAS 77 (106)
Q Consensus 66 ~~~g~~fv~f~~ 77 (106)
...|++++.|++
T Consensus 29 s~~G~~WiKyED 40 (60)
T PF06613_consen 29 SSEGLAWIKYED 40 (60)
T ss_dssp SSTTEEEEEETT
T ss_pred CcCCeEEEEEcc
Confidence 346899999987
No 199
>PF13773 DUF4170: Domain of unknown function (DUF4170)
Probab=22.11 E-value=1.4e+02 Score=15.63 Aligned_cols=31 Identities=19% Similarity=0.054 Sum_probs=23.6
Q ss_pred EEEEecCHHHHHHHHHHhCCceeCCeEEEEE
Q 034021 71 GFVKFASEAAAGEAIKEMDGLLLDGRNIRVN 101 (106)
Q Consensus 71 ~fv~f~~~~~a~~al~~l~~~~~~g~~l~v~ 101 (106)
-.--|.+.++|.+|++..-+..+++-.++-.
T Consensus 28 iVG~fp~y~~A~~aWrakAq~TVDnA~mRYf 58 (69)
T PF13773_consen 28 IVGIFPDYASAYAAWRAKAQRTVDNAHMRYF 58 (69)
T ss_pred EEecCCChHHHHHHHHHHHhCchhcceeeEE
Confidence 4456888999999999888888877655543
No 200
>PF00398 RrnaAD: Ribosomal RNA adenine dimethylase; InterPro: IPR001737 This family of proteins include rRNA adenine dimethylases (e.g. KsgA) and the Erythromycin resistance methylases (Erm). The bacterial enzyme KsgA catalyses the transfer of a total of four methyl groups from S-adenosyl-l-methionine (S-AdoMet) to two adjacent adenosine bases in 16S rRNA. This enzyme and the resulting modified adenosine bases appear to be conserved in all species of eubacteria, eukaryotes, and archaea, and in eukaryotic organelles. Bacterial resistance to the aminoglycoside antibiotic kasugamycin involves inactivation of KsgA and resulting loss of the dimethylations, with modest consequences to the overall fitness of the organism. In contrast, the yeast ortholog, Dim1, is essential. In Saccharomyces cerevisiae (Baker's yeast), and presumably in other eukaryotes, the enzyme performs a vital role in pre-rRNA processing in addition to its methylating activity. The best conserved region in these enzymes is located in the N-terminal section and corresponds to a region that is probably involved in S-adenosyl methionine (SAM) binding domain. The crystal structure of KsgA from Escherichia coli has been solved to a resolution of 2.1A. It bears a strong similarity to the crystal structure of ErmC' from Bacillus stearothermophilus and a lesser similarity to the yeast mitochondrial transcription factor, sc-mtTFB []. The Erm family of RNA methyltransferases, which methylate a single adenosine base in 23S rRNA confer resistance to the MLS-B group of antibiotics. Despite their sequence similarity, the two enzyme families have strikingly different levels of regulation that remain to be elucidated. Other orthologs, of this family include the yeast and Homo sapiens (Human) mitochondrial transcription factors (MTF1 and h-mtTFB respectively), which are nuclear encoded []. Human-mtTFB is able to stimulate transcription in vitro independently of its S-adenosylmethionine binding and rRNA methyltransferase activity [].; GO: 0000179 rRNA (adenine-N6,N6-)-dimethyltransferase activity, 0008649 rRNA methyltransferase activity, 0000154 rRNA modification; PDB: 3FTF_A 3R9X_B 3FTE_A 3FTC_A 3FTD_A 3GRY_A 3FYC_A 3GRU_A 3FYD_A 3GRV_A ....
Probab=21.92 E-value=1.2e+02 Score=19.73 Aligned_cols=24 Identities=21% Similarity=0.555 Sum_probs=20.1
Q ss_pred CCceEEEcCCCCCCCHHHHHHHhc
Q 034021 25 SRTKLFIGGLSYDTNETVLKDAFG 48 (106)
Q Consensus 25 ~~~~l~v~~lp~~~~~~~l~~~f~ 48 (106)
....+.|+|||...+..-|.+++.
T Consensus 96 ~~~~~vv~NlPy~is~~il~~ll~ 119 (262)
T PF00398_consen 96 NQPLLVVGNLPYNISSPILRKLLE 119 (262)
T ss_dssp SSEEEEEEEETGTGHHHHHHHHHH
T ss_pred CCceEEEEEecccchHHHHHHHhh
Confidence 355678999999999988888886
No 201
>COG5594 Uncharacterized integral membrane protein [Function unknown]
Probab=21.45 E-value=1.3e+02 Score=23.64 Aligned_cols=36 Identities=25% Similarity=0.298 Sum_probs=24.2
Q ss_pred eeEEEEEecCHHHHHHHHHHhC-CceeCCeEEEEEec
Q 034021 68 RGYGFVKFASEAAAGEAIKEMD-GLLLDGRNIRVNYA 103 (106)
Q Consensus 68 ~g~~fv~f~~~~~a~~al~~l~-~~~~~g~~l~v~~s 103 (106)
.+.+||+|++.-.|+.+.+..- ....++.++.+.++
T Consensus 357 ~~~~FItFkSq~~Aq~~aQ~~~~sr~~~~~~v~iapa 393 (827)
T COG5594 357 TKSGFITFKSQASAQIAAQSQIYSRVLGKLKVEIAPA 393 (827)
T ss_pred cccEEEEEehhHHHHHHHHhhhhhhhhcceeeeecCC
Confidence 4589999999999999998532 22344444555443
No 202
>TIGR00755 ksgA dimethyladenosine transferase. Alternate name: S-adenosylmethionine--6-N',N'-adenosyl (rRNA) dimethyltransferase
Probab=21.11 E-value=1.8e+02 Score=18.80 Aligned_cols=23 Identities=9% Similarity=0.263 Sum_probs=19.2
Q ss_pred EEEcCCCCCCCHHHHHHHhccCC
Q 034021 29 LFIGGLSYDTNETVLKDAFGQHG 51 (106)
Q Consensus 29 l~v~~lp~~~~~~~l~~~f~~~g 51 (106)
+.|+|+|++++..-+.+++...|
T Consensus 97 ~vvsNlPy~i~~~il~~ll~~~~ 119 (253)
T TIGR00755 97 KVVSNLPYNISSPLIFKLLEKPK 119 (253)
T ss_pred eEEEcCChhhHHHHHHHHhccCC
Confidence 67899999999999999986443
No 203
>PF05573 NosL: NosL; InterPro: IPR008719 NosL is one of the accessory proteins of the nos (nitrous oxide reductase) gene cluster. NosL is a monomeric protein of 18,540 MW that specifically and stoichiometrically binds Cu(I). The copper ion in NosL is ligated by a Cys residue, and one Met and one His are thought to serve as the other ligands. It is possible that NosL is a copper chaperone involved in metallocentre assembly []. This entry also contains HTH-type transcriptional repressors, including YcnK. YcnK may act as a negative transcriptional regulator of YcnJ in the presence of copper and may use copper as a corepressor. The gene, ycnK, is significantly induced under copper-limiting conditions.; PDB: 2HQ3_A 2HPU_A.
Probab=21.03 E-value=2e+02 Score=17.12 Aligned_cols=22 Identities=32% Similarity=0.353 Sum_probs=16.0
Q ss_pred eeEEEEEecCHHHHHHHHHHhC
Q 034021 68 RGYGFVKFASEAAAGEAIKEMD 89 (106)
Q Consensus 68 ~g~~fv~f~~~~~a~~al~~l~ 89 (106)
-|..++-|.+.++|++..+...
T Consensus 114 Mg~~~~aF~~~~~A~~F~~~~G 135 (149)
T PF05573_consen 114 MGPDLIAFASKEDAEAFAKEHG 135 (149)
T ss_dssp TS--EEEES-HHHHHHHHHHTE
T ss_pred CCCcccccCCHHHHHHHHHHcC
Confidence 3578999999999999999743
No 204
>PHA01627 DNA binding protein
Probab=20.77 E-value=1.8e+02 Score=16.60 Aligned_cols=35 Identities=17% Similarity=0.282 Sum_probs=26.6
Q ss_pred EEEEecCHHHHHHHHHHhCCceeCCeEEEEEeccC
Q 034021 71 GFVKFASEAAAGEAIKEMDGLLLDGRNIRVNYAHK 105 (106)
Q Consensus 71 ~fv~f~~~~~a~~al~~l~~~~~~g~~l~v~~s~~ 105 (106)
.||..--.++....+..+.|..+.-+++.+.....
T Consensus 39 ~~vSaIGH~sTA~lls~llg~~ip~NRi~i~~~~G 73 (107)
T PHA01627 39 EFVSAIGHDATANLLSNLCGVNLPKNRIEIKLDKG 73 (107)
T ss_pred CeEEeeccHHHHHHHHHHhCccccccceEEEecCC
Confidence 46666668888889999999988877777766543
No 205
>PRK08559 nusG transcription antitermination protein NusG; Validated
Probab=20.72 E-value=1.5e+02 Score=17.76 Aligned_cols=26 Identities=31% Similarity=0.431 Sum_probs=21.3
Q ss_pred ceeEEEEEecCHHHHHHHHHHhCCce
Q 034021 67 SRGYGFVKFASEAAAGEAIKEMDGLL 92 (106)
Q Consensus 67 ~~g~~fv~f~~~~~a~~al~~l~~~~ 92 (106)
.+||.||+....+++..+++.+.+..
T Consensus 45 fpGYVfVe~~~~~~~~~~i~~v~~v~ 70 (153)
T PRK08559 45 LKGYVLVEAESKGAVEEAIRGIPHVR 70 (153)
T ss_pred CCcEEEEEEEChHHHHHHHhcCCCEe
Confidence 57899999998888888888776654
No 206
>PF05189 RTC_insert: RNA 3'-terminal phosphate cyclase (RTC), insert domain; InterPro: IPR013796 RNA cyclases are a family of RNA-modifying enzymes that are conserved in eukaryotes, bacteria and archaea. RNA 3'-terminal phosphate cyclase (6.5.1.4 from EC) [, ] catalyses the conversion of 3'-phosphate to a 2',3'-cyclic phosphodiester at the end of RNA. ATP + RNA 3'-terminal-phosphate = AMP + diphosphate + RNA terminal-2',3'-cyclic-phosphate These enzymes might be responsible for production of the cyclic phosphate RNA ends that are known to be required by many RNA ligases in both prokaryotes and eukaryotes. RNA cyclase is a protein of from 36 to 42 kDa. The best conserved region is a glycine-rich stretch of residues located in the central part of the sequence and which is reminiscent of various ATP, GTP or AMP glycine-rich loops. The crystal structure of RNA 3'-terminal phosphate cyclase shows that each molecule consists of two domains. The larger domain contains three repeats of a folding unit comprising two parallel alpha helices and a four-stranded beta sheet; this fold was previously identified in translation initiation factor 3 (IF3). The large domain is similar to one of the two domains of 5-enolpyruvylshikimate-3-phosphate synthase and UDP-N-acetylglucosamine enolpyruvyl transferase. The smaller domain uses a similar secondary structure element with different topology, observed in many other proteins such as thioredoxin []. Although the active site of this enzyme could not be unambiguously assigned, it can be mapped to a region surrounding His309, an adenylate acceptor, in which a number of amino acids are highly conserved in the enzyme from different sources []. This entry contains the insert-domain of approximately 100 amino acids.; PDB: 3TV1_B 3KGD_D 1QMI_D 1QMH_B 3TUX_A 3TUT_A 3TW3_A 3PQV_A.
Probab=20.61 E-value=1.7e+02 Score=16.08 Aligned_cols=46 Identities=17% Similarity=0.262 Sum_probs=24.0
Q ss_pred eEEEcCCCCCCCHHHH---HHHhccCCCeeEEEE--eecCCCCCceeEEEE
Q 034021 28 KLFIGGLSYDTNETVL---KDAFGQHGEIIEVKV--ICDRVTGQSRGYGFV 73 (106)
Q Consensus 28 ~l~v~~lp~~~~~~~l---~~~f~~~g~i~~~~~--~~~~~~~~~~g~~fv 73 (106)
..++.+||.++.+.++ ++.+..+++-..+.. ........+.|++.+
T Consensus 12 ~a~~a~lp~~va~R~~~~a~~~L~~~~~~v~i~~d~~~~~~~~~~~G~gi~ 62 (103)
T PF05189_consen 12 IAFVAGLPSSVAERMANAARKRLNWYGPDVEIETDYRESDDSAFGPGSGIS 62 (103)
T ss_dssp EEEEESS-CHHHHHHHHHHHHHHCTTCSEEEEEEEEE-CCCCGCSSEEEEE
T ss_pred EEEEcCCCHHHHHHHHHHHHHHhhhhCCCeEEEEecccCccCCCCCceEEE
Confidence 4578899999887654 444555554344433 112223444555544
No 207
>cd06257 DnaJ DnaJ domain or J-domain. DnaJ/Hsp40 (heat shock protein 40) proteins are highly conserved and play crucial roles in protein translation, folding, unfolding, translocation, and degradation. They act primarily by stimulating the ATPase activity of Hsp70s, an important chaperonine family. Hsp40 proteins are characterized by the presence of a J domain, which mediates the interaction with Hsp70. They may contain other domains as well, and the architectures provide a means of classification.
Probab=20.13 E-value=1.2e+02 Score=14.04 Aligned_cols=19 Identities=21% Similarity=0.366 Sum_probs=15.0
Q ss_pred cCCCCCCCHHHHHHHhccC
Q 034021 32 GGLSYDTNETVLKDAFGQH 50 (106)
Q Consensus 32 ~~lp~~~~~~~l~~~f~~~ 50 (106)
=+++.+.+.++|++.|...
T Consensus 6 Lgl~~~~~~~~ik~~y~~l 24 (55)
T cd06257 6 LGVPPDASDEEIKKAYRKL 24 (55)
T ss_pred cCCCCCCCHHHHHHHHHHH
Confidence 3678889999998888653
Done!