Query psy3346
Match_columns 79
No_of_seqs 166 out of 1624
Neff 9.9
Searched_HMMs 46136
Date Fri Aug 16 22:39:30 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy3346.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/3346hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PLN03134 glycine-rich RNA-bind 99.8 1.3E-18 2.9E-23 97.0 7.5 62 17-79 31-92 (144)
2 KOG0149|consensus 99.8 1.1E-18 2.4E-23 102.3 6.2 59 20-79 12-70 (247)
3 TIGR01659 sex-lethal sex-letha 99.7 3.3E-17 7.2E-22 102.0 8.1 64 15-79 102-165 (346)
4 TIGR01661 ELAV_HUD_SF ELAV/HuD 99.7 2.6E-17 5.6E-22 102.0 7.5 60 19-79 2-61 (352)
5 PF00076 RRM_1: RNA recognitio 99.7 3.5E-17 7.5E-22 80.6 6.3 55 23-79 1-55 (70)
6 TIGR01661 ELAV_HUD_SF ELAV/HuD 99.7 4.6E-17 1E-21 100.9 8.1 60 19-79 268-327 (352)
7 KOG0122|consensus 99.7 1.3E-16 2.9E-21 94.1 6.7 63 16-79 185-247 (270)
8 PF14259 RRM_6: RNA recognitio 99.7 6.4E-16 1.4E-20 76.4 6.1 55 23-79 1-55 (70)
9 TIGR01659 sex-lethal sex-letha 99.6 7.3E-16 1.6E-20 96.1 7.4 60 19-79 192-251 (346)
10 TIGR01645 half-pint poly-U bin 99.6 7.2E-16 1.6E-20 101.2 6.9 61 18-79 105-165 (612)
11 TIGR01648 hnRNP-R-Q heterogene 99.6 1.3E-15 2.7E-20 99.8 7.7 60 18-79 56-115 (578)
12 TIGR01622 SF-CC1 splicing fact 99.6 1.8E-15 3.9E-20 96.7 8.1 62 17-79 86-147 (457)
13 KOG0121|consensus 99.6 5.3E-16 1.2E-20 84.2 4.8 62 17-79 33-94 (153)
14 PLN03120 nucleic acid binding 99.6 1.9E-15 4.2E-20 90.5 7.0 56 20-79 4-59 (260)
15 KOG0113|consensus 99.6 1.9E-15 4.2E-20 91.3 6.6 61 17-78 98-158 (335)
16 PLN03121 nucleic acid binding 99.6 4.2E-15 9.2E-20 88.1 7.1 58 18-79 3-60 (243)
17 KOG0125|consensus 99.6 2.1E-15 4.6E-20 92.2 5.9 63 14-79 90-152 (376)
18 TIGR01628 PABP-1234 polyadenyl 99.6 3.6E-15 7.7E-20 97.5 7.2 57 22-79 2-58 (562)
19 TIGR01645 half-pint poly-U bin 99.6 5.5E-15 1.2E-19 97.2 7.2 60 19-79 203-262 (612)
20 TIGR01642 U2AF_lg U2 snRNP aux 99.6 1.6E-14 3.5E-19 93.3 8.2 61 18-79 293-353 (509)
21 TIGR01628 PABP-1234 polyadenyl 99.5 2.6E-14 5.6E-19 93.5 7.2 61 17-79 282-342 (562)
22 TIGR01622 SF-CC1 splicing fact 99.5 3.5E-14 7.7E-19 90.8 7.2 59 20-79 186-244 (457)
23 KOG0144|consensus 99.5 2.1E-14 4.5E-19 90.5 5.7 62 17-79 31-92 (510)
24 KOG0117|consensus 99.5 3.1E-14 6.7E-19 89.9 6.3 62 17-79 80-141 (506)
25 KOG0127|consensus 99.5 4.2E-14 9.1E-19 91.3 6.6 61 18-79 290-350 (678)
26 COG0724 RNA-binding proteins ( 99.5 6.2E-14 1.3E-18 82.8 6.9 59 20-79 115-173 (306)
27 smart00362 RRM_2 RNA recogniti 99.5 9.3E-14 2E-18 67.7 6.4 55 22-79 1-55 (72)
28 KOG0114|consensus 99.5 8.2E-14 1.8E-18 73.3 6.2 61 15-79 13-73 (124)
29 KOG0148|consensus 99.5 3E-14 6.5E-19 85.5 4.9 66 13-79 55-120 (321)
30 KOG0107|consensus 99.5 6.5E-14 1.4E-18 79.5 5.0 56 18-79 8-63 (195)
31 KOG0126|consensus 99.5 7.7E-15 1.7E-19 83.8 0.9 57 18-75 33-89 (219)
32 KOG0131|consensus 99.5 8.8E-14 1.9E-18 79.4 4.8 62 16-78 5-66 (203)
33 smart00360 RRM RNA recognition 99.5 2.4E-13 5.2E-18 66.0 5.7 54 25-79 1-54 (71)
34 PLN03213 repressor of silencin 99.4 3.3E-13 7.3E-18 86.7 6.6 52 17-73 7-58 (759)
35 KOG4207|consensus 99.4 2.1E-13 4.6E-18 79.3 5.1 65 14-79 7-71 (256)
36 cd00590 RRM RRM (RNA recogniti 99.4 8.7E-13 1.9E-17 64.5 6.7 56 22-79 1-56 (74)
37 TIGR01649 hnRNP-L_PTB hnRNP-L/ 99.4 4.3E-13 9.3E-18 86.7 6.6 53 20-79 2-54 (481)
38 KOG0130|consensus 99.4 2.4E-13 5.3E-18 74.6 4.6 67 12-79 64-130 (170)
39 KOG0148|consensus 99.4 5E-13 1.1E-17 80.3 5.8 56 17-79 161-216 (321)
40 KOG0144|consensus 99.4 8.1E-14 1.8E-18 87.9 2.5 60 18-79 122-181 (510)
41 KOG0105|consensus 99.4 7.4E-13 1.6E-17 76.1 5.1 58 18-79 4-61 (241)
42 KOG0145|consensus 99.4 9E-13 2E-17 79.1 5.3 61 18-79 39-99 (360)
43 KOG0108|consensus 99.4 1.3E-12 2.8E-17 83.4 5.2 58 21-79 19-76 (435)
44 TIGR01649 hnRNP-L_PTB hnRNP-L/ 99.3 4.7E-12 1E-16 81.9 7.2 56 18-79 273-329 (481)
45 KOG0111|consensus 99.3 5.4E-13 1.2E-17 78.4 2.4 62 17-79 7-68 (298)
46 KOG0124|consensus 99.3 1.2E-12 2.7E-17 81.6 2.7 58 20-78 113-170 (544)
47 KOG4205|consensus 99.3 6.2E-12 1.4E-16 77.5 4.6 59 19-78 5-63 (311)
48 KOG0127|consensus 99.3 7.3E-12 1.6E-16 81.1 5.0 60 19-79 4-63 (678)
49 KOG0146|consensus 99.3 4.4E-12 9.5E-17 76.5 3.4 61 17-79 16-76 (371)
50 TIGR01648 hnRNP-R-Q heterogene 99.3 2.3E-11 5E-16 80.1 6.9 52 19-79 232-285 (578)
51 TIGR01642 U2AF_lg U2 snRNP aux 99.3 3.4E-11 7.4E-16 77.9 7.5 56 17-79 172-239 (509)
52 KOG0146|consensus 99.2 1.3E-11 2.8E-16 74.5 4.8 64 15-79 280-343 (371)
53 KOG0415|consensus 99.2 1.3E-11 2.8E-16 76.8 4.7 62 17-79 236-297 (479)
54 KOG0145|consensus 99.2 7.7E-12 1.7E-16 75.2 3.5 63 16-79 123-185 (360)
55 KOG0147|consensus 99.2 4.9E-12 1.1E-16 81.5 2.8 57 22-79 280-336 (549)
56 KOG4212|consensus 99.2 6.4E-11 1.4E-15 75.4 6.4 58 20-79 44-102 (608)
57 KOG0123|consensus 99.2 7.8E-11 1.7E-15 74.3 6.2 58 18-79 74-131 (369)
58 KOG0117|consensus 99.2 9.8E-11 2.1E-15 74.4 6.1 62 17-78 161-223 (506)
59 KOG0110|consensus 99.1 4.1E-11 8.9E-16 79.2 3.6 61 18-79 611-671 (725)
60 KOG4208|consensus 99.1 2E-10 4.4E-15 66.7 5.8 59 19-78 48-107 (214)
61 KOG0124|consensus 99.1 2.2E-10 4.8E-15 71.8 5.6 57 20-77 210-266 (544)
62 smart00361 RRM_1 RNA recogniti 99.1 3.2E-10 7E-15 56.2 4.8 45 34-79 2-53 (70)
63 KOG0116|consensus 99.1 3.5E-10 7.6E-15 72.1 5.5 61 18-79 286-346 (419)
64 KOG4205|consensus 99.0 7.6E-10 1.6E-14 68.4 5.5 58 20-78 97-154 (311)
65 KOG0153|consensus 99.0 8E-10 1.7E-14 68.5 4.9 59 14-79 222-280 (377)
66 PF04059 RRM_2: RNA recognitio 99.0 5.6E-09 1.2E-13 54.8 6.7 56 21-77 2-59 (97)
67 KOG0123|consensus 99.0 1.6E-09 3.5E-14 68.4 5.2 61 17-79 267-327 (369)
68 KOG0132|consensus 98.9 1.7E-09 3.7E-14 72.5 4.8 53 20-79 421-473 (894)
69 KOG0109|consensus 98.9 1.1E-09 2.3E-14 66.8 3.5 48 21-77 3-50 (346)
70 KOG4211|consensus 98.9 5E-09 1.1E-13 67.3 6.3 59 17-79 7-65 (510)
71 KOG4206|consensus 98.9 8.5E-09 1.8E-13 60.7 6.0 55 20-78 9-67 (221)
72 KOG0533|consensus 98.9 9.2E-09 2E-13 61.7 5.6 59 19-79 82-140 (243)
73 KOG0131|consensus 98.8 5.3E-09 1.1E-13 60.1 3.7 62 17-79 93-155 (203)
74 KOG4661|consensus 98.8 1.1E-08 2.3E-13 67.4 5.1 60 19-79 404-463 (940)
75 PF13893 RRM_5: RNA recognitio 98.8 1.4E-08 3E-13 48.1 3.9 37 37-79 1-37 (56)
76 KOG0110|consensus 98.8 1.5E-08 3.2E-13 67.5 4.7 60 20-79 515-576 (725)
77 KOG4660|consensus 98.8 1.7E-08 3.8E-13 65.6 4.8 59 15-79 70-128 (549)
78 KOG4211|consensus 98.7 6.1E-08 1.3E-12 62.5 6.4 60 18-79 101-161 (510)
79 KOG4209|consensus 98.7 1.8E-08 3.9E-13 60.2 3.8 64 15-79 96-159 (231)
80 KOG4212|consensus 98.7 2.3E-08 4.9E-13 64.1 4.2 58 16-79 532-589 (608)
81 KOG1548|consensus 98.6 1.8E-07 3.9E-12 58.3 5.5 58 19-78 133-198 (382)
82 KOG0106|consensus 98.6 7E-08 1.5E-12 57.0 3.6 50 21-79 2-51 (216)
83 KOG0226|consensus 98.6 1E-07 2.3E-12 57.3 3.8 62 17-79 187-248 (290)
84 KOG4454|consensus 98.5 3.8E-08 8.3E-13 58.1 1.9 58 17-77 6-63 (267)
85 KOG1457|consensus 98.5 3.5E-07 7.5E-12 54.4 5.6 63 17-79 31-93 (284)
86 KOG0109|consensus 98.5 1.5E-07 3.2E-12 57.7 3.1 52 18-78 76-127 (346)
87 KOG0151|consensus 98.5 2E-07 4.3E-12 62.6 3.9 63 17-79 171-235 (877)
88 KOG1457|consensus 98.5 1.4E-07 3.1E-12 56.0 2.7 59 15-78 205-263 (284)
89 PF08777 RRM_3: RNA binding mo 98.4 6.6E-07 1.4E-11 47.7 4.5 53 20-79 1-53 (105)
90 KOG0147|consensus 98.3 1.6E-07 3.5E-12 61.2 0.4 63 14-77 173-235 (549)
91 KOG4849|consensus 98.2 1.3E-06 2.8E-11 54.9 2.8 59 18-77 78-138 (498)
92 PF14605 Nup35_RRM_2: Nup53/35 98.2 1.2E-05 2.7E-10 37.8 5.6 51 21-79 2-52 (53)
93 KOG0120|consensus 98.1 5.7E-06 1.2E-10 54.2 4.1 59 17-76 286-344 (500)
94 KOG4210|consensus 98.0 2.6E-06 5.5E-11 52.5 1.4 57 20-77 184-241 (285)
95 KOG1190|consensus 98.0 7.6E-06 1.6E-10 52.3 3.5 52 19-77 27-78 (492)
96 KOG4206|consensus 97.9 4.6E-05 1E-09 45.2 5.3 55 17-77 143-197 (221)
97 KOG0129|consensus 97.8 8.1E-05 1.8E-09 48.7 6.0 58 19-77 258-320 (520)
98 KOG1995|consensus 97.8 4.3E-05 9.4E-10 48.0 4.4 62 17-79 63-132 (351)
99 KOG0129|consensus 97.8 0.0001 2.2E-09 48.3 5.6 61 17-78 367-428 (520)
100 KOG1365|consensus 97.8 0.00011 2.4E-09 46.9 5.6 60 18-79 159-222 (508)
101 KOG2314|consensus 97.7 0.00014 3E-09 48.4 6.1 61 17-79 55-121 (698)
102 KOG1365|consensus 97.7 3.5E-05 7.5E-10 49.2 2.9 58 20-79 280-340 (508)
103 KOG1855|consensus 97.5 0.00011 2.5E-09 47.3 3.1 62 18-79 229-302 (484)
104 KOG3152|consensus 97.5 4E-05 8.7E-10 46.4 0.9 59 19-78 73-143 (278)
105 KOG0128|consensus 97.5 7.3E-05 1.6E-09 51.4 2.1 57 20-78 736-792 (881)
106 KOG4307|consensus 97.5 0.00035 7.6E-09 47.7 5.2 57 21-79 868-925 (944)
107 KOG0105|consensus 97.4 0.00095 2E-08 39.1 6.1 56 14-77 109-164 (241)
108 KOG0128|consensus 97.3 9.4E-06 2E-10 55.6 -3.3 59 20-79 667-725 (881)
109 KOG0115|consensus 97.3 0.00015 3.3E-09 43.9 2.0 56 21-78 32-87 (275)
110 PF08675 RNA_bind: RNA binding 97.3 0.0012 2.6E-08 33.8 4.8 49 20-78 9-57 (87)
111 PF11608 Limkain-b1: Limkain b 97.3 0.00096 2.1E-08 34.3 4.5 48 21-79 3-55 (90)
112 COG5175 MOT2 Transcriptional r 97.3 0.00048 1.1E-08 43.6 4.0 61 19-79 113-181 (480)
113 KOG4676|consensus 97.2 0.00039 8.4E-09 44.6 3.2 58 20-77 7-66 (479)
114 KOG0106|consensus 97.1 0.00054 1.2E-08 40.8 2.6 54 17-79 96-149 (216)
115 PF05172 Nup35_RRM: Nup53/35/4 97.0 0.0022 4.8E-08 33.9 4.3 60 19-79 5-70 (100)
116 KOG1456|consensus 96.9 0.0048 1.1E-07 39.7 5.9 56 17-79 28-83 (494)
117 KOG4307|consensus 96.9 0.00094 2E-08 45.7 2.8 60 17-78 431-491 (944)
118 KOG0112|consensus 96.6 0.00058 1.3E-08 47.5 0.1 58 17-76 369-426 (975)
119 KOG0112|consensus 96.4 0.0041 8.8E-08 43.6 3.4 56 17-79 452-507 (975)
120 KOG1190|consensus 96.4 0.0087 1.9E-07 38.9 4.6 55 19-78 413-467 (492)
121 PF10309 DUF2414: Protein of u 96.4 0.034 7.5E-07 26.9 5.8 52 19-79 4-58 (62)
122 PF03467 Smg4_UPF3: Smg-4/UPF3 96.4 0.0051 1.1E-07 35.6 3.1 61 17-77 4-69 (176)
123 KOG2193|consensus 95.7 0.011 2.3E-07 38.7 2.5 51 21-79 2-53 (584)
124 KOG0120|consensus 95.2 0.039 8.4E-07 36.8 3.9 44 36-79 425-470 (500)
125 KOG1548|consensus 94.6 0.08 1.7E-06 33.8 4.0 60 15-79 260-330 (382)
126 KOG1456|consensus 94.5 0.23 5.1E-06 32.3 5.9 59 15-79 282-341 (494)
127 KOG2416|consensus 94.4 0.039 8.5E-07 37.5 2.5 56 15-77 439-495 (718)
128 PF15023 DUF4523: Protein of u 94.2 0.22 4.7E-06 28.3 4.8 54 17-78 83-140 (166)
129 KOG2591|consensus 94.0 0.19 4.2E-06 34.1 5.0 54 18-79 173-228 (684)
130 KOG2068|consensus 93.9 0.034 7.3E-07 35.2 1.3 61 19-79 76-141 (327)
131 KOG4410|consensus 92.4 0.48 1E-05 29.9 4.7 49 20-75 330-379 (396)
132 KOG4210|consensus 92.3 0.051 1.1E-06 33.8 0.5 59 19-78 87-145 (285)
133 PF11767 SET_assoc: Histone ly 91.9 0.61 1.3E-05 22.8 4.0 39 31-79 11-49 (66)
134 KOG4660|consensus 91.8 0.21 4.5E-06 33.7 2.8 55 22-77 390-445 (549)
135 PF10567 Nab6_mRNP_bdg: RNA-re 90.7 0.78 1.7E-05 28.9 4.4 61 16-76 11-77 (309)
136 KOG1996|consensus 90.6 0.75 1.6E-05 29.1 4.3 44 35-79 301-345 (378)
137 KOG2891|consensus 90.6 0.61 1.3E-05 29.4 3.9 54 17-70 146-215 (445)
138 PF07576 BRAP2: BRCA1-associat 90.5 1.7 3.7E-05 23.4 7.3 56 20-78 12-69 (110)
139 PF03468 XS: XS domain; Inter 89.6 1 2.2E-05 24.5 3.9 49 21-73 9-66 (116)
140 PF15513 DUF4651: Domain of un 89.4 1.5 3.3E-05 21.2 4.2 18 35-52 9-26 (62)
141 KOG2202|consensus 88.3 0.12 2.5E-06 31.8 -0.4 35 43-79 92-126 (260)
142 KOG2253|consensus 88.0 0.34 7.5E-06 33.4 1.6 53 15-77 35-87 (668)
143 KOG4285|consensus 86.9 1.4 2.9E-05 28.1 3.6 52 20-79 197-248 (350)
144 KOG4008|consensus 86.8 0.87 1.9E-05 27.9 2.7 36 16-51 36-71 (261)
145 KOG0804|consensus 86.3 5.5 0.00012 26.8 6.2 55 20-77 74-129 (493)
146 KOG4483|consensus 84.4 3 6.5E-05 27.6 4.4 51 20-78 391-442 (528)
147 KOG2135|consensus 84.2 0.8 1.7E-05 30.7 1.8 52 18-76 370-422 (526)
148 PF08952 DUF1866: Domain of un 84.1 2.5 5.4E-05 24.0 3.5 53 18-79 25-86 (146)
149 KOG4676|consensus 82.0 0.14 3E-06 33.4 -2.2 47 21-72 152-198 (479)
150 KOG4454|consensus 73.3 0.63 1.4E-05 28.3 -1.0 52 19-72 79-134 (267)
151 COG0724 RNA-binding proteins ( 71.8 10 0.00022 22.0 3.9 41 15-55 220-260 (306)
152 COG0030 KsgA Dimethyladenosine 70.2 11 0.00025 23.4 3.8 34 20-53 95-128 (259)
153 PF09707 Cas_Cas2CT1978: CRISP 69.5 14 0.0003 19.1 3.9 48 20-71 25-72 (86)
154 PF07292 NID: Nmi/IFP 35 domai 66.9 4.1 8.9E-05 21.1 1.3 25 17-41 49-73 (88)
155 KOG2318|consensus 66.6 14 0.00031 25.7 4.0 38 17-54 171-213 (650)
156 PF04847 Calcipressin: Calcipr 63.0 16 0.00034 21.5 3.4 39 34-79 9-47 (184)
157 PHA01632 hypothetical protein 62.6 13 0.00029 17.6 2.4 21 23-43 19-39 (64)
158 PRK11558 putative ssRNA endonu 61.9 22 0.00049 18.8 3.6 50 20-73 27-76 (97)
159 PF00398 RrnaAD: Ribosomal RNA 60.7 13 0.00027 22.8 2.8 32 20-51 97-130 (262)
160 PF14893 PNMA: PNMA 56.8 12 0.00025 24.2 2.2 25 18-42 16-40 (331)
161 COG0445 GidA Flavin-dependent 55.7 46 0.00099 23.4 4.9 39 17-55 298-336 (621)
162 PF11411 DNA_ligase_IV: DNA li 53.2 13 0.00028 16.0 1.4 16 30-45 19-34 (36)
163 KOG1295|consensus 53.1 17 0.00037 23.9 2.5 58 19-76 6-65 (376)
164 KOG4213|consensus 52.1 50 0.0011 19.7 4.5 45 32-78 118-165 (205)
165 PRK00274 ksgA 16S ribosomal RN 51.9 28 0.0006 21.5 3.3 22 22-43 107-128 (272)
166 PRK01178 rps24e 30S ribosomal 51.9 16 0.00035 19.3 2.0 47 31-78 30-80 (99)
167 PF13046 DUF3906: Protein of u 51.6 24 0.00052 17.2 2.4 32 34-68 32-63 (64)
168 KOG2295|consensus 50.1 1.5 3.2E-05 30.1 -2.5 53 18-71 229-281 (648)
169 COG5193 LHP1 La protein, small 50.0 5 0.00011 26.6 -0.1 58 20-78 174-241 (438)
170 PTZ00338 dimethyladenosine tra 49.1 26 0.00057 22.0 2.9 32 21-52 102-133 (294)
171 TIGR00755 ksgA dimethyladenosi 46.4 38 0.00081 20.5 3.3 24 22-45 96-119 (253)
172 cd00027 BRCT Breast Cancer Sup 45.1 31 0.00067 15.4 3.7 26 21-46 2-27 (72)
173 PF04026 SpoVG: SpoVG; InterP 44.4 45 0.00098 17.0 3.6 25 47-72 3-27 (84)
174 smart00457 MACPF membrane-atta 44.2 28 0.0006 20.3 2.4 22 25-46 30-51 (194)
175 KOG4574|consensus 42.3 9.7 0.00021 27.8 0.3 48 23-77 301-348 (1007)
176 smart00650 rADc Ribosomal RNA 41.1 67 0.0014 18.0 3.6 24 21-44 78-101 (169)
177 PTZ00071 40S ribosomal protein 40.4 30 0.00065 19.4 2.0 46 31-77 35-85 (132)
178 COG5638 Uncharacterized conser 40.2 67 0.0014 21.8 3.8 39 15-53 141-184 (622)
179 PRK13259 regulatory protein Sp 39.0 62 0.0013 17.1 3.4 14 59-72 14-27 (94)
180 PF08206 OB_RNB: Ribonuclease 38.2 27 0.00059 16.2 1.4 11 62-72 7-17 (58)
181 PF01823 MACPF: MAC/Perforin d 38.1 29 0.00064 20.0 1.9 22 25-46 53-77 (212)
182 TIGR02542 B_forsyth_147 Bacter 36.4 24 0.00052 19.5 1.2 44 29-73 83-129 (145)
183 PF14111 DUF4283: Domain of un 36.3 65 0.0014 17.5 3.0 31 23-53 107-138 (153)
184 PF15407 Spo7_2_N: Sporulation 36.0 28 0.00061 17.1 1.3 23 19-41 26-48 (67)
185 PRK15464 cold shock-like prote 35.9 21 0.00046 17.5 0.9 10 62-71 15-24 (70)
186 PRK15463 cold shock-like prote 34.4 24 0.00053 17.2 1.0 10 62-71 15-24 (70)
187 PRK09937 stationary phase/star 34.3 25 0.00055 17.4 1.0 9 62-70 12-20 (74)
188 PRK09507 cspE cold shock prote 33.8 24 0.00052 17.2 0.9 9 62-70 14-22 (69)
189 PF06613 KorB_C: KorB C-termin 33.2 64 0.0014 15.5 2.8 22 49-72 19-40 (60)
190 PRK14998 cold shock-like prote 33.1 27 0.00058 17.3 1.0 10 62-71 12-21 (73)
191 PRK11679 lipoprotein; Provisio 33.0 1.1E+02 0.0025 19.8 4.0 50 17-74 236-285 (346)
192 PRK10943 cold shock-like prote 32.4 25 0.00055 17.1 0.8 9 62-70 14-22 (69)
193 PF00226 DnaJ: DnaJ domain; I 32.1 60 0.0013 14.9 2.2 19 26-44 6-24 (64)
194 PF06014 DUF910: Bacterial pro 31.9 37 0.0008 16.5 1.3 16 34-49 4-19 (62)
195 TIGR02381 cspD cold shock doma 31.5 30 0.00066 16.7 1.0 10 62-71 12-21 (68)
196 smart00703 NRF N-terminal doma 30.7 52 0.0011 17.3 1.9 23 23-45 72-95 (110)
197 PRK10354 RNA chaperone/anti-te 30.3 30 0.00064 16.8 0.9 8 63-70 16-23 (70)
198 PF03439 Spt5-NGN: Early trans 30.1 52 0.0011 16.5 1.8 17 62-78 43-59 (84)
199 PRK09890 cold shock protein Cs 30.1 30 0.00065 16.9 0.9 10 62-71 15-24 (70)
200 PF06804 Lipoprotein_18: NlpB/ 30.1 40 0.00086 21.3 1.6 51 17-75 195-245 (303)
201 PF01984 dsDNA_bind: Double-st 29.6 54 0.0012 17.6 1.8 21 26-46 75-95 (107)
202 KOG0862|consensus 29.4 36 0.00077 20.7 1.2 15 59-73 105-119 (216)
203 cd06498 ACD_alphaB-crystallin_ 29.2 21 0.00046 18.0 0.2 27 27-54 57-83 (84)
204 TIGR01873 cas_CT1978 CRISPR-as 29.1 93 0.002 16.1 3.2 49 20-72 25-74 (87)
205 KOG3424|consensus 28.7 1.1E+02 0.0024 17.0 5.3 46 31-77 34-83 (132)
206 cd06257 DnaJ DnaJ domain or J- 28.7 64 0.0014 14.1 2.4 19 26-44 6-24 (55)
207 PF10281 Ish1: Putative stress 27.6 62 0.0013 13.6 2.0 16 31-46 3-18 (38)
208 PF09702 Cas_Csa5: CRISPR-asso 27.3 1.1E+02 0.0024 16.4 2.9 22 17-41 61-82 (105)
209 COG5310 Homospermidine synthas 27.2 90 0.0019 20.8 2.8 29 22-51 87-115 (481)
210 PF03672 UPF0154: Uncharacteri 26.4 92 0.002 15.2 2.2 18 29-46 29-46 (64)
211 PF00313 CSD: 'Cold-shock' DNA 26.3 59 0.0013 15.2 1.5 12 62-73 11-22 (66)
212 cd04458 CSP_CDS Cold-Shock Pro 26.3 45 0.00097 15.6 1.1 10 63-72 12-21 (65)
213 PF09336 Vps4_C: Vps4 C termin 26.2 45 0.00098 15.9 1.1 15 26-40 40-54 (62)
214 PF12385 Peptidase_C70: Papain 26.1 95 0.0021 18.1 2.5 18 31-48 95-112 (166)
215 PF00220 Hormone_4: Neurohypop 25.9 32 0.00069 10.1 0.3 6 24-29 2-7 (9)
216 PF05189 RTC_insert: RNA 3'-te 25.8 1.1E+02 0.0024 15.8 4.0 47 23-70 13-64 (103)
217 PF14112 DUF4284: Domain of un 25.3 1.1E+02 0.0024 16.6 2.7 18 22-42 3-20 (122)
218 PHA00147 upper collar protein 24.9 1.4E+02 0.0031 19.2 3.3 24 26-49 42-65 (308)
219 PRK11230 glycolate oxidase sub 24.2 2.4E+02 0.0053 19.3 4.6 43 34-78 203-249 (499)
220 KOG0821|consensus 24.1 1.3E+02 0.0029 18.9 3.0 25 19-43 128-152 (326)
221 PF13600 DUF4140: N-terminal d 23.5 23 0.0005 18.4 -0.3 19 20-38 23-41 (104)
222 PRK04239 hypothetical protein; 23.2 1E+02 0.0022 16.7 2.2 20 26-45 80-99 (110)
223 COG2088 SpoVG Uncharacterized 23.1 1.3E+02 0.0028 15.8 3.2 25 47-72 3-27 (95)
224 smart00271 DnaJ DnaJ molecular 22.8 92 0.002 13.9 2.4 19 26-44 7-25 (60)
225 PRK00523 hypothetical protein; 22.0 1.2E+02 0.0027 15.1 2.3 17 30-46 38-54 (72)
226 PRK13916 plasmid segregation p 22.0 36 0.00077 17.6 0.3 21 24-44 23-43 (97)
227 PRK14896 ksgA 16S ribosomal RN 22.0 1.1E+02 0.0024 18.6 2.5 22 21-42 92-113 (258)
228 KOG2855|consensus 21.3 2.5E+02 0.0055 18.4 4.3 47 20-71 61-108 (330)
229 TIGR00006 S-adenosyl-methyltra 20.7 90 0.0019 20.0 1.9 19 31-49 140-158 (305)
230 PF08734 GYD: GYD domain; Int 20.3 1.4E+02 0.0031 15.2 4.5 39 35-78 23-62 (91)
No 1
>PLN03134 glycine-rich RNA-binding protein 4; Provisional
Probab=99.78 E-value=1.3e-18 Score=97.03 Aligned_cols=62 Identities=21% Similarity=0.492 Sum_probs=57.4
Q ss_pred CCCCCeEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHHHhcC
Q psy3346 17 KQTGSKILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKRV 79 (79)
Q Consensus 17 ~~~~~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~a 79 (79)
....++|||+|||+.+++++|+++|++||.|..+.++.++ .+++++|||||+|.+.++|++|
T Consensus 31 ~~~~~~lfVgnL~~~~te~~L~~~F~~~G~I~~v~i~~d~-~tg~~kGfaFV~F~~~e~A~~A 92 (144)
T PLN03134 31 RLMSTKLFIGGLSWGTDDASLRDAFAHFGDVVDAKVIVDR-ETGRSRGFGFVNFNDEGAATAA 92 (144)
T ss_pred cCCCCEEEEeCCCCCCCHHHHHHHHhcCCCeEEEEEEecC-CCCCcceEEEEEECCHHHHHHH
Confidence 3457899999999999999999999999999999999987 7899999999999999999875
No 2
>KOG0149|consensus
Probab=99.77 E-value=1.1e-18 Score=102.30 Aligned_cols=59 Identities=25% Similarity=0.514 Sum_probs=56.3
Q ss_pred CCeEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHHHhcC
Q psy3346 20 GSKILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKRV 79 (79)
Q Consensus 20 ~~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~a 79 (79)
-++|||++|+|.+..+.|+++|++||+|.+..++.|+ .+|+++||+||+|.+.++|.+|
T Consensus 12 ~TKifVggL~w~T~~~~l~~yFeqfGeI~eavvitd~-~t~rskGyGfVTf~d~~aa~rA 70 (247)
T KOG0149|consen 12 FTKIFVGGLAWETHKETLRRYFEQFGEIVEAVVITDK-NTGRSKGYGFVTFRDAEAATRA 70 (247)
T ss_pred EEEEEEcCcccccchHHHHHHHHHhCceEEEEEEecc-CCccccceeeEEeecHHHHHHH
Confidence 4689999999999999999999999999999999998 9999999999999999999875
No 3
>TIGR01659 sex-lethal sex-lethal family splicing factor. This model describes the sex-lethal family of splicing factors found in Dipteran insects. The sex-lethal phenotype, however, may be limited to the Melanogasters and closely related species. In Drosophila the protein acts as an inhibitor of splicing. This subfamily is most closely related to the ELAV/HUD subfamily of splicing factors (TIGR01661).
Probab=99.72 E-value=3.3e-17 Score=102.04 Aligned_cols=64 Identities=22% Similarity=0.541 Sum_probs=58.8
Q ss_pred ccCCCCCeEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHHHhcC
Q psy3346 15 VAKQTGSKILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKRV 79 (79)
Q Consensus 15 ~~~~~~~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~a 79 (79)
......++|||+|||+++++++|+++|+.||.|..|+++.++ .+++++|||||+|.++++|++|
T Consensus 102 ~~~~~~~~LfVgnLp~~~te~~L~~lF~~~G~V~~v~i~~d~-~tg~srGyaFVeF~~~e~A~~A 165 (346)
T TIGR01659 102 DTNNSGTNLIVNYLPQDMTDRELYALFRTIGPINTCRIMRDY-KTGYSFGYAFVDFGSEADSQRA 165 (346)
T ss_pred CCCCCCcEEEEeCCCCCCCHHHHHHHHHhcCCEEEEEEEecC-CCCccCcEEEEEEccHHHHHHH
Confidence 345568999999999999999999999999999999999987 7899999999999999999875
No 4
>TIGR01661 ELAV_HUD_SF ELAV/HuD family splicing factor. These proteins contain 3 RNA-recognition motifs (rrm: pfam00076).
Probab=99.72 E-value=2.6e-17 Score=102.05 Aligned_cols=60 Identities=25% Similarity=0.596 Sum_probs=56.2
Q ss_pred CCCeEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHHHhcC
Q psy3346 19 TGSKILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKRV 79 (79)
Q Consensus 19 ~~~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~a 79 (79)
..++|||+|||+.+++++|+++|++||.|..|+++.++ .+|+++|||||+|.+.++|++|
T Consensus 2 ~~~~l~V~nLp~~~~e~~l~~~F~~~G~i~~v~i~~d~-~~g~s~g~afV~f~~~~~A~~A 61 (352)
T TIGR01661 2 SKTNLIVNYLPQTMTQEEIRSLFTSIGEIESCKLVRDK-VTGQSLGYGFVNYVRPEDAEKA 61 (352)
T ss_pred CCcEEEEeCCCCCCCHHHHHHHHHccCCEEEEEEEEcC-CCCccceEEEEEECcHHHHHHH
Confidence 36799999999999999999999999999999999987 7899999999999999999875
No 5
>PF00076 RRM_1: RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain); InterPro: IPR000504 Many eukaryotic proteins containing one or more copies of a putative RNA-binding domain of about 90 amino acids are known to bind single-stranded RNAs [, , ]. The largest group of single strand RNA-binding proteins is the eukaryotic RNA recognition motif (RRM) family that contains an eight amino acid RNP-1 consensus sequence [, ]. RRM proteins have a variety of RNA binding preferences and functions, and include heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing (SR, U2AF, Sxl), protein components of small nuclear ribonucleoproteins (U1 and U2 snRNPs), and proteins that regulate RNA stability and translation (PABP, La, Hu) [, , ]. The RRM in heterodimeric splicing factor U2 snRNP auxiliary factor (U2AF) appears to have two RRM-like domains with specialised features for protein recognition []. The motif also appears in a few single stranded DNA binding proteins. The typical RRM consists of four anti-parallel beta-strands and two alpha-helices arranged in a beta-alpha-beta-beta-alpha-beta fold with side chains that stack with RNA bases. Specificity of RNA binding is determined by multiple contacts with surrounding amino acids. A third helix is present during RNA binding in some cases []. The RRM is reviewed in a number of publications [, , ].; GO: 0003676 nucleic acid binding; PDB: 2RNE_A 2DGO_A 2DO4_A 1YTY_B 2VOO_B 2VOP_A 2VON_B 1ZH5_B 2VOD_A 1S79_A ....
Probab=99.72 E-value=3.5e-17 Score=80.59 Aligned_cols=55 Identities=27% Similarity=0.688 Sum_probs=51.5
Q ss_pred EEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHHHhcC
Q psy3346 23 ILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKRV 79 (79)
Q Consensus 23 l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~a 79 (79)
|||+|||+.+++++|+++|++||.+..+.+..+ .++..+++|||+|.+.++|++|
T Consensus 1 l~v~nlp~~~t~~~l~~~f~~~g~i~~~~~~~~--~~~~~~~~a~V~F~~~~~a~~a 55 (70)
T PF00076_consen 1 LYVGNLPPDVTEEELRDFFSQFGKIESIKVMRN--SSGKSKGYAFVEFESEEDAEKA 55 (70)
T ss_dssp EEEESETTTSSHHHHHHHHHTTSTEEEEEEEEE--TTSSEEEEEEEEESSHHHHHHH
T ss_pred cEEcCCCCcCCHHHHHHHHHHhhhccccccccc--ccccccceEEEEEcCHHHHHHH
Confidence 799999999999999999999999999999887 6788999999999999999864
No 6
>TIGR01661 ELAV_HUD_SF ELAV/HuD family splicing factor. These proteins contain 3 RNA-recognition motifs (rrm: pfam00076).
Probab=99.72 E-value=4.6e-17 Score=100.92 Aligned_cols=60 Identities=25% Similarity=0.485 Sum_probs=55.8
Q ss_pred CCCeEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHHHhcC
Q psy3346 19 TGSKILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKRV 79 (79)
Q Consensus 19 ~~~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~a 79 (79)
.+.+|||+|||+.+++++|.++|++||.|.+++++.++ .++.++|||||+|.+.++|.+|
T Consensus 268 ~~~~lfV~NL~~~~~e~~L~~~F~~fG~v~~v~i~~d~-~t~~skG~aFV~F~~~~~A~~A 327 (352)
T TIGR01661 268 AGYCIFVYNLSPDTDETVLWQLFGPFGAVQNVKIIRDL-TTNQCKGYGFVSMTNYDEAAMA 327 (352)
T ss_pred CCcEEEEeCCCCCCCHHHHHHHHHhCCCeEEEEEeEcC-CCCCccceEEEEECCHHHHHHH
Confidence 34579999999999999999999999999999999996 7999999999999999998765
No 7
>KOG0122|consensus
Probab=99.68 E-value=1.3e-16 Score=94.13 Aligned_cols=63 Identities=30% Similarity=0.640 Sum_probs=58.7
Q ss_pred cCCCCCeEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHHHhcC
Q psy3346 16 AKQTGSKILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKRV 79 (79)
Q Consensus 16 ~~~~~~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~a 79 (79)
...+..+|.|.||+.++++.+|+++|.+||.|..+.|..++ .+|.++|||||.|.+.++|.+|
T Consensus 185 ~R~D~~tvRvtNLsed~~E~dL~eLf~~fg~i~rvylardK-~TG~~kGFAFVtF~sRddA~rA 247 (270)
T KOG0122|consen 185 ERDDEATVRVTNLSEDMREDDLEELFRPFGPITRVYLARDK-ETGLSKGFAFVTFESRDDAARA 247 (270)
T ss_pred cCCccceeEEecCccccChhHHHHHhhccCccceeEEEEcc-ccCcccceEEEEEecHHHHHHH
Confidence 34467889999999999999999999999999999999998 9999999999999999999875
No 8
>PF14259 RRM_6: RNA recognition motif (a.k.a. RRM, RBD, or RNP domain); PDB: 2DNN_A 1WI6_A 2EVZ_A 2ADC_A 1QM9_A 2KG1_A 2HGN_A 1U2F_A 2G4B_A 2HZC_A ....
Probab=99.65 E-value=6.4e-16 Score=76.44 Aligned_cols=55 Identities=36% Similarity=0.715 Sum_probs=49.0
Q ss_pred EEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHHHhcC
Q psy3346 23 ILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKRV 79 (79)
Q Consensus 23 l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~a 79 (79)
|||+|||+++++++|+++|..+|.|..+.+..++ . +..+|+|||+|.+.++|.+|
T Consensus 1 v~i~nlp~~~~~~~l~~~f~~~g~v~~v~~~~~~-~-~~~~~~a~v~f~~~~~a~~a 55 (70)
T PF14259_consen 1 VYISNLPPSTTEEDLRNFFSRFGPVEKVRLIKNK-D-GQSRGFAFVEFSSEEDAKRA 55 (70)
T ss_dssp EEEESSTTT--HHHHHHHCTTSSBEEEEEEEEST-T-SSEEEEEEEEESSHHHHHHH
T ss_pred CEEeCCCCCCCHHHHHHHHHhcCCcceEEEEeee-c-cccCCEEEEEeCCHHHHHHH
Confidence 7999999999999999999999999999999885 4 88999999999999999764
No 9
>TIGR01659 sex-lethal sex-lethal family splicing factor. This model describes the sex-lethal family of splicing factors found in Dipteran insects. The sex-lethal phenotype, however, may be limited to the Melanogasters and closely related species. In Drosophila the protein acts as an inhibitor of splicing. This subfamily is most closely related to the ELAV/HUD subfamily of splicing factors (TIGR01661).
Probab=99.65 E-value=7.3e-16 Score=96.10 Aligned_cols=60 Identities=22% Similarity=0.486 Sum_probs=56.0
Q ss_pred CCCeEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHHHhcC
Q psy3346 19 TGSKILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKRV 79 (79)
Q Consensus 19 ~~~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~a 79 (79)
..++|||+|||+.+++++|+++|++||.|..+.++.++ .+++++|||||+|.+.++|++|
T Consensus 192 ~~~~lfV~nLp~~vtee~L~~~F~~fG~V~~v~i~~d~-~tg~~kG~aFV~F~~~e~A~~A 251 (346)
T TIGR01659 192 KDTNLYVTNLPRTITDDQLDTIFGKYGQIVQKNILRDK-LTGTPRGVAFVRFNKREEAQEA 251 (346)
T ss_pred ccceeEEeCCCCcccHHHHHHHHHhcCCEEEEEEeecC-CCCccceEEEEEECCHHHHHHH
Confidence 46789999999999999999999999999999999886 7899999999999999999875
No 10
>TIGR01645 half-pint poly-U binding splicing factor, half-pint family. In the case of PUF60 (GP|6176532), in complex with p54, and in the presence of U2AF, facilitates association of U2 snRNP with pre-mRNA.
Probab=99.64 E-value=7.2e-16 Score=101.24 Aligned_cols=61 Identities=25% Similarity=0.533 Sum_probs=57.1
Q ss_pred CCCCeEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHHHhcC
Q psy3346 18 QTGSKILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKRV 79 (79)
Q Consensus 18 ~~~~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~a 79 (79)
...++|||+|||+.+++++|+++|.+||.|.+++++.++ .+++++|||||+|.+.++|++|
T Consensus 105 ~~~~rLfVGnLp~~~tEe~Lr~lF~~fG~I~sV~I~~D~-~TgkskGfAFVeF~s~e~A~~A 165 (612)
T TIGR01645 105 AIMCRVYVGSISFELREDTIRRAFDPFGPIKSINMSWDP-ATGKHKGFAFVEYEVPEAAQLA 165 (612)
T ss_pred cCCCEEEEcCCCCCCCHHHHHHHHHccCCEEEEEEeecC-CCCCcCCeEEEEeCcHHHHHHH
Confidence 456899999999999999999999999999999999987 8899999999999999999865
No 11
>TIGR01648 hnRNP-R-Q heterogeneous nuclear ribonucleoprotein R, Q family. Sequences in this subfamily include the human heterogeneous nuclear ribonucleoproteins (hnRNP) R, Q and APOBEC-1 complementation factor (aka APOBEC-1 stimulating protein). These proteins contain three RNA recognition domains (rrm: pfam00076) and a somewhat variable C-terminal domain.
Probab=99.63 E-value=1.3e-15 Score=99.75 Aligned_cols=60 Identities=35% Similarity=0.527 Sum_probs=56.4
Q ss_pred CCCCeEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHHHhcC
Q psy3346 18 QTGSKILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKRV 79 (79)
Q Consensus 18 ~~~~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~a 79 (79)
..+++|||+|||+++++++|+++|++||.|..++|+.+ .+++++|||||+|.+.++|++|
T Consensus 56 ~~~~~lFVgnLp~~~tEd~L~~~F~~~G~I~~vrl~~D--~sG~sRGfaFV~F~~~e~A~~A 115 (578)
T TIGR01648 56 GRGCEVFVGKIPRDLYEDELVPLFEKAGPIYELRLMMD--FSGQNRGYAFVTFCGKEEAKEA 115 (578)
T ss_pred CCCCEEEeCCCCCCCCHHHHHHHHHhhCCEEEEEEEEC--CCCCccceEEEEeCCHHHHHHH
Confidence 45799999999999999999999999999999999999 7899999999999999999875
No 12
>TIGR01622 SF-CC1 splicing factor, CC1-like family. A homologous gene from Plasmodium falciparum was identified in the course of the analysis of that genome at TIGR and was included in the model.
Probab=99.63 E-value=1.8e-15 Score=96.67 Aligned_cols=62 Identities=19% Similarity=0.407 Sum_probs=57.1
Q ss_pred CCCCCeEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHHHhcC
Q psy3346 17 KQTGSKILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKRV 79 (79)
Q Consensus 17 ~~~~~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~a 79 (79)
..+.++|||+|||+.+++++|+++|++||.|..|.++.++ .++.++|||||+|.+.++|++|
T Consensus 86 ~~~~~~l~V~nlp~~~~~~~l~~~F~~~G~v~~v~i~~d~-~~~~skg~afVeF~~~e~A~~A 147 (457)
T TIGR01622 86 ERDDRTVFVLQLALKARERDLYEFFSKVGKVRDVQCIKDR-NSRRSKGVAYVEFYDVESVIKA 147 (457)
T ss_pred ccCCcEEEEeCCCCCCCHHHHHHHHHhcCCeeEEEEeecC-CCCCcceEEEEEECCHHHHHHH
Confidence 3456899999999999999999999999999999999987 7899999999999999999865
No 13
>KOG0121|consensus
Probab=99.63 E-value=5.3e-16 Score=84.19 Aligned_cols=62 Identities=21% Similarity=0.420 Sum_probs=56.5
Q ss_pred CCCCCeEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHHHhcC
Q psy3346 17 KQTGSKILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKRV 79 (79)
Q Consensus 17 ~~~~~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~a 79 (79)
....++|||+||++.++|+.|.++|+++|.|..|.+-.++ .+..+.|||||+|.+.++|+.|
T Consensus 33 ~r~S~tvyVgNlSfyttEEqiyELFs~cG~irriiMGLdr-~kktpCGFCFVeyy~~~dA~~A 94 (153)
T KOG0121|consen 33 LRKSCTVYVGNLSFYTTEEQIYELFSKCGDIRRIIMGLDR-FKKTPCGFCFVEYYSRDDAEDA 94 (153)
T ss_pred HhhcceEEEeeeeeeecHHHHHHHHHhccchheeEecccc-CCcCccceEEEEEecchhHHHH
Confidence 3457999999999999999999999999999999998887 7777899999999999998764
No 14
>PLN03120 nucleic acid binding protein; Provisional
Probab=99.62 E-value=1.9e-15 Score=90.48 Aligned_cols=56 Identities=18% Similarity=0.408 Sum_probs=50.7
Q ss_pred CCeEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHHHhcC
Q psy3346 20 GSKILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKRV 79 (79)
Q Consensus 20 ~~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~a 79 (79)
.++|||+|||+.+++++|+++|+.||.|.++.++.++ ..+|||||+|.++++|+.|
T Consensus 4 ~rtVfVgNLs~~tTE~dLrefFS~~G~I~~V~I~~d~----~~~GfAFVtF~d~eaAe~A 59 (260)
T PLN03120 4 VRTVKVSNVSLKATERDIKEFFSFSGDIEYVEMQSEN----ERSQIAYVTFKDPQGAETA 59 (260)
T ss_pred CCEEEEeCCCCCCCHHHHHHHHHhcCCeEEEEEeecC----CCCCEEEEEeCcHHHHHHH
Confidence 5799999999999999999999999999999998873 2469999999999998764
No 15
>KOG0113|consensus
Probab=99.62 E-value=1.9e-15 Score=91.35 Aligned_cols=61 Identities=20% Similarity=0.495 Sum_probs=56.0
Q ss_pred CCCCCeEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHHHhc
Q psy3346 17 KQTGSKILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKR 78 (79)
Q Consensus 17 ~~~~~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~ 78 (79)
..+-+||||+-|+++++|..|+..|+.||.|..|+|+.++ .+|+++|||||+|+...+...
T Consensus 98 gDPy~TLFv~RLnydT~EskLrreF~~YG~IkrirlV~d~-vTgkskGYAFIeye~erdm~~ 158 (335)
T KOG0113|consen 98 GDPYKTLFVARLNYDTSESKLRREFEKYGPIKRIRLVRDK-VTGKSKGYAFIEYEHERDMKA 158 (335)
T ss_pred CCccceeeeeeccccccHHHHHHHHHhcCcceeEEEeeec-ccCCccceEEEEeccHHHHHH
Confidence 3456899999999999999999999999999999999998 899999999999998877554
No 16
>PLN03121 nucleic acid binding protein; Provisional
Probab=99.60 E-value=4.2e-15 Score=88.06 Aligned_cols=58 Identities=19% Similarity=0.299 Sum_probs=51.6
Q ss_pred CCCCeEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHHHhcC
Q psy3346 18 QTGSKILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKRV 79 (79)
Q Consensus 18 ~~~~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~a 79 (79)
..+.+|||+||++.+++++|+++|+.||.|.+|+|+.+ +...++|||+|.++++++.|
T Consensus 3 ~~g~TV~V~NLS~~tTE~dLrefFS~~G~I~~V~I~~D----~et~gfAfVtF~d~~aaetA 60 (243)
T PLN03121 3 PGGYTAEVTNLSPKATEKDVYDFFSHCGAIEHVEIIRS----GEYACTAYVTFKDAYALETA 60 (243)
T ss_pred CCceEEEEecCCCCCCHHHHHHHHHhcCCeEEEEEecC----CCcceEEEEEECCHHHHHHH
Confidence 35789999999999999999999999999999999887 34458999999999998754
No 17
>KOG0125|consensus
Probab=99.60 E-value=2.1e-15 Score=92.16 Aligned_cols=63 Identities=29% Similarity=0.536 Sum_probs=55.9
Q ss_pred cccCCCCCeEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHHHhcC
Q psy3346 14 NVAKQTGSKILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKRV 79 (79)
Q Consensus 14 ~~~~~~~~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~a 79 (79)
......+++|+|.|||+...+-||+.+|++||.|.+|.++.+ . ..++||+||+|+++++|++|
T Consensus 90 s~s~~~pkRLhVSNIPFrFRdpDL~aMF~kfG~VldVEIIfN--E-RGSKGFGFVTmen~~dadRA 152 (376)
T KOG0125|consen 90 SSSKDTPKRLHVSNIPFRFRDPDLRAMFEKFGKVLDVEIIFN--E-RGSKGFGFVTMENPADADRA 152 (376)
T ss_pred CCCCCCCceeEeecCCccccCccHHHHHHhhCceeeEEEEec--c-CCCCccceEEecChhhHHHH
Confidence 344566789999999999999999999999999999999887 3 34789999999999999875
No 18
>TIGR01628 PABP-1234 polyadenylate binding protein, human types 1, 2, 3, 4 family. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed, or of unknown tissue range.
Probab=99.60 E-value=3.6e-15 Score=97.48 Aligned_cols=57 Identities=25% Similarity=0.480 Sum_probs=54.1
Q ss_pred eEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHHHhcC
Q psy3346 22 KILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKRV 79 (79)
Q Consensus 22 ~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~a 79 (79)
+|||+|||+++++++|+++|++||.|.+|++.+++ .+++++|||||+|.+.++|++|
T Consensus 2 sl~VgnLp~~vte~~L~~~F~~~G~v~~v~v~~d~-~t~~s~G~afV~F~~~~~A~~A 58 (562)
T TIGR01628 2 SLYVGDLDPDVTEAKLYDLFKPFGPVLSVRVCRDS-VTRRSLGYGYVNFQNPADAERA 58 (562)
T ss_pred eEEEeCCCCCCCHHHHHHHHHhcCCEEEEEEEecC-CCCCcceEEEEEECCHHHHHHH
Confidence 79999999999999999999999999999999986 7789999999999999999875
No 19
>TIGR01645 half-pint poly-U binding splicing factor, half-pint family. In the case of PUF60 (GP|6176532), in complex with p54, and in the presence of U2AF, facilitates association of U2 snRNP with pre-mRNA.
Probab=99.59 E-value=5.5e-15 Score=97.18 Aligned_cols=60 Identities=22% Similarity=0.469 Sum_probs=55.5
Q ss_pred CCCeEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHHHhcC
Q psy3346 19 TGSKILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKRV 79 (79)
Q Consensus 19 ~~~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~a 79 (79)
..++|||+|||+++++++|+++|+.||.|.++++..++ .+++++|||||+|.+.++|.+|
T Consensus 203 ~~~rLfVgnLp~~vteedLk~lFs~FG~I~svrl~~D~-~tgksKGfGFVeFe~~e~A~kA 262 (612)
T TIGR01645 203 KFNRIYVASVHPDLSETDIKSVFEAFGEIVKCQLARAP-TGRGHKGYGFIEYNNLQSQSEA 262 (612)
T ss_pred ccceEEeecCCCCCCHHHHHHHHhhcCCeeEEEEEecC-CCCCcCCeEEEEECCHHHHHHH
Confidence 35799999999999999999999999999999999987 7788999999999999998764
No 20
>TIGR01642 U2AF_lg U2 snRNP auxilliary factor, large subunit, splicing factor. Members of this subfamily are found in plants, metazoa and fungi.
Probab=99.57 E-value=1.6e-14 Score=93.27 Aligned_cols=61 Identities=26% Similarity=0.605 Sum_probs=55.8
Q ss_pred CCCCeEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHHHhcC
Q psy3346 18 QTGSKILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKRV 79 (79)
Q Consensus 18 ~~~~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~a 79 (79)
...++|||+|||+.+++++|+++|+.||.|..+.++.++ .+|.++|||||+|.+.++|..|
T Consensus 293 ~~~~~l~v~nlp~~~~~~~l~~~f~~~G~i~~~~~~~~~-~~g~~~g~afv~f~~~~~a~~A 353 (509)
T TIGR01642 293 DSKDRIYIGNLPLYLGEDQIKELLESFGDLKAFNLIKDI-ATGLSKGYAFCEYKDPSVTDVA 353 (509)
T ss_pred CCCCEEEEeCCCCCCCHHHHHHHHHhcCCeeEEEEEecC-CCCCcCeEEEEEECCHHHHHHH
Confidence 346799999999999999999999999999999998886 7899999999999999998764
No 21
>TIGR01628 PABP-1234 polyadenylate binding protein, human types 1, 2, 3, 4 family. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed, or of unknown tissue range.
Probab=99.54 E-value=2.6e-14 Score=93.49 Aligned_cols=61 Identities=33% Similarity=0.583 Sum_probs=56.5
Q ss_pred CCCCCeEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHHHhcC
Q psy3346 17 KQTGSKILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKRV 79 (79)
Q Consensus 17 ~~~~~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~a 79 (79)
....++|||+||++.+++++|+++|++||.|.+++++.+ .+|.++|||||+|.+.++|.+|
T Consensus 282 ~~~~~~l~V~nl~~~~~~~~L~~~F~~~G~i~~~~i~~d--~~g~~~g~gfV~f~~~~~A~~A 342 (562)
T TIGR01628 282 KAQGVNLYVKNLDDTVTDEKLRELFSECGEITSAKVMLD--EKGVSRGFGFVCFSNPEEANRA 342 (562)
T ss_pred ccCCCEEEEeCCCCccCHHHHHHHHHhcCCeEEEEEEEC--CCCCcCCeEEEEeCCHHHHHHH
Confidence 345678999999999999999999999999999999998 7899999999999999999875
No 22
>TIGR01622 SF-CC1 splicing factor, CC1-like family. A homologous gene from Plasmodium falciparum was identified in the course of the analysis of that genome at TIGR and was included in the model.
Probab=99.53 E-value=3.5e-14 Score=90.75 Aligned_cols=59 Identities=32% Similarity=0.640 Sum_probs=55.0
Q ss_pred CCeEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHHHhcC
Q psy3346 20 GSKILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKRV 79 (79)
Q Consensus 20 ~~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~a 79 (79)
.++|||+|||..+++++|+++|++||.|..+.++.++ .+|.++|||||+|.+.++|.+|
T Consensus 186 ~~~l~v~nl~~~~te~~l~~~f~~~G~i~~v~~~~d~-~~g~~~g~afV~f~~~e~A~~A 244 (457)
T TIGR01622 186 FLKLYVGNLHFNITEQELRQIFEPFGDIEDVQLHRDP-ETGRSKGFGFIQFHDAEEAKEA 244 (457)
T ss_pred CCEEEEcCCCCCCCHHHHHHHHHhcCCeEEEEEEEcC-CCCccceEEEEEECCHHHHHHH
Confidence 5899999999999999999999999999999999886 6789999999999999998764
No 23
>KOG0144|consensus
Probab=99.53 E-value=2.1e-14 Score=90.46 Aligned_cols=62 Identities=29% Similarity=0.551 Sum_probs=57.3
Q ss_pred CCCCCeEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHHHhcC
Q psy3346 17 KQTGSKILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKRV 79 (79)
Q Consensus 17 ~~~~~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~a 79 (79)
+.+.-++||+.+|..++|.+|+++|++||.|.+|.+++|+ .++.++|||||.|.+.++|.+|
T Consensus 31 d~~~vKlfVgqIprt~sE~dlr~lFe~yg~V~einl~kDk-~t~~s~gcCFv~~~trk~a~~a 92 (510)
T KOG0144|consen 31 DGSAVKLFVGQIPRTASEKDLRELFEKYGNVYEINLIKDK-STGQSKGCCFVKYYTRKEADEA 92 (510)
T ss_pred CchhhhheeccCCccccHHHHHHHHHHhCceeEEEeeccc-ccCcccceEEEEeccHHHHHHH
Confidence 3445689999999999999999999999999999999998 8999999999999999998764
No 24
>KOG0117|consensus
Probab=99.53 E-value=3.1e-14 Score=89.92 Aligned_cols=62 Identities=34% Similarity=0.523 Sum_probs=58.6
Q ss_pred CCCCCeEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHHHhcC
Q psy3346 17 KQTGSKILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKRV 79 (79)
Q Consensus 17 ~~~~~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~a 79 (79)
...++.|||+.||.++.|++|..+|++.|+|-.++|+.++ .+|.++|||||+|.+.++|++|
T Consensus 80 p~~G~EVfvGkIPrD~~EdeLvplfEkiG~I~elRLMmD~-~sG~nRGYAFVtf~~Ke~Aq~A 141 (506)
T KOG0117|consen 80 PPRGCEVFVGKIPRDVFEDELVPLFEKIGKIYELRLMMDP-FSGDNRGYAFVTFCTKEEAQEA 141 (506)
T ss_pred CCCCceEEecCCCccccchhhHHHHHhccceeeEEEeecc-cCCCCcceEEEEeecHHHHHHH
Confidence 3668999999999999999999999999999999999998 8999999999999999999875
No 25
>KOG0127|consensus
Probab=99.52 E-value=4.2e-14 Score=91.25 Aligned_cols=61 Identities=34% Similarity=0.675 Sum_probs=56.9
Q ss_pred CCCCeEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHHHhcC
Q psy3346 18 QTGSKILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKRV 79 (79)
Q Consensus 18 ~~~~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~a 79 (79)
..+.+|||+|||+++++++|.+.|++||.|.+..++.++ .++.++|.|||.|.+..++.+|
T Consensus 290 ~~~~tVFvRNL~fD~tEEel~~~fskFG~v~ya~iV~~k-~T~~skGtAFv~Fkt~~~~~~c 350 (678)
T KOG0127|consen 290 TEGKTVFVRNLPFDTTEEELKEHFSKFGEVKYAIIVKDK-DTGHSKGTAFVKFKTQIAAQNC 350 (678)
T ss_pred cccceEEEecCCccccHHHHHHHHHhhccceeEEEEecc-CCCCcccceEEEeccHHHHHHH
Confidence 446899999999999999999999999999999999998 9999999999999999988765
No 26
>COG0724 RNA-binding proteins (RRM domain) [General function prediction only]
Probab=99.52 E-value=6.2e-14 Score=82.79 Aligned_cols=59 Identities=37% Similarity=0.747 Sum_probs=55.6
Q ss_pred CCeEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHHHhcC
Q psy3346 20 GSKILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKRV 79 (79)
Q Consensus 20 ~~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~a 79 (79)
.++|||+|||+.+++++|+++|.+||.+..+.+..++ .++.++|||||+|.+.+++..|
T Consensus 115 ~~~l~v~nL~~~~~~~~l~~~F~~~g~~~~~~~~~d~-~~~~~~g~~~v~f~~~~~~~~a 173 (306)
T COG0724 115 NNTLFVGNLPYDVTEEDLRELFKKFGPVKRVRLVRDR-ETGKSRGFAFVEFESEESAEKA 173 (306)
T ss_pred CceEEEeCCCCCCCHHHHHHHHHhcCceeEEEeeecc-ccCccCceEEEEecCHHHHHHH
Confidence 6999999999999999999999999999999999987 7899999999999999998764
No 27
>smart00362 RRM_2 RNA recognition motif.
Probab=99.52 E-value=9.3e-14 Score=67.72 Aligned_cols=55 Identities=35% Similarity=0.765 Sum_probs=49.8
Q ss_pred eEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHHHhcC
Q psy3346 22 KILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKRV 79 (79)
Q Consensus 22 ~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~a 79 (79)
+|||+|||..+++++|+++|.+||.+..+.+..+ . +.++|+|||+|.+.++|++|
T Consensus 1 ~v~i~~l~~~~~~~~l~~~~~~~g~v~~~~~~~~--~-~~~~~~~~v~f~~~~~a~~a 55 (72)
T smart00362 1 TLFVGNLPPDVTEEDLKELFSKFGPIESVKIPKD--T-GKSKGFAFVEFESEEDAEKA 55 (72)
T ss_pred CEEEcCCCCcCCHHHHHHHHHhcCCEEEEEEecC--C-CCCCceEEEEeCCHHHHHHH
Confidence 5899999999999999999999999999988876 3 77889999999999998754
No 28
>KOG0114|consensus
Probab=99.51 E-value=8.2e-14 Score=73.33 Aligned_cols=61 Identities=25% Similarity=0.521 Sum_probs=52.7
Q ss_pred ccCCCCCeEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHHHhcC
Q psy3346 15 VAKQTGSKILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKRV 79 (79)
Q Consensus 15 ~~~~~~~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~a 79 (79)
.+....+-|||.|||+++|.+++.++|.+||.|..+++-..+ ..+|.|||.|++..+|.+|
T Consensus 13 lppevnriLyirNLp~~ITseemydlFGkyg~IrQIRiG~~k----~TrGTAFVVYedi~dAk~A 73 (124)
T KOG0114|consen 13 LPPEVNRILYIRNLPFKITSEEMYDLFGKYGTIRQIRIGNTK----ETRGTAFVVYEDIFDAKKA 73 (124)
T ss_pred CChhhheeEEEecCCccccHHHHHHHhhcccceEEEEecCcc----CcCceEEEEehHhhhHHHH
Confidence 334457889999999999999999999999999999987653 4579999999999998765
No 29
>KOG0148|consensus
Probab=99.51 E-value=3e-14 Score=85.49 Aligned_cols=66 Identities=24% Similarity=0.509 Sum_probs=60.1
Q ss_pred CcccCCCCCeEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHHHhcC
Q psy3346 13 SNVAKQTGSKILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKRV 79 (79)
Q Consensus 13 ~~~~~~~~~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~a 79 (79)
..+....+..+||+-|...++.++|++.|.+||+|..+++++|. .+++++||+||.|.+.++|++|
T Consensus 55 sk~t~~~hfhvfvgdls~eI~~e~lr~aF~pFGevS~akvirD~-~T~KsKGYgFVSf~~k~dAEnA 120 (321)
T KOG0148|consen 55 SKPTSNQHFHVFVGDLSPEIDNEKLREAFAPFGEVSDAKVIRDM-NTGKSKGYGFVSFPNKEDAENA 120 (321)
T ss_pred CCCccccceeEEehhcchhcchHHHHHHhccccccccceEeecc-cCCcccceeEEeccchHHHHHH
Confidence 33444557899999999999999999999999999999999998 8999999999999999999875
No 30
>KOG0107|consensus
Probab=99.49 E-value=6.5e-14 Score=79.47 Aligned_cols=56 Identities=30% Similarity=0.544 Sum_probs=49.6
Q ss_pred CCCCeEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHHHhcC
Q psy3346 18 QTGSKILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKRV 79 (79)
Q Consensus 18 ~~~~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~a 79 (79)
.-.++|||+||+..+++.+|+..|..||.+..+.+-.. +.|||||+|+++.+|+.|
T Consensus 8 ~~~~kVYVGnL~~~a~k~eLE~~F~~yG~lrsvWvArn------PPGfAFVEFed~RDA~DA 63 (195)
T KOG0107|consen 8 NGNTKVYVGNLGSRATKRELERAFSKYGPLRSVWVARN------PPGFAFVEFEDPRDAEDA 63 (195)
T ss_pred CCCceEEeccCCCCcchHHHHHHHHhcCcceeEEEeec------CCCceEEeccCcccHHHH
Confidence 44789999999999999999999999999998877554 369999999999998865
No 31
>KOG0126|consensus
Probab=99.48 E-value=7.7e-15 Score=83.80 Aligned_cols=57 Identities=18% Similarity=0.495 Sum_probs=52.7
Q ss_pred CCCCeEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHH
Q psy3346 18 QTGSKILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNE 75 (79)
Q Consensus 18 ~~~~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~ 75 (79)
.+..-|||+|||++.|+.+|.-+|++||.|..|.|++|+ .+|+++||||+.|++..+
T Consensus 33 kdsA~Iyiggl~~~LtEgDil~VFSqyGe~vdinLiRDk-~TGKSKGFaFLcYEDQRS 89 (219)
T KOG0126|consen 33 KDSAYIYIGGLPYELTEGDILCVFSQYGEIVDINLIRDK-KTGKSKGFAFLCYEDQRS 89 (219)
T ss_pred ccceEEEECCCcccccCCcEEEEeeccCceEEEEEEecC-CCCcccceEEEEecCccc
Confidence 345689999999999999999999999999999999998 999999999999988654
No 32
>KOG0131|consensus
Probab=99.47 E-value=8.8e-14 Score=79.35 Aligned_cols=62 Identities=26% Similarity=0.454 Sum_probs=57.4
Q ss_pred cCCCCCeEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHHHhc
Q psy3346 16 AKQTGSKILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKR 78 (79)
Q Consensus 16 ~~~~~~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~ 78 (79)
....+.+|||+||+..++++.|.++|-+.|+|..++++.++ .+..++||||++|.++++|+-
T Consensus 5 ~rnqd~tiyvgnld~kvs~~~l~EL~iqagpVv~i~iPkDr-v~~~~qGygF~Ef~~eedadY 66 (203)
T KOG0131|consen 5 ERNQDATLYVGNLDEKVSEELLYELFIQAGPVVNLHIPKDR-VTQKHQGYGFAEFRTEEDADY 66 (203)
T ss_pred ccCCCceEEEecCCHHHHHHHHHHHHHhcCceeeeecchhh-hcccccceeEEEEechhhhHH
Confidence 34567899999999999999999999999999999999998 888899999999999999874
No 33
>smart00360 RRM RNA recognition motif.
Probab=99.47 E-value=2.4e-13 Score=66.00 Aligned_cols=54 Identities=37% Similarity=0.727 Sum_probs=48.7
Q ss_pred EeCCCCCCCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHHHhcC
Q psy3346 25 VRNIPFQAKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKRV 79 (79)
Q Consensus 25 v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~a 79 (79)
|+|||..+++++|+++|.+||.+..+.+..++ .++.++|+|||+|.+.++|.+|
T Consensus 1 i~~l~~~~~~~~l~~~f~~~g~v~~~~i~~~~-~~~~~~~~a~v~f~~~~~a~~a 54 (71)
T smart00360 1 VGNLPPDVTEEELRELFSKFGKIESVRLVRDK-DTGKSKGFAFVEFESEEDAEKA 54 (71)
T ss_pred CCCCCcccCHHHHHHHHHhhCCEeEEEEEeCC-CCCCCCceEEEEeCCHHHHHHH
Confidence 57999999999999999999999999988875 5688899999999999998764
No 34
>PLN03213 repressor of silencing 3; Provisional
Probab=99.45 E-value=3.3e-13 Score=86.74 Aligned_cols=52 Identities=17% Similarity=0.448 Sum_probs=46.9
Q ss_pred CCCCCeEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCH
Q psy3346 17 KQTGSKILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITK 73 (79)
Q Consensus 17 ~~~~~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~ 73 (79)
.....+||||||++.+++++|+..|..||.|..+.|++ .+| +|||||+|.+.
T Consensus 7 ~~~gMRIYVGNLSydVTEDDLravFSeFGsVkdVEIpR---ETG--RGFAFVEMssd 58 (759)
T PLN03213 7 GGGGVRLHVGGLGESVGRDDLLKIFSPMGTVDAVEFVR---TKG--RSFAYIDFSPS 58 (759)
T ss_pred CCcceEEEEeCCCCCCCHHHHHHHHHhcCCeeEEEEec---ccC--CceEEEEecCC
Confidence 34568999999999999999999999999999999994 466 89999999986
No 35
>KOG4207|consensus
Probab=99.44 E-value=2.1e-13 Score=79.33 Aligned_cols=65 Identities=22% Similarity=0.450 Sum_probs=58.3
Q ss_pred cccCCCCCeEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHHHhcC
Q psy3346 14 NVAKQTGSKILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKRV 79 (79)
Q Consensus 14 ~~~~~~~~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~a 79 (79)
+++....+.|.|-||.+.++.++|+.+|++||.|-.|.|+.++ .++.++|||||-|....+|+.|
T Consensus 7 PPdv~gm~SLkVdNLTyRTspd~LrrvFekYG~vgDVyIPrdr-~Tr~sRgFaFVrf~~k~daedA 71 (256)
T KOG4207|consen 7 PPDVEGMTSLKVDNLTYRTSPDDLRRVFEKYGRVGDVYIPRDR-YTRQSRGFAFVRFHDKRDAEDA 71 (256)
T ss_pred CCCcccceeEEecceeccCCHHHHHHHHHHhCcccceeccccc-ccccccceeEEEeeecchHHHH
Confidence 3444556789999999999999999999999999999999998 8999999999999988887764
No 36
>cd00590 RRM RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
Probab=99.44 E-value=8.7e-13 Score=64.53 Aligned_cols=56 Identities=30% Similarity=0.652 Sum_probs=50.0
Q ss_pred eEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHHHhcC
Q psy3346 22 KILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKRV 79 (79)
Q Consensus 22 ~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~a 79 (79)
+|+|+|||+.+++++|+++|..+|.|..+.+..+ ..+..+|+|||+|.+.++|..|
T Consensus 1 ~i~i~~l~~~~~~~~i~~~~~~~g~i~~~~~~~~--~~~~~~~~~~v~f~s~~~a~~a 56 (74)
T cd00590 1 TLFVGNLPPDVTEEDLRELFSKFGKVESVRIVRD--KDTKSKGFAFVEFEDEEDAEKA 56 (74)
T ss_pred CEEEeCCCCccCHHHHHHHHHhcCCEEEEEEeeC--CCCCcceEEEEEECCHHHHHHH
Confidence 4899999999999999999999999999999877 3447789999999999998754
No 37
>TIGR01649 hnRNP-L_PTB hnRNP-L/PTB/hephaestus splicing factor family. Included in this family of heterogeneous ribonucleoproteins are PTB (polypyrimidine tract binding protein ) and hnRNP-L. These proteins contain four RNA recognition motifs (rrm: pfam00067).
Probab=99.43 E-value=4.3e-13 Score=86.72 Aligned_cols=53 Identities=25% Similarity=0.429 Sum_probs=48.5
Q ss_pred CCeEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHHHhcC
Q psy3346 20 GSKILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKRV 79 (79)
Q Consensus 20 ~~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~a 79 (79)
.++|||+|||+.+++++|+++|++||.|.++.++.+ +|||||+|.+.++|++|
T Consensus 2 s~vv~V~nLp~~~te~~L~~~f~~fG~V~~v~i~~~-------k~~afVef~~~e~A~~A 54 (481)
T TIGR01649 2 SPVVHVRNLPQDVVEADLVEALIPFGPVSYVMMLPG-------KRQALVEFEDEESAKAC 54 (481)
T ss_pred ccEEEEcCCCCCCCHHHHHHHHHhcCCeeEEEEECC-------CCEEEEEeCchHHHHHH
Confidence 468999999999999999999999999999988765 38999999999999875
No 38
>KOG0130|consensus
Probab=99.43 E-value=2.4e-13 Score=74.59 Aligned_cols=67 Identities=24% Similarity=0.427 Sum_probs=60.8
Q ss_pred cCcccCCCCCeEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHHHhcC
Q psy3346 12 SSNVAKQTGSKILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKRV 79 (79)
Q Consensus 12 ~~~~~~~~~~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~a 79 (79)
+.+....++..|||.++...++++++...|..||+|++++|..++ .+|-.+|||.|+|++.+.|++|
T Consensus 64 pgPqrSVEGwIi~VtgvHeEatEedi~d~F~dyGeiKNihLNLDR-RtGy~KGYaLvEYet~keAq~A 130 (170)
T KOG0130|consen 64 PGPQRSVEGWIIFVTGVHEEATEEDIHDKFADYGEIKNIHLNLDR-RTGYVKGYALVEYETLKEAQAA 130 (170)
T ss_pred CCCccceeeEEEEEeccCcchhHHHHHHHHhhcccccceeecccc-ccccccceeeeehHhHHHHHHH
Confidence 334556678999999999999999999999999999999999998 8999999999999999998865
No 39
>KOG0148|consensus
Probab=99.42 E-value=5e-13 Score=80.29 Aligned_cols=56 Identities=29% Similarity=0.469 Sum_probs=51.7
Q ss_pred CCCCCeEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHHHhcC
Q psy3346 17 KQTGSKILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKRV 79 (79)
Q Consensus 17 ~~~~~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~a 79 (79)
..++++||++|++.-++++++++.|++||.|.+|++..++ ||+||.|.+.|+|..|
T Consensus 161 sp~NtsVY~G~I~~~lte~~mr~~Fs~fG~I~EVRvFk~q-------GYaFVrF~tkEaAahA 216 (321)
T KOG0148|consen 161 SPDNTSVYVGNIASGLTEDLMRQTFSPFGPIQEVRVFKDQ-------GYAFVRFETKEAAAHA 216 (321)
T ss_pred CCCCceEEeCCcCccccHHHHHHhcccCCcceEEEEeccc-------ceEEEEecchhhHHHH
Confidence 4668999999999999999999999999999999998884 9999999999998764
No 40
>KOG0144|consensus
Probab=99.42 E-value=8.1e-14 Score=87.85 Aligned_cols=60 Identities=32% Similarity=0.495 Sum_probs=55.3
Q ss_pred CCCCeEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHHHhcC
Q psy3346 18 QTGSKILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKRV 79 (79)
Q Consensus 18 ~~~~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~a 79 (79)
.+.++|||+.|+..++|.+++++|.+||.|++|.+.++ ..+.+||||||.|.+.+-|..|
T Consensus 122 ~~e~KLFvg~lsK~~te~evr~iFs~fG~Ied~~ilrd--~~~~sRGcaFV~fstke~A~~A 181 (510)
T KOG0144|consen 122 VEERKLFVGMLSKQCTENEVREIFSRFGHIEDCYILRD--PDGLSRGCAFVKFSTKEMAVAA 181 (510)
T ss_pred ccchhhhhhhccccccHHHHHHHHHhhCccchhhheec--ccccccceeEEEEehHHHHHHH
Confidence 34789999999999999999999999999999999999 7999999999999999877643
No 41
>KOG0105|consensus
Probab=99.39 E-value=7.4e-13 Score=76.10 Aligned_cols=58 Identities=28% Similarity=0.528 Sum_probs=50.6
Q ss_pred CCCCeEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHHHhcC
Q psy3346 18 QTGSKILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKRV 79 (79)
Q Consensus 18 ~~~~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~a 79 (79)
...++|||+|||.++.+.+|+.+|-+||.|..|.|...+ ...+||||+|+++.+|+.|
T Consensus 4 r~~~~iyvGNLP~diRekeieDlFyKyg~i~~ieLK~r~----g~ppfafVeFEd~RDAeDA 61 (241)
T KOG0105|consen 4 RNSRRIYVGNLPGDIREKEIEDLFYKYGRIREIELKNRP----GPPPFAFVEFEDPRDAEDA 61 (241)
T ss_pred cccceEEecCCCcchhhccHHHHHhhhcceEEEEeccCC----CCCCeeEEEecCccchhhh
Confidence 457899999999999999999999999999999885542 3458999999999998765
No 42
>KOG0145|consensus
Probab=99.38 E-value=9e-13 Score=79.14 Aligned_cols=61 Identities=25% Similarity=0.580 Sum_probs=56.8
Q ss_pred CCCCeEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHHHhcC
Q psy3346 18 QTGSKILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKRV 79 (79)
Q Consensus 18 ~~~~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~a 79 (79)
...+.|.|.-||...++++++.+|...|+|++|++++|+ .+|++.||+||.|..+++|++|
T Consensus 39 ~skTNLIvNYLPQ~MTqdE~rSLF~SiGeiEScKLvRDK-itGqSLGYGFVNYv~p~DAe~A 99 (360)
T KOG0145|consen 39 ESKTNLIVNYLPQNMTQDELRSLFGSIGEIESCKLVRDK-ITGQSLGYGFVNYVRPKDAEKA 99 (360)
T ss_pred cccceeeeeecccccCHHHHHHHhhcccceeeeeeeecc-ccccccccceeeecChHHHHHH
Confidence 345678888899999999999999999999999999999 9999999999999999999875
No 43
>KOG0108|consensus
Probab=99.36 E-value=1.3e-12 Score=83.38 Aligned_cols=58 Identities=31% Similarity=0.615 Sum_probs=55.5
Q ss_pred CeEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHHHhcC
Q psy3346 21 SKILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKRV 79 (79)
Q Consensus 21 ~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~a 79 (79)
+.+||+|+|+++++++|..+|+..|.|.+++++.|+ .+|+.+||||++|.+.+++..|
T Consensus 19 ~~v~vgnip~~~se~~l~~~~~~~g~v~s~~~v~D~-~tG~~~G~~f~~~~~~~~~~~a 76 (435)
T KOG0108|consen 19 SSVFVGNIPYEGSEEQLLSIFSGVGPVLSFRLVYDR-ETGKPKGFGFCEFTDEETAERA 76 (435)
T ss_pred cceEecCCCCcccHHHHHHHHhccCccceeeecccc-cCCCcCceeeEecCchhhHHHH
Confidence 899999999999999999999999999999999999 9999999999999999988754
No 44
>TIGR01649 hnRNP-L_PTB hnRNP-L/PTB/hephaestus splicing factor family. Included in this family of heterogeneous ribonucleoproteins are PTB (polypyrimidine tract binding protein ) and hnRNP-L. These proteins contain four RNA recognition motifs (rrm: pfam00067).
Probab=99.35 E-value=4.7e-12 Score=81.95 Aligned_cols=56 Identities=18% Similarity=0.325 Sum_probs=50.2
Q ss_pred CCCCeEEEeCCCC-CCCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHHHhcC
Q psy3346 18 QTGSKILVRNIPF-QAKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKRV 79 (79)
Q Consensus 18 ~~~~~l~v~nl~~-~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~a 79 (79)
...++|||+|||+ .+++++|+++|++||.|..++++.++ +|+|||+|.+.++|+.|
T Consensus 273 ~~~~~l~v~nL~~~~vt~~~L~~lF~~yG~V~~vki~~~~------~g~afV~f~~~~~A~~A 329 (481)
T TIGR01649 273 GPGSVLMVSGLHQEKVNCDRLFNLFCVYGNVERVKFMKNK------KETALIEMADPYQAQLA 329 (481)
T ss_pred CCCCEEEEeCCCCCCCCHHHHHHHHHhcCCeEEEEEEeCC------CCEEEEEECCHHHHHHH
Confidence 3567999999998 69999999999999999999988762 59999999999999865
No 45
>KOG0111|consensus
Probab=99.34 E-value=5.4e-13 Score=78.43 Aligned_cols=62 Identities=27% Similarity=0.508 Sum_probs=57.1
Q ss_pred CCCCCeEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHHHhcC
Q psy3346 17 KQTGSKILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKRV 79 (79)
Q Consensus 17 ~~~~~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~a 79 (79)
..+.++|||++|..++++.-|...|-+||.|..|.++.|. .+++++||+||+|...|+|..|
T Consensus 7 a~~KrtlYVGGladeVtekvLhaAFIPFGDI~dIqiPlDy-esqkHRgFgFVefe~aEDAaaA 68 (298)
T KOG0111|consen 7 ANQKRTLYVGGLADEVTEKVLHAAFIPFGDIKDIQIPLDY-ESQKHRGFGFVEFEEAEDAAAA 68 (298)
T ss_pred cccceeEEeccchHHHHHHHHHhccccccchhhcccccch-hcccccceeEEEeeccchhHHH
Confidence 3457899999999999999999999999999999999997 8999999999999999988654
No 46
>KOG0124|consensus
Probab=99.30 E-value=1.2e-12 Score=81.63 Aligned_cols=58 Identities=26% Similarity=0.564 Sum_probs=55.1
Q ss_pred CCeEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHHHhc
Q psy3346 20 GSKILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKR 78 (79)
Q Consensus 20 ~~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~ 78 (79)
-|++||+.+.++..++.|+..|.+||+|+++.+.+|+ .+++++|||||+|+-+|.|+-
T Consensus 113 McRvYVGSIsfEl~EDtiR~AF~PFGPIKSInMSWDp-~T~kHKgFAFVEYEvPEaAqL 170 (544)
T KOG0124|consen 113 MCRVYVGSISFELREDTIRRAFDPFGPIKSINMSWDP-ATGKHKGFAFVEYEVPEAAQL 170 (544)
T ss_pred hHheeeeeeEEEechHHHHhhccCCCCcceeeccccc-ccccccceEEEEEeCcHHHHH
Confidence 5899999999999999999999999999999999998 999999999999999998764
No 47
>KOG4205|consensus
Probab=99.28 E-value=6.2e-12 Score=77.48 Aligned_cols=59 Identities=25% Similarity=0.551 Sum_probs=54.5
Q ss_pred CCCeEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHHHhc
Q psy3346 19 TGSKILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKR 78 (79)
Q Consensus 19 ~~~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~ 78 (79)
+.+++||++|+|+++++.|+..|.+||.|..+.+++++ .+++++||+||+|.+.+.+.+
T Consensus 5 ~~~KlfiGgisw~ttee~Lr~yf~~~Gev~d~~vm~d~-~t~rsrgFgfv~f~~~~~v~~ 63 (311)
T KOG4205|consen 5 ESGKLFIGGLSWETTEESLREYFSQFGEVTDCVVMRDP-STGRSRGFGFVTFATPEGVDA 63 (311)
T ss_pred CCcceeecCcCccccHHHHHHHhcccCceeeEEEeccC-CCCCcccccceecCCCcchhe
Confidence 78999999999999999999999999999999999998 889999999999997766543
No 48
>KOG0127|consensus
Probab=99.27 E-value=7.3e-12 Score=81.14 Aligned_cols=60 Identities=32% Similarity=0.677 Sum_probs=55.6
Q ss_pred CCCeEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHHHhcC
Q psy3346 19 TGSKILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKRV 79 (79)
Q Consensus 19 ~~~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~a 79 (79)
.+.||||++||++++.++|.++|+.+|+|..+.++.++ .++.++||+||+|.-.+++++|
T Consensus 4 ~g~TlfV~~lp~~~~~~qL~e~FS~vGPik~~~vVt~~-gs~~~RGfgfVtFam~ED~qrA 63 (678)
T KOG0127|consen 4 SGATLFVSRLPFSSTGEQLEEFFSYVGPIKHAVVVTNK-GSSEKRGFGFVTFAMEEDVQRA 63 (678)
T ss_pred CCceEEEecCCCccchhHHHHhhhcccCcceeEEecCC-CcccccCccceeeehHhHHHHH
Confidence 45899999999999999999999999999999999887 7888999999999999998764
No 49
>KOG0146|consensus
Probab=99.26 E-value=4.4e-12 Score=76.49 Aligned_cols=61 Identities=25% Similarity=0.446 Sum_probs=55.9
Q ss_pred CCCCCeEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHHHhcC
Q psy3346 17 KQTGSKILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKRV 79 (79)
Q Consensus 17 ~~~~~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~a 79 (79)
..++++|||+.|...-+|++++.+|.+||.|.+|.+.+. ..|.++||+||.|.+..+|+.|
T Consensus 16 g~~drklfvgml~kqq~e~dvrrlf~pfG~~~e~tvlrg--~dg~sKGCAFVKf~s~~eAqaA 76 (371)
T KOG0146|consen 16 GGDDRKLFVGMLNKQQSEDDVRRLFQPFGNIEECTVLRG--PDGNSKGCAFVKFSSHAEAQAA 76 (371)
T ss_pred CccchhhhhhhhcccccHHHHHHHhcccCCcceeEEecC--CCCCCCCceEEEeccchHHHHH
Confidence 346899999999999999999999999999999999998 7899999999999999888753
No 50
>TIGR01648 hnRNP-R-Q heterogeneous nuclear ribonucleoprotein R, Q family. Sequences in this subfamily include the human heterogeneous nuclear ribonucleoproteins (hnRNP) R, Q and APOBEC-1 complementation factor (aka APOBEC-1 stimulating protein). These proteins contain three RNA recognition domains (rrm: pfam00076) and a somewhat variable C-terminal domain.
Probab=99.26 E-value=2.3e-11 Score=80.08 Aligned_cols=52 Identities=25% Similarity=0.461 Sum_probs=46.1
Q ss_pred CCCeEEEeCCCCCCCHHHHHHHhHhc--CCeeEEEeecccCCCCCcceEEEEEecCHHHHhcC
Q psy3346 19 TGSKILVRNIPFQAKQSEVEELFKAF--GELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKRV 79 (79)
Q Consensus 19 ~~~~l~v~nl~~~~~~~~l~~~f~~~--g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~a 79 (79)
..++|||+|||+++++++|+++|++| |.|..|.++ ++||||+|.+.++|++|
T Consensus 232 ~~k~LfVgNL~~~~tee~L~~~F~~f~~G~I~rV~~~---------rgfAFVeF~s~e~A~kA 285 (578)
T TIGR01648 232 KVKILYVRNLMTTTTEEIIEKSFSEFKPGKVERVKKI---------RDYAFVHFEDREDAVKA 285 (578)
T ss_pred cccEEEEeCCCCCCCHHHHHHHHHhcCCCceEEEEee---------cCeEEEEeCCHHHHHHH
Confidence 35789999999999999999999999 999988653 36999999999999875
No 51
>TIGR01642 U2AF_lg U2 snRNP auxilliary factor, large subunit, splicing factor. Members of this subfamily are found in plants, metazoa and fungi.
Probab=99.26 E-value=3.4e-11 Score=77.95 Aligned_cols=56 Identities=23% Similarity=0.399 Sum_probs=44.0
Q ss_pred CCCCCeEEEeCCCCCCCHHHHHHHhHhcC------------CeeEEEeecccCCCCCcceEEEEEecCHHHHhcC
Q psy3346 17 KQTGSKILVRNIPFQAKQSEVEELFKAFG------------ELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKRV 79 (79)
Q Consensus 17 ~~~~~~l~v~nl~~~~~~~~l~~~f~~~g------------~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~a 79 (79)
....++|||+|||+.+++++|.++|..++ .|..+.+ +..+|||||+|.+.++|..|
T Consensus 172 ~~~~r~lyVgnLp~~~t~~~l~~~F~~~~~~~~~~~~~~~~~v~~~~~-------~~~kg~afVeF~~~e~A~~A 239 (509)
T TIGR01642 172 TRQARRLYVGGIPPEFVEEAVVDFFNDLMIATGYHKAEDGKHVSSVNI-------NKEKNFAFLEFRTVEEATFA 239 (509)
T ss_pred CccccEEEEeCCCCCCCHHHHHHHHHHHHHhcCCCCCCCCCceEEEEE-------CCCCCEEEEEeCCHHHHhhh
Confidence 34568999999999999999999998752 2333322 34579999999999999875
No 52
>KOG0146|consensus
Probab=99.25 E-value=1.3e-11 Score=74.49 Aligned_cols=64 Identities=19% Similarity=0.387 Sum_probs=59.0
Q ss_pred ccCCCCCeEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHHHhcC
Q psy3346 15 VAKQTGSKILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKRV 79 (79)
Q Consensus 15 ~~~~~~~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~a 79 (79)
...+++|.|||-.||.+..+.+|..+|-+||.|.+.++..|+ .++.+++|+||.|.++.+++.|
T Consensus 280 reGPeGCNlFIYHLPQEFgDaEliQmF~PFGhivSaKVFvDR-ATNQSKCFGFVSfDNp~SaQaA 343 (371)
T KOG0146|consen 280 REGPEGCNLFIYHLPQEFGDAELIQMFLPFGHIVSAKVFVDR-ATNQSKCFGFVSFDNPASAQAA 343 (371)
T ss_pred hcCCCcceEEEEeCchhhccHHHHHHhccccceeeeeeeehh-ccccccceeeEecCCchhHHHH
Confidence 456789999999999999999999999999999999988888 9999999999999999988754
No 53
>KOG0415|consensus
Probab=99.24 E-value=1.3e-11 Score=76.75 Aligned_cols=62 Identities=18% Similarity=0.361 Sum_probs=58.0
Q ss_pred CCCCCeEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHHHhcC
Q psy3346 17 KQTGSKILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKRV 79 (79)
Q Consensus 17 ~~~~~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~a 79 (79)
.++...|||..|.+-++.++|.-+|+.||.|.+|.++++. .+|.+..||||+|.+.+++++|
T Consensus 236 ~PPeNVLFVCKLNPVTtDeDLeiIFSrFG~i~sceVIRD~-ktgdsLqyaFiEFen~escE~A 297 (479)
T KOG0415|consen 236 KPPENVLFVCKLNPVTTDEDLEIIFSRFGKIVSCEVIRDR-KTGDSLQYAFIEFENKESCEQA 297 (479)
T ss_pred CCCcceEEEEecCCcccccchhhHHhhcccceeeeEEecc-cccchhheeeeeecchhhHHHH
Confidence 4556899999999999999999999999999999999998 8999999999999999999875
No 54
>KOG0145|consensus
Probab=99.24 E-value=7.7e-12 Score=75.21 Aligned_cols=63 Identities=29% Similarity=0.525 Sum_probs=56.4
Q ss_pred cCCCCCeEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHHHhcC
Q psy3346 16 AKQTGSKILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKRV 79 (79)
Q Consensus 16 ~~~~~~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~a 79 (79)
+...+..|||.+||...+..+|+.+|++||.|-.-++..|. .+|.++|.+||.|...++|+.|
T Consensus 123 ~~Ik~aNLYvSGlPktMtqkelE~iFs~fGrIItSRiL~dq-vtg~srGVgFiRFDKr~EAe~A 185 (360)
T KOG0145|consen 123 DSIKDANLYVSGLPKTMTQKELEQIFSPFGRIITSRILVDQ-VTGLSRGVGFIRFDKRIEAEEA 185 (360)
T ss_pred hhhcccceEEecCCccchHHHHHHHHHHhhhhhhhhhhhhc-ccceecceeEEEecchhHHHHH
Confidence 34557899999999999999999999999998888887777 7899999999999999998764
No 55
>KOG0147|consensus
Probab=99.24 E-value=4.9e-12 Score=81.53 Aligned_cols=57 Identities=30% Similarity=0.722 Sum_probs=53.4
Q ss_pred eEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHHHhcC
Q psy3346 22 KILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKRV 79 (79)
Q Consensus 22 ~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~a 79 (79)
.|||+||.+.+++++|+.+|++||.|..|.++.+. .+|.++||+|++|.+.++|.+|
T Consensus 280 rl~vgnLHfNite~~lr~ifepfg~Ie~v~l~~d~-~tG~skgfGfi~f~~~~~ar~a 336 (549)
T KOG0147|consen 280 RLYVGNLHFNITEDMLRGIFEPFGKIENVQLTKDS-ETGRSKGFGFITFVNKEDARKA 336 (549)
T ss_pred hhhhcccccCchHHHHhhhccCcccceeeeecccc-ccccccCcceEEEecHHHHHHH
Confidence 39999999999999999999999999999999985 5999999999999999998764
No 56
>KOG4212|consensus
Probab=99.20 E-value=6.4e-11 Score=75.39 Aligned_cols=58 Identities=26% Similarity=0.550 Sum_probs=53.8
Q ss_pred CCeEEEeCCCCCCCHHHHHHHh-HhcCCeeEEEeecccCCCCCcceEEEEEecCHHHHhcC
Q psy3346 20 GSKILVRNIPFQAKQSEVEELF-KAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKRV 79 (79)
Q Consensus 20 ~~~l~v~nl~~~~~~~~l~~~f-~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~a 79 (79)
.+.+||.|||+++.+.+|+.+| ++-|+|++|.|..| .+|+++|||.|+|+++|.+++|
T Consensus 44 ~R~vfItNIpyd~rWqdLKdLvrekvGev~yveLl~D--~~GK~rGcavVEFk~~E~~qKa 102 (608)
T KOG4212|consen 44 DRSVFITNIPYDYRWQDLKDLVREKVGEVEYVELLFD--ESGKARGCAVVEFKDPENVQKA 102 (608)
T ss_pred cceEEEecCcchhhhHhHHHHHHHhcCceEeeeeecc--cCCCcCCceEEEeeCHHHHHHH
Confidence 4669999999999999999999 46799999999999 7999999999999999998875
No 57
>KOG0123|consensus
Probab=99.18 E-value=7.8e-11 Score=74.30 Aligned_cols=58 Identities=21% Similarity=0.511 Sum_probs=52.0
Q ss_pred CCCCeEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHHHhcC
Q psy3346 18 QTGSKILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKRV 79 (79)
Q Consensus 18 ~~~~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~a 79 (79)
.++..+||.||+++++..+|..+|+.||.|.+|++..+ ..| ++|| ||.|++.++|.+|
T Consensus 74 rd~~~~~i~nl~~~~~~~~~~d~f~~~g~ilS~kv~~~--~~g-~kg~-FV~f~~e~~a~~a 131 (369)
T KOG0123|consen 74 RDPSLVFIKNLDESIDNKSLYDTFSEFGNILSCKVATD--ENG-SKGY-FVQFESEESAKKA 131 (369)
T ss_pred cCCceeeecCCCcccCcHHHHHHHHhhcCeeEEEEEEc--CCC-ceee-EEEeCCHHHHHHH
Confidence 34455999999999999999999999999999999998 566 8999 9999999999864
No 58
>KOG0117|consensus
Probab=99.17 E-value=9.8e-11 Score=74.40 Aligned_cols=62 Identities=26% Similarity=0.341 Sum_probs=51.6
Q ss_pred CCCCCeEEEeCCCCCCCHHHHHHHhHhcCC-eeEEEeecccCCCCCcceEEEEEecCHHHHhc
Q psy3346 17 KQTGSKILVRNIPFQAKQSEVEELFKAFGE-LKFVRLPKKMVGSGLHRGFGFVEFITKNEAKR 78 (79)
Q Consensus 17 ~~~~~~l~v~nl~~~~~~~~l~~~f~~~g~-i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~ 78 (79)
....++|||+|||.+.+++++++.+++.++ |..|.+..++....++||||||+|.++..|.-
T Consensus 161 Svan~RLFiG~IPK~k~keeIlee~~kVteGVvdVivy~~p~dk~KNRGFaFveYe~H~~Aa~ 223 (506)
T KOG0117|consen 161 SVANCRLFIGNIPKTKKKEEILEEMKKVTEGVVDVIVYPSPDDKTKNRGFAFVEYESHRAAAM 223 (506)
T ss_pred eeecceeEeccCCccccHHHHHHHHHhhCCCeeEEEEecCccccccccceEEEEeecchhHHH
Confidence 356799999999999999999999999874 67777766543567899999999999987753
No 59
>KOG0110|consensus
Probab=99.14 E-value=4.1e-11 Score=79.23 Aligned_cols=61 Identities=66% Similarity=1.119 Sum_probs=55.1
Q ss_pred CCCCeEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHHHhcC
Q psy3346 18 QTGSKILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKRV 79 (79)
Q Consensus 18 ~~~~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~a 79 (79)
..++.|+|.|||+..+..+++.+|..||.+.+++++... ..+.++|||||+|-++.+|.+|
T Consensus 611 k~~tKIlVRNipFeAt~rEVr~LF~aFGqlksvRlPKK~-~k~a~rGF~Fv~f~t~~ea~nA 671 (725)
T KOG0110|consen 611 KKGTKILVRNIPFEATKREVRKLFTAFGQLKSVRLPKKI-GKGAHRGFGFVDFLTPREAKNA 671 (725)
T ss_pred cccceeeeeccchHHHHHHHHHHHhcccceeeeccchhh-cchhhccceeeeccCcHHHHHH
Confidence 337899999999999999999999999999999998864 6778899999999999998765
No 60
>KOG4208|consensus
Probab=99.13 E-value=2e-10 Score=66.73 Aligned_cols=59 Identities=29% Similarity=0.524 Sum_probs=52.1
Q ss_pred CCCeEEEeCCCCCCCHHHHHHHhHhc-CCeeEEEeecccCCCCCcceEEEEEecCHHHHhc
Q psy3346 19 TGSKILVRNIPFQAKQSEVEELFKAF-GELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKR 78 (79)
Q Consensus 19 ~~~~l~v~nl~~~~~~~~l~~~f~~~-g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~ 78 (79)
...-+|+..+|....+.++..+|.++ |.+...++-+++ .||.++|||||+|++++.|.-
T Consensus 48 ~~g~~~~~~~p~g~~e~~~~~~~~q~~g~v~r~rlsRnk-rTGNSKgYAFVEFEs~eVA~I 107 (214)
T KOG4208|consen 48 IEGVVYVDHIPHGFFETEILNYFRQFGGTVTRFRLSRNK-RTGNSKGYAFVEFESEEVAKI 107 (214)
T ss_pred CccceeecccccchhHHHHhhhhhhcCCeeEEEEeeccc-ccCCcCceEEEEeccHHHHHH
Confidence 34568899999999999999999998 678888887887 899999999999999998864
No 61
>KOG0124|consensus
Probab=99.11 E-value=2.2e-10 Score=71.75 Aligned_cols=57 Identities=23% Similarity=0.490 Sum_probs=51.5
Q ss_pred CCeEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHHHh
Q psy3346 20 GSKILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAK 77 (79)
Q Consensus 20 ~~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~ 77 (79)
-.+|||..+.++.++++|+.+|+.||+|.+|.+-+.+ ..+.++||+|++|.+..+..
T Consensus 210 fnRiYVaSvHpDLSe~DiKSVFEAFG~I~~C~LAr~p-t~~~HkGyGfiEy~n~qs~~ 266 (544)
T KOG0124|consen 210 FNRIYVASVHPDLSETDIKSVFEAFGEIVKCQLARAP-TGRGHKGYGFIEYNNLQSQS 266 (544)
T ss_pred hheEEeeecCCCccHHHHHHHHHhhcceeeEEeeccC-CCCCccceeeEEeccccchH
Confidence 4689999999999999999999999999999999986 67779999999998876543
No 62
>smart00361 RRM_1 RNA recognition motif.
Probab=99.09 E-value=3.2e-10 Score=56.16 Aligned_cols=45 Identities=22% Similarity=0.355 Sum_probs=36.9
Q ss_pred HHHHHHHhH----hcCCeeEEE-eecccCCC--CCcceEEEEEecCHHHHhcC
Q psy3346 34 QSEVEELFK----AFGELKFVR-LPKKMVGS--GLHRGFGFVEFITKNEAKRV 79 (79)
Q Consensus 34 ~~~l~~~f~----~~g~i~~~~-~~~~~~~~--~~~~g~~fv~f~~~~~a~~a 79 (79)
+++|+++|+ +||.|.++. ++.++ .+ +.++||+||+|.+.++|.+|
T Consensus 2 ~~~l~~~~~~~~~~fG~v~~v~~v~~~~-~~~~~~~rG~~fV~f~~~~dA~~A 53 (70)
T smart00361 2 DEDFEREFSEEEEYFGEVGKINKIYIDN-VGYENHKRGNVYITFERSEDAARA 53 (70)
T ss_pred chhHHHHHHHHHHhcCCeeEEEEEEeCC-CCCCCCCcEEEEEEECCHHHHHHH
Confidence 467888888 999999985 55554 44 88999999999999999865
No 63
>KOG0116|consensus
Probab=99.07 E-value=3.5e-10 Score=72.15 Aligned_cols=61 Identities=34% Similarity=0.562 Sum_probs=49.7
Q ss_pred CCCCeEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHHHhcC
Q psy3346 18 QTGSKILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKRV 79 (79)
Q Consensus 18 ~~~~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~a 79 (79)
....+|||.|||.+++..+|+++|.+||.|+...+.... ..++..+||||+|.+.++++.|
T Consensus 286 ~~~~~i~V~nlP~da~~~~l~~~Fk~FG~Ik~~~I~vr~-~~~~~~~fgFV~f~~~~~~~~~ 346 (419)
T KOG0116|consen 286 ADGLGIFVKNLPPDATPAELEEVFKQFGPIKEGGIQVRS-PGGKNPCFGFVEFENAAAVQNA 346 (419)
T ss_pred ecccceEeecCCCCCCHHHHHHHHhhcccccccceEEec-cCCCcCceEEEEEeecchhhhh
Confidence 335669999999999999999999999999998875542 2345459999999998887653
No 64
>KOG4205|consensus
Probab=99.02 E-value=7.6e-10 Score=68.38 Aligned_cols=58 Identities=26% Similarity=0.521 Sum_probs=53.8
Q ss_pred CCeEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHHHhc
Q psy3346 20 GSKILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKR 78 (79)
Q Consensus 20 ~~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~ 78 (79)
..++||++||..+++.+++++|.+||.|..+.++.|. .+.+++||+||.|.+.+++++
T Consensus 97 tkkiFvGG~~~~~~e~~~r~yfe~~g~v~~~~~~~d~-~~~~~rgFgfv~~~~e~sVdk 154 (311)
T KOG4205|consen 97 TKKIFVGGLPPDTTEEDFKDYFEQFGKVADVVIMYDK-TTSRPRGFGFVTFDSEDSVDK 154 (311)
T ss_pred eeEEEecCcCCCCchHHHhhhhhccceeEeeEEeecc-cccccccceeeEeccccccce
Confidence 5589999999999999999999999999999999887 889999999999999988765
No 65
>KOG0153|consensus
Probab=99.00 E-value=8e-10 Score=68.47 Aligned_cols=59 Identities=17% Similarity=0.416 Sum_probs=51.3
Q ss_pred cccCCCCCeEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHHHhcC
Q psy3346 14 NVAKQTGSKILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKRV 79 (79)
Q Consensus 14 ~~~~~~~~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~a 79 (79)
++.+..-.+|||++|-..+++.+|+..|.+||+|.++.+...+ ++|||+|.+.++|+.|
T Consensus 222 pPeD~~I~tLyIg~l~d~v~e~dIrdhFyqyGeirsi~~~~~~-------~CAFv~ftTR~aAE~A 280 (377)
T KOG0153|consen 222 PPEDTSIKTLYIGGLNDEVLEQDIRDHFYQYGEIRSIRILPRK-------GCAFVTFTTREAAEKA 280 (377)
T ss_pred CCcccceeEEEecccccchhHHHHHHHHhhcCCeeeEEeeccc-------ccceeeehhhHHHHHH
Confidence 3445556899999999999999999999999999999887653 7999999999998865
No 66
>PF04059 RRM_2: RNA recognition motif 2; InterPro: IPR007201 This RNA recognition motif 2 is found in Meiosis protein mei2. It is found C-terminal to the RNA-binding region RNP-1 (IPR000504 from INTERPRO).
Probab=98.97 E-value=5.6e-09 Score=54.78 Aligned_cols=56 Identities=25% Similarity=0.519 Sum_probs=48.3
Q ss_pred CeEEEeCCCCCCCHHHHHHHhHhc--CCeeEEEeecccCCCCCcceEEEEEecCHHHHh
Q psy3346 21 SKILVRNIPFQAKQSEVEELFKAF--GELKFVRLPKKMVGSGLHRGFGFVEFITKNEAK 77 (79)
Q Consensus 21 ~~l~v~nl~~~~~~~~l~~~f~~~--g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~ 77 (79)
+||.|+|||...+.++|.+++... |....+.|+.|. .++.+.|||||.|.+++.+.
T Consensus 2 TTvMirNIPn~~t~~~L~~~l~~~~~g~yDF~YLPiDf-~~~~N~GYAFVNf~~~~~~~ 59 (97)
T PF04059_consen 2 TTVMIRNIPNKYTQEMLIQILDEHFKGKYDFFYLPIDF-KNKCNLGYAFVNFTSPQAAI 59 (97)
T ss_pred eeEEEecCCCCCCHHHHHHHHHHhccCcceEEEeeeec-cCCCceEEEEEEcCCHHHHH
Confidence 689999999999999999998543 566777888886 78889999999999998875
No 67
>KOG0123|consensus
Probab=98.95 E-value=1.6e-09 Score=68.42 Aligned_cols=61 Identities=30% Similarity=0.613 Sum_probs=56.6
Q ss_pred CCCCCeEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHHHhcC
Q psy3346 17 KQTGSKILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKRV 79 (79)
Q Consensus 17 ~~~~~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~a 79 (79)
......|||.||+..++.+.|+.+|+.+|.|.++++..+ ..+.++||+||.|.+.++|.+|
T Consensus 267 ~~~~~nl~vknld~~~~~e~L~~~f~~~GeI~s~kv~~~--~~g~skG~gfV~fs~~eeA~~A 327 (369)
T KOG0123|consen 267 SLQGANLYVKNLDETLSDEKLRKIFSSFGEITSAKVMVD--ENGKSKGFGFVEFSSPEEAKKA 327 (369)
T ss_pred cccccccccccCccccchhHHHHHHhcccceeeEEEEec--cCCCccceEEEEcCCHHHHHHH
Confidence 556789999999999999999999999999999999998 7899999999999999998764
No 68
>KOG0132|consensus
Probab=98.93 E-value=1.7e-09 Score=72.54 Aligned_cols=53 Identities=25% Similarity=0.584 Sum_probs=48.8
Q ss_pred CCeEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHHHhcC
Q psy3346 20 GSKILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKRV 79 (79)
Q Consensus 20 ~~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~a 79 (79)
.+||||+.|+..+++.+|..+|+.||.|.+|.++.. +|||||......+|.+|
T Consensus 421 SrTLwvG~i~k~v~e~dL~~~feefGeiqSi~li~~-------R~cAfI~M~~RqdA~ka 473 (894)
T KOG0132|consen 421 SRTLWVGGIPKNVTEQDLANLFEEFGEIQSIILIPP-------RGCAFIKMVRRQDAEKA 473 (894)
T ss_pred eeeeeeccccchhhHHHHHHHHHhcccceeEeeccC-------CceeEEEEeehhHHHHH
Confidence 489999999999999999999999999999988665 59999999999999875
No 69
>KOG0109|consensus
Probab=98.93 E-value=1.1e-09 Score=66.76 Aligned_cols=48 Identities=29% Similarity=0.596 Sum_probs=41.6
Q ss_pred CeEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHHHh
Q psy3346 21 SKILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAK 77 (79)
Q Consensus 21 ~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~ 77 (79)
.+|||+|||..+++.+|+.+|++||+|.+|.++.+ ||||..++...++
T Consensus 3 ~KLFIGNLp~~~~~~elr~lFe~ygkVlECDIvKN---------YgFVHiEdktaae 50 (346)
T KOG0109|consen 3 VKLFIGNLPREATEQELRSLFEQYGKVLECDIVKN---------YGFVHIEDKTAAE 50 (346)
T ss_pred cchhccCCCcccchHHHHHHHHhhCceEeeeeecc---------cceEEeecccccH
Confidence 57999999999999999999999999999998775 6777777665554
No 70
>KOG4211|consensus
Probab=98.91 E-value=5e-09 Score=67.34 Aligned_cols=59 Identities=22% Similarity=0.562 Sum_probs=51.1
Q ss_pred CCCCCeEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHHHhcC
Q psy3346 17 KQTGSKILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKRV 79 (79)
Q Consensus 17 ~~~~~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~a 79 (79)
......|-+.+|||++|+++|..+|+.++ |..+.+++ .+|+..|-|||+|.+.+++++|
T Consensus 7 ~~~~~~vr~rGLPwsat~~ei~~Ff~~~~-I~~~~~~r---~~Gr~sGeA~Ve~~seedv~~A 65 (510)
T KOG4211|consen 7 GSTAFEVRLRGLPWSATEKEILDFFSNCG-IENLEIPR---RNGRPSGEAYVEFTSEEDVEKA 65 (510)
T ss_pred CCcceEEEecCCCccccHHHHHHHHhcCc-eeEEEEec---cCCCcCcceEEEeechHHHHHH
Confidence 45567888999999999999999999996 88866665 4799999999999999998765
No 71
>KOG4206|consensus
Probab=98.88 E-value=8.5e-09 Score=60.68 Aligned_cols=55 Identities=25% Similarity=0.405 Sum_probs=46.5
Q ss_pred CCeEEEeCCCCCCCHHHHHH----HhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHHHhc
Q psy3346 20 GSKILVRNIPFQAKQSEVEE----LFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKR 78 (79)
Q Consensus 20 ~~~l~v~nl~~~~~~~~l~~----~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~ 78 (79)
..||||.||+..+..++|+. +|++||.|..|.... +.+.+|-|||.|.+.+.|-.
T Consensus 9 n~TlYInnLnekI~~~elkrsL~~LFsqfG~ildI~a~k----t~KmRGQA~VvFk~~~~As~ 67 (221)
T KOG4206|consen 9 NGTLYINNLNEKIKKDELKRSLYLLFSQFGKILDISAFK----TPKMRGQAFVVFKETEAASA 67 (221)
T ss_pred CceEeehhccccccHHHHHHHHHHHHHhhCCeEEEEecC----CCCccCceEEEecChhHHHH
Confidence 33999999999999998877 999999998886644 46788999999999887654
No 72
>KOG0533|consensus
Probab=98.85 E-value=9.2e-09 Score=61.70 Aligned_cols=59 Identities=32% Similarity=0.541 Sum_probs=53.7
Q ss_pred CCCeEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHHHhcC
Q psy3346 19 TGSKILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKRV 79 (79)
Q Consensus 19 ~~~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~a 79 (79)
...+|+|.|||+.+.+++|+++|..||.+..+.+..+ ..|.+.|.|-|.|...++|++|
T Consensus 82 ~~~~v~v~NL~~~V~~~Dl~eLF~~~~~~~r~~vhy~--~~G~s~Gta~v~~~r~~DA~~a 140 (243)
T KOG0533|consen 82 RSTKVNVSNLPYGVIDADLKELFAEFGELKRVAVHYD--RAGRSLGTADVSFNRRDDAERA 140 (243)
T ss_pred CcceeeeecCCcCcchHHHHHHHHHhccceEEeeccC--CCCCCCccceeeecchHhHHHH
Confidence 3478999999999999999999999998899989898 6999999999999999888753
No 73
>KOG0131|consensus
Probab=98.82 E-value=5.3e-09 Score=60.07 Aligned_cols=62 Identities=21% Similarity=0.449 Sum_probs=51.7
Q ss_pred CCCCCeEEEeCCCCCCCHHHHHHHhHhcCCeeE-EEeecccCCCCCcceEEEEEecCHHHHhcC
Q psy3346 17 KQTGSKILVRNIPFQAKQSEVEELFKAFGELKF-VRLPKKMVGSGLHRGFGFVEFITKNEAKRV 79 (79)
Q Consensus 17 ~~~~~~l~v~nl~~~~~~~~l~~~f~~~g~i~~-~~~~~~~~~~~~~~g~~fv~f~~~~~a~~a 79 (79)
..-+..+||+||.+.+++..|...|+.||.+.. -.+.++. .+|.++||+|+.|.+.+.+.+|
T Consensus 93 l~vganlfvgNLd~~vDe~~L~dtFsafG~l~~~P~i~rd~-~tg~~~~~g~i~~~sfeasd~a 155 (203)
T KOG0131|consen 93 LDVGANLFVGNLDPEVDEKLLYDTFSAFGVLISPPKIMRDP-DTGNPKGFGFINYASFEASDAA 155 (203)
T ss_pred ccccccccccccCcchhHHHHHHHHHhccccccCCcccccc-cCCCCCCCeEEechhHHHHHHH
Confidence 344578999999999999999999999998654 3566665 7789999999999999887653
No 74
>KOG4661|consensus
Probab=98.80 E-value=1.1e-08 Score=67.42 Aligned_cols=60 Identities=22% Similarity=0.416 Sum_probs=53.1
Q ss_pred CCCeEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHHHhcC
Q psy3346 19 TGSKILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKRV 79 (79)
Q Consensus 19 ~~~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~a 79 (79)
.++.|||.+|+..+...+|+.+|++||.|...+++.+. .+...++|+||++.+.++|.+|
T Consensus 404 ~gRNlWVSGLSstTRAtDLKnlFSKyGKVvGAKVVTNa-RsPGaRCYGfVTMSts~eAtkC 463 (940)
T KOG4661|consen 404 LGRNLWVSGLSSTTRATDLKNLFSKYGKVVGAKVVTNA-RSPGARCYGFVTMSTSAEATKC 463 (940)
T ss_pred cccceeeeccccchhhhHHHHHHHHhcceeceeeeecC-CCCCcceeEEEEecchHHHHHH
Confidence 46899999999999999999999999999988888775 5666789999999999988765
No 75
>PF13893 RRM_5: RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain); PDB: 2PE8_A 2PEH_A 2D9O_A 1A9N_D 2DIT_A 3S6E_A 2LQ5_A 1SJQ_A 2AD9_A 1X4D_A ....
Probab=98.78 E-value=1.4e-08 Score=48.07 Aligned_cols=37 Identities=43% Similarity=0.802 Sum_probs=30.9
Q ss_pred HHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHHHhcC
Q psy3346 37 VEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKRV 79 (79)
Q Consensus 37 l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~a 79 (79)
|.++|++||.|..+.+..+ . +++|||+|.+.++|+.|
T Consensus 1 L~~~f~~fG~V~~i~~~~~--~----~~~a~V~f~~~~~A~~a 37 (56)
T PF13893_consen 1 LYKLFSKFGEVKKIKIFKK--K----RGFAFVEFASVEDAQKA 37 (56)
T ss_dssp HHHHHTTTS-EEEEEEETT--S----TTEEEEEESSHHHHHHH
T ss_pred ChHHhCCcccEEEEEEEeC--C----CCEEEEEECCHHHHHHH
Confidence 6789999999999988665 2 58999999999999864
No 76
>KOG0110|consensus
Probab=98.76 E-value=1.5e-08 Score=67.49 Aligned_cols=60 Identities=32% Similarity=0.561 Sum_probs=49.3
Q ss_pred CCeEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecccCCC--CCcceEEEEEecCHHHHhcC
Q psy3346 20 GSKILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKMVGS--GLHRGFGFVEFITKNEAKRV 79 (79)
Q Consensus 20 ~~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~--~~~~g~~fv~f~~~~~a~~a 79 (79)
.++|||.||+++++.+++...|...|.|.++.|...+... --+.|||||+|.++++|++|
T Consensus 515 ~t~lfvkNlnf~Tt~e~l~~~F~k~G~VlS~~I~kkkd~~~k~lSmGfgFVEF~~~e~A~~a 576 (725)
T KOG0110|consen 515 ETKLFVKNLNFDTTLEDLEDLFSKQGTVLSIEISKKKDPANKYLSMGFGFVEFAKPESAQAA 576 (725)
T ss_pred chhhhhhcCCcccchhHHHHHHHhcCeEEEEEEeccccccccccccceeEEEecCHHHHHHH
Confidence 4559999999999999999999999999999886653111 12459999999999998865
No 77
>KOG4660|consensus
Probab=98.75 E-value=1.7e-08 Score=65.61 Aligned_cols=59 Identities=27% Similarity=0.472 Sum_probs=50.4
Q ss_pred ccCCCCCeEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHHHhcC
Q psy3346 15 VAKQTGSKILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKRV 79 (79)
Q Consensus 15 ~~~~~~~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~a 79 (79)
....+..+|+|-|||..+++++|+.+|+.||+|..++. +-..+|.+||+|.+..+|++|
T Consensus 70 ~~~~~~~~L~v~nl~~~Vsn~~L~~~f~~yGeir~ir~------t~~~~~~~~v~FyDvR~A~~A 128 (549)
T KOG4660|consen 70 EKDMNQGTLVVFNLPRSVSNDTLLRIFGAYGEIREIRE------TPNKRGIVFVEFYDVRDAERA 128 (549)
T ss_pred cccCccceEEEEecCCcCCHHHHHHHHHhhcchhhhhc------ccccCceEEEEEeehHhHHHH
Confidence 34556789999999999999999999999999999654 334579999999999998875
No 78
>KOG4211|consensus
Probab=98.72 E-value=6.1e-08 Score=62.50 Aligned_cols=60 Identities=22% Similarity=0.430 Sum_probs=50.4
Q ss_pred CCCCeEEEeCCCCCCCHHHHHHHhHhcCCeeE-EEeecccCCCCCcceEEEEEecCHHHHhcC
Q psy3346 18 QTGSKILVRNIPFQAKQSEVEELFKAFGELKF-VRLPKKMVGSGLHRGFGFVEFITKNEAKRV 79 (79)
Q Consensus 18 ~~~~~l~v~nl~~~~~~~~l~~~f~~~g~i~~-~~~~~~~~~~~~~~g~~fv~f~~~~~a~~a 79 (79)
.....|.+++||+.|++++|.++|+..--+.. +.++.+ ..+++.|-|||.|++.+.|++|
T Consensus 101 ~~d~vVRLRGLPfscte~dI~~FFaGL~Iv~~gi~l~~d--~rgR~tGEAfVqF~sqe~ae~A 161 (510)
T KOG4211|consen 101 ANDGVVRLRGLPFSCTEEDIVEFFAGLEIVPDGILLPMD--QRGRPTGEAFVQFESQESAEIA 161 (510)
T ss_pred CCCceEEecCCCccCcHHHHHHHhcCCcccccceeeecc--CCCCcccceEEEecCHHHHHHH
Confidence 45678999999999999999999988754444 556777 5788889999999999999875
No 79
>KOG4209|consensus
Probab=98.71 E-value=1.8e-08 Score=60.22 Aligned_cols=64 Identities=23% Similarity=0.416 Sum_probs=56.8
Q ss_pred ccCCCCCeEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHHHhcC
Q psy3346 15 VAKQTGSKILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKRV 79 (79)
Q Consensus 15 ~~~~~~~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~a 79 (79)
....+...+||+|+.+.++.+++...|+.||.+..+.++.++ ..+.++||+||+|.+.+.++.+
T Consensus 96 ~~~~d~~sv~v~nvd~~~t~~~~e~hf~~Cg~i~~~ti~~d~-~~~~~k~~~yvef~~~~~~~~a 159 (231)
T KOG4209|consen 96 QKEVDAPSVWVGNVDFLVTLTKIELHFESCGGINRVTVPKDK-FRGHPKGFAYVEFSSYELVEEA 159 (231)
T ss_pred hhccCCceEEEeccccccccchhhheeeccCCccceeeeccc-cCCCcceeEEEecccHhhhHHH
Confidence 445678899999999999999999999999999999999887 7778999999999999887653
No 80
>KOG4212|consensus
Probab=98.70 E-value=2.3e-08 Score=64.07 Aligned_cols=58 Identities=24% Similarity=0.585 Sum_probs=49.8
Q ss_pred cCCCCCeEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHHHhcC
Q psy3346 16 AKQTGSKILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKRV 79 (79)
Q Consensus 16 ~~~~~~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~a 79 (79)
..+..|+|+|+|||+++|+..|+.-|..||.+.+..++. .|+++| .|.|.++++|++|
T Consensus 532 aarKa~qIiirNlP~dfTWqmlrDKfre~G~v~yadime----~GkskG--VVrF~s~edAEra 589 (608)
T KOG4212|consen 532 AARKACQIIIRNLPFDFTWQMLRDKFREIGHVLYADIME----NGKSKG--VVRFFSPEDAERA 589 (608)
T ss_pred ccccccEEEEecCCccccHHHHHHHHHhccceehhhhhc----cCCccc--eEEecCHHHHHHH
Confidence 345578899999999999999999999999999888743 477776 8999999999875
No 81
>KOG1548|consensus
Probab=98.58 E-value=1.8e-07 Score=58.27 Aligned_cols=58 Identities=17% Similarity=0.356 Sum_probs=49.9
Q ss_pred CCCeEEEeCCCCCCCHHHHHHHhHhcCCee--------EEEeecccCCCCCcceEEEEEecCHHHHhc
Q psy3346 19 TGSKILVRNIPFQAKQSEVEELFKAFGELK--------FVRLPKKMVGSGLHRGFGFVEFITKNEAKR 78 (79)
Q Consensus 19 ~~~~l~v~nl~~~~~~~~l~~~f~~~g~i~--------~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~ 78 (79)
.++.|||.|||.++|.+++.++|++||-|. .|+|.++ ..|..+|-|.+.|...++++-
T Consensus 133 ~Nt~VYVsgLP~DiT~dE~~~~~sKcGiI~~d~~t~epk~KlYrd--~~G~lKGDaLc~y~K~ESVeL 198 (382)
T KOG1548|consen 133 VNTSVYVSGLPLDITVDEFAEVMSKCGIIMRDPQTGEPKVKLYRD--NQGKLKGDALCCYIKRESVEL 198 (382)
T ss_pred cCceEEecCCCCcccHHHHHHHHHhcceEeccCCCCCeeEEEEec--CCCCccCceEEEeecccHHHH
Confidence 357799999999999999999999999774 3777777 679999999999988887653
No 82
>KOG0106|consensus
Probab=98.58 E-value=7e-08 Score=57.00 Aligned_cols=50 Identities=30% Similarity=0.747 Sum_probs=43.7
Q ss_pred CeEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHHHhcC
Q psy3346 21 SKILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKRV 79 (79)
Q Consensus 21 ~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~a 79 (79)
..+||++||+.+.+.+|..+|..||.+..+.+.. ||+||+|.+..+|+.|
T Consensus 2 ~rv~vg~~~~~~~~~d~E~~f~~yg~~~d~~mk~---------gf~fv~fed~rda~Da 51 (216)
T KOG0106|consen 2 PRVYIGRLPYRARERDVERFFKGYGKIPDADMKN---------GFGFVEFEDPRDADDA 51 (216)
T ss_pred CceeecccCCccchhHHHHHHhhccccccceeec---------ccceeccCchhhhhcc
Confidence 4689999999999999999999999988876533 7889999999988764
No 83
>KOG0226|consensus
Probab=98.55 E-value=1e-07 Score=57.30 Aligned_cols=62 Identities=18% Similarity=0.397 Sum_probs=55.2
Q ss_pred CCCCCeEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHHHhcC
Q psy3346 17 KQTGSKILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKRV 79 (79)
Q Consensus 17 ~~~~~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~a 79 (79)
+.++.+||.+-|-.+++++.|-..|.+|-.....++++++ .+|+++||+||.|.++.++.+|
T Consensus 187 ~~~DfRIfcgdlgNevnd~vl~raf~Kfpsf~~akviRdk-RTgKSkgygfVSf~~pad~~rA 248 (290)
T KOG0226|consen 187 DEDDFRIFCGDLGNEVNDDVLARAFKKFPSFQKAKVIRDK-RTGKSKGYGFVSFRDPADYVRA 248 (290)
T ss_pred ccccceeecccccccccHHHHHHHHHhccchhhccccccc-cccccccceeeeecCHHHHHHH
Confidence 4557899999999999999999999999887788888888 8999999999999999987543
No 84
>KOG4454|consensus
Probab=98.54 E-value=3.8e-08 Score=58.14 Aligned_cols=58 Identities=22% Similarity=0.351 Sum_probs=50.3
Q ss_pred CCCCCeEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHHHh
Q psy3346 17 KQTGSKILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAK 77 (79)
Q Consensus 17 ~~~~~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~ 77 (79)
....++|||.|+...++++.|.++|-+-|+|.++.|+.+ ..+..+ ||||.|+++.++.
T Consensus 6 ae~drtl~v~n~~~~v~eelL~ElfiqaGPV~kv~ip~~--~d~~~k-Fa~v~f~~E~sv~ 63 (267)
T KOG4454|consen 6 AEMDRTLLVQNMYSGVSEELLSELFIQAGPVYKVGIPSG--QDQEQK-FAYVFFPNENSVQ 63 (267)
T ss_pred cchhhHHHHHhhhhhhhHHHHHHHhhccCceEEEeCCCC--ccCCCc-eeeeecccccchh
Confidence 456789999999999999999999999999999999888 455555 9999999877654
No 85
>KOG1457|consensus
Probab=98.53 E-value=3.5e-07 Score=54.39 Aligned_cols=63 Identities=19% Similarity=0.263 Sum_probs=49.0
Q ss_pred CCCCCeEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHHHhcC
Q psy3346 17 KQTGSKILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKRV 79 (79)
Q Consensus 17 ~~~~~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~a 79 (79)
...-+||||.+||.++...+|..+|..|-..+...+..........+.+||++|.+..+|.+|
T Consensus 31 ~~~VRTLFVSGLP~DvKpREiynLFR~f~GYEgslLK~Tsk~~~~~~pvaFatF~s~q~A~aa 93 (284)
T KOG1457|consen 31 PGAVRTLFVSGLPNDVKPREIYNLFRRFHGYEGSLLKYTSKGDQVCKPVAFATFTSHQFALAA 93 (284)
T ss_pred ccccceeeeccCCcccCHHHHHHHhccCCCccceeeeeccCCCccccceEEEEecchHHHHHH
Confidence 344799999999999999999999999976666666443202233568999999999998754
No 86
>KOG0109|consensus
Probab=98.47 E-value=1.5e-07 Score=57.66 Aligned_cols=52 Identities=23% Similarity=0.402 Sum_probs=46.0
Q ss_pred CCCCeEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHHHhc
Q psy3346 18 QTGSKILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKR 78 (79)
Q Consensus 18 ~~~~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~ 78 (79)
...++++|+||.+.++..+|+..|.+||.+.+|.++. +|+||.|.-.++|..
T Consensus 76 k~stkl~vgNis~tctn~ElRa~fe~ygpviecdivk---------dy~fvh~d~~eda~~ 127 (346)
T KOG0109|consen 76 KASTKLHVGNISPTCTNQELRAKFEKYGPVIECDIVK---------DYAFVHFDRAEDAVE 127 (346)
T ss_pred CCccccccCCCCccccCHHHhhhhcccCCceeeeeec---------ceeEEEEeeccchHH
Confidence 3467899999999999999999999999999998866 589999998887754
No 87
>KOG0151|consensus
Probab=98.47 E-value=2e-07 Score=62.61 Aligned_cols=63 Identities=22% Similarity=0.358 Sum_probs=51.1
Q ss_pred CCCCCeEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecccC--CCCCcceEEEEEecCHHHHhcC
Q psy3346 17 KQTGSKILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKMV--GSGLHRGFGFVEFITKNEAKRV 79 (79)
Q Consensus 17 ~~~~~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~~--~~~~~~g~~fv~f~~~~~a~~a 79 (79)
++..+.+|++||++.+++..|...|..||++..++++..+. .....+-|+||.|-+..+|++|
T Consensus 171 DP~TTNlyv~Nlnpsv~E~~ll~tfGrfgPlasvKimwpRtEeEk~r~r~cgfvafmnR~D~era 235 (877)
T KOG0151|consen 171 DPQTTNLYVGNLNPSVDENFLLRTFGRFGPLASVKIMWPRTEEEKRRERNCGFVAFMNRADAERA 235 (877)
T ss_pred CCcccceeeecCCccccHHHHHHHhcccCcccceeeecccchhhhccccccceeeehhhhhHHHH
Confidence 34567899999999999999999999999999988776531 1233456899999999988764
No 88
>KOG1457|consensus
Probab=98.45 E-value=1.4e-07 Score=55.97 Aligned_cols=59 Identities=14% Similarity=0.359 Sum_probs=46.4
Q ss_pred ccCCCCCeEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHHHhc
Q psy3346 15 VAKQTGSKILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKR 78 (79)
Q Consensus 15 ~~~~~~~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~ 78 (79)
.....+.+|||.||..++++++|+.+|+.|.....+++... .| ...||++|+..+.|..
T Consensus 205 ~~~~acstlfianl~~~~~ed~l~~~~~~~~gf~~l~~~~~---~g--~~vaf~~~~~~~~at~ 263 (284)
T KOG1457|consen 205 SGARACSTLFIANLGPNCTEDELKQLLSRYPGFHILKIRAR---GG--MPVAFADFEEIEQATD 263 (284)
T ss_pred ccchhhhhHhhhccCCCCCHHHHHHHHHhCCCceEEEEecC---CC--cceEeecHHHHHHHHH
Confidence 34456789999999999999999999999987666665432 22 3679999998888764
No 89
>PF08777 RRM_3: RNA binding motif; InterPro: IPR014886 This domain is found in protein La which functions as an RNA chaperone during RNA polymerase III transcription, and can also stimulate translation initiation. It contains a five stranded beta sheet which forms an atypical RNA recognition motif []. ; PDB: 1OWX_A.
Probab=98.42 E-value=6.6e-07 Score=47.67 Aligned_cols=53 Identities=23% Similarity=0.467 Sum_probs=33.1
Q ss_pred CCeEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHHHhcC
Q psy3346 20 GSKILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKRV 79 (79)
Q Consensus 20 ~~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~a 79 (79)
++.|+|.+++..++.++|+..|++||.|.+|.+.... .-|||.|.+++.|+.|
T Consensus 1 G~il~~~g~~~~~~re~iK~~f~~~g~V~yVD~~~G~-------~~g~VRf~~~~~A~~a 53 (105)
T PF08777_consen 1 GCILKFSGLGEPTSREDIKEAFSQFGEVAYVDFSRGD-------TEGYVRFKTPEAAQKA 53 (105)
T ss_dssp --EEEEEE--SS--HHHHHHHT-SS--EEEEE--TT--------SEEEEEESS---HHHH
T ss_pred CeEEEEecCCCCcCHHHHHHHHHhcCCcceEEecCCC-------CEEEEEECCcchHHHH
Confidence 3578899999999999999999999999999886652 4689999999888653
No 90
>KOG0147|consensus
Probab=98.30 E-value=1.6e-07 Score=61.17 Aligned_cols=63 Identities=17% Similarity=0.363 Sum_probs=55.3
Q ss_pred cccCCCCCeEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHHHh
Q psy3346 14 NVAKQTGSKILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAK 77 (79)
Q Consensus 14 ~~~~~~~~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~ 77 (79)
...+.+.+++|+--|+..++..+|.++|+.+|.|..++++.++ .++.++|.+||+|.+.+++.
T Consensus 173 ~~eERd~Rtvf~~qla~r~~pRdL~efFs~~gkVrdVriI~Dr-~s~rskgi~Yvef~D~~sVp 235 (549)
T KOG0147|consen 173 SPEERDQRTVFCMQLARRNPPRDLEEFFSIVGKVRDVRIIGDR-NSRRSKGIAYVEFCDEQSVP 235 (549)
T ss_pred CchHHhHHHHHHHHHhhcCCchhHHHHHHhhcCcceeEeeccc-cchhhcceeEEEEecccchh
Confidence 3445567889999999999999999999999999999999998 88999999999999877653
No 91
>KOG4849|consensus
Probab=98.20 E-value=1.3e-06 Score=54.92 Aligned_cols=59 Identities=10% Similarity=0.234 Sum_probs=48.4
Q ss_pred CCCCeEEEeCCCCCCCHHHHHHHhHhcC--CeeEEEeecccCCCCCcceEEEEEecCHHHHh
Q psy3346 18 QTGSKILVRNIPFQAKQSEVEELFKAFG--ELKFVRLPKKMVGSGLHRGFGFVEFITKNEAK 77 (79)
Q Consensus 18 ~~~~~l~v~nl~~~~~~~~l~~~f~~~g--~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~ 77 (79)
.....+||+||-|.+|+++|.+.+...| .+..+++..++ .+|+++|||.+...+..+++
T Consensus 78 Grk~~~YvGNL~W~TTD~DL~~A~~S~G~~~~~dmKFFENR-~NGQSKG~AL~~~~SdAa~K 138 (498)
T KOG4849|consen 78 GRKYCCYVGNLLWYTTDADLLKALQSTGLAQFADMKFFENR-TNGQSKGYALLVLNSDAAVK 138 (498)
T ss_pred CceEEEEecceeEEeccHHHHHHHHhhhHHHHhhhhhhhcc-cCCcccceEEEEecchHHHH
Confidence 3456799999999999999999988776 46677777777 89999999999987765543
No 92
>PF14605 Nup35_RRM_2: Nup53/35/40-type RNA recognition motif
Probab=98.20 E-value=1.2e-05 Score=37.75 Aligned_cols=51 Identities=22% Similarity=0.464 Sum_probs=40.0
Q ss_pred CeEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHHHhcC
Q psy3346 21 SKILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKRV 79 (79)
Q Consensus 21 ~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~a 79 (79)
+.|-|.+.+++..+. +...|..||+|..+.+... .-+.|+.|.+..+|++|
T Consensus 2 ~wI~V~Gf~~~~~~~-vl~~F~~fGeI~~~~~~~~-------~~~~~l~y~~~~~ae~A 52 (53)
T PF14605_consen 2 TWISVSGFPPDLAEE-VLEHFASFGEIVDIYVPES-------TNWMYLKYKSRKDAEKA 52 (53)
T ss_pred cEEEEEeECchHHHH-HHHHHHhcCCEEEEEcCCC-------CcEEEEEECCHHHHHhh
Confidence 457788888776644 5558889999999877532 36899999999999886
No 93
>KOG0120|consensus
Probab=98.08 E-value=5.7e-06 Score=54.15 Aligned_cols=59 Identities=32% Similarity=0.597 Sum_probs=52.3
Q ss_pred CCCCCeEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHHH
Q psy3346 17 KQTGSKILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEA 76 (79)
Q Consensus 17 ~~~~~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a 76 (79)
......+||++||...++..+++++..||.+....++.+. .+|-++||||.+|-++...
T Consensus 286 ~~~~~ki~v~~lp~~l~~~q~~Ell~~fg~lk~f~lv~d~-~~g~skg~af~ey~dpsvt 344 (500)
T KOG0120|consen 286 PDSPNKIFVGGLPLYLTEDQVKELLDSFGPLKAFRLVKDS-ATGNSKGFAFCEYCDPSVT 344 (500)
T ss_pred ccccchhhhccCcCccCHHHHHHHHHhcccchhheeeccc-ccccccceeeeeeeCCcch
Confidence 3456789999999999999999999999999999998887 7789999999999876554
No 94
>KOG4210|consensus
Probab=98.01 E-value=2.6e-06 Score=52.50 Aligned_cols=57 Identities=33% Similarity=0.605 Sum_probs=50.5
Q ss_pred CCeEE-EeCCCCCCCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHHHh
Q psy3346 20 GSKIL-VRNIPFQAKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAK 77 (79)
Q Consensus 20 ~~~l~-v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~ 77 (79)
..++| +++|+..++.++|+..|..+|.|..++++.++ .++..+|++|+.|.......
T Consensus 184 s~~~~~~~~~~f~~~~d~~~~~~~~~~~i~~~r~~~~~-~s~~~kg~a~~~~~~~~~~~ 241 (285)
T KOG4210|consen 184 SDTIFFVGELDFSLTRDDLKEHFVSSGEITSVRLPTDE-ESGDSKGFAYVDFSAGNSKK 241 (285)
T ss_pred cccceeecccccccchHHHhhhccCcCcceeeccCCCC-CccchhhhhhhhhhhchhHH
Confidence 34555 99999999999999999999999999999887 89999999999998877654
No 95
>KOG1190|consensus
Probab=98.00 E-value=7.6e-06 Score=52.33 Aligned_cols=52 Identities=21% Similarity=0.367 Sum_probs=44.9
Q ss_pred CCCeEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHHHh
Q psy3346 19 TGSKILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAK 77 (79)
Q Consensus 19 ~~~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~ 77 (79)
..+.+.++|||++++|+++..++.+||.|..+.+...+ --||++|.+.++|-
T Consensus 27 pSkV~HlRnlp~e~tE~elI~Lg~pFG~vtn~~~lkGk-------nQAflem~d~~sAv 78 (492)
T KOG1190|consen 27 PSKVVHLRNLPWEVTEEELISLGLPFGKVTNLLMLKGK-------NQAFLEMADEESAV 78 (492)
T ss_pred CcceeEeccCCccccHHHHHHhcccccceeeeeeeccc-------hhhhhhhcchhhhh
Confidence 56789999999999999999999999999998876652 26899999988874
No 96
>KOG4206|consensus
Probab=97.90 E-value=4.6e-05 Score=45.20 Aligned_cols=55 Identities=27% Similarity=0.529 Sum_probs=46.3
Q ss_pred CCCCCeEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHHHh
Q psy3346 17 KQTGSKILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAK 77 (79)
Q Consensus 17 ~~~~~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~ 77 (79)
-.....+|+.|||.+++.+.+..+|.+|.....++++... .+.|||+|.+...+.
T Consensus 143 ~ppn~ilf~~niP~es~~e~l~~lf~qf~g~keir~i~~~------~~iAfve~~~d~~a~ 197 (221)
T KOG4206|consen 143 APPNNILFLTNIPSESESEMLSDLFEQFPGFKEIRLIPPR------SGIAFVEFLSDRQAS 197 (221)
T ss_pred CCCceEEEEecCCcchhHHHHHHHHhhCcccceeEeccCC------CceeEEecchhhhhH
Confidence 3456789999999999999999999999989999887652 488999998876643
No 97
>KOG0129|consensus
Probab=97.83 E-value=8.1e-05 Score=48.74 Aligned_cols=58 Identities=22% Similarity=0.498 Sum_probs=43.6
Q ss_pred CCCeEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeeccc--CCCCCcce---EEEEEecCHHHHh
Q psy3346 19 TGSKILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKM--VGSGLHRG---FGFVEFITKNEAK 77 (79)
Q Consensus 19 ~~~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~--~~~~~~~g---~~fv~f~~~~~a~ 77 (79)
-.++|||++||++++|+.|...|..||.+ .+.++... ...--++| |+|+.|+++.++.
T Consensus 258 ~S~KVFvGGlp~dise~~i~~~F~~FGs~-~VdWP~k~~~~~~~ppkGs~~YvflvFe~E~sV~ 320 (520)
T KOG0129|consen 258 YSRKVFVGGLPWDITEAQINASFGQFGSV-KVDWPGKANSRGRAPPKGSYGYVFLVFEDERSVQ 320 (520)
T ss_pred cccceeecCCCccccHHHHHhhcccccce-EeecCCCccccccCCCCCcccEEEEEecchHHHH
Confidence 35789999999999999999999999964 55666321 01112345 9999999988764
No 98
>KOG1995|consensus
Probab=97.81 E-value=4.3e-05 Score=47.97 Aligned_cols=62 Identities=19% Similarity=0.291 Sum_probs=53.2
Q ss_pred CCCCCeEEEeCCCCCCCHHHHHHHhHhcCCee--------EEEeecccCCCCCcceEEEEEecCHHHHhcC
Q psy3346 17 KQTGSKILVRNIPFQAKQSEVEELFKAFGELK--------FVRLPKKMVGSGLHRGFGFVEFITKNEAKRV 79 (79)
Q Consensus 17 ~~~~~~l~v~nl~~~~~~~~l~~~f~~~g~i~--------~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~a 79 (79)
....-+|||.+||..+++.++..+|.+||.|. .|.+..++ .++..+|-|.|.|.++..|+.|
T Consensus 63 ~s~~~ti~v~g~~d~~~~~~~~~~f~qcg~ikrnK~t~kPki~~y~dk-eT~~~KGeatvS~~D~~~akaa 132 (351)
T KOG1995|consen 63 KSDNETIFVWGCPDSVCENDNADFFLQCGVIKRNKRTGKPKIKIYTDK-ETGAPKGEATVSYEDPPAAKAA 132 (351)
T ss_pred ccccccceeeccCccchHHHHHHHHhhcceeccCCCCCCcchhccccc-cccCcCCceeeeecChhhhhhh
Confidence 45567999999999999999999999999774 35566676 8999999999999999988765
No 99
>KOG0129|consensus
Probab=97.77 E-value=0.0001 Score=48.34 Aligned_cols=61 Identities=16% Similarity=0.298 Sum_probs=51.4
Q ss_pred CCCCCeEEEeCCCCCCCHHHHHHHhH-hcCCeeEEEeecccCCCCCcceEEEEEecCHHHHhc
Q psy3346 17 KQTGSKILVRNIPFQAKQSEVEELFK-AFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKR 78 (79)
Q Consensus 17 ~~~~~~l~v~nl~~~~~~~~l~~~f~-~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~ 78 (79)
-+..+|||||+||.-++..+|-.+|. -||.|..+-|=.|+ .-+-++|-|-|+|.+..+-.+
T Consensus 367 lDprrTVFVGgvprpl~A~eLA~imd~lyGgV~yaGIDtD~-k~KYPkGaGRVtFsnqqsYi~ 428 (520)
T KOG0129|consen 367 IDPRRTVFVGGLPRPLTAEELAMIMEDLFGGVLYVGIDTDP-KLKYPKGAGRVTFSNQQAYIK 428 (520)
T ss_pred cCccceEEecCCCCcchHHHHHHHHHHhcCceEEEEeccCc-ccCCCCCcceeeecccHHHHH
Confidence 34568999999999999999999997 69999999887775 566688999999998776443
No 100
>KOG1365|consensus
Probab=97.76 E-value=0.00011 Score=46.94 Aligned_cols=60 Identities=25% Similarity=0.377 Sum_probs=46.9
Q ss_pred CCCCeEEEeCCCCCCCHHHHHHHhHh-c---CCeeEEEeecccCCCCCcceEEEEEecCHHHHhcC
Q psy3346 18 QTGSKILVRNIPFQAKQSEVEELFKA-F---GELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKRV 79 (79)
Q Consensus 18 ~~~~~l~v~nl~~~~~~~~l~~~f~~-~---g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~a 79 (79)
.+.-.|..++||+++++.++.++|.+ | |....+.++.. ..|+..|-|||.|..+++|+.|
T Consensus 159 ~~qvivRmRGLPfdat~~dVv~FF~~~cpv~~g~egvLFV~r--pdgrpTGdAFvlfa~ee~aq~a 222 (508)
T KOG1365|consen 159 ENQVIVRMRGLPFDATALDVVEFFGPPCPVTGGTEGVLFVTR--PDGRPTGDAFVLFACEEDAQFA 222 (508)
T ss_pred ccceEEEecCCCCCcchHHHHHhcCCCCcccCCccceEEEEC--CCCCcccceEEEecCHHHHHHH
Confidence 33456778899999999999999963 2 23455655555 6889999999999999998764
No 101
>KOG2314|consensus
Probab=97.74 E-value=0.00014 Score=48.42 Aligned_cols=61 Identities=21% Similarity=0.395 Sum_probs=49.6
Q ss_pred CCCCCeEEEeCCCCCCCH------HHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHHHhcC
Q psy3346 17 KQTGSKILVRNIPFQAKQ------SEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKRV 79 (79)
Q Consensus 17 ~~~~~~l~v~nl~~~~~~------~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~a 79 (79)
+.-...|+|.|+|.--.. .-|.++|+++|++....++.+ ..|..+||.|++|.+..+|+.|
T Consensus 55 eg~D~vVvv~g~PvV~~~rl~klk~vl~kvfsk~gk~vn~~~P~~--e~ggtkG~lf~E~~~~~~A~~a 121 (698)
T KOG2314|consen 55 EGFDSVVVVDGAPVVGPARLEKLKKVLTKVFSKAGKIVNMYYPID--EEGGTKGYLFVEYASMRDAKKA 121 (698)
T ss_pred CCcceEEEECCCcccChhHHHHHHHHHHHHHHhhccccceeeccC--ccCCeeeEEEEEecChhhHHHH
Confidence 355678999999875332 346788999999999999988 5677999999999999998764
No 102
>KOG1365|consensus
Probab=97.71 E-value=3.5e-05 Score=49.19 Aligned_cols=58 Identities=19% Similarity=0.488 Sum_probs=48.8
Q ss_pred CCeEEEeCCCCCCCHHHHHHHhHhcCC-eeE--EEeecccCCCCCcceEEEEEecCHHHHhcC
Q psy3346 20 GSKILVRNIPFQAKQSEVEELFKAFGE-LKF--VRLPKKMVGSGLHRGFGFVEFITKNEAKRV 79 (79)
Q Consensus 20 ~~~l~v~nl~~~~~~~~l~~~f~~~g~-i~~--~~~~~~~~~~~~~~g~~fv~f~~~~~a~~a 79 (79)
...|.+++||++.+.++|..+|..|-. |.. ++++.+ ..|+..|-|||.|.+.++|..|
T Consensus 280 kdcvRLRGLPy~AtvEdIL~FlgdFa~~i~f~gVHmv~N--~qGrPSGeAFIqm~nae~a~aa 340 (508)
T KOG1365|consen 280 KDCVRLRGLPYEATVEDILDFLGDFATDIRFQGVHMVLN--GQGRPSGEAFIQMRNAERARAA 340 (508)
T ss_pred CCeeEecCCChhhhHHHHHHHHHHHhhhcccceeEEEEc--CCCCcChhhhhhhhhhHHHHHH
Confidence 567899999999999999999998852 433 778777 7899999999999999987653
No 103
>KOG1855|consensus
Probab=97.50 E-value=0.00011 Score=47.29 Aligned_cols=62 Identities=19% Similarity=0.390 Sum_probs=48.7
Q ss_pred CCCCeEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecc---c-CCCCCc--------ceEEEEEecCHHHHhcC
Q psy3346 18 QTGSKILVRNIPFQAKQSEVEELFKAFGELKFVRLPKK---M-VGSGLH--------RGFGFVEFITKNEAKRV 79 (79)
Q Consensus 18 ~~~~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~---~-~~~~~~--------~g~~fv~f~~~~~a~~a 79 (79)
.+.++|.+.|||.+-.-+-|.++|+.+|.|..|++... + ...+.. +-+|+|+|...+.|.+|
T Consensus 229 l~srtivaenLP~Dh~~enl~kiFg~~G~IksIRIckPgaip~d~r~~~~~~~~~~tk~~AlvEye~~~~A~KA 302 (484)
T KOG1855|consen 229 LPSRTIVAENLPLDHSYENLSKIFGTVGSIKSIRICKPGAIPEDVRGFPKKYFELQTKECALVEYEEVEAARKA 302 (484)
T ss_pred cccceEEEecCCcchHHHHHHHHhhcccceeeeeecCCCCCCcccccCCccchhhhhhhhhhhhhhhhHHHHHH
Confidence 45789999999999988999999999999999987433 1 022222 45789999999888765
No 104
>KOG3152|consensus
Probab=97.48 E-value=4e-05 Score=46.40 Aligned_cols=59 Identities=32% Similarity=0.442 Sum_probs=44.2
Q ss_pred CCCeEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecccCCC--------CCcce----EEEEEecCHHHHhc
Q psy3346 19 TGSKILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKMVGS--------GLHRG----FGFVEFITKNEAKR 78 (79)
Q Consensus 19 ~~~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~--------~~~~g----~~fv~f~~~~~a~~ 78 (79)
..-.||+++||+......|+++|..||.|-.|.|.... .+ |.+.. -|+|+|.+...|.+
T Consensus 73 k~GVvylS~IPp~m~~~rlReil~~yGeVGRvylqpE~-~s~~~~r~~~~~n~~~~y~EGWvEF~~KrvAK~ 143 (278)
T KOG3152|consen 73 KTGVVYLSNIPPYMDPVRLREILSQYGEVGRVYLQPED-DSKRAARKRKGGNYKKLYSEGWVEFISKRVAKR 143 (278)
T ss_pred cceEEEeccCCCccCHHHHHHHHHhccccceEEecchh-hHHHHHHhhcCCCccccchhHHHHHHHHHHHHH
Confidence 34579999999999999999999999999999885432 22 22222 26788887776653
No 105
>KOG0128|consensus
Probab=97.48 E-value=7.3e-05 Score=51.41 Aligned_cols=57 Identities=25% Similarity=0.397 Sum_probs=51.6
Q ss_pred CCeEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHHHhc
Q psy3346 20 GSKILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKR 78 (79)
Q Consensus 20 ~~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~ 78 (79)
...++|.|.|+..|.+.++.++..+|.+.+.+++.. ..|+++|.+||.|.++.++.+
T Consensus 736 K~~v~i~g~pf~gt~e~~k~l~~~~gn~~~~~~vt~--r~gkpkg~a~v~y~~ea~~s~ 792 (881)
T KOG0128|consen 736 KISVAISGPPFQGTKEELKSLASKTGNVTSLRLVTV--RAGKPKGKARVDYNTEADASR 792 (881)
T ss_pred hhhhheeCCCCCCchHHHHhhccccCCccccchhhh--hccccccceeccCCCcchhhh
Confidence 467999999999999999999999999999988887 689999999999999887754
No 106
>KOG4307|consensus
Probab=97.47 E-value=0.00035 Score=47.68 Aligned_cols=57 Identities=23% Similarity=0.476 Sum_probs=48.5
Q ss_pred CeEEEeCCCCCCCHHHHHHHhHhcCCe-eEEEeecccCCCCCcceEEEEEecCHHHHhcC
Q psy3346 21 SKILVRNIPFQAKQSEVEELFKAFGEL-KFVRLPKKMVGSGLHRGFGFVEFITKNEAKRV 79 (79)
Q Consensus 21 ~~l~v~nl~~~~~~~~l~~~f~~~g~i-~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~a 79 (79)
+.|-+.|+|++++-++|.++|..|-.+ .+|.+.+. ..|...|-|-|.|++.++|.+|
T Consensus 868 ~V~~~~n~Pf~v~l~dI~~FF~dY~~~p~sI~~r~n--d~G~pTGe~mvAfes~~eAr~A 925 (944)
T KOG4307|consen 868 RVLSCNNFPFDVTLEDIVEFFNDYEPDPNSIRIRRN--DDGVPTGECMVAFESQEEARRA 925 (944)
T ss_pred eEEEecCCCccccHHHHHHHhcccccCCCceeEeec--CCCCcccceeEeecCHHHHHhh
Confidence 478899999999999999999998644 34566565 7899999999999999999875
No 107
>KOG0105|consensus
Probab=97.44 E-value=0.00095 Score=39.11 Aligned_cols=56 Identities=18% Similarity=0.497 Sum_probs=47.0
Q ss_pred cccCCCCCeEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHHHh
Q psy3346 14 NVAKQTGSKILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAK 77 (79)
Q Consensus 14 ~~~~~~~~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~ 77 (79)
++.......+.|.+||++.++.+|+.....-|.+....+..+ |++.|+|-..++-+
T Consensus 109 ppsrrSe~RVvVsGLp~SgSWQDLKDHmReaGdvCfadv~rD--------g~GvV~~~r~eDMk 164 (241)
T KOG0105|consen 109 PPSRRSEYRVVVSGLPPSGSWQDLKDHMREAGDVCFADVQRD--------GVGVVEYLRKEDMK 164 (241)
T ss_pred CcccccceeEEEecCCCCCchHHHHHHHHhhCCeeeeeeecc--------cceeeeeeehhhHH
Confidence 344566788999999999999999999999999988887777 67889988877643
No 108
>KOG0128|consensus
Probab=97.34 E-value=9.4e-06 Score=55.57 Aligned_cols=59 Identities=24% Similarity=0.372 Sum_probs=49.7
Q ss_pred CCeEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHHHhcC
Q psy3346 20 GSKILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKRV 79 (79)
Q Consensus 20 ~~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~a 79 (79)
-.++|+.||+..+.+.+|...|.++|.+..+++.... ..++.+|+||++|..++++.+|
T Consensus 667 ~~~~fvsnl~~~~~~~dl~~~~~~~~~~e~vqi~~h~-n~~~~rG~~Y~~F~~~~~~~aa 725 (881)
T KOG0128|consen 667 LIKIFVSNLSPKMSEEDLSERFSPSGTIEVVQIVIHK-NEKRFRGKAYVEFLKPEHAGAA 725 (881)
T ss_pred HHHHHHhhcchhhcCchhhhhcCccchhhhHHHHHHh-hccccccceeeEeecCCchhhh
Confidence 4678999999999999999999999988777665332 5788999999999999887654
No 109
>KOG0115|consensus
Probab=97.32 E-value=0.00015 Score=43.94 Aligned_cols=56 Identities=25% Similarity=0.371 Sum_probs=48.2
Q ss_pred CeEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHHHhc
Q psy3346 21 SKILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKR 78 (79)
Q Consensus 21 ~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~ 78 (79)
..|||.||+.-++.+.+.+-|+.||.|..-.+..| ..++..+-++|.|...-.+.+
T Consensus 32 a~l~V~nl~~~~sndll~~~f~~fg~~e~av~~vD--~r~k~t~eg~v~~~~k~~a~~ 87 (275)
T KOG0115|consen 32 AELYVVNLMQGASNDLLEQAFRRFGPIERAVAKVD--DRGKPTREGIVEFAKKPNARK 87 (275)
T ss_pred ceEEEEecchhhhhHHHHHhhhhcCccchheeeec--ccccccccchhhhhcchhHHH
Confidence 67999999999999999999999999988777777 678888889999987666544
No 110
>PF08675 RNA_bind: RNA binding domain; InterPro: IPR014789 This domain corresponds to the RNA binding domain of Poly(A)-specific ribonuclease (PARN). ; GO: 0003723 RNA binding, 0004535 poly(A)-specific ribonuclease activity, 0046872 metal ion binding, 0006402 mRNA catabolic process, 0005634 nucleus, 0005737 cytoplasm; PDB: 3CTR_A 2ROK_A 3D45_B 1WHV_A.
Probab=97.30 E-value=0.0012 Score=33.82 Aligned_cols=49 Identities=24% Similarity=0.449 Sum_probs=34.3
Q ss_pred CCeEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHHHhc
Q psy3346 20 GSKILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKR 78 (79)
Q Consensus 20 ~~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~ 78 (79)
....+++ .|.+....+|.++|++||.| .|.++.+ .-|||.....+.|..
T Consensus 9 dHVFhlt-FPkeWK~~DI~qlFspfG~I-~VsWi~d--------TSAfV~l~~r~~~~~ 57 (87)
T PF08675_consen 9 DHVFHLT-FPKEWKTSDIYQLFSPFGQI-YVSWIND--------TSAFVALHNRDQAKV 57 (87)
T ss_dssp CCEEEEE---TT--HHHHHHHCCCCCCE-EEEEECT--------TEEEEEECCCHHHHH
T ss_pred ceEEEEe-CchHhhhhhHHHHhccCCcE-EEEEEcC--------CcEEEEeecHHHHHH
Confidence 3455555 99999999999999999975 5666666 358999988887653
No 111
>PF11608 Limkain-b1: Limkain b1; InterPro: IPR024582 This entry represents a conserved domain found in limkain b1, which is a novel human autoantigen, localised to a subset of ABCD3 and PXF marked peroxisomes. Limkain b1 may be a relatively common target of human autoantibodies reactive to cytoplasmic vesicle-like structures [].; PDB: 2DIU_A.
Probab=97.30 E-value=0.00096 Score=34.28 Aligned_cols=48 Identities=25% Similarity=0.374 Sum_probs=31.2
Q ss_pred CeEEEeCCCCCCCHHHH----HHHhHhcC-CeeEEEeecccCCCCCcceEEEEEecCHHHHhcC
Q psy3346 21 SKILVRNIPFQAKQSEV----EELFKAFG-ELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKRV 79 (79)
Q Consensus 21 ~~l~v~nl~~~~~~~~l----~~~f~~~g-~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~a 79 (79)
..|+|.|||...+...+ ++++..|| .|..+ ..+.|++.|.+++.|.+|
T Consensus 3 s~L~V~NLP~~~d~~~I~~RL~qLsdNCGGkVl~v-----------~~~tAilrF~~~~~A~RA 55 (90)
T PF11608_consen 3 SLLYVSNLPTNKDPSSIKNRLRQLSDNCGGKVLSV-----------SGGTAILRFPNQEFAERA 55 (90)
T ss_dssp EEEEEES--TTS-HHHHHHHHHHHHHTTT--EEE-------------TT-EEEEESSHHHHHHH
T ss_pred cEEEEecCCCCCCHHHHHHHHHHHhhccCCEEEEE-----------eCCEEEEEeCCHHHHHHH
Confidence 46899999999987665 55555776 34443 126799999999999875
No 112
>COG5175 MOT2 Transcriptional repressor [Transcription]
Probab=97.29 E-value=0.00048 Score=43.56 Aligned_cols=61 Identities=25% Similarity=0.392 Sum_probs=42.5
Q ss_pred CCCeEEEeCCCCCCCHHHH------HHHhHhcCCeeEEEeecccCCCCCcceE--EEEEecCHHHHhcC
Q psy3346 19 TGSKILVRNIPFQAKQSEV------EELFKAFGELKFVRLPKKMVGSGLHRGF--GFVEFITKNEAKRV 79 (79)
Q Consensus 19 ~~~~l~v~nl~~~~~~~~l------~~~f~~~g~i~~~~~~~~~~~~~~~~g~--~fv~f~~~~~a~~a 79 (79)
+..-+||-+||+.+..+++ .++|.+||.|..|.+-..........+. .||+|.+.++|.+|
T Consensus 113 QKNLvYVigi~pkva~Ee~~~vLk~~eyFGQyGkI~KIvvNkkt~s~nst~~h~gvYITy~~kedAarc 181 (480)
T COG5175 113 QKNLVYVIGIPPKVADEEVAPVLKRHEYFGQYGKIKKIVVNKKTSSLNSTASHAGVYITYSTKEDAARC 181 (480)
T ss_pred ecceeEEecCCCCCCcccccccccchhhhhhccceeEEEecccccccccccccceEEEEecchHHHHHH
Confidence 3456899999999877662 5889999999998764431011112232 39999999998764
No 113
>KOG4676|consensus
Probab=97.23 E-value=0.00039 Score=44.59 Aligned_cols=58 Identities=21% Similarity=0.288 Sum_probs=46.0
Q ss_pred CCeEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeeccc--CCCCCcceEEEEEecCHHHHh
Q psy3346 20 GSKILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKM--VGSGLHRGFGFVEFITKNEAK 77 (79)
Q Consensus 20 ~~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~--~~~~~~~g~~fv~f~~~~~a~ 77 (79)
...|-|.||.+.++.++++.+|...|.|..+.|.... ..-......|||.|.+...+.
T Consensus 7 ~~vIqvanispsat~dqm~tlFg~lGkI~elrlyp~~~d~~~pv~sRtcyVkf~d~~sv~ 66 (479)
T KOG4676|consen 7 LGVIQVANISPSATKDQMQTLFGNLGKIPELRLYPNVDDSKIPVISRTCYVKFLDSQSVT 66 (479)
T ss_pred CceeeecccCchhhHHHHHHHHhhccccccccccCCCCCccCcceeeeEEEeccCCccee
Confidence 3489999999999999999999999999999885432 122334578999998877654
No 114
>KOG0106|consensus
Probab=97.08 E-value=0.00054 Score=40.79 Aligned_cols=54 Identities=31% Similarity=0.652 Sum_probs=44.6
Q ss_pred CCCCCeEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHHHhcC
Q psy3346 17 KQTGSKILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKRV 79 (79)
Q Consensus 17 ~~~~~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~a 79 (79)
....+.+.+.|++..+.+.+|...|.++|.+....+ ..+++||+|...+++.+|
T Consensus 96 ~~s~~r~~~~~~~~r~~~qdl~d~~~~~g~~~~~~~---------~~~~~~v~Fs~~~da~ra 149 (216)
T KOG0106|consen 96 SRTHFRLIVRNLSLRVSWQDLKDHFRPAGEVTYVDA---------RRNFAFVEFSEQEDAKRA 149 (216)
T ss_pred ccccceeeeccchhhhhHHHHhhhhcccCCCchhhh---------hccccceeehhhhhhhhc
Confidence 455789999999999999999999999998743322 357899999999998764
No 115
>PF05172 Nup35_RRM: Nup53/35/40-type RNA recognition motif; InterPro: IPR007846 The MPPN (Mitotic PhosphoProtein N end) family is uncharacterised however it probably plays a role in the cell cycle because the family includes mitotic phosphoproteins O13026 from SWISSPROT []. This family also includes Q05166 from SWISSPROT a suppressor of thermosensitive mutations in the DNA polymerase delta gene, Pol III []. The conserved central region appears to be distantly related to the RNA-binding region RNP-1 (RNA recognition motif, IPR000504 from INTERPRO), suggesting an RNA binding function for this protein.; PDB: 1WWH_C 3P3D_A.
Probab=97.00 E-value=0.0022 Score=33.91 Aligned_cols=60 Identities=13% Similarity=0.118 Sum_probs=36.8
Q ss_pred CCCeEEEeCCCCCCCHHHHHHHhHhcCCeeEEE-eecccC-----CCCCcceEEEEEecCHHHHhcC
Q psy3346 19 TGSKILVRNIPFQAKQSEVEELFKAFGELKFVR-LPKKMV-----GSGLHRGFGFVEFITKNEAKRV 79 (79)
Q Consensus 19 ~~~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~-~~~~~~-----~~~~~~g~~fv~f~~~~~a~~a 79 (79)
..+-|.|-+.|+. ....+.+.|++||.|.+.. +..+.. .......+-.|.|.++.+|++|
T Consensus 5 ~~~wVtVFGfp~~-~~~~Vl~~F~~~G~Ile~~~~~~~~~~~~~~~~~~~~NWi~I~Y~~~~~A~rA 70 (100)
T PF05172_consen 5 SETWVTVFGFPPS-ASNQVLRHFSSFGTILEHFEVLRSSSGINPYPIPSGGNWIHITYDNPLSAQRA 70 (100)
T ss_dssp GCCEEEEE---GG-GHHHHHHHHHCCS-EECEEGGG----------E-CCTTEEEEEESSHHHHHHH
T ss_pred CCeEEEEEccCHH-HHHHHHHHHHhcceEEEeecccccccccccccCCCCCCEEEEECCCHHHHHHH
Confidence 3556778889888 5566788899999987664 111100 0112346899999999999875
No 116
>KOG1456|consensus
Probab=96.92 E-value=0.0048 Score=39.66 Aligned_cols=56 Identities=20% Similarity=0.310 Sum_probs=45.9
Q ss_pred CCCCCeEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHHHhcC
Q psy3346 17 KQTGSKILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKRV 79 (79)
Q Consensus 17 ~~~~~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~a 79 (79)
....-.+.|++|-..+++.+|.+-++.||.|..+..+.. +.-+.|+|++.+.|.+|
T Consensus 28 ~~~spvvhvr~l~~~v~eadl~eal~~fG~i~yvt~~P~-------~r~alvefedi~~akn~ 83 (494)
T KOG1456|consen 28 PNPSPVVHVRGLHQGVVEADLVEALSNFGPIAYVTCMPH-------KRQALVEFEDIEGAKNC 83 (494)
T ss_pred CCCCceEEEeccccccchhHHHHHHhcCCceEEEEeccc-------cceeeeeeccccchhhh
Confidence 344567999999999999999999999999988866443 25688999988877654
No 117
>KOG4307|consensus
Probab=96.89 E-value=0.00094 Score=45.71 Aligned_cols=60 Identities=18% Similarity=0.243 Sum_probs=48.9
Q ss_pred CCCCCeEEEeCCCCCCCHHHHHHHhHhcCCeeE-EEeecccCCCCCcceEEEEEecCHHHHhc
Q psy3346 17 KQTGSKILVRNIPFQAKQSEVEELFKAFGELKF-VRLPKKMVGSGLHRGFGFVEFITKNEAKR 78 (79)
Q Consensus 17 ~~~~~~l~v~nl~~~~~~~~l~~~f~~~g~i~~-~~~~~~~~~~~~~~g~~fv~f~~~~~a~~ 78 (79)
...+..|||..||..+++.++.++|...-.|++ |.|... .++..++.|||.|..++++..
T Consensus 431 ~~ag~~lyv~~lP~~t~~~~~v~~f~~~~~Ved~I~lt~~--P~~~~~~~afv~F~~~~a~~~ 491 (944)
T KOG4307|consen 431 GGAGGALYVFQLPVMTPIVPPVNKFMGAAAVEDFIELTRL--PTDLLRPAAFVAFIHPTAPLT 491 (944)
T ss_pred CCccceEEeccCCccccccchhhhhhhhhhhhheeEeccC--Ccccccchhhheeccccccch
Confidence 455789999999999999999999987666655 666555 688889999999998776554
No 118
>KOG0112|consensus
Probab=96.57 E-value=0.00058 Score=47.52 Aligned_cols=58 Identities=14% Similarity=0.289 Sum_probs=46.8
Q ss_pred CCCCCeEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHHH
Q psy3346 17 KQTGSKILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEA 76 (79)
Q Consensus 17 ~~~~~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a 76 (79)
.....+||++||+..+++.+++..|..+|.+..|.+-..+ -+.-..|+||.|.+...+
T Consensus 369 ~~atrTLf~Gnl~~kl~eseiR~af~e~gkve~VDiKtP~--~~~esa~~f~~~~n~dmt 426 (975)
T KOG0112|consen 369 FRATRTLFLGNLDSKLTESEIRPAFDESGKVEEVDIKTPH--IKTESAYAFVSLLNTDMT 426 (975)
T ss_pred hhhhhhhhhcCcccchhhhhhhhhhhhhccccccccccCC--CCcccchhhhhhhccccC
Confidence 4557899999999999999999999999999999886652 344456888888765543
No 119
>KOG0112|consensus
Probab=96.43 E-value=0.0041 Score=43.59 Aligned_cols=56 Identities=9% Similarity=0.291 Sum_probs=46.9
Q ss_pred CCCCCeEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHHHhcC
Q psy3346 17 KQTGSKILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKRV 79 (79)
Q Consensus 17 ~~~~~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~a 79 (79)
....+.+++++|..++....+...|..||.|..|.+-.. ..|+||.|++...++.|
T Consensus 452 st~ttr~~sgglg~w~p~~~l~r~fd~fGpir~Idy~hg-------q~yayi~yes~~~aq~a 507 (975)
T KOG0112|consen 452 STPTTRLQSGGLGPWSPVSRLNREFDRFGPIRIIDYRHG-------QPYAYIQYESPPAAQAA 507 (975)
T ss_pred cccceeeccCCCCCCChHHHHHHHhhccCcceeeecccC-------CcceeeecccCccchhh
Confidence 345678999999999999999999999999999877443 37999999998877653
No 120
>KOG1190|consensus
Probab=96.43 E-value=0.0087 Score=38.86 Aligned_cols=55 Identities=16% Similarity=0.190 Sum_probs=41.7
Q ss_pred CCCeEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHHHhc
Q psy3346 19 TGSKILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKR 78 (79)
Q Consensus 19 ~~~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~ 78 (79)
+..++.+.|+|.+++++++++.|..-|...+.... -++.+.++.+.+.+.|+|-.
T Consensus 413 psatlHlsnip~svsee~lk~~f~~~g~~vkafkf-----f~kd~kmal~q~~sveeA~~ 467 (492)
T KOG1190|consen 413 PSATLHLSNIPPSVSEEDLKNLFQEPGGQVKAFKF-----FQKDRKMALPQLESVEEAIQ 467 (492)
T ss_pred chhheeeccCCcccchhHHHHhhhcCCceEEeeee-----cCCCcceeecccCChhHhhh
Confidence 34589999999999999999999887765443321 23345789999999988754
No 121
>PF10309 DUF2414: Protein of unknown function (DUF2414); InterPro: IPR019416 This entry contains proteins that have no known function.
Probab=96.42 E-value=0.034 Score=26.92 Aligned_cols=52 Identities=12% Similarity=0.140 Sum_probs=39.9
Q ss_pred CCCeEEEeCCCCCCCHHHHHHHhHhc---CCeeEEEeecccCCCCCcceEEEEEecCHHHHhcC
Q psy3346 19 TGSKILVRNIPFQAKQSEVEELFKAF---GELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKRV 79 (79)
Q Consensus 19 ~~~~l~v~nl~~~~~~~~l~~~f~~~---g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~a 79 (79)
.+..|+|.++.. .+.++++.+|..| .....+.++.| .-|-|.|.+.+.|.+|
T Consensus 4 rpeavhirGvd~-lsT~dI~~y~~~y~~~~~~~~IEWIdD--------tScNvvf~d~~~A~~A 58 (62)
T PF10309_consen 4 RPEAVHIRGVDE-LSTDDIKAYFSEYFDEEGPFRIEWIDD--------TSCNVVFKDEETAARA 58 (62)
T ss_pred eeceEEEEcCCC-CCHHHHHHHHHHhcccCCCceEEEecC--------CcEEEEECCHHHHHHH
Confidence 456799999854 6778899999888 13567888887 2367889999998765
No 122
>PF03467 Smg4_UPF3: Smg-4/UPF3 family; InterPro: IPR005120 Nonsense-mediated mRNA decay (NMD) is a surveillance mechanism by which eukaryotic cells detect and degrade transcripts containing premature termination codons. Three 'up-frameshift' proteins, UPF1, UPF2 and UPF3, are essential for this process in organisms ranging from yeast, human to plants []. Exon junction complexes (EJCs) are deposited ~24 nucleotides upstream of exon-exon junctions after splicing. Translation causes displacement of the EJCs, however, premature translation termination upstream of one or more EJCs triggers the recruitment of UPF1, UPF2 and UPF3 and activates the NMD pathway [, ]. This family contains UPF3. The crystal structure of the complex between human UPF2 and UPF3b, which are, respectively, a MIF4G (middle portion of eIF4G) domain and an RNP domain (ribonucleoprotein-type RNA-binding domain) has been determined to 1.95A. The protein-protein interface is mediated by highly conserved charged residues in UPF2 and UPF3b and involves the beta-sheet surface of the UPF3b ribonucleoprotein (RNP) domain, which is generally used by these domains to bind nucleic acids. In UPF3b the RNP domain does not bind RNA, whereas the UPF2 construct and the complex do. It is clear that some RNP domains have evolved for specific protein-protein interactions rather than as nucleic acid binding modules [].; PDB: 1UW4_A 2L08_A.
Probab=96.36 E-value=0.0051 Score=35.60 Aligned_cols=61 Identities=11% Similarity=0.182 Sum_probs=32.9
Q ss_pred CCCCCeEEEeCCCCCCCHHHHHHHhHh-cCCe---eEEEeeccc-CCCCCcceEEEEEecCHHHHh
Q psy3346 17 KQTGSKILVRNIPFQAKQSEVEELFKA-FGEL---KFVRLPKKM-VGSGLHRGFGFVEFITKNEAK 77 (79)
Q Consensus 17 ~~~~~~l~v~nl~~~~~~~~l~~~f~~-~g~i---~~~~~~~~~-~~~~~~~g~~fv~f~~~~~a~ 77 (79)
..+..+|.|++||+..+++++.+.+.+ ++.- ..+.-.... ........-|||.|.+.+++.
T Consensus 4 ~~~~~KvVIR~LPP~LteeeF~~~i~~~l~~~~~w~y~~g~~~~~~~~~~~~SRaYi~F~~~~~~~ 69 (176)
T PF03467_consen 4 EKEGTKVVIRRLPPNLTEEEFWEQISPWLPDEWDWYYFQGKYGKKSFKPPTYSRAYINFKNPEDLL 69 (176)
T ss_dssp -----EEEEEEE-TTS-HHHHCCCCSS--SSE---EEEEEEES-SSSTTS--EEEEEEESSCHHHH
T ss_pred cccCceEEEeCCCCCCCHHHHHHHhhhhcccccceEEEecCCCCccCCCCcceEEEEEeCCHHHHH
Confidence 345679999999999999998886665 5543 333311111 011112356999999987753
No 123
>KOG2193|consensus
Probab=95.66 E-value=0.011 Score=38.75 Aligned_cols=51 Identities=24% Similarity=0.300 Sum_probs=36.7
Q ss_pred CeEEEeCCCCCCCHHHHHHHhHhcC-CeeEEEeecccCCCCCcceEEEEEecCHHHHhcC
Q psy3346 21 SKILVRNIPFQAKQSEVEELFKAFG-ELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKRV 79 (79)
Q Consensus 21 ~~l~v~nl~~~~~~~~l~~~f~~~g-~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~a 79 (79)
..+|++||.+.++..++..+|..-- ....-.|+. .||+||.+.+...|.+|
T Consensus 2 nklyignL~p~~~psdl~svfg~ak~~~~g~fl~k--------~gyafvd~pdq~wa~ka 53 (584)
T KOG2193|consen 2 NKLYIGNLSPQVTPSDLESVFGDAKIPGSGQFLVK--------SGYAFVDCPDQQWANKA 53 (584)
T ss_pred CcccccccCCCCChHHHHHHhccccCCCCcceeee--------cceeeccCCchhhhhhh
Confidence 4689999999999999999996431 111222333 28999999988877543
No 124
>KOG0120|consensus
Probab=95.19 E-value=0.039 Score=36.81 Aligned_cols=44 Identities=25% Similarity=0.404 Sum_probs=30.9
Q ss_pred HHHHHhHhcCCeeEEEeecc-cC-CCCCcceEEEEEecCHHHHhcC
Q psy3346 36 EVEELFKAFGELKFVRLPKK-MV-GSGLHRGFGFVEFITKNEAKRV 79 (79)
Q Consensus 36 ~l~~~f~~~g~i~~~~~~~~-~~-~~~~~~g~~fv~f~~~~~a~~a 79 (79)
+++.-+.+||.|..|.+++. .. ...-..|-.||+|.+.+++++|
T Consensus 425 dvr~ec~k~g~v~~v~ipr~~~~~~~~~G~GkVFVefas~ed~qrA 470 (500)
T KOG0120|consen 425 DVRTECAKFGAVRSVEIPRPYPDENPVPGTGKVFVEFADTEDSQRA 470 (500)
T ss_pred HHHHHhcccCceeEEecCCCCCCCCcCCCcccEEEEecChHHHHHH
Confidence 34455567899999988776 20 1222346789999999998875
No 125
>KOG1548|consensus
Probab=94.61 E-value=0.08 Score=33.84 Aligned_cols=60 Identities=17% Similarity=0.313 Sum_probs=43.1
Q ss_pred ccCCCCCeEEEeCCCC----CCC-------HHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHHHhcC
Q psy3346 15 VAKQTGSKILVRNIPF----QAK-------QSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKRV 79 (79)
Q Consensus 15 ~~~~~~~~l~v~nl~~----~~~-------~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~a 79 (79)
......++|.+.|+=. ..+ .++|.+-..+||.|.++.+.-. .+.|.+-|.|.+.++|+.|
T Consensus 260 sk~r~~~tVi~kn~Ftp~~~~~~~~l~~dlkedl~eec~K~G~v~~vvv~d~-----hPdGvvtV~f~n~eeA~~c 330 (382)
T KOG1548|consen 260 SKARADRTVILKNMFTPEDFEKNPDLLNDLKEDLTEECEKFGQVRKVVVYDR-----HPDGVVTVSFRNNEEADQC 330 (382)
T ss_pred ccccCCcEEEeeecCCHHHhccCHHHHHHHHHHHHHHHHHhCCcceEEEecc-----CCCceeEEEeCChHHHHHH
Confidence 3445678899998721 223 2456677889999999876432 3568899999999998764
No 126
>KOG1456|consensus
Probab=94.50 E-value=0.23 Score=32.29 Aligned_cols=59 Identities=19% Similarity=0.273 Sum_probs=47.9
Q ss_pred ccCCCCCeEEEeCCCCCC-CHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHHHhcC
Q psy3346 15 VAKQTGSKILVRNIPFQA-KQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKRV 79 (79)
Q Consensus 15 ~~~~~~~~l~v~nl~~~~-~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~a 79 (79)
....+++.+.|-+|.-.. .-+.|..+|-.||.|..|+++..+ .|.|.|++.+..+.++|
T Consensus 282 ~g~~~g~VmMVyGLdh~k~N~drlFNl~ClYGNV~rvkFmkTk------~gtamVemgd~~aver~ 341 (494)
T KOG1456|consen 282 GGGAPGCVMMVYGLDHGKMNCDRLFNLFCLYGNVERVKFMKTK------PGTAMVEMGDAYAVERA 341 (494)
T ss_pred CCCCCCcEEEEEeccccccchhhhhhhhhhcCceeeEEEeecc------cceeEEEcCcHHHHHHH
Confidence 345667899999998764 556789999999999999998874 47799999998887764
No 127
>KOG2416|consensus
Probab=94.42 E-value=0.039 Score=37.52 Aligned_cols=56 Identities=14% Similarity=0.188 Sum_probs=41.9
Q ss_pred ccCCCCCeEEEeCCCCCCCHHHHHHHhHh-cCCeeEEEeecccCCCCCcceEEEEEecCHHHHh
Q psy3346 15 VAKQTGSKILVRNIPFQAKQSEVEELFKA-FGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAK 77 (79)
Q Consensus 15 ~~~~~~~~l~v~nl~~~~~~~~l~~~f~~-~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~ 77 (79)
........|||.||-.-.|...|++++.. +|.|... |+- +-+..|||.|.+.++|-
T Consensus 439 sR~~~SnvlhI~nLvRPFTlgQLkelL~rtgg~Vee~-WmD------kIKShCyV~yss~eEA~ 495 (718)
T KOG2416|consen 439 SRKEPSNVLHIDNLVRPFTLGQLKELLGRTGGNVEEF-WMD------KIKSHCYVSYSSVEEAA 495 (718)
T ss_pred CCCCccceEeeecccccchHHHHHHHHhhccCchHHH-HHH------HhhcceeEecccHHHHH
Confidence 33445678999999999999999999985 5555554 322 23567999999988874
No 128
>PF15023 DUF4523: Protein of unknown function (DUF4523)
Probab=94.25 E-value=0.22 Score=28.28 Aligned_cols=54 Identities=24% Similarity=0.231 Sum_probs=36.5
Q ss_pred CCCCCeEEEeCCCCCC----CHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHHHhc
Q psy3346 17 KQTGSKILVRNIPFQA----KQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKR 78 (79)
Q Consensus 17 ~~~~~~l~v~nl~~~~----~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~ 78 (79)
+..-.+|.|+=|.... +-..+...++.||+|.++.+-- +.-|.|.|.+..+|=+
T Consensus 83 epPMsTIVVRWlkknm~~~edl~sV~~~Ls~fGpI~SVT~cG--------rqsavVvF~d~~SAC~ 140 (166)
T PF15023_consen 83 EPPMSTIVVRWLKKNMQPTEDLKSVIQRLSVFGPIQSVTLCG--------RQSAVVVFKDITSACK 140 (166)
T ss_pred CCCceeEEeehhhhcCChHHHHHHHHHHHHhcCCcceeeecC--------CceEEEEehhhHHHHH
Confidence 4556778787555444 3345666678899999986422 2458999998887643
No 129
>KOG2591|consensus
Probab=94.01 E-value=0.19 Score=34.14 Aligned_cols=54 Identities=17% Similarity=0.313 Sum_probs=41.9
Q ss_pred CCCCeEEEeCCCCCCCHHHHHHHhH--hcCCeeEEEeecccCCCCCcceEEEEEecCHHHHhcC
Q psy3346 18 QTGSKILVRNIPFQAKQSEVEELFK--AFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKRV 79 (79)
Q Consensus 18 ~~~~~l~v~nl~~~~~~~~l~~~f~--~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~a 79 (79)
...|.+.++-||..+-.++++.+|+ .|-.+.+|.+-.+. + =||+|++..+|+.|
T Consensus 173 ~kRcIvilREIpettp~e~Vk~lf~~encPk~iscefa~N~-------n-WyITfesd~DAQqA 228 (684)
T KOG2591|consen 173 HKRCIVILREIPETTPIEVVKALFKGENCPKVISCEFAHND-------N-WYITFESDTDAQQA 228 (684)
T ss_pred cceeEEEEeecCCCChHHHHHHHhccCCCCCceeeeeeecC-------c-eEEEeecchhHHHH
Confidence 3456788889999999999999995 47777777764431 2 38999999999865
No 130
>KOG2068|consensus
Probab=93.86 E-value=0.034 Score=35.15 Aligned_cols=61 Identities=16% Similarity=0.296 Sum_probs=41.4
Q ss_pred CCCeEEEeCCCCCCCHHHH---HHHhHhcCCeeEEEeecccCCCCC--cceEEEEEecCHHHHhcC
Q psy3346 19 TGSKILVRNIPFQAKQSEV---EELFKAFGELKFVRLPKKMVGSGL--HRGFGFVEFITKNEAKRV 79 (79)
Q Consensus 19 ~~~~l~v~nl~~~~~~~~l---~~~f~~~g~i~~~~~~~~~~~~~~--~~g~~fv~f~~~~~a~~a 79 (79)
+..-+|+.+|+.....+.+ .+.|.+||.|..+..-.++..... ...-++|+|...++|..|
T Consensus 76 qknlvyvvgl~~~~ade~~l~~~eyfgqygki~ki~~~~~~S~~s~~~~~~s~yITy~~~eda~rc 141 (327)
T KOG2068|consen 76 QKNLVYVVGLPLDLADESVLERTEYFGQYGKINKIVKNKDPSSSSSSGGTCSVYITYEEEEDADRC 141 (327)
T ss_pred hhhhhhhhCCCccccchhhhhCcccccccccceEEeecCCcccccCCCCCCcccccccchHhhhhH
Confidence 3456888899988766654 467888999999877665310111 112379999998887654
No 131
>KOG4410|consensus
Probab=92.37 E-value=0.48 Score=29.86 Aligned_cols=49 Identities=12% Similarity=0.143 Sum_probs=36.3
Q ss_pred CCeEEEeCCCCCCCHHHHHHHhHhcCCe-eEEEeecccCCCCCcceEEEEEecCHHH
Q psy3346 20 GSKILVRNIPFQAKQSEVEELFKAFGEL-KFVRLPKKMVGSGLHRGFGFVEFITKNE 75 (79)
Q Consensus 20 ~~~l~v~nl~~~~~~~~l~~~f~~~g~i-~~~~~~~~~~~~~~~~g~~fv~f~~~~~ 75 (79)
...|+++|||.++.-.+|+..+.+.+.+ .++.|. .+.|-||..|.+...
T Consensus 330 ~~di~~~nl~rd~rv~dlk~~lr~~~~~pm~iswk-------g~~~k~flh~~~~~~ 379 (396)
T KOG4410|consen 330 KTDIKLTNLSRDIRVKDLKSELRKRECTPMSISWK-------GHFGKCFLHFGNRKG 379 (396)
T ss_pred ccceeeccCccccchHHHHHHHHhcCCCceeEeee-------cCCcceeEecCCccC
Confidence 4679999999999999999999887642 333331 245779999977543
No 132
>KOG4210|consensus
Probab=92.32 E-value=0.051 Score=33.82 Aligned_cols=59 Identities=19% Similarity=0.068 Sum_probs=46.5
Q ss_pred CCCeEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHHHhc
Q psy3346 19 TGSKILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKR 78 (79)
Q Consensus 19 ~~~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~ 78 (79)
..++.|++++.+.+.+.+...++...|......+.... ....++|++++.|...+.+..
T Consensus 87 ~~~~~f~g~~s~~~e~~~~~~~~~~~g~~~~~~~S~~~-~~~~sk~~~s~~f~~ks~~~~ 145 (285)
T KOG4210|consen 87 SSSTFFVGELSENIEESEDDNFSSEAGLRVDARSSSLE-DSLSSKGGLSVHFAGKSQFFA 145 (285)
T ss_pred ccccccccccccchhhccccccchhhcCcccchhhhhc-cccccccceeeccccHHHHHH
Confidence 46789999999999998888888888866555554433 567788999999999887654
No 133
>PF11767 SET_assoc: Histone lysine methyltransferase SET associated; InterPro: IPR024636 The SET domain is a protein-protein interaction domain found in protein lysine methyltransferase enzymes. This entry represents a domain of unknown function which is associated with the SET domain and found in histone lysine methyltransferases [].
Probab=91.95 E-value=0.61 Score=22.84 Aligned_cols=39 Identities=15% Similarity=0.342 Sum_probs=27.8
Q ss_pred CCCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHHHhcC
Q psy3346 31 QAKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKRV 79 (79)
Q Consensus 31 ~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~a 79 (79)
.++-.+++..|..|+ ...|. .+ .+ | =||.|.+..+|++|
T Consensus 11 ~~~v~d~K~~Lr~y~-~~~I~--~d--~t----G-fYIvF~~~~Ea~rC 49 (66)
T PF11767_consen 11 GVTVEDFKKRLRKYR-WDRIR--DD--RT----G-FYIVFNDSKEAERC 49 (66)
T ss_pred CccHHHHHHHHhcCC-cceEE--ec--CC----E-EEEEECChHHHHHH
Confidence 467788999999996 45544 33 12 3 37999999999875
No 134
>KOG4660|consensus
Probab=91.79 E-value=0.21 Score=33.68 Aligned_cols=55 Identities=16% Similarity=0.350 Sum_probs=32.3
Q ss_pred eEEEeCCCCCCCHHHHHHHhH-hcCCeeEEEeecccCCCCCcceEEEEEecCHHHHh
Q psy3346 22 KILVRNIPFQAKQSEVEELFK-AFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAK 77 (79)
Q Consensus 22 ~l~v~nl~~~~~~~~l~~~f~-~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~ 77 (79)
++.|.|+|...|...|.+... ..|.-..+.|+.|- .+..+.|||||.|-+++++.
T Consensus 390 t~~iknipNK~T~~ml~~~d~~~~gtYDFlYLPiDF-~nkcNvGYAFINm~sp~ai~ 445 (549)
T KOG4660|consen 390 TLMIKNIPNKYTSKMLLAADEKNKGTYDFLYLPIDF-KNKCNVGYAFINMTSPEAII 445 (549)
T ss_pred hhHhhccCchhhHHhhhhhhccccCccceEEecccc-ccccccceeEEeecCHHHHH
Confidence 344444444444433333321 23444556666664 56668899999999988764
No 135
>PF10567 Nab6_mRNP_bdg: RNA-recognition motif; InterPro: IPR018885 This conserved domain is found in fungal proteins and appears to be involved in RNA-processing. It binds to poly-adenylated RNA, interacts genetically with mRNA 3'-end processing factors, co-purifies with the nuclear cap-binding protein Cbp20p, and is found in complexes containing other translation factors, such as EIF4G as in P39935 from SWISSPROT and P39936 from SWISSPROT.
Probab=90.74 E-value=0.78 Score=28.90 Aligned_cols=61 Identities=13% Similarity=0.177 Sum_probs=43.7
Q ss_pred cCCCCCeEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeeccc------CCCCCcceEEEEEecCHHHH
Q psy3346 16 AKQTGSKILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKM------VGSGLHRGFGFVEFITKNEA 76 (79)
Q Consensus 16 ~~~~~~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~------~~~~~~~g~~fv~f~~~~~a 76 (79)
+....+.|.+.|+..+++--.+..-|.+||+|+++.++.+. ....+......+.|-+.+.+
T Consensus 11 D~YrTRSLLfeNv~~sidLh~Fl~~fv~~~pIESiYL~~~~~~~~d~~~~d~~~~SilLSFlsr~~C 77 (309)
T PF10567_consen 11 DEYRTRSLLFENVNNSIDLHSFLTKFVKFGPIESIYLIKSNDKPSDDYNDDKNNQSILLSFLSREIC 77 (309)
T ss_pred ccceeHHHHHhhccccccHHHHHHHhhccCceeEEEEecCCCcccccccccccceEEEEeeechHHH
Confidence 34456778899999999888888889999999999987652 01222234556777666654
No 136
>KOG1996|consensus
Probab=90.58 E-value=0.75 Score=29.13 Aligned_cols=44 Identities=20% Similarity=0.153 Sum_probs=31.2
Q ss_pred HHHHHHhHhcCCeeEEEeecccCCCCCcc-eEEEEEecCHHHHhcC
Q psy3346 35 SEVEELFKAFGELKFVRLPKKMVGSGLHR-GFGFVEFITKNEAKRV 79 (79)
Q Consensus 35 ~~l~~~f~~~g~i~~~~~~~~~~~~~~~~-g~~fv~f~~~~~a~~a 79 (79)
+++++-+++||.|..|.+...+ ...... ---||+|...++|.+|
T Consensus 301 de~keEceKyg~V~~viifeip-~~p~deavRiFveF~r~e~aiKA 345 (378)
T KOG1996|consen 301 DETKEECEKYGKVGNVIIFEIP-SQPEDEAVRIFVEFERVESAIKA 345 (378)
T ss_pred HHHHHHHHhhcceeeEEEEecC-CCccchhheeeeeeccHHHHHHH
Confidence 4677888999999988776654 222222 2369999999988764
No 137
>KOG2891|consensus
Probab=90.55 E-value=0.61 Score=29.39 Aligned_cols=54 Identities=26% Similarity=0.468 Sum_probs=36.7
Q ss_pred CCCCCeEEEeCCCCC------------CCHHHHHHHhHhcCCeeEEEeec-cc---CCCCCcceEEEEEe
Q psy3346 17 KQTGSKILVRNIPFQ------------AKQSEVEELFKAFGELKFVRLPK-KM---VGSGLHRGFGFVEF 70 (79)
Q Consensus 17 ~~~~~~l~v~nl~~~------------~~~~~l~~~f~~~g~i~~~~~~~-~~---~~~~~~~g~~fv~f 70 (79)
...+.+||+.+||-. .+++.|+..|..||.|..+.++. ++ ..+|+..|..|-.|
T Consensus 146 gerpdti~la~ip~kwf~lkedg~~dlpse~rlr~a~eafg~ir~vdipicdplr~~mn~kisgiq~~gf 215 (445)
T KOG2891|consen 146 GERPDTIHLAGIPCKWFALKEDGSEDLPSEDRLRKAFEAFGEIRNVDIPICDPLREEMNGKISGIQFHGF 215 (445)
T ss_pred CCCCCceeecCCcceeeeecccccccCChHHHHHHHHHHhccceecCCcccchhHHHhcCccccceeecc
Confidence 345678888888753 46778999999999999988853 22 13555545444333
No 138
>PF07576 BRAP2: BRCA1-associated protein 2; InterPro: IPR011422 These proteins include BRCA1-associated protein 2 (BRAP2), which binds nuclear localisation signals (NLSs) in vitro and in yeast two-hybrid screening []. These proteins share a region of sequence similarity at their N terminus. They also have IPR001607 from INTERPRO at the C terminus.
Probab=90.48 E-value=1.7 Score=23.40 Aligned_cols=56 Identities=13% Similarity=0.167 Sum_probs=35.7
Q ss_pred CCeEEEeCCCCC-CCHHHHHHHhHhcC-CeeEEEeecccCCCCCcceEEEEEecCHHHHhc
Q psy3346 20 GSKILVRNIPFQ-AKQSEVEELFKAFG-ELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKR 78 (79)
Q Consensus 20 ~~~l~v~nl~~~-~~~~~l~~~f~~~g-~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~ 78 (79)
.+.+.+--.|+. ++.+.|..+..++- .|..++++++. ..++-.+.+.|.+.++|+.
T Consensus 12 ~~~~~~l~vp~~~~~~d~l~~f~~~~~~~i~~~riird~---~pnrymVLikF~~~~~Ad~ 69 (110)
T PF07576_consen 12 RSTLCCLAVPPYMTPSDFLLFFGAPFREDIEHIRIIRDG---TPNRYMVLIKFRDQESADE 69 (110)
T ss_pred CceEEEEEeCcccccHHHHHHhhhcccccEEEEEEeeCC---CCceEEEEEEECCHHHHHH
Confidence 344444444554 45555655555553 57788888872 2356778999999998863
No 139
>PF03468 XS: XS domain; InterPro: IPR005380 The XS (rice gene X and SGS3) domain is found in a family of plant proteins including gene X Q9SBW2 from SWISSPROT and SGS3 Q9LDX1 from SWISSPROT. SGS3 is thought to be involved in post-transcriptional gene silencing (PTGS). This domain contains a conserved aspartate residue that may be functionally important. The XS domain containing proteins contain coiled-coils, which suggests that they will oligomerise. Most coiled-coil proteins form either a dimeric or a trimeric structure. It is possible that different members of the XS domain family could oligomerise via their coiled-coils forming a variety of complexes [].; PDB: 4E8U_C.
Probab=89.62 E-value=1 Score=24.46 Aligned_cols=49 Identities=29% Similarity=0.362 Sum_probs=25.4
Q ss_pred CeEEEeCCCCC---------CCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCH
Q psy3346 21 SKILVRNIPFQ---------AKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITK 73 (79)
Q Consensus 21 ~~l~v~nl~~~---------~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~ 73 (79)
-++.|.|++.+ .+.+.|++.|..|.++. ++...++ ..+.|++.|.|.+.
T Consensus 9 wmgIi~N~~~~~~~~g~~~g~~~~~l~~~l~~f~p~k-v~~l~~~---~gh~g~aiv~F~~~ 66 (116)
T PF03468_consen 9 WMGIIVNIPTEKDDDGRWVGMSNEELLDKLAEFNPLK-VKPLYGK---QGHTGFAIVEFNKD 66 (116)
T ss_dssp -EEEEE----EE-TTS-EE---SHHHHHHHHH---SE-EEEEEET---TEEEEEEEEE--SS
T ss_pred CEEEEEcCccccCCCCceeccCHHHHHHHHHhcCCce-eEECcCC---CCCcEEEEEEECCC
Confidence 34566677554 34578999999998764 4444442 35789999999764
No 140
>PF15513 DUF4651: Domain of unknown function (DUF4651)
Probab=89.36 E-value=1.5 Score=21.21 Aligned_cols=18 Identities=22% Similarity=0.654 Sum_probs=14.9
Q ss_pred HHHHHHhHhcCCeeEEEe
Q psy3346 35 SEVEELFKAFGELKFVRL 52 (79)
Q Consensus 35 ~~l~~~f~~~g~i~~~~~ 52 (79)
.+|+++|+..|.|.-+.+
T Consensus 9 ~~iR~~fs~lG~I~vLYv 26 (62)
T PF15513_consen 9 AEIRQFFSQLGEIAVLYV 26 (62)
T ss_pred HHHHHHHHhcCcEEEEEE
Confidence 578999999999876654
No 141
>KOG2202|consensus
Probab=88.30 E-value=0.12 Score=31.76 Aligned_cols=35 Identities=17% Similarity=0.300 Sum_probs=25.9
Q ss_pred hcCCeeEEEeecccCCCCCcceEEEEEecCHHHHhcC
Q psy3346 43 AFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKRV 79 (79)
Q Consensus 43 ~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~a 79 (79)
+||+|..+.+-.+ ......|=+||.|...++|++|
T Consensus 92 kygEiee~~Vc~N--l~~hl~GNVYV~f~~Ee~ae~a 126 (260)
T KOG2202|consen 92 KYGEIEELNVCDN--LGDHLVGNVYVKFRSEEDAEAA 126 (260)
T ss_pred Hhhhhhhhhhhcc--cchhhhhhhhhhcccHHHHHHH
Confidence 8899988754333 3334568899999999999875
No 142
>KOG2253|consensus
Probab=88.01 E-value=0.34 Score=33.39 Aligned_cols=53 Identities=19% Similarity=0.295 Sum_probs=40.4
Q ss_pred ccCCCCCeEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHHHh
Q psy3346 15 VAKQTGSKILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAK 77 (79)
Q Consensus 15 ~~~~~~~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~ 77 (79)
...+...++||+|+...+..+-++.++..||-|.++. +. -|||..|..+....
T Consensus 35 ~~~~~~~~vfv~~~~~~~s~~~~~~il~~~g~v~s~k--r~--------~fgf~~f~~~~~~~ 87 (668)
T KOG2253|consen 35 QPLPPRDTVFVGNISYLVSQEFWKSILAKSGFVPSWK--RD--------KFGFCEFLKHIGDL 87 (668)
T ss_pred cCCCCCceeEecchhhhhhHHHHHHHHhhCCcchhhh--hh--------hhcccchhhHHHHH
Confidence 3445678999999999999999999999999775542 22 27888887766543
No 143
>KOG4285|consensus
Probab=86.94 E-value=1.4 Score=28.07 Aligned_cols=52 Identities=17% Similarity=0.196 Sum_probs=36.0
Q ss_pred CCeEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHHHhcC
Q psy3346 20 GSKILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKRV 79 (79)
Q Consensus 20 ~~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~a 79 (79)
+.=|-|-++|+... ..|...|.+||.|.+... +.+--|-+|.|.+..+|++|
T Consensus 197 D~WVTVfGFppg~~-s~vL~~F~~cG~Vvkhv~-------~~ngNwMhirYssr~~A~KA 248 (350)
T KOG4285|consen 197 DTWVTVFGFPPGQV-SIVLNLFSRCGEVVKHVT-------PSNGNWMHIRYSSRTHAQKA 248 (350)
T ss_pred cceEEEeccCccch-hHHHHHHHhhCeeeeeec-------CCCCceEEEEecchhHHHHh
Confidence 34455556776644 456778999999866532 23345889999999998875
No 144
>KOG4008|consensus
Probab=86.82 E-value=0.87 Score=27.87 Aligned_cols=36 Identities=8% Similarity=0.311 Sum_probs=29.6
Q ss_pred cCCCCCeEEEeCCCCCCCHHHHHHHhHhcCCeeEEE
Q psy3346 16 AKQTGSKILVRNIPFQAKQSEVEELFKAFGELKFVR 51 (79)
Q Consensus 16 ~~~~~~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~ 51 (79)
...+..++|+.|+|..++++.|..+.+++|-+..+.
T Consensus 36 ~~~eKd~lfl~Nvp~~~tee~lkr~vsqlg~vq~~~ 71 (261)
T KOG4008|consen 36 NSNEKDCLFLVNVPLLSTEEHLKRFVSQLGHVQELL 71 (261)
T ss_pred ccccccceeeecccccccHHHHHHHHHHhhhhhhee
Confidence 345678999999999999999999999998654443
No 145
>KOG0804|consensus
Probab=86.26 E-value=5.5 Score=26.77 Aligned_cols=55 Identities=11% Similarity=0.235 Sum_probs=44.8
Q ss_pred CCeEEEeCCCCCCCHHHHHHHhHhcC-CeeEEEeecccCCCCCcceEEEEEecCHHHHh
Q psy3346 20 GSKILVRNIPFQAKQSEVEELFKAFG-ELKFVRLPKKMVGSGLHRGFGFVEFITKNEAK 77 (79)
Q Consensus 20 ~~~l~v~nl~~~~~~~~l~~~f~~~g-~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~ 77 (79)
.+.|.|-.+|...+-.+|..|...+- .|..++++++. .+ ++-...+.|.+.++|.
T Consensus 74 ~~mLcilaVP~~mt~~Dll~F~~~~~~~I~~irivRd~--~p-nrymvLIkFr~q~da~ 129 (493)
T KOG0804|consen 74 STMLCILAVPAYMTSHDLLRFCASFIKQISDIRIVRDG--MP-NRYMVLIKFRDQADAD 129 (493)
T ss_pred CcEEEEEeccccccHHHHHHHHHHHhhhhheeEEeecC--CC-ceEEEEEEeccchhHH
Confidence 78899999999999999999998764 58899998872 22 3455789999998875
No 146
>KOG4483|consensus
Probab=84.42 E-value=3 Score=27.64 Aligned_cols=51 Identities=12% Similarity=0.169 Sum_probs=39.6
Q ss_pred CCeEEEeCCCCCCCHHHHHHHhHhcCC-eeEEEeecccCCCCCcceEEEEEecCHHHHhc
Q psy3346 20 GSKILVRNIPFQAKQSEVEELFKAFGE-LKFVRLPKKMVGSGLHRGFGFVEFITKNEAKR 78 (79)
Q Consensus 20 ~~~l~v~nl~~~~~~~~l~~~f~~~g~-i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~ 78 (79)
...|-|-++|.....++|...|..|+. --.|.|+.+ -.+|..|.+...|..
T Consensus 391 pHVlEIydfp~efkteDll~~f~~yq~kgfdIkWvDd--------thalaVFss~~~Aae 442 (528)
T KOG4483|consen 391 PHVLEIYDFPDEFKTEDLLKAFETYQNKGFDIKWVDD--------THALAVFSSVNRAAE 442 (528)
T ss_pred cceeEeccCchhhccHHHHHHHHHhhcCCceeEEeec--------ceeEEeecchHHHHH
Confidence 467888899999999999999999863 456677766 457888888777654
No 147
>KOG2135|consensus
Probab=84.24 E-value=0.8 Score=30.65 Aligned_cols=52 Identities=23% Similarity=0.382 Sum_probs=36.8
Q ss_pred CCCCeEEEeCCCCCC-CHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHHH
Q psy3346 18 QTGSKILVRNIPFQA-KQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEA 76 (79)
Q Consensus 18 ~~~~~l~v~nl~~~~-~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a 76 (79)
.+.+.+-+.-.|+.. +..+|...|.+||.|..|.+-.. --.|.|+|.+..+|
T Consensus 370 ~dhs~l~lek~~~glnt~a~ln~hfA~fG~i~n~qv~~~-------~~~a~vTF~t~aea 422 (526)
T KOG2135|consen 370 VDHSPLALEKSPFGLNTIADLNPHFAQFGEIENIQVDYS-------SLHAVVTFKTRAEA 422 (526)
T ss_pred cccchhhhhccCCCCchHhhhhhhhhhcCccccccccCc-------hhhheeeeeccccc
Confidence 445666666666664 56789999999999999876433 13467888777665
No 148
>PF08952 DUF1866: Domain of unknown function (DUF1866) ; InterPro: IPR015047 This domain, found in synaptojanin, has no known function. ; PDB: 1UFW_A 2DNR_A.
Probab=84.08 E-value=2.5 Score=24.00 Aligned_cols=53 Identities=19% Similarity=0.246 Sum_probs=31.9
Q ss_pred CCCCeEEEeCCCC------CCCH---HHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHHHhcC
Q psy3346 18 QTGSKILVRNIPF------QAKQ---SEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKRV 79 (79)
Q Consensus 18 ~~~~~l~v~nl~~------~~~~---~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~a 79 (79)
+..-||.|.-+.+ ...+ .+|.+.|..||.+.-+++..+ .-+|+|.+-++|.+|
T Consensus 25 PpDaTVvVsv~~~~~~e~~~Fdd~l~~~ll~~~~~~GevvLvRfv~~---------~mwVTF~dg~sALaa 86 (146)
T PF08952_consen 25 PPDATVVVSVDSPSEDEDSSFDDNLMDELLQKFAQYGEVVLVRFVGD---------TMWVTFRDGQSALAA 86 (146)
T ss_dssp -TT-EEEEEECS-SCCCCHS--HHHHHHHHHHHHCCS-ECEEEEETT---------CEEEEESSCHHHHHH
T ss_pred CCCceEEEEecCCCccccCcCCHHHHHHHHHHHHhCCceEEEEEeCC---------eEEEEECccHHHHHH
Confidence 4456666664441 1222 357778888998777766554 368999988887654
No 149
>KOG4676|consensus
Probab=81.97 E-value=0.14 Score=33.43 Aligned_cols=47 Identities=17% Similarity=0.104 Sum_probs=35.5
Q ss_pred CeEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecC
Q psy3346 21 SKILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFIT 72 (79)
Q Consensus 21 ~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~ 72 (79)
+++++++|+..+...++.+.|..+|.+...++-. +....+|-++|..
T Consensus 152 Rt~~v~sl~~~~~l~e~~e~f~r~Gev~ya~~as-----k~~s~~c~~sf~~ 198 (479)
T KOG4676|consen 152 RTREVQSLISAAILPESGESFERKGEVSYAHTAS-----KSRSSSCSHSFRK 198 (479)
T ss_pred hhhhhhcchhhhcchhhhhhhhhcchhhhhhhhc-----cCCCcchhhhHhh
Confidence 7899999999999999999999999987766633 3333455566643
No 150
>KOG4454|consensus
Probab=73.34 E-value=0.63 Score=28.31 Aligned_cols=52 Identities=25% Similarity=0.468 Sum_probs=41.0
Q ss_pred CCCeEEEeC----CCCCCCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecC
Q psy3346 19 TGSKILVRN----IPFQAKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFIT 72 (79)
Q Consensus 19 ~~~~l~v~n----l~~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~ 72 (79)
...+++.|+ |...++++.+...|+.-|.+..+++..+ ..|+++-++|+++..
T Consensus 79 ~q~~~r~G~shapld~r~~~ei~~~v~s~a~p~~~~R~~~~--~d~rnrn~~~~~~qr 134 (267)
T KOG4454|consen 79 EQRTLRCGNSHAPLDERVTEEILYEVFSQAGPIEGVRIPTD--NDGRNRNFGFVTYQR 134 (267)
T ss_pred hhcccccCCCcchhhhhcchhhheeeecccCCCCCcccccc--ccCCccCccchhhhh
Confidence 346677777 7777888888999999999999999888 467888888887643
No 151
>COG0724 RNA-binding proteins (RRM domain) [General function prediction only]
Probab=71.77 E-value=10 Score=22.05 Aligned_cols=41 Identities=24% Similarity=0.532 Sum_probs=33.5
Q ss_pred ccCCCCCeEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecc
Q psy3346 15 VAKQTGSKILVRNIPFQAKQSEVEELFKAFGELKFVRLPKK 55 (79)
Q Consensus 15 ~~~~~~~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~ 55 (79)
........+++.+++.......+...|..+|.+....+...
T Consensus 220 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 260 (306)
T COG0724 220 LLLEKSDNLYVGNLPLKTAEEELADLFKSRGDIVRASLPPS 260 (306)
T ss_pred ccccccceeeccccccccchhHHHHhccccccceeeeccCC
Confidence 33455788999999999999999999999998866665544
No 152
>COG0030 KsgA Dimethyladenosine transferase (rRNA methylation) [Translation, ribosomal structure and biogenesis]
Probab=70.17 E-value=11 Score=23.38 Aligned_cols=34 Identities=15% Similarity=0.130 Sum_probs=25.2
Q ss_pred CCeEEEeCCCCCCCHHHHHHHhHhcCCeeEEEee
Q psy3346 20 GSKILVRNIPFQAKQSEVEELFKAFGELKFVRLP 53 (79)
Q Consensus 20 ~~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~ 53 (79)
.....|+|||+.++..-+..++...-.+....++
T Consensus 95 ~~~~vVaNlPY~Isspii~kll~~~~~~~~~v~M 128 (259)
T COG0030 95 QPYKVVANLPYNISSPILFKLLEEKFIIQDMVLM 128 (259)
T ss_pred CCCEEEEcCCCcccHHHHHHHHhccCccceEEEE
Confidence 4466799999999999999988765554444443
No 153
>PF09707 Cas_Cas2CT1978: CRISPR-associated protein (Cas_Cas2CT1978); InterPro: IPR010152 Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR) are a family of DNA direct repeats separated by regularly sized non-repetitive spacer sequences that are found in most bacterial and archaeal genomes []. CRISPRs appear to provide acquired resistance against bacteriophages, possibly acting with an RNA interference-like mechanism to inhibit gene functions of invasive DNA elements [, ]. Differences in the number and type of spacers between CRISPR repeats correlate with phage sensitivity. It is thought that following phage infection, bacteria integrate new spacers derived from phage genomic sequences, and that the removal or addition of particular spacers modifies the phage-resistance phenotype of the cell. Therefore, the specificity of CRISPRs may be determined by spacer-phage sequence similarity. In addition, there are many protein families known as CRISPR-associated sequences (Cas), which are encoded in the vicinity of CRISPR loci []. CRISPR/cas gene regions can be quite large, with up to 20 different, tandem-arranged cas genes next to a CRISPR cluster or filling the region between two repeat clusters. Cas genes and CRISPRs are found on mobile genetic elements such as plasmids, and have undergone extensive horizontal transfer. Cas proteins are thought to be involved in the propagation and functioning of CRISPRs. Some Cas proteins show similarity to helicases and repair proteins, although the functions of most are unknown. Cas families can be divided into subtypes according to operon organisation and phylogeny. This entry represents a minor branch of the Cas2 family of CRISPR-associated protein which are found in IPR003799 from INTERPRO. Cas2 is one of four protein families (Cas1 to Cas4) that are associated with CRISPR elements and always occur near a repeat cluster, usually in the order cas3-cas4-cas1-cas2. The function of Cas2 (and Cas1) is unknown. Cas3 proteins appear to be helicases while Cas4 proteins resemble RecB-type exonucleases, suggesting that these genes are involved in DNA metabolism or gene expression [].
Probab=69.52 E-value=14 Score=19.08 Aligned_cols=48 Identities=17% Similarity=0.241 Sum_probs=31.0
Q ss_pred CCeEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEec
Q psy3346 20 GSKILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFI 71 (79)
Q Consensus 20 ~~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~ 71 (79)
..-+||+|++..+-+.--..+.+..+.-.-+ +... ..+ ..||.|-++-
T Consensus 25 ~~GVyVg~~s~rVRe~lW~~v~~~~~~G~a~-m~~~--~~n-eqG~~~~t~G 72 (86)
T PF09707_consen 25 RPGVYVGNVSARVRERLWERVTEWIGDGSAV-MVWS--DNN-EQGFDFRTLG 72 (86)
T ss_pred CCCcEEcCCCHHHHHHHHHHHHhhCCCccEE-EEEc--cCC-CCCEEEEEeC
Confidence 3568999999888877666666655543333 3332 223 6799998773
No 154
>PF07292 NID: Nmi/IFP 35 domain (NID); InterPro: IPR009909 This entry represents a domain of approximately 90 residues that is tandemly repeated within interferon-induced 35 kDa protein (IFP 35) and the homologous N-myc-interactor (Nmi). This domain mediates Nmi-Nmi protein interactions and subcellular localisation [].
Probab=66.90 E-value=4.1 Score=21.11 Aligned_cols=25 Identities=20% Similarity=0.316 Sum_probs=20.6
Q ss_pred CCCCCeEEEeCCCCCCCHHHHHHHh
Q psy3346 17 KQTGSKILVRNIPFQAKQSEVEELF 41 (79)
Q Consensus 17 ~~~~~~l~v~nl~~~~~~~~l~~~f 41 (79)
....++|.+.|||....+++|+..+
T Consensus 49 ~vs~rtVlvsgip~~l~ee~l~D~L 73 (88)
T PF07292_consen 49 GVSKRTVLVSGIPDVLDEEELRDKL 73 (88)
T ss_pred cccCCEEEEeCCCCCCChhhheeeE
Confidence 4567899999999999999888543
No 155
>KOG2318|consensus
Probab=66.63 E-value=14 Score=25.74 Aligned_cols=38 Identities=21% Similarity=0.413 Sum_probs=29.3
Q ss_pred CCCCCeEEEeCCCCC-CCHHHHHHHhHhc----CCeeEEEeec
Q psy3346 17 KQTGSKILVRNIPFQ-AKQSEVEELFKAF----GELKFVRLPK 54 (79)
Q Consensus 17 ~~~~~~l~v~nl~~~-~~~~~l~~~f~~~----g~i~~~~~~~ 54 (79)
.....+|-|.||.|. +...+|.-+|..| |.|.+|.|..
T Consensus 171 ~~~T~RLAVvNMDWd~v~AkDL~v~~nSFlP~gGsilSV~IYp 213 (650)
T KOG2318|consen 171 GEETKRLAVVNMDWDRVKAKDLFVLFNSFLPKGGSILSVKIYP 213 (650)
T ss_pred ccccceeeEeccccccccHHHHHHHHHhhcCCCCceeEEEech
Confidence 455678999999997 5778888888765 5788887643
No 156
>PF04847 Calcipressin: Calcipressin; InterPro: IPR006931 Calcipressin 1 negatively regulates calcineurin (IPR015757 from INTERPRO) by direct binding and is essential for the survival of T helper type 1 cells. Calcipressin 1 is a phosphoprotein that increases its capacity to inhibit calcineurin when phosphorylated at the FLISPP motif, and this phosphorylation also controls the half-life of calcipressin 1 by accelerating its degradation []. Calcineurin is a calcium-responsive enzyme that dephosphorylates the nuclear factor of activated T cells (NFAT). In so doing it promotes its nuclear translocation and uniquely links calcium signalling to transcriptional regulation []. Calcipressins are a family of proteins derived from three genes. Calcipressin 1 is also known as modulatory calcineurin-interacting protein 1 (MCIP1), Adapt78 and Down syndrome critical region 1 (DSCR1). Calcipressin 2 is variously known as MCIP2, ZAKI-4 and DSCR1-like 1. Calcipressin 3 is also called MCIP3 and DSCR1-like 2 []. DSCR1 (Adapt78) is associated with successful adaptation to oxidative stress and calcium stress as well as with diseases like Alzheimer's and Down syndrome. The DSCR1 (Adapt78) isoform 1 protein, calcipressin 1, inhibits calcineurin and protects against acute calcium-mediated stress damage, including transient oxidative stress []. Calcipressin 1 is encoded by DSCR1, a gene on human chromosome 21. Calcipressin 1 isoform 1 has an N-terminal coding region, which generates a new polypeptide of 252 amino acids. Endogenous calcipressin 1 exists as a complex together with the calcineurin A and B heterodimer []. ; GO: 0019722 calcium-mediated signaling; PDB: 1WEY_A.
Probab=63.02 E-value=16 Score=21.51 Aligned_cols=39 Identities=26% Similarity=0.189 Sum_probs=25.3
Q ss_pred HHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHHHhcC
Q psy3346 34 QSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKRV 79 (79)
Q Consensus 34 ~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~a 79 (79)
...|+++|..++.+........ -+-..|.|.+.++|.+|
T Consensus 9 ~~~l~~l~~~~~~~~~~~~L~s-------FrRi~v~f~~~~~A~~~ 47 (184)
T PF04847_consen 9 LAELEELFSTYDPPVQFSPLKS-------FRRIRVVFESPESAQRA 47 (184)
T ss_dssp HHHHHHHHHTT-SS-EEEEETT-------TTEEEEE-SSTTHHHHH
T ss_pred HHHHHHHHHhcCCceEEEEcCC-------CCEEEEEeCCHHHHHHH
Confidence 4678999999997766654443 24467889988888653
No 157
>PHA01632 hypothetical protein
Probab=62.58 E-value=13 Score=17.62 Aligned_cols=21 Identities=19% Similarity=0.539 Sum_probs=16.6
Q ss_pred EEEeCCCCCCCHHHHHHHhHh
Q psy3346 23 ILVRNIPFQAKQSEVEELFKA 43 (79)
Q Consensus 23 l~v~nl~~~~~~~~l~~~f~~ 43 (79)
+.|..+|..-++++|+..+.+
T Consensus 19 ilieqvp~kpteeelrkvlpk 39 (64)
T PHA01632 19 ILIEQVPQKPTEEELRKVLPK 39 (64)
T ss_pred EehhhcCCCCCHHHHHHHHHH
Confidence 345578999999999888754
No 158
>PRK11558 putative ssRNA endonuclease; Provisional
Probab=61.88 E-value=22 Score=18.78 Aligned_cols=50 Identities=14% Similarity=0.179 Sum_probs=29.5
Q ss_pred CCeEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCH
Q psy3346 20 GSKILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITK 73 (79)
Q Consensus 20 ~~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~ 73 (79)
..-+||++++..+-+.--..+-+.++.-. +.+... .+. -.||.|-++...
T Consensus 27 ~~GVyVg~~S~rVRd~lW~~v~~~~~~G~-avmv~~--~~~-eqG~~~~t~G~~ 76 (97)
T PRK11558 27 RAGVYVGDVSRRIREMIWQQVTQLAEEGN-VVMAWA--TNT-ESGFEFQTFGEN 76 (97)
T ss_pred CCCcEEcCCCHHHHHHHHHHHHHhCCCCc-EEEEEc--CCC-CCCcEEEecCCC
Confidence 34689999888877765555555555422 223332 222 338888877543
No 159
>PF00398 RrnaAD: Ribosomal RNA adenine dimethylase; InterPro: IPR001737 This family of proteins include rRNA adenine dimethylases (e.g. KsgA) and the Erythromycin resistance methylases (Erm). The bacterial enzyme KsgA catalyses the transfer of a total of four methyl groups from S-adenosyl-l-methionine (S-AdoMet) to two adjacent adenosine bases in 16S rRNA. This enzyme and the resulting modified adenosine bases appear to be conserved in all species of eubacteria, eukaryotes, and archaea, and in eukaryotic organelles. Bacterial resistance to the aminoglycoside antibiotic kasugamycin involves inactivation of KsgA and resulting loss of the dimethylations, with modest consequences to the overall fitness of the organism. In contrast, the yeast ortholog, Dim1, is essential. In Saccharomyces cerevisiae (Baker's yeast), and presumably in other eukaryotes, the enzyme performs a vital role in pre-rRNA processing in addition to its methylating activity. The best conserved region in these enzymes is located in the N-terminal section and corresponds to a region that is probably involved in S-adenosyl methionine (SAM) binding domain. The crystal structure of KsgA from Escherichia coli has been solved to a resolution of 2.1A. It bears a strong similarity to the crystal structure of ErmC' from Bacillus stearothermophilus and a lesser similarity to the yeast mitochondrial transcription factor, sc-mtTFB []. The Erm family of RNA methyltransferases, which methylate a single adenosine base in 23S rRNA confer resistance to the MLS-B group of antibiotics. Despite their sequence similarity, the two enzyme families have strikingly different levels of regulation that remain to be elucidated. Other orthologs, of this family include the yeast and Homo sapiens (Human) mitochondrial transcription factors (MTF1 and h-mtTFB respectively), which are nuclear encoded []. Human-mtTFB is able to stimulate transcription in vitro independently of its S-adenosylmethionine binding and rRNA methyltransferase activity [].; GO: 0000179 rRNA (adenine-N6,N6-)-dimethyltransferase activity, 0008649 rRNA methyltransferase activity, 0000154 rRNA modification; PDB: 3FTF_A 3R9X_B 3FTE_A 3FTC_A 3FTD_A 3GRY_A 3FYC_A 3GRU_A 3FYD_A 3GRV_A ....
Probab=60.68 E-value=13 Score=22.78 Aligned_cols=32 Identities=22% Similarity=0.551 Sum_probs=24.1
Q ss_pred CCeEEEeCCCCCCCHHHHHHHhH--hcCCeeEEE
Q psy3346 20 GSKILVRNIPFQAKQSEVEELFK--AFGELKFVR 51 (79)
Q Consensus 20 ~~~l~v~nl~~~~~~~~l~~~f~--~~g~i~~~~ 51 (79)
...+.|+|||+.++..-+..++. .+|...-+.
T Consensus 97 ~~~~vv~NlPy~is~~il~~ll~~~~~g~~~~~l 130 (262)
T PF00398_consen 97 QPLLVVGNLPYNISSPILRKLLELYRFGRVRMVL 130 (262)
T ss_dssp SEEEEEEEETGTGHHHHHHHHHHHGGGCEEEEEE
T ss_pred CceEEEEEecccchHHHHHHHhhcccccccceEE
Confidence 56788999999999998888886 455444433
No 160
>PF14893 PNMA: PNMA
Probab=56.78 E-value=12 Score=24.19 Aligned_cols=25 Identities=28% Similarity=0.496 Sum_probs=21.0
Q ss_pred CCCCeEEEeCCCCCCCHHHHHHHhH
Q psy3346 18 QTGSKILVRNIPFQAKQSEVEELFK 42 (79)
Q Consensus 18 ~~~~~l~v~nl~~~~~~~~l~~~f~ 42 (79)
+..+.|.|.+||.++++.++++.+.
T Consensus 16 ~~~r~lLv~giP~dc~~~ei~e~l~ 40 (331)
T PF14893_consen 16 DPQRALLVLGIPEDCEEAEIEEALQ 40 (331)
T ss_pred ChhhhheeecCCCCCCHHHHHHHHH
Confidence 4467889999999999999887774
No 161
>COG0445 GidA Flavin-dependent tRNA uridine 5-carboxymethylaminomethyl modification enzyme GidA [Cell cycle control, cell division, chromosome partitioning]
Probab=55.71 E-value=46 Score=23.44 Aligned_cols=39 Identities=5% Similarity=0.129 Sum_probs=28.1
Q ss_pred CCCCCeEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecc
Q psy3346 17 KQTGSKILVRNIPFQAKQSEVEELFKAFGELKFVRLPKK 55 (79)
Q Consensus 17 ~~~~~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~ 55 (79)
..+...+|+.+|+.+.-++-=.++....-..+.+.+.+.
T Consensus 298 Gl~~~evY~nGlSTSlP~dVQ~~~irsipGlEna~i~rp 336 (621)
T COG0445 298 GLDTDEVYPNGLSTSLPEDVQEQIIRSIPGLENAEILRP 336 (621)
T ss_pred CCCCceEecCcccccCCHHHHHHHHHhCcccccceeecc
Confidence 445678999999888887766666666655677766554
No 162
>PF11411 DNA_ligase_IV: DNA ligase IV; InterPro: IPR021536 DNA ligase IV along with Xrcc4 functions in DNA non-homologous end joining. This process is required to mend double-strand breaks. Upon ligase binding to an Xrcc4 dimer, the helical tails unwind leading to a flat interaction surface []. ; GO: 0003910 DNA ligase (ATP) activity; PDB: 3II6_Y 2E2W_A 1IK9_C.
Probab=53.16 E-value=13 Score=16.01 Aligned_cols=16 Identities=13% Similarity=0.449 Sum_probs=9.8
Q ss_pred CCCCHHHHHHHhHhcC
Q psy3346 30 FQAKQSEVEELFKAFG 45 (79)
Q Consensus 30 ~~~~~~~l~~~f~~~g 45 (79)
..+++.+|+++|.+..
T Consensus 19 ~Dtd~~~Lk~vF~~i~ 34 (36)
T PF11411_consen 19 VDTDEDQLKEVFNRIK 34 (36)
T ss_dssp S---HHHHHHHHHCS-
T ss_pred ccCCHHHHHHHHHHhc
Confidence 3678899999997754
No 163
>KOG1295|consensus
Probab=53.14 E-value=17 Score=23.91 Aligned_cols=58 Identities=14% Similarity=0.267 Sum_probs=37.0
Q ss_pred CCCeEEEeCCCCCCCHHHHHHHhHhcCC-eeEEEee-cccCCCCCcceEEEEEecCHHHH
Q psy3346 19 TGSKILVRNIPFQAKQSEVEELFKAFGE-LKFVRLP-KKMVGSGLHRGFGFVEFITKNEA 76 (79)
Q Consensus 19 ~~~~l~v~nl~~~~~~~~l~~~f~~~g~-i~~~~~~-~~~~~~~~~~g~~fv~f~~~~~a 76 (79)
....+.|.+||+..++.++.+...++-. +....+. .+........+.+||.|...++.
T Consensus 6 ~~~Kvv~rrlpp~l~~~~~~eqi~p~~~~v~~~~F~~a~~s~~~~~ysrayinFk~~~dv 65 (376)
T KOG1295|consen 6 AKVKVVVRRLPPKLTEEQLLEQINPFPEHVNWEFFAKADESLRNHKYSRAYINFKNPEDV 65 (376)
T ss_pred cceeeeeecCCCcccHHHHhhhcCCCccccchheeccccccchhhhhhhhhhccccHHHH
Confidence 4567889999999999999888877632 2222222 11101122246789999888764
No 164
>KOG4213|consensus
Probab=52.07 E-value=50 Score=19.74 Aligned_cols=45 Identities=22% Similarity=0.279 Sum_probs=29.2
Q ss_pred CCHHHHHHHhHhc-CCeeEEEeecccCCCC--CcceEEEEEecCHHHHhc
Q psy3346 32 AKQSEVEELFKAF-GELKFVRLPKKMVGSG--LHRGFGFVEFITKNEAKR 78 (79)
Q Consensus 32 ~~~~~l~~~f~~~-g~i~~~~~~~~~~~~~--~~~g~~fv~f~~~~~a~~ 78 (79)
.+++.+..+...- |.+..+.+-.. ..+ ..+|-.||+|.+.+.+.+
T Consensus 118 ~td~ql~~l~qw~~~k~~nv~mr~~--~~k~~~fkGsvkv~f~tk~qa~a 165 (205)
T KOG4213|consen 118 ITDDQLDDLNQWASGKGHNVKMRRH--GNKAHPFKGSVKVTFQTKEQAFA 165 (205)
T ss_pred CCHHHHHHHHHHhcccceEeecccc--CCCCCCCCCceEEEeecHHHHHh
Confidence 5566665555332 57777766443 333 567889999999888764
No 165
>PRK00274 ksgA 16S ribosomal RNA methyltransferase KsgA/Dim1 family protein; Reviewed
Probab=51.90 E-value=28 Score=21.46 Aligned_cols=22 Identities=18% Similarity=0.347 Sum_probs=18.6
Q ss_pred eEEEeCCCCCCCHHHHHHHhHh
Q psy3346 22 KILVRNIPFQAKQSEVEELFKA 43 (79)
Q Consensus 22 ~l~v~nl~~~~~~~~l~~~f~~ 43 (79)
.+.|+|+|+.++..-|..++..
T Consensus 107 ~~vv~NlPY~iss~ii~~~l~~ 128 (272)
T PRK00274 107 LKVVANLPYNITTPLLFHLLEE 128 (272)
T ss_pred ceEEEeCCccchHHHHHHHHhc
Confidence 5779999999998888888854
No 166
>PRK01178 rps24e 30S ribosomal protein S24e; Reviewed
Probab=51.86 E-value=16 Score=19.32 Aligned_cols=47 Identities=19% Similarity=0.316 Sum_probs=26.4
Q ss_pred CCCHHHHHHHhH-hcCCeeEEEee---cccCCCCCcceEEEEEecCHHHHhc
Q psy3346 31 QAKQSEVEELFK-AFGELKFVRLP---KKMVGSGLHRGFGFVEFITKNEAKR 78 (79)
Q Consensus 31 ~~~~~~l~~~f~-~~g~i~~~~~~---~~~~~~~~~~g~~fv~f~~~~~a~~ 78 (79)
+.+..+|++-+. .++.=....++ ...-..|.+.|||.| |.+.+.+.+
T Consensus 30 tpsr~eirekLa~~~~~~~~~vvv~~~~t~fG~g~s~G~a~I-Yds~e~~kk 80 (99)
T PRK01178 30 TPSRKDVRKKLAAMLNADKELVVVRKIKTEYGMGKSKGYAKV-YDDKERARK 80 (99)
T ss_pred CCCHHHHHHHHHHHHCcCCCEEEEEccCccCCCceEEEEEEE-ECCHHHHHh
Confidence 567788877774 45532222221 221134567788877 788777653
No 167
>PF13046 DUF3906: Protein of unknown function (DUF3906)
Probab=51.55 E-value=24 Score=17.21 Aligned_cols=32 Identities=34% Similarity=0.518 Sum_probs=20.8
Q ss_pred HHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEE
Q psy3346 34 QSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFV 68 (79)
Q Consensus 34 ~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv 68 (79)
+.+|...|-+-..|..+.+...+ . ..+|-|||
T Consensus 32 e~eler~fl~~P~v~e~~l~EKK-r--i~~G~gyV 63 (64)
T PF13046_consen 32 EVELERHFLPLPEVKEVALYEKK-R--IRKGAGYV 63 (64)
T ss_pred HHHhhhhccCCCCceEEEEEEEE-e--eeCCceeE
Confidence 45677777777788888887654 2 23355555
No 168
>KOG2295|consensus
Probab=50.07 E-value=1.5 Score=30.09 Aligned_cols=53 Identities=23% Similarity=0.191 Sum_probs=36.2
Q ss_pred CCCCeEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEec
Q psy3346 18 QTGSKILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFI 71 (79)
Q Consensus 18 ~~~~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~ 71 (79)
...+++++.|+++.++-.+|...++.+..+..+.+.... .......+++|+|.
T Consensus 229 hke~sll~rni~Pnis~aeIe~~ck~i~~~lrfals~~~-aek~~~r~lwv~fk 281 (648)
T KOG2295|consen 229 HKECSLLVRNILPNISVAEIENLCKGIPGFLRFALSTIN-AEKNFERRLWVTFK 281 (648)
T ss_pred hHHHHHHHhccCCcccHHHHHHHhccCchheeeeccCch-HHHHHHHHhhHhhc
Confidence 345789999999999999999999887666665554332 22233345555554
No 169
>COG5193 LHP1 La protein, small RNA-binding pol III transcript stabilizing protein and related La-motif-containing proteins involved in translation [Posttranslational modification, protein turnover, chaperones / Translation, ribosomal structure and biogenesis]
Probab=50.02 E-value=5 Score=26.56 Aligned_cols=58 Identities=21% Similarity=0.196 Sum_probs=40.5
Q ss_pred CCeEEEeCCCCCCCH--------HHHHHHhHh--cCCeeEEEeecccCCCCCcceEEEEEecCHHHHhc
Q psy3346 20 GSKILVRNIPFQAKQ--------SEVEELFKA--FGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKR 78 (79)
Q Consensus 20 ~~~l~v~nl~~~~~~--------~~l~~~f~~--~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~ 78 (79)
.+.+|+.+....... +++..+|.+ .+.+..+.+.++. ....++|-.|++|...+.+++
T Consensus 174 qr~~y~n~fG~e~~~~a~~~e~~~d~~~~~p~h~h~~~~~i~~rrd~-~nkn~~gSv~~efk~~~~~q~ 241 (438)
T COG5193 174 QRDVYQNGFGKEDVNNASRPEQQEDLEIQFPPHYHAPPSQIRNRRDW-LNKNFRGSVFVEFKYFREAQR 241 (438)
T ss_pred hhhHHhhcCCcccccccccchhhhhHHhhCCCcccCChhhccchhhh-hhccccCcccccccChHHHHH
Confidence 456677666665444 378888888 4566666666653 356678889999999888765
No 170
>PTZ00338 dimethyladenosine transferase-like protein; Provisional
Probab=49.07 E-value=26 Score=22.05 Aligned_cols=32 Identities=22% Similarity=0.266 Sum_probs=22.6
Q ss_pred CeEEEeCCCCCCCHHHHHHHhHhcCCeeEEEe
Q psy3346 21 SKILVRNIPFQAKQSEVEELFKAFGELKFVRL 52 (79)
Q Consensus 21 ~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~ 52 (79)
..+.|.|||+.++...+..++.....+....+
T Consensus 102 ~d~VvaNlPY~Istpil~~ll~~~~~~~~~vl 133 (294)
T PTZ00338 102 FDVCVANVPYQISSPLVFKLLAHRPLFRCAVL 133 (294)
T ss_pred cCEEEecCCcccCcHHHHHHHhcCCCCceeee
Confidence 35778999999999888888864333444333
No 171
>TIGR00755 ksgA dimethyladenosine transferase. Alternate name: S-adenosylmethionine--6-N',N'-adenosyl (rRNA) dimethyltransferase
Probab=46.44 E-value=38 Score=20.54 Aligned_cols=24 Identities=17% Similarity=0.282 Sum_probs=20.0
Q ss_pred eEEEeCCCCCCCHHHHHHHhHhcC
Q psy3346 22 KILVRNIPFQAKQSEVEELFKAFG 45 (79)
Q Consensus 22 ~l~v~nl~~~~~~~~l~~~f~~~g 45 (79)
-+.|+|||+.++...+.+++...|
T Consensus 96 ~~vvsNlPy~i~~~il~~ll~~~~ 119 (253)
T TIGR00755 96 LKVVSNLPYNISSPLIFKLLEKPK 119 (253)
T ss_pred ceEEEcCChhhHHHHHHHHhccCC
Confidence 377999999999999999886544
No 172
>cd00027 BRCT Breast Cancer Suppressor Protein (BRCA1), carboxy-terminal domain. The BRCT domain is found within many DNA damage repair and cell cycle checkpoint proteins. The unique diversity of this domain superfamily allows BRCT modules to interact forming homo/hetero BRCT multimers, BRCT-non-BRCT interactions, and interactions within DNA strand breaks.
Probab=45.14 E-value=31 Score=15.38 Aligned_cols=26 Identities=15% Similarity=0.276 Sum_probs=21.2
Q ss_pred CeEEEeCCCCCCCHHHHHHHhHhcCC
Q psy3346 21 SKILVRNIPFQAKQSEVEELFKAFGE 46 (79)
Q Consensus 21 ~~l~v~nl~~~~~~~~l~~~f~~~g~ 46 (79)
..+++.+.........|.++...+|.
T Consensus 2 ~~~~i~g~~~~~~~~~l~~~i~~~Gg 27 (72)
T cd00027 2 LTFVITGDLPSEERDELKELIEKLGG 27 (72)
T ss_pred CEEEEEecCCCcCHHHHHHHHHHcCC
Confidence 46778777667888999999999885
No 173
>PF04026 SpoVG: SpoVG; InterPro: IPR007170 This is a stage V sporulation protein G. It is essential for sporulation and specific to stage V sporulation in Bacillus megaterium and Bacillus subtilis []. In B. subtilis, expression decreases after 30-60 minutes of cold shock [].; GO: 0030435 sporulation resulting in formation of a cellular spore; PDB: 2IA9_F 2I9X_B 2I9Z_A.
Probab=44.38 E-value=45 Score=17.05 Aligned_cols=25 Identities=28% Similarity=0.309 Sum_probs=15.2
Q ss_pred eeEEEeecccCCCCCcceEEEEEecC
Q psy3346 47 LKFVRLPKKMVGSGLHRGFGFVEFIT 72 (79)
Q Consensus 47 i~~~~~~~~~~~~~~~~g~~fv~f~~ 72 (79)
|..+++..-. ..++.+++|-|+|.+
T Consensus 3 itdVri~~~~-~~~~lka~asV~~dd 27 (84)
T PF04026_consen 3 ITDVRIRKIE-PEGKLKAFASVTFDD 27 (84)
T ss_dssp EEEEEEEETT-SSSSEEEEEEEEETT
T ss_pred cEEEEEEEec-CCCCEEEEEEEEECC
Confidence 4555553332 347777888887764
No 174
>smart00457 MACPF membrane-attack complex / perforin.
Probab=44.24 E-value=28 Score=20.26 Aligned_cols=22 Identities=18% Similarity=0.483 Sum_probs=18.7
Q ss_pred EeCCCCCCCHHHHHHHhHhcCC
Q psy3346 25 VRNIPFQAKQSEVEELFKAFGE 46 (79)
Q Consensus 25 v~nl~~~~~~~~l~~~f~~~g~ 46 (79)
+.+||...+......+|..||.
T Consensus 30 l~~Lp~~~~~~~~~~fi~~yGT 51 (194)
T smart00457 30 LRDLPDQYNRGAYARFIDKYGT 51 (194)
T ss_pred HHhCccccCHHHHHHHHHHhCC
Confidence 4478888889999999999984
No 175
>KOG4574|consensus
Probab=42.33 E-value=9.7 Score=27.76 Aligned_cols=48 Identities=15% Similarity=0.227 Sum_probs=35.0
Q ss_pred EEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHHHh
Q psy3346 23 ILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAK 77 (79)
Q Consensus 23 l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~ 77 (79)
.++.|.+-..+...|-.++..||.+.+.+..++- -.|.|+|.+.+.|-
T Consensus 301 ~~~~nn~v~~tSssL~~l~s~yg~v~s~wtlr~~-------N~alvs~~s~~sai 348 (1007)
T KOG4574|consen 301 QSLENNAVNLTSSSLATLCSDYGSVASAWTLRDL-------NMALVSFSSVESAI 348 (1007)
T ss_pred hhhhcccccchHHHHHHHHHhhcchhhheecccc-------cchhhhhHHHHHHH
Confidence 3444555567778899999999999888766652 45788888877764
No 176
>smart00650 rADc Ribosomal RNA adenine dimethylases.
Probab=41.13 E-value=67 Score=18.05 Aligned_cols=24 Identities=17% Similarity=0.278 Sum_probs=19.4
Q ss_pred CeEEEeCCCCCCCHHHHHHHhHhc
Q psy3346 21 SKILVRNIPFQAKQSEVEELFKAF 44 (79)
Q Consensus 21 ~~l~v~nl~~~~~~~~l~~~f~~~ 44 (79)
..+.++|+|+.++...+..++...
T Consensus 78 ~d~vi~n~Py~~~~~~i~~~l~~~ 101 (169)
T smart00650 78 PYKVVGNLPYNISTPILFKLLEEP 101 (169)
T ss_pred CCEEEECCCcccHHHHHHHHHhcC
Confidence 457789999999988888888653
No 177
>PTZ00071 40S ribosomal protein S24; Provisional
Probab=40.39 E-value=30 Score=19.38 Aligned_cols=46 Identities=20% Similarity=0.405 Sum_probs=26.5
Q ss_pred CCCHHHHHHHhHh-cC-C-eeEEEe--ecccCCCCCcceEEEEEecCHHHHh
Q psy3346 31 QAKQSEVEELFKA-FG-E-LKFVRL--PKKMVGSGLHRGFGFVEFITKNEAK 77 (79)
Q Consensus 31 ~~~~~~l~~~f~~-~g-~-i~~~~~--~~~~~~~~~~~g~~fv~f~~~~~a~ 77 (79)
+.+..+|++-+.. |+ . ...+.+ ....-..|.+.|||.| |.+.+.+.
T Consensus 35 TpSr~eirekLA~~~~v~d~~~Vvv~~~~T~fG~g~StG~a~I-Yds~e~~k 85 (132)
T PTZ00071 35 TVSKKDIKEKLAKQYKVADARTIVLFGFKTKFGGGKTTGFGLI-YDNLAALK 85 (132)
T ss_pred CCCHHHHHHHHHHHhCCCCCCEEEEEccEecCCCceEEEEEEE-ECCHHHHH
Confidence 5677888887754 55 1 122222 1221135667788877 78877764
No 178
>COG5638 Uncharacterized conserved protein [Function unknown]
Probab=40.16 E-value=67 Score=21.83 Aligned_cols=39 Identities=18% Similarity=0.405 Sum_probs=28.1
Q ss_pred ccCCCCCeEEEeCCCCC-CCHHHHHHHhHhc----CCeeEEEee
Q psy3346 15 VAKQTGSKILVRNIPFQ-AKQSEVEELFKAF----GELKFVRLP 53 (79)
Q Consensus 15 ~~~~~~~~l~v~nl~~~-~~~~~l~~~f~~~----g~i~~~~~~ 53 (79)
.......+|-|.||.|. +...+|...|+.| |.+..+.+.
T Consensus 141 e~G~~tkrLAvVnmDWd~v~a~DLf~~fsSf~P~ggkl~kV~iy 184 (622)
T COG5638 141 EEGNPTKRLAVVNMDWDRVDAKDLFKIFSSFLPYGGKLSKVKIY 184 (622)
T ss_pred CCCCcccceeEeecccccchHHHHHHHHHhhCCCCCccceeEec
Confidence 33455678999999996 6677888888765 466666653
No 179
>PRK13259 regulatory protein SpoVG; Reviewed
Probab=39.03 E-value=62 Score=17.05 Aligned_cols=14 Identities=14% Similarity=0.195 Sum_probs=7.6
Q ss_pred CCCcceEEEEEecC
Q psy3346 59 SGLHRGFGFVEFIT 72 (79)
Q Consensus 59 ~~~~~g~~fv~f~~ 72 (79)
.|+-++||-|+|.+
T Consensus 14 ~g~lka~asit~dd 27 (94)
T PRK13259 14 EGRMKAIVSITFDN 27 (94)
T ss_pred CCcEEEEEEEEECC
Confidence 35555666555543
No 180
>PF08206 OB_RNB: Ribonuclease B OB domain; InterPro: IPR013223 This domain includes the N-terminal OB domain found in ribonuclease B proteins in one or two copies.; PDB: 2ID0_D 2IX1_A 2IX0_A.
Probab=38.15 E-value=27 Score=16.21 Aligned_cols=11 Identities=45% Similarity=0.664 Sum_probs=7.7
Q ss_pred cceEEEEEecC
Q psy3346 62 HRGFGFVEFIT 72 (79)
Q Consensus 62 ~~g~~fv~f~~ 72 (79)
.+|||||...+
T Consensus 7 ~~GfGFv~~~~ 17 (58)
T PF08206_consen 7 PKGFGFVIPDD 17 (58)
T ss_dssp SSS-EEEEECT
T ss_pred cCCCEEEEECC
Confidence 46999998765
No 181
>PF01823 MACPF: MAC/Perforin domain; InterPro: IPR020864 The membrane attack complex/perforin (MACPF) domain is conserved in bacteria, fungi, mammals and plants. It was originally identified and named as being common to five complement components (C6, C7, C8-alpha, C8-beta, and C9) and perforin. These molecules perform critical functions in innate and adaptive immunity. The MAC family proteins and perforin are known to participate in lytic pore formation. In response to pathogen infection, a sequential and highly specific interaction between the constituent elements occurs to form transmembrane channels which are known as the membrane-attack complex (MAC).Only a few other MACPF proteins have been characterised and several are thought to form pores for invasion or protection [, , ]. Examples are proteins from malarial parasites [], the cytolytic toxins from sea anemones [], and proteins that provide plant immunity [, ]. Functionally uncharacterised MACPF proteins are also evident in pathogenic bacteria such as Chlamydia spp [] and Photorhabdus luminescens (Xenorhabdus luminescens) []. The MACPF domain is commonly found to be associated with other N- and C-terminal domains, such as TSP1 (see PDOC50092 from PROSITEDOC), LDLRA (see PDOC00929 from PROSITEDOC), EGF-like (see PDOC00021 from PROSITEDOC),Sushi/CCP/SCR (see PDOC50923 from PROSITEDOC), FIMAC or C2 (see PDOC00380 from PROSITEDOC). They probably control or target MACPF function [, ]. The MACPF domain oligomerizes, undergoes conformational change, and is required for lytic activity. The MACPF domain consists of a central kinked four-stranded antiparallel beta sheet surrounded by alpha helices and beta strands, forming two structural segments. Overall, the MACPF domain has a thin L-shaped appearance. MACPF domains exhibit limited sequence similarity but contain a signature [YW]-G-[TS]-H-[FY]-x(6)-G-G motif [, , ]. Some proteins known to contain a MACPF domain are listed below: Vertebrate complement proteins C6 to C9. Complement factors C6 to C9 assemble to form a scaffold, the membrane attack complex (MAC), that permits C9 polymerisation into pores that lyse Gram-negative pathogens [, ]. Vertebrate perforin. It is delivered by natural killer cells and cytotoxic T lymphocytes and forms oligomeric pores (12 to 18 monomers) in the plasma membrane of either virus-infected or transformed cells. Arabidopsis thaliana (Mouse-ear cress) constitutively activated cell death 1 (CAD1) protein. It is likely to act as a mediator that recognises plant signals for pathogen infection []. Arabidopsis thaliana (Mouse-ear cress) necrotic spotted lesions 1 (NSL1) protein []. Venomous sea anemone Phyllodiscus semoni (Night anemone) toxins PsTX-60A and PsTX-60B []. Venomous sea anemone Actineria villosa (Okinawan sea anemone) toxin AvTX-60A []. Plasmodium sporozoite microneme protein essential for cell traversal 2 (SPECT2). It is essential for the membrane-wounding activity of the sporozoite and is involved in its traversal of the sinusoidal cell layer prior to hepatocyte-infection []. P. luminescens Plu-MACPF. Although nonlytic, it was shown to bind to cell membranes []. Chlamydial putative uncharacterised protein CT153 []. ; PDB: 2QP2_A 3OJY_B 3NSJ_A 4E0S_B 3T5O_A 4A5W_B 2QQH_A 2RD7_A.
Probab=38.10 E-value=29 Score=19.98 Aligned_cols=22 Identities=14% Similarity=0.442 Sum_probs=15.3
Q ss_pred EeCCCCCCCHHH---HHHHhHhcCC
Q psy3346 25 VRNIPFQAKQSE---VEELFKAFGE 46 (79)
Q Consensus 25 v~nl~~~~~~~~---l~~~f~~~g~ 46 (79)
+.+||...+... +.++|..||.
T Consensus 53 l~~L~~~~~~~~~~~y~~f~~~yGT 77 (212)
T PF01823_consen 53 LNALPAEYNSDNTDEYYRFFDKYGT 77 (212)
T ss_dssp HHTSHSS--HHHHHHHHHHHHHH-S
T ss_pred HHhhCcccCccchHHHHHHHHHhCc
Confidence 557888887777 7889999995
No 182
>TIGR02542 B_forsyth_147 Bacteroides forsythus 147-residue repeat. The longest predicted protein in Bacteroides forsythus ATCC 43037 is over 3000 residues long and lacks homology to other known proteins. Immediately after the signal sequence are four tandem repeats, approximately 147 residues long. This model describes that repeat. This model describes that repeat.
Probab=36.39 E-value=24 Score=19.48 Aligned_cols=44 Identities=11% Similarity=0.218 Sum_probs=26.9
Q ss_pred CCCCCHHHHHHHhHh---cCCeeEEEeecccCCCCCcceEEEEEecCH
Q psy3346 29 PFQAKQSEVEELFKA---FGELKFVRLPKKMVGSGLHRGFGFVEFITK 73 (79)
Q Consensus 29 ~~~~~~~~l~~~f~~---~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~ 73 (79)
|+..+-.+++++|+. |..|..-.+.++- ....+-..||..|...
T Consensus 83 PYTlT~~e~r~iF~Epm~YQGITReQV~rdG-LP~GsYRiCFrL~~~~ 129 (145)
T TIGR02542 83 PYTLTYNELRQIFREPMVYQGITREQVQRDG-LPEGSYRICFRLFNAT 129 (145)
T ss_pred ceeeeHHHHHHHHhhhhhhccccHHHHhhcC-CCCCceEEEEEEeccc
Confidence 667899999999964 4445444444442 2222345788877654
No 183
>PF14111 DUF4283: Domain of unknown function (DUF4283)
Probab=36.28 E-value=65 Score=17.55 Aligned_cols=31 Identities=13% Similarity=0.184 Sum_probs=24.1
Q ss_pred EEEeCCCCCC-CHHHHHHHhHhcCCeeEEEee
Q psy3346 23 ILVRNIPFQA-KQSEVEELFKAFGELKFVRLP 53 (79)
Q Consensus 23 l~v~nl~~~~-~~~~l~~~f~~~g~i~~~~~~ 53 (79)
|.|.|||... +++.++.+-+.+|.+..+...
T Consensus 107 Vri~glP~~~~~~~~~~~i~~~iG~~i~vD~~ 138 (153)
T PF14111_consen 107 VRIYGLPLHLWSEEILKAIGSKIGEPIEVDEN 138 (153)
T ss_pred hhhccCCHHHhhhHHHHHHHHhcCCeEEEEcC
Confidence 5566999984 667788888999998887653
No 184
>PF15407 Spo7_2_N: Sporulation protein family 7
Probab=35.98 E-value=28 Score=17.06 Aligned_cols=23 Identities=17% Similarity=0.261 Sum_probs=16.0
Q ss_pred CCCeEEEeCCCCCCCHHHHHHHh
Q psy3346 19 TGSKILVRNIPFQAKQSEVEELF 41 (79)
Q Consensus 19 ~~~~l~v~nl~~~~~~~~l~~~f 41 (79)
..+++|||++|...-.+.=..++
T Consensus 26 tSr~vflG~IP~~W~~~~~~~~~ 48 (67)
T PF15407_consen 26 TSRRVFLGPIPEIWLQDHRKSWY 48 (67)
T ss_pred cCceEEECCCChHHHHcCcchHH
Confidence 46799999999976655433333
No 185
>PRK15464 cold shock-like protein CspH; Provisional
Probab=35.86 E-value=21 Score=17.52 Aligned_cols=10 Identities=30% Similarity=0.404 Sum_probs=7.2
Q ss_pred cceEEEEEec
Q psy3346 62 HRGFGFVEFI 71 (79)
Q Consensus 62 ~~g~~fv~f~ 71 (79)
.+||+||.=.
T Consensus 15 ~KGfGFI~~~ 24 (70)
T PRK15464 15 KSGKGFIIPS 24 (70)
T ss_pred CCCeEEEccC
Confidence 3699999643
No 186
>PRK15463 cold shock-like protein CspF; Provisional
Probab=34.39 E-value=24 Score=17.25 Aligned_cols=10 Identities=20% Similarity=0.325 Sum_probs=7.3
Q ss_pred cceEEEEEec
Q psy3346 62 HRGFGFVEFI 71 (79)
Q Consensus 62 ~~g~~fv~f~ 71 (79)
.+||+||+=.
T Consensus 15 ~kGfGFI~~~ 24 (70)
T PRK15463 15 KSGKGLITPS 24 (70)
T ss_pred CCceEEEecC
Confidence 3699999643
No 187
>PRK09937 stationary phase/starvation inducible regulatory protein CspD; Provisional
Probab=34.34 E-value=25 Score=17.43 Aligned_cols=9 Identities=44% Similarity=0.940 Sum_probs=6.7
Q ss_pred cceEEEEEe
Q psy3346 62 HRGFGFVEF 70 (79)
Q Consensus 62 ~~g~~fv~f 70 (79)
.+||+||.=
T Consensus 12 ~KGfGFI~~ 20 (74)
T PRK09937 12 AKGFGFICP 20 (74)
T ss_pred CCCeEEEee
Confidence 369999954
No 188
>PRK09507 cspE cold shock protein CspE; Reviewed
Probab=33.75 E-value=24 Score=17.15 Aligned_cols=9 Identities=44% Similarity=1.069 Sum_probs=6.8
Q ss_pred cceEEEEEe
Q psy3346 62 HRGFGFVEF 70 (79)
Q Consensus 62 ~~g~~fv~f 70 (79)
.+||+||.=
T Consensus 14 ~kGyGFI~~ 22 (69)
T PRK09507 14 SKGFGFITP 22 (69)
T ss_pred CCCcEEEec
Confidence 369999864
No 189
>PF06613 KorB_C: KorB C-terminal beta-barrel domain; InterPro: IPR010575 This domain is found in several KorB transcriptional repressor proteins. The korB gene is a major regulatory element in the replication and maintenance of broad host-range plasmid RK2. It negatively controls the replication gene trfA, the host-lethal determinants kilA and kilB, and the korA-korB operon []. This family is found in conjunction with IPR003115 from INTERPRO.; GO: 0003677 DNA binding, 0045892 negative regulation of transcription, DNA-dependent; PDB: 1IGQ_B 1IGU_A.
Probab=33.22 E-value=64 Score=15.53 Aligned_cols=22 Identities=14% Similarity=0.235 Sum_probs=13.3
Q ss_pred EEEeecccCCCCCcceEEEEEecC
Q psy3346 49 FVRLPKKMVGSGLHRGFGFVEFIT 72 (79)
Q Consensus 49 ~~~~~~~~~~~~~~~g~~fv~f~~ 72 (79)
..+++.++ ...+.|++++.|++
T Consensus 19 ~arllLnr--Rps~~G~~WiKyED 40 (60)
T PF06613_consen 19 PARLLLNR--RPSSEGLAWIKYED 40 (60)
T ss_dssp EEEE-TTB----SSTTEEEEEETT
T ss_pred hhhhhhcc--CCCcCCeEEEEEcc
Confidence 34555552 44567999999976
No 190
>PRK14998 cold shock-like protein CspD; Provisional
Probab=33.13 E-value=27 Score=17.26 Aligned_cols=10 Identities=40% Similarity=0.756 Sum_probs=7.2
Q ss_pred cceEEEEEec
Q psy3346 62 HRGFGFVEFI 71 (79)
Q Consensus 62 ~~g~~fv~f~ 71 (79)
.+||+||.=.
T Consensus 12 ~kGfGFI~~~ 21 (73)
T PRK14998 12 AKGFGFICPE 21 (73)
T ss_pred CCceEEEecC
Confidence 4699999643
No 191
>PRK11679 lipoprotein; Provisional
Probab=33.05 E-value=1.1e+02 Score=19.80 Aligned_cols=50 Identities=14% Similarity=0.218 Sum_probs=38.4
Q ss_pred CCCCCeEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHH
Q psy3346 17 KQTGSKILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKN 74 (79)
Q Consensus 17 ~~~~~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~ 74 (79)
..++..+++-+.|++..+..|-..+.+.|-- +. ...+++|.-||.|..++
T Consensus 236 d~~g~~~l~~~~~fdraW~rlg~aLdr~Gf~-----V~---Drdrs~G~y~V~y~~~~ 285 (346)
T PRK11679 236 DDTGLPVLIVRAPYNVVWQRLPAALEKVGMK-----VT---DRTRSQGTVAVTYKPPS 285 (346)
T ss_pred cCCCCeEEEeCCcHHHHHHHHHHHHHhcCCe-----Ee---eccccCcEEEEEecCCC
Confidence 4557778888999999999999999988831 11 34567899999998654
No 192
>PRK10943 cold shock-like protein CspC; Provisional
Probab=32.42 E-value=25 Score=17.07 Aligned_cols=9 Identities=44% Similarity=1.069 Sum_probs=6.9
Q ss_pred cceEEEEEe
Q psy3346 62 HRGFGFVEF 70 (79)
Q Consensus 62 ~~g~~fv~f 70 (79)
.+||+||.=
T Consensus 14 ~kGfGFI~~ 22 (69)
T PRK10943 14 SKGFGFITP 22 (69)
T ss_pred CCCcEEEec
Confidence 369999964
No 193
>PF00226 DnaJ: DnaJ domain; InterPro: IPR001623 The prokaryotic heat shock protein DnaJ interacts with the chaperone hsp70-like DnaK protein []. Structurally, the DnaJ protein consists of an N-terminal conserved domain (called 'J' domain) of about 70 amino acids, a glycine-rich region ('G' domain') of about 30 residues, a central domain containing four repeats of a CXXCXGXG motif ('CRR' domain) and a C-terminal region of 120 to 170 residues. Such a structure is shown in the following schematic representation: +------------+-+-------+-----+-----------+--------------------------------+ | N-terminal | | Gly-R | | CXXCXGXG | C-terminal | +------------+-+-------+-----+-----------+--------------------------------+ It is thought that the 'J' domain of DnaJ mediates the interaction with the dnaK protein and consists of four helices, the second of which has a charged surface that includes at least one pair of basic residues that are essential for interaction with the ATPase domain of Hsp70. The J- and CRR-domains are found in many prokaryotic and eukaryotic proteins [], either together or separately. In yeast, J-domains have been classified into 3 groups; the class III proteins are functionally distinct and do not appear to act as molecular chaperones []. ; GO: 0031072 heat shock protein binding; PDB: 2GUZ_C 2L6L_A 1HDJ_A 2EJ7_A 1FPO_C 2CUG_A 2QSA_A 2OCH_A 3BVO_B 3APQ_A ....
Probab=32.07 E-value=60 Score=14.89 Aligned_cols=19 Identities=16% Similarity=0.464 Sum_probs=15.6
Q ss_pred eCCCCCCCHHHHHHHhHhc
Q psy3346 26 RNIPFQAKQSEVEELFKAF 44 (79)
Q Consensus 26 ~nl~~~~~~~~l~~~f~~~ 44 (79)
-+|+..++.++|+..|...
T Consensus 6 Lgl~~~~~~~eik~~y~~l 24 (64)
T PF00226_consen 6 LGLPPDASDEEIKKAYRRL 24 (64)
T ss_dssp CTSTTTSSHHHHHHHHHHH
T ss_pred CCCCCCCCHHHHHHHHHhh
Confidence 4788999999999888654
No 194
>PF06014 DUF910: Bacterial protein of unknown function (DUF910); InterPro: IPR009256 This family consists of several short bacterial proteins of unknown function.; PDB: 2NN4_A.
Probab=31.89 E-value=37 Score=16.46 Aligned_cols=16 Identities=31% Similarity=0.656 Sum_probs=10.1
Q ss_pred HHHHHHHhHhcCCeeE
Q psy3346 34 QSEVEELFKAFGELKF 49 (79)
Q Consensus 34 ~~~l~~~f~~~g~i~~ 49 (79)
--++++++++||-+-.
T Consensus 4 lyDVqQLLK~fG~~IY 19 (62)
T PF06014_consen 4 LYDVQQLLKKFGIIIY 19 (62)
T ss_dssp HHHHHHHHHTTS----
T ss_pred HHHHHHHHHHCCEEEE
Confidence 3578999999996533
No 195
>TIGR02381 cspD cold shock domain protein CspD. This model represents what appears to be a phylogenetically distinct clade, containing E. coli CspD and related proteobacterial proteins within the larger family of cold shock domain proteins described by pfam model pfam00313. The gene symbol cspD may have been used idependently for other subfamilies of cold shock domain proteins, such as for B. subtilis CspD. These proteins typically are shorter than 70 amino acids. In E. coli, CspD is a stress response protein induced in stationary phase. This homodimer binds single-stranded DNA and appears to inhibit DNA replication.
Probab=31.54 E-value=30 Score=16.69 Aligned_cols=10 Identities=40% Similarity=0.756 Sum_probs=7.3
Q ss_pred cceEEEEEec
Q psy3346 62 HRGFGFVEFI 71 (79)
Q Consensus 62 ~~g~~fv~f~ 71 (79)
.+||+||.=.
T Consensus 12 ~kGfGFI~~~ 21 (68)
T TIGR02381 12 AKGFGFICPE 21 (68)
T ss_pred CCCeEEEecC
Confidence 3699999644
No 196
>smart00703 NRF N-terminal domain in C. elegans NRF-6 (Nose Resistant to Fluoxetine-4) and NDG-4 (resistant to nordihydroguaiaretic acid-4). Also present in several other worm and fly proteins.
Probab=30.68 E-value=52 Score=17.34 Aligned_cols=23 Identities=9% Similarity=0.316 Sum_probs=17.7
Q ss_pred EEEe-CCCCCCCHHHHHHHhHhcC
Q psy3346 23 ILVR-NIPFQAKQSEVEELFKAFG 45 (79)
Q Consensus 23 l~v~-nl~~~~~~~~l~~~f~~~g 45 (79)
+.++ -+|.+++..++..++..+.
T Consensus 72 ~~~gvC~P~sCs~~dv~~l~~~~~ 95 (110)
T smart00703 72 LRTGVCLPSSCSAADLKTLLNQLS 95 (110)
T ss_pred eEEEEEeCCCCCHHHHHHHHHHHH
Confidence 4444 4899999999999997764
No 197
>PRK10354 RNA chaperone/anti-terminator; Provisional
Probab=30.28 E-value=30 Score=16.83 Aligned_cols=8 Identities=50% Similarity=1.211 Sum_probs=6.4
Q ss_pred ceEEEEEe
Q psy3346 63 RGFGFVEF 70 (79)
Q Consensus 63 ~g~~fv~f 70 (79)
+||+||.=
T Consensus 16 kGfGFI~~ 23 (70)
T PRK10354 16 KGFGFITP 23 (70)
T ss_pred CCcEEEec
Confidence 69999964
No 198
>PF03439 Spt5-NGN: Early transcription elongation factor of RNA pol II, NGN section; InterPro: IPR005100 Spt5p and prokaryotic NusG are shown to contain a novel 'NGN' domain. The combined NGN and KOW motif regions of Spt5 form the binding domain with Spt4 []. Spt5 complexes with Spt4 as a 1:1 heterodimer snf this Spt5-Spt4 complex regulates early transcription elongation by RNA polymerase II and has an imputed role in pre-mRNA processing via its physical association with mRNA capping enzymes. The Schizosaccharomyces pombe core Spt5-Spt4 complex is a heterodimer bearing a trypsin-resistant Spt4-binding domain within the Spt5 subunit []. ; PDB: 3H7H_B 3QQC_D 3P8B_B 2EXU_A.
Probab=30.13 E-value=52 Score=16.55 Aligned_cols=17 Identities=24% Similarity=0.460 Sum_probs=14.0
Q ss_pred cceEEEEEecCHHHHhc
Q psy3346 62 HRGFGFVEFITKNEAKR 78 (79)
Q Consensus 62 ~~g~~fv~f~~~~~a~~ 78 (79)
.+||-||+=.+..++..
T Consensus 43 lkGyIyVEA~~~~~V~~ 59 (84)
T PF03439_consen 43 LKGYIYVEAERESDVKE 59 (84)
T ss_dssp STSEEEEEESSHHHHHH
T ss_pred CceEEEEEeCCHHHHHH
Confidence 67999999988877654
No 199
>PRK09890 cold shock protein CspG; Provisional
Probab=30.11 E-value=30 Score=16.86 Aligned_cols=10 Identities=40% Similarity=0.856 Sum_probs=7.2
Q ss_pred cceEEEEEec
Q psy3346 62 HRGFGFVEFI 71 (79)
Q Consensus 62 ~~g~~fv~f~ 71 (79)
.+||+||+=.
T Consensus 15 ~kGfGFI~~~ 24 (70)
T PRK09890 15 DKGFGFITPD 24 (70)
T ss_pred CCCcEEEecC
Confidence 3699999643
No 200
>PF06804 Lipoprotein_18: NlpB/DapX lipoprotein; InterPro: IPR010653 This entry consists of a number of bacterial lipoproteins often known as NlpB or DapX. This lipoprotein is detected in outer membrane vesicles in Escherichia coli and appears to be non-essential [].; PDB: 2YH6_A 3TGO_D 2YH5_A 2LAF_A 2LAE_A 3SNS_A.
Probab=30.05 E-value=40 Score=21.32 Aligned_cols=51 Identities=14% Similarity=0.257 Sum_probs=33.9
Q ss_pred CCCCCeEEEeCCCCCCCHHHHHHHhHhcCCeeEEEeecccCCCCCcceEEEEEecCHHH
Q psy3346 17 KQTGSKILVRNIPFQAKQSEVEELFKAFGELKFVRLPKKMVGSGLHRGFGFVEFITKNE 75 (79)
Q Consensus 17 ~~~~~~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~ 75 (79)
+.++..+++-..|++..+..|...+.+.|-. +. ...++.|.-||.|..+++
T Consensus 195 d~~g~~~l~~~~~fd~aW~rl~~aL~~~gf~-----V~---d~drs~G~~~v~y~~~~~ 245 (303)
T PF06804_consen 195 DGNGQPALILRAPFDRAWRRLGLALDRLGFT-----VE---DRDRSQGVYYVRYKPPDS 245 (303)
T ss_dssp -TTS-EEEEEES-HHHHHHHHHHHHHHTTEE-----EE---EEETTTTEEEEEE----H
T ss_pred cCCCceEEEECCcHHHHHHHHHHHHHhCCCE-----EE---ecccccEEEEEEEcCCCh
Confidence 3567788888999999999999999998821 11 234567999999987654
No 201
>PF01984 dsDNA_bind: Double-stranded DNA-binding domain; InterPro: IPR002836 This protein family is found in archaea and eukaryota. The human TFAR19 (TF-1 cell apoptosis-related protein 19) encodes a protein which shares significant homology to the corresponding proteins of species ranging from yeast to mice. TFAR19 exhibits a ubiquitous expression pattern and its expression is up-regulated in the tumour cells undergoing apoptosis. TFAR19 may play a general role in the apoptotic process []. Also included in this family is a DNA-binding protein from the archaea, Methanobacterium thermoautotrophicum.; GO: 0003677 DNA binding; PDB: 1EIJ_A 2K6B_A 2CRU_A 1YYB_A 2JXN_A 2FH0_A.
Probab=29.60 E-value=54 Score=17.56 Aligned_cols=21 Identities=10% Similarity=0.284 Sum_probs=15.6
Q ss_pred eCCCCCCCHHHHHHHhHhcCC
Q psy3346 26 RNIPFQAKQSEVEELFKAFGE 46 (79)
Q Consensus 26 ~nl~~~~~~~~l~~~f~~~g~ 46 (79)
|.|...+++++|..++.....
T Consensus 75 G~l~~kI~d~~L~~iL~~i~~ 95 (107)
T PF01984_consen 75 GQLRGKIDDEQLKEILEQISE 95 (107)
T ss_dssp TSSSS-B-HHHHHHHHHHHCC
T ss_pred CCCCCCcCHHHHHHHHHHHhh
Confidence 567888999999999988753
No 202
>KOG0862|consensus
Probab=29.43 E-value=36 Score=20.74 Aligned_cols=15 Identities=33% Similarity=0.490 Sum_probs=11.6
Q ss_pred CCCcceEEEEEecCH
Q psy3346 59 SGLHRGFGFVEFITK 73 (79)
Q Consensus 59 ~~~~~g~~fv~f~~~ 73 (79)
+...|.|+|++|.+-
T Consensus 105 ~~~~RPY~FieFD~~ 119 (216)
T KOG0862|consen 105 QPASRPYAFIEFDTF 119 (216)
T ss_pred CccCCCeeEEehhHH
Confidence 346789999999764
No 203
>cd06498 ACD_alphaB-crystallin_HspB5 Alpha-crystallin domain found in the small heat shock protein (sHsp) alphaB-crystallin (HspB5, 20kDa). sHsps are molecular chaperones that suppress protein aggregation and protect against cell stress, and are generally active as large oligomers consisting of multiple subunits. Alpha crystallin, an abundant protein in the mammalian lens, is a large (700 kDa) heteropolymer composed of HspB4 and HspB5, generally in a molar ratio of HspB4:HspB5 of 3:1. HspB4 does not belong to this group. HspB5 shows increased synthesis in response to stress. HspB5 is also expressed constitutively in other tissues including brain, heart, and type I and type IIa skeletal muscle fibers, and in several cancers including gliomas, renal cell carcinomas, basal-like and metaplastic breast carcinomas, and head and neck cancer. Its functions include effects on the apoptotic pathway and on metastasis. Phosphorylation of HspB5 reduces its oligomerization and anti-apoptotic activ
Probab=29.21 E-value=21 Score=18.00 Aligned_cols=27 Identities=19% Similarity=0.235 Sum_probs=18.9
Q ss_pred CCCCCCCHHHHHHHhHhcCCeeEEEeec
Q psy3346 27 NIPFQAKQSEVEELFKAFGELKFVRLPK 54 (79)
Q Consensus 27 nl~~~~~~~~l~~~f~~~g~i~~~~~~~ 54 (79)
.||..+..+.+..-|..-| +..+.+++
T Consensus 57 ~LP~~vd~~~i~A~~~~dG-vL~I~lPk 83 (84)
T cd06498 57 RIPADVDPLTITSSLSPDG-VLTVCGPR 83 (84)
T ss_pred ECCCCCChHHcEEEeCCCC-EEEEEEeC
Confidence 5888888888887775555 66666653
No 204
>TIGR01873 cas_CT1978 CRISPR-associated endoribonuclease Cas2, E. coli subfamily. CRISPR is a term for Clustered, Regularly Interspaced Short Palidromic Repeats. A number of protein families appear only in association with these repeats and are designated Cas (CRISPR-Associated) proteins. This model represents a minor branch of the Cas2 family of CRISPR-associated endonuclease, whereas most Cas2 proteins are modeled instead by TIGR01573. This form of Cas2 is characteristic for the Ecoli subtype of CRISPR/Cas locus.
Probab=29.15 E-value=93 Score=16.13 Aligned_cols=49 Identities=12% Similarity=0.106 Sum_probs=27.7
Q ss_pred CCeEEEeCCCCCCCHHHHHHHhHh-cCCeeEEEeecccCCCCCcceEEEEEecC
Q psy3346 20 GSKILVRNIPFQAKQSEVEELFKA-FGELKFVRLPKKMVGSGLHRGFGFVEFIT 72 (79)
Q Consensus 20 ~~~l~v~nl~~~~~~~~l~~~f~~-~g~i~~~~~~~~~~~~~~~~g~~fv~f~~ 72 (79)
..-+||++++..+-+.--..+-+. .+. -.+.+... +....||.|-++..
T Consensus 25 ~~GVyVg~~s~rVRe~lW~~v~~~~~~~-G~avm~~~---~~~e~G~~~~t~G~ 74 (87)
T TIGR01873 25 RAGVYVGGVSASVRERIWDYLAQHCPPK-GSLVITWS---SNTCPGFEFFTLGE 74 (87)
T ss_pred CCCcEEcCCCHHHHHHHHHHHHHhCCCC-ccEEEEEe---CCCCCCcEEEecCC
Confidence 346899988888776554444444 232 22333332 22345788887754
No 205
>KOG3424|consensus
Probab=28.69 E-value=1.1e+02 Score=16.95 Aligned_cols=46 Identities=20% Similarity=0.323 Sum_probs=26.4
Q ss_pred CCCHHHHHHHhHhcCCe--eEEEe--ecccCCCCCcceEEEEEecCHHHHh
Q psy3346 31 QAKQSEVEELFKAFGEL--KFVRL--PKKMVGSGLHRGFGFVEFITKNEAK 77 (79)
Q Consensus 31 ~~~~~~l~~~f~~~g~i--~~~~~--~~~~~~~~~~~g~~fv~f~~~~~a~ 77 (79)
.++.++|++-+...-.. ..+.+ .+..-..|++.|||.| |.+.+.|.
T Consensus 34 ~vsK~EirEKla~mYkt~~d~V~vfgfrt~~GggkstgfalI-Ydsve~ak 83 (132)
T KOG3424|consen 34 NVSKTEIREKLAKMYKTTPDAVFVFGFRTHFGGGKSTGFALI-YDSVEYAK 83 (132)
T ss_pred CCCHHHHHHHHHHHhcCCcceEEEEEeeeccCCcccceeeee-eehHHHHH
Confidence 57788888877653221 11111 1221146778888877 77777654
No 206
>cd06257 DnaJ DnaJ domain or J-domain. DnaJ/Hsp40 (heat shock protein 40) proteins are highly conserved and play crucial roles in protein translation, folding, unfolding, translocation, and degradation. They act primarily by stimulating the ATPase activity of Hsp70s, an important chaperonine family. Hsp40 proteins are characterized by the presence of a J domain, which mediates the interaction with Hsp70. They may contain other domains as well, and the architectures provide a means of classification.
Probab=28.68 E-value=64 Score=14.11 Aligned_cols=19 Identities=16% Similarity=0.473 Sum_probs=15.3
Q ss_pred eCCCCCCCHHHHHHHhHhc
Q psy3346 26 RNIPFQAKQSEVEELFKAF 44 (79)
Q Consensus 26 ~nl~~~~~~~~l~~~f~~~ 44 (79)
-+|+..++.++|+..|...
T Consensus 6 Lgl~~~~~~~~ik~~y~~l 24 (55)
T cd06257 6 LGVPPDASDEEIKKAYRKL 24 (55)
T ss_pred cCCCCCCCHHHHHHHHHHH
Confidence 3688899999998888654
No 207
>PF10281 Ish1: Putative stress-responsive nuclear envelope protein; InterPro: IPR018803 This group of proteins, found primarily in fungi, consists of putative stress-responsive nuclear envelope protein Ish1 and homologues [].
Probab=27.56 E-value=62 Score=13.58 Aligned_cols=16 Identities=19% Similarity=0.312 Sum_probs=13.1
Q ss_pred CCCHHHHHHHhHhcCC
Q psy3346 31 QAKQSEVEELFKAFGE 46 (79)
Q Consensus 31 ~~~~~~l~~~f~~~g~ 46 (79)
+.+.++|++.+..+|-
T Consensus 3 tWs~~~L~~wL~~~gi 18 (38)
T PF10281_consen 3 TWSDSDLKSWLKSHGI 18 (38)
T ss_pred CCCHHHHHHHHHHcCC
Confidence 3578899999999884
No 208
>PF09702 Cas_Csa5: CRISPR-associated protein (Cas_Csa5); InterPro: IPR010157 Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR) are a family of DNA direct repeats separated by regularly sized non-repetitive spacer sequences that are found in most bacterial and archaeal genomes []. CRISPRs appear to provide acquired resistance against bacteriophages, possibly acting with an RNA interference-like mechanism to inhibit gene functions of invasive DNA elements [, ]. Differences in the number and type of spacers between CRISPR repeats correlate with phage sensitivity. It is thought that following phage infection, bacteria integrate new spacers derived from phage genomic sequences, and that the removal or addition of particular spacers modifies the phage-resistance phenotype of the cell. Therefore, the specificity of CRISPRs may be determined by spacer-phage sequence similarity. In addition, there are many protein families known as CRISPR-associated sequences (Cas), which are encoded in the vicinity of CRISPR loci []. CRISPR/cas gene regions can be quite large, with up to 20 different, tandem-arranged cas genes next to a CRISPR cluster or filling the region between two repeat clusters. Cas genes and CRISPRs are found on mobile genetic elements such as plasmids, and have undergone extensive horizontal transfer. Cas proteins are thought to be involved in the propagation and functioning of CRISPRs. Some Cas proteins show similarity to helicases and repair proteins, although the functions of most are unknown. Cas families can be divided into subtypes according to operon organisation and phylogeny. This entry represents a minor family of Cas protein found in various species of Sulfolobus and Pyrococcus (all archaeal). It is found with two different CRISPR loci in Sulfolobus solfataricus.
Probab=27.28 E-value=1.1e+02 Score=16.45 Aligned_cols=22 Identities=32% Similarity=0.643 Sum_probs=15.1
Q ss_pred CCCCCeEEEeCCCCCCCHHHHHHHh
Q psy3346 17 KQTGSKILVRNIPFQAKQSEVEELF 41 (79)
Q Consensus 17 ~~~~~~l~v~nl~~~~~~~~l~~~f 41 (79)
+.++..++++.||.+ +++..|+
T Consensus 61 ekeg~~i~~g~lPt~---~eVe~Fl 82 (105)
T PF09702_consen 61 EKEGNYIIVGYLPTD---EEVEDFL 82 (105)
T ss_pred cCCCCEEecCCCCCh---HHHHHHH
Confidence 556788999888854 4555554
No 209
>COG5310 Homospermidine synthase [Secondary metabolites biosynthesis, transport, and catabolism]
Probab=27.19 E-value=90 Score=20.77 Aligned_cols=29 Identities=10% Similarity=0.304 Sum_probs=23.4
Q ss_pred eEEEeCCCCCCCHHHHHHHhHhcCCeeEEE
Q psy3346 22 KILVRNIPFQAKQSEVEELFKAFGELKFVR 51 (79)
Q Consensus 22 ~l~v~nl~~~~~~~~l~~~f~~~g~i~~~~ 51 (79)
.=|+-||+.+++.-+|..++.++| +.++.
T Consensus 87 qgf~vnLSvd~~s~Dlmr~crk~~-vLYid 115 (481)
T COG5310 87 QGFCVNLSVDTSSLDLMRLCRKHG-VLYID 115 (481)
T ss_pred ceEEEEeEeccchhHHHHHHHHcC-eEEEe
Confidence 456779999999999999999998 45553
No 210
>PF03672 UPF0154: Uncharacterised protein family (UPF0154); InterPro: IPR005359 The proteins in this entry are functionally uncharacterised.
Probab=26.43 E-value=92 Score=15.19 Aligned_cols=18 Identities=6% Similarity=0.183 Sum_probs=14.5
Q ss_pred CCCCCHHHHHHHhHhcCC
Q psy3346 29 PFQAKQSEVEELFKAFGE 46 (79)
Q Consensus 29 ~~~~~~~~l~~~f~~~g~ 46 (79)
.+.++++.|+.++.+-|.
T Consensus 29 NPpine~mir~M~~QMG~ 46 (64)
T PF03672_consen 29 NPPINEKMIRAMMMQMGR 46 (64)
T ss_pred CCCCCHHHHHHHHHHhCC
Confidence 345789999999998884
No 211
>PF00313 CSD: 'Cold-shock' DNA-binding domain; InterPro: IPR002059 When Escherichia coli is exposed to a temperature drop from 37 to 10 degrees centigrade, a 4-5 hour lag phase occurs, after which growth is resumed at a reduced rate []. During the lag phase, the expression of around 13 proteins, which contain specific DNA-binding regions [], is increased 2-10 fold. These so-called 'cold shock' proteins are thought to help the cell to survive in temperatures lower than optimum growth temperature, by contrast with heat shock proteins, which help the cell to survive in temperatures greater than the optimum, possibly by condensation of the chromosome and organisation of the prokaryotic nucleoid []. A conserved domain of about 70 amino acids has been found in prokaryotic and eukaryotic DNA-binding proteins [, , ]. This domain is known as the 'cold-shock domain' (CSD), part of which is highly similar [] to the RNP-1 RNA-binding motif.; GO: 0003677 DNA binding, 0006355 regulation of transcription, DNA-dependent; PDB: 1HZC_A 1I5F_A 1HZ9_B 1C9O_B 1HZB_B 1HZA_A 2HAX_B 2L15_A 2LSS_A 3I2Z_B ....
Probab=26.31 E-value=59 Score=15.18 Aligned_cols=12 Identities=25% Similarity=0.659 Sum_probs=8.5
Q ss_pred cceEEEEEecCH
Q psy3346 62 HRGFGFVEFITK 73 (79)
Q Consensus 62 ~~g~~fv~f~~~ 73 (79)
.+||+||.-...
T Consensus 11 ~kgyGFI~~~~~ 22 (66)
T PF00313_consen 11 EKGYGFITSDDG 22 (66)
T ss_dssp TTTEEEEEETTS
T ss_pred CCCceEEEEccc
Confidence 358999986543
No 212
>cd04458 CSP_CDS Cold-Shock Protein (CSP) contains an S1-like cold-shock domain (CSD) that is found in eukaryotes, prokaryotes, and archaea. CSP's include the major cold-shock proteins CspA and CspB in bacteria and the eukaryotic gene regulatory factor Y-box protein. CSP expression is up-regulated by an abrupt drop in growth temperature. CSP's are also expressed under normal condition at lower level. The function of cold-shock proteins is not fully understood. They preferentially bind poly-pyrimidine region of single-stranded RNA and DNA. CSP's are thought to bind mRNA and regulate ribosomal translation, mRNA degradation, and the rate of transcription termination. The human Y-box protein, which contains a CSD, regulates transcription and translation of genes that contain the Y-box sequence in their promoters. This specific ssDNA-binding properties of CSD are required for the binding of Y-box protein to the promoter's Y-box sequence, thereby regulating transcription.
Probab=26.26 E-value=45 Score=15.56 Aligned_cols=10 Identities=40% Similarity=0.843 Sum_probs=7.5
Q ss_pred ceEEEEEecC
Q psy3346 63 RGFGFVEFIT 72 (79)
Q Consensus 63 ~g~~fv~f~~ 72 (79)
+||+||.=..
T Consensus 12 kGfGFI~~~~ 21 (65)
T cd04458 12 KGFGFITPDD 21 (65)
T ss_pred CCeEEEecCC
Confidence 6999997544
No 213
>PF09336 Vps4_C: Vps4 C terminal oligomerisation domain; InterPro: IPR015415 This domain is found at the C-terminal of ATPase proteins involved in vacuolar sorting. It forms an alpha helix structure and is required for oligomerisation []. ; PDB: 1XWI_A 3EIH_C 2QPA_C 3EIE_A 2RKO_A 2QP9_X 3MHV_C 3CF3_C 3CF1_A 3CF2_A ....
Probab=26.16 E-value=45 Score=15.89 Aligned_cols=15 Identities=7% Similarity=0.277 Sum_probs=7.4
Q ss_pred eCCCCCCCHHHHHHH
Q psy3346 26 RNIPFQAKQSEVEEL 40 (79)
Q Consensus 26 ~nl~~~~~~~~l~~~ 40 (79)
.++.++++.++|..+
T Consensus 40 ~~~kpSVs~~dl~~y 54 (62)
T PF09336_consen 40 KKVKPSVSQEDLKKY 54 (62)
T ss_dssp HTCGGSS-HHHHHHH
T ss_pred HHcCCCCCHHHHHHH
Confidence 344555666665443
No 214
>PF12385 Peptidase_C70: Papain-like cysteine protease AvrRpt2; InterPro: IPR022118 This is a family of cysteine proteases, found in actinobacteria, protobacteria and firmicutes. Papain-like cysteine proteases play a crucial role in plant-pathogen/pest interactions. On entering the host they act on non-self substrates, thereby manipulating the host to evade proteolysis []. AvrRpt2 from Pseudomonas syringae pv tomato DC3000 triggers resistance to P. syringae-2-dependent defence responses, including hypersensitive cell death, by cleaving the Arabidopsis RIN4 protein which is monitored by the cognate resistance protein RPS2 [].
Probab=26.05 E-value=95 Score=18.14 Aligned_cols=18 Identities=17% Similarity=0.379 Sum_probs=15.7
Q ss_pred CCCHHHHHHHhHhcCCee
Q psy3346 31 QAKQSEVEELFKAFGELK 48 (79)
Q Consensus 31 ~~~~~~l~~~f~~~g~i~ 48 (79)
..+.+.+..++.+||+|-
T Consensus 95 ~~t~e~~~~LL~~yGPLw 112 (166)
T PF12385_consen 95 SYTAEGLANLLREYGPLW 112 (166)
T ss_pred ccCHHHHHHHHHHcCCeE
Confidence 678899999999999863
No 215
>PF00220 Hormone_4: Neurohypophysial hormones, N-terminal Domain; InterPro: IPR022423 Oxytocin (or ocytocin) and vasopressin [] are small (nine amino acid residues), structurally and functionally related neurohypophysial peptide hormones. Oxytocin causes contraction of the smooth muscle of the uterus and of the mammary gland while vasopressin has a direct antidiuretic action on the kidney and also causes vasoconstriction of the peripheral vessels. Like the majority of active peptides, both hormones are synthesized as larger protein precursors that are enzymatically converted to their mature forms. Peptides belonging to this family are also found in birds, fish, reptiles and amphibians (mesotocin, isotocin, valitocin, glumitocin, aspargtocin, vasotocin, seritocin, asvatocin, phasvatocin), in worms (annetocin), octopi (cephalotocin), locust (locupressin or neuropeptide F1/F2) and in molluscs (conopressins G and S) []. The pattern developed to detect this category of peptides spans their entire sequence and includes four invariant amino acid residues. .; GO: 0005185 neurohypophyseal hormone activity, 0005576 extracellular region
Probab=25.87 E-value=32 Score=10.08 Aligned_cols=6 Identities=50% Similarity=1.005 Sum_probs=2.8
Q ss_pred EEeCCC
Q psy3346 24 LVRNIP 29 (79)
Q Consensus 24 ~v~nl~ 29 (79)
||.|-|
T Consensus 2 ~i~nCP 7 (9)
T PF00220_consen 2 YIRNCP 7 (9)
T ss_pred ccccCC
Confidence 444544
No 216
>PF05189 RTC_insert: RNA 3'-terminal phosphate cyclase (RTC), insert domain; InterPro: IPR013796 RNA cyclases are a family of RNA-modifying enzymes that are conserved in eukaryotes, bacteria and archaea. RNA 3'-terminal phosphate cyclase (6.5.1.4 from EC) [, ] catalyses the conversion of 3'-phosphate to a 2',3'-cyclic phosphodiester at the end of RNA. ATP + RNA 3'-terminal-phosphate = AMP + diphosphate + RNA terminal-2',3'-cyclic-phosphate These enzymes might be responsible for production of the cyclic phosphate RNA ends that are known to be required by many RNA ligases in both prokaryotes and eukaryotes. RNA cyclase is a protein of from 36 to 42 kDa. The best conserved region is a glycine-rich stretch of residues located in the central part of the sequence and which is reminiscent of various ATP, GTP or AMP glycine-rich loops. The crystal structure of RNA 3'-terminal phosphate cyclase shows that each molecule consists of two domains. The larger domain contains three repeats of a folding unit comprising two parallel alpha helices and a four-stranded beta sheet; this fold was previously identified in translation initiation factor 3 (IF3). The large domain is similar to one of the two domains of 5-enolpyruvylshikimate-3-phosphate synthase and UDP-N-acetylglucosamine enolpyruvyl transferase. The smaller domain uses a similar secondary structure element with different topology, observed in many other proteins such as thioredoxin []. Although the active site of this enzyme could not be unambiguously assigned, it can be mapped to a region surrounding His309, an adenylate acceptor, in which a number of amino acids are highly conserved in the enzyme from different sources []. This entry contains the insert-domain of approximately 100 amino acids.; PDB: 3TV1_B 3KGD_D 1QMI_D 1QMH_B 3TUX_A 3TUT_A 3TW3_A 3PQV_A.
Probab=25.80 E-value=1.1e+02 Score=15.84 Aligned_cols=47 Identities=13% Similarity=0.158 Sum_probs=25.5
Q ss_pred EEEeCCCCCCCHHHHH---HHhHhcCCeeEEEe--ecccCCCCCcceEEEEEe
Q psy3346 23 ILVRNIPFQAKQSEVE---ELFKAFGELKFVRL--PKKMVGSGLHRGFGFVEF 70 (79)
Q Consensus 23 l~v~nl~~~~~~~~l~---~~f~~~g~i~~~~~--~~~~~~~~~~~g~~fv~f 70 (79)
.|+.+||..+.+.++. ..|..+..-..+.. .... ..+.+.|++.+.+
T Consensus 13 a~~a~lp~~va~R~~~~a~~~L~~~~~~v~i~~d~~~~~-~~~~~~G~gi~l~ 64 (103)
T PF05189_consen 13 AFVAGLPSSVAERMANAARKRLNWYGPDVEIETDYRESD-DSAFGPGSGISLV 64 (103)
T ss_dssp EEEESS-CHHHHHHHHHHHHHHCTTCSEEEEEEEEE-CC-CCGCSSEEEEEEE
T ss_pred EEEcCCCHHHHHHHHHHHHHHhhhhCCCeEEEEecccCc-cCCCCCceEEEEE
Confidence 5788999999887654 44444443333332 1112 4555667765444
No 217
>PF14112 DUF4284: Domain of unknown function (DUF4284)
Probab=25.32 E-value=1.1e+02 Score=16.58 Aligned_cols=18 Identities=33% Similarity=0.608 Sum_probs=14.0
Q ss_pred eEEEeCCCCCCCHHHHHHHhH
Q psy3346 22 KILVRNIPFQAKQSEVEELFK 42 (79)
Q Consensus 22 ~l~v~nl~~~~~~~~l~~~f~ 42 (79)
.|||||.+ +++++.++|.
T Consensus 3 siWiG~f~---s~~el~~Y~e 20 (122)
T PF14112_consen 3 SIWIGNFK---SEDELEEYFE 20 (122)
T ss_pred EEEEecCC---CHHHHHHHhC
Confidence 58999775 7888888874
No 218
>PHA00147 upper collar protein
Probab=24.93 E-value=1.4e+02 Score=19.17 Aligned_cols=24 Identities=21% Similarity=0.293 Sum_probs=20.2
Q ss_pred eCCCCCCCHHHHHHHhHhcCCeeE
Q psy3346 26 RNIPFQAKQSEVEELFKAFGELKF 49 (79)
Q Consensus 26 ~nl~~~~~~~~l~~~f~~~g~i~~ 49 (79)
.|||++++..-|++.+-++|-+..
T Consensus 42 eglP~~idp~flEk~i~q~G~v~f 65 (308)
T PHA00147 42 EGLPNTIDPSFLEKSIHQNGYVAF 65 (308)
T ss_pred cCCCCCCCHHHHHHHHHHcCceEE
Confidence 489999999999999988886543
No 219
>PRK11230 glycolate oxidase subunit GlcD; Provisional
Probab=24.25 E-value=2.4e+02 Score=19.30 Aligned_cols=43 Identities=14% Similarity=0.046 Sum_probs=27.2
Q ss_pred HHHHHHHhH----hcCCeeEEEeecccCCCCCcceEEEEEecCHHHHhc
Q psy3346 34 QSEVEELFK----AFGELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKR 78 (79)
Q Consensus 34 ~~~l~~~f~----~~g~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~ 78 (79)
.-+|..+|. .+|-|.++.+...+ ........++.|.+.+++..
T Consensus 203 g~dl~~l~~Gs~GtlGIIt~atlkl~p--~p~~~~~~~~~f~~~~~a~~ 249 (499)
T PRK11230 203 GFDLLALFTGSEGMLGVVTEVTVKLLP--KPPVARVLLASFDSVEKAGL 249 (499)
T ss_pred ccchHhhhccCCCccEEEEEEEEEEEc--CCcceEEEEEECCCHHHHHH
Confidence 345666553 46778887775542 23345667788988887654
No 220
>KOG0821|consensus
Probab=24.09 E-value=1.3e+02 Score=18.87 Aligned_cols=25 Identities=12% Similarity=0.313 Sum_probs=18.9
Q ss_pred CCCeEEEeCCCCCCCHHHHHHHhHh
Q psy3346 19 TGSKILVRNIPFQAKQSEVEELFKA 43 (79)
Q Consensus 19 ~~~~l~v~nl~~~~~~~~l~~~f~~ 43 (79)
.+..-.|||||++++.--+-+.+..
T Consensus 128 ~p~~H~IGNLPf~i~~pliik~l~~ 152 (326)
T KOG0821|consen 128 PPNVHIIGNLPFSVSTPLIIKWLEN 152 (326)
T ss_pred CCceEEeccCCccccchHHHHHHhh
Confidence 4456679999999988777776654
No 221
>PF13600 DUF4140: N-terminal domain of unknown function (DUF4140)
Probab=23.51 E-value=23 Score=18.37 Aligned_cols=19 Identities=11% Similarity=0.314 Sum_probs=15.9
Q ss_pred CCeEEEeCCCCCCCHHHHH
Q psy3346 20 GSKILVRNIPFQAKQSEVE 38 (79)
Q Consensus 20 ~~~l~v~nl~~~~~~~~l~ 38 (79)
...|.|.+||..+..+.|+
T Consensus 23 ~~~i~~~~Lp~~~d~~Sl~ 41 (104)
T PF13600_consen 23 ENEIIFEGLPPSLDPDSLR 41 (104)
T ss_pred ceEEEEeCCCcccCCCcEE
Confidence 3589999999999887764
No 222
>PRK04239 hypothetical protein; Provisional
Probab=23.16 E-value=1e+02 Score=16.72 Aligned_cols=20 Identities=10% Similarity=0.227 Sum_probs=16.8
Q ss_pred eCCCCCCCHHHHHHHhHhcC
Q psy3346 26 RNIPFQAKQSEVEELFKAFG 45 (79)
Q Consensus 26 ~nl~~~~~~~~l~~~f~~~g 45 (79)
|.|+..+++++|..++....
T Consensus 80 G~i~~ki~e~~L~~lL~~v~ 99 (110)
T PRK04239 80 GRIQGPIDDEQLKEILEQLT 99 (110)
T ss_pred CCCCCCcCHHHHHHHHHHHh
Confidence 56788999999999998764
No 223
>COG2088 SpoVG Uncharacterized protein, involved in the regulation of septum location [Cell envelope biogenesis, outer membrane]
Probab=23.14 E-value=1.3e+02 Score=15.76 Aligned_cols=25 Identities=20% Similarity=0.183 Sum_probs=14.6
Q ss_pred eeEEEeecccCCCCCcceEEEEEecC
Q psy3346 47 LKFVRLPKKMVGSGLHRGFGFVEFIT 72 (79)
Q Consensus 47 i~~~~~~~~~~~~~~~~g~~fv~f~~ 72 (79)
|..+++..-. ..|+-+.|+-|+|.+
T Consensus 3 iTdVRirkv~-~dgrmkA~vsvT~D~ 27 (95)
T COG2088 3 ITDVRIRKVD-TDGRMKAYVSVTLDN 27 (95)
T ss_pred ceeEEEEEec-CCCcEEEEEEEEecc
Confidence 4556654433 456666777777654
No 224
>smart00271 DnaJ DnaJ molecular chaperone homology domain.
Probab=22.77 E-value=92 Score=13.90 Aligned_cols=19 Identities=16% Similarity=0.445 Sum_probs=15.2
Q ss_pred eCCCCCCCHHHHHHHhHhc
Q psy3346 26 RNIPFQAKQSEVEELFKAF 44 (79)
Q Consensus 26 ~nl~~~~~~~~l~~~f~~~ 44 (79)
-+|+..++.++|+.-|...
T Consensus 7 Lgl~~~~~~~~ik~ay~~l 25 (60)
T smart00271 7 LGVPRDASLDEIKKAYRKL 25 (60)
T ss_pred cCCCCCCCHHHHHHHHHHH
Confidence 3688889999998888654
No 225
>PRK00523 hypothetical protein; Provisional
Probab=22.00 E-value=1.2e+02 Score=15.13 Aligned_cols=17 Identities=6% Similarity=0.282 Sum_probs=14.2
Q ss_pred CCCCHHHHHHHhHhcCC
Q psy3346 30 FQAKQSEVEELFKAFGE 46 (79)
Q Consensus 30 ~~~~~~~l~~~f~~~g~ 46 (79)
+.++|+.++..+.+-|.
T Consensus 38 Ppine~mir~M~~QMGq 54 (72)
T PRK00523 38 PPITENMIRAMYMQMGR 54 (72)
T ss_pred cCCCHHHHHHHHHHhCC
Confidence 56789999999998884
No 226
>PRK13916 plasmid segregation protein ParR; Provisional
Probab=21.99 E-value=36 Score=17.63 Aligned_cols=21 Identities=19% Similarity=0.554 Sum_probs=16.2
Q ss_pred EEeCCCCCCCHHHHHHHhHhc
Q psy3346 24 LVRNIPFQAKQSEVEELFKAF 44 (79)
Q Consensus 24 ~v~nl~~~~~~~~l~~~f~~~ 44 (79)
|+.|||..+-...+++.+.+|
T Consensus 23 FL~~~P~GT~~~~iR~~L~rY 43 (97)
T PRK13916 23 FLENVPRGTKTAHIREALRRY 43 (97)
T ss_pred HHHHCCCCCccHHHHHHHHHH
Confidence 466888888887788777766
No 227
>PRK14896 ksgA 16S ribosomal RNA methyltransferase KsgA/Dim1 family protein; Provisional
Probab=21.98 E-value=1.1e+02 Score=18.64 Aligned_cols=22 Identities=27% Similarity=0.404 Sum_probs=17.1
Q ss_pred CeEEEeCCCCCCCHHHHHHHhH
Q psy3346 21 SKILVRNIPFQAKQSEVEELFK 42 (79)
Q Consensus 21 ~~l~v~nl~~~~~~~~l~~~f~ 42 (79)
..+.|+|+|+.++...+..++.
T Consensus 92 ~d~Vv~NlPy~i~s~~~~~l~~ 113 (258)
T PRK14896 92 FNKVVSNLPYQISSPITFKLLK 113 (258)
T ss_pred ceEEEEcCCcccCcHHHHHHHh
Confidence 3578999999998777776664
No 228
>KOG2855|consensus
Probab=21.27 E-value=2.5e+02 Score=18.40 Aligned_cols=47 Identities=9% Similarity=0.127 Sum_probs=33.0
Q ss_pred CCeEEEeCCCCCCCHHHHHHHhHhcCC-eeEEEeecccCCCCCcceEEEEEec
Q psy3346 20 GSKILVRNIPFQAKQSEVEELFKAFGE-LKFVRLPKKMVGSGLHRGFGFVEFI 71 (79)
Q Consensus 20 ~~~l~v~nl~~~~~~~~l~~~f~~~g~-i~~~~~~~~~~~~~~~~g~~fv~f~ 71 (79)
.++.||+++=.+.--++|...+.++|- ...+.. ..+.+.++++|...
T Consensus 61 ~~~afiGkvGdD~fG~~l~~~L~~~~V~~~~v~~-----~~~~~T~~a~i~v~ 108 (330)
T KOG2855|consen 61 GRVAFIGKVGDDEFGDDLLDILKQNGVDTSGVKF-----DENARTACATITVS 108 (330)
T ss_pred cceeeeecccchhhHHHHHHHHhhCCccccccee-----cCCCceEEEEEEEc
Confidence 688999999988888999999999762 223332 33455677776543
No 229
>TIGR00006 S-adenosyl-methyltransferase MraW. Genetics paper in 1972 links mra cluster to peptidoglycan biosynthesis in E. coli. Seems to be common in proteobacteria.wn.
Probab=20.65 E-value=90 Score=20.01 Aligned_cols=19 Identities=32% Similarity=0.677 Sum_probs=16.0
Q ss_pred CCCHHHHHHHhHhcCCeeE
Q psy3346 31 QAKQSEVEELFKAFGELKF 49 (79)
Q Consensus 31 ~~~~~~l~~~f~~~g~i~~ 49 (79)
.+++++|..+|..||+...
T Consensus 140 ~~~e~~L~~i~~~yGEe~~ 158 (305)
T TIGR00006 140 TYSEEDLERILKKYGEEKF 158 (305)
T ss_pred hCCHHHHHHHHHHhcCcch
Confidence 5788999999999997643
No 230
>PF08734 GYD: GYD domain; InterPro: IPR014845 These proteins of unknown function are usually less than 100 amino acids in length. They may belong to the dimeric alpha/beta barrel superfamily.
Probab=20.30 E-value=1.4e+02 Score=15.21 Aligned_cols=39 Identities=26% Similarity=0.182 Sum_probs=25.5
Q ss_pred HHHHHHhHhcC-CeeEEEeecccCCCCCcceEEEEEecCHHHHhc
Q psy3346 35 SEVEELFKAFG-ELKFVRLPKKMVGSGLHRGFGFVEFITKNEAKR 78 (79)
Q Consensus 35 ~~l~~~f~~~g-~i~~~~~~~~~~~~~~~~g~~fv~f~~~~~a~~ 78 (79)
+.+++..+..| .+.+..+.. |..-....+++.+.+.+.+
T Consensus 23 ~a~~~~~e~~Gg~l~~~y~t~-----G~yD~v~i~eaPD~~~a~~ 62 (91)
T PF08734_consen 23 EAVRALIEALGGKLKSFYWTL-----GEYDFVVIVEAPDDETAAA 62 (91)
T ss_pred HHHHHHHHHcCCEEEEEEEec-----CCCCEEEEEEcCCHHHHHH
Confidence 45677777776 566666644 4444667788888776643
Done!