Query psy7050
Match_columns 61
No_of_seqs 137 out of 1598
Neff 10.0
Searched_HMMs 46136
Date Fri Aug 16 16:52:36 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy7050.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/7050hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PLN03134 glycine-rich RNA-bind 99.7 1.7E-16 3.7E-21 81.4 6.8 52 10-61 31-82 (144)
2 KOG0149|consensus 99.7 1.6E-16 3.4E-21 85.9 6.1 50 12-61 11-60 (247)
3 KOG4207|consensus 99.7 3.6E-16 7.8E-21 83.7 5.8 61 1-61 1-61 (256)
4 TIGR01661 ELAV_HUD_SF ELAV/HuD 99.6 2.7E-15 5.8E-20 84.8 7.7 51 11-61 267-317 (352)
5 TIGR01661 ELAV_HUD_SF ELAV/HuD 99.6 1.6E-15 3.4E-20 85.8 6.8 49 13-61 3-51 (352)
6 TIGR01659 sex-lethal sex-letha 99.6 2.7E-15 5.8E-20 85.6 7.5 53 9-61 103-155 (346)
7 TIGR01648 hnRNP-R-Q heterogene 99.6 2E-14 4.2E-19 86.1 7.4 58 3-61 43-105 (578)
8 TIGR01659 sex-lethal sex-letha 99.6 1.6E-14 3.4E-19 82.5 6.6 52 10-61 190-241 (346)
9 KOG0113|consensus 99.5 1.8E-14 3.9E-19 80.3 5.8 51 11-61 99-149 (335)
10 KOG0122|consensus 99.5 2.1E-14 4.6E-19 78.2 5.4 53 9-61 185-237 (270)
11 TIGR01645 half-pint poly-U bin 99.5 3.7E-14 8.1E-19 85.3 6.5 51 11-61 105-155 (612)
12 KOG0126|consensus 99.5 3E-15 6.5E-20 78.9 1.3 52 9-60 31-82 (219)
13 KOG0121|consensus 99.5 2.9E-14 6.2E-19 71.7 4.4 51 11-61 34-84 (153)
14 KOG0117|consensus 99.5 7.1E-14 1.5E-18 81.2 6.0 51 11-61 81-131 (506)
15 PF00076 RRM_1: RNA recognitio 99.5 1.2E-13 2.7E-18 62.4 5.5 45 16-61 1-45 (70)
16 KOG0144|consensus 99.5 1.1E-13 2.3E-18 80.3 5.3 51 11-61 32-82 (510)
17 TIGR01622 SF-CC1 splicing fact 99.5 3.1E-13 6.7E-18 78.7 7.0 50 12-61 88-137 (457)
18 TIGR01642 U2AF_lg U2 snRNP aux 99.5 4.3E-13 9.2E-18 78.9 7.3 50 12-61 294-343 (509)
19 TIGR01645 half-pint poly-U bin 99.5 2.8E-13 6E-18 81.6 6.3 50 12-61 203-252 (612)
20 KOG0125|consensus 99.4 2.3E-13 5E-18 76.8 5.1 51 9-61 92-142 (376)
21 TIGR01628 PABP-1234 polyadenyl 99.4 4.6E-13 1E-17 79.8 6.5 48 14-61 1-48 (562)
22 PF14259 RRM_6: RNA recognitio 99.4 8E-13 1.7E-17 60.1 5.7 45 16-61 1-45 (70)
23 PLN03213 repressor of silencin 99.4 5.2E-13 1.1E-17 79.0 6.2 46 12-61 9-54 (759)
24 KOG0131|consensus 99.4 2.8E-13 6.2E-18 71.3 4.3 54 8-61 4-57 (203)
25 PLN03120 nucleic acid binding 99.4 7.7E-13 1.7E-17 73.0 6.1 46 13-61 4-49 (260)
26 PLN03121 nucleic acid binding 99.4 2E-12 4.3E-17 70.7 6.4 47 12-61 4-50 (243)
27 TIGR01622 SF-CC1 splicing fact 99.4 2.1E-12 4.5E-17 75.3 6.4 49 13-61 186-234 (457)
28 KOG0127|consensus 99.4 1.8E-12 3.9E-17 77.1 6.0 51 11-61 290-340 (678)
29 KOG0145|consensus 99.4 2.5E-13 5.5E-18 75.1 2.2 60 2-61 116-175 (360)
30 COG0724 RNA-binding proteins ( 99.4 3.8E-12 8.2E-17 68.5 6.0 49 13-61 115-163 (306)
31 KOG0130|consensus 99.3 2.5E-12 5.4E-17 65.4 4.6 55 7-61 66-120 (170)
32 KOG0107|consensus 99.3 5.5E-12 1.2E-16 66.2 5.1 46 11-61 8-53 (195)
33 TIGR01628 PABP-1234 polyadenyl 99.3 8.2E-12 1.8E-16 74.5 6.5 50 11-61 283-332 (562)
34 KOG0148|consensus 99.3 6.6E-12 1.4E-16 69.6 4.7 51 10-60 59-109 (321)
35 KOG4205|consensus 99.3 8.6E-12 1.9E-16 70.5 5.0 50 12-61 5-54 (311)
36 KOG0111|consensus 99.3 2.2E-12 4.7E-17 70.1 1.9 52 10-61 7-58 (298)
37 KOG0124|consensus 99.2 3.9E-12 8.4E-17 73.2 2.4 49 13-61 113-161 (544)
38 KOG0108|consensus 99.2 1.8E-11 3.9E-16 71.7 4.9 48 14-61 19-66 (435)
39 smart00362 RRM_2 RNA recogniti 99.2 5.9E-11 1.3E-15 53.0 5.5 45 15-61 1-45 (72)
40 KOG0144|consensus 99.2 5.1E-12 1.1E-16 73.5 1.7 48 13-61 124-171 (510)
41 KOG0145|consensus 99.2 6.9E-11 1.5E-15 65.6 5.9 53 9-61 274-326 (360)
42 smart00360 RRM RNA recognition 99.2 9.6E-11 2.1E-15 52.1 4.8 44 18-61 1-44 (71)
43 KOG0114|consensus 99.2 2.5E-10 5.4E-15 55.9 6.1 48 10-60 15-62 (124)
44 KOG0148|consensus 99.1 1.6E-10 3.4E-15 64.3 5.3 46 10-61 161-206 (321)
45 KOG0147|consensus 99.1 2.6E-11 5.7E-16 71.8 2.3 46 16-61 281-326 (549)
46 KOG0127|consensus 99.1 1.8E-10 3.9E-15 68.8 4.7 49 13-61 5-53 (678)
47 KOG0124|consensus 99.1 2.9E-10 6.2E-15 65.7 5.3 50 12-61 209-258 (544)
48 TIGR01649 hnRNP-L_PTB hnRNP-L/ 99.1 3.4E-10 7.4E-15 67.0 5.7 43 13-61 2-44 (481)
49 KOG0105|consensus 99.1 4.2E-10 9.2E-15 60.0 4.9 48 11-61 4-51 (241)
50 cd00590 RRM RRM (RNA recogniti 99.1 1.1E-09 2.5E-14 49.0 5.8 46 15-61 1-46 (74)
51 TIGR01649 hnRNP-L_PTB hnRNP-L/ 99.1 1.1E-09 2.4E-14 64.8 6.7 45 12-61 274-319 (481)
52 KOG0109|consensus 99.0 7E-10 1.5E-14 62.3 3.4 35 14-48 3-37 (346)
53 KOG0146|consensus 99.0 1.7E-09 3.7E-14 60.4 4.8 53 9-61 281-333 (371)
54 KOG0415|consensus 98.9 2.7E-09 5.8E-14 61.5 3.8 48 14-61 240-287 (479)
55 KOG4205|consensus 98.9 4.2E-09 9.2E-14 59.8 4.6 49 13-61 97-145 (311)
56 KOG0146|consensus 98.8 3.1E-09 6.8E-14 59.4 3.2 49 12-61 18-66 (371)
57 KOG0131|consensus 98.8 4.7E-09 1E-13 55.6 3.2 51 10-60 93-144 (203)
58 KOG4208|consensus 98.8 2.5E-08 5.4E-13 53.7 5.4 53 9-61 45-98 (214)
59 TIGR01642 U2AF_lg U2 snRNP aux 98.8 3.6E-08 7.7E-13 58.4 6.4 46 10-61 172-229 (509)
60 KOG0110|consensus 98.8 6.2E-09 1.3E-13 63.6 3.2 49 13-61 613-661 (725)
61 TIGR01648 hnRNP-R-Q heterogene 98.7 4.1E-08 8.9E-13 59.5 6.0 42 12-61 232-275 (578)
62 KOG0153|consensus 98.7 3.1E-08 6.8E-13 56.7 4.7 48 8-61 223-270 (377)
63 KOG0116|consensus 98.7 4.5E-08 9.8E-13 57.5 4.5 49 13-61 288-336 (419)
64 PF04059 RRM_2: RNA recognitio 98.7 2.1E-07 4.6E-12 45.1 6.1 47 14-60 2-50 (97)
65 KOG4212|consensus 98.6 1.3E-07 2.8E-12 56.0 5.1 48 13-61 44-92 (608)
66 KOG0117|consensus 98.6 1E-07 2.2E-12 56.2 4.0 40 13-60 259-298 (506)
67 KOG0123|consensus 98.6 1.6E-07 3.5E-12 54.5 4.6 45 14-61 77-121 (369)
68 KOG4661|consensus 98.5 1.8E-07 4E-12 56.9 4.5 49 13-61 405-453 (940)
69 KOG4206|consensus 98.5 5.2E-07 1.1E-11 49.1 5.8 47 11-60 7-57 (221)
70 KOG4209|consensus 98.5 1.2E-07 2.7E-12 52.0 3.3 51 11-61 99-149 (231)
71 smart00361 RRM_1 RNA recogniti 98.5 3.2E-07 7E-12 41.8 3.9 35 27-61 2-43 (70)
72 KOG0109|consensus 98.5 1.9E-07 4.2E-12 52.7 3.3 38 11-48 76-113 (346)
73 KOG0132|consensus 98.5 2.7E-07 5.9E-12 57.3 4.1 43 13-61 421-463 (894)
74 KOG0123|consensus 98.3 1.7E-06 3.6E-11 50.4 4.8 51 10-61 267-317 (369)
75 KOG4454|consensus 98.2 7.8E-07 1.7E-11 48.7 2.1 50 10-61 6-55 (267)
76 KOG4849|consensus 98.2 4.8E-07 1E-11 52.4 1.1 47 14-60 81-129 (498)
77 KOG0533|consensus 98.2 5.9E-06 1.3E-10 45.8 5.1 49 11-60 81-129 (243)
78 KOG4212|consensus 98.2 2E-06 4.4E-11 51.1 3.3 45 9-56 532-576 (608)
79 KOG0226|consensus 98.1 2.9E-06 6.2E-11 47.3 3.2 50 11-60 188-237 (290)
80 KOG4211|consensus 98.1 1.7E-05 3.7E-10 47.5 5.7 50 9-61 6-55 (510)
81 KOG0120|consensus 98.1 6.1E-06 1.3E-10 49.6 3.7 53 9-61 285-337 (500)
82 KOG1548|consensus 98.0 3.8E-05 8.3E-10 44.4 5.8 49 10-59 131-187 (382)
83 KOG0110|consensus 98.0 1.4E-05 3E-10 49.5 4.0 48 14-61 516-566 (725)
84 KOG4210|consensus 97.9 8.6E-06 1.9E-10 46.1 2.3 50 12-61 183-233 (285)
85 KOG0151|consensus 97.8 4E-05 8.7E-10 47.9 4.5 39 11-49 172-210 (877)
86 KOG0147|consensus 97.8 3.5E-06 7.5E-11 50.7 -0.3 51 11-61 177-227 (549)
87 KOG0106|consensus 97.8 2.7E-05 5.9E-10 42.6 2.9 32 14-45 2-33 (216)
88 KOG4660|consensus 97.8 4.9E-05 1.1E-09 46.0 4.0 45 12-61 74-118 (549)
89 PF13893 RRM_5: RNA recognitio 97.7 7.7E-05 1.7E-09 32.3 3.1 27 30-61 1-27 (56)
90 KOG0129|consensus 97.7 0.00026 5.6E-09 42.8 5.8 52 9-60 366-418 (520)
91 KOG0129|consensus 97.3 0.00059 1.3E-08 41.3 4.3 33 13-46 259-291 (520)
92 KOG4211|consensus 97.3 0.0012 2.6E-08 39.9 5.2 49 12-61 102-151 (510)
93 PF08777 RRM_3: RNA binding mo 97.2 0.00049 1.1E-08 33.8 2.9 36 13-48 1-36 (105)
94 KOG1995|consensus 97.2 0.00076 1.6E-08 39.2 3.9 52 9-60 62-121 (351)
95 KOG1190|consensus 97.1 0.0007 1.5E-08 40.3 3.3 37 12-48 27-63 (492)
96 KOG1457|consensus 97.1 0.0026 5.5E-08 35.5 5.2 51 11-61 32-83 (284)
97 KOG3152|consensus 97.0 0.002 4.4E-08 36.3 3.9 39 12-50 73-111 (278)
98 KOG1457|consensus 96.9 0.002 4.4E-08 35.8 3.5 33 11-43 208-240 (284)
99 PF08675 RNA_bind: RNA binding 96.8 0.01 2.2E-07 28.3 5.2 33 14-48 10-42 (87)
100 KOG0105|consensus 96.8 0.0061 1.3E-07 33.2 4.7 40 9-48 111-150 (241)
101 KOG4206|consensus 96.6 0.012 2.5E-07 32.5 5.1 46 11-61 144-189 (221)
102 KOG0115|consensus 96.3 0.0048 1E-07 34.8 2.6 35 14-48 32-66 (275)
103 KOG0128|consensus 96.3 0.0032 7E-08 40.3 2.1 46 14-60 737-782 (881)
104 KOG0128|consensus 95.8 0.00015 3.2E-09 46.0 -5.2 50 12-61 666-715 (881)
105 KOG1855|consensus 95.4 0.038 8.2E-07 33.4 3.8 37 12-48 230-266 (484)
106 KOG1365|consensus 95.3 0.024 5.2E-07 33.9 2.9 48 13-61 280-330 (508)
107 KOG2314|consensus 95.1 0.14 3.1E-06 32.2 5.8 50 11-61 56-111 (698)
108 PF14605 Nup35_RRM_2: Nup53/35 94.6 0.1 2.2E-06 22.5 3.3 32 14-46 2-33 (53)
109 KOG1365|consensus 93.7 0.32 6.8E-06 29.4 4.9 46 14-60 162-211 (508)
110 KOG4676|consensus 93.1 0.12 2.7E-06 31.1 2.7 35 13-47 7-41 (479)
111 KOG0112|consensus 92.8 0.043 9.2E-07 35.8 0.6 37 12-48 371-407 (975)
112 KOG0106|consensus 92.3 0.13 2.8E-06 28.5 2.0 33 11-43 97-129 (216)
113 KOG1190|consensus 92.2 0.47 1E-05 28.9 4.2 38 13-50 297-335 (492)
114 KOG1456|consensus 91.0 1.2 2.6E-05 27.0 5.1 37 10-46 28-64 (494)
115 KOG4307|consensus 90.9 0.35 7.5E-06 31.3 3.0 51 10-61 431-482 (944)
116 KOG1456|consensus 90.8 1.7 3.6E-05 26.4 5.5 41 9-49 283-324 (494)
117 KOG0112|consensus 90.6 0.86 1.9E-05 30.2 4.5 36 10-45 452-487 (975)
118 KOG4307|consensus 90.0 1 2.2E-05 29.4 4.4 46 13-58 867-912 (944)
119 PF03467 Smg4_UPF3: Smg-4/UPF3 89.6 0.74 1.6E-05 24.6 3.2 30 11-40 5-35 (176)
120 KOG2193|consensus 89.5 0.33 7.2E-06 29.7 2.0 24 14-37 2-25 (584)
121 PF09707 Cas_Cas2CT1978: CRISP 87.2 1.5 3.3E-05 20.9 3.1 45 13-60 25-69 (86)
122 KOG4660|consensus 85.7 0.92 2E-05 28.4 2.4 47 14-60 389-436 (549)
123 COG5175 MOT2 Transcriptional r 85.6 1.1 2.4E-05 26.8 2.6 35 13-47 114-154 (480)
124 KOG2253|consensus 82.4 1.2 2.7E-05 28.5 2.0 35 10-44 37-71 (668)
125 KOG2416|consensus 80.8 1 2.2E-05 28.8 1.3 37 7-43 438-475 (718)
126 KOG4008|consensus 80.6 3 6.6E-05 23.7 2.9 32 9-40 36-67 (261)
127 PRK11558 putative ssRNA endonu 80.3 4.1 8.8E-05 20.0 3.0 45 13-60 27-71 (97)
128 KOG4410|consensus 79.3 11 0.00023 22.4 5.0 26 14-39 331-356 (396)
129 PF07292 NID: Nmi/IFP 35 domai 78.4 6 0.00013 19.0 3.4 22 13-34 52-73 (88)
130 COG0030 KsgA Dimethyladenosine 78.1 2.6 5.6E-05 24.1 2.2 35 14-48 96-130 (259)
131 KOG4454|consensus 76.8 0.52 1.1E-05 26.6 -0.7 49 11-60 78-130 (267)
132 PF05172 Nup35_RRM: Nup53/35/4 75.0 4.8 0.0001 19.7 2.5 30 14-44 7-36 (100)
133 PF11608 Limkain-b1: Limkain b 74.9 6.1 0.00013 19.1 2.7 30 14-43 3-37 (90)
134 KOG4210|consensus 74.5 1.7 3.7E-05 25.1 1.0 49 12-60 87-135 (285)
135 PF10567 Nab6_mRNP_bdg: RNA-re 73.4 3.2 6.9E-05 24.3 1.8 37 13-49 15-51 (309)
136 COG5584 Predicted small secret 69.5 6 0.00013 19.5 2.0 29 21-49 30-58 (103)
137 KOG2891|consensus 67.6 9.8 0.00021 22.5 3.0 35 12-46 148-194 (445)
138 PTZ00338 dimethyladenosine tra 66.1 7 0.00015 22.6 2.2 34 15-48 103-136 (294)
139 TIGR01873 cas_CT1978 CRISPR-as 65.6 14 0.0003 17.8 2.8 24 13-36 25-48 (87)
140 PRK15464 cold shock-like prote 64.3 4 8.6E-05 18.6 0.8 8 54-61 15-22 (70)
141 PF14893 PNMA: PNMA 64.1 8.7 0.00019 22.8 2.4 26 10-35 15-40 (331)
142 PRK00274 ksgA 16S ribosomal RN 63.7 9.6 0.00021 21.6 2.4 34 15-48 107-140 (272)
143 PF10309 DUF2414: Protein of u 63.2 13 0.00029 16.6 4.1 35 13-48 5-42 (62)
144 PF15513 DUF4651: Domain of un 62.6 14 0.0003 16.7 4.0 17 28-44 9-25 (62)
145 PRK15463 cold shock-like prote 61.6 4.5 9.8E-05 18.4 0.8 8 54-61 15-22 (70)
146 COG0481 LepA Membrane GTPase L 59.5 15 0.00032 23.5 2.8 45 14-59 298-343 (603)
147 KOG2318|consensus 59.3 38 0.00082 22.1 4.5 36 10-45 171-211 (650)
148 PRK09507 cspE cold shock prote 59.1 4.4 9.6E-05 18.3 0.5 8 54-61 14-21 (69)
149 PRK10943 cold shock-like prote 58.2 4.6 0.0001 18.3 0.5 8 54-61 14-21 (69)
150 PRK10354 RNA chaperone/anti-te 55.4 5.7 0.00012 18.0 0.5 8 54-61 15-22 (70)
151 PRK09890 cold shock protein Cs 55.3 5.6 0.00012 18.0 0.5 8 54-61 15-22 (70)
152 cd00027 BRCT Breast Cancer Sup 53.3 18 0.00038 15.0 3.6 26 14-39 2-27 (72)
153 PF00398 RrnaAD: Ribosomal RNA 52.9 25 0.00053 19.8 2.9 31 13-43 97-129 (262)
154 PF03468 XS: XS domain; Inter 50.4 33 0.00072 17.2 2.9 33 26-61 30-62 (116)
155 PF12157 DUF3591: Protein of u 48.8 32 0.00069 21.6 3.1 40 19-61 359-398 (457)
156 PF11411 DNA_ligase_IV: DNA li 48.7 19 0.0004 14.4 1.5 15 24-38 20-34 (36)
157 TIGR00755 ksgA dimethyladenosi 47.2 43 0.00094 18.7 3.3 25 15-39 96-120 (253)
158 PF03672 UPF0154: Uncharacteri 46.7 30 0.00064 15.7 2.1 18 22-39 29-46 (64)
159 PHA01632 hypothetical protein 46.3 29 0.00063 15.4 2.3 18 18-35 21-38 (64)
160 PF07230 Peptidase_S80: Bacter 46.3 20 0.00043 22.7 2.0 36 15-54 280-315 (501)
161 PF13046 DUF3906: Protein of u 46.2 31 0.00067 15.7 2.2 32 27-60 32-63 (64)
162 PHA02531 20 portal vertex prot 45.4 15 0.00033 23.2 1.4 36 14-53 282-317 (514)
163 PRK00523 hypothetical protein; 44.0 34 0.00074 15.9 2.1 18 22-39 37-54 (72)
164 PF15407 Spo7_2_N: Sporulation 43.9 7.6 0.00017 17.7 0.0 18 11-28 25-42 (67)
165 smart00650 rADc Ribosomal RNA 41.4 53 0.0012 17.0 3.4 23 14-36 78-100 (169)
166 PF15063 TC1: Thyroid cancer p 41.0 23 0.0005 16.7 1.3 26 14-39 26-51 (79)
167 PF01823 MACPF: MAC/Perforin d 40.0 26 0.00056 18.6 1.7 22 18-39 53-77 (212)
168 COG5638 Uncharacterized conser 39.8 98 0.0021 19.6 4.3 37 9-45 142-183 (622)
169 KOG2135|consensus 39.4 18 0.00038 22.9 1.0 35 14-48 373-408 (526)
170 PRK01844 hypothetical protein; 39.2 45 0.00097 15.5 2.1 18 22-39 36-53 (72)
171 PF11004 Kdo_hydroxy: 3-deoxy- 38.0 55 0.0012 19.3 2.8 47 10-57 18-65 (281)
172 smart00457 MACPF membrane-atta 37.9 24 0.00053 18.9 1.4 21 19-39 31-51 (194)
173 PF09176 Mpt_N: Methylene-tetr 37.4 9.2 0.0002 18.2 -0.3 30 3-33 35-64 (81)
174 PF00220 Hormone_4: Neurohypop 36.9 16 0.00035 10.0 0.3 6 17-22 2-7 (9)
175 PF14111 DUF4283: Domain of un 36.1 62 0.0013 16.3 3.5 31 16-46 107-138 (153)
176 smart00596 PRE_C2HC PRE_C2HC d 33.7 56 0.0012 15.1 3.9 26 29-54 3-29 (69)
177 KOG4365|consensus 33.3 28 0.0006 22.0 1.2 45 16-60 6-50 (572)
178 PF10281 Ish1: Putative stress 32.4 41 0.0009 13.1 1.9 16 24-39 3-18 (38)
179 PF05189 RTC_insert: RNA 3'-te 32.2 66 0.0014 15.4 4.3 45 15-59 12-61 (103)
180 KOG2591|consensus 30.9 1.5E+02 0.0032 19.6 4.0 35 12-46 174-210 (684)
181 KOG3424|consensus 30.6 86 0.0019 16.2 3.3 36 25-60 35-75 (132)
182 cd00874 RNA_Cyclase_Class_II R 30.2 88 0.0019 18.7 2.9 43 15-59 188-234 (326)
183 cd06257 DnaJ DnaJ domain or J- 29.8 51 0.0011 13.3 2.2 18 19-36 6-23 (55)
184 PF12385 Peptidase_C70: Papain 29.5 68 0.0015 17.4 2.1 20 24-43 95-114 (166)
185 PF06014 DUF910: Bacterial pro 28.6 65 0.0014 14.5 1.7 16 27-42 4-19 (62)
186 COG3763 Uncharacterized protei 28.4 74 0.0016 14.8 2.1 18 22-39 36-53 (71)
187 COG4647 AcxC Acetone carboxyla 28.3 69 0.0015 16.8 2.0 31 13-43 89-132 (165)
188 PF00226 DnaJ: DnaJ domain; I 28.3 61 0.0013 13.7 2.2 18 19-36 6-23 (64)
189 smart00703 NRF N-terminal doma 28.2 56 0.0012 15.9 1.6 23 15-37 71-94 (110)
190 KOG0008|consensus 27.3 1E+02 0.0022 22.6 3.0 39 20-61 820-859 (1563)
191 KOG0820|consensus 27.0 78 0.0017 18.9 2.2 22 14-35 124-145 (315)
192 PF04008 Adenosine_kin: Adenos 26.6 1E+02 0.0023 16.5 2.5 27 11-37 6-32 (155)
193 PF13600 DUF4140: N-terminal d 25.7 26 0.00057 16.8 0.2 35 13-47 23-57 (104)
194 smart00540 LEM in nuclear memb 24.8 70 0.0015 13.3 1.8 15 25-39 6-20 (44)
195 KOG2068|consensus 24.7 48 0.001 19.9 1.2 36 14-49 78-116 (327)
196 smart00195 DSPc Dual specifici 24.5 61 0.0013 16.0 1.4 18 15-32 7-24 (138)
197 PF11249 DUF3047: Protein of u 23.6 1.4E+02 0.003 16.2 4.3 33 28-60 145-178 (183)
198 PRK04239 hypothetical protein; 23.3 1.1E+02 0.0024 15.4 2.1 19 19-37 80-98 (110)
199 PF15051 FAM198: FAM198 protei 23.3 88 0.0019 18.8 2.0 30 14-43 3-33 (326)
200 PF01984 dsDNA_bind: Double-st 23.2 76 0.0016 15.7 1.6 20 19-38 75-94 (107)
201 PF05023 Phytochelatin: Phytoc 23.1 43 0.00092 18.7 0.7 14 48-61 199-212 (212)
202 TIGR00006 S-adenosyl-methyltra 22.3 76 0.0016 18.8 1.6 19 24-42 140-158 (305)
203 PF00054 Laminin_G_1: Laminin 22.2 72 0.0016 15.8 1.4 11 12-22 91-101 (131)
204 smart00271 DnaJ DnaJ molecular 22.2 80 0.0017 13.0 2.2 18 19-36 7-24 (60)
205 PF15023 DUF4523: Protein of u 22.1 1.5E+02 0.0032 16.0 3.5 33 13-45 86-122 (166)
206 PRK00050 16S rRNA m(4)C1402 me 22.1 78 0.0017 18.6 1.7 19 24-42 138-156 (296)
207 PF00533 BRCT: BRCA1 C Terminu 21.8 89 0.0019 13.4 2.8 29 12-41 7-35 (78)
208 KOG0984|consensus 21.4 1.9E+02 0.004 16.9 3.6 30 6-35 24-53 (282)
209 KOG0718|consensus 20.7 1E+02 0.0023 19.8 2.0 19 19-37 15-33 (546)
210 COG2118 DNA-binding protein [G 20.7 1.4E+02 0.0029 15.3 2.1 20 18-37 82-101 (116)
211 KOG4285|consensus 20.2 98 0.0021 18.7 1.8 29 16-45 200-228 (350)
No 1
>PLN03134 glycine-rich RNA-binding protein 4; Provisional
Probab=99.69 E-value=1.7e-16 Score=81.35 Aligned_cols=52 Identities=35% Similarity=0.443 Sum_probs=48.6
Q ss_pred CCCCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050 10 IEGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR 61 (61)
Q Consensus 10 ~~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~ 61 (61)
....++|||+|||+.+++++|+++|++||.|.++.++.|+.+++++|||||+
T Consensus 31 ~~~~~~lfVgnL~~~~te~~L~~~F~~~G~I~~v~i~~d~~tg~~kGfaFV~ 82 (144)
T PLN03134 31 RLMSTKLFIGGLSWGTDDASLRDAFAHFGDVVDAKVIVDRETGRSRGFGFVN 82 (144)
T ss_pred cCCCCEEEEeCCCCCCCHHHHHHHHhcCCCeEEEEEEecCCCCCcceEEEEE
Confidence 3446899999999999999999999999999999999999999999999995
No 2
>KOG0149|consensus
Probab=99.68 E-value=1.6e-16 Score=85.95 Aligned_cols=50 Identities=32% Similarity=0.571 Sum_probs=47.7
Q ss_pred CCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050 12 GMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR 61 (61)
Q Consensus 12 ~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~ 61 (61)
.-.+||||+|+|.+..+.|++.|++||+|.+..++.|+.||+++|||||+
T Consensus 11 ~~TKifVggL~w~T~~~~l~~yFeqfGeI~eavvitd~~t~rskGyGfVT 60 (247)
T KOG0149|consen 11 TFTKIFVGGLAWETHKETLRRYFEQFGEIVEAVVITDKNTGRSKGYGFVT 60 (247)
T ss_pred eEEEEEEcCcccccchHHHHHHHHHhCceEEEEEEeccCCccccceeeEE
Confidence 34789999999999999999999999999999999999999999999996
No 3
>KOG4207|consensus
Probab=99.66 E-value=3.6e-16 Score=83.68 Aligned_cols=61 Identities=84% Similarity=1.456 Sum_probs=59.3
Q ss_pred CCCCCCCCCCCCCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050 1 MSYGRPPPRIEGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR 61 (61)
Q Consensus 1 m~~~~~~~~~~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~ 61 (61)
|+|+.+|++.+....|.|-||.+.++.++|+.+|++||.|-++.|+.|+.|++.+|||||.
T Consensus 1 MS~g~~PPdv~gm~SLkVdNLTyRTspd~LrrvFekYG~vgDVyIPrdr~Tr~sRgFaFVr 61 (256)
T KOG4207|consen 1 MSYGRPPPDVEGMTSLKVDNLTYRTSPDDLRRVFEKYGRVGDVYIPRDRYTRQSRGFAFVR 61 (256)
T ss_pred CCCCCCCCCcccceeEEecceeccCCHHHHHHHHHHhCcccceecccccccccccceeEEE
Confidence 9999999999999999999999999999999999999999999999999999999999983
No 4
>TIGR01661 ELAV_HUD_SF ELAV/HuD family splicing factor. These proteins contain 3 RNA-recognition motifs (rrm: pfam00076).
Probab=99.63 E-value=2.7e-15 Score=84.84 Aligned_cols=51 Identities=29% Similarity=0.386 Sum_probs=47.5
Q ss_pred CCCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050 11 EGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR 61 (61)
Q Consensus 11 ~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~ 61 (61)
..+.+|||+|||+.+++++|+++|++||.|.+++++.|+.|+.++|||||+
T Consensus 267 ~~~~~lfV~NL~~~~~e~~L~~~F~~fG~v~~v~i~~d~~t~~skG~aFV~ 317 (352)
T TIGR01661 267 GAGYCIFVYNLSPDTDETVLWQLFGPFGAVQNVKIIRDLTTNQCKGYGFVS 317 (352)
T ss_pred CCCcEEEEeCCCCCCCHHHHHHHHHhCCCeEEEEEeEcCCCCCccceEEEE
Confidence 334579999999999999999999999999999999999999999999995
No 5
>TIGR01661 ELAV_HUD_SF ELAV/HuD family splicing factor. These proteins contain 3 RNA-recognition motifs (rrm: pfam00076).
Probab=99.63 E-value=1.6e-15 Score=85.78 Aligned_cols=49 Identities=31% Similarity=0.536 Sum_probs=47.1
Q ss_pred CcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050 13 MVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR 61 (61)
Q Consensus 13 ~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~ 61 (61)
..+|||+|||..+++++|+++|+.||+|.++.+++|+.+|+++|||||+
T Consensus 3 ~~~l~V~nLp~~~~e~~l~~~F~~~G~i~~v~i~~d~~~g~s~g~afV~ 51 (352)
T TIGR01661 3 KTNLIVNYLPQTMTQEEIRSLFTSIGEIESCKLVRDKVTGQSLGYGFVN 51 (352)
T ss_pred CcEEEEeCCCCCCCHHHHHHHHHccCCEEEEEEEEcCCCCccceEEEEE
Confidence 4899999999999999999999999999999999999999999999995
No 6
>TIGR01659 sex-lethal sex-lethal family splicing factor. This model describes the sex-lethal family of splicing factors found in Dipteran insects. The sex-lethal phenotype, however, may be limited to the Melanogasters and closely related species. In Drosophila the protein acts as an inhibitor of splicing. This subfamily is most closely related to the ELAV/HUD subfamily of splicing factors (TIGR01661).
Probab=99.63 E-value=2.7e-15 Score=85.58 Aligned_cols=53 Identities=30% Similarity=0.326 Sum_probs=49.3
Q ss_pred CCCCCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050 9 RIEGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR 61 (61)
Q Consensus 9 ~~~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~ 61 (61)
+....++|||+|||+++++++|+++|+.+|.|.+++++.|+.+++++|||||+
T Consensus 103 ~~~~~~~LfVgnLp~~~te~~L~~lF~~~G~V~~v~i~~d~~tg~srGyaFVe 155 (346)
T TIGR01659 103 TNNSGTNLIVNYLPQDMTDRELYALFRTIGPINTCRIMRDYKTGYSFGYAFVD 155 (346)
T ss_pred CCCCCcEEEEeCCCCCCCHHHHHHHHHhcCCEEEEEEEecCCCCccCcEEEEE
Confidence 34456899999999999999999999999999999999999999999999995
No 7
>TIGR01648 hnRNP-R-Q heterogeneous nuclear ribonucleoprotein R, Q family. Sequences in this subfamily include the human heterogeneous nuclear ribonucleoproteins (hnRNP) R, Q and APOBEC-1 complementation factor (aka APOBEC-1 stimulating protein). These proteins contain three RNA recognition domains (rrm: pfam00076) and a somewhat variable C-terminal domain.
Probab=99.56 E-value=2e-14 Score=86.13 Aligned_cols=58 Identities=31% Similarity=0.592 Sum_probs=50.8
Q ss_pred CCCCCCC-----CCCCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050 3 YGRPPPR-----IEGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR 61 (61)
Q Consensus 3 ~~~~~~~-----~~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~ 61 (61)
|+.||+. ....++|||+|||+++++++|+++|+++|.|.++++++| .+++++|||||+
T Consensus 43 ~g~Pp~~~~~~~p~~~~~lFVgnLp~~~tEd~L~~~F~~~G~I~~vrl~~D-~sG~sRGfaFV~ 105 (578)
T TIGR01648 43 YGGPPPGWSGVQPGRGCEVFVGKIPRDLYEDELVPLFEKAGPIYELRLMMD-FSGQNRGYAFVT 105 (578)
T ss_pred CCCCCCcccCCCCCCCCEEEeCCCCCCCCHHHHHHHHHhhCCEEEEEEEEC-CCCCccceEEEE
Confidence 5555542 344689999999999999999999999999999999999 789999999995
No 8
>TIGR01659 sex-lethal sex-lethal family splicing factor. This model describes the sex-lethal family of splicing factors found in Dipteran insects. The sex-lethal phenotype, however, may be limited to the Melanogasters and closely related species. In Drosophila the protein acts as an inhibitor of splicing. This subfamily is most closely related to the ELAV/HUD subfamily of splicing factors (TIGR01661).
Probab=99.56 E-value=1.6e-14 Score=82.49 Aligned_cols=52 Identities=37% Similarity=0.496 Sum_probs=48.2
Q ss_pred CCCCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050 10 IEGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR 61 (61)
Q Consensus 10 ~~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~ 61 (61)
.....+|||+|||+.+++++|+++|++||.|..+.++.++.+++.+|||||+
T Consensus 190 ~~~~~~lfV~nLp~~vtee~L~~~F~~fG~V~~v~i~~d~~tg~~kG~aFV~ 241 (346)
T TIGR01659 190 SIKDTNLYVTNLPRTITDDQLDTIFGKYGQIVQKNILRDKLTGTPRGVAFVR 241 (346)
T ss_pred ccccceeEEeCCCCcccHHHHHHHHHhcCCEEEEEEeecCCCCccceEEEEE
Confidence 3445789999999999999999999999999999999999999999999995
No 9
>KOG0113|consensus
Probab=99.55 E-value=1.8e-14 Score=80.29 Aligned_cols=51 Identities=37% Similarity=0.660 Sum_probs=48.3
Q ss_pred CCCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050 11 EGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR 61 (61)
Q Consensus 11 ~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~ 61 (61)
+.=+||||+-|+.++++..|+..|+.||.|+.++++.|+.||+++|||||+
T Consensus 99 DPy~TLFv~RLnydT~EskLrreF~~YG~IkrirlV~d~vTgkskGYAFIe 149 (335)
T KOG0113|consen 99 DPYKTLFVARLNYDTSESKLRREFEKYGPIKRIRLVRDKVTGKSKGYAFIE 149 (335)
T ss_pred CccceeeeeeccccccHHHHHHHHHhcCcceeEEEeeecccCCccceEEEE
Confidence 334899999999999999999999999999999999999999999999995
No 10
>KOG0122|consensus
Probab=99.54 E-value=2.1e-14 Score=78.25 Aligned_cols=53 Identities=34% Similarity=0.597 Sum_probs=49.8
Q ss_pred CCCCCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050 9 RIEGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR 61 (61)
Q Consensus 9 ~~~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~ 61 (61)
+.++.++|.|.||+.++++++|+++|.+||.+..+.+.+|+.||.++|||||+
T Consensus 185 ~R~D~~tvRvtNLsed~~E~dL~eLf~~fg~i~rvylardK~TG~~kGFAFVt 237 (270)
T KOG0122|consen 185 ERDDEATVRVTNLSEDMREDDLEELFRPFGPITRVYLARDKETGLSKGFAFVT 237 (270)
T ss_pred cCCccceeEEecCccccChhHHHHHhhccCccceeEEEEccccCcccceEEEE
Confidence 45567899999999999999999999999999999999999999999999984
No 11
>TIGR01645 half-pint poly-U binding splicing factor, half-pint family. In the case of PUF60 (GP|6176532), in complex with p54, and in the presence of U2AF, facilitates association of U2 snRNP with pre-mRNA.
Probab=99.53 E-value=3.7e-14 Score=85.29 Aligned_cols=51 Identities=29% Similarity=0.540 Sum_probs=48.3
Q ss_pred CCCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050 11 EGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR 61 (61)
Q Consensus 11 ~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~ 61 (61)
...++|||+||++.+++++|+++|.+||.|.+++++.|+.+++++|||||+
T Consensus 105 ~~~~rLfVGnLp~~~tEe~Lr~lF~~fG~I~sV~I~~D~~TgkskGfAFVe 155 (612)
T TIGR01645 105 AIMCRVYVGSISFELREDTIRRAFDPFGPIKSINMSWDPATGKHKGFAFVE 155 (612)
T ss_pred cCCCEEEEcCCCCCCCHHHHHHHHHccCCEEEEEEeecCCCCCcCCeEEEE
Confidence 446899999999999999999999999999999999999999999999995
No 12
>KOG0126|consensus
Probab=99.52 E-value=3e-15 Score=78.86 Aligned_cols=52 Identities=35% Similarity=0.569 Sum_probs=48.8
Q ss_pred CCCCCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEE
Q psy7050 9 RIEGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFV 60 (61)
Q Consensus 9 ~~~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv 60 (61)
+-.+.+-|||||||+..++.+|.-.|++||.+..+.+++|+.||+++||||+
T Consensus 31 ~YkdsA~Iyiggl~~~LtEgDil~VFSqyGe~vdinLiRDk~TGKSKGFaFL 82 (219)
T KOG0126|consen 31 EYKDSAYIYIGGLPYELTEGDILCVFSQYGEIVDINLIRDKKTGKSKGFAFL 82 (219)
T ss_pred hcccceEEEECCCcccccCCcEEEEeeccCceEEEEEEecCCCCcccceEEE
Confidence 3456688999999999999999999999999999999999999999999996
No 13
>KOG0121|consensus
Probab=99.51 E-value=2.9e-14 Score=71.72 Aligned_cols=51 Identities=31% Similarity=0.675 Sum_probs=48.1
Q ss_pred CCCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050 11 EGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR 61 (61)
Q Consensus 11 ~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~ 61 (61)
.+.++|||+||++.++++.|.++|+.+|.|+.+.+-.|+.+..+.|||||+
T Consensus 34 r~S~tvyVgNlSfyttEEqiyELFs~cG~irriiMGLdr~kktpCGFCFVe 84 (153)
T KOG0121|consen 34 RKSCTVYVGNLSFYTTEEQIYELFSKCGDIRRIIMGLDRFKKTPCGFCFVE 84 (153)
T ss_pred hhcceEEEeeeeeeecHHHHHHHHHhccchheeEeccccCCcCccceEEEE
Confidence 457999999999999999999999999999999999999999999999995
No 14
>KOG0117|consensus
Probab=99.50 E-value=7.1e-14 Score=81.16 Aligned_cols=51 Identities=25% Similarity=0.470 Sum_probs=48.9
Q ss_pred CCCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050 11 EGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR 61 (61)
Q Consensus 11 ~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~ 61 (61)
..++.||||.||.++.+++|..+|++.|+|-+++|++|+.+|.+||||||+
T Consensus 81 ~~G~EVfvGkIPrD~~EdeLvplfEkiG~I~elRLMmD~~sG~nRGYAFVt 131 (506)
T KOG0117|consen 81 PRGCEVFVGKIPRDVFEDELVPLFEKIGKIYELRLMMDPFSGDNRGYAFVT 131 (506)
T ss_pred CCCceEEecCCCccccchhhHHHHHhccceeeEEEeecccCCCCcceEEEE
Confidence 457999999999999999999999999999999999999999999999995
No 15
>PF00076 RRM_1: RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain); InterPro: IPR000504 Many eukaryotic proteins containing one or more copies of a putative RNA-binding domain of about 90 amino acids are known to bind single-stranded RNAs [, , ]. The largest group of single strand RNA-binding proteins is the eukaryotic RNA recognition motif (RRM) family that contains an eight amino acid RNP-1 consensus sequence [, ]. RRM proteins have a variety of RNA binding preferences and functions, and include heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing (SR, U2AF, Sxl), protein components of small nuclear ribonucleoproteins (U1 and U2 snRNPs), and proteins that regulate RNA stability and translation (PABP, La, Hu) [, , ]. The RRM in heterodimeric splicing factor U2 snRNP auxiliary factor (U2AF) appears to have two RRM-like domains with specialised features for protein recognition []. The motif also appears in a few single stranded DNA binding proteins. The typical RRM consists of four anti-parallel beta-strands and two alpha-helices arranged in a beta-alpha-beta-beta-alpha-beta fold with side chains that stack with RNA bases. Specificity of RNA binding is determined by multiple contacts with surrounding amino acids. A third helix is present during RNA binding in some cases []. The RRM is reviewed in a number of publications [, , ].; GO: 0003676 nucleic acid binding; PDB: 2RNE_A 2DGO_A 2DO4_A 1YTY_B 2VOO_B 2VOP_A 2VON_B 1ZH5_B 2VOD_A 1S79_A ....
Probab=99.50 E-value=1.2e-13 Score=62.36 Aligned_cols=45 Identities=36% Similarity=0.612 Sum_probs=42.0
Q ss_pred EEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050 16 LKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR 61 (61)
Q Consensus 16 l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~ 61 (61)
|||+|||+.+++++|+++|++||.+..+.+..+ .++..+|+|||+
T Consensus 1 l~v~nlp~~~t~~~l~~~f~~~g~i~~~~~~~~-~~~~~~~~a~V~ 45 (70)
T PF00076_consen 1 LYVGNLPPDVTEEELRDFFSQFGKIESIKVMRN-SSGKSKGYAFVE 45 (70)
T ss_dssp EEEESETTTSSHHHHHHHHHTTSTEEEEEEEEE-TTSSEEEEEEEE
T ss_pred cEEcCCCCcCCHHHHHHHHHHhhhccccccccc-ccccccceEEEE
Confidence 799999999999999999999999999999887 678899999985
No 16
>KOG0144|consensus
Probab=99.47 E-value=1.1e-13 Score=80.28 Aligned_cols=51 Identities=33% Similarity=0.562 Sum_probs=47.9
Q ss_pred CCCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050 11 EGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR 61 (61)
Q Consensus 11 ~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~ 61 (61)
.+.-++||+.+|..++|.+|+++|++||.|.+|.+++|+.|+..+|||||+
T Consensus 32 ~~~vKlfVgqIprt~sE~dlr~lFe~yg~V~einl~kDk~t~~s~gcCFv~ 82 (510)
T KOG0144|consen 32 GSAVKLFVGQIPRTASEKDLRELFEKYGNVYEINLIKDKSTGQSKGCCFVK 82 (510)
T ss_pred chhhhheeccCCccccHHHHHHHHHHhCceeEEEeecccccCcccceEEEE
Confidence 344689999999999999999999999999999999999999999999985
No 17
>TIGR01622 SF-CC1 splicing factor, CC1-like family. A homologous gene from Plasmodium falciparum was identified in the course of the analysis of that genome at TIGR and was included in the model.
Probab=99.47 E-value=3.1e-13 Score=78.73 Aligned_cols=50 Identities=30% Similarity=0.471 Sum_probs=47.4
Q ss_pred CCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050 12 GMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR 61 (61)
Q Consensus 12 ~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~ 61 (61)
..++|||+|||..+++++|+++|+.+|.|..+.++.++.+++++|||||+
T Consensus 88 ~~~~l~V~nlp~~~~~~~l~~~F~~~G~v~~v~i~~d~~~~~skg~afVe 137 (457)
T TIGR01622 88 DDRTVFVLQLALKARERDLYEFFSKVGKVRDVQCIKDRNSRRSKGVAYVE 137 (457)
T ss_pred CCcEEEEeCCCCCCCHHHHHHHHHhcCCeeEEEEeecCCCCCcceEEEEE
Confidence 35799999999999999999999999999999999999999999999995
No 18
>TIGR01642 U2AF_lg U2 snRNP auxilliary factor, large subunit, splicing factor. Members of this subfamily are found in plants, metazoa and fungi.
Probab=99.46 E-value=4.3e-13 Score=78.93 Aligned_cols=50 Identities=22% Similarity=0.389 Sum_probs=47.2
Q ss_pred CCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050 12 GMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR 61 (61)
Q Consensus 12 ~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~ 61 (61)
...+|||+|||..+++++|+++|+.||.|..+.++.++.+|.++|||||+
T Consensus 294 ~~~~l~v~nlp~~~~~~~l~~~f~~~G~i~~~~~~~~~~~g~~~g~afv~ 343 (509)
T TIGR01642 294 SKDRIYIGNLPLYLGEDQIKELLESFGDLKAFNLIKDIATGLSKGYAFCE 343 (509)
T ss_pred CCCEEEEeCCCCCCCHHHHHHHHHhcCCeeEEEEEecCCCCCcCeEEEEE
Confidence 45799999999999999999999999999999999999999999999985
No 19
>TIGR01645 half-pint poly-U binding splicing factor, half-pint family. In the case of PUF60 (GP|6176532), in complex with p54, and in the presence of U2AF, facilitates association of U2 snRNP with pre-mRNA.
Probab=99.45 E-value=2.8e-13 Score=81.61 Aligned_cols=50 Identities=22% Similarity=0.423 Sum_probs=47.1
Q ss_pred CCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050 12 GMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR 61 (61)
Q Consensus 12 ~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~ 61 (61)
..++|||+||++.+++++|+++|+.||.|.++.++.++.+++++|||||+
T Consensus 203 ~~~rLfVgnLp~~vteedLk~lFs~FG~I~svrl~~D~~tgksKGfGFVe 252 (612)
T TIGR01645 203 KFNRIYVASVHPDLSETDIKSVFEAFGEIVKCQLARAPTGRGHKGYGFIE 252 (612)
T ss_pred ccceEEeecCCCCCCHHHHHHHHhhcCCeeEEEEEecCCCCCcCCeEEEE
Confidence 34799999999999999999999999999999999999999999999985
No 20
>KOG0125|consensus
Probab=99.44 E-value=2.3e-13 Score=76.77 Aligned_cols=51 Identities=35% Similarity=0.548 Sum_probs=45.0
Q ss_pred CCCCCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050 9 RIEGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR 61 (61)
Q Consensus 9 ~~~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~ 61 (61)
..+..++|+|+|+|+...+.||+.+|.+||.|.++.|+.+. .| +||||||+
T Consensus 92 s~~~pkRLhVSNIPFrFRdpDL~aMF~kfG~VldVEIIfNE-RG-SKGFGFVT 142 (376)
T KOG0125|consen 92 SKDTPKRLHVSNIPFRFRDPDLRAMFEKFGKVLDVEIIFNE-RG-SKGFGFVT 142 (376)
T ss_pred CCCCCceeEeecCCccccCccHHHHHHhhCceeeEEEEecc-CC-CCccceEE
Confidence 34567899999999999999999999999999999999873 34 79999996
No 21
>TIGR01628 PABP-1234 polyadenylate binding protein, human types 1, 2, 3, 4 family. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed, or of unknown tissue range.
Probab=99.44 E-value=4.6e-13 Score=79.80 Aligned_cols=48 Identities=33% Similarity=0.471 Sum_probs=45.8
Q ss_pred cEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050 14 VSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR 61 (61)
Q Consensus 14 ~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~ 61 (61)
.+|||+|||.++++++|+++|+++|.|.++++.+|..|++++|||||+
T Consensus 1 ~sl~VgnLp~~vte~~L~~~F~~~G~v~~v~v~~d~~t~~s~G~afV~ 48 (562)
T TIGR01628 1 ASLYVGDLDPDVTEAKLYDLFKPFGPVLSVRVCRDSVTRRSLGYGYVN 48 (562)
T ss_pred CeEEEeCCCCCCCHHHHHHHHHhcCCEEEEEEEecCCCCCcceEEEEE
Confidence 379999999999999999999999999999999999999999999985
No 22
>PF14259 RRM_6: RNA recognition motif (a.k.a. RRM, RBD, or RNP domain); PDB: 2DNN_A 1WI6_A 2EVZ_A 2ADC_A 1QM9_A 2KG1_A 2HGN_A 1U2F_A 2G4B_A 2HZC_A ....
Probab=99.43 E-value=8e-13 Score=60.09 Aligned_cols=45 Identities=40% Similarity=0.654 Sum_probs=40.4
Q ss_pred EEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050 16 LKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR 61 (61)
Q Consensus 16 l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~ 61 (61)
|||+|||+.+++++|+++|+.+|.|..+.+..++. +..+|+|||+
T Consensus 1 v~i~nlp~~~~~~~l~~~f~~~g~v~~v~~~~~~~-~~~~~~a~v~ 45 (70)
T PF14259_consen 1 VYISNLPPSTTEEDLRNFFSRFGPVEKVRLIKNKD-GQSRGFAFVE 45 (70)
T ss_dssp EEEESSTTT--HHHHHHHCTTSSBEEEEEEEESTT-SSEEEEEEEE
T ss_pred CEEeCCCCCCCHHHHHHHHHhcCCcceEEEEeeec-cccCCEEEEE
Confidence 79999999999999999999999999999999876 8899999985
No 23
>PLN03213 repressor of silencing 3; Provisional
Probab=99.43 E-value=5.2e-13 Score=78.97 Aligned_cols=46 Identities=33% Similarity=0.497 Sum_probs=42.0
Q ss_pred CCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050 12 GMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR 61 (61)
Q Consensus 12 ~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~ 61 (61)
...+||||||++.+++++|+.+|..||.|..+.++ +.+| ||||||+
T Consensus 9 ~gMRIYVGNLSydVTEDDLravFSeFGsVkdVEIp--RETG--RGFAFVE 54 (759)
T PLN03213 9 GGVRLHVGGLGESVGRDDLLKIFSPMGTVDAVEFV--RTKG--RSFAYID 54 (759)
T ss_pred cceEEEEeCCCCCCCHHHHHHHHHhcCCeeEEEEe--cccC--CceEEEE
Confidence 34789999999999999999999999999999998 4567 9999995
No 24
>KOG0131|consensus
Probab=99.42 E-value=2.8e-13 Score=71.34 Aligned_cols=54 Identities=28% Similarity=0.473 Sum_probs=51.1
Q ss_pred CCCCCCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050 8 PRIEGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR 61 (61)
Q Consensus 8 ~~~~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~ 61 (61)
.+.+++.+|||+||+..+++..|.++|.+.|+|..+++++|+.+..++||||++
T Consensus 4 ~~rnqd~tiyvgnld~kvs~~~l~EL~iqagpVv~i~iPkDrv~~~~qGygF~E 57 (203)
T KOG0131|consen 4 IERNQDATLYVGNLDEKVSEELLYELFIQAGPVVNLHIPKDRVTQKHQGYGFAE 57 (203)
T ss_pred cccCCCceEEEecCCHHHHHHHHHHHHHhcCceeeeecchhhhcccccceeEEE
Confidence 467788999999999999999999999999999999999999999999999986
No 25
>PLN03120 nucleic acid binding protein; Provisional
Probab=99.42 E-value=7.7e-13 Score=72.97 Aligned_cols=46 Identities=20% Similarity=0.418 Sum_probs=42.0
Q ss_pred CcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050 13 MVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR 61 (61)
Q Consensus 13 ~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~ 61 (61)
.++|||+|||+.+++++|+++|+.||.|.++.++.++. .+|||||+
T Consensus 4 ~rtVfVgNLs~~tTE~dLrefFS~~G~I~~V~I~~d~~---~~GfAFVt 49 (260)
T PLN03120 4 VRTVKVSNVSLKATERDIKEFFSFSGDIEYVEMQSENE---RSQIAYVT 49 (260)
T ss_pred CCEEEEeCCCCCCCHHHHHHHHHhcCCeEEEEEeecCC---CCCEEEEE
Confidence 47999999999999999999999999999999998853 46899985
No 26
>PLN03121 nucleic acid binding protein; Provisional
Probab=99.39 E-value=2e-12 Score=70.68 Aligned_cols=47 Identities=28% Similarity=0.413 Sum_probs=42.2
Q ss_pred CCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050 12 GMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR 61 (61)
Q Consensus 12 ~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~ 61 (61)
.+.+|||+||++.+++++|+++|+.||.|.++.++++.. .+|+|||+
T Consensus 4 ~g~TV~V~NLS~~tTE~dLrefFS~~G~I~~V~I~~D~e---t~gfAfVt 50 (243)
T PLN03121 4 GGYTAEVTNLSPKATEKDVYDFFSHCGAIEHVEIIRSGE---YACTAYVT 50 (243)
T ss_pred CceEEEEecCCCCCCHHHHHHHHHhcCCeEEEEEecCCC---cceEEEEE
Confidence 458999999999999999999999999999999998843 45799985
No 27
>TIGR01622 SF-CC1 splicing factor, CC1-like family. A homologous gene from Plasmodium falciparum was identified in the course of the analysis of that genome at TIGR and was included in the model.
Probab=99.38 E-value=2.1e-12 Score=75.33 Aligned_cols=49 Identities=37% Similarity=0.718 Sum_probs=46.7
Q ss_pred CcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050 13 MVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR 61 (61)
Q Consensus 13 ~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~ 61 (61)
..+|||+|||..+++++|+++|+.||.|..+.++.++.+|+++|||||+
T Consensus 186 ~~~l~v~nl~~~~te~~l~~~f~~~G~i~~v~~~~d~~~g~~~g~afV~ 234 (457)
T TIGR01622 186 FLKLYVGNLHFNITEQELRQIFEPFGDIEDVQLHRDPETGRSKGFGFIQ 234 (457)
T ss_pred CCEEEEcCCCCCCCHHHHHHHHHhcCCeEEEEEEEcCCCCccceEEEEE
Confidence 5899999999999999999999999999999999999999999999985
No 28
>KOG0127|consensus
Probab=99.38 E-value=1.8e-12 Score=77.06 Aligned_cols=51 Identities=35% Similarity=0.501 Sum_probs=48.1
Q ss_pred CCCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050 11 EGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR 61 (61)
Q Consensus 11 ~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~ 61 (61)
..+.+|||+|||++++++.|.++|++||.|....++.++.|+.++|.|||.
T Consensus 290 ~~~~tVFvRNL~fD~tEEel~~~fskFG~v~ya~iV~~k~T~~skGtAFv~ 340 (678)
T KOG0127|consen 290 TEGKTVFVRNLPFDTTEEELKEHFSKFGEVKYAIIVKDKDTGHSKGTAFVK 340 (678)
T ss_pred cccceEEEecCCccccHHHHHHHHHhhccceeEEEEeccCCCCcccceEEE
Confidence 345899999999999999999999999999999999999999999999984
No 29
>KOG0145|consensus
Probab=99.37 E-value=2.5e-13 Score=75.12 Aligned_cols=60 Identities=32% Similarity=0.438 Sum_probs=56.6
Q ss_pred CCCCCCCCCCCCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050 2 SYGRPPPRIEGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR 61 (61)
Q Consensus 2 ~~~~~~~~~~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~ 61 (61)
||.+|+.+.-++.+|||++||..++..+|+.+|++||.|...+++.|..||.++|.+||.
T Consensus 116 SyARPSs~~Ik~aNLYvSGlPktMtqkelE~iFs~fGrIItSRiL~dqvtg~srGVgFiR 175 (360)
T KOG0145|consen 116 SYARPSSDSIKDANLYVSGLPKTMTQKELEQIFSPFGRIITSRILVDQVTGLSRGVGFIR 175 (360)
T ss_pred EeccCChhhhcccceEEecCCccchHHHHHHHHHHhhhhhhhhhhhhcccceecceeEEE
Confidence 788999888899999999999999999999999999999999999999999999999984
No 30
>COG0724 RNA-binding proteins (RRM domain) [General function prediction only]
Probab=99.35 E-value=3.8e-12 Score=68.48 Aligned_cols=49 Identities=47% Similarity=0.708 Sum_probs=46.8
Q ss_pred CcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050 13 MVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR 61 (61)
Q Consensus 13 ~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~ 61 (61)
..+|||+|||..+++++|+++|..+|.+..+.+..++.++..+|+|||+
T Consensus 115 ~~~l~v~nL~~~~~~~~l~~~F~~~g~~~~~~~~~d~~~~~~~g~~~v~ 163 (306)
T COG0724 115 NNTLFVGNLPYDVTEEDLRELFKKFGPVKRVRLVRDRETGKSRGFAFVE 163 (306)
T ss_pred CceEEEeCCCCCCCHHHHHHHHHhcCceeEEEeeeccccCccCceEEEE
Confidence 4899999999999999999999999999999999998899999999985
No 31
>KOG0130|consensus
Probab=99.35 E-value=2.5e-12 Score=65.38 Aligned_cols=55 Identities=27% Similarity=0.379 Sum_probs=50.9
Q ss_pred CCCCCCCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050 7 PPRIEGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR 61 (61)
Q Consensus 7 ~~~~~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~ 61 (61)
|+..-++..|||.+++..++++++.+.|..||+|+.+++-.|+.||..+|||+|+
T Consensus 66 PqrSVEGwIi~VtgvHeEatEedi~d~F~dyGeiKNihLNLDRRtGy~KGYaLvE 120 (170)
T KOG0130|consen 66 PQRSVEGWIIFVTGVHEEATEEDIHDKFADYGEIKNIHLNLDRRTGYVKGYALVE 120 (170)
T ss_pred CccceeeEEEEEeccCcchhHHHHHHHHhhcccccceeeccccccccccceeeee
Confidence 4455667999999999999999999999999999999999999999999999985
No 32
>KOG0107|consensus
Probab=99.32 E-value=5.5e-12 Score=66.22 Aligned_cols=46 Identities=35% Similarity=0.673 Sum_probs=41.4
Q ss_pred CCCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050 11 EGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR 61 (61)
Q Consensus 11 ~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~ 61 (61)
+..++|||+||+..+++.+|+..|..||.+.++++..+ +.||||||
T Consensus 8 ~~~~kVYVGnL~~~a~k~eLE~~F~~yG~lrsvWvArn-----PPGfAFVE 53 (195)
T KOG0107|consen 8 NGNTKVYVGNLGSRATKRELERAFSKYGPLRSVWVARN-----PPGFAFVE 53 (195)
T ss_pred CCCceEEeccCCCCcchHHHHHHHHhcCcceeEEEeec-----CCCceEEe
Confidence 34699999999999999999999999999999988765 57899996
No 33
>TIGR01628 PABP-1234 polyadenylate binding protein, human types 1, 2, 3, 4 family. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed, or of unknown tissue range.
Probab=99.32 E-value=8.2e-12 Score=74.54 Aligned_cols=50 Identities=38% Similarity=0.540 Sum_probs=46.2
Q ss_pred CCCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050 11 EGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR 61 (61)
Q Consensus 11 ~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~ 61 (61)
....+|||+||+..+++++|+++|+.||.|.++.++.+ .+|.++|||||+
T Consensus 283 ~~~~~l~V~nl~~~~~~~~L~~~F~~~G~i~~~~i~~d-~~g~~~g~gfV~ 332 (562)
T TIGR01628 283 AQGVNLYVKNLDDTVTDEKLRELFSECGEITSAKVMLD-EKGVSRGFGFVC 332 (562)
T ss_pred cCCCEEEEeCCCCccCHHHHHHHHHhcCCeEEEEEEEC-CCCCcCCeEEEE
Confidence 34578999999999999999999999999999999998 689999999985
No 34
>KOG0148|consensus
Probab=99.29 E-value=6.6e-12 Score=69.62 Aligned_cols=51 Identities=31% Similarity=0.465 Sum_probs=47.6
Q ss_pred CCCCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEE
Q psy7050 10 IEGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFV 60 (61)
Q Consensus 10 ~~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv 60 (61)
.....-+||+-|...++.++|++.|.+||+|.++++++|..|++++||+||
T Consensus 59 ~~~hfhvfvgdls~eI~~e~lr~aF~pFGevS~akvirD~~T~KsKGYgFV 109 (321)
T KOG0148|consen 59 SNQHFHVFVGDLSPEIDNEKLREAFAPFGEVSDAKVIRDMNTGKSKGYGFV 109 (321)
T ss_pred cccceeEEehhcchhcchHHHHHHhccccccccceEeecccCCcccceeEE
Confidence 344567999999999999999999999999999999999999999999998
No 35
>KOG4205|consensus
Probab=99.28 E-value=8.6e-12 Score=70.49 Aligned_cols=50 Identities=40% Similarity=0.581 Sum_probs=48.2
Q ss_pred CCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050 12 GMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR 61 (61)
Q Consensus 12 ~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~ 61 (61)
+.+++||++|+|.++++.|++.|.++|.+.++.+++|+.++..+||+||+
T Consensus 5 ~~~KlfiGgisw~ttee~Lr~yf~~~Gev~d~~vm~d~~t~rsrgFgfv~ 54 (311)
T KOG4205|consen 5 ESGKLFIGGLSWETTEESLREYFSQFGEVTDCVVMRDPSTGRSRGFGFVT 54 (311)
T ss_pred CCcceeecCcCccccHHHHHHHhcccCceeeEEEeccCCCCCccccccee
Confidence 67999999999999999999999999999999999999999999999985
No 36
>KOG0111|consensus
Probab=99.27 E-value=2.2e-12 Score=70.06 Aligned_cols=52 Identities=33% Similarity=0.461 Sum_probs=48.9
Q ss_pred CCCCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050 10 IEGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR 61 (61)
Q Consensus 10 ~~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~ 61 (61)
..+.++||||+|...+++.-|...|.+||.|..+.++.|-+++++||||||+
T Consensus 7 a~~KrtlYVGGladeVtekvLhaAFIPFGDI~dIqiPlDyesqkHRgFgFVe 58 (298)
T KOG0111|consen 7 ANQKRTLYVGGLADEVTEKVLHAAFIPFGDIKDIQIPLDYESQKHRGFGFVE 58 (298)
T ss_pred cccceeEEeccchHHHHHHHHHhccccccchhhcccccchhcccccceeEEE
Confidence 3456899999999999999999999999999999999999999999999996
No 37
>KOG0124|consensus
Probab=99.25 E-value=3.9e-12 Score=73.18 Aligned_cols=49 Identities=31% Similarity=0.592 Sum_probs=47.2
Q ss_pred CcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050 13 MVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR 61 (61)
Q Consensus 13 ~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~ 61 (61)
-|++|||.+.+...++.|+..|.+||+|+++.+..|+.|++++|||||+
T Consensus 113 McRvYVGSIsfEl~EDtiR~AF~PFGPIKSInMSWDp~T~kHKgFAFVE 161 (544)
T KOG0124|consen 113 MCRVYVGSISFELREDTIRRAFDPFGPIKSINMSWDPATGKHKGFAFVE 161 (544)
T ss_pred hHheeeeeeEEEechHHHHhhccCCCCcceeecccccccccccceEEEE
Confidence 3789999999999999999999999999999999999999999999996
No 38
>KOG0108|consensus
Probab=99.24 E-value=1.8e-11 Score=71.67 Aligned_cols=48 Identities=29% Similarity=0.482 Sum_probs=46.8
Q ss_pred cEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050 14 VSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR 61 (61)
Q Consensus 14 ~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~ 61 (61)
+.+||||+|++++++.|..+|+..|.|.+++++.|++||+.+||||++
T Consensus 19 ~~v~vgnip~~~se~~l~~~~~~~g~v~s~~~v~D~~tG~~~G~~f~~ 66 (435)
T KOG0108|consen 19 SSVFVGNIPYEGSEEQLLSIFSGVGPVLSFRLVYDRETGKPKGFGFCE 66 (435)
T ss_pred cceEecCCCCcccHHHHHHHHhccCccceeeecccccCCCcCceeeEe
Confidence 899999999999999999999999999999999999999999999985
No 39
>smart00362 RRM_2 RNA recognition motif.
Probab=99.23 E-value=5.9e-11 Score=53.00 Aligned_cols=45 Identities=40% Similarity=0.727 Sum_probs=40.6
Q ss_pred EEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050 15 SLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR 61 (61)
Q Consensus 15 ~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~ 61 (61)
+|+|+|||..+++++|+++|.++|.+..+.+..++ +.++|+|||+
T Consensus 1 ~v~i~~l~~~~~~~~l~~~~~~~g~v~~~~~~~~~--~~~~~~~~v~ 45 (72)
T smart00362 1 TLFVGNLPPDVTEEDLKELFSKFGPIESVKIPKDT--GKSKGFAFVE 45 (72)
T ss_pred CEEEcCCCCcCCHHHHHHHHHhcCCEEEEEEecCC--CCCCceEEEE
Confidence 58999999999999999999999999999888775 6788999984
No 40
>KOG0144|consensus
Probab=99.21 E-value=5.1e-12 Score=73.49 Aligned_cols=48 Identities=40% Similarity=0.596 Sum_probs=45.1
Q ss_pred CcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050 13 MVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR 61 (61)
Q Consensus 13 ~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~ 61 (61)
..+||||.|+..++|.+++++|++||.|++|.+++|. -+.+||||||+
T Consensus 124 e~KLFvg~lsK~~te~evr~iFs~fG~Ied~~ilrd~-~~~sRGcaFV~ 171 (510)
T KOG0144|consen 124 ERKLFVGMLSKQCTENEVREIFSRFGHIEDCYILRDP-DGLSRGCAFVK 171 (510)
T ss_pred chhhhhhhccccccHHHHHHHHHhhCccchhhheecc-cccccceeEEE
Confidence 5899999999999999999999999999999999995 48899999985
No 41
>KOG0145|consensus
Probab=99.20 E-value=6.9e-11 Score=65.64 Aligned_cols=53 Identities=28% Similarity=0.359 Sum_probs=49.0
Q ss_pred CCCCCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050 9 RIEGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR 61 (61)
Q Consensus 9 ~~~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~ 61 (61)
....+.-|||-||.+++++.-|.++|.+||.|..+.+++|..|.+.+||+||+
T Consensus 274 ~~~~g~ciFvYNLspd~de~~LWQlFgpFGAv~nVKvirD~ttnkCKGfgFVt 326 (360)
T KOG0145|consen 274 GPGGGWCIFVYNLSPDADESILWQLFGPFGAVTNVKVIRDFTTNKCKGFGFVT 326 (360)
T ss_pred CCCCeeEEEEEecCCCchHhHHHHHhCcccceeeEEEEecCCcccccceeEEE
Confidence 34456899999999999999999999999999999999999999999999985
No 42
>smart00360 RRM RNA recognition motif.
Probab=99.18 E-value=9.6e-11 Score=52.09 Aligned_cols=44 Identities=39% Similarity=0.684 Sum_probs=40.4
Q ss_pred EecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050 18 VDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR 61 (61)
Q Consensus 18 v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~ 61 (61)
|+|||..+++++|+++|.++|.+..+.+..++.++.++|+|||+
T Consensus 1 i~~l~~~~~~~~l~~~f~~~g~v~~~~i~~~~~~~~~~~~a~v~ 44 (71)
T smart00360 1 VGNLPPDVTEEELRELFSKFGKIESVRLVRDKDTGKSKGFAFVE 44 (71)
T ss_pred CCCCCcccCHHHHHHHHHhhCCEeEEEEEeCCCCCCCCceEEEE
Confidence 57899999999999999999999999998887788999999985
No 43
>KOG0114|consensus
Probab=99.17 E-value=2.5e-10 Score=55.87 Aligned_cols=48 Identities=31% Similarity=0.520 Sum_probs=41.7
Q ss_pred CCCCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEE
Q psy7050 10 IEGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFV 60 (61)
Q Consensus 10 ~~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv 60 (61)
.+..+.|||+|||+.++.+++.++|.+||.|.++++-..+ ..+|-|||
T Consensus 15 pevnriLyirNLp~~ITseemydlFGkyg~IrQIRiG~~k---~TrGTAFV 62 (124)
T KOG0114|consen 15 PEVNRILYIRNLPFKITSEEMYDLFGKYGTIRQIRIGNTK---ETRGTAFV 62 (124)
T ss_pred hhhheeEEEecCCccccHHHHHHHhhcccceEEEEecCcc---CcCceEEE
Confidence 3456789999999999999999999999999999987654 35788987
No 44
>KOG0148|consensus
Probab=99.14 E-value=1.6e-10 Score=64.30 Aligned_cols=46 Identities=30% Similarity=0.514 Sum_probs=41.1
Q ss_pred CCCCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050 10 IEGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR 61 (61)
Q Consensus 10 ~~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~ 61 (61)
..+.++|||||++..++++.|++.|++||.|.++++..+ +||+||+
T Consensus 161 sp~NtsVY~G~I~~~lte~~mr~~Fs~fG~I~EVRvFk~------qGYaFVr 206 (321)
T KOG0148|consen 161 SPDNTSVYVGNIASGLTEDLMRQTFSPFGPIQEVRVFKD------QGYAFVR 206 (321)
T ss_pred CCCCceEEeCCcCccccHHHHHHhcccCCcceEEEEecc------cceEEEE
Confidence 345699999999999999999999999999999998766 6799984
No 45
>KOG0147|consensus
Probab=99.13 E-value=2.6e-11 Score=71.84 Aligned_cols=46 Identities=37% Similarity=0.665 Sum_probs=44.7
Q ss_pred EEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050 16 LKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR 61 (61)
Q Consensus 16 l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~ 61 (61)
+|||||++++++++|+.+|++||.|..+++.+|.+||.++||+|++
T Consensus 281 l~vgnLHfNite~~lr~ifepfg~Ie~v~l~~d~~tG~skgfGfi~ 326 (549)
T KOG0147|consen 281 LYVGNLHFNITEDMLRGIFEPFGKIENVQLTKDSETGRSKGFGFIT 326 (549)
T ss_pred hhhcccccCchHHHHhhhccCcccceeeeeccccccccccCcceEE
Confidence 8999999999999999999999999999999998899999999985
No 46
>KOG0127|consensus
Probab=99.10 E-value=1.8e-10 Score=68.82 Aligned_cols=49 Identities=27% Similarity=0.475 Sum_probs=46.8
Q ss_pred CcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050 13 MVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR 61 (61)
Q Consensus 13 ~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~ 61 (61)
+.||||++||+.++.++|.++|+..|+|..+.++.++.++.+|||+||+
T Consensus 5 g~TlfV~~lp~~~~~~qL~e~FS~vGPik~~~vVt~~gs~~~RGfgfVt 53 (678)
T KOG0127|consen 5 GATLFVSRLPFSSTGEQLEEFFSYVGPIKHAVVVTNKGSSEKRGFGFVT 53 (678)
T ss_pred CceEEEecCCCccchhHHHHhhhcccCcceeEEecCCCcccccCcccee
Confidence 4899999999999999999999999999999999999999999999984
No 47
>KOG0124|consensus
Probab=99.10 E-value=2.9e-10 Score=65.67 Aligned_cols=50 Identities=22% Similarity=0.423 Sum_probs=46.9
Q ss_pred CCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050 12 GMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR 61 (61)
Q Consensus 12 ~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~ 61 (61)
.-.+|||..++++.+++||+..|+-||+|.+|.+.+++.++.++||+|++
T Consensus 209 ~fnRiYVaSvHpDLSe~DiKSVFEAFG~I~~C~LAr~pt~~~HkGyGfiE 258 (544)
T KOG0124|consen 209 KFNRIYVASVHPDLSETDIKSVFEAFGEIVKCQLARAPTGRGHKGYGFIE 258 (544)
T ss_pred hhheEEeeecCCCccHHHHHHHHHhhcceeeEEeeccCCCCCccceeeEE
Confidence 34689999999999999999999999999999999999888999999986
No 48
>TIGR01649 hnRNP-L_PTB hnRNP-L/PTB/hephaestus splicing factor family. Included in this family of heterogeneous ribonucleoproteins are PTB (polypyrimidine tract binding protein ) and hnRNP-L. These proteins contain four RNA recognition motifs (rrm: pfam00067).
Probab=99.09 E-value=3.4e-10 Score=66.97 Aligned_cols=43 Identities=21% Similarity=0.218 Sum_probs=38.7
Q ss_pred CcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050 13 MVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR 61 (61)
Q Consensus 13 ~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~ 61 (61)
.++|||+|||+.+++++|+++|++||.|.++.++.+ +|+|||+
T Consensus 2 s~vv~V~nLp~~~te~~L~~~f~~fG~V~~v~i~~~------k~~afVe 44 (481)
T TIGR01649 2 SPVVHVRNLPQDVVEADLVEALIPFGPVSYVMMLPG------KRQALVE 44 (481)
T ss_pred ccEEEEcCCCCCCCHHHHHHHHHhcCCeeEEEEECC------CCEEEEE
Confidence 378999999999999999999999999999988753 5799985
No 49
>KOG0105|consensus
Probab=99.07 E-value=4.2e-10 Score=59.95 Aligned_cols=48 Identities=21% Similarity=0.347 Sum_probs=40.6
Q ss_pred CCCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050 11 EGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR 61 (61)
Q Consensus 11 ~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~ 61 (61)
...++|||+|||.++.+.+|+++|.+||.|..+.|...+ ...+||||+
T Consensus 4 r~~~~iyvGNLP~diRekeieDlFyKyg~i~~ieLK~r~---g~ppfafVe 51 (241)
T KOG0105|consen 4 RNSRRIYVGNLPGDIREKEIEDLFYKYGRIREIELKNRP---GPPPFAFVE 51 (241)
T ss_pred cccceEEecCCCcchhhccHHHHHhhhcceEEEEeccCC---CCCCeeEEE
Confidence 345899999999999999999999999999999875432 246799985
No 50
>cd00590 RRM RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
Probab=99.07 E-value=1.1e-09 Score=48.96 Aligned_cols=46 Identities=46% Similarity=0.703 Sum_probs=40.6
Q ss_pred EEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050 15 SLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR 61 (61)
Q Consensus 15 ~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~ 61 (61)
+|+|+|||..+++++++++|+.+|.+..+.+..++.+ ..+|+|||+
T Consensus 1 ~i~i~~l~~~~~~~~i~~~~~~~g~i~~~~~~~~~~~-~~~~~~~v~ 46 (74)
T cd00590 1 TLFVGNLPPDVTEEDLRELFSKFGKVESVRIVRDKDT-KSKGFAFVE 46 (74)
T ss_pred CEEEeCCCCccCHHHHHHHHHhcCCEEEEEEeeCCCC-CcceEEEEE
Confidence 4899999999999999999999999999998887654 678899874
No 51
>TIGR01649 hnRNP-L_PTB hnRNP-L/PTB/hephaestus splicing factor family. Included in this family of heterogeneous ribonucleoproteins are PTB (polypyrimidine tract binding protein ) and hnRNP-L. These proteins contain four RNA recognition motifs (rrm: pfam00067).
Probab=99.05 E-value=1.1e-09 Score=64.85 Aligned_cols=45 Identities=20% Similarity=0.244 Sum_probs=40.2
Q ss_pred CCcEEEEecCCC-CCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050 12 GMVSLKVDNLTY-RTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR 61 (61)
Q Consensus 12 ~~~~l~v~~l~~-~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~ 61 (61)
.+++|||+||++ .+++++|+++|+.||.|.++.++++ .+|+|||+
T Consensus 274 ~~~~l~v~nL~~~~vt~~~L~~lF~~yG~V~~vki~~~-----~~g~afV~ 319 (481)
T TIGR01649 274 PGSVLMVSGLHQEKVNCDRLFNLFCVYGNVERVKFMKN-----KKETALIE 319 (481)
T ss_pred CCCEEEEeCCCCCCCCHHHHHHHHHhcCCeEEEEEEeC-----CCCEEEEE
Confidence 457999999998 6999999999999999999999876 26899985
No 52
>KOG0109|consensus
Probab=98.97 E-value=7e-10 Score=62.25 Aligned_cols=35 Identities=34% Similarity=0.522 Sum_probs=33.1
Q ss_pred cEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccC
Q psy7050 14 VSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRD 48 (61)
Q Consensus 14 ~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~ 48 (61)
.+|||||||..+++.+|+.+|++||+|.+|+++++
T Consensus 3 ~KLFIGNLp~~~~~~elr~lFe~ygkVlECDIvKN 37 (346)
T KOG0109|consen 3 VKLFIGNLPREATEQELRSLFEQYGKVLECDIVKN 37 (346)
T ss_pred cchhccCCCcccchHHHHHHHHhhCceEeeeeecc
Confidence 57999999999999999999999999999999876
No 53
>KOG0146|consensus
Probab=98.96 E-value=1.7e-09 Score=60.42 Aligned_cols=53 Identities=23% Similarity=0.231 Sum_probs=49.6
Q ss_pred CCCCCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050 9 RIEGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR 61 (61)
Q Consensus 9 ~~~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~ 61 (61)
+..++|+|||-.||.+..+.+|..+|.+||.|.+.++..|+.|..+++|+||.
T Consensus 281 eGPeGCNlFIYHLPQEFgDaEliQmF~PFGhivSaKVFvDRATNQSKCFGFVS 333 (371)
T KOG0146|consen 281 EGPEGCNLFIYHLPQEFGDAELIQMFLPFGHIVSAKVFVDRATNQSKCFGFVS 333 (371)
T ss_pred cCCCcceEEEEeCchhhccHHHHHHhccccceeeeeeeehhccccccceeeEe
Confidence 45678999999999999999999999999999999999999999999999983
No 54
>KOG0415|consensus
Probab=98.88 E-value=2.7e-09 Score=61.45 Aligned_cols=48 Identities=35% Similarity=0.436 Sum_probs=46.6
Q ss_pred cEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050 14 VSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR 61 (61)
Q Consensus 14 ~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~ 61 (61)
..|||..|++.++.++|.-+|+.||.|.+|.+++|..||.+-.||||+
T Consensus 240 NVLFVCKLNPVTtDeDLeiIFSrFG~i~sceVIRD~ktgdsLqyaFiE 287 (479)
T KOG0415|consen 240 NVLFVCKLNPVTTDEDLEIIFSRFGKIVSCEVIRDRKTGDSLQYAFIE 287 (479)
T ss_pred ceEEEEecCCcccccchhhHHhhcccceeeeEEecccccchhheeeee
Confidence 789999999999999999999999999999999999999999999986
No 55
>KOG4205|consensus
Probab=98.88 E-value=4.2e-09 Score=59.77 Aligned_cols=49 Identities=37% Similarity=0.571 Sum_probs=46.5
Q ss_pred CcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050 13 MVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR 61 (61)
Q Consensus 13 ~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~ 61 (61)
..++||++||..+++.++++.|.++|.|..+.++.|..+.+++||+||+
T Consensus 97 tkkiFvGG~~~~~~e~~~r~yfe~~g~v~~~~~~~d~~~~~~rgFgfv~ 145 (311)
T KOG4205|consen 97 TKKIFVGGLPPDTTEEDFKDYFEQFGKVADVVIMYDKTTSRPRGFGFVT 145 (311)
T ss_pred eeEEEecCcCCCCchHHHhhhhhccceeEeeEEeecccccccccceeeE
Confidence 5699999999999999999999999999999999999999999999984
No 56
>KOG0146|consensus
Probab=98.85 E-value=3.1e-09 Score=59.39 Aligned_cols=49 Identities=31% Similarity=0.459 Sum_probs=45.0
Q ss_pred CCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050 12 GMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR 61 (61)
Q Consensus 12 ~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~ 61 (61)
++++||||-|...-.|++++.+|..||.+.+|.+.+.+. |.++|+|||+
T Consensus 18 ~drklfvgml~kqq~e~dvrrlf~pfG~~~e~tvlrg~d-g~sKGCAFVK 66 (371)
T KOG0146|consen 18 DDRKLFVGMLNKQQSEDDVRRLFQPFGNIEECTVLRGPD-GNSKGCAFVK 66 (371)
T ss_pred cchhhhhhhhcccccHHHHHHHhcccCCcceeEEecCCC-CCCCCceEEE
Confidence 568999999999999999999999999999999988864 8899999985
No 57
>KOG0131|consensus
Probab=98.82 E-value=4.7e-09 Score=55.65 Aligned_cols=51 Identities=27% Similarity=0.301 Sum_probs=45.3
Q ss_pred CCCCcEEEEecCCCCCCHHHHHHHhhhCCCeEEE-EEccCCCCCCceeEEEE
Q psy7050 10 IEGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDI-YIPRDRFTRESRGFAFV 60 (61)
Q Consensus 10 ~~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~-~~~~~~~t~~~~g~~fv 60 (61)
.+.+.++||+||.+.+++..|.+.|+.||.+.+. .+++++.||.++||||+
T Consensus 93 l~vganlfvgNLd~~vDe~~L~dtFsafG~l~~~P~i~rd~~tg~~~~~g~i 144 (203)
T KOG0131|consen 93 LDVGANLFVGNLDPEVDEKLLYDTFSAFGVLISPPKIMRDPDTGNPKGFGFI 144 (203)
T ss_pred ccccccccccccCcchhHHHHHHHHHhccccccCCcccccccCCCCCCCeEE
Confidence 4456899999999999999999999999987664 68889999999999987
No 58
>KOG4208|consensus
Probab=98.79 E-value=2.5e-08 Score=53.66 Aligned_cols=53 Identities=25% Similarity=0.348 Sum_probs=46.3
Q ss_pred CCCCCcEEEEecCCCCCCHHHHHHHhhhC-CCeEEEEEccCCCCCCceeEEEEC
Q psy7050 9 RIEGMVSLKVDNLTYRTTPDDLRRVFERC-GEVGDIYIPRDRFTRESRGFAFVR 61 (61)
Q Consensus 9 ~~~~~~~l~v~~l~~~~~~~~l~~~f~~~-G~i~~~~~~~~~~t~~~~g~~fv~ 61 (61)
+.....-+|+..+|....+..+...|.++ |.+...++.+++.||.++|||||+
T Consensus 45 ~~~~~g~~~~~~~p~g~~e~~~~~~~~q~~g~v~r~rlsRnkrTGNSKgYAFVE 98 (214)
T KOG4208|consen 45 EQEIEGVVYVDHIPHGFFETEILNYFRQFGGTVTRFRLSRNKRTGNSKGYAFVE 98 (214)
T ss_pred ccCCccceeecccccchhHHHHhhhhhhcCCeeEEEEeecccccCCcCceEEEE
Confidence 44445678999999999999999999887 788888888999999999999996
No 59
>TIGR01642 U2AF_lg U2 snRNP auxilliary factor, large subunit, splicing factor. Members of this subfamily are found in plants, metazoa and fungi.
Probab=98.78 E-value=3.6e-08 Score=58.40 Aligned_cols=46 Identities=17% Similarity=0.291 Sum_probs=34.7
Q ss_pred CCCCcEEEEecCCCCCCHHHHHHHhhhC------------CCeEEEEEccCCCCCCceeEEEEC
Q psy7050 10 IEGMVSLKVDNLTYRTTPDDLRRVFERC------------GEVGDIYIPRDRFTRESRGFAFVR 61 (61)
Q Consensus 10 ~~~~~~l~v~~l~~~~~~~~l~~~f~~~------------G~i~~~~~~~~~~t~~~~g~~fv~ 61 (61)
....++|||+|||+.+++++|.++|..+ +.|..+.+ .+.+|||||+
T Consensus 172 ~~~~r~lyVgnLp~~~t~~~l~~~F~~~~~~~~~~~~~~~~~v~~~~~------~~~kg~afVe 229 (509)
T TIGR01642 172 TRQARRLYVGGIPPEFVEEAVVDFFNDLMIATGYHKAEDGKHVSSVNI------NKEKNFAFLE 229 (509)
T ss_pred CccccEEEEeCCCCCCCHHHHHHHHHHHHHhcCCCCCCCCCceEEEEE------CCCCCEEEEE
Confidence 3446899999999999999999999874 23333333 3457899995
No 60
>KOG0110|consensus
Probab=98.78 E-value=6.2e-09 Score=63.55 Aligned_cols=49 Identities=24% Similarity=0.535 Sum_probs=45.0
Q ss_pred CcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050 13 MVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR 61 (61)
Q Consensus 13 ~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~ 61 (61)
..+|.|+|+|+.++-.+++.+|..||.+.+++++.-...+.++|||||+
T Consensus 613 ~tKIlVRNipFeAt~rEVr~LF~aFGqlksvRlPKK~~k~a~rGF~Fv~ 661 (725)
T KOG0110|consen 613 GTKILVRNIPFEATKREVRKLFTAFGQLKSVRLPKKIGKGAHRGFGFVD 661 (725)
T ss_pred cceeeeeccchHHHHHHHHHHHhcccceeeeccchhhcchhhccceeee
Confidence 5789999999999999999999999999999999876667889999985
No 61
>TIGR01648 hnRNP-R-Q heterogeneous nuclear ribonucleoprotein R, Q family. Sequences in this subfamily include the human heterogeneous nuclear ribonucleoproteins (hnRNP) R, Q and APOBEC-1 complementation factor (aka APOBEC-1 stimulating protein). These proteins contain three RNA recognition domains (rrm: pfam00076) and a somewhat variable C-terminal domain.
Probab=98.75 E-value=4.1e-08 Score=59.55 Aligned_cols=42 Identities=31% Similarity=0.413 Sum_probs=36.6
Q ss_pred CCcEEEEecCCCCCCHHHHHHHhhhC--CCeEEEEEccCCCCCCceeEEEEC
Q psy7050 12 GMVSLKVDNLTYRTTPDDLRRVFERC--GEVGDIYIPRDRFTRESRGFAFVR 61 (61)
Q Consensus 12 ~~~~l~v~~l~~~~~~~~l~~~f~~~--G~i~~~~~~~~~~t~~~~g~~fv~ 61 (61)
..++|||+||++.+++++|+++|+.| |.|.++.++ ++||||+
T Consensus 232 ~~k~LfVgNL~~~~tee~L~~~F~~f~~G~I~rV~~~--------rgfAFVe 275 (578)
T TIGR01648 232 KVKILYVRNLMTTTTEEIIEKSFSEFKPGKVERVKKI--------RDYAFVH 275 (578)
T ss_pred cccEEEEeCCCCCCCHHHHHHHHHhcCCCceEEEEee--------cCeEEEE
Confidence 34789999999999999999999999 999998654 4589985
No 62
>KOG0153|consensus
Probab=98.72 E-value=3.1e-08 Score=56.69 Aligned_cols=48 Identities=29% Similarity=0.454 Sum_probs=40.0
Q ss_pred CCCCCCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050 8 PRIEGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR 61 (61)
Q Consensus 8 ~~~~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~ 61 (61)
++...-.+|||++|...+++.+|+++|.+||+|+++.+.-. +++|||+
T Consensus 223 PeD~~I~tLyIg~l~d~v~e~dIrdhFyqyGeirsi~~~~~------~~CAFv~ 270 (377)
T KOG0153|consen 223 PEDTSIKTLYIGGLNDEVLEQDIRDHFYQYGEIRSIRILPR------KGCAFVT 270 (377)
T ss_pred CcccceeEEEecccccchhHHHHHHHHhhcCCeeeEEeecc------cccceee
Confidence 34555689999999999999999999999999999987643 4478874
No 63
>KOG0116|consensus
Probab=98.67 E-value=4.5e-08 Score=57.48 Aligned_cols=49 Identities=27% Similarity=0.355 Sum_probs=40.0
Q ss_pred CcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050 13 MVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR 61 (61)
Q Consensus 13 ~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~ 61 (61)
..+|||+|||++++.+.|++.|..||.|+...+..-...++..+||||+
T Consensus 288 ~~~i~V~nlP~da~~~~l~~~Fk~FG~Ik~~~I~vr~~~~~~~~fgFV~ 336 (419)
T KOG0116|consen 288 GLGIFVKNLPPDATPAELEEVFKQFGPIKEGGIQVRSPGGKNPCFGFVE 336 (419)
T ss_pred ccceEeecCCCCCCHHHHHHHHhhcccccccceEEeccCCCcCceEEEE
Confidence 4569999999999999999999999999998775543334445899984
No 64
>PF04059 RRM_2: RNA recognition motif 2; InterPro: IPR007201 This RNA recognition motif 2 is found in Meiosis protein mei2. It is found C-terminal to the RNA-binding region RNP-1 (IPR000504 from INTERPRO).
Probab=98.67 E-value=2.1e-07 Score=45.14 Aligned_cols=47 Identities=23% Similarity=0.441 Sum_probs=41.7
Q ss_pred cEEEEecCCCCCCHHHHHHHhhh--CCCeEEEEEccCCCCCCceeEEEE
Q psy7050 14 VSLKVDNLTYRTTPDDLRRVFER--CGEVGDIYIPRDRFTRESRGFAFV 60 (61)
Q Consensus 14 ~~l~v~~l~~~~~~~~l~~~f~~--~G~i~~~~~~~~~~t~~~~g~~fv 60 (61)
+||.++|+|...+.++|.+++.. .|...-+.++.|..++.+.|||||
T Consensus 2 TTvMirNIPn~~t~~~L~~~l~~~~~g~yDF~YLPiDf~~~~N~GYAFV 50 (97)
T PF04059_consen 2 TTVMIRNIPNKYTQEMLIQILDEHFKGKYDFFYLPIDFKNKCNLGYAFV 50 (97)
T ss_pred eeEEEecCCCCCCHHHHHHHHHHhccCcceEEEeeeeccCCCceEEEEE
Confidence 68999999999999999888865 367777889999999999999998
No 65
>KOG4212|consensus
Probab=98.61 E-value=1.3e-07 Score=55.95 Aligned_cols=48 Identities=27% Similarity=0.340 Sum_probs=43.5
Q ss_pred CcEEEEecCCCCCCHHHHHHHhhh-CCCeEEEEEccCCCCCCceeEEEEC
Q psy7050 13 MVSLKVDNLTYRTTPDDLRRVFER-CGEVGDIYIPRDRFTRESRGFAFVR 61 (61)
Q Consensus 13 ~~~l~v~~l~~~~~~~~l~~~f~~-~G~i~~~~~~~~~~t~~~~g~~fv~ 61 (61)
.+.+||.|+|+++.|.+|++++.. -|+|..+.+..| .+|+.+|+|.||
T Consensus 44 ~R~vfItNIpyd~rWqdLKdLvrekvGev~yveLl~D-~~GK~rGcavVE 92 (608)
T KOG4212|consen 44 DRSVFITNIPYDYRWQDLKDLVREKVGEVEYVELLFD-ESGKARGCAVVE 92 (608)
T ss_pred cceEEEecCcchhhhHhHHHHHHHhcCceEeeeeecc-cCCCcCCceEEE
Confidence 356999999999999999999975 799999999999 579999999986
No 66
>KOG0117|consensus
Probab=98.57 E-value=1e-07 Score=56.20 Aligned_cols=40 Identities=40% Similarity=0.650 Sum_probs=36.1
Q ss_pred CcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEE
Q psy7050 13 MVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFV 60 (61)
Q Consensus 13 ~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv 60 (61)
-+.|||+||+.++|++.|+++|..||.|..+.-++| ||||
T Consensus 259 VKvLYVRNL~~~tTeE~lk~~F~~~G~veRVkk~rD--------YaFV 298 (506)
T KOG0117|consen 259 VKVLYVRNLMESTTEETLKKLFNEFGKVERVKKPRD--------YAFV 298 (506)
T ss_pred eeeeeeeccchhhhHHHHHHHHHhccceEEeecccc--------eeEE
Confidence 467999999999999999999999999999877655 8887
No 67
>KOG0123|consensus
Probab=98.56 E-value=1.6e-07 Score=54.52 Aligned_cols=45 Identities=22% Similarity=0.359 Sum_probs=40.8
Q ss_pred cEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050 14 VSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR 61 (61)
Q Consensus 14 ~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~ 61 (61)
..+||.||++.++.++|.++|+.||.|.+|.+..+.. | .+|| ||+
T Consensus 77 ~~~~i~nl~~~~~~~~~~d~f~~~g~ilS~kv~~~~~-g-~kg~-FV~ 121 (369)
T KOG0123|consen 77 SLVFIKNLDESIDNKSLYDTFSEFGNILSCKVATDEN-G-SKGY-FVQ 121 (369)
T ss_pred ceeeecCCCcccCcHHHHHHHHhhcCeeEEEEEEcCC-C-ceee-EEE
Confidence 4499999999999999999999999999999999854 5 8999 984
No 68
>KOG4661|consensus
Probab=98.54 E-value=1.8e-07 Score=56.93 Aligned_cols=49 Identities=24% Similarity=0.363 Sum_probs=44.8
Q ss_pred CcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050 13 MVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR 61 (61)
Q Consensus 13 ~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~ 61 (61)
+++|+|.+|+..+...+|+.+|++||.|.-..++.+..+.-.++|+||+
T Consensus 405 gRNlWVSGLSstTRAtDLKnlFSKyGKVvGAKVVTNaRsPGaRCYGfVT 453 (940)
T KOG4661|consen 405 GRNLWVSGLSSTTRATDLKNLFSKYGKVVGAKVVTNARSPGARCYGFVT 453 (940)
T ss_pred ccceeeeccccchhhhHHHHHHHHhcceeceeeeecCCCCCcceeEEEE
Confidence 5789999999999999999999999999999999887777789999996
No 69
>KOG4206|consensus
Probab=98.53 E-value=5.2e-07 Score=49.12 Aligned_cols=47 Identities=34% Similarity=0.473 Sum_probs=39.2
Q ss_pred CCCcEEEEecCCCCCCHHHHHH----HhhhCCCeEEEEEccCCCCCCceeEEEE
Q psy7050 11 EGMVSLKVDNLTYRTTPDDLRR----VFERCGEVGDIYIPRDRFTRESRGFAFV 60 (61)
Q Consensus 11 ~~~~~l~v~~l~~~~~~~~l~~----~f~~~G~i~~~~~~~~~~t~~~~g~~fv 60 (61)
+...||||.||+..+..++|+. +|++||.|.++... .+.+.+|-|||
T Consensus 7 ~pn~TlYInnLnekI~~~elkrsL~~LFsqfG~ildI~a~---kt~KmRGQA~V 57 (221)
T KOG4206|consen 7 NPNGTLYINNLNEKIKKDELKRSLYLLFSQFGKILDISAF---KTPKMRGQAFV 57 (221)
T ss_pred CCCceEeehhccccccHHHHHHHHHHHHHhhCCeEEEEec---CCCCccCceEE
Confidence 3335999999999999888776 99999999988765 35778999987
No 70
>KOG4209|consensus
Probab=98.52 E-value=1.2e-07 Score=52.04 Aligned_cols=51 Identities=29% Similarity=0.621 Sum_probs=46.8
Q ss_pred CCCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050 11 EGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR 61 (61)
Q Consensus 11 ~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~ 61 (61)
-+...+|++|+.+.++...+..+|+.||.+..+.++.|+.+++++||+||+
T Consensus 99 ~d~~sv~v~nvd~~~t~~~~e~hf~~Cg~i~~~ti~~d~~~~~~k~~~yve 149 (231)
T KOG4209|consen 99 VDAPSVWVGNVDFLVTLTKIELHFESCGGINRVTVPKDKFRGHPKGFAYVE 149 (231)
T ss_pred cCCceEEEeccccccccchhhheeeccCCccceeeeccccCCCcceeEEEe
Confidence 345789999999999998899999999999999999999999999999985
No 71
>smart00361 RRM_1 RNA recognition motif.
Probab=98.50 E-value=3.2e-07 Score=41.80 Aligned_cols=35 Identities=34% Similarity=0.566 Sum_probs=28.7
Q ss_pred HHHHHHHhh----hCCCeEEEE-EccCCCC--CCceeEEEEC
Q psy7050 27 PDDLRRVFE----RCGEVGDIY-IPRDRFT--RESRGFAFVR 61 (61)
Q Consensus 27 ~~~l~~~f~----~~G~i~~~~-~~~~~~t--~~~~g~~fv~ 61 (61)
+++|++.|+ +||.+.++. +..++.+ +.++|||||+
T Consensus 2 ~~~l~~~~~~~~~~fG~v~~v~~v~~~~~~~~~~~rG~~fV~ 43 (70)
T smart00361 2 DEDFEREFSEEEEYFGEVGKINKIYIDNVGYENHKRGNVYIT 43 (70)
T ss_pred chhHHHHHHHHHHhcCCeeEEEEEEeCCCCCCCCCcEEEEEE
Confidence 467788887 999999985 7777666 8899999985
No 72
>KOG0109|consensus
Probab=98.47 E-value=1.9e-07 Score=52.70 Aligned_cols=38 Identities=32% Similarity=0.445 Sum_probs=34.8
Q ss_pred CCCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccC
Q psy7050 11 EGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRD 48 (61)
Q Consensus 11 ~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~ 48 (61)
....+++|+|+.+.++.+++++.|++||.+.++++++|
T Consensus 76 k~stkl~vgNis~tctn~ElRa~fe~ygpviecdivkd 113 (346)
T KOG0109|consen 76 KASTKLHVGNISPTCTNQELRAKFEKYGPVIECDIVKD 113 (346)
T ss_pred CCccccccCCCCccccCHHHhhhhcccCCceeeeeecc
Confidence 34579999999999999999999999999999998866
No 73
>KOG0132|consensus
Probab=98.47 E-value=2.7e-07 Score=57.33 Aligned_cols=43 Identities=33% Similarity=0.531 Sum_probs=38.0
Q ss_pred CcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050 13 MVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR 61 (61)
Q Consensus 13 ~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~ 61 (61)
.+|||||+|+..+++.+|..+|+.||.|.++.+.- ++|+|||+
T Consensus 421 SrTLwvG~i~k~v~e~dL~~~feefGeiqSi~li~------~R~cAfI~ 463 (894)
T KOG0132|consen 421 SRTLWVGGIPKNVTEQDLANLFEEFGEIQSIILIP------PRGCAFIK 463 (894)
T ss_pred eeeeeeccccchhhHHHHHHHHHhcccceeEeecc------CCceeEEE
Confidence 47999999999999999999999999999987653 57789974
No 74
>KOG0123|consensus
Probab=98.32 E-value=1.7e-06 Score=50.40 Aligned_cols=51 Identities=29% Similarity=0.431 Sum_probs=45.5
Q ss_pred CCCCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050 10 IEGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR 61 (61)
Q Consensus 10 ~~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~ 61 (61)
.-...+|||.|++..++.+.|+++|+.+|.|.++.++.+ ..+.++||+||+
T Consensus 267 ~~~~~nl~vknld~~~~~e~L~~~f~~~GeI~s~kv~~~-~~g~skG~gfV~ 317 (369)
T KOG0123|consen 267 SLQGANLYVKNLDETLSDEKLRKIFSSFGEITSAKVMVD-ENGKSKGFGFVE 317 (369)
T ss_pred cccccccccccCccccchhHHHHHHhcccceeeEEEEec-cCCCccceEEEE
Confidence 344578999999999999999999999999999999888 568999999984
No 75
>KOG4454|consensus
Probab=98.23 E-value=7.8e-07 Score=48.71 Aligned_cols=50 Identities=28% Similarity=0.349 Sum_probs=42.9
Q ss_pred CCCCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050 10 IEGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR 61 (61)
Q Consensus 10 ~~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~ 61 (61)
.+..++|||+|+...++++-|.++|-+.|+|.++.|+.++. ++.+ ||||.
T Consensus 6 ae~drtl~v~n~~~~v~eelL~ElfiqaGPV~kv~ip~~~d-~~~k-Fa~v~ 55 (267)
T KOG4454|consen 6 AEMDRTLLVQNMYSGVSEELLSELFIQAGPVYKVGIPSGQD-QEQK-FAYVF 55 (267)
T ss_pred cchhhHHHHHhhhhhhhHHHHHHHhhccCceEEEeCCCCcc-CCCc-eeeee
Confidence 45568999999999999999999999999999999988865 4445 88874
No 76
>KOG4849|consensus
Probab=98.22 E-value=4.8e-07 Score=52.40 Aligned_cols=47 Identities=28% Similarity=0.457 Sum_probs=40.2
Q ss_pred cEEEEecCCCCCCHHHHHHHhhhCC--CeEEEEEccCCCCCCceeEEEE
Q psy7050 14 VSLKVDNLTYRTTPDDLRRVFERCG--EVGDIYIPRDRFTRESRGFAFV 60 (61)
Q Consensus 14 ~~l~v~~l~~~~~~~~l~~~f~~~G--~i~~~~~~~~~~t~~~~g~~fv 60 (61)
..+|||||-|++++++|.+.+...| .+.++.+..++.+|+++|||+|
T Consensus 81 ~~~YvGNL~W~TTD~DL~~A~~S~G~~~~~dmKFFENR~NGQSKG~AL~ 129 (498)
T KOG4849|consen 81 YCCYVGNLLWYTTDADLLKALQSTGLAQFADMKFFENRTNGQSKGYALL 129 (498)
T ss_pred EEEEecceeEEeccHHHHHHHHhhhHHHHhhhhhhhcccCCcccceEEE
Confidence 4689999999999999998887765 4566778888999999999986
No 77
>KOG0533|consensus
Probab=98.20 E-value=5.9e-06 Score=45.81 Aligned_cols=49 Identities=31% Similarity=0.438 Sum_probs=41.4
Q ss_pred CCCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEE
Q psy7050 11 EGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFV 60 (61)
Q Consensus 11 ~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv 60 (61)
....+|+|.|||..+.+++|+++|..+|.+..+.+.+++ .|.+.|.|-|
T Consensus 81 ~~~~~v~v~NL~~~V~~~Dl~eLF~~~~~~~r~~vhy~~-~G~s~Gta~v 129 (243)
T KOG0533|consen 81 TRSTKVNVSNLPYGVIDADLKELFAEFGELKRVAVHYDR-AGRSLGTADV 129 (243)
T ss_pred CCcceeeeecCCcCcchHHHHHHHHHhccceEEeeccCC-CCCCCcccee
Confidence 334789999999999999999999999988888888885 5777776644
No 78
>KOG4212|consensus
Probab=98.19 E-value=2e-06 Score=51.08 Aligned_cols=45 Identities=24% Similarity=0.253 Sum_probs=38.7
Q ss_pred CCCCCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCcee
Q psy7050 9 RIEGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRG 56 (61)
Q Consensus 9 ~~~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g 56 (61)
...+.|+|+|+|||+++||+.|++-|..+|.+....++. .|+.+|
T Consensus 532 aarKa~qIiirNlP~dfTWqmlrDKfre~G~v~yadime---~GkskG 576 (608)
T KOG4212|consen 532 AARKACQIIIRNLPFDFTWQMLRDKFREIGHVLYADIME---NGKSKG 576 (608)
T ss_pred ccccccEEEEecCCccccHHHHHHHHHhccceehhhhhc---cCCccc
Confidence 356679999999999999999999999999999988853 366776
No 79
>KOG0226|consensus
Probab=98.15 E-value=2.9e-06 Score=47.30 Aligned_cols=50 Identities=26% Similarity=0.395 Sum_probs=44.1
Q ss_pred CCCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEE
Q psy7050 11 EGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFV 60 (61)
Q Consensus 11 ~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv 60 (61)
+.+.+||+|.|..+++++.|...|.+|-.-....+++|+.||+++||+||
T Consensus 188 ~~DfRIfcgdlgNevnd~vl~raf~Kfpsf~~akviRdkRTgKSkgygfV 237 (290)
T KOG0226|consen 188 EDDFRIFCGDLGNEVNDDVLARAFKKFPSFQKAKVIRDKRTGKSKGYGFV 237 (290)
T ss_pred cccceeecccccccccHHHHHHHHHhccchhhccccccccccccccceee
Confidence 34589999999999999999999999866556678899999999999998
No 80
>KOG4211|consensus
Probab=98.08 E-value=1.7e-05 Score=47.47 Aligned_cols=50 Identities=22% Similarity=0.485 Sum_probs=40.6
Q ss_pred CCCCCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050 9 RIEGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR 61 (61)
Q Consensus 9 ~~~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~ 61 (61)
+.+...-|.+++|||.+|+++|.++|+.++ |..+.+. +.+|+..|=|||+
T Consensus 6 e~~~~~~vr~rGLPwsat~~ei~~Ff~~~~-I~~~~~~--r~~Gr~sGeA~Ve 55 (510)
T KOG4211|consen 6 EGSTAFEVRLRGLPWSATEKEILDFFSNCG-IENLEIP--RRNGRPSGEAYVE 55 (510)
T ss_pred CCCcceEEEecCCCccccHHHHHHHHhcCc-eeEEEEe--ccCCCcCcceEEE
Confidence 445567888999999999999999999985 6665444 4579999999985
No 81
>KOG0120|consensus
Probab=98.07 E-value=6.1e-06 Score=49.61 Aligned_cols=53 Identities=23% Similarity=0.370 Sum_probs=48.2
Q ss_pred CCCCCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050 9 RIEGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR 61 (61)
Q Consensus 9 ~~~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~ 61 (61)
......+++|++||...++..+.++...||.+....++.+..+|.++||||.+
T Consensus 285 ~~~~~~ki~v~~lp~~l~~~q~~Ell~~fg~lk~f~lv~d~~~g~skg~af~e 337 (500)
T KOG0120|consen 285 VPDSPNKIFVGGLPLYLTEDQVKELLDSFGPLKAFRLVKDSATGNSKGFAFCE 337 (500)
T ss_pred cccccchhhhccCcCccCHHHHHHHHHhcccchhheeecccccccccceeeee
Confidence 34556899999999999999999999999999999999999999999999964
No 82
>KOG1548|consensus
Probab=97.99 E-value=3.8e-05 Score=44.44 Aligned_cols=49 Identities=24% Similarity=0.360 Sum_probs=36.3
Q ss_pred CCCCcEEEEecCCCCCCHHHHHHHhhhCCCeEE--------EEEccCCCCCCceeEEE
Q psy7050 10 IEGMVSLKVDNLTYRTTPDDLRRVFERCGEVGD--------IYIPRDRFTRESRGFAF 59 (61)
Q Consensus 10 ~~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~--------~~~~~~~~t~~~~g~~f 59 (61)
......|||.|||.++|-+.+.++|++||-|.. |.+-++. .|+.+|=|.
T Consensus 131 ~~~Nt~VYVsgLP~DiT~dE~~~~~sKcGiI~~d~~t~epk~KlYrd~-~G~lKGDaL 187 (382)
T KOG1548|consen 131 PKVNTSVYVSGLPLDITVDEFAEVMSKCGIIMRDPQTGEPKVKLYRDN-QGKLKGDAL 187 (382)
T ss_pred cccCceEEecCCCCcccHHHHHHHHHhcceEeccCCCCCeeEEEEecC-CCCccCceE
Confidence 445677999999999999999999999997643 4455553 366666443
No 83
>KOG0110|consensus
Probab=97.97 E-value=1.4e-05 Score=49.52 Aligned_cols=48 Identities=38% Similarity=0.532 Sum_probs=39.3
Q ss_pred cEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCC---CceeEEEEC
Q psy7050 14 VSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTR---ESRGFAFVR 61 (61)
Q Consensus 14 ~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~---~~~g~~fv~ 61 (61)
++||+.||++.++.+++..+|...|.|.++.+...+... -+.|||||+
T Consensus 516 t~lfvkNlnf~Tt~e~l~~~F~k~G~VlS~~I~kkkd~~~k~lSmGfgFVE 566 (725)
T KOG0110|consen 516 TKLFVKNLNFDTTLEDLEDLFSKQGTVLSIEISKKKDPANKYLSMGFGFVE 566 (725)
T ss_pred hhhhhhcCCcccchhHHHHHHHhcCeEEEEEEeccccccccccccceeEEE
Confidence 459999999999999999999999999998776554321 234999996
No 84
>KOG4210|consensus
Probab=97.91 E-value=8.6e-06 Score=46.09 Aligned_cols=50 Identities=32% Similarity=0.573 Sum_probs=45.0
Q ss_pred CCcEEE-EecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050 12 GMVSLK-VDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR 61 (61)
Q Consensus 12 ~~~~l~-v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~ 61 (61)
...++| ++++++.+++++|+.+|..+|.+..++++.++.++..+|||+++
T Consensus 183 ~s~~~~~~~~~~f~~~~d~~~~~~~~~~~i~~~r~~~~~~s~~~kg~a~~~ 233 (285)
T KOG4210|consen 183 PSDTIFFVGELDFSLTRDDLKEHFVSSGEITSVRLPTDEESGDSKGFAYVD 233 (285)
T ss_pred ccccceeecccccccchHHHhhhccCcCcceeeccCCCCCccchhhhhhhh
Confidence 345666 99999999999999999999999999999999999999999863
No 85
>KOG0151|consensus
Probab=97.85 E-value=4e-05 Score=47.86 Aligned_cols=39 Identities=28% Similarity=0.317 Sum_probs=34.1
Q ss_pred CCCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCC
Q psy7050 11 EGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDR 49 (61)
Q Consensus 11 ~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~ 49 (61)
...+++|++||++.++++.|...|..||+|.++.++..+
T Consensus 172 P~TTNlyv~Nlnpsv~E~~ll~tfGrfgPlasvKimwpR 210 (877)
T KOG0151|consen 172 PQTTNLYVGNLNPSVDENFLLRTFGRFGPLASVKIMWPR 210 (877)
T ss_pred CcccceeeecCCccccHHHHHHHhcccCcccceeeeccc
Confidence 445789999999999999999999999999988776543
No 86
>KOG0147|consensus
Probab=97.81 E-value=3.5e-06 Score=50.74 Aligned_cols=51 Identities=33% Similarity=0.513 Sum_probs=46.7
Q ss_pred CCCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050 11 EGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR 61 (61)
Q Consensus 11 ~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~ 61 (61)
...+++|+-.++...+.-+|.++|+..|+|..+.++.|+.++..+|.+||+
T Consensus 177 Rd~Rtvf~~qla~r~~pRdL~efFs~~gkVrdVriI~Dr~s~rskgi~Yve 227 (549)
T KOG0147|consen 177 RDQRTVFCMQLARRNPPRDLEEFFSIVGKVRDVRIIGDRNSRRSKGIAYVE 227 (549)
T ss_pred HhHHHHHHHHHhhcCCchhHHHHHHhhcCcceeEeeccccchhhcceeEEE
Confidence 345788998999999999999999999999999999999999999999985
No 87
>KOG0106|consensus
Probab=97.79 E-value=2.7e-05 Score=42.56 Aligned_cols=32 Identities=25% Similarity=0.508 Sum_probs=29.2
Q ss_pred cEEEEecCCCCCCHHHHHHHhhhCCCeEEEEE
Q psy7050 14 VSLKVDNLTYRTTPDDLRRVFERCGEVGDIYI 45 (61)
Q Consensus 14 ~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~ 45 (61)
..+||++||+.+.+.++..+|..||.+..+.+
T Consensus 2 ~rv~vg~~~~~~~~~d~E~~f~~yg~~~d~~m 33 (216)
T KOG0106|consen 2 PRVYIGRLPYRARERDVERFFKGYGKIPDADM 33 (216)
T ss_pred CceeecccCCccchhHHHHHHhhcccccccee
Confidence 46899999999999999999999999888754
No 88
>KOG4660|consensus
Probab=97.78 E-value=4.9e-05 Score=46.03 Aligned_cols=45 Identities=36% Similarity=0.473 Sum_probs=36.9
Q ss_pred CCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050 12 GMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR 61 (61)
Q Consensus 12 ~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~ 61 (61)
...+|+|-|||..+++++|+++|+.||+|+.++ .|...+|..||+
T Consensus 74 ~~~~L~v~nl~~~Vsn~~L~~~f~~yGeir~ir-----~t~~~~~~~~v~ 118 (549)
T KOG4660|consen 74 NQGTLVVFNLPRSVSNDTLLRIFGAYGEIREIR-----ETPNKRGIVFVE 118 (549)
T ss_pred ccceEEEEecCCcCCHHHHHHHHHhhcchhhhh-----cccccCceEEEE
Confidence 347999999999999999999999999999854 344456777764
No 89
>PF13893 RRM_5: RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain); PDB: 2PE8_A 2PEH_A 2D9O_A 1A9N_D 2DIT_A 3S6E_A 2LQ5_A 1SJQ_A 2AD9_A 1X4D_A ....
Probab=97.69 E-value=7.7e-05 Score=32.28 Aligned_cols=27 Identities=48% Similarity=0.843 Sum_probs=20.9
Q ss_pred HHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050 30 LRRVFERCGEVGDIYIPRDRFTRESRGFAFVR 61 (61)
Q Consensus 30 l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~ 61 (61)
|.++|++||.|.++.+..+. +++|||+
T Consensus 1 L~~~f~~fG~V~~i~~~~~~-----~~~a~V~ 27 (56)
T PF13893_consen 1 LYKLFSKFGEVKKIKIFKKK-----RGFAFVE 27 (56)
T ss_dssp HHHHHTTTS-EEEEEEETTS-----TTEEEEE
T ss_pred ChHHhCCcccEEEEEEEeCC-----CCEEEEE
Confidence 67899999999999886553 5788885
No 90
>KOG0129|consensus
Probab=97.66 E-value=0.00026 Score=42.80 Aligned_cols=52 Identities=23% Similarity=0.214 Sum_probs=43.5
Q ss_pred CCCCCcEEEEecCCCCCCHHHHHHHhhh-CCCeEEEEEccCCCCCCceeEEEE
Q psy7050 9 RIEGMVSLKVDNLTYRTTPDDLRRVFER-CGEVGDIYIPRDRFTRESRGFAFV 60 (61)
Q Consensus 9 ~~~~~~~l~v~~l~~~~~~~~l~~~f~~-~G~i~~~~~~~~~~t~~~~g~~fv 60 (61)
..+..+|||||+||.-++..+|..+|.. ||.|..+.|=.|++=+.++|-|-|
T Consensus 366 ~lDprrTVFVGgvprpl~A~eLA~imd~lyGgV~yaGIDtD~k~KYPkGaGRV 418 (520)
T KOG0129|consen 366 PIDPRRTVFVGGLPRPLTAEELAMIMEDLFGGVLYVGIDTDPKLKYPKGAGRV 418 (520)
T ss_pred ccCccceEEecCCCCcchHHHHHHHHHHhcCceEEEEeccCcccCCCCCccee
Confidence 4566689999999999999999999995 999999988778666667776655
No 91
>KOG0129|consensus
Probab=97.31 E-value=0.00059 Score=41.32 Aligned_cols=33 Identities=21% Similarity=0.347 Sum_probs=29.4
Q ss_pred CcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEc
Q psy7050 13 MVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIP 46 (61)
Q Consensus 13 ~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~ 46 (61)
.++||||+||++++|+.|...|..||.+ .+.|+
T Consensus 259 S~KVFvGGlp~dise~~i~~~F~~FGs~-~VdWP 291 (520)
T KOG0129|consen 259 SRKVFVGGLPWDITEAQINASFGQFGSV-KVDWP 291 (520)
T ss_pred ccceeecCCCccccHHHHHhhcccccce-EeecC
Confidence 5799999999999999999999999985 46666
No 92
>KOG4211|consensus
Probab=97.27 E-value=0.0012 Score=39.88 Aligned_cols=49 Identities=27% Similarity=0.419 Sum_probs=38.5
Q ss_pred CCcEEEEecCCCCCCHHHHHHHhhhCCCeEE-EEEccCCCCCCceeEEEEC
Q psy7050 12 GMVSLKVDNLTYRTTPDDLRRVFERCGEVGD-IYIPRDRFTRESRGFAFVR 61 (61)
Q Consensus 12 ~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~-~~~~~~~~t~~~~g~~fv~ 61 (61)
..-.|.+++||+.|++++|.++|+..--+.. +.++.++ .++..|-|||+
T Consensus 102 ~d~vVRLRGLPfscte~dI~~FFaGL~Iv~~gi~l~~d~-rgR~tGEAfVq 151 (510)
T KOG4211|consen 102 NDGVVRLRGLPFSCTEEDIVEFFAGLEIVPDGILLPMDQ-RGRPTGEAFVQ 151 (510)
T ss_pred CCceEEecCCCccCcHHHHHHHhcCCcccccceeeeccC-CCCcccceEEE
Confidence 4568999999999999999999987644444 5567775 46688888884
No 93
>PF08777 RRM_3: RNA binding motif; InterPro: IPR014886 This domain is found in protein La which functions as an RNA chaperone during RNA polymerase III transcription, and can also stimulate translation initiation. It contains a five stranded beta sheet which forms an atypical RNA recognition motif []. ; PDB: 1OWX_A.
Probab=97.22 E-value=0.00049 Score=33.81 Aligned_cols=36 Identities=28% Similarity=0.412 Sum_probs=22.4
Q ss_pred CcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccC
Q psy7050 13 MVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRD 48 (61)
Q Consensus 13 ~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~ 48 (61)
++.|++.+++..++.++|++.|+.+|.|..+.+...
T Consensus 1 G~il~~~g~~~~~~re~iK~~f~~~g~V~yVD~~~G 36 (105)
T PF08777_consen 1 GCILKFSGLGEPTSREDIKEAFSQFGEVAYVDFSRG 36 (105)
T ss_dssp --EEEEEE--SS--HHHHHHHT-SS--EEEEE--TT
T ss_pred CeEEEEecCCCCcCHHHHHHHHHhcCCcceEEecCC
Confidence 367889999999999999999999999998887653
No 94
>KOG1995|consensus
Probab=97.21 E-value=0.00076 Score=39.24 Aligned_cols=52 Identities=23% Similarity=0.210 Sum_probs=43.1
Q ss_pred CCCCCcEEEEecCCCCCCHHHHHHHhhhCCCeE--------EEEEccCCCCCCceeEEEE
Q psy7050 9 RIEGMVSLKVDNLTYRTTPDDLRRVFERCGEVG--------DIYIPRDRFTRESRGFAFV 60 (61)
Q Consensus 9 ~~~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~--------~~~~~~~~~t~~~~g~~fv 60 (61)
......+|||.+|+..++++++.++|.++|.|. .+++-+|++|++.+|-|.|
T Consensus 62 ~~s~~~ti~v~g~~d~~~~~~~~~~f~qcg~ikrnK~t~kPki~~y~dkeT~~~KGeatv 121 (351)
T KOG1995|consen 62 DKSDNETIFVWGCPDSVCENDNADFFLQCGVIKRNKRTGKPKIKIYTDKETGAPKGEATV 121 (351)
T ss_pred cccccccceeeccCccchHHHHHHHHhhcceeccCCCCCCcchhccccccccCcCCceee
Confidence 445567999999999999999999999999763 3567788888988887765
No 95
>KOG1190|consensus
Probab=97.12 E-value=0.0007 Score=40.30 Aligned_cols=37 Identities=16% Similarity=0.268 Sum_probs=32.6
Q ss_pred CCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccC
Q psy7050 12 GMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRD 48 (61)
Q Consensus 12 ~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~ 48 (61)
..+.++++|||++++|.++..++.+||.|..+.+.+.
T Consensus 27 pSkV~HlRnlp~e~tE~elI~Lg~pFG~vtn~~~lkG 63 (492)
T KOG1190|consen 27 PSKVVHLRNLPWEVTEEELISLGLPFGKVTNLLMLKG 63 (492)
T ss_pred CcceeEeccCCccccHHHHHHhcccccceeeeeeecc
Confidence 4578999999999999999999999999998876543
No 96
>KOG1457|consensus
Probab=97.11 E-value=0.0026 Score=35.45 Aligned_cols=51 Identities=16% Similarity=0.205 Sum_probs=34.9
Q ss_pred CCCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCC-CceeEEEEC
Q psy7050 11 EGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTR-ESRGFAFVR 61 (61)
Q Consensus 11 ~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~-~~~g~~fv~ 61 (61)
..-+||||.+||.++-..+|..+|..|-.-+.+.+-.....+ -.+-+||++
T Consensus 32 ~~VRTLFVSGLP~DvKpREiynLFR~f~GYEgslLK~Tsk~~~~~~pvaFat 83 (284)
T KOG1457|consen 32 GAVRTLFVSGLPNDVKPREIYNLFRRFHGYEGSLLKYTSKGDQVCKPVAFAT 83 (284)
T ss_pred cccceeeeccCCcccCHHHHHHHhccCCCccceeeeeccCCCccccceEEEE
Confidence 346899999999999999999999987554555443322222 233566653
No 97
>KOG3152|consensus
Probab=96.96 E-value=0.002 Score=36.26 Aligned_cols=39 Identities=23% Similarity=0.406 Sum_probs=33.9
Q ss_pred CCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCC
Q psy7050 12 GMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRF 50 (61)
Q Consensus 12 ~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~ 50 (61)
..-.||++++|+...-..|+++|+.||.|-.+.+.....
T Consensus 73 k~GVvylS~IPp~m~~~rlReil~~yGeVGRvylqpE~~ 111 (278)
T KOG3152|consen 73 KTGVVYLSNIPPYMDPVRLREILSQYGEVGRVYLQPEDD 111 (278)
T ss_pred cceEEEeccCCCccCHHHHHHHHHhccccceEEecchhh
Confidence 446899999999999999999999999999998765443
No 98
>KOG1457|consensus
Probab=96.88 E-value=0.002 Score=35.83 Aligned_cols=33 Identities=21% Similarity=0.289 Sum_probs=26.8
Q ss_pred CCCcEEEEecCCCCCCHHHHHHHhhhCCCeEEE
Q psy7050 11 EGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDI 43 (61)
Q Consensus 11 ~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~ 43 (61)
....+|||.||..+++++.|+.+|+.|..-..+
T Consensus 208 ~acstlfianl~~~~~ed~l~~~~~~~~gf~~l 240 (284)
T KOG1457|consen 208 RACSTLFIANLGPNCTEDELKQLLSRYPGFHIL 240 (284)
T ss_pred hhhhhHhhhccCCCCCHHHHHHHHHhCCCceEE
Confidence 344689999999999999999999998543333
No 99
>PF08675 RNA_bind: RNA binding domain; InterPro: IPR014789 This domain corresponds to the RNA binding domain of Poly(A)-specific ribonuclease (PARN). ; GO: 0003723 RNA binding, 0004535 poly(A)-specific ribonuclease activity, 0046872 metal ion binding, 0006402 mRNA catabolic process, 0005634 nucleus, 0005737 cytoplasm; PDB: 3CTR_A 2ROK_A 3D45_B 1WHV_A.
Probab=96.82 E-value=0.01 Score=28.27 Aligned_cols=33 Identities=12% Similarity=0.231 Sum_probs=22.1
Q ss_pred cEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccC
Q psy7050 14 VSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRD 48 (61)
Q Consensus 14 ~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~ 48 (61)
...++. .|..+-..+|.++|++||.|. +.|+.|
T Consensus 10 HVFhlt-FPkeWK~~DI~qlFspfG~I~-VsWi~d 42 (87)
T PF08675_consen 10 HVFHLT-FPKEWKTSDIYQLFSPFGQIY-VSWIND 42 (87)
T ss_dssp CEEEEE---TT--HHHHHHHCCCCCCEE-EEEECT
T ss_pred eEEEEe-CchHhhhhhHHHHhccCCcEE-EEEEcC
Confidence 445555 999999999999999999874 555544
No 100
>KOG0105|consensus
Probab=96.77 E-value=0.0061 Score=33.16 Aligned_cols=40 Identities=20% Similarity=0.196 Sum_probs=34.6
Q ss_pred CCCCCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccC
Q psy7050 9 RIEGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRD 48 (61)
Q Consensus 9 ~~~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~ 48 (61)
.......|.|.+||+..++++|+++....|.++...+.+|
T Consensus 111 srrSe~RVvVsGLp~SgSWQDLKDHmReaGdvCfadv~rD 150 (241)
T KOG0105|consen 111 SRRSEYRVVVSGLPPSGSWQDLKDHMREAGDVCFADVQRD 150 (241)
T ss_pred ccccceeEEEecCCCCCchHHHHHHHHhhCCeeeeeeecc
Confidence 3444578999999999999999999999999988887766
No 101
>KOG4206|consensus
Probab=96.58 E-value=0.012 Score=32.54 Aligned_cols=46 Identities=24% Similarity=0.312 Sum_probs=37.5
Q ss_pred CCCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050 11 EGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR 61 (61)
Q Consensus 11 ~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~ 61 (61)
.....+++.|+|..++.+.+..+|.+|..-..++++... .+.|||+
T Consensus 144 ppn~ilf~~niP~es~~e~l~~lf~qf~g~keir~i~~~-----~~iAfve 189 (221)
T KOG4206|consen 144 PPNNILFLTNIPSESESEMLSDLFEQFPGFKEIRLIPPR-----SGIAFVE 189 (221)
T ss_pred CCceEEEEecCCcchhHHHHHHHHhhCcccceeEeccCC-----CceeEEe
Confidence 445789999999999999999999999888888876542 4577774
No 102
>KOG0115|consensus
Probab=96.30 E-value=0.0048 Score=34.78 Aligned_cols=35 Identities=29% Similarity=0.324 Sum_probs=30.7
Q ss_pred cEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccC
Q psy7050 14 VSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRD 48 (61)
Q Consensus 14 ~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~ 48 (61)
+.|||.||+..++.+.+...|+.||.+....+..|
T Consensus 32 a~l~V~nl~~~~sndll~~~f~~fg~~e~av~~vD 66 (275)
T KOG0115|consen 32 AELYVVNLMQGASNDLLEQAFRRFGPIERAVAKVD 66 (275)
T ss_pred ceEEEEecchhhhhHHHHHhhhhcCccchheeeec
Confidence 78999999999999999999999999877655544
No 103
>KOG0128|consensus
Probab=96.29 E-value=0.0032 Score=40.29 Aligned_cols=46 Identities=20% Similarity=0.388 Sum_probs=41.2
Q ss_pred cEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEE
Q psy7050 14 VSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFV 60 (61)
Q Consensus 14 ~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv 60 (61)
..++|+|.|+..+.+.++.++...|.+.+..++..+ .|+++|.|+|
T Consensus 737 ~~v~i~g~pf~gt~e~~k~l~~~~gn~~~~~~vt~r-~gkpkg~a~v 782 (881)
T KOG0128|consen 737 ISVAISGPPFQGTKEELKSLASKTGNVTSLRLVTVR-AGKPKGKARV 782 (881)
T ss_pred hhhheeCCCCCCchHHHHhhccccCCccccchhhhh-ccccccceec
Confidence 578999999999999999999999999999887775 5889998886
No 104
>KOG0128|consensus
Probab=95.80 E-value=0.00015 Score=46.00 Aligned_cols=50 Identities=24% Similarity=0.410 Sum_probs=41.5
Q ss_pred CCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050 12 GMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR 61 (61)
Q Consensus 12 ~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~ 61 (61)
+.+++|++||+..+.+.+|...|..+|.+..+.+.-....++.+|+||++
T Consensus 666 ~~~~~fvsnl~~~~~~~dl~~~~~~~~~~e~vqi~~h~n~~~~rG~~Y~~ 715 (881)
T KOG0128|consen 666 DLIKIFVSNLSPKMSEEDLSERFSPSGTIEVVQIVIHKNEKRFRGKAYVE 715 (881)
T ss_pred HHHHHHHhhcchhhcCchhhhhcCccchhhhHHHHHHhhccccccceeeE
Confidence 34689999999999999999999999887777665455678899999873
No 105
>KOG1855|consensus
Probab=95.36 E-value=0.038 Score=33.40 Aligned_cols=37 Identities=19% Similarity=0.371 Sum_probs=33.6
Q ss_pred CCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccC
Q psy7050 12 GMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRD 48 (61)
Q Consensus 12 ~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~ 48 (61)
+.++|.+-|||.+-.-+.|.++|..+|.|..+++...
T Consensus 230 ~srtivaenLP~Dh~~enl~kiFg~~G~IksIRIckP 266 (484)
T KOG1855|consen 230 PSRTIVAENLPLDHSYENLSKIFGTVGSIKSIRICKP 266 (484)
T ss_pred ccceEEEecCCcchHHHHHHHHhhcccceeeeeecCC
Confidence 5689999999999998999999999999999998755
No 106
>KOG1365|consensus
Probab=95.31 E-value=0.024 Score=33.94 Aligned_cols=48 Identities=15% Similarity=0.241 Sum_probs=37.9
Q ss_pred CcEEEEecCCCCCCHHHHHHHhhhCCC-eEE--EEEccCCCCCCceeEEEEC
Q psy7050 13 MVSLKVDNLTYRTTPDDLRRVFERCGE-VGD--IYIPRDRFTRESRGFAFVR 61 (61)
Q Consensus 13 ~~~l~v~~l~~~~~~~~l~~~f~~~G~-i~~--~~~~~~~~t~~~~g~~fv~ 61 (61)
...|..++||+.++-++|..+|..|.. +.. ++++.+ ..|+..|-|||+
T Consensus 280 kdcvRLRGLPy~AtvEdIL~FlgdFa~~i~f~gVHmv~N-~qGrPSGeAFIq 330 (508)
T KOG1365|consen 280 KDCVRLRGLPYEATVEDILDFLGDFATDIRFQGVHMVLN-GQGRPSGEAFIQ 330 (508)
T ss_pred CCeeEecCCChhhhHHHHHHHHHHHhhhcccceeEEEEc-CCCCcChhhhhh
Confidence 456889999999999999999988753 333 667766 468888888875
No 107
>KOG2314|consensus
Probab=95.14 E-value=0.14 Score=32.17 Aligned_cols=50 Identities=24% Similarity=0.400 Sum_probs=38.4
Q ss_pred CCCcEEEEecCCCCCCH------HHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050 11 EGMVSLKVDNLTYRTTP------DDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR 61 (61)
Q Consensus 11 ~~~~~l~v~~l~~~~~~------~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~ 61 (61)
.-...|.|.|+|-.-.. .-|..+|+++|++....++.+.++| .+||.|++
T Consensus 56 g~D~vVvv~g~PvV~~~rl~klk~vl~kvfsk~gk~vn~~~P~~e~gg-tkG~lf~E 111 (698)
T KOG2314|consen 56 GFDSVVVVDGAPVVGPARLEKLKKVLTKVFSKAGKIVNMYYPIDEEGG-TKGYLFVE 111 (698)
T ss_pred CcceEEEECCCcccChhHHHHHHHHHHHHHHhhccccceeeccCccCC-eeeEEEEE
Confidence 33467888888865432 2456889999999999999887766 89999975
No 108
>PF14605 Nup35_RRM_2: Nup53/35/40-type RNA recognition motif
Probab=94.59 E-value=0.1 Score=22.53 Aligned_cols=32 Identities=25% Similarity=0.506 Sum_probs=24.2
Q ss_pred cEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEc
Q psy7050 14 VSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIP 46 (61)
Q Consensus 14 ~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~ 46 (61)
+.|-|.+.+.... +.+..+|..+|+|....+.
T Consensus 2 ~wI~V~Gf~~~~~-~~vl~~F~~fGeI~~~~~~ 33 (53)
T PF14605_consen 2 TWISVSGFPPDLA-EEVLEHFASFGEIVDIYVP 33 (53)
T ss_pred cEEEEEeECchHH-HHHHHHHHhcCCEEEEEcC
Confidence 4577888887655 4556689999999998775
No 109
>KOG1365|consensus
Probab=93.67 E-value=0.32 Score=29.44 Aligned_cols=46 Identities=22% Similarity=0.319 Sum_probs=33.0
Q ss_pred cEEEEecCCCCCCHHHHHHHhhhC----CCeEEEEEccCCCCCCceeEEEE
Q psy7050 14 VSLKVDNLTYRTTPDDLRRVFERC----GEVGDIYIPRDRFTRESRGFAFV 60 (61)
Q Consensus 14 ~~l~v~~l~~~~~~~~l~~~f~~~----G~i~~~~~~~~~~t~~~~g~~fv 60 (61)
-.|.+++||+++++.++.++|.+- |....+.++..+ .|+..|=|||
T Consensus 162 vivRmRGLPfdat~~dVv~FF~~~cpv~~g~egvLFV~rp-dgrpTGdAFv 211 (508)
T KOG1365|consen 162 VIVRMRGLPFDATALDVVEFFGPPCPVTGGTEGVLFVTRP-DGRPTGDAFV 211 (508)
T ss_pred eEEEecCCCCCcchHHHHHhcCCCCcccCCccceEEEECC-CCCcccceEE
Confidence 567889999999999999999631 234455555443 4777777776
No 110
>KOG4676|consensus
Probab=93.07 E-value=0.12 Score=31.09 Aligned_cols=35 Identities=17% Similarity=0.396 Sum_probs=30.9
Q ss_pred CcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEcc
Q psy7050 13 MVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPR 47 (61)
Q Consensus 13 ~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~ 47 (61)
...|.|.||.+.++.++++.+|...|.|..+.|.-
T Consensus 7 ~~vIqvanispsat~dqm~tlFg~lGkI~elrlyp 41 (479)
T KOG4676|consen 7 LGVIQVANISPSATKDQMQTLFGNLGKIPELRLYP 41 (479)
T ss_pred CceeeecccCchhhHHHHHHHHhhccccccccccC
Confidence 35899999999999999999999999998887643
No 111
>KOG0112|consensus
Probab=92.81 E-value=0.043 Score=35.80 Aligned_cols=37 Identities=24% Similarity=0.427 Sum_probs=33.0
Q ss_pred CCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccC
Q psy7050 12 GMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRD 48 (61)
Q Consensus 12 ~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~ 48 (61)
...+||++||+..+++.+++..|..+|.+.++.+-..
T Consensus 371 atrTLf~Gnl~~kl~eseiR~af~e~gkve~VDiKtP 407 (975)
T KOG0112|consen 371 ATRTLFLGNLDSKLTESEIRPAFDESGKVEEVDIKTP 407 (975)
T ss_pred hhhhhhhcCcccchhhhhhhhhhhhhccccccccccC
Confidence 3579999999999999999999999999999877544
No 112
>KOG0106|consensus
Probab=92.30 E-value=0.13 Score=28.51 Aligned_cols=33 Identities=33% Similarity=0.437 Sum_probs=28.2
Q ss_pred CCCcEEEEecCCCCCCHHHHHHHhhhCCCeEEE
Q psy7050 11 EGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDI 43 (61)
Q Consensus 11 ~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~ 43 (61)
...+.+.+.+++..+.+.+|.++|.++|.+...
T Consensus 97 ~s~~r~~~~~~~~r~~~qdl~d~~~~~g~~~~~ 129 (216)
T KOG0106|consen 97 RTHFRLIVRNLSLRVSWQDLKDHFRPAGEVTYV 129 (216)
T ss_pred cccceeeeccchhhhhHHHHhhhhcccCCCchh
Confidence 345788999999999999999999999988443
No 113
>KOG1190|consensus
Probab=92.18 E-value=0.47 Score=28.90 Aligned_cols=38 Identities=34% Similarity=0.386 Sum_probs=32.8
Q ss_pred CcEEEEecCCCC-CCHHHHHHHhhhCCCeEEEEEccCCC
Q psy7050 13 MVSLKVDNLTYR-TTPDDLRRVFERCGEVGDIYIPRDRF 50 (61)
Q Consensus 13 ~~~l~v~~l~~~-~~~~~l~~~f~~~G~i~~~~~~~~~~ 50 (61)
...|.|.||... ++.+.|..+|..||.|.++.+..++.
T Consensus 297 n~vllvsnln~~~VT~d~LftlFgvYGdVqRVkil~nkk 335 (492)
T KOG1190|consen 297 NVVLLVSNLNEEAVTPDVLFTLFGVYGDVQRVKILYNKK 335 (492)
T ss_pred ceEEEEecCchhccchhHHHHHHhhhcceEEEEeeecCC
Confidence 477889999776 68899999999999999999988753
No 114
>KOG1456|consensus
Probab=90.99 E-value=1.2 Score=27.00 Aligned_cols=37 Identities=14% Similarity=0.111 Sum_probs=31.7
Q ss_pred CCCCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEc
Q psy7050 10 IEGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIP 46 (61)
Q Consensus 10 ~~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~ 46 (61)
.+..-.+.|++|...+.+.+|-+.++.||.|..+..+
T Consensus 28 ~~~spvvhvr~l~~~v~eadl~eal~~fG~i~yvt~~ 64 (494)
T KOG1456|consen 28 PNPSPVVHVRGLHQGVVEADLVEALSNFGPIAYVTCM 64 (494)
T ss_pred CCCCceEEEeccccccchhHHHHHHhcCCceEEEEec
Confidence 3445689999999999999999999999999887544
No 115
>KOG4307|consensus
Probab=90.88 E-value=0.35 Score=31.34 Aligned_cols=51 Identities=27% Similarity=0.046 Sum_probs=38.8
Q ss_pred CCCCcEEEEecCCCCCCHHHHHHHhhhCCCeEE-EEEccCCCCCCceeEEEEC
Q psy7050 10 IEGMVSLKVDNLTYRTTPDDLRRVFERCGEVGD-IYIPRDRFTRESRGFAFVR 61 (61)
Q Consensus 10 ~~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~-~~~~~~~~t~~~~g~~fv~ 61 (61)
...+..|||-.||..+++.++-++|...-.|++ +.|..- .+++.++-|||+
T Consensus 431 ~~ag~~lyv~~lP~~t~~~~~v~~f~~~~~Ved~I~lt~~-P~~~~~~~afv~ 482 (944)
T KOG4307|consen 431 GGAGGALYVFQLPVMTPIVPPVNKFMGAAAVEDFIELTRL-PTDLLRPAAFVA 482 (944)
T ss_pred CCccceEEeccCCccccccchhhhhhhhhhhhheeEeccC-Ccccccchhhhe
Confidence 345678999999999999999999988766666 555444 356677777763
No 116
>KOG1456|consensus
Probab=90.77 E-value=1.7 Score=26.44 Aligned_cols=41 Identities=17% Similarity=0.104 Sum_probs=33.5
Q ss_pred CCCCCcEEEEecCCCCC-CHHHHHHHhhhCCCeEEEEEccCC
Q psy7050 9 RIEGMVSLKVDNLTYRT-TPDDLRRVFERCGEVGDIYIPRDR 49 (61)
Q Consensus 9 ~~~~~~~l~v~~l~~~~-~~~~l~~~f~~~G~i~~~~~~~~~ 49 (61)
....++.+.|-+|.... +-+.|..+|-.||.|..+.+++.+
T Consensus 283 g~~~g~VmMVyGLdh~k~N~drlFNl~ClYGNV~rvkFmkTk 324 (494)
T KOG1456|consen 283 GGAPGCVMMVYGLDHGKMNCDRLFNLFCLYGNVERVKFMKTK 324 (494)
T ss_pred CCCCCcEEEEEeccccccchhhhhhhhhhcCceeeEEEeecc
Confidence 34556899999998875 557789999999999999998764
No 117
>KOG0112|consensus
Probab=90.55 E-value=0.86 Score=30.24 Aligned_cols=36 Identities=22% Similarity=0.229 Sum_probs=31.4
Q ss_pred CCCCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEE
Q psy7050 10 IEGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYI 45 (61)
Q Consensus 10 ~~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~ 45 (61)
....+.+++++|..++.-..|...|..||.+..+.+
T Consensus 452 st~ttr~~sgglg~w~p~~~l~r~fd~fGpir~Idy 487 (975)
T KOG0112|consen 452 STPTTRLQSGGLGPWSPVSRLNREFDRFGPIRIIDY 487 (975)
T ss_pred cccceeeccCCCCCCChHHHHHHHhhccCcceeeec
Confidence 344578999999999999999999999999988764
No 118
>KOG4307|consensus
Probab=89.95 E-value=1 Score=29.37 Aligned_cols=46 Identities=15% Similarity=0.127 Sum_probs=31.8
Q ss_pred CcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEE
Q psy7050 13 MVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFA 58 (61)
Q Consensus 13 ~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~ 58 (61)
.+.|-+.|+|++++-++|-++|..|-.+......+-.+-|...|-|
T Consensus 867 p~V~~~~n~Pf~v~l~dI~~FF~dY~~~p~sI~~r~nd~G~pTGe~ 912 (944)
T KOG4307|consen 867 PRVLSCNNFPFDVTLEDIVEFFNDYEPDPNSIRIRRNDDGVPTGEC 912 (944)
T ss_pred CeEEEecCCCccccHHHHHHHhcccccCCCceeEeecCCCCcccce
Confidence 3478899999999999999999998654332223223446555544
No 119
>PF03467 Smg4_UPF3: Smg-4/UPF3 family; InterPro: IPR005120 Nonsense-mediated mRNA decay (NMD) is a surveillance mechanism by which eukaryotic cells detect and degrade transcripts containing premature termination codons. Three 'up-frameshift' proteins, UPF1, UPF2 and UPF3, are essential for this process in organisms ranging from yeast, human to plants []. Exon junction complexes (EJCs) are deposited ~24 nucleotides upstream of exon-exon junctions after splicing. Translation causes displacement of the EJCs, however, premature translation termination upstream of one or more EJCs triggers the recruitment of UPF1, UPF2 and UPF3 and activates the NMD pathway [, ]. This family contains UPF3. The crystal structure of the complex between human UPF2 and UPF3b, which are, respectively, a MIF4G (middle portion of eIF4G) domain and an RNP domain (ribonucleoprotein-type RNA-binding domain) has been determined to 1.95A. The protein-protein interface is mediated by highly conserved charged residues in UPF2 and UPF3b and involves the beta-sheet surface of the UPF3b ribonucleoprotein (RNP) domain, which is generally used by these domains to bind nucleic acids. In UPF3b the RNP domain does not bind RNA, whereas the UPF2 construct and the complex do. It is clear that some RNP domains have evolved for specific protein-protein interactions rather than as nucleic acid binding modules [].; PDB: 1UW4_A 2L08_A.
Probab=89.58 E-value=0.74 Score=24.64 Aligned_cols=30 Identities=7% Similarity=0.057 Sum_probs=19.4
Q ss_pred CCCcEEEEecCCCCCCHHHHHHHhhh-CCCe
Q psy7050 11 EGMVSLKVDNLTYRTTPDDLRRVFER-CGEV 40 (61)
Q Consensus 11 ~~~~~l~v~~l~~~~~~~~l~~~f~~-~G~i 40 (61)
....+|-|++||+..+++.+.+.+.. ++.-
T Consensus 5 ~~~~KvVIR~LPP~LteeeF~~~i~~~l~~~ 35 (176)
T PF03467_consen 5 KEGTKVVIRRLPPNLTEEEFWEQISPWLPDE 35 (176)
T ss_dssp ----EEEEEEE-TTS-HHHHCCCCSS--SSE
T ss_pred ccCceEEEeCCCCCCCHHHHHHHhhhhcccc
Confidence 34579999999999999999887766 5543
No 120
>KOG2193|consensus
Probab=89.49 E-value=0.33 Score=29.73 Aligned_cols=24 Identities=38% Similarity=0.556 Sum_probs=21.3
Q ss_pred cEEEEecCCCCCCHHHHHHHhhhC
Q psy7050 14 VSLKVDNLTYRTTPDDLRRVFERC 37 (61)
Q Consensus 14 ~~l~v~~l~~~~~~~~l~~~f~~~ 37 (61)
.++|++||.+..+..+++.+|...
T Consensus 2 nklyignL~p~~~psdl~svfg~a 25 (584)
T KOG2193|consen 2 NKLYIGNLSPQVTPSDLESVFGDA 25 (584)
T ss_pred CcccccccCCCCChHHHHHHhccc
Confidence 478999999999999999998754
No 121
>PF09707 Cas_Cas2CT1978: CRISPR-associated protein (Cas_Cas2CT1978); InterPro: IPR010152 Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR) are a family of DNA direct repeats separated by regularly sized non-repetitive spacer sequences that are found in most bacterial and archaeal genomes []. CRISPRs appear to provide acquired resistance against bacteriophages, possibly acting with an RNA interference-like mechanism to inhibit gene functions of invasive DNA elements [, ]. Differences in the number and type of spacers between CRISPR repeats correlate with phage sensitivity. It is thought that following phage infection, bacteria integrate new spacers derived from phage genomic sequences, and that the removal or addition of particular spacers modifies the phage-resistance phenotype of the cell. Therefore, the specificity of CRISPRs may be determined by spacer-phage sequence similarity. In addition, there are many protein families known as CRISPR-associated sequences (Cas), which are encoded in the vicinity of CRISPR loci []. CRISPR/cas gene regions can be quite large, with up to 20 different, tandem-arranged cas genes next to a CRISPR cluster or filling the region between two repeat clusters. Cas genes and CRISPRs are found on mobile genetic elements such as plasmids, and have undergone extensive horizontal transfer. Cas proteins are thought to be involved in the propagation and functioning of CRISPRs. Some Cas proteins show similarity to helicases and repair proteins, although the functions of most are unknown. Cas families can be divided into subtypes according to operon organisation and phylogeny. This entry represents a minor branch of the Cas2 family of CRISPR-associated protein which are found in IPR003799 from INTERPRO. Cas2 is one of four protein families (Cas1 to Cas4) that are associated with CRISPR elements and always occur near a repeat cluster, usually in the order cas3-cas4-cas1-cas2. The function of Cas2 (and Cas1) is unknown. Cas3 proteins appear to be helicases while Cas4 proteins resemble RecB-type exonucleases, suggesting that these genes are involved in DNA metabolism or gene expression [].
Probab=87.18 E-value=1.5 Score=20.94 Aligned_cols=45 Identities=24% Similarity=0.327 Sum_probs=28.9
Q ss_pred CcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEE
Q psy7050 13 MVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFV 60 (61)
Q Consensus 13 ~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv 60 (61)
..-+||++++..+.+.-...+.+..+....+-+..+. ...||+|-
T Consensus 25 ~~GVyVg~~s~rVRe~lW~~v~~~~~~G~a~m~~~~~---neqG~~~~ 69 (86)
T PF09707_consen 25 RPGVYVGNVSARVRERLWERVTEWIGDGSAVMVWSDN---NEQGFDFR 69 (86)
T ss_pred CCCcEEcCCCHHHHHHHHHHHHhhCCCccEEEEEccC---CCCCEEEE
Confidence 3569999999988887777776655544444333332 15677763
No 122
>KOG4660|consensus
Probab=85.73 E-value=0.92 Score=28.42 Aligned_cols=47 Identities=26% Similarity=0.395 Sum_probs=30.0
Q ss_pred cEEEEecCCCCCCHHHHHHHhhh-CCCeEEEEEccCCCCCCceeEEEE
Q psy7050 14 VSLKVDNLTYRTTPDDLRRVFER-CGEVGDIYIPRDRFTRESRGFAFV 60 (61)
Q Consensus 14 ~~l~v~~l~~~~~~~~l~~~f~~-~G~i~~~~~~~~~~t~~~~g~~fv 60 (61)
+++.|.|++...|...|.+.-.+ .|.-..+.++.|-.+..+.|||||
T Consensus 389 tt~~iknipNK~T~~ml~~~d~~~~gtYDFlYLPiDF~nkcNvGYAFI 436 (549)
T KOG4660|consen 389 TTLMIKNIPNKYTSKMLLAADEKNKGTYDFLYLPIDFKNKCNVGYAFI 436 (549)
T ss_pred hhhHhhccCchhhHHhhhhhhccccCccceEEeccccccccccceeEE
Confidence 44555555555555544444322 455566778888888889999997
No 123
>COG5175 MOT2 Transcriptional repressor [Transcription]
Probab=85.62 E-value=1.1 Score=26.84 Aligned_cols=35 Identities=14% Similarity=0.297 Sum_probs=26.8
Q ss_pred CcEEEEecCCCCCCHHH----H--HHHhhhCCCeEEEEEcc
Q psy7050 13 MVSLKVDNLTYRTTPDD----L--RRVFERCGEVGDIYIPR 47 (61)
Q Consensus 13 ~~~l~v~~l~~~~~~~~----l--~~~f~~~G~i~~~~~~~ 47 (61)
..-+||-+|++.+..+. | .+.|.+||.|..+.+-+
T Consensus 114 KNLvYVigi~pkva~Ee~~~vLk~~eyFGQyGkI~KIvvNk 154 (480)
T COG5175 114 KNLVYVIGIPPKVADEEVAPVLKRHEYFGQYGKIKKIVVNK 154 (480)
T ss_pred cceeEEecCCCCCCcccccccccchhhhhhccceeEEEecc
Confidence 35689999999886654 2 47899999999986643
No 124
>KOG2253|consensus
Probab=82.42 E-value=1.2 Score=28.49 Aligned_cols=35 Identities=14% Similarity=0.226 Sum_probs=29.8
Q ss_pred CCCCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEE
Q psy7050 10 IEGMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIY 44 (61)
Q Consensus 10 ~~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~ 44 (61)
.....++||+|+...+....+..++..+|.|.++.
T Consensus 37 ~~~~~~vfv~~~~~~~s~~~~~~il~~~g~v~s~k 71 (668)
T KOG2253|consen 37 LPPRDTVFVGNISYLVSQEFWKSILAKSGFVPSWK 71 (668)
T ss_pred CCCCceeEecchhhhhhHHHHHHHHhhCCcchhhh
Confidence 34557999999999999999999999999877653
No 125
>KOG2416|consensus
Probab=80.85 E-value=1 Score=28.77 Aligned_cols=37 Identities=32% Similarity=0.461 Sum_probs=28.7
Q ss_pred CCCCCCCcEEEEecCCCCCCHHHHHHHhhh-CCCeEEE
Q psy7050 7 PPRIEGMVSLKVDNLTYRTTPDDLRRVFER-CGEVGDI 43 (61)
Q Consensus 7 ~~~~~~~~~l~v~~l~~~~~~~~l~~~f~~-~G~i~~~ 43 (61)
|+.......|||.||-.-.|.-.|++++.. +|.|...
T Consensus 438 PsR~~~SnvlhI~nLvRPFTlgQLkelL~rtgg~Vee~ 475 (718)
T KOG2416|consen 438 PSRKEPSNVLHIDNLVRPFTLGQLKELLGRTGGNVEEF 475 (718)
T ss_pred CCCCCccceEeeecccccchHHHHHHHHhhccCchHHH
Confidence 344455688999999999999999999985 5555544
No 126
>KOG4008|consensus
Probab=80.61 E-value=3 Score=23.73 Aligned_cols=32 Identities=22% Similarity=0.275 Sum_probs=27.3
Q ss_pred CCCCCcEEEEecCCCCCCHHHHHHHhhhCCCe
Q psy7050 9 RIEGMVSLKVDNLTYRTTPDDLRRVFERCGEV 40 (61)
Q Consensus 9 ~~~~~~~l~v~~l~~~~~~~~l~~~f~~~G~i 40 (61)
...+...+|+-|+|..++++.|.+..+.+|-+
T Consensus 36 ~~~eKd~lfl~Nvp~~~tee~lkr~vsqlg~v 67 (261)
T KOG4008|consen 36 NSNEKDCLFLVNVPLLSTEEHLKRFVSQLGHV 67 (261)
T ss_pred ccccccceeeecccccccHHHHHHHHHHhhhh
Confidence 34556889999999999999999999998854
No 127
>PRK11558 putative ssRNA endonuclease; Provisional
Probab=80.30 E-value=4.1 Score=19.97 Aligned_cols=45 Identities=16% Similarity=0.140 Sum_probs=26.4
Q ss_pred CcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEE
Q psy7050 13 MVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFV 60 (61)
Q Consensus 13 ~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv 60 (61)
..-+|||+++..+.+.-...+-+.++.-..+-+..+. .-.||+|-
T Consensus 27 ~~GVyVg~~S~rVRd~lW~~v~~~~~~G~avmv~~~~---~eqG~~~~ 71 (97)
T PRK11558 27 RAGVYVGDVSRRIREMIWQQVTQLAEEGNVVMAWATN---TESGFEFQ 71 (97)
T ss_pred CCCcEEcCCCHHHHHHHHHHHHHhCCCCcEEEEEcCC---CCCCcEEE
Confidence 3569999988888776666665555443333222221 23377764
No 128
>KOG4410|consensus
Probab=79.33 E-value=11 Score=22.38 Aligned_cols=26 Identities=19% Similarity=0.281 Sum_probs=22.4
Q ss_pred cEEEEecCCCCCCHHHHHHHhhhCCC
Q psy7050 14 VSLKVDNLTYRTTPDDLRRVFERCGE 39 (61)
Q Consensus 14 ~~l~v~~l~~~~~~~~l~~~f~~~G~ 39 (61)
.-|+++||+.++.-.+|+..+.+.+.
T Consensus 331 ~di~~~nl~rd~rv~dlk~~lr~~~~ 356 (396)
T KOG4410|consen 331 TDIKLTNLSRDIRVKDLKSELRKREC 356 (396)
T ss_pred cceeeccCccccchHHHHHHHHhcCC
Confidence 56999999999999999988877653
No 129
>PF07292 NID: Nmi/IFP 35 domain (NID); InterPro: IPR009909 This entry represents a domain of approximately 90 residues that is tandemly repeated within interferon-induced 35 kDa protein (IFP 35) and the homologous N-myc-interactor (Nmi). This domain mediates Nmi-Nmi protein interactions and subcellular localisation [].
Probab=78.37 E-value=6 Score=18.99 Aligned_cols=22 Identities=14% Similarity=0.191 Sum_probs=19.1
Q ss_pred CcEEEEecCCCCCCHHHHHHHh
Q psy7050 13 MVSLKVDNLTYRTTPDDLRRVF 34 (61)
Q Consensus 13 ~~~l~v~~l~~~~~~~~l~~~f 34 (61)
.++|.+.|+|....++.|++.+
T Consensus 52 ~rtVlvsgip~~l~ee~l~D~L 73 (88)
T PF07292_consen 52 KRTVLVSGIPDVLDEEELRDKL 73 (88)
T ss_pred CCEEEEeCCCCCCChhhheeeE
Confidence 3789999999999999998764
No 130
>COG0030 KsgA Dimethyladenosine transferase (rRNA methylation) [Translation, ribosomal structure and biogenesis]
Probab=78.07 E-value=2.6 Score=24.12 Aligned_cols=35 Identities=20% Similarity=0.159 Sum_probs=25.6
Q ss_pred cEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccC
Q psy7050 14 VSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRD 48 (61)
Q Consensus 14 ~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~ 48 (61)
....|+|||+.++..-+..++...-.+....++.+
T Consensus 96 ~~~vVaNlPY~Isspii~kll~~~~~~~~~v~M~Q 130 (259)
T COG0030 96 PYKVVANLPYNISSPILFKLLEEKFIIQDMVLMVQ 130 (259)
T ss_pred CCEEEEcCCCcccHHHHHHHHhccCccceEEEEeH
Confidence 45678999999999999988876544444555443
No 131
>KOG4454|consensus
Probab=76.79 E-value=0.52 Score=26.55 Aligned_cols=49 Identities=35% Similarity=0.538 Sum_probs=36.4
Q ss_pred CCCcEEEEec----CCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEE
Q psy7050 11 EGMVSLKVDN----LTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFV 60 (61)
Q Consensus 11 ~~~~~l~v~~----l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv 60 (61)
+...+++.|+ |....+.+.+.+.|+..|.+..+++..+.+ +.++-++|+
T Consensus 78 e~q~~~r~G~shapld~r~~~ei~~~v~s~a~p~~~~R~~~~~d-~rnrn~~~~ 130 (267)
T KOG4454|consen 78 EEQRTLRCGNSHAPLDERVTEEILYEVFSQAGPIEGVRIPTDND-GRNRNFGFV 130 (267)
T ss_pred hhhcccccCCCcchhhhhcchhhheeeecccCCCCCcccccccc-CCccCccch
Confidence 3346777777 777778888888999999988888877755 566666654
No 132
>PF05172 Nup35_RRM: Nup53/35/40-type RNA recognition motif; InterPro: IPR007846 The MPPN (Mitotic PhosphoProtein N end) family is uncharacterised however it probably plays a role in the cell cycle because the family includes mitotic phosphoproteins O13026 from SWISSPROT []. This family also includes Q05166 from SWISSPROT a suppressor of thermosensitive mutations in the DNA polymerase delta gene, Pol III []. The conserved central region appears to be distantly related to the RNA-binding region RNP-1 (RNA recognition motif, IPR000504 from INTERPRO), suggesting an RNA binding function for this protein.; PDB: 1WWH_C 3P3D_A.
Probab=74.96 E-value=4.8 Score=19.72 Aligned_cols=30 Identities=13% Similarity=0.225 Sum_probs=19.8
Q ss_pred cEEEEecCCCCCCHHHHHHHhhhCCCeEEEE
Q psy7050 14 VSLKVDNLTYRTTPDDLRRVFERCGEVGDIY 44 (61)
Q Consensus 14 ~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~ 44 (61)
.-|.|=+.|+. ....+.+.|++||.|.+..
T Consensus 7 ~wVtVFGfp~~-~~~~Vl~~F~~~G~Ile~~ 36 (100)
T PF05172_consen 7 TWVTVFGFPPS-ASNQVLRHFSSFGTILEHF 36 (100)
T ss_dssp CEEEEE---GG-GHHHHHHHHHCCS-EECEE
T ss_pred eEEEEEccCHH-HHHHHHHHHHhcceEEEee
Confidence 55777788887 5567788999999987764
No 133
>PF11608 Limkain-b1: Limkain b1; InterPro: IPR024582 This entry represents a conserved domain found in limkain b1, which is a novel human autoantigen, localised to a subset of ABCD3 and PXF marked peroxisomes. Limkain b1 may be a relatively common target of human autoantibodies reactive to cytoplasmic vesicle-like structures [].; PDB: 2DIU_A.
Probab=74.91 E-value=6.1 Score=19.07 Aligned_cols=30 Identities=33% Similarity=0.576 Sum_probs=17.3
Q ss_pred cEEEEecCCCCCCHHH----HHHHhhhCC-CeEEE
Q psy7050 14 VSLKVDNLTYRTTPDD----LRRVFERCG-EVGDI 43 (61)
Q Consensus 14 ~~l~v~~l~~~~~~~~----l~~~f~~~G-~i~~~ 43 (61)
..|+|.|||.+..... |+.++.-|| .+..+
T Consensus 3 s~L~V~NLP~~~d~~~I~~RL~qLsdNCGGkVl~v 37 (90)
T PF11608_consen 3 SLLYVSNLPTNKDPSSIKNRLRQLSDNCGGKVLSV 37 (90)
T ss_dssp EEEEEES--TTS-HHHHHHHHHHHHHTTT--EEE-
T ss_pred cEEEEecCCCCCCHHHHHHHHHHHhhccCCEEEEE
Confidence 4689999999988655 556666665 45443
No 134
>KOG4210|consensus
Probab=74.51 E-value=1.7 Score=25.05 Aligned_cols=49 Identities=14% Similarity=-0.010 Sum_probs=35.9
Q ss_pred CCcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEE
Q psy7050 12 GMVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFV 60 (61)
Q Consensus 12 ~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv 60 (61)
..+++|++++.+.+.+.+...++..+|......+.........+|++++
T Consensus 87 ~~~~~f~g~~s~~~e~~~~~~~~~~~g~~~~~~~S~~~~~~~sk~~~s~ 135 (285)
T KOG4210|consen 87 SSSTFFVGELSENIEESEDDNFSSEAGLRVDARSSSLEDSLSSKGGLSV 135 (285)
T ss_pred ccccccccccccchhhccccccchhhcCcccchhhhhccccccccceee
Confidence 4688999999999988877888888887666655554455566666654
No 135
>PF10567 Nab6_mRNP_bdg: RNA-recognition motif; InterPro: IPR018885 This conserved domain is found in fungal proteins and appears to be involved in RNA-processing. It binds to poly-adenylated RNA, interacts genetically with mRNA 3'-end processing factors, co-purifies with the nuclear cap-binding protein Cbp20p, and is found in complexes containing other translation factors, such as EIF4G as in P39935 from SWISSPROT and P39936 from SWISSPROT.
Probab=73.37 E-value=3.2 Score=24.34 Aligned_cols=37 Identities=19% Similarity=0.327 Sum_probs=31.5
Q ss_pred CcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCC
Q psy7050 13 MVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDR 49 (61)
Q Consensus 13 ~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~ 49 (61)
.+.|...|+...++-..+...|.+||+|+++.++.+.
T Consensus 15 TRSLLfeNv~~sidLh~Fl~~fv~~~pIESiYL~~~~ 51 (309)
T PF10567_consen 15 TRSLLFENVNNSIDLHSFLTKFVKFGPIESIYLIKSN 51 (309)
T ss_pred eHHHHHhhccccccHHHHHHHhhccCceeEEEEecCC
Confidence 4568888999888888888889999999999988764
No 136
>COG5584 Predicted small secreted protein [Function unknown]
Probab=69.49 E-value=6 Score=19.47 Aligned_cols=29 Identities=7% Similarity=0.200 Sum_probs=20.5
Q ss_pred CCCCCCHHHHHHHhhhCCCeEEEEEccCC
Q psy7050 21 LTYRTTPDDLRRVFERCGEVGDIYIPRDR 49 (61)
Q Consensus 21 l~~~~~~~~l~~~f~~~G~i~~~~~~~~~ 49 (61)
++....-+-+++.|+++|.|.-.++...+
T Consensus 30 is~e~alk~vk~afk~~mnI~GSwI~~~p 58 (103)
T COG5584 30 ISRENALKVVKEAFKQFMNIKGSWIVYEP 58 (103)
T ss_pred cChhHHHHHHHHHhcccCCcceeEEEEec
Confidence 33444445678999999999888776655
No 137
>KOG2891|consensus
Probab=67.57 E-value=9.8 Score=22.47 Aligned_cols=35 Identities=26% Similarity=0.555 Sum_probs=26.7
Q ss_pred CCcEEEEecCCCC------------CCHHHHHHHhhhCCCeEEEEEc
Q psy7050 12 GMVSLKVDNLTYR------------TTPDDLRRVFERCGEVGDIYIP 46 (61)
Q Consensus 12 ~~~~l~v~~l~~~------------~~~~~l~~~f~~~G~i~~~~~~ 46 (61)
.+.+||+.++|-. .+++.|+..|..||.|..+.++
T Consensus 148 rpdti~la~ip~kwf~lkedg~~dlpse~rlr~a~eafg~ir~vdip 194 (445)
T KOG2891|consen 148 RPDTIHLAGIPCKWFALKEDGSEDLPSEDRLRKAFEAFGEIRNVDIP 194 (445)
T ss_pred CCCceeecCCcceeeeecccccccCChHHHHHHHHHHhccceecCCc
Confidence 3467888877643 3467799999999999988765
No 138
>PTZ00338 dimethyladenosine transferase-like protein; Provisional
Probab=66.08 E-value=7 Score=22.63 Aligned_cols=34 Identities=9% Similarity=0.055 Sum_probs=24.2
Q ss_pred EEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccC
Q psy7050 15 SLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRD 48 (61)
Q Consensus 15 ~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~ 48 (61)
.+.|+|+|+.++...+..++.....+....++.+
T Consensus 103 d~VvaNlPY~Istpil~~ll~~~~~~~~~vlm~Q 136 (294)
T PTZ00338 103 DVCVANVPYQISSPLVFKLLAHRPLFRCAVLMFQ 136 (294)
T ss_pred CEEEecCCcccCcHHHHHHHhcCCCCceeeeeeh
Confidence 4678999999999888888865444455555444
No 139
>TIGR01873 cas_CT1978 CRISPR-associated endoribonuclease Cas2, E. coli subfamily. CRISPR is a term for Clustered, Regularly Interspaced Short Palidromic Repeats. A number of protein families appear only in association with these repeats and are designated Cas (CRISPR-Associated) proteins. This model represents a minor branch of the Cas2 family of CRISPR-associated endonuclease, whereas most Cas2 proteins are modeled instead by TIGR01573. This form of Cas2 is characteristic for the Ecoli subtype of CRISPR/Cas locus.
Probab=65.60 E-value=14 Score=17.78 Aligned_cols=24 Identities=4% Similarity=-0.032 Sum_probs=16.4
Q ss_pred CcEEEEecCCCCCCHHHHHHHhhh
Q psy7050 13 MVSLKVDNLTYRTTPDDLRRVFER 36 (61)
Q Consensus 13 ~~~l~v~~l~~~~~~~~l~~~f~~ 36 (61)
..-+||++++..+.+.-...+-+.
T Consensus 25 ~~GVyVg~~s~rVRe~lW~~v~~~ 48 (87)
T TIGR01873 25 RAGVYVGGVSASVRERIWDYLAQH 48 (87)
T ss_pred CCCcEEcCCCHHHHHHHHHHHHHh
Confidence 356999998888776555544444
No 140
>PRK15464 cold shock-like protein CspH; Provisional
Probab=64.33 E-value=4 Score=18.65 Aligned_cols=8 Identities=25% Similarity=0.476 Sum_probs=6.3
Q ss_pred ceeEEEEC
Q psy7050 54 SRGFAFVR 61 (61)
Q Consensus 54 ~~g~~fv~ 61 (61)
.+||+||+
T Consensus 15 ~KGfGFI~ 22 (70)
T PRK15464 15 KSGKGFII 22 (70)
T ss_pred CCCeEEEc
Confidence 47899984
No 141
>PF14893 PNMA: PNMA
Probab=64.09 E-value=8.7 Score=22.84 Aligned_cols=26 Identities=8% Similarity=0.164 Sum_probs=21.3
Q ss_pred CCCCcEEEEecCCCCCCHHHHHHHhh
Q psy7050 10 IEGMVSLKVDNLTYRTTPDDLRRVFE 35 (61)
Q Consensus 10 ~~~~~~l~v~~l~~~~~~~~l~~~f~ 35 (61)
.+..+.+.|.++|.+++++++++.+.
T Consensus 15 ~~~~r~lLv~giP~dc~~~ei~e~l~ 40 (331)
T PF14893_consen 15 VDPQRALLVLGIPEDCEEAEIEEALQ 40 (331)
T ss_pred cChhhhheeecCCCCCCHHHHHHHHH
Confidence 44557899999999999998887664
No 142
>PRK00274 ksgA 16S ribosomal RNA methyltransferase KsgA/Dim1 family protein; Reviewed
Probab=63.69 E-value=9.6 Score=21.61 Aligned_cols=34 Identities=24% Similarity=0.246 Sum_probs=23.1
Q ss_pred EEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccC
Q psy7050 15 SLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRD 48 (61)
Q Consensus 15 ~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~ 48 (61)
-+.|+|+|+.++..-|..++.....+....++.+
T Consensus 107 ~~vv~NlPY~iss~ii~~~l~~~~~~~~~~l~~Q 140 (272)
T PRK00274 107 LKVVANLPYNITTPLLFHLLEERDPIRDMVVMVQ 140 (272)
T ss_pred ceEEEeCCccchHHHHHHHHhcCCCCCeeEEEeH
Confidence 4678999999998888888754222344444443
No 143
>PF10309 DUF2414: Protein of unknown function (DUF2414); InterPro: IPR019416 This entry contains proteins that have no known function.
Probab=63.23 E-value=13 Score=16.64 Aligned_cols=35 Identities=14% Similarity=0.140 Sum_probs=25.3
Q ss_pred CcEEEEecCCCCCCHHHHHHHhhhC---CCeEEEEEccC
Q psy7050 13 MVSLKVDNLTYRTTPDDLRRVFERC---GEVGDIYIPRD 48 (61)
Q Consensus 13 ~~~l~v~~l~~~~~~~~l~~~f~~~---G~i~~~~~~~~ 48 (61)
..+|+|+++.. .+.++++..|..| .....+.|+-|
T Consensus 5 peavhirGvd~-lsT~dI~~y~~~y~~~~~~~~IEWIdD 42 (62)
T PF10309_consen 5 PEAVHIRGVDE-LSTDDIKAYFSEYFDEEGPFRIEWIDD 42 (62)
T ss_pred eceEEEEcCCC-CCHHHHHHHHHHhcccCCCceEEEecC
Confidence 46899999865 5677888888877 13456777665
No 144
>PF15513 DUF4651: Domain of unknown function (DUF4651)
Probab=62.63 E-value=14 Score=16.65 Aligned_cols=17 Identities=29% Similarity=0.788 Sum_probs=14.0
Q ss_pred HHHHHHhhhCCCeEEEE
Q psy7050 28 DDLRRVFERCGEVGDIY 44 (61)
Q Consensus 28 ~~l~~~f~~~G~i~~~~ 44 (61)
++|++.|+..|.|.-+.
T Consensus 9 ~~iR~~fs~lG~I~vLY 25 (62)
T PF15513_consen 9 AEIRQFFSQLGEIAVLY 25 (62)
T ss_pred HHHHHHHHhcCcEEEEE
Confidence 57899999999986554
No 145
>PRK15463 cold shock-like protein CspF; Provisional
Probab=61.65 E-value=4.5 Score=18.40 Aligned_cols=8 Identities=13% Similarity=0.360 Sum_probs=6.3
Q ss_pred ceeEEEEC
Q psy7050 54 SRGFAFVR 61 (61)
Q Consensus 54 ~~g~~fv~ 61 (61)
.+||+||+
T Consensus 15 ~kGfGFI~ 22 (70)
T PRK15463 15 KSGKGLIT 22 (70)
T ss_pred CCceEEEe
Confidence 48899984
No 146
>COG0481 LepA Membrane GTPase LepA [Cell envelope biogenesis, outer membrane]
Probab=59.53 E-value=15 Score=23.52 Aligned_cols=45 Identities=20% Similarity=0.324 Sum_probs=35.0
Q ss_pred cEEEEecCCCCCCH-HHHHHHhhhCCCeEEEEEccCCCCCCceeEEE
Q psy7050 14 VSLKVDNLTYRTTP-DDLRRVFERCGEVGDIYIPRDRFTRESRGFAF 59 (61)
Q Consensus 14 ~~l~v~~l~~~~~~-~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~f 59 (61)
-.+|.|-.|-+.++ ++|++.+++. .+.+..+...++|...-||||
T Consensus 298 P~Vf~GlyPid~~dye~LrdAleKL-~LNDasl~~E~EtS~ALGfGf 343 (603)
T COG0481 298 PMVFAGLYPVDSDDYEDLRDALEKL-QLNDASLTYEPETSQALGFGF 343 (603)
T ss_pred ceEEEeecccChhHHHHHHHHHHhc-ccccceeeeccccchhccCce
Confidence 46899988888776 7899998876 456666777778887778877
No 147
>KOG2318|consensus
Probab=59.27 E-value=38 Score=22.09 Aligned_cols=36 Identities=25% Similarity=0.318 Sum_probs=27.2
Q ss_pred CCCCcEEEEecCCCCC-CHHHHHHHhhhC----CCeEEEEE
Q psy7050 10 IEGMVSLKVDNLTYRT-TPDDLRRVFERC----GEVGDIYI 45 (61)
Q Consensus 10 ~~~~~~l~v~~l~~~~-~~~~l~~~f~~~----G~i~~~~~ 45 (61)
.+..++|-|-|+.|.. ...+|.-+|..| |.|.++.|
T Consensus 171 ~~~T~RLAVvNMDWd~v~AkDL~v~~nSFlP~gGsilSV~I 211 (650)
T KOG2318|consen 171 GEETKRLAVVNMDWDRVKAKDLFVLFNSFLPKGGSILSVKI 211 (650)
T ss_pred ccccceeeEeccccccccHHHHHHHHHhhcCCCCceeEEEe
Confidence 3556889999999984 668888888754 57777765
No 148
>PRK09507 cspE cold shock protein CspE; Reviewed
Probab=59.07 E-value=4.4 Score=18.32 Aligned_cols=8 Identities=50% Similarity=1.202 Sum_probs=6.2
Q ss_pred ceeEEEEC
Q psy7050 54 SRGFAFVR 61 (61)
Q Consensus 54 ~~g~~fv~ 61 (61)
.+||+||+
T Consensus 14 ~kGyGFI~ 21 (69)
T PRK09507 14 SKGFGFIT 21 (69)
T ss_pred CCCcEEEe
Confidence 47899984
No 149
>PRK10943 cold shock-like protein CspC; Provisional
Probab=58.18 E-value=4.6 Score=18.25 Aligned_cols=8 Identities=50% Similarity=1.202 Sum_probs=6.2
Q ss_pred ceeEEEEC
Q psy7050 54 SRGFAFVR 61 (61)
Q Consensus 54 ~~g~~fv~ 61 (61)
.+||+||+
T Consensus 14 ~kGfGFI~ 21 (69)
T PRK10943 14 SKGFGFIT 21 (69)
T ss_pred CCCcEEEe
Confidence 47899984
No 150
>PRK10354 RNA chaperone/anti-terminator; Provisional
Probab=55.41 E-value=5.7 Score=17.97 Aligned_cols=8 Identities=38% Similarity=1.132 Sum_probs=6.1
Q ss_pred ceeEEEEC
Q psy7050 54 SRGFAFVR 61 (61)
Q Consensus 54 ~~g~~fv~ 61 (61)
.+||+||+
T Consensus 15 ~kGfGFI~ 22 (70)
T PRK10354 15 DKGFGFIT 22 (70)
T ss_pred CCCcEEEe
Confidence 47899984
No 151
>PRK09890 cold shock protein CspG; Provisional
Probab=55.26 E-value=5.6 Score=18.04 Aligned_cols=8 Identities=38% Similarity=1.132 Sum_probs=6.1
Q ss_pred ceeEEEEC
Q psy7050 54 SRGFAFVR 61 (61)
Q Consensus 54 ~~g~~fv~ 61 (61)
.+||+||+
T Consensus 15 ~kGfGFI~ 22 (70)
T PRK09890 15 DKGFGFIT 22 (70)
T ss_pred CCCcEEEe
Confidence 47899984
No 152
>cd00027 BRCT Breast Cancer Suppressor Protein (BRCA1), carboxy-terminal domain. The BRCT domain is found within many DNA damage repair and cell cycle checkpoint proteins. The unique diversity of this domain superfamily allows BRCT modules to interact forming homo/hetero BRCT multimers, BRCT-non-BRCT interactions, and interactions within DNA strand breaks.
Probab=53.32 E-value=18 Score=14.95 Aligned_cols=26 Identities=15% Similarity=0.326 Sum_probs=20.4
Q ss_pred cEEEEecCCCCCCHHHHHHHhhhCCC
Q psy7050 14 VSLKVDNLTYRTTPDDLRRVFERCGE 39 (61)
Q Consensus 14 ~~l~v~~l~~~~~~~~l~~~f~~~G~ 39 (61)
..+++.+.........|.++...+|.
T Consensus 2 ~~~~i~g~~~~~~~~~l~~~i~~~Gg 27 (72)
T cd00027 2 LTFVITGDLPSEERDELKELIEKLGG 27 (72)
T ss_pred CEEEEEecCCCcCHHHHHHHHHHcCC
Confidence 46777777667788899999988875
No 153
>PF00398 RrnaAD: Ribosomal RNA adenine dimethylase; InterPro: IPR001737 This family of proteins include rRNA adenine dimethylases (e.g. KsgA) and the Erythromycin resistance methylases (Erm). The bacterial enzyme KsgA catalyses the transfer of a total of four methyl groups from S-adenosyl-l-methionine (S-AdoMet) to two adjacent adenosine bases in 16S rRNA. This enzyme and the resulting modified adenosine bases appear to be conserved in all species of eubacteria, eukaryotes, and archaea, and in eukaryotic organelles. Bacterial resistance to the aminoglycoside antibiotic kasugamycin involves inactivation of KsgA and resulting loss of the dimethylations, with modest consequences to the overall fitness of the organism. In contrast, the yeast ortholog, Dim1, is essential. In Saccharomyces cerevisiae (Baker's yeast), and presumably in other eukaryotes, the enzyme performs a vital role in pre-rRNA processing in addition to its methylating activity. The best conserved region in these enzymes is located in the N-terminal section and corresponds to a region that is probably involved in S-adenosyl methionine (SAM) binding domain. The crystal structure of KsgA from Escherichia coli has been solved to a resolution of 2.1A. It bears a strong similarity to the crystal structure of ErmC' from Bacillus stearothermophilus and a lesser similarity to the yeast mitochondrial transcription factor, sc-mtTFB []. The Erm family of RNA methyltransferases, which methylate a single adenosine base in 23S rRNA confer resistance to the MLS-B group of antibiotics. Despite their sequence similarity, the two enzyme families have strikingly different levels of regulation that remain to be elucidated. Other orthologs, of this family include the yeast and Homo sapiens (Human) mitochondrial transcription factors (MTF1 and h-mtTFB respectively), which are nuclear encoded []. Human-mtTFB is able to stimulate transcription in vitro independently of its S-adenosylmethionine binding and rRNA methyltransferase activity [].; GO: 0000179 rRNA (adenine-N6,N6-)-dimethyltransferase activity, 0008649 rRNA methyltransferase activity, 0000154 rRNA modification; PDB: 3FTF_A 3R9X_B 3FTE_A 3FTC_A 3FTD_A 3GRY_A 3FYC_A 3GRU_A 3FYD_A 3GRV_A ....
Probab=52.94 E-value=25 Score=19.85 Aligned_cols=31 Identities=35% Similarity=0.494 Sum_probs=23.1
Q ss_pred CcEEEEecCCCCCCHHHHHHHhh--hCCCeEEE
Q psy7050 13 MVSLKVDNLTYRTTPDDLRRVFE--RCGEVGDI 43 (61)
Q Consensus 13 ~~~l~v~~l~~~~~~~~l~~~f~--~~G~i~~~ 43 (61)
..-+.|+|||...+..-+..++. .+|...-+
T Consensus 97 ~~~~vv~NlPy~is~~il~~ll~~~~~g~~~~~ 129 (262)
T PF00398_consen 97 QPLLVVGNLPYNISSPILRKLLELYRFGRVRMV 129 (262)
T ss_dssp SEEEEEEEETGTGHHHHHHHHHHHGGGCEEEEE
T ss_pred CceEEEEEecccchHHHHHHHhhcccccccceE
Confidence 35688999999999888888876 45544433
No 154
>PF03468 XS: XS domain; InterPro: IPR005380 The XS (rice gene X and SGS3) domain is found in a family of plant proteins including gene X Q9SBW2 from SWISSPROT and SGS3 Q9LDX1 from SWISSPROT. SGS3 is thought to be involved in post-transcriptional gene silencing (PTGS). This domain contains a conserved aspartate residue that may be functionally important. The XS domain containing proteins contain coiled-coils, which suggests that they will oligomerise. Most coiled-coil proteins form either a dimeric or a trimeric structure. It is possible that different members of the XS domain family could oligomerise via their coiled-coils forming a variety of complexes [].; PDB: 4E8U_C.
Probab=50.43 E-value=33 Score=17.21 Aligned_cols=33 Identities=15% Similarity=0.225 Sum_probs=18.7
Q ss_pred CHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050 26 TPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR 61 (61)
Q Consensus 26 ~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~ 61 (61)
+.+.|.+.|..|..++ +..+.+.. -+.|++.|+
T Consensus 30 ~~~~l~~~l~~f~p~k-v~~l~~~~--gh~g~aiv~ 62 (116)
T PF03468_consen 30 SNEELLDKLAEFNPLK-VKPLYGKQ--GHTGFAIVE 62 (116)
T ss_dssp -SHHHHHHHHH---SE-EEEEEETT--EEEEEEEEE
T ss_pred CHHHHHHHHHhcCCce-eEECcCCC--CCcEEEEEE
Confidence 4578889998898765 43444432 367888763
No 155
>PF12157 DUF3591: Protein of unknown function (DUF3591); InterPro: IPR022591 This functionally uncharacterised domain is found centrally in the eukaryotic transcription initiation factor TFIID subunit 1.
Probab=48.85 E-value=32 Score=21.57 Aligned_cols=40 Identities=20% Similarity=0.283 Sum_probs=24.6
Q ss_pred ecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050 19 DNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFVR 61 (61)
Q Consensus 19 ~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~ 61 (61)
.-.||.+|...+...=.+ ....+.=.-|+ ||...||.||.
T Consensus 359 ~laPWn~T~nFi~A~qgk--a~L~l~G~gDP-TG~GeGfSfir 398 (457)
T PF12157_consen 359 QLAPWNTTKNFINATQGK--AMLQLHGEGDP-TGRGEGFSFIR 398 (457)
T ss_pred hCCChHhHHHHHHHhccC--cEEEEecCCCC-CCCCCceEEEE
Confidence 345777787777765221 13344334564 68889999974
No 156
>PF11411 DNA_ligase_IV: DNA ligase IV; InterPro: IPR021536 DNA ligase IV along with Xrcc4 functions in DNA non-homologous end joining. This process is required to mend double-strand breaks. Upon ligase binding to an Xrcc4 dimer, the helical tails unwind leading to a flat interaction surface []. ; GO: 0003910 DNA ligase (ATP) activity; PDB: 3II6_Y 2E2W_A 1IK9_C.
Probab=48.65 E-value=19 Score=14.45 Aligned_cols=15 Identities=40% Similarity=0.627 Sum_probs=9.1
Q ss_pred CCCHHHHHHHhhhCC
Q psy7050 24 RTTPDDLRRVFERCG 38 (61)
Q Consensus 24 ~~~~~~l~~~f~~~G 38 (61)
+++++.|++.|.+..
T Consensus 20 Dtd~~~Lk~vF~~i~ 34 (36)
T PF11411_consen 20 DTDEDQLKEVFNRIK 34 (36)
T ss_dssp ---HHHHHHHHHCS-
T ss_pred cCCHHHHHHHHHHhc
Confidence 567888999987653
No 157
>TIGR00755 ksgA dimethyladenosine transferase. Alternate name: S-adenosylmethionine--6-N',N'-adenosyl (rRNA) dimethyltransferase
Probab=47.25 E-value=43 Score=18.69 Aligned_cols=25 Identities=20% Similarity=0.200 Sum_probs=20.1
Q ss_pred EEEEecCCCCCCHHHHHHHhhhCCC
Q psy7050 15 SLKVDNLTYRTTPDDLRRVFERCGE 39 (61)
Q Consensus 15 ~l~v~~l~~~~~~~~l~~~f~~~G~ 39 (61)
-+.|+|+|+.++...+..++...|.
T Consensus 96 ~~vvsNlPy~i~~~il~~ll~~~~~ 120 (253)
T TIGR00755 96 LKVVSNLPYNISSPLIFKLLEKPKF 120 (253)
T ss_pred ceEEEcCChhhHHHHHHHHhccCCC
Confidence 3778999999999999988864443
No 158
>PF03672 UPF0154: Uncharacterised protein family (UPF0154); InterPro: IPR005359 The proteins in this entry are functionally uncharacterised.
Probab=46.75 E-value=30 Score=15.70 Aligned_cols=18 Identities=11% Similarity=0.170 Sum_probs=14.8
Q ss_pred CCCCCHHHHHHHhhhCCC
Q psy7050 22 TYRTTPDDLRRVFERCGE 39 (61)
Q Consensus 22 ~~~~~~~~l~~~f~~~G~ 39 (61)
++.++++.++.++.+.|.
T Consensus 29 NPpine~mir~M~~QMG~ 46 (64)
T PF03672_consen 29 NPPINEKMIRAMMMQMGR 46 (64)
T ss_pred CCCCCHHHHHHHHHHhCC
Confidence 456789999999998874
No 159
>PHA01632 hypothetical protein
Probab=46.34 E-value=29 Score=15.37 Aligned_cols=18 Identities=22% Similarity=0.615 Sum_probs=14.3
Q ss_pred EecCCCCCCHHHHHHHhh
Q psy7050 18 VDNLTYRTTPDDLRRVFE 35 (61)
Q Consensus 18 v~~l~~~~~~~~l~~~f~ 35 (61)
|..+|..-++++|+..+.
T Consensus 21 ieqvp~kpteeelrkvlp 38 (64)
T PHA01632 21 IEQVPQKPTEEELRKVLP 38 (64)
T ss_pred hhhcCCCCCHHHHHHHHH
Confidence 357899999999987664
No 160
>PF07230 Peptidase_S80: Bacteriophage T4-like capsid assembly protein (Gp20); InterPro: IPR010823 This family of proteins is essential for capsid assembly in the T4-like bacteriophages []. Gp20 forms a unique 12 subunit portal vertex through which DNA enters during packaging and exits during infection. The Gp20 vertex acts as an initiator for the assembly of the major capsid protein and the scaffolding proteins into a prolate icosahedron of precise dimensions. The regulation of portal protein gene expression is an important regulator of prohead assembly in bacteriophage T4 [].
Probab=46.33 E-value=20 Score=22.68 Aligned_cols=36 Identities=22% Similarity=0.251 Sum_probs=26.3
Q ss_pred EEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCc
Q psy7050 15 SLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRES 54 (61)
Q Consensus 15 ~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~ 54 (61)
.|=|||||..-.++-|+++..+| .-+++.|..||.-
T Consensus 280 yIDvGnlpk~KAeqyl~~iM~k~----knklvYDa~TGev 315 (501)
T PF07230_consen 280 YIDVGNLPKQKAEQYLRDIMNKY----KNKLVYDASTGEV 315 (501)
T ss_pred EEECCCCChHhHHHHHHHHHHHh----cceeeecCCCCee
Confidence 44569999999999999998887 2345556666643
No 161
>PF13046 DUF3906: Protein of unknown function (DUF3906)
Probab=46.16 E-value=31 Score=15.66 Aligned_cols=32 Identities=22% Similarity=0.328 Sum_probs=20.3
Q ss_pred HHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEE
Q psy7050 27 PDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFV 60 (61)
Q Consensus 27 ~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv 60 (61)
+..|...|-+--.+.++.+...+.- .+|-|||
T Consensus 32 e~eler~fl~~P~v~e~~l~EKKri--~~G~gyV 63 (64)
T PF13046_consen 32 EVELERHFLPLPEVKEVALYEKKRI--RKGAGYV 63 (64)
T ss_pred HHHhhhhccCCCCceEEEEEEEEee--eCCceeE
Confidence 4567777777778888877755433 3445554
No 162
>PHA02531 20 portal vertex protein; Provisional
Probab=45.43 E-value=15 Score=23.23 Aligned_cols=36 Identities=25% Similarity=0.266 Sum_probs=25.6
Q ss_pred cEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCC
Q psy7050 14 VSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRE 53 (61)
Q Consensus 14 ~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~ 53 (61)
.-|=|||||..-.++-|+++..+|- -+++.|..||.
T Consensus 282 FYiDVGNlPk~KAeqYlr~vm~~yk----NklvYDa~TGe 317 (514)
T PHA02531 282 FYIDVGNLPKRKAEEYLNNVMQRYK----NRVVYDANTGK 317 (514)
T ss_pred EEEEcCCCChhhHHHHHHHHHHHhh----hhEEEeCCCCe
Confidence 3455699999999999999988772 23444555554
No 163
>PRK00523 hypothetical protein; Provisional
Probab=44.00 E-value=34 Score=15.90 Aligned_cols=18 Identities=17% Similarity=0.303 Sum_probs=14.8
Q ss_pred CCCCCHHHHHHHhhhCCC
Q psy7050 22 TYRTTPDDLRRVFERCGE 39 (61)
Q Consensus 22 ~~~~~~~~l~~~f~~~G~ 39 (61)
++-++|+.++.++.+.|.
T Consensus 37 NPpine~mir~M~~QMGq 54 (72)
T PRK00523 37 NPPITENMIRAMYMQMGR 54 (72)
T ss_pred CcCCCHHHHHHHHHHhCC
Confidence 456789999999998885
No 164
>PF15407 Spo7_2_N: Sporulation protein family 7
Probab=43.94 E-value=7.6 Score=17.67 Aligned_cols=18 Identities=6% Similarity=-0.191 Sum_probs=13.9
Q ss_pred CCCcEEEEecCCCCCCHH
Q psy7050 11 EGMVSLKVDNLTYRTTPD 28 (61)
Q Consensus 11 ~~~~~l~v~~l~~~~~~~ 28 (61)
.-.+.+|+|++|..+-++
T Consensus 25 ~tSr~vflG~IP~~W~~~ 42 (67)
T PF15407_consen 25 LTSRRVFLGPIPEIWLQD 42 (67)
T ss_pred HcCceEEECCCChHHHHc
Confidence 345889999999987554
No 165
>smart00650 rADc Ribosomal RNA adenine dimethylases.
Probab=41.40 E-value=53 Score=17.01 Aligned_cols=23 Identities=26% Similarity=0.288 Sum_probs=18.5
Q ss_pred cEEEEecCCCCCCHHHHHHHhhh
Q psy7050 14 VSLKVDNLTYRTTPDDLRRVFER 36 (61)
Q Consensus 14 ~~l~v~~l~~~~~~~~l~~~f~~ 36 (61)
.-+.++|+|+..+.+.+..++..
T Consensus 78 ~d~vi~n~Py~~~~~~i~~~l~~ 100 (169)
T smart00650 78 PYKVVGNLPYNISTPILFKLLEE 100 (169)
T ss_pred CCEEEECCCcccHHHHHHHHHhc
Confidence 34668999999988888888764
No 166
>PF15063 TC1: Thyroid cancer protein 1
Probab=41.02 E-value=23 Score=16.68 Aligned_cols=26 Identities=23% Similarity=0.423 Sum_probs=20.8
Q ss_pred cEEEEecCCCCCCHHHHHHHhhhCCC
Q psy7050 14 VSLKVDNLTYRTTPDDLRRVFERCGE 39 (61)
Q Consensus 14 ~~l~v~~l~~~~~~~~l~~~f~~~G~ 39 (61)
++--+.|+=.+++...|+.+|...|.
T Consensus 26 RKkasaNIFe~vn~~qlqrLF~~sGD 51 (79)
T PF15063_consen 26 RKKASANIFENVNLDQLQRLFQKSGD 51 (79)
T ss_pred hhhhhhhhhhccCHHHHHHHHHHccc
Confidence 34456677788899999999998885
No 167
>PF01823 MACPF: MAC/Perforin domain; InterPro: IPR020864 The membrane attack complex/perforin (MACPF) domain is conserved in bacteria, fungi, mammals and plants. It was originally identified and named as being common to five complement components (C6, C7, C8-alpha, C8-beta, and C9) and perforin. These molecules perform critical functions in innate and adaptive immunity. The MAC family proteins and perforin are known to participate in lytic pore formation. In response to pathogen infection, a sequential and highly specific interaction between the constituent elements occurs to form transmembrane channels which are known as the membrane-attack complex (MAC).Only a few other MACPF proteins have been characterised and several are thought to form pores for invasion or protection [, , ]. Examples are proteins from malarial parasites [], the cytolytic toxins from sea anemones [], and proteins that provide plant immunity [, ]. Functionally uncharacterised MACPF proteins are also evident in pathogenic bacteria such as Chlamydia spp [] and Photorhabdus luminescens (Xenorhabdus luminescens) []. The MACPF domain is commonly found to be associated with other N- and C-terminal domains, such as TSP1 (see PDOC50092 from PROSITEDOC), LDLRA (see PDOC00929 from PROSITEDOC), EGF-like (see PDOC00021 from PROSITEDOC),Sushi/CCP/SCR (see PDOC50923 from PROSITEDOC), FIMAC or C2 (see PDOC00380 from PROSITEDOC). They probably control or target MACPF function [, ]. The MACPF domain oligomerizes, undergoes conformational change, and is required for lytic activity. The MACPF domain consists of a central kinked four-stranded antiparallel beta sheet surrounded by alpha helices and beta strands, forming two structural segments. Overall, the MACPF domain has a thin L-shaped appearance. MACPF domains exhibit limited sequence similarity but contain a signature [YW]-G-[TS]-H-[FY]-x(6)-G-G motif [, , ]. Some proteins known to contain a MACPF domain are listed below: Vertebrate complement proteins C6 to C9. Complement factors C6 to C9 assemble to form a scaffold, the membrane attack complex (MAC), that permits C9 polymerisation into pores that lyse Gram-negative pathogens [, ]. Vertebrate perforin. It is delivered by natural killer cells and cytotoxic T lymphocytes and forms oligomeric pores (12 to 18 monomers) in the plasma membrane of either virus-infected or transformed cells. Arabidopsis thaliana (Mouse-ear cress) constitutively activated cell death 1 (CAD1) protein. It is likely to act as a mediator that recognises plant signals for pathogen infection []. Arabidopsis thaliana (Mouse-ear cress) necrotic spotted lesions 1 (NSL1) protein []. Venomous sea anemone Phyllodiscus semoni (Night anemone) toxins PsTX-60A and PsTX-60B []. Venomous sea anemone Actineria villosa (Okinawan sea anemone) toxin AvTX-60A []. Plasmodium sporozoite microneme protein essential for cell traversal 2 (SPECT2). It is essential for the membrane-wounding activity of the sporozoite and is involved in its traversal of the sinusoidal cell layer prior to hepatocyte-infection []. P. luminescens Plu-MACPF. Although nonlytic, it was shown to bind to cell membranes []. Chlamydial putative uncharacterised protein CT153 []. ; PDB: 2QP2_A 3OJY_B 3NSJ_A 4E0S_B 3T5O_A 4A5W_B 2QQH_A 2RD7_A.
Probab=40.05 E-value=26 Score=18.60 Aligned_cols=22 Identities=23% Similarity=0.505 Sum_probs=14.1
Q ss_pred EecCCCCCCHHH---HHHHhhhCCC
Q psy7050 18 VDNLTYRTTPDD---LRRVFERCGE 39 (61)
Q Consensus 18 v~~l~~~~~~~~---l~~~f~~~G~ 39 (61)
+.+||...+... +.++|..||.
T Consensus 53 l~~L~~~~~~~~~~~y~~f~~~yGT 77 (212)
T PF01823_consen 53 LNALPAEYNSDNTDEYYRFFDKYGT 77 (212)
T ss_dssp HHTSHSS--HHHHHHHHHHHHHH-S
T ss_pred HHhhCcccCccchHHHHHHHHHhCc
Confidence 345777777666 7789999996
No 168
>COG5638 Uncharacterized conserved protein [Function unknown]
Probab=39.85 E-value=98 Score=19.61 Aligned_cols=37 Identities=22% Similarity=0.371 Sum_probs=25.9
Q ss_pred CCCCCcEEEEecCCCC-CCHHHHHHHhhhC----CCeEEEEE
Q psy7050 9 RIEGMVSLKVDNLTYR-TTPDDLRRVFERC----GEVGDIYI 45 (61)
Q Consensus 9 ~~~~~~~l~v~~l~~~-~~~~~l~~~f~~~----G~i~~~~~ 45 (61)
+.+...+|-|-|+.|+ +...+|..+|+.| |.+..+.+
T Consensus 142 ~G~~tkrLAvVnmDWd~v~a~DLf~~fsSf~P~ggkl~kV~i 183 (622)
T COG5638 142 EGNPTKRLAVVNMDWDRVDAKDLFKIFSSFLPYGGKLSKVKI 183 (622)
T ss_pred CCCcccceeEeecccccchHHHHHHHHHhhCCCCCccceeEe
Confidence 3455678999999997 4567888777753 55665554
No 169
>KOG2135|consensus
Probab=39.42 E-value=18 Score=22.88 Aligned_cols=35 Identities=23% Similarity=0.388 Sum_probs=25.0
Q ss_pred cEEEEecCCCCC-CHHHHHHHhhhCCCeEEEEEccC
Q psy7050 14 VSLKVDNLTYRT-TPDDLRRVFERCGEVGDIYIPRD 48 (61)
Q Consensus 14 ~~l~v~~l~~~~-~~~~l~~~f~~~G~i~~~~~~~~ 48 (61)
+.+-+.-.++.. +.++|..+|.+||.|..|.+-..
T Consensus 373 s~l~lek~~~glnt~a~ln~hfA~fG~i~n~qv~~~ 408 (526)
T KOG2135|consen 373 SPLALEKSPFGLNTIADLNPHFAQFGEIENIQVDYS 408 (526)
T ss_pred chhhhhccCCCCchHhhhhhhhhhcCccccccccCc
Confidence 444455556665 46899999999999988865433
No 170
>PRK01844 hypothetical protein; Provisional
Probab=39.17 E-value=45 Score=15.51 Aligned_cols=18 Identities=11% Similarity=0.167 Sum_probs=14.6
Q ss_pred CCCCCHHHHHHHhhhCCC
Q psy7050 22 TYRTTPDDLRRVFERCGE 39 (61)
Q Consensus 22 ~~~~~~~~l~~~f~~~G~ 39 (61)
++-++|+.|+.++.+-|.
T Consensus 36 NPpine~mir~Mm~QMGq 53 (72)
T PRK01844 36 NPPINEQMLKMMMMQMGQ 53 (72)
T ss_pred CCCCCHHHHHHHHHHhCC
Confidence 456789999999998874
No 171
>PF11004 Kdo_hydroxy: 3-deoxy-D-manno-oct-2-ulosonic acid (Kdo) hydroxylase; InterPro: IPR021266 This bacterial family of proteins has no known function.
Probab=38.00 E-value=55 Score=19.27 Aligned_cols=47 Identities=17% Similarity=0.158 Sum_probs=28.7
Q ss_pred CCCCcEEEEecCCCCCCHHHHHHHhhh-CCCeEEEEEccCCCCCCceeE
Q psy7050 10 IEGMVSLKVDNLTYRTTPDDLRRVFER-CGEVGDIYIPRDRFTRESRGF 57 (61)
Q Consensus 10 ~~~~~~l~v~~l~~~~~~~~l~~~f~~-~G~i~~~~~~~~~~t~~~~g~ 57 (61)
.+.+..||.-+|++.....+. .++.+ .-....-.+..++.++..+|.
T Consensus 18 LE~G~VL~fP~l~F~l~~~E~-~fL~p~~~d~k~KNIs~~p~~~~l~G~ 65 (281)
T PF11004_consen 18 LEQGKVLYFPQLRFPLSPEER-AFLDPAIVDPKRKNISYRPNTGSLRGA 65 (281)
T ss_pred hhCCCEEEeCCCCCCCChhHH-hhcChhhhCCCCCceeecCCCCceecc
Confidence 467899999999998887665 33332 211122234446666666664
No 172
>smart00457 MACPF membrane-attack complex / perforin.
Probab=37.90 E-value=24 Score=18.90 Aligned_cols=21 Identities=14% Similarity=0.300 Sum_probs=17.5
Q ss_pred ecCCCCCCHHHHHHHhhhCCC
Q psy7050 19 DNLTYRTTPDDLRRVFERCGE 39 (61)
Q Consensus 19 ~~l~~~~~~~~l~~~f~~~G~ 39 (61)
.+||..........++..||.
T Consensus 31 ~~Lp~~~~~~~~~~fi~~yGT 51 (194)
T smart00457 31 RDLPDQYNRGAYARFIDKYGT 51 (194)
T ss_pred HhCccccCHHHHHHHHHHhCC
Confidence 467888888888999999995
No 173
>PF09176 Mpt_N: Methylene-tetrahydromethanopterin dehydrogenase, N-terminal; InterPro: IPR015259 Prokaryotic methylene-tetrahydromethanopterin dehydrogenase catalyses the dehydrogenation of methylene-tetrahydromethanopterin during growth on one-carbon compounds such as methanol. It can also catalyse the reversible dehydrogenation of methylene-tetrahydrofolate, though at much lower efficiency []. The pterin domain of this protein is composed of two alpha-beta segments found at the N- and C-terminal ends of the polypeptide respectivly. This entry represents the N-terminal segment of the pterin domain, with a core comprising three alpha/beta/alpha layers in which each sheet contains four strands. ; PDB: 1LUA_A 1LU9_C.
Probab=37.38 E-value=9.2 Score=18.15 Aligned_cols=30 Identities=13% Similarity=0.123 Sum_probs=15.8
Q ss_pred CCCCCCCCCCCcEEEEecCCCCCCHHHHHHH
Q psy7050 3 YGRPPPRIEGMVSLKVDNLTYRTTPDDLRRV 33 (61)
Q Consensus 3 ~~~~~~~~~~~~~l~v~~l~~~~~~~~l~~~ 33 (61)
|+|+|. .-+.+-||||+-+-...++.|+..
T Consensus 35 FsR~P~-~~~~TaiFIGG~d~~~a~dml~~a 64 (81)
T PF09176_consen 35 FSRGPK-DLKRTAIFIGGRDVALAEDMLEAA 64 (81)
T ss_dssp SSS-GG-GGGGEEEEEE-S-HHHHHHHHHHH
T ss_pred EccCCc-CCceeEEEECCccHHHHHHHHHHH
Confidence 455553 334467999986655555555443
No 174
>PF00220 Hormone_4: Neurohypophysial hormones, N-terminal Domain; InterPro: IPR022423 Oxytocin (or ocytocin) and vasopressin [] are small (nine amino acid residues), structurally and functionally related neurohypophysial peptide hormones. Oxytocin causes contraction of the smooth muscle of the uterus and of the mammary gland while vasopressin has a direct antidiuretic action on the kidney and also causes vasoconstriction of the peripheral vessels. Like the majority of active peptides, both hormones are synthesized as larger protein precursors that are enzymatically converted to their mature forms. Peptides belonging to this family are also found in birds, fish, reptiles and amphibians (mesotocin, isotocin, valitocin, glumitocin, aspargtocin, vasotocin, seritocin, asvatocin, phasvatocin), in worms (annetocin), octopi (cephalotocin), locust (locupressin or neuropeptide F1/F2) and in molluscs (conopressins G and S) []. The pattern developed to detect this category of peptides spans their entire sequence and includes four invariant amino acid residues. .; GO: 0005185 neurohypophyseal hormone activity, 0005576 extracellular region
Probab=36.86 E-value=16 Score=10.04 Aligned_cols=6 Identities=17% Similarity=0.174 Sum_probs=2.8
Q ss_pred EEecCC
Q psy7050 17 KVDNLT 22 (61)
Q Consensus 17 ~v~~l~ 22 (61)
||.|.|
T Consensus 2 ~i~nCP 7 (9)
T PF00220_consen 2 YIRNCP 7 (9)
T ss_pred ccccCC
Confidence 445544
No 175
>PF14111 DUF4283: Domain of unknown function (DUF4283)
Probab=36.15 E-value=62 Score=16.26 Aligned_cols=31 Identities=13% Similarity=0.191 Sum_probs=24.5
Q ss_pred EEEecCCCCC-CHHHHHHHhhhCCCeEEEEEc
Q psy7050 16 LKVDNLTYRT-TPDDLRRVFERCGEVGDIYIP 46 (61)
Q Consensus 16 l~v~~l~~~~-~~~~l~~~f~~~G~i~~~~~~ 46 (61)
|.+.|||... +++-++.+-+..|.+..++..
T Consensus 107 Vri~glP~~~~~~~~~~~i~~~iG~~i~vD~~ 138 (153)
T PF14111_consen 107 VRIYGLPLHLWSEEILKAIGSKIGEPIEVDEN 138 (153)
T ss_pred hhhccCCHHHhhhHHHHHHHHhcCCeEEEEcC
Confidence 5667999884 567788888899999888654
No 176
>smart00596 PRE_C2HC PRE_C2HC domain.
Probab=33.74 E-value=56 Score=15.05 Aligned_cols=26 Identities=19% Similarity=0.134 Sum_probs=15.5
Q ss_pred HHHHHhhhCC-CeEEEEEccCCCCCCc
Q psy7050 29 DLRRVFERCG-EVGDIYIPRDRFTRES 54 (61)
Q Consensus 29 ~l~~~f~~~G-~i~~~~~~~~~~t~~~ 54 (61)
.+.+.|...| ++..++-+..+.++.+
T Consensus 3 ~I~~~L~~~G~~v~~i~~m~~~~~r~P 29 (69)
T smart00596 3 QIEEALKDIGFPVLFIHNMLNRDTKNP 29 (69)
T ss_pred HHHHHHHHcCCceeEEEcccccCCCCc
Confidence 4566666666 4666666666554433
No 177
>KOG4365|consensus
Probab=33.27 E-value=28 Score=22.00 Aligned_cols=45 Identities=7% Similarity=-0.187 Sum_probs=30.7
Q ss_pred EEEecCCCCCCHHHHHHHhhhCCCeEEEEEccCCCCCCceeEEEE
Q psy7050 16 LKVDNLTYRTTPDDLRRVFERCGEVGDIYIPRDRFTRESRGFAFV 60 (61)
Q Consensus 16 l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~~~~t~~~~g~~fv 60 (61)
.++..++-..++.++.-+|..+|-|..+.+-..-..+...-.+|+
T Consensus 6 ~~l~d~~~~~~~~~~~~~~~d~~~i~~~d~~~~~~~~~~~v~~f~ 50 (572)
T KOG4365|consen 6 KSLKDSVASNNKDQNSMKHEDPSIISMEDGSPYVNGSLGEVTPFQ 50 (572)
T ss_pred hhHhhcccccccchhhhhccCCcceeeccCCccccCCcceeeeee
Confidence 445567777788889889999998887765544344444555554
No 178
>PF10281 Ish1: Putative stress-responsive nuclear envelope protein; InterPro: IPR018803 This group of proteins, found primarily in fungi, consists of putative stress-responsive nuclear envelope protein Ish1 and homologues [].
Probab=32.38 E-value=41 Score=13.11 Aligned_cols=16 Identities=19% Similarity=0.248 Sum_probs=12.9
Q ss_pred CCCHHHHHHHhhhCCC
Q psy7050 24 RTTPDDLRRVFERCGE 39 (61)
Q Consensus 24 ~~~~~~l~~~f~~~G~ 39 (61)
.++.++|++-+..+|-
T Consensus 3 tWs~~~L~~wL~~~gi 18 (38)
T PF10281_consen 3 TWSDSDLKSWLKSHGI 18 (38)
T ss_pred CCCHHHHHHHHHHcCC
Confidence 4678899999988874
No 179
>PF05189 RTC_insert: RNA 3'-terminal phosphate cyclase (RTC), insert domain; InterPro: IPR013796 RNA cyclases are a family of RNA-modifying enzymes that are conserved in eukaryotes, bacteria and archaea. RNA 3'-terminal phosphate cyclase (6.5.1.4 from EC) [, ] catalyses the conversion of 3'-phosphate to a 2',3'-cyclic phosphodiester at the end of RNA. ATP + RNA 3'-terminal-phosphate = AMP + diphosphate + RNA terminal-2',3'-cyclic-phosphate These enzymes might be responsible for production of the cyclic phosphate RNA ends that are known to be required by many RNA ligases in both prokaryotes and eukaryotes. RNA cyclase is a protein of from 36 to 42 kDa. The best conserved region is a glycine-rich stretch of residues located in the central part of the sequence and which is reminiscent of various ATP, GTP or AMP glycine-rich loops. The crystal structure of RNA 3'-terminal phosphate cyclase shows that each molecule consists of two domains. The larger domain contains three repeats of a folding unit comprising two parallel alpha helices and a four-stranded beta sheet; this fold was previously identified in translation initiation factor 3 (IF3). The large domain is similar to one of the two domains of 5-enolpyruvylshikimate-3-phosphate synthase and UDP-N-acetylglucosamine enolpyruvyl transferase. The smaller domain uses a similar secondary structure element with different topology, observed in many other proteins such as thioredoxin []. Although the active site of this enzyme could not be unambiguously assigned, it can be mapped to a region surrounding His309, an adenylate acceptor, in which a number of amino acids are highly conserved in the enzyme from different sources []. This entry contains the insert-domain of approximately 100 amino acids.; PDB: 3TV1_B 3KGD_D 1QMI_D 1QMH_B 3TUX_A 3TUT_A 3TW3_A 3PQV_A.
Probab=32.22 E-value=66 Score=15.41 Aligned_cols=45 Identities=13% Similarity=0.012 Sum_probs=24.3
Q ss_pred EEEEecCCCCCCHHHH---HHHhhhCCCeEEEEE--ccCCCCCCceeEEE
Q psy7050 15 SLKVDNLTYRTTPDDL---RRVFERCGEVGDIYI--PRDRFTRESRGFAF 59 (61)
Q Consensus 15 ~l~v~~l~~~~~~~~l---~~~f~~~G~i~~~~~--~~~~~t~~~~g~~f 59 (61)
..|+.+||..+.+..+ +..+..+..-..+.. ......+...|++.
T Consensus 12 ~a~~a~lp~~va~R~~~~a~~~L~~~~~~v~i~~d~~~~~~~~~~~G~gi 61 (103)
T PF05189_consen 12 IAFVAGLPSSVAERMANAARKRLNWYGPDVEIETDYRESDDSAFGPGSGI 61 (103)
T ss_dssp EEEEESS-CHHHHHHHHHHHHHHCTTCSEEEEEEEEE-CCCCGCSSEEEE
T ss_pred EEEEcCCCHHHHHHHHHHHHHHhhhhCCCeEEEEecccCccCCCCCceEE
Confidence 4578899998877544 455555554334332 12234455666654
No 180
>KOG2591|consensus
Probab=30.87 E-value=1.5e+02 Score=19.61 Aligned_cols=35 Identities=11% Similarity=0.175 Sum_probs=27.5
Q ss_pred CCcEEEEecCCCCCCHHHHHHHhhh--CCCeEEEEEc
Q psy7050 12 GMVSLKVDNLTYRTTPDDLRRVFER--CGEVGDIYIP 46 (61)
Q Consensus 12 ~~~~l~v~~l~~~~~~~~l~~~f~~--~G~i~~~~~~ 46 (61)
+.|.+.++-||..+..+.++.+|+. |-.+.+|.+.
T Consensus 174 kRcIvilREIpettp~e~Vk~lf~~encPk~iscefa 210 (684)
T KOG2591|consen 174 KRCIVILREIPETTPIEVVKALFKGENCPKVISCEFA 210 (684)
T ss_pred ceeEEEEeecCCCChHHHHHHHhccCCCCCceeeeee
Confidence 3477888999999999999999975 5566666543
No 181
>KOG3424|consensus
Probab=30.62 E-value=86 Score=16.19 Aligned_cols=36 Identities=17% Similarity=0.235 Sum_probs=20.1
Q ss_pred CCHHHHHHHhhh-CCCeEEE----EEccCCCCCCceeEEEE
Q psy7050 25 TTPDDLRRVFER-CGEVGDI----YIPRDRFTRESRGFAFV 60 (61)
Q Consensus 25 ~~~~~l~~~f~~-~G~i~~~----~~~~~~~t~~~~g~~fv 60 (61)
++.++|++-++. |-.-.+. .+-..-.+|++.|||.|
T Consensus 35 vsK~EirEKla~mYkt~~d~V~vfgfrt~~GggkstgfalI 75 (132)
T KOG3424|consen 35 VSKTEIREKLAKMYKTTPDAVFVFGFRTHFGGGKSTGFALI 75 (132)
T ss_pred CCHHHHHHHHHHHhcCCcceEEEEEeeeccCCcccceeeee
Confidence 466777776654 4322222 23333356778888875
No 182
>cd00874 RNA_Cyclase_Class_II RNA 3' phosphate cyclase domain (class II). These proteins function as RNA cyclase to catalyze the ATP-dependent conversion of 3'-phosphate to a 2'.3'-cyclic phosphodiester at the end of RNA molecule. A conserved catalytic histidine residue is found in all members of this subfamily.
Probab=30.23 E-value=88 Score=18.66 Aligned_cols=43 Identities=14% Similarity=0.121 Sum_probs=24.2
Q ss_pred EEEEecCCCCCCHHH---HHHHhhh-CCCeEEEEEccCCCCCCceeEEE
Q psy7050 15 SLKVDNLTYRTTPDD---LRRVFER-CGEVGDIYIPRDRFTRESRGFAF 59 (61)
Q Consensus 15 ~l~v~~l~~~~~~~~---l~~~f~~-~G~i~~~~~~~~~~t~~~~g~~f 59 (61)
..++.++|..+.+.. .++.+.+ ++ .++.+..+...+.+.|++.
T Consensus 188 ~~~~~~l~~~va~r~~~~a~~~L~~~~~--~dv~i~~~~~~~~s~G~~i 234 (326)
T cd00874 188 ISHAANLPPHVAERQAEAAAALLRKALG--LQIEIEPEDQSALGPGSGI 234 (326)
T ss_pred EEEEccCCHHHHHHHHHHHHHHHhhccC--CCeEEEEEecCCCCCCEEE
Confidence 455678888776644 3455555 43 2334443434466777765
No 183
>cd06257 DnaJ DnaJ domain or J-domain. DnaJ/Hsp40 (heat shock protein 40) proteins are highly conserved and play crucial roles in protein translation, folding, unfolding, translocation, and degradation. They act primarily by stimulating the ATPase activity of Hsp70s, an important chaperonine family. Hsp40 proteins are characterized by the presence of a J domain, which mediates the interaction with Hsp70. They may contain other domains as well, and the architectures provide a means of classification.
Probab=29.76 E-value=51 Score=13.34 Aligned_cols=18 Identities=0% Similarity=0.338 Sum_probs=13.5
Q ss_pred ecCCCCCCHHHHHHHhhh
Q psy7050 19 DNLTYRTTPDDLRRVFER 36 (61)
Q Consensus 19 ~~l~~~~~~~~l~~~f~~ 36 (61)
-+++..++.++|+..|..
T Consensus 6 Lgl~~~~~~~~ik~~y~~ 23 (55)
T cd06257 6 LGVPPDASDEEIKKAYRK 23 (55)
T ss_pred cCCCCCCCHHHHHHHHHH
Confidence 367788888888877754
No 184
>PF12385 Peptidase_C70: Papain-like cysteine protease AvrRpt2; InterPro: IPR022118 This is a family of cysteine proteases, found in actinobacteria, protobacteria and firmicutes. Papain-like cysteine proteases play a crucial role in plant-pathogen/pest interactions. On entering the host they act on non-self substrates, thereby manipulating the host to evade proteolysis []. AvrRpt2 from Pseudomonas syringae pv tomato DC3000 triggers resistance to P. syringae-2-dependent defence responses, including hypersensitive cell death, by cleaving the Arabidopsis RIN4 protein which is monitored by the cognate resistance protein RPS2 [].
Probab=29.46 E-value=68 Score=17.38 Aligned_cols=20 Identities=15% Similarity=0.193 Sum_probs=16.5
Q ss_pred CCCHHHHHHHhhhCCCeEEE
Q psy7050 24 RTTPDDLRRVFERCGEVGDI 43 (61)
Q Consensus 24 ~~~~~~l~~~f~~~G~i~~~ 43 (61)
..+.+.+..++..||+|.-.
T Consensus 95 ~~t~e~~~~LL~~yGPLwv~ 114 (166)
T PF12385_consen 95 SYTAEGLANLLREYGPLWVA 114 (166)
T ss_pred ccCHHHHHHHHHHcCCeEEE
Confidence 56789999999999987543
No 185
>PF06014 DUF910: Bacterial protein of unknown function (DUF910); InterPro: IPR009256 This family consists of several short bacterial proteins of unknown function.; PDB: 2NN4_A.
Probab=28.60 E-value=65 Score=14.51 Aligned_cols=16 Identities=13% Similarity=0.231 Sum_probs=10.1
Q ss_pred HHHHHHHhhhCCCeEE
Q psy7050 27 PDDLRRVFERCGEVGD 42 (61)
Q Consensus 27 ~~~l~~~f~~~G~i~~ 42 (61)
-.|+++++.+||.+..
T Consensus 4 lyDVqQLLK~fG~~IY 19 (62)
T PF06014_consen 4 LYDVQQLLKKFGIIIY 19 (62)
T ss_dssp HHHHHHHHHTTS----
T ss_pred HHHHHHHHHHCCEEEE
Confidence 4678899999997644
No 186
>COG3763 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=28.39 E-value=74 Score=14.75 Aligned_cols=18 Identities=11% Similarity=0.252 Sum_probs=14.5
Q ss_pred CCCCCHHHHHHHhhhCCC
Q psy7050 22 TYRTTPDDLRRVFERCGE 39 (61)
Q Consensus 22 ~~~~~~~~l~~~f~~~G~ 39 (61)
++.++++.++.++.+-|.
T Consensus 36 NPpine~~iR~M~~qmGq 53 (71)
T COG3763 36 NPPINEEMIRMMMAQMGQ 53 (71)
T ss_pred CCCCCHHHHHHHHHHhCC
Confidence 456789999999988874
No 187
>COG4647 AcxC Acetone carboxylase, gamma subunit [Secondary metabolites biosynthesis, transport, and catabolism]
Probab=28.33 E-value=69 Score=16.78 Aligned_cols=31 Identities=13% Similarity=0.208 Sum_probs=17.3
Q ss_pred CcEEEEec------------CCCCCCHHHHHHHhhh-CCCeEEE
Q psy7050 13 MVSLKVDN------------LTYRTTPDDLRRVFER-CGEVGDI 43 (61)
Q Consensus 13 ~~~l~v~~------------l~~~~~~~~l~~~f~~-~G~i~~~ 43 (61)
..+|+|++ .-++..+..+++++-+ +|.+..+
T Consensus 89 ~a~i~vrdtee~lreiyp~s~ipdp~wme~reficpecg~l~ev 132 (165)
T COG4647 89 HANIYVRDTEEKLREIYPKSDIPDPQWMEIREFICPECGILHEV 132 (165)
T ss_pred eeEEEEcchHHHHHHhCcccCCCCchHHHHHHhhCccccceeee
Confidence 45777775 2333445566666654 6655444
No 188
>PF00226 DnaJ: DnaJ domain; InterPro: IPR001623 The prokaryotic heat shock protein DnaJ interacts with the chaperone hsp70-like DnaK protein []. Structurally, the DnaJ protein consists of an N-terminal conserved domain (called 'J' domain) of about 70 amino acids, a glycine-rich region ('G' domain') of about 30 residues, a central domain containing four repeats of a CXXCXGXG motif ('CRR' domain) and a C-terminal region of 120 to 170 residues. Such a structure is shown in the following schematic representation: +------------+-+-------+-----+-----------+--------------------------------+ | N-terminal | | Gly-R | | CXXCXGXG | C-terminal | +------------+-+-------+-----+-----------+--------------------------------+ It is thought that the 'J' domain of DnaJ mediates the interaction with the dnaK protein and consists of four helices, the second of which has a charged surface that includes at least one pair of basic residues that are essential for interaction with the ATPase domain of Hsp70. The J- and CRR-domains are found in many prokaryotic and eukaryotic proteins [], either together or separately. In yeast, J-domains have been classified into 3 groups; the class III proteins are functionally distinct and do not appear to act as molecular chaperones []. ; GO: 0031072 heat shock protein binding; PDB: 2GUZ_C 2L6L_A 1HDJ_A 2EJ7_A 1FPO_C 2CUG_A 2QSA_A 2OCH_A 3BVO_B 3APQ_A ....
Probab=28.26 E-value=61 Score=13.73 Aligned_cols=18 Identities=11% Similarity=0.416 Sum_probs=13.7
Q ss_pred ecCCCCCCHHHHHHHhhh
Q psy7050 19 DNLTYRTTPDDLRRVFER 36 (61)
Q Consensus 19 ~~l~~~~~~~~l~~~f~~ 36 (61)
-+|+.+++.++|+..|..
T Consensus 6 Lgl~~~~~~~eik~~y~~ 23 (64)
T PF00226_consen 6 LGLPPDASDEEIKKAYRR 23 (64)
T ss_dssp CTSTTTSSHHHHHHHHHH
T ss_pred CCCCCCCCHHHHHHHHHh
Confidence 367888888888877754
No 189
>smart00703 NRF N-terminal domain in C. elegans NRF-6 (Nose Resistant to Fluoxetine-4) and NDG-4 (resistant to nordihydroguaiaretic acid-4). Also present in several other worm and fly proteins.
Probab=28.23 E-value=56 Score=15.92 Aligned_cols=23 Identities=17% Similarity=0.333 Sum_probs=17.7
Q ss_pred EEEEe-cCCCCCCHHHHHHHhhhC
Q psy7050 15 SLKVD-NLTYRTTPDDLRRVFERC 37 (61)
Q Consensus 15 ~l~v~-~l~~~~~~~~l~~~f~~~ 37 (61)
.+.+| .+|..++.+++..++..+
T Consensus 71 ~~~~gvC~P~sCs~~dv~~l~~~~ 94 (110)
T smart00703 71 VLRTGVCLPSSCSAADLKTLLNQL 94 (110)
T ss_pred ceEEEEEeCCCCCHHHHHHHHHHH
Confidence 34444 579999999999988765
No 190
>KOG0008|consensus
Probab=27.27 E-value=1e+02 Score=22.56 Aligned_cols=39 Identities=21% Similarity=0.221 Sum_probs=23.2
Q ss_pred cCCCCCCHHHHH-HHhhhCCCeEEEEEccCCCCCCceeEEEEC
Q psy7050 20 NLTYRTTPDDLR-RVFERCGEVGDIYIPRDRFTRESRGFAFVR 61 (61)
Q Consensus 20 ~l~~~~~~~~l~-~~f~~~G~i~~~~~~~~~~t~~~~g~~fv~ 61 (61)
-.||.++...|. ..=.. ....+.=+-|+ ||...||.||.
T Consensus 820 ~aPWNtTrnFInqa~~gk--~lLeltGvgDP-TGcGeGFSyvR 859 (1563)
T KOG0008|consen 820 LAPWNTTRNFINQATQGK--CLLELTGVGDP-TGCGEGFSYVR 859 (1563)
T ss_pred cCcchhhHHHHHHHhccc--eeEEeecCCCC-CcccCcceeEe
Confidence 357777877774 22111 13333335564 78899999984
No 191
>KOG0820|consensus
Probab=26.98 E-value=78 Score=18.91 Aligned_cols=22 Identities=18% Similarity=0.205 Sum_probs=17.9
Q ss_pred cEEEEecCCCCCCHHHHHHHhh
Q psy7050 14 VSLKVDNLTYRTTPDDLRRVFE 35 (61)
Q Consensus 14 ~~l~v~~l~~~~~~~~l~~~f~ 35 (61)
.-+.|.|+|+.++..-+..++.
T Consensus 124 fd~cVsNlPyqISSp~vfKLL~ 145 (315)
T KOG0820|consen 124 FDGCVSNLPYQISSPLVFKLLL 145 (315)
T ss_pred cceeeccCCccccCHHHHHhcC
Confidence 4578999999999988877653
No 192
>PF04008 Adenosine_kin: Adenosine specific kinase; InterPro: IPR007153 The structure of a member of this family from the hyperthermophilic archaeon Pyrobaculum aerophilum contains a modified histidine residue which is interpreted as stable phosphorylation. In vitro binding studies confirmed that adenosine and AMP but not ADP or ATP bind to the protein [].; PDB: 1VGG_A 1RLH_A 1WVQ_A 2GL0_F 2JB7_B 2EKM_C 2D16_D.
Probab=26.63 E-value=1e+02 Score=16.47 Aligned_cols=27 Identities=11% Similarity=0.121 Sum_probs=21.4
Q ss_pred CCCcEEEEecCCCCCCHHHHHHHhhhC
Q psy7050 11 EGMVSLKVDNLTYRTTPDDLRRVFERC 37 (61)
Q Consensus 11 ~~~~~l~v~~l~~~~~~~~l~~~f~~~ 37 (61)
+++.++.+|.-++--|-+||.+.+...
T Consensus 6 Pe~~N~IlGqsHFIKTvEDl~Ealv~s 32 (155)
T PF04008_consen 6 PEGCNVILGQSHFIKTVEDLYEALVTS 32 (155)
T ss_dssp -TTEEEEEEEESSTTHHHHHHHHHHCC
T ss_pred CCCceEEEeccchhhhHHHHHHHHhhc
Confidence 456899999999977889998887654
No 193
>PF13600 DUF4140: N-terminal domain of unknown function (DUF4140)
Probab=25.68 E-value=26 Score=16.76 Aligned_cols=35 Identities=20% Similarity=0.147 Sum_probs=22.0
Q ss_pred CcEEEEecCCCCCCHHHHHHHhhhCCCeEEEEEcc
Q psy7050 13 MVSLKVDNLTYRTTPDDLRRVFERCGEVGDIYIPR 47 (61)
Q Consensus 13 ~~~l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~~~ 47 (61)
...|.+.+||..+..+.|+--...-..|..+.+..
T Consensus 23 ~~~i~~~~Lp~~~d~~Sl~V~~~g~~~i~~v~~~~ 57 (104)
T PF13600_consen 23 ENEIIFEGLPPSLDPDSLRVSGEGGVTILSVRFRR 57 (104)
T ss_pred ceEEEEeCCCcccCCCcEEEEecCCEEEEEEEEEE
Confidence 36899999999998887753211112455555443
No 194
>smart00540 LEM in nuclear membrane-associated proteins. LEM, domain in nuclear membrane-associated proteins, including lamino-associated polypeptide 2 and emerin.
Probab=24.83 E-value=70 Score=13.28 Aligned_cols=15 Identities=20% Similarity=0.359 Sum_probs=12.0
Q ss_pred CCHHHHHHHhhhCCC
Q psy7050 25 TTPDDLRRVFERCGE 39 (61)
Q Consensus 25 ~~~~~l~~~f~~~G~ 39 (61)
.++++|++.+..+|-
T Consensus 6 LSd~eL~~~L~~~G~ 20 (44)
T smart00540 6 LSDAELRAELKQYGL 20 (44)
T ss_pred cCHHHHHHHHHHcCC
Confidence 577889999888873
No 195
>KOG2068|consensus
Probab=24.75 E-value=48 Score=19.94 Aligned_cols=36 Identities=17% Similarity=0.244 Sum_probs=25.6
Q ss_pred cEEEEecCCCCCCHHHHH---HHhhhCCCeEEEEEccCC
Q psy7050 14 VSLKVDNLTYRTTPDDLR---RVFERCGEVGDIYIPRDR 49 (61)
Q Consensus 14 ~~l~v~~l~~~~~~~~l~---~~f~~~G~i~~~~~~~~~ 49 (61)
.-+|+-+|+.....+.+. +.|.+||.+.++..-.++
T Consensus 78 nlvyvvgl~~~~ade~~l~~~eyfgqygki~ki~~~~~~ 116 (327)
T KOG2068|consen 78 NLVYVVGLPLDLADESVLERTEYFGQYGKINKIVKNKDP 116 (327)
T ss_pred hhhhhhCCCccccchhhhhCcccccccccceEEeecCCc
Confidence 457888888887655443 567788899888776654
No 196
>smart00195 DSPc Dual specificity phosphatase, catalytic domain.
Probab=24.49 E-value=61 Score=15.99 Aligned_cols=18 Identities=11% Similarity=0.104 Sum_probs=12.7
Q ss_pred EEEEecCCCCCCHHHHHH
Q psy7050 15 SLKVDNLTYRTTPDDLRR 32 (61)
Q Consensus 15 ~l~v~~l~~~~~~~~l~~ 32 (61)
.||+|++|.....+.|++
T Consensus 7 ~l~~G~~~~~~~~~~l~~ 24 (138)
T smart00195 7 HLYLGSYSSALNLALLKK 24 (138)
T ss_pred CeEECChhHcCCHHHHHH
Confidence 599999987765554443
No 197
>PF11249 DUF3047: Protein of unknown function (DUF3047); InterPro: IPR021409 This bacterial family of proteins has no known function.
Probab=23.59 E-value=1.4e+02 Score=16.25 Aligned_cols=33 Identities=21% Similarity=0.222 Sum_probs=24.4
Q ss_pred HHHHHHhhh-CCCeEEEEEccCCCCCCceeEEEE
Q psy7050 28 DDLRRVFER-CGEVGDIYIPRDRFTRESRGFAFV 60 (61)
Q Consensus 28 ~~l~~~f~~-~G~i~~~~~~~~~~t~~~~g~~fv 60 (61)
+|.+++|.. .+.|..+-++.|..+...++-|++
T Consensus 145 ~Dy~~~FG~~p~~i~~vai~tDsDnT~~~a~A~y 178 (183)
T PF11249_consen 145 ADYRRAFGEEPPRIVGVAIMTDSDNTGGSARAYY 178 (183)
T ss_pred HHHHHHhCCCCCceeEEEEEEEcCCCCCEEEEEE
Confidence 677777754 678888999988777666666653
No 198
>PRK04239 hypothetical protein; Provisional
Probab=23.31 E-value=1.1e+02 Score=15.41 Aligned_cols=19 Identities=11% Similarity=0.289 Sum_probs=16.1
Q ss_pred ecCCCCCCHHHHHHHhhhC
Q psy7050 19 DNLTYRTTPDDLRRVFERC 37 (61)
Q Consensus 19 ~~l~~~~~~~~l~~~f~~~ 37 (61)
|.|...++++.|..++...
T Consensus 80 G~i~~ki~e~~L~~lL~~v 98 (110)
T PRK04239 80 GRIQGPIDDEQLKEILEQL 98 (110)
T ss_pred CCCCCCcCHHHHHHHHHHH
Confidence 5678889999999999874
No 199
>PF15051 FAM198: FAM198 protein
Probab=23.29 E-value=88 Score=18.83 Aligned_cols=30 Identities=20% Similarity=0.216 Sum_probs=21.0
Q ss_pred cEEEEecCCCCCCHHHHHHHhh-hCCCeEEE
Q psy7050 14 VSLKVDNLTYRTTPDDLRRVFE-RCGEVGDI 43 (61)
Q Consensus 14 ~~l~v~~l~~~~~~~~l~~~f~-~~G~i~~~ 43 (61)
-.||-...|++.+++|++.+-- .-|+|..+
T Consensus 3 iRiyse~aPpWfS~~Di~~MrlLAd~~V~~~ 33 (326)
T PF15051_consen 3 IRIYSESAPPWFSEDDIRKMRLLADGEVVSK 33 (326)
T ss_pred ccccCCCCCCcCCHHHHHHHHHhcCCceeee
Confidence 4577788999999999987632 34555443
No 200
>PF01984 dsDNA_bind: Double-stranded DNA-binding domain; InterPro: IPR002836 This protein family is found in archaea and eukaryota. The human TFAR19 (TF-1 cell apoptosis-related protein 19) encodes a protein which shares significant homology to the corresponding proteins of species ranging from yeast to mice. TFAR19 exhibits a ubiquitous expression pattern and its expression is up-regulated in the tumour cells undergoing apoptosis. TFAR19 may play a general role in the apoptotic process []. Also included in this family is a DNA-binding protein from the archaea, Methanobacterium thermoautotrophicum.; GO: 0003677 DNA binding; PDB: 1EIJ_A 2K6B_A 2CRU_A 1YYB_A 2JXN_A 2FH0_A.
Probab=23.22 E-value=76 Score=15.75 Aligned_cols=20 Identities=15% Similarity=0.371 Sum_probs=14.8
Q ss_pred ecCCCCCCHHHHHHHhhhCC
Q psy7050 19 DNLTYRTTPDDLRRVFERCG 38 (61)
Q Consensus 19 ~~l~~~~~~~~l~~~f~~~G 38 (61)
|.|...++++.|.+++....
T Consensus 75 G~l~~kI~d~~L~~iL~~i~ 94 (107)
T PF01984_consen 75 GQLRGKIDDEQLKEILEQIS 94 (107)
T ss_dssp TSSSS-B-HHHHHHHHHHHC
T ss_pred CCCCCCcCHHHHHHHHHHHh
Confidence 56788899999999998754
No 201
>PF05023 Phytochelatin: Phytochelatin synthase; InterPro: IPR007719 This entry represents plant phytochelatin synthases (also known as glutathione gamma-glutamylcysteinyltransferase; 2.3.2.15 from EC), which is involved in the synthesis of phytochelatins (PC) and homophytochelatins (hPC), the heavy-metal-binding peptides of plants. This enzyme is required for detoxification of heavy metals such as cadmium and arsenate. The N-terminal region of phytochelatin synthase contains the active site, as well as four highly conserved cysteine residues that appear to play an important role in heavy-metal-induced phytochelatin catalysis. The C-terminal region is rich in cysteines, and may act as a metal sensor, whereby the Cys residues bind cadmium ions to bring them into closer proximity and transferring them to the activation site in the N-terminal catalytic domain []. The C-terminal region displays homology to the functional domains of metallothionein and metallochaperone.; GO: 0016756 glutathione gamma-glutamylcysteinyltransferase activity, 0046872 metal ion binding, 0010038 response to metal ion, 0046938 phytochelatin biosynthetic process; PDB: 2BTW_A 2BU3_B.
Probab=23.15 E-value=43 Score=18.72 Aligned_cols=14 Identities=29% Similarity=0.555 Sum_probs=9.3
Q ss_pred CCCCCCceeEEEEC
Q psy7050 48 DRFTRESRGFAFVR 61 (61)
Q Consensus 48 ~~~t~~~~g~~fv~ 61 (61)
|+.++++|||-.|+
T Consensus 199 D~~s~~~RG~~~is 212 (212)
T PF05023_consen 199 DPDSGKSRGYLLIS 212 (212)
T ss_dssp ETTTTEE-EEEEEE
T ss_pred CCCCCCcceEEEeC
Confidence 56778888887663
No 202
>TIGR00006 S-adenosyl-methyltransferase MraW. Genetics paper in 1972 links mra cluster to peptidoglycan biosynthesis in E. coli. Seems to be common in proteobacteria.wn.
Probab=22.28 E-value=76 Score=18.81 Aligned_cols=19 Identities=26% Similarity=0.468 Sum_probs=15.7
Q ss_pred CCCHHHHHHHhhhCCCeEE
Q psy7050 24 RTTPDDLRRVFERCGEVGD 42 (61)
Q Consensus 24 ~~~~~~l~~~f~~~G~i~~ 42 (61)
.+++++|.++|..||+...
T Consensus 140 ~~~e~~L~~i~~~yGEe~~ 158 (305)
T TIGR00006 140 TYSEEDLERILKKYGEEKF 158 (305)
T ss_pred hCCHHHHHHHHHHhcCcch
Confidence 4678999999999997654
No 203
>PF00054 Laminin_G_1: Laminin G domain; InterPro: IPR012679 Laminins are large heterotrimeric glycoproteins involved in basement membrane function []. The laminin globular (G) domain can be found in one to several copies in various laminin family members, which includes a large number of extracellular proteins. The C terminus of laminin alpha chain contains a tandem repeat of five laminin G domains, which are critical for heparin-binding and cell attachment activity []. Laminin alpha4 is distributed in a variety of tissues including peripheral nerves, dorsal root ganglion, skeletal muscle and capillaries; in the neuromuscular junction, it is required for synaptic specialisation []. The structure of the laminin-G domain has been predicted to resemble that of pentraxin []. Laminin G domains can vary in their function, and a variety of binding functions has been ascribed to different LamG modules. For example, the laminin alpha1 and alpha2 chains each has five C-teminal laminin G domains, where only domains LG4 and LG5 contain binding sites for heparin, sulphatides and the cell surface receptor dystroglycan []. Laminin G-containing proteins appear to have a wide variety of roles in cell adhesion, signalling, migration, assembly and differentiation. This entry represents one subtype of laminin G domains, which is sometimes found in association with thrombospondin-type laminin G domains (IPR012680 from INTERPRO).; PDB: 1OKQ_A 1DYK_A 2C5D_A 1H30_A 1LHW_A 1KDK_A 1LHU_A 1KDM_A 1LHO_A 1D2S_A ....
Probab=22.19 E-value=72 Score=15.82 Aligned_cols=11 Identities=27% Similarity=0.024 Sum_probs=8.9
Q ss_pred CCcEEEEecCC
Q psy7050 12 GMVSLKVDNLT 22 (61)
Q Consensus 12 ~~~~l~v~~l~ 22 (61)
....||||++|
T Consensus 91 ~~~~lyvGG~p 101 (131)
T PF00054_consen 91 VDGPLYVGGLP 101 (131)
T ss_dssp ECSEEEESSSS
T ss_pred cccCEEEccCC
Confidence 34569999999
No 204
>smart00271 DnaJ DnaJ molecular chaperone homology domain.
Probab=22.18 E-value=80 Score=13.02 Aligned_cols=18 Identities=6% Similarity=0.370 Sum_probs=13.3
Q ss_pred ecCCCCCCHHHHHHHhhh
Q psy7050 19 DNLTYRTTPDDLRRVFER 36 (61)
Q Consensus 19 ~~l~~~~~~~~l~~~f~~ 36 (61)
-+++..++.++|+..|..
T Consensus 7 Lgl~~~~~~~~ik~ay~~ 24 (60)
T smart00271 7 LGVPRDASLDEIKKAYRK 24 (60)
T ss_pred cCCCCCCCHHHHHHHHHH
Confidence 367777888888877754
No 205
>PF15023 DUF4523: Protein of unknown function (DUF4523)
Probab=22.06 E-value=1.5e+02 Score=16.00 Aligned_cols=33 Identities=12% Similarity=0.172 Sum_probs=21.6
Q ss_pred CcEEEEecCCCCCC----HHHHHHHhhhCCCeEEEEE
Q psy7050 13 MVSLKVDNLTYRTT----PDDLRRVFERCGEVGDIYI 45 (61)
Q Consensus 13 ~~~l~v~~l~~~~~----~~~l~~~f~~~G~i~~~~~ 45 (61)
-.+|-|+=|...+. -+.+...++.||+|.++.+
T Consensus 86 MsTIVVRWlkknm~~~edl~sV~~~Ls~fGpI~SVT~ 122 (166)
T PF15023_consen 86 MSTIVVRWLKKNMQPTEDLKSVIQRLSVFGPIQSVTL 122 (166)
T ss_pred ceeEEeehhhhcCChHHHHHHHHHHHHhcCCcceeee
Confidence 36788876655553 2345556678999988753
No 206
>PRK00050 16S rRNA m(4)C1402 methyltranserfase; Provisional
Probab=22.05 E-value=78 Score=18.62 Aligned_cols=19 Identities=26% Similarity=0.464 Sum_probs=15.5
Q ss_pred CCCHHHHHHHhhhCCCeEE
Q psy7050 24 RTTPDDLRRVFERCGEVGD 42 (61)
Q Consensus 24 ~~~~~~l~~~f~~~G~i~~ 42 (61)
.+++++|..+|..||....
T Consensus 138 ~~~~~~L~~i~~~yGee~~ 156 (296)
T PRK00050 138 TYSEEELARIFKEYGEERF 156 (296)
T ss_pred hCCHHHHHHHHHHhcCcch
Confidence 4578899999999998654
No 207
>PF00533 BRCT: BRCA1 C Terminus (BRCT) domain; InterPro: IPR001357 The BRCT domain (after the C_terminal domain of a breast cancer susceptibility protein) is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage [], for example as found in the breast cancer DNA-repair protein BRCA1. The domain is an approximately 100 amino acid tandem repeat, which appears to act as a phospho-protein binding domain []. A chitin biosynthesis protein from yeast also seems to belong to this group. ; GO: 0005622 intracellular; PDB: 3L46_A 3AL3_A 3AL2_A 1WF6_A 3II6_X 2NTE_B 3FA2_A 2R1Z_A 2COK_A 2K7F_A ....
Probab=21.84 E-value=89 Score=13.38 Aligned_cols=29 Identities=7% Similarity=0.156 Sum_probs=19.6
Q ss_pred CCcEEEEecCCCCCCHHHHHHHhhhCCCeE
Q psy7050 12 GMVSLKVDNLTYRTTPDDLRRVFERCGEVG 41 (61)
Q Consensus 12 ~~~~l~v~~l~~~~~~~~l~~~f~~~G~i~ 41 (61)
+++++++.+ -.....+.|..+...+|...
T Consensus 7 ~g~~f~i~~-~~~~~~~~l~~~i~~~GG~v 35 (78)
T PF00533_consen 7 EGCTFCISG-FDSDEREELEQLIKKHGGTV 35 (78)
T ss_dssp TTEEEEESS-TSSSHHHHHHHHHHHTTEEE
T ss_pred CCEEEEEcc-CCCCCHHHHHHHHHHcCCEE
Confidence 467888833 33445677888888888543
No 208
>KOG0984|consensus
Probab=21.42 E-value=1.9e+02 Score=16.92 Aligned_cols=30 Identities=20% Similarity=0.449 Sum_probs=22.8
Q ss_pred CCCCCCCCcEEEEecCCCCCCHHHHHHHhh
Q psy7050 6 PPPRIEGMVSLKVDNLTYRTTPDDLRRVFE 35 (61)
Q Consensus 6 ~~~~~~~~~~l~v~~l~~~~~~~~l~~~f~ 35 (61)
+|...+..+.++|++-.+++.+++|...-+
T Consensus 24 pp~~Ld~~a~i~I~~~~~eV~ad~L~~i~e 53 (282)
T KOG0984|consen 24 PPRNLDSRALIYIGDRNFEVPADDLVGIEE 53 (282)
T ss_pred CCcccCCCceEEEecCccccchhhhhhhhh
Confidence 334456678899999889999999876653
No 209
>KOG0718|consensus
Probab=20.72 E-value=1e+02 Score=19.80 Aligned_cols=19 Identities=21% Similarity=0.462 Sum_probs=16.0
Q ss_pred ecCCCCCCHHHHHHHhhhC
Q psy7050 19 DNLTYRTTPDDLRRVFERC 37 (61)
Q Consensus 19 ~~l~~~~~~~~l~~~f~~~ 37 (61)
-|||.++++++|++.+.+.
T Consensus 15 LNlpkdAt~eeI~~AYrr~ 33 (546)
T KOG0718|consen 15 LNLPKDATDEEIKKAYRRL 33 (546)
T ss_pred hCCCcccCHHHHHHHHHHH
Confidence 5999999999999887653
No 210
>COG2118 DNA-binding protein [General function prediction only]
Probab=20.69 E-value=1.4e+02 Score=15.29 Aligned_cols=20 Identities=20% Similarity=0.499 Sum_probs=16.6
Q ss_pred EecCCCCCCHHHHHHHhhhC
Q psy7050 18 VDNLTYRTTPDDLRRVFERC 37 (61)
Q Consensus 18 v~~l~~~~~~~~l~~~f~~~ 37 (61)
.|.+...+++++|.+++...
T Consensus 82 tGri~~~I~e~~lk~IL~~i 101 (116)
T COG2118 82 TGRITHKIDEEELKEILERI 101 (116)
T ss_pred cCCCCCCCCHHHHHHHHHHH
Confidence 36788899999999998764
No 211
>KOG4285|consensus
Probab=20.17 E-value=98 Score=18.69 Aligned_cols=29 Identities=24% Similarity=0.322 Sum_probs=18.9
Q ss_pred EEEecCCCCCCHHHHHHHhhhCCCeEEEEE
Q psy7050 16 LKVDNLTYRTTPDDLRRVFERCGEVGDIYI 45 (61)
Q Consensus 16 l~v~~l~~~~~~~~l~~~f~~~G~i~~~~~ 45 (61)
|-|-++++... .-+..+|++||.|.+...
T Consensus 200 VTVfGFppg~~-s~vL~~F~~cG~Vvkhv~ 228 (350)
T KOG4285|consen 200 VTVFGFPPGQV-SIVLNLFSRCGEVVKHVT 228 (350)
T ss_pred EEEeccCccch-hHHHHHHHhhCeeeeeec
Confidence 44445555533 446678999999887643
Done!