Query psy2203
Match_columns 74
No_of_seqs 123 out of 604
Neff 5.8
Searched_HMMs 46136
Date Fri Aug 16 18:51:43 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy2203.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/2203hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG0120|consensus 99.8 1.7E-19 3.8E-24 139.0 4.5 66 8-73 393-459 (500)
2 TIGR01642 U2AF_lg U2 snRNP aux 99.6 5.1E-15 1.1E-19 110.7 7.0 64 10-73 405-469 (509)
3 TIGR01645 half-pint poly-U bin 99.6 5E-15 1.1E-19 116.6 5.8 64 6-74 506-571 (612)
4 KOG0147|consensus 99.5 1.6E-14 3.5E-19 112.1 3.8 55 12-73 441-495 (549)
5 TIGR01622 SF-CC1 splicing fact 99.4 1.7E-12 3.8E-17 96.2 7.5 57 11-73 359-415 (457)
6 KOG0124|consensus 99.3 5.7E-12 1.2E-16 95.9 4.9 62 7-73 439-502 (544)
7 KOG1548|consensus 99.2 5.9E-11 1.3E-15 89.0 5.9 58 3-60 254-312 (382)
8 KOG1996|consensus 98.9 1E-09 2.2E-14 81.6 2.3 57 11-73 278-334 (378)
9 smart00361 RRM_1 RNA recogniti 98.1 1.2E-05 2.6E-10 46.6 5.1 39 35-73 3-42 (70)
10 PF15519 RBM39linker: linker b 97.6 3.4E-05 7.3E-10 46.4 1.8 23 11-33 51-73 (73)
11 KOG2202|consensus 95.5 0.0022 4.7E-08 46.8 -1.0 59 12-73 41-115 (260)
12 PF13893 RRM_5: RNA recognitio 95.4 0.021 4.6E-07 31.1 3.0 20 41-60 1-20 (56)
13 PLN03134 glycine-rich RNA-bind 94.0 0.13 2.8E-06 33.8 4.5 49 13-71 33-81 (144)
14 smart00360 RRM RNA recognition 93.8 0.25 5.5E-06 25.9 4.7 23 39-61 11-33 (71)
15 TIGR01661 ELAV_HUD_SF ELAV/HuD 93.1 0.52 1.1E-05 33.7 6.6 48 14-71 269-316 (352)
16 TIGR01661 ELAV_HUD_SF ELAV/HuD 92.9 0.38 8.3E-06 34.4 5.7 48 14-71 3-50 (352)
17 TIGR01622 SF-CC1 splicing fact 92.0 0.78 1.7E-05 34.2 6.6 50 12-71 87-136 (457)
18 TIGR01659 sex-lethal sex-letha 90.8 0.51 1.1E-05 35.3 4.6 33 39-71 208-240 (346)
19 KOG0127|consensus 90.4 0.51 1.1E-05 38.3 4.4 33 39-72 132-164 (678)
20 PF14259 RRM_6: RNA recognitio 90.2 0.66 1.4E-05 25.6 3.8 32 39-73 13-44 (70)
21 TIGR01649 hnRNP-L_PTB hnRNP-L/ 89.5 0.84 1.8E-05 35.0 4.9 36 13-58 1-36 (481)
22 smart00362 RRM_2 RNA recogniti 89.5 1.5 3.2E-05 22.9 4.7 23 38-60 13-35 (72)
23 cd00590 RRM RRM (RNA recogniti 88.6 1.8 3.9E-05 22.7 4.6 25 38-62 13-37 (74)
24 KOG4207|consensus 88.3 0.71 1.5E-05 33.5 3.5 35 37-73 26-60 (256)
25 TIGR01649 hnRNP-L_PTB hnRNP-L/ 88.1 1.9 4.1E-05 33.1 5.9 40 12-60 273-312 (481)
26 TIGR01628 PABP-1234 polyadenyl 87.9 1.1 2.3E-05 34.7 4.5 48 13-71 284-331 (562)
27 TIGR01628 PABP-1234 polyadenyl 87.1 1.3 2.8E-05 34.2 4.6 34 38-71 14-47 (562)
28 PF00076 RRM_1: RNA recognitio 86.7 1.9 4.2E-05 23.1 4.1 22 38-59 12-33 (70)
29 KOG0114|consensus 86.7 2.4 5.1E-05 27.9 5.0 50 10-74 14-63 (124)
30 TIGR01645 half-pint poly-U bin 85.6 1.6 3.5E-05 35.4 4.5 34 39-72 219-252 (612)
31 TIGR01648 hnRNP-R-Q heterogene 85.2 5 0.00011 32.3 7.1 57 3-71 48-104 (578)
32 TIGR01642 U2AF_lg U2 snRNP aux 85.1 4.1 9E-05 30.7 6.3 48 14-71 295-342 (509)
33 KOG0113|consensus 83.7 2 4.4E-05 32.5 4.1 35 37-73 114-148 (335)
34 PLN03120 nucleic acid binding 83.3 2.8 6E-05 30.8 4.6 45 14-71 4-48 (260)
35 KOG0144|consensus 79.9 3.4 7.4E-05 32.8 4.2 35 37-71 47-81 (510)
36 KOG0107|consensus 79.6 3 6.5E-05 29.5 3.5 25 38-62 24-48 (195)
37 TIGR01659 sex-lethal sex-letha 79.1 4.7 0.0001 30.2 4.7 34 38-71 121-154 (346)
38 KOG0125|consensus 78.9 2.9 6.4E-05 32.0 3.5 32 39-72 111-142 (376)
39 PLN03121 nucleic acid binding 78.2 5 0.00011 29.2 4.4 36 15-60 6-41 (243)
40 PF00586 AIRS: AIR synthase re 78.0 2.7 5.8E-05 25.1 2.6 31 20-50 59-89 (96)
41 COG0724 RNA-binding proteins ( 77.9 7.6 0.00017 25.0 4.9 49 14-72 115-163 (306)
42 PF00522 VPR: VPR/VPX protein; 73.7 3.7 8E-05 26.0 2.5 18 31-48 19-36 (96)
43 KOG0144|consensus 69.4 4.6 0.0001 32.1 2.6 34 37-71 137-170 (510)
44 PHA02944 hypothetical protein; 69.0 8.7 0.00019 26.8 3.6 49 1-49 130-178 (180)
45 PRK07718 fliL flagellar basal 67.5 8.5 0.00018 25.2 3.3 37 21-57 98-137 (142)
46 KOG0149|consensus 64.9 17 0.00036 26.7 4.6 37 37-73 25-61 (247)
47 PLN03213 repressor of silencin 64.1 12 0.00026 30.6 4.1 23 38-60 24-46 (759)
48 TIGR01648 hnRNP-R-Q heterogene 60.7 24 0.00051 28.5 5.2 36 13-58 232-269 (578)
49 PF15513 DUF4651: Domain of un 58.4 34 0.00074 20.0 4.3 29 34-62 4-32 (62)
50 smart00545 JmjN Small domain f 57.0 7.5 0.00016 20.9 1.3 24 26-52 10-33 (42)
51 PF02375 JmjN: jmjN domain; I 56.5 4.1 8.9E-05 21.0 0.2 16 37-52 16-31 (34)
52 COG1864 NUC1 DNA/RNA endonucle 55.9 31 0.00066 25.6 4.7 45 11-55 137-181 (281)
53 KOG1190|consensus 53.6 20 0.00043 28.4 3.6 40 11-60 25-64 (492)
54 PF13043 DUF3903: Domain of un 52.0 12 0.00027 20.0 1.6 15 37-51 9-24 (40)
55 PRK05731 thiamine monophosphat 51.7 15 0.00033 26.4 2.6 31 19-50 85-115 (318)
56 KOG0148|consensus 51.5 18 0.00039 27.3 2.9 45 24-72 66-110 (321)
57 PF08776 VASP_tetra: VASP tetr 51.1 23 0.0005 19.1 2.6 16 32-47 7-22 (40)
58 cd02194 ThiL ThiL (Thiamine-mo 50.1 16 0.00035 25.8 2.5 21 30-50 92-112 (291)
59 TIGR01379 thiL thiamine-monoph 49.2 17 0.00038 26.1 2.6 30 20-50 84-113 (317)
60 PF02037 SAP: SAP domain; Int 47.3 18 0.0004 18.2 1.8 12 38-49 6-17 (35)
61 KOG0127|consensus 46.9 35 0.00075 28.1 4.1 47 15-71 293-339 (678)
62 cd02204 PurL_repeat2 PurL subu 46.3 37 0.0008 23.5 3.8 18 32-49 72-89 (264)
63 KOG0116|consensus 45.9 29 0.00063 27.1 3.4 33 39-71 303-335 (419)
64 cd02192 PurM-like3 AIR synthas 45.8 29 0.00063 24.9 3.3 20 31-50 98-117 (283)
65 cd07058 BMC_CsoS1 Carboxysome 44.6 24 0.00053 21.7 2.3 24 39-62 57-82 (88)
66 smart00540 LEM in nuclear memb 43.6 19 0.00041 19.6 1.5 12 39-50 9-20 (44)
67 PF14605 Nup35_RRM_2: Nup53/35 43.3 43 0.00093 18.3 3.1 22 36-57 12-33 (53)
68 cd02901 Macro_Poa1p_like Macro 43.3 64 0.0014 20.2 4.3 45 17-62 73-120 (140)
69 PRK12785 fliL flagellar basal 42.6 36 0.00077 22.8 3.1 34 23-56 124-160 (166)
70 KOG2287|consensus 41.9 68 0.0015 23.9 4.8 41 10-58 125-165 (349)
71 PF03748 FliL: Flagellar basal 41.7 47 0.001 19.5 3.3 35 23-57 57-94 (99)
72 PF01906 YbjQ_1: Putative heav 41.3 64 0.0014 19.9 3.9 28 33-60 57-86 (105)
73 KOG0117|consensus 40.2 30 0.00064 27.7 2.7 21 38-58 273-293 (506)
74 PF12683 DUF3798: Protein of u 40.1 65 0.0014 24.0 4.4 38 24-62 139-177 (275)
75 KOG0108|consensus 40.0 40 0.00087 26.4 3.4 39 34-72 28-66 (435)
76 KOG0110|consensus 39.1 41 0.00089 28.1 3.5 40 34-73 525-565 (725)
77 PF14234 DUF4336: Domain of un 38.4 50 0.0011 24.5 3.5 43 12-59 20-66 (285)
78 cd00396 PurM-like AIR (aminoim 37.5 44 0.00095 22.5 3.0 32 19-50 42-73 (222)
79 cd03400 Band_7_1 A subgroup of 37.5 48 0.001 20.3 3.0 25 34-58 84-109 (124)
80 KOG0124|consensus 37.2 67 0.0015 25.5 4.2 34 38-71 224-257 (544)
81 PF08777 RRM_3: RNA binding mo 37.0 35 0.00076 21.2 2.3 21 38-58 15-35 (105)
82 KOG0339|consensus 36.8 27 0.00058 28.8 2.0 22 35-56 306-327 (731)
83 cd02106 Band_7 The band 7 doma 36.6 64 0.0014 18.6 3.3 24 35-58 82-106 (121)
84 PF08887 GAD-like: GAD-like do 35.8 37 0.00081 21.5 2.3 31 28-58 55-85 (109)
85 KOG0122|consensus 35.8 69 0.0015 23.8 3.9 48 14-71 189-236 (270)
86 cd07045 BMC_CcmK_like Carbon d 35.6 85 0.0018 18.8 3.7 29 34-62 50-79 (84)
87 KOG4206|consensus 35.4 91 0.002 22.5 4.4 44 12-61 7-50 (221)
88 PF12897 Aminotran_MocR: Alani 34.0 52 0.0011 25.9 3.2 57 16-73 209-265 (425)
89 cd06061 PurM-like1 AIR synthas 33.5 52 0.0011 23.2 2.9 22 29-50 91-112 (298)
90 KOG0145|consensus 33.2 59 0.0013 24.6 3.2 35 37-71 54-88 (360)
91 smart00244 PHB prohibitin homo 33.1 73 0.0016 19.7 3.3 25 34-58 120-145 (160)
92 PF01762 Galactosyl_T: Galacto 32.9 47 0.001 22.0 2.5 23 36-58 36-58 (195)
93 KOG4205|consensus 32.6 61 0.0013 24.2 3.3 34 38-71 111-144 (311)
94 PTZ00370 STEVOR; Provisional 32.3 53 0.0012 24.7 2.9 37 12-49 38-75 (296)
95 COG2450 Uncharacterized conser 32.1 50 0.0011 21.8 2.5 24 26-49 72-96 (124)
96 PRK14105 selenophosphate synth 32.0 51 0.0011 24.2 2.8 21 29-49 110-130 (345)
97 PF05184 SapB_1: Saposin-like 31.9 31 0.00066 17.0 1.2 20 30-49 20-39 (39)
98 PRK00943 selenophosphate synth 31.2 39 0.00084 25.0 2.0 22 29-50 113-134 (347)
99 cd02196 PurM PurM (Aminoimidaz 31.1 69 0.0015 22.9 3.3 21 30-50 77-97 (297)
100 TIGR01478 STEVOR variant surfa 30.7 66 0.0014 24.2 3.1 37 12-49 38-76 (295)
101 cd03506 Delta6-FADS-like The D 30.4 47 0.001 22.3 2.2 17 33-49 187-203 (204)
102 KOG0145|consensus 30.4 58 0.0012 24.7 2.8 49 14-72 278-326 (360)
103 cd03403 Band_7_stomatin_like B 30.1 74 0.0016 21.2 3.1 24 35-58 115-139 (215)
104 COG0309 HypE Hydrogenase matur 30.0 78 0.0017 24.2 3.5 30 20-49 93-122 (339)
105 PF01545 Cation_efflux: Cation 29.4 1.1E+02 0.0024 21.1 3.9 27 31-57 250-277 (284)
106 TIGR00476 selD selenium donor 29.2 69 0.0015 23.5 3.1 22 29-50 114-135 (347)
107 PF14088 DUF4268: Domain of un 29.2 1.5E+02 0.0034 18.7 4.6 41 5-48 90-130 (140)
108 PF08285 DPM3: Dolichol-phosph 29.2 50 0.0011 20.4 2.0 21 21-43 59-79 (91)
109 TIGR02413 Bac_small_yrzI Bacil 28.4 34 0.00074 18.8 1.0 29 20-48 14-42 (46)
110 cd02197 HypE HypE (Hydrogenase 28.1 69 0.0015 22.8 2.8 20 30-49 91-110 (293)
111 PF10087 DUF2325: Uncharacteri 27.6 74 0.0016 19.0 2.5 27 14-50 50-76 (97)
112 KOG1031|consensus 27.2 1E+02 0.0022 26.2 3.8 47 3-49 447-494 (1169)
113 smart00513 SAP Putative DNA-bi 27.2 53 0.0011 16.2 1.6 12 38-49 6-17 (35)
114 PRK07021 fliL flagellar basal 27.0 89 0.0019 20.6 3.0 42 15-56 108-156 (162)
115 cd03399 Band_7_flotillin Band_ 26.8 1.1E+02 0.0024 18.8 3.3 25 35-59 89-114 (128)
116 COG1669 Predicted nucleotidylt 26.6 86 0.0019 19.8 2.7 26 31-57 4-30 (97)
117 PF08663 HalX: HalX domain; I 26.4 39 0.00084 20.0 1.1 20 24-43 28-47 (71)
118 PRK05385 phosphoribosylaminoim 26.0 95 0.0021 22.5 3.3 21 30-50 109-129 (327)
119 cd02195 SelD Selenophosphate s 25.5 76 0.0016 22.6 2.6 21 30-50 108-128 (287)
120 COG3146 Uncharacterized protei 25.3 1.2E+02 0.0027 23.5 3.8 55 1-58 103-158 (387)
121 cd03408 Band_7_5 A subgroup of 25.2 1.1E+02 0.0024 20.1 3.3 24 35-58 168-192 (207)
122 cd07607 BAR_SH3P_plant The Bin 25.2 58 0.0013 23.3 1.9 17 35-51 35-51 (209)
123 smart00596 PRE_C2HC PRE_C2HC d 24.9 1.4E+02 0.0031 17.7 3.3 25 39-63 2-27 (69)
124 PRK08455 fliL flagellar basal 24.7 1E+02 0.0022 21.1 3.1 35 23-57 140-177 (182)
125 KOG3298|consensus 24.7 79 0.0017 22.0 2.5 36 35-70 28-65 (170)
126 PHA02767 hypothetical protein; 24.7 34 0.00073 21.6 0.6 10 39-48 3-12 (101)
127 TIGR02124 hypE hydrogenase exp 24.4 87 0.0019 22.6 2.8 21 30-50 86-106 (320)
128 cd02203 PurL_repeat1 PurL subu 24.2 1.5E+02 0.0034 21.2 4.1 19 32-50 92-110 (313)
129 PF05886 Orthopox_F8: Orthopox 24.1 60 0.0013 19.1 1.6 18 42-59 28-45 (65)
130 KOG0131|consensus 24.1 1.3E+02 0.0027 21.5 3.4 35 37-71 21-56 (203)
131 PRK08284 precorrin 6A synthase 24.0 1.9E+02 0.0041 20.7 4.4 47 13-59 200-250 (253)
132 PF12668 DUF3791: Protein of u 23.7 86 0.0019 17.5 2.2 16 34-49 47-62 (62)
133 PRK10222 PTS system L-ascorbat 23.5 51 0.0011 19.7 1.3 20 11-30 51-70 (85)
134 COG4080 SpoU rRNA Methylase fa 23.5 39 0.00083 23.0 0.8 44 16-70 4-47 (147)
135 PF10256 Erf4: Golgin subfamil 23.3 1.6E+02 0.0035 18.1 3.6 34 2-47 15-48 (118)
136 KOG0130|consensus 23.3 73 0.0016 21.9 2.1 24 37-60 85-108 (170)
137 TIGR00878 purM phosphoribosyla 23.1 1.2E+02 0.0025 22.2 3.3 21 30-50 109-129 (332)
138 KOG0117|consensus 22.9 3.7E+02 0.0081 21.7 6.1 58 3-73 73-130 (506)
139 cd02691 PurM-like2 AIR synthas 22.7 1E+02 0.0022 23.0 2.9 25 25-50 96-120 (346)
140 COG5470 Uncharacterized conser 22.4 1.1E+02 0.0023 19.4 2.6 23 27-49 11-33 (96)
141 KOG0123|consensus 22.4 1.5E+02 0.0033 22.4 3.8 23 40-62 92-114 (369)
142 cd02685 MIT_C MIT_C; domain fo 22.0 2.4E+02 0.0053 19.0 4.4 41 17-57 52-92 (148)
143 TIGR00693 thiE thiamine-phosph 21.9 1.2E+02 0.0025 19.9 2.9 21 30-50 38-58 (196)
144 cd03405 Band_7_HflC Band_7_Hfl 21.9 1.3E+02 0.0028 20.4 3.2 24 34-57 122-146 (242)
145 PRK03892 ribonuclease P protei 21.7 1.1E+02 0.0025 22.0 2.9 33 28-61 38-70 (216)
146 cd04933 ACT_AK1-AT_1 ACT domai 21.7 98 0.0021 18.2 2.2 16 37-52 60-75 (78)
147 PF12021 DUF3509: Protein of u 21.6 1.1E+02 0.0024 19.1 2.5 29 17-45 37-65 (94)
148 KOG3233|consensus 21.5 1.7E+02 0.0037 22.1 3.8 34 15-48 142-181 (297)
149 PRK07226 fructose-bisphosphate 21.4 1.5E+02 0.0032 21.0 3.4 24 32-55 122-145 (267)
150 TIGR01821 5aminolev_synth 5-am 21.1 2.5E+02 0.0054 20.5 4.7 34 13-52 177-210 (402)
151 KOG0477|consensus 21.0 71 0.0015 27.1 1.9 54 1-59 236-309 (854)
152 TIGR01949 AroFGH_arch predicte 20.9 1.5E+02 0.0033 20.7 3.4 25 32-56 119-143 (258)
153 PF05722 Ustilago_mating: Usti 20.4 47 0.001 24.2 0.7 12 1-13 257-268 (286)
154 KOG4175|consensus 20.4 63 0.0014 23.7 1.4 13 38-50 137-149 (268)
155 KOG0105|consensus 20.1 1.4E+02 0.0031 21.5 3.1 21 40-60 22-42 (241)
No 1
>KOG0120|consensus
Probab=99.77 E-value=1.7e-19 Score=139.00 Aligned_cols=66 Identities=55% Similarity=0.969 Sum_probs=62.1
Q ss_pred CCCCCCceEEEeeCCCCCCcCCChhHHHHHHHHHHHHHHhcCCeEEEEecCC-CCCCCCCceEEEEe
Q psy2203 8 LVPPLRFSVLCLLNMVTPDELKDEDEYEDICEDIKEECNKYGIVRSLEIPRP-IENVDVPGCGKATL 73 (74)
Q Consensus 8 ~~~~~ps~vv~L~Nm~~~~el~~~~~~~ei~eDv~eEC~K~G~V~~V~Iprp-~~~~~~~gvGkVFv 73 (74)
..+..||+|+||+||++++||.++++|+||+|||+.||+|||.|.+|.|||| ..+...+|+|||||
T Consensus 393 q~~g~~t~Vl~L~n~Vt~deLkdd~EyeeIlEdvr~ec~k~g~v~~v~ipr~~~~~~~~~G~GkVFV 459 (500)
T KOG0120|consen 393 QMAGIPTEVLCLTNVVTPDELKDDEEYEEILEDVRTECAKFGAVRSVEIPRPYPDENPVPGTGKVFV 459 (500)
T ss_pred ccCCCcchhhhhhhcCCHHHhcchHHHHHHHHHHHHHhcccCceeEEecCCCCCCCCcCCCcccEEE
Confidence 4567999999999999999999999999999999999999999999999999 55668999999998
No 2
>TIGR01642 U2AF_lg U2 snRNP auxilliary factor, large subunit, splicing factor. Members of this subfamily are found in plants, metazoa and fungi.
Probab=99.57 E-value=5.1e-15 Score=110.71 Aligned_cols=64 Identities=44% Similarity=0.680 Sum_probs=57.7
Q ss_pred CCCCceEEEeeCCCCCCcCCChhHHHHHHHHHHHHHHhcCCeEEEEecCCCCC-CCCCceEEEEe
Q psy2203 10 PPLRFSVLCLLNMVTPDELKDEDEYEDICEDIKEECNKYGIVRSLEIPRPIEN-VDVPGCGKATL 73 (74)
Q Consensus 10 ~~~ps~vv~L~Nm~~~~el~~~~~~~ei~eDv~eEC~K~G~V~~V~Iprp~~~-~~~~gvGkVFv 73 (74)
...||+||+|.||+++++|.++++|++|.+||++||++||.|++|.||++..+ ....|.|++||
T Consensus 405 ~~~~s~v~~l~N~~~~~~l~~d~~~~~~~edl~~~f~~~G~v~~v~i~~~~~~~~~~~~~G~~fV 469 (509)
T TIGR01642 405 GGKPTKVVQLTNLVTGDDLMDDEEYEEIYEDVKTEFSKYGPLINIVIPRPNGDRNSTPGVGKVFL 469 (509)
T ss_pred cCCCceEEEeccCCchhHhcCcchHHHHHHHHHHHHHhcCCeeEEEeeccCcCCCcCCCcceEEE
Confidence 34689999999999999999999999999999999999999999999998543 35678999998
No 3
>TIGR01645 half-pint poly-U binding splicing factor, half-pint family. In the case of PUF60 (GP|6176532), in complex with p54, and in the presence of U2AF, facilitates association of U2 snRNP with pre-mRNA.
Probab=99.56 E-value=5e-15 Score=116.61 Aligned_cols=64 Identities=30% Similarity=0.435 Sum_probs=54.4
Q ss_pred ccCCCCCCceEEEeeCCCCCCcCCChhHHHHHHHHHHHHHHhcCCeEEEEecCCCCCC--CCCceEEEEeC
Q psy2203 6 DRLVPPLRFSVLCLLNMVTPDELKDEDEYEDICEDIKEECNKYGIVRSLEIPRPIENV--DVPGCGKATLV 74 (74)
Q Consensus 6 d~~~~~~ps~vv~L~Nm~~~~el~~~~~~~ei~eDv~eEC~K~G~V~~V~Iprp~~~~--~~~gvGkVFv~ 74 (74)
++|....+|+||+|.|||+++|+.+ ||++||++||+|||.|.+|.|+.++.+. +..++|+|||.
T Consensus 506 ~~l~rp~~S~vVvL~NMv~~~elde-----dl~eDV~eEC~K~G~V~~v~I~~~~~~~~~~~~~~g~VfV~ 571 (612)
T TIGR01645 506 QKLMRTNRSNVIVLRNMVTPQDIDE-----FLEGEIREECGKFGVVDRVIINFEKQGEEEDAEIIVKIFVE 571 (612)
T ss_pred HhhcCCCCCCEEEEeCCCChHHhHH-----HHHHHHHHHhhcCceeEEEEEecCCCCccccccceEEEEEE
Confidence 3566677899999999999999853 4999999999999999999999977653 45679999983
No 4
>KOG0147|consensus
Probab=99.49 E-value=1.6e-14 Score=112.09 Aligned_cols=55 Identities=27% Similarity=0.376 Sum_probs=51.5
Q ss_pred CCceEEEeeCCCCCCcCCChhHHHHHHHHHHHHHHhcCCeEEEEecCCCCCCCCCceEEEEe
Q psy2203 12 LRFSVLCLLNMVTPDELKDEDEYEDICEDIKEECNKYGIVRSLEIPRPIENVDVPGCGKATL 73 (74)
Q Consensus 12 ~ps~vv~L~Nm~~~~el~~~~~~~ei~eDv~eEC~K~G~V~~V~Iprp~~~~~~~gvGkVFv 73 (74)
.||+|++|+|||+|.+.+++.|-+||.+||.|||+|||.|.||.|...+ .|.|||
T Consensus 441 i~t~C~lL~nMFdpstete~n~d~eI~edV~Eec~k~g~v~hi~vd~ns-------~g~VYv 495 (549)
T KOG0147|consen 441 IPTQCLLLSNMFDPSTETEPNWDQEIREDVIEECGKHGKVCHIFVDKNS-------AGCVYV 495 (549)
T ss_pred CccHHHHHhhcCCcccccCcchhhHHHHHHHHHHHhcCCeeEEEEccCC-------CceEEE
Confidence 8999999999999999999999999999999999999999999999966 367776
No 5
>TIGR01622 SF-CC1 splicing factor, CC1-like family. A homologous gene from Plasmodium falciparum was identified in the course of the analysis of that genome at TIGR and was included in the model.
Probab=99.37 E-value=1.7e-12 Score=96.16 Aligned_cols=57 Identities=33% Similarity=0.491 Sum_probs=51.7
Q ss_pred CCCceEEEeeCCCCCCcCCChhHHHHHHHHHHHHHHhcCCeEEEEecCCCCCCCCCceEEEEe
Q psy2203 11 PLRFSVLCLLNMVTPDELKDEDEYEDICEDIKEECNKYGIVRSLEIPRPIENVDVPGCGKATL 73 (74)
Q Consensus 11 ~~ps~vv~L~Nm~~~~el~~~~~~~ei~eDv~eEC~K~G~V~~V~Iprp~~~~~~~gvGkVFv 73 (74)
..+++||+|.|||++.+..++.+|+||.+||++||+|||.|++|.|+++. +.|.+||
T Consensus 359 ~~~~~~l~l~n~~~~~~~~~~~~~~~~~~dv~~e~~k~G~v~~v~v~~~~------~~G~~fV 415 (457)
T TIGR01622 359 NLATTCLVLSNMFDPATEEEPNFDNEILDDVKEECSKYGGVVHIYVDTKN------SAGKIYL 415 (457)
T ss_pred CCCCcEEEEecCCCCcccccchHHHHHHHHHHHHHHhcCCeeEEEEeCCC------CceeEEE
Confidence 37899999999999999999999999999999999999999999998764 3577886
No 6
>KOG0124|consensus
Probab=99.26 E-value=5.7e-12 Score=95.94 Aligned_cols=62 Identities=27% Similarity=0.437 Sum_probs=53.6
Q ss_pred cCCCCCCceEEEeeCCCCCCcCCChhHHHHHHHHHHHHHHhcCCeEEEEecCCCCCC--CCCceEEEEe
Q psy2203 7 RLVPPLRFSVLCLLNMVTPDELKDEDEYEDICEDIKEECNKYGIVRSLEIPRPIENV--DVPGCGKATL 73 (74)
Q Consensus 7 ~~~~~~ps~vv~L~Nm~~~~el~~~~~~~ei~eDv~eEC~K~G~V~~V~Iprp~~~~--~~~gvGkVFv 73 (74)
+|.....|+|++|+||++|+|++++ |+.+|+|||+|||.|.+|.|..-.++. ..+-+-||||
T Consensus 439 kLmR~~~S~VivLRNMV~P~DiDe~-----LegEi~EECgKfG~V~rViI~nekq~e~edaeiiVKIFV 502 (544)
T KOG0124|consen 439 KLMRKQESTVIVLRNMVDPKDIDED-----LEGEITEECGKFGAVNRVIIYNEKQGEEEDAEIIVKIFV 502 (544)
T ss_pred HHhccccCcEEEEeccCChhhhhhH-----HHHHHHHHHhcccceeEEEEEecccccccchhhhheeee
Confidence 5666788999999999999999876 999999999999999999999887654 3445788887
No 7
>KOG1548|consensus
Probab=99.16 E-value=5.9e-11 Score=88.98 Aligned_cols=58 Identities=31% Similarity=0.492 Sum_probs=48.4
Q ss_pred CCcccCCCCCCceEEEeeCCCCCCcCCCh-hHHHHHHHHHHHHHHhcCCeEEEEecCCC
Q psy2203 3 PPWDRLVPPLRFSVLCLLNMVTPDELKDE-DEYEDICEDIKEECNKYGIVRSLEIPRPI 60 (74)
Q Consensus 3 ~~~d~~~~~~ps~vv~L~Nm~~~~el~~~-~~~~ei~eDv~eEC~K~G~V~~V~Iprp~ 60 (74)
|-+++-.-....+.|+|+|||+|+++..+ +...||++|+++||+|||.|.+|+|...+
T Consensus 254 pd~~~~sk~r~~~tVi~kn~Ftp~~~~~~~~l~~dlkedl~eec~K~G~v~~vvv~d~h 312 (382)
T KOG1548|consen 254 PDRDDPSKARADRTVILKNMFTPEDFEKNPDLLNDLKEDLTEECEKFGQVRKVVVYDRH 312 (382)
T ss_pred CCccccccccCCcEEEeeecCCHHHhccCHHHHHHHHHHHHHHHHHhCCcceEEEeccC
Confidence 34445555567788999999999999877 45799999999999999999999998643
No 8
>KOG1996|consensus
Probab=98.87 E-value=1e-09 Score=81.63 Aligned_cols=57 Identities=30% Similarity=0.350 Sum_probs=46.8
Q ss_pred CCCceEEEeeCCCCCCcCCChhHHHHHHHHHHHHHHhcCCeEEEEecCCCCCCCCCceEEEEe
Q psy2203 11 PLRFSVLCLLNMVTPDELKDEDEYEDICEDIKEECNKYGIVRSLEIPRPIENVDVPGCGKATL 73 (74)
Q Consensus 11 ~~ps~vv~L~Nm~~~~el~~~~~~~ei~eDv~eEC~K~G~V~~V~Iprp~~~~~~~gvGkVFv 73 (74)
..|++||.|.|||.+++..++ |.+++++||+|||+|.+|.|+.-+. +...-.-+|||
T Consensus 278 ~~ptkvlllrnmVg~gevd~e-----lede~keEceKyg~V~~viifeip~-~p~deavRiFv 334 (378)
T KOG1996|consen 278 KCPTKVLLLRNMVGAGEVDEE-----LEDETKEECEKYGKVGNVIIFEIPS-QPEDEAVRIFV 334 (378)
T ss_pred hcchHHHHhhhhcCcccccHH-----HHHHHHHHHHhhcceeeEEEEecCC-Cccchhheeee
Confidence 468999999999999998765 9999999999999999999987632 33334556765
No 9
>smart00361 RRM_1 RNA recognition motif.
Probab=98.07 E-value=1.2e-05 Score=46.64 Aligned_cols=39 Identities=21% Similarity=0.221 Sum_probs=26.1
Q ss_pred HHHHHHHHHHHHhcCCeEEEE-ecCCCCCCCCCceEEEEe
Q psy2203 35 EDICEDIKEECNKYGIVRSLE-IPRPIENVDVPGCGKATL 73 (74)
Q Consensus 35 ~ei~eDv~eEC~K~G~V~~V~-Iprp~~~~~~~gvGkVFv 73 (74)
+||++.+++||++||.|.+|. |..+........-|.+||
T Consensus 3 ~~l~~~~~~~~~~fG~v~~v~~v~~~~~~~~~~~rG~~fV 42 (70)
T smart00361 3 EDFEREFSEEEEYFGEVGKINKIYIDNVGYENHKRGNVYI 42 (70)
T ss_pred hhHHHHHHHHHHhcCCeeEEEEEEeCCCCCCCCCcEEEEE
Confidence 479999999999999999985 544433211122355565
No 10
>PF15519 RBM39linker: linker between RRM2 and RRM3 domains in RBM39 protein; PDB: 3S6E_A 2LQ5_A.
Probab=97.61 E-value=3.4e-05 Score=46.42 Aligned_cols=23 Identities=30% Similarity=0.410 Sum_probs=16.7
Q ss_pred CCCceEEEeeCCCCCCcCCChhH
Q psy2203 11 PLRFSVLCLLNMVTPDELKDEDE 33 (74)
Q Consensus 11 ~~ps~vv~L~Nm~~~~el~~~~~ 33 (74)
..+|+|++|+|||+|.+.+++.|
T Consensus 51 ~~aS~C~lLkNMFDP~~Ete~~W 73 (73)
T PF15519_consen 51 PIASRCFLLKNMFDPAEETEPDW 73 (73)
T ss_dssp S---SEEEEESSS-TTCGGSTTH
T ss_pred CCCCceeeeecCCCcccccCCCC
Confidence 58899999999999999887765
No 11
>KOG2202|consensus
Probab=95.55 E-value=0.0022 Score=46.84 Aligned_cols=59 Identities=20% Similarity=0.282 Sum_probs=41.8
Q ss_pred CCceEEEeeCCCCCCcCC-----------Chh----HHHHHHHHHHHHHH-hcCCeEEEEecCCCCCCCCCceEEEEe
Q psy2203 12 LRFSVLCLLNMVTPDELK-----------DED----EYEDICEDIKEECN-KYGIVRSLEIPRPIENVDVPGCGKATL 73 (74)
Q Consensus 12 ~ps~vv~L~Nm~~~~el~-----------~~~----~~~ei~eDv~eEC~-K~G~V~~V~Iprp~~~~~~~gvGkVFv 73 (74)
.-|..|+|.||.--.+.. .|+ .|+|..|||-.|++ |||.|+++.|-..... .=+|.|||
T Consensus 41 t~s~t~ll~nmyq~P~~~~~~~d~~~~~~~de~~q~~~defyEd~f~E~~~kygEiee~~Vc~Nl~~---hl~GNVYV 115 (260)
T KOG2202|consen 41 TFSQTVLLKNMYQNPENSWERRDAQGQFLTDEELQRHEDEFYEDVFTELEDKYGEIEELNVCDNLGD---HLVGNVYV 115 (260)
T ss_pred ccchHHHHHHHHhCCCCCchhhhhccccccHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhcccch---hhhhhhhh
Confidence 345677789987544322 222 36899999999998 9999999988886542 23566665
No 12
>PF13893 RRM_5: RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain); PDB: 2PE8_A 2PEH_A 2D9O_A 1A9N_D 2DIT_A 3S6E_A 2LQ5_A 1SJQ_A 2AD9_A 1X4D_A ....
Probab=95.43 E-value=0.021 Score=31.06 Aligned_cols=20 Identities=20% Similarity=0.365 Sum_probs=16.7
Q ss_pred HHHHHHhcCCeEEEEecCCC
Q psy2203 41 IKEECNKYGIVRSLEIPRPI 60 (74)
Q Consensus 41 v~eEC~K~G~V~~V~Iprp~ 60 (74)
+++.|++||.|.++.+.+..
T Consensus 1 L~~~f~~fG~V~~i~~~~~~ 20 (56)
T PF13893_consen 1 LYKLFSKFGEVKKIKIFKKK 20 (56)
T ss_dssp HHHHHTTTS-EEEEEEETTS
T ss_pred ChHHhCCcccEEEEEEEeCC
Confidence 46889999999999998865
No 13
>PLN03134 glycine-rich RNA-binding protein 4; Provisional
Probab=93.99 E-value=0.13 Score=33.79 Aligned_cols=49 Identities=12% Similarity=0.102 Sum_probs=34.0
Q ss_pred CceEEEeeCCCCCCcCCChhHHHHHHHHHHHHHHhcCCeEEEEecCCCCCCCCCceEEE
Q psy2203 13 RFSVLCLLNMVTPDELKDEDEYEDICEDIKEECNKYGIVRSLEIPRPIENVDVPGCGKA 71 (74)
Q Consensus 13 ps~vv~L~Nm~~~~el~~~~~~~ei~eDv~eEC~K~G~V~~V~Iprp~~~~~~~gvGkV 71 (74)
.++-|.+.|+-..-. .+|+++-+++||.|.++.|++...+....|.|.|
T Consensus 33 ~~~~lfVgnL~~~~t----------e~~L~~~F~~~G~I~~v~i~~d~~tg~~kGfaFV 81 (144)
T PLN03134 33 MSTKLFIGGLSWGTD----------DASLRDAFAHFGDVVDAKVIVDRETGRSRGFGFV 81 (144)
T ss_pred CCCEEEEeCCCCCCC----------HHHHHHHHhcCCCeEEEEEEecCCCCCcceEEEE
Confidence 345577888754322 4688999999999999999887654334455443
No 14
>smart00360 RRM RNA recognition motif.
Probab=93.80 E-value=0.25 Score=25.86 Aligned_cols=23 Identities=26% Similarity=0.566 Sum_probs=19.7
Q ss_pred HHHHHHHHhcCCeEEEEecCCCC
Q psy2203 39 EDIKEECNKYGIVRSLEIPRPIE 61 (74)
Q Consensus 39 eDv~eEC~K~G~V~~V~Iprp~~ 61 (74)
+|+++-++.||.|..+.+++...
T Consensus 11 ~~l~~~f~~~g~v~~~~i~~~~~ 33 (71)
T smart00360 11 EELRELFSKFGKIESVRLVRDKD 33 (71)
T ss_pred HHHHHHHHhhCCEeEEEEEeCCC
Confidence 57788888999999999998764
No 15
>TIGR01661 ELAV_HUD_SF ELAV/HuD family splicing factor. These proteins contain 3 RNA-recognition motifs (rrm: pfam00076).
Probab=93.05 E-value=0.52 Score=33.74 Aligned_cols=48 Identities=17% Similarity=0.249 Sum_probs=33.5
Q ss_pred ceEEEeeCCCCCCcCCChhHHHHHHHHHHHHHHhcCCeEEEEecCCCCCCCCCceEEE
Q psy2203 14 FSVLCLLNMVTPDELKDEDEYEDICEDIKEECNKYGIVRSLEIPRPIENVDVPGCGKA 71 (74)
Q Consensus 14 s~vv~L~Nm~~~~el~~~~~~~ei~eDv~eEC~K~G~V~~V~Iprp~~~~~~~gvGkV 71 (74)
+++|.+.|+-. +. + .+|+++-+++||.|.++.|++...+...-|.|.|
T Consensus 269 ~~~lfV~NL~~--~~-~-------e~~L~~~F~~fG~v~~v~i~~d~~t~~skG~aFV 316 (352)
T TIGR01661 269 GYCIFVYNLSP--DT-D-------ETVLWQLFGPFGAVQNVKIIRDLTTNQCKGYGFV 316 (352)
T ss_pred CcEEEEeCCCC--CC-C-------HHHHHHHHHhCCCeEEEEEeEcCCCCCccceEEE
Confidence 44788999843 33 2 3567788999999999999986644344555544
No 16
>TIGR01661 ELAV_HUD_SF ELAV/HuD family splicing factor. These proteins contain 3 RNA-recognition motifs (rrm: pfam00076).
Probab=92.87 E-value=0.38 Score=34.40 Aligned_cols=48 Identities=19% Similarity=0.248 Sum_probs=32.7
Q ss_pred ceEEEeeCCCCCCcCCChhHHHHHHHHHHHHHHhcCCeEEEEecCCCCCCCCCceEEE
Q psy2203 14 FSVLCLLNMVTPDELKDEDEYEDICEDIKEECNKYGIVRSLEIPRPIENVDVPGCGKA 71 (74)
Q Consensus 14 s~vv~L~Nm~~~~el~~~~~~~ei~eDv~eEC~K~G~V~~V~Iprp~~~~~~~gvGkV 71 (74)
+.-|.+.|+ |.++.+ +|+++.+++||.|.+|.|.+-..+....|.|.|
T Consensus 3 ~~~l~V~nL--p~~~~e--------~~l~~~F~~~G~i~~v~i~~d~~~g~s~g~afV 50 (352)
T TIGR01661 3 KTNLIVNYL--PQTMTQ--------EEIRSLFTSIGEIESCKLVRDKVTGQSLGYGFV 50 (352)
T ss_pred CcEEEEeCC--CCCCCH--------HHHHHHHHccCCEEEEEEEEcCCCCccceEEEE
Confidence 455667887 555543 578899999999999999875443233344443
No 17
>TIGR01622 SF-CC1 splicing factor, CC1-like family. A homologous gene from Plasmodium falciparum was identified in the course of the analysis of that genome at TIGR and was included in the model.
Probab=92.04 E-value=0.78 Score=34.19 Aligned_cols=50 Identities=16% Similarity=0.151 Sum_probs=35.3
Q ss_pred CCceEEEeeCCCCCCcCCChhHHHHHHHHHHHHHHhcCCeEEEEecCCCCCCCCCceEEE
Q psy2203 12 LRFSVLCLLNMVTPDELKDEDEYEDICEDIKEECNKYGIVRSLEIPRPIENVDVPGCGKA 71 (74)
Q Consensus 12 ~ps~vv~L~Nm~~~~el~~~~~~~ei~eDv~eEC~K~G~V~~V~Iprp~~~~~~~gvGkV 71 (74)
.+++.|.+.|+ +.++. .+||++.+++||.|..|.|++...+...-|.|.|
T Consensus 87 ~~~~~l~V~nl--p~~~~--------~~~l~~~F~~~G~v~~v~i~~d~~~~~skg~afV 136 (457)
T TIGR01622 87 RDDRTVFVLQL--ALKAR--------ERDLYEFFSKVGKVRDVQCIKDRNSRRSKGVAYV 136 (457)
T ss_pred cCCcEEEEeCC--CCCCC--------HHHHHHHHHhcCCeeEEEEeecCCCCCcceEEEE
Confidence 45778889998 33443 3568889999999999999886543334455554
No 18
>TIGR01659 sex-lethal sex-lethal family splicing factor. This model describes the sex-lethal family of splicing factors found in Dipteran insects. The sex-lethal phenotype, however, may be limited to the Melanogasters and closely related species. In Drosophila the protein acts as an inhibitor of splicing. This subfamily is most closely related to the ELAV/HUD subfamily of splicing factors (TIGR01661).
Probab=90.80 E-value=0.51 Score=35.27 Aligned_cols=33 Identities=18% Similarity=0.191 Sum_probs=24.3
Q ss_pred HHHHHHHHhcCCeEEEEecCCCCCCCCCceEEE
Q psy2203 39 EDIKEECNKYGIVRSLEIPRPIENVDVPGCGKA 71 (74)
Q Consensus 39 eDv~eEC~K~G~V~~V~Iprp~~~~~~~gvGkV 71 (74)
+|+++.+++||.|+++.|++...+..+.|.|.|
T Consensus 208 e~L~~~F~~fG~V~~v~i~~d~~tg~~kG~aFV 240 (346)
T TIGR01659 208 DQLDTIFGKYGQIVQKNILRDKLTGTPRGVAFV 240 (346)
T ss_pred HHHHHHHHhcCCEEEEEEeecCCCCccceEEEE
Confidence 578888999999999999986543333455444
No 19
>KOG0127|consensus
Probab=90.40 E-value=0.51 Score=38.27 Aligned_cols=33 Identities=24% Similarity=0.368 Sum_probs=26.6
Q ss_pred HHHHHHHHhcCCeEEEEecCCCCCCCCCceEEEE
Q psy2203 39 EDIKEECNKYGIVRSLEIPRPIENVDVPGCGKAT 72 (74)
Q Consensus 39 eDv~eEC~K~G~V~~V~Iprp~~~~~~~gvGkVF 72 (74)
.|++.=|++||.|..|+||+..++..+ |.|.|.
T Consensus 132 ~dLk~vFs~~G~V~Ei~IP~k~dgklc-GFaFV~ 164 (678)
T KOG0127|consen 132 PDLKNVFSNFGKVVEIVIPRKKDGKLC-GFAFVQ 164 (678)
T ss_pred HHHHHHHhhcceEEEEEcccCCCCCcc-ceEEEE
Confidence 388899999999999999998877544 555553
No 20
>PF14259 RRM_6: RNA recognition motif (a.k.a. RRM, RBD, or RNP domain); PDB: 2DNN_A 1WI6_A 2EVZ_A 2ADC_A 1QM9_A 2KG1_A 2HGN_A 1U2F_A 2G4B_A 2HZC_A ....
Probab=90.23 E-value=0.66 Score=25.62 Aligned_cols=32 Identities=19% Similarity=0.367 Sum_probs=24.8
Q ss_pred HHHHHHHHhcCCeEEEEecCCCCCCCCCceEEEEe
Q psy2203 39 EDIKEECNKYGIVRSLEIPRPIENVDVPGCGKATL 73 (74)
Q Consensus 39 eDv~eEC~K~G~V~~V~Iprp~~~~~~~gvGkVFv 73 (74)
+|+++.++.||.|.++.+.+...+ .+-|..||
T Consensus 13 ~~l~~~f~~~g~v~~v~~~~~~~~---~~~~~a~v 44 (70)
T PF14259_consen 13 EDLRNFFSRFGPVEKVRLIKNKDG---QSRGFAFV 44 (70)
T ss_dssp HHHHHHCTTSSBEEEEEEEESTTS---SEEEEEEE
T ss_pred HHHHHHHHhcCCcceEEEEeeecc---ccCCEEEE
Confidence 789999999999999999986542 34555554
No 21
>TIGR01649 hnRNP-L_PTB hnRNP-L/PTB/hephaestus splicing factor family. Included in this family of heterogeneous ribonucleoproteins are PTB (polypyrimidine tract binding protein ) and hnRNP-L. These proteins contain four RNA recognition motifs (rrm: pfam00067).
Probab=89.49 E-value=0.84 Score=35.00 Aligned_cols=36 Identities=17% Similarity=0.145 Sum_probs=29.1
Q ss_pred CceEEEeeCCCCCCcCCChhHHHHHHHHHHHHHHhcCCeEEEEecC
Q psy2203 13 RFSVLCLLNMVTPDELKDEDEYEDICEDIKEECNKYGIVRSLEIPR 58 (74)
Q Consensus 13 ps~vv~L~Nm~~~~el~~~~~~~ei~eDv~eEC~K~G~V~~V~Ipr 58 (74)
||++|.+.|+=.. .. .+|+++-+++||.|.++.|.+
T Consensus 1 ps~vv~V~nLp~~--~t--------e~~L~~~f~~fG~V~~v~i~~ 36 (481)
T TIGR01649 1 PSPVVHVRNLPQD--VV--------EADLVEALIPFGPVSYVMMLP 36 (481)
T ss_pred CccEEEEcCCCCC--CC--------HHHHHHHHHhcCCeeEEEEEC
Confidence 7999999997543 32 357788899999999999886
No 22
>smart00362 RRM_2 RNA recognition motif.
Probab=89.49 E-value=1.5 Score=22.89 Aligned_cols=23 Identities=39% Similarity=0.670 Sum_probs=18.8
Q ss_pred HHHHHHHHHhcCCeEEEEecCCC
Q psy2203 38 CEDIKEECNKYGIVRSLEIPRPI 60 (74)
Q Consensus 38 ~eDv~eEC~K~G~V~~V~Iprp~ 60 (74)
.+|+++-.++||.|..+.+.+..
T Consensus 13 ~~~l~~~~~~~g~v~~~~~~~~~ 35 (72)
T smart00362 13 EEDLKELFSKFGPIESVKIPKDT 35 (72)
T ss_pred HHHHHHHHHhcCCEEEEEEecCC
Confidence 35677778899999999988765
No 23
>cd00590 RRM RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
Probab=88.61 E-value=1.8 Score=22.68 Aligned_cols=25 Identities=36% Similarity=0.534 Sum_probs=20.8
Q ss_pred HHHHHHHHHhcCCeEEEEecCCCCC
Q psy2203 38 CEDIKEECNKYGIVRSLEIPRPIEN 62 (74)
Q Consensus 38 ~eDv~eEC~K~G~V~~V~Iprp~~~ 62 (74)
.+|+++-++.||.+.++.+.+...+
T Consensus 13 ~~~i~~~~~~~g~i~~~~~~~~~~~ 37 (74)
T cd00590 13 EEDLRELFSKFGKVESVRIVRDKDT 37 (74)
T ss_pred HHHHHHHHHhcCCEEEEEEeeCCCC
Confidence 4688888999999999999986543
No 24
>KOG4207|consensus
Probab=88.28 E-value=0.71 Score=33.53 Aligned_cols=35 Identities=29% Similarity=0.355 Sum_probs=25.8
Q ss_pred HHHHHHHHHHhcCCeEEEEecCCCCCCCCCceEEEEe
Q psy2203 37 ICEDIKEECNKYGIVRSLEIPRPIENVDVPGCGKATL 73 (74)
Q Consensus 37 i~eDv~eEC~K~G~V~~V~Iprp~~~~~~~gvGkVFv 73 (74)
--+|++--.+|||.|-.|+||+-..+...- |..||
T Consensus 26 spd~LrrvFekYG~vgDVyIPrdr~Tr~sR--gFaFV 60 (256)
T KOG4207|consen 26 SPDDLRRVFEKYGRVGDVYIPRDRYTRQSR--GFAFV 60 (256)
T ss_pred CHHHHHHHHHHhCcccceeccccccccccc--ceeEE
Confidence 345677788999999999999977654444 44454
No 25
>TIGR01649 hnRNP-L_PTB hnRNP-L/PTB/hephaestus splicing factor family. Included in this family of heterogeneous ribonucleoproteins are PTB (polypyrimidine tract binding protein ) and hnRNP-L. These proteins contain four RNA recognition motifs (rrm: pfam00067).
Probab=88.05 E-value=1.9 Score=33.10 Aligned_cols=40 Identities=15% Similarity=0.122 Sum_probs=30.3
Q ss_pred CCceEEEeeCCCCCCcCCChhHHHHHHHHHHHHHHhcCCeEEEEecCCC
Q psy2203 12 LRFSVLCLLNMVTPDELKDEDEYEDICEDIKEECNKYGIVRSLEIPRPI 60 (74)
Q Consensus 12 ~ps~vv~L~Nm~~~~el~~~~~~~ei~eDv~eEC~K~G~V~~V~Iprp~ 60 (74)
.++++|.+.|+- .+.+.+ +++++-+++||.|.+|+|.+.+
T Consensus 273 ~~~~~l~v~nL~-~~~vt~--------~~L~~lF~~yG~V~~vki~~~~ 312 (481)
T TIGR01649 273 GPGSVLMVSGLH-QEKVNC--------DRLFNLFCVYGNVERVKFMKNK 312 (481)
T ss_pred CCCCEEEEeCCC-CCCCCH--------HHHHHHHHhcCCeEEEEEEeCC
Confidence 477899999983 222322 4788889999999999998854
No 26
>TIGR01628 PABP-1234 polyadenylate binding protein, human types 1, 2, 3, 4 family. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed, or of unknown tissue range.
Probab=87.89 E-value=1.1 Score=34.65 Aligned_cols=48 Identities=17% Similarity=0.140 Sum_probs=32.0
Q ss_pred CceEEEeeCCCCCCcCCChhHHHHHHHHHHHHHHhcCCeEEEEecCCCCCCCCCceEEE
Q psy2203 13 RFSVLCLLNMVTPDELKDEDEYEDICEDIKEECNKYGIVRSLEIPRPIENVDVPGCGKA 71 (74)
Q Consensus 13 ps~vv~L~Nm~~~~el~~~~~~~ei~eDv~eEC~K~G~V~~V~Iprp~~~~~~~gvGkV 71 (74)
.+..|.+.|+-. ++. .+++++.|++||.|.++.|.+...+ ...|.|.|
T Consensus 284 ~~~~l~V~nl~~--~~~--------~~~L~~~F~~~G~i~~~~i~~d~~g-~~~g~gfV 331 (562)
T TIGR01628 284 QGVNLYVKNLDD--TVT--------DEKLRELFSECGEITSAKVMLDEKG-VSRGFGFV 331 (562)
T ss_pred CCCEEEEeCCCC--ccC--------HHHHHHHHHhcCCeEEEEEEECCCC-CcCCeEEE
Confidence 344577777632 222 3678999999999999999987433 23455544
No 27
>TIGR01628 PABP-1234 polyadenylate binding protein, human types 1, 2, 3, 4 family. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed, or of unknown tissue range.
Probab=87.12 E-value=1.3 Score=34.16 Aligned_cols=34 Identities=18% Similarity=0.179 Sum_probs=25.0
Q ss_pred HHHHHHHHHhcCCeEEEEecCCCCCCCCCceEEE
Q psy2203 38 CEDIKEECNKYGIVRSLEIPRPIENVDVPGCGKA 71 (74)
Q Consensus 38 ~eDv~eEC~K~G~V~~V~Iprp~~~~~~~gvGkV 71 (74)
.+|+++-+++||.|.+|.|.+-..+....|.|.|
T Consensus 14 e~~L~~~F~~~G~v~~v~v~~d~~t~~s~G~afV 47 (562)
T TIGR01628 14 EAKLYDLFKPFGPVLSVRVCRDSVTRRSLGYGYV 47 (562)
T ss_pred HHHHHHHHHhcCCEEEEEEEecCCCCCcceEEEE
Confidence 4678888999999999999987654333444443
No 28
>PF00076 RRM_1: RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain); InterPro: IPR000504 Many eukaryotic proteins containing one or more copies of a putative RNA-binding domain of about 90 amino acids are known to bind single-stranded RNAs [, , ]. The largest group of single strand RNA-binding proteins is the eukaryotic RNA recognition motif (RRM) family that contains an eight amino acid RNP-1 consensus sequence [, ]. RRM proteins have a variety of RNA binding preferences and functions, and include heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing (SR, U2AF, Sxl), protein components of small nuclear ribonucleoproteins (U1 and U2 snRNPs), and proteins that regulate RNA stability and translation (PABP, La, Hu) [, , ]. The RRM in heterodimeric splicing factor U2 snRNP auxiliary factor (U2AF) appears to have two RRM-like domains with specialised features for protein recognition []. The motif also appears in a few single stranded DNA binding proteins. The typical RRM consists of four anti-parallel beta-strands and two alpha-helices arranged in a beta-alpha-beta-beta-alpha-beta fold with side chains that stack with RNA bases. Specificity of RNA binding is determined by multiple contacts with surrounding amino acids. A third helix is present during RNA binding in some cases []. The RRM is reviewed in a number of publications [, , ].; GO: 0003676 nucleic acid binding; PDB: 2RNE_A 2DGO_A 2DO4_A 1YTY_B 2VOO_B 2VOP_A 2VON_B 1ZH5_B 2VOD_A 1S79_A ....
Probab=86.72 E-value=1.9 Score=23.14 Aligned_cols=22 Identities=18% Similarity=0.501 Sum_probs=19.5
Q ss_pred HHHHHHHHHhcCCeEEEEecCC
Q psy2203 38 CEDIKEECNKYGIVRSLEIPRP 59 (74)
Q Consensus 38 ~eDv~eEC~K~G~V~~V~Iprp 59 (74)
.+|+++-.++||.+.++.+.+.
T Consensus 12 ~~~l~~~f~~~g~i~~~~~~~~ 33 (70)
T PF00076_consen 12 EEELRDFFSQFGKIESIKVMRN 33 (70)
T ss_dssp HHHHHHHHHTTSTEEEEEEEEE
T ss_pred HHHHHHHHHHhhhccccccccc
Confidence 3688899999999999999986
No 29
>KOG0114|consensus
Probab=86.71 E-value=2.4 Score=27.88 Aligned_cols=50 Identities=30% Similarity=0.508 Sum_probs=36.2
Q ss_pred CCCCceEEEeeCCCCCCcCCChhHHHHHHHHHHHHHHhcCCeEEEEecCCCCCCCCCceEEEEeC
Q psy2203 10 PPLRFSVLCLLNMVTPDELKDEDEYEDICEDIKEECNKYGIVRSLEIPRPIENVDVPGCGKATLV 74 (74)
Q Consensus 10 ~~~ps~vv~L~Nm~~~~el~~~~~~~ei~eDv~eEC~K~G~V~~V~Iprp~~~~~~~gvGkVFv~ 74 (74)
++.-++++.+.|+ |-++.. |+|-+=..|||.|.+|.|--... .-|..||+
T Consensus 14 ppevnriLyirNL--p~~ITs--------eemydlFGkyg~IrQIRiG~~k~-----TrGTAFVV 63 (124)
T KOG0114|consen 14 PPEVNRILYIRNL--PFKITS--------EEMYDLFGKYGTIRQIRIGNTKE-----TRGTAFVV 63 (124)
T ss_pred ChhhheeEEEecC--CccccH--------HHHHHHhhcccceEEEEecCccC-----cCceEEEE
Confidence 4456889999997 444443 46777789999999999876443 35777774
No 30
>TIGR01645 half-pint poly-U binding splicing factor, half-pint family. In the case of PUF60 (GP|6176532), in complex with p54, and in the presence of U2AF, facilitates association of U2 snRNP with pre-mRNA.
Probab=85.56 E-value=1.6 Score=35.35 Aligned_cols=34 Identities=21% Similarity=0.296 Sum_probs=26.7
Q ss_pred HHHHHHHHhcCCeEEEEecCCCCCCCCCceEEEE
Q psy2203 39 EDIKEECNKYGIVRSLEIPRPIENVDVPGCGKAT 72 (74)
Q Consensus 39 eDv~eEC~K~G~V~~V~Iprp~~~~~~~gvGkVF 72 (74)
+|+++.+++||.|.++.|++...+....|.|.|-
T Consensus 219 edLk~lFs~FG~I~svrl~~D~~tgksKGfGFVe 252 (612)
T TIGR01645 219 TDIKSVFEAFGEIVKCQLARAPTGRGHKGYGFIE 252 (612)
T ss_pred HHHHHHHhhcCCeeEEEEEecCCCCCcCCeEEEE
Confidence 5788889999999999999876554556777663
No 31
>TIGR01648 hnRNP-R-Q heterogeneous nuclear ribonucleoprotein R, Q family. Sequences in this subfamily include the human heterogeneous nuclear ribonucleoproteins (hnRNP) R, Q and APOBEC-1 complementation factor (aka APOBEC-1 stimulating protein). These proteins contain three RNA recognition domains (rrm: pfam00076) and a somewhat variable C-terminal domain.
Probab=85.24 E-value=5 Score=32.30 Aligned_cols=57 Identities=16% Similarity=0.236 Sum_probs=37.1
Q ss_pred CCcccCCCCCCceEEEeeCCCCCCcCCChhHHHHHHHHHHHHHHhcCCeEEEEecCCCCCCCCCceEEE
Q psy2203 3 PPWDRLVPPLRFSVLCLLNMVTPDELKDEDEYEDICEDIKEECNKYGIVRSLEIPRPIENVDVPGCGKA 71 (74)
Q Consensus 3 ~~~d~~~~~~ps~vv~L~Nm~~~~el~~~~~~~ei~eDv~eEC~K~G~V~~V~Iprp~~~~~~~gvGkV 71 (74)
|.|+...+...+++ .+.|+=. ++. .+|+++-+++||.|.++.|.+...+ ..-|.|.|
T Consensus 48 ~~~~~~~p~~~~~l-FVgnLp~--~~t--------Ed~L~~~F~~~G~I~~vrl~~D~sG-~sRGfaFV 104 (578)
T TIGR01648 48 PGWSGVQPGRGCEV-FVGKIPR--DLY--------EDELVPLFEKAGPIYELRLMMDFSG-QNRGYAFV 104 (578)
T ss_pred CcccCCCCCCCCEE-EeCCCCC--CCC--------HHHHHHHHHhhCCEEEEEEEECCCC-CccceEEE
Confidence 45877776666554 4666532 332 3578889999999999999887333 23344443
No 32
>TIGR01642 U2AF_lg U2 snRNP auxilliary factor, large subunit, splicing factor. Members of this subfamily are found in plants, metazoa and fungi.
Probab=85.06 E-value=4.1 Score=30.69 Aligned_cols=48 Identities=19% Similarity=0.266 Sum_probs=32.0
Q ss_pred ceEEEeeCCCCCCcCCChhHHHHHHHHHHHHHHhcCCeEEEEecCCCCCCCCCceEEE
Q psy2203 14 FSVLCLLNMVTPDELKDEDEYEDICEDIKEECNKYGIVRSLEIPRPIENVDVPGCGKA 71 (74)
Q Consensus 14 s~vv~L~Nm~~~~el~~~~~~~ei~eDv~eEC~K~G~V~~V~Iprp~~~~~~~gvGkV 71 (74)
.+.|.+.|+ +..+.+ ++|++-+++||.|..+.|.+...+....|.|.|
T Consensus 295 ~~~l~v~nl--p~~~~~--------~~l~~~f~~~G~i~~~~~~~~~~~g~~~g~afv 342 (509)
T TIGR01642 295 KDRIYIGNL--PLYLGE--------DQIKELLESFGDLKAFNLIKDIATGLSKGYAFC 342 (509)
T ss_pred CCEEEEeCC--CCCCCH--------HHHHHHHHhcCCeeEEEEEecCCCCCcCeEEEE
Confidence 456778888 344432 577888899999999998875443233445444
No 33
>KOG0113|consensus
Probab=83.67 E-value=2 Score=32.46 Aligned_cols=35 Identities=20% Similarity=0.303 Sum_probs=25.9
Q ss_pred HHHHHHHHHHhcCCeEEEEecCCCCCCCCCceEEEEe
Q psy2203 37 ICEDIKEECNKYGIVRSLEIPRPIENVDVPGCGKATL 73 (74)
Q Consensus 37 i~eDv~eEC~K~G~V~~V~Iprp~~~~~~~gvGkVFv 73 (74)
-+.+++.|.++||.|++|.+-+...+..+ -|..||
T Consensus 114 ~EskLrreF~~YG~IkrirlV~d~vTgks--kGYAFI 148 (335)
T KOG0113|consen 114 SESKLRREFEKYGPIKRIRLVRDKVTGKS--KGYAFI 148 (335)
T ss_pred cHHHHHHHHHhcCcceeEEEeeecccCCc--cceEEE
Confidence 35789999999999999999886543333 345554
No 34
>PLN03120 nucleic acid binding protein; Provisional
Probab=83.29 E-value=2.8 Score=30.76 Aligned_cols=45 Identities=18% Similarity=0.078 Sum_probs=31.4
Q ss_pred ceEEEeeCCCCCCcCCChhHHHHHHHHHHHHHHhcCCeEEEEecCCCCCCCCCceEEE
Q psy2203 14 FSVLCLLNMVTPDELKDEDEYEDICEDIKEECNKYGIVRSLEIPRPIENVDVPGCGKA 71 (74)
Q Consensus 14 s~vv~L~Nm~~~~el~~~~~~~ei~eDv~eEC~K~G~V~~V~Iprp~~~~~~~gvGkV 71 (74)
++.|.+.|+=.. . -.+||++-.+.||.|.+|.|++... ..|.|.|
T Consensus 4 ~rtVfVgNLs~~--t--------TE~dLrefFS~~G~I~~V~I~~d~~---~~GfAFV 48 (260)
T PLN03120 4 VRTVKVSNVSLK--A--------TERDIKEFFSFSGDIEYVEMQSENE---RSQIAYV 48 (260)
T ss_pred CCEEEEeCCCCC--C--------CHHHHHHHHHhcCCeEEEEEeecCC---CCCEEEE
Confidence 456677776433 1 1468899999999999999988653 2455544
No 35
>KOG0144|consensus
Probab=79.87 E-value=3.4 Score=32.77 Aligned_cols=35 Identities=23% Similarity=0.359 Sum_probs=28.5
Q ss_pred HHHHHHHHHHhcCCeEEEEecCCCCCCCCCceEEE
Q psy2203 37 ICEDIKEECNKYGIVRSLEIPRPIENVDVPGCGKA 71 (74)
Q Consensus 37 i~eDv~eEC~K~G~V~~V~Iprp~~~~~~~gvGkV 71 (74)
-+.|+|+-.+|||.|..|.|+|-..+....|...|
T Consensus 47 sE~dlr~lFe~yg~V~einl~kDk~t~~s~gcCFv 81 (510)
T KOG0144|consen 47 SEKDLRELFEKYGNVYEINLIKDKSTGQSKGCCFV 81 (510)
T ss_pred cHHHHHHHHHHhCceeEEEeecccccCcccceEEE
Confidence 46899999999999999999998776555565544
No 36
>KOG0107|consensus
Probab=79.63 E-value=3 Score=29.46 Aligned_cols=25 Identities=24% Similarity=0.408 Sum_probs=20.7
Q ss_pred HHHHHHHHHhcCCeEEEEecCCCCC
Q psy2203 38 CEDIKEECNKYGIVRSLEIPRPIEN 62 (74)
Q Consensus 38 ~eDv~eEC~K~G~V~~V~Iprp~~~ 62 (74)
+.|++.+..+||.+.+|+|.+.+++
T Consensus 24 k~eLE~~F~~yG~lrsvWvArnPPG 48 (195)
T KOG0107|consen 24 KRELERAFSKYGPLRSVWVARNPPG 48 (195)
T ss_pred hHHHHHHHHhcCcceeEEEeecCCC
Confidence 4456788999999999999997654
No 37
>TIGR01659 sex-lethal sex-lethal family splicing factor. This model describes the sex-lethal family of splicing factors found in Dipteran insects. The sex-lethal phenotype, however, may be limited to the Melanogasters and closely related species. In Drosophila the protein acts as an inhibitor of splicing. This subfamily is most closely related to the ELAV/HUD subfamily of splicing factors (TIGR01661).
Probab=79.12 E-value=4.7 Score=30.15 Aligned_cols=34 Identities=12% Similarity=0.038 Sum_probs=23.6
Q ss_pred HHHHHHHHHhcCCeEEEEecCCCCCCCCCceEEE
Q psy2203 38 CEDIKEECNKYGIVRSLEIPRPIENVDVPGCGKA 71 (74)
Q Consensus 38 ~eDv~eEC~K~G~V~~V~Iprp~~~~~~~gvGkV 71 (74)
.+|+++-+++||.|++|.|.+-..+....|.|.|
T Consensus 121 e~~L~~lF~~~G~V~~v~i~~d~~tg~srGyaFV 154 (346)
T TIGR01659 121 DRELYALFRTIGPINTCRIMRDYKTGYSFGYAFV 154 (346)
T ss_pred HHHHHHHHHhcCCEEEEEEEecCCCCccCcEEEE
Confidence 4578888899999999999875443233344444
No 38
>KOG0125|consensus
Probab=78.85 E-value=2.9 Score=32.03 Aligned_cols=32 Identities=28% Similarity=0.347 Sum_probs=26.1
Q ss_pred HHHHHHHHhcCCeEEEEecCCCCCCCCCceEEEE
Q psy2203 39 EDIKEECNKYGIVRSLEIPRPIENVDVPGCGKAT 72 (74)
Q Consensus 39 eDv~eEC~K~G~V~~V~Iprp~~~~~~~gvGkVF 72 (74)
-|++.-.+|||+|+.|.|.-...| .-|.|.|-
T Consensus 111 pDL~aMF~kfG~VldVEIIfNERG--SKGFGFVT 142 (376)
T KOG0125|consen 111 PDLRAMFEKFGKVLDVEIIFNERG--SKGFGFVT 142 (376)
T ss_pred ccHHHHHHhhCceeeEEEEeccCC--CCccceEE
Confidence 388999999999999999987765 35777664
No 39
>PLN03121 nucleic acid binding protein; Provisional
Probab=78.24 E-value=5 Score=29.24 Aligned_cols=36 Identities=17% Similarity=0.173 Sum_probs=27.2
Q ss_pred eEEEeeCCCCCCcCCChhHHHHHHHHHHHHHHhcCCeEEEEecCCC
Q psy2203 15 SVLCLLNMVTPDELKDEDEYEDICEDIKEECNKYGIVRSLEIPRPI 60 (74)
Q Consensus 15 ~vv~L~Nm~~~~el~~~~~~~ei~eDv~eEC~K~G~V~~V~Iprp~ 60 (74)
..|...|+-..-. .+|+++-.+.||.|.+|.|++..
T Consensus 6 ~TV~V~NLS~~tT----------E~dLrefFS~~G~I~~V~I~~D~ 41 (243)
T PLN03121 6 YTAEVTNLSPKAT----------EKDVYDFFSHCGAIEHVEIIRSG 41 (243)
T ss_pred eEEEEecCCCCCC----------HHHHHHHHHhcCCeEEEEEecCC
Confidence 4566777643322 47899999999999999999864
No 40
>PF00586 AIRS: AIR synthase related protein, N-terminal domain; InterPro: IPR000728 This family includes Hydrogen expression/formation protein, HypE, which may be involved in the maturation of NifE hydrogenase; AIR synthase and FGAM synthase, which are involved in de novo purine biosynthesis; and selenide, water dikinase, an enzyme which synthesizes selenophosphate from selenide and ATP.; GO: 0003824 catalytic activity; PDB: 3VIU_A 2Z1T_A 2Z1U_A 3C9U_B 3C9S_A 3C9R_A 1VQV_A 3C9T_B 3M84_A 3QTY_A ....
Probab=78.04 E-value=2.7 Score=25.06 Aligned_cols=31 Identities=26% Similarity=0.578 Sum_probs=23.0
Q ss_pred eCCCCCCcCCChhHHHHHHHHHHHHHHhcCC
Q psy2203 20 LNMVTPDELKDEDEYEDICEDIKEECNKYGI 50 (74)
Q Consensus 20 ~Nm~~~~el~~~~~~~ei~eDv~eEC~K~G~ 50 (74)
.+-+...+..+.++.+++.+-+.+.|++||.
T Consensus 59 ~~~l~~~~~~~~~~l~~~~~Gi~~~~~~~g~ 89 (96)
T PF00586_consen 59 LDSLGLPNPESPEELKEIVKGIAEACREFGI 89 (96)
T ss_dssp EEEEEESTTSBHHHHHHHHHHHHHHHHHHT-
T ss_pred EEEEEcCCCCCHHHHHHHHHHHHHHHHHhCC
Confidence 3333335555677899999999999999995
No 41
>COG0724 RNA-binding proteins (RRM domain) [General function prediction only]
Probab=77.86 E-value=7.6 Score=24.97 Aligned_cols=49 Identities=22% Similarity=0.283 Sum_probs=34.3
Q ss_pred ceEEEeeCCCCCCcCCChhHHHHHHHHHHHHHHhcCCeEEEEecCCCCCCCCCceEEEE
Q psy2203 14 FSVLCLLNMVTPDELKDEDEYEDICEDIKEECNKYGIVRSLEIPRPIENVDVPGCGKAT 72 (74)
Q Consensus 14 s~vv~L~Nm~~~~el~~~~~~~ei~eDv~eEC~K~G~V~~V~Iprp~~~~~~~gvGkVF 72 (74)
...|.+.|+= .++ -.+|+++...+||.+.++.+++-.......|.|.|-
T Consensus 115 ~~~l~v~nL~--~~~--------~~~~l~~~F~~~g~~~~~~~~~d~~~~~~~g~~~v~ 163 (306)
T COG0724 115 NNTLFVGNLP--YDV--------TEEDLRELFKKFGPVKRVRLVRDRETGKSRGFAFVE 163 (306)
T ss_pred CceEEEeCCC--CCC--------CHHHHHHHHHhcCceeEEEeeeccccCccCceEEEE
Confidence 4667777775 222 356889999999999999998876333445666553
No 42
>PF00522 VPR: VPR/VPX protein; InterPro: IPR000012 Human immunodeficiency virus (HIV) is the human retrovirus associated with AIDS (acquired immune deficiency syndrome), and SIV its simian counterpart. Three main groups of primate lentivirus are known, designated Human immunodeficiency virus 1 (HIV-1), Human immunodeficiency virus 2 (HIV-2)/Simian immunodeficiency virus - mac (SIVMAC)/Simian immunodeficiency virus - sm (SIVSM) and Simian immunodeficiency virus - agm (SIVAGM). Simian immunodeficiency virus - mnd (SIVMND) has been suggested to represent a fourth distinct group []. These groups are believed to have diverged from a common ancestor long before the spread of AIDS in humans. Genetic variation in HIV-1 and HIV-2 has been studied extensively, and the nucleotide sequences reported for several strains []. ORF analysis has revealed two open reading frames, yielding the so-called R- and X-ORF proteins, which show a high degree of sequence similarity. Vpx plays a role in nuclear translocation of the viral pre-integration complex (PIC) and is thus required for the virus to infect non-dividing cells. Vpr also plays a role in nuclear translocation of the (PIC) and may target specific host proteins for degradation by the 26S proteasome. It acts by associating with the cellular CUL4A-DDB1 E3 ligase complex through direct interaction with host VPRPB/DCAF-1. This would result in cell cycle arrest or apoptosis in infected cells, creating a favourable environment for maximizing viral expression and production by rendering the HIV-1 LTR transcription more active.; GO: 0019058 viral infectious cycle, 0042025 host cell nucleus; PDB: 1VPC_A 1X9V_B 1ESX_A 1CEU_A 1M8L_A 1DSJ_A 1BDE_A 1DSK_A 1FI0_A.
Probab=73.66 E-value=3.7 Score=25.99 Aligned_cols=18 Identities=22% Similarity=0.350 Sum_probs=15.9
Q ss_pred hhHHHHHHHHHHHHHHhc
Q psy2203 31 EDEYEDICEDIKEECNKY 48 (74)
Q Consensus 31 ~~~~~ei~eDv~eEC~K~ 48 (74)
++|..++.+|+++|+.||
T Consensus 19 ~eWl~~~Leelk~EAvrH 36 (96)
T PF00522_consen 19 NEWLLELLEELKEEAVRH 36 (96)
T ss_dssp CHHHHHHHHHHHHHHHHS
T ss_pred HHHHHHHHHHHHHHHhcc
Confidence 467899999999999886
No 43
>KOG0144|consensus
Probab=69.40 E-value=4.6 Score=32.06 Aligned_cols=34 Identities=18% Similarity=0.340 Sum_probs=25.5
Q ss_pred HHHHHHHHHHhcCCeEEEEecCCCCCCCCCceEEE
Q psy2203 37 ICEDIKEECNKYGIVRSLEIPRPIENVDVPGCGKA 71 (74)
Q Consensus 37 i~eDv~eEC~K~G~V~~V~Iprp~~~~~~~gvGkV 71 (74)
-+.||++=|++||.|+.+.|-|-..+ ..-|.+.|
T Consensus 137 te~evr~iFs~fG~Ied~~ilrd~~~-~sRGcaFV 170 (510)
T KOG0144|consen 137 TENEVREIFSRFGHIEDCYILRDPDG-LSRGCAFV 170 (510)
T ss_pred cHHHHHHHHHhhCccchhhheecccc-cccceeEE
Confidence 35699999999999999999996544 23444433
No 44
>PHA02944 hypothetical protein; Provisional
Probab=68.99 E-value=8.7 Score=26.76 Aligned_cols=49 Identities=16% Similarity=0.383 Sum_probs=41.5
Q ss_pred CCCCcccCCCCCCceEEEeeCCCCCCcCCChhHHHHHHHHHHHHHHhcC
Q psy2203 1 MYPPWDRLVPPLRFSVLCLLNMVTPDELKDEDEYEDICEDIKEECNKYG 49 (74)
Q Consensus 1 ~~~~~d~~~~~~ps~vv~L~Nm~~~~el~~~~~~~ei~eDv~eEC~K~G 49 (74)
||-+||.+.+.....|+.=.+-..+++++.+..|+++.+-+|.--.-||
T Consensus 130 ~f~gYdGl~~~sKhtv~k~v~f~~~~~iD~~~L~~el~~~aR~AyNA~~ 178 (180)
T PHA02944 130 MFIGYDGLFADSKHTVIKRVEFYAGEDIDAAQLYNELIEAARHAYNAIG 178 (180)
T ss_pred EEecccccccccceeEEEEEEecCcccccHHHHHHHHHHHHHHHHhccC
Confidence 5778999999888888877777779999999999999999887766665
No 45
>PRK07718 fliL flagellar basal body-associated protein FliL; Reviewed
Probab=67.49 E-value=8.5 Score=25.20 Aligned_cols=37 Identities=24% Similarity=0.401 Sum_probs=29.5
Q ss_pred CCCCCCcCCChhHHHHHHHHHHHHHHhc---CCeEEEEec
Q psy2203 21 NMVTPDELKDEDEYEDICEDIKEECNKY---GIVRSLEIP 57 (74)
Q Consensus 21 Nm~~~~el~~~~~~~ei~eDv~eEC~K~---G~V~~V~Ip 57 (74)
-=.+.+++.+.+-.+.|++|+++...++ |.|.+|.+-
T Consensus 98 ~~~~~~~l~~~~G~~~Lr~el~~~in~~l~~g~V~~Vyft 137 (142)
T PRK07718 98 ADMNAEDFKGKKGLEALKEQLKEKINNLMQEGKVEKVYIT 137 (142)
T ss_pred HcCCHHHhcChhHHHHHHHHHHHHHHHhhccCceEEEEEE
Confidence 3346788888888899999999998765 888888753
No 46
>KOG0149|consensus
Probab=64.89 E-value=17 Score=26.67 Aligned_cols=37 Identities=14% Similarity=0.167 Sum_probs=28.9
Q ss_pred HHHHHHHHHHhcCCeEEEEecCCCCCCCCCceEEEEe
Q psy2203 37 ICEDIKEECNKYGIVRSLEIPRPIENVDVPGCGKATL 73 (74)
Q Consensus 37 i~eDv~eEC~K~G~V~~V~Iprp~~~~~~~gvGkVFv 73 (74)
=.++++.=.++||.|+..+|..-....-.-|.|.|-+
T Consensus 25 ~~~~l~~yFeqfGeI~eavvitd~~t~rskGyGfVTf 61 (247)
T KOG0149|consen 25 HKETLRRYFEQFGEIVEAVVITDKNTGRSKGYGFVTF 61 (247)
T ss_pred chHHHHHHHHHhCceEEEEEEeccCCccccceeeEEe
Confidence 4678899999999999888777665555678888754
No 47
>PLN03213 repressor of silencing 3; Provisional
Probab=64.09 E-value=12 Score=30.56 Aligned_cols=23 Identities=22% Similarity=0.309 Sum_probs=19.3
Q ss_pred HHHHHHHHHhcCCeEEEEecCCC
Q psy2203 38 CEDIKEECNKYGIVRSLEIPRPI 60 (74)
Q Consensus 38 ~eDv~eEC~K~G~V~~V~Iprp~ 60 (74)
.+|++.=.+.||.|.+|.|||-+
T Consensus 24 EDDLravFSeFGsVkdVEIpRET 46 (759)
T PLN03213 24 RDDLLKIFSPMGTVDAVEFVRTK 46 (759)
T ss_pred HHHHHHHHHhcCCeeEEEEeccc
Confidence 46778889999999999999643
No 48
>TIGR01648 hnRNP-R-Q heterogeneous nuclear ribonucleoprotein R, Q family. Sequences in this subfamily include the human heterogeneous nuclear ribonucleoproteins (hnRNP) R, Q and APOBEC-1 complementation factor (aka APOBEC-1 stimulating protein). These proteins contain three RNA recognition domains (rrm: pfam00076) and a somewhat variable C-terminal domain.
Probab=60.70 E-value=24 Score=28.54 Aligned_cols=36 Identities=22% Similarity=0.366 Sum_probs=27.0
Q ss_pred CceEEEeeCCCCCCcCCChhHHHHHHHHHHHHHHhc--CCeEEEEecC
Q psy2203 13 RFSVLCLLNMVTPDELKDEDEYEDICEDIKEECNKY--GIVRSLEIPR 58 (74)
Q Consensus 13 ps~vv~L~Nm~~~~el~~~~~~~ei~eDv~eEC~K~--G~V~~V~Ipr 58 (74)
.+++|.+.|+-. +.. .+++++.+++| |.|.+|.+.+
T Consensus 232 ~~k~LfVgNL~~--~~t--------ee~L~~~F~~f~~G~I~rV~~~r 269 (578)
T TIGR01648 232 KVKILYVRNLMT--TTT--------EEIIEKSFSEFKPGKVERVKKIR 269 (578)
T ss_pred cccEEEEeCCCC--CCC--------HHHHHHHHHhcCCCceEEEEeec
Confidence 457788888732 222 35788899999 9999998875
No 49
>PF15513 DUF4651: Domain of unknown function (DUF4651)
Probab=58.43 E-value=34 Score=20.03 Aligned_cols=29 Identities=17% Similarity=0.248 Sum_probs=24.8
Q ss_pred HHHHHHHHHHHHHhcCCeEEEEecCCCCC
Q psy2203 34 YEDICEDIKEECNKYGIVRSLEIPRPIEN 62 (74)
Q Consensus 34 ~~ei~eDv~eEC~K~G~V~~V~Iprp~~~ 62 (74)
.+.|..+||+..+..|.|.=+.|-.....
T Consensus 4 re~i~~~iR~~fs~lG~I~vLYvn~~eS~ 32 (62)
T PF15513_consen 4 REEITAEIRQFFSQLGEIAVLYVNPYESD 32 (62)
T ss_pred HHHHHHHHHHHHHhcCcEEEEEEcccccC
Confidence 46899999999999999999888776654
No 50
>smart00545 JmjN Small domain found in the jumonji family of transcription factors. To date, this domain always co-occurs with the JmjC domain (although the reverse is not true).
Probab=57.04 E-value=7.5 Score=20.89 Aligned_cols=24 Identities=33% Similarity=0.610 Sum_probs=15.8
Q ss_pred CcCCChhHHHHHHHHHHHHHHhcCCeE
Q psy2203 26 DELKDEDEYEDICEDIKEECNKYGIVR 52 (74)
Q Consensus 26 ~el~~~~~~~ei~eDv~eEC~K~G~V~ 52 (74)
+|+.|.- ...+.|+..+.|||.++
T Consensus 10 eEF~Dp~---~yi~~i~~~~~~yGi~K 33 (42)
T smart00545 10 EEFKDPL---AYISKIRPQAEKYGICK 33 (42)
T ss_pred HHHHCHH---HHHHHHHHHHhhCCEEE
Confidence 4444443 44555777899999775
No 51
>PF02375 JmjN: jmjN domain; InterPro: IPR003349 Jumonji protein is required for neural tube formation in mice [].There is evidence of domain swapping within the jumonji family of transcription factors []. This domain is often associated with JmjC (see IPR003347 from INTERPRO).; PDB: 2XML_A 2W2I_C 3DXT_A 3DXU_A 2OX0_B 2OQ6_B 2WWJ_A 2Q8D_A 3PDQ_A 2YBK_A ....
Probab=56.52 E-value=4.1 Score=21.04 Aligned_cols=16 Identities=38% Similarity=0.634 Sum_probs=12.4
Q ss_pred HHHHHHHHHHhcCCeE
Q psy2203 37 ICEDIKEECNKYGIVR 52 (74)
Q Consensus 37 i~eDv~eEC~K~G~V~ 52 (74)
..+.|+.++++||-++
T Consensus 16 yi~~i~~~g~~~Gi~K 31 (34)
T PF02375_consen 16 YISSIEPEGEKYGICK 31 (34)
T ss_dssp HHHHHHHTTGGGSEEE
T ss_pred HHHHHHHHHHHCCEEE
Confidence 4455888999999765
No 52
>COG1864 NUC1 DNA/RNA endonuclease G, NUC1 [Nucleotide transport and metabolism]
Probab=55.85 E-value=31 Score=25.65 Aligned_cols=45 Identities=13% Similarity=0.141 Sum_probs=37.4
Q ss_pred CCCceEEEeeCCCCCCcCCChhHHHHHHHHHHHHHHhcCCeEEEE
Q psy2203 11 PLRFSVLCLLNMVTPDELKDEDEYEDICEDIKEECNKYGIVRSLE 55 (74)
Q Consensus 11 ~~ps~vv~L~Nm~~~~el~~~~~~~ei~eDv~eEC~K~G~V~~V~ 55 (74)
+.......|+||.-.....+...+.+|++-+|+=+.++|.+..|+
T Consensus 137 ~a~~dTF~~TN~~PQ~~~~Nr~~W~~lE~~~R~la~~~~~~~~V~ 181 (281)
T COG1864 137 QANNDTFYMTNMTPQAADLNRKIWLELEDYCRNLARKYGSVVIVF 181 (281)
T ss_pred hcchhhheeecCCCcchhhhhhHHHHHHHHHHHHHHhcCcEEEEE
Confidence 344455679999988877777888999999999999999988876
No 53
>KOG1190|consensus
Probab=53.60 E-value=20 Score=28.44 Aligned_cols=40 Identities=23% Similarity=0.202 Sum_probs=30.7
Q ss_pred CCCceEEEeeCCCCCCcCCChhHHHHHHHHHHHHHHhcCCeEEEEecCCC
Q psy2203 11 PLRFSVLCLLNMVTPDELKDEDEYEDICEDIKEECNKYGIVRSLEIPRPI 60 (74)
Q Consensus 11 ~~ps~vv~L~Nm~~~~el~~~~~~~ei~eDv~eEC~K~G~V~~V~Iprp~ 60 (74)
..||+||.|+|| |.|.. ++|+-+-|.-||+|+++..-+-.
T Consensus 25 ~~pSkV~HlRnl--p~e~t--------E~elI~Lg~pFG~vtn~~~lkGk 64 (492)
T KOG1190|consen 25 AEPSKVVHLRNL--PWEVT--------EEELISLGLPFGKVTNLLMLKGK 64 (492)
T ss_pred cCCcceeEeccC--Ccccc--------HHHHHHhcccccceeeeeeeccc
Confidence 469999999998 34443 35677788899999999876643
No 54
>PF13043 DUF3903: Domain of unknown function (DUF3903)
Probab=51.95 E-value=12 Score=19.99 Aligned_cols=15 Identities=20% Similarity=0.762 Sum_probs=11.2
Q ss_pred HHHHHHHHHH-hcCCe
Q psy2203 37 ICEDIKEECN-KYGIV 51 (74)
Q Consensus 37 i~eDv~eEC~-K~G~V 51 (74)
-.+.|+.||+ +||+-
T Consensus 9 ai~kvr~eckrrfgkt 24 (40)
T PF13043_consen 9 AIQKVRAECKRRFGKT 24 (40)
T ss_pred HHHHHHHHHHHHhchh
Confidence 3578999995 48863
No 55
>PRK05731 thiamine monophosphate kinase; Provisional
Probab=51.74 E-value=15 Score=26.39 Aligned_cols=31 Identities=32% Similarity=0.608 Sum_probs=23.5
Q ss_pred eeCCCCCCcCCChhHHHHHHHHHHHHHHhcCC
Q psy2203 19 LLNMVTPDELKDEDEYEDICEDIKEECNKYGI 50 (74)
Q Consensus 19 L~Nm~~~~el~~~~~~~ei~eDv~eEC~K~G~ 50 (74)
+.++..|+.. +.++.+++.+.|.+.|.+||.
T Consensus 85 ~~~l~~p~~~-~~~~l~~i~~Gi~~~~~~~g~ 115 (318)
T PRK05731 85 LLALALPKDL-DEAWLEALADGLFELADRYGA 115 (318)
T ss_pred EEEEEcCCCC-CHHHHHHHHHHHHHHHHHhCC
Confidence 3555556554 456689999999999999985
No 56
>KOG0148|consensus
Probab=51.54 E-value=18 Score=27.28 Aligned_cols=45 Identities=20% Similarity=0.316 Sum_probs=32.2
Q ss_pred CCCcCCChhHHHHHHHHHHHHHHhcCCeEEEEecCCCCCCCCCceEEEE
Q psy2203 24 TPDELKDEDEYEDICEDIKEECNKYGIVRSLEIPRPIENVDVPGCGKAT 72 (74)
Q Consensus 24 ~~~el~~~~~~~ei~eDv~eEC~K~G~V~~V~Iprp~~~~~~~gvGkVF 72 (74)
..++|.-+-.+ +++|+-..+||+|-..+|-|-..+...-|-|.|-
T Consensus 66 fvgdls~eI~~----e~lr~aF~pFGevS~akvirD~~T~KsKGYgFVS 110 (321)
T KOG0148|consen 66 FVGDLSPEIDN----EKLREAFAPFGEVSDAKVIRDMNTGKSKGYGFVS 110 (321)
T ss_pred Eehhcchhcch----HHHHHHhccccccccceEeecccCCcccceeEEe
Confidence 45555544333 4566778999999999999988776667777663
No 57
>PF08776 VASP_tetra: VASP tetramerisation domain; InterPro: IPR014885 Vasodilator-stimulated phosphoprotein (VASP) is an actin cytoskeletal regulatory protein. This region corresponds to the tetramerisation domain which forms a right handed alpha helical coiled coil structure []. ; PDB: 1USE_A 1USD_A.
Probab=51.09 E-value=23 Score=19.08 Aligned_cols=16 Identities=25% Similarity=0.439 Sum_probs=11.4
Q ss_pred hHHHHHHHHHHHHHHh
Q psy2203 32 DEYEDICEDIKEECNK 47 (74)
Q Consensus 32 ~~~~ei~eDv~eEC~K 47 (74)
...+||.++|+-|..|
T Consensus 7 ~~KqEIL~EvrkEl~K 22 (40)
T PF08776_consen 7 RLKQEILEEVRKELQK 22 (40)
T ss_dssp HHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHH
Confidence 3457888888887765
No 58
>cd02194 ThiL ThiL (Thiamine-monophosphate kinase) plays a dual role in de novo biosynthesis and in salvage of exogenous thiamine. Thiamine salvage occurs in two steps, with thiamine kinase catalyzing the formation of thiamine phosphate, and ThiL catalyzing the conversion of this intermediate to thiamine pyrophosphate. The N-terminal domain of ThiL binds ATP and is related to the ATP-binding domains of hydrogen expression/formation protein HypE, the AIR synthases, FGAM synthase and selenophosphate synthetase (SelD).
Probab=50.09 E-value=16 Score=25.81 Aligned_cols=21 Identities=29% Similarity=0.681 Sum_probs=18.7
Q ss_pred ChhHHHHHHHHHHHHHHhcCC
Q psy2203 30 DEDEYEDICEDIKEECNKYGI 50 (74)
Q Consensus 30 ~~~~~~ei~eDv~eEC~K~G~ 50 (74)
+.++.+++...+.+.|++||.
T Consensus 92 ~~~~l~~i~~Gi~~a~~~~g~ 112 (291)
T cd02194 92 DEEWLEEFYRGLAEAADRYGV 112 (291)
T ss_pred CHHHHHHHHHHHHHHHHHcCC
Confidence 566889999999999999985
No 59
>TIGR01379 thiL thiamine-monophosphate kinase. Proteins scoring between the trusted and noise cutoff for this model include short forms from the Thermoplasmas (which lack the N-terminal region) and a highly derived form from Campylobacter jejuni. Eukaryotes lack this enzyme, and add pyrophosphate from ATP to unphosphorylated thiamine in a single step.
Probab=49.25 E-value=17 Score=26.10 Aligned_cols=30 Identities=30% Similarity=0.541 Sum_probs=23.0
Q ss_pred eCCCCCCcCCChhHHHHHHHHHHHHHHhcCC
Q psy2203 20 LNMVTPDELKDEDEYEDICEDIKEECNKYGI 50 (74)
Q Consensus 20 ~Nm~~~~el~~~~~~~ei~eDv~eEC~K~G~ 50 (74)
.++..|++. +..+.+++.+.|.+.|.+||.
T Consensus 84 ~~~~~P~~~-~~~~l~~i~~Gi~~a~~~~g~ 113 (317)
T TIGR01379 84 LSLGLPSDL-DEAWLEAFYDGLFECAKQYGV 113 (317)
T ss_pred EEEEcCCCC-CHHHHHHHHHHHHHHHHHcCC
Confidence 445556554 455689999999999999985
No 60
>PF02037 SAP: SAP domain; InterPro: IPR003034 The SAP (after SAF-A/B, Acinus and PIAS) motif is a putative DNA binding domain found in diverse nuclear proteins involved in chromosomal organisation [], including in apoptosis []. In yeast, SAP is found in the most distal N-terminal region of E3 SUMO-protein ligase SIZ1, where it is involved in nuclear localization [].; GO: 0003676 nucleic acid binding; PDB: 2RNN_A 1JEQ_A 2KW9_A 2KVU_A 2DO1_A 1ZBU_B 1ZBH_A 2DO5_A 2RNO_A 1H1J_S ....
Probab=47.31 E-value=18 Score=18.21 Aligned_cols=12 Identities=33% Similarity=0.589 Sum_probs=10.2
Q ss_pred HHHHHHHHHhcC
Q psy2203 38 CEDIKEECNKYG 49 (74)
Q Consensus 38 ~eDv~eEC~K~G 49 (74)
..|++++|.++|
T Consensus 6 v~eLk~~l~~~g 17 (35)
T PF02037_consen 6 VAELKEELKERG 17 (35)
T ss_dssp HHHHHHHHHHTT
T ss_pred HHHHHHHHHHCC
Confidence 468899999988
No 61
>KOG0127|consensus
Probab=46.94 E-value=35 Score=28.12 Aligned_cols=47 Identities=19% Similarity=0.253 Sum_probs=33.7
Q ss_pred eEEEeeCCCCCCcCCChhHHHHHHHHHHHHHHhcCCeEEEEecCCCCCCCCCceEEE
Q psy2203 15 SVLCLLNMVTPDELKDEDEYEDICEDIKEECNKYGIVRSLEIPRPIENVDVPGCGKA 71 (74)
Q Consensus 15 ~vv~L~Nm~~~~el~~~~~~~ei~eDv~eEC~K~G~V~~V~Iprp~~~~~~~gvGkV 71 (74)
++|.+.|+-- .--.+++.+..++||.|....|-..+.+..+.|.+.|
T Consensus 293 ~tVFvRNL~f----------D~tEEel~~~fskFG~v~ya~iV~~k~T~~skGtAFv 339 (678)
T KOG0127|consen 293 KTVFVRNLPF----------DTTEEELKEHFSKFGEVKYAIIVKDKDTGHSKGTAFV 339 (678)
T ss_pred ceEEEecCCc----------cccHHHHHHHHHhhccceeEEEEeccCCCCcccceEE
Confidence 6677777632 1245788999999999999988777766555565554
No 62
>cd02204 PurL_repeat2 PurL subunit of the formylglycinamide ribonucleotide amidotransferase (FGAR-AT), second repeat. FGAR-AT catalyzes the ATP-dependent conversion of formylglycinamide ribonucleotide (FGAR) and glutamine to formylglycinamidine ribonucleotide (FGAM), ADP, phosphate, and glutamate in the fourth step of the purine biosynthetic pathway. In eukaryotes and Gram-negative bacteria, FGAR-AT is encoded by the purL gene as a multidomain protein with a molecular mass of about 140 kDa. In Gram-positive bacteria and archaea FGAR-AT is a complex of three proteins: PurS, PurL, and PurQ. PurL itself contains two tandem N- and C-terminal domains (four domains altogether). The N-terminal domains bind ATP and are related to the ATP-binding domains of HypE, ThiL, SelD and PurM.
Probab=46.32 E-value=37 Score=23.46 Aligned_cols=18 Identities=11% Similarity=0.399 Sum_probs=15.9
Q ss_pred hHHHHHHHHHHHHHHhcC
Q psy2203 32 DEYEDICEDIKEECNKYG 49 (74)
Q Consensus 32 ~~~~ei~eDv~eEC~K~G 49 (74)
.+.+++...+++.|++||
T Consensus 72 ~~l~~~~~g~~~~~~~~g 89 (264)
T cd02204 72 GQLVEAVLGLGDACRALG 89 (264)
T ss_pred HHHHHHHHHHHHHHHHhC
Confidence 457899999999999997
No 63
>KOG0116|consensus
Probab=45.95 E-value=29 Score=27.05 Aligned_cols=33 Identities=15% Similarity=0.224 Sum_probs=25.5
Q ss_pred HHHHHHHHhcCCeEEEEecCCCCCCCCCceEEE
Q psy2203 39 EDIKEECNKYGIVRSLEIPRPIENVDVPGCGKA 71 (74)
Q Consensus 39 eDv~eEC~K~G~V~~V~Iprp~~~~~~~gvGkV 71 (74)
.+++++..+||.|++.-|.+.+.+...+..|.|
T Consensus 303 ~~l~~~Fk~FG~Ik~~~I~vr~~~~~~~~fgFV 335 (419)
T KOG0116|consen 303 AELEEVFKQFGPIKEGGIQVRSPGGKNPCFGFV 335 (419)
T ss_pred HHHHHHHhhcccccccceEEeccCCCcCceEEE
Confidence 448889999999999999998754433366776
No 64
>cd02192 PurM-like3 AIR synthase (PurM) related protein, subgroup 3 of unknown function. The family of PurM related proteins includes Hydrogen expression/formation protein HypE, AIR synthases, FGAM synthase and Selenophosphate synthetase (SelD). They all contain two conserved domains and seem to dimerize. The N-terminal domain forms the dimer interface and is a putative ATP binding domain.
Probab=45.83 E-value=29 Score=24.87 Aligned_cols=20 Identities=15% Similarity=0.567 Sum_probs=18.0
Q ss_pred hhHHHHHHHHHHHHHHhcCC
Q psy2203 31 EDEYEDICEDIKEECNKYGI 50 (74)
Q Consensus 31 ~~~~~ei~eDv~eEC~K~G~ 50 (74)
.++.+++.+.+++.|++||.
T Consensus 98 ~e~l~~i~~Gi~~a~~~~gi 117 (283)
T cd02192 98 AEAAAQVLEGMRDAAEKFGV 117 (283)
T ss_pred HHHHHHHHHHHHHHHHHcCC
Confidence 47789999999999999984
No 65
>cd07058 BMC_CsoS1 Carboxysome Shell 1 (CsoS1); Bacterial Micro-Compartment (BCM) domain. The cso operon in Halothiobacillus neapolitanus contains the genes involved in alpha carboxysome function including those for the carboxysome shell proteins: CsoS1A, CsoS1B, and CsoS1C. CsoS1A has been shown to form hexameric units which further assemble into the flat facets of the polyhedral carboxysome shell. The structures suggest that the central pores and the gaps between hexamers limit the transport of metabolites into and out of the the carboxysome. Although it has been suggested that other homologous proteins, CsoS1B and CsoS1C, in this family might also form hexamers and play similar functional roles in the construction of carboxysome outer shell at present no experimental evidence directly supports this view.
Probab=44.61 E-value=24 Score=21.66 Aligned_cols=24 Identities=21% Similarity=0.281 Sum_probs=17.2
Q ss_pred HHHHHHHHhcCC--eEEEEecCCCCC
Q psy2203 39 EDIKEECNKYGI--VRSLEIPRPIEN 62 (74)
Q Consensus 39 eDv~eEC~K~G~--V~~V~Iprp~~~ 62 (74)
+.-++.+++.|. +.+..||||++.
T Consensus 57 eag~~~a~~~g~~~v~~~VIp~p~~~ 82 (88)
T cd07058 57 RAGADACERVGDGLVAAHIIARVHSE 82 (88)
T ss_pred HHHHHHHHhcCCeEEEEEEeCCCCHH
Confidence 333555667885 788899999864
No 66
>smart00540 LEM in nuclear membrane-associated proteins. LEM, domain in nuclear membrane-associated proteins, including lamino-associated polypeptide 2 and emerin.
Probab=43.58 E-value=19 Score=19.57 Aligned_cols=12 Identities=25% Similarity=0.827 Sum_probs=9.9
Q ss_pred HHHHHHHHhcCC
Q psy2203 39 EDIKEECNKYGI 50 (74)
Q Consensus 39 eDv~eEC~K~G~ 50 (74)
++++++|.+||-
T Consensus 9 ~eL~~~L~~~G~ 20 (44)
T smart00540 9 AELRAELKQYGL 20 (44)
T ss_pred HHHHHHHHHcCC
Confidence 468899999985
No 67
>PF14605 Nup35_RRM_2: Nup53/35/40-type RNA recognition motif
Probab=43.35 E-value=43 Score=18.27 Aligned_cols=22 Identities=23% Similarity=0.602 Sum_probs=18.8
Q ss_pred HHHHHHHHHHHhcCCeEEEEec
Q psy2203 36 DICEDIKEECNKYGIVRSLEIP 57 (74)
Q Consensus 36 ei~eDv~eEC~K~G~V~~V~Ip 57 (74)
+..++|..-...||.|+++.++
T Consensus 12 ~~~~~vl~~F~~fGeI~~~~~~ 33 (53)
T PF14605_consen 12 DLAEEVLEHFASFGEIVDIYVP 33 (53)
T ss_pred hHHHHHHHHHHhcCCEEEEEcC
Confidence 4567788888889999999999
No 68
>cd02901 Macro_Poa1p_like Macro domain, Poa1p_like family. The macro domain is a high-affinity ADP-ribose binding module found in a variety of proteins as a stand-alone domain or in combination with other domains like in histone macroH2A and some PARPs (poly ADP-ribose polymerases). Some macro domains recognize poly ADP-ribose as a ligand. Previously identified as displaying an Appr-1"-p (ADP-ribose-1"-monophosphate) processing activity, the macro domain may play roles in distinct ADP-ribose pathways, such as the ADP-ribosylation of proteins, an important post-translational modification which occurs in DNA repair, transcription, chromatin biology, and long-term memory formation, among other processes. Members of this family show similarity to the yeast protein Poa1p, reported to be a phosphatase specific for Appr-1"-p, a tRNA splicing metabolite. Poa1p may play a role in tRNA splicing regulation.
Probab=43.26 E-value=64 Score=20.21 Aligned_cols=45 Identities=18% Similarity=0.324 Sum_probs=29.6
Q ss_pred EEeeCCCCCCcCCChhHHH---HHHHHHHHHHHhcCCeEEEEecCCCCC
Q psy2203 17 LCLLNMVTPDELKDEDEYE---DICEDIKEECNKYGIVRSLEIPRPIEN 62 (74)
Q Consensus 17 v~L~Nm~~~~el~~~~~~~---ei~eDv~eEC~K~G~V~~V~Iprp~~~ 62 (74)
..+.|++++.....+..|. .-...+.+-|.+.+ +.+|.+|+...|
T Consensus 73 ~~I~~~~t~~~~~~~~~~~~l~~~l~~~~~~a~~~~-~~sva~P~iG~G 120 (140)
T cd02901 73 RYIYNLPTKVHYGPKSRYEAIEKSLRELRAHARDNG-IKSVAMPRIGCG 120 (140)
T ss_pred eEEEEeeccCCCCCCCcHHHHHHHHHHHHHHHHHcC-CCEEeeCCCCCc
Confidence 4567777776654444444 44566666677654 789999997654
No 69
>PRK12785 fliL flagellar basal body-associated protein FliL; Reviewed
Probab=42.64 E-value=36 Score=22.83 Aligned_cols=34 Identities=15% Similarity=0.230 Sum_probs=26.3
Q ss_pred CCCCcCCChhHHHHHHHHHHHHHHhc---CCeEEEEe
Q psy2203 23 VTPDELKDEDEYEDICEDIKEECNKY---GIVRSLEI 56 (74)
Q Consensus 23 ~~~~el~~~~~~~ei~eDv~eEC~K~---G~V~~V~I 56 (74)
.+.++|.+.+-.+.|++|+++..... |.|.+|.+
T Consensus 124 ~~~~~L~~~~Gk~~Lr~ei~~~in~~l~~~~V~~VlF 160 (166)
T PRK12785 124 LRPSDLNGSAGLFRLKEELLRRVNVALAPAQVNAVLF 160 (166)
T ss_pred CCHHHhcChHHHHHHHHHHHHHHHhhcCCCceeEEEE
Confidence 45677877777788999999998775 77777765
No 70
>KOG2287|consensus
Probab=41.93 E-value=68 Score=23.89 Aligned_cols=41 Identities=17% Similarity=0.325 Sum_probs=30.9
Q ss_pred CCCCceEEEeeCCCCCCcCCChhHHHHHHHHHHHHHHhcCCeEEEEecC
Q psy2203 10 PPLRFSVLCLLNMVTPDELKDEDEYEDICEDIKEECNKYGIVRSLEIPR 58 (74)
Q Consensus 10 ~~~ps~vv~L~Nm~~~~el~~~~~~~ei~eDv~eEC~K~G~V~~V~Ipr 58 (74)
....-+++.|.++-+.++ ++..++.+|...||.+..+-...
T Consensus 125 ~~~~v~~~FLvG~~~~~~--------~~~~~l~~Ea~~ygDIi~~df~D 165 (349)
T KOG2287|consen 125 RGGRVRVLFLVGLPSNED--------KLNKLLADEARLYGDIIQVDFED 165 (349)
T ss_pred CCCcEEEEEEecCCCcHH--------HHHHHHHHHHHHhCCEEEEeccc
Confidence 344566777777766555 47899999999999999886544
No 71
>PF03748 FliL: Flagellar basal body-associated protein FliL; InterPro: IPR005503 This FliL protein controls the rotational direction of the flagella during chemotaxis []. FliL is a cytoplasmic membrane protein associated with the basal body [].; GO: 0001539 ciliary or flagellar motility, 0006935 chemotaxis, 0009425 bacterial-type flagellum basal body
Probab=41.69 E-value=47 Score=19.50 Aligned_cols=35 Identities=23% Similarity=0.430 Sum_probs=26.6
Q ss_pred CCCCcCCChhHHHHHHHHHHHHHHh-c--CCeEEEEec
Q psy2203 23 VTPDELKDEDEYEDICEDIKEECNK-Y--GIVRSLEIP 57 (74)
Q Consensus 23 ~~~~el~~~~~~~ei~eDv~eEC~K-~--G~V~~V~Ip 57 (74)
.+.+++.+.+-.+.|++++++...+ + |.|.+|.+-
T Consensus 57 ~~~~~l~~~~g~~~Lk~~l~~~in~~l~~~~V~~V~ft 94 (99)
T PF03748_consen 57 KTAEDLSGPEGKERLKDELKDRINKILGKGKVKDVYFT 94 (99)
T ss_pred CCHHHhcChhhHHHHHHHHHHHHHHhhccCcEEEEEEE
Confidence 4667788877788899999998875 3 578887653
No 72
>PF01906 YbjQ_1: Putative heavy-metal-binding; InterPro: IPR002765 This family of bacterial proteins have not been characterised.; PDB: 3QKB_A 1VR4_D 1Y2I_D 2GTC_C.
Probab=41.27 E-value=64 Score=19.88 Aligned_cols=28 Identities=25% Similarity=0.331 Sum_probs=22.2
Q ss_pred HHHHHHHHHHHHHHhcC--CeEEEEecCCC
Q psy2203 33 EYEDICEDIKEECNKYG--IVRSLEIPRPI 60 (74)
Q Consensus 33 ~~~ei~eDv~eEC~K~G--~V~~V~Iprp~ 60 (74)
.+++..++++++|.+.| .|..+.+.-.+
T Consensus 57 ~r~~A~~~L~~~A~~~GAnAVIgv~~~~~~ 86 (105)
T PF01906_consen 57 AREEALERLKEEAKELGANAVIGVRFDYSS 86 (105)
T ss_dssp HHHHHHHHHHHHHHHTT-SEEEEEEEEEEE
T ss_pred HHHHHHHHHHHHHhhCCCCEEEEEEEEeee
Confidence 46788999999999998 68777766543
No 73
>KOG0117|consensus
Probab=40.23 E-value=30 Score=27.69 Aligned_cols=21 Identities=33% Similarity=0.566 Sum_probs=17.3
Q ss_pred HHHHHHHHHhcCCeEEEEecC
Q psy2203 38 CEDIKEECNKYGIVRSLEIPR 58 (74)
Q Consensus 38 ~eDv~eEC~K~G~V~~V~Ipr 58 (74)
+|-++++.++||.|++|+.+|
T Consensus 273 eE~lk~~F~~~G~veRVkk~r 293 (506)
T KOG0117|consen 273 EETLKKLFNEFGKVERVKKPR 293 (506)
T ss_pred HHHHHHHHHhccceEEeeccc
Confidence 345667789999999999997
No 74
>PF12683 DUF3798: Protein of unknown function (DUF3798); InterPro: IPR024258 This entry represents functionally uncharacterised proteins that are found in bacteria. They are typically between 247 and 417 amino acids in length. Most of the proteins in this entry have an N-terminal lipoprotein attachment site. These proteins have distant similarity to periplasmic ligand binding families suggesting that this family has a similar role.; PDB: 3QI7_A.
Probab=40.15 E-value=65 Score=23.95 Aligned_cols=38 Identities=16% Similarity=0.353 Sum_probs=25.8
Q ss_pred CCCcCCChhHHHHHHHHHHHHHHhcC-CeEEEEecCCCCC
Q psy2203 24 TPDELKDEDEYEDICEDIKEECNKYG-IVRSLEIPRPIEN 62 (74)
Q Consensus 24 ~~~el~~~~~~~ei~eDv~eEC~K~G-~V~~V~Iprp~~~ 62 (74)
.|..+. .+....-++.|+++|++.| +-+.+..|.|..+
T Consensus 139 fprhms-~~~l~~Rr~~M~~~C~~lGi~fv~~taPDP~sd 177 (275)
T PF12683_consen 139 FPRHMS-YELLARRRDIMEEACKDLGIKFVEVTAPDPTSD 177 (275)
T ss_dssp ETTGGG-SHHHHHHHHHHHHHHHHCT--EEEEEE---SST
T ss_pred chhhcc-hHHHHHHHHHHHHHHHHcCCeEEEEeCCCCCCC
Confidence 455555 3455667889999999999 7889999999865
No 75
>KOG0108|consensus
Probab=40.03 E-value=40 Score=26.39 Aligned_cols=39 Identities=23% Similarity=0.187 Sum_probs=30.9
Q ss_pred HHHHHHHHHHHHHhcCCeEEEEecCCCCCCCCCceEEEE
Q psy2203 34 YEDICEDIKEECNKYGIVRSLEIPRPIENVDVPGCGKAT 72 (74)
Q Consensus 34 ~~ei~eDv~eEC~K~G~V~~V~Iprp~~~~~~~gvGkVF 72 (74)
|.--+++|.+.|++.|.|.++.+--..++..+.|+|.+.
T Consensus 28 ~~~se~~l~~~~~~~g~v~s~~~v~D~~tG~~~G~~f~~ 66 (435)
T KOG0108|consen 28 YEGSEEQLLSIFSGVGPVLSFRLVYDRETGKPKGFGFCE 66 (435)
T ss_pred CcccHHHHHHHHhccCccceeeecccccCCCcCceeeEe
Confidence 444667899999999999999977766666778888764
No 76
>KOG0110|consensus
Probab=39.11 E-value=41 Score=28.13 Aligned_cols=40 Identities=23% Similarity=0.348 Sum_probs=29.6
Q ss_pred HHHHHHHHHHHHHhcCCeEEEEecCCCCCC-CCCceEEEEe
Q psy2203 34 YEDICEDIKEECNKYGIVRSLEIPRPIENV-DVPGCGKATL 73 (74)
Q Consensus 34 ~~ei~eDv~eEC~K~G~V~~V~Iprp~~~~-~~~gvGkVFv 73 (74)
|.+-+++++....|+|.|+++.|+...... -...+|..||
T Consensus 525 f~Tt~e~l~~~F~k~G~VlS~~I~kkkd~~~k~lSmGfgFV 565 (725)
T KOG0110|consen 525 FDTTLEDLEDLFSKQGTVLSIEISKKKDPANKYLSMGFGFV 565 (725)
T ss_pred cccchhHHHHHHHhcCeEEEEEEeccccccccccccceeEE
Confidence 345678899999999999999999877643 2344566565
No 77
>PF14234 DUF4336: Domain of unknown function (DUF4336)
Probab=38.41 E-value=50 Score=24.51 Aligned_cols=43 Identities=19% Similarity=0.293 Sum_probs=29.9
Q ss_pred CCceEEEeeC----CCCCCcCCChhHHHHHHHHHHHHHHhcCCeEEEEecCC
Q psy2203 12 LRFSVLCLLN----MVTPDELKDEDEYEDICEDIKEECNKYGIVRSLEIPRP 59 (74)
Q Consensus 12 ~ps~vv~L~N----m~~~~el~~~~~~~ei~eDv~eEC~K~G~V~~V~Iprp 59 (74)
.-.+||.|.| +.+|=.+++ |+++.|++--.++|.|.+|+.|..
T Consensus 20 ~RMTVVrL~~G~L~VhSPvapT~-----el~~~l~~L~~~~G~VkyIVaPn~ 66 (285)
T PF14234_consen 20 TRMTVVRLSDGGLWVHSPVAPTP-----ELKAELDELEAQHGPVKYIVAPNK 66 (285)
T ss_pred ceEEEEEECCCCEEEECCCCCCH-----HHHHHHHHHhccCCceeEEEcCCc
Confidence 3345777775 345555544 377777776689999999999963
No 78
>cd00396 PurM-like AIR (aminoimidazole ribonucleotide) synthase related protein. This family includes Hydrogen expression/formation protein HypE, AIR synthases, FGAM (formylglycinamidine ribonucleotide) synthase and Selenophosphate synthetase (SelD). The N-terminal domain of AIR synthase forms the dimer interface of the protein, and is suggested as a putative ATP binding domain.
Probab=37.52 E-value=44 Score=22.50 Aligned_cols=32 Identities=28% Similarity=0.542 Sum_probs=23.7
Q ss_pred eeCCCCCCcCCChhHHHHHHHHHHHHHHhcCC
Q psy2203 19 LLNMVTPDELKDEDEYEDICEDIKEECNKYGI 50 (74)
Q Consensus 19 L~Nm~~~~el~~~~~~~ei~eDv~eEC~K~G~ 50 (74)
+.+.+...+..+.++..++..-+++.|+.||.
T Consensus 42 ~~~~l~~~~~~~~~~~~~~~~Gi~~~~~~~gi 73 (222)
T cd00396 42 LLASLSLSNGLEVDILEDVVDGVAEACNQLGV 73 (222)
T ss_pred HhhheccCCCCCHHHHHHHHHHHHHHHHHcCC
Confidence 33444444445677889999999999999985
No 79
>cd03400 Band_7_1 A subgroup of the band 7 domain of flotillin (reggie) like proteins. This subgroup contains proteins similar to stomatin, prohibitin, flotillin, HlfK/C and podicin. Many of these band 7 domain-containing proteins are lipid raft-associated. Individual proteins of this band 7 domain family may cluster to form membrane microdomains which may in turn recruit multiprotein complexes. Microdomains formed from flotillin proteins may in addition be dynamic units with their own regulatory functions. Flotillins have been implicated in signal transduction, vesicle trafficking, cytoskeleton rearrangement and are known to interact with a variety of proteins. Stomatin interacts with and regulates members of the degenerin/epithelia Na+ channel family in mechanosensory cells of Caenorhabditis elegans and vertebrate neurons and participates in trafficking of Glut1 glucose transporters. Prohibitin may act as a chaperone for the stabilization of mitochondrial proteins. Prokaryotic H
Probab=37.46 E-value=48 Score=20.32 Aligned_cols=25 Identities=20% Similarity=0.235 Sum_probs=18.7
Q ss_pred HHHHHHHHHHHHHhcC-CeEEEEecC
Q psy2203 34 YEDICEDIKEECNKYG-IVRSLEIPR 58 (74)
Q Consensus 34 ~~ei~eDv~eEC~K~G-~V~~V~Ipr 58 (74)
-+++++.+++++++|| .|.+|.|-+
T Consensus 84 ~~~i~~~l~~~~~~~Gi~v~~v~i~~ 109 (124)
T cd03400 84 ESAIKKELIEEFVGDGLILEEVLLRN 109 (124)
T ss_pred HHHHHHHHHHHhccCCeEEEEEEEec
Confidence 4677888888889999 667776633
No 80
>KOG0124|consensus
Probab=37.24 E-value=67 Score=25.45 Aligned_cols=34 Identities=21% Similarity=0.267 Sum_probs=25.6
Q ss_pred HHHHHHHHHhcCCeEEEEecCCCCCCCCCceEEE
Q psy2203 38 CEDIKEECNKYGIVRSLEIPRPIENVDVPGCGKA 71 (74)
Q Consensus 38 ~eDv~eEC~K~G~V~~V~Iprp~~~~~~~gvGkV 71 (74)
.+||+.-.+.||.++++.+-|-+.+..--|.|.+
T Consensus 224 e~DiKSVFEAFG~I~~C~LAr~pt~~~HkGyGfi 257 (544)
T KOG0124|consen 224 ETDIKSVFEAFGEIVKCQLARAPTGRGHKGYGFI 257 (544)
T ss_pred HHHHHHHHHhhcceeeEEeeccCCCCCccceeeE
Confidence 3678888888999999999997766544555544
No 81
>PF08777 RRM_3: RNA binding motif; InterPro: IPR014886 This domain is found in protein La which functions as an RNA chaperone during RNA polymerase III transcription, and can also stimulate translation initiation. It contains a five stranded beta sheet which forms an atypical RNA recognition motif []. ; PDB: 1OWX_A.
Probab=37.03 E-value=35 Score=21.23 Aligned_cols=21 Identities=38% Similarity=0.619 Sum_probs=14.4
Q ss_pred HHHHHHHHHhcCCeEEEEecC
Q psy2203 38 CEDIKEECNKYGIVRSLEIPR 58 (74)
Q Consensus 38 ~eDv~eEC~K~G~V~~V~Ipr 58 (74)
.+|+++.|++||.|.-|-.-+
T Consensus 15 re~iK~~f~~~g~V~yVD~~~ 35 (105)
T PF08777_consen 15 REDIKEAFSQFGEVAYVDFSR 35 (105)
T ss_dssp HHHHHHHT-SS--EEEEE--T
T ss_pred HHHHHHHHHhcCCcceEEecC
Confidence 789999999999999887766
No 82
>KOG0339|consensus
Probab=36.77 E-value=27 Score=28.83 Aligned_cols=22 Identities=23% Similarity=0.490 Sum_probs=19.0
Q ss_pred HHHHHHHHHHHHhcCCeEEEEe
Q psy2203 35 EDICEDIKEECNKYGIVRSLEI 56 (74)
Q Consensus 35 ~ei~eDv~eEC~K~G~V~~V~I 56 (74)
.++...|..||+|||++.++.+
T Consensus 306 rela~Qi~~eaKkf~K~ygl~~ 327 (731)
T KOG0339|consen 306 RELASQIFSEAKKFGKAYGLRV 327 (731)
T ss_pred HHHHHHHHHHHHHhhhhccceE
Confidence 5899999999999998877654
No 83
>cd02106 Band_7 The band 7 domain of flotillin (reggie) like proteins. This group contains proteins similar to stomatin, prohibitin, flotillin, HlfK/C and podicin. Many of these band 7 domain-containing proteins are lipid raft-associated. Individual proteins of this band 7 domain family may cluster to form membrane microdomains which may in turn recruit multiprotein complexes. Microdomains formed from flotillin proteins may in addition be dynamic units with their own regulatory functions. Flotillins have been implicated in signal transduction, vesicle trafficking, cytoskeleton rearrangement and are known to interact with a variety of proteins. Stomatin interacts with and regulates members of the degenerin/epithelia Na+ channel family in mechanosensory cells of Caenorhabditis elegans and vertebrate neurons and participates in trafficking of Glut1 glucose transporters. Prohibitin may act as a chaperone for the stabilization of mitochondrial proteins. Prokaryotic HflK/C plays a role i
Probab=36.63 E-value=64 Score=18.61 Aligned_cols=24 Identities=29% Similarity=0.560 Sum_probs=16.6
Q ss_pred HHHHHHHHHHHHhcC-CeEEEEecC
Q psy2203 35 EDICEDIKEECNKYG-IVRSLEIPR 58 (74)
Q Consensus 35 ~ei~eDv~eEC~K~G-~V~~V~Ipr 58 (74)
+++++++.+++++|| .|.++.|-.
T Consensus 82 ~~v~~~l~~~~~~~Gi~i~~v~i~~ 106 (121)
T cd02106 82 AEVREALQEDLDKYGIEVVDVRIKD 106 (121)
T ss_pred HHHHHHHHHHHHhcCCEEEEEEEEe
Confidence 566777777899999 455555543
No 84
>PF08887 GAD-like: GAD-like domain; InterPro: IPR014983 This domain is functionally uncharacterised, but it appears to be distantly related to the GAD domain IPR004115 from INTERPRO.
Probab=35.80 E-value=37 Score=21.52 Aligned_cols=31 Identities=23% Similarity=0.377 Sum_probs=19.3
Q ss_pred CCChhHHHHHHHHHHHHHHhcCCeEEEEecC
Q psy2203 28 LKDEDEYEDICEDIKEECNKYGIVRSLEIPR 58 (74)
Q Consensus 28 l~~~~~~~ei~eDv~eEC~K~G~V~~V~Ipr 58 (74)
+.++++|+++.++.-..=..+.....+.|.|
T Consensus 55 ~vnP~dy~~vl~~~~~~~~~~~~~~~~~ia~ 85 (109)
T PF08887_consen 55 LVNPDDYEDVLDEWLGGTPLFDPDNYIPIAR 85 (109)
T ss_pred EECHHHHHHHHHHHhcCCccccCceEEEEEE
Confidence 4567788888777654323345556666666
No 85
>KOG0122|consensus
Probab=35.79 E-value=69 Score=23.78 Aligned_cols=48 Identities=15% Similarity=0.302 Sum_probs=31.6
Q ss_pred ceEEEeeCCCCCCcCCChhHHHHHHHHHHHHHHhcCCeEEEEecCCCCCCCCCceEEE
Q psy2203 14 FSVLCLLNMVTPDELKDEDEYEDICEDIKEECNKYGIVRSLEIPRPIENVDVPGCGKA 71 (74)
Q Consensus 14 s~vv~L~Nm~~~~el~~~~~~~ei~eDv~eEC~K~G~V~~V~Iprp~~~~~~~gvGkV 71 (74)
+..|.++|+ +++..++ |+++-..+||.|.+|.+-+...+...-|.+.|
T Consensus 189 ~~tvRvtNL--sed~~E~--------dL~eLf~~fg~i~rvylardK~TG~~kGFAFV 236 (270)
T KOG0122|consen 189 EATVRVTNL--SEDMRED--------DLEELFRPFGPITRVYLARDKETGLSKGFAFV 236 (270)
T ss_pred cceeEEecC--ccccChh--------HHHHHhhccCccceeEEEEccccCcccceEEE
Confidence 445778876 3344333 44666789999999999997765444444444
No 86
>cd07045 BMC_CcmK_like Carbon dioxide concentrating mechanism K (CcmK)-like proteins, Bacterial Micro-Compartment (BMC) domain. Bacterial micro-compartments are primitive protein-based organelles that sequester specific metabolic pathways in bacterial cells. The prototypical bacterial microcompartment is the carboxysome shell, a bacterial polyhedral organelle which increase the efficiency of CO2 fixation by encapsulating RuBisCO and carbonic anhydrase. They can be divided into two types: alpha-type carboxysomes (alpha-cyanobacteria and proteobacteria) and beta-type carboxysomes (beta-cyanobacteria). Potential functional differences between the two types are not yet fully understood. In addition to these proteins there are several homologous shell proteins including those found in pdu organelles involved in coenzyme B12-dependent degradation of 1,2-propanediol and eut organelles involved in the cobalamin-dependent degradation of ethanolamine. Structure evidence shows that several carbox
Probab=35.65 E-value=85 Score=18.82 Aligned_cols=29 Identities=28% Similarity=0.428 Sum_probs=19.9
Q ss_pred HHHHHHHHHHHHHhcC-CeEEEEecCCCCC
Q psy2203 34 YEDICEDIKEECNKYG-IVRSLEIPRPIEN 62 (74)
Q Consensus 34 ~~ei~eDv~eEC~K~G-~V~~V~Iprp~~~ 62 (74)
-+.=.+..++-+++.| -|.+..||+|++.
T Consensus 50 V~~Av~a~~~~~~~~~~~v~~~vI~~p~~~ 79 (84)
T cd07045 50 VKAAVEAGAAAAERIGELVSSHVIPRPHDE 79 (84)
T ss_pred HHHHHHHHHHHHhccCcEEEEEEeCCCCHH
Confidence 3444555566677776 7778889999864
No 87
>KOG4206|consensus
Probab=35.39 E-value=91 Score=22.50 Aligned_cols=44 Identities=7% Similarity=0.088 Sum_probs=32.1
Q ss_pred CCceEEEeeCCCCCCcCCChhHHHHHHHHHHHHHHhcCCeEEEEecCCCC
Q psy2203 12 LRFSVLCLLNMVTPDELKDEDEYEDICEDIKEECNKYGIVRSLEIPRPIE 61 (74)
Q Consensus 12 ~ps~vv~L~Nm~~~~el~~~~~~~ei~eDv~eEC~K~G~V~~V~Iprp~~ 61 (74)
.|...+.+.|+=. ...-.|++.-+..=.+.||+|+.|...++..
T Consensus 7 ~pn~TlYInnLne------kI~~~elkrsL~~LFsqfG~ildI~a~kt~K 50 (221)
T KOG4206|consen 7 NPNGTLYINNLNE------KIKKDELKRSLYLLFSQFGKILDISAFKTPK 50 (221)
T ss_pred CCCceEeehhccc------cccHHHHHHHHHHHHHhhCCeEEEEecCCCC
Confidence 4444666776543 3344578888888899999999999987653
No 88
>PF12897 Aminotran_MocR: Alanine-glyoxylate amino-transferase; InterPro: IPR024551 This entry represents a family of putative aminotransferases.; PDB: 3D6K_C 3EZ1_A 3PPL_B.
Probab=34.03 E-value=52 Score=25.91 Aligned_cols=57 Identities=21% Similarity=0.317 Sum_probs=32.1
Q ss_pred EEEeeCCCCCCcCCChhHHHHHHHHHHHHHHhcCCeEEEEecCCCCCCCCCceEEEEe
Q psy2203 16 VLCLLNMVTPDELKDEDEYEDICEDIKEECNKYGIVRSLEIPRPIENVDVPGCGKATL 73 (74)
Q Consensus 16 vv~L~Nm~~~~el~~~~~~~ei~eDv~eEC~K~G~V~~V~Iprp~~~~~~~gvGkVFv 73 (74)
-|.--|....-.|.++.....+ -+|-++|++.|.-.++.+.-...--..+|.|-.|+
T Consensus 209 RI~WDNAY~vHhL~~~~~~~~~-~nil~~~~~AGnpdrv~~F~STSKITf~GaGva~~ 265 (425)
T PF12897_consen 209 RIFWDNAYAVHHLYDEEPRDAL-LNILDACAKAGNPDRVYVFASTSKITFPGAGVAFF 265 (425)
T ss_dssp EEEEE-TTTT-BSSSSSS-------HHHHHHHTT-TTSEEEEEESTTTS-TTSS-EEE
T ss_pred EEEeecCceEeeccccccchhh-hHHHHHHHHcCCCCeEEEEecccccccCCcceeee
Confidence 3445666666677444322222 38999999999999999887766555566655554
No 89
>cd06061 PurM-like1 AIR synthase (PurM) related protein, subgroup 1 of unknown function. The family of PurM related proteins includes Hydrogen expression/formation protein HypE, AIR synthases, FGAM synthase and Selenophosphate synthetase (SelD). They all contain two conserved domains and seem to dimerize. The N-terminal domain forms the dimer interface and is a putative ATP binding domain.
Probab=33.55 E-value=52 Score=23.18 Aligned_cols=22 Identities=32% Similarity=0.643 Sum_probs=19.0
Q ss_pred CChhHHHHHHHHHHHHHHhcCC
Q psy2203 29 KDEDEYEDICEDIKEECNKYGI 50 (74)
Q Consensus 29 ~~~~~~~ei~eDv~eEC~K~G~ 50 (74)
.+.++.+++.+-+++.|.+||.
T Consensus 91 ~~~~~l~~~~~Gi~~~~~~~gi 112 (298)
T cd06061 91 TDEEELKAIMREINEAAKELGV 112 (298)
T ss_pred CCHHHHHHHHHHHHHHHHHcCC
Confidence 4567889999999999999984
No 90
>KOG0145|consensus
Probab=33.25 E-value=59 Score=24.64 Aligned_cols=35 Identities=14% Similarity=0.174 Sum_probs=26.7
Q ss_pred HHHHHHHHHHhcCCeEEEEecCCCCCCCCCceEEE
Q psy2203 37 ICEDIKEECNKYGIVRSLEIPRPIENVDVPGCGKA 71 (74)
Q Consensus 37 i~eDv~eEC~K~G~V~~V~Iprp~~~~~~~gvGkV 71 (74)
-++++|.-....|.|+++++-|-.-....-|-|.|
T Consensus 54 TqdE~rSLF~SiGeiEScKLvRDKitGqSLGYGFV 88 (360)
T KOG0145|consen 54 TQDELRSLFGSIGEIESCKLVRDKITGQSLGYGFV 88 (360)
T ss_pred CHHHHHHHhhcccceeeeeeeecccccccccccee
Confidence 46788888999999999999997754444566654
No 91
>smart00244 PHB prohibitin homologues. prohibitin homologues
Probab=33.05 E-value=73 Score=19.72 Aligned_cols=25 Identities=32% Similarity=0.553 Sum_probs=18.0
Q ss_pred HHHHHHHHHHHHHhcC-CeEEEEecC
Q psy2203 34 YEDICEDIKEECNKYG-IVRSLEIPR 58 (74)
Q Consensus 34 ~~ei~eDv~eEC~K~G-~V~~V~Ipr 58 (74)
-+++++.+.+++++|| .+.++.|-.
T Consensus 120 ~~~v~~~l~~~~~~~Gi~i~~v~i~~ 145 (160)
T smart00244 120 SENIREELNERAEAWGIEVEDVEIKD 145 (160)
T ss_pred HHHHHHHHHHHHHhCCCEEEEEEEEe
Confidence 3667778888889999 566666544
No 92
>PF01762 Galactosyl_T: Galactosyltransferase; InterPro: IPR002659 The biosynthesis of disaccharides, oligosaccharides and polysaccharides involves the action of hundreds of different glycosyltransferases. These enzymes catalyse the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. A classification of glycosyltransferases using nucleotide diphospho-sugar, nucleotide monophospho-sugar and sugar phosphates (2.4.1.- from EC) and related proteins into distinct sequence based families has been described []. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. The same three-dimensional fold is expected to occur within each of the families. Because 3-D structures are better conserved than sequences, several of the families defined on the basis of sequence similarities may have similar 3-D structures and therefore form 'clans'. Glycosyltransferase family 31 (GH31 from CAZY) comprises enzymes with a number of known activities; N-acetyllactosaminide beta-1,3-N-acetylglucosaminyltransferase (2.4.1.149 from EC); beta-1,3-galactosyltransferase (2.4.1 from EC); fucose-specific beta-1,3-N-acetylglucosaminyltransferase (2.4.1 from EC); globotriosylceramide beta-1,3-GalNAc transferase (2.4.1.79 from EC) [, ].; GO: 0008378 galactosyltransferase activity, 0006486 protein glycosylation, 0016020 membrane
Probab=32.87 E-value=47 Score=21.98 Aligned_cols=23 Identities=30% Similarity=0.485 Sum_probs=18.7
Q ss_pred HHHHHHHHHHHhcCCeEEEEecC
Q psy2203 36 DICEDIKEECNKYGIVRSLEIPR 58 (74)
Q Consensus 36 ei~eDv~eEC~K~G~V~~V~Ipr 58 (74)
++.+.+++|.++||.++......
T Consensus 36 ~~~~~l~~E~~~y~Dil~~d~~D 58 (195)
T PF01762_consen 36 DLQEALQEEAEKYGDILQGDFVD 58 (195)
T ss_pred HHHHHhhhhhhhcCceEeeeccc
Confidence 57777999999999998876533
No 93
>KOG4205|consensus
Probab=32.58 E-value=61 Score=24.24 Aligned_cols=34 Identities=24% Similarity=0.261 Sum_probs=26.8
Q ss_pred HHHHHHHHHhcCCeEEEEecCCCCCCCCCceEEE
Q psy2203 38 CEDIKEECNKYGIVRSLEIPRPIENVDVPGCGKA 71 (74)
Q Consensus 38 ~eDv~eEC~K~G~V~~V~Iprp~~~~~~~gvGkV 71 (74)
.+|.++..++||.|..+.+.--......-|.|+|
T Consensus 111 e~~~r~yfe~~g~v~~~~~~~d~~~~~~rgFgfv 144 (311)
T KOG4205|consen 111 EEDFKDYFEQFGKVADVVIMYDKTTSRPRGFGFV 144 (311)
T ss_pred hHHHhhhhhccceeEeeEEeecccccccccceee
Confidence 4688899999999999998876665556676665
No 94
>PTZ00370 STEVOR; Provisional
Probab=32.31 E-value=53 Score=24.68 Aligned_cols=37 Identities=22% Similarity=0.461 Sum_probs=31.3
Q ss_pred CCceEEEeeCCCCCCcCCChhHHHHHHHHHHHHH-HhcC
Q psy2203 12 LRFSVLCLLNMVTPDELKDEDEYEDICEDIKEEC-NKYG 49 (74)
Q Consensus 12 ~ps~vv~L~Nm~~~~el~~~~~~~ei~eDv~eEC-~K~G 49 (74)
..||.++=....+| ...+|.|.+++.+.+.+|+ +||-
T Consensus 38 ~~sR~L~Ecel~~p-~YdNDpemK~i~d~~n~eaikkyq 75 (296)
T PTZ00370 38 IKSRLLAQTQNHNP-HYHNDPELKEIIDKMNEEAIKKYQ 75 (296)
T ss_pred cceeehhhhhcCCC-CCCCcHHHHHHHHHHhHHHhhhhh
Confidence 45788888888888 8889999999999999987 6674
No 95
>COG2450 Uncharacterized conserved protein [Function unknown]
Probab=32.08 E-value=50 Score=21.81 Aligned_cols=24 Identities=25% Similarity=0.652 Sum_probs=19.5
Q ss_pred CcCC-ChhHHHHHHHHHHHHHHhcC
Q psy2203 26 DELK-DEDEYEDICEDIKEECNKYG 49 (74)
Q Consensus 26 ~el~-~~~~~~ei~eDv~eEC~K~G 49 (74)
..|. ++..+..+.+++|..+..+|
T Consensus 72 t~l~~d~~~~~~V~e~lr~~a~~~g 96 (124)
T COG2450 72 TPLERDDDLFERVIEELRDTAEEVG 96 (124)
T ss_pred CCcccChhHHHHHHHHHHHHHHHhC
Confidence 3455 67778999999999999986
No 96
>PRK14105 selenophosphate synthetase; Provisional
Probab=31.97 E-value=51 Score=24.16 Aligned_cols=21 Identities=5% Similarity=0.205 Sum_probs=18.6
Q ss_pred CChhHHHHHHHHHHHHHHhcC
Q psy2203 29 KDEDEYEDICEDIKEECNKYG 49 (74)
Q Consensus 29 ~~~~~~~ei~eDv~eEC~K~G 49 (74)
.+.++.++|.+.+.+.|++||
T Consensus 110 ~~~~~l~~i~~Gi~~~~~~~g 130 (345)
T PRK14105 110 LPIEVAKEMLQGFQDFCREND 130 (345)
T ss_pred CCHHHHHHHHHHHHHHHHHhC
Confidence 456788999999999999997
No 97
>PF05184 SapB_1: Saposin-like type B, region 1; InterPro: IPR007856 Synonym(s):cerebroside sulphate activator, CSAct Saposin B is a small non-enzymatic glycoprotein required for the breakdown of cerebroside sulphates (sulphatides) in lysosomes. Saposin B contains three intramolecular disulphide bridges, exists as a dimer and is remarkably heat, protease, and pH stable. The crystal structure of human saposin B reveals an unusual shell-like dimer consisting of a monolayer of alpha-helices enclosing a large hydrophobic cavity. Although the secondary structure of saposin B is similar to that of the known monomeric members of the saposin-like superfamily, the helices are repacked into a different tertiary arrangement to form the homodimer. A comparison of the two forms of the saposin B dimer suggests that extraction of target lipids from membranes involves a conformational change that facilitates access to the inner cavity [].; GO: 0006629 lipid metabolic process; PDB: 1N69_C 1QDM_C 4DDJ_A 2DOB_A 1OF9_A 2Z9A_A 1M12_A 2GTG_A 1SN6_A 2QYP_B ....
Probab=31.89 E-value=31 Score=17.05 Aligned_cols=20 Identities=25% Similarity=0.523 Sum_probs=15.2
Q ss_pred ChhHHHHHHHHHHHHHHhcC
Q psy2203 30 DEDEYEDICEDIKEECNKYG 49 (74)
Q Consensus 30 ~~~~~~ei~eDv~eEC~K~G 49 (74)
++..-++|.+-+...|++.|
T Consensus 20 ~~~t~~~I~~~l~~~C~~lP 39 (39)
T PF05184_consen 20 NNKTEEEIKKALEKACNKLP 39 (39)
T ss_dssp STCHHHHHHHHHHHHHTTSC
T ss_pred cCccHHHHHHHHHHHHhhCc
Confidence 44556789999999998765
No 98
>PRK00943 selenophosphate synthetase; Provisional
Probab=31.23 E-value=39 Score=24.95 Aligned_cols=22 Identities=18% Similarity=0.359 Sum_probs=19.0
Q ss_pred CChhHHHHHHHHHHHHHHhcCC
Q psy2203 29 KDEDEYEDICEDIKEECNKYGI 50 (74)
Q Consensus 29 ~~~~~~~ei~eDv~eEC~K~G~ 50 (74)
.++++.++|.+.+.+.|++||.
T Consensus 113 ~~~~~l~~i~~Gi~~~~~~~gv 134 (347)
T PRK00943 113 LPPEVAREVLEGGRAACRQAGI 134 (347)
T ss_pred CcHHHHHHHHHHHHHHHHHcCC
Confidence 4567889999999999999974
No 99
>cd02196 PurM PurM (Aminoimidazole Ribonucleotide [AIR] synthetase), one of eleven enzymes required for purine biosynthesis, catalyzes the conversion of formylglycinamide ribonucleotide (FGAM) and ATP to AIR, ADP, and Pi, the fifth step in de novo purine biosynthesis. The N-terminal domain of PurM is related to the ATP-binding domains of hydrogen expression/formation protein HypE, the AIR synthases, selenophosphate synthetase (SelD), and FGAM synthase and is thought to bind ATP.
Probab=31.15 E-value=69 Score=22.87 Aligned_cols=21 Identities=29% Similarity=0.537 Sum_probs=18.9
Q ss_pred ChhHHHHHHHHHHHHHHhcCC
Q psy2203 30 DEDEYEDICEDIKEECNKYGI 50 (74)
Q Consensus 30 ~~~~~~ei~eDv~eEC~K~G~ 50 (74)
++++..++...+.+.|.+||.
T Consensus 77 ~~~~l~~~~~Gi~~~~~~~gi 97 (297)
T cd02196 77 DPEVAAEIVKGIAEGCRQAGC 97 (297)
T ss_pred CHHHHHHHHHHHHHHHHHcCC
Confidence 677889999999999999974
No 100
>TIGR01478 STEVOR variant surface antigen, stevor family. This model represents the stevor branch of the rifin/stevor family (pfam02009) of predicted variant surface antigens as found in Plasmodium falciparum. This model is based on a set of stevor sequences kindly provided by Matt Berriman from the Sanger Center. This is a global model and assesses a penalty for incomplete sequence. Additional fragmentary sequences may be found with the fragment model and a cutoff of 8 bits.
Probab=30.69 E-value=66 Score=24.19 Aligned_cols=37 Identities=22% Similarity=0.460 Sum_probs=30.2
Q ss_pred CCceEEEeeCC-CCCCcCCChhHHHHHHHHHHHHH-HhcC
Q psy2203 12 LRFSVLCLLNM-VTPDELKDEDEYEDICEDIKEEC-NKYG 49 (74)
Q Consensus 12 ~ps~vv~L~Nm-~~~~el~~~~~~~ei~eDv~eEC-~K~G 49 (74)
..||.++=... .+| ...+|.|.+++.+.+.+|+ +||-
T Consensus 38 ~~sR~L~Ecel~~~p-~Y~nDpEmK~iid~~n~eaikkyq 76 (295)
T TIGR01478 38 TKSRLLAEIQRPKNP-HYHNDPELKEIIDKLNEEAIKKYQ 76 (295)
T ss_pred ccceehhhhccccCC-CCCCcHHHHHHHHHHhHHHhhhhh
Confidence 45778777777 777 7888999999999999987 6674
No 101
>cd03506 Delta6-FADS-like The Delta6 Fatty Acid Desaturase (Delta6-FADS)-like CD includes the integral-membrane enzymes: delta-4, delta-5, delta-6, delta-8, delta-8-sphingolipid, and delta-11 desaturases found in vertebrates, higher plants, fungi, and bacteria. These desaturases are required for the synthesis of highly unsaturated fatty acids (HUFAs), which are mainly esterified into phospholipids and contribute to maintaining membrane fluidity. While HUFAs may be required for cold tolerance in bacteria, plants and fish, the primary role of HUFAs in mammals is cell signaling. These enzymes are described as front-end desaturases because they introduce a double bond between the pre-exiting double bond and the carboxyl (front) end of the fatty acid. Various substrates are involved, with both acyl-coenzyme A (CoA) and acyl-lipid desaturases present in this CD. Acyl-lipid desaturases are localized in the membranes of cyanobacterial thylakoid, plant endoplasmic reticulum (ER), and plastid; an
Probab=30.44 E-value=47 Score=22.33 Aligned_cols=17 Identities=29% Similarity=0.673 Sum_probs=15.0
Q ss_pred HHHHHHHHHHHHHHhcC
Q psy2203 33 EYEDICEDIKEECNKYG 49 (74)
Q Consensus 33 ~~~ei~eDv~eEC~K~G 49 (74)
.|.++..-||+-|+|||
T Consensus 187 ~l~~~~~~v~~~~~~~g 203 (204)
T cd03506 187 NYPKVAPLVRELCKKHG 203 (204)
T ss_pred hHHHHHHHHHHHHHHhC
Confidence 46789999999999998
No 102
>KOG0145|consensus
Probab=30.39 E-value=58 Score=24.69 Aligned_cols=49 Identities=18% Similarity=0.283 Sum_probs=33.2
Q ss_pred ceEEEeeCCCCCCcCCChhHHHHHHHHHHHHHHhcCCeEEEEecCCCCCCCCCceEEEE
Q psy2203 14 FSVLCLLNMVTPDELKDEDEYEDICEDIKEECNKYGIVRSLEIPRPIENVDVPGCGKAT 72 (74)
Q Consensus 14 s~vv~L~Nm~~~~el~~~~~~~ei~eDv~eEC~K~G~V~~V~Iprp~~~~~~~gvGkVF 72 (74)
.-|+.+.|+-...| +. + +-+-..-||.|.+|+|-|--.++..-|.|.|-
T Consensus 278 g~ciFvYNLspd~d---e~----~---LWQlFgpFGAv~nVKvirD~ttnkCKGfgFVt 326 (360)
T KOG0145|consen 278 GWCIFVYNLSPDAD---ES----I---LWQLFGPFGAVTNVKVIRDFTTNKCKGFGFVT 326 (360)
T ss_pred eeEEEEEecCCCch---Hh----H---HHHHhCcccceeeEEEEecCCcccccceeEEE
Confidence 34777888755433 11 1 22346789999999999977666677777764
No 103
>cd03403 Band_7_stomatin_like Band_7_stomatin_like: A subgroup of the band 7 domain of flotillin (reggie) like proteins similar to stomatin and podicin (two lipid raft-associated integral membrane proteins). Individual proteins of this band 7 domain family may cluster to form membrane microdomains which may in turn recruit multiprotein complexes. Stomatin is widely expressed and, highly expressed in red blood cells. It localizes predominantly to the plasma membrane and to intracellular vesicles of the endocytic pathway, where it is present in higher order homo-oligomeric complexes (of between 9 and 12 monomers). Stomatin interacts with and regulates members of the degenerin/epithelia Na+ channel family in mechanosensory cells of Caenorhabditis elegans and vertebrate neurons and, is implicated in trafficking of Glut1 glucose transporters. Prohibitin is a mitochondrial inner-membrane protein hypothesized to act as a chaperone for the stabilization of mitochondrial proteins. Podicin local
Probab=30.09 E-value=74 Score=21.22 Aligned_cols=24 Identities=25% Similarity=0.473 Sum_probs=17.5
Q ss_pred HHHHHHHHHHHHhcC-CeEEEEecC
Q psy2203 35 EDICEDIKEECNKYG-IVRSLEIPR 58 (74)
Q Consensus 35 ~ei~eDv~eEC~K~G-~V~~V~Ipr 58 (74)
++|++.+++++.+|| .|.++.|-+
T Consensus 115 ~~i~~~l~~~l~~~Gi~v~~v~i~~ 139 (215)
T cd03403 115 AELVEILDEATDPWGVKVERVEIKD 139 (215)
T ss_pred HHHHHHHHHHHhccCeEEEEEEEee
Confidence 678888888899999 455555543
No 104
>COG0309 HypE Hydrogenase maturation factor [Posttranslational modification, protein turnover, chaperones]
Probab=29.99 E-value=78 Score=24.18 Aligned_cols=30 Identities=23% Similarity=0.176 Sum_probs=22.8
Q ss_pred eCCCCCCcCCChhHHHHHHHHHHHHHHhcC
Q psy2203 20 LNMVTPDELKDEDEYEDICEDIKEECNKYG 49 (74)
Q Consensus 20 ~Nm~~~~el~~~~~~~ei~eDv~eEC~K~G 49 (74)
+.-+..++=...++++.|..+|.++|+++|
T Consensus 93 ~~~lil~eg~~~e~l~~i~~si~e~a~~~G 122 (339)
T COG0309 93 SVGLILPEGLPIEDLERILKSIDEEAEEAG 122 (339)
T ss_pred eEeEecCCCCCHHHHHHHHHHHHHHHHHcC
Confidence 333444444566789999999999999998
No 105
>PF01545 Cation_efflux: Cation efflux family; InterPro: IPR002524 Members of this family are integral membrane proteins, that are found to increase tolerance to divalent metal ions such as cadmium, zinc, and cobalt. These proteins are considered to be efflux pumps that remove these ions from cells [, ], however others are implicated in ion uptake []. The family has six predicted transmembrane domains. Members of the family are variable in length because of variably sized inserts, often containing low-complexity sequence.; GO: 0008324 cation transmembrane transporter activity, 0006812 cation transport, 0055085 transmembrane transport, 0016021 integral to membrane; PDB: 3BYP_B 3BYR_A 3H90_B 2QFI_B 2ZZT_A.
Probab=29.36 E-value=1.1e+02 Score=21.13 Aligned_cols=27 Identities=22% Similarity=0.305 Sum_probs=20.7
Q ss_pred hhHHHHHHHHHHHHHH-hcCCeEEEEec
Q psy2203 31 EDEYEDICEDIKEECN-KYGIVRSLEIP 57 (74)
Q Consensus 31 ~~~~~ei~eDv~eEC~-K~G~V~~V~Ip 57 (74)
-.+..++.+++++++. +|+.|.++.|.
T Consensus 250 v~~~~~i~~~i~~~l~~~~~~i~~v~I~ 277 (284)
T PF01545_consen 250 VEEAHEIRERIEKRLREKFPGIYDVTIH 277 (284)
T ss_dssp HHHHHHHHHHHHHHHHHHSTTCEEEEEE
T ss_pred HHHHHHHHHHHHHHHHHhCCCcEEEEEE
Confidence 3456789999999985 47899887653
No 106
>TIGR00476 selD selenium donor protein. In prokaryotes, the incorporation of selenocysteine as the 21st amino acid, encoded by TGA, requires several elements: SelC is the tRNA itself, SelD acts as a donor of reduced selenium, SelA modifies a serine residue on SelC into selenocysteine, and SelB is a selenocysteine-specific translation elongation factor. 3-prime or 5-prime non-coding elements of mRNA have been found as probable structures for directing selenocysteine incorporation. This model describes SelD, known as selenophosphate synthetase, selenium donor protein, and selenide,water dikinase. SelD provides reduced selenium for the selenium transferase SelA. This protein itself contains selenocysteine in many species; any sequence not aligning to the beginning of the model is likely have selenocysteine residue incorrectly interpreted as a stop codon upstream of the given sequence. The SelD protein also provides selenophosphate for the enzyme tRNA 2-selenouridine synthase, which catalyz
Probab=29.19 E-value=69 Score=23.48 Aligned_cols=22 Identities=14% Similarity=0.401 Sum_probs=19.0
Q ss_pred CChhHHHHHHHHHHHHHHhcCC
Q psy2203 29 KDEDEYEDICEDIKEECNKYGI 50 (74)
Q Consensus 29 ~~~~~~~ei~eDv~eEC~K~G~ 50 (74)
.+.++.+++.+.+.+.|++||.
T Consensus 114 ~~~~~l~~i~~Gi~~~~~~~g~ 135 (347)
T TIGR00476 114 LTIEVMREVIQGFKDACREAGT 135 (347)
T ss_pred CCHHHHHHHHHHHHHHHHHcCC
Confidence 3467889999999999999984
No 107
>PF14088 DUF4268: Domain of unknown function (DUF4268)
Probab=29.18 E-value=1.5e+02 Score=18.71 Aligned_cols=41 Identities=22% Similarity=0.388 Sum_probs=29.4
Q ss_pred cccCCCCCCceEEEeeCCCCCCcCCChhHHHHHHHHHHHHHHhc
Q psy2203 5 WDRLVPPLRFSVLCLLNMVTPDELKDEDEYEDICEDIKEECNKY 48 (74)
Q Consensus 5 ~d~~~~~~ps~vv~L~Nm~~~~el~~~~~~~ei~eDv~eEC~K~ 48 (74)
|.++-....||+.+....++ +.+.+.+.++.+=+.+.+.++
T Consensus 90 W~~l~~kk~sRI~~~~~~~~---~~d~~~W~~~~~w~~~~m~~f 130 (140)
T PF14088_consen 90 WERLDDKKASRIYVYKEGVD---IFDEEDWPEIIDWLVENMIKF 130 (140)
T ss_pred eeeCCCCceEEEEEEccCCC---CcCHHHHHHHHHHHHHHHHHH
Confidence 66666677889877776666 556777777777777776664
No 108
>PF08285 DPM3: Dolichol-phosphate mannosyltransferase subunit 3 (DPM3); InterPro: IPR013174 This family corresponds to subunit 3 of dolichol-phosphate mannosyltransferase, an enzyme which generates mannosyl donors for glycosylphosphatidylinositols, N-glycan and protein O- and C-mannosylation. DPM3 is an integral membrane protein and plays a role in stabilising the dolichol-phosphate mannosyl transferase complex [].
Probab=29.16 E-value=50 Score=20.39 Aligned_cols=21 Identities=19% Similarity=0.370 Sum_probs=14.1
Q ss_pred CCCCCCcCCChhHHHHHHHHHHH
Q psy2203 21 NMVTPDELKDEDEYEDICEDIKE 43 (74)
Q Consensus 21 Nm~~~~el~~~~~~~ei~eDv~e 43 (74)
++++-.|- ++.|+||++||+|
T Consensus 59 ~v~tFnDc--peA~~eL~~eI~e 79 (91)
T PF08285_consen 59 GVATFNDC--PEAAKELQKEIKE 79 (91)
T ss_pred hhhccCCC--HHHHHHHHHHHHH
Confidence 44444442 5678999999876
No 109
>TIGR02413 Bac_small_yrzI Bacillus tandem small hypothetical proetin. Members of this family are very small proteins, about 47 residues each, in the genus Bacillus. Single members are found in Bacillus subtilis and Bacillus halodurans, but arrays of six in tandem in Bacillus cereus and Bacillus anthracis. An EIxxE motif present in most members of this family resembles cleavage sites by the germination protease GPR in a number small, acid-soluble spore proteins (SASP). A role in sporulation is possible.
Probab=28.41 E-value=34 Score=18.85 Aligned_cols=29 Identities=24% Similarity=0.452 Sum_probs=23.5
Q ss_pred eCCCCCCcCCChhHHHHHHHHHHHHHHhc
Q psy2203 20 LNMVTPDELKDEDEYEDICEDIKEECNKY 48 (74)
Q Consensus 20 ~Nm~~~~el~~~~~~~ei~eDv~eEC~K~ 48 (74)
++-.|.+|+.-+..++-+.++|++-=.+|
T Consensus 14 Kr~~S~~Ei~~eqq~k~~~deik~rq~~Y 42 (46)
T TIGR02413 14 KRKLSEAEIEREQQIEKIMDEVKERQSLY 42 (46)
T ss_pred eccCCHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 45678888888888999999999986665
No 110
>cd02197 HypE HypE (Hydrogenase expression/formation protein). HypE is involved in Ni-Fe hydrogenase biosynthesis. HypE dehydrates its own carbamoyl moiety in an ATP-dependent process to yield the enzyme thiocyanate. The N-terminal domain of HypE is related to the ATP-binding domains of the AIR synthases, selenophosphate synthetase (SelD), and FGAM synthase and is thought to bind ATP.
Probab=28.07 E-value=69 Score=22.78 Aligned_cols=20 Identities=20% Similarity=0.385 Sum_probs=17.8
Q ss_pred ChhHHHHHHHHHHHHHHhcC
Q psy2203 30 DEDEYEDICEDIKEECNKYG 49 (74)
Q Consensus 30 ~~~~~~ei~eDv~eEC~K~G 49 (74)
+.++.+++.+.+.+.|++||
T Consensus 91 ~~~~l~~~~~Gi~~~~~~~g 110 (293)
T cd02197 91 PLEDLERIVKSMAEAAREAG 110 (293)
T ss_pred CHHHHHHHHHHHHHHHHHcC
Confidence 35678999999999999997
No 111
>PF10087 DUF2325: Uncharacterized protein conserved in bacteria (DUF2325); InterPro: IPR016772 There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=27.59 E-value=74 Score=18.95 Aligned_cols=27 Identities=26% Similarity=0.411 Sum_probs=20.4
Q ss_pred ceEEEeeCCCCCCcCCChhHHHHHHHHHHHHHHhcCC
Q psy2203 14 FSVLCLLNMVTPDELKDEDEYEDICEDIKEECNKYGI 50 (74)
Q Consensus 14 s~vv~L~Nm~~~~el~~~~~~~ei~eDv~eEC~K~G~ 50 (74)
-=||++++.++-. ....++++|+|+|.
T Consensus 50 D~VIv~t~~vsH~----------~~~~vk~~akk~~i 76 (97)
T PF10087_consen 50 DLVIVFTDYVSHN----------AMWKVKKAAKKYGI 76 (97)
T ss_pred CEEEEEeCCcChH----------HHHHHHHHHHHcCC
Confidence 4467777766643 67789999999996
No 112
>KOG1031|consensus
Probab=27.21 E-value=1e+02 Score=26.22 Aligned_cols=47 Identities=21% Similarity=0.266 Sum_probs=36.8
Q ss_pred CCcccCCCCCCceEEEeeCCCCCCcCC-ChhHHHHHHHHHHHHHHhcC
Q psy2203 3 PPWDRLVPPLRFSVLCLLNMVTPDELK-DEDEYEDICEDIKEECNKYG 49 (74)
Q Consensus 3 ~~~d~~~~~~ps~vv~L~Nm~~~~el~-~~~~~~ei~eDv~eEC~K~G 49 (74)
|.|-...++..|--+.=.|-|+.+|-+ .|.|+.||..+|+.-+...|
T Consensus 447 pafvgimgntrsyklldwnsfnsdepe~rdawwaeir~eikshakalg 494 (1169)
T KOG1031|consen 447 PAFVGIMGNTRSYKLLDWNSFNSDEPETRDAWWAEIRMEIKSHAKALG 494 (1169)
T ss_pred cchhccccccccceecccccccCCCcchhhHHHHHHHHHHHHHHHhcC
Confidence 555566666666667778999888854 67899999999999988877
No 113
>smart00513 SAP Putative DNA-binding (bihelical) motif predicted to be involved in chromosomal organisation.
Probab=27.16 E-value=53 Score=16.24 Aligned_cols=12 Identities=33% Similarity=0.625 Sum_probs=9.7
Q ss_pred HHHHHHHHHhcC
Q psy2203 38 CEDIKEECNKYG 49 (74)
Q Consensus 38 ~eDv~eEC~K~G 49 (74)
..+++++|..+|
T Consensus 6 ~~~Lk~~l~~~g 17 (35)
T smart00513 6 VSELKDELKKRG 17 (35)
T ss_pred HHHHHHHHHHcC
Confidence 357889999887
No 114
>PRK07021 fliL flagellar basal body-associated protein FliL; Reviewed
Probab=26.98 E-value=89 Score=20.65 Aligned_cols=42 Identities=24% Similarity=0.383 Sum_probs=29.5
Q ss_pred eEEEeeCCCCCCcCCChhHHHHHHHHHHHHHHhc---C----CeEEEEe
Q psy2203 15 SVLCLLNMVTPDELKDEDEYEDICEDIKEECNKY---G----IVRSLEI 56 (74)
Q Consensus 15 ~vv~L~Nm~~~~el~~~~~~~ei~eDv~eEC~K~---G----~V~~V~I 56 (74)
+++.+..=-+.++|.+.+-.+.|++|+++...+. | .|.+|.+
T Consensus 108 ~ii~~Ls~k~~~~L~~~eGk~~Lk~ei~~~in~~l~~~~~~~~V~~VlF 156 (162)
T PRK07021 108 RLLLLLSRKHAAELATEEGKQKLAAEIKQTLSQPLVPGQPPQVVTDVLF 156 (162)
T ss_pred HHHHHHhcCCHHHhcCHHHHHHHHHHHHHHHHHHHhccCCCCceeEEee
Confidence 3444444557778888888888999999998763 2 4777654
No 115
>cd03399 Band_7_flotillin Band_7_flotillin: a subgroup of the band 7 domain of flotillin (reggie) like proteins. This subgroup contains proteins similar to stomatin, prohibitin, flotillin, HlfK/C and podicin. These two proteins are lipid raft-associated. Individual proteins of this band 7 domain family may cluster to form membrane microdomains which may in turn recruit multiprotein complexes. Microdomains formed from flotillin proteins may in addition be dynamic units with their own regulatory functions. Flotillins have been implicated in signal transduction, vesicle trafficking, cytoskeleton rearrangement and, interact with a variety of proteins. Flotillins may play a role in the progression of prion disease, in the pathogenesis of neurodegenerative diseases such as Parkinson's and Alzheimer's disease and, in cancer invasion and metastasis.
Probab=26.77 E-value=1.1e+02 Score=18.85 Aligned_cols=25 Identities=28% Similarity=0.349 Sum_probs=19.0
Q ss_pred HHHHHHHHHHHHhcC-CeEEEEecCC
Q psy2203 35 EDICEDIKEECNKYG-IVRSLEIPRP 59 (74)
Q Consensus 35 ~ei~eDv~eEC~K~G-~V~~V~Iprp 59 (74)
.+|++.+.++.++|| .|.++.|-+-
T Consensus 89 ~~i~~~v~~~~~~~Gi~i~~v~I~~i 114 (128)
T cd03399 89 EQVQEVVAPDLNKMGLELDSFTIKDI 114 (128)
T ss_pred HHHHHHHHHHHHHCCCEEEEEeeEEe
Confidence 678888888899999 6666766443
No 116
>COG1669 Predicted nucleotidyltransferases [General function prediction only]
Probab=26.57 E-value=86 Score=19.77 Aligned_cols=26 Identities=35% Similarity=0.533 Sum_probs=19.4
Q ss_pred hhHHHHHHHHHHHHHH-hcCCeEEEEec
Q psy2203 31 EDEYEDICEDIKEECN-KYGIVRSLEIP 57 (74)
Q Consensus 31 ~~~~~ei~eDv~eEC~-K~G~V~~V~Ip 57 (74)
...++++...++.+|. ||| |.++.|+
T Consensus 4 ~~~~~~~lr~~~~~l~~k~g-v~~~~vF 30 (97)
T COG1669 4 VSELKKILRKIKPELKEKYG-VKRVAVF 30 (97)
T ss_pred HHHHHHHHHHHHHHHHHHhC-CceEEEe
Confidence 4567788999999998 676 5666554
No 117
>PF08663 HalX: HalX domain; InterPro: IPR013971 HalX is a protein of unknown function, previously mis-annotated as HoxA-like transcriptional regulator. It is C-terminal to a CheY-like superfamily domain and suggests a role as a response regulator.
Probab=26.39 E-value=39 Score=19.99 Aligned_cols=20 Identities=30% Similarity=0.699 Sum_probs=15.0
Q ss_pred CCCcCCChhHHHHHHHHHHH
Q psy2203 24 TPDELKDEDEYEDICEDIKE 43 (74)
Q Consensus 24 ~~~el~~~~~~~ei~eDv~e 43 (74)
+..+|.+.++|.+|.+.|.+
T Consensus 28 ~~~eL~~seeY~eL~~ri~~ 47 (71)
T PF08663_consen 28 SEAELEESEEYQELEDRIEE 47 (71)
T ss_pred CHHHHhccHHHHHHHHHHHH
Confidence 34578888999988877654
No 118
>PRK05385 phosphoribosylaminoimidazole synthetase; Provisional
Probab=25.99 E-value=95 Score=22.53 Aligned_cols=21 Identities=24% Similarity=0.504 Sum_probs=18.5
Q ss_pred ChhHHHHHHHHHHHHHHhcCC
Q psy2203 30 DEDEYEDICEDIKEECNKYGI 50 (74)
Q Consensus 30 ~~~~~~ei~eDv~eEC~K~G~ 50 (74)
++++.+++.+.+.+.|++||.
T Consensus 109 ~~~~~~~~~~Gi~~~~~~~g~ 129 (327)
T PRK05385 109 DPEVAAQVVKGIAEGCEQAGC 129 (327)
T ss_pred CHHHHHHHHHHHHHHHHHcCC
Confidence 467889999999999999973
No 119
>cd02195 SelD Selenophosphate synthetase (SelD) catalyzes the conversion of selenium to selenophosphate which is required by a number of bacterial, archaeal and eukaryotic organisms for synthesis of Secys-tRNA, the precursor of selenocysteine in selenoenzymes. The N-terminal domain of SelD is related to the ATP-binding domains of hydrogen expression/formation protein HypE, the AIR synthases, and FGAM synthase and is thought to bind ATP.
Probab=25.47 E-value=76 Score=22.60 Aligned_cols=21 Identities=19% Similarity=0.275 Sum_probs=18.3
Q ss_pred ChhHHHHHHHHHHHHHHhcCC
Q psy2203 30 DEDEYEDICEDIKEECNKYGI 50 (74)
Q Consensus 30 ~~~~~~ei~eDv~eEC~K~G~ 50 (74)
+.++.+++.+.+.+.|++||.
T Consensus 108 ~~~~l~~~~~Gi~~~~~~~g~ 128 (287)
T cd02195 108 QEEVLREILAGGKDKLREAGA 128 (287)
T ss_pred hHHHHHHHHHHHHHHHHHcCC
Confidence 566789999999999999973
No 120
>COG3146 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=25.26 E-value=1.2e+02 Score=23.52 Aligned_cols=55 Identities=20% Similarity=0.276 Sum_probs=36.5
Q ss_pred CCCCcccCCCCCCceEEEeeCCCCCCcCCChhHHHHHHHHHHHHHHhcCC-eEEEEecC
Q psy2203 1 MYPPWDRLVPPLRFSVLCLLNMVTPDELKDEDEYEDICEDIKEECNKYGI-VRSLEIPR 58 (74)
Q Consensus 1 ~~~~~d~~~~~~ps~vv~L~Nm~~~~el~~~~~~~ei~eDv~eEC~K~G~-V~~V~Ipr 58 (74)
.||||...+|=+|.+=-.|..- +....++-.+-+...+.+.|.+.|- =.||.-+.
T Consensus 103 YYPKll~~vPFTPvtG~RlL~~---~~~d~~~~~~~L~~~l~~~~~~~glSS~Hv~F~~ 158 (387)
T COG3146 103 YYPKLLCAVPFTPVTGPRLLAR---DGEDEEEVRQALLAGLDELCEQSGLSSAHVTFVD 158 (387)
T ss_pred ccchhhcCCCCCCCCCceeecC---ccccHHHHHHHHHHHHHHHHHhcCCCceeEecCC
Confidence 3899999988877665555421 2222234467899999999999993 33444433
No 121
>cd03408 Band_7_5 A subgroup of the band 7 domain of flotillin (reggie) like proteins. This subgroup contains proteins similar to stomatin, prohibitin, flotillin, HlfK/C and podicin. Many of these band 7 domain-containing proteins are lipid raft-associated. Individual proteins of this band 7 domain family may cluster to form membrane microdomains which may in turn recruit multiprotein complexes. Microdomains formed from flotillin proteins may in addition be dynamic units with their own regulatory functions. Flotillins have been implicated in signal transduction, vesicle trafficking, cytoskeleton rearrangement and are known to interact with a variety of proteins. Stomatin interacts with and regulates members of the degenerin/epithelia Na+ channel family in mechanosensory cells of Caenorhabditis elegans and vertebrate neurons and participates in trafficking of Glut1 glucose transporters. Prohibitin may act as a chaperone for the stabilization of mitochondrial proteins. Prokaryotic H
Probab=25.22 E-value=1.1e+02 Score=20.08 Aligned_cols=24 Identities=17% Similarity=0.212 Sum_probs=17.6
Q ss_pred HHHHHHHHHHHHhcC-CeEEEEecC
Q psy2203 35 EDICEDIKEECNKYG-IVRSLEIPR 58 (74)
Q Consensus 35 ~ei~eDv~eEC~K~G-~V~~V~Ipr 58 (74)
+++++.+.++.++|| .|.++.|-.
T Consensus 168 ~~v~~~l~~~~~~~Gi~i~~v~I~~ 192 (207)
T cd03408 168 KAVREALAPWFASFGLELVSVYIES 192 (207)
T ss_pred HHHHHHHHHHHHhcCcEEEEEEEEe
Confidence 567778888889999 666666543
No 122
>cd07607 BAR_SH3P_plant The Bin/Amphiphysin/Rvs (BAR) domain of the plant SH3 domain-containing proteins. BAR domains are dimerization, lipid binding and curvature sensing modules found in many different proteins with diverse functions including organelle biogenesis, membrane trafficking or remodeling, and cell division and migration. This group is composed of proteins with similarity to Arabidopsis thaliana SH3 domain-containing proteins 1 (SH3P1) and 2 (SH3P2). SH3P1 is involved in the trafficking of clathrin-coated vesicles. It is localized at the plasma membrane and is associated with vesicles of the trans-Golgi network. Yeast complementation studies reveal that SH3P1 has similar functions to the Saccharomyces cerevisiae Rvs167p, which is involved in endocytosis and actin cytoskeletal arrangement. Members of this group contain an N-terminal BAR domain and a C-terminal SH3 domain. BAR domains form dimers that bind to membranes, induce membrane bending and curvature, and may also be i
Probab=25.19 E-value=58 Score=23.28 Aligned_cols=17 Identities=35% Similarity=0.730 Sum_probs=13.7
Q ss_pred HHHHHHHHHHHHhcCCe
Q psy2203 35 EDICEDIKEECNKYGIV 51 (74)
Q Consensus 35 ~ei~eDv~eEC~K~G~V 51 (74)
-||-.-+-|.|.|||.=
T Consensus 35 ~Ei~~KlaeDc~KYG~e 51 (209)
T cd07607 35 LEIGTKLAEDCKKYGSE 51 (209)
T ss_pred HHHHHHHHHHHHHhccC
Confidence 46777888999999963
No 123
>smart00596 PRE_C2HC PRE_C2HC domain.
Probab=24.93 E-value=1.4e+02 Score=17.72 Aligned_cols=25 Identities=16% Similarity=0.074 Sum_probs=20.1
Q ss_pred HHHHHHHHhcC-CeEEEEecCCCCCC
Q psy2203 39 EDIKEECNKYG-IVRSLEIPRPIENV 63 (74)
Q Consensus 39 eDv~eEC~K~G-~V~~V~Iprp~~~~ 63 (74)
+.|+++.+++| .|..|.-++...+.
T Consensus 2 ~~I~~~L~~~G~~v~~i~~m~~~~~r 27 (69)
T smart00596 2 SQIEEALKDIGFPVLFIHNMLNRDTK 27 (69)
T ss_pred HHHHHHHHHcCCceeEEEcccccCCC
Confidence 46888999999 78888888877753
No 124
>PRK08455 fliL flagellar basal body-associated protein FliL; Reviewed
Probab=24.74 E-value=1e+02 Score=21.07 Aligned_cols=35 Identities=17% Similarity=0.373 Sum_probs=25.0
Q ss_pred CCCCcCCChhHHHHHHHHHHHHHHhc---CCeEEEEec
Q psy2203 23 VTPDELKDEDEYEDICEDIKEECNKY---GIVRSLEIP 57 (74)
Q Consensus 23 ~~~~el~~~~~~~ei~eDv~eEC~K~---G~V~~V~Ip 57 (74)
-+.+||.+.+-.+.|++++++...++ |.|.+|.+-
T Consensus 140 kt~~dL~t~~Gk~~Lk~ei~~~iN~~L~~g~V~~VyFT 177 (182)
T PRK08455 140 KTVEEVSTNKGKERLKDEIVGKLNEFLIDGFIKNVFFT 177 (182)
T ss_pred CCHHHhcCHHHHHHHHHHHHHHHHHHhccCceeEEEeE
Confidence 45567777776777888888877554 778887654
No 125
>KOG3298|consensus
Probab=24.73 E-value=79 Score=21.99 Aligned_cols=36 Identities=28% Similarity=0.514 Sum_probs=24.1
Q ss_pred HHHHHHHHHHHH-hcCCeEEEEecCCC-CCCCCCceEE
Q psy2203 35 EDICEDIKEECN-KYGIVRSLEIPRPI-ENVDVPGCGK 70 (74)
Q Consensus 35 ~ei~eDv~eEC~-K~G~V~~V~Iprp~-~~~~~~gvGk 70 (74)
.-|++++++-|. +||-+..|..-..- .+.-.+|-|.
T Consensus 28 ~~L~~~veg~ctg~~Gyvi~vt~ld~Ig~g~I~~~~G~ 65 (170)
T KOG3298|consen 28 RKLLAEVEGKCTGKYGYVIAVTTLDNIGEGRIRPGTGF 65 (170)
T ss_pred HHHHHHhhccccccccEEEEEEEhhhccCCccccCCce
Confidence 457888888896 89999999855443 3333344443
No 126
>PHA02767 hypothetical protein; Provisional
Probab=24.68 E-value=34 Score=21.56 Aligned_cols=10 Identities=70% Similarity=1.215 Sum_probs=8.4
Q ss_pred HHHHHHHHhc
Q psy2203 39 EDIKEECNKY 48 (74)
Q Consensus 39 eDv~eEC~K~ 48 (74)
-|++.||.||
T Consensus 3 idikn~Cr~~ 12 (101)
T PHA02767 3 IDIKNECRKY 12 (101)
T ss_pred cchHHHHHHH
Confidence 4789999987
No 127
>TIGR02124 hypE hydrogenase expression/formation protein HypE. This family contains HypE (or HupE), a protein required for expression of catalytically active hydrogenase in many systems. It appears to be an accessory protein involved in maturation rather than a regulatory protein involved in expression. HypE shows considerable homology to the thiamine-monophosphate kinase ThiL (TIGR01379) and other enzymes.
Probab=24.36 E-value=87 Score=22.60 Aligned_cols=21 Identities=24% Similarity=0.449 Sum_probs=18.2
Q ss_pred ChhHHHHHHHHHHHHHHhcCC
Q psy2203 30 DEDEYEDICEDIKEECNKYGI 50 (74)
Q Consensus 30 ~~~~~~ei~eDv~eEC~K~G~ 50 (74)
+.++.+++.+.+.+.|.+||.
T Consensus 86 ~~~~l~~~~~Gi~~a~~~~gi 106 (320)
T TIGR02124 86 PIEDLERIVKSMAEAARKAGV 106 (320)
T ss_pred CHHHHHHHHHHHHHHHHHcCC
Confidence 356789999999999999973
No 128
>cd02203 PurL_repeat1 PurL subunit of the formylglycinamide ribonucleotide amidotransferase (FGAR-AT), first repeat. FGAR-AT catalyzes the ATP-dependent conversion of formylglycinamide ribonucleotide (FGAR) and glutamine to formylglycinamidine ribonucleotide (FGAM), ADP, phosphate, and glutamate in the fourth step of the purine biosynthetic pathway. In eukaryotes and Gram-negative bacteria, FGAR-AT is encoded by the purL gene as a multidomain protein with a molecular mass of about 140 kDa. In Gram-positive bacteria and archaea FGAR-AT is a complex of three proteins: PurS, PurL, and PurQ. PurL itself contains two tandem N- and C-terminal domains (four domains altogether). The N-terminal domains bind ATP and are related to the ATP-binding domains of HypE, ThiL, SelD and PurM.
Probab=24.18 E-value=1.5e+02 Score=21.23 Aligned_cols=19 Identities=21% Similarity=0.282 Sum_probs=16.5
Q ss_pred hHHHHHHHHHHHHHHhcCC
Q psy2203 32 DEYEDICEDIKEECNKYGI 50 (74)
Q Consensus 32 ~~~~ei~eDv~eEC~K~G~ 50 (74)
++.+++...+.+.|.+||.
T Consensus 92 ~~l~~~~~Gi~~~~~~~gv 110 (313)
T cd02203 92 RILDGVVAGISDYGNCIGI 110 (313)
T ss_pred HHHHHHHHHHHHHHHHhCC
Confidence 4578999999999999984
No 129
>PF05886 Orthopox_F8: Orthopoxvirus F8 protein; InterPro: IPR008726 This family consists of several poxvirus F8 proteins. The function of this family is unknown.
Probab=24.09 E-value=60 Score=19.06 Aligned_cols=18 Identities=22% Similarity=0.285 Sum_probs=14.2
Q ss_pred HHHHHhcCCeEEEEecCC
Q psy2203 42 KEECNKYGIVRSLEIPRP 59 (74)
Q Consensus 42 ~eEC~K~G~V~~V~Iprp 59 (74)
.+|-.|||.-.+|+-+..
T Consensus 28 ~dEI~KYG~clNIKk~t~ 45 (65)
T PF05886_consen 28 YDEIKKYGHCLNIKKFTK 45 (65)
T ss_pred HHHHHHhcceeeeecccC
Confidence 367789999999887654
No 130
>KOG0131|consensus
Probab=24.05 E-value=1.3e+02 Score=21.54 Aligned_cols=35 Identities=29% Similarity=0.343 Sum_probs=24.1
Q ss_pred HHHHHHHH-HHhcCCeEEEEecCCCCCCCCCceEEE
Q psy2203 37 ICEDIKEE-CNKYGIVRSLEIPRPIENVDVPGCGKA 71 (74)
Q Consensus 37 i~eDv~eE-C~K~G~V~~V~Iprp~~~~~~~gvGkV 71 (74)
+-+.+.-| |-.-|.|+++.||+-.-.+...|.|.+
T Consensus 21 vs~~~l~EL~iqagpVv~i~iPkDrv~~~~qGygF~ 56 (203)
T KOG0131|consen 21 VSEELLYELFIQAGPVVNLHIPKDRVTQKHQGYGFA 56 (203)
T ss_pred HHHHHHHHHHHhcCceeeeecchhhhcccccceeEE
Confidence 33344444 577899999999997766555666654
No 131
>PRK08284 precorrin 6A synthase; Provisional
Probab=23.97 E-value=1.9e+02 Score=20.67 Aligned_cols=47 Identities=17% Similarity=0.190 Sum_probs=36.4
Q ss_pred CceEEEeeCCCCCCcCC---ChhHHHHHHHHHHHHH-HhcCCeEEEEecCC
Q psy2203 13 RFSVLCLLNMVTPDELK---DEDEYEDICEDIKEEC-NKYGIVRSLEIPRP 59 (74)
Q Consensus 13 ps~vv~L~Nm~~~~el~---~~~~~~ei~eDv~eEC-~K~G~V~~V~Iprp 59 (74)
..++.+..|+..++|-. .-++..+...|++.|+ .+||-+..-.+-|.
T Consensus 200 ~~~v~v~e~l~~p~E~I~~~~l~ei~~~~~~~~~~~~~~~~~~~~~~~~~~ 250 (253)
T PRK08284 200 DLEIWWGAYLGTPDEILIAGPLAEVAEEILRVRAEARARHGWIMDTYLLRR 250 (253)
T ss_pred CCEEEEEecCCCCCeEEEEccHHHHHHHHHHHHHHHHHhcCceehhhhhcc
Confidence 45789999999999843 3467788889999998 67998877665443
No 132
>PF12668 DUF3791: Protein of unknown function (DUF3791); InterPro: IPR024269 This entry represents proteins of unknown function.
Probab=23.68 E-value=86 Score=17.48 Aligned_cols=16 Identities=44% Similarity=0.376 Sum_probs=13.0
Q ss_pred HHHHHHHHHHHHHhcC
Q psy2203 34 YEDICEDIKEECNKYG 49 (74)
Q Consensus 34 ~~ei~eDv~eEC~K~G 49 (74)
.+.+.+|+.+.|++-|
T Consensus 47 ~~yivedi~~~l~~~g 62 (62)
T PF12668_consen 47 DEYIVEDIIEYLKNRG 62 (62)
T ss_pred HHHHHHHHHHHHHhcC
Confidence 3578999999998765
No 133
>PRK10222 PTS system L-ascorbate-specific transporter subunit IIB; Provisional
Probab=23.50 E-value=51 Score=19.69 Aligned_cols=20 Identities=20% Similarity=0.255 Sum_probs=16.0
Q ss_pred CCCceEEEeeCCCCCCcCCC
Q psy2203 11 PLRFSVLCLLNMVTPDELKD 30 (74)
Q Consensus 11 ~~ps~vv~L~Nm~~~~el~~ 30 (74)
.....++.+.|+++.+++..
T Consensus 51 ~g~~~l~gI~N~~d~~ei~~ 70 (85)
T PRK10222 51 TGNKYVVGVRNMLSPADFGP 70 (85)
T ss_pred CCCceEEEEecccCHHHHHH
Confidence 34478999999999987765
No 134
>COG4080 SpoU rRNA Methylase family enzyme [General function prediction only]
Probab=23.50 E-value=39 Score=22.99 Aligned_cols=44 Identities=11% Similarity=0.263 Sum_probs=25.9
Q ss_pred EEEeeCCCCCCcCCChhHHHHHHHHHHHHHHhcCCeEEEEecCCCCCCCCCceEE
Q psy2203 16 VLCLLNMVTPDELKDEDEYEDICEDIKEECNKYGIVRSLEIPRPIENVDVPGCGK 70 (74)
Q Consensus 16 vv~L~Nm~~~~el~~~~~~~ei~eDv~eEC~K~G~V~~V~Iprp~~~~~~~gvGk 70 (74)
+++|+|..|...+ .|+--=.-.|| +...++-++......+|+++
T Consensus 4 ~vvlHN~~S~~rv----------~e~ariaygfg-~k~lV~tka~g~AAQsGIp~ 47 (147)
T COG4080 4 IVVLHNVSSVQRV----------LEFARIAYGFG-AKRLVLTKAKGSAAQSGIPE 47 (147)
T ss_pred EEEEecCCchHHH----------HHHHHHHcccC-ccEEEEEecccHhhhhccHH
Confidence 5788888887663 23333344556 66666666655444455543
No 135
>PF10256 Erf4: Golgin subfamily A member 7/ERF4 family; InterPro: IPR019383 Proteins in this entry include Golgin subfamily A member 7 and the Ras modification protein ERF4.
Probab=23.28 E-value=1.6e+02 Score=18.10 Aligned_cols=34 Identities=21% Similarity=0.408 Sum_probs=20.6
Q ss_pred CCCcccCCCCCCceEEEeeCCCCCCcCCChhHHHHHHHHHHHHHHh
Q psy2203 2 YPPWDRLVPPLRFSVLCLLNMVTPDELKDEDEYEDICEDIKEECNK 47 (74)
Q Consensus 2 ~~~~d~~~~~~ps~vv~L~Nm~~~~el~~~~~~~ei~eDv~eEC~K 47 (74)
+|.|+..-|.. |.|. ..+++|+++..+|-.-+.+
T Consensus 15 ~~~Fs~~~P~~------L~~~------is~~ef~~iI~~IN~~l~~ 48 (118)
T PF10256_consen 15 FPQFSTEYPGE------LSGY------ISPEEFEEIINTINQILKE 48 (118)
T ss_pred CCccCccCCHh------hcCC------CCHHHHHHHHHHHHHHHHH
Confidence 56666644433 3333 4566778888887776644
No 136
>KOG0130|consensus
Probab=23.26 E-value=73 Score=21.92 Aligned_cols=24 Identities=21% Similarity=0.429 Sum_probs=19.1
Q ss_pred HHHHHHHHHHhcCCeEEEEecCCC
Q psy2203 37 ICEDIKEECNKYGIVRSLEIPRPI 60 (74)
Q Consensus 37 i~eDv~eEC~K~G~V~~V~Iprp~ 60 (74)
-.+||.+-..-||.|+++.+---.
T Consensus 85 tEedi~d~F~dyGeiKNihLNLDR 108 (170)
T KOG0130|consen 85 TEEDIHDKFADYGEIKNIHLNLDR 108 (170)
T ss_pred hHHHHHHHHhhcccccceeecccc
Confidence 457889999999999999865433
No 137
>TIGR00878 purM phosphoribosylaminoimidazole synthetase. This enzyme is found as a homodimeric monofunctional protein in prokaryotes and as part of a larger, multifunctional protein, sometimes with two copies of this enzyme in tandem, in eukaryotes.
Probab=23.13 E-value=1.2e+02 Score=22.24 Aligned_cols=21 Identities=29% Similarity=0.520 Sum_probs=18.5
Q ss_pred ChhHHHHHHHHHHHHHHhcCC
Q psy2203 30 DEDEYEDICEDIKEECNKYGI 50 (74)
Q Consensus 30 ~~~~~~ei~eDv~eEC~K~G~ 50 (74)
++++++++.+.+.+.|++||.
T Consensus 109 ~~~~~~~i~~Gi~~a~~~~g~ 129 (332)
T TIGR00878 109 DPEVASQIVKGIAEGCKQAGC 129 (332)
T ss_pred CHHHHHHHHHHHHHHHHHcCC
Confidence 467889999999999999973
No 138
>KOG0117|consensus
Probab=22.90 E-value=3.7e+02 Score=21.71 Aligned_cols=58 Identities=19% Similarity=0.242 Sum_probs=40.2
Q ss_pred CCcccCCCCCCceEEEeeCCCCCCcCCChhHHHHHHHHHHHHHHhcCCeEEEEecCCCCCCCCCceEEEEe
Q psy2203 3 PPWDRLVPPLRFSVLCLLNMVTPDELKDEDEYEDICEDIKEECNKYGIVRSLEIPRPIENVDVPGCGKATL 73 (74)
Q Consensus 3 ~~~d~~~~~~ps~vv~L~Nm~~~~el~~~~~~~ei~eDv~eEC~K~G~V~~V~Iprp~~~~~~~gvGkVFv 73 (74)
|.|+...+...+.|.+=. -|-|+. ++|+.-=|+|-|.|-.+.+-+-+- +..+=|..||
T Consensus 73 P~weg~~p~~G~EVfvGk---IPrD~~--------EdeLvplfEkiG~I~elRLMmD~~--sG~nRGYAFV 130 (506)
T KOG0117|consen 73 PGWEGPPPPRGCEVFVGK---IPRDVF--------EDELVPLFEKIGKIYELRLMMDPF--SGDNRGYAFV 130 (506)
T ss_pred CcccCCCCCCCceEEecC---CCcccc--------chhhHHHHHhccceeeEEEeeccc--CCCCcceEEE
Confidence 789999998888877433 233443 346778899999999999887532 2233566665
No 139
>cd02691 PurM-like2 AIR synthase (PurM) related protein, archaeal subgroup 2 of unknown function. The family of PurM related proteins includes Hydrogen expression/formation protein HypE, AIR synthases, FGAM synthase and Selenophosphate synthetase (SelD). They all contain two conserved domains and seem to dimerize. The N-terminal domain forms the dimer interface and is a putative ATP binding domain.
Probab=22.70 E-value=1e+02 Score=22.97 Aligned_cols=25 Identities=16% Similarity=0.292 Sum_probs=19.7
Q ss_pred CCcCCChhHHHHHHHHHHHHHHhcCC
Q psy2203 25 PDELKDEDEYEDICEDIKEECNKYGI 50 (74)
Q Consensus 25 ~~el~~~~~~~ei~eDv~eEC~K~G~ 50 (74)
|.+ .+.++.+++.+.+.+.|.+||.
T Consensus 96 p~~-~~~~~l~~i~~Gi~~a~~~~gv 120 (346)
T cd02691 96 ADD-GDVGKLFDFTAGVTAVSEATGV 120 (346)
T ss_pred CCC-CCHHHHHHHHHHHHHHHHHcCC
Confidence 443 3456788999999999999984
No 140
>COG5470 Uncharacterized conserved protein [Function unknown]
Probab=22.43 E-value=1.1e+02 Score=19.42 Aligned_cols=23 Identities=26% Similarity=0.668 Sum_probs=20.7
Q ss_pred cCCChhHHHHHHHHHHHHHHhcC
Q psy2203 27 ELKDEDEYEDICEDIKEECNKYG 49 (74)
Q Consensus 27 el~~~~~~~ei~eDv~eEC~K~G 49 (74)
++.|.+.|++..+-+..=.+|||
T Consensus 11 dv~D~e~y~~Y~~~~~~a~~~~G 33 (96)
T COG5470 11 DVRDPEQYKDYVSKAKPAIEKFG 33 (96)
T ss_pred eecCHHHHHHHHHHhHHHHHHhC
Confidence 46788999999999999999997
No 141
>KOG0123|consensus
Probab=22.41 E-value=1.5e+02 Score=22.45 Aligned_cols=23 Identities=9% Similarity=0.261 Sum_probs=17.9
Q ss_pred HHHHHHHhcCCeEEEEecCCCCC
Q psy2203 40 DIKEECNKYGIVRSLEIPRPIEN 62 (74)
Q Consensus 40 Dv~eEC~K~G~V~~V~Iprp~~~ 62 (74)
++.+=.+.||.|+++++-+-..|
T Consensus 92 ~~~d~f~~~g~ilS~kv~~~~~g 114 (369)
T KOG0123|consen 92 SLYDTFSEFGNILSCKVATDENG 114 (369)
T ss_pred HHHHHHHhhcCeeEEEEEEcCCC
Confidence 34445688999999999997655
No 142
>cd02685 MIT_C MIT_C; domain found C-terminal to MIT (contained within Microtubule Interacting and Trafficking molecules) domains, as well as in some bacterial proteins. The function of this domain is unknown.
Probab=22.04 E-value=2.4e+02 Score=18.99 Aligned_cols=41 Identities=20% Similarity=0.302 Sum_probs=33.1
Q ss_pred EEeeCCCCCCcCCChhHHHHHHHHHHHHHHhcCCeEEEEec
Q psy2203 17 LCLLNMVTPDELKDEDEYEDICEDIKEECNKYGIVRSLEIP 57 (74)
Q Consensus 17 v~L~Nm~~~~el~~~~~~~ei~eDv~eEC~K~G~V~~V~Ip 57 (74)
+.-..++|..|-.+.++-.+..+.|++-|.++|....+.-.
T Consensus 52 ~~~i~LvT~~d~~~~~~Q~~~l~~i~~sl~~~gI~~~~~f~ 92 (148)
T cd02685 52 LKYIHLVTGEDEDNGKQQIEALEEIKQSLASHGVEFTWEFS 92 (148)
T ss_pred eEEEEEEecCCCCCHHHHHHHHHHHHHHHHhCCcEEEEEEC
Confidence 44455667777777777888999999999999999888875
No 143
>TIGR00693 thiE thiamine-phosphate pyrophosphorylase. This model includes ThiE from Bacillus subtilis but excludes its paralog, the regulatory protein TenI, and neighbors of TenI.
Probab=21.93 E-value=1.2e+02 Score=19.93 Aligned_cols=21 Identities=29% Similarity=0.714 Sum_probs=17.6
Q ss_pred ChhHHHHHHHHHHHHHHhcCC
Q psy2203 30 DEDEYEDICEDIKEECNKYGI 50 (74)
Q Consensus 30 ~~~~~~ei~eDv~eEC~K~G~ 50 (74)
++.++..+...+++.|.+||.
T Consensus 38 ~~~~~~~~~~~l~~~~~~~~~ 58 (196)
T TIGR00693 38 NTRERLALAEKLQELCRRYGV 58 (196)
T ss_pred CHHHHHHHHHHHHHHHHHhCC
Confidence 456788899999999999983
No 144
>cd03405 Band_7_HflC Band_7_HflC: The band 7 domain of flotillin (reggie) like proteins. This group includes proteins similar to prokaryotic HlfC (High frequency of lysogenization C). Although many members of the band 7 family are lipid raft associated, prokaryote plasma membranes lack cholesterol and are unlikely to have lipid raft domains. Individual proteins of this band 7 domain family may cluster to form membrane microdomains which may in turn recruit multiprotein complexes. Escherichia coli HflC is an integral membrane protein which may localize to the plasma membrane. HflC associates with another band 7 family member (HflK) to form an HflKC complex. HflKC interacts with FtsH in a large complex termed the FtsH holo-enzyme. FtsH is an AAA ATP-dependent protease which exerts progressive proteolysis against membrane-embedded and soluble substrate proteins. HflKC can modulate the activity of FtsH. HflKC plays a role in the decision between lysogenic and lytic cycle growth during la
Probab=21.87 E-value=1.3e+02 Score=20.43 Aligned_cols=24 Identities=29% Similarity=0.443 Sum_probs=17.9
Q ss_pred HHHHHHHHHHHHHhcC-CeEEEEec
Q psy2203 34 YEDICEDIKEECNKYG-IVRSLEIP 57 (74)
Q Consensus 34 ~~ei~eDv~eEC~K~G-~V~~V~Ip 57 (74)
-++|.+.+++++.+|| .|.++.|-
T Consensus 122 ~~~i~~~l~~~l~~~Gi~i~~v~i~ 146 (242)
T cd03405 122 MEEIRRAVAEEAKELGIEVVDVRIK 146 (242)
T ss_pred HHHHHHHHHHHHHccCcEEEEEEEE
Confidence 3677888888899999 56666654
No 145
>PRK03892 ribonuclease P protein component 3; Provisional
Probab=21.72 E-value=1.1e+02 Score=21.96 Aligned_cols=33 Identities=27% Similarity=0.524 Sum_probs=25.2
Q ss_pred CCChhHHHHHHHHHHHHHHhcCCeEEEEecCCCC
Q psy2203 28 LKDEDEYEDICEDIKEECNKYGIVRSLEIPRPIE 61 (74)
Q Consensus 28 l~~~~~~~ei~eDv~eEC~K~G~V~~V~Iprp~~ 61 (74)
|.+...+..+++|.++=+..||.+ .|.|-.+.+
T Consensus 38 ~~~~~~~~~~~~d~~~l~~~yg~~-gv~i~~~np 70 (216)
T PRK03892 38 LEDSPDFGSLKEELKELKKEYGKV-AILLVTPKP 70 (216)
T ss_pred ccCCCChhhhHHHHHHHHHhcCcc-eEEEecCCH
Confidence 344556778999999999999987 777766543
No 146
>cd04933 ACT_AK1-AT_1 ACT domains located C-terminal to the catalytic domain of a monofunctional, lysine-sensitive, plant aspartate kinase 1 (AK1). This CD includes the first of two ACT domains located C-terminal to the catalytic domain of a monofunctional, lysine-sensitive, plant aspartate kinase 1 (AK1), which can be synergistically inhibited by S-adenosylmethionine. This isoenzyme is found in higher plants, Arabidopsis thaliana (AT) and Zea mays, and also in Chlorophyta. Like the Escherichia coli AKIII (LysC), Arabidopsis AK1 binds two feedback allosteric inhibitor lysine molecules at the dimer interface located between the ACT1 domain of two subunits. A loop in common is involved in the binding of both Lys and S-adenosylmethionine providing an explanation for the synergistic inhibition by these effectors. Members of this CD belong to the superfamily of ACT regulatory domains.
Probab=21.70 E-value=98 Score=18.24 Aligned_cols=16 Identities=25% Similarity=0.306 Sum_probs=12.4
Q ss_pred HHHHHHHHHHhcCCeE
Q psy2203 37 ICEDIKEECNKYGIVR 52 (74)
Q Consensus 37 i~eDv~eEC~K~G~V~ 52 (74)
-.+.+++|-+|||.|.
T Consensus 60 ~l~~~~~~~~~~~~~~ 75 (78)
T cd04933 60 ELDHVVEELEKDAVVN 75 (78)
T ss_pred HHHHHHHHHHHcCeEE
Confidence 3448899999999764
No 147
>PF12021 DUF3509: Protein of unknown function (DUF3509); InterPro: IPR021898 This family of proteins is functionally uncharacterised. This protein is found in bacteria. Proteins in this family are typically between 92 to 110 amino acids in length. This protein has two completely conserved residues (G and R) that may be functionally important.
Probab=21.57 E-value=1.1e+02 Score=19.05 Aligned_cols=29 Identities=14% Similarity=0.382 Sum_probs=25.1
Q ss_pred EEeeCCCCCCcCCChhHHHHHHHHHHHHH
Q psy2203 17 LCLLNMVTPDELKDEDEYEDICEDIKEEC 45 (74)
Q Consensus 17 v~L~Nm~~~~el~~~~~~~ei~eDv~eEC 45 (74)
++++-+++...+.+....+++.++|+-+-
T Consensus 37 ~~~~R~is~~Ql~d~~~L~~vI~~irrdL 65 (94)
T PF12021_consen 37 VVVKRAISAAQLNDQQQLEWVIDSIRRDL 65 (94)
T ss_pred eEEEEecCHHHhcCHHHHHHHHHHHHHHH
Confidence 66788999999999999998888888774
No 148
>KOG3233|consensus
Probab=21.52 E-value=1.7e+02 Score=22.05 Aligned_cols=34 Identities=24% Similarity=0.429 Sum_probs=30.0
Q ss_pred eEEEeeCCCCCCcCCChhHH------HHHHHHHHHHHHhc
Q psy2203 15 SVLCLLNMVTPDELKDEDEY------EDICEDIKEECNKY 48 (74)
Q Consensus 15 ~vv~L~Nm~~~~el~~~~~~------~ei~eDv~eEC~K~ 48 (74)
+|..|.|+....+++.-+|| -|..+++.+-|.+|
T Consensus 142 KvYmLy~leP~~elTGG~WytDqdlDvEfIe~L~~~c~~f 181 (297)
T KOG3233|consen 142 KVYMLYDLEPDSELTGGTWYTDQDLDVEFIEVLKQICVRF 181 (297)
T ss_pred eEEEEecccccccccCCcccccccccHHHHHHHHHHHHHH
Confidence 79999999999999988888 36889999999877
No 149
>PRK07226 fructose-bisphosphate aldolase; Provisional
Probab=21.42 E-value=1.5e+02 Score=21.01 Aligned_cols=24 Identities=21% Similarity=0.376 Sum_probs=19.0
Q ss_pred hHHHHHHHHHHHHHHhcCCeEEEE
Q psy2203 32 DEYEDICEDIKEECNKYGIVRSLE 55 (74)
Q Consensus 32 ~~~~ei~eDv~eEC~K~G~V~~V~ 55 (74)
..+.+....|.++|.+||....+.
T Consensus 122 ~~~~~~~~~v~~~~~~~g~pl~vi 145 (267)
T PRK07226 122 AEMLEDLGEVAEECEEWGMPLLAM 145 (267)
T ss_pred HHHHHHHHHHHHHHHHcCCcEEEE
Confidence 345677788899999999887764
No 150
>TIGR01821 5aminolev_synth 5-aminolevulinic acid synthase. This model represents 5-aminolevulinic acid synthase, an enzyme for one of two routes to the heme precursor 5-aminolevulinate. The protein is a pyridoxal phosphate-dependent enzyme related to 2-amino-3-ketobutyrate CoA tranferase and 8-amino-7-oxononanoate synthase. This enzyme appears restricted to the alpha Proteobacteria and mitochondrial derivatives.
Probab=21.15 E-value=2.5e+02 Score=20.46 Aligned_cols=34 Identities=21% Similarity=0.310 Sum_probs=25.0
Q ss_pred CceEEEeeCCCCCCcCCChhHHHHHHHHHHHHHHhcCCeE
Q psy2203 13 RFSVLCLLNMVTPDELKDEDEYEDICEDIKEECNKYGIVR 52 (74)
Q Consensus 13 ps~vv~L~Nm~~~~el~~~~~~~ei~eDv~eEC~K~G~V~ 52 (74)
.+++|++.++.++.-..-+ .+++.+-|.+||.+.
T Consensus 177 ~~~~v~~e~~~~~~G~~~~------l~~i~~l~~~~~~~l 210 (402)
T TIGR01821 177 RPKIIAFESVYSMDGDIAP------IEEICDLADKYGALT 210 (402)
T ss_pred CCeEEEEcCCCCCCCCccC------HHHHHHHHHHcCCEE
Confidence 4678999998887543333 578888999998643
No 151
>KOG0477|consensus
Probab=21.04 E-value=71 Score=27.05 Aligned_cols=54 Identities=19% Similarity=0.217 Sum_probs=34.6
Q ss_pred CCCCcccCCCCCCceEEE--------------------eeCCCCCCcCCChhHHHHHHHHHHHHHHhcCCeEEEEecCC
Q psy2203 1 MYPPWDRLVPPLRFSVLC--------------------LLNMVTPDELKDEDEYEDICEDIKEECNKYGIVRSLEIPRP 59 (74)
Q Consensus 1 ~~~~~d~~~~~~ps~vv~--------------------L~Nm~~~~el~~~~~~~ei~eDv~eEC~K~G~V~~V~Iprp 59 (74)
|||+|++..+.+..|+-. ++.+++..--.-+ .+.-|+.-|.|||.|..-++-..
T Consensus 236 ~~p~yeri~~~ihvris~lP~~~~lr~lRq~Hln~Lvr~~GvVtr~tgV~p-----ql~~vky~C~KC~~vlgPF~qs~ 309 (854)
T KOG0477|consen 236 HYPNYERIHNEIHVRISDLPVCESLRSLRQLHLNQLVRTSGVVTRRTGVFP-----QLSVVKYDCLKCGFVLGPFVQSS 309 (854)
T ss_pred hCCChhhcccceeeeeecCCccccHHHHHHhccCceEEeeeEEEecceeeh-----hhHHHhhhHHhhCCccCceeecc
Confidence 689998887776555432 3333333221112 45678899999999988877654
No 152
>TIGR01949 AroFGH_arch predicted phospho-2-dehydro-3-deoxyheptonate aldolase. Together these two genes appear to perform the synthesis of 3-dehydroquinate. It is presumed that the substrates and the chemical transformations involved are identical, but this has not yet been proven experimentally.
Probab=20.93 E-value=1.5e+02 Score=20.73 Aligned_cols=25 Identities=20% Similarity=0.139 Sum_probs=19.4
Q ss_pred hHHHHHHHHHHHHHHhcCCeEEEEe
Q psy2203 32 DEYEDICEDIKEECNKYGIVRSLEI 56 (74)
Q Consensus 32 ~~~~ei~eDv~eEC~K~G~V~~V~I 56 (74)
.++.+....++++|.+||...-|.+
T Consensus 119 ~~~~~~~~~i~~~~~~~g~~liv~~ 143 (258)
T TIGR01949 119 WEQIRDLGMIAEICDDWGVPLLAMM 143 (258)
T ss_pred HHHHHHHHHHHHHHHHcCCCEEEEE
Confidence 3455778889999999998776643
No 153
>PF05722 Ustilago_mating: Ustilago B locus mating-type protein; InterPro: IPR008888 This domain consists of several Ustilago mating-type proteins. The b locus of the phytopathogenic fungus Ustilago maydis encodes a multiallelic recognition function that controls the ability of the fungus to form a dikaryon and complete the sexual stage of the life cycle. The b locus has at least 25 alleles and any combination of two different alleles, brought together by mating between haploid cells, allows the fungus to cause disease and undergo sexual development within the plant [].; GO: 0003677 DNA binding, 0005634 nucleus
Probab=20.40 E-value=47 Score=24.25 Aligned_cols=12 Identities=33% Similarity=0.941 Sum_probs=8.4
Q ss_pred CCCCcccCCCCCC
Q psy2203 1 MYPPWDRLVPPLR 13 (74)
Q Consensus 1 ~~~~~d~~~~~~p 13 (74)
|||+|| ..+++|
T Consensus 257 ~fppyd-a~gq~p 268 (286)
T PF05722_consen 257 MFPPYD-AMGQIP 268 (286)
T ss_pred cCCchh-hcccCC
Confidence 799999 444444
No 154
>KOG4175|consensus
Probab=20.39 E-value=63 Score=23.66 Aligned_cols=13 Identities=38% Similarity=0.680 Sum_probs=10.7
Q ss_pred HHHHHHHHHhcCC
Q psy2203 38 CEDIKEECNKYGI 50 (74)
Q Consensus 38 ~eDv~eEC~K~G~ 50 (74)
.+-+|+||.|+|.
T Consensus 137 a~~~Rne~~k~gi 149 (268)
T KOG4175|consen 137 AETLRNEARKHGI 149 (268)
T ss_pred HHHHHHHHHhcCc
Confidence 3568999999994
No 155
>KOG0105|consensus
Probab=20.09 E-value=1.4e+02 Score=21.51 Aligned_cols=21 Identities=29% Similarity=0.544 Sum_probs=16.7
Q ss_pred HHHHHHHhcCCeEEEEecCCC
Q psy2203 40 DIKEECNKYGIVRSLEIPRPI 60 (74)
Q Consensus 40 Dv~eEC~K~G~V~~V~Iprp~ 60 (74)
||++-.-|||.|..|.+-.++
T Consensus 22 eieDlFyKyg~i~~ieLK~r~ 42 (241)
T KOG0105|consen 22 EIEDLFYKYGRIREIELKNRP 42 (241)
T ss_pred cHHHHHhhhcceEEEEeccCC
Confidence 566778999999999876554
Done!