Query psy4687
Match_columns 339
No_of_seqs 172 out of 235
Neff 5.4
Searched_HMMs 46136
Date Fri Aug 16 23:20:39 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy4687.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/4687hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG2295|consensus 100.0 7.7E-73 1.7E-77 565.8 11.8 304 31-339 228-545 (648)
2 PF04959 ARS2: Arsenite-resist 100.0 1.4E-37 2.9E-42 286.6 -0.9 111 229-339 1-111 (214)
3 PF13821 DUF4187: Domain of un 99.5 4.5E-15 9.7E-20 110.1 2.8 41 197-237 11-51 (55)
4 PF00076 RRM_1: RNA recognitio 98.6 1.4E-07 3.1E-12 69.5 6.3 69 37-106 1-69 (70)
5 KOG1994|consensus 98.2 7.2E-07 1.6E-11 83.1 2.7 45 196-240 222-266 (268)
6 PLN03134 glycine-rich RNA-bind 98.1 2.3E-05 5E-10 68.5 10.5 83 26-108 26-108 (144)
7 PF14259 RRM_6: RNA recognitio 97.8 4.7E-05 1E-09 56.8 6.0 66 37-103 1-66 (70)
8 smart00362 RRM_2 RNA recogniti 97.7 0.00012 2.6E-09 52.4 6.8 67 36-104 1-67 (72)
9 cd00590 RRM RRM (RNA recogniti 97.7 0.00017 3.6E-09 52.0 7.2 69 36-105 1-69 (74)
10 smart00360 RRM RNA recognition 97.3 0.00068 1.5E-08 48.2 6.0 65 39-103 1-65 (71)
11 TIGR01659 sex-lethal sex-letha 97.0 0.0037 8E-08 62.2 9.7 75 32-107 105-180 (346)
12 TIGR01661 ELAV_HUD_SF ELAV/HuD 97.0 0.0023 5E-08 61.9 7.4 73 34-107 3-76 (352)
13 TIGR01661 ELAV_HUD_SF ELAV/HuD 96.8 0.0048 1.1E-07 59.7 8.7 73 33-106 268-341 (352)
14 TIGR01659 sex-lethal sex-letha 96.5 0.0077 1.7E-07 59.9 7.7 71 33-104 192-263 (346)
15 PF00096 zf-C2H2: Zinc finger, 96.4 0.0015 3.3E-08 39.2 1.2 23 306-329 1-23 (23)
16 TIGR01642 U2AF_lg U2 snRNP aux 96.4 0.01 2.2E-07 60.6 7.8 73 34-107 295-368 (509)
17 TIGR01622 SF-CC1 splicing fact 96.3 0.011 2.3E-07 59.6 7.3 73 34-107 186-259 (457)
18 TIGR01648 hnRNP-R-Q heterogene 96.3 0.025 5.4E-07 60.0 10.1 98 4-104 23-127 (578)
19 TIGR01649 hnRNP-L_PTB hnRNP-L/ 96.2 0.021 4.6E-07 58.7 9.1 72 31-104 391-464 (481)
20 PLN03120 nucleic acid binding 96.1 0.016 3.4E-07 55.8 7.1 68 34-106 4-72 (260)
21 TIGR01628 PABP-1234 polyadenyl 96.1 0.014 3E-07 60.8 7.1 76 31-107 282-357 (562)
22 TIGR01628 PABP-1234 polyadenyl 96.1 0.017 3.7E-07 60.1 7.7 71 35-106 1-72 (562)
23 PF13894 zf-C2H2_4: C2H2-type 96.0 0.0041 8.9E-08 36.8 1.6 24 306-329 1-24 (24)
24 TIGR01648 hnRNP-R-Q heterogene 96.0 0.02 4.4E-07 60.7 7.7 73 28-108 227-301 (578)
25 COG0724 RNA-binding proteins ( 95.7 0.04 8.6E-07 48.6 7.3 74 34-107 115-188 (306)
26 KOG0114|consensus 95.6 0.048 1E-06 46.1 7.1 68 33-103 17-84 (124)
27 KOG3152|consensus 95.6 0.0065 1.4E-07 58.1 2.0 92 33-127 73-177 (278)
28 PLN03121 nucleic acid binding 95.6 0.042 9.1E-07 52.3 7.3 68 33-104 4-71 (243)
29 TIGR01645 half-pint poly-U bin 95.4 0.039 8.4E-07 58.9 7.3 72 34-107 204-277 (612)
30 TIGR01649 hnRNP-L_PTB hnRNP-L/ 95.3 0.055 1.2E-06 55.7 7.7 72 32-108 273-345 (481)
31 PHA00616 hypothetical protein 95.1 0.0066 1.4E-07 43.3 0.2 26 306-331 2-27 (44)
32 PF08777 RRM_3: RNA binding mo 95.1 0.081 1.7E-06 44.1 6.7 60 34-99 1-60 (105)
33 TIGR01642 U2AF_lg U2 snRNP aux 95.1 0.051 1.1E-06 55.4 6.8 66 34-106 175-252 (509)
34 TIGR01622 SF-CC1 splicing fact 94.3 0.14 3E-06 51.7 7.6 71 34-106 89-160 (457)
35 TIGR01645 half-pint poly-U bin 94.2 0.1 2.2E-06 55.8 6.7 78 29-107 102-180 (612)
36 KOG0117|consensus 93.8 0.13 2.7E-06 52.9 6.0 76 27-110 252-327 (506)
37 KOG1457|consensus 93.7 0.055 1.2E-06 51.2 3.1 63 35-101 211-273 (284)
38 smart00355 ZnF_C2H2 zinc finge 93.3 0.049 1.1E-06 32.2 1.5 24 306-330 1-24 (26)
39 smart00361 RRM_1 RNA recogniti 93.2 0.21 4.5E-06 37.8 5.2 55 49-103 7-64 (70)
40 PF12171 zf-C2H2_jaz: Zinc-fin 93.2 0.05 1.1E-06 34.1 1.3 24 305-328 1-24 (27)
41 KOG0117|consensus 93.0 0.5 1.1E-05 48.7 8.8 87 17-105 67-154 (506)
42 KOG0108|consensus 92.6 0.39 8.4E-06 49.5 7.5 72 35-106 19-90 (435)
43 KOG1457|consensus 92.5 0.33 7.2E-06 46.1 6.4 67 35-102 35-103 (284)
44 smart00451 ZnF_U1 U1-like zinc 92.5 0.076 1.7E-06 34.7 1.6 31 304-334 2-33 (35)
45 KOG0127|consensus 92.4 0.28 6E-06 51.6 6.2 73 34-107 5-78 (678)
46 PF03467 Smg4_UPF3: Smg-4/UPF3 92.3 0.17 3.8E-06 45.8 4.1 68 35-102 8-81 (176)
47 PLN03213 repressor of silencin 92.1 0.47 1E-05 49.6 7.3 72 32-107 8-81 (759)
48 KOG4206|consensus 92.0 0.8 1.7E-05 43.2 8.2 70 33-105 8-81 (221)
49 PF04059 RRM_2: RNA recognitio 91.9 0.76 1.6E-05 38.0 7.1 69 35-104 2-73 (97)
50 PF12874 zf-met: Zinc-finger o 91.5 0.11 2.3E-06 31.7 1.3 23 306-328 1-23 (25)
51 KOG0122|consensus 91.3 0.93 2E-05 43.5 7.9 75 32-106 187-261 (270)
52 PF13912 zf-C2H2_6: C2H2-type 91.0 0.13 2.9E-06 31.8 1.4 25 306-330 2-26 (27)
53 PF02892 zf-BED: BED zinc fing 91.0 0.15 3.3E-06 35.4 1.8 29 302-330 13-45 (45)
54 KOG4212|consensus 89.5 0.62 1.4E-05 48.0 5.5 75 27-106 529-603 (608)
55 KOG0121|consensus 88.5 1.4 3.1E-05 38.7 6.2 77 27-103 29-105 (153)
56 PF12756 zf-C2H2_2: C2H2 type 88.0 0.33 7.1E-06 38.0 1.9 28 305-332 50-78 (100)
57 KOG4206|consensus 87.8 1.4 3E-05 41.6 6.2 66 32-102 144-209 (221)
58 KOG0116|consensus 87.5 1.5 3.2E-05 45.1 6.7 65 37-107 291-360 (419)
59 KOG0533|consensus 87.2 2 4.3E-05 41.1 7.0 74 33-107 82-155 (243)
60 PF05605 zf-Di19: Drought indu 87.0 0.39 8.4E-06 35.0 1.7 27 304-331 1-27 (54)
61 PF13913 zf-C2HC_2: zinc-finge 86.2 0.49 1.1E-05 29.6 1.6 20 306-326 3-22 (25)
62 PHA02768 hypothetical protein; 85.7 0.45 9.7E-06 35.6 1.4 24 306-329 6-29 (55)
63 KOG0107|consensus 85.5 2.9 6.3E-05 38.4 6.8 65 34-105 10-76 (195)
64 KOG3993|consensus 84.9 0.32 6.8E-06 49.8 0.4 29 302-330 353-381 (500)
65 PF13909 zf-H2C2_5: C2H2-type 84.5 0.51 1.1E-05 28.5 1.1 24 306-330 1-24 (24)
66 PF13893 RRM_5: RNA recognitio 82.7 2.4 5.1E-05 30.2 4.1 51 51-106 1-51 (56)
67 KOG4207|consensus 82.5 5.1 0.00011 37.9 7.2 82 28-110 7-89 (256)
68 KOG0127|consensus 81.7 3.5 7.6E-05 43.7 6.4 70 35-105 118-187 (678)
69 KOG0109|consensus 78.1 2.2 4.7E-05 42.1 3.4 87 32-129 76-162 (346)
70 PHA00733 hypothetical protein 76.1 1.7 3.8E-05 37.4 1.9 27 304-330 98-124 (128)
71 KOG0717|consensus 75.1 1.3 2.9E-05 45.9 1.1 34 306-339 293-327 (508)
72 COG4049 Uncharacterized protei 74.8 1.3 2.8E-05 33.5 0.6 34 300-333 12-45 (65)
73 PF04780 DUF629: Protein of un 74.7 1.6 3.5E-05 45.4 1.6 42 289-334 44-86 (466)
74 KOG0131|consensus 73.8 5.9 0.00013 36.7 4.8 76 31-106 6-81 (203)
75 PF13465 zf-H2C2_2: Zinc-finge 72.8 1.6 3.4E-05 27.3 0.6 22 292-316 4-25 (26)
76 PF04423 Rad50_zn_hook: Rad50 72.7 3.3 7.2E-05 30.1 2.4 44 289-338 8-51 (54)
77 KOG0123|consensus 68.9 11 0.00023 38.2 5.8 67 37-107 79-146 (369)
78 KOG3623|consensus 68.2 1.9 4.1E-05 47.1 0.4 31 302-332 278-308 (1007)
79 KOG1548|consensus 67.0 14 0.0003 37.3 6.1 72 33-108 133-215 (382)
80 KOG0145|consensus 66.7 9.6 0.00021 37.2 4.8 74 31-105 124-198 (360)
81 COG2871 NqrF Na+-transporting 64.8 6.6 0.00014 39.0 3.3 35 42-76 316-351 (410)
82 smart00614 ZnF_BED BED zinc fi 62.8 4.6 9.9E-05 28.9 1.4 28 304-331 17-49 (50)
83 KOG4212|consensus 62.8 19 0.00041 37.6 6.2 70 34-106 44-116 (608)
84 KOG0113|consensus 62.0 23 0.0005 35.2 6.5 70 35-107 102-174 (335)
85 PF02891 zf-MIZ: MIZ/SP-RING z 61.0 5.7 0.00012 28.7 1.7 22 289-313 28-49 (50)
86 smart00734 ZnF_Rad18 Rad18-lik 60.6 6.1 0.00013 24.9 1.5 19 307-326 3-21 (26)
87 KOG0149|consensus 57.9 27 0.00059 33.5 6.0 71 35-106 13-87 (247)
88 PHA00732 hypothetical protein 56.6 7.6 0.00016 30.8 1.8 25 306-330 2-26 (79)
89 KOG0105|consensus 56.6 31 0.00067 32.3 6.0 71 33-106 5-75 (241)
90 PF05605 zf-Di19: Drought indu 54.3 7.3 0.00016 28.2 1.3 26 303-330 29-54 (54)
91 KOG0110|consensus 52.9 17 0.00036 39.7 4.2 68 34-103 613-682 (725)
92 KOG0124|consensus 52.6 88 0.0019 32.1 8.9 72 33-104 209-280 (544)
93 KOG0145|consensus 51.4 35 0.00076 33.5 5.7 91 35-126 42-133 (360)
94 KOG0123|consensus 51.0 33 0.00072 34.7 5.8 76 29-107 265-342 (369)
95 PRK12496 hypothetical protein; 50.0 7.1 0.00015 35.0 0.8 19 298-316 120-138 (164)
96 KOG0146|consensus 48.0 42 0.00091 33.1 5.7 83 28-113 279-364 (371)
97 PF09538 FYDLN_acid: Protein o 47.9 7.5 0.00016 32.8 0.6 14 303-316 7-20 (108)
98 smart00719 Plus3 Short conserv 47.3 37 0.00079 28.2 4.6 36 46-82 12-55 (109)
99 KOG0110|consensus 46.5 33 0.00071 37.5 5.1 72 36-107 517-591 (725)
100 PF00403 HMA: Heavy-metal-asso 45.9 85 0.0018 22.4 5.9 56 37-98 2-58 (62)
101 PF05443 ROS_MUCR: ROS/MUCR tr 45.5 13 0.00028 32.5 1.6 31 302-335 69-99 (132)
102 KOG0131|consensus 44.3 49 0.0011 30.8 5.2 88 4-98 70-161 (203)
103 KOG0125|consensus 43.9 1E+02 0.0022 31.2 7.7 70 35-106 97-166 (376)
104 KOG2591|consensus 43.5 56 0.0012 35.1 6.1 66 25-97 166-233 (684)
105 KOG2462|consensus 41.6 11 0.00023 36.9 0.6 29 304-332 186-214 (279)
106 KOG0153|consensus 41.6 55 0.0012 33.2 5.5 78 26-109 220-298 (377)
107 PHA00733 hypothetical protein 40.7 25 0.00055 30.2 2.7 26 302-327 70-95 (128)
108 KOG2462|consensus 39.7 14 0.0003 36.1 1.1 36 289-327 230-265 (279)
109 TIGR02300 FYDLN_acid conserved 38.4 13 0.00029 32.4 0.6 15 302-316 6-20 (129)
110 KOG0132|consensus 37.8 70 0.0015 35.6 6.0 63 35-103 422-484 (894)
111 KOG0105|consensus 37.6 1.4E+02 0.003 28.2 7.0 72 25-103 106-177 (241)
112 PF09237 GAGA: GAGA factor; I 36.7 19 0.00041 26.8 1.1 31 302-332 21-51 (54)
113 PF01927 Mut7-C: Mut7-C RNAse 35.7 12 0.00026 32.6 -0.0 26 290-315 109-134 (147)
114 PF01709 Transcrip_reg: Transc 35.4 69 0.0015 30.3 5.0 59 32-94 88-150 (234)
115 KOG3576|consensus 34.3 17 0.00037 34.4 0.7 25 302-326 114-138 (267)
116 KOG0144|consensus 33.0 45 0.00097 34.8 3.5 90 13-107 105-197 (510)
117 TIGR01033 DNA-binding regulato 32.9 1.3E+02 0.0029 28.6 6.5 59 32-94 92-154 (238)
118 KOG0144|consensus 31.4 1.3E+02 0.0029 31.5 6.5 74 33-109 33-109 (510)
119 smart00834 CxxC_CXXC_SSSS Puta 30.7 23 0.00049 23.6 0.7 14 305-318 5-18 (41)
120 KOG1190|consensus 30.1 56 0.0012 33.9 3.6 47 15-61 377-441 (492)
121 TIGR02098 MJ0042_CXXC MJ0042 f 30.1 29 0.00064 23.0 1.1 16 302-317 22-37 (38)
122 PF03126 Plus-3: Plus-3 domain 29.8 75 0.0016 26.1 3.8 38 45-82 7-52 (108)
123 TIGR03636 L23_arch archaeal ri 29.2 1.8E+02 0.0038 23.1 5.6 55 41-96 20-74 (77)
124 COG4957 Predicted transcriptio 28.6 22 0.00047 31.5 0.4 31 302-335 73-103 (148)
125 COG1656 Uncharacterized conser 28.5 30 0.00065 31.4 1.2 33 288-320 113-146 (165)
126 PRK00110 hypothetical protein; 28.3 1.4E+02 0.0031 28.5 5.9 58 32-94 92-152 (245)
127 KOG2482|consensus 27.6 34 0.00074 34.6 1.6 28 302-329 192-219 (423)
128 PRK14548 50S ribosomal protein 27.2 1.9E+02 0.0041 23.3 5.6 55 41-96 27-81 (84)
129 COG3677 Transposase and inacti 26.9 28 0.00062 30.0 0.8 15 302-316 50-64 (129)
130 TIGR02728 spore_gerQ spore coa 25.9 80 0.0017 25.4 3.1 29 59-87 44-76 (82)
131 KOG0148|consensus 25.1 2E+02 0.0043 28.6 6.2 68 30-103 160-227 (321)
132 PF11341 DUF3143: Protein of u 24.9 59 0.0013 25.0 2.1 20 36-55 41-60 (63)
133 COG1997 RPL43A Ribosomal prote 24.8 34 0.00073 28.1 0.8 21 300-320 48-68 (89)
134 PF07754 DUF1610: Domain of un 23.9 41 0.00088 21.1 0.9 11 303-313 14-24 (24)
135 PF08741 YwhD: YwhD family; I 23.8 89 0.0019 28.3 3.3 61 14-87 6-85 (163)
136 PF05265 DUF723: Protein of un 23.3 34 0.00075 26.1 0.6 10 303-312 51-60 (60)
137 PF03317 ELF: ELF protein; In 23.2 23 0.0005 33.4 -0.5 42 34-84 90-131 (284)
138 KOG0109|consensus 23.2 97 0.0021 30.9 3.7 65 35-107 3-67 (346)
139 KOG4205|consensus 22.4 1.4E+02 0.0031 29.7 4.8 61 33-94 96-157 (311)
140 COG1439 Predicted nucleic acid 22.3 32 0.00069 31.6 0.2 19 298-316 132-150 (177)
141 PRK12378 hypothetical protein; 20.9 3.1E+02 0.0067 26.2 6.6 58 32-94 89-149 (235)
142 PRK00464 nrdR transcriptional 20.9 30 0.00065 30.9 -0.2 18 306-323 29-46 (154)
143 KOG4660|consensus 20.6 2.3E+02 0.005 30.4 6.1 72 25-101 66-137 (549)
144 KOG4211|consensus 20.3 2.6E+02 0.0057 29.6 6.3 75 17-93 83-162 (510)
145 PTZ00380 microtubule-associate 20.1 1.3E+02 0.0028 26.0 3.5 37 35-71 71-109 (121)
146 KOG0106|consensus 20.1 85 0.0018 29.7 2.6 62 36-105 3-64 (216)
No 1
>KOG2295|consensus
Probab=100.00 E-value=7.7e-73 Score=565.83 Aligned_cols=304 Identities=45% Similarity=0.747 Sum_probs=279.8
Q ss_pred cCCCCeEEeeecCCCCCHHHHHHHHhhcCCeeEEEecCCCCCCCceEEEEEEecCcccHHHHHHhhccccccCccccccc
Q psy4687 31 LHRTSSIFLRNLAPTITKAEVENLCKRYSGFLRVAIADPQPDRRWFRRGWVTFRRDVDIKEICWNLNNIRLRDCELGAIV 110 (339)
Q Consensus 31 lh~t~slfIk~IpP~isr~ele~~ck~~~GF~~lsLSDP~p~krf~R~GWV~f~~~~~~~~~~~~L~~~~i~d~el~~~v 110 (339)
+|+|++||+++|+|+||+++|+.+|+++|||+++|||++++.|+|+|+|||+|+.++||++|||+||+++++.+.+++.+
T Consensus 228 ~hke~sll~rni~Pnis~aeIe~~ck~i~~~lrfals~~~aek~~~r~lwv~fk~~~ni~~a~~aLn~irl~s~~~se~e 307 (648)
T KOG2295|consen 228 THKECSLLVRNILPNISVAEIENLCKGIPGFLRFALSTINAEKNFERRLWVTFKRGTNIKEACWALNGIRLRSNFLSESE 307 (648)
T ss_pred hhHHHHHHHhccCCcccHHHHHHHhccCchheeeeccCchHHHHHHHHhhHhhccccchHHHHHHhhhcccccccccccc
Confidence 59999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred ccccccccccccCcccchhhhhh-hhhh--------hccccCCCCCCCCcccccCCCCccccchhhhhhhhhchhHHhhh
Q psy4687 111 NRDLSRRVRTVSGLTSLKPVVQF-DMKL--------SLWTDGQEPAQQPMSYTLSSKNPVLQNITDYLIEEASAEEEELL 181 (339)
Q Consensus 111 ~r~~~~~~r~~~~it~~~~rl~~-Dl~~--------~L~~~~~~~~~~~~~~~~~~~np~~~~i~d~liee~s~eeeell 181 (339)
|+++.+|+|.+||||.|++++.+ --++ .||... ++.++....+.++||++++|++++++|+|++|++++
T Consensus 308 n~~i~rrvr~~~Gia~~keiasn~a~k~~~lldk~irl~~~a--~~t~al~tkl~s~n~i~kni~d~l~te~S~eE~e~~ 385 (648)
T KOG2295|consen 308 NPDITRRVRPINGIAKHKEIASNAAQKLKNLLDKLIRLIDRA--SQTQALVTKLSSPNPIAKNIEDRLKTEASMEEDELL 385 (648)
T ss_pred ccCccceeccCCchhhHHHHHHHHHHhhhHHHhhhHHHhhcc--cccccchhhccCCCHHHHHHHHHhhhhcchhhhhhc
Confidence 99999999999999999999987 2222 666543 233344456889999999999999999999999999
Q ss_pred cCCCCCcc---cccccChhHHHHHHHHHHHHHHhcceeeeccccCCCcccccccCCCCccCCCCCCCCCChhhHHHHHHH
Q psy4687 182 GKSEENTE---ETEQVDPNVLRVLDRLILYLRIVHSVDYYNHCEYPNEDEMPNRCGIIHTRGSAPASKVTPQEITDYCST 258 (339)
Q Consensus 182 g~~~~~~~---~~~e~d~~l~~~LD~~i~YLR~vh~~cyyc~~ey~~~~el~~~Cp~~H~R~~~p~~~~t~~e~~~W~~~ 258 (339)
|+++.+.- .+.++|...++.||+|+.|||+||++|||..++|.+++.|+++||.+|+|++ +.++..++.+|.+.
T Consensus 386 gssg~e~p~eg~~~er~d~~lk~LDll~~YLr~VhslDfyn~~ey~~e~~mpnrcgl~hvR~~---~~vs~~ev~e~es~ 462 (648)
T KOG2295|consen 386 GSSGPEVPIEGLTSERDDIRLKLLDLLAEYLRIVHSLDFYNSKEYESEDAMPNRCGLIHVRGK---GFVSSKEVGEEESI 462 (648)
T ss_pred CCCCCcCccccCccccchhHHHHHHHHHHHHHHHHhcccccccccchhhhcccccCceeeccc---CCCCcccchhHHHH
Confidence 98874421 4677888999999999999999999999999999999999999999999998 35666778899999
Q ss_pred HHHhHHhccCCCCCCChHHHHhhcCcccHHHHHHHHHhhhhhccCCeecCCC--CcccccChhhHHhhhhhcchHHHHHh
Q psy4687 259 FQNKMSSLVSSPQEPTPEELDQLGAKKTDAEVEKFIAANTQEISTDKWLCPL--SGKKFKGPDFIRKHIFNKHAEKVEEV 336 (339)
Q Consensus 259 ~e~k~~~~l~~~~~~~~~d~~klG~k~~eeevek~v~~~~~e~~e~K~rC~~--C~KlFk~~eFV~KHI~nKH~e~v~~v 336 (339)
|+++++..|.....+.+++++++|.+++|+||++||++||++++++||+|++ |+|||||||||+|||++||.|||+++
T Consensus 463 f~s~le~~l~~~~~lee~eakkkg~k~~e~eve~~v~s~t~e~~kdKy~C~lsgc~KlF~gpEFvrKHi~~KH~d~leei 542 (648)
T KOG2295|consen 463 FLSDLENNLACLLELEEEEAKKKGAKDVEDEVENFVDSNTMELDKDKYLCPLSGCAKLFKGPEFVRKHINKKHKDKLEEI 542 (648)
T ss_pred HHHHhhhcccccccchHHHHHHhcccCHHHHHHHHHHHHHHHhhcccccCCCcchHhhccCHHHHHHHHHHHHHHHHHHH
Confidence 9999999999999999999999999999999999999999999999999975 77999999999999999999999999
Q ss_pred hcC
Q psy4687 337 KKE 339 (339)
Q Consensus 337 ~~e 339 (339)
|+|
T Consensus 543 rke 545 (648)
T KOG2295|consen 543 RKE 545 (648)
T ss_pred HHH
Confidence 975
No 2
>PF04959 ARS2: Arsenite-resistance protein 2; InterPro: IPR007042 This entry represents Arsenite-resistance protein 2 (also known as Serrate RNA effector molecule homolog) which is thought to play a role in arsenite resistance [], although does not directly confer arsenite resistance but rather modulates arsenic sensitivity []. Arsenite is a carcinogenic compound which can act as a comutagen by inhibiting DNA repair. It is also involved in cell cycle progression at S phase. ; PDB: 3AX1_A.
Probab=100.00 E-value=1.4e-37 Score=286.61 Aligned_cols=111 Identities=57% Similarity=1.031 Sum_probs=77.2
Q ss_pred ccccCCCCccCCCCCCCCCChhhHHHHHHHHHHhHHhccCCCCCCChHHHHhhcCcccHHHHHHHHHhhhhhccCCeecC
Q psy4687 229 MPNRCGIIHTRGSAPASKVTPQEITDYCSTFQNKMSSLVSSPQEPTPEELDQLGAKKTDAEVEKFIAANTQEISTDKWLC 308 (339)
Q Consensus 229 l~~~Cp~~H~R~~~p~~~~t~~e~~~W~~~~e~k~~~~l~~~~~~~~~d~~klG~k~~eeevek~v~~~~~e~~e~K~rC 308 (339)
|+++||++|||+++|++.+|+.++.+|+++||+||+++|++...++++++++||++++|+||++||+++|++++++||||
T Consensus 1 mp~rcGi~HvRg~~p~~~~~~~e~~~w~k~~e~kl~~~L~~~~~l~~~~~~klG~k~~e~eve~~v~~~~~e~~~~K~~C 80 (214)
T PF04959_consen 1 MPNRCGIIHVRGPPPSNRVTSDEVSEWVKTFEEKLQQLLEPRETLSEEELKKLGGKDPEEEVEKFVQKNTKEEDEDKWRC 80 (214)
T ss_dssp -------------------------HHHHHHHHHHHHHHHS--HHHHHH-HHHHHHHHHHHHGGGEEEEE-SSSSEEEEE
T ss_pred CcccccccccccccccCccCHHHHHHHHHHHHHHHHHHcCCcCcCChHHHHHHccCcHHHHHHHHHHHHHHHHcCCEECC
Confidence 78999999999999998999999999999999999999999988889999999999999999999999999999999999
Q ss_pred CCCcccccChhhHHhhhhhcchHHHHHhhcC
Q psy4687 309 PLSGKKFKGPDFIRKHIFNKHAEKVEEVKKE 339 (339)
Q Consensus 309 ~~C~KlFk~~eFV~KHI~nKH~e~v~~v~~e 339 (339)
++|+|||||+|||+|||+|||+|+|+++++|
T Consensus 81 ~lc~KlFkg~eFV~KHI~nKH~e~ve~~~~e 111 (214)
T PF04959_consen 81 PLCGKLFKGPEFVRKHIFNKHPEKVEEVKKE 111 (214)
T ss_dssp -SSS-EESSHHHHHHHHHHH-HHHHHHHHHH
T ss_pred CCCCcccCChHHHHHHHhhcCHHHHHHHHHH
Confidence 9999999999999999999999999999875
No 3
>PF13821 DUF4187: Domain of unknown function (DUF4187)
Probab=99.52 E-value=4.5e-15 Score=110.12 Aligned_cols=41 Identities=24% Similarity=0.454 Sum_probs=38.9
Q ss_pred hHHHHHHHHHHHHHHhcceeeeccccCCCcccccccCCCCc
Q psy4687 197 NVLRVLDRLILYLRIVHSVDYYNHCEYPNEDEMPNRCGIIH 237 (339)
Q Consensus 197 ~l~~~LD~~i~YLR~vh~~cyyc~~ey~~~~el~~~Cp~~H 237 (339)
.+.++|++++.|||.+|+||||||++|+|.+||.++|||.+
T Consensus 11 ~~~e~L~~l~~YLR~~~~YC~~Cg~~Y~d~~dL~~~CPG~t 51 (55)
T PF13821_consen 11 SPEERLDKLLSYLREEHNYCFWCGTKYDDEEDLERNCPGPT 51 (55)
T ss_pred CHHHHHHHHHHHHHhhCceeeeeCCccCCHHHHHhCCCCCC
Confidence 45799999999999999999999999999999999999976
No 4
>PF00076 RRM_1: RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain); InterPro: IPR000504 Many eukaryotic proteins containing one or more copies of a putative RNA-binding domain of about 90 amino acids are known to bind single-stranded RNAs [, , ]. The largest group of single strand RNA-binding proteins is the eukaryotic RNA recognition motif (RRM) family that contains an eight amino acid RNP-1 consensus sequence [, ]. RRM proteins have a variety of RNA binding preferences and functions, and include heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing (SR, U2AF, Sxl), protein components of small nuclear ribonucleoproteins (U1 and U2 snRNPs), and proteins that regulate RNA stability and translation (PABP, La, Hu) [, , ]. The RRM in heterodimeric splicing factor U2 snRNP auxiliary factor (U2AF) appears to have two RRM-like domains with specialised features for protein recognition []. The motif also appears in a few single stranded DNA binding proteins. The typical RRM consists of four anti-parallel beta-strands and two alpha-helices arranged in a beta-alpha-beta-beta-alpha-beta fold with side chains that stack with RNA bases. Specificity of RNA binding is determined by multiple contacts with surrounding amino acids. A third helix is present during RNA binding in some cases []. The RRM is reviewed in a number of publications [, , ].; GO: 0003676 nucleic acid binding; PDB: 2RNE_A 2DGO_A 2DO4_A 1YTY_B 2VOO_B 2VOP_A 2VON_B 1ZH5_B 2VOD_A 1S79_A ....
Probab=98.58 E-value=1.4e-07 Score=69.48 Aligned_cols=69 Identities=14% Similarity=0.346 Sum_probs=62.1
Q ss_pred EEeeecCCCCCHHHHHHHHhhcCCeeEEEecCCCCCCCceEEEEEEecCcccHHHHHHhhccccccCccc
Q psy4687 37 IFLRNLAPTITKAEVENLCKRYSGFLRVAIADPQPDRRWFRRGWVTFRRDVDIKEICWNLNNIRLRDCEL 106 (339)
Q Consensus 37 lfIk~IpP~isr~ele~~ck~~~GF~~lsLSDP~p~krf~R~GWV~f~~~~~~~~~~~~L~~~~i~d~el 106 (339)
|||++||++++.++|.+++++++....+.+..- +.++..+.|||.|....+++.|+..+++..+.+..+
T Consensus 1 l~v~nlp~~~t~~~l~~~f~~~g~i~~~~~~~~-~~~~~~~~a~V~F~~~~~a~~a~~~l~g~~~~~~~i 69 (70)
T PF00076_consen 1 LYVGNLPPDVTEEELRDFFSQFGKIESIKVMRN-SSGKSKGYAFVEFESEEDAEKALEELNGKKINGRKI 69 (70)
T ss_dssp EEEESETTTSSHHHHHHHHHTTSTEEEEEEEEE-TTSSEEEEEEEEESSHHHHHHHHHHHTTEEETTEEE
T ss_pred cEEcCCCCcCCHHHHHHHHHHhhhccccccccc-ccccccceEEEEEcCHHHHHHHHHHcCCCEECccCc
Confidence 799999999999999999999999988888743 777899999999999999999999999988877543
No 5
>KOG1994|consensus
Probab=98.20 E-value=7.2e-07 Score=83.14 Aligned_cols=45 Identities=16% Similarity=0.378 Sum_probs=40.1
Q ss_pred hhHHHHHHHHHHHHHHhcceeeeccccCCCcccccccCCCCccCC
Q psy4687 196 PNVLRVLDRLILYLRIVHSVDYYNHCEYPNEDEMPNRCGIIHTRG 240 (339)
Q Consensus 196 ~~l~~~LD~~i~YLR~vh~~cyyc~~ey~~~~el~~~Cp~~H~R~ 240 (339)
+.+.+.|-.+.-|||.-|.||||||+.|.++++|..+|||..-|.
T Consensus 222 ei~Ee~lt~in~~LR~eh~YC~fCG~~y~~~edl~ehCPGvnedd 266 (268)
T KOG1994|consen 222 EIIEERLTKINIFLRSEHYYCFFCGIKYKDEEDLYEHCPGVNEDD 266 (268)
T ss_pred HHHHHHHHHHHHHHhccceEEEEeccccCCHHHHHHhCCCCCccc
Confidence 345689999999999999999999999999999999999875543
No 6
>PLN03134 glycine-rich RNA-binding protein 4; Provisional
Probab=98.13 E-value=2.3e-05 Score=68.46 Aligned_cols=83 Identities=10% Similarity=0.117 Sum_probs=67.0
Q ss_pred CCCcccCCCCeEEeeecCCCCCHHHHHHHHhhcCCeeEEEecCCCCCCCceEEEEEEecCcccHHHHHHhhccccccCcc
Q psy4687 26 PPPRALHRTSSIFLRNLAPTITKAEVENLCKRYSGFLRVAIADPQPDRRWFRRGWVTFRRDVDIKEICWNLNNIRLRDCE 105 (339)
Q Consensus 26 ~~~~~lh~t~slfIk~IpP~isr~ele~~ck~~~GF~~lsLSDP~p~krf~R~GWV~f~~~~~~~~~~~~L~~~~i~d~e 105 (339)
--...-+.+++|||.+||++++-++|.++|++++-...+.+.-.....+.-..|||+|....++..|+..||+..|.+..
T Consensus 26 ~~~~~~~~~~~lfVgnL~~~~te~~L~~~F~~~G~I~~v~i~~d~~tg~~kGfaFV~F~~~e~A~~Al~~lng~~i~Gr~ 105 (144)
T PLN03134 26 MLGSLRLMSTKLFIGGLSWGTDDASLRDAFAHFGDVVDAKVIVDRETGRSRGFGFVNFNDEGAATAAISEMDGKELNGRH 105 (144)
T ss_pred ccccccCCCCEEEEeCCCCCCCHHHHHHHHhcCCCeEEEEEEecCCCCCcceEEEEEECCHHHHHHHHHHcCCCEECCEE
Confidence 34455677789999999999999999999999999888888522222233468999999999999999999999888866
Q ss_pred ccc
Q psy4687 106 LGA 108 (339)
Q Consensus 106 l~~ 108 (339)
+..
T Consensus 106 l~V 108 (144)
T PLN03134 106 IRV 108 (144)
T ss_pred EEE
Confidence 553
No 7
>PF14259 RRM_6: RNA recognition motif (a.k.a. RRM, RBD, or RNP domain); PDB: 2DNN_A 1WI6_A 2EVZ_A 2ADC_A 1QM9_A 2KG1_A 2HGN_A 1U2F_A 2G4B_A 2HZC_A ....
Probab=97.83 E-value=4.7e-05 Score=56.83 Aligned_cols=66 Identities=21% Similarity=0.356 Sum_probs=52.1
Q ss_pred EEeeecCCCCCHHHHHHHHhhcCCeeEEEecCCCCCCCceEEEEEEecCcccHHHHHHhhccccccC
Q psy4687 37 IFLRNLAPTITKAEVENLCKRYSGFLRVAIADPQPDRRWFRRGWVTFRRDVDIKEICWNLNNIRLRD 103 (339)
Q Consensus 37 lfIk~IpP~isr~ele~~ck~~~GF~~lsLSDP~p~krf~R~GWV~f~~~~~~~~~~~~L~~~~i~d 103 (339)
|+|++||++++.++|.+++..++-...+.+. .....++.+.|||.|.+..++..++..+++..+++
T Consensus 1 v~i~nlp~~~~~~~l~~~f~~~g~v~~v~~~-~~~~~~~~~~a~v~f~~~~~a~~al~~~~~~~~~g 66 (70)
T PF14259_consen 1 VYISNLPPSTTEEDLRNFFSRFGPVEKVRLI-KNKDGQSRGFAFVEFSSEEDAKRALELLNGKEIDG 66 (70)
T ss_dssp EEEESSTTT--HHHHHHHCTTSSBEEEEEEE-ESTTSSEEEEEEEEESSHHHHHHHHHHHTTEEETT
T ss_pred CEEeCCCCCCCHHHHHHHHHhcCCcceEEEE-eeeccccCCEEEEEeCCHHHHHHHHHHCCCcEECC
Confidence 7999999999999999999999546666665 22335789999999999999999888777676665
No 8
>smart00362 RRM_2 RNA recognition motif.
Probab=97.74 E-value=0.00012 Score=52.45 Aligned_cols=67 Identities=21% Similarity=0.367 Sum_probs=55.5
Q ss_pred eEEeeecCCCCCHHHHHHHHhhcCCeeEEEecCCCCCCCceEEEEEEecCcccHHHHHHhhccccccCc
Q psy4687 36 SIFLRNLAPTITKAEVENLCKRYSGFLRVAIADPQPDRRWFRRGWVTFRRDVDIKEICWNLNNIRLRDC 104 (339)
Q Consensus 36 slfIk~IpP~isr~ele~~ck~~~GF~~lsLSDP~p~krf~R~GWV~f~~~~~~~~~~~~L~~~~i~d~ 104 (339)
+|+|++||++++.++|.+++++++...++.+.... ......|+|.|....++..++..+++..+.+.
T Consensus 1 ~v~i~~l~~~~~~~~l~~~~~~~g~v~~~~~~~~~--~~~~~~~~v~f~~~~~a~~a~~~~~~~~~~~~ 67 (72)
T smart00362 1 TLFVGNLPPDVTEEDLKELFSKFGPIESVKIPKDT--GKSKGFAFVEFESEEDAEKAIEALNGTKLGGR 67 (72)
T ss_pred CEEEcCCCCcCCHHHHHHHHHhcCCEEEEEEecCC--CCCCceEEEEeCCHHHHHHHHHHhCCcEECCE
Confidence 48999999999999999999999998887765322 34557899999999999988888888766653
No 9
>cd00590 RRM RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
Probab=97.72 E-value=0.00017 Score=52.02 Aligned_cols=69 Identities=22% Similarity=0.353 Sum_probs=56.9
Q ss_pred eEEeeecCCCCCHHHHHHHHhhcCCeeEEEecCCCCCCCceEEEEEEecCcccHHHHHHhhccccccCcc
Q psy4687 36 SIFLRNLAPTITKAEVENLCKRYSGFLRVAIADPQPDRRWFRRGWVTFRRDVDIKEICWNLNNIRLRDCE 105 (339)
Q Consensus 36 slfIk~IpP~isr~ele~~ck~~~GF~~lsLSDP~p~krf~R~GWV~f~~~~~~~~~~~~L~~~~i~d~e 105 (339)
+|+|+++|++++.++|.++++.+++...+.+.- .+..+....|||.|....++..++..+++..+.+..
T Consensus 1 ~i~i~~l~~~~~~~~i~~~~~~~g~i~~~~~~~-~~~~~~~~~~~v~f~s~~~a~~a~~~~~~~~~~~~~ 69 (74)
T cd00590 1 TLFVGNLPPDVTEEDLRELFSKFGKVESVRIVR-DKDTKSKGFAFVEFEDEEDAEKALEALNGKELGGRP 69 (74)
T ss_pred CEEEeCCCCccCHHHHHHHHHhcCCEEEEEEee-CCCCCcceEEEEEECCHHHHHHHHHHhCCCeECCeE
Confidence 589999999999999999999998888877762 222256679999999999999998888887666544
No 10
>smart00360 RRM RNA recognition motif.
Probab=97.33 E-value=0.00068 Score=48.22 Aligned_cols=65 Identities=20% Similarity=0.309 Sum_probs=54.0
Q ss_pred eeecCCCCCHHHHHHHHhhcCCeeEEEecCCCCCCCceEEEEEEecCcccHHHHHHhhccccccC
Q psy4687 39 LRNLAPTITKAEVENLCKRYSGFLRVAIADPQPDRRWFRRGWVTFRRDVDIKEICWNLNNIRLRD 103 (339)
Q Consensus 39 Ik~IpP~isr~ele~~ck~~~GF~~lsLSDP~p~krf~R~GWV~f~~~~~~~~~~~~L~~~~i~d 103 (339)
|++||+.++.++|.+++++++....+.+.......+....|+|.|....++..++..+++..+.+
T Consensus 1 i~~l~~~~~~~~l~~~f~~~g~v~~~~i~~~~~~~~~~~~a~v~f~~~~~a~~a~~~~~~~~~~~ 65 (71)
T smart00360 1 VGNLPPDVTEEELRELFSKFGKIESVRLVRDKDTGKSKGFAFVEFESEEDAEKALEALNGKELDG 65 (71)
T ss_pred CCCCCcccCHHHHHHHHHhhCCEeEEEEEeCCCCCCCCceEEEEeCCHHHHHHHHHHcCCCeeCC
Confidence 57899999999999999999988888887544345666799999999999998888888766654
No 11
>TIGR01659 sex-lethal sex-lethal family splicing factor. This model describes the sex-lethal family of splicing factors found in Dipteran insects. The sex-lethal phenotype, however, may be limited to the Melanogasters and closely related species. In Drosophila the protein acts as an inhibitor of splicing. This subfamily is most closely related to the ELAV/HUD subfamily of splicing factors (TIGR01661).
Probab=97.02 E-value=0.0037 Score=62.17 Aligned_cols=75 Identities=21% Similarity=0.293 Sum_probs=61.8
Q ss_pred CCCCeEEeeecCCCCCHHHHHHHHhhcCCeeEEEe-cCCCCCCCceEEEEEEecCcccHHHHHHhhccccccCcccc
Q psy4687 32 HRTSSIFLRNLAPTITKAEVENLCKRYSGFLRVAI-ADPQPDRRWFRRGWVTFRRDVDIKEICWNLNNIRLRDCELG 107 (339)
Q Consensus 32 h~t~slfIk~IpP~isr~ele~~ck~~~GF~~lsL-SDP~p~krf~R~GWV~f~~~~~~~~~~~~L~~~~i~d~el~ 107 (339)
-..++|||.+||++++.++|.++++.++-+..+-| -|+...+ --..|||.|....++..|+..||++.+.+-.+.
T Consensus 105 ~~~~~LfVgnLp~~~te~~L~~lF~~~G~V~~v~i~~d~~tg~-srGyaFVeF~~~e~A~~Ai~~LnG~~l~gr~i~ 180 (346)
T TIGR01659 105 NSGTNLIVNYLPQDMTDRELYALFRTIGPINTCRIMRDYKTGY-SFGYAFVDFGSEADSQRAIKNLNGITVRNKRLK 180 (346)
T ss_pred CCCcEEEEeCCCCCCCHHHHHHHHHhcCCEEEEEEEecCCCCc-cCcEEEEEEccHHHHHHHHHHcCCCccCCceee
Confidence 35679999999999999999999999998888876 4555433 235799999999999999999999888775543
No 12
>TIGR01661 ELAV_HUD_SF ELAV/HuD family splicing factor. These proteins contain 3 RNA-recognition motifs (rrm: pfam00076).
Probab=96.95 E-value=0.0023 Score=61.92 Aligned_cols=73 Identities=19% Similarity=0.339 Sum_probs=61.1
Q ss_pred CCeEEeeecCCCCCHHHHHHHHhhcCCeeEEEec-CCCCCCCceEEEEEEecCcccHHHHHHhhccccccCcccc
Q psy4687 34 TSSIFLRNLAPTITKAEVENLCKRYSGFLRVAIA-DPQPDRRWFRRGWVTFRRDVDIKEICWNLNNIRLRDCELG 107 (339)
Q Consensus 34 t~slfIk~IpP~isr~ele~~ck~~~GF~~lsLS-DP~p~krf~R~GWV~f~~~~~~~~~~~~L~~~~i~d~el~ 107 (339)
.+.|||++||++++-++|.+++++++-...+.|- |+.. .+--..|||.|....++..|+..||+..+.+-.+.
T Consensus 3 ~~~l~V~nLp~~~~e~~l~~~F~~~G~i~~v~i~~d~~~-g~s~g~afV~f~~~~~A~~Ai~~l~g~~l~g~~i~ 76 (352)
T TIGR01661 3 KTNLIVNYLPQTMTQEEIRSLFTSIGEIESCKLVRDKVT-GQSLGYGFVNYVRPEDAEKAVNSLNGLRLQNKTIK 76 (352)
T ss_pred CcEEEEeCCCCCCCHHHHHHHHHccCCEEEEEEEEcCCC-CccceEEEEEECcHHHHHHHHhhcccEEECCeeEE
Confidence 6789999999999999999999999988888874 3332 23446899999999999999999999888876654
No 13
>TIGR01661 ELAV_HUD_SF ELAV/HuD family splicing factor. These proteins contain 3 RNA-recognition motifs (rrm: pfam00076).
Probab=96.85 E-value=0.0048 Score=59.67 Aligned_cols=73 Identities=21% Similarity=0.249 Sum_probs=59.6
Q ss_pred CCCeEEeeecCCCCCHHHHHHHHhhcCCeeEEEec-CCCCCCCceEEEEEEecCcccHHHHHHhhccccccCccc
Q psy4687 33 RTSSIFLRNLAPTITKAEVENLCKRYSGFLRVAIA-DPQPDRRWFRRGWVTFRRDVDIKEICWNLNNIRLRDCEL 106 (339)
Q Consensus 33 ~t~slfIk~IpP~isr~ele~~ck~~~GF~~lsLS-DP~p~krf~R~GWV~f~~~~~~~~~~~~L~~~~i~d~el 106 (339)
++.+|||++||+.++.++|.++++.++...++-+- |+... +---.|+|.|+...++..|+..||+..+.+-.+
T Consensus 268 ~~~~lfV~NL~~~~~e~~L~~~F~~fG~v~~v~i~~d~~t~-~skG~aFV~F~~~~~A~~Ai~~lnG~~~~gr~i 341 (352)
T TIGR01661 268 AGYCIFVYNLSPDTDETVLWQLFGPFGAVQNVKIIRDLTTN-QCKGYGFVSMTNYDEAAMAILSLNGYTLGNRVL 341 (352)
T ss_pred CCcEEEEeCCCCCCCHHHHHHHHHhCCCeEEEEEeEcCCCC-CccceEEEEECCHHHHHHHHHHhCCCEECCeEE
Confidence 34479999999999999999999999999998874 44322 222468999999999999999999988877444
No 14
>TIGR01659 sex-lethal sex-lethal family splicing factor. This model describes the sex-lethal family of splicing factors found in Dipteran insects. The sex-lethal phenotype, however, may be limited to the Melanogasters and closely related species. In Drosophila the protein acts as an inhibitor of splicing. This subfamily is most closely related to the ELAV/HUD subfamily of splicing factors (TIGR01661).
Probab=96.52 E-value=0.0077 Score=59.92 Aligned_cols=71 Identities=20% Similarity=0.330 Sum_probs=57.3
Q ss_pred CCCeEEeeecCCCCCHHHHHHHHhhcCCeeEEEec-CCCCCCCceEEEEEEecCcccHHHHHHhhccccccCc
Q psy4687 33 RTSSIFLRNLAPTITKAEVENLCKRYSGFLRVAIA-DPQPDRRWFRRGWVTFRRDVDIKEICWNLNNIRLRDC 104 (339)
Q Consensus 33 ~t~slfIk~IpP~isr~ele~~ck~~~GF~~lsLS-DP~p~krf~R~GWV~f~~~~~~~~~~~~L~~~~i~d~ 104 (339)
+.+.|||.+||++++.++|.+++++++....+.+- |.... +--..|+|.|....++.+|+..||+..+.+.
T Consensus 192 ~~~~lfV~nLp~~vtee~L~~~F~~fG~V~~v~i~~d~~tg-~~kG~aFV~F~~~e~A~~Ai~~lng~~~~g~ 263 (346)
T TIGR01659 192 KDTNLYVTNLPRTITDDQLDTIFGKYGQIVQKNILRDKLTG-TPRGVAFVRFNKREEAQEAISALNNVIPEGG 263 (346)
T ss_pred ccceeEEeCCCCcccHHHHHHHHHhcCCEEEEEEeecCCCC-ccceEEEEEECCHHHHHHHHHHhCCCccCCC
Confidence 45679999999999999999999999998877763 32211 2225899999999999999999999877653
No 15
>PF00096 zf-C2H2: Zinc finger, C2H2 type; InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=96.40 E-value=0.0015 Score=39.20 Aligned_cols=23 Identities=30% Similarity=0.840 Sum_probs=21.7
Q ss_pred ecCCCCcccccChhhHHhhhhhcc
Q psy4687 306 WLCPLSGKKFKGPDFIRKHIFNKH 329 (339)
Q Consensus 306 ~rC~~C~KlFk~~eFV~KHI~nKH 329 (339)
|.|+.|+|.|..+.-+.+|++. |
T Consensus 1 y~C~~C~~~f~~~~~l~~H~~~-H 23 (23)
T PF00096_consen 1 YKCPICGKSFSSKSNLKRHMRR-H 23 (23)
T ss_dssp EEETTTTEEESSHHHHHHHHHH-H
T ss_pred CCCCCCCCccCCHHHHHHHHhH-C
Confidence 7899999999999999999987 5
No 16
>TIGR01642 U2AF_lg U2 snRNP auxilliary factor, large subunit, splicing factor. Members of this subfamily are found in plants, metazoa and fungi.
Probab=96.39 E-value=0.01 Score=60.58 Aligned_cols=73 Identities=12% Similarity=0.162 Sum_probs=60.3
Q ss_pred CCeEEeeecCCCCCHHHHHHHHhhcCCeeEEEe-cCCCCCCCceEEEEEEecCcccHHHHHHhhccccccCcccc
Q psy4687 34 TSSIFLRNLAPTITKAEVENLCKRYSGFLRVAI-ADPQPDRRWFRRGWVTFRRDVDIKEICWNLNNIRLRDCELG 107 (339)
Q Consensus 34 t~slfIk~IpP~isr~ele~~ck~~~GF~~lsL-SDP~p~krf~R~GWV~f~~~~~~~~~~~~L~~~~i~d~el~ 107 (339)
+..|||.+||..++-++|.++++.++.+..+-| -|+... +....|+|.|....++..|+..||+..+.+..|.
T Consensus 295 ~~~l~v~nlp~~~~~~~l~~~f~~~G~i~~~~~~~~~~~g-~~~g~afv~f~~~~~a~~A~~~l~g~~~~~~~l~ 368 (509)
T TIGR01642 295 KDRIYIGNLPLYLGEDQIKELLESFGDLKAFNLIKDIATG-LSKGYAFCEYKDPSVTDVAIAALNGKDTGDNKLH 368 (509)
T ss_pred CCEEEEeCCCCCCCHHHHHHHHHhcCCeeEEEEEecCCCC-CcCeEEEEEECCHHHHHHHHHHcCCCEECCeEEE
Confidence 468999999999999999999999999888776 344322 2345799999999888899999999988876654
No 17
>TIGR01622 SF-CC1 splicing factor, CC1-like family. A homologous gene from Plasmodium falciparum was identified in the course of the analysis of that genome at TIGR and was included in the model.
Probab=96.29 E-value=0.011 Score=59.61 Aligned_cols=73 Identities=21% Similarity=0.323 Sum_probs=59.9
Q ss_pred CCeEEeeecCCCCCHHHHHHHHhhcCCeeEEEec-CCCCCCCceEEEEEEecCcccHHHHHHhhccccccCcccc
Q psy4687 34 TSSIFLRNLAPTITKAEVENLCKRYSGFLRVAIA-DPQPDRRWFRRGWVTFRRDVDIKEICWNLNNIRLRDCELG 107 (339)
Q Consensus 34 t~slfIk~IpP~isr~ele~~ck~~~GF~~lsLS-DP~p~krf~R~GWV~f~~~~~~~~~~~~L~~~~i~d~el~ 107 (339)
+..|||.+||++++.++|.++++.++....+.|. |+... +--..|+|.|....++..|+..||+..|.+-.+.
T Consensus 186 ~~~l~v~nl~~~~te~~l~~~f~~~G~i~~v~~~~d~~~g-~~~g~afV~f~~~e~A~~A~~~l~g~~i~g~~i~ 259 (457)
T TIGR01622 186 FLKLYVGNLHFNITEQELRQIFEPFGDIEDVQLHRDPETG-RSKGFGFIQFHDAEEAKEALEVMNGFELAGRPIK 259 (457)
T ss_pred CCEEEEcCCCCCCCHHHHHHHHHhcCCeEEEEEEEcCCCC-ccceEEEEEECCHHHHHHHHHhcCCcEECCEEEE
Confidence 6789999999999999999999999998888774 44321 2224799999999999999999999888775543
No 18
>TIGR01648 hnRNP-R-Q heterogeneous nuclear ribonucleoprotein R, Q family. Sequences in this subfamily include the human heterogeneous nuclear ribonucleoproteins (hnRNP) R, Q and APOBEC-1 complementation factor (aka APOBEC-1 stimulating protein). These proteins contain three RNA recognition domains (rrm: pfam00076) and a somewhat variable C-terminal domain.
Probab=96.27 E-value=0.025 Score=60.01 Aligned_cols=98 Identities=19% Similarity=0.221 Sum_probs=70.1
Q ss_pred hhHHHhhhccCCc------cCCCCCCCCCCCcccCCCCeEEeeecCCCCCHHHHHHHHhhcCCeeEEEe-cCCCCCCCce
Q psy4687 4 LLNLMMLNSYNNI------INGGESKNEPPPRALHRTSSIFLRNLAPTITKAEVENLCKRYSGFLRVAI-ADPQPDRRWF 76 (339)
Q Consensus 4 ~~~~~~~~~~~~~------~~~~~~~~~~~~~~lh~t~slfIk~IpP~isr~ele~~ck~~~GF~~lsL-SDP~p~krf~ 76 (339)
+..|+-+|.|.-. .+|+|...-.. .....++.|||.+||++++-++|.++|.+++-...+-| -| ...+--
T Consensus 23 ~~a~~~~~gy~~~~~~g~r~~g~Pp~~~~~-~~p~~~~~lFVgnLp~~~tEd~L~~~F~~~G~I~~vrl~~D--~sG~sR 99 (578)
T TIGR01648 23 LKALLERTGYTLVQENGQRKYGGPPPGWSG-VQPGRGCEVFVGKIPRDLYEDELVPLFEKAGPIYELRLMMD--FSGQNR 99 (578)
T ss_pred HHHHHHhhCccccccCCcccCCCCCCcccC-CCCCCCCEEEeCCCCCCCCHHHHHHHHHhhCCEEEEEEEEC--CCCCcc
Confidence 5566667777532 34444333111 12346799999999999999999999999987666655 33 222334
Q ss_pred EEEEEEecCcccHHHHHHhhccccccCc
Q psy4687 77 RRGWVTFRRDVDIKEICWNLNNIRLRDC 104 (339)
Q Consensus 77 R~GWV~f~~~~~~~~~~~~L~~~~i~d~ 104 (339)
..|+|+|....++..|+..||+..++..
T Consensus 100 GfaFV~F~~~e~A~~Ai~~lng~~i~~G 127 (578)
T TIGR01648 100 GYAFVTFCGKEEAKEAVKLLNNYEIRPG 127 (578)
T ss_pred ceEEEEeCCHHHHHHHHHHcCCCeecCC
Confidence 5899999999999999999998888643
No 19
>TIGR01649 hnRNP-L_PTB hnRNP-L/PTB/hephaestus splicing factor family. Included in this family of heterogeneous ribonucleoproteins are PTB (polypyrimidine tract binding protein ) and hnRNP-L. These proteins contain four RNA recognition motifs (rrm: pfam00067).
Probab=96.21 E-value=0.021 Score=58.75 Aligned_cols=72 Identities=14% Similarity=0.239 Sum_probs=58.0
Q ss_pred cCCCCeEEeeecCCCCCHHHHHHHHhhcCC--eeEEEecCCCCCCCceEEEEEEecCcccHHHHHHhhccccccCc
Q psy4687 31 LHRTSSIFLRNLAPTITKAEVENLCKRYSG--FLRVAIADPQPDRRWFRRGWVTFRRDVDIKEICWNLNNIRLRDC 104 (339)
Q Consensus 31 lh~t~slfIk~IpP~isr~ele~~ck~~~G--F~~lsLSDP~p~krf~R~GWV~f~~~~~~~~~~~~L~~~~i~d~ 104 (339)
...+..|||++||++++-++|.++++++++ +..+-+- |.... -...|.|.|....++.+|+..||+..|.+-
T Consensus 391 ~~ps~~L~v~NLp~~~tee~L~~lF~~~G~~~i~~ik~~-~~~~~-~~~~gfVeF~~~e~A~~Al~~ln~~~l~~~ 464 (481)
T TIGR01649 391 QPPSATLHLSNIPLSVSEEDLKELFAENGVHKVKKFKFF-PKDNE-RSKMGLLEWESVEDAVEALIALNHHQLNEP 464 (481)
T ss_pred CCCCcEEEEecCCCCCCHHHHHHHHHhcCCccceEEEEe-cCCCC-cceeEEEEcCCHHHHHHHHHHhcCCccCCC
Confidence 345678999999999999999999999998 6666664 22211 235799999999999999999999888763
No 20
>PLN03120 nucleic acid binding protein; Provisional
Probab=96.13 E-value=0.016 Score=55.79 Aligned_cols=68 Identities=12% Similarity=0.200 Sum_probs=56.4
Q ss_pred CCeEEeeecCCCCCHHHHHHHHhhcCCeeEEEec-CCCCCCCceEEEEEEecCcccHHHHHHhhccccccCccc
Q psy4687 34 TSSIFLRNLAPTITKAEVENLCKRYSGFLRVAIA-DPQPDRRWFRRGWVTFRRDVDIKEICWNLNNIRLRDCEL 106 (339)
Q Consensus 34 t~slfIk~IpP~isr~ele~~ck~~~GF~~lsLS-DP~p~krf~R~GWV~f~~~~~~~~~~~~L~~~~i~d~el 106 (339)
+.+|||.+||+.++-.+|.+++..++...++.|- |.. .+ +.|+|+|....+++.++ .||+..|.|-.+
T Consensus 4 ~rtVfVgNLs~~tTE~dLrefFS~~G~I~~V~I~~d~~-~~---GfAFVtF~d~eaAe~Al-lLnG~~l~gr~V 72 (260)
T PLN03120 4 VRTVKVSNVSLKATERDIKEFFSFSGDIEYVEMQSENE-RS---QIAYVTFKDPQGAETAL-LLSGATIVDQSV 72 (260)
T ss_pred CCEEEEeCCCCCCCHHHHHHHHHhcCCeEEEEEeecCC-CC---CEEEEEeCcHHHHHHHH-HhcCCeeCCceE
Confidence 5789999999999999999999999999999883 322 22 58999999999888888 488887776443
No 21
>TIGR01628 PABP-1234 polyadenylate binding protein, human types 1, 2, 3, 4 family. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed, or of unknown tissue range.
Probab=96.10 E-value=0.014 Score=60.81 Aligned_cols=76 Identities=13% Similarity=0.156 Sum_probs=60.3
Q ss_pred cCCCCeEEeeecCCCCCHHHHHHHHhhcCCeeEEEecCCCCCCCceEEEEEEecCcccHHHHHHhhccccccCcccc
Q psy4687 31 LHRTSSIFLRNLAPTITKAEVENLCKRYSGFLRVAIADPQPDRRWFRRGWVTFRRDVDIKEICWNLNNIRLRDCELG 107 (339)
Q Consensus 31 lh~t~slfIk~IpP~isr~ele~~ck~~~GF~~lsLSDP~p~krf~R~GWV~f~~~~~~~~~~~~L~~~~i~d~el~ 107 (339)
...+..|||++|+++++.++|.++|++++....+.+--.. ..+---.|+|+|....++..|+..+|+..+.+-.+.
T Consensus 282 ~~~~~~l~V~nl~~~~~~~~L~~~F~~~G~i~~~~i~~d~-~g~~~g~gfV~f~~~~~A~~A~~~~~g~~~~gk~l~ 357 (562)
T TIGR01628 282 KAQGVNLYVKNLDDTVTDEKLRELFSECGEITSAKVMLDE-KGVSRGFGFVCFSNPEEANRAVTEMHGRMLGGKPLY 357 (562)
T ss_pred ccCCCEEEEeCCCCccCHHHHHHHHHhcCCeEEEEEEECC-CCCcCCeEEEEeCCHHHHHHHHHHhcCCeeCCceeE
Confidence 4567889999999999999999999999987777663221 112224799999999999999999999877775554
No 22
>TIGR01628 PABP-1234 polyadenylate binding protein, human types 1, 2, 3, 4 family. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed, or of unknown tissue range.
Probab=96.07 E-value=0.017 Score=60.11 Aligned_cols=71 Identities=27% Similarity=0.455 Sum_probs=58.5
Q ss_pred CeEEeeecCCCCCHHHHHHHHhhcCCeeEEEec-CCCCCCCceEEEEEEecCcccHHHHHHhhccccccCccc
Q psy4687 35 SSIFLRNLAPTITKAEVENLCKRYSGFLRVAIA-DPQPDRRWFRRGWVTFRRDVDIKEICWNLNNIRLRDCEL 106 (339)
Q Consensus 35 ~slfIk~IpP~isr~ele~~ck~~~GF~~lsLS-DP~p~krf~R~GWV~f~~~~~~~~~~~~L~~~~i~d~el 106 (339)
.||||++||++++.++|.++|++++-...+-+. |... ++-...|+|.|....++..|+..+++..+.+-.+
T Consensus 1 ~sl~VgnLp~~vte~~L~~~F~~~G~v~~v~v~~d~~t-~~s~G~afV~F~~~~~A~~Al~~ln~~~i~gk~i 72 (562)
T TIGR01628 1 ASLYVGDLDPDVTEAKLYDLFKPFGPVLSVRVCRDSVT-RRSLGYGYVNFQNPADAERALETMNFKRLGGKPI 72 (562)
T ss_pred CeEEEeCCCCCCCHHHHHHHHHhcCCEEEEEEEecCCC-CCcceEEEEEECCHHHHHHHHHHhCCCEECCeeE
Confidence 489999999999999999999999988877773 4442 3445689999999999999999898877766443
No 23
>PF13894 zf-C2H2_4: C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=96.00 E-value=0.0041 Score=36.80 Aligned_cols=24 Identities=33% Similarity=0.852 Sum_probs=20.7
Q ss_pred ecCCCCcccccChhhHHhhhhhcc
Q psy4687 306 WLCPLSGKKFKGPDFIRKHIFNKH 329 (339)
Q Consensus 306 ~rC~~C~KlFk~~eFV~KHI~nKH 329 (339)
|.|+.|++.|....=+++|+.+.|
T Consensus 1 ~~C~~C~~~~~~~~~l~~H~~~~H 24 (24)
T PF13894_consen 1 FQCPICGKSFRSKSELRQHMRTHH 24 (24)
T ss_dssp EE-SSTS-EESSHHHHHHHHHHHS
T ss_pred CCCcCCCCcCCcHHHHHHHHHhhC
Confidence 789999999999999999999876
No 24
>TIGR01648 hnRNP-R-Q heterogeneous nuclear ribonucleoprotein R, Q family. Sequences in this subfamily include the human heterogeneous nuclear ribonucleoproteins (hnRNP) R, Q and APOBEC-1 complementation factor (aka APOBEC-1 stimulating protein). These proteins contain three RNA recognition domains (rrm: pfam00076) and a somewhat variable C-terminal domain.
Probab=95.96 E-value=0.02 Score=60.68 Aligned_cols=73 Identities=22% Similarity=0.309 Sum_probs=60.6
Q ss_pred CcccCCCCeEEeeecCCCCCHHHHHHHHhhc--CCeeEEEecCCCCCCCceEEEEEEecCcccHHHHHHhhccccccCcc
Q psy4687 28 PRALHRTSSIFLRNLAPTITKAEVENLCKRY--SGFLRVAIADPQPDRRWFRRGWVTFRRDVDIKEICWNLNNIRLRDCE 105 (339)
Q Consensus 28 ~~~lh~t~slfIk~IpP~isr~ele~~ck~~--~GF~~lsLSDP~p~krf~R~GWV~f~~~~~~~~~~~~L~~~~i~d~e 105 (339)
+..+.+...|||++||.+++-++|.++|+++ +-..++.+. +. .|+|.|....++..|+..||+..|.+..
T Consensus 227 ~~~~~~~k~LfVgNL~~~~tee~L~~~F~~f~~G~I~rV~~~-----rg---fAFVeF~s~e~A~kAi~~lnG~~i~Gr~ 298 (578)
T TIGR01648 227 EDVMAKVKILYVRNLMTTTTEEIIEKSFSEFKPGKVERVKKI-----RD---YAFVHFEDREDAVKAMDELNGKELEGSE 298 (578)
T ss_pred ccccccccEEEEeCCCCCCCHHHHHHHHHhcCCCceEEEEee-----cC---eEEEEeCCHHHHHHHHHHhCCCEECCEE
Confidence 3445566889999999999999999999999 667777542 33 5999999999999999999999888876
Q ss_pred ccc
Q psy4687 106 LGA 108 (339)
Q Consensus 106 l~~ 108 (339)
+..
T Consensus 299 I~V 301 (578)
T TIGR01648 299 IEV 301 (578)
T ss_pred EEE
Confidence 653
No 25
>COG0724 RNA-binding proteins (RRM domain) [General function prediction only]
Probab=95.68 E-value=0.04 Score=48.63 Aligned_cols=74 Identities=19% Similarity=0.303 Sum_probs=62.3
Q ss_pred CCeEEeeecCCCCCHHHHHHHHhhcCCeeEEEecCCCCCCCceEEEEEEecCcccHHHHHHhhccccccCcccc
Q psy4687 34 TSSIFLRNLAPTITKAEVENLCKRYSGFLRVAIADPQPDRRWFRRGWVTFRRDVDIKEICWNLNNIRLRDCELG 107 (339)
Q Consensus 34 t~slfIk~IpP~isr~ele~~ck~~~GF~~lsLSDP~p~krf~R~GWV~f~~~~~~~~~~~~L~~~~i~d~el~ 107 (339)
..+|||.+||+.++.++|.+++++++...++.+.-=....+.-..|.|.|....++..|+..+++..+.+..+.
T Consensus 115 ~~~l~v~nL~~~~~~~~l~~~F~~~g~~~~~~~~~d~~~~~~~g~~~v~f~~~~~~~~a~~~~~~~~~~~~~~~ 188 (306)
T COG0724 115 NNTLFVGNLPYDVTEEDLRELFKKFGPVKRVRLVRDRETGKSRGFAFVEFESEESAEKAIEELNGKELEGRPLR 188 (306)
T ss_pred CceEEEeCCCCCCCHHHHHHHHHhcCceeEEEeeeccccCccCceEEEEecCHHHHHHHHHHcCCCeECCceeE
Confidence 59999999999999999999999999998888742223346667999999999999999999988887776654
No 26
>KOG0114|consensus
Probab=95.64 E-value=0.048 Score=46.13 Aligned_cols=68 Identities=22% Similarity=0.424 Sum_probs=59.7
Q ss_pred CCCeEEeeecCCCCCHHHHHHHHhhcCCeeEEEecCCCCCCCceEEEEEEecCcccHHHHHHhhccccccC
Q psy4687 33 RTSSIFLRNLAPTITKAEVENLCKRYSGFLRVAIADPQPDRRWFRRGWVTFRRDVDIKEICWNLNNIRLRD 103 (339)
Q Consensus 33 ~t~slfIk~IpP~isr~ele~~ck~~~GF~~lsLSDP~p~krf~R~GWV~f~~~~~~~~~~~~L~~~~i~d 103 (339)
-+.-|||||+|-.||.++.-+++-+++-..-+-++---..|. .++|.|..=.+++.||..|++.-+.+
T Consensus 17 vnriLyirNLp~~ITseemydlFGkyg~IrQIRiG~~k~TrG---TAFVVYedi~dAk~A~dhlsg~n~~~ 84 (124)
T KOG0114|consen 17 VNRILYIRNLPFKITSEEMYDLFGKYGTIRQIRIGNTKETRG---TAFVVYEDIFDAKKACDHLSGYNVDN 84 (124)
T ss_pred hheeEEEecCCccccHHHHHHHhhcccceEEEEecCccCcCc---eEEEEehHhhhHHHHHHHhcccccCC
Confidence 456789999999999999999999999999999987666655 89999999999999999998765555
No 27
>KOG3152|consensus
Probab=95.57 E-value=0.0065 Score=58.09 Aligned_cols=92 Identities=21% Similarity=0.383 Sum_probs=70.1
Q ss_pred CCCeEEeeecCCCCCHHHHHHHHhhcCCeeEEEecCCCC-------------CCCceEEEEEEecCcccHHHHHHhhccc
Q psy4687 33 RTSSIFLRNLAPTITKAEVENLCKRYSGFLRVAIADPQP-------------DRRWFRRGWVTFRRDVDIKEICWNLNNI 99 (339)
Q Consensus 33 ~t~slfIk~IpP~isr~ele~~ck~~~GF~~lsLSDP~p-------------~krf~R~GWV~f~~~~~~~~~~~~L~~~ 99 (339)
+|..+||-+|||...-..|-+++..|+-.-||.|. |.. .+.-|+-|||-|..-..++.+...|||.
T Consensus 73 k~GVvylS~IPp~m~~~rlReil~~yGeVGRvylq-pE~~s~~~~r~~~~~n~~~~y~EGWvEF~~KrvAK~iAe~Lnn~ 151 (278)
T KOG3152|consen 73 KTGVVYLSNIPPYMDPVRLREILSQYGEVGRVYLQ-PEDDSKRAARKRKGGNYKKLYSEGWVEFISKRVAKRIAELLNNT 151 (278)
T ss_pred cceEEEeccCCCccCHHHHHHHHHhccccceEEec-chhhHHHHHHhhcCCCccccchhHHHHHHHHHHHHHHHHHhCCC
Confidence 88999999999999999999999999999999985 322 2355788999999999999999999998
Q ss_pred cccCcccccccccccccccccccCcccc
Q psy4687 100 RLRDCELGAIVNRDLSRRVRTVSGLTSL 127 (339)
Q Consensus 100 ~i~d~el~~~v~r~~~~~~r~~~~it~~ 127 (339)
.|.|-.-++. ++.-=.+++.|+..++
T Consensus 152 ~Iggkk~S~~--~~dlWNmKYLprFKW~ 177 (278)
T KOG3152|consen 152 PIGGKKKSPF--RDDLWNMKYLPRFKWV 177 (278)
T ss_pred ccCCCCCCch--HHhhhhhhhccCcchH
Confidence 8877442211 2222234455555543
No 28
>PLN03121 nucleic acid binding protein; Provisional
Probab=95.56 E-value=0.042 Score=52.33 Aligned_cols=68 Identities=18% Similarity=0.254 Sum_probs=56.6
Q ss_pred CCCeEEeeecCCCCCHHHHHHHHhhcCCeeEEEecCCCCCCCceEEEEEEecCcccHHHHHHhhccccccCc
Q psy4687 33 RTSSIFLRNLAPTITKAEVENLCKRYSGFLRVAIADPQPDRRWFRRGWVTFRRDVDIKEICWNLNNIRLRDC 104 (339)
Q Consensus 33 ~t~slfIk~IpP~isr~ele~~ck~~~GF~~lsLSDP~p~krf~R~GWV~f~~~~~~~~~~~~L~~~~i~d~ 104 (339)
.+.+||+.+|+|.++..+|.+|+..++-..++-|-.. ++.-..|||+|....++..|+ .|++..|.|-
T Consensus 4 ~g~TV~V~NLS~~tTE~dLrefFS~~G~I~~V~I~~D---~et~gfAfVtF~d~~aaetAl-lLnGa~l~d~ 71 (243)
T PLN03121 4 GGYTAEVTNLSPKATEKDVYDFFSHCGAIEHVEIIRS---GEYACTAYVTFKDAYALETAV-LLSGATIVDQ 71 (243)
T ss_pred CceEEEEecCCCCCCHHHHHHHHHhcCCeEEEEEecC---CCcceEEEEEECCHHHHHHHH-hcCCCeeCCc
Confidence 4579999999999999999999999999998888543 233368999999988887776 6888877663
No 29
>TIGR01645 half-pint poly-U binding splicing factor, half-pint family. In the case of PUF60 (GP|6176532), in complex with p54, and in the presence of U2AF, facilitates association of U2 snRNP with pre-mRNA.
Probab=95.42 E-value=0.039 Score=58.94 Aligned_cols=72 Identities=14% Similarity=0.321 Sum_probs=59.9
Q ss_pred CCeEEeeecCCCCCHHHHHHHHhhcCCeeEEEec-CCCCCCCce-EEEEEEecCcccHHHHHHhhccccccCcccc
Q psy4687 34 TSSIFLRNLAPTITKAEVENLCKRYSGFLRVAIA-DPQPDRRWF-RRGWVTFRRDVDIKEICWNLNNIRLRDCELG 107 (339)
Q Consensus 34 t~slfIk~IpP~isr~ele~~ck~~~GF~~lsLS-DP~p~krf~-R~GWV~f~~~~~~~~~~~~L~~~~i~d~el~ 107 (339)
...|||.+||++++-++|.++++.++....+.|. ||.. +.+ ..|||.|....++.+|...+|+..+.|-.|.
T Consensus 204 ~~rLfVgnLp~~vteedLk~lFs~FG~I~svrl~~D~~t--gksKGfGFVeFe~~e~A~kAI~amNg~elgGr~Lr 277 (612)
T TIGR01645 204 FNRIYVASVHPDLSETDIKSVFEAFGEIVKCQLARAPTG--RGHKGYGFIEYNNLQSQSEAIASMNLFDLGGQYLR 277 (612)
T ss_pred cceEEeecCCCCCCHHHHHHHHhhcCCeeEEEEEecCCC--CCcCCeEEEEECCHHHHHHHHHHhCCCeeCCeEEE
Confidence 4689999999999999999999999998888774 3432 223 3899999999999999999998888776653
No 30
>TIGR01649 hnRNP-L_PTB hnRNP-L/PTB/hephaestus splicing factor family. Included in this family of heterogeneous ribonucleoproteins are PTB (polypyrimidine tract binding protein ) and hnRNP-L. These proteins contain four RNA recognition motifs (rrm: pfam00067).
Probab=95.28 E-value=0.055 Score=55.74 Aligned_cols=72 Identities=15% Similarity=0.168 Sum_probs=61.3
Q ss_pred CCCCeEEeeecCC-CCCHHHHHHHHhhcCCeeEEEecCCCCCCCceEEEEEEecCcccHHHHHHhhccccccCccccc
Q psy4687 32 HRTSSIFLRNLAP-TITKAEVENLCKRYSGFLRVAIADPQPDRRWFRRGWVTFRRDVDIKEICWNLNNIRLRDCELGA 108 (339)
Q Consensus 32 h~t~slfIk~IpP-~isr~ele~~ck~~~GF~~lsLSDP~p~krf~R~GWV~f~~~~~~~~~~~~L~~~~i~d~el~~ 108 (339)
+.+++|||++||+ .++-++|.+++..++...++.+-- ..+ -.|+|.|....++..|+..||+..+.+..|..
T Consensus 273 ~~~~~l~v~nL~~~~vt~~~L~~lF~~yG~V~~vki~~--~~~---g~afV~f~~~~~A~~Ai~~lng~~l~g~~l~v 345 (481)
T TIGR01649 273 GPGSVLMVSGLHQEKVNCDRLFNLFCVYGNVERVKFMK--NKK---ETALIEMADPYQAQLALTHLNGVKLFGKPLRV 345 (481)
T ss_pred CCCCEEEEeCCCCCCCCHHHHHHHHHhcCCeEEEEEEe--CCC---CEEEEEECCHHHHHHHHHHhCCCEECCceEEE
Confidence 4678999999999 699999999999999999998731 122 47999999999999999999999998876653
No 31
>PHA00616 hypothetical protein
Probab=95.08 E-value=0.0066 Score=43.33 Aligned_cols=26 Identities=19% Similarity=0.473 Sum_probs=24.8
Q ss_pred ecCCCCcccccChhhHHhhhhhcchH
Q psy4687 306 WLCPLSGKKFKGPDFIRKHIFNKHAE 331 (339)
Q Consensus 306 ~rC~~C~KlFk~~eFV~KHI~nKH~e 331 (339)
|.|+.|||.|.-..-+.+|+++.|.+
T Consensus 2 YqC~~CG~~F~~~s~l~~H~r~~hg~ 27 (44)
T PHA00616 2 YQCLRCGGIFRKKKEVIEHLLSVHKQ 27 (44)
T ss_pred CccchhhHHHhhHHHHHHHHHHhcCC
Confidence 78999999999999999999999987
No 32
>PF08777 RRM_3: RNA binding motif; InterPro: IPR014886 This domain is found in protein La which functions as an RNA chaperone during RNA polymerase III transcription, and can also stimulate translation initiation. It contains a five stranded beta sheet which forms an atypical RNA recognition motif []. ; PDB: 1OWX_A.
Probab=95.07 E-value=0.081 Score=44.06 Aligned_cols=60 Identities=10% Similarity=0.236 Sum_probs=38.9
Q ss_pred CCeEEeeecCCCCCHHHHHHHHhhcCCeeEEEecCCCCCCCceEEEEEEecCcccHHHHHHhhccc
Q psy4687 34 TSSIFLRNLAPTITKAEVENLCKRYSGFLRVAIADPQPDRRWFRRGWVTFRRDVDIKEICWNLNNI 99 (339)
Q Consensus 34 t~slfIk~IpP~isr~ele~~ck~~~GF~~lsLSDP~p~krf~R~GWV~f~~~~~~~~~~~~L~~~ 99 (339)
++.|.|..+.+.+||++|.++|++++...||-++.=.. .|||-|+.+..++.++..+...
T Consensus 1 G~il~~~g~~~~~~re~iK~~f~~~g~V~yVD~~~G~~------~g~VRf~~~~~A~~a~~~~~~~ 60 (105)
T PF08777_consen 1 GCILKFSGLGEPTSREDIKEAFSQFGEVAYVDFSRGDT------EGYVRFKTPEAAQKALEKLKEA 60 (105)
T ss_dssp --EEEEEE--SS--HHHHHHHT-SS--EEEEE--TT-S------EEEEEESS---HHHHHHHHHHT
T ss_pred CeEEEEecCCCCcCHHHHHHHHHhcCCcceEEecCCCC------EEEEEECCcchHHHHHHHHHhc
Confidence 35688999999999999999999999999998885444 8999999999888888776543
No 33
>TIGR01642 U2AF_lg U2 snRNP auxilliary factor, large subunit, splicing factor. Members of this subfamily are found in plants, metazoa and fungi.
Probab=95.07 E-value=0.051 Score=55.41 Aligned_cols=66 Identities=14% Similarity=0.271 Sum_probs=49.3
Q ss_pred CCeEEeeecCCCCCHHHHHHHHhhc----C--------CeeEEEecCCCCCCCceEEEEEEecCcccHHHHHHhhccccc
Q psy4687 34 TSSIFLRNLAPTITKAEVENLCKRY----S--------GFLRVAIADPQPDRRWFRRGWVTFRRDVDIKEICWNLNNIRL 101 (339)
Q Consensus 34 t~slfIk~IpP~isr~ele~~ck~~----~--------GF~~lsLSDP~p~krf~R~GWV~f~~~~~~~~~~~~L~~~~i 101 (339)
...|||.+||++++-++|.++++.+ + .+..+-+ ...++ .|||.|....++..|+ +||++.+
T Consensus 175 ~r~lyVgnLp~~~t~~~l~~~F~~~~~~~~~~~~~~~~~v~~~~~---~~~kg---~afVeF~~~e~A~~Al-~l~g~~~ 247 (509)
T TIGR01642 175 ARRLYVGGIPPEFVEEAVVDFFNDLMIATGYHKAEDGKHVSSVNI---NKEKN---FAFLEFRTVEEATFAM-ALDSIIY 247 (509)
T ss_pred ccEEEEeCCCCCCCHHHHHHHHHHHHHhcCCCCCCCCCceEEEEE---CCCCC---EEEEEeCCHHHHhhhh-cCCCeEe
Confidence 3579999999999999999999985 1 1222222 23444 4999999998888887 5888877
Q ss_pred cCccc
Q psy4687 102 RDCEL 106 (339)
Q Consensus 102 ~d~el 106 (339)
.+..|
T Consensus 248 ~g~~l 252 (509)
T TIGR01642 248 SNVFL 252 (509)
T ss_pred eCcee
Confidence 77554
No 34
>TIGR01622 SF-CC1 splicing factor, CC1-like family. A homologous gene from Plasmodium falciparum was identified in the course of the analysis of that genome at TIGR and was included in the model.
Probab=94.31 E-value=0.14 Score=51.66 Aligned_cols=71 Identities=15% Similarity=0.168 Sum_probs=55.8
Q ss_pred CCeEEeeecCCCCCHHHHHHHHhhcCCeeEEEec-CCCCCCCceEEEEEEecCcccHHHHHHhhccccccCccc
Q psy4687 34 TSSIFLRNLAPTITKAEVENLCKRYSGFLRVAIA-DPQPDRRWFRRGWVTFRRDVDIKEICWNLNNIRLRDCEL 106 (339)
Q Consensus 34 t~slfIk~IpP~isr~ele~~ck~~~GF~~lsLS-DP~p~krf~R~GWV~f~~~~~~~~~~~~L~~~~i~d~el 106 (339)
..+|||.+||++++-++|.+++.+++....+-|- |+.. .+--..|+|.|....++..|+ .|++..+.+-.+
T Consensus 89 ~~~l~V~nlp~~~~~~~l~~~F~~~G~v~~v~i~~d~~~-~~skg~afVeF~~~e~A~~Al-~l~g~~~~g~~i 160 (457)
T TIGR01622 89 DRTVFVLQLALKARERDLYEFFSKVGKVRDVQCIKDRNS-RRSKGVAYVEFYDVESVIKAL-ALTGQMLLGRPI 160 (457)
T ss_pred CcEEEEeCCCCCCCHHHHHHHHHhcCCeeEEEEeecCCC-CCcceEEEEEECCHHHHHHHH-HhCCCEECCeee
Confidence 4689999999999999999999999988777773 3332 233457999999998888876 488877766443
No 35
>TIGR01645 half-pint poly-U binding splicing factor, half-pint family. In the case of PUF60 (GP|6176532), in complex with p54, and in the presence of U2AF, facilitates association of U2 snRNP with pre-mRNA.
Probab=94.20 E-value=0.1 Score=55.75 Aligned_cols=78 Identities=9% Similarity=0.178 Sum_probs=63.2
Q ss_pred cccCCCCeEEeeecCCCCCHHHHHHHHhhcCCeeEEEec-CCCCCCCceEEEEEEecCcccHHHHHHhhccccccCcccc
Q psy4687 29 RALHRTSSIFLRNLAPTITKAEVENLCKRYSGFLRVAIA-DPQPDRRWFRRGWVTFRRDVDIKEICWNLNNIRLRDCELG 107 (339)
Q Consensus 29 ~~lh~t~slfIk~IpP~isr~ele~~ck~~~GF~~lsLS-DP~p~krf~R~GWV~f~~~~~~~~~~~~L~~~~i~d~el~ 107 (339)
.++-..+.|||.+||++++.++|.+++.+++....+.+. |+...+ --..|||.|+...++..++..+|+..+.+-.+.
T Consensus 102 ~a~~~~~rLfVGnLp~~~tEe~Lr~lF~~fG~I~sV~I~~D~~Tgk-skGfAFVeF~s~e~A~~Ai~~lnG~~i~GR~Ik 180 (612)
T TIGR01645 102 QALAIMCRVYVGSISFELREDTIRRAFDPFGPIKSINMSWDPATGK-HKGFAFVEYEVPEAAQLALEQMNGQMLGGRNIK 180 (612)
T ss_pred hhhcCCCEEEEcCCCCCCCHHHHHHHHHccCCEEEEEEeecCCCCC-cCCeEEEEeCcHHHHHHHHHhcCCeEEecceee
Confidence 334455789999999999999999999999999888874 554432 234799999999999999999999888776543
No 36
>KOG0117|consensus
Probab=93.77 E-value=0.13 Score=52.87 Aligned_cols=76 Identities=17% Similarity=0.292 Sum_probs=62.4
Q ss_pred CCcccCCCCeEEeeecCCCCCHHHHHHHHhhcCCeeEEEecCCCCCCCceEEEEEEecCcccHHHHHHhhccccccCccc
Q psy4687 27 PPRALHRTSSIFLRNLAPTITKAEVENLCKRYSGFLRVAIADPQPDRRWFRRGWVTFRRDVDIKEICWNLNNIRLRDCEL 106 (339)
Q Consensus 27 ~~~~lh~t~slfIk~IpP~isr~ele~~ck~~~GF~~lsLSDP~p~krf~R~GWV~f~~~~~~~~~~~~L~~~~i~d~el 106 (339)
.+..+-+-..||||||+-++|-+.|++++++|+-..||--- |. .|+|.|..-.++-.||.++|+..|++..+
T Consensus 252 ded~ms~VKvLYVRNL~~~tTeE~lk~~F~~~G~veRVkk~-----rD---YaFVHf~eR~davkAm~~~ngkeldG~~i 323 (506)
T KOG0117|consen 252 DEDTMSKVKVLYVRNLMESTTEETLKKLFNEFGKVERVKKP-----RD---YAFVHFAEREDAVKAMKETNGKELDGSPI 323 (506)
T ss_pred ChhhhhheeeeeeeccchhhhHHHHHHHHHhccceEEeecc-----cc---eeEEeecchHHHHHHHHHhcCceecCceE
Confidence 33355556899999999999999999999999878887643 44 79999998888888999999999998766
Q ss_pred cccc
Q psy4687 107 GAIV 110 (339)
Q Consensus 107 ~~~v 110 (339)
..+.
T Consensus 324 EvtL 327 (506)
T KOG0117|consen 324 EVTL 327 (506)
T ss_pred EEEe
Confidence 5443
No 37
>KOG1457|consensus
Probab=93.69 E-value=0.055 Score=51.23 Aligned_cols=63 Identities=27% Similarity=0.417 Sum_probs=52.4
Q ss_pred CeEEeeecCCCCCHHHHHHHHhhcCCeeEEEecCCCCCCCceEEEEEEecCcccHHHHHHhhccccc
Q psy4687 35 SSIFLRNLAPTITKAEVENLCKRYSGFLRVAIADPQPDRRWFRRGWVTFRRDVDIKEICWNLNNIRL 101 (339)
Q Consensus 35 ~slfIk~IpP~isr~ele~~ck~~~GF~~lsLSDP~p~krf~R~GWV~f~~~~~~~~~~~~L~~~~i 101 (339)
++|||-|+.|+|+-++|..+++++|||.+|-|- .|+=.-++++-|..=..+.++|..|++..|
T Consensus 211 stlfianl~~~~~ed~l~~~~~~~~gf~~l~~~----~~~g~~vaf~~~~~~~~at~am~~lqg~~~ 273 (284)
T KOG1457|consen 211 STLFIANLGPNCTEDELKQLLSRYPGFHILKIR----ARGGMPVAFADFEEIEQATDAMNHLQGNLL 273 (284)
T ss_pred hhHhhhccCCCCCHHHHHHHHHhCCCceEEEEe----cCCCcceEeecHHHHHHHHHHHHHhhccee
Confidence 689999999999999999999999999999985 334445788888777777788888876544
No 38
>smart00355 ZnF_C2H2 zinc finger.
Probab=93.33 E-value=0.049 Score=32.23 Aligned_cols=24 Identities=33% Similarity=0.724 Sum_probs=21.5
Q ss_pred ecCCCCcccccChhhHHhhhhhcch
Q psy4687 306 WLCPLSGKKFKGPDFIRKHIFNKHA 330 (339)
Q Consensus 306 ~rC~~C~KlFk~~eFV~KHI~nKH~ 330 (339)
|.|+.|++.|....-+..|+. .|.
T Consensus 1 ~~C~~C~~~f~~~~~l~~H~~-~H~ 24 (26)
T smart00355 1 YRCPECGKVFKSKSALKEHMR-THX 24 (26)
T ss_pred CCCCCCcchhCCHHHHHHHHH-Hhc
Confidence 679999999999999999998 654
No 39
>smart00361 RRM_1 RNA recognition motif.
Probab=93.24 E-value=0.21 Score=37.80 Aligned_cols=55 Identities=15% Similarity=0.112 Sum_probs=38.9
Q ss_pred HHHHHHHhhcCCeeEEE--ecCCCCC-CCceEEEEEEecCcccHHHHHHhhccccccC
Q psy4687 49 AEVENLCKRYSGFLRVA--IADPQPD-RRWFRRGWVTFRRDVDIKEICWNLNNIRLRD 103 (339)
Q Consensus 49 ~ele~~ck~~~GF~~ls--LSDP~p~-krf~R~GWV~f~~~~~~~~~~~~L~~~~i~d 103 (339)
+.+.+.|.+++.+..+. +-|+... .+---.|+|+|....++..|+..||+..+.+
T Consensus 7 ~~~~~~~~~fG~v~~v~~v~~~~~~~~~~~rG~~fV~f~~~~dA~~A~~~l~g~~~~g 64 (70)
T smart00361 7 REFSEEEEYFGEVGKINKIYIDNVGYENHKRGNVYITFERSEDAARAIVDLNGRYFDG 64 (70)
T ss_pred HHHHHHHHhcCCeeEEEEEEeCCCCCCCCCcEEEEEEECCHHHHHHHHHHhCCCEECC
Confidence 44555666999999884 4443220 1111359999999999999999999877666
No 40
>PF12171 zf-C2H2_jaz: Zinc-finger double-stranded RNA-binding; InterPro: IPR022755 This zinc finger is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localise in the nucleus, particularly the nucleolus []. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localisation. This entry represents the multiple-adjacent-C2H2 zinc finger, JAZ. ; PDB: 4DGW_A 1ZR9_A.
Probab=93.16 E-value=0.05 Score=34.15 Aligned_cols=24 Identities=21% Similarity=0.541 Sum_probs=21.5
Q ss_pred eecCCCCcccccChhhHHhhhhhc
Q psy4687 305 KWLCPLSGKKFKGPDFIRKHIFNK 328 (339)
Q Consensus 305 K~rC~~C~KlFk~~eFV~KHI~nK 328 (339)
+|-|..|+|-|.+..-+..|+..|
T Consensus 1 q~~C~~C~k~f~~~~~~~~H~~sk 24 (27)
T PF12171_consen 1 QFYCDACDKYFSSENQLKQHMKSK 24 (27)
T ss_dssp -CBBTTTTBBBSSHHHHHCCTTSH
T ss_pred CCCcccCCCCcCCHHHHHHHHccC
Confidence 488999999999999999999874
No 41
>KOG0117|consensus
Probab=92.98 E-value=0.5 Score=48.66 Aligned_cols=87 Identities=20% Similarity=0.302 Sum_probs=69.1
Q ss_pred cCCCCCCCCCCCcccCCCCeEEeeecCCCCCHHHHHHHHhhcCCeeEEE-ecCCCCCCCceEEEEEEecCcccHHHHHHh
Q psy4687 17 INGGESKNEPPPRALHRTSSIFLRNLAPTITKAEVENLCKRYSGFLRVA-IADPQPDRRWFRRGWVTFRRDVDIKEICWN 95 (339)
Q Consensus 17 ~~~~~~~~~~~~~~lh~t~slfIk~IpP~isr~ele~~ck~~~GF~~ls-LSDP~p~krf~R~GWV~f~~~~~~~~~~~~ 95 (339)
=+|||+.+..+|.+ ..++-+||=.||-++.-++|.-+|.+++---=|- +-||....+ -=.++|||.....+++|...
T Consensus 67 k~ggPpP~weg~~p-~~G~EVfvGkIPrD~~EdeLvplfEkiG~I~elRLMmD~~sG~n-RGYAFVtf~~Ke~Aq~Aik~ 144 (506)
T KOG0117|consen 67 KYGGPPPGWEGPPP-PRGCEVFVGKIPRDVFEDELVPLFEKIGKIYELRLMMDPFSGDN-RGYAFVTFCTKEEAQEAIKE 144 (506)
T ss_pred ccCCCCCcccCCCC-CCCceEEecCCCccccchhhHHHHHhccceeeEEEeecccCCCC-cceEEEEeecHHHHHHHHHH
Confidence 37888888667776 7899999999999999999999999997632233 356664332 12689999999999999999
Q ss_pred hccccccCcc
Q psy4687 96 LNNIRLRDCE 105 (339)
Q Consensus 96 L~~~~i~d~e 105 (339)
|||..|+-+.
T Consensus 145 lnn~Eir~GK 154 (506)
T KOG0117|consen 145 LNNYEIRPGK 154 (506)
T ss_pred hhCccccCCC
Confidence 9999888655
No 42
>KOG0108|consensus
Probab=92.57 E-value=0.39 Score=49.52 Aligned_cols=72 Identities=15% Similarity=0.198 Sum_probs=64.2
Q ss_pred CeEEeeecCCCCCHHHHHHHHhhcCCeeEEEecCCCCCCCceEEEEEEecCcccHHHHHHhhccccccCccc
Q psy4687 35 SSIFLRNLAPTITKAEVENLCKRYSGFLRVAIADPQPDRRWFRRGWVTFRRDVDIKEICWNLNNIRLRDCEL 106 (339)
Q Consensus 35 ~slfIk~IpP~isr~ele~~ck~~~GF~~lsLSDP~p~krf~R~GWV~f~~~~~~~~~~~~L~~~~i~d~el 106 (339)
.++||.+||.++|.++|..+|++.++..-+-+-..-..=+|-=.|++.|....++..+..+||+..+.+-.|
T Consensus 19 ~~v~vgnip~~~se~~l~~~~~~~g~v~s~~~v~D~~tG~~~G~~f~~~~~~~~~~~a~~~lNg~~~~gr~l 90 (435)
T KOG0108|consen 19 SSVFVGNIPYEGSEEQLLSIFSGVGPVLSFRLVYDRETGKPKGFGFCEFTDEETAERAIRNLNGAEFNGRKL 90 (435)
T ss_pred cceEecCCCCcccHHHHHHHHhccCccceeeecccccCCCcCceeeEecCchhhHHHHHHhcCCcccCCceE
Confidence 899999999999999999999999998888887777766888899999999988888888999887776554
No 43
>KOG1457|consensus
Probab=92.51 E-value=0.33 Score=46.12 Aligned_cols=67 Identities=19% Similarity=0.319 Sum_probs=54.6
Q ss_pred CeEEeeecCCCCCHHHHHHHHhhcCCeeE--EEecCCCCCCCceEEEEEEecCcccHHHHHHhhcccccc
Q psy4687 35 SSIFLRNLAPTITKAEVENLCKRYSGFLR--VAIADPQPDRRWFRRGWVTFRRDVDIKEICWNLNNIRLR 102 (339)
Q Consensus 35 ~slfIk~IpP~isr~ele~~ck~~~GF~~--lsLSDP~p~krf~R~GWV~f~~~~~~~~~~~~L~~~~i~ 102 (339)
.+||+-.+|-+|.--||..++++++||.- |-+++-. .+-.--+|+|+|..-..+-++++.||+++.+
T Consensus 35 RTLFVSGLP~DvKpREiynLFR~f~GYEgslLK~Tsk~-~~~~~pvaFatF~s~q~A~aamnaLNGvrFD 103 (284)
T KOG1457|consen 35 RTLFVSGLPNDVKPREIYNLFRRFHGYEGSLLKYTSKG-DQVCKPVAFATFTSHQFALAAMNALNGVRFD 103 (284)
T ss_pred ceeeeccCCcccCHHHHHHHhccCCCccceeeeeccCC-CccccceEEEEecchHHHHHHHHHhcCeeec
Confidence 57999999999999999999999999864 3334332 1112258999999999999999999999886
No 44
>smart00451 ZnF_U1 U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Probab=92.46 E-value=0.076 Score=34.70 Aligned_cols=31 Identities=23% Similarity=0.441 Sum_probs=25.5
Q ss_pred CeecCCCCcccccChhhHHhhhhhc-chHHHH
Q psy4687 304 DKWLCPLSGKKFKGPDFIRKHIFNK-HAEKVE 334 (339)
Q Consensus 304 ~K~rC~~C~KlFk~~eFV~KHI~nK-H~e~v~ 334 (339)
++|.|.+|++-|.++.-+..|+..| |...++
T Consensus 2 ~~~~C~~C~~~~~~~~~~~~H~~gk~H~~~~~ 33 (35)
T smart00451 2 GGFYCKLCNVTFTDEISVEAHLKGKKHKKNVK 33 (35)
T ss_pred cCeEccccCCccCCHHHHHHHHChHHHHHHHH
Confidence 5789999999999999999999764 555544
No 45
>KOG0127|consensus
Probab=92.36 E-value=0.28 Score=51.65 Aligned_cols=73 Identities=14% Similarity=0.202 Sum_probs=59.4
Q ss_pred CCeEEeeecCCCCCHHHHHHHHhhcCCeeEEEe-cCCCCCCCceEEEEEEecCcccHHHHHHhhccccccCcccc
Q psy4687 34 TSSIFLRNLAPTITKAEVENLCKRYSGFLRVAI-ADPQPDRRWFRRGWVTFRRDVDIKEICWNLNNIRLRDCELG 107 (339)
Q Consensus 34 t~slfIk~IpP~isr~ele~~ck~~~GF~~lsL-SDP~p~krf~R~GWV~f~~~~~~~~~~~~L~~~~i~d~el~ 107 (339)
+.+||+++||++++-++|+++++.++++.+-.+ .++..+. ---.|+|+|.=..|...+...+++.++.++-+.
T Consensus 5 g~TlfV~~lp~~~~~~qL~e~FS~vGPik~~~vVt~~gs~~-~RGfgfVtFam~ED~qrA~~e~~~~kf~Gr~l~ 78 (678)
T KOG0127|consen 5 GATLFVSRLPFSSTGEQLEEFFSYVGPIKHAVVVTNKGSSE-KRGFGFVTFAMEEDVQRALAETEQSKFEGRILN 78 (678)
T ss_pred CceEEEecCCCccchhHHHHhhhcccCcceeEEecCCCccc-ccCccceeeehHhHHHHHHHHhhcCcccceecc
Confidence 378999999999999999999999999887555 5555421 112599999999999999998998888887665
No 46
>PF03467 Smg4_UPF3: Smg-4/UPF3 family; InterPro: IPR005120 Nonsense-mediated mRNA decay (NMD) is a surveillance mechanism by which eukaryotic cells detect and degrade transcripts containing premature termination codons. Three 'up-frameshift' proteins, UPF1, UPF2 and UPF3, are essential for this process in organisms ranging from yeast, human to plants []. Exon junction complexes (EJCs) are deposited ~24 nucleotides upstream of exon-exon junctions after splicing. Translation causes displacement of the EJCs, however, premature translation termination upstream of one or more EJCs triggers the recruitment of UPF1, UPF2 and UPF3 and activates the NMD pathway [, ]. This family contains UPF3. The crystal structure of the complex between human UPF2 and UPF3b, which are, respectively, a MIF4G (middle portion of eIF4G) domain and an RNP domain (ribonucleoprotein-type RNA-binding domain) has been determined to 1.95A. The protein-protein interface is mediated by highly conserved charged residues in UPF2 and UPF3b and involves the beta-sheet surface of the UPF3b ribonucleoprotein (RNP) domain, which is generally used by these domains to bind nucleic acids. In UPF3b the RNP domain does not bind RNA, whereas the UPF2 construct and the complex do. It is clear that some RNP domains have evolved for specific protein-protein interactions rather than as nucleic acid binding modules [].; PDB: 1UW4_A 2L08_A.
Probab=92.27 E-value=0.17 Score=45.77 Aligned_cols=68 Identities=13% Similarity=0.212 Sum_probs=48.3
Q ss_pred CeEEeeecCCCCCHHHHHHHHhh-cCCe---eEEE--ecCCCCCCCceEEEEEEecCcccHHHHHHhhcccccc
Q psy4687 35 SSIFLRNLAPTITKAEVENLCKR-YSGF---LRVA--IADPQPDRRWFRRGWVTFRRDVDIKEICWNLNNIRLR 102 (339)
Q Consensus 35 ~slfIk~IpP~isr~ele~~ck~-~~GF---~~ls--LSDP~p~krf~R~GWV~f~~~~~~~~~~~~L~~~~i~ 102 (339)
.-|.||+|||+++.+++.+.++. +++. .++. .++..-...-+-+++|.|....++.+.+..+++....
T Consensus 8 ~KvVIR~LPP~LteeeF~~~i~~~l~~~~~w~y~~g~~~~~~~~~~~~SRaYi~F~~~~~~~~F~~~~~g~~F~ 81 (176)
T PF03467_consen 8 TKVVIRRLPPNLTEEEFWEQISPWLPDEWDWYYFQGKYGKKSFKPPTYSRAYINFKNPEDLLEFRDRFDGHVFV 81 (176)
T ss_dssp -EEEEEEE-TTS-HHHHCCCCSS--SSE---EEEEEEES-SSSTTS--EEEEEEESSCHHHHHHHHHCTTEEEE
T ss_pred ceEEEeCCCCCCCHHHHHHHhhhhcccccceEEEecCCCCccCCCCcceEEEEEeCCHHHHHHHHHhcCCcEEE
Confidence 46889999999999999997776 5554 5665 5665555566788999999999999888888874443
No 47
>PLN03213 repressor of silencing 3; Provisional
Probab=92.07 E-value=0.47 Score=49.57 Aligned_cols=72 Identities=7% Similarity=0.124 Sum_probs=60.5
Q ss_pred CCCCeEEeeecCCCCCHHHHHHHHhhcCCeeEEEecCCCCCCCceEEEEEEecCc--ccHHHHHHhhccccccCcccc
Q psy4687 32 HRTSSIFLRNLAPTITKAEVENLCKRYSGFLRVAIADPQPDRRWFRRGWVTFRRD--VDIKEICWNLNNIRLRDCELG 107 (339)
Q Consensus 32 h~t~slfIk~IpP~isr~ele~~ck~~~GF~~lsLSDP~p~krf~R~GWV~f~~~--~~~~~~~~~L~~~~i~d~el~ 107 (339)
-.+.+|||-+|+.+|+.++|.+++.+++.+.+|.|- ..+. -.-|+|.|..+ .++..+...||+....|..|.
T Consensus 8 ~~gMRIYVGNLSydVTEDDLravFSeFGsVkdVEIp---RETG-RGFAFVEMssdddaEeeKAISaLNGAEWKGR~LK 81 (759)
T PLN03213 8 GGGVRLHVGGLGESVGRDDLLKIFSPMGTVDAVEFV---RTKG-RSFAYIDFSPSSTNSLTKLFSTYNGCVWKGGRLR 81 (759)
T ss_pred CcceEEEEeCCCCCCCHHHHHHHHHhcCCeeEEEEe---cccC-CceEEEEecCCcHHHHHHHHHHhcCCeecCceeE
Confidence 356899999999999999999999999999999994 2222 45689999987 568888899999888887764
No 48
>KOG4206|consensus
Probab=91.98 E-value=0.8 Score=43.19 Aligned_cols=70 Identities=16% Similarity=0.302 Sum_probs=58.6
Q ss_pred CCCeEEeeecCCCCCHHHHHH----HHhhcCCeeEEEecCCCCCCCceEEEEEEecCcccHHHHHHhhccccccCcc
Q psy4687 33 RTSSIFLRNLAPTITKAEVEN----LCKRYSGFLRVAIADPQPDRRWFRRGWVTFRRDVDIKEICWNLNNIRLRDCE 105 (339)
Q Consensus 33 ~t~slfIk~IpP~isr~ele~----~ck~~~GF~~lsLSDP~p~krf~R~GWV~f~~~~~~~~~~~~L~~~~i~d~e 105 (339)
...+|+|++++--|.+++|-. +++++++..-++..-+.+-|. -+||+|+....+..|+.+|+|...-|=.
T Consensus 8 pn~TlYInnLnekI~~~elkrsL~~LFsqfG~ildI~a~kt~KmRG---QA~VvFk~~~~As~A~r~l~gfpFygK~ 81 (221)
T KOG4206|consen 8 PNGTLYINNLNEKIKKDELKRSLYLLFSQFGKILDISAFKTPKMRG---QAFVVFKETEAASAALRALQGFPFYGKP 81 (221)
T ss_pred CCceEeehhccccccHHHHHHHHHHHHHhhCCeEEEEecCCCCccC---ceEEEecChhHHHHHHHHhcCCcccCch
Confidence 334999999999999999988 999999988887776777666 8999999988888888899987665533
No 49
>PF04059 RRM_2: RNA recognition motif 2; InterPro: IPR007201 This RNA recognition motif 2 is found in Meiosis protein mei2. It is found C-terminal to the RNA-binding region RNP-1 (IPR000504 from INTERPRO).
Probab=91.90 E-value=0.76 Score=38.01 Aligned_cols=69 Identities=10% Similarity=0.136 Sum_probs=49.8
Q ss_pred CeEEeeecCCCCCHHHHHHHHhhc--CCeeEEEe-cCCCCCCCceEEEEEEecCcccHHHHHHhhccccccCc
Q psy4687 35 SSIFLRNLAPTITKAEVENLCKRY--SGFLRVAI-ADPQPDRRWFRRGWVTFRRDVDIKEICWNLNNIRLRDC 104 (339)
Q Consensus 35 ~slfIk~IpP~isr~ele~~ck~~--~GF~~lsL-SDP~p~krf~R~GWV~f~~~~~~~~~~~~L~~~~i~d~ 104 (339)
++|+||+||...|++.|.+++.+. +.|..|.| -|.... .=.-.|+|-|.....+..-....++.+...+
T Consensus 2 TTvMirNIPn~~t~~~L~~~l~~~~~g~yDF~YLPiDf~~~-~N~GYAFVNf~~~~~~~~F~~~f~g~~w~~~ 73 (97)
T PF04059_consen 2 TTVMIRNIPNKYTQEMLIQILDEHFKGKYDFFYLPIDFKNK-CNLGYAFVNFTSPQAAIRFYKAFNGKKWPNF 73 (97)
T ss_pred eeEEEecCCCCCCHHHHHHHHHHhccCcceEEEeeeeccCC-CceEEEEEEcCCHHHHHHHHHHHcCCccccC
Confidence 689999999999999999988876 34666666 222222 2234788999888777777777777766543
No 50
>PF12874 zf-met: Zinc-finger of C2H2 type; PDB: 1ZU1_A 2KVG_A.
Probab=91.55 E-value=0.11 Score=31.65 Aligned_cols=23 Identities=26% Similarity=0.668 Sum_probs=21.2
Q ss_pred ecCCCCcccccChhhHHhhhhhc
Q psy4687 306 WLCPLSGKKFKGPDFIRKHIFNK 328 (339)
Q Consensus 306 ~rC~~C~KlFk~~eFV~KHI~nK 328 (339)
|.|.+|++.|.++.-++.|+..|
T Consensus 1 ~~C~~C~~~f~s~~~~~~H~~s~ 23 (25)
T PF12874_consen 1 FYCDICNKSFSSENSLRQHLRSK 23 (25)
T ss_dssp EEETTTTEEESSHHHHHHHHTTH
T ss_pred CCCCCCCCCcCCHHHHHHHHCcC
Confidence 78999999999999999999764
No 51
>KOG0122|consensus
Probab=91.26 E-value=0.93 Score=43.53 Aligned_cols=75 Identities=19% Similarity=0.238 Sum_probs=61.3
Q ss_pred CCCCeEEeeecCCCCCHHHHHHHHhhcCCeeEEEecCCCCCCCceEEEEEEecCcccHHHHHHhhccccccCccc
Q psy4687 32 HRTSSIFLRNLAPTITKAEVENLCKRYSGFLRVAIADPQPDRRWFRRGWVTFRRDVDIKEICWNLNNIRLRDCEL 106 (339)
Q Consensus 32 h~t~slfIk~IpP~isr~ele~~ck~~~GF~~lsLSDP~p~krf~R~GWV~f~~~~~~~~~~~~L~~~~i~d~el 106 (339)
+-+++|-|-|+|-+....+|++++..++++.|++|.==...-.+-=-|+|+|..-.++..|...||+...+.--|
T Consensus 187 ~D~~tvRvtNLsed~~E~dL~eLf~~fg~i~rvylardK~TG~~kGFAFVtF~sRddA~rAI~~LnG~gyd~LIL 261 (270)
T KOG0122|consen 187 DDEATVRVTNLSEDMREDDLEELFRPFGPITRVYLARDKETGLSKGFAFVTFESRDDAARAIADLNGYGYDNLIL 261 (270)
T ss_pred CccceeEEecCccccChhHHHHHhhccCccceeEEEEccccCcccceEEEEEecHHHHHHHHHHccCcccceEEE
Confidence 467889999999999999999999999999999997443333444469999999999888888899876555333
No 52
>PF13912 zf-C2H2_6: C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=91.03 E-value=0.13 Score=31.78 Aligned_cols=25 Identities=24% Similarity=0.440 Sum_probs=22.5
Q ss_pred ecCCCCcccccChhhHHhhhhhcch
Q psy4687 306 WLCPLSGKKFKGPDFIRKHIFNKHA 330 (339)
Q Consensus 306 ~rC~~C~KlFk~~eFV~KHI~nKH~ 330 (339)
|.|..|++.|....=+..|++..|+
T Consensus 2 ~~C~~C~~~F~~~~~l~~H~~~h~~ 26 (27)
T PF13912_consen 2 FECDECGKTFSSLSALREHKRSHCS 26 (27)
T ss_dssp EEETTTTEEESSHHHHHHHHCTTTT
T ss_pred CCCCccCCccCChhHHHHHhHHhcC
Confidence 7899999999999999999977653
No 53
>PF02892 zf-BED: BED zinc finger; InterPro: IPR003656 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents predicted BED-type zinc finger domains. The BED finger which was named after the Drosophila proteins BEAF and DREF, is found in one or more copies in cellular regulatory factors and transposases from plants, animals and fungi. The BED finger is an about 50 to 60 amino acid residues domain that contains a characteristic motif with two highly conserved aromatic positions, as well as a shared pattern of cysteines and histidines that is predicted to form a zinc finger. As diverse BED fingers are able to bind DNA, it has been suggested that DNA-binding is the general function of this domain []. Some proteins known to contain a BED domain include animal, plant and fungi AC1 and Hobo-like transposases; Caenorhabditis elegans Dpy-20 protein, a predicted cuticular gene transcriptional regulator; Drosophila BEAF (boundary element-associated factor), thought to be involved in chromatin insulation; Drosophila DREF, a transcriptional regulator for S-phase genes; and tobacco 3AF1 and tomato E4/E8-BP1, light- and ethylene-regulated DNA binding proteins that contain two BED fingers. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0003677 DNA binding; PDB: 2DJR_A 2CT5_A.
Probab=90.96 E-value=0.15 Score=35.38 Aligned_cols=29 Identities=28% Similarity=0.457 Sum_probs=22.0
Q ss_pred cCCeecCCCCcccccCh----hhHHhhhhhcch
Q psy4687 302 STDKWLCPLSGKKFKGP----DFIRKHIFNKHA 330 (339)
Q Consensus 302 ~e~K~rC~~C~KlFk~~----eFV~KHI~nKH~ 330 (339)
+..+.+|+.|+|.|++. .=+..||.++||
T Consensus 13 ~~~~a~C~~C~~~~~~~~~~ts~l~~HL~~~hp 45 (45)
T PF02892_consen 13 DKKKAKCKYCGKVIKYSSGGTSNLKRHLKKKHP 45 (45)
T ss_dssp CSS-EEETTTTEE-----SSTHHHHHHHHHTTH
T ss_pred CcCeEEeCCCCeEEeeCCCcHHHHHHhhhhhCc
Confidence 57899999999999984 788999999997
No 54
>KOG4212|consensus
Probab=89.52 E-value=0.62 Score=48.00 Aligned_cols=75 Identities=21% Similarity=0.278 Sum_probs=64.9
Q ss_pred CCcccCCCCeEEeeecCCCCCHHHHHHHHhhcCCeeEEEecCCCCCCCceEEEEEEecCcccHHHHHHhhccccccCccc
Q psy4687 27 PPRALHRTSSIFLRNLAPTITKAEVENLCKRYSGFLRVAIADPQPDRRWFRRGWVTFRRDVDIKEICWNLNNIRLRDCEL 106 (339)
Q Consensus 27 ~~~~lh~t~slfIk~IpP~isr~ele~~ck~~~GF~~lsLSDP~p~krf~R~GWV~f~~~~~~~~~~~~L~~~~i~d~el 106 (339)
-+-+..|.+|||||++|-++|=+-|..-+++++-.+|.-+-++=.+ .|-|-|.+..++.-||..+++.++++-++
T Consensus 529 a~gaarKa~qIiirNlP~dfTWqmlrDKfre~G~v~yadime~Gks-----kGVVrF~s~edAEra~a~Mngs~l~Gr~I 603 (608)
T KOG4212|consen 529 AVGAARKACQIIIRNLPFDFTWQMLRDKFREIGHVLYADIMENGKS-----KGVVRFFSPEDAERACALMNGSRLDGRNI 603 (608)
T ss_pred cccccccccEEEEecCCccccHHHHHHHHHhccceehhhhhccCCc-----cceEEecCHHHHHHHHHHhccCcccCcee
Confidence 4566778899999999999999999999999999999888666665 56888999999999999999999987544
No 55
>KOG0121|consensus
Probab=88.47 E-value=1.4 Score=38.73 Aligned_cols=77 Identities=16% Similarity=0.234 Sum_probs=68.1
Q ss_pred CCcccCCCCeEEeeecCCCCCHHHHHHHHhhcCCeeEEEecCCCCCCCceEEEEEEecCcccHHHHHHhhccccccC
Q psy4687 27 PPRALHRTSSIFLRNLAPTITKAEVENLCKRYSGFLRVAIADPQPDRRWFRRGWVTFRRDVDIKEICWNLNNIRLRD 103 (339)
Q Consensus 27 ~~~~lh~t~slfIk~IpP~isr~ele~~ck~~~GF~~lsLSDP~p~krf~R~GWV~f~~~~~~~~~~~~L~~~~i~d 103 (339)
.-+++-+.++|++-||+--.+-++|-+++.+-+-.+++-++=-.-.+.+-=-++|.|-...++..|+.-|+++++++
T Consensus 29 ~~~a~r~S~tvyVgNlSfyttEEqiyELFs~cG~irriiMGLdr~kktpCGFCFVeyy~~~dA~~AlryisgtrLdd 105 (153)
T KOG0121|consen 29 QLEALRKSCTVYVGNLSFYTTEEQIYELFSKCGDIRRIIMGLDRFKKTPCGFCFVEYYSRDDAEDALRYISGTRLDD 105 (153)
T ss_pred HHHHHhhcceEEEeeeeeeecHHHHHHHHHhccchheeEeccccCCcCccceEEEEEecchhHHHHHHHhccCcccc
Confidence 45678889999999999999999999999999999999998655666777788999988888888888899999887
No 56
>PF12756 zf-C2H2_2: C2H2 type zinc-finger (2 copies); PDB: 2DMI_A.
Probab=88.04 E-value=0.33 Score=37.99 Aligned_cols=28 Identities=25% Similarity=0.695 Sum_probs=23.7
Q ss_pred eecCCCCcccccChhhHHhhhhhc-chHH
Q psy4687 305 KWLCPLSGKKFKGPDFIRKHIFNK-HAEK 332 (339)
Q Consensus 305 K~rC~~C~KlFk~~eFV~KHI~nK-H~e~ 332 (339)
.|.|..|++.|+..+=+..||..+ |...
T Consensus 50 ~~~C~~C~~~f~s~~~l~~Hm~~~~H~~~ 78 (100)
T PF12756_consen 50 SFRCPYCNKTFRSREALQEHMRSKHHKKR 78 (100)
T ss_dssp SEEBSSSS-EESSHHHHHHHHHHTTTTC-
T ss_pred CCCCCccCCCCcCHHHHHHHHcCccCCCc
Confidence 899999999999999999999976 5544
No 57
>KOG4206|consensus
Probab=87.83 E-value=1.4 Score=41.62 Aligned_cols=66 Identities=17% Similarity=0.371 Sum_probs=58.7
Q ss_pred CCCCeEEeeecCCCCCHHHHHHHHhhcCCeeEEEecCCCCCCCceEEEEEEecCcccHHHHHHhhcccccc
Q psy4687 32 HRTSSIFLRNLAPTITKAEVENLCKRYSGFLRVAIADPQPDRRWFRRGWVTFRRDVDIKEICWNLNNIRLR 102 (339)
Q Consensus 32 h~t~slfIk~IpP~isr~ele~~ck~~~GF~~lsLSDP~p~krf~R~GWV~f~~~~~~~~~~~~L~~~~i~ 102 (339)
.....||+-+||.....+-++.+...++||+.+.+-++.+. +++|.|..++-...+...|++.+|.
T Consensus 144 ppn~ilf~~niP~es~~e~l~~lf~qf~g~keir~i~~~~~-----iAfve~~~d~~a~~a~~~lq~~~it 209 (221)
T KOG4206|consen 144 PPNNILFLTNIPSESESEMLSDLFEQFPGFKEIRLIPPRSG-----IAFVEFLSDRQASAAQQALQGFKIT 209 (221)
T ss_pred CCceEEEEecCCcchhHHHHHHHHhhCcccceeEeccCCCc-----eeEEecchhhhhHHHhhhhccceec
Confidence 44578999999999999999999999999999999999887 9999999998777777788887776
No 58
>KOG0116|consensus
Probab=87.52 E-value=1.5 Score=45.12 Aligned_cols=65 Identities=23% Similarity=0.400 Sum_probs=47.4
Q ss_pred EEeeecCCCCCHHHHHHHHhhc-----CCeeEEEecCCCCCCCceEEEEEEecCcccHHHHHHhhccccccCcccc
Q psy4687 37 IFLRNLAPTITKAEVENLCKRY-----SGFLRVAIADPQPDRRWFRRGWVTFRRDVDIKEICWNLNNIRLRDCELG 107 (339)
Q Consensus 37 lfIk~IpP~isr~ele~~ck~~-----~GF~~lsLSDP~p~krf~R~GWV~f~~~~~~~~~~~~L~~~~i~d~el~ 107 (339)
||+++||++++-.+|+++++.+ .|+...+.. .+.. =-|+|.|..+.+++.++.+ +-+.|.+..|.
T Consensus 291 i~V~nlP~da~~~~l~~~Fk~FG~Ik~~~I~vr~~~----~~~~-~fgFV~f~~~~~~~~~i~A-sp~~ig~~kl~ 360 (419)
T KOG0116|consen 291 IFVKNLPPDATPAELEEVFKQFGPIKEGGIQVRSPG----GKNP-CFGFVEFENAAAVQNAIEA-SPLEIGGRKLN 360 (419)
T ss_pred eEeecCCCCCCHHHHHHHHhhcccccccceEEeccC----CCcC-ceEEEEEeecchhhhhhhc-CccccCCeeEE
Confidence 9999999999999999999998 577776622 2233 5799999999988876642 13334444443
No 59
>KOG0533|consensus
Probab=87.24 E-value=2 Score=41.13 Aligned_cols=74 Identities=20% Similarity=0.291 Sum_probs=62.6
Q ss_pred CCCeEEeeecCCCCCHHHHHHHHhhcCCeeEEEecCCCCCCCceEEEEEEecCcccHHHHHHhhccccccCcccc
Q psy4687 33 RTSSIFLRNLAPTITKAEVENLCKRYSGFLRVAIADPQPDRRWFRRGWVTFRRDVDIKEICWNLNNIRLRDCELG 107 (339)
Q Consensus 33 ~t~slfIk~IpP~isr~ele~~ck~~~GF~~lsLSDP~p~krf~R~GWV~f~~~~~~~~~~~~L~~~~i~d~el~ 107 (339)
+.+-|.|.++|..|+=.+|.+++..+....++++. +.+.-+=.=.|=|+|+...|+..++..+|++.++++.+.
T Consensus 82 ~~~~v~v~NL~~~V~~~Dl~eLF~~~~~~~r~~vh-y~~~G~s~Gta~v~~~r~~DA~~avk~~~gv~ldG~~mk 155 (243)
T KOG0533|consen 82 RSTKVNVSNLPYGVIDADLKELFAEFGELKRVAVH-YDRAGRSLGTADVSFNRRDDAERAVKKYNGVALDGRPMK 155 (243)
T ss_pred CcceeeeecCCcCcchHHHHHHHHHhccceEEeec-cCCCCCCCccceeeecchHhHHHHHHHhcCcccCCceee
Confidence 33779999999999999999999999988888885 444444445789999999999999999999999988754
No 60
>PF05605 zf-Di19: Drought induced 19 protein (Di19), zinc-binding; InterPro: IPR008598 This entry consists of several drought induced 19 (Di19) like and RING finger 114 proteins. Di19 has been found to be strongly expressed in both the roots and leaves of Arabidopsis thaliana during progressive drought [], whilst RING finger proteins are thought to play a role in spermatogenesis. The precise function is unknown.
Probab=87.03 E-value=0.39 Score=34.99 Aligned_cols=27 Identities=30% Similarity=0.630 Sum_probs=19.2
Q ss_pred CeecCCCCcccccChhhHHhhhhhcchH
Q psy4687 304 DKWLCPLSGKKFKGPDFIRKHIFNKHAE 331 (339)
Q Consensus 304 ~K~rC~~C~KlFk~~eFV~KHI~nKH~e 331 (339)
+.|.||.|+|.|.. .-+..|+..+|+.
T Consensus 1 ~~f~CP~C~~~~~~-~~L~~H~~~~H~~ 27 (54)
T PF05605_consen 1 DSFTCPYCGKGFSE-SSLVEHCEDEHRS 27 (54)
T ss_pred CCcCCCCCCCccCH-HHHHHHHHhHCcC
Confidence 35788888885544 4577888888875
No 61
>PF13913 zf-C2HC_2: zinc-finger of a C2HC-type
Probab=86.23 E-value=0.49 Score=29.57 Aligned_cols=20 Identities=45% Similarity=0.874 Sum_probs=17.7
Q ss_pred ecCCCCcccccChhhHHhhhh
Q psy4687 306 WLCPLSGKKFKGPDFIRKHIF 326 (339)
Q Consensus 306 ~rC~~C~KlFk~~eFV~KHI~ 326 (339)
..|+.|+.+| .++-+.||+.
T Consensus 3 ~~C~~CgR~F-~~~~l~~H~~ 22 (25)
T PF13913_consen 3 VPCPICGRKF-NPDRLEKHEK 22 (25)
T ss_pred CcCCCCCCEE-CHHHHHHHHH
Confidence 3699999999 8999999975
No 62
>PHA02768 hypothetical protein; Provisional
Probab=85.71 E-value=0.45 Score=35.60 Aligned_cols=24 Identities=17% Similarity=0.502 Sum_probs=21.9
Q ss_pred ecCCCCcccccChhhHHhhhhhcc
Q psy4687 306 WLCPLSGKKFKGPDFIRKHIFNKH 329 (339)
Q Consensus 306 ~rC~~C~KlFk~~eFV~KHI~nKH 329 (339)
|.|+.|||.|.-..-+..|+++-+
T Consensus 6 y~C~~CGK~Fs~~~~L~~H~r~H~ 29 (55)
T PHA02768 6 YECPICGEIYIKRKSMITHLRKHN 29 (55)
T ss_pred cCcchhCCeeccHHHHHHHHHhcC
Confidence 789999999999999999999833
No 63
>KOG0107|consensus
Probab=85.48 E-value=2.9 Score=38.45 Aligned_cols=65 Identities=22% Similarity=0.340 Sum_probs=53.9
Q ss_pred CCeEEeeecCCCCCHHHHHHHHhhcCC--eeEEEecCCCCCCCceEEEEEEecCcccHHHHHHhhccccccCcc
Q psy4687 34 TSSIFLRNLAPTITKAEVENLCKRYSG--FLRVAIADPQPDRRWFRRGWVTFRRDVDIKEICWNLNNIRLRDCE 105 (339)
Q Consensus 34 t~slfIk~IpP~isr~ele~~ck~~~G--F~~lsLSDP~p~krf~R~GWV~f~~~~~~~~~~~~L~~~~i~d~e 105 (339)
.+.+++=++++..++.|||.++..++- =.|||- +|. . -++|.|+...|+.+|+-.||+.+|-+..
T Consensus 10 ~~kVYVGnL~~~a~k~eLE~~F~~yG~lrsvWvAr---nPP-G---fAFVEFed~RDA~DAvr~LDG~~~cG~r 76 (195)
T KOG0107|consen 10 NTKVYVGNLGSRATKRELERAFSKYGPLRSVWVAR---NPP-G---FAFVEFEDPRDAEDAVRYLDGKDICGSR 76 (195)
T ss_pred CceEEeccCCCCcchHHHHHHHHhcCcceeEEEee---cCC-C---ceEEeccCcccHHHHHhhcCCccccCce
Confidence 689999999999999999999999975 356665 332 2 4789999999999999999998877744
No 64
>KOG3993|consensus
Probab=84.89 E-value=0.32 Score=49.81 Aligned_cols=29 Identities=31% Similarity=0.658 Sum_probs=25.8
Q ss_pred cCCeecCCCCcccccChhhHHhhhhhcch
Q psy4687 302 STDKWLCPLSGKKFKGPDFIRKHIFNKHA 330 (339)
Q Consensus 302 ~e~K~rC~~C~KlFk~~eFV~KHI~nKH~ 330 (339)
.++-|-|..|+|+|+-.-|++||...-|.
T Consensus 353 s~gi~~C~~C~KkFrRqAYLrKHqlthq~ 381 (500)
T KOG3993|consen 353 SSGIFSCHTCGKKFRRQAYLRKHQLTHQR 381 (500)
T ss_pred cCceeecHHhhhhhHHHHHHHHhHHhhhc
Confidence 57799999999999999999999876654
No 65
>PF13909 zf-H2C2_5: C2H2-type zinc-finger domain; PDB: 1X5W_A.
Probab=84.53 E-value=0.51 Score=28.49 Aligned_cols=24 Identities=17% Similarity=0.463 Sum_probs=19.0
Q ss_pred ecCCCCcccccChhhHHhhhhhcch
Q psy4687 306 WLCPLSGKKFKGPDFIRKHIFNKHA 330 (339)
Q Consensus 306 ~rC~~C~KlFk~~eFV~KHI~nKH~ 330 (339)
|+|+.|+=... +.-+.+|++.+||
T Consensus 1 y~C~~C~y~t~-~~~l~~H~~~~H~ 24 (24)
T PF13909_consen 1 YKCPHCSYSTS-KSNLKRHLKRHHP 24 (24)
T ss_dssp EE-SSSS-EES-HHHHHHHHHHHHS
T ss_pred CCCCCCCCcCC-HHHHHHHHHhhCc
Confidence 78999997777 7789999999886
No 66
>PF13893 RRM_5: RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain); PDB: 2PE8_A 2PEH_A 2D9O_A 1A9N_D 2DIT_A 3S6E_A 2LQ5_A 1SJQ_A 2AD9_A 1X4D_A ....
Probab=82.68 E-value=2.4 Score=30.20 Aligned_cols=51 Identities=16% Similarity=0.202 Sum_probs=41.1
Q ss_pred HHHHHhhcCCeeEEEecCCCCCCCceEEEEEEecCcccHHHHHHhhccccccCccc
Q psy4687 51 VENLCKRYSGFLRVAIADPQPDRRWFRRGWVTFRRDVDIKEICWNLNNIRLRDCEL 106 (339)
Q Consensus 51 le~~ck~~~GF~~lsLSDP~p~krf~R~GWV~f~~~~~~~~~~~~L~~~~i~d~el 106 (339)
|-++|+++|-...+.+.... -..++|.|....++..+...||+..+.|..+
T Consensus 1 L~~~f~~fG~V~~i~~~~~~-----~~~a~V~f~~~~~A~~a~~~l~~~~~~g~~l 51 (56)
T PF13893_consen 1 LYKLFSKFGEVKKIKIFKKK-----RGFAFVEFASVEDAQKAIEQLNGRQFNGRPL 51 (56)
T ss_dssp HHHHHTTTS-EEEEEEETTS-----TTEEEEEESSHHHHHHHHHHHTTSEETTEEE
T ss_pred ChHHhCCcccEEEEEEEeCC-----CCEEEEEECCHHHHHHHHHHhCCCEECCcEE
Confidence 45789999999999997555 2479999999999999999999988877544
No 67
>KOG4207|consensus
Probab=82.47 E-value=5.1 Score=37.88 Aligned_cols=82 Identities=18% Similarity=0.227 Sum_probs=64.4
Q ss_pred CcccCCCCeEEeeecCCCCCHHHHHHHHhhcCCeeEEEe-cCCCCCCCceEEEEEEecCcccHHHHHHhhccccccCccc
Q psy4687 28 PRALHRTSSIFLRNLAPTITKAEVENLCKRYSGFLRVAI-ADPQPDRRWFRRGWVTFRRDVDIKEICWNLNNIRLRDCEL 106 (339)
Q Consensus 28 ~~~lh~t~slfIk~IpP~isr~ele~~ck~~~GF~~lsL-SDP~p~krf~R~GWV~f~~~~~~~~~~~~L~~~~i~d~el 106 (339)
|-..--+++|.|.||---.|-.+|-.++++++-.-=|.| -|+.. +.=-.-|+|-|+---++.+|+.+|++.-|++-+|
T Consensus 7 PPdv~gm~SLkVdNLTyRTspd~LrrvFekYG~vgDVyIPrdr~T-r~sRgFaFVrf~~k~daedA~damDG~~ldgRel 85 (256)
T KOG4207|consen 7 PPDVEGMTSLKVDNLTYRTSPDDLRRVFEKYGRVGDVYIPRDRYT-RQSRGFAFVRFHDKRDAEDALDAMDGAVLDGREL 85 (256)
T ss_pred CCCcccceeEEecceeccCCHHHHHHHHHHhCcccceeccccccc-ccccceeEEEeeecchHHHHHHhhcceeecccee
Confidence 334467899999999999999999999999987666666 33332 1222358999999999999999999999998887
Q ss_pred cccc
Q psy4687 107 GAIV 110 (339)
Q Consensus 107 ~~~v 110 (339)
-+++
T Consensus 86 rVq~ 89 (256)
T KOG4207|consen 86 RVQM 89 (256)
T ss_pred eehh
Confidence 5443
No 68
>KOG0127|consensus
Probab=81.70 E-value=3.5 Score=43.72 Aligned_cols=70 Identities=16% Similarity=0.285 Sum_probs=57.1
Q ss_pred CeEEeeecCCCCCHHHHHHHHhhcCCeeEEEecCCCCCCCceEEEEEEecCcccHHHHHHhhccccccCcc
Q psy4687 35 SSIFLRNLAPTITKAEVENLCKRYSGFLRVAIADPQPDRRWFRRGWVTFRRDVDIKEICWNLNNIRLRDCE 105 (339)
Q Consensus 35 ~slfIk~IpP~isr~ele~~ck~~~GF~~lsLSDP~p~krf~R~GWV~f~~~~~~~~~~~~L~~~~i~d~e 105 (339)
.-|.|||+|=.|+..+|..+|..++-|.-+.|- -.+.-+.-=-|+|.|..-.++..++..+|+..|.+-.
T Consensus 118 ~rLIIRNLPf~~k~~dLk~vFs~~G~V~Ei~IP-~k~dgklcGFaFV~fk~~~dA~~Al~~~N~~~i~gR~ 187 (678)
T KOG0127|consen 118 WRLIIRNLPFKCKKPDLKNVFSNFGKVVEIVIP-RKKDGKLCGFAFVQFKEKKDAEKALEFFNGNKIDGRP 187 (678)
T ss_pred ceEEeecCCcccCcHHHHHHHhhcceEEEEEcc-cCCCCCccceEEEEEeeHHHHHHHHHhccCceecCce
Confidence 568899999999999999999999999888874 2222233356899999999999999999988887633
No 69
>KOG0109|consensus
Probab=78.15 E-value=2.2 Score=42.09 Aligned_cols=87 Identities=22% Similarity=0.372 Sum_probs=66.3
Q ss_pred CCCCeEEeeecCCCCCHHHHHHHHhhcCCeeEEEecCCCCCCCceEEEEEEecCcccHHHHHHhhccccccCcccccccc
Q psy4687 32 HRTSSIFLRNLAPTITKAEVENLCKRYSGFLRVAIADPQPDRRWFRRGWVTFRRDVDIKEICWNLNNIRLRDCELGAIVN 111 (339)
Q Consensus 32 h~t~slfIk~IpP~isr~ele~~ck~~~GF~~lsLSDP~p~krf~R~GWV~f~~~~~~~~~~~~L~~~~i~d~el~~~v~ 111 (339)
..++.|++=||.|.|+-+|+.+.+.+++-..- -|-+.. .|+|.|+...++-++...|||..+.+--++.++
T Consensus 76 k~stkl~vgNis~tctn~ElRa~fe~ygpvie---cdivkd-----y~fvh~d~~eda~~air~l~~~~~~gk~m~vq~- 146 (346)
T KOG0109|consen 76 KASTKLHVGNISPTCTNQELRAKFEKYGPVIE---CDIVKD-----YAFVHFDRAEDAVEAIRGLDNTEFQGKRMHVQL- 146 (346)
T ss_pred CCccccccCCCCccccCHHHhhhhcccCCcee---eeeecc-----eeEEEEeeccchHHHHhcccccccccceeeeee-
Confidence 34678999999999999999997777653221 122222 799999999999988888999888887777554
Q ss_pred cccccccccccCcccchh
Q psy4687 112 RDLSRRVRTVSGLTSLKP 129 (339)
Q Consensus 112 r~~~~~~r~~~~it~~~~ 129 (339)
.+.|+|+.||+--.-.
T Consensus 147 --stsrlrtapgmgDq~~ 162 (346)
T KOG0109|consen 147 --STSRLRTAPGMGDQSG 162 (346)
T ss_pred --eccccccCCCCCCHHH
Confidence 4678999999764443
No 70
>PHA00733 hypothetical protein
Probab=76.14 E-value=1.7 Score=37.35 Aligned_cols=27 Identities=33% Similarity=0.724 Sum_probs=24.9
Q ss_pred CeecCCCCcccccChhhHHhhhhhcch
Q psy4687 304 DKWLCPLSGKKFKGPDFIRKHIFNKHA 330 (339)
Q Consensus 304 ~K~rC~~C~KlFk~~eFV~KHI~nKH~ 330 (339)
..|.|+.|+|-|....-+..|+..||.
T Consensus 98 ~~~~C~~CgK~F~~~~sL~~H~~~~h~ 124 (128)
T PHA00733 98 HSKVCPVCGKEFRNTDSTLDHVCKKHN 124 (128)
T ss_pred cCccCCCCCCccCCHHHHHHHHHHhcC
Confidence 358999999999999999999999996
No 71
>KOG0717|consensus
Probab=75.13 E-value=1.3 Score=45.91 Aligned_cols=34 Identities=32% Similarity=0.586 Sum_probs=31.5
Q ss_pred ecCCCCcccccChhhHHhhhh-hcchHHHHHhhcC
Q psy4687 306 WLCPLSGKKFKGPDFIRKHIF-NKHAEKVEEVKKE 339 (339)
Q Consensus 306 ~rC~~C~KlFk~~eFV~KHI~-nKH~e~v~~v~~e 339 (339)
.-|..|+|.||...=+..|+. +||-+.|.++|||
T Consensus 293 lyC~vCnKsFKseKq~kNHEnSKKHkenv~eLrqe 327 (508)
T KOG0717|consen 293 LYCVVCNKSFKSEKQLKNHENSKKHKENVAELRQE 327 (508)
T ss_pred eEEeeccccccchHHHHhhHHHHHHHHHHHHHHHH
Confidence 889999999999999999997 5799999999975
No 72
>COG4049 Uncharacterized protein containing archaeal-type C2H2 Zn-finger [General function prediction only]
Probab=74.78 E-value=1.3 Score=33.52 Aligned_cols=34 Identities=18% Similarity=0.236 Sum_probs=27.7
Q ss_pred hccCCeecCCCCcccccChhhHHhhhhhcchHHH
Q psy4687 300 EISTDKWLCPLSGKKFKGPDFIRKHIFNKHAEKV 333 (339)
Q Consensus 300 e~~e~K~rC~~C~KlFk~~eFV~KHI~nKH~e~v 333 (339)
...+.-++||-|+++|....=...|+.+-|.-+.
T Consensus 12 RDGE~~lrCPRC~~~FR~~K~Y~RHVNKaH~~~~ 45 (65)
T COG4049 12 RDGEEFLRCPRCGMVFRRRKDYIRHVNKAHGWLF 45 (65)
T ss_pred cCCceeeeCCchhHHHHHhHHHHHHhhHHhhhhh
Confidence 3356778999999999998888889988887554
No 73
>PF04780 DUF629: Protein of unknown function (DUF629); InterPro: IPR006865 This domain represents a region of several plant proteins of unknown function. A C2H2 zinc finger is predicted in this region in some family members, but the spacing between the cysteine residues is not conserved throughout the family.
Probab=74.74 E-value=1.6 Score=45.43 Aligned_cols=42 Identities=26% Similarity=0.552 Sum_probs=31.5
Q ss_pred HHHHHHHhhhhhccCCe-ecCCCCcccccChhhHHhhhhhcchHHHH
Q psy4687 289 EVEKFIAANTQEISTDK-WLCPLSGKKFKGPDFIRKHIFNKHAEKVE 334 (339)
Q Consensus 289 evek~v~~~~~e~~e~K-~rC~~C~KlFk~~eFV~KHI~nKH~e~v~ 334 (339)
++-.|++.| ..|+ |.||.|.|+|...+=...|+..+|+..+.
T Consensus 44 eal~fak~n----~sWrFWiCp~CskkF~d~~~~~~H~~~eH~~~l~ 86 (466)
T PF04780_consen 44 EALSFAKEN----KSWRFWICPRCSKKFSDAESCLSHMEQEHPAGLK 86 (466)
T ss_pred HHHHHHHhc----CceeEeeCCcccceeCCHHHHHHHHHHhhhhhcC
Confidence 444555532 2332 45799999999999999999999998763
No 74
>KOG0131|consensus
Probab=73.76 E-value=5.9 Score=36.73 Aligned_cols=76 Identities=14% Similarity=0.247 Sum_probs=62.9
Q ss_pred cCCCCeEEeeecCCCCCHHHHHHHHhhcCCeeEEEecCCCCCCCceEEEEEEecCcccHHHHHHhhccccccCccc
Q psy4687 31 LHRTSSIFLRNLAPTITKAEVENLCKRYSGFLRVAIADPQPDRRWFRRGWVTFRRDVDIKEICWNLNNIRLRDCEL 106 (339)
Q Consensus 31 lh~t~slfIk~IpP~isr~ele~~ck~~~GF~~lsLSDP~p~krf~R~GWV~f~~~~~~~~~~~~L~~~~i~d~el 106 (339)
.-+..+||+=+|++-||++-|-++|-+.+-+..|-+..=.......--|+|-|....++.=++.-||.+++-+--+
T Consensus 6 rnqd~tiyvgnld~kvs~~~l~EL~iqagpVv~i~iPkDrv~~~~qGygF~Ef~~eedadYAikiln~VkLYgrpI 81 (203)
T KOG0131|consen 6 RNQDATLYVGNLDEKVSEELLYELFIQAGPVVNLHIPKDRVTQKHQGYGFAEFRTEEDADYAIKILNMVKLYGRPI 81 (203)
T ss_pred cCCCceEEEecCCHHHHHHHHHHHHHhcCceeeeecchhhhcccccceeEEEEechhhhHHHHHHHHHHHhcCcee
Confidence 3567899999999999999999999999988888886544444455679999999999998988888877776433
No 75
>PF13465 zf-H2C2_2: Zinc-finger double domain; PDB: 2EN7_A 1TF6_A 1TF3_A 2ELT_A 2EOS_A 2EN2_A 2DMD_A 2WBS_A 2WBU_A 2EM5_A ....
Probab=72.83 E-value=1.6 Score=27.26 Aligned_cols=22 Identities=27% Similarity=0.715 Sum_probs=15.8
Q ss_pred HHHHhhhhhccCCeecCCCCccccc
Q psy4687 292 KFIAANTQEISTDKWLCPLSGKKFK 316 (339)
Q Consensus 292 k~v~~~~~e~~e~K~rC~~C~KlFk 316 (339)
..++.++ .+-.|.|+.|+|.|.
T Consensus 4 ~H~~~H~---~~k~~~C~~C~k~F~ 25 (26)
T PF13465_consen 4 RHMRTHT---GEKPYKCPYCGKSFS 25 (26)
T ss_dssp HHHHHHS---SSSSEEESSSSEEES
T ss_pred HHhhhcC---CCCCCCCCCCcCeeC
Confidence 3444444 456699999999996
No 76
>PF04423 Rad50_zn_hook: Rad50 zinc hook motif; InterPro: IPR007517 The Mre11 complex (Mre11 Rad50 Nbs1) is central to chromosomal maintenance and functions in homologous recombination, telomere maintenance and sister chromatid association. The Rad50 coiled-coil region contains a dimer interface at the apex of the coiled coils in which pairs of conserved Cys-X-X-Cys motifs form interlocking hooks that bind one Zn ion. This alignment includes the zinc hook motif and a short stretch of coiled-coil on either side.; GO: 0004518 nuclease activity, 0005524 ATP binding, 0008270 zinc ion binding, 0006281 DNA repair; PDB: 1L8D_B.
Probab=72.68 E-value=3.3 Score=30.07 Aligned_cols=44 Identities=25% Similarity=0.535 Sum_probs=26.8
Q ss_pred HHHHHHHhhhhhccCCeecCCCCcccccChhhHHhhhhhcchHHHHHhhc
Q psy4687 289 EVEKFIAANTQEISTDKWLCPLSGKKFKGPDFIRKHIFNKHAEKVEEVKK 338 (339)
Q Consensus 289 evek~v~~~~~e~~e~K~rC~~C~KlFk~~eFV~KHI~nKH~e~v~~v~~ 338 (339)
++.+++.. ..+.+=.||+|+--|-..+. .+|..+..+.++.+.+
T Consensus 8 ~~~k~i~~----l~~~~~~CPlC~r~l~~e~~--~~li~~~~~~i~~~~~ 51 (54)
T PF04423_consen 8 ELKKYIEE----LKEAKGCCPLCGRPLDEEHR--QELIKKYKSEIEELPE 51 (54)
T ss_dssp HHHHHHHH----HTT-SEE-TTT--EE-HHHH--HHHHHHHHHHHHHHHH
T ss_pred HHHHHHHH----HhcCCCcCCCCCCCCCHHHH--HHHHHHHHHHHHhhhh
Confidence 44455553 33333399999999999987 8888888887777654
No 77
>KOG0123|consensus
Probab=68.92 E-value=11 Score=38.22 Aligned_cols=67 Identities=18% Similarity=0.338 Sum_probs=54.5
Q ss_pred EEeeecCCCCCHHHHHHHHhhcCCeeEEEe-cCCCCCCCceEEEEEEecCcccHHHHHHhhccccccCcccc
Q psy4687 37 IFLRNLAPTITKAEVENLCKRYSGFLRVAI-ADPQPDRRWFRRGWVTFRRDVDIKEICWNLNNIRLRDCELG 107 (339)
Q Consensus 37 lfIk~IpP~isr~ele~~ck~~~GF~~lsL-SDP~p~krf~R~GWV~f~~~~~~~~~~~~L~~~~i~d~el~ 107 (339)
+|||+++|+|+=..|..+++.++--...=+ .|.+-. +-++|.|++...++++.+.+||.-+.+-.+.
T Consensus 79 ~~i~nl~~~~~~~~~~d~f~~~g~ilS~kv~~~~~g~----kg~FV~f~~e~~a~~ai~~~ng~ll~~kki~ 146 (369)
T KOG0123|consen 79 VFIKNLDESIDNKSLYDTFSEFGNILSCKVATDENGS----KGYFVQFESEESAKKAIEKLNGMLLNGKKIY 146 (369)
T ss_pred eeecCCCcccCcHHHHHHHHhhcCeeEEEEEEcCCCc----eeeEEEeCCHHHHHHHHHHhcCcccCCCeeE
Confidence 999999999999999999999988776655 344443 3349999999999999999998777665543
No 78
>KOG3623|consensus
Probab=68.16 E-value=1.9 Score=47.09 Aligned_cols=31 Identities=29% Similarity=0.444 Sum_probs=26.2
Q ss_pred cCCeecCCCCcccccChhhHHhhhhhcchHH
Q psy4687 302 STDKWLCPLSGKKFKGPDFIRKHIFNKHAEK 332 (339)
Q Consensus 302 ~e~K~rC~~C~KlFk~~eFV~KHI~nKH~e~ 332 (339)
.--||.|+.|||-||=.+-++.||+.--+|+
T Consensus 278 ~lRKFKCtECgKAFKfKHHLKEHlRIHSGEK 308 (1007)
T KOG3623|consen 278 LLRKFKCTECGKAFKFKHHLKEHLRIHSGEK 308 (1007)
T ss_pred hhccccccccchhhhhHHHHHhhheeecCCC
Confidence 4569999999999999999999998765553
No 79
>KOG1548|consensus
Probab=66.96 E-value=14 Score=37.29 Aligned_cols=72 Identities=18% Similarity=0.293 Sum_probs=57.7
Q ss_pred CCCeEEeeecCCCCCHHHHHHHHhhcCCeeEEEecCCCCCC---CceEE--------EEEEecCcccHHHHHHhhccccc
Q psy4687 33 RTSSIFLRNLAPTITKAEVENLCKRYSGFLRVAIADPQPDR---RWFRR--------GWVTFRRDVDIKEICWNLNNIRL 101 (339)
Q Consensus 33 ~t~slfIk~IpP~isr~ele~~ck~~~GF~~lsLSDP~p~k---rf~R~--------GWV~f~~~~~~~~~~~~L~~~~i 101 (339)
..++|+|.++|++||-.|+.++.++.++- +.||+..+ ++||- |=++|=...++.=|+.-|++..+
T Consensus 133 ~Nt~VYVsgLP~DiT~dE~~~~~sKcGiI----~~d~~t~epk~KlYrd~~G~lKGDaLc~y~K~ESVeLA~~ilDe~~~ 208 (382)
T KOG1548|consen 133 VNTSVYVSGLPLDITVDEFAEVMSKCGII----MRDPQTGEPKVKLYRDNQGKLKGDALCCYIKRESVELAIKILDEDEL 208 (382)
T ss_pred cCceEEecCCCCcccHHHHHHHHHhcceE----eccCCCCCeeEEEEecCCCCccCceEEEeecccHHHHHHHHhCcccc
Confidence 45789999999999999999999997754 46887554 77774 77888777777777888998888
Q ss_pred cCccccc
Q psy4687 102 RDCELGA 108 (339)
Q Consensus 102 ~d~el~~ 108 (339)
+++.+..
T Consensus 209 rg~~~rV 215 (382)
T KOG1548|consen 209 RGKKLRV 215 (382)
T ss_pred cCcEEEE
Confidence 8877653
No 80
>KOG0145|consensus
Probab=66.72 E-value=9.6 Score=37.24 Aligned_cols=74 Identities=16% Similarity=0.354 Sum_probs=56.9
Q ss_pred cCCCCeEEeeecCCCCCHHHHHHHHhhcCCee-EEEecCCCCCCCceEEEEEEecCcccHHHHHHhhccccccCcc
Q psy4687 31 LHRTSSIFLRNLAPTITKAEVENLCKRYSGFL-RVAIADPQPDRRWFRRGWVTFRRDVDIKEICWNLNNIRLRDCE 105 (339)
Q Consensus 31 lh~t~slfIk~IpP~isr~ele~~ck~~~GF~-~lsLSDP~p~krf~R~GWV~f~~~~~~~~~~~~L~~~~i~d~e 105 (339)
-.++.-|++..||-+.+-.|||.+++.++--. .=-|+|-+..- =--+|+|.||.-..+.+|...||+.+-.+|.
T Consensus 124 ~Ik~aNLYvSGlPktMtqkelE~iFs~fGrIItSRiL~dqvtg~-srGVgFiRFDKr~EAe~AIk~lNG~~P~g~t 198 (360)
T KOG0145|consen 124 SIKDANLYVSGLPKTMTQKELEQIFSPFGRIITSRILVDQVTGL-SRGVGFIRFDKRIEAEEAIKGLNGQKPSGCT 198 (360)
T ss_pred hhcccceEEecCCccchHHHHHHHHHHhhhhhhhhhhhhcccce-ecceeEEEecchhHHHHHHHhccCCCCCCCC
Confidence 35778899999999999999999999986522 22344544311 1138999999999999999999998887776
No 81
>COG2871 NqrF Na+-transporting NADH:ubiquinone oxidoreductase, subunit NqrF [Energy production and conversion]
Probab=64.79 E-value=6.6 Score=39.04 Aligned_cols=35 Identities=23% Similarity=0.500 Sum_probs=26.3
Q ss_pred cCCCCCHHHHHHHHhhcCCeeE-EEecCCCCCCCce
Q psy4687 42 LAPTITKAEVENLCKRYSGFLR-VAIADPQPDRRWF 76 (339)
Q Consensus 42 IpP~isr~ele~~ck~~~GF~~-lsLSDP~p~krf~ 76 (339)
.--.+=.++.+.+-++.|.|.| |+||||+|+-+|.
T Consensus 316 ~rE~fY~Ed~d~L~ae~pNF~wH~aLSdplpEDnW~ 351 (410)
T COG2871 316 LREMFYQEDFDQLQAENPNFHWHLALSDPLPEDNWD 351 (410)
T ss_pred HHHhHHHHHHHHHHhhCCCcEEEEEecCCCCcCCcc
Confidence 3334445677788888899986 9999999997543
No 82
>smart00614 ZnF_BED BED zinc finger. DNA-binding domain in chromatin-boundary-element-binding proteins and transposases
Probab=62.84 E-value=4.6 Score=28.87 Aligned_cols=28 Identities=29% Similarity=0.553 Sum_probs=24.3
Q ss_pred CeecCCCCcccccCh-----hhHHhhhhhcchH
Q psy4687 304 DKWLCPLSGKKFKGP-----DFIRKHIFNKHAE 331 (339)
Q Consensus 304 ~K~rC~~C~KlFk~~-----eFV~KHI~nKH~e 331 (339)
.+-.|..|+|.+.+. .=+.+|+.++||+
T Consensus 17 ~~a~C~~C~~~l~~~~~~gTs~L~rHl~~~h~~ 49 (50)
T smart00614 17 QRAKCKYCGKKLSRSSKGGTSNLRRHLRRKHPA 49 (50)
T ss_pred eEEEecCCCCEeeeCCCCCcHHHHHHHHhHCcC
Confidence 578999999999766 5899999999985
No 83
>KOG4212|consensus
Probab=62.79 E-value=19 Score=37.59 Aligned_cols=70 Identities=20% Similarity=0.306 Sum_probs=54.7
Q ss_pred CCeEEeeecCCCCCHHHHHHHHhhcCC-eeEEEec-CCC-CCCCceEEEEEEecCcccHHHHHHhhccccccCccc
Q psy4687 34 TSSIFLRNLAPTITKAEVENLCKRYSG-FLRVAIA-DPQ-PDRRWFRRGWVTFRRDVDIKEICWNLNNIRLRDCEL 106 (339)
Q Consensus 34 t~slfIk~IpP~isr~ele~~ck~~~G-F~~lsLS-DP~-p~krf~R~GWV~f~~~~~~~~~~~~L~~~~i~d~el 106 (339)
.-++||-+||-++.=++|..++++.-| ..|+-|= |-. ..|. -|-|-|+...+++++++.||...+.+-+|
T Consensus 44 ~R~vfItNIpyd~rWqdLKdLvrekvGev~yveLl~D~~GK~rG---cavVEFk~~E~~qKa~E~lnk~~~~GR~l 116 (608)
T KOG4212|consen 44 DRSVFITNIPYDYRWQDLKDLVREKVGEVEYVELLFDESGKARG---CAVVEFKDPENVQKALEKLNKYEVNGREL 116 (608)
T ss_pred cceEEEecCcchhhhHhHHHHHHHhcCceEeeeeecccCCCcCC---ceEEEeeCHHHHHHHHHHhhhccccCceE
Confidence 347999999999999999999999866 7777763 222 2222 57899999999999999999766655444
No 84
>KOG0113|consensus
Probab=62.03 E-value=23 Score=35.17 Aligned_cols=70 Identities=10% Similarity=0.302 Sum_probs=59.9
Q ss_pred CeEEeeecCCCCCHHHHHHHHhhcCCeeEEEecCC---CCCCCceEEEEEEecCcccHHHHHHhhccccccCcccc
Q psy4687 35 SSIFLRNLAPTITKAEVENLCKRYSGFLRVAIADP---QPDRRWFRRGWVTFRRDVDIKEICWNLNNIRLRDCELG 107 (339)
Q Consensus 35 ~slfIk~IpP~isr~ele~~ck~~~GF~~lsLSDP---~p~krf~R~GWV~f~~~~~~~~~~~~L~~~~i~d~el~ 107 (339)
.+|||-.|+-+.|-.+|...+.+++--++|-|--= ...|. .|+|.|...-+|+.|....++++|++-.+-
T Consensus 102 ~TLFv~RLnydT~EskLrreF~~YG~IkrirlV~d~vTgkskG---YAFIeye~erdm~~AYK~adG~~Idgrri~ 174 (335)
T KOG0113|consen 102 KTLFVARLNYDTSESKLRREFEKYGPIKRIRLVRDKVTGKSKG---YAFIEYEHERDMKAAYKDADGIKIDGRRIL 174 (335)
T ss_pred ceeeeeeccccccHHHHHHHHHhcCcceeEEEeeecccCCccc---eEEEEeccHHHHHHHHHhccCceecCcEEE
Confidence 58999999999999999999999999999888533 23334 699999999999999999999999885543
No 85
>PF02891 zf-MIZ: MIZ/SP-RING zinc finger; InterPro: IPR004181 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents MIZ-type zinc finger domains. Miz1 (Msx-interacting-zinc finger) is a zinc finger-containing protein with homology to the yeast protein, Nfi-1. Miz1 is a sequence specific DNA binding protein that can function as a positive-acting transcription factor. Miz1 binds to the homeobox protein Msx2, enhancing the specific DNA-binding ability of Msx2 []. Other proteins containing this domain include the human pias family (protein inhibitor of activated STAT protein). More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding; PDB: 3I2D_A.
Probab=60.97 E-value=5.7 Score=28.72 Aligned_cols=22 Identities=41% Similarity=1.016 Sum_probs=11.3
Q ss_pred HHHHHHHhhhhhccCCeecCCCCcc
Q psy4687 289 EVEKFIAANTQEISTDKWLCPLSGK 313 (339)
Q Consensus 289 evek~v~~~~~e~~e~K~rC~~C~K 313 (339)
+++.|+..+.. ..+|+||.|+|
T Consensus 28 Dl~~fl~~~~~---~~~W~CPiC~~ 49 (50)
T PF02891_consen 28 DLESFLESNQR---TPKWKCPICNK 49 (50)
T ss_dssp EHHHHHHHHHH---S---B-TTT--
T ss_pred CHHHHHHHhhc---cCCeECcCCcC
Confidence 36778886543 35599999987
No 86
>smart00734 ZnF_Rad18 Rad18-like CCHC zinc finger. Yeast Rad18p functions with Rad5p in error-free post-replicative DNA repair. This zinc finger is likely to bind nucleic-acids.
Probab=60.56 E-value=6.1 Score=24.92 Aligned_cols=19 Identities=21% Similarity=0.509 Sum_probs=16.2
Q ss_pred cCCCCcccccChhhHHhhhh
Q psy4687 307 LCPLSGKKFKGPDFIRKHIF 326 (339)
Q Consensus 307 rC~~C~KlFk~~eFV~KHI~ 326 (339)
.||.|++.+ ....+..|+.
T Consensus 3 ~CPiC~~~v-~~~~in~HLD 21 (26)
T smart00734 3 QCPVCFREV-PENLINSHLD 21 (26)
T ss_pred cCCCCcCcc-cHHHHHHHHH
Confidence 599999999 6678888886
No 87
>KOG0149|consensus
Probab=57.89 E-value=27 Score=33.52 Aligned_cols=71 Identities=27% Similarity=0.402 Sum_probs=56.6
Q ss_pred CeEEeeecCCCCCHHHHHHHHhhcCC-eeEEEecCCCCCCCceEEEEEEecCcccHHHHHHhhc---cccccCccc
Q psy4687 35 SSIFLRNLAPTITKAEVENLCKRYSG-FLRVAIADPQPDRRWFRRGWVTFRRDVDIKEICWNLN---NIRLRDCEL 106 (339)
Q Consensus 35 ~slfIk~IpP~isr~ele~~ck~~~G-F~~lsLSDP~p~krf~R~GWV~f~~~~~~~~~~~~L~---~~~i~d~el 106 (339)
+-||+=.+|=...++.|.+.+.+++. ..-+.++|=+..| =--.|+|||...+.+..||..-| +.|.-+|.|
T Consensus 13 TKifVggL~w~T~~~~l~~yFeqfGeI~eavvitd~~t~r-skGyGfVTf~d~~aa~rAc~dp~piIdGR~aNcnl 87 (247)
T KOG0149|consen 13 TKIFVGGLAWETHKETLRRYFEQFGEIVEAVVITDKNTGR-SKGYGFVTFRDAEAATRACKDPNPIIDGRKANCNL 87 (247)
T ss_pred EEEEEcCcccccchHHHHHHHHHhCceEEEEEEeccCCcc-ccceeeEEeecHHHHHHHhcCCCCcccccccccch
Confidence 68999999999999999999999987 5667889987664 23479999999999999996533 334446655
No 88
>PHA00732 hypothetical protein
Probab=56.59 E-value=7.6 Score=30.84 Aligned_cols=25 Identities=20% Similarity=0.394 Sum_probs=21.2
Q ss_pred ecCCCCcccccChhhHHhhhhhcch
Q psy4687 306 WLCPLSGKKFKGPDFIRKHIFNKHA 330 (339)
Q Consensus 306 ~rC~~C~KlFk~~eFV~KHI~nKH~ 330 (339)
|.|+.|+|.|....=+..|+..-|.
T Consensus 2 y~C~~Cgk~F~s~s~Lk~H~r~~H~ 26 (79)
T PHA00732 2 FKCPICGFTTVTLFALKQHARRNHT 26 (79)
T ss_pred ccCCCCCCccCCHHHHHHHhhcccC
Confidence 6899999999999999999875453
No 89
>KOG0105|consensus
Probab=56.58 E-value=31 Score=32.30 Aligned_cols=71 Identities=18% Similarity=0.371 Sum_probs=55.4
Q ss_pred CCCeEEeeecCCCCCHHHHHHHHhhcCCeeEEEecCCCCCCCceEEEEEEecCcccHHHHHHhhccccccCccc
Q psy4687 33 RTSSIFLRNLAPTITKAEVENLCKRYSGFLRVAIADPQPDRRWFRRGWVTFRRDVDIKEICWNLNNIRLRDCEL 106 (339)
Q Consensus 33 ~t~slfIk~IpP~isr~ele~~ck~~~GF~~lsLSDP~p~krf~R~GWV~f~~~~~~~~~~~~L~~~~i~d~el 106 (339)
.+..|++-|+||+|--.+||.++-+++-..-+-|-.|.-. -=-++|-|...-|+.+|...-++-..++|-|
T Consensus 5 ~~~~iyvGNLP~diRekeieDlFyKyg~i~~ieLK~r~g~---ppfafVeFEd~RDAeDAiygRdGYdydg~rL 75 (241)
T KOG0105|consen 5 NSRRIYVGNLPGDIREKEIEDLFYKYGRIREIELKNRPGP---PPFAFVEFEDPRDAEDAIYGRDGYDYDGCRL 75 (241)
T ss_pred ccceEEecCCCcchhhccHHHHHhhhcceEEEEeccCCCC---CCeeEEEecCccchhhhhhcccccccCcceE
Confidence 4578999999999999999999999998888888654311 1268999999988888766556665566555
No 90
>PF05605 zf-Di19: Drought induced 19 protein (Di19), zinc-binding; InterPro: IPR008598 This entry consists of several drought induced 19 (Di19) like and RING finger 114 proteins. Di19 has been found to be strongly expressed in both the roots and leaves of Arabidopsis thaliana during progressive drought [], whilst RING finger proteins are thought to play a role in spermatogenesis. The precise function is unknown.
Probab=54.30 E-value=7.3 Score=28.22 Aligned_cols=26 Identities=15% Similarity=0.526 Sum_probs=20.9
Q ss_pred CCeecCCCCcccccChhhHHhhhhhcch
Q psy4687 303 TDKWLCPLSGKKFKGPDFIRKHIFNKHA 330 (339)
Q Consensus 303 e~K~rC~~C~KlFk~~eFV~KHI~nKH~ 330 (339)
...+.||+|.+.+.. =+..||...|.
T Consensus 29 ~~~v~CPiC~~~~~~--~l~~Hl~~~H~ 54 (54)
T PF05605_consen 29 SKNVVCPICSSRVTD--NLIRHLNSQHR 54 (54)
T ss_pred CCCccCCCchhhhhh--HHHHHHHHhcC
Confidence 446999999998875 56779998884
No 91
>KOG0110|consensus
Probab=52.93 E-value=17 Score=39.70 Aligned_cols=68 Identities=24% Similarity=0.290 Sum_probs=53.9
Q ss_pred CCeEEeeecCCCCCHHHHHHHHhhcCCeeEEEecCCCCCCCce--EEEEEEecCcccHHHHHHhhccccccC
Q psy4687 34 TSSIFLRNLAPTITKAEVENLCKRYSGFLRVAIADPQPDRRWF--RRGWVTFRRDVDIKEICWNLNNIRLRD 103 (339)
Q Consensus 34 t~slfIk~IpP~isr~ele~~ck~~~GF~~lsLSDP~p~krf~--R~GWV~f~~~~~~~~~~~~L~~~~i~d 103 (339)
++-|+||+||=--++.+|..++..++-++-+.| |....+.. +-|||.|-+..++..|+.+|-++.+-+
T Consensus 613 ~tKIlVRNipFeAt~rEVr~LF~aFGqlksvRl--PKK~~k~a~rGF~Fv~f~t~~ea~nA~~al~STHlyG 682 (725)
T KOG0110|consen 613 GTKILVRNIPFEATKREVRKLFTAFGQLKSVRL--PKKIGKGAHRGFGFVDFLTPREAKNAFDALGSTHLYG 682 (725)
T ss_pred cceeeeeccchHHHHHHHHHHHhcccceeeecc--chhhcchhhccceeeeccCcHHHHHHHHhhcccceec
Confidence 579999999999999999999999999987776 44422222 358999999888888887787665544
No 92
>KOG0124|consensus
Probab=52.58 E-value=88 Score=32.13 Aligned_cols=72 Identities=11% Similarity=0.301 Sum_probs=58.0
Q ss_pred CCCeEEeeecCCCCCHHHHHHHHhhcCCeeEEEecCCCCCCCceEEEEEEecCcccHHHHHHhhccccccCc
Q psy4687 33 RTSSIFLRNLAPTITKAEVENLCKRYSGFLRVAIADPQPDRRWFRRGWVTFRRDVDIKEICWNLNNIRLRDC 104 (339)
Q Consensus 33 ~t~slfIk~IpP~isr~ele~~ck~~~GF~~lsLSDP~p~krf~R~GWV~f~~~~~~~~~~~~L~~~~i~d~ 104 (339)
+-..|++-+|-|++|-.+|..++..++--.+-.|.---..+.---.|+|-|.....+.++...+|=..+.+.
T Consensus 209 ~fnRiYVaSvHpDLSe~DiKSVFEAFG~I~~C~LAr~pt~~~HkGyGfiEy~n~qs~~eAiasMNlFDLGGQ 280 (544)
T KOG0124|consen 209 KFNRIYVASVHPDLSETDIKSVFEAFGEIVKCQLARAPTGRGHKGYGFIEYNNLQSQSEAIASMNLFDLGGQ 280 (544)
T ss_pred hhheEEeeecCCCccHHHHHHHHHhhcceeeEEeeccCCCCCccceeeEEeccccchHHHhhhcchhhcccc
Confidence 347899999999999999999999999999988875444444445899999999999988877765544443
No 93
>KOG0145|consensus
Probab=51.39 E-value=35 Score=33.50 Aligned_cols=91 Identities=18% Similarity=0.271 Sum_probs=67.4
Q ss_pred CeEEeeecCCCCCHHHHHHHHhhcCCeeEEEec-CCCCCCCceEEEEEEecCcccHHHHHHhhccccccCcccccccccc
Q psy4687 35 SSIFLRNLAPTITKAEVENLCKRYSGFLRVAIA-DPQPDRRWFRRGWVTFRRDVDIKEICWNLNNIRLRDCELGAIVNRD 113 (339)
Q Consensus 35 ~slfIk~IpP~isr~ele~~ck~~~GF~~lsLS-DP~p~krf~R~GWV~f~~~~~~~~~~~~L~~~~i~d~el~~~v~r~ 113 (339)
+-|.|.-+|-+.+.+||..++..++.....-|- |-...+ =.-.|+|-|-+..|+..|.+.||+.|+..-++..--.||
T Consensus 42 TNLIvNYLPQ~MTqdE~rSLF~SiGeiEScKLvRDKitGq-SLGYGFVNYv~p~DAe~AintlNGLrLQ~KTIKVSyARP 120 (360)
T KOG0145|consen 42 TNLIVNYLPQNMTQDELRSLFGSIGEIESCKLVRDKITGQ-SLGYGFVNYVRPKDAEKAINTLNGLRLQNKTIKVSYARP 120 (360)
T ss_pred ceeeeeecccccCHHHHHHHhhcccceeeeeeeecccccc-ccccceeeecChHHHHHHHhhhcceeeccceEEEEeccC
Confidence 567788999999999999999999876665552 222221 223799999999999999999999998876654434477
Q ss_pred cccccccccCccc
Q psy4687 114 LSRRVRTVSGLTS 126 (339)
Q Consensus 114 ~~~~~r~~~~it~ 126 (339)
.+..||-+|=+.+
T Consensus 121 Ss~~Ik~aNLYvS 133 (360)
T KOG0145|consen 121 SSDSIKDANLYVS 133 (360)
T ss_pred ChhhhcccceEEe
Confidence 7777776664443
No 94
>KOG0123|consensus
Probab=51.01 E-value=33 Score=34.65 Aligned_cols=76 Identities=13% Similarity=0.280 Sum_probs=60.1
Q ss_pred cccCCCCeEEeeecCCCCCHHHHHHHHhhcCCe--eEEEecCCCCCCCceEEEEEEecCcccHHHHHHhhccccccCccc
Q psy4687 29 RALHRTSSIFLRNLAPTITKAEVENLCKRYSGF--LRVAIADPQPDRRWFRRGWVTFRRDVDIKEICWNLNNIRLRDCEL 106 (339)
Q Consensus 29 ~~lh~t~slfIk~IpP~isr~ele~~ck~~~GF--~~lsLSDP~p~krf~R~GWV~f~~~~~~~~~~~~L~~~~i~d~el 106 (339)
...-...+|+|++++..++.+.|.+.+..++-. .+|-.-+..+.+. .|.|.|....+++.|+..+|+..+.+-.+
T Consensus 265 ~~~~~~~nl~vknld~~~~~e~L~~~f~~~GeI~s~kv~~~~~g~skG---~gfV~fs~~eeA~~A~~~~n~~~i~~k~l 341 (369)
T KOG0123|consen 265 SVSLQGANLYVKNLDETLSDEKLRKIFSSFGEITSAKVMVDENGKSKG---FGFVEFSSPEEAKKAMTEMNGRLIGGKPL 341 (369)
T ss_pred cccccccccccccCccccchhHHHHHHhcccceeeEEEEeccCCCccc---eEEEEcCCHHHHHHHHHhhChhhhcCCch
Confidence 334567899999999999999999999998764 4555555666666 69999999999999988888876666444
Q ss_pred c
Q psy4687 107 G 107 (339)
Q Consensus 107 ~ 107 (339)
.
T Consensus 342 ~ 342 (369)
T KOG0123|consen 342 Y 342 (369)
T ss_pred h
Confidence 4
No 95
>PRK12496 hypothetical protein; Provisional
Probab=50.01 E-value=7.1 Score=35.00 Aligned_cols=19 Identities=16% Similarity=0.277 Sum_probs=15.6
Q ss_pred hhhccCCeecCCCCccccc
Q psy4687 298 TQEISTDKWLCPLSGKKFK 316 (339)
Q Consensus 298 ~~e~~e~K~rC~~C~KlFk 316 (339)
+.+.-.|+|+|+.|+|.|.
T Consensus 120 i~~~~~w~~~C~gC~~~~~ 138 (164)
T PRK12496 120 IKKVIKWRKVCKGCKKKYP 138 (164)
T ss_pred chhheeeeEECCCCCcccc
Confidence 3355689999999999995
No 96
>KOG0146|consensus
Probab=47.96 E-value=42 Score=33.06 Aligned_cols=83 Identities=14% Similarity=0.253 Sum_probs=56.8
Q ss_pred CcccCCCCeEEeeecCCCCCHHHHHHHHhhcCCeeEEE-ecC--CCCCCCceEEEEEEecCcccHHHHHHhhccccccCc
Q psy4687 28 PRALHRTSSIFLRNLAPTITKAEVENLCKRYSGFLRVA-IAD--PQPDRRWFRRGWVTFRRDVDIKEICWNLNNIRLRDC 104 (339)
Q Consensus 28 ~~~lh~t~slfIk~IpP~isr~ele~~ck~~~GF~~ls-LSD--P~p~krf~R~GWV~f~~~~~~~~~~~~L~~~~i~d~ 104 (339)
-|.=+.++-|||=++|-..+-.||..++--++-...-- .-| -|.+|= -|+|.||.-.+...+..++|+..|.--
T Consensus 279 qreGPeGCNlFIYHLPQEFgDaEliQmF~PFGhivSaKVFvDRATNQSKC---FGFVSfDNp~SaQaAIqAMNGFQIGMK 355 (371)
T KOG0146|consen 279 QREGPEGCNLFIYHLPQEFGDAELIQMFLPFGHIVSAKVFVDRATNQSKC---FGFVSFDNPASAQAAIQAMNGFQIGMK 355 (371)
T ss_pred hhcCCCcceEEEEeCchhhccHHHHHHhccccceeeeeeeehhccccccc---eeeEecCCchhHHHHHHHhcchhhhhh
Confidence 34446789999999999999999999887765432211 111 344444 399999998888877788888776544
Q ss_pred ccccccccc
Q psy4687 105 ELGAIVNRD 113 (339)
Q Consensus 105 el~~~v~r~ 113 (339)
.|..+..||
T Consensus 356 RLKVQLKRP 364 (371)
T KOG0146|consen 356 RLKVQLKRP 364 (371)
T ss_pred hhhhhhcCc
Confidence 444444443
No 97
>PF09538 FYDLN_acid: Protein of unknown function (FYDLN_acid); InterPro: IPR012644 Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=47.94 E-value=7.5 Score=32.77 Aligned_cols=14 Identities=43% Similarity=0.729 Sum_probs=11.8
Q ss_pred CCeecCCCCccccc
Q psy4687 303 TDKWLCPLSGKKFK 316 (339)
Q Consensus 303 e~K~rC~~C~KlFk 316 (339)
+-|..|+.||++|=
T Consensus 7 GtKR~Cp~CG~kFY 20 (108)
T PF09538_consen 7 GTKRTCPSCGAKFY 20 (108)
T ss_pred CCcccCCCCcchhc
Confidence 56889999999993
No 98
>smart00719 Plus3 Short conserved domain in transcriptional regulators. Plus3 domains occur in the Saccharomyces cerevisiae Rtf1p protein, which interacts with Spt6p, and in parsley CIP, which interacts with the bZIP protein CPRF1.
Probab=47.33 E-value=37 Score=28.23 Aligned_cols=36 Identities=33% Similarity=0.451 Sum_probs=27.1
Q ss_pred CCHHHHHHHHhhc-------CCeeEEEecCCCCCC-CceEEEEEE
Q psy4687 46 ITKAEVENLCKRY-------SGFLRVAIADPQPDR-RWFRRGWVT 82 (339)
Q Consensus 46 isr~ele~~ck~~-------~GF~~lsLSDP~p~k-rf~R~GWV~ 82 (339)
++|..|+.+|..- +=|.|+.++ |++.. .-||++-|+
T Consensus 12 L~R~~l~~~~~~p~Fe~~v~GcFVRv~ig-~~~~~~~~Yrl~qI~ 55 (109)
T smart00719 12 LRRSLVEELLKPPTFESKVVGCFVRVKIG-PNDQKQPIYRLVQVT 55 (109)
T ss_pred EEHHHHHHHhcCCchhHhceEeEEEEEEC-CCCCCCCeEEEEEEe
Confidence 6899999999941 238899998 33333 789999886
No 99
>KOG0110|consensus
Probab=46.51 E-value=33 Score=37.54 Aligned_cols=72 Identities=21% Similarity=0.378 Sum_probs=58.7
Q ss_pred eEEeeecCCCCCHHHHHHHHhhcCCeeEEEecC-CCCCCCce--EEEEEEecCcccHHHHHHhhccccccCcccc
Q psy4687 36 SIFLRNLAPTITKAEVENLCKRYSGFLRVAIAD-PQPDRRWF--RRGWVTFRRDVDIKEICWNLNNIRLRDCELG 107 (339)
Q Consensus 36 slfIk~IpP~isr~ele~~ck~~~GF~~lsLSD-P~p~krf~--R~GWV~f~~~~~~~~~~~~L~~~~i~d~el~ 107 (339)
.||++|++-+-+-++++..++..+.++.+.|+= +.|.++|- =.|+|-|+...++..++.+||++.|++..+.
T Consensus 517 ~lfvkNlnf~Tt~e~l~~~F~k~G~VlS~~I~kkkd~~~k~lSmGfgFVEF~~~e~A~~a~k~lqgtvldGH~l~ 591 (725)
T KOG0110|consen 517 KLFVKNLNFDTTLEDLEDLFSKQGTVLSIEISKKKDPANKYLSMGFGFVEFAKPESAQAALKALQGTVLDGHKLE 591 (725)
T ss_pred hhhhhcCCcccchhHHHHHHHhcCeEEEEEEeccccccccccccceeEEEecCHHHHHHHHHHhcCceecCceEE
Confidence 399999999999999999999988888888863 33322222 2588999999999999999999999886654
No 100
>PF00403 HMA: Heavy-metal-associated domain; InterPro: IPR006121 Proteins that transport heavy metals in micro-organisms and mammals share similarities in their sequences and structures. These proteins provide an important focus for research, some being involved in bacterial resistance to toxic metals, such as lead and cadmium, while others are involved in inherited human syndromes, such as Wilson's and Menke's diseases []. A conserved domain has been found in a number of these heavy metal transport or detoxification proteins []. The domain, which has been termed Heavy-Metal-Associated (HMA), contains two conserved cysteines that are probably involved in metal binding. Structure solution of the fourth HMA domain of the Menke's copper transporting ATPase shows a well-defined structure comprising a four-stranded antiparallel beta-sheet and two alpha helices packed in an alpha-beta sandwich fold []. This fold is common to other domains and is classified as "ferredoxin-like".; GO: 0046872 metal ion binding, 0030001 metal ion transport; PDB: 2VOY_A 1P6T_A 1KQK_A 2RML_A 1JWW_A 3K7R_F 1FES_A 1CC8_A 1FD8_A 2GGP_A ....
Probab=45.91 E-value=85 Score=22.44 Aligned_cols=56 Identities=16% Similarity=0.162 Sum_probs=41.7
Q ss_pred EEeeecCCCCCHHHHHHHHhhcCCeeEEEecCCCCCCCceEEEEEEecCcc-cHHHHHHhhcc
Q psy4687 37 IFLRNLAPTITKAEVENLCKRYSGFLRVAIADPQPDRRWFRRGWVTFRRDV-DIKEICWNLNN 98 (339)
Q Consensus 37 lfIk~IpP~isr~ele~~ck~~~GF~~lsLSDP~p~krf~R~GWV~f~~~~-~~~~~~~~L~~ 98 (339)
+.|.++...=-...|++.++++||...+.+.=+. ....|.|+++. ++.++...|..
T Consensus 2 ~~v~~m~C~~C~~~v~~~l~~~~GV~~v~vd~~~------~~v~v~~~~~~~~~~~i~~~i~~ 58 (62)
T PF00403_consen 2 FKVPGMTCEGCAKKVEKALSKLPGVKSVKVDLET------KTVTVTYDPDKTSIEKIIEAIEK 58 (62)
T ss_dssp EEEESTTSHHHHHHHHHHHHTSTTEEEEEEETTT------TEEEEEESTTTSCHHHHHHHHHH
T ss_pred EEECCcccHHHHHHHHHHHhcCCCCcEEEEECCC------CEEEEEEecCCCCHHHHHHHHHH
Confidence 4555566666678899999999999999886322 37899999885 66777666654
No 101
>PF05443 ROS_MUCR: ROS/MUCR transcriptional regulator protein; InterPro: IPR008807 This family consists of several ROS/MUCR transcriptional regulator proteins. The ros chromosomal gene is present in octopine and nopaline strains of Agrobacterium tumefaciens as well as in Rhizobium meliloti (Sinorhizobium meliloti). This gene encodes a 15.5 kDa protein that specifically represses the virC and virD operons in the virulence region of the Ti plasmid [] and is necessary for succinoglycan production []. S. meliloti can produce two types of acidic exopolysaccharides, succinoglycan and galactoglucan, that are interchangeable for infection of Medicago sativa (Alfalfa) nodules. MucR from S. meliloti acts as a transcriptional repressor that blocks the expression of the exp genes responsible for galactoglucan production therefore allowing the exclusive production of succinoglycan [].; GO: 0003677 DNA binding, 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent; PDB: 2JSP_A.
Probab=45.54 E-value=13 Score=32.52 Aligned_cols=31 Identities=35% Similarity=0.568 Sum_probs=20.3
Q ss_pred cCCeecCCCCcccccChhhHHhhhhhcchHHHHH
Q psy4687 302 STDKWLCPLSGKKFKGPDFIRKHIFNKHAEKVEE 335 (339)
Q Consensus 302 ~e~K~rC~~C~KlFk~~eFV~KHI~nKH~e~v~~ 335 (339)
.++.--|-+|||.||.- .+|+...|+=--++
T Consensus 69 ~~d~i~clecGk~~k~L---krHL~~~~gltp~e 99 (132)
T PF05443_consen 69 TPDYIICLECGKKFKTL---KRHLRTHHGLTPEE 99 (132)
T ss_dssp -SS-EE-TBT--EESBH---HHHHHHTT-S-HHH
T ss_pred ccCeeEEccCCcccchH---HHHHHHccCCCHHH
Confidence 57889999999999987 99999998654444
No 102
>KOG0131|consensus
Probab=44.26 E-value=49 Score=30.82 Aligned_cols=88 Identities=16% Similarity=0.316 Sum_probs=54.3
Q ss_pred hhHHHhhhccCCccCCCCCCCCCCCcccCCCCeEEeeecCCCCCHHHHHHHHhhcCCeeE--EEecCCC--CCCCceEEE
Q psy4687 4 LLNLMMLNSYNNIINGGESKNEPPPRALHRTSSIFLRNLAPTITKAEVENLCKRYSGFLR--VAIADPQ--PDRRWFRRG 79 (339)
Q Consensus 4 ~~~~~~~~~~~~~~~~~~~~~~~~~~~lh~t~slfIk~IpP~isr~ele~~ck~~~GF~~--lsLSDP~--p~krf~R~G 79 (339)
+|| |.--|+-||-=.-+.. ....+--+.-|||+|+.|.|-+.-|-..++.++--.. =-+-||. ..+. -|
T Consensus 70 iln--~VkLYgrpIrv~kas~--~~~nl~vganlfvgNLd~~vDe~~L~dtFsafG~l~~~P~i~rd~~tg~~~~---~g 142 (203)
T KOG0131|consen 70 ILN--MVKLYGRPIRVNKASA--HQKNLDVGANLFVGNLDPEVDEKLLYDTFSAFGVLISPPKIMRDPDTGNPKG---FG 142 (203)
T ss_pred HHH--HHHhcCceeEEEeccc--ccccccccccccccccCcchhHHHHHHHHHhccccccCCcccccccCCCCCC---Ce
Confidence 456 6667888774322221 1223344588999999999999999999998753221 0111222 3333 47
Q ss_pred EEEecCcccHHHHHHhhcc
Q psy4687 80 WVTFRRDVDIKEICWNLNN 98 (339)
Q Consensus 80 WV~f~~~~~~~~~~~~L~~ 98 (339)
.|.|..-+....|...+|+
T Consensus 143 ~i~~~sfeasd~ai~s~ng 161 (203)
T KOG0131|consen 143 FINYASFEASDAAIGSMNG 161 (203)
T ss_pred EEechhHHHHHHHHHHhcc
Confidence 8888877766666666665
No 103
>KOG0125|consensus
Probab=43.86 E-value=1e+02 Score=31.23 Aligned_cols=70 Identities=14% Similarity=0.240 Sum_probs=56.7
Q ss_pred CeEEeeecCCCCCHHHHHHHHhhcCCeeEEEecCCCCCCCceEEEEEEecCcccHHHHHHhhccccccCccc
Q psy4687 35 SSIFLRNLAPTITKAEVENLCKRYSGFLRVAIADPQPDRRWFRRGWVTFRRDVDIKEICWNLNNIRLRDCEL 106 (339)
Q Consensus 35 ~slfIk~IpP~isr~ele~~ck~~~GF~~lsLSDP~p~krf~R~GWV~f~~~~~~~~~~~~L~~~~i~d~el 106 (339)
..|.+.|||=.---.+|..++.+++-..-|-|- -.+|.=---|+|||+...|++.+-.+|++.-|++=..
T Consensus 97 kRLhVSNIPFrFRdpDL~aMF~kfG~VldVEII--fNERGSKGFGFVTmen~~dadRARa~LHgt~VEGRkI 166 (376)
T KOG0125|consen 97 KRLHVSNIPFRFRDPDLRAMFEKFGKVLDVEII--FNERGSKGFGFVTMENPADADRARAELHGTVVEGRKI 166 (376)
T ss_pred ceeEeecCCccccCccHHHHHHhhCceeeEEEE--eccCCCCccceEEecChhhHHHHHHHhhcceeeceEE
Confidence 578999999999999999999999988877763 2333222349999999999999999999988877543
No 104
>KOG2591|consensus
Probab=43.51 E-value=56 Score=35.15 Aligned_cols=66 Identities=24% Similarity=0.407 Sum_probs=51.9
Q ss_pred CCCCcccCCCCeEEeeecCCCCCHHHHHHHHhh--cCCeeEEEecCCCCCCCceEEEEEEecCcccHHHHHHhhc
Q psy4687 25 EPPPRALHRTSSIFLRNLAPTITKAEVENLCKR--YSGFLRVAIADPQPDRRWFRRGWVTFRRDVDIKEICWNLN 97 (339)
Q Consensus 25 ~~~~~~lh~t~slfIk~IpP~isr~ele~~ck~--~~GF~~lsLSDP~p~krf~R~GWV~f~~~~~~~~~~~~L~ 97 (339)
-.|.|+.||-+.+.||.||-+-.-++|..+++. .|-|+.+-.+ ..+ +| +|||..++|+.+|...|.
T Consensus 166 gekVrp~~kRcIvilREIpettp~e~Vk~lf~~encPk~iscefa--~N~-nW----yITfesd~DAQqAykylr 233 (684)
T KOG2591|consen 166 GEKVRPNHKRCIVILREIPETTPIEVVKALFKGENCPKVISCEFA--HND-NW----YITFESDTDAQQAYKYLR 233 (684)
T ss_pred ccccccCcceeEEEEeecCCCChHHHHHHHhccCCCCCceeeeee--ecC-ce----EEEeecchhHHHHHHHHH
Confidence 358999999999999999999888888888887 3667666655 221 23 499999999988876664
No 105
>KOG2462|consensus
Probab=41.62 E-value=11 Score=36.85 Aligned_cols=29 Identities=31% Similarity=0.488 Sum_probs=23.8
Q ss_pred CeecCCCCcccccChhhHHhhhhhcchHH
Q psy4687 304 DKWLCPLSGKKFKGPDFIRKHIFNKHAEK 332 (339)
Q Consensus 304 ~K~rC~~C~KlFk~~eFV~KHI~nKH~e~ 332 (339)
--+.|.+|||.|.-|=++..||++-=+|+
T Consensus 186 l~c~C~iCGKaFSRPWLLQGHiRTHTGEK 214 (279)
T KOG2462|consen 186 LPCECGICGKAFSRPWLLQGHIRTHTGEK 214 (279)
T ss_pred CCcccccccccccchHHhhcccccccCCC
Confidence 34668999999999999999998765553
No 106
>KOG0153|consensus
Probab=41.61 E-value=55 Score=33.16 Aligned_cols=78 Identities=13% Similarity=0.258 Sum_probs=58.7
Q ss_pred CCCcccCCCCeEEeeecCCCCCHHHHHHHHhhcCCeeEEEecCCCCCCCceEEEEEEecCcccHHHHHHh-hccccccCc
Q psy4687 26 PPPRALHRTSSIFLRNLAPTITKAEVENLCKRYSGFLRVAIADPQPDRRWFRRGWVTFRRDVDIKEICWN-LNNIRLRDC 104 (339)
Q Consensus 26 ~~~~~lh~t~slfIk~IpP~isr~ele~~ck~~~GF~~lsLSDP~p~krf~R~GWV~f~~~~~~~~~~~~-L~~~~i~d~ 104 (339)
-+|----.-.+|+|+.+-+.|+-++|.+++-++++-..+++. |.. +-++|+|.+-..+.-|... +|..-|.+|
T Consensus 220 lepPeD~~I~tLyIg~l~d~v~e~dIrdhFyqyGeirsi~~~-~~~-----~CAFv~ftTR~aAE~Aae~~~n~lvI~G~ 293 (377)
T KOG0153|consen 220 LEPPEDTSIKTLYIGGLNDEVLEQDIRDHFYQYGEIRSIRIL-PRK-----GCAFVTFTTREAAEKAAEKSFNKLVINGF 293 (377)
T ss_pred cCCCcccceeEEEecccccchhHHHHHHHHhhcCCeeeEEee-ccc-----ccceeeehhhHHHHHHHHhhcceeeecce
Confidence 344444566899999999999999999999999998888876 222 2789999888766655544 566666777
Q ss_pred ccccc
Q psy4687 105 ELGAI 109 (339)
Q Consensus 105 el~~~ 109 (339)
.+..+
T Consensus 294 Rl~i~ 298 (377)
T KOG0153|consen 294 RLKIK 298 (377)
T ss_pred EEEEE
Confidence 76543
No 107
>PHA00733 hypothetical protein
Probab=40.68 E-value=25 Score=30.17 Aligned_cols=26 Identities=23% Similarity=0.492 Sum_probs=23.2
Q ss_pred cCCeecCCCCcccccChhhHHhhhhh
Q psy4687 302 STDKWLCPLSGKKFKGPDFIRKHIFN 327 (339)
Q Consensus 302 ~e~K~rC~~C~KlFk~~eFV~KHI~n 327 (339)
....|.|+.|+|.|....-+..|+..
T Consensus 70 ~~kPy~C~~Cgk~Fss~s~L~~H~r~ 95 (128)
T PHA00733 70 AVSPYVCPLCLMPFSSSVSLKQHIRY 95 (128)
T ss_pred CCCCccCCCCCCcCCCHHHHHHHHhc
Confidence 56779999999999999999999874
No 108
>KOG2462|consensus
Probab=39.70 E-value=14 Score=36.08 Aligned_cols=36 Identities=22% Similarity=0.479 Sum_probs=26.0
Q ss_pred HHHHHHHhhhhhccCCeecCCCCcccccChhhHHhhhhh
Q psy4687 289 EVEKFIAANTQEISTDKWLCPLSGKKFKGPDFIRKHIFN 327 (339)
Q Consensus 289 evek~v~~~~~e~~e~K~rC~~C~KlFk~~eFV~KHI~n 327 (339)
.|-+.++-+. +.-||.|+-|+|-|-=..|+.||...
T Consensus 230 NLRAHmQTHS---~~K~~qC~~C~KsFsl~SyLnKH~ES 265 (279)
T KOG2462|consen 230 NLRAHMQTHS---DVKKHQCPRCGKSFALKSYLNKHSES 265 (279)
T ss_pred HHHHHHHhhc---CCccccCcchhhHHHHHHHHHHhhhh
Confidence 3444455433 34499999999999988888888753
No 109
>TIGR02300 FYDLN_acid conserved hypothetical protein TIGR02300. Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=38.36 E-value=13 Score=32.36 Aligned_cols=15 Identities=40% Similarity=0.747 Sum_probs=11.3
Q ss_pred cCCeecCCCCccccc
Q psy4687 302 STDKWLCPLSGKKFK 316 (339)
Q Consensus 302 ~e~K~rC~~C~KlFk 316 (339)
=+-|+.||.|||+|=
T Consensus 6 lGtKr~Cp~cg~kFY 20 (129)
T TIGR02300 6 LGTKRICPNTGSKFY 20 (129)
T ss_pred hCccccCCCcCcccc
Confidence 356778888888883
No 110
>KOG0132|consensus
Probab=37.76 E-value=70 Score=35.65 Aligned_cols=63 Identities=14% Similarity=0.410 Sum_probs=51.4
Q ss_pred CeEEeeecCCCCCHHHHHHHHhhcCCeeEEEecCCCCCCCceEEEEEEecCcccHHHHHHhhccccccC
Q psy4687 35 SSIFLRNLAPTITKAEVENLCKRYSGFLRVAIADPQPDRRWFRRGWVTFRRDVDIKEICWNLNNIRLRD 103 (339)
Q Consensus 35 ~slfIk~IpP~isr~ele~~ck~~~GF~~lsLSDP~p~krf~R~GWV~f~~~~~~~~~~~~L~~~~i~d 103 (339)
++|||..|+-+|+-++|-.++.+++-..+|-|--|. +-|+|+-..--++..|+.+|+|.++-+
T Consensus 422 rTLwvG~i~k~v~e~dL~~~feefGeiqSi~li~~R------~cAfI~M~~RqdA~kalqkl~n~kv~~ 484 (894)
T KOG0132|consen 422 RTLWVGGIPKNVTEQDLANLFEEFGEIQSIILIPPR------GCAFIKMVRRQDAEKALQKLSNVKVAD 484 (894)
T ss_pred eeeeeccccchhhHHHHHHHHHhcccceeEeeccCC------ceeEEEEeehhHHHHHHHHHhcccccc
Confidence 689999999999999999999999999999987442 478886655556777778888765544
No 111
>KOG0105|consensus
Probab=37.60 E-value=1.4e+02 Score=28.17 Aligned_cols=72 Identities=21% Similarity=0.325 Sum_probs=58.2
Q ss_pred CCCCcccCCCCeEEeeecCCCCCHHHHHHHHhhcCCeeEEEecCCCCCCCceEEEEEEecCcccHHHHHHhhccccccC
Q psy4687 25 EPPPRALHRTSSIFLRNLAPTITKAEVENLCKRYSGFLRVAIADPQPDRRWFRRGWVTFRRDVDIKEICWNLNNIRLRD 103 (339)
Q Consensus 25 ~~~~~~lh~t~slfIk~IpP~isr~ele~~ck~~~GF~~lsLSDP~p~krf~R~GWV~f~~~~~~~~~~~~L~~~~i~d 103 (339)
..-|-.-|....+++..+||+-|=|+|..+..+.+- |..+|-...- .|-|-|-+-.||+=|+-+|++.+.+-
T Consensus 106 ~rgppsrrSe~RVvVsGLp~SgSWQDLKDHmReaGd---vCfadv~rDg----~GvV~~~r~eDMkYAvr~ld~~~~~s 177 (241)
T KOG0105|consen 106 RRGPPSRRSEYRVVVSGLPPSGSWQDLKDHMREAGD---VCFADVQRDG----VGVVEYLRKEDMKYAVRKLDDQKFRS 177 (241)
T ss_pred ccCCcccccceeEEEecCCCCCchHHHHHHHHhhCC---eeeeeeeccc----ceeeeeeehhhHHHHHHhhccccccC
Confidence 455667788899999999999999999999999876 3345544442 89999999999999998898766553
No 112
>PF09237 GAGA: GAGA factor; InterPro: IPR015318 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. Members of this entry bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1YUI_A 1YUJ_A.
Probab=36.75 E-value=19 Score=26.83 Aligned_cols=31 Identities=26% Similarity=0.504 Sum_probs=23.4
Q ss_pred cCCeecCCCCcccccChhhHHhhhhhcchHH
Q psy4687 302 STDKWLCPLSGKKFKGPDFIRKHIFNKHAEK 332 (339)
Q Consensus 302 ~e~K~rC~~C~KlFk~~eFV~KHI~nKH~e~ 332 (339)
.+.-.-||+|+--+.-+.=++-|+...|.-+
T Consensus 21 S~~PatCP~C~a~~~~srnLrRHle~~H~~k 51 (54)
T PF09237_consen 21 SEQPATCPICGAVIRQSRNLRRHLEIRHFKK 51 (54)
T ss_dssp TS--EE-TTT--EESSHHHHHHHHHHHTTTS
T ss_pred cCCCCCCCcchhhccchhhHHHHHHHHhccc
Confidence 5777889999999999999999999999754
No 113
>PF01927 Mut7-C: Mut7-C RNAse domain; InterPro: IPR002782 This prokaryotic family of proteins have no known function. The proteins contain four conserved cysteines that may be involved in metal binding or disulphide bridges.
Probab=35.72 E-value=12 Score=32.63 Aligned_cols=26 Identities=27% Similarity=0.547 Sum_probs=18.8
Q ss_pred HHHHHHhhhhhccCCeecCCCCcccc
Q psy4687 290 VEKFIAANTQEISTDKWLCPLSGKKF 315 (339)
Q Consensus 290 vek~v~~~~~e~~e~K~rC~~C~KlF 315 (339)
+..-|-..+.+..+.=|+|+.|+|.|
T Consensus 109 v~~~vp~~v~~~~~~f~~C~~C~kiy 134 (147)
T PF01927_consen 109 VKDRVPPYVYETYDEFWRCPGCGKIY 134 (147)
T ss_pred cccccCccccccCCeEEECCCCCCEe
Confidence 33335555556667799999999998
No 114
>PF01709 Transcrip_reg: Transcriptional regulator; InterPro: IPR002876 This entry represents the core region of several hypothetical proteins found in bacteria, plants, and yeast proteins. This core region can be subdivided into three domains: a 3-helical bundle domain, and two alpha+beta domains with different folds, where domain 3 (ferredoxin-like fold) is inserted within domain 2. This core region is found in the following hypothetical proteins: YebC from Escherichia coli, HP0162 from Helicobacter pylori (Campylobacter pylori) and aq1575 from Aquifex aeolicus []. The crystal structure of a conserved hypothetical protein, Aq1575, from Aquifex aeolicus has been determined. A structural homology search reveals that this protein has a new fold with no obvious similarity to those of other proteins of known three-dimensional structure. The protein reveals a monomer consisting of three domains arranged along a pseudo threefold symmetry axis. There is a large cleft with approximate dimensions of 10 A x 10 A x 20 A in the centre of the three domains along the symmetry axis. Two possible active sites are suggested based on the structure and multiple sequence alignment. There are several highly conserved residues in these putative active sites [].; PDB: 1LFP_A 1MW7_A 1KON_A.
Probab=35.44 E-value=69 Score=30.32 Aligned_cols=59 Identities=15% Similarity=0.303 Sum_probs=43.3
Q ss_pred CCCCeEEeeecCCCCCH--HHHHHHHhhcCCeeEEEecCCC-CCCCceEEEEEEec-CcccHHHHHH
Q psy4687 32 HRTSSIFLRNLAPTITK--AEVENLCKRYSGFLRVAIADPQ-PDRRWFRRGWVTFR-RDVDIKEICW 94 (339)
Q Consensus 32 h~t~slfIk~IpP~isr--~ele~~ck~~~GF~~lsLSDP~-p~krf~R~GWV~f~-~~~~~~~~~~ 94 (339)
+.+..++|..+..+..| ++|-.+|++++| +|++|- ...-|.|.|.|.|. ...+..+++.
T Consensus 88 P~Gvaiive~lTDN~nRt~~~ir~~~~K~gg----~l~~~gsv~~~F~~kG~i~~~~~~~~~d~~~e 150 (234)
T PF01709_consen 88 PGGVAIIVECLTDNKNRTVSDIRSIFKKNGG----SLGPSGSVSFMFERKGVIEVSKKDLDEDELME 150 (234)
T ss_dssp TTTEEEEEEEEES-HHHHHHHHHHHHHTTT-----EEE-TTSSGGGEEEEEEEEEEHCCS-HHHHHH
T ss_pred CCCcEEEEEEeCCCHhHHHHHHHHHHHHcCc----eeCCCCcceeeeeeeEEEEEEeCCCChHHHHH
Confidence 45688899998888766 689999999999 677765 33479999999999 5566666654
No 115
>KOG3576|consensus
Probab=34.29 E-value=17 Score=34.38 Aligned_cols=25 Identities=28% Similarity=0.627 Sum_probs=20.3
Q ss_pred cCCeecCCCCcccccChhhHHhhhh
Q psy4687 302 STDKWLCPLSGKKFKGPDFIRKHIF 326 (339)
Q Consensus 302 ~e~K~rC~~C~KlFk~~eFV~KHI~ 326 (339)
+.++|-|.+|+|.|.=.-.+-.|++
T Consensus 114 d~d~ftCrvCgK~F~lQRmlnrh~k 138 (267)
T KOG3576|consen 114 DQDSFTCRVCGKKFGLQRMLNRHLK 138 (267)
T ss_pred CCCeeeeehhhhhhhHHHHHHHHhh
Confidence 6788889999999888777777764
No 116
>KOG0144|consensus
Probab=33.02 E-value=45 Score=34.79 Aligned_cols=90 Identities=18% Similarity=0.253 Sum_probs=59.7
Q ss_pred cCCccCCCCCCCCCCCcccCCCCeEEeeecCCCCCHHHHHHHHhhcCCeeEEEe-cCCC-CCCCceEEEEEEecCcccHH
Q psy4687 13 YNNIINGGESKNEPPPRALHRTSSIFLRNLAPTITKAEVENLCKRYSGFLRVAI-ADPQ-PDRRWFRRGWVTFRRDVDIK 90 (339)
Q Consensus 13 ~~~~~~~~~~~~~~~~~~lh~t~slfIk~IpP~isr~ele~~ck~~~GF~~lsL-SDP~-p~krf~R~GWV~f~~~~~~~ 90 (339)
-+-||-=-+-|+|. +-+....-|||..++-.++..||+++|.+++-..-++| -||. ..|. -|+|+|..-+-+-
T Consensus 105 ~~~pvqvk~Ad~E~--er~~~e~KLFvg~lsK~~te~evr~iFs~fG~Ied~~ilrd~~~~sRG---caFV~fstke~A~ 179 (510)
T KOG0144|consen 105 MHHPVQVKYADGER--ERIVEERKLFVGMLSKQCTENEVREIFSRFGHIEDCYILRDPDGLSRG---CAFVKFSTKEMAV 179 (510)
T ss_pred CCcceeecccchhh--hccccchhhhhhhccccccHHHHHHHHHhhCccchhhheecccccccc---eeEEEEehHHHHH
Confidence 33444444445532 22355677999999999999999999999976544433 2343 3333 5899998876444
Q ss_pred HHHHhhc-cccccCcccc
Q psy4687 91 EICWNLN-NIRLRDCELG 107 (339)
Q Consensus 91 ~~~~~L~-~~~i~d~el~ 107 (339)
.|+..|| ..++++|.--
T Consensus 180 ~Aika~ng~~tmeGcs~P 197 (510)
T KOG0144|consen 180 AAIKALNGTQTMEGCSQP 197 (510)
T ss_pred HHHHhhccceeeccCCCc
Confidence 4556777 4677888754
No 117
>TIGR01033 DNA-binding regulatory protein, YebC/PmpR family. This model describes a minimally characterized protein family, restricted to bacteria excepting for some eukaryotic sequences that have possible transit peptides. YebC from E. coli is crystallized, and PA0964 from Pseudomonas aeruginosa has been shown to be a sequence-specific DNA-binding regulatory protein.
Probab=32.95 E-value=1.3e+02 Score=28.62 Aligned_cols=59 Identities=17% Similarity=0.337 Sum_probs=44.4
Q ss_pred CCCCeEEeeecCCCCCH--HHHHHHHhhcCCeeEEEecCCCCCC-CceEEEEEEecCc-ccHHHHHH
Q psy4687 32 HRTSSIFLRNLAPTITK--AEVENLCKRYSGFLRVAIADPQPDR-RWFRRGWVTFRRD-VDIKEICW 94 (339)
Q Consensus 32 h~t~slfIk~IpP~isr--~ele~~ck~~~GF~~lsLSDP~p~k-rf~R~GWV~f~~~-~~~~~~~~ 94 (339)
+.+..++|-.+..+..| ++|..++++++| +|++|-.-+ -|-|.|.|.|... .+..+++.
T Consensus 92 P~GvaiiVe~lTDN~nRt~~~ir~~f~K~gg----~l~~~gsv~~~Fe~kG~i~~~~~~~~~d~~~e 154 (238)
T TIGR01033 92 PGGVAIIVECLTDNKNRTASEVRSAFNKNGG----SLGEPGSVSYLFSRKGVIEVPKNEVDEEDLME 154 (238)
T ss_pred CCceEEEEEEecCCHHhHHHHHHHHHHHcCC----eeCCCCceeeeeecceEEEECCCCCCHHHHHH
Confidence 56688999999988876 689999999999 566554433 7999999999854 34445543
No 118
>KOG0144|consensus
Probab=31.40 E-value=1.3e+02 Score=31.47 Aligned_cols=74 Identities=12% Similarity=0.254 Sum_probs=56.3
Q ss_pred CCCeEEeeecCCCCCHHHHHHHHhhcCCeeEEEec-CCC--CCCCceEEEEEEecCcccHHHHHHhhccccccCcccccc
Q psy4687 33 RTSSIFLRNLAPTITKAEVENLCKRYSGFLRVAIA-DPQ--PDRRWFRRGWVTFRRDVDIKEICWNLNNIRLRDCELGAI 109 (339)
Q Consensus 33 ~t~slfIk~IpP~isr~ele~~ck~~~GF~~lsLS-DP~--p~krf~R~GWV~f~~~~~~~~~~~~L~~~~i~d~el~~~ 109 (339)
.+--||+-.||-+-|-.+|.+++.+|+.+.-+.|- |=. ..|. -++|+|....+..++..+|.|++.=-+..+++
T Consensus 33 ~~vKlfVgqIprt~sE~dlr~lFe~yg~V~einl~kDk~t~~s~g---cCFv~~~trk~a~~a~~Alhn~ktlpG~~~pv 109 (510)
T KOG0144|consen 33 SAVKLFVGQIPRTASEKDLRELFEKYGNVYEINLIKDKSTGQSKG---CCFVKYYTRKEADEAINALHNQKTLPGMHHPV 109 (510)
T ss_pred hhhhheeccCCccccHHHHHHHHHHhCceeEEEeecccccCcccc---eEEEEeccHHHHHHHHHHhhcccccCCCCcce
Confidence 34568999999999999999999999998777762 222 2222 47999999998889989998876644444443
No 119
>smart00834 CxxC_CXXC_SSSS Putative regulatory protein. CxxC_CXXC_SSSS represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=30.74 E-value=23 Score=23.63 Aligned_cols=14 Identities=21% Similarity=0.479 Sum_probs=11.0
Q ss_pred eecCCCCcccccCh
Q psy4687 305 KWLCPLSGKKFKGP 318 (339)
Q Consensus 305 K~rC~~C~KlFk~~ 318 (339)
-|+|+.|++.|...
T Consensus 5 ~y~C~~Cg~~fe~~ 18 (41)
T smart00834 5 EYRCEDCGHTFEVL 18 (41)
T ss_pred EEEcCCCCCEEEEE
Confidence 37999999988643
No 120
>KOG1190|consensus
Probab=30.06 E-value=56 Score=33.88 Aligned_cols=47 Identities=23% Similarity=0.434 Sum_probs=36.9
Q ss_pred CccCCCCCCC---CCCCcccCCC---------------CeEEeeecCCCCCHHHHHHHHhhcCCe
Q psy4687 15 NIINGGESKN---EPPPRALHRT---------------SSIFLRNLAPTITKAEVENLCKRYSGF 61 (339)
Q Consensus 15 ~~~~~~~~~~---~~~~~~lh~t---------------~slfIk~IpP~isr~ele~~ck~~~GF 61 (339)
-|-+|+++.| ++-+-+||+- .+|.+-+|||+||-+++-+++.+.+|-
T Consensus 377 lp~egq~d~glT~dy~~spLhrfkkpgsKN~~ni~PpsatlHlsnip~svsee~lk~~f~~~g~~ 441 (492)
T KOG1190|consen 377 LPREGQEDQGLTKDYGNSPLHRFKKPGSKNYQNIFPPSATLHLSNIPPSVSEEDLKNLFQEPGGQ 441 (492)
T ss_pred CCCCCCccccccccCCCCchhhccCcccccccccCCchhheeeccCCcccchhHHHHhhhcCCce
Confidence 3567777755 5566666654 489999999999999999999997664
No 121
>TIGR02098 MJ0042_CXXC MJ0042 family finger-like domain. This domain contains a CXXCX(19)CXXC motif suggestive of both zinc fingers and thioredoxin, usually found at the N-terminus of prokaryotic proteins. One partially characterized gene, agmX, is among a large set in Myxococcus whose interruption affects adventurous gliding motility.
Probab=30.05 E-value=29 Score=23.02 Aligned_cols=16 Identities=19% Similarity=0.287 Sum_probs=13.1
Q ss_pred cCCeecCCCCcccccC
Q psy4687 302 STDKWLCPLSGKKFKG 317 (339)
Q Consensus 302 ~e~K~rC~~C~KlFk~ 317 (339)
...+.+|+.|+..|.+
T Consensus 22 ~~~~v~C~~C~~~~~~ 37 (38)
T TIGR02098 22 NGGKVRCGKCGHVWYA 37 (38)
T ss_pred CCCEEECCCCCCEEEe
Confidence 3458999999999975
No 122
>PF03126 Plus-3: Plus-3 domain; InterPro: IPR004343 The yeast Paf1 complex consists of Pfa1, Rtf1, Cdc73, Ctr9, and Leo1. The complex regulates histone H2B ubiquitination, histone H3 methylation, RNA polymerase II carboxy-terminal domain (CTD) Ser2 phosphorylation, and RNA 3' end processing. The conservation of Paf1 complex function in higher eukaryotes has been confirmed in human cells, Drosophila and Arabidopsis. The Plus3 domain spans the most conserved regions of the Rtf1 protein and is surrounded by regions of low complexity and coiled-coil propensity []. It contains only a limited number of highly conserved amino acids, among which are three positively charged residues that gave the Plus3 domain its name. The capacity to bind single-stranded DNA is at least one function of the Plus3 domain []. The plus-3 domain is about 90 residues in length and is often found associated with the GYF domain (IPR003169 from INTERPRO). The Plus3 domain structure consists of six alpha helices intervened by a sequence of six beta strands in a mixed alpha/beta topology. Beta strands 1, 2, 5, and 6 compose a four-stranded antiparallel beta sheet with a beta-hairpin insertion formed by strands 3 and 4. The N-terminal helices alpha1-alpha3 and C-terminal helix alpha6 pack together to form an alpha subdomain, while the beta strands and the small 3(10) helix alpha 4 form a beta subdomain. The two subdomains pack together to form a compact, globular protein [].; GO: 0003677 DNA binding, 0006352 transcription initiation, DNA-dependent, 0016570 histone modification, 0005634 nucleus; PDB: 2BZE_A 3U1U_B 2DB9_A.
Probab=29.76 E-value=75 Score=26.08 Aligned_cols=38 Identities=16% Similarity=0.272 Sum_probs=26.6
Q ss_pred CCCHHHHHHHHhhc-------CCeeEEEecCCCCCCC-ceEEEEEE
Q psy4687 45 TITKAEVENLCKRY-------SGFLRVAIADPQPDRR-WFRRGWVT 82 (339)
Q Consensus 45 ~isr~ele~~ck~~-------~GF~~lsLSDP~p~kr-f~R~GWV~ 82 (339)
-++|..|+++|..- +-|.||.++......+ -||++-|+
T Consensus 7 ~l~Rs~l~~~~~~P~Fe~~v~G~fVRv~ig~~~~~~~~~Yrl~qI~ 52 (108)
T PF03126_consen 7 RLSRSQLEKWLFKPFFEDAVVGCFVRVNIGPNDYTQNPVYRLCQIV 52 (108)
T ss_dssp BE-HHHHHHHTTSTTHHHHHTT-EEEEEEETTSSS-E--EEEEEEE
T ss_pred EEEHHHHHHHHcCCChhHheeEEEEEEEecCCCCCCceeEEEEEEE
Confidence 37899999999873 5599999995443333 59999885
No 123
>TIGR03636 L23_arch archaeal ribosomal protein L23. This model describes the archaeal ribosomal protein L23P and rigorously excludes the bacterial counterpart L23. In order to capture every known instance of archaeal L23P, the trusted cutoff is set lower than a few of the highest scoring eukaryotic cytosolic ribosomal counterparts.
Probab=29.20 E-value=1.8e+02 Score=23.06 Aligned_cols=55 Identities=13% Similarity=0.153 Sum_probs=41.1
Q ss_pred ecCCCCCHHHHHHHHhhcCCeeEEEecCCCCCCCceEEEEEEecCcccHHHHHHhh
Q psy4687 41 NLAPTITKAEVENLCKRYSGFLRVAIADPQPDRRWFRRGWVTFRRDVDIKEICWNL 96 (339)
Q Consensus 41 ~IpP~isr~ele~~ck~~~GF~~lsLSDP~p~krf~R~GWV~f~~~~~~~~~~~~L 96 (339)
.+++..+|.+|.+.+.++=|..=.++--=+. +.-+..++|++.++.++.++...|
T Consensus 20 ~V~~~anK~eIK~avE~lf~VkV~~Vnt~~~-~~~~KKA~VtL~~g~~a~~va~k~ 74 (77)
T TIGR03636 20 IVDRKATKGDIKRAVEKLFDVKVEKVNTLIT-PRGEKKAYVKLAEEYAAEEIASRL 74 (77)
T ss_pred EECCCCCHHHHHHHHHHHhCCceEEEEeEEc-CCCceEEEEEECCCCcHHHHHHhh
Confidence 4789999999999999987765555432122 245689999999999888876554
No 124
>COG4957 Predicted transcriptional regulator [Transcription]
Probab=28.58 E-value=22 Score=31.48 Aligned_cols=31 Identities=29% Similarity=0.502 Sum_probs=24.6
Q ss_pred cCCeecCCCCcccccChhhHHhhhhhcchHHHHH
Q psy4687 302 STDKWLCPLSGKKFKGPDFIRKHIFNKHAEKVEE 335 (339)
Q Consensus 302 ~e~K~rC~~C~KlFk~~eFV~KHI~nKH~e~v~~ 335 (339)
..+-.-|-+|||+||.- ..|+.++|.=--++
T Consensus 73 tpD~IicLEDGkkfKSL---KRHL~t~~gmTPd~ 103 (148)
T COG4957 73 TPDYIICLEDGKKFKSL---KRHLTTHYGLTPDE 103 (148)
T ss_pred CCCeEEEeccCcchHHH---HHHHhcccCCCHHH
Confidence 46778899999999985 89999988643333
No 125
>COG1656 Uncharacterized conserved protein [Function unknown]
Probab=28.49 E-value=30 Score=31.43 Aligned_cols=33 Identities=24% Similarity=0.502 Sum_probs=23.0
Q ss_pred HHHHHHHHhhhhhccCCeecCCCCcccc-cChhh
Q psy4687 288 AEVEKFIAANTQEISTDKWLCPLSGKKF-KGPDF 320 (339)
Q Consensus 288 eevek~v~~~~~e~~e~K~rC~~C~KlF-k~~eF 320 (339)
+|+..-|-..+....+.=|+|+.|+|.| +|.+|
T Consensus 113 eev~~~Vp~~~~~~~~~f~~C~~CgkiYW~GsHw 146 (165)
T COG1656 113 EEVKEKVPEKVYRNYEEFYRCPKCGKIYWKGSHW 146 (165)
T ss_pred HHHhhccchhhhhcccceeECCCCcccccCchHH
Confidence 4555445544556678899999999998 56654
No 126
>PRK00110 hypothetical protein; Validated
Probab=28.35 E-value=1.4e+02 Score=28.55 Aligned_cols=58 Identities=17% Similarity=0.235 Sum_probs=44.4
Q ss_pred CCCCeEEeeecCCCCCH--HHHHHHHhhcCCeeEEEecCCCCCC-CceEEEEEEecCcccHHHHHH
Q psy4687 32 HRTSSIFLRNLAPTITK--AEVENLCKRYSGFLRVAIADPQPDR-RWFRRGWVTFRRDVDIKEICW 94 (339)
Q Consensus 32 h~t~slfIk~IpP~isr--~ele~~ck~~~GF~~lsLSDP~p~k-rf~R~GWV~f~~~~~~~~~~~ 94 (339)
+.+..|+|..+-.+..| ++|..++++++| +|++|-.-. -|-|.|.|.|+.+ +..+++.
T Consensus 92 P~GvaiiVe~lTDN~nRt~~~vR~~f~K~gG----~l~~~Gsv~~~Fe~kG~i~~~~~-~~d~~~e 152 (245)
T PRK00110 92 PGGVAIIVEALTDNRNRTAAEVRHAFSKNGG----NLGETGSVSYMFDRKGVIVIEPL-DEDELME 152 (245)
T ss_pred CCCeEEEEEEecCCHHHHHHHHHHHHHhcCc----eeCCCcceEEEeccceEEEeCCC-CHHHHHH
Confidence 55688999999988876 689999999999 566654332 7999999999865 4445543
No 127
>KOG2482|consensus
Probab=27.64 E-value=34 Score=34.64 Aligned_cols=28 Identities=18% Similarity=0.438 Sum_probs=25.4
Q ss_pred cCCeecCCCCcccccChhhHHhhhhhcc
Q psy4687 302 STDKWLCPLSGKKFKGPDFIRKHIFNKH 329 (339)
Q Consensus 302 ~e~K~rC~~C~KlFk~~eFV~KHI~nKH 329 (339)
.=+.++|-.|.|-|+...-++-|+++|.
T Consensus 192 kL~r~~CLyCekifrdkntLkeHMrkK~ 219 (423)
T KOG2482|consen 192 KLERLRCLYCEKIFRDKNTLKEHMRKKR 219 (423)
T ss_pred HHhhheeeeeccccCCcHHHHHHHHhcc
Confidence 4577899999999999999999999984
No 128
>PRK14548 50S ribosomal protein L23P; Provisional
Probab=27.22 E-value=1.9e+02 Score=23.28 Aligned_cols=55 Identities=13% Similarity=0.156 Sum_probs=41.5
Q ss_pred ecCCCCCHHHHHHHHhhcCCeeEEEecCCCCCCCceEEEEEEecCcccHHHHHHhh
Q psy4687 41 NLAPTITKAEVENLCKRYSGFLRVAIADPQPDRRWFRRGWVTFRRDVDIKEICWNL 96 (339)
Q Consensus 41 ~IpP~isr~ele~~ck~~~GF~~lsLSDP~p~krf~R~GWV~f~~~~~~~~~~~~L 96 (339)
-+++..++.+|.+.+.++=|+.=.+.--=+.. +-++.++|+|.++.++.++...|
T Consensus 27 ~V~~~anK~eIK~AvE~lf~VkV~~VnT~~~~-~~~KKA~V~L~~g~~A~~va~ki 81 (84)
T PRK14548 27 IVDRRATKPDIKRAVEELFDVKVEKVNTLITP-KGEKKAYVKLAEEYDAEEIASRL 81 (84)
T ss_pred EECCCCCHHHHHHHHHHHhCCceEEEEeEEcC-CCcEEEEEEeCCCCcHHHHHHhh
Confidence 47899999999999999877665555322222 45689999999999888876544
No 129
>COG3677 Transposase and inactivated derivatives [DNA replication, recombination, and repair]
Probab=26.89 E-value=28 Score=30.05 Aligned_cols=15 Identities=20% Similarity=0.634 Sum_probs=13.3
Q ss_pred cCCeecCCCCccccc
Q psy4687 302 STDKWLCPLSGKKFK 316 (339)
Q Consensus 302 ~e~K~rC~~C~KlFk 316 (339)
...+|+|+.|++-|-
T Consensus 50 ~~qRyrC~~C~~tf~ 64 (129)
T COG3677 50 GHQRYKCKSCGSTFT 64 (129)
T ss_pred cccccccCCcCccee
Confidence 488999999999885
No 130
>TIGR02728 spore_gerQ spore coat protein GerQ. Members of this protein family are the spore coat protein GerQ of endospore-forming Firmicutes (low GC Gram-positive bacteria). This protein is cross-linked by a spore coat-associated transglutaminase.
Probab=25.87 E-value=80 Score=25.40 Aligned_cols=29 Identities=28% Similarity=0.591 Sum_probs=24.5
Q ss_pred CCeeEEEecCCCCCCCce----EEEEEEecCcc
Q psy4687 59 SGFLRVAIADPQPDRRWF----RRGWVTFRRDV 87 (339)
Q Consensus 59 ~GF~~lsLSDP~p~krf~----R~GWV~f~~~~ 87 (339)
.|=.++-||||+..+||. =.-||+||..-
T Consensus 44 AGRDhiiisdp~tg~RyLl~mvyldyvtFdeei 76 (82)
T TIGR02728 44 AGRDHIVISDPQSGMRYLLLMVYLDYLTFDEEI 76 (82)
T ss_pred cCcceEEEcCCCCCcEEEeeEeeeeeEEecccc
Confidence 688899999999999885 46899998763
No 131
>KOG0148|consensus
Probab=25.13 E-value=2e+02 Score=28.56 Aligned_cols=68 Identities=15% Similarity=0.240 Sum_probs=52.8
Q ss_pred ccCCCCeEEeeecCCCCCHHHHHHHHhhcCCeeEEEecCCCCCCCceEEEEEEecCcccHHHHHHhhccccccC
Q psy4687 30 ALHRTSSIFLRNLAPTITKAEVENLCKRYSGFLRVAIADPQPDRRWFRRGWVTFRRDVDIKEICWNLNNIRLRD 103 (339)
Q Consensus 30 ~lh~t~slfIk~IpP~isr~ele~~ck~~~GF~~lsLSDP~p~krf~R~GWV~f~~~~~~~~~~~~L~~~~i~d 103 (339)
.-...+++++=+|++-++-++|...+..++--.-|-+= +. .+.++|-|++...+..|.-.+||..|.+
T Consensus 160 ssp~NtsVY~G~I~~~lte~~mr~~Fs~fG~I~EVRvF---k~---qGYaFVrF~tkEaAahAIv~mNntei~G 227 (321)
T KOG0148|consen 160 SSPDNTSVYVGNIASGLTEDLMRQTFSPFGPIQEVRVF---KD---QGYAFVRFETKEAAAHAIVQMNNTEIGG 227 (321)
T ss_pred CCCCCceEEeCCcCccccHHHHHHhcccCCcceEEEEe---cc---cceEEEEecchhhHHHHHHHhcCceeCc
Confidence 34556899999999999999999999988754433332 33 3578999999988777777899988876
No 132
>PF11341 DUF3143: Protein of unknown function (DUF3143); InterPro: IPR021489 This family of proteins has no known function.
Probab=24.91 E-value=59 Score=25.03 Aligned_cols=20 Identities=15% Similarity=0.272 Sum_probs=16.4
Q ss_pred eEEeeecCCCCCHHHHHHHH
Q psy4687 36 SIFLRNLAPTITKAEVENLC 55 (339)
Q Consensus 36 slfIk~IpP~isr~ele~~c 55 (339)
+-.=|..|..+||++||+.+
T Consensus 41 ~~~~rsF~YsLSR~DvE~Ai 60 (63)
T PF11341_consen 41 QDIQRSFPYSLSREDVEAAI 60 (63)
T ss_pred cccEEeccCcCCHHHHHHHH
Confidence 34468889999999999965
No 133
>COG1997 RPL43A Ribosomal protein L37AE/L43A [Translation, ribosomal structure and biogenesis]
Probab=24.84 E-value=34 Score=28.05 Aligned_cols=21 Identities=38% Similarity=0.833 Sum_probs=18.0
Q ss_pred hccCCeecCCCCcccccChhh
Q psy4687 300 EISTDKWLCPLSGKKFKGPDF 320 (339)
Q Consensus 300 e~~e~K~rC~~C~KlFk~~eF 320 (339)
...-+-|.|.-|+..|.|.-|
T Consensus 48 R~a~GIW~C~kCg~~fAGgay 68 (89)
T COG1997 48 RIATGIWKCRKCGAKFAGGAY 68 (89)
T ss_pred eeccCeEEcCCCCCeeccccc
Confidence 457899999999999998765
No 134
>PF07754 DUF1610: Domain of unknown function (DUF1610); InterPro: IPR011668 This domain is found in archaeal species. It is likely to bind zinc via its four well-conserved cysteine residues.
Probab=23.86 E-value=41 Score=21.12 Aligned_cols=11 Identities=27% Similarity=0.543 Sum_probs=9.3
Q ss_pred CCeecCCCCcc
Q psy4687 303 TDKWLCPLSGK 313 (339)
Q Consensus 303 e~K~rC~~C~K 313 (339)
...|.||.||+
T Consensus 14 ~v~f~CPnCG~ 24 (24)
T PF07754_consen 14 AVPFPCPNCGF 24 (24)
T ss_pred CceEeCCCCCC
Confidence 67899999985
No 135
>PF08741 YwhD: YwhD family; InterPro: IPR014852 The members of this entry are currently uncharacterised. They are around 170 amino acids in length.
Probab=23.80 E-value=89 Score=28.26 Aligned_cols=61 Identities=30% Similarity=0.446 Sum_probs=42.5
Q ss_pred CCccCCCCCCCCCCCcccCCC---CeEEeeecCCCC----------------CHHHHHHHHhhcCCeeEEEecCCCCCCC
Q psy4687 14 NNIINGGESKNEPPPRALHRT---SSIFLRNLAPTI----------------TKAEVENLCKRYSGFLRVAIADPQPDRR 74 (339)
Q Consensus 14 ~~~~~~~~~~~~~~~~~lh~t---~slfIk~IpP~i----------------sr~ele~~ck~~~GF~~lsLSDP~p~kr 74 (339)
-|||.+++.|+ |++ .+|-+.+|+|-| .|..+| .|.+++.=-+-+|.-|
T Consensus 6 fnIi~~d~tdg-------H~G~g~Gsi~L~nvspViID~e~~ea~vD~GAmHArS~VE------kgIKf~~dke~vp~gk 72 (163)
T PF08741_consen 6 FNIIKNDSTDG-------HGGFGAGSISLNNVSPVIIDVEEGEAFVDMGAMHARSKVE------KGIKFSPDKEEVPNGK 72 (163)
T ss_pred eEEecCCCCCC-------CCCcceeeEeccccccEEEecCCCEEEEecccccchhhHh------cCceeCCCHHHCCCCc
Confidence 37899999888 766 466677777653 233333 3666665555566778
Q ss_pred ceEEEEEEecCcc
Q psy4687 75 WFRRGWVTFRRDV 87 (339)
Q Consensus 75 f~R~GWV~f~~~~ 87 (339)
-|-+-||+.++..
T Consensus 73 ~yWlVWVtvdr~e 85 (163)
T PF08741_consen 73 PYWLVWVTVDRKE 85 (163)
T ss_pred EEEEEEEEecccC
Confidence 8999999999885
No 136
>PF05265 DUF723: Protein of unknown function (DUF723); InterPro: IPR007929 This family contains several uncharacterised proteins from Neisseria meningitidis. These proteins may have a role in DNA binding.
Probab=23.34 E-value=34 Score=26.08 Aligned_cols=10 Identities=40% Similarity=0.720 Sum_probs=7.8
Q ss_pred CCeecCCCCc
Q psy4687 303 TDKWLCPLSG 312 (339)
Q Consensus 303 e~K~rC~~C~ 312 (339)
..|++||.|+
T Consensus 51 ~sk~GCP~Cg 60 (60)
T PF05265_consen 51 KSKHGCPECG 60 (60)
T ss_pred hhccCCCCCC
Confidence 4678899996
No 137
>PF03317 ELF: ELF protein; InterPro: IPR004990 This is a family of hypothetical proteins from cereal crops.
Probab=23.25 E-value=23 Score=33.38 Aligned_cols=42 Identities=33% Similarity=0.520 Sum_probs=30.7
Q ss_pred CCeEEeeecCCCCCHHHHHHHHhhcCCeeEEEecCCCCCCCceEEEEEEec
Q psy4687 34 TSSIFLRNLAPTITKAEVENLCKRYSGFLRVAIADPQPDRRWFRRGWVTFR 84 (339)
Q Consensus 34 t~slfIk~IpP~isr~ele~~ck~~~GF~~lsLSDP~p~krf~R~GWV~f~ 84 (339)
|++-|+=|+||.|---+--+.+..+.-|..|.=-|| |||+|=
T Consensus 90 ~~~sflitlppeiqdpqalahl~glnfylslyeqdp---------~wv~fi 131 (284)
T PF03317_consen 90 TCCSFLITLPPEIQDPQALAHLAGLNFYLSLYEQDP---------GWVTFI 131 (284)
T ss_pred hhhheEEecCcccCCHHHHHHHhcchheehhhhcCc---------cHHHHH
Confidence 567789999999987777777766665555554554 899983
No 138
>KOG0109|consensus
Probab=23.18 E-value=97 Score=30.89 Aligned_cols=65 Identities=22% Similarity=0.397 Sum_probs=53.0
Q ss_pred CeEEeeecCCCCCHHHHHHHHhhcCCeeEEEecCCCCCCCceEEEEEEecCcccHHHHHHhhccccccCcccc
Q psy4687 35 SSIFLRNLAPTITKAEVENLCKRYSGFLRVAIADPQPDRRWFRRGWVTFRRDVDIKEICWNLNNIRLRDCELG 107 (339)
Q Consensus 35 ~slfIk~IpP~isr~ele~~ck~~~GF~~lsLSDP~p~krf~R~GWV~f~~~~~~~~~~~~L~~~~i~d~el~ 107 (339)
.-|||-|+|-..+-++|..++.+++-+.- -|-+.. .|+|..+..+...+|.-+|++-+|+++..+
T Consensus 3 ~KLFIGNLp~~~~~~elr~lFe~ygkVlE---CDIvKN-----YgFVHiEdktaaedairNLhgYtLhg~nIn 67 (346)
T KOG0109|consen 3 VKLFIGNLPREATEQELRSLFEQYGKVLE---CDIVKN-----YGFVHIEDKTAAEDAIRNLHGYTLHGVNIN 67 (346)
T ss_pred cchhccCCCcccchHHHHHHHHhhCceEe---eeeecc-----cceEEeecccccHHHHhhcccceecceEEE
Confidence 35899999999999999999988876443 344443 699999999888888888999999887765
No 139
>KOG4205|consensus
Probab=22.45 E-value=1.4e+02 Score=29.68 Aligned_cols=61 Identities=23% Similarity=0.363 Sum_probs=47.3
Q ss_pred CCCeEEeeecCCCCCHHHHHHHHhhcC-CeeEEEecCCCCCCCceEEEEEEecCcccHHHHHH
Q psy4687 33 RTSSIFLRNLAPTITKAEVENLCKRYS-GFLRVAIADPQPDRRWFRRGWVTFRRDVDIKEICW 94 (339)
Q Consensus 33 ~t~slfIk~IpP~isr~ele~~ck~~~-GF~~lsLSDP~p~krf~R~GWV~f~~~~~~~~~~~ 94 (339)
.|.-+|+-.||++++-.++.+.+.+++ =..-+..-|....+ =..-|.|+|++...++.++.
T Consensus 96 ~tkkiFvGG~~~~~~e~~~r~yfe~~g~v~~~~~~~d~~~~~-~rgFgfv~~~~e~sVdkv~~ 157 (311)
T KOG4205|consen 96 RTKKIFVGGLPPDTTEEDFKDYFEQFGKVADVVIMYDKTTSR-PRGFGFVTFDSEDSVDKVTL 157 (311)
T ss_pred ceeEEEecCcCCCCchHHHhhhhhccceeEeeEEeecccccc-cccceeeEeccccccceecc
Confidence 577999999999999999999999987 34445556665543 23459999999998887763
No 140
>COG1439 Predicted nucleic acid-binding protein, consists of a PIN domain and a Zn-ribbon module [General function prediction only]
Probab=22.26 E-value=32 Score=31.59 Aligned_cols=19 Identities=11% Similarity=0.207 Sum_probs=16.4
Q ss_pred hhhccCCeecCCCCccccc
Q psy4687 298 TQEISTDKWLCPLSGKKFK 316 (339)
Q Consensus 298 ~~e~~e~K~rC~~C~KlFk 316 (339)
+++.-.|++||..|.|.|.
T Consensus 132 I~~v~~w~~rC~GC~~~f~ 150 (177)
T COG1439 132 IKKVRKWRLRCHGCKRIFP 150 (177)
T ss_pred cceEeeeeEEEecCceecC
Confidence 3455799999999999999
No 141
>PRK12378 hypothetical protein; Provisional
Probab=20.91 E-value=3.1e+02 Score=26.15 Aligned_cols=58 Identities=16% Similarity=0.256 Sum_probs=45.5
Q ss_pred CCCCeEEeeecCCCCCH--HHHHHHHhhcCCeeEEEecCCCCCC-CceEEEEEEecCcccHHHHHH
Q psy4687 32 HRTSSIFLRNLAPTITK--AEVENLCKRYSGFLRVAIADPQPDR-RWFRRGWVTFRRDVDIKEICW 94 (339)
Q Consensus 32 h~t~slfIk~IpP~isr--~ele~~ck~~~GF~~lsLSDP~p~k-rf~R~GWV~f~~~~~~~~~~~ 94 (339)
+.+..|+|..+..+..| ++|..++++++| +|++|-.-+ -|.|.|-|.|... +..+++.
T Consensus 89 PgGvaiiVe~lTDN~nRt~~~vr~~f~K~gg----~l~~~gsv~~~Fe~kG~i~i~~~-~~d~~~e 149 (235)
T PRK12378 89 PNGVMVIVECLTDNVNRTVANVRSAFNKNGG----NLGTSGSVAFMFDHKGVFVFEGD-DEDELLE 149 (235)
T ss_pred CCCcEEEEEECCCCHHHHHHHHHHHHhhcCC----eECCCCceeeeeecceEEEeCCC-CHHHHHH
Confidence 56688999999998877 689999999999 667665433 8999999999855 5555553
No 142
>PRK00464 nrdR transcriptional regulator NrdR; Validated
Probab=20.85 E-value=30 Score=30.92 Aligned_cols=18 Identities=22% Similarity=0.359 Sum_probs=14.8
Q ss_pred ecCCCCcccccChhhHHh
Q psy4687 306 WLCPLSGKKFKGPDFIRK 323 (339)
Q Consensus 306 ~rC~~C~KlFk~~eFV~K 323 (339)
++|+.||+.|.+-|=|.+
T Consensus 29 ~~c~~c~~~f~~~e~~~~ 46 (154)
T PRK00464 29 RECLACGKRFTTFERVEL 46 (154)
T ss_pred eeccccCCcceEeEeccC
Confidence 899999999998765543
No 143
>KOG4660|consensus
Probab=20.64 E-value=2.3e+02 Score=30.40 Aligned_cols=72 Identities=15% Similarity=0.254 Sum_probs=51.2
Q ss_pred CCCCcccCCCCeEEeeecCCCCCHHHHHHHHhhcCCeeEEEecCCCCCCCceEEEEEEecCcccHHHHHHhhccccc
Q psy4687 25 EPPPRALHRTSSIFLRNLAPTITKAEVENLCKRYSGFLRVAIADPQPDRRWFRRGWVTFRRDVDIKEICWNLNNIRL 101 (339)
Q Consensus 25 ~~~~~~lh~t~slfIk~IpP~isr~ele~~ck~~~GF~~lsLSDP~p~krf~R~GWV~f~~~~~~~~~~~~L~~~~i 101 (339)
+..+..-|.+-+|+|=++|++||-++|..++..++-.+=+-.+ |+ +++|+ .|-|=.=-+++.|+..|+...+
T Consensus 66 ~np~~~~~~~~~L~v~nl~~~Vsn~~L~~~f~~yGeir~ir~t-~~--~~~~~--~v~FyDvR~A~~Alk~l~~~~~ 137 (549)
T KOG4660|consen 66 DNPSEKDMNQGTLVVFNLPRSVSNDTLLRIFGAYGEIREIRET-PN--KRGIV--FVEFYDVRDAERALKALNRREI 137 (549)
T ss_pred CCCCcccCccceEEEEecCCcCCHHHHHHHHHhhcchhhhhcc-cc--cCceE--EEEEeehHhHHHHHHHHHHHHh
Confidence 4455557888999999999999999999999999876653333 33 34543 5666544466677777775444
No 144
>KOG4211|consensus
Probab=20.27 E-value=2.6e+02 Score=29.64 Aligned_cols=75 Identities=15% Similarity=0.126 Sum_probs=51.2
Q ss_pred cCCCCCCCCCCCcccCC---CCeEEeeecCCCCCHHHHHHHHhhcCCeeE--EEecCCCCCCCceEEEEEEecCcccHHH
Q psy4687 17 INGGESKNEPPPRALHR---TSSIFLRNLAPTITKAEVENLCKRYSGFLR--VAIADPQPDRRWFRRGWVTFRRDVDIKE 91 (339)
Q Consensus 17 ~~~~~~~~~~~~~~lh~---t~slfIk~IpP~isr~ele~~ck~~~GF~~--lsLSDP~p~krf~R~GWV~f~~~~~~~~ 91 (339)
+.+.|-|..-+|-..|. ...+-+|-||=+|+.++|.+|+.-++=..- +=+-|+.- |.-=-++|.|.+...+..
T Consensus 83 ~~~~e~d~~~~~~g~~s~~~d~vVRLRGLPfscte~dI~~FFaGL~Iv~~gi~l~~d~rg--R~tGEAfVqF~sqe~ae~ 160 (510)
T KOG4211|consen 83 AGGAEADWVMRPGGPNSSANDGVVRLRGLPFSCTEEDIVEFFAGLEIVPDGILLPMDQRG--RPTGEAFVQFESQESAEI 160 (510)
T ss_pred cCCccccccccCCCCCCCCCCceEEecCCCccCcHHHHHHHhcCCcccccceeeeccCCC--CcccceEEEecCHHHHHH
Confidence 44555555556655555 478999999999999999999998732221 11222332 466789999999876665
Q ss_pred HH
Q psy4687 92 IC 93 (339)
Q Consensus 92 ~~ 93 (339)
|+
T Consensus 161 Al 162 (510)
T KOG4211|consen 161 AL 162 (510)
T ss_pred HH
Confidence 54
No 145
>PTZ00380 microtubule-associated protein (MAP); Provisional
Probab=20.11 E-value=1.3e+02 Score=25.98 Aligned_cols=37 Identities=24% Similarity=0.484 Sum_probs=27.2
Q ss_pred CeEEeee-cCCC-CCHHHHHHHHhhcCCeeEEEecCCCC
Q psy4687 35 SSIFLRN-LAPT-ITKAEVENLCKRYSGFLRVAIADPQP 71 (339)
Q Consensus 35 ~slfIk~-IpP~-isr~ele~~ck~~~GF~~lsLSDP~p 71 (339)
.-||+.+ +||+ .+=.+|-+-+|.-.||+|++-|+=+.
T Consensus 71 ~flfVnn~lp~~s~~mg~lYe~~KDeDGFLYi~Ys~e~t 109 (121)
T PTZ00380 71 VTLAIEGSTPAVTATVGDIADACKRDDGFLYVSVRTEQA 109 (121)
T ss_pred EEEEECCccCCccchHHHHHHHhcCCCCeEEEEEccccc
Confidence 4577777 5543 35677888888889999999987543
No 146
>KOG0106|consensus
Probab=20.08 E-value=85 Score=29.69 Aligned_cols=62 Identities=18% Similarity=0.310 Sum_probs=46.3
Q ss_pred eEEeeecCCCCCHHHHHHHHhhcCCeeEEEecCCCCCCCceEEEEEEecCcccHHHHHHhhccccccCcc
Q psy4687 36 SIFLRNLAPTITKAEVENLCKRYSGFLRVAIADPQPDRRWFRRGWVTFRRDVDIKEICWNLNNIRLRDCE 105 (339)
Q Consensus 36 slfIk~IpP~isr~ele~~ck~~~GF~~lsLSDP~p~krf~R~GWV~f~~~~~~~~~~~~L~~~~i~d~e 105 (339)
-+||..+|+.+...+||.|+++++-...+.+- -. .|+|.|...-++.+|+..||+..+-+-.
T Consensus 3 rv~vg~~~~~~~~~d~E~~f~~yg~~~d~~mk-----~g---f~fv~fed~rda~Dav~~l~~~~l~~e~ 64 (216)
T KOG0106|consen 3 RVYIGRLPYRARERDVERFFKGYGKIPDADMK-----NG---FGFVEFEDPRDADDAVHDLDGKELCGER 64 (216)
T ss_pred ceeecccCCccchhHHHHHHhhccccccceee-----cc---cceeccCchhhhhcccchhcCceeccee
Confidence 47899999999999999999998765554442 11 2267888888888888888876665433
Done!