Query psy15934
Match_columns 70
No_of_seqs 88 out of 90
Neff 4.1
Searched_HMMs 46136
Date Fri Aug 16 16:49:47 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy15934.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/15934hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PF13637 Ank_4: Ankyrin repeat 98.3 5.2E-07 1.1E-11 50.3 2.7 35 34-69 5-39 (54)
2 PF13857 Ank_5: Ankyrin repeat 97.9 5.5E-06 1.2E-10 47.1 2.0 36 34-70 20-55 (56)
3 PF12796 Ank_2: Ankyrin repeat 97.7 4.2E-05 9.1E-10 45.0 3.5 36 33-69 29-64 (89)
4 PHA02875 ankyrin repeat protei 97.4 0.00015 3.3E-09 53.1 3.1 36 33-69 5-40 (413)
5 PHA02878 ankyrin repeat protei 97.2 0.00029 6.4E-09 53.1 3.1 36 34-70 41-76 (477)
6 KOG0818|consensus 97.1 0.00053 1.1E-08 56.5 3.7 39 32-70 135-173 (669)
7 PHA03095 ankyrin-like protein; 97.1 0.00052 1.1E-08 50.6 3.0 36 34-70 261-296 (471)
8 PHA02798 ankyrin-like protein; 97.0 0.00052 1.1E-08 52.2 2.8 36 34-70 262-297 (489)
9 PHA02989 ankyrin repeat protei 96.9 0.00086 1.9E-08 51.0 2.9 35 34-69 260-294 (494)
10 PHA02875 ankyrin repeat protei 96.8 0.0013 2.7E-08 48.3 3.2 35 34-69 106-140 (413)
11 KOG0514|consensus 96.8 0.0017 3.8E-08 51.8 4.1 54 10-69 325-378 (452)
12 TIGR00870 trp transient-recept 96.8 0.0012 2.7E-08 52.7 3.2 39 30-69 17-57 (743)
13 PHA03100 ankyrin repeat protei 96.7 0.0015 3.3E-08 48.4 3.1 35 34-69 254-288 (480)
14 PLN03192 Voltage-dependent pot 96.7 0.0015 3.2E-08 53.4 3.2 38 32-70 527-564 (823)
15 PHA02743 Viral ankyrin protein 96.7 0.0014 3E-08 43.6 2.5 36 34-69 61-99 (166)
16 PHA02874 ankyrin repeat protei 96.6 0.0018 4E-08 48.2 3.2 35 34-69 128-162 (434)
17 PF13606 Ank_3: Ankyrin repeat 96.5 0.0043 9.4E-08 31.8 3.3 25 34-58 6-30 (30)
18 PHA03095 ankyrin-like protein; 96.5 0.0027 5.9E-08 46.8 3.3 34 35-69 19-52 (471)
19 PF00023 Ank: Ankyrin repeat H 96.5 0.0064 1.4E-07 30.9 3.8 27 34-60 6-32 (33)
20 PHA02736 Viral ankyrin protein 96.4 0.0033 7.2E-08 40.5 3.0 35 34-69 96-131 (154)
21 PHA02878 ankyrin repeat protei 96.4 0.0031 6.8E-08 47.6 3.1 35 34-69 172-206 (477)
22 PLN03192 Voltage-dependent pot 96.4 0.0032 7E-08 51.4 3.2 36 34-70 626-661 (823)
23 PTZ00322 6-phosphofructo-2-kin 96.3 0.0039 8.4E-08 50.4 3.3 37 33-70 85-121 (664)
24 PHA02884 ankyrin repeat protei 96.3 0.0041 8.8E-08 46.6 3.1 35 35-69 38-75 (300)
25 PHA02791 ankyrin-like protein; 96.2 0.0046 1E-07 45.5 3.1 35 34-69 65-99 (284)
26 PHA02946 ankyin-like protein; 96.2 0.0054 1.2E-07 47.1 3.6 35 34-69 76-110 (446)
27 PHA02795 ankyrin-like protein; 96.2 0.0043 9.3E-08 49.1 3.1 36 34-70 225-260 (437)
28 PHA03100 ankyrin repeat protei 96.2 0.0046 1E-07 45.8 3.1 36 33-69 218-255 (480)
29 PHA02874 ankyrin repeat protei 96.2 0.006 1.3E-07 45.4 3.6 35 34-69 39-73 (434)
30 PF13857 Ank_5: Ankyrin repeat 96.1 0.001 2.2E-08 37.6 -0.6 21 49-70 1-22 (56)
31 PHA02741 hypothetical protein; 96.0 0.0081 1.8E-07 39.8 3.1 35 34-69 102-137 (169)
32 PHA02743 Viral ankyrin protein 96.0 0.0076 1.6E-07 40.1 2.9 36 34-70 98-134 (166)
33 PHA02859 ankyrin repeat protei 95.9 0.0085 1.8E-07 41.3 3.2 28 42-69 65-92 (209)
34 PHA02884 ankyrin repeat protei 95.9 0.0083 1.8E-07 45.0 3.2 35 34-69 108-142 (300)
35 PHA02741 hypothetical protein; 95.8 0.011 2.4E-07 39.1 3.1 37 34-70 64-104 (169)
36 KOG4412|consensus 95.8 0.0087 1.9E-07 44.3 2.9 36 34-70 76-112 (226)
37 PHA02730 ankyrin-like protein; 95.6 0.013 2.9E-07 48.6 3.6 35 34-69 45-82 (672)
38 PF12796 Ank_2: Ankyrin repeat 95.5 0.019 4.1E-07 33.5 3.2 31 34-69 1-31 (89)
39 PHA02736 Viral ankyrin protein 95.5 0.0079 1.7E-07 38.8 1.6 34 37-70 62-98 (154)
40 PHA02798 ankyrin-like protein; 95.5 0.031 6.7E-07 42.6 5.1 35 34-69 40-76 (489)
41 PHA02859 ankyrin repeat protei 95.5 0.012 2.7E-07 40.5 2.6 29 40-69 169-197 (209)
42 PHA02876 ankyrin repeat protei 95.5 0.014 3E-07 46.2 3.1 35 34-69 149-183 (682)
43 KOG0509|consensus 95.5 0.011 2.4E-07 48.8 2.7 34 36-70 118-151 (600)
44 COG0666 Arp FOG: Ankyrin repea 95.4 0.019 4E-07 36.1 3.1 39 31-70 74-112 (235)
45 PHA02946 ankyin-like protein; 95.4 0.011 2.4E-07 45.4 2.3 32 38-70 47-78 (446)
46 PTZ00322 6-phosphofructo-2-kin 95.4 0.016 3.4E-07 46.9 3.2 36 34-70 119-154 (664)
47 PHA02716 CPXV016; CPX019; EVM0 95.2 0.022 4.7E-07 47.9 3.6 38 31-69 285-322 (764)
48 cd00204 ANK ankyrin repeats; 95.1 0.038 8.2E-07 31.9 3.5 35 34-69 44-78 (126)
49 PHA02989 ankyrin repeat protei 95.0 0.015 3.3E-07 44.3 2.1 31 38-69 119-150 (494)
50 COG0666 Arp FOG: Ankyrin repea 94.9 0.046 1E-06 34.3 3.6 35 34-69 110-152 (235)
51 PHA02876 ankyrin repeat protei 94.8 0.03 6.4E-07 44.3 3.2 36 33-69 181-216 (682)
52 PHA02716 CPXV016; CPX019; EVM0 94.7 0.034 7.4E-07 46.7 3.5 35 34-69 216-252 (764)
53 PHA02791 ankyrin-like protein; 94.6 0.041 8.8E-07 40.5 3.3 36 33-69 163-199 (284)
54 PHA02917 ankyrin-like protein; 94.2 0.044 9.6E-07 44.7 3.1 34 35-69 108-141 (661)
55 smart00248 ANK ankyrin repeats 93.2 0.25 5.4E-06 21.2 3.5 24 34-57 6-29 (30)
56 cd00204 ANK ankyrin repeats; 93.2 0.15 3.3E-06 29.3 3.4 35 34-69 77-111 (126)
57 KOG0509|consensus 93.0 0.11 2.3E-06 43.2 3.4 43 26-69 40-83 (600)
58 KOG4214|consensus 92.8 0.074 1.6E-06 36.1 1.8 36 32-69 4-39 (117)
59 KOG3676|consensus 92.6 0.083 1.8E-06 45.0 2.3 33 37-70 247-279 (782)
60 KOG0502|consensus 92.5 0.06 1.3E-06 41.1 1.1 32 37-69 233-264 (296)
61 PHA02917 ankyrin-like protein; 92.2 0.14 3.1E-06 41.8 3.1 35 34-69 456-490 (661)
62 KOG0505|consensus 92.1 0.1 2.2E-06 42.7 2.2 37 32-69 42-78 (527)
63 KOG0510|consensus 91.8 0.094 2E-06 45.3 1.7 33 37-70 128-160 (929)
64 PHA02792 ankyrin-like protein; 91.6 0.18 4E-06 41.8 3.1 35 34-69 414-448 (631)
65 KOG1710|consensus 91.4 0.22 4.9E-06 39.3 3.2 36 34-69 49-84 (396)
66 KOG0508|consensus 91.1 0.14 3E-06 42.5 1.9 27 43-70 508-534 (615)
67 KOG0512|consensus 90.5 0.3 6.6E-06 36.2 3.1 36 33-69 100-135 (228)
68 TIGR00870 trp transient-recept 90.4 0.11 2.5E-06 41.6 0.8 36 34-70 179-214 (743)
69 PHA02792 ankyrin-like protein; 89.7 0.31 6.7E-06 40.5 2.8 36 33-69 342-379 (631)
70 KOG4412|consensus 88.6 0.37 8E-06 35.9 2.3 36 34-70 143-178 (226)
71 PHA02795 ankyrin-like protein; 88.3 0.44 9.5E-06 37.9 2.7 33 36-69 194-226 (437)
72 KOG0508|consensus 88.1 0.42 9E-06 39.7 2.5 35 35-70 122-156 (615)
73 PHA02730 ankyrin-like protein; 87.2 0.73 1.6E-05 38.5 3.5 36 34-69 466-501 (672)
74 KOG0515|consensus 86.6 0.69 1.5E-05 39.0 3.0 33 36-69 589-621 (752)
75 KOG4177|consensus 86.2 0.6 1.3E-05 41.3 2.6 38 31-69 541-578 (1143)
76 KOG0512|consensus 85.7 1 2.2E-05 33.5 3.2 42 27-69 60-102 (228)
77 KOG2505|consensus 84.1 1 2.2E-05 37.5 2.8 34 36-70 436-469 (591)
78 PF08332 CaMKII_AD: Calcium/ca 83.8 1.6 3.6E-05 29.3 3.4 30 19-53 2-31 (128)
79 KOG0502|consensus 82.8 0.73 1.6E-05 35.3 1.5 36 34-70 164-199 (296)
80 KOG0195|consensus 82.3 1.1 2.3E-05 35.8 2.3 32 37-69 74-105 (448)
81 KOG4177|consensus 78.6 1.7 3.6E-05 38.6 2.5 42 28-70 240-281 (1143)
82 KOG0783|consensus 78.3 0.99 2.2E-05 40.0 1.0 32 38-69 60-91 (1267)
83 PF10152 DUF2360: Predicted co 77.9 2.2 4.7E-05 29.0 2.4 33 28-60 113-147 (148)
84 KOG0818|consensus 72.9 2.5 5.5E-05 35.5 2.0 32 37-69 174-205 (669)
85 KOG4496|consensus 72.6 4.5 9.7E-05 29.4 3.0 36 30-67 134-171 (194)
86 KOG1710|consensus 72.3 3 6.6E-05 33.1 2.2 39 30-69 12-50 (396)
87 KOG0510|consensus 71.5 4.1 8.8E-05 35.7 2.9 35 34-69 277-311 (929)
88 KOG0521|consensus 70.1 2.4 5.2E-05 36.1 1.3 33 36-69 695-727 (785)
89 PF12870 Lumazine_bd: Lumazine 70.0 4.1 9E-05 23.9 2.0 24 29-52 11-34 (111)
90 KOG4214|consensus 68.8 5.5 0.00012 27.1 2.6 34 33-67 70-103 (117)
91 PF12169 DNA_pol3_gamma3: DNA 64.2 9.5 0.0002 24.3 3.0 31 27-57 14-48 (143)
92 PF07729 FCD: FCD domain; Int 59.8 19 0.00041 20.9 3.5 28 25-52 94-121 (125)
93 KOG0505|consensus 59.4 11 0.00023 31.3 3.1 38 31-69 74-111 (527)
94 PF01649 Ribosomal_S20p: Ribos 59.2 18 0.0004 22.6 3.6 23 28-50 28-50 (84)
95 PRK14135 recX recombination re 59.1 15 0.00033 26.1 3.5 55 6-60 31-92 (263)
96 TIGR00756 PPR pentatricopeptid 58.4 18 0.0004 16.6 3.0 22 37-58 10-35 (35)
97 KOG0507|consensus 54.4 12 0.00025 32.7 2.6 33 36-69 121-153 (854)
98 PF13041 PPR_2: PPR repeat fam 52.2 22 0.00048 18.7 2.7 21 38-58 14-38 (50)
99 KOG3609|consensus 48.1 19 0.00041 31.4 2.9 41 29-70 24-68 (822)
100 PRK06585 holA DNA polymerase I 47.7 43 0.00094 24.3 4.4 41 16-56 197-244 (343)
101 COG0268 RpsT Ribosomal protein 47.1 37 0.00079 21.9 3.5 23 28-50 29-51 (88)
102 KOG0511|consensus 47.0 16 0.00036 30.0 2.3 39 30-69 36-74 (516)
103 cd00781 ketosteroid_isomerase 46.6 32 0.0007 20.7 3.1 26 28-53 6-31 (122)
104 TIGR00029 S20 ribosomal protei 44.4 43 0.00093 21.2 3.5 24 27-50 28-51 (87)
105 PRK14155 heat shock protein Gr 44.4 6.8 0.00015 28.4 -0.2 33 29-61 101-133 (208)
106 PF04908 SH3BGR: SH3-binding, 44.3 13 0.00028 24.0 1.1 21 29-49 78-98 (99)
107 PRK00239 rpsT 30S ribosomal pr 44.0 45 0.00097 21.1 3.6 24 27-50 28-51 (88)
108 PF11252 DUF3051: Protein of u 43.5 20 0.00044 25.8 2.1 46 3-56 51-99 (189)
109 TIGR02957 SigX4 RNA polymerase 43.4 36 0.00077 24.5 3.4 27 28-54 167-193 (281)
110 COG2605 Predicted kinase relat 43.2 64 0.0014 25.5 4.9 52 3-54 201-253 (333)
111 PF05223 MecA_N: NTF2-like N-t 43.0 26 0.00056 22.1 2.4 23 29-51 5-27 (118)
112 PF12680 SnoaL_2: SnoaL-like d 42.7 15 0.00032 20.5 1.1 20 32-51 2-21 (102)
113 PF02607 B12-binding_2: B12 bi 42.5 17 0.00038 20.9 1.4 25 30-54 4-28 (79)
114 PF13474 SnoaL_3: SnoaL-like d 41.5 42 0.0009 19.6 3.0 23 29-51 3-25 (121)
115 PRK14156 heat shock protein Gr 40.3 7.8 0.00017 27.5 -0.4 31 28-61 108-138 (177)
116 PRK09636 RNA polymerase sigma 39.6 46 0.00099 23.9 3.5 26 28-53 174-199 (293)
117 PF13963 Transpos_assoc: Trans 39.1 18 0.00038 21.9 1.1 13 47-59 58-70 (77)
118 PRK09635 sigI RNA polymerase s 38.8 44 0.00094 24.5 3.3 24 28-51 177-200 (290)
119 PRK14139 heat shock protein Gr 37.8 8.4 0.00018 27.5 -0.6 31 30-61 116-146 (185)
120 TIGR02960 SigX5 RNA polymerase 36.7 52 0.0011 23.6 3.3 27 28-54 207-233 (324)
121 PF04651 Pox_A12: Poxvirus A12 36.5 30 0.00064 25.4 2.1 23 28-51 8-30 (189)
122 smart00668 CTLH C-terminal to 36.5 71 0.0015 17.0 3.3 23 30-52 4-26 (58)
123 TIGR02096 conserved hypothetic 36.4 49 0.0011 19.9 2.8 24 30-53 3-26 (129)
124 PF07739 TipAS: TipAS antibiot 35.8 28 0.0006 21.2 1.6 16 43-58 41-56 (118)
125 PF07890 Rrp15p: Rrp15p; Inte 35.7 62 0.0014 21.4 3.4 35 18-53 72-106 (130)
126 PRK14154 heat shock protein Gr 35.5 8 0.00017 28.2 -1.0 31 31-61 140-170 (208)
127 PF09572 RE_XamI: XamI restric 35.3 37 0.0008 25.8 2.5 25 43-67 67-92 (251)
128 PF14534 DUF4440: Domain of un 34.3 28 0.0006 19.7 1.4 23 29-51 3-25 (107)
129 KOG0522|consensus 33.6 48 0.001 27.8 3.1 35 34-69 59-93 (560)
130 PF05714 Borrelia_lipo_1: Borr 33.6 33 0.00071 24.7 1.9 22 4-25 67-89 (211)
131 KOG0521|consensus 33.5 30 0.00066 29.6 2.0 34 35-69 661-694 (785)
132 PRK14163 heat shock protein Gr 33.2 11 0.00024 27.6 -0.6 32 29-61 118-149 (214)
133 PRK08241 RNA polymerase factor 32.6 61 0.0013 23.5 3.2 26 29-54 218-243 (339)
134 PRK14159 heat shock protein Gr 32.3 9.1 0.0002 27.1 -1.1 29 31-61 110-138 (176)
135 PF13356 DUF4102: Domain of un 32.1 25 0.00055 21.1 1.0 17 43-59 67-83 (89)
136 PF12893 Lumazine_bd_2: Putati 32.1 39 0.00084 20.8 1.9 27 28-58 7-33 (116)
137 PRK14146 heat shock protein Gr 31.8 13 0.00029 27.0 -0.3 33 28-61 138-170 (215)
138 PRK14151 heat shock protein Gr 31.7 8.8 0.00019 27.0 -1.2 30 31-61 108-137 (176)
139 CHL00102 rps20 ribosomal prote 31.4 85 0.0018 20.2 3.4 24 27-50 28-58 (93)
140 KOG3676|consensus 31.3 33 0.00071 29.8 1.8 25 36-60 190-214 (782)
141 PF15472 DUF4638: Domain of un 31.0 41 0.00088 25.8 2.1 21 40-60 227-248 (268)
142 COG0576 GrpE Molecular chapero 30.8 12 0.00026 26.6 -0.7 30 31-61 124-153 (193)
143 PRK10325 heat shock protein Gr 30.1 14 0.00031 26.3 -0.4 30 31-61 127-156 (197)
144 PRK14161 heat shock protein Gr 30.1 10 0.00023 26.7 -1.1 31 31-61 108-138 (178)
145 PRK07417 arogenate dehydrogena 30.0 66 0.0014 23.0 3.0 25 29-53 246-270 (279)
146 PRK14150 heat shock protein Gr 29.8 14 0.0003 26.3 -0.5 30 31-61 126-155 (193)
147 KOG0195|consensus 29.8 56 0.0012 26.3 2.8 36 34-70 38-73 (448)
148 PRK05629 hypothetical protein; 29.6 1.9E+02 0.0041 21.0 5.4 47 11-57 174-227 (318)
149 PF01399 PCI: PCI domain; Int 29.3 91 0.002 18.0 3.1 24 30-53 2-25 (105)
150 smart00165 UBA Ubiquitin assoc 29.2 35 0.00077 17.2 1.1 15 43-57 2-16 (37)
151 PRK14157 heat shock protein Gr 29.2 15 0.00033 27.2 -0.4 31 30-61 156-186 (227)
152 PF00627 UBA: UBA/TS-N domain; 29.2 16 0.00034 19.0 -0.3 15 43-57 3-17 (37)
153 TIGR02246 conserved hypothetic 29.1 96 0.0021 18.4 3.2 24 28-51 7-30 (128)
154 cd08323 CARD_APAF1 Caspase act 29.1 93 0.002 19.5 3.2 26 30-55 61-86 (86)
155 KOG2204|consensus 28.7 51 0.0011 28.0 2.5 25 43-67 355-384 (625)
156 PRK14525 rpsP 30S ribosomal pr 28.7 52 0.0011 21.1 2.1 17 41-57 51-67 (88)
157 KOG0506|consensus 28.7 59 0.0013 27.5 2.8 39 31-70 507-545 (622)
158 PRK00040 rpsP 30S ribosomal pr 28.6 50 0.0011 20.5 1.9 17 41-57 52-68 (75)
159 PF12554 MOZART1: Mitotic-spin 28.6 31 0.00068 19.9 0.9 13 45-57 26-38 (48)
160 PHA02608 67 prohead core prote 27.3 76 0.0016 20.4 2.6 18 32-49 2-19 (80)
161 PRK14153 heat shock protein Gr 27.3 16 0.00034 26.4 -0.6 32 29-61 118-149 (194)
162 PRK10219 DNA-binding transcrip 26.9 90 0.002 18.9 2.9 45 9-53 25-69 (107)
163 PRK14148 heat shock protein Gr 26.7 14 0.00031 26.5 -0.9 30 31-61 127-156 (195)
164 PRK07502 cyclohexadienyl dehyd 26.6 83 0.0018 22.7 3.0 26 28-53 256-281 (307)
165 PRK14140 heat shock protein Gr 26.1 18 0.00038 26.0 -0.5 30 31-61 124-153 (191)
166 PF06896 DUF1268: Protein of u 25.8 1.7E+02 0.0036 19.4 4.2 29 10-39 61-89 (114)
167 PHA01976 helix-turn-helix prot 25.8 92 0.002 17.2 2.6 48 13-60 12-62 (67)
168 KOG0507|consensus 25.8 60 0.0013 28.5 2.5 33 37-70 89-121 (854)
169 TIGR00002 S16 ribosomal protei 25.7 63 0.0014 20.1 2.0 18 40-57 48-65 (78)
170 cd08332 CARD_CASP2 Caspase act 25.5 1E+02 0.0022 19.1 3.0 31 22-52 57-89 (90)
171 PRK14145 heat shock protein Gr 25.3 19 0.0004 26.0 -0.5 31 30-61 128-158 (196)
172 PRK14134 recX recombination re 25.2 95 0.0021 23.0 3.2 55 6-60 35-96 (283)
173 PF02153 PDH: Prephenate dehyd 24.9 1.1E+02 0.0025 21.7 3.5 24 28-51 234-257 (258)
174 COG5626 Uncharacterized small 24.9 77 0.0017 21.0 2.4 24 31-54 38-61 (97)
175 PRK14956 DNA polymerase III su 24.9 1.1E+02 0.0024 24.9 3.7 30 28-57 248-282 (484)
176 KOG3835|consensus 24.8 97 0.0021 25.5 3.3 38 16-53 8-45 (495)
177 PRK11199 tyrA bifunctional cho 24.5 90 0.002 23.7 3.0 26 28-53 321-346 (374)
178 KOG2384|consensus 24.1 62 0.0013 24.3 2.0 35 34-69 16-51 (223)
179 PRK14144 heat shock protein Gr 23.9 17 0.00038 26.3 -0.9 31 30-61 130-160 (199)
180 CHL00005 rps16 ribosomal prote 23.7 67 0.0015 20.3 1.9 18 40-57 47-64 (82)
181 PRK14142 heat shock protein Gr 23.5 24 0.00052 26.2 -0.2 31 30-61 113-143 (223)
182 PF13812 PPR_3: Pentatricopept 23.3 96 0.0021 14.3 2.6 19 38-56 12-34 (34)
183 PF07862 Nif11: Nitrogen fixat 23.1 83 0.0018 17.0 2.0 29 27-55 2-42 (49)
184 KOG2785|consensus 23.0 1E+02 0.0023 24.8 3.2 21 7-27 32-52 (390)
185 PF03013 Pyr_excise: Pyrimidin 23.0 52 0.0011 22.3 1.3 26 36-61 59-84 (130)
186 TIGR00370 conserved hypothetic 22.9 43 0.00094 23.9 1.0 23 10-32 95-117 (202)
187 cd07218 Pat_iPLA2 Calcium-inde 22.9 87 0.0019 22.6 2.6 29 27-55 205-233 (245)
188 KOG4023|consensus 22.8 40 0.00088 22.7 0.8 22 26-47 75-96 (108)
189 PRK08507 prephenate dehydrogen 22.8 1.1E+02 0.0024 21.7 3.0 25 29-53 244-268 (275)
190 PRK14149 heat shock protein Gr 22.7 18 0.00038 26.1 -1.0 30 30-61 122-151 (191)
191 KOG0514|consensus 22.6 41 0.00088 27.5 0.9 30 40-70 278-308 (452)
192 PF11815 DUF3336: Domain of un 22.5 1.3E+02 0.0028 20.2 3.2 32 29-60 61-92 (145)
193 PF13877 RPAP3_C: Potential Mo 22.4 1.3E+02 0.0028 18.1 3.0 27 32-58 22-48 (94)
194 PRK14160 heat shock protein Gr 22.3 21 0.00046 26.0 -0.7 30 30-61 144-173 (211)
195 PRK14158 heat shock protein Gr 22.2 18 0.00039 26.0 -1.1 31 31-61 126-156 (194)
196 PRK14162 heat shock protein Gr 21.9 22 0.00047 25.6 -0.7 30 31-61 126-155 (194)
197 PRK05574 holA DNA polymerase I 21.6 2.1E+02 0.0045 20.3 4.3 40 17-56 201-247 (340)
198 smart00796 AHS1 Allophanate hy 21.6 47 0.001 23.5 1.0 23 10-32 105-127 (201)
199 PF04910 Tcf25: Transcriptiona 21.5 1.3E+02 0.0029 22.9 3.4 24 29-52 12-35 (360)
200 COG1393 ArsC Arsenate reductas 21.4 57 0.0012 21.3 1.2 27 40-68 11-38 (117)
201 PF03489 SapB_2: Saposin-like 21.3 48 0.001 16.7 0.7 18 43-60 12-29 (35)
202 cd07221 Pat_PNPLA3 Patatin-lik 21.2 88 0.0019 22.7 2.3 30 26-55 207-236 (252)
203 PF10052 DUF2288: Protein of u 21.2 1.1E+02 0.0023 19.8 2.5 20 35-54 41-60 (93)
204 COG3041 Uncharacterized protei 21.2 1.7E+02 0.0038 19.1 3.4 32 30-61 10-48 (91)
205 PRK14143 heat shock protein Gr 21.2 25 0.00054 26.0 -0.6 31 30-61 154-184 (238)
206 PF12997 DUF3881: Domain of un 21.1 1.6E+02 0.0034 22.7 3.7 34 16-49 151-185 (283)
207 PRK14147 heat shock protein Gr 20.9 22 0.00047 24.9 -0.9 29 32-61 104-132 (172)
208 PRK14962 DNA polymerase III su 20.9 1.3E+02 0.0029 23.9 3.4 42 17-58 230-278 (472)
209 PF12976 DUF3860: Domain of Un 20.9 1E+02 0.0022 20.1 2.3 29 31-61 10-38 (92)
210 PRK06545 prephenate dehydrogen 20.8 1.3E+02 0.0028 22.5 3.2 25 29-53 248-272 (359)
211 TIGR01470 cysG_Nterm siroheme 20.8 2.7E+02 0.0058 19.4 4.6 31 24-54 174-204 (205)
212 TIGR03123 one_C_unchar_1 proba 20.6 1.6E+02 0.0034 22.6 3.6 49 8-56 231-280 (318)
213 PRK14164 heat shock protein Gr 20.6 24 0.00051 25.9 -0.8 31 30-61 150-180 (218)
214 smart00342 HTH_ARAC helix_turn 20.4 1.6E+02 0.0034 15.9 2.8 16 11-26 7-22 (84)
215 COG1541 PaaK Coenzyme F390 syn 20.3 2.6E+02 0.0056 22.7 4.9 43 15-58 16-58 (438)
216 COG1937 Uncharacterized protei 20.2 1.5E+02 0.0033 18.9 3.0 29 27-55 4-34 (89)
217 PF00886 Ribosomal_S16: Riboso 20.2 49 0.0011 19.5 0.7 15 42-56 47-61 (62)
218 PF13936 HTH_38: Helix-turn-he 20.1 1.5E+02 0.0032 15.8 2.6 16 12-27 2-18 (44)
No 1
>PF13637 Ank_4: Ankyrin repeats (many copies); PDB: 3B95_A 3B7B_A 3F6Q_A 2KBX_A 3IXE_A 2DWZ_C 2DVW_A 3AJI_A 1S70_B 2HE0_A ....
Probab=98.32 E-value=5.2e-07 Score=50.30 Aligned_cols=35 Identities=20% Similarity=0.146 Sum_probs=28.1
Q ss_pred HHHHHHcCCHHHHHHHHhcCCCCCCCCCCCCccccc
Q psy15934 34 FLEYVNNSQVEKIAKLCAKGLDPNFHCPETGEMVAL 69 (70)
Q Consensus 34 f~~aV~~~~~~kV~kLL~kGlDPN~~d~~~GeTpL~ 69 (70)
++.|+..|+.+-+.-|+++|.|||.+| +.|+|||+
T Consensus 5 lh~A~~~g~~~~~~~Ll~~~~din~~d-~~g~t~lh 39 (54)
T PF13637_consen 5 LHWAARSGNLEIVKLLLEHGADINAQD-EDGRTPLH 39 (54)
T ss_dssp HHHHHHTT-HHHHHHHHHTTSGTT-B--TTS--HHH
T ss_pred HHHHHHhCCHHHHHHHHHCCCCCCCCC-CCCCCHHH
Confidence 678999999999999999999999998 79999986
No 2
>PF13857 Ank_5: Ankyrin repeats (many copies); PDB: 1SW6_A 3EHR_B 3EHQ_A.
Probab=97.95 E-value=5.5e-06 Score=47.08 Aligned_cols=36 Identities=19% Similarity=0.173 Sum_probs=25.4
Q ss_pred HHHHHHcCCHHHHHHHHhcCCCCCCCCCCCCcccccC
Q psy15934 34 FLEYVNNSQVEKIAKLCAKGLDPNFHCPETGEMVALI 70 (70)
Q Consensus 34 f~~aV~~~~~~kV~kLL~kGlDPN~~d~~~GeTpL~L 70 (70)
+..|+..|+.+-|.-||+.|.|||..| ..|.|||.+
T Consensus 20 LH~A~~~g~~~~v~~Ll~~g~d~~~~d-~~G~Tpl~~ 55 (56)
T PF13857_consen 20 LHWAARYGHSEVVRLLLQNGADPNAKD-KDGQTPLHY 55 (56)
T ss_dssp HHHHHHHT-HHHHHHHHHCT--TT----TTS--HHHH
T ss_pred HHHHHHcCcHHHHHHHHHCcCCCCCCc-CCCCCHHHh
Confidence 678999999999999999999999999 699999863
No 3
>PF12796 Ank_2: Ankyrin repeats (3 copies); InterPro: IPR020683 This entry represents the ankyrin repeat-containing domain. These domains contain multiple repeats of a beta(2)-alpha(2) motif. The ankyrin repeat is one of the most common protein-protein interaction motifs in nature. Ankyrin repeats are tandemly repeated modules of about 33 amino acids. They occur in a large number of functionally diverse proteins mainly from eukaryotes. The few known examples from prokaryotes and viruses may be the result of horizontal gene transfers []. The repeat has been found in proteins of diverse function such as transcriptional initiators, cell-cycle regulators, cytoskeletal, ion transporters and signal transducers. The ankyrin fold appears to be defined by its structure rather than its function since there is no specific sequence or structure which is universally recognised by it. The conserved fold of the ankyrin repeat unit is known from several crystal and solution structures [, , , ]. Each repeat folds into a helix-loop-helix structure with a beta-hairpin/loop region projecting out from the helices at a 90o angle. The repeats stack together to form an L-shaped structure [, ].; PDB: 3AAA_C 3F6Q_A 2KBX_A 3IXE_A 3TWR_D 3TWV_A 3TWT_B 3TWQ_A 3TWS_A 3TWX_B ....
Probab=97.74 E-value=4.2e-05 Score=44.96 Aligned_cols=36 Identities=19% Similarity=0.185 Sum_probs=32.3
Q ss_pred HHHHHHHcCCHHHHHHHHhcCCCCCCCCCCCCccccc
Q psy15934 33 RFLEYVNNSQVEKIAKLCAKGLDPNFHCPETGEMVAL 69 (70)
Q Consensus 33 df~~aV~~~~~~kV~kLL~kGlDPN~~d~~~GeTpL~ 69 (70)
=++-|+..|+.+-+..|+++|.|||..| ..|.|||+
T Consensus 29 ~l~~A~~~~~~~~~~~Ll~~g~~~~~~~-~~g~t~L~ 64 (89)
T PF12796_consen 29 ALHYAAENGNLEIVKLLLENGADINSQD-KNGNTALH 64 (89)
T ss_dssp HHHHHHHTTTHHHHHHHHHTTTCTT-BS-TTSSBHHH
T ss_pred HHHHHHHcCCHHHHHHHHHhcccccccC-CCCCCHHH
Confidence 4778999999999999999999999999 69999996
No 4
>PHA02875 ankyrin repeat protein; Provisional
Probab=97.38 E-value=0.00015 Score=53.09 Aligned_cols=36 Identities=17% Similarity=0.258 Sum_probs=32.2
Q ss_pred HHHHHHHcCCHHHHHHHHhcCCCCCCCCCCCCccccc
Q psy15934 33 RFLEYVNNSQVEKIAKLCAKGLDPNFHCPETGEMVAL 69 (70)
Q Consensus 33 df~~aV~~~~~~kV~kLL~kGlDPN~~d~~~GeTpL~ 69 (70)
.+.+|++.|+.+-|..|++.|+|||+.+ ..|.|||+
T Consensus 5 ~L~~A~~~g~~~iv~~Ll~~g~~~n~~~-~~g~tpL~ 40 (413)
T PHA02875 5 ALCDAILFGELDIARRLLDIGINPNFEI-YDGISPIK 40 (413)
T ss_pred HHHHHHHhCCHHHHHHHHHCCCCCCccC-CCCCCHHH
Confidence 4678999999999999999999999887 58999986
No 5
>PHA02878 ankyrin repeat protein; Provisional
Probab=97.22 E-value=0.00029 Score=53.12 Aligned_cols=36 Identities=14% Similarity=0.127 Sum_probs=33.3
Q ss_pred HHHHHHcCCHHHHHHHHhcCCCCCCCCCCCCcccccC
Q psy15934 34 FLEYVNNSQVEKIAKLCAKGLDPNFHCPETGEMVALI 70 (70)
Q Consensus 34 f~~aV~~~~~~kV~kLL~kGlDPN~~d~~~GeTpL~L 70 (70)
+..||..|+.+-|.-|+++|.|||..| ..|.|||++
T Consensus 41 Lh~A~~~g~~e~vk~Ll~~gadvn~~d-~~g~TpLh~ 76 (477)
T PHA02878 41 LHQAVEARNLDVVKSLLTRGHNVNQPD-HRDLTPLHI 76 (477)
T ss_pred HHHHHHcCCHHHHHHHHHCCCCCCCCC-CCCCCHHHH
Confidence 678999999999999999999999999 599999973
No 6
>KOG0818|consensus
Probab=97.10 E-value=0.00053 Score=56.50 Aligned_cols=39 Identities=26% Similarity=0.413 Sum_probs=36.0
Q ss_pred HHHHHHHHcCCHHHHHHHHhcCCCCCCCCCCCCcccccC
Q psy15934 32 KRFLEYVNNSQVEKIAKLCAKGLDPNFHCPETGEMVALI 70 (70)
Q Consensus 32 ~df~~aV~~~~~~kV~kLL~kGlDPN~~d~~~GeTpL~L 70 (70)
+.+...|+.|+.+.--+||.-|.||||-+|++|.|||++
T Consensus 135 rQLhasvRt~nlet~LRll~lGA~~N~~hpekg~TpLHv 173 (669)
T KOG0818|consen 135 KQLHSSVRTGNLETCLRLLSLGAQANFFHPEKGNTPLHV 173 (669)
T ss_pred HHHHHHhhcccHHHHHHHHHcccccCCCCcccCCchhHH
Confidence 457788999999999999999999999999999999973
No 7
>PHA03095 ankyrin-like protein; Provisional
Probab=97.06 E-value=0.00052 Score=50.60 Aligned_cols=36 Identities=17% Similarity=0.057 Sum_probs=31.8
Q ss_pred HHHHHHcCCHHHHHHHHhcCCCCCCCCCCCCcccccC
Q psy15934 34 FLEYVNNSQVEKIAKLCAKGLDPNFHCPETGEMVALI 70 (70)
Q Consensus 34 f~~aV~~~~~~kV~kLL~kGlDPN~~d~~~GeTpL~L 70 (70)
+..|+..|+.+-|..||+.|.|||..| ..|+|||++
T Consensus 261 Lh~A~~~~~~~~v~~LL~~gad~n~~~-~~g~tpl~~ 296 (471)
T PHA03095 261 LHYAAVFNNPRACRRLIALGADINAVS-SDGNTPLSL 296 (471)
T ss_pred HHHHHHcCCHHHHHHHHHcCCCCcccC-CCCCCHHHH
Confidence 567888999999999999999999998 599999863
No 8
>PHA02798 ankyrin-like protein; Provisional
Probab=97.02 E-value=0.00052 Score=52.16 Aligned_cols=36 Identities=19% Similarity=0.155 Sum_probs=32.1
Q ss_pred HHHHHHcCCHHHHHHHHhcCCCCCCCCCCCCcccccC
Q psy15934 34 FLEYVNNSQVEKIAKLCAKGLDPNFHCPETGEMVALI 70 (70)
Q Consensus 34 f~~aV~~~~~~kV~kLL~kGlDPN~~d~~~GeTpL~L 70 (70)
+.-|+..|+.+-+.-||++|.|||..| ..|+|||++
T Consensus 262 L~~A~~~~~~~~v~~LL~~GAdin~~d-~~G~TpL~~ 297 (489)
T PHA02798 262 LYYSVSHNNRKIFEYLLQLGGDINIIT-ELGNTCLFT 297 (489)
T ss_pred HHHHHHcCcHHHHHHHHHcCCcccccC-CCCCcHHHH
Confidence 556888999999999999999999999 699999963
No 9
>PHA02989 ankyrin repeat protein; Provisional
Probab=96.86 E-value=0.00086 Score=51.04 Aligned_cols=35 Identities=20% Similarity=0.108 Sum_probs=31.9
Q ss_pred HHHHHHcCCHHHHHHHHhcCCCCCCCCCCCCccccc
Q psy15934 34 FLEYVNNSQVEKIAKLCAKGLDPNFHCPETGEMVAL 69 (70)
Q Consensus 34 f~~aV~~~~~~kV~kLL~kGlDPN~~d~~~GeTpL~ 69 (70)
+..|++.|+.+.|.-||++|.|||..| ..|+|||+
T Consensus 260 L~~Aa~~~~~~~v~~LL~~Gadin~~d-~~G~TpL~ 294 (494)
T PHA02989 260 LLISAKVDNYEAFNYLLKLGDDIYNVS-KDGDTVLT 294 (494)
T ss_pred HHHHHHhcCHHHHHHHHHcCCCccccC-CCCCCHHH
Confidence 567889999999999999999999998 69999996
No 10
>PHA02875 ankyrin repeat protein; Provisional
Probab=96.80 E-value=0.0013 Score=48.30 Aligned_cols=35 Identities=14% Similarity=0.078 Sum_probs=27.3
Q ss_pred HHHHHHcCCHHHHHHHHhcCCCCCCCCCCCCccccc
Q psy15934 34 FLEYVNNSQVEKIAKLCAKGLDPNFHCPETGEMVAL 69 (70)
Q Consensus 34 f~~aV~~~~~~kV~kLL~kGlDPN~~d~~~GeTpL~ 69 (70)
+..|+..|+.+-|.-|+++|.|||..+ ..|.|||+
T Consensus 106 L~~A~~~~~~~iv~~Ll~~gad~~~~~-~~g~tpLh 140 (413)
T PHA02875 106 LHLATILKKLDIMKLLIARGADPDIPN-TDKFSPLH 140 (413)
T ss_pred HHHHHHhCCHHHHHHHHhCCCCCCCCC-CCCCCHHH
Confidence 556777888888888888888888877 47888875
No 11
>KOG0514|consensus
Probab=96.78 E-value=0.0017 Score=51.79 Aligned_cols=54 Identities=17% Similarity=0.257 Sum_probs=44.9
Q ss_pred hhhhccCCCHHHHHHHhhHhHHHHHHHHHHcCCHHHHHHHHhcCCCCCCCCCCCCccccc
Q psy15934 10 FISLLLNLDEKQLKALHTRSNLKRFLEYVNNSQVEKIAKLCAKGLDPNFHCPETGEMVAL 69 (70)
Q Consensus 10 rvy~~~~l~ekql~klhtk~~~~df~~aV~~~~~~kV~kLL~kGlDPN~~d~~~GeTpL~ 69 (70)
|+|++=+++-|. +++..--+|=||.+|.++-|.-||+-|.|-|.+| +.|-|+||
T Consensus 325 ~LF~mgnVNaKA-----sQ~gQTALMLAVSHGr~d~vk~LLacgAdVNiQD-dDGSTALM 378 (452)
T KOG0514|consen 325 RLFKMGDVNAKA-----SQHGQTALMLAVSHGRVDMVKALLACGADVNIQD-DDGSTALM 378 (452)
T ss_pred HHHhccCcchhh-----hhhcchhhhhhhhcCcHHHHHHHHHccCCCcccc-CCccHHHh
Confidence 556666666554 3445567899999999999999999999999999 79999998
No 12
>TIGR00870 trp transient-receptor-potential calcium channel protein. after chronic exposure to capsaicin. (McCleskey and Gold, 1999).
Probab=96.77 E-value=0.0012 Score=52.70 Aligned_cols=39 Identities=15% Similarity=0.170 Sum_probs=35.9
Q ss_pred HHHHHHHHHHcCCHHHHHHHHhc--CCCCCCCCCCCCccccc
Q psy15934 30 NLKRFLEYVNNSQVEKIAKLCAK--GLDPNFHCPETGEMVAL 69 (70)
Q Consensus 30 ~~~df~~aV~~~~~~kV~kLL~k--GlDPN~~d~~~GeTpL~ 69 (70)
.-+.|+.||+.|+.+.|.+++++ |+|+|..| ..|.|||+
T Consensus 17 ~~~~~l~A~~~g~~~~v~~lL~~~~~~~in~~d-~~G~t~Lh 57 (743)
T TIGR00870 17 EEKAFLPAAERGDLASVYRDLEEPKKLNINCPD-RLGRSALF 57 (743)
T ss_pred HHHHHHHHHHcCCHHHHHHHhccccccCCCCcC-ccchhHHH
Confidence 35789999999999999999999 99999999 59999997
No 13
>PHA03100 ankyrin repeat protein; Provisional
Probab=96.71 E-value=0.0015 Score=48.36 Aligned_cols=35 Identities=23% Similarity=0.212 Sum_probs=24.0
Q ss_pred HHHHHHcCCHHHHHHHHhcCCCCCCCCCCCCccccc
Q psy15934 34 FLEYVNNSQVEKIAKLCAKGLDPNFHCPETGEMVAL 69 (70)
Q Consensus 34 f~~aV~~~~~~kV~kLL~kGlDPN~~d~~~GeTpL~ 69 (70)
+..|+..++.+-|.-|+++|.|||..| ..|+|||.
T Consensus 254 L~~A~~~~~~~iv~~Ll~~gad~n~~d-~~g~tpl~ 288 (480)
T PHA03100 254 LHYAVYNNNPEFVKYLLDLGANPNLVN-KYGDTPLH 288 (480)
T ss_pred HHHHHHcCCHHHHHHHHHcCCCCCccC-CCCCcHHH
Confidence 455666777777777777777777776 47777764
No 14
>PLN03192 Voltage-dependent potassium channel; Provisional
Probab=96.70 E-value=0.0015 Score=53.41 Aligned_cols=38 Identities=18% Similarity=0.274 Sum_probs=34.1
Q ss_pred HHHHHHHHcCCHHHHHHHHhcCCCCCCCCCCCCcccccC
Q psy15934 32 KRFLEYVNNSQVEKIAKLCAKGLDPNFHCPETGEMVALI 70 (70)
Q Consensus 32 ~df~~aV~~~~~~kV~kLL~kGlDPN~~d~~~GeTpL~L 70 (70)
..++.|+..|+.+-+..|++.|.|||..| ..|.|||++
T Consensus 527 ~~L~~Aa~~g~~~~l~~Ll~~G~d~n~~d-~~G~TpLh~ 564 (823)
T PLN03192 527 SNLLTVASTGNAALLEELLKAKLDPDIGD-SKGRTPLHI 564 (823)
T ss_pred hHHHHHHHcCCHHHHHHHHHCCCCCCCCC-CCCCCHHHH
Confidence 34788999999999999999999999999 599999973
No 15
>PHA02743 Viral ankyrin protein; Provisional
Probab=96.68 E-value=0.0014 Score=43.65 Aligned_cols=36 Identities=19% Similarity=0.061 Sum_probs=25.1
Q ss_pred HHHHHHcCCHHH---HHHHHhcCCCCCCCCCCCCccccc
Q psy15934 34 FLEYVNNSQVEK---IAKLCAKGLDPNFHCPETGEMVAL 69 (70)
Q Consensus 34 f~~aV~~~~~~k---V~kLL~kGlDPN~~d~~~GeTpL~ 69 (70)
+..|+..|..+. |.-|++.|.|+|.+|...|+|||+
T Consensus 61 Lh~Aa~~g~~~~~~~i~~Ll~~Gadin~~d~~~g~TpLh 99 (166)
T PHA02743 61 THMVAWYDRANAVMKIELLVNMGADINARELGTGNTLLH 99 (166)
T ss_pred HHHHHHhCccCHHHHHHHHHHcCCCCCCCCCCCCCcHHH
Confidence 556666665443 566778888888877557888876
No 16
>PHA02874 ankyrin repeat protein; Provisional
Probab=96.65 E-value=0.0018 Score=48.20 Aligned_cols=35 Identities=20% Similarity=0.215 Sum_probs=31.2
Q ss_pred HHHHHHcCCHHHHHHHHhcCCCCCCCCCCCCccccc
Q psy15934 34 FLEYVNNSQVEKIAKLCAKGLDPNFHCPETGEMVAL 69 (70)
Q Consensus 34 f~~aV~~~~~~kV~kLL~kGlDPN~~d~~~GeTpL~ 69 (70)
+..|+..|+.+-|.-|+++|.|||..| ..|+|||+
T Consensus 128 Lh~A~~~~~~~~v~~Ll~~gad~n~~d-~~g~tpLh 162 (434)
T PHA02874 128 LHYAIKKGDLESIKMLFEYGADVNIED-DNGCYPIH 162 (434)
T ss_pred HHHHHHCCCHHHHHHHHhCCCCCCCcC-CCCCCHHH
Confidence 677899999999999999999999988 58999986
No 17
>PF13606 Ank_3: Ankyrin repeat
Probab=96.55 E-value=0.0043 Score=31.83 Aligned_cols=25 Identities=24% Similarity=0.256 Sum_probs=22.6
Q ss_pred HHHHHHcCCHHHHHHHHhcCCCCCC
Q psy15934 34 FLEYVNNSQVEKIAKLCAKGLDPNF 58 (70)
Q Consensus 34 f~~aV~~~~~~kV~kLL~kGlDPN~ 58 (70)
+..|+..|+.+-|.-||+.|.|||.
T Consensus 6 Lh~A~~~g~~e~v~~Ll~~gadvn~ 30 (30)
T PF13606_consen 6 LHLAASNGNIEIVKYLLEHGADVNA 30 (30)
T ss_pred HHHHHHhCCHHHHHHHHHcCCCCCC
Confidence 5678999999999999999999983
No 18
>PHA03095 ankyrin-like protein; Provisional
Probab=96.51 E-value=0.0027 Score=46.83 Aligned_cols=34 Identities=35% Similarity=0.425 Sum_probs=17.9
Q ss_pred HHHHHcCCHHHHHHHHhcCCCCCCCCCCCCccccc
Q psy15934 35 LEYVNNSQVEKIAKLCAKGLDPNFHCPETGEMVAL 69 (70)
Q Consensus 35 ~~aV~~~~~~kV~kLL~kGlDPN~~d~~~GeTpL~ 69 (70)
+.+...++++.|..|++.|.|+|..+ ..|+|||+
T Consensus 19 ~~~~~~~~~~~v~~Ll~~ga~vn~~~-~~g~t~Lh 52 (471)
T PHA03095 19 LLNASNVTVEEVRRLLAAGADVNFRG-EYGKTPLH 52 (471)
T ss_pred HHcCCCCCHHHHHHHHHcCCCcccCC-CCCCCHHH
Confidence 34444555555555555555555555 35555553
No 19
>PF00023 Ank: Ankyrin repeat Hereditary spherocytosis; InterPro: IPR002110 The ankyrin repeat is one of the most common protein-protein interaction motifs in nature. Ankyrin repeats are tandemly repeated modules of about 33 amino acids. They occur in a large number of functionally diverse proteins mainly from eukaryotes. The few known examples from prokaryotes and viruses may be the result of horizontal gene transfers []. The repeat has been found in proteins of diverse function such as transcriptional initiators, cell-cycle regulators, cytoskeletal, ion transporters and signal transducers. The ankyrin fold appears to be defined by its structure rather than its function since there is no specific sequence or structure which is universally recognised by it. The conserved fold of the ankyrin repeat unit is known from several crystal and solution structures [, , , ]. Each repeat folds into a helix-loop-helix structure with a beta-hairpin/loop region projecting out from the helices at a 90o angle. The repeats stack together to form an L-shaped structure [, ].; GO: 0005515 protein binding; PDB: 1D9S_A 1NFI_F 1IKN_D 1WDY_A 1OT8_C 1QYM_A 1TR4_A 1UOH_A 1N11_A 1K1A_A ....
Probab=96.50 E-value=0.0064 Score=30.90 Aligned_cols=27 Identities=15% Similarity=0.081 Sum_probs=24.8
Q ss_pred HHHHHHcCCHHHHHHHHhcCCCCCCCC
Q psy15934 34 FLEYVNNSQVEKIAKLCAKGLDPNFHC 60 (70)
Q Consensus 34 f~~aV~~~~~~kV~kLL~kGlDPN~~d 60 (70)
+..|+..|+.+-|.-|+++|.|||.+|
T Consensus 6 Lh~A~~~~~~~~v~~Ll~~ga~~~~~d 32 (33)
T PF00023_consen 6 LHYAAQRGHPDIVKLLLKHGADINARD 32 (33)
T ss_dssp HHHHHHTTCHHHHHHHHHTTSCTTCBC
T ss_pred HHHHHHHHHHHHHHHHHHCcCCCCCCC
Confidence 467999999999999999999999876
No 20
>PHA02736 Viral ankyrin protein; Provisional
Probab=96.43 E-value=0.0033 Score=40.52 Aligned_cols=35 Identities=14% Similarity=0.056 Sum_probs=27.5
Q ss_pred HHHHHHcCCHHHHHHHHh-cCCCCCCCCCCCCccccc
Q psy15934 34 FLEYVNNSQVEKIAKLCA-KGLDPNFHCPETGEMVAL 69 (70)
Q Consensus 34 f~~aV~~~~~~kV~kLL~-kGlDPN~~d~~~GeTpL~ 69 (70)
+..|+..++.+-+.-||. .|.|+|..| ..|+|||+
T Consensus 96 Lh~A~~~~~~~i~~~Ll~~~g~d~n~~~-~~g~tpL~ 131 (154)
T PHA02736 96 LHIAVYTQNYELATWLCNQPGVNMEILN-YAFKTPYY 131 (154)
T ss_pred HHHHHHhCCHHHHHHHHhCCCCCCcccc-CCCCCHHH
Confidence 566778888888877886 488888888 58888875
No 21
>PHA02878 ankyrin repeat protein; Provisional
Probab=96.39 E-value=0.0031 Score=47.60 Aligned_cols=35 Identities=14% Similarity=0.017 Sum_probs=25.5
Q ss_pred HHHHHHcCCHHHHHHHHhcCCCCCCCCCCCCccccc
Q psy15934 34 FLEYVNNSQVEKIAKLCAKGLDPNFHCPETGEMVAL 69 (70)
Q Consensus 34 f~~aV~~~~~~kV~kLL~kGlDPN~~d~~~GeTpL~ 69 (70)
+..|+..|+.+-|.-|+++|.|||..| ..|.|||+
T Consensus 172 Lh~A~~~~~~~iv~~Ll~~gad~n~~d-~~g~tpLh 206 (477)
T PHA02878 172 LHYATENKDQRLTELLLSYGANVNIPD-KTNNSPLH 206 (477)
T ss_pred HHHHHhCCCHHHHHHHHHCCCCCCCcC-CCCCCHHH
Confidence 456677777777777777777777777 47777775
No 22
>PLN03192 Voltage-dependent potassium channel; Provisional
Probab=96.36 E-value=0.0032 Score=51.45 Aligned_cols=36 Identities=11% Similarity=0.222 Sum_probs=33.0
Q ss_pred HHHHHHcCCHHHHHHHHhcCCCCCCCCCCCCcccccC
Q psy15934 34 FLEYVNNSQVEKIAKLCAKGLDPNFHCPETGEMVALI 70 (70)
Q Consensus 34 f~~aV~~~~~~kV~kLL~kGlDPN~~d~~~GeTpL~L 70 (70)
+..|+..|+.+-+..|++.|.|||..| ..|+|||++
T Consensus 626 L~~Aa~~g~~~~v~~Ll~~Gadin~~d-~~G~TpLh~ 661 (823)
T PLN03192 626 LCTAAKRNDLTAMKELLKQGLNVDSED-HQGATALQV 661 (823)
T ss_pred HHHHHHhCCHHHHHHHHHCCCCCCCCC-CCCCCHHHH
Confidence 678999999999999999999999999 599999973
No 23
>PTZ00322 6-phosphofructo-2-kinase/fructose-2,6-biphosphatase; Provisional
Probab=96.31 E-value=0.0039 Score=50.37 Aligned_cols=37 Identities=19% Similarity=0.192 Sum_probs=32.2
Q ss_pred HHHHHHHcCCHHHHHHHHhcCCCCCCCCCCCCcccccC
Q psy15934 33 RFLEYVNNSQVEKIAKLCAKGLDPNFHCPETGEMVALI 70 (70)
Q Consensus 33 df~~aV~~~~~~kV~kLL~kGlDPN~~d~~~GeTpL~L 70 (70)
.++.++..|+.+.|.-|++.|.|+|..| ..|.|||++
T Consensus 85 ~L~~aa~~G~~~~vk~LL~~Gadin~~d-~~G~TpLh~ 121 (664)
T PTZ00322 85 ELCQLAASGDAVGARILLTGGADPNCRD-YDGRTPLHI 121 (664)
T ss_pred HHHHHHHcCCHHHHHHHHHCCCCCCCcC-CCCCcHHHH
Confidence 4888999999999999999999999988 589999863
No 24
>PHA02884 ankyrin repeat protein; Provisional
Probab=96.28 E-value=0.0041 Score=46.62 Aligned_cols=35 Identities=14% Similarity=0.199 Sum_probs=17.4
Q ss_pred HHHHHcCCHHHHHHHHhcCCCCCCCCC---CCCccccc
Q psy15934 35 LEYVNNSQVEKIAKLCAKGLDPNFHCP---ETGEMVAL 69 (70)
Q Consensus 35 ~~aV~~~~~~kV~kLL~kGlDPN~~d~---~~GeTpL~ 69 (70)
..|+..|+.+-+.-|++.|+|||..|+ ..|.|||+
T Consensus 38 ~~A~~~~~~eivk~LL~~GAdiN~~~~~sd~~g~TpLh 75 (300)
T PHA02884 38 YSSIKFHYTDIIDAILKLGADPEAPFPLSENSKTNPLI 75 (300)
T ss_pred HHHHHcCCHHHHHHHHHCCCCccccCcccCCCCCCHHH
Confidence 344445555555555555555555431 24555554
No 25
>PHA02791 ankyrin-like protein; Provisional
Probab=96.23 E-value=0.0046 Score=45.50 Aligned_cols=35 Identities=11% Similarity=0.148 Sum_probs=27.1
Q ss_pred HHHHHHcCCHHHHHHHHhcCCCCCCCCCCCCccccc
Q psy15934 34 FLEYVNNSQVEKIAKLCAKGLDPNFHCPETGEMVAL 69 (70)
Q Consensus 34 f~~aV~~~~~~kV~kLL~kGlDPN~~d~~~GeTpL~ 69 (70)
+..|+..|+.+-|.-|++.|.|+|..| ..|+|||+
T Consensus 65 Lh~Aa~~g~~eiV~lLL~~Gadvn~~d-~~G~TpLh 99 (284)
T PHA02791 65 LHQAATLEDTKIVKILLFSGMDDSQFD-DKGNTALY 99 (284)
T ss_pred HHHHHHCCCHHHHHHHHHCCCCCCCCC-CCCCCHHH
Confidence 566777888877777788888888777 47888876
No 26
>PHA02946 ankyin-like protein; Provisional
Probab=96.23 E-value=0.0054 Score=47.10 Aligned_cols=35 Identities=17% Similarity=0.095 Sum_probs=22.7
Q ss_pred HHHHHHcCCHHHHHHHHhcCCCCCCCCCCCCccccc
Q psy15934 34 FLEYVNNSQVEKIAKLCAKGLDPNFHCPETGEMVAL 69 (70)
Q Consensus 34 f~~aV~~~~~~kV~kLL~kGlDPN~~d~~~GeTpL~ 69 (70)
+.-|+..|+.+-|.-||+.|.|||..| ..|.|||+
T Consensus 76 Lh~Aa~~g~~eiv~lLL~~GAdin~~d-~~g~TpLh 110 (446)
T PHA02946 76 LHIASKINNNRIVAMLLTHGADPNACD-KQHKTPLY 110 (446)
T ss_pred HHHHHHcCCHHHHHHHHHCcCCCCCCC-CCCCCHHH
Confidence 445666666666666667777777666 46677664
No 27
>PHA02795 ankyrin-like protein; Provisional
Probab=96.22 E-value=0.0043 Score=49.07 Aligned_cols=36 Identities=11% Similarity=0.134 Sum_probs=32.3
Q ss_pred HHHHHHcCCHHHHHHHHhcCCCCCCCCCCCCcccccC
Q psy15934 34 FLEYVNNSQVEKIAKLCAKGLDPNFHCPETGEMVALI 70 (70)
Q Consensus 34 f~~aV~~~~~~kV~kLL~kGlDPN~~d~~~GeTpL~L 70 (70)
+..|+..|+.+-|.-|++.|.|+|..| ..|.|||++
T Consensus 225 Lh~Aa~~g~~eiVelLL~~GAdIN~~d-~~G~TpLh~ 260 (437)
T PHA02795 225 LYRAIYAGYIDLVSWLLENGANVNAVM-SNGYTCLDV 260 (437)
T ss_pred HHHHHHcCCHHHHHHHHHCCCCCCCcC-CCCCCHHHH
Confidence 678889999999999999999999999 599999963
No 28
>PHA03100 ankyrin repeat protein; Provisional
Probab=96.21 E-value=0.0046 Score=45.85 Aligned_cols=36 Identities=14% Similarity=0.006 Sum_probs=32.1
Q ss_pred HHHHHHHcCC--HHHHHHHHhcCCCCCCCCCCCCccccc
Q psy15934 33 RFLEYVNNSQ--VEKIAKLCAKGLDPNFHCPETGEMVAL 69 (70)
Q Consensus 33 df~~aV~~~~--~~kV~kLL~kGlDPN~~d~~~GeTpL~ 69 (70)
-+..|+..++ .+-|.-|++.|.|+|..| ..|+|||+
T Consensus 218 ~l~~a~~~~~~~~~iv~~Ll~~g~din~~d-~~g~TpL~ 255 (480)
T PHA03100 218 PLHIAACYNEITLEVVNYLLSYGVPINIKD-VYGFTPLH 255 (480)
T ss_pred HHHHHHHhCcCcHHHHHHHHHcCCCCCCCC-CCCCCHHH
Confidence 3567888888 999999999999999999 69999996
No 29
>PHA02874 ankyrin repeat protein; Provisional
Probab=96.18 E-value=0.006 Score=45.45 Aligned_cols=35 Identities=11% Similarity=0.136 Sum_probs=32.4
Q ss_pred HHHHHHcCCHHHHHHHHhcCCCCCCCCCCCCccccc
Q psy15934 34 FLEYVNNSQVEKIAKLCAKGLDPNFHCPETGEMVAL 69 (70)
Q Consensus 34 f~~aV~~~~~~kV~kLL~kGlDPN~~d~~~GeTpL~ 69 (70)
+..|++.|+.+-|.-|++.|.|+|..+ ..|.|||+
T Consensus 39 L~~A~~~g~~~iv~~Ll~~Ga~~n~~~-~~~~t~L~ 73 (434)
T PHA02874 39 LIDAIRSGDAKIVELFIKHGADINHIN-TKIPHPLL 73 (434)
T ss_pred HHHHHHcCCHHHHHHHHHCCCCCCCCC-CCCCCHHH
Confidence 788999999999999999999999888 58999986
No 30
>PF13857 Ank_5: Ankyrin repeats (many copies); PDB: 1SW6_A 3EHR_B 3EHQ_A.
Probab=96.11 E-value=0.001 Score=37.60 Aligned_cols=21 Identities=24% Similarity=0.167 Sum_probs=7.7
Q ss_pred HHhcC-CCCCCCCCCCCcccccC
Q psy15934 49 LCAKG-LDPNFHCPETGEMVALI 70 (70)
Q Consensus 49 LL~kG-lDPN~~d~~~GeTpL~L 70 (70)
||+.| .|+|..| ..|+|||++
T Consensus 1 LL~~~~~~~n~~d-~~G~T~LH~ 22 (56)
T PF13857_consen 1 LLEHGPADVNAQD-KYGNTPLHW 22 (56)
T ss_dssp -----T--TT----TTS--HHHH
T ss_pred CCccCcCCCcCcC-CCCCcHHHH
Confidence 68889 9999999 699999973
No 31
>PHA02741 hypothetical protein; Provisional
Probab=95.99 E-value=0.0081 Score=39.77 Aligned_cols=35 Identities=14% Similarity=0.167 Sum_probs=21.1
Q ss_pred HHHHHHcCCHHHHHHHHh-cCCCCCCCCCCCCccccc
Q psy15934 34 FLEYVNNSQVEKIAKLCA-KGLDPNFHCPETGEMVAL 69 (70)
Q Consensus 34 f~~aV~~~~~~kV~kLL~-kGlDPN~~d~~~GeTpL~ 69 (70)
+..|+..++.+-+.-||. .|.|+|..| ..|+|||+
T Consensus 102 Lh~A~~~~~~~iv~~Ll~~~g~~~~~~n-~~g~tpL~ 137 (169)
T PHA02741 102 LHLAAHRRDHDLAEWLCCQPGIDLHFCN-ADNKSPFE 137 (169)
T ss_pred HHHHHHcCCHHHHHHHHhCCCCCCCcCC-CCCCCHHH
Confidence 455666666665555554 366666665 46666664
No 32
>PHA02743 Viral ankyrin protein; Provisional
Probab=95.97 E-value=0.0076 Score=40.10 Aligned_cols=36 Identities=17% Similarity=0.074 Sum_probs=31.1
Q ss_pred HHHHHHcCCHHHHHHHHh-cCCCCCCCCCCCCcccccC
Q psy15934 34 FLEYVNNSQVEKIAKLCA-KGLDPNFHCPETGEMVALI 70 (70)
Q Consensus 34 f~~aV~~~~~~kV~kLL~-kGlDPN~~d~~~GeTpL~L 70 (70)
|..|+..++.+-+.-||. .|.|+|..| ..|+|||++
T Consensus 98 Lh~A~~~g~~~iv~~Ll~~~gad~~~~d-~~g~tpL~~ 134 (166)
T PHA02743 98 LHIAASTKNYELAEWLCRQLGVNLGAIN-YQHETAYHI 134 (166)
T ss_pred HHHHHHhCCHHHHHHHHhccCCCccCcC-CCCCCHHHH
Confidence 677899999988888885 799999998 699999963
No 33
>PHA02859 ankyrin repeat protein; Provisional
Probab=95.94 E-value=0.0085 Score=41.31 Aligned_cols=28 Identities=25% Similarity=0.277 Sum_probs=15.9
Q ss_pred CHHHHHHHHhcCCCCCCCCCCCCccccc
Q psy15934 42 QVEKIAKLCAKGLDPNFHCPETGEMVAL 69 (70)
Q Consensus 42 ~~~kV~kLL~kGlDPN~~d~~~GeTpL~ 69 (70)
+.+-|.-|+++|.|+|..+...|.|||+
T Consensus 65 ~~eiv~~Ll~~gadvn~~~~~~g~TpLh 92 (209)
T PHA02859 65 NVEILKFLIENGADVNFKTRDNNLSALH 92 (209)
T ss_pred CHHHHHHHHHCCCCCCccCCCCCCCHHH
Confidence 5555555566666666554335666654
No 34
>PHA02884 ankyrin repeat protein; Provisional
Probab=95.89 E-value=0.0083 Score=44.97 Aligned_cols=35 Identities=14% Similarity=0.021 Sum_probs=18.4
Q ss_pred HHHHHHcCCHHHHHHHHhcCCCCCCCCCCCCccccc
Q psy15934 34 FLEYVNNSQVEKIAKLCAKGLDPNFHCPETGEMVAL 69 (70)
Q Consensus 34 f~~aV~~~~~~kV~kLL~kGlDPN~~d~~~GeTpL~ 69 (70)
+..|+..++.+-+.-|+..|.|||..| ..|.|||+
T Consensus 108 Lh~Aa~~~~~eivklLL~~GAdin~kd-~~G~TpL~ 142 (300)
T PHA02884 108 LYISVLHGCLKCLEILLSYGADINIQT-NDMVTPIE 142 (300)
T ss_pred HHHHHHcCCHHHHHHHHHCCCCCCCCC-CCCCCHHH
Confidence 344455555555555555555555555 35555553
No 35
>PHA02741 hypothetical protein; Provisional
Probab=95.80 E-value=0.011 Score=39.13 Aligned_cols=37 Identities=19% Similarity=0.134 Sum_probs=29.5
Q ss_pred HHHHHHcCCH----HHHHHHHhcCCCCCCCCCCCCcccccC
Q psy15934 34 FLEYVNNSQV----EKIAKLCAKGLDPNFHCPETGEMVALI 70 (70)
Q Consensus 34 f~~aV~~~~~----~kV~kLL~kGlDPN~~d~~~GeTpL~L 70 (70)
+.-|+..|.. +-+.-|++.|.|+|..|...|+|||++
T Consensus 64 Lh~A~~~g~~~~~~~ii~~Ll~~gadin~~~~~~g~TpLh~ 104 (169)
T PHA02741 64 IHIAAEKHEAQLAAEIIDHLIELGADINAQEMLEGDTALHL 104 (169)
T ss_pred HHHHHHcCChHHHHHHHHHHHHcCCCCCCCCcCCCCCHHHH
Confidence 6678888873 556677899999999885589999973
No 36
>KOG4412|consensus
Probab=95.80 E-value=0.0087 Score=44.32 Aligned_cols=36 Identities=19% Similarity=0.202 Sum_probs=33.1
Q ss_pred HHHHHHcCCHHHHHHHHhc-CCCCCCCCCCCCcccccC
Q psy15934 34 FLEYVNNSQVEKIAKLCAK-GLDPNFHCPETGEMVALI 70 (70)
Q Consensus 34 f~~aV~~~~~~kV~kLL~k-GlDPN~~d~~~GeTpL~L 70 (70)
|.-|+..|+.+-|..|+.| |.|||... ..|.|+|++
T Consensus 76 lhia~s~g~~evVk~Ll~r~~advna~t-n~G~T~LHy 112 (226)
T KOG4412|consen 76 LHIAASNGNDEVVKELLNRSGADVNATT-NGGQTCLHY 112 (226)
T ss_pred hhhhhhcCcHHHHHHHhcCCCCCcceec-CCCcceehh
Confidence 7888999999999999999 99999988 699999974
No 37
>PHA02730 ankyrin-like protein; Provisional
Probab=95.62 E-value=0.013 Score=48.60 Aligned_cols=35 Identities=17% Similarity=0.237 Sum_probs=31.0
Q ss_pred HHHHHHcC---CHHHHHHHHhcCCCCCCCCCCCCccccc
Q psy15934 34 FLEYVNNS---QVEKIAKLCAKGLDPNFHCPETGEMVAL 69 (70)
Q Consensus 34 f~~aV~~~---~~~kV~kLL~kGlDPN~~d~~~GeTpL~ 69 (70)
++.|+..+ +.+-|.-||+.|+|||..| ..|+|||+
T Consensus 45 Lh~A~~~~~~~~~eivklLLs~GAdin~kD-~~G~TPLh 82 (672)
T PHA02730 45 LHCYVSNKCDTDIKIVRLLLSRGVERLCRN-NEGLTPLG 82 (672)
T ss_pred HHHHHHcCCcCcHHHHHHHHhCCCCCcccC-CCCCChHH
Confidence 77888886 4888888999999999999 69999996
No 38
>PF12796 Ank_2: Ankyrin repeats (3 copies); InterPro: IPR020683 This entry represents the ankyrin repeat-containing domain. These domains contain multiple repeats of a beta(2)-alpha(2) motif. The ankyrin repeat is one of the most common protein-protein interaction motifs in nature. Ankyrin repeats are tandemly repeated modules of about 33 amino acids. They occur in a large number of functionally diverse proteins mainly from eukaryotes. The few known examples from prokaryotes and viruses may be the result of horizontal gene transfers []. The repeat has been found in proteins of diverse function such as transcriptional initiators, cell-cycle regulators, cytoskeletal, ion transporters and signal transducers. The ankyrin fold appears to be defined by its structure rather than its function since there is no specific sequence or structure which is universally recognised by it. The conserved fold of the ankyrin repeat unit is known from several crystal and solution structures [, , , ]. Each repeat folds into a helix-loop-helix structure with a beta-hairpin/loop region projecting out from the helices at a 90o angle. The repeats stack together to form an L-shaped structure [, ].; PDB: 3AAA_C 3F6Q_A 2KBX_A 3IXE_A 3TWR_D 3TWV_A 3TWT_B 3TWQ_A 3TWS_A 3TWX_B ....
Probab=95.55 E-value=0.019 Score=33.49 Aligned_cols=31 Identities=26% Similarity=0.283 Sum_probs=27.5
Q ss_pred HHHHHHcCCHHHHHHHHhcCCCCCCCCCCCCccccc
Q psy15934 34 FLEYVNNSQVEKIAKLCAKGLDPNFHCPETGEMVAL 69 (70)
Q Consensus 34 f~~aV~~~~~~kV~kLL~kGlDPN~~d~~~GeTpL~ 69 (70)
++.|++.|+.+-+..|++.|.+.|. |+|||+
T Consensus 1 L~~A~~~~~~~~~~~ll~~~~~~~~-----~~~~l~ 31 (89)
T PF12796_consen 1 LHIAAQNGNLEILKFLLEKGADINL-----GNTALH 31 (89)
T ss_dssp HHHHHHTTTHHHHHHHHHTTSTTTS-----SSBHHH
T ss_pred CHHHHHcCCHHHHHHHHHCcCCCCC-----CCCHHH
Confidence 3679999999999999999988886 888886
No 39
>PHA02736 Viral ankyrin protein; Provisional
Probab=95.53 E-value=0.0079 Score=38.75 Aligned_cols=34 Identities=26% Similarity=0.142 Sum_probs=25.4
Q ss_pred HHHcCCH---HHHHHHHhcCCCCCCCCCCCCcccccC
Q psy15934 37 YVNNSQV---EKIAKLCAKGLDPNFHCPETGEMVALI 70 (70)
Q Consensus 37 aV~~~~~---~kV~kLL~kGlDPN~~d~~~GeTpL~L 70 (70)
|+..++. +.+.-|++.|.|+|..|...|+|||++
T Consensus 62 a~~~~~~~~~e~v~~Ll~~gadin~~~~~~g~T~Lh~ 98 (154)
T PHA02736 62 VSNPDKADPQEKLKLLMEWGADINGKERVFGNTPLHI 98 (154)
T ss_pred ecccCchhHHHHHHHHHHcCCCccccCCCCCCcHHHH
Confidence 4555554 346678899999999885589999973
No 40
>PHA02798 ankyrin-like protein; Provisional
Probab=95.52 E-value=0.031 Score=42.58 Aligned_cols=35 Identities=11% Similarity=0.086 Sum_probs=26.6
Q ss_pred HHHHHHcC--CHHHHHHHHhcCCCCCCCCCCCCccccc
Q psy15934 34 FLEYVNNS--QVEKIAKLCAKGLDPNFHCPETGEMVAL 69 (70)
Q Consensus 34 f~~aV~~~--~~~kV~kLL~kGlDPN~~d~~~GeTpL~ 69 (70)
|..|.... +.+-|.-|++.|.|+|..| ..|+|||+
T Consensus 40 ~~~yl~~~~~~~~iv~~Ll~~Gadvn~~d-~~g~TpL~ 76 (489)
T PHA02798 40 FQKYLQRDSPSTDIVKLFINLGANVNGLD-NEYSTPLC 76 (489)
T ss_pred HHHHHhCCCCCHHHHHHHHHCCCCCCCCC-CCCCChHH
Confidence 33455544 6777888889999999888 68999985
No 41
>PHA02859 ankyrin repeat protein; Provisional
Probab=95.49 E-value=0.012 Score=40.50 Aligned_cols=29 Identities=21% Similarity=0.107 Sum_probs=16.1
Q ss_pred cCCHHHHHHHHhcCCCCCCCCCCCCccccc
Q psy15934 40 NSQVEKIAKLCAKGLDPNFHCPETGEMVAL 69 (70)
Q Consensus 40 ~~~~~kV~kLL~kGlDPN~~d~~~GeTpL~ 69 (70)
.++.+-|.-|++.|.|||..| ..|.|||.
T Consensus 169 ~~~~~iv~~Ll~~Gadi~~~d-~~g~tpl~ 197 (209)
T PHA02859 169 HSDKKIFDFLTSLGIDINETN-KSGYNCYD 197 (209)
T ss_pred cCCHHHHHHHHHcCCCCCCCC-CCCCCHHH
Confidence 445544555555666666555 46666654
No 42
>PHA02876 ankyrin repeat protein; Provisional
Probab=95.48 E-value=0.014 Score=46.16 Aligned_cols=35 Identities=14% Similarity=-0.033 Sum_probs=27.0
Q ss_pred HHHHHHcCCHHHHHHHHhcCCCCCCCCCCCCccccc
Q psy15934 34 FLEYVNNSQVEKIAKLCAKGLDPNFHCPETGEMVAL 69 (70)
Q Consensus 34 f~~aV~~~~~~kV~kLL~kGlDPN~~d~~~GeTpL~ 69 (70)
+..+++.++.+-+.-|+++|.|+|..| ..|.|||+
T Consensus 149 l~~~i~~~~~~i~k~Ll~~Gadvn~~d-~~G~TpLh 183 (682)
T PHA02876 149 IKERIQQDELLIAEMLLEGGADVNAKD-IYCITPIH 183 (682)
T ss_pred HHHHHHCCcHHHHHHHHhCCCCCCCCC-CCCCCHHH
Confidence 456667788877777788888888887 47888886
No 43
>KOG0509|consensus
Probab=95.47 E-value=0.011 Score=48.82 Aligned_cols=34 Identities=18% Similarity=0.241 Sum_probs=27.8
Q ss_pred HHHHcCCHHHHHHHHhcCCCCCCCCCCCCcccccC
Q psy15934 36 EYVNNSQVEKIAKLCAKGLDPNFHCPETGEMVALI 70 (70)
Q Consensus 36 ~aV~~~~~~kV~kLL~kGlDPN~~d~~~GeTpL~L 70 (70)
+|.++|++.-|.-|++-|+||+..| .+|.+||++
T Consensus 118 WAar~G~~~vv~lLlqhGAdpt~~D-~~G~~~lHl 151 (600)
T KOG0509|consen 118 WAARNGHISVVDLLLQHGADPTLKD-KQGLTPLHL 151 (600)
T ss_pred HHHHcCcHHHHHHHHHcCCCCceec-CCCCcHHHH
Confidence 6778888888888888888888888 688888764
No 44
>COG0666 Arp FOG: Ankyrin repeat [General function prediction only]
Probab=95.44 E-value=0.019 Score=36.08 Aligned_cols=39 Identities=15% Similarity=0.101 Sum_probs=33.7
Q ss_pred HHHHHHHHHcCCHHHHHHHHhcCCCCCCCCCCCCcccccC
Q psy15934 31 LKRFLEYVNNSQVEKIAKLCAKGLDPNFHCPETGEMVALI 70 (70)
Q Consensus 31 ~~df~~aV~~~~~~kV~kLL~kGlDPN~~d~~~GeTpL~L 70 (70)
...++.+...++.+.+..++..|+|||..| ..|.|||++
T Consensus 74 ~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~-~~g~t~l~~ 112 (235)
T COG0666 74 RLPLHSAASKGDDKIVKLLLASGADVNAKD-ADGDTPLHL 112 (235)
T ss_pred cCHHHHHHHcCcHHHHHHHHHcCCCccccc-CCCCcHHHH
Confidence 455778888899988899999999999999 699999973
No 45
>PHA02946 ankyin-like protein; Provisional
Probab=95.37 E-value=0.011 Score=45.38 Aligned_cols=32 Identities=19% Similarity=0.285 Sum_probs=27.0
Q ss_pred HHcCCHHHHHHHHhcCCCCCCCCCCCCcccccC
Q psy15934 38 VNNSQVEKIAKLCAKGLDPNFHCPETGEMVALI 70 (70)
Q Consensus 38 V~~~~~~kV~kLL~kGlDPN~~d~~~GeTpL~L 70 (70)
+...+.+-|.-|+++|.|||..| ..|.|||++
T Consensus 47 ~~~~~~~iv~~Ll~~Gadvn~~d-~~G~TpLh~ 78 (446)
T PHA02946 47 IKGLDERFVEELLHRGYSPNETD-DDGNYPLHI 78 (446)
T ss_pred hcCCCHHHHHHHHHCcCCCCccC-CCCCCHHHH
Confidence 34556778899999999999999 599999973
No 46
>PTZ00322 6-phosphofructo-2-kinase/fructose-2,6-biphosphatase; Provisional
Probab=95.35 E-value=0.016 Score=46.88 Aligned_cols=36 Identities=19% Similarity=0.226 Sum_probs=33.1
Q ss_pred HHHHHHcCCHHHHHHHHhcCCCCCCCCCCCCcccccC
Q psy15934 34 FLEYVNNSQVEKIAKLCAKGLDPNFHCPETGEMVALI 70 (70)
Q Consensus 34 f~~aV~~~~~~kV~kLL~kGlDPN~~d~~~GeTpL~L 70 (70)
+..|+..|+.+-|.-||+.|.|||..| ..|.|||++
T Consensus 119 Lh~Aa~~g~~eiv~~LL~~Gadvn~~d-~~G~TpLh~ 154 (664)
T PTZ00322 119 LHIACANGHVQVVRVLLEFGADPTLLD-KDGKTPLEL 154 (664)
T ss_pred HHHHHHCCCHHHHHHHHHCCCCCCCCC-CCCCCHHHH
Confidence 778999999999999999999999999 699999973
No 47
>PHA02716 CPXV016; CPX019; EVM010; Provisional
Probab=95.22 E-value=0.022 Score=47.86 Aligned_cols=38 Identities=11% Similarity=0.284 Sum_probs=31.9
Q ss_pred HHHHHHHHHcCCHHHHHHHHhcCCCCCCCCCCCCccccc
Q psy15934 31 LKRFLEYVNNSQVEKIAKLCAKGLDPNFHCPETGEMVAL 69 (70)
Q Consensus 31 ~~df~~aV~~~~~~kV~kLL~kGlDPN~~d~~~GeTpL~ 69 (70)
+..|+.|++.|+.+-|..|++.|.|+|..| ..|.|||+
T Consensus 285 L~~~i~AA~~g~leiVklLLe~GAdIN~kD-~~G~TPLH 322 (764)
T PHA02716 285 LHSYITLARNIDISVVYSFLQPGVKLHYKD-SAGRTCLH 322 (764)
T ss_pred hHHHHHHHHcCCHHHHHHHHhCCCceeccC-CCCCCHHH
Confidence 445677888899999999999999999888 58999986
No 48
>cd00204 ANK ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.
Probab=95.12 E-value=0.038 Score=31.92 Aligned_cols=35 Identities=23% Similarity=0.206 Sum_probs=18.7
Q ss_pred HHHHHHcCCHHHHHHHHhcCCCCCCCCCCCCccccc
Q psy15934 34 FLEYVNNSQVEKIAKLCAKGLDPNFHCPETGEMVAL 69 (70)
Q Consensus 34 f~~aV~~~~~~kV~kLL~kGlDPN~~d~~~GeTpL~ 69 (70)
+..|+..++.+-+..|+..|.++|..+ ..|.||++
T Consensus 44 l~~a~~~~~~~~~~~ll~~~~~~~~~~-~~~~~~l~ 78 (126)
T cd00204 44 LHLAAKNGHLEIVKLLLEKGADVNARD-KDGNTPLH 78 (126)
T ss_pred HHHHHHcCCHHHHHHHHHcCCCccccC-CCCCCHHH
Confidence 445555555555555555555555555 35555543
No 49
>PHA02989 ankyrin repeat protein; Provisional
Probab=95.02 E-value=0.015 Score=44.28 Aligned_cols=31 Identities=16% Similarity=0.216 Sum_probs=22.6
Q ss_pred HHcCCHHHHHHHHhcCCCC-CCCCCCCCccccc
Q psy15934 38 VNNSQVEKIAKLCAKGLDP-NFHCPETGEMVAL 69 (70)
Q Consensus 38 V~~~~~~kV~kLL~kGlDP-N~~d~~~GeTpL~ 69 (70)
...++.+-|.-|+++|+|| |..| ..|.|||+
T Consensus 119 ~~~~~~eiv~~Ll~~Gadin~~~d-~~g~tpLh 150 (494)
T PHA02989 119 SNINNCDMLRFLLSKGINVNDVKN-SRGYNLLH 150 (494)
T ss_pred cccCcHHHHHHHHHCCCCcccccC-CCCCCHHH
Confidence 3446777777778888888 6666 57888875
No 50
>COG0666 Arp FOG: Ankyrin repeat [General function prediction only]
Probab=94.85 E-value=0.046 Score=34.28 Aligned_cols=35 Identities=20% Similarity=0.110 Sum_probs=29.2
Q ss_pred HHHHHHcCC-----HHHHHHHHhcCC---CCCCCCCCCCccccc
Q psy15934 34 FLEYVNNSQ-----VEKIAKLCAKGL---DPNFHCPETGEMVAL 69 (70)
Q Consensus 34 f~~aV~~~~-----~~kV~kLL~kGl---DPN~~d~~~GeTpL~ 69 (70)
+..|+..++ .+-+..||+.|. +++-.| +.|.|||+
T Consensus 110 l~~a~~~~~~~~~~~~~~~~ll~~g~~~~~~~~~~-~~g~tpl~ 152 (235)
T COG0666 110 LHLAALNGNPPEGNIEVAKLLLEAGADLDVNNLRD-EDGNTPLH 152 (235)
T ss_pred HHHHHhcCCcccchHHHHHHHHHcCCCCCCccccC-CCCCchhH
Confidence 667888888 999999999999 555447 79999996
No 51
>PHA02876 ankyrin repeat protein; Provisional
Probab=94.81 E-value=0.03 Score=44.29 Aligned_cols=36 Identities=14% Similarity=0.018 Sum_probs=32.7
Q ss_pred HHHHHHHcCCHHHHHHHHhcCCCCCCCCCCCCccccc
Q psy15934 33 RFLEYVNNSQVEKIAKLCAKGLDPNFHCPETGEMVAL 69 (70)
Q Consensus 33 df~~aV~~~~~~kV~kLL~kGlDPN~~d~~~GeTpL~ 69 (70)
=+.-|++.|+.+-|.-|+++|.|+|..+ ..|.|||+
T Consensus 181 pLh~Aa~~G~~~iv~~LL~~Gad~n~~~-~~g~t~L~ 216 (682)
T PHA02876 181 PIHYAAERGNAKMVNLLLSYGADVNIIA-LDDLSVLE 216 (682)
T ss_pred HHHHHHHCCCHHHHHHHHHCCCCcCccC-CCCCCHHH
Confidence 3778999999999999999999999998 69999986
No 52
>PHA02716 CPXV016; CPX019; EVM010; Provisional
Probab=94.74 E-value=0.034 Score=46.72 Aligned_cols=35 Identities=20% Similarity=0.342 Sum_probs=22.0
Q ss_pred HHHHHHcCC--HHHHHHHHhcCCCCCCCCCCCCccccc
Q psy15934 34 FLEYVNNSQ--VEKIAKLCAKGLDPNFHCPETGEMVAL 69 (70)
Q Consensus 34 f~~aV~~~~--~~kV~kLL~kGlDPN~~d~~~GeTpL~ 69 (70)
+..|+..|+ .+-|.-|++.|.|+|..| ..|.|||+
T Consensus 216 LH~Aa~~g~~~~eIVklLLe~GADVN~kD-~~G~TPLh 252 (764)
T PHA02716 216 LHTYLITGNVCASVIKKIIELGGDMDMKC-VNGMSPIM 252 (764)
T ss_pred HHHHHHcCCCCHHHHHHHHHcCCCCCCCC-CCCCCHHH
Confidence 344566663 355666667777777776 46777764
No 53
>PHA02791 ankyrin-like protein; Provisional
Probab=94.59 E-value=0.041 Score=40.54 Aligned_cols=36 Identities=11% Similarity=-0.079 Sum_probs=27.5
Q ss_pred HHHHHHHcCCHHHHHHHHhcCCCCCCCCCCCCccc-cc
Q psy15934 33 RFLEYVNNSQVEKIAKLCAKGLDPNFHCPETGEMV-AL 69 (70)
Q Consensus 33 df~~aV~~~~~~kV~kLL~kGlDPN~~d~~~GeTp-L~ 69 (70)
=+.-|++.|+.+-|.-||+.|.|||..| ..|.|| |+
T Consensus 163 pLh~Aa~~g~~eiv~lLL~~gAd~n~~d-~~g~t~~L~ 199 (284)
T PHA02791 163 CIHITIKNGHVDMMILLLDYMTSTNTNN-SLLFIPDIK 199 (284)
T ss_pred HHHHHHHcCCHHHHHHHHHCCCCCCccc-CCCCChHHH
Confidence 3677888888888888888888888877 467766 64
No 54
>PHA02917 ankyrin-like protein; Provisional
Probab=94.23 E-value=0.044 Score=44.68 Aligned_cols=34 Identities=26% Similarity=0.349 Sum_probs=30.3
Q ss_pred HHHHHcCCHHHHHHHHhcCCCCCCCCCCCCccccc
Q psy15934 35 LEYVNNSQVEKIAKLCAKGLDPNFHCPETGEMVAL 69 (70)
Q Consensus 35 ~~aV~~~~~~kV~kLL~kGlDPN~~d~~~GeTpL~ 69 (70)
+.|...++.+-|.-|+++|.|||..| ..|.|||+
T Consensus 108 ~~a~~~~~~e~vk~Ll~~Gadin~~d-~~g~T~L~ 141 (661)
T PHA02917 108 YMKSKNVDVDLIKVLVEHGFDLSVKC-ENHRSVIE 141 (661)
T ss_pred HHHhhcCCHHHHHHHHHcCCCCCccC-CCCccHHH
Confidence 45778899999999999999999998 69999995
No 55
>smart00248 ANK ankyrin repeats. Ankyrin repeats are about 33 amino acids long and occur in at least four consecutive copies. They are involved in protein-protein interactions. The core of the repeat seems to be an helix-loop-helix structure.
Probab=93.25 E-value=0.25 Score=21.15 Aligned_cols=24 Identities=29% Similarity=0.307 Sum_probs=21.1
Q ss_pred HHHHHHcCCHHHHHHHHhcCCCCC
Q psy15934 34 FLEYVNNSQVEKIAKLCAKGLDPN 57 (70)
Q Consensus 34 f~~aV~~~~~~kV~kLL~kGlDPN 57 (70)
|..|+..++.+-+..|++.|.+||
T Consensus 6 l~~~~~~~~~~~~~~ll~~~~~~~ 29 (30)
T smart00248 6 LHLAAENGNLEVVKLLLDKGADIN 29 (30)
T ss_pred HHHHHHcCCHHHHHHHHHcCCCCC
Confidence 566888899999999999999887
No 56
>cd00204 ANK ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.
Probab=93.20 E-value=0.15 Score=29.32 Aligned_cols=35 Identities=17% Similarity=0.158 Sum_probs=20.8
Q ss_pred HHHHHHcCCHHHHHHHHhcCCCCCCCCCCCCccccc
Q psy15934 34 FLEYVNNSQVEKIAKLCAKGLDPNFHCPETGEMVAL 69 (70)
Q Consensus 34 f~~aV~~~~~~kV~kLL~kGlDPN~~d~~~GeTpL~ 69 (70)
+..|++.++.+-+.-|+.+|.+.+..| ..|+|||+
T Consensus 77 l~~a~~~~~~~~~~~L~~~~~~~~~~~-~~~~~~l~ 111 (126)
T cd00204 77 LHLAARNGNLDVVKLLLKHGADVNARD-KDGRTPLH 111 (126)
T ss_pred HHHHHHcCcHHHHHHHHHcCCCCcccC-CCCCCHHH
Confidence 555666666655666666666666555 45666653
No 57
>KOG0509|consensus
Probab=93.05 E-value=0.11 Score=43.20 Aligned_cols=43 Identities=16% Similarity=0.203 Sum_probs=38.2
Q ss_pred hhHhHHHHHHHHHHcCCHHHHHHHHhc-CCCCCCCCCCCCccccc
Q psy15934 26 HTRSNLKRFLEYVNNSQVEKIAKLCAK-GLDPNFHCPETGEMVAL 69 (70)
Q Consensus 26 htk~~~~df~~aV~~~~~~kV~kLL~k-GlDPN~~d~~~GeTpL~ 69 (70)
-+..+..+++.|.+.|+.+.|..|++. |.|+|-.| ..|.|+|+
T Consensus 40 e~d~~~~~~v~A~q~G~l~~v~~lve~~g~~v~~~D-~~g~tlLH 83 (600)
T KOG0509|consen 40 EEDSSLDDIVKATQYGELETVKELVESEGESVNNPD-REGVTLLH 83 (600)
T ss_pred ccchhhhhhhhHhhcchHHHHHHHHhhcCcCCCCCC-cCCcccee
Confidence 355678899999999999999999999 99999888 59999986
No 58
>KOG4214|consensus
Probab=92.76 E-value=0.074 Score=36.06 Aligned_cols=36 Identities=17% Similarity=0.170 Sum_probs=31.0
Q ss_pred HHHHHHHHcCCHHHHHHHHhcCCCCCCCCCCCCccccc
Q psy15934 32 KRFLEYVNNSQVEKIAKLCAKGLDPNFHCPETGEMVAL 69 (70)
Q Consensus 32 ~df~~aV~~~~~~kV~kLL~kGlDPN~~d~~~GeTpL~ 69 (70)
++|+++|++|..+.|.+...+|+|-|-. -.|.|||+
T Consensus 4 ~~~~W~vkNG~~DeVk~~v~~g~nVn~~--~ggR~plh 39 (117)
T KOG4214|consen 4 MSVAWNVKNGEIDEVKQSVNEGLNVNEI--YGGRTPLH 39 (117)
T ss_pred hhHhhhhccCcHHHHHHHHHccccHHHH--hCCcccch
Confidence 6799999999999999999999888754 37899986
No 59
>KOG3676|consensus
Probab=92.63 E-value=0.083 Score=44.96 Aligned_cols=33 Identities=30% Similarity=0.341 Sum_probs=27.5
Q ss_pred HHHcCCHHHHHHHHhcCCCCCCCCCCCCcccccC
Q psy15934 37 YVNNSQVEKIAKLCAKGLDPNFHCPETGEMVALI 70 (70)
Q Consensus 37 aV~~~~~~kV~kLL~kGlDPN~~d~~~GeTpL~L 70 (70)
|+=.++.+-|.=|+++|.|||.+|+ .|+|-|++
T Consensus 247 AAC~nq~eivrlLl~~gAd~~aqDS-~GNTVLH~ 279 (782)
T KOG3676|consen 247 AACTNQPEIVRLLLAHGADPNAQDS-NGNTVLHM 279 (782)
T ss_pred HHHcCCHHHHHHHHhcCCCCCcccc-CCChHHHH
Confidence 4456788778888899999999995 99998863
No 60
>KOG0502|consensus
Probab=92.45 E-value=0.06 Score=41.14 Aligned_cols=32 Identities=19% Similarity=0.212 Sum_probs=24.8
Q ss_pred HHHcCCHHHHHHHHhcCCCCCCCCCCCCccccc
Q psy15934 37 YVNNSQVEKIAKLCAKGLDPNFHCPETGEMVAL 69 (70)
Q Consensus 37 aV~~~~~~kV~kLL~kGlDPN~~d~~~GeTpL~ 69 (70)
||+-+.+.=+..||++|.|||..| +.|++++.
T Consensus 233 Avrgnhvkcve~Ll~sGAd~t~e~-dsGy~~md 264 (296)
T KOG0502|consen 233 AVRGNHVKCVESLLNSGADVTQED-DSGYWIMD 264 (296)
T ss_pred eecCChHHHHHHHHhcCCCccccc-ccCCcHHH
Confidence 555666777777889999999766 79998875
No 61
>PHA02917 ankyrin-like protein; Provisional
Probab=92.25 E-value=0.14 Score=41.80 Aligned_cols=35 Identities=17% Similarity=0.199 Sum_probs=30.0
Q ss_pred HHHHHHcCCHHHHHHHHhcCCCCCCCCCCCCccccc
Q psy15934 34 FLEYVNNSQVEKIAKLCAKGLDPNFHCPETGEMVAL 69 (70)
Q Consensus 34 f~~aV~~~~~~kV~kLL~kGlDPN~~d~~~GeTpL~ 69 (70)
+.-|+..++.+-+.-|++.|.|+|..| ..|.|||+
T Consensus 456 Lh~Aa~~~~~~~v~~Ll~~GAdin~~d-~~G~T~L~ 490 (661)
T PHA02917 456 LHKAVRYNKQSLVSLLLESGSDVNIRS-NNGYTCIA 490 (661)
T ss_pred HHHHHHcCCHHHHHHHHHCcCCCCCCC-CCCCCHHH
Confidence 566778888888889999999999988 58999986
No 62
>KOG0505|consensus
Probab=92.15 E-value=0.1 Score=42.74 Aligned_cols=37 Identities=27% Similarity=0.399 Sum_probs=33.2
Q ss_pred HHHHHHHHcCCHHHHHHHHhcCCCCCCCCCCCCccccc
Q psy15934 32 KRFLEYVNNSQVEKIAKLCAKGLDPNFHCPETGEMVAL 69 (70)
Q Consensus 32 ~df~~aV~~~~~~kV~kLL~kGlDPN~~d~~~GeTpL~ 69 (70)
.-|+.|-..++.+.|.+|+.+|.+||.+. .+|.|+|+
T Consensus 42 a~~l~A~~~~d~~ev~~ll~~ga~~~~~n-~DglTalh 78 (527)
T KOG0505|consen 42 AVFLEACSRGDLEEVRKLLNRGASPNLCN-VDGLTALH 78 (527)
T ss_pred HHHHhccccccHHHHHHHhccCCCccccC-CccchhHH
Confidence 34899999999999999999999999988 59999985
No 63
>KOG0510|consensus
Probab=91.85 E-value=0.094 Score=45.29 Aligned_cols=33 Identities=18% Similarity=0.145 Sum_probs=27.7
Q ss_pred HHHcCCHHHHHHHHhcCCCCCCCCCCCCcccccC
Q psy15934 37 YVNNSQVEKIAKLCAKGLDPNFHCPETGEMVALI 70 (70)
Q Consensus 37 aV~~~~~~kV~kLL~kGlDPN~~d~~~GeTpL~L 70 (70)
|+..++.+.+..|+++|+|||..| +.|.|||++
T Consensus 128 A~~~~~~s~L~~Ll~~~~dvnl~d-e~~~TpLh~ 160 (929)
T KOG0510|consen 128 AADSGNYSCLKLLLDYGADVNLED-ENGFTPLHL 160 (929)
T ss_pred ccccchHHHHHHHHHhcCCccccc-cCCCchhhH
Confidence 456788888999999999999888 788999874
No 64
>PHA02792 ankyrin-like protein; Provisional
Probab=91.60 E-value=0.18 Score=41.82 Aligned_cols=35 Identities=17% Similarity=0.192 Sum_probs=29.9
Q ss_pred HHHHHHcCCHHHHHHHHhcCCCCCCCCCCCCccccc
Q psy15934 34 FLEYVNNSQVEKIAKLCAKGLDPNFHCPETGEMVAL 69 (70)
Q Consensus 34 f~~aV~~~~~~kV~kLL~kGlDPN~~d~~~GeTpL~ 69 (70)
+.-|+..++.+-+.-|++.|.|+|..| ..|+|||+
T Consensus 414 Lh~Aa~~~n~eivelLLs~GADIN~kD-~~G~TpL~ 448 (631)
T PHA02792 414 LYYCIESHSVSLVEWLIDNGADINITT-KYGSTCIG 448 (631)
T ss_pred HHHHHHcCCHHHHHHHHHCCCCCCCcC-CCCCCHHH
Confidence 777888888888888889999999888 68999986
No 65
>KOG1710|consensus
Probab=91.36 E-value=0.22 Score=39.30 Aligned_cols=36 Identities=14% Similarity=-0.006 Sum_probs=32.5
Q ss_pred HHHHHHcCCHHHHHHHHhcCCCCCCCCCCCCccccc
Q psy15934 34 FLEYVNNSQVEKIAKLCAKGLDPNFHCPETGEMVAL 69 (70)
Q Consensus 34 f~~aV~~~~~~kV~kLL~kGlDPN~~d~~~GeTpL~ 69 (70)
+..|...|+.+-|..||+-|.|-|-+--.+|+||||
T Consensus 49 LahAaykGnl~~v~lll~~gaDvN~~qhg~~YTpLm 84 (396)
T KOG1710|consen 49 LAHAAYKGNLTLVELLLELGADVNDKQHGTLYTPLM 84 (396)
T ss_pred HHHHHhcCcHHHHHHHHHhCCCcCcccccccccHHH
Confidence 677888999999999999999999877689999997
No 66
>KOG0508|consensus
Probab=91.14 E-value=0.14 Score=42.48 Aligned_cols=27 Identities=19% Similarity=-0.011 Sum_probs=22.7
Q ss_pred HHHHHHHHhcCCCCCCCCCCCCcccccC
Q psy15934 43 VEKIAKLCAKGLDPNFHCPETGEMVALI 70 (70)
Q Consensus 43 ~~kV~kLL~kGlDPN~~d~~~GeTpL~L 70 (70)
+.-+..||+=|+|||.+| ..|+|||++
T Consensus 508 ~~V~k~Llecgadvna~D-~~~ntplHI 534 (615)
T KOG0508|consen 508 AHVTKVLLECGADVNARD-FDNNTPLHI 534 (615)
T ss_pred HHHHHHHHHcCCCCCccc-CCCCchhHH
Confidence 445667789999999999 699999974
No 67
>KOG0512|consensus
Probab=90.47 E-value=0.3 Score=36.23 Aligned_cols=36 Identities=11% Similarity=0.114 Sum_probs=26.6
Q ss_pred HHHHHHHcCCHHHHHHHHhcCCCCCCCCCCCCccccc
Q psy15934 33 RFLEYVNNSQVEKIAKLCAKGLDPNFHCPETGEMVAL 69 (70)
Q Consensus 33 df~~aV~~~~~~kV~kLL~kGlDPN~~d~~~GeTpL~ 69 (70)
-+-+|+.+|-.+-|..|+..|.+|+... ..|.|||+
T Consensus 100 pLHRAaYn~h~div~~ll~~gAn~~a~T-~~GWTPLh 135 (228)
T KOG0512|consen 100 PLHRAAYNGHLDIVHELLLSGANKEAKT-NEGWTPLH 135 (228)
T ss_pred HHHHHHhcCchHHHHHHHHccCCccccc-ccCccchh
Confidence 3567777777777777777888887776 57777774
No 68
>TIGR00870 trp transient-receptor-potential calcium channel protein. after chronic exposure to capsaicin. (McCleskey and Gold, 1999).
Probab=90.41 E-value=0.11 Score=41.64 Aligned_cols=36 Identities=11% Similarity=-0.049 Sum_probs=31.4
Q ss_pred HHHHHHcCCHHHHHHHHhcCCCCCCCCCCCCcccccC
Q psy15934 34 FLEYVNNSQVEKIAKLCAKGLDPNFHCPETGEMVALI 70 (70)
Q Consensus 34 f~~aV~~~~~~kV~kLL~kGlDPN~~d~~~GeTpL~L 70 (70)
+.-|+..++.+-|.-|++.|.|+|..| ..|+|||++
T Consensus 179 L~~Aa~~~~~~iv~lLl~~gadin~~d-~~g~T~Lh~ 214 (743)
T TIGR00870 179 LNAAACLGSPSIVALLSEDPADILTAD-SLGNTLLHL 214 (743)
T ss_pred HHHHHHhCCHHHHHHHhcCCcchhhHh-hhhhHHHHH
Confidence 456788899999999999999999999 599999963
No 69
>PHA02792 ankyrin-like protein; Provisional
Probab=89.67 E-value=0.31 Score=40.51 Aligned_cols=36 Identities=8% Similarity=0.026 Sum_probs=28.5
Q ss_pred HHHHHHHcCCHHHHHHHHhcCCCCCCCCCCCC--ccccc
Q psy15934 33 RFLEYVNNSQVEKIAKLCAKGLDPNFHCPETG--EMVAL 69 (70)
Q Consensus 33 df~~aV~~~~~~kV~kLL~kGlDPN~~d~~~G--eTpL~ 69 (70)
.++.|+..++.+-|.-|++.|+|+|..|. .| .|||+
T Consensus 342 ~~~~Aa~~gn~eIVelLIs~GADIN~kD~-~g~~~TpLh 379 (631)
T PHA02792 342 KYFQKFDNRDPKVVEYILKNGNVVVEDDD-NIINIMPLF 379 (631)
T ss_pred HHHHHHHcCCHHHHHHHHHcCCchhhhcC-CCCChhHHH
Confidence 37788888999888888899999998774 44 57764
No 70
>KOG4412|consensus
Probab=88.64 E-value=0.37 Score=35.85 Aligned_cols=36 Identities=14% Similarity=0.028 Sum_probs=30.2
Q ss_pred HHHHHHcCCHHHHHHHHhcCCCCCCCCCCCCcccccC
Q psy15934 34 FLEYVNNSQVEKIAKLCAKGLDPNFHCPETGEMVALI 70 (70)
Q Consensus 34 f~~aV~~~~~~kV~kLL~kGlDPN~~d~~~GeTpL~L 70 (70)
+-+|..-|+.+-|.=||..|..-|++| ..|.|||+.
T Consensus 143 lHRAAavGklkvie~Li~~~a~~n~qD-k~G~TpL~~ 178 (226)
T KOG4412|consen 143 LHRAAAVGKLKVIEYLISQGAPLNTQD-KYGFTPLHH 178 (226)
T ss_pred hHHHHhccchhhHHHHHhcCCCCCccc-ccCccHHHH
Confidence 456778899988888889999999999 599999973
No 71
>PHA02795 ankyrin-like protein; Provisional
Probab=88.31 E-value=0.44 Score=37.90 Aligned_cols=33 Identities=12% Similarity=-0.141 Sum_probs=27.4
Q ss_pred HHHHcCCHHHHHHHHhcCCCCCCCCCCCCccccc
Q psy15934 36 EYVNNSQVEKIAKLCAKGLDPNFHCPETGEMVAL 69 (70)
Q Consensus 36 ~aV~~~~~~kV~kLL~kGlDPN~~d~~~GeTpL~ 69 (70)
-+.+.+..+-+.-|+..|.|.|..| ..|.|||+
T Consensus 194 ~a~~~~~~eIve~LIs~GADIN~kD-~~G~TpLh 226 (437)
T PHA02795 194 LVDEPTVLEIYKLCIPYIEDINQLD-AGGRTLLY 226 (437)
T ss_pred HHHhcCHHHHHHHHHhCcCCcCcCC-CCCCCHHH
Confidence 3456667777888889999999999 69999996
No 72
>KOG0508|consensus
Probab=88.08 E-value=0.42 Score=39.73 Aligned_cols=35 Identities=20% Similarity=0.324 Sum_probs=22.5
Q ss_pred HHHHHcCCHHHHHHHHhcCCCCCCCCCCCCcccccC
Q psy15934 35 LEYVNNSQVEKIAKLCAKGLDPNFHCPETGEMVALI 70 (70)
Q Consensus 35 ~~aV~~~~~~kV~kLL~kGlDPN~~d~~~GeTpL~L 70 (70)
.-|--.|..+-|.-|++.|+||+..+ ..|.|-||+
T Consensus 122 raACfDG~leivKyLvE~gad~~Ian-rhGhTcLmI 156 (615)
T KOG0508|consen 122 RAACFDGHLEIVKYLVEHGADPEIAN-RHGHTCLMI 156 (615)
T ss_pred HHHHhcchhHHHHHHHHcCCCCcccc-cCCCeeEEe
Confidence 34445566666666777777777666 477776664
No 73
>PHA02730 ankyrin-like protein; Provisional
Probab=87.20 E-value=0.73 Score=38.55 Aligned_cols=36 Identities=11% Similarity=0.030 Sum_probs=29.1
Q ss_pred HHHHHHcCCHHHHHHHHhcCCCCCCCCCCCCccccc
Q psy15934 34 FLEYVNNSQVEKIAKLCAKGLDPNFHCPETGEMVAL 69 (70)
Q Consensus 34 f~~aV~~~~~~kV~kLL~kGlDPN~~d~~~GeTpL~ 69 (70)
+.-|+..++.+-+.-|+..|.|+|..|...|+|||+
T Consensus 466 Lh~Aa~~~~~eive~LI~~GAdIN~~d~~~g~TaL~ 501 (672)
T PHA02730 466 LYYAVDVNNIQFARRLLEYGASVNTTSRSIINTAIQ 501 (672)
T ss_pred HHHHHHhCCHHHHHHHHHCCCCCCCCCCcCCcCHHH
Confidence 566777888888888889999999888435899885
No 74
>KOG0515|consensus
Probab=86.61 E-value=0.69 Score=39.05 Aligned_cols=33 Identities=15% Similarity=0.015 Sum_probs=30.3
Q ss_pred HHHHcCCHHHHHHHHhcCCCCCCCCCCCCccccc
Q psy15934 36 EYVNNSQVEKIAKLCAKGLDPNFHCPETGEMVAL 69 (70)
Q Consensus 36 ~aV~~~~~~kV~kLL~kGlDPN~~d~~~GeTpL~ 69 (70)
.||=-|+.+-|.=|++.|.|-|..|+ .|.|||+
T Consensus 589 NAiCaghyeIVkFLi~~ganVNa~DS-dGWTPLH 621 (752)
T KOG0515|consen 589 NAICAGHYEIVKFLIEFGANVNAADS-DGWTPLH 621 (752)
T ss_pred hhhhcchhHHHHHHHhcCCcccCccC-CCCchhh
Confidence 68888999989999999999999995 9999997
No 75
>KOG4177|consensus
Probab=86.22 E-value=0.6 Score=41.32 Aligned_cols=38 Identities=18% Similarity=0.103 Sum_probs=30.9
Q ss_pred HHHHHHHHHcCCHHHHHHHHhcCCCCCCCCCCCCccccc
Q psy15934 31 LKRFLEYVNNSQVEKIAKLCAKGLDPNFHCPETGEMVAL 69 (70)
Q Consensus 31 ~~df~~aV~~~~~~kV~kLL~kGlDPN~~d~~~GeTpL~ 69 (70)
|-=..-|+..|++.-|.-||++|.|-|..| +-|.|||+
T Consensus 541 ~TpLh~A~~~g~v~~VkfLLe~gAdv~ak~-~~G~TPLH 578 (1143)
T KOG4177|consen 541 YTPLHVAVHYGNVDLVKFLLEHGADVNAKD-KLGYTPLH 578 (1143)
T ss_pred cchHHHHHhcCCchHHHHhhhCCccccccC-CCCCChhh
Confidence 334667888888888888888888888888 68888886
No 76
>KOG0512|consensus
Probab=85.69 E-value=1 Score=33.51 Aligned_cols=42 Identities=10% Similarity=0.101 Sum_probs=34.6
Q ss_pred hHhHHHHHHHHHHcCCHHHHHHHHhcCC-CCCCCCCCCCccccc
Q psy15934 27 TRSNLKRFLEYVNNSQVEKIAKLCAKGL-DPNFHCPETGEMVAL 69 (70)
Q Consensus 27 tk~~~~df~~aV~~~~~~kV~kLL~kGl-DPN~~d~~~GeTpL~ 69 (70)
-|...+..++|+..+-...|..||.--. --|+.| +.|+|||+
T Consensus 60 e~dp~rl~lwaae~nrl~eV~~lL~e~an~vNtrD-~D~YTpLH 102 (228)
T KOG0512|consen 60 EKDPIRLLLWAAEKNRLTEVQRLLSEKANHVNTRD-EDEYTPLH 102 (228)
T ss_pred ccCHHHHHHHHHhhccHHHHHHHHHhccccccccc-cccccHHH
Confidence 3466788999999999999999996444 468888 69999995
No 77
>KOG2505|consensus
Probab=84.07 E-value=1 Score=37.50 Aligned_cols=34 Identities=12% Similarity=0.109 Sum_probs=30.6
Q ss_pred HHHHcCCHHHHHHHHhcCCCCCCCCCCCCcccccC
Q psy15934 36 EYVNNSQVEKIAKLCAKGLDPNFHCPETGEMVALI 70 (70)
Q Consensus 36 ~aV~~~~~~kV~kLL~kGlDPN~~d~~~GeTpL~L 70 (70)
.|++.|+..-|..+|+-|.||-+-| ..|.||-+|
T Consensus 436 ~aa~qg~~k~v~~~Leeg~Dp~~kd-~~Grtpy~l 469 (591)
T KOG2505|consen 436 YAAAQGARKCVKYFLEEGCDPSTKD-GAGRTPYSL 469 (591)
T ss_pred HHHhcchHHHHHHHHHhcCCchhcc-cCCCCcccc
Confidence 4677899999999999999999999 799999875
No 78
>PF08332 CaMKII_AD: Calcium/calmodulin dependent protein kinase II Association; InterPro: IPR013543 Protein phosphorylation, which plays a key role in most cellular activities, is a reversible process mediated by protein kinases and phosphoprotein phosphatases. Protein kinases catalyse the transfer of the gamma phosphate from nucleotide triphosphates (often ATP) to one or more amino acid residues in a protein substrate side chain, resulting in a conformational change affecting protein function. Phosphoprotein phosphatases catalyse the reverse process. Protein kinases fall into three broad classes, characterised with respect to substrate specificity []: Serine/threonine-protein kinases Tyrosine-protein kinases Dual specific protein kinases (e.g. MEK - phosphorylates both Thr and Tyr on target proteins) Protein kinase function has been evolutionarily conserved from Escherichia coli to human []. Protein kinases play a role in a multitude of cellular processes, including division, proliferation, apoptosis, and differentiation []. Phosphorylation usually results in a functional change of the target protein by changing enzyme activity, cellular location, or association with other proteins. The catalytic subunits of protein kinases are highly conserved, and several structures have been solved [], leading to large screens to develop kinase-specific inhibitors for the treatments of a number of diseases []. This domain is found at the C terminus of the Calcium/calmodulin dependent protein kinases II (CaMKII). These proteins also have a Ser/Thr protein kinase domain (IPR000719 from INTERPRO) at their N terminus []. The function of the CaMKII association domain is the assembly of the single proteins into large (8 to 14 subunits) multimers [] and is a prominent kinase in the central nervous system that may function in long-term potentiation and neurotransmitter release. ; GO: 0004683 calmodulin-dependent protein kinase activity, 0005516 calmodulin binding, 0006468 protein phosphorylation; PDB: 2W2C_F 3H51_B 3SOA_A 2UX0_A 1HKX_M 2F86_B.
Probab=83.82 E-value=1.6 Score=29.27 Aligned_cols=30 Identities=30% Similarity=0.464 Sum_probs=24.4
Q ss_pred HHHHHHHhhHhHHHHHHHHHHcCCHHHHHHHHhcC
Q psy15934 19 EKQLKALHTRSNLKRFLEYVNNSQVEKIAKLCAKG 53 (70)
Q Consensus 19 ekql~klhtk~~~~df~~aV~~~~~~kV~kLL~kG 53 (70)
|+++++| .++.++||+.||.+++++|++..
T Consensus 2 e~eI~~l-----~~~w~~ai~tgD~~~~~~ly~~d 31 (128)
T PF08332_consen 2 EQEIAAL-----FDRWNDAIQTGDPETYAKLYAPD 31 (128)
T ss_dssp HHHHHHH-----HHHHHHHHHHT-HHHHHHHEEEE
T ss_pred HHHHHHH-----HHHHHHHHHcCCHHHHhhhcCCC
Confidence 5667665 68899999999999999999765
No 79
>KOG0502|consensus
Probab=82.81 E-value=0.73 Score=35.34 Aligned_cols=36 Identities=14% Similarity=0.160 Sum_probs=28.2
Q ss_pred HHHHHHcCCHHHHHHHHhcCCCCCCCCCCCCcccccC
Q psy15934 34 FLEYVNNSQVEKIAKLCAKGLDPNFHCPETGEMVALI 70 (70)
Q Consensus 34 f~~aV~~~~~~kV~kLL~kGlDPN~~d~~~GeTpL~L 70 (70)
+++|...|..+-|.=||+-|.||...- .--||+|+|
T Consensus 164 LiWAaa~G~i~vV~fLL~~GAdp~~lg-k~resALsL 199 (296)
T KOG0502|consen 164 LIWAAAKGHIPVVQFLLNSGADPDALG-KYRESALSL 199 (296)
T ss_pred hHHHHhcCchHHHHHHHHcCCChhhhh-hhhhhhHhH
Confidence 788889999988888888888888765 345666654
No 80
>KOG0195|consensus
Probab=82.28 E-value=1.1 Score=35.76 Aligned_cols=32 Identities=19% Similarity=0.158 Sum_probs=28.1
Q ss_pred HHHcCCHHHHHHHHhcCCCCCCCCCCCCccccc
Q psy15934 37 YVNNSQVEKIAKLCAKGLDPNFHCPETGEMVAL 69 (70)
Q Consensus 37 aV~~~~~~kV~kLL~kGlDPN~~d~~~GeTpL~ 69 (70)
|..+|.-+-|.+||.+-.|-|.++ +.|+|||+
T Consensus 74 aaahghrdivqkll~~kadvnavn-ehgntplh 105 (448)
T KOG0195|consen 74 AAAHGHRDIVQKLLSRKADVNAVN-EHGNTPLH 105 (448)
T ss_pred hhhcccHHHHHHHHHHhcccchhh-ccCCCchh
Confidence 556788888999999999999988 89999996
No 81
>KOG4177|consensus
Probab=78.60 E-value=1.7 Score=38.64 Aligned_cols=42 Identities=17% Similarity=0.136 Sum_probs=36.3
Q ss_pred HhHHHHHHHHHHcCCHHHHHHHHhcCCCCCCCCCCCCcccccC
Q psy15934 28 RSNLKRFLEYVNNSQVEKIAKLCAKGLDPNFHCPETGEMVALI 70 (70)
Q Consensus 28 k~~~~df~~aV~~~~~~kV~kLL~kGlDPN~~d~~~GeTpL~L 70 (70)
.+.+.-+-.|...|..+.+..|+.+|..||..+ -.|+|||+.
T Consensus 240 ~~gl~~lh~a~~~g~~~i~~~l~~~ga~~~~~~-vr~~tplh~ 281 (1143)
T KOG4177|consen 240 ESGLTPLHVAAFMGHLDIVKLLLQHGASVNVST-VRGETPLHM 281 (1143)
T ss_pred ccCccHHHHHHhccchhHHHHHHhcccccCccc-ccccCcchh
Confidence 445666778899999999999999999999888 699999963
No 82
>KOG0783|consensus
Probab=78.32 E-value=0.99 Score=39.96 Aligned_cols=32 Identities=19% Similarity=0.170 Sum_probs=28.0
Q ss_pred HHcCCHHHHHHHHhcCCCCCCCCCCCCccccc
Q psy15934 38 VNNSQVEKIAKLCAKGLDPNFHCPETGEMVAL 69 (70)
Q Consensus 38 V~~~~~~kV~kLL~kGlDPN~~d~~~GeTpL~ 69 (70)
+..+.-+-+.-||++|.|-+-+|-|+|.|||+
T Consensus 60 ~S~~k~~~l~wLlqhGidv~vqD~ESG~taLH 91 (1267)
T KOG0783|consen 60 VSENKNSFLRWLLQHGIDVFVQDEESGYTALH 91 (1267)
T ss_pred eccchhHHHHHHHhcCceeeeccccccchHhh
Confidence 44667778888999999999999999999996
No 83
>PF10152 DUF2360: Predicted coiled-coil domain-containing protein (DUF2360); InterPro: IPR019309 This entry represents a component of the WASH complex. The WASH complex is present at the surface of endosomes and recruits and activates the Arp2/3 complex to induce actin polymerisation. The WASH complex plays a key role in the fission of tubules that serve as transport intermediates during endosome sorting []. The WASH complex's subunit structure: F-actin-capping protein subunit alpha (CAPZA1, CAPZA2 or CAPZA3), F-actin-capping protein subunit beta (CAPZB), WASH (WASH1, WASH2P, WASH3P, WASH4P, WASH5P or WASH6P), FAM21 (FAM21A, FAM21B or FAM21C), KIAA1033, KIAA0196 (strumpellin) and CCDC53.
Probab=77.91 E-value=2.2 Score=29.03 Aligned_cols=33 Identities=18% Similarity=0.373 Sum_probs=26.1
Q ss_pred HhHHHHHHHHHHcCCHHHH--HHHHhcCCCCCCCC
Q psy15934 28 RSNLKRFLEYVNNSQVEKI--AKLCAKGLDPNFHC 60 (70)
Q Consensus 28 k~~~~df~~aV~~~~~~kV--~kLL~kGlDPN~~d 60 (70)
--.|.+||.-++.|=.... .||..-|+||+..|
T Consensus 113 dP~y~kYfKMl~~GvP~~aVk~KM~~eGlDp~~Ld 147 (148)
T PF10152_consen 113 DPRYAKYFKMLKMGVPREAVKQKMQAEGLDPSLLD 147 (148)
T ss_pred CccHHHHHHHHHcCCCHHHHHHHHHHcCCCHHHhc
Confidence 3469999999999954433 46889999999876
No 84
>KOG0818|consensus
Probab=72.87 E-value=2.5 Score=35.47 Aligned_cols=32 Identities=19% Similarity=0.176 Sum_probs=28.3
Q ss_pred HHHcCCHHHHHHHHhcCCCCCCCCCCCCccccc
Q psy15934 37 YVNNSQVEKIAKLCAKGLDPNFHCPETGEMVAL 69 (70)
Q Consensus 37 aV~~~~~~kV~kLL~kGlDPN~~d~~~GeTpL~ 69 (70)
|.+-|+.-.+.=|+--|.||+..| -.|.||+.
T Consensus 174 AAk~Gq~~Q~ElL~vYGAD~~a~d-~~GmtP~~ 205 (669)
T KOG0818|consen 174 AAKAGQILQAELLAVYGADPGAQD-SSGMTPVD 205 (669)
T ss_pred HHhccchhhhhHHhhccCCCCCCC-CCCCcHHH
Confidence 566788999999999999999999 59999975
No 85
>KOG4496|consensus
Probab=72.58 E-value=4.5 Score=29.37 Aligned_cols=36 Identities=25% Similarity=0.530 Sum_probs=28.4
Q ss_pred HHHHHHHHHHcCCHH--HHHHHHhcCCCCCCCCCCCCccc
Q psy15934 30 NLKRFLEYVNNSQVE--KIAKLCAKGLDPNFHCPETGEMV 67 (70)
Q Consensus 30 ~~~df~~aV~~~~~~--kV~kLL~kGlDPN~~d~~~GeTp 67 (70)
.|.+||.-++.|=.+ -+.||+..|+||++. ++|.-|
T Consensus 134 ~YakY~KMlklGV~~~ai~nKMiSEGlDPdiL--eR~DaP 171 (194)
T KOG4496|consen 134 AYAKYFKMLKLGVLEAAIINKMISEGLDPDIL--ERGDAP 171 (194)
T ss_pred HHHHHHHHHHhccHHHHHHHHHHHccCCHHHH--hCCCCC
Confidence 588999999988655 467999999999975 356544
No 86
>KOG1710|consensus
Probab=72.30 E-value=3 Score=33.11 Aligned_cols=39 Identities=21% Similarity=0.186 Sum_probs=33.3
Q ss_pred HHHHHHHHHHcCCHHHHHHHHhcCCCCCCCCCCCCccccc
Q psy15934 30 NLKRFLEYVNNSQVEKIAKLCAKGLDPNFHCPETGEMVAL 69 (70)
Q Consensus 30 ~~~df~~aV~~~~~~kV~kLL~kGlDPN~~d~~~GeTpL~ 69 (70)
.-+.++++|-.|+++..-.||.-==.-|++|| +|.|+|+
T Consensus 12 ~~~~Lle~i~Kndt~~a~~LLs~vr~vn~~D~-sGMs~La 50 (396)
T KOG1710|consen 12 PKSPLLEAIDKNDTEAALALLSTVRQVNQRDP-SGMSVLA 50 (396)
T ss_pred hhhHHHHHHccCcHHHHHHHHHHhhhhhccCC-CcccHHH
Confidence 34568999999999999999977556999994 9999986
No 87
>KOG0510|consensus
Probab=71.55 E-value=4.1 Score=35.69 Aligned_cols=35 Identities=14% Similarity=0.019 Sum_probs=31.4
Q ss_pred HHHHHHcCCHHHHHHHHhcCCCCCCCCCCCCccccc
Q psy15934 34 FLEYVNNSQVEKIAKLCAKGLDPNFHCPETGEMVAL 69 (70)
Q Consensus 34 f~~aV~~~~~~kV~kLL~kGlDPN~~d~~~GeTpL~ 69 (70)
+=.|.+.|..+.|.-|+..|+|-|... ..++|||+
T Consensus 277 LH~a~r~G~~~svd~Ll~~Ga~I~~kn-~d~~spLH 311 (929)
T KOG0510|consen 277 LHYAARQGGPESVDNLLGFGASINSKN-KDEESPLH 311 (929)
T ss_pred HHHHHHcCChhHHHHHHHcCCcccccC-CCCCCchH
Confidence 345788999999999999999999887 79999996
No 88
>KOG0521|consensus
Probab=70.13 E-value=2.4 Score=36.07 Aligned_cols=33 Identities=21% Similarity=0.284 Sum_probs=24.3
Q ss_pred HHHHcCCHHHHHHHHhcCCCCCCCCCCCCccccc
Q psy15934 36 EYVNNSQVEKIAKLCAKGLDPNFHCPETGEMVAL 69 (70)
Q Consensus 36 ~aV~~~~~~kV~kLL~kGlDPN~~d~~~GeTpL~ 69 (70)
.++..|.+..+..++.+|.|||.-|+ +|.+||.
T Consensus 695 ~~~~~g~~~~~~~ll~~~a~~~a~~~-~~~~~l~ 727 (785)
T KOG0521|consen 695 HATASGHTSIACLLLKRGADPNAFDP-DGKLPLD 727 (785)
T ss_pred hhhhhcccchhhhhccccccccccCc-cCcchhh
Confidence 45667777777788888888887774 7777764
No 89
>PF12870 Lumazine_bd: Lumazine-binding domain; InterPro: IPR024267 This entry represents a lumazine-binding domain found in a family of putative lipoproteins from bacteria. Lumazine is a fluorescent accessory protein having 6,7-dimethyl-8-(1'-D-ribityl) lumazine (DMRL) as its authentic chromophore; it modulates the emission of bacterial luciferase to shorter wavelengths with increasing luminous strength.; PDB: 3K7C_C.
Probab=69.98 E-value=4.1 Score=23.95 Aligned_cols=24 Identities=21% Similarity=0.304 Sum_probs=18.7
Q ss_pred hHHHHHHHHHHcCCHHHHHHHHhc
Q psy15934 29 SNLKRFLEYVNNSQVEKIAKLCAK 52 (70)
Q Consensus 29 ~~~~df~~aV~~~~~~kV~kLL~k 52 (70)
...+.|++|+..||.++..+++..
T Consensus 11 ~~v~~f~~al~~gd~~~a~~~~~~ 34 (111)
T PF12870_consen 11 EVVKNFFDALKNGDYEKAYAYLSP 34 (111)
T ss_dssp HHHHHHHHHHCTT-HHHHHHTB--
T ss_pred HHHHHHHHHHHcCCHHHHHHhhCc
Confidence 457899999999999999988753
No 90
>KOG4214|consensus
Probab=68.84 E-value=5.5 Score=27.05 Aligned_cols=34 Identities=24% Similarity=0.388 Sum_probs=26.7
Q ss_pred HHHHHHHcCCHHHHHHHHhcCCCCCCCCCCCCccc
Q psy15934 33 RFLEYVNNSQVEKIAKLCAKGLDPNFHCPETGEMV 67 (70)
Q Consensus 33 df~~aV~~~~~~kV~kLL~kGlDPN~~d~~~GeTp 67 (70)
-++.||-.|.++-|.=||+.|.|--.|-| .|.+.
T Consensus 70 PLLsAvwEGH~~cVklLL~~GAdrt~~~P-dG~~~ 103 (117)
T KOG4214|consen 70 PLLSAVWEGHRDCVKLLLQNGADRTIHAP-DGTAL 103 (117)
T ss_pred HHHHHHHHhhHHHHHHHHHcCcccceeCC-CchhH
Confidence 37788888888888888888888888876 67654
No 91
>PF12169 DNA_pol3_gamma3: DNA polymerase III subunits gamma and tau domain III; InterPro: IPR022754 This domain is found in bacteria and eukaryotes, and is approximately 110 amino acids in length. It is found in association with PF00004 from PFAM. This domain is also present in the tau subunit before it undergoes cleavage. Domains I-III are shared between the tau and the gamma subunits, while most of the DnaB-binding Domain IV and all of the alpha-interacting Domain V are unique to tau. ; GO: 0003887 DNA-directed DNA polymerase activity; PDB: 1NJF_B 3GLG_G 1XXH_I 1NJG_A 3GLF_B 3GLI_G.
Probab=64.22 E-value=9.5 Score=24.29 Aligned_cols=31 Identities=16% Similarity=0.347 Sum_probs=22.0
Q ss_pred hHhHHHHHHHHHHcCCHHHHH----HHHhcCCCCC
Q psy15934 27 TRSNLKRFLEYVNNSQVEKIA----KLCAKGLDPN 57 (70)
Q Consensus 27 tk~~~~df~~aV~~~~~~kV~----kLL~kGlDPN 57 (70)
+...+.+++++|..+|+.++- ++++.|.||.
T Consensus 14 ~~~~i~~l~~ai~~~d~~~~l~~~~~l~~~G~d~~ 48 (143)
T PF12169_consen 14 DEEQIFELLDAILEGDAAEALELLNELLEQGKDPK 48 (143)
T ss_dssp STHHHHHHHHHHHTT-HHHHHHHHHHHHHCT--HH
T ss_pred CHHHHHHHHHHHHcCCHHHHHHHHHHHHHhCCCHH
Confidence 456688999999999998765 4568898885
No 92
>PF07729 FCD: FCD domain; InterPro: IPR011711 Many bacterial transcription regulation proteins bind DNA through a helix-turn-helix (HTH) motif, which can be classified into subfamilies on the basis of sequence similarities. The HTH GntR family has many members distributed among diverse bacterial groups that regulate various biological processes. It was named GntR after the Bacillus subtilis repressor of the gluconate operon []. In general, these proteins contain a DNA-binding HTH domain at the N terminus, and an effector binding or oligomerisation domain at the C terminus. The winged-helix DNA-binding domain is well conserved in structure for the whole of the GntR family (IPR000524 from INTERPRO), and is similar in structure to other transcriptional regulator families. The C-terminal effector-binding and oligomerisation domains are more variable and are consequently used to define the subfamilies. Based on the sequence and structure of the C-terminal domains, the GtnR family can be divided into four major groups, as represented by FadR (IPR008920 from INTERPRO), HutC, MocR and YtrA, as well as some minor groups such as those represented by AraR and PlmA []. This entry represents the C-terminal ligand binding domain of many members of the GntR family. This domain probably binds to a range of effector molecules that regulate the transcription of genes through the action of the N-terminal DNA-binding domain. This domain is found in P45427 from SWISSPROT and P31460 from SWISSPROT that are regulators of sugar biosynthesis operons.; PDB: 3SXK_A 3SXY_A 3SXM_B 3SXZ_A 3FMS_A 2DI3_B 2HS5_A 3IHU_B 3C7J_A.
Probab=59.80 E-value=19 Score=20.91 Aligned_cols=28 Identities=4% Similarity=0.092 Sum_probs=21.9
Q ss_pred HhhHhHHHHHHHHHHcCCHHHHHHHHhc
Q psy15934 25 LHTRSNLKRFLEYVNNSQVEKIAKLCAK 52 (70)
Q Consensus 25 lhtk~~~~df~~aV~~~~~~kV~kLL~k 52 (70)
-.+-...++.++|+..||++.+.+++..
T Consensus 94 ~~~~~~h~~i~~ai~~~d~~~a~~~~~~ 121 (125)
T PF07729_consen 94 ERSLEEHREIIDAIRAGDPEAAREALRQ 121 (125)
T ss_dssp HHHHHHHHHHHHHHHTT-HHHHHHHHHH
T ss_pred hhhHHHHHHHHHHHHcCCHHHHHHHHHH
Confidence 3445568899999999999999998753
No 93
>KOG0505|consensus
Probab=59.37 E-value=11 Score=31.30 Aligned_cols=38 Identities=16% Similarity=0.132 Sum_probs=32.1
Q ss_pred HHHHHHHHHcCCHHHHHHHHhcCCCCCCCCCCCCccccc
Q psy15934 31 LKRFLEYVNNSQVEKIAKLCAKGLDPNFHCPETGEMVAL 69 (70)
Q Consensus 31 ~~df~~aV~~~~~~kV~kLL~kGlDPN~~d~~~GeTpL~ 69 (70)
|--.-+++..++.+-|.-||+.|.+-|..| .+|.|||+
T Consensus 74 lTalhq~~id~~~e~v~~l~e~ga~Vn~~d-~e~wtPlh 111 (527)
T KOG0505|consen 74 LTALHQACIDDNLEMVKFLVENGANVNAQD-NEGWTPLH 111 (527)
T ss_pred chhHHHHHhcccHHHHHHHHHhcCCccccc-cccCCcch
Confidence 334556777888989999999999999999 59999996
No 94
>PF01649 Ribosomal_S20p: Ribosomal protein S20; InterPro: IPR002583 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites [, ]. About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits. Many ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome [, ]. This family consists of bacterial (and chloroplast) examples of the ribosomal small subunit protein S20. Bacterial ribosomal protein S20 forms part of the 30S ribosomal subunit, and interacts with 16S rRNA.; GO: 0003723 RNA binding, 0003735 structural constituent of ribosome, 0006412 translation, 0005622 intracellular, 0005840 ribosome; PDB: 1I94_T 1FJG_T 4DH9_T 3KNJ_T 3TVG_W 3UYF_W 3V28_T 3KIS_t 3HUY_T 1HNX_T ....
Probab=59.22 E-value=18 Score=22.62 Aligned_cols=23 Identities=26% Similarity=0.456 Sum_probs=17.7
Q ss_pred HhHHHHHHHHHHcCCHHHHHHHH
Q psy15934 28 RSNLKRFLEYVNNSQVEKIAKLC 50 (70)
Q Consensus 28 k~~~~df~~aV~~~~~~kV~kLL 50 (70)
++..++|..+|..|+.+...+++
T Consensus 28 rT~iKk~~~ai~~~~~~~a~~~l 50 (84)
T PF01649_consen 28 RTAIKKFREAIEAGDKEEAKELL 50 (84)
T ss_dssp HHHHHHHHHHHHTT-HHHHHHHH
T ss_pred HHHHHHHHHHHHccChHHHHHHH
Confidence 56788999999999999665554
No 95
>PRK14135 recX recombination regulator RecX; Provisional
Probab=59.09 E-value=15 Score=26.12 Aligned_cols=55 Identities=11% Similarity=0.145 Sum_probs=40.0
Q ss_pred hhhhhhhhccCCCHHHHHHHhhHhHHHHHHHH----HH---cCCHHHHHHHHhcCCCCCCCC
Q psy15934 6 IANTFISLLLNLDEKQLKALHTRSNLKRFLEY----VN---NSQVEKIAKLCAKGLDPNFHC 60 (70)
Q Consensus 6 ~~k~rvy~~~~l~ekql~klhtk~~~~df~~a----V~---~~~~~kV~kLL~kGlDPN~~d 60 (70)
+.+-++.+...||+.++..|-....+....++ +. ....+=..+|.++|++|+..+
T Consensus 31 ~~~~~L~~g~~l~~~~~~~i~~~~~~~~a~~~Al~~L~~r~~s~~el~~kL~~kg~~~~~Ie 92 (263)
T PRK14135 31 LVKFMLKKGKELDEEDLEEIQYADQVSKGKNLALYYLSYQMRTEKEVRDYLKKHEISEEIIS 92 (263)
T ss_pred HHHhcCcCCCcCCHHHHHHHHHHHHHHHHHHHHHHHhhhccccHHHHHHHHHHCCCCHHHHH
Confidence 56778888999999999999887776665444 33 333445567788999987654
No 96
>TIGR00756 PPR pentatricopeptide repeat domain (PPR motif). This family has a similar consensus to the TPR domain (tetratricopeptide), pfam pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.
Probab=58.45 E-value=18 Score=16.64 Aligned_cols=22 Identities=18% Similarity=0.543 Sum_probs=14.9
Q ss_pred HHHcCCHHHHHH----HHhcCCCCCC
Q psy15934 37 YVNNSQVEKIAK----LCAKGLDPNF 58 (70)
Q Consensus 37 aV~~~~~~kV~k----LL~kGlDPN~ 58 (70)
+++.|+.++..+ |.++|+.||.
T Consensus 10 ~~~~~~~~~a~~~~~~M~~~g~~p~~ 35 (35)
T TIGR00756 10 LCKAGRVEEALELFKEMLERGIEPDV 35 (35)
T ss_pred HHHCCCHHHHHHHHHHHHHcCCCCCC
Confidence 456677666554 4678999883
No 97
>KOG0507|consensus
Probab=54.38 E-value=12 Score=32.73 Aligned_cols=33 Identities=21% Similarity=0.174 Sum_probs=29.2
Q ss_pred HHHHcCCHHHHHHHHhcCCCCCCCCCCCCccccc
Q psy15934 36 EYVNNSQVEKIAKLCAKGLDPNFHCPETGEMVAL 69 (70)
Q Consensus 36 ~aV~~~~~~kV~kLL~kGlDPN~~d~~~GeTpL~ 69 (70)
-+|+++..+-+.=||.+|.||=.++ ++|+|+|-
T Consensus 121 laaqhgh~dvv~~Ll~~~adp~i~n-ns~~t~ld 153 (854)
T KOG0507|consen 121 LAAQHGHLEVVFYLLKKNADPFIRN-NSKETVLD 153 (854)
T ss_pred hhhhhcchHHHHHHHhcCCCccccC-cccccHHH
Confidence 4789999999999999999999999 59999873
No 98
>PF13041 PPR_2: PPR repeat family
Probab=52.18 E-value=22 Score=18.75 Aligned_cols=21 Identities=14% Similarity=0.409 Sum_probs=14.5
Q ss_pred HHcCCHHHHH----HHHhcCCCCCC
Q psy15934 38 VNNSQVEKIA----KLCAKGLDPNF 58 (70)
Q Consensus 38 V~~~~~~kV~----kLL~kGlDPN~ 58 (70)
.+.|+.++.. +|.++|+.||.
T Consensus 14 ~~~~~~~~a~~l~~~M~~~g~~P~~ 38 (50)
T PF13041_consen 14 CKAGKFEEALKLFKEMKKRGIKPDS 38 (50)
T ss_pred HHCcCHHHHHHHHHHHHHcCCCCCH
Confidence 4456665544 55789999995
No 99
>KOG3609|consensus
Probab=48.11 E-value=19 Score=31.38 Aligned_cols=41 Identities=29% Similarity=0.284 Sum_probs=32.2
Q ss_pred hHHHHHHHHHHcCCHHHHHHHHhcC----CCCCCCCCCCCcccccC
Q psy15934 29 SNLKRFLEYVNNSQVEKIAKLCAKG----LDPNFHCPETGEMVALI 70 (70)
Q Consensus 29 ~~~~df~~aV~~~~~~kV~kLL~kG----lDPN~~d~~~GeTpL~L 70 (70)
.+.++|+.+++.||...|.++++.- +.-|-+|| =|.++|++
T Consensus 24 ~~e~~fL~a~E~gd~~~V~k~l~~~~~~~lninc~d~-lGr~al~i 68 (822)
T KOG3609|consen 24 EGEKGFLLAHENGDVPLVAKALEYKAVSKLNINCRDP-LGRLALHI 68 (822)
T ss_pred hhhHHHHHHHHcCChHHHHHHHHhccccccchhccCh-Hhhhceec
Confidence 4578999999999999999999643 45666775 77777754
No 100
>PRK06585 holA DNA polymerase III subunit delta; Reviewed
Probab=47.75 E-value=43 Score=24.33 Aligned_cols=41 Identities=12% Similarity=0.228 Sum_probs=31.4
Q ss_pred CCCHHHHHHH---hhHhHHHHHHHHHHcCCHHHHH----HHHhcCCCC
Q psy15934 16 NLDEKQLKAL---HTRSNLKRFLEYVNNSQVEKIA----KLCAKGLDP 56 (70)
Q Consensus 16 ~l~ekql~kl---htk~~~~df~~aV~~~~~~kV~----kLL~kGlDP 56 (70)
.++...+.++ ....+.-+|.+|+-.|+..+.. .|++.|.+|
T Consensus 197 ~It~edV~~lv~~~~e~~if~l~dai~~~~~~~a~~~l~~ll~~g~~p 244 (343)
T PRK06585 197 EITLDDVRAVVGDASALSLDDAADAALAGDLAAFERALDRALAEGTAP 244 (343)
T ss_pred CCCHHHHHHHhCCcccccHHHHHHHHHCCCHHHHHHHHHHHHHcCCCH
Confidence 5777777777 5667888999999999998754 455666666
No 101
>COG0268 RpsT Ribosomal protein S20 [Translation, ribosomal structure and biogenesis]
Probab=47.13 E-value=37 Score=21.94 Aligned_cols=23 Identities=13% Similarity=0.174 Sum_probs=18.3
Q ss_pred HhHHHHHHHHHHcCCHHHHHHHH
Q psy15934 28 RSNLKRFLEYVNNSQVEKIAKLC 50 (70)
Q Consensus 28 k~~~~df~~aV~~~~~~kV~kLL 50 (70)
|+..++|..+|..||.+....++
T Consensus 29 rT~iKk~~~ai~~gd~~~A~~~l 51 (88)
T COG0268 29 RTAIKKVEAAIEAGDKEAAKAAL 51 (88)
T ss_pred HHHHHHHHHHHHcCCHHHHHHHH
Confidence 56788999999999877666554
No 102
>KOG0511|consensus
Probab=47.03 E-value=16 Score=29.98 Aligned_cols=39 Identities=13% Similarity=0.140 Sum_probs=34.2
Q ss_pred HHHHHHHHHHcCCHHHHHHHHhcCCCCCCCCCCCCccccc
Q psy15934 30 NLKRFLEYVNNSQVEKIAKLCAKGLDPNFHCPETGEMVAL 69 (70)
Q Consensus 30 ~~~df~~aV~~~~~~kV~kLL~kGlDPN~~d~~~GeTpL~ 69 (70)
...+.++|-+.|+++.|..|.+-|..-|.+| .=-..||.
T Consensus 36 ~f~elceacR~GD~d~v~~LVetgvnVN~vD-~fD~spL~ 74 (516)
T KOG0511|consen 36 PFGELCEACRAGDVDRVRYLVETGVNVNAVD-RFDSSPLY 74 (516)
T ss_pred chHHHHHHhhcccHHHHHHHHHhCCCcchhh-cccccHHH
Confidence 3778999999999999999999999999999 46667764
No 103
>cd00781 ketosteroid_isomerase ketosteroid isomerase: Many biological reactions proceed by enzymatic cleavage of a C-H bond adjacent to carbonyl or a carboxyl group, leading to an enol or a enolate intermediate that is subsequently re-protonated at the same or an adjacent carbon. Ketosteroid isomerases are important members of this class of enzymes which are the most proficient of all enzymes known and have served as a paradigm for enzymatic enolizations since its discovery in 1954. This CD includes members of this class that calalyze the isomerization of various beta,gamma-unsaturated isomers at nearly a diffusion-controlled rate. These enzymes are widely distributed in bacteria.
Probab=46.63 E-value=32 Score=20.65 Aligned_cols=26 Identities=31% Similarity=0.496 Sum_probs=21.7
Q ss_pred HhHHHHHHHHHHcCCHHHHHHHHhcC
Q psy15934 28 RSNLKRFLEYVNNSQVEKIAKLCAKG 53 (70)
Q Consensus 28 k~~~~df~~aV~~~~~~kV~kLL~kG 53 (70)
+.-.++|++++..+|.+.+..|+.-.
T Consensus 6 ~~~v~~~~~a~~~~D~~~~~~l~aed 31 (122)
T cd00781 6 KAAVQRYVEAVNAGDPEGIVALFADD 31 (122)
T ss_pred HHHHHHHHHHHHCCCHHHHHHHcCCC
Confidence 45588999999999999999996443
No 104
>TIGR00029 S20 ribosomal protein S20. This family consists of bacterial (and chloroplast) examples of the bacteria ribosomal small subunit protein S20.
Probab=44.41 E-value=43 Score=21.23 Aligned_cols=24 Identities=8% Similarity=0.191 Sum_probs=18.7
Q ss_pred hHhHHHHHHHHHHcCCHHHHHHHH
Q psy15934 27 TRSNLKRFLEYVNNSQVEKIAKLC 50 (70)
Q Consensus 27 tk~~~~df~~aV~~~~~~kV~kLL 50 (70)
-|+..++|..+|..|+.+....++
T Consensus 28 ~kT~iKk~~~ai~~~d~~~a~~~l 51 (87)
T TIGR00029 28 MKTIIKKVYAAIAAGDKDKAQEAF 51 (87)
T ss_pred HHHHHHHHHHHHHCCCHHHHHHHH
Confidence 366789999999999987655553
No 105
>PRK14155 heat shock protein GrpE; Provisional
Probab=44.40 E-value=6.8 Score=28.42 Aligned_cols=33 Identities=21% Similarity=0.312 Sum_probs=22.8
Q ss_pred hHHHHHHHHHHcCCHHHHHHHHhcCCCCCCCCC
Q psy15934 29 SNLKRFLEYVNNSQVEKIAKLCAKGLDPNFHCP 61 (70)
Q Consensus 29 ~~~~df~~aV~~~~~~kV~kLL~kGlDPN~~d~ 61 (70)
-.++.|+.....-.+++|...+-.-||||+|..
T Consensus 101 mi~k~~~~~L~k~GV~~I~~~~G~~FDP~~HEA 133 (208)
T PRK14155 101 MTEKELLGAFERNGLKKIDPAKGDKFDPHLHQA 133 (208)
T ss_pred HHHHHHHHHHHHCCCceecCCCCCCCChhHhce
Confidence 345667777777666666544567799999984
No 106
>PF04908 SH3BGR: SH3-binding, glutamic acid-rich protein; InterPro: IPR006993 This family of proteins, which contains SH3BGRL3, is functionally uncharacterised. SH3BGRL3 is a highly conserved small protein, which is widely expressed and shows a significant similarity to glutaredoxin 1 (GRX1) of Escherichia coli which is predicted to belong to the thioredoxin superfamily. However, SH3BGRL3 lacks both conserved cysteine residues, which characterise the enzymatic active site of GRX. This structural feature raises the possibility that SH3BGRL3 and its homologues could function as endogenous modulators of GRX activity []. ; PDB: 1SJ6_A 1U6T_A 1WRY_A 1T1V_B 1J0F_A 2CT6_A.
Probab=44.30 E-value=13 Score=24.02 Aligned_cols=21 Identities=29% Similarity=0.434 Sum_probs=15.6
Q ss_pred hHHHHHHHHHHcCCHHHHHHH
Q psy15934 29 SNLKRFLEYVNNSQVEKIAKL 49 (70)
Q Consensus 29 ~~~~df~~aV~~~~~~kV~kL 49 (70)
++|++|++|++.+..+..-+|
T Consensus 78 Gdye~f~ea~E~~~L~~fL~L 98 (99)
T PF04908_consen 78 GDYEDFEEANENGELEEFLKL 98 (99)
T ss_dssp EEHHHHHHHHCTT-HHHHHT-
T ss_pred eeHHHHHHHHhhCHHHHHhCc
Confidence 579999999999988765443
No 107
>PRK00239 rpsT 30S ribosomal protein S20; Reviewed
Probab=43.96 E-value=45 Score=21.12 Aligned_cols=24 Identities=13% Similarity=0.178 Sum_probs=19.0
Q ss_pred hHhHHHHHHHHHHcCCHHHHHHHH
Q psy15934 27 TRSNLKRFLEYVNNSQVEKIAKLC 50 (70)
Q Consensus 27 tk~~~~df~~aV~~~~~~kV~kLL 50 (70)
-|+..++|..+|..|+.+....++
T Consensus 28 ~kT~iKk~~~ai~~~~~~~a~~~~ 51 (88)
T PRK00239 28 VRTAIKKVEAAIAAGDKEAAEEAL 51 (88)
T ss_pred HHHHHHHHHHHHHcCCHHHHHHHH
Confidence 366789999999999987766654
No 108
>PF11252 DUF3051: Protein of unknown function (DUF3051); InterPro: IPR021411 This entry is represented by Sugarcane yellow leaf virus (SCYLV), Orf0; it is a family of uncharacterised viral proteins.
Probab=43.47 E-value=20 Score=25.79 Aligned_cols=46 Identities=17% Similarity=0.415 Sum_probs=28.2
Q ss_pred cchhhhhhhhhccCCCHHHHHHHhhHhHHHHHHHHHHc---CCHHHHHHHHhcCCCC
Q psy15934 3 TSNIANTFISLLLNLDEKQLKALHTRSNLKRFLEYVNN---SQVEKIAKLCAKGLDP 56 (70)
Q Consensus 3 ~~~~~k~rvy~~~~l~ekql~klhtk~~~~df~~aV~~---~~~~kV~kLL~kGlDP 56 (70)
+.-+-||.+..|-++...-+. + .++.|+-. .++..+..+|+||.||
T Consensus 51 ~rc~s~rf~leq~~i~~e~fr-----t---qll~y~g~difqdvarlq~~lergyd~ 99 (189)
T PF11252_consen 51 KRCFSRRFTLEQDHISPEFFR-----T---QLLQYVGMDIFQDVARLQNILERGYDP 99 (189)
T ss_pred hhHhhcceeeecCcCCHHHHH-----H---HHHHHhchHHHHHHHHHHHHHHhcchh
Confidence 334456666777676655432 1 23334332 3677888899999998
No 109
>TIGR02957 SigX4 RNA polymerase sigma-70 factor, TIGR02957 family. This group of sigma factors are members of the sigma-70 family (TIGR02937). They and appear by homology, tree building and bidirectional best hits, to represent a conserved family. This family is found in a limited number of bacterial lineages. This family includes apparent paralogous expansion in Streptomyces coelicolor A3(2), and multiple copies in Mycobacterium smegmatis MC2, Streptomyces avermitilis MA-4680 and Nocardia farcinica IFM10152.
Probab=43.39 E-value=36 Score=24.49 Aligned_cols=27 Identities=26% Similarity=0.480 Sum_probs=23.0
Q ss_pred HhHHHHHHHHHHcCCHHHHHHHHhcCC
Q psy15934 28 RSNLKRFLEYVNNSQVEKIAKLCAKGL 54 (70)
Q Consensus 28 k~~~~df~~aV~~~~~~kV~kLL~kGl 54 (70)
+.-.+.|++|+..||.+.+..||.-..
T Consensus 167 ~~~~~~f~~a~~~gD~~~l~~lL~~dv 193 (281)
T TIGR02957 167 RQLLERFVEAAQTGDLDGLLELLAEDV 193 (281)
T ss_pred HHHHHHHHHHHHhCCHHHHHHHHhhce
Confidence 355889999999999999999987543
No 110
>COG2605 Predicted kinase related to galactokinase and mevalonate kinase [General function prediction only]
Probab=43.23 E-value=64 Score=25.49 Aligned_cols=52 Identities=15% Similarity=0.319 Sum_probs=40.8
Q ss_pred cchhhhhhhhhccCCCHHHHHHHhh-HhHHHHHHHHHHcCCHHHHHHHHhcCC
Q psy15934 3 TSNIANTFISLLLNLDEKQLKALHT-RSNLKRFLEYVNNSQVEKIAKLCAKGL 54 (70)
Q Consensus 3 ~~~~~k~rvy~~~~l~ekql~klht-k~~~~df~~aV~~~~~~kV~kLL~kGl 54 (70)
.|+|.+..|=+...=||+.++++|. |+--.+|.+++..+|......+|.+|-
T Consensus 201 Ss~V~~dQ~~~~~~~~~~~~e~~~~mk~~A~~~~~al~~nd~~~f~~~l~~gW 253 (333)
T COG2605 201 SSEVIEDQVRNVVDGDEETLEALHEMKALAYEMKDALVRNDIPEFGQILDRGW 253 (333)
T ss_pred hhHHHHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHHhcchHHHHHHHHhHH
Confidence 3556666665555667788999886 677778999999999999999998874
No 111
>PF05223 MecA_N: NTF2-like N-terminal transpeptidase domain; InterPro: IPR007887 The multiple antibiotic resistance of methicillin-resistant strains of Staphylococcus aureus (MRSA) has become a major clinical problem worldwide. Methicillin resistance in MRSA strains is due to the acquisition of the mecA gene via horizontal transfer from an unidentified species which encodes penicillin-binding protein 2a (PBP2a). The structure of the N-terminal domain from MecA is known [] Q53707 from SWISSPROT and is found to be similar to that found in NTF2 IPR002075 from INTERPRO. The length of the PBP2A N-terminal domain (which positions the transpeptidase active site more than 100A from the expected C terminus of the transmembrane anchor) suggests a possible structural role and potentially gives the transpeptidase domain substantial reach from the cell membrane. This domain seems unlikely to have an enzymatic function.; GO: 0046677 response to antibiotic; PDB: 1MWS_B 1MWT_B 1MWR_A 1MWU_A 1VQQ_A.
Probab=42.99 E-value=26 Score=22.12 Aligned_cols=23 Identities=17% Similarity=0.255 Sum_probs=18.6
Q ss_pred hHHHHHHHHHHcCCHHHHHHHHh
Q psy15934 29 SNLKRFLEYVNNSQVEKIAKLCA 51 (70)
Q Consensus 29 ~~~~df~~aV~~~~~~kV~kLL~ 51 (70)
...+.|+++.+.++.+++..++.
T Consensus 5 ~~~~~f~~aw~~~dy~~m~~~~~ 27 (118)
T PF05223_consen 5 ETAEAFLEAWEKGDYAAMYELTS 27 (118)
T ss_dssp HHHHHHHHHHHTT-HHHHHHTB-
T ss_pred HHHHHHHHHHHcCCHHHHHHhhc
Confidence 45789999999999999998764
No 112
>PF12680 SnoaL_2: SnoaL-like domain; PDB: 3F40_A 3RGA_A 3G8Z_A 3DMC_A 3FH1_A 1TUH_A 3F14_A 3ER7_A 1Z1S_A 3F7X_A ....
Probab=42.68 E-value=15 Score=20.46 Aligned_cols=20 Identities=40% Similarity=0.596 Sum_probs=17.5
Q ss_pred HHHHHHHHcCCHHHHHHHHh
Q psy15934 32 KRFLEYVNNSQVEKIAKLCA 51 (70)
Q Consensus 32 ~df~~aV~~~~~~kV~kLL~ 51 (70)
++|++|+..+|.+.+..++.
T Consensus 2 ~~~~~a~~~~d~~~i~~~~~ 21 (102)
T PF12680_consen 2 RRFFEAWNAGDLDAIAALFA 21 (102)
T ss_dssp HHHHHHHHTTHHHHHHHTEE
T ss_pred HHHHHHHHcCCHHHHHHHcC
Confidence 68999999999999988764
No 113
>PF02607 B12-binding_2: B12 binding domain; InterPro: IPR003759 Cobalamin-dependent methionine synthase (2.1.1.13 from EC) is a large modular protein that catalyses methyl transfer from methyltetrahydrofolate (CH3-H4folate) to homocysteine. During the catalytic cycle, it supports three distinct methyl transfer reactions, each involving the cobalamin (vitamin B12) cofactor and a substrate bound to its own functional unit []. The cobalamin cofactor plays an essential role in this reaction, accepting the methyl group from CH3-H4folate to form methylcob(III)alamin, and in turn donating the methyl group to homocysteine to generate methionine and cob(I)alamin. Methionine synthase is a large enzyme composed of four structurally and functionally distinct modules: the first two modules bind homocysteine and CH3-H4folate, the third module binds the cobalamin cofactor and the C-terminal module binds S-adenosylmethionine. The cobalamin-binding module is composed of two structurally distinct domains: a 4-helical bundle cap domain (residues 651-740 in the Escherichia coli enzyme) and an alpha/beta B12-binding domain (residues 741-896) (IPR006158 from INTERPRO). The 4-helical bundle forms a cap over the alpha/beta domain, which acts to shield the methyl ligand of cobalamin from solvent []. Furthermore, in the conversion to the active conformation of this enzyme, the 4-helical cap rotates to allow the cobalamin cofactor to bind the activation domain (IPR004223 from INTERPRO). The alpha/beta domain is a common cobalamin-binding motif, whereas the 4-helical bundle domain with its methyl cap is a distinctive feature of methionine synthases. This entry represents the 4-helical bundle cap domain. This domain is also present in other shorter proteins that bind to B12, and is always found N terminus to the alpha/beta B12-binding domain.; GO: 0008705 methionine synthase activity, 0031419 cobalamin binding, 0046872 metal ion binding, 0009086 methionine biosynthetic process; PDB: 3EZX_A 3BUL_A 1K7Y_A 1BMT_A 3IV9_A 1K98_A 3IVA_A 2I2X_P.
Probab=42.50 E-value=17 Score=20.90 Aligned_cols=25 Identities=20% Similarity=0.294 Sum_probs=19.5
Q ss_pred HHHHHHHHHHcCCHHHHHHHHhcCC
Q psy15934 30 NLKRFLEYVNNSQVEKIAKLCAKGL 54 (70)
Q Consensus 30 ~~~df~~aV~~~~~~kV~kLL~kGl 54 (70)
-.++|++||..++.+.+.+++++.+
T Consensus 4 ~~~~l~~al~~~d~~~~~~~~~~~l 28 (79)
T PF02607_consen 4 LIERLLDALLAGDEEEAEALLEEAL 28 (79)
T ss_dssp HHHHHHHHHHTT-CCHHHHHHHHHH
T ss_pred HHHHHHHHHHhCCHHHHHHHHHHHH
Confidence 4788999999999999888865443
No 114
>PF13474 SnoaL_3: SnoaL-like domain; PDB: 2GXF_A 3KSP_A 3KE7_A 3BB9_E 3CNX_A 3F7S_A 3GWR_B.
Probab=41.53 E-value=42 Score=19.64 Aligned_cols=23 Identities=13% Similarity=0.371 Sum_probs=19.9
Q ss_pred hHHHHHHHHHHcCCHHHHHHHHh
Q psy15934 29 SNLKRFLEYVNNSQVEKIAKLCA 51 (70)
Q Consensus 29 ~~~~df~~aV~~~~~~kV~kLL~ 51 (70)
+.+++|.+|...+|++.+..++.
T Consensus 3 ~~~~~~~~a~~~~D~~~~~~~~~ 25 (121)
T PF13474_consen 3 ALLEEWIEAFERGDIDALLSLFS 25 (121)
T ss_dssp HHHHHHHHHHHCT-HHHHHHHEE
T ss_pred HHHHHHHHHHHhCCHHHHHHhhC
Confidence 45889999999999999999986
No 115
>PRK14156 heat shock protein GrpE; Provisional
Probab=40.31 E-value=7.8 Score=27.49 Aligned_cols=31 Identities=19% Similarity=0.355 Sum_probs=22.6
Q ss_pred HhHHHHHHHHHHcCCHHHHHHHHhcCCCCCCCCC
Q psy15934 28 RSNLKRFLEYVNNSQVEKIAKLCAKGLDPNFHCP 61 (70)
Q Consensus 28 k~~~~df~~aV~~~~~~kV~kLL~kGlDPN~~d~ 61 (70)
+..++.|+.....--+..|.. .-||||+|..
T Consensus 108 ~mi~k~l~~~L~~~GV~~i~~---~~FDP~~HEA 138 (177)
T PRK14156 108 EMVQESLIQALKEEGVEEVAV---DSFDHNLHMA 138 (177)
T ss_pred HHHHHHHHHHHHHCCCeecCC---CCCChhHhhc
Confidence 455677777777777666654 2899999985
No 116
>PRK09636 RNA polymerase sigma factor SigJ; Provisional
Probab=39.64 E-value=46 Score=23.92 Aligned_cols=26 Identities=12% Similarity=0.264 Sum_probs=22.2
Q ss_pred HhHHHHHHHHHHcCCHHHHHHHHhcC
Q psy15934 28 RSNLKRFLEYVNNSQVEKIAKLCAKG 53 (70)
Q Consensus 28 k~~~~df~~aV~~~~~~kV~kLL~kG 53 (70)
..-.+.|++|+..||.+.+..|+.-.
T Consensus 174 ~~~v~~f~~A~~~gD~~~l~~Lla~D 199 (293)
T PRK09636 174 AELVEAFFAALASGDLDALVALLAPD 199 (293)
T ss_pred HHHHHHHHHHHHhCCHHHHHHHHhhC
Confidence 34588999999999999999997643
No 117
>PF13963 Transpos_assoc: Transposase-associated domain
Probab=39.12 E-value=18 Score=21.89 Aligned_cols=13 Identities=31% Similarity=0.759 Sum_probs=10.7
Q ss_pred HHHHhcCCCCCCC
Q psy15934 47 AKLCAKGLDPNFH 59 (70)
Q Consensus 47 ~kLL~kGlDPN~~ 59 (70)
.-|+.+||+||+.
T Consensus 58 ~HL~~~Gf~~~Y~ 70 (77)
T PF13963_consen 58 EHLVCRGFMPNYT 70 (77)
T ss_pred HHHHHhCCCCCCC
Confidence 4578999999984
No 118
>PRK09635 sigI RNA polymerase sigma factor SigI; Provisional
Probab=38.78 E-value=44 Score=24.55 Aligned_cols=24 Identities=17% Similarity=0.375 Sum_probs=21.5
Q ss_pred HhHHHHHHHHHHcCCHHHHHHHHh
Q psy15934 28 RSNLKRFLEYVNNSQVEKIAKLCA 51 (70)
Q Consensus 28 k~~~~df~~aV~~~~~~kV~kLL~ 51 (70)
+.-.+.|+.|...||++.+..||.
T Consensus 177 ~~~~~~f~~a~~~gd~~~l~~ll~ 200 (290)
T PRK09635 177 RVVTRAFIEACSNGDLDTLLEVLD 200 (290)
T ss_pred HHHHHHHHHHHHhCCHHHHHHHhh
Confidence 355889999999999999999994
No 119
>PRK14139 heat shock protein GrpE; Provisional
Probab=37.85 E-value=8.4 Score=27.51 Aligned_cols=31 Identities=23% Similarity=0.376 Sum_probs=20.5
Q ss_pred HHHHHHHHHHcCCHHHHHHHHhcCCCCCCCCC
Q psy15934 30 NLKRFLEYVNNSQVEKIAKLCAKGLDPNFHCP 61 (70)
Q Consensus 30 ~~~df~~aV~~~~~~kV~kLL~kGlDPN~~d~ 61 (70)
.++.|++......++.|.. +-.-||||+|..
T Consensus 116 i~k~l~~vL~k~Gv~~I~~-~G~~FDP~~HEA 146 (185)
T PRK14139 116 TLKQLTSAFEKGRVVEINP-VGEKFDPHQHQA 146 (185)
T ss_pred HHHHHHHHHHHCCCceeCC-CCCCCChHHhhe
Confidence 3455666666666655555 467799999974
No 120
>TIGR02960 SigX5 RNA polymerase sigma-70 factor, TIGR02960 family. This group of sigma factors are members of the sigma-70 family (TIGR02937). They and appear by homology, tree building, bidirectional best hits and one-to-a-genome distribution, to represent a conserved family.
Probab=36.68 E-value=52 Score=23.56 Aligned_cols=27 Identities=11% Similarity=0.303 Sum_probs=23.1
Q ss_pred HhHHHHHHHHHHcCCHHHHHHHHhcCC
Q psy15934 28 RSNLKRFLEYVNNSQVEKIAKLCAKGL 54 (70)
Q Consensus 28 k~~~~df~~aV~~~~~~kV~kLL~kGl 54 (70)
++.++.|++|+..||.+.+..|+.-..
T Consensus 207 ~~~v~~~~~a~~~gD~~~l~~Lla~Dv 233 (324)
T TIGR02960 207 QDLLERYIAAFESYDLDALTALLHEDA 233 (324)
T ss_pred HHHHHHHHHHHHcCCHHHHHHHhcCCe
Confidence 455899999999999999999987543
No 121
>PF04651 Pox_A12: Poxvirus A12 protein; InterPro: IPR006744 This family contains vaccinia virus protein A12 and its homologues. VVA12 is a virion protein though its function is unknown.
Probab=36.52 E-value=30 Score=25.37 Aligned_cols=23 Identities=26% Similarity=0.442 Sum_probs=18.8
Q ss_pred HhHHHHHHHHHHcCCHHHHHHHHh
Q psy15934 28 RSNLKRFLEYVNNSQVEKIAKLCA 51 (70)
Q Consensus 28 k~~~~df~~aV~~~~~~kV~kLL~ 51 (70)
+++|+||++.|..-.. .+..||.
T Consensus 8 RSSYDDYIeTinKitP-QLrtlLa 30 (189)
T PF04651_consen 8 RSSYDDYIETINKITP-QLRTLLA 30 (189)
T ss_pred cccHHHHHHHHHHhCH-HHHHHHH
Confidence 6999999999987655 7777774
No 122
>smart00668 CTLH C-terminal to LisH motif. Alpha-helical motif of unknown function.
Probab=36.45 E-value=71 Score=17.01 Aligned_cols=23 Identities=13% Similarity=0.157 Sum_probs=19.1
Q ss_pred HHHHHHHHHHcCCHHHHHHHHhc
Q psy15934 30 NLKRFLEYVNNSQVEKIAKLCAK 52 (70)
Q Consensus 30 ~~~df~~aV~~~~~~kV~kLL~k 52 (70)
....+.++|..|+.+.+.++|+.
T Consensus 4 ~~~~i~~~i~~g~~~~a~~~~~~ 26 (58)
T smart00668 4 ERKRIRELILKGDWDEALEWLSS 26 (58)
T ss_pred HHHHHHHHHHcCCHHHHHHHHHH
Confidence 45678899999999999999854
No 123
>TIGR02096 conserved hypothetical protein, steroid delta-isomerase-related. This family of proteins about 135 amino acids in length largely restricted to the Proteobacteria. This family and a delta5-3-ketosteroid isomerase from Pseudomonas testosteroni appear homologous, especially toward their respective N-termini. Members, therefore, probably are enzymes.
Probab=36.42 E-value=49 Score=19.90 Aligned_cols=24 Identities=17% Similarity=0.318 Sum_probs=20.3
Q ss_pred HHHHHHHHHHcCCHHHHHHHHhcC
Q psy15934 30 NLKRFLEYVNNSQVEKIAKLCAKG 53 (70)
Q Consensus 30 ~~~df~~aV~~~~~~kV~kLL~kG 53 (70)
-.++|++++..++.+.+.+++...
T Consensus 3 iv~~~~~a~~~~d~~~~~~~~~~d 26 (129)
T TIGR02096 3 LAQHWIEAFNRGDMDAVLALLAED 26 (129)
T ss_pred HHHHHHHHHHCCCHHHHHHhcCCC
Confidence 367899999999999999996544
No 124
>PF07739 TipAS: TipAS antibiotic-recognition domain; InterPro: IPR012925 TipAL is a bacterial transcriptional regulator of the MerR family. The tipA gene can be expressed as a long form, TipAL, and a short form, TipAS, which constitutes the C-terminal part of TipAL. TipAS forms the antibiotic-recognition domain []. This domain, which has an alpha-helical globin-like fold, is also found at the C terminus of other MerR family transcription factors, including Mta, a central regulator of multidrug resistance in Bacillus subtilis [], and SkgA from Caulobacter crescentus []. ; GO: 0003677 DNA binding, 0045892 negative regulation of transcription, DNA-dependent; PDB: 1NY9_A 3HH0_A 3QAO_A.
Probab=35.80 E-value=28 Score=21.24 Aligned_cols=16 Identities=31% Similarity=0.636 Sum_probs=10.8
Q ss_pred HHHHHHHHhcCCCCCC
Q psy15934 43 VEKIAKLCAKGLDPNF 58 (70)
Q Consensus 43 ~~kV~kLL~kGlDPN~ 58 (70)
..++..+++.|+||+.
T Consensus 41 ~~~l~~~~~~g~~p~s 56 (118)
T PF07739_consen 41 FAELAALMEEGVDPDS 56 (118)
T ss_dssp HHHHHHHHHHT--TT-
T ss_pred HHHHHHHHHcCCCcCC
Confidence 3578889999999985
No 125
>PF07890 Rrp15p: Rrp15p; InterPro: IPR012459 This family contains sequences from a number of hypothetical eukaryotic proteins of unknown function. The region featured is approximately 150 amino acids long.
Probab=35.72 E-value=62 Score=21.36 Aligned_cols=35 Identities=29% Similarity=0.428 Sum_probs=27.0
Q ss_pred CHHHHHHHhhHhHHHHHHHHHHcCCHHHHHHHHhcC
Q psy15934 18 DEKQLKALHTRSNLKRFLEYVNNSQVEKIAKLCAKG 53 (70)
Q Consensus 18 ~ekql~klhtk~~~~df~~aV~~~~~~kV~kLL~kG 53 (70)
-|+.|.++=||+ .-+||+||..-+...-..+-+.|
T Consensus 72 ~Er~LrkiAtrG-VVqLFNAV~~~Q~~~~~~~~~~~ 106 (130)
T PF07890_consen 72 RERRLRKIATRG-VVQLFNAVRKAQKEAEKKLEEAG 106 (130)
T ss_pred HHHHHHHHHHHh-HHHHHHHHHHHHHHHHHHHhhcc
Confidence 388999999999 67899999987776555554443
No 126
>PRK14154 heat shock protein GrpE; Provisional
Probab=35.50 E-value=8 Score=28.21 Aligned_cols=31 Identities=19% Similarity=0.234 Sum_probs=18.3
Q ss_pred HHHHHHHHHcCCHHHHHHHHhcCCCCCCCCC
Q psy15934 31 LKRFLEYVNNSQVEKIAKLCAKGLDPNFHCP 61 (70)
Q Consensus 31 ~~df~~aV~~~~~~kV~kLL~kGlDPN~~d~ 61 (70)
++.|......--++.|....-.-||||+|..
T Consensus 140 ~k~l~~vL~k~GVe~I~~~~G~~FDP~~HEA 170 (208)
T PRK14154 140 LDLLHNTLAKHGVQVINPNPGDPFDPALHEA 170 (208)
T ss_pred HHHHHHHHHHCCCEEecCCCCCCCChhHhhe
Confidence 4445555554444444333456799999975
No 127
>PF09572 RE_XamI: XamI restriction endonuclease; InterPro: IPR019072 There are four classes of restriction endonucleases: types I, II,III and IV. All types of enzymes recognise specific short DNA sequences and carry out the endonucleolytic cleavage of DNA to give specific double-stranded fragments with terminal 5'-phosphates. They differ in their recognition sequence, subunit composition, cleavage position, and cofactor requirements [, ], as summarised below: Type I enzymes (3.1.21.3 from EC) cleave at sites remote from recognition site; require both ATP and S-adenosyl-L-methionine to function; multifunctional protein with both restriction and methylase (2.1.1.72 from EC) activities. Type II enzymes (3.1.21.4 from EC) cleave within or at short specific distances from recognition site; most require magnesium; single function (restriction) enzymes independent of methylase. Type III enzymes (3.1.21.5 from EC) cleave at sites a short distance from recognition site; require ATP (but doesn't hydrolyse it); S-adenosyl-L-methionine stimulates reaction but is not required; exists as part of a complex with a modification methylase methylase (2.1.1.72 from EC). Type IV enzymes target methylated DNA. Type II restriction endonucleases (3.1.21.4 from EC) are components of prokaryotic DNA restriction-modification mechanisms that protect the organism against invading foreign DNA. These site-specific deoxyribonucleases catalyse the endonucleolytic cleavage of DNA to give specific double-stranded fragments with terminal 5'-phosphates. Of the 3000 restriction endonucleases that have been characterised, most are homodimeric or tetrameric enzymes that cleave target DNA at sequence-specific sites close to the recognition site. For homodimeric enzymes, the recognition site is usually a palindromic sequence 4-8 bp in length. Most enzymes require magnesium ions as a cofactor for catalysis. Although they can vary in their mode of recognition, many restriction endonucleases share a similar structural core comprising four beta-strands and one alpha-helix, as well as a similar mechanism of cleavage, suggesting a common ancestral origin []. However, there is still considerable diversity amongst restriction endonucleases [, ]. The target site recognition process triggers large conformational changes of the enzyme and the target DNA, leading to the activation of the catalytic centres. Like other DNA binding proteins, restriction enzymes are capable of non-specific DNA binding as well, which is the prerequisite for efficient target site location by facilitated diffusion. Non-specific binding usually does not involve interactions with the bases but only with the DNA backbone []. This entry includes the XamI restriction endonuclease which recognises GTCGAC but cleavage site unknown. ; GO: 0003677 DNA binding, 0009036 Type II site-specific deoxyribonuclease activity, 0009307 DNA restriction-modification system
Probab=35.34 E-value=37 Score=25.79 Aligned_cols=25 Identities=28% Similarity=0.356 Sum_probs=19.8
Q ss_pred HHHHHHHHhcCCCCCC-CCCCCCccc
Q psy15934 43 VEKIAKLCAKGLDPNF-HCPETGEMV 67 (70)
Q Consensus 43 ~~kV~kLL~kGlDPN~-~d~~~GeTp 67 (70)
..+|.+++.+++||+. .+-.+|++|
T Consensus 67 l~kI~~iI~~llDp~~FpWl~~~~~P 92 (251)
T PF09572_consen 67 LQKICDIIEKLLDPDRFPWLGRGRAP 92 (251)
T ss_pred HHHHHHHHHHhcCcccccccCCCCCC
Confidence 6789999999999994 455666665
No 128
>PF14534 DUF4440: Domain of unknown function (DUF4440); PDB: 3HX8_A 3SOY_A 3ROB_B 3GZR_A 3B7C_A 3CU3_A 3FSD_A 2R4I_C 1TP6_A.
Probab=34.28 E-value=28 Score=19.72 Aligned_cols=23 Identities=17% Similarity=0.325 Sum_probs=20.1
Q ss_pred hHHHHHHHHHHcCCHHHHHHHHh
Q psy15934 29 SNLKRFLEYVNNSQVEKIAKLCA 51 (70)
Q Consensus 29 ~~~~df~~aV~~~~~~kV~kLL~ 51 (70)
+-++.|.+|...+|.+.+.+|+.
T Consensus 3 a~~~~~~~A~~~~D~~~~~~~~~ 25 (107)
T PF14534_consen 3 ALEEQYEDAFNAGDIDALASLYA 25 (107)
T ss_dssp HHHHHHHHHHHTTHHHHHHTTEE
T ss_pred HHHHHHHHHHHhCCHHHHHhhhC
Confidence 45788999999999999988876
No 129
>KOG0522|consensus
Probab=33.57 E-value=48 Score=27.84 Aligned_cols=35 Identities=20% Similarity=0.093 Sum_probs=31.8
Q ss_pred HHHHHHcCCHHHHHHHHhcCCCCCCCCCCCCccccc
Q psy15934 34 FLEYVNNSQVEKIAKLCAKGLDPNFHCPETGEMVAL 69 (70)
Q Consensus 34 f~~aV~~~~~~kV~kLL~kGlDPN~~d~~~GeTpL~ 69 (70)
..-||..|+.+.++-|+..|.|--..+ ..|.+||+
T Consensus 59 LhlAV~Lg~~~~a~~Ll~a~Adv~~kN-~~gWs~L~ 93 (560)
T KOG0522|consen 59 LHLAVRLGHVEAARILLSAGADVSIKN-NEGWSPLH 93 (560)
T ss_pred HHHHHHhcCHHHHHHHHhcCCCccccc-cccccHHH
Confidence 678999999999999999999998888 69999985
No 130
>PF05714 Borrelia_lipo_1: Borrelia burgdorferi virulent strain associated lipoprotein; InterPro: IPR008421 This entry consists of several virulent strain associated lipoproteins from the Lyme disease spirochete Borrelia burgdorferi.; PDB: 1W33_A 1W3Z_A.
Probab=33.56 E-value=33 Score=24.72 Aligned_cols=22 Identities=32% Similarity=0.366 Sum_probs=18.0
Q ss_pred chhhhhhhhhccC-CCHHHHHHH
Q psy15934 4 SNIANTFISLLLN-LDEKQLKAL 25 (70)
Q Consensus 4 ~~~~k~rvy~~~~-l~ekql~kl 25 (70)
+..+||++|..|+ .++..+..|
T Consensus 67 ~~~~rR~~YssLn~y~~~kIk~l 89 (211)
T PF05714_consen 67 SKNIRRIIYSSLNTYDTEKIKTL 89 (211)
T ss_dssp HHHHHHHHHHHTT-T-HHHHHHH
T ss_pred HHHHHHHHHHHHHHhhHHHHHHH
Confidence 4578999999999 999988765
No 131
>KOG0521|consensus
Probab=33.46 E-value=30 Score=29.63 Aligned_cols=34 Identities=18% Similarity=0.081 Sum_probs=29.4
Q ss_pred HHHHHcCCHHHHHHHHhcCCCCCCCCCCCCccccc
Q psy15934 35 LEYVNNSQVEKIAKLCAKGLDPNFHCPETGEMVAL 69 (70)
Q Consensus 35 ~~aV~~~~~~kV~kLL~kGlDPN~~d~~~GeTpL~ 69 (70)
-.|+..+..-.+.-||+.|.|-|..| ..|.|||+
T Consensus 661 h~a~~~~~~~~~e~ll~~ga~vn~~d-~~g~~plh 694 (785)
T KOG0521|consen 661 HVAVGTGDSGAVELLLQNGADVNALD-SKGRTPLH 694 (785)
T ss_pred hhhhccchHHHHHHHHhcCCcchhhh-ccCCCcch
Confidence 35677888888888999999999999 69999986
No 132
>PRK14163 heat shock protein GrpE; Provisional
Probab=33.17 E-value=11 Score=27.62 Aligned_cols=32 Identities=19% Similarity=0.249 Sum_probs=19.2
Q ss_pred hHHHHHHHHHHcCCHHHHHHHHhcCCCCCCCCC
Q psy15934 29 SNLKRFLEYVNNSQVEKIAKLCAKGLDPNFHCP 61 (70)
Q Consensus 29 ~~~~df~~aV~~~~~~kV~kLL~kGlDPN~~d~ 61 (70)
..++.|+.....--+++|.. +-.-||||+|..
T Consensus 118 mi~k~l~~~L~k~Gv~~I~~-~G~~FDP~~HEA 149 (214)
T PRK14163 118 SVAESLETTVAKLGLQQFGK-EGEPFDPTIHEA 149 (214)
T ss_pred HHHHHHHHHHHHCCCEEeCC-CCCCCChhHhce
Confidence 33445555555555544443 456799999974
No 133
>PRK08241 RNA polymerase factor sigma-70; Validated
Probab=32.61 E-value=61 Score=23.54 Aligned_cols=26 Identities=15% Similarity=0.286 Sum_probs=22.3
Q ss_pred hHHHHHHHHHHcCCHHHHHHHHhcCC
Q psy15934 29 SNLKRFLEYVNNSQVEKIAKLCAKGL 54 (70)
Q Consensus 29 ~~~~df~~aV~~~~~~kV~kLL~kGl 54 (70)
...+.|++|...||++.+..|+.-+.
T Consensus 218 ~~v~~~~~A~~~gD~~~l~~lla~Dv 243 (339)
T PRK08241 218 ALLARYVAAFEAYDVDALVALLTEDA 243 (339)
T ss_pred HHHHHHHHHHhcCCHHHHHHHhcCCE
Confidence 34789999999999999999986654
No 134
>PRK14159 heat shock protein GrpE; Provisional
Probab=32.28 E-value=9.1 Score=27.08 Aligned_cols=29 Identities=31% Similarity=0.486 Sum_probs=17.9
Q ss_pred HHHHHHHHHcCCHHHHHHHHhcCCCCCCCCC
Q psy15934 31 LKRFLEYVNNSQVEKIAKLCAKGLDPNFHCP 61 (70)
Q Consensus 31 ~~df~~aV~~~~~~kV~kLL~kGlDPN~~d~ 61 (70)
++.|+.....--+++|... .. ||||+|+.
T Consensus 110 ~k~l~~vL~k~Gv~~I~~~-G~-FDP~~HEA 138 (176)
T PRK14159 110 LDLFLKKLEKHGVALIKEE-KE-FDPNLHEA 138 (176)
T ss_pred HHHHHHHHHHCcCEecCCC-CC-CChHHhhh
Confidence 4555555555555555442 33 99999985
No 135
>PF13356 DUF4102: Domain of unknown function (DUF4102); PDB: 3JU0_A 3RMP_A 3JTZ_A 2KJ8_A.
Probab=32.13 E-value=25 Score=21.15 Aligned_cols=17 Identities=24% Similarity=0.528 Sum_probs=11.8
Q ss_pred HHHHHHHHhcCCCCCCC
Q psy15934 43 VEKIAKLCAKGLDPNFH 59 (70)
Q Consensus 43 ~~kV~kLL~kGlDPN~~ 59 (70)
+..+..+++.|.||+..
T Consensus 67 a~~~~~~~~~G~dP~~~ 83 (89)
T PF13356_consen 67 ARELRALVRQGIDPREE 83 (89)
T ss_dssp HHHHHHHHCTT--GGGS
T ss_pred HHHHHHHHHcCCCHHHH
Confidence 45778899999999853
No 136
>PF12893 Lumazine_bd_2: Putative lumazine-binding; PDB: 3BLZ_C 3DUK_F 3FKA_C.
Probab=32.12 E-value=39 Score=20.83 Aligned_cols=27 Identities=15% Similarity=0.431 Sum_probs=19.2
Q ss_pred HhHHHHHHHHHHcCCHHHHHHHHhcCCCCCC
Q psy15934 28 RSNLKRFLEYVNNSQVEKIAKLCAKGLDPNF 58 (70)
Q Consensus 28 k~~~~df~~aV~~~~~~kV~kLL~kGlDPN~ 58 (70)
++.+.+||+++..||.+.+. +.|.|++
T Consensus 7 ~~~v~~Y~dg~~~gD~~~l~----~~f~~~a 33 (116)
T PF12893_consen 7 EATVQDYFDGLYNGDSEKLR----SAFHPDA 33 (116)
T ss_dssp HHHHHHHHHHHHHT-HHHHG----GGEEEEE
T ss_pred HHHHHHHHHHHHhcCHHHHH----HhhCCCc
Confidence 45688999999999997664 4555554
No 137
>PRK14146 heat shock protein GrpE; Provisional
Probab=31.79 E-value=13 Score=26.97 Aligned_cols=33 Identities=21% Similarity=0.251 Sum_probs=21.6
Q ss_pred HhHHHHHHHHHHcCCHHHHHHHHhcCCCCCCCCC
Q psy15934 28 RSNLKRFLEYVNNSQVEKIAKLCAKGLDPNFHCP 61 (70)
Q Consensus 28 k~~~~df~~aV~~~~~~kV~kLL~kGlDPN~~d~ 61 (70)
+-.++.|++....--++.|.. +-.-||||+|..
T Consensus 138 ~mi~k~l~~~L~k~Gv~~i~~-~G~~FDP~~HeA 170 (215)
T PRK14146 138 KMILKEFYSVLEKSNVIRFDP-KGEPFDPMSMEA 170 (215)
T ss_pred HHHHHHHHHHHHHCcCeeeCC-CCCCCChhHhce
Confidence 344566666666665655543 366799999985
No 138
>PRK14151 heat shock protein GrpE; Provisional
Probab=31.74 E-value=8.8 Score=27.04 Aligned_cols=30 Identities=23% Similarity=0.422 Sum_probs=17.6
Q ss_pred HHHHHHHHHcCCHHHHHHHHhcCCCCCCCCC
Q psy15934 31 LKRFLEYVNNSQVEKIAKLCAKGLDPNFHCP 61 (70)
Q Consensus 31 ~~df~~aV~~~~~~kV~kLL~kGlDPN~~d~ 61 (70)
++.|.+......++.|.. .-.-||||+|..
T Consensus 108 ~k~l~~~L~k~Gv~~i~~-~G~~FDP~~HEA 137 (176)
T PRK14151 108 LKMFQDTLKRYQLEAVDP-HGEPFNPEHHQA 137 (176)
T ss_pred HHHHHHHHHHCCCEEeCC-CCCCCCHHHhhc
Confidence 444555555554444443 345699999974
No 139
>CHL00102 rps20 ribosomal protein S20
Probab=31.40 E-value=85 Score=20.17 Aligned_cols=24 Identities=13% Similarity=0.408 Sum_probs=18.5
Q ss_pred hHhHHHHHHHHHHc-------CCHHHHHHHH
Q psy15934 27 TRSNLKRFLEYVNN-------SQVEKIAKLC 50 (70)
Q Consensus 27 tk~~~~df~~aV~~-------~~~~kV~kLL 50 (70)
-|+..++|..+|.. |+.+....++
T Consensus 28 ~rT~iKk~~~ai~~~~~~~~~~d~~~a~~~l 58 (93)
T CHL00102 28 VKTLIKKYLKNLEDYKTSPNSNNKKKVQETL 58 (93)
T ss_pred HHHHHHHHHHHHHhhcccCCcccHHHHHHHH
Confidence 35678899999998 8877766664
No 140
>KOG3676|consensus
Probab=31.27 E-value=33 Score=29.81 Aligned_cols=25 Identities=24% Similarity=0.206 Sum_probs=22.2
Q ss_pred HHHHcCCHHHHHHHHhcCCCCCCCC
Q psy15934 36 EYVNNSQVEKIAKLCAKGLDPNFHC 60 (70)
Q Consensus 36 ~aV~~~~~~kV~kLL~kGlDPN~~d 60 (70)
-||.+.|.+-|..||++|+|-|-++
T Consensus 190 iAIv~~~~~~V~lLl~~gADV~aRa 214 (782)
T KOG3676|consen 190 IAIVNRDAELVRLLLAAGADVHARA 214 (782)
T ss_pred HHHHhccHHHHHHHHHcCCchhhHh
Confidence 5889999999999999999998654
No 141
>PF15472 DUF4638: Domain of unknown function (DUF4638)
Probab=31.05 E-value=41 Score=25.78 Aligned_cols=21 Identities=29% Similarity=0.698 Sum_probs=18.4
Q ss_pred cCCHHHHHHHH-hcCCCCCCCC
Q psy15934 40 NSQVEKIAKLC-AKGLDPNFHC 60 (70)
Q Consensus 40 ~~~~~kV~kLL-~kGlDPN~~d 60 (70)
.+|+..+.+|| +-|+|++||-
T Consensus 227 ~nNi~Tl~~lCkdaG~~vdiHp 248 (268)
T PF15472_consen 227 KNNIRTLLKLCKDAGMDVDIHP 248 (268)
T ss_pred hhhHHHHHHHHHHcCCCcccCc
Confidence 57899999999 7899999983
No 142
>COG0576 GrpE Molecular chaperone GrpE (heat shock protein) [Posttranslational modification, protein turnover, chaperones]
Probab=30.85 E-value=12 Score=26.58 Aligned_cols=30 Identities=30% Similarity=0.512 Sum_probs=16.1
Q ss_pred HHHHHHHHHcCCHHHHHHHHhcCCCCCCCCC
Q psy15934 31 LKRFLEYVNNSQVEKIAKLCAKGLDPNFHCP 61 (70)
Q Consensus 31 ~~df~~aV~~~~~~kV~kLL~kGlDPN~~d~ 61 (70)
++.|+++...-.++.|.- .-.=||||+|-.
T Consensus 124 ~~~l~~~L~k~Gv~~i~~-~Ge~FDP~~HeA 153 (193)
T COG0576 124 LDQLLDALEKLGVEEIGP-EGEKFDPNLHEA 153 (193)
T ss_pred HHHHHHHHHHCCCEEeCC-CCCCCCHHHhhh
Confidence 344444444444444433 333599999864
No 143
>PRK10325 heat shock protein GrpE; Provisional
Probab=30.15 E-value=14 Score=26.33 Aligned_cols=30 Identities=43% Similarity=0.532 Sum_probs=18.0
Q ss_pred HHHHHHHHHcCCHHHHHHHHhcCCCCCCCCC
Q psy15934 31 LKRFLEYVNNSQVEKIAKLCAKGLDPNFHCP 61 (70)
Q Consensus 31 ~~df~~aV~~~~~~kV~kLL~kGlDPN~~d~ 61 (70)
++.|++....-.++.|.. .-.-||||+|+.
T Consensus 127 ~~~l~~~L~~~Gv~~i~~-~G~~FDP~~HEA 156 (197)
T PRK10325 127 LKSMLDVVRKFGVEVIAE-TNVPLDPNVHQA 156 (197)
T ss_pred HHHHHHHHHHCcCeeeCC-CCCCCChhHhce
Confidence 445555555555555432 334599999985
No 144
>PRK14161 heat shock protein GrpE; Provisional
Probab=30.13 E-value=10 Score=26.75 Aligned_cols=31 Identities=16% Similarity=0.195 Sum_probs=15.6
Q ss_pred HHHHHHHHHcCCHHHHHHHHhcCCCCCCCCC
Q psy15934 31 LKRFLEYVNNSQVEKIAKLCAKGLDPNFHCP 61 (70)
Q Consensus 31 ~~df~~aV~~~~~~kV~kLL~kGlDPN~~d~ 61 (70)
++.|......--+++|....-.-||||+|+.
T Consensus 108 ~k~l~~vL~~~Gv~~I~~~~G~~FDP~~HEA 138 (178)
T PRK14161 108 KDELDKVFHKHHIEEIKPEIGSMFDYNLHNA 138 (178)
T ss_pred HHHHHHHHHHCCCEEecCCCCCCCChHHhhh
Confidence 3344444443333333222233499999984
No 145
>PRK07417 arogenate dehydrogenase; Reviewed
Probab=29.99 E-value=66 Score=22.96 Aligned_cols=25 Identities=8% Similarity=0.186 Sum_probs=21.6
Q ss_pred hHHHHHHHHHHcCCHHHHHHHHhcC
Q psy15934 29 SNLKRFLEYVNNSQVEKIAKLCAKG 53 (70)
Q Consensus 29 ~~~~df~~aV~~~~~~kV~kLL~kG 53 (70)
..+.+|.++++.+|.+.+.++++++
T Consensus 246 ~~l~~~~~~l~~~d~~~l~~~~~~~ 270 (279)
T PRK07417 246 QSLDQLEELIEQENWSALEQKLEQT 270 (279)
T ss_pred HHHHHHHHHHHcCCHHHHHHHHHHH
Confidence 4578899999999999999999875
No 146
>PRK14150 heat shock protein GrpE; Provisional
Probab=29.80 E-value=14 Score=26.33 Aligned_cols=30 Identities=27% Similarity=0.434 Sum_probs=16.8
Q ss_pred HHHHHHHHHcCCHHHHHHHHhcCCCCCCCCC
Q psy15934 31 LKRFLEYVNNSQVEKIAKLCAKGLDPNFHCP 61 (70)
Q Consensus 31 ~~df~~aV~~~~~~kV~kLL~kGlDPN~~d~ 61 (70)
++.|......--+..|.. .-.-||||+|..
T Consensus 126 ~~~l~~~L~~~Gv~~i~~-~G~~FDP~~HeA 155 (193)
T PRK14150 126 LKSLLDTVAKFGVEVVGP-VGEPFNPEVHQA 155 (193)
T ss_pred HHHHHHHHHHCCCeeeCC-CCCCCCHhHcce
Confidence 344444444444444432 344599999984
No 147
>KOG0195|consensus
Probab=29.78 E-value=56 Score=26.31 Aligned_cols=36 Identities=11% Similarity=0.053 Sum_probs=28.9
Q ss_pred HHHHHHcCCHHHHHHHHhcCCCCCCCCCCCCcccccC
Q psy15934 34 FLEYVNNSQVEKIAKLCAKGLDPNFHCPETGEMVALI 70 (70)
Q Consensus 34 f~~aV~~~~~~kV~kLL~kGlDPN~~d~~~GeTpL~L 70 (70)
+-++.+.|.+.-|.-||.||.--|..+ -...|||+|
T Consensus 38 lhwaakegh~aivemll~rgarvn~tn-mgddtplhl 73 (448)
T KOG0195|consen 38 LHWAAKEGHVAIVEMLLSRGARVNSTN-MGDDTPLHL 73 (448)
T ss_pred hhhhhhcccHHHHHHHHhccccccccc-CCCCcchhh
Confidence 346788899988888899999999866 344899875
No 148
>PRK05629 hypothetical protein; Validated
Probab=29.61 E-value=1.9e+02 Score=21.02 Aligned_cols=47 Identities=15% Similarity=0.090 Sum_probs=35.7
Q ss_pred hhhccCCCHHHHHHH---hhHhHHHHHHHHHHcCCHHHHHHH----HhcCCCCC
Q psy15934 11 ISLLLNLDEKQLKAL---HTRSNLKRFLEYVNNSQVEKIAKL----CAKGLDPN 57 (70)
Q Consensus 11 vy~~~~l~ekql~kl---htk~~~~df~~aV~~~~~~kV~kL----L~kGlDPN 57 (70)
.|..=.+++..+.++ ....+.-++.+||-.|+..+..++ ++.|-+|-
T Consensus 174 ~~~~~~It~e~V~~~v~~~~~~~iF~l~dAv~~g~~~~Al~~l~~l~~~g~~pi 227 (318)
T PRK05629 174 EDTQGNVTVEKVRAYYVGVAEVSGFDIADLACAGQVSKAVASTRRALQLGVSPV 227 (318)
T ss_pred hcCCCCcCHHHHHHHhCCCccchHHHHHHHHHcCCHHHHHHHHHHHHHcCCCcH
Confidence 454446888888888 556778889999999999877655 56777763
No 149
>PF01399 PCI: PCI domain; InterPro: IPR000717 A homology domain of unclear function, occurs in the C-terminal region of several regulatory components of the 26S proteasome as well as in other proteins. This domain has also been called the PINT motif (Proteasome, Int-6, Nip-1 and TRIP-15) []. Apparently, all of the characterised proteins containing PCI domains are parts of larger multi-protein complexes. Proteins with PCI domains include budding yeast proteasome regulatory components Rpn3(Sun2), Rpn5, Rpn6, Rpn7and Rpn9 []; mammalian proteasome regulatory components p55, p58 and p44.5, and translation initiation factor 3 complex subunits p110 and INT6 [, ]; Arabidopsis COP9 and FUS6/COP11 []; mammalian G-protein pathway suppressor GPS1, and several uncharacterised ORFs from plant, nematodes and mammals. The complete homology domain comprises approx. 200 residues, the highest conservation is found in the C-terminal half. Several of the proteins mentioned above have no detectable homology to the N-terminal half of the domain.; GO: 0005515 protein binding; PDB: 3TXM_A 3TXN_A 1UFM_A 3CHM_A 3T5X_A 3T5V_B.
Probab=29.33 E-value=91 Score=18.03 Aligned_cols=24 Identities=8% Similarity=0.218 Sum_probs=18.7
Q ss_pred HHHHHHHHHHcCCHHHHHHHHhcC
Q psy15934 30 NLKRFLEYVNNSQVEKIAKLCAKG 53 (70)
Q Consensus 30 ~~~df~~aV~~~~~~kV~kLL~kG 53 (70)
.|.++++++..|+.....+.+++.
T Consensus 2 ~~~~l~~~~~~~~~~~~~~~l~~~ 25 (105)
T PF01399_consen 2 PYSELLRAFRSGDLQEFEEFLEKH 25 (105)
T ss_dssp HHHHHHHHHHCT-HHHHHHHHHHT
T ss_pred HHHHHHHHHHhCCHHHHHHHHHHH
Confidence 467888888888888888888765
No 150
>smart00165 UBA Ubiquitin associated domain. Present in Rad23, SNF1-like kinases. The newly-found UBA in p62 is known to bind ubiquitin.
Probab=29.24 E-value=35 Score=17.18 Aligned_cols=15 Identities=33% Similarity=0.452 Sum_probs=11.7
Q ss_pred HHHHHHHHhcCCCCC
Q psy15934 43 VEKIAKLCAKGLDPN 57 (70)
Q Consensus 43 ~~kV~kLL~kGlDPN 57 (70)
.++|.+|.+-||+++
T Consensus 2 ~~~v~~L~~mGf~~~ 16 (37)
T smart00165 2 EEKIDQLLEMGFSRE 16 (37)
T ss_pred HHHHHHHHHcCCCHH
Confidence 367888888898875
No 151
>PRK14157 heat shock protein GrpE; Provisional
Probab=29.17 E-value=15 Score=27.20 Aligned_cols=31 Identities=19% Similarity=0.210 Sum_probs=20.1
Q ss_pred HHHHHHHHHHcCCHHHHHHHHhcCCCCCCCCC
Q psy15934 30 NLKRFLEYVNNSQVEKIAKLCAKGLDPNFHCP 61 (70)
Q Consensus 30 ~~~df~~aV~~~~~~kV~kLL~kGlDPN~~d~ 61 (70)
.+++|......-.+++|.. +-.-||||+|+.
T Consensus 156 i~k~l~~vL~k~GVe~I~~-~Ge~FDP~~HEA 186 (227)
T PRK14157 156 VAAKIDKAFEKFGVEKFGE-KGEDFDPTKHDA 186 (227)
T ss_pred HHHHHHHHHHHCCCEEeCC-CCCCCChhhhce
Confidence 4566666666666655543 244599999984
No 152
>PF00627 UBA: UBA/TS-N domain; InterPro: IPR000449 UBA domains are a commonly occurring sequence motif of approximately 45 amino acid residues that are found in diverse proteins involved in the ubiquitin/proteasome pathway, DNA excision-repair, and cell signalling via protein kinases []. The human homologue of yeast Rad23A is one example of a nucleotide excision-repair protein that contains both an internal and a C-terminal UBA domain. The solution structure of human Rad23A UBA(2) showed that the domain forms a compact three-helix bundle []. Comparison of the structures of UBA(1) and UBA(2) reveals that both form very similar folds and have a conserved large hydrophobic surface patch which may be a common protein-interacting surface present in diverse UBA domains. Evidence that ubiquitin binds to UBA domains leads to the prediction that the hydrophobic surface patch of UBA domains interacts with the hydrophobic surface on the five-stranded beta-sheet of ubiquitin []. This domain is similar in sequence to the N-terminal domain of translation elongation factor EF1B (or EF-Ts) from bacteria, mitochondria and chloroplasts. More information about EF1B (EF-Ts) proteins can be found at Protein of the Month: Elongation Factors [].; GO: 0005515 protein binding; PDB: 2DAI_A 2OO9_C 2JUJ_A 1WHC_A 1YLA_A 2O25_B 3K9O_A 3K9P_A 3F92_A 3E46_A ....
Probab=29.16 E-value=16 Score=18.97 Aligned_cols=15 Identities=27% Similarity=0.448 Sum_probs=10.7
Q ss_pred HHHHHHHHhcCCCCC
Q psy15934 43 VEKIAKLCAKGLDPN 57 (70)
Q Consensus 43 ~~kV~kLL~kGlDPN 57 (70)
.+.|.+|.+-||+++
T Consensus 3 ~~~v~~L~~mGf~~~ 17 (37)
T PF00627_consen 3 EEKVQQLMEMGFSRE 17 (37)
T ss_dssp HHHHHHHHHHTS-HH
T ss_pred HHHHHHHHHcCCCHH
Confidence 467888888898865
No 153
>TIGR02246 conserved hypothetical protein. This family consists of uncharacterized proteins found in a number of genera and species, including Streptomyces, Xanthomonas, Oceanobacillus iheyensis, Caulobacter crescentus CB15, and Xylella fastidiosa. The function is unknown.
Probab=29.11 E-value=96 Score=18.36 Aligned_cols=24 Identities=17% Similarity=0.155 Sum_probs=20.0
Q ss_pred HhHHHHHHHHHHcCCHHHHHHHHh
Q psy15934 28 RSNLKRFLEYVNNSQVEKIAKLCA 51 (70)
Q Consensus 28 k~~~~df~~aV~~~~~~kV~kLL~ 51 (70)
++-+..|++++..+|.+.+..+..
T Consensus 7 ~~l~~~~~~a~~~~D~~~~~~~~~ 30 (128)
T TIGR02246 7 RALVATWEAAWAAGDAEGFADLFT 30 (128)
T ss_pred HHHHHHHHHHHHcCCHHHHHHhhC
Confidence 344788999999999999998854
No 154
>cd08323 CARD_APAF1 Caspase activation and recruitment domain similar to that found in Apoptotic Protease-Activating Factor 1. Caspase activation and recruitment domain (CARD) similar to that found in apoptotic protease-activating factor 1 (APAF-1), which is an activator of caspase-9. APAF-1 contains WD-40 repeats, a CARD, and an ATPase domain. Upon stimulation, APAF-1, together with caspase-9, forms the heptameric 'apoptosome', which leads to the processing and activation of caspase-9, starting a caspase cascade which leads to apoptosis. In general, CARDs are death domains (DDs) found associated with caspases. They are known to be important in the signaling pathways for apoptosis, inflammation, and host-defense mechanisms. DDs are protein-protein interaction domains found in a variety of domain architectures. Their common feature is that they form homodimers by self-association or heterodimers by associating with other members of the DD superfamily including PYRIN and DED (Death Effect
Probab=29.09 E-value=93 Score=19.46 Aligned_cols=26 Identities=12% Similarity=0.146 Sum_probs=20.9
Q ss_pred HHHHHHHHHHcCCHHHHHHHHhcCCC
Q psy15934 30 NLKRFLEYVNNSQVEKIAKLCAKGLD 55 (70)
Q Consensus 30 ~~~df~~aV~~~~~~kV~kLL~kGlD 55 (70)
.+.-|.+|.+...-..+++||..|+.
T Consensus 61 Af~~F~~aL~~~~~~~La~lL~~~~~ 86 (86)
T cd08323 61 AYVSFYNALLHEGYKDLALLLHDAIP 86 (86)
T ss_pred HHHHHHHHHHhcCChHHHHHHhccCC
Confidence 46779999987667799999998863
No 155
>KOG2204|consensus
Probab=28.75 E-value=51 Score=28.03 Aligned_cols=25 Identities=44% Similarity=0.640 Sum_probs=18.3
Q ss_pred HHHHHHHH-----hcCCCCCCCCCCCCccc
Q psy15934 43 VEKIAKLC-----AKGLDPNFHCPETGEMV 67 (70)
Q Consensus 43 ~~kV~kLL-----~kGlDPN~~d~~~GeTp 67 (70)
+.+|++.| ..|+-||+..|.+|+..
T Consensus 355 v~~IRk~l~k~ekP~GLypnYinP~sg~wg 384 (625)
T KOG2204|consen 355 VVKIRKVLNKSEKPHGLYPNYINPSSGEWG 384 (625)
T ss_pred HHHHHHHHHhhcCCCCCCcccccCCCCchh
Confidence 34555555 45999999999999753
No 156
>PRK14525 rpsP 30S ribosomal protein S16; Provisional
Probab=28.71 E-value=52 Score=21.09 Aligned_cols=17 Identities=29% Similarity=0.493 Sum_probs=14.5
Q ss_pred CCHHHHHHHHhcCCCCC
Q psy15934 41 SQVEKIAKLCAKGLDPN 57 (70)
Q Consensus 41 ~~~~kV~kLL~kGlDPN 57 (70)
=|.++|..||.+|..|.
T Consensus 51 ln~eri~~WL~~GAqpT 67 (88)
T PRK14525 51 LKVERIEHWLKAGAKPS 67 (88)
T ss_pred EcHHHHHHHHHCCCccC
Confidence 37889999999999886
No 157
>KOG0506|consensus
Probab=28.68 E-value=59 Score=27.50 Aligned_cols=39 Identities=8% Similarity=0.160 Sum_probs=32.7
Q ss_pred HHHHHHHHHcCCHHHHHHHHhcCCCCCCCCCCCCcccccC
Q psy15934 31 LKRFLEYVNNSQVEKIAKLCAKGLDPNFHCPETGEMVALI 70 (70)
Q Consensus 31 ~~df~~aV~~~~~~kV~kLL~kGlDPN~~d~~~GeTpL~L 70 (70)
.-.+|-|...|++..+.++.-.|.|-+..| -...|.|++
T Consensus 507 ~i~~~~aa~~GD~~alrRf~l~g~D~~~~D-yD~RTaLHv 545 (622)
T KOG0506|consen 507 VINVMYAAKNGDLSALRRFALQGMDLETKD-YDDRTALHV 545 (622)
T ss_pred hhhhhhhhhcCCHHHHHHHHHhcccccccc-cccchhhee
Confidence 445888999999999999999999999888 477787763
No 158
>PRK00040 rpsP 30S ribosomal protein S16; Reviewed
Probab=28.61 E-value=50 Score=20.47 Aligned_cols=17 Identities=18% Similarity=0.339 Sum_probs=15.2
Q ss_pred CCHHHHHHHHhcCCCCC
Q psy15934 41 SQVEKIAKLCAKGLDPN 57 (70)
Q Consensus 41 ~~~~kV~kLL~kGlDPN 57 (70)
=|.+++..+|.+|..|.
T Consensus 52 ln~eri~~Wl~~GAqpt 68 (75)
T PRK00040 52 LDEERVLYWLGQGAQPT 68 (75)
T ss_pred EcHHHHHHHHHCCCccC
Confidence 37899999999999986
No 159
>PF12554 MOZART1: Mitotic-spindle organizing gamma-tubulin ring associated; InterPro: IPR022214 This family of proteins is found in eukaryotes. Proteins in this family are typically between 71 and 105 amino acids in length. There is a single completely conserved residue L that may be functionally important.
Probab=28.57 E-value=31 Score=19.90 Aligned_cols=13 Identities=31% Similarity=0.836 Sum_probs=9.5
Q ss_pred HHHHHHhcCCCCC
Q psy15934 45 KIAKLCAKGLDPN 57 (70)
Q Consensus 45 kV~kLL~kGlDPN 57 (70)
-..+||+.|.+|.
T Consensus 26 ici~L~e~GVnPe 38 (48)
T PF12554_consen 26 ICIELCENGVNPE 38 (48)
T ss_pred HHHHHHHCCCCHH
Confidence 3457888888884
No 160
>PHA02608 67 prohead core protein; Provisional
Probab=27.34 E-value=76 Score=20.39 Aligned_cols=18 Identities=11% Similarity=0.252 Sum_probs=14.1
Q ss_pred HHHHHHHHcCCHHHHHHH
Q psy15934 32 KRFLEYVNNSQVEKIAKL 49 (70)
Q Consensus 32 ~df~~aV~~~~~~kV~kL 49 (70)
++|++||++|+.=.+.+.
T Consensus 2 e~lIeAIKS~DLV~akK~ 19 (80)
T PHA02608 2 EDLIEAIKSGDLVEAKKE 19 (80)
T ss_pred hHHHHHHhcCcHHHHHHH
Confidence 679999999987666554
No 161
>PRK14153 heat shock protein GrpE; Provisional
Probab=27.26 E-value=16 Score=26.37 Aligned_cols=32 Identities=22% Similarity=0.325 Sum_probs=19.7
Q ss_pred hHHHHHHHHHHcCCHHHHHHHHhcCCCCCCCCC
Q psy15934 29 SNLKRFLEYVNNSQVEKIAKLCAKGLDPNFHCP 61 (70)
Q Consensus 29 ~~~~df~~aV~~~~~~kV~kLL~kGlDPN~~d~ 61 (70)
..++.|+.....-.+..|... -.-||||+|..
T Consensus 118 mi~k~~~~vL~k~Gv~~I~~~-G~~FDP~~HEA 149 (194)
T PRK14153 118 MVSKQFFSILEKYGLERIECE-GEEFDPHRHEA 149 (194)
T ss_pred HHHHHHHHHHHHCCCeeeCCC-CCCCChhHhce
Confidence 445556655555555555442 45599999985
No 162
>PRK10219 DNA-binding transcriptional regulator SoxS; Provisional
Probab=26.95 E-value=90 Score=18.86 Aligned_cols=45 Identities=7% Similarity=0.042 Sum_probs=31.8
Q ss_pred hhhhhccCCCHHHHHHHhhHhHHHHHHHHHHcCCHHHHHHHHhcC
Q psy15934 9 TFISLLLNLDEKQLKALHTRSNLKRFLEYVNNSQVEKIAKLCAKG 53 (70)
Q Consensus 9 ~rvy~~~~l~ekql~klhtk~~~~df~~aV~~~~~~kV~kLL~kG 53 (70)
..+.+.+++++.+|.++-.+..=..|.+++..--.+....+|..|
T Consensus 25 ~~lA~~~~~S~~~l~r~f~~~~g~s~~~~i~~~Rl~~a~~~L~~~ 69 (107)
T PRK10219 25 DVVAKKSGYSKWYLQRMFRTVTHQTLGDYIRQRRLLLAAVELRTT 69 (107)
T ss_pred HHHHHHHCCCHHHHHHHHHHHHCcCHHHHHHHHHHHHHHHHHHcc
Confidence 345677788888888877776555566667766677777777654
No 163
>PRK14148 heat shock protein GrpE; Provisional
Probab=26.71 E-value=14 Score=26.52 Aligned_cols=30 Identities=23% Similarity=0.453 Sum_probs=16.4
Q ss_pred HHHHHHHHHcCCHHHHHHHHhcCCCCCCCCC
Q psy15934 31 LKRFLEYVNNSQVEKIAKLCAKGLDPNFHCP 61 (70)
Q Consensus 31 ~~df~~aV~~~~~~kV~kLL~kGlDPN~~d~ 61 (70)
++.|......--+.+|.. .-.-||||+|..
T Consensus 127 ~k~l~~vL~k~Gv~~I~~-~G~~FDP~~HEA 156 (195)
T PRK14148 127 AKMLVDILKKNGVEELDP-KGEKFDPNLHEA 156 (195)
T ss_pred HHHHHHHHHHCCCEEeCC-CCCCCChhHhhe
Confidence 444444444444444332 234599999974
No 164
>PRK07502 cyclohexadienyl dehydrogenase; Validated
Probab=26.60 E-value=83 Score=22.68 Aligned_cols=26 Identities=8% Similarity=0.134 Sum_probs=20.8
Q ss_pred HhHHHHHHHHHHcCCHHHHHHHHhcC
Q psy15934 28 RSNLKRFLEYVNNSQVEKIAKLCAKG 53 (70)
Q Consensus 28 k~~~~df~~aV~~~~~~kV~kLL~kG 53 (70)
...+.+|.++++.+|.+.+.++++++
T Consensus 256 ~~~l~~~~~~l~~~d~~~l~~~~~~~ 281 (307)
T PRK07502 256 TEDLAALQRAIRWGDGDALFDLFTRT 281 (307)
T ss_pred HHHHHHHHHHHHcCCHHHHHHHHHHH
Confidence 34577788999999999999887654
No 165
>PRK14140 heat shock protein GrpE; Provisional
Probab=26.08 E-value=18 Score=25.99 Aligned_cols=30 Identities=30% Similarity=0.511 Sum_probs=17.5
Q ss_pred HHHHHHHHHcCCHHHHHHHHhcCCCCCCCCC
Q psy15934 31 LKRFLEYVNNSQVEKIAKLCAKGLDPNFHCP 61 (70)
Q Consensus 31 ~~df~~aV~~~~~~kV~kLL~kGlDPN~~d~ 61 (70)
++.|+.....-.+..|.. +-.-||||+|..
T Consensus 124 ~k~l~~~L~k~GV~~i~~-~Ge~FDP~~HEA 153 (191)
T PRK14140 124 HRQLLEALKKEGVEVIEA-VGEQFDPNLHQA 153 (191)
T ss_pred HHHHHHHHHHCCCEeeCC-CCCCCChHHhcc
Confidence 444555555444444432 345699999985
No 166
>PF06896 DUF1268: Protein of unknown function (DUF1268); InterPro: IPR009681 This entry is represented by Bacteriophage bIL286, Orf51. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This family consists of several bacterial and phage proteins of around 115 residues in length. The function of this family is unknown.
Probab=25.83 E-value=1.7e+02 Score=19.40 Aligned_cols=29 Identities=31% Similarity=0.537 Sum_probs=19.8
Q ss_pred hhhhccCCCHHHHHHHhhHhHHHHHHHHHH
Q psy15934 10 FISLLLNLDEKQLKALHTRSNLKRFLEYVN 39 (70)
Q Consensus 10 rvy~~~~l~ekql~klhtk~~~~df~~aV~ 39 (70)
++-.-++|+.+|..+| ---..++|.+++.
T Consensus 61 Fi~~iL~L~dkq~ekl-~~iD~~~~~e~~~ 89 (114)
T PF06896_consen 61 FIQDILKLNDKQVEKL-EDIDFEDLQEIVS 89 (114)
T ss_pred HHHHHHCCCHHHHHHH-hcCCHHHHHHHHH
Confidence 4455677888888888 4466777766654
No 167
>PHA01976 helix-turn-helix protein
Probab=25.78 E-value=92 Score=17.17 Aligned_cols=48 Identities=13% Similarity=0.159 Sum_probs=29.7
Q ss_pred hccCCCHHHHHHH--hhHhHHHHHHHHHHcCCHHHHHHHHh-cCCCCCCCC
Q psy15934 13 LLLNLDEKQLKAL--HTRSNLKRFLEYVNNSQVEKIAKLCA-KGLDPNFHC 60 (70)
Q Consensus 13 ~~~~l~ekql~kl--htk~~~~df~~aV~~~~~~kV~kLL~-kGlDPN~~d 60 (70)
...+++-++||+. -+++.+-++......=+.+.+.++++ -|++|+.--
T Consensus 12 ~~~glt~~~lA~~~gvs~~~v~~~e~g~~~p~~~~l~~ia~~l~v~~~~l~ 62 (67)
T PHA01976 12 NARAWSAPELSRRAGVRHSLIYDFEADKRLPNLKTLLRLADALGVTLDWLC 62 (67)
T ss_pred HHcCCCHHHHHHHhCCCHHHHHHHHcCCCCCCHHHHHHHHHHHCcCHHHHh
Confidence 4457888888876 35666666554444345566666663 377777543
No 168
>KOG0507|consensus
Probab=25.75 E-value=60 Score=28.53 Aligned_cols=33 Identities=15% Similarity=-0.082 Sum_probs=28.1
Q ss_pred HHHcCCHHHHHHHHhcCCCCCCCCCCCCcccccC
Q psy15934 37 YVNNSQVEKIAKLCAKGLDPNFHCPETGEMVALI 70 (70)
Q Consensus 37 aV~~~~~~kV~kLL~kGlDPN~~d~~~GeTpL~L 70 (70)
|.-.|+-+-|.-+|-.|--+|... ..|+|||++
T Consensus 89 aaw~g~~e~vkmll~q~d~~na~~-~e~~tplhl 121 (854)
T KOG0507|consen 89 AAWNGNLEIVKMLLLQTDILNAVN-IENETPLHL 121 (854)
T ss_pred hhhcCcchHHHHHHhcccCCCccc-ccCcCccch
Confidence 456788888888899998899876 799999985
No 169
>TIGR00002 S16 ribosomal protein S16. This model describes ribosomal S16 of bacteria and organelles.
Probab=25.68 E-value=63 Score=20.14 Aligned_cols=18 Identities=33% Similarity=0.503 Sum_probs=15.2
Q ss_pred cCCHHHHHHHHhcCCCCC
Q psy15934 40 NSQVEKIAKLCAKGLDPN 57 (70)
Q Consensus 40 ~~~~~kV~kLL~kGlDPN 57 (70)
.=|.+++..+|.+|.-|.
T Consensus 48 ~l~~~ri~~Wl~~GAqps 65 (78)
T TIGR00002 48 KLNVERIKYWLSKGAQPT 65 (78)
T ss_pred EEcHHHHHHHHHCCCccC
Confidence 346889999999999986
No 170
>cd08332 CARD_CASP2 Caspase activation and recruitment domain of Caspase-2. Caspase activation and recruitment domain (CARD) similar to that found in caspase-2. Caspases are aspartate-specific cysteine proteases with functions in apoptosis and immune signaling. Caspase-2 (also known as ICH1, NEDD2, or CASP2) is one of the most evolutionarily conserved caspases, and plays a role in apoptosis, DNA damage response, cell cycle regulation, and tumor suppression. It is localized in the nucleus and exhibits properties of both an initiator and an effector caspase. In general, CARDs are death domains (DDs) found associated with caspases. They are known to be important in the signaling pathways for apoptosis, inflammation, and host-defense mechanisms. DDs are protein-protein interaction domains found in a variety of domain architectures. Their common feature is that they form homodimers by self-association or heterodimers by associating with other members of the DD superfamily including PYRIN and
Probab=25.49 E-value=1e+02 Score=19.09 Aligned_cols=31 Identities=26% Similarity=0.396 Sum_probs=22.9
Q ss_pred HHHHhhHh--HHHHHHHHHHcCCHHHHHHHHhc
Q psy15934 22 LKALHTRS--NLKRFLEYVNNSQVEKIAKLCAK 52 (70)
Q Consensus 22 l~klhtk~--~~~df~~aV~~~~~~kV~kLL~k 52 (70)
|..|=+++ .+..|.+|.+..+-.-+++||+|
T Consensus 57 ld~L~~RG~~AF~~F~~aL~~~~~~~La~lL~~ 89 (90)
T cd08332 57 LNLLPKRGPRAFSAFCEALRETSQEHLCDLLEK 89 (90)
T ss_pred HHHHHHhChhHHHHHHHHHHhcChHHHHHHHhh
Confidence 34445553 57789999988777889999876
No 171
>PRK14145 heat shock protein GrpE; Provisional
Probab=25.33 E-value=19 Score=26.04 Aligned_cols=31 Identities=19% Similarity=0.284 Sum_probs=18.1
Q ss_pred HHHHHHHHHHcCCHHHHHHHHhcCCCCCCCCC
Q psy15934 30 NLKRFLEYVNNSQVEKIAKLCAKGLDPNFHCP 61 (70)
Q Consensus 30 ~~~df~~aV~~~~~~kV~kLL~kGlDPN~~d~ 61 (70)
.++.|......--++.|.. +-.-||||+|..
T Consensus 128 i~k~l~~vL~k~GVe~I~~-~Ge~FDP~~HEA 158 (196)
T PRK14145 128 IYRQFKKILDKFGVKEIEA-EGQIFDPYKHHA 158 (196)
T ss_pred HHHHHHHHHHHCCCEEeCC-CCCCCCchhhhe
Confidence 3444555555444444443 356799999984
No 172
>PRK14134 recX recombination regulator RecX; Provisional
Probab=25.18 E-value=95 Score=23.03 Aligned_cols=55 Identities=16% Similarity=0.172 Sum_probs=38.0
Q ss_pred hhhhhhhhccCCCHHHHHHHhhHhHHHH----HHHHHH---cCCHHHHHHHHhcCCCCCCCC
Q psy15934 6 IANTFISLLLNLDEKQLKALHTRSNLKR----FLEYVN---NSQVEKIAKLCAKGLDPNFHC 60 (70)
Q Consensus 6 ~~k~rvy~~~~l~ekql~klhtk~~~~d----f~~aV~---~~~~~kV~kLL~kGlDPN~~d 60 (70)
+.+-++.+..-||+.++..+=....+.+ -+.++. .-..+=..+|.++|++|+.++
T Consensus 35 l~~~~L~kG~eld~e~~~ei~~~~~~~~a~~~AL~~Ls~r~rSe~Elr~KL~~k~~~~~~Ie 96 (283)
T PRK14134 35 VYYHNLKKGKVIDVNSLNDIIKEDNYIKCKGYALKYIEKSYKTEKQIKEKLYLKEYDEDAVN 96 (283)
T ss_pred HHHhCCcCCCCcCHHHHHHHHHHHHHHHHHHHHHHHhccCcchHHHHHHHHHhCCCCHHHHH
Confidence 5677888999999999999987666555 333333 233344556677999987654
No 173
>PF02153 PDH: Prephenate dehydrogenase; InterPro: IPR003099 Members of this family are prephenate dehydrogenases 1.3.1.12 from EC involved in tyrosine biosynthesis. ; GO: 0004665 prephenate dehydrogenase (NADP+) activity, 0008977 prephenate dehydrogenase activity, 0006571 tyrosine biosynthetic process, 0055114 oxidation-reduction process; PDB: 2F1K_B 2PV7_A 3DZB_B 3KTD_B 3B1F_A 2G5C_D 3GGP_C 3GGG_C 3GGO_D.
Probab=24.95 E-value=1.1e+02 Score=21.72 Aligned_cols=24 Identities=17% Similarity=0.253 Sum_probs=20.2
Q ss_pred HhHHHHHHHHHHcCCHHHHHHHHh
Q psy15934 28 RSNLKRFLEYVNNSQVEKIAKLCA 51 (70)
Q Consensus 28 k~~~~df~~aV~~~~~~kV~kLL~ 51 (70)
...+.+|.++++.++.+.+.++++
T Consensus 234 ~~~L~~l~~~l~~~d~~~l~~~l~ 257 (258)
T PF02153_consen 234 IKELNELREALEAGDEEELEELLE 257 (258)
T ss_dssp HHHHHHHHHHHHTTSHHHHHHHHH
T ss_pred HHHHHHHHHHHHcCCHHHHHHHhc
Confidence 356778889999999999999886
No 174
>COG5626 Uncharacterized small conserved protein [Function unknown]
Probab=24.94 E-value=77 Score=20.95 Aligned_cols=24 Identities=29% Similarity=0.349 Sum_probs=20.2
Q ss_pred HHHHHHHHHcCCHHHHHHHHhcCC
Q psy15934 31 LKRFLEYVNNSQVEKIAKLCAKGL 54 (70)
Q Consensus 31 ~~df~~aV~~~~~~kV~kLL~kGl 54 (70)
+-+--+||..++.++|..++.-|+
T Consensus 38 Ll~VgeAiA~D~~~~V~~Wise~l 61 (97)
T COG5626 38 LLEVGEAIAGDDREKVAAWISEGL 61 (97)
T ss_pred HHHHHHHHhcCcHHHHHHHHhccC
Confidence 445568999999999999998876
No 175
>PRK14956 DNA polymerase III subunits gamma and tau; Provisional
Probab=24.93 E-value=1.1e+02 Score=24.95 Aligned_cols=30 Identities=20% Similarity=0.253 Sum_probs=22.3
Q ss_pred HhHHHHHHHHHHcCCH-----HHHHHHHhcCCCCC
Q psy15934 28 RSNLKRFLEYVNNSQV-----EKIAKLCAKGLDPN 57 (70)
Q Consensus 28 k~~~~df~~aV~~~~~-----~kV~kLL~kGlDPN 57 (70)
...+.+|++++..++. ..+.+|++.|.||.
T Consensus 248 ~~~~~~l~~si~~~d~~~~al~~l~~l~~~G~d~~ 282 (484)
T PRK14956 248 IEFLTSFIKSLIDPDNHSKSLEILESLYQEGQDIY 282 (484)
T ss_pred HHHHHHHHHHHHcCCcHHHHHHHHHHHHHcCCCHH
Confidence 3446778888887543 36788899999985
No 176
>KOG3835|consensus
Probab=24.76 E-value=97 Score=25.53 Aligned_cols=38 Identities=24% Similarity=0.433 Sum_probs=30.5
Q ss_pred CCCHHHHHHHhhHhHHHHHHHHHHcCCHHHHHHHHhcC
Q psy15934 16 NLDEKQLKALHTRSNLKRFLEYVNNSQVEKIAKLCAKG 53 (70)
Q Consensus 16 ~l~ekql~klhtk~~~~df~~aV~~~~~~kV~kLL~kG 53 (70)
-|.|=||-++=.|||+-.|++....---+-|.+||+-|
T Consensus 8 TLgElQLyrVLqkANLlsYyd~FIqQGGDDvqQlceag 45 (495)
T KOG3835|consen 8 TLGELQLYRVLQKANLLSYYDVFIQQGGDDVQQLCEAG 45 (495)
T ss_pred chHHHHHHHHHHHhhHHHHHHHHHHhcchHHHHHHHhh
Confidence 46789999999999999999876654455788888765
No 177
>PRK11199 tyrA bifunctional chorismate mutase/prephenate dehydrogenase; Provisional
Probab=24.49 E-value=90 Score=23.68 Aligned_cols=26 Identities=12% Similarity=0.088 Sum_probs=22.5
Q ss_pred HhHHHHHHHHHHcCCHHHHHHHHhcC
Q psy15934 28 RSNLKRFLEYVNNSQVEKIAKLCAKG 53 (70)
Q Consensus 28 k~~~~df~~aV~~~~~~kV~kLL~kG 53 (70)
..++.+|.+++..+|.+++.++++++
T Consensus 321 ~~~l~~l~~~l~~~d~~~l~~~~~~a 346 (374)
T PRK11199 321 YQRFGEALELLEQGDKQAFIDSFRKV 346 (374)
T ss_pred HHHHHHHHHHHHcCCHHHHHHHHHHH
Confidence 35678899999999999999999875
No 178
>KOG2384|consensus
Probab=24.07 E-value=62 Score=24.27 Aligned_cols=35 Identities=14% Similarity=0.045 Sum_probs=28.5
Q ss_pred HHHHHHcCCHHHHHHHHhcC-CCCCCCCCCCCccccc
Q psy15934 34 FLEYVNNSQVEKIAKLCAKG-LDPNFHCPETGEMVAL 69 (70)
Q Consensus 34 f~~aV~~~~~~kV~kLL~kG-lDPN~~d~~~GeTpL~ 69 (70)
.|-|.-.|+.+.+.=|+.|| .+|-..| ++|.+.+.
T Consensus 16 lmcaa~eg~~eavsyllgrg~a~vgv~d-~ssldaaq 51 (223)
T KOG2384|consen 16 LMCAAMEGSNEAVSYLLGRGVAFVGVTD-ESSLDAAQ 51 (223)
T ss_pred HHHHhhhcchhHHHHHhccCcccccccc-cccchHHH
Confidence 35577789999999999999 9999877 68876653
No 179
>PRK14144 heat shock protein GrpE; Provisional
Probab=23.90 E-value=17 Score=26.28 Aligned_cols=31 Identities=29% Similarity=0.473 Sum_probs=20.7
Q ss_pred HHHHHHHHHHcCCHHHHHHHHhcCCCCCCCCC
Q psy15934 30 NLKRFLEYVNNSQVEKIAKLCAKGLDPNFHCP 61 (70)
Q Consensus 30 ~~~df~~aV~~~~~~kV~kLL~kGlDPN~~d~ 61 (70)
.++.|+.....-.+..|.. +-.-||||+|..
T Consensus 130 i~k~l~~~L~k~GV~~I~~-~G~~FDP~~HEA 160 (199)
T PRK14144 130 TMKLFLDALQKFDVEQIDP-LGQTFDPQQHEA 160 (199)
T ss_pred HHHHHHHHHHHCCCEEeCC-CCCCCChhHhce
Confidence 3456666666666666554 456799999975
No 180
>CHL00005 rps16 ribosomal protein S16
Probab=23.70 E-value=67 Score=20.30 Aligned_cols=18 Identities=28% Similarity=0.344 Sum_probs=15.1
Q ss_pred cCCHHHHHHHHhcCCCCC
Q psy15934 40 NSQVEKIAKLCAKGLDPN 57 (70)
Q Consensus 40 ~~~~~kV~kLL~kGlDPN 57 (70)
.=|.++|..+|.+|.-|.
T Consensus 47 ~ln~eri~~Wl~~GAqpt 64 (82)
T CHL00005 47 YLNVPAILYFLEKGAQPT 64 (82)
T ss_pred EEeHHHHHHHHHCcCccC
Confidence 347889999999999886
No 181
>PRK14142 heat shock protein GrpE; Provisional
Probab=23.49 E-value=24 Score=26.15 Aligned_cols=31 Identities=10% Similarity=0.133 Sum_probs=22.0
Q ss_pred HHHHHHHHHHcCCHHHHHHHHhcCCCCCCCCC
Q psy15934 30 NLKRFLEYVNNSQVEKIAKLCAKGLDPNFHCP 61 (70)
Q Consensus 30 ~~~df~~aV~~~~~~kV~kLL~kGlDPN~~d~ 61 (70)
.++.|+.....--++.|.. +-.-||||+|..
T Consensus 113 I~kqL~~iLek~GVe~I~~-~Ge~FDP~~HEA 143 (223)
T PRK14142 113 VADKLDSALTGLGLVAFGA-EGEDFDPVLHEA 143 (223)
T ss_pred HHHHHHHHHHHCCCEEeCC-CCCCCChhhhce
Confidence 4667777777666666643 467799999974
No 182
>PF13812 PPR_3: Pentatricopeptide repeat domain
Probab=23.32 E-value=96 Score=14.32 Aligned_cols=19 Identities=16% Similarity=0.372 Sum_probs=11.6
Q ss_pred HHcCCHHHHHHH----HhcCCCC
Q psy15934 38 VNNSQVEKIAKL----CAKGLDP 56 (70)
Q Consensus 38 V~~~~~~kV~kL----L~kGlDP 56 (70)
++.|+.+.+.++ .++|+.|
T Consensus 12 ~~~g~~~~a~~~~~~M~~~gv~P 34 (34)
T PF13812_consen 12 AKAGDPDAALQLFDEMKEQGVKP 34 (34)
T ss_pred HHCCCHHHHHHHHHHHHHhCCCC
Confidence 446666665544 5677766
No 183
>PF07862 Nif11: Nitrogen fixation protein of unknown function; InterPro: IPR012903 This domain is found in the cyanobacteria, and the nitrogen-fixing proteobacterium Azotobacter vinelandii and may be involved in nitrogen fixation, but no role has been assigned [].
Probab=23.14 E-value=83 Score=17.01 Aligned_cols=29 Identities=34% Similarity=0.539 Sum_probs=15.9
Q ss_pred hHhHHHHHHHHHHcC-----------CHHHHHHHH-hcCCC
Q psy15934 27 TRSNLKRFLEYVNNS-----------QVEKIAKLC-AKGLD 55 (70)
Q Consensus 27 tk~~~~df~~aV~~~-----------~~~kV~kLL-~kGlD 55 (70)
++.+++.|+..|+.+ +.+.+.++. +.|++
T Consensus 2 S~~~l~~Fl~~~~~d~~l~~~l~~~~~~~e~~~lA~~~Gy~ 42 (49)
T PF07862_consen 2 SIESLKAFLEKVKSDPELREQLKACQNPEEVVALAREAGYD 42 (49)
T ss_pred CHHHHHHHHHHHhcCHHHHHHHHhcCCHHHHHHHHHHcCCC
Confidence 345566666666532 555555554 45554
No 184
>KOG2785|consensus
Probab=23.05 E-value=1e+02 Score=24.80 Aligned_cols=21 Identities=5% Similarity=0.036 Sum_probs=17.8
Q ss_pred hhhhhhhccCCCHHHHHHHhh
Q psy15934 7 ANTFISLLLNLDEKQLKALHT 27 (70)
Q Consensus 7 ~k~rvy~~~~l~ekql~klht 27 (70)
+||||+..+++...+++.=|.
T Consensus 32 LKRkVA~lPPItaE~F~~k~~ 52 (390)
T KOG2785|consen 32 LKRKVASLPPITAEEFNEKVL 52 (390)
T ss_pred HHhHhhcCCCcCHHHHhHHHh
Confidence 799999999999988865544
No 185
>PF03013 Pyr_excise: Pyrimidine dimer DNA glycosylase; InterPro: IPR004260 Pyrimidine dimer DNA glycosylases are enzymes responsible for initiating the base excision repair pathway, excising pyrimidine dimers by hydrolysis of the glycosylic bond of the 5' pyrimidine, followed by the intra-pyrimidine phosphodiester bond []. One such enzyme is T4 endonuclease V, an enzyme responsible for the first step of a pyrimidine-dimer-specific excision-repair pathway []. Bacteriophage T4 that are deficient in these enzymes are extremely sensitive to UV.; PDB: 2FCC_B 1ENJ_A 1ENI_A 1ENK_A 1VAS_A 2END_A.
Probab=22.97 E-value=52 Score=22.27 Aligned_cols=26 Identities=19% Similarity=0.249 Sum_probs=14.3
Q ss_pred HHHHcCCHHHHHHHHhcCCCCCCCCC
Q psy15934 36 EYVNNSQVEKIAKLCAKGLDPNFHCP 61 (70)
Q Consensus 36 ~aV~~~~~~kV~kLL~kGlDPN~~d~ 61 (70)
.+.......-+.+|..||..||+.+.
T Consensus 59 ~~L~~rh~~l~~EM~~RGY~~~~~~~ 84 (130)
T PF03013_consen 59 YYLYKRHQLLMAEMQRRGYKPNSPWF 84 (130)
T ss_dssp HHHHHHHHHHHHHHHHTT---S--S-
T ss_pred HHHHHHHHHHHHHHHHcCCCCChhhh
Confidence 34444455678999999999998764
No 186
>TIGR00370 conserved hypothetical protein TIGR00370.
Probab=22.94 E-value=43 Score=23.87 Aligned_cols=23 Identities=9% Similarity=0.222 Sum_probs=19.4
Q ss_pred hhhhccCCCHHHHHHHhhHhHHH
Q psy15934 10 FISLLLNLDEKQLKALHTRSNLK 32 (70)
Q Consensus 10 rvy~~~~l~ekql~klhtk~~~~ 32 (70)
+|-+..+|++.++..+||.+.|.
T Consensus 95 ~vA~~~gLs~eevi~~Hs~~~y~ 117 (202)
T TIGR00370 95 EVAKINQLSPEEVIDIHSNGEYV 117 (202)
T ss_pred HHHHHhCcCHHHHHHHHhCCceE
Confidence 56678889999999999998775
No 187
>cd07218 Pat_iPLA2 Calcium-independent phospholipase A2; Classified as Group IVA-1 PLA2. Calcium-independent phospholipase A2; otherwise known as Group IVA-1 PLA2. It contains the lipase consensus sequence (Gly-X-Ser-X-Gly);mutagenesis experiments confirm the role of this serine as a nucleophile. Some members of this group show triacylglycerol lipase activity (EC 3:1:1:3). Members include iPLA-1, iPLA-2, and iPLA-3 from Aedes aegypti and show acylglycerol transacylase/lipase activity. Also includes putative iPLA2-eta from Pediculus humanus corporis which shows patatin-like phospholipase activity.
Probab=22.88 E-value=87 Score=22.59 Aligned_cols=29 Identities=28% Similarity=0.610 Sum_probs=22.7
Q ss_pred hHhHHHHHHHHHHcCCHHHHHHHHhcCCC
Q psy15934 27 TRSNLKRFLEYVNNSQVEKIAKLCAKGLD 55 (70)
Q Consensus 27 tk~~~~df~~aV~~~~~~kV~kLL~kGlD 55 (70)
|..|+.++.+++---+.+.+.+||++|++
T Consensus 205 ~~~n~~r~~~~l~pp~~~~l~~~~~~g~~ 233 (245)
T cd07218 205 SRQNIYRLVRILFPPRPEVLSSLCQQGFD 233 (245)
T ss_pred eHHHHHHHHHhhcCCCHHHHHHHHHhhhH
Confidence 34677788888888888888888888864
No 188
>KOG4023|consensus
Probab=22.80 E-value=40 Score=22.70 Aligned_cols=22 Identities=14% Similarity=0.297 Sum_probs=15.9
Q ss_pred hhHhHHHHHHHHHHcCCHHHHH
Q psy15934 26 HTRSNLKRFLEYVNNSQVEKIA 47 (70)
Q Consensus 26 htk~~~~df~~aV~~~~~~kV~ 47 (70)
|=.+.|+.||+|++.+-+-..-
T Consensus 75 ~Y~Gdye~F~ea~E~ntl~eFL 96 (108)
T KOG4023|consen 75 QYCGDYELFFEAVEQNTLQEFL 96 (108)
T ss_pred cccccHHHHHHHHHHHHHHHHH
Confidence 4467899999999776554433
No 189
>PRK08507 prephenate dehydrogenase; Validated
Probab=22.76 E-value=1.1e+02 Score=21.66 Aligned_cols=25 Identities=16% Similarity=0.399 Sum_probs=20.4
Q ss_pred hHHHHHHHHHHcCCHHHHHHHHhcC
Q psy15934 29 SNLKRFLEYVNNSQVEKIAKLCAKG 53 (70)
Q Consensus 29 ~~~~df~~aV~~~~~~kV~kLL~kG 53 (70)
..+.+|.+++..+|.+.+.+++++|
T Consensus 244 ~~l~~~~~~l~~~d~~~~~~~~~~~ 268 (275)
T PRK08507 244 KELEQFKQLIENEDWEELEEWMEQA 268 (275)
T ss_pred HHHHHHHHHHHcCCHHHHHHHHHHH
Confidence 3467788888999999999998765
No 190
>PRK14149 heat shock protein GrpE; Provisional
Probab=22.65 E-value=18 Score=26.06 Aligned_cols=30 Identities=30% Similarity=0.578 Sum_probs=17.6
Q ss_pred HHHHHHHHHHcCCHHHHHHHHhcCCCCCCCCC
Q psy15934 30 NLKRFLEYVNNSQVEKIAKLCAKGLDPNFHCP 61 (70)
Q Consensus 30 ~~~df~~aV~~~~~~kV~kLL~kGlDPN~~d~ 61 (70)
.++.|+.....--++.|... .. ||||+|..
T Consensus 122 i~k~l~~vL~k~GV~~I~~~-G~-FDP~~HEA 151 (191)
T PRK14149 122 TMEKLHEVLARHGIEGIECL-EE-FDPNFHNA 151 (191)
T ss_pred HHHHHHHHHHHCCCEEeCCC-CC-CChHHhhe
Confidence 34455555555555555442 33 99999974
No 191
>KOG0514|consensus
Probab=22.64 E-value=41 Score=27.49 Aligned_cols=30 Identities=13% Similarity=0.302 Sum_probs=19.2
Q ss_pred cCCHHHHHHHHhcCC-CCCCCCCCCCcccccC
Q psy15934 40 NSQVEKIAKLCAKGL-DPNFHCPETGEMVALI 70 (70)
Q Consensus 40 ~~~~~kV~kLL~kGl-DPN~~d~~~GeTpL~L 70 (70)
+.|-+-|..||+-|. |-|-++ .-|+||.||
T Consensus 278 HaNF~VV~~LLDSgvC~VD~qN-rAGYtpiML 308 (452)
T KOG0514|consen 278 HANFDVVSILLDSGVCDVDQQN-RAGYTPVML 308 (452)
T ss_pred ccchHHHHHHhccCcccccccc-cccccHHHH
Confidence 456677777777774 455444 567777664
No 192
>PF11815 DUF3336: Domain of unknown function (DUF3336); InterPro: IPR021771 This family of proteins is characterised by an N-terminal domain that is found adjacent to the patatin/phospholipase A2-related domain (see PF01734 from PFAM). The family of proteins, which contain these two domains, have been characterised in Saccharomyces cerevisiae (Baker's yeast) as a bifunctional enzyme with triacylglycerol lipase and lysophosphatidic acid acyltransferase or lysophosphatidylethanolamine acyltransferase activity. They are generally involved in triacylglycerol mobilisation and localized to lipid particles [, ].
Probab=22.54 E-value=1.3e+02 Score=20.23 Aligned_cols=32 Identities=22% Similarity=0.307 Sum_probs=27.7
Q ss_pred hHHHHHHHHHHcCCHHHHHHHHhcCCCCCCCC
Q psy15934 29 SNLKRFLEYVNNSQVEKIAKLCAKGLDPNFHC 60 (70)
Q Consensus 29 ~~~~df~~aV~~~~~~kV~kLL~kGlDPN~~d 60 (70)
..+..+-+|-..||++.+.-+|+-|+-.|+..
T Consensus 61 ~rl~~L~~aR~~~d~~~l~~~Lr~~l~RNlgg 92 (145)
T PF11815_consen 61 ERLRELREARQSGDIEALMFLLRTGLHRNLGG 92 (145)
T ss_pred HHHHHHHHHHHcCCHHHHHHHHHHHHhcCccc
Confidence 44677888889999999999999999999864
No 193
>PF13877 RPAP3_C: Potential Monad-binding region of RPAP3
Probab=22.43 E-value=1.3e+02 Score=18.07 Aligned_cols=27 Identities=22% Similarity=0.539 Sum_probs=21.3
Q ss_pred HHHHHHHHcCCHHHHHHHHhcCCCCCC
Q psy15934 32 KRFLEYVNNSQVEKIAKLCAKGLDPNF 58 (70)
Q Consensus 32 ~df~~aV~~~~~~kV~kLL~kGlDPN~ 58 (70)
+...+|+..-..+.+.++..-|+||++
T Consensus 22 ~~~~~yL~~i~p~~l~~if~~~l~~~~ 48 (94)
T PF13877_consen 22 EERYEYLKSIPPDSLPKIFKNSLEPEF 48 (94)
T ss_pred HHHHHHHHhCChHHHHHHHHccCCHHH
Confidence 366677777788888888888888875
No 194
>PRK14160 heat shock protein GrpE; Provisional
Probab=22.33 E-value=21 Score=26.01 Aligned_cols=30 Identities=30% Similarity=0.503 Sum_probs=19.3
Q ss_pred HHHHHHHHHHcCCHHHHHHHHhcCCCCCCCCC
Q psy15934 30 NLKRFLEYVNNSQVEKIAKLCAKGLDPNFHCP 61 (70)
Q Consensus 30 ~~~df~~aV~~~~~~kV~kLL~kGlDPN~~d~ 61 (70)
.++.|......--++.|... . -||||+|..
T Consensus 144 i~kql~~vL~k~GVe~I~~~-G-~FDP~~HEA 173 (211)
T PRK14160 144 TVKQFKTSLEKLGVEEISTE-G-EFDPNLHNA 173 (211)
T ss_pred HHHHHHHHHHHCCCEEeCCC-C-CCChHHhce
Confidence 35556666666666555442 3 599999984
No 195
>PRK14158 heat shock protein GrpE; Provisional
Probab=22.25 E-value=18 Score=26.04 Aligned_cols=31 Identities=19% Similarity=0.304 Sum_probs=15.7
Q ss_pred HHHHHHHHHcCCHHHHHHHHhcCCCCCCCCC
Q psy15934 31 LKRFLEYVNNSQVEKIAKLCAKGLDPNFHCP 61 (70)
Q Consensus 31 ~~df~~aV~~~~~~kV~kLL~kGlDPN~~d~ 61 (70)
++.|.+....--++.|....-.-||||+|..
T Consensus 126 ~k~l~~vLek~Gv~~I~~~~G~~FDP~~HEA 156 (194)
T PRK14158 126 LSMLLSTLKKFGVTPVEAEKGTPFDPAYHQA 156 (194)
T ss_pred HHHHHHHHHHCCCEEecCCCCCCCChHHhhh
Confidence 3344444444444333322233499999964
No 196
>PRK14162 heat shock protein GrpE; Provisional
Probab=21.89 E-value=22 Score=25.61 Aligned_cols=30 Identities=20% Similarity=0.416 Sum_probs=16.7
Q ss_pred HHHHHHHHHcCCHHHHHHHHhcCCCCCCCCC
Q psy15934 31 LKRFLEYVNNSQVEKIAKLCAKGLDPNFHCP 61 (70)
Q Consensus 31 ~~df~~aV~~~~~~kV~kLL~kGlDPN~~d~ 61 (70)
++.|......-.+..|.. +-.-||||+|..
T Consensus 126 ~k~l~~vL~~~GV~~I~~-~G~~FDP~~HEA 155 (194)
T PRK14162 126 LDHLVKALKDHGVTEIKA-DGEKFDPTLHQA 155 (194)
T ss_pred HHHHHHHHHHCCCEEeCC-CCCCCChhHhhh
Confidence 334444444444444433 345699999975
No 197
>PRK05574 holA DNA polymerase III subunit delta; Reviewed
Probab=21.59 E-value=2.1e+02 Score=20.35 Aligned_cols=40 Identities=8% Similarity=0.211 Sum_probs=28.5
Q ss_pred CCHHHHHHHhhH---hHHHHHHHHHHcCCHHHHH----HHHhcCCCC
Q psy15934 17 LDEKQLKALHTR---SNLKRFLEYVNNSQVEKIA----KLCAKGLDP 56 (70)
Q Consensus 17 l~ekql~klhtk---~~~~df~~aV~~~~~~kV~----kLL~kGlDP 56 (70)
++...+.++-.. .+.-+|++|+..++..+.- +|+..|.||
T Consensus 201 It~~~I~~~i~~~~~~~~f~l~dai~~~~~~~a~~~l~~l~~~~~~~ 247 (340)
T PRK05574 201 ITLEDVEEAVPDSARFDVFDLVDAILAGKIKRALRILDGLRLEGEEP 247 (340)
T ss_pred CCHHHHHHHHhhhhcCCHHHHHHHHHCCCHHHHHHHHHHHHHCCCcH
Confidence 777777776443 5677899999999987544 445556665
No 198
>smart00796 AHS1 Allophanate hydrolase subunit 1. This domain represents subunit 1 of allophanate hydrolase (AHS1).
Probab=21.56 E-value=47 Score=23.46 Aligned_cols=23 Identities=17% Similarity=0.349 Sum_probs=19.0
Q ss_pred hhhhccCCCHHHHHHHhhHhHHH
Q psy15934 10 FISLLLNLDEKQLKALHTRSNLK 32 (70)
Q Consensus 10 rvy~~~~l~ekql~klhtk~~~~ 32 (70)
+|-+..+|++.++.++||.+.|.
T Consensus 105 ~vA~~~gLs~~evi~~Hs~~~y~ 127 (201)
T smart00796 105 FVARHNGLSVDEVIRLHSAAEYR 127 (201)
T ss_pred HHHHHhCcCHHHHHHHHhCCCeE
Confidence 45677889999999999998774
No 199
>PF04910 Tcf25: Transcriptional repressor TCF25; InterPro: IPR006994 This entry appears to represent a novel family of basic helix-loop-helix (bHLH) proteins that control differentiation and development of a variety of organs [, ]. Human Nulp1 (Q2MK75 from SWISSPROT) is a basic helix-loop-helix protein expressed broadly during early embryonic organogenesis. Over expression of human Nulp1 in COS-7 cells inhibits the transcriptional activity of serum response factor (SRF), suggesting that Nulp1 may act as a novel bHLH transcriptional repressor in the SRF signalling pathway to mediate cellular functions [].
Probab=21.48 E-value=1.3e+02 Score=22.95 Aligned_cols=24 Identities=17% Similarity=0.259 Sum_probs=19.7
Q ss_pred hHHHHHHHHHHcCCHHHHHHHHhc
Q psy15934 29 SNLKRFLEYVNNSQVEKIAKLCAK 52 (70)
Q Consensus 29 ~~~~df~~aV~~~~~~kV~kLL~k 52 (70)
.....|..+|...|.+.+..+|++
T Consensus 12 ~~q~~F~~~v~~~Dp~~l~~ll~~ 35 (360)
T PF04910_consen 12 EAQEQFYAAVQSHDPNALINLLQK 35 (360)
T ss_pred HHHHHHHHHHHccCHHHHHHHHHH
Confidence 456779999999999999998843
No 200
>COG1393 ArsC Arsenate reductase and related proteins, glutaredoxin family [Inorganic ion transport and metabolism]
Probab=21.41 E-value=57 Score=21.34 Aligned_cols=27 Identities=15% Similarity=0.036 Sum_probs=19.6
Q ss_pred cCCHHHHHHHH-hcCCCCCCCCCCCCcccc
Q psy15934 40 NSQVEKIAKLC-AKGLDPNFHCPETGEMVA 68 (70)
Q Consensus 40 ~~~~~kV~kLL-~kGlDPN~~d~~~GeTpL 68 (70)
++.+.+..++| +.|.+|.+||= .++|+
T Consensus 11 C~t~rka~~~L~~~gi~~~~~~y--~~~~~ 38 (117)
T COG1393 11 CSTCRKALAWLEEHGIEYTFIDY--LKTPP 38 (117)
T ss_pred ChHHHHHHHHHHHcCCCcEEEEe--ecCCC
Confidence 45667777777 89999999873 44444
No 201
>PF03489 SapB_2: Saposin-like type B, region 2; InterPro: IPR008138 Saposins are small lysosomal proteins that serve as activators of various lysosomal lipid-degrading enzymes []. They probably act by isolating the lipid substrate from the membrane surroundings, thus making it more accessible to the soluble degradative enzymes. All mammalian saposins are synthesized as a single precursor molecule (prosaposin) which contains four Saposin-B domains, yielding the active saposins after proteolytic cleavage, and two Saposin-A domains that are removed in the activation reaction. The Saposin-B domains also occur in other proteins, many of them active in the lysis of membranes [, ].; PDB: 3BQQ_A 2RB3_B 2R0R_A 3BQP_A 2R1Q_A 1NKL_A 1L9L_A 1QDM_C 3RFI_A 4DDJ_A ....
Probab=21.29 E-value=48 Score=16.68 Aligned_cols=18 Identities=39% Similarity=0.571 Sum_probs=12.7
Q ss_pred HHHHHHHHhcCCCCCCCC
Q psy15934 43 VEKIAKLCAKGLDPNFHC 60 (70)
Q Consensus 43 ~~kV~kLL~kGlDPN~~d 60 (70)
.+.|.+++.+++||+..+
T Consensus 12 ~~~ii~~l~~~~~p~~iC 29 (35)
T PF03489_consen 12 GPQIIQLLEKQLDPQQIC 29 (35)
T ss_dssp HHHHHHHHHTTSTHHHHH
T ss_pred HHHHHHHHHhcCChHHHH
Confidence 356777888888887554
No 202
>cd07221 Pat_PNPLA3 Patatin-like phospholipase domain containing protein 3. PNPLA3 is a triacylglycerol lipase that mediates triacylglycerol hydrolysis in adipocytes and is an indicator of the nutritional state. PNPLA3 is also known as adiponutrin (ADPN) or iPLA2-epsilon. Human adiponutrins are bound to the cell membrane of adipocytes and show transacylase, TG hydrolase, and PLA2 activity. This family includes patatin-like proteins: ADPN (adiponutrin) from mammals, PNPLA3 (Patatin-like phospholipase domain-containing protein 3), and iPLA2-epsilon (Calcium-independent phospholipase A2) from Homo sapiens.
Probab=21.20 E-value=88 Score=22.72 Aligned_cols=30 Identities=17% Similarity=0.278 Sum_probs=25.7
Q ss_pred hhHhHHHHHHHHHHcCCHHHHHHHHhcCCC
Q psy15934 26 HTRSNLKRFLEYVNNSQVEKIAKLCAKGLD 55 (70)
Q Consensus 26 htk~~~~df~~aV~~~~~~kV~kLL~kGlD 55 (70)
.+..|...+.+|+.--+.+.+.++|++|.+
T Consensus 207 ~~~~n~~~~~~a~~pp~~~~l~~~~~~G~~ 236 (252)
T cd07221 207 LCTENLYLLTRALFPPDVKVLGEICLRGYL 236 (252)
T ss_pred eeHHHHHHHHHHhCCCCHHHHHHHHHhhHH
Confidence 366778889999999999999999999964
No 203
>PF10052 DUF2288: Protein of unknown function (DUF2288); InterPro: IPR018741 Members of this family of hypothetical bacterial proteins have no known function.
Probab=21.19 E-value=1.1e+02 Score=19.81 Aligned_cols=20 Identities=10% Similarity=0.209 Sum_probs=17.5
Q ss_pred HHHHHcCCHHHHHHHHhcCC
Q psy15934 35 LEYVNNSQVEKIAKLCAKGL 54 (70)
Q Consensus 35 ~~aV~~~~~~kV~kLL~kGl 54 (70)
=.++..||+..|..+++.|.
T Consensus 41 a~aia~D~~~~V~~Wi~~g~ 60 (93)
T PF10052_consen 41 AEAIAEDNTAQVQAWIESGQ 60 (93)
T ss_pred HHHHHhcCHHHHHHHHHCCC
Confidence 45788999999999999985
No 204
>COG3041 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=21.19 E-value=1.7e+02 Score=19.07 Aligned_cols=32 Identities=22% Similarity=0.368 Sum_probs=20.3
Q ss_pred HHHHHHHHHHcC-----CHHHHHHHHhc--CCCCCCCCC
Q psy15934 30 NLKRFLEYVNNS-----QVEKIAKLCAK--GLDPNFHCP 61 (70)
Q Consensus 30 ~~~df~~aV~~~-----~~~kV~kLL~k--GlDPN~~d~ 61 (70)
--+||=.++.++ ...+|..+|.. .++|++.|-
T Consensus 10 F~kD~k~~~k~~~~d~~~l~~vi~~L~~~~Pl~~~~kDH 48 (91)
T COG3041 10 FKKDFKKLIKRGPLDMDKLKKVITLLANDLPLPPRYKDH 48 (91)
T ss_pred hhHHHHHHHhcCcchHHHHHHHHHHHHhCCCCCccccCC
Confidence 345666677666 34445555544 589999884
No 205
>PRK14143 heat shock protein GrpE; Provisional
Probab=21.19 E-value=25 Score=26.02 Aligned_cols=31 Identities=19% Similarity=0.374 Sum_probs=19.8
Q ss_pred HHHHHHHHHHcCCHHHHHHHHhcCCCCCCCCC
Q psy15934 30 NLKRFLEYVNNSQVEKIAKLCAKGLDPNFHCP 61 (70)
Q Consensus 30 ~~~df~~aV~~~~~~kV~kLL~kGlDPN~~d~ 61 (70)
.++.|++....--++.|.. .-.-||||+|..
T Consensus 154 i~k~l~~~L~k~GV~~i~~-~G~~FDP~~HEA 184 (238)
T PRK14143 154 LYKQLVDVLKRLGVSPMRV-VGQEFDPNLHEA 184 (238)
T ss_pred HHHHHHHHHHHCCCeeeCC-CCCCCChHHhhe
Confidence 3566666666655555543 244599999984
No 206
>PF12997 DUF3881: Domain of unknown function, E. rectale Gene description (DUF3881); InterPro: IPR024541 This entry represents proteins of unknown function found primarily in Firmicutes. The Eubacterium rectale gene appears to be upregulated in the presence of Bacteroides thetaiotaomicron compared to growth in pure culture [].
Probab=21.07 E-value=1.6e+02 Score=22.73 Aligned_cols=34 Identities=21% Similarity=0.225 Sum_probs=24.2
Q ss_pred CCCHHHHHHH-hhHhHHHHHHHHHHcCCHHHHHHH
Q psy15934 16 NLDEKQLKAL-HTRSNLKRFLEYVNNSQVEKIAKL 49 (70)
Q Consensus 16 ~l~ekql~kl-htk~~~~df~~aV~~~~~~kV~kL 49 (70)
.-+++|.+.- ....+-..+++|+++||-+++..|
T Consensus 151 ~K~~~q~~~~~~~~~~R~~Li~AAk~GDedAiEsL 185 (283)
T PF12997_consen 151 KKTEEQEENEKEESRNRNMLIEAAKNGDEDAIESL 185 (283)
T ss_pred ccchhhhHHHHHHHHHHHHHHHHHHcCCHHHHHhc
Confidence 3466666633 233345569999999999999887
No 207
>PRK14147 heat shock protein GrpE; Provisional
Probab=20.94 E-value=22 Score=24.95 Aligned_cols=29 Identities=14% Similarity=0.359 Sum_probs=15.2
Q ss_pred HHHHHHHHcCCHHHHHHHHhcCCCCCCCCC
Q psy15934 32 KRFLEYVNNSQVEKIAKLCAKGLDPNFHCP 61 (70)
Q Consensus 32 ~df~~aV~~~~~~kV~kLL~kGlDPN~~d~ 61 (70)
+.|......--++.|.. .-.-||||+|..
T Consensus 104 k~l~~~L~~~Gv~~i~~-~G~~FDP~~HeA 132 (172)
T PRK14147 104 KQLLKVAADNGLTLLDP-VGQPFNPEHHQA 132 (172)
T ss_pred HHHHHHHHHCCCEEeCC-CCCCCChHHhce
Confidence 33444444443333332 234599999974
No 208
>PRK14962 DNA polymerase III subunits gamma and tau; Provisional
Probab=20.93 E-value=1.3e+02 Score=23.88 Aligned_cols=42 Identities=10% Similarity=0.230 Sum_probs=29.2
Q ss_pred CCHHHHHHHh---hHhHHHHHHHHHHcCCHHH----HHHHHhcCCCCCC
Q psy15934 17 LDEKQLKALH---TRSNLKRFLEYVNNSQVEK----IAKLCAKGLDPNF 58 (70)
Q Consensus 17 l~ekql~klh---tk~~~~df~~aV~~~~~~k----V~kLL~kGlDPN~ 58 (70)
++...+.++= ...+.-+|++++..++..+ +.+|+..|-||.+
T Consensus 230 It~e~V~~~l~~~~~~~i~~li~si~~~d~~~Al~~l~~ll~~Gedp~~ 278 (472)
T PRK14962 230 ITLETVHEALGLIPIEVVRDYINAIFNGDVKRVFTVLDDVYYSGKDYEV 278 (472)
T ss_pred CCHHHHHHHHcCCCHHHHHHHHHHHHcCCHHHHHHHHHHHHHcCCCHHH
Confidence 5665554432 2356778999999999887 4556677888864
No 209
>PF12976 DUF3860: Domain of Unknown Function with PDB structure (DUF3860); InterPro: IPR024619 This protein family includes hypothetical protein (JCVI_PEP_1096688149193) whose crystal structure has been solved PDB:2OD5. The hypothetical protein is from an environmental metagenome (unidentified marine microbe).
Probab=20.89 E-value=1e+02 Score=20.10 Aligned_cols=29 Identities=28% Similarity=0.549 Sum_probs=21.9
Q ss_pred HHHHHHHHHcCCHHHHHHHHhcCCCCCCCCC
Q psy15934 31 LKRFLEYVNNSQVEKIAKLCAKGLDPNFHCP 61 (70)
Q Consensus 31 ~~df~~aV~~~~~~kV~kLL~kGlDPN~~d~ 61 (70)
+++|+.-- -.|+-.|...+++-+|||+ +|
T Consensus 10 Ir~~L~ER-~~NT~EI~~~~~~~M~~~s-~P 38 (92)
T PF12976_consen 10 IRNYLSER-PRNTIEISAWLASQMDPNS-CP 38 (92)
T ss_pred HHHHHhcC-cccHHHHHHHHHhccCCCC-CH
Confidence 45566544 5677789999999999996 55
No 210
>PRK06545 prephenate dehydrogenase; Validated
Probab=20.79 E-value=1.3e+02 Score=22.53 Aligned_cols=25 Identities=28% Similarity=0.434 Sum_probs=21.0
Q ss_pred hHHHHHHHHHHcCCHHHHHHHHhcC
Q psy15934 29 SNLKRFLEYVNNSQVEKIAKLCAKG 53 (70)
Q Consensus 29 ~~~~df~~aV~~~~~~kV~kLL~kG 53 (70)
..+..|.++++.++.+.+.+++++|
T Consensus 248 ~~l~~~~~~l~~~d~~~l~~~~~~~ 272 (359)
T PRK06545 248 EDLDRARDALESGDAEAIAELFDAG 272 (359)
T ss_pred HHHHHHHHHHHcCCHHHHHHHHHHH
Confidence 4577888999999999999998764
No 211
>TIGR01470 cysG_Nterm siroheme synthase, N-terminal domain. This model represents a subfamily of CysG N-terminal region-related sequences. All sequences in the seed alignment for this model are N-terminal regions of known or predicted siroheme synthases. The C-terminal region of each is uroporphyrin-III C-methyltransferase (EC 2.1.1.107), which catalyzes the first step committed to the biosynthesis of either siroheme or cobalamin (vitamin B12) rather than protoheme (heme). The region represented by this model completes the process of oxidation and iron insertion to yield siroheme. Siroheme is a cofactor for nitrite and sulfite reductases, so siroheme synthase is CysG of cysteine biosynthesis in some organisms.
Probab=20.76 E-value=2.7e+02 Score=19.42 Aligned_cols=31 Identities=19% Similarity=0.132 Sum_probs=22.0
Q ss_pred HHhhHhHHHHHHHHHHcCCHHHHHHHHhcCC
Q psy15934 24 ALHTRSNLKRFLEYVNNSQVEKIAKLCAKGL 54 (70)
Q Consensus 24 klhtk~~~~df~~aV~~~~~~kV~kLL~kGl 54 (70)
++-.+..-++|++++..|+.+.+.+++...+
T Consensus 174 ~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~ 204 (205)
T TIGR01470 174 RFWEKFFDGAFAERVLAGREEQAERVLATRL 204 (205)
T ss_pred HHHHHHhccHHHHHHHcCCHHHHHHHHHHhh
Confidence 3333333456888899999999999887654
No 212
>TIGR03123 one_C_unchar_1 probable H4MPT-linked C1 transfer pathway protein. This protein family was identified, by the method of partial phylogenetic profiling, as related to the use of tetrahydromethanopterin (H4MPT) as a C-1 carrier. Characteristic markers of the H4MPT-linked C1 transfer pathway include formylmethanofuran dehydrogenase subunits, methenyltetrahydromethanopterin cyclohydrolase, etc. Tetrahydromethanopterin, a tetrahydrofolate analog, occurs in methanogenic archaea, bacterial methanotrophs, planctomycetes, and a few other lineages.
Probab=20.65 E-value=1.6e+02 Score=22.65 Aligned_cols=49 Identities=18% Similarity=0.169 Sum_probs=25.5
Q ss_pred hhhhhhccCCCHHHHHHHhhHhHHHHHHHHHHcCCHHHHHHHH-hcCCCC
Q psy15934 8 NTFISLLLNLDEKQLKALHTRSNLKRFLEYVNNSQVEKIAKLC-AKGLDP 56 (70)
Q Consensus 8 k~rvy~~~~l~ekql~klhtk~~~~df~~aV~~~~~~kV~kLL-~kGlDP 56 (70)
-+|+-+++.+|..++..---..--.-+++.....-.+.|.+++ ++|.||
T Consensus 231 ~~rlAr~lg~d~~e~~~~~~~~~A~~i~~~~~~~m~~ai~~v~~~~G~Dp 280 (318)
T TIGR03123 231 MRRLARMVCADLEELGEEDVRNLAKYYYEAQLEQLTEAIEEVLERYGLKT 280 (318)
T ss_pred HHHHHHHhCCChhHhcchhHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCC
Confidence 3578899999977632211111122233333333334443333 799999
No 213
>PRK14164 heat shock protein GrpE; Provisional
Probab=20.62 E-value=24 Score=25.90 Aligned_cols=31 Identities=19% Similarity=0.332 Sum_probs=19.4
Q ss_pred HHHHHHHHHHcCCHHHHHHHHhcCCCCCCCCC
Q psy15934 30 NLKRFLEYVNNSQVEKIAKLCAKGLDPNFHCP 61 (70)
Q Consensus 30 ~~~df~~aV~~~~~~kV~kLL~kGlDPN~~d~ 61 (70)
.++.|+.....--+++|.. +-.-||||+|..
T Consensus 150 i~~~l~~vL~k~Gve~I~~-~Ge~FDP~~HEA 180 (218)
T PRK14164 150 FSDKLTNVLAGLKVEKFGE-EGDAFDPEIHEA 180 (218)
T ss_pred HHHHHHHHHHHCCCEEeCC-CCCCCChhHhhe
Confidence 3555666666555555543 345599999974
No 214
>smart00342 HTH_ARAC helix_turn_helix, arabinose operon control protein.
Probab=20.39 E-value=1.6e+02 Score=15.86 Aligned_cols=16 Identities=19% Similarity=0.414 Sum_probs=8.4
Q ss_pred hhhccCCCHHHHHHHh
Q psy15934 11 ISLLLNLDEKQLKALH 26 (70)
Q Consensus 11 vy~~~~l~ekql~klh 26 (70)
+...+++++.+|.++=
T Consensus 7 la~~~~~s~~~l~~~f 22 (84)
T smart00342 7 LAEALGMSPRHLQRLF 22 (84)
T ss_pred HHHHhCCCHHHHHHHH
Confidence 4445556666554443
No 215
>COG1541 PaaK Coenzyme F390 synthetase [Coenzyme metabolism]
Probab=20.28 E-value=2.6e+02 Score=22.67 Aligned_cols=43 Identities=28% Similarity=0.383 Sum_probs=33.6
Q ss_pred cCCCHHHHHHHhhHhHHHHHHHHHHcCCHHHHHHHHhcCCCCCC
Q psy15934 15 LNLDEKQLKALHTRSNLKRFLEYVNNSQVEKIAKLCAKGLDPNF 58 (70)
Q Consensus 15 ~~l~ekql~klhtk~~~~df~~aV~~~~~~kV~kLL~kGlDPN~ 58 (70)
--+..+.|.+|-.+. ++.++.+|..+..--=+.+-++|++|.-
T Consensus 16 e~~~r~eL~~lq~~r-l~~~~k~~yeNsPfYr~~f~~~~v~p~~ 58 (438)
T COG1541 16 ETLSRKELEKLQEKR-LKSTVKYVYENSPFYRKKFKEAGVDPDD 58 (438)
T ss_pred hhcCHHHHHHHHHHH-HHHHHHHHHhcCHHHHHHHHHcCCChHH
Confidence 356788899998887 8889999888777555556678999874
No 216
>COG1937 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=20.22 E-value=1.5e+02 Score=18.92 Aligned_cols=29 Identities=21% Similarity=0.343 Sum_probs=18.9
Q ss_pred hHhHHHHHHHHHH--cCCHHHHHHHHhcCCC
Q psy15934 27 TRSNLKRFLEYVN--NSQVEKIAKLCAKGLD 55 (70)
Q Consensus 27 tk~~~~df~~aV~--~~~~~kV~kLL~kGlD 55 (70)
+...-++.+.-+. .|+++.|.+|++-|-|
T Consensus 4 ~~~~kkkl~~RlrRi~GQv~gI~rMlEe~~~ 34 (89)
T COG1937 4 SIEEKKKLLNRLRRIEGQVRGIERMLEEDRD 34 (89)
T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHHhCCCc
Confidence 3333444444444 6999999999987643
No 217
>PF00886 Ribosomal_S16: Ribosomal protein S16; InterPro: IPR000307 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites [, ]. About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits. Many ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome [, ]. Ribosomal protein S16 is one of the proteins from the small ribosomal subunit. It belongs to a family of ribosomal proteins which, on the basis of sequence similarities [], groups: Eubacterial S16. Algal and plant chloroplast S16. Cyanelle S16. Neurospora crassa mitochondrial S24 (cyt-21). S16 proteins have about 100 amino-acid residues.; GO: 0003735 structural constituent of ribosome, 0006412 translation, 0005622 intracellular, 0005840 ribosome; PDB: 2Y14_P 3UZ6_S 2J02_P 1HNZ_P 3V26_P 3KNL_P 1N34_P 2J00_P 1HNW_P 3OHC_P ....
Probab=20.16 E-value=49 Score=19.55 Aligned_cols=15 Identities=33% Similarity=0.547 Sum_probs=12.3
Q ss_pred CHHHHHHHHhcCCCC
Q psy15934 42 QVEKIAKLCAKGLDP 56 (70)
Q Consensus 42 ~~~kV~kLL~kGlDP 56 (70)
|.++|..+|.+|.-|
T Consensus 47 ~~eri~~Wl~~GAqp 61 (62)
T PF00886_consen 47 NFERIKYWLSKGAQP 61 (62)
T ss_dssp THHHHHHHHHTTEEE
T ss_pred CHHHHHHHHHcCcCC
Confidence 678888999998765
No 218
>PF13936 HTH_38: Helix-turn-helix domain; PDB: 2W48_A.
Probab=20.06 E-value=1.5e+02 Score=15.85 Aligned_cols=16 Identities=31% Similarity=0.343 Sum_probs=2.8
Q ss_pred hhccCCCHH-HHHHHhh
Q psy15934 12 SLLLNLDEK-QLKALHT 27 (70)
Q Consensus 12 y~~~~l~ek-ql~klht 27 (70)
|+.|..+|. +++.++.
T Consensus 2 ~~~Lt~~eR~~I~~l~~ 18 (44)
T PF13936_consen 2 YKHLTPEERNQIEALLE 18 (44)
T ss_dssp ----------HHHHHHC
T ss_pred ccchhhhHHHHHHHHHH
Confidence 334444443 3344443
Done!