Query psy12170
Match_columns 73
No_of_seqs 112 out of 369
Neff 5.1
Searched_HMMs 46136
Date Fri Aug 16 20:29:46 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy12170.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/12170hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG0044|consensus 99.6 4.9E-15 1.1E-19 104.5 5.7 49 23-71 6-54 (193)
2 COG5126 FRQ1 Ca2+-binding prot 98.0 8.8E-06 1.9E-10 56.3 4.1 37 34-70 8-46 (160)
3 KOG0034|consensus 97.4 0.00038 8.2E-09 48.9 5.1 52 19-70 1-60 (187)
4 cd00052 EH Eps15 homology doma 95.8 0.063 1.4E-06 29.7 5.9 48 22-70 12-59 (67)
5 KOG0038|consensus 95.7 0.013 2.7E-07 41.4 3.1 39 19-57 1-39 (189)
6 smart00027 EH Eps15 homology d 95.4 0.065 1.4E-06 32.8 5.5 50 20-70 21-70 (96)
7 PF13833 EF-hand_8: EF-hand do 94.5 0.11 2.4E-06 28.3 4.2 46 24-70 3-51 (54)
8 cd00086 homeodomain Homeodomai 94.1 0.1 2.2E-06 28.5 3.6 30 23-52 22-51 (59)
9 PF11569 Homez: Homeodomain le 93.9 0.068 1.5E-06 31.5 2.7 30 24-53 21-50 (56)
10 PF00036 EF-hand_1: EF hand; 93.1 0.11 2.3E-06 26.2 2.3 26 44-70 1-26 (29)
11 cd05031 S-100A10_like S-100A10 93.1 0.23 5.1E-06 30.2 4.2 48 23-71 24-78 (94)
12 cd05029 S-100A6 S-100A6: S-100 92.9 0.31 6.7E-06 30.0 4.6 48 23-71 26-78 (88)
13 PF05920 Homeobox_KN: Homeobox 92.7 0.2 4.3E-06 27.2 3.1 31 22-52 7-37 (40)
14 smart00389 HOX Homeodomain. DN 92.7 0.24 5.1E-06 26.9 3.5 27 25-51 24-50 (56)
15 cd05030 calgranulins Calgranul 92.4 0.48 1E-05 28.8 5.0 48 23-71 24-78 (88)
16 PF13499 EF-hand_7: EF-hand do 92.2 0.28 6.1E-06 27.3 3.5 48 22-70 13-66 (66)
17 cd00213 S-100 S-100: S-100 dom 92.0 0.89 1.9E-05 26.9 5.8 49 22-71 23-78 (88)
18 cd00051 EFh EF-hand, calcium b 92.0 0.93 2E-05 23.2 5.4 48 22-70 13-62 (63)
19 cd05027 S-100B S-100B: S-100B 91.5 0.54 1.2E-05 28.9 4.5 50 22-72 22-79 (88)
20 PF00046 Homeobox: Homeobox do 91.0 0.31 6.8E-06 26.8 2.8 29 23-51 22-50 (57)
21 PF13405 EF-hand_6: EF-hand do 90.8 0.27 5.7E-06 24.4 2.2 27 44-71 1-27 (31)
22 cd05023 S-100A11 S-100A11: S-1 90.7 0.86 1.9E-05 28.1 4.9 45 25-70 27-78 (89)
23 cd00213 S-100 S-100: S-100 dom 90.6 0.41 8.9E-06 28.4 3.3 27 44-70 9-36 (88)
24 PTZ00183 centrin; Provisional 90.6 0.46 9.9E-06 30.0 3.6 30 39-68 10-41 (158)
25 cd05026 S-100Z S-100Z: S-100Z 90.5 0.67 1.5E-05 28.4 4.3 50 22-72 24-81 (93)
26 cd05025 S-100A1 S-100A1: S-100 90.1 0.97 2.1E-05 27.2 4.7 51 20-71 21-79 (92)
27 smart00054 EFh EF-hand, calciu 89.9 0.42 9.1E-06 21.0 2.3 28 44-72 1-28 (29)
28 PTZ00184 calmodulin; Provision 89.1 0.75 1.6E-05 28.4 3.7 44 24-69 26-72 (149)
29 PLN02964 phosphatidylserine de 88.1 0.49 1.1E-05 39.0 2.9 36 24-59 119-156 (644)
30 smart00027 EH Eps15 homology d 87.9 0.81 1.8E-05 27.8 3.2 32 39-70 3-36 (96)
31 PTZ00183 centrin; Provisional 87.4 2.8 6.2E-05 26.3 5.7 49 23-72 104-154 (158)
32 PF14658 EF-hand_9: EF-hand do 85.0 2.6 5.7E-05 25.4 4.3 47 24-70 13-62 (66)
33 cd05022 S-100A13 S-100A13: S-1 84.8 1.8 4E-05 26.8 3.7 49 22-71 22-74 (89)
34 KOG0041|consensus 84.7 0.98 2.1E-05 33.3 2.7 33 38-70 91-125 (244)
35 cd04790 HTH_Cfa-like_unk Helix 84.2 2.6 5.7E-05 28.8 4.6 35 25-59 115-149 (172)
36 KOG0028|consensus 83.9 3.4 7.4E-05 29.2 5.1 48 22-70 119-168 (172)
37 cd00252 SPARC_EC SPARC_EC; ext 83.8 5.6 0.00012 25.8 5.8 46 22-70 61-106 (116)
38 PTZ00184 calmodulin; Provision 83.4 4.2 9.2E-05 25.0 4.9 48 22-70 97-146 (149)
39 PF13202 EF-hand_5: EF hand; P 83.2 1.3 2.9E-05 21.3 2.1 23 46-69 2-24 (25)
40 PF04546 Sigma70_ner: Sigma-70 82.6 0.56 1.2E-05 32.7 0.8 48 25-72 123-171 (211)
41 PF09820 AAA-ATPase_like: Pred 77.8 1.8 3.9E-05 31.5 2.1 40 33-72 218-260 (284)
42 PF14788 EF-hand_10: EF hand; 77.7 3.9 8.4E-05 23.6 3.1 45 26-71 2-48 (51)
43 cd05023 S-100A11 S-100A11: S-1 77.2 3.4 7.3E-05 25.4 2.9 29 43-71 6-38 (89)
44 PF13518 HTH_28: Helix-turn-he 74.6 2.9 6.3E-05 22.1 1.9 31 30-60 14-44 (52)
45 PF12174 RST: RCD1-SRO-TAF4 (R 74.4 2.9 6.4E-05 25.2 2.1 23 43-69 28-50 (70)
46 PF09693 Phage_XkdX: Phage unc 74.0 4.1 8.9E-05 21.9 2.4 16 56-71 22-37 (40)
47 PF12763 EF-hand_4: Cytoskelet 73.8 6.3 0.00014 25.1 3.6 46 22-68 22-67 (104)
48 TIGR01669 phage_XkdX phage unc 73.7 12 0.00025 20.8 4.2 36 29-71 7-42 (45)
49 PF13384 HTH_23: Homeodomain-l 73.4 2.9 6.3E-05 22.2 1.7 31 30-60 19-49 (50)
50 KOG0027|consensus 71.6 13 0.00029 24.0 4.9 50 22-72 98-149 (151)
51 cd05029 S-100A6 S-100A6: S-100 68.5 6.7 0.00015 23.9 2.8 26 45-70 12-38 (88)
52 cd05026 S-100Z S-100Z: S-100Z 67.1 10 0.00022 23.1 3.4 27 44-70 11-38 (93)
53 COG5126 FRQ1 Ca2+-binding prot 66.7 28 0.0006 24.1 5.8 29 41-71 54-82 (160)
54 cd05031 S-100A10_like S-100A10 64.4 11 0.00024 22.7 3.1 28 43-70 8-36 (94)
55 PRK12722 transcriptional activ 63.6 12 0.00026 26.6 3.6 32 31-62 26-58 (187)
56 PF14513 DAG_kinase_N: Diacylg 62.5 6.2 0.00014 26.7 1.9 45 24-68 6-56 (138)
57 KOG0027|consensus 62.2 14 0.00029 24.0 3.5 46 24-70 59-111 (151)
58 PLN02964 phosphatidylserine de 62.1 23 0.00049 29.5 5.4 48 23-71 193-242 (644)
59 PF13011 LZ_Tnp_IS481: leucine 60.4 13 0.00028 23.4 3.0 31 33-63 30-60 (85)
60 PRK04387 hypothetical protein; 58.7 9.9 0.00021 24.3 2.2 23 38-60 32-54 (90)
61 PF05280 FlhC: Flagellar trans 58.6 12 0.00025 26.2 2.8 30 34-63 29-59 (175)
62 PF09966 DUF2200: Uncharacteri 58.0 13 0.00028 24.7 2.8 31 27-57 24-61 (111)
63 cd05022 S-100A13 S-100A13: S-1 57.8 17 0.00037 22.4 3.2 27 44-70 9-35 (89)
64 KOG0488|consensus 56.7 13 0.00028 28.0 3.0 27 25-51 196-222 (309)
65 COG4875 Uncharacterized protei 55.5 11 0.00025 25.9 2.3 30 34-63 72-101 (156)
66 PF09851 SHOCT: Short C-termin 54.0 28 0.00061 17.5 3.2 22 44-69 4-25 (31)
67 PF15187 Augurin: Oesophageal 53.5 8.9 0.00019 25.4 1.4 16 39-54 44-59 (114)
68 KOG0031|consensus 51.2 52 0.0011 23.3 5.1 46 24-70 116-163 (171)
69 PRK12860 transcriptional activ 50.1 28 0.0006 24.8 3.6 33 31-63 26-59 (189)
70 KOG0487|consensus 49.8 6.7 0.00015 29.9 0.5 25 26-50 260-284 (308)
71 PF09279 EF-hand_like: Phospho 49.6 19 0.00041 21.0 2.4 24 44-69 1-24 (83)
72 cd00252 SPARC_EC SPARC_EC; ext 49.1 32 0.00069 22.2 3.5 28 42-70 47-74 (116)
73 PF05344 DUF746: Domain of Unk 48.8 25 0.00055 21.2 2.8 33 30-62 15-51 (65)
74 TIGR01565 homeo_ZF_HD homeobox 48.3 9.6 0.00021 22.2 0.9 26 26-51 30-55 (58)
75 KOG0483|consensus 47.9 26 0.00057 25.0 3.3 29 23-51 72-100 (198)
76 PF03871 RNA_pol_Rpb5_N: RNA p 47.3 32 0.00069 21.3 3.2 31 42-72 4-46 (93)
77 PF05193 Peptidase_M16_C: Pept 47.0 15 0.00033 22.6 1.8 30 39-68 1-35 (184)
78 KOG0488|consensus 46.7 12 0.00026 28.2 1.4 22 34-55 173-194 (309)
79 KOG0030|consensus 46.5 49 0.0011 23.1 4.3 49 22-72 101-152 (152)
80 KOG0037|consensus 45.4 27 0.00059 25.6 3.0 26 45-70 123-150 (221)
81 KOG4223|consensus 44.8 26 0.00057 27.0 3.0 30 40-69 157-188 (325)
82 KOG0491|consensus 44.4 27 0.00058 25.1 2.8 28 25-52 124-151 (194)
83 PF05256 UPF0223: Uncharacteri 43.4 12 0.00026 23.8 0.8 23 37-59 31-53 (88)
84 PF15658 Latrotoxin_C: Latroto 43.0 29 0.00063 23.5 2.7 27 42-68 85-112 (127)
85 KOG0028|consensus 42.9 97 0.0021 22.0 5.4 27 42-69 68-94 (172)
86 PF05263 DUF722: Protein of un 42.4 26 0.00057 23.4 2.4 24 32-55 103-126 (130)
87 PF03979 Sigma70_r1_1: Sigma-7 40.8 27 0.00059 20.9 2.1 38 17-54 13-50 (82)
88 smart00411 BHL bacterial (prok 40.7 47 0.001 19.5 3.2 26 30-55 6-31 (90)
89 cd00591 HU_IHF Integration hos 40.1 47 0.001 19.3 3.1 26 29-54 4-29 (87)
90 PRK13979 DNA topoisomerase IV 39.1 21 0.00045 30.9 1.9 36 34-71 202-239 (957)
91 PF08707 PriCT_2: Primase C te 38.9 62 0.0013 19.0 3.5 28 37-67 48-75 (78)
92 PHA02970 hypothetical protein; 38.8 52 0.0011 21.8 3.3 32 25-56 22-53 (115)
93 PF00226 DnaJ: DnaJ domain; I 38.7 32 0.0007 18.8 2.1 21 39-59 11-32 (64)
94 PF06902 Fer4_19: Divergent 4F 37.8 13 0.00028 21.8 0.3 10 54-63 52-61 (64)
95 PF13758 Prefoldin_3: Prefoldi 37.0 74 0.0016 20.6 3.8 38 29-70 19-56 (99)
96 PF00220 Hormone_4: Neurohypop 36.4 17 0.00037 14.2 0.5 8 52-59 2-9 (9)
97 PF08640 U3_assoc_6: U3 small 36.2 66 0.0014 19.7 3.4 23 29-51 6-28 (83)
98 PRK02628 nadE NAD synthetase; 36.2 57 0.0012 26.8 3.9 30 25-54 581-628 (679)
99 PF00216 Bac_DNA_binding: Bact 36.1 60 0.0013 18.9 3.1 26 29-54 5-30 (90)
100 KOG0844|consensus 36.1 28 0.0006 27.3 2.0 23 34-56 182-204 (408)
101 KOG0775|consensus 35.8 37 0.0008 26.0 2.6 34 22-55 197-230 (304)
102 PRK05658 RNA polymerase sigma 35.8 30 0.00066 27.9 2.2 41 31-72 259-314 (619)
103 PRK10664 transcriptional regul 35.4 62 0.0013 19.8 3.2 26 29-54 5-30 (90)
104 KOG0037|consensus 35.3 1E+02 0.0022 22.6 4.7 46 25-71 73-121 (221)
105 PHA03102 Small T antigen; Revi 34.9 63 0.0014 22.1 3.4 36 34-69 11-49 (153)
106 PF08763 Ca_chan_IQ: Voltage g 34.3 26 0.00056 18.8 1.1 10 45-54 15-24 (35)
107 TIGR02675 tape_meas_nterm tape 34.3 47 0.001 19.7 2.4 36 35-70 5-40 (75)
108 TIGR00987 himA integration hos 33.2 68 0.0015 19.5 3.1 25 30-54 7-31 (96)
109 PF02885 Glycos_trans_3N: Glyc 33.1 97 0.0021 17.6 4.3 32 30-65 5-36 (66)
110 PF14516 AAA_35: AAA-like doma 32.9 45 0.00097 24.7 2.6 34 37-72 198-231 (331)
111 COG2944 Predicted transcriptio 32.3 87 0.0019 20.3 3.6 24 25-48 43-66 (104)
112 COG4476 Uncharacterized protei 32.2 45 0.00097 21.3 2.1 23 37-59 31-53 (90)
113 PF13551 HTH_29: Winged helix- 32.1 38 0.00083 20.0 1.8 25 31-55 15-39 (112)
114 PF06648 DUF1160: Protein of u 31.6 87 0.0019 20.9 3.6 31 25-55 50-83 (122)
115 PF04695 Pex14_N: Peroxisomal 31.5 79 0.0017 20.7 3.4 37 18-55 14-50 (136)
116 cd01104 HTH_MlrA-CarA Helix-Tu 30.9 74 0.0016 17.4 2.8 36 32-67 4-45 (68)
117 PF08356 EF_assoc_2: EF hand a 30.8 96 0.0021 19.5 3.5 31 38-68 2-36 (89)
118 PF02256 Fe_hyd_SSU: Iron hydr 30.5 46 0.001 19.0 1.9 16 41-56 28-43 (60)
119 PF06401 Alpha-2-MRAP_C: Alpha 30.5 43 0.00094 24.4 2.1 26 29-54 6-35 (214)
120 PF13443 HTH_26: Cro/C1-type H 30.3 34 0.00074 18.6 1.3 20 31-50 13-32 (63)
121 KOG0046|consensus 30.3 62 0.0013 27.0 3.2 13 23-35 10-22 (627)
122 PRK08402 replication factor A; 30.0 1.3E+02 0.0027 23.2 4.7 38 26-66 4-42 (355)
123 KOG3956|consensus 29.9 81 0.0017 24.5 3.5 31 24-54 150-183 (359)
124 PF04699 P16-Arc: ARP2/3 compl 29.5 52 0.0011 22.5 2.3 21 24-54 96-116 (152)
125 COG5573 Predicted nucleic-acid 29.4 88 0.0019 21.6 3.4 57 12-68 27-88 (142)
126 PF05603 DUF775: Protein of un 29.2 28 0.00061 24.6 0.9 15 41-55 180-194 (202)
127 cd06257 DnaJ DnaJ domain or J- 29.2 82 0.0018 16.4 2.7 32 38-69 10-46 (55)
128 PF01527 HTH_Tnp_1: Transposas 29.1 73 0.0016 17.8 2.6 25 30-54 25-49 (76)
129 COG0484 DnaJ DnaJ-class molecu 29.0 60 0.0013 25.4 2.8 32 38-69 14-50 (371)
130 PRK00095 mutL DNA mismatch rep 28.9 1.2E+02 0.0025 24.7 4.5 38 34-71 568-615 (617)
131 TIGR02653 Lon_rel_chp conserve 28.6 68 0.0015 27.1 3.2 29 41-69 407-436 (675)
132 PF08667 BetR: BetR domain; I 28.6 1E+02 0.0022 21.1 3.6 30 32-68 41-70 (147)
133 PF03672 UPF0154: Uncharacteri 28.4 1.4E+02 0.003 17.9 3.7 32 23-54 29-62 (64)
134 PRK09019 translation initiatio 28.1 59 0.0013 21.2 2.3 21 40-60 55-75 (108)
135 TIGR02688 conserved hypothetic 27.9 74 0.0016 25.6 3.2 29 41-69 400-429 (449)
136 PF02337 Gag_p10: Retroviral G 27.9 70 0.0015 20.2 2.5 50 19-68 1-57 (90)
137 PF13411 MerR_1: MerR HTH fami 27.8 75 0.0016 17.5 2.4 23 31-53 3-25 (69)
138 PF10437 Lip_prot_lig_C: Bacte 27.8 1.4E+02 0.0029 17.6 3.8 42 28-69 44-85 (86)
139 KOG0031|consensus 27.6 94 0.002 22.0 3.3 33 39-71 25-59 (171)
140 cd00159 RhoGAP RhoGAP: GTPase- 27.2 1.2E+02 0.0025 19.2 3.5 32 38-69 48-79 (169)
141 PF09373 PMBR: Pseudomurein-bi 27.1 40 0.00087 17.1 1.1 14 58-71 2-15 (33)
142 PRK12279 50S ribosomal protein 27.0 69 0.0015 24.5 2.8 27 44-73 249-275 (311)
143 PRK10753 transcriptional regul 27.0 1E+02 0.0023 18.7 3.2 26 29-54 5-30 (90)
144 PF14410 GH-E: HNH/ENDO VII su 26.8 42 0.00092 20.0 1.3 44 15-62 17-66 (70)
145 cd04761 HTH_MerR-SF Helix-Turn 26.7 99 0.0021 15.7 3.4 23 31-53 3-25 (49)
146 KOG0489|consensus 26.5 55 0.0012 23.8 2.1 19 33-51 191-209 (261)
147 PRK11426 hypothetical protein; 26.5 1.4E+02 0.0031 20.1 3.9 40 25-64 69-113 (132)
148 PTZ00100 DnaJ chaperone protei 26.3 1.5E+02 0.0034 19.4 4.0 39 31-69 68-107 (116)
149 cd04236 AAK_NAGS-Urea AAK_NAGS 26.3 96 0.0021 22.9 3.4 28 30-57 4-31 (271)
150 PF13048 DUF3908: Protein of u 26.0 34 0.00073 23.0 0.9 40 30-71 4-43 (127)
151 PRK14561 hypothetical protein; 26.0 62 0.0013 22.2 2.2 28 39-66 133-162 (194)
152 PTZ00232 variable surface prot 25.9 85 0.0018 24.2 3.1 29 43-71 167-198 (363)
153 PF13560 HTH_31: Helix-turn-he 25.8 95 0.0021 17.1 2.6 38 10-48 27-64 (64)
154 PRK00285 ihfA integration host 25.6 1.3E+02 0.0027 18.3 3.4 26 30-55 8-33 (99)
155 PF09435 DUF2015: Fungal prote 25.4 1.7E+02 0.0036 19.8 4.1 33 28-60 87-123 (128)
156 PF00165 HTH_AraC: Bacterial r 25.4 96 0.0021 15.7 2.4 24 29-52 9-32 (42)
157 PF01465 GRIP: GRIP domain; I 25.2 87 0.0019 17.1 2.3 24 26-49 23-46 (46)
158 cd07971 OBF_DNA_ligase_LigD Th 25.1 73 0.0016 19.9 2.3 18 37-54 40-57 (115)
159 PRK00523 hypothetical protein; 25.0 1.7E+02 0.0036 18.0 3.7 32 23-54 37-70 (72)
160 PF13348 Y_phosphatase3C: Tyro 25.0 1E+02 0.0023 17.2 2.7 19 34-52 49-67 (68)
161 PF14550 Peptidase_U35_2: Puta 24.9 69 0.0015 21.2 2.2 22 40-61 35-56 (122)
162 PF13720 Acetyltransf_11: Udp 24.9 78 0.0017 19.2 2.3 28 36-64 24-51 (83)
163 KOG0485|consensus 24.8 58 0.0013 24.4 2.0 26 26-51 129-154 (268)
164 PF07037 DUF1323: Putative tra 24.8 79 0.0017 21.3 2.4 20 33-52 5-24 (122)
165 smart00342 HTH_ARAC helix_turn 24.7 1.2E+02 0.0026 16.3 2.9 28 30-57 52-80 (84)
166 KOG3218|consensus 24.2 64 0.0014 23.5 2.0 31 42-72 6-48 (208)
167 COG4397 Mu-like prophage major 24.0 68 0.0015 24.3 2.2 32 38-69 2-34 (308)
168 PRK12461 UDP-N-acetylglucosami 23.9 90 0.002 22.5 2.8 26 36-61 196-221 (255)
169 PF05120 GvpG: Gas vesicle pro 23.7 1.4E+02 0.0031 18.3 3.3 24 42-69 34-57 (79)
170 PF10281 Ish1: Putative stress 23.7 1.2E+02 0.0026 15.5 3.6 29 40-68 4-32 (38)
171 PF09124 Endonuc-dimeris: T4 r 23.6 92 0.002 18.1 2.3 34 18-51 8-47 (54)
172 TIGR01557 myb_SHAQKYF myb-like 23.6 1.5E+02 0.0033 16.8 3.3 28 26-53 27-54 (57)
173 cd00043 CYCLIN Cyclin box fold 23.5 1.2E+02 0.0026 16.3 2.8 23 30-52 64-87 (88)
174 PF09808 SNAPc_SNAP43: Small n 23.4 1.2E+02 0.0025 20.8 3.2 26 42-70 3-28 (194)
175 PF13373 DUF2407_C: DUF2407 C- 23.0 1.1E+02 0.0023 20.6 2.8 24 33-57 5-28 (140)
176 KOG2047|consensus 23.0 93 0.002 26.8 3.0 53 4-58 5-58 (835)
177 PF09570 RE_SinI: SinI restric 23.0 73 0.0016 23.4 2.2 25 44-68 177-212 (221)
178 smart00751 BSD domain in trans 22.8 1.4E+02 0.0031 16.1 3.1 36 27-70 8-43 (51)
179 PF01395 PBP_GOBP: PBP/GOBP fa 22.7 1.7E+02 0.0037 17.0 3.7 47 25-71 30-80 (121)
180 PF02208 Sorb: Sorbin homologo 22.6 45 0.00098 19.0 0.8 10 45-54 25-35 (47)
181 TIGR03830 CxxCG_CxxCG_HTH puta 22.6 1.5E+02 0.0032 18.2 3.3 26 24-49 63-88 (127)
182 KOG2758|consensus 22.6 1.2E+02 0.0025 24.3 3.3 33 27-59 111-145 (432)
183 cd05024 S-100A10 S-100A10: A s 22.5 2.1E+02 0.0045 17.9 4.5 46 24-70 22-74 (91)
184 PF12419 DUF3670: SNF2 Helicas 22.3 2.4E+02 0.0051 18.4 4.7 59 10-68 68-137 (141)
185 COG3763 Uncharacterized protei 22.1 1.9E+02 0.0042 17.7 3.6 33 22-54 35-69 (71)
186 PF00612 IQ: IQ calmodulin-bin 22.0 73 0.0016 14.2 1.4 11 44-54 6-16 (21)
187 smart00271 DnaJ DnaJ molecular 21.9 1.2E+02 0.0026 16.1 2.5 22 39-60 12-34 (60)
188 PRK12274 serine/threonine prot 21.8 1.2E+02 0.0025 22.0 3.0 25 42-67 154-178 (218)
189 PF00767 Poty_coat: Potyvirus 21.3 1.2E+02 0.0025 22.3 3.0 34 36-72 52-85 (237)
190 smart00015 IQ Short calmodulin 21.2 71 0.0015 14.9 1.3 11 44-54 8-18 (26)
191 PF00620 RhoGAP: RhoGAP domain 20.9 1.9E+02 0.004 18.0 3.6 34 37-70 48-81 (151)
192 TIGR03561 organ_hyd_perox pero 20.8 1.1E+02 0.0023 19.6 2.4 20 39-58 101-120 (134)
193 KOG0384|consensus 20.7 72 0.0016 29.1 2.0 20 37-56 999-1018(1373)
194 COG3592 Uncharacterized conser 20.3 38 0.00082 20.9 0.2 20 39-62 51-70 (74)
195 PF06014 DUF910: Bacterial pro 20.2 1.7E+02 0.0036 17.4 3.0 21 41-68 30-50 (62)
196 PF14591 AF0941-like: AF0941-l 20.1 73 0.0016 21.6 1.6 30 41-70 34-67 (127)
197 TIGR03760 ICE_TraI_Pfluor inte 20.1 1E+02 0.0022 22.0 2.4 28 27-54 18-45 (218)
No 1
>KOG0044|consensus
Probab=99.56 E-value=4.9e-15 Score=104.52 Aligned_cols=49 Identities=35% Similarity=0.497 Sum_probs=47.5
Q ss_pred ccCCCHHHHHHHHhcCCCCHHHHHHHHHHhhcccCCCCcCHHHHhhhhc
Q psy12170 23 VQKYRPIALEDLLKQTKFTRQEIRVMYRGFKQFLTFIGLNHPTLKDNTN 71 (73)
Q Consensus 23 ~skls~e~l~~L~~~T~Fs~~EIk~lYk~FkkecPsG~Lt~~EFk~i~~ 71 (73)
+++++|+.+++|.+.|+|+++||++|||+|+++||+|.++.++|+.||-
T Consensus 6 ~~~~~~~~~e~l~~~t~f~~~ei~~~Yr~Fk~~cP~G~~~~~~F~~i~~ 54 (193)
T KOG0044|consen 6 NSKLQPESLEQLVQQTKFSKKEIQQWYRGFKNECPSGRLTLEEFREIYA 54 (193)
T ss_pred cccCCcHHHHHHHHhcCCCHHHHHHHHHHhcccCCCCccCHHHHHHHHH
Confidence 7899999999999999999999999999999999999999999999984
No 2
>COG5126 FRQ1 Ca2+-binding protein (EF-Hand superfamily) [Signal transduction mechanisms / Cytoskeleton / Cell division and chromosome partitioning / General function prediction only]
Probab=97.99 E-value=8.8e-06 Score=56.31 Aligned_cols=37 Identities=22% Similarity=0.211 Sum_probs=35.3
Q ss_pred HHhcCCCCHHHHHHHHHHhhcccC--CCCcCHHHHhhhh
Q psy12170 34 LLKQTKFTRQEIRVMYRGFKQFLT--FIGLNHPTLKDNT 70 (73)
Q Consensus 34 L~~~T~Fs~~EIk~lYk~FkkecP--sG~Lt~~EFk~i~ 70 (73)
+...|+|+++||++||++|...|+ +|.|++++|..||
T Consensus 8 ~~~~~~~t~~qi~~lkeaF~l~D~d~~G~I~~~el~~il 46 (160)
T COG5126 8 LLTFTQLTEEQIQELKEAFQLFDRDSDGLIDRNELGKIL 46 (160)
T ss_pred hhhcccCCHHHHHHHHHHHHHhCcCCCCCCcHHHHHHHH
Confidence 788899999999999999999998 9999999999997
No 3
>KOG0034|consensus
Probab=97.39 E-value=0.00038 Score=48.95 Aligned_cols=52 Identities=21% Similarity=0.179 Sum_probs=41.5
Q ss_pred ccccccC-CCHHHHHHHHhcCC----CCHHHHHHHHHHhhcccC---CCCcCHHHHhhhh
Q psy12170 19 SSKHVQK-YRPIALEDLLKQTK----FTRQEIRVMYRGFKQFLT---FIGLNHPTLKDNT 70 (73)
Q Consensus 19 ~~~~~sk-ls~e~l~~L~~~T~----Fs~~EIk~lYk~FkkecP---sG~Lt~~EFk~i~ 70 (73)
|+...|. ++.+++..+...|+ ||..||..||.+|++.++ .|.|+.++|..|.
T Consensus 1 Mg~~~s~~~~~~~~~~~~~~~~~~~~fs~~EI~~L~~rF~kl~~~~~~g~lt~eef~~i~ 60 (187)
T KOG0034|consen 1 MGNLSSTLLSDEDLEELQMYTGDPTQFSANEIERLYERFKKLDRNNGDGYLTKEEFLSIP 60 (187)
T ss_pred CCcccccccchhhhHHHHhccCCCcccCHHHHHHHHHHHHHhccccccCccCHHHHHHHH
Confidence 3444443 45677888888999 999999999999999754 5799999998875
No 4
>cd00052 EH Eps15 homology domain; found in proteins implicated in endocytosis, vesicle transport, and signal transduction. The alignment contains a pair of EF-hand motifs, typically one of them is canonical and binds to Ca2+, while the other may not bind to Ca2+. A hydrophobic binding pocket is formed by residues from both EF-hand motifs. The EH domain binds to proteins containing NPF (class I), [WF]W or SWG (class II), or H[TS]F (class III) sequence motifs.
Probab=95.79 E-value=0.063 Score=29.68 Aligned_cols=48 Identities=6% Similarity=-0.090 Sum_probs=39.6
Q ss_pred cccCCCHHHHHHHHhcCCCCHHHHHHHHHHhhcccCCCCcCHHHHhhhh
Q psy12170 22 HVQKYRPIALEDLLKQTKFTRQEIRVMYRGFKQFLTFIGLNHPTLKDNT 70 (73)
Q Consensus 22 ~~skls~e~l~~L~~~T~Fs~~EIk~lYk~FkkecPsG~Lt~~EFk~i~ 70 (73)
+.+.++.+++..+....+++..+++++++.|-. ..+|.|+.++|..++
T Consensus 12 ~~G~i~~~el~~~l~~~g~~~~~~~~i~~~~d~-~~~g~i~~~ef~~~~ 59 (67)
T cd00052 12 GDGLISGDEARPFLGKSGLPRSVLAQIWDLADT-DKDGKLDKEEFAIAM 59 (67)
T ss_pred CCCcCcHHHHHHHHHHcCCCHHHHHHHHHHhcC-CCCCcCCHHHHHHHH
Confidence 445788899988777789999999999988854 357899999998765
No 5
>KOG0038|consensus
Probab=95.66 E-value=0.013 Score=41.42 Aligned_cols=39 Identities=18% Similarity=0.273 Sum_probs=34.4
Q ss_pred ccccccCCCHHHHHHHHhcCCCCHHHHHHHHHHhhcccC
Q psy12170 19 SSKHVQKYRPIALEDLLKQTKFTRQEIRVMYRGFKQFLT 57 (73)
Q Consensus 19 ~~~~~skls~e~l~~L~~~T~Fs~~EIk~lYk~FkkecP 57 (73)
||+.+...+.++++....+|-|++++|-++|++|....|
T Consensus 1 MGNK~~vFT~eqLd~YQDCTFFtrKdIlrl~~Rf~~L~P 39 (189)
T KOG0038|consen 1 MGNKQTVFTEEQLDEYQDCTFFTRKDILRLHKRFYELAP 39 (189)
T ss_pred CCCccceeeHHHHhhhcccccccHHHHHHHHHHHHHhCc
Confidence 677777788999999999999999999999999987544
No 6
>smart00027 EH Eps15 homology domain. Pair of EF hand motifs that recognise proteins containing Asn-Pro-Phe (NPF) sequences.
Probab=95.43 E-value=0.065 Score=32.75 Aligned_cols=50 Identities=4% Similarity=-0.065 Sum_probs=41.4
Q ss_pred cccccCCCHHHHHHHHhcCCCCHHHHHHHHHHhhcccCCCCcCHHHHhhhh
Q psy12170 20 SKHVQKYRPIALEDLLKQTKFTRQEIRVMYRGFKQFLTFIGLNHPTLKDNT 70 (73)
Q Consensus 20 ~~~~skls~e~l~~L~~~T~Fs~~EIk~lYk~FkkecPsG~Lt~~EFk~i~ 70 (73)
..+.+.++.+++..+.+..+++.+++.++++.+-. .-+|.|+.++|..+.
T Consensus 21 ~d~~G~Is~~el~~~l~~~~~~~~ev~~i~~~~d~-~~~g~I~~~eF~~~~ 70 (96)
T smart00027 21 KNQDGTVTGAQAKPILLKSGLPQTLLAKIWNLADI-DNDGELDKDEFALAM 70 (96)
T ss_pred CCCCCeEeHHHHHHHHHHcCCCHHHHHHHHHHhcC-CCCCCcCHHHHHHHH
Confidence 34556899999999887889999999999998854 347899999998765
No 7
>PF13833 EF-hand_8: EF-hand domain pair; PDB: 3KF9_A 1TTX_A 1WLZ_A 1ALV_A 1NX3_A 1ALW_A 1NX2_A 1NX1_A 1NX0_A 1DF0_A ....
Probab=94.47 E-value=0.11 Score=28.26 Aligned_cols=46 Identities=13% Similarity=0.135 Sum_probs=36.2
Q ss_pred cCCCHHHHHHHHhc--CC-CCHHHHHHHHHHhhcccCCCCcCHHHHhhhh
Q psy12170 24 QKYRPIALEDLLKQ--TK-FTRQEIRVMYRGFKQFLTFIGLNHPTLKDNT 70 (73)
Q Consensus 24 skls~e~l~~L~~~--T~-Fs~~EIk~lYk~FkkecPsG~Lt~~EFk~i~ 70 (73)
..++++++...... .+ ++++|+..+++.|=. ..+|.|+.+||..+.
T Consensus 3 G~i~~~~~~~~l~~~g~~~~s~~e~~~l~~~~D~-~~~G~I~~~EF~~~~ 51 (54)
T PF13833_consen 3 GKITREEFRRALSKLGIKDLSEEEVDRLFREFDT-DGDGYISFDEFISMM 51 (54)
T ss_dssp SEEEHHHHHHHHHHTTSSSSCHHHHHHHHHHHTT-SSSSSEEHHHHHHHH
T ss_pred CEECHHHHHHHHHHhCCCCCCHHHHHHHHHhccc-CCCCCCCHHHHHHHH
Confidence 35677787775543 35 999999999999954 467999999998764
No 8
>cd00086 homeodomain Homeodomain; DNA binding domains involved in the transcriptional regulation of key eukaryotic developmental processes; may bind to DNA as monomers or as homo- and/or heterodimers, in a sequence-specific manner.
Probab=94.09 E-value=0.1 Score=28.47 Aligned_cols=30 Identities=13% Similarity=0.200 Sum_probs=26.0
Q ss_pred ccCCCHHHHHHHHhcCCCCHHHHHHHHHHh
Q psy12170 23 VQKYRPIALEDLLKQTKFTRQEIRVMYRGF 52 (73)
Q Consensus 23 ~skls~e~l~~L~~~T~Fs~~EIk~lYk~F 52 (73)
+...+..+++.|...|+++..+|+.||+.=
T Consensus 22 ~~~P~~~~~~~la~~~~l~~~qV~~WF~nr 51 (59)
T cd00086 22 NPYPSREEREELAKELGLTERQVKIWFQNR 51 (59)
T ss_pred CCCCCHHHHHHHHHHHCcCHHHHHHHHHHH
Confidence 446788899999999999999999998753
No 9
>PF11569 Homez: Homeodomain leucine-zipper encoding, Homez; PDB: 2YS9_A.
Probab=93.95 E-value=0.068 Score=31.49 Aligned_cols=30 Identities=13% Similarity=0.310 Sum_probs=21.6
Q ss_pred cCCCHHHHHHHHhcCCCCHHHHHHHHHHhh
Q psy12170 24 QKYRPIALEDLLKQTKFTRQEIRVMYRGFK 53 (73)
Q Consensus 24 skls~e~l~~L~~~T~Fs~~EIk~lYk~Fk 53 (73)
..|...+|+.|+.+|++|.++|+.||..=.
T Consensus 21 ~~L~E~DL~~L~~kS~ms~qqVr~WFa~~~ 50 (56)
T PF11569_consen 21 KQLQEEDLDELCDKSRMSYQQVRDWFAERM 50 (56)
T ss_dssp ----TTHHHHHHHHTT--HHHHHHHHHHHS
T ss_pred CCccHhhHHHHHHHHCCCHHHHHHHHHHhc
Confidence 367788999999999999999999997643
No 10
>PF00036 EF-hand_1: EF hand; InterPro: IPR018248 Many calcium-binding proteins belong to the same evolutionary family and share a type of calcium-binding domain known as the EF-hand. This type of domain consists of a twelve residue loop flanked on both sides by a twelve residue alpha-helical domain. In an EF-hand loop the calcium ion is coordinated in a pentagonal bipyramidal configuration. The six residues involved in the binding are in positions 1, 3, 5, 7, 9 and 12; these residues are denoted by X, Y, Z, -Y, -X and -Z. The invariant Glu or Asp at position 12 provides two oxygens for liganding Ca (bidentate ligand).; PDB: 1BJF_A 1XFW_R 1XFV_O 2K0J_A 2F3Z_A 3BYA_A 1XFU_Q 2R28_B 1ZOT_B 3G43_D ....
Probab=93.09 E-value=0.11 Score=26.24 Aligned_cols=26 Identities=15% Similarity=0.241 Sum_probs=21.0
Q ss_pred HHHHHHHHhhcccCCCCcCHHHHhhhh
Q psy12170 44 EIRVMYRGFKQFLTFIGLNHPTLKDNT 70 (73)
Q Consensus 44 EIk~lYk~FkkecPsG~Lt~~EFk~i~ 70 (73)
||+++++.|=+ ..+|.|+.+||+.+.
T Consensus 1 E~~~~F~~~D~-d~dG~I~~~Ef~~~~ 26 (29)
T PF00036_consen 1 ELKEAFREFDK-DGDGKIDFEEFKEMM 26 (29)
T ss_dssp HHHHHHHHHST-TSSSEEEHHHHHHHH
T ss_pred CHHHHHHHHCC-CCCCcCCHHHHHHHH
Confidence 67788888754 578999999999875
No 11
>cd05031 S-100A10_like S-100A10_like: S-100A10 domain found in proteins similar to S100A10. S100A10 is a member of the S100 family of EF-hand superfamily of calcium-binding proteins. Note that the S-100 hierarchy, to which this S-100A1_like group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately. S100 proteins are expressed exclusively in vertebrates, and are implicated in intracellular and extracellular regulatory activities. A unique feature of S100A10 is that it contains mutation in both of the calcium binding sites, making it calcium insensitive. S100A10 has been detected in brain, heart, gastrointestinal tract, kidney, liver, lung, spleen, testes, epidermis, aorta, and thymus. Structural data supports the homo- and hetero-dimeric as well as hetero-tetrameric nature of the protein. S100A10 has multiple binding partners in its calcium free state and is therefore involved in many diverse biological functions.
Probab=93.06 E-value=0.23 Score=30.19 Aligned_cols=48 Identities=8% Similarity=0.061 Sum_probs=38.3
Q ss_pred ccCCCHHHHHHHHhc-------CCCCHHHHHHHHHHhhcccCCCCcCHHHHhhhhc
Q psy12170 23 VQKYRPIALEDLLKQ-------TKFTRQEIRVMYRGFKQFLTFIGLNHPTLKDNTN 71 (73)
Q Consensus 23 ~skls~e~l~~L~~~-------T~Fs~~EIk~lYk~FkkecPsG~Lt~~EFk~i~~ 71 (73)
.+.++..++..+... ..++.++|+.|.+.|-. ..+|.|+..+|..+..
T Consensus 24 dG~Is~~El~~~l~~~~g~~lg~~~s~~ei~~~~~~~D~-~~dg~I~f~eF~~l~~ 78 (94)
T cd05031 24 KNTLSRKELKKLMEKELSEFLKNQKDPMAVDKIMKDLDQ-NRDGKVNFEEFVSLVA 78 (94)
T ss_pred CCeECHHHHHHHHHHHhHHHhhccccHHHHHHHHHHhCC-CCCCcCcHHHHHHHHH
Confidence 368999998886553 36799999999999843 4678999999987754
No 12
>cd05029 S-100A6 S-100A6: S-100A6 domain found in proteins similar to S100A6. S100A6 is a member of the S100 domain family within EF-hand Ca2+-binding proteins superfamily. Note that the S-100 hierarchy, to which this S-100A6 group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately. S100 proteins exhibit unique patterns of tissue- and cell type-specific expression and have been implicated in the Ca2+-dependent regulation of diverse physiological processes, including cell cycle regulation, differentiation, growth, and metabolic control . S100A6 is normally expressed in the G1 phase of the cell cycle in neuronal cells. The function of S100A6 remains unclear, but evidence suggests that it is involved in cell cycle regulation and exocytosis. S100A6 may also be involved in tumorigenesis; the protein is overexpressed in several tumors. Ca2+ binding to S100A6 leads to a conformational change in the protein, which exposes a hydrophobic surface for interact
Probab=92.87 E-value=0.31 Score=29.96 Aligned_cols=48 Identities=13% Similarity=0.107 Sum_probs=37.9
Q ss_pred ccCCCHHHHHHHHh-----cCCCCHHHHHHHHHHhhcccCCCCcCHHHHhhhhc
Q psy12170 23 VQKYRPIALEDLLK-----QTKFTRQEIRVMYRGFKQFLTFIGLNHPTLKDNTN 71 (73)
Q Consensus 23 ~skls~e~l~~L~~-----~T~Fs~~EIk~lYk~FkkecPsG~Lt~~EFk~i~~ 71 (73)
+..++.+++..+.. ..+++.+||.+|.+.+ ....+|.|+..+|..+-.
T Consensus 26 ~g~Is~~EL~~~l~~~~~lg~k~t~~ev~~m~~~~-D~d~dG~Idf~EFv~lm~ 78 (88)
T cd05029 26 KNTLSKKELKELIQKELTIGSKLQDAEIAKLMEDL-DRNKDQEVNFQEYVTFLG 78 (88)
T ss_pred CCEECHHHHHHHHHHHHhcCCCCCHHHHHHHHHHh-cCCCCCCCcHHHHHHHHH
Confidence 45789999888664 3468999999998887 346789999999977643
No 13
>PF05920 Homeobox_KN: Homeobox KN domain; InterPro: IPR008422 This entry represents a homeobox transcription factor KN domain conserved from fungi to human and plants [].; GO: 0003677 DNA binding, 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus; PDB: 3K2A_B 2LK2_A 1X2N_A 2DMN_A.
Probab=92.68 E-value=0.2 Score=27.17 Aligned_cols=31 Identities=26% Similarity=0.238 Sum_probs=24.0
Q ss_pred cccCCCHHHHHHHHhcCCCCHHHHHHHHHHh
Q psy12170 22 HVQKYRPIALEDLLKQTKFTRQEIRVMYRGF 52 (73)
Q Consensus 22 ~~skls~e~l~~L~~~T~Fs~~EIk~lYk~F 52 (73)
.++-.++++.+.|+..|++|..+|..|+..-
T Consensus 7 ~nPYPs~~ek~~L~~~tgls~~Qi~~WF~Na 37 (40)
T PF05920_consen 7 HNPYPSKEEKEELAKQTGLSRKQISNWFINA 37 (40)
T ss_dssp TSGS--HHHHHHHHHHHTS-HHHHHHHHHHH
T ss_pred CCCCCCHHHHHHHHHHcCCCHHHHHHHHHHh
Confidence 4556788899999999999999999998753
No 14
>smart00389 HOX Homeodomain. DNA-binding factors that are involved in the transcriptional regulation of key developmental processes
Probab=92.66 E-value=0.24 Score=26.94 Aligned_cols=27 Identities=11% Similarity=0.235 Sum_probs=24.6
Q ss_pred CCCHHHHHHHHhcCCCCHHHHHHHHHH
Q psy12170 25 KYRPIALEDLLKQTKFTRQEIRVMYRG 51 (73)
Q Consensus 25 kls~e~l~~L~~~T~Fs~~EIk~lYk~ 51 (73)
..++.+++.|...++++..+|+.||..
T Consensus 24 ~P~~~~~~~la~~~~l~~~qV~~WF~n 50 (56)
T smart00389 24 YPSREEREELAAKLGLSERQVKVWFQN 50 (56)
T ss_pred CCCHHHHHHHHHHHCcCHHHHHHhHHH
Confidence 567888999999999999999999975
No 15
>cd05030 calgranulins Calgranulins: S-100 domain found in proteins belonging to the Calgranulin subgroup of the S100 family of EF-hand calcium-modulated proteins, including S100A8, S100A9, and S100A12 . Note that the S-100 hierarchy, to which this Calgranulin group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately. These proteins are expressed mainly in granulocytes, and are involved in inflammation, allergy, and neuritogenesis, as well as in host-parasite response. Calgranulins are modulated not only by calcium, but also by other metals such as zinc and copper. Structural data suggested that calgranulins may exist in multiple structural forms, homodimers, as well as hetero-oligomers. For example, the S100A8/S100A9 complex called calprotectin plays important roles in the regulation of inflammatory processes, wound repair, and regulating zinc-dependent enzymes as well as microbial growth.
Probab=92.39 E-value=0.48 Score=28.82 Aligned_cols=48 Identities=6% Similarity=0.015 Sum_probs=37.4
Q ss_pred ccCCCHHHHHHHHhcC---CCC----HHHHHHHHHHhhcccCCCCcCHHHHhhhhc
Q psy12170 23 VQKYRPIALEDLLKQT---KFT----RQEIRVMYRGFKQFLTFIGLNHPTLKDNTN 71 (73)
Q Consensus 23 ~skls~e~l~~L~~~T---~Fs----~~EIk~lYk~FkkecPsG~Lt~~EFk~i~~ 71 (73)
+..++..++..+.... .++ ..++..|++.|- ...+|.|+.++|..+..
T Consensus 24 ~~~Is~~El~~ll~~~~g~~~t~~~~~~~v~~i~~~~D-~d~dG~I~f~eF~~~~~ 78 (88)
T cd05030 24 PDTLYKKEFKQLVEKELPNFLKKEKNQKAIDKIFEDLD-TNQDGQLSFEEFLVLVI 78 (88)
T ss_pred cccCCHHHHHHHHHHHhhHhhccCCCHHHHHHHHHHcC-CCCCCcCcHHHHHHHHH
Confidence 4578899988877532 244 899999999994 35789999999988764
No 16
>PF13499 EF-hand_7: EF-hand domain pair; PDB: 1TCF_A 2TN4_A 1TN4_A 1A2X_A 2CT9_B 2OTG_B 2OS8_B 1SNL_A 3O4Y_A 3J04_E ....
Probab=92.16 E-value=0.28 Score=27.35 Aligned_cols=48 Identities=15% Similarity=0.027 Sum_probs=32.4
Q ss_pred cccCCCHHHHHHHHhcCCC--CHHHHHHH----HHHhhcccCCCCcCHHHHhhhh
Q psy12170 22 HVQKYRPIALEDLLKQTKF--TRQEIRVM----YRGFKQFLTFIGLNHPTLKDNT 70 (73)
Q Consensus 22 ~~skls~e~l~~L~~~T~F--s~~EIk~l----Yk~FkkecPsG~Lt~~EFk~i~ 70 (73)
+.+.++.+++..+...... +..++... ++.+=. ..+|.|+.+||..+|
T Consensus 13 ~~G~i~~~el~~~~~~~~~~~~~~~~~~~~~~~~~~~D~-d~dG~i~~~Ef~~~~ 66 (66)
T PF13499_consen 13 GDGYISKEELRRALKHLGRDMSDEESDEMIDQIFREFDT-DGDGRISFDEFLNFM 66 (66)
T ss_dssp SSSEEEHHHHHHHHHHTTSHSTHHHHHHHHHHHHHHHTT-TSSSSEEHHHHHHHH
T ss_pred ccCCCCHHHHHHHHHHhcccccHHHHHHHHHHHHHHhCC-CCcCCCcHHHHhccC
Confidence 4457888888887666543 25555554 444422 378999999999875
No 17
>cd00213 S-100 S-100: S-100 domain, which represents the largest family within the superfamily of proteins carrying the Ca-binding EF-hand motif. Note that this S-100 hierarchy contains only S-100 EF-hand domains, other EF-hands have been modeled separately. S100 proteins are expressed exclusively in vertebrates, and are implicated in intracellular and extracellular regulatory activities. Intracellularly, S100 proteins act as Ca-signaling or Ca-buffering proteins. The most unusual characteristic of certain S100 proteins is their occurrence in extracellular space, where they act in a cytokine-like manner through RAGE, the receptor for advanced glycation products. Structural data suggest that many S100 members exist within cells as homo- or heterodimers and even oligomers; oligomerization contributes to their functional diversification. Upon binding calcium, most S100 proteins change conformation to a more open structure exposing a hydrophobic cleft. This hydrophobic surface represents th
Probab=92.03 E-value=0.89 Score=26.93 Aligned_cols=49 Identities=6% Similarity=0.036 Sum_probs=37.2
Q ss_pred cccCCCHHHHHHHHhc-------CCCCHHHHHHHHHHhhcccCCCCcCHHHHhhhhc
Q psy12170 22 HVQKYRPIALEDLLKQ-------TKFTRQEIRVMYRGFKQFLTFIGLNHPTLKDNTN 71 (73)
Q Consensus 22 ~~skls~e~l~~L~~~-------T~Fs~~EIk~lYk~FkkecPsG~Lt~~EFk~i~~ 71 (73)
..+.++..++..+.+. ..++.++|..|++.|-. ..+|.|+.++|..+..
T Consensus 23 ~~G~Is~~el~~~l~~~~g~~~~~~~~~~ei~~i~~~~d~-~~~g~I~f~eF~~~~~ 78 (88)
T cd00213 23 DKDTLSKKELKELLETELPNFLKNQKDPEAVDKIMKDLDV-NKDGKVDFQEFLVLIG 78 (88)
T ss_pred CCCcCcHHHHHHHHHHHhhhhccCCCCHHHHHHHHHHhcc-CCCCcCcHHHHHHHHH
Confidence 4457888888876643 23469999999999954 4578999999988754
No 18
>cd00051 EFh EF-hand, calcium binding motif; A diverse superfamily of calcium sensors and calcium signal modulators; most examples in this alignment model have 2 active canonical EF hands. Ca2+ binding induces a conformational change in the EF-hand motif, leading to the activation or inactivation of target proteins. EF-hands tend to occur in pairs or higher copy numbers.
Probab=92.01 E-value=0.93 Score=23.19 Aligned_cols=48 Identities=15% Similarity=0.056 Sum_probs=37.2
Q ss_pred cccCCCHHHHHHHHhcC--CCCHHHHHHHHHHhhcccCCCCcCHHHHhhhh
Q psy12170 22 HVQKYRPIALEDLLKQT--KFTRQEIRVMYRGFKQFLTFIGLNHPTLKDNT 70 (73)
Q Consensus 22 ~~skls~e~l~~L~~~T--~Fs~~EIk~lYk~FkkecPsG~Lt~~EFk~i~ 70 (73)
+.+.+..+++..+.+.. ..+.++++.+++.|-. ..+|.|+.++|..+.
T Consensus 13 ~~g~l~~~e~~~~l~~~~~~~~~~~~~~~~~~~~~-~~~~~l~~~ef~~~~ 62 (63)
T cd00051 13 GDGTISADELKAALKSLGEGLSEEEIDEMIREVDK-DGDGKIDFEEFLELM 62 (63)
T ss_pred CCCcCcHHHHHHHHHHhCCCCCHHHHHHHHHHhCC-CCCCeEeHHHHHHHh
Confidence 34568888888776665 6899999999998854 356899999998754
No 19
>cd05027 S-100B S-100B: S-100B domain found in proteins similar to S100B. S100B is a calcium-binding protein belonging to a large S100 vertebrate-specific protein family within the EF-hand superfamily of calcium-binding proteins. Note that the S-100 hierarchy, to which this S-100B group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately. S100B is most abundant in glial cells of the central nervous system, predominately in astrocytes. S100B is involved in signal transduction via the inhibition of protein phoshorylation, regulation of enzyme activity and by affecting the calcium homeostasis. Upon calcium binding the S100B homodimer changes conformation to expose a hydrophobic cleft, which represents the interaction site of S100B with its more than 20 known target proteins. These target proteins include several cellular architecture proteins such as tubulin and GFAP; S100B can inhibit polymerization of these oligomeric molecules. Furthermore, S100B i
Probab=91.50 E-value=0.54 Score=28.87 Aligned_cols=50 Identities=6% Similarity=-0.053 Sum_probs=39.5
Q ss_pred ccc-CCCHHHHHHHHhc-----C--CCCHHHHHHHHHHhhcccCCCCcCHHHHhhhhcC
Q psy12170 22 HVQ-KYRPIALEDLLKQ-----T--KFTRQEIRVMYRGFKQFLTFIGLNHPTLKDNTNG 72 (73)
Q Consensus 22 ~~s-kls~e~l~~L~~~-----T--~Fs~~EIk~lYk~FkkecPsG~Lt~~EFk~i~~~ 72 (73)
+++ .++..++..+.+. . ..++++|..|.+..- ...+|.|+..+|..+-.+
T Consensus 22 gdG~~I~~~eL~~ll~~~~~~~lg~~~~~~~v~~~i~~~D-~n~dG~v~f~eF~~li~~ 79 (88)
T cd05027 22 GDKHKLKKSELKELINNELSHFLEEIKEQEVVDKVMETLD-SDGDGECDFQEFMAFVAM 79 (88)
T ss_pred CCcCEECHHHHHHHHHHHhHHHhcCCCCHHHHHHHHHHhC-CCCCCcCcHHHHHHHHHH
Confidence 455 4899999987776 3 468999999999883 457799999999877544
No 20
>PF00046 Homeobox: Homeobox domain not present here.; InterPro: IPR001356 The homeobox domain was first identified in a number of drosophila homeotic and segmentation proteins, but is now known to be well-conserved in many other animals, including vertebrates [, , ]. Hox genes encode homeodomain-containing transcriptional regulators that operate differential genetic programs along the anterior-posterior axis of animal bodies []. The domain binds DNA through a helix-turn-helix (HTH) structure. The HTH motif is characterised by two alpha-helices, which make intimate contacts with the DNA and are joined by a short turn. The second helix binds to DNA via a number of hydrogen bonds and hydrophobic interactions, which occur between specific side chains and the exposed bases and thymine methyl groups within the major groove of the DNA []. The first helix helps to stabilise the structure. The motif is very similar in sequence and structure in a wide range of DNA-binding proteins (e.g., cro and repressor proteins, homeotic proteins, etc.). One of the principal differences between HTH motifs in these different proteins arises from the stereo-chemical requirement for glycine in the turn which is needed to avoid steric interference of the beta-carbon with the main chain: for cro and repressor proteins the glycine appears to be mandatory, while for many of the homeotic and other DNA-binding proteins the requirement is relaxed.; GO: 0003700 sequence-specific DNA binding transcription factor activity, 0043565 sequence-specific DNA binding, 0006355 regulation of transcription, DNA-dependent; PDB: 2DA3_A 1LFB_A 2LFB_A 2ECB_A 2DA5_A 3D1N_O 3A03_A 2XSD_C 3CMY_A 1AHD_P ....
Probab=91.03 E-value=0.31 Score=26.80 Aligned_cols=29 Identities=14% Similarity=0.186 Sum_probs=25.1
Q ss_pred ccCCCHHHHHHHHhcCCCCHHHHHHHHHH
Q psy12170 23 VQKYRPIALEDLLKQTKFTRQEIRVMYRG 51 (73)
Q Consensus 23 ~skls~e~l~~L~~~T~Fs~~EIk~lYk~ 51 (73)
+..++.++++.|...++++...|+.||..
T Consensus 22 ~~~p~~~~~~~la~~l~l~~~~V~~WF~n 50 (57)
T PF00046_consen 22 NPYPSKEEREELAKELGLTERQVKNWFQN 50 (57)
T ss_dssp SSSCHHHHHHHHHHHHTSSHHHHHHHHHH
T ss_pred hccccccccccccccccccccccccCHHH
Confidence 34677788999999999999999999974
No 21
>PF13405 EF-hand_6: EF-hand domain; PDB: 2AMI_A 3QRX_A 1W7J_B 1OE9_B 1W7I_B 1KFU_S 1KFX_S 2BL0_B 1Y1X_B 3MSE_B ....
Probab=90.83 E-value=0.27 Score=24.38 Aligned_cols=27 Identities=11% Similarity=0.025 Sum_probs=19.5
Q ss_pred HHHHHHHHhhcccCCCCcCHHHHhhhhc
Q psy12170 44 EIRVMYRGFKQFLTFIGLNHPTLKDNTN 71 (73)
Q Consensus 44 EIk~lYk~FkkecPsG~Lt~~EFk~i~~ 71 (73)
+++.+|+.| -...+|.|+.+||+.+..
T Consensus 1 ~l~~~F~~~-D~d~dG~I~~~el~~~l~ 27 (31)
T PF13405_consen 1 RLREAFKMF-DKDGDGFIDFEELRAILR 27 (31)
T ss_dssp HHHHHHHHH--TTSSSEEEHHHHHHHHH
T ss_pred CHHHHHHHH-CCCCCCcCcHHHHHHHHH
Confidence 355556666 235899999999998865
No 22
>cd05023 S-100A11 S-100A11: S-100A11 domain found in proteins similar to S100A11. S100A11 is a member of the S-100 domain family within EF-hand Ca2+-binding proteins superfamily. Note that the S-100 hierarchy, to which this S-100A11 group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately. S100 proteins exhibit unique patterns of tissue- and cell type-specific expression and have been implicated in the Ca2+-dependent regulation of diverse physiological processes, including cell cycle regulation, differentiation, growth, and metabolic control . S100 proteins have also been associated with a variety of pathological events, including neoplastic transformation and neurodegenerative diseases such as Alzheimer's, usually via over expression of the protein. S100A11 is expressed in smooth muscle and other tissues and involves in calcium-dependent membrane aggregation, which is important for cell vesiculation . As is the case for many other S100 proteins, S
Probab=90.69 E-value=0.86 Score=28.06 Aligned_cols=45 Identities=4% Similarity=-0.000 Sum_probs=36.5
Q ss_pred CCCHHHHHHHHhcC-------CCCHHHHHHHHHHhhcccCCCCcCHHHHhhhh
Q psy12170 25 KYRPIALEDLLKQT-------KFTRQEIRVMYRGFKQFLTFIGLNHPTLKDNT 70 (73)
Q Consensus 25 kls~e~l~~L~~~T-------~Fs~~EIk~lYk~FkkecPsG~Lt~~EFk~i~ 70 (73)
.++.+++..+.... .-+..++.+|.+.+-. +.+|.|+.+||..+-
T Consensus 27 ~Ls~~Elk~ll~~e~~~~~~~~~~~~~~~~ll~~~D~-d~DG~I~f~EF~~l~ 78 (89)
T cd05023 27 QLSKTEFLSFMNTELASFTKNQKDPGVLDRMMKKLDL-NSDGQLDFQEFLNLI 78 (89)
T ss_pred eECHHHHHHHHHHhhhHhhcCCCCHHHHHHHHHHcCC-CCCCcCcHHHHHHHH
Confidence 78999998877553 4578999999998743 689999999998764
No 23
>cd00213 S-100 S-100: S-100 domain, which represents the largest family within the superfamily of proteins carrying the Ca-binding EF-hand motif. Note that this S-100 hierarchy contains only S-100 EF-hand domains, other EF-hands have been modeled separately. S100 proteins are expressed exclusively in vertebrates, and are implicated in intracellular and extracellular regulatory activities. Intracellularly, S100 proteins act as Ca-signaling or Ca-buffering proteins. The most unusual characteristic of certain S100 proteins is their occurrence in extracellular space, where they act in a cytokine-like manner through RAGE, the receptor for advanced glycation products. Structural data suggest that many S100 members exist within cells as homo- or heterodimers and even oligomers; oligomerization contributes to their functional diversification. Upon binding calcium, most S100 proteins change conformation to a more open structure exposing a hydrophobic cleft. This hydrophobic surface represents th
Probab=90.64 E-value=0.41 Score=28.42 Aligned_cols=27 Identities=15% Similarity=0.046 Sum_probs=19.1
Q ss_pred HHHHHHHHhhcc-cCCCCcCHHHHhhhh
Q psy12170 44 EIRVMYRGFKQF-LTFIGLNHPTLKDNT 70 (73)
Q Consensus 44 EIk~lYk~Fkke-cPsG~Lt~~EFk~i~ 70 (73)
+++..|+.|=+. +++|.|+.++|+.++
T Consensus 9 ~l~~~F~~~D~~~~~~G~Is~~el~~~l 36 (88)
T cd00213 9 TIIDVFHKYSGKEGDKDTLSKKELKELL 36 (88)
T ss_pred HHHHHHHHHhhccCCCCcCcHHHHHHHH
Confidence 344445555543 788999999999876
No 24
>PTZ00183 centrin; Provisional
Probab=90.58 E-value=0.46 Score=29.98 Aligned_cols=30 Identities=17% Similarity=0.153 Sum_probs=13.9
Q ss_pred CCCHHHHHHHHHHhhccc--CCCCcCHHHHhh
Q psy12170 39 KFTRQEIRVMYRGFKQFL--TFIGLNHPTLKD 68 (73)
Q Consensus 39 ~Fs~~EIk~lYk~Fkkec--PsG~Lt~~EFk~ 68 (73)
+|+..+++.|.+.|..-. .+|.|+..+|..
T Consensus 10 ~~~~~~~~~~~~~F~~~D~~~~G~i~~~e~~~ 41 (158)
T PTZ00183 10 GLTEDQKKEIREAFDLFDTDGSGTIDPKELKV 41 (158)
T ss_pred CCCHHHHHHHHHHHHHhCCCCCCcccHHHHHH
Confidence 344445555544444422 344455555543
No 25
>cd05026 S-100Z S-100Z: S-100Z domain found in proteins similar to S100Z. S100Z is a member of the S100 domain family within the EF-hand Ca2+-binding proteins superfamily. Note that the S-100 hierarchy, to which this S-100Z group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately.S100 proteins exhibit unique patterns of tissue- and cell type-specific expression and have been implicated in the Ca2+-dependent regulation of diverse physiological processes, including cell cycle regulation, differentiation, growth, and metabolic control. S100Z is normally expressed in various tissues, with its highest level of expression being in spleen and leukocytes. The function of S100Z remains unclear. Preliminary structural data suggests that S100Z is homodimer, however a heterodimer with S100P has been reported. S100Z is capable of binding calcium ions. When calcium binds to S110Z, the protein experiences a conformational change, which exposes hydrophobic surfac
Probab=90.54 E-value=0.67 Score=28.44 Aligned_cols=50 Identities=8% Similarity=0.016 Sum_probs=38.3
Q ss_pred ccc-CCCHHHHHHHHhc-------CCCCHHHHHHHHHHhhcccCCCCcCHHHHhhhhcC
Q psy12170 22 HVQ-KYRPIALEDLLKQ-------TKFTRQEIRVMYRGFKQFLTFIGLNHPTLKDNTNG 72 (73)
Q Consensus 22 ~~s-kls~e~l~~L~~~-------T~Fs~~EIk~lYk~FkkecPsG~Lt~~EFk~i~~~ 72 (73)
++. +++..++..+... ..-+..+|.++.+.+-. ..+|.|+.+||..+..+
T Consensus 24 gdg~~Is~~EL~~ll~~~~~~~~~~~~~~~~v~~i~~elD~-n~dG~Idf~EF~~l~~~ 81 (93)
T cd05026 24 GDRYKLSKGELKELLQRELTDFLSSQKDPMLVDKIMNDLDS-NKDNEVDFNEFVVLVAA 81 (93)
T ss_pred CCCCEECHHHHHHHHHHHhHHhcccccCHHHHHHHHHHhCC-CCCCCCCHHHHHHHHHH
Confidence 444 5899999887643 23478899999999954 57899999999887543
No 26
>cd05025 S-100A1 S-100A1: S-100A1 domain found in proteins similar to S100A1. S100A1 is a calcium-binding protein belonging to a large S100 vertebrate-specific protein family within the EF-hand superfamily of calcium-binding proteins. Note that the S-100 hierarchy, to which this S-100A1 group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately. As is the case with many other members of S100 protein family, S100A1 is implicated in intracellular and extracellular regulatory activities, including interaction with myosin-associated twitchin kinase, actin-capping protein CapZ, sinapsin I, and tubulin. Structural data suggests that S100A1 proteins exist within cells as antiparallel homodimers, while heterodimers with S100A4 and S100B also has been reported. Upon binding calcium S100A1 changes conformation to expose a hydrophobic cleft which is the interaction site of S100A1 with its more that 20 known target proteins.
Probab=90.13 E-value=0.97 Score=27.24 Aligned_cols=51 Identities=10% Similarity=0.059 Sum_probs=38.8
Q ss_pred ccccc-CCCHHHHHHHHhc-C------CCCHHHHHHHHHHhhcccCCCCcCHHHHhhhhc
Q psy12170 20 SKHVQ-KYRPIALEDLLKQ-T------KFTRQEIRVMYRGFKQFLTFIGLNHPTLKDNTN 71 (73)
Q Consensus 20 ~~~~s-kls~e~l~~L~~~-T------~Fs~~EIk~lYk~FkkecPsG~Lt~~EFk~i~~ 71 (73)
..+.. ++++.++..+.+. - ..+..+++.+.+.+-. ..+|.|+..+|..+..
T Consensus 21 ~dg~G~~Is~~El~~~l~~~lg~~~~~~~s~~~v~~i~~~~D~-d~~G~I~f~eF~~l~~ 79 (92)
T cd05025 21 KEGDKYKLSKKELKDLLQTELSDFLDAQKDADAVDKIMKELDE-NGDGEVDFQEFVVLVA 79 (92)
T ss_pred ccCCCCeECHHHHHHHHHHHHHHHccCCCCHHHHHHHHHHHCC-CCCCcCcHHHHHHHHH
Confidence 34555 4899999887653 2 3589999999999844 4679999999988754
No 27
>smart00054 EFh EF-hand, calcium binding motif. EF-hands are calcium-binding motifs that occur at least in pairs. Links between disease states and genes encoding EF-hands, particularly the S100 subclass, are emerging. Each motif consists of a 12 residue loop flanked on either side by a 12 residue alpha-helix. EF-hands undergo a conformational change unpon binding calcium ions.
Probab=89.89 E-value=0.42 Score=21.01 Aligned_cols=28 Identities=18% Similarity=0.203 Sum_probs=20.5
Q ss_pred HHHHHHHHhhcccCCCCcCHHHHhhhhcC
Q psy12170 44 EIRVMYRGFKQFLTFIGLNHPTLKDNTNG 72 (73)
Q Consensus 44 EIk~lYk~FkkecPsG~Lt~~EFk~i~~~ 72 (73)
|++++++.|-. ..+|.|+..+|..+.+.
T Consensus 1 ~~~~~f~~~d~-~~~g~i~~~e~~~~~~~ 28 (29)
T smart00054 1 ELKEAFRLFDK-DGDGKIDFEEFKDLLKA 28 (29)
T ss_pred CHHHHHHHHCC-CCCCcEeHHHHHHHHHh
Confidence 45667777744 35689999999988754
No 28
>PTZ00184 calmodulin; Provisional
Probab=89.13 E-value=0.75 Score=28.44 Aligned_cols=44 Identities=11% Similarity=0.079 Sum_probs=23.6
Q ss_pred cCCCHHHHHHHHhcC---CCCHHHHHHHHHHhhcccCCCCcCHHHHhhh
Q psy12170 24 QKYRPIALEDLLKQT---KFTRQEIRVMYRGFKQFLTFIGLNHPTLKDN 69 (73)
Q Consensus 24 skls~e~l~~L~~~T---~Fs~~EIk~lYk~FkkecPsG~Lt~~EFk~i 69 (73)
+.++.+++..+. .. ..+..+++++++.|-.. .+|.|+.++|..+
T Consensus 26 G~i~~~e~~~~l-~~~~~~~~~~~~~~~~~~~d~~-~~g~i~~~ef~~~ 72 (149)
T PTZ00184 26 GTITTKELGTVM-RSLGQNPTEAELQDMINEVDAD-GNGTIDFPEFLTL 72 (149)
T ss_pred CcCCHHHHHHHH-HHhCCCCCHHHHHHHHHhcCcC-CCCcCcHHHHHHH
Confidence 345555554433 23 24455666666665432 4567777777654
No 29
>PLN02964 phosphatidylserine decarboxylase
Probab=88.06 E-value=0.49 Score=38.98 Aligned_cols=36 Identities=14% Similarity=0.075 Sum_probs=32.8
Q ss_pred cCCCHHHHHHHHhc--CCCCHHHHHHHHHHhhcccCCC
Q psy12170 24 QKYRPIALEDLLKQ--TKFTRQEIRVMYRGFKQFLTFI 59 (73)
Q Consensus 24 skls~e~l~~L~~~--T~Fs~~EIk~lYk~FkkecPsG 59 (73)
++++++.+..+++. |+|+.+|++.+++.|...||+|
T Consensus 119 ~~~s~n~lv~~~e~~~t~f~~kqi~elkeaF~lfD~dg 156 (644)
T PLN02964 119 NRLSKNTLVGYCELDLFDFVTQEPESACESFDLLDPSS 156 (644)
T ss_pred CCCCHHHhhhheeecHhhccHHHHHHHHHHHHHHCCCC
Confidence 47899999999999 9999999999999999988775
No 30
>smart00027 EH Eps15 homology domain. Pair of EF hand motifs that recognise proteins containing Asn-Pro-Phe (NPF) sequences.
Probab=87.91 E-value=0.81 Score=27.80 Aligned_cols=32 Identities=6% Similarity=-0.086 Sum_probs=26.3
Q ss_pred CCCHHHHHHHHHHhhccc--CCCCcCHHHHhhhh
Q psy12170 39 KFTRQEIRVMYRGFKQFL--TFIGLNHPTLKDNT 70 (73)
Q Consensus 39 ~Fs~~EIk~lYk~Fkkec--PsG~Lt~~EFk~i~ 70 (73)
.+|.+++..+.+.|..-+ .+|.|+.++|+.++
T Consensus 3 ~ls~~~~~~l~~~F~~~D~d~~G~Is~~el~~~l 36 (96)
T smart00027 3 AISPEDKAKYEQIFRSLDKNQDGTVTGAQAKPIL 36 (96)
T ss_pred CCCHHHHHHHHHHHHHhCCCCCCeEeHHHHHHHH
Confidence 367889999999998855 57899999998875
No 31
>PTZ00183 centrin; Provisional
Probab=87.41 E-value=2.8 Score=26.28 Aligned_cols=49 Identities=12% Similarity=0.032 Sum_probs=35.8
Q ss_pred ccCCCHHHHHHHHhcC--CCCHHHHHHHHHHhhcccCCCCcCHHHHhhhhcC
Q psy12170 23 VQKYRPIALEDLLKQT--KFTRQEIRVMYRGFKQFLTFIGLNHPTLKDNTNG 72 (73)
Q Consensus 23 ~skls~e~l~~L~~~T--~Fs~~EIk~lYk~FkkecPsG~Lt~~EFk~i~~~ 72 (73)
.+.++.+++..+.... +++..+++.++..|-. ...|.|+.++|+.+..+
T Consensus 104 ~G~i~~~e~~~~l~~~~~~l~~~~~~~~~~~~d~-~~~g~i~~~ef~~~~~~ 154 (158)
T PTZ00183 104 TGKISLKNLKRVAKELGETITDEELQEMIDEADR-NGDGEISEEEFYRIMKK 154 (158)
T ss_pred CCcCcHHHHHHHHHHhCCCCCHHHHHHHHHHhCC-CCCCcCcHHHHHHHHhc
Confidence 3467777777655543 4899999999988854 35689999999887643
No 32
>PF14658 EF-hand_9: EF-hand domain
Probab=85.01 E-value=2.6 Score=25.36 Aligned_cols=47 Identities=13% Similarity=-0.077 Sum_probs=33.7
Q ss_pred cCCCHHHHH-HHHhcC--CCCHHHHHHHHHHhhcccCCCCcCHHHHhhhh
Q psy12170 24 QKYRPIALE-DLLKQT--KFTRQEIRVMYRGFKQFLTFIGLNHPTLKDNT 70 (73)
Q Consensus 24 skls~e~l~-~L~~~T--~Fs~~EIk~lYk~FkkecPsG~Lt~~EFk~i~ 70 (73)
+...-..|- .|...| +.++.+|+.|++.+=-++..|.|+.+.|..|-
T Consensus 13 G~V~v~~l~~~Lra~~~~~p~e~~Lq~l~~elDP~g~~~~v~~d~F~~iM 62 (66)
T PF14658_consen 13 GRVPVSDLITYLRAVTGRSPEESELQDLINELDPEGRDGSVNFDTFLAIM 62 (66)
T ss_pred ceEeHHHHHHHHHHHcCCCCcHHHHHHHHHHhCCCCCCceEeHHHHHHHH
Confidence 334444544 455555 47999999999998666666789999998774
No 33
>cd05022 S-100A13 S-100A13: S-100A13 domain found in proteins similar to S100A13. S100A13 is a calcium-binding protein belonging to a large S100 vertebrate-specific protein family within the EF-hand superfamily of calcium-binding proteins. Note that the S-100 hierarchy, to which this S-100A13 group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately. S100A13 is involved in the cellular export of interleukin-1 (IL-1) and of fibroblast growth factor-1 (FGF-1), which plays an important role in angiogenesis and tissue regeneration. Export is based on the CuII-dependent formation of multiprotein complexes containing the S100A13 protein. Assembly of these complexes occurs near the inner surface of the plasma membrane. Binding of two Ca(II) ions per monomer triggers key conformational changes leading to the creation of two identical and symmetrical Cu(II)-binding sites on the surface of the protein, close to the interface between the two monomers. These Cu
Probab=84.81 E-value=1.8 Score=26.78 Aligned_cols=49 Identities=8% Similarity=0.082 Sum_probs=37.5
Q ss_pred cccCCCHHHHHHHHhc---CCCCH-HHHHHHHHHhhcccCCCCcCHHHHhhhhc
Q psy12170 22 HVQKYRPIALEDLLKQ---TKFTR-QEIRVMYRGFKQFLTFIGLNHPTLKDNTN 71 (73)
Q Consensus 22 ~~skls~e~l~~L~~~---T~Fs~-~EIk~lYk~FkkecPsG~Lt~~EFk~i~~ 71 (73)
.+..++..++..+..+ ..++. +++..+.+..- ...+|.|+.+||..+..
T Consensus 22 ~~g~i~~~ELk~ll~~elg~~ls~~~~v~~mi~~~D-~d~DG~I~F~EF~~l~~ 74 (89)
T cd05022 22 GKESLTASEFQELLTQQLPHLLKDVEGLEEKMKNLD-VNQDSKLSFEEFWELIG 74 (89)
T ss_pred CCCeECHHHHHHHHHHHhhhhccCHHHHHHHHHHhC-CCCCCCCcHHHHHHHHH
Confidence 4457899999987766 34677 89999987653 36789999999987653
No 34
>KOG0041|consensus
Probab=84.69 E-value=0.98 Score=33.29 Aligned_cols=33 Identities=24% Similarity=0.205 Sum_probs=29.0
Q ss_pred CCCCHHHHHHHHHHhhcc--cCCCCcCHHHHhhhh
Q psy12170 38 TKFTRQEIRVMYRGFKQF--LTFIGLNHPTLKDNT 70 (73)
Q Consensus 38 T~Fs~~EIk~lYk~Fkke--cPsG~Lt~~EFk~i~ 70 (73)
..|++++|+.+|+-|++. .-+|-|+.-|+|.+.
T Consensus 91 ~eFsrkqIk~~~~~Fk~yDe~rDgfIdl~ELK~mm 125 (244)
T KOG0041|consen 91 SEFSRKQIKDAESMFKQYDEDRDGFIDLMELKRMM 125 (244)
T ss_pred hHHHHHHHHHHHHHHHHhcccccccccHHHHHHHH
Confidence 379999999999999994 589999999998764
No 35
>cd04790 HTH_Cfa-like_unk Helix-Turn-Helix DNA binding domain of putative Cfa-like transcription regulators. Putative helix-turn-helix (HTH) MerR-like transcription regulator; conserved, Cfa-like, unknown proteins (~172 a.a.). The N-terminal domain of these proteins appears to be related to the HTH domain of Cfa, a cyclopropane fatty acid synthase. These Cfa-like proteins have a unique C-terminal domain with conserved histidines (motif HXXFX7HXXF). Based on sequence similarity of the N-terminal domains, these proteins are predicted to function as transcription regulators that mediate responses to stress in eubacteria. They belong to the MerR superfamily of transcription regulators that promote transcription of various stress regulons by reconfiguring the operator sequence located between the -35 and -10 promoter elements. A typical MerR regulator is comprised of distinct domains that harbor the regulatory (effector-binding) site and the active (DNA-binding) site. Their N-terminal domain
Probab=84.24 E-value=2.6 Score=28.78 Aligned_cols=35 Identities=11% Similarity=0.064 Sum_probs=31.2
Q ss_pred CCCHHHHHHHHhcCCCCHHHHHHHHHHhhcccCCC
Q psy12170 25 KYRPIALEDLLKQTKFTRQEIRVMYRGFKQFLTFI 59 (73)
Q Consensus 25 kls~e~l~~L~~~T~Fs~~EIk~lYk~FkkecPsG 59 (73)
..+......|.+..+++++.+++|+..|-+..|.|
T Consensus 115 ~V~~~~w~~l~~~~g~~~~~m~~wh~~fe~~~p~~ 149 (172)
T cd04790 115 LVTKEKWVAILKAAGMDEADMRRWHIEFEKMEPEA 149 (172)
T ss_pred cCCHHHHHHHHHHcCCChHHHHHHHHHHHHhCcHH
Confidence 46667777888999999999999999999999988
No 36
>KOG0028|consensus
Probab=83.93 E-value=3.4 Score=29.23 Aligned_cols=48 Identities=15% Similarity=0.020 Sum_probs=38.8
Q ss_pred cccCCCHHHHHHHHhcC--CCCHHHHHHHHHHhhcccCCCCcCHHHHhhhh
Q psy12170 22 HVQKYRPIALEDLLKQT--KFTRQEIRVMYRGFKQFLTFIGLNHPTLKDNT 70 (73)
Q Consensus 22 ~~skls~e~l~~L~~~T--~Fs~~EIk~lYk~FkkecPsG~Lt~~EFk~i~ 70 (73)
..+++++..+..+.... +|+.+||+.+-..|- .|.+|.|+.+||..|-
T Consensus 119 ~~Gkis~~~lkrvakeLgenltD~El~eMIeEAd-~d~dgevneeEF~~im 168 (172)
T KOG0028|consen 119 KTGKISQRNLKRVAKELGENLTDEELMEMIEEAD-RDGDGEVNEEEFIRIM 168 (172)
T ss_pred CCCCcCHHHHHHHHHHhCccccHHHHHHHHHHhc-ccccccccHHHHHHHH
Confidence 44688888888877665 799999999988885 3788999999998774
No 37
>cd00252 SPARC_EC SPARC_EC; extracellular Ca2+ binding domain (containing 2 EF-hand motifs) of SPARC and related proteins (QR1, SC1/hevin, testican and tsc-36/FRP). SPARC (BM-40) is a multifunctional glycoprotein, a matricellular protein, that functions to regulate cell-matrix interactions; binds to such proteins as collagen and vitronectin and binds to endothelial cells thus inhibiting cellular proliferation. The EC domain interacts with a follistatin-like (FS) domain which appears to stabilize Ca2+ binding. The two EF-hands interact canonically but their conserved disulfide bonds confer a tight association between the EF-hand pair and an acid/amphiphilic N-terminal helix. Proposed active form involves a Ca2+ dependent symmetric homodimerization of EC-FS modules.
Probab=83.83 E-value=5.6 Score=25.79 Aligned_cols=46 Identities=7% Similarity=-0.111 Sum_probs=29.2
Q ss_pred cccCCCHHHHHHHHhcCCCCHHHHHHHHHHhhcccCCCCcCHHHHhhhh
Q psy12170 22 HVQKYRPIALEDLLKQTKFTRQEIRVMYRGFKQFLTFIGLNHPTLKDNT 70 (73)
Q Consensus 22 ~~skls~e~l~~L~~~T~Fs~~EIk~lYk~FkkecPsG~Lt~~EFk~i~ 70 (73)
+...++.++|..+. -...+..+.+|.+.|- ...+|.||.+||...+
T Consensus 61 ~DG~Ls~~EL~~~~--l~~~e~~~~~f~~~~D-~n~Dg~IS~~Ef~~cl 106 (116)
T cd00252 61 YDGKLSHHELAPIR--LDPNEHCIKPFFESCD-LDKDGSISLDEWCYCF 106 (116)
T ss_pred CCCcCCHHHHHHHH--ccchHHHHHHHHHHHC-CCCCCCCCHHHHHHHH
Confidence 44567777777665 2333455655555552 2478899999997654
No 38
>PTZ00184 calmodulin; Provisional
Probab=83.36 E-value=4.2 Score=24.99 Aligned_cols=48 Identities=15% Similarity=0.031 Sum_probs=34.7
Q ss_pred cccCCCHHHHHHHHhcC--CCCHHHHHHHHHHhhcccCCCCcCHHHHhhhh
Q psy12170 22 HVQKYRPIALEDLLKQT--KFTRQEIRVMYRGFKQFLTFIGLNHPTLKDNT 70 (73)
Q Consensus 22 ~~skls~e~l~~L~~~T--~Fs~~EIk~lYk~FkkecPsG~Lt~~EFk~i~ 70 (73)
..+.++++++..+.... .++.+++..+++.+-+ ..+|.|+.++|..+.
T Consensus 97 ~~g~i~~~e~~~~l~~~~~~~~~~~~~~~~~~~d~-~~~g~i~~~ef~~~~ 146 (149)
T PTZ00184 97 GNGFISAAELRHVMTNLGEKLTDEEVDEMIREADV-DGDGQINYEEFVKMM 146 (149)
T ss_pred CCCeEeHHHHHHHHHHHCCCCCHHHHHHHHHhcCC-CCCCcCcHHHHHHHH
Confidence 34567888887766555 3688888888876633 456899999998765
No 39
>PF13202 EF-hand_5: EF hand; PDB: 3DD4_A 2Q4U_A 2BE4_A 1UHJ_B 1UHI_A 1UHH_B 1EJ3_B 1UHK_A 2ZFD_A 1UHN_A ....
Probab=83.22 E-value=1.3 Score=21.32 Aligned_cols=23 Identities=4% Similarity=-0.027 Sum_probs=15.9
Q ss_pred HHHHHHhhcccCCCCcCHHHHhhh
Q psy12170 46 RVMYRGFKQFLTFIGLNHPTLKDN 69 (73)
Q Consensus 46 k~lYk~FkkecPsG~Lt~~EFk~i 69 (73)
+++++.+= ...+|.|+.+||.++
T Consensus 2 ~~~F~~~D-~d~DG~is~~E~~~~ 24 (25)
T PF13202_consen 2 KDAFQQFD-TDGDGKISFEEFQRL 24 (25)
T ss_dssp HHHHHHHT-TTSSSEEEHHHHHHH
T ss_pred HHHHHHHc-CCCCCcCCHHHHHHH
Confidence 44444442 257899999999874
No 40
>PF04546 Sigma70_ner: Sigma-70, non-essential region; InterPro: IPR007631 The bacterial core RNA polymerase complex, which consists of five subunits, is sufficient for transcription elongation and termination but is unable to initiate transcription. Transcription initiation from promoter elements requires a sixth, dissociable subunit called a sigma factor, which reversibly associates with the core RNA polymerase complex to form a holoenzyme []. RNA polymerase recruits alternative sigma factors as a means of switching on specific regulons. Most bacteria express a multiplicity of sigma factors. Two of these factors, sigma-70 (gene rpoD), generally known as the major or primary sigma factor, and sigma-54 (gene rpoN or ntrA) direct the transcription of a wide variety of genes. The other sigma factors, known as alternative sigma factors, are required for the transcription of specific subsets of genes. With regard to sequence similarity, sigma factors can be grouped into two classes, the sigma-54 and sigma-70 families. Sequence alignments of the sigma70 family members reveal four conserved regions that can be further divided into subregions eg. sub-region 2.2, which may be involved in the binding of the sigma factor to the core RNA polymerase; and sub-region 4.2, which seems to harbor a DNA-binding 'helix-turn-helix' motif involved in binding the conserved -35 region of promoters recognised by the major sigma factors [, ]. This domain is found in the primary vegetative sigma factor. Its function is unclear, and it can be removed without apparent loss of function [, ].; GO: 0003677 DNA binding, 0003700 sequence-specific DNA binding transcription factor activity, 0016987 sigma factor activity, 0006352 transcription initiation, DNA-dependent, 0006355 regulation of transcription, DNA-dependent; PDB: 1SIG_A 3IYD_F.
Probab=82.60 E-value=0.56 Score=32.75 Aligned_cols=48 Identities=19% Similarity=0.265 Sum_probs=35.1
Q ss_pred CCCHHHHHHHHhcCCCCHHHHHHHHHHhhccc-CCCCcCHHHHhhhhcC
Q psy12170 25 KYRPIALEDLLKQTKFTRQEIRVMYRGFKQFL-TFIGLNHPTLKDNTNG 72 (73)
Q Consensus 25 kls~e~l~~L~~~T~Fs~~EIk~lYk~Fkkec-PsG~Lt~~EFk~i~~~ 72 (73)
+|+|..++.|...-.=.-..|+..=|..++.| -..++.+.+|.+.|+|
T Consensus 123 kl~~k~id~L~~~lr~~~~~Ir~~Er~i~~l~v~~~gmpR~~Fi~~f~g 171 (211)
T PF04546_consen 123 KLSPKQIDRLVEQLREIVERIRQQERRIMRLCVRRAGMPRKEFIKSFPG 171 (211)
T ss_dssp EE-HHHHHHHCHHHHCCCHHHHHHHHHHHHCCCTTTT--HHHCCCCCTT
T ss_pred ccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCHHHHHHHccC
Confidence 46666666666666556677777888888899 6699999999999998
No 41
>PF09820 AAA-ATPase_like: Predicted AAA-ATPase; InterPro: IPR018631 This entry is predicted to be an AAA-ATPase domain []. It is usually found together with IPR012547 from INTERPRO.
Probab=77.84 E-value=1.8 Score=31.52 Aligned_cols=40 Identities=18% Similarity=0.164 Sum_probs=31.2
Q ss_pred HHHhcCCCCHHHHHHHHHHhhcccC---CCCcCHHHHhhhhcC
Q psy12170 33 DLLKQTKFTRQEIRVMYRGFKQFLT---FIGLNHPTLKDNTNG 72 (73)
Q Consensus 33 ~L~~~T~Fs~~EIk~lYk~FkkecP---sG~Lt~~EFk~i~~~ 72 (73)
....-.+||++||+.+++.+....+ +-....++.+..|+|
T Consensus 218 ~f~~~~GFT~~Ev~~ll~~~~~~~~~~~~~~~~~~~lk~wYdG 260 (284)
T PF09820_consen 218 RFSEYFGFTEEEVETLLKYYIENLAEEQDREELLEELKEWYDG 260 (284)
T ss_pred hHhhhcCcCHHHHHHHHHHHHHHhhhccchHHHHHHHHHHcCC
Confidence 4556679999999999999966543 445567788999988
No 42
>PF14788 EF-hand_10: EF hand; PDB: 1DJW_B 1DJI_B 1DJG_B 1QAS_B 2ISD_B 1DJZ_B 1DJY_B 1DJX_B 1QAT_A 1DJH_A ....
Probab=77.72 E-value=3.9 Score=23.55 Aligned_cols=45 Identities=11% Similarity=0.091 Sum_probs=28.4
Q ss_pred CCHHHHHHHHhcCCCCHHH--HHHHHHHhhcccCCCCcCHHHHhhhhc
Q psy12170 26 YRPIALEDLLKQTKFTRQE--IRVMYRGFKQFLTFIGLNHPTLKDNTN 71 (73)
Q Consensus 26 ls~e~l~~L~~~T~Fs~~E--Ik~lYk~FkkecPsG~Lt~~EFk~i~~ 71 (73)
++-.+++.|.+..|..-.+ ...||+.=- ...+|.|..+||...|.
T Consensus 2 msf~Evk~lLk~~NI~~~~~yA~~LFq~~D-~s~~g~Le~~Ef~~Fy~ 48 (51)
T PF14788_consen 2 MSFKEVKKLLKMMNIEMDDEYARQLFQECD-KSQSGRLEGEEFEEFYK 48 (51)
T ss_dssp BEHHHHHHHHHHTT----HHHHHHHHHHH--SSSSSEBEHHHHHHHHH
T ss_pred CCHHHHHHHHHHHccCcCHHHHHHHHHHhc-ccCCCCccHHHHHHHHH
Confidence 4556888888888765444 444443221 25789999999999885
No 43
>cd05023 S-100A11 S-100A11: S-100A11 domain found in proteins similar to S100A11. S100A11 is a member of the S-100 domain family within EF-hand Ca2+-binding proteins superfamily. Note that the S-100 hierarchy, to which this S-100A11 group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately. S100 proteins exhibit unique patterns of tissue- and cell type-specific expression and have been implicated in the Ca2+-dependent regulation of diverse physiological processes, including cell cycle regulation, differentiation, growth, and metabolic control . S100 proteins have also been associated with a variety of pathological events, including neoplastic transformation and neurodegenerative diseases such as Alzheimer's, usually via over expression of the protein. S100A11 is expressed in smooth muscle and other tissues and involves in calcium-dependent membrane aggregation, which is important for cell vesiculation . As is the case for many other S100 proteins, S
Probab=77.21 E-value=3.4 Score=25.37 Aligned_cols=29 Identities=14% Similarity=0.039 Sum_probs=19.4
Q ss_pred HHHHHHHHHhhc---ccC-CCCcCHHHHhhhhc
Q psy12170 43 QEIRVMYRGFKQ---FLT-FIGLNHPTLKDNTN 71 (73)
Q Consensus 43 ~EIk~lYk~Fkk---ecP-sG~Lt~~EFk~i~~ 71 (73)
+-|..|.+.|++ ... .|.|+.+||+.++.
T Consensus 6 ~~i~~l~~~F~~y~~~dg~~~~Ls~~Elk~ll~ 38 (89)
T cd05023 6 RCIESLIAVFQKYAGKDGDSYQLSKTEFLSFMN 38 (89)
T ss_pred HHHHHHHHHHHHHhccCCCcCeECHHHHHHHHH
Confidence 445666666666 223 34899999988764
No 44
>PF13518 HTH_28: Helix-turn-helix domain
Probab=74.56 E-value=2.9 Score=22.12 Aligned_cols=31 Identities=3% Similarity=0.022 Sum_probs=25.3
Q ss_pred HHHHHHhcCCCCHHHHHHHHHHhhcccCCCC
Q psy12170 30 ALEDLLKQTKFTRQEIRVMYRGFKQFLTFIG 60 (73)
Q Consensus 30 ~l~~L~~~T~Fs~~EIk~lYk~FkkecPsG~ 60 (73)
.+.++....+.+..-|.+|.+.|....+.|.
T Consensus 14 s~~~~a~~~gis~~tv~~w~~~y~~~G~~~l 44 (52)
T PF13518_consen 14 SVREIAREFGISRSTVYRWIKRYREGGIEGL 44 (52)
T ss_pred CHHHHHHHHCCCHhHHHHHHHHHHhcCHHHh
Confidence 4567888889999999999999998655543
No 45
>PF12174 RST: RCD1-SRO-TAF4 (RST) plant domain; InterPro: IPR022003 This domain is found in many plant proteins including SROs and RCD1s; it is required for interaction with multiple plant transcription factors.
Probab=74.42 E-value=2.9 Score=25.19 Aligned_cols=23 Identities=13% Similarity=0.288 Sum_probs=19.0
Q ss_pred HHHHHHHHHhhcccCCCCcCHHHHhhh
Q psy12170 43 QEIRVMYRGFKQFLTFIGLNHPTLKDN 69 (73)
Q Consensus 43 ~EIk~lYk~FkkecPsG~Lt~~EFk~i 69 (73)
..|..+|..|++ +.|++++|.+.
T Consensus 28 ~~l~~~Y~~~k~----~kIsR~~fvr~ 50 (70)
T PF12174_consen 28 DLLQKHYEEFKK----KKISREEFVRK 50 (70)
T ss_pred HHHHHHHHHHHH----CCCCHHHHHHH
Confidence 457788999995 89999999764
No 46
>PF09693 Phage_XkdX: Phage uncharacterised protein (Phage_XkdX); InterPro: IPR010022 This entry is represented by Bacteriophage 69, Orf86. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This entry identifies a family of small (about 50 amino acid) phage proteins, found in at least 12 different phage and prophage regions of Gram-positive bacteria. In a number of these phage, the gene for this protein is found near the holin and endolysin genes.
Probab=73.97 E-value=4.1 Score=21.89 Aligned_cols=16 Identities=13% Similarity=0.007 Sum_probs=13.7
Q ss_pred cCCCCcCHHHHhhhhc
Q psy12170 56 LTFIGLNHPTLKDNTN 71 (73)
Q Consensus 56 cPsG~Lt~~EFk~i~~ 71 (73)
.+.|.||.+||+.|+.
T Consensus 22 V~~g~IT~eey~eITG 37 (40)
T PF09693_consen 22 VEAGWITKEEYKEITG 37 (40)
T ss_pred hhcCeECHHHHHHhhC
Confidence 4679999999999974
No 47
>PF12763 EF-hand_4: Cytoskeletal-regulatory complex EF hand; PDB: 2QPT_A 2KSP_A 2KFG_A 2JQ6_A 2KFH_A 2KFF_A 1IQ3_A 3FIA_A 2KHN_A 2KGR_A ....
Probab=73.84 E-value=6.3 Score=25.14 Aligned_cols=46 Identities=7% Similarity=0.053 Sum_probs=37.4
Q ss_pred cccCCCHHHHHHHHhcCCCCHHHHHHHHHHhhcccCCCCcCHHHHhh
Q psy12170 22 HVQKYRPIALEDLLKQTKFTRQEIRVMYRGFKQFLTFIGLNHPTLKD 68 (73)
Q Consensus 22 ~~skls~e~l~~L~~~T~Fs~~EIk~lYk~FkkecPsG~Lt~~EFk~ 68 (73)
..+.++.+....+..+++++.+.|.+++. .--..-+|.|+.+||.-
T Consensus 22 ~~g~isg~~a~~~f~~S~L~~~~L~~IW~-LaD~~~dG~L~~~EF~i 67 (104)
T PF12763_consen 22 QDGKISGDQAREFFMKSGLPRDVLAQIWN-LADIDNDGKLDFEEFAI 67 (104)
T ss_dssp STTEEEHHHHHHHHHHTTSSHHHHHHHHH-HH-SSSSSEEEHHHHHH
T ss_pred CCCeEeHHHHHHHHHHcCCCHHHHHHHHh-hhcCCCCCcCCHHHHHH
Confidence 34578888888889999999999999996 33346899999999953
No 48
>TIGR01669 phage_XkdX phage uncharacterized protein, XkdX family. This model represents a family of small (about 50 amino acid) phage proteins, found in at least 12 different phage and prophage regions of Gram-positive bacteria. In a number of these phage, the gene for this protein is found near the holin and endolysin genes.
Probab=73.72 E-value=12 Score=20.78 Aligned_cols=36 Identities=6% Similarity=0.023 Sum_probs=25.8
Q ss_pred HHHHHHHhcCCCCHHHHHHHHHHhhcccCCCCcCHHHHhhhhc
Q psy12170 29 IALEDLLKQTKFTRQEIRVMYRGFKQFLTFIGLNHPTLKDNTN 71 (73)
Q Consensus 29 e~l~~L~~~T~Fs~~EIk~lYk~FkkecPsG~Lt~~EFk~i~~ 71 (73)
+.+.+.-..-.+|.++|+.. .+-|.||.+|++.|+.
T Consensus 7 e~iK~~Y~~g~~t~e~v~~~-------V~~~~IT~eey~eITG 42 (45)
T TIGR01669 7 EKVKTYYLWGYYSNEDVNKF-------VEKKLITREQYKVITG 42 (45)
T ss_pred HHHHHHHHcCCCCHHHHHHH-------hhcCccCHHHHHHHhC
Confidence 45666666667777776543 3559999999999985
No 49
>PF13384 HTH_23: Homeodomain-like domain; PDB: 2X48_C.
Probab=73.42 E-value=2.9 Score=22.24 Aligned_cols=31 Identities=6% Similarity=0.042 Sum_probs=16.5
Q ss_pred HHHHHHhcCCCCHHHHHHHHHHhhcccCCCC
Q psy12170 30 ALEDLLKQTKFTRQEIRVMYRGFKQFLTFIG 60 (73)
Q Consensus 30 ~l~~L~~~T~Fs~~EIk~lYk~FkkecPsG~ 60 (73)
...++.+..+.|+.-|.+|.++|.+.-+.|.
T Consensus 19 s~~~ia~~lgvs~~Tv~~w~kr~~~~G~~gL 49 (50)
T PF13384_consen 19 SIREIAKRLGVSRSTVYRWIKRYREEGLEGL 49 (50)
T ss_dssp -HHHHHHHHTS-HHHHHHHHT----------
T ss_pred CHHHHHHHHCcCHHHHHHHHHHccccccccc
Confidence 3456778889999999999999987655553
No 50
>KOG0027|consensus
Probab=71.58 E-value=13 Score=24.04 Aligned_cols=50 Identities=18% Similarity=0.094 Sum_probs=33.2
Q ss_pred cccCCCHHHHHHHHhcC--CCCHHHHHHHHHHhhcccCCCCcCHHHHhhhhcC
Q psy12170 22 HVQKYRPIALEDLLKQT--KFTRQEIRVMYRGFKQFLTFIGLNHPTLKDNTNG 72 (73)
Q Consensus 22 ~~skls~e~l~~L~~~T--~Fs~~EIk~lYk~FkkecPsG~Lt~~EFk~i~~~ 72 (73)
+.+.++..++..+.... .++..++..+.+.+-. ..+|.++-.+|..+..+
T Consensus 98 ~~G~Is~~el~~~l~~lg~~~~~~e~~~mi~~~d~-d~dg~i~f~ef~~~m~~ 149 (151)
T KOG0027|consen 98 GDGFISASELKKVLTSLGEKLTDEECKEMIREVDV-DGDGKVNFEEFVKMMSG 149 (151)
T ss_pred CCCcCcHHHHHHHHHHhCCcCCHHHHHHHHHhcCC-CCCCeEeHHHHHHHHhc
Confidence 44567777777655554 5667777777777643 45677777777776654
No 51
>cd05029 S-100A6 S-100A6: S-100A6 domain found in proteins similar to S100A6. S100A6 is a member of the S100 domain family within EF-hand Ca2+-binding proteins superfamily. Note that the S-100 hierarchy, to which this S-100A6 group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately. S100 proteins exhibit unique patterns of tissue- and cell type-specific expression and have been implicated in the Ca2+-dependent regulation of diverse physiological processes, including cell cycle regulation, differentiation, growth, and metabolic control . S100A6 is normally expressed in the G1 phase of the cell cycle in neuronal cells. The function of S100A6 remains unclear, but evidence suggests that it is involved in cell cycle regulation and exocytosis. S100A6 may also be involved in tumorigenesis; the protein is overexpressed in several tumors. Ca2+ binding to S100A6 leads to a conformational change in the protein, which exposes a hydrophobic surface for interact
Probab=68.53 E-value=6.7 Score=23.93 Aligned_cols=26 Identities=12% Similarity=0.102 Sum_probs=18.4
Q ss_pred HHHHHHHhhc-ccCCCCcCHHHHhhhh
Q psy12170 45 IRVMYRGFKQ-FLTFIGLNHPTLKDNT 70 (73)
Q Consensus 45 Ik~lYk~Fkk-ecPsG~Lt~~EFk~i~ 70 (73)
|-.+|..|-. .+++|.|+.+||+.+.
T Consensus 12 ~i~~F~~y~~~~~~~g~Is~~EL~~~l 38 (88)
T cd05029 12 LVAIFHKYSGREGDKNTLSKKELKELI 38 (88)
T ss_pred HHHHHHHHHccCCCCCEECHHHHHHHH
Confidence 4455666666 4568888888888776
No 52
>cd05026 S-100Z S-100Z: S-100Z domain found in proteins similar to S100Z. S100Z is a member of the S100 domain family within the EF-hand Ca2+-binding proteins superfamily. Note that the S-100 hierarchy, to which this S-100Z group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately.S100 proteins exhibit unique patterns of tissue- and cell type-specific expression and have been implicated in the Ca2+-dependent regulation of diverse physiological processes, including cell cycle regulation, differentiation, growth, and metabolic control. S100Z is normally expressed in various tissues, with its highest level of expression being in spleen and leukocytes. The function of S100Z remains unclear. Preliminary structural data suggests that S100Z is homodimer, however a heterodimer with S100P has been reported. S100Z is capable of binding calcium ions. When calcium binds to S110Z, the protein experiences a conformational change, which exposes hydrophobic surfac
Probab=67.07 E-value=10 Score=23.14 Aligned_cols=27 Identities=11% Similarity=0.068 Sum_probs=17.4
Q ss_pred HHHHHHHHhhcccCCC-CcCHHHHhhhh
Q psy12170 44 EIRVMYRGFKQFLTFI-GLNHPTLKDNT 70 (73)
Q Consensus 44 EIk~lYk~FkkecPsG-~Lt~~EFk~i~ 70 (73)
.|++.|+.|-....+| .|+..||+.+.
T Consensus 11 ~~~~~F~~~dd~dgdg~~Is~~EL~~ll 38 (93)
T cd05026 11 TLIRIFHNYSGKEGDRYKLSKGELKELL 38 (93)
T ss_pred HHHHHHHHHHccCCCCCEECHHHHHHHH
Confidence 4555566665555666 48888887765
No 53
>COG5126 FRQ1 Ca2+-binding protein (EF-Hand superfamily) [Signal transduction mechanisms / Cytoskeleton / Cell division and chromosome partitioning / General function prediction only]
Probab=66.74 E-value=28 Score=24.10 Aligned_cols=29 Identities=10% Similarity=0.210 Sum_probs=16.8
Q ss_pred CHHHHHHHHHHhhcccCCCCcCHHHHhhhhc
Q psy12170 41 TRQEIRVMYRGFKQFLTFIGLNHPTLKDNTN 71 (73)
Q Consensus 41 s~~EIk~lYk~FkkecPsG~Lt~~EFk~i~~ 71 (73)
+..+|.++...|=. .+|.|+..+|..+++
T Consensus 54 s~~ei~~l~~~~d~--~~~~idf~~Fl~~ms 82 (160)
T COG5126 54 SEAEINKLFEEIDA--GNETVDFPEFLTVMS 82 (160)
T ss_pred cHHHHHHHHHhccC--CCCccCHHHHHHHHH
Confidence 44444444444433 457888888877654
No 54
>cd05031 S-100A10_like S-100A10_like: S-100A10 domain found in proteins similar to S100A10. S100A10 is a member of the S100 family of EF-hand superfamily of calcium-binding proteins. Note that the S-100 hierarchy, to which this S-100A1_like group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately. S100 proteins are expressed exclusively in vertebrates, and are implicated in intracellular and extracellular regulatory activities. A unique feature of S100A10 is that it contains mutation in both of the calcium binding sites, making it calcium insensitive. S100A10 has been detected in brain, heart, gastrointestinal tract, kidney, liver, lung, spleen, testes, epidermis, aorta, and thymus. Structural data supports the homo- and hetero-dimeric as well as hetero-tetrameric nature of the protein. S100A10 has multiple binding partners in its calcium free state and is therefore involved in many diverse biological functions.
Probab=64.44 E-value=11 Score=22.69 Aligned_cols=28 Identities=11% Similarity=0.077 Sum_probs=18.8
Q ss_pred HHHHHHHHHhhccc-CCCCcCHHHHhhhh
Q psy12170 43 QEIRVMYRGFKQFL-TFIGLNHPTLKDNT 70 (73)
Q Consensus 43 ~EIk~lYk~Fkkec-PsG~Lt~~EFk~i~ 70 (73)
.+|...|+.|-... .+|.|+.++|+.++
T Consensus 8 ~~l~~~F~~~D~~dg~dG~Is~~El~~~l 36 (94)
T cd05031 8 ESLILTFHRYAGKDGDKNTLSRKELKKLM 36 (94)
T ss_pred HHHHHHHHHHhccCCCCCeECHHHHHHHH
Confidence 44556666665423 36899999998775
No 55
>PRK12722 transcriptional activator FlhC; Provisional
Probab=63.63 E-value=12 Score=26.61 Aligned_cols=32 Identities=19% Similarity=0.259 Sum_probs=26.6
Q ss_pred HHHHHhcCCCCHHHHHHHHHHhhcc-cCCCCcC
Q psy12170 31 LEDLLKQTKFTRQEIRVMYRGFKQF-LTFIGLN 62 (73)
Q Consensus 31 l~~L~~~T~Fs~~EIk~lYk~Fkke-cPsG~Lt 62 (73)
++.|...|++++..+.++||...-. -|.|.+-
T Consensus 26 ~q~leset~Ls~~rl~~Lyke~~G~spPkG~lP 58 (187)
T PRK12722 26 LQVLESETQLSRERLIKLYKELRGVSPPKGMLP 58 (187)
T ss_pred HHHHHHHHCcCHHHHHHHHHHHcCCCCCCCCCC
Confidence 4558888999999999999999765 5888764
No 56
>PF14513 DAG_kinase_N: Diacylglycerol kinase N-terminus; PDB: 1TUZ_A.
Probab=62.53 E-value=6.2 Score=26.65 Aligned_cols=45 Identities=11% Similarity=0.250 Sum_probs=32.0
Q ss_pred cCCCHHHHHHHHhcCCCCHHHHHHHHHHhh------cccCCCCcCHHHHhh
Q psy12170 24 QKYRPIALEDLLKQTKFTRQEIRVMYRGFK------QFLTFIGLNHPTLKD 68 (73)
Q Consensus 24 skls~e~l~~L~~~T~Fs~~EIk~lYk~Fk------kecPsG~Lt~~EFk~ 68 (73)
..|+|.+-.+|.+-..+|.+-|+..-+.|. +.+|.+.|+-+-|+.
T Consensus 6 ~~lsp~eF~qLq~y~eys~kklkdvl~eF~~~g~~~~~~~~~~Id~egF~~ 56 (138)
T PF14513_consen 6 VSLSPEEFAQLQKYSEYSTKKLKDVLKEFHGDGSLAKYNPEEPIDYEGFKL 56 (138)
T ss_dssp S-S-HHHHHHHHHHHHH----HHHHHHHH-HTSGGGGGEETTEE-HHHHHH
T ss_pred eccCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccccCCCCCcCHHHHHH
Confidence 479999999999999999999999999995 356888888888865
No 57
>KOG0027|consensus
Probab=62.25 E-value=14 Score=24.02 Aligned_cols=46 Identities=15% Similarity=0.096 Sum_probs=31.7
Q ss_pred cCCCHHHHHHHHhcCC-------CCHHHHHHHHHHhhcccCCCCcCHHHHhhhh
Q psy12170 24 QKYRPIALEDLLKQTK-------FTRQEIRVMYRGFKQFLTFIGLNHPTLKDNT 70 (73)
Q Consensus 24 skls~e~l~~L~~~T~-------Fs~~EIk~lYk~FkkecPsG~Lt~~EFk~i~ 70 (73)
+.+.-.+...|+.... -+.++++..++-|=++ .+|.|+..+|+.+-
T Consensus 59 g~I~~~eF~~l~~~~~~~~~~~~~~~~el~eaF~~fD~d-~~G~Is~~el~~~l 111 (151)
T KOG0027|consen 59 GTIDFEEFLDLMEKLGEEKTDEEASSEELKEAFRVFDKD-GDGFISASELKKVL 111 (151)
T ss_pred CeEcHHHHHHHHHhhhcccccccccHHHHHHHHHHHccC-CCCcCcHHHHHHHH
Confidence 3555555544444322 2456888888888766 88999999999874
No 58
>PLN02964 phosphatidylserine decarboxylase
Probab=62.13 E-value=23 Score=29.46 Aligned_cols=48 Identities=8% Similarity=-0.054 Sum_probs=36.1
Q ss_pred ccCCCHHHHHHHHhcC--CCCHHHHHHHHHHhhcccCCCCcCHHHHhhhhc
Q psy12170 23 VQKYRPIALEDLLKQT--KFTRQEIRVMYRGFKQFLTFIGLNHPTLKDNTN 71 (73)
Q Consensus 23 ~skls~e~l~~L~~~T--~Fs~~EIk~lYk~FkkecPsG~Lt~~EFk~i~~ 71 (73)
...++-++...+.... ..++++|+.+++.|-+ .++|.|+.+||+.+..
T Consensus 193 dG~IdfdEFl~lL~~lg~~~seEEL~eaFk~fDk-DgdG~Is~dEL~~vL~ 242 (644)
T PLN02964 193 DGQLSFSEFSDLIKAFGNLVAANKKEELFKAADL-NGDGVVTIDELAALLA 242 (644)
T ss_pred CCeEcHHHHHHHHHHhccCCCHHHHHHHHHHhCC-CCCCcCCHHHHHHHHH
Confidence 4567777766544443 4789999999999965 4689999999998753
No 59
>PF13011 LZ_Tnp_IS481: leucine-zipper of insertion element IS481
Probab=60.43 E-value=13 Score=23.39 Aligned_cols=31 Identities=3% Similarity=-0.148 Sum_probs=25.6
Q ss_pred HHHhcCCCCHHHHHHHHHHhhcccCCCCcCH
Q psy12170 33 DLLKQTKFTRQEIRVMYRGFKQFLTFIGLNH 63 (73)
Q Consensus 33 ~L~~~T~Fs~~EIk~lYk~FkkecPsG~Lt~ 63 (73)
...+..+.|..-..+|..+|..+++.|..+.
T Consensus 30 ~aA~~~gVS~~Ta~kW~~Ryra~G~~GL~DR 60 (85)
T PF13011_consen 30 HAAAEFGVSRRTAYKWLARYRAEGEAGLQDR 60 (85)
T ss_pred HHHHHhCCCHHHHHHHHHHHHHcCccccccc
Confidence 3455678899999999999999988887664
No 60
>PRK04387 hypothetical protein; Provisional
Probab=58.66 E-value=9.9 Score=24.27 Aligned_cols=23 Identities=22% Similarity=0.330 Sum_probs=20.4
Q ss_pred CCCCHHHHHHHHHHhhcccCCCC
Q psy12170 38 TKFTRQEIRVMYRGFKQFLTFIG 60 (73)
Q Consensus 38 T~Fs~~EIk~lYk~FkkecPsG~ 60 (73)
.+..++++...|++||+-.|+-.
T Consensus 32 ~gv~re~ll~~Y~~FK~VVpsK~ 54 (90)
T PRK04387 32 KGVDAEELLDAYRRFKEIVPSKA 54 (90)
T ss_pred cCCcHHHHHHHHHHHHHhcCChH
Confidence 58899999999999999999853
No 61
>PF05280 FlhC: Flagellar transcriptional activator (FlhC); InterPro: IPR007944 This family consists of several bacterial flagellar transcriptional activator (FlhC) proteins. FlhC combines with FlhD to form a regulatory complex in Escherichia coli, this complex has been shown to be a global regulator involved in many cellular processes as well as a flagellar transcriptional activator [].; GO: 0003677 DNA binding, 0030092 regulation of flagellum assembly, 0045893 positive regulation of transcription, DNA-dependent; PDB: 2AVU_E.
Probab=58.63 E-value=12 Score=26.16 Aligned_cols=30 Identities=20% Similarity=0.267 Sum_probs=19.2
Q ss_pred HHhcCCCCHHHHHHHHHHhhcc-cCCCCcCH
Q psy12170 34 LLKQTKFTRQEIRVMYRGFKQF-LTFIGLNH 63 (73)
Q Consensus 34 L~~~T~Fs~~EIk~lYk~Fkke-cPsG~Lt~ 63 (73)
|...|+++.+.+.+||+...-. -|.|.+-.
T Consensus 29 l~~et~Ls~~rl~~Lykel~G~sppkG~lP~ 59 (175)
T PF05280_consen 29 LESETGLSRERLRRLYKELHGVSPPKGMLPF 59 (175)
T ss_dssp HHHHSSS-HHHHHHHHHHHCSS----S----
T ss_pred HHHHHCCCHHHHHHHHHHHcCCCCCCCCCCC
Confidence 6678999999999999998755 57887643
No 62
>PF09966 DUF2200: Uncharacterized protein conserved in bacteria (DUF2200); InterPro: IPR014580 There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.; PDB: 3C9P_A.
Probab=58.01 E-value=13 Score=24.66 Aligned_cols=31 Identities=10% Similarity=0.199 Sum_probs=21.5
Q ss_pred CHHHHHH-HHhcCCCCHHHHHHH------HHHhhcccC
Q psy12170 27 RPIALED-LLKQTKFTRQEIRVM------YRGFKQFLT 57 (73)
Q Consensus 27 s~e~l~~-L~~~T~Fs~~EIk~l------Yk~FkkecP 57 (73)
+++++++ ++.-|+|+.++|+.+ |..|-.+.|
T Consensus 24 t~~Evd~vi~WLTGy~~~~l~~~~~~~~~~~~FF~~AP 61 (111)
T PF09966_consen 24 TKEEVDQVIRWLTGYDQEELQAQIESKVTFETFFAQAP 61 (111)
T ss_dssp -HHHHHHHHHHHH---HHHHHHHTTS--BHHHHHHT-S
T ss_pred CHHHHHHHHHHHhcCCHHHHHHHHHcCCCHHHHHHHcc
Confidence 4567777 677799999999999 888888766
No 63
>cd05022 S-100A13 S-100A13: S-100A13 domain found in proteins similar to S100A13. S100A13 is a calcium-binding protein belonging to a large S100 vertebrate-specific protein family within the EF-hand superfamily of calcium-binding proteins. Note that the S-100 hierarchy, to which this S-100A13 group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately. S100A13 is involved in the cellular export of interleukin-1 (IL-1) and of fibroblast growth factor-1 (FGF-1), which plays an important role in angiogenesis and tissue regeneration. Export is based on the CuII-dependent formation of multiprotein complexes containing the S100A13 protein. Assembly of these complexes occurs near the inner surface of the plasma membrane. Binding of two Ca(II) ions per monomer triggers key conformational changes leading to the creation of two identical and symmetrical Cu(II)-binding sites on the surface of the protein, close to the interface between the two monomers. These Cu
Probab=57.78 E-value=17 Score=22.37 Aligned_cols=27 Identities=4% Similarity=-0.122 Sum_probs=19.1
Q ss_pred HHHHHHHHhhcccCCCCcCHHHHhhhh
Q psy12170 44 EIRVMYRGFKQFLTFIGLNHPTLKDNT 70 (73)
Q Consensus 44 EIk~lYk~FkkecPsG~Lt~~EFk~i~ 70 (73)
.|...|+.|-+...+|.|+..||+.+-
T Consensus 9 ~l~~~F~~fd~~~~~g~i~~~ELk~ll 35 (89)
T cd05022 9 TLVSNFHKASVKGGKESLTASEFQELL 35 (89)
T ss_pred HHHHHHHHHhCCCCCCeECHHHHHHHH
Confidence 345556666543678899999998864
No 64
>KOG0488|consensus
Probab=56.72 E-value=13 Score=27.99 Aligned_cols=27 Identities=7% Similarity=0.129 Sum_probs=24.2
Q ss_pred CCCHHHHHHHHhcCCCCHHHHHHHHHH
Q psy12170 25 KYRPIALEDLLKQTKFTRQEIRVMYRG 51 (73)
Q Consensus 25 kls~e~l~~L~~~T~Fs~~EIk~lYk~ 51 (73)
-|+..+=.+|.+..++|..+|+.||+.
T Consensus 196 YLS~~DR~~LA~~LgLTdaQVKtWfQN 222 (309)
T KOG0488|consen 196 YLSVADRIELAASLGLTDAQVKTWFQN 222 (309)
T ss_pred cccHHHHHHHHHHcCCchhhHHHHHhh
Confidence 477788788999999999999999987
No 65
>COG4875 Uncharacterized protein conserved in bacteria with a cystatin-like fold [Function unknown]
Probab=55.54 E-value=11 Score=25.94 Aligned_cols=30 Identities=17% Similarity=0.184 Sum_probs=26.1
Q ss_pred HHhcCCCCHHHHHHHHHHhhcccCCCCcCH
Q psy12170 34 LLKQTKFTRQEIRVMYRGFKQFLTFIGLNH 63 (73)
Q Consensus 34 L~~~T~Fs~~EIk~lYk~FkkecPsG~Lt~ 63 (73)
++.+-..|+.||...|+.|.+.-|-|.|+-
T Consensus 72 ~Sn~vR~s~~ei~DYF~~FLk~KPqG~Ids 101 (156)
T COG4875 72 MSNQVRSSRSEILDYFSHFLKLKPQGYIDS 101 (156)
T ss_pred cccccccCHHHHHHHHHHHhccCCcceecc
Confidence 555667899999999999999999998864
No 66
>PF09851 SHOCT: Short C-terminal domain; InterPro: IPR018649 This family of hypothetical prokaryotic proteins has no known function.
Probab=53.95 E-value=28 Score=17.48 Aligned_cols=22 Identities=0% Similarity=0.025 Sum_probs=14.3
Q ss_pred HHHHHHHHhhcccCCCCcCHHHHhhh
Q psy12170 44 EIRVMYRGFKQFLTFIGLNHPTLKDN 69 (73)
Q Consensus 44 EIk~lYk~FkkecPsG~Lt~~EFk~i 69 (73)
+|++|-..| -+|.||.+||.+.
T Consensus 4 ~L~~L~~l~----~~G~IseeEy~~~ 25 (31)
T PF09851_consen 4 RLEKLKELY----DKGEISEEEYEQK 25 (31)
T ss_pred HHHHHHHHH----HcCCCCHHHHHHH
Confidence 444444444 3699999999654
No 67
>PF15187 Augurin: Oesophageal cancer-related gene 4
Probab=53.51 E-value=8.9 Score=25.42 Aligned_cols=16 Identities=19% Similarity=0.409 Sum_probs=13.7
Q ss_pred CCCHHHHHHHHHHhhc
Q psy12170 39 KFTRQEIRVMYRGFKQ 54 (73)
Q Consensus 39 ~Fs~~EIk~lYk~Fkk 54 (73)
.=|+-++++||..|+=
T Consensus 44 DRsrPdVQQW~qQFlY 59 (114)
T PF15187_consen 44 DRSRPDVQQWYQQFLY 59 (114)
T ss_pred ccCCHHHHHHHHHHHH
Confidence 4588999999999984
No 68
>KOG0031|consensus
Probab=51.20 E-value=52 Score=23.30 Aligned_cols=46 Identities=22% Similarity=0.189 Sum_probs=34.0
Q ss_pred cCCCHHHHHHHHhc--CCCCHHHHHHHHHHhhcccCCCCcCHHHHhhhh
Q psy12170 24 QKYRPIALEDLLKQ--TKFTRQEIRVMYRGFKQFLTFIGLNHPTLKDNT 70 (73)
Q Consensus 24 skls~e~l~~L~~~--T~Fs~~EIk~lYk~FkkecPsG~Lt~~EFk~i~ 70 (73)
+++..+.+.+|... -.|+.+||..+|+.+-- .-.|.++-..|..+-
T Consensus 116 G~I~~d~lre~Ltt~gDr~~~eEV~~m~r~~p~-d~~G~~dy~~~~~~i 163 (171)
T KOG0031|consen 116 GKIDEDYLRELLTTMGDRFTDEEVDEMYREAPI-DKKGNFDYKAFTYII 163 (171)
T ss_pred CccCHHHHHHHHHHhcccCCHHHHHHHHHhCCc-ccCCceeHHHHHHHH
Confidence 46777777774433 38999999999998742 236999999997653
No 69
>PRK12860 transcriptional activator FlhC; Provisional
Probab=50.08 E-value=28 Score=24.83 Aligned_cols=33 Identities=18% Similarity=0.206 Sum_probs=26.5
Q ss_pred HHHHHhcCCCCHHHHHHHHHHhhcc-cCCCCcCH
Q psy12170 31 LEDLLKQTKFTRQEIRVMYRGFKQF-LTFIGLNH 63 (73)
Q Consensus 31 l~~L~~~T~Fs~~EIk~lYk~Fkke-cPsG~Lt~ 63 (73)
++.|...|++++..+.++||.-.-. -|.|.+-.
T Consensus 26 lq~leset~Ls~~rl~~Lyke~~G~SpPkG~lP~ 59 (189)
T PRK12860 26 LQVLESETTLSRDRLIRLYKEVRGVSPPKGMLPF 59 (189)
T ss_pred HHHHHHHHCCCHHHHHHHHHHHcCCCCCCCCCCC
Confidence 4557888999999999999988665 48887643
No 70
>KOG0487|consensus
Probab=49.77 E-value=6.7 Score=29.88 Aligned_cols=25 Identities=8% Similarity=0.266 Sum_probs=20.7
Q ss_pred CCHHHHHHHHhcCCCCHHHHHHHHH
Q psy12170 26 YRPIALEDLLKQTKFTRQEIRVMYR 50 (73)
Q Consensus 26 ls~e~l~~L~~~T~Fs~~EIk~lYk 50 (73)
++++.=-+|.+.+|+|+.+|+-||+
T Consensus 260 itkeKR~ElSr~lNLTeRQVKIWFQ 284 (308)
T KOG0487|consen 260 ITKEKRLELSRTLNLTERQVKIWFQ 284 (308)
T ss_pred HhHHHHHHHHHhcccchhheeeeeh
Confidence 4555555789999999999999987
No 71
>PF09279 EF-hand_like: Phosphoinositide-specific phospholipase C, efhand-like; InterPro: IPR015359 This domain is predominantly found in the enzyme phosphoinositol-specific phospholipase C. It adopts a structure consisting of a core of four alpha helices, in an EF like fold, and is required for functioning of the enzyme []. ; PDB: 3OHM_B 3QR0_A 2ZKM_X 2FJU_B 3QR1_D 1DJW_B 1DJI_B 1DJG_B 1QAS_B 2ISD_B ....
Probab=49.65 E-value=19 Score=20.99 Aligned_cols=24 Identities=13% Similarity=0.223 Sum_probs=15.5
Q ss_pred HHHHHHHHhhcccCCCCcCHHHHhhh
Q psy12170 44 EIRVMYRGFKQFLTFIGLNHPTLKDN 69 (73)
Q Consensus 44 EIk~lYk~FkkecPsG~Lt~~EFk~i 69 (73)
||+.+|+.+.. ..+.||.++|.+.
T Consensus 1 ei~~if~~ys~--~~~~mt~~~f~~F 24 (83)
T PF09279_consen 1 EIEEIFRKYSS--DKEYMTAEEFRRF 24 (83)
T ss_dssp HHHHHHHHHCT--TSSSEEHHHHHHH
T ss_pred CHHHHHHHHhC--CCCcCCHHHHHHH
Confidence 56667777743 4667777777653
No 72
>cd00252 SPARC_EC SPARC_EC; extracellular Ca2+ binding domain (containing 2 EF-hand motifs) of SPARC and related proteins (QR1, SC1/hevin, testican and tsc-36/FRP). SPARC (BM-40) is a multifunctional glycoprotein, a matricellular protein, that functions to regulate cell-matrix interactions; binds to such proteins as collagen and vitronectin and binds to endothelial cells thus inhibiting cellular proliferation. The EC domain interacts with a follistatin-like (FS) domain which appears to stabilize Ca2+ binding. The two EF-hands interact canonically but their conserved disulfide bonds confer a tight association between the EF-hand pair and an acid/amphiphilic N-terminal helix. Proposed active form involves a Ca2+ dependent symmetric homodimerization of EC-FS modules.
Probab=49.14 E-value=32 Score=22.24 Aligned_cols=28 Identities=14% Similarity=0.090 Sum_probs=19.3
Q ss_pred HHHHHHHHHHhhcccCCCCcCHHHHhhhh
Q psy12170 42 RQEIRVMYRGFKQFLTFIGLNHPTLKDNT 70 (73)
Q Consensus 42 ~~EIk~lYk~FkkecPsG~Lt~~EFk~i~ 70 (73)
+.+|+-.|..|=+ ..+|.|+.+|++.+.
T Consensus 47 ~~~l~w~F~~lD~-d~DG~Ls~~EL~~~~ 74 (116)
T cd00252 47 KDPVGWMFNQLDG-NYDGKLSHHELAPIR 74 (116)
T ss_pred HHHHHHHHHHHCC-CCCCcCCHHHHHHHH
Confidence 4445555555533 499999999998764
No 73
>PF05344 DUF746: Domain of Unknown Function (DUF746); InterPro: IPR008008 This is a short conserved region found in some transposons.
Probab=48.83 E-value=25 Score=21.18 Aligned_cols=33 Identities=12% Similarity=0.040 Sum_probs=26.3
Q ss_pred HHHHHHhcCCCCHHHHHHHHHHhhc----ccCCCCcC
Q psy12170 30 ALEDLLKQTKFTRQEIRVMYRGFKQ----FLTFIGLN 62 (73)
Q Consensus 30 ~l~~L~~~T~Fs~~EIk~lYk~Fkk----ecPsG~Lt 62 (73)
.+.+....-+.+..-|.+|-+.|.+ .+|||...
T Consensus 15 s~~~Aa~~lG~~~~~v~~wv~~fR~wll~LDPSG~~E 51 (65)
T PF05344_consen 15 SVAQAADRLGTDPGTVRRWVRMFRQWLLQLDPSGHWE 51 (65)
T ss_pred cHHHHHHHHCcCHHHHHHHHHHHHHHHHHcCCCCChH
Confidence 3556666778999999999999988 36999764
No 74
>TIGR01565 homeo_ZF_HD homeobox domain, ZF-HD class. This model represents a class of homoebox domain that differs substantially from the typical homoebox domain described in pfam model pfam00046. It is found in both C4 and C3 plants.
Probab=48.35 E-value=9.6 Score=22.19 Aligned_cols=26 Identities=15% Similarity=0.250 Sum_probs=22.5
Q ss_pred CCHHHHHHHHhcCCCCHHHHHHHHHH
Q psy12170 26 YRPIALEDLLKQTKFTRQEIRVMYRG 51 (73)
Q Consensus 26 ls~e~l~~L~~~T~Fs~~EIk~lYk~ 51 (73)
++..+.+.|+..+++++..|+-|+..
T Consensus 30 ~~~~~r~~la~~lgl~~~vvKVWfqN 55 (58)
T TIGR01565 30 KRREEVREFCEEIGVTRKVFKVWMHN 55 (58)
T ss_pred CCHHHHHHHHHHhCCCHHHeeeeccc
Confidence 66778889999999999999999764
No 75
>KOG0483|consensus
Probab=47.91 E-value=26 Score=24.98 Aligned_cols=29 Identities=17% Similarity=0.330 Sum_probs=25.9
Q ss_pred ccCCCHHHHHHHHhcCCCCHHHHHHHHHH
Q psy12170 23 VQKYRPIALEDLLKQTKFTRQEIRVMYRG 51 (73)
Q Consensus 23 ~skls~e~l~~L~~~T~Fs~~EIk~lYk~ 51 (73)
+.+|.|+....|.+..+++..+|.-||..
T Consensus 72 ~~~L~p~~K~~LAk~LgL~pRQVavWFQN 100 (198)
T KOG0483|consen 72 EKKLEPERKKKLAKELGLQPRQVAVWFQN 100 (198)
T ss_pred ccccChHHHHHHHHhhCCChhHHHHHHhh
Confidence 34799999999999999999999999874
No 76
>PF03871 RNA_pol_Rpb5_N: RNA polymerase Rpb5, N-terminal domain; InterPro: IPR005571 Prokaryotes contain a single DNA-dependent RNA polymerase (RNAP; 2.7.7.6 from EC) that is responsible for the transcription of all genes, while eukaryotes have three classes of RNAPs (I-III) that transcribe different sets of genes. Each class of RNA polymerase is an assemblage of ten to twelve different polypeptides. Certain subunits of RNAPs, including RPB5 (POLR2E in mammals), are common to all three eukaryotic polymerases. RPB5 plays a role in the transcription activation process. Eukaryotic RPB5 has a bipartite structure consisting of a unique N-terminal region, plus a C-terminal region that is structurally homologous to the prokaryotic RPB5 homologue, subunit H (gene rpoH) (IPR000783 from INTERPRO) [, , , ]. This entry represents the N-terminal domain of eukaryotic RPB5, which has a core structure consisting of 3 layers alpha/beta/alpha []. The N-terminal domain is involved in DNA binding and is part of the jaw module in the RNA pol II structure []. This module is important for positioning the downstream DNA.; GO: 0003677 DNA binding, 0003899 DNA-directed RNA polymerase activity, 0006351 transcription, DNA-dependent, 0005634 nucleus; PDB: 3H0G_Q 1DZF_A 3GTG_E 2VUM_E 3GTP_E 3GTO_E 3S17_E 3S1R_E 1I3Q_E 1I50_E ....
Probab=47.35 E-value=32 Score=21.32 Aligned_cols=31 Identities=10% Similarity=0.181 Sum_probs=21.7
Q ss_pred HHHHHHHHHHhhccc----CCC--------CcCHHHHhhhhcC
Q psy12170 42 RQEIRVMYRGFKQFL----TFI--------GLNHPTLKDNTNG 72 (73)
Q Consensus 42 ~~EIk~lYk~Fkkec----PsG--------~Lt~~EFk~i~~~ 72 (73)
..|+.+||+..+.-+ --| .++.++|++.|..
T Consensus 4 ~~e~~rl~rirrTv~eMl~DRGY~V~~~el~~s~~~F~~~~~~ 46 (93)
T PF03871_consen 4 DEEVSRLFRIRRTVMEMLRDRGYLVSEEELNMSLEEFKEKYGE 46 (93)
T ss_dssp THHHHHHHHHHCCCCCCCCCCTEE--CCCCS--HHHHHHHCB-
T ss_pred HHHHHHHHHHHHHHHHHHhcCCCccChhhhcCCHHHHHHHHcc
Confidence 479999999998843 223 6789999998863
No 77
>PF05193 Peptidase_M16_C: Peptidase M16 inactive domain; InterPro: IPR007863 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. These metallopeptidases belong to MEROPS peptidase family M16 (clan ME). They include proteins, which are classified as non-peptidase homologues either have been found experimentally to be without peptidase activity, or lack amino acid residues that are believed to be essential for the catalytic activity. The peptidases in this group of sequences include: Insulinase, insulin-degrading enzyme (3.4.24.56 from EC) Mitochondrial processing peptidase alpha subunit, (Alpha-MPP, 3.4.24.64 from EC) Pitrlysin, Protease III precursor (3.4.24.55 from EC) Nardilysin, (3.4.24.61 from EC) Ubiquinol-cytochrome C reductase complex core protein I,mitochondrial precursor (1.10.2.2 from EC) Coenzyme PQQ synthesis protein F (3.4.99 from EC) These proteins do not share many regions of sequence similarity; the most noticeable is in the N-terminal section. This region includes a conserved histidine followed, two residues later by a glutamate and another histidine. In pitrilysin, it has been shown [] that this H-x-x-E-H motif is involved in enzymatic activity; the two histidines bind zinc and the glutamate is necessary for catalytic activity. The mitochondrial processing peptidase consists of two structurally related domains. One is the active peptidase whereas the other, the C-terminal region, is inactive. The two domains hold the substrate like a clamp [].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis; PDB: 1BE3_B 1PP9_B 2A06_B 1SQB_B 1SQP_B 1L0N_B 1SQX_B 1NU1_B 1L0L_B 2FYU_B ....
Probab=46.96 E-value=15 Score=22.59 Aligned_cols=30 Identities=17% Similarity=0.372 Sum_probs=20.1
Q ss_pred CCCHHHHHHHHHHhhcc-----cCCCCcCHHHHhh
Q psy12170 39 KFTRQEIRVMYRGFKQF-----LTFIGLNHPTLKD 68 (73)
Q Consensus 39 ~Fs~~EIk~lYk~Fkke-----cPsG~Lt~~EFk~ 68 (73)
++|.++|+.+|++|..- +-.|-++.++...
T Consensus 1 ~it~e~l~~f~~~~y~p~n~~l~i~Gd~~~~~~~~ 35 (184)
T PF05193_consen 1 NITLEDLRAFYKKFYRPSNMTLVIVGDIDPDELEK 35 (184)
T ss_dssp C--HHHHHHHHHHHSSGGGEEEEEEESSGHHHHHH
T ss_pred CCCHHHHHHHHHHhcCccceEEEEEcCccHHHHHH
Confidence 46788888898888772 3667777766654
No 78
>KOG0488|consensus
Probab=46.69 E-value=12 Score=28.16 Aligned_cols=22 Identities=14% Similarity=0.261 Sum_probs=19.1
Q ss_pred HHhcCCCCHHHHHHHHHHhhcc
Q psy12170 34 LLKQTKFTRQEIRVMYRGFKQF 55 (73)
Q Consensus 34 L~~~T~Fs~~EIk~lYk~Fkke 55 (73)
-...|-||..+|+.|+|+|.+.
T Consensus 173 RksRTaFT~~Ql~~LEkrF~~Q 194 (309)
T KOG0488|consen 173 RKSRTAFSDHQLFELEKRFEKQ 194 (309)
T ss_pred ccchhhhhHHHHHHHHHHHHHh
Confidence 4557999999999999999873
No 79
>KOG0030|consensus
Probab=46.49 E-value=49 Score=23.05 Aligned_cols=49 Identities=20% Similarity=0.171 Sum_probs=36.4
Q ss_pred cccCCCHHHHHHHHhcC--CCCHHHHHHHHHHhhcccCCCCcCHHHH-hhhhcC
Q psy12170 22 HVQKYRPIALEDLLKQT--KFTRQEIRVMYRGFKQFLTFIGLNHPTL-KDNTNG 72 (73)
Q Consensus 22 ~~skls~e~l~~L~~~T--~Fs~~EIk~lYk~FkkecPsG~Lt~~EF-k~i~~~ 72 (73)
+++.+.-.+|..+...+ ++++.|+..|-.+ ++..+|.|+-+.| +.|.+|
T Consensus 101 g~G~i~~aeLRhvLttlGekl~eeEVe~Llag--~eD~nG~i~YE~fVk~i~~g 152 (152)
T KOG0030|consen 101 GNGTIMGAELRHVLTTLGEKLTEEEVEELLAG--QEDSNGCINYEAFVKHIMSG 152 (152)
T ss_pred CCcceeHHHHHHHHHHHHhhccHHHHHHHHcc--ccccCCcCcHHHHHHHHhcC
Confidence 34456666777766555 7999999998776 4788999999999 556554
No 80
>KOG0037|consensus
Probab=45.43 E-value=27 Score=25.61 Aligned_cols=26 Identities=12% Similarity=0.134 Sum_probs=15.6
Q ss_pred HHHHHHHhhcc--cCCCCcCHHHHhhhh
Q psy12170 45 IRVMYRGFKQF--LTFIGLNHPTLKDNT 70 (73)
Q Consensus 45 Ik~lYk~Fkke--cPsG~Lt~~EFk~i~ 70 (73)
|++|-.-|++- +.||+|+..||++..
T Consensus 123 i~~Wr~vF~~~D~D~SG~I~~sEL~~Al 150 (221)
T KOG0037|consen 123 INQWRNVFRTYDRDRSGTIDSSELRQAL 150 (221)
T ss_pred HHHHHHHHHhcccCCCCcccHHHHHHHH
Confidence 44555555552 477777777776543
No 81
>KOG4223|consensus
Probab=44.79 E-value=26 Score=27.03 Aligned_cols=30 Identities=13% Similarity=-0.150 Sum_probs=24.6
Q ss_pred CCHHHHHHHHHHhhcc--cCCCCcCHHHHhhh
Q psy12170 40 FTRQEIRVMYRGFKQF--LTFIGLNHPTLKDN 69 (73)
Q Consensus 40 Fs~~EIk~lYk~Fkke--cPsG~Lt~~EFk~i 69 (73)
.=++-|.++-++|+.- ..+|.||++||-..
T Consensus 157 ~~~km~~rDe~rFk~AD~d~dg~lt~EEF~aF 188 (325)
T KOG4223|consen 157 EYKKMIARDEERFKAADQDGDGSLTLEEFTAF 188 (325)
T ss_pred HHHHHHHHHHHHHhhcccCCCCcccHHHHHhc
Confidence 3467788999999994 58899999999754
No 82
>KOG0491|consensus
Probab=44.41 E-value=27 Score=25.09 Aligned_cols=28 Identities=4% Similarity=0.166 Sum_probs=23.8
Q ss_pred CCCHHHHHHHHhcCCCCHHHHHHHHHHh
Q psy12170 25 KYRPIALEDLLKQTKFTRQEIRVMYRGF 52 (73)
Q Consensus 25 kls~e~l~~L~~~T~Fs~~EIk~lYk~F 52 (73)
-|+-.+..+|....+++..+++-||..=
T Consensus 124 YLS~~e~~ELan~L~LS~~QVKTWFQNr 151 (194)
T KOG0491|consen 124 YLSTPERQELANALSLSETQVKTWFQNR 151 (194)
T ss_pred hcccHHHHHHHHHhhhhHHHHHHHHHHH
Confidence 4677788899999999999999998653
No 83
>PF05256 UPF0223: Uncharacterised protein family (UPF0223); InterPro: IPR007920 This family of proteins is functionally uncharacterised.; PDB: 2OY9_B.
Probab=43.45 E-value=12 Score=23.76 Aligned_cols=23 Identities=26% Similarity=0.408 Sum_probs=16.5
Q ss_pred cCCCCHHHHHHHHHHhhcccCCC
Q psy12170 37 QTKFTRQEIRVMYRGFKQFLTFI 59 (73)
Q Consensus 37 ~T~Fs~~EIk~lYk~FkkecPsG 59 (73)
..+.+++++..-|++||+--|+-
T Consensus 31 E~gV~r~~ll~~Y~~FK~VVpsK 53 (88)
T PF05256_consen 31 EKGVDREELLDAYRRFKKVVPSK 53 (88)
T ss_dssp TT-EEHHHHHHHHHHHHHH---H
T ss_pred HccCcHHHHHHHHHHHHHHcccH
Confidence 36789999999999999988873
No 84
>PF15658 Latrotoxin_C: Latrotoxin C-terminal domain
Probab=42.98 E-value=29 Score=23.47 Aligned_cols=27 Identities=11% Similarity=0.048 Sum_probs=21.4
Q ss_pred HHHHHHHHHHhhcc-cCCCCcCHHHHhh
Q psy12170 42 RQEIRVMYRGFKQF-LTFIGLNHPTLKD 68 (73)
Q Consensus 42 ~~EIk~lYk~Fkke-cPsG~Lt~~EFk~ 68 (73)
-.||.++-.-|.++ ||.|.|+...|..
T Consensus 85 ~neIs~~L~Sy~e~a~p~~~Ls~kq~~k 112 (127)
T PF15658_consen 85 FNEISKILCSYAEKACPGSKLSPKQFDK 112 (127)
T ss_pred chHHHHHHHHHHHHhCcccCCCHHHHHH
Confidence 34788888888775 9999999987754
No 85
>KOG0028|consensus
Probab=42.92 E-value=97 Score=21.98 Aligned_cols=27 Identities=7% Similarity=0.011 Sum_probs=14.0
Q ss_pred HHHHHHHHHHhhcccCCCCcCHHHHhhh
Q psy12170 42 RQEIRVMYRGFKQFLTFIGLNHPTLKDN 69 (73)
Q Consensus 42 ~~EIk~lYk~FkkecPsG~Lt~~EFk~i 69 (73)
++||.++-..+-+++ +|.|+-++|..+
T Consensus 68 k~ei~kll~d~dk~~-~g~i~fe~f~~~ 94 (172)
T KOG0028|consen 68 KEEILKLLADVDKEG-SGKITFEDFRRV 94 (172)
T ss_pred hHHHHHHHHhhhhcc-CceechHHHHHH
Confidence 555555555554433 355555555544
No 86
>PF05263 DUF722: Protein of unknown function (DUF722); InterPro: IPR007927 This entry is represented by Bacteriophage bIL285, Orf38. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches.
Probab=42.44 E-value=26 Score=23.44 Aligned_cols=24 Identities=13% Similarity=0.149 Sum_probs=20.1
Q ss_pred HHHHhcCCCCHHHHHHHHHHhhcc
Q psy12170 32 EDLLKQTKFTRQEIRVMYRGFKQF 55 (73)
Q Consensus 32 ~~L~~~T~Fs~~EIk~lYk~Fkke 55 (73)
..+....+.++..+++||..||..
T Consensus 103 ~~IA~~l~i~erta~r~~~~fK~~ 126 (130)
T PF05263_consen 103 YQIAQKLHISERTARRWRDRFKND 126 (130)
T ss_pred HHHHHHhCccHHHHHHHHHHHHHH
Confidence 346667889999999999999974
No 87
>PF03979 Sigma70_r1_1: Sigma-70 factor, region 1.1; InterPro: IPR007127 The bacterial core RNA polymerase complex, which consists of five subunits, is sufficient for transcription elongation and termination but is unable to initiate transcription. Transcription initiation from promoter elements requires a sixth, dissociable subunit called a sigma factor, which reversibly associates with the core RNA polymerase complex to form a holoenzyme []. RNA polymerase recruits alternative sigma factors as a means of switching on specific regulons. Most bacteria express a multiplicity of sigma factors. Two of these factors, sigma-70 (gene rpoD), generally known as the major or primary sigma factor, and sigma-54 (gene rpoN or ntrA) direct the transcription of a wide variety of genes. The other sigma factors, known as alternative sigma factors, are required for the transcription of specific subsets of genes. With regard to sequence similarity, sigma factors can be grouped into two classes, the sigma-54 and sigma-70 families. Sequence alignments of the sigma70 family members reveal four conserved regions that can be further divided into subregions eg. sub-region 2.2, which may be involved in the binding of the sigma factor to the core RNA polymerase; and sub-region 4.2, which seems to harbor a DNA-binding 'helix-turn-helix' motif involved in binding the conserved -35 region of promoters recognised by the major sigma factors [, ]. This entry represents Region 1.1 which modulates DNA binding by region 2 and 4 when sigma is unbound by the core RNA polymerase [, ]. Region 1.1 is also involved in promoter binding.; GO: 0003677 DNA binding, 0006355 regulation of transcription, DNA-dependent; PDB: 2K6X_A.
Probab=40.81 E-value=27 Score=20.90 Aligned_cols=38 Identities=18% Similarity=0.283 Sum_probs=27.4
Q ss_pred hcccccccCCCHHHHHHHHhcCCCCHHHHHHHHHHhhc
Q psy12170 17 IESSKHVQKYRPIALEDLLKQTKFTRQEIRVMYRGFKQ 54 (73)
Q Consensus 17 ~e~~~~~skls~e~l~~L~~~T~Fs~~EIk~lYk~Fkk 54 (73)
|..|+.++.++-.+|......-.++..+|..+|..|..
T Consensus 13 i~~gK~~G~lT~~eI~~~L~~~~~~~e~id~i~~~L~~ 50 (82)
T PF03979_consen 13 IEKGKKKGYLTYDEINDALPEDDLDPEQIDEIYDTLED 50 (82)
T ss_dssp HHHHHHHSS-BHHHHHHH-S-S---HHHHHHHHHHHHT
T ss_pred HHHHhhcCcCCHHHHHHHcCccCCCHHHHHHHHHHHHH
Confidence 46677778899999998777778999999999999987
No 88
>smart00411 BHL bacterial (prokaryotic) histone like domain.
Probab=40.65 E-value=47 Score=19.50 Aligned_cols=26 Identities=4% Similarity=0.337 Sum_probs=22.6
Q ss_pred HHHHHHhcCCCCHHHHHHHHHHhhcc
Q psy12170 30 ALEDLLKQTKFTRQEIRVMYRGFKQF 55 (73)
Q Consensus 30 ~l~~L~~~T~Fs~~EIk~lYk~Fkke 55 (73)
-++.+.+.|++++.++......|...
T Consensus 6 li~~ia~~~~~~~~~v~~vl~~l~~~ 31 (90)
T smart00411 6 LIDAIAEKAGLSKKDAKAAVDAFLEI 31 (90)
T ss_pred HHHHHHHHhCCCHHHHHHHHHHHHHH
Confidence 36679999999999999999999763
No 89
>cd00591 HU_IHF Integration host factor (IHF) and HU are small heterodimeric members of the DNABII protein family that bind and bend DNA, functioning as architectural factors in many cellular processes including transcription, site-specific recombination, and higher-order nucleoprotein complex assembly. The dimer subunits associate to form a compact globular core from which two beta ribbon arms (one from each subunit) protrude. The beta arms track and bind the DNA minor groove. Despite sequence and structural similarity, IHF and HU can be distinguished by their different DNA substrate preferences.
Probab=40.06 E-value=47 Score=19.32 Aligned_cols=26 Identities=12% Similarity=0.286 Sum_probs=22.3
Q ss_pred HHHHHHHhcCCCCHHHHHHHHHHhhc
Q psy12170 29 IALEDLLKQTKFTRQEIRVMYRGFKQ 54 (73)
Q Consensus 29 e~l~~L~~~T~Fs~~EIk~lYk~Fkk 54 (73)
+-++.|.+.|++++.+++.+...|..
T Consensus 4 ~l~~~ia~~~~~~~~~v~~vl~~~~~ 29 (87)
T cd00591 4 ELIEAIAEKTGLSKKDAEAAVDAFLD 29 (87)
T ss_pred HHHHHHHHHhCcCHHHHHHHHHHHHH
Confidence 34667999999999999999999876
No 90
>PRK13979 DNA topoisomerase IV subunit A; Provisional
Probab=39.10 E-value=21 Score=30.90 Aligned_cols=36 Identities=17% Similarity=0.214 Sum_probs=29.0
Q ss_pred HHhcCCCCHHHHHHHHHHhhcccCCCCc--CHHHHhhhhc
Q psy12170 34 LLKQTKFTRQEIRVMYRGFKQFLTFIGL--NHPTLKDNTN 71 (73)
Q Consensus 34 L~~~T~Fs~~EIk~lYk~FkkecPsG~L--t~~EFk~i~~ 71 (73)
+...-+++..||..|++++ ++|+|.+ ..+.+.++|.
T Consensus 202 ~i~~~~~~~~~l~~~~~gp--DfptGg~i~~~~~i~~~y~ 239 (957)
T PRK13979 202 YIDNNEITTKELMNYIKGP--DLPTGGILIGKKSLLSAYE 239 (957)
T ss_pred HhcCCCCCHHHHhhhccCC--CCCCCeEEECchhHHHHhh
Confidence 4455689999999999999 8999966 5677777775
No 91
>PF08707 PriCT_2: Primase C terminal 2 (PriCT-2) ; InterPro: IPR014819 This alpha helical domain is found at the C-terminal of primases. ; GO: 0016817 hydrolase activity, acting on acid anhydrides
Probab=38.89 E-value=62 Score=18.99 Aligned_cols=28 Identities=25% Similarity=0.278 Sum_probs=21.5
Q ss_pred cCCCCHHHHHHHHHHhhcccCCCCcCHHHHh
Q psy12170 37 QTKFTRQEIRVMYRGFKQFLTFIGLNHPTLK 67 (73)
Q Consensus 37 ~T~Fs~~EIk~lYk~FkkecPsG~Lt~~EFk 67 (73)
..+|+..++...++.| .+.|.++...|.
T Consensus 48 ~~ky~~~e~~~~W~s~---~~~~~it~~Tl~ 75 (78)
T PF08707_consen 48 SPKYDEEECERKWRSF---DRPGGITIGTLF 75 (78)
T ss_pred CCCCCHHHHHHHHHhC---CCCCCccHHHHH
Confidence 3589999999999999 444557776664
No 92
>PHA02970 hypothetical protein; Provisional
Probab=38.75 E-value=52 Score=21.79 Aligned_cols=32 Identities=22% Similarity=0.319 Sum_probs=26.2
Q ss_pred CCCHHHHHHHHhcCCCCHHHHHHHHHHhhccc
Q psy12170 25 KYRPIALEDLLKQTKFTRQEIRVMYRGFKQFL 56 (73)
Q Consensus 25 kls~e~l~~L~~~T~Fs~~EIk~lYk~Fkkec 56 (73)
+.+..-+.+++.---+...||..+|-+|.+..
T Consensus 22 ~C~~dl~~D~sey~~YN~~EIn~lY~~FLk~h 53 (115)
T PHA02970 22 KCSSDLVHDLSEYEVYNPKEINSLYISFLKHH 53 (115)
T ss_pred cchHHHHccHHHHhcCCHHHHHHHHHHHHHHc
Confidence 45556666788777899999999999999853
No 93
>PF00226 DnaJ: DnaJ domain; InterPro: IPR001623 The prokaryotic heat shock protein DnaJ interacts with the chaperone hsp70-like DnaK protein []. Structurally, the DnaJ protein consists of an N-terminal conserved domain (called 'J' domain) of about 70 amino acids, a glycine-rich region ('G' domain') of about 30 residues, a central domain containing four repeats of a CXXCXGXG motif ('CRR' domain) and a C-terminal region of 120 to 170 residues. Such a structure is shown in the following schematic representation: +------------+-+-------+-----+-----------+--------------------------------+ | N-terminal | | Gly-R | | CXXCXGXG | C-terminal | +------------+-+-------+-----+-----------+--------------------------------+ It is thought that the 'J' domain of DnaJ mediates the interaction with the dnaK protein and consists of four helices, the second of which has a charged surface that includes at least one pair of basic residues that are essential for interaction with the ATPase domain of Hsp70. The J- and CRR-domains are found in many prokaryotic and eukaryotic proteins [], either together or separately. In yeast, J-domains have been classified into 3 groups; the class III proteins are functionally distinct and do not appear to act as molecular chaperones []. ; GO: 0031072 heat shock protein binding; PDB: 2GUZ_C 2L6L_A 1HDJ_A 2EJ7_A 1FPO_C 2CUG_A 2QSA_A 2OCH_A 3BVO_B 3APQ_A ....
Probab=38.74 E-value=32 Score=18.85 Aligned_cols=21 Identities=19% Similarity=0.240 Sum_probs=18.0
Q ss_pred CCCHHHHHHHHHHhhccc-CCC
Q psy12170 39 KFTRQEIRVMYRGFKQFL-TFI 59 (73)
Q Consensus 39 ~Fs~~EIk~lYk~Fkkec-PsG 59 (73)
+.+.++|++.|+++.+.+ |+.
T Consensus 11 ~~~~~eik~~y~~l~~~~HPD~ 32 (64)
T PF00226_consen 11 DASDEEIKKAYRRLSKQYHPDK 32 (64)
T ss_dssp TSSHHHHHHHHHHHHHHTSTTT
T ss_pred CCCHHHHHHHHHhhhhcccccc
Confidence 678999999999999975 654
No 94
>PF06902 Fer4_19: Divergent 4Fe-4S mono-cluster; InterPro: IPR010693 This entry represents bacterial ferredoxins such Ferredoxin-1, -2 and -soy from Streptomyces griseolus and Ferredoxin fas2 from Rhodococcus fascians, plus several bacterial hypothetical proteins that contain three highly conserved cysteine residues. These ferredoxins each bind a 3Fe-4S cluster. Ferredoxin-soy (SoyB) act as electron transport protein for the cytochrome P450-SOY system []. Ferredoxin-1 (SuaB) and Ferredoxin-2 (SubB) act as electron transport proteins for the herbicide-metabolising cytochrome P-450 SU1 and SU2 systems, respectively [, ]. Ferredoxin-fas2 also plays a role in electrontransfer, the fas operon encoding genes involved in cytokinin production and in host plant fasciation (leafy gall).
Probab=37.76 E-value=13 Score=21.83 Aligned_cols=10 Identities=10% Similarity=-0.257 Sum_probs=7.9
Q ss_pred cccCCCCcCH
Q psy12170 54 QFLTFIGLNH 63 (73)
Q Consensus 54 kecPsG~Lt~ 63 (73)
..||||.|+.
T Consensus 52 ~~CPSGAL~~ 61 (64)
T PF06902_consen 52 DRCPSGALSY 61 (64)
T ss_pred HcCCccCcEE
Confidence 3799999864
No 95
>PF13758 Prefoldin_3: Prefoldin subunit
Probab=37.02 E-value=74 Score=20.58 Aligned_cols=38 Identities=18% Similarity=0.233 Sum_probs=26.2
Q ss_pred HHHHHHHhcCCCCHHHHHHHHHHhhcccCCCCcCHHHHhhhh
Q psy12170 29 IALEDLLKQTKFTRQEIRVMYRGFKQFLTFIGLNHPTLKDNT 70 (73)
Q Consensus 29 e~l~~L~~~T~Fs~~EIk~lYk~FkkecPsG~Lt~~EFk~i~ 70 (73)
+++..|-....-+.++|.++.++|.- -.++..|++.|.
T Consensus 19 EEi~~l~~~~~~~~e~l~~i~r~f~g----~lv~~kEi~~il 56 (99)
T PF13758_consen 19 EEIEALPEDDDATREDLLRIRRDFGG----SLVTEKEIKEIL 56 (99)
T ss_pred HHHHhccccCCCCHHHHHHHHHhcCc----ccccHHHHHHHh
Confidence 35555544457788889999999852 267777777664
No 96
>PF00220 Hormone_4: Neurohypophysial hormones, N-terminal Domain; InterPro: IPR022423 Oxytocin (or ocytocin) and vasopressin [] are small (nine amino acid residues), structurally and functionally related neurohypophysial peptide hormones. Oxytocin causes contraction of the smooth muscle of the uterus and of the mammary gland while vasopressin has a direct antidiuretic action on the kidney and also causes vasoconstriction of the peripheral vessels. Like the majority of active peptides, both hormones are synthesized as larger protein precursors that are enzymatically converted to their mature forms. Peptides belonging to this family are also found in birds, fish, reptiles and amphibians (mesotocin, isotocin, valitocin, glumitocin, aspargtocin, vasotocin, seritocin, asvatocin, phasvatocin), in worms (annetocin), octopi (cephalotocin), locust (locupressin or neuropeptide F1/F2) and in molluscs (conopressins G and S) []. The pattern developed to detect this category of peptides spans their entire sequence and includes four invariant amino acid residues. .; GO: 0005185 neurohypophyseal hormone activity, 0005576 extracellular region
Probab=36.37 E-value=17 Score=14.18 Aligned_cols=8 Identities=0% Similarity=-0.143 Sum_probs=5.4
Q ss_pred hhcccCCC
Q psy12170 52 FKQFLTFI 59 (73)
Q Consensus 52 FkkecPsG 59 (73)
|.+.||.|
T Consensus 2 ~i~nCP~G 9 (9)
T PF00220_consen 2 YIRNCPIG 9 (9)
T ss_pred ccccCCCC
Confidence 45678876
No 97
>PF08640 U3_assoc_6: U3 small nucleolar RNA-associated protein 6; InterPro: IPR013949 This entry represents U3 nucleolar RNA-associated proteins which are involved in nucleolar processing of pre-18S ribosomal RNA [].
Probab=36.24 E-value=66 Score=19.68 Aligned_cols=23 Identities=39% Similarity=0.569 Sum_probs=18.9
Q ss_pred HHHHHHHhcCCCCHHHHHHHHHH
Q psy12170 29 IALEDLLKQTKFTRQEIRVMYRG 51 (73)
Q Consensus 29 e~l~~L~~~T~Fs~~EIk~lYk~ 51 (73)
.++++|.+.-=||++||+.+-++
T Consensus 6 pele~l~~~~lFt~~EI~~Ivkk 28 (83)
T PF08640_consen 6 PELEDLERKGLFTKEEIREIVKK 28 (83)
T ss_pred HHHHHHHHhCCCCHHHHHHHHHH
Confidence 36788888888999999988654
No 98
>PRK02628 nadE NAD synthetase; Reviewed
Probab=36.24 E-value=57 Score=26.75 Aligned_cols=30 Identities=17% Similarity=0.259 Sum_probs=21.8
Q ss_pred CCCHHHHHHHHhcC------------------CCCHHHHHHHHHHhhc
Q psy12170 25 KYRPIALEDLLKQT------------------KFTRQEIRVMYRGFKQ 54 (73)
Q Consensus 25 kls~e~l~~L~~~T------------------~Fs~~EIk~lYk~Fkk 54 (73)
.++|+++-.+.... .|++++|++|.+.|.+
T Consensus 581 ~~~~~~i~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~f~~ 628 (679)
T PRK02628 581 GFRPSKIAFLAWHAWKDAERGAWPGFPEDKRPAYDLATIKKWLEVFLR 628 (679)
T ss_pred CCCHHHHHHHHHHHhhccccccCCCCcchhcccCCHHHHHHHHHHHcc
Confidence 57777776654331 6999999999977664
No 99
>PF00216 Bac_DNA_binding: Bacterial DNA-binding protein; InterPro: IPR000119 Bacteria synthesise a set of small, usually basic proteins of about 90 residues that bind DNA and are known as histone-like proteins [, ]. Examples include the HU protein in Escherichia coli is a dimer of closely related alpha and beta chains and in other bacteria can be a dimer of identical chains. HU-type proteins have been found in a variety of eubacteria, cyanobacteria and archaebacteria, and are also encoded in the chloroplast genome of some algae []. The integration host factor (IHF), a dimer of closely related chains which seem to function in genetic recombination as well as in translational and transcriptional control [] is found in enterobacteria and viral proteins include the African Swine fever virus protein A104R (or LMW5-AR) []. The exact function of these proteins is not yet clear but they are capable of wrapping DNA and stabilising it from denaturation under extreme environmental conditions. The structure is known for one of these proteins []. The protein exists as a dimer and two "beta-arms" function as the non-specific binding site for bacterial DNA. ; GO: 0003677 DNA binding; PDB: 3C4I_B 2O97_A 1MUL_A 1P78_A 1P51_C 1P71_B 2HT0_A 1OWG_A 2IIF_A 1OUZ_A ....
Probab=36.08 E-value=60 Score=18.92 Aligned_cols=26 Identities=4% Similarity=0.270 Sum_probs=22.4
Q ss_pred HHHHHHHhcCCCCHHHHHHHHHHhhc
Q psy12170 29 IALEDLLKQTKFTRQEIRVMYRGFKQ 54 (73)
Q Consensus 29 e~l~~L~~~T~Fs~~EIk~lYk~Fkk 54 (73)
+-++.+...+++++.++......|..
T Consensus 5 eli~~ia~~~~~s~~~v~~vl~~~~~ 30 (90)
T PF00216_consen 5 ELIKRIAEKTGLSKKDVEAVLDALFD 30 (90)
T ss_dssp HHHHHHHHHHTSSHHHHHHHHHHHHH
T ss_pred HHHHHHHHhcCCCHHHHHHHHHHHHH
Confidence 44677999999999999999998876
No 100
>KOG0844|consensus
Probab=36.07 E-value=28 Score=27.33 Aligned_cols=23 Identities=26% Similarity=0.257 Sum_probs=19.7
Q ss_pred HHhcCCCCHHHHHHHHHHhhccc
Q psy12170 34 LLKQTKFTRQEIRVMYRGFKQFL 56 (73)
Q Consensus 34 L~~~T~Fs~~EIk~lYk~Fkkec 56 (73)
=+-.|.||+++|.+|-|.|.++.
T Consensus 182 RRYRTAFTReQIaRLEKEFyrEN 204 (408)
T KOG0844|consen 182 RRYRTAFTREQIARLEKEFYREN 204 (408)
T ss_pred HHHHhhhhHHHHHHHHHHHHHhc
Confidence 34568999999999999999863
No 101
>KOG0775|consensus
Probab=35.81 E-value=37 Score=26.00 Aligned_cols=34 Identities=15% Similarity=0.154 Sum_probs=29.2
Q ss_pred cccCCCHHHHHHHHhcCCCCHHHHHHHHHHhhcc
Q psy12170 22 HVQKYRPIALEDLLKQTKFTRQEIRVMYRGFKQF 55 (73)
Q Consensus 22 ~~skls~e~l~~L~~~T~Fs~~EIk~lYk~Fkke 55 (73)
.+.-.+|++-..|.+.|+++-.++--|||.=.+.
T Consensus 197 ~~~YPsp~eKReLA~aTgLt~tQVsNWFKNRRQR 230 (304)
T KOG0775|consen 197 QNPYPSPREKRELAEATGLTITQVSNWFKNRRQR 230 (304)
T ss_pred cCCCCChHHHHHHHHHhCCchhhhhhhhhhhhhh
Confidence 4456788899999999999999999999987663
No 102
>PRK05658 RNA polymerase sigma factor RpoD; Validated
Probab=35.80 E-value=30 Score=27.92 Aligned_cols=41 Identities=17% Similarity=0.266 Sum_probs=28.1
Q ss_pred HHHHHhcCCCCHHHHHHHHHHh--------------hccc-CCCCcCHHHHhhhhcC
Q psy12170 31 LEDLLKQTKFTRQEIRVMYRGF--------------KQFL-TFIGLNHPTLKDNTNG 72 (73)
Q Consensus 31 l~~L~~~T~Fs~~EIk~lYk~F--------------kkec-PsG~Lt~~EFk~i~~~ 72 (73)
+..+...-+|+.+.|.++...+ ++ | ....+.+.+|...|+|
T Consensus 259 i~~~l~~lkL~~k~id~Lv~~lr~~~~rIr~~Er~i~~-~~~~~~m~R~~Fi~~f~g 314 (619)
T PRK05658 259 LKEELKSLRLTSKQIDELVEQLRDINKRVRGQERELLR-LVERLKMPRKDFLKLFQG 314 (619)
T ss_pred HHHHHHHcCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHcCCCHHHHHHHccC
Confidence 3445556677777777555444 33 5 5678889999999976
No 103
>PRK10664 transcriptional regulator HU subunit beta; Provisional
Probab=35.37 E-value=62 Score=19.78 Aligned_cols=26 Identities=0% Similarity=0.049 Sum_probs=22.9
Q ss_pred HHHHHHHhcCCCCHHHHHHHHHHhhc
Q psy12170 29 IALEDLLKQTKFTRQEIRVMYRGFKQ 54 (73)
Q Consensus 29 e~l~~L~~~T~Fs~~EIk~lYk~Fkk 54 (73)
+-++.|...+++|+.++...+..|..
T Consensus 5 eli~~ia~~~~~s~~~~~~~v~~~~~ 30 (90)
T PRK10664 5 QLIDKIAAGADISKAAAGRALDAIIA 30 (90)
T ss_pred HHHHHHHHHhCCCHHHHHHHHHHHHH
Confidence 34778999999999999999999976
No 104
>KOG0037|consensus
Probab=35.34 E-value=1e+02 Score=22.61 Aligned_cols=46 Identities=15% Similarity=0.118 Sum_probs=37.5
Q ss_pred CCCHHHHHH-HH--hcCCCCHHHHHHHHHHhhcccCCCCcCHHHHhhhhc
Q psy12170 25 KYRPIALED-LL--KQTKFTRQEIRVMYRGFKQFLTFIGLNHPTLKDNTN 71 (73)
Q Consensus 25 kls~e~l~~-L~--~~T~Fs~~EIk~lYk~FkkecPsG~Lt~~EFk~i~~ 71 (73)
++.-++|+. |. ...+|+-+-++-|--=|.++ -+|+|...||+..|+
T Consensus 73 ~i~~~eLq~aLsn~~~~~Fs~~TcrlmI~mfd~~-~~G~i~f~EF~~Lw~ 121 (221)
T KOG0037|consen 73 RILAKELQQALSNGTWSPFSIETCRLMISMFDRD-NSGTIGFKEFKALWK 121 (221)
T ss_pred cccHHHHHHHhhcCCCCCCCHHHHHHHHHHhcCC-CCCccCHHHHHHHHH
Confidence 566777776 44 56699999999998888754 679999999999986
No 105
>PHA03102 Small T antigen; Reviewed
Probab=34.88 E-value=63 Score=22.09 Aligned_cols=36 Identities=14% Similarity=-0.093 Sum_probs=25.9
Q ss_pred HHhcCCC--CHHHHHHHHHHhhcc-cCCCCcCHHHHhhh
Q psy12170 34 LLKQTKF--TRQEIRVMYRGFKQF-LTFIGLNHPTLKDN 69 (73)
Q Consensus 34 L~~~T~F--s~~EIk~lYk~Fkke-cPsG~Lt~~EFk~i 69 (73)
|.-...- |..||++-||+..+. .|+-.-+.+.|+.|
T Consensus 11 LGl~~~A~~s~~eIKkAYr~la~~~HPDkgg~~e~~k~i 49 (153)
T PHA03102 11 LGLPRSAWGNLPLMRKAYLRKCLEFHPDKGGDEEKMKEL 49 (153)
T ss_pred cCCCCCCCCCHHHHHHHHHHHHHHHCcCCCchhHHHHHH
Confidence 3333466 889999999998664 67776667777665
No 106
>PF08763 Ca_chan_IQ: Voltage gated calcium channel IQ domain; InterPro: IPR014873 Ca2+ ions are unique in that they not only carry charge but they are also the most widely used of diffusible second messengers. Voltage-dependent Ca2+ channels (VDCC) are a family of molecules that allow cells to couple electrical activity to intracellular Ca2+ signalling. The opening and closing of these channels by depolarizing stimuli, such as action potentials, allows Ca2+ ions to enter neurons down a steep electrochemical gradient, producing transient intracellular Ca2+ signals. Many of the processes that occur in neurons, including transmitter release, gene transcription and metabolism are controlled by Ca2+ influx occurring simultaneously at different cellular locales. The pore is formed by the alpha-1 subunit which incorporates the conduction pore, the voltage sensor and gating apparatus, and the known sites of channel regulation by second messengers, drugs, and toxins []. The activity of this pore is modulated by 4 tightly-coupled subunits: an intracellular beta subunit; a transmembrane gamma subunit; and a disulphide-linked complex of alpha-2 and delta subunits, which are proteolytically cleaved from the same gene product. Properties of the protein including gating voltage-dependence, G protein modulation and kinase susceptibility can be influenced by these subunits. Voltage-gated calcium channels are classified as T, L, N, P, Q and R, and are distinguished by their sensitivity to pharmacological blocks, single-channel conductance kinetics, and voltage-dependence. On the basis of their voltage activation properties, the voltage-gated calcium classes can be further divided into two broad groups: the low (T-type) and high (L, N, P, Q and R-type) threshold-activated channels. The voltage-gated calcium channel alpha 1 subunit contains an IQ domain, named for its isoleucine-glutamine (IQ) motif, which interacts with hydrophobic pockets of Ca2+/calmodulin []. The interaction regulates two self-regulatory calcium dependent feedback mechanisms, calcium dependent inactivation (CDI), and calcium-dependent facilitation (CDF). ; PDB: 3OXQ_F 2F3Z_B 3G43_E 2F3Y_B 2BE6_D 3DVM_B 3BXK_D 2VAY_B 3DVK_B 3BXL_B ....
Probab=34.28 E-value=26 Score=18.78 Aligned_cols=10 Identities=40% Similarity=0.763 Sum_probs=8.7
Q ss_pred HHHHHHHhhc
Q psy12170 45 IRVMYRGFKQ 54 (73)
Q Consensus 45 Ik~lYk~Fkk 54 (73)
|+.+||.|++
T Consensus 15 I~dyfr~~K~ 24 (35)
T PF08763_consen 15 IQDYFRQFKK 24 (35)
T ss_dssp HHHHHHHHHH
T ss_pred HHHHHHHHHH
Confidence 7889999986
No 107
>TIGR02675 tape_meas_nterm tape measure domain. Proteins containing this domain are strictly bacterial, including bacteriophage and prophage regions of bacterial genomes. Most members are 800 to 1800 amino acids long, making them among the longest predicted proteins of their respective phage genomes, where they are encoded in tail protein regions. This roughly 80-residue domain described here usually begins between residue 100 and 250. Many members are known or predicted to act as phage tail tape measure proteins, a minor tail component that regulates tail length.
Probab=34.28 E-value=47 Score=19.72 Aligned_cols=36 Identities=11% Similarity=-0.039 Sum_probs=24.6
Q ss_pred HhcCCCCHHHHHHHHHHhhcccCCCCcCHHHHhhhh
Q psy12170 35 LKQTKFTRQEIRVMYRGFKQFLTFIGLNHPTLKDNT 70 (73)
Q Consensus 35 ~~~T~Fs~~EIk~lYk~FkkecPsG~Lt~~EFk~i~ 70 (73)
....+.+.+++...-..|-+--.+|.++-++|..+.
T Consensus 5 ~~~~G~s~e~~~~~~~ql~Q~~~~Gkv~~ee~n~~~ 40 (75)
T TIGR02675 5 GAASGASAEEADGALIQLSQMLASGKLRGEEINSLL 40 (75)
T ss_pred HHHhCCCHHHHHHHHHHHHHHHHcCcccHHHHHHHH
Confidence 344566777777777777777777777777776654
No 108
>TIGR00987 himA integration host factor, alpha subunit. This protein forms a site-specific DNA-binding heterodimer with the integration host factor beta subunit. It is closely related to the DNA-binding protein HU.
Probab=33.18 E-value=68 Score=19.51 Aligned_cols=25 Identities=16% Similarity=0.244 Sum_probs=22.2
Q ss_pred HHHHHHhcCCCCHHHHHHHHHHhhc
Q psy12170 30 ALEDLLKQTKFTRQEIRVMYRGFKQ 54 (73)
Q Consensus 30 ~l~~L~~~T~Fs~~EIk~lYk~Fkk 54 (73)
-++.|...+++++.+++.++..|..
T Consensus 7 li~~ia~~~~~s~~~v~~vv~~~~~ 31 (96)
T TIGR00987 7 MSEYLFDELGLSKREAKELVELFFE 31 (96)
T ss_pred HHHHHHHHhCcCHHHHHHHHHHHHH
Confidence 3677999999999999999999976
No 109
>PF02885 Glycos_trans_3N: Glycosyl transferase family, helical bundle domain Prosite entry for Thymidine and pyrimidine-nucleoside phosphorylases; InterPro: IPR017459 The biosynthesis of disaccharides, oligosaccharides and polysaccharides involves the action of hundreds of different glycosyltransferases. These enzymes catalyse the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. A classification of glycosyltransferases using nucleotide diphospho-sugar, nucleotide monophospho-sugar and sugar phosphates (2.4.1.- from EC) and related proteins into distinct sequence based families has been described []. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. The same three-dimensional fold is expected to occur within each of the families. Because 3-D structures are better conserved than sequences, several of the families defined on the basis of sequence similarities may have similar 3-D structures and therefore form 'clans'. The glycosyl transferase family includes anthranilate phosphoribosyltransferase (TrpD, 2.4.2.18 from EC) and thymidine phosphorylase (2.4.2.2 from EC). All these proteins can transfer a phosphorylated ribose substrate. Thymidine phosphorylase (2.4.2.2 from EC) catalyses the reversible phosphorolysis of thymidine, deoxyuridine and their analogues to their respective bases and 2-deoxyribose 1-phosphate. This enzyme regulates the availability of thymidine and is therefore essential to nucleic acid metabolism. This N-terminal domain is found in various family 3 glycosyl transferases, including anthranilate phosphoribosyltransferase (TrpD, 2.4.2.18 from EC) and thymidine phosphorylase (2.4.2.2 from EC). All these proteins can transfer a phosphorylated ribose substrate. Thymidine phosphorylase catalyses the reversible phosphorolysis of thymidine, deoxyuridine and their analogues to their respective bases and 2-deoxyribose 1-phosphate. This enzyme regulates the availability of thymidine and is therefore essential to nucleic acid metabolism.; PDB: 2DSJ_B 2ELC_B 2BPQ_A 1ZVW_B 3QR9_B 1V8G_B 2WK5_C 2J0F_C 2WK6_B 1UOU_A ....
Probab=33.12 E-value=97 Score=17.58 Aligned_cols=32 Identities=13% Similarity=0.258 Sum_probs=19.8
Q ss_pred HHHHHHhcCCCCHHHHHHHHHHhhcccCCCCcCHHH
Q psy12170 30 ALEDLLKQTKFTRQEIRVMYRGFKQFLTFIGLNHPT 65 (73)
Q Consensus 30 ~l~~L~~~T~Fs~~EIk~lYk~FkkecPsG~Lt~~E 65 (73)
-|+.|...-++|.+|++.+++.... |.++..+
T Consensus 5 ~l~~l~~g~~Ls~~e~~~~~~~i~~----g~~s~~q 36 (66)
T PF02885_consen 5 ILKKLRDGEDLSREEAKAAFDAILD----GEVSDAQ 36 (66)
T ss_dssp HHHHHHTT----HHHHHHHHHHHHT----TSS-HHH
T ss_pred HHHHHHcCCCCCHHHHHHHHHHHHc----CCCCHHH
Confidence 4667777888999999999888874 5555444
No 110
>PF14516 AAA_35: AAA-like domain
Probab=32.92 E-value=45 Score=24.66 Aligned_cols=34 Identities=21% Similarity=0.229 Sum_probs=22.3
Q ss_pred cCCCCHHHHHHHHHHhhcccCCCCcCHHHHhhhhcC
Q psy12170 37 QTKFTRQEIRVMYRGFKQFLTFIGLNHPTLKDNTNG 72 (73)
Q Consensus 37 ~T~Fs~~EIk~lYk~FkkecPsG~Lt~~EFk~i~~~ 72 (73)
-..||.+||+.|.+++......+. .+.+...+.|
T Consensus 198 L~~Ft~~ev~~L~~~~~~~~~~~~--~~~l~~~tgG 231 (331)
T PF14516_consen 198 LPDFTPEEVQELAQRYGLEFSQEQ--LEQLMDWTGG 231 (331)
T ss_pred CCCCCHHHHHHHHHhhhccCCHHH--HHHHHHHHCC
Confidence 458999999999998865533333 4445555554
No 111
>COG2944 Predicted transcriptional regulator [Transcription]
Probab=32.25 E-value=87 Score=20.35 Aligned_cols=24 Identities=4% Similarity=0.177 Sum_probs=16.0
Q ss_pred CCCHHHHHHHHhcCCCCHHHHHHH
Q psy12170 25 KYRPIALEDLLKQTKFTRQEIRVM 48 (73)
Q Consensus 25 kls~e~l~~L~~~T~Fs~~EIk~l 48 (73)
.+++.+|..|++.+++|..+..+.
T Consensus 43 ~ls~~eIk~iRe~~~lSQ~vFA~~ 66 (104)
T COG2944 43 TLSPTEIKAIREKLGLSQPVFARY 66 (104)
T ss_pred CCCHHHHHHHHHHhCCCHHHHHHH
Confidence 477777777777777766665443
No 112
>COG4476 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=32.15 E-value=45 Score=21.34 Aligned_cols=23 Identities=17% Similarity=0.319 Sum_probs=20.0
Q ss_pred cCCCCHHHHHHHHHHhhcccCCC
Q psy12170 37 QTKFTRQEIRVMYRGFKQFLTFI 59 (73)
Q Consensus 37 ~T~Fs~~EIk~lYk~FkkecPsG 59 (73)
..+.++.++..=||+||+..|+.
T Consensus 31 E~gv~~~~ll~~Yr~FK~IVPsK 53 (90)
T COG4476 31 EKGVDAEDLLGSYRRFKEIVPSK 53 (90)
T ss_pred HccccHHHHHHHHHHHHHhcCch
Confidence 35789999999999999999875
No 113
>PF13551 HTH_29: Winged helix-turn helix
Probab=32.14 E-value=38 Score=20.03 Aligned_cols=25 Identities=4% Similarity=0.292 Sum_probs=21.5
Q ss_pred HHHHHhcCCCCHHHHHHHHHHhhcc
Q psy12170 31 LEDLLKQTKFTRQEIRVMYRGFKQF 55 (73)
Q Consensus 31 l~~L~~~T~Fs~~EIk~lYk~Fkke 55 (73)
+.++.+.++.++.-|.+|-++|...
T Consensus 15 ~~~ia~~lg~s~~Tv~r~~~~~~~~ 39 (112)
T PF13551_consen 15 IAEIARRLGISRRTVYRWLKRYREG 39 (112)
T ss_pred HHHHHHHHCcCHHHHHHHHHHHHcc
Confidence 5677888899999999999999874
No 114
>PF06648 DUF1160: Protein of unknown function (DUF1160); InterPro: IPR010594 This entry is represented by Autographa californica nuclear polyhedrosis virus (AcMNPV), Orf75; it is a family of uncharacterised viral proteins.
Probab=31.57 E-value=87 Score=20.85 Aligned_cols=31 Identities=19% Similarity=0.268 Sum_probs=23.6
Q ss_pred CCCHHHHHHHHhcC---CCCHHHHHHHHHHhhcc
Q psy12170 25 KYRPIALEDLLKQT---KFTRQEIRVMYRGFKQF 55 (73)
Q Consensus 25 kls~e~l~~L~~~T---~Fs~~EIk~lYk~Fkke 55 (73)
++..+.+-.|.... .+|+.+|..+|.+|+..
T Consensus 50 eid~e~~y~l~~~~d~~~LT~~Qi~Yl~~~~~~n 83 (122)
T PF06648_consen 50 EIDVEDMYNLFGAVDGLKLTRSQIDYLYNRVYNN 83 (122)
T ss_pred CCCHHHHHHHHhcccHhhcCHHHHHHHHHHHHcc
Confidence 56677777766654 68999999999998864
No 115
>PF04695 Pex14_N: Peroxisomal membrane anchor protein (Pex14p) conserved region; InterPro: IPR006785 This conserved region defines a group of peroxisomal membrane anchor proteins which bind the PTS1 (peroxisomal targeting signal) receptor and are required for the import of PTS1-containing proteins into peroxisomes. Loss of functional Pex14p results in defects in both the PTS1 and PTS2-dependent import pathways. Deletion analysis of this conserved region implicates it in selective peroxisome degradation. In the majority of members this region is situated at the N terminus of the protein [, ].; GO: 0005777 peroxisome, 0016020 membrane; PDB: 2W85_A 2W84_A 3FF5_B.
Probab=31.45 E-value=79 Score=20.74 Aligned_cols=37 Identities=14% Similarity=-0.040 Sum_probs=20.6
Q ss_pred cccccccCCCHHHHHHHHhcCCCCHHHHHHHHHHhhcc
Q psy12170 18 ESSKHVQKYRPIALEDLLKQTKFTRQEIRVMYRGFKQF 55 (73)
Q Consensus 18 e~~~~~skls~e~l~~L~~~T~Fs~~EIk~lYk~Fkke 55 (73)
..+.....-.-+.++-|. +-++|.+||+...++-...
T Consensus 14 ~~p~V~~sp~~~k~~FL~-sKGLt~~EI~~al~~a~~~ 50 (136)
T PF04695_consen 14 QDPKVRNSPLEKKIAFLE-SKGLTEEEIDEALGRAGSP 50 (136)
T ss_dssp CTTTCCCS-HHHHHHHHH-HCT--HHHHHHHHHHHT--
T ss_pred CCcccccCCHHHHHHHHH-cCCCCHHHHHHHHHhcCCc
Confidence 444443332233455566 4599999999999987654
No 116
>cd01104 HTH_MlrA-CarA Helix-Turn-Helix DNA binding domain of the transcription regulators MlrA and CarA. Helix-turn-helix (HTH) transcription regulator MlrA (merR-like regulator A), N-terminal domain. The MlrA protein, also known as YehV, has been shown to control cell-cell aggregation by co-regulating the expression of curli and extracellular matrix production in Escherichia coli and Salmonella typhimurium. Its close homolog, CarA from Myxococcus xanthus, is involved in activation of the carotenoid biosynthesis genes by light. These proteins belong to the MerR superfamily of transcription regulators that promote expression of several stress regulon genes by reconfiguring the spacer between the -35 and -10 promoter elements. Their conserved N-terminal domains contain predicted HTH motifs that mediate DNA binding, while the dissimilar C-terminal domains bind specific coactivator molecules. Many MlrA- and CarA-like proteins in this group appear to lack the long dimerization helix seen i
Probab=30.90 E-value=74 Score=17.44 Aligned_cols=36 Identities=8% Similarity=-0.021 Sum_probs=24.2
Q ss_pred HHHHhcCCCCHHHHHHHHHHhhccc----CCC--CcCHHHHh
Q psy12170 32 EDLLKQTKFTRQEIRVMYRGFKQFL----TFI--GLNHPTLK 67 (73)
Q Consensus 32 ~~L~~~T~Fs~~EIk~lYk~Fkkec----PsG--~Lt~~EFk 67 (73)
.++.+.++++..-|++|.+.+.... ++| ..+.+++.
T Consensus 4 ~eva~~~gvs~~tlr~w~~~~g~~~~~r~~~~~r~yt~~~v~ 45 (68)
T cd01104 4 GAVARLTGVSPDTLRAWERRYGLPAPQRTDGGHRLYSEADVA 45 (68)
T ss_pred HHHHHHHCcCHHHHHHHHHhCCCCCCCcCCCCCeecCHHHHH
Confidence 4667788999999999998754432 344 44555543
No 117
>PF08356 EF_assoc_2: EF hand associated; InterPro: IPR013567 This region predominantly appears near EF-hands (IPR002048 from INTERPRO) in GTP-binding proteins. It is found in all three eukaryotic kingdoms.
Probab=30.82 E-value=96 Score=19.48 Aligned_cols=31 Identities=10% Similarity=0.044 Sum_probs=20.5
Q ss_pred CCCCHHHHHHHHHHhhcccCCC----CcCHHHHhh
Q psy12170 38 TKFTRQEIRVMYRGFKQFLTFI----GLNHPTLKD 68 (73)
Q Consensus 38 T~Fs~~EIk~lYk~FkkecPsG----~Lt~~EFk~ 68 (73)
+.++.+||..+-+--.+.+|+| .||.+-|.-
T Consensus 2 ~pL~~~el~~ik~~v~~~~~~gv~~~GiT~~GFl~ 36 (89)
T PF08356_consen 2 KPLQPQELEDIKKVVRENIPDGVNDNGITLDGFLF 36 (89)
T ss_pred CCCCHHHHHHHHHHHHHHCCCCcCCCccchhhHHH
Confidence 3466777777766666667766 677776653
No 118
>PF02256 Fe_hyd_SSU: Iron hydrogenase small subunit; InterPro: IPR003149 Many microorganisms, such as methanogenic, acetogenic, nitrogen-fixing, photosynthetic, or sulphate-reducing bacteria, metabolise hydrogen. Hydrogen activation is mediated by a family of enzymes, termed hydrogenases, which either provide these organisms with reducing power from hydrogen oxidation, or act as electron sinks. There are two hydrogenases families that differ functionally from each other: NiFe hydrogenases tend to be more involved in hydrogen oxidation, while Iron-only FeFe (Fe only) hydrogenases in hydrogen production. Fe only hydrogenases (1.12.7.2 from EC) show a common core structure, which contains a moiety, deeply buried inside the protein, with an Fe-Fe dinuclear centre, nonproteic bridging, terminal CO and CN- ligands attached to each of the iron atoms, and a dithio moiety, which also bridges the two iron atoms and has been tentatively assigned as a di(thiomethyl)amine. This common core also harbours three [4Fe-4S] iron-sulphur clusters []. In FeFe hydrogenases, as in NiFe hydrogenases, the set of iron-sulphur clusters is dispersed regularly between the dinuclear Fe-Fe centre and the molecular surface. These clusters are distant by about 1.2 nm from each other but the [4Fe-4S] cluster closest to the dinuclear centre is covalently bound to one of the iron atoms though a thiolate bridging ligand. The moiety including the dinuclear centre, the thiolate bridging ligand, and the proximal [4Fe-4S] cluster is known as the H-cluster. A channel, lined with hydrophobic amino acid side chains, nearly connects the dinuclear centre and the molecular surface. Furthermore hydrogen-bonded water molecule sites have been identified at the interior and at the surface of the protein. The small subunit is comprised of alternating random coil and alpha helical structures that encompass the large subunit in a novel protein fold [].; PDB: 3LX4_A 1C4C_A 1FEH_A 3C8Y_A 1C4A_A 1HFE_S 1GX7_D 1E08_D.
Probab=30.50 E-value=46 Score=19.04 Aligned_cols=16 Identities=19% Similarity=0.318 Sum_probs=12.2
Q ss_pred CHHHHHHHHHHhhccc
Q psy12170 41 TRQEIRVMYRGFKQFL 56 (73)
Q Consensus 41 s~~EIk~lYk~Fkkec 56 (73)
.-.+|+.||+.|...+
T Consensus 28 eNp~v~~lY~~~lg~p 43 (60)
T PF02256_consen 28 ENPEVQELYKEFLGGP 43 (60)
T ss_dssp G-HHHHHHHHHTTSST
T ss_pred cCHHHHHHHHHHhCCC
Confidence 4468999999999743
No 119
>PF06401 Alpha-2-MRAP_C: Alpha-2-macroglobulin RAP, C-terminal domain ; InterPro: IPR010483 The alpha-2-macroglobulin receptor-associated protein (RAP) is a intracellular glycoprotein that binds to the 2-macroglobulin receptor and other members of the low density lipoprotein receptor family. The protein inhibits binding of all currently known ligands of these receptors []. Two different studies have provided conflicting domain boundaries.; GO: 0008201 heparin binding, 0050750 low-density lipoprotein particle receptor binding, 0005783 endoplasmic reticulum; PDB: 2FCW_A 2P03_A 2FTU_A 2P01_A.
Probab=30.46 E-value=43 Score=24.35 Aligned_cols=26 Identities=23% Similarity=0.425 Sum_probs=18.1
Q ss_pred HHHHHHHhcC---C-CCHHHHHHHHHHhhc
Q psy12170 29 IALEDLLKQT---K-FTRQEIRVMYRGFKQ 54 (73)
Q Consensus 29 e~l~~L~~~T---~-Fs~~EIk~lYk~Fkk 54 (73)
..|+.|..+. + ||.+|++.|++.|+.
T Consensus 6 ~kL~KLW~kA~~sGkFt~eEL~~Lk~Ef~h 35 (214)
T PF06401_consen 6 KKLNKLWHKAETSGKFTDEELDKLKEEFQH 35 (214)
T ss_dssp HHHHHHHHHHHHHS---CHHHHHHHHHHHH
T ss_pred hHHHHHHHHHHHcCCCCHHHHHHHHHHHHH
Confidence 4566665543 6 999999999999975
No 120
>PF13443 HTH_26: Cro/C1-type HTH DNA-binding domain; PDB: 3TYR_A 3TYS_A 3B7H_A.
Probab=30.34 E-value=34 Score=18.62 Aligned_cols=20 Identities=20% Similarity=0.391 Sum_probs=12.3
Q ss_pred HHHHHhcCCCCHHHHHHHHH
Q psy12170 31 LEDLLKQTKFTRQEIRVMYR 50 (73)
Q Consensus 31 l~~L~~~T~Fs~~EIk~lYk 50 (73)
..+|.+.+++++..|.+|++
T Consensus 13 ~~~La~~~gis~~tl~~~~~ 32 (63)
T PF13443_consen 13 QKDLARKTGISRSTLSRILN 32 (63)
T ss_dssp HHHHHHHHT--HHHHHHHHT
T ss_pred HHHHHHHHCcCHHHHHHHHh
Confidence 34566777777777777766
No 121
>KOG0046|consensus
Probab=30.26 E-value=62 Score=27.03 Aligned_cols=13 Identities=15% Similarity=0.158 Sum_probs=5.6
Q ss_pred ccCCCHHHHHHHH
Q psy12170 23 VQKYRPIALEDLL 35 (73)
Q Consensus 23 ~skls~e~l~~L~ 35 (73)
++.++.+++..|.
T Consensus 10 ~~~~tq~El~~l~ 22 (627)
T KOG0046|consen 10 QSQLTQEELRELK 22 (627)
T ss_pred cccccHHHHHHHH
Confidence 3344444444433
No 122
>PRK08402 replication factor A; Reviewed
Probab=29.98 E-value=1.3e+02 Score=23.21 Aligned_cols=38 Identities=16% Similarity=0.189 Sum_probs=29.1
Q ss_pred CCHHH-HHHHHhcCCCCHHHHHHHHHHhhcccCCCCcCHHHH
Q psy12170 26 YRPIA-LEDLLKQTKFTRQEIRVMYRGFKQFLTFIGLNHPTL 66 (73)
Q Consensus 26 ls~e~-l~~L~~~T~Fs~~EIk~lYk~FkkecPsG~Lt~~EF 66 (73)
|++++ ++.|.+.++.+++||.+-.+.-..++| ++.++-
T Consensus 4 ~~~~~i~e~~~~~~~vs~eei~~~l~~lv~~~~---~~e~~A 42 (355)
T PRK08402 4 LTKEEIIERIERKTGMSREEIEEKIREIMKEEG---ISEHAA 42 (355)
T ss_pred ccHHHHHHHHHHhhCCCHHHHHHHHHHHHHHcC---CCHHHH
Confidence 55666 556888889999999999988888765 666544
No 123
>KOG3956|consensus
Probab=29.89 E-value=81 Score=24.50 Aligned_cols=31 Identities=23% Similarity=0.478 Sum_probs=25.7
Q ss_pred cCCCHHHHHHHHhc---CCCCHHHHHHHHHHhhc
Q psy12170 24 QKYRPIALEDLLKQ---TKFTRQEIRVMYRGFKQ 54 (73)
Q Consensus 24 skls~e~l~~L~~~---T~Fs~~EIk~lYk~Fkk 54 (73)
+.+....++.|..+ .+||.+|+..|++.|+-
T Consensus 150 ~~f~D~~L~kLW~kA~~sgFs~eEL~~L~~Ef~h 183 (359)
T KOG3956|consen 150 GKFTDQNLQKLWSKAQNSGFSQEELNALHGEFKH 183 (359)
T ss_pred cccCchhHHHHHHHHhccCCCHHHHHHHHHHHHH
Confidence 47777888887654 58999999999999974
No 124
>PF04699 P16-Arc: ARP2/3 complex 16 kDa subunit (p16-Arc); InterPro: IPR006789 The Arp2/3 protein complex has been implicated in the control of actin polymerisation. The human complex consists of seven subunits which include the actin related proteins Arp2 and Arp3, and five others referred to as p41-Arc, p34-Arc, p21-Arc, p20-Arc, and p16-Arc. The precise function of p16-Arc is currently unknown. Its structure consists of a single domain containing a bundle of seven alpha helices [, ].; GO: 0030833 regulation of actin filament polymerization, 0005856 cytoskeleton; PDB: 3DWL_G 1TYQ_G 1U2V_G 2P9U_G 2P9L_G 1K8K_G 3DXM_G 2P9N_G 3DXK_G 2P9I_G ....
Probab=29.51 E-value=52 Score=22.54 Aligned_cols=21 Identities=14% Similarity=0.414 Sum_probs=14.1
Q ss_pred cCCCHHHHHHHHhcCCCCHHHHHHHHHHhhc
Q psy12170 24 QKYRPIALEDLLKQTKFTRQEIRVMYRGFKQ 54 (73)
Q Consensus 24 skls~e~l~~L~~~T~Fs~~EIk~lYk~Fkk 54 (73)
..|.+++++-|+ +.+||+|-.
T Consensus 96 ~~L~~~~~D~LM----------KYiYkg~~~ 116 (152)
T PF04699_consen 96 KSLDSDQQDILM----------KYIYKGMES 116 (152)
T ss_dssp CCS-HHHHHHHH----------HHHHHHTTS
T ss_pred HhCCHHHHhHHH----------HHHHHhccC
Confidence 356666666665 688999954
No 125
>COG5573 Predicted nucleic-acid-binding protein, contains PIN domain [General function prediction only]
Probab=29.44 E-value=88 Score=21.56 Aligned_cols=57 Identities=16% Similarity=0.116 Sum_probs=40.2
Q ss_pred hhhhhhccccccc-CCCHHHH----HHHHhcCCCCHHHHHHHHHHhhcccCCCCcCHHHHhh
Q psy12170 12 VVAFEIESSKHVQ-KYRPIAL----EDLLKQTKFTRQEIRVMYRGFKQFLTFIGLNHPTLKD 68 (73)
Q Consensus 12 ~~~~~~e~~~~~s-kls~e~l----~~L~~~T~Fs~~EIk~lYk~FkkecPsG~Lt~~EFk~ 68 (73)
+-+.+|++...++ -.+-+-| -.|.+++.+++..|+.+--.|.+.|.--.+|.++...
T Consensus 27 ~~a~~L~~a~~~~~VVs~QVl~Et~~vl~RK~~~s~~~i~~lie~~~~~~~Iv~~t~~~~~~ 88 (142)
T COG5573 27 DAAEVLEQALGHTYVVSVQVLNETCYVLKRKYGASEQLIQTLIEAFRRQCRIVHLTHEEVVQ 88 (142)
T ss_pred HHHHHHHHhcCceEEEehHHHHHHHHHHHHhcCCcHHHHHHHHHHHHhhceeecCCHHHHHH
Confidence 3445555554433 1233333 3489999999999999999999999777788877654
No 126
>PF05603 DUF775: Protein of unknown function (DUF775); InterPro: IPR008493 This family consists of several eukaryotic proteins of unknown function.
Probab=29.18 E-value=28 Score=24.59 Aligned_cols=15 Identities=7% Similarity=0.091 Sum_probs=12.2
Q ss_pred CHHHHHHHHHHhhcc
Q psy12170 41 TRQEIRVMYRGFKQF 55 (73)
Q Consensus 41 s~~EIk~lYk~Fkke 55 (73)
.-.-+++||++|.++
T Consensus 180 P~~~~~~W~~kFe~K 194 (202)
T PF05603_consen 180 PLSVFDKWWEKFERK 194 (202)
T ss_pred eHHHHHHHHHHHHHH
Confidence 445899999999874
No 127
>cd06257 DnaJ DnaJ domain or J-domain. DnaJ/Hsp40 (heat shock protein 40) proteins are highly conserved and play crucial roles in protein translation, folding, unfolding, translocation, and degradation. They act primarily by stimulating the ATPase activity of Hsp70s, an important chaperonine family. Hsp40 proteins are characterized by the presence of a J domain, which mediates the interaction with Hsp70. They may contain other domains as well, and the architectures provide a means of classification.
Probab=29.17 E-value=82 Score=16.36 Aligned_cols=32 Identities=16% Similarity=0.138 Sum_probs=21.1
Q ss_pred CCCCHHHHHHHHHHhhccc-CCCC----cCHHHHhhh
Q psy12170 38 TKFTRQEIRVMYRGFKQFL-TFIG----LNHPTLKDN 69 (73)
Q Consensus 38 T~Fs~~EIk~lYk~Fkkec-PsG~----Lt~~EFk~i 69 (73)
-+.+.++|++-|++..+.+ |+-. ...+.|..+
T Consensus 10 ~~~~~~~ik~~y~~l~~~~HPD~~~~~~~~~~~~~~l 46 (55)
T cd06257 10 PDASDEEIKKAYRKLALKYHPDKNPDDPEAEEKFKEI 46 (55)
T ss_pred CCCCHHHHHHHHHHHHHHHCcCCCCCcHHHHHHHHHH
Confidence 3568899999999998863 5433 234455443
No 128
>PF01527 HTH_Tnp_1: Transposase; InterPro: IPR002514 Transposase proteins are necessary for efficient DNA transposition. This family consists of various Escherichia coli insertion elements and other bacterial transposases some of which are members of the IS3 family. This region includes a helix-turn-helix motif (HTH) at the N terminus followed by a leucine zipper (LZ) motif. The LZ motif has been shown to mediate oligomerisation of the transposase components in IS911 []. More information about these proteins can be found at Protein of the Month: Transposase [].; GO: 0003677 DNA binding, 0004803 transposase activity, 0006313 transposition, DNA-mediated; PDB: 2JN6_A 2RN7_A.
Probab=29.14 E-value=73 Score=17.83 Aligned_cols=25 Identities=0% Similarity=0.224 Sum_probs=20.9
Q ss_pred HHHHHHhcCCCCHHHHHHHHHHhhc
Q psy12170 30 ALEDLLKQTKFTRQEIRVMYRGFKQ 54 (73)
Q Consensus 30 ~l~~L~~~T~Fs~~EIk~lYk~Fkk 54 (73)
.+.+++...+++..-|.+|-+.|..
T Consensus 25 sv~~va~~~gi~~~~l~~W~~~~~~ 49 (76)
T PF01527_consen 25 SVSEVAREYGISPSTLYNWRKQYRE 49 (76)
T ss_dssp HHHHHHHHHTS-HHHHHHHHHHHH-
T ss_pred ceEeeecccccccccccHHHHHHhc
Confidence 4678899999999999999999973
No 129
>COG0484 DnaJ DnaJ-class molecular chaperone with C-terminal Zn finger domain [Posttranslational modification, protein turnover, chaperones]
Probab=29.03 E-value=60 Score=25.39 Aligned_cols=32 Identities=16% Similarity=0.160 Sum_probs=23.5
Q ss_pred CCCCHHHHHHHHHHhhcc-cCCCCc----CHHHHhhh
Q psy12170 38 TKFTRQEIRVMYRGFKQF-LTFIGL----NHPTLKDN 69 (73)
Q Consensus 38 T~Fs~~EIk~lYk~Fkke-cPsG~L----t~~EFk~i 69 (73)
-+-|++||++-||++.++ .|+-.= ..+.|+.|
T Consensus 14 k~As~~EIKkAYRkLA~kyHPD~n~g~~~AeeKFKEI 50 (371)
T COG0484 14 KDASEEEIKKAYRKLAKKYHPDRNPGDKEAEEKFKEI 50 (371)
T ss_pred CCCCHHHHHHHHHHHHHHhCCCCCCCCHHHHHHHHHH
Confidence 367999999999999986 465433 45567765
No 130
>PRK00095 mutL DNA mismatch repair protein; Reviewed
Probab=28.89 E-value=1.2e+02 Score=24.72 Aligned_cols=38 Identities=13% Similarity=0.155 Sum_probs=30.0
Q ss_pred HHhcCCCCHHHHHHHHHHhhc-----ccCCC-----CcCHHHHhhhhc
Q psy12170 34 LLKQTKFTRQEIRVMYRGFKQ-----FLTFI-----GLNHPTLKDNTN 71 (73)
Q Consensus 34 L~~~T~Fs~~EIk~lYk~Fkk-----ecPsG-----~Lt~~EFk~i~~ 71 (73)
++....++.+|++.|-+...+ .||.| .|++.++.+.|.
T Consensus 568 Ik~g~~Ls~~E~~~Ll~~L~~~~~P~~CPHGRPt~i~l~~~elek~F~ 615 (617)
T PRK00095 568 IRAGRRLTLEEMNALLRQLEATENPGTCPHGRPTYIELSLSDLEKLFK 615 (617)
T ss_pred hhccCCCCHHHHHHHHHHHHhcccccCCCCCCeeEEECCHHHHHHHhC
Confidence 444568999999999988776 59999 578888877664
No 131
>TIGR02653 Lon_rel_chp conserved hypothetical protein. This model describes a protein family of unknown function, about 690 residues in length, in which some members show C-terminal sequence similarity to Pfam model pfam05362, which is the Lon protease C-terminal proteolytic domain, from MEROPS family S16. However, the annotated catalytic sites of E. coli Lon protease are not conserved in members of this family. Members have a motif GP[RK][GS]TGKS, similar to the ATP-binding P-loop motif GxxGxGK[ST].
Probab=28.63 E-value=68 Score=27.09 Aligned_cols=29 Identities=7% Similarity=0.076 Sum_probs=25.0
Q ss_pred CHHHHHHHHHHhhcc-cCCCCcCHHHHhhh
Q psy12170 41 TRQEIRVMYRGFKQF-LTFIGLNHPTLKDN 69 (73)
Q Consensus 41 s~~EIk~lYk~Fkke-cPsG~Lt~~EFk~i 69 (73)
+..=|+++..+|+|. +|+|.++.++|..+
T Consensus 407 D~~aV~kt~SgllKLl~P~~~~~~ee~e~~ 436 (675)
T TIGR02653 407 DVIAVRKTVSGLLKLLYPDGEYTKDDVREC 436 (675)
T ss_pred hHHHHHHHHHHHHHHhCCCCCCCHHHHHHH
Confidence 455688999999997 99999999999865
No 132
>PF08667 BetR: BetR domain; InterPro: IPR013975 CheY-like phosphoacceptor (or receiver [REC]) domain is a common module in a variety of response regulators of the bacterial signal transduction systems. BetR is one of the many response regulators and is encoded mainly in Burkholderia spp. It is a N-terminal helix-turn-helix domain (HTH) and has been shown to be related to the XRE-type HTH domain (IPR001387 from INTERPRO), it has been suggested that BetR would have dimerization, protein-protein interaction, and activation/relief-of-inhibition properties [].
Probab=28.56 E-value=1e+02 Score=21.13 Aligned_cols=30 Identities=27% Similarity=0.419 Sum_probs=24.0
Q ss_pred HHHHhcCCCCHHHHHHHHHHhhcccCCCCcCHHHHhh
Q psy12170 32 EDLLKQTKFTRQEIRVMYRGFKQFLTFIGLNHPTLKD 68 (73)
Q Consensus 32 ~~L~~~T~Fs~~EIk~lYk~FkkecPsG~Lt~~EFk~ 68 (73)
..|.-.+.||-.||+++.+.| .++.++|..
T Consensus 41 RKL~G~~~ftl~EI~~Ia~~f-------gvS~d~l~g 70 (147)
T PF08667_consen 41 RKLNGKSPFTLEEIKKIAKHF-------GVSPDELFG 70 (147)
T ss_pred HHhcCCCCCCHHHHHHHHHHh-------CcCHHHHHh
Confidence 357778899999999999999 566777663
No 133
>PF03672 UPF0154: Uncharacterised protein family (UPF0154); InterPro: IPR005359 The proteins in this entry are functionally uncharacterised.
Probab=28.43 E-value=1.4e+02 Score=17.87 Aligned_cols=32 Identities=19% Similarity=0.282 Sum_probs=22.1
Q ss_pred ccCCCHHHHHHHHhcC--CCCHHHHHHHHHHhhc
Q psy12170 23 VQKYRPIALEDLLKQT--KFTRQEIRVMYRGFKQ 54 (73)
Q Consensus 23 ~skls~e~l~~L~~~T--~Fs~~EIk~lYk~Fkk 54 (73)
|+..+.+.+..+..+- +-|++.|++.++..++
T Consensus 29 NPpine~mir~M~~QMG~kpSekqi~Q~m~~mk~ 62 (64)
T PF03672_consen 29 NPPINEKMIRAMMMQMGRKPSEKQIKQMMRSMKN 62 (64)
T ss_pred CCCCCHHHHHHHHHHhCCCccHHHHHHHHHHHHh
Confidence 4456667777766554 6788888888877654
No 134
>PRK09019 translation initiation factor Sui1; Validated
Probab=28.08 E-value=59 Score=21.20 Aligned_cols=21 Identities=14% Similarity=0.164 Sum_probs=17.8
Q ss_pred CCHHHHHHHHHHhhcccCCCC
Q psy12170 40 FTRQEIRVMYRGFKQFLTFIG 60 (73)
Q Consensus 40 Fs~~EIk~lYk~FkkecPsG~ 60 (73)
....+++.|-+.||+.|..|.
T Consensus 55 ~~~~dlk~l~K~lKkk~gcGG 75 (108)
T PRK09019 55 LDDAELKKLAAELKKKCGCGG 75 (108)
T ss_pred CCHHHHHHHHHHHHHHhcCCC
Confidence 368899999999999986663
No 135
>TIGR02688 conserved hypothetical protein TIGR02688. Members of this family are uncharacterized proteins sporadically distributed in bacteria and archaea, about 470 amino acids in length. Several members of this family appear in public databases with annotation as ATP-dependent protease La, despite the lack of similarity to families TIGR00763 (ATP-dependent protease La) or pfam02190 (ATP-dependent protease La (LON) domain). This protein is repeatedly found downstream of another uncharacterized protein of about 880 amino acids in length, described by model TIGR02687.
Probab=27.87 E-value=74 Score=25.57 Aligned_cols=29 Identities=3% Similarity=0.196 Sum_probs=24.4
Q ss_pred CHHHHHHHHHHhhcc-cCCCCcCHHHHhhh
Q psy12170 41 TRQEIRVMYRGFKQF-LTFIGLNHPTLKDN 69 (73)
Q Consensus 41 s~~EIk~lYk~Fkke-cPsG~Lt~~EFk~i 69 (73)
+..=|+++..+|.|. +|+|.++.++|..+
T Consensus 400 D~~aV~kt~SgllKLL~P~~~~~~ee~~~~ 429 (449)
T TIGR02688 400 DVIAVKKTFSGLMKILFPHGTITKEEFTEC 429 (449)
T ss_pred hHHHHHHHHHHHHHHhCCCCCCCHHHHHHH
Confidence 345578999999996 99999999999764
No 136
>PF02337 Gag_p10: Retroviral GAG p10 protein; InterPro: IPR003322 Retroviral matrix proteins (or major core proteins) are components of envelope-associated capsids, which line the inner surface of virus envelopes and are associated with viral membranes []. Matrix proteins are produced as part of Gag precursor polyproteins. During viral maturation, the Gag polyprotein is cleaved into major structural proteins by the viral protease, yielding the matrix (MA), capsid (CA), nucleocapsid (NC), and some smaller peptides. Gag-derived proteins govern the entire assembly and release of the virus particles, with matrix proteins playing key roles in Gag stability, capsid assembly, transport and budding. Although matrix proteins from different retroviruses appear to perform similar functions and can have similar structural folds, their primary sequences can be very different. This entry represents matrix proteins from beta-retroviruses such as Mason-Pfizer monkey virus (MPMV) (Simian Mason-Pfizer virus) and Mouse mammary tumor virus (MMTV) [, ]. This entry also identifies matrix proteins from several eukaryotic endogenous retroviruses, which arise when one or more copies of the retroviral genome becomes integrated into the host genome [].; GO: 0005198 structural molecule activity, 0019028 viral capsid; PDB: 2F77_X 2F76_X.
Probab=27.86 E-value=70 Score=20.18 Aligned_cols=50 Identities=14% Similarity=0.088 Sum_probs=27.9
Q ss_pred ccccccC-CCHHHHHHHHhcC--CCCHHHHHHHHHHhhcccC----CCCcCHHHHhh
Q psy12170 19 SSKHVQK-YRPIALEDLLKQT--KFTRQEIRVMYRGFKQFLT----FIGLNHPTLKD 68 (73)
Q Consensus 19 ~~~~~sk-ls~e~l~~L~~~T--~Fs~~EIk~lYk~FkkecP----sG~Lt~~EFk~ 68 (73)
||.+.|+ +=-..++.|.+.- ..+.+++...|+--.+-|| +|.|+.+..++
T Consensus 1 MG~~~S~~~fv~~Lk~lLk~rGi~v~~~~L~~f~~~i~~~~PWF~~eG~l~~~~W~k 57 (90)
T PF02337_consen 1 MGQSHSKQPFVSILKHLLKERGIRVKKKDLINFLSFIDKVCPWFPEEGTLDLDNWKK 57 (90)
T ss_dssp --SSS-HHHHHHHHHHHHHCCT----HHHHHHHHHHHHHHTT-SS--SS-HHHHHHH
T ss_pred CCccchhhHHHHHHHHHHHHcCeeecHHHHHHHHHHHHHhCCCCCCCCCcCHHHHHH
Confidence 4555444 2223455544443 6789999999998888884 78888876554
No 137
>PF13411 MerR_1: MerR HTH family regulatory protein; PDB: 2JML_A 3GP4_A 3GPV_B.
Probab=27.85 E-value=75 Score=17.48 Aligned_cols=23 Identities=13% Similarity=0.109 Sum_probs=20.1
Q ss_pred HHHHHhcCCCCHHHHHHHHHHhh
Q psy12170 31 LEDLLKQTKFTRQEIRVMYRGFK 53 (73)
Q Consensus 31 l~~L~~~T~Fs~~EIk~lYk~Fk 53 (73)
+.++.+.++.+..-|+.|-+.+.
T Consensus 3 i~eva~~~gvs~~tlr~y~~~gl 25 (69)
T PF13411_consen 3 IKEVAKLLGVSPSTLRYYEREGL 25 (69)
T ss_dssp HHHHHHHTTTTHHHHHHHHHTTS
T ss_pred HHHHHHHHCcCHHHHHHHHHhcC
Confidence 45788889999999999999887
No 138
>PF10437 Lip_prot_lig_C: Bacterial lipoate protein ligase C-terminus; InterPro: IPR019491 This is the C-terminal domain of a bacterial lipoate protein ligase. There is no conservation between this C terminus and that of vertebrate lipoate protein ligase C-termini, but both are associated with IPR004143 from INTERPRO, further upstream. This C-terminal domain is more stable than IPR004143 from INTERPRO and the hypothesis is that the C-terminal domain has a role in recognising the lipoyl domain and/or transferring the lipoyl group onto it from the lipoyl-AMP intermediate. C-terminal fragments of length 172 to 193 amino acid residues are observed in the eubacterial enzymes whereas in their archaeal counterparts the C-terminal segment is significantly smaller, ranging in size from 87 to 107 amino acid residues. ; PDB: 1X2G_A 3A7R_A 3A7A_A 1X2H_C 1VQZ_A 3R07_C.
Probab=27.79 E-value=1.4e+02 Score=17.59 Aligned_cols=42 Identities=17% Similarity=0.119 Sum_probs=31.3
Q ss_pred HHHHHHHHhcCCCCHHHHHHHHHHhhcccCCCCcCHHHHhhh
Q psy12170 28 PIALEDLLKQTKFTRQEIRVMYRGFKQFLTFIGLNHPTLKDN 69 (73)
Q Consensus 28 ~e~l~~L~~~T~Fs~~EIk~lYk~FkkecPsG~Lt~~EFk~i 69 (73)
.++++....-+.|+++.|...-+..--..--|.++.++|.+.
T Consensus 44 i~~le~~L~G~~~~~~~i~~~l~~~~~~~~~~~~~~~el~~~ 85 (86)
T PF10437_consen 44 IEELEEALIGCPYDREAIKEALNSVDLEDYFGNISVEELIEL 85 (86)
T ss_dssp HHHHHHHHTTCBSSHHHHHHHHHHCHGGGTCCTHHHHHHHHH
T ss_pred HHHHHHHHHhcCCCHHHHHHHHHHhCHhhccccCCHHHHHHh
Confidence 456666677789999999999988843333358888888764
No 139
>KOG0031|consensus
Probab=27.60 E-value=94 Score=22.02 Aligned_cols=33 Identities=15% Similarity=0.122 Sum_probs=28.1
Q ss_pred CCCHHHHHHHHHHhhc--ccCCCCcCHHHHhhhhc
Q psy12170 39 KFTRQEIRVMYRGFKQ--FLTFIGLNHPTLKDNTN 71 (73)
Q Consensus 39 ~Fs~~EIk~lYk~Fkk--ecPsG~Lt~~EFk~i~~ 71 (73)
-|+..+|+.+-.+|-- ...+|-|.++.++++|.
T Consensus 25 mf~q~QIqEfKEAF~~mDqnrDG~IdkeDL~d~~a 59 (171)
T KOG0031|consen 25 MFDQSQIQEFKEAFNLMDQNRDGFIDKEDLRDMLA 59 (171)
T ss_pred HhhHHHHHHHHHHHHHHhccCCCcccHHHHHHHHH
Confidence 4899999999999976 35889999999998874
No 140
>cd00159 RhoGAP RhoGAP: GTPase-activator protein (GAP) for Rho-like GTPases; GAPs towards Rho/Rac/Cdc42-like small GTPases. Small GTPases (G proteins) cluster into distinct families, and all act as molecular switches, active in their GTP-bound form but inactive when bound to GDP. The Rho family of small G proteins, which includes Cdc42Hs, activates effectors involved in a wide variety of developmental processes, including regulation of cytoskeleton formation, cell proliferation and the JNK signaling pathway. G proteins generally have a low intrinsic GTPase hydrolytic activity but there are family-specific groups of GAPs that enhance the rate of GTP hydrolysis by several orders of magnitude. The RhoGAPs are one of the major classes of regulators of Rho G proteins.
Probab=27.15 E-value=1.2e+02 Score=19.17 Aligned_cols=32 Identities=3% Similarity=0.043 Sum_probs=26.3
Q ss_pred CCCCHHHHHHHHHHhhcccCCCCcCHHHHhhh
Q psy12170 38 TKFTRQEIRVMYRGFKQFLTFIGLNHPTLKDN 69 (73)
Q Consensus 38 T~Fs~~EIk~lYk~FkkecPsG~Lt~~EFk~i 69 (73)
..++...+..+.|.|.+..|.+.|+.+-+..+
T Consensus 48 ~~~~~~~va~~lK~~l~~Lp~pli~~~~~~~~ 79 (169)
T cd00159 48 EDYDVHDVASLLKLYLRELPEPLIPFELYDEF 79 (169)
T ss_pred cccCHHHHHHHHHHHHHcCCCccCCHHHHHHH
Confidence 35688999999999999999999997655443
No 141
>PF09373 PMBR: Pseudomurein-binding repeat; InterPro: IPR018975 Methanothermobacter thermautotrophicus (Methanobacterium thermoformicicum) is a methanogenic Gram-positive microorganism with a cell wall consisting of pseudomurein. This repeat specifically binds to pseudomurein. This repeat is found at the N terminus of PeiW and PeiP which are pseudomurein binding phage proteins.
Probab=27.15 E-value=40 Score=17.10 Aligned_cols=14 Identities=7% Similarity=-0.064 Sum_probs=10.8
Q ss_pred CCCcCHHHHhhhhc
Q psy12170 58 FIGLNHPTLKDNTN 71 (73)
Q Consensus 58 sG~Lt~~EFk~i~~ 71 (73)
+|.|+.+++.++.+
T Consensus 2 ~~~i~~~~~~d~a~ 15 (33)
T PF09373_consen 2 SGTISKEEYLDMAS 15 (33)
T ss_pred CceecHHHHHHHHH
Confidence 57888888887754
No 142
>PRK12279 50S ribosomal protein L22/unknown domain fusion protein; Provisional
Probab=27.04 E-value=69 Score=24.54 Aligned_cols=27 Identities=11% Similarity=0.208 Sum_probs=19.7
Q ss_pred HHHHHHHHhhcccCCCCcCHHHHhhhhcCC
Q psy12170 44 EIRVMYRGFKQFLTFIGLNHPTLKDNTNGI 73 (73)
Q Consensus 44 EIk~lYk~FkkecPsG~Lt~~EFk~i~~~~ 73 (73)
....+++.|++ .+.|++++|.+.|.|.
T Consensus 249 ~P~~lW~k~~~---~sGIsk~~F~~Yf~g~ 275 (311)
T PRK12279 249 AISSIWRKYGK---QSVISKKEYDAYYEGK 275 (311)
T ss_pred CHHHHHHHHhh---ccCCCHHHHHHHhCCC
Confidence 34556677766 4688999999999883
No 143
>PRK10753 transcriptional regulator HU subunit alpha; Provisional
Probab=27.03 E-value=1e+02 Score=18.67 Aligned_cols=26 Identities=0% Similarity=0.135 Sum_probs=22.5
Q ss_pred HHHHHHHhcCCCCHHHHHHHHHHhhc
Q psy12170 29 IALEDLLKQTKFTRQEIRVMYRGFKQ 54 (73)
Q Consensus 29 e~l~~L~~~T~Fs~~EIk~lYk~Fkk 54 (73)
+-++.|.+.+++|+.+++.....|..
T Consensus 5 eli~~ia~~~~~s~~~~~~~v~~~~~ 30 (90)
T PRK10753 5 QLIDVIADKAELSKTQAKAALESTLA 30 (90)
T ss_pred HHHHHHHHHhCCCHHHHHHHHHHHHH
Confidence 34677899999999999999999976
No 144
>PF14410 GH-E: HNH/ENDO VII superfamily nuclease with conserved GHE residues
Probab=26.83 E-value=42 Score=19.98 Aligned_cols=44 Identities=11% Similarity=0.003 Sum_probs=27.1
Q ss_pred hhhcccccccCCCHHHHHHH---HhcCCCCHHHHHHHHH---HhhcccCCCCcC
Q psy12170 15 FEIESSKHVQKYRPIALEDL---LKQTKFTRQEIRVMYR---GFKQFLTFIGLN 62 (73)
Q Consensus 15 ~~~e~~~~~skls~e~l~~L---~~~T~Fs~~EIk~lYk---~FkkecPsG~Lt 62 (73)
...+|+..... +-..+ ....+.|++|+..||. .|.=+||+...+
T Consensus 17 ~~~dmgH~~~~----e~~~~~~~~~~~~~t~ke~~d~~n~p~nyrlE~~s~NrS 66 (70)
T PF14410_consen 17 GPWDMGHKPGV----EYWRLVGRAEEGGITRKEFLDWYNDPDNYRLEDPSCNRS 66 (70)
T ss_pred CCCCccCchHH----HHHHHHHHHHhcCCCHHHHHHHHhCccceeecCCcCCcC
Confidence 45677776321 22222 2236899999999998 344478887654
No 145
>cd04761 HTH_MerR-SF Helix-Turn-Helix DNA binding domain of transcription regulators from the MerR superfamily. Helix-turn-helix (HTH) transcription regulator MerR superfamily, N-terminal domain. The MerR family transcription regulators have been shown to mediate responses to stress including exposure to heavy metals, drugs, or oxygen radicals in eubacterial and some archaeal species. They regulate transcription of multidrug/metal ion transporter genes and oxidative stress regulons by reconfiguring the spacer between the -35 and -10 promoter elements. A typical MerR regulator is comprised of two distinct domains that harbor the regulatory (effector-binding) site and the active (DNA-binding) site. Their N-terminal domains are homologous and contain a DNA-binding winged HTH motif, while the C-terminal domains are often dissimilar and bind specific coactivator molecules such as metal ions, drugs, and organic substrates.
Probab=26.74 E-value=99 Score=15.67 Aligned_cols=23 Identities=22% Similarity=0.114 Sum_probs=18.3
Q ss_pred HHHHHhcCCCCHHHHHHHHHHhh
Q psy12170 31 LEDLLKQTKFTRQEIRVMYRGFK 53 (73)
Q Consensus 31 l~~L~~~T~Fs~~EIk~lYk~Fk 53 (73)
+.++.+.++.+..-|+.|-+...
T Consensus 3 ~~e~a~~~gv~~~tlr~~~~~g~ 25 (49)
T cd04761 3 IGELAKLTGVSPSTLRYYERIGL 25 (49)
T ss_pred HHHHHHHHCcCHHHHHHHHHCCC
Confidence 34667778999999999977765
No 146
>KOG0489|consensus
Probab=26.51 E-value=55 Score=23.77 Aligned_cols=19 Identities=11% Similarity=0.494 Sum_probs=15.2
Q ss_pred HHHhcCCCCHHHHHHHHHH
Q psy12170 33 DLLKQTKFTRQEIRVMYRG 51 (73)
Q Consensus 33 ~L~~~T~Fs~~EIk~lYk~ 51 (73)
+|....++|+.+||-||..
T Consensus 191 EiA~~L~LtErQIKIWFQN 209 (261)
T KOG0489|consen 191 EIAHALNLTERQIKIWFQN 209 (261)
T ss_pred HHHhhcchhHHHHHHHHHH
Confidence 4566678899999999864
No 147
>PRK11426 hypothetical protein; Provisional
Probab=26.47 E-value=1.4e+02 Score=20.13 Aligned_cols=40 Identities=8% Similarity=-0.054 Sum_probs=32.4
Q ss_pred CCCHHHHHHHHhcCCCCHHHHHHHHHHhhc-----ccCCCCcCHH
Q psy12170 25 KYRPIALEDLLKQTKFTRQEIRVMYRGFKQ-----FLTFIGLNHP 64 (73)
Q Consensus 25 kls~e~l~~L~~~T~Fs~~EIk~lYk~Fkk-----ecPsG~Lt~~ 64 (73)
.+-++.++.|..+|+.+++++...-..+.= ..|+|.+..+
T Consensus 69 ~lG~d~i~~lA~q~Gl~~~~~~~~LA~~LP~~VDklTP~G~lp~~ 113 (132)
T PRK11426 69 ALGTNAVSDLGQKLGVDTSTASSLLAEQLPKIIDALSPQGEVSPQ 113 (132)
T ss_pred HhChHHHHHHHHHHCcCHHHHHHHHHHHhHHHHhccCCCCCCCcc
Confidence 466778889999999999999887777654 4699998764
No 148
>PTZ00100 DnaJ chaperone protein; Provisional
Probab=26.26 E-value=1.5e+02 Score=19.41 Aligned_cols=39 Identities=10% Similarity=-0.089 Sum_probs=27.8
Q ss_pred HHHHHhcCCCCHHHHHHHHHHhhcc-cCCCCcCHHHHhhh
Q psy12170 31 LEDLLKQTKFTRQEIRVMYRGFKQF-LTFIGLNHPTLKDN 69 (73)
Q Consensus 31 l~~L~~~T~Fs~~EIk~lYk~Fkke-cPsG~Lt~~EFk~i 69 (73)
.+-|.-....|.+||++-||+..+. .|+-.=+.+.|..|
T Consensus 68 y~ILGv~~~As~~eIkkaYRrLa~~~HPDkgGs~~~~~kI 107 (116)
T PTZ00100 68 YKILNISPTASKERIREAHKQLMLRNHPDNGGSTYIASKV 107 (116)
T ss_pred HHHcCCCCCCCHHHHHHHHHHHHHHhCCCCCCCHHHHHHH
Confidence 4445555678999999999999775 57754466666655
No 149
>cd04236 AAK_NAGS-Urea AAK_NAGS-Urea: N-acetylglutamate (NAG) kinase-like domain of the NAG Synthase (NAGS) of the urea cycle found in animals. Ureogenic NAGS is a mitochondrial enzyme catalyzing the formation of NAG from acetylcoenzyme A and L-glutamate; NAG is an essential allosteric activator of carbamylphosphate synthase I, the first and rate limiting enzyme of the urea cycle. Ureogenic NAGS activity is dependent on the concentration of glutamate (substrate) and arginine (activator). Domain architecture of ureogenic NAGS consists of an N-terminal NAG kinase-like (ArgB) domain (this CD) and a C-terminal DUF619 domain. Members of this CD belong to the protein superfamily, the Amino Acid Kinase Family (AAKF).
Probab=26.26 E-value=96 Score=22.85 Aligned_cols=28 Identities=18% Similarity=0.210 Sum_probs=21.9
Q ss_pred HHHHHHhcCCCCHHHHHHHHHHhhcccC
Q psy12170 30 ALEDLLKQTKFTRQEIRVMYRGFKQFLT 57 (73)
Q Consensus 30 ~l~~L~~~T~Fs~~EIk~lYk~FkkecP 57 (73)
+++...++-+.+++|+++|-+.|..-.|
T Consensus 4 ~~~~~~~~~~~~~~e~~~~l~~f~~~~~ 31 (271)
T cd04236 4 DVKAFLHQKGGDPREARYWLTQFQIAMP 31 (271)
T ss_pred hHHHHHHHhCCCHHHHHHHHHHhhccCC
Confidence 4555556677899999999999998544
No 150
>PF13048 DUF3908: Protein of unknown function (DUF3908)
Probab=26.02 E-value=34 Score=22.97 Aligned_cols=40 Identities=15% Similarity=0.159 Sum_probs=28.9
Q ss_pred HHHHHHhcCCCCHHHHHHHHHHhhcccCCCCcCHHHHhhhhc
Q psy12170 30 ALEDLLKQTKFTRQEIRVMYRGFKQFLTFIGLNHPTLKDNTN 71 (73)
Q Consensus 30 ~l~~L~~~T~Fs~~EIk~lYk~FkkecPsG~Lt~~EFk~i~~ 71 (73)
.+.++...+.+...+=.++.+-+++-|+ .|....|+.+||
T Consensus 4 ~~~~~i~~~~~~~~~~~k~~~~~kkv~~--~Ld~~n~~~~Yp 43 (127)
T PF13048_consen 4 EFKRWIKSSNFRDEEDYKLRELLKKVCE--LLDISNIKQFYP 43 (127)
T ss_pred HHHHHHHhccCCchhhhHHHHHHHHHHh--hcCcccceEEEe
Confidence 5667788888876443388888888888 566777777776
No 151
>PRK14561 hypothetical protein; Provisional
Probab=25.95 E-value=62 Score=22.20 Aligned_cols=28 Identities=11% Similarity=0.030 Sum_probs=23.7
Q ss_pred CCCHHHHHHHHHHhhc--ccCCCCcCHHHH
Q psy12170 39 KFTRQEIRVMYRGFKQ--FLTFIGLNHPTL 66 (73)
Q Consensus 39 ~Fs~~EIk~lYk~Fkk--ecPsG~Lt~~EF 66 (73)
+++++||.++-+.+.. .-|+|.+...+|
T Consensus 133 ~~~K~eI~~la~~l~~~~~~~~~~~~~~~~ 162 (194)
T PRK14561 133 GFGRKTIDRLVERLFEIEEGESEEIPKSDY 162 (194)
T ss_pred CCCHHHHHHHHHhhEEEEeccCCCcCccch
Confidence 7899999999999988 368888877655
No 152
>PTZ00232 variable surface protein Vir27; Provisional
Probab=25.93 E-value=85 Score=24.20 Aligned_cols=29 Identities=10% Similarity=-0.032 Sum_probs=22.3
Q ss_pred HHHHHHHHHhhcccCCCCcC---HHHHhhhhc
Q psy12170 43 QEIRVMYRGFKQFLTFIGLN---HPTLKDNTN 71 (73)
Q Consensus 43 ~EIk~lYk~FkkecPsG~Lt---~~EFk~i~~ 71 (73)
++++..|+.|...|+++.=+ -.+|...|.
T Consensus 167 ~~i~~AY~~v~~~C~~~~~~~~yC~~F~~~~~ 198 (363)
T PTZ00232 167 NKYINMYNDAYSDCNKINKKKYDCNKFSELFR 198 (363)
T ss_pred HHHHHHHHHHhhhcCCCCCCCcchHHHHHHhc
Confidence 57899999999999876544 367877764
No 153
>PF13560 HTH_31: Helix-turn-helix domain; PDB: 3F51_C 3F52_A 3PXP_A 2OFY_A.
Probab=25.83 E-value=95 Score=17.07 Aligned_cols=38 Identities=16% Similarity=0.192 Sum_probs=20.1
Q ss_pred hhhhhhhhcccccccCCCHHHHHHHHhcCCCCHHHHHHH
Q psy12170 10 EEVVAFEIESSKHVQKYRPIALEDLLKQTKFTRQEIRVM 48 (73)
Q Consensus 10 ~~~~~~~~e~~~~~skls~e~l~~L~~~T~Fs~~EIk~l 48 (73)
+..-...+|.|... ..++..+..|....+.+..++..|
T Consensus 27 s~s~v~~iE~G~~~-~p~~~~l~~l~~~l~~~~~~~~~l 64 (64)
T PF13560_consen 27 SQSTVSRIERGRRP-RPSPDTLQRLARALGVPPDERAEL 64 (64)
T ss_dssp -HHHHHHHHTTSSS-S-BHHHHHHHHHHTT--HHHHHC-
T ss_pred CHHHHHHHHCCCCC-CCCHHHHHHHHHHHCcCHHHHccC
Confidence 33445566666552 355666777777777776666543
No 154
>PRK00285 ihfA integration host factor subunit alpha; Reviewed
Probab=25.55 E-value=1.3e+02 Score=18.31 Aligned_cols=26 Identities=15% Similarity=0.245 Sum_probs=22.6
Q ss_pred HHHHHHhcCCCCHHHHHHHHHHhhcc
Q psy12170 30 ALEDLLKQTKFTRQEIRVMYRGFKQF 55 (73)
Q Consensus 30 ~l~~L~~~T~Fs~~EIk~lYk~Fkke 55 (73)
-++.+...+++++.++......|...
T Consensus 8 l~~~ia~~~~~s~~~v~~vl~~~~~~ 33 (99)
T PRK00285 8 LAEALFEKVGLSKREAKELVELFFEE 33 (99)
T ss_pred HHHHHHHHhCcCHHHHHHHHHHHHHH
Confidence 36678899999999999999999873
No 155
>PF09435 DUF2015: Fungal protein of unknown function (DUF2015); InterPro: IPR018559 This entry represents uncharacterised proteins found in fungi.
Probab=25.39 E-value=1.7e+02 Score=19.80 Aligned_cols=33 Identities=15% Similarity=0.127 Sum_probs=23.7
Q ss_pred HHHHHHHHhcC--CCCHHHHHHHHHHhhcc--cCCCC
Q psy12170 28 PIALEDLLKQT--KFTRQEIRVMYRGFKQF--LTFIG 60 (73)
Q Consensus 28 ~e~l~~L~~~T--~Fs~~EIk~lYk~Fkke--cPsG~ 60 (73)
++++..++++- +|+++-+-..-++|++. .|+|.
T Consensus 87 k~EI~~IM~~~~v~FDeARliy~~~~f~~NgI~pdG~ 123 (128)
T PF09435_consen 87 KREIRRIMKRRRVNFDEARLIYTERRFKKNGIGPDGR 123 (128)
T ss_pred HHHHHHHHHHcCCCHHHHHHHHHHHHHHHcCCCCCCC
Confidence 67888888776 45666666666889985 58875
No 156
>PF00165 HTH_AraC: Bacterial regulatory helix-turn-helix proteins, AraC family; PDB: 1WPK_A 1ZGW_A 1U8B_A.
Probab=25.36 E-value=96 Score=15.74 Aligned_cols=24 Identities=21% Similarity=0.421 Sum_probs=15.2
Q ss_pred HHHHHHHhcCCCCHHHHHHHHHHh
Q psy12170 29 IALEDLLKQTKFTRQEIRVMYRGF 52 (73)
Q Consensus 29 e~l~~L~~~T~Fs~~EIk~lYk~F 52 (73)
-.|+++....+||..-+.+.++.-
T Consensus 9 ~~l~~iA~~~g~S~~~f~r~Fk~~ 32 (42)
T PF00165_consen 9 LTLEDIAEQAGFSPSYFSRLFKKE 32 (42)
T ss_dssp --HHHHHHHHTS-HHHHHHHHHHH
T ss_pred CCHHHHHHHHCCCHHHHHHHHHHH
Confidence 357788888899877766665543
No 157
>PF01465 GRIP: GRIP domain; InterPro: IPR000237 The GRIP (golgin-97, RanBP2alpha,Imh1p and p230/golgin-245) domain [, , ] is found in many large coiled-coil proteins. It has been shown to be sufficient for targeting to the Golgi []. The GRIP domain contains a completely conserved tyrosine residue.; GO: 0005515 protein binding, 0000042 protein targeting to Golgi; PDB: 1R4A_H 1UPT_B.
Probab=25.22 E-value=87 Score=17.05 Aligned_cols=24 Identities=21% Similarity=0.198 Sum_probs=16.3
Q ss_pred CCHHHHHHHHhcCCCCHHHHHHHH
Q psy12170 26 YRPIALEDLLKQTKFTRQEIRVMY 49 (73)
Q Consensus 26 ls~e~l~~L~~~T~Fs~~EIk~lY 49 (73)
.+...+..|..--+||++|.++|+
T Consensus 23 ~~~~llpvi~tlL~fs~~e~~~i~ 46 (46)
T PF01465_consen 23 EREQLLPVIATLLKFSPEEKQKIL 46 (46)
T ss_dssp -HHHHHHHHHHHTT--HHHHHHHH
T ss_pred hHHHHHHHHHHHHCCCHHHHHhhC
Confidence 344466678888899999999874
No 158
>cd07971 OBF_DNA_ligase_LigD The Oligonucleotide/oligosaccharide binding (OB)-fold domain of ATP-dependent DNA ligase LigD is a DNA-binding module that is part of the catalytic core unit. ATP-dependent polynucleotide ligases catalyze phosphodiester bond formation using nicked nucleic acid substrates with the high energy nucleotide of ATP as a cofactor in a three step reaction mechanism. DNA ligases play a vital role in the diverse processes of DNA replication, recombination and repair. ATP-dependent ligases are present in many organisms such as viruses, bacteriohages, eukarya, archaea and bacteria. Bacterial DNA ligases are divided into two broad classes: NAD-dependent and ATP-dependent. All bacterial species have a NAD-dependent DNA ligase (LigA). Some bacterial genomes contain multiple genes for DNA ligases that are predicted to use ATP as their cofactor, including Mycobacterium tuberculosis LigB, LigC, and LigD. This group is composed of Mycobacterium tuberculosis LigD and similar ba
Probab=25.10 E-value=73 Score=19.88 Aligned_cols=18 Identities=17% Similarity=0.287 Sum_probs=16.2
Q ss_pred cCCCCHHHHHHHHHHhhc
Q psy12170 37 QTKFTRQEIRVMYRGFKQ 54 (73)
Q Consensus 37 ~T~Fs~~EIk~lYk~Fkk 54 (73)
.|+|+.++++.|++.+.+
T Consensus 40 gtG~s~~~~~~l~~~l~~ 57 (115)
T cd07971 40 GTGFSAATLRELRERLAP 57 (115)
T ss_pred cCCCCHHHHHHHHHHhhc
Confidence 479999999999999976
No 159
>PRK00523 hypothetical protein; Provisional
Probab=25.04 E-value=1.7e+02 Score=17.98 Aligned_cols=32 Identities=16% Similarity=0.183 Sum_probs=23.0
Q ss_pred ccCCCHHHHHHHHhcC--CCCHHHHHHHHHHhhc
Q psy12170 23 VQKYRPIALEDLLKQT--KFTRQEIRVMYRGFKQ 54 (73)
Q Consensus 23 ~skls~e~l~~L~~~T--~Fs~~EIk~lYk~Fkk 54 (73)
|+.++++.++.+..+- +-|++.|++.++..++
T Consensus 37 NPpine~mir~M~~QMGqKPSekki~Q~m~~mk~ 70 (72)
T PRK00523 37 NPPITENMIRAMYMQMGRKPSESQIKQVMRSVKN 70 (72)
T ss_pred CcCCCHHHHHHHHHHhCCCccHHHHHHHHHHHHh
Confidence 4567777887766554 6788888888887653
No 160
>PF13348 Y_phosphatase3C: Tyrosine phosphatase family C-terminal region; PDB: 1YWF_A 2OZ5_B.
Probab=24.99 E-value=1e+02 Score=17.22 Aligned_cols=19 Identities=11% Similarity=0.272 Sum_probs=12.9
Q ss_pred HHhcCCCCHHHHHHHHHHh
Q psy12170 34 LLKQTKFTRQEIRVMYRGF 52 (73)
Q Consensus 34 L~~~T~Fs~~EIk~lYk~F 52 (73)
|....+||+++|.+|.+.+
T Consensus 49 l~~~lgl~~~~i~~Lr~~l 67 (68)
T PF13348_consen 49 LREELGLSEEDIERLRERL 67 (68)
T ss_dssp HHHT-T--HHHHHHHHHHH
T ss_pred HHHcCCCCHHHHHHHHHHc
Confidence 6666799999999998776
No 161
>PF14550 Peptidase_U35_2: Putative phage protease XkdF
Probab=24.94 E-value=69 Score=21.21 Aligned_cols=22 Identities=23% Similarity=0.166 Sum_probs=18.7
Q ss_pred CCHHHHHHHHHHhhcccCCCCc
Q psy12170 40 FTRQEIRVMYRGFKQFLTFIGL 61 (73)
Q Consensus 40 Fs~~EIk~lYk~FkkecPsG~L 61 (73)
+|++||.+-...|++.+.++.+
T Consensus 35 ~taeeIekaA~~Fm~~~~~~d~ 56 (122)
T PF14550_consen 35 MTAEEIEKAAHKFMKNYRNIDK 56 (122)
T ss_pred cCHHHHHHHHHHHHHHcccCCc
Confidence 8999999999999998766543
No 162
>PF13720 Acetyltransf_11: Udp N-acetylglucosamine O-acyltransferase; Domain 2; PDB: 3I3A_A 3I3X_A 3HSQ_B 2JF2_A 1LXA_A 2AQ9_A 2QIV_X 2QIA_A 2JF3_A 4EQY_F ....
Probab=24.86 E-value=78 Score=19.17 Aligned_cols=28 Identities=18% Similarity=0.406 Sum_probs=19.9
Q ss_pred hcCCCCHHHHHHHHHHhhcccCCCCcCHH
Q psy12170 36 KQTKFTRQEIRVMYRGFKQFLTFIGLNHP 64 (73)
Q Consensus 36 ~~T~Fs~~EIk~lYk~FkkecPsG~Lt~~ 64 (73)
+.-+|++++|..+-+.|+...-+| ++.+
T Consensus 24 rR~Gfs~~~i~~l~~ayr~l~~~~-~~~~ 51 (83)
T PF13720_consen 24 RRRGFSKEEISALRRAYRILFRSG-LTLE 51 (83)
T ss_dssp HHTTS-HHHHHHHHHHHHHHHTSS-S-HH
T ss_pred HHcCCCHHHHHHHHHHHHHHHhCC-CCHH
Confidence 356899999999999998876555 3443
No 163
>KOG0485|consensus
Probab=24.82 E-value=58 Score=24.39 Aligned_cols=26 Identities=8% Similarity=0.278 Sum_probs=19.3
Q ss_pred CCHHHHHHHHhcCCCCHHHHHHHHHH
Q psy12170 26 YRPIALEDLLKQTKFTRQEIRVMYRG 51 (73)
Q Consensus 26 ls~e~l~~L~~~T~Fs~~EIk~lYk~ 51 (73)
|+..+-..|.++.++|+.+|+-||+.
T Consensus 129 LSsaeRa~LA~sLqLTETQVKIWFQN 154 (268)
T KOG0485|consen 129 LSSAERAGLAASLQLTETQVKIWFQN 154 (268)
T ss_pred hhHHHHhHHHHhhhhhhhhhhhhhhh
Confidence 34444445777889999999999874
No 164
>PF07037 DUF1323: Putative transcription regulator (DUF1323); InterPro: IPR010749 This family consists of several hypothetical Enterobacterial proteins of around 120 residues in length. The function of this family is unknown.
Probab=24.80 E-value=79 Score=21.27 Aligned_cols=20 Identities=30% Similarity=0.340 Sum_probs=16.8
Q ss_pred HHHhcCCCCHHHHHHHHHHh
Q psy12170 33 DLLKQTKFTRQEIRVMYRGF 52 (73)
Q Consensus 33 ~L~~~T~Fs~~EIk~lYk~F 52 (73)
+|...|+++++-|.+|-|.-
T Consensus 5 ELA~~tG~srQTINrWvRke 24 (122)
T PF07037_consen 5 ELAELTGYSRQTINRWVRKE 24 (122)
T ss_pred HHHHHhCccHHHHHHHHHhc
Confidence 46778999999999998753
No 165
>smart00342 HTH_ARAC helix_turn_helix, arabinose operon control protein.
Probab=24.68 E-value=1.2e+02 Score=16.31 Aligned_cols=28 Identities=4% Similarity=0.172 Sum_probs=19.2
Q ss_pred HHHHHHhcCCC-CHHHHHHHHHHhhcccC
Q psy12170 30 ALEDLLKQTKF-TRQEIRVMYRGFKQFLT 57 (73)
Q Consensus 30 ~l~~L~~~T~F-s~~EIk~lYk~FkkecP 57 (73)
.+.++....+| +..-+.+.|+.+....|
T Consensus 52 ~~~~ia~~~g~~s~~~f~r~Fk~~~g~sp 80 (84)
T smart00342 52 SVTEIALRVGFSSQSYFSRAFKKLFGVTP 80 (84)
T ss_pred CHHHHHHHhCCCChHHHHHHHHHHHCcCh
Confidence 46677777788 88877777776654333
No 166
>KOG3218|consensus
Probab=24.17 E-value=64 Score=23.53 Aligned_cols=31 Identities=19% Similarity=0.094 Sum_probs=22.7
Q ss_pred HHHHHHHHHHhhcc----cCCC--------CcCHHHHhhhhcC
Q psy12170 42 RQEIRVMYRGFKQF----LTFI--------GLNHPTLKDNTNG 72 (73)
Q Consensus 42 ~~EIk~lYk~Fkke----cPsG--------~Lt~~EFk~i~~~ 72 (73)
+.|+.++|+.++.- --.| .|++++|+..|.+
T Consensus 6 e~E~~rl~~ar~T~~qMlrDRGY~vt~~el~ltLe~F~~~yg~ 48 (208)
T KOG3218|consen 6 EEEIYRLYLARKTAMQMLRDRGYTVTQEELDLTLEEFKARYGD 48 (208)
T ss_pred HHHHHHHHHHHHHHHHHHHhcCccccHHHhhhhHHHHHHHhcc
Confidence 77899999988762 1223 5788999998864
No 167
>COG4397 Mu-like prophage major head subunit gpT [General function prediction only]
Probab=23.96 E-value=68 Score=24.28 Aligned_cols=32 Identities=19% Similarity=0.323 Sum_probs=25.7
Q ss_pred CCCCHHHHHHHHHHhhcccCCC-CcCHHHHhhh
Q psy12170 38 TKFTRQEIRVMYRGFKQFLTFI-GLNHPTLKDN 69 (73)
Q Consensus 38 T~Fs~~EIk~lYk~FkkecPsG-~Lt~~EFk~i 69 (73)
|..+..-|+.|+.+|++.+..| ...+.+++.|
T Consensus 2 ~~iTPd~ikALftGf~knFqdGl~mApSQYkeI 34 (308)
T COG4397 2 TPITPDMIKALFTGFRKNFQDGLSMAPSQYKEI 34 (308)
T ss_pred CccCHHHHHHHHHhhhhhhhhhhhhChhhhhhh
Confidence 5678888999999999998888 5666677665
No 168
>PRK12461 UDP-N-acetylglucosamine acyltransferase; Provisional
Probab=23.89 E-value=90 Score=22.49 Aligned_cols=26 Identities=19% Similarity=0.268 Sum_probs=21.2
Q ss_pred hcCCCCHHHHHHHHHHhhcccCCCCc
Q psy12170 36 KQTKFTRQEIRVMYRGFKQFLTFIGL 61 (73)
Q Consensus 36 ~~T~Fs~~EIk~lYk~FkkecPsG~L 61 (73)
+.-+|++++|+.+.+.|+..+-+|..
T Consensus 196 ~r~g~~~~~~~~~~~~~~~~~~~~~~ 221 (255)
T PRK12461 196 RRRGFSSRAIRALKRAYKIIYRSGLS 221 (255)
T ss_pred hhcCCCHHHHHHHHHHHHHHHhcCCC
Confidence 45689999999999999887776643
No 169
>PF05120 GvpG: Gas vesicle protein G ; InterPro: IPR007804 Gas vesicles are intracellular, protein-coated, and hollow organelles found in cyanobacteria and halophilic archaea. They are permeable to ambient gases by diffusion and provide buoyancy, enabling cells to move upwards in water to access oxygen and/or light. Proteins containing this family are involved in the formation of gas vesicles [].
Probab=23.70 E-value=1.4e+02 Score=18.33 Aligned_cols=24 Identities=8% Similarity=0.202 Sum_probs=17.3
Q ss_pred HHHHHHHHHHhhcccCCCCcCHHHHhhh
Q psy12170 42 RQEIRVMYRGFKQFLTFIGLNHPTLKDN 69 (73)
Q Consensus 42 ~~EIk~lYk~FkkecPsG~Lt~~EFk~i 69 (73)
++++..|+..|- +|.||.++|...
T Consensus 34 ~~~L~~L~~~~e----~GEIseeEf~~~ 57 (79)
T PF05120_consen 34 RRELAELQEALE----AGEISEEEFERR 57 (79)
T ss_pred HHHHHHHHHHHH----cCCCCHHHHHHH
Confidence 455666677664 799999999653
No 170
>PF10281 Ish1: Putative stress-responsive nuclear envelope protein; InterPro: IPR018803 This group of proteins, found primarily in fungi, consists of putative stress-responsive nuclear envelope protein Ish1 and homologues [].
Probab=23.68 E-value=1.2e+02 Score=15.54 Aligned_cols=29 Identities=3% Similarity=0.063 Sum_probs=22.2
Q ss_pred CCHHHHHHHHHHhhcccCCCCcCHHHHhh
Q psy12170 40 FTRQEIRVMYRGFKQFLTFIGLNHPTLKD 68 (73)
Q Consensus 40 Fs~~EIk~lYk~FkkecPsG~Lt~~EFk~ 68 (73)
-|..+|+.|-+...-..|.+..+++++.+
T Consensus 4 Ws~~~L~~wL~~~gi~~~~~~~~rd~Ll~ 32 (38)
T PF10281_consen 4 WSDSDLKSWLKSHGIPVPKSAKTRDELLK 32 (38)
T ss_pred CCHHHHHHHHHHcCCCCCCCCCCHHHHHH
Confidence 46788999988877777776678888754
No 171
>PF09124 Endonuc-dimeris: T4 recombination endonuclease VII, dimerisation; InterPro: IPR015208 This entry represents a dimerisation domain predominantly found in Bacteriophage T4 recombination endonuclease VII. It adopts a helical secondary structure, with three alpha helices oriented parallel to each other. As well as mediating dimerisation of the protein, this domain is also involved in binding to the DNA major groove []. ; PDB: 1EN7_B 1E7L_B 2QNF_A 2QNC_A 1E7D_A.
Probab=23.61 E-value=92 Score=18.07 Aligned_cols=34 Identities=9% Similarity=0.160 Sum_probs=20.2
Q ss_pred cccccccCCCHHHHHHHHhcC------CCCHHHHHHHHHH
Q psy12170 18 ESSKHVQKYRPIALEDLLKQT------KFTRQEIRVMYRG 51 (73)
Q Consensus 18 e~~~~~skls~e~l~~L~~~T------~Fs~~EIk~lYk~ 51 (73)
|+.+..|+|+..++......- .-++++|-+.|+.
T Consensus 8 D~~K~FSRl~k~eMiaem~~~G~~y~~~~tK~~Lvk~fkK 47 (54)
T PF09124_consen 8 DKVKWFSRLTKPEMIAEMDSYGFEYNEKDTKAQLVKIFKK 47 (54)
T ss_dssp HHHHHHHTS-HHHHHHHHHHTT----TTS-HHHHHHHHHH
T ss_pred HHHHHHHhcCHHHHHHHHHHhCCcCCccccHHHHHHHHHH
Confidence 344556788887766544453 4578888777764
No 172
>TIGR01557 myb_SHAQKYF myb-like DNA-binding domain, SHAQKYF class. This model describes a DNA-binding domain restricted to (but common in) plant proteins, many of which also contain a response regulator domain. The domain appears related to the Myb-like DNA-binding domain described by Pfam model pfam00249. It is distinguished in part by a well-conserved motif SH[AL]QKY[RF] at the C-terminal end of the motif.
Probab=23.58 E-value=1.5e+02 Score=16.80 Aligned_cols=28 Identities=14% Similarity=0.334 Sum_probs=22.5
Q ss_pred CCHHHHHHHHhcCCCCHHHHHHHHHHhh
Q psy12170 26 YRPIALEDLLKQTKFTRQEIRVMYRGFK 53 (73)
Q Consensus 26 ls~e~l~~L~~~T~Fs~~EIk~lYk~Fk 53 (73)
-.|..|..++..+++|..+|+.-+..+.
T Consensus 27 a~pk~I~~~~~~~~lT~~qV~SH~QKy~ 54 (57)
T TIGR01557 27 ATPKRILELMVVDGLTRDQVASHLQKYR 54 (57)
T ss_pred cchHHHHHHcCCCCCCHHHHHHHHHHHH
Confidence 5677888888889999999998877654
No 173
>cd00043 CYCLIN Cyclin box fold. Protein binding domain functioning in cell-cycle and transcription control. Present in cyclins, TFIIB and Retinoblastoma (RB).The cyclins consist of 8 classes of cell cycle regulators that regulate cyclin dependent kinases (CDKs). TFIIB is a transcription factor that binds the TATA box. Cyclins, TFIIB and RB contain 2 copies of the domain.
Probab=23.48 E-value=1.2e+02 Score=16.29 Aligned_cols=23 Identities=35% Similarity=0.401 Sum_probs=16.4
Q ss_pred HHHHHHhcCCC-CHHHHHHHHHHh
Q psy12170 30 ALEDLLKQTKF-TRQEIRVMYRGF 52 (73)
Q Consensus 30 ~l~~L~~~T~F-s~~EIk~lYk~F 52 (73)
.++++...++. +..+|.++++.+
T Consensus 64 ~~~~~~~~~~~~~~~~i~~~e~~i 87 (88)
T cd00043 64 WLKDLVHVTGYATEEEILRMEKLL 87 (88)
T ss_pred CHHHHhHHhCCCCHHHHHHHHHHh
Confidence 34556666777 888888888764
No 174
>PF09808 SNAPc_SNAP43: Small nuclear RNA activating complex (SNAPc), subunit SNAP43; InterPro: IPR019188 Members of this family are part of the SNAPc complex required for the transcription of both RNA polymerase II and III small-nuclear RNA genes. They bind to the proximal sequence element (PSE), a non-TATA-box basal promoter element common to these 2 types of genes. Furthermore, they also recruit TBP and BRF2 to the U6 snRNA TATA box. SNAPc consists of at least four stably associated subunits, SNAP43, SNAP45, SNAP50, and SNAP190. None of the three small subunits can bind to the PSE on their own [].
Probab=23.35 E-value=1.2e+02 Score=20.82 Aligned_cols=26 Identities=15% Similarity=0.241 Sum_probs=19.5
Q ss_pred HHHHHHHHHHhhcccCCCCcCHHHHhhhh
Q psy12170 42 RQEIRVMYRGFKQFLTFIGLNHPTLKDNT 70 (73)
Q Consensus 42 ~~EIk~lYk~FkkecPsG~Lt~~EFk~i~ 70 (73)
+++|..+-.+|++. ..++-+.|++++
T Consensus 3 ~~D~~~Ll~~F~~~---~~~~F~~F~~~W 28 (194)
T PF09808_consen 3 KEDIDELLQRFQQA---ESVRFEDFKRLW 28 (194)
T ss_pred HHHHHHHHHHHHHc---CCCCHHHHHHHH
Confidence 45777888888876 567788888776
No 175
>PF13373 DUF2407_C: DUF2407 C-terminal domain
Probab=23.03 E-value=1.1e+02 Score=20.63 Aligned_cols=24 Identities=25% Similarity=0.423 Sum_probs=18.1
Q ss_pred HHHhcCCCCHHHHHHHHHHhhcccC
Q psy12170 33 DLLKQTKFTRQEIRVMYRGFKQFLT 57 (73)
Q Consensus 33 ~L~~~T~Fs~~EIk~lYk~FkkecP 57 (73)
.|. ..+||++||..|-+-|.....
T Consensus 5 RLl-~~GFS~~eI~~LR~QF~~~~~ 28 (140)
T PF13373_consen 5 RLL-SAGFSPEEIQDLRSQFHSIYG 28 (140)
T ss_pred HHH-HcCCCHHHHHHHHHHHHHHhc
Confidence 444 448999999999888887543
No 176
>KOG2047|consensus
Probab=22.97 E-value=93 Score=26.84 Aligned_cols=53 Identities=17% Similarity=0.051 Sum_probs=40.2
Q ss_pred CCCchhhhhhhhhhcccccccCCCHHH-HHHHHhcCCCCHHHHHHHHHHhhcccCC
Q psy12170 4 DDESDVEEVVAFEIESSKHVQKYRPIA-LEDLLKQTKFTRQEIRVMYRGFKQFLTF 58 (73)
Q Consensus 4 ~~~~~~~~~~~~~~e~~~~~skls~e~-l~~L~~~T~Fs~~EIk~lYk~FkkecPs 58 (73)
-|....+|++|+|.|...++-. -.- +..+..+-+=+.+.+..+|.+|.+..|.
T Consensus 5 ~dl~~~~EDvpfEeEilRnp~s--vk~W~RYIe~k~~sp~k~~~~lYERal~~lp~ 58 (835)
T KOG2047|consen 5 VDLNFENEDVPFEEEILRNPFS--VKCWLRYIEHKAGSPDKQRNLLYERALKELPG 58 (835)
T ss_pred ccccccccccchHHHHHcCchh--HHHHHHHHHHHccCChHHHHHHHHHHHHHCCC
Confidence 4667778899999998877332 222 5567777788899999999999998774
No 177
>PF09570 RE_SinI: SinI restriction endonuclease; InterPro: IPR019070 There are four classes of restriction endonucleases: types I, II,III and IV. All types of enzymes recognise specific short DNA sequences and carry out the endonucleolytic cleavage of DNA to give specific double-stranded fragments with terminal 5'-phosphates. They differ in their recognition sequence, subunit composition, cleavage position, and cofactor requirements [, ], as summarised below: Type I enzymes (3.1.21.3 from EC) cleave at sites remote from recognition site; require both ATP and S-adenosyl-L-methionine to function; multifunctional protein with both restriction and methylase (2.1.1.72 from EC) activities. Type II enzymes (3.1.21.4 from EC) cleave within or at short specific distances from recognition site; most require magnesium; single function (restriction) enzymes independent of methylase. Type III enzymes (3.1.21.5 from EC) cleave at sites a short distance from recognition site; require ATP (but doesn't hydrolyse it); S-adenosyl-L-methionine stimulates reaction but is not required; exists as part of a complex with a modification methylase methylase (2.1.1.72 from EC). Type IV enzymes target methylated DNA. Type II restriction endonucleases (3.1.21.4 from EC) are components of prokaryotic DNA restriction-modification mechanisms that protect the organism against invading foreign DNA. These site-specific deoxyribonucleases catalyse the endonucleolytic cleavage of DNA to give specific double-stranded fragments with terminal 5'-phosphates. Of the 3000 restriction endonucleases that have been characterised, most are homodimeric or tetrameric enzymes that cleave target DNA at sequence-specific sites close to the recognition site. For homodimeric enzymes, the recognition site is usually a palindromic sequence 4-8 bp in length. Most enzymes require magnesium ions as a cofactor for catalysis. Although they can vary in their mode of recognition, many restriction endonucleases share a similar structural core comprising four beta-strands and one alpha-helix, as well as a similar mechanism of cleavage, suggesting a common ancestral origin []. However, there is still considerable diversity amongst restriction endonucleases [, ]. The target site recognition process triggers large conformational changes of the enzyme and the target DNA, leading to the activation of the catalytic centres. Like other DNA binding proteins, restriction enzymes are capable of non-specific DNA binding as well, which is the prerequisite for efficient target site location by facilitated diffusion. Non-specific binding usually does not involve interactions with the bases but only with the DNA backbone []. This entry includes the restriction endonuclease SinI, which recognises and cleaves the double-stranded sequence G^GWCC. ; GO: 0003677 DNA binding, 0009036 Type II site-specific deoxyribonuclease activity, 0009307 DNA restriction-modification system
Probab=22.96 E-value=73 Score=23.39 Aligned_cols=25 Identities=20% Similarity=0.297 Sum_probs=18.0
Q ss_pred HHHHHHHHhhccc-----------CCCCcCHHHHhh
Q psy12170 44 EIRVMYRGFKQFL-----------TFIGLNHPTLKD 68 (73)
Q Consensus 44 EIk~lYk~Fkkec-----------PsG~Lt~~EFk~ 68 (73)
.|++|||-|.+.. .+-.|++++|+.
T Consensus 177 ~IkKW~R~~sk~~~TNW~~lp~~~~~~~LsE~~F~~ 212 (221)
T PF09570_consen 177 PIKKWFRTFSKTGETNWDNLPEMMGSYNLSEEDFKN 212 (221)
T ss_pred chHHHHHhhcCCCccchhhhhhhhccCcCCHHHHHH
Confidence 5899999998832 233677777764
No 178
>smart00751 BSD domain in transcription factors and synapse-associated proteins.
Probab=22.75 E-value=1.4e+02 Score=16.13 Aligned_cols=36 Identities=17% Similarity=0.281 Sum_probs=22.3
Q ss_pred CHHHHHHHHhcCCCCHHHHHHHHHHhhcccCCCCcCHHHHhhhh
Q psy12170 27 RPIALEDLLKQTKFTRQEIRVMYRGFKQFLTFIGLNHPTLKDNT 70 (73)
Q Consensus 27 s~e~l~~L~~~T~Fs~~EIk~lYk~FkkecPsG~Lt~~EFk~i~ 70 (73)
+.+.+..|... -..|+++|.. .|| +.|+.++|=..|
T Consensus 8 ~~~~i~~il~~----~p~l~~~~~~---lVP-~~~se~~FW~ry 43 (51)
T smart00751 8 KKEEIESLLKE----NPLLKKLYNE---LVP-KVLSEEEFWARY 43 (51)
T ss_pred CHHHHHHHHHH----CHHHHHHHHH---HCC-CCCCHHHHHHHH
Confidence 34455555432 2347777766 577 489999995554
No 179
>PF01395 PBP_GOBP: PBP/GOBP family; InterPro: IPR006170 The olfactory receptors of terrestrial animals exist in an aqueous environment, yet detect odorants that are primarily hydrophobic. The aqueous solubility of hydrophobic odorants is thought to be greatly enhanced via odorant binding proteins which exist in the extracellular fluid surrounding the odorant receptors []. This family is composed of pheromone binding proteins (PBP), which are male-specific and associate with pheromone-sensitive neurons and general-odorant binding proteins (GOBP). ; GO: 0005549 odorant binding; PDB: 2KPH_A 3NHT_A 3NHI_A 3NGV_A 3K1E_B 3DZT_A 3DYE_A 3DXL_A 3DY9_A 3BJH_A ....
Probab=22.68 E-value=1.7e+02 Score=17.01 Aligned_cols=47 Identities=13% Similarity=0.061 Sum_probs=30.9
Q ss_pred CCCHHHHHHHHhcCCCCHHHHHHHHHHhhcc----cCCCCcCHHHHhhhhc
Q psy12170 25 KYRPIALEDLLKQTKFTRQEIRVMYRGFKQF----LTFIGLNHPTLKDNTN 71 (73)
Q Consensus 25 kls~e~l~~L~~~T~Fs~~EIk~lYk~Fkke----cPsG~Lt~~EFk~i~~ 71 (73)
..+.+.+..+....-.+..+++-+.+-|.+. .++|.++.+.+...+.
T Consensus 30 ~~~~~~~~~~~~~~~~~~~~~~C~~~C~~~~~g~~~~~g~~~~~~~~~~~~ 80 (121)
T PF01395_consen 30 GITEDDIEQLRKGKPPEDEEIKCFMRCVLKKLGLMDDDGKFDVDKIREQLK 80 (121)
T ss_dssp TSSHHHHHHHHTTTSTTSHHHHHHHHHHHHHTTSBETTSEBBHHHHHHHHH
T ss_pred CcCHHHHHHHHcCCCCCCcccCcHHHHHHHHhhhhhccCcccHHHHHHHHh
Confidence 3455666666655555546777665555552 4899999998887764
No 180
>PF02208 Sorb: Sorbin homologous domain; InterPro: IPR003127 Sorbin is an active peptide present in the digestive tract, where it has pro-absorptive and anti-secretory effects in different parts of the intestine, including the ability to decrease VIP (vasoactive intestinal peptide) and cholera toxin-induced secretion. It is expressed in some intestinal and pancreatic endocrine tumours in humans []. Sorbin-homology domains are found in adaptor proteins such as vinexin, CAP/ponsin and argBP2, which regulate various cellular functions, including cell adhesion, cytoskeletal organisation, and growth factor signalling []. In addition to the sorbin domain, these proteins contain three SH3 (src homology 3) domains. The sorbin homology domain mediates the interaction of vinexin and CAP with flotillin, which is crucial for the localisation of SH3-binding proteins to the lipid raft, a region of the plasma membrane rich in cholesterol and sphingolipids that acts to concentrate certain signalling molecules. The sorbin homology domain of adaptor proteins may mediate interactions with the lipid raft that are crucial to intracellular communication [].
Probab=22.61 E-value=45 Score=18.96 Aligned_cols=10 Identities=40% Similarity=0.617 Sum_probs=6.7
Q ss_pred HHHHHHH-hhc
Q psy12170 45 IRVMYRG-FKQ 54 (73)
Q Consensus 45 Ik~lYk~-Fkk 54 (73)
-+.|||. |++
T Consensus 25 ~kDWYktMFkq 35 (47)
T PF02208_consen 25 PKDWYKTMFKQ 35 (47)
T ss_pred hhHHHHHHHHH
Confidence 3568887 766
No 181
>TIGR03830 CxxCG_CxxCG_HTH putative zinc finger/helix-turn-helix protein, YgiT family. This model describes a family of predicted regulatory proteins with a conserved zinc finger/HTH architecture. The amino-terminal region contains a novel domain, featuring two CXXC motifs and occuring in a number of small bacterial proteins as well as in the present family. The carboxyl-terminal region consists of a helix-turn-helix domain, modeled by pfam01381. The predicted function is DNA binding and transcriptional regulation.
Probab=22.58 E-value=1.5e+02 Score=18.23 Aligned_cols=26 Identities=12% Similarity=0.221 Sum_probs=20.2
Q ss_pred cCCCHHHHHHHHhcCCCCHHHHHHHH
Q psy12170 24 QKYRPIALEDLLKQTKFTRQEIRVMY 49 (73)
Q Consensus 24 skls~e~l~~L~~~T~Fs~~EIk~lY 49 (73)
+.+.+..+..++...++|..++.++-
T Consensus 63 ~~~~~~~i~~~r~~~gltq~~lA~~l 88 (127)
T TIGR03830 63 GLLTPPEIRRIRKKLGLSQREAAELL 88 (127)
T ss_pred CCcCHHHHHHHHHHcCCCHHHHHHHh
Confidence 46788888888888888888877664
No 182
>KOG2758|consensus
Probab=22.56 E-value=1.2e+02 Score=24.27 Aligned_cols=33 Identities=24% Similarity=0.348 Sum_probs=26.8
Q ss_pred CHHHHHHHHhcCCCCHHHHHHHHH--HhhcccCCC
Q psy12170 27 RPIALEDLLKQTKFTRQEIRVMYR--GFKQFLTFI 59 (73)
Q Consensus 27 s~e~l~~L~~~T~Fs~~EIk~lYk--~FkkecPsG 59 (73)
+..-++.|.+..+|+.+.|..+|| .|+=+|+|=
T Consensus 111 ~~~~l~~L~e~ynf~~e~i~~lykyakfqyeCGNY 145 (432)
T KOG2758|consen 111 RVQNLQHLQEHYNFTPERIETLYKYAKFQYECGNY 145 (432)
T ss_pred HHHHHHHHHHhcCCCHHHHHHHHHHHHHHHhccCc
Confidence 345577899999999999999965 787789763
No 183
>cd05024 S-100A10 S-100A10: A subgroup of the S-100A10 domain found in proteins similar to S100A10. S100A10 is a member of the S100 family of EF-hand superfamily of calcium-binding proteins. Note that the S-100 hierarchy, to which this S-100A10 group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately. S100 proteins are expressed exclusively in vertebrates, and are implicated in intracellular and extracellular regulatory activities. A unique feature of S100A10 is that it contains mutation in both of the calcium binding sites, making it calcium insensitive. S100A10 has been detected in brain, heart, gastrointestinal tract, kidney, liver, lung, spleen, testes, epidermis, aorta, and thymus. Structural data supports the homo- and hetero-dimeric as well as hetero-tetrameric nature of the protein. S100A10 has multiple binding partners in its calcium free state and is therefore involved in many diverse biological functions.
Probab=22.48 E-value=2.1e+02 Score=17.94 Aligned_cols=46 Identities=4% Similarity=-0.009 Sum_probs=32.0
Q ss_pred cCCCHHHHHHHHhc-------CCCCHHHHHHHHHHhhcccCCCCcCHHHHhhhh
Q psy12170 24 QKYRPIALEDLLKQ-------TKFTRQEIRVMYRGFKQFLTFIGLNHPTLKDNT 70 (73)
Q Consensus 24 skls~e~l~~L~~~-------T~Fs~~EIk~lYk~FkkecPsG~Lt~~EFk~i~ 70 (73)
.+|++.++..|.++ +.=++.-|.++.+.- -.+.+|.|+-.||...-
T Consensus 22 ~tLsk~Elk~Ll~~Elp~~l~~~~d~~~vd~im~~L-D~n~Dg~vdF~EF~~Lv 74 (91)
T cd05024 22 NYLNRDDLQKLMEKEFSEFLKNQNDPMAVDKIMKDL-DDCRDGKVGFQSFFSLI 74 (91)
T ss_pred CcCCHHHHHHHHHHHhHHHHcCCCCHHHHHHHHHHh-CCCCCCcCcHHHHHHHH
Confidence 37889998887632 122566677776654 34789999999997653
No 184
>PF12419 DUF3670: SNF2 Helicase protein ; InterPro: IPR022138 This domain family is found in bacteria, archaea and eukaryotes, and is approximately 140 amino acids in length. The family is found in association with PF00271 from PFAM, PF00176 from PFAM. Most of the proteins in this family are annotated as SNF2 helicases but there is little accompanying literature to confirm this.
Probab=22.29 E-value=2.4e+02 Score=18.44 Aligned_cols=59 Identities=14% Similarity=0.141 Sum_probs=42.5
Q ss_pred hhhhhhhhcccccccCCCHHHHHHHHhcC-----------CCCHHHHHHHHHHhhcccCCCCcCHHHHhh
Q psy12170 10 EEVVAFEIESSKHVQKYRPIALEDLLKQT-----------KFTRQEIRVMYRGFKQFLTFIGLNHPTLKD 68 (73)
Q Consensus 10 ~~~~~~~~e~~~~~skls~e~l~~L~~~T-----------~Fs~~EIk~lYk~FkkecPsG~Lt~~EFk~ 68 (73)
++.+.++.+...+-..++++++++|.+.. ..+.++|+++-+.+.+.-+.+.++..+-.+
T Consensus 68 ~~l~~f~W~lalGd~~Ls~eEf~~L~~~~~~LV~~rg~WV~ld~~~l~~~~~~~~~~~~~~~lt~~e~Lr 137 (141)
T PF12419_consen 68 DQLLDFDWELALGDEELSEEEFEQLVEQKRPLVRFRGRWVELDPEELRRALAFLEKAPKGEKLTLAEALR 137 (141)
T ss_pred HHHhcceEEEEECCEECCHHHHHHHHHcCCCeEEECCEEEEECHHHHHHHHHHHHhccccCCCCHHHHHH
Confidence 34445555666666689999999987663 369999999998888744433588777544
No 185
>COG3763 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=22.13 E-value=1.9e+02 Score=17.69 Aligned_cols=33 Identities=12% Similarity=0.146 Sum_probs=24.1
Q ss_pred cccCCCHHHHHHHHhcC--CCCHHHHHHHHHHhhc
Q psy12170 22 HVQKYRPIALEDLLKQT--KFTRQEIRVMYRGFKQ 54 (73)
Q Consensus 22 ~~skls~e~l~~L~~~T--~Fs~~EIk~lYk~Fkk 54 (73)
.|+.++++.+..+..+- +-|++.|++.|+.-.+
T Consensus 35 ~NPpine~~iR~M~~qmGqKpSe~kI~Qvm~~i~k 69 (71)
T COG3763 35 DNPPINEEMIRMMMAQMGQKPSEKKINQVMRSIIK 69 (71)
T ss_pred hCCCCCHHHHHHHHHHhCCCchHHHHHHHHHHHHh
Confidence 45667788888755554 6788999999987554
No 186
>PF00612 IQ: IQ calmodulin-binding motif; InterPro: IPR000048 The IQ motif is an extremely basic unit of about 23 amino acids, whose conserved core usually fits the consensus A-x(3)-I-Q-x(2)-F-R-x(4)-K-K. The IQ motif, which can be present in one or more copies, serves as a binding site for different EF-hand proteins including the essential and regulatory myosin light chains, calmodulin (CaM), and CaM-like proteins [, ].Many IQ motifs are protein kinase C (PKC) phosphorylation sites [, ]. Resolution of the 3D structure of scallop myosin has shown that the IQ motif forms a basic amphipathic helix []. Some proteins known to contain an IQ motif are listed below: A number of conventional and unconventional myosins. Neuromodulin (GAP-43). This protein is associated with nerve growth. It is a major component of the motile "growth cones" that form the tips of elongating axons. Neurogranin (NG/p17). Acts as a "third messenger" substrate of protein kinase C-mediated molecular cascades during synaptic development and remodeling. Sperm surface protein Sp17. Ras GTPase-activating-like protein IQGAP1. IQGAP1 contains 4 IQ motifs. This entry covers the entire IQ motif.; GO: 0005515 protein binding; PDB: 2DFS_A 2IX7_C 1OE9_A 1W7J_A 1W7I_A 1KQM_A 1KK7_A 1WDC_A 1DFL_A 1B7T_A ....
Probab=21.98 E-value=73 Score=14.22 Aligned_cols=11 Identities=27% Similarity=0.600 Sum_probs=8.2
Q ss_pred HHHHHHHHhhc
Q psy12170 44 EIRVMYRGFKQ 54 (73)
Q Consensus 44 EIk~lYk~Fkk 54 (73)
-|+.++|+|+.
T Consensus 6 ~iQ~~~R~~~~ 16 (21)
T PF00612_consen 6 IIQSYWRGYLA 16 (21)
T ss_dssp HHHHHHHHHHH
T ss_pred HHHHHHHHHHH
Confidence 37888888864
No 187
>smart00271 DnaJ DnaJ molecular chaperone homology domain.
Probab=21.87 E-value=1.2e+02 Score=16.05 Aligned_cols=22 Identities=18% Similarity=0.128 Sum_probs=17.0
Q ss_pred CCCHHHHHHHHHHhhccc-CCCC
Q psy12170 39 KFTRQEIRVMYRGFKQFL-TFIG 60 (73)
Q Consensus 39 ~Fs~~EIk~lYk~Fkkec-PsG~ 60 (73)
+.+..+|++-|++..+.+ |+-.
T Consensus 12 ~~~~~~ik~ay~~l~~~~HPD~~ 34 (60)
T smart00271 12 DASLDEIKKAYRKLALKYHPDKN 34 (60)
T ss_pred CCCHHHHHHHHHHHHHHHCcCCC
Confidence 478899999999988764 6543
No 188
>PRK12274 serine/threonine protein kinase; Provisional
Probab=21.80 E-value=1.2e+02 Score=21.96 Aligned_cols=25 Identities=12% Similarity=0.301 Sum_probs=20.8
Q ss_pred HHHHHHHHHHhhcccCCCCcCHHHHh
Q psy12170 42 RQEIRVMYRGFKQFLTFIGLNHPTLK 67 (73)
Q Consensus 42 ~~EIk~lYk~FkkecPsG~Lt~~EFk 67 (73)
..+|..+++-|++.|| +.||..|.+
T Consensus 154 ~rDl~~llk~~~~y~~-~~l~~~~~~ 178 (218)
T PRK12274 154 REDLRHLLKHKRMYCP-AALTPVERR 178 (218)
T ss_pred HHHHHHHHHHHHhcCC-CCCCHHHHh
Confidence 5889999999999898 588887754
No 189
>PF00767 Poty_coat: Potyvirus coat protein; InterPro: IPR001592 This protease is found in genome polyproteins of potyviruses. The genome polyprotein contains: N-terminal protein (P1), helper component protease (3.4.22 from EC, HC-PRO), protein P3, 6KD protein (6K1), cytoplasmic inclusion protein (CI), 6KD protein 2 (6K2), genome-linked protein (VPG), nuclear inclusion protein A (3.4.22 from EC), nuclear inclusion protein B (2.7.7.48 from EC) and coat protein (CP). The coat protein is at the C terminus of the polyprotein.; GO: 0019028 viral capsid
Probab=21.35 E-value=1.2e+02 Score=22.33 Aligned_cols=34 Identities=18% Similarity=0.264 Sum_probs=26.5
Q ss_pred hcCCCCHHHHHHHHHHhhcccCCCCcCHHHHhhhhcC
Q psy12170 36 KQTKFTRQEIRVMYRGFKQFLTFIGLNHPTLKDNTNG 72 (73)
Q Consensus 36 ~~T~Fs~~EIk~lYk~FkkecPsG~Lt~~EFk~i~~~ 72 (73)
..|.-|..|+..||.+.+++.. ++.+++.-|.||
T Consensus 52 sNtrATq~Qf~~W~e~Vk~~y~---v~d~~m~iil~g 85 (237)
T PF00767_consen 52 SNTRATQSQFEAWYEAVKKDYG---VTDDQMQIILNG 85 (237)
T ss_pred hcccchHHHHHHHHHHhhhhcC---cChHHHHHHHHH
Confidence 3678899999999999998753 456677776665
No 190
>smart00015 IQ Short calmodulin-binding motif containing conserved Ile and Gln residues. Calmodulin-binding motif.
Probab=21.23 E-value=71 Score=14.92 Aligned_cols=11 Identities=27% Similarity=0.619 Sum_probs=8.6
Q ss_pred HHHHHHHHhhc
Q psy12170 44 EIRVMYRGFKQ 54 (73)
Q Consensus 44 EIk~lYk~Fkk 54 (73)
-|+.++|+|+.
T Consensus 8 ~IQa~~Rg~~~ 18 (26)
T smart00015 8 IIQAAWRGYLA 18 (26)
T ss_pred HHHHHHHHHHH
Confidence 47888888875
No 191
>PF00620 RhoGAP: RhoGAP domain; InterPro: IPR000198 Members of the Rho family of small G proteins transduce signals from plasma-membrane receptors and control cell adhesion, motility and shape by actin cytoskeleton formation. Like all other GTPases, Rho proteins act as molecular switches, with an active GTP-bound form and an inactive GDP-bound form. The active conformation is promoted by guanine-nucleotide exchange factors, and the inactive state by GTPase-activating proteins (GAPs) which stimulate the intrinsic GTPase activity of small G proteins. This entry is a Rho/Rac/Cdc42-like GAP domain, that is found in a wide variety of large, multi-functional proteins []. A number of structure are known for this family [, , ]. The domain is composed of seven alpha helices. This domain is also known as the breakpoint cluster region-homology (BH) domain.; GO: 0007165 signal transduction, 0005622 intracellular; PDB: 1RGP_A 1AM4_B 1GRN_B 2NGR_B 1OW3_A 1TX4_A 3BYI_B 1XA6_A 3FK2_B 1F7C_A ....
Probab=20.91 E-value=1.9e+02 Score=18.04 Aligned_cols=34 Identities=6% Similarity=0.051 Sum_probs=28.5
Q ss_pred cCCCCHHHHHHHHHHhhcccCCCCcCHHHHhhhh
Q psy12170 37 QTKFTRQEIRVMYRGFKQFLTFIGLNHPTLKDNT 70 (73)
Q Consensus 37 ~T~Fs~~EIk~lYk~FkkecPsG~Lt~~EFk~i~ 70 (73)
...++...+..+.|.|.++.|.+.++.+.+..+.
T Consensus 48 ~~~~~~~~va~~lK~~L~~lp~pli~~~~~~~~~ 81 (151)
T PF00620_consen 48 LENYDVHDVASLLKRFLRELPEPLIPSELYDKFI 81 (151)
T ss_dssp GTTSTHHHHHHHHHHHHHHSSSTSTTHHHHHHHH
T ss_pred ccccChhhccccceeeeeccccchhhhhHHHHHh
Confidence 3578899999999999999999999988766543
No 192
>TIGR03561 organ_hyd_perox peroxiredoxin, Ohr subfamily. Pfam model pfam02566, OsmC-like protein, contains several deeply split clades of homologous proteins. The clade modeled here includes the protein Ohr, or organic hydroperoxide resistance protein.
Probab=20.83 E-value=1.1e+02 Score=19.56 Aligned_cols=20 Identities=10% Similarity=0.217 Sum_probs=18.2
Q ss_pred CCCHHHHHHHHHHhhcccCC
Q psy12170 39 KFTRQEIRVMYRGFKQFLTF 58 (73)
Q Consensus 39 ~Fs~~EIk~lYk~FkkecPs 58 (73)
..+.+++++|...+.+.||=
T Consensus 101 ~~~~e~~~~l~~~a~~~CpV 120 (134)
T TIGR03561 101 GLDQAEAEALVEAAHQVCPY 120 (134)
T ss_pred CCCHHHHHHHHHHHhccCcc
Confidence 48999999999999999994
No 193
>KOG0384|consensus
Probab=20.67 E-value=72 Score=29.08 Aligned_cols=20 Identities=40% Similarity=0.712 Sum_probs=18.2
Q ss_pred cCCCCHHHHHHHHHHhhccc
Q psy12170 37 QTKFTRQEIRVMYRGFKQFL 56 (73)
Q Consensus 37 ~T~Fs~~EIk~lYk~Fkkec 56 (73)
..+|+++||+++||.+++.|
T Consensus 999 ~~~~~e~eir~~~ra~~kfg 1018 (1373)
T KOG0384|consen 999 KGGFTEKEIRRFYRAYLKFG 1018 (1373)
T ss_pred hcCCCHHHHHHHHHHHHHhc
Confidence 46899999999999999976
No 194
>COG3592 Uncharacterized conserved protein [Function unknown]
Probab=20.29 E-value=38 Score=20.94 Aligned_cols=20 Identities=15% Similarity=-0.006 Sum_probs=13.1
Q ss_pred CCCHHHHHHHHHHhhcccCCCCcC
Q psy12170 39 KFTRQEIRVMYRGFKQFLTFIGLN 62 (73)
Q Consensus 39 ~Fs~~EIk~lYk~FkkecPsG~Lt 62 (73)
+-+.++|++.- .-||||.|+
T Consensus 51 ~~~ve~i~~vi----~sCPSGAl~ 70 (74)
T COG3592 51 AVDVEEIVKVI----DTCPSGALK 70 (74)
T ss_pred CCCHHHHHHHH----HhCCchhhh
Confidence 34566666553 359999886
No 195
>PF06014 DUF910: Bacterial protein of unknown function (DUF910); InterPro: IPR009256 This family consists of several short bacterial proteins of unknown function.; PDB: 2NN4_A.
Probab=20.21 E-value=1.7e+02 Score=17.39 Aligned_cols=21 Identities=10% Similarity=0.153 Sum_probs=15.4
Q ss_pred CHHHHHHHHHHhhcccCCCCcCHHHHhh
Q psy12170 41 TRQEIRVMYRGFKQFLTFIGLNHPTLKD 68 (73)
Q Consensus 41 s~~EIk~lYk~FkkecPsG~Lt~~EFk~ 68 (73)
=+.||+.||+ +|.|++++|..
T Consensus 30 M~~El~~Ly~-------~~lidk~~y~~ 50 (62)
T PF06014_consen 30 MEIELKELYK-------SGLIDKKEYLT 50 (62)
T ss_dssp HHHHHHHHHH-------TTSS-HHHHHH
T ss_pred HHHHHHHHHH-------cCCCCHHHHHH
Confidence 3678888887 58888888865
No 196
>PF14591 AF0941-like: AF0941-like; PDB: 1YOZ_B.
Probab=20.14 E-value=73 Score=21.56 Aligned_cols=30 Identities=13% Similarity=0.076 Sum_probs=16.4
Q ss_pred CHHHHHHHHHHhhc----ccCCCCcCHHHHhhhh
Q psy12170 41 TRQEIRVMYRGFKQ----FLTFIGLNHPTLKDNT 70 (73)
Q Consensus 41 s~~EIk~lYk~Fkk----ecPsG~Lt~~EFk~i~ 70 (73)
++++|..+||.=.+ +.-+|.++.+|=++.+
T Consensus 34 ~~e~I~emFr~D~e~Il~~~~~Gdi~eEEA~~ll 67 (127)
T PF14591_consen 34 DKEEIEEMFRSDLEDILEDYKSGDIDEEEALQLL 67 (127)
T ss_dssp -HHHHHHHHHHHHHHHHHHHHTTSS-HHHHHHHH
T ss_pred CHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHH
Confidence 66666666333222 2367888888766554
No 197
>TIGR03760 ICE_TraI_Pfluor integrating conjugative element relaxase, PFL_4751 family. Members of this protein family are the TraI putative relaxases required for transfer by a subclass of integrating conjugative elements (ICE) as found in Pseudomonas fluorescens Pf-5, and understood from study of two related ICE, SXT and R391. This model represents the N-terminal domain. Note that no homology is detected to the similarly named TraI relaxase of the F plasmid.
Probab=20.12 E-value=1e+02 Score=21.96 Aligned_cols=28 Identities=14% Similarity=0.222 Sum_probs=23.7
Q ss_pred CHHHHHHHHhcCCCCHHHHHHHHHHhhc
Q psy12170 27 RPIALEDLLKQTKFTRQEIRVMYRGFKQ 54 (73)
Q Consensus 27 s~e~l~~L~~~T~Fs~~EIk~lYk~Fkk 54 (73)
+.+-|++|.+.+.++..+++++|.....
T Consensus 18 r~~ll~~i~~~~~l~~~~~~~l~~~~l~ 45 (218)
T TIGR03760 18 RQQLLQQLWRNSSLSAAVYEQLYLQPLE 45 (218)
T ss_pred HHHHHHHHHHHcCCChHHHHHHHHHHHH
Confidence 4457889999999999999999977654
Done!