Query psy15328
Match_columns 76
No_of_seqs 100 out of 372
Neff 4.8
Searched_HMMs 46136
Date Fri Aug 16 18:09:48 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy15328.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/15328hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG0044|consensus 99.7 2.9E-18 6.3E-23 122.7 4.2 58 18-75 10-67 (193)
2 COG5126 FRQ1 Ca2+-binding prot 98.4 4E-07 8.7E-12 63.9 3.9 49 25-74 8-58 (160)
3 KOG0034|consensus 96.6 0.005 1.1E-07 44.0 4.8 40 23-62 15-61 (187)
4 cd00213 S-100 S-100: S-100 dom 93.6 0.15 3.2E-06 30.8 3.9 31 35-65 9-40 (88)
5 PF00036 EF-hand_1: EF hand; 92.9 0.15 3.3E-06 26.1 2.7 28 35-63 1-28 (29)
6 PTZ00183 centrin; Provisional 92.4 0.27 5.9E-06 31.4 4.0 39 26-64 6-46 (158)
7 PTZ00184 calmodulin; Provision 91.7 0.58 1.3E-05 29.3 4.9 35 30-64 4-40 (149)
8 PF13833 EF-hand_8: EF-hand do 91.4 0.97 2.1E-05 24.7 5.1 35 28-63 18-53 (54)
9 PF13405 EF-hand_6: EF-hand do 91.3 0.29 6.3E-06 24.6 2.6 27 35-62 1-27 (31)
10 KOG0038|consensus 89.6 0.77 1.7E-05 33.1 4.5 20 26-45 17-36 (189)
11 smart00054 EFh EF-hand, calciu 89.6 0.55 1.2E-05 20.8 2.6 28 35-63 1-28 (29)
12 smart00027 EH Eps15 homology d 88.4 1.3 2.7E-05 27.3 4.4 47 14-62 25-71 (96)
13 cd05029 S-100A6 S-100A6: S-100 87.9 3.2 7E-05 25.7 6.0 52 11-63 24-79 (88)
14 PF14658 EF-hand_9: EF-hand do 87.9 1.7 3.7E-05 26.6 4.6 43 21-63 20-64 (66)
15 cd05023 S-100A11 S-100A11: S-1 87.7 0.75 1.6E-05 28.8 3.0 34 34-67 6-43 (89)
16 smart00027 EH Eps15 homology d 86.6 4 8.6E-05 25.0 5.9 34 31-64 4-39 (96)
17 cd05029 S-100A6 S-100A6: S-100 86.6 1.1 2.3E-05 27.9 3.3 30 35-64 11-41 (88)
18 KOG0041|consensus 86.0 1.1 2.3E-05 33.7 3.5 36 30-65 92-129 (244)
19 cd05030 calgranulins Calgranul 84.4 3.4 7.5E-05 25.3 4.8 51 13-64 24-80 (88)
20 cd00052 EH Eps15 homology doma 84.3 3.9 8.5E-05 22.5 4.6 34 29-63 28-61 (67)
21 cd05030 calgranulins Calgranul 81.0 3.2 7E-05 25.5 3.7 18 48-65 23-40 (88)
22 cd05031 S-100A10_like S-100A10 80.3 7.9 0.00017 23.7 5.3 34 29-63 46-79 (94)
23 PF00446 GnRH: Gonadotropin-re 79.8 0.82 1.8E-05 18.9 0.5 9 3-11 2-10 (10)
24 PF13499 EF-hand_7: EF-hand do 77.7 2.8 6E-05 23.5 2.5 19 47-65 12-30 (66)
25 KOG0027|consensus 77.5 5.4 0.00012 26.3 4.2 34 32-66 83-116 (151)
26 cd05026 S-100Z S-100Z: S-100Z 74.3 15 0.00033 22.6 5.4 33 31-64 50-82 (93)
27 PF11569 Homez: Homeodomain le 73.3 3.7 8E-05 24.5 2.3 22 22-43 28-49 (56)
28 cd00213 S-100 S-100: S-100 dom 72.9 10 0.00022 22.5 4.3 32 31-63 48-79 (88)
29 PRK04387 hypothetical protein; 72.7 7.6 0.00017 25.2 3.8 33 28-64 31-63 (90)
30 cd05027 S-100B S-100B: S-100B 72.4 4.5 9.7E-05 25.1 2.6 33 34-66 8-41 (88)
31 cd05031 S-100A10_like S-100A10 72.4 5.5 0.00012 24.4 3.0 31 33-63 7-38 (94)
32 cd00086 homeodomain Homeodomai 71.8 4.9 0.00011 21.8 2.5 24 20-43 28-51 (59)
33 cd05026 S-100Z S-100Z: S-100Z 71.1 6.6 0.00014 24.3 3.2 31 34-64 10-41 (93)
34 cd00051 EFh EF-hand, calcium b 70.6 12 0.00027 18.9 5.4 32 29-61 31-62 (63)
35 PTZ00183 centrin; Provisional 69.4 11 0.00023 23.9 4.0 33 30-63 122-154 (158)
36 PF12174 RST: RCD1-SRO-TAF4 (R 69.1 9 0.00019 23.4 3.4 45 14-62 7-52 (70)
37 PF09279 EF-hand_like: Phospho 67.6 7.8 0.00017 23.0 2.9 26 35-62 1-26 (83)
38 cd05025 S-100A1 S-100A1: S-100 67.4 15 0.00033 22.2 4.3 33 30-63 48-80 (92)
39 cd05022 S-100A13 S-100A13: S-1 67.2 9.7 0.00021 23.9 3.4 29 35-63 9-37 (89)
40 KOG0027|consensus 65.9 19 0.00042 23.6 4.8 32 31-63 41-72 (151)
41 cd05022 S-100A13 S-100A13: S-1 65.9 23 0.00049 22.2 4.9 52 11-63 21-75 (89)
42 PF12872 OST-HTH: OST-HTH/LOTU 65.8 12 0.00025 21.5 3.4 25 49-73 21-45 (74)
43 cd05025 S-100A1 S-100A1: S-100 64.7 14 0.0003 22.4 3.6 30 34-63 9-39 (92)
44 KOG0488|consensus 64.1 4.6 0.0001 30.9 1.7 23 23-45 171-193 (309)
45 PLN02964 phosphatidylserine de 63.7 18 0.0004 30.5 5.2 31 19-50 124-156 (644)
46 PF09966 DUF2200: Uncharacteri 63.5 7.9 0.00017 26.1 2.5 32 17-48 24-61 (111)
47 smart00389 HOX Homeodomain. DN 63.5 8.8 0.00019 20.8 2.4 25 18-42 26-50 (56)
48 KOG0486|consensus 63.2 6.8 0.00015 31.0 2.5 39 18-56 106-144 (351)
49 PF09851 SHOCT: Short C-termin 63.0 19 0.00042 18.5 3.5 25 34-62 3-27 (31)
50 PF03941 INCENP_ARK-bind: Inne 62.8 4 8.6E-05 23.7 0.9 25 12-36 18-42 (57)
51 COG4875 Uncharacterized protei 61.8 5.8 0.00013 27.9 1.7 43 14-56 61-103 (156)
52 KOG0028|consensus 59.8 16 0.00035 26.3 3.8 35 32-67 104-138 (172)
53 cd05023 S-100A11 S-100A11: S-1 59.7 23 0.0005 21.9 4.1 33 30-63 48-80 (89)
54 PF13202 EF-hand_5: EF hand; P 59.2 9.2 0.0002 18.6 1.8 15 46-60 10-24 (25)
55 PF01023 S_100: S-100/ICaBP ty 58.6 25 0.00054 19.4 3.7 32 36-67 8-40 (44)
56 PF05193 Peptidase_M16_C: Pept 58.3 6.1 0.00013 24.7 1.3 36 30-65 1-41 (184)
57 cd05027 S-100B S-100B: S-100B 57.4 43 0.00093 20.6 5.6 34 29-63 46-79 (88)
58 PF07914 DUF1679: Protein of u 56.8 6.6 0.00014 30.8 1.5 36 33-68 328-366 (414)
59 PF15187 Augurin: Oesophageal 55.9 11 0.00023 25.5 2.2 25 21-45 35-59 (114)
60 TIGR01669 phage_XkdX phage unc 55.2 28 0.00061 19.5 3.6 18 46-63 26-43 (45)
61 KOG0046|consensus 55.0 16 0.00034 30.9 3.4 39 26-64 8-47 (627)
62 cd04790 HTH_Cfa-like_unk Helix 52.5 11 0.00023 26.2 1.8 28 24-51 123-150 (172)
63 PF05920 Homeobox_KN: Homeobox 52.5 14 0.0003 20.1 1.9 22 22-43 16-37 (40)
64 PF09693 Phage_XkdX: Phage unc 51.7 11 0.00024 20.5 1.5 19 45-63 20-38 (40)
65 PF13384 HTH_23: Homeodomain-l 51.5 9.9 0.00021 20.4 1.3 29 23-51 21-49 (50)
66 PF05256 UPF0223: Uncharacteri 51.0 9.9 0.00021 24.5 1.3 34 28-65 31-64 (88)
67 PF14788 EF-hand_10: EF hand; 50.7 29 0.00063 20.3 3.2 33 31-64 18-50 (51)
68 PF05280 FlhC: Flagellar trans 49.4 15 0.00032 26.0 2.2 30 24-53 28-58 (175)
69 PF12674 Zn_ribbon_2: Putative 49.3 48 0.001 20.6 4.3 37 30-68 41-77 (81)
70 PRK12722 transcriptional activ 49.3 22 0.00048 25.7 3.1 31 23-53 27-58 (187)
71 PF00620 RhoGAP: RhoGAP domain 49.1 54 0.0012 20.8 4.7 36 27-62 47-82 (151)
72 KOG0031|consensus 49.1 36 0.00079 24.5 4.1 35 31-65 26-62 (171)
73 PF00046 Homeobox: Homeobox do 47.8 24 0.00052 19.3 2.5 23 20-42 28-50 (57)
74 PF08707 PriCT_2: Primase C te 47.3 45 0.00096 19.9 3.8 28 28-58 48-75 (78)
75 PLN02964 phosphatidylserine de 47.2 33 0.00072 29.0 4.2 33 30-63 211-243 (644)
76 PF03871 RNA_pol_Rpb5_N: RNA p 46.0 26 0.00057 22.0 2.7 32 33-64 4-47 (93)
77 cd00159 RhoGAP RhoGAP: GTPase- 45.9 55 0.0012 21.0 4.3 35 29-63 48-82 (169)
78 PF03732 Retrotrans_gag: Retro 45.5 41 0.00089 19.3 3.4 33 34-66 9-41 (96)
79 COG5126 FRQ1 Ca2+-binding prot 45.0 33 0.00072 24.1 3.3 44 19-63 112-156 (160)
80 KOG0037|consensus 44.4 37 0.00081 25.3 3.7 19 46-64 135-153 (221)
81 PF08667 BetR: BetR domain; I 43.7 61 0.0013 22.6 4.5 38 19-63 37-74 (147)
82 COG4476 Uncharacterized protei 42.3 29 0.00062 22.6 2.5 32 29-64 32-63 (90)
83 PHA02970 hypothetical protein; 42.0 27 0.00059 23.5 2.4 29 20-48 26-54 (115)
84 PF09570 RE_SinI: SinI restric 41.4 28 0.00061 26.0 2.6 30 35-64 177-217 (221)
85 PF13518 HTH_28: Helix-turn-he 41.0 24 0.00052 18.7 1.7 30 23-52 16-45 (52)
86 PF09821 AAA_assoc_C: C-termin 40.7 42 0.00091 22.2 3.2 35 37-71 66-100 (120)
87 PRK12461 UDP-N-acetylglucosami 40.7 21 0.00045 26.2 1.9 36 28-63 197-232 (255)
88 cd00252 SPARC_EC SPARC_EC; ext 39.8 56 0.0012 21.4 3.7 13 48-60 61-73 (116)
89 PF02885 Glycos_trans_3N: Glyc 39.7 76 0.0017 18.3 5.0 38 19-60 3-40 (66)
90 TIGR03738 PRTRC_C PRTRC system 39.3 34 0.00073 21.1 2.4 22 46-67 17-38 (66)
91 PRK00095 mutL DNA mismatch rep 38.1 57 0.0012 26.9 4.2 39 25-63 568-616 (617)
92 PRK00915 2-isopropylmalate syn 37.3 98 0.0021 25.0 5.3 38 29-66 346-384 (513)
93 KOG0031|consensus 37.3 64 0.0014 23.3 3.8 35 27-62 130-164 (171)
94 PF02257 RFX_DNA_binding: RFX 37.1 68 0.0015 20.3 3.6 38 30-67 23-60 (85)
95 PHA03102 Small T antigen; Revi 37.0 1E+02 0.0022 21.4 4.8 40 21-60 7-49 (153)
96 smart00411 BHL bacterial (prok 36.6 58 0.0013 19.4 3.2 27 19-45 4-30 (90)
97 PF00220 Hormone_4: Neurohypop 36.6 15 0.00034 14.6 0.4 8 43-50 2-9 (9)
98 PF13950 Epimerase_Csub: UDP-g 36.2 77 0.0017 18.5 3.5 40 12-51 23-62 (62)
99 smart00324 RhoGAP GTPase-activ 35.9 98 0.0021 20.3 4.4 35 29-63 52-86 (174)
100 PF01465 GRIP: GRIP domain; I 35.6 55 0.0012 18.2 2.7 27 14-40 20-46 (46)
101 cd00591 HU_IHF Integration hos 35.3 58 0.0013 19.2 3.0 27 19-45 3-29 (87)
102 PF00216 Bac_DNA_binding: Bact 34.6 53 0.0012 19.4 2.7 28 18-45 3-30 (90)
103 PRK14561 hypothetical protein; 34.6 35 0.00076 23.8 2.2 29 30-58 133-163 (194)
104 COG4397 Mu-like prophage major 34.4 41 0.00089 25.9 2.6 45 29-73 2-47 (308)
105 TIGR01889 Staph_reg_Sar staphy 34.0 1.2E+02 0.0026 18.9 4.9 46 18-63 10-55 (109)
106 PF04857 CAF1: CAF1 family rib 33.5 65 0.0014 23.3 3.5 35 30-67 155-189 (262)
107 PF08356 EF_assoc_2: EF hand a 33.5 1.2E+02 0.0026 19.4 4.3 36 29-64 2-41 (89)
108 PRK12860 transcriptional activ 33.3 56 0.0012 23.7 3.1 31 23-53 27-58 (189)
109 PF00226 DnaJ: DnaJ domain; I 32.9 85 0.0018 17.3 3.3 24 28-51 9-32 (64)
110 PF15658 Latrotoxin_C: Latroto 32.7 86 0.0019 21.6 3.8 30 35-64 87-117 (127)
111 KOG4094|consensus 32.2 53 0.0011 23.6 2.7 20 27-46 120-139 (178)
112 TIGR00973 leuA_bact 2-isopropy 31.6 1E+02 0.0022 24.9 4.5 39 28-66 342-381 (494)
113 PF06902 Fer4_19: Divergent 4F 31.5 19 0.00042 21.4 0.4 11 44-54 51-61 (64)
114 PRK10664 transcriptional regul 30.7 68 0.0015 19.9 2.8 27 19-45 4-30 (90)
115 TIGR03760 ICE_TraI_Pfluor inte 30.6 50 0.0011 23.9 2.5 26 20-45 20-45 (218)
116 PF08763 Ca_chan_IQ: Voltage g 30.5 38 0.00082 18.5 1.4 10 36-45 15-24 (35)
117 PLN03228 methylthioalkylmalate 30.3 1.3E+02 0.0029 24.6 5.1 36 28-63 440-476 (503)
118 PF13174 TPR_6: Tetratricopept 29.9 65 0.0014 14.9 2.1 17 33-49 16-32 (33)
119 TIGR02688 conserved hypothetic 29.8 84 0.0018 25.7 3.8 31 31-61 399-430 (449)
120 PRK09389 (R)-citramalate synth 29.8 1.1E+02 0.0023 24.7 4.4 37 29-65 332-369 (488)
121 TIGR02675 tape_meas_nterm tape 29.2 83 0.0018 18.9 2.9 38 26-63 5-42 (75)
122 TIGR02653 Lon_rel_chp conserve 29.0 93 0.002 26.7 4.1 31 31-61 406-437 (675)
123 PRK08402 replication factor A; 28.7 1.6E+02 0.0035 22.9 5.2 42 19-63 7-48 (355)
124 PF02209 VHP: Villin headpiece 28.5 40 0.00086 18.2 1.3 12 52-63 2-13 (36)
125 PF13720 Acetyltransf_11: Udp 28.4 1.3E+02 0.0028 18.5 3.8 37 28-65 25-61 (83)
126 PF09129 Chol_subst-bind: Chol 28.2 38 0.00083 26.5 1.6 24 3-26 258-282 (321)
127 COG3592 Uncharacterized conser 28.1 52 0.0011 20.7 1.9 33 8-54 39-71 (74)
128 KOG1707|consensus 27.9 1.2E+02 0.0025 26.0 4.4 40 26-65 304-345 (625)
129 PF04282 DUF438: Family of unk 27.7 79 0.0017 19.4 2.7 19 21-40 37-55 (71)
130 KOG0384|consensus 27.2 50 0.0011 30.5 2.3 38 10-47 981-1018(1373)
131 cd07971 OBF_DNA_ligase_LigD Th 27.0 62 0.0014 20.5 2.2 20 26-45 38-57 (115)
132 PF14410 GH-E: HNH/ENDO VII su 26.7 63 0.0014 19.5 2.1 24 29-52 39-65 (70)
133 PF01785 Closter_coat: Closter 26.6 69 0.0015 22.4 2.6 39 26-64 8-47 (188)
134 KOG0844|consensus 26.1 41 0.00088 26.9 1.4 22 26-47 183-204 (408)
135 KOG0034|consensus 26.0 1.1E+02 0.0024 21.7 3.6 18 52-69 84-101 (187)
136 PF06014 DUF910: Bacterial pro 25.8 1.1E+02 0.0025 18.4 3.1 23 31-60 29-51 (62)
137 KOG3218|consensus 25.8 70 0.0015 23.7 2.5 35 33-67 6-52 (208)
138 cd04402 RhoGAP_ARHGAP20 RhoGAP 25.7 1.8E+02 0.0038 20.0 4.5 35 29-63 62-96 (192)
139 cd05071 PTKc_Src Catalytic dom 25.7 1.4E+02 0.003 20.2 3.9 33 34-66 230-262 (262)
140 PF12763 EF-hand_4: Cytoskelet 25.6 51 0.0011 21.2 1.6 33 31-63 4-37 (104)
141 PRK11858 aksA trans-homoaconit 25.4 98 0.0021 23.8 3.4 38 28-65 333-372 (378)
142 cd04377 RhoGAP_myosin_IX RhoGA 25.4 1.8E+02 0.0039 19.9 4.4 35 29-63 63-97 (186)
143 PHA02053 hypothetical protein 25.1 78 0.0017 21.3 2.4 27 38-64 7-33 (115)
144 TIGR02110 PQQ_syn_pqqF coenzym 24.7 81 0.0018 26.8 3.0 47 17-65 156-207 (696)
145 PF02256 Fe_hyd_SSU: Iron hydr 24.7 58 0.0013 18.9 1.6 15 32-46 28-42 (60)
146 smart00153 VHP Villin headpiec 24.4 55 0.0012 17.6 1.4 12 52-63 2-13 (36)
147 PF09808 SNAPc_SNAP43: Small n 23.8 1.5E+02 0.0032 20.6 3.8 29 33-64 3-31 (194)
148 KOG4347|consensus 23.8 84 0.0018 27.0 2.9 29 20-48 487-515 (671)
149 PRK13662 hypothetical protein; 23.6 98 0.0021 22.0 2.9 30 16-45 143-173 (177)
150 PF03471 CorC_HlyC: Transporte 23.5 65 0.0014 19.1 1.7 25 49-73 11-35 (81)
151 PF05120 GvpG: Gas vesicle pro 23.3 1.6E+02 0.0035 18.4 3.6 23 34-60 35-57 (79)
152 COG4306 Uncharacterized protei 23.2 1.4E+02 0.003 21.0 3.5 41 23-63 90-133 (160)
153 PF05263 DUF722: Protein of un 23.2 81 0.0018 21.4 2.3 22 25-46 105-126 (130)
154 PF08672 APC2: Anaphase promot 23.0 1.7E+02 0.0037 17.1 3.5 41 23-63 4-44 (60)
155 KOG2670|consensus 22.9 73 0.0016 25.8 2.3 20 30-49 272-291 (433)
156 PF01320 Colicin_Pyocin: Colic 22.9 1.3E+02 0.0027 19.2 3.0 31 21-51 18-50 (85)
157 PTZ00100 DnaJ chaperone protei 22.9 2.4E+02 0.0052 18.8 4.6 38 24-61 70-108 (116)
158 KOG0037|consensus 22.8 2.7E+02 0.0059 20.8 5.1 46 17-63 75-122 (221)
159 PF09373 PMBR: Pseudomurein-bi 22.7 76 0.0016 16.3 1.7 15 49-63 2-16 (33)
160 PRK05289 UDP-N-acetylglucosami 22.6 57 0.0012 23.7 1.6 37 28-64 201-237 (262)
161 PRK10753 transcriptional regul 22.4 1.2E+02 0.0026 18.7 2.8 27 19-45 4-30 (90)
162 PTZ00232 variable surface prot 22.4 1E+02 0.0023 24.1 3.1 31 34-64 167-200 (363)
163 cd06257 DnaJ DnaJ domain or J- 22.1 1.4E+02 0.003 15.7 3.8 20 29-48 10-29 (55)
164 PF14550 Peptidase_U35_2: Puta 21.9 1E+02 0.0022 20.7 2.6 24 28-51 32-55 (122)
165 PTZ00230 variable surface prot 21.9 1.1E+02 0.0023 24.0 3.1 33 34-66 164-199 (364)
166 PF07553 Lipoprotein_Ltp: Host 21.8 1.1E+02 0.0024 17.4 2.3 19 19-37 22-42 (48)
167 PF01614 IclR: Bacterial trans 21.8 2E+02 0.0043 17.7 3.8 41 16-62 19-61 (129)
168 TIGR00987 himA integration hos 21.8 1.3E+02 0.0029 18.4 3.0 27 19-45 5-31 (96)
169 PF13373 DUF2407_C: DUF2407 C- 21.6 88 0.0019 21.3 2.3 21 28-48 8-28 (140)
170 PF13758 Prefoldin_3: Prefoldi 21.5 2.4E+02 0.0051 18.6 4.2 40 20-63 16-58 (99)
171 KOG0484|consensus 21.5 1.2E+02 0.0027 20.6 2.9 33 24-57 17-50 (125)
172 cd07967 OBF_DNA_ligase_III The 21.4 91 0.002 20.9 2.3 20 26-45 43-62 (139)
173 PF14454 Prok_Ub: Prokaryotic 21.4 1E+02 0.0022 18.7 2.3 22 46-67 18-39 (65)
174 PF05603 DUF775: Protein of un 21.3 68 0.0015 23.0 1.7 18 29-46 177-194 (202)
175 PRK00285 ihfA integration host 20.7 1.5E+02 0.0033 18.2 3.1 27 19-45 6-32 (99)
176 COG1043 LpxA Acyl-[acyl carrie 20.7 96 0.0021 23.7 2.5 37 27-63 201-237 (260)
177 PF09820 AAA-ATPase_like: Pred 20.6 1.5E+02 0.0033 21.7 3.5 40 24-63 218-260 (284)
178 PF13011 LZ_Tnp_IS481: leucine 20.4 1.2E+02 0.0025 19.4 2.5 29 26-54 32-60 (85)
179 TIGR01565 homeo_ZF_HD homeobox 20.3 1.2E+02 0.0026 17.8 2.4 20 26-45 3-22 (58)
180 smart00386 HAT HAT (Half-A-TPR 20.2 1.1E+02 0.0023 13.7 2.2 16 33-48 3-18 (33)
181 PF11508 DUF3218: Protein of u 20.1 66 0.0014 23.7 1.4 38 35-72 11-56 (213)
182 PF05347 Complex1_LYR: Complex 20.1 1.3E+02 0.0027 16.5 2.4 30 35-65 2-34 (59)
183 PF00767 Poty_coat: Potyvirus 20.1 1.9E+02 0.0041 21.5 3.9 37 23-62 48-84 (237)
184 PF04994 TfoX_C: TfoX C-termin 20.0 1.2E+02 0.0027 18.5 2.5 33 16-48 10-47 (81)
No 1
>KOG0044|consensus
Probab=99.73 E-value=2.9e-18 Score=122.66 Aligned_cols=58 Identities=43% Similarity=0.878 Sum_probs=55.0
Q ss_pred hhHHHhhhhhcCCCCHHHHHHHHHHHhhhCCCCCCCHHHHHHHHhhcCCCCCCCCccc
Q psy15328 18 GTKLERGREGRDGRRDAEIQEWYKGFLKDCPTGHLSVEEFKKIYGNFFPYGDASKYSK 75 (76)
Q Consensus 18 ~~~~~~~l~~~T~Fs~~EI~~wYr~FkkecPsG~it~~eFk~iy~~~FP~Gda~~fa~ 75 (76)
++...+++.+.|+|+++||++|||+|+++||+|.++.++|+.||+++||.|+++.|++
T Consensus 10 ~~~~~e~l~~~t~f~~~ei~~~Yr~Fk~~cP~G~~~~~~F~~i~~~~fp~gd~~~y~~ 67 (193)
T KOG0044|consen 10 QPESLEQLVQQTKFSKKEIQQWYRGFKNECPSGRLTLEEFREIYASFFPDGDASKYAE 67 (193)
T ss_pred CcHHHHHHHHhcCCCHHHHHHHHHHhcccCCCCccCHHHHHHHHHHHCCCCCHHHHHH
Confidence 4667889999999999999999999999999999999999999999999999999875
No 2
>COG5126 FRQ1 Ca2+-binding protein (EF-Hand superfamily) [Signal transduction mechanisms / Cytoskeleton / Cell division and chromosome partitioning / General function prediction only]
Probab=98.37 E-value=4e-07 Score=63.91 Aligned_cols=49 Identities=16% Similarity=0.084 Sum_probs=44.5
Q ss_pred hhhcCCCCHHHHHHHHHHHhhhCC--CCCCCHHHHHHHHhhcCCCCCCCCcc
Q psy15328 25 REGRDGRRDAEIQEWYKGFLKDCP--TGHLSVEEFKKIYGNFFPYGDASKYS 74 (76)
Q Consensus 25 l~~~T~Fs~~EI~~wYr~FkkecP--sG~it~~eFk~iy~~~FP~Gda~~fa 74 (76)
+...|+|+++||++||++|...|| +|.|++++|..||+ ++|.+.++.++
T Consensus 8 ~~~~~~~t~~qi~~lkeaF~l~D~d~~G~I~~~el~~ilr-~lg~~~s~~ei 58 (160)
T COG5126 8 LLTFTQLTEEQIQELKEAFQLFDRDSDGLIDRNELGKILR-SLGFNPSEAEI 58 (160)
T ss_pred hhhcccCCHHHHHHHHHHHHHhCcCCCCCCcHHHHHHHHH-HcCCCCcHHHH
Confidence 678899999999999999999999 99999999999999 78887766654
No 3
>KOG0034|consensus
Probab=96.56 E-value=0.005 Score=43.96 Aligned_cols=40 Identities=30% Similarity=0.284 Sum_probs=32.7
Q ss_pred hhhhhcCC----CCHHHHHHHHHHHhhhCC---CCCCCHHHHHHHHh
Q psy15328 23 RGREGRDG----RRDAEIQEWYKGFLKDCP---TGHLSVEEFKKIYG 62 (76)
Q Consensus 23 ~~l~~~T~----Fs~~EI~~wYr~FkkecP---sG~it~~eFk~iy~ 62 (76)
+.+...|+ |+..||..||.+|++-++ .|.|+.+||..|+.
T Consensus 15 ~~~~~~~~~~~~fs~~EI~~L~~rF~kl~~~~~~g~lt~eef~~i~~ 61 (187)
T KOG0034|consen 15 EELQMYTGDPTQFSANEIERLYERFKKLDRNNGDGYLTKEEFLSIPE 61 (187)
T ss_pred HHHHhccCCCcccCHHHHHHHHHHHHHhccccccCccCHHHHHHHHH
Confidence 45577788 999999999999999444 47889999887763
No 4
>cd00213 S-100 S-100: S-100 domain, which represents the largest family within the superfamily of proteins carrying the Ca-binding EF-hand motif. Note that this S-100 hierarchy contains only S-100 EF-hand domains, other EF-hands have been modeled separately. S100 proteins are expressed exclusively in vertebrates, and are implicated in intracellular and extracellular regulatory activities. Intracellularly, S100 proteins act as Ca-signaling or Ca-buffering proteins. The most unusual characteristic of certain S100 proteins is their occurrence in extracellular space, where they act in a cytokine-like manner through RAGE, the receptor for advanced glycation products. Structural data suggest that many S100 members exist within cells as homo- or heterodimers and even oligomers; oligomerization contributes to their functional diversification. Upon binding calcium, most S100 proteins change conformation to a more open structure exposing a hydrophobic cleft. This hydrophobic surface represents th
Probab=93.60 E-value=0.15 Score=30.81 Aligned_cols=31 Identities=16% Similarity=0.245 Sum_probs=22.2
Q ss_pred HHHHHHHHHhhh-CCCCCCCHHHHHHHHhhcC
Q psy15328 35 EIQEWYKGFLKD-CPTGHLSVEEFKKIYGNFF 65 (76)
Q Consensus 35 EI~~wYr~Fkke-cPsG~it~~eFk~iy~~~F 65 (76)
+++..|+.|=+. +++|.|+.++|+.++...+
T Consensus 9 ~l~~~F~~~D~~~~~~G~Is~~el~~~l~~~~ 40 (88)
T cd00213 9 TIIDVFHKYSGKEGDKDTLSKKELKELLETEL 40 (88)
T ss_pred HHHHHHHHHhhccCCCCcCcHHHHHHHHHHHh
Confidence 344445555443 8999999999999997643
No 5
>PF00036 EF-hand_1: EF hand; InterPro: IPR018248 Many calcium-binding proteins belong to the same evolutionary family and share a type of calcium-binding domain known as the EF-hand. This type of domain consists of a twelve residue loop flanked on both sides by a twelve residue alpha-helical domain. In an EF-hand loop the calcium ion is coordinated in a pentagonal bipyramidal configuration. The six residues involved in the binding are in positions 1, 3, 5, 7, 9 and 12; these residues are denoted by X, Y, Z, -Y, -X and -Z. The invariant Glu or Asp at position 12 provides two oxygens for liganding Ca (bidentate ligand).; PDB: 1BJF_A 1XFW_R 1XFV_O 2K0J_A 2F3Z_A 3BYA_A 1XFU_Q 2R28_B 1ZOT_B 3G43_D ....
Probab=92.85 E-value=0.15 Score=26.09 Aligned_cols=28 Identities=29% Similarity=0.534 Sum_probs=22.5
Q ss_pred HHHHHHHHHhhhCCCCCCCHHHHHHHHhh
Q psy15328 35 EIQEWYKGFLKDCPTGHLSVEEFKKIYGN 63 (76)
Q Consensus 35 EI~~wYr~FkkecPsG~it~~eFk~iy~~ 63 (76)
||+++++.|= ...+|.|+.+||+.+...
T Consensus 1 E~~~~F~~~D-~d~dG~I~~~Ef~~~~~~ 28 (29)
T PF00036_consen 1 ELKEAFREFD-KDGDGKIDFEEFKEMMKK 28 (29)
T ss_dssp HHHHHHHHHS-TTSSSEEEHHHHHHHHHH
T ss_pred CHHHHHHHHC-CCCCCcCCHHHHHHHHHh
Confidence 5777777774 556899999999998765
No 6
>PTZ00183 centrin; Provisional
Probab=92.36 E-value=0.27 Score=31.40 Aligned_cols=39 Identities=18% Similarity=0.198 Sum_probs=30.6
Q ss_pred hhcCCCCHHHHHHHHHHHhhh--CCCCCCCHHHHHHHHhhc
Q psy15328 26 EGRDGRRDAEIQEWYKGFLKD--CPTGHLSVEEFKKIYGNF 64 (76)
Q Consensus 26 ~~~T~Fs~~EI~~wYr~Fkke--cPsG~it~~eFk~iy~~~ 64 (76)
.+.++++..+++++.+.|..- ..+|.|+..+|..+....
T Consensus 6 ~~~~~~~~~~~~~~~~~F~~~D~~~~G~i~~~e~~~~l~~~ 46 (158)
T PTZ00183 6 SERPGLTEDQKKEIREAFDLFDTDGSGTIDPKELKVAMRSL 46 (158)
T ss_pred cccCCCCHHHHHHHHHHHHHhCCCCCCcccHHHHHHHHHHh
Confidence 466788888888888888873 447889999998887754
No 7
>PTZ00184 calmodulin; Provisional
Probab=91.72 E-value=0.58 Score=29.30 Aligned_cols=35 Identities=14% Similarity=0.300 Sum_probs=23.1
Q ss_pred CCCHHHHHHHHHHHhh--hCCCCCCCHHHHHHHHhhc
Q psy15328 30 GRRDAEIQEWYKGFLK--DCPTGHLSVEEFKKIYGNF 64 (76)
Q Consensus 30 ~Fs~~EI~~wYr~Fkk--ecPsG~it~~eFk~iy~~~ 64 (76)
.+++++++++++.|+. ...+|.|+.++|..+....
T Consensus 4 ~~~~~~~~~~~~~F~~~D~~~~G~i~~~e~~~~l~~~ 40 (149)
T PTZ00184 4 QLTEEQIAEFKEAFSLFDKDGDGTITTKELGTVMRSL 40 (149)
T ss_pred ccCHHHHHHHHHHHHHHcCCCCCcCCHHHHHHHHHHh
Confidence 3566777777777766 3456777777777666553
No 8
>PF13833 EF-hand_8: EF-hand domain pair; PDB: 3KF9_A 1TTX_A 1WLZ_A 1ALV_A 1NX3_A 1ALW_A 1NX2_A 1NX1_A 1NX0_A 1DF0_A ....
Probab=91.36 E-value=0.97 Score=24.70 Aligned_cols=35 Identities=17% Similarity=0.281 Sum_probs=28.9
Q ss_pred cCC-CCHHHHHHHHHHHhhhCCCCCCCHHHHHHHHhh
Q psy15328 28 RDG-RRDAEIQEWYKGFLKDCPTGHLSVEEFKKIYGN 63 (76)
Q Consensus 28 ~T~-Fs~~EI~~wYr~FkkecPsG~it~~eFk~iy~~ 63 (76)
... ++++|++.+++.|= ...+|.|+.+||..++..
T Consensus 18 g~~~~s~~e~~~l~~~~D-~~~~G~I~~~EF~~~~~~ 53 (54)
T PF13833_consen 18 GIKDLSEEEVDRLFREFD-TDGDGYISFDEFISMMQR 53 (54)
T ss_dssp TSSSSCHHHHHHHHHHHT-TSSSSSEEHHHHHHHHHH
T ss_pred CCCCCCHHHHHHHHHhcc-cCCCCCCCHHHHHHHHHh
Confidence 345 99999999999985 456899999999988653
No 9
>PF13405 EF-hand_6: EF-hand domain; PDB: 2AMI_A 3QRX_A 1W7J_B 1OE9_B 1W7I_B 1KFU_S 1KFX_S 2BL0_B 1Y1X_B 3MSE_B ....
Probab=91.30 E-value=0.29 Score=24.60 Aligned_cols=27 Identities=26% Similarity=0.459 Sum_probs=21.1
Q ss_pred HHHHHHHHHhhhCCCCCCCHHHHHHHHh
Q psy15328 35 EIQEWYKGFLKDCPTGHLSVEEFKKIYG 62 (76)
Q Consensus 35 EI~~wYr~FkkecPsG~it~~eFk~iy~ 62 (76)
|++++|+.| -...+|.|+.+||+.++.
T Consensus 1 ~l~~~F~~~-D~d~dG~I~~~el~~~l~ 27 (31)
T PF13405_consen 1 RLREAFKMF-DKDGDGFIDFEELRAILR 27 (31)
T ss_dssp HHHHHHHHH--TTSSSEEEHHHHHHHHH
T ss_pred CHHHHHHHH-CCCCCCcCcHHHHHHHHH
Confidence 456666666 366899999999999987
No 10
>KOG0038|consensus
Probab=89.64 E-value=0.77 Score=33.11 Aligned_cols=20 Identities=15% Similarity=0.057 Sum_probs=18.6
Q ss_pred hhcCCCCHHHHHHHHHHHhh
Q psy15328 26 EGRDGRRDAEIQEWYKGFLK 45 (76)
Q Consensus 26 ~~~T~Fs~~EI~~wYr~Fkk 45 (76)
+.+|-|+++||-++|++|..
T Consensus 17 QDCTFFtrKdIlrl~~Rf~~ 36 (189)
T KOG0038|consen 17 QDCTFFTRKDILRLHKRFYE 36 (189)
T ss_pred cccccccHHHHHHHHHHHHH
Confidence 57999999999999999986
No 11
>smart00054 EFh EF-hand, calcium binding motif. EF-hands are calcium-binding motifs that occur at least in pairs. Links between disease states and genes encoding EF-hands, particularly the S100 subclass, are emerging. Each motif consists of a 12 residue loop flanked on either side by a 12 residue alpha-helix. EF-hands undergo a conformational change unpon binding calcium ions.
Probab=89.57 E-value=0.55 Score=20.84 Aligned_cols=28 Identities=32% Similarity=0.508 Sum_probs=20.6
Q ss_pred HHHHHHHHHhhhCCCCCCCHHHHHHHHhh
Q psy15328 35 EIQEWYKGFLKDCPTGHLSVEEFKKIYGN 63 (76)
Q Consensus 35 EI~~wYr~FkkecPsG~it~~eFk~iy~~ 63 (76)
|++++++.|-. ..+|.|+..+|..++..
T Consensus 1 ~~~~~f~~~d~-~~~g~i~~~e~~~~~~~ 28 (29)
T smart00054 1 ELKEAFRLFDK-DGDGKIDFEEFKDLLKA 28 (29)
T ss_pred CHHHHHHHHCC-CCCCcEeHHHHHHHHHh
Confidence 45667766644 45789999999988764
No 12
>smart00027 EH Eps15 homology domain. Pair of EF hand motifs that recognise proteins containing Asn-Pro-Phe (NPF) sequences.
Probab=88.40 E-value=1.3 Score=27.31 Aligned_cols=47 Identities=15% Similarity=0.077 Sum_probs=24.9
Q ss_pred cccchhHHHhhhhhcCCCCHHHHHHHHHHHhhhCCCCCCCHHHHHHHHh
Q psy15328 14 GWNSGTKLERGREGRDGRRDAEIQEWYKGFLKDCPTGHLSVEEFKKIYG 62 (76)
Q Consensus 14 ~~~~~~~~~~~l~~~T~Fs~~EI~~wYr~FkkecPsG~it~~eFk~iy~ 62 (76)
|.-+-..|.+-+.. .+++.+++.++.+.+-. .-+|.|+.+||..+..
T Consensus 25 G~Is~~el~~~l~~-~~~~~~ev~~i~~~~d~-~~~g~I~~~eF~~~~~ 71 (96)
T smart00027 25 GTVTGAQAKPILLK-SGLPQTLLAKIWNLADI-DNDGELDKDEFALAMH 71 (96)
T ss_pred CeEeHHHHHHHHHH-cCCCHHHHHHHHHHhcC-CCCCCcCHHHHHHHHH
Confidence 33344444444333 35666666666665532 2356666666666554
No 13
>cd05029 S-100A6 S-100A6: S-100A6 domain found in proteins similar to S100A6. S100A6 is a member of the S100 domain family within EF-hand Ca2+-binding proteins superfamily. Note that the S-100 hierarchy, to which this S-100A6 group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately. S100 proteins exhibit unique patterns of tissue- and cell type-specific expression and have been implicated in the Ca2+-dependent regulation of diverse physiological processes, including cell cycle regulation, differentiation, growth, and metabolic control . S100A6 is normally expressed in the G1 phase of the cell cycle in neuronal cells. The function of S100A6 remains unclear, but evidence suggests that it is involved in cell cycle regulation and exocytosis. S100A6 may also be involved in tumorigenesis; the protein is overexpressed in several tumors. Ca2+ binding to S100A6 leads to a conformational change in the protein, which exposes a hydrophobic surface for interact
Probab=87.93 E-value=3.2 Score=25.75 Aligned_cols=52 Identities=19% Similarity=0.218 Sum_probs=36.9
Q ss_pred CCccccchhHHHhhhh----hcCCCCHHHHHHHHHHHhhhCCCCCCCHHHHHHHHhh
Q psy15328 11 GAKGWNSGTKLERGRE----GRDGRRDAEIQEWYKGFLKDCPTGHLSVEEFKKIYGN 63 (76)
Q Consensus 11 ~~~~~~~~~~~~~~l~----~~T~Fs~~EI~~wYr~FkkecPsG~it~~eFk~iy~~ 63 (76)
|.+|.-+..-|.+-|. ....++++||.++.+.. ....+|.|+.+||..+...
T Consensus 24 ~~~g~Is~~EL~~~l~~~~~lg~k~t~~ev~~m~~~~-D~d~dG~Idf~EFv~lm~~ 79 (88)
T cd05029 24 GDKNTLSKKELKELIQKELTIGSKLQDAEIAKLMEDL-DRNKDQEVNFQEYVTFLGA 79 (88)
T ss_pred CCCCEECHHHHHHHHHHHHhcCCCCCHHHHHHHHHHh-cCCCCCCCcHHHHHHHHHH
Confidence 3455556666666553 35668999999998877 4577799999999776543
No 14
>PF14658 EF-hand_9: EF-hand domain
Probab=87.89 E-value=1.7 Score=26.62 Aligned_cols=43 Identities=16% Similarity=0.254 Sum_probs=33.3
Q ss_pred HHhhhhhcCC--CCHHHHHHHHHHHhhhCCCCCCCHHHHHHHHhh
Q psy15328 21 LERGREGRDG--RRDAEIQEWYKGFLKDCPTGHLSVEEFKKIYGN 63 (76)
Q Consensus 21 ~~~~l~~~T~--Fs~~EI~~wYr~FkkecPsG~it~~eFk~iy~~ 63 (76)
|..-|.+.|+ .++.||+.|++..=-++-.|.|+.+.|..|..+
T Consensus 20 l~~~Lra~~~~~p~e~~Lq~l~~elDP~g~~~~v~~d~F~~iM~~ 64 (66)
T PF14658_consen 20 LITYLRAVTGRSPEESELQDLINELDPEGRDGSVNFDTFLAIMRD 64 (66)
T ss_pred HHHHHHHHcCCCCcHHHHHHHHHHhCCCCCCceEeHHHHHHHHHH
Confidence 3444555554 889999999999866666788999999988764
No 15
>cd05023 S-100A11 S-100A11: S-100A11 domain found in proteins similar to S100A11. S100A11 is a member of the S-100 domain family within EF-hand Ca2+-binding proteins superfamily. Note that the S-100 hierarchy, to which this S-100A11 group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately. S100 proteins exhibit unique patterns of tissue- and cell type-specific expression and have been implicated in the Ca2+-dependent regulation of diverse physiological processes, including cell cycle regulation, differentiation, growth, and metabolic control . S100 proteins have also been associated with a variety of pathological events, including neoplastic transformation and neurodegenerative diseases such as Alzheimer's, usually via over expression of the protein. S100A11 is expressed in smooth muscle and other tissues and involves in calcium-dependent membrane aggregation, which is important for cell vesiculation . As is the case for many other S100 proteins, S
Probab=87.72 E-value=0.75 Score=28.76 Aligned_cols=34 Identities=24% Similarity=0.175 Sum_probs=24.6
Q ss_pred HHHHHHHHHHhh----hCCCCCCCHHHHHHHHhhcCCC
Q psy15328 34 AEIQEWYKGFLK----DCPTGHLSVEEFKKIYGNFFPY 67 (76)
Q Consensus 34 ~EI~~wYr~Fkk----ecPsG~it~~eFk~iy~~~FP~ 67 (76)
+-|..|.+.|++ +..+|.|+.+||+.++...+|+
T Consensus 6 ~~i~~l~~~F~~y~~~dg~~~~Ls~~Elk~ll~~e~~~ 43 (89)
T cd05023 6 RCIESLIAVFQKYAGKDGDSYQLSKTEFLSFMNTELAS 43 (89)
T ss_pred HHHHHHHHHHHHHhccCCCcCeECHHHHHHHHHHhhhH
Confidence 345666666666 3434589999999999998873
No 16
>smart00027 EH Eps15 homology domain. Pair of EF hand motifs that recognise proteins containing Asn-Pro-Phe (NPF) sequences.
Probab=86.62 E-value=4 Score=25.03 Aligned_cols=34 Identities=12% Similarity=0.003 Sum_probs=27.5
Q ss_pred CCHHHHHHHHHHHhhhC--CCCCCCHHHHHHHHhhc
Q psy15328 31 RRDAEIQEWYKGFLKDC--PTGHLSVEEFKKIYGNF 64 (76)
Q Consensus 31 Fs~~EI~~wYr~Fkkec--PsG~it~~eFk~iy~~~ 64 (76)
+|.+|+..+.+.|..-+ .+|.|+.++++.++...
T Consensus 4 ls~~~~~~l~~~F~~~D~d~~G~Is~~el~~~l~~~ 39 (96)
T smart00027 4 ISPEDKAKYEQIFRSLDKNQDGTVTGAQAKPILLKS 39 (96)
T ss_pred CCHHHHHHHHHHHHHhCCCCCCeEeHHHHHHHHHHc
Confidence 57788888888888843 56899999999988774
No 17
>cd05029 S-100A6 S-100A6: S-100A6 domain found in proteins similar to S100A6. S100A6 is a member of the S100 domain family within EF-hand Ca2+-binding proteins superfamily. Note that the S-100 hierarchy, to which this S-100A6 group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately. S100 proteins exhibit unique patterns of tissue- and cell type-specific expression and have been implicated in the Ca2+-dependent regulation of diverse physiological processes, including cell cycle regulation, differentiation, growth, and metabolic control . S100A6 is normally expressed in the G1 phase of the cell cycle in neuronal cells. The function of S100A6 remains unclear, but evidence suggests that it is involved in cell cycle regulation and exocytosis. S100A6 may also be involved in tumorigenesis; the protein is overexpressed in several tumors. Ca2+ binding to S100A6 leads to a conformational change in the protein, which exposes a hydrophobic surface for interact
Probab=86.60 E-value=1.1 Score=27.94 Aligned_cols=30 Identities=13% Similarity=0.192 Sum_probs=23.1
Q ss_pred HHHHHHHHHhh-hCCCCCCCHHHHHHHHhhc
Q psy15328 35 EIQEWYKGFLK-DCPTGHLSVEEFKKIYGNF 64 (76)
Q Consensus 35 EI~~wYr~Fkk-ecPsG~it~~eFk~iy~~~ 64 (76)
-|-.+|..|-. +.|.|.|+.+||+.++...
T Consensus 11 ~~i~~F~~y~~~~~~~g~Is~~EL~~~l~~~ 41 (88)
T cd05029 11 LLVAIFHKYSGREGDKNTLSKKELKELIQKE 41 (88)
T ss_pred HHHHHHHHHHccCCCCCEECHHHHHHHHHHH
Confidence 45667777776 6778899999999988753
No 18
>KOG0041|consensus
Probab=86.02 E-value=1.1 Score=33.67 Aligned_cols=36 Identities=14% Similarity=0.192 Sum_probs=33.2
Q ss_pred CCCHHHHHHHHHHHhh--hCCCCCCCHHHHHHHHhhcC
Q psy15328 30 GRRDAEIQEWYKGFLK--DCPTGHLSVEEFKKIYGNFF 65 (76)
Q Consensus 30 ~Fs~~EI~~wYr~Fkk--ecPsG~it~~eFk~iy~~~F 65 (76)
.|++++|+..|.-|++ +.-+|.|++.|+|.+...+.
T Consensus 92 eFsrkqIk~~~~~Fk~yDe~rDgfIdl~ELK~mmEKLg 129 (244)
T KOG0041|consen 92 EFSRKQIKDAESMFKQYDEDRDGFIDLMELKRMMEKLG 129 (244)
T ss_pred HHHHHHHHHHHHHHHHhcccccccccHHHHHHHHHHhC
Confidence 6999999999999999 88899999999999988854
No 19
>cd05030 calgranulins Calgranulins: S-100 domain found in proteins belonging to the Calgranulin subgroup of the S100 family of EF-hand calcium-modulated proteins, including S100A8, S100A9, and S100A12 . Note that the S-100 hierarchy, to which this Calgranulin group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately. These proteins are expressed mainly in granulocytes, and are involved in inflammation, allergy, and neuritogenesis, as well as in host-parasite response. Calgranulins are modulated not only by calcium, but also by other metals such as zinc and copper. Structural data suggested that calgranulins may exist in multiple structural forms, homodimers, as well as hetero-oligomers. For example, the S100A8/S100A9 complex called calprotectin plays important roles in the regulation of inflammatory processes, wound repair, and regulating zinc-dependent enzymes as well as microbial growth.
Probab=84.39 E-value=3.4 Score=25.35 Aligned_cols=51 Identities=16% Similarity=0.168 Sum_probs=34.4
Q ss_pred ccccchhHHHhhhhh--cC----CCCHHHHHHHHHHHhhhCCCCCCCHHHHHHHHhhc
Q psy15328 13 KGWNSGTKLERGREG--RD----GRRDAEIQEWYKGFLKDCPTGHLSVEEFKKIYGNF 64 (76)
Q Consensus 13 ~~~~~~~~~~~~l~~--~T----~Fs~~EI~~wYr~FkkecPsG~it~~eFk~iy~~~ 64 (76)
++.-+...|..-|.. .. ..++.+|..+.+.+- ...+|.|+.++|..++...
T Consensus 24 ~~~Is~~El~~ll~~~~g~~~t~~~~~~~v~~i~~~~D-~d~dG~I~f~eF~~~~~~~ 80 (88)
T cd05030 24 PDTLYKKEFKQLVEKELPNFLKKEKNQKAIDKIFEDLD-TNQDGQLSFEEFLVLVIKV 80 (88)
T ss_pred cccCCHHHHHHHHHHHhhHhhccCCCHHHHHHHHHHcC-CCCCCcCcHHHHHHHHHHH
Confidence 445555555554432 11 233899999999883 4678999999999887753
No 20
>cd00052 EH Eps15 homology domain; found in proteins implicated in endocytosis, vesicle transport, and signal transduction. The alignment contains a pair of EF-hand motifs, typically one of them is canonical and binds to Ca2+, while the other may not bind to Ca2+. A hydrophobic binding pocket is formed by residues from both EF-hand motifs. The EH domain binds to proteins containing NPF (class I), [WF]W or SWG (class II), or H[TS]F (class III) sequence motifs.
Probab=84.26 E-value=3.9 Score=22.49 Aligned_cols=34 Identities=18% Similarity=0.119 Sum_probs=26.6
Q ss_pred CCCCHHHHHHHHHHHhhhCCCCCCCHHHHHHHHhh
Q psy15328 29 DGRRDAEIQEWYKGFLKDCPTGHLSVEEFKKIYGN 63 (76)
Q Consensus 29 T~Fs~~EI~~wYr~FkkecPsG~it~~eFk~iy~~ 63 (76)
.+++..+++++.+.|-. ..+|.|+.++|..++..
T Consensus 28 ~g~~~~~~~~i~~~~d~-~~~g~i~~~ef~~~~~~ 61 (67)
T cd00052 28 SGLPRSVLAQIWDLADT-DKDGKLDKEEFAIAMHL 61 (67)
T ss_pred cCCCHHHHHHHHHHhcC-CCCCcCCHHHHHHHHHH
Confidence 36789999999887743 44789999999887754
No 21
>cd05030 calgranulins Calgranulins: S-100 domain found in proteins belonging to the Calgranulin subgroup of the S100 family of EF-hand calcium-modulated proteins, including S100A8, S100A9, and S100A12 . Note that the S-100 hierarchy, to which this Calgranulin group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately. These proteins are expressed mainly in granulocytes, and are involved in inflammation, allergy, and neuritogenesis, as well as in host-parasite response. Calgranulins are modulated not only by calcium, but also by other metals such as zinc and copper. Structural data suggested that calgranulins may exist in multiple structural forms, homodimers, as well as hetero-oligomers. For example, the S100A8/S100A9 complex called calprotectin plays important roles in the regulation of inflammatory processes, wound repair, and regulating zinc-dependent enzymes as well as microbial growth.
Probab=81.02 E-value=3.2 Score=25.46 Aligned_cols=18 Identities=22% Similarity=0.288 Sum_probs=13.3
Q ss_pred CCCCCCHHHHHHHHhhcC
Q psy15328 48 PTGHLSVEEFKKIYGNFF 65 (76)
Q Consensus 48 PsG~it~~eFk~iy~~~F 65 (76)
++|.|+.+||+.++...+
T Consensus 23 ~~~~Is~~El~~ll~~~~ 40 (88)
T cd05030 23 HPDTLYKKEFKQLVEKEL 40 (88)
T ss_pred CcccCCHHHHHHHHHHHh
Confidence 678888888888886433
No 22
>cd05031 S-100A10_like S-100A10_like: S-100A10 domain found in proteins similar to S100A10. S100A10 is a member of the S100 family of EF-hand superfamily of calcium-binding proteins. Note that the S-100 hierarchy, to which this S-100A1_like group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately. S100 proteins are expressed exclusively in vertebrates, and are implicated in intracellular and extracellular regulatory activities. A unique feature of S100A10 is that it contains mutation in both of the calcium binding sites, making it calcium insensitive. S100A10 has been detected in brain, heart, gastrointestinal tract, kidney, liver, lung, spleen, testes, epidermis, aorta, and thymus. Structural data supports the homo- and hetero-dimeric as well as hetero-tetrameric nature of the protein. S100A10 has multiple binding partners in its calcium free state and is therefore involved in many diverse biological functions.
Probab=80.29 E-value=7.9 Score=23.65 Aligned_cols=34 Identities=15% Similarity=0.297 Sum_probs=26.9
Q ss_pred CCCCHHHHHHHHHHHhhhCCCCCCCHHHHHHHHhh
Q psy15328 29 DGRRDAEIQEWYKGFLKDCPTGHLSVEEFKKIYGN 63 (76)
Q Consensus 29 T~Fs~~EI~~wYr~FkkecPsG~it~~eFk~iy~~ 63 (76)
..++.++|+.+.+.+- ...+|.|+..+|..+...
T Consensus 46 ~~~s~~ei~~~~~~~D-~~~dg~I~f~eF~~l~~~ 79 (94)
T cd05031 46 NQKDPMAVDKIMKDLD-QNRDGKVNFEEFVSLVAG 79 (94)
T ss_pred ccccHHHHHHHHHHhC-CCCCCcCcHHHHHHHHHH
Confidence 3678889999988873 345789999999887764
No 23
>PF00446 GnRH: Gonadotropin-releasing hormone; InterPro: IPR002012 The gonadotropin-releasing hormones (GnRH) (gonadoliberin) [] are a family of peptides that play a pivotal role in reproduction. The main function of GnRH is to act on the pituitary to stimulate the synthesis and secretion of luteinizing and follicle-stimulating hormones, but GnRH also acts on the brain, retina, sympathetic nervous system, gonads and placenta in certain species. There seems to be at least three forms of GnRH. The second form is expressed in midbrain and seems to be widespread. The third form has only been found so far in fish. GnRH is a C-terminal amidated decapeptide processed from a larger precursor protein. Four of the ten residues are perfectly conserved in all species where GnRH has been sequenced.; GO: 0005179 hormone activity, 0007275 multicellular organismal development, 0005576 extracellular region
Probab=79.79 E-value=0.82 Score=18.89 Aligned_cols=9 Identities=56% Similarity=1.616 Sum_probs=7.1
Q ss_pred ccccccccC
Q psy15328 3 LWSKGWNSG 11 (76)
Q Consensus 3 ~~~~~~~~~ 11 (76)
-||-||+.|
T Consensus 2 HwS~~w~PG 10 (10)
T PF00446_consen 2 HWSHGWKPG 10 (10)
T ss_pred ccccccCCC
Confidence 489999865
No 24
>PF13499 EF-hand_7: EF-hand domain pair; PDB: 1TCF_A 2TN4_A 1TN4_A 1A2X_A 2CT9_B 2OTG_B 2OS8_B 1SNL_A 3O4Y_A 3J04_E ....
Probab=77.73 E-value=2.8 Score=23.47 Aligned_cols=19 Identities=21% Similarity=0.240 Sum_probs=11.9
Q ss_pred CCCCCCCHHHHHHHHhhcC
Q psy15328 47 CPTGHLSVEEFKKIYGNFF 65 (76)
Q Consensus 47 cPsG~it~~eFk~iy~~~F 65 (76)
..+|.|+.+||+.++....
T Consensus 12 d~~G~i~~~el~~~~~~~~ 30 (66)
T PF13499_consen 12 DGDGYISKEELRRALKHLG 30 (66)
T ss_dssp TSSSEEEHHHHHHHHHHTT
T ss_pred CccCCCCHHHHHHHHHHhc
Confidence 3466677777666666654
No 25
>KOG0027|consensus
Probab=77.49 E-value=5.4 Score=26.29 Aligned_cols=34 Identities=29% Similarity=0.442 Sum_probs=29.6
Q ss_pred CHHHHHHHHHHHhhhCCCCCCCHHHHHHHHhhcCC
Q psy15328 32 RDAEIQEWYKGFLKDCPTGHLSVEEFKKIYGNFFP 66 (76)
Q Consensus 32 s~~EI~~wYr~FkkecPsG~it~~eFk~iy~~~FP 66 (76)
+.+||++-+|-|=.+ -+|.||..||+.+...+..
T Consensus 83 ~~~el~eaF~~fD~d-~~G~Is~~el~~~l~~lg~ 116 (151)
T KOG0027|consen 83 SSEELKEAFRVFDKD-GDGFISASELKKVLTSLGE 116 (151)
T ss_pred cHHHHHHHHHHHccC-CCCcCcHHHHHHHHHHhCC
Confidence 577999999999877 8999999999999988654
No 26
>cd05026 S-100Z S-100Z: S-100Z domain found in proteins similar to S100Z. S100Z is a member of the S100 domain family within the EF-hand Ca2+-binding proteins superfamily. Note that the S-100 hierarchy, to which this S-100Z group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately.S100 proteins exhibit unique patterns of tissue- and cell type-specific expression and have been implicated in the Ca2+-dependent regulation of diverse physiological processes, including cell cycle regulation, differentiation, growth, and metabolic control. S100Z is normally expressed in various tissues, with its highest level of expression being in spleen and leukocytes. The function of S100Z remains unclear. Preliminary structural data suggests that S100Z is homodimer, however a heterodimer with S100P has been reported. S100Z is capable of binding calcium ions. When calcium binds to S110Z, the protein experiences a conformational change, which exposes hydrophobic surfac
Probab=74.29 E-value=15 Score=22.62 Aligned_cols=33 Identities=6% Similarity=0.139 Sum_probs=24.4
Q ss_pred CCHHHHHHHHHHHhhhCCCCCCCHHHHHHHHhhc
Q psy15328 31 RRDAEIQEWYKGFLKDCPTGHLSVEEFKKIYGNF 64 (76)
Q Consensus 31 Fs~~EI~~wYr~FkkecPsG~it~~eFk~iy~~~ 64 (76)
-+..+|.++.+.+- ...+|.|+.+||..+...+
T Consensus 50 ~~~~~v~~i~~elD-~n~dG~Idf~EF~~l~~~l 82 (93)
T cd05026 50 KDPMLVDKIMNDLD-SNKDNEVDFNEFVVLVAAL 82 (93)
T ss_pred cCHHHHHHHHHHhC-CCCCCCCCHHHHHHHHHHH
Confidence 36667777777774 3568999999998887653
No 27
>PF11569 Homez: Homeodomain leucine-zipper encoding, Homez; PDB: 2YS9_A.
Probab=73.31 E-value=3.7 Score=24.53 Aligned_cols=22 Identities=5% Similarity=0.187 Sum_probs=16.9
Q ss_pred HhhhhhcCCCCHHHHHHHHHHH
Q psy15328 22 ERGREGRDGRRDAEIQEWYKGF 43 (76)
Q Consensus 22 ~~~l~~~T~Fs~~EI~~wYr~F 43 (76)
.+.|+..|+++.++|+.||..=
T Consensus 28 L~~L~~kS~ms~qqVr~WFa~~ 49 (56)
T PF11569_consen 28 LDELCDKSRMSYQQVRDWFAER 49 (56)
T ss_dssp HHHHHHHTT--HHHHHHHHHHH
T ss_pred HHHHHHHHCCCHHHHHHHHHHh
Confidence 3568999999999999999764
No 28
>cd00213 S-100 S-100: S-100 domain, which represents the largest family within the superfamily of proteins carrying the Ca-binding EF-hand motif. Note that this S-100 hierarchy contains only S-100 EF-hand domains, other EF-hands have been modeled separately. S100 proteins are expressed exclusively in vertebrates, and are implicated in intracellular and extracellular regulatory activities. Intracellularly, S100 proteins act as Ca-signaling or Ca-buffering proteins. The most unusual characteristic of certain S100 proteins is their occurrence in extracellular space, where they act in a cytokine-like manner through RAGE, the receptor for advanced glycation products. Structural data suggest that many S100 members exist within cells as homo- or heterodimers and even oligomers; oligomerization contributes to their functional diversification. Upon binding calcium, most S100 proteins change conformation to a more open structure exposing a hydrophobic cleft. This hydrophobic surface represents th
Probab=72.92 E-value=10 Score=22.52 Aligned_cols=32 Identities=16% Similarity=0.311 Sum_probs=25.5
Q ss_pred CCHHHHHHHHHHHhhhCCCCCCCHHHHHHHHhh
Q psy15328 31 RRDAEIQEWYKGFLKDCPTGHLSVEEFKKIYGN 63 (76)
Q Consensus 31 Fs~~EI~~wYr~FkkecPsG~it~~eFk~iy~~ 63 (76)
.+.++|..+++.|-. .-+|.|+.++|..+...
T Consensus 48 ~~~~ei~~i~~~~d~-~~~g~I~f~eF~~~~~~ 79 (88)
T cd00213 48 KDPEAVDKIMKDLDV-NKDGKVDFQEFLVLIGK 79 (88)
T ss_pred CCHHHHHHHHHHhcc-CCCCcCcHHHHHHHHHH
Confidence 468999999988843 34689999999988765
No 29
>PRK04387 hypothetical protein; Provisional
Probab=72.72 E-value=7.6 Score=25.21 Aligned_cols=33 Identities=27% Similarity=0.410 Sum_probs=26.3
Q ss_pred cCCCCHHHHHHHHHHHhhhCCCCCCCHHHHHHHHhhc
Q psy15328 28 RDGRRDAEIQEWYKGFLKDCPTGHLSVEEFKKIYGNF 64 (76)
Q Consensus 28 ~T~Fs~~EI~~wYr~FkkecPsG~it~~eFk~iy~~~ 64 (76)
..+.++++|...||+||+-.|+ +.|=++|+..|
T Consensus 31 E~gv~re~ll~~Y~~FK~VVps----K~EEKql~reF 63 (90)
T PRK04387 31 EKGVDAEELLDAYRRFKEIVPS----KAEEKQIDREF 63 (90)
T ss_pred HcCCcHHHHHHHHHHHHHhcCC----hHHHHHHHHHH
Confidence 3578999999999999999996 55666666654
No 30
>cd05027 S-100B S-100B: S-100B domain found in proteins similar to S100B. S100B is a calcium-binding protein belonging to a large S100 vertebrate-specific protein family within the EF-hand superfamily of calcium-binding proteins. Note that the S-100 hierarchy, to which this S-100B group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately. S100B is most abundant in glial cells of the central nervous system, predominately in astrocytes. S100B is involved in signal transduction via the inhibition of protein phoshorylation, regulation of enzyme activity and by affecting the calcium homeostasis. Upon calcium binding the S100B homodimer changes conformation to expose a hydrophobic cleft, which represents the interaction site of S100B with its more than 20 known target proteins. These target proteins include several cellular architecture proteins such as tubulin and GFAP; S100B can inhibit polymerization of these oligomeric molecules. Furthermore, S100B i
Probab=72.45 E-value=4.5 Score=25.10 Aligned_cols=33 Identities=12% Similarity=0.201 Sum_probs=22.3
Q ss_pred HHHHHHHHHHhhhCCCC-CCCHHHHHHHHhhcCC
Q psy15328 34 AEIQEWYKGFLKDCPTG-HLSVEEFKKIYGNFFP 66 (76)
Q Consensus 34 ~EI~~wYr~FkkecPsG-~it~~eFk~iy~~~FP 66 (76)
.+|+..|+.|-....+| .|+..||+.+...-||
T Consensus 8 ~~l~~aF~~fD~~dgdG~~I~~~eL~~ll~~~~~ 41 (88)
T cd05027 8 VALIDVFHQYSGREGDKHKLKKSELKELINNELS 41 (88)
T ss_pred HHHHHHHHHhcccCCCcCEECHHHHHHHHHHHhH
Confidence 34566666664345688 5999999998887443
No 31
>cd05031 S-100A10_like S-100A10_like: S-100A10 domain found in proteins similar to S100A10. S100A10 is a member of the S100 family of EF-hand superfamily of calcium-binding proteins. Note that the S-100 hierarchy, to which this S-100A1_like group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately. S100 proteins are expressed exclusively in vertebrates, and are implicated in intracellular and extracellular regulatory activities. A unique feature of S100A10 is that it contains mutation in both of the calcium binding sites, making it calcium insensitive. S100A10 has been detected in brain, heart, gastrointestinal tract, kidney, liver, lung, spleen, testes, epidermis, aorta, and thymus. Structural data supports the homo- and hetero-dimeric as well as hetero-tetrameric nature of the protein. S100A10 has multiple binding partners in its calcium free state and is therefore involved in many diverse biological functions.
Probab=72.36 E-value=5.5 Score=24.35 Aligned_cols=31 Identities=19% Similarity=0.233 Sum_probs=21.9
Q ss_pred HHHHHHHHHHHhh-hCCCCCCCHHHHHHHHhh
Q psy15328 33 DAEIQEWYKGFLK-DCPTGHLSVEEFKKIYGN 63 (76)
Q Consensus 33 ~~EI~~wYr~Fkk-ecPsG~it~~eFk~iy~~ 63 (76)
..+|+..|+.|-. +..+|.|+.+|++.++..
T Consensus 7 ~~~l~~~F~~~D~~dg~dG~Is~~El~~~l~~ 38 (94)
T cd05031 7 MESLILTFHRYAGKDGDKNTLSRKELKKLMEK 38 (94)
T ss_pred HHHHHHHHHHHhccCCCCCeECHHHHHHHHHH
Confidence 3456666666654 334699999999988875
No 32
>cd00086 homeodomain Homeodomain; DNA binding domains involved in the transcriptional regulation of key eukaryotic developmental processes; may bind to DNA as monomers or as homo- and/or heterodimers, in a sequence-specific manner.
Probab=71.79 E-value=4.9 Score=21.85 Aligned_cols=24 Identities=8% Similarity=0.271 Sum_probs=19.5
Q ss_pred HHHhhhhhcCCCCHHHHHHHHHHH
Q psy15328 20 KLERGREGRDGRRDAEIQEWYKGF 43 (76)
Q Consensus 20 ~~~~~l~~~T~Fs~~EI~~wYr~F 43 (76)
...+.|...|+++..+|+.|++.=
T Consensus 28 ~~~~~la~~~~l~~~qV~~WF~nr 51 (59)
T cd00086 28 EEREELAKELGLTERQVKIWFQNR 51 (59)
T ss_pred HHHHHHHHHHCcCHHHHHHHHHHH
Confidence 445678899999999999998753
No 33
>cd05026 S-100Z S-100Z: S-100Z domain found in proteins similar to S100Z. S100Z is a member of the S100 domain family within the EF-hand Ca2+-binding proteins superfamily. Note that the S-100 hierarchy, to which this S-100Z group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately.S100 proteins exhibit unique patterns of tissue- and cell type-specific expression and have been implicated in the Ca2+-dependent regulation of diverse physiological processes, including cell cycle regulation, differentiation, growth, and metabolic control. S100Z is normally expressed in various tissues, with its highest level of expression being in spleen and leukocytes. The function of S100Z remains unclear. Preliminary structural data suggests that S100Z is homodimer, however a heterodimer with S100P has been reported. S100Z is capable of binding calcium ions. When calcium binds to S110Z, the protein experiences a conformational change, which exposes hydrophobic surfac
Probab=71.11 E-value=6.6 Score=24.31 Aligned_cols=31 Identities=13% Similarity=0.158 Sum_probs=20.8
Q ss_pred HHHHHHHHHHhhhCCCC-CCCHHHHHHHHhhc
Q psy15328 34 AEIQEWYKGFLKDCPTG-HLSVEEFKKIYGNF 64 (76)
Q Consensus 34 ~EI~~wYr~FkkecPsG-~it~~eFk~iy~~~ 64 (76)
.+|++.|+.|-...-+| .|+..||+.++...
T Consensus 10 ~~~~~~F~~~dd~dgdg~~Is~~EL~~ll~~~ 41 (93)
T cd05026 10 DTLIRIFHNYSGKEGDRYKLSKGELKELLQRE 41 (93)
T ss_pred HHHHHHHHHHHccCCCCCEECHHHHHHHHHHH
Confidence 35566666666445566 58888888888663
No 34
>cd00051 EFh EF-hand, calcium binding motif; A diverse superfamily of calcium sensors and calcium signal modulators; most examples in this alignment model have 2 active canonical EF hands. Ca2+ binding induces a conformational change in the EF-hand motif, leading to the activation or inactivation of target proteins. EF-hands tend to occur in pairs or higher copy numbers.
Probab=70.59 E-value=12 Score=18.95 Aligned_cols=32 Identities=28% Similarity=0.448 Sum_probs=20.9
Q ss_pred CCCCHHHHHHHHHHHhhhCCCCCCCHHHHHHHH
Q psy15328 29 DGRRDAEIQEWYKGFLKDCPTGHLSVEEFKKIY 61 (76)
Q Consensus 29 T~Fs~~EI~~wYr~FkkecPsG~it~~eFk~iy 61 (76)
...+.++++.+++.|-. ..+|.|+.++|..++
T Consensus 31 ~~~~~~~~~~~~~~~~~-~~~~~l~~~ef~~~~ 62 (63)
T cd00051 31 EGLSEEEIDEMIREVDK-DGDGKIDFEEFLELM 62 (63)
T ss_pred CCCCHHHHHHHHHHhCC-CCCCeEeHHHHHHHh
Confidence 45677777777776633 345778888877653
No 35
>PTZ00183 centrin; Provisional
Probab=69.42 E-value=11 Score=23.90 Aligned_cols=33 Identities=30% Similarity=0.336 Sum_probs=21.8
Q ss_pred CCCHHHHHHHHHHHhhhCCCCCCCHHHHHHHHhh
Q psy15328 30 GRRDAEIQEWYKGFLKDCPTGHLSVEEFKKIYGN 63 (76)
Q Consensus 30 ~Fs~~EI~~wYr~FkkecPsG~it~~eFk~iy~~ 63 (76)
+++..+++.++..|-. ...|.|+.++|..+...
T Consensus 122 ~l~~~~~~~~~~~~d~-~~~g~i~~~ef~~~~~~ 154 (158)
T PTZ00183 122 TITDEELQEMIDEADR-NGDGEISEEEFYRIMKK 154 (158)
T ss_pred CCCHHHHHHHHHHhCC-CCCCcCcHHHHHHHHhc
Confidence 3677777777776632 34577777777777654
No 36
>PF12174 RST: RCD1-SRO-TAF4 (RST) plant domain; InterPro: IPR022003 This domain is found in many plant proteins including SROs and RCD1s; it is required for interaction with multiple plant transcription factors.
Probab=69.07 E-value=9 Score=23.43 Aligned_cols=45 Identities=20% Similarity=0.173 Sum_probs=30.8
Q ss_pred cccchhHHHhhhhhcCCCC-HHHHHHHHHHHhhhCCCCCCCHHHHHHHHh
Q psy15328 14 GWNSGTKLERGREGRDGRR-DAEIQEWYKGFLKDCPTGHLSVEEFKKIYG 62 (76)
Q Consensus 14 ~~~~~~~~~~~l~~~T~Fs-~~EI~~wYr~FkkecPsG~it~~eFk~iy~ 62 (76)
.|.+=+.|...|...=.-+ ...|...|..|+ .+.|+++||.+-..
T Consensus 7 p~~~F~~L~~~l~~~l~~~~~~~l~~~Y~~~k----~~kIsR~~fvr~lR 52 (70)
T PF12174_consen 7 PWMPFPMLFSALSKHLPPSKMDLLQKHYEEFK----KKKISREEFVRKLR 52 (70)
T ss_pred CcccHHHHHHHHHHHCCHHHHHHHHHHHHHHH----HCCCCHHHHHHHHH
Confidence 4666666666555433322 356788999998 78999999976544
No 37
>PF09279 EF-hand_like: Phosphoinositide-specific phospholipase C, efhand-like; InterPro: IPR015359 This domain is predominantly found in the enzyme phosphoinositol-specific phospholipase C. It adopts a structure consisting of a core of four alpha helices, in an EF like fold, and is required for functioning of the enzyme []. ; PDB: 3OHM_B 3QR0_A 2ZKM_X 2FJU_B 3QR1_D 1DJW_B 1DJI_B 1DJG_B 1QAS_B 2ISD_B ....
Probab=67.63 E-value=7.8 Score=23.01 Aligned_cols=26 Identities=23% Similarity=0.530 Sum_probs=16.6
Q ss_pred HHHHHHHHHhhhCCCCCCCHHHHHHHHh
Q psy15328 35 EIQEWYKGFLKDCPTGHLSVEEFKKIYG 62 (76)
Q Consensus 35 EI~~wYr~FkkecPsG~it~~eFk~iy~ 62 (76)
||+.+|+.+.. ..+.||.++|.+...
T Consensus 1 ei~~if~~ys~--~~~~mt~~~f~~FL~ 26 (83)
T PF09279_consen 1 EIEEIFRKYSS--DKEYMTAEEFRRFLR 26 (83)
T ss_dssp HHHHHHHHHCT--TSSSEEHHHHHHHHH
T ss_pred CHHHHHHHHhC--CCCcCCHHHHHHHHH
Confidence 56666666633 466777777776664
No 38
>cd05025 S-100A1 S-100A1: S-100A1 domain found in proteins similar to S100A1. S100A1 is a calcium-binding protein belonging to a large S100 vertebrate-specific protein family within the EF-hand superfamily of calcium-binding proteins. Note that the S-100 hierarchy, to which this S-100A1 group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately. As is the case with many other members of S100 protein family, S100A1 is implicated in intracellular and extracellular regulatory activities, including interaction with myosin-associated twitchin kinase, actin-capping protein CapZ, sinapsin I, and tubulin. Structural data suggests that S100A1 proteins exist within cells as antiparallel homodimers, while heterodimers with S100A4 and S100B also has been reported. Upon binding calcium S100A1 changes conformation to expose a hydrophobic cleft which is the interaction site of S100A1 with its more that 20 known target proteins.
Probab=67.40 E-value=15 Score=22.18 Aligned_cols=33 Identities=12% Similarity=0.243 Sum_probs=24.1
Q ss_pred CCCHHHHHHHHHHHhhhCCCCCCCHHHHHHHHhh
Q psy15328 30 GRRDAEIQEWYKGFLKDCPTGHLSVEEFKKIYGN 63 (76)
Q Consensus 30 ~Fs~~EI~~wYr~FkkecPsG~it~~eFk~iy~~ 63 (76)
..++.+|+.+.+.+- ..-+|.|+..+|..+...
T Consensus 48 ~~s~~~v~~i~~~~D-~d~~G~I~f~eF~~l~~~ 80 (92)
T cd05025 48 QKDADAVDKIMKELD-ENGDGEVDFQEFVVLVAA 80 (92)
T ss_pred CCCHHHHHHHHHHHC-CCCCCcCcHHHHHHHHHH
Confidence 347778888887773 334788999999887764
No 39
>cd05022 S-100A13 S-100A13: S-100A13 domain found in proteins similar to S100A13. S100A13 is a calcium-binding protein belonging to a large S100 vertebrate-specific protein family within the EF-hand superfamily of calcium-binding proteins. Note that the S-100 hierarchy, to which this S-100A13 group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately. S100A13 is involved in the cellular export of interleukin-1 (IL-1) and of fibroblast growth factor-1 (FGF-1), which plays an important role in angiogenesis and tissue regeneration. Export is based on the CuII-dependent formation of multiprotein complexes containing the S100A13 protein. Assembly of these complexes occurs near the inner surface of the plasma membrane. Binding of two Ca(II) ions per monomer triggers key conformational changes leading to the creation of two identical and symmetrical Cu(II)-binding sites on the surface of the protein, close to the interface between the two monomers. These Cu
Probab=67.20 E-value=9.7 Score=23.85 Aligned_cols=29 Identities=10% Similarity=0.161 Sum_probs=20.4
Q ss_pred HHHHHHHHHhhhCCCCCCCHHHHHHHHhh
Q psy15328 35 EIQEWYKGFLKDCPTGHLSVEEFKKIYGN 63 (76)
Q Consensus 35 EI~~wYr~FkkecPsG~it~~eFk~iy~~ 63 (76)
.|...|+.|-+.-.+|.|+..||+.+...
T Consensus 9 ~l~~~F~~fd~~~~~g~i~~~ELk~ll~~ 37 (89)
T cd05022 9 TLVSNFHKASVKGGKESLTASEFQELLTQ 37 (89)
T ss_pred HHHHHHHHHhCCCCCCeECHHHHHHHHHH
Confidence 45555555543367888888888888877
No 40
>KOG0027|consensus
Probab=65.88 E-value=19 Score=23.59 Aligned_cols=32 Identities=22% Similarity=0.377 Sum_probs=18.4
Q ss_pred CCHHHHHHHHHHHhhhCCCCCCCHHHHHHHHhh
Q psy15328 31 RRDAEIQEWYKGFLKDCPTGHLSVEEFKKIYGN 63 (76)
Q Consensus 31 Fs~~EI~~wYr~FkkecPsG~it~~eFk~iy~~ 63 (76)
.++.||+.+.+.+-.+ .+|.|+..+|..+...
T Consensus 41 ~t~~el~~~~~~~D~d-g~g~I~~~eF~~l~~~ 72 (151)
T KOG0027|consen 41 PTEEELRDLIKEIDLD-GDGTIDFEEFLDLMEK 72 (151)
T ss_pred CCHHHHHHHHHHhCCC-CCCeEcHHHHHHHHHh
Confidence 4566666665555433 4566666666666554
No 41
>cd05022 S-100A13 S-100A13: S-100A13 domain found in proteins similar to S100A13. S100A13 is a calcium-binding protein belonging to a large S100 vertebrate-specific protein family within the EF-hand superfamily of calcium-binding proteins. Note that the S-100 hierarchy, to which this S-100A13 group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately. S100A13 is involved in the cellular export of interleukin-1 (IL-1) and of fibroblast growth factor-1 (FGF-1), which plays an important role in angiogenesis and tissue regeneration. Export is based on the CuII-dependent formation of multiprotein complexes containing the S100A13 protein. Assembly of these complexes occurs near the inner surface of the plasma membrane. Binding of two Ca(II) ions per monomer triggers key conformational changes leading to the creation of two identical and symmetrical Cu(II)-binding sites on the surface of the protein, close to the interface between the two monomers. These Cu
Probab=65.86 E-value=23 Score=22.15 Aligned_cols=52 Identities=21% Similarity=0.218 Sum_probs=35.8
Q ss_pred CCccccchhHHHhhhhh--cCCCCH-HHHHHHHHHHhhhCCCCCCCHHHHHHHHhh
Q psy15328 11 GAKGWNSGTKLERGREG--RDGRRD-AEIQEWYKGFLKDCPTGHLSVEEFKKIYGN 63 (76)
Q Consensus 11 ~~~~~~~~~~~~~~l~~--~T~Fs~-~EI~~wYr~FkkecPsG~it~~eFk~iy~~ 63 (76)
+.++.-+..-|..-|.. ...++. +++..+.+..- ..-+|.|+.+||..+.+.
T Consensus 21 ~~~g~i~~~ELk~ll~~elg~~ls~~~~v~~mi~~~D-~d~DG~I~F~EF~~l~~~ 75 (89)
T cd05022 21 GGKESLTASEFQELLTQQLPHLLKDVEGLEEKMKNLD-VNQDSKLSFEEFWELIGE 75 (89)
T ss_pred CCCCeECHHHHHHHHHHHhhhhccCHHHHHHHHHHhC-CCCCCCCcHHHHHHHHHH
Confidence 34555555555555554 244666 88888876653 577899999999888765
No 42
>PF12872 OST-HTH: OST-HTH/LOTUS domain; PDB: 2KPM_A 3S93_B 3RCO_A 2KZV_A.
Probab=65.78 E-value=12 Score=21.55 Aligned_cols=25 Identities=24% Similarity=0.470 Sum_probs=17.3
Q ss_pred CCCCCHHHHHHHHhhcCCCCCCCCc
Q psy15328 49 TGHLSVEEFKKIYGNFFPYGDASKY 73 (76)
Q Consensus 49 sG~it~~eFk~iy~~~FP~Gda~~f 73 (76)
+|.++..++...|.+.+|.=++..|
T Consensus 21 ~g~v~ls~l~~~~~~~~~~f~~~~y 45 (74)
T PF12872_consen 21 DGWVSLSQLGQEYKKKYPDFDPRDY 45 (74)
T ss_dssp TSSEEHHHHHHHHHHHHTT--TCCT
T ss_pred CceEEHHHHHHHHHHHCCCCCcccc
Confidence 5678888888888887776665544
No 43
>cd05025 S-100A1 S-100A1: S-100A1 domain found in proteins similar to S100A1. S100A1 is a calcium-binding protein belonging to a large S100 vertebrate-specific protein family within the EF-hand superfamily of calcium-binding proteins. Note that the S-100 hierarchy, to which this S-100A1 group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately. As is the case with many other members of S100 protein family, S100A1 is implicated in intracellular and extracellular regulatory activities, including interaction with myosin-associated twitchin kinase, actin-capping protein CapZ, sinapsin I, and tubulin. Structural data suggests that S100A1 proteins exist within cells as antiparallel homodimers, while heterodimers with S100A4 and S100B also has been reported. Upon binding calcium S100A1 changes conformation to expose a hydrophobic cleft which is the interaction site of S100A1 with its more that 20 known target proteins.
Probab=64.73 E-value=14 Score=22.41 Aligned_cols=30 Identities=13% Similarity=0.165 Sum_probs=18.5
Q ss_pred HHHHHHHHHHhhhCCCC-CCCHHHHHHHHhh
Q psy15328 34 AEIQEWYKGFLKDCPTG-HLSVEEFKKIYGN 63 (76)
Q Consensus 34 ~EI~~wYr~FkkecPsG-~it~~eFk~iy~~ 63 (76)
++|++.|+.|-...-+| .|+.+||+.++..
T Consensus 9 ~~l~~~F~~fDd~dg~G~~Is~~El~~~l~~ 39 (92)
T cd05025 9 ETLINVFHAHSGKEGDKYKLSKKELKDLLQT 39 (92)
T ss_pred HHHHHHHHHHhcccCCCCeECHHHHHHHHHH
Confidence 45666666664233467 4777777777754
No 44
>KOG0488|consensus
Probab=64.06 E-value=4.6 Score=30.86 Aligned_cols=23 Identities=30% Similarity=0.370 Sum_probs=21.1
Q ss_pred hhhhhcCCCCHHHHHHHHHHHhh
Q psy15328 23 RGREGRDGRRDAEIQEWYKGFLK 45 (76)
Q Consensus 23 ~~l~~~T~Fs~~EI~~wYr~Fkk 45 (76)
..++..|.||..+|+.|+++|.+
T Consensus 171 K~RksRTaFT~~Ql~~LEkrF~~ 193 (309)
T KOG0488|consen 171 KRRKSRTAFSDHQLFELEKRFEK 193 (309)
T ss_pred ccccchhhhhHHHHHHHHHHHHH
Confidence 56689999999999999999987
No 45
>PLN02964 phosphatidylserine decarboxylase
Probab=63.71 E-value=18 Score=30.46 Aligned_cols=31 Identities=13% Similarity=-0.129 Sum_probs=24.4
Q ss_pred hHHHhhhhhc--CCCCHHHHHHHHHHHhhhCCCC
Q psy15328 19 TKLERGREGR--DGRRDAEIQEWYKGFLKDCPTG 50 (76)
Q Consensus 19 ~~~~~~l~~~--T~Fs~~EI~~wYr~FkkecPsG 50 (76)
+.|+. ++.. |+|+.+|++++++.|..-+|+|
T Consensus 124 n~lv~-~~e~~~t~f~~kqi~elkeaF~lfD~dg 156 (644)
T PLN02964 124 NTLVG-YCELDLFDFVTQEPESACESFDLLDPSS 156 (644)
T ss_pred HHhhh-heeecHhhccHHHHHHHHHHHHHHCCCC
Confidence 34443 4666 8999999999999999988885
No 46
>PF09966 DUF2200: Uncharacterized protein conserved in bacteria (DUF2200); InterPro: IPR014580 There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.; PDB: 3C9P_A.
Probab=63.54 E-value=7.9 Score=26.11 Aligned_cols=32 Identities=16% Similarity=0.320 Sum_probs=22.7
Q ss_pred chhHHHhhhhhcCCCCHHHHHHH------HHHHhhhCC
Q psy15328 17 SGTKLERGREGRDGRRDAEIQEW------YKGFLKDCP 48 (76)
Q Consensus 17 ~~~~~~~~l~~~T~Fs~~EI~~w------Yr~FkkecP 48 (76)
+...+.+.++..|+|+.++|+.+ |..|-.+-|
T Consensus 24 t~~Evd~vi~WLTGy~~~~l~~~~~~~~~~~~FF~~AP 61 (111)
T PF09966_consen 24 TKEEVDQVIRWLTGYDQEELQAQIESKVTFETFFAQAP 61 (111)
T ss_dssp -HHHHHHHHHHHH---HHHHHHHTTS--BHHHHHHT-S
T ss_pred CHHHHHHHHHHHhcCCHHHHHHHHHcCCCHHHHHHHcc
Confidence 45567888999999999999998 788877766
No 47
>smart00389 HOX Homeodomain. DNA-binding factors that are involved in the transcriptional regulation of key developmental processes
Probab=63.46 E-value=8.8 Score=20.76 Aligned_cols=25 Identities=8% Similarity=0.368 Sum_probs=20.1
Q ss_pred hhHHHhhhhhcCCCCHHHHHHHHHH
Q psy15328 18 GTKLERGREGRDGRRDAEIQEWYKG 42 (76)
Q Consensus 18 ~~~~~~~l~~~T~Fs~~EI~~wYr~ 42 (76)
......+|...++++..+|+.|+..
T Consensus 26 ~~~~~~~la~~~~l~~~qV~~WF~n 50 (56)
T smart00389 26 SREEREELAAKLGLSERQVKVWFQN 50 (56)
T ss_pred CHHHHHHHHHHHCcCHHHHHHhHHH
Confidence 3455677889999999999999875
No 48
>KOG0486|consensus
Probab=63.22 E-value=6.8 Score=30.95 Aligned_cols=39 Identities=21% Similarity=0.221 Sum_probs=29.9
Q ss_pred hhHHHhhhhhcCCCCHHHHHHHHHHHhhhCCCCCCCHHH
Q psy15328 18 GTKLERGREGRDGRRDAEIQEWYKGFLKDCPTGHLSVEE 56 (76)
Q Consensus 18 ~~~~~~~l~~~T~Fs~~EI~~wYr~FkkecPsG~it~~e 56 (76)
+++|.+|.++.|||+..+|++|=.-|...|-.-.-+++|
T Consensus 106 ~kki~KqrrQrthFtSqqlqele~tF~rNrypdMstrEE 144 (351)
T KOG0486|consen 106 KKKISKQRRQRTHFTSQQLQELEATFQRNRYPDMSTREE 144 (351)
T ss_pred cchhhhhhhhhhhhHHHHHHHHHHHHhhccCCccchhhH
Confidence 456788999999999999999999998855433334443
No 49
>PF09851 SHOCT: Short C-terminal domain; InterPro: IPR018649 This family of hypothetical prokaryotic proteins has no known function.
Probab=63.03 E-value=19 Score=18.45 Aligned_cols=25 Identities=16% Similarity=0.300 Sum_probs=16.6
Q ss_pred HHHHHHHHHHhhhCCCCCCCHHHHHHHHh
Q psy15328 34 AEIQEWYKGFLKDCPTGHLSVEEFKKIYG 62 (76)
Q Consensus 34 ~EI~~wYr~FkkecPsG~it~~eFk~iy~ 62 (76)
++|+.|-..+. +|.||.+||.+.-.
T Consensus 3 ~~L~~L~~l~~----~G~IseeEy~~~k~ 27 (31)
T PF09851_consen 3 DRLEKLKELYD----KGEISEEEYEQKKA 27 (31)
T ss_pred HHHHHHHHHHH----cCCCCHHHHHHHHH
Confidence 34445444443 89999999987543
No 50
>PF03941 INCENP_ARK-bind: Inner centromere protein, ARK binding region; InterPro: IPR005635 This region of the inner centromere protein has been found to be necessary and sufficient for binding to aurora-related kinase. This interaction has been implicated in the coordination of chromosome segregation with cell division in yeast [].; PDB: 2BFX_C 2BFY_C 3ZTX_D 2VGO_D 2VGP_D 2VRX_D 4AF3_D.
Probab=62.82 E-value=4 Score=23.71 Aligned_cols=25 Identities=16% Similarity=0.340 Sum_probs=19.0
Q ss_pred CccccchhHHHhhhhhcCCCCHHHH
Q psy15328 12 AKGWNSGTKLERGREGRDGRRDAEI 36 (76)
Q Consensus 12 ~~~~~~~~~~~~~l~~~T~Fs~~EI 36 (76)
...||.++.|.++|.++-+.+..+|
T Consensus 18 iP~WA~~~~L~~~L~~Q~~~Dpd~I 42 (57)
T PF03941_consen 18 IPSWAQSPNLRQALKKQQNIDPDEI 42 (57)
T ss_dssp --GGGSHHHHHHHHHHHHHS-HHHH
T ss_pred CCCCcCcHHHHHHHHHHhccCHHHH
Confidence 4589999999999998888766654
No 51
>COG4875 Uncharacterized protein conserved in bacteria with a cystatin-like fold [Function unknown]
Probab=61.78 E-value=5.8 Score=27.86 Aligned_cols=43 Identities=19% Similarity=0.309 Sum_probs=36.7
Q ss_pred cccchhHHHhhhhhcCCCCHHHHHHHHHHHhhhCCCCCCCHHH
Q psy15328 14 GWNSGTKLERGREGRDGRRDAEIQEWYKGFLKDCPTGHLSVEE 56 (76)
Q Consensus 14 ~~~~~~~~~~~l~~~T~Fs~~EI~~wYr~FkkecPsG~it~~e 56 (76)
..+..-+|.--|+.+-..++.||...++.|++.-|-|.|+-..
T Consensus 61 nyApDaVLLPT~Sn~vR~s~~ei~DYF~~FLk~KPqG~IdsR~ 103 (156)
T COG4875 61 NYAPDAVLLPTMSNQVRSSRSEILDYFSHFLKLKPQGYIDSRK 103 (156)
T ss_pred hcCCceEeecccccccccCHHHHHHHHHHHhccCCcceeccee
Confidence 4566667778888888999999999999999999999997543
No 52
>KOG0028|consensus
Probab=59.85 E-value=16 Score=26.29 Aligned_cols=35 Identities=23% Similarity=0.309 Sum_probs=30.7
Q ss_pred CHHHHHHHHHHHhhhCCCCCCCHHHHHHHHhhcCCC
Q psy15328 32 RDAEIQEWYKGFLKDCPTGHLSVEEFKKIYGNFFPY 67 (76)
Q Consensus 32 s~~EI~~wYr~FkkecPsG~it~~eFk~iy~~~FP~ 67 (76)
+.+||++-+|-|. ..-+|.|+..+|+.+-..+.|+
T Consensus 104 t~eEi~~afrl~D-~D~~Gkis~~~lkrvakeLgen 138 (172)
T KOG0028|consen 104 TKEEIKKAFRLFD-DDKTGKISQRNLKRVAKELGEN 138 (172)
T ss_pred cHHHHHHHHHccc-ccCCCCcCHHHHHHHHHHhCcc
Confidence 8899999999886 5568999999999999998885
No 53
>cd05023 S-100A11 S-100A11: S-100A11 domain found in proteins similar to S100A11. S100A11 is a member of the S-100 domain family within EF-hand Ca2+-binding proteins superfamily. Note that the S-100 hierarchy, to which this S-100A11 group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately. S100 proteins exhibit unique patterns of tissue- and cell type-specific expression and have been implicated in the Ca2+-dependent regulation of diverse physiological processes, including cell cycle regulation, differentiation, growth, and metabolic control . S100 proteins have also been associated with a variety of pathological events, including neoplastic transformation and neurodegenerative diseases such as Alzheimer's, usually via over expression of the protein. S100A11 is expressed in smooth muscle and other tissues and involves in calcium-dependent membrane aggregation, which is important for cell vesiculation . As is the case for many other S100 proteins, S
Probab=59.68 E-value=23 Score=21.94 Aligned_cols=33 Identities=18% Similarity=0.299 Sum_probs=22.5
Q ss_pred CCCHHHHHHHHHHHhhhCCCCCCCHHHHHHHHhh
Q psy15328 30 GRRDAEIQEWYKGFLKDCPTGHLSVEEFKKIYGN 63 (76)
Q Consensus 30 ~Fs~~EI~~wYr~FkkecPsG~it~~eFk~iy~~ 63 (76)
.-+..++.++.+.+- ...+|.|+.+||..+...
T Consensus 48 ~~~~~~~~~ll~~~D-~d~DG~I~f~EF~~l~~~ 80 (89)
T cd05023 48 QKDPGVLDRMMKKLD-LNSDGQLDFQEFLNLIGG 80 (89)
T ss_pred CCCHHHHHHHHHHcC-CCCCCcCcHHHHHHHHHH
Confidence 345566666666553 467889999999877654
No 54
>PF13202 EF-hand_5: EF hand; PDB: 3DD4_A 2Q4U_A 2BE4_A 1UHJ_B 1UHI_A 1UHH_B 1EJ3_B 1UHK_A 2ZFD_A 1UHN_A ....
Probab=59.16 E-value=9.2 Score=18.56 Aligned_cols=15 Identities=33% Similarity=0.638 Sum_probs=12.9
Q ss_pred hCCCCCCCHHHHHHH
Q psy15328 46 DCPTGHLSVEEFKKI 60 (76)
Q Consensus 46 ecPsG~it~~eFk~i 60 (76)
..-+|.|+.+||+++
T Consensus 10 ~d~DG~is~~E~~~~ 24 (25)
T PF13202_consen 10 TDGDGKISFEEFQRL 24 (25)
T ss_dssp TTSSSEEEHHHHHHH
T ss_pred CCCCCcCCHHHHHHH
Confidence 567899999999875
No 55
>PF01023 S_100: S-100/ICaBP type calcium binding domain; InterPro: IPR013787 The calcium-binding domain found in S100 and CaBP-9k proteins is a subfamily of the EF-hand calcium-binding domain []. S100s are small dimeric acidic calcium and zinc-binding proteins abundant in the brain, with S100B playing an important role in modulating the proliferation and differentiation of neurons and glia cells []. S100 proteins have two different types of calcium-binding sites: a low affinity one with a special structure, and a 'normal' EF-hand type high-affinity site. Calbindin-D9k (CaBP-9k) also belong to this family of proteins, but it does not form dimers. CaBP-9k is a cytosolic protein expressed in a variety of tissues. Although its precise function is unknown, it appears to be under the control of the steroid hormones oestrogen and progesterone in the female reproductive system []. In the intestine, CaBP-9k may be involved in calcium absorption by mediating intracellular diffusion []. This entry represents a subdomain of the calcium-binding domain found in S100, CaBP-9k, and related proteins.; PDB: 2RGI_A 4DUQ_B 2KAY_B 2KAX_A 2CNP_A 1CNP_A 1A03_A 1JWD_B 2JTT_A 1XK4_B ....
Probab=58.61 E-value=25 Score=19.42 Aligned_cols=32 Identities=19% Similarity=0.338 Sum_probs=24.0
Q ss_pred HHHHHHHHhh-hCCCCCCCHHHHHHHHhhcCCC
Q psy15328 36 IQEWYKGFLK-DCPTGHLSVEEFKKIYGNFFPY 67 (76)
Q Consensus 36 I~~wYr~Fkk-ecPsG~it~~eFk~iy~~~FP~ 67 (76)
|...|..+.. +.-...|++.||+.+...-||+
T Consensus 8 iI~vFhkYa~~~Gd~~~Lsk~Elk~Ll~~Elp~ 40 (44)
T PF01023_consen 8 IIDVFHKYAGKEGDKDTLSKKELKELLEKELPN 40 (44)
T ss_dssp HHHHHHHHHTSSSSTTSEEHHHHHHHHHHHSTT
T ss_pred HHHHHHHHhccCCCCCeEcHHHHHHHHHHHHHH
Confidence 4445555543 5667799999999999998885
No 56
>PF05193 Peptidase_M16_C: Peptidase M16 inactive domain; InterPro: IPR007863 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. These metallopeptidases belong to MEROPS peptidase family M16 (clan ME). They include proteins, which are classified as non-peptidase homologues either have been found experimentally to be without peptidase activity, or lack amino acid residues that are believed to be essential for the catalytic activity. The peptidases in this group of sequences include: Insulinase, insulin-degrading enzyme (3.4.24.56 from EC) Mitochondrial processing peptidase alpha subunit, (Alpha-MPP, 3.4.24.64 from EC) Pitrlysin, Protease III precursor (3.4.24.55 from EC) Nardilysin, (3.4.24.61 from EC) Ubiquinol-cytochrome C reductase complex core protein I,mitochondrial precursor (1.10.2.2 from EC) Coenzyme PQQ synthesis protein F (3.4.99 from EC) These proteins do not share many regions of sequence similarity; the most noticeable is in the N-terminal section. This region includes a conserved histidine followed, two residues later by a glutamate and another histidine. In pitrilysin, it has been shown [] that this H-x-x-E-H motif is involved in enzymatic activity; the two histidines bind zinc and the glutamate is necessary for catalytic activity. The mitochondrial processing peptidase consists of two structurally related domains. One is the active peptidase whereas the other, the C-terminal region, is inactive. The two domains hold the substrate like a clamp [].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis; PDB: 1BE3_B 1PP9_B 2A06_B 1SQB_B 1SQP_B 1L0N_B 1SQX_B 1NU1_B 1L0L_B 2FYU_B ....
Probab=58.27 E-value=6.1 Score=24.72 Aligned_cols=36 Identities=19% Similarity=0.490 Sum_probs=28.1
Q ss_pred CCCHHHHHHHHHHHhh-----hCCCCCCCHHHHHHHHhhcC
Q psy15328 30 GRRDAEIQEWYKGFLK-----DCPTGHLSVEEFKKIYGNFF 65 (76)
Q Consensus 30 ~Fs~~EI~~wYr~Fkk-----ecPsG~it~~eFk~iy~~~F 65 (76)
++|.++|+++|++|.. -+-.|-++.++..++...+|
T Consensus 1 ~it~e~l~~f~~~~y~p~n~~l~i~Gd~~~~~~~~~i~~~~ 41 (184)
T PF05193_consen 1 NITLEDLRAFYKKFYRPSNMTLVIVGDIDPDELEKLIEKYF 41 (184)
T ss_dssp C--HHHHHHHHHHHSSGGGEEEEEEESSGHHHHHHHHHHHH
T ss_pred CCCHHHHHHHHHHhcCccceEEEEEcCccHHHHHHHHHhhh
Confidence 4788999999999987 46678899999987776665
No 57
>cd05027 S-100B S-100B: S-100B domain found in proteins similar to S100B. S100B is a calcium-binding protein belonging to a large S100 vertebrate-specific protein family within the EF-hand superfamily of calcium-binding proteins. Note that the S-100 hierarchy, to which this S-100B group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately. S100B is most abundant in glial cells of the central nervous system, predominately in astrocytes. S100B is involved in signal transduction via the inhibition of protein phoshorylation, regulation of enzyme activity and by affecting the calcium homeostasis. Upon calcium binding the S100B homodimer changes conformation to expose a hydrophobic cleft, which represents the interaction site of S100B with its more than 20 known target proteins. These target proteins include several cellular architecture proteins such as tubulin and GFAP; S100B can inhibit polymerization of these oligomeric molecules. Furthermore, S100B i
Probab=57.40 E-value=43 Score=20.60 Aligned_cols=34 Identities=9% Similarity=0.157 Sum_probs=26.2
Q ss_pred CCCCHHHHHHHHHHHhhhCCCCCCCHHHHHHHHhh
Q psy15328 29 DGRRDAEIQEWYKGFLKDCPTGHLSVEEFKKIYGN 63 (76)
Q Consensus 29 T~Fs~~EI~~wYr~FkkecPsG~it~~eFk~iy~~ 63 (76)
...++++|..+.+.. ...-+|.|+.++|..+...
T Consensus 46 ~~~~~~~v~~~i~~~-D~n~dG~v~f~eF~~li~~ 79 (88)
T cd05027 46 EIKEQEVVDKVMETL-DSDGDGECDFQEFMAFVAM 79 (88)
T ss_pred CCCCHHHHHHHHHHh-CCCCCCcCcHHHHHHHHHH
Confidence 346888899988877 3456789999999887654
No 58
>PF07914 DUF1679: Protein of unknown function (DUF1679); InterPro: IPR012877 This region is found in a number of Caenorhabditis elegans and Caenorhabditis briggsae proteins, in one case (Q19034 from SWISSPROT) as a repeat. In many of the family members, this region is associated with the CHK region described by SMART as being found in zinc finger-C4 and HLH domain-containing kinases. In fact, one member of this family (Q9GUC1 from SWISSPROT) is annotated as being a member of the nuclear hormone receptor family, and contains regions typical of such proteins (IPR000536 from INTERPRO, IPR008946 from INTERPRO, and IPR001628 from INTERPRO).
Probab=56.84 E-value=6.6 Score=30.82 Aligned_cols=36 Identities=25% Similarity=0.662 Sum_probs=29.0
Q ss_pred HHHHHHHHHHHhhhC-CCC--CCCHHHHHHHHhhcCCCC
Q psy15328 33 DAEIQEWYKGFLKDC-PTG--HLSVEEFKKIYGNFFPYG 68 (76)
Q Consensus 33 ~~EI~~wYr~Fkkec-PsG--~it~~eFk~iy~~~FP~G 68 (76)
+.=|...|..|.+.+ +.| -.|.+++++.|..+||.+
T Consensus 328 ~~lL~~Yy~~f~~~l~~~~~~PfT~eqL~~sY~l~fp~~ 366 (414)
T PF07914_consen 328 EELLEYYYDTFTEALEDGGKAPFTLEQLKDSYRLYFPFG 366 (414)
T ss_pred HHHHHHHHHHHHHHHhccCCCCccHHHHHHHHHHHHHHH
Confidence 445788999999988 333 459999999999999975
No 59
>PF15187 Augurin: Oesophageal cancer-related gene 4
Probab=55.85 E-value=11 Score=25.48 Aligned_cols=25 Identities=32% Similarity=0.664 Sum_probs=18.1
Q ss_pred HHhhhhhcCCCCHHHHHHHHHHHhh
Q psy15328 21 LERGREGRDGRRDAEIQEWYKGFLK 45 (76)
Q Consensus 21 ~~~~l~~~T~Fs~~EI~~wYr~Fkk 45 (76)
|.|+-...=.=|.-|+|+||..|+=
T Consensus 35 l~R~kR~lWDRsrPdVQQW~qQFlY 59 (114)
T PF15187_consen 35 LKRQKRQLWDRSRPDVQQWYQQFLY 59 (114)
T ss_pred hhhHHHhhhccCCHHHHHHHHHHHH
Confidence 4444444446688999999999983
No 60
>TIGR01669 phage_XkdX phage uncharacterized protein, XkdX family. This model represents a family of small (about 50 amino acid) phage proteins, found in at least 12 different phage and prophage regions of Gram-positive bacteria. In a number of these phage, the gene for this protein is found near the holin and endolysin genes.
Probab=55.19 E-value=28 Score=19.52 Aligned_cols=18 Identities=22% Similarity=0.296 Sum_probs=14.5
Q ss_pred hCCCCCCCHHHHHHHHhh
Q psy15328 46 DCPTGHLSVEEFKKIYGN 63 (76)
Q Consensus 46 ecPsG~it~~eFk~iy~~ 63 (76)
-.+-|.||.+|++.|-+.
T Consensus 26 ~V~~~~IT~eey~eITG~ 43 (45)
T TIGR01669 26 FVEKKLITREQYKVITGE 43 (45)
T ss_pred HhhcCccCHHHHHHHhCC
Confidence 356699999999999653
No 61
>KOG0046|consensus
Probab=54.95 E-value=16 Score=30.89 Aligned_cols=39 Identities=21% Similarity=0.205 Sum_probs=33.6
Q ss_pred hhcCCCCHHHHHHHHHHHhh-hCCCCCCCHHHHHHHHhhc
Q psy15328 26 EGRDGRRDAEIQEWYKGFLK-DCPTGHLSVEEFKKIYGNF 64 (76)
Q Consensus 26 ~~~T~Fs~~EI~~wYr~Fkk-ecPsG~it~~eFk~iy~~~ 64 (76)
..++.||..|++.+-..|.+ +.-+|.++..+....|...
T Consensus 8 ~~~~~~tq~El~~l~~kF~~~d~~~G~v~~~~l~~~f~k~ 47 (627)
T KOG0046|consen 8 WLQSQLTQEELRELKEKFNKLDDQKGYVTVYELPDAFKKA 47 (627)
T ss_pred hhcccccHHHHHHHHHHHHhhcCCCCeeehHHhHHHHHHh
Confidence 56788999999999999988 7888899988888888764
No 62
>cd04790 HTH_Cfa-like_unk Helix-Turn-Helix DNA binding domain of putative Cfa-like transcription regulators. Putative helix-turn-helix (HTH) MerR-like transcription regulator; conserved, Cfa-like, unknown proteins (~172 a.a.). The N-terminal domain of these proteins appears to be related to the HTH domain of Cfa, a cyclopropane fatty acid synthase. These Cfa-like proteins have a unique C-terminal domain with conserved histidines (motif HXXFX7HXXF). Based on sequence similarity of the N-terminal domains, these proteins are predicted to function as transcription regulators that mediate responses to stress in eubacteria. They belong to the MerR superfamily of transcription regulators that promote transcription of various stress regulons by reconfiguring the operator sequence located between the -35 and -10 promoter elements. A typical MerR regulator is comprised of distinct domains that harbor the regulatory (effector-binding) site and the active (DNA-binding) site. Their N-terminal domain
Probab=52.55 E-value=11 Score=26.16 Aligned_cols=28 Identities=25% Similarity=0.413 Sum_probs=24.2
Q ss_pred hhhhcCCCCHHHHHHHHHHHhhhCCCCC
Q psy15328 24 GREGRDGRRDAEIQEWYKGFLKDCPTGH 51 (76)
Q Consensus 24 ~l~~~T~Fs~~EI~~wYr~FkkecPsG~ 51 (76)
.|....++++..+++|+..|-+..|.|.
T Consensus 123 ~l~~~~g~~~~~m~~wh~~fe~~~p~~h 150 (172)
T cd04790 123 AILKAAGMDEADMRRWHIEFEKMEPEAH 150 (172)
T ss_pred HHHHHcCCChHHHHHHHHHHHHhCcHHH
Confidence 3456778999999999999999999875
No 63
>PF05920 Homeobox_KN: Homeobox KN domain; InterPro: IPR008422 This entry represents a homeobox transcription factor KN domain conserved from fungi to human and plants [].; GO: 0003677 DNA binding, 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus; PDB: 3K2A_B 2LK2_A 1X2N_A 2DMN_A.
Probab=52.45 E-value=14 Score=20.12 Aligned_cols=22 Identities=14% Similarity=0.374 Sum_probs=17.5
Q ss_pred HhhhhhcCCCCHHHHHHHHHHH
Q psy15328 22 ERGREGRDGRRDAEIQEWYKGF 43 (76)
Q Consensus 22 ~~~l~~~T~Fs~~EI~~wYr~F 43 (76)
.++|...|+++..+|..|+..-
T Consensus 16 k~~L~~~tgls~~Qi~~WF~Na 37 (40)
T PF05920_consen 16 KEELAKQTGLSRKQISNWFINA 37 (40)
T ss_dssp HHHHHHHHTS-HHHHHHHHHHH
T ss_pred HHHHHHHcCCCHHHHHHHHHHh
Confidence 3667889999999999998653
No 64
>PF09693 Phage_XkdX: Phage uncharacterised protein (Phage_XkdX); InterPro: IPR010022 This entry is represented by Bacteriophage 69, Orf86. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This entry identifies a family of small (about 50 amino acid) phage proteins, found in at least 12 different phage and prophage regions of Gram-positive bacteria. In a number of these phage, the gene for this protein is found near the holin and endolysin genes.
Probab=51.72 E-value=11 Score=20.50 Aligned_cols=19 Identities=32% Similarity=0.546 Sum_probs=15.5
Q ss_pred hhCCCCCCCHHHHHHHHhh
Q psy15328 45 KDCPTGHLSVEEFKKIYGN 63 (76)
Q Consensus 45 kecPsG~it~~eFk~iy~~ 63 (76)
.-++.|.||.+||+.|-+.
T Consensus 20 ~~V~~g~IT~eey~eITGe 38 (40)
T PF09693_consen 20 NFVEAGWITKEEYKEITGE 38 (40)
T ss_pred HHhhcCeECHHHHHHhhCC
Confidence 3467899999999999664
No 65
>PF13384 HTH_23: Homeodomain-like domain; PDB: 2X48_C.
Probab=51.51 E-value=9.9 Score=20.37 Aligned_cols=29 Identities=21% Similarity=0.355 Sum_probs=15.7
Q ss_pred hhhhhcCCCCHHHHHHHHHHHhhhCCCCC
Q psy15328 23 RGREGRDGRRDAEIQEWYKGFLKDCPTGH 51 (76)
Q Consensus 23 ~~l~~~T~Fs~~EI~~wYr~FkkecPsG~ 51 (76)
+++....+.|+.-|.+|.++|.+.-+.|.
T Consensus 21 ~~ia~~lgvs~~Tv~~w~kr~~~~G~~gL 49 (50)
T PF13384_consen 21 REIAKRLGVSRSTVYRWIKRYREEGLEGL 49 (50)
T ss_dssp HHHHHHHTS-HHHHHHHHT----------
T ss_pred HHHHHHHCcCHHHHHHHHHHccccccccc
Confidence 55678889999999999999987666654
No 66
>PF05256 UPF0223: Uncharacterised protein family (UPF0223); InterPro: IPR007920 This family of proteins is functionally uncharacterised.; PDB: 2OY9_B.
Probab=50.96 E-value=9.9 Score=24.52 Aligned_cols=34 Identities=26% Similarity=0.379 Sum_probs=24.5
Q ss_pred cCCCCHHHHHHHHHHHhhhCCCCCCCHHHHHHHHhhcC
Q psy15328 28 RDGRRDAEIQEWYKGFLKDCPTGHLSVEEFKKIYGNFF 65 (76)
Q Consensus 28 ~T~Fs~~EI~~wYr~FkkecPsG~it~~eFk~iy~~~F 65 (76)
..+.++++|..-||+|++-.|+ +.|=++|...|-
T Consensus 31 E~gV~r~~ll~~Y~~FK~VVps----K~EEKql~r~Fe 64 (88)
T PF05256_consen 31 EKGVDREELLDAYRRFKKVVPS----KSEEKQLDREFE 64 (88)
T ss_dssp TT-EEHHHHHHHHHHHHHH-------HHHHHHHHHHHH
T ss_pred HccCcHHHHHHHHHHHHHHccc----HHHHHHHHHHHH
Confidence 4678999999999999999995 667777777653
No 67
>PF14788 EF-hand_10: EF hand; PDB: 1DJW_B 1DJI_B 1DJG_B 1QAS_B 2ISD_B 1DJZ_B 1DJY_B 1DJX_B 1QAT_A 1DJH_A ....
Probab=50.73 E-value=29 Score=20.26 Aligned_cols=33 Identities=21% Similarity=0.398 Sum_probs=21.4
Q ss_pred CCHHHHHHHHHHHhhhCCCCCCCHHHHHHHHhhc
Q psy15328 31 RRDAEIQEWYKGFLKDCPTGHLSVEEFKKIYGNF 64 (76)
Q Consensus 31 Fs~~EI~~wYr~FkkecPsG~it~~eFk~iy~~~ 64 (76)
.++.-...|++.-- ..-+|.|..+||..+|..+
T Consensus 18 ~~~~yA~~LFq~~D-~s~~g~Le~~Ef~~Fy~~L 50 (51)
T PF14788_consen 18 MDDEYARQLFQECD-KSQSGRLEGEEFEEFYKRL 50 (51)
T ss_dssp --HHHHHHHHHHH--SSSSSEBEHHHHHHHHHHH
T ss_pred cCHHHHHHHHHHhc-ccCCCCccHHHHHHHHHHh
Confidence 44444445544432 5679999999999999753
No 68
>PF05280 FlhC: Flagellar transcriptional activator (FlhC); InterPro: IPR007944 This family consists of several bacterial flagellar transcriptional activator (FlhC) proteins. FlhC combines with FlhD to form a regulatory complex in Escherichia coli, this complex has been shown to be a global regulator involved in many cellular processes as well as a flagellar transcriptional activator [].; GO: 0003677 DNA binding, 0030092 regulation of flagellum assembly, 0045893 positive regulation of transcription, DNA-dependent; PDB: 2AVU_E.
Probab=49.45 E-value=15 Score=26.04 Aligned_cols=30 Identities=23% Similarity=0.348 Sum_probs=18.8
Q ss_pred hhhhcCCCCHHHHHHHHHHHhh-hCCCCCCC
Q psy15328 24 GREGRDGRRDAEIQEWYKGFLK-DCPTGHLS 53 (76)
Q Consensus 24 ~l~~~T~Fs~~EI~~wYr~Fkk-ecPsG~it 53 (76)
.|...|.++.++|.+||+...- .-|.|.+.
T Consensus 28 ~l~~et~Ls~~rl~~Lykel~G~sppkG~lP 58 (175)
T PF05280_consen 28 VLESETGLSRERLRRLYKELHGVSPPKGMLP 58 (175)
T ss_dssp CHHHHSSS-HHHHHHHHHHHCSS----S---
T ss_pred HHHHHHCCCHHHHHHHHHHHcCCCCCCCCCC
Confidence 3567899999999999999875 46777764
No 69
>PF12674 Zn_ribbon_2: Putative zinc ribbon domain
Probab=49.33 E-value=48 Score=20.62 Aligned_cols=37 Identities=19% Similarity=0.317 Sum_probs=30.5
Q ss_pred CCCHHHHHHHHHHHhhhCCCCCCCHHHHHHHHhhcCCCC
Q psy15328 30 GRRDAEIQEWYKGFLKDCPTGHLSVEEFKKIYGNFFPYG 68 (76)
Q Consensus 30 ~Fs~~EI~~wYr~FkkecPsG~it~~eFk~iy~~~FP~G 68 (76)
++|-+|+...-..|+.+... +++++.+.++..++|.-
T Consensus 41 ~~t~eemie~~~~~~~~~~~--~~~~~a~~~~~~~lp~L 77 (81)
T PF12674_consen 41 DITMEEMIEFCVPFMDEFNG--MTPEEARKMMPRYLPTL 77 (81)
T ss_pred cCCHHHHHHHHHHHHHHhCC--CCHHHHHHHHHHHccCC
Confidence 47888888888888886654 89999999999988863
No 70
>PRK12722 transcriptional activator FlhC; Provisional
Probab=49.27 E-value=22 Score=25.68 Aligned_cols=31 Identities=19% Similarity=0.240 Sum_probs=25.4
Q ss_pred hhhhhcCCCCHHHHHHHHHHHhh-hCCCCCCC
Q psy15328 23 RGREGRDGRRDAEIQEWYKGFLK-DCPTGHLS 53 (76)
Q Consensus 23 ~~l~~~T~Fs~~EI~~wYr~Fkk-ecPsG~it 53 (76)
+.|...|.++.+.+.++|+...- --|.|.+.
T Consensus 27 q~leset~Ls~~rl~~Lyke~~G~spPkG~lP 58 (187)
T PRK12722 27 QVLESETQLSRERLIKLYKELRGVSPPKGMLP 58 (187)
T ss_pred HHHHHHHCcCHHHHHHHHHHHcCCCCCCCCCC
Confidence 45678899999999999999966 56677764
No 71
>PF00620 RhoGAP: RhoGAP domain; InterPro: IPR000198 Members of the Rho family of small G proteins transduce signals from plasma-membrane receptors and control cell adhesion, motility and shape by actin cytoskeleton formation. Like all other GTPases, Rho proteins act as molecular switches, with an active GTP-bound form and an inactive GDP-bound form. The active conformation is promoted by guanine-nucleotide exchange factors, and the inactive state by GTPase-activating proteins (GAPs) which stimulate the intrinsic GTPase activity of small G proteins. This entry is a Rho/Rac/Cdc42-like GAP domain, that is found in a wide variety of large, multi-functional proteins []. A number of structure are known for this family [, , ]. The domain is composed of seven alpha helices. This domain is also known as the breakpoint cluster region-homology (BH) domain.; GO: 0007165 signal transduction, 0005622 intracellular; PDB: 1RGP_A 1AM4_B 1GRN_B 2NGR_B 1OW3_A 1TX4_A 3BYI_B 1XA6_A 3FK2_B 1F7C_A ....
Probab=49.15 E-value=54 Score=20.84 Aligned_cols=36 Identities=17% Similarity=0.319 Sum_probs=31.7
Q ss_pred hcCCCCHHHHHHHHHHHhhhCCCCCCCHHHHHHHHh
Q psy15328 27 GRDGRRDAEIQEWYKGFLKDCPTGHLSVEEFKKIYG 62 (76)
Q Consensus 27 ~~T~Fs~~EI~~wYr~FkkecPsG~it~~eFk~iy~ 62 (76)
....++...|-.+.|.|.++.|.+.++.+.+..+..
T Consensus 47 ~~~~~~~~~va~~lK~~L~~lp~pli~~~~~~~~~~ 82 (151)
T PF00620_consen 47 NLENYDVHDVASLLKRFLRELPEPLIPSELYDKFIA 82 (151)
T ss_dssp TGTTSTHHHHHHHHHHHHHHSSSTSTTHHHHHHHHH
T ss_pred cccccChhhccccceeeeeccccchhhhhHHHHHhh
Confidence 346789999999999999999999999998887774
No 72
>KOG0031|consensus
Probab=49.11 E-value=36 Score=24.50 Aligned_cols=35 Identities=17% Similarity=0.326 Sum_probs=31.8
Q ss_pred CCHHHHHHHHHHHhh--hCCCCCCCHHHHHHHHhhcC
Q psy15328 31 RRDAEIQEWYKGFLK--DCPTGHLSVEEFKKIYGNFF 65 (76)
Q Consensus 31 Fs~~EI~~wYr~Fkk--ecPsG~it~~eFk~iy~~~F 65 (76)
|+..+||++-..|-- ..-.|.|.+++++.+|+...
T Consensus 26 f~q~QIqEfKEAF~~mDqnrDG~IdkeDL~d~~aSlG 62 (171)
T KOG0031|consen 26 FDQSQIQEFKEAFNLMDQNRDGFIDKEDLRDMLASLG 62 (171)
T ss_pred hhHHHHHHHHHHHHHHhccCCCcccHHHHHHHHHHcC
Confidence 899999999999966 78899999999999999865
No 73
>PF00046 Homeobox: Homeobox domain not present here.; InterPro: IPR001356 The homeobox domain was first identified in a number of drosophila homeotic and segmentation proteins, but is now known to be well-conserved in many other animals, including vertebrates [, , ]. Hox genes encode homeodomain-containing transcriptional regulators that operate differential genetic programs along the anterior-posterior axis of animal bodies []. The domain binds DNA through a helix-turn-helix (HTH) structure. The HTH motif is characterised by two alpha-helices, which make intimate contacts with the DNA and are joined by a short turn. The second helix binds to DNA via a number of hydrogen bonds and hydrophobic interactions, which occur between specific side chains and the exposed bases and thymine methyl groups within the major groove of the DNA []. The first helix helps to stabilise the structure. The motif is very similar in sequence and structure in a wide range of DNA-binding proteins (e.g., cro and repressor proteins, homeotic proteins, etc.). One of the principal differences between HTH motifs in these different proteins arises from the stereo-chemical requirement for glycine in the turn which is needed to avoid steric interference of the beta-carbon with the main chain: for cro and repressor proteins the glycine appears to be mandatory, while for many of the homeotic and other DNA-binding proteins the requirement is relaxed.; GO: 0003700 sequence-specific DNA binding transcription factor activity, 0043565 sequence-specific DNA binding, 0006355 regulation of transcription, DNA-dependent; PDB: 2DA3_A 1LFB_A 2LFB_A 2ECB_A 2DA5_A 3D1N_O 3A03_A 2XSD_C 3CMY_A 1AHD_P ....
Probab=47.76 E-value=24 Score=19.26 Aligned_cols=23 Identities=9% Similarity=0.381 Sum_probs=18.3
Q ss_pred HHHhhhhhcCCCCHHHHHHHHHH
Q psy15328 20 KLERGREGRDGRRDAEIQEWYKG 42 (76)
Q Consensus 20 ~~~~~l~~~T~Fs~~EI~~wYr~ 42 (76)
.....|...++++...|+.||..
T Consensus 28 ~~~~~la~~l~l~~~~V~~WF~n 50 (57)
T PF00046_consen 28 EEREELAKELGLTERQVKNWFQN 50 (57)
T ss_dssp HHHHHHHHHHTSSHHHHHHHHHH
T ss_pred cccccccccccccccccccCHHH
Confidence 34566788889999999999864
No 74
>PF08707 PriCT_2: Primase C terminal 2 (PriCT-2) ; InterPro: IPR014819 This alpha helical domain is found at the C-terminal of primases. ; GO: 0016817 hydrolase activity, acting on acid anhydrides
Probab=47.27 E-value=45 Score=19.93 Aligned_cols=28 Identities=11% Similarity=0.243 Sum_probs=20.8
Q ss_pred cCCCCHHHHHHHHHHHhhhCCCCCCCHHHHH
Q psy15328 28 RDGRRDAEIQEWYKGFLKDCPTGHLSVEEFK 58 (76)
Q Consensus 28 ~T~Fs~~EI~~wYr~FkkecPsG~it~~eFk 58 (76)
..+|+.+|+...++.| .++|.||...+.
T Consensus 48 ~~ky~~~e~~~~W~s~---~~~~~it~~Tl~ 75 (78)
T PF08707_consen 48 SPKYDEEECERKWRSF---DRPGGITIGTLF 75 (78)
T ss_pred CCCCCHHHHHHHHHhC---CCCCCccHHHHH
Confidence 5579999999999999 444557766554
No 75
>PLN02964 phosphatidylserine decarboxylase
Probab=47.24 E-value=33 Score=28.97 Aligned_cols=33 Identities=12% Similarity=0.122 Sum_probs=28.1
Q ss_pred CCCHHHHHHHHHHHhhhCCCCCCCHHHHHHHHhh
Q psy15328 30 GRRDAEIQEWYKGFLKDCPTGHLSVEEFKKIYGN 63 (76)
Q Consensus 30 ~Fs~~EI~~wYr~FkkecPsG~it~~eFk~iy~~ 63 (76)
..+++||+..++.|-+ .++|.|+.+||+.+...
T Consensus 211 ~~seEEL~eaFk~fDk-DgdG~Is~dEL~~vL~~ 243 (644)
T PLN02964 211 LVAANKKEELFKAADL-NGDGVVTIDELAALLAL 243 (644)
T ss_pred CCCHHHHHHHHHHhCC-CCCCcCCHHHHHHHHHh
Confidence 4788899999999855 46899999999999877
No 76
>PF03871 RNA_pol_Rpb5_N: RNA polymerase Rpb5, N-terminal domain; InterPro: IPR005571 Prokaryotes contain a single DNA-dependent RNA polymerase (RNAP; 2.7.7.6 from EC) that is responsible for the transcription of all genes, while eukaryotes have three classes of RNAPs (I-III) that transcribe different sets of genes. Each class of RNA polymerase is an assemblage of ten to twelve different polypeptides. Certain subunits of RNAPs, including RPB5 (POLR2E in mammals), are common to all three eukaryotic polymerases. RPB5 plays a role in the transcription activation process. Eukaryotic RPB5 has a bipartite structure consisting of a unique N-terminal region, plus a C-terminal region that is structurally homologous to the prokaryotic RPB5 homologue, subunit H (gene rpoH) (IPR000783 from INTERPRO) [, , , ]. This entry represents the N-terminal domain of eukaryotic RPB5, which has a core structure consisting of 3 layers alpha/beta/alpha []. The N-terminal domain is involved in DNA binding and is part of the jaw module in the RNA pol II structure []. This module is important for positioning the downstream DNA.; GO: 0003677 DNA binding, 0003899 DNA-directed RNA polymerase activity, 0006351 transcription, DNA-dependent, 0005634 nucleus; PDB: 3H0G_Q 1DZF_A 3GTG_E 2VUM_E 3GTP_E 3GTO_E 3S17_E 3S1R_E 1I3Q_E 1I50_E ....
Probab=46.01 E-value=26 Score=22.03 Aligned_cols=32 Identities=28% Similarity=0.367 Sum_probs=22.4
Q ss_pred HHHHHHHHHHHhhh------------CCCCCCCHHHHHHHHhhc
Q psy15328 33 DAEIQEWYKGFLKD------------CPTGHLSVEEFKKIYGNF 64 (76)
Q Consensus 33 ~~EI~~wYr~Fkke------------cPsG~it~~eFk~iy~~~ 64 (76)
++|+.+|||..++- -..=.+++++|+..|+.-
T Consensus 4 ~~e~~rl~rirrTv~eMl~DRGY~V~~~el~~s~~~F~~~~~~~ 47 (93)
T PF03871_consen 4 DEEVSRLFRIRRTVMEMLRDRGYLVSEEELNMSLEEFKEKYGEN 47 (93)
T ss_dssp THHHHHHHHHHCCCCCCCCCCTEE--CCCCS--HHHHHHHCB-S
T ss_pred HHHHHHHHHHHHHHHHHHhcCCCccChhhhcCCHHHHHHHHccc
Confidence 48999999999882 223346999999999874
No 77
>cd00159 RhoGAP RhoGAP: GTPase-activator protein (GAP) for Rho-like GTPases; GAPs towards Rho/Rac/Cdc42-like small GTPases. Small GTPases (G proteins) cluster into distinct families, and all act as molecular switches, active in their GTP-bound form but inactive when bound to GDP. The Rho family of small G proteins, which includes Cdc42Hs, activates effectors involved in a wide variety of developmental processes, including regulation of cytoskeleton formation, cell proliferation and the JNK signaling pathway. G proteins generally have a low intrinsic GTPase hydrolytic activity but there are family-specific groups of GAPs that enhance the rate of GTP hydrolysis by several orders of magnitude. The RhoGAPs are one of the major classes of regulators of Rho G proteins.
Probab=45.85 E-value=55 Score=20.99 Aligned_cols=35 Identities=11% Similarity=0.243 Sum_probs=29.3
Q ss_pred CCCCHHHHHHHHHHHhhhCCCCCCCHHHHHHHHhh
Q psy15328 29 DGRRDAEIQEWYKGFLKDCPTGHLSVEEFKKIYGN 63 (76)
Q Consensus 29 T~Fs~~EI~~wYr~FkkecPsG~it~~eFk~iy~~ 63 (76)
..++...+-.+.|.|.++.|.+.|+.+.+..+...
T Consensus 48 ~~~~~~~va~~lK~~l~~Lp~pli~~~~~~~~~~~ 82 (169)
T cd00159 48 EDYDVHDVASLLKLYLRELPEPLIPFELYDEFIEL 82 (169)
T ss_pred cccCHHHHHHHHHHHHHcCCCccCCHHHHHHHHHH
Confidence 45689999999999999999999998776666543
No 78
>PF03732 Retrotrans_gag: Retrotransposon gag protein ; InterPro: IPR005162 Transposable elements (TEs) promote various chromosomal rearrangements more efficiently, and often more specifically, than other cellular processes. Retrotransposons are structurally similar to retroviruses and are bounded by long terminal repeats. This entry represents eukaryotic Gag or capsid-related retrotranspon-related proteins. There is a central motif QGXXEXXXXXFXXLXXH that is common to Retroviridae gag-proteins, but is poorly conserved.
Probab=45.49 E-value=41 Score=19.31 Aligned_cols=33 Identities=21% Similarity=0.474 Sum_probs=26.9
Q ss_pred HHHHHHHHHHhhhCCCCCCCHHHHHHHHhhcCC
Q psy15328 34 AEIQEWYKGFLKDCPTGHLSVEEFKKIYGNFFP 66 (76)
Q Consensus 34 ~EI~~wYr~FkkecPsG~it~~eFk~iy~~~FP 66 (76)
.....||+......+....+-++|+..+...|-
T Consensus 9 g~A~~w~~~~~~~~~~~~~~W~~~~~~~~~~f~ 41 (96)
T PF03732_consen 9 GPARQWYRNLRPNEIRDFITWEEFKDAFRKRFF 41 (96)
T ss_pred CHHHHHHHHhHhcCCCCCCCHHHHHHHHHHHHh
Confidence 456789999999888778899999888877763
No 79
>COG5126 FRQ1 Ca2+-binding protein (EF-Hand superfamily) [Signal transduction mechanisms / Cytoskeleton / Cell division and chromosome partitioning / General function prediction only]
Probab=44.97 E-value=33 Score=24.09 Aligned_cols=44 Identities=25% Similarity=0.392 Sum_probs=29.2
Q ss_pred hHHHhhhhh-cCCCCHHHHHHHHHHHhhhCCCCCCCHHHHHHHHhh
Q psy15328 19 TKLERGREG-RDGRRDAEIQEWYKGFLKDCPTGHLSVEEFKKIYGN 63 (76)
Q Consensus 19 ~~~~~~l~~-~T~Fs~~EI~~wYr~FkkecPsG~it~~eFk~iy~~ 63 (76)
..|.+.++. .-.++.+|+..+.+.+- ..-+|.|+.++|.+++..
T Consensus 112 ~eL~~vl~~lge~~~deev~~ll~~~d-~d~dG~i~~~eF~~~~~~ 156 (160)
T COG5126 112 GELRRVLKSLGERLSDEEVEKLLKEYD-EDGDGEIDYEEFKKLIKD 156 (160)
T ss_pred HHHHHHHHhhcccCCHHHHHHHHHhcC-CCCCceEeHHHHHHHHhc
Confidence 344444442 33588888888887763 345888888888887643
No 80
>KOG0037|consensus
Probab=44.40 E-value=37 Score=25.31 Aligned_cols=19 Identities=11% Similarity=0.291 Sum_probs=12.2
Q ss_pred hCCCCCCCHHHHHHHHhhc
Q psy15328 46 DCPTGHLSVEEFKKIYGNF 64 (76)
Q Consensus 46 ecPsG~it~~eFk~iy~~~ 64 (76)
..-||.|+.+|++.-+..+
T Consensus 135 ~D~SG~I~~sEL~~Al~~~ 153 (221)
T KOG0037|consen 135 RDRSGTIDSSELRQALTQL 153 (221)
T ss_pred cCCCCcccHHHHHHHHHHc
Confidence 4566777777776666554
No 81
>PF08667 BetR: BetR domain; InterPro: IPR013975 CheY-like phosphoacceptor (or receiver [REC]) domain is a common module in a variety of response regulators of the bacterial signal transduction systems. BetR is one of the many response regulators and is encoded mainly in Burkholderia spp. It is a N-terminal helix-turn-helix domain (HTH) and has been shown to be related to the XRE-type HTH domain (IPR001387 from INTERPRO), it has been suggested that BetR would have dimerization, protein-protein interaction, and activation/relief-of-inhibition properties [].
Probab=43.75 E-value=61 Score=22.58 Aligned_cols=38 Identities=24% Similarity=0.181 Sum_probs=30.2
Q ss_pred hHHHhhhhhcCCCCHHHHHHHHHHHhhhCCCCCCCHHHHHHHHhh
Q psy15328 19 TKLERGREGRDGRRDAEIQEWYKGFLKDCPTGHLSVEEFKKIYGN 63 (76)
Q Consensus 19 ~~~~~~l~~~T~Fs~~EI~~wYr~FkkecPsG~it~~eFk~iy~~ 63 (76)
...-|-|...+.|+-.||+++.+.| -++.++|.....+
T Consensus 37 ssa~RKL~G~~~ftl~EI~~Ia~~f-------gvS~d~l~g~~~~ 74 (147)
T PF08667_consen 37 SSAYRKLNGKSPFTLEEIKKIAKHF-------GVSPDELFGHSDN 74 (147)
T ss_pred HHHHHHhcCCCCCCHHHHHHHHHHh-------CcCHHHHHhhhhc
Confidence 3456778899999999999999999 4778888744444
No 82
>COG4476 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=42.34 E-value=29 Score=22.64 Aligned_cols=32 Identities=22% Similarity=0.365 Sum_probs=25.4
Q ss_pred CCCCHHHHHHHHHHHhhhCCCCCCCHHHHHHHHhhc
Q psy15328 29 DGRRDAEIQEWYKGFLKDCPTGHLSVEEFKKIYGNF 64 (76)
Q Consensus 29 T~Fs~~EI~~wYr~FkkecPsG~it~~eFk~iy~~~ 64 (76)
.+.++++|-.=||+||...|+ +.|=++|...|
T Consensus 32 ~gv~~~~ll~~Yr~FK~IVPs----K~eEKql~r~F 63 (90)
T COG4476 32 KGVDAEDLLGSYRRFKEIVPS----KAEEKQLGRDF 63 (90)
T ss_pred ccccHHHHHHHHHHHHHhcCc----hHHHHHHhHHH
Confidence 468899999999999999996 45556666554
No 83
>PHA02970 hypothetical protein; Provisional
Probab=42.02 E-value=27 Score=23.53 Aligned_cols=29 Identities=24% Similarity=0.252 Sum_probs=23.9
Q ss_pred HHHhhhhhcCCCCHHHHHHHHHHHhhhCC
Q psy15328 20 KLERGREGRDGRRDAEIQEWYKGFLKDCP 48 (76)
Q Consensus 20 ~~~~~l~~~T~Fs~~EI~~wYr~FkkecP 48 (76)
-|+.+++.-.-++.+||..+|-+|.+..-
T Consensus 26 dl~~D~sey~~YN~~EIn~lY~~FLk~h~ 54 (115)
T PHA02970 26 DLVHDLSEYEVYNPKEINSLYISFLKHHN 54 (115)
T ss_pred HHHccHHHHhcCCHHHHHHHHHHHHHHcc
Confidence 45567777778999999999999999654
No 84
>PF09570 RE_SinI: SinI restriction endonuclease; InterPro: IPR019070 There are four classes of restriction endonucleases: types I, II,III and IV. All types of enzymes recognise specific short DNA sequences and carry out the endonucleolytic cleavage of DNA to give specific double-stranded fragments with terminal 5'-phosphates. They differ in their recognition sequence, subunit composition, cleavage position, and cofactor requirements [, ], as summarised below: Type I enzymes (3.1.21.3 from EC) cleave at sites remote from recognition site; require both ATP and S-adenosyl-L-methionine to function; multifunctional protein with both restriction and methylase (2.1.1.72 from EC) activities. Type II enzymes (3.1.21.4 from EC) cleave within or at short specific distances from recognition site; most require magnesium; single function (restriction) enzymes independent of methylase. Type III enzymes (3.1.21.5 from EC) cleave at sites a short distance from recognition site; require ATP (but doesn't hydrolyse it); S-adenosyl-L-methionine stimulates reaction but is not required; exists as part of a complex with a modification methylase methylase (2.1.1.72 from EC). Type IV enzymes target methylated DNA. Type II restriction endonucleases (3.1.21.4 from EC) are components of prokaryotic DNA restriction-modification mechanisms that protect the organism against invading foreign DNA. These site-specific deoxyribonucleases catalyse the endonucleolytic cleavage of DNA to give specific double-stranded fragments with terminal 5'-phosphates. Of the 3000 restriction endonucleases that have been characterised, most are homodimeric or tetrameric enzymes that cleave target DNA at sequence-specific sites close to the recognition site. For homodimeric enzymes, the recognition site is usually a palindromic sequence 4-8 bp in length. Most enzymes require magnesium ions as a cofactor for catalysis. Although they can vary in their mode of recognition, many restriction endonucleases share a similar structural core comprising four beta-strands and one alpha-helix, as well as a similar mechanism of cleavage, suggesting a common ancestral origin []. However, there is still considerable diversity amongst restriction endonucleases [, ]. The target site recognition process triggers large conformational changes of the enzyme and the target DNA, leading to the activation of the catalytic centres. Like other DNA binding proteins, restriction enzymes are capable of non-specific DNA binding as well, which is the prerequisite for efficient target site location by facilitated diffusion. Non-specific binding usually does not involve interactions with the bases but only with the DNA backbone []. This entry includes the restriction endonuclease SinI, which recognises and cleaves the double-stranded sequence G^GWCC. ; GO: 0003677 DNA binding, 0009036 Type II site-specific deoxyribonuclease activity, 0009307 DNA restriction-modification system
Probab=41.39 E-value=28 Score=25.98 Aligned_cols=30 Identities=30% Similarity=0.644 Sum_probs=21.6
Q ss_pred HHHHHHHHHhhh-----------CCCCCCCHHHHHHHHhhc
Q psy15328 35 EIQEWYKGFLKD-----------CPTGHLSVEEFKKIYGNF 64 (76)
Q Consensus 35 EI~~wYr~Fkke-----------cPsG~it~~eFk~iy~~~ 64 (76)
.|++|||-|.+. ..+-.|++++|+.+-.++
T Consensus 177 ~IkKW~R~~sk~~~TNW~~lp~~~~~~~LsE~~F~~fV~qY 217 (221)
T PF09570_consen 177 PIKKWFRTFSKTGETNWDNLPEMMGSYNLSEEDFKNFVEQY 217 (221)
T ss_pred chHHHHHhhcCCCccchhhhhhhhccCcCCHHHHHHHHHHH
Confidence 589999999872 224467888888766554
No 85
>PF13518 HTH_28: Helix-turn-helix domain
Probab=40.98 E-value=24 Score=18.67 Aligned_cols=30 Identities=17% Similarity=0.232 Sum_probs=23.6
Q ss_pred hhhhhcCCCCHHHHHHHHHHHhhhCCCCCC
Q psy15328 23 RGREGRDGRRDAEIQEWYKGFLKDCPTGHL 52 (76)
Q Consensus 23 ~~l~~~T~Fs~~EI~~wYr~FkkecPsG~i 52 (76)
.++...-+.+..-|..|.+.|...-+.|..
T Consensus 16 ~~~a~~~gis~~tv~~w~~~y~~~G~~~l~ 45 (52)
T PF13518_consen 16 REIAREFGISRSTVYRWIKRYREGGIEGLK 45 (52)
T ss_pred HHHHHHHCCCHhHHHHHHHHHHhcCHHHhc
Confidence 455677788999999999999986665543
No 86
>PF09821 AAA_assoc_C: C-terminal AAA-associated domain; InterPro: IPR018632 Members of this family are found in various prokaryotic ABC transporters, predominantly involved in nitrate, sulphonate and bicarbonate translocation.
Probab=40.75 E-value=42 Score=22.22 Aligned_cols=35 Identities=20% Similarity=0.324 Sum_probs=27.9
Q ss_pred HHHHHHHhhhCCCCCCCHHHHHHHHhhcCCCCCCC
Q psy15328 37 QEWYKGFLKDCPTGHLSVEEFKKIYGNFFPYGDAS 71 (76)
Q Consensus 37 ~~wYr~FkkecPsG~it~~eFk~iy~~~FP~Gda~ 71 (76)
...-++-..+.|++.+..+.|.++...++|..++.
T Consensus 66 ~~~I~~~L~~~~~~~~~~~~~~~~L~~~~~~~~ae 100 (120)
T PF09821_consen 66 AAHIRRVLRERPNHRLPEERFLDELEDHFSPEEAE 100 (120)
T ss_pred HHHHHHHHHhCCCCCCCHHHHHHHHHHHCChhHHH
Confidence 34445556678999999999999999999877654
No 87
>PRK12461 UDP-N-acetylglucosamine acyltransferase; Provisional
Probab=40.69 E-value=21 Score=26.18 Aligned_cols=36 Identities=11% Similarity=0.092 Sum_probs=27.5
Q ss_pred cCCCCHHHHHHHHHHHhhhCCCCCCCHHHHHHHHhh
Q psy15328 28 RDGRRDAEIQEWYKGFLKDCPTGHLSVEEFKKIYGN 63 (76)
Q Consensus 28 ~T~Fs~~EI~~wYr~FkkecPsG~it~~eFk~iy~~ 63 (76)
.-.|++++|..+.+.|+.-+-+|....+-.+++-..
T Consensus 197 r~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 232 (255)
T PRK12461 197 RRGFSSRAIRALKRAYKIIYRSGLSVQQAVAELELQ 232 (255)
T ss_pred hcCCCHHHHHHHHHHHHHHHhcCCCHHHHHHHHHHh
Confidence 348999999999999999888877555555555444
No 88
>cd00252 SPARC_EC SPARC_EC; extracellular Ca2+ binding domain (containing 2 EF-hand motifs) of SPARC and related proteins (QR1, SC1/hevin, testican and tsc-36/FRP). SPARC (BM-40) is a multifunctional glycoprotein, a matricellular protein, that functions to regulate cell-matrix interactions; binds to such proteins as collagen and vitronectin and binds to endothelial cells thus inhibiting cellular proliferation. The EC domain interacts with a follistatin-like (FS) domain which appears to stabilize Ca2+ binding. The two EF-hands interact canonically but their conserved disulfide bonds confer a tight association between the EF-hand pair and an acid/amphiphilic N-terminal helix. Proposed active form involves a Ca2+ dependent symmetric homodimerization of EC-FS modules.
Probab=39.82 E-value=56 Score=21.40 Aligned_cols=13 Identities=38% Similarity=0.444 Sum_probs=7.5
Q ss_pred CCCCCCHHHHHHH
Q psy15328 48 PTGHLSVEEFKKI 60 (76)
Q Consensus 48 PsG~it~~eFk~i 60 (76)
.+|.|+.+|++.+
T Consensus 61 ~DG~Ls~~EL~~~ 73 (116)
T cd00252 61 YDGKLSHHELAPI 73 (116)
T ss_pred CCCcCCHHHHHHH
Confidence 5555665555544
No 89
>PF02885 Glycos_trans_3N: Glycosyl transferase family, helical bundle domain Prosite entry for Thymidine and pyrimidine-nucleoside phosphorylases; InterPro: IPR017459 The biosynthesis of disaccharides, oligosaccharides and polysaccharides involves the action of hundreds of different glycosyltransferases. These enzymes catalyse the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. A classification of glycosyltransferases using nucleotide diphospho-sugar, nucleotide monophospho-sugar and sugar phosphates (2.4.1.- from EC) and related proteins into distinct sequence based families has been described []. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. The same three-dimensional fold is expected to occur within each of the families. Because 3-D structures are better conserved than sequences, several of the families defined on the basis of sequence similarities may have similar 3-D structures and therefore form 'clans'. The glycosyl transferase family includes anthranilate phosphoribosyltransferase (TrpD, 2.4.2.18 from EC) and thymidine phosphorylase (2.4.2.2 from EC). All these proteins can transfer a phosphorylated ribose substrate. Thymidine phosphorylase (2.4.2.2 from EC) catalyses the reversible phosphorolysis of thymidine, deoxyuridine and their analogues to their respective bases and 2-deoxyribose 1-phosphate. This enzyme regulates the availability of thymidine and is therefore essential to nucleic acid metabolism. This N-terminal domain is found in various family 3 glycosyl transferases, including anthranilate phosphoribosyltransferase (TrpD, 2.4.2.18 from EC) and thymidine phosphorylase (2.4.2.2 from EC). All these proteins can transfer a phosphorylated ribose substrate. Thymidine phosphorylase catalyses the reversible phosphorolysis of thymidine, deoxyuridine and their analogues to their respective bases and 2-deoxyribose 1-phosphate. This enzyme regulates the availability of thymidine and is therefore essential to nucleic acid metabolism.; PDB: 2DSJ_B 2ELC_B 2BPQ_A 1ZVW_B 3QR9_B 1V8G_B 2WK5_C 2J0F_C 2WK6_B 1UOU_A ....
Probab=39.72 E-value=76 Score=18.32 Aligned_cols=38 Identities=11% Similarity=0.205 Sum_probs=24.0
Q ss_pred hHHHhhhhhcCCCCHHHHHHHHHHHhhhCCCCCCCHHHHHHH
Q psy15328 19 TKLERGREGRDGRRDAEIQEWYKGFLKDCPTGHLSVEEFKKI 60 (76)
Q Consensus 19 ~~~~~~l~~~T~Fs~~EI~~wYr~FkkecPsG~it~~eFk~i 60 (76)
+.+.+.+...-+++.+|++.+++... +|.++..+.-.+
T Consensus 3 ~~~l~~l~~g~~Ls~~e~~~~~~~i~----~g~~s~~qiaAf 40 (66)
T PF02885_consen 3 KEILKKLRDGEDLSREEAKAAFDAIL----DGEVSDAQIAAF 40 (66)
T ss_dssp HHHHHHHHTT----HHHHHHHHHHHH----TTSS-HHHHHHH
T ss_pred HHHHHHHHcCCCCCHHHHHHHHHHHH----cCCCCHHHHHHH
Confidence 34667778888999999999999998 556666554433
No 90
>TIGR03738 PRTRC_C PRTRC system protein C. A novel genetic system characterized by six major proteins, included a ParB homolog and a ThiF homolog, is designated PRTRC, or ParB-Related,ThiF-Related Cassette. It is often found on plasmids. This protein family is designated PRTRC system protein C.
Probab=39.30 E-value=34 Score=21.11 Aligned_cols=22 Identities=23% Similarity=0.475 Sum_probs=20.3
Q ss_pred hCCCCCCCHHHHHHHHhhcCCC
Q psy15328 46 DCPTGHLSVEEFKKIYGNFFPY 67 (76)
Q Consensus 46 ecPsG~it~~eFk~iy~~~FP~ 67 (76)
..|+..++.++-+++|+..+|-
T Consensus 17 ~DP~p~~spe~V~dfYs~~YPe 38 (66)
T TIGR03738 17 ADPSPAMSPEQVRDFYSAQYPE 38 (66)
T ss_pred CCCCCCCCHHHHHHHHhccCch
Confidence 6899999999999999999984
No 91
>PRK00095 mutL DNA mismatch repair protein; Reviewed
Probab=38.07 E-value=57 Score=26.85 Aligned_cols=39 Identities=18% Similarity=0.401 Sum_probs=32.0
Q ss_pred hhhcCCCCHHHHHHHHHHHhh-----hCCCCC-----CCHHHHHHHHhh
Q psy15328 25 REGRDGRRDAEIQEWYKGFLK-----DCPTGH-----LSVEEFKKIYGN 63 (76)
Q Consensus 25 l~~~T~Fs~~EI~~wYr~Fkk-----ecPsG~-----it~~eFk~iy~~ 63 (76)
++..+.++.+|++.|-+...+ .||-|+ |++.|+.+.|+.
T Consensus 568 Ik~g~~Ls~~E~~~Ll~~L~~~~~P~~CPHGRPt~i~l~~~elek~F~R 616 (617)
T PRK00095 568 IRAGRRLTLEEMNALLRQLEATENPGTCPHGRPTYIELSLSDLEKLFKR 616 (617)
T ss_pred hhccCCCCHHHHHHHHHHHHhcccccCCCCCCeeEEECCHHHHHHHhCc
Confidence 567788999999999988777 699996 588888887753
No 92
>PRK00915 2-isopropylmalate synthase; Validated
Probab=37.33 E-value=98 Score=24.95 Aligned_cols=38 Identities=11% Similarity=0.075 Sum_probs=30.5
Q ss_pred CCCCHHHHHHHHHHHhhh-CCCCCCCHHHHHHHHhhcCC
Q psy15328 29 DGRRDAEIQEWYKGFLKD-CPTGHLSVEEFKKIYGNFFP 66 (76)
Q Consensus 29 T~Fs~~EI~~wYr~Fkke-cPsG~it~~eFk~iy~~~FP 66 (76)
-.++++++..+++++++. .-.+.++.+++..|+...+.
T Consensus 346 ~~~~~~~~~~~~~~vk~~~~~~~~~~~~~l~~l~~~~~~ 384 (513)
T PRK00915 346 YKLSDEELDKAFERFKELADKKKEVFDEDLEALVEDETQ 384 (513)
T ss_pred CCCCHHHHHHHHHHHHHHHhcCCCcCHHHHHHHHHHHhc
Confidence 456889999999999983 23457999999999976665
No 93
>KOG0031|consensus
Probab=37.27 E-value=64 Score=23.26 Aligned_cols=35 Identities=23% Similarity=0.248 Sum_probs=27.6
Q ss_pred hcCCCCHHHHHHHHHHHhhhCCCCCCCHHHHHHHHh
Q psy15328 27 GRDGRRDAEIQEWYKGFLKDCPTGHLSVEEFKKIYG 62 (76)
Q Consensus 27 ~~T~Fs~~EI~~wYr~FkkecPsG~it~~eFk~iy~ 62 (76)
....|+.+||..+||.+- ..-.|.+....|..|..
T Consensus 130 ~gDr~~~eEV~~m~r~~p-~d~~G~~dy~~~~~~it 164 (171)
T KOG0031|consen 130 MGDRFTDEEVDEMYREAP-IDKKGNFDYKAFTYIIT 164 (171)
T ss_pred hcccCCHHHHHHHHHhCC-cccCCceeHHHHHHHHH
Confidence 456799999999999763 11269999999998876
No 94
>PF02257 RFX_DNA_binding: RFX DNA-binding domain; InterPro: IPR003150 RFX is a regulatory factor which binds to the X box of MHC class II genes and is essential for their expression. The DNA-binding domain of RFX is the central domain of the protein and binds ssDNA as either a monomer or homodimer [].; GO: 0003677 DNA binding, 0006355 regulation of transcription, DNA-dependent; PDB: 1DP7_P 2KW3_A.
Probab=37.05 E-value=68 Score=20.32 Aligned_cols=38 Identities=18% Similarity=0.333 Sum_probs=29.6
Q ss_pred CCCHHHHHHHHHHHhhhCCCCCCCHHHHHHHHhhcCCC
Q psy15328 30 GRRDAEIQEWYKGFLKDCPTGHLSVEEFKKIYGNFFPY 67 (76)
Q Consensus 30 ~Fs~~EI~~wYr~FkkecPsG~it~~eFk~iy~~~FP~ 67 (76)
-.++.+|-.=|..|=++..-..|+...|=++....||+
T Consensus 23 svpR~~lY~~Y~~~C~~~~~~pln~AsFGKlir~vFP~ 60 (85)
T PF02257_consen 23 SVPRSDLYAHYLSFCEKNGIKPLNAASFGKLIRQVFPN 60 (85)
T ss_dssp EEEHHHHHHHHHHHHHHTT-----HHHHHHHHHHHSTT
T ss_pred ccchHHHHHHHHHHHHHhCCCCCchHHHHHHHHHHcCC
Confidence 46788999999999888888889999999999999997
No 95
>PHA03102 Small T antigen; Reviewed
Probab=37.00 E-value=1e+02 Score=21.41 Aligned_cols=40 Identities=13% Similarity=-0.090 Sum_probs=27.4
Q ss_pred HHhhhhhcCCC--CHHHHHHHHHHHhh-hCCCCCCCHHHHHHH
Q psy15328 21 LERGREGRDGR--RDAEIQEWYKGFLK-DCPTGHLSVEEFKKI 60 (76)
Q Consensus 21 ~~~~l~~~T~F--s~~EI~~wYr~Fkk-ecPsG~it~~eFk~i 60 (76)
+.+.|--...- |.+||++-||+..+ -+|+-.-+.+.|+.|
T Consensus 7 l~~vLGl~~~A~~s~~eIKkAYr~la~~~HPDkgg~~e~~k~i 49 (153)
T PHA03102 7 LMDLLGLPRSAWGNLPLMRKAYLRKCLEFHPDKGGDEEKMKEL 49 (153)
T ss_pred HHHHcCCCCCCCCCHHHHHHHHHHHHHHHCcCCCchhHHHHHH
Confidence 33444444556 89999999999854 677765566777654
No 96
>smart00411 BHL bacterial (prokaryotic) histone like domain.
Probab=36.58 E-value=58 Score=19.36 Aligned_cols=27 Identities=15% Similarity=0.244 Sum_probs=23.9
Q ss_pred hHHHhhhhhcCCCCHHHHHHHHHHHhh
Q psy15328 19 TKLERGREGRDGRRDAEIQEWYKGFLK 45 (76)
Q Consensus 19 ~~~~~~l~~~T~Fs~~EI~~wYr~Fkk 45 (76)
..|.+++...|+++++++......|..
T Consensus 4 ~eli~~ia~~~~~~~~~v~~vl~~l~~ 30 (90)
T smart00411 4 SELIDAIAEKAGLSKKDAKAAVDAFLE 30 (90)
T ss_pred HHHHHHHHHHhCCCHHHHHHHHHHHHH
Confidence 457788899999999999999999976
No 97
>PF00220 Hormone_4: Neurohypophysial hormones, N-terminal Domain; InterPro: IPR022423 Oxytocin (or ocytocin) and vasopressin [] are small (nine amino acid residues), structurally and functionally related neurohypophysial peptide hormones. Oxytocin causes contraction of the smooth muscle of the uterus and of the mammary gland while vasopressin has a direct antidiuretic action on the kidney and also causes vasoconstriction of the peripheral vessels. Like the majority of active peptides, both hormones are synthesized as larger protein precursors that are enzymatically converted to their mature forms. Peptides belonging to this family are also found in birds, fish, reptiles and amphibians (mesotocin, isotocin, valitocin, glumitocin, aspargtocin, vasotocin, seritocin, asvatocin, phasvatocin), in worms (annetocin), octopi (cephalotocin), locust (locupressin or neuropeptide F1/F2) and in molluscs (conopressins G and S) []. The pattern developed to detect this category of peptides spans their entire sequence and includes four invariant amino acid residues. .; GO: 0005185 neurohypophyseal hormone activity, 0005576 extracellular region
Probab=36.56 E-value=15 Score=14.62 Aligned_cols=8 Identities=38% Similarity=1.576 Sum_probs=5.3
Q ss_pred HhhhCCCC
Q psy15328 43 FLKDCPTG 50 (76)
Q Consensus 43 FkkecPsG 50 (76)
|.+.||.|
T Consensus 2 ~i~nCP~G 9 (9)
T PF00220_consen 2 YIRNCPIG 9 (9)
T ss_pred ccccCCCC
Confidence 44678876
No 98
>PF13950 Epimerase_Csub: UDP-glucose 4-epimerase C-term subunit; PDB: 1EK5_A 1I3K_B 1I3M_B 1HZJ_A 1EK6_A 1I3N_A 1I3L_A 2CNB_B 1GY8_D 1NAI_A ....
Probab=36.23 E-value=77 Score=18.47 Aligned_cols=40 Identities=13% Similarity=-0.058 Sum_probs=27.4
Q ss_pred CccccchhHHHhhhhhcCCCCHHHHHHHHHHHhhhCCCCC
Q psy15328 12 AKGWNSGTKLERGREGRDGRRDAEIQEWYKGFLKDCPTGH 51 (76)
Q Consensus 12 ~~~~~~~~~~~~~l~~~T~Fs~~EI~~wYr~FkkecPsG~ 51 (76)
+..++.-.+..+.|-....++-+|+-+=.=+|.+..|+|.
T Consensus 23 ~~~~Ad~~kA~~~LgW~p~~~L~~~i~~~w~W~~~np~Gy 62 (62)
T PF13950_consen 23 AHLVADISKAREELGWKPKYSLEDMIRDAWNWQKKNPNGY 62 (62)
T ss_dssp SEE-B--HHHHHHC----SSSHHHHHHHHHHHHHHSTTTT
T ss_pred hhhhCCHHHHHHHhCCCcCCCHHHHHHHHHHHHHHCcCCC
Confidence 4567888888899999999999998877777999999984
No 99
>smart00324 RhoGAP GTPase-activator protein for Rho-like GTPases. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases. etter domain limits and outliers.
Probab=35.93 E-value=98 Score=20.29 Aligned_cols=35 Identities=14% Similarity=0.243 Sum_probs=30.5
Q ss_pred CCCCHHHHHHHHHHHhhhCCCCCCCHHHHHHHHhh
Q psy15328 29 DGRRDAEIQEWYKGFLKDCPTGHLSVEEFKKIYGN 63 (76)
Q Consensus 29 T~Fs~~EI~~wYr~FkkecPsG~it~~eFk~iy~~ 63 (76)
..++...+-.+.|.|.++-|.+.|+.+.+..+...
T Consensus 52 ~~~~~~~va~~lK~~Lr~Lp~pli~~~~~~~~~~~ 86 (174)
T smart00324 52 SEYDVHDVAGLLKLFLRELPEPLIPYELYEEFIEA 86 (174)
T ss_pred ccCCHHHHHHHHHHHHHhCCCccCCHHHHHHHHHH
Confidence 45789999999999999999999999888776543
No 100
>PF01465 GRIP: GRIP domain; InterPro: IPR000237 The GRIP (golgin-97, RanBP2alpha,Imh1p and p230/golgin-245) domain [, , ] is found in many large coiled-coil proteins. It has been shown to be sufficient for targeting to the Golgi []. The GRIP domain contains a completely conserved tyrosine residue.; GO: 0005515 protein binding, 0000042 protein targeting to Golgi; PDB: 1R4A_H 1UPT_B.
Probab=35.58 E-value=55 Score=18.17 Aligned_cols=27 Identities=11% Similarity=-0.226 Sum_probs=18.0
Q ss_pred cccchhHHHhhhhhcCCCCHHHHHHHH
Q psy15328 14 GWNSGTKLERGREGRDGRRDAEIQEWY 40 (76)
Q Consensus 14 ~~~~~~~~~~~l~~~T~Fs~~EI~~wY 40 (76)
.++....|..-|...=+||++|.++++
T Consensus 20 ~~~~~~~llpvi~tlL~fs~~e~~~i~ 46 (46)
T PF01465_consen 20 EPSEREQLLPVIATLLKFSPEEKQKIL 46 (46)
T ss_dssp S---HHHHHHHHHHHTT--HHHHHHHH
T ss_pred chhhHHHHHHHHHHHHCCCHHHHHhhC
Confidence 366677788888888899999998874
No 101
>cd00591 HU_IHF Integration host factor (IHF) and HU are small heterodimeric members of the DNABII protein family that bind and bend DNA, functioning as architectural factors in many cellular processes including transcription, site-specific recombination, and higher-order nucleoprotein complex assembly. The dimer subunits associate to form a compact globular core from which two beta ribbon arms (one from each subunit) protrude. The beta arms track and bind the DNA minor groove. Despite sequence and structural similarity, IHF and HU can be distinguished by their different DNA substrate preferences.
Probab=35.27 E-value=58 Score=19.17 Aligned_cols=27 Identities=15% Similarity=0.250 Sum_probs=23.7
Q ss_pred hHHHhhhhhcCCCCHHHHHHHHHHHhh
Q psy15328 19 TKLERGREGRDGRRDAEIQEWYKGFLK 45 (76)
Q Consensus 19 ~~~~~~l~~~T~Fs~~EI~~wYr~Fkk 45 (76)
..|.+.|...|+++..++......|..
T Consensus 3 ~~l~~~ia~~~~~~~~~v~~vl~~~~~ 29 (87)
T cd00591 3 SELIEAIAEKTGLSKKDAEAAVDAFLD 29 (87)
T ss_pred HHHHHHHHHHhCcCHHHHHHHHHHHHH
Confidence 457788999999999999999998875
No 102
>PF00216 Bac_DNA_binding: Bacterial DNA-binding protein; InterPro: IPR000119 Bacteria synthesise a set of small, usually basic proteins of about 90 residues that bind DNA and are known as histone-like proteins [, ]. Examples include the HU protein in Escherichia coli is a dimer of closely related alpha and beta chains and in other bacteria can be a dimer of identical chains. HU-type proteins have been found in a variety of eubacteria, cyanobacteria and archaebacteria, and are also encoded in the chloroplast genome of some algae []. The integration host factor (IHF), a dimer of closely related chains which seem to function in genetic recombination as well as in translational and transcriptional control [] is found in enterobacteria and viral proteins include the African Swine fever virus protein A104R (or LMW5-AR) []. The exact function of these proteins is not yet clear but they are capable of wrapping DNA and stabilising it from denaturation under extreme environmental conditions. The structure is known for one of these proteins []. The protein exists as a dimer and two "beta-arms" function as the non-specific binding site for bacterial DNA. ; GO: 0003677 DNA binding; PDB: 3C4I_B 2O97_A 1MUL_A 1P78_A 1P51_C 1P71_B 2HT0_A 1OWG_A 2IIF_A 1OUZ_A ....
Probab=34.60 E-value=53 Score=19.43 Aligned_cols=28 Identities=7% Similarity=0.209 Sum_probs=24.0
Q ss_pred hhHHHhhhhhcCCCCHHHHHHHHHHHhh
Q psy15328 18 GTKLERGREGRDGRRDAEIQEWYKGFLK 45 (76)
Q Consensus 18 ~~~~~~~l~~~T~Fs~~EI~~wYr~Fkk 45 (76)
-..|.+++...|.+++.++......|..
T Consensus 3 k~eli~~ia~~~~~s~~~v~~vl~~~~~ 30 (90)
T PF00216_consen 3 KKELIKRIAEKTGLSKKDVEAVLDALFD 30 (90)
T ss_dssp HHHHHHHHHHHHTSSHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHhcCCCHHHHHHHHHHHHH
Confidence 3567888999999999999999998864
No 103
>PRK14561 hypothetical protein; Provisional
Probab=34.59 E-value=35 Score=23.76 Aligned_cols=29 Identities=7% Similarity=0.260 Sum_probs=25.2
Q ss_pred CCCHHHHHHHHHHHhh--hCCCCCCCHHHHH
Q psy15328 30 GRRDAEIQEWYKGFLK--DCPTGHLSVEEFK 58 (76)
Q Consensus 30 ~Fs~~EI~~wYr~Fkk--ecPsG~it~~eFk 58 (76)
.++++||+++-+.+.. .-|+|.|...++.
T Consensus 133 ~~~K~eI~~la~~l~~~~~~~~~~~~~~~~~ 163 (194)
T PRK14561 133 GFGRKTIDRLVERLFEIEEGESEEIPKSDYE 163 (194)
T ss_pred CCCHHHHHHHHHhhEEEEeccCCCcCccchH
Confidence 7999999999999998 7899999776654
No 104
>COG4397 Mu-like prophage major head subunit gpT [General function prediction only]
Probab=34.42 E-value=41 Score=25.86 Aligned_cols=45 Identities=18% Similarity=0.300 Sum_probs=34.6
Q ss_pred CCCCHHHHHHHHHHHhhhCCCC-CCCHHHHHHHHhhcCCCCCCCCc
Q psy15328 29 DGRRDAEIQEWYKGFLKDCPTG-HLSVEEFKKIYGNFFPYGDASKY 73 (76)
Q Consensus 29 T~Fs~~EI~~wYr~FkkecPsG-~it~~eFk~iy~~~FP~Gda~~f 73 (76)
|..+++-|+.|+-+|++.+..| .+.++++++|-.........++|
T Consensus 2 ~~iTPd~ikALftGf~knFqdGl~mApSQYkeIAtvvpSttaSNTY 47 (308)
T COG4397 2 TPITPDMIKALFTGFRKNFQDGLSMAPSQYKEIATVVPSTTASNTY 47 (308)
T ss_pred CccCHHHHHHHHHhhhhhhhhhhhhChhhhhhheeeccccccccch
Confidence 5678999999999999999999 57888999987664333334443
No 105
>TIGR01889 Staph_reg_Sar staphylococcal accessory regulator family. This model represents a family of transcriptional regulatory proteins in Staphylococcus aureus and Staphylococcus epidermidis. Some members contain two tandem copies of this region. This family is related to the MarR transcriptional regulator family described by pfam model pfam01047.
Probab=34.01 E-value=1.2e+02 Score=18.89 Aligned_cols=46 Identities=17% Similarity=-0.045 Sum_probs=35.3
Q ss_pred hhHHHhhhhhcCCCCHHHHHHHHHHHhhhCCCCCCCHHHHHHHHhh
Q psy15328 18 GTKLERGREGRDGRRDAEIQEWYKGFLKDCPTGHLSVEEFKKIYGN 63 (76)
Q Consensus 18 ~~~~~~~l~~~T~Fs~~EI~~wYr~FkkecPsG~it~~eFk~iy~~ 63 (76)
=..+.+.+....+++..|..-|+--.+-+.|.|.++..+.......
T Consensus 10 ~~~~~~~l~~~~~ls~~q~~vL~~l~~~~~~~~~~t~~eL~~~l~~ 55 (109)
T TIGR01889 10 IKSLKRYLKKEFNLSLEELLILYYLGKLENNEGKLTLKEIIKEILI 55 (109)
T ss_pred HHHHHHHHHHHcCCCHHHHHHHHHHHhhhccCCcCcHHHHHHHHCC
Confidence 3566777888889999999888776655667899999988766543
No 106
>PF04857 CAF1: CAF1 family ribonuclease; InterPro: IPR006941 CAF1 is an RNase of the DEDD superfamily, and a subunit of the Ccr4-Not complex that mediates 3' to 5' mRNA deadenylation. The major pathways of mRNA turnover in eukaryotes initiate with shortening of the poly(A) tail. CAF1 P39008 from SWISSPROT encodes a critical component of the major cytoplasmic deadenylase in yeast. Caf1p is required for normal mRNA deadenylation in vivo and localises to the cytoplasm. Caf1p copurifies with a Ccr4p-dependent poly(A)-specific exonuclease activity. Some members of this family contain a single-stranded nucleic acid binding domain, R3H.; GO: 0005634 nucleus; PDB: 3D45_B 1UG8_A 2D5R_A 2A1S_C 2A1R_A 2FC6_A 1UOC_A 3G10_A 2P51_A 3G0Z_A.
Probab=33.54 E-value=65 Score=23.34 Aligned_cols=35 Identities=29% Similarity=0.509 Sum_probs=27.7
Q ss_pred CCCHHHHHHHHHHHhhhCCCCCCCHHHHHHHHhhcCCC
Q psy15328 30 GRRDAEIQEWYKGFLKDCPTGHLSVEEFKKIYGNFFPY 67 (76)
Q Consensus 30 ~Fs~~EI~~wYr~FkkecPsG~it~~eFk~iy~~~FP~ 67 (76)
|=---+|-.+|+.|...-| -+.+||+......||.
T Consensus 155 hn~~~Dl~~l~~~f~~~LP---~t~~eF~~~~~~~FP~ 189 (262)
T PF04857_consen 155 HNGLYDLMYLYKKFIGPLP---ETLEEFKELLRELFPR 189 (262)
T ss_dssp SSTHHHHHHHHHHHTTS-----SSHHHHHHHHHHHSSS
T ss_pred eChHhHHHHHHHHhcCCCC---CCHHHHHHHHHHHCcc
Confidence 3344578899999998777 6789999999999996
No 107
>PF08356 EF_assoc_2: EF hand associated; InterPro: IPR013567 This region predominantly appears near EF-hands (IPR002048 from INTERPRO) in GTP-binding proteins. It is found in all three eukaryotic kingdoms.
Probab=33.52 E-value=1.2e+02 Score=19.38 Aligned_cols=36 Identities=17% Similarity=0.289 Sum_probs=28.6
Q ss_pred CCCCHHHHHHHHHHHhhhCCCC----CCCHHHHHHHHhhc
Q psy15328 29 DGRRDAEIQEWYKGFLKDCPTG----HLSVEEFKKIYGNF 64 (76)
Q Consensus 29 T~Fs~~EI~~wYr~FkkecPsG----~it~~eFk~iy~~~ 64 (76)
+.++.+||..+-+--.+.+|+| .||.+-|..+-..|
T Consensus 2 ~pL~~~el~~ik~~v~~~~~~gv~~~GiT~~GFl~L~~lf 41 (89)
T PF08356_consen 2 KPLQPQELEDIKKVVRENIPDGVNDNGITLDGFLFLNKLF 41 (89)
T ss_pred CCCCHHHHHHHHHHHHHHCCCCcCCCccchhhHHHHHHHH
Confidence 5688999999998888888887 46888887665544
No 108
>PRK12860 transcriptional activator FlhC; Provisional
Probab=33.32 E-value=56 Score=23.68 Aligned_cols=31 Identities=19% Similarity=0.207 Sum_probs=24.8
Q ss_pred hhhhhcCCCCHHHHHHHHHHHhh-hCCCCCCC
Q psy15328 23 RGREGRDGRRDAEIQEWYKGFLK-DCPTGHLS 53 (76)
Q Consensus 23 ~~l~~~T~Fs~~EI~~wYr~Fkk-ecPsG~it 53 (76)
+.|...|.++.+.+.++||.-.- --|.|.+.
T Consensus 27 q~leset~Ls~~rl~~Lyke~~G~SpPkG~lP 58 (189)
T PRK12860 27 QVLESETTLSRDRLIRLYKEVRGVSPPKGMLP 58 (189)
T ss_pred HHHHHHHCCCHHHHHHHHHHHcCCCCCCCCCC
Confidence 45678899999999999998865 56677664
No 109
>PF00226 DnaJ: DnaJ domain; InterPro: IPR001623 The prokaryotic heat shock protein DnaJ interacts with the chaperone hsp70-like DnaK protein []. Structurally, the DnaJ protein consists of an N-terminal conserved domain (called 'J' domain) of about 70 amino acids, a glycine-rich region ('G' domain') of about 30 residues, a central domain containing four repeats of a CXXCXGXG motif ('CRR' domain) and a C-terminal region of 120 to 170 residues. Such a structure is shown in the following schematic representation: +------------+-+-------+-----+-----------+--------------------------------+ | N-terminal | | Gly-R | | CXXCXGXG | C-terminal | +------------+-+-------+-----+-----------+--------------------------------+ It is thought that the 'J' domain of DnaJ mediates the interaction with the dnaK protein and consists of four helices, the second of which has a charged surface that includes at least one pair of basic residues that are essential for interaction with the ATPase domain of Hsp70. The J- and CRR-domains are found in many prokaryotic and eukaryotic proteins [], either together or separately. In yeast, J-domains have been classified into 3 groups; the class III proteins are functionally distinct and do not appear to act as molecular chaperones []. ; GO: 0031072 heat shock protein binding; PDB: 2GUZ_C 2L6L_A 1HDJ_A 2EJ7_A 1FPO_C 2CUG_A 2QSA_A 2OCH_A 3BVO_B 3APQ_A ....
Probab=32.95 E-value=85 Score=17.34 Aligned_cols=24 Identities=21% Similarity=0.304 Sum_probs=19.1
Q ss_pred cCCCCHHHHHHHHHHHhhhCCCCC
Q psy15328 28 RDGRRDAEIQEWYKGFLKDCPTGH 51 (76)
Q Consensus 28 ~T~Fs~~EI~~wYr~FkkecPsG~ 51 (76)
..+.+.+||++-|++..+.+..-.
T Consensus 9 ~~~~~~~eik~~y~~l~~~~HPD~ 32 (64)
T PF00226_consen 9 PPDASDEEIKKAYRRLSKQYHPDK 32 (64)
T ss_dssp TTTSSHHHHHHHHHHHHHHTSTTT
T ss_pred CCCCCHHHHHHHHHhhhhcccccc
Confidence 347889999999999999665444
No 110
>PF15658 Latrotoxin_C: Latrotoxin C-terminal domain
Probab=32.72 E-value=86 Score=21.57 Aligned_cols=30 Identities=30% Similarity=0.623 Sum_probs=21.1
Q ss_pred HHHHHHHHHhh-hCCCCCCCHHHHHHHHhhc
Q psy15328 35 EIQEWYKGFLK-DCPTGHLSVEEFKKIYGNF 64 (76)
Q Consensus 35 EI~~wYr~Fkk-ecPsG~it~~eFk~iy~~~ 64 (76)
||-+.--.|.+ .||.|.|+...|.++...|
T Consensus 87 eIs~~L~Sy~e~a~p~~~Ls~kq~~kfl~~f 117 (127)
T PF15658_consen 87 EISKILCSYAEKACPGSKLSPKQFDKFLAEF 117 (127)
T ss_pred HHHHHHHHHHHHhCcccCCCHHHHHHHHHHH
Confidence 44444444444 7999999999998887664
No 111
>KOG4094|consensus
Probab=32.23 E-value=53 Score=23.63 Aligned_cols=20 Identities=30% Similarity=0.537 Sum_probs=16.7
Q ss_pred hcCCCCHHHHHHHHHHHhhh
Q psy15328 27 GRDGRRDAEIQEWYKGFLKD 46 (76)
Q Consensus 27 ~~T~Fs~~EI~~wYr~Fkke 46 (76)
...+.+.+|+-..||.|+.+
T Consensus 120 ~~~~vsA~ems~FYk~FL~k 139 (178)
T KOG4094|consen 120 RLEHVSANEMSEFYKDFLNK 139 (178)
T ss_pred cccccCHHHHHHHHHHHHhh
Confidence 46679999999999999764
No 112
>TIGR00973 leuA_bact 2-isopropylmalate synthase, bacterial type. A larger family of homologous proteins includes homocitrate synthase, distinct lineages of 2-isopropylmalate synthase, several distinct, uncharacterized, orthologous sets in the Archaea, and other related enzymes. This model describes a family of 2-isopropylmalate synthases found primarily in Bacteria. The homologous families in the Archaea may represent isozymes and/or related enzymes.
Probab=31.60 E-value=1e+02 Score=24.86 Aligned_cols=39 Identities=10% Similarity=0.046 Sum_probs=30.7
Q ss_pred cCCCCHHHHHHHHHHHhhh-CCCCCCCHHHHHHHHhhcCC
Q psy15328 28 RDGRRDAEIQEWYKGFLKD-CPTGHLSVEEFKKIYGNFFP 66 (76)
Q Consensus 28 ~T~Fs~~EI~~wYr~Fkke-cPsG~it~~eFk~iy~~~FP 66 (76)
.-.+++++++.+++++|.. .-.+.++.+++..|+...+.
T Consensus 342 g~~l~~~~~~~~~~~vk~~~~~~~~~~~~~l~~l~~~~~~ 381 (494)
T TIGR00973 342 GFKLDDEELDKLFEKFKELADKKKEVTDEDLEALVFEEKR 381 (494)
T ss_pred CCCCCHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHhc
Confidence 3457889999999999983 22457999999999976554
No 113
>PF06902 Fer4_19: Divergent 4Fe-4S mono-cluster; InterPro: IPR010693 This entry represents bacterial ferredoxins such Ferredoxin-1, -2 and -soy from Streptomyces griseolus and Ferredoxin fas2 from Rhodococcus fascians, plus several bacterial hypothetical proteins that contain three highly conserved cysteine residues. These ferredoxins each bind a 3Fe-4S cluster. Ferredoxin-soy (SoyB) act as electron transport protein for the cytochrome P450-SOY system []. Ferredoxin-1 (SuaB) and Ferredoxin-2 (SubB) act as electron transport proteins for the herbicide-metabolising cytochrome P-450 SU1 and SU2 systems, respectively [, ]. Ferredoxin-fas2 also plays a role in electrontransfer, the fas operon encoding genes involved in cytokinin production and in host plant fasciation (leafy gall).
Probab=31.54 E-value=19 Score=21.44 Aligned_cols=11 Identities=45% Similarity=1.014 Sum_probs=8.5
Q ss_pred hhhCCCCCCCH
Q psy15328 44 LKDCPTGHLSV 54 (76)
Q Consensus 44 kkecPsG~it~ 54 (76)
...||||.|+.
T Consensus 51 v~~CPSGAL~~ 61 (64)
T PF06902_consen 51 VDRCPSGALSY 61 (64)
T ss_pred HHcCCccCcEE
Confidence 34899999864
No 114
>PRK10664 transcriptional regulator HU subunit beta; Provisional
Probab=30.72 E-value=68 Score=19.91 Aligned_cols=27 Identities=7% Similarity=-0.004 Sum_probs=23.8
Q ss_pred hHHHhhhhhcCCCCHHHHHHHHHHHhh
Q psy15328 19 TKLERGREGRDGRRDAEIQEWYKGFLK 45 (76)
Q Consensus 19 ~~~~~~l~~~T~Fs~~EI~~wYr~Fkk 45 (76)
..|++.|...++++.+++...+..|..
T Consensus 4 ~eli~~ia~~~~~s~~~~~~~v~~~~~ 30 (90)
T PRK10664 4 SQLIDKIAAGADISKAAAGRALDAIIA 30 (90)
T ss_pred HHHHHHHHHHhCCCHHHHHHHHHHHHH
Confidence 457788899999999999999999976
No 115
>TIGR03760 ICE_TraI_Pfluor integrating conjugative element relaxase, PFL_4751 family. Members of this protein family are the TraI putative relaxases required for transfer by a subclass of integrating conjugative elements (ICE) as found in Pseudomonas fluorescens Pf-5, and understood from study of two related ICE, SXT and R391. This model represents the N-terminal domain. Note that no homology is detected to the similarly named TraI relaxase of the F plasmid.
Probab=30.60 E-value=50 Score=23.95 Aligned_cols=26 Identities=15% Similarity=0.038 Sum_probs=23.0
Q ss_pred HHHhhhhhcCCCCHHHHHHHHHHHhh
Q psy15328 20 KLERGREGRDGRRDAEIQEWYKGFLK 45 (76)
Q Consensus 20 ~~~~~l~~~T~Fs~~EI~~wYr~Fkk 45 (76)
.++++|...+.++..+++++|.....
T Consensus 20 ~ll~~i~~~~~l~~~~~~~l~~~~l~ 45 (218)
T TIGR03760 20 QLLQQLWRNSSLSAAVYEQLYLQPLE 45 (218)
T ss_pred HHHHHHHHHcCCChHHHHHHHHHHHH
Confidence 68899999999999999999987753
No 116
>PF08763 Ca_chan_IQ: Voltage gated calcium channel IQ domain; InterPro: IPR014873 Ca2+ ions are unique in that they not only carry charge but they are also the most widely used of diffusible second messengers. Voltage-dependent Ca2+ channels (VDCC) are a family of molecules that allow cells to couple electrical activity to intracellular Ca2+ signalling. The opening and closing of these channels by depolarizing stimuli, such as action potentials, allows Ca2+ ions to enter neurons down a steep electrochemical gradient, producing transient intracellular Ca2+ signals. Many of the processes that occur in neurons, including transmitter release, gene transcription and metabolism are controlled by Ca2+ influx occurring simultaneously at different cellular locales. The pore is formed by the alpha-1 subunit which incorporates the conduction pore, the voltage sensor and gating apparatus, and the known sites of channel regulation by second messengers, drugs, and toxins []. The activity of this pore is modulated by 4 tightly-coupled subunits: an intracellular beta subunit; a transmembrane gamma subunit; and a disulphide-linked complex of alpha-2 and delta subunits, which are proteolytically cleaved from the same gene product. Properties of the protein including gating voltage-dependence, G protein modulation and kinase susceptibility can be influenced by these subunits. Voltage-gated calcium channels are classified as T, L, N, P, Q and R, and are distinguished by their sensitivity to pharmacological blocks, single-channel conductance kinetics, and voltage-dependence. On the basis of their voltage activation properties, the voltage-gated calcium classes can be further divided into two broad groups: the low (T-type) and high (L, N, P, Q and R-type) threshold-activated channels. The voltage-gated calcium channel alpha 1 subunit contains an IQ domain, named for its isoleucine-glutamine (IQ) motif, which interacts with hydrophobic pockets of Ca2+/calmodulin []. The interaction regulates two self-regulatory calcium dependent feedback mechanisms, calcium dependent inactivation (CDI), and calcium-dependent facilitation (CDF). ; PDB: 3OXQ_F 2F3Z_B 3G43_E 2F3Y_B 2BE6_D 3DVM_B 3BXK_D 2VAY_B 3DVK_B 3BXL_B ....
Probab=30.54 E-value=38 Score=18.48 Aligned_cols=10 Identities=40% Similarity=0.946 Sum_probs=8.7
Q ss_pred HHHHHHHHhh
Q psy15328 36 IQEWYKGFLK 45 (76)
Q Consensus 36 I~~wYr~Fkk 45 (76)
|+.+||.|++
T Consensus 15 I~dyfr~~K~ 24 (35)
T PF08763_consen 15 IQDYFRQFKK 24 (35)
T ss_dssp HHHHHHHHHH
T ss_pred HHHHHHHHHH
Confidence 7889999986
No 117
>PLN03228 methylthioalkylmalate synthase; Provisional
Probab=30.26 E-value=1.3e+02 Score=24.55 Aligned_cols=36 Identities=14% Similarity=0.189 Sum_probs=29.7
Q ss_pred cCCCCHHHHHHHHHHHhh-hCCCCCCCHHHHHHHHhh
Q psy15328 28 RDGRRDAEIQEWYKGFLK-DCPTGHLSVEEFKKIYGN 63 (76)
Q Consensus 28 ~T~Fs~~EI~~wYr~Fkk-ecPsG~it~~eFk~iy~~ 63 (76)
.-+.+++++..++++++. ..-.+.|+.+|+..|+..
T Consensus 440 G~~~~~~~~~~l~~~vk~~a~~~~~l~~~el~~i~~~ 476 (503)
T PLN03228 440 GYELDDEKLNEVFSRFRDLTKEKKRITDADLKALVVN 476 (503)
T ss_pred CCCCCHHHHHHHHHHHHHHHHcCCCCCHHHHHHHHhc
Confidence 456789999999999998 233467999999999876
No 118
>PF13174 TPR_6: Tetratricopeptide repeat; PDB: 3QKY_A 2XEV_A 3URZ_B 2Q7F_A.
Probab=29.91 E-value=65 Score=14.86 Aligned_cols=17 Identities=24% Similarity=0.509 Sum_probs=13.2
Q ss_pred HHHHHHHHHHHhhhCCC
Q psy15328 33 DAEIQEWYKGFLKDCPT 49 (76)
Q Consensus 33 ~~EI~~wYr~FkkecPs 49 (76)
.++-...|+++.+++|+
T Consensus 16 ~~~A~~~~~~~~~~~P~ 32 (33)
T PF13174_consen 16 YDEAIEYFQRLIKRYPD 32 (33)
T ss_dssp HHHHHHHHHHHHHHSTT
T ss_pred HHHHHHHHHHHHHHCcC
Confidence 35667888999888885
No 119
>TIGR02688 conserved hypothetical protein TIGR02688. Members of this family are uncharacterized proteins sporadically distributed in bacteria and archaea, about 470 amino acids in length. Several members of this family appear in public databases with annotation as ATP-dependent protease La, despite the lack of similarity to families TIGR00763 (ATP-dependent protease La) or pfam02190 (ATP-dependent protease La (LON) domain). This protein is repeatedly found downstream of another uncharacterized protein of about 880 amino acids in length, described by model TIGR02687.
Probab=29.84 E-value=84 Score=25.69 Aligned_cols=31 Identities=26% Similarity=0.523 Sum_probs=25.9
Q ss_pred CCHHHHHHHHHHHhh-hCCCCCCCHHHHHHHH
Q psy15328 31 RRDAEIQEWYKGFLK-DCPTGHLSVEEFKKIY 61 (76)
Q Consensus 31 Fs~~EI~~wYr~Fkk-ecPsG~it~~eFk~iy 61 (76)
=+..-|+++..+++| -+|+|.++.+||+.+.
T Consensus 399 RD~~aV~kt~SgllKLL~P~~~~~~ee~~~~l 430 (449)
T TIGR02688 399 RDVIAVKKTFSGLMKILFPHGTITKEEFTECL 430 (449)
T ss_pred hhHHHHHHHHHHHHHHhCCCCCCCHHHHHHHH
Confidence 356678899999999 5999999999998754
No 120
>PRK09389 (R)-citramalate synthase; Provisional
Probab=29.75 E-value=1.1e+02 Score=24.71 Aligned_cols=37 Identities=8% Similarity=0.003 Sum_probs=29.8
Q ss_pred CCCCHHHHHHHHHHHhhh-CCCCCCCHHHHHHHHhhcC
Q psy15328 29 DGRRDAEIQEWYKGFLKD-CPTGHLSVEEFKKIYGNFF 65 (76)
Q Consensus 29 T~Fs~~EI~~wYr~Fkke-cPsG~it~~eFk~iy~~~F 65 (76)
-.++++++.+++++|+.. .-...++.+|+..|+....
T Consensus 332 ~~~~~~~~~~~~~~vk~~~~~~~~~~~~el~~l~~~~~ 369 (488)
T PRK09389 332 IEVSDDQLNEIVSRVKELGDRGKRVTDADLLAIAEDVL 369 (488)
T ss_pred CCCCHHHHHHHHHHHHHHHHcCCCcCHHHHHHHHHHHh
Confidence 456889999999999982 2345799999999997755
No 121
>TIGR02675 tape_meas_nterm tape measure domain. Proteins containing this domain are strictly bacterial, including bacteriophage and prophage regions of bacterial genomes. Most members are 800 to 1800 amino acids long, making them among the longest predicted proteins of their respective phage genomes, where they are encoded in tail protein regions. This roughly 80-residue domain described here usually begins between residue 100 and 250. Many members are known or predicted to act as phage tail tape measure proteins, a minor tail component that regulates tail length.
Probab=29.17 E-value=83 Score=18.95 Aligned_cols=38 Identities=16% Similarity=0.164 Sum_probs=33.0
Q ss_pred hhcCCCCHHHHHHHHHHHhhhCCCCCCCHHHHHHHHhh
Q psy15328 26 EGRDGRRDAEIQEWYKGFLKDCPTGHLSVEEFKKIYGN 63 (76)
Q Consensus 26 ~~~T~Fs~~EI~~wYr~FkkecPsG~it~~eFk~iy~~ 63 (76)
....+.+.+++...-..|-+-.-+|+++-+||..|...
T Consensus 5 ~~~~G~s~e~~~~~~~ql~Q~~~~Gkv~~ee~n~~~e~ 42 (75)
T TIGR02675 5 GAASGASAEEADGALIQLSQMLASGKLRGEEINSLLEA 42 (75)
T ss_pred HHHhCCCHHHHHHHHHHHHHHHHcCcccHHHHHHHHHH
Confidence 34567888999999999999999999999999999866
No 122
>TIGR02653 Lon_rel_chp conserved hypothetical protein. This model describes a protein family of unknown function, about 690 residues in length, in which some members show C-terminal sequence similarity to Pfam model pfam05362, which is the Lon protease C-terminal proteolytic domain, from MEROPS family S16. However, the annotated catalytic sites of E. coli Lon protease are not conserved in members of this family. Members have a motif GP[RK][GS]TGKS, similar to the ATP-binding P-loop motif GxxGxGK[ST].
Probab=29.02 E-value=93 Score=26.72 Aligned_cols=31 Identities=19% Similarity=0.388 Sum_probs=26.5
Q ss_pred CCHHHHHHHHHHHhhh-CCCCCCCHHHHHHHH
Q psy15328 31 RRDAEIQEWYKGFLKD-CPTGHLSVEEFKKIY 61 (76)
Q Consensus 31 Fs~~EI~~wYr~Fkke-cPsG~it~~eFk~iy 61 (76)
=+..-|+++..+|+|- +|+|.++++||+.+.
T Consensus 406 RD~~aV~kt~SgllKLl~P~~~~~~ee~e~~l 437 (675)
T TIGR02653 406 RDVIAVRKTVSGLLKLLYPDGEYTKDDVRECL 437 (675)
T ss_pred hhHHHHHHHHHHHHHHhCCCCCCCHHHHHHHH
Confidence 3567789999999995 999999999998754
No 123
>PRK08402 replication factor A; Reviewed
Probab=28.70 E-value=1.6e+02 Score=22.95 Aligned_cols=42 Identities=17% Similarity=0.212 Sum_probs=35.0
Q ss_pred hHHHhhhhhcCCCCHHHHHHHHHHHhhhCCCCCCCHHHHHHHHhh
Q psy15328 19 TKLERGREGRDGRRDAEIQEWYKGFLKDCPTGHLSVEEFKKIYGN 63 (76)
Q Consensus 19 ~~~~~~l~~~T~Fs~~EI~~wYr~FkkecPsG~it~~eFk~iy~~ 63 (76)
..|.+++...++.+++||.+-.+.-..++| ++.++...+...
T Consensus 7 ~~i~e~~~~~~~vs~eei~~~l~~lv~~~~---~~e~~A~~~ia~ 48 (355)
T PRK08402 7 EEIIERIERKTGMSREEIEEKIREIMKEEG---ISEHAAALLLAE 48 (355)
T ss_pred HHHHHHHHHhhCCCHHHHHHHHHHHHHHcC---CCHHHHHHHHHH
Confidence 457788888999999999999999999987 888877665543
No 124
>PF02209 VHP: Villin headpiece domain; InterPro: IPR003128 Villin is an F-actin bundling protein involved in the maintenance of the microvilli of the absorptive epithelia. The villin-type "headpiece" domain is a modular motif found at the extreme C terminus of larger "core" domains in over 25 cytoskeletal proteins in plants and animals, often in assocation with the Gelsolin repeat. Although the headpiece is classified as an F-actin-binding domain, it has been shown that not all headpiece domains are intrinsically F-actin-binding motifs, surface charge distribution may be an important element for F-actin recognition []. An autonomously folding, 35 residue, thermostable subdomain (HP36) of the full-length 76 amino acid residue villin headpiece, is the smallest known example of a cooperatively folded domain of a naturally occurring protein. The structure of HP36, as determined by NMR spectroscopy, consists of three short helices surrounding a tightly packed hydrophobic core []. ; GO: 0003779 actin binding, 0007010 cytoskeleton organization; PDB: 1ZV6_A 1QZP_A 1UND_A 2PPZ_A 3TJW_B 1YU8_X 2JM0_A 1WY4_A 3MYC_A 1YU5_X ....
Probab=28.50 E-value=40 Score=18.24 Aligned_cols=12 Identities=50% Similarity=0.933 Sum_probs=7.0
Q ss_pred CCHHHHHHHHhh
Q psy15328 52 LSVEEFKKIYGN 63 (76)
Q Consensus 52 it~~eFk~iy~~ 63 (76)
|+.+||..+|+.
T Consensus 2 Lsd~dF~~vFgm 13 (36)
T PF02209_consen 2 LSDEDFEKVFGM 13 (36)
T ss_dssp S-HHHHHHHHSS
T ss_pred cCHHHHHHHHCC
Confidence 566677766654
No 125
>PF13720 Acetyltransf_11: Udp N-acetylglucosamine O-acyltransferase; Domain 2; PDB: 3I3A_A 3I3X_A 3HSQ_B 2JF2_A 1LXA_A 2AQ9_A 2QIV_X 2QIA_A 2JF3_A 4EQY_F ....
Probab=28.43 E-value=1.3e+02 Score=18.45 Aligned_cols=37 Identities=22% Similarity=0.269 Sum_probs=25.1
Q ss_pred cCCCCHHHHHHHHHHHhhhCCCCCCCHHHHHHHHhhcC
Q psy15328 28 RDGRRDAEIQEWYKGFLKDCPTGHLSVEEFKKIYGNFF 65 (76)
Q Consensus 28 ~T~Fs~~EI~~wYr~FkkecPsG~it~~eFk~iy~~~F 65 (76)
.-+|++++|..+-+.|+.-.-++ ++.++-.+....-+
T Consensus 25 R~Gfs~~~i~~l~~ayr~l~~~~-~~~~~a~~~l~~~~ 61 (83)
T PF13720_consen 25 RRGFSKEEISALRRAYRILFRSG-LTLEEALEELEEEY 61 (83)
T ss_dssp HTTS-HHHHHHHHHHHHHHHTSS-S-HHHHHHHHHHHT
T ss_pred HcCCCHHHHHHHHHHHHHHHhCC-CCHHHHHHHHHHhc
Confidence 34899999999999999977766 56666554444433
No 126
>PF09129 Chol_subst-bind: Cholesterol oxidase, substrate-binding; InterPro: IPR015213 The substrate-binding domain found in cholesterol oxidase is composed of an eight-stranded mixed beta-pleated sheet and six alpha-helices. This domain is positioned over the isoalloxazine ring system of the FAD cofactor bound by the FAD-binding domain (IPR006094 from INTERPRO) and forms the roof of the active site cavity, allowing for catalysis of oxidation and isomerisation of cholesterol to cholest-4-en-3-one []. ; PDB: 3JS8_A 1I19_B 2I0K_A.
Probab=28.24 E-value=38 Score=26.54 Aligned_cols=24 Identities=33% Similarity=0.693 Sum_probs=14.5
Q ss_pred ccccccccCCc-cccchhHHHhhhh
Q psy15328 3 LWSKGWNSGAK-GWNSGTKLERGRE 26 (76)
Q Consensus 3 ~~~~~~~~~~~-~~~~~~~~~~~l~ 26 (76)
-|||||.-+.+ .|....+|..-|-
T Consensus 258 EWSKGWayt~~~aw~d~~~l~~~iP 282 (321)
T PF09129_consen 258 EWSKGWAYTDDAAWTDETVLTTTIP 282 (321)
T ss_dssp -TTSB-S-SSS-TT--HHHHHTHHH
T ss_pred eeccccccCCCccccChhHHhhhch
Confidence 39999986655 7999988876543
No 127
>COG3592 Uncharacterized conserved protein [Function unknown]
Probab=28.13 E-value=52 Score=20.71 Aligned_cols=33 Identities=30% Similarity=0.511 Sum_probs=22.2
Q ss_pred cccCCccccchhHHHhhhhhcCCCCHHHHHHHHHHHhhhCCCCCCCH
Q psy15328 8 WNSGAKGWNSGTKLERGREGRDGRRDAEIQEWYKGFLKDCPTGHLSV 54 (76)
Q Consensus 8 ~~~~~~~~~~~~~~~~~l~~~T~Fs~~EI~~wYr~FkkecPsG~it~ 54 (76)
.|-|.|-|-+-. +-+.+||++.- ..||||.|+.
T Consensus 39 F~~~rkPWI~Pd----------~~~ve~i~~vi----~sCPSGAl~Y 71 (74)
T COG3592 39 FNLGRKPWIMPD----------AVDVEEIVKVI----DTCPSGALKY 71 (74)
T ss_pred cccCCCCccCCC----------CCCHHHHHHHH----HhCCchhhhe
Confidence 355666675543 45778888764 4799999863
No 128
>KOG1707|consensus
Probab=27.95 E-value=1.2e+02 Score=26.00 Aligned_cols=40 Identities=20% Similarity=0.124 Sum_probs=35.5
Q ss_pred hhcCCCCHHHHHHHHHHHhh--hCCCCCCCHHHHHHHHhhcC
Q psy15328 26 EGRDGRRDAEIQEWYKGFLK--DCPTGHLSVEEFKKIYGNFF 65 (76)
Q Consensus 26 ~~~T~Fs~~EI~~wYr~Fkk--ecPsG~it~~eFk~iy~~~F 65 (76)
-..|.++.+=++.+-.-|.+ ..++|.++.+||+.+|..+.
T Consensus 304 ~~s~ELs~~~~~Fl~~~f~~~D~d~Dg~L~p~El~~LF~~~P 345 (625)
T KOG1707|consen 304 DQSVELSPKGYRFLVDVFEKFDRDNDGALSPEELKDLFSTAP 345 (625)
T ss_pred CcceeccHHHHHHHHHHHHhccCCCCCCcCHHHHHHHhhhCC
Confidence 36788999999999999998 89999999999999999853
No 129
>PF04282 DUF438: Family of unknown function (DUF438); InterPro: IPR007380 This is a a group of uncharacterised proteins.
Probab=27.67 E-value=79 Score=19.45 Aligned_cols=19 Identities=26% Similarity=0.200 Sum_probs=11.8
Q ss_pred HHhhhhhcCCCCHHHHHHHH
Q psy15328 21 LERGREGRDGRRDAEIQEWY 40 (76)
Q Consensus 21 ~~~~l~~~T~Fs~~EI~~wY 40 (76)
++++|.+ .+++.+||++|.
T Consensus 37 ~Eq~Li~-eG~~~eeiq~LC 55 (71)
T PF04282_consen 37 AEQELIQ-EGMPVEEIQKLC 55 (71)
T ss_pred HHHHHHH-cCCCHHHHHHHh
Confidence 4455544 567777777765
No 130
>KOG0384|consensus
Probab=27.18 E-value=50 Score=30.47 Aligned_cols=38 Identities=26% Similarity=0.434 Sum_probs=25.1
Q ss_pred cCCccccchhHHHhhhhhcCCCCHHHHHHHHHHHhhhC
Q psy15328 10 SGAKGWNSGTKLERGREGRDGRRDAEIQEWYKGFLKDC 47 (76)
Q Consensus 10 ~~~~~~~~~~~~~~~l~~~T~Fs~~EI~~wYr~Fkkec 47 (76)
.|.++|--+..+.-.-....+|+++||+.+||.+++-|
T Consensus 981 ~~~~~~~r~r~~~~~g~~~~~~~e~eir~~~ra~~kfg 1018 (1373)
T KOG0384|consen 981 LGKKGKGRWREILKRGEEKGGFTEKEIRRFYRAYLKFG 1018 (1373)
T ss_pred cccccccccccccccchhhcCCCHHHHHHHHHHHHHhc
Confidence 34455554443222223467899999999999999854
No 131
>cd07971 OBF_DNA_ligase_LigD The Oligonucleotide/oligosaccharide binding (OB)-fold domain of ATP-dependent DNA ligase LigD is a DNA-binding module that is part of the catalytic core unit. ATP-dependent polynucleotide ligases catalyze phosphodiester bond formation using nicked nucleic acid substrates with the high energy nucleotide of ATP as a cofactor in a three step reaction mechanism. DNA ligases play a vital role in the diverse processes of DNA replication, recombination and repair. ATP-dependent ligases are present in many organisms such as viruses, bacteriohages, eukarya, archaea and bacteria. Bacterial DNA ligases are divided into two broad classes: NAD-dependent and ATP-dependent. All bacterial species have a NAD-dependent DNA ligase (LigA). Some bacterial genomes contain multiple genes for DNA ligases that are predicted to use ATP as their cofactor, including Mycobacterium tuberculosis LigB, LigC, and LigD. This group is composed of Mycobacterium tuberculosis LigD and similar ba
Probab=27.01 E-value=62 Score=20.50 Aligned_cols=20 Identities=15% Similarity=0.121 Sum_probs=17.7
Q ss_pred hhcCCCCHHHHHHHHHHHhh
Q psy15328 26 EGRDGRRDAEIQEWYKGFLK 45 (76)
Q Consensus 26 ~~~T~Fs~~EI~~wYr~Fkk 45 (76)
...|.|+.++++.|++.+..
T Consensus 38 ~vgtG~s~~~~~~l~~~l~~ 57 (115)
T cd07971 38 RVGTGFSAATLRELRERLAP 57 (115)
T ss_pred eEcCCCCHHHHHHHHHHhhc
Confidence 35689999999999999976
No 132
>PF14410 GH-E: HNH/ENDO VII superfamily nuclease with conserved GHE residues
Probab=26.70 E-value=63 Score=19.53 Aligned_cols=24 Identities=21% Similarity=0.538 Sum_probs=14.2
Q ss_pred CCCCHHHHHHHHH---HHhhhCCCCCC
Q psy15328 29 DGRRDAEIQEWYK---GFLKDCPTGHL 52 (76)
Q Consensus 29 T~Fs~~EI~~wYr---~FkkecPsG~i 52 (76)
-..+.+|+..||+ .|.=+||+...
T Consensus 39 ~~~t~ke~~d~~n~p~nyrlE~~s~Nr 65 (70)
T PF14410_consen 39 GGITRKEFLDWYNDPDNYRLEDPSCNR 65 (70)
T ss_pred cCCCHHHHHHHHhCccceeecCCcCCc
Confidence 4567777777776 22236666544
No 133
>PF01785 Closter_coat: Closterovirus coat protein; InterPro: IPR002679 This family consist of coat proteins from closterovirus, which belong to the Closteroviridae, which have a positive strand ssRNA genome with no DNA stage during replication. The viral coat protein encapsulates and protects the viral genome. Both the large cp1 and smaller cp2 coat protein originate from the same primary transcript []. Members of the Closterovirus include Beet yellows virus (BYV) (Sugar beet yellows virus) and Grapevine leafroll-associated virus 7.; GO: 0044423 virion part
Probab=26.59 E-value=69 Score=22.36 Aligned_cols=39 Identities=8% Similarity=0.106 Sum_probs=31.7
Q ss_pred hhcCCCCHHHHHHHHHHHhhhCCCC-CCCHHHHHHHHhhc
Q psy15328 26 EGRDGRRDAEIQEWYKGFLKDCPTG-HLSVEEFKKIYGNF 64 (76)
Q Consensus 26 ~~~T~Fs~~EI~~wYr~FkkecPsG-~it~~eFk~iy~~~ 64 (76)
...+.++.+|++++...+++.|..- .++.+++..++..+
T Consensus 8 ~~~~~ls~~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~~~ 47 (188)
T PF01785_consen 8 SDPDVLSEEDLKKLSEKLKKFLKEKLGVTDKDFIAFFVML 47 (188)
T ss_pred eCccccCHHHHHHHHHHHHHHHHhhcCCCHHHHHHHHHHH
Confidence 3567899999999999999977777 88888887666554
No 134
>KOG0844|consensus
Probab=26.10 E-value=41 Score=26.89 Aligned_cols=22 Identities=18% Similarity=0.205 Sum_probs=19.7
Q ss_pred hhcCCCCHHHHHHHHHHHhhhC
Q psy15328 26 EGRDGRRDAEIQEWYKGFLKDC 47 (76)
Q Consensus 26 ~~~T~Fs~~EI~~wYr~Fkkec 47 (76)
.=.|.|++++|-+|=|.|.+|.
T Consensus 183 RYRTAFTReQIaRLEKEFyrEN 204 (408)
T KOG0844|consen 183 RYRTAFTREQIARLEKEFYREN 204 (408)
T ss_pred HHHhhhhHHHHHHHHHHHHHhc
Confidence 5689999999999999999864
No 135
>KOG0034|consensus
Probab=25.95 E-value=1.1e+02 Score=21.72 Aligned_cols=18 Identities=28% Similarity=0.543 Sum_probs=10.7
Q ss_pred CCHHHHHHHHhhcCCCCC
Q psy15328 52 LSVEEFKKIYGNFFPYGD 69 (76)
Q Consensus 52 it~~eFk~iy~~~FP~Gd 69 (76)
|+.++|.+.++.|.|...
T Consensus 84 v~F~~Fv~~ls~f~~~~~ 101 (187)
T KOG0034|consen 84 VDFEEFVRLLSVFSPKAS 101 (187)
T ss_pred cCHHHHHHHHhhhcCCcc
Confidence 666666666666665543
No 136
>PF06014 DUF910: Bacterial protein of unknown function (DUF910); InterPro: IPR009256 This family consists of several short bacterial proteins of unknown function.; PDB: 2NN4_A.
Probab=25.84 E-value=1.1e+02 Score=18.43 Aligned_cols=23 Identities=26% Similarity=0.504 Sum_probs=17.0
Q ss_pred CCHHHHHHHHHHHhhhCCCCCCCHHHHHHH
Q psy15328 31 RRDAEIQEWYKGFLKDCPTGHLSVEEFKKI 60 (76)
Q Consensus 31 Fs~~EI~~wYr~FkkecPsG~it~~eFk~i 60 (76)
+=+.||+.+|+ +|.|+.++|.+.
T Consensus 29 lM~~El~~Ly~-------~~lidk~~y~~A 51 (62)
T PF06014_consen 29 LMEIELKELYK-------SGLIDKKEYLTA 51 (62)
T ss_dssp HHHHHHHHHHH-------TTSS-HHHHHHH
T ss_pred HHHHHHHHHHH-------cCCCCHHHHHHH
Confidence 34678999998 688999998653
No 137
>KOG3218|consensus
Probab=25.78 E-value=70 Score=23.72 Aligned_cols=35 Identities=31% Similarity=0.416 Sum_probs=26.1
Q ss_pred HHHHHHHHHHHhhh----CC--------CCCCCHHHHHHHHhhcCCC
Q psy15328 33 DAEIQEWYKGFLKD----CP--------TGHLSVEEFKKIYGNFFPY 67 (76)
Q Consensus 33 ~~EI~~wYr~Fkke----cP--------sG~it~~eFk~iy~~~FP~ 67 (76)
+.|+.++|+..+.. .- .=.+++++|+..|+.--|.
T Consensus 6 e~E~~rl~~ar~T~~qMlrDRGY~vt~~el~ltLe~F~~~yg~~~p~ 52 (208)
T KOG3218|consen 6 EEEIYRLYLARKTAMQMLRDRGYTVTQEELDLTLEEFKARYGDKMPD 52 (208)
T ss_pred HHHHHHHHHHHHHHHHHHHhcCccccHHHhhhhHHHHHHHhccCCcc
Confidence 78999999999871 11 2246899999999985343
No 138
>cd04402 RhoGAP_ARHGAP20 RhoGAP_ARHGAP20: RhoGAP (GTPase-activator protein [GAP] for Rho-like small GTPases) domain of ArhGAP20-like proteins. ArhGAP20, also known as KIAA1391 and RA-RhoGAP, contains a RhoGAP, a RA, and a PH domain, and ANXL repeats. ArhGAP20 is activated by Rap1 and induces inactivation of Rho, which in turn leads to neurite outgrowth. Small GTPases cluster into distinct families, and all act as molecular switches, active in their GTP-bound form but inactive when GDP-bound. The Rho family of GTPases activates effectors involved in a wide variety of developmental processes, including regulation of cytoskeleton formation, cell proliferation and the JNK signaling pathway. GTPases generally have a low intrinsic GTPase hydrolytic activity but there are family-specific groups of GAPs that enhance the rate of GTP hydrolysis by several orders of magnitude.
Probab=25.73 E-value=1.8e+02 Score=20.02 Aligned_cols=35 Identities=17% Similarity=0.265 Sum_probs=30.2
Q ss_pred CCCCHHHHHHHHHHHhhhCCCCCCCHHHHHHHHhh
Q psy15328 29 DGRRDAEIQEWYKGFLKDCPTGHLSVEEFKKIYGN 63 (76)
Q Consensus 29 T~Fs~~EI~~wYr~FkkecPsG~it~~eFk~iy~~ 63 (76)
..++...+-.+-|.|.++-|.+.|+.+.+..++..
T Consensus 62 ~~~~~~~va~~lK~flreLpepLi~~~~~~~~~~~ 96 (192)
T cd04402 62 KAEPVLLLASVLKDFLRNIPGSLLSSDLYEEWMSA 96 (192)
T ss_pred ccCCHHHHHHHHHHHHHhCCCccCCHHHHHHHHHH
Confidence 56788999999999999999999998888776654
No 139
>cd05071 PTKc_Src Catalytic domain of the Protein Tyrosine Kinase, Src. Protein Tyrosine Kinase (PTK) family; Src kinase; catalytic (c) domain. The PTKc family is part of a larger superfamily that includes the catalytic domains of other kinases such as protein serine/threonine kinases, RIO kinases, and phosphoinositide 3-kinase (PI3K). PTKs catalyze the transfer of the gamma-phosphoryl group from ATP to tyrosine (tyr) residues in protein substrates. Src (or c-Src) is a cytoplasmic (or non-receptor) tyr kinase, containing an N-terminal SH4 domain with a myristoylation site, followed by SH3 and SH2 domains, a tyr kinase domain, and a regulatory C-terminal region with a conserved tyr. It is activated by autophosphorylation at the tyr kinase domain, and is negatively regulated by phosphorylation at the C-terminal tyr by Csk (C-terminal Src Kinase). c-Src is the vertebrate homolog of the oncogenic protein (v-Src) from Rous sarcoma virus. Together with other Src subfamily proteins, it is invo
Probab=25.70 E-value=1.4e+02 Score=20.22 Aligned_cols=33 Identities=9% Similarity=0.265 Sum_probs=29.0
Q ss_pred HHHHHHHHHHhhhCCCCCCCHHHHHHHHhhcCC
Q psy15328 34 AEIQEWYKGFLKDCPTGHLSVEEFKKIYGNFFP 66 (76)
Q Consensus 34 ~EI~~wYr~FkkecPsG~it~~eFk~iy~~~FP 66 (76)
.++..+.++..+..|..+.+..++..+...+|+
T Consensus 230 ~~l~~li~~~l~~~p~~Rp~~~~~~~~l~~~~~ 262 (262)
T cd05071 230 ESLHDLMCQCWRKEPEERPTFEYLQAFLEDYFT 262 (262)
T ss_pred HHHHHHHHHHccCCcccCCCHHHHHHHHHHhcC
Confidence 678888999999999999999999999888774
No 140
>PF12763 EF-hand_4: Cytoskeletal-regulatory complex EF hand; PDB: 2QPT_A 2KSP_A 2KFG_A 2JQ6_A 2KFH_A 2KFF_A 1IQ3_A 3FIA_A 2KHN_A 2KGR_A ....
Probab=25.60 E-value=51 Score=21.24 Aligned_cols=33 Identities=18% Similarity=0.252 Sum_probs=22.5
Q ss_pred CCHHHHHHHHHHHhhhCC-CCCCCHHHHHHHHhh
Q psy15328 31 RRDAEIQEWYKGFLKDCP-TGHLSVEEFKKIYGN 63 (76)
Q Consensus 31 Fs~~EI~~wYr~FkkecP-sG~it~~eFk~iy~~ 63 (76)
++++|.++..+-|..-.| +|.|+-.+-+.+|..
T Consensus 4 ls~~e~~~y~~~F~~l~~~~g~isg~~a~~~f~~ 37 (104)
T PF12763_consen 4 LSPEEKQKYDQIFQSLDPQDGKISGDQAREFFMK 37 (104)
T ss_dssp -SCCHHHHHHHHHHCTSSSTTEEEHHHHHHHHHH
T ss_pred CCHHHHHHHHHHHHhcCCCCCeEeHHHHHHHHHH
Confidence 466777777777776444 677777777777665
No 141
>PRK11858 aksA trans-homoaconitate synthase; Reviewed
Probab=25.41 E-value=98 Score=23.82 Aligned_cols=38 Identities=16% Similarity=0.078 Sum_probs=30.5
Q ss_pred cCCCCHHHHHHHHHHHhhhCC--CCCCCHHHHHHHHhhcC
Q psy15328 28 RDGRRDAEIQEWYKGFLKDCP--TGHLSVEEFKKIYGNFF 65 (76)
Q Consensus 28 ~T~Fs~~EI~~wYr~FkkecP--sG~it~~eFk~iy~~~F 65 (76)
.-..+++++..+++++|...- ...|+.+|+..||....
T Consensus 333 g~~~~~~~~~~~~~~vk~~~~~~~~~~~~~el~~~~~~~~ 372 (378)
T PRK11858 333 GIELSREELCELLEKVKELSERKKRSLTDEELKELVEDVR 372 (378)
T ss_pred CCCCCHHHHHHHHHHHHHHHHhcCCCCCHHHHHHHHHHHh
Confidence 346789999999999999432 24899999999998753
No 142
>cd04377 RhoGAP_myosin_IX RhoGAP_myosin_IX: RhoGAP (GTPase-activator protein [GAP] for Rho-like small GTPases) domain present in class IX myosins. Class IX myosins contain a characteristic head domain, a neck domain, a tail domain which contains a C6H2-zinc binding motif and a RhoGAP domain. Class IX myosins are single-headed, processive myosins that are partly cytoplasmic, and partly associated with membranes and the actin cytoskeleton. Class IX myosins are implicated in the regulation of neuronal morphogenesis and function of sensory systems, like the inner ear. There are two major isoforms, myosin IXA and IXB with several splice variants, which are both expressed in developing neurons. Small GTPases cluster into distinct families, and all act as molecular switches, active in their GTP-bound form but inactive when GDP-bound. The Rho family of GTPases activates effectors involved in a wide variety of developmental processes, including regulation of cytoskeleton formation, cell prolifer
Probab=25.38 E-value=1.8e+02 Score=19.90 Aligned_cols=35 Identities=14% Similarity=0.239 Sum_probs=29.8
Q ss_pred CCCCHHHHHHHHHHHhhhCCCCCCCHHHHHHHHhh
Q psy15328 29 DGRRDAEIQEWYKGFLKDCPTGHLSVEEFKKIYGN 63 (76)
Q Consensus 29 T~Fs~~EI~~wYr~FkkecPsG~it~~eFk~iy~~ 63 (76)
+.++...|-.+-|.|.++-|...|+.+-+..+...
T Consensus 63 ~~~~~~~va~~LK~flr~LpepLi~~~~~~~~~~~ 97 (186)
T cd04377 63 EDYPIHVITSVLKQWLRELPEPLMTFELYENFLRA 97 (186)
T ss_pred ccCCHHHHHHHHHHHHHcCCCccCCHHHHHHHHHH
Confidence 45789999999999999999999999887766543
No 143
>PHA02053 hypothetical protein
Probab=25.12 E-value=78 Score=21.32 Aligned_cols=27 Identities=19% Similarity=0.379 Sum_probs=19.9
Q ss_pred HHHHHHhhhCCCCCCCHHHHHHHHhhc
Q psy15328 38 EWYKGFLKDCPTGHLSVEEFKKIYGNF 64 (76)
Q Consensus 38 ~wYr~FkkecPsG~it~~eFk~iy~~~ 64 (76)
.+.-.+...|-+|+|++..-+.|...+
T Consensus 7 ~lm~~yr~r~~~gri~e~kak~iIe~Y 33 (115)
T PHA02053 7 ELMYQYRERCWSGRITEPKAKAIIEKY 33 (115)
T ss_pred HHHHHHHHHHhccccCcHHHHHHHHHH
Confidence 333444568999999999888887665
No 144
>TIGR02110 PQQ_syn_pqqF coenzyme PQQ biosynthesis probable peptidase PqqF. In a subset of species that make coenzyme PQQ (pyrrolo-quinoline-quinone), this probable peptidase is found in the PQQ biosynthesis region and is thought to act as a protease on PqqA (TIGR02107), a probable peptide precursor of the coenzyme. PQQ is required for some glucose dehydrogenases and alcohol dehydrogenases.
Probab=24.75 E-value=81 Score=26.77 Aligned_cols=47 Identities=11% Similarity=0.206 Sum_probs=36.8
Q ss_pred chhHHHhhhhhcCCCCHHHHHHHHHHHhh-----hCCCCCCCHHHHHHHHhhcC
Q psy15328 17 SGTKLERGREGRDGRRDAEIQEWYKGFLK-----DCPTGHLSVEEFKKIYGNFF 65 (76)
Q Consensus 17 ~~~~~~~~l~~~T~Fs~~EI~~wYr~Fkk-----ecPsG~it~~eFk~iy~~~F 65 (76)
.|+. ..+...|..+.++|+..|+++.. -+=.|.++.++.+++-..+|
T Consensus 156 iGt~--esL~~it~~t~edL~~F~~~~Y~p~NmvLvIvGdvs~eel~~l~e~~f 207 (696)
T TIGR02110 156 AGSR--DSLALPNTAFQQALRDFHRRHYQAGNMQLWLQGPQSLDELEQLAARFG 207 (696)
T ss_pred CCCH--HHHhCcccchHHHHHHHHHHhcchhcEEEEEEeCCCHHHHHHHHHHHh
Confidence 3453 55666666789999999999986 35678899999998887776
No 145
>PF02256 Fe_hyd_SSU: Iron hydrogenase small subunit; InterPro: IPR003149 Many microorganisms, such as methanogenic, acetogenic, nitrogen-fixing, photosynthetic, or sulphate-reducing bacteria, metabolise hydrogen. Hydrogen activation is mediated by a family of enzymes, termed hydrogenases, which either provide these organisms with reducing power from hydrogen oxidation, or act as electron sinks. There are two hydrogenases families that differ functionally from each other: NiFe hydrogenases tend to be more involved in hydrogen oxidation, while Iron-only FeFe (Fe only) hydrogenases in hydrogen production. Fe only hydrogenases (1.12.7.2 from EC) show a common core structure, which contains a moiety, deeply buried inside the protein, with an Fe-Fe dinuclear centre, nonproteic bridging, terminal CO and CN- ligands attached to each of the iron atoms, and a dithio moiety, which also bridges the two iron atoms and has been tentatively assigned as a di(thiomethyl)amine. This common core also harbours three [4Fe-4S] iron-sulphur clusters []. In FeFe hydrogenases, as in NiFe hydrogenases, the set of iron-sulphur clusters is dispersed regularly between the dinuclear Fe-Fe centre and the molecular surface. These clusters are distant by about 1.2 nm from each other but the [4Fe-4S] cluster closest to the dinuclear centre is covalently bound to one of the iron atoms though a thiolate bridging ligand. The moiety including the dinuclear centre, the thiolate bridging ligand, and the proximal [4Fe-4S] cluster is known as the H-cluster. A channel, lined with hydrophobic amino acid side chains, nearly connects the dinuclear centre and the molecular surface. Furthermore hydrogen-bonded water molecule sites have been identified at the interior and at the surface of the protein. The small subunit is comprised of alternating random coil and alpha helical structures that encompass the large subunit in a novel protein fold [].; PDB: 3LX4_A 1C4C_A 1FEH_A 3C8Y_A 1C4A_A 1HFE_S 1GX7_D 1E08_D.
Probab=24.74 E-value=58 Score=18.92 Aligned_cols=15 Identities=47% Similarity=0.787 Sum_probs=11.5
Q ss_pred CHHHHHHHHHHHhhh
Q psy15328 32 RDAEIQEWYKGFLKD 46 (76)
Q Consensus 32 s~~EI~~wYr~Fkke 46 (76)
.-.+|++||+.|...
T Consensus 28 eNp~v~~lY~~~lg~ 42 (60)
T PF02256_consen 28 ENPEVQELYKEFLGG 42 (60)
T ss_dssp G-HHHHHHHHHTTSS
T ss_pred cCHHHHHHHHHHhCC
Confidence 346899999999863
No 146
>smart00153 VHP Villin headpiece domain.
Probab=24.42 E-value=55 Score=17.58 Aligned_cols=12 Identities=42% Similarity=0.877 Sum_probs=8.0
Q ss_pred CCHHHHHHHHhh
Q psy15328 52 LSVEEFKKIYGN 63 (76)
Q Consensus 52 it~~eFk~iy~~ 63 (76)
|+.+||+.+|+.
T Consensus 2 LsdeeF~~vfgm 13 (36)
T smart00153 2 LSDEDFEEVFGM 13 (36)
T ss_pred CCHHHHHHHHCC
Confidence 566777777664
No 147
>PF09808 SNAPc_SNAP43: Small nuclear RNA activating complex (SNAPc), subunit SNAP43; InterPro: IPR019188 Members of this family are part of the SNAPc complex required for the transcription of both RNA polymerase II and III small-nuclear RNA genes. They bind to the proximal sequence element (PSE), a non-TATA-box basal promoter element common to these 2 types of genes. Furthermore, they also recruit TBP and BRF2 to the U6 snRNA TATA box. SNAPc consists of at least four stably associated subunits, SNAP43, SNAP45, SNAP50, and SNAP190. None of the three small subunits can bind to the PSE on their own [].
Probab=23.77 E-value=1.5e+02 Score=20.63 Aligned_cols=29 Identities=21% Similarity=0.534 Sum_probs=24.7
Q ss_pred HHHHHHHHHHHhhhCCCCCCCHHHHHHHHhhc
Q psy15328 33 DAEIQEWYKGFLKDCPTGHLSVEEFKKIYGNF 64 (76)
Q Consensus 33 ~~EI~~wYr~FkkecPsG~it~~eFk~iy~~~ 64 (76)
+++|..|-.+|.+. ..++.++|++++...
T Consensus 3 ~~D~~~Ll~~F~~~---~~~~F~~F~~~W~~~ 31 (194)
T PF09808_consen 3 KEDIDELLQRFQQA---ESVRFEDFKRLWREM 31 (194)
T ss_pred HHHHHHHHHHHHHc---CCCCHHHHHHHHHHC
Confidence 46789999999988 678999999998874
No 148
>KOG4347|consensus
Probab=23.75 E-value=84 Score=27.03 Aligned_cols=29 Identities=10% Similarity=0.202 Sum_probs=24.7
Q ss_pred HHHhhhhhcCCCCHHHHHHHHHHHhhhCC
Q psy15328 20 KLERGREGRDGRRDAEIQEWYKGFLKDCP 48 (76)
Q Consensus 20 ~~~~~l~~~T~Fs~~EI~~wYr~FkkecP 48 (76)
.++|.+-..|.|+..||..+|..|+++.-
T Consensus 487 t~lrs~~~~~~lt~~dL~~lYd~f~~e~~ 515 (671)
T KOG4347|consen 487 TILRSVVQTTSLTNTDLENLYDLFKEEHL 515 (671)
T ss_pred HHHHhhcccCccCHHHHHHHHHHHHHHHh
Confidence 45677788999999999999999999643
No 149
>PRK13662 hypothetical protein; Provisional
Probab=23.59 E-value=98 Score=22.02 Aligned_cols=30 Identities=13% Similarity=0.207 Sum_probs=21.5
Q ss_pred cchhHHHhhhhh-cCCCCHHHHHHHHHHHhh
Q psy15328 16 NSGTKLERGREG-RDGRRDAEIQEWYKGFLK 45 (76)
Q Consensus 16 ~~~~~~~~~l~~-~T~Fs~~EI~~wYr~Fkk 45 (76)
+.+..|++.+.+ .--|+..-|..||.+|++
T Consensus 143 ~~~~~l~~~i~~~~~PF~~~~i~~~~~~~~~ 173 (177)
T PRK13662 143 ENVKILVDWINQKKGPFSPDFVDIWYERYLE 173 (177)
T ss_pred HHHHHHHHHHHhCCCCCChHHHHHHHHHHHH
Confidence 345556665544 555999999999999964
No 150
>PF03471 CorC_HlyC: Transporter associated domain; InterPro: IPR005170 This small domain is found in a family of proteins with the CBS IPR002550 from INTERPRO domain and two CBS domains with this domain found at the C terminus of the proteins, the domain is also found at the C terminus of some Na+/H+ antiporters. This domain is also found in CorC that is involved in Magnesium and cobalt efflux. The function of this domain is uncertain but might be involved in modulating transport of ion substrates.; PDB: 3DED_F 2PLI_C 2R2Z_A 2P4P_A 2O3G_A 2P3H_A 3LLB_A 3LAE_A 2P13_B 2NQW_A ....
Probab=23.49 E-value=65 Score=19.07 Aligned_cols=25 Identities=16% Similarity=0.392 Sum_probs=18.4
Q ss_pred CCCCCHHHHHHHHhhcCCCCCCCCc
Q psy15328 49 TGHLSVEEFKKIYGNFFPYGDASKY 73 (76)
Q Consensus 49 sG~it~~eFk~iy~~~FP~Gda~~f 73 (76)
+|.++..+|.++|+--+|..+.++.
T Consensus 11 ~G~~~l~~l~~~~~~~l~~~~~~Tl 35 (81)
T PF03471_consen 11 SGSTPLDDLNELLGLDLPEEDYDTL 35 (81)
T ss_dssp ETTSBHHHHHHHHTS-TTTTTTSBH
T ss_pred EecCCHHHHHHHHCcCCCccchhhH
Confidence 5788889999988888887665543
No 151
>PF05120 GvpG: Gas vesicle protein G ; InterPro: IPR007804 Gas vesicles are intracellular, protein-coated, and hollow organelles found in cyanobacteria and halophilic archaea. They are permeable to ambient gases by diffusion and provide buoyancy, enabling cells to move upwards in water to access oxygen and/or light. Proteins containing this family are involved in the formation of gas vesicles [].
Probab=23.33 E-value=1.6e+02 Score=18.37 Aligned_cols=23 Identities=30% Similarity=0.495 Sum_probs=17.4
Q ss_pred HHHHHHHHHHhhhCCCCCCCHHHHHHH
Q psy15328 34 AEIQEWYKGFLKDCPTGHLSVEEFKKI 60 (76)
Q Consensus 34 ~EI~~wYr~FkkecPsG~it~~eFk~i 60 (76)
++|..++..|- .|-||.++|...
T Consensus 35 ~~L~~L~~~~e----~GEIseeEf~~~ 57 (79)
T PF05120_consen 35 RELAELQEALE----AGEISEEEFERR 57 (79)
T ss_pred HHHHHHHHHHH----cCCCCHHHHHHH
Confidence 45666777665 899999999754
No 152
>COG4306 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=23.24 E-value=1.4e+02 Score=21.05 Aligned_cols=41 Identities=22% Similarity=0.366 Sum_probs=33.2
Q ss_pred hhhhhcCCCCHHHHHHHHHHHh---hhCCCCCCCHHHHHHHHhh
Q psy15328 23 RGREGRDGRRDAEIQEWYKGFL---KDCPTGHLSVEEFKKIYGN 63 (76)
Q Consensus 23 ~~l~~~T~Fs~~EI~~wYr~Fk---kecPsG~it~~eFk~iy~~ 63 (76)
.-+.+.++++.+|+|+.....- ++.|.-++..--|+++...
T Consensus 90 elvea~~~l~pdevqqf~tdlt~ltkdspktqva~lrfkkvmsk 133 (160)
T COG4306 90 ELVEAGENLNPDEVQQFRTDLTDLTKDSPKTQVATLRFKKVMSK 133 (160)
T ss_pred HHHHccccCCHHHHHHHHhhHHHHhhcCchhHHHHHHHHHHHHH
Confidence 3446889999999999876554 4899999999999988765
No 153
>PF05263 DUF722: Protein of unknown function (DUF722); InterPro: IPR007927 This entry is represented by Bacteriophage bIL285, Orf38. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches.
Probab=23.15 E-value=81 Score=21.44 Aligned_cols=22 Identities=14% Similarity=0.255 Sum_probs=18.8
Q ss_pred hhhcCCCCHHHHHHHHHHHhhh
Q psy15328 25 REGRDGRRDAEIQEWYKGFLKD 46 (76)
Q Consensus 25 l~~~T~Fs~~EI~~wYr~Fkke 46 (76)
+....|.++..+++||..||..
T Consensus 105 IA~~l~i~erta~r~~~~fK~~ 126 (130)
T PF05263_consen 105 IAQKLHISERTARRWRDRFKND 126 (130)
T ss_pred HHHHhCccHHHHHHHHHHHHHH
Confidence 4566789999999999999974
No 154
>PF08672 APC2: Anaphase promoting complex (APC) subunit 2; InterPro: IPR014786 The anaphase-promoting complex (APC) or cyclosome is a multi-subunit E3 protein ubiquitin ligase that regulates important events in mitosis such as the initiation of anaphase and exit from telophase. The APC, in conjunction with other enzymes, assembles multi-ubiquitin chains on a variety of regulatory proteins, thereby targeting them for proteolysis by the 26S proteasome. Anaphase is initiated when the APC triggers the destruction of securin, thereby allowing the protease, separase, to disrupt sister-chromatid cohesion. Securin ubiquitination by the APC is inhibited by cyclin-dependent kinase 1 (Cdk1)-dependent phosphorylation []. Forkhead Box M1 (FoxM1), which is a transcription factor that is over-expressed in many cancers, is degraded in late mitosis and early G1 phase by the APC/cyclosome (APC/C) E3 ubiquitin ligase []. The APC/C targets mitotic cyclins for destruction in mitosis and G1 phase and is then inactivated at S phase. It thereby generates alternating states of high and low cyclin-Cdk activity, which is required for the alternation of mitosis and DNA replication []. The APC/C is composed of at least 13 subunits that stay tightly associated throughout the cell cycle: APC1, APC2, APC4, APC5, APC9, APC11, CDC16, CDC23, CDC26, CDC27, DOC1, MND2 and SWM1[], []. In fission yeast the 13 subunits are known as: Apc1, Apc2, Nuc2, Apc4, Apc5, Cut9, Apc8, Apc10, Apc11, Hcn1, Apc13, Apc14 and Apc15 []. This entry represents a C-terminal domain found in APC subunit 2. ; PDB: 1LDD_A.
Probab=22.97 E-value=1.7e+02 Score=17.11 Aligned_cols=41 Identities=20% Similarity=0.102 Sum_probs=27.2
Q ss_pred hhhhhcCCCCHHHHHHHHHHHhhhCCCCCCCHHHHHHHHhh
Q psy15328 23 RGREGRDGRRDAEIQEWYKGFLKDCPTGHLSVEEFKKIYGN 63 (76)
Q Consensus 23 ~~l~~~T~Fs~~EI~~wYr~FkkecPsG~it~~eFk~iy~~ 63 (76)
.-|.....++-+.|+...|-|...-|...++.+|++.+...
T Consensus 4 gMLtN~gsl~l~RIh~mLkmf~~~~~~~~~s~~eL~~fL~~ 44 (60)
T PF08672_consen 4 GMLTNLGSLPLDRIHSMLKMFPKDPGGYDISLEELQEFLDR 44 (60)
T ss_dssp HHHHHH-SEEHHHHHHHHHHH-GGG--TT--HHHHHHHHHH
T ss_pred HHhhcCCCCCHHHHHHHHHhccCCCCCCCCCHHHHHHHHHH
Confidence 34556666888999999999955557778899999887654
No 155
>KOG2670|consensus
Probab=22.90 E-value=73 Score=25.75 Aligned_cols=20 Identities=30% Similarity=0.589 Sum_probs=18.2
Q ss_pred CCCHHHHHHHHHHHhhhCCC
Q psy15328 30 GRRDAEIQEWYKGFLKDCPT 49 (76)
Q Consensus 30 ~Fs~~EI~~wYr~FkkecPs 49 (76)
.++.++|-.+|+.|.+++|-
T Consensus 272 ~~s~~~L~dlY~~~~k~yPi 291 (433)
T KOG2670|consen 272 WLSGDQLADLYKSFIKDYPI 291 (433)
T ss_pred ccCHHHHHHHHHHHHhcCCe
Confidence 47889999999999999995
No 156
>PF01320 Colicin_Pyocin: Colicin immunity protein / pyocin immunity protein; InterPro: IPR023802 Bacterial colicin and pyocin immunity proteins [, ] can bind specifically to the DNase-type colicins and pyocins and inhibit their bactericidal activity. The 1.8-angstrom crystal structure of the ImmE7 protein consists of four antiparallel alpha-helices []. Sequence similarities between colicins E2, A and E1 [] are less striking. The colicin E2 (pyocin) immunity protein does not share similarity with either the colicin E3 or cloacin DF13 [] immunity proteins. Pyocin protects a cell that harbours the plasmid ColE2 encoding colicin E2 against colicin E2; it is thus essential both for autonomous replication and colicin E2 immunity []. This entry represents the structural domain of colicin and pyocin immunity proteins.; GO: 0015643 toxin binding, 0030153 bacteriocin immunity; PDB: 1GXH_A 1GXG_A 1MZ8_C 2ERH_A 1ZNV_C 1AYI_A 1UNK_A 2JBG_A 7CEI_A 1CEI_A ....
Probab=22.89 E-value=1.3e+02 Score=19.19 Aligned_cols=31 Identities=19% Similarity=0.271 Sum_probs=16.9
Q ss_pred HHhhhhhcCCCCHHHHHHHHHHHhh--hCCCCC
Q psy15328 21 LERGREGRDGRRDAEIQEWYKGFLK--DCPTGH 51 (76)
Q Consensus 21 ~~~~l~~~T~Fs~~EI~~wYr~Fkk--ecPsG~ 51 (76)
|++.+...+.=+++++..|-..|.+ ++|.|.
T Consensus 18 ~v~~i~~~~~~~ee~~d~lv~hF~~iteHP~gS 50 (85)
T PF01320_consen 18 FVKEIFNAELKTEEEHDELVDHFEKITEHPDGS 50 (85)
T ss_dssp HHHHHHHTCSSSCHHHHHHHHHHHHHH--TTTT
T ss_pred HHHHHHcCCCCCHHHHHHHHHHHHHcCCCCCCC
Confidence 4455555554455566666666666 566553
No 157
>PTZ00100 DnaJ chaperone protein; Provisional
Probab=22.88 E-value=2.4e+02 Score=18.80 Aligned_cols=38 Identities=16% Similarity=-0.010 Sum_probs=25.8
Q ss_pred hhhhcCCCCHHHHHHHHHHHhh-hCCCCCCCHHHHHHHH
Q psy15328 24 GREGRDGRRDAEIQEWYKGFLK-DCPTGHLSVEEFKKIY 61 (76)
Q Consensus 24 ~l~~~T~Fs~~EI~~wYr~Fkk-ecPsG~it~~eFk~iy 61 (76)
.|--....|.+||++-||+... -+|+-.-+.+.|++|-
T Consensus 70 ILGv~~~As~~eIkkaYRrLa~~~HPDkgGs~~~~~kIn 108 (116)
T PTZ00100 70 ILNISPTASKERIREAHKQLMLRNHPDNGGSTYIASKVN 108 (116)
T ss_pred HcCCCCCCCHHHHHHHHHHHHHHhCCCCCCCHHHHHHHH
Confidence 3334456889999999999866 5666434566666653
No 158
>KOG0037|consensus
Probab=22.80 E-value=2.7e+02 Score=20.80 Aligned_cols=46 Identities=13% Similarity=0.136 Sum_probs=34.7
Q ss_pred chhHHHhhhh--hcCCCCHHHHHHHHHHHhhhCCCCCCCHHHHHHHHhh
Q psy15328 17 SGTKLERGRE--GRDGRRDAEIQEWYKGFLKDCPTGHLSVEEFKKIYGN 63 (76)
Q Consensus 17 ~~~~~~~~l~--~~T~Fs~~EI~~wYr~FkkecPsG~it~~eFk~iy~~ 63 (76)
+-+.|.+.|. ..++|+.+-++-+-.-|-+ .-+|+|...||+.++..
T Consensus 75 ~~~eLq~aLsn~~~~~Fs~~TcrlmI~mfd~-~~~G~i~f~EF~~Lw~~ 122 (221)
T KOG0037|consen 75 LAKELQQALSNGTWSPFSIETCRLMISMFDR-DNSGTIGFKEFKALWKY 122 (221)
T ss_pred cHHHHHHHhhcCCCCCCCHHHHHHHHHHhcC-CCCCccCHHHHHHHHHH
Confidence 3455555555 5678999998888887764 45799999999998764
No 159
>PF09373 PMBR: Pseudomurein-binding repeat; InterPro: IPR018975 Methanothermobacter thermautotrophicus (Methanobacterium thermoformicicum) is a methanogenic Gram-positive microorganism with a cell wall consisting of pseudomurein. This repeat specifically binds to pseudomurein. This repeat is found at the N terminus of PeiW and PeiP which are pseudomurein binding phage proteins.
Probab=22.67 E-value=76 Score=16.33 Aligned_cols=15 Identities=27% Similarity=0.507 Sum_probs=12.0
Q ss_pred CCCCCHHHHHHHHhh
Q psy15328 49 TGHLSVEEFKKIYGN 63 (76)
Q Consensus 49 sG~it~~eFk~iy~~ 63 (76)
+|.|+.+++.++...
T Consensus 2 ~~~i~~~~~~d~a~r 16 (33)
T PF09373_consen 2 SGTISKEEYLDMASR 16 (33)
T ss_pred CceecHHHHHHHHHH
Confidence 578889998888765
No 160
>PRK05289 UDP-N-acetylglucosamine acyltransferase; Provisional
Probab=22.59 E-value=57 Score=23.68 Aligned_cols=37 Identities=16% Similarity=0.245 Sum_probs=29.7
Q ss_pred cCCCCHHHHHHHHHHHhhhCCCCCCCHHHHHHHHhhc
Q psy15328 28 RDGRRDAEIQEWYKGFLKDCPTGHLSVEEFKKIYGNF 64 (76)
Q Consensus 28 ~T~Fs~~EI~~wYr~FkkecPsG~it~~eFk~iy~~~ 64 (76)
.-+|+++++.++-+.|+.-.+.|....+...++-..+
T Consensus 201 ~~~~~~~~~~~i~~a~~~~~~~~~~~~~~~~~~~~~~ 237 (262)
T PRK05289 201 RRGFSREEIHALRRAYKLLYRSGLTLEEALEELAEEY 237 (262)
T ss_pred hCCCCHHHHHHHHHHHHHHHHcCccHHHHHHHHHhhc
Confidence 3479999999999999999999987777776665543
No 161
>PRK10753 transcriptional regulator HU subunit alpha; Provisional
Probab=22.44 E-value=1.2e+02 Score=18.70 Aligned_cols=27 Identities=11% Similarity=0.021 Sum_probs=23.3
Q ss_pred hHHHhhhhhcCCCCHHHHHHHHHHHhh
Q psy15328 19 TKLERGREGRDGRRDAEIQEWYKGFLK 45 (76)
Q Consensus 19 ~~~~~~l~~~T~Fs~~EI~~wYr~Fkk 45 (76)
..|++.+...++++.+++......|..
T Consensus 4 ~eli~~ia~~~~~s~~~~~~~v~~~~~ 30 (90)
T PRK10753 4 TQLIDVIADKAELSKTQAKAALESTLA 30 (90)
T ss_pred HHHHHHHHHHhCCCHHHHHHHHHHHHH
Confidence 357788889999999999999999965
No 162
>PTZ00232 variable surface protein Vir27; Provisional
Probab=22.43 E-value=1e+02 Score=24.09 Aligned_cols=31 Identities=16% Similarity=0.334 Sum_probs=23.4
Q ss_pred HHHHHHHHHHhhhCCCCCCCH---HHHHHHHhhc
Q psy15328 34 AEIQEWYKGFLKDCPTGHLSV---EEFKKIYGNF 64 (76)
Q Consensus 34 ~EI~~wYr~FkkecPsG~it~---~eFk~iy~~~ 64 (76)
++++..|+.|...|+++.-+. .+|.+.|+.-
T Consensus 167 ~~i~~AY~~v~~~C~~~~~~~~yC~~F~~~~~~~ 200 (363)
T PTZ00232 167 NKYINMYNDAYSDCNKINKKKYDCNKFSELFRNK 200 (363)
T ss_pred HHHHHHHHHHhhhcCCCCCCCcchHHHHHHhccC
Confidence 678999999999999875443 5777776543
No 163
>cd06257 DnaJ DnaJ domain or J-domain. DnaJ/Hsp40 (heat shock protein 40) proteins are highly conserved and play crucial roles in protein translation, folding, unfolding, translocation, and degradation. They act primarily by stimulating the ATPase activity of Hsp70s, an important chaperonine family. Hsp40 proteins are characterized by the presence of a J domain, which mediates the interaction with Hsp70. They may contain other domains as well, and the architectures provide a means of classification.
Probab=22.14 E-value=1.4e+02 Score=15.68 Aligned_cols=20 Identities=20% Similarity=0.285 Sum_probs=16.0
Q ss_pred CCCCHHHHHHHHHHHhhhCC
Q psy15328 29 DGRRDAEIQEWYKGFLKDCP 48 (76)
Q Consensus 29 T~Fs~~EI~~wYr~FkkecP 48 (76)
-..+.++|++-|++..+.+.
T Consensus 10 ~~~~~~~ik~~y~~l~~~~H 29 (55)
T cd06257 10 PDASDEEIKKAYRKLALKYH 29 (55)
T ss_pred CCCCHHHHHHHHHHHHHHHC
Confidence 45788999999999988543
No 164
>PF14550 Peptidase_U35_2: Putative phage protease XkdF
Probab=21.93 E-value=1e+02 Score=20.73 Aligned_cols=24 Identities=21% Similarity=0.167 Sum_probs=19.5
Q ss_pred cCCCCHHHHHHHHHHHhhhCCCCC
Q psy15328 28 RDGRRDAEIQEWYKGFLKDCPTGH 51 (76)
Q Consensus 28 ~T~Fs~~EI~~wYr~FkkecPsG~ 51 (76)
.--++++||++-...|++.+.++-
T Consensus 32 gD~~taeeIekaA~~Fm~~~~~~d 55 (122)
T PF14550_consen 32 GDYMTAEEIEKAAHKFMKNYRNID 55 (122)
T ss_pred CCccCHHHHHHHHHHHHHHcccCC
Confidence 334899999999999999866553
No 165
>PTZ00230 variable surface protein Vir7; Provisional
Probab=21.90 E-value=1.1e+02 Score=24.02 Aligned_cols=33 Identities=21% Similarity=0.416 Sum_probs=23.0
Q ss_pred HHHHHHHHHHhhhCCCCCCCH---HHHHHHHhhcCC
Q psy15328 34 AEIQEWYKGFLKDCPTGHLSV---EEFKKIYGNFFP 66 (76)
Q Consensus 34 ~EI~~wYr~FkkecPsG~it~---~eFk~iy~~~FP 66 (76)
+++...|+.|...|+++.-+. .+|.+.+.+..|
T Consensus 164 ~~~~~AY~~~~~~C~~~~~~~~YC~~F~~~~~~~~p 199 (364)
T PTZ00230 164 DTHVNIYNEFYSECEVEDKEKNYCKNFKKYFPDKKP 199 (364)
T ss_pred HHHHHHHHHHhCcCCCCCCCCchhHHHHHHccccCc
Confidence 468899999999999765322 456666555544
No 166
>PF07553 Lipoprotein_Ltp: Host cell surface-exposed lipoprotein; InterPro: IPR011434 This domain is found as 1-3 copies in a small family of proteins of unknown function.
Probab=21.83 E-value=1.1e+02 Score=17.39 Aligned_cols=19 Identities=21% Similarity=0.204 Sum_probs=10.3
Q ss_pred hHHHhhhhhc--CCCCHHHHH
Q psy15328 19 TKLERGREGR--DGRRDAEIQ 37 (76)
Q Consensus 19 ~~~~~~l~~~--T~Fs~~EI~ 37 (76)
..|.+||... -+||++|.+
T Consensus 22 ~~l~~QL~se~ge~Ft~e~A~ 42 (48)
T PF07553_consen 22 QGLYDQLTSEYGEGFTEEEAQ 42 (48)
T ss_pred HHHHHHHHhhcccCCCHHHHH
Confidence 4455555544 456666654
No 167
>PF01614 IclR: Bacterial transcriptional regulator This Pfam family contains some of the members of the iclR family; InterPro: IPR014757 The many bacterial transcription regulation proteins which bind DNA through a 'helix-turn-helix' motif can be classified into subfamilies on the basis of sequence similarities. One of these subfamilies, called 'iclR', groups several proteins including: gylR, a possible activator protein for the gylABX glycerol operon in Streptomyces. iclR, the repressor of the acetate operon (also known as glyoxylate bypass operon) in Escherichia coli and Salmonella typhimurium. These proteins have a Helix-Turn-Helix motif at the N terminus that is similar to that of other DNA-binding proteins [].; PDB: 3MQ0_A 2IA2_D 3OBF_B 2G7U_C 3R4K_A 1TF1_D 1MKM_A 1YSP_A 3BJN_A 1TD5_A ....
Probab=21.79 E-value=2e+02 Score=17.75 Aligned_cols=41 Identities=15% Similarity=0.032 Sum_probs=26.1
Q ss_pred cchhHHHhhhhhcCCCCHHHHHHHHHH--HhhhCCCCCCCHHHHHHHHh
Q psy15328 16 NSGTKLERGREGRDGRRDAEIQEWYKG--FLKDCPTGHLSVEEFKKIYG 62 (76)
Q Consensus 16 ~~~~~~~~~l~~~T~Fs~~EI~~wYr~--FkkecPsG~it~~eFk~iy~ 62 (76)
++|.+|.- ..+++++..++.. |...-+....+.++|.+-..
T Consensus 19 A~Gr~lLA------~~~~~~~~~~l~~~~~~~~~~~~~~~~~~l~~~l~ 61 (129)
T PF01614_consen 19 ASGRALLA------FLPEEEREALLARLPLEPYTERTITDPEELRQELA 61 (129)
T ss_dssp HHHHHHHC------TSGHHHHHHHHHHHTSSBSSTTSCCSHHHHHHHHH
T ss_pred HHHHHHHH------hCCHHHHHHHHhccccccccCCCCCCHHHHHHHHH
Confidence 45666655 5789999999998 44333333344677655443
No 168
>TIGR00987 himA integration host factor, alpha subunit. This protein forms a site-specific DNA-binding heterodimer with the integration host factor beta subunit. It is closely related to the DNA-binding protein HU.
Probab=21.78 E-value=1.3e+02 Score=18.44 Aligned_cols=27 Identities=15% Similarity=0.111 Sum_probs=23.3
Q ss_pred hHHHhhhhhcCCCCHHHHHHHHHHHhh
Q psy15328 19 TKLERGREGRDGRRDAEIQEWYKGFLK 45 (76)
Q Consensus 19 ~~~~~~l~~~T~Fs~~EI~~wYr~Fkk 45 (76)
..|.+.+...++++..+++.....|..
T Consensus 5 ~eli~~ia~~~~~s~~~v~~vv~~~~~ 31 (96)
T TIGR00987 5 AEMSEYLFDELGLSKREAKELVELFFE 31 (96)
T ss_pred HHHHHHHHHHhCcCHHHHHHHHHHHHH
Confidence 457788889999999999999999975
No 169
>PF13373 DUF2407_C: DUF2407 C-terminal domain
Probab=21.56 E-value=88 Score=21.34 Aligned_cols=21 Identities=24% Similarity=0.175 Sum_probs=18.1
Q ss_pred cCCCCHHHHHHHHHHHhhhCC
Q psy15328 28 RDGRRDAEIQEWYKGFLKDCP 48 (76)
Q Consensus 28 ~T~Fs~~EI~~wYr~FkkecP 48 (76)
..+||++||..|-+.|.....
T Consensus 8 ~~GFS~~eI~~LR~QF~~~~~ 28 (140)
T PF13373_consen 8 SAGFSPEEIQDLRSQFHSIYG 28 (140)
T ss_pred HcCCCHHHHHHHHHHHHHHhc
Confidence 449999999999999998665
No 170
>PF13758 Prefoldin_3: Prefoldin subunit
Probab=21.54 E-value=2.4e+02 Score=18.55 Aligned_cols=40 Identities=20% Similarity=0.281 Sum_probs=31.2
Q ss_pred HHHhhhhhc---CCCCHHHHHHHHHHHhhhCCCCCCCHHHHHHHHhh
Q psy15328 20 KLERGREGR---DGRRDAEIQEWYKGFLKDCPTGHLSVEEFKKIYGN 63 (76)
Q Consensus 20 ~~~~~l~~~---T~Fs~~EI~~wYr~FkkecPsG~it~~eFk~iy~~ 63 (76)
.|..++.+. ..-+.++|.+.-|.|. .-.++..|++.|++.
T Consensus 16 ~LKEEi~~l~~~~~~~~e~l~~i~r~f~----g~lv~~kEi~~ilG~ 58 (99)
T PF13758_consen 16 GLKEEIEALPEDDDATREDLLRIRRDFG----GSLVTEKEIKEILGE 58 (99)
T ss_pred HHHHHHHhccccCCCCHHHHHHHHHhcC----cccccHHHHHHHhCC
Confidence 355555555 4688999999999995 337899999999887
No 171
>KOG0484|consensus
Probab=21.48 E-value=1.2e+02 Score=20.64 Aligned_cols=33 Identities=24% Similarity=0.287 Sum_probs=24.7
Q ss_pred hhhhcCCCCHHHHHHHHHHHhh-hCCCCCCCHHHH
Q psy15328 24 GREGRDGRRDAEIQEWYKGFLK-DCPTGHLSVEEF 57 (76)
Q Consensus 24 ~l~~~T~Fs~~EI~~wYr~Fkk-ecPsG~it~~eF 57 (76)
|..-.|-|+..+|++|=|-|.. .+|+ .-|++|.
T Consensus 17 QRRIRTTFTS~QLkELErvF~ETHYPD-IYTREEi 50 (125)
T KOG0484|consen 17 QRRIRTTFTSAQLKELERVFAETHYPD-IYTREEI 50 (125)
T ss_pred hhhhhhhhhHHHHHHHHHHHHhhcCCc-chhHHHH
Confidence 4567799999999999999998 4554 3455544
No 172
>cd07967 OBF_DNA_ligase_III The Oligonucleotide/oligosaccharide binding (OB)-fold domain of ATP-dependent DNA ligase III is a DNA-binding module that is part of the catalytic core unit. ATP-dependent polynucleotide ligases catalyze phosphodiester bond formation using nicked nucleic acid substrates with the high energy nucleotide of ATP as a cofactor in a three step reaction mechanism. DNA ligases play a vital role in the diverse processes of DNA replication, recombination and repair. ATP-dependent ligases are present in many organisms such as viruses, bacteriohages, eukarya, archaea and bacteria. There are three classes of ATP-dependent DNA ligases in eukaryotic cells (I, III and IV). DNA ligase III is not found in lower eukaryotes and is present both in the nucleus and mitochondria. It has several isoforms; two splice forms, III-alpha and III-beta, differ in their carboxy-terminal sequences. DNA ligase III-beta is believed to play a role in homologous recombination during meiotic proph
Probab=21.44 E-value=91 Score=20.95 Aligned_cols=20 Identities=20% Similarity=0.265 Sum_probs=17.7
Q ss_pred hhcCCCCHHHHHHHHHHHhh
Q psy15328 26 EGRDGRRDAEIQEWYKGFLK 45 (76)
Q Consensus 26 ~~~T~Fs~~EI~~wYr~Fkk 45 (76)
+-.|+||.++++++.+.+..
T Consensus 43 kVGTGfs~~~l~~l~~~L~~ 62 (139)
T cd07967 43 KCGNGHDDATLARLQKELKM 62 (139)
T ss_pred EECCCCCHHHHHHHHHHhhh
Confidence 46799999999999999975
No 173
>PF14454 Prok_Ub: Prokaryotic Ubiquitin
Probab=21.40 E-value=1e+02 Score=18.67 Aligned_cols=22 Identities=32% Similarity=0.511 Sum_probs=19.4
Q ss_pred hCCCCCCCHHHHHHHHhhcCCC
Q psy15328 46 DCPTGHLSVEEFKKIYGNFFPY 67 (76)
Q Consensus 46 ecPsG~it~~eFk~iy~~~FP~ 67 (76)
..|+-.++.++-+++|+..+|-
T Consensus 18 ~DP~p~~spe~V~~~ya~~YPe 39 (65)
T PF14454_consen 18 PDPNPSLSPEEVRDFYAAQYPE 39 (65)
T ss_pred CCCCCCCCHHHHHHHHhhhChh
Confidence 4688899999999999999984
No 174
>PF05603 DUF775: Protein of unknown function (DUF775); InterPro: IPR008493 This family consists of several eukaryotic proteins of unknown function.
Probab=21.28 E-value=68 Score=22.97 Aligned_cols=18 Identities=11% Similarity=0.383 Sum_probs=14.4
Q ss_pred CCCCHHHHHHHHHHHhhh
Q psy15328 29 DGRRDAEIQEWYKGFLKD 46 (76)
Q Consensus 29 T~Fs~~EI~~wYr~Fkke 46 (76)
.-.+-+-+++||++|.++
T Consensus 177 ~~VP~~~~~~W~~kFe~K 194 (202)
T PF05603_consen 177 EVVPLSVFDKWWEKFERK 194 (202)
T ss_pred eEEeHHHHHHHHHHHHHH
Confidence 346777899999999873
No 175
>PRK00285 ihfA integration host factor subunit alpha; Reviewed
Probab=20.69 E-value=1.5e+02 Score=18.20 Aligned_cols=27 Identities=19% Similarity=0.201 Sum_probs=23.9
Q ss_pred hHHHhhhhhcCCCCHHHHHHHHHHHhh
Q psy15328 19 TKLERGREGRDGRRDAEIQEWYKGFLK 45 (76)
Q Consensus 19 ~~~~~~l~~~T~Fs~~EI~~wYr~Fkk 45 (76)
..|.+.+...+.++..++......|..
T Consensus 6 ~el~~~ia~~~~~s~~~v~~vl~~~~~ 32 (99)
T PRK00285 6 ADLAEALFEKVGLSKREAKELVELFFE 32 (99)
T ss_pred HHHHHHHHHHhCcCHHHHHHHHHHHHH
Confidence 457788889999999999999999976
No 176
>COG1043 LpxA Acyl-[acyl carrier protein]
Probab=20.66 E-value=96 Score=23.73 Aligned_cols=37 Identities=22% Similarity=0.201 Sum_probs=28.3
Q ss_pred hcCCCCHHHHHHHHHHHhhhCCCCCCCHHHHHHHHhh
Q psy15328 27 GRDGRRDAEIQEWYKGFLKDCPTGHLSVEEFKKIYGN 63 (76)
Q Consensus 27 ~~T~Fs~~EI~~wYr~FkkecPsG~it~~eFk~iy~~ 63 (76)
..-.|++++|..|-|.||.-+-+|.--.+-.+.+...
T Consensus 201 kRrgf~~e~i~alr~ayk~lfr~~~~~~e~~~~i~~~ 237 (260)
T COG1043 201 KRRGFSREEIHALRKAYKLLFRSGLTLREALEEIAEE 237 (260)
T ss_pred eccCCCHHHHHHHHHHHHHHeeCCCCHHHHHHHHHHH
Confidence 3458999999999999999888886555555555444
No 177
>PF09820 AAA-ATPase_like: Predicted AAA-ATPase; InterPro: IPR018631 This entry is predicted to be an AAA-ATPase domain []. It is usually found together with IPR012547 from INTERPRO.
Probab=20.62 E-value=1.5e+02 Score=21.72 Aligned_cols=40 Identities=18% Similarity=0.243 Sum_probs=26.9
Q ss_pred hhhhcCCCCHHHHHHHHHHHhhhCCCCC---CCHHHHHHHHhh
Q psy15328 24 GREGRDGRRDAEIQEWYKGFLKDCPTGH---LSVEEFKKIYGN 63 (76)
Q Consensus 24 ~l~~~T~Fs~~EI~~wYr~FkkecPsG~---it~~eFk~iy~~ 63 (76)
....--+||++||+.+.+....+.+... ...++.+.-|--
T Consensus 218 ~f~~~~GFT~~Ev~~ll~~~~~~~~~~~~~~~~~~~lk~wYdG 260 (284)
T PF09820_consen 218 RFSEYFGFTEEEVETLLKYYIENLAEEQDREELLEELKEWYDG 260 (284)
T ss_pred hHhhhcCcCHHHHHHHHHHHHHHhhhccchHHHHHHHHHHcCC
Confidence 5556779999999999999977655333 333444444443
No 178
>PF13011 LZ_Tnp_IS481: leucine-zipper of insertion element IS481
Probab=20.45 E-value=1.2e+02 Score=19.36 Aligned_cols=29 Identities=10% Similarity=0.188 Sum_probs=24.4
Q ss_pred hhcCCCCHHHHHHHHHHHhhhCCCCCCCH
Q psy15328 26 EGRDGRRDAEIQEWYKGFLKDCPTGHLSV 54 (76)
Q Consensus 26 ~~~T~Fs~~EI~~wYr~FkkecPsG~it~ 54 (76)
.+.-+.|..-..+|..+|..+.+.|..+.
T Consensus 32 A~~~gVS~~Ta~kW~~Ryra~G~~GL~DR 60 (85)
T PF13011_consen 32 AAEFGVSRRTAYKWLARYRAEGEAGLQDR 60 (85)
T ss_pred HHHhCCCHHHHHHHHHHHHHcCccccccc
Confidence 45567899999999999999999888753
No 179
>TIGR01565 homeo_ZF_HD homeobox domain, ZF-HD class. This model represents a class of homoebox domain that differs substantially from the typical homoebox domain described in pfam model pfam00046. It is found in both C4 and C3 plants.
Probab=20.35 E-value=1.2e+02 Score=17.84 Aligned_cols=20 Identities=10% Similarity=-0.124 Sum_probs=17.9
Q ss_pred hhcCCCCHHHHHHHHHHHhh
Q psy15328 26 EGRDGRRDAEIQEWYKGFLK 45 (76)
Q Consensus 26 ~~~T~Fs~~EI~~wYr~Fkk 45 (76)
...|.||.+++..|-+.|.+
T Consensus 3 R~RT~Ft~~Q~~~Le~~fe~ 22 (58)
T TIGR01565 3 RRRTKFTAEQKEKMRDFAEK 22 (58)
T ss_pred CCCCCCCHHHHHHHHHHHHH
Confidence 35799999999999999987
No 180
>smart00386 HAT HAT (Half-A-TPR) repeats. Present in several RNA-binding proteins. Structurally and sequentially thought to be similar to TPRs.
Probab=20.21 E-value=1.1e+02 Score=13.73 Aligned_cols=16 Identities=19% Similarity=0.312 Sum_probs=13.4
Q ss_pred HHHHHHHHHHHhhhCC
Q psy15328 33 DAEIQEWYKGFLKDCP 48 (76)
Q Consensus 33 ~~EI~~wYr~FkkecP 48 (76)
.++++.+|.+-.+.||
T Consensus 3 ~~~~r~i~e~~l~~~~ 18 (33)
T smart00386 3 IERARKIYERALEKFP 18 (33)
T ss_pred HHHHHHHHHHHHHHCC
Confidence 4677889999999988
No 181
>PF11508 DUF3218: Protein of unknown function (DUF3218); InterPro: IPR021593 This family of proteins with unknown function appears to be restricted to Pseudomonas. ; PDB: 2IGS_H.
Probab=20.13 E-value=66 Score=23.69 Aligned_cols=38 Identities=32% Similarity=0.614 Sum_probs=23.9
Q ss_pred HHHHHHHHHhhhCCCCCCCHHH-H-------HHHHhhcCCCCCCCC
Q psy15328 35 EIQEWYKGFLKDCPTGHLSVEE-F-------KKIYGNFFPYGDASK 72 (76)
Q Consensus 35 EI~~wYr~FkkecPsG~it~~e-F-------k~iy~~~FP~Gda~~ 72 (76)
-|--+||+|.+.|..|..-++- . ..+.-.|-|.||.++
T Consensus 11 Sl~~lYKGlanqCsPG~vFPEaqt~EAWDIPLrLHPeFvPgGDvsk 56 (213)
T PF11508_consen 11 SLSNLYKGLANQCSPGQVFPEAQTIEAWDIPLRLHPEFVPGGDVSK 56 (213)
T ss_dssp HHHHHHHHHHHHH-TTS---HHHHHHHHHHHHHH-GGGSGGG-GGG
T ss_pred hHHHHHHHHHhcCCCCCCCcchhhhhhcccceeeccccCCCCchhH
Confidence 4678999999999999876542 1 234567778888765
No 182
>PF05347 Complex1_LYR: Complex 1 protein (LYR family); InterPro: IPR008011 This family of short proteins includes proteins from the NADH-ubiquinone oxidoreductase complex I. The family includes the B14 subunit from bovine NADH-ubiquinone oxidoreductase B14 subunit Q02366 from SWISSPROT, and the B22 subunit from the human enzyme Q9Y6M9 from SWISSPROT. The family has been named LYR after a highly conserved tripeptide motif close to the N terminus of these proteins. Members of this family also found in yeast which do contain this complex. In these organisms they are believed to be be required for iron-sulphur custer biogenesis.
Probab=20.10 E-value=1.3e+02 Score=16.47 Aligned_cols=30 Identities=17% Similarity=0.319 Sum_probs=18.7
Q ss_pred HHHHHHHHHhh---hCCCCCCCHHHHHHHHhhcC
Q psy15328 35 EIQEWYKGFLK---DCPTGHLSVEEFKKIYGNFF 65 (76)
Q Consensus 35 EI~~wYr~Fkk---ecPsG~it~~eFk~iy~~~F 65 (76)
++-+|||.+.. .+|... ....+......-|
T Consensus 2 ~vl~LYR~lLR~~~~~~~~~-~r~~~~~~iR~~F 34 (59)
T PF05347_consen 2 RVLSLYRQLLRAARSFPDDS-EREYIRAEIRQEF 34 (59)
T ss_pred hHHHHHHHHHHHHHHcCCcc-hHHHHHHHHHHHH
Confidence 56788999888 356555 4555555444444
No 183
>PF00767 Poty_coat: Potyvirus coat protein; InterPro: IPR001592 This protease is found in genome polyproteins of potyviruses. The genome polyprotein contains: N-terminal protein (P1), helper component protease (3.4.22 from EC, HC-PRO), protein P3, 6KD protein (6K1), cytoplasmic inclusion protein (CI), 6KD protein 2 (6K2), genome-linked protein (VPG), nuclear inclusion protein A (3.4.22 from EC), nuclear inclusion protein B (2.7.7.48 from EC) and coat protein (CP). The coat protein is at the C terminus of the polyprotein.; GO: 0019028 viral capsid
Probab=20.06 E-value=1.9e+02 Score=21.55 Aligned_cols=37 Identities=14% Similarity=0.368 Sum_probs=29.2
Q ss_pred hhhhhcCCCCHHHHHHHHHHHhhhCCCCCCCHHHHHHHHh
Q psy15328 23 RGREGRDGRRDAEIQEWYKGFLKDCPTGHLSVEEFKKIYG 62 (76)
Q Consensus 23 ~~l~~~T~Fs~~EI~~wYr~FkkecPsG~it~~eFk~iy~ 62 (76)
+.....|.-|..++..||.+.++++- ++.+++..|..
T Consensus 48 Q~disNtrATq~Qf~~W~e~Vk~~y~---v~d~~m~iil~ 84 (237)
T PF00767_consen 48 QEDISNTRATQSQFEAWYEAVKKDYG---VTDDQMQIILN 84 (237)
T ss_pred hhhhhcccchHHHHHHHHHHhhhhcC---cChHHHHHHHH
Confidence 34457888999999999999999886 67776666644
No 184
>PF04994 TfoX_C: TfoX C-terminal domain; InterPro: IPR007077 This domain is found in a number of bacterial proteins including the TfoX gene product of Haemophilus influenzae. TfoX may play a key role in the development of genetic competence by regulating the expression of late competence-specific genes []. This family corresponds to the C-terminal presumed domain of TfoX. The domain is found in association with the N-terminal domain in some, but not all members of this group, suggesting this is an autonomous and functionally unrelated domain. For example it is found associated with Q9JZR1 from SWISSPROT in IPR002125 from INTERPRO.; PDB: 3BQT_A 3MAB_A.
Probab=20.00 E-value=1.2e+02 Score=18.54 Aligned_cols=33 Identities=24% Similarity=0.230 Sum_probs=21.9
Q ss_pred cchhHHHhhhhhcCCCCHHHHHHH-----HHHHhhhCC
Q psy15328 16 NSGTKLERGREGRDGRRDAEIQEW-----YKGFLKDCP 48 (76)
Q Consensus 16 ~~~~~~~~~l~~~T~Fs~~EI~~w-----Yr~FkkecP 48 (76)
|.|+++++.|...-=-|.+|++++ |.+.++..+
T Consensus 10 Nig~~~e~~L~~vGI~t~~~L~~~Ga~~a~~~Lk~~~~ 47 (81)
T PF04994_consen 10 NIGPKSERMLAKVGIHTVEDLRELGAVEAYLRLKASGP 47 (81)
T ss_dssp T--HHHHHHHHHTT--SHHHHHHHHHHHHHHHHHHH-T
T ss_pred CCCHHHHHHHHHcCCCCHHHHHHhCHHHHHHHHHHHCC
Confidence 788999999888888889998864 666666544
Done!