Query psy5907
Match_columns 56
No_of_seqs 126 out of 1035
Neff 8.7
Searched_HMMs 46136
Date Fri Aug 16 21:19:53 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy5907.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/5907hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 cd01389 MATA_HMG-box MATA_HMG- 99.8 1.8E-20 3.9E-25 92.8 7.5 54 1-54 7-60 (77)
2 cd01388 SOX-TCF_HMG-box SOX-TC 99.8 2.8E-20 6.1E-25 91.2 7.5 54 1-54 7-60 (72)
3 PTZ00199 high mobility group p 99.8 7.9E-20 1.7E-24 93.7 7.4 54 1-54 28-83 (94)
4 cd01390 HMGB-UBF_HMG-box HMGB- 99.8 1.4E-19 3.1E-24 86.7 6.9 54 1-54 6-59 (66)
5 PF00505 HMG_box: HMG (high mo 99.8 1.7E-18 3.7E-23 83.6 7.0 54 1-54 6-59 (69)
6 smart00398 HMG high mobility g 99.8 4E-18 8.6E-23 82.1 7.5 54 1-54 7-60 (70)
7 cd00084 HMG-box High Mobility 99.8 1.1E-17 2.4E-22 79.8 7.4 54 1-54 6-59 (66)
8 KOG0527|consensus 99.7 4.1E-18 8.9E-23 102.4 5.4 54 1-54 68-121 (331)
9 PF09011 HMG_box_2: HMG-box do 99.7 3.8E-16 8.3E-21 76.6 6.7 54 1-54 9-63 (73)
10 KOG0381|consensus 99.6 1.3E-15 2.8E-20 77.6 7.2 54 1-54 28-81 (96)
11 COG5648 NHP6B Chromatin-associ 99.6 1.9E-15 4.1E-20 86.0 6.2 54 1-54 76-129 (211)
12 KOG4715|consensus 99.4 6.2E-13 1.3E-17 79.9 4.6 54 1-54 70-123 (410)
13 KOG3248|consensus 99.2 1.9E-11 4.2E-16 74.0 5.6 54 1-54 197-250 (421)
14 KOG0528|consensus 99.1 3.8E-11 8.2E-16 75.0 0.6 52 1-52 331-382 (511)
15 KOG0526|consensus 99.0 2.9E-10 6.3E-15 72.0 4.1 50 1-54 541-590 (615)
16 KOG2746|consensus 98.6 2.6E-08 5.6E-13 64.4 3.2 54 1-54 187-242 (683)
17 PF06382 DUF1074: Protein of u 98.4 1.4E-06 2.9E-11 49.1 5.7 47 1-51 84-130 (183)
18 PF08073 CHDNT: CHDNT (NUC034) 97.8 0.00013 2.8E-09 34.2 4.7 37 2-38 15-51 (55)
19 PF04690 YABBY: YABBY protein; 97.6 0.00019 4.1E-09 40.4 4.7 38 1-38 127-164 (170)
20 PF14887 HMG_box_5: HMG (high 97.5 0.00091 2E-08 33.4 5.5 50 4-54 12-61 (85)
21 COG5648 NHP6B Chromatin-associ 97.4 0.00012 2.7E-09 42.2 2.2 52 3-54 151-202 (211)
22 PF06244 DUF1014: Protein of u 97.0 0.0029 6.2E-08 34.0 4.5 39 2-40 79-117 (122)
23 KOG3223|consensus 95.6 0.035 7.5E-07 32.1 4.1 45 2-49 171-215 (221)
24 PRK15117 ABC transporter perip 95.4 0.044 9.4E-07 31.7 4.2 33 19-51 66-99 (211)
25 TIGR03481 HpnM hopanoid biosyn 95.1 0.049 1.1E-06 31.2 3.9 33 19-51 62-95 (198)
26 PF05494 Tol_Tol_Ttg2: Toluene 94.8 0.035 7.7E-07 30.7 2.6 33 19-51 36-69 (170)
27 PF06945 DUF1289: Protein of u 92.9 0.46 9.9E-06 21.6 4.0 29 20-53 21-49 (51)
28 PF13875 DUF4202: Domain of un 92.4 0.63 1.4E-05 26.7 4.9 40 2-44 131-170 (185)
29 PF09164 VitD-bind_III: Vitami 90.1 1.3 2.8E-05 21.5 4.6 32 2-33 10-41 (68)
30 PF01352 KRAB: KRAB box; Inte 88.0 0.65 1.4E-05 20.3 2.1 29 23-51 3-32 (41)
31 PF12650 DUF3784: Domain of un 87.0 0.55 1.2E-05 23.8 1.7 16 33-48 25-40 (97)
32 PF11304 DUF3106: Protein of u 86.1 3.4 7.3E-05 21.5 6.1 40 12-51 32-73 (107)
33 PRK10236 hypothetical protein; 84.9 1.5 3.3E-05 26.1 3.0 31 19-49 106-141 (237)
34 PF04769 MAT_Alpha1: Mating-ty 83.5 3.6 7.9E-05 23.9 4.1 42 2-49 50-91 (201)
35 PF11304 DUF3106: Protein of u 83.1 4 8.7E-05 21.3 3.9 17 30-46 34-50 (107)
36 PF13945 NST1: Salt tolerance 82.7 3.3 7.1E-05 23.9 3.7 30 22-51 99-128 (190)
37 PRK12751 cpxP periplasmic stre 81.8 4.3 9.3E-05 22.8 3.9 27 25-51 118-144 (162)
38 PF12881 NUT_N: NUT protein N 81.5 9.8 0.00021 23.8 5.6 49 3-51 232-280 (328)
39 KOG1610|consensus 79.7 7.4 0.00016 24.3 4.7 30 23-52 218-247 (322)
40 PF13412 HTH_24: Winged helix- 77.7 3.5 7.5E-05 17.8 2.3 22 15-36 12-33 (48)
41 PRK10363 cpxP periplasmic repr 74.8 9.3 0.0002 21.7 3.9 29 23-51 110-138 (166)
42 PF15581 Imm35: Immunity prote 73.3 11 0.00023 19.4 3.9 28 21-48 30-57 (93)
43 PF12290 DUF3802: Protein of u 73.0 12 0.00027 19.9 4.7 28 23-50 73-100 (113)
44 cd08317 Death_ank Death domain 72.9 1.9 4E-05 21.3 0.8 22 19-40 3-24 (84)
45 PF05388 Carbpep_Y_N: Carboxyp 72.4 12 0.00025 19.9 3.7 28 23-50 45-72 (113)
46 COG2854 Ttg2D ABC-type transpo 71.0 5 0.00011 23.4 2.3 33 19-51 68-101 (202)
47 PF08367 M16C_assoc: Peptidase 67.6 18 0.00039 21.1 4.2 33 23-55 12-44 (248)
48 COG3313 Predicted Fe-S protein 66.4 8.6 0.00019 19.0 2.3 22 20-46 26-47 (74)
49 PF06628 Catalase-rel: Catalas 66.0 6.4 0.00014 18.7 1.8 19 30-48 13-31 (68)
50 PRK10455 periplasmic protein; 64.7 23 0.0005 19.8 4.1 27 25-51 118-144 (161)
51 PF07813 LTXXQ: LTXXQ motif fa 63.5 16 0.00036 17.8 3.9 25 24-48 75-99 (100)
52 PRK00523 hypothetical protein; 62.6 18 0.00038 17.8 5.0 36 6-41 25-60 (72)
53 PF03672 UPF0154: Uncharacteri 61.4 18 0.00038 17.4 5.1 34 8-41 19-52 (64)
54 PRK14291 chaperone protein Dna 61.1 33 0.00072 21.6 4.7 37 8-44 22-62 (382)
55 PRK12750 cpxP periplasmic repr 60.5 29 0.00062 19.6 4.1 24 28-51 128-151 (170)
56 PRK10266 curved DNA-binding pr 59.4 38 0.00082 20.6 5.2 40 8-47 23-67 (306)
57 PRK14299 chaperone protein Dna 58.6 39 0.00084 20.4 4.8 39 8-46 23-66 (291)
58 PRK14281 chaperone protein Dna 58.5 28 0.0006 22.0 4.1 40 7-46 21-66 (397)
59 PHA03102 Small T antigen; Revi 58.2 26 0.00057 19.5 3.6 37 7-43 25-62 (153)
60 PF02026 RyR: RyR domain; Int 58.0 14 0.0003 18.8 2.3 19 33-51 60-78 (94)
61 PRK14279 chaperone protein Dna 57.7 27 0.00059 22.1 3.9 40 8-47 28-73 (392)
62 PF11521 TFIIE-A_C-term: C-ter 57.5 14 0.00031 18.7 2.2 30 22-52 51-80 (86)
63 PF02376 CUT: CUT domain; Int 57.4 14 0.00031 18.5 2.2 32 19-50 37-70 (87)
64 PRK14283 chaperone protein Dna 57.3 46 0.00099 20.9 4.9 40 7-46 23-67 (378)
65 PRK14280 chaperone protein Dna 55.7 49 0.0011 20.8 4.9 40 8-47 23-67 (376)
66 smart00271 DnaJ DnaJ molecular 55.6 11 0.00023 16.8 1.5 33 8-40 20-58 (60)
67 PRK14296 chaperone protein Dna 55.2 46 0.001 20.9 4.6 39 8-46 23-66 (372)
68 PRK01844 hypothetical protein; 54.3 26 0.00057 17.2 5.1 36 6-41 24-59 (72)
69 KOG1827|consensus 53.4 0.35 7.5E-06 32.3 -5.0 41 2-42 555-595 (629)
70 PF02650 HTH_WhiA: WhiA C-term 52.9 12 0.00025 18.9 1.5 20 13-32 47-66 (85)
71 TIGR00787 dctP tripartite ATP- 51.7 24 0.00051 20.6 2.9 21 31-51 213-233 (257)
72 COG1638 DctP TRAP-type C4-dica 50.1 27 0.00058 21.7 3.0 24 31-54 244-267 (332)
73 PTZ00037 DnaJ_C chaperone prot 49.9 60 0.0013 20.9 4.6 40 7-46 46-87 (421)
74 COG3763 Uncharacterized protei 48.8 34 0.00073 16.8 4.9 34 7-40 25-58 (71)
75 PRK14284 chaperone protein Dna 48.6 53 0.0012 20.7 4.2 39 8-46 20-64 (391)
76 PF11588 DUF3243: Protein of u 48.1 22 0.00047 17.9 2.0 19 27-45 53-71 (81)
77 PRK14285 chaperone protein Dna 47.4 53 0.0011 20.6 4.0 41 7-47 21-67 (365)
78 PLN02150 terpene synthase/cycl 47.1 11 0.00025 19.2 1.0 19 22-40 24-42 (96)
79 PF01630 Glyco_hydro_56: Hyalu 46.1 75 0.0016 20.1 4.5 32 4-35 123-154 (337)
80 PRK14287 chaperone protein Dna 45.5 76 0.0016 19.9 4.9 39 8-46 23-66 (371)
81 cd08805 Death_ank1 Death domai 45.4 6.2 0.00013 19.8 -0.2 21 20-40 4-24 (84)
82 PRK14276 chaperone protein Dna 45.1 78 0.0017 19.9 4.6 39 8-46 23-66 (380)
83 KOG0713|consensus 45.1 79 0.0017 20.0 4.9 40 8-47 35-79 (336)
84 PRK09897 hypothetical protein; 44.5 53 0.0012 21.8 3.8 29 23-51 353-381 (534)
85 PRK14294 chaperone protein Dna 43.7 80 0.0017 19.7 4.9 39 8-46 23-67 (366)
86 PF00226 DnaJ: DnaJ domain; I 43.0 33 0.00071 15.4 2.1 38 8-45 19-63 (64)
87 cd01670 Death Death Domain: a 42.7 14 0.0003 17.4 0.8 17 24-40 3-19 (79)
88 PF08373 RAP: RAP domain; Int 42.6 22 0.00047 15.8 1.5 15 33-47 41-56 (58)
89 PRK14298 chaperone protein Dna 42.5 76 0.0017 20.0 4.2 37 8-44 24-64 (377)
90 PF03480 SBP_bac_7: Bacterial 42.4 28 0.0006 20.5 2.2 20 32-51 214-233 (286)
91 PF02936 COX4: Cytochrome c ox 41.9 15 0.00032 20.2 0.9 22 23-44 29-54 (142)
92 PRK05439 pantothenate kinase; 41.2 20 0.00043 22.2 1.5 49 1-50 8-58 (311)
93 PF05914 RIB43A: RIB43A; Inte 40.8 4.9 0.00011 25.4 -1.2 34 14-47 252-287 (379)
94 PRK14292 chaperone protein Dna 40.7 89 0.0019 19.6 4.3 39 8-46 21-64 (371)
95 PF09829 DUF2057: Uncharacteri 40.0 39 0.00084 19.0 2.5 24 23-46 164-187 (189)
96 smart00351 PAX Paired Box doma 39.4 58 0.0013 17.1 3.0 20 13-32 86-105 (125)
97 cd08318 Death_NMPP84 Death dom 39.1 17 0.00037 18.0 0.9 16 23-38 10-25 (86)
98 PRK14295 chaperone protein Dna 38.6 1E+02 0.0022 19.5 4.4 40 8-47 28-73 (389)
99 PRK14290 chaperone protein Dna 38.5 83 0.0018 19.7 3.9 36 8-43 22-63 (365)
100 cd08779 Death_PIDD Death Domai 38.4 15 0.00032 18.3 0.5 17 23-39 5-21 (86)
101 PF14098 SSPI: Small, acid-sol 38.3 49 0.0011 15.9 2.3 17 32-48 44-60 (65)
102 COG4735 Uncharacterized protei 38.0 81 0.0018 18.7 3.5 33 19-51 35-67 (211)
103 PF01710 HTH_Tnp_IS630: Transp 38.0 48 0.001 17.3 2.5 20 12-31 63-82 (119)
104 PRK14278 chaperone protein Dna 37.2 1.1E+02 0.0023 19.3 4.3 39 8-46 22-65 (378)
105 PF11616 EZH2_WD-Binding: WD r 37.2 32 0.00069 14.0 1.4 14 24-37 12-25 (30)
106 PF06242 DUF1013: Protein of u 36.6 33 0.00071 19.0 1.7 18 15-32 78-95 (140)
107 PF11334 DUF3136: Protein of u 36.5 55 0.0012 15.7 3.3 30 2-36 9-38 (64)
108 cd00922 Cyt_c_Oxidase_IV Cytoc 36.5 19 0.00041 19.6 0.8 14 33-46 43-56 (136)
109 PRK14286 chaperone protein Dna 36.4 1.1E+02 0.0024 19.2 5.0 40 8-47 23-68 (372)
110 COG2938 Uncharacterized conser 36.3 66 0.0014 16.6 2.8 24 28-51 35-58 (94)
111 PF03810 IBN_N: Importin-beta 35.3 52 0.0011 15.1 2.4 23 25-47 39-69 (77)
112 PF11903 DUF3423: Protein of u 35.2 60 0.0013 15.8 2.8 20 13-32 37-56 (72)
113 PRK14288 chaperone protein Dna 34.9 1.2E+02 0.0026 19.1 5.2 39 8-46 22-66 (369)
114 PF09556 RE_HaeIII: HaeIII res 34.9 16 0.00035 22.6 0.4 28 24-51 150-177 (300)
115 cd07911 RNRR2_Rv0233_like Ribo 34.8 41 0.00089 20.1 2.1 16 33-48 35-50 (280)
116 PRK09731 putative general secr 34.2 95 0.0021 17.8 3.6 34 3-44 6-39 (178)
117 KOG0715|consensus 33.9 1.1E+02 0.0025 18.6 4.6 37 9-45 63-103 (288)
118 PRK14297 chaperone protein Dna 33.5 1E+02 0.0022 19.5 3.7 39 8-46 23-67 (380)
119 PRK14289 chaperone protein Dna 33.5 1.3E+02 0.0027 19.0 4.7 40 8-47 24-69 (386)
120 PF13542 HTH_Tnp_ISL3: Helix-t 33.3 37 0.0008 14.6 1.4 32 6-37 12-44 (52)
121 smart00413 ETS erythroblast tr 33.3 26 0.00056 17.8 1.0 24 30-53 39-62 (87)
122 smart00206 NTR Tissue inhibito 33.1 26 0.00056 20.0 1.0 18 33-50 101-118 (172)
123 PF12376 DUF3654: Protein of u 33.1 63 0.0014 17.9 2.5 21 35-55 6-26 (137)
124 TIGR02349 DnaJ_bact chaperone 32.8 1.2E+02 0.0027 18.7 4.3 39 8-46 19-62 (354)
125 PHA02947 S-S bond formation pa 32.7 49 0.0011 19.6 2.1 28 18-45 58-85 (215)
126 PF02216 B: B domain; InterPr 32.6 60 0.0013 15.1 2.4 17 35-51 20-36 (54)
127 PHA02955 hypothetical protein; 32.4 49 0.0011 19.6 2.1 28 18-45 58-85 (213)
128 PF00356 LacI: Bacterial regul 32.4 53 0.0012 14.4 2.9 32 19-51 9-40 (46)
129 cd08804 Death_ank2 Death domai 32.1 20 0.00042 17.8 0.4 20 21-40 5-24 (84)
130 cd06257 DnaJ DnaJ domain or J- 31.6 52 0.0011 14.0 3.5 31 8-38 19-54 (55)
131 cd09071 FAR_C C-terminal domai 31.4 33 0.00071 16.7 1.2 15 31-45 53-67 (92)
132 PF04363 DUF496: Protein of un 31.3 71 0.0015 16.5 2.3 17 19-35 47-63 (95)
133 cd03585 NTR_TIMP NTR domain, T 31.2 28 0.0006 20.1 1.0 18 33-50 106-123 (183)
134 PRK05423 hypothetical protein; 31.1 87 0.0019 16.4 3.0 18 19-36 54-71 (104)
135 PRK14277 chaperone protein Dna 31.0 1.4E+02 0.0031 18.8 4.8 39 8-46 24-68 (386)
136 PF06424 PRP1_N: PRP1 splicing 31.0 48 0.001 18.1 1.8 26 14-39 95-126 (133)
137 PHA02662 ORF131 putative membr 30.9 55 0.0012 19.6 2.1 28 18-45 71-98 (226)
138 PF10675 DUF2489: Protein of u 30.5 76 0.0017 17.1 2.6 19 30-48 94-112 (131)
139 cd08319 Death_RAIDD Death doma 30.1 24 0.00053 17.6 0.5 16 23-38 5-20 (83)
140 cd08306 Death_FADD Fas-associa 30.1 27 0.00059 17.3 0.7 17 22-38 4-20 (86)
141 COG4282 SMI1 Protein involved 29.2 1.2E+02 0.0027 17.5 3.3 29 18-46 30-58 (191)
142 PF12926 MOZART2: Mitotic-spin 28.8 91 0.002 15.9 3.7 28 19-46 5-32 (88)
143 PF07757 AdoMet_MTase: Predict 28.6 70 0.0015 17.1 2.1 25 27-51 19-43 (112)
144 PF11460 DUF3007: Protein of u 28.3 1E+02 0.0022 16.3 4.1 20 29-48 82-101 (104)
145 PRK13965 ribonucleotide-diphos 28.2 61 0.0013 20.1 2.1 17 33-49 60-76 (335)
146 PRK14293 chaperone protein Dna 27.9 1.6E+02 0.0035 18.5 4.4 41 7-47 21-66 (374)
147 PF13404 HTH_AsnC-type: AsnC-t 27.9 63 0.0014 13.8 2.2 16 16-31 13-28 (42)
148 PF04255 DUF433: Protein of un 27.9 71 0.0015 14.4 2.9 19 12-30 35-53 (56)
149 PF14647 FAM91_N: FAM91 N-term 27.8 1.2E+02 0.0027 19.0 3.3 31 1-31 87-122 (308)
150 TIGR00647 MG103 conserved hypo 27.7 74 0.0016 19.5 2.4 20 12-31 236-255 (279)
151 TIGR03092 SASP_sspI small, aci 27.6 85 0.0018 15.2 2.2 18 31-48 42-59 (65)
152 PF11625 DUF3253: Protein of u 27.5 94 0.002 15.7 3.2 38 13-50 15-55 (83)
153 PLN02492 ribonucleoside-diphos 27.4 63 0.0014 19.8 2.1 17 32-48 45-61 (324)
154 PF07800 DUF1644: Protein of u 27.2 30 0.00066 19.6 0.7 30 8-37 122-151 (162)
155 PF00887 ACBP: Acyl CoA bindin 27.0 89 0.0019 15.2 3.7 49 3-52 30-82 (87)
156 PRK06946 lipid A biosynthesis 27.0 1.3E+02 0.0028 18.0 3.3 26 13-38 49-74 (293)
157 PRK07920 lipid A biosynthesis 27.0 85 0.0018 18.8 2.6 26 13-38 44-69 (298)
158 TIGR02208 lipid_A_msbB lipid A 26.9 1.3E+02 0.0028 18.1 3.3 26 13-38 60-85 (305)
159 PF14374 Ribos_L4_asso_C: 60S 26.8 33 0.00072 17.1 0.7 14 19-32 8-21 (80)
160 PF11961 DUF3475: Domain of un 26.8 82 0.0018 14.7 2.1 21 26-46 9-29 (57)
161 PRK13967 nrdF1 ribonucleotide- 26.7 67 0.0014 19.8 2.1 17 33-49 47-63 (322)
162 PRK05645 lipid A biosynthesis 26.7 95 0.0021 18.5 2.7 26 13-38 49-74 (295)
163 PRK08734 lipid A biosynthesis 26.5 96 0.0021 18.7 2.7 26 13-38 50-75 (305)
164 PF12550 GCR1_C: Transcription 26.4 81 0.0017 15.3 2.1 22 19-40 24-45 (81)
165 PF13867 SAP30_Sin3_bdg: Sin3 26.4 77 0.0017 14.3 2.8 26 19-44 19-44 (53)
166 PRK10767 chaperone protein Dna 26.4 1.7E+02 0.0037 18.3 4.7 39 8-46 23-67 (371)
167 smart00509 TFS2N Domain in the 26.2 47 0.001 16.0 1.2 21 17-37 52-72 (75)
168 PF01527 HTH_Tnp_1: Transposas 26.2 35 0.00076 15.8 0.7 25 6-30 7-33 (76)
169 PF01047 MarR: MarR family; I 26.1 37 0.0008 14.9 0.8 16 16-31 13-28 (59)
170 cd01776 Rin1_RA Ubiquitin doma 25.8 1.1E+02 0.0023 15.7 2.8 23 18-40 21-43 (87)
171 PRK09614 nrdF ribonucleotide-d 25.5 73 0.0016 19.4 2.1 16 33-48 47-62 (324)
172 PF05047 L51_S25_CI-B8: Mitoch 25.0 76 0.0017 13.8 2.0 17 6-22 4-20 (52)
173 COG2926 Uncharacterized protei 24.9 1.2E+02 0.0026 15.9 3.0 18 19-36 54-71 (109)
174 TIGR00534 OpcA opcA protein. T 24.9 83 0.0018 19.6 2.3 19 18-36 5-23 (311)
175 PRK13966 nrdF2 ribonucleotide- 24.9 75 0.0016 19.6 2.1 16 33-48 49-64 (324)
176 PRK08943 lipid A biosynthesis 24.8 1.5E+02 0.0032 18.0 3.3 26 13-38 69-94 (314)
177 COG0484 DnaJ DnaJ-class molecu 24.8 2E+02 0.0043 18.5 4.5 40 8-47 23-67 (371)
178 PRK14282 chaperone protein Dna 24.7 1.9E+02 0.0041 18.2 4.0 39 8-46 23-68 (369)
179 cd08780 Death_TRADD Death Doma 24.6 44 0.00096 17.1 0.9 17 23-39 5-21 (90)
180 PRK14301 chaperone protein Dna 24.2 1.9E+02 0.0042 18.2 4.1 39 8-46 23-67 (373)
181 PF12339 DNAJ_related: DNA-J r 24.1 86 0.0019 17.1 2.0 18 20-37 114-131 (132)
182 PTZ00211 ribonucleoside-diphos 24.1 79 0.0017 19.5 2.1 17 32-48 56-72 (330)
183 PF10407 Cytokin_check_N: Cdc1 24.1 87 0.0019 15.3 1.9 23 19-41 11-33 (73)
184 TIGR02269 Myxococcus xanthus p 23.6 1.7E+02 0.0037 17.3 3.8 24 9-32 174-197 (211)
185 PF07055 Eno-Rase_FAD_bd: Enoy 23.4 62 0.0013 15.6 1.3 19 23-41 16-34 (65)
186 PF10077 DUF2314: Uncharacteri 23.3 1.4E+02 0.003 16.1 2.8 20 29-48 109-128 (133)
187 cd08803 Death_ank3 Death domai 22.8 31 0.00067 17.2 0.2 24 22-45 6-29 (84)
188 cd07765 KRAB_A-box KRAB (Krupp 22.6 51 0.0011 11.0 3.0 18 32-49 13-30 (40)
189 PRK08733 lipid A biosynthesis 22.1 1.8E+02 0.0038 17.6 3.3 26 13-38 64-89 (306)
190 PRK08326 ribonucleotide-diphos 21.9 95 0.0021 19.0 2.1 17 32-48 51-67 (311)
191 PF09349 OHCU_decarbox: OHCU d 21.7 1.6E+02 0.0034 16.2 3.1 28 18-45 31-58 (159)
192 PF11953 DUF3470: Domain of un 21.5 37 0.00081 14.9 0.3 19 22-40 3-21 (43)
193 cd00183 TFIIS_I N-terminal dom 21.2 65 0.0014 15.5 1.1 21 17-37 54-74 (76)
194 PRK06860 lipid A biosynthesis 21.1 2E+02 0.0042 17.4 3.3 27 12-38 63-89 (309)
195 PF04031 Las1: Las1-like ; In 21.1 98 0.0021 17.1 1.9 29 2-30 83-111 (154)
196 PRK09101 nrdB ribonucleotide-d 20.9 1E+02 0.0022 19.5 2.1 16 33-48 62-77 (376)
197 cd08777 Death_RIP1 Death Domai 20.9 47 0.001 16.5 0.6 17 23-39 5-21 (86)
198 PRK02955 small acid-soluble sp 20.8 1.2E+02 0.0027 14.7 2.2 17 32-48 46-62 (68)
199 PF10562 CaM_bdg_C0: Calmoduli 20.6 85 0.0018 12.7 1.4 14 23-36 13-26 (29)
200 cd03577 NTR_TIMP_like NTR doma 20.6 41 0.00089 17.2 0.3 17 33-49 97-113 (116)
201 cd00155 RasGEF Guanine nucleot 20.6 1.4E+02 0.0031 17.0 2.6 22 29-50 111-132 (237)
202 TIGR02147 Fsuc_second hypothet 20.5 1.9E+02 0.0041 17.6 3.2 27 5-31 12-38 (271)
203 PF04545 Sigma70_r4: Sigma-70, 20.4 91 0.002 13.3 1.5 14 18-31 18-31 (50)
No 1
>cd01389 MATA_HMG-box MATA_HMG-box, class I member of the HMG-box superfamily of DNA-binding proteins. These proteins contain a single HMG box, and bind the minor groove of DNA in a highly sequence-specific manner. Members include the fungal mating type gene products MC, MATA1 and Ste11.
Probab=99.84 E-value=1.8e-20 Score=92.81 Aligned_cols=54 Identities=35% Similarity=0.508 Sum_probs=52.4
Q ss_pred ChHHhHHHHHHHHHHHhCCCCCHHHHHHHHHHHhcCCCHHHHHHHHHHHHHhhh
Q psy5907 1 MPYMRYSRKVWDQVKAQNLDLKLWEIGKIIGQMWRDLPEDQKTEYVEDYEQEKI 54 (56)
Q Consensus 1 ~ay~lf~~~~~~~~~~~~p~~~~~ei~k~l~~~W~~l~~~eK~~y~~~a~~~k~ 54 (56)
+|||+|+++.+..++.+||++++++|+++||++|+.||+++|.+|.++|++++.
T Consensus 7 naf~lf~~~~r~~~~~~~p~~~~~eisk~~g~~Wk~ls~eeK~~y~~~A~~~k~ 60 (77)
T cd01389 7 NAFILYRQDKHAQLKTENPGLTNNEISRIIGRMWRSESPEVKAYYKELAEEEKE 60 (77)
T ss_pred cHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHhhCCHHHHHHHHHHHHHHHH
Confidence 589999999999999999999999999999999999999999999999999876
No 2
>cd01388 SOX-TCF_HMG-box SOX-TCF_HMG-box, class I member of the HMG-box superfamily of DNA-binding proteins. These proteins contain a single HMG box, and bind the minor groove of DNA in a highly sequence-specific manner. Members include SRY and its homologs in insects and vertebrates, and transcription factor-like proteins, TCF-1, -3, -4, and LEF-1. They appear to bind the minor groove of the A/T C A A A G/C-motif.
Probab=99.84 E-value=2.8e-20 Score=91.17 Aligned_cols=54 Identities=24% Similarity=0.406 Sum_probs=52.4
Q ss_pred ChHHhHHHHHHHHHHHhCCCCCHHHHHHHHHHHhcCCCHHHHHHHHHHHHHhhh
Q psy5907 1 MPYMRYSRKVWDQVKAQNLDLKLWEIGKIIGQMWRDLPEDQKTEYVEDYEQEKI 54 (56)
Q Consensus 1 ~ay~lf~~~~~~~~~~~~p~~~~~ei~k~l~~~W~~l~~~eK~~y~~~a~~~k~ 54 (56)
+|||+|+++.+..++.+||++++.+|+++||++|+.||+++|.+|.+++..++.
T Consensus 7 naf~~F~~~~r~~~~~~~p~~~~~eisk~l~~~Wk~ls~~eK~~y~~~a~~~k~ 60 (72)
T cd01388 7 NAFMLFSKRHRRKVLQEYPLKENRAISKILGDRWKALSNEEKQPYYEEAKKLKE 60 (72)
T ss_pred cHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHH
Confidence 589999999999999999999999999999999999999999999999999886
No 3
>PTZ00199 high mobility group protein; Provisional
Probab=99.82 E-value=7.9e-20 Score=93.67 Aligned_cols=54 Identities=26% Similarity=0.628 Sum_probs=51.8
Q ss_pred ChHHhHHHHHHHHHHHhCCCCC--HHHHHHHHHHHhcCCCHHHHHHHHHHHHHhhh
Q psy5907 1 MPYMRYSRKVWDQVKAQNLDLK--LWEIGKIIGQMWRDLPEDQKTEYVEDYEQEKI 54 (56)
Q Consensus 1 ~ay~lf~~~~~~~~~~~~p~~~--~~ei~k~l~~~W~~l~~~eK~~y~~~a~~~k~ 54 (56)
|||++|++++|..|..+||+++ +++|+++||++|+.||+++|.+|.+.|+.++.
T Consensus 28 sAY~~F~~~~R~~i~~~~P~~~~~~~evsk~ige~Wk~ls~eeK~~y~~~A~~dk~ 83 (94)
T PTZ00199 28 SAYMFFAKEKRAEIIAENPELAKDVAAVGKMVGEAWNKLSEEEKAPYEKKAQEDKV 83 (94)
T ss_pred cHHHHHHHHHHHHHHHHCcCCcccHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHH
Confidence 6899999999999999999986 89999999999999999999999999999976
No 4
>cd01390 HMGB-UBF_HMG-box HMGB-UBF_HMG-box, class II and III members of the HMG-box superfamily of DNA-binding proteins. These proteins bind the minor groove of DNA in a non-sequence specific fashion and contain two or more tandem HMG boxes. Class II members include non-histone chromosomal proteins, HMG1 and HMG2, which bind to bent or distorted DNA such as four-way DNA junctions, synthetic DNA cruciforms, kinked cisplatin-modified DNA, DNA bulges, cross-overs in supercoiled DNA, and can cause looping of linear DNA. Class III members include nucleolar and mitochondrial transcription factors, UBF and mtTF1, which bind four-way DNA junctions.
Probab=99.81 E-value=1.4e-19 Score=86.69 Aligned_cols=54 Identities=31% Similarity=0.609 Sum_probs=52.2
Q ss_pred ChHHhHHHHHHHHHHHhCCCCCHHHHHHHHHHHhcCCCHHHHHHHHHHHHHhhh
Q psy5907 1 MPYMRYSRKVWDQVKAQNLDLKLWEIGKIIGQMWRDLPEDQKTEYVEDYEQEKI 54 (56)
Q Consensus 1 ~ay~lf~~~~~~~~~~~~p~~~~~ei~k~l~~~W~~l~~~eK~~y~~~a~~~k~ 54 (56)
+||++|+++.+..++.+||++++.+|++.||++|++||+++|.+|.+.+++++.
T Consensus 6 saf~~f~~~~r~~~~~~~p~~~~~~i~~~~~~~W~~ls~~eK~~y~~~a~~~~~ 59 (66)
T cd01390 6 SAYFLFSQEQRPKLKKENPDASVTEVTKILGEKWKELSEEEKKKYEEKAEKDKE 59 (66)
T ss_pred cHHHHHHHHHHHHHHHHCcCCCHHHHHHHHHHHHHhCCHHHHHHHHHHHHHHHH
Confidence 589999999999999999999999999999999999999999999999999876
No 5
>PF00505 HMG_box: HMG (high mobility group) box; InterPro: IPR000910 High mobility group (HMG or HMGB) proteins are a family of relatively low molecular weight non-histone components in chromatin. HMG1 (also called HMG-T in fish) and HMG2 are two highly related proteins that bind single-stranded DNA preferentially and unwind double-stranded DNA. Although they have no sequence specificity, they have a high affinity for bent or distorted DNA, and bend linear DNA. HMG1 and HMG2 contain two DNA-binding HMG-box domains (A and B) that show structural and functional differences, and have a long acidic C-terminal domain rich in aspartic and glutamic acid residues. The acidic tail modulates the affinity of the tandem HMG boxes in HMG1 and 2 for a variety of DNA targets. HMG1 and 2 appear to play important architectural roles in the assembly of nucleoprotein complexes in a variety of biological processes, for example V(D)J recombination, the initiation of transcription, and DNA repair []. The profile in this entry describing the HMG-domains is much more general than the signature. In addition to the HMG1 and HMG2 proteins, HMG-domains occur in single or multiple copies in the following protein classes; the SOX family of transcription factors; SRY sex determining region Y protein and related proteins []; LEF1 lymphoid enhancer binding factor 1 []; SSRP recombination signal recognition protein; MTF1 mitochondrial transcription factor 1; UBF1/2 nucleolar transcription factors; Abf2 yeast ARS-binding factor []; and Saccharomyces cerevisiae transcription factors Ixr1, Rox1, Nhp6a, Nhp6b and Spp41.; GO: 0003677 DNA binding; PDB: 1I11_A 1J3C_A 1J3D_A 1WZ6_A 1WGF_A 2D7L_A 1GT0_D 3U2B_C 2CRJ_A 2CS1_A ....
Probab=99.78 E-value=1.7e-18 Score=83.61 Aligned_cols=54 Identities=37% Similarity=0.634 Sum_probs=51.6
Q ss_pred ChHHhHHHHHHHHHHHhCCCCCHHHHHHHHHHHhcCCCHHHHHHHHHHHHHhhh
Q psy5907 1 MPYMRYSRKVWDQVKAQNLDLKLWEIGKIIGQMWRDLPEDQKTEYVEDYEQEKI 54 (56)
Q Consensus 1 ~ay~lf~~~~~~~~~~~~p~~~~~ei~k~l~~~W~~l~~~eK~~y~~~a~~~k~ 54 (56)
+||++|+.+.+..++.+||+++..+|++++|++|++||+++|.+|.+.+.+++.
T Consensus 6 ~af~lf~~~~~~~~k~~~p~~~~~~i~~~~~~~W~~l~~~eK~~y~~~a~~~~~ 59 (69)
T PF00505_consen 6 NAFMLFCKEKRAKLKEENPDLSNKEISKILAQMWKNLSEEEKAPYKEEAEEEKE 59 (69)
T ss_dssp -HHHHHHHHHHHHHHHHSTTSTHHHHHHHHHHHHHCSHHHHHHHHHHHHHHHHH
T ss_pred CHHHHHHHHHHHHHHHHhcccccccchhhHHHHHhcCCHHHHHHHHHHHHHHHH
Confidence 589999999999999999999999999999999999999999999999999875
No 6
>smart00398 HMG high mobility group.
Probab=99.77 E-value=4e-18 Score=82.12 Aligned_cols=54 Identities=33% Similarity=0.546 Sum_probs=52.0
Q ss_pred ChHHhHHHHHHHHHHHhCCCCCHHHHHHHHHHHhcCCCHHHHHHHHHHHHHhhh
Q psy5907 1 MPYMRYSRKVWDQVKAQNLDLKLWEIGKIIGQMWRDLPEDQKTEYVEDYEQEKI 54 (56)
Q Consensus 1 ~ay~lf~~~~~~~~~~~~p~~~~~ei~k~l~~~W~~l~~~eK~~y~~~a~~~k~ 54 (56)
+||++|+++.+..+..+||++++.+|++.||.+|+.||+++|.+|.+.+++++.
T Consensus 7 ~~y~~f~~~~r~~~~~~~~~~~~~~i~~~~~~~W~~l~~~ek~~y~~~a~~~~~ 60 (70)
T smart00398 7 SAFMLFSQENRAKIKAENPDLSNAEISKKLGERWKLLSEEEKAPYEEKAKKDKE 60 (70)
T ss_pred cHHHHHHHHHHHHHHHHCcCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHH
Confidence 589999999999999999999999999999999999999999999999999865
No 7
>cd00084 HMG-box High Mobility Group (HMG)-box is found in a variety of eukaryotic chromosomal proteins and transcription factors. HMGs bind to the minor groove of DNA and have been classified by DNA binding preferences. Two phylogenically distinct groups of Class I proteins bind DNA in a sequence specific fashion and contain a single HMG box. One group (SOX-TCF) includes transcription factors, TCF-1, -3, -4; and also SRY and LEF-1, which bind four-way DNA junctions and duplex DNA targets. The second group (MATA) includes fungal mating type gene products MC, MATA1 and Ste11. Class II and III proteins (HMGB-UBF) bind DNA in a non-sequence specific fashion and contain two or more tandem HMG boxes. Class II members include non-histone chromosomal proteins, HMG1 and HMG2, which bind to bent or distorted DNA such as four-way DNA junctions, synthetic DNA cruciforms, kinked cisplatin-modified DNA, DNA bulges, cross-overs in supercoiled DNA, and can cause looping of linear DNA. Class III member
Probab=99.76 E-value=1.1e-17 Score=79.80 Aligned_cols=54 Identities=39% Similarity=0.674 Sum_probs=52.1
Q ss_pred ChHHhHHHHHHHHHHHhCCCCCHHHHHHHHHHHhcCCCHHHHHHHHHHHHHhhh
Q psy5907 1 MPYMRYSRKVWDQVKAQNLDLKLWEIGKIIGQMWRDLPEDQKTEYVEDYEQEKI 54 (56)
Q Consensus 1 ~ay~lf~~~~~~~~~~~~p~~~~~ei~k~l~~~W~~l~~~eK~~y~~~a~~~k~ 54 (56)
+||++|+++.+..++.+||+++..+|++.+|.+|+.||+++|.+|.+.+++++.
T Consensus 6 ~af~~f~~~~~~~~~~~~~~~~~~~i~~~~~~~W~~l~~~~k~~y~~~a~~~~~ 59 (66)
T cd00084 6 SAYFLFSQEHRAEVKAENPGLSVGEISKILGEMWKSLSEEEKKKYEEKAEKDKE 59 (66)
T ss_pred cHHHHHHHHHHHHHHHHCcCCCHHHHHHHHHHHHHhCCHHHHHHHHHHHHHHHH
Confidence 589999999999999999999999999999999999999999999999999875
No 8
>KOG0527|consensus
Probab=99.74 E-value=4.1e-18 Score=102.43 Aligned_cols=54 Identities=26% Similarity=0.450 Sum_probs=52.2
Q ss_pred ChHHhHHHHHHHHHHHhCCCCCHHHHHHHHHHHhcCCCHHHHHHHHHHHHHhhh
Q psy5907 1 MPYMRYSRKVWDQVKAQNLDLKLWEIGKIIGQMWRDLPEDQKTEYVEDYEQEKI 54 (56)
Q Consensus 1 ~ay~lf~~~~~~~~~~~~p~~~~~ei~k~l~~~W~~l~~~eK~~y~~~a~~~k~ 54 (56)
.|||+|++..|.++..+||+++++||||.||..|+.|+++||.+|+++|++.|+
T Consensus 68 NAFMVWSq~~RRkma~qnP~mHNSEISK~LG~~WK~Lse~EKrPFi~EAeRLR~ 121 (331)
T KOG0527|consen 68 NAFMVWSQGQRRKLAKQNPKMHNSEISKRLGAEWKLLSEEEKRPFVDEAERLRA 121 (331)
T ss_pred chhhhhhHHHHHHHHHhCcchhhHHHHHHHHHHHhhcCHhhhccHHHHHHHHHH
Confidence 489999999999999999999999999999999999999999999999999875
No 9
>PF09011 HMG_box_2: HMG-box domain; InterPro: IPR015101 This domain is predominantly found in Maelstrom homologue proteins. It has no known function. ; GO: 0005634 nucleus; PDB: 2EQZ_A 1V64_A 2CTO_A 1H5P_A 3TQ6_A 3FGH_A 3TMM_A 1J3X_A 2YRQ_A 1AAB_A ....
Probab=99.67 E-value=3.8e-16 Score=76.61 Aligned_cols=54 Identities=24% Similarity=0.447 Sum_probs=48.7
Q ss_pred ChHHhHHHHHHHHHHHh-CCCCCHHHHHHHHHHHhcCCCHHHHHHHHHHHHHhhh
Q psy5907 1 MPYMRYSRKVWDQVKAQ-NLDLKLWEIGKIIGQMWRDLPEDQKTEYVEDYEQEKI 54 (56)
Q Consensus 1 ~ay~lf~~~~~~~~~~~-~p~~~~~ei~k~l~~~W~~l~~~eK~~y~~~a~~~k~ 54 (56)
+||++|+.+++..++.. .+.....|+++.||..|++||+++|.+|++.|++++.
T Consensus 9 say~lF~~~~~~~~k~~G~~~~~~~e~~k~~~~~Wk~Ls~~EK~~Y~~~A~~~k~ 63 (73)
T PF09011_consen 9 SAYNLFMKEMRKEVKEEGGQKQSFREVMKEISERWKSLSEEEKEPYEERAKEDKE 63 (73)
T ss_dssp SHHHHHHHHHHHHHHHHT-T-SSHHHHHHHHHHHHHHS-HHHHHHHHHHHHHHHH
T ss_pred CHHHHHHHHHHHHHHHhcccCCCHHHHHHHHHHHHHhcCHHHHHHHHHHHHHHHH
Confidence 68999999999999988 7888999999999999999999999999999999886
No 10
>KOG0381|consensus
Probab=99.65 E-value=1.3e-15 Score=77.64 Aligned_cols=54 Identities=30% Similarity=0.561 Sum_probs=51.5
Q ss_pred ChHHhHHHHHHHHHHHhCCCCCHHHHHHHHHHHhcCCCHHHHHHHHHHHHHhhh
Q psy5907 1 MPYMRYSRKVWDQVKAQNLDLKLWEIGKIIGQMWRDLPEDQKTEYVEDYEQEKI 54 (56)
Q Consensus 1 ~ay~lf~~~~~~~~~~~~p~~~~~ei~k~l~~~W~~l~~~eK~~y~~~a~~~k~ 54 (56)
+||++|+.+.+..++.+||+++..+|++++|++|++|++++|.+|...+..++.
T Consensus 28 sa~~~f~~~~~~~~k~~~p~~~~~~v~k~~g~~W~~l~~~~k~~y~~ka~~~k~ 81 (96)
T KOG0381|consen 28 SAFFLFSSEQRSKIKAENPGLSVGEVAKALGEMWKNLAEEEKQPYEEKASKLKE 81 (96)
T ss_pred cHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHHhcCCHHHHHHHHHHHHHHHH
Confidence 589999999999999999999999999999999999999999999999988775
No 11
>COG5648 NHP6B Chromatin-associated proteins containing the HMG domain [Chromatin structure and dynamics]
Probab=99.62 E-value=1.9e-15 Score=86.00 Aligned_cols=54 Identities=24% Similarity=0.611 Sum_probs=52.0
Q ss_pred ChHHhHHHHHHHHHHHhCCCCCHHHHHHHHHHHhcCCCHHHHHHHHHHHHHhhh
Q psy5907 1 MPYMRYSRKVWDQVKAQNLDLKLWEIGKIIGQMWRDLPEDQKTEYVEDYEQEKI 54 (56)
Q Consensus 1 ~ay~lf~~~~~~~~~~~~p~~~~~ei~k~l~~~W~~l~~~eK~~y~~~a~~~k~ 54 (56)
||||+|+..+|+.++..+|++++.+|++.+|++|++|++++|++|..++..++.
T Consensus 76 sayf~y~~~~R~ei~~~~p~l~~~e~~k~~~e~WK~Ltd~eke~y~k~~~~~~e 129 (211)
T COG5648 76 SAYFLYSAENRDEIRKENPKLTFGEVGKLLSEKWKELTDEEKEPYYKEANSDRE 129 (211)
T ss_pred hHHHHHHHHHHHHHHHhCCCCChHHHHHHHHHHHHhccHhhhhhHHHHHhhHHH
Confidence 689999999999999999999999999999999999999999999999988775
No 12
>KOG4715|consensus
Probab=99.39 E-value=6.2e-13 Score=79.92 Aligned_cols=54 Identities=76% Similarity=1.375 Sum_probs=52.4
Q ss_pred ChHHhHHHHHHHHHHHhCCCCCHHHHHHHHHHHhcCCCHHHHHHHHHHHHHhhh
Q psy5907 1 MPYMRYSRKVWDQVKAQNLDLKLWEIGKIIGQMWRDLPEDQKTEYVEDYEQEKI 54 (56)
Q Consensus 1 ~ay~lf~~~~~~~~~~~~p~~~~~ei~k~l~~~W~~l~~~eK~~y~~~a~~~k~ 54 (56)
++||.|++.+|++|+.+||++..-||.++||.+|..|++++|..|..+++.+|+
T Consensus 70 ~pymrySrkvWd~VkA~nPe~kLWeiGK~Ig~mW~dLpd~EK~ey~~EYeaEKi 123 (410)
T KOG4715|consen 70 MPYMRYSRKVWDQVKASNPELKLWEIGKIIGGMWLDLPDEEKQEYLNEYEAEKI 123 (410)
T ss_pred chhhHHhhhhhhhhhccCcchHHHHHHHHHHHHHhhCcchHHHHHHHHHHHHHH
Confidence 579999999999999999999999999999999999999999999999999887
No 13
>KOG3248|consensus
Probab=99.25 E-value=1.9e-11 Score=74.04 Aligned_cols=54 Identities=22% Similarity=0.405 Sum_probs=51.3
Q ss_pred ChHHhHHHHHHHHHHHhCCCCCHHHHHHHHHHHhcCCCHHHHHHHHHHHHHhhh
Q psy5907 1 MPYMRYSRKVWDQVKAQNLDLKLWEIGKIIGQMWRDLPEDQKTEYVEDYEQEKI 54 (56)
Q Consensus 1 ~ay~lf~~~~~~~~~~~~p~~~~~ei~k~l~~~W~~l~~~eK~~y~~~a~~~k~ 54 (56)
.|||+|.+++|.+|..++--...++|.++||.+|..||.+|..+|.++|.++++
T Consensus 197 NAFmlyMKEmRa~vvaEctlKeSAaiNqiLGrRWH~LSrEEQAKYyElArKerq 250 (421)
T KOG3248|consen 197 NAFMLYMKEMRAKVVAECTLKESAAINQILGRRWHALSREEQAKYYELARKERQ 250 (421)
T ss_pred HHHHHHHHHHHHHHHHHhhhhhHHHHHHHHhHHHhhhhHHHHHHHHHHHHHHHH
Confidence 389999999999999999878899999999999999999999999999999886
No 14
>KOG0528|consensus
Probab=99.05 E-value=3.8e-11 Score=75.03 Aligned_cols=52 Identities=19% Similarity=0.412 Sum_probs=48.4
Q ss_pred ChHHhHHHHHHHHHHHhCCCCCHHHHHHHHHHHhcCCCHHHHHHHHHHHHHh
Q psy5907 1 MPYMRYSRKVWDQVKAQNLDLKLWEIGKIIGQMWRDLPEDQKTEYVEDYEQE 52 (56)
Q Consensus 1 ~ay~lf~~~~~~~~~~~~p~~~~~ei~k~l~~~W~~l~~~eK~~y~~~a~~~ 52 (56)
.|||+|.++.|.++...+|++++.+|+++||.+|+.|+..||++|.++-...
T Consensus 331 NAFMVWAkDERRKILqA~PDMHNSnISKILGSRWKaMSN~eKQPYYEEQaRL 382 (511)
T KOG0528|consen 331 NAFMVWAKDERRKILQAFPDMHNSNISKILGSRWKAMSNTEKQPYYEEQARL 382 (511)
T ss_pred chhhcccchhhhhhhhcCccccccchhHHhcccccccccccccchHHHHHHH
Confidence 3899999999999999999999999999999999999999999999876653
No 15
>KOG0526|consensus
Probab=99.04 E-value=2.9e-10 Score=72.01 Aligned_cols=50 Identities=18% Similarity=0.440 Sum_probs=47.7
Q ss_pred ChHHhHHHHHHHHHHHhCCCCCHHHHHHHHHHHhcCCCHHHHHHHHHHHHHhhh
Q psy5907 1 MPYMRYSRKVWDQVKAQNLDLKLWEIGKIIGQMWRDLPEDQKTEYVEDYEQEKI 54 (56)
Q Consensus 1 ~ay~lf~~~~~~~~~~~~p~~~~~ei~k~l~~~W~~l~~~eK~~y~~~a~~~k~ 54 (56)
+|||+|.+..|..++.+ +.++++|++..|++|+.|+. |.+|++.|+.+|+
T Consensus 541 sa~m~w~~~~r~~ik~d--gi~~~dv~kk~g~~wk~ms~--k~~we~ka~~dk~ 590 (615)
T KOG0526|consen 541 SAYMLWLNASRESIKED--GISVGDVAKKAGEKWKQMSA--KEEWEDKAAVDKQ 590 (615)
T ss_pred hHHHHHHHhhhhhHhhc--CchHHHHHHHHhHHHhhhcc--cchhhHHHHHHHH
Confidence 58999999999999987 99999999999999999999 9999999999987
No 16
>KOG2746|consensus
Probab=98.64 E-value=2.6e-08 Score=64.43 Aligned_cols=54 Identities=22% Similarity=0.340 Sum_probs=51.0
Q ss_pred ChHHhHHHHHH--HHHHHhCCCCCHHHHHHHHHHHhcCCCHHHHHHHHHHHHHhhh
Q psy5907 1 MPYMRYSRKVW--DQVKAQNLDLKLWEIGKIIGQMWRDLPEDQKTEYVEDYEQEKI 54 (56)
Q Consensus 1 ~ay~lf~~~~~--~~~~~~~p~~~~~ei~k~l~~~W~~l~~~eK~~y~~~a~~~k~ 54 (56)
++|++|++.++ ..+...||+..+.-|+++||+.|-.|.+.||+.|.+++.+.|.
T Consensus 187 naf~ifskrhr~~g~vhq~~pn~DNrtIskiLgewWytL~~~Ekq~yhdLa~Qvk~ 242 (683)
T KOG2746|consen 187 NAFHIFSKRHRGEGRVHQRHPNQDNRTISKILGEWWYTLGPNEKQKYHDLAFQVKE 242 (683)
T ss_pred HHHHHHHhhcCCccchhccCccccchhHHHHHhhhHhhhCchhhhhHHHHHHHHHH
Confidence 47999999999 8999999999999999999999999999999999999998764
No 17
>PF06382 DUF1074: Protein of unknown function (DUF1074); InterPro: IPR024460 This family consists of several proteins which appear to be specific to Insecta. The function of this family is unknown.
Probab=98.41 E-value=1.4e-06 Score=49.14 Aligned_cols=47 Identities=17% Similarity=0.306 Sum_probs=41.6
Q ss_pred ChHHhHHHHHHHHHHHhCCCCCHHHHHHHHHHHhcCCCHHHHHHHHHHHHH
Q psy5907 1 MPYMRYSRKVWDQVKAQNLDLKLWEIGKIIGQMWRDLPEDQKTEYVEDYEQ 51 (56)
Q Consensus 1 ~ay~lf~~~~~~~~~~~~p~~~~~ei~k~l~~~W~~l~~~eK~~y~~~a~~ 51 (56)
.||+-|..+++.. |.+++..|+....+..|..|++.+|..|..++..
T Consensus 84 naYLNFLReFRrk----h~~L~p~dlI~~AAraW~rLSe~eK~rYrr~~~~ 130 (183)
T PF06382_consen 84 NAYLNFLREFRRK----HCGLSPQDLIQRAARAWCRLSEAEKNRYRRMAPS 130 (183)
T ss_pred hHHHHHHHHHHHH----ccCCCHHHHHHHHHHHHHhCCHHHHHHHHhhcch
Confidence 3799999887775 7899999999999999999999999999986543
No 18
>PF08073 CHDNT: CHDNT (NUC034) domain; InterPro: IPR012958 The CHD N-terminal domain is found in PHD/RING fingers and chromo domain-associated helicases [].; GO: 0003677 DNA binding, 0005524 ATP binding, 0008270 zinc ion binding, 0016818 hydrolase activity, acting on acid anhydrides, in phosphorus-containing anhydrides, 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus
Probab=97.77 E-value=0.00013 Score=34.16 Aligned_cols=37 Identities=16% Similarity=0.360 Sum_probs=33.4
Q ss_pred hHHhHHHHHHHHHHHhCCCCCHHHHHHHHHHHhcCCC
Q psy5907 2 PYMRYSRKVWDQVKAQNLDLKLWEIGKIIGQMWRDLP 38 (56)
Q Consensus 2 ay~lf~~~~~~~~~~~~p~~~~~ei~k~l~~~W~~l~ 38 (56)
.|-+|++..|+.+.+.||++..+.+..+++.+|+.-+
T Consensus 15 ~yK~Fsq~vRP~l~~~NPk~~~sKl~~l~~AKwrEF~ 51 (55)
T PF08073_consen 15 NYKAFSQHVRPLLAKANPKAPMSKLMMLLQAKWREFQ 51 (55)
T ss_pred HHHHHHHHHHHHHHHHCCCCcHHHHHHHHHHHHHHHH
Confidence 3668999999999999999999999999999998543
No 19
>PF04690 YABBY: YABBY protein; InterPro: IPR006780 YABBY proteins are a group of plant-specific transcription factors involved in the specification of abaxial polarity in lateral organs such as leaves and floral organs [, ].
Probab=97.62 E-value=0.00019 Score=40.41 Aligned_cols=38 Identities=24% Similarity=0.432 Sum_probs=36.0
Q ss_pred ChHHhHHHHHHHHHHHhCCCCCHHHHHHHHHHHhcCCC
Q psy5907 1 MPYMRYSRKVWDQVKAQNLDLKLWEIGKIIGQMWRDLP 38 (56)
Q Consensus 1 ~ay~lf~~~~~~~~~~~~p~~~~~ei~k~l~~~W~~l~ 38 (56)
+||..|.++.-+.++..+|+++..|.-...+..|...|
T Consensus 127 saYn~f~k~ei~rik~~~p~ishkeaFs~aAknW~h~p 164 (170)
T PF04690_consen 127 SAYNRFMKEEIQRIKAENPDISHKEAFSAAAKNWAHFP 164 (170)
T ss_pred hhHHHHHHHHHHHHHhcCCCCCHHHHHHHHHHhhhhCc
Confidence 68999999999999999999999999999999998765
No 20
>PF14887 HMG_box_5: HMG (high mobility group) box 5; PDB: 1L8Y_A 1L8Z_A 2HDZ_A.
Probab=97.47 E-value=0.00091 Score=33.41 Aligned_cols=50 Identities=10% Similarity=0.233 Sum_probs=42.1
Q ss_pred HhHHHHHHHHHHHhCCCCCHHHHHHHHHHHhcCCCHHHHHHHHHHHHHhhh
Q psy5907 4 MRYSRKVWDQVKAQNLDLKLWEIGKIIGQMWRDLPEDQKTEYVEDYEQEKI 54 (56)
Q Consensus 4 ~lf~~~~~~~~~~~~p~~~~~ei~k~l~~~W~~l~~~eK~~y~~~a~~~k~ 54 (56)
-+|.++....+...+++-...+. +.+...|++|++.+|-+|+..|.++..
T Consensus 12 e~Wqq~vi~dYla~~~~dr~K~~-kam~~~W~~me~Kekl~WIkKA~EdqK 61 (85)
T PF14887_consen 12 EIWQQSVIGDYLAKFRNDRKKAL-KAMEAQWSQMEKKEKLKWIKKAAEDQK 61 (85)
T ss_dssp HHHHHHHHHHHHHHTTSTHHHHH-HHHHHHHHTTGGGHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHhhHhHHHHH-HHHHHHHHHhhhhhhhHHHHHHHHHHH
Confidence 46778888888888888777774 599999999999999999999988754
No 21
>COG5648 NHP6B Chromatin-associated proteins containing the HMG domain [Chromatin structure and dynamics]
Probab=97.41 E-value=0.00012 Score=42.23 Aligned_cols=52 Identities=29% Similarity=0.533 Sum_probs=45.3
Q ss_pred HHhHHHHHHHHHHHhCCCCCHHHHHHHHHHHhcCCCHHHHHHHHHHHHHhhh
Q psy5907 3 YMRYSRKVWDQVKAQNLDLKLWEIGKIIGQMWRDLPEDQKTEYVEDYEQEKI 54 (56)
Q Consensus 3 y~lf~~~~~~~~~~~~p~~~~~ei~k~l~~~W~~l~~~eK~~y~~~a~~~k~ 54 (56)
|+-+....+..+...+|+-+..+..+++|..|+.|+++-|.+|.+.+..++.
T Consensus 151 ~~e~~~~~r~~~~~~~~~~~~~e~~k~~~~~w~el~~skK~~~~~~~Kk~k~ 202 (211)
T COG5648 151 FIENEPKIRPKVEGPSPDKALVEETKIISKAWSELDESKKKKYIDKYKKLKE 202 (211)
T ss_pred hhhccHHhccccCCCCcchhhhHHhhhhhhhhhhhChhhhhHHHHHHHHHHH
Confidence 4556667777777888888999999999999999999999999999998764
No 22
>PF06244 DUF1014: Protein of unknown function (DUF1014); InterPro: IPR010422 This family consists of several hypothetical eukaryotic proteins of unknown function.
Probab=96.95 E-value=0.0029 Score=33.99 Aligned_cols=39 Identities=21% Similarity=0.498 Sum_probs=36.2
Q ss_pred hHHhHHHHHHHHHHHhCCCCCHHHHHHHHHHHhcCCCHH
Q psy5907 2 PYMRYSRKVWDQVKAQNLDLKLWEIGKIIGQMWRDLPED 40 (56)
Q Consensus 2 ay~lf~~~~~~~~~~~~p~~~~~ei~k~l~~~W~~l~~~ 40 (56)
||.-|....-+.++.++|++-.+.+-.+|...|...|+.
T Consensus 79 Ay~afeE~~Lp~lK~E~PgLrlsQ~kq~l~K~w~KSPeN 117 (122)
T PF06244_consen 79 AYKAFEERRLPELKEENPGLRLSQYKQMLWKEWQKSPEN 117 (122)
T ss_pred HHHHHHHHHhHHHHhhCCCchHHHHHHHHHHHHhcCCCC
Confidence 689999999999999999999999999999999887753
No 23
>KOG3223|consensus
Probab=95.58 E-value=0.035 Score=32.13 Aligned_cols=45 Identities=13% Similarity=0.414 Sum_probs=39.3
Q ss_pred hHHhHHHHHHHHHHHhCCCCCHHHHHHHHHHHhcCCCHHHHHHHHHHH
Q psy5907 2 PYMRYSRKVWDQVKAQNLDLKLWEIGKIIGQMWRDLPEDQKTEYVEDY 49 (56)
Q Consensus 2 ay~lf~~~~~~~~~~~~p~~~~~ei~k~l~~~W~~l~~~eK~~y~~~a 49 (56)
||.-|-...-+.++.+||++..+..-.+|-..|...|+. ||.+.+
T Consensus 171 A~~afEe~~LPrLK~e~P~lrlsQ~Kqll~Kew~KsPDN---P~Nq~~ 215 (221)
T KOG3223|consen 171 AFKAFEEARLPRLKKENPGLRLSQYKQLLKKEWQKSPDN---PFNQAA 215 (221)
T ss_pred HHHHHHHhhchhhhhcCCCccHHHHHHHHHHHHhhCCCC---hhhHHh
Confidence 688899999999999999999999999999999988875 665543
No 24
>PRK15117 ABC transporter periplasmic binding protein MlaC; Provisional
Probab=95.43 E-value=0.044 Score=31.67 Aligned_cols=33 Identities=15% Similarity=0.432 Sum_probs=29.3
Q ss_pred CCCCHHHHHH-HHHHHhcCCCHHHHHHHHHHHHH
Q psy5907 19 LDLKLWEIGK-IIGQMWRDLPEDQKTEYVEDYEQ 51 (56)
Q Consensus 19 p~~~~~ei~k-~l~~~W~~l~~~eK~~y~~~a~~ 51 (56)
|...+..+++ .+|.-|+.+|+++|..|.+....
T Consensus 66 p~~Df~~~s~~vLG~~wr~as~eQr~~F~~~F~~ 99 (211)
T PRK15117 66 PYVQVKYAGALVLGRYYKDATPAQREAYFAAFRE 99 (211)
T ss_pred ccCCHHHHHHHHhhhhhhhCCHHHHHHHHHHHHH
Confidence 7788999986 68999999999999999988765
No 25
>TIGR03481 HpnM hopanoid biosynthesis associated membrane protein HpnM. The genomes containing members of this family share the machinery for the biosynthesis of hopanoid lipids. Furthermore, the genes of this family are usually located proximal to other components of this biological process. The proteins are members of the pfam05494 family of putative transporters known as "toluene tolerance protein Ttg2D", although it is unlikely that the members included here have anything to do with toluene per-se.
Probab=95.14 E-value=0.049 Score=31.16 Aligned_cols=33 Identities=15% Similarity=0.533 Sum_probs=28.7
Q ss_pred CCCCHHHHHH-HHHHHhcCCCHHHHHHHHHHHHH
Q psy5907 19 LDLKLWEIGK-IIGQMWRDLPEDQKTEYVEDYEQ 51 (56)
Q Consensus 19 p~~~~~ei~k-~l~~~W~~l~~~eK~~y~~~a~~ 51 (56)
|...+..+++ .+|..|+.+|+++|+.|.+....
T Consensus 62 ~~~Df~~mar~vLG~~W~~~s~~Qr~~F~~~F~~ 95 (198)
T TIGR03481 62 EAFDLPAMARLTLGSSWTSLSPEQRRRFIGAFRE 95 (198)
T ss_pred HhCCHHHHHHHHhhhhhhhCCHHHHHHHHHHHHH
Confidence 5678888887 68999999999999999988765
No 26
>PF05494 Tol_Tol_Ttg2: Toluene tolerance, Ttg2 ; InterPro: IPR008869 Toluene tolerance is mediated by increased cell membrane rigidity resulting from changes in fatty acid and phospholipid compositions, exclusion of toluene from the cell membrane, and removal of intracellular toluene by degradation []. Many proteins are involved in these processes. This family is a transporter which shows similarity to ABC transporters [].; PDB: 2QGU_A.
Probab=94.77 E-value=0.035 Score=30.66 Aligned_cols=33 Identities=21% Similarity=0.545 Sum_probs=26.9
Q ss_pred CCCCHHHHHHH-HHHHhcCCCHHHHHHHHHHHHH
Q psy5907 19 LDLKLWEIGKI-IGQMWRDLPEDQKTEYVEDYEQ 51 (56)
Q Consensus 19 p~~~~~ei~k~-l~~~W~~l~~~eK~~y~~~a~~ 51 (56)
|...+..+++. ||..|+.+|++++..|.+....
T Consensus 36 ~~~D~~~~ar~~LG~~w~~~s~~q~~~F~~~f~~ 69 (170)
T PF05494_consen 36 PYFDFERMARRVLGRYWRKASPAQRQRFVEAFKQ 69 (170)
T ss_dssp GGB-HHHHHHHHHGGGTTTS-HHHHHHHHHHHHH
T ss_pred HhCCHHHHHHHHHHHhHhhCCHHHHHHHHHHHHH
Confidence 77888888865 8889999999999999988765
No 27
>PF06945 DUF1289: Protein of unknown function (DUF1289); InterPro: IPR010710 This family consists of a number of hypothetical bacterial proteins. The aligned region spans around 56 residues and contains 4 highly conserved cysteine residues towards the N terminus. The function of this family is unknown.
Probab=92.86 E-value=0.46 Score=21.65 Aligned_cols=29 Identities=17% Similarity=0.463 Sum_probs=22.6
Q ss_pred CCCHHHHHHHHHHHhcCCCHHHHHHHHHHHHHhh
Q psy5907 20 DLKLWEIGKIIGQMWRDLPEDQKTEYVEDYEQEK 53 (56)
Q Consensus 20 ~~~~~ei~k~l~~~W~~l~~~eK~~y~~~a~~~k 53 (56)
..+..||.. |+.|++++|..........+
T Consensus 21 ~RT~dEI~~-----W~~~s~~er~~i~~~l~~R~ 49 (51)
T PF06945_consen 21 GRTLDEIRD-----WKSMSDDERRAILARLRARR 49 (51)
T ss_pred CCcHHHHHH-----HhhCCHHHHHHHHHHHHHHh
Confidence 457888877 99999999988776665544
No 28
>PF13875 DUF4202: Domain of unknown function (DUF4202)
Probab=92.45 E-value=0.63 Score=26.74 Aligned_cols=40 Identities=15% Similarity=0.341 Sum_probs=33.4
Q ss_pred hHHhHHHHHHHHHHHhCCCCCHHHHHHHHHHHhcCCCHHHHHH
Q psy5907 2 PYMRYSRKVWDQVKAQNLDLKLWEIGKIIGQMWRDLPEDQKTE 44 (56)
Q Consensus 2 ay~lf~~~~~~~~~~~~p~~~~~ei~k~l~~~W~~l~~~eK~~ 44 (56)
+.++|...+...+...| ....+..+|...|+.||+.-++-
T Consensus 131 acLVFL~~~f~~F~~~~---deeK~v~Il~KTw~KMS~~g~~~ 170 (185)
T PF13875_consen 131 ACLVFLEYYFEDFAAKH---DEEKIVDILRKTWRKMSERGHEA 170 (185)
T ss_pred HHHHhHHHHHHHHHhcC---CHHHHHHHHHHHHHHCCHHHHHH
Confidence 46788888888888766 77889999999999999987753
No 29
>PF09164 VitD-bind_III: Vitamin D binding protein, domain III; InterPro: IPR015247 This domain is predominantly found in Vitamin D binding proteins, and adopts a multihelical structure. It is required for formation of an actin 'clamp', allowing the protein to bind to actin []. ; PDB: 1MA9_A 1KW2_A 1KXP_D 1J7E_A 1J78_A 1LOT_A.
Probab=90.08 E-value=1.3 Score=21.47 Aligned_cols=32 Identities=13% Similarity=0.345 Sum_probs=24.9
Q ss_pred hHHhHHHHHHHHHHHhCCCCCHHHHHHHHHHH
Q psy5907 2 PYMRYSRKVWDQVKAQNLDLKLWEIGKIIGQM 33 (56)
Q Consensus 2 ay~lf~~~~~~~~~~~~p~~~~~ei~k~l~~~ 33 (56)
.|.-|-+...+.++...|+.+..++..++.++
T Consensus 10 tFtEyKKrL~e~l~~k~P~at~~~l~~lve~R 41 (68)
T PF09164_consen 10 TFTEYKKRLAERLRAKLPDATPTELKELVEKR 41 (68)
T ss_dssp -HHHHHHHHHHHHHHH-TTS-HHHHHHHHHHH
T ss_pred cHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHH
Confidence 36778888889999999999999999887654
No 30
>PF01352 KRAB: KRAB box; InterPro: IPR001909 The Krueppel-associated box (KRAB) is a domain of around 75 amino acids that is found in the N-terminal part of about one third of eukaryotic Krueppel-type C2H2 zinc finger proteins (ZFPs) []. It is enriched in charged amino acids and can be divided into subregions A and B, which are predicted to fold into two amphipathic alpha-helices. The KRAB A and B boxes can be separated by variable spacer segments and many KRAB proteins contain only the A box []. The functions currently known for members of the KRAB-containing protein family include transcriptional repression of RNA polymerase I, II, and III promoters, binding and splicing of RNA, and control of nucleolus function. The KRAB domain functions as a transcriptional repressor when tethered to the template DNA by a DNA-binding domain. A sequence of 45 amino acids in the KRAB A subdomain has been shown to be necessary and sufficient for transcriptional repression. The B box does not repress by itself but does potentiate the repression exerted by the KRAB A subdomain [, ]. Gene silencing requires the binding of the KRAB domain to the RING-B box-coiled coil (RBCC) domain of the KAP-1/TIF1-beta corepressor. As KAP-1 binds to the heterochromatin proteins HP1, it has been proposed that the KRAB-ZFP-bound target gene could be silenced following recruitment to heterochromatin [, ]. KRAB-ZFPs probably constitute the single largest class of transcription factors within the human genome []. Although the function of KRAB-ZFPs is largely unknown, they appear to play important roles during cell differentiation and development. The KRAB domain is generally encoded by two exons. The regions coded by the two exons are known as KRAB-A and KRAB-B.; GO: 0003676 nucleic acid binding, 0006355 regulation of transcription, DNA-dependent, 0005622 intracellular; PDB: 1V65_A.
Probab=88.05 E-value=0.65 Score=20.27 Aligned_cols=29 Identities=17% Similarity=0.274 Sum_probs=16.4
Q ss_pred HHHHHHHHH-HHhcCCCHHHHHHHHHHHHH
Q psy5907 23 LWEIGKIIG-QMWRDLPEDQKTEYVEDYEQ 51 (56)
Q Consensus 23 ~~ei~k~l~-~~W~~l~~~eK~~y~~~a~~ 51 (56)
+.+|+--.+ +.|..|.+.+|..|.+.-.+
T Consensus 3 f~Dvav~fs~eEW~~L~~~Qk~ly~dvm~E 32 (41)
T PF01352_consen 3 FEDVAVYFSQEEWELLDPAQKNLYRDVMLE 32 (41)
T ss_dssp ----TT---HHHHHTS-HHHHHHHHHHHHH
T ss_pred EEEEEEEcChhhcccccceecccchhHHHH
Confidence 444544444 56999999999999876543
No 31
>PF12650 DUF3784: Domain of unknown function (DUF3784); InterPro: IPR017259 This group represents an uncharacterised conserved protein.
Probab=86.99 E-value=0.55 Score=23.76 Aligned_cols=16 Identities=13% Similarity=0.430 Sum_probs=13.8
Q ss_pred HhcCCCHHHHHHHHHH
Q psy5907 33 MWRDLPEDQKTEYVED 48 (56)
Q Consensus 33 ~W~~l~~~eK~~y~~~ 48 (56)
-+++||++||+.|++.
T Consensus 25 Gyntms~eEk~~~D~~ 40 (97)
T PF12650_consen 25 GYNTMSKEEKEKYDKK 40 (97)
T ss_pred hcccCCHHHHHHhhHH
Confidence 4899999999999854
No 32
>PF11304 DUF3106: Protein of unknown function (DUF3106); InterPro: IPR021455 Some members in this family of proteins are annotated as transmembrane proteins however this cannot be confirmed. Currently no function is known.
Probab=86.14 E-value=3.4 Score=21.54 Aligned_cols=40 Identities=15% Similarity=0.288 Sum_probs=25.4
Q ss_pred HHHHHhCCCCCHHHHHHHHH--HHhcCCCHHHHHHHHHHHHH
Q psy5907 12 DQVKAQNLDLKLWEIGKIIG--QMWRDLPEDQKTEYVEDYEQ 51 (56)
Q Consensus 12 ~~~~~~~p~~~~~ei~k~l~--~~W~~l~~~eK~~y~~~a~~ 51 (56)
-.+....+.+++.+-.++.. ..|..||++++..-...+..
T Consensus 32 l~~a~r~~~mspeqq~r~~~rm~~W~~LspeqR~~~R~~~~~ 73 (107)
T PF11304_consen 32 LQIAERWPSMSPEQQQRLRERMRRWAALSPEQRQQARENYQR 73 (107)
T ss_pred HHHHHHHhcCCHHHHHHHHHHHHHHHhCCHHHHHHHHHHHHH
Confidence 34555567777776665544 46788888877766555543
No 33
>PRK10236 hypothetical protein; Provisional
Probab=84.93 E-value=1.5 Score=26.11 Aligned_cols=31 Identities=19% Similarity=0.610 Sum_probs=23.6
Q ss_pred CCCCHHHH-----HHHHHHHhcCCCHHHHHHHHHHH
Q psy5907 19 LDLKLWEI-----GKIIGQMWRDLPEDQKTEYVEDY 49 (56)
Q Consensus 19 p~~~~~ei-----~k~l~~~W~~l~~~eK~~y~~~a 49 (56)
+..+..+| .+.+++.|+.||++|++.+.+.-
T Consensus 106 ~~~st~~iE~~il~kll~~a~~kms~eE~~~L~~~l 141 (237)
T PRK10236 106 KEMSTFEIEQQLLEQFLRNTWKKMDEEHKQEFLHAV 141 (237)
T ss_pred CCCCHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHH
Confidence 34555554 47789999999999998877543
No 34
>PF04769 MAT_Alpha1: Mating-type protein MAT alpha 1; InterPro: IPR006856 This family includes Saccharomyces cerevisiae (Baker's yeast) mating type protein alpha 1 (P01365 from SWISSPROT). MAT alpha 1 is a transcription activator that activates mating-type alpha-specific genes with the help of the MADS-box containing MCM1 transcription factor, which together bind cooperatively to PQ elements upstream of alpha-specific genes. The MCM1-MATalpha1 complex is required for the proper DNA-bending that is needed for transcriptional activation []. Alpha 1 interacts in vivo with STE12, linking expression of alpha-specific genes to the alpha-pheromone (IPR006742 from INTERPRO) response pathway [].; GO: 0000772 mating pheromone activity, 0003677 DNA binding, 0045895 positive regulation of transcription, mating-type specific, 0005634 nucleus
Probab=83.50 E-value=3.6 Score=23.88 Aligned_cols=42 Identities=17% Similarity=0.212 Sum_probs=29.4
Q ss_pred hHHhHHHHHHHHHHHhCCCCCHHHHHHHHHHHhcCCCHHHHHHHHHHH
Q psy5907 2 PYMRYSRKVWDQVKAQNLDLKLWEIGKIIGQMWRDLPEDQKTEYVEDY 49 (56)
Q Consensus 2 ay~lf~~~~~~~~~~~~p~~~~~ei~k~l~~~W~~l~~~eK~~y~~~a 49 (56)
+||.|+.=.- ...|+....++|..|+..|..=+- |..|.-.+
T Consensus 50 ~Fm~FRsyy~----~~~~~~~Qk~~S~~l~~lW~~dp~--k~~W~l~a 91 (201)
T PF04769_consen 50 GFMAFRSYYS----PIFPPLPQKELSGILTKLWEKDPF--KNKWSLMA 91 (201)
T ss_pred HHHHHHHHHH----hhcCCcCHHHHHHHHHHHHhCCcc--HhHHHHHh
Confidence 5777766444 446899999999999999986433 34444333
No 35
>PF11304 DUF3106: Protein of unknown function (DUF3106); InterPro: IPR021455 Some members in this family of proteins are annotated as transmembrane proteins however this cannot be confirmed. Currently no function is known.
Probab=83.06 E-value=4 Score=21.26 Aligned_cols=17 Identities=18% Similarity=0.517 Sum_probs=8.3
Q ss_pred HHHHhcCCCHHHHHHHH
Q psy5907 30 IGQMWRDLPEDQKTEYV 46 (56)
Q Consensus 30 l~~~W~~l~~~eK~~y~ 46 (56)
++..|..||++++..+.
T Consensus 34 ~a~r~~~mspeqq~r~~ 50 (107)
T PF11304_consen 34 IAERWPSMSPEQQQRLR 50 (107)
T ss_pred HHHHHhcCCHHHHHHHH
Confidence 34455555555554443
No 36
>PF13945 NST1: Salt tolerance down-regulator
Probab=82.71 E-value=3.3 Score=23.94 Aligned_cols=30 Identities=17% Similarity=0.340 Sum_probs=23.6
Q ss_pred CHHHHHHHHHHHhcCCCHHHHHHHHHHHHH
Q psy5907 22 KLWEIGKIIGQMWRDLPEDQKTEYVEDYEQ 51 (56)
Q Consensus 22 ~~~ei~k~l~~~W~~l~~~eK~~y~~~a~~ 51 (56)
+..+-...|-+.|-+|+++||...+..-++
T Consensus 99 Ss~eEre~LkeFW~SL~eeERr~LVkIEKe 128 (190)
T PF13945_consen 99 SSQEEREKLKEFWESLSEEERRSLVKIEKE 128 (190)
T ss_pred hhHHHHHHHHHHHHccCHHHHHHHHHhhHH
Confidence 355667789999999999999987755443
No 37
>PRK12751 cpxP periplasmic stress adaptor protein CpxP; Reviewed
Probab=81.84 E-value=4.3 Score=22.81 Aligned_cols=27 Identities=26% Similarity=0.385 Sum_probs=22.3
Q ss_pred HHHHHHHHHhcCCCHHHHHHHHHHHHH
Q psy5907 25 EIGKIIGQMWRDLPEDQKTEYVEDYEQ 51 (56)
Q Consensus 25 ei~k~l~~~W~~l~~~eK~~y~~~a~~ 51 (56)
+..+...+++..|++++|..|.+..++
T Consensus 118 ~~~~~~~qmy~lLTPEQra~l~~~~e~ 144 (162)
T PRK12751 118 EMAKVRNQMYNLLTPEQKEALNKKHQE 144 (162)
T ss_pred HHHHHHHHHHHcCCHHHHHHHHHHHHH
Confidence 445666788899999999999988775
No 38
>PF12881 NUT_N: NUT protein N terminus; InterPro: IPR024309 This domain is found in the N-terminal region of Nuclear Testis (NUT) proteins. It is also found in FAM22, which are a family of uncharacterised mammalian proteins.
Probab=81.47 E-value=9.8 Score=23.77 Aligned_cols=49 Identities=14% Similarity=0.030 Sum_probs=39.9
Q ss_pred HHhHHHHHHHHHHHhCCCCCHHHHHHHHHHHhcCCCHHHHHHHHHHHHH
Q psy5907 3 YMRYSRKVWDQVKAQNLDLKLWEIGKIIGQMWRDLPEDQKTEYVEDYEQ 51 (56)
Q Consensus 3 y~lf~~~~~~~~~~~~p~~~~~ei~k~l~~~W~~l~~~eK~~y~~~a~~ 51 (56)
+--|..-.-..+....|.++..|-....-..|.-.|.-+|..|.++|++
T Consensus 232 lSCFLIpvLrsLar~kPtMtlEeGl~ra~qEW~~~SnfdRmifyemaek 280 (328)
T PF12881_consen 232 LSCFLIPVLRSLARLKPTMTLEEGLWRAVQEWQHTSNFDRMIFYEMAEK 280 (328)
T ss_pred hhhhHHHHHHHHHhcCCCccHHHHHHHHHHHhhccccccHHHHHHHHHH
Confidence 4445555555566677999999998888899999999999999999876
No 39
>KOG1610|consensus
Probab=79.65 E-value=7.4 Score=24.30 Aligned_cols=30 Identities=33% Similarity=0.680 Sum_probs=26.5
Q ss_pred HHHHHHHHHHHhcCCCHHHHHHHHHHHHHh
Q psy5907 23 LWEIGKIIGQMWRDLPEDQKTEYVEDYEQE 52 (56)
Q Consensus 23 ~~ei~k~l~~~W~~l~~~eK~~y~~~a~~~ 52 (56)
...+.+.+.+.|..+|++.|+.|-+.+-.+
T Consensus 218 ~~~~~~~~~~~w~~l~~e~k~~YGedy~~~ 247 (322)
T KOG1610|consen 218 PEKLEKRMKEIWERLPQETKDEYGEDYFED 247 (322)
T ss_pred hHHHHHHHHHHHhcCCHHHHHHHHHHHHHH
Confidence 578889999999999999999999887664
No 40
>PF13412 HTH_24: Winged helix-turn-helix DNA-binding; PDB: 1I1G_B 2IA0_B 3I4P_A 2GQQ_A 2L4A_A 2CFX_B 2DBB_B 2EFO_A 2EFQ_A 2PN6_A ....
Probab=77.67 E-value=3.5 Score=17.85 Aligned_cols=22 Identities=14% Similarity=0.128 Sum_probs=15.3
Q ss_pred HHhCCCCCHHHHHHHHHHHhcC
Q psy5907 15 KAQNLDLKLWEIGKIIGQMWRD 36 (56)
Q Consensus 15 ~~~~p~~~~~ei~k~l~~~W~~ 36 (56)
..++|+++..++++.+|=.+..
T Consensus 12 l~~~~~~t~~ela~~~~is~~t 33 (48)
T PF13412_consen 12 LRENPRITQKELAEKLGISRST 33 (48)
T ss_dssp HHHCTTS-HHHHHHHHTS-HHH
T ss_pred HHHcCCCCHHHHHHHhCCCHHH
Confidence 3568999999999988754443
No 41
>PRK10363 cpxP periplasmic repressor CpxP; Reviewed
Probab=74.78 E-value=9.3 Score=21.69 Aligned_cols=29 Identities=31% Similarity=0.367 Sum_probs=23.8
Q ss_pred HHHHHHHHHHHhcCCCHHHHHHHHHHHHH
Q psy5907 23 LWEIGKIIGQMWRDLPEDQKTEYVEDYEQ 51 (56)
Q Consensus 23 ~~ei~k~l~~~W~~l~~~eK~~y~~~a~~ 51 (56)
..+..++-.++.+-|+|++|..|.+..++
T Consensus 110 ~Vem~k~~nqmy~lLTPEQKaq~~~~~~~ 138 (166)
T PRK10363 110 QVEMAKVRNQMYRLLTPEQQAVLNEKHQQ 138 (166)
T ss_pred HHHHHHHHHHHHHhCCHHHHHHHHHHHHH
Confidence 34566777789999999999999988765
No 42
>PF15581 Imm35: Immunity protein 35
Probab=73.29 E-value=11 Score=19.37 Aligned_cols=28 Identities=29% Similarity=0.527 Sum_probs=21.2
Q ss_pred CCHHHHHHHHHHHhcCCCHHHHHHHHHH
Q psy5907 21 LKLWEIGKIIGQMWRDLPEDQKTEYVED 48 (56)
Q Consensus 21 ~~~~ei~k~l~~~W~~l~~~eK~~y~~~ 48 (56)
.+...+...|.+.|+.||+++=..-.+.
T Consensus 30 ~~i~~l~~lIe~eWRGl~~~qV~~kl~a 57 (93)
T PF15581_consen 30 RTIRNLESLIEHEWRGLPEEQVLYKLEA 57 (93)
T ss_pred HHHHHHHHHHHHHHcCCCHHHHHHHHHH
Confidence 4567788899999999999875544433
No 43
>PF12290 DUF3802: Protein of unknown function (DUF3802); InterPro: IPR020979 This family of proteins is found in bacteria and are typically between 114 and 143 amino acids in length. There is a conserved KNLFD sequence motif. The annotation with this family suggests that it may be the B subunit of bacterial type IIA DNA topoisomerase but there is no evidence to support this annotation.
Probab=73.01 E-value=12 Score=19.93 Aligned_cols=28 Identities=7% Similarity=0.657 Sum_probs=22.1
Q ss_pred HHHHHHHHHHHhcCCCHHHHHHHHHHHH
Q psy5907 23 LWEIGKIIGQMWRDLPEDQKTEYVEDYE 50 (56)
Q Consensus 23 ~~ei~k~l~~~W~~l~~~eK~~y~~~a~ 50 (56)
..++...|+.-|...+-.+..-|.+++-
T Consensus 73 v~DLeEVLa~V~~~~aT~eQ~~Fi~Ef~ 100 (113)
T PF12290_consen 73 VYDLEEVLASVWNQKATNEQIAFIEEFI 100 (113)
T ss_pred HHHHHHHHHHHHcCCCCHHHHHHHHHHH
Confidence 3567788999999988888888877653
No 44
>cd08317 Death_ank Death domain associated with Ankyrins. Death Domain (DD) associated with Ankyrins. Ankyrins are modular proteins comprising three conserved domains, an N-terminal membrane-binding domain containing ANK repeats, a spectrin-binding domain and a C-terminal DD. Ankyrins function as adaptor proteins and they interact, through ANK repeats, with structurally diverse membrane proteins, including ion channels/pumps, calcium release channels, and cell adhesion molecules. They play critical roles in the proper expression and membrane localization of these proteins. In mammals, this family includes ankyrin-R for restricted (or ANK1), ankyrin-B for broadly expressed (or ANK2) and ankyrin-G for general or giant (or ANK3). They are expressed in different combinations in many tissues and play non-overlapping functions. In general, DDs are protein-protein interaction domains found in a variety of domain architectures. Their common feature is that they form homodimers by self-associati
Probab=72.88 E-value=1.9 Score=21.28 Aligned_cols=22 Identities=27% Similarity=0.732 Sum_probs=18.1
Q ss_pred CCCCHHHHHHHHHHHhcCCCHH
Q psy5907 19 LDLKLWEIGKIIGQMWRDLPED 40 (56)
Q Consensus 19 p~~~~~ei~k~l~~~W~~l~~~ 40 (56)
|+.....|++.||.-|+.|..+
T Consensus 3 ~~~~l~~ia~~lG~dW~~LAr~ 24 (84)
T cd08317 3 ADIRLADISNLLGSDWPQLARE 24 (84)
T ss_pred ccchHHHHHHHHhhHHHHHHHH
Confidence 5778899999999999877543
No 45
>PF05388 Carbpep_Y_N: Carboxypeptidase Y pro-peptide; InterPro: IPR008442 This signature is found at the N terminus of carboxypeptidase Y, which belong to MEROPS peptidase family S10. This region contains the signal peptide and pro-peptide regions [,].; GO: 0004185 serine-type carboxypeptidase activity, 0005773 vacuole
Probab=72.35 E-value=12 Score=19.87 Aligned_cols=28 Identities=21% Similarity=0.233 Sum_probs=24.0
Q ss_pred HHHHHHHHHHHhcCCCHHHHHHHHHHHH
Q psy5907 23 LWEIGKIIGQMWRDLPEDQKTEYVEDYE 50 (56)
Q Consensus 23 ~~ei~k~l~~~W~~l~~~eK~~y~~~a~ 50 (56)
...+++.+++..+.|+.+-|..|.+...
T Consensus 45 ~~~~~~~l~e~l~~Lt~e~k~~W~E~~~ 72 (113)
T PF05388_consen 45 LEKISKYLNEPLKSLTSEAKALWDEMML 72 (113)
T ss_pred HHHHHHHHHHHHhhccHHHHHHHHHHHH
Confidence 5566778999999999999999998765
No 46
>COG2854 Ttg2D ABC-type transport system involved in resistance to organic solvents, auxiliary component [Secondary metabolites biosynthesis, transport, and catabolism]
Probab=70.98 E-value=5 Score=23.45 Aligned_cols=33 Identities=9% Similarity=0.354 Sum_probs=25.0
Q ss_pred CCCCHHHHHH-HHHHHhcCCCHHHHHHHHHHHHH
Q psy5907 19 LDLKLWEIGK-IIGQMWRDLPEDQKTEYVEDYEQ 51 (56)
Q Consensus 19 p~~~~~ei~k-~l~~~W~~l~~~eK~~y~~~a~~ 51 (56)
|-..+.=+++ .||.-|+.+|+++++.|.+...+
T Consensus 68 p~vd~~~~a~~vLGk~~k~aspeQ~~~F~~aF~~ 101 (202)
T COG2854 68 PYVDFKYAAKLVLGKYYKTASPEQRQAFFKAFRT 101 (202)
T ss_pred hhhcHHHHHHHHhccccccCCHHHHHHHHHHHHH
Confidence 4445444443 48999999999999999987765
No 47
>PF08367 M16C_assoc: Peptidase M16C associated; InterPro: IPR013578 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This domain appears in eukaryotes as well as bacteria and tends to be found near the C terminus of metalloproteases and related sequences belonging to MEROPS peptidase family M16 (subfamily M16C, clan ME). These include: eupitrilysin, falcilysin, PreP peptidase, CYM1 peptidase and subfamily M16C non-peptidase homologues.; GO: 0008237 metallopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis; PDB: 2FGE_B 3S5I_A 3S5H_A 3S5M_A 3S5K_A.
Probab=67.56 E-value=18 Score=21.14 Aligned_cols=33 Identities=18% Similarity=0.120 Sum_probs=26.6
Q ss_pred HHHHHHHHHHHhcCCCHHHHHHHHHHHHHhhhc
Q psy5907 23 LWEIGKIIGQMWRDLPEDQKTEYVEDYEQEKIY 55 (56)
Q Consensus 23 ~~ei~k~l~~~W~~l~~~eK~~y~~~a~~~k~~ 55 (56)
..+..+.|.+...+|++++++...+..++.+.+
T Consensus 12 ~~~e~~~L~~~k~~Ls~~e~~~i~~~~~~L~~~ 44 (248)
T PF08367_consen 12 EEEEKEKLAAYKASLSEEEKEKIIEQTKELKER 44 (248)
T ss_dssp HHHHHHHHHHHHHCS-HHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHhhCCHHHHHHHHHHHHHHHHH
Confidence 356778899999999999999999888876653
No 48
>COG3313 Predicted Fe-S protein [General function prediction only]
Probab=66.38 E-value=8.6 Score=19.04 Aligned_cols=22 Identities=18% Similarity=0.362 Sum_probs=16.9
Q ss_pred CCCHHHHHHHHHHHhcCCCHHHHHHHH
Q psy5907 20 DLKLWEIGKIIGQMWRDLPEDQKTEYV 46 (56)
Q Consensus 20 ~~~~~ei~k~l~~~W~~l~~~eK~~y~ 46 (56)
..+..||.. |..||++||.---
T Consensus 26 ~Rt~~Ei~~-----W~~msd~Er~aVl 47 (74)
T COG3313 26 GRTRDEIFN-----WSSMSDDERRAVL 47 (74)
T ss_pred cccHHHHHH-----HhhCCHHHHHHHH
Confidence 346778877 9999999987544
No 49
>PF06628 Catalase-rel: Catalase-related immune-responsive; InterPro: IPR010582 Catalases (1.11.1.6 from EC) are antioxidant enzymes that catalyse the conversion of hydrogen peroxide to water and molecular oxygen, serving to protect cells from its toxic effects []. Hydrogen peroxide is produced as a consequence of oxidative cellular metabolism and can be converted to the highly reactive hydroxyl radical via transition metals, this radical being able to damage a wide variety of molecules within a cell, leading to oxidative stress and cell death. Catalases act to neutralise hydrogen peroxide toxicity, and are produced by all aerobic organisms ranging from bacteria to man. Most catalases are mono-functional, haem-containing enzymes, although there are also bifunctional haem-containing peroxidase/catalases (IPR000763 from INTERPRO) that are closely related to plant peroxidases, and non-haem, manganese-containing catalases (IPR007760 from INTERPRO) that are found in bacteria []. This entry represents a small conserved region within catalase enzymes that carries the immune-responsive amphipathic octa-peptide that is recognised by T cells [].; PDB: 2CAH_A 1NM0_A 1H7K_A 1E93_A 1H6N_A 3HB6_A 2CAG_A 1M85_A 1MQF_A 1A4E_C ....
Probab=66.04 E-value=6.4 Score=18.71 Aligned_cols=19 Identities=21% Similarity=0.553 Sum_probs=15.5
Q ss_pred HHHHhcCCCHHHHHHHHHH
Q psy5907 30 IGQMWRDLPEDQKTEYVED 48 (56)
Q Consensus 30 l~~~W~~l~~~eK~~y~~~ 48 (56)
-+..|+.|++++|..+...
T Consensus 13 a~~ly~~l~~~er~~lv~n 31 (68)
T PF06628_consen 13 ARDLYRVLSDEERERLVEN 31 (68)
T ss_dssp HHHHHHHSSHHHHHHHHHH
T ss_pred HHHHHHHCCHHHHHHHHHH
Confidence 4567999999999988754
No 50
>PRK10455 periplasmic protein; Reviewed
Probab=64.65 E-value=23 Score=19.81 Aligned_cols=27 Identities=15% Similarity=0.349 Sum_probs=21.3
Q ss_pred HHHHHHHHHhcCCCHHHHHHHHHHHHH
Q psy5907 25 EIGKIIGQMWRDLPEDQKTEYVEDYEQ 51 (56)
Q Consensus 25 ei~k~l~~~W~~l~~~eK~~y~~~a~~ 51 (56)
+..+.-...+..|++++|..|.+..++
T Consensus 118 ~~~~~~~qiy~vLTPEQr~q~~~~~ek 144 (161)
T PRK10455 118 AHMETQNKIYNVLTPEQKKQFNANFEK 144 (161)
T ss_pred HHHHHHHHHHHhCCHHHHHHHHHHHHH
Confidence 344555677899999999999988765
No 51
>PF07813 LTXXQ: LTXXQ motif family protein; InterPro: IPR012899 This five residue motif is found in a number of bacterial proteins bearing similarity to the protein CpxP (P32158 from SWISSPROT). This is a periplasmic protein that aids in combating extracytoplasmic protein-mediated toxicity, and may also be involved in the response to alkaline pH []. Another member of this family, Spy (P77754 from SWISSPROT) is also a periplasmic protein that may be involved in the response to stress []. The homology between CpxP and Spy may indicate that these two proteins are functionally related []. The motif is found repeated twice in many members of this entry. ; GO: 0042597 periplasmic space; PDB: 3ITF_B 3QZC_B 3OEO_D 3O39_A.
Probab=63.52 E-value=16 Score=17.76 Aligned_cols=25 Identities=20% Similarity=0.174 Sum_probs=18.6
Q ss_pred HHHHHHHHHHhcCCCHHHHHHHHHH
Q psy5907 24 WEIGKIIGQMWRDLPEDQKTEYVED 48 (56)
Q Consensus 24 ~ei~k~l~~~W~~l~~~eK~~y~~~ 48 (56)
..+.......+..|++++|..|..+
T Consensus 75 ~~~~~~~~~~~~vLt~eQk~~~~~l 99 (100)
T PF07813_consen 75 EERAKAQHALYAVLTPEQKEKFDQL 99 (100)
T ss_dssp HHHHHHHHHHHTTS-HHHHHHHHHH
T ss_pred HHHHHHHHHHHhcCCHHHHHHHHHh
Confidence 4455667788999999999988754
No 52
>PRK00523 hypothetical protein; Provisional
Probab=62.56 E-value=18 Score=17.83 Aligned_cols=36 Identities=25% Similarity=0.196 Sum_probs=28.0
Q ss_pred HHHHHHHHHHHhCCCCCHHHHHHHHHHHhcCCCHHH
Q psy5907 6 YSRKVWDQVKAQNLDLKLWEIGKIIGQMWRDLPEDQ 41 (56)
Q Consensus 6 f~~~~~~~~~~~~p~~~~~ei~k~l~~~W~~l~~~e 41 (56)
+.+..-.+.-.+||.++..-|..+...+-+..|+.+
T Consensus 25 iark~~~k~l~~NPpine~mir~M~~QMGqKPSekk 60 (72)
T PRK00523 25 VSKKMFKKQIRENPPITENMIRAMYMQMGRKPSESQ 60 (72)
T ss_pred HHHHHHHHHHHHCcCCCHHHHHHHHHHhCCCccHHH
Confidence 334455566678999999999999999998887653
No 53
>PF03672 UPF0154: Uncharacterised protein family (UPF0154); InterPro: IPR005359 The proteins in this entry are functionally uncharacterised.
Probab=61.42 E-value=18 Score=17.42 Aligned_cols=34 Identities=26% Similarity=0.208 Sum_probs=26.5
Q ss_pred HHHHHHHHHhCCCCCHHHHHHHHHHHhcCCCHHH
Q psy5907 8 RKVWDQVKAQNLDLKLWEIGKIIGQMWRDLPEDQ 41 (56)
Q Consensus 8 ~~~~~~~~~~~p~~~~~ei~k~l~~~W~~l~~~e 41 (56)
.....+.-.+||.++..-|..+...+-+..|+.+
T Consensus 19 r~~~~k~l~~NPpine~mir~M~~QMG~kpSekq 52 (64)
T PF03672_consen 19 RKYMEKQLKENPPINEKMIRAMMMQMGRKPSEKQ 52 (64)
T ss_pred HHHHHHHHHHCCCCCHHHHHHHHHHhCCCccHHH
Confidence 3334445578999999999999999999888764
No 54
>PRK14291 chaperone protein DnaJ; Provisional
Probab=61.08 E-value=33 Score=21.56 Aligned_cols=37 Identities=16% Similarity=0.151 Sum_probs=27.5
Q ss_pred HHHHHHHHHhCCCCC----HHHHHHHHHHHhcCCCHHHHHH
Q psy5907 8 RKVWDQVKAQNLDLK----LWEIGKIIGQMWRDLPEDQKTE 44 (56)
Q Consensus 8 ~~~~~~~~~~~p~~~----~~ei~k~l~~~W~~l~~~eK~~ 44 (56)
+.+|...+.-||+.+ ..+.-+.|.+.|..|++..|..
T Consensus 22 ~ayr~la~~~HPD~~~~~~~~~~f~~i~~Ay~vLsd~~kR~ 62 (382)
T PRK14291 22 KAYRRLARKYHPDFNKNPEAEEKFKEINEAYQVLSDPEKRK 62 (382)
T ss_pred HHHHHHHHHHCCCCCCCccHHHHHHHHHHHHHHhcCHHHHH
Confidence 456677777889875 3466678999999998877553
No 55
>PRK12750 cpxP periplasmic repressor CpxP; Reviewed
Probab=60.46 E-value=29 Score=19.56 Aligned_cols=24 Identities=25% Similarity=0.231 Sum_probs=19.4
Q ss_pred HHHHHHhcCCCHHHHHHHHHHHHH
Q psy5907 28 KIIGQMWRDLPEDQKTEYVEDYEQ 51 (56)
Q Consensus 28 k~l~~~W~~l~~~eK~~y~~~a~~ 51 (56)
+...+....|++++|..|.+..++
T Consensus 128 ~~~~~~~~vLTpEQRak~~e~~~~ 151 (170)
T PRK12750 128 EKRHQMLSILTPEQKAKFQELQQE 151 (170)
T ss_pred HHHHHHHHhCCHHHHHHHHHHHHH
Confidence 345567999999999999987655
No 56
>PRK10266 curved DNA-binding protein CbpA; Provisional
Probab=59.41 E-value=38 Score=20.60 Aligned_cols=40 Identities=18% Similarity=0.251 Sum_probs=28.8
Q ss_pred HHHHHHHHHhCCCCC----HHHHHHHHHHHhcCCCHHHHH-HHHH
Q psy5907 8 RKVWDQVKAQNLDLK----LWEIGKIIGQMWRDLPEDQKT-EYVE 47 (56)
Q Consensus 8 ~~~~~~~~~~~p~~~----~~ei~k~l~~~W~~l~~~eK~-~y~~ 47 (56)
+.+|...+.-||+.+ ..+.-+.+.+.|..|+...|. .|..
T Consensus 23 ~ayr~la~k~HPD~~~~~~~~~~f~~i~~Ay~~L~~~~kr~~yD~ 67 (306)
T PRK10266 23 TAYRRLARKYHPDVSKEPDAEARFKEVAEAWEVLSDEQRRAEYDQ 67 (306)
T ss_pred HHHHHHHHHHCcCCCCCccHHHHHHHHHHHHHHhhhHHHHHHHHH
Confidence 456666777789865 667778999999999865544 5543
No 57
>PRK14299 chaperone protein DnaJ; Provisional
Probab=58.59 E-value=39 Score=20.43 Aligned_cols=39 Identities=18% Similarity=0.136 Sum_probs=27.6
Q ss_pred HHHHHHHHHhCCCCC----HHHHHHHHHHHhcCCCHHHHH-HHH
Q psy5907 8 RKVWDQVKAQNLDLK----LWEIGKIIGQMWRDLPEDQKT-EYV 46 (56)
Q Consensus 8 ~~~~~~~~~~~p~~~----~~ei~k~l~~~W~~l~~~eK~-~y~ 46 (56)
+..|...+.-||+.+ ..+.-+.|.+.|..|++.+|. .|.
T Consensus 23 ~ayr~la~~~HPD~~~~~~~~~~f~~i~~Ay~~L~d~~kr~~yD 66 (291)
T PRK14299 23 KAFKKLARKYHPDVNKSPGAEEKFKEINEAYTVLSDPEKRRIYD 66 (291)
T ss_pred HHHHHHHHHHCcCCCCChhHHHHHHHHHHHHHHhcCHHHHHHHH
Confidence 455666677788865 456667899999999876654 444
No 58
>PRK14281 chaperone protein DnaJ; Provisional
Probab=58.52 E-value=28 Score=22.05 Aligned_cols=40 Identities=18% Similarity=0.104 Sum_probs=29.1
Q ss_pred HHHHHHHHHHhCCCCCH-----HHHHHHHHHHhcCCCHHHHH-HHH
Q psy5907 7 SRKVWDQVKAQNLDLKL-----WEIGKIIGQMWRDLPEDQKT-EYV 46 (56)
Q Consensus 7 ~~~~~~~~~~~~p~~~~-----~ei~k~l~~~W~~l~~~eK~-~y~ 46 (56)
.+.+|...+.-||+.+. .+.-+.|.+.|..|++.+|. .|.
T Consensus 21 kkayr~la~~~HPD~~~~~~~a~~~f~~i~~Ay~vL~d~~~r~~yD 66 (397)
T PRK14281 21 KKAYRKLALKYHPDKNPDNKEAEEHFKEVNEAYEVLSNDDKRRRYD 66 (397)
T ss_pred HHHHHHHHHHHCCCcCCCchHHHHHHHHHHHHHHHhhhhhhhhhhh
Confidence 35567777788898753 46778999999999977654 444
No 59
>PHA03102 Small T antigen; Reviewed
Probab=58.18 E-value=26 Score=19.51 Aligned_cols=37 Identities=14% Similarity=0.084 Sum_probs=27.4
Q ss_pred HHHHHHHHHHhCCCCC-HHHHHHHHHHHhcCCCHHHHH
Q psy5907 7 SRKVWDQVKAQNLDLK-LWEIGKIIGQMWRDLPEDQKT 43 (56)
Q Consensus 7 ~~~~~~~~~~~~p~~~-~~ei~k~l~~~W~~l~~~eK~ 43 (56)
.+..|...+.-||+.. ..+..+.|++.|..|++..+.
T Consensus 25 KkAYr~la~~~HPDkgg~~e~~k~in~Ay~~L~d~~~r 62 (153)
T PHA03102 25 RKAYLRKCLEFHPDKGGDEEKMKELNTLYKKFRESVKS 62 (153)
T ss_pred HHHHHHHHHHHCcCCCchhHHHHHHHHHHHHHhhHHHh
Confidence 3556777788899875 456678889999888876654
No 60
>PF02026 RyR: RyR domain; InterPro: IPR003032 This domain is called RyR for Ryanodine receptor []. The domain is found in four copies in the ryanodine receptor. The function of this domain is unknown.; PDB: 4ETV_A 3RQR_A 4ETT_A 4ERT_A 4ESU_A 4ETU_A 4ERV_A 3NRT_E.
Probab=57.96 E-value=14 Score=18.82 Aligned_cols=19 Identities=16% Similarity=0.295 Sum_probs=14.6
Q ss_pred HhcCCCHHHHHHHHHHHHH
Q psy5907 33 MWRDLPEDQKTEYVEDYEQ 51 (56)
Q Consensus 33 ~W~~l~~~eK~~y~~~a~~ 51 (56)
-|..||+++|..+...+.+
T Consensus 60 py~~L~e~eK~~dr~~~~e 78 (94)
T PF02026_consen 60 PYDELSEEEKEKDRDMVRE 78 (94)
T ss_dssp -GGGS-HHHHHHHHHHHHH
T ss_pred ChhhCCHHHHHHhHHHHHH
Confidence 4899999999998877765
No 61
>PRK14279 chaperone protein DnaJ; Provisional
Probab=57.69 E-value=27 Score=22.06 Aligned_cols=40 Identities=20% Similarity=0.140 Sum_probs=28.9
Q ss_pred HHHHHHHHHhCCCCCH-----HHHHHHHHHHhcCCCHHHHH-HHHH
Q psy5907 8 RKVWDQVKAQNLDLKL-----WEIGKIIGQMWRDLPEDQKT-EYVE 47 (56)
Q Consensus 8 ~~~~~~~~~~~p~~~~-----~ei~k~l~~~W~~l~~~eK~-~y~~ 47 (56)
+.+|...+.-||+.+. .+.-+.|.+.+..|++.+|. .|..
T Consensus 28 ~ayr~la~~~HPD~~~~~~~a~~~f~~i~~Ay~vLsD~~KR~~YD~ 73 (392)
T PRK14279 28 KAYRKLARELHPDANPGDPAAEERFKAVSEAHDVLSDPAKRKEYDE 73 (392)
T ss_pred HHHHHHHHHHCcCCCCCChHHHHHHHHHHHHHHHhcchhhhhHHHH
Confidence 4567777788898753 46667899999999987765 4443
No 62
>PF11521 TFIIE-A_C-term: C-terminal general transcription factor TFIIE alpha; InterPro: IPR021600 TFIIE is compiled of two subunits, alpha and beta. This family of proteins are the C-terminal domain of the alpha subunit of the protein which is the largest subunit and contains several functional domains which are important for basal transcription and cell growth. The C-terminal end of the protein binds directly to the amino-terminal PH domain of p62/Tfb1 (of IIH) which is involved in the recruitment of the general transcription factor IIH to the transcription preinitiation complex. P53 competes for the same binding site as TFIIE alpha which shows their structural similarity. Like p53, TFIIE alpha 336-439 can activate transcription in vivo []. ; PDB: 2RNR_A 2RNQ_A 2JTX_A.
Probab=57.47 E-value=14 Score=18.68 Aligned_cols=30 Identities=10% Similarity=0.206 Sum_probs=19.0
Q ss_pred CHHHHHHHHHHHhcCCCHHHHHHHHHHHHHh
Q psy5907 22 KLWEIGKIIGQMWRDLPEDQKTEYVEDYEQE 52 (56)
Q Consensus 22 ~~~ei~k~l~~~W~~l~~~eK~~y~~~a~~~ 52 (56)
...+|+. --+.=..|+++||.-|++..++.
T Consensus 51 ~~~eVtq-~p~LV~qMT~~EKEaYi~v~Q~~ 80 (86)
T PF11521_consen 51 PYSEVTQ-RPELVAQMTPEEKEAYIQVGQEM 80 (86)
T ss_dssp EHHHHHH--HHHHHHS-HHHHHHHHHHHHHH
T ss_pred ehhhcCc-chHHHHHcCHHHHHHHHHHHHHH
Confidence 3566660 01222479999999999988764
No 63
>PF02376 CUT: CUT domain; InterPro: IPR003350 A class, also called ONECUT, of homeodomain proteins. The CUT domain is a DNA-binding motif which can bind independently or in cooperation with the homeodomain (IPR001356 from INTERPRO), often found downstream of the CUT domain. Proteins display two modes of DNA binding, which hinge on the homeodomain and on the linker that separates it from the cut domain, and two modes of transcriptional stimulation, which hinge on the homeodomain [].; GO: 0003677 DNA binding; PDB: 1WH6_A 2D5V_A 2CSF_A 1X2L_A 2O49_A 2O4A_A 1YSE_A 1S7E_A 1WIZ_A 1WH8_A.
Probab=57.36 E-value=14 Score=18.55 Aligned_cols=32 Identities=16% Similarity=0.442 Sum_probs=25.3
Q ss_pred CCCCHHHHHHHHH--HHhcCCCHHHHHHHHHHHH
Q psy5907 19 LDLKLWEIGKIIG--QMWRDLPEDQKTEYVEDYE 50 (56)
Q Consensus 19 p~~~~~ei~k~l~--~~W~~l~~~eK~~y~~~a~ 50 (56)
-+.+.+-++.+|. ..|..|.+..|..|..++.
T Consensus 37 L~rsqgtlsdlL~~PKpW~~L~~~gre~y~RM~n 70 (87)
T PF02376_consen 37 LNRSQGTLSDLLRKPKPWNKLKPKGREPYIRMYN 70 (87)
T ss_dssp TSS-HHHHHHHHHSCCTTCHHHHHHHHHHHHHHH
T ss_pred hccChhHHHHHhCCCCCHHHcCccccHHHHHHHH
Confidence 4677888888888 6799999999999988763
No 64
>PRK14283 chaperone protein DnaJ; Provisional
Probab=57.31 E-value=46 Score=20.90 Aligned_cols=40 Identities=20% Similarity=0.201 Sum_probs=29.2
Q ss_pred HHHHHHHHHHhCCCCC----HHHHHHHHHHHhcCCCHHHHH-HHH
Q psy5907 7 SRKVWDQVKAQNLDLK----LWEIGKIIGQMWRDLPEDQKT-EYV 46 (56)
Q Consensus 7 ~~~~~~~~~~~~p~~~----~~ei~k~l~~~W~~l~~~eK~-~y~ 46 (56)
.+.++...++-||+.+ ..+.-+.|.+.|..|++.+|. .|.
T Consensus 23 k~ayr~la~~~HPD~~~~~~a~~~f~~i~~Ay~~Lsd~~kR~~YD 67 (378)
T PRK14283 23 KKAYRKLARKYHPDVSEEEGAEEKFKEISEAYAVLSDDEKRQRYD 67 (378)
T ss_pred HHHHHHHHHHHCcCCCCCccHHHHHHHHHHHHHHhchhHHHHHHh
Confidence 3456667777889875 456778899999999888754 443
No 65
>PRK14280 chaperone protein DnaJ; Provisional
Probab=55.67 E-value=49 Score=20.77 Aligned_cols=40 Identities=25% Similarity=0.250 Sum_probs=28.3
Q ss_pred HHHHHHHHHhCCCCC----HHHHHHHHHHHhcCCCHHHH-HHHHH
Q psy5907 8 RKVWDQVKAQNLDLK----LWEIGKIIGQMWRDLPEDQK-TEYVE 47 (56)
Q Consensus 8 ~~~~~~~~~~~p~~~----~~ei~k~l~~~W~~l~~~eK-~~y~~ 47 (56)
+.+|...+.-||+.+ ..+.-+.|.+.|..|++.+| ..|..
T Consensus 23 ~ayr~la~~~HpD~~~~~~a~~~f~~i~~Ay~vL~d~~kr~~yD~ 67 (376)
T PRK14280 23 KAYRKLSKKYHPDINKEEGADEKFKEISEAYEVLSDDQKRAQYDQ 67 (376)
T ss_pred HHHHHHHHHHCcCCCCCccHHHHHHHHHHHHHHhccHhHHHHHHh
Confidence 455666667788764 55777889999999986664 45543
No 66
>smart00271 DnaJ DnaJ molecular chaperone homology domain.
Probab=55.60 E-value=11 Score=16.76 Aligned_cols=33 Identities=15% Similarity=0.006 Sum_probs=20.5
Q ss_pred HHHHHHHHHhCCCCCH------HHHHHHHHHHhcCCCHH
Q psy5907 8 RKVWDQVKAQNLDLKL------WEIGKIIGQMWRDLPED 40 (56)
Q Consensus 8 ~~~~~~~~~~~p~~~~------~ei~k~l~~~W~~l~~~ 40 (56)
+..+..++.-||+... .+....|.+.|..|...
T Consensus 20 ~ay~~l~~~~HPD~~~~~~~~~~~~~~~l~~Ay~~L~~~ 58 (60)
T smart00271 20 KAYRKLALKYHPDKNPGDKEEAEEKFKEINEAYEVLSDP 58 (60)
T ss_pred HHHHHHHHHHCcCCCCCchHHHHHHHHHHHHHHHHHcCC
Confidence 4455566666887654 35666677777666543
No 67
>PRK14296 chaperone protein DnaJ; Provisional
Probab=55.16 E-value=46 Score=20.92 Aligned_cols=39 Identities=18% Similarity=0.103 Sum_probs=27.6
Q ss_pred HHHHHHHHHhCCCCC----HHHHHHHHHHHhcCCCHHHHH-HHH
Q psy5907 8 RKVWDQVKAQNLDLK----LWEIGKIIGQMWRDLPEDQKT-EYV 46 (56)
Q Consensus 8 ~~~~~~~~~~~p~~~----~~ei~k~l~~~W~~l~~~eK~-~y~ 46 (56)
+.+|...++-||+.+ ..+-=+.|++.+..|++.+|. .|.
T Consensus 23 ~ayrkla~~~HPD~n~~~~a~~~F~~i~~AyevLsD~~KR~~YD 66 (372)
T PRK14296 23 QAYRKLAKQYHPDLNKSPDAHDKMVEINEAADVLLDKDKRKQYD 66 (372)
T ss_pred HHHHHHHHHHCcCCCCCchHHHHHHHHHHHHHHhcCHHHhhhhh
Confidence 455666677788865 445557799999999977765 444
No 68
>PRK01844 hypothetical protein; Provisional
Probab=54.32 E-value=26 Score=17.23 Aligned_cols=36 Identities=14% Similarity=0.072 Sum_probs=27.9
Q ss_pred HHHHHHHHHHHhCCCCCHHHHHHHHHHHhcCCCHHH
Q psy5907 6 YSRKVWDQVKAQNLDLKLWEIGKIIGQMWRDLPEDQ 41 (56)
Q Consensus 6 f~~~~~~~~~~~~p~~~~~ei~k~l~~~W~~l~~~e 41 (56)
+.+..-.+.-++||.++..-|..+...+-+..|+..
T Consensus 24 ~ark~~~k~lk~NPpine~mir~Mm~QMGqkPSekk 59 (72)
T PRK01844 24 IARKYMMNYLQKNPPINEQMLKMMMMQMGQKPSQKK 59 (72)
T ss_pred HHHHHHHHHHHHCCCCCHHHHHHHHHHhCCCccHHH
Confidence 334455566778999999999999999988877653
No 69
>KOG1827|consensus
Probab=53.39 E-value=0.35 Score=32.34 Aligned_cols=41 Identities=27% Similarity=0.414 Sum_probs=35.7
Q ss_pred hHHhHHHHHHHHHHHhCCCCCHHHHHHHHHHHhcCCCHHHH
Q psy5907 2 PYMRYSRKVWDQVKAQNLDLKLWEIGKIIGQMWRDLPEDQK 42 (56)
Q Consensus 2 ay~lf~~~~~~~~~~~~p~~~~~ei~k~l~~~W~~l~~~eK 42 (56)
+|++|+.+.+..+-.++|+..+++++.++|..|..++..-|
T Consensus 555 ~~~~~s~~~~~~~~~~np~v~~~~~~~~vg~~~~~lp~~~k 595 (629)
T KOG1827|consen 555 PYILDSIENRTIIWFENPTVGFGEVSIIVGNDWDKLPNINK 595 (629)
T ss_pred cccccccccCceeeeeCCCcccceeEEeecCCcccCccccc
Confidence 57778888888888899999999999999999999995544
No 70
>PF02650 HTH_WhiA: WhiA C-terminal HTH domain; InterPro: IPR023054 This domain is found at the C terminus of the sporulation regulator WhiA. It is predicted to form a DNA binding helix-turn-helix structure []. ; PDB: 3HYI_A.
Probab=52.88 E-value=12 Score=18.88 Aligned_cols=20 Identities=25% Similarity=0.519 Sum_probs=13.8
Q ss_pred HHHHhCCCCCHHHHHHHHHH
Q psy5907 13 QVKAQNLDLKLWEIGKIIGQ 32 (56)
Q Consensus 13 ~~~~~~p~~~~~ei~k~l~~ 32 (56)
.++.+||+++-.|++..+..
T Consensus 47 ~lRl~~Pd~SL~EL~~~~~~ 66 (85)
T PF02650_consen 47 ELRLENPDASLKELGELLEP 66 (85)
T ss_dssp HHHHH-TTS-HHHHHHTT--
T ss_pred HHHHHCccccHHHHHHHHcC
Confidence 57788999999999887654
No 71
>TIGR00787 dctP tripartite ATP-independent periplasmic transporter solute receptor, DctP family. TRAP-T (Tripartite ATP-independent Periplasmic Transporter) family proteins generally consist of three components, and these systems have so far been found in Gram-negative bacteria, Gram-postive bacteria and archaea. The best characterized example is the DctPQM system of Rhodobacter capsulatus, a C4 dicarboxylate (malate, fumarate, succinate) transporter. This model represents the DctP family, one of at least three major families of extracytoplasmic solute receptor for TRAP family transporters. Other are the SnoM family (see pfam03480) and TAXI (TRAP-associated extracytoplasmic immunogenic) family.
Probab=51.67 E-value=24 Score=20.55 Aligned_cols=21 Identities=24% Similarity=0.529 Sum_probs=17.1
Q ss_pred HHHhcCCCHHHHHHHHHHHHH
Q psy5907 31 GQMWRDLPEDQKTEYVEDYEQ 51 (56)
Q Consensus 31 ~~~W~~l~~~eK~~y~~~a~~ 51 (56)
.+.|..||++.|.-..+.+.+
T Consensus 213 ~~~~~~L~~e~q~~i~~a~~~ 233 (257)
T TIGR00787 213 KAFWKSLPPDLQAVVKEAAKE 233 (257)
T ss_pred HHHHhcCCHHHHHHHHHHHHH
Confidence 467999999999988776655
No 72
>COG1638 DctP TRAP-type C4-dicarboxylate transport system, periplasmic component [Carbohydrate transport and metabolism]
Probab=50.14 E-value=27 Score=21.66 Aligned_cols=24 Identities=25% Similarity=0.522 Sum_probs=19.6
Q ss_pred HHHhcCCCHHHHHHHHHHHHHhhh
Q psy5907 31 GQMWRDLPEDQKTEYVEDYEQEKI 54 (56)
Q Consensus 31 ~~~W~~l~~~eK~~y~~~a~~~k~ 54 (56)
...|..||++.|.-..+.+.+.-.
T Consensus 244 ~~~w~~L~~e~q~il~~aa~e~~~ 267 (332)
T COG1638 244 KAFWDSLPEEDQTILLEAAKEAAE 267 (332)
T ss_pred HHHHhcCCHHHHHHHHHHHHHHHH
Confidence 467999999999999888877543
No 73
>PTZ00037 DnaJ_C chaperone protein; Provisional
Probab=49.87 E-value=60 Score=20.91 Aligned_cols=40 Identities=18% Similarity=0.047 Sum_probs=29.0
Q ss_pred HHHHHHHHHHhCCCCCH-HHHHHHHHHHhcCCCHHHHH-HHH
Q psy5907 7 SRKVWDQVKAQNLDLKL-WEIGKIIGQMWRDLPEDQKT-EYV 46 (56)
Q Consensus 7 ~~~~~~~~~~~~p~~~~-~ei~k~l~~~W~~l~~~eK~-~y~ 46 (56)
.+.+|...++-||+.+. .+.-+.|++.+..|++.+|. .|.
T Consensus 46 KkAYrkla~k~HPDk~~~~e~F~~i~~AYevLsD~~kR~~YD 87 (421)
T PTZ00037 46 KKAYRKLAIKHHPDKGGDPEKFKEISRAYEVLSDPEKRKIYD 87 (421)
T ss_pred HHHHHHHHHHHCCCCCchHHHHHHHHHHHHHhccHHHHHHHh
Confidence 35667777788998763 46667899999999877655 444
No 74
>COG3763 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=48.81 E-value=34 Score=16.81 Aligned_cols=34 Identities=21% Similarity=0.149 Sum_probs=26.5
Q ss_pred HHHHHHHHHHhCCCCCHHHHHHHHHHHhcCCCHH
Q psy5907 7 SRKVWDQVKAQNLDLKLWEIGKIIGQMWRDLPED 40 (56)
Q Consensus 7 ~~~~~~~~~~~~p~~~~~ei~k~l~~~W~~l~~~ 40 (56)
++..-.+.-.+||.++..-|.-+.+.+-..-|+.
T Consensus 25 ark~~~k~lk~NPpine~~iR~M~~qmGqKpSe~ 58 (71)
T COG3763 25 ARKQMKKQLKDNPPINEEMIRMMMAQMGQKPSEK 58 (71)
T ss_pred HHHHHHHHHhhCCCCCHHHHHHHHHHhCCCchHH
Confidence 3444455667899999999999999998887765
No 75
>PRK14284 chaperone protein DnaJ; Provisional
Probab=48.56 E-value=53 Score=20.74 Aligned_cols=39 Identities=15% Similarity=0.073 Sum_probs=27.9
Q ss_pred HHHHHHHHHhCCCCCH-----HHHHHHHHHHhcCCCHHHHH-HHH
Q psy5907 8 RKVWDQVKAQNLDLKL-----WEIGKIIGQMWRDLPEDQKT-EYV 46 (56)
Q Consensus 8 ~~~~~~~~~~~p~~~~-----~ei~k~l~~~W~~l~~~eK~-~y~ 46 (56)
+.+|...+.-||+.+. .+.-+.|.+.+..|++.+|. .|.
T Consensus 20 kayr~la~~~HPD~~~~~~~a~~~f~~i~~Ay~vL~d~~kR~~YD 64 (391)
T PRK14284 20 KAYRKLAVKYHPDKNPGDAEAEKRFKEVSEAYEVLSDAQKRESYD 64 (391)
T ss_pred HHHHHHHHHHCcCCCCCchHHHHHHHHHHHHHHHhcCHHHHHHHH
Confidence 4566777777898753 45678899999998866654 444
No 76
>PF11588 DUF3243: Protein of unknown function (DUF3243); InterPro: IPR021637 This family of proteins with unknown function includes uncharacterised proteins ymfJ and yflH. The family appears to be restricted to Firmicutes.; PDB: 3D0W_B.
Probab=48.11 E-value=22 Score=17.90 Aligned_cols=19 Identities=11% Similarity=0.513 Sum_probs=14.2
Q ss_pred HHHHHHHhcCCCHHHHHHH
Q psy5907 27 GKIIGQMWRDLPEDQKTEY 45 (56)
Q Consensus 27 ~k~l~~~W~~l~~~eK~~y 45 (56)
.+.|.+.|..-+++||..-
T Consensus 53 erlLkELW~va~e~Eq~~L 71 (81)
T PF11588_consen 53 ERLLKELWDVADEEEQHAL 71 (81)
T ss_dssp HHHHHHHHHC--HHHHHHH
T ss_pred HHHHHHHHHhCCHHHHHHH
Confidence 3789999999999999764
No 77
>PRK14285 chaperone protein DnaJ; Provisional
Probab=47.44 E-value=53 Score=20.59 Aligned_cols=41 Identities=15% Similarity=0.092 Sum_probs=28.7
Q ss_pred HHHHHHHHHHhCCCCC-----HHHHHHHHHHHhcCCCHHH-HHHHHH
Q psy5907 7 SRKVWDQVKAQNLDLK-----LWEIGKIIGQMWRDLPEDQ-KTEYVE 47 (56)
Q Consensus 7 ~~~~~~~~~~~~p~~~-----~~ei~k~l~~~W~~l~~~e-K~~y~~ 47 (56)
.+.+|...+.-||+.+ ..+.-+.|.+.|..|++.+ |..|..
T Consensus 21 k~ayr~la~~~HPD~~~~~~~a~~~f~~i~~Ay~vL~d~~kr~~yd~ 67 (365)
T PRK14285 21 KKAYRKIAIKYHPDKNKGNKEAESIFKEATEAYEVLIDDNKRAQYDR 67 (365)
T ss_pred HHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHcCcchhHHHHh
Confidence 3566777778899875 2355678999999998765 445543
No 78
>PLN02150 terpene synthase/cyclase family protein
Probab=47.13 E-value=11 Score=19.16 Aligned_cols=19 Identities=21% Similarity=0.343 Sum_probs=15.2
Q ss_pred CHHHHHHHHHHHhcCCCHH
Q psy5907 22 KLWEIGKIIGQMWRDLPED 40 (56)
Q Consensus 22 ~~~ei~k~l~~~W~~l~~~ 40 (56)
...++..++.+.|+.|.++
T Consensus 24 A~~~i~~li~~~WK~iN~e 42 (96)
T PLN02150 24 AVSELKKMIRDNYKIVMEE 42 (96)
T ss_pred HHHHHHHHHHHHHHHHHHH
Confidence 4678889999999986654
No 79
>PF01630 Glyco_hydro_56: Hyaluronidase; InterPro: IPR018155 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycoside hydrolase family 56 GH56 from CAZY comprises enzymes with only one known activity; hyaluronidase 3.2.1.35 from EC. The venom of Apis mellifera (Honeybee) contains several biologically-active peptides and two enzymes, one of which is a hyaluronidase []. The amino acid sequence of bee venom hyaluronidase contains 349 amino acids, and includes four cysteines and a number of potential glycosylation sites []. The sequence shows a high degree of similarity to PH-20, a membrane protein of mammalian sperm involved in sperm-egg adhesion, supporting the view that hyaluronidases play a role in fertilisation []. PH-20 is required for sperm adhesion to the egg zona pellucida; it is located on both the sperm plasma membrane and acrosomal membrane []. The amino acid sequence of the mature protein contains 468 amino acids, and includes six potential N-linked glycosylation sites and twelve cysteines, eight of which are tightly clustered near the C terminus [].; GO: 0004415 hyalurononglucosaminidase activity, 0005975 carbohydrate metabolic process; PDB: 1FCQ_A 1FCV_A 1FCU_A 2J88_A 2PE4_A 2ATM_A.
Probab=46.07 E-value=75 Score=20.07 Aligned_cols=32 Identities=19% Similarity=0.162 Sum_probs=24.2
Q ss_pred HhHHHHHHHHHHHhCCCCCHHHHHHHHHHHhc
Q psy5907 4 MRYSRKVWDQVKAQNLDLKLWEIGKIIGQMWR 35 (56)
Q Consensus 4 ~lf~~~~~~~~~~~~p~~~~~ei~k~l~~~W~ 35 (56)
.+|...-...++.+||+++..++.+.....+.
T Consensus 123 ~iYr~~S~~lv~~~hp~ws~~~v~~~A~~~FE 154 (337)
T PF01630_consen 123 DIYRNESIELVRQQHPDWSEKEVEKEAKKEFE 154 (337)
T ss_dssp HHHHHHHHHHHHHHSTTS-HHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHhCCCCCHHHHHHHHHHHHH
Confidence 35677778889999999999999886665554
No 80
>PRK14287 chaperone protein DnaJ; Provisional
Probab=45.50 E-value=76 Score=19.95 Aligned_cols=39 Identities=15% Similarity=0.176 Sum_probs=27.1
Q ss_pred HHHHHHHHHhCCCCC----HHHHHHHHHHHhcCCCHHHHH-HHH
Q psy5907 8 RKVWDQVKAQNLDLK----LWEIGKIIGQMWRDLPEDQKT-EYV 46 (56)
Q Consensus 8 ~~~~~~~~~~~p~~~----~~ei~k~l~~~W~~l~~~eK~-~y~ 46 (56)
+.+|...+.-||+.+ ..+.-+.|.+.+..|++.+|. .|.
T Consensus 23 ~ayr~la~~~HpD~~~~~~~~~~f~~i~~Ay~~L~d~~kR~~YD 66 (371)
T PRK14287 23 KAYRKLARKYHPDVNKAPDAEDKFKEVKEAYDTLSDPQKKAHYD 66 (371)
T ss_pred HHHHHHHHHHCcCCCCChhHHHHHHHHHHHHHHhCcHhHHHHHH
Confidence 455666667788865 345667899999999877654 444
No 81
>cd08805 Death_ank1 Death domain of Ankyrin-1. Death Domain (DD) of the human protein ankyrin-1 (ANK-1) and related proteins. Ankyrins are modular proteins comprising three conserved domains, an N-terminal membrane-binding domain containing ANK repeats, a spectrin-binding domain and a C-terminal DD. ANK-1, also called ankyrin-R (for restricted), is found in brain, muscle, and erythrocytes and is thought to function in linking integral membrane proteins to the underlying cytoskeleton. It plays a critical nonredundant role in erythroid development and is associated with hereditary spherocytosis (HS), a common disorder of the red cell membrane. The small alternatively-spliced variant, sANK-1, found in striated muscle and concentrated in the sarcoplasmic reticulum (SR) binds obscurin and titin, which facilitates the anchoring of the network SR to the contractile apparatus. In general, DDs are protein-protein interaction domains found in a variety of domain architectures. Their common featur
Probab=45.38 E-value=6.2 Score=19.78 Aligned_cols=21 Identities=24% Similarity=0.605 Sum_probs=15.7
Q ss_pred CCCHHHHHHHHHHHhcCCCHH
Q psy5907 20 DLKLWEIGKIIGQMWRDLPED 40 (56)
Q Consensus 20 ~~~~~ei~k~l~~~W~~l~~~ 40 (56)
++....|+..||..|..|..+
T Consensus 4 ~~~l~~Ia~~LG~dW~~Lar~ 24 (84)
T cd08805 4 EMKMAVIREHLGLSWAELARE 24 (84)
T ss_pred hhHHHHHHHHhcchHHHHHHH
Confidence 456678888899999876544
No 82
>PRK14276 chaperone protein DnaJ; Provisional
Probab=45.10 E-value=78 Score=19.95 Aligned_cols=39 Identities=21% Similarity=0.198 Sum_probs=27.7
Q ss_pred HHHHHHHHHhCCCCC----HHHHHHHHHHHhcCCCHHHHH-HHH
Q psy5907 8 RKVWDQVKAQNLDLK----LWEIGKIIGQMWRDLPEDQKT-EYV 46 (56)
Q Consensus 8 ~~~~~~~~~~~p~~~----~~ei~k~l~~~W~~l~~~eK~-~y~ 46 (56)
+.+|...+.-||+.+ ..+.-+.|.+.|..|++.+|. .|.
T Consensus 23 ~ayr~la~~~HpD~~~~~~a~~~f~~i~~Ay~vL~d~~kR~~YD 66 (380)
T PRK14276 23 KAYRKLSKKYHPDINKEPGAEEKYKEVQEAYETLSDPQKRAAYD 66 (380)
T ss_pred HHHHHHHHHHCcCCCCCcCHHHHHHHHHHHHHHhcCHhhhhhHh
Confidence 455666667788754 457778899999988877754 444
No 83
>KOG0713|consensus
Probab=45.07 E-value=79 Score=20.05 Aligned_cols=40 Identities=15% Similarity=0.048 Sum_probs=27.7
Q ss_pred HHHHHHHHHhCCCCC-----HHHHHHHHHHHhcCCCHHHHHHHHH
Q psy5907 8 RKVWDQVKAQNLDLK-----LWEIGKIIGQMWRDLPEDQKTEYVE 47 (56)
Q Consensus 8 ~~~~~~~~~~~p~~~-----~~ei~k~l~~~W~~l~~~eK~~y~~ 47 (56)
+..|....+-||+.+ ..+.=+.|+..|.-|++.+|..-.+
T Consensus 35 kAYRKLALk~HPDkNpddp~A~e~F~~in~AYEVLsDpekRk~YD 79 (336)
T KOG0713|consen 35 KAYRKLALKYHPDKNPDDPNANEKFKEINAAYEVLSDPEKRKHYD 79 (336)
T ss_pred HHHHHHHHHhCCCCCCCCHHHHHHHHHHHHHHHHhcCHHHHHHHH
Confidence 445666667788654 5566678999999998777655443
No 84
>PRK09897 hypothetical protein; Provisional
Probab=44.46 E-value=53 Score=21.78 Aligned_cols=29 Identities=7% Similarity=0.272 Sum_probs=22.7
Q ss_pred HHHHHHHHHHHhcCCCHHHHHHHHHHHHH
Q psy5907 23 LWEIGKIIGQMWRDLPEDQKTEYVEDYEQ 51 (56)
Q Consensus 23 ~~ei~k~l~~~W~~l~~~eK~~y~~~a~~ 51 (56)
...+...+..-|..||.++|+.|......
T Consensus 353 i~~l~~~~~~iw~~l~~~d~~rf~~~l~~ 381 (534)
T PRK09897 353 ILRLHEAVQEIVPHLNEHDHKRFSKGLAR 381 (534)
T ss_pred HHHHHHHHHHHHHhCCHHHHHHHHHhhhH
Confidence 34455677899999999999999876543
No 85
>PRK14294 chaperone protein DnaJ; Provisional
Probab=43.73 E-value=80 Score=19.75 Aligned_cols=39 Identities=15% Similarity=0.073 Sum_probs=27.4
Q ss_pred HHHHHHHHHhCCCCCH-----HHHHHHHHHHhcCCCHHHHH-HHH
Q psy5907 8 RKVWDQVKAQNLDLKL-----WEIGKIIGQMWRDLPEDQKT-EYV 46 (56)
Q Consensus 8 ~~~~~~~~~~~p~~~~-----~ei~k~l~~~W~~l~~~eK~-~y~ 46 (56)
+.+|...+.-||+.+. .+.-+.+.+.|..|++.++. .|.
T Consensus 23 ~ayr~la~~~HPD~~~~~~~~~~~f~~~~~Ay~vL~d~~~r~~yD 67 (366)
T PRK14294 23 KSYRKLAMKYHPDRNPGDKEAEELFKEAAEAYEVLSDPKKRGIYD 67 (366)
T ss_pred HHHHHHHHHHCCCCCCCchHHHHHHHHHHHHHHHhccHHHHHHHH
Confidence 4566666777888753 46667899999999876654 444
No 86
>PF00226 DnaJ: DnaJ domain; InterPro: IPR001623 The prokaryotic heat shock protein DnaJ interacts with the chaperone hsp70-like DnaK protein []. Structurally, the DnaJ protein consists of an N-terminal conserved domain (called 'J' domain) of about 70 amino acids, a glycine-rich region ('G' domain') of about 30 residues, a central domain containing four repeats of a CXXCXGXG motif ('CRR' domain) and a C-terminal region of 120 to 170 residues. Such a structure is shown in the following schematic representation: +------------+-+-------+-----+-----------+--------------------------------+ | N-terminal | | Gly-R | | CXXCXGXG | C-terminal | +------------+-+-------+-----+-----------+--------------------------------+ It is thought that the 'J' domain of DnaJ mediates the interaction with the dnaK protein and consists of four helices, the second of which has a charged surface that includes at least one pair of basic residues that are essential for interaction with the ATPase domain of Hsp70. The J- and CRR-domains are found in many prokaryotic and eukaryotic proteins [], either together or separately. In yeast, J-domains have been classified into 3 groups; the class III proteins are functionally distinct and do not appear to act as molecular chaperones []. ; GO: 0031072 heat shock protein binding; PDB: 2GUZ_C 2L6L_A 1HDJ_A 2EJ7_A 1FPO_C 2CUG_A 2QSA_A 2OCH_A 3BVO_B 3APQ_A ....
Probab=42.98 E-value=33 Score=15.41 Aligned_cols=38 Identities=16% Similarity=0.094 Sum_probs=25.7
Q ss_pred HHHHHHHHHhCCCCC--H----HHHHHHHHHHhcCCCHHHH-HHH
Q psy5907 8 RKVWDQVKAQNLDLK--L----WEIGKIIGQMWRDLPEDQK-TEY 45 (56)
Q Consensus 8 ~~~~~~~~~~~p~~~--~----~ei~k~l~~~W~~l~~~eK-~~y 45 (56)
+..+..++.-||+.. . .+....|.+.|..|+...+ ..|
T Consensus 19 ~~y~~l~~~~HPD~~~~~~~~~~~~~~~i~~Ay~~L~~~~~R~~Y 63 (64)
T PF00226_consen 19 KAYRRLSKQYHPDKNSGDEAEAEEKFARINEAYEILSDPERRRRY 63 (64)
T ss_dssp HHHHHHHHHTSTTTGTSTHHHHHHHHHHHHHHHHHHHSHHHHHHH
T ss_pred HHHHhhhhccccccchhhhhhhhHHHHHHHHHHHHhCCHHHHHhc
Confidence 455666777788762 2 3778889999998765554 444
No 87
>cd01670 Death Death Domain: a protein-protein interaction domain. Death Domains (DDs) are protein-protein interaction domains found in a variety of domain architectures. Their common feature is that they form homodimers by self-association or heterodimers by associating with other members of the DD superfamily including CARD (Caspase activation and recruitment domain), DED (Death Effector Domain), and PYRIN. Structural analysis of DD-DD complexes show that the domains interact with each other in many different ways. DD-containing proteins serve as adaptors in signaling pathways and they can recruit other proteins into signaling complexes. In mammals, they are prominent components of the programmed cell death (apoptosis) pathway and are found in a number of other signaling pathways. In invertebrates, they are involved in transcriptional regulation of zygotic patterning genes in insect embryogenesis, and are components of the ToII/NF-kappaB pathway, a conserved innate immune pathway in a
Probab=42.69 E-value=14 Score=17.39 Aligned_cols=17 Identities=24% Similarity=0.607 Sum_probs=12.0
Q ss_pred HHHHHHHHHHhcCCCHH
Q psy5907 24 WEIGKIIGQMWRDLPED 40 (56)
Q Consensus 24 ~ei~k~l~~~W~~l~~~ 40 (56)
..|+..+|..|+.|...
T Consensus 3 ~~ia~~lg~~W~~la~~ 19 (79)
T cd01670 3 DKLAKKLGKDWKKLARK 19 (79)
T ss_pred HHHHHHHhhHHHHHHHH
Confidence 46778888888766543
No 88
>PF08373 RAP: RAP domain; InterPro: IPR013584 The ~60-residue RAP (an acronym for RNA-binding domain abundant in Apicomplexans) domain is found in various proteins in eukaryotes. It is particularly abundant in apicomplexans and might mediate a range of cellular functions through its potential interactions with RNA []. The RAP domain consists of multiple blocks of charged and aromatics residues and is predicted to be composed of alpha helical and beta strand structures. Two predicted loop regions that are dominated by glycine and tryptophan residues are found before and after the central beta sheet []. Some proteins known to contain a RAP domain are listed below: Human hypothetical protein MGC5297, Mammalian FAST kinase domain-containing proteins (FASTKDs), Chlamydomonas reinhardtii chloroplastic trans-splicing factor Raa3.
Probab=42.64 E-value=22 Score=15.82 Aligned_cols=15 Identities=33% Similarity=0.871 Sum_probs=11.8
Q ss_pred HhcCC-CHHHHHHHHH
Q psy5907 33 MWRDL-PEDQKTEYVE 47 (56)
Q Consensus 33 ~W~~l-~~~eK~~y~~ 47 (56)
.|..| +.++|..|..
T Consensus 41 eW~~l~~~~~k~~YL~ 56 (58)
T PF08373_consen 41 EWNKLKSREEKIEYLK 56 (58)
T ss_pred HHHhcCCHHHHHHHHh
Confidence 48888 8888888764
No 89
>PRK14298 chaperone protein DnaJ; Provisional
Probab=42.46 E-value=76 Score=20.00 Aligned_cols=37 Identities=16% Similarity=0.058 Sum_probs=25.6
Q ss_pred HHHHHHHHHhCCCCC----HHHHHHHHHHHhcCCCHHHHHH
Q psy5907 8 RKVWDQVKAQNLDLK----LWEIGKIIGQMWRDLPEDQKTE 44 (56)
Q Consensus 8 ~~~~~~~~~~~p~~~----~~ei~k~l~~~W~~l~~~eK~~ 44 (56)
+.+|...+.-||+.+ ..+.-+.|.+.|..|++.+|..
T Consensus 24 ~ayr~la~~~HPD~~~~~~~~~~f~~i~~Ay~vL~d~~kR~ 64 (377)
T PRK14298 24 KAYRKLAMKYHPDKNKEPDAEEKFKEISEAYAVLSDAEKRA 64 (377)
T ss_pred HHHHHHHHHhCccccCChhHHHHHHHHHHHHHHhcchHhhh
Confidence 445666667788875 3345577999999888877543
No 90
>PF03480 SBP_bac_7: Bacterial extracellular solute-binding protein, family 7; InterPro: IPR018389 This family of proteins are involved in binding extracellular solutes for transport across the bacterial cytoplasmic membrane. This family includes a C4-dicarboxylate-binding protein DctP [, ] and the sialic acid-binding protein SiaP. The structure of the SiaP receptor has revealed an overall topology similar to ATP binding cassette ESR (extracytoplasmic solute receptors) proteins []. Upon binding of sialic acid, SiaP undergoes domain closure about a hinge region and kinking of an alpha-helix hinge component [].; GO: 0006810 transport, 0030288 outer membrane-bounded periplasmic space; PDB: 2HZK_C 2HZL_B 2HPG_C 2XWI_A 2XWK_A 2WX9_A 2CEY_A 2WYP_A 3B50_A 2CEX_B ....
Probab=42.43 E-value=28 Score=20.53 Aligned_cols=20 Identities=15% Similarity=0.589 Sum_probs=15.3
Q ss_pred HHhcCCCHHHHHHHHHHHHH
Q psy5907 32 QMWRDLPEDQKTEYVEDYEQ 51 (56)
Q Consensus 32 ~~W~~l~~~eK~~y~~~a~~ 51 (56)
+.|..||++.|+-..+.+.+
T Consensus 214 ~~w~~L~~e~q~~l~~~~~~ 233 (286)
T PF03480_consen 214 DWWDSLPDEDQEALDDAADE 233 (286)
T ss_dssp HHHHHS-HHHHHHHHHHHHH
T ss_pred HHHhcCCHHHHHHHHHHHHH
Confidence 56999999999888776655
No 91
>PF02936 COX4: Cytochrome c oxidase subunit IV; InterPro: IPR004203 Cytochrome c oxidase, a 13 sub-unit complex (1.9.3.1 from EC) is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit IV. The Dictyostelium discoideum (Slime mould) member of this family is called COX VI. The Saccharomyces cerevisiae protein YGX6_YEAST appears to be the yeast COX IV subunit.; GO: 0004129 cytochrome-c oxidase activity; PDB: 3ABK_Q 3AG1_Q 3ASN_Q 1OCZ_D 2EIN_Q 2OCC_D 2YBB_O 3AG3_D 1OCO_Q 1V55_Q ....
Probab=41.89 E-value=15 Score=20.18 Aligned_cols=22 Identities=23% Similarity=0.347 Sum_probs=12.7
Q ss_pred HHHHHHHHH----HHhcCCCHHHHHH
Q psy5907 23 LWEIGKIIG----QMWRDLPEDQKTE 44 (56)
Q Consensus 23 ~~ei~k~l~----~~W~~l~~~eK~~ 44 (56)
.+++.+.|. .-|+.||.+||.-
T Consensus 29 ~~~~~~~LkeKekg~Wk~LS~eEKka 54 (142)
T PF02936_consen 29 QTEIIEALKEKEKGDWKKLSLEEKKA 54 (142)
T ss_dssp --HHHHHHHHHTTS-GGGS-HHHHHH
T ss_pred hHHHHHHHHHHHhCCHhhCCHHHHHH
Confidence 345555555 4599999998864
No 92
>PRK05439 pantothenate kinase; Provisional
Probab=41.18 E-value=20 Score=22.16 Aligned_cols=49 Identities=20% Similarity=0.437 Sum_probs=30.3
Q ss_pred ChHHhHHHHHHHHHHHhCC-CCCHHHHHHHHHHHhcCCC-HHHHHHHHHHHH
Q psy5907 1 MPYMRYSRKVWDQVKAQNL-DLKLWEIGKIIGQMWRDLP-EDQKTEYVEDYE 50 (56)
Q Consensus 1 ~ay~lf~~~~~~~~~~~~p-~~~~~ei~k~l~~~W~~l~-~~eK~~y~~~a~ 50 (56)
++|..|..+.|..+....| .++..|+.++-| .-..++ ++-...|..++.
T Consensus 8 ~~~~~~~r~~w~~l~~~~~~~l~~~~~~~l~~-~~~~~~~~~v~~iy~plar 58 (311)
T PRK05439 8 SPYLEFSREQWAALRDSTPLTLTEEELERLRG-LNDPISLEEVAEIYLPLSR 58 (311)
T ss_pred CCceeECHHHHHHHHhcCCCCCCHHHHHHhhc-CCCCCCHHHHHHHHHHHHH
Confidence 4788899999999987776 567777766433 222222 333445544443
No 93
>PF05914 RIB43A: RIB43A; InterPro: IPR008805 This family consists of several RIB43A-like eukaryotic proteins. Ciliary and flagellar microtubules contain a specialised set of protofilaments, termed ribbons, that are composed of tubulin and several associated proteins. RIB43A was first characterised in the unicellular biflagellate, Chlamydomonas reinhardtii although highly related sequences are present in several higher eukaryotes including humans. The function of this protein is unknown although the structure of RIB43A and its association with the specialised protofilament ribbons and with basal bodies is relevant to the proposed role of ribbons in forming and stabilising doublet and triplet microtubules and in organising their three-dimensional structure. Human RIB43A homologues could represent a structural requirement in centriole replication in dividing cells [].
Probab=40.84 E-value=4.9 Score=25.41 Aligned_cols=34 Identities=15% Similarity=0.344 Sum_probs=23.9
Q ss_pred HHHhCCCCC--HHHHHHHHHHHhcCCCHHHHHHHHH
Q psy5907 14 VKAQNLDLK--LWEIGKIIGQMWRDLPEDQKTEYVE 47 (56)
Q Consensus 14 ~~~~~p~~~--~~ei~k~l~~~W~~l~~~eK~~y~~ 47 (56)
+..++|... ...-.+.+...|+.||+++...+..
T Consensus 252 lLtEnp~~a~s~~gp~Rv~~d~wKGMs~eQl~~i~~ 287 (379)
T PF05914_consen 252 LLTENPQVAQSSFGPHRVIPDRWKGMSPEQLEEIRK 287 (379)
T ss_pred cccCCchhccccCCCCCCCCcccCCCCHHHHHHHHH
Confidence 335667653 2233567889999999999988764
No 94
>PRK14292 chaperone protein DnaJ; Provisional
Probab=40.74 E-value=89 Score=19.55 Aligned_cols=39 Identities=15% Similarity=0.058 Sum_probs=27.3
Q ss_pred HHHHHHHHHhCCCCC----HHHHHHHHHHHhcCCCHHHH-HHHH
Q psy5907 8 RKVWDQVKAQNLDLK----LWEIGKIIGQMWRDLPEDQK-TEYV 46 (56)
Q Consensus 8 ~~~~~~~~~~~p~~~----~~ei~k~l~~~W~~l~~~eK-~~y~ 46 (56)
+.++...+.-||+.. ..+.-+.|.+....|++.++ ..|.
T Consensus 21 ~ayr~l~~~~hpD~~~~~~a~~~~~~i~~Ay~vL~d~~~r~~yd 64 (371)
T PRK14292 21 SAYRKLALKYHPDRNKEKGAAEKFAQINEAYAVLSDAEKRAHYD 64 (371)
T ss_pred HHHHHHHHHHCCCCCCChhHHHHHHHHHHHHHHhcchhhhhhHh
Confidence 456667777788865 45666788899998886654 4444
No 95
>PF09829 DUF2057: Uncharacterized protein conserved in bacteria (DUF2057); InterPro: IPR018635 The proteins in this entry are functionally uncharacterised.
Probab=40.01 E-value=39 Score=18.97 Aligned_cols=24 Identities=4% Similarity=0.178 Sum_probs=20.4
Q ss_pred HHHHHHHHHHHhcCCCHHHHHHHH
Q psy5907 23 LWEIGKIIGQMWRDLPEDQKTEYV 46 (56)
Q Consensus 23 ~~ei~k~l~~~W~~l~~~eK~~y~ 46 (56)
..+...+|-..|...++++|+.|.
T Consensus 164 ~~~~~~~Lq~wy~qAs~~eRk~F~ 187 (189)
T PF09829_consen 164 ESEALQMLQYWYLQASKEERKAFL 187 (189)
T ss_pred cccHHHHHHHHHHhCCHHHHHHHh
Confidence 456678899999999999999875
No 96
>smart00351 PAX Paired Box domain.
Probab=39.40 E-value=58 Score=17.15 Aligned_cols=20 Identities=25% Similarity=0.366 Sum_probs=16.6
Q ss_pred HHHHhCCCCCHHHHHHHHHH
Q psy5907 13 QVKAQNLDLKLWEIGKIIGQ 32 (56)
Q Consensus 13 ~~~~~~p~~~~~ei~k~l~~ 32 (56)
.+..+||+++..||...|.+
T Consensus 86 ~~~~~~p~~t~~el~~~L~~ 105 (125)
T smart00351 86 DYKQENPGIFAWEIRDRLLS 105 (125)
T ss_pred HHHHHCCCCCHHHHHHHHHH
Confidence 35678999999999988875
No 97
>cd08318 Death_NMPP84 Death domain of Nuclear Matrix Protein P84. Death domain (DD) found in the Nuclear Matrix Protein P84 (also known as HPR1 or THOC1). HPR1/p84 resides in the nuclear matrix and is part of the THO complex, also called TREX (transcription/export) complex, which functions in mRNP biogenesis at the interface between transcription and export of mRNA from the nucleus. Mice lacking THOC1 have abnormal testis development and are sterile. In general, DDs are protein-protein interaction domains found in a variety of domain architectures. Their common feature is that they form homodimers by self-association or heterodimers by associating with other members of the DD superfamily including CARD (Caspase activation and recruitment domain), DED (Death Effector Domain), and PYRIN. They serve as adaptors in signaling pathways and can recruit other proteins into signaling complexes.
Probab=39.06 E-value=17 Score=17.99 Aligned_cols=16 Identities=19% Similarity=0.561 Sum_probs=12.6
Q ss_pred HHHHHHHHHHHhcCCC
Q psy5907 23 LWEIGKIIGQMWRDLP 38 (56)
Q Consensus 23 ~~ei~k~l~~~W~~l~ 38 (56)
...|+..||..|+.|.
T Consensus 10 l~~ia~~iG~~Wk~La 25 (86)
T cd08318 10 ITVFANKLGEDWKTLA 25 (86)
T ss_pred HHHHHHHHhhhHHHHH
Confidence 5668888999998664
No 98
>PRK14295 chaperone protein DnaJ; Provisional
Probab=38.58 E-value=1e+02 Score=19.52 Aligned_cols=40 Identities=23% Similarity=0.223 Sum_probs=28.1
Q ss_pred HHHHHHHHHhCCCCC-----HHHHHHHHHHHhcCCCHHHHH-HHHH
Q psy5907 8 RKVWDQVKAQNLDLK-----LWEIGKIIGQMWRDLPEDQKT-EYVE 47 (56)
Q Consensus 8 ~~~~~~~~~~~p~~~-----~~ei~k~l~~~W~~l~~~eK~-~y~~ 47 (56)
+.+|...+.-||+.+ ..+.-+.|.+.|..|++.+|. .|..
T Consensus 28 ~ayr~la~~~HPD~~~~~~~a~~~f~~i~~Ay~vL~d~~~r~~yD~ 73 (389)
T PRK14295 28 KAYRKLAREYHPDANKGDAKAEERFKEISEAYDVLSDEKKRKEYDE 73 (389)
T ss_pred HHHHHHHHHHCCCcCCCchhHHHHHHHHHHHHHHHCchhhHHHHHH
Confidence 455666677788864 346677899999988877654 4543
No 99
>PRK14290 chaperone protein DnaJ; Provisional
Probab=38.50 E-value=83 Score=19.69 Aligned_cols=36 Identities=25% Similarity=0.184 Sum_probs=25.9
Q ss_pred HHHHHHHHHhCCCCCH------HHHHHHHHHHhcCCCHHHHH
Q psy5907 8 RKVWDQVKAQNLDLKL------WEIGKIIGQMWRDLPEDQKT 43 (56)
Q Consensus 8 ~~~~~~~~~~~p~~~~------~ei~k~l~~~W~~l~~~eK~ 43 (56)
+.+|...++-||+.+. .+.-+.|.+.|..|++..+.
T Consensus 22 ~ayr~la~~~HPD~~~~~~~~a~~~f~~i~~Ay~~L~d~~~r 63 (365)
T PRK14290 22 KAFRELAKKWHPDLHPGNKAEAEEKFKEISEAYEVLSDPQKR 63 (365)
T ss_pred HHHHHHHHHHCcCCCCCchhHHHHHHHHHHHHHHHhcChhhh
Confidence 4566677777888753 35557799999998887754
No 100
>cd08779 Death_PIDD Death Domain of p53-induced protein with a death domain. Death domain (DD) found in PIDD (p53-induced protein with a death domain) and similar proteins. PIDD is a component of the PIDDosome complex, which is an oligomeric caspase-activating complex involved in caspase-2 activation and plays a role in mediating stress-induced apoptosis. The PIDDosome complex is composed of three components, PIDD, RAIDD and caspase-2, which interact through their DDs and DD-like domains. The DD of PIDD interacts with the DD of RAIDD, which also contains a Caspase Activation and Recruitment Domain (CARD) that interacts with the caspase-2 CARD. Autoproteolysis of PIDD determines the downstream signaling event, between pro-survival NF-kB or pro-death caspase-2 activation. In general, DDs are protein-protein interaction domains found in a variety of domain architectures. Their common feature is that they form homodimers by self-association or heterodimers by associating with other members
Probab=38.42 E-value=15 Score=18.29 Aligned_cols=17 Identities=24% Similarity=0.454 Sum_probs=13.0
Q ss_pred HHHHHHHHHHHhcCCCH
Q psy5907 23 LWEIGKIIGQMWRDLPE 39 (56)
Q Consensus 23 ~~ei~k~l~~~W~~l~~ 39 (56)
...|+..||..|+.+..
T Consensus 5 l~~ia~~LG~~Wk~lar 21 (86)
T cd08779 5 LLSIAGRLGLDWQAIGL 21 (86)
T ss_pred HHHHHHHHhHHHHHHHH
Confidence 45788899999987653
No 101
>PF14098 SSPI: Small, acid-soluble spore protein I
Probab=38.35 E-value=49 Score=15.94 Aligned_cols=17 Identities=29% Similarity=0.667 Sum_probs=13.9
Q ss_pred HHhcCCCHHHHHHHHHH
Q psy5907 32 QMWRDLPEDQKTEYVED 48 (56)
Q Consensus 32 ~~W~~l~~~eK~~y~~~ 48 (56)
-.|+.+++++|....+.
T Consensus 44 ~~W~~~~~~ek~~m~~~ 60 (65)
T PF14098_consen 44 VIWKNSDESEKQEMVNT 60 (65)
T ss_pred HHHHhCCHHHHHHHHHH
Confidence 46999999999887654
No 102
>COG4735 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=38.03 E-value=81 Score=18.67 Aligned_cols=33 Identities=15% Similarity=0.213 Sum_probs=26.5
Q ss_pred CCCCHHHHHHHHHHHhcCCCHHHHHHHHHHHHH
Q psy5907 19 LDLKLWEIGKIIGQMWRDLPEDQKTEYVEDYEQ 51 (56)
Q Consensus 19 p~~~~~ei~k~l~~~W~~l~~~eK~~y~~~a~~ 51 (56)
.+.++...-+..|..|+.++.+++....-.+.+
T Consensus 35 gg~s~anflr~~G~lyk~il~d~~~~Lk~k~~k 67 (211)
T COG4735 35 GGLSAANFLRDLGQLYKKILCDEKDKLKVKVQK 67 (211)
T ss_pred ccHHHHHHHHHHHHHHHHcCHHHHHHHHHHHHH
Confidence 467788888889999999999998877665554
No 103
>PF01710 HTH_Tnp_IS630: Transposase; InterPro: IPR002622 Transposase proteins are necessary for efficient DNA transposition. This entry includes insertion sequences from Synechocystis sp. (strain PCC 6803) three of which are characterised as homologous to bacterial IS5- and IS4- and to several members of the IS630-Tc1-mariner superfamily []. More information about these proteins can be found at Protein of the Month: Transposase [].
Probab=37.98 E-value=48 Score=17.26 Aligned_cols=20 Identities=25% Similarity=0.413 Sum_probs=14.7
Q ss_pred HHHHHhCCCCCHHHHHHHHH
Q psy5907 12 DQVKAQNLDLKLWEIGKIIG 31 (56)
Q Consensus 12 ~~~~~~~p~~~~~ei~k~l~ 31 (56)
..+...+|+++..|+++.+|
T Consensus 63 ~~~v~~~pd~tl~Ela~~l~ 82 (119)
T PF01710_consen 63 KALVEENPDATLRELAERLG 82 (119)
T ss_pred HHHHHHCCCcCHHHHHHHcC
Confidence 44556789999999887643
No 104
>PRK14278 chaperone protein DnaJ; Provisional
Probab=37.22 E-value=1.1e+02 Score=19.34 Aligned_cols=39 Identities=18% Similarity=0.079 Sum_probs=26.9
Q ss_pred HHHHHHHHHhCCCCC----HHHHHHHHHHHhcCCCHHHHH-HHH
Q psy5907 8 RKVWDQVKAQNLDLK----LWEIGKIIGQMWRDLPEDQKT-EYV 46 (56)
Q Consensus 8 ~~~~~~~~~~~p~~~----~~ei~k~l~~~W~~l~~~eK~-~y~ 46 (56)
+.+|...+.-||+.+ ..+.-+.|++.|..|++.+|. .|.
T Consensus 22 ~ayr~la~~~hpD~~~~~~a~~~f~~i~~Ay~vL~d~~~r~~YD 65 (378)
T PRK14278 22 RAYRKLARELHPDVNPDEEAQEKFKEISVAYEVLSDPEKRRIVD 65 (378)
T ss_pred HHHHHHHHHHCCCCCCcHHHHHHHHHHHHHHHHhchhhhhhhhh
Confidence 456667777788875 335556799999999766544 444
No 105
>PF11616 EZH2_WD-Binding: WD repeat binding protein EZH2; InterPro: IPR021654 This family of proteins represents Enhancer of zest homologue 2, (EZH2) a 30 residue peptide which binds to a WD-repeat domain of EED by residues 39-68. EED is a component of PRC2 complex which is involved in gene expression []. This interaction is required for the HMTase activity of PCR2 []. ; GO: 0018024 histone-lysine N-methyltransferase activity; PDB: 2QXV_B.
Probab=37.21 E-value=32 Score=14.05 Aligned_cols=14 Identities=29% Similarity=0.624 Sum_probs=10.0
Q ss_pred HHHHHHHHHHhcCC
Q psy5907 24 WEIGKIIGQMWRDL 37 (56)
Q Consensus 24 ~ei~k~l~~~W~~l 37 (56)
.|-+.++.+.|+.+
T Consensus 12 ~e~t~iLN~eWk~l 25 (30)
T PF11616_consen 12 QERTDILNEEWKKL 25 (30)
T ss_dssp HHHHHHHHHHHHH-
T ss_pred HHHHHHHHHHHHHh
Confidence 45677888999865
No 106
>PF06242 DUF1013: Protein of unknown function (DUF1013); InterPro: IPR010421 This is a family of uncharacterised proteins found in Proteobacteria.
Probab=36.61 E-value=33 Score=19.01 Aligned_cols=18 Identities=28% Similarity=0.362 Sum_probs=15.2
Q ss_pred HHhCCCCCHHHHHHHHHH
Q psy5907 15 KAQNLDLKLWEIGKIIGQ 32 (56)
Q Consensus 15 ~~~~p~~~~~ei~k~l~~ 32 (56)
...||.++-+.|++.+|.
T Consensus 78 vk~hPel~DaQI~kLiGT 95 (140)
T PF06242_consen 78 VKNHPELSDAQIAKLIGT 95 (140)
T ss_pred HhcCCcccHHHHHHHhcC
Confidence 356899999999999884
No 107
>PF11334 DUF3136: Protein of unknown function (DUF3136); InterPro: IPR021483 This family of proteins with unknown function appear to be restricted to Cyanobacteria.
Probab=36.47 E-value=55 Score=15.74 Aligned_cols=30 Identities=20% Similarity=0.274 Sum_probs=22.2
Q ss_pred hHHhHHHHHHHHHHHhCCCCCHHHHHHHHHHHhcC
Q psy5907 2 PYMRYSRKVWDQVKAQNLDLKLWEIGKIIGQMWRD 36 (56)
Q Consensus 2 ay~lf~~~~~~~~~~~~p~~~~~ei~k~l~~~W~~ 36 (56)
.|.+||+..+-.+. -+.+..+|.+.+ .|.-
T Consensus 9 ~y~~YckALr~Lv~---~G~~~~~i~rTv--CW~r 38 (64)
T PF11334_consen 9 GYPLYCKALRRLVA---DGRSEEEIRRTV--CWDR 38 (64)
T ss_pred CchHHHHHHHHHHH---cCCCHHHHHHHH--HHHH
Confidence 37899999999887 477888887654 4643
No 108
>cd00922 Cyt_c_Oxidase_IV Cytochrome c oxidase subunit IV. Cytochrome c oxidase (CcO), the terminal oxidase in the respiratory chains of eukaryotes and most bacteria, is a multi-chain transmembrane protein located in the inner membrane of mitochondria and the cell membrane of prokaryotes. It catalyzes the reduction of O2 and simultaneously pumps protons across the membrane. The number of subunits varies from three to five in bacteria and up to 13 in mammalian mitochondria. Subunits I, II, and III of mammalian CcO are encoded within the mitochondrial genome and the remaining 10 subunits are encoded within the nuclear genome. Found only in eukaryotes, subunit IV is the largest of the nuclear-encoded subunits. It binds ATP at the matrix side, leading to an allosteric inhibition of enzyme activity at high intramitochondrial ATP/ADP ratios. In mammals, subunit IV has a lung-specific isoform and a ubiquitously expressed isoform.
Probab=36.47 E-value=19 Score=19.60 Aligned_cols=14 Identities=21% Similarity=0.541 Sum_probs=11.3
Q ss_pred HhcCCCHHHHHHHH
Q psy5907 33 MWRDLPEDQKTEYV 46 (56)
Q Consensus 33 ~W~~l~~~eK~~y~ 46 (56)
-|+.||.+||.-..
T Consensus 43 dW~~LT~~EKkAlY 56 (136)
T cd00922 43 DWKQLTLEEKKALY 56 (136)
T ss_pred CHhhCCHHHHhhHh
Confidence 49999999997644
No 109
>PRK14286 chaperone protein DnaJ; Provisional
Probab=36.37 E-value=1.1e+02 Score=19.23 Aligned_cols=40 Identities=15% Similarity=0.006 Sum_probs=28.3
Q ss_pred HHHHHHHHHhCCCCC-----HHHHHHHHHHHhcCCCHHHH-HHHHH
Q psy5907 8 RKVWDQVKAQNLDLK-----LWEIGKIIGQMWRDLPEDQK-TEYVE 47 (56)
Q Consensus 8 ~~~~~~~~~~~p~~~-----~~ei~k~l~~~W~~l~~~eK-~~y~~ 47 (56)
+.+|...+.-||+.+ ..+.-+.|.+.+..|++.+| ..|..
T Consensus 23 ~ayr~la~~~HPD~~~~~~~a~~~f~~i~~Ay~vL~d~~kR~~YD~ 68 (372)
T PRK14286 23 SAYRKLAIKYHPDKNKGNKESEEKFKEATEAYEILRDPKKRQAYDQ 68 (372)
T ss_pred HHHHHHHHHHCcCCCCCchHHHHHHHHHHHHHHHhccHHHHHHHHH
Confidence 456677777889875 24566789999999987664 45553
No 110
>COG2938 Uncharacterized conserved protein [Function unknown]
Probab=36.27 E-value=66 Score=16.59 Aligned_cols=24 Identities=13% Similarity=0.223 Sum_probs=18.6
Q ss_pred HHHHHHhcCCCHHHHHHHHHHHHH
Q psy5907 28 KIIGQMWRDLPEDQKTEYVEDYEQ 51 (56)
Q Consensus 28 k~l~~~W~~l~~~eK~~y~~~a~~ 51 (56)
.-....|..||++++..|..+-+.
T Consensus 35 ~Fae~~~~~lsd~el~~f~~LLe~ 58 (94)
T COG2938 35 PFAEKEFDSLSDEELDEFERLLEC 58 (94)
T ss_pred HHHHHHHhhCCHHHHHHHHHHHcC
Confidence 334578899999999999877543
No 111
>PF03810 IBN_N: Importin-beta N-terminal domain; InterPro: IPR001494 Karyopherins are a group of proteins involved in transporting molecules through the pores of the nuclear envelope. Karyopherins, which may act as importins or exportins, are part of the Importin-beta super-family, which all share a similar three-dimensional structure. Members of the importin-beta (karyopherin-beta) family can bind and transport cargo by themselves, or can form heterodimers with importin-alpha. As part of a heterodimer, importin-beta mediates interactions with the pore complex, while importin-alpha acts as an adaptor protein to bind the nuclear localisation signal (NLS) on the cargo through the classical NLS import of proteins. Importin-beta is a helicoidal molecule constructed from 19 HEAT repeats. Many nuclear pore proteins contain FG sequence repeats that can bind to HEAT repeats within importins [, ], which is important for importin-beta mediated transport. Ran GTPase helps to control the unidirectional transfer of cargo. The cytoplasm contains primarily RanGDP and the nucleus RanGTP through the actions of RanGAP and RanGEF, respectively. In the nucleus, RanGTP binds to importin-beta within the importin/cargo complex, causing a conformational change in importin-beta that releases it from importin-alpha-bound cargo. As a result, the N-terminal auto-inhibitory region on importin-alpha is free to loop back and bind to the major NLS-binding site, causing the cargo to be released []. There are additional release factors as well. This entry represents the N-terminal domain of karyopherins that is important for the binding of the Ran protein []. More information about these proteins can be found at Protein of the Month: Importins [].; GO: 0008565 protein transporter activity, 0006886 intracellular protein transport; PDB: 3NC1_A 3NBY_D 3NBZ_D 3NC0_A 3GJX_D 1IBR_D 1QGR_A 3LWW_A 1F59_A 2Q5D_A ....
Probab=35.34 E-value=52 Score=15.11 Aligned_cols=23 Identities=26% Similarity=0.606 Sum_probs=16.5
Q ss_pred HHHHHHHHHhc--------CCCHHHHHHHHH
Q psy5907 25 EIGKIIGQMWR--------DLPEDQKTEYVE 47 (56)
Q Consensus 25 ei~k~l~~~W~--------~l~~~eK~~y~~ 47 (56)
-+-..|...|. .+|+++|..-..
T Consensus 39 ~LKn~I~~~W~~~~~~~~~~~~~~~k~~Ik~ 69 (77)
T PF03810_consen 39 LLKNLIKKNWSPSKQKGWSQLPEEEKEQIKS 69 (77)
T ss_dssp HHHHHHHHSGGHHHHHHHHGSSHHHHHHHHH
T ss_pred HHHHHHHHcCchhhccCCCCCCHHHHHHHHH
Confidence 34455777788 999999976543
No 112
>PF11903 DUF3423: Protein of unknown function (DUF3423); InterPro: IPR021831 This family of proteins are functionally uncharacterised. This protein is found in bacteria. Proteins in this family are typically between 73 to 118 amino acids in length.
Probab=35.20 E-value=60 Score=15.83 Aligned_cols=20 Identities=20% Similarity=0.174 Sum_probs=15.4
Q ss_pred HHHHhCCCCCHHHHHHHHHH
Q psy5907 13 QVKAQNLDLKLWEIGKIIGQ 32 (56)
Q Consensus 13 ~~~~~~p~~~~~ei~k~l~~ 32 (56)
.+...||+++...|...+-.
T Consensus 37 r~~E~np~l~~~~I~~~l~~ 56 (72)
T PF11903_consen 37 RAAEDNPDLSYSFIEEMLLA 56 (72)
T ss_pred HHHHHCCCCCHHHHHHHHHH
Confidence 44578999999999887643
No 113
>PRK14288 chaperone protein DnaJ; Provisional
Probab=34.90 E-value=1.2e+02 Score=19.10 Aligned_cols=39 Identities=18% Similarity=0.138 Sum_probs=27.7
Q ss_pred HHHHHHHHHhCCCCCH-----HHHHHHHHHHhcCCCHHHHH-HHH
Q psy5907 8 RKVWDQVKAQNLDLKL-----WEIGKIIGQMWRDLPEDQKT-EYV 46 (56)
Q Consensus 8 ~~~~~~~~~~~p~~~~-----~ei~k~l~~~W~~l~~~eK~-~y~ 46 (56)
+.+|...+.-||+.+. .+.-+.|.+.+..|++.+|. .|.
T Consensus 22 kayrkla~k~HPD~~~~~~~a~~~f~~i~~AYevLsd~~kR~~YD 66 (369)
T PRK14288 22 KSYRKLALKYHPDRNAGDKEAEEKFKLINEAYGVLSDEKKRALYD 66 (369)
T ss_pred HHHHHHHHHHCCCCCCCccHHHHHHHHHHHHHHHhccHHHHHHHH
Confidence 4567777788898753 34556799999999876654 454
No 114
>PF09556 RE_HaeIII: HaeIII restriction endonuclease; InterPro: IPR019059 There are four classes of restriction endonucleases: types I, II,III and IV. All types of enzymes recognise specific short DNA sequences and carry out the endonucleolytic cleavage of DNA to give specific double-stranded fragments with terminal 5'-phosphates. They differ in their recognition sequence, subunit composition, cleavage position, and cofactor requirements [, ], as summarised below: Type I enzymes (3.1.21.3 from EC) cleave at sites remote from recognition site; require both ATP and S-adenosyl-L-methionine to function; multifunctional protein with both restriction and methylase (2.1.1.72 from EC) activities. Type II enzymes (3.1.21.4 from EC) cleave within or at short specific distances from recognition site; most require magnesium; single function (restriction) enzymes independent of methylase. Type III enzymes (3.1.21.5 from EC) cleave at sites a short distance from recognition site; require ATP (but doesn't hydrolyse it); S-adenosyl-L-methionine stimulates reaction but is not required; exists as part of a complex with a modification methylase methylase (2.1.1.72 from EC). Type IV enzymes target methylated DNA. Type II restriction endonucleases (3.1.21.4 from EC) are components of prokaryotic DNA restriction-modification mechanisms that protect the organism against invading foreign DNA. These site-specific deoxyribonucleases catalyse the endonucleolytic cleavage of DNA to give specific double-stranded fragments with terminal 5'-phosphates. Of the 3000 restriction endonucleases that have been characterised, most are homodimeric or tetrameric enzymes that cleave target DNA at sequence-specific sites close to the recognition site. For homodimeric enzymes, the recognition site is usually a palindromic sequence 4-8 bp in length. Most enzymes require magnesium ions as a cofactor for catalysis. Although they can vary in their mode of recognition, many restriction endonucleases share a similar structural core comprising four beta-strands and one alpha-helix, as well as a similar mechanism of cleavage, suggesting a common ancestral origin []. However, there is still considerable diversity amongst restriction endonucleases [, ]. The target site recognition process triggers large conformational changes of the enzyme and the target DNA, leading to the activation of the catalytic centres. Like other DNA binding proteins, restriction enzymes are capable of non-specific DNA binding as well, which is the prerequisite for efficient target site location by facilitated diffusion. Non-specific binding usually does not involve interactions with the bases but only with the DNA backbone []. This entry represents the restriction endonuclease HaeIII, which recognises and cleaves the double-stranded sequence GG^CC.
Probab=34.89 E-value=16 Score=22.59 Aligned_cols=28 Identities=29% Similarity=0.548 Sum_probs=21.0
Q ss_pred HHHHHHHHHHhcCCCHHHHHHHHHHHHH
Q psy5907 24 WEIGKIIGQMWRDLPEDQKTEYVEDYEQ 51 (56)
Q Consensus 24 ~ei~k~l~~~W~~l~~~eK~~y~~~a~~ 51 (56)
-+.-+.-+..|+.++..+-..|..+-..
T Consensus 150 L~~~k~~~~~W~~l~nKe~~vYiPLL~A 177 (300)
T PF09556_consen 150 LEEEKKEGTKWRELPNKEEDVYIPLLKA 177 (300)
T ss_pred HHHHHHcCcchhhcccccccchHHHHHH
Confidence 3444556679999999999999876543
No 115
>cd07911 RNRR2_Rv0233_like Ribonucleotide Reductase R2-like protein, Mn/Fe-binding domain. Rv0233 is a Mycobacterium tuberculosis ribonucleotide reductase R2 protein with a heterodinuclear manganese/iron-carboxylate cofactor located in its metal center. The Rv0233-like family may represent a structural/functional counterpart of the evolutionary ancestor of the RNRR2's (Ribonucleotide Reductase, R2/beta subunit) and the bacterial multicomponent monooxygenases. RNRR2s belong to a broad superfamily of ferritin-like diiron-carboxylate proteins. The RNR protein catalyzes the conversion of ribonucleotides to deoxyribonucleotides and is found in prokaryotes and archaea. The catalytically active form of RNR is a proposed alpha2-beta2 tetramer. The homodimeric alpha subunit (R1) contains the active site and redox active cysteines as well as the allosteric binding sites.
Probab=34.79 E-value=41 Score=20.06 Aligned_cols=16 Identities=19% Similarity=0.435 Sum_probs=13.7
Q ss_pred HhcCCCHHHHHHHHHH
Q psy5907 33 MWRDLPEDQKTEYVED 48 (56)
Q Consensus 33 ~W~~l~~~eK~~y~~~ 48 (56)
.|+.|++.||..|...
T Consensus 35 ~w~~L~~~Er~~~~~~ 50 (280)
T cd07911 35 DWEQLSEEERDLALRL 50 (280)
T ss_pred HHHhCCHHHHHHHHHH
Confidence 5999999999988754
No 116
>PRK09731 putative general secretion pathway protein YghD; Provisional
Probab=34.17 E-value=95 Score=17.80 Aligned_cols=34 Identities=15% Similarity=0.199 Sum_probs=23.3
Q ss_pred HHhHHHHHHHHHHHhCCCCCHHHHHHHHHHHhcCCCHHHHHH
Q psy5907 3 YMRYSRKVWDQVKAQNLDLKLWEIGKIIGQMWRDLPEDQKTE 44 (56)
Q Consensus 3 y~lf~~~~~~~~~~~~p~~~~~ei~k~l~~~W~~l~~~eK~~ 44 (56)
|+.|++..|.. .++.+| .-++..|..+++-|+..
T Consensus 6 ~~~~~~~~~~~------~~~~~~--~~~~~~W~~ls~REq~l 39 (178)
T PRK09731 6 FIHYFQQWRER------QLSRGE--HWLAQHLAGRSPREKGM 39 (178)
T ss_pred HHHHHHHHHHH------Hhcchh--hHHHHHHccCCHHHHHH
Confidence 67777766654 233333 36888999999998864
No 117
>KOG0715|consensus
Probab=33.87 E-value=1.1e+02 Score=18.62 Aligned_cols=37 Identities=16% Similarity=0.127 Sum_probs=26.8
Q ss_pred HHHHHHHHhCCCCC----HHHHHHHHHHHhcCCCHHHHHHH
Q psy5907 9 KVWDQVKAQNLDLK----LWEIGKIIGQMWRDLPEDQKTEY 45 (56)
Q Consensus 9 ~~~~~~~~~~p~~~----~~ei~k~l~~~W~~l~~~eK~~y 45 (56)
..+...+.-||+.+ ..+.-..|.+.+..|++++|..=
T Consensus 63 Af~~LaKkyHPD~n~~~~a~~kF~eI~~AYEiLsd~eKR~~ 103 (288)
T KOG0715|consen 63 AFRKLAKKYHPDVNKDKEASKKFKEISEAYEILSDEEKRQE 103 (288)
T ss_pred HHHHHHHhhCCCCCCCcchhhHHHHHHHHHHHhcCHHHHHH
Confidence 45566677788764 66677788999998887777543
No 118
>PRK14297 chaperone protein DnaJ; Provisional
Probab=33.49 E-value=1e+02 Score=19.45 Aligned_cols=39 Identities=21% Similarity=0.141 Sum_probs=27.4
Q ss_pred HHHHHHHHHhCCCCC-----HHHHHHHHHHHhcCCCHHHHH-HHH
Q psy5907 8 RKVWDQVKAQNLDLK-----LWEIGKIIGQMWRDLPEDQKT-EYV 46 (56)
Q Consensus 8 ~~~~~~~~~~~p~~~-----~~ei~k~l~~~W~~l~~~eK~-~y~ 46 (56)
+.+|...+.-||+.+ ..+.-+.|.+.|..|++.+|. .|.
T Consensus 23 ~ayr~la~~~HPD~~~~~~~a~~~f~~i~~Ay~vL~d~~~r~~yD 67 (380)
T PRK14297 23 KAFRKLAIKYHPDKNKGNKEAEEKFKEINEAYQVLSDPQKKAQYD 67 (380)
T ss_pred HHHHHHHHHHCcCCCCCcHHHHHHHHHHHHHHHHhcCHhhhCchh
Confidence 455666677788875 346678899999998886644 444
No 119
>PRK14289 chaperone protein DnaJ; Provisional
Probab=33.47 E-value=1.3e+02 Score=19.02 Aligned_cols=40 Identities=15% Similarity=0.086 Sum_probs=27.5
Q ss_pred HHHHHHHHHhCCCCC-----HHHHHHHHHHHhcCCCHHHHHH-HHH
Q psy5907 8 RKVWDQVKAQNLDLK-----LWEIGKIIGQMWRDLPEDQKTE-YVE 47 (56)
Q Consensus 8 ~~~~~~~~~~~p~~~-----~~ei~k~l~~~W~~l~~~eK~~-y~~ 47 (56)
+..+...+.-||+.+ ..+.-+.+.+.|..|++.++.. |..
T Consensus 24 ~ayr~la~~~HpD~~~~~~~a~~~f~~i~~Ay~~L~d~~~R~~yD~ 69 (386)
T PRK14289 24 KAYRKKAIQYHPDKNPGDKEAEEKFKEAAEAYDVLSDPDKRSRYDQ 69 (386)
T ss_pred HHHHHHHHHHCCCCCCCChHHHHHHHHHHHHHHHhcCHHHHHHHHH
Confidence 345556666788875 3455677889999998887554 664
No 120
>PF13542 HTH_Tnp_ISL3: Helix-turn-helix domain of transposase family ISL3
Probab=33.29 E-value=37 Score=14.58 Aligned_cols=32 Identities=6% Similarity=0.257 Sum_probs=20.2
Q ss_pred HHHHHHHHHHHhCCCC-CHHHHHHHHHHHhcCC
Q psy5907 6 YSRKVWDQVKAQNLDL-KLWEIGKIIGQMWRDL 37 (56)
Q Consensus 6 f~~~~~~~~~~~~p~~-~~~ei~k~l~~~W~~l 37 (56)
+.......+....-+. +..+|++.+|=.|...
T Consensus 12 ~T~~~~~~i~~~~~~~~s~~~vA~~~~vs~~TV 44 (52)
T PF13542_consen 12 ITKRLEQYILKLLRESRSFKDVARELGVSWSTV 44 (52)
T ss_pred HHHHHHHHHHHHHhhcCCHHHHHHHHCCCHHHH
Confidence 3344444455444444 8999999888777643
No 121
>smart00413 ETS erythroblast transformation specific domain. variation of the helix-turn-helix motif
Probab=33.25 E-value=26 Score=17.84 Aligned_cols=24 Identities=8% Similarity=0.318 Sum_probs=10.8
Q ss_pred HHHHhcCCCHHHHHHHHHHHHHhh
Q psy5907 30 IGQMWRDLPEDQKTEYVEDYEQEK 53 (56)
Q Consensus 30 l~~~W~~l~~~eK~~y~~~a~~~k 53 (56)
+++.|..-......-|+.++...+
T Consensus 39 vA~lWG~~Knk~~M~YeklSRaLR 62 (87)
T smart00413 39 VARLWGQRKNKPNMNYEKLSRALR 62 (87)
T ss_pred HHHHHhhhcCCCCCCHHHHHHHHH
Confidence 444554433333344555554443
No 122
>smart00206 NTR Tissue inhibitor of metalloproteinase family. Form complexes with metalloproteinases, such as collagenases, and irreversibly inactivate them.
Probab=33.14 E-value=26 Score=20.02 Aligned_cols=18 Identities=22% Similarity=0.462 Sum_probs=15.5
Q ss_pred HhcCCCHHHHHHHHHHHH
Q psy5907 33 MWRDLPEDQKTEYVEDYE 50 (56)
Q Consensus 33 ~W~~l~~~eK~~y~~~a~ 50 (56)
.|.+|+..+|.-+...+.
T Consensus 101 pW~~lT~~Qr~Gl~~~Y~ 118 (172)
T smart00206 101 PWDSLSLAQRKGLNKRYH 118 (172)
T ss_pred ehHhCCHHHHhhhhHhhc
Confidence 599999999999887764
No 123
>PF12376 DUF3654: Protein of unknown function (DUF3654) ; InterPro: IPR022115 This family of proteins is found in eukaryotes. Proteins in this family are typically between 193 and 612 amino acids in length.
Probab=33.08 E-value=63 Score=17.86 Aligned_cols=21 Identities=29% Similarity=0.373 Sum_probs=15.3
Q ss_pred cCCCHHHHHHHHHHHHHhhhc
Q psy5907 35 RDLPEDQKTEYVEDYEQEKIY 55 (56)
Q Consensus 35 ~~l~~~eK~~y~~~a~~~k~~ 55 (56)
..||+++|..+...-..-|.|
T Consensus 6 ~~~s~eEk~k~~e~l~~ikey 26 (137)
T PF12376_consen 6 SELSEEEKKKEEEMLNKIKEY 26 (137)
T ss_pred cccCHHHHHHHHHHHHHHHHH
Confidence 467888888888777666654
No 124
>TIGR02349 DnaJ_bact chaperone protein DnaJ. This model represents bacterial forms of DnaJ, part of the DnaK-DnaJ-GrpE chaperone system. The three components typically are encoded by consecutive genes. DnaJ homologs occur in many genomes, typically not near DnaK and GrpE-like genes; most such genes are not included by this family. Eukaryotic (mitochondrial and chloroplast) forms are not included in the scope of this family.
Probab=32.78 E-value=1.2e+02 Score=18.73 Aligned_cols=39 Identities=21% Similarity=0.155 Sum_probs=27.5
Q ss_pred HHHHHHHHHhCCCCC----HHHHHHHHHHHhcCCCHHHHH-HHH
Q psy5907 8 RKVWDQVKAQNLDLK----LWEIGKIIGQMWRDLPEDQKT-EYV 46 (56)
Q Consensus 8 ~~~~~~~~~~~p~~~----~~ei~k~l~~~W~~l~~~eK~-~y~ 46 (56)
+.++...+.-||+.. ..+.-+.|.+.|..|++..+. .|.
T Consensus 19 ~ayr~la~~~HPD~~~~~~~~~~f~~i~~Ay~vL~d~~~R~~yd 62 (354)
T TIGR02349 19 KAYRKLAKKYHPDRNKDKEAEEKFKEINEAYEVLSDPEKRAQYD 62 (354)
T ss_pred HHHHHHHHHHCCCCCCCccHHHHHHHHHHHHHHhhChHHHHhhh
Confidence 455666677788875 345667899999988877744 444
No 125
>PHA02947 S-S bond formation pathway protein; Provisional
Probab=32.73 E-value=49 Score=19.62 Aligned_cols=28 Identities=25% Similarity=0.321 Sum_probs=23.6
Q ss_pred CCCCCHHHHHHHHHHHhcCCCHHHHHHH
Q psy5907 18 NLDLKLWEIGKIIGQMWRDLPEDQKTEY 45 (56)
Q Consensus 18 ~p~~~~~ei~k~l~~~W~~l~~~eK~~y 45 (56)
++..++.-+.+.+.+--..||+++|..-
T Consensus 58 na~~sf~lli~a~~Et~~~Lp~~qk~~i 85 (215)
T PHA02947 58 NKRLSFTLLIKTFKEVISTLPEKERREL 85 (215)
T ss_pred CHHHHHHHHHHHHHHHHHhCCHHHHHHH
Confidence 4567788889999999999999999754
No 126
>PF02216 B: B domain; InterPro: IPR003132 This entry represents the immunoglobulin-binding domain found in the Staphylococcus aureus virulence factor protein A (SpA). Protein A contains five highly homologous Ig-binding domains in tandem (designated domains E, D, A, B and C), which share a common structure consisting of three helices in a closed left-handed twist. Protein A can exist in both secreted and membrane-bound forms, and has two distinct Ig-binding activities: each domain can bind Fc-gamma (the constant region of IgG involved in effector functions) and Fab (the Ig fragment responsible for antigen recognition) [].; GO: 0019865 immunoglobulin binding, 0009405 pathogenesis; PDB: 1EDL_A 1EDI_A 1EDJ_A 1EDK_A 2B88_A 2B87_A 2B89_A 1FC2_C 1DEE_H 1ZXG_A ....
Probab=32.64 E-value=60 Score=15.06 Aligned_cols=17 Identities=24% Similarity=0.628 Sum_probs=11.8
Q ss_pred cCCCHHHHHHHHHHHHH
Q psy5907 35 RDLPEDQKTEYVEDYEQ 51 (56)
Q Consensus 35 ~~l~~~eK~~y~~~a~~ 51 (56)
..|++++|.-|++.-+.
T Consensus 20 ~nLteeQrn~yI~~lKd 36 (54)
T PF02216_consen 20 PNLTEEQRNGYIQSLKD 36 (54)
T ss_dssp TTS-HHHHHHHHHHHHH
T ss_pred CCcCHHHHHhHHHHHhh
Confidence 46788888888876554
No 127
>PHA02955 hypothetical protein; Provisional
Probab=32.43 E-value=49 Score=19.61 Aligned_cols=28 Identities=11% Similarity=0.194 Sum_probs=23.9
Q ss_pred CCCCCHHHHHHHHHHHhcCCCHHHHHHH
Q psy5907 18 NLDLKLWEIGKIIGQMWRDLPEDQKTEY 45 (56)
Q Consensus 18 ~p~~~~~ei~k~l~~~W~~l~~~eK~~y 45 (56)
++..++.-+.+.+.+--..||+++|..-
T Consensus 58 na~~sf~lli~a~~Et~~~Lp~~qk~~i 85 (213)
T PHA02955 58 DEEKNFQLLIEALIETIENFPEKEQKEI 85 (213)
T ss_pred CHHHHHHHHHHHHHHHHHhCCHHHHHHH
Confidence 4567788899999999999999999765
No 128
>PF00356 LacI: Bacterial regulatory proteins, lacI family; InterPro: IPR000843 Numerous bacterial transcription regulatory proteins bind DNA via a helix-turn-helix (HTH) motif. These proteins are very diverse, but for convenience may be grouped into subfamilies on the basis of sequence similarity. One such family groups together a range of proteins, including ascG, ccpA, cytR, ebgR, fruR, galR, galS, lacI, malI, opnR, purF, rafR, rbtR and scrR [, ]. Within this family, the HTH motif is situated towards the N terminus.; GO: 0003700 sequence-specific DNA binding transcription factor activity, 0006355 regulation of transcription, DNA-dependent, 0005622 intracellular; PDB: 3KJX_C 1ZAY_A 1VPW_A 2PUA_A 1QQA_A 1PNR_A 1JFT_A 1QP4_A 2PUD_A 1JH9_A ....
Probab=32.42 E-value=53 Score=14.39 Aligned_cols=32 Identities=9% Similarity=0.233 Sum_probs=25.1
Q ss_pred CCCCHHHHHHHHHHHhcCCCHHHHHHHHHHHHH
Q psy5907 19 LDLKLWEIGKIIGQMWRDLPEDQKTEYVEDYEQ 51 (56)
Q Consensus 19 p~~~~~ei~k~l~~~W~~l~~~eK~~y~~~a~~ 51 (56)
-+.+.+-|++.|... ..++++.++.-.+.+++
T Consensus 9 agvS~~TVSr~ln~~-~~vs~~tr~rI~~~a~~ 40 (46)
T PF00356_consen 9 AGVSKSTVSRVLNGP-PRVSEETRERILEAAEE 40 (46)
T ss_dssp HTSSHHHHHHHHTTC-SSSTHHHHHHHHHHHHH
T ss_pred HCcCHHHHHHHHhCC-CCCCHHHHHHHHHHHHH
Confidence 478889999999887 78899988876655544
No 129
>cd08804 Death_ank2 Death domain of Ankyrin-2. Death Domain (DD) of Ankyrin-2 (ANK-2) and related proteins. Ankyrins are modular proteins comprising three conserved domains, an N-terminal membrane-binding domain containing ANK repeats, a spectrin-binding domain and a C-terminal DD. ANK-2, also called ankyrin-B (for broadly expressed), is required for proper function of the Na/Ca ion exchanger-1 in cardiomyocytes, and is thought to function in linking integral membrane proteins to the underlying cytoskeleton. Human ANK-2 is associated with "Ankyrin-B syndrome", an atypical arrythmia disorder with risk of sudden cardiac death. It also plays key roles in the brain and striated muscle. Loss of ANK-2 is associated with significant nervous system defects and sarcomere disorganization. In general, DDs are protein-protein interaction domains found in a variety of domain architectures. Their common feature is that they form homodimers by self-association or heterodimers by associating with other
Probab=32.05 E-value=20 Score=17.78 Aligned_cols=20 Identities=25% Similarity=0.513 Sum_probs=14.9
Q ss_pred CCHHHHHHHHHHHhcCCCHH
Q psy5907 21 LKLWEIGKIIGQMWRDLPED 40 (56)
Q Consensus 21 ~~~~ei~k~l~~~W~~l~~~ 40 (56)
.....|+..||..|+.+..+
T Consensus 5 ~~l~~ia~~LG~dWk~LAr~ 24 (84)
T cd08804 5 ERLAVIADHLGFSWTELARE 24 (84)
T ss_pred hHHHHHHHHHhhhHHHHHHH
Confidence 44577888999999877544
No 130
>cd06257 DnaJ DnaJ domain or J-domain. DnaJ/Hsp40 (heat shock protein 40) proteins are highly conserved and play crucial roles in protein translation, folding, unfolding, translocation, and degradation. They act primarily by stimulating the ATPase activity of Hsp70s, an important chaperonine family. Hsp40 proteins are characterized by the presence of a J domain, which mediates the interaction with Hsp70. They may contain other domains as well, and the architectures provide a means of classification.
Probab=31.65 E-value=52 Score=14.04 Aligned_cols=31 Identities=16% Similarity=-0.016 Sum_probs=18.7
Q ss_pred HHHHHHHHHhCCCCCH-----HHHHHHHHHHhcCCC
Q psy5907 8 RKVWDQVKAQNLDLKL-----WEIGKIIGQMWRDLP 38 (56)
Q Consensus 8 ~~~~~~~~~~~p~~~~-----~ei~k~l~~~W~~l~ 38 (56)
+..+..++.-||+... .+....|.+.|..|+
T Consensus 19 ~~y~~l~~~~HPD~~~~~~~~~~~~~~l~~Ay~~L~ 54 (55)
T cd06257 19 KAYRKLALKYHPDKNPDDPEAEEKFKEINEAYEVLS 54 (55)
T ss_pred HHHHHHHHHHCcCCCCCcHHHHHHHHHHHHHHHHhc
Confidence 4456666777887653 345556666666554
No 131
>cd09071 FAR_C C-terminal domain of fatty acyl CoA reductases. C-terminal domain of fatty acyl CoA reductases, a family of SDR-like proteins. SDRs or short-chain dehydrogenases/reductases are Rossmann-fold NAD(P)H-binding proteins. Many proteins in this FAR_C family may function as fatty acyl-CoA reductases (FARs), acting on medium and long chain fatty acids, and have been reported to be involved in diverse processes such as the biosynthesis of insect pheromones, plant cuticular wax production, and mammalian wax biosynthesis. In Arabidopsis thaliana, proteins with this particular architecture have also been identified as the MALE STERILITY 2 (MS2) gene product, which is implicated in male gametogenesis. Mutations in MS2 inhibit the synthesis of exine (sporopollenin), rendering plants unable to reduce pollen wall fatty acids to corresponding alcohols. The function of this C-terminal domain is unclear.
Probab=31.43 E-value=33 Score=16.66 Aligned_cols=15 Identities=20% Similarity=0.720 Sum_probs=10.8
Q ss_pred HHHhcCCCHHHHHHH
Q psy5907 31 GQMWRDLPEDQKTEY 45 (56)
Q Consensus 31 ~~~W~~l~~~eK~~y 45 (56)
-..|+.|++++|+.|
T Consensus 53 ~~L~~~l~~~Dr~~F 67 (92)
T cd09071 53 RALWERLSEEDRELF 67 (92)
T ss_pred HHHHHHCCHHHHHhC
Confidence 345788888888755
No 132
>PF04363 DUF496: Protein of unknown function (DUF496); InterPro: IPR007458 Members of this family are uncharacterised proteins.
Probab=31.26 E-value=71 Score=16.47 Aligned_cols=17 Identities=29% Similarity=0.384 Sum_probs=15.0
Q ss_pred CCCCHHHHHHHHHHHhc
Q psy5907 19 LDLKLWEIGKIIGQMWR 35 (56)
Q Consensus 19 p~~~~~ei~k~l~~~W~ 35 (56)
|+++..+|..+|..|-.
T Consensus 47 ~~Ms~edi~~II~nMr~ 63 (95)
T PF04363_consen 47 PDMSIEDIRAIIENMRS 63 (95)
T ss_pred CCCCHHHHHHHHHHHHh
Confidence 99999999999988754
No 133
>cd03585 NTR_TIMP NTR domain, TIMP subfamily; TIMPs, or tissue inibitors of metalloproteases, are essential regulators of extracellular matrix turnover and remodeling. They form complexes with matrix metalloproteases (MMPs) and inactivate them irreversibly by non-covalently binding their active zinc-binding sites. The levels of activated membrane-type MMPs, MMPs, and free TIMPs determine the balance between matrix degradation and matrix formation or stabilization. Consequently, TIMPs play roles in processes that require the remodeling and degradation of connective tissue, such as development, morphogenesis, wound healing, as well as in various diseases and pathological states such as tumor cell metastasis, arthritis, and artherosclerosis. Most TIMPs bind to a variety of MMPs. TIMP-1 and TIMP-2 appear to be multifunctional proteins with diverse biological action. They may exhibit growth factor-like activity and can inhibit angiogenesis. TIMP-3 has been implicated in apoptosis.
Probab=31.20 E-value=28 Score=20.08 Aligned_cols=18 Identities=28% Similarity=0.486 Sum_probs=15.4
Q ss_pred HhcCCCHHHHHHHHHHHH
Q psy5907 33 MWRDLPEDQKTEYVEDYE 50 (56)
Q Consensus 33 ~W~~l~~~eK~~y~~~a~ 50 (56)
.|.+|+..+|.-+...+.
T Consensus 106 pW~~lT~~Qr~gl~~~Y~ 123 (183)
T cd03585 106 PWDSLSLTQKKGLNHRYQ 123 (183)
T ss_pred ehHhCCHHHHhhhhHhhc
Confidence 599999999998887764
No 134
>PRK05423 hypothetical protein; Provisional
Probab=31.05 E-value=87 Score=16.38 Aligned_cols=18 Identities=33% Similarity=0.357 Sum_probs=15.4
Q ss_pred CCCCHHHHHHHHHHHhcC
Q psy5907 19 LDLKLWEIGKIIGQMWRD 36 (56)
Q Consensus 19 p~~~~~ei~k~l~~~W~~ 36 (56)
|+++..+|..+|..|-..
T Consensus 54 ~~Ms~e~i~~II~nMr~D 71 (104)
T PRK05423 54 PGMSIEEIQGIIANMKSD 71 (104)
T ss_pred CCCCHHHHHHHHHHHHhh
Confidence 899999999999888543
No 135
>PRK14277 chaperone protein DnaJ; Provisional
Probab=31.01 E-value=1.4e+02 Score=18.85 Aligned_cols=39 Identities=23% Similarity=0.176 Sum_probs=27.3
Q ss_pred HHHHHHHHHhCCCCCH-----HHHHHHHHHHhcCCCHHHHH-HHH
Q psy5907 8 RKVWDQVKAQNLDLKL-----WEIGKIIGQMWRDLPEDQKT-EYV 46 (56)
Q Consensus 8 ~~~~~~~~~~~p~~~~-----~ei~k~l~~~W~~l~~~eK~-~y~ 46 (56)
+.+|...+.-||+.+. .+.-+.|.+.+..|++.+|. .|.
T Consensus 24 ~ayr~la~~~HPD~~~~~~~a~~~f~~i~~Ay~vL~d~~kr~~yD 68 (386)
T PRK14277 24 KAYRRLAKKYHPDLNPGDKEAEQKFKEINEAYEILSDPQKRAQYD 68 (386)
T ss_pred HHHHHHHHHHCCCcCCCchHHHHHHHHHHHHHHHhCCHHHHHHHH
Confidence 4566666777888753 45677899999999866654 444
No 136
>PF06424 PRP1_N: PRP1 splicing factor, N-terminal; InterPro: IPR010491 This domain is specific to the N-terminal part of the prp1 splicing factor, which is involved in mRNA splicing (and possibly also poly(A)+ RNA nuclear export and cell cycle progression). This domain is specific to the N terminus of the RNA splicing factor encoded by prp1 []. It is involved in mRNA splicing and possibly also poly(A)and RNA nuclear export and cell cycle progression.; GO: 0000398 nuclear mRNA splicing, via spliceosome, 0005634 nucleus
Probab=30.96 E-value=48 Score=18.15 Aligned_cols=26 Identities=15% Similarity=0.599 Sum_probs=16.1
Q ss_pred HHHhCCCCC--HHHHHHHHH----HHhcCCCH
Q psy5907 14 VKAQNLDLK--LWEIGKIIG----QMWRDLPE 39 (56)
Q Consensus 14 ~~~~~p~~~--~~ei~k~l~----~~W~~l~~ 39 (56)
...++|.++ |+++-+.|+ +.|-+||+
T Consensus 95 ~~~~~pkI~~QFaDLKR~La~VS~eeW~~IPE 126 (133)
T PF06424_consen 95 YRKENPKIQQQFADLKRSLATVSEEEWENIPE 126 (133)
T ss_pred hhccCchHHHHHHHHHHHHccCCHHHHhcCCc
Confidence 334456554 666666654 67888876
No 137
>PHA02662 ORF131 putative membrane protein; Provisional
Probab=30.87 E-value=55 Score=19.59 Aligned_cols=28 Identities=11% Similarity=0.133 Sum_probs=23.7
Q ss_pred CCCCCHHHHHHHHHHHhcCCCHHHHHHH
Q psy5907 18 NLDLKLWEIGKIIGQMWRDLPEDQKTEY 45 (56)
Q Consensus 18 ~p~~~~~ei~k~l~~~W~~l~~~eK~~y 45 (56)
++..++.-+.+.+.+--..||+++|..-
T Consensus 71 na~~sf~lll~Al~Et~~~Lp~~qK~~i 98 (226)
T PHA02662 71 DAADALALASAALAETLAELPRADRLAV 98 (226)
T ss_pred CHHHHHHHHHHHHHHHHHhCCHHHHHHH
Confidence 4566788899999999999999999763
No 138
>PF10675 DUF2489: Protein of unknown function (DUF2489); InterPro: IPR019617 This entry represents bacterial uncharacterised proteins.
Probab=30.51 E-value=76 Score=17.14 Aligned_cols=19 Identities=11% Similarity=0.267 Sum_probs=15.4
Q ss_pred HHHHhcCCCHHHHHHHHHH
Q psy5907 30 IGQMWRDLPEDQKTEYVED 48 (56)
Q Consensus 30 l~~~W~~l~~~eK~~y~~~ 48 (56)
+++.|+.|+..++..++..
T Consensus 94 theaRk~L~k~ermk~D~e 112 (131)
T PF10675_consen 94 THEARKALPKKERMKLDLE 112 (131)
T ss_pred cHHHHHcCCHHHHHHHhHH
Confidence 4677999999999887744
No 139
>cd08319 Death_RAIDD Death domain of RIP-associated ICH-1 homologous protein with a death domain. Death domain (DD) of RAIDD (RIP-associated ICH-1 homologous protein with a death domain), also known as CRADD (Caspase and RIP adaptor). RAIDD is an adaptor protein that together with the p53-inducible protein PIDD and caspase-2, forms the PIDDosome complex, which is required for caspase-2 activation and plays a role in mediating stress-induced apoptosis. RAIDD contains an N-terminal Caspase Activation and Recruitment Domain (CARD), which interacts with the caspase-2 CARD, and a C-terminal DD, which interacts with the DD of PIDD. In general, DDs are protein-protein interaction domains found in a variety of domain architectures. Their common feature is that they form homodimers by self-association or heterodimers by associating with other members of the DD superfamily including CARD, DED (Death Effector Domain), and PYRIN. They serve as adaptors in signaling pathways and can recruit other pr
Probab=30.12 E-value=24 Score=17.56 Aligned_cols=16 Identities=19% Similarity=0.557 Sum_probs=12.4
Q ss_pred HHHHHHHHHHHhcCCC
Q psy5907 23 LWEIGKIIGQMWRDLP 38 (56)
Q Consensus 23 ~~ei~k~l~~~W~~l~ 38 (56)
...|+..||..|+.|.
T Consensus 5 L~~la~~LG~~W~~La 20 (83)
T cd08319 5 LNQLAQRLGPEWEQVL 20 (83)
T ss_pred HHHHHHHHhhhHHHHH
Confidence 4568888999998764
No 140
>cd08306 Death_FADD Fas-associated Death Domain protein-protein interaction domain. Death domain (DD) found in FAS-associated via death domain (FADD). FADD is a component of the death-inducing signaling complex (DISC) and serves as an adaptor in the signaling pathway of death receptor proteins. It modulates apoptosis as well as non-apoptotic processes such as cell cycle progression, survival, innate immune signaling, and hematopoiesis. FADD contains an N-terminal DED and a C-terminal DD. Its DD interacts with the DD of the activated death receptor, FAS, and its DED recruits the initiator caspases, caspase-8 and -10, to the DISC complex via a homotypic interaction with the N-terminal DED of the caspase. DDs are protein-protein interaction domains found in a variety of domain architectures. Their common feature is that they form homodimers by self-association or heterodimers by associating with other members of the DD superfamily including CARD (Caspase activation and recruitment domain),
Probab=30.11 E-value=27 Score=17.28 Aligned_cols=17 Identities=29% Similarity=0.526 Sum_probs=12.0
Q ss_pred CHHHHHHHHHHHhcCCC
Q psy5907 22 KLWEIGKIIGQMWRDLP 38 (56)
Q Consensus 22 ~~~ei~k~l~~~W~~l~ 38 (56)
.+.-|+..+|..|+.+.
T Consensus 4 ~f~~i~~~lG~~Wk~la 20 (86)
T cd08306 4 AFDVICENVGRDWRKLA 20 (86)
T ss_pred HHHHHHHHHhhhHHHHH
Confidence 35667788888887543
No 141
>COG4282 SMI1 Protein involved in beta-1,3-glucan synthesis [Carbohydrate transport and metabolism]
Probab=29.21 E-value=1.2e+02 Score=17.55 Aligned_cols=29 Identities=17% Similarity=0.214 Sum_probs=25.6
Q ss_pred CCCCCHHHHHHHHHHHhcCCCHHHHHHHH
Q psy5907 18 NLDLKLWEIGKIIGQMWRDLPEDQKTEYV 46 (56)
Q Consensus 18 ~p~~~~~ei~k~l~~~W~~l~~~eK~~y~ 46 (56)
+|+++..+|.+.=.++-..+|++-|+.|.
T Consensus 30 ~pgat~~di~~aE~dlg~tlPpdvResl~ 58 (191)
T COG4282 30 RPGATCGDIQRAEADLGRTLPPDVRESLA 58 (191)
T ss_pred CCCccHHHHHHHHHHhcCcCChHHHHHHH
Confidence 37888999999999999999999998764
No 142
>PF12926 MOZART2: Mitotic-spindle organizing gamma-tubulin ring associated; InterPro: IPR024332 The MOZART2 family of proteins (also known as FAM128 and Mitotic-spindle organizing protein 2) operate as part of the gamma-tubulin ring complex, gamma-TuRC, one of the complexes necessary for chromosome segregation. This complex is located at centrosomes and mediates the formation of bipolar spindles in mitosis; it consists of six subunits. However, unlike the other four known subunits, the MOZART proteins, both 1 and 2, do not carry the conserved 'Spc97-Spc98' GCP domain, so the TUBGCP nomenclature cannot be used for it. The exact function of MOZART2 is not clear [].
Probab=28.81 E-value=91 Score=15.94 Aligned_cols=28 Identities=11% Similarity=-0.074 Sum_probs=22.2
Q ss_pred CCCCHHHHHHHHHHHhcCCCHHHHHHHH
Q psy5907 19 LDLKLWEIGKIIGQMWRDLPEDQKTEYV 46 (56)
Q Consensus 19 p~~~~~ei~k~l~~~W~~l~~~eK~~y~ 46 (56)
|+....++.+.--..-+.|+++|.+-|+
T Consensus 5 ~~~~~~~~~k~~~~rk~~Ls~eE~EL~E 32 (88)
T PF12926_consen 5 PGSPTAQVYKYSLRRKKVLSAEEVELYE 32 (88)
T ss_pred CCCChHHHHHHHHHHHhccCHHHHHHHH
Confidence 5556677777777888999999998775
No 143
>PF07757 AdoMet_MTase: Predicted AdoMet-dependent methyltransferase; InterPro: IPR011671 tRNA (uracil-O(2)-)-methyltransferase catalyses the formation of O(2)-methyl-uracil at position 44 (m2U44) in tRNA(Ser) [].; GO: 0008168 methyltransferase activity
Probab=28.62 E-value=70 Score=17.09 Aligned_cols=25 Identities=12% Similarity=0.217 Sum_probs=19.5
Q ss_pred HHHHHHHhcCCCHHHHHHHHHHHHH
Q psy5907 27 GKIIGQMWRDLPEDQKTEYVEDYEQ 51 (56)
Q Consensus 27 ~k~l~~~W~~l~~~eK~~y~~~a~~ 51 (56)
++.|-+.|..-++-.|..|++.+=+
T Consensus 19 a~~lv~~W~E~TdP~K~VfEDlaIA 43 (112)
T PF07757_consen 19 ARWLVDNWPESTDPQKHVFEDLAIA 43 (112)
T ss_pred HHHHHHhCcccCCchhhHHHHHHHH
Confidence 4566778998888889999987643
No 144
>PF11460 DUF3007: Protein of unknown function (DUF3007); InterPro: IPR021562 This is a family of uncharacterised proteins found in bacteria and eukaryotes.
Probab=28.30 E-value=1e+02 Score=16.27 Aligned_cols=20 Identities=5% Similarity=0.146 Sum_probs=14.1
Q ss_pred HHHHHhcCCCHHHHHHHHHH
Q psy5907 29 IIGQMWRDLPEDQKTEYVED 48 (56)
Q Consensus 29 ~l~~~W~~l~~~eK~~y~~~ 48 (56)
.+.++..+||++|.+.-...
T Consensus 82 ~lqkRle~l~~eE~~~L~~e 101 (104)
T PF11460_consen 82 ELQKRLEELSPEELEALQAE 101 (104)
T ss_pred HHHHHHHhCCHHHHHHHHHH
Confidence 34557788999988765543
No 145
>PRK13965 ribonucleotide-diphosphate reductase subunit beta; Provisional
Probab=28.16 E-value=61 Score=20.12 Aligned_cols=17 Identities=29% Similarity=0.579 Sum_probs=14.0
Q ss_pred HhcCCCHHHHHHHHHHH
Q psy5907 33 MWRDLPEDQKTEYVEDY 49 (56)
Q Consensus 33 ~W~~l~~~eK~~y~~~a 49 (56)
.|+.||++||..|...-
T Consensus 60 dw~~Lt~~Er~~~~~~l 76 (335)
T PRK13965 60 SWRSLGEDWQQLITRTF 76 (335)
T ss_pred HHHhCCHHHHHHHHHHH
Confidence 38999999999987643
No 146
>PRK14293 chaperone protein DnaJ; Provisional
Probab=27.91 E-value=1.6e+02 Score=18.50 Aligned_cols=41 Identities=15% Similarity=0.181 Sum_probs=28.3
Q ss_pred HHHHHHHHHHhCCCCC----HHHHHHHHHHHhcCCCHHH-HHHHHH
Q psy5907 7 SRKVWDQVKAQNLDLK----LWEIGKIIGQMWRDLPEDQ-KTEYVE 47 (56)
Q Consensus 7 ~~~~~~~~~~~~p~~~----~~ei~k~l~~~W~~l~~~e-K~~y~~ 47 (56)
.+.++...+.-||+.+ ..+.-+.|.+.|..|++.. |..|..
T Consensus 21 k~ayr~la~~~HPD~~~~~~a~~~f~~i~~Ay~vL~~~~~R~~yd~ 66 (374)
T PRK14293 21 KRAYRRLARKYHPDVNKEPGAEDRFKEINRAYEVLSDPETRARYDQ 66 (374)
T ss_pred HHHHHHHHHHHCCCCCCCcCHHHHHHHHHHHHHHHhchHHHHHHhh
Confidence 3455666667788754 5566778999999888666 555553
No 147
>PF13404 HTH_AsnC-type: AsnC-type helix-turn-helix domain; PDB: 2ZNY_E 2ZNZ_G 1RI7_A 2CYY_A 2E1C_A 2VC1_B 2QZ8_A 2W29_C 2IVM_B 2VBX_B ....
Probab=27.87 E-value=63 Score=13.78 Aligned_cols=16 Identities=13% Similarity=0.171 Sum_probs=10.1
Q ss_pred HhCCCCCHHHHHHHHH
Q psy5907 16 AQNLDLKLWEIGKIIG 31 (56)
Q Consensus 16 ~~~p~~~~~ei~k~l~ 31 (56)
..++..++.+|++.+|
T Consensus 13 q~d~r~s~~~la~~lg 28 (42)
T PF13404_consen 13 QEDGRRSYAELAEELG 28 (42)
T ss_dssp HH-TTS-HHHHHHHHT
T ss_pred HHcCCccHHHHHHHHC
Confidence 4457778888887765
No 148
>PF04255 DUF433: Protein of unknown function (DUF433); InterPro: IPR007367 This is a family of uncharacterised proteins.; PDB: 2GA1_B.
Probab=27.86 E-value=71 Score=14.38 Aligned_cols=19 Identities=16% Similarity=0.281 Sum_probs=11.3
Q ss_pred HHHHHhCCCCCHHHHHHHH
Q psy5907 12 DQVKAQNLDLKLWEIGKII 30 (56)
Q Consensus 12 ~~~~~~~p~~~~~ei~k~l 30 (56)
+.+...+|+++..+|...+
T Consensus 35 eeI~~~yp~Lt~~~i~aAl 53 (56)
T PF04255_consen 35 EEIAEDYPSLTLEDIRAAL 53 (56)
T ss_dssp HHHHHHSTT--HHHHHHHH
T ss_pred HHHHHHCCCCCHHHHHHHH
Confidence 4566677878777776554
No 149
>PF14647 FAM91_N: FAM91 N-terminus
Probab=27.84 E-value=1.2e+02 Score=19.00 Aligned_cols=31 Identities=13% Similarity=0.326 Sum_probs=23.3
Q ss_pred ChHHhHHHHHHHHHHHhC-----CCCCHHHHHHHHH
Q psy5907 1 MPYMRYSRKVWDQVKAQN-----LDLKLWEIGKIIG 31 (56)
Q Consensus 1 ~ay~lf~~~~~~~~~~~~-----p~~~~~ei~k~l~ 31 (56)
+||-.|+.-....++.+. |+-+.+|.-+.+|
T Consensus 87 TPF~YY~~ml~~lm~~ekSYDsLPNFTAaD~LRllG 122 (308)
T PF14647_consen 87 TPFSYYIDMLAELMKSEKSYDSLPNFTAADCLRLLG 122 (308)
T ss_pred CcHHHHHHHHHHHHHcCCCcccCCCCcHHHHHHHhc
Confidence 467777777777777554 8889999888776
No 150
>TIGR00647 MG103 conserved hypothetical protein.
Probab=27.73 E-value=74 Score=19.53 Aligned_cols=20 Identities=20% Similarity=0.217 Sum_probs=16.8
Q ss_pred HHHHHhCCCCCHHHHHHHHH
Q psy5907 12 DQVKAQNLDLKLWEIGKIIG 31 (56)
Q Consensus 12 ~~~~~~~p~~~~~ei~k~l~ 31 (56)
..++.+||+++-.|+...+.
T Consensus 236 a~lRl~~Pd~SL~ELgell~ 255 (279)
T TIGR00647 236 CLLKIDHPDWSLEQIAEFFA 255 (279)
T ss_pred HHHHHhCcccCHHHHHHHhc
Confidence 35788999999999988874
No 151
>TIGR03092 SASP_sspI small, acid-soluble spore protein I. This protein family is restricted to a subset of endospore-forming bacteria such as Bacillus subtilis, all of which are in the Firmicutes (low-GC Gram-positive) lineage. It is a minor SASP (small, acid-soluble spore protein) designated SspI. The gene in Bacillus subtilis previously was designated ysfA.
Probab=27.57 E-value=85 Score=15.15 Aligned_cols=18 Identities=28% Similarity=0.647 Sum_probs=13.9
Q ss_pred HHHhcCCCHHHHHHHHHH
Q psy5907 31 GQMWRDLPEDQKTEYVED 48 (56)
Q Consensus 31 ~~~W~~l~~~eK~~y~~~ 48 (56)
-..|+.+++.+|....+.
T Consensus 42 E~~W~~~~~~ek~~m~~~ 59 (65)
T TIGR03092 42 EAIWKHANEQEKDEMLET 59 (65)
T ss_pred HHHHHhcCHHHHHHHHHH
Confidence 346999999999876543
No 152
>PF11625 DUF3253: Protein of unknown function (DUF3253); InterPro: IPR021660 This bacterial family of proteins has no known function. ; PDB: 2NS0_A.
Probab=27.51 E-value=94 Score=15.66 Aligned_cols=38 Identities=18% Similarity=0.246 Sum_probs=23.8
Q ss_pred HHHHhCCC--CCHHHHHHHHHH-HhcCCCHHHHHHHHHHHH
Q psy5907 13 QVKAQNLD--LKLWEIGKIIGQ-MWRDLPEDQKTEYVEDYE 50 (56)
Q Consensus 13 ~~~~~~p~--~~~~ei~k~l~~-~W~~l~~~eK~~y~~~a~ 50 (56)
.+....|+ +.++|+++.++- .|+.+-+..+.-=..++.
T Consensus 15 ll~~R~~~ktiCPSevARal~~~~WR~lm~~vR~~A~~L~~ 55 (83)
T PF11625_consen 15 LLAARGPGKTICPSEVARALGPDDWRDLMPPVRAAARRLAR 55 (83)
T ss_dssp HHHHS-TT--B-HHHHHHHH-TTS-GGGHHHHHHHHHHHHH
T ss_pred HHHhcCCCCccCHHHHHHHHCchhhHHHHHHHHHHHHHHHH
Confidence 34444455 458999999984 499998888876555554
No 153
>PLN02492 ribonucleoside-diphosphate reductase
Probab=27.36 E-value=63 Score=19.81 Aligned_cols=17 Identities=18% Similarity=0.415 Sum_probs=14.1
Q ss_pred HHhcCCCHHHHHHHHHH
Q psy5907 32 QMWRDLPEDQKTEYVED 48 (56)
Q Consensus 32 ~~W~~l~~~eK~~y~~~ 48 (56)
..|+.||++||..|...
T Consensus 45 ~dw~~Lt~~Er~~~~~i 61 (324)
T PLN02492 45 KDWEKLTDDERHFISHV 61 (324)
T ss_pred HHHHhCCHHHHHHHHHH
Confidence 34999999999988754
No 154
>PF07800 DUF1644: Protein of unknown function (DUF1644); InterPro: IPR012866 This family consists of sequences found in a number of hypothetical plant proteins of unknown function. The region of interest contains nine highly conserved cysteine residues and is approximately 160 amino acids in length, which probably represent a zinc-binding domain.
Probab=27.21 E-value=30 Score=19.60 Aligned_cols=30 Identities=10% Similarity=0.241 Sum_probs=24.1
Q ss_pred HHHHHHHHHhCCCCCHHHHHHHHHHHhcCC
Q psy5907 8 RKVWDQVKAQNLDLKLWEIGKIIGQMWRDL 37 (56)
Q Consensus 8 ~~~~~~~~~~~p~~~~~ei~k~l~~~W~~l 37 (56)
.+.+.-++.+||...+.+|--.....|+.|
T Consensus 122 ~eLrKHar~~HP~~rP~~vDP~rq~~W~~l 151 (162)
T PF07800_consen 122 SELRKHARSEHPSARPSEVDPQRQRDWERL 151 (162)
T ss_pred HHHHHHHHhhCCCCCCccCCHHHHHHHHHH
Confidence 456677888999999999888888888765
No 155
>PF00887 ACBP: Acyl CoA binding protein; InterPro: IPR000582 Acyl-CoA-binding protein (ACBP) is a small (10 Kd) protein that binds medium- and long-chain acyl-CoA esters with very high affinity and may function as an intracellular carrier of acyl-CoA esters []. ACBP is also known as diazepam binding inhibitor (DBI) or endozepine (EP) because of its ability to displace diazepam from the benzodiazepine (BZD) recognition site located on the GABA type A receptor. It is therefore possible that this protein also acts as a neuropeptide to modulate the action of the GABA receptor []. ACBP is a highly conserved protein of about 90 residues that is found in all four eukaryotic kingdoms, Animalia, Plantae, Fungi and Protista, and in some eubacterial species []. Although ACBP occurs as a completely independent protein, intact ACB domains have been identified in a number of large, multifunctional proteins in a variety of eukaryotic species. These include large membrane-associated proteins with N-terminal ACB domains, multifunctional enzymes with both ACB and peroxisomal enoyl-CoA Delta(3), Delta(2)-enoyl-CoA isomerase domains, and proteins with both an ACB domain and ankyrin repeats (IPR002110 from INTERPRO) []. The ACB domain consists of four alpha-helices arranged in a bowl shape with a highly exposed acyl-CoA-binding site. The ligand is bound through specific interactions with residues on the protein, most notably several conserved positive charges that interact with the phosphate group on the adenosine-3'phosphate moiety, and the acyl chain is sandwiched between the hydrophobic surfaces of CoA and the protein []. Other proteins containing an ACB domain include: Endozepine-like peptide (ELP) (gene DBIL5) from mouse []. ELP is a testis-specific ACBP homologue that may be involved in the energy metabolism of the mature sperm. MA-DBI, a transmembrane protein of unknown function which has been found in mammals. MA-DBI contains a N-terminal ACB domain. DRS-1 [], a human protein of unknown function that contains a N-terminal ACB domain and a C-terminal enoyl-CoA isomerase/hydratase domain. ; GO: 0000062 fatty-acyl-CoA binding; PDB: 2CB8_A 2FJ9_A 2LBB_A 1ST7_A 3EPY_B 2FDQ_C 1NTI_A 1HB8_A 1ACA_A 1NVL_A ....
Probab=27.02 E-value=89 Score=15.24 Aligned_cols=49 Identities=20% Similarity=0.300 Sum_probs=27.9
Q ss_pred HHhHHHHHHHHHHHhCCCCCHHHHHHHHHHHhcCCC----HHHHHHHHHHHHHh
Q psy5907 3 YMRYSRKVWDQVKAQNLDLKLWEIGKIIGQMWRDLP----EDQKTEYVEDYEQE 52 (56)
Q Consensus 3 y~lf~~~~~~~~~~~~p~~~~~ei~k~l~~~W~~l~----~~eK~~y~~~a~~~ 52 (56)
|-+|.+...+.+....|+.. .-+.+..-+.|+.+. ++-+..|++...+.
T Consensus 30 YalyKQAt~Gd~~~~~P~~~-d~~~~~K~~AW~~l~gms~~eA~~~Yi~~v~~~ 82 (87)
T PF00887_consen 30 YALYKQATHGDCDTPRPGFF-DIEGRAKWDAWKALKGMSKEEAMREYIELVEEL 82 (87)
T ss_dssp HHHHHHHHTSS--S-CTTTT-CHHHHHHHHHHHTTTTTHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHhCCCcCCCCcch-hHHHHHHHHHHHHccCCCHHHHHHHHHHHHHHH
Confidence 45666666655555556533 334455567897654 55677788777653
No 156
>PRK06946 lipid A biosynthesis lauroyl acyltransferase; Provisional
Probab=26.99 E-value=1.3e+02 Score=18.05 Aligned_cols=26 Identities=12% Similarity=0.012 Sum_probs=20.4
Q ss_pred HHHHhCCCCCHHHHHHHHHHHhcCCC
Q psy5907 13 QVKAQNLDLKLWEIGKIIGQMWRDLP 38 (56)
Q Consensus 13 ~~~~~~p~~~~~ei~k~l~~~W~~l~ 38 (56)
.+..-.|+++..|+.+++.+.|.++.
T Consensus 49 NL~~~fPe~s~~e~~~i~r~~~~~~~ 74 (293)
T PRK06946 49 NLKLCFPDWSDARREELARRHFRHVI 74 (293)
T ss_pred HHHHhCCCCCHHHHHHHHHHHHHHHH
Confidence 34444699999999999988887665
No 157
>PRK07920 lipid A biosynthesis lauroyl acyltransferase; Provisional
Probab=26.97 E-value=85 Score=18.82 Aligned_cols=26 Identities=8% Similarity=0.037 Sum_probs=20.0
Q ss_pred HHHHhCCCCCHHHHHHHHHHHhcCCC
Q psy5907 13 QVKAQNLDLKLWEIGKIIGQMWRDLP 38 (56)
Q Consensus 13 ~~~~~~p~~~~~ei~k~l~~~W~~l~ 38 (56)
.+..-.|+++..|+.+++.+.|.++.
T Consensus 44 NL~~~fPe~s~~e~~~i~~~~~~~~~ 69 (298)
T PRK07920 44 NLARVVPDAPAAVLDALVRAAMRSYA 69 (298)
T ss_pred HHHHhcCCCCHHHHHHHHHHHHHHHH
Confidence 44445699999999999888887654
No 158
>TIGR02208 lipid_A_msbB lipid A biosynthesis (KDO)2-(lauroyl)-lipid IVA acyltransferase. This family consists of MsbB in E. coli and closely related proteins in other species. MsbB is homologous to HtrB (TIGR02207) and acts immediately after it in the biosynthesis of KDO-2 lipid A (also called Re LPS and Re endotoxin). These two enzymes act after creation of KDO-2 lipid IV-A by addition of the KDO sugars.
Probab=26.94 E-value=1.3e+02 Score=18.13 Aligned_cols=26 Identities=15% Similarity=0.026 Sum_probs=20.4
Q ss_pred HHHHhCCCCCHHHHHHHHHHHhcCCC
Q psy5907 13 QVKAQNLDLKLWEIGKIIGQMWRDLP 38 (56)
Q Consensus 13 ~~~~~~p~~~~~ei~k~l~~~W~~l~ 38 (56)
.+..-.|+++..|+.+++.+.|.++.
T Consensus 60 NL~~~fP~~s~~e~~~i~~~~~~~~g 85 (305)
T TIGR02208 60 NLSACFPEKSEAERETIIDNNFATFV 85 (305)
T ss_pred HHHHhCCCCCHHHHHHHHHHHHHHHH
Confidence 34445699999999999999887664
No 159
>PF14374 Ribos_L4_asso_C: 60S ribosomal protein L4 C-terminal domain; PDB: 3O5H_D 3IZS_D 3O58_D 3IZR_D 4A17_C 4A1C_C 4A1E_C 4A1A_C.
Probab=26.83 E-value=33 Score=17.12 Aligned_cols=14 Identities=14% Similarity=0.280 Sum_probs=7.3
Q ss_pred CCCCHHHHHHHHHH
Q psy5907 19 LDLKLWEIGKIIGQ 32 (56)
Q Consensus 19 p~~~~~ei~k~l~~ 32 (56)
|-++++|++++|.+
T Consensus 8 ~~MtN~Dl~RiInS 21 (80)
T PF14374_consen 8 PKMTNADLSRIINS 21 (80)
T ss_dssp -S-S---HHHHHHS
T ss_pred chhccccHHHHhcc
Confidence 67889999999863
No 160
>PF11961 DUF3475: Domain of unknown function (DUF3475); InterPro: IPR021864 This presumed domain is functionally uncharacterised. This domain is found in eukaryotes. This domain is about 60 amino acids in length. This domain is found associated with PF05003 from PFAM.
Probab=26.79 E-value=82 Score=14.75 Aligned_cols=21 Identities=10% Similarity=0.343 Sum_probs=13.7
Q ss_pred HHHHHHHHhcCCCHHHHHHHH
Q psy5907 26 IGKIIGQMWRDLPEDQKTEYV 46 (56)
Q Consensus 26 i~k~l~~~W~~l~~~eK~~y~ 46 (56)
..-.+...|++|++++=....
T Consensus 9 ~msk~~~L~~SLsd~~i~~Lr 29 (57)
T PF11961_consen 9 TMSKLVNLWQSLSDEEIARLR 29 (57)
T ss_pred HHHHHHHHHHHcChHHHHHHH
Confidence 334456789999988654443
No 161
>PRK13967 nrdF1 ribonucleotide-diphosphate reductase subunit beta; Provisional
Probab=26.72 E-value=67 Score=19.80 Aligned_cols=17 Identities=12% Similarity=0.507 Sum_probs=13.7
Q ss_pred HhcCCCHHHHHHHHHHH
Q psy5907 33 MWRDLPEDQKTEYVEDY 49 (56)
Q Consensus 33 ~W~~l~~~eK~~y~~~a 49 (56)
.|+.|+++||..|....
T Consensus 47 dw~~Lt~~Er~~i~~~l 63 (322)
T PRK13967 47 SWQTLSSTEQQTTIRVF 63 (322)
T ss_pred HHHhCCHHHHHHHHHHH
Confidence 49999999999877543
No 162
>PRK05645 lipid A biosynthesis lauroyl acyltransferase; Provisional
Probab=26.67 E-value=95 Score=18.53 Aligned_cols=26 Identities=23% Similarity=0.276 Sum_probs=20.2
Q ss_pred HHHHhCCCCCHHHHHHHHHHHhcCCC
Q psy5907 13 QVKAQNLDLKLWEIGKIIGQMWRDLP 38 (56)
Q Consensus 13 ~~~~~~p~~~~~ei~k~l~~~W~~l~ 38 (56)
.+..-.|+++..|+.+++.+.|.++.
T Consensus 49 NL~~~fP~~s~~e~~~i~~~~~~~~~ 74 (295)
T PRK05645 49 NLSKCFPELSPAELEKLVGQSLMDIG 74 (295)
T ss_pred HHHHhCCCCCHHHHHHHHHHHHHHHH
Confidence 34445699999999999888887654
No 163
>PRK08734 lipid A biosynthesis lauroyl acyltransferase; Provisional
Probab=26.47 E-value=96 Score=18.72 Aligned_cols=26 Identities=12% Similarity=0.150 Sum_probs=20.2
Q ss_pred HHHHhCCCCCHHHHHHHHHHHhcCCC
Q psy5907 13 QVKAQNLDLKLWEIGKIIGQMWRDLP 38 (56)
Q Consensus 13 ~~~~~~p~~~~~ei~k~l~~~W~~l~ 38 (56)
.+..-.|+++..++.+++.+.|.++.
T Consensus 50 NL~~~fP~~s~~e~~~i~~~~~~~~g 75 (305)
T PRK08734 50 NLELAYPELSPQQRAQLHAQILRSTA 75 (305)
T ss_pred HHHHhCCCCCHHHHHHHHHHHHHHHH
Confidence 44455699999999999888887654
No 164
>PF12550 GCR1_C: Transcriptional activator of glycolytic enzymes; InterPro: IPR022210 This domain family is found in eukaryotes, and is approximately 80 amino acids in length. This family is activates the transcription of glycolytic enzymes.
Probab=26.44 E-value=81 Score=15.31 Aligned_cols=22 Identities=18% Similarity=0.341 Sum_probs=17.0
Q ss_pred CCCCHHHHHHHHHHHhcCCCHH
Q psy5907 19 LDLKLWEIGKIIGQMWRDLPED 40 (56)
Q Consensus 19 p~~~~~ei~k~l~~~W~~l~~~ 40 (56)
.+.+..++.+.-|..|+.-+.+
T Consensus 24 g~psI~~le~~yG~~WR~~~~~ 45 (81)
T PF12550_consen 24 GQPSIRSLEKKYGSKWRRDSKE 45 (81)
T ss_pred CCCCHHHHHHHhChhhccCccc
Confidence 3567899999999999974443
No 165
>PF13867 SAP30_Sin3_bdg: Sin3 binding region of histone deacetylase complex subunit SAP30; PDB: 2LD7_A.
Probab=26.44 E-value=77 Score=14.28 Aligned_cols=26 Identities=8% Similarity=0.130 Sum_probs=17.6
Q ss_pred CCCCHHHHHHHHHHHhcCCCHHHHHH
Q psy5907 19 LDLKLWEIGKIIGQMWRDLPEDQKTE 44 (56)
Q Consensus 19 p~~~~~ei~k~l~~~W~~l~~~eK~~ 44 (56)
|+.+-.+++..+..-+.+++-+|..-
T Consensus 19 ~~~sK~qLa~~V~kHF~s~~v~E~ev 44 (53)
T PF13867_consen 19 PRSSKEQLANAVRKHFNSQPVDENEV 44 (53)
T ss_dssp SS--HHHHHHHHHHHHTT----HHHH
T ss_pred CCCCHHHHHHHHHHHHhcCCCCHHHH
Confidence 68899999999999999999887754
No 166
>PRK10767 chaperone protein DnaJ; Provisional
Probab=26.38 E-value=1.7e+02 Score=18.30 Aligned_cols=39 Identities=21% Similarity=0.095 Sum_probs=27.0
Q ss_pred HHHHHHHHHhCCCCCH-----HHHHHHHHHHhcCCCHHHH-HHHH
Q psy5907 8 RKVWDQVKAQNLDLKL-----WEIGKIIGQMWRDLPEDQK-TEYV 46 (56)
Q Consensus 8 ~~~~~~~~~~~p~~~~-----~ei~k~l~~~W~~l~~~eK-~~y~ 46 (56)
+.+|...+.-||+... .+.-+.|.+.|..|++.++ ..|.
T Consensus 23 ~ayr~la~~~HPD~~~~~~~a~~~f~~i~~Ay~~L~d~~~r~~yd 67 (371)
T PRK10767 23 KAYRKLAMKYHPDRNPGDKEAEEKFKEIKEAYEVLSDPQKRAAYD 67 (371)
T ss_pred HHHHHHHHHHCCCCCCCcHHHHHHHHHHHHHHHHhcchhhhhHhh
Confidence 4566666777898752 3555789999999987665 4444
No 167
>smart00509 TFS2N Domain in the N-terminus of transcription elongation factor S-II (and elsewhere).
Probab=26.21 E-value=47 Score=16.01 Aligned_cols=21 Identities=14% Similarity=0.442 Sum_probs=15.4
Q ss_pred hCCCCCHHHHHHHHHHHhcCC
Q psy5907 17 QNLDLKLWEIGKIIGQMWRDL 37 (56)
Q Consensus 17 ~~p~~~~~ei~k~l~~~W~~l 37 (56)
.|++.....+++.|-+.|+.+
T Consensus 52 kh~~~~I~~~A~~Li~~WK~~ 72 (75)
T smart00509 52 KHKNEEIRKLAKKLIKSWKKL 72 (75)
T ss_pred cCCcHHHHHHHHHHHHHHHHH
Confidence 356666778888888888754
No 168
>PF01527 HTH_Tnp_1: Transposase; InterPro: IPR002514 Transposase proteins are necessary for efficient DNA transposition. This family consists of various Escherichia coli insertion elements and other bacterial transposases some of which are members of the IS3 family. This region includes a helix-turn-helix motif (HTH) at the N terminus followed by a leucine zipper (LZ) motif. The LZ motif has been shown to mediate oligomerisation of the transposase components in IS911 []. More information about these proteins can be found at Protein of the Month: Transposase [].; GO: 0003677 DNA binding, 0004803 transposase activity, 0006313 transposition, DNA-mediated; PDB: 2JN6_A 2RN7_A.
Probab=26.16 E-value=35 Score=15.80 Aligned_cols=25 Identities=16% Similarity=0.056 Sum_probs=13.2
Q ss_pred HHHHHHHHHHHhC--CCCCHHHHHHHH
Q psy5907 6 YSRKVWDQVKAQN--LDLKLWEIGKII 30 (56)
Q Consensus 6 f~~~~~~~~~~~~--p~~~~~ei~k~l 30 (56)
|+.+.+..+.... ++.+..+|++..
T Consensus 7 ys~e~K~~~v~~~~~~g~sv~~va~~~ 33 (76)
T PF01527_consen 7 YSPEFKLQAVREYLESGESVSEVAREY 33 (76)
T ss_dssp --HHHHHHHHHHHHHHHCHHHHHHHHH
T ss_pred CCHHHHHHHHHHHHHCCCceEeeeccc
Confidence 4555555554444 566667766654
No 169
>PF01047 MarR: MarR family; InterPro: IPR000835 The MarR-type HTH domain is a DNA-binding, winged helix-turn-helix (wHTH) domain of about 135 amino acids present in transcription regulators of the MarR/SlyA family, involved in the development of antibiotic resistance. This family of transcription regulators is named after Escherichia coli MarR, a repressor of genes which activate the multiple antibiotic resistance and oxidative stress regulons, and after slyA from Salmonella typhimurium and E. coli, a transcription regulator that is required for virulence and survival in the macrophage environment. Regulators with the MarR-type HTH domain are present in bacteria and archaea and control a variety of biological functions, including resistance to multiple antibiotics, household disinfectants, organic solvents, oxidative stress agents and regulation of the virulence factor synthesis in pathogens of humans and plants. Many of the MarR-like regulators respond to aromatic compounds [, , ]. The crystal structures of MarR, MexR and SlyA have been determined and show a winged HTH DNA-binding core flanked by helices involved in dimerisation. The DNA-binding domains are ascribed to the superfamily of winged helix proteins, containing a three (four)-helix (H) bundle and a three-stranded antiparallel beta-sheet (B) in the topology: H1-(H1')-H2-B1-H3-H4-B2-B3-H5-H6. Helices 3 and 4 comprise the helix-turn-helix motif and the beta-sheet is called the wing. Helix 4 is termed the recognition helix, like in other HTHs where it binds the DNA major groove. The helices 1, 5 and 6 are involved in dimerisation, as most MarR-like transcription regulators form dimers [, ]. ; GO: 0003700 sequence-specific DNA binding transcription factor activity, 0006355 regulation of transcription, DNA-dependent, 0005622 intracellular; PDB: 1JGS_A 2NYX_D 2PEX_B 2PFB_A 3BPX_A 3BPV_A 2BV6_A 3BJA_A 3E6M_B 2ETH_A ....
Probab=26.09 E-value=37 Score=14.91 Aligned_cols=16 Identities=19% Similarity=0.260 Sum_probs=11.3
Q ss_pred HhCCCCCHHHHHHHHH
Q psy5907 16 AQNLDLKLWEIGKIIG 31 (56)
Q Consensus 16 ~~~p~~~~~ei~k~l~ 31 (56)
.++|+++..++++.++
T Consensus 13 ~~~~~~~~~~la~~~~ 28 (59)
T PF01047_consen 13 YENGGITQSELAEKLG 28 (59)
T ss_dssp HHHSSEEHHHHHHHHT
T ss_pred HHcCCCCHHHHHHHHC
Confidence 3467788888877654
No 170
>cd01776 Rin1_RA Ubiquitin domain of RIN1 RAS effector. Rin1_RA RIN1 is a RAS effector that binds with specificity and high affinity to activated RAS via its carboxy-terminal RA (RAS-associated) domain. RIN1 competes directly with RAF1 for RAS binding and is thought to divert signaling away from RAF and the MAPK pathway while also shunting RAS signals through alternate pathways. In addition, Rin1 and Rin2 are Rab5-binding proteins, binding preferentially to the GTP-bound form, that enhance the GDP-GTP exchange reaction on Rab5 that regulate the docking and fusion processes of endocytic vesicles. In addition to the RA domain, RIN1 and RIN2 have an SH2 (Src homology 2) domain, a proline-rich SH3 domain, and a Vps9 domain.
Probab=25.76 E-value=1.1e+02 Score=15.65 Aligned_cols=23 Identities=0% Similarity=0.066 Sum_probs=18.3
Q ss_pred CCCCCHHHHHHHHHHHhcCCCHH
Q psy5907 18 NLDLKLWEIGKIIGQMWRDLPED 40 (56)
Q Consensus 18 ~p~~~~~ei~k~l~~~W~~l~~~ 40 (56)
-|+.+..++..+++++-+--.++
T Consensus 21 ~P~~tt~~vc~lcA~Kf~V~qPe 43 (87)
T cd01776 21 RPYITTEDVCQLCAEKFKVTQPE 43 (87)
T ss_pred CCCCcHHHHHHHHHHHhccCChh
Confidence 38899999999999887655444
No 171
>PRK09614 nrdF ribonucleotide-diphosphate reductase subunit beta; Reviewed
Probab=25.45 E-value=73 Score=19.44 Aligned_cols=16 Identities=25% Similarity=0.669 Sum_probs=13.5
Q ss_pred HhcCCCHHHHHHHHHH
Q psy5907 33 MWRDLPEDQKTEYVED 48 (56)
Q Consensus 33 ~W~~l~~~eK~~y~~~ 48 (56)
.|+.||++||..|...
T Consensus 47 dw~~Lt~~Er~~~~~~ 62 (324)
T PRK09614 47 DWKKLSDEEKNLYTRV 62 (324)
T ss_pred HHHhCCHHHHHHHHHH
Confidence 4899999999988754
No 172
>PF05047 L51_S25_CI-B8: Mitochondrial ribosomal protein L51 / S25 / CI-B8 domain ; InterPro: IPR007741 Proteins containing this domain are located in the mitochondrion and include ribosomal protein L51, and S25. This domain is also found in mitochondrial NADH-ubiquinone oxidoreductase B8 subunit (CI-B8) 1.6.5.3 from EC. It is not known whether all members of this family form part of the NADH-ubiquinone oxidoreductase and whether they are also all ribosomal proteins.; PDB: 1S3A_A.
Probab=25.02 E-value=76 Score=13.79 Aligned_cols=17 Identities=12% Similarity=0.249 Sum_probs=12.0
Q ss_pred HHHHHHHHHHHhCCCCC
Q psy5907 6 YSRKVWDQVKAQNLDLK 22 (56)
Q Consensus 6 f~~~~~~~~~~~~p~~~ 22 (56)
|.....+.++..||++.
T Consensus 4 F~~~~lp~l~~~NP~v~ 20 (52)
T PF05047_consen 4 FLKNNLPTLKYHNPQVQ 20 (52)
T ss_dssp HHHHTHHHHHHHSTT--
T ss_pred HHHHhHHHHHHHCCCcE
Confidence 55777788888899864
No 173
>COG2926 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=24.94 E-value=1.2e+02 Score=15.92 Aligned_cols=18 Identities=39% Similarity=0.409 Sum_probs=14.8
Q ss_pred CCCCHHHHHHHHHHHhcC
Q psy5907 19 LDLKLWEIGKIIGQMWRD 36 (56)
Q Consensus 19 p~~~~~ei~k~l~~~W~~ 36 (56)
|+++..||..+|..|-..
T Consensus 54 ~~Ms~eei~~II~~MksD 71 (109)
T COG2926 54 PDMSIEEIQGIIESMKSD 71 (109)
T ss_pred CCCCHHHHHHHHHHHHHH
Confidence 899999999998887543
No 174
>TIGR00534 OpcA opcA protein. The opcA gene is found immediately downstream of zwf, the glucose-6-phosphate dehydrogenase (G6PDH) gene, in a number of species, including Mycobacterium tuberculosis, Streptomyces coelicolor, Nostoc punctiforme, and Synechococcus sp. PCC 7942. In the latter, disruption of opcA was shown to block assembly of G6PDH into active oligomeric forms.
Probab=24.92 E-value=83 Score=19.60 Aligned_cols=19 Identities=37% Similarity=0.688 Sum_probs=16.7
Q ss_pred CCCCCHHHHHHHHHHHhcC
Q psy5907 18 NLDLKLWEIGKIIGQMWRD 36 (56)
Q Consensus 18 ~p~~~~~ei~k~l~~~W~~ 36 (56)
-|+.+..+|++.|.+.|.+
T Consensus 5 l~dtt~~~I~~~L~~l~~~ 23 (311)
T TIGR00534 5 PTDITLNEINKELNQLWQS 23 (311)
T ss_pred CCCCCHHHHHHHHHHHHHh
Confidence 3688999999999999975
No 175
>PRK13966 nrdF2 ribonucleotide-diphosphate reductase subunit beta; Provisional
Probab=24.90 E-value=75 Score=19.62 Aligned_cols=16 Identities=19% Similarity=0.320 Sum_probs=13.5
Q ss_pred HhcCCCHHHHHHHHHH
Q psy5907 33 MWRDLPEDQKTEYVED 48 (56)
Q Consensus 33 ~W~~l~~~eK~~y~~~ 48 (56)
.|+.||+.+|..|...
T Consensus 49 dw~~Lt~~Ek~~~~~~ 64 (324)
T PRK13966 49 SWGTLTAGEKQLTMRV 64 (324)
T ss_pred HHHhCCHHHHHHHHHH
Confidence 4899999999988754
No 176
>PRK08943 lipid A biosynthesis (KDO)2-(lauroyl)-lipid IVA acyltransferase; Validated
Probab=24.78 E-value=1.5e+02 Score=18.01 Aligned_cols=26 Identities=19% Similarity=0.148 Sum_probs=20.5
Q ss_pred HHHHhCCCCCHHHHHHHHHHHhcCCC
Q psy5907 13 QVKAQNLDLKLWEIGKIIGQMWRDLP 38 (56)
Q Consensus 13 ~~~~~~p~~~~~ei~k~l~~~W~~l~ 38 (56)
.+..-.|+++..++.+++.+.|.++.
T Consensus 69 NL~~afPe~s~~e~~~i~~~~~~n~~ 94 (314)
T PRK08943 69 NLSLCFPEKSEAEREAIIDEMFATAP 94 (314)
T ss_pred HHHHhCCCCCHHHHHHHHHHHHHHHH
Confidence 44445699999999999999887664
No 177
>COG0484 DnaJ DnaJ-class molecular chaperone with C-terminal Zn finger domain [Posttranslational modification, protein turnover, chaperones]
Probab=24.75 E-value=2e+02 Score=18.53 Aligned_cols=40 Identities=18% Similarity=0.082 Sum_probs=28.0
Q ss_pred HHHHHHHHHhCCCCCH-----HHHHHHHHHHhcCCCHHHHHHHHH
Q psy5907 8 RKVWDQVKAQNLDLKL-----WEIGKIIGQMWRDLPEDQKTEYVE 47 (56)
Q Consensus 8 ~~~~~~~~~~~p~~~~-----~ei~k~l~~~W~~l~~~eK~~y~~ 47 (56)
+.+|...++-||+.+. .|-=+.|.+....|++.+|..-.+
T Consensus 23 kAYRkLA~kyHPD~n~g~~~AeeKFKEI~eAYEVLsD~eKRa~YD 67 (371)
T COG0484 23 KAYRKLAKKYHPDRNPGDKEAEEKFKEINEAYEVLSDPEKRAAYD 67 (371)
T ss_pred HHHHHHHHHhCCCCCCCCHHHHHHHHHHHHHHHHhCCHHHHHHhh
Confidence 4556667777887764 445577889999998887765333
No 178
>PRK14282 chaperone protein DnaJ; Provisional
Probab=24.66 E-value=1.9e+02 Score=18.17 Aligned_cols=39 Identities=26% Similarity=0.110 Sum_probs=26.6
Q ss_pred HHHHHHHHHhCCCCCH------HHHHHHHHHHhcCCCHHHHH-HHH
Q psy5907 8 RKVWDQVKAQNLDLKL------WEIGKIIGQMWRDLPEDQKT-EYV 46 (56)
Q Consensus 8 ~~~~~~~~~~~p~~~~------~ei~k~l~~~W~~l~~~eK~-~y~ 46 (56)
+.+|...+.-||+.+. .+.-+.|.+.+..|++.+|. .|.
T Consensus 23 ~ayr~la~~~HPD~~~~~~~~a~~~f~~i~~Ay~vL~d~~kR~~YD 68 (369)
T PRK14282 23 RAYKRLVKEWHPDRHPENRKEAEQKFKEIQEAYEVLSDPQKRAMYD 68 (369)
T ss_pred HHHHHHHHHHCCCCCccchhHHHHHHHHHHHHHHHhcChhhHHHHh
Confidence 4566667777898753 34556788889888877654 444
No 179
>cd08780 Death_TRADD Death Domain of Tumor Necrosis Factor Receptor 1-Associated Death Domain protein. Death domain (DD) of TRADD (TNF Receptor 1-Associated Death Domain or TNFRSF1A-associated via death domain) protein. TRADD is a central signaling adaptor for TNF-receptor 1 (TNFR1), mediating activation of Nuclear Factor -kappaB (NF-kB) and c-Jun N-terminal kinase (JNK), as well as caspase-dependent apoptosis. It also carries important immunological roles including germinal center formation, DR3-mediated T-cell stimulation, and TNFalpha-mediated inflammatory responses. In general, DDs are protein-protein interaction domains found in a variety of domain architectures. Their common feature is that they form homodimers by self-association or heterodimers by associating with other members of the DD superfamily including CARD (Caspase activation and recruitment domain), DED (Death Effector Domain), and PYRIN. They serve as adaptors in signaling pathways and can recruit other proteins into s
Probab=24.60 E-value=44 Score=17.15 Aligned_cols=17 Identities=18% Similarity=0.503 Sum_probs=10.7
Q ss_pred HHHHHHHHHHHhcCCCH
Q psy5907 23 LWEIGKIIGQMWRDLPE 39 (56)
Q Consensus 23 ~~ei~k~l~~~W~~l~~ 39 (56)
+..++.-+|..|+.+..
T Consensus 5 ~q~~~~nvGr~WK~laR 21 (90)
T cd08780 5 QQHFAKSVGKKWKPVGR 21 (90)
T ss_pred HHHHHHHHhHHHHHHHH
Confidence 45566777777765443
No 180
>PRK14301 chaperone protein DnaJ; Provisional
Probab=24.18 E-value=1.9e+02 Score=18.18 Aligned_cols=39 Identities=13% Similarity=0.034 Sum_probs=26.9
Q ss_pred HHHHHHHHHhCCCCCH-----HHHHHHHHHHhcCCCHHHH-HHHH
Q psy5907 8 RKVWDQVKAQNLDLKL-----WEIGKIIGQMWRDLPEDQK-TEYV 46 (56)
Q Consensus 8 ~~~~~~~~~~~p~~~~-----~ei~k~l~~~W~~l~~~eK-~~y~ 46 (56)
+.+|...+.-||+.+. .+.-+.|.+.|..|++..| ..|.
T Consensus 23 ~ayr~la~~~HPD~~~~~~~a~~~f~~i~~Ay~vL~d~~kr~~yD 67 (373)
T PRK14301 23 KAYRKLALQYHPDRNPDNPEAEQKFKEAAEAYEVLRDAEKRARYD 67 (373)
T ss_pred HHHHHHHHHhCCCcCCCChHHHHHHHHHHHHHHHhcchhhhhhhh
Confidence 4566677777888753 3455688899998887664 4444
No 181
>PF12339 DNAJ_related: DNA-J related protein ; InterPro: IPR021059 This domain family is approximately 130 amino acids in length and contains a conserved YYLD sequence motif. The proteins have a C-terminal DNA-J domain PF00226 from PFAM and most of the sequences are annotated as DNA-J related proteins, other annotations include: DnaJ-class molecular chaperon and formate dehydrogenase; but there is currently no publications to support these annotations.
Probab=24.15 E-value=86 Score=17.05 Aligned_cols=18 Identities=11% Similarity=0.499 Sum_probs=15.3
Q ss_pred CCCHHHHHHHHHHHhcCC
Q psy5907 20 DLKLWEIGKIIGQMWRDL 37 (56)
Q Consensus 20 ~~~~~ei~k~l~~~W~~l 37 (56)
+.+..+|.+.+.+.|+.+
T Consensus 114 ~t~~~~V~~LL~~FW~r~ 131 (132)
T PF12339_consen 114 ETSEAEVERLLNSFWQRY 131 (132)
T ss_pred hcCHHHHHHHHHHHHHhc
Confidence 567899999999999754
No 182
>PTZ00211 ribonucleoside-diphosphate reductase small subunit; Provisional
Probab=24.13 E-value=79 Score=19.47 Aligned_cols=17 Identities=12% Similarity=0.292 Sum_probs=14.1
Q ss_pred HHhcCCCHHHHHHHHHH
Q psy5907 32 QMWRDLPEDQKTEYVED 48 (56)
Q Consensus 32 ~~W~~l~~~eK~~y~~~ 48 (56)
..|+.||+++|..|...
T Consensus 56 ~dw~~Lt~~Er~~~~~~ 72 (330)
T PTZ00211 56 KDWEKLNDGERHFIKHV 72 (330)
T ss_pred HHHHhCCHHHHHHHHHH
Confidence 34999999999988754
No 183
>PF10407 Cytokin_check_N: Cdc14 phosphatase binding protein N-terminus ; InterPro: IPR018844 Cytokinesis in yeasts involves a family of proteins whose essential function is to bind Cdc14-family phosphatase and prevent this from being sequestered and inhibited in the nucleolus. This is the highly conserved N terminus of a family of proteins which act as cytokinesis checkpoint controls by allowing cells to cope with cytokinesis defects. These proteins are required for rDNA silencing and mini-chromosome maintenance [].
Probab=24.13 E-value=87 Score=15.33 Aligned_cols=23 Identities=13% Similarity=0.203 Sum_probs=18.7
Q ss_pred CCCCHHHHHHHHHHHhcCCCHHH
Q psy5907 19 LDLKLWEIGKIIGQMWRDLPEDQ 41 (56)
Q Consensus 19 p~~~~~ei~k~l~~~W~~l~~~e 41 (56)
|+.+..++...|.++|+.|=+.+
T Consensus 11 ~~~tl~~L~~eI~~~f~kLYP~~ 33 (73)
T PF10407_consen 11 PNNTLSQLKEEIEERFKKLYPNE 33 (73)
T ss_pred CCCcHHHHHHHHHHHHHHHCCCC
Confidence 78889999999999998765544
No 184
>TIGR02269 Myxococcus xanthus paralogous lipoprotein family TIGR02269. This family consists of at least 9 paralogs in Myxococcus xanthus, a member of the Deltaproteobacteria. One appears truncated toward the N-terminus; the others are predicted lipoproteins. The function is unknown.
Probab=23.55 E-value=1.7e+02 Score=17.29 Aligned_cols=24 Identities=17% Similarity=0.242 Sum_probs=18.3
Q ss_pred HHHHHHHHhCCCCCHHHHHHHHHH
Q psy5907 9 KVWDQVKAQNLDLKLWEIGKIIGQ 32 (56)
Q Consensus 9 ~~~~~~~~~~p~~~~~ei~k~l~~ 32 (56)
+.+..+...||+.+..+|-+-.++
T Consensus 174 ~aWr~f~~~~~~as~e~i~~hage 197 (211)
T TIGR02269 174 QAWRDFARRYPGASPEEIWRHAGE 197 (211)
T ss_pred HHHHHHHHhCCCCCHHHHHHHHHH
Confidence 345567778999999999877665
No 185
>PF07055 Eno-Rase_FAD_bd: Enoyl reductase FAD binding domain; InterPro: IPR010758 This family contains a number of bacterial putative reductases. ; GO: 0016491 oxidoreductase activity, 0055114 oxidation-reduction process; PDB: 3S8M_A 3ZU5_A 3ZU3_A 3ZU4_A 3ZU2_A.
Probab=23.44 E-value=62 Score=15.55 Aligned_cols=19 Identities=5% Similarity=0.427 Sum_probs=14.0
Q ss_pred HHHHHHHHHHHhcCCCHHH
Q psy5907 23 LWEIGKIIGQMWRDLPEDQ 41 (56)
Q Consensus 23 ~~ei~k~l~~~W~~l~~~e 41 (56)
..+|...+.+.|..++.+-
T Consensus 16 ~~dvQ~~V~~lw~~it~en 34 (65)
T PF07055_consen 16 RPDVQAEVAELWEQITTEN 34 (65)
T ss_dssp SHHHHHHHHHHHCCT-CCC
T ss_pred CHHHHHHHHHHHHHhcccc
Confidence 3577888999999888653
No 186
>PF10077 DUF2314: Uncharacterized protein conserved in bacteria (DUF2314); InterPro: IPR018756 This domain of unkown function is found in various bacterial hypothetical proteins, as well as putative ankyrin repeat proteins.
Probab=23.31 E-value=1.4e+02 Score=16.09 Aligned_cols=20 Identities=25% Similarity=0.337 Sum_probs=15.1
Q ss_pred HHHHHhcCCCHHHHHHHHHH
Q psy5907 29 IIGQMWRDLPEDQKTEYVED 48 (56)
Q Consensus 29 ~l~~~W~~l~~~eK~~y~~~ 48 (56)
.+--+.+.||++|+..|.+.
T Consensus 109 Ti~~~~~~m~~~e~~~~d~~ 128 (133)
T PF10077_consen 109 TIRAMRSRMSEEERDEHDEA 128 (133)
T ss_pred HHHHHHhhCCHHHHHHHHHH
Confidence 34556788999999888764
No 187
>cd08803 Death_ank3 Death domain of Ankyrin-3. Death Domain (DD) of the human protein ankyrin-3 (ANK-3) and related proteins. Ankyrins are modular proteins comprising three conserved domains, an N-terminal membrane-binding domain containing ANK repeats, a spectrin-binding domain and a C-terminal DD. ANK-3, also called anykyrin-G (for general or giant), is found in neurons and at least one splice variant has been shown to be essential for propagation of action potentials as a binding partner to neurofascin and voltage-gated sodium channels. It is required for maintaining axo-dendritic polarity, and may be a genetic risk factor associated with bipolar disorder. ANK-3 may also play roles in other cell types. Mutations affecting ANK-3 pathways for Na channel localization are associated with Brugada syndrome, a potentially fata arrythmia. In general, DDs are protein-protein interaction domains found in a variety of domain architectures. Their common feature is that they form homodimers by se
Probab=22.81 E-value=31 Score=17.17 Aligned_cols=24 Identities=13% Similarity=0.386 Sum_probs=14.4
Q ss_pred CHHHHHHHHHHHhcCCCHHHHHHH
Q psy5907 22 KLWEIGKIIGQMWRDLPEDQKTEY 45 (56)
Q Consensus 22 ~~~ei~k~l~~~W~~l~~~eK~~y 45 (56)
....|+..||.-|..|..+-.-++
T Consensus 6 ~l~~ia~~LG~dW~~LA~eLg~s~ 29 (84)
T cd08803 6 RMAIVADHLGLSWTELARELNFSV 29 (84)
T ss_pred HHHHHHHHhhccHHHHHHHcCCCH
Confidence 345677777777776655443333
No 188
>cd07765 KRAB_A-box KRAB (Kruppel-associated box) domain -A box. The KRAB domain is a transcription repression module, found in a subgroup of the zinc finger proteins (ZFPs) of the C2H2 family, KRAB-ZFPs. KRAB-ZFPs comprise the largest group of transcriptional regulators in mammals, and are only found in tetrapods. These proteins have been shown to play important roles in cell differentiation and organ development, and in regulating viral replication and transcription. A KRAB domain may consist of an A-box, or of an A-box plus either a B-box, a divergent B-box (b), or a C-box. Only the A-box is included in this model. The A-box is needed for repression, the B- and C- boxes are not. KRAB-ZFPs have one or two KRAB domains at their amino-terminal end, and multiple C2H2 zinc finger motifs at their C-termini. Some KRAB-ZFPs also contain a SCAN domain which mediates homo- and hetero-oligomerization. The KRAB domain is a protein-protein interaction module which represses transcription through
Probab=22.62 E-value=51 Score=10.95 Aligned_cols=18 Identities=22% Similarity=0.387 Sum_probs=13.4
Q ss_pred HHhcCCCHHHHHHHHHHH
Q psy5907 32 QMWRDLPEDQKTEYVEDY 49 (56)
Q Consensus 32 ~~W~~l~~~eK~~y~~~a 49 (56)
+.|..+...++..|.+..
T Consensus 13 ~~~~~~~~~~~~~~~~~~ 30 (40)
T cd07765 13 EEWELLDPAQRDLYRDVM 30 (40)
T ss_pred HHHhcCCHHHHHHHHHHH
Confidence 578888888887776543
No 189
>PRK08733 lipid A biosynthesis lauroyl acyltransferase; Provisional
Probab=22.07 E-value=1.8e+02 Score=17.57 Aligned_cols=26 Identities=8% Similarity=-0.039 Sum_probs=19.6
Q ss_pred HHHHhCCCCCHHHHHHHHHHHhcCCC
Q psy5907 13 QVKAQNLDLKLWEIGKIIGQMWRDLP 38 (56)
Q Consensus 13 ~~~~~~p~~~~~ei~k~l~~~W~~l~ 38 (56)
.+..-.|+++..|+.+++.+.|.++.
T Consensus 64 NL~~~fP~~s~~e~~~i~~~~~~~~~ 89 (306)
T PRK08733 64 NLKLCFPEQDDAWRARLLRDSFDALG 89 (306)
T ss_pred HHHHhCCCCCHHHHHHHHHHHHHHHH
Confidence 34444699999999998888887544
No 190
>PRK08326 ribonucleotide-diphosphate reductase subunit beta; Validated
Probab=21.93 E-value=95 Score=19.02 Aligned_cols=17 Identities=12% Similarity=0.384 Sum_probs=14.0
Q ss_pred HHhcCCCHHHHHHHHHH
Q psy5907 32 QMWRDLPEDQKTEYVED 48 (56)
Q Consensus 32 ~~W~~l~~~eK~~y~~~ 48 (56)
..|+.||+.||..|...
T Consensus 51 ~dw~~Lt~~Er~~~~~i 67 (311)
T PRK08326 51 EDWEKLSDEERDYATRL 67 (311)
T ss_pred HHHHhCCHHHHHHHHHH
Confidence 45999999999988754
No 191
>PF09349 OHCU_decarbox: OHCU decarboxylase; InterPro: IPR018020 The proteins in this entry are OHCU decarboxylase, an enzyme of the purine catabolism that catalyses the conversion of OHCU into S(+)-allantoin []; it is the third step of the conversion of uric acid (a purine derivative) to allantoin. Step one is catalysed by urate oxidase (IPR002042 from INTERPRO) and step two is catalysed by hydroxyisourate hydrolase (IPR000895 from INTERPRO). ; PDB: 3O7I_B 3O7H_B 3O7J_A 3O7K_A 2Q37_A 2O70_B 2O73_C 2O74_C 2O8I_A.
Probab=21.75 E-value=1.6e+02 Score=16.22 Aligned_cols=28 Identities=18% Similarity=0.049 Sum_probs=22.9
Q ss_pred CCCCCHHHHHHHHHHHhcCCCHHHHHHH
Q psy5907 18 NLDLKLWEIGKIIGQMWRDLPEDQKTEY 45 (56)
Q Consensus 18 ~p~~~~~ei~k~l~~~W~~l~~~eK~~y 45 (56)
.|-.+..++...+...|..++++++.-.
T Consensus 31 rPf~s~~~L~~a~~~~~~~~~~~~~~~~ 58 (159)
T PF09349_consen 31 RPFASVDALIAAADEAVRSLSEEDKLEA 58 (159)
T ss_dssp GS-SSHHHHHHHHHHHHHCS-HHHHHHH
T ss_pred CCCCCHHHHHHHHHHHHHhCCHHHHHHH
Confidence 5889999999999999999999887653
No 192
>PF11953 DUF3470: Domain of unknown function (DUF3470); InterPro: IPR022569 This functionally uncharacterised domain is found in bacteria and is about 50 amino acids in length. It is found C-terminal to PF00037 from PFAM. It contains a single completely conserved residue N that may be functionally important. ; PDB: 1FRX_A 1GAO_A 1FRL_A 1AXQ_A 1FTC_B 1FDA_A 1A6L_A 1FDD_A 1F5B_A 1F5C_A ....
Probab=21.50 E-value=37 Score=14.87 Aligned_cols=19 Identities=16% Similarity=0.534 Sum_probs=11.6
Q ss_pred CHHHHHHHHHHHhcCCCHH
Q psy5907 22 KLWEIGKIIGQMWRDLPED 40 (56)
Q Consensus 22 ~~~ei~k~l~~~W~~l~~~ 40 (56)
.+.+|...++..|-+++..
T Consensus 3 ~f~elN~ela~~WP~It~k 21 (43)
T PF11953_consen 3 HFIELNAELAKKWPNITEK 21 (43)
T ss_dssp HHHHHHHHHHTTS-BE-S-
T ss_pred HHHHHHHHHHHhCCCcCcc
Confidence 3567778888888776654
No 193
>cd00183 TFIIS_I N-terminal domain (domain I) of transcription elongation factor S-II (TFIIS); similar to a domain found in elongin A and CRSP70; likely to be involved in transcription; domain I from TFIIS interacts with RNA polymerase II holoenzyme
Probab=21.19 E-value=65 Score=15.45 Aligned_cols=21 Identities=19% Similarity=0.495 Sum_probs=14.4
Q ss_pred hCCCCCHHHHHHHHHHHhcCC
Q psy5907 17 QNLDLKLWEIGKIIGQMWRDL 37 (56)
Q Consensus 17 ~~p~~~~~ei~k~l~~~W~~l 37 (56)
.|++-....+++.|-..|+.+
T Consensus 54 kh~~~~i~~~A~~Lv~~Wk~~ 74 (76)
T cd00183 54 KHSNEKIRKLAKALIKSWKKL 74 (76)
T ss_pred cCCcHHHHHHHHHHHHHHHHh
Confidence 355555677778888888753
No 194
>PRK06860 lipid A biosynthesis lauroyl acyltransferase; Provisional
Probab=21.10 E-value=2e+02 Score=17.39 Aligned_cols=27 Identities=7% Similarity=0.031 Sum_probs=20.6
Q ss_pred HHHHHhCCCCCHHHHHHHHHHHhcCCC
Q psy5907 12 DQVKAQNLDLKLWEIGKIIGQMWRDLP 38 (56)
Q Consensus 12 ~~~~~~~p~~~~~ei~k~l~~~W~~l~ 38 (56)
..+..-.|+++..|+.+++.+.|.++.
T Consensus 63 ~NL~~~fPe~s~~e~~~i~~~~~~~~g 89 (309)
T PRK06860 63 RNLELCFPEMSEQEREAIVVKNFESVG 89 (309)
T ss_pred HHHHHhCCCCCHHHHHHHHHHHHHHHH
Confidence 344455699999999998888887654
No 195
>PF04031 Las1: Las1-like ; InterPro: IPR007174 Las1 is an essential nuclear protein involved in cell morphogenesis and cell surface growth [].
Probab=21.10 E-value=98 Score=17.09 Aligned_cols=29 Identities=14% Similarity=0.206 Sum_probs=13.2
Q ss_pred hHHhHHHHHHHHHHHhCCCCCHHHHHHHH
Q psy5907 2 PYMRYSRKVWDQVKAQNLDLKLWEIGKII 30 (56)
Q Consensus 2 ay~lf~~~~~~~~~~~~p~~~~~ei~k~l 30 (56)
|++.|....-+.........+...+++.+
T Consensus 83 aivRfVNgl~D~~Q~~~~a~si~~~A~~i 111 (154)
T PF04031_consen 83 AIVRFVNGLVDPSQQGKYARSIASLAKEI 111 (154)
T ss_pred hHHHHHHHhhhHhhccchhhhHHHHHHHc
Confidence 45556555555443323333444444443
No 196
>PRK09101 nrdB ribonucleotide-diphosphate reductase subunit beta; Reviewed
Probab=20.93 E-value=1e+02 Score=19.49 Aligned_cols=16 Identities=25% Similarity=0.634 Sum_probs=13.2
Q ss_pred HhcCCCHHHHHHHHHH
Q psy5907 33 MWRDLPEDQKTEYVED 48 (56)
Q Consensus 33 ~W~~l~~~eK~~y~~~ 48 (56)
.|..||+++|..|...
T Consensus 62 dw~~Lt~~Er~~~~~~ 77 (376)
T PRK09101 62 DYQALPEHEKHIFISN 77 (376)
T ss_pred HHHhCCHHHHHHHHHH
Confidence 4999999999988643
No 197
>cd08777 Death_RIP1 Death Domain of Receptor-Interacting Protein 1. Death domain (DD) found in Receptor-Interacting Protein 1 (RIP1) and related proteins. RIP kinases serve as essential sensors of cellular stress. Vertebrates contain several types containing a homologous N-terminal kinase domain and varying C-terminal domains. RIP1 harbors a C-terminal DD, which binds death receptors (DRs) including TNF receptor 1, Fas, TNF-related apoptosis-inducing ligand receptor 1 (TRAILR1), and TRAILR2. It also interacts with other DD-containing adaptor proteins such as TRADD and FADD. RIP1 plays a crucial role in determining a cell's fate, between survival or death, following exposure to stress signals. It is important in the signaling of NF-kappaB and MAPKs, and it links DR-associated signaling to reactive oxygen species (ROS) production. Abnormal RIP1 function may result in ROS accumulation affecting inflammatory responses, innate immunity, stress responses, and cell survival. In general, DDs ar
Probab=20.89 E-value=47 Score=16.52 Aligned_cols=17 Identities=18% Similarity=0.423 Sum_probs=12.2
Q ss_pred HHHHHHHHHHHhcCCCH
Q psy5907 23 LWEIGKIIGQMWRDLPE 39 (56)
Q Consensus 23 ~~ei~k~l~~~W~~l~~ 39 (56)
..-|+..+|..|+.+..
T Consensus 5 l~~l~~~lG~~Wk~lar 21 (86)
T cd08777 5 LDLLRENLGKKWKRCAR 21 (86)
T ss_pred HHHHHHHHHHHHHHHHH
Confidence 45677888899986653
No 198
>PRK02955 small acid-soluble spore protein SspI; Provisional
Probab=20.83 E-value=1.2e+02 Score=14.71 Aligned_cols=17 Identities=29% Similarity=0.739 Sum_probs=13.3
Q ss_pred HHhcCCCHHHHHHHHHH
Q psy5907 32 QMWRDLPEDQKTEYVED 48 (56)
Q Consensus 32 ~~W~~l~~~eK~~y~~~ 48 (56)
..|+.+++.+|....+.
T Consensus 46 ~~W~~~~~~ek~~m~~~ 62 (68)
T PRK02955 46 VIWKNADENEKDEMLET 62 (68)
T ss_pred HHHHhcCHHHHHHHHHH
Confidence 35999999999876543
No 199
>PF10562 CaM_bdg_C0: Calmodulin-binding domain C0 of NMDA receptor NR1 subunit; InterPro: IPR018882 This is a very short highly conserved domain that is C-terminal to the cytosolic transmembrane region IV of the NMDA-receptor 1. It has been shown to bind Calmodulin-Calcium with high affinity. The ionotropic N-methyl-D-aspartate receptor (NMDAR) is a major source of calcium flux into neurons in the brain and plays a critical role in learning, memory, neural development, and synaptic plasticity. Calmodulin (CaM) regulates NMDARs by binding tightly to the C0 and C1 regions of their NR1 subunit. The conserved tryptophan is considered to be the anchor residue [].
Probab=20.63 E-value=85 Score=12.69 Aligned_cols=14 Identities=21% Similarity=0.556 Sum_probs=10.8
Q ss_pred HHHHHHHHHHHhcC
Q psy5907 23 LWEIGKIIGQMWRD 36 (56)
Q Consensus 23 ~~ei~k~l~~~W~~ 36 (56)
--|+++...+.|+.
T Consensus 13 ~~elAr~a~dkWR~ 26 (29)
T PF10562_consen 13 QLELARHAADKWRG 26 (29)
T ss_pred HHHHHHHHHHHHHh
Confidence 35788888889974
No 200
>cd03577 NTR_TIMP_like NTR domain, TIMP-like subfamily; TIMPs, or tissue inibitors of metalloproteases, are essential regulators of extracellular matrix turnover and remodeling. They form complexes with matrix metalloproteases (MMPs) and inactivate them irreversibly by non-covalently binding their active zinc-binding sites. This group contains domains similar to the TIMP NTR domain, which binds MMPs. Members of this group may or may not function as MMP inhibitors.
Probab=20.57 E-value=41 Score=17.17 Aligned_cols=17 Identities=35% Similarity=0.597 Sum_probs=12.2
Q ss_pred HhcCCCHHHHHHHHHHH
Q psy5907 33 MWRDLPEDQKTEYVEDY 49 (56)
Q Consensus 33 ~W~~l~~~eK~~y~~~a 49 (56)
.|+.|+.++|.-....+
T Consensus 97 ~W~~ls~~qkr~l~~~y 113 (116)
T cd03577 97 PWDDLTKEQKRGLKGLY 113 (116)
T ss_pred EhHHCCHHHHhhhhhcc
Confidence 58888888887665544
No 201
>cd00155 RasGEF Guanine nucleotide exchange factor for Ras-like small GTPases. Small GTP-binding proteins of the Ras superfamily function as molecular switches in fundamental events such as signal transduction, cytoskeleton dynamics and intracellular trafficking. Guanine-nucleotide-exchange factors (GEFs) positively regulate these GTP-binding proteins in response to a variety of signals. GEFs catalyze the dissociation of GDP from the inactive GTP-binding proteins. GTP can then bind and induce structural changes that allow interaction with effectors.
Probab=20.57 E-value=1.4e+02 Score=17.00 Aligned_cols=22 Identities=23% Similarity=0.299 Sum_probs=17.4
Q ss_pred HHHHHhcCCCHHHHHHHHHHHH
Q psy5907 29 IIGQMWRDLPEDQKTEYVEDYE 50 (56)
Q Consensus 29 ~l~~~W~~l~~~eK~~y~~~a~ 50 (56)
.+...|..++++.+..|.++..
T Consensus 111 RL~~tw~~l~~~~~~~~~~l~~ 132 (237)
T cd00155 111 RLKKTWEVLSSKLKKLFEELEE 132 (237)
T ss_pred hHHHHHHHCCHHHHHHHHHHHH
Confidence 3577899999999988876654
No 202
>TIGR02147 Fsuc_second hypothetical protein, TIGR02147. This family consists of the 40 members of a paralogous protein family in the rumen anaerobe Fibrobacter succinogenes S85. Member proteins are about 270 residues long and appear to lack signal sequences and transmembrane helices. The only perfectly conserved residue is a glycine in an otherwise poorly conserved region, suggesting members are not enzymes. The family is not characterized.
Probab=20.51 E-value=1.9e+02 Score=17.61 Aligned_cols=27 Identities=15% Similarity=0.294 Sum_probs=15.8
Q ss_pred hHHHHHHHHHHHhCCCCCHHHHHHHHH
Q psy5907 5 RYSRKVWDQVKAQNLDLKLWEIGKIIG 31 (56)
Q Consensus 5 lf~~~~~~~~~~~~p~~~~~ei~k~l~ 31 (56)
.|.++....-+..+|..+..++++.+|
T Consensus 12 ~fl~d~ye~rk~~~p~fS~R~fa~~~G 38 (271)
T TIGR02147 12 KYLRDYYEERKKTDPAFSWRFFAEKAG 38 (271)
T ss_pred HHHHHHHHHHhccCcCcCHHHHHHHhC
Confidence 455565555555566666666655555
No 203
>PF04545 Sigma70_r4: Sigma-70, region 4; InterPro: IPR007630 The bacterial core RNA polymerase complex, which consists of five subunits, is sufficient for transcription elongation and termination but is unable to initiate transcription. Transcription initiation from promoter elements requires a sixth, dissociable subunit called a sigma factor, which reversibly associates with the core RNA polymerase complex to form a holoenzyme []. RNA polymerase recruits alternative sigma factors as a means of switching on specific regulons. Most bacteria express a multiplicity of sigma factors. Two of these factors, sigma-70 (gene rpoD), generally known as the major or primary sigma factor, and sigma-54 (gene rpoN or ntrA) direct the transcription of a wide variety of genes. The other sigma factors, known as alternative sigma factors, are required for the transcription of specific subsets of genes. With regard to sequence similarity, sigma factors can be grouped into two classes, the sigma-54 and sigma-70 families. Sequence alignments of the sigma70 family members reveal four conserved regions that can be further divided into subregions eg. sub-region 2.2, which may be involved in the binding of the sigma factor to the core RNA polymerase; and sub-region 4.2, which seems to harbor a DNA-binding 'helix-turn-helix' motif involved in binding the conserved -35 region of promoters recognised by the major sigma factors [, ]. Region 4 of sigma-70 like sigma-factors is involved in binding to the -35 promoter element via a helix-turn-helix motif []. Due to the way Pfam works, the threshold has been set artificially high to prevent overlaps with other helix-turn-helix families. Therefore there are many false negatives.; GO: 0003677 DNA binding, 0003700 sequence-specific DNA binding transcription factor activity, 0016987 sigma factor activity, 0006352 transcription initiation, DNA-dependent, 0006355 regulation of transcription, DNA-dependent; PDB: 2P7V_B 3IYD_F 1TLH_B 1KU7_A 1RIO_H 3N97_A 1KU3_A 1RP3_C 1SC5_A 1NR3_A ....
Probab=20.45 E-value=91 Score=13.27 Aligned_cols=14 Identities=36% Similarity=0.342 Sum_probs=10.1
Q ss_pred CCCCCHHHHHHHHH
Q psy5907 18 NLDLKLWEIGKIIG 31 (56)
Q Consensus 18 ~p~~~~~ei~k~l~ 31 (56)
..+++..||+..+|
T Consensus 18 ~~~~t~~eIa~~lg 31 (50)
T PF04545_consen 18 FEGLTLEEIAERLG 31 (50)
T ss_dssp TST-SHHHHHHHHT
T ss_pred cCCCCHHHHHHHHC
Confidence 36788888888776
Done!