Query psy8269
Match_columns 386
No_of_seqs 294 out of 1297
Neff 6.5
Searched_HMMs 46136
Date Fri Aug 16 23:00:09 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy8269.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/8269hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PRK10907 intramembrane serine 99.9 2.4E-25 5.2E-30 215.3 17.0 177 170-364 93-270 (276)
2 PTZ00101 rhomboid-1 protease; 99.9 3.9E-25 8.5E-30 213.6 17.1 183 172-364 54-240 (278)
3 COG0705 Membrane associated se 99.9 8.9E-24 1.9E-28 198.7 13.7 187 171-364 17-211 (228)
4 KOG2980|consensus 99.9 2.9E-24 6.3E-29 205.5 5.0 211 15-370 96-308 (310)
5 COG5208 HAP5 CCAAT-binding fac 99.9 5.2E-23 1.1E-27 186.9 7.5 108 86-194 82-189 (286)
6 PF01694 Rhomboid: Rhomboid fa 99.8 2.3E-21 4.9E-26 168.7 5.9 141 216-365 3-143 (145)
7 KOG2289|consensus 99.8 1.3E-21 2.9E-26 190.9 3.0 142 213-364 112-254 (316)
8 KOG1657|consensus 99.7 1.6E-18 3.5E-23 163.5 2.4 104 91-196 53-156 (236)
9 KOG2632|consensus 99.6 1.9E-15 4.1E-20 142.9 12.5 182 163-360 7-194 (258)
10 KOG2290|consensus 99.6 5.9E-16 1.3E-20 154.5 3.6 136 218-364 450-585 (652)
11 PF00808 CBFD_NFYB_HMF: Histon 99.2 3.8E-11 8.2E-16 91.7 5.8 64 114-177 2-65 (65)
12 KOG1659|consensus 99.0 2.9E-10 6.4E-15 104.0 6.2 84 109-192 8-91 (224)
13 COG5247 BUR6 Class 2 transcrip 99.0 1.2E-09 2.6E-14 88.9 6.0 79 114-192 23-101 (113)
14 PF08551 DUF1751: Eukaryotic i 98.4 1.9E-07 4.1E-12 77.4 3.6 82 218-299 7-96 (99)
15 KOG1658|consensus 98.3 3.5E-07 7.6E-12 80.2 3.2 82 111-192 56-137 (162)
16 PF04511 DER1: Der1-like famil 98.2 1.7E-05 3.8E-10 73.4 12.6 137 216-361 39-179 (197)
17 KOG2890|consensus 98.1 1.3E-05 2.8E-10 77.8 8.1 154 212-370 58-221 (326)
18 KOG0858|consensus 97.8 7.8E-05 1.7E-09 70.4 8.6 132 215-361 49-184 (239)
19 COG2036 HHT1 Histones H3 and H 96.2 0.0092 2E-07 48.7 4.9 74 105-179 10-83 (91)
20 cd00074 H2A Histone 2A; H2A is 95.8 0.036 7.8E-07 47.2 7.1 74 115-188 21-94 (115)
21 PF00125 Histone: Core histone 95.5 0.037 8.1E-07 42.8 5.6 64 114-177 8-72 (75)
22 KOG0869|consensus 95.2 0.052 1.1E-06 48.2 6.2 72 113-184 31-103 (168)
23 smart00803 TAF TATA box bindin 95.2 0.058 1.3E-06 41.3 5.8 63 114-177 2-64 (65)
24 cd00076 H4 Histone H4, one of 95.2 0.067 1.5E-06 43.2 6.3 67 111-178 10-76 (85)
25 PLN00035 histone H4; Provision 94.8 0.093 2E-06 43.8 6.3 66 113-179 28-93 (103)
26 PTZ00015 histone H4; Provision 94.2 0.16 3.4E-06 42.4 6.4 68 110-178 26-93 (102)
27 COG5262 HTA1 Histone H2A [Chro 93.8 0.21 4.5E-06 42.3 6.4 78 111-188 23-100 (132)
28 COG5291 Predicted membrane pro 93.5 0.27 5.8E-06 47.1 7.3 45 212-256 53-99 (313)
29 smart00417 H4 Histone H4. 93.2 0.24 5.2E-06 39.0 5.5 63 111-174 10-72 (74)
30 cd07981 TAF12 TATA Binding Pro 92.0 0.53 1.2E-05 36.6 6.1 64 116-179 3-66 (72)
31 smart00414 H2A Histone 2A. 90.0 0.96 2.1E-05 38.0 6.1 73 115-187 10-82 (106)
32 cd08048 TAF11 TATA Binding Pro 89.9 1.2 2.6E-05 36.0 6.4 71 106-177 8-81 (85)
33 KOG4463|consensus 89.6 0.24 5.3E-06 47.8 2.5 60 217-277 49-108 (323)
34 KOG0870|consensus 87.4 1.9 4.1E-05 38.9 6.5 76 114-189 10-87 (172)
35 PTZ00017 histone H2A; Provisio 86.5 1.9 4.2E-05 37.7 6.0 73 115-187 28-100 (134)
36 PF09415 CENP-X: CENP-S associ 85.9 2.1 4.7E-05 33.4 5.4 60 116-175 1-63 (72)
37 KOG1756|consensus 84.7 3.6 7.7E-05 35.6 6.6 77 113-189 26-102 (131)
38 PF13150 DUF3989: Protein of u 83.7 0.66 1.4E-05 37.5 1.7 68 18-87 4-76 (85)
39 PLN00154 histone H2A; Provisio 83.1 3.3 7.1E-05 36.3 5.8 74 115-188 39-113 (136)
40 KOG0871|consensus 81.2 5.7 0.00012 35.2 6.6 77 110-186 8-85 (156)
41 COG5150 Class 2 transcription 79.6 5.1 0.00011 34.6 5.6 84 109-192 6-93 (148)
42 smart00428 H3 Histone H3. 78.2 8.5 0.00018 32.3 6.5 81 97-177 15-98 (105)
43 PLN00156 histone H2AX; Provisi 77.3 6.1 0.00013 34.8 5.6 72 115-186 30-101 (139)
44 PLN00157 histone H2A; Provisio 77.0 6.8 0.00015 34.2 5.8 70 115-184 27-96 (132)
45 PLN00153 histone H2A; Provisio 76.8 6 0.00013 34.4 5.4 70 115-184 25-94 (129)
46 PF03419 Peptidase_U4: Sporula 67.7 57 0.0012 31.9 10.7 40 233-276 11-50 (293)
47 COG4769 Predicted membrane pro 66.3 67 0.0014 29.3 9.7 119 237-364 41-163 (181)
48 smart00576 BTP Bromodomain tra 65.3 14 0.00031 28.8 4.8 55 126-180 17-71 (77)
49 PF04719 TAFII28: hTAFII28-lik 61.8 21 0.00045 29.2 5.2 73 105-177 14-87 (90)
50 cd08050 TAF6 TATA Binding Prot 61.4 17 0.00037 36.6 5.8 61 118-179 3-63 (343)
51 cd07979 TAF9 TATA Binding Prot 59.3 28 0.00061 29.6 5.9 61 118-179 5-65 (117)
52 PF02969 TAF: TATA box binding 58.7 22 0.00048 27.3 4.6 49 129-177 17-65 (66)
53 PLN00161 histone H3; Provision 57.4 38 0.00082 29.7 6.4 81 97-177 41-123 (135)
54 TIGR02854 spore_II_GA sigma-E 57.1 1E+02 0.0022 30.3 10.2 41 232-276 10-50 (288)
55 COG4603 ABC-type uncharacteriz 54.3 2.2E+02 0.0048 29.0 12.3 123 170-299 142-291 (356)
56 PTZ00252 histone H2A; Provisio 54.3 35 0.00077 29.9 5.7 70 115-184 26-97 (134)
57 PF13105 DUF3959: Protein of u 53.4 66 0.0014 29.8 7.5 19 346-364 134-152 (239)
58 PF04632 FUSC: Fusaric acid re 50.3 3.3E+02 0.0072 29.3 13.8 35 347-381 474-511 (650)
59 PF03847 TFIID_20kDa: Transcri 48.3 57 0.0012 25.1 5.5 60 119-178 4-63 (68)
60 TIGR00834 ae anion exchange pr 46.8 2.2E+02 0.0047 32.7 11.8 64 232-299 376-440 (900)
61 PF09011 HMG_box_2: HMG-box do 46.2 4.3 9.4E-05 31.1 -1.1 43 15-64 29-71 (73)
62 KOG1658|consensus 45.5 24 0.00053 31.6 3.4 45 110-154 7-51 (162)
63 PF06728 PIG-U: GPI transamida 44.7 40 0.00088 34.4 5.4 88 211-309 254-342 (382)
64 PF03540 TFIID_30kDa: Transcri 43.5 74 0.0016 23.2 5.1 48 114-162 2-49 (51)
65 smart00398 HMG high mobility g 43.5 6.9 0.00015 28.9 -0.3 40 17-63 28-67 (70)
66 cd01390 HMGB-UBF_HMG-box HMGB- 40.9 7.1 0.00015 28.7 -0.6 24 17-40 27-50 (66)
67 PTZ00199 high mobility group p 38.3 9.4 0.0002 31.1 -0.3 41 18-65 52-92 (94)
68 cd01388 SOX-TCF_HMG-box SOX-TC 36.9 7.7 0.00017 29.6 -1.0 42 17-65 28-69 (72)
69 cd01389 MATA_HMG-box MATA_HMG- 36.9 7.3 0.00016 30.1 -1.1 44 17-67 28-71 (77)
70 PF15510 CENP-W: Centromere ki 35.4 84 0.0018 25.8 4.7 63 113-176 15-92 (102)
71 PF12805 FUSC-like: FUSC-like 34.7 2.4E+02 0.0051 27.3 8.8 12 344-355 70-81 (284)
72 COG0705 Membrane associated se 33.9 29 0.00063 32.3 2.2 69 216-302 137-205 (228)
73 PLN00160 histone H3; Provision 33.8 1.4E+02 0.0031 24.7 6.0 80 97-176 7-88 (97)
74 PF00505 HMG_box: HMG (high mo 33.7 6.9 0.00015 29.1 -1.7 25 16-40 26-50 (69)
75 PF10979 DUF2786: Protein of u 33.7 49 0.0011 23.1 2.8 35 117-151 4-38 (43)
76 PLN00121 histone H3; Provision 33.6 1.2E+02 0.0026 26.7 5.7 80 97-176 48-128 (136)
77 PF09527 ATPase_gene1: Putativ 33.2 1.8E+02 0.004 20.8 6.3 41 234-274 9-50 (55)
78 PF06609 TRI12: Fungal trichot 32.3 1.9E+02 0.0042 31.6 8.4 41 233-274 86-126 (599)
79 PF07524 Bromo_TP: Bromodomain 31.4 1.5E+02 0.0033 22.7 5.7 55 125-179 16-70 (77)
80 COG3105 Uncharacterized protei 31.1 81 0.0018 27.5 4.2 24 341-364 4-27 (138)
81 COG0370 FeoB Fe2+ transport sy 30.8 4E+02 0.0087 29.5 10.4 71 292-364 393-464 (653)
82 PTZ00018 histone H3; Provision 30.4 1.5E+02 0.0032 26.2 5.8 79 97-176 48-128 (136)
83 cd08045 TAF4 TATA Binding Prot 28.6 1.9E+02 0.004 27.0 6.7 66 118-184 52-123 (212)
84 PF14898 DUF4491: Domain of un 28.4 90 0.002 25.7 3.9 44 249-302 26-69 (94)
85 PF15511 CENP-T: Centromere ki 27.8 72 0.0016 33.1 4.1 67 106-172 343-414 (414)
86 KOG1172|consensus 26.0 8.8E+02 0.019 27.8 12.1 59 236-298 371-430 (876)
87 cd00084 HMG-box High Mobility 24.5 21 0.00045 25.9 -0.4 25 16-40 26-50 (66)
88 KOG1142|consensus 24.4 1.1E+02 0.0023 29.9 4.2 71 107-177 147-217 (258)
89 PRK09554 feoB ferrous iron tra 22.8 6.3E+02 0.014 28.5 10.5 51 312-364 425-475 (772)
90 PF14241 DUF4341: Domain of un 22.7 1.5E+02 0.0032 22.3 3.9 34 262-299 6-39 (62)
91 PF04892 VanZ: VanZ like famil 22.1 4.6E+02 0.0099 21.7 9.2 28 341-368 105-132 (133)
92 PF04632 FUSC: Fusaric acid re 21.9 3.3E+02 0.0071 29.3 8.0 31 341-371 464-494 (650)
93 COG4956 Integral membrane prot 21.8 3.1E+02 0.0067 27.7 6.9 86 259-349 49-140 (356)
94 smart00427 H2B Histone H2B. 21.0 2.7E+02 0.0059 22.7 5.3 60 120-180 7-67 (89)
No 1
>PRK10907 intramembrane serine protease GlpG; Provisional
Probab=99.93 E-value=2.4e-25 Score=215.26 Aligned_cols=177 Identities=12% Similarity=0.170 Sum_probs=126.9
Q ss_pred hhhHHHHHHHHHHHHHHHhhcccccccccCCCCccccccccccCC-CCccccceeecccccCChhHHHHHHHHHHHHHHH
Q psy8269 170 RNDIAMAITKYDMFDFLIDIVPREEVKTATGPHRRDDATKTALNP-DQYLSVPMLLSTFSHQSPLHIFANMFVLHSFMPS 248 (386)
Q Consensus 170 ~~DIa~aI~~i~i~~FL~~ivp~~~~~~~~~p~~~~~~~~~~l~p-~~g~~wrllTs~FlH~~~~HLl~Nm~~L~~fG~~ 248 (386)
..+++..++.+|+.+|+.+.+.+....... ..+..++ .++|+||++|++|+|.|+.|+++||+.+|.+|..
T Consensus 93 ~~p~T~~li~i~i~vf~l~~~~~~~~~~~~--------l~~~~~~~~~~q~WRl~T~~flH~~~~Hl~fNml~l~~lG~~ 164 (276)
T PRK10907 93 AGPLTLGVMIACVVVFILMQILGDQTVMLW--------LAWPFDPSLKFELWRYFTHALLHFSLLHILFNLLWWWYLGGA 164 (276)
T ss_pred CCCHHHHHHHHHHHHHHHHHHhccHHHHHH--------HhccccccccCCcHHHHhHHHHhCCHHHHHHHHHHHHHHHHH
Confidence 456888999999999999877553211000 0011111 3679999999999999999999999999999999
Q ss_pred HHhhhchhHHHHHHHHHHHHHHHHHHHHHhhcCCCcceechHHHHHHHHHHHHHHcCCCcceeEechhhhHHHHHHHHHH
Q psy8269 249 SVEDLGKEQFVGFYLTAGVVASLLSYVHKILVRKPGLSIGASGAIMAVLAHTCITHPDTELGILFVPYVRFSAEHAIQGI 328 (386)
Q Consensus 249 lE~~~G~~rfl~lyl~sgv~gsl~s~l~~~~~~~~~~~vGASGaI~Gll~a~~~~~p~~~~~l~fip~~~~~a~~~~~~~ 328 (386)
+|+.+|+++++.+|++++++|+++++++. ....+|+||+|||++++......+.+.....+|. ....++.++
T Consensus 165 iE~~~G~~~~l~l~l~s~i~~~~~~~~~~-----~~~~gGaSGvVygL~g~~~~~~~~~p~~~~~lp~---~~~~f~llw 236 (276)
T PRK10907 165 VEKRLGSGKLIVITLISALLSGWVQSKFS-----GPWFGGLSGVVYALMGYVWLRGERDPQSGIYLPR---GLIAFALLW 236 (276)
T ss_pred HHHHHChHHHHHHHHHHHHHHHHHHHHHc-----cchhhHHHHHHHHHHHHHHHHhccccccchhhhH---HHHHHHHHH
Confidence 99999999999999999999999988765 3467899999999999876543222221111221 111122222
Q ss_pred HHHHHHHhhhcCchhHHHHHHHHHHHHHHHHHHhhh
Q psy8269 329 MLLDFLGVLFRWRLFDHAAHLGGALFGILYSKYGEQ 364 (386)
Q Consensus 329 l~~~~~~~~~~~~~i~~~aHLgG~L~G~l~~~~~~~ 364 (386)
+.+...+. + ..+|++.||+||+++|+++++...+
T Consensus 237 l~~g~~~~-~-g~~Ian~AHlgGli~Gll~g~~~~~ 270 (276)
T PRK10907 237 LVAGYFDL-F-GMSIANAAHVAGLAVGLAMAFWDTR 270 (276)
T ss_pred HHHHHHHc-c-CcccHHHHHHHHHHHHHHHHHHhhh
Confidence 22222221 2 3689999999999999999987654
No 2
>PTZ00101 rhomboid-1 protease; Provisional
Probab=99.93 E-value=3.9e-25 Score=213.62 Aligned_cols=183 Identities=22% Similarity=0.216 Sum_probs=122.6
Q ss_pred hHHHHHHHHHHHHHHHhhcccccccccCCCCc--cccc-cccccCCCCccccceeecccccCChhHHHHHHHHHHHHHHH
Q psy8269 172 DIAMAITKYDMFDFLIDIVPREEVKTATGPHR--RDDA-TKTALNPDQYLSVPMLLSTFSHQSPLHIFANMFVLHSFMPS 248 (386)
Q Consensus 172 DIa~aI~~i~i~~FL~~ivp~~~~~~~~~p~~--~~~~-~~~~l~p~~g~~wrllTs~FlH~~~~HLl~Nm~~L~~fG~~ 248 (386)
-+...|+.++++.|+.++.-+.+ ...+|.. .... ........++|+||++|++|+|.|+.|+++||++++.+|..
T Consensus 54 ~l~~~Iiii~iivfil~l~~~~~--~~l~p~~~~L~~~Ga~~~~~i~~gq~WRLiT~~FlH~~~~HLl~Nm~~l~~~G~~ 131 (278)
T PTZ00101 54 SFIMAISIIQIIVFIISVSIKPA--DFLTPSDSLLVTLGANVASRIKQGEIHRLILPIFLHANIFHTFFNVFFQLRMGFT 131 (278)
T ss_pred HHHHHHHHHHHHHHHHHHHhccc--ccCCCCHHHHHHHhCcchhhhhcCCCHHHHHHHHHccCHHHHHHHHHHHHHHHHH
Confidence 34567788888888887763211 1111110 0000 00111123789999999999999999999999999999999
Q ss_pred HHhhhchhHHHHHHHHHHHHHHHHHHHHHhhcCCCcceechHHHHHHHHHHHHHHcCCCcceeEechhhhHHHHHHHHHH
Q psy8269 249 SVEDLGKEQFVGFYLTAGVVASLLSYVHKILVRKPGLSIGASGAIMAVLAHTCITHPDTELGILFVPYVRFSAEHAIQGI 328 (386)
Q Consensus 249 lE~~~G~~rfl~lyl~sgv~gsl~s~l~~~~~~~~~~~vGASGaI~Gll~a~~~~~p~~~~~l~fip~~~~~a~~~~~~~ 328 (386)
+|+.+|+.|++.+|+++|++|++++.++. +...++||||++||++|++....-...... +. +......+..+
T Consensus 132 lE~~~G~~r~~ilYl~sGi~G~l~s~~~~----~~~~svGASgAifGLiGa~~~~lil~w~~~---~~-~~~~~~~~i~~ 203 (278)
T PTZ00101 132 LEKNYGIVKIIILYFLTGIYGNILSSSVT----YCPIKVGASTSGMGLLGIVTSELILLWHVI---RH-RERVVFNIIFF 203 (278)
T ss_pred HHHHHChHHHHHHHHHHHHHHHHHHHHHc----cCCcEEehhHHHHHHHHHHHHHHHHHHHhh---cc-HHHHHHHHHHH
Confidence 99999999999999999999999988775 345799999999999999764210000000 00 00000001111
Q ss_pred HHHHHH-HhhhcCchhHHHHHHHHHHHHHHHHHHhhh
Q psy8269 329 MLLDFL-GVLFRWRLFDHAAHLGGALFGILYSKYGEQ 364 (386)
Q Consensus 329 l~~~~~-~~~~~~~~i~~~aHLgG~L~G~l~~~~~~~ 364 (386)
+++.+. ......+++|+.||+||+++|++++..+.+
T Consensus 204 ~li~~~l~~~~~g~~Id~~aHlGG~i~G~llg~~~~~ 240 (278)
T PTZ00101 204 SLISFFYYFTFNGSNIDHVGHLGGLLSGISMGILYNS 240 (278)
T ss_pred HHHHHHHHHhcCCCCccHHHHHHHHHHHHHHHHHHHh
Confidence 111111 111224789999999999999999998765
No 3
>COG0705 Membrane associated serine protease [Amino acid transport and metabolism]
Probab=99.91 E-value=8.9e-24 Score=198.71 Aligned_cols=187 Identities=23% Similarity=0.277 Sum_probs=139.0
Q ss_pred hhHHHHHHHHHHHHHHHhhcccccccccCCCCccccccccccCCC-------CccccceeecccccCChhHHHHHHHHHH
Q psy8269 171 NDIAMAITKYDMFDFLIDIVPREEVKTATGPHRRDDATKTALNPD-------QYLSVPMLLSTFSHQSPLHIFANMFVLH 243 (386)
Q Consensus 171 ~DIa~aI~~i~i~~FL~~ivp~~~~~~~~~p~~~~~~~~~~l~p~-------~g~~wrllTs~FlH~~~~HLl~Nm~~L~ 243 (386)
..++..+..+|+..|+.+.+...+........ .....+.|. ..|+||++|++|+|.|+.|+++||+.+|
T Consensus 17 ~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~w~lit~~FlH~~~~Hll~N~~~l~ 92 (228)
T COG0705 17 PPVTLFLILLNILVFLLELVLGWSAIFLLTFL----FRLFGLYPLNLLGALARDQLWRLITAIFLHAGFLHLLFNMLALW 92 (228)
T ss_pred chHHHHHHHHHHHHHHHHHHccchHHHHHHHh----hhHHhhcchhhhccccccchHHHHHHHHHHhhHHHHHHHHHHHH
Confidence 56677888899999999887664311110000 000111111 1189999999999999999999999999
Q ss_pred HHHHHHHhhhchhHHHHHHHHHHHHHHHHHHHHHhhcCCCcceechHHHHHHHHHHHHHHcCCCcceeEechhhhHHHHH
Q psy8269 244 SFMPSSVEDLGKEQFVGFYLTAGVVASLLSYVHKILVRKPGLSIGASGAIMAVLAHTCITHPDTELGILFVPYVRFSAEH 323 (386)
Q Consensus 244 ~fG~~lE~~~G~~rfl~lyl~sgv~gsl~s~l~~~~~~~~~~~vGASGaI~Gll~a~~~~~p~~~~~l~fip~~~~~a~~ 323 (386)
.+|..+|+.+|+.+|+.+|+.+|+++++++..+.... ..+++||||+++|++++++...|..+....... .+..+..
T Consensus 93 ~fg~~le~~~G~~~f~~~yl~~gl~~~~~~~~~~~~~--~~~~~GASG~i~gllga~~~~~~~~~~~~~~~~-~~~~~~~ 169 (228)
T COG0705 93 VFGSNLERRLGTLRFLLFYLLSGLLAGLAQVLFGPKG--GAPSLGASGAIFGLLGAYFLLFPFARILLLFLS-LPRPALI 169 (228)
T ss_pred HhhHHHHHHhchhHHHHHHHHHHHHHHHHHHHHcccc--cCcccchhHHHHHHHHHHHHHccccchhhhhcc-CchhHHH
Confidence 9999999999999999999999999999988877321 258999999999999999999999877655411 2233444
Q ss_pred HHHHHHHHHHHHhhhcC-chhHHHHHHHHHHHHHHHHHHhhh
Q psy8269 324 AIQGIMLLDFLGVLFRW-RLFDHAAHLGGALFGILYSKYGEQ 364 (386)
Q Consensus 324 ~~~~~l~~~~~~~~~~~-~~i~~~aHLgG~L~G~l~~~~~~~ 364 (386)
++.+.+..++....... ++|++.||++|+++|++++....+
T Consensus 170 ~i~~~~~~~~~~~~~~~~~~va~~aHl~G~i~G~l~~~~~~~ 211 (228)
T COG0705 170 LILIWLLYSLFSGAGSFGPSVAWSAHLGGLIGGLLLAALLSR 211 (228)
T ss_pred HHHHHHHHHHHHHhcCCchHHHHHHHHHHHHHHHHHHHHHhh
Confidence 44455555554433221 379999999999999999998776
No 4
>KOG2980|consensus
Probab=99.89 E-value=2.9e-24 Score=205.46 Aligned_cols=211 Identities=35% Similarity=0.525 Sum_probs=189.2
Q ss_pred ccchhhHHHHHHHHhccCCCCCeeeehhhhHHHHHHHhhcCCccchHHHhhhccCCCCcchhhcccCCCccccccccccc
Q psy8269 15 SNDIPKWRKDLRKRWNALHPGDRVYVPILFLNGVVFLAWFYPRLHPVLYKYFASNPQSMGITLRLHAVSPEFDSRLGRSF 94 (386)
Q Consensus 15 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 94 (386)
.++.+++||++.+.|..++|| ++.+++.+|..||..|++|..+-.|..|+.+++..++
T Consensus 96 ~~~f~~F~~~~~k~w~~~~~g--~v~~ll~~n~~vf~lWrv~~~~~~~~~~mls~~~~~t-------------------- 153 (310)
T KOG2980|consen 96 FGGFTGFQKRGWKHWISGANG--VVFGLLIANAFVFTLWRVPQKQFTMIPWMLSRNAYKT-------------------- 153 (310)
T ss_pred cCCcccceecchHHHhhcCCc--chhHHHHHHHHHHHHHHhcchhhhhhhHHhhcccccc--------------------
Confidence 567889999999999999999 9999999999999999999999999999998876555
Q ss_pred hhhhhHHhhhhhhhhhhcccCchHHHHHhhhcCccccccccchhhHHHHHHHHHHHHHhHHhhHhhhhccccccchhhHH
Q psy8269 95 FGSVLEVQGALSMLDLKTQALPLARIKKIMKLDDGVKMISAEAPMLFSKAAELFIHELTIRAWIHTEDNKRRTLQRNDIA 174 (386)
Q Consensus 95 ~~~~~~~~~~~~~~~~~~~~~p~~rik~im~~~~~v~mi~~~~~~l~~~a~e~fi~~lt~rsw~~~~~~kr~~iq~~DIa 174 (386)
T Consensus 154 -------------------------------------------------------------------------------- 153 (310)
T KOG2980|consen 154 -------------------------------------------------------------------------------- 153 (310)
T ss_pred --------------------------------------------------------------------------------
Confidence
Q ss_pred HHHHHHHHHHHHHhhcccccccccCCCCccccccccccCCCCccccceeecccccCChhHHHHHHHHHHHHHH-HHHhhh
Q psy8269 175 MAITKYDMFDFLIDIVPREEVKTATGPHRRDDATKTALNPDQYLSVPMLLSTFSHQSPLHIFANMFVLHSFMP-SSVEDL 253 (386)
Q Consensus 175 ~aI~~i~i~~FL~~ivp~~~~~~~~~p~~~~~~~~~~l~p~~g~~wrllTs~FlH~~~~HLl~Nm~~L~~fG~-~lE~~~ 253 (386)
-+|.+++|.|+|.+.+|+..||+.++.+.. .+.-.+
T Consensus 154 -------------------------------------------~~w~i~~s~Fsh~~a~h~g~~~~~~~~y~~~a~~~~~ 190 (310)
T KOG2980|consen 154 -------------------------------------------GCWKIILSTFSHYSALHLGPNMLVLKSYLAGALKGSL 190 (310)
T ss_pred -------------------------------------------cceeEEeehhcchhHhhhcHHHHHHHHHhcccccCCc
Confidence 378999999999999999999999999988 888889
Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHhhcCCCcceechHHHHHHHHHHHHHHcCCCcceeEechhhhHHHHHHHHHHHHHHH
Q psy8269 254 GKEQFVGFYLTAGVVASLLSYVHKILVRKPGLSIGASGAIMAVLAHTCITHPDTELGILFVPYVRFSAEHAIQGIMLLDF 333 (386)
Q Consensus 254 G~~rfl~lyl~sgv~gsl~s~l~~~~~~~~~~~vGASGaI~Gll~a~~~~~p~~~~~l~fip~~~~~a~~~~~~~l~~~~ 333 (386)
|...+.++|+++++.|......-.....+.++++||||++|++++..+.++|+.+..+.|++..+..+...+..++.+++
T Consensus 191 ~~~~~~AlylSa~~~~~~i~~~~~v~~~~~gp~LGAsGav~ai~a~~~~lfP~~~~~i~f~~~v~~ga~~~~~~i~~~~~ 270 (310)
T KOG2980|consen 191 GFSSFFALYLSAGVKGLFISVKDKVPTSWAGPSLGASGAVYAILALDCTLFPKTTLYILFVFPVPAGAGLAFKAIAAYDF 270 (310)
T ss_pred chhhcccceeccccccceeEeeccccccccccccccchHHHHHHHHHhhcCcCcceeEEEeecccccchhHHHHHHHhhh
Confidence 99999999998888887766655444456689999999999999999999999999999988888888888889999999
Q ss_pred HHhhhcCchhHHHHHHHHHHHHHHHHHHhhh-cccccc
Q psy8269 334 LGVLFRWRLFDHAAHLGGALFGILYSKYGEQ-TWAHRA 370 (386)
Q Consensus 334 ~~~~~~~~~i~~~aHLgG~L~G~l~~~~~~~-~w~~r~ 370 (386)
.+...++...+|+||++|.++|++++.++.. +|++|.
T Consensus 271 a~~~l~~~~~n~~Ah~~gsl~Gv~va~~~~~ri~kgR~ 308 (310)
T KOG2980|consen 271 AGLILGWGFFNHAAHLSGSLFGVVVATYLWARIRKGRF 308 (310)
T ss_pred cceeeccccchhHhhhcchHHHHHHHHHHHHHHHcCcc
Confidence 9988899999999999999999999999744 666664
No 5
>COG5208 HAP5 CCAAT-binding factor, subunit C [Transcription]
Probab=99.88 E-value=5.2e-23 Score=186.93 Aligned_cols=108 Identities=57% Similarity=0.905 Sum_probs=101.4
Q ss_pred ccccccccchhhhhHHhhhhhhhhhhcccCchHHHHHhhhcCccccccccchhhHHHHHHHHHHHHHhHHhhHhhhhccc
Q psy8269 86 FDSRLGRSFFGSVLEVQGALSMLDLKTQALPLARIKKIMKLDDGVKMISAEAPMLFSKAAELFIHELTIRAWIHTEDNKR 165 (386)
Q Consensus 86 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~rik~im~~~~~v~mi~~~~~~l~~~a~e~fi~~lt~rsw~~~~~~kr 165 (386)
+++...|.|=+++.+.++.+ +.++|.+.+|++||||+||.|+||+||++|+|+||+|+||.||.|+|.|+|..++++||
T Consensus 82 ~~e~i~ryWq~ti~~~e~~~-q~~~k~h~LPlARIkkvMKtdedVkMisaEaPvlFak~~EiFI~ELTmRAW~~ae~NkR 160 (286)
T COG5208 82 LDERISRYWQQTIKAAEEER-QILLKDHNLPLARIKKVMKTDEDVKMISAEAPVLFAKITEIFIEELTMRAWINAEENKR 160 (286)
T ss_pred HHHHHHHHHHHHHHHHHHhH-HHHHHhccCcHHHHHHHHhcccchhheecccchHHHHHHHHHHHHHHHHHHHHHhHhhh
Confidence 67788887777888887755 58899999999999999999999999999999999999999999999999999999999
Q ss_pred cccchhhHHHHHHHHHHHHHHHhhccccc
Q psy8269 166 RTLQRNDIAMAITKYDMFDFLIDIVPREE 194 (386)
Q Consensus 166 ~~iq~~DIa~aI~~i~i~~FL~~ivp~~~ 194 (386)
|++|++||+.++...++||||++++|+..
T Consensus 161 RtLQksDia~Av~kSeMfDFLidivpr~p 189 (286)
T COG5208 161 RTLQKSDIAAAVKKSEMFDFLIDIVPRNP 189 (286)
T ss_pred hHHHHHHHHHHHHHHHHHhHHhhhccCCC
Confidence 99999999999999999999999999964
No 6
>PF01694 Rhomboid: Rhomboid family; InterPro: IPR022764 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This group of proteins contain serine peptidases belonging to the MEROPS peptidase family S54 (Rhomboid, clan ST). They are integral membrane proteins related to the Drosophila melanogaster (Fruit fly) rhomboid protein P20350 from SWISSPROT. Members of this family are found in archaea, bacteria and eukaryotes. The D. melanogaster rhomboid protease cleaves type-1 transmembrane domains using a catalytic triad composed of serine, histidine and asparagine contributed by different transmembrane domains. It cleaves the transmembrane proteins Spitz, Gurken and Keren within their transmembrane domains to release a soluble TGFalpha-like growth factor. Cleavage occurs in the Golgi, following translocation of the substrates from the endoplasmic reticulum membrane by Star, another transmembrane protein. The growth factors are then able to activate the epidermal growth factor receptor [, ]. Few substrates of mammalian rhomboid homologues have been determined, but rhomboid-like protein 2 (MEROPS S54.002) has been shown to cleave ephrin B3 []. Parasite-encoded rhomboid enzymes are also important for invasion of host cells by Toxoplasma and the malaria parasite. In Saccharomyces cerevisiae (Baker's yeast) the Pcp1 (MDM37) protein (MEROPS S54.007) is a mitochondrial endopeptidase required for the activation of cytochrome c peroxidase and for the processing of the mitochondrial dynamin-like protein Mgm1 [, ]. Mutations in Pcp1 result in cells have fragmented mitochondria, which have very few short tubulues []. This entry represents the 6 transmembrane helix rhomboid domain.; GO: 0004252 serine-type endopeptidase activity, 0016021 integral to membrane; PDB: 3UBB_A 3B45_A 3B44_A 2NRF_A 3TXT_A 2O7L_A 2XTU_A 2IRV_A 2XOW_A 2XTV_A ....
Probab=99.84 E-value=2.3e-21 Score=168.72 Aligned_cols=141 Identities=26% Similarity=0.336 Sum_probs=103.0
Q ss_pred CccccceeecccccCChhHHHHHHHHHHHHHHHHHhhhchhHHHHHHHHHHHHHHHHHHHHHhhcCCCcceechHHHHHH
Q psy8269 216 QYLSVPMLLSTFSHQSPLHIFANMFVLHSFMPSSVEDLGKEQFVGFYLTAGVVASLLSYVHKILVRKPGLSIGASGAIMA 295 (386)
Q Consensus 216 ~g~~wrllTs~FlH~~~~HLl~Nm~~L~~fG~~lE~~~G~~rfl~lyl~sgv~gsl~s~l~~~~~~~~~~~vGASGaI~G 295 (386)
++|+||++|++|+|.|+.|+++|++.++.+|..+|+.+|+.+++.+|+++++++++.+.++.. ...+.+|+||+++|
T Consensus 3 ~~~~wrl~T~~f~h~~~~hl~~n~~~l~~~g~~lE~~~G~~~~~~~~l~~~~~~~l~~~~~~~---~~~~~~G~Sg~~~~ 79 (145)
T PF01694_consen 3 NGQWWRLFTSPFVHANFLHLLFNLLALWFFGSLLERRLGSRRFLALYLLSGLLGSLLSLLFSP---PNQPYVGASGAVFG 79 (145)
T ss_dssp CC-TTHHHHGGG--SSHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----S-----SSHHHHHH
T ss_pred CCcchhhhHHHHHccCHHHHHHHHHHHHHhhhhHhhhccchHHHHHHHHHHHhhhhccccccc---cccccCCCcccchH
Confidence 569999999999999999999999999999999999999999999999999999999988772 22279999999999
Q ss_pred HHHHHHHHcCCCcceeEechhhhHHHHHHHHHHHHHHHHHhhhcCchhHHHHHHHHHHHHHHHHHHhhhc
Q psy8269 296 VLAHTCITHPDTELGILFVPYVRFSAEHAIQGIMLLDFLGVLFRWRLFDHAAHLGGALFGILYSKYGEQT 365 (386)
Q Consensus 296 ll~a~~~~~p~~~~~l~fip~~~~~a~~~~~~~l~~~~~~~~~~~~~i~~~aHLgG~L~G~l~~~~~~~~ 365 (386)
++++.+...|+.+........ ........+........ .+++++.+|++|+++|++++...+++
T Consensus 80 l~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~--~~~~~~~~hl~G~~~G~~~~~~~~~~ 143 (145)
T PF01694_consen 80 LLGAFLFLYPQNKKRLRFIYL----ALVVPIIVLVIILLLGF--IPNISFLGHLGGFLAGLLYGFLILRR 143 (145)
T ss_dssp HHHHHHHHHHCCCCCS---HC----CCCCCCCCCCHHHCTSS--SSTTTHHHHHHHHHHHHHHHHHHCH-
T ss_pred HHHHHHHHHhhccchhhcchH----HHHHHHHHHHHHHHHHH--HHhHHHHHHHHHHHHHHHHHHHHHHc
Confidence 999999988877544332000 00000001111111111 57999999999999999999987653
No 7
>KOG2289|consensus
Probab=99.83 E-value=1.3e-21 Score=190.93 Aligned_cols=142 Identities=23% Similarity=0.213 Sum_probs=109.2
Q ss_pred CCC-CccccceeecccccCChhHHHHHHHHHHHHHHHHHhhhchhHHHHHHHHHHHHHHHHHHHHHhhcCCCcceechHH
Q psy8269 213 NPD-QYLSVPMLLSTFSHQSPLHIFANMFVLHSFMPSSVEDLGKEQFVGFYLTAGVVASLLSYVHKILVRKPGLSIGASG 291 (386)
Q Consensus 213 ~p~-~g~~wrllTs~FlH~~~~HLl~Nm~~L~~fG~~lE~~~G~~rfl~lyl~sgv~gsl~s~l~~~~~~~~~~~vGASG 291 (386)
.|. ++|+||++|++|+|+|+.||++||+.+.++|..+|..+|..|+.++|++||++||+++.++. +..++|||||
T Consensus 112 ~~~~r~E~WRllTym~LHaGi~HL~~N~~~ql~iGi~LE~~~G~~RiglIYl~gg~aGSlls~l~d----~~~~sVGASg 187 (316)
T KOG2289|consen 112 KPVHRGELWRLLTYMWLHAGIFHLLLNMLSQLFIGIPLEQVHGFLRIGLIYLAGGVAGSLLSSLFD----PNSISVGASG 187 (316)
T ss_pred ChhhhchhHHHHHHHHHhcCHHHHHHHHHHHHhccccHHhhcCceEEeeehhhhhhhhHHHHHHhc----cCCceecccH
Confidence 444 78999999999999999999999999999999999999999999999999999999999987 5778999999
Q ss_pred HHHHHHHHHHHHcCCCcceeEechhhhHHHHHHHHHHHHHHHHHhhhcCchhHHHHHHHHHHHHHHHHHHhhh
Q psy8269 292 AIMAVLAHTCITHPDTELGILFVPYVRFSAEHAIQGIMLLDFLGVLFRWRLFDHAAHLGGALFGILYSKYGEQ 364 (386)
Q Consensus 292 aI~Gll~a~~~~~p~~~~~l~fip~~~~~a~~~~~~~l~~~~~~~~~~~~~i~~~aHLgG~L~G~l~~~~~~~ 364 (386)
++|||+|+....--..+-... -+..+...+..++.+++.. ...+.+++++|+||++.|..+++....
T Consensus 188 gvfaLlgA~Ls~l~~Nw~~m~----~~~~~l~~ll~Ii~i~l~~--G~~~~~~~~~h~gg~~~G~~~~fil~~ 254 (316)
T KOG2289|consen 188 GVFALLGAHLSNLLTNWTIMK----NKFAALRTLLIIIFINLDL--GFAPYVDNFAHIGGLLAGFLLGFVLHI 254 (316)
T ss_pred HHHHHHHHHHHHHHhhHHHhc----chHHHHHHHHHHHHHHHhh--ccccceeccccccccCCCcchhHHhhh
Confidence 999999998753211111000 0122223333333333321 123678889999999999999987664
No 8
>KOG1657|consensus
Probab=99.72 E-value=1.6e-18 Score=163.54 Aligned_cols=104 Identities=59% Similarity=0.891 Sum_probs=94.5
Q ss_pred cccchhhhhHHhhhhhhhhhhcccCchHHHHHhhhcCccccccccchhhHHHHHHHHHHHHHhHHhhHhhhhccccccch
Q psy8269 91 GRSFFGSVLEVQGALSMLDLKTQALPLARIKKIMKLDDGVKMISAEAPMLFSKAAELFIHELTIRAWIHTEDNKRRTLQR 170 (386)
Q Consensus 91 ~~~~~~~~~~~~~~~~~~~~~~~~~p~~rik~im~~~~~v~mi~~~~~~l~~~a~e~fi~~lt~rsw~~~~~~kr~~iq~ 170 (386)
.++||.... .+.+...|++.+.+|++|||||||.|++|+||++|||++++||||+|+.+++.|+|.++++.+|+++++
T Consensus 53 l~~fw~~~~--~e~e~~~d~~~~~lPlaRiKkimK~dedv~mI~~Eapvl~aka~E~Fi~elt~~sw~~Tee~~rrtl~~ 130 (236)
T KOG1657|consen 53 LQSFWSKQA--IEKEGQLDFKNHILPLARIKKIMKSDEDVSMITAEAPVLFAKACELFITELTLRSWVHTEENKRRTLQK 130 (236)
T ss_pred HHhhhhccc--cccccccchhhccCcHhhccccccccccccccchhHHHHHHHHHHHHHHHHHHHhhhhhcccccccchH
Confidence 356777444 224567899999999999999999999999999999999999999999999999999999999999999
Q ss_pred hhHHHHHHHHHHHHHHHhhccccccc
Q psy8269 171 NDIAMAITKYDMFDFLIDIVPREEVK 196 (386)
Q Consensus 171 ~DIa~aI~~i~i~~FL~~ivp~~~~~ 196 (386)
+|++.++....+|+||+|++|.....
T Consensus 131 sdia~av~~s~~fdFL~DivP~~~~~ 156 (236)
T KOG1657|consen 131 SDIAAAVTQSETFDFLRDIVPRKILA 156 (236)
T ss_pred HHHHHHhccCCCccceeccccchhcc
Confidence 99999999999999999999997544
No 9
>KOG2632|consensus
Probab=99.64 E-value=1.9e-15 Score=142.93 Aligned_cols=182 Identities=14% Similarity=0.140 Sum_probs=134.4
Q ss_pred ccccccchhhHHHHHHHHHHHHHHHhhcccccccccCCCCccccccccccCCCCccccceeecccccCChhHHHHHHHHH
Q psy8269 163 NKRRTLQRNDIAMAITKYDMFDFLIDIVPREEVKTATGPHRRDDATKTALNPDQYLSVPMLLSTFSHQSPLHIFANMFVL 242 (386)
Q Consensus 163 ~kr~~iq~~DIa~aI~~i~i~~FL~~ivp~~~~~~~~~p~~~~~~~~~~l~p~~g~~wrllTs~FlH~~~~HLl~Nm~~L 242 (386)
..+.+......+.++...|+..|++-..+... +..........+.|.||++||+++|.+..|+++||+.+
T Consensus 7 ~~~~~~~~p~~ts~~~~~~~~i~lv~~~~~i~----------~~~~l~~~~l~~~ql~RL~Ty~l~H~s~~hllfnmlaL 76 (258)
T KOG2632|consen 7 VGQFWMKIPLLTSIVVVLAILIYLVSFFPGIV----------EVLGLPSELLINWQLYRLITYALVHLSLPHLLFNMLAL 76 (258)
T ss_pred CccccccchHHHHHHHHHHHHHHHHhccchhh----------hHhcCCHHHhhhHHHHHHHHHHHHhccHHHHHHHHHHH
Confidence 34445666677888899999999998887642 11112223344679999999999999999999999999
Q ss_pred HHHHHHHHhhhc-hhHHHHHHHHHHHHHHHHHHHHHhh--cCC---CcceechHHHHHHHHHHHHHHcCCCcceeEechh
Q psy8269 243 HSFMPSSVEDLG-KEQFVGFYLTAGVVASLLSYVHKIL--VRK---PGLSIGASGAIMAVLAHTCITHPDTELGILFVPY 316 (386)
Q Consensus 243 ~~fG~~lE~~~G-~~rfl~lyl~sgv~gsl~s~l~~~~--~~~---~~~~vGASGaI~Gll~a~~~~~p~~~~~l~fip~ 316 (386)
|.+|...|+.+| +.+++.+....++.++++.+++... ..+ ...++|.||..|++++..+...|.....++..
T Consensus 77 ~~~g~~fE~~~G~t~~~l~~~~llalf~gIl~ll~~~~~~~~d~~~~~~a~G~s~v~Fam~~~~~~~sp~r~~~~fg~-- 154 (258)
T KOG2632|consen 77 WPLGSQFERTHGTTVRILMFTVLLALFSGILYLLAYHVFLLSDLVYVEGAIGFSGVLFAMMAVLEVQSPVRSRSVFGL-- 154 (258)
T ss_pred HhchhHHHhhccceehHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhcccccccHHHHHHHHHHhhcCcccchhhccc--
Confidence 999999999999 8899999999999999988877641 111 24579999999999999998888877433321
Q ss_pred hhHHHHHHHHHHHHHHHHHhhhcCchhHHHHHHHHHHHHHHHHH
Q psy8269 317 VRFSAEHAIQGIMLLDFLGVLFRWRLFDHAAHLGGALFGILYSK 360 (386)
Q Consensus 317 ~~~~a~~~~~~~l~~~~~~~~~~~~~i~~~aHLgG~L~G~l~~~ 360 (386)
+..+++......++... ...++.++.+|++|+++|+.+++
T Consensus 155 ~siP~~l~Pw~lLi~~~----~lvp~aSFlghl~GllvG~ay~~ 194 (258)
T KOG2632|consen 155 FSIPIVLAPWALLIATQ----ILVPQASFLGHLCGLLVGYAYAF 194 (258)
T ss_pred ccccHHHHHHHHHHHHH----HHccCchHHHHHHHHHHHHHHHH
Confidence 11122222222222221 11378999999999999999998
No 10
>KOG2290|consensus
Probab=99.58 E-value=5.9e-16 Score=154.54 Aligned_cols=136 Identities=20% Similarity=0.240 Sum_probs=108.2
Q ss_pred cccceeecccccCChhHHHHHHHHHHHHHHHHHhhhchhHHHHHHHHHHHHHHHHHHHHHhhcCCCcceechHHHHHHHH
Q psy8269 218 LSVPMLLSTFSHQSPLHIFANMFVLHSFMPSSVEDLGKEQFVGFYLTAGVVASLLSYVHKILVRKPGLSIGASGAIMAVL 297 (386)
Q Consensus 218 ~~wrllTs~FlH~~~~HLl~Nm~~L~~fG~~lE~~~G~~rfl~lyl~sgv~gsl~s~l~~~~~~~~~~~vGASGaI~Gll 297 (386)
|.+|++||.|+|+++.|++..|.+++.+...+|+..|+.|..++|++||+.||+++.++. ++.+.||.||+-+|++
T Consensus 450 QfYRL~~SLFlHagviH~~vSi~FQm~vmrdlEkL~g~~riAIiy~~SGitGNLASAIFl----pY~~eVgPa~sQ~Gil 525 (652)
T KOG2290|consen 450 QFYRLWLSLFLHAGVIHLLVSICFQMTVMRDLEKLAGWHRIAIIYFLSGITGNLASAIFL----PYRAEVGPAGSQFGIL 525 (652)
T ss_pred HHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHhhcchhhheeeecccccccchheeee----ccccccCCcccccchH
Confidence 789999999999999999999999999999999999999999999999999999999887 5789999999999999
Q ss_pred HHHHHHcCCCcceeEechhhhHHHHHHHHHHHHHHHHHhhhcCchhHHHHHHHHHHHHHHHHHHhhh
Q psy8269 298 AHTCITHPDTELGILFVPYVRFSAEHAIQGIMLLDFLGVLFRWRLFDHAAHLGGALFGILYSKYGEQ 364 (386)
Q Consensus 298 ~a~~~~~p~~~~~l~fip~~~~~a~~~~~~~l~~~~~~~~~~~~~i~~~aHLgG~L~G~l~~~~~~~ 364 (386)
+.+.+-....+..+- .+..+..-+...+++..+|+ .|.||++||+.|+++|++..+.+..
T Consensus 526 a~l~vEl~qs~~il~----~~w~a~~~Lia~~L~L~iGl---iPWiDN~aHlfG~i~GLl~s~~~~P 585 (652)
T KOG2290|consen 526 ACLFVELFQSWQILE----RPWRAFFHLIATLLVLCIGL---IPWIDNWAHLFGTIFGLLTSIIFLP 585 (652)
T ss_pred HHHHHHHHhhhHhhh----hHHHHHHHHHHHHHHHHhcc---ccchhhHHHHHHHHHHHHHHHHhhc
Confidence 998775444332111 11222111222222222333 3889999999999999999987654
No 11
>PF00808 CBFD_NFYB_HMF: Histone-like transcription factor (CBF/NF-Y) and archaeal histone; InterPro: IPR003958 The CCAAT-binding factor (CBF) is a mammalian transcription factor that binds to a CCAAT motif in the promoters of a wide variety of genes, including type I collagen and albumin. The factor is a heteromeric complex of A and B subunits, both of which are required for DNA-binding [, ]. The subunits can interact in the absence of DNA-binding, conserved regions in each being important in mediating this interaction. The A subunit can be split into 3 domains on the basis of sequence similarity, a non-conserved N-terminal 'A domain'; a highly-conserved central 'B domain' involved in DNA-binding; and a C-terminal 'C domain', which contains a number of glutamine and acidic residues involved in protein-protein interactions []. The A subunit shows striking similarity to the HAP3 subunit of the yeast CCAAT-binding heterotrimeric transcription factor [, ]. The Kluyveromyces lactis HAP3 protein has been predicted to contain a 4-cysteine zinc finger, which is thought to be present in similar HAP3 and CBF subunit A proteins, in which the third cysteine is replaced by a serine []. This domain is found in the CCAAT transcription factor and archaeal histones.; GO: 0043565 sequence-specific DNA binding, 0005622 intracellular; PDB: 1F1E_A 2BYM_D 2BYK_D 1HTA_A 1B67_A 1JFI_B 1KU5_B 1N1J_A 1BFM_A 1B6W_A ....
Probab=99.18 E-value=3.8e-11 Score=91.72 Aligned_cols=64 Identities=44% Similarity=0.538 Sum_probs=59.0
Q ss_pred cCchHHHHHhhhcCccccccccchhhHHHHHHHHHHHHHhHHhhHhhhhccccccchhhHHHHH
Q psy8269 114 ALPLARIKKIMKLDDGVKMISAEAPMLFSKAAELFIHELTIRAWIHTEDNKRRTLQRNDIAMAI 177 (386)
Q Consensus 114 ~~p~~rik~im~~~~~v~mi~~~~~~l~~~a~e~fi~~lt~rsw~~~~~~kr~~iq~~DIa~aI 177 (386)
.+|+++||||||.++++.++++|+..++.+++|.|+++++..++..+.+++|++++..||..++
T Consensus 2 ~lP~a~vkri~k~~~~~~~vs~ea~~~i~~a~e~Fi~~l~~~A~~~a~~~~rkti~~~Dv~~Av 65 (65)
T PF00808_consen 2 SLPLARVKRIMKSDPDVMRVSKEAVEAIAKAAEEFIQYLAKEANEIAQRDKRKTITYEDVAKAV 65 (65)
T ss_dssp SS-HHHHHHHHHHTSTTSEE-HHHHHHHHHHHHHHHHHHHHHHHHHHHHTTSSEE-HHHHHHHH
T ss_pred CCChHHHHHHhccCCCccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCccCHHHHHHHC
Confidence 5899999999999999999999999999999999999999999999999999999999998875
No 12
>KOG1659|consensus
Probab=99.04 E-value=2.9e-10 Score=104.02 Aligned_cols=84 Identities=29% Similarity=0.479 Sum_probs=79.0
Q ss_pred hhhcccCchHHHHHhhhcCccccccccchhhHHHHHHHHHHHHHhHHhhHhhhhccccccchhhHHHHHHHHHHHHHHHh
Q psy8269 109 DLKTQALPLARIKKIMKLDDGVKMISAEAPMLFSKAAELFIHELTIRAWIHTEDNKRRTLQRNDIAMAITKYDMFDFLID 188 (386)
Q Consensus 109 ~~~~~~~p~~rik~im~~~~~v~mi~~~~~~l~~~a~e~fi~~lt~rsw~~~~~~kr~~iq~~DIa~aI~~i~i~~FL~~ 188 (386)
+......|.+|||||||.|++|.+|+.-+|+++++|.|+|++.+..++...++.++.++++.+.+..+|-....||||.+
T Consensus 8 ~~~~trfp~aRiKKIMQ~dEdIGKvaqavPViisralElFl~~l~~~t~~~t~~~~aKt~s~~hlkq~v~~~~~FdFLk~ 87 (224)
T KOG1659|consen 8 KKYKTRFPPARIKKIMQSDEDIGKVAQAVPVIISRALELFLESLLQKTLEITRSRGAKTVSSSHLKQAVESDPKFDFLKE 87 (224)
T ss_pred hhhhccCCHHHHHHHHhhhhhhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHhcCccccCHHHHHHHHhccchhHHHHH
Confidence 34455689999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred hccc
Q psy8269 189 IVPR 192 (386)
Q Consensus 189 ivp~ 192 (386)
++..
T Consensus 88 ~v~~ 91 (224)
T KOG1659|consen 88 VVEK 91 (224)
T ss_pred HHHh
Confidence 7654
No 13
>COG5247 BUR6 Class 2 transcription repressor NC2, alpha subunit (DRAP1 homolog) [Transcription]
Probab=98.96 E-value=1.2e-09 Score=88.91 Aligned_cols=79 Identities=27% Similarity=0.463 Sum_probs=75.3
Q ss_pred cCchHHHHHhhhcCccccccccchhhHHHHHHHHHHHHHhHHhhHhhhhccccccchhhHHHHHHHHHHHHHHHhhccc
Q psy8269 114 ALPLARIKKIMKLDDGVKMISAEAPMLFSKAAELFIHELTIRAWIHTEDNKRRTLQRNDIAMAITKYDMFDFLIDIVPR 192 (386)
Q Consensus 114 ~~p~~rik~im~~~~~v~mi~~~~~~l~~~a~e~fi~~lt~rsw~~~~~~kr~~iq~~DIa~aI~~i~i~~FL~~ivp~ 192 (386)
-.|.+|+|||||+|+|+.++++..|++.+||.|+|++++...+-..++....+.++.+.+..++-+.+-||||.+++..
T Consensus 23 rFP~ar~KkIMQ~deDiGKV~q~tPVIaskalE~Fl~~iv~~s~k~aR~~~skR~t~e~lk~a~~sdekFdFL~~~~~~ 101 (113)
T COG5247 23 RFPIARLKKIMQLDEDIGKVGQSTPVIASKALEMFLTEIVGLSLKEARKKSSKRMTSEFLKRATESDEKFDFLKNMEQF 101 (113)
T ss_pred cCCHHHHHHHHHhhhhhhhhhhcchHHHHHHHHHHHHHHHHHHHHHHHhccchhhHHHHHHHHHhhhHHHHHHHHHHHh
Confidence 4799999999999999999999999999999999999999999999999999999999999999999999999998754
No 14
>PF08551 DUF1751: Eukaryotic integral membrane protein (DUF1751); InterPro: IPR013861 This entry is found in eukaryotic integral membrane proteins. Q12239 from SWISSPROT, a Saccharomyces cerevisiae (Baker's yeast) protein, has been shown to localise COP II vesicles [].
Probab=98.42 E-value=1.9e-07 Score=77.42 Aligned_cols=82 Identities=16% Similarity=0.235 Sum_probs=67.6
Q ss_pred cccceeecccccCChhHHHHHHHHHHHHHHHHHhhhchhHHHHHHHHHHHHHHHHHHHHHhh----cCCC----cceech
Q psy8269 218 LSVPMLLSTFSHQSPLHIFANMFVLHSFMPSSVEDLGKEQFVGFYLTAGVVASLLSYVHKIL----VRKP----GLSIGA 289 (386)
Q Consensus 218 ~~wrllTs~FlH~~~~HLl~Nm~~L~~fG~~lE~~~G~~rfl~lyl~sgv~gsl~s~l~~~~----~~~~----~~~vGA 289 (386)
.+|+++|+.|++.+++.+++|...++..|+.+|+.||+++++.++++.++++|+...+.... .+.. .+.-|.
T Consensus 7 ~pWtl~T~~fve~~i~~~l~~~~~l~~~g~~lE~~WGs~E~lkFi~vv~~~tnl~~~~~~~~~y~i~~~~~~l~~~i~G~ 86 (99)
T PF08551_consen 7 YPWTLFTAGFVETNIIGLLFSLLTLFYGGRYLEPIWGSREFLKFILVVNVITNLLTFLLYLLLYAITGNESYLFVPISGF 86 (99)
T ss_pred ehHHHHHHHHHHhHHHHHHHHHHHHHHhhHHHHHhcChHHHHHHHHHHHHHhHHHHHHHHHHHHHHhCCCceeEEEecCc
Confidence 58999999999999999999999999999999999999999999999999998877665422 2222 345677
Q ss_pred HHHHHHHHHH
Q psy8269 290 SGAIMAVLAH 299 (386)
Q Consensus 290 SGaI~Gll~a 299 (386)
.|.+.|++.+
T Consensus 87 ~~~~~g~lVa 96 (99)
T PF08551_consen 87 MGVLAGFLVA 96 (99)
T ss_pred HHhHhheEEE
Confidence 7777776544
No 15
>KOG1658|consensus
Probab=98.32 E-value=3.5e-07 Score=80.16 Aligned_cols=82 Identities=43% Similarity=0.584 Sum_probs=77.2
Q ss_pred hcccCchHHHHHhhhcCccccccccchhhHHHHHHHHHHHHHhHHhhHhhhhccccccchhhHHHHHHHHHHHHHHHhhc
Q psy8269 111 KTQALPLARIKKIMKLDDGVKMISAEAPMLFSKAAELFIHELTIRAWIHTEDNKRRTLQRNDIAMAITKYDMFDFLIDIV 190 (386)
Q Consensus 111 ~~~~~p~~rik~im~~~~~v~mi~~~~~~l~~~a~e~fi~~lt~rsw~~~~~~kr~~iq~~DIa~aI~~i~i~~FL~~iv 190 (386)
+..-+|+.|||.+|+++|++.|.-+|+..++.||.|.|++++...+++|+..+++.++++.|+.++|...+-+-|+-...
T Consensus 56 ~l~rLpL~rik~vvkl~pdl~l~~dea~~l~a~aaelfi~~Ln~~~~~~~q~~k~kt~qr~d~D~ai~~~de~~fle~~~ 135 (162)
T KOG1658|consen 56 SLSRLPLARIKQVVKLDPDLTLLNDEASQLIAKAAELFIQELNDVAYTTAQLRKRKTEQRRDYDTAIEAVDEFAFLEGAL 135 (162)
T ss_pred hhhhccHHHHHhhccCCcchhhhhhHHHHHHHHHHHHHHHHHHhccchhHHHHHhhhhhhhcccccccchHHHHHHhhhc
Confidence 34458999999999999999999999999999999999999999999999999999999999999999999999998776
Q ss_pred cc
Q psy8269 191 PR 192 (386)
Q Consensus 191 p~ 192 (386)
+.
T Consensus 136 d~ 137 (162)
T KOG1658|consen 136 DT 137 (162)
T ss_pred cc
Confidence 65
No 16
>PF04511 DER1: Der1-like family; InterPro: IPR007599 The endoplasmic reticulum (ER) of the yeast Saccharomyces cerevisiae (Baker's yeast) contains a proteolytic system able to selectively degrade misfolded lumenal secretory proteins. For examination of the components involved in this degradation process, mutants were isolated. They could be divided into four complementation groups. The mutations led to stabilisation of two different substrates for this process, and the classes were called der for degradation in the ER. DER1 was cloned by complementation of the der1-2 mutation. The DER1 gene codes for a novel, hydrophobic protein that is localized to the ER. Deletion of DER1 abolished degradation of the substrate proteins, suggesting that the function of the Der1 protein may be specifically required for the degradation process associated with the ER []. Interestingly this family seems distantly related to the Rhomboid family of membrane peptidases. This family may also mediate degradation of misfolded proteins.
Probab=98.23 E-value=1.7e-05 Score=73.43 Aligned_cols=137 Identities=17% Similarity=0.150 Sum_probs=93.7
Q ss_pred CccccceeecccccCCh-hHHHHHHHHHHHHHHHHHhh-hc-h-hHHHHHHHHHHHHHHHHHHHHHhhcCCCcceechHH
Q psy8269 216 QYLSVPMLLSTFSHQSP-LHIFANMFVLHSFMPSSVED-LG-K-EQFVGFYLTAGVVASLLSYVHKILVRKPGLSIGASG 291 (386)
Q Consensus 216 ~g~~wrllTs~FlH~~~-~HLl~Nm~~L~~fG~~lE~~-~G-~-~rfl~lyl~sgv~gsl~s~l~~~~~~~~~~~vGASG 291 (386)
++|+||++|+.|.-++. .++++|++.++..+..+|+. ++ + ..++...+.+++.-.+.+.+.... .-+.+.+| ++
T Consensus 39 ~~q~WRl~Tsff~~g~~~~~~l~~~~~l~~~s~~LE~~~f~~~~ady~~~ll~~~~~i~~~~~~~~~~-~~~~~~l~-~~ 116 (197)
T PF04511_consen 39 KFQIWRLFTSFFYFGPFSLNFLFNLYFLYQYSSSLEEGHFQGRSADYLWFLLFGASLILILSLLIGPY-FFNIPFLG-SS 116 (197)
T ss_pred hcCceeeEEEEEEEcCCCHHHHHHHHHHHHHhhHhccCCCCCCHHHHHHHHHHHHHHHHHHHHhhccc-hhHHHHHH-HH
Confidence 57999999999975444 68999999999999999998 32 2 456666665555555555443210 00112232 44
Q ss_pred HHHHHHHHHHHHcCCCcceeEechhhhHHHHHHHHHHHHHHHHHhhhcCchhHHHHHHHHHHHHHHHHHH
Q psy8269 292 AIMAVLAHTCITHPDTELGILFVPYVRFSAEHAIQGIMLLDFLGVLFRWRLFDHAAHLGGALFGILYSKY 361 (386)
Q Consensus 292 aI~Gll~a~~~~~p~~~~~l~fip~~~~~a~~~~~~~l~~~~~~~~~~~~~i~~~aHLgG~L~G~l~~~~ 361 (386)
.+..++-.++-..|+.++.++++ ++++++++..++++++++.. +-+...++-|+++|-++-++
T Consensus 117 l~~~l~Y~wsr~np~~~v~~~g~--~~i~a~ylP~~~~~~~~l~~-----~~~~~~~l~Gi~~Ghly~fl 179 (197)
T PF04511_consen 117 LSFALTYIWSRKNPNAQVSFFGL--FTIKAKYLPWVLLAFSLLFG-----GSSPIPDLLGILVGHLYYFL 179 (197)
T ss_pred HHHHHHHHHHHhCcccceeeEEE--EEEChhhHHHHHHHHHHHhC-----CCcHHHHHHHHHHHHHHHHH
Confidence 56666666777789999887764 35577777777777776531 22455899999999998865
No 17
>KOG2890|consensus
Probab=98.06 E-value=1.3e-05 Score=77.83 Aligned_cols=154 Identities=18% Similarity=0.187 Sum_probs=105.2
Q ss_pred cCCCC--ccccceeecccccCChhHHHHHHHHHHHHHHHHHhhhchhHHHHHHHHHHHHHHHHHHH----HHhhcCCC--
Q psy8269 212 LNPDQ--YLSVPMLLSTFSHQSPLHIFANMFVLHSFMPSSVEDLGKEQFVGFYLTAGVVASLLSYV----HKILVRKP-- 283 (386)
Q Consensus 212 l~p~~--g~~wrllTs~FlH~~~~HLl~Nm~~L~~fG~~lE~~~G~~rfl~lyl~sgv~gsl~s~l----~~~~~~~~-- 283 (386)
+.|.+ ..+|+++|+.|+-.+++-.+.|...|.+-|..+|+.||+..++.+|.+.-...+++..+ .++++...
T Consensus 58 lvp~~~~~~~WtliTs~fie~~vw~V~~sv~~L~v~G~~lEp~Wg~~e~lkff~ivn~~~~l~v~v~~~l~Y~it~n~v~ 137 (326)
T KOG2890|consen 58 LVPNALFFFPWTLITSGFIELNVWDVLVSVLTLSVGGKFLEPNWGSLELLKFFAIVNGSTTLVVLVPALLLYMITDNHVY 137 (326)
T ss_pred cCCcchhhhhHHHHhcchhhhhHHHHHHHHHheeecceeeccCCCCHHHHHHHHHhhchhHHHHHHHHHHHHHHhcCceE
Confidence 34443 36999999999999999999999999999999999999999999887765555544332 23333322
Q ss_pred --cceechHHHHHHHHHHHHHHcCCCcceeEechhhhHHHHHHHHHHHHHHHHHhhhcCchhHHHHHHHHHHHHHHHHHH
Q psy8269 284 --GLSIGASGAIMAVLAHTCITHPDTELGILFVPYVRFSAEHAIQGIMLLDFLGVLFRWRLFDHAAHLGGALFGILYSKY 361 (386)
Q Consensus 284 --~~~vGASGaI~Gll~a~~~~~p~~~~~l~fip~~~~~a~~~~~~~l~~~~~~~~~~~~~i~~~aHLgG~L~G~l~~~~ 361 (386)
.+..|+.|...|++.++-..+|+..+... +.-++..+.+....+.+.++..+ -...+.+.+.-+.+|.+.+|.
T Consensus 138 L~~~i~G~~gilaGilVa~kQllpd~~il~~--~~~r~~~~~lP~~~l~~~~il~i---~~f~~f~~l~s~~~g~~~sWt 212 (326)
T KOG2890|consen 138 LYIPIHGTTGILAGILVAWKQLLPDTIILEL--KSGRFLYAHLPLLVLFLSLILSI---ITFLVFASLPSITFGVLVSWT 212 (326)
T ss_pred EEEEeccchHHHHHHHHHHHHHcCceeEEec--cchhhhhhhCCHHHHHHHHHHHH---HHHHHhhhhHHHHHhhhhhhh
Confidence 35889999999999999999999764332 22223322222222222111111 134566777777888888887
Q ss_pred hhhcccccc
Q psy8269 362 GEQTWAHRA 370 (386)
Q Consensus 362 ~~~~w~~r~ 370 (386)
+.|..++++
T Consensus 213 YLRfyq~h~ 221 (326)
T KOG2890|consen 213 YLRFYQRHP 221 (326)
T ss_pred hheecccCC
Confidence 766555554
No 18
>KOG0858|consensus
Probab=97.82 E-value=7.8e-05 Score=70.41 Aligned_cols=132 Identities=14% Similarity=0.146 Sum_probs=91.4
Q ss_pred CCccccceeecccccCC-hhHHHHHHHHHHHHHHHHHhhhc---hhHHHHHHHHHHHHHHHHHHHHHhhcCCCcceechH
Q psy8269 215 DQYLSVPMLLSTFSHQS-PLHIFANMFVLHSFMPSSVEDLG---KEQFVGFYLTAGVVASLLSYVHKILVRKPGLSIGAS 290 (386)
Q Consensus 215 ~~g~~wrllTs~FlH~~-~~HLl~Nm~~L~~fG~~lE~~~G---~~rfl~lyl~sgv~gsl~s~l~~~~~~~~~~~vGAS 290 (386)
.+.|+||++|+.+.-.. -+|.++||++++--++.+|+-.= +..|+.+++.++++-.+.++... ...+| +
T Consensus 49 kk~QiWRliTs~lyfg~~gf~fl~n~~FlyrY~~~LE~g~f~~rtadf~~mllf~~~l~~~~~~~~~------~~fLg-~ 121 (239)
T KOG0858|consen 49 KKFQIWRLITSFLYFGPFGFDFLMNLYFLYRYSSMLEEGSFRGRTADFLYMLLFGAVLLTLTGLFVY------IVFLG-Q 121 (239)
T ss_pred hHhHHHHhhhhhheeccccHHHHHHHHHHHHHHHHHhcCCCCCchhHHHHHHHHHHHHHHHHHHHHH------HHHHH-H
Confidence 48899999999987765 79999999999999999999742 25677777777777776555433 12233 2
Q ss_pred HHHHHHHHHHHHHcCCCcceeEechhhhHHHHHHHHHHHHHHHHHhhhcCchhHHHHHHHHHHHHHHHHHH
Q psy8269 291 GAIMAVLAHTCITHPDTELGILFVPYVRFSAEHAIQGIMLLDFLGVLFRWRLFDHAAHLGGALFGILYSKY 361 (386)
Q Consensus 291 GaI~Gll~a~~~~~p~~~~~l~fip~~~~~a~~~~~~~l~~~~~~~~~~~~~i~~~aHLgG~L~G~l~~~~ 361 (386)
+.++.+.=..+...|+..+..++ .+++++.++...+++++.+. + +.+ ..-+-|++.|-+|.++
T Consensus 122 ~l~~~l~YvWs~~Np~~~v~F~g--~~~f~a~YlPwvll~fs~l~---g--~~~-~~dllGi~~GHiy~fl 184 (239)
T KOG0858|consen 122 SLVFMLVYVWSKRNPDVIVSFFG--LITFKAPYLPWVLLGFSFLF---G--GSI-LVDLLGIIVGHIYYFL 184 (239)
T ss_pred HHHHHHHHHHHhhCCCceEEEEE--EecCccccchHHHHHHHHHh---C--Cch-HHHHHhhhhheeEEEE
Confidence 33444444445556776666554 35778888887777777653 1 223 6778888999776653
No 19
>COG2036 HHT1 Histones H3 and H4 [Chromatin structure and dynamics]
Probab=96.17 E-value=0.0092 Score=48.74 Aligned_cols=74 Identities=23% Similarity=0.312 Sum_probs=65.4
Q ss_pred hhhhhhhcccCchHHHHHhhhcCccccccccchhhHHHHHHHHHHHHHhHHhhHhhhhccccccchhhHHHHHHH
Q psy8269 105 LSMLDLKTQALPLARIKKIMKLDDGVKMISAEAPMLFSKAAELFIHELTIRAWIHTEDNKRRTLQRNDIAMAITK 179 (386)
Q Consensus 105 ~~~~~~~~~~~p~~rik~im~~~~~v~mi~~~~~~l~~~a~e~fi~~lt~rsw~~~~~~kr~~iq~~DIa~aI~~ 179 (386)
++.+......+|.+.+++||+..-.- .++.||...+.++.|.|..+++..+-.|+.-.+|+|++..||..++-.
T Consensus 10 r~~~~~~~~~Lp~apv~Ri~r~~~~~-Rvs~~A~~~l~~~~e~~~~~i~~~A~~~A~ha~RKTV~~~DI~la~~~ 83 (91)
T COG2036 10 RRYQRSTDLLLPKAPVRRILRKAGAE-RVSSSAIEELQEALEEYLEEIAEDAVELAEHAKRKTVKAEDIKLALKR 83 (91)
T ss_pred HhhhhhhhhhcCchHHHHHHHHHhHH-HhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCeecHHHHHHHHHH
Confidence 45566677789999999999985433 999999999999999999999999999999999999999999887744
No 20
>cd00074 H2A Histone 2A; H2A is a subunit of the nucleosome. The nucleosome is an octamer containing two H2A, H2B, H3, and H4 subunits. The H2A subunit performs essential roles in maintaining structural integrity of the nucleosome, chromatin condensation, and binding of specific chromatin-associated proteins.
Probab=95.78 E-value=0.036 Score=47.23 Aligned_cols=74 Identities=20% Similarity=0.349 Sum_probs=67.1
Q ss_pred CchHHHHHhhhcCccccccccchhhHHHHHHHHHHHHHhHHhhHhhhhccccccchhhHHHHHHHHHHHHHHHh
Q psy8269 115 LPLARIKKIMKLDDGVKMISAEAPMLFSKAAELFIHELTIRAWIHTEDNKRRTLQRNDIAMAITKYDMFDFLID 188 (386)
Q Consensus 115 ~p~~rik~im~~~~~v~mi~~~~~~l~~~a~e~fi~~lt~rsw~~~~~~kr~~iq~~DIa~aI~~i~i~~FL~~ 188 (386)
.|++||+++|+.+..-.-|+.+|+......+|-+..|+...+...+...+++.|++.||..+|-...-++.|..
T Consensus 21 fPV~ri~R~Lk~~~~a~RVs~~A~VyLaAvLEYL~aEIlelA~n~ak~~k~krItp~hi~lAi~nD~EL~~L~~ 94 (115)
T cd00074 21 FPVGRIHRYLKKGRYAERVGAGAPVYLAAVLEYLTAEVLELAGNAARDNKKKRITPRHLQLAVRNDEELNKLLK 94 (115)
T ss_pred CcHHHHHHHHHcCccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEcHHHHHHHHhccHHHHHHHc
Confidence 79999999999877778999999999999999999999999999999999999999999999977666665553
No 21
>PF00125 Histone: Core histone H2A/H2B/H3/H4 histone h2a signature histone h2b signature histone h3 signature histone h4 signature; InterPro: IPR007125 The core histones together with some other DNA binding proteins appear to form a superfamily defined by a common fold and distant sequence similarities [, ]. Some proteins contain local homology domains related to the histone fold [].; GO: 0003677 DNA binding; PDB: 2YFW_D 2YFV_B 1U35_H 2F8N_D 2PYO_D 2NQB_D 3AN2_C 3AZJ_C 3AV1_G 3AZM_G ....
Probab=95.46 E-value=0.037 Score=42.84 Aligned_cols=64 Identities=25% Similarity=0.314 Sum_probs=58.0
Q ss_pred cCchHHHHHhhhcCcccc-ccccchhhHHHHHHHHHHHHHhHHhhHhhhhccccccchhhHHHHH
Q psy8269 114 ALPLARIKKIMKLDDGVK-MISAEAPMLFSKAAELFIHELTIRAWIHTEDNKRRTLQRNDIAMAI 177 (386)
Q Consensus 114 ~~p~~rik~im~~~~~v~-mi~~~~~~l~~~a~e~fi~~lt~rsw~~~~~~kr~~iq~~DIa~aI 177 (386)
..|..|+.|-+..+-+-+ .|+.++......++|-+..++...+..++...||.+++..||..++
T Consensus 8 ~~~~~r~~r~i~~~~~~~~ris~~a~~~L~~~~E~~~~~il~~A~~~a~~~kR~tI~~~DI~~A~ 72 (75)
T PF00125_consen 8 KFPFSRLLREIGEEILSKYRISSEALVALQSVLEYLLVEILEEAGNLARHAKRKTITPRDIQLAV 72 (75)
T ss_dssp HHHHHHHHHHHHHTTSSSSEECHHHHHHHHHHHHHHHHHHHHHHHHHHHHTTBSEEGHHHHHHHH
T ss_pred ceEEeeeeehhhcccccccccccccchhhhhhhhhhhhhhhhHHHHHHhhcCCcEecHHHHHHHH
Confidence 467788888888877775 9999999999999999999999999999999999999999998876
No 22
>KOG0869|consensus
Probab=95.21 E-value=0.052 Score=48.24 Aligned_cols=72 Identities=25% Similarity=0.306 Sum_probs=63.1
Q ss_pred ccCchHHHHHhhhcCc-cccccccchhhHHHHHHHHHHHHHhHHhhHhhhhccccccchhhHHHHHHHHHHHH
Q psy8269 113 QALPLARIKKIMKLDD-GVKMISAEAPMLFSKAAELFIHELTIRAWIHTEDNKRRTLQRNDIAMAITKYDMFD 184 (386)
Q Consensus 113 ~~~p~~rik~im~~~~-~v~mi~~~~~~l~~~a~e~fi~~lt~rsw~~~~~~kr~~iq~~DIa~aI~~i~i~~ 184 (386)
+-+|+++|-||||..= .-.+|+.||.-.+-.-+--||..+|-.|-.-+++.||+||+-.||--|+..+..-+
T Consensus 31 r~LPIANV~RIMK~~lP~naKIsKDAKE~vQECVSEfISFvT~EAsekC~~EkRKTIngdDllwAm~tLGFe~ 103 (168)
T KOG0869|consen 31 RFLPIANVSRIMKKALPANAKISKDAKETVQECVSEFISFVTGEASEKCQREKRKTINGDDLLWAMSTLGFEN 103 (168)
T ss_pred hhccHHHHHHHHHhcCCcccccchHHHHHHHHHHHHHHHHHhhHHHHHHHHHhcCcccHHHHHHHHHHcCcHh
Confidence 4599999999999874 55599999999999999999999999999999999999999999988776654433
No 23
>smart00803 TAF TATA box binding protein associated factor. TAFs (TATA box binding protein associated factors) are part of the transcription initiation factor TFIID multimeric protein complex. TFIID is composed of the TATA box binding protein (TBP) and a number of TAFs. The TAFs provide binding sites for many different transcriptional activators and co-activators that modulate transcription initiation by Pol II. TAF proteins adopt a histone-like fold.
Probab=95.19 E-value=0.058 Score=41.27 Aligned_cols=63 Identities=32% Similarity=0.385 Sum_probs=57.7
Q ss_pred cCchHHHHHhhhcCccccccccchhhHHHHHHHHHHHHHhHHhhHhhhhccccccchhhHHHHH
Q psy8269 114 ALPLARIKKIMKLDDGVKMISAEAPMLFSKAAELFIHELTIRAWIHTEDNKRRTLQRNDIAMAI 177 (386)
Q Consensus 114 ~~p~~rik~im~~~~~v~mi~~~~~~l~~~a~e~fi~~lt~rsw~~~~~~kr~~iq~~DIa~aI 177 (386)
.+|.+-||++.+.. |+.-++.|+-.......|-++.++...+-.+.+-.+|++++..||..++
T Consensus 2 ~~p~~~i~ria~~~-Gi~ris~~a~~~l~~~~e~rl~~i~~~A~k~~~hakRktlt~~DI~~Al 64 (65)
T smart00803 2 WLPKETIKDVAESL-GIGNLSDEAAKLLAEDVEYRIKEIVQEALKFMRHSKRTTLTTSDIDSAL 64 (65)
T ss_pred CCCHHHHHHHHHHC-CCccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCeecHHHHHHHh
Confidence 37889999999865 7888999999999999999999999999999999999999999998764
No 24
>cd00076 H4 Histone H4, one of the four histones, along with H2A, H2B and H3, which forms the eukaryotic nucleosome core; along with H3, it plays a central role in nucleosome formation; histones bind to DNA and wrap the genetic material into "beads on a string" in which DNA (the string) is wrapped around small blobs of histones (the beads) at regular intervals; play a role in the inheritance of specialized chromosome structures and the control of gene activity; defects in the establishment of proper chromosome structure by histones may activate or silence genes aberrantly and thus lead to disease; the sequence of histone H4 has remained almost invariant in more than 2 billion years of evolution
Probab=95.17 E-value=0.067 Score=43.19 Aligned_cols=67 Identities=21% Similarity=0.300 Sum_probs=61.1
Q ss_pred hcccCchHHHHHhhhcCccccccccchhhHHHHHHHHHHHHHhHHhhHhhhhccccccchhhHHHHHH
Q psy8269 111 KTQALPLARIKKIMKLDDGVKMISAEAPMLFSKAAELFIHELTIRAWIHTEDNKRRTLQRNDIAMAIT 178 (386)
Q Consensus 111 ~~~~~p~~rik~im~~~~~v~mi~~~~~~l~~~a~e~fi~~lt~rsw~~~~~~kr~~iq~~DIa~aI~ 178 (386)
..+.+|.+-|+++.+.+ |+.-||.|+-.-.....+.|..++...+-.+++-.+|+|++..||..++-
T Consensus 10 ~~~gi~k~~I~RLarr~-GvkRIS~d~y~e~~~~l~~~l~~I~~dav~ya~Ha~RKTVt~~DV~~alk 76 (85)
T cd00076 10 NIKGITKPAIRRLARRG-GVKRISGGVYDEVRNVLKSYLEDVIRDAVTYTEHAKRKTVTAMDVVYALK 76 (85)
T ss_pred hhccCCHHHHHHHHHHc-CcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcCcHHHHHHHHH
Confidence 34569999999999976 69999999999999999999999999999999999999999999988774
No 25
>PLN00035 histone H4; Provisional
Probab=94.81 E-value=0.093 Score=43.82 Aligned_cols=66 Identities=23% Similarity=0.315 Sum_probs=60.9
Q ss_pred ccCchHHHHHhhhcCccccccccchhhHHHHHHHHHHHHHhHHhhHhhhhccccccchhhHHHHHHH
Q psy8269 113 QALPLARIKKIMKLDDGVKMISAEAPMLFSKAAELFIHELTIRAWIHTEDNKRRTLQRNDIAMAITK 179 (386)
Q Consensus 113 ~~~p~~rik~im~~~~~v~mi~~~~~~l~~~a~e~fi~~lt~rsw~~~~~~kr~~iq~~DIa~aI~~ 179 (386)
+.+|.+-|+++.+.. |+.-||.|+-..+..+.|.|..++...+-.+++-.+|+|++..||..++-.
T Consensus 28 ~~ipk~~IrRLARr~-GvkRIS~~ay~elr~vle~~l~~I~~dav~ya~HA~RKTV~~~DV~~Alkr 93 (103)
T PLN00035 28 QGITKPAIRRLARRG-GVKRISGLIYEETRGVLKIFLENVIRDAVTYTEHARRKTVTAMDVVYALKR 93 (103)
T ss_pred ccCCHHHHHHHHHHc-CcccchHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcCcHHHHHHHHHH
Confidence 449999999999976 699999999999999999999999999999999999999999999888744
No 26
>PTZ00015 histone H4; Provisional
Probab=94.24 E-value=0.16 Score=42.45 Aligned_cols=68 Identities=19% Similarity=0.263 Sum_probs=61.7
Q ss_pred hhcccCchHHHHHhhhcCccccccccchhhHHHHHHHHHHHHHhHHhhHhhhhccccccchhhHHHHHH
Q psy8269 110 LKTQALPLARIKKIMKLDDGVKMISAEAPMLFSKAAELFIHELTIRAWIHTEDNKRRTLQRNDIAMAIT 178 (386)
Q Consensus 110 ~~~~~~p~~rik~im~~~~~v~mi~~~~~~l~~~a~e~fi~~lt~rsw~~~~~~kr~~iq~~DIa~aI~ 178 (386)
.+.+.+|.+-|+++.+.+ |+.-||.|+-.-.....|.|..++...+-.+++-.+|+|++..||..++-
T Consensus 26 ~~i~gI~k~~IrRLarr~-GvkRIS~d~y~e~r~vle~~l~~I~rdav~~aeHA~RKTVt~~DV~~AlK 93 (102)
T PTZ00015 26 DNIRGITKGAIRRLARRG-GVKRISGDIYEEVRGVLKAFLENVVRDSTAYTEYARRKTVTAMDVVYALK 93 (102)
T ss_pred hcccCCCHHHHHHHHHHc-CCccchHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccHHHHHHHHH
Confidence 344669999999999964 99999999999999999999999999999999999999999999988763
No 27
>COG5262 HTA1 Histone H2A [Chromatin structure and dynamics]
Probab=93.80 E-value=0.21 Score=42.35 Aligned_cols=78 Identities=22% Similarity=0.328 Sum_probs=69.1
Q ss_pred hcccCchHHHHHhhhcCccccccccchhhHHHHHHHHHHHHHhHHhhHhhhhccccccchhhHHHHHHHHHHHHHHHh
Q psy8269 111 KTQALPLARIKKIMKLDDGVKMISAEAPMLFSKAAELFIHELTIRAWIHTEDNKRRTLQRNDIAMAITKYDMFDFLID 188 (386)
Q Consensus 111 ~~~~~p~~rik~im~~~~~v~mi~~~~~~l~~~a~e~fi~~lt~rsw~~~~~~kr~~iq~~DIa~aI~~i~i~~FL~~ 188 (386)
.--..|++||||+||.+..-+-|+.++++..+.-+|-.+-|++.-+=..+...+...+++.++..+|-.-+=.+||+-
T Consensus 23 agl~fpvgrvkr~lk~~~~~~Rig~~A~Vyl~AvleYL~aEilelAgNaA~d~kkkri~PrHlqlAIrnD~EL~~l~~ 100 (132)
T COG5262 23 AGLIFPVGRVKRLLKKGNYRMRIGAGAPVYLAAVLEYLAAEILELAGNAARDNKKKRIIPRHLQLAIRNDEELNKLLG 100 (132)
T ss_pred cCccccHHHHHHHHHcCccceeecCCcHHHHHHHHHHHHHHHHHHhhhhhhhcCcceechHHHHHHhcCcHHHHHHhh
Confidence 344589999999999887888899999999999989889999999999999999999999999999977777777764
No 28
>COG5291 Predicted membrane protein [Function unknown]
Probab=93.46 E-value=0.27 Score=47.05 Aligned_cols=45 Identities=13% Similarity=0.174 Sum_probs=35.6
Q ss_pred cCCCCccccceeecccccC-ChhHHHHHHHHHHHHHHHHHhh-hchh
Q psy8269 212 LNPDQYLSVPMLLSTFSHQ-SPLHIFANMFVLHSFMPSSVED-LGKE 256 (386)
Q Consensus 212 l~p~~g~~wrllTs~FlH~-~~~HLl~Nm~~L~~fG~~lE~~-~G~~ 256 (386)
+...+-|+||++||...-+ --+..++|++.+|--.+.+|+- +|+.
T Consensus 53 L~~k~~qiwRlfTs~~~~~~~~~d~~M~vyf~Y~yS~~LE~g~f~~~ 99 (313)
T COG5291 53 LFLKRLQIWRLFTSFLYFGKPTLDMFMHVYFLYRYSRMLEEGCFNTS 99 (313)
T ss_pred hHHHHHHHHHHHHHHHhhcCcchhHHHHHHHHHHHHHHHhccccCcc
Confidence 3445679999999876554 5678999999999999999986 4444
No 29
>smart00417 H4 Histone H4.
Probab=93.22 E-value=0.24 Score=38.98 Aligned_cols=63 Identities=22% Similarity=0.271 Sum_probs=57.4
Q ss_pred hcccCchHHHHHhhhcCccccccccchhhHHHHHHHHHHHHHhHHhhHhhhhccccccchhhHH
Q psy8269 111 KTQALPLARIKKIMKLDDGVKMISAEAPMLFSKAAELFIHELTIRAWIHTEDNKRRTLQRNDIA 174 (386)
Q Consensus 111 ~~~~~p~~rik~im~~~~~v~mi~~~~~~l~~~a~e~fi~~lt~rsw~~~~~~kr~~iq~~DIa 174 (386)
..+.+|.+-|+++.+.+ |+.-||.|+-.-.....|.|..++...+-.+++-.+|+|++..||.
T Consensus 10 ~i~gI~k~~IrRLaRr~-GvkRIS~~~y~elr~vle~~l~~I~rdav~~a~ha~RKTV~~~DV~ 72 (74)
T smart00417 10 NIQGITKPAIRRLARRG-GVKRISGLIYDETRNVLKSFLENVVRDAVTYTEHARRKTVTAMDVV 72 (74)
T ss_pred hhcCCCHHHHHHHHHHc-CcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccHHHhe
Confidence 34569999999999965 9999999999999999999999999999999999999999988874
No 30
>cd07981 TAF12 TATA Binding Protein (TBP) Associated Factor 12 (TAF12) is one of several TAFs that bind TBP and is involved in forming Transcription Factor IID (TFIID) complex. The TATA Binding Protein (TBP) Associated Factor 12 (TAF12) is one of several TAFs that bind TBP and are involved in forming the TFIID complex. TFIID is one of the seven General Transcription Factors (GTFs) (TFIIA, TFIIB, TFIID, TFIIE, TFIIF, and TFIID) that are involved in accurate initiation of transcription by RNA polymerase II in eukaryotes. TFIID plays an important role in the recognition of promoter DNA and assembly of the pre-initiation complex. TFIID complex is composed of the TBP and at least 13 TAFs. TAFs are named after their electrophoretic mobility in polyacrylamide gels in different species. A new, unified nomenclature has been suggested for the pol II TAFs to show the relationship between TAF orthologs and paralogs. Several hypotheses are proposed for TAFs function such as serving as activator-bind
Probab=92.00 E-value=0.53 Score=36.60 Aligned_cols=64 Identities=16% Similarity=0.286 Sum_probs=54.8
Q ss_pred chHHHHHhhhcCccccccccchhhHHHHHHHHHHHHHhHHhhHhhhhccccccchhhHHHHHHH
Q psy8269 116 PLARIKKIMKLDDGVKMISAEAPMLFSKAAELFIHELTIRAWIHTEDNKRRTLQRNDIAMAITK 179 (386)
Q Consensus 116 p~~rik~im~~~~~v~mi~~~~~~l~~~a~e~fi~~lt~rsw~~~~~~kr~~iq~~DIa~aI~~ 179 (386)
+-.++.+.++.-+.-.-+++|+..++....|-|+..++..+-.++.-++|.++...||..++-.
T Consensus 3 ~k~~l~~lv~~id~~~~~~~da~~~l~~~~e~fv~~v~~~a~~lAkHr~~~tv~~~Di~l~l~r 66 (72)
T cd07981 3 TKRKLQELLKEIDPREQLDPDVEELLLEIADDFVDDVVEDACRLAKHRKSDTLEVKDVQLHLER 66 (72)
T ss_pred cHHHHHHHHHhhCCCCCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCHHHHHHHHHH
Confidence 3455666666665557789999999999999999999999999999999999999999887744
No 31
>smart00414 H2A Histone 2A.
Probab=89.96 E-value=0.96 Score=38.04 Aligned_cols=73 Identities=23% Similarity=0.388 Sum_probs=64.1
Q ss_pred CchHHHHHhhhcCccccccccchhhHHHHHHHHHHHHHhHHhhHhhhhccccccchhhHHHHHHHHHHHHHHH
Q psy8269 115 LPLARIKKIMKLDDGVKMISAEAPMLFSKAAELFIHELTIRAWIHTEDNKRRTLQRNDIAMAITKYDMFDFLI 187 (386)
Q Consensus 115 ~p~~rik~im~~~~~v~mi~~~~~~l~~~a~e~fi~~lt~rsw~~~~~~kr~~iq~~DIa~aI~~i~i~~FL~ 187 (386)
.|++||++.|+....-.-|+..+++......|=...|+-..+...+..++++-|++.++..+|-...-++-|.
T Consensus 10 fPVgRi~r~Lk~~~~~~Rv~~~A~VyLaAvLEYLtaEILeLagn~a~~~k~~rItp~hi~lAi~nD~EL~~L~ 82 (106)
T smart00414 10 FPVGRIHRLLRKGTYAKRVGAGAPVYLAAVLEYLTAEVLELAGNAARDNKKRRITPRHLQLAIRNDEELNKLL 82 (106)
T ss_pred CchHHHHHHHHcCccccccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccchHHHhhhccCCHHHHHHH
Confidence 7999999999988777799999999999999999999999999999999999999999999996654444333
No 32
>cd08048 TAF11 TATA Binding Protein (TBP) Associated Factor 11 (TAF11) is one of several TAFs that bind TBP and is involved in forming Transcription Factor IID (TFIID) complex. The TATA Binding Protein (TBP) Associated Factor 11 (TAF11) is one of several TAFs that bind TBP and are involved in forming the Transcription Factor IID (TFIID) complex. TFIID is one of seven General Transcription Factors (GTF) (TFIIA, TFIIB, TFIID, TFIIE, TFIIF, and TFIID) that are involved in accurate initiation of transcription by RNA polymerase II in eukaryotes. TFIID plays an important role in the recognition of promoter DNA and assembly of the pre-initiation complex. TFIID complex is composed of the TBP and at least 13 TAFs. TAFs from various species were originally named by their predicted molecular weight or their electrophoretic mobility in polyacrylamide gels. A new, unified nomenclature for the pol II TAFs has been suggested to show the relationship between TAF orthologs and paralogs. Several hypothes
Probab=89.90 E-value=1.2 Score=35.95 Aligned_cols=71 Identities=20% Similarity=0.259 Sum_probs=60.3
Q ss_pred hhhhhhcccCchHHHHHhhhcCccccccccchhhHHHHHHHHHHHHHhHHhhHhhhhccc---cccchhhHHHHH
Q psy8269 106 SMLDLKTQALPLARIKKIMKLDDGVKMISAEAPMLFSKAAELFIHELTIRAWIHTEDNKR---RTLQRNDIAMAI 177 (386)
Q Consensus 106 ~~~~~~~~~~p~~rik~im~~~~~v~mi~~~~~~l~~~a~e~fi~~lt~rsw~~~~~~kr---~~iq~~DIa~aI 177 (386)
+=+++....+|-+.+||+|+.--+ +.+++++.++++-..-.|+.+++..|-.-.++... ..+++++|..|.
T Consensus 8 Rye~~Rra~f~k~~iKr~~~~~~~-~~v~~~v~i~v~glaKvFVGeivE~A~~V~~~~~~~~~~Pl~P~HireA~ 81 (85)
T cd08048 8 RYEMFRRSSFPKAAIKRLIQSVTG-QSVSQNVVIAVAGIAKVFVGEIVEEARDVQEEWGEANTGPLQPRHLREAY 81 (85)
T ss_pred HHHHHHHhhccHHHHHHHHHHHcC-CCCCchHHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCCcHHHHHHH
Confidence 445677778999999999998888 78899999999999999999999999888777666 677788877664
No 33
>KOG4463|consensus
Probab=89.57 E-value=0.24 Score=47.77 Aligned_cols=60 Identities=20% Similarity=0.351 Sum_probs=52.2
Q ss_pred ccccceeecccccCChhHHHHHHHHHHHHHHHHHhhhchhHHHHHHHHHHHHHHHHHHHHH
Q psy8269 217 YLSVPMLLSTFSHQSPLHIFANMFVLHSFMPSSVEDLGKEQFVGFYLTAGVVASLLSYVHK 277 (386)
Q Consensus 217 g~~wrllTs~FlH~~~~HLl~Nm~~L~~fG~~lE~~~G~~rfl~lyl~sgv~gsl~s~l~~ 277 (386)
+|+||++.+.|.-.+-..+.+-++.+|.| +.+|+.+|+.+|..+.+.++..+-++...+.
T Consensus 49 ~qywrlL~~qF~~~n~~e~~~~l~I~Y~f-R~~ERlLGShky~~fiv~s~~~~~l~~~il~ 108 (323)
T KOG4463|consen 49 FQYWRLLMSQFAFSNTPELMFGLYILYYF-RVFERLLGSHKYSVFIVFSGTVSLLLEVILL 108 (323)
T ss_pred HHHHHHHHHHHHhcCChHHHHHHHHHHHH-HHHHHHhccccceeehhHHHHHHHHHHHHHH
Confidence 58999999999999988888888877766 6699999999999999999999888776654
No 34
>KOG0870|consensus
Probab=87.42 E-value=1.9 Score=38.86 Aligned_cols=76 Identities=28% Similarity=0.340 Sum_probs=65.0
Q ss_pred cCchHHHHHhhhcCc--cccccccchhhHHHHHHHHHHHHHhHHhhHhhhhccccccchhhHHHHHHHHHHHHHHHhh
Q psy8269 114 ALPLARIKKIMKLDD--GVKMISAEAPMLFSKAAELFIHELTIRAWIHTEDNKRRTLQRNDIAMAITKYDMFDFLIDI 189 (386)
Q Consensus 114 ~~p~~rik~im~~~~--~v~mi~~~~~~l~~~a~e~fi~~lt~rsw~~~~~~kr~~iq~~DIa~aI~~i~i~~FL~~i 189 (386)
.+|.+-|.++.|-.. +---|+.||...++.+.-.|+..+|..|=.-+...+|++++..||..++--++--.|+..+
T Consensus 10 ~lP~AiI~rlvke~l~E~~vsisKeA~~Ai~raAtVFv~~Lts~s~e~A~~q~rKt~sadDVl~aL~Eiefs~f~~pl 87 (172)
T KOG0870|consen 10 NLPNAIITRLVKEVLPESNVSISKEARLAIARAATVFVIFLTSVSNEIAKDQKRKTISADDVLKALDEIEFSSFVNPL 87 (172)
T ss_pred hccHHHHHHHHHHhCccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccHHHHHHHHHHhchHHHhhHH
Confidence 489999998887554 3347899999999999999999999999999999999999999999999777766665533
No 35
>PTZ00017 histone H2A; Provisional
Probab=86.50 E-value=1.9 Score=37.66 Aligned_cols=73 Identities=23% Similarity=0.369 Sum_probs=64.7
Q ss_pred CchHHHHHhhhcCccccccccchhhHHHHHHHHHHHHHhHHhhHhhhhccccccchhhHHHHHHHHHHHHHHH
Q psy8269 115 LPLARIKKIMKLDDGVKMISAEAPMLFSKAAELFIHELTIRAWIHTEDNKRRTLQRNDIAMAITKYDMFDFLI 187 (386)
Q Consensus 115 ~p~~rik~im~~~~~v~mi~~~~~~l~~~a~e~fi~~lt~rsw~~~~~~kr~~iq~~DIa~aI~~i~i~~FL~ 187 (386)
.|++||.+.|+...--.-|+..+++......|-..-|+...+...+..+++.-|++.++..+|-..+-++-|+
T Consensus 28 FPVgRi~R~Lk~g~~a~RV~a~A~VYLAAVLEYLtaEILELAgNaa~d~kk~RItPrHi~lAI~nDeEL~~Ll 100 (134)
T PTZ00017 28 FPVGRVHRYLKKGRYAKRVGAGAPVYLAAVLEYLTAEVLELAGNAAKDNKKKRITPRHIQLAIRNDEELNKLL 100 (134)
T ss_pred cchHHHHHHHhccchhccccccchhhhHHHHHHHHHHHHHHHHHHHHhcCCCeecHHHHHhhccCcHHHHHHH
Confidence 7999999999887666689999999999999999999999999999999999999999999996655555444
No 36
>PF09415 CENP-X: CENP-S associating Centromere protein X; InterPro: IPR018552 Centromere protein X (CENP-X) is a component of the CENP-S complex. The CENP-S complex is composed of at least of CENP-S and CENP-X and is essential for the stable assembly of the outer kinetchore []. CENP-X is also a DNA-binding component of the Fanconi anemia (FA) core complex involved in DNA damage repair and genome maintenance. The FA complex is composed of CENPS, FANCA, FANCB, FANCC, FANCE, FANCF, FANCG, FANCL/PHF9, FANCM, FAAP24 and CENPX. Interacts with CENPS, FANCM and FAAP24 [, ].; PDB: 4DRB_L 4DRA_H 3V9R_D.
Probab=85.87 E-value=2.1 Score=33.40 Aligned_cols=60 Identities=22% Similarity=0.274 Sum_probs=44.6
Q ss_pred chHHHHHhhhc--CccccccccchhhHHHHHHHHHHHHHhHHhhHhhhhccccc-cchhhHHH
Q psy8269 116 PLARIKKIMKL--DDGVKMISAEAPMLFSKAAELFIHELTIRAWIHTEDNKRRT-LQRNDIAM 175 (386)
Q Consensus 116 p~~rik~im~~--~~~v~mi~~~~~~l~~~a~e~fi~~lt~rsw~~~~~~kr~~-iq~~DIa~ 175 (386)
|..-|.|+++. .+....|++|+..+..+..+.|+.|-..||....+..+... +.-.|+..
T Consensus 1 p~~li~rll~~~f~~~~tkIs~dal~l~~eyl~iFV~EAv~Ra~~~a~~e~~~~~le~e~LEk 63 (72)
T PF09415_consen 1 PPELIARLLHEHFKDDKTKISKDALKLSAEYLRIFVREAVARAAEQAEAEGDEGFLEVEHLEK 63 (72)
T ss_dssp -CHHHHHHHCTTSSSTT-EE-CCCHHHHHHHHHHHHHHHHHHHHHHHHHTT-SSEE-HHHHHH
T ss_pred ChHHHHHHHHHHhcCCCCCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCCHHHHHH
Confidence 55667888885 34566999999999999999999999999999777777666 66555544
No 37
>KOG1756|consensus
Probab=84.73 E-value=3.6 Score=35.59 Aligned_cols=77 Identities=19% Similarity=0.327 Sum_probs=68.3
Q ss_pred ccCchHHHHHhhhcCccccccccchhhHHHHHHHHHHHHHhHHhhHhhhhccccccchhhHHHHHHHHHHHHHHHhh
Q psy8269 113 QALPLARIKKIMKLDDGVKMISAEAPMLFSKAAELFIHELTIRAWIHTEDNKRRTLQRNDIAMAITKYDMFDFLIDI 189 (386)
Q Consensus 113 ~~~p~~rik~im~~~~~v~mi~~~~~~l~~~a~e~fi~~lt~rsw~~~~~~kr~~iq~~DIa~aI~~i~i~~FL~~i 189 (386)
-.-|++||.++|+......-++.++|+..+--.|-..-+++..+=..+..+++.-+++.++..+|-.-+-..+|+.-
T Consensus 26 l~fPvgri~r~Lr~~~~~~ri~~gapV~laavLeYL~Aeile~agnaardnkk~ri~PrH~~lAI~NDeEL~~lL~~ 102 (131)
T KOG1756|consen 26 LQFPVGRIHRLLRKGRYAQRVGAGAPVYLAAVLEYLTAEILELAGNAARDNKKTRITPRHLQLAIRNDEELNKLLGK 102 (131)
T ss_pred cccCHHHHHHHHHccchhhhccCCChHHHHHHHHHHHHHHHHHhHHHhhhcCccccChHHHHHHHhCcHHHHHHhcc
Confidence 34799999999999888889999999999988887788888888889999999999999999999887778888764
No 38
>PF13150 DUF3989: Protein of unknown function (DUF3989)
Probab=83.69 E-value=0.66 Score=37.46 Aligned_cols=68 Identities=22% Similarity=0.253 Sum_probs=42.6
Q ss_pred hhhHHHH----HHHHhccCCCCCeeeehhhhHHHHHHHhhcCCccchHHHhhhccC-CCCcchhhcccCCCcccc
Q psy8269 18 IPKWRKD----LRKRWNALHPGDRVYVPILFLNGVVFLAWFYPRLHPVLYKYFASN-PQSMGITLRLHAVSPEFD 87 (386)
Q Consensus 18 ~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~ 87 (386)
+.++++. |+..|..|+|.+|..+-++|+ +.|++-..=..-...++.=-.+ ...-.+||++...+++.|
T Consensus 4 ~~~~~~~~~~~Lr~~c~~Lsp~~R~~vvl~ml--~~fa~l~ly~~~~ai~~~Gk~~g~~~~IeHI~~l~l~~~~D 76 (85)
T PF13150_consen 4 IRKIKDRADDRLRRYCGRLSPKQRLRVVLVML--VLFAALCLYMTVSAIYDIGKEDGERMEIEHIRPLELPQPKD 76 (85)
T ss_pred HHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHhccCcccCccceeccccCCCCCCC
Confidence 3444444 999999999999999988887 7777766433333333332222 334456676666664433
No 39
>PLN00154 histone H2A; Provisional
Probab=83.14 E-value=3.3 Score=36.29 Aligned_cols=74 Identities=22% Similarity=0.253 Sum_probs=65.4
Q ss_pred CchHHHHHhhhcCc-cccccccchhhHHHHHHHHHHHHHhHHhhHhhhhccccccchhhHHHHHHHHHHHHHHHh
Q psy8269 115 LPLARIKKIMKLDD-GVKMISAEAPMLFSKAAELFIHELTIRAWIHTEDNKRRTLQRNDIAMAITKYDMFDFLID 188 (386)
Q Consensus 115 ~p~~rik~im~~~~-~v~mi~~~~~~l~~~a~e~fi~~lt~rsw~~~~~~kr~~iq~~DIa~aI~~i~i~~FL~~ 188 (386)
.|++||++.|+..- .-.-|+..+|+.....+|=..-|+..-+...+..++++-|++.++..+|-..+-++.|..
T Consensus 39 FPVgRi~r~Lk~g~~~~~RVga~ApVYLAAVLEYLtAEVLELAGNaA~d~kk~RItPrHi~lAIrnDeEL~~Ll~ 113 (136)
T PLN00154 39 FPVGRIHRQLKQRVSAHGRVGATAAVYTAAILEYLTAEVLELAGNASKDLKVKRITPRHLQLAIRGDEELDTLIK 113 (136)
T ss_pred CchHHHHHHHHhhhhhccccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceecHHHhhhhccCcHHHHHHhc
Confidence 79999999999976 456999999999999999999999999999999999999999999999977666665554
No 40
>KOG0871|consensus
Probab=81.23 E-value=5.7 Score=35.23 Aligned_cols=77 Identities=17% Similarity=0.211 Sum_probs=63.4
Q ss_pred hhcccCchHHHHHhhhcCcccc-ccccchhhHHHHHHHHHHHHHhHHhhHhhhhccccccchhhHHHHHHHHHHHHHH
Q psy8269 110 LKTQALPLARIKKIMKLDDGVK-MISAEAPMLFSKAAELFIHELTIRAWIHTEDNKRRTLQRNDIAMAITKYDMFDFL 186 (386)
Q Consensus 110 ~~~~~~p~~rik~im~~~~~v~-mi~~~~~~l~~~a~e~fi~~lt~rsw~~~~~~kr~~iq~~DIa~aI~~i~i~~FL 186 (386)
.+...+|-+.|-||++--=.+. ++++|+.-++-..|--||+.++..+=.-+...+++||-..+|-.++-....-+|+
T Consensus 8 dde~sLPkAtv~KmIke~lP~d~rvakeareliincCvEFI~liSsEAneic~~e~KKTIa~EHV~KALe~LgF~eYi 85 (156)
T KOG0871|consen 8 DDELSLPKATVNKMIKEMLPKDVRVAKEARELIINCCVEFINLISSEANEICNKEAKKTIAPEHVIKALENLGFGEYI 85 (156)
T ss_pred cccccCcHHHHHHHHHHhCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhHHhcccCCHHHHHHHHHHcchHHHH
Confidence 3455699999999998544434 6799999999999999999999999999999999999999998888554443443
No 41
>COG5150 Class 2 transcription repressor NC2, beta subunit (Dr1) [Transcription]
Probab=79.61 E-value=5.1 Score=34.61 Aligned_cols=84 Identities=18% Similarity=0.237 Sum_probs=65.5
Q ss_pred hhhcccCchHHHHHhhhcCcccc-ccccchhhHHHHHHHHHHHHHhHHhhHhhhhccccccchhhHHHHHHHH---HHHH
Q psy8269 109 DLKTQALPLARIKKIMKLDDGVK-MISAEAPMLFSKAAELFIHELTIRAWIHTEDNKRRTLQRNDIAMAITKY---DMFD 184 (386)
Q Consensus 109 ~~~~~~~p~~rik~im~~~~~v~-mi~~~~~~l~~~a~e~fi~~lt~rsw~~~~~~kr~~iq~~DIa~aI~~i---~i~~ 184 (386)
+.+...+|-+.+.|+.---=.+. |.+.|+.-++-.+|.-|++.++..+=..++..++++|...+|-.++-.. .-+.
T Consensus 6 ~dDe~sLPKATVqKMvS~iLp~dl~ftKearei~in~cieFi~~lsseAne~ce~EaKKTIa~EHviKALenLef~eyi~ 85 (148)
T COG5150 6 NDDENSLPKATVQKMVSSILPKDLVFTKEAREIFINACIEFINMLSSEANEACEEEAKKTIAYEHVIKALENLEFEEYIE 85 (148)
T ss_pred ccccccCcHHHHHHHHHHhccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccHHHHHHHHHhccHHHHHH
Confidence 34455699999998876544555 6699999999999999999999999999999999999999998887443 3333
Q ss_pred HHHhhccc
Q psy8269 185 FLIDIVPR 192 (386)
Q Consensus 185 FL~~ivp~ 192 (386)
++.++...
T Consensus 86 ~~~e~~~n 93 (148)
T COG5150 86 SCMEEHEN 93 (148)
T ss_pred HHHHHHHH
Confidence 44444443
No 42
>smart00428 H3 Histone H3.
Probab=78.25 E-value=8.5 Score=32.34 Aligned_cols=81 Identities=23% Similarity=0.267 Sum_probs=65.3
Q ss_pred hhhHHhhhhhhhhhhcccCchHHH-HHhhhcCcc-cc-ccccchhhHHHHHHHHHHHHHhHHhhHhhhhccccccchhhH
Q psy8269 97 SVLEVQGALSMLDLKTQALPLARI-KKIMKLDDG-VK-MISAEAPMLFSKAAELFIHELTIRAWIHTEDNKRRTLQRNDI 173 (386)
Q Consensus 97 ~~~~~~~~~~~~~~~~~~~p~~ri-k~im~~~~~-v~-mi~~~~~~l~~~a~e~fi~~lt~rsw~~~~~~kr~~iq~~DI 173 (386)
..+|+...++..|.--+.+|-.|+ |+|.+.-.. -. -++.||.....-|.|.|.-.+...+..|+.-.||-|++..|+
T Consensus 15 aLrEIr~yQkst~lLI~k~pF~RLVREI~~~~~~~~~~R~~~~Al~aLQeasE~ylv~lfeda~~~a~HAkRvTl~~kDi 94 (105)
T smart00428 15 ALREIRKYQKSTDLLIRKAPFQRLVREIAQKFTTGVDLRFQSSAIMALQEAAEAYLVGLFEDTNLLAIHAKRVTIMPKDI 94 (105)
T ss_pred HHHHHHHHccCcccccccccHHHHHHHHHHHcCCCCCceeeHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccCcHhhH
Confidence 445665556777888888998885 677665432 11 679999999999999999999999999999999999999999
Q ss_pred HHHH
Q psy8269 174 AMAI 177 (386)
Q Consensus 174 a~aI 177 (386)
..+.
T Consensus 95 ~La~ 98 (105)
T smart00428 95 QLAR 98 (105)
T ss_pred HHHH
Confidence 6654
No 43
>PLN00156 histone H2AX; Provisional
Probab=77.31 E-value=6.1 Score=34.77 Aligned_cols=72 Identities=24% Similarity=0.338 Sum_probs=62.5
Q ss_pred CchHHHHHhhhcCccccccccchhhHHHHHHHHHHHHHhHHhhHhhhhccccccchhhHHHHHHHHHHHHHH
Q psy8269 115 LPLARIKKIMKLDDGVKMISAEAPMLFSKAAELFIHELTIRAWIHTEDNKRRTLQRNDIAMAITKYDMFDFL 186 (386)
Q Consensus 115 ~p~~rik~im~~~~~v~mi~~~~~~l~~~a~e~fi~~lt~rsw~~~~~~kr~~iq~~DIa~aI~~i~i~~FL 186 (386)
.|++||.+.|+...--.=|+..+|+..+...|=..-|+...+...+..++++-|++.++..+|-..+=++-|
T Consensus 30 FPVgRi~R~Lk~g~ya~RVga~ApVYLAAVLEYLtaEVLELAgNaa~d~kk~RItPrHi~lAIrnDeEL~~L 101 (139)
T PLN00156 30 FPVGRIARFLKAGKYAERVGAGAPVYLSAVLEYLAAEVLELAGNAARDNKKNRIVPRHIQLAVRNDEELSKL 101 (139)
T ss_pred cchHHHHHHHhcCChhhccCCccHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcCcHHHHHhhccCcHHHHHH
Confidence 899999999988765568899999999999999999999999999999999999999999999554444433
No 44
>PLN00157 histone H2A; Provisional
Probab=77.04 E-value=6.8 Score=34.23 Aligned_cols=70 Identities=21% Similarity=0.323 Sum_probs=61.9
Q ss_pred CchHHHHHhhhcCccccccccchhhHHHHHHHHHHHHHhHHhhHhhhhccccccchhhHHHHHHHHHHHH
Q psy8269 115 LPLARIKKIMKLDDGVKMISAEAPMLFSKAAELFIHELTIRAWIHTEDNKRRTLQRNDIAMAITKYDMFD 184 (386)
Q Consensus 115 ~p~~rik~im~~~~~v~mi~~~~~~l~~~a~e~fi~~lt~rsw~~~~~~kr~~iq~~DIa~aI~~i~i~~ 184 (386)
.|++||++.|+...--.-|+..+++..+...|=..-|+...+...+..++++-|++.++..+|-..+-++
T Consensus 27 FPVgRi~R~Lk~g~~a~RIg~~A~VYLAAVLEYLtaEVLELAgnaa~d~kk~RItPrHi~lAI~nDeEL~ 96 (132)
T PLN00157 27 FPVGRIARYLKAGKYATRVGAGAPVYLAAVLEYLAAEVLELAGNAARDNKKSRIVPRHIQLAVRNDEELS 96 (132)
T ss_pred cchHHHHHHHhcCchhhhcCCCcHhHHHHHHHHHHHHHHHHHHHHHHhcCCccccHHHHhhcccCcHHHH
Confidence 8999999999887656688999999999999999999999999999999999999999999995544344
No 45
>PLN00153 histone H2A; Provisional
Probab=76.85 E-value=6 Score=34.43 Aligned_cols=70 Identities=24% Similarity=0.334 Sum_probs=62.1
Q ss_pred CchHHHHHhhhcCccccccccchhhHHHHHHHHHHHHHhHHhhHhhhhccccccchhhHHHHHHHHHHHH
Q psy8269 115 LPLARIKKIMKLDDGVKMISAEAPMLFSKAAELFIHELTIRAWIHTEDNKRRTLQRNDIAMAITKYDMFD 184 (386)
Q Consensus 115 ~p~~rik~im~~~~~v~mi~~~~~~l~~~a~e~fi~~lt~rsw~~~~~~kr~~iq~~DIa~aI~~i~i~~ 184 (386)
.|++||.+.|+..---.=|+..+|+..+...|=..-|+...+...+..++++-|++.++..+|-..+=++
T Consensus 25 FpVgRi~R~Lr~g~~a~Rvga~A~VYLAAVLEYLtaEVLELAgnaa~d~kk~RItPrHi~lAI~nDeEL~ 94 (129)
T PLN00153 25 FPVGRIARYLKKGKYAERIGAGAPVYLAAVLEYLTAEVLELAGNAARDNKKNRIVPRHIQLAIRNDEELG 94 (129)
T ss_pred cchHHHHHHHhcCchhhccCCCchHHHHHHHHHHHHHHHHHHHHHHHhcCCCccChHHHHhhccCcHHHH
Confidence 7999999999887666688999999999999999999999999999999999999999999996544444
No 46
>PF03419 Peptidase_U4: Sporulation factor SpoIIGA This family belongs to family U4 of the peptidase classification.; InterPro: IPR005081 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. The peptidases families associated with clan U- have an unknown catalytic mechanism as the protein fold of the active site domain and the active site residues have not been reported. This group of peptidases belong to the MEROPS peptidase family U4 (SpoIIGA peptidase family, clan U-). Sporulation in bacteria such as Bacillus subtilis involves the formation of a polar septum, which divides the sporangium into a mother cell and a forespore. The sigma E factor, which is encoded within the spoIIG operon, is a cell-specific regulatory protein that directs gene transcription in the mother cell. Sigma E is synthesised as an inactive proprotein pro-sigma E, which is converted to the mature factor by the putative processing enzyme SpoIIGA []. ; GO: 0004190 aspartic-type endopeptidase activity, 0006508 proteolysis, 0030436 asexual sporulation
Probab=67.73 E-value=57 Score=31.87 Aligned_cols=40 Identities=10% Similarity=0.175 Sum_probs=29.0
Q ss_pred hHHHHHHHHHHHHHHHHHhhhchhHHHHHHHHHHHHHHHHHHHH
Q psy8269 233 LHIFANMFVLHSFMPSSVEDLGKEQFVGFYLTAGVVASLLSYVH 276 (386)
Q Consensus 233 ~HLl~Nm~~L~~fG~~lE~~~G~~rfl~lyl~sgv~gsl~s~l~ 276 (386)
..+++|.+.|+..+..+......+|.+ +++.+|++.+++.
T Consensus 11 ~N~~md~~lL~~t~~~~~~~~~~~Rll----~~A~~Gal~~~~~ 50 (293)
T PF03419_consen 11 VNFLMDYFLLWLTARLLKRRASRWRLL----LGAAIGALYSLLI 50 (293)
T ss_pred HHHHHHHHHHHHHHHHhcCCCcHHHHH----HHHHHHHHHHHHH
Confidence 467889999999999988888888775 4555555544443
No 47
>COG4769 Predicted membrane protein [Function unknown]
Probab=66.29 E-value=67 Score=29.33 Aligned_cols=119 Identities=21% Similarity=0.171 Sum_probs=55.9
Q ss_pred HHHHHHHHHHHHHHhhhchhHHHHHHHHHHHHHHHHHHHHHhhcCCCcceechHHHHHHHHHHHHHHcCCC-cceeEech
Q psy8269 237 ANMFVLHSFMPSSVEDLGKEQFVGFYLTAGVVASLLSYVHKILVRKPGLSIGASGAIMAVLAHTCITHPDT-ELGILFVP 315 (386)
Q Consensus 237 ~Nm~~L~~fG~~lE~~~G~~rfl~lyl~sgv~gsl~s~l~~~~~~~~~~~vGASGaI~Gll~a~~~~~p~~-~~~l~fip 315 (386)
.|..++..+ +.++....+.+-++-.+++++.+.-+. .+.-..+++|++.+.++.+.+..-+- ...+..+.
T Consensus 41 aNivvlIa~-----~~l~~~~~~~~i~lr~il~AL~sGtlf----s~~Fl~sfaG~i~S~L~m~~l~~f~~k~~S~lgiS 111 (181)
T COG4769 41 ANIVVLIAF-----ETLNFKDALQTILLRVILQALFSGTLF----SPVFLYSFAGAILSTLFMYFLYQFGPKYLSLLGIS 111 (181)
T ss_pred hhhHHHHhH-----HhccHHHHHHHHHHHHHHHHHHcCccc----hHHHHHHHHHHHHHHHHHHHHHHcCCceEeeeehh
Confidence 355555433 224555555555555555554433211 12345677788777777776654442 22233222
Q ss_pred hhhHHHHHHHHHHHHHHHH---HhhhcCchhHHHHHHHHHHHHHHHHHHhhh
Q psy8269 316 YVRFSAEHAIQGIMLLDFL---GVLFRWRLFDHAAHLGGALFGILYSKYGEQ 364 (386)
Q Consensus 316 ~~~~~a~~~~~~~l~~~~~---~~~~~~~~i~~~aHLgG~L~G~l~~~~~~~ 364 (386)
...--.-.+..++++--+. ......|-....+-+.|...|++....+.+
T Consensus 112 ~mGaF~hNl~QLivas~Lv~~~~v~l~lPll~flGivsG~~vg~~~~~~i~~ 163 (181)
T COG4769 112 VMGAFTHNLGQLIVASFLVFTTSVMLYLPLLIFLGIVSGTAVGILANTLIIT 163 (181)
T ss_pred hHHHHHHhHHHHHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 1111111111112111111 111122556667777777777777766554
No 48
>smart00576 BTP Bromodomain transcription factors and PHD domain containing proteins. subdomain of archael histone-like transcription factors
Probab=65.28 E-value=14 Score=28.81 Aligned_cols=55 Identities=18% Similarity=0.191 Sum_probs=48.9
Q ss_pred cCccccccccchhhHHHHHHHHHHHHHhHHhhHhhhhccccccchhhHHHHHHHH
Q psy8269 126 LDDGVKMISAEAPMLFSKAAELFIHELTIRAWIHTEDNKRRTLQRNDIAMAITKY 180 (386)
Q Consensus 126 ~~~~v~mi~~~~~~l~~~a~e~fi~~lt~rsw~~~~~~kr~~iq~~DIa~aI~~i 180 (386)
.+-|..-+++++.-.+....+.|++++...+-.+.+-.+|..++..||.+++...
T Consensus 17 ~~~Gf~~~~~sale~ltdi~~~yl~~l~~~~~~~a~~agR~~~~~~Dv~~Al~~~ 71 (77)
T smart00576 17 ESAGFDSFQESALETLTDILQSYIQELGRTAHSYAELAGRTEPNLGDVVLALENL 71 (77)
T ss_pred HHcCccccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCHHHHHHHHHHh
Confidence 3558888999999999999999999999999999999999999999998887553
No 49
>PF04719 TAFII28: hTAFII28-like protein conserved region; InterPro: IPR006809 The general transcription factor, TFIID, consists of the TATA-binding protein (TBP) associated with a series of TBP-associated factors (TAFs) that together participate in the assembly of the transcription preinitiation complex. The conserved region is found at the C terminus of most member proteins. The crystal structure of hTAFII28 with hTAFII18 shows that this region is involved in the binding of these two subunits. The conserved region contains four alpha helices and three loops arranged as in histone H3 [, ].; GO: 0006367 transcription initiation from RNA polymerase II promoter, 0005634 nucleus; PDB: 1BH9_B 1BH8_B.
Probab=61.76 E-value=21 Score=29.19 Aligned_cols=73 Identities=19% Similarity=0.193 Sum_probs=47.1
Q ss_pred hhhhhhhcccCchHHHHHhhhcCccccccccchhhHHHHHHHHHHHHHhHHhhHhhhhcccc-ccchhhHHHHH
Q psy8269 105 LSMLDLKTQALPLARIKKIMKLDDGVKMISAEAPMLFSKAAELFIHELTIRAWIHTEDNKRR-TLQRNDIAMAI 177 (386)
Q Consensus 105 ~~~~~~~~~~~p~~rik~im~~~~~v~mi~~~~~~l~~~a~e~fi~~lt~rsw~~~~~~kr~-~iq~~DIa~aI 177 (386)
++=+.+....+|.+.|||+|+.-.+=+-+++.+.++++--.-+|+=|+...|-.-.++.+.. .+++.++..+.
T Consensus 14 ~Rye~fRRs~~~k~~ikkli~~~~~~qsv~~~v~i~v~g~aKvFVGEiVE~A~~Vq~~~~~~~pl~P~hlreA~ 87 (90)
T PF04719_consen 14 DRYEAFRRSSFNKAAIKKLINQVLGNQSVSQNVVIAVAGIAKVFVGEIVEEARDVQEEWGETGPLQPDHLREAY 87 (90)
T ss_dssp HHHHHHHH----HHHHHHHHHHHHS-S---HHHHHHHHHHHHHHHHHHHHHHHHHHHHTT--SS--HHHHHHHH
T ss_pred HHHHHHHHccCCHHHHHHHHHHHcCCCCCChhHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcHHHHHHH
Confidence 45667888889999999999996664566777788888888999999998887666554433 66666666553
No 50
>cd08050 TAF6 TATA Binding Protein (TBP) Associated Factor 6 (TAF6) is one of several TAFs that bind TBP and is involved in forming Transcription Factor IID (TFIID) complex. The TATA Binding Protein (TBP) Associated Factor 6 (TAF6) is one of several TAFs that bind TBP and are involved in forming Transcription Factor IID (TFIID) complex. TFIID is one of seven General Transcription Factors (GTFs) (TFIIA, TFIIB, TFIID, TFIIE, TFIIF, and TFIID) that are involved in accurate initiation of transcription by RNA polymerase II in eukaryotes. TFIID plays an important role in the recognition of promoter DNA and assembly of the pre-initiation complex. TFIID complex is composed of the TBP and at least 13 TAFs. TAFs are named after their electrophoretic mobility in polyacrylamide gels in different species. A new, unified nomenclature has been suggested for the pol II TAFs to show the relationship between TAF orthologs and paralogs. Several hypotheses are proposed for TAFs functions such as serving as
Probab=61.38 E-value=17 Score=36.59 Aligned_cols=61 Identities=26% Similarity=0.330 Sum_probs=52.9
Q ss_pred HHHHHhhhcCccccccccchhhHHHHHHHHHHHHHhHHhhHhhhhccccccchhhHHHHHHH
Q psy8269 118 ARIKKIMKLDDGVKMISAEAPMLFSKAAELFIHELTIRAWIHTEDNKRRTLQRNDIAMAITK 179 (386)
Q Consensus 118 ~rik~im~~~~~v~mi~~~~~~l~~~a~e~fi~~lt~rsw~~~~~~kr~~iq~~DIa~aI~~ 179 (386)
..||.|.+. -|+.-++.|+....+...|-++.++...|-...+..||++++..||..|+-.
T Consensus 3 ~~i~~ia~~-~Gi~~~~~~a~~~La~~~e~~~~~i~~~A~k~~~hskR~~l~~~Di~~Al~~ 63 (343)
T cd08050 3 ESIKLIAES-LGIDSLSDEVAQLLAEDVEYRLREIIQEAAKFMRHSKRRKLTTSDVNHALRL 63 (343)
T ss_pred hHHHHHHHH-cCCCcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCHHHHHHHHHH
Confidence 345555554 3778889999999999999999999999999999999999999999999844
No 51
>cd07979 TAF9 TATA Binding Protein (TBP) Associated Factor 9 (TAF9) is one of several TAFs that bind TBP and is involved in forming Transcription Factor IID (TFIID) complex. The TATA Binding Protein (TBP) Associated Factor 9 (TAF9) is one of several TAFs that bind TBP and are involved in forming the TFIID complex. TFIID is one of seven General Transcription Factors (GTF) (TFIIA, TFIIB, TFIID, TFIIE, TFIIF, and TFIID) that are involved in accurate initiation of transcription by RNA polymerase II in eukaryotes. TFIID plays an important role in the recognition of promoter DNA and assembly of the pre-initiation complex. The TFIID complex is composed of the TBP and at least 13 TAFs. TAFs from various species were originally named by their predicted molecular weight or their electrophoretic mobility in polyacrylamide gels. A new, unified nomenclature for the pol II TAFs has been suggested to show the relationship between TAFs orthologs and paralogs. Human TAF9 has a paralogue gene (TAF9L) whi
Probab=59.34 E-value=28 Score=29.62 Aligned_cols=61 Identities=16% Similarity=0.195 Sum_probs=53.4
Q ss_pred HHHHHhhhcCccccccccchhhHHHHHHHHHHHHHhHHhhHhhhhccccccchhhHHHHHHH
Q psy8269 118 ARIKKIMKLDDGVKMISAEAPMLFSKAAELFIHELTIRAWIHTEDNKRRTLQRNDIAMAITK 179 (386)
Q Consensus 118 ~rik~im~~~~~v~mi~~~~~~l~~~a~e~fi~~lt~rsw~~~~~~kr~~iq~~DIa~aI~~ 179 (386)
.-|++|++.. |+.-.+++++.-.-.-.+.+..++...+-.+.+-.+|++++..||..+|..
T Consensus 5 ~~v~~iLk~~-Gv~~~~~~v~~~Lle~~~ry~~~il~dA~~~a~hA~r~tV~~eDV~lAi~~ 65 (117)
T cd07979 5 RVIAAILKSM-GITEYEPRVINQLLEFAYRYTTDVLDDAKVYSEHAGKANIDADDVKLAIQS 65 (117)
T ss_pred HHHHHHHHHC-CCCccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCHHHHHHHHHH
Confidence 3466777754 888889999999999999999999999999999999999999999999944
No 52
>PF02969 TAF: TATA box binding protein associated factor (TAF); InterPro: IPR004823 The TATA box binding protein associated factor (TAF) is part of the transcription initiation factor TFIID multimeric protein complex. TFIID plays a central role in mediating promoter responses to various activators and repressors. It binds tightly to TAFII-250 and directly interacts with TAFII-40. TFIID is composed of TATA binding protein (TBP)and a number of TBP-associated factors (TAFS). TAF proteins adopt a histone-like fold.; GO: 0006352 transcription initiation, DNA-dependent, 0005634 nucleus; PDB: 1TAF_B.
Probab=58.68 E-value=22 Score=27.29 Aligned_cols=49 Identities=27% Similarity=0.374 Sum_probs=39.6
Q ss_pred cccccccchhhHHHHHHHHHHHHHhHHhhHhhhhccccccchhhHHHHH
Q psy8269 129 GVKMISAEAPMLFSKAAELFIHELTIRAWIHTEDNKRRTLQRNDIAMAI 177 (386)
Q Consensus 129 ~v~mi~~~~~~l~~~a~e~fi~~lt~rsw~~~~~~kr~~iq~~DIa~aI 177 (386)
|+.-++.|+-...+..+|--++++...+-......||+.++..||..++
T Consensus 17 Gi~~l~de~a~~La~dveyrlreiiq~a~kfm~hskR~~Lt~~Di~~AL 65 (66)
T PF02969_consen 17 GISNLSDEAAKALAEDVEYRLREIIQEALKFMRHSKRTKLTTDDINSAL 65 (66)
T ss_dssp T---B-HHHHHHHHHHHHHHHHHHHHHHHHHHHHTT-SSB-HHHHHHHH
T ss_pred CCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCHHHHHHHh
Confidence 5667888999999999999999999999999999999999999998875
No 53
>PLN00161 histone H3; Provisional
Probab=57.42 E-value=38 Score=29.74 Aligned_cols=81 Identities=23% Similarity=0.235 Sum_probs=63.6
Q ss_pred hhhHHhhhhhhhhhhcccCchHHH-HHhhhcCc-cccccccchhhHHHHHHHHHHHHHhHHhhHhhhhccccccchhhHH
Q psy8269 97 SVLEVQGALSMLDLKTQALPLARI-KKIMKLDD-GVKMISAEAPMLFSKAAELFIHELTIRAWIHTEDNKRRTLQRNDIA 174 (386)
Q Consensus 97 ~~~~~~~~~~~~~~~~~~~p~~ri-k~im~~~~-~v~mi~~~~~~l~~~a~e~fi~~lt~rsw~~~~~~kr~~iq~~DIa 174 (386)
..+|+...++..|.--+.+|-.|+ |+|++.-. .---+..+|....--|.|.|+-.+-.-+..|+--.||-|++..|+.
T Consensus 41 aLrEIR~yQkst~lLIpklPF~RLVREI~~~~~~~~~Rfq~~Al~ALQEAsEayLV~lFeda~lcaiHAkRVTlm~kDm~ 120 (135)
T PLN00161 41 ALREIRKYQKSTELLIRKLPFARLVREISNEMLREPFRWTAEALLALQEATEDFLVHLFEDCNLCAIHAKRVTIMPKDMQ 120 (135)
T ss_pred HHHHHHHHccccccccccccHHHHHHHHHHhcCCCCcEeeHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcccchhhHH
Confidence 345665556778888888999885 66765432 1125688888999999999999999999999999999999999986
Q ss_pred HHH
Q psy8269 175 MAI 177 (386)
Q Consensus 175 ~aI 177 (386)
.+-
T Consensus 121 La~ 123 (135)
T PLN00161 121 LAR 123 (135)
T ss_pred HHH
Confidence 553
No 54
>TIGR02854 spore_II_GA sigma-E processing peptidase SpoIIGA. Members of this protein family are the stage II sporulation protein SpoIIGA. This protein acts as an activating protease for Sigma-E, one of several specialized sigma factors of the sporulation process in Bacillus subtilis and related endospore-forming bacteria.
Probab=57.14 E-value=1e+02 Score=30.30 Aligned_cols=41 Identities=12% Similarity=0.169 Sum_probs=30.1
Q ss_pred hhHHHHHHHHHHHHHHHHHhhhchhHHHHHHHHHHHHHHHHHHHH
Q psy8269 232 PLHIFANMFVLHSFMPSSVEDLGKEQFVGFYLTAGVVASLLSYVH 276 (386)
Q Consensus 232 ~~HLl~Nm~~L~~fG~~lE~~~G~~rfl~lyl~sgv~gsl~s~l~ 276 (386)
+..+++|.+.|+..+..+.+....+|.+ +++++|++.+.+.
T Consensus 10 l~Nf~~d~~LL~~t~~~lk~~~~~~Rll----~ga~iGa~~~~~~ 50 (288)
T TIGR02854 10 LENFIIDYFLLYLTARTLKDKVSQWRLL----LAALIGSLYVLFM 50 (288)
T ss_pred HHHHHHHHHHHHHHHHHhhccchHHHHH----HHHHHHHHHHHHH
Confidence 3567889999999999998888888765 4555555544444
No 55
>COG4603 ABC-type uncharacterized transport system, permease component [General function prediction only]
Probab=54.33 E-value=2.2e+02 Score=29.04 Aligned_cols=123 Identities=18% Similarity=0.187 Sum_probs=61.4
Q ss_pred hhhHHHHHHHHHHHHHHHhhcccccccccCCCCccccccccccCCCCccccceeecc-cccCChhHHHHHHHHHHHHHHH
Q psy8269 170 RNDIAMAITKYDMFDFLIDIVPREEVKTATGPHRRDDATKTALNPDQYLSVPMLLST-FSHQSPLHIFANMFVLHSFMPS 248 (386)
Q Consensus 170 ~~DIa~aI~~i~i~~FL~~ivp~~~~~~~~~p~~~~~~~~~~l~p~~g~~wrllTs~-FlH~~~~HLl~Nm~~L~~fG~~ 248 (386)
-+++-.+|+.+.+..++.+.+-........+ . +...+..-|+..+.-+++... -+|.++.--+.-....|+ .
T Consensus 142 ~nEVIttiMlNyIa~~l~~~lV~~~~~~p~~---~-~~~~T~~i~~~a~lp~~~~~~~~~~~g~lial~~av~~~~---l 214 (356)
T COG4603 142 VNEVITTIMLNYIAVYLLNYLVEGPLRDPAG---T-NFPQTRPIPDAARLPSLFLASGRLHWGFLLALAAAVLVWF---L 214 (356)
T ss_pred ccHHHHHHHHHHHHHHHHHHHHcCcccCCCc---C-CCccCccCchhhcCCccccccchHHHHHHHHHHHHHHHHH---H
Confidence 3567778888888888777765531111111 0 011222223333333332222 455555444444444443 2
Q ss_pred HHhh-hch--------------------hHHHHHHHHHHHHHHHHHHHHH-----hhcCCCcceechHHHHHHHHHH
Q psy8269 249 SVED-LGK--------------------EQFVGFYLTAGVVASLLSYVHK-----ILVRKPGLSIGASGAIMAVLAH 299 (386)
Q Consensus 249 lE~~-~G~--------------------~rfl~lyl~sgv~gsl~s~l~~-----~~~~~~~~~vGASGaI~Gll~a 299 (386)
+++. +|- +..+..+++||.+++++..+.. ....+..+-.|.+|...++++-
T Consensus 215 l~kT~~G~~lr~~G~sp~Aa~yaGi~~~r~i~~sm~iSGalAGLaG~~ev~G~~~~~~~~~~~G~Gf~gI~Vallg~ 291 (356)
T COG4603 215 LKKTVFGYELRVVGLSPRAARYAGISVKRTIFLSMLISGALAGLAGAVEVLGAFGRLRPSISPGYGFDGIAVALLGR 291 (356)
T ss_pred HHHccCCeEEEEecCChhHHHHCCCChhHHHHHHHHHHHHHHHHHHHHHHhcchhhcccccCCCcChHHHHHHHHhc
Confidence 4432 111 1234555666666666654432 1122346788999999888885
No 56
>PTZ00252 histone H2A; Provisional
Probab=54.32 E-value=35 Score=29.86 Aligned_cols=70 Identities=16% Similarity=0.281 Sum_probs=58.4
Q ss_pred CchHHHHHhhhcCccccccccchhhHHHHHHHHHHHHHhHHhhHhhhh--ccccccchhhHHHHHHHHHHHH
Q psy8269 115 LPLARIKKIMKLDDGVKMISAEAPMLFSKAAELFIHELTIRAWIHTED--NKRRTLQRNDIAMAITKYDMFD 184 (386)
Q Consensus 115 ~p~~rik~im~~~~~v~mi~~~~~~l~~~a~e~fi~~lt~rsw~~~~~--~kr~~iq~~DIa~aI~~i~i~~ 184 (386)
.|++||.+.|+..---.=|+..+++......|=..-|+-..+...+.+ +++.-|++.++..+|-..+=++
T Consensus 26 FPVgRi~R~Lr~g~ya~RIga~ApVYLAAVLEYLtaEVLELAgnaa~d~~~kk~RItPrHi~lAIrNDeEL~ 97 (134)
T PTZ00252 26 FPVGRVGSLLRRGQYARRIGASGAVYMAAVLEYLTAELLELSVKAAAQQAKKPKRLTPRTVTLAVRHDDDLG 97 (134)
T ss_pred CchHHHHHHHHcCCcccccCCccHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccHHHHHhhccChHHHH
Confidence 899999999988766668899999999999998888888888888754 6778899999999995544333
No 57
>PF13105 DUF3959: Protein of unknown function (DUF3959)
Probab=53.40 E-value=66 Score=29.84 Aligned_cols=19 Identities=21% Similarity=0.308 Sum_probs=15.8
Q ss_pred HHHHHHHHHHHHHHHHhhh
Q psy8269 346 AAHLGGALFGILYSKYGEQ 364 (386)
Q Consensus 346 ~aHLgG~L~G~l~~~~~~~ 364 (386)
.--+||++.|-+++.+++|
T Consensus 134 llLvgGli~GGLlA~~~hR 152 (239)
T PF13105_consen 134 LLLVGGLILGGLLAMLIHR 152 (239)
T ss_pred HHHHHHHHHHHHHHHHHHh
Confidence 3457999999999998887
No 58
>PF04632 FUSC: Fusaric acid resistance protein family; InterPro: IPR006726 This entry represents the p-hydroxybenzoic acid efflux pump subunit AaeB (pHBA efflux pump protein B) whose substrates are p-hydroxybenzoic acid (pHBA), 6-hydroxy-2-naphthoic and 2-hydroxycinnamate. It could function as a metabolic relief valve, allowing to eliminate certain compounds when they accumulate to high levels in the cell []. This family also includes fusaric acid resistance proteins [], which are likely to be membrane transporter proteins, and uncharacterised transporter YdhK.; GO: 0006810 transport, 0005886 plasma membrane
Probab=50.26 E-value=3.3e+02 Score=29.28 Aligned_cols=35 Identities=20% Similarity=0.289 Sum_probs=16.4
Q ss_pred HHHHHHHHHHHHHHHhhh---ccccccchHHHHHHhhc
Q psy8269 347 AHLGGALFGILYSKYGEQ---TWAHRAPVVEYWKSLKK 381 (386)
Q Consensus 347 aHLgG~L~G~l~~~~~~~---~w~~r~~~~~~W~~~R~ 381 (386)
|.+.|.+++.+...+... .|..|+-....|+++++
T Consensus 474 a~l~G~~~a~l~~~li~p~~~~~~~rrl~~~~~~~l~~ 511 (650)
T PF04632_consen 474 AILLGIVIAALVFRLIRPFSPEWRRRRLLRALRRDLAR 511 (650)
T ss_pred HHHHHHHHHHHHHHHHCCCChhHHHHHHHHHHHHHHHH
Confidence 444444444443333322 23344444557777665
No 59
>PF03847 TFIID_20kDa: Transcription initiation factor TFIID subunit A; InterPro: IPR003228 Human transcription initiation factor TFIID is composed of the TATA-binding polypeptide (TBP) and at least 13 TBP-associated factors (TAFs) that collectively or individually are involved in activator-dependent transcription [].; GO: 0006352 transcription initiation, DNA-dependent, 0005669 transcription factor TFIID complex; PDB: 1H3O_B.
Probab=48.32 E-value=57 Score=25.11 Aligned_cols=60 Identities=17% Similarity=0.289 Sum_probs=43.8
Q ss_pred HHHHhhhcCccccccccchhhHHHHHHHHHHHHHhHHhhHhhhhccccccchhhHHHHHH
Q psy8269 119 RIKKIMKLDDGVKMISAEAPMLFSKAAELFIHELTIRAWIHTEDNKRRTLQRNDIAMAIT 178 (386)
Q Consensus 119 rik~im~~~~~v~mi~~~~~~l~~~a~e~fi~~lt~rsw~~~~~~kr~~iq~~DIa~aI~ 178 (386)
++...++.=+.-..+.+|+.-++..-.+-|+..++.++-..+..++..++...||...+-
T Consensus 4 ~l~~Lv~~iDp~~~ld~~vee~Ll~laddFv~~v~~~ac~lAKhR~s~tle~~Dv~~~Le 63 (68)
T PF03847_consen 4 KLQELVKQIDPNEKLDPDVEELLLELADDFVDDVVSFACRLAKHRKSSTLEVKDVQLHLE 63 (68)
T ss_dssp HHHHHHHCC-SS----HHHHHHHHHHHHHHHHHHHHHHHHHHHHTT-SEE-HHHHHHHHH
T ss_pred HHHHHHHHcCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCCHHHHHHHHH
Confidence 344555555555567888999999999999999999999999999999999988876653
No 60
>TIGR00834 ae anion exchange protein. They preferentially catalyze anion exchange (antiport) reactions, typically acting as HCO3-:Cl- antiporters, but also transporting a range of other inorganic and organic anions. Additionally, renal Na+:HCO3- cotransporters have been found to be members of the AE family. They catalyze the reabsorption of HCO3- in the renal proximal tubule.
Probab=46.77 E-value=2.2e+02 Score=32.74 Aligned_cols=64 Identities=17% Similarity=0.165 Sum_probs=37.8
Q ss_pred hhHHHHHHHHHHHHHHHHHhhhchh-HHHHHHHHHHHHHHHHHHHHHhhcCCCcceechHHHHHHHHHH
Q psy8269 232 PLHIFANMFVLHSFMPSSVEDLGKE-QFVGFYLTAGVVASLLSYVHKILVRKPGLSIGASGAIMAVLAH 299 (386)
Q Consensus 232 ~~HLl~Nm~~L~~fG~~lE~~~G~~-rfl~lyl~sgv~gsl~s~l~~~~~~~~~~~vGASGaI~Gll~a 299 (386)
++=.+.|+.--+.||..+++.-+.. -..-+.+++++ +++...++. +++-..+|.+|-+.-+..+
T Consensus 376 ifiyFA~L~PaIaFG~ll~~~T~g~~gv~E~Llstai-~Giifslf~---GQPL~IlG~TGPilvF~~~ 440 (900)
T TIGR00834 376 IFIYFAALSPAITFGGLLGEKTRNMMGVSELLISTAV-QGVLFALLA---AQPLLVVGFSGPLLVFEEA 440 (900)
T ss_pred HHHHHHHhhHHhhHHHHHHHhhCCcchHHHHHHHHHH-HHHHHhhhc---CCceEEecCcccHHHHHHH
Confidence 3344556666677888888774433 22333344444 444444444 5777889999966555544
No 61
>PF09011 HMG_box_2: HMG-box domain; InterPro: IPR015101 This domain is predominantly found in Maelstrom homologue proteins. It has no known function. ; GO: 0005634 nucleus; PDB: 2EQZ_A 1V64_A 2CTO_A 1H5P_A 3TQ6_A 3FGH_A 3TMM_A 1J3X_A 2YRQ_A 1AAB_A ....
Probab=46.22 E-value=4.3 Score=31.11 Aligned_cols=43 Identities=12% Similarity=0.117 Sum_probs=31.9
Q ss_pred ccchhhHHHHHHHHhccCCCCCeeeehhhhHHHHHHHhhcCCccchHHHh
Q psy8269 15 SNDIPKWRKDLRKRWNALHPGDRVYVPILFLNGVVFLAWFYPRLHPVLYK 64 (386)
Q Consensus 15 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 64 (386)
.....+..+.+...|++|++.||..| ..-|..+..++...|..
T Consensus 29 ~~~~~e~~k~~~~~Wk~Ls~~EK~~Y-------~~~A~~~k~~y~~e~~~ 71 (73)
T PF09011_consen 29 KQSFREVMKEISERWKSLSEEEKEPY-------EERAKEDKERYEREMKE 71 (73)
T ss_dssp -SSHHHHHHHHHHHHHHS-HHHHHHH-------HHHHHHHHHHHHHHHHH
T ss_pred CCCHHHHHHHHHHHHHhcCHHHHHHH-------HHHHHHHHHHHHHHHHh
Confidence 34577889999999999999999988 66666666666665544
No 62
>KOG1658|consensus
Probab=45.46 E-value=24 Score=31.58 Aligned_cols=45 Identities=27% Similarity=0.256 Sum_probs=40.9
Q ss_pred hhcccCchHHHHHhhhcCccccccccchhhHHHHHHHHHHHHHhH
Q psy8269 110 LKTQALPLARIKKIMKLDDGVKMISAEAPMLFSKAAELFIHELTI 154 (386)
Q Consensus 110 ~~~~~~p~~rik~im~~~~~v~mi~~~~~~l~~~a~e~fi~~lt~ 154 (386)
.+.+.+|.+..|||=|.+|-..+.+.++-.-..-+.|.|+|-++.
T Consensus 7 e~~p~~p~ekvkkiak~dPey~~te~~a~~etafatE~fvq~lv~ 51 (162)
T KOG1658|consen 7 ECSPKLPMEKVKKIAKNDPEYMDTEDDAFVETAFATEQFVQVLVH 51 (162)
T ss_pred hhCccccHHHHHHhhcCCchhhhcccchHHHHHHHHHHHHhhhhh
Confidence 355679999999999999999999999998999999999998886
No 63
>PF06728 PIG-U: GPI transamidase subunit PIG-U; InterPro: IPR009600 Many eukaryotic proteins are anchored to the cell surface via glycosylphosphatidylinositol (GPI), which is posttranslationally attached to the C terminus by GPI transamidase. The mammalian GPI transamidase is a complex of at least four subunits, GPI8, GAA1, PIG-S, and PIG-T. PIG-U is thought to represent a fifth subunit in this complex and may be involved in the recognition of either the GPI attachment signal or the lipid portion of GPI [].; GO: 0006506 GPI anchor biosynthetic process, 0005789 endoplasmic reticulum membrane, 0016021 integral to membrane
Probab=44.71 E-value=40 Score=34.41 Aligned_cols=88 Identities=18% Similarity=0.338 Sum_probs=66.8
Q ss_pred ccCCCCccccceeecccccC-ChhHHHHHHHHHHHHHHHHHhhhchhHHHHHHHHHHHHHHHHHHHHHhhcCCCcceech
Q psy8269 211 ALNPDQYLSVPMLLSTFSHQ-SPLHIFANMFVLHSFMPSSVEDLGKEQFVGFYLTAGVVASLLSYVHKILVRKPGLSIGA 289 (386)
Q Consensus 211 ~l~p~~g~~wrllTs~FlH~-~~~HLl~Nm~~L~~fG~~lE~~~G~~rfl~lyl~sgv~gsl~s~l~~~~~~~~~~~vGA 289 (386)
-+.|+-|-+|=++|-+|-|. .+....+|...+....+..-| +.+..+.++.+..|+++-+ .+.+++|=
T Consensus 254 dltPNlGl~WYfF~EmFd~Fr~fF~~vF~~~~~~y~~Pl~ir-l~~~p~~~~~~~~~~~~~f----------k~Ypt~~D 322 (382)
T PF06728_consen 254 DLTPNLGLWWYFFTEMFDHFRPFFLFVFQLHPFIYIIPLTIR-LRKQPLFAFFILLGIISVF----------KPYPTVGD 322 (382)
T ss_pred CCCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh-cccCcHHHHHHHHHHHHHh----------CCCCchhH
Confidence 35688899999999999995 888899999988777775554 4555666666666655432 46799999
Q ss_pred HHHHHHHHHHHHHHcCCCcc
Q psy8269 290 SGAIMAVLAHTCITHPDTEL 309 (386)
Q Consensus 290 SGaI~Gll~a~~~~~p~~~~ 309 (386)
.|...+++..+--.++..+.
T Consensus 323 ~~~~lsll~~~~~~~~~~r~ 342 (382)
T PF06728_consen 323 VGLYLSLLPLFSHLFPYMRY 342 (382)
T ss_pred HHHHHHHHHHhHHHHhhccc
Confidence 99999999887666666553
No 64
>PF03540 TFIID_30kDa: Transcription initiation factor TFIID 23-30kDa subunit; InterPro: IPR003923 Transcription initiation factor TFIID is a multimeric protein complex that plays a central role in mediating promoter responses to various activators and repressors. The complex includes TATA binding protein (TBP) and various TBP-associated factors (TAFS). TFIID a bona fide RNA polymerase II-specific TATA-binding protein-associated factor (TAF) and is essential for viability []. TFIID acts to nucleate the transcription complex, recruiting the rest of the factors through a direct interaction with TFIIB. The TBP subunit of TFIID is sufficient for TATA-element binding and TFIIB interaction, and can support basal transcription. The protein belongs to the TAF2H family.; GO: 0006352 transcription initiation, DNA-dependent, 0005634 nucleus
Probab=43.53 E-value=74 Score=23.25 Aligned_cols=48 Identities=19% Similarity=0.181 Sum_probs=36.0
Q ss_pred cCchHHHHHhhhcCccccccccchhhHHHHHHHHHHHHHhHHhhHhhhh
Q psy8269 114 ALPLARIKKIMKLDDGVKMISAEAPMLFSKAAELFIHELTIRAWIHTED 162 (386)
Q Consensus 114 ~~p~~rik~im~~~~~v~mi~~~~~~l~~~a~e~fi~~lt~rsw~~~~~ 162 (386)
.+|.+.+.-.|+.. |++.--....-+++.|+++|+.++...++.|..-
T Consensus 2 ~IPD~v~~~yL~~~-G~~~~D~rv~RLvSLaaQKFisdI~~dA~q~~k~ 49 (51)
T PF03540_consen 2 TIPDEVTDYYLERS-GFQTSDPRVKRLVSLAAQKFISDIANDAMQYCKI 49 (51)
T ss_pred CCCHHHHHHHHHHC-CCCCCCHhHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 46777777777764 3344444556788999999999999999988754
No 65
>smart00398 HMG high mobility group.
Probab=43.50 E-value=6.9 Score=28.87 Aligned_cols=40 Identities=15% Similarity=0.090 Sum_probs=29.0
Q ss_pred chhhHHHHHHHHhccCCCCCeeeehhhhHHHHHHHhhcCCccchHHH
Q psy8269 17 DIPKWRKDLRKRWNALHPGDRVYVPILFLNGVVFLAWFYPRLHPVLY 63 (386)
Q Consensus 17 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 63 (386)
..+++.+.+.+.|+.|.+.+|.++ ..-+..+...+...+.
T Consensus 28 ~~~~i~~~~~~~W~~l~~~ek~~y-------~~~a~~~~~~y~~~~~ 67 (70)
T smart00398 28 SNAEISKKLGERWKLLSEEEKAPY-------EEKAKKDKERYEEEMP 67 (70)
T ss_pred CHHHHHHHHHHHHHcCCHHHHHHH-------HHHHHHHHHHHHHHHH
Confidence 367889999999999999999988 5555544444444433
No 66
>cd01390 HMGB-UBF_HMG-box HMGB-UBF_HMG-box, class II and III members of the HMG-box superfamily of DNA-binding proteins. These proteins bind the minor groove of DNA in a non-sequence specific fashion and contain two or more tandem HMG boxes. Class II members include non-histone chromosomal proteins, HMG1 and HMG2, which bind to bent or distorted DNA such as four-way DNA junctions, synthetic DNA cruciforms, kinked cisplatin-modified DNA, DNA bulges, cross-overs in supercoiled DNA, and can cause looping of linear DNA. Class III members include nucleolar and mitochondrial transcription factors, UBF and mtTF1, which bind four-way DNA junctions.
Probab=40.88 E-value=7.1 Score=28.67 Aligned_cols=24 Identities=17% Similarity=0.368 Sum_probs=22.0
Q ss_pred chhhHHHHHHHHhccCCCCCeeee
Q psy8269 17 DIPKWRKDLRKRWNALHPGDRVYV 40 (386)
Q Consensus 17 ~~~~~~~~~~~~~~~~~~~~~~~~ 40 (386)
...++-+.+.+.|+.|++.+|..+
T Consensus 27 ~~~~i~~~~~~~W~~ls~~eK~~y 50 (66)
T cd01390 27 SVTEVTKILGEKWKELSEEEKKKY 50 (66)
T ss_pred CHHHHHHHHHHHHHhCCHHHHHHH
Confidence 568899999999999999999888
No 67
>PTZ00199 high mobility group protein; Provisional
Probab=38.34 E-value=9.4 Score=31.05 Aligned_cols=41 Identities=15% Similarity=0.073 Sum_probs=34.2
Q ss_pred hhhHHHHHHHHhccCCCCCeeeehhhhHHHHHHHhhcCCccchHHHhh
Q psy8269 18 IPKWRKDLRKRWNALHPGDRVYVPILFLNGVVFLAWFYPRLHPVLYKY 65 (386)
Q Consensus 18 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 65 (386)
..++-+.|...|+.|+++||..+ ..-+..+..+|...|-.|
T Consensus 52 ~~evsk~ige~Wk~ls~eeK~~y-------~~~A~~dk~rY~~e~~~Y 92 (94)
T PTZ00199 52 VAAVGKMVGEAWNKLSEEEKAPY-------EKKAQEDKVRYEKEKAEY 92 (94)
T ss_pred HHHHHHHHHHHHHcCCHHHHHHH-------HHHHHHHHHHHHHHHHHH
Confidence 56778899999999999999988 777778877777777665
No 68
>cd01388 SOX-TCF_HMG-box SOX-TCF_HMG-box, class I member of the HMG-box superfamily of DNA-binding proteins. These proteins contain a single HMG box, and bind the minor groove of DNA in a highly sequence-specific manner. Members include SRY and its homologs in insects and vertebrates, and transcription factor-like proteins, TCF-1, -3, -4, and LEF-1. They appear to bind the minor groove of the A/T C A A A G/C-motif.
Probab=36.95 E-value=7.7 Score=29.62 Aligned_cols=42 Identities=17% Similarity=0.096 Sum_probs=29.9
Q ss_pred chhhHHHHHHHHhccCCCCCeeeehhhhHHHHHHHhhcCCccchHHHhh
Q psy8269 17 DIPKWRKDLRKRWNALHPGDRVYVPILFLNGVVFLAWFYPRLHPVLYKY 65 (386)
Q Consensus 17 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 65 (386)
...++-|.+.+.|+.|++++|..+ ...+..+...+...+++|
T Consensus 28 ~~~eisk~l~~~Wk~ls~~eK~~y-------~~~a~~~k~~y~~~~p~y 69 (72)
T cd01388 28 ENRAISKILGDRWKALSNEEKQPY-------YEEAKKLKELHMKLYPDY 69 (72)
T ss_pred CHHHHHHHHHHHHHcCCHHHHHHH-------HHHHHHHHHHHHHHCcCC
Confidence 456788899999999999999887 555555554444444443
No 69
>cd01389 MATA_HMG-box MATA_HMG-box, class I member of the HMG-box superfamily of DNA-binding proteins. These proteins contain a single HMG box, and bind the minor groove of DNA in a highly sequence-specific manner. Members include the fungal mating type gene products MC, MATA1 and Ste11.
Probab=36.89 E-value=7.3 Score=30.11 Aligned_cols=44 Identities=11% Similarity=0.067 Sum_probs=34.4
Q ss_pred chhhHHHHHHHHhccCCCCCeeeehhhhHHHHHHHhhcCCccchHHHhhhc
Q psy8269 17 DIPKWRKDLRKRWNALHPGDRVYVPILFLNGVVFLAWFYPRLHPVLYKYFA 67 (386)
Q Consensus 17 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 67 (386)
...++-+.+.+.|+.|++.+|..+ ...|..+...+...+++|=.
T Consensus 28 ~~~eisk~~g~~Wk~ls~eeK~~y-------~~~A~~~k~~~~~~~p~Yky 71 (77)
T cd01389 28 TNNEISRIIGRMWRSESPEVKAYY-------KELAEEEKERHAREYPDYKY 71 (77)
T ss_pred CHHHHHHHHHHHHhhCCHHHHHHH-------HHHHHHHHHHHHHHCCCCcc
Confidence 457889999999999999999988 66666666666666666543
No 70
>PF15510 CENP-W: Centromere kinetochore component W
Probab=35.40 E-value=84 Score=25.78 Aligned_cols=63 Identities=21% Similarity=0.371 Sum_probs=47.4
Q ss_pred ccCchHHHHHhhhcCc-cccccccchhhH--------------HHHHHHHHHHHHhHHhhHhhhhccccccchhhHHHH
Q psy8269 113 QALPLARIKKIMKLDD-GVKMISAEAPML--------------FSKAAELFIHELTIRAWIHTEDNKRRTLQRNDIAMA 176 (386)
Q Consensus 113 ~~~p~~rik~im~~~~-~v~mi~~~~~~l--------------~~~a~e~fi~~lt~rsw~~~~~~kr~~iq~~DIa~a 176 (386)
+..|.+-+|+++|-.+ -.. +..+.+.+ +-.-|=+|++.+...+-+.+-++|-.+|.+.+|..+
T Consensus 15 rkaPrgfLkrv~Kr~KphlR-l~~~~Dllv~~~~f~~~~~~~~vhLncLLFvhrLAEEaRtnA~EnK~~~Ik~~Hv~Aa 92 (102)
T PF15510_consen 15 RKAPRGFLKRVFKRQKPHLR-LETSGDLLVRFCPFSGWQWGGEVHLNCLLFVHRLAEEARTNACENKCGTIKKEHVLAA 92 (102)
T ss_pred HhCchHHHHHHHHhcCCcee-ecccccHHHhhcccccccccceeehhHHHHHHHHHHHHHHHHHHHhhccccHHHHHHH
Confidence 3579999999999443 333 22223333 556899999999999999999999999998887544
No 71
>PF12805 FUSC-like: FUSC-like inner membrane protein yccS
Probab=34.69 E-value=2.4e+02 Score=27.30 Aligned_cols=12 Identities=25% Similarity=0.077 Sum_probs=4.8
Q ss_pred HHHHHHHHHHHH
Q psy8269 344 DHAAHLGGALFG 355 (386)
Q Consensus 344 ~~~aHLgG~L~G 355 (386)
+.+.|.+=+++|
T Consensus 70 ~~~~~~~l~~~G 81 (284)
T PF12805_consen 70 EALEHALLFLAG 81 (284)
T ss_pred HHHHHHHHHHHH
Confidence 334444333333
No 72
>COG0705 Membrane associated serine protease [Amino acid transport and metabolism]
Probab=33.93 E-value=29 Score=32.29 Aligned_cols=69 Identities=13% Similarity=-0.011 Sum_probs=49.3
Q ss_pred CccccceeecccccCChhHHHHHHHHHHHHHHHHHhhhchhHHHHHHHHHHHHHHHHHHHHHhhcCCCcceechHHHHHH
Q psy8269 216 QYLSVPMLLSTFSHQSPLHIFANMFVLHSFMPSSVEDLGKEQFVGFYLTAGVVASLLSYVHKILVRKPGLSIGASGAIMA 295 (386)
Q Consensus 216 ~g~~wrllTs~FlH~~~~HLl~Nm~~L~~fG~~lE~~~G~~rfl~lyl~sgv~gsl~s~l~~~~~~~~~~~vGASGaI~G 295 (386)
+|.++.++++.++|....|...+... ..+...+++...++.++...... ..+.++.++-+.|
T Consensus 137 SG~i~gllga~~~~~~~~~~~~~~~~-------------~~~~~~~~i~~~~~~~~~~~~~~-----~~~~va~~aHl~G 198 (228)
T COG0705 137 SGAIFGLLGAYFLLFPFARILLLFLS-------------LPRPALILILIWLLYSLFSGAGS-----FGPSVAWSAHLGG 198 (228)
T ss_pred hHHHHHHHHHHHHHccccchhhhhcc-------------CchhHHHHHHHHHHHHHHHHhcC-----CchHHHHHHHHHH
Confidence 67899999999999988888777654 34455667777777776666544 2257888888888
Q ss_pred HHHHHHH
Q psy8269 296 VLAHTCI 302 (386)
Q Consensus 296 ll~a~~~ 302 (386)
+++....
T Consensus 199 ~i~G~l~ 205 (228)
T COG0705 199 LIGGLLL 205 (228)
T ss_pred HHHHHHH
Confidence 8866543
No 73
>PLN00160 histone H3; Provisional
Probab=33.84 E-value=1.4e+02 Score=24.71 Aligned_cols=80 Identities=23% Similarity=0.231 Sum_probs=62.2
Q ss_pred hhhHHhhhhhhhhhhcccCchHHH-HHhhhcCc-cccccccchhhHHHHHHHHHHHHHhHHhhHhhhhccccccchhhHH
Q psy8269 97 SVLEVQGALSMLDLKTQALPLARI-KKIMKLDD-GVKMISAEAPMLFSKAAELFIHELTIRAWIHTEDNKRRTLQRNDIA 174 (386)
Q Consensus 97 ~~~~~~~~~~~~~~~~~~~p~~ri-k~im~~~~-~v~mi~~~~~~l~~~a~e~fi~~lt~rsw~~~~~~kr~~iq~~DIa 174 (386)
+.+|+..-++..|.--+.+|-.|+ |+|.+.-. +---+..+|....--|.|.|.-.+-.-+..|+--.||-|++..|+.
T Consensus 7 aLrEIR~yQkst~lLI~k~pF~RLVREI~~~~~~~~~Rfq~~Al~ALQeAsEayLv~lfed~~lca~HakRVTl~~kD~~ 86 (97)
T PLN00160 7 ALKEIKMYQKSTDLLIRRLPFARLVREIQMEMSREAYRWQGSAILALQEAAEAHLVGLFEDSNLCAIHGKRVTIMPKDMQ 86 (97)
T ss_pred HHHHHHHHccchhhhhccccHHHHHHHHHHHcCCCCcEeeHHHHHHHHHHHHHHHHHHHhhhHHHHHHhcccccchhhHH
Confidence 345555556777778888898885 66665322 2126688888899999999999999999999999999999999986
Q ss_pred HH
Q psy8269 175 MA 176 (386)
Q Consensus 175 ~a 176 (386)
.+
T Consensus 87 L~ 88 (97)
T PLN00160 87 LA 88 (97)
T ss_pred HH
Confidence 54
No 74
>PF00505 HMG_box: HMG (high mobility group) box; InterPro: IPR000910 High mobility group (HMG or HMGB) proteins are a family of relatively low molecular weight non-histone components in chromatin. HMG1 (also called HMG-T in fish) and HMG2 are two highly related proteins that bind single-stranded DNA preferentially and unwind double-stranded DNA. Although they have no sequence specificity, they have a high affinity for bent or distorted DNA, and bend linear DNA. HMG1 and HMG2 contain two DNA-binding HMG-box domains (A and B) that show structural and functional differences, and have a long acidic C-terminal domain rich in aspartic and glutamic acid residues. The acidic tail modulates the affinity of the tandem HMG boxes in HMG1 and 2 for a variety of DNA targets. HMG1 and 2 appear to play important architectural roles in the assembly of nucleoprotein complexes in a variety of biological processes, for example V(D)J recombination, the initiation of transcription, and DNA repair []. The profile in this entry describing the HMG-domains is much more general than the signature. In addition to the HMG1 and HMG2 proteins, HMG-domains occur in single or multiple copies in the following protein classes; the SOX family of transcription factors; SRY sex determining region Y protein and related proteins []; LEF1 lymphoid enhancer binding factor 1 []; SSRP recombination signal recognition protein; MTF1 mitochondrial transcription factor 1; UBF1/2 nucleolar transcription factors; Abf2 yeast ARS-binding factor []; and Saccharomyces cerevisiae transcription factors Ixr1, Rox1, Nhp6a, Nhp6b and Spp41.; GO: 0003677 DNA binding; PDB: 1I11_A 1J3C_A 1J3D_A 1WZ6_A 1WGF_A 2D7L_A 1GT0_D 3U2B_C 2CRJ_A 2CS1_A ....
Probab=33.69 E-value=6.9 Score=29.06 Aligned_cols=25 Identities=16% Similarity=0.194 Sum_probs=22.0
Q ss_pred cchhhHHHHHHHHhccCCCCCeeee
Q psy8269 16 NDIPKWRKDLRKRWNALHPGDRVYV 40 (386)
Q Consensus 16 ~~~~~~~~~~~~~~~~~~~~~~~~~ 40 (386)
...+++.+.+.+.|+.|++.+|..+
T Consensus 26 ~~~~~i~~~~~~~W~~l~~~eK~~y 50 (69)
T PF00505_consen 26 LSNKEISKILAQMWKNLSEEEKAPY 50 (69)
T ss_dssp STHHHHHHHHHHHHHCSHHHHHHHH
T ss_pred cccccchhhHHHHHhcCCHHHHHHH
Confidence 3578899999999999999998877
No 75
>PF10979 DUF2786: Protein of unknown function (DUF2786); InterPro: IPR024498 This domain is found in proteins that have no known function.
Probab=33.69 E-value=49 Score=23.14 Aligned_cols=35 Identities=26% Similarity=0.362 Sum_probs=29.1
Q ss_pred hHHHHHhhhcCccccccccchhhHHHHHHHHHHHH
Q psy8269 117 LARIKKIMKLDDGVKMISAEAPMLFSKAAELFIHE 151 (386)
Q Consensus 117 ~~rik~im~~~~~v~mi~~~~~~l~~~a~e~fi~~ 151 (386)
+.||||.+.+..+-.--..|+.....+|.++..++
T Consensus 4 l~kI~kLLalA~~~~~~~~EA~~A~~kAq~Lm~ky 38 (43)
T PF10979_consen 4 LEKIRKLLALAESTGSNEHEAEAALAKAQRLMAKY 38 (43)
T ss_pred HHHHHHHHHHhhCCCCCHHHHHHHHHHHHHHHHHh
Confidence 57999999999987777789999999998766543
No 76
>PLN00121 histone H3; Provisional
Probab=33.58 E-value=1.2e+02 Score=26.74 Aligned_cols=80 Identities=21% Similarity=0.251 Sum_probs=63.2
Q ss_pred hhhHHhhhhhhhhhhcccCchHHH-HHhhhcCccccccccchhhHHHHHHHHHHHHHhHHhhHhhhhccccccchhhHHH
Q psy8269 97 SVLEVQGALSMLDLKTQALPLARI-KKIMKLDDGVKMISAEAPMLFSKAAELFIHELTIRAWIHTEDNKRRTLQRNDIAM 175 (386)
Q Consensus 97 ~~~~~~~~~~~~~~~~~~~p~~ri-k~im~~~~~v~mi~~~~~~l~~~a~e~fi~~lt~rsw~~~~~~kr~~iq~~DIa~ 175 (386)
..+|+...++..|.--+.+|-.|+ |+|.+.-..---+..++....--|.|.|.-.+-.-+..|+--.||-+++..|+..
T Consensus 48 aLrEIr~yQkst~lLI~k~pF~RLVREI~~~~~~~~Rf~~~Al~ALQeaaE~yLv~lfed~~lca~HakRVTl~~kD~~L 127 (136)
T PLN00121 48 ALREIRKYQKSTELLIRKLPFQRLVREIAQDFKTDLRFQSSAVLALQEAAEAYLVGLFEDTNLCAIHAKRVTIMPKDIQL 127 (136)
T ss_pred HHHHHHHhccccccccccccHHHHHHHHHHHhCccceeeHHHHHHHHHHHHHHHHHHHhhhHHHHHHhcceecchhhHHH
Confidence 345665566788888888999885 6666542211156889999999999999999999999999999999999999855
Q ss_pred H
Q psy8269 176 A 176 (386)
Q Consensus 176 a 176 (386)
+
T Consensus 128 ~ 128 (136)
T PLN00121 128 A 128 (136)
T ss_pred H
Confidence 4
No 77
>PF09527 ATPase_gene1: Putative F0F1-ATPase subunit (ATPase_gene1)
Probab=33.25 E-value=1.8e+02 Score=20.81 Aligned_cols=41 Identities=17% Similarity=0.052 Sum_probs=29.7
Q ss_pred HHHHHHHHHHHHHHHHHhhhch-hHHHHHHHHHHHHHHHHHH
Q psy8269 234 HIFANMFVLHSFMPSSVEDLGK-EQFVGFYLTAGVVASLLSY 274 (386)
Q Consensus 234 HLl~Nm~~L~~fG~~lE~~~G~-~rfl~lyl~sgv~gsl~s~ 274 (386)
.++.+++.-..+|.-+++.+|+ .-+..+.++-|+++++.+.
T Consensus 9 ~~~~~i~~g~~~G~~lD~~~~t~p~~~~~g~llG~~~g~~~~ 50 (55)
T PF09527_consen 9 TMAAPILVGFFLGYWLDKWFGTSPWFTLIGLLLGIAAGFYNV 50 (55)
T ss_pred HHHHHHHHHHHHHHHHHHHcCCChHHHHHHHHHHHHHHHHHH
Confidence 4566777778889999999888 5566666666776666544
No 78
>PF06609 TRI12: Fungal trichothecene efflux pump (TRI12); InterPro: IPR010573 This family consists of several fungal specific trichothecene efflux pump proteins. Many of the genes involved in trichothecene toxin biosynthesis in Fusarium sporotrichioides are present within a gene cluster. It has been suggested that TRI12 may play a role in F. sporotrichioides self-protection against trichothecenes [].
Probab=32.33 E-value=1.9e+02 Score=31.55 Aligned_cols=41 Identities=17% Similarity=0.117 Sum_probs=27.2
Q ss_pred hHHHHHHHHHHHHHHHHHhhhchhHHHHHHHHHHHHHHHHHH
Q psy8269 233 LHIFANMFVLHSFMPSSVEDLGKEQFVGFYLTAGVVASLLSY 274 (386)
Q Consensus 233 ~HLl~Nm~~L~~fG~~lE~~~G~~rfl~lyl~sgv~gsl~s~ 274 (386)
...+.+....-.+|+ +.+.+|++.++..=.+.+++|.+...
T Consensus 86 ~~~l~~av~~~~~G~-LSDlfGRr~~~i~g~~l~vvG~Iv~a 126 (599)
T PF06609_consen 86 AWTLASAVSFPFVGR-LSDLFGRRYFFIIGSLLGVVGSIVCA 126 (599)
T ss_pred HHHHHHHHHHHhhHH-HHHHhcchHHHHHHHHHHHhHHHHhh
Confidence 355556666666666 89999999888666665665555443
No 79
>PF07524 Bromo_TP: Bromodomain associated; InterPro: IPR006565 This bromodomain is found in eukaryotic transcription factors and PHD domain containing proteins (IPR001965 from INTERPRO). The tandem PHD finger-bromodomain is found in many chromatin-associated proteins. It is involved in gene silencing by the human co-repressor KRAB-associated protein 1 (KAP1). The tandem PHD finger-bromodomain of KAP1 has a distinct structure that joins the two protein modules. The first helix, alpha(Z), of an atypical bromodomain forms the central hydrophobic core that anchors the other three helices of the bromodomain on one side and the zinc binding PHD finger on the other []. The Rap1 GTPase-activating protein, Sipa1, is modulated by the cellular bromodomain protein, Brd4. Brd4 belongs to the BET family and is a multifunctional protein involved in transcription, replication, the signal transduction pathway, and cell cycle progression. All of these functions are linked to its association with acetylated chromatin. It has tandem bromodomains []. The dysregulation of the Brd4-associated pathways may play an important role in breast cancer progression []. Bovine papillomavirus type 1 E2 also binds to chromosomes in a complex with Brd4. Interaction with Brd4 is additionally important for E2-mediated transcriptional regulation [, ].
Probab=31.38 E-value=1.5e+02 Score=22.70 Aligned_cols=55 Identities=18% Similarity=0.158 Sum_probs=47.5
Q ss_pred hcCccccccccchhhHHHHHHHHHHHHHhHHhhHhhhhccccccchhhHHHHHHH
Q psy8269 125 KLDDGVKMISAEAPMLFSKAAELFIHELTIRAWIHTEDNKRRTLQRNDIAMAITK 179 (386)
Q Consensus 125 ~~~~~v~mi~~~~~~l~~~a~e~fi~~lt~rsw~~~~~~kr~~iq~~DIa~aI~~ 179 (386)
....|..-+++.+.-.+......|++.+...+=.+++..+|......|+.+++..
T Consensus 16 l~~~GF~~~~~~al~~Ltdi~~~yl~~l~~~~~~~ae~~gRt~~~~~Dv~~al~~ 70 (77)
T PF07524_consen 16 LKHAGFDSASPSALDTLTDILQRYLQELGRTAKRYAEHAGRTEPNLQDVEQALEE 70 (77)
T ss_pred HHHcCccccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCHHHHHHHHHH
Confidence 3556788888999999999999999999999999999888888888999877744
No 80
>COG3105 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=31.15 E-value=81 Score=27.54 Aligned_cols=24 Identities=17% Similarity=0.225 Sum_probs=16.2
Q ss_pred chhHHHHHHHHHHHHHHHHHHhhh
Q psy8269 341 RLFDHAAHLGGALFGILYSKYGEQ 364 (386)
Q Consensus 341 ~~i~~~aHLgG~L~G~l~~~~~~~ 364 (386)
....|..-+.|+++|+++|++.-|
T Consensus 4 t~~~W~~a~igLvvGi~IG~li~R 27 (138)
T COG3105 4 TFMTWEYALIGLVVGIIIGALIAR 27 (138)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHH
Confidence 445566667777777777776665
No 81
>COG0370 FeoB Fe2+ transport system protein B [Inorganic ion transport and metabolism]
Probab=30.82 E-value=4e+02 Score=29.49 Aligned_cols=71 Identities=20% Similarity=0.250 Sum_probs=44.1
Q ss_pred HHHHHHHHHHHHcCCCcc-eeEechhhhHHHHHHHHHHHHHHHHHhhhcCchhHHHHHHHHHHHHHHHHHHhhh
Q psy8269 292 AIMAVLAHTCITHPDTEL-GILFVPYVRFSAEHAIQGIMLLDFLGVLFRWRLFDHAAHLGGALFGILYSKYGEQ 364 (386)
Q Consensus 292 aI~Gll~a~~~~~p~~~~-~l~fip~~~~~a~~~~~~~l~~~~~~~~~~~~~i~~~aHLgG~L~G~l~~~~~~~ 364 (386)
.+-|++++=..-.++.++ ..+.+|+....++.-+...++-.+.... ...+.+.-.+.|++..++.++.+++
T Consensus 393 nVPaImaTRtle~~reRllt~lv~pfmsCsARLpVy~l~~~affp~~--~~~v~f~lY~lgi~~all~a~vl~k 464 (653)
T COG0370 393 NVPAIMATRTLETERERLLTILVIPFMSCSARLPVYVLFAAAFFPSN--GGLVLFSLYLLGIVVALLTAKVLKK 464 (653)
T ss_pred chHHHHHHHHhcchhHHHHHHHHhhccchhhhHHHHHHHHHHHcccc--cchHHHHHHHHHHHHHHHHHHHHHH
Confidence 355566655444444443 3334677777777776555443332211 2456777889999999999998887
No 82
>PTZ00018 histone H3; Provisional
Probab=30.44 E-value=1.5e+02 Score=26.16 Aligned_cols=79 Identities=22% Similarity=0.307 Sum_probs=63.4
Q ss_pred hhhHHhhhhhhhhhhcccCchHHH-HHhhhc-CccccccccchhhHHHHHHHHHHHHHhHHhhHhhhhccccccchhhHH
Q psy8269 97 SVLEVQGALSMLDLKTQALPLARI-KKIMKL-DDGVKMISAEAPMLFSKAAELFIHELTIRAWIHTEDNKRRTLQRNDIA 174 (386)
Q Consensus 97 ~~~~~~~~~~~~~~~~~~~p~~ri-k~im~~-~~~v~mi~~~~~~l~~~a~e~fi~~lt~rsw~~~~~~kr~~iq~~DIa 174 (386)
..+|+..-++..|.--+.+|-.|+ |+|.+. .+++ -+..++....--|.|.|.-.+-.-+..|+--.||-+++..|+.
T Consensus 48 aLrEIr~yQkst~lLI~k~pF~RLVREI~~~~~~~~-rf~~~al~aLQeaaE~yLv~lfed~~lca~HakRVTl~~kD~~ 126 (136)
T PTZ00018 48 ALREIRRYQKSTELLIRKLPFQRLVREIAQDFKTDL-RFQSSAVLALQEAAEAYLVGLFEDTNLCAIHAKRVTIMPKDIQ 126 (136)
T ss_pred HHHHHHHHcccchhccccccHHHHHHHHHHHcCCcc-eeeHHHHHHHHHHHHHHHHHHhhhhHHHHHhhcceecchhhHH
Confidence 445665556788888888999885 566543 2333 6688999999999999999999999999999999999999985
Q ss_pred HH
Q psy8269 175 MA 176 (386)
Q Consensus 175 ~a 176 (386)
.+
T Consensus 127 L~ 128 (136)
T PTZ00018 127 LA 128 (136)
T ss_pred HH
Confidence 44
No 83
>cd08045 TAF4 TATA Binding Protein (TBP) Associated Factor 4 (TAF4) is one of several TAFs that bind TBP and is involved in forming Transcription Factor IID (TFIID) complex. The TATA Binding Protein (TBP) Associated Factor 4 (TAF4) is one of several TAFs that bind TBP and are involved in forming the Transcription Factor IID (TFIID) complex. TFIID is one of seven General Transcription Factors (GTF) (TFIIA, TFIIB, TFIID, TFIIE, TFIIF, and TFIID) that are involved in accurate initiation of transcription by RNA polymerase II in eukaryote. TFIID plays an important role in the recognition of promoter DNA and assembly of the pre-initiation complex. TFIID complex is composed of the TBP and at least 13 TAFs. TAFs from various species were originally named by their predicted molecular weight or their electrophoretic mobility in polyacrylamide gels. A new, unified nomenclature for the pol II TAFs has been suggested to show the relationship between TAF orthologs and paralogs. Several hypotheses are
Probab=28.58 E-value=1.9e+02 Score=27.00 Aligned_cols=66 Identities=21% Similarity=0.232 Sum_probs=47.7
Q ss_pred HHHHHhhhcCccccccccchhhHHHHHHHHHHHHHhHHhhHhhhhcc------ccccchhhHHHHHHHHHHHH
Q psy8269 118 ARIKKIMKLDDGVKMISAEAPMLFSKAAELFIHELTIRAWIHTEDNK------RRTLQRNDIAMAITKYDMFD 184 (386)
Q Consensus 118 ~rik~im~~~~~v~mi~~~~~~l~~~a~e~fi~~lt~rsw~~~~~~k------r~~iq~~DIa~aI~~i~i~~ 184 (386)
.++++|++.. +++.+++|+..+++.||+.+++.+....-.+.+-+. .+....+|+...+..+...+
T Consensus 52 ~~~~~i~~~~-g~~~~~~d~~~lis~a~e~rlr~li~k~~~~s~hR~~~~~~~~r~~~~sdvr~qL~~l~~~e 123 (212)
T cd08045 52 KKIRKIAKKH-GLKEVDEDVLDLISLALEERLRNLLEKLIEVSEHRVDSEKEDERYEITSDVRKQLRFLEQLE 123 (212)
T ss_pred HHHHHHHHHc-CCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCCCCceeecchHHHHHHHHHHHH
Confidence 4566666654 444889999999999999999998888776665543 34566788877776654433
No 84
>PF14898 DUF4491: Domain of unknown function (DUF4491)
Probab=28.43 E-value=90 Score=25.69 Aligned_cols=44 Identities=20% Similarity=0.353 Sum_probs=30.8
Q ss_pred HHhhhchhHHHHHHHHHHHHHHHHHHHHHhhcCCCcceechHHHHHHHHHHHHH
Q psy8269 249 SVEDLGKEQFVGFYLTAGVVASLLSYVHKILVRKPGLSIGASGAIMAVLAHTCI 302 (386)
Q Consensus 249 lE~~~G~~rfl~lyl~sgv~gsl~s~l~~~~~~~~~~~vGASGaI~Gll~a~~~ 302 (386)
.|.++|+ +.+-+|++.|+++-.++++... +.. .++.|++|+.+.
T Consensus 26 ~EYyfg~-~~W~~FL~~Gi~~~~~Sl~~~~--------~~~-S~llgv~g~s~l 69 (94)
T PF14898_consen 26 GEYYFGT-RIWPIFLLAGIACIIASLFVSN--------VIW-SALLGVLGFSCL 69 (94)
T ss_pred EEEecCC-CcHHHHHHHHHHHHHHHHHHcc--------hHH-HHHHHHHHHHHH
Confidence 4667888 5777999999999888888761 122 246677776553
No 85
>PF15511 CENP-T: Centromere kinetochore component CENP-T; PDB: 3B0D_T 3B0C_T 3VH5_T 3VH6_T.
Probab=27.77 E-value=72 Score=33.07 Aligned_cols=67 Identities=21% Similarity=0.229 Sum_probs=40.4
Q ss_pred hhhhhhcccCchHHHHHhhhcCc-----cccccccchhhHHHHHHHHHHHHHhHHhhHhhhhccccccchhh
Q psy8269 106 SMLDLKTQALPLARIKKIMKLDD-----GVKMISAEAPMLFSKAAELFIHELTIRAWIHTEDNKRRTLQRND 172 (386)
Q Consensus 106 ~~~~~~~~~~p~~rik~im~~~~-----~v~mi~~~~~~l~~~a~e~fi~~lt~rsw~~~~~~kr~~iq~~D 172 (386)
+.+-..+..+|.+-|||+..--- .+++|++|+...+.+|.+.|-++++.-==.|+.=-+|+||..+|
T Consensus 343 Skhgi~~P~lP~~~vK~la~~~ak~s~~sK~kiskdal~aleqasdwfFeQl~dDL~aYA~HAgRKTIdesD 414 (414)
T PF15511_consen 343 SKHGIPYPSLPPGVVKKLAQHFAKSSGGSKMKISKDALEALEQASDWFFEQLGDDLEAYAKHAGRKTIDESD 414 (414)
T ss_dssp ---------S-HHHHHHHHHHHH-------S-B-HHHHHHHHHHHHHHHHHHHHHHHHHHHHTT-SEE-HHH
T ss_pred CCCCCCCCCCCccHHHHHHHHHHHhhcccCCccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcCCCCC
Confidence 55666677799999999865432 45689999999999999999999986555556666788887654
No 86
>KOG1172|consensus
Probab=25.96 E-value=8.8e+02 Score=27.83 Aligned_cols=59 Identities=17% Similarity=0.195 Sum_probs=33.2
Q ss_pred HHHHHHHHHHHHHHHhhhchh-HHHHHHHHHHHHHHHHHHHHHhhcCCCcceechHHHHHHHHH
Q psy8269 236 FANMFVLHSFMPSSVEDLGKE-QFVGFYLTAGVVASLLSYVHKILVRKPGLSIGASGAIMAVLA 298 (386)
Q Consensus 236 l~Nm~~L~~fG~~lE~~~G~~-rfl~lyl~sgv~gsl~s~l~~~~~~~~~~~vGASGaI~Gll~ 298 (386)
+.|+.--..||..+++.-... ...-. ++|..++++...++. +++-..+|.+|-+.=+--
T Consensus 371 fa~l~P~ItFG~ll~~~Tdg~~~v~E~-L~stal~Giifslfg---gQPLlIlg~TgP~lVfe~ 430 (876)
T KOG1172|consen 371 FACLLPAITFGGLLGEATDGLIGVVET-LLSTALCGIIFSLFG---GQPLLILGVTGPLLVFEK 430 (876)
T ss_pred HHhhhhHhhHHHHhhhhccchHHHHHH-HHHHHHHHHHHHHhc---CCceEEEecCccHHHHHH
Confidence 344555667888887764222 22223 344444444444444 567778899986544333
No 87
>cd00084 HMG-box High Mobility Group (HMG)-box is found in a variety of eukaryotic chromosomal proteins and transcription factors. HMGs bind to the minor groove of DNA and have been classified by DNA binding preferences. Two phylogenically distinct groups of Class I proteins bind DNA in a sequence specific fashion and contain a single HMG box. One group (SOX-TCF) includes transcription factors, TCF-1, -3, -4; and also SRY and LEF-1, which bind four-way DNA junctions and duplex DNA targets. The second group (MATA) includes fungal mating type gene products MC, MATA1 and Ste11. Class II and III proteins (HMGB-UBF) bind DNA in a non-sequence specific fashion and contain two or more tandem HMG boxes. Class II members include non-histone chromosomal proteins, HMG1 and HMG2, which bind to bent or distorted DNA such as four-way DNA junctions, synthetic DNA cruciforms, kinked cisplatin-modified DNA, DNA bulges, cross-overs in supercoiled DNA, and can cause looping of linear DNA. Class III member
Probab=24.54 E-value=21 Score=25.88 Aligned_cols=25 Identities=16% Similarity=0.258 Sum_probs=22.2
Q ss_pred cchhhHHHHHHHHhccCCCCCeeee
Q psy8269 16 NDIPKWRKDLRKRWNALHPGDRVYV 40 (386)
Q Consensus 16 ~~~~~~~~~~~~~~~~~~~~~~~~~ 40 (386)
....++.+.+.+.|+.|++.+|.++
T Consensus 26 ~~~~~i~~~~~~~W~~l~~~~k~~y 50 (66)
T cd00084 26 LSVGEISKILGEMWKSLSEEEKKKY 50 (66)
T ss_pred CCHHHHHHHHHHHHHhCCHHHHHHH
Confidence 3567899999999999999999887
No 88
>KOG1142|consensus
Probab=24.35 E-value=1.1e+02 Score=29.87 Aligned_cols=71 Identities=15% Similarity=0.234 Sum_probs=57.3
Q ss_pred hhhhhcccCchHHHHHhhhcCccccccccchhhHHHHHHHHHHHHHhHHhhHhhhhccccccchhhHHHHH
Q psy8269 107 MLDLKTQALPLARIKKIMKLDDGVKMISAEAPMLFSKAAELFIHELTIRAWIHTEDNKRRTLQRNDIAMAI 177 (386)
Q Consensus 107 ~~~~~~~~~p~~rik~im~~~~~v~mi~~~~~~l~~~a~e~fi~~lt~rsw~~~~~~kr~~iq~~DIa~aI 177 (386)
+++..++.+=..++.+.++.-++-..+-.|+.-+...-++-||..++.++=..+.-+|..++--.||.-.+
T Consensus 147 ~~~~~~~il~k~kl~dLvqqId~~~~LD~dVedlLleiADdFV~sii~~sC~LAKHRKsdtlEvrDIqLhL 217 (258)
T KOG1142|consen 147 DEPGNNPILSKRKLDDLVQQIDGTTKLDDDVEDLLLEIADDFVSSIIHRSCKLAKHRKSDTVEVRDIQLHL 217 (258)
T ss_pred ccCCCCccccccchhHHHHhhcCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccCccchhheeeee
Confidence 34455555666777777777788889999999999999999999999999999999998888766665444
No 89
>PRK09554 feoB ferrous iron transport protein B; Reviewed
Probab=22.75 E-value=6.3e+02 Score=28.51 Aligned_cols=51 Identities=22% Similarity=0.257 Sum_probs=33.7
Q ss_pred EechhhhHHHHHHHHHHHHHHHHHhhhcCchhHHHHHHHHHHHHHHHHHHhhh
Q psy8269 312 LFVPYVRFSAEHAIQGIMLLDFLGVLFRWRLFDHAAHLGGALFGILYSKYGEQ 364 (386)
Q Consensus 312 ~fip~~~~~a~~~~~~~l~~~~~~~~~~~~~i~~~aHLgG~L~G~l~~~~~~~ 364 (386)
+..|+.+..++.-+..+++-.+.. .....+-+...+.|+++.++.++++++
T Consensus 425 l~~pfm~C~ArlpV~~l~~~aff~--~~~~~v~~~~y~~gi~~~~~~~~ll~~ 475 (772)
T PRK09554 425 MMAPFMSCGARLAIFAVFAAAFFG--QNGALAVFSLYLLGIVMAILTGLMLKY 475 (772)
T ss_pred HHhccCCccchHHHHHHHHHHHcC--cccHHHHHHHHHHHHHHHHHHHHHHhc
Confidence 346667777777765554433321 112356777888999999999988876
No 90
>PF14241 DUF4341: Domain of unknown function (DUF4341)
Probab=22.65 E-value=1.5e+02 Score=22.28 Aligned_cols=34 Identities=18% Similarity=0.188 Sum_probs=21.4
Q ss_pred HHHHHHHHHHHHHHHHhhcCCCcceechHHHHHHHHHH
Q psy8269 262 YLTAGVVASLLSYVHKILVRKPGLSIGASGAIMAVLAH 299 (386)
Q Consensus 262 yl~sgv~gsl~s~l~~~~~~~~~~~vGASGaI~Gll~a 299 (386)
|+.+.++|-.+...+. .++...|+||.+.+++..
T Consensus 6 l~GG~lIGla~~~ll~----~~Gri~GiSGil~~~~~~ 39 (62)
T PF14241_consen 6 LIGGLLIGLAASLLLL----LNGRIAGISGILSGLLSP 39 (62)
T ss_pred HHHHHHHHHHHHHHHH----HcCcceehHHHHHHHhCC
Confidence 3444444444333333 257889999999988775
No 91
>PF04892 VanZ: VanZ like family ; InterPro: IPR006976 This entry represents a conserved sequence region found in the VanZ protein and also several phosphotransbutyrylases. VanZ confers low-level resistance to the glycopeptide antibiotic teicoplanin (Te). Analysis of cytoplasmic peptidoglycan precursors, accumulated in the presence of ramoplanin, showed that VanZ-mediated Te resistance does not involve incorporation of a substituent of D-alanine into the peptidoglycan precursors [].
Probab=22.11 E-value=4.6e+02 Score=21.71 Aligned_cols=28 Identities=29% Similarity=0.416 Sum_probs=21.4
Q ss_pred chhHHHHHHHHHHHHHHHHHHhhhcccc
Q psy8269 341 RLFDHAAHLGGALFGILYSKYGEQTWAH 368 (386)
Q Consensus 341 ~~i~~~aHLgG~L~G~l~~~~~~~~w~~ 368 (386)
+-.|......|++.|+++....+|+|++
T Consensus 105 d~~Dv~~n~~G~~lG~~l~~~~~~~~~r 132 (133)
T PF04892_consen 105 DIDDVLANTLGALLGYLLYRLIRKRWQR 132 (133)
T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHhcc
Confidence 4567788889999999988887765544
No 92
>PF04632 FUSC: Fusaric acid resistance protein family; InterPro: IPR006726 This entry represents the p-hydroxybenzoic acid efflux pump subunit AaeB (pHBA efflux pump protein B) whose substrates are p-hydroxybenzoic acid (pHBA), 6-hydroxy-2-naphthoic and 2-hydroxycinnamate. It could function as a metabolic relief valve, allowing to eliminate certain compounds when they accumulate to high levels in the cell []. This family also includes fusaric acid resistance proteins [], which are likely to be membrane transporter proteins, and uncharacterised transporter YdhK.; GO: 0006810 transport, 0005886 plasma membrane
Probab=21.89 E-value=3.3e+02 Score=29.30 Aligned_cols=31 Identities=13% Similarity=0.066 Sum_probs=21.5
Q ss_pred chhHHHHHHHHHHHHHHHHHHhhhccccccc
Q psy8269 341 RLFDHAAHLGGALFGILYSKYGEQTWAHRAP 371 (386)
Q Consensus 341 ~~i~~~aHLgG~L~G~l~~~~~~~~w~~r~~ 371 (386)
+...+.-...|.+.|+++++...+......+
T Consensus 464 d~~~f~n~~la~l~G~~~a~l~~~li~p~~~ 494 (650)
T PF04632_consen 464 DFATFLNRALAILLGIVIAALVFRLIRPFSP 494 (650)
T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHCCCCh
Confidence 3455667778899999999888774443333
No 93
>COG4956 Integral membrane protein (PIN domain superfamily) [General function prediction only]
Probab=21.83 E-value=3.1e+02 Score=27.67 Aligned_cols=86 Identities=17% Similarity=0.256 Sum_probs=0.0
Q ss_pred HHHHHHHHHHHHHHHHHHHhh------cCCCcceechHHHHHHHHHHHHHHcCCCcceeEechhhhHHHHHHHHHHHHHH
Q psy8269 259 VGFYLTAGVVASLLSYVHKIL------VRKPGLSIGASGAIMAVLAHTCITHPDTELGILFVPYVRFSAEHAIQGIMLLD 332 (386)
Q Consensus 259 l~lyl~sgv~gsl~s~l~~~~------~~~~~~~vGASGaI~Gll~a~~~~~p~~~~~l~fip~~~~~a~~~~~~~l~~~ 332 (386)
+.+|+++..++..+.-.+..+ .+......|.=|.+.|++.++.+..| +..+|+.-++....+.+.+.+.
T Consensus 49 i~~~li~~~~~~~~~~~~~~le~~i~k~~~~~ilf~tiGLiiGLlia~l~~~p-----L~~~~ip~~~~ii~vi~t~il~ 123 (356)
T COG4956 49 IIFFLISFWFGKYVLNWLKRLEEQIRKLPVTTILFGTIGLIIGLLIAVLLSSP-----LFLLPIPFISTIIPVILTIILA 123 (356)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhcCHHHHHHHHHHHHHHHHHHHHHhhH-----HhhCCccHHHhHHHHHHHHHHH
Q ss_pred HHHhhhcCchhHHHHHH
Q psy8269 333 FLGVLFRWRLFDHAAHL 349 (386)
Q Consensus 333 ~~~~~~~~~~i~~~aHL 349 (386)
.++...+..+-+-.+++
T Consensus 124 y~G~~~~~k~~de~~~l 140 (356)
T COG4956 124 YFGFQLADKKRDEFLRL 140 (356)
T ss_pred HHhhHHhhhhhHHHHHh
No 94
>smart00427 H2B Histone H2B.
Probab=20.99 E-value=2.7e+02 Score=22.71 Aligned_cols=60 Identities=22% Similarity=0.242 Sum_probs=36.9
Q ss_pred HHHhhh-cCccccccccchhhHHHHHHHHHHHHHhHHhhHhhhhccccccchhhHHHHHHHH
Q psy8269 120 IKKIMK-LDDGVKMISAEAPMLFSKAAELFIHELTIRAWIHTEDNKRRTLQRNDIAMAITKY 180 (386)
Q Consensus 120 ik~im~-~~~~v~mi~~~~~~l~~~a~e~fi~~lt~rsw~~~~~~kr~~iq~~DIa~aI~~i 180 (386)
|.|++| .+|+. .|+.++-.+..--..-..+.+...+-+....+||++++..||..++-.+
T Consensus 7 i~kvLKqVhpd~-giS~kam~imnSfvnDiferIa~EAs~L~~~nkr~TltsreIqtAvrl~ 67 (89)
T smart00427 7 IYKVLKQVHPDT-GISSKAMSIMNSFVNDIFERIAAEASKLARYNKKSTLSSREIQTAVRLI 67 (89)
T ss_pred HHHHHHHhCCCc-cccHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcCCHHHHHHHHHHH
Confidence 334433 34444 3555554444433333344556667777888999999999998887553
Done!